vllm/attention at 5a4b4b3729e1a1594bf56d38b7c8d3f556754634 - vllm - 丝路新云-代码仓

xinyun/vllm

mirror of https://git.datalinker.icu/vllm-project/vllm.git synced 2026-04-23 12:57:09 +08:00

History

Yongye Zhu 007dd90859

[gpt-oss] Enable gpt-oss on ampere (#22714 )

Signed-off-by: Yongye Zhu <zyy1102000@gmail.com>

2025-08-12 03:21:44 -07:00

..

[Attention] Support multiple attention metadata builders per kv_cache_spec + proper local attention no hybrid kv cache fix (#21588 )

2025-08-06 18:40:52 -07:00

[Docs] Fix warnings in docs build (#22588 )

2025-08-10 05:49:51 -07:00

[TPU] kv cache update kernel doesn't need to be padded slices to multiple of num_slices_per_block (#22394 )

2025-08-09 20:49:04 -07:00

[MISC] Add init files for python package (#20908 )

2025-07-15 12:16:33 +00:00

__init__.py

[Misc] Add SPDX-FileCopyrightText (#19100 )

2025-06-03 11:20:17 -07:00

layer.py

[gpt-oss] Enable gpt-oss on ampere (#22714 )

2025-08-12 03:21:44 -07:00

selector.py

[gpt-oss] Enable gpt-oss on ampere (#22714 )

2025-08-12 03:21:44 -07:00