vllm/attention at d23539549a6db54ab152ce4e566c31f6891ddab5 - vllm - 丝路新云-代码仓

xinyun/vllm

mirror of https://git.datalinker.icu/vllm-project/vllm.git synced 2026-04-17 22:07:05 +08:00

History

Lukas Geiger 76e4dcf225

[Misc] Remove unused attention prefix prefill ops functions (#26971 )

Signed-off-by: Lukas Geiger <lukas.geiger94@gmail.com>

2025-11-11 18:26:04 +00:00

..

[Attention] Refactor CUDA attention backend selection logic (#24794 )

2025-11-11 07:40:44 -05:00

[V0 deprecation] Remove VLLM_USE_V1 usage in most modules (#27955 )

2025-11-04 20:51:16 -08:00

[Misc] Remove unused attention prefix prefill ops functions (#26971 )

2025-11-11 18:26:04 +00:00

[XPU] Add gpt-oss model support for Intel GPU (#27786 )

2025-11-05 02:17:23 +00:00

__init__.py

…

layer.py

[Attention] Refactor CUDA attention backend selection logic (#24794 )

2025-11-11 07:40:44 -05:00

selector.py

[Attention] Refactor CUDA attention backend selection logic (#24794 )

2025-11-11 07:40:44 -05:00