Logo
Explore Help
Sign In
xinyun/vllm
1
0
Fork 0
You've already forked vllm
mirror of https://git.datalinker.icu/vllm-project/vllm.git synced 2026-01-04 08:46:31 +08:00
Code Issues Packages Projects Releases Wiki Activity
vllm/vllm/platforms
History
Chendi.Xue 460d02a417
[NIXL] Fix after virtual block_size for host_buffer with heter kv_layout (#29122)
Signed-off-by: Chendi Xue <chendi.xue@intel.com>
2025-11-21 08:55:27 -08:00
..
__init__.py
[TPU] Rename path to tpu platform (#28452)
2025-11-11 19:16:47 +00:00
cpu.py
[Misc] Make SchedulerConfig.max_model_len init-only (#28733)
2025-11-15 01:59:31 -08:00
cuda.py
[Attention] FlashAttention ViT support, make default backend (#28763)
2025-11-18 20:06:21 -08:00
interface.py
[CI Failure] Fix backend selection for encoder-only models (#28534)
2025-11-13 10:11:27 -05:00
rocm.py
[Attention] Add ROCM_AITER_MLA_SPARSE to attention backend registry (#29103)
2025-11-20 20:24:43 -08:00
tpu.py
[Misc] Make SchedulerConfig.max_model_len init-only (#28733)
2025-11-15 01:59:31 -08:00
xpu.py
[NIXL] Fix after virtual block_size for host_buffer with heter kv_layout (#29122)
2025-11-21 08:55:27 -08:00
Powered by Gitea Version: 1.23.1 Page: 1120ms Template: 3ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API