vllm/platforms at 23637dcdef9ecc39df6a0e33871ed48c5f9dfcbd - vllm - 丝路新云-代码仓

xinyun/vllm

mirror of https://git.datalinker.icu/vllm-project/vllm.git synced 2026-06-23 18:57:20 +08:00

History

elvischenv 2dec7c1a5d

[Bugfix][CUDA] fixes CUDA FP8 kv cache dtype supported (#21420 )

Signed-off-by: elvischenv <219235043+elvischenv@users.noreply.github.com>

2025-07-22 20:34:50 -07:00

..

__init__.py

[V0 deprecation] Remove V0 HPU backend (#21131 )

2025-07-17 16:37:36 -07:00

cpu.py

[CPU] Enable shared-memory based pipeline parallel for CPU backend (#21289 )

2025-07-21 09:07:08 -07:00

cuda.py

[Bugfix][CUDA] fixes CUDA FP8 kv cache dtype supported (#21420 )

2025-07-22 20:34:50 -07:00

interface.py

Fix kv_cache_dtype handling for out-of-tree HPU plugin (#21302 )

2025-07-21 23:35:14 -07:00

neuron.py

[Refactor]Abstract Platform Interface for Distributed Backend and Add xccl Support for Intel XPU (#19410 )

2025-07-07 04:32:32 +00:00

rocm.py

Fix kv_cache_dtype handling for out-of-tree HPU plugin (#21302 )

2025-07-21 23:35:14 -07:00

tpu.py

Fix kv_cache_dtype handling for out-of-tree HPU plugin (#21302 )

2025-07-21 23:35:14 -07:00

xpu.py

[BugFix] Fix VllmConfig() construction on all platforms (#20695 )

2025-07-10 07:00:20 +00:00