vllm/platforms at 239ef0c1ac0dfe68d8d2e28c54ecf9aa9bcd945b - vllm - 丝路新云-代码仓

xinyun/vllm

mirror of https://git.datalinker.icu/vllm-project/vllm.git synced 2026-05-27 06:14:25 +08:00

History

Michael Goin 239ef0c1ac

[CI Failure] Fix fp8 kv cache on <SM90 (#25396 )

Signed-off-by: mgoin <mgoin64@gmail.com>

2025-09-22 18:27:51 +00:00

..

__init__.py

[V0 deprecation] Deprecate V0 Neuron backend (#21159 )

2025-09-06 16:15:18 -07:00

cpu.py

[V0 Deprecation] Remove async_output_proc, preemption mode, delay factor (#25334 )

2025-09-21 08:52:32 -07:00

cuda.py

[CI Failure] Fix fp8 kv cache on <SM90 (#25396 )

2025-09-22 18:27:51 +00:00

interface.py

[V1][Attention] Split triton_attn in triton-only and rocm specific backends (#24648 )

2025-09-22 15:20:28 +00:00

rocm.py

[V1][Attention] Split triton_attn in triton-only and rocm specific backends (#24648 )

2025-09-22 15:20:28 +00:00

tpu.py

[V0 Deprecation] Remove async_output_proc, preemption mode, delay factor (#25334 )

2025-09-21 08:52:32 -07:00

xpu.py

refactor: abstract graph mode support into platform interface (#25161 )

2025-09-22 10:22:29 +00:00