This website requires JavaScript.
Explore
Help
Sign In
xinyun
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
mirror of
https://git.datalinker.icu/vllm-project/vllm.git
synced
2026-01-19 16:24:30 +08:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
vllm
/
vllm
/
platforms
History
Michael Goin
239ef0c1ac
[CI Failure] Fix fp8 kv cache on <SM90 (
#25396
)
...
Signed-off-by: mgoin <mgoin64@gmail.com>
2025-09-22 18:27:51 +00:00
..
__init__.py
[V0 deprecation] Deprecate V0 Neuron backend (
#21159
)
2025-09-06 16:15:18 -07:00
cpu.py
[V0 Deprecation] Remove async_output_proc, preemption mode, delay factor (
#25334
)
2025-09-21 08:52:32 -07:00
cuda.py
[CI Failure] Fix fp8 kv cache on <SM90 (
#25396
)
2025-09-22 18:27:51 +00:00
interface.py
[V1][Attention] Split triton_attn in triton-only and rocm specific backends (
#24648
)
2025-09-22 15:20:28 +00:00
rocm.py
[V1][Attention] Split triton_attn in triton-only and rocm specific backends (
#24648
)
2025-09-22 15:20:28 +00:00
tpu.py
[V0 Deprecation] Remove async_output_proc, preemption mode, delay factor (
#25334
)
2025-09-21 08:52:32 -07:00
xpu.py
refactor: abstract graph mode support into platform interface (
#25161
)
2025-09-22 10:22:29 +00:00