This website requires JavaScript.
Explore
Help
Sign In
xinyun
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
mirror of
https://git.datalinker.icu/vllm-project/vllm.git
synced
2026-01-19 10:04:38 +08:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
vllm
/
vllm
/
platforms
History
Thien Tran
a0933c3bd6
[Bugfix] Enable FP8 KV cache for FlashInfer and Triton backend on non-sm100 GPUs (
#24577
)
...
Signed-off-by: Thien Tran <gau.nernst@yahoo.com.sg>
2025-09-10 12:33:41 -07:00
..
__init__.py
[V0 deprecation] Deprecate V0 Neuron backend (
#21159
)
2025-09-06 16:15:18 -07:00
cpu.py
[Hardware][Apple-CPU] Enable native bfloat16 on Apple Silicon (M2 and later) (
#24129
)
2025-09-10 03:50:21 +00:00
cuda.py
[Bugfix] Enable FP8 KV cache for FlashInfer and Triton backend on non-sm100 GPUs (
#24577
)
2025-09-10 12:33:41 -07:00
interface.py
Feature/vit attention unification# 23880 (
#23978
)
2025-09-10 06:10:14 -07:00
rocm.py
[rocm] enable torchao quantization for rocm (
#24400
)
2025-09-10 06:16:21 -07:00
tpu.py
[XPU][P/D] Add XPU support in NixlConnector (
#22436
)
2025-09-04 21:03:12 -07:00
xpu.py
[XPU][P/D] Add XPU support in NixlConnector (
#22436
)
2025-09-04 21:03:12 -07:00