This website requires JavaScript.
Explore
Help
Sign In
xinyun
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
mirror of
https://git.datalinker.icu/vllm-project/vllm.git
synced
2026-01-02 17:54:10 +08:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
vllm
/
vllm
/
platforms
History
elvischenv
2dec7c1a5d
[Bugfix][CUDA] fixes CUDA FP8 kv cache dtype supported (
#21420
)
...
Signed-off-by: elvischenv <219235043+elvischenv@users.noreply.github.com>
2025-07-22 20:34:50 -07:00
..
__init__.py
[V0 deprecation] Remove V0 HPU backend (
#21131
)
2025-07-17 16:37:36 -07:00
cpu.py
[CPU] Enable shared-memory based pipeline parallel for CPU backend (
#21289
)
2025-07-21 09:07:08 -07:00
cuda.py
[Bugfix][CUDA] fixes CUDA FP8 kv cache dtype supported (
#21420
)
2025-07-22 20:34:50 -07:00
interface.py
Fix kv_cache_dtype handling for out-of-tree HPU plugin (
#21302
)
2025-07-21 23:35:14 -07:00
neuron.py
[Refactor]Abstract Platform Interface for Distributed Backend and Add xccl Support for Intel XPU (
#19410
)
2025-07-07 04:32:32 +00:00
rocm.py
Fix kv_cache_dtype handling for out-of-tree HPU plugin (
#21302
)
2025-07-21 23:35:14 -07:00
tpu.py
Fix kv_cache_dtype handling for out-of-tree HPU plugin (
#21302
)
2025-07-21 23:35:14 -07:00
xpu.py
[BugFix] Fix
VllmConfig()
construction on all platforms (
#20695
)
2025-07-10 07:00:20 +00:00