This website requires JavaScript.
Explore
Help
Sign In
xinyun
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
mirror of
https://git.datalinker.icu/vllm-project/vllm.git
synced
2026-05-02 04:57:54 +08:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
vllm
/
vllm
/
config
History
nopperl
2b30afa442
Use hidden_size_per_head as head_size fallback (
#24221
)
...
Signed-off-by: nopperl <54780682+nopperl@users.noreply.github.com>
2025-09-04 12:59:16 +01:00
..
__init__.py
Use hidden_size_per_head as head_size fallback (
#24221
)
2025-09-04 12:59:16 +01:00
cache.py
[V1] Enable prefill optimization for Gemma3n (
#22628
)
2025-08-28 14:54:30 -07:00
compilation.py
[V1] v1 engine + full CUDA graph support for PLaMo2 (
#23998
)
2025-09-03 08:24:02 -07:00
parallel.py
fix some typos (
#24071
)
2025-09-02 20:44:50 -07:00
scheduler.py
[V0 Deprecation] Remove args for multi-step scheduling (
#22779
)
2025-08-12 20:38:18 -07:00
utils.py
Extract
CompilationConfig
from
config.py
(
#22524
)
2025-08-08 16:34:25 -07:00