This website requires JavaScript.
Explore
Help
Sign In
xinyun
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
mirror of
https://git.datalinker.icu/vllm-project/vllm.git
synced
2026-03-29 13:51:27 +08:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
vllm
/
vllm
/
v1
/
spec_decode
History
Lucas Wilkinson
de71747655
[SpecDecode] Simplified alternative padded-speculation acceptance rate fix (
#29845
)
...
Signed-off-by: Lucas Wilkinson <lwilkins@redhat.com>
2025-12-22 13:06:10 -08:00
..
__init__.py
…
eagle.py
[SpecDecode] Simplified alternative padded-speculation acceptance rate fix (
#29845
)
2025-12-22 13:06:10 -08:00
medusa.py
[V1][Spec Decode] Optimize Medusa proposer to avoid GPU-CPU sync (
#29723
)
2025-12-10 00:15:01 +00:00
metadata.py
…
metrics.py
[mypy] Enable type checking for more directories (
#29674
)
2025-11-28 08:39:27 -08:00
ngram_proposer.py
[BugFix] Fix index error in ngram_proposer (
#29779
)
2025-12-02 04:48:11 +00:00
suffix_decoding.py
Revert "[Redo]
#26368
(
#28771
)" (
#29121
)
2025-11-20 21:27:45 -08:00
utils.py
[SpecDecode] Simplified alternative padded-speculation acceptance rate fix (
#29845
)
2025-12-22 13:06:10 -08:00