This website requires JavaScript.
Explore
Help
Sign In
xinyun
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
mirror of
https://git.datalinker.icu/vllm-project/vllm.git
synced
2025-12-10 05:45:00 +08:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
vllm
/
tests
/
spec_decode
History
Antoni Baum
69e1d2fb69
[Core] Refactor model loading code (
#4097
)
2024-04-16 11:34:39 -07:00
..
e2e
[Speculative decoding] Adding configuration object for speculative decoding (
#3706
)
2024-04-03 00:40:57 +00:00
__init__.py
[Speculative decoding 3/9] Worker which speculates, scores, and applies rejection sampling (
#3103
)
2024-03-08 23:32:46 -08:00
test_batch_expansion.py
[Misc] Add pytest marker to opt-out of global test cleanup (
#3863
)
2024-04-04 21:54:16 -07:00
test_metrics.py
[CI] Try introducing isort. (
#3495
)
2024-03-25 07:59:47 -07:00
test_multi_step_worker.py
[CI] Try introducing isort. (
#3495
)
2024-03-25 07:59:47 -07:00
test_spec_decode_worker.py
[Misc] [Core] Implement RFC "Augment BaseExecutor interfaces to enable hardware-agnostic speculative decoding" (
#3837
)
2024-04-09 11:44:15 -07:00
test_utils.py
[CI] Try introducing isort. (
#3495
)
2024-03-25 07:59:47 -07:00
utils.py
[Core] Refactor model loading code (
#4097
)
2024-04-16 11:34:39 -07:00