vllm/spec_decode at 69e1d2fb6922b2d388bae41286d8867976cbd6c6 - vllm - 丝路新云-代码仓

xinyun/vllm

mirror of https://git.datalinker.icu/vllm-project/vllm.git synced 2026-06-21 02:57:12 +08:00

History

Antoni Baum 69e1d2fb69

[Core] Refactor model loading code (#4097 )

2024-04-16 11:34:39 -07:00

..

[Speculative decoding] Adding configuration object for speculative decoding (#3706 )

2024-04-03 00:40:57 +00:00

__init__.py

[Speculative decoding 3/9] Worker which speculates, scores, and applies rejection sampling (#3103 )

2024-03-08 23:32:46 -08:00

test_batch_expansion.py

[Misc] Add pytest marker to opt-out of global test cleanup (#3863 )

2024-04-04 21:54:16 -07:00

test_metrics.py

[CI] Try introducing isort. (#3495 )

2024-03-25 07:59:47 -07:00

test_multi_step_worker.py

[CI] Try introducing isort. (#3495 )

2024-03-25 07:59:47 -07:00

test_spec_decode_worker.py

[Misc] [Core] Implement RFC "Augment BaseExecutor interfaces to enable hardware-agnostic speculative decoding" (#3837 )

2024-04-09 11:44:15 -07:00

test_utils.py

[CI] Try introducing isort. (#3495 )

2024-03-25 07:59:47 -07:00

utils.py

[Core] Refactor model loading code (#4097 )

2024-04-16 11:34:39 -07:00