vllm/spec_decode at cc63d03fbb93f2b984d38e1f5626f523c1f9f1a4 - vllm - 丝路新云-代码仓

xinyun/vllm

mirror of https://git.datalinker.icu/vllm-project/vllm.git synced 2026-07-28 03:41:20 +08:00

History

Zhuohan Li 2f8844ba08

Re-enable the 80 char line width limit (#3305 )

2024-03-10 19:49:14 -07:00

..

batch_expansion.py

Re-enable the 80 char line width limit (#3305 )

2024-03-10 19:49:14 -07:00

interfaces.py

[Speculative decoding 3/9] Worker which speculates, scores, and applies rejection sampling (#3103 )

2024-03-08 23:32:46 -08:00

metrics.py

[Speculative decoding 3/9] Worker which speculates, scores, and applies rejection sampling (#3103 )

2024-03-08 23:32:46 -08:00

multi_step_worker.py

Re-enable the 80 char line width limit (#3305 )

2024-03-10 19:49:14 -07:00

spec_decode_worker.py

Re-enable the 80 char line width limit (#3305 )

2024-03-10 19:49:14 -07:00

util.py

[Speculative decoding 3/9] Worker which speculates, scores, and applies rejection sampling (#3103 )

2024-03-08 23:32:46 -08:00