vllm/engine at 221d93ecbf51102df69deaf153d35df6d93370f6 - vllm - 丝路新云-代码仓

xinyun/vllm

mirror of https://git.datalinker.icu/vllm-project/vllm.git synced 2026-06-27 22:47:20 +08:00

History

Simon Mo a134ef6f5e

Support eos_token_id from generation_config.json (#4182 )

2024-04-19 04:13:36 +00:00

..

output_processor

[Speculative decoding 6/9] Integrate speculative decoding with LLMEngine (#3894 )

2024-04-16 13:09:21 -07:00

__init__.py

Change the name to vLLM (#150 )

2023-06-17 03:07:40 -07:00

arg_utils.py

[Bugfix] Get available quantization methods from quantization registry (#4098 )

2024-04-18 00:21:55 -07:00

async_llm_engine.py

[CI/CD] add neuron docker and ci test scripts (#3571 )

2024-04-18 15:26:01 -07:00

llm_engine.py

Support eos_token_id from generation_config.json (#4182 )

2024-04-19 04:13:36 +00:00

metrics.py

[Bugfix] fix_log_time_in_metrics (#4050 )

2024-04-13 07:52:36 -07:00

ray_utils.py

[Core] RayWorkerVllm --> WorkerWrapper to reduce duplication (#4024 )

2024-04-17 08:34:33 +00:00