This website requires JavaScript.
Explore
Help
Sign In
xinyun
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
mirror of
https://git.datalinker.icu/vllm-project/vllm.git
synced
2026-04-15 15:47:05 +08:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
vllm
/
vllm
/
engine
History
Simon Mo
a134ef6f5e
Support eos_token_id from generation_config.json (
#4182
)
2024-04-19 04:13:36 +00:00
..
output_processor
[Speculative decoding 6/9] Integrate speculative decoding with LLMEngine (
#3894
)
2024-04-16 13:09:21 -07:00
__init__.py
Change the name to vLLM (
#150
)
2023-06-17 03:07:40 -07:00
arg_utils.py
[Bugfix] Get available quantization methods from quantization registry (
#4098
)
2024-04-18 00:21:55 -07:00
async_llm_engine.py
[CI/CD] add neuron docker and ci test scripts (
#3571
)
2024-04-18 15:26:01 -07:00
llm_engine.py
Support eos_token_id from generation_config.json (
#4182
)
2024-04-19 04:13:36 +00:00
metrics.py
[Bugfix] fix_log_time_in_metrics (
#4050
)
2024-04-13 07:52:36 -07:00
ray_utils.py
[Core] RayWorkerVllm --> WorkerWrapper to reduce duplication (
#4024
)
2024-04-17 08:34:33 +00:00