vllm/engine at 4ca65a97638054ed04b37c2bf3e868d4c1209e9c - vllm - 丝路新云-代码仓

xinyun/vllm

mirror of https://git.datalinker.icu/vllm-project/vllm.git synced 2026-06-24 09:47:13 +08:00

History

Isotr0py 4ca65a9763

[Core][Bugfix] Accept GGUF model without .gguf extension (#8056 )

2024-09-02 08:43:26 -04:00

..

output_processor

[BugFix][Core] Multistep Fix Crash on Request Cancellation (#8059 )

2024-08-31 19:44:03 +00:00

__init__.py

Change the name to vLLM (#150 )

2023-06-17 03:07:40 -07:00

arg_utils.py

[Core][Bugfix] Accept GGUF model without .gguf extension (#8056 )

2024-09-02 08:43:26 -04:00

async_llm_engine.py

[Core] Logprobs support in Multi-step (#7652 )

2024-08-29 19:19:08 -07:00

async_timeout.py

[Bugfix] AsyncLLMEngine hangs with asyncio.run (#5654 )

2024-06-19 13:57:12 -07:00

llm_engine.py

[Core] Increase default max_num_batched_tokens for multimodal models (#8028 )

2024-08-30 08:20:34 -07:00

metrics_types.py

[MISC] Add prefix cache hit rate to metrics (#7606 )

2024-08-19 11:52:07 -07:00

metrics.py

[MISC] Add prefix cache hit rate to metrics (#7606 )

2024-08-19 11:52:07 -07:00

protocol.py

[Core] Logprobs support in Multi-step (#7652 )

2024-08-29 19:19:08 -07:00