vllm/engine at b5b647b084de3a5a29d35ca527c9901f8e6a4e7e - vllm - 丝路新云-代码仓

xinyun/vllm

mirror of https://git.datalinker.icu/vllm-project/vllm.git synced 2026-06-28 20:27:24 +08:00

History

tomeras91 7c32b6861e

[Frontend] correctly record prefill and decode time metrics (#10853 )

Signed-off-by: Tomer Asida <tomera@ai21.com>

2024-12-03 19:13:31 +00:00

..

multiprocessing

[Core][Performance] Add XGrammar support for guided decoding and set it as default (#10785 )

2024-12-03 15:17:00 +08:00

output_processor

[V1] AsyncLLM Implementation (#9826 )

2024-11-11 23:05:38 +00:00

__init__.py

Change the name to vLLM (#150 )

2023-06-17 03:07:40 -07:00

arg_utils.py

[Core][Performance] Add XGrammar support for guided decoding and set it as default (#10785 )

2024-12-03 15:17:00 +08:00

async_llm_engine.py

[Core][Performance] Add XGrammar support for guided decoding and set it as default (#10785 )

2024-12-03 15:17:00 +08:00

async_timeout.py

[Bugfix] AsyncLLMEngine hangs with asyncio.run (#5654 )

2024-06-19 13:57:12 -07:00

llm_engine.py

[Core][Performance] Add XGrammar support for guided decoding and set it as default (#10785 )

2024-12-03 15:17:00 +08:00

metrics_types.py

[Metrics] add more metrics (#4464 )

2024-11-12 00:17:38 +08:00

metrics.py

[Frontend] correctly record prefill and decode time metrics (#10853 )

2024-12-03 19:13:31 +00:00

protocol.py

[Misc] Rename embedding classes to pooling (#10801 )

2024-12-01 14:36:51 +08:00