vllm/engine at b09c755be89edaaca7c9e010f423545f0cd014b4 - vllm - 丝路新云-代码仓

xinyun/vllm

mirror of https://git.datalinker.icu/vllm-project/vllm.git synced 2026-06-26 01:37:16 +08:00

History

Kunshang Ji 076169f603

[Hardware][Intel GPU] Add intel GPU pipeline parallel support. (#7810 )

2024-08-27 10:07:02 -07:00

..

output_processor

[Core] Asynchronous Output Processor (#7049 )

2024-08-26 20:53:20 -07:00

__init__.py

Change the name to vLLM (#150 )

2023-06-17 03:07:40 -07:00

arg_utils.py

[Model] Add Mistral Tokenization to improve robustness and chat encoding (#7739 )

2024-08-27 12:40:02 +00:00

async_llm_engine.py

[Hardware][Intel GPU] Add intel GPU pipeline parallel support. (#7810 )

2024-08-27 10:07:02 -07:00

async_timeout.py

[Bugfix] AsyncLLMEngine hangs with asyncio.run (#5654 )

2024-06-19 13:57:12 -07:00

llm_engine.py

[Hardware][Intel GPU] Add intel GPU pipeline parallel support. (#7810 )

2024-08-27 10:07:02 -07:00

metrics_types.py

[MISC] Add prefix cache hit rate to metrics (#7606 )

2024-08-19 11:52:07 -07:00

metrics.py

[MISC] Add prefix cache hit rate to metrics (#7606 )

2024-08-19 11:52:07 -07:00

protocol.py

[misc] Add Torch profiler support (#7451 )

2024-08-21 15:39:26 -07:00