vllm/worker at 95e7d4a97cd64f8c6dc226ec0bbceebef6458701 - vllm - 丝路新云-代码仓

xinyun/vllm

mirror of https://git.datalinker.icu/vllm-project/vllm.git synced 2026-06-21 17:27:26 +08:00

History

bigPYJ1151 8afca50889

[Hardware][Intel] Isolate CPUModelRunner and ModelRunner for better maintenance (#3824 )

2024-04-11 11:56:49 -07:00

..

__init__.py

Change the name to vLLM (#150 )

2023-06-17 03:07:40 -07:00

cache_engine.py

[Misc] [Core] Implement RFC "Augment BaseExecutor interfaces to enable hardware-agnostic speculative decoding" (#3837 )

2024-04-09 11:44:15 -07:00

cpu_model_runner.py

[Hardware][Intel] Isolate CPUModelRunner and ModelRunner for better maintenance (#3824 )

2024-04-11 11:56:49 -07:00

cpu_worker.py

[Hardware][Intel] Isolate CPUModelRunner and ModelRunner for better maintenance (#3824 )

2024-04-11 11:56:49 -07:00

model_runner.py

[Core][5/N] Fully working chunked prefill e2e (#3884 )

2024-04-10 17:56:48 -07:00

neuron_model_runner.py

[Misc] Minor fix in KVCache type (#3652 )

2024-03-26 23:14:06 -07:00

neuron_worker.py

[Misc] [Core] Implement RFC "Augment BaseExecutor interfaces to enable hardware-agnostic speculative decoding" (#3837 )

2024-04-09 11:44:15 -07:00

worker_base.py

[Misc] [Core] Implement RFC "Augment BaseExecutor interfaces to enable hardware-agnostic speculative decoding" (#3837 )

2024-04-09 11:44:15 -07:00

worker.py

[Core][Refactor] move parallel_utils into vllm/distributed (#3950 )

2024-04-10 15:33:30 -07:00