vllm/worker at 379da6dcb5f5d062d0452b2fc23291e5113dcf04 - vllm - 丝路新云-代码仓

xinyun/vllm

mirror of https://git.datalinker.icu/vllm-project/vllm.git synced 2026-07-26 18:27:13 +08:00

History

Woosuk Kwon 0ee535b294

[Misc] Set block size at initialization & Fix test_model_runner (#4705 )

2024-05-09 09:04:59 -07:00

..

__init__.py

Change the name to vLLM (#150 )

2023-06-17 03:07:40 -07:00

cache_engine.py

[Core][Optimization] change python dict to pytorch tensor for blocks to swap (#4659 )

2024-05-08 12:07:05 -07:00

cpu_model_runner.py

[Misc] Set block size at initialization & Fix test_model_runner (#4705 )

2024-05-09 09:04:59 -07:00

cpu_worker.py

[Misc] Set block size at initialization & Fix test_model_runner (#4705 )

2024-05-09 09:04:59 -07:00

model_runner.py

[Misc] Set block size at initialization & Fix test_model_runner (#4705 )

2024-05-09 09:04:59 -07:00

neuron_model_runner.py

[Core][Model runner refactoring 1/N] Refactor attn metadata term (#4518 )

2024-05-03 10:20:12 -07:00

neuron_worker.py

[Core] RayWorkerVllm --> WorkerWrapper to reduce duplication (#4024 )

2024-04-17 08:34:33 +00:00

worker_base.py

[Misc][Refactor] Introduce ExecuteModelData (#4540 )

2024-05-03 17:47:07 -07:00

worker.py

[Misc] Set block size at initialization & Fix test_model_runner (#4705 )

2024-05-09 09:04:59 -07:00