vllm/model_loader at 54600709b6d419fb243ce718a48ab7d40f5c3eb7 - vllm - 丝路新云-代码仓

xinyun/vllm

mirror of https://git.datalinker.icu/vllm-project/vllm.git synced 2026-01-19 03:14:29 +08:00

History

youkaichao 614aa51203

[misc][cuda] use nvml to avoid accidentally cuda initialization (#6007 )

2024-06-30 20:07:34 -07:00

..

__init__.py

[Misc] Enhance attention selector (#4751 )

2024-05-13 10:47:25 -07:00

loader.py

[misc][cuda] use nvml to avoid accidentally cuda initialization (#6007 )

2024-06-30 20:07:34 -07:00

neuron.py

[Typing] Mypy typing part 2 (#4043 )

2024-04-17 17:28:43 -07:00

openvino.py

[Hardware][Intel] OpenVINO vLLM backend (#5379 )

2024-06-28 13:50:16 +00:00

tensorizer.py

[Frontend] Add FlexibleArgumentParser to support both underscore and dash in names (#5718 )

2024-06-20 17:00:13 -06:00

utils.py

[Kernel] FP8 support for MoE kernel / Mixtral (#4244 )

2024-04-24 01:18:23 +00:00

weight_utils.py

[mypy] Enable type checking for test directory (#5017 )

2024-06-15 04:45:31 +00:00