vllm/model_loader at db35186391a2abfc6c91d703527dac20d2488107 - vllm - 丝路新云-代码仓

xinyun/vllm

mirror of https://git.datalinker.icu/vllm-project/vllm.git synced 2026-06-25 11:37:14 +08:00

History

Woosuk Kwon 23993a7997

[Bugfix][TPU] Do not use torch.Generator for TPUs (#6981 )

2024-07-31 18:50:28 -07:00

..

__init__.py

[vlm] Remove vision language config. (#6089 )

2024-07-03 22:14:16 +00:00

loader.py

[Bugfix] Support cpu offloading with fp8 quantization (#6960 )

2024-07-31 12:47:46 -07:00

neuron.py

[Typing] Mypy typing part 2 (#4043 )

2024-04-17 17:28:43 -07:00

openvino.py

[Hardware][Intel] OpenVINO vLLM backend (#5379 )

2024-06-28 13:50:16 +00:00

tensorizer.py

[Frontend] Add FlexibleArgumentParser to support both underscore and dash in names (#5718 )

2024-06-20 17:00:13 -06:00

utils.py

[Kernel] FP8 support for MoE kernel / Mixtral (#4244 )

2024-04-24 01:18:23 +00:00

weight_utils.py

[Bugfix][TPU] Do not use torch.Generator for TPUs (#6981 )

2024-07-31 18:50:28 -07:00