vllm/model_executor at 665cbcec4b963f6ab7b696f3d7e3393a7909003d - vllm - 丝路新云-代码仓

xinyun/vllm

mirror of https://git.datalinker.icu/vllm-project/vllm.git synced 2026-07-02 03:07:08 +08:00

History

Woosuk Kwon 7c600440f7

Fix model docstrings (#1764 )

2023-11-23 23:04:44 -08:00

..

[Build] Avoid building too many extensions (#1624 )

2023-11-23 16:31:19 -08:00

Fix model docstrings (#1764 )

2023-11-23 23:04:44 -08:00

TP/quantization/weight loading refactor part 2 - Refactor quantized linear logic and extend quantization support to all models (#1622 )

2023-11-15 22:50:41 -08:00

__init__.py

[Quality] Add code formatter and linter (#326 )

2023-07-03 11:31:55 -07:00

input_metadata.py

Delay GPU->CPU sync in sampling (#1337 )

2023-10-30 09:01:34 -07:00

model_loader.py

Migrate linter from pylint to ruff (#1665 )

2023-11-20 11:58:01 -08:00

utils.py

TP/quantization/weight loading refactor part 2 - Refactor quantized linear logic and extend quantization support to all models (#1622 )

2023-11-15 22:50:41 -08:00

weight_utils.py

[BugFix] Fix a bug in loading safetensors (#1732 )

2023-11-20 15:51:18 -08:00