vllm/model_executor at e1464c3a0861384974dd6cfa35f2d6ff729ab29c - vllm - 丝路新云-代码仓

xinyun/vllm

mirror of https://git.datalinker.icu/vllm-project/vllm.git synced 2026-04-27 22:27:13 +08:00

History

Isotr0py e1464c3a08

[Quantization] Enable compressed-tensors AWQ for Turing GPU (#29732 )

Signed-off-by: Isotr0py <mozf@mail2.sysu.edu.cn>

2025-11-30 06:04:28 +00:00

..

[Quantization] Enable compressed-tensors AWQ for Turing GPU (#29732 )

2025-11-30 06:04:28 +00:00

[Chore]: Reorganize model repo operating functions in transformers_utils (#29680 )

2025-11-28 08:46:51 -08:00

[Chore] Enable passing tokenizer=None into MM processor (#29724 )

2025-11-29 06:25:10 -08:00

[Core] Encoder separation for Encode-Prefill-Decode Disaggregation (#25233 )

2025-11-11 18:58:33 -08:00

__init__.py

…

custom_op.py

…

parameter.py

[Docs] Replace rst style double-backtick with md single-backtick (#27091 )

2025-10-17 02:47:34 -07:00

utils.py

[CI] Fix mypy for vllm/v1/worker (#29037 )

2025-11-21 11:36:07 +08:00