vllm/model_executor at d44e9df7d49a9bb3400b002c38c06fae2dd7d1e8 - vllm - 丝路新云-代码仓

xinyun/vllm

mirror of https://git.datalinker.icu/vllm-project/vllm.git synced 2025-12-17 11:35:49 +08:00

History

Shanshan Shen d44e9df7d4

[Model][Mamba] Add selector for mamba attention backend and make it pluggable for other device (#26487 )

Signed-off-by: shen-shanshan <467638484@qq.com>

2025-11-19 16:24:55 +00:00

..

[Model][Mamba] Add selector for mamba attention backend and make it pluggable for other device (#26487 )

2025-11-19 16:24:55 +00:00

Move online quantization to model.load_weights (#26327 )

2025-11-18 16:52:41 -08:00

[Model][Mamba] Add selector for mamba attention backend and make it pluggable for other device (#26487 )

2025-11-19 16:24:55 +00:00

[Core] Encoder separation for Encode-Prefill-Decode Disaggregation (#25233 )

2025-11-11 18:58:33 -08:00

__init__.py

…

custom_op.py

…

parameter.py

…

utils.py

…