vllm/model_executor at 8de4315229f6da4eb9d29cbceb2849033ff3418a - vllm

mirror of https://git.datalinker.icu/vllm-project/vllm.git synced 2026-03-31 11:57:02 +08:00

History

yuantao 8de4315229 Add support for openpangu_pro_moe_v2, which characterized by its different kv head size and sink kv in attention.

Signed-off-by: yuantao <2422264527@qq.com>

2025-11-15 12:00:40 +08:00

2025-11-14 22:55:42 +00:00

2025-11-12 23:43:57 +00:00

2025-11-15 12:00:40 +08:00

2025-11-11 18:58:33 -08:00

__init__.py

…

custom_op.py

…

parameter.py

2025-10-17 02:47:34 -07:00

utils.py

2025-10-18 09:48:22 -07:00