vllm/model_executor at 5a5506c6617136bceee3ce3d81277d08fd0bdb71 - vllm

mirror of https://git.datalinker.icu/vllm-project/vllm.git synced 2026-04-20 14:57:02 +08:00

History

Jhao-Ting Chen 5a5506c661 enable DeepGEMM swapAB from FlashInfer for M<32 linear gemms

Signed-off-by: Jhao-Ting Chen <jhaotingc@nvidia.com>

2025-12-24 11:19:39 -08:00

2025-12-24 11:19:39 -08:00

2025-12-24 05:38:46 -08:00

2025-12-24 09:54:01 -08:00

2025-12-17 20:21:51 -08:00

__init__.py

…

custom_op.py

2025-12-15 11:02:09 +08:00

parameter.py

…

utils.py

2025-12-09 13:54:32 -08:00