vllm/kernels at 2e7035dd8cc2e6c907873462b4ac0bb9f08e0abb - vllm - 丝路新云-代码仓

xinyun/vllm

mirror of https://git.datalinker.icu/vllm-project/vllm.git synced 2026-05-07 23:42:23 +08:00

History

rasmith 7618dc973d

[CI/Build] Make test_mha_attn.py run on correct platform only and check for flash_attn_varlen_func in layer.py (#29145 )

2025-12-09 20:18:17 +00:00

..

[CI/Build] Make test_mha_attn.py run on correct platform only and check for flash_attn_varlen_func in layer.py (#29145 )

2025-12-09 20:18:17 +00:00

[Performance] Fused blockwise quant RMS norm (#27883 )

2025-12-07 16:38:04 +00:00

Add SpecDec support to selective_state_update (#29488 )

2025-12-08 16:45:18 -05:00

[Kernel][MoE] optimize moe_align_block_size (#29642 )

2025-12-07 01:58:47 -08:00

[Kernel]Support W4A8 Grouped GEMM on Hopper (#29691 )

2025-12-08 19:29:06 -08:00

__init__.py

…

allclose_default.py

Convert formatting to use ruff instead of yapf + isort (#26247 )

2025-10-05 07:06:22 -07:00

quant_utils.py

[CI/Test] Fix FP8 per-tensor quant test reference scale shape (#30352 )

2025-12-09 12:52:20 -06:00

test_apply_repetition_penalties.py

Convert formatting to use ruff instead of yapf + isort (#26247 )

2025-10-05 07:06:22 -07:00

test_cache_kernels.py

[Bugfix][cache_kernels]: Fix OOB in cache_kernels.cu (#28760 )

2025-11-20 02:52:02 -08:00

test_fla_layernorm_guard.py

[PERF] [Qwen3-next] Speed up gated RMSNorm (#26207 )

2025-10-12 08:27:50 +00:00

test_flex_attention.py

[V0 Deprecation] Remove VLLM_USE_V1 from tests (#26341 )

2025-10-07 15:42:31 +00:00

test_fused_quant_activation.py

Convert formatting to use ruff instead of yapf + isort (#26247 )

2025-10-05 07:06:22 -07:00

test_onednn.py

[CPU] Refactor CPU attention backend (#27954 )

2025-11-12 09:43:06 +08:00

test_shuffle_rows.py

Convert formatting to use ruff instead of yapf + isort (#26247 )

2025-10-05 07:06:22 -07:00

test_top_k_per_row.py

[DeepSeek v3.2] Make top-k work for any logit values. (#27568 )

2025-12-08 06:55:58 -08:00

utils.py

[Feat] Support non-gated activations in NVFP4 modelopt path (#29004 )

2025-11-30 11:02:40 -05:00