vllm/kernels at e5e9067e61600eedd4e75bd1c512ec52872916aa - vllm

mirror of https://git.datalinker.icu/vllm-project/vllm.git synced 2026-08-02 04:49:08 +08:00

History

[Bugfix] Fix test fused quant layernorm tests (#27865 )

Signed-off-by: ElizaWszola <ewszola@redhat.com>
Signed-off-by: yewentao256 <zhyanwentao@126.com>
Co-authored-by: yewentao256 <zhyanwentao@126.com>
Co-authored-by: Wentao Ye <44945378+yewentao256@users.noreply.github.com>

2025-11-08 14:31:33 -08:00

attention

Update Flashinfer from v0.4.1 to v0.5.2 (#27952 )

2025-11-07 16:24:42 -08:00

core

[Bugfix] Fix test fused quant layernorm tests (#27865 )

2025-11-08 14:31:33 -08:00

mamba

[V1] [Hybrid] Mamba1 Automatic Prefix Caching (#26377 )

2025-11-02 04:16:23 -08:00

moe

Bugfix: Cutlass FP8 FusedMoE bad scaling factors (#27255 )

2025-11-05 06:06:06 -05:00

quantization

[Bugfix] Use latency MOE backend as default for Flashinfer and other misc fixes (#27439 )

2025-11-07 04:18:39 -08:00

__init__.py

[CI/Build] Move test_utils.py to tests/utils.py (#4425 )

2024-05-13 23:50:09 +09:00

allclose_default.py

Convert formatting to use ruff instead of yapf + isort (#26247 )

2025-10-05 07:06:22 -07:00

quant_utils.py

[Chore]:Extract math and argparse utilities to separate modules (#27188 )

2025-10-26 04:03:32 -07:00

test_apply_repetition_penalties.py

Convert formatting to use ruff instead of yapf + isort (#26247 )

2025-10-05 07:06:22 -07:00

test_fla_layernorm_guard.py

[PERF] [Qwen3-next] Speed up gated RMSNorm (#26207 )

2025-10-12 08:27:50 +00:00

test_flex_attention.py

[V0 Deprecation] Remove VLLM_USE_V1 from tests (#26341 )

2025-10-07 15:42:31 +00:00

test_fused_quant_activation.py

Convert formatting to use ruff instead of yapf + isort (#26247 )

2025-10-05 07:06:22 -07:00

test_onednn.py

Update Optional[x] -> x | None and Union[x, y] to x | y (#26633 )

2025-10-12 09:51:31 -07:00

test_shuffle_rows.py

Convert formatting to use ruff instead of yapf + isort (#26247 )

2025-10-05 07:06:22 -07:00

test_top_k_per_row.py

[Deepseek v3.2] Remove extra logics in indexer (#26465 )

2025-10-21 23:34:03 +00:00

test_triton_flash_attention.py

Convert formatting to use ruff instead of yapf + isort (#26247 )

2025-10-05 07:06:22 -07:00

utils.py

[Chore] Clean up pytorch helper functions in vllm.utils (#26908 )

2025-10-18 09:48:22 -07:00