vllm/kernels at 314fa8abbf9d4f6dc89eba1d8fdf80e6cd4432ed - vllm

mirror of https://git.datalinker.icu/vllm-project/vllm.git synced 2026-07-31 14:47:54 +08:00

History

[ROCm][FEAT] Fuse DeepSeek shared experts into AITER fused_moe ops (#24097 )

Signed-off-by: chenjun <junchen2@amd.com>
Signed-off-by: kliuae <kuanfu.liu@embeddedllm.com>
Co-authored-by: valarLip <103567126+valarLip@users.noreply.github.com>
Co-authored-by: TJian <tunjian.tan@embeddedllm.com>

2025-10-16 10:41:34 +08:00

attention

Update Optional[x] -> x | None and Union[x, y] to x | y (#26633 )

2025-10-12 09:51:31 -07:00

core

Pruning kernel Core Tests (#26727 )

2025-10-13 23:08:18 +00:00

mamba

[CI Perf]Prune Tests in kernel/mamba (#26538 )

2025-10-13 18:22:31 -04:00

moe

[ROCm][FEAT] Fuse DeepSeek shared experts into AITER fused_moe ops (#24097 )

2025-10-16 10:41:34 +08:00

quantization

Update Optional[x] -> x | None and Union[x, y] to x | y (#26633 )

2025-10-12 09:51:31 -07:00

__init__.py

[CI/Build] Move test_utils.py to tests/utils.py (#4425 )

2024-05-13 23:50:09 +09:00

allclose_default.py

Convert formatting to use ruff instead of yapf + isort (#26247 )

2025-10-05 07:06:22 -07:00

quant_utils.py

Update Optional[x] -> x | None and Union[x, y] to x | y (#26633 )

2025-10-12 09:51:31 -07:00

test_apply_repetition_penalties.py

Convert formatting to use ruff instead of yapf + isort (#26247 )

2025-10-05 07:06:22 -07:00

test_fla_layernorm_guard.py

[PERF] [Qwen3-next] Speed up gated RMSNorm (#26207 )

2025-10-12 08:27:50 +00:00

test_flex_attention.py

[V0 Deprecation] Remove VLLM_USE_V1 from tests (#26341 )

2025-10-07 15:42:31 +00:00

test_fused_quant_activation.py

Convert formatting to use ruff instead of yapf + isort (#26247 )

2025-10-05 07:06:22 -07:00

test_onednn.py

Update Optional[x] -> x | None and Union[x, y] to x | y (#26633 )

2025-10-12 09:51:31 -07:00

test_shuffle_rows.py

Convert formatting to use ruff instead of yapf + isort (#26247 )

2025-10-05 07:06:22 -07:00

test_top_k_per_row.py

Added test_top_k_per_row to test-pipeline.yaml. (#26569 )

2025-10-10 10:48:33 -04:00

test_triton_flash_attention.py

Convert formatting to use ruff instead of yapf + isort (#26247 )

2025-10-05 07:06:22 -07:00

utils.py

Update Optional[x] -> x | None and Union[x, y] to x | y (#26633 )

2025-10-12 09:51:31 -07:00