vllm/kernels at 7c9b2c8f8132e47fa9b04c0ae9a49872e0172f5f - vllm

mirror of https://git.datalinker.icu/vllm-project/vllm.git synced 2026-07-02 03:17:08 +08:00

History

Access partial_rotary_factor from rope_parameters (#29966 )

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>

2025-12-04 18:42:49 +00:00

attention

[Misc] Remove redundant attention var constants (#29650 )

2025-11-28 04:35:19 -08:00

core

Access partial_rotary_factor from rope_parameters (#29966 )

2025-12-04 18:42:49 +00:00

mamba

[V1] [Hybrid] Mamba1 Automatic Prefix Caching (#26377 )

2025-11-02 04:16:23 -08:00

moe

[Kernels] Remove BatchedTritonOrDeepGemmExperts and default fallback to Triton (#29929 )

2025-12-03 20:49:00 +00:00

quantization

[Kernel][Quantization] add w4a8 support for marlin kernel (#24722 )

2025-11-29 07:19:33 -08:00

__init__.py

…

allclose_default.py

Convert formatting to use ruff instead of yapf + isort (#26247 )

2025-10-05 07:06:22 -07:00

quant_utils.py

[Chore]:Extract math and argparse utilities to separate modules (#27188 )

2025-10-26 04:03:32 -07:00

test_apply_repetition_penalties.py

Convert formatting to use ruff instead of yapf + isort (#26247 )

2025-10-05 07:06:22 -07:00

test_cache_kernels.py

[Bugfix][cache_kernels]: Fix OOB in cache_kernels.cu (#28760 )

2025-11-20 02:52:02 -08:00

test_fla_layernorm_guard.py

[PERF] [Qwen3-next] Speed up gated RMSNorm (#26207 )

2025-10-12 08:27:50 +00:00

test_flex_attention.py

[V0 Deprecation] Remove VLLM_USE_V1 from tests (#26341 )

2025-10-07 15:42:31 +00:00

test_fused_quant_activation.py

Convert formatting to use ruff instead of yapf + isort (#26247 )

2025-10-05 07:06:22 -07:00

test_onednn.py

[CPU] Refactor CPU attention backend (#27954 )

2025-11-12 09:43:06 +08:00

test_shuffle_rows.py

Convert formatting to use ruff instead of yapf + isort (#26247 )

2025-10-05 07:06:22 -07:00

test_top_k_per_row.py

[Deepseek v3.2] Remove extra logics in indexer (#26465 )

2025-10-21 23:34:03 +00:00

utils.py

[Feat] Support non-gated activations in NVFP4 modelopt path (#29004 )

2025-11-30 11:02:40 -05:00