vllm/determinism at 6366c098d7c76120b6a55a6829a2649c727a2862 - vllm

mirror of https://git.datalinker.icu/vllm-project/vllm.git synced 2026-07-23 09:57:15 +08:00

History

[Perf] Optimize batch invariant BMM, 18.1% Throughput improvement, 10.7% TTFT improvement (#29345 )

Signed-off-by: yewentao256 <zhyanwentao@126.com>
Signed-off-by: Wentao Ye <44945378+yewentao256@users.noreply.github.com>
Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>

2025-11-26 09:38:52 -07:00

conftest.py

[Bug] Fix torch dynamo warning Dynamo detected a call to a functools.lru_cache (#29038 )

2025-11-20 16:52:23 +08:00

test_batch_invariance.py

[Perf] Optimize batch invariant BMM, 18.1% Throughput improvement, 10.7% TTFT improvement (#29345 )

2025-11-26 09:38:52 -07:00

test_online_batch_invariance.py

[Bug] Fix torch dynamo warning Dynamo detected a call to a functools.lru_cache (#29038 )

2025-11-20 16:52:23 +08:00

test_rms_norm_batch_invariant.py

…

utils.py

[Perf] Optimize batch invariant BMM, 18.1% Throughput improvement, 10.7% TTFT improvement (#29345 )

2025-11-26 09:38:52 -07:00