vllm/compile at 59f935300c4818cb10db8a0efadb431a2f169506 - vllm

mirror of https://git.datalinker.icu/vllm-project/vllm.git synced 2026-05-21 21:51:23 +08:00

History

[Misc] Refactor AllReduceFusionPass. Remove parameter (#20918 )

Signed-off-by: ilmarkov <imarkov@redhat.com>
Co-authored-by: ilmarkov <imarkov@redhat.com>

2025-07-15 06:57:40 +00:00

piecewise

[Feature][ROCm] Add full graph capture support for TritonAttentionBackend (#19158 )

2025-06-17 17:03:06 -04:00

__init__.py

[torch.compile] register allreduce operations as custom ops (#8526 )

2024-09-16 22:57:57 -07:00

backend.py

[torch.compile][ROCm] Fuse quantization onto attention using a torch.compile pass (#16756 )

2025-06-12 08:31:04 -07:00

test_async_tp.py

[torch.compile][ROCm] Fuse quantization onto attention using a torch.compile pass (#16756 )

2025-06-12 08:31:04 -07:00

test_basic_correctness.py

Support embedding models in V1 (#16188 )

2025-06-18 21:36:33 -07:00

test_config.py

[BugFix] VLLM_DISABLE_COMPILE_CACHE=1 should disable all reads and writes from the cache (#20942 )

2025-07-15 01:26:18 +00:00

test_full_graph.py

[Bugfix] Upgrade depyf to 0.19 and streamline custom pass logging (#20777 )

2025-07-11 00:15:22 -07:00

test_functionalization.py

[Misc] Add SPDX-FileCopyrightText (#19100 )

2025-06-03 11:20:17 -07:00

test_fusion_all_reduce.py

[Misc] Refactor AllReduceFusionPass. Remove parameter (#20918 )

2025-07-15 06:57:40 +00:00

test_fusion_attn.py

[Perf][fp8] Use CustomOp abstraction for fp8 quant for better perf (#19830 )

2025-07-11 04:56:28 +00:00

test_fusion.py

[Perf][fp8] Use CustomOp abstraction for fp8 quant for better perf (#19830 )

2025-07-11 04:56:28 +00:00

test_pass_manager.py

[Misc] Add SPDX-FileCopyrightText (#19100 )

2025-06-03 11:20:17 -07:00

test_sequence_parallelism.py

[Feature] Support sequence parallelism for static fp8 quantization (#19181 )

2025-06-23 16:09:02 -04:00

test_silu_mul_quant_fusion.py

[Perf][fp8] Use CustomOp abstraction for fp8 quant for better perf (#19830 )

2025-07-11 04:56:28 +00:00

test_wrapper.py

[Misc] Add SPDX-FileCopyrightText (#19100 )

2025-06-03 11:20:17 -07:00