vllm/compile at 6eae34533a893d26b5f6d178c3b5885aa229e520 - vllm - 丝路新云-代码仓

xinyun/vllm

mirror of https://git.datalinker.icu/vllm-project/vllm.git synced 2026-03-24 02:55:54 +08:00

History

Richard Zou b90b0852e9

[easy] Print number of needed GPUs in skip message (#17594 )

Signed-off-by: rzou <zou3519@gmail.com>

2025-05-02 15:27:43 -07:00

..

Support FIPS enabled machines with MD5 hashing (#15299 )

2025-03-26 20:19:46 -04:00

__init__.py

[torch.compile] register allreduce operations as custom ops (#8526 )

2024-09-16 22:57:57 -07:00

backend.py

Update to torch==2.6.0 (#12721 )

2025-03-14 16:58:30 -04:00

conftest.py

[V1] V1 Enablement Oracle (#13726 )

2025-03-14 22:02:20 -07:00

test_basic_correctness.py

[easy] Print number of needed GPUs in skip message (#17594 )

2025-05-02 15:27:43 -07:00

test_full_graph.py

Make name of compressed-tensors quant method consistent across vLLM (#17255 )

2025-04-28 16:28:13 +00:00

test_functionalization.py

[torch.compile] Add torch inductor pass for fusing silu_and_mul with subsequent scaled_fp8_quant operations (#10867 )

2025-05-01 07:59:28 -07:00

test_fusion.py

[Feature] support sequence parallelism using compilation pass (#16155 )

2025-04-27 06:29:35 -07:00

test_pass_manager.py

[Feature] support sequence parallelism using compilation pass (#16155 )

2025-04-27 06:29:35 -07:00

test_sequence_parallelism.py

[Feature] support sequence parallelism using compilation pass (#16155 )

2025-04-27 06:29:35 -07:00

test_silu_mul_quant_fusion.py

[torch.compile] Add torch inductor pass for fusing silu_and_mul with subsequent scaled_fp8_quant operations (#10867 )

2025-05-01 07:59:28 -07:00

test_wrapper.py

[Misc] Add SPDX-License-Identifier headers to python source files (#12628 )

2025-02-02 11:58:18 -08:00