xinyun/vllm - vllm - 丝路新云-代码仓

mirror of https://git.datalinker.icu/vllm-project/vllm.git synced 2026-07-31 20:47:58 +08:00

Author	SHA1	Message	Date
Matthew Bonanni	f29aeb5a25	Add FLASHINFER_MLA to test_mla_backends and add B200 CI run (#27663 ) Signed-off-by: Matthew Bonanni <mbonanni@redhat.com>	2025-10-31 11:12:19 -07:00
Jee Jee Li	0384aa7150	[CI/Build] Add gpt-oss LoRA test (#27870 ) Signed-off-by: Jee Jee Li <pandaleefree@gmail.com>	2025-10-31 22:17:21 +08:00
Wentao Ye	2bf0bcc1fc	[CI Test] Add Scheduled Integration Test (#27765 ) Signed-off-by: yewentao256 <zhyanwentao@126.com>	2025-10-30 17:29:26 -07:00
Jakub Sochacki	697f507a8e	[CI/Build][Intel] Enable performance benchmarks for Intel Gaudi 3 (#26919 ) Signed-off-by: jakub-sochacki <jakub.sochacki@wp.pl>	2025-10-31 07:57:22 +08:00
Zhewen Li	e806178d2a	[BugFix][VL] Fix FA selection on Qwen2.5-VL (#27790 ) Signed-off-by: zhewenli <zhewenli@meta.com> Co-authored-by: Roger Wang <hey@rogerw.io>	2025-10-30 07:54:44 +00:00
Huamin Li	5be1bed790	[CI/Build]Add eval config for Qwen3-235B-A22B-Instruct-2507-FP8 (#27113 ) Signed-off-by: Huamin Li <3ericli@gmail.com>	2025-10-30 07:50:56 +00:00
Kuntai Du	8bff831f0a	[Benchmark] Cleanup deprecated nightly benchmark and adjust the docstring for performance benchmark (#25786 ) Signed-off-by: KuntaiDu <kuntai@uchicago.edu>	2025-10-30 04:43:37 +00:00
Kunshang Ji	b5bae42f91	[XPU] Update latest IPEX 2.8 release (#27735 ) Signed-off-by: Kunshang Ji <kunshang.ji@intel.com>	2025-10-30 11:17:13 +08:00
22quinn	f7a6682872	[CI/Build] Test torchrun with 8 cards (#27548 ) Signed-off-by: 22quinn <33176974+22quinn@users.noreply.github.com>	2025-10-29 10:26:06 -07:00
bnellnm	1891cf605a	[Bugfix] Fix modular kernel tests (#27707 ) Signed-off-by: Bill Nell <bnell@redhat.com>	2025-10-29 16:14:33 +08:00
Cyrus Leung	4fb8771cc0	[CI/Build] Move pre-commit only scripts to `tools/pre_commit` (#27657 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2025-10-29 08:04:33 +00:00
Zhewen Li	8b62495076	[Bugfix] Fix non-contiguous tensor error in `rocm_unquantized_gemm_impl` (#27605 ) Signed-off-by: zhewenli <zhewenli@meta.com>	2025-10-29 00:00:15 -07:00
Zhewen Li	83fd49b1fc	[CI/Build][Bugfix]Fix Quantized Models Test on AMD (#27712 ) Signed-off-by: zhewenli <zhewenli@meta.com>	2025-10-29 06:27:30 +00:00
Mohammad Miadh Angkad	a8c02fb5bf	[Bugfix][CI] Fix v1 attention backend tests and add CI coverage (#26597 ) Signed-off-by: Mohammad Miadh Angkad <MAngkad.BSDSBA2027@aim.edu> Signed-off-by: Mohammad Miadh Angkad <mangkad.bsdsba2027@aim.edu> Co-authored-by: Ye (Charlotte) Qi <yeq@meta.com>	2025-10-28 11:42:05 -04:00
Zhewen Li	0291fbf65c	[CI/Build] Fix amd model executor test (#27612 ) Signed-off-by: zhewenli <zhewenli@meta.com>	2025-10-28 08:58:11 +00:00
Cyrus Leung	55cba4a05c	[CI/Build] Update causal-conv1d installation (#27529 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2025-10-26 22:14:22 +08:00
Cyrus Leung	c7abff2990	Revert "[CI/Build] Use CPU for mm processing test on CI (#27522 )" (#27531 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2025-10-26 04:44:27 -07:00
Isotr0py	d63cd9ff10	[CI/Build] Use CPU for mm processing test on CI (#27522 ) Signed-off-by: Isotr0py <mozf@mail2.sysu.edu.cn>	2025-10-26 13:09:18 +08:00
Jiangyun Zhu	29c9cb8007	[CI] Add tests for cudagraph (#27391 ) Signed-off-by: zjy0516 <riverclouds.zhu@qq.com>	2025-10-25 02:37:33 +00:00
Zhewen Li	fc168c33f3	[CI/Build] Fix test_torch_utils in AMD CI (#27317 ) Signed-off-by: zhewenli <zhewenli@meta.com>	2025-10-24 12:26:00 -07:00
ioana ghiban	435be10db9	Fix AArch64 CPU Docker pipeline (#27331 ) Signed-off-by: Ioana Ghiban <ioana.ghiban@arm.com>	2025-10-24 05:11:01 -07:00
Alexei-V-Ivanov-AMD	295c7f0267	Mirroring the test definitions (2025-10-22) (#27362 ) Signed-off-by: Alexei V. Ivanov <alexei.ivanov@amd.com>	2025-10-24 00:02:26 +08:00
Louie Tsai	3b7bdf983b	add SLA information into comparison graph for vLLM Benchmark Suite (#25525 ) Signed-off-by: Tsai, Louie <louie.tsai@intel.com> Signed-off-by: louie-tsai <louie.tsai@intel.com> Signed-off-by: Louie Tsai <louie.tsai@intel.com> Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>	2025-10-23 08:04:59 +00:00
Alexei-V-Ivanov-AMD	49c00fe304	Mirroring changes in test-pipeline.yaml into test-amd.yaml (#27242 ) Signed-off-by: Alexei V. Ivanov <alexei.ivanov@amd.com>	2025-10-22 09:59:45 -04:00
Huy Do	ed540d6d4c	Update release pipeline for PyTorch 2.9.0 (#27303 ) Signed-off-by: Huy Do <huydhn@gmail.com>	2025-10-22 09:18:01 +00:00
Huy Do	becb7de40b	Update PyTorch to 2.9.0+cu129 (#24994 ) Co-authored-by: Luka Govedič <ProExpertProg@users.noreply.github.com>	2025-10-21 17:20:18 -04:00
Chen Wu	5f6cbf60d6	[Feature][Kernel]FusedMoE LoRA (#21229 ) Signed-off-by: wuchen <cntryroa@gmail.com> Signed-off-by: banjuede <lmklhc@163.com> Signed-off-by: Chen Wu <cntryroa@gmail.com> Signed-off-by: Danielle Robinson <dmmaddix@amazon.com> Signed-off-by: Jee Jee Li <pandaleefree@gmail.com> Signed-off-by: bk-201 <joy25810@foxmail.com> Co-authored-by: wuchen <wuchen@zetyun.com> Co-authored-by: Nathan Van Gheem <vangheem@gmail.com> Co-authored-by: banjuede <lmklhc@163.com> Co-authored-by: Danielle Robinson <dmmaddix@amazon.com> Co-authored-by: Jee Jee Li <pandaleefree@gmail.com> Co-authored-by: bk-201 <joy25810@foxmail.com>	2025-10-21 03:01:37 +00:00
Lunwen He	0eb8f2b880	create is_in_the_same_node on cpu (#26832 ) Co-authored-by: Lunwen He <lunwenh@meta.com>	2025-10-21 02:04:14 +00:00
ioana ghiban	1c691f4a71	AArch64 CPU Docker pipeline (#26931 )	2025-10-20 07:09:40 -04:00
Tova Movshovitz	83e760c57d	[V1][Metrics][Plugin] Add plugin support for custom `StatLoggerBase` implementations (#22456 ) Signed-off-by: tovam <tovam@pliops.com>	2025-10-18 15:12:46 -07:00
Nicolò Lucchesi	99722d5f0e	[CI] Remove forbidden slash (#27112 ) Signed-off-by: NickLucche <nlucches@redhat.com>	2025-10-17 09:38:00 -07:00
Nicolò Lucchesi	2ba60ec7fe	[CI] Nixl integration tests (#27010 ) Signed-off-by: NickLucche <nlucches@redhat.com>	2025-10-17 07:13:31 -07:00
Luka Govedič	bd7157a071	[torch.compile] Enable attention and allreduce fusion without custom ops enabled (#24604 ) Signed-off-by: Luka Govedič <lgovedic@redhat.com> Signed-off-by: Luka Govedič <ProExpertProg@users.noreply.github.com>	2025-10-17 08:10:23 -06:00
Li, Jiang	5550ff9c25	[CI/Build] Update compressed tensor test path to fix CPU CI (#27068 ) Signed-off-by: jiang1.li <jiang1.li@intel.com>	2025-10-16 22:34:56 -07:00
Zhewen Li	9c2c2287a0	[CI/Build] Update Llama4 eval yaml (#27070 ) Signed-off-by: zhewenli <zhewenli@meta.com>	2025-10-17 04:59:47 +00:00
Michael Goin	f8a0acbdbe	[CI] Enable Blackwell Llama4 MoE tests (#26731 ) Signed-off-by: mgoin <mgoin64@gmail.com>	2025-10-15 21:02:57 -06:00
Alexei-V-Ivanov-AMD	938c43ea7f	[ci] Adjusting AMD test composition 2025-10-14 (#26852 ) Signed-off-by: Alexei V. Ivanov <alexei.ivanov@amd.com>	2025-10-15 23:52:13 +00:00
Zhewen Li	f3c378ffa7	[CI/Build] Add Qwen2.5-VL-7B-Instruct ChartQA Accuracy Tests in CI (#21810 ) Signed-off-by: Ye (Charlotte) Qi <yeq@meta.com> Signed-off-by: zhewenli <zhewenli@meta.com> Co-authored-by: Ye (Charlotte) Qi <yeq@meta.com> Co-authored-by: Ye (Charlotte) Qi <ye.charlotte.qi@gmail.com>	2025-10-15 08:09:56 +00:00
Michael Goin	7e0ef4084a	[CI Failure] Fix torchao dep failure for Quantization Test (#26824 ) Signed-off-by: mgoin <mgoin64@gmail.com>	2025-10-14 16:41:43 -07:00
Michael Goin	04b5f9802d	[CI] Raise VLLM_MAX_SIZE_MB to 500 due to failing Build wheel - CUDA 12.9 (#26722 ) Signed-off-by: mgoin <mgoin64@gmail.com>	2025-10-14 10:52:05 -07:00
Alexei-V-Ivanov-AMD	d3cc8427c0	[ci] Adding the test-amd.yaml for test definitions for the AMD backend. (alternative PR) (#26718 ) Signed-off-by: Alexei V. Ivanov <alexei.ivanov@amd.com>	2025-10-13 23:10:23 -07:00
Yibo Cai	f89f599395	[CI][Release][Arm64]: Build arm64 release for gpu arch 8.9 (#26698 )	2025-10-13 18:42:12 +00:00
liuzhenwei	27ed39a347	[XPU] Upgrade NIXL to remove CUDA dependency (#26570 ) Signed-off-by: zhenwei-intel <zhenwei.liu@intel.com>	2025-10-11 05:15:23 +00:00
Nishidha Panpaliya	8f8474fbe3	[CI/Build] Fix ppc64le CPU build and tests (#22443 ) Signed-off-by: Nishidha Panpaliya <nishidha.panpaliya@partner.ibm.com>	2025-10-11 13:04:42 +08:00
Zhengxu Chen	eef921f45e	AOT Compilation for torch.compile (Bundled) (#24274 ) Signed-off-by: zhxchen17 <zhxchen17@fb.com>	2025-10-10 19:02:11 -04:00
Will Eaton	3b780a4bbb	Update CUDA architecture list in build pipeline for 12.9.1 wheels (#26592 ) Signed-off-by: Will Eaton <wseaton@users.noreply.github.com>	2025-10-10 11:15:27 -07:00
Roberto L. Castro	96ad65b7fe	[Transform] [Quantization] Add QuTLASS support to vLLM (#24440 ) Signed-off-by: LopezCastroRoberto <roberto.lopez.castro@udc.es> Signed-off-by: Roberto L. Castro <38211239+LopezCastroRoberto@users.noreply.github.com> Signed-off-by: Andrei Panferov <andrei@panferov.org> Co-authored-by: Andrei Panferov <andrei@panferov.org> Co-authored-by: Michael Goin <mgoin64@gmail.com>	2025-10-10 09:43:40 -07:00
Daniel Cámpora	0e67102d93	Added test_top_k_per_row to test-pipeline.yaml. (#26569 ) Signed-off-by: Daniel Campora <961215+dcampora@users.noreply.github.com>	2025-10-10 10:48:33 -04:00
Jason Li	f4ba2061cf	[BugFix][torch.compile] Fix fused_scaled_matmul_reduce_scatter signature for PyTorch 2.8 (#26038 ) Signed-off-by: jasonlizhengjian <jasonlizhengjian@gmail.com> Signed-off-by: <> Signed-off-by: Luka Govedič <ProExpertProg@users.noreply.github.com> Co-authored-by: Luka Govedič <ProExpertProg@users.noreply.github.com>	2025-10-10 07:42:13 -07:00
Johnny Yang	59012df99b	[TPU] update TPU benchmark threshold (#25713 ) Signed-off-by: Johnny Yang <johnnyyang@google.com>	2025-10-07 13:53:09 -07:00

1 2 3 4 5 ...

821 Commits