xinyun/vllm - vllm - 丝路新云-代码仓

mirror of https://git.datalinker.icu/vllm-project/vllm.git synced 2026-05-29 05:27:04 +08:00

Author	SHA1	Message	Date
Benjamin Chislett	975676d174	[Feat] Drop-in Torch CUDA Profiler (#27841 ) Signed-off-by: Benjamin Chislett <bchislett@nvidia.com>	2025-11-08 14:07:37 -08:00
Kuntai Du	8bff831f0a	[Benchmark] Cleanup deprecated nightly benchmark and adjust the docstring for performance benchmark (#25786 ) Signed-off-by: KuntaiDu <kuntai@uchicago.edu>	2025-10-30 04:43:37 +00:00
Cyrus Leung	ecca3fee76	[Frontend] Add `vllm bench sweep` to CLI (#27639 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk> Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com> Co-authored-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>	2025-10-29 05:59:48 -07:00
Matvei Pashkovskii	130aa8cbcf	Add load pattern configuration guide to benchmarks (#26886 ) Signed-off-by: Matvei Pashkovskii <mpashkov@amd.com> Signed-off-by: Matvei Pashkovskii <matvei.pashkovskii@amd.com> Co-authored-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>	2025-10-28 10:49:15 -07:00
Cyrus Leung	8fb7b2fab9	[Doc] Fix links to GH projects (#27530 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2025-10-26 17:55:51 +08:00
Cyrus Leung	ceacedc1f9	[Benchmark] Add plot utility for parameter sweep (#27168 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2025-10-21 20:30:03 -07:00
Huy Do	becb7de40b	Update PyTorch to 2.9.0+cu129 (#24994 ) Co-authored-by: Luka Govedič <ProExpertProg@users.noreply.github.com>	2025-10-21 17:20:18 -04:00
Cyrus Leung	b3aba04e5a	[Benchmark] Convenience script for multiple parameter combinations (#27085 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2025-10-18 23:57:01 -07:00
dongbo910220	a1946c9f61	[Chore] Separate out profiling utilities from vllm.utils (#27150 ) Signed-off-by: dongbo910220 <1275604947@qq.com>	2025-10-18 19:12:01 +00:00
Harry Mellor	483ea64611	[Docs] Replace all explicit anchors with real links (#27087 ) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>	2025-10-17 02:22:06 -07:00
Harry Mellor	4ffd6e8942	[Docs] Reduce custom syntax used in docs (#27009 ) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>	2025-10-16 20:05:34 -07:00
Cyrus Leung	ef9676a1f1	[Doc] ruff format some Python examples (#26767 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2025-10-14 03:21:53 -07:00
Maximilien de Bayser	fe3edb4cf0	Add support for the /rerank endpoint in vllm bench serve (#26602 ) Signed-off-by: Max de Bayser <mbayser@br.ibm.com>	2025-10-14 04:25:43 +00:00
Harry Mellor	8fcaaf6a16	Update `Optional[x]` -> `x \| None` and `Union[x, y]` to `x \| y` (#26633 ) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>	2025-10-12 09:51:31 -07:00
Harry Mellor	e09d1753ec	Remove Python 3.9 support ahead of PyTorch 2.9 in v0.11.1 (#26416 ) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>	2025-10-08 10:40:42 -07:00
Cyrus Leung	44b9af5bb2	[Benchmark] Enable MM Embedding benchmarks (#26310 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2025-10-06 19:51:58 +00:00
Wenlong Wang	79aa244678	[Multi Modal] Configurable MM Profiling (#25631 ) Signed-off-by: wwl2755 <wangwenlong2755@gmail.com> Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com> Co-authored-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>	2025-10-03 03:59:10 -07:00
Cyrus Leung	d00d652998	[CI/Build] Replace `vllm.entrypoints.openai.api_server` entrypoint with `vllm serve` command (#25967 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2025-10-02 10:04:57 -07:00
Naman Lalit	9bedac9623	[Doc] Add documentation for vLLM continuous benchmarking and profiling (#25819 ) Signed-off-by: Naman Lalit <nl2688@nyu.edu>	2025-09-29 20:49:49 +00:00
Jialin Ouyang	c216119d64	[Core] GC Debug callback (#24829 ) Signed-off-by: Jialin Ouyang <jialino@meta.com> Signed-off-by: Jialin Ouyang <Jialin.Ouyang@gmail.com> Co-authored-by: Jialin Ouyang <jialino@meta.com>	2025-09-27 17:53:31 +00:00
Cyrus Leung	27d7638b94	[Bugfix] Merge MM embeddings by index instead of token IDs (#16229 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk> Signed-off-by: NickLucche <nlucches@redhat.com> Signed-off-by: Roger Wang <hey@rogerw.io> Co-authored-by: NickLucche <nlucches@redhat.com> Co-authored-by: Roger Wang <hey@rogerw.io>	2025-09-27 08:15:12 +00:00
vllmellm	0d9fe260dd	[docs] Benchmark Serving Incorrect Arg (#25474 ) Signed-off-by: vllmellm <vllm.ellm@embeddedllm.com>	2025-09-23 06:05:11 -07:00
Roger Wang	21da73343a	[Misc] Clean up flags in `vllm bench serve` (#25138 ) Signed-off-by: Roger Wang <hey@rogerw.io>	2025-09-18 12:43:33 +00:00
Harry Mellor	32baf1d036	[Docs] Clean up the contributing README (#25099 ) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>	2025-09-17 21:05:18 -07:00
yyzxw	5672ba90bd	[Docs] fix invalid doc link (#25017 ) Signed-off-by: zxw <1020938856@qq.com>	2025-09-16 20:53:23 -07:00
Isotr0py	5a411ef6c4	[Benchmarks] Add MMVU video dataset support and clean up deprecated datasets (#24719 ) Signed-off-by: Isotr0py <mozf@mail2.sysu.edu.cn>	2025-09-17 03:29:43 +00:00
elvischenv	3059b9cc6b	[Doc] Add --force-overwrite option to generate_cmake_presets.py (#24375 ) Signed-off-by: elvischenv <219235043+elvischenv@users.noreply.github.com>	2025-09-16 18:45:29 -07:00
Ye (Charlotte) Qi	85e0df1392	[Docs] move benchmarks README to contributing guides (#24820 )	2025-09-16 05:52:57 -07:00
Woosuk Kwon	759ef49b15	Remove V0 Encoder-Decoder Support (#24907 ) Signed-off-by: Woosuk Kwon <woosuk@thinkingmachines.ai>	2025-09-15 21:17:14 -07:00
Harry Mellor	361ae27f8a	[Docs] Fix formatting of transcription doc (#24676 ) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>	2025-09-11 11:18:06 -07:00
Wentao Ye	4984a291d5	[Doc] Fix Markdown Pre-commit Error (#24670 ) Signed-off-by: yewentao256 <zhyanwentao@126.com>	2025-09-11 09:05:59 -07:00
Nicolò Lucchesi	404c85ca72	[Docs] Add transcription support to model (#24664 ) Signed-off-by: NickLucche <nlucches@redhat.com>	2025-09-11 07:39:01 -07:00
Michael Yao	2f0b833a05	[Docs] Fix a tip indentation and typo (#24419 ) Signed-off-by: windsonsea <haifeng.yao@daocloud.io>	2025-09-08 00:19:40 -07:00
Louie Tsai	006e7a34ae	Adding int4 and int8 models for CPU benchmarking (#23709 ) Signed-off-by: Tsai, Louie <louie.tsai@intel.com>	2025-09-05 20:08:50 +08:00
Julien Debache	41c80698b3	Document multi-proc method selection for profiling (#23802 ) Signed-off-by: jdebache <jdebache@nvidia.com>	2025-09-01 06:28:26 -07:00
Thomas Parnell	1c26b42296	[Docs] [V1] [Hybrid] Add new documentation re: contributing mamba-based models (#23824 ) Signed-off-by: Thomas Parnell <tpa@zurich.ibm.com>	2025-08-29 18:47:58 +00:00
Didier Durand	d99c3a4f7b	[Doc]: fix typos in .md files (including those of #23751 ) (#23825 ) Signed-off-by: Didier Durand <durand.didier@gmail.com>	2025-08-28 04:38:19 -07:00
Cyrus Leung	27e8d1ea3e	[Refactor] Define MultiModalKwargsItems separate from MultiModalKwargs (#23053 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2025-08-18 09:52:00 +00:00
Louie Tsai	00e3f9da46	vLLM Benchmark suite improvement (#22119 ) Signed-off-by: Tsai, Louie <louie.tsai@intel.com> Signed-off-by: Louie Tsai <louie.tsai@intel.com> Co-authored-by: Li, Jiang <bigpyj64@gmail.com>	2025-08-14 07:12:17 +00:00
Harry Mellor	00976db0c3	[Docs] Fix warnings in docs build (#22588 ) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>	2025-08-10 05:49:51 -07:00
Harry Mellor	c49848396d	Refactor sliding window configuration to Transformers best practice (#21927 ) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>	2025-08-09 20:50:48 -07:00
Thomas Parnell	8a0ffd6285	Remove mamba_ssm from vLLM requirements; install inside test container using `--no-build-isolation` (#22541 ) Signed-off-by: Thomas Parnell <tpa@zurich.ibm.com>	2025-08-08 23:05:32 -07:00
Csrayz	b917da442b	Expose PyTorch profiler configuration to environment variables (#21803 ) Signed-off-by: Csrayz <33659823+Csrayz@users.noreply.github.com>	2025-07-29 19:46:31 -07:00
Harry Mellor	ba5c5e5404	[Docs] Switch to better markdown linting pre-commit hook (#21851 ) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>	2025-07-29 19:45:08 -07:00
David Xia	7b49cb1c6b	[Doc] update Contributing page's testing section (#18272 ) Signed-off-by: David Xia <david@davidxia.com>	2025-07-29 10:32:46 -07:00
Kay Yan	2470419119	[Docs] Fix the outdated URL for installing from vLLM binaries (#21523 ) Signed-off-by: Kay Yan <kay.yan@daocloud.io> Co-authored-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>	2025-07-29 04:56:27 -07:00
Ye (Charlotte) Qi	01a395e9e7	[CI/Build][Doc] Clean up more docs that point to old bench scripts (#21667 ) Signed-off-by: Ye (Charlotte) Qi <yeq@meta.com>	2025-07-27 04:02:12 +00:00
Ye (Charlotte) Qi	e7c4f9ee86	[CI/Build][Doc] Move existing benchmark scripts in CI/document/example to vllm bench CLI (#21355 ) Signed-off-by: Ye (Charlotte) Qi <yeq@meta.com>	2025-07-26 07:10:14 -07:00
Zhou Fang	807a328bb6	[Docs] Add `requirements/common.txt` to run unit tests (#21572 ) Signed-off-by: Zhou Fang <fang.github@gmail.com>	2025-07-24 20:51:15 -07:00
elvischenv	5a19a6c670	[Fix] Update mamba_ssm to 2.2.5 (#21421 ) Signed-off-by: elvischenv <219235043+elvischenv@users.noreply.github.com>	2025-07-24 03:25:41 -07:00

1 2

85 Commits