xinyun/vllm - vllm - 丝路新云-代码仓

mirror of https://git.datalinker.icu/vllm-project/vllm.git synced 2026-06-28 10:17:12 +08:00

Author	SHA1	Message	Date
samzong	ce75e15373	refactor(benchmarks): add type annotations to wait_for_endpoint parameters (#25218 ) Signed-off-by: samzong <samzong.lu@gmail.com>	2025-09-19 16:36:52 +00:00
Roger Wang	21da73343a	[Misc] Clean up flags in `vllm bench serve` (#25138 ) Signed-off-by: Roger Wang <hey@rogerw.io>	2025-09-18 12:43:33 +00:00
Punitvara	05b044e698	[Doc] Fix cross-reference warnings (#25058 ) Signed-off-by: Punit Vara <punitvara@gmail.com> Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com> Co-authored-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>	2025-09-18 02:05:16 -07:00
Simon Mo	a904ea78ea	[benchmark] add peak throughput metrics and plot (#23867 ) Signed-off-by: simon-mo <simon.mo@hey.com>	2025-09-17 22:30:02 -07:00
samzong	4a2d33e371	[Docs] vllm/benchmarks/datasets.py fix docstring param format. (#24970 ) Signed-off-by: samzong <samzong.lu@gmail.com>	2025-09-17 08:11:51 -07:00
samzong	47f670b03b	[Docs] improve code formatting and comments for eliminate griffe build warning. (#25010 ) Signed-off-by: samzong <samzong.lu@gmail.com>	2025-09-17 07:31:20 -07:00
Zhuohan Li	6c47f6bfa4	[Core] Remove tokenizer group in vLLM (#24078 ) Signed-off-by: Zhuohan Li <zhuohan123@gmail.com>	2025-09-17 08:42:59 +00:00
Isotr0py	5a411ef6c4	[Benchmarks] Add MMVU video dataset support and clean up deprecated datasets (#24719 ) Signed-off-by: Isotr0py <mozf@mail2.sysu.edu.cn>	2025-09-17 03:29:43 +00:00
Ye (Charlotte) Qi	ff68035932	[Benchmarks] Throw usage error when using dataset-name random and dataset-path together (#24819 ) Signed-off-by: Ye (Charlotte) Qi <yeq@meta.com>	2025-09-14 17:50:01 +00:00
Clayton Coleman	bc636f21a6	[Benchmark] Allow arbitrary headers to be passed to benchmarked endpoints (#23937 ) Signed-off-by: Clayton Coleman <smarterclayton@gmail.com>	2025-09-12 13:57:53 -07:00
Didier Durand	bcb06d7baf	[Doc]: fix typos in various files (#24726 ) Signed-off-by: Didier Durand <durand.didier@gmail.com>	2025-09-12 06:43:12 -07:00
Tomas Ruiz	ee0bc5e1b4	Enable --profile in 'vllm bench throughput' (#24575 ) Signed-off-by: Tomas Ruiz <tomas.ruiz.te@gmail.com>	2025-09-10 23:06:19 -07:00
Ekagra Ranjan	fb1a8f932a	[Benchmark] Add option to skip oversampling in benchmark (#24457 ) Signed-off-by: Ekagra Ranjan <3116519+ekagra-ranjan@users.noreply.github.com>	2025-09-09 22:00:17 +00:00
Ming Yang	1823a00d67	[Misc] Support bench serve long context (#24373 ) Signed-off-by: Ming Yang <minos.future@gmail.com>	2025-09-08 22:53:10 -07:00
Ekagra Ranjan	41183c1fe0	[Spec Decode] Fix offline spec_decode.py (#24257 ) Signed-off-by: Ekagra Ranjan <3116519+ekagra-ranjan@users.noreply.github.com> Co-authored-by: Roger Wang <hey@rogerw.io>	2025-09-08 20:44:13 +00:00
Ekagra Ranjan	cd08636926	[Spec Decode][Benchmark] Add Blitzedit dataset (#23605 ) Signed-off-by: Ekagra Ranjan <3116519+ekagra-ranjan@users.noreply.github.com> Co-authored-by: Roger Wang <hey@rogerw.io>	2025-09-08 10:32:52 -07:00
Ekagra Ranjan	3feeeb9fea	[Spec Decode][Benchmark] Add Spec Bench Dataset for benchmarking (#23563 ) Signed-off-by: Ekagra Ranjan <3116519+ekagra-ranjan@users.noreply.github.com>	2025-09-08 10:32:42 -07:00
co63oc	1bd007f234	fix some typos (#24071 ) Signed-off-by: co63oc <co63oc@users.noreply.github.com>	2025-09-02 20:44:50 -07:00
Jiangyun Zhu	c83c4ff815	[Benchmark] Add support for local hf dataset path in benchmark (#23999 ) Signed-off-by: zjy0516 <riverclouds.zhu@qq.com>	2025-09-02 17:49:16 +00:00
Huy Do	6ace2f72b0	Fix writing benchmark results with tuple keys (#23633 ) Signed-off-by: Huy Do <huydhn@gmail.com>	2025-08-26 19:16:09 +08:00
Jiangyun Zhu	3ecbb14b81	[Benchmarks] add benchmark for embedding models (#23000 ) Signed-off-by: zjy0516 <riverclouds.zhu@qq.com>	2025-08-25 23:57:08 -07:00
Breno Baldas Skuk	0cb7b065c3	Feature/benchmark/random mm data/images (#23119 ) Signed-off-by: breno.skuk <breno.skuk@hcompany.ai>	2025-08-25 01:28:35 -07:00
Jared O'Connell	31282401b6	[BugFix] Fix Python 3.9 Support (#23306 ) Signed-off-by: Jared O'Connell <46976761+jaredoconnell@users.noreply.github.com> Signed-off-by: Cyrus Leung <cyrus.tl.leung@gmail.com> Co-authored-by: Cyrus Leung <cyrus.tl.leung@gmail.com>	2025-08-20 23:23:56 -07:00
Cyrus Leung	0c31e28e95	[Bugfix] Fix extra whitespace in strings caused by newline (#23272 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2025-08-20 22:03:00 -07:00
Zhewen Li	f729023272	[CI/Build] Also check DP in benchmarks throughput script (#23038 ) Co-authored-by: Simon Mo <simon.mo@hey.com>	2025-08-20 04:09:27 +00:00
Chenheli Hua	1630cc8d0f	[Benchmarks] Add video inputs to ShareGPTDataset. (#23199 ) Signed-off-by: Chenheli Hua <huachenheli@outlook.com>	2025-08-19 23:42:31 +00:00
Ruixiang Tan	03d4235fd2	[Misc] Fix the benchmark's README and improve the error messages for the benchmark's argument checks (#22654 ) Signed-off-by: tanruixiang <tanruixiang0104@gmail.com>	2025-08-19 10:18:51 -07:00
hustxiayang	31436e8b4f	[Misc] Add request_id into benchmark_serve.py (#23065 ) Signed-off-by: yangxia <yangxiast@gmail.com>	2025-08-19 08:32:18 +00:00
Seiji Eicher	de9cb61763	Add docs for PrefixRepetitionDataset + enable usage with `vllm bench throughput` (#23012 ) Signed-off-by: Seiji Eicher <seiji@anyscale.com> Co-authored-by: Roger Wang <hey@rogerw.me>	2025-08-16 10:21:20 +00:00
Seiji Eicher	00d6cba0cf	Add PrefixRepetitionRandomDataset to `vllm bench serve` datasets (#20638 ) Signed-off-by: Seiji Eicher <seiji@anyscale.com>	2025-08-15 14:09:23 -07:00
Chenheli Hua	993d3d122b	[Benchmarks] Include image data when ShareGPT4V dataset is used. (#22955 ) Signed-off-by: Chenheli Hua <huachenheli@outlook.com>	2025-08-15 18:23:06 +00:00
Harry Mellor	bc1d02ac85	[Docs] Add comprehensive CLI reference for all large `vllm` subcommands (#22601 ) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>	2025-08-11 00:13:33 -07:00
Breno Baldas Skuk	65a7917be4	Fix(benchmarks): allow multiple mm contents in OpenAI Chat Completion Benchmarks (#22534 ) Signed-off-by: breno.skuk <breno.skuk@hcompany.ai>	2025-08-10 09:03:15 -07:00
lkchen	808a7b69df	[bench] Fix benchmark/serve.py to ignore unavailable results (#22382 ) Signed-off-by: Linkun <github@lkchen.net>	2025-08-07 23:15:50 -07:00
lkchen	4d4297e8fe	[Bench] Split serve.py:main into async/async versions (#22405 ) Signed-off-by: Linkun <github@lkchen.net>	2025-08-06 23:05:07 -07:00
Lionel Villard	ad6c655dde	preload heavy modules when mp method is forkserver (#22214 ) Signed-off-by: Lionel Villard <villard@us.ibm.com>	2025-08-06 20:33:24 -07:00
Seiji Eicher	6f5478298d	Use `aiohttp` connection pool for benchmarking (#21981 ) Signed-off-by: Seiji Eicher <seiji@anyscale.com>	2025-08-03 19:23:32 -07:00
Ye (Charlotte) Qi	3f36c325fa	[Benchmark] Support ready check timeout in `vllm bench serve` (#21696 ) Signed-off-by: Ye (Charlotte) Qi <yeq@meta.com> Co-authored-by: Roger Wang <hey@rogerw.me>	2025-08-03 00:52:38 -07:00
Peter Pan	533db0935d	[benchmark] add max-concurrency in result table (#21095 ) Signed-off-by: Peter Pan <Peter.Pan@daocloud.io>	2025-07-30 01:15:43 -07:00
rongfu.leng	18cc33dd60	[bugfix] fix profile impact benchmark results (#21507 ) Signed-off-by: rongfu.leng <rongfu.leng@daocloud.io>	2025-07-27 22:44:24 -07:00
Huy Do	971948b846	Handle non-serializable objects in vllm bench (#21665 )	2025-07-27 03:35:22 +00:00
Cyrus Leung	34ddcf9ff4	[Frontend] `run-batch` supports V1 (#21541 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2025-07-24 20:05:55 -07:00
Jialin Ouyang	10904e6d75	[benchmark] Port benchmark request sent optimization to benchmark_serving (#21209 ) Signed-off-by: Jialin Ouyang <Jialin.Ouyang@gmail.com>	2025-07-22 05:28:00 -07:00
Jialin Ouyang	1bf65138f6	[benchmark] Sending request strictly follows the random intervals (#21108 ) Signed-off-by: Jialin Ouyang <Jialin.Ouyang@gmail.com>	2025-07-18 06:22:08 +00:00
Michael Goin	8bb43b9c9e	Add benchmark dataset for mlperf llama tasks (#20338 ) Signed-off-by: mgoin <mgoin64@gmail.com>	2025-07-14 19:10:07 +00:00
Li Wang	9ff2af6d2b	[Benchmark] Parameterization of streaming loading of multimodal datasets (#20528 ) Signed-off-by: wangli <wangli858794774@gmail.com>	2025-07-09 13:35:16 +00:00
Kebe	b1c1fe35a5	[Misc] remove redundant char (#20287 ) Signed-off-by: Kebe <mail@kebe7jun.com>	2025-07-01 15:33:22 +08:00
Ekagra Ranjan	9502c38138	[Benchmark][Bug] Fix multiple bugs in bench and add args to spec_decode offline (#20083 )	2025-06-25 22:06:27 -07:00
d.transposed	c635c5f744	[Misc][Benchmarking] Add variable request-rate ("ramp-up") to the benchmarking client. (#19423 ) Signed-off-by: dtransposed <damian@damian-ml-machine.europe-west3-b.c.jetbrains-grazie.internal> Co-authored-by: dtransposed <damian@damian-ml-machine.europe-west3-b.c.jetbrains-grazie.internal> Co-authored-by: Roger Wang <hey@rogerw.me>	2025-06-24 18:41:49 +00:00
Wang, Yi	202c5df935	[Benchmark] fix request loss if "ping" is returned (#19535 ) Signed-off-by: Wang, Yi A <yi.a.wang@intel.com> Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>	2025-06-22 07:21:04 +00:00

1 2

70 Commits