xinyun/vllm - vllm - 丝路新云-代码仓

mirror of https://git.datalinker.icu/vllm-project/vllm.git synced 2025-12-22 04:25:01 +08:00

Author	SHA1	Message	Date
Chenheli Hua	7f2ea7074e	[Frontend][Multimodal] Allow skipping media data when UUIDs are provided. (#23950 ) Signed-off-by: Roger Wang <hey@rogerw.io> Signed-off-by: Chenheli Hua <huachenheli@outlook.com> Signed-off-by: Roger Wang <hey@rogerw.me> Co-authored-by: Roger Wang <hey@rogerw.io> Co-authored-by: Roger Wang <hey@rogerw.me>	2025-09-13 02:16:06 +00:00
Chenheli Hua	009d689b0c	[Core] Simplify and unify mm uuid handling & auto-generated mm hash overrides processing. (#24271 ) Signed-off-by: Chenheli Hua <huachenheli@outlook.com>	2025-09-09 21:36:09 -07:00
Roger Wang	749be00a98	[Core][Multimodal] Allow passing `multi_modal_uuids` as multimodal identifiers. (#23394 ) Signed-off-by: Roger Wang <hey@rogerw.io>	2025-08-30 18:01:22 -07:00
Roger Wang	8bf6266a17	[Multimodal] Generate mm_hash based on request metadata when caching is turned off (#23690 ) Signed-off-by: Roger Wang <hey@rogerw.io>	2025-08-27 20:24:31 +00:00
Cyrus Leung	69244e67e6	[Core] Use key-only cache for `BaseMultiModalProcessor` (#23018 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2025-08-27 14:19:13 +08:00
Cyrus Leung	6879cd80ae	[Refactor] Pass `tokenizer` explicitly instead of binding to prompt update (#23542 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2025-08-25 06:31:57 -07:00
Cyrus Leung	712d0f88d8	[Refactor] Dynamic `target` and `content` for prompt updates (#23411 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2025-08-24 23:39:58 -07:00
Roger Wang	79f05e4436	[Multimodal] Always enable hashing mm data (#23308 ) Signed-off-by: Roger Wang <hey@rogerw.io> Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk> Co-authored-by: DarkLight1337 <tlleungac@connect.ust.hk>	2025-08-21 07:23:28 -07:00
Cyrus Leung	d3f71f1224	[Refactor] Get prompt updates earlier (#23097 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2025-08-18 12:31:53 +00:00
Cyrus Leung	27e8d1ea3e	[Refactor] Define MultiModalKwargsItems separate from MultiModalKwargs (#23053 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2025-08-18 09:52:00 +00:00
Cyrus Leung	5c32143b9d	[Refactor] Defer tensor data construction in MultiModalKwargs (#23030 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2025-08-16 21:05:50 -07:00
Cyrus Leung	8c9da6be22	[Core] Simplify mm processing cache (#22457 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2025-08-07 09:47:07 -07:00
Cyrus Leung	766bc8162c	[Core] Store only the keys for multi-modal data in P0 (#22198 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2025-08-07 01:45:04 -07:00
Cyrus Leung	f5d0f4784f	[Frontend] Improve error message for too many mm items (#22114 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2025-08-02 02:20:38 -07:00
Cyrus Leung	32dffc2772	[Core] Rename `get_max_tokens_per_item` for backward compatibility (#20630 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2025-07-08 23:11:30 +00:00
Kyle Sayers	d8cf819a9a	[Core] [Bugfix] [Multimodal] Fix multimodal profiling and generation for SFT/PTQed models (#20058 ) Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>	2025-06-30 17:26:49 +00:00
Woosuk Kwon	2c5302fadd	[Multimodal] Optimize Qwen2/2.5-VL startup time (#19756 ) Signed-off-by: Woosuk Kwon <woosuk.kwon@berkeley.edu> Signed-off-by: Roger Wang <hey@rogerw.me> Co-authored-by: Roger Wang <hey@rogerw.me>	2025-06-21 20:01:07 +00:00
Simon Mo	02f0c7b220	[Misc] Add SPDX-FileCopyrightText (#19100 ) Signed-off-by: simon-mo <simon.mo@hey.com>	2025-06-03 11:20:17 -07:00
Hyogeun Oh (오효근)	a68e293cb9	[Doc] Convert Sphinx directives ( `{class}`, `{meth}`, `{attr}`, ...) to MkDocs format for better documentation linking (#18663 ) Signed-off-by: Zerohertz <ohg3417@gmail.com>	2025-05-27 01:44:20 -07:00
Feng XiaoLong	4fc1bf813a	[Bugfix] Migrate to REGEX Library to prevent catastrophic backtracking (#18454 ) Signed-off-by: Crucifixion-Fxl <xmufxl@gmail.com> Co-authored-by: Crucifixion-Fxl <xmufxl@gmail.com>	2025-05-23 16:16:26 -07:00
Harry Mellor	2edb533af2	Replace `{func}` with mkdocs style links (#18610 ) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>	2025-05-23 05:51:38 -07:00
David Xia	749f792553	[Frontend] decrease import time of vllm.multimodal (#18031 ) Co-authored-by: Aaron Pham <Aaronpham0103@gmail.com>	2025-05-14 15:43:32 -07:00
Cyrus Leung	61e0a506a3	[Bugfix] Avoid repeatedly creating dummy data during engine startup (#17935 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2025-05-12 22:40:19 -07:00
Harry Mellor	d6484ef3c3	Add full API docs and improve the UX of navigating them (#17485 ) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>	2025-05-03 19:42:43 -07:00
Cyrus Leung	cb234955df	[Misc] Clean up input processing (#17582 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2025-05-02 08:11:53 -07:00
Marko Rosenmueller	77073c77bc	[Core] Prevent side-channel attacks via cache salting (#17045 ) Signed-off-by: Marko Rosenmueller <5467316+dr75@users.noreply.github.com>	2025-04-30 20:27:21 +08:00
Cyrus Leung	506475de5f	[Optim] Compute multimodal hash only once per item (#17314 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2025-04-29 09:40:35 +08:00
Cyrus Leung	8b464d9660	[Misc] Clean up Qwen2.5-Omni code (#17301 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2025-04-28 06:20:45 -07:00
Harry Mellor	e78587a64c	Improve-mm-and-pooler-and-decoding-configs (#16789 ) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>	2025-04-17 22:13:32 -07:00
Cyrus Leung	d9fc8cd9da	[V1] Enable multi-input by default (#15799 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2025-04-12 08:52:39 +00:00
Cyrus Leung	56d4aefa33	[VLM] Avoid unnecessary dummy multimodal data during processing (#16416 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2025-04-10 19:32:14 +00:00
Cyrus Leung	83b824c8b4	[VLM] Remove `BaseProcessingInfo.get_mm_max_tokens_per_item` (#16408 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2025-04-10 09:06:58 -07:00
Roger Wang	f2ebb6f541	[V1] Scatter and gather placeholders in the model runner (#16076 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk> Signed-off-by: mgoin <mgoin64@gmail.com> Signed-off-by: Roger Wang <ywang@roblox.com> Co-authored-by: DarkLight1337 <tlleungac@connect.ust.hk> Co-authored-by: mgoin <mgoin64@gmail.com> Co-authored-by: Jennifer Zhao <ai.jenniferzhao@gmail.com>	2025-04-08 10:43:41 +08:00
Isotr0py	fc0f87768a	[Bugfix] Make dummy encoder prompt padding alternative and add missing warnings (#16129 ) Signed-off-by: Isotr0py <2037008807@qq.com>	2025-04-07 04:07:15 +00:00
Roger Wang	af51d80fa1	Revert "[V1] Scatter and gather placeholders in the model runner" (#16075 )	2025-04-04 14:50:57 -07:00
Cyrus Leung	f5722a5052	[V1] Scatter and gather placeholders in the model runner (#15712 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk> Signed-off-by: mgoin <mgoin64@gmail.com> Signed-off-by: Roger Wang <ywang@roblox.com> Co-authored-by: mgoin <mgoin64@gmail.com> Co-authored-by: Roger Wang <ywang@roblox.com>	2025-04-04 21:26:44 +00:00
Bella kira	f4c98b4d4c	[Misc] Consolidate LRUCache implementations (#15481 ) Signed-off-by: Bella kira <2374035698@qq.com>	2025-03-27 06:43:43 +00:00
Cyrus Leung	ffa443afed	[Bugfix] Fix embedding assignment for InternVL-based models (#15086 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2025-03-20 03:40:13 +00:00
Cyrus Leung	3d446433ec	[Bugfix] Fix size calculation of processing cache (#15114 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2025-03-19 05:53:19 -07:00
Cyrus Leung	61f412187d	[Bugfix] Re-enable Gemma3 for V1 (#14980 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2025-03-18 23:58:22 -07:00
Rémi Delacourt	61c6a5a796	[VLM] Merged multi-modal processor for Pixtral (#12211 ) Signed-off-by: remi <remi@mistral.ai> Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk> Co-authored-by: DarkLight1337 <tlleungac@connect.ust.hk>	2025-03-15 06:28:27 -07:00
Cyrus Leung	3556a41434	[VLM] Limit multimodal input cache by memory (#14805 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2025-03-15 02:52:05 -07:00
Cyrus Leung	ab93f1360f	[VLM] Various cleanup and fixes (#14806 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2025-03-14 05:58:19 -07:00
Cyrus Leung	05fb6718f0	[Bugfix] Clean up multi-modal processors (#14417 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2025-03-07 10:33:38 +00:00
Roger Wang	ec79b67c77	[Misc][V1] Avoid using `envs.VLLM_USE_V1` in mm processing (#14256 ) Signed-off-by: Roger Wang <ywang@roblox.com>	2025-03-05 07:37:16 +00:00
Cyrus Leung	f7bee5c815	[VLM][Bugfix] Enable specifying prompt target via index (#14038 )	2025-02-28 07:35:55 -08:00
Cyrus Leung	f1579b229d	[VLM] Generalized prompt updates for multi-modal processor (#13964 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2025-02-27 17:44:25 +00:00
Isotr0py	edf309ebbe	[VLM] Support multimodal inputs for Florence-2 models (#13320 )	2025-02-27 02:06:41 -08:00
Isotr0py	ba5106e519	[LMM] Implement merged multimodal processor for whisper (#13278 )	2025-02-23 01:46:03 -08:00
Cyrus Leung	4da1f667e9	[VLM] Keep track of whether prompt replacements have been applied (#13215 )	2025-02-14 04:20:46 -08:00

1 2

80 Commits