xinyun/vllm - vllm - 丝路新云-代码仓

mirror of https://git.datalinker.icu/vllm-project/vllm.git synced 2026-01-16 12:54:32 +08:00

Author	SHA1	Message	Date
sangbumlikeagod	092bb73b8a	[Frontend] add 'verbose_json' and 'timestamp' feature on Whisper Transcription/Translation (#24209 ) Signed-off-by: sangbumlikeagod <oironese@naver.com> Signed-off-by: sangbumlikeagod <98077576+sangbumlikeagod@users.noreply.github.com>	2025-12-01 18:19:17 +01:00
Cyrus Leung	f0a28bf661	[Misc] Unify tokenizer registration (#29767 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2025-12-01 11:34:58 +00:00
daniel-salib	014ece97c7	[Frontend] Add tool filtering support to ToolServer (#29224 ) Signed-off-by: Daniel Salib <danielsalib@meta.com> Co-authored-by: Chauncey <chaunceyjiang@gmail.com>	2025-12-01 08:03:57 +00:00
wang.yuqi	62de4f4257	[Frontend] Resettle pooling entrypoints (#29634 ) Signed-off-by: wang.yuqi <yuqi.wang@daocloud.io>	2025-12-01 15:30:43 +08:00
Cyrus Leung	2afcec4dec	[Misc] Update `TokenizerLike` interface and move `get_cached_tokenizer` (#29730 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2025-11-30 14:59:47 +08:00
Cyrus Leung	fe3398fab2	[Chore] Enable passing `tokenizer=None` into MM processor (#29724 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2025-11-29 06:25:10 -08:00
Cyrus Leung	34a984274e	[Misc] Refactor tokenizer interface (#29693 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2025-11-29 04:02:21 -08:00
Didier Durand	04a797cd0e	[Doc]: fixing typos in various files. (#29717 ) Signed-off-by: Didier Durand <durand.didier@gmail.com>	2025-11-29 01:15:39 -08:00
Cyrus Leung	8d9338fae4	[Chore] Rename `Processor` to `InputProcessor` (#29682 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2025-11-28 09:35:41 -08:00
Cyrus Leung	0808eb813b	[Misc] Remove `yapf` directives (#29675 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2025-11-28 15:07:23 +00:00
HappyAmazonian	f8151b66fa	Revert "Supress verbose logs from model_hosting_container_standards (… (#29335 ) Signed-off-by: Shen Teng <sheteng@amazon.com> Co-authored-by: Cyrus Leung <tlleungac@connect.ust.hk>	2025-11-28 05:29:05 -08:00
maang-h	51906c8c55	[Docs] Improve `priority` parameter documentation (#29572 ) Signed-off-by: maang <maang_h@163.com> Signed-off-by: maang-h <55082429+maang-h@users.noreply.github.com> Co-authored-by: Cyrus Leung <cyrus.tl.leung@gmail.com>	2025-11-27 02:09:24 -08:00
Andrew Xia	b07555d26f	[responsesAPI][2] parse ResponseFunctionToolCallOutputItem (#29383 ) Signed-off-by: Andrew Xia <axia@fb.com> Co-authored-by: Andrew Xia <axia@fb.com>	2025-11-25 10:27:26 -08:00
Harry Mellor	a1f2676879	Scheduled removal of `override_pooler_config` and `disable_log_requests` (#29402 ) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com> Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>	2025-11-25 16:08:57 +00:00
Ben Browning	e1dd706cd1	[Frontend] Respect Chat Completion parallel_tool_calls param (#26233 ) Signed-off-by: Ben Browning <bbrownin@redhat.com> Co-authored-by: Chauncey <chaunceyjiang@gmail.com>	2025-11-25 09:56:15 +00:00
Andrew Xia	a685b47c57	[responsesAPI] refactor construct_input_messages (#29359 ) Signed-off-by: Andrew Xia <axia@fb.com> Co-authored-by: Andrew Xia <axia@fb.com>	2025-11-25 09:47:10 +00:00
Nick Hill	db2906108a	[Misc] Streamline unique id generation (#29375 ) Signed-off-by: Nick Hill <nhill@redhat.com>	2025-11-25 08:30:11 +00:00
Nick Hill	7992324f23	[BugFix] Use unique ids for different transcription prompts (#29372 ) Signed-off-by: Nick Hill <nhill@redhat.com>	2025-11-25 06:55:16 +00:00
Harry Mellor	316c8492bf	Scheduled removal of `guided_*` config fields (#29326 ) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>	2025-11-25 05:24:05 +00:00
Nick Hill	a178a0b40b	[BugFix] Fix duplicate id tool-call race condition (#29355 ) Signed-off-by: Nick Hill <nhill@redhat.com>	2025-11-25 01:54:26 +00:00
Aydin Abiar	656516c315	[Bugfix] properly handle nested json with llama3 tool parser (#27701 ) Signed-off-by: Aydin Abiar <aydin@anyscale.com> Signed-off-by: Aydin Abiar <62435714+Aydin-ab@users.noreply.github.com> Co-authored-by: Aydin Abiar <aydin@anyscale.com> Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> Co-authored-by: Chauncey <chaunceyjiang@gmail.com>	2025-11-24 15:28:51 +00:00
Mads Kildegård	ea38474ac5	[Frontend][Responses API] Multi-turn (with type: "output_text") support for non-harmony requests (#29175 ) Signed-off-by: Mads Kildegård <mkildegaard99@gmail.com>	2025-11-22 09:58:22 +00:00
Andrew Xia	742e9ff6b3	[responsesAPI] parse reasoning item input (#28248 ) Signed-off-by: Andrew Xia <axia@fb.com> Co-authored-by: Andrew Xia <axia@fb.com> Co-authored-by: Cyrus Leung <tlleungac@connect.ust.hk>	2025-11-22 15:42:11 +08:00
sfbemerk	2092ce8c39	Tool Call Parser logs should not contain user input / model output except on DEBUG (#29160 ) Signed-off-by: Benjamin Merkel <benjamin.merkel@tngtech.com> Co-authored-by: Benjamin Merkel <benjamin.merkel@tngtech.com> Co-authored-by: Chauncey <chaunceyjiang@gmail.com>	2025-11-21 20:57:19 +08:00
Cyrus Leung	aab0102a26	[V0 deprecation] Remove more V0 references (#29088 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2025-11-21 11:56:59 +00:00
Alex Brooks	b4734b9550	[Bugfix] Fix default MM LoRA alignment for single str prompts (#29140 ) Signed-off-by: Alex-Brooks <Alex.Brooks@ibm.com>	2025-11-21 13:32:30 +08:00
Cyrus Leung	56e96b37e4	[V0 Deprecation] Remove `best_of` (#29090 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2025-11-21 11:40:40 +08:00
jeremyteboul	0730414999	[Core] Add audio_embeds support to chat completions (#29059 ) Signed-off-by: Jeremy Teboul <jeremyteboul@fb.com> Co-authored-by: Jeremy Teboul <jeremyteboul@fb.com>	2025-11-21 11:39:47 +08:00
Software Developer	4d01b64284	[Bugfix] - Add Trace Headers to Beam Search Path (#29100 ) Signed-off-by: dsuhinin <suhinin.dmitriy@gmail.com>	2025-11-20 20:00:33 +00:00
rookie	56f45eddaf	[Frontend] Optimize beam search loop by sorting and then splicing (#19347 ) Signed-off-by: zhangguozhu <zhangguozhu@360.cn> Signed-off-by: mgoin <mgoin64@gmail.com> Co-authored-by: zhangguozhu <zhangguozhu@360.cn> Co-authored-by: mgoin <mgoin64@gmail.com>	2025-11-20 09:02:30 -08:00
Samit	371b1d4c61	[RL] Add Pause and Resume Generation for Asynchronous RL Training (#28037 ) Signed-off-by: SamitHuang <285365963@qq.com> Signed-off-by: Samit <285365963@qq.com> Signed-off-by: samithuang <285365963@qq.com> Co-authored-by: 22quinn <33176974+22quinn@users.noreply.github.com>	2025-11-20 03:01:03 -08:00
Quentin Gallouédec	1c7bcc55b8	[Frontend] Allow parsed tool arguments (#28820 ) Signed-off-by: Quentin Gallouédec <45557362+qgallouedec@users.noreply.github.com> Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>	2025-11-19 22:20:12 -08:00
Michael Goin	67745d189f	Supress verbose logs from model_hosting_container_standards (#28949 ) Signed-off-by: mgoin <mgoin64@gmail.com>	2025-11-18 12:29:06 -08:00
Benjamin Bartels	b6e04390d3	[Bugfix] Fix Kimi-K2 tool parser concatenated tool calls parsing (#28831 ) Signed-off-by: Thomas Mao <yiyeguhu@gmail.com> Signed-off-by: bbartels <benjamin@bartels.dev> Co-authored-by: Thomas Mao <yiyeguhu@gmail.com> Co-authored-by: Chauncey <chaunceyjiang@gmail.com>	2025-11-17 19:13:25 -08:00
Jay Caldwell	6f37419244	[Bugfix][Model] Prevent special token leakage in KimiK2ToolParser streaming mode (#28543 ) Signed-off-by: Jscaldwell55 <jay.s.caldwell@gmail.com>	2025-11-17 13:54:46 +08:00
Lucia Fang	b316ac6589	[V1] Support MP Executor for multi node distributed inference (#23691 ) Signed-off-by: Lu Fang <fanglu@fb.com> Signed-off-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com> Signed-off-by: Lucia Fang <fanglu@fb.com> Signed-off-by: Lucia Fang <116399278+luccafong@users.noreply.github.com> Signed-off-by: Nick Hill <nhill@redhat.com> Co-authored-by: Nick Hill <nhill@redhat.com>	2025-11-16 09:01:21 +00:00
Zhuohan Li	dd6ac1c2bb	[RL] [V1] Remove unused device argument from reset_kv_cache (#28766 ) Signed-off-by: Zhuohan Li <zhuohan123@gmail.com>	2025-11-14 23:59:42 -08:00
Nicolò Lucchesi	6f1e7f7226	[DisaggEverything] Tokens in<>out `/generate` endpoint (#24261 ) Signed-off-by: NickLucche <nlucches@redhat.com> Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com> Co-authored-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>	2025-11-14 09:58:01 -07:00
Srreyansh Sethi	360bd8762f	[Frontend] Added chat-style multimodal support to /classify. (#27516 ) Signed-off-by: WorldExplored <srreyansh.sethi@gmail.com> Signed-off-by: Srreyansh Sethi <107075589+WorldExplored@users.noreply.github.com> Signed-off-by: vnadathur <glvikramn@gmail.com> Signed-off-by: wang.yuqi <yuqi.wang@daocloud.io> Co-authored-by: vnadathur <236933696+vnadathur@users.noreply.github.com> Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> Co-authored-by: vnadathur <glvikramn@gmail.com> Co-authored-by: wang.yuqi <noooop@126.com> Co-authored-by: wang.yuqi <yuqi.wang@daocloud.io>	2025-11-14 11:03:55 +00:00
baonudesifeizhai	c428e8d80b	Fix io processor pooling #28273 (#28484 ) Signed-off-by: baonudesifeizhai <baonudesifeizhai@gmail.com>	2025-11-13 11:34:14 +00:00
Chauncey	5c9ad138d5	[Frontend] supports interleaved thinking (#28531 ) Signed-off-by: chaunceyjiang <chaunceyjiang@gmail.com>	2025-11-13 16:14:13 +08:00
Andrew Xia	1a0b157a2e	[Frontend][responsesAPI][1/n] convert responses API tool input to chat completions tool format (#28231 ) Signed-off-by: Andrew Xia <axia@fb.com> Co-authored-by: Andrew Xia <axia@fb.com> Co-authored-by: Chauncey <chaunceyjiang@gmail.com>	2025-11-13 04:47:22 +00:00
Andrew Xia	7c38ed0f1c	[Frontend] split append tool output (#28333 ) Signed-off-by: Andrew Xia <axia@fb.com> Co-authored-by: Andrew Xia <axia@fb.com>	2025-11-13 04:03:23 +00:00
Yanan Cao	48c879369f	[Frontend] Change CompilationMode to a proper Enum (#28165 ) Signed-off-by: Yanan Cao <gmagogsfm@gmail.com>	2025-11-11 19:46:18 -05:00
Zuyi Zhao	bca74e32b7	[Frontend] Add sagemaker_standards dynamic lora adapter and stateful session management decorators to vLLM OpenAI API server (#27892 ) Signed-off-by: Zuyi Zhao <zhaozuy@amazon.com> Signed-off-by: Shen Teng <sheteng@amazon.com> Co-authored-by: Shen Teng <sheteng@amazon.com> Co-authored-by: Robert Shaw <114415538+robertgshaw2-redhat@users.noreply.github.com>	2025-11-11 04:57:01 +00:00
Jialin Ouyang	b30372cbd0	[Perf] Move gc.freeze logic from EngineCoreProc to EngineCore for better coverage (#27896 ) Signed-off-by: Jialin Ouyang <Jialin.Ouyang@gmail.com>	2025-11-10 15:34:18 -08:00
Andrew Xia	4b94ed8f92	[Frontend][2/n] remove empty content from _parse_tool_calls_from_content (#28331 ) Signed-off-by: Andrew Xia <axia@fb.com> Co-authored-by: Andrew Xia <axia@fb.com>	2025-11-10 14:07:49 -08:00
Benjamin Chislett	975676d174	[Feat] Drop-in Torch CUDA Profiler (#27841 ) Signed-off-by: Benjamin Chislett <bchislett@nvidia.com>	2025-11-08 14:07:37 -08:00
Harry Mellor	d9ab1ad9d1	`reasoning_content` -> `reasoning` (#27752 ) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>	2025-11-08 12:15:08 +00:00
Iceber Gu	e0d6b4a867	[CLI] add --max-tokens to `vllm complete` (#28109 ) Signed-off-by: Iceber Gu <caiwei95@hotmail.com>	2025-11-07 12:21:40 +00:00

1 2 3 4 5 ...

1101 Commits