xinyun/vllm - vllm - 丝路新云-代码仓

mirror of https://git.datalinker.icu/vllm-project/vllm.git synced 2025-12-16 05:35:01 +08:00

Author	SHA1	Message	Date
Matthew Bonanni	44b5ce956d	[Bugfix] In LongRoPE, decide short vs long based on max_model_len (#27431 ) Signed-off-by: Matthew Bonanni <mbonanni@redhat.com>	2025-10-28 12:00:56 +00:00
Andrew Xia	53a56e658b	[gpt-oss][2/N] Support input_messages in responsesRequest (#26962 ) Signed-off-by: Andrew Xia <axia@fb.com> Co-authored-by: Andrew Xia <axia@fb.com>	2025-10-27 23:15:49 +00:00
Ben Browning	3b96f85c36	[Chore]: Stream tokens vs characters in tool call parser tests (#26513 ) Signed-off-by: Ben Browning <bbrownin@redhat.com>	2025-10-27 23:06:25 +08:00
Chauncey	a4fc21895e	[Bugfix] Fixed when return_token_ids=False, the first event still contains prompt_token_ids. (#27561 ) Signed-off-by: chaunceyjiang <chaunceyjiang@gmail.com>	2025-10-27 11:06:43 +00:00
Yeshwanth N	71b1c8b667	[Chore]:Extract math and argparse utilities to separate modules (#27188 ) Signed-off-by: Yeshwanth Surya <yeshsurya@gmail.com> Signed-off-by: Yeshwanth N <yeshsurya@gmail.com> Signed-off-by: yeshsurya <yeshsurya@gmail.com>	2025-10-26 04:03:32 -07:00
Chauncey	41a62564a7	Fix test named tool use (#27458 ) Signed-off-by: chaunceyjiang <chaunceyjiang@gmail.com>	2025-10-24 20:27:45 +08:00
strinczer	074475541a	[Bugfix] Fix Pydantic union resolution for ResponseFunctionToolCall in Responses API (#26706 ) Signed-off-by: Shai Trinczer <strinczer@icloud.com> Co-authored-by: Chauncey <chaunceyjiang@gmail.com> Co-authored-by: Ye (Charlotte) Qi <yeq@meta.com>	2025-10-23 22:53:42 -07:00
Cyrus Leung	fe2016de2d	[CI/Build] Remove unnecessary flags from test registry (#27353 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2025-10-23 14:42:40 +00:00
Chauncey	d00ce29d89	[CI] Reorganize entrypoints tests (#27403 ) Signed-off-by: chaunceyjiang <chaunceyjiang@gmail.com>	2025-10-23 10:10:06 +00:00
Russell Bryant	58fab50d82	[Frontend] Require flag for loading text and image embeds (#27204 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk> Co-authored-by: DarkLight1337 <tlleungac@connect.ust.hk>	2025-10-22 15:52:02 +00:00
ExtReMLapin	a4c29e6e82	fixed reasoning streaming with tool_choice="required" (#24108 ) Signed-off-by: CNE Pierre FICHEPOIL <pierre-1.fichepoil@gendarmerie.interieur.gouv.fr> Signed-off-by: ExtReMLapin <3909752+ExtReMLapin@users.noreply.github.com> Co-authored-by: CNE Pierre FICHEPOIL <pierre-1.fichepoil@gendarmerie.interieur.gouv.fr> Co-authored-by: Chauncey <chaunceyjiang@gmail.com>	2025-10-22 09:42:55 +00:00
iAmir97	7a6c8c3fa1	[Chore] Separate out `vllm.utils.network_utils` (#27164 ) Signed-off-by: iAmir97 <Amir.balwel@embeddedllm.com> Co-authored-by: iAmir97 <Amir.balwel@embeddedllm.com>	2025-10-19 03:06:32 -07:00
dongbo910220	83004020fd	[Test] Add test for /health endpoint on engine failure (#26074 ) Signed-off-by: dongbo910220 <1275604947@qq.com>	2025-10-18 09:59:05 +00:00
Hanchenli	7c572544e4	[GPT-OSS] Structure_Tag support for gpt-oss tool-call in cot (#25515 ) Signed-off-by: Hanchenli <lihanc2002@gmail.com> Signed-off-by: Hanchenli <61769611+Hanchenli@users.noreply.github.com> Signed-off-by: Wei Wei <wwei6@meta.com> Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> Co-authored-by: Wei Wei <wwei6@meta.com> Co-authored-by: Wei Wei <weiweinpu@gmail.com> Co-authored-by: Cyrus Leung <tlleungac@connect.ust.hk>	2025-10-17 21:55:54 -07:00
Tahsin Tunan	43721bc67f	[CI] Replace large models with tiny alternatives in tests (#24057 ) Signed-off-by: Tahsin Tunan <tahsintunan@gmail.com> Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com> Co-authored-by: Nick Hill <nhill@redhat.com> Co-authored-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>	2025-10-16 15:51:27 +01:00
Cyrus Leung	76f0d05bc6	[CI/Build] Update expected beam search output for Phi3V (#26978 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2025-10-16 05:12:44 +00:00
InChang Jeong	0ecc553ee6	[Bugfix] reasoning_parser parameter handling in run_batch.py (#26225 ) Signed-off-by: inc-jeong <inc.jeong@navercorp.com> Signed-off-by: InChang Jeong <inc.jeong@navercorp.com> Co-authored-by: USER <user@AL02367916.local>	2025-10-16 10:24:05 +08:00
Pradeep Dasigi	4794c2bd92	Olmo 3 tool parser and tests (#26143 ) Signed-off-by: Pradeep Dasigi <pradeepd@allenai.org>	2025-10-15 16:36:12 +00:00
Cyrus Leung	b8a4572157	[Misc] Use helper function to generate dummy messages in OpenAI MM tests (#26875 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2025-10-15 07:17:37 +00:00
Chauncey	df850c4912	[Feature][Responses API] Stream Function Call - harmony (#24317 ) Signed-off-by: chaunceyjiang <chaunceyjiang@gmail.com>	2025-10-14 08:31:43 -07:00
Chauncey	780eb03d9b	[CI] Fix test_tool_id_kimi_k2 (#26787 ) Signed-off-by: chaunceyjiang <chaunceyjiang@gmail.com>	2025-10-14 10:27:07 +00:00
Max Wittig	fd85c9f426	[Bugfix][FE]: Always include usage with `--enable-force-include-usage` (#20983 ) Signed-off-by: Max Wittig <max.wittig@siemens.com> Signed-off-by: Antoine Auger <antoineauger@users.noreply.github.com> Co-authored-by: Antoine Auger <antoineauger@users.noreply.github.com>	2025-10-14 09:17:39 +02:00
Jialin Ouyang	35bc22f23c	[ResponseAPI] Further polish message serialization and unit tests (#26728 ) Signed-off-by: Jialin Ouyang <Jialin.Ouyang@gmail.com>	2025-10-13 23:31:35 +00:00
Jialin Ouyang	4073c82c4e	[ResponseAPI] Simplify input/output message serialization (#26620 ) Signed-off-by: Jialin Ouyang <Jialin.Ouyang@gmail.com>	2025-10-13 09:59:15 +00:00
Harry Mellor	8fcaaf6a16	Update `Optional[x]` -> `x \| None` and `Union[x, y]` to `x \| y` (#26633 ) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>	2025-10-12 09:51:31 -07:00
Chauncey	910abdbd08	[Bugfix] fixed top_logprobs: -1 does not appear to work as intended (#26470 ) Signed-off-by: chaunceyjiang <chaunceyjiang@gmail.com>	2025-10-11 00:41:17 +08:00
Chauncey	1e6848a65d	[CI] fix test_run_batch.py::test_completions - AssertionError (#26578 ) Signed-off-by: chaunceyjiang <chaunceyjiang@gmail.com>	2025-10-10 22:16:28 +08:00
Chauncey	720d3cd0f0	[CI] fix ruff format (#26579 ) Signed-off-by: chaunceyjiang <chaunceyjiang@gmail.com>	2025-10-10 03:02:12 -07:00
Ashwin Phadke	ab196edefb	Remove LoRA bias support (#25807 ) Signed-off-by: Ashwin Phadke <ashwinphadke12@rediffmail.com> Signed-off-by: Ashwin Phadke <23502062+ashwin-phadke@users.noreply.github.com> Signed-off-by: Jee Jee Li <pandaleefree@gmail.com> Co-authored-by: Jee Jee Li <pandaleefree@gmail.com>	2025-10-10 09:50:33 +00:00
Luis Tomas Bolivar	3ee202ea1e	[GPT-OSS] Add support for arrays at tool message content (#25593 ) Signed-off-by: Luis Tomas Bolivar <ltomasbo@redhat.com>	2025-10-10 09:00:45 +00:00
Cyrus Leung	ad430a67ca	[Metrics] Log multi-modal cache stats and fix reset (#26285 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2025-10-10 01:45:55 -07:00
Ben Browning	da4455609d	[Chore]: One pythonic tool parser test uses the wrong parser (#26515 ) Signed-off-by: Ben Browning <bbrownin@redhat.com>	2025-10-10 04:03:55 +00:00
Cyrus Leung	4bdf7ac593	[Bugfix] Fix SHM cache initialization (#26427 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2025-10-09 02:48:04 -07:00
Cyrus Leung	dc7976dd9f	[Misc] Upgrade more code to Python 3.10 (#26463 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2025-10-09 10:43:53 +01:00
Cyrus Leung	1e4ecca1d0	[V0 Deprecation] Remove `VLLM_USE_V1` from tests (#26341 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2025-10-07 15:42:31 +00:00
Andrew Xia	185d8ed44f	[responsesAPI][bugfix] serialize harmony messages (#26185 ) Signed-off-by: Andrew Xia <axia@meta.com> Co-authored-by: Ye (Charlotte) Qi <yeq@meta.com>	2025-10-07 07:07:53 +00:00
Harry Mellor	6c04638214	Fix per file ruff ignores related to line length (#26262 ) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>	2025-10-06 05:12:40 +00:00
wuhang	91ac7f764d	[CI][gpt-oss] Enable python tool tests in CI (#24315 ) Signed-off-by: wuhang <wuhang6@huawei.com>	2025-10-06 04:20:06 +00:00
Harry Mellor	1c0c68202c	Fix per file ruff ignores related to typing (#26254 ) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>	2025-10-05 16:37:55 +00:00
Harry Mellor	d6953beb91	Convert formatting to use `ruff` instead of `yapf` + `isort` (#26247 ) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>	2025-10-05 07:06:22 -07:00
Cyrus Leung	a964e5e6c3	[Bugfix] Allow `--skip-tokenizer-init` with `echo and return_token_ids` (#26238 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2025-10-05 05:38:53 +00:00
Cyrus Leung	119f00630b	[Renderer] Clean up renderer code (#26216 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2025-10-04 17:05:29 +00:00
Ben Browning	ea25a76c05	[BugFix] Use async Mistral Tokenizer in Chat Completions (#26134 ) Signed-off-by: Ben Browning <bbrownin@redhat.com> Co-authored-by: Cyrus Leung <tlleungac@connect.ust.hk>	2025-10-04 09:42:08 +08:00
Andrew Xia	831b124151	[responsesAPI] add better error messaging for long prompts (#25724 ) Signed-off-by: Andrew Xia <axia@meta.com> Signed-off-by: Andrew Xia <axia@fb.com> Co-authored-by: Andrew Xia <axia@fb.com>	2025-10-03 14:33:13 -07:00
Yang Liu	812b7f54a8	[Renderer] Move Processor out of AsyncLLM (#24138 ) Signed-off-by: Yang <lymailforjob@gmail.com> Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk> Co-authored-by: DarkLight1337 <tlleungac@connect.ust.hk>	2025-10-03 11:29:45 +00:00
kyt	2ed3f20dba	[openai] Fix missing tool usage check (system message) (#24768 ) Signed-off-by: kyt <eluban4532@gmail.com>	2025-10-03 18:55:44 +08:00
Andrew Xia	e5017cd6d6	[gpt-oss] disable tool server initialization if no tool in request (#25790 ) Signed-off-by: Andrew Xia <axia@meta.com> Signed-off-by: Andrew Xia <axia@fb.com> Co-authored-by: Andrew Xia <axia@fb.com>	2025-10-03 05:08:35 +00:00
Andrew Xia	5db1870bb9	[gpt-oss] use vLLM instead of openai types for streaming (#25186 ) Signed-off-by: Andrew Xia <axia@meta.com> Signed-off-by: Andrew Xia <axia@fb.com> Co-authored-by: Andrew Xia <axia@fb.com>	2025-09-30 22:47:07 +00:00
Andrew Sansom	78a47f87ce	Test Prompt Embeds/LoRA compatibility and Enable LoRA Support for OPT Models (#25717 ) Signed-off-by: Andrew Sansom <andrew@protopia.ai>	2025-09-30 08:10:58 +08:00
Russell Bryant	3958b96bf5	Add option to restrict media domains (#25783 ) Signed-off-by: Chenheli Hua <huachenheli@outlook.com> Signed-off-by: Russell Bryant <rbryant@redhat.com> Co-authored-by: Chenheli Hua <huachenheli@outlook.com>	2025-09-27 01:23:52 +00:00

1 2 3 4 5 ...

393 Commits