xinyun/vllm - vllm - 丝路新云-代码仓

mirror of https://git.datalinker.icu/vllm-project/vllm.git synced 2026-08-03 21:37:12 +08:00

Author	SHA1	Message	Date
paolovic	da224daaa9	[Bugfix] add hf_token to EngineArgs (#16093 ) Signed-off-by: paolovic <paul-philipp.luley@uzh.ch> Co-authored-by: paolovic <paul-philipp.luley@uzh.ch>	2025-04-06 14:47:33 +00:00
Isotr0py	c2a9671510	[Misc] Improve model redirect to accept json dictionary (#16119 ) Signed-off-by: Isotr0py <2037008807@qq.com>	2025-04-06 05:51:45 -07:00
Reid	86cbd2eee9	[Misc] improve gguf check (#15974 ) Signed-off-by: reidliu41 <reid201711@gmail.com> Co-authored-by: reidliu41 <reid201711@gmail.com>	2025-04-04 01:33:36 +00:00
yihong	37bfee92bf	fix: better error message for get_config close #13889 (#15943 ) Signed-off-by: yihong0618 <zouzou0208@gmail.com>	2025-04-03 03:53:19 +00:00
yihong	70fedd0f79	fix: Comments to English for better dev experience (#15768 ) Signed-off-by: yihong0618 <zouzou0208@gmail.com>	2025-03-30 10:47:57 -07:00
pengyuange	de1cb38769	[Model] Support Skywork-R1V (#15397 ) Signed-off-by: jiacai.liu <932997367@qq.com> Co-authored-by: jiacai.liu <932997367@qq.com>	2025-03-28 20:39:21 -07:00
Kebe	432cf22a6a	[Bugfix] Fix regex compile display format (#15368 ) Signed-off-by: Kebe <mail@kebe7jun.com>	2025-03-28 08:58:44 -07:00
wang.yuqi	3f532cb6a6	[Misc] Use model_redirect to redirect the model name to a local folder. (#14116 )	2025-03-27 02:21:23 -07:00
Bryan Lu	781d056280	[Feature] Enhance EAGLE Architecture with Proper RMS Norms (#14990 ) Signed-off-by: Bryan Lu <yuzhelu@amazon.com> Co-authored-by: Cyrus Leung <cyrus.tl.leung@gmail.com>	2025-03-26 08:24:07 +00:00
Nick Hill	da6ea29f7a	[V1] Avoid redundant input processing in n>1 case (#14985 ) Signed-off-by: Nick Hill <nhill@redhat.com>	2025-03-20 22:24:10 -07:00
Wang Ran (汪然)	c607a2652b	Fixing Imprecise Type Annotations (#15192 )	2025-03-20 01:19:55 -07:00
Brayden Zhong	8b3e94a357	[Model] Remove duplicated message check in Mistral chat completion request (#15069 ) Signed-off-by: Brayden Zhong <b8zhong@uwaterloo.ca>	2025-03-19 05:09:32 +00:00
Rémi Delacourt	61c6a5a796	[VLM] Merged multi-modal processor for Pixtral (#12211 ) Signed-off-by: remi <remi@mistral.ai> Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk> Co-authored-by: DarkLight1337 <tlleungac@connect.ust.hk>	2025-03-15 06:28:27 -07:00
Harry Mellor	3b352a2f92	Correct capitalisation: `VLLM` -> `vLLM` (#14562 ) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>	2025-03-10 16:36:21 +00:00
Jee Jee Li	6a84164add	[Bugfix] Add file lock for ModelScope download (#14060 ) Signed-off-by: Jee Jee Li <pandaleefree@gmail.com>	2025-03-01 06:10:28 +00:00
Florian Greinacher	215bf150a6	[Bugfix] Handle None parameters in Mistral function calls. (#13786 )	2025-02-26 03:06:21 -08:00
Jee Jee Li	5157338ed9	[Misc] Improve LoRA spelling (#13831 )	2025-02-25 23:43:01 -08:00
Chen1022	340e39e387	Fix string parsing error (#13825 )	2025-02-25 08:20:29 -08:00
Chen1022	32c3b6bfd1	[Misc]Clarify Error Handling for Non-existent Model Paths and HF Repo IDs (#13724 ) Signed-off-by: Chen-0210 <chenjincong11@gmail.com> Co-authored-by: Michael Goin <mgoin64@gmail.com>	2025-02-25 10:12:19 +00:00
Shanshan Shen	2d87d7d1ac	[Bugfix] Modify modelscope api usage in transformer_utils (#13807 )	2025-02-25 00:36:07 -08:00
cjackal	51010a1807	[Misc] set single whitespace between log sentences (#13771 ) Signed-off-by: cjackal <44624812+cjackal@users.noreply.github.com>	2025-02-25 10:26:12 +08:00
燃	041e294716	[Misc] add mm_processor_kwargs to extra_body for Qwen2.5-VL (#13533 )	2025-02-19 23:04:30 -08:00
Isotr0py	550d97eb58	[Misc] Avoid calling unnecessary `hf_list_repo_files` for local model path (#13348 ) Signed-off-by: isotr0py <2037008807@qq.com>	2025-02-19 18:57:48 +00:00
Cyrus Leung	377d10bd14	[VLM][Bugfix] Pass processor kwargs properly on init (#13516 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2025-02-19 13:13:50 +00:00
Kevin H. Luu	d5d214ac7f	[1/n][CI] Load models in CI from S3 instead of HF (#13205 ) Signed-off-by: <> Co-authored-by: EC2 Default User <ec2-user@ip-172-31-20-117.us-west-2.compute.internal>	2025-02-19 07:34:59 +00:00
Isotr0py	29fc5772c4	[Bugfix] Remove noisy error logging during local model loading (#13458 )	2025-02-18 03:15:48 -08:00
r.4ntix	ce77eb9410	[Bugfix] Fix VLLM_USE_MODELSCOPE issue (#13384 )	2025-02-17 14:22:01 +00:00
Rafael Vasquez	314cfade02	[Frontend] Generate valid tool call IDs when using `tokenizer-mode=mistral` (#12332 )	2025-02-12 08:29:56 -08:00
Shiyan Deng	f1042e86f0	[Misc] AMD Build Improvements (#12923 )	2025-02-12 02:36:10 -08:00
Maximilien de Bayser	7c4033acd4	Further reduce the HTTP calls to huggingface.co (#13107 )	2025-02-12 02:34:09 -08:00
Keyun Tong	3ee696a63d	[RFC][vllm-API] Support tokenizer registry for customized tokenizer in vLLM (#12518 ) Signed-off-by: Keyun Tong <tongkeyun@gmail.com>	2025-02-12 12:25:58 +08:00
Florian Greinacher	cb080f32e3	[Bugfix] Support missing tool parameters in mistral tokenizer (#12884 ) Signed-off-by: Florian Greinacher <florian.greinacher@siemens.com>	2025-02-11 03:33:33 +00:00
Farzad Abdolhosseini	08b2d845d6	[Model] Ultravox Model: Support v0.5 Release (#12912 ) Signed-off-by: Farzad Abdolhosseini <farzad@fixie.ai>	2025-02-10 22:02:48 +00:00
Kevin H. Luu	fde71262e0	[misc] Add retries with exponential backoff for HF file existence check (#13008 )	2025-02-10 01:15:02 -08:00
Patrick von Platen	d366ccc4e3	[RFC] [Mistral] FP8 format (#10130 ) Signed-off-by: mgoin <mgoin64@gmail.com> Co-authored-by: mgoin <mgoin64@gmail.com>	2025-02-08 14:12:53 -07:00
zifeitong	d01f66b039	[Bugfix] Fix multi-round chat error when mistral tokenizer is used (#12859 ) Signed-off-by: Zifei Tong <zifeitong@gmail.com> Co-authored-by: Cyrus Leung <cyrus.tl.leung@gmail.com>	2025-02-08 07:04:34 +00:00
afeldman-nm	0630d4537a	[V1] Logprobs and prompt logprobs support (#9880 ) This PR is adding support for sample logprobs & prompt logprobs to vLLM v1. New behavior: - During model execution, model runner computes sample logprobs (if user-provided logprobs setting is not None) and prompt logprobs (if user-provided prompt_logprobs setting is not None). For both sample and prompt logprobs, the engine core returns 3 vectors: token ids, token logprob values, token ranks. Ranks reflect tokens' 1-indexed positions in the vocabulary vector after sorting the vocabulary by log probability in descending order. - In scheduler.update_from_output(), sample and prompt logprobs are incorporated into the EngineCoreOutput data structure which is transferred to the engine client. If multiprocessing is enabled, then sample and prompt logprobs will be (de)serialized when the EngineCoreOutput data structure is (de)serialized. - During output processing, the LogprobsProcessor transforms the triplet of token ids, token logprobs values, and token ranks into the OpenAI-compatible List[Dict[token id,Logprob]] format (for sample and prompt logprobs respectively.) - Each Logprob instance (whether sample- or prompt-) consists of a token's log-probability, rank, and detokenized string representation. Note that logprob detokenization is handled by the LogprobsProcessor not the detokenizer. Signed-off-by: Andrew Feldman <afeldman@neuralmagic.com> Signed-off-by: Nick Hill <nhill@redhat.com> Signed-off-by: rshaw@neuralmagic.com <rshaw@neuralmagic.com> Co-authored-by: rshaw@neuralmagic.com <rshaw@neuralmagic.com> Co-authored-by: Nick Hill <nhill@redhat.com>	2025-02-07 07:26:20 -08:00
Maximilien de Bayser	6e1fc61f0f	Prevent unecessary requests to huggingface hub (#12837 )	2025-02-06 21:37:41 -08:00
Kevin H. Luu	e152f29502	[misc] Reduce number of config file requests to HuggingFace (#12797 ) Signed-off-by: EC2 Default User <ec2-user@ip-172-31-20-117.us-west-2.compute.internal> Signed-off-by: <> Co-authored-by: EC2 Default User <ec2-user@ip-172-31-20-117.us-west-2.compute.internal>	2025-02-06 14:59:18 +00:00
youkaichao	20579c0fae	make sure mistral_common not imported for non-mistral models (#12669 ) When people use deepseek models, they find that they need to solve cv2 version conflict, see https://zhuanlan.zhihu.com/p/21064432691 . I added the check, and make all imports of `cv2` lazy. --------- Signed-off-by: youkaichao <youkaichao@gmail.com>	2025-02-03 13:40:25 +08:00
Russell Bryant	e489ad7a21	[Misc] Add SPDX-License-Identifier headers to python source files (#12628 ) - Add SPDX license headers to python source files - Check for SPDX headers using pre-commit commit 9d7ef44c3cfb72ca4c32e1c677d99259d10d4745 Author: Russell Bryant <rbryant@redhat.com> Date: Fri Jan 31 14:18:24 2025 -0500 Add SPDX license headers to python source files This commit adds SPDX license headers to python source files as recommended to the project by the Linux Foundation. These headers provide a concise way that is both human and machine readable for communicating license information for each source file. It helps avoid any ambiguity about the license of the code and can also be easily used by tools to help manage license compliance. The Linux Foundation runs license scans against the codebase to help ensure we are in compliance with the licenses of the code we use, including dependencies. Having these headers in place helps that tool do its job. More information can be found on the SPDX site: - https://spdx.dev/learn/handling-license-info/ Signed-off-by: Russell Bryant <rbryant@redhat.com> commit 5a1cf1cb3b80759131c73f6a9dddebccac039dea Author: Russell Bryant <rbryant@redhat.com> Date: Fri Jan 31 14:36:32 2025 -0500 Check for SPDX headers using pre-commit Signed-off-by: Russell Bryant <rbryant@redhat.com> --------- Signed-off-by: Russell Bryant <rbryant@redhat.com>	2025-02-02 11:58:18 -08:00
Roger Wang	7a8987dac5	[Bugfix] Gracefully handle huggingface hub http error (#12571 )	2025-01-31 08:19:35 +00:00
Harry Mellor	823ab79633	Update `pre-commit` hooks (#12475 ) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>	2025-01-27 17:23:08 -07:00
omer-dayan	5e5630a478	[Bugfix] Path join when building local path for S3 clone (#12353 ) Signed-off-by: Omer Dayan (SW-GPU) <omer@run.ai>	2025-01-24 11:06:07 +08:00
Cyrus Leung	cd7b6f0857	[VLM] Avoid unnecessary tokenization (#12310 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2025-01-22 11:08:31 +00:00
Cyrus Leung	b37d82791e	[Model] Upgrade Aria to transformers 4.48 (#12203 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2025-01-20 17:58:48 +08:00
Cyrus Leung	630eb5b5ce	[Bugfix] Fix multi-modal processors for transformers 4.48 (#12187 )	2025-01-18 19:16:34 -08:00
Isotr0py	02798ecabe	[Model] Port deepseek-vl2 processor, remove dependency (#12169 ) Signed-off-by: Isotr0py <2037008807@qq.com>	2025-01-18 13:59:39 +08:00
Kunshang Ji	54cacf008f	[Bugfix] Mistral tokenizer encode accept list of str (#12149 ) Signed-off-by: Kunshang Ji <kunshang.ji@intel.com>	2025-01-17 16:47:53 +00:00
Joe Runde	edce722eaa	[Bugfix] use right truncation for non-generative tasks (#12050 ) Signed-off-by: Joe Runde <Joseph.Runde@ibm.com>	2025-01-16 00:31:01 +08:00

1 2 3 4 5

224 Commits