xinyun/vllm - vllm - 丝路新云-代码仓

mirror of https://git.datalinker.icu/vllm-project/vllm.git synced 2025-12-13 03:04:55 +08:00

Author	SHA1	Message	Date
Harry Mellor	00976db0c3	[Docs] Fix warnings in docs build (#22588 ) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>	2025-08-10 05:49:51 -07:00
22quinn	54de71d0df	[Sampler] Support returning all logprobs or logits (#21792 ) Signed-off-by: 22quinn <33176974+22quinn@users.noreply.github.com>	2025-08-04 03:04:12 -07:00
Harry Mellor	313ae8c16a	[Deprecation] Remove everything scheduled for removal in v0.10.0 (#20979 ) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>	2025-07-15 15:57:53 +00:00
afeldman-nm	dfada85eee	[Frontend] Expose custom args in OpenAI APIs (#16862 ) Signed-off-by: Andrew Feldman <afeldman@neuralmagic.com> Signed-off-by: Andrew Feldman <afeldman@redhat.com> Co-authored-by: Nick Hill <nhill@redhat.com>	2025-06-18 17:41:11 -07:00
Simon Mo	02f0c7b220	[Misc] Add SPDX-FileCopyrightText (#19100 ) Signed-off-by: simon-mo <simon.mo@hey.com>	2025-06-03 11:20:17 -07:00
CYJiang	d054da1992	[Misc] fix: add miss best_of param validation (#18555 ) Signed-off-by: googs1025 <googs1025@gmail.com>	2025-06-03 11:02:07 -07:00
Harry Mellor	c6798baa9c	Change `top_k` to be disabled with `0` (still accept `-1` for now) (#17773 ) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>	2025-05-09 10:01:49 +00:00
Qiong Zhou Huang	39956efb3f	[Bugfix] Fix bad words for Mistral models (#17753 ) Signed-off-by: Qiong Zhou Huang <qiong@phonic.co>	2025-05-07 23:32:10 -07:00
Yang Wang	b8b0859b5c	add more pytorch related tests for torch nightly (#17422 ) Signed-off-by: Yang Wang <elainewy@meta.com>	2025-05-02 03:29:59 -07:00
Gabriel Marinho	1c2bc7ead0	Truncation control for embedding models (#14776 ) Signed-off-by: Gabriel Marinho <gmarinho@ibm.com> Signed-off-by: Max de Bayser <mbayser@br.ibm.com> Co-authored-by: Max de Bayser <mbayser@br.ibm.com>	2025-04-30 09:24:57 +08:00
Harry Mellor	a6977dbd15	Simplify (and fix) passing of guided decoding backend options (#17008 ) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>	2025-04-29 19:02:23 +00:00
Nick Hill	8262a3e23b	[Misc] Validate `stop_token_ids` contents (#17268 ) Signed-off-by: Nick Hill <nhill@redhat.com>	2025-04-28 03:54:05 +00:00
Russell Bryant	f8acd01ff7	[V1] Add `structural_tag` support using xgrammar (#17085 )	2025-04-26 14:06:37 +00:00
Chauncey	acba33a0f1	[Bugfix] Fix the issue where llm.generate cannot be called repeatedly after setting GuidedDecodingParams (#16767 ) Signed-off-by: chaunceyjiang <chaunceyjiang@gmail.com> Signed-off-by: Russell Bryant <rbryant@redhat.com> Co-authored-by: Russell Bryant <rbryant@redhat.com>	2025-04-22 06:02:20 +00:00
Alex Brooks	f8f9c0ba62	[Bugfix] Don't set an upper bound on repetition penalty (#16403 ) Signed-off-by: Alex-Brooks <Alex.Brooks@ibm.com> Co-authored-by: Nick Hill <nhill@redhat.com>	2025-04-11 14:19:40 +08:00
yihong	04149cce27	[BugFix] fix some typos found by typos. (#16314 ) Signed-off-by: yihong0618 <zouzou0208@gmail.com>	2025-04-09 03:43:59 -07:00
Nick Hill	623e2ed29f	[BugFix][V1] Quick fix for min_tokens with multiple EOS (#15407 ) Signed-off-by: Nick Hill <nhill@redhat.com>	2025-03-24 15:58:59 -07:00
Nick Hill	fc1f67715d	[BugFix][V1] Fix overhead related to bad_words sampling when not in use (#14894 ) Signed-off-by: Nick Hill <nhill@redhat.com>	2025-03-16 14:53:34 -07:00
Nick Hill	4059adc31b	[Misc][Minor] Simplify `SamplingParams.__post_init__()` (#14772 ) Signed-off-by: Nick Hill <nhill@redhat.com>	2025-03-14 11:44:20 +08:00
22quinn	eb8b5eb183	[V1] Support bad_words in sampler (#13376 ) Signed-off-by: 22quinn <33176974+22quinn@users.noreply.github.com> Co-authored-by: Nick Hill <nhill@redhat.com>	2025-03-08 14:50:26 -08:00
Aviv Keshet	4aae667668	[core] add `extra_args` to `SamplingParams` (#13300 ) Signed-off-by: Aviv Keshet <akeshet@scaledcognition.com>	2025-03-08 14:41:18 +08:00
Harry Mellor	bf0560bda9	Reinstate `best_of` for V0 (#14356 ) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>	2025-03-06 08:34:22 -08:00
Vincent	a4f1ee35d6	Deprecate `best_of` Sampling Parameter in anticipation for vLLM V1 (#13997 ) Signed-off-by: vincent-4 <vincentzhongy+githubvincent4@gmail.com> Signed-off-by: Brayden Zhong <b8zhong@uwaterloo.ca> Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com> Co-authored-by: Brayden Zhong <b8zhong@uwaterloo.ca> Co-authored-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>	2025-03-05 20:22:43 +00:00
Harry Mellor	cf069aa8aa	Update deprecated Python 3.8 typing (#13971 )	2025-03-02 17:34:51 -08:00
Joe Runde	bfbc0b32c6	[Frontend] Add backend-specific options for guided decoding (#13505 ) Signed-off-by: Joe Runde <Joseph.Runde@ibm.com>	2025-02-20 15:07:58 -05:00
Lu Fang	6224a9f620	Support logit_bias in v1 Sampler (#13079 )	2025-02-14 04:34:59 -08:00
Russell Bryant	e489ad7a21	[Misc] Add SPDX-License-Identifier headers to python source files (#12628 ) - Add SPDX license headers to python source files - Check for SPDX headers using pre-commit commit 9d7ef44c3cfb72ca4c32e1c677d99259d10d4745 Author: Russell Bryant <rbryant@redhat.com> Date: Fri Jan 31 14:18:24 2025 -0500 Add SPDX license headers to python source files This commit adds SPDX license headers to python source files as recommended to the project by the Linux Foundation. These headers provide a concise way that is both human and machine readable for communicating license information for each source file. It helps avoid any ambiguity about the license of the code and can also be easily used by tools to help manage license compliance. The Linux Foundation runs license scans against the codebase to help ensure we are in compliance with the licenses of the code we use, including dependencies. Having these headers in place helps that tool do its job. More information can be found on the SPDX site: - https://spdx.dev/learn/handling-license-info/ Signed-off-by: Russell Bryant <rbryant@redhat.com> commit 5a1cf1cb3b80759131c73f6a9dddebccac039dea Author: Russell Bryant <rbryant@redhat.com> Date: Fri Jan 31 14:36:32 2025 -0500 Check for SPDX headers using pre-commit Signed-off-by: Russell Bryant <rbryant@redhat.com> --------- Signed-off-by: Russell Bryant <rbryant@redhat.com>	2025-02-02 11:58:18 -08:00
Michael Goin	74fa1d123c	[Bugfix] Fix OpenAI parallel sampling when using xgrammar (#11637 ) Signed-off-by: mgoin <michael@neuralmagic.com>	2024-12-31 03:43:54 +00:00
jianzheng	8db957ee3a	[bugfix] fixed parameter “n” when set parameter “bestof” > 1 (#10854 ) Signed-off-by: jianzheng <57654625+o2363286@users.noreply.github.com>	2024-12-04 08:48:22 +00:00
Joe Runde	031a7995f3	[Bugfix][Frontend] Reject guided decoding in multistep mode (#9892 ) Signed-off-by: Joe Runde <Joseph.Runde@ibm.com>	2024-11-01 01:09:46 +00:00
Zhong Qishuai	ef7865b4f9	[Frontend] re-enable multi-modality input in the new beam search implementation (#9427 ) Signed-off-by: Qishuai Ferdinandzhong@gmail.com	2024-10-29 11:49:47 +00:00
Vasiliy Alekseev	07e981fdf4	[Frontend] Bad words sampling parameter (#9717 ) Signed-off-by: Vasily Alexeev <alvasian@yandex.ru>	2024-10-26 16:29:38 +00:00
Nick Hill	1325872ec8	[Frontend] Avoid creating guided decoding LogitsProcessor unnecessarily (#9521 )	2024-10-18 20:21:01 -07:00
youkaichao	cbc2ef5529	[misc] hide best_of from engine (#9261 ) Co-authored-by: Brendan Wong <bjwpokemon@gmail.com>	2024-10-10 21:30:44 -07:00
Travis Johnson	480b7f40cf	[Misc] Improve validation errors around best_of and n (#9167 ) Signed-off-by: Travis Johnson <tsjohnso@us.ibm.com>	2024-10-09 04:54:48 +00:00
youkaichao	18b296fdb2	[core] remove beam search from the core (#9105 )	2024-10-07 05:47:04 +00:00
Brendan Wong	168cab6bbf	[Frontend] API support for beam search (#9087 ) Co-authored-by: youkaichao <youkaichao@126.com>	2024-10-05 23:39:03 -07:00
Joe Runde	062c89e7c9	[Frontend][Core] Move guided decoding params into sampling params (#8252 ) Signed-off-by: Joe Runde <Joseph.Runde@ibm.com> Co-authored-by: Nick Hill <nickhill@us.ibm.com>	2024-10-01 09:34:25 +08:00
youkaichao	1e7d5c01f5	[misc] soft drop beam search (#8763 )	2024-09-24 15:48:39 -07:00
saumya-saran	b28298f2f4	[Bugfix] Validate SamplingParam n is an int (#8548 )	2024-09-20 12:46:02 -07:00
Nick Hill	551ce01078	[Core] Add engine option to return only deltas or final output (#7381 )	2024-09-12 12:02:00 -07:00
Cyrus Leung	baaedfdb2d	[mypy] Enable following imports for entrypoints (#7248 ) Co-authored-by: Woosuk Kwon <woosuk.kwon@berkeley.edu> Co-authored-by: Fei <dfdfcai4@gmail.com>	2024-08-20 23:28:21 -07:00
SangBin Cho	ff7ec82c4d	[Core] Optimize SPMD architecture with delta + serialization optimization (#7109 )	2024-08-18 17:57:20 -07:00
Chang Su	c134a46402	Fix empty output when temp is too low (#2937 ) Co-authored-by: Cyrus Leung <tlleungac@connect.ust.hk>	2024-08-14 05:31:44 +00:00
Atilla Akkuş	7b261092de	[BUGFIX]: top_k is expected to be an integer. (#7227 )	2024-08-07 00:32:16 -07:00
Peng Guanwen	db9e5708a9	[Core] Reduce unnecessary compute when logprobs=None (#6532 )	2024-07-29 16:47:31 +00:00
Woosuk Kwon	bdf5fd1386	[Misc] Remove deprecation warning for beam search (#6659 )	2024-07-23 00:21:58 +00:00
Simon Mo	32c9d7f765	Report usage for beam search (#6404 )	2024-07-14 19:37:35 -07:00
Woosuk Kwon	eeceadaecc	[Misc] Add deprecation warning for beam search (#6402 )	2024-07-13 11:52:22 -07:00
Nick Hill	365791ff81	[BugFix] Fix `min_tokens` behaviour for multiple eos tokens (#5849 )	2024-06-27 11:31:11 -07:00

1 2

85 Commits