xinyun/vllm - vllm - 丝路新云-代码仓

mirror of https://git.datalinker.icu/vllm-project/vllm.git synced 2026-06-18 21:37:32 +08:00

Author	SHA1	Message	Date
Reid	d92879baf6	[doc] Add feature status legend (#17257 ) Signed-off-by: reidliu41 <reid201711@gmail.com> Co-authored-by: reidliu41 <reid201711@gmail.com>	2025-04-27 08:17:02 -07:00
Russell Bryant	52b4f4a8d7	[Docs] Update structured output doc for V1 (#17135 ) Signed-off-by: Russell Bryant <rbryant@redhat.com>	2025-04-26 15:12:18 +00:00
Cyrus Leung	909fdaf152	[Bugfix] Fix standard models tests (#17217 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2025-04-26 02:26:41 -07:00
yarongmu-google	7bd0c7745c	[Doc] Minor fix for the vLLM TPU setup page (#17206 ) Signed-off-by: Yarong Mu <ymu@google.com>	2025-04-26 04:39:56 +00:00
Reid	537d5ee025	[doc] add Anything LLM integration (#17216 ) Signed-off-by: reidliu41 <reid201711@gmail.com> Co-authored-by: reidliu41 <reid201711@gmail.com>	2025-04-25 21:03:23 -07:00
Cyrus Leung	9d98ab5ec6	[Misc] Inline Molmo requirements (#17190 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2025-04-25 16:41:44 +00:00
Reid	df5c879527	[doc] update wrong hf model links (#17184 ) Signed-off-by: reidliu41 <reid201711@gmail.com> Co-authored-by: reidliu41 <reid201711@gmail.com>	2025-04-25 16:40:54 +00:00
Michael Yao	f851b84266	[Doc] Add two links to disagg_prefill.md (#17168 ) Signed-off-by: windsonsea <haifeng.yao@daocloud.io>	2025-04-25 10:23:57 +00:00
Michael Yao	ef19e67d2c	[Doc] Add headings to improve gptqmodel.md (#17164 ) Signed-off-by: windsonsea <haifeng.yao@daocloud.io>	2025-04-25 01:13:13 -07:00
Michael Goin	649818995f	[Docs] Fix True->true in supported_models.md (#17141 )	2025-04-25 04:20:04 +00:00
Varun Sundar Rabindranath	7a0a9da72b	[Doc] V1 : Update LoRA status (#17133 ) Signed-off-by: varun sundar rabindranath <vsundarr@redhat.com> Co-authored-by: varun sundar rabindranath <vsundarr@redhat.com>	2025-04-24 20:17:22 -07:00
Maximilien de Bayser	05e1fbfc52	Add chat template for Llama 4 models (#16428 ) Signed-off-by: Max de Bayser <mbayser@br.ibm.com>	2025-04-24 20:19:36 +00:00
Russell Bryant	6d0df0ebeb	[Docs] Generate correct github links for decorated functions (#17125 ) Signed-off-by: Russell Bryant <rbryant@redhat.com>	2025-04-24 10:39:43 -07:00
Harry Mellor	0422ce109f	Add `:markdownhelp:` to `EngineArgs` docs so markdown docstrings render properly (#17124 ) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>	2025-04-24 10:28:45 -07:00
Eyshika Agarwal	47bdee409c	Molmo Requirements (#17026 ) Signed-off-by: Eyshika Agarwal <eyshikaengineer@gmail.com> Signed-off-by: eyshika <eyshikaengineer@gmail.com>	2025-04-24 10:08:37 -07:00
Atilla	49f189439d	existing torch installation pip command fix for docs (#17059 )	2025-04-24 10:07:21 -07:00
wang.yuqi	67309a1cb5	[Frontend] Using matryoshka_dimensions control the allowed output dimensions. (#16970 )	2025-04-24 07:06:28 -07:00
omer-dayan	2bc0f72ae5	Add docs for runai_streamer_sharded (#17093 ) Signed-off-by: Omer Dayan (SW-GPU) <omer@run.ai> Co-authored-by: Cyrus Leung <cyrus.tl.leung@gmail.com>	2025-04-24 01:03:21 -07:00
Reid	9c1244de57	[doc] update to hyperlink (#17096 ) Signed-off-by: reidliu41 <reid201711@gmail.com> Co-authored-by: reidliu41 <reid201711@gmail.com>	2025-04-24 00:58:08 -07:00
Reid	db2f8d915c	[V1] Update structured output (#16812 ) Signed-off-by: reidliu41 <reid201711@gmail.com> Co-authored-by: reidliu41 <reid201711@gmail.com>	2025-04-23 23:57:17 -07:00
Harry Mellor	2c8ed8ee48	More informative error when using Transformers backend (#16988 ) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>	2025-04-23 19:54:03 -07:00
Michael Yao	f7912cba3d	[Doc] Add top anchor and a note to quantization/bitblas.md (#17042 ) Signed-off-by: windsonsea <haifeng.yao@daocloud.io>	2025-04-23 07:32:16 -07:00
Reid	eb8ef4224d	[doc] add download path tips (#17013 ) Signed-off-by: reidliu41 <reid201711@gmail.com> Co-authored-by: reidliu41 <reid201711@gmail.com>	2025-04-23 04:06:30 +00:00
Lei Wang	8d32dc603d	[Kernel] Support Microsoft Runtime Kernel Lib for our Low Precision Computation - BitBLAS (#6036 ) Signed-off-by: xinyuxiao <xinyuxiao2024@gmail.com> Co-authored-by: xinyuxiao <xinyuxiao2024@gmail.com>	2025-04-22 09:01:36 +01:00
Michael Yao	3097ce3a32	[Doc] Update ai_accelerator/hpu-gaudi.inc.md (#16956 ) Signed-off-by: windsonsea <haifeng.yao@daocloud.io>	2025-04-22 05:33:27 +00:00
Cyrus Leung	29f395c97c	[Doc] Remove unnecessary V1 flag (#16924 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2025-04-21 21:04:38 -04:00
David Xia	f728ab8e35	[Doc] mention how to install in CPU editable mode (#16923 ) Signed-off-by: David Xia <david@davidxia.com>	2025-04-21 17:45:51 +00:00
David Xia	63e26fff78	[doc] install required python3-dev apt package (#16888 ) Signed-off-by: David Xia <david@davidxia.com>	2025-04-21 16:15:18 +00:00
Yan Ma	fe3462c774	[XPU][Bugfix] minor fix for XPU (#15591 ) Signed-off-by: yan ma <yan.ma@intel.com>	2025-04-22 00:02:57 +08:00
Alex Brooks	b34f33438a	[Doc] Split dummy_processor_inputs() in Multimodal Docs (#16915 ) Signed-off-by: Alex-Brooks <Alex.Brooks@ibm.com>	2025-04-21 11:10:01 +00:00
Reid	d6195a748b	[doc] update hyperlink (#16877 ) Signed-off-by: reidliu41 <reid201711@gmail.com> Co-authored-by: reidliu41 <reid201711@gmail.com>	2025-04-19 16:40:38 +00:00
Roger Wang	5124f5bf51	[Model] Qwen2.5-Omni Cleanup (#16872 )	2025-04-19 09:37:02 +00:00
Isotr0py	83f3c3bd91	[Model] Refactor Phi-4-multimodal to use merged processor and support V1 (#15477 ) Signed-off-by: Isotr0py <2037008807@qq.com> Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk> Co-authored-by: DarkLight1337 <tlleungac@connect.ust.hk>	2025-04-19 02:26:11 -07:00
Nicolò Lucchesi	2ef0dc53b8	[Frontend] Add sampling params to `v1/audio/transcriptions` endpoint (#16591 ) Signed-off-by: Jannis Schönleber <joennlae@gmail.com> Signed-off-by: NickLucche <nlucches@redhat.com> Co-authored-by: Jannis Schönleber <joennlae@gmail.com>	2025-04-19 07:03:54 +00:00
Yang Fan	2c1bd848a6	[Model][VLM] Add Qwen2.5-Omni model support (thinker only) (#15130 ) Signed-off-by: fyabc <suyang.fy@alibaba-inc.com> Signed-off-by: Roger Wang <ywang@roblox.com> Co-authored-by: Roger Wang <136131678+ywang96@users.noreply.github.com> Co-authored-by: Roger Wang <ywang@roblox.com> Co-authored-by: Xiong Wang <wangxiongts@163.com>	2025-04-18 23:14:36 -07:00
Justin Ho	490b1698a5	[Doc] Updated Llama section in tool calling docs to have llama 3.2 config info (#16857 ) Signed-off-by: jmho <jaylenho734@gmail.com>	2025-04-18 23:28:53 +00:00
Michael Yao	26507f8973	[Docs] Fix a link and grammar issue in production-stack.md (#16809 ) Signed-off-by: windsonsea <haifeng.yao@daocloud.io>	2025-04-18 06:42:58 +00:00
Nathan Weinberg	9c1d5b456d	[Doc] add podman setup instructions for official image (#16796 ) Signed-off-by: Nathan Weinberg <nweinber@redhat.com>	2025-04-18 06:10:49 +00:00
Harry Mellor	e78587a64c	Improve-mm-and-pooler-and-decoding-configs (#16789 ) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>	2025-04-17 22:13:32 -07:00
Cyrus Leung	c16fb5dae8	[Doc] Improve help examples for `--compilation-config` (#16729 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2025-04-17 21:22:34 -07:00
Mark McLoughlin	e4755f7fac	[V1][Metrics] Fix http metrics middleware (#15894 )	2025-04-17 19:52:18 +00:00
Insu Kim	7c02d6a137	[Doc] Changed explanation of generation_tokens_total and prompt_tokens_total counter type metrics to avoid confusion (#16784 ) Signed-off-by: insukim1994 <insu.kim@moreh.io>	2025-04-17 14:10:08 +00:00
wang.yuqi	11c3b98491	[Doc] Document Matryoshka Representation Learning support (#16770 )	2025-04-17 13:37:37 +00:00
Cyrus Leung	dbe7f07001	[Doc] Make sure to update vLLM when installing latest code (#16781 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2025-04-17 06:53:31 -06:00
Reid	c69bf4ee06	fix: hyperlink (#16778 ) Signed-off-by: reidliu41 <reid201711@gmail.com> Co-authored-by: reidliu41 <reid201711@gmail.com>	2025-04-17 11:34:20 +00:00
Michael Yao	207da28186	[Doc] Fix a 404 link in installation/cpu.md (#16773 ) Signed-off-by: windsonsea <haifeng.yao@daocloud.io>	2025-04-17 10:46:21 +00:00
intervitens	5b1aca2ae3	[Bugfix] Fix GLM4 model (#16618 ) Signed-off-by: intervitens <intervitens@tutanota.com>	2025-04-17 03:35:07 -07:00
Reid	d8e557b5e5	[doc] add open-webui example (#16747 ) Signed-off-by: reidliu41 <reid201711@gmail.com> Co-authored-by: reidliu41 <reid201711@gmail.com>	2025-04-17 18:27:32 +08:00
Cyrus Leung	61a44a0b22	[Doc] Add more tips to avoid OOM (#16765 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2025-04-17 09:54:34 +00:00
Harry Mellor	3cd91dc955	Help user create custom model for Transformers backend remote code models (#16719 ) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>	2025-04-17 01:05:59 +00:00

1 2 3 4 5 ...

926 Commits