xinyun/vllm - vllm - 丝路新云-代码仓

mirror of https://git.datalinker.icu/vllm-project/vllm.git synced 2026-06-23 17:27:31 +08:00

Author	SHA1	Message	Date
Harry Mellor	483ea64611	[Docs] Replace all explicit anchors with real links (#27087 ) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>	2025-10-17 02:22:06 -07:00
Chauncey	acb1bfa601	[CI] fix docs build failed (#27082 ) Signed-off-by: chaunceyjiang <chaunceyjiang@gmail.com>	2025-10-17 07:53:40 +00:00
Said Taghadouini	3aeb19a39e	[Model] Add support for LightOnOCR (#26916 ) Signed-off-by: Said Taghadouini <taghadouinisaid@gmail.com> Signed-off-by: Said Taghadouini <84044788+staghado@users.noreply.github.com> Co-authored-by: Cyrus Leung <cyrus.tl.leung@gmail.com>	2025-10-17 05:05:24 +00:00
Cyrus Leung	8c017b3490	[Model] Always use Transformers backend for PaliGemma and Gemma3-MM (#26715 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2025-10-17 05:03:35 +00:00
Harry Mellor	4ffd6e8942	[Docs] Reduce custom syntax used in docs (#27009 ) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>	2025-10-16 20:05:34 -07:00
Cyrus Leung	6256697997	[Doc] ruff format remaining Python examples (#26795 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2025-10-15 01:25:49 -07:00
Cyrus Leung	9c4cb68339	[Chore] Remove `SupportsV0Only` interface and update supported models docs (#26783 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2025-10-14 04:55:10 -07:00
wang.yuqi	767c3ab869	[Model][0/N] Improve all pooling task \| clean up (#25817 ) Signed-off-by: wang.yuqi <noooop@126.com>	2025-10-13 16:44:50 +08:00
Xiong Wang	19a9b169bf	Add Qwen3-Omni moe thinker (#25550 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk> Signed-off-by: Roger Wang <hey@rogerw.io> Signed-off-by: Isotr0py <mozf@mail2.sysu.edu.cn> Co-authored-by: Xiong Wang <feizi.wx@alibaba-inc.com> Co-authored-by: DarkLight1337 <tlleungac@connect.ust.hk> Co-authored-by: Roger Wang <hey@rogerw.io> Co-authored-by: Isotr0py <mozf@mail2.sysu.edu.cn>	2025-10-10 17:00:56 +00:00
Shane A	8d2b8c0ff2	[Model] Add FlexOlmo model implementation (#24923 ) Signed-off-by: Shane A <shanea@allenai.org>	2025-10-10 09:43:15 -07:00
Paul Pak	320feae6f5	[Model] Lfm2Moe (#26344 ) Signed-off-by: Paul Pak <paulpak58@gmail.com>	2025-10-07 16:03:05 +00:00
antrec	6f59beaf0b	[Model] Add support for ModernBertForTokenClassification (#26340 ) Signed-off-by: Antoine Recanati Le Goat <antoine.recanati@sancare.fr> Signed-off-by: antrec <antoine.recanati@gmail.com> Co-authored-by: Antoine Recanati Le Goat <antoine.recanati@sancare.fr> Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>	2025-10-07 14:29:19 +00:00
Cyrus Leung	4570535ec4	[Model] CLIP Embedding Support (#26010 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2025-10-04 06:21:42 -07:00
Harry Mellor	d3d649efec	Support expert parallel in Transformers backend (#26162 ) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com> Co-authored-by: Isotr0py <mozf@mail2.sysu.edu.cn>	2025-10-04 04:35:04 +00:00
Cyrus Leung	f9a8084e48	[Model] Use `merge_by_field_config` for MM models (InternVL family) (#26153 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2025-10-03 01:59:06 -07:00
Harry Mellor	10d765482d	`FusedMoE` support for the Transformers backend (#22650 ) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>	2025-10-02 23:12:15 -07:00
pwschuurman	be22bb6f3d	Run:ai model streamer add GCS package support (#24909 ) Signed-off-by: Peter Schuurman <psch@google.com>	2025-10-01 20:59:13 -07:00
Cyrus Leung	2f652e6cdf	[Doc] Improve MM Pooling model documentation (#25966 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2025-09-30 18:58:29 +00:00
Andrew Sansom	78a47f87ce	Test Prompt Embeds/LoRA compatibility and Enable LoRA Support for OPT Models (#25717 ) Signed-off-by: Andrew Sansom <andrew@protopia.ai>	2025-09-30 08:10:58 +08:00
Jee Jee Li	e61eb5e09d	[Model] Remove MotifForCausalLM (#25866 ) Signed-off-by: Jee Jee Li <pandaleefree@gmail.com>	2025-09-30 00:36:30 +08:00
Yuxuan Zhang	b1ded114b9	Update GLM-4.5 Doc transformers version (#25830 ) Signed-off-by: zRzRzRzRzRzRzR <2448370773@qq.com>	2025-09-28 12:05:51 +00:00
XuruiYang	845adb3ec6	[Model] Add LongCat-Flash (#23991 ) Signed-off-by: yangxurui <yangxurui@meituan.com> Co-authored-by: yangxurui <yangxurui@meituan.com>	2025-09-24 21:53:40 -07:00
Harry Mellor	8c853050e7	[Docs] Enable `fail_on_warning` for the docs build in CI (#25580 ) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>	2025-09-24 19:30:33 +00:00
Roger Wang	7b57a433da	[Model] Support Dots OCR (#24645 ) Signed-off-by: Roger Wang <hey@rogerw.io> Co-authored-by: yinz-aizip <yinz@aizip.ai>	2025-09-22 02:24:40 +00:00
Harry Mellor	12aed7e453	Encoder model support for the Transformers backend (#25174 ) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>	2025-09-19 19:15:22 +01:00
Harry Mellor	058525b997	Move `PoolerConfig` from `config/__init__.py` to `config/pooler.py` (#25181 ) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>	2025-09-19 11:02:55 +00:00
wang.yuqi	5f696c33b1	[New Model] Support BertForTokenClassification / Named Entity Recognition (NER) task (#24872 ) Signed-off-by: wang.yuqi <noooop@126.com> Signed-off-by: Isotr0py <mozf@mail2.sysu.edu.cn> Co-authored-by: Isotr0py <mozf@mail2.sysu.edu.cn>	2025-09-18 23:22:01 +08:00
Roger Wang	0f7acdd73c	[Model] Support Qwen3-VL Model Series (#24727 ) Signed-off-by: Roger Wang <hey@rogerw.io> Signed-off-by: Isotr0py <mozf@mail2.sysu.edu.cn> Co-authored-by: Huang Jie <92386084+JJJYmmm@users.noreply.github.com> Co-authored-by: 松灵 <26085463+wulipc@users.noreply.github.com> Co-authored-by: Isotr0py <mozf@mail2.sysu.edu.cn>	2025-09-17 05:01:04 +00:00
Woosuk Kwon	759ef49b15	Remove V0 Encoder-Decoder Support (#24907 ) Signed-off-by: Woosuk Kwon <woosuk@thinkingmachines.ai>	2025-09-15 21:17:14 -07:00
ant-yy	72c99f2a75	[Model]: support Ling2.0 (#24627 ) Signed-off-by: vito.yy <vito.yy@antgroup.com> Co-authored-by: Isotr0py <mozf@mail2.sysu.edu.cn>	2025-09-15 05:09:30 -07:00
wang.yuqi	bf214ca226	[Misc] Fix examples openai_pooling_client.py (#24853 ) Signed-off-by: wang.yuqi <noooop@126.com> Co-authored-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>	2025-09-15 11:57:30 +00:00
Shane A	89e08d6d18	[Model] Add Olmo3 model implementation (#24534 ) Signed-off-by: Shane A <shanea@allenai.org> Co-authored-by: Isotr0py <mozf@mail2.sysu.edu.cn>	2025-09-13 03:26:21 +00:00
Tao He	f946197473	[Docs] Fixes a typo in the qwen3next model name. (#24654 ) Signed-off-by: Tao He <linzhu.ht@alibaba-inc.com>	2025-09-11 19:35:14 +08:00
Russell Bryant	ba6011027d	[Docs] Update V1 doc to reflect whisper support (#24606 ) Signed-off-by: Russell Bryant <rbryant@redhat.com>	2025-09-11 01:50:08 -07:00
Tao He	e93f4cc9e3	Add the support for the qwen3 next model (a hybrid attention model). (#24526 ) Signed-off-by: Tao He <linzhu.ht@alibaba-inc.com> Co-authored-by: Jee Jee Li <pandaleefree@gmail.com>	2025-09-11 15:32:09 +08:00
TaehyunKim	9bd831f501	[Model] New model support for Motif-1-Tiny (#23414 ) Signed-off-by: ca1207 <ca1207zzz@gmail.com> Signed-off-by: TaehyunKim <73943231+ca1207@users.noreply.github.com> Co-authored-by: WyldeCat <skan1543@gmail.com> Co-authored-by: Jee Jee Li <pandaleefree@gmail.com>	2025-09-10 23:29:40 -07:00
Yash Pratap Singh	9e3c3a7df2	[LoRA]: Add LoRA support to Mistral's Voxtral models (#24517 ) Signed-off-by: Yash Pratap Singh <yashsingh20001@gmail.com> Co-authored-by: Jee Jee Li <pandaleefree@gmail.com>	2025-09-10 06:12:03 -07:00
Nicolò Lucchesi	3707cb2505	[Docs] Gemma3n `transcriptions` endpoint support (#24512 ) Signed-off-by: NickLucche <nlucches@redhat.com>	2025-09-09 11:03:32 -07:00
Cyrus Leung	948dd3443b	[Bugfix] Fix Apertus HF repo name (#24447 ) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>	2025-09-08 21:40:29 -07:00
wang.yuqi	6d6c6b05d3	[New Model]: google/embeddinggemma-300m (#24318 ) Signed-off-by: wang.yuqi <noooop@126.com>	2025-09-05 22:58:36 -07:00
Yash Pratap Singh	c9f7081f9c	[LoRA]: Add lora support to qwen-2.5-omni (#24231 )	2025-09-04 05:50:50 -07:00
Jiangyun Zhu	eafa8dcde6	[Model] Add pp support for hunyuan (#24212 ) Signed-off-by: zjy0516 <riverclouds.zhu@qq.com>	2025-09-04 03:58:26 -07:00
bingchen-mi	e7fc70016f	[Model] Add MiDashengLM model support (#23652 ) Signed-off-by: chenbing8 <chenbing8@xiaomi.com> Signed-off-by: bingchen-mi <chenbing8@xiaomi.com> Co-authored-by: Jee Jee Li <pandaleefree@gmail.com> Co-authored-by: Isotr0py <mozf@mail2.sysu.edu.cn>	2025-09-04 00:08:09 -07:00
nopperl	fa4311d85f	[V1] v1 engine + full CUDA graph support for PLaMo2 (#23998 ) Signed-off-by: Hemmi Shinichi <shemmi@preferred.jp> Signed-off-by: nopperl <54780682+nopperl@users.noreply.github.com> Co-authored-by: Hemmi Shinichi <shemmi@preferred.jp> Co-authored-by: Thomas Parnell <tom.parnell@gmail.com>	2025-09-03 08:24:02 -07:00
Kwai-Keye	7c8271cd1e	[Model]: support KeyeVL-1_5-8B (#23838 ) Signed-off-by: wangruitao <wangruitao@kuaishou.com> Co-authored-by: wangruitao <wangruitao@kuaishou.com>	2025-09-01 03:50:27 -07:00
sadegh.shokatian	379ea2823a	Add LoRA support for DeepSeek models (V2, V3, R1-0528) (#23971 ) Signed-off-by: sadeghja1070 <sadegh.ja1070@gmail.com> Signed-off-by: Jee Jee Li <pandaleefree@gmail.com> Co-authored-by: Claude <noreply@anthropic.com> Co-authored-by: Jee Jee Li <pandaleefree@gmail.com> Co-authored-by: Cyrus Leung <tlleungac@connect.ust.hk>	2025-08-30 06:40:02 -07:00
Didier Durand	d99c3a4f7b	[Doc]: fix typos in .md files (including those of #23751 ) (#23825 ) Signed-off-by: Didier Durand <durand.didier@gmail.com>	2025-08-28 04:38:19 -07:00
Isotr0py	c5d004aaaf	[Model] Add PP support and VLM backbone compatability for GPT-OSS (#23680 ) Signed-off-by: Isotr0py <mozf@mail2.sysu.edu.cn>	2025-08-28 16:03:28 +08:00
wang.yuqi	11a7fafaa8	[New Model]: Support GteNewModelForSequenceClassification (#23524 ) Signed-off-by: wang.yuqi <noooop@126.com>	2025-08-28 15:36:42 +08:00
Isotr0py	841490434a	[Model] Enable native HF format InternVL support (#23742 ) Signed-off-by: Isotr0py <mozf@mail2.sysu.edu.cn>	2025-08-27 14:45:17 +00:00

1 2 3 4

162 Commits