xinyun/vllm - vllm - 丝路新云-代码仓

mirror of https://git.datalinker.icu/vllm-project/vllm.git synced 2025-12-20 07:25:01 +08:00

Author	SHA1	Message	Date
Isotr0py	09500f7dde	[Model] Add BNB quantization support for Mllama (#9720 )	2024-10-29 08:20:02 -04:00
Michael Goin	b7df53cd42	[Bugfix] Use "vision_model" prefix for MllamaVisionModel (#9628 ) Signed-off-by: mgoin <michael@neuralmagic.com>	2024-10-24 10:07:44 +08:00
Michael Goin	bb01f2915e	[Bugfix][Model] Fix Mllama SDPA illegal memory access for batched multi-image (#9626 ) Signed-off-by: mgoin <michael@neuralmagic.com>	2024-10-24 10:03:44 +08:00
Cyrus Leung	c18e1a3418	[VLM] Enable overriding whether post layernorm is used in vision encoder + fix quant args (#9217 ) Co-authored-by: Isotr0py <2037008807@qq.com>	2024-10-23 11:27:37 +00:00
Cyrus Leung	cee711fdbb	[Core] Rename input data types (#8688 )	2024-10-16 10:49:37 +00:00
Xiang Xu	f0fe4fe86d	[Model] Make llama3.2 support multiple and interleaved images (#9095 )	2024-10-14 15:24:26 -07:00
Michael Goin	7193774b1f	[Misc] Support quantization of MllamaForCausalLM (#8822 )	2024-09-25 14:46:22 -07:00
Chen Zhang	770ec6024f	[Model] Add support for the multi-modal Llama 3.2 model (#8811 ) Co-authored-by: simon-mo <xmo@berkeley.edu> Co-authored-by: Chang Su <chang.s.su@oracle.com> Co-authored-by: Simon Mo <simon.mo@hey.com> Co-authored-by: Roger Wang <136131678+ywang96@users.noreply.github.com> Co-authored-by: Roger Wang <ywang@roblox.com>	2024-09-25 13:29:32 -07:00