vllm/model_executor at 61f412187d972a006aef1653bfe348aeaefb6a0b - vllm - 丝路新云-代码仓

xinyun/vllm

mirror of https://git.datalinker.icu/vllm-project/vllm.git synced 2025-12-25 05:35:02 +08:00

History

Cyrus Leung 61f412187d

[Bugfix] Re-enable Gemma3 for V1 (#14980 )

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

2025-03-18 23:58:22 -07:00

..

guided_decoding

[Fix][Structured Output] using vocab_size to construct matcher (#14868 )

2025-03-17 11:42:45 -04:00

[Bugfix] Fix broken CPU quantization due to triton import (#15038 )

2025-03-18 08:57:39 -07:00

[Bugfix] Fix bnb quantization for models with both HF-format and Mistral-format weights (#14950 )

2025-03-17 23:27:26 +00:00

[Bugfix] Re-enable Gemma3 for V1 (#14980 )

2025-03-18 23:58:22 -07:00

__init__.py

[Misc] Add SPDX-License-Identifier headers to python source files (#12628 )

2025-02-02 11:58:18 -08:00

custom_op.py

[Neuron] Add custom_ops for neuron backend (#13246 )

2025-02-25 11:47:49 -08:00

parameter.py

[Misc] Add SPDX-License-Identifier headers to python source files (#12628 )

2025-02-02 11:58:18 -08:00

pooling_metadata.py

[Misc] Add SPDX-License-Identifier headers to python source files (#12628 )

2025-02-02 11:58:18 -08:00

sampling_metadata.py

[Misc] Add SPDX-License-Identifier headers to python source files (#12628 )

2025-02-02 11:58:18 -08:00

utils.py

[Misc] Add SPDX-License-Identifier headers to python source files (#12628 )

2025-02-02 11:58:18 -08:00