vllm/model_executor at 6b42a56d462ee17867f752b1d4d1ab1d0067cd5b - vllm - 丝路新云-代码仓

xinyun/vllm

mirror of https://git.datalinker.icu/vllm-project/vllm.git synced 2026-04-08 09:17:03 +08:00

History

Lucas Wilkinson d47807ba08

[Attention] Remove slow setattr in MLA (#14769 )

Signed-off-by: Lucas Wilkinson <lwilkins@redhat.com>

2025-03-13 21:31:14 +00:00

..

guided_decoding

[Bugfix][Structured Output] Support outlines engine with reasoning outputs for DeepSeek R1 (#14114 )

2025-03-06 03:49:20 +00:00

[Attention] Remove slow setattr in MLA (#14769 )

2025-03-13 21:31:14 +00:00

[BugFix][TritonMLA] Process weights after model loading for GGUF (#14555 )

2025-03-12 20:14:36 -07:00

[Bugfix] Fix prompt format of GLM4V (#14539 )

2025-03-13 11:37:17 +00:00

__init__.py

[Misc] Add SPDX-License-Identifier headers to python source files (#12628 )

2025-02-02 11:58:18 -08:00

custom_op.py

[Neuron] Add custom_ops for neuron backend (#13246 )

2025-02-25 11:47:49 -08:00

parameter.py

[Misc] Add SPDX-License-Identifier headers to python source files (#12628 )

2025-02-02 11:58:18 -08:00

pooling_metadata.py

[Misc] Add SPDX-License-Identifier headers to python source files (#12628 )

2025-02-02 11:58:18 -08:00

sampling_metadata.py

[Misc] Add SPDX-License-Identifier headers to python source files (#12628 )

2025-02-02 11:58:18 -08:00

utils.py

[Misc] Add SPDX-License-Identifier headers to python source files (#12628 )

2025-02-02 11:58:18 -08:00