vllm/model_executor at 83b824c8b4ee55824b30f0509fd312b0cddb35e5 - vllm - 丝路新云-代码仓

xinyun/vllm

mirror of https://git.datalinker.icu/vllm-project/vllm.git synced 2025-12-15 12:16:13 +08:00

History

Cyrus Leung 83b824c8b4

[VLM] Remove BaseProcessingInfo.get_mm_max_tokens_per_item (#16408 )

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

2025-04-10 09:06:58 -07:00

..

guided_decoding

[Core] Upgrade to xgrammar 0.1.18, add cache size limit (#16283 )

2025-04-08 19:13:22 -07:00

[Kernel] Use moe_wna16 kernel for compressed tensors wna16 moe models (#16038 )

2025-04-10 15:08:47 +08:00

[TPU] Fix dummy loading OOM (#16372 )

2025-04-10 04:06:16 +00:00

[VLM] Remove BaseProcessingInfo.get_mm_max_tokens_per_item (#16408 )

2025-04-10 09:06:58 -07:00

__init__.py

[Misc] Add SPDX-License-Identifier headers to python source files (#12628 )

2025-02-02 11:58:18 -08:00

custom_op.py

[Neuron] Add custom_ops for neuron backend (#13246 )

2025-02-25 11:47:49 -08:00

parameter.py

[Misc] Add SPDX-License-Identifier headers to python source files (#12628 )

2025-02-02 11:58:18 -08:00

pooling_metadata.py

[Misc] Add SPDX-License-Identifier headers to python source files (#12628 )

2025-02-02 11:58:18 -08:00

sampling_metadata.py

[Bugfix] Fix extra comma (#15851 )

2025-03-31 22:57:28 -07:00

utils.py

[Misc] Add SPDX-License-Identifier headers to python source files (#12628 )

2025-02-02 11:58:18 -08:00