vllm/model_executor at a92842454ca824ce6fcf356f31e3bf0daf53629b - vllm - 丝路新云-代码仓

xinyun/vllm

mirror of https://git.datalinker.icu/vllm-project/vllm.git synced 2026-06-25 23:07:25 +08:00

History

rasmith e3d0a1d190

[Quantizaton] [AMD] Add support for running DeepSeek int8 w8a8 MoE on ROCm (#17558 )

Signed-off-by: Randall Smith <Randall.Smith@amd.com>

2025-05-02 21:41:10 -07:00

..

guided_decoding

[Feature][Frontend]: Deprecate --enable-reasoning (#17452 )

2025-05-01 06:46:16 -07:00

[Quantizaton] [AMD] Add support for running DeepSeek int8 w8a8 MoE on ROCm (#17558 )

2025-05-02 21:41:10 -07:00

Add pt_load_map_location to allow loading to cuda (#16869 )

2025-05-01 23:23:42 -07:00

permute/unpermute kernel for moe optimization (#14568 )

2025-05-02 11:31:55 -07:00

__init__.py

[Misc] Add SPDX-License-Identifier headers to python source files (#12628 )

2025-02-02 11:58:18 -08:00

custom_op.py

[Fix] Support passing args to logger (#17425 )

2025-04-30 08:06:58 -07:00

parameter.py

[Kernel] Support Microsoft Runtime Kernel Lib for our Low Precision Computation - BitBLAS (#6036 )

2025-04-22 09:01:36 +01:00

pooling_metadata.py

[Misc] Add SPDX-License-Identifier headers to python source files (#12628 )

2025-02-02 11:58:18 -08:00

sampling_metadata.py

[Bugfix] Fix extra comma (#15851 )

2025-03-31 22:57:28 -07:00

utils.py

[Misc] Add SPDX-License-Identifier headers to python source files (#12628 )

2025-02-02 11:58:18 -08:00