vllm/model_executor at 2fd1a40a54cf9a5af6f0a8ce4700faf4a1a5108b - vllm - 丝路新云-代码仓

xinyun/vllm

mirror of https://git.datalinker.icu/vllm-project/vllm.git synced 2026-06-27 03:37:11 +08:00

History

Wentao Ye 930a24144c

[Bug] R1 Accuracy: Fix routed_scaling_factor Double Mul Issue (#24119 )

Signed-off-by: yewentao256 <zhyanwentao@126.com>

2025-09-02 22:22:30 +00:00

..

[Bugfix] Fix packed_factor missing attribute error (#23902 )

2025-09-02 10:56:31 -07:00

[Chore][V0 Deprecation] Move LogProb to a separate file (#24055 )

2025-09-01 12:07:53 -07:00

[Bug] R1 Accuracy: Fix routed_scaling_factor Double Mul Issue (#24119 )

2025-09-02 22:22:30 +00:00

[Kernel] Add nvfp4 gemm flashinfer backends (#22346 )

2025-08-14 16:03:55 -04:00

__init__.py

[Misc] Add SPDX-FileCopyrightText (#19100 )

2025-06-03 11:20:17 -07:00

custom_op.py

Optimize configuration access with LRU cache in custom ops (#22204 )

2025-08-04 21:43:24 -07:00

parameter.py

[Transform] [Quantization] Add transforms to compressed tensors (#22486 )

2025-08-28 02:43:48 -04:00

sampling_metadata.py

[Doc]: fix typos in Python comments (#24042 )

2025-09-01 19:07:45 -07:00

utils.py

[Quantization] Enable BNB support for InternS1 (#21953 )

2025-08-01 11:09:54 +00:00