vllm/core at woosuk/router-nixl - vllm - 丝路新云-代码仓

xinyun/vllm

mirror of https://git.datalinker.icu/vllm-project/vllm.git synced 2026-07-28 01:01:19 +08:00

History

Bram Wasti e317414ce1

Cache the environment variable check for batch invariance (#26510 )

Signed-off-by: Bram Wasti <bwasti@meta.com>

2025-10-10 22:47:34 +00:00

..

batch_invariant.hpp

Cache the environment variable check for batch invariance (#26510 )

2025-10-10 22:47:34 +00:00

exception.hpp

[Bugfix] Fix Marlin MoE act order when is_k_full == False (#8741 )

2024-09-28 18:19:40 -07:00

math.hpp

[torch.compile] Add torch inductor pass for fusing silu_and_mul with subsequent scaled_fp8_quant operations (#10867 )

2025-05-01 07:59:28 -07:00

registration.h

[CI/Build] Per file CUDA Archs (improve wheel size and dev build times) (#8845 )

2024-10-03 22:55:25 -04:00

scalar_type.hpp

[Kernel] [Quantization] Add MXFP4 and bias support for marlin kernel (#22428 )

2025-08-14 11:23:22 -07:00