This website requires JavaScript.
Explore
Help
Sign In
xinyun
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
mirror of
https://git.datalinker.icu/vllm-project/vllm.git
synced
2025-12-28 16:58:43 +08:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
vllm
/
csrc
/
cpu
History
Li, Jiang
e2f56c309d
[CPU] Update torch 2.9.1 for CPU backend (
#29664
)
...
Signed-off-by: jiang1.li <jiang1.li@intel.com>
2025-11-28 13:37:54 +00:00
..
micro_gemm
…
sgl-kernels
…
activation.cpp
…
cpu_attn_amx.hpp
…
cpu_attn_impl.hpp
[CPU][IBM Z] Fix BF16 support and vectorize math operations for s390x (
#28926
)
2025-11-24 12:08:09 +00:00
cpu_attn_macros.h
…
cpu_attn_neon.hpp
[perf][cpu] Accelerate paged attention GEMMs (QK, PV) on Arm CPUs with NEON (
#29193
)
2025-11-22 09:04:36 -08:00
cpu_attn_vec16.hpp
…
cpu_attn_vec.hpp
…
cpu_attn.cpp
[perf][cpu] Accelerate paged attention GEMMs (QK, PV) on Arm CPUs with NEON (
#29193
)
2025-11-22 09:04:36 -08:00
cpu_types_arm.hpp
…
cpu_types_scalar.hpp
refactor(cpu_types_scalar.hpp): Unify scalar loop implementations using unroll_loop (
#28847
)
2025-11-19 11:05:44 +00:00
cpu_types_vsx.hpp
…
cpu_types_vxe.hpp
[CPU][IBM Z] Fix BF16 support and vectorize math operations for s390x (
#28926
)
2025-11-24 12:08:09 +00:00
cpu_types_x86.hpp
…
cpu_types.hpp
…
cpu_wna16.cpp
…
dnnl_helper.cpp
…
dnnl_helper.h
…
dnnl_kernels.cpp
…
float_convert.hpp
…
layernorm.cpp
…
mla_decode.cpp
…
pos_encoding.cpp
…
scratchpad_manager.cpp
…
scratchpad_manager.h
…
shm.cpp
…
torch_bindings.cpp
cleanup at::Tag::needs_fixed_stride_order (
#28974
)
2025-11-20 02:51:36 -08:00
utils.cpp
[CPU] Update torch 2.9.1 for CPU backend (
#29664
)
2025-11-28 13:37:54 +00:00
utils.hpp
[CI/Build] Fix broken build on Apple M1 (
#28999
)
2025-11-19 11:07:22 +00:00