This website requires JavaScript.
Explore
Help
Sign In
xinyun
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
mirror of
https://git.datalinker.icu/vllm-project/vllm.git
synced
2026-01-04 13:13:09 +08:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
vllm
/
csrc
History
Mor Zusman
fdd9daafa3
[Kernel/Model] Migrate mamba_ssm and causal_conv1d kernels to vLLM (
#7651
)
2024-08-28 15:06:52 -07:00
..
attention
…
core
[Bugfix] Allow ScalarType to be compiled with pytorch 2.3 and add checks for registering FakeScalarType and dynamo support. (
#7886
)
2024-08-27 23:13:45 -04:00
cpu
…
cutlass_extensions
…
mamba
[Kernel/Model] Migrate mamba_ssm and causal_conv1d kernels to vLLM (
#7651
)
2024-08-28 15:06:52 -07:00
moe
[Kernel] Expand MoE weight loading + Add Fused Marlin MoE Kernel (
#7766
)
2024-08-27 15:07:09 -07:00
prepare_inputs
…
quantization
[Bugfix] Don't build machete on cuda <12.0 (
#7757
)
2024-08-22 08:28:52 -04:00
activation_kernels.cu
…
cache_kernels.cu
…
cache.h
…
cuda_compat.h
…
cuda_utils_kernels.cu
…
cuda_utils.h
…
custom_all_reduce_test.cu
…
custom_all_reduce.cu
…
custom_all_reduce.cuh
…
dispatch_utils.h
…
layernorm_kernels.cu
…
moe_align_block_size_kernels.cu
…
ops.h
[Kernel/Model] Migrate mamba_ssm and causal_conv1d kernels to vLLM (
#7651
)
2024-08-28 15:06:52 -07:00
pos_encoding_kernels.cu
…
torch_bindings.cpp
[Kernel/Model] Migrate mamba_ssm and causal_conv1d kernels to vLLM (
#7651
)
2024-08-28 15:06:52 -07:00