This website requires JavaScript.
Explore
Help
Sign In
xinyun
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
mirror of
https://git.datalinker.icu/vllm-project/vllm.git
synced
2025-12-16 10:26:07 +08:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
vllm
/
tests
/
models
History
Qiu
46cbbca05c
[CI][DCP][Perf] reduce DCP CI execution time (
#29858
)
...
Signed-off-by: QiuChunshuo <qiuchunshuo@huawei.com>
2025-12-04 17:28:21 +00:00
..
fixtures
…
language
[Model][6/N] Improve all pooling task | Support chunked prefill with ALL pooling (
#27145
)
2025-12-04 13:44:15 +00:00
multimodal
[Chore] Deprecate
merge_by_field_config
arg (
#30035
)
2025-12-04 17:21:24 +00:00
quantization
[Bugfix][Quantization] Support BF16 tensors on GGUF (
#29948
)
2025-12-03 10:33:46 +00:00
__init__.py
…
registry.py
[CI][DCP][Perf] reduce DCP CI execution time (
#29858
)
2025-12-04 17:28:21 +00:00
test_gguf_download.py
[Chore]: Reorganize gguf utils funtions under
transformers_utils
(
#29891
)
2025-12-02 17:33:23 +00:00
test_initialization.py
[Attention] Refactor CUDA attention backend selection logic (
#24794
)
2025-11-11 07:40:44 -05:00
test_oot_registration.py
…
test_registry.py
…
test_terratorch.py
…
test_transformers.py
[Hardware][AMD] Remove ROCm skip conditions for transformers backend tests (
#29782
)
2025-12-02 02:03:13 +08:00
test_utils.py
…
test_vision.py
…
utils.py
[Chore] Move tokenizer initialization methods (
#29793
)
2025-12-02 13:33:37 +08:00