This website requires JavaScript.
Explore
Help
Sign In
xinyun
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
mirror of
https://git.datalinker.icu/vllm-project/vllm.git
synced
2026-01-01 13:15:18 +08:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
vllm
/
tests
/
models
History
Cyrus Leung
6aa8f9a4e7
[Core] Rework dtype resolution (
#18751
)
...
Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>
2025-06-01 11:04:23 +08:00
..
fixtures
…
language
[Core] Rework dtype resolution (
#18751
)
2025-06-01 11:04:23 +08:00
multimodal
[Core] Rework dtype resolution (
#18751
)
2025-06-01 11:04:23 +08:00
quantization
[V1][Quantization] Add CUDA graph compatible v1 GGUF support (
#18646
)
2025-05-27 04:40:28 +00:00
__init__.py
…
registry.py
[Feature] minicpm eagle support (
#18943
)
2025-05-30 06:45:56 -07:00
test_initialization.py
[CI] Enable test_initialization to run on V1 (
#16736
)
2025-05-23 15:09:44 -07:00
test_oot_registration.py
…
test_registry.py
…
test_transformers.py
Enable hybrid attention models for Transformers backend (
#18494
)
2025-05-23 10:12:08 +08:00
test_utils.py
…
test_vision.py
…
utils.py
[Bugfix] Fix the failing gte embedding test (
#18720
)
2025-05-29 07:39:25 -07:00