This website requires JavaScript.
Explore
Help
Sign In
xinyun
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
mirror of
https://git.datalinker.icu/vllm-project/vllm.git
synced
2026-04-03 06:37:02 +08:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
vllm
/
tests
/
v1
History
Robert Shaw
2b10ba7491
[Bugfix][Nixl] Fix Preemption Bug (
#18631
)
...
Signed-off-by:
rshaw@neuralmagic.com
<robertgshaw2@gmail.com>
2025-05-23 23:30:16 +00:00
..
core
[v1] Redo "Support multiple KV cache groups in GPU model runner (
#17945
)" (
#18593
)
2025-05-23 09:39:47 -07:00
e2e
…
engine
…
entrypoints
[Bugfix] Migrate to REGEX Library to prevent catastrophic backtracking (
#18454
)
2025-05-23 16:16:26 -07:00
kv_connector
[Bugfix][Nixl] Fix Preemption Bug (
#18631
)
2025-05-23 23:30:16 +00:00
metrics
…
sample
[Bugfix] Migrate to REGEX Library to prevent catastrophic backtracking (
#18454
)
2025-05-23 16:16:26 -07:00
shutdown
…
spec_decode
[V1][Spec Decoding] Use model_loader.get_model() to load models (
#18273
)
2025-05-23 02:05:44 +00:00
structured_output
…
tpu
…
worker
[v1] Redo "Support multiple KV cache groups in GPU model runner (
#17945
)" (
#18593
)
2025-05-23 09:39:47 -07:00
__init__.py
…
test_async_llm_dp.py
…
test_oracle.py
…
test_serial_utils.py
…
test_utils.py
…