This website requires JavaScript.
Explore
Help
Sign In
xinyun
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
mirror of
https://git.datalinker.icu/vllm-project/vllm.git
synced
2025-12-16 10:35:52 +08:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
vllm
/
tests
/
core
History
Cody Yu
d11bf435a0
[MISC] Consolidate cleanup() and refactor offline_inference_with_prefix.py (
#9510
)
2024-10-18 14:30:55 -07:00
..
block
[MISC] Consolidate cleanup() and refactor offline_inference_with_prefix.py (
#9510
)
2024-10-18 14:30:55 -07:00
__init__.py
[Tests] Add block manager and scheduler tests (
#3108
)
2024-03-05 18:23:34 -08:00
test_chunked_prefill_scheduler.py
[Model] Add user-configurable task for models that support both generation and embedding (
#9424
)
2024-10-18 11:31:58 -07:00
test_num_computed_tokens_update.py
[Core] Deprecating block manager v1 and make block manager v2 default (
#8704
)
2024-10-17 11:38:15 -05:00
test_scheduler_encoder_decoder.py
[Model] Add user-configurable task for models that support both generation and embedding (
#9424
)
2024-10-18 11:31:58 -07:00
test_scheduler.py
[Model] Add user-configurable task for models that support both generation and embedding (
#9424
)
2024-10-18 11:31:58 -07:00
test_serialization.py
[Core] Optimize SPMD architecture with delta + serialization optimization (
#7109
)
2024-08-18 17:57:20 -07:00
utils.py
[core] remove beam search from the core (
#9105
)
2024-10-07 05:47:04 +00:00