This website requires JavaScript.
Explore
Help
Sign In
xinyun
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
mirror of
https://git.datalinker.icu/vllm-project/vllm.git
synced
2026-04-05 12:37:03 +08:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
vllm
/
tests
/
v1
/
engine
History
Jeffrey Li
0e4254492f
[Bugfix]: fix issue with n>1 sampling on v1 requests overriding each other (
#16863
)
...
Signed-off-by: Jeffrey Li <jeffrey.dot.li@gmail.com>
2025-04-22 11:40:19 +08:00
..
__init__.py
…
conftest.py
Update deprecated Python 3.8 typing (
#13971
)
2025-03-02 17:34:51 -08:00
test_async_llm.py
[V1] Minor V1 async engine test refactor (
#15075
)
2025-03-19 10:37:17 -07:00
test_engine_args.py
[V1] Revert the default
max_num_seqs
to V0 values for most hardware (
#16158
)
2025-04-07 13:54:36 -04:00
test_engine_core_client.py
[V1][BugFix] Exit properly if engine core fails during startup (
#16137
)
2025-04-07 15:30:15 -07:00
test_engine_core.py
[V1] Refactor num_computed_tokens logic (
#15307
)
2025-03-27 04:54:36 +00:00
test_llm_engine.py
[Core] Update dtype detection and defaults (
#14858
)
2025-03-19 13:49:33 +08:00
test_output_processor.py
[Bugfix]: fix issue with n>1 sampling on v1 requests overriding each other (
#16863
)
2025-04-22 11:40:19 +08:00
utils.py
[V1] Detokenizer: Respect Stop Tokens + not include_stop_str_in_output (
#14624
)
2025-03-13 19:07:34 +00:00