vllm/v1 at 0bf6e9749366baed9794a34cf32fbedabcc60c35 - vllm

mirror of https://git.datalinker.icu/vllm-project/vllm.git synced 2026-04-16 18:37:04 +08:00

History

Woosuk Kwon 0bf6e97493 sched

Signed-off-by: Woosuk Kwon <woosuk.kwon@berkeley.edu>

2025-03-11 23:35:10 -07:00

attention

sched

2025-03-11 23:35:10 -07:00

core

sched

2025-03-11 23:35:10 -07:00

engine

sched

2025-03-11 23:35:10 -07:00

executor

Update deprecated Python 3.8 typing (#13971 )

2025-03-02 17:34:51 -08:00

metrics

[V1][Metrics] Fix traceback with preemptions+LoRA (#14220 )

2025-03-07 15:36:16 -05:00

sample

[V1] Support bad_words in sampler (#13376 )

2025-03-08 14:50:26 -08:00

spec_decode

[V1][Spec Decode] Optimize N-gram matching with Numba (#13365 )

2025-02-18 13:19:58 -08:00

stats

Update deprecated Python 3.8 typing (#13971 )

2025-03-02 17:34:51 -08:00

structured_output

[V1][Core] Support MistralTokenizer for Structured Output (#14625 )

2025-03-12 10:40:09 +08:00

worker

sched

2025-03-11 23:35:10 -07:00

__init__.py

[V1] AsyncLLM Implementation (#9826 )

2024-11-11 23:05:38 +00:00

kv_cache_interface.py

[Bugfix][V1] Handle MLA in kv_cache_interface (#14462 )

2025-03-07 22:18:25 -08:00

outputs.py

[V1] Eagerly remove finished requests from the batch (#14388 )

2025-03-07 10:56:00 -08:00

request.py

[V1][Core] Support for Structured Outputs (#12388 )

2025-03-07 07:19:11 -08:00

serial_utils.py

[V1] Use msgpack for core request serialization (#12918 )

2025-02-10 11:35:56 +08:00

utils.py

Update deprecated Python 3.8 typing (#13971 )

2025-03-02 17:34:51 -08:00