Logo
Explore Help
Sign In
xinyun/vllm
1
0
Fork 0
You've already forked vllm
mirror of https://git.datalinker.icu/vllm-project/vllm.git synced 2026-04-16 18:37:04 +08:00
Code Issues Packages Projects Releases Wiki Activity
vllm/vllm/v1
History
Woosuk Kwon 0bf6e97493 sched
Signed-off-by: Woosuk Kwon <woosuk.kwon@berkeley.edu>
2025-03-11 23:35:10 -07:00
..
attention
sched
2025-03-11 23:35:10 -07:00
core
sched
2025-03-11 23:35:10 -07:00
engine
sched
2025-03-11 23:35:10 -07:00
executor
Update deprecated Python 3.8 typing (#13971)
2025-03-02 17:34:51 -08:00
metrics
[V1][Metrics] Fix traceback with preemptions+LoRA (#14220)
2025-03-07 15:36:16 -05:00
sample
[V1] Support bad_words in sampler (#13376)
2025-03-08 14:50:26 -08:00
spec_decode
[V1][Spec Decode] Optimize N-gram matching with Numba (#13365)
2025-02-18 13:19:58 -08:00
stats
Update deprecated Python 3.8 typing (#13971)
2025-03-02 17:34:51 -08:00
structured_output
[V1][Core] Support MistralTokenizer for Structured Output (#14625)
2025-03-12 10:40:09 +08:00
worker
sched
2025-03-11 23:35:10 -07:00
__init__.py
[V1] AsyncLLM Implementation (#9826)
2024-11-11 23:05:38 +00:00
kv_cache_interface.py
[Bugfix][V1] Handle MLA in kv_cache_interface (#14462)
2025-03-07 22:18:25 -08:00
outputs.py
[V1] Eagerly remove finished requests from the batch (#14388)
2025-03-07 10:56:00 -08:00
request.py
[V1][Core] Support for Structured Outputs (#12388)
2025-03-07 07:19:11 -08:00
serial_utils.py
[V1] Use msgpack for core request serialization (#12918)
2025-02-10 11:35:56 +08:00
utils.py
Update deprecated Python 3.8 typing (#13971)
2025-03-02 17:34:51 -08:00
Powered by Gitea Version: 1.23.1 Page: 609ms Template: 9ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API