70 Commits

Author SHA1 Message Date
Simon Mo
dc0b6066ab
[CI] Change PR remainder to avoid at-mentions (#8134) 2024-09-03 14:11:42 -07:00
Kevin H. Luu
f1575dc99f
[ci] Fix GHA workflow (#8129)
Signed-off-by: kevin <kevin@anyscale.com>
2024-09-03 13:25:09 -07:00
Kevin H. Luu
95a178f861
[CI] Only PR reviewers/committers can trigger CI on PR (#8124)
Signed-off-by: kevin <kevin@anyscale.com>
2024-09-03 11:32:27 -07:00
Jonathan Berkhahn
9c71c97ae2
[mypy] Enable mypy type checking for vllm/core (#7229) 2024-08-28 07:11:14 +08:00
youkaichao
35ee2ad6b9
[github][misc] promote asking llm first (#7809) 2024-08-23 09:38:50 -07:00
Cyrus Leung
baaedfdb2d
[mypy] Enable following imports for entrypoints (#7248)
Co-authored-by: Woosuk Kwon <woosuk.kwon@berkeley.edu>
Co-authored-by: Fei <dfdfcai4@gmail.com>
2024-08-20 23:28:21 -07:00
Kevin H. Luu
5469146bcc
[ci] Remove fast check cancel workflow (#7455) 2024-08-12 21:19:51 -07:00
Kevin H. Luu
1137f343aa
[ci] Cancel fastcheck when PR is ready (#7433)
Signed-off-by: kevin <kevin@anyscale.com>
2024-08-12 10:59:14 -07:00
Kevin H. Luu
9b3e2edd30
[ci] Cancel fastcheck run when PR is marked ready (#7427)
Signed-off-by: kevin <kevin@anyscale.com>
2024-08-12 10:56:52 -07:00
Cyrus Leung
7eb4a51c5f
[Core] Support serving encoder/decoder models (#7258) 2024-08-09 10:39:41 +08:00
Michael Goin
e14fb22e59
[Doc] Put collect_env issue output in a <detail> block (#7310) 2024-08-08 11:22:49 -07:00
Isotr0py
360bd67cf0
[Core] Support loading GGUF model (#5191)
Co-authored-by: Michael Goin <michael@neuralmagic.com>
2024-08-05 17:54:23 -06:00
Jee Jee Li
f80ab3521c
Clean up remaining Punica C information (#7027) 2024-08-04 15:37:08 -07:00
Michael Goin
b482b9a5b1
[CI/Build] Add support for Python 3.12 (#7035) 2024-08-02 13:51:22 -07:00
Cyrus Leung
3bb4b1e4cd
[mypy] Speed up mypy checking (#7056) 2024-08-01 19:49:43 -07:00
Sage Moore
7e0861bd0b
[CI/Build] Update PyTorch to 2.4.0 (#6951)
Co-authored-by: Michael Goin <michael@neuralmagic.com>
2024-08-01 11:11:24 -07:00
Jee Jee Li
7ecee34321
[Kernel][RFC] Refactor the punica kernel based on Triton (#5036) 2024-07-31 17:12:24 -07:00
Cyrus Leung
da1f7cc12a
[mypy] Enable following imports for some directories (#6681) 2024-07-31 10:38:03 +08:00
Kevin H. Luu
4fbf4aa128
[ci] GHA workflow to remove ready label upon "/notready" comment (#6921)
Signed-off-by: kevin <kevin@anyscale.com>
2024-07-29 17:03:45 -07:00
Kevin H. Luu
2d4733ba2d
Fix PR comment bot (#6554)
Signed-off-by: kevin <kevin@anyscale.com>
2024-07-18 14:48:29 -07:00
Kevin H. Luu
ecdb462c24
[ci] Reword Github bot comment (#6534) 2024-07-18 08:01:45 -07:00
Simon Mo
9bfece89fd
Add FUNDING.yml (#6435) 2024-07-14 20:36:16 -07:00
Kevin H. Luu
16ff6bd58c
[ci] Fix wording for GH bot (#6398)
Signed-off-by: kevin <kevin@anyscale.com>
2024-07-12 16:34:37 -07:00
Cody Yu
aa48e502fb
[MISC] Upgrade dependency to PyTorch 2.3.1 (#5327) 2024-07-12 12:04:26 -07:00
Kevin H. Luu
4dbebd03cc
[ci] Add GHA workflows to enable full CI run (#6381)
Signed-off-by: kevin <kevin@anyscale.com>
2024-07-12 11:36:26 -07:00
Benjamin Muskalla
b422d4961a
[CI/Build] Enable mypy typing for remaining folders (#6268) 2024-07-10 22:15:55 +08:00
Cyrus Leung
0e9164b40a
[mypy] Enable type checking for test directory (#5017) 2024-06-15 04:45:31 +00:00
SangBin Cho
847cdcca1c
[CI] Upgrade codespell version. (#5381) 2024-06-12 10:06:14 -07:00
Cyrus Leung
7a64d24aad
[Core] Support image processor (#4197) 2024-06-02 22:56:41 -07:00
youkaichao
325c119961
[Misc] add logging level env var (#5045) 2024-05-24 23:49:49 -07:00
Michael Goin
5f6d10c14c
[CI/Build] Enforce style for C++ and CUDA code with clang-format (#4722) 2024-05-22 07:18:41 +00:00
Simon Mo
e186d37cb1
[CI] use ccache actions properly in release workflow (#4629) 2024-05-06 22:23:36 +00:00
Simon Mo
c7f2cf2b7f
[CI] Reduce wheel size by not shipping debug symbols (#4602) 2024-05-04 21:28:58 -07:00
Simon Mo
8d8357c8ed
bump version to v0.4.2 (#4600) 2024-05-04 17:09:49 -07:00
SangBin Cho
fb087af52e
[mypy][7/N] Cover all directories (#4555) 2024-05-02 10:47:41 -07:00
SangBin Cho
cf8cac8c70
[mypy][6/N] Fix all the core subdirectory typing (#4450)
Co-authored-by: Cade Daniel <edacih@gmail.com>
2024-05-02 03:01:00 +00:00
Michael Goin
d627a3d837
[Misc] Upgrade to torch==2.3.0 (#4454) 2024-04-29 20:05:47 -04:00
SangBin Cho
df29793dc7
[mypy][5/N] Support all typing on model executor (#4427) 2024-04-28 19:01:26 -07:00
youkaichao
aba47be3fe
[Misc] add RFC issue template (#4401)
Co-authored-by: Simon Mo <simon.mo@hey.com>
2024-04-26 15:47:45 -07:00
SangBin Cho
b5b4a398a7
[Mypy] Typing lora folder (#4337) 2024-04-25 19:13:50 +00:00
Simon Mo
62b5166bd4
[CI] Add ccache for wheel builds job (#4281) 2024-04-23 09:51:41 -07:00
SangBin Cho
0ae11f78ab
[Mypy] Part 3 fix typing for nested directories for most of directory (#4161) 2024-04-22 21:32:44 -07:00
youkaichao
15b86408a8
[Misc] add nccl in collect env (#4211) 2024-04-19 19:44:51 +00:00
youkaichao
8a7a3e4436
[Core] add an option to log every function call to for debugging hang/crash in distributed inference (#4079)
Co-authored-by: Simon Mo <simon.mo@hey.com>
2024-04-18 16:15:12 -07:00
SangBin Cho
533d2a1f39
[Typing] Mypy typing part 2 (#4043)
Co-authored-by: SangBin Cho <sangcho@sangcho-LT93GQWG9C.local>
2024-04-17 17:28:43 -07:00
youkaichao
546e721168
[CI/Test] expand ruff and yapf for all supported python version (#4037) 2024-04-13 01:43:37 +00:00
SangBin Cho
09473ee41c
[mypy] Add mypy type annotation part 1 (#4006) 2024-04-12 14:35:50 -07:00
Woosuk Kwon
cfaf49a167
[Misc] Define common requirements (#3841) 2024-04-05 00:39:17 -07:00
youkaichao
ca81ff5196
[Core] manage nccl via a pypi package & upgrade to pt 2.2.1 (#3805) 2024-04-04 10:26:19 -07:00
youkaichao
205b94942e
[CI/Build] fix TORCH_CUDA_ARCH_LIST in wheel build (#3801) 2024-04-02 11:54:33 -07:00