47 Commits

Author SHA1 Message Date
Cyrus Leung
eeec9e3390
[Frontend] Separate pooling APIs in offline inference (#11129)
Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>
2024-12-13 10:40:07 +00:00
Cyrus Leung
d2f058e76c
[Misc] Rename embedding classes to pooling (#10801)
Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>
2024-12-01 14:36:51 +08:00
Cyrus Leung
3b00b9c26c
[Core] renamePromptInputs and inputs (#8876) 2024-09-26 20:35:15 -07:00
Simon Mo
4f1ba0844b
Revert "rename PromptInputs and inputs with backward compatibility (#8760) (#8810) 2024-09-25 10:36:26 -07:00
Cyrus Leung
28e1299e60
rename PromptInputs and inputs with backward compatibility (#8760) 2024-09-25 09:36:47 -07:00
Simon Mo
3185fb0cca
Revert "[Core] Rename PromptInputs to PromptType, and inputs to prompt" (#8750) 2024-09-24 05:45:20 +00:00
Daniele
ee5f34b1c2
[CI/Build] use setuptools-scm to set __version__ (#4738)
Co-authored-by: youkaichao <youkaichao@126.com>
2024-09-23 09:44:26 -07:00
Cyrus Leung
0057894ef7
[Core] Rename PromptInputs and inputs(#8673) 2024-09-20 19:00:54 -07:00
Cyrus Leung
739b61a348
[Frontend] Refactor prompt processing (#4028)
Co-authored-by: Roger Wang <ywang@roblox.com>
2024-07-22 10:13:53 -07:00
Michael Goin
111fc6e7ec
[Misc] Add generated git commit hash as vllm.__commit__ (#6386) 2024-07-12 22:52:15 +00:00
Cyrus Leung
03dccc886e
[Misc] Add vLLM version getter to utils (#5098) 2024-06-13 11:21:39 -07:00
Simon Mo
114332b88e
Bump version to v0.5.0 (#5384) 2024-06-10 15:56:06 -07:00
Simon Mo
87a658c812
Bump version to v0.4.3 (#5046) 2024-05-30 11:13:46 -07:00
Cyrus Leung
5ae5ed1e60
[Core] Consolidate prompt arguments to LLM engines (#4328)
Co-authored-by: Roger Wang <ywang@roblox.com>
2024-05-28 13:29:31 -07:00
Chang Su
e254497b66
[Model][Misc] Add e5-mistral-7b-instruct and Embedding API (#3734) 2024-05-11 11:30:37 -07:00
Simon Mo
8d8357c8ed
bump version to v0.4.2 (#4600) 2024-05-04 17:09:49 -07:00
Nick Hill
479d69fad0
[Core] Move ray_utils.py from engine to executor package (#4347) 2024-04-25 06:52:22 +00:00
Simon Mo
221d93ecbf
Bump version of 0.4.1 (#4177) 2024-04-19 01:00:22 -07:00
youkaichao
95baec828f
[Core] enable out-of-tree model register (#3871) 2024-04-06 17:11:41 -07:00
youkaichao
a3c226e7eb
[CI/Build] 0.4.0.post1, fix sm 7.0/7.5 binary (#3803) 2024-04-02 12:57:04 -07:00
Simon Mo
430530fc18
bump version to v0.4.0 (#3712) 2024-03-29 12:28:33 -07:00
youkaichao
f342153b48
Revert "bump version to v0.4.0" (#3708) 2024-03-28 18:49:42 -07:00
Simon Mo
27a57cad52
bump version to v0.4.0 (#3705) 2024-03-28 18:26:51 -07:00
Zhuohan Li
4c922709b6
Add distributed model executor abstraction (#3191) 2024-03-11 11:03:45 -07:00
Woosuk Kwon
1cb0cc2975
[FIX] Make flash_attn optional (#3269) 2024-03-08 10:52:20 -08:00
Woosuk Kwon
2daf23ab0c
Separate attention backends (#3005) 2024-03-07 01:45:50 -08:00
Woosuk Kwon
82091b864a
Bump up to v0.3.3 (#3129) 2024-03-01 12:58:06 -08:00
Zhuohan Li
8fbd84bf78
Bump up version to v0.3.2 (#2968)
This version is for more model support. Add support for Gemma models (#2964) and OLMo models (#2832).
2024-02-21 11:47:25 -08:00
Woosuk Kwon
5f08050d8d
Bump up to v0.3.1 (#2887) 2024-02-16 15:05:18 -08:00
Zhuohan Li
1af090b57d
Bump up version to v0.3.0 (#2656) 2024-01-31 00:07:07 -08:00
Woosuk Kwon
2e0b6e7757
Bump up to v0.2.7 (#2337) 2024-01-03 17:35:56 -08:00
Woosuk Kwon
671af2b1c0
Bump up to v0.2.6 (#2157) 2023-12-17 10:34:56 -08:00
Woosuk Kwon
31c1f3255e
Bump up to v0.2.5 (#2095) 2023-12-13 23:56:15 -08:00
Woosuk Kwon
4dd4b5c538
Bump up to v0.2.4 (#2034) 2023-12-11 11:49:39 -08:00
Woosuk Kwon
0f90effc66
Bump up to v0.2.3 (#1903) 2023-12-03 12:27:47 -08:00
Woosuk Kwon
c5f7740d89
Bump up to v0.2.2 (#1689) 2023-11-18 21:57:07 -08:00
Zhuohan Li
651c614aa4
Bump up the version to v0.2.1 (#1355) 2023-10-16 12:58:57 -07:00
Woosuk Kwon
e2fb71ec9f
Bump up the version to v0.2.0 (#1212) 2023-09-28 15:30:38 -07:00
Woosuk Kwon
90eb3f43ca
Bump up the version to v0.1.7 (#1013) 2023-09-11 00:54:30 -07:00
Zhuohan Li
1117aa1411
Bump up the version to v0.1.6 (#989) 2023-09-08 00:07:46 -07:00
Woosuk Kwon
852ef5b4f5
Bump up the version to v0.1.5 (#944) 2023-09-07 16:15:31 -07:00
Woosuk Kwon
791d79de32
Bump up the version to v0.1.4 (#846) 2023-08-25 12:28:00 +09:00
Zhuohan Li
aa84c92ef6
Bump up version to 0.1.3 (#657) 2023-08-02 16:46:53 -07:00
Woosuk Kwon
1c395b4eaa
Bump up the version (#300) 2023-07-04 21:41:53 -07:00
Zhuohan Li
d6fa1be3a8
[Quality] Add code formatter and linter (#326) 2023-07-03 11:31:55 -07:00
Zhuohan Li
83658c8ace
Bump up version to 0.1.1 (#204) 2023-06-22 15:33:32 +08:00
Woosuk Kwon
0b98ba15c7
Change the name to vLLM (#150) 2023-06-17 03:07:40 -07:00