Cyrus Leung
|
eeec9e3390
|
[Frontend] Separate pooling APIs in offline inference (#11129)
Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>
|
2024-12-13 10:40:07 +00:00 |
|
Cyrus Leung
|
d2f058e76c
|
[Misc] Rename embedding classes to pooling (#10801)
Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>
|
2024-12-01 14:36:51 +08:00 |
|
Cyrus Leung
|
3b00b9c26c
|
[Core] renamePromptInputs and inputs (#8876)
|
2024-09-26 20:35:15 -07:00 |
|
Simon Mo
|
4f1ba0844b
|
Revert "rename PromptInputs and inputs with backward compatibility (#8760) (#8810)
|
2024-09-25 10:36:26 -07:00 |
|
Cyrus Leung
|
28e1299e60
|
rename PromptInputs and inputs with backward compatibility (#8760)
|
2024-09-25 09:36:47 -07:00 |
|
Simon Mo
|
3185fb0cca
|
Revert "[Core] Rename PromptInputs to PromptType, and inputs to prompt" (#8750)
|
2024-09-24 05:45:20 +00:00 |
|
Daniele
|
ee5f34b1c2
|
[CI/Build] use setuptools-scm to set __version__ (#4738)
Co-authored-by: youkaichao <youkaichao@126.com>
|
2024-09-23 09:44:26 -07:00 |
|
Cyrus Leung
|
0057894ef7
|
[Core] Rename PromptInputs and inputs(#8673)
|
2024-09-20 19:00:54 -07:00 |
|
Cyrus Leung
|
739b61a348
|
[Frontend] Refactor prompt processing (#4028)
Co-authored-by: Roger Wang <ywang@roblox.com>
|
2024-07-22 10:13:53 -07:00 |
|
Michael Goin
|
111fc6e7ec
|
[Misc] Add generated git commit hash as vllm.__commit__ (#6386)
|
2024-07-12 22:52:15 +00:00 |
|
Cyrus Leung
|
03dccc886e
|
[Misc] Add vLLM version getter to utils (#5098)
|
2024-06-13 11:21:39 -07:00 |
|
Simon Mo
|
114332b88e
|
Bump version to v0.5.0 (#5384)
|
2024-06-10 15:56:06 -07:00 |
|
Simon Mo
|
87a658c812
|
Bump version to v0.4.3 (#5046)
|
2024-05-30 11:13:46 -07:00 |
|
Cyrus Leung
|
5ae5ed1e60
|
[Core] Consolidate prompt arguments to LLM engines (#4328)
Co-authored-by: Roger Wang <ywang@roblox.com>
|
2024-05-28 13:29:31 -07:00 |
|
Chang Su
|
e254497b66
|
[Model][Misc] Add e5-mistral-7b-instruct and Embedding API (#3734)
|
2024-05-11 11:30:37 -07:00 |
|
Simon Mo
|
8d8357c8ed
|
bump version to v0.4.2 (#4600)
|
2024-05-04 17:09:49 -07:00 |
|
Nick Hill
|
479d69fad0
|
[Core] Move ray_utils.py from engine to executor package (#4347)
|
2024-04-25 06:52:22 +00:00 |
|
Simon Mo
|
221d93ecbf
|
Bump version of 0.4.1 (#4177)
|
2024-04-19 01:00:22 -07:00 |
|
youkaichao
|
95baec828f
|
[Core] enable out-of-tree model register (#3871)
|
2024-04-06 17:11:41 -07:00 |
|
youkaichao
|
a3c226e7eb
|
[CI/Build] 0.4.0.post1, fix sm 7.0/7.5 binary (#3803)
|
2024-04-02 12:57:04 -07:00 |
|
Simon Mo
|
430530fc18
|
bump version to v0.4.0 (#3712)
|
2024-03-29 12:28:33 -07:00 |
|
youkaichao
|
f342153b48
|
Revert "bump version to v0.4.0" (#3708)
|
2024-03-28 18:49:42 -07:00 |
|
Simon Mo
|
27a57cad52
|
bump version to v0.4.0 (#3705)
|
2024-03-28 18:26:51 -07:00 |
|
Zhuohan Li
|
4c922709b6
|
Add distributed model executor abstraction (#3191)
|
2024-03-11 11:03:45 -07:00 |
|
Woosuk Kwon
|
1cb0cc2975
|
[FIX] Make flash_attn optional (#3269)
|
2024-03-08 10:52:20 -08:00 |
|
Woosuk Kwon
|
2daf23ab0c
|
Separate attention backends (#3005)
|
2024-03-07 01:45:50 -08:00 |
|
Woosuk Kwon
|
82091b864a
|
Bump up to v0.3.3 (#3129)
|
2024-03-01 12:58:06 -08:00 |
|
Zhuohan Li
|
8fbd84bf78
|
Bump up version to v0.3.2 (#2968)
This version is for more model support. Add support for Gemma models (#2964) and OLMo models (#2832).
|
2024-02-21 11:47:25 -08:00 |
|
Woosuk Kwon
|
5f08050d8d
|
Bump up to v0.3.1 (#2887)
|
2024-02-16 15:05:18 -08:00 |
|
Zhuohan Li
|
1af090b57d
|
Bump up version to v0.3.0 (#2656)
|
2024-01-31 00:07:07 -08:00 |
|
Woosuk Kwon
|
2e0b6e7757
|
Bump up to v0.2.7 (#2337)
|
2024-01-03 17:35:56 -08:00 |
|
Woosuk Kwon
|
671af2b1c0
|
Bump up to v0.2.6 (#2157)
|
2023-12-17 10:34:56 -08:00 |
|
Woosuk Kwon
|
31c1f3255e
|
Bump up to v0.2.5 (#2095)
|
2023-12-13 23:56:15 -08:00 |
|
Woosuk Kwon
|
4dd4b5c538
|
Bump up to v0.2.4 (#2034)
|
2023-12-11 11:49:39 -08:00 |
|
Woosuk Kwon
|
0f90effc66
|
Bump up to v0.2.3 (#1903)
|
2023-12-03 12:27:47 -08:00 |
|
Woosuk Kwon
|
c5f7740d89
|
Bump up to v0.2.2 (#1689)
|
2023-11-18 21:57:07 -08:00 |
|
Zhuohan Li
|
651c614aa4
|
Bump up the version to v0.2.1 (#1355)
|
2023-10-16 12:58:57 -07:00 |
|
Woosuk Kwon
|
e2fb71ec9f
|
Bump up the version to v0.2.0 (#1212)
|
2023-09-28 15:30:38 -07:00 |
|
Woosuk Kwon
|
90eb3f43ca
|
Bump up the version to v0.1.7 (#1013)
|
2023-09-11 00:54:30 -07:00 |
|
Zhuohan Li
|
1117aa1411
|
Bump up the version to v0.1.6 (#989)
|
2023-09-08 00:07:46 -07:00 |
|
Woosuk Kwon
|
852ef5b4f5
|
Bump up the version to v0.1.5 (#944)
|
2023-09-07 16:15:31 -07:00 |
|
Woosuk Kwon
|
791d79de32
|
Bump up the version to v0.1.4 (#846)
|
2023-08-25 12:28:00 +09:00 |
|
Zhuohan Li
|
aa84c92ef6
|
Bump up version to 0.1.3 (#657)
|
2023-08-02 16:46:53 -07:00 |
|
Woosuk Kwon
|
1c395b4eaa
|
Bump up the version (#300)
|
2023-07-04 21:41:53 -07:00 |
|
Zhuohan Li
|
d6fa1be3a8
|
[Quality] Add code formatter and linter (#326)
|
2023-07-03 11:31:55 -07:00 |
|
Zhuohan Li
|
83658c8ace
|
Bump up version to 0.1.1 (#204)
|
2023-06-22 15:33:32 +08:00 |
|
Woosuk Kwon
|
0b98ba15c7
|
Change the name to vLLM (#150)
|
2023-06-17 03:07:40 -07:00 |
|