7880 Commits

Author SHA1 Message Date
Robert Shaw
e540aa41b8 revert logger changes
Signed-off-by: Robert Shaw <robshaw@redhat.com>
2025-07-20 19:11:43 +00:00
Robert Shaw
1b488f8d5a Merge branch 'main' into one-pod-per-node-lb 2025-07-20 19:07:51 +00:00
Robert Shaw
840d3812de updated
Signed-off-by: Robert Shaw <robshaw@redhat.com>
2025-07-20 19:07:31 +00:00
Robert Shaw
3f4ae353c2 updated
Signed-off-by: Robert Shaw <robshaw@redhat.com>
2025-07-20 17:19:42 +00:00
Robert Shaw
e9e180da5f cleanup
Signed-off-by: Robert Shaw <robshaw@redhat.com>
2025-07-20 17:19:13 +00:00
Robert Shaw
5ea4fa206d updated
Signed-off-by: Robert Shaw <robshaw@redhat.com>
2025-07-20 17:17:15 +00:00
Robert Shaw
f477b50493 updated
Signed-off-by: Robert Shaw <robshaw@redhat.com>
2025-07-20 17:15:20 +00:00
Robert Shaw
876c864d3c updated
Signed-off-by: Robert Shaw <robshaw@redhat.com>
2025-07-20 17:12:27 +00:00
Jiayi Yan
7ba34b1241
[bugfix] fix syntax warning caused by backslash (#21251) 2025-07-20 17:12:10 +00:00
Robert Shaw
d9291f998e cleanup
Signed-off-by: Robert Shaw <robshaw@redhat.com>
2025-07-20 17:04:35 +00:00
Robert Shaw
c08fb6d456 updated
Signed-off-by: Robert Shaw <robshaw@redhat.com>
2025-07-20 16:57:45 +00:00
Robert Shaw
5e6114df5d
Merge pull request #19 from robertgshaw2-redhat/fix-prometheus-logging
Improve code structure
2025-07-20 12:53:23 -04:00
Robert Shaw
4eae5cbeea updated
Signed-off-by: Robert Shaw <robshaw@redhat.com>
2025-07-20 16:50:36 +00:00
Robert Shaw
1358836fa0 updated
Signed-off-by: Robert Shaw <robshaw@redhat.com>
2025-07-20 16:48:31 +00:00
Robert Shaw
02ecfa80a4 updated
Signed-off-by: Robert Shaw <robshaw@redhat.com>
2025-07-20 16:46:35 +00:00
Robert Shaw
54e405bd92 updated
Signed-off-by: Robert Shaw <robshaw@redhat.com>
2025-07-20 16:45:46 +00:00
Robert Shaw
896b0a271e updated
Signed-off-by: Robert Shaw <robshaw@redhat.com>
2025-07-20 15:56:59 +00:00
Robert Shaw
fd0650f258 updated
Signed-off-by: Robert Shaw <robshaw@redhat.com>
2025-07-20 15:56:50 +00:00
Robert Shaw
cad9670547 updated
Signed-off-by: Robert Shaw <robshaw@redhat.com>
2025-07-20 15:23:30 +00:00
Robert Shaw
3956d8ccad updated
Signed-off-by: Robert Shaw <robshaw@redhat.com>
2025-07-20 15:22:43 +00:00
Robert Shaw
9a2e26d049 updated
Signed-off-by: Robert Shaw <robshaw@redhat.com>
2025-07-20 14:44:54 +00:00
Robert Shaw
d39cf9380d updated
Signed-off-by: Robert Shaw <robshaw@redhat.com>
2025-07-20 14:42:48 +00:00
Robert Shaw
e08e1e99ee cleanup prometheus logging
Signed-off-by: Robert Shaw <robshaw@redhat.com>
2025-07-20 14:41:55 +00:00
Robert Shaw
de91a3cd6a convert to use only one prometheus stat logger per async llm
Signed-off-by: Robert Shaw <robshaw@redhat.com>
2025-07-20 14:38:45 +00:00
Robert Shaw
a69edca369 convert to use only one prometheus stat logger per async llm
Signed-off-by: Robert Shaw <robshaw@redhat.com>
2025-07-20 13:52:50 +00:00
Robert Shaw
1e5303a801 stash
Signed-off-by: Robert Shaw <robshaw@redhat.com>
2025-07-20 13:37:34 +00:00
Robert Shaw
6569facd3b stash
Signed-off-by: Robert Shaw <robshaw@redhat.com>
2025-07-20 13:34:38 +00:00
Raushan Turganbay
9499e26e2a
[Model] Support VLMs with transformers backend (#20543)
Signed-off-by: raushan <raushan@huggingface.co>
Signed-off-by: Isotr0py <2037008807@qq.com>
Signed-off-by: Isotr0py <mozf@mail2.sysu.edu.cn>
Co-authored-by: Isotr0py <2037008807@qq.com>
Co-authored-by: Isotr0py <mozf@mail2.sysu.edu.cn>
Co-authored-by: Cyrus Leung <cyrus.tl.leung@gmail.com>
2025-07-20 13:25:50 +00:00
Calvin Chen
51ba839555
[Model] use AutoWeightsLoader for bart (#18299)
Signed-off-by: calvin chen <120380290@qq.com>
2025-07-20 08:15:50 +00:00
Robert Shaw
471fa4ae68 updated
Signed-off-by: Robert Shaw <robshaw@redhat.com>
2025-07-20 03:54:43 +00:00
Robert Shaw
dbc51d6e98 nits
Signed-off-by: Robert Shaw <robshaw@redhat.com>
2025-07-20 03:48:11 +00:00
Robert Shaw
b9c0f658ca nits
Signed-off-by: Robert Shaw <robshaw@redhat.com>
2025-07-20 03:47:45 +00:00
Robert Shaw
1ced153eec updatedd
Signed-off-by: Robert Shaw <robshaw@redhat.com>
2025-07-20 03:47:23 +00:00
Robert Shaw
2a68433a82 updated
Signed-off-by: Robert Shaw <robshaw@redhat.com>
2025-07-20 03:45:48 +00:00
Robert Shaw
4438796b48 fix lb issues
Signed-off-by: Robert Shaw <robshaw@redhat.com>
2025-07-20 03:44:38 +00:00
Seiji Eicher
d1fb65bde3
Enable v1 metrics tests (#20953)
Signed-off-by: Seiji Eicher <seiji@anyscale.com>
v0.10.0rc1
2025-07-20 03:22:02 +00:00
Chengji Yao
3a1d8940ae
[TPU] support fp8 kv cache quantization (#19292)
Signed-off-by: Chengji Yao <chengjiyao@google.com>
2025-07-20 03:01:00 +00:00
Robert Shaw
d2d54e9c72 updated
Signed-off-by: Robert Shaw <robshaw@redhat.com>
2025-07-20 02:46:54 +00:00
Robert Shaw
e1843b7e6c updated
Signed-off-by: Robert Shaw <robshaw@redhat.com>
2025-07-20 02:30:23 +00:00
Robert Shaw
b142571366 cleanup
Signed-off-by: Robert Shaw <robshaw@redhat.com>
2025-07-20 02:24:49 +00:00
Robert Shaw
2aa497571d updated
Signed-off-by: Robert Shaw <robshaw@redhat.com>
2025-07-20 02:23:34 +00:00
Robert Shaw
14db6606f2 updated
Signed-off-by: Robert Shaw <robshaw@redhat.com>
2025-07-20 02:21:51 +00:00
Robert Shaw
4f5d3eabc8 updated
Signed-off-by: Robert Shaw <robshaw@redhat.com>
2025-07-20 02:21:19 +00:00
Robert Shaw
14cf3c4786 updated
Signed-off-by: Robert Shaw <robshaw@redhat.com>
2025-07-20 02:20:54 +00:00
Robert Shaw
2fd05875d4 updated
Signed-off-by: Robert Shaw <robshaw@redhat.com>
2025-07-20 02:20:36 +00:00
Robert Shaw
48cf09be0b updated
Signed-off-by: Robert Shaw <robshaw@redhat.com>
2025-07-20 02:19:52 +00:00
Robert Shaw
59a958362f updated
Signed-off-by: Robert Shaw <robshaw@redhat.com>
2025-07-20 02:19:30 +00:00
Robert Shaw
aefeeed64d updated
Signed-off-by: Robert Shaw <robshaw@redhat.com>
2025-07-20 02:18:01 +00:00
Robert Shaw
b90d33163c updated
Signed-off-by: Robert Shaw <robshaw@redhat.com>
2025-07-20 02:15:19 +00:00
Thomas Parnell
2b504eb770
[Docs] [V1] Update docs to remove enforce_eager limitation for hybrid models. (#21233)
Signed-off-by: Thomas Parnell <tpa@zurich.ibm.com>
2025-07-19 16:09:58 -07:00