Default Branch

3125d79950 · [Chore] Remove unused PolyNorm layer (#27110) · Updated 2025-10-18 03:03:43 +08:00

Branches

ce24bc8a9e · Merge branch 'main' into prune-samplers-test · Updated 2025-08-22 01:15:16 +08:00

-1
-1

1da94e673c · Do not use eval() to convert unknown types (#23266) · Updated 2025-08-21 04:39:42 +08:00

3869
6

d983769c41 · fix cuda graph (#22721) · Updated 2025-08-20 14:24:37 +08:00

3800
0
Included

de92ab523b · single deepep handle · Updated 2025-08-20 04:01:22 +08:00

4291
144

dabc03baa7 · updated · Updated 2025-08-20 01:05:49 +08:00

-1
-1

2ad6985c49 · opt · Updated 2025-08-16 05:24:50 +08:00

3918
3

36ccdcad2c · updated · Updated 2025-08-14 11:34:37 +08:00

-1
-1

69dbcc56bf · remove metrics and tracing · Updated 2025-08-14 11:29:47 +08:00

-1
-1

0470cac520 · updaed · Updated 2025-08-14 10:14:03 +08:00

-1
-1

75c7fdc016 · updated · Updated 2025-08-14 07:01:11 +08:00

-1
-1

5667ed8788 · Merge branch 'main' into seemethere/cuda_arm64 · Updated 2025-08-14 05:07:44 +08:00

3975
3

42018e8d96 · Revert "Implicit language-model-only mode via limit-mm-per-prompt (#22299)" · Updated 2025-08-09 13:51:13 +08:00

4113
1

ddb65dad96 · fix · Updated 2025-08-07 07:53:32 +08:00

4192
2

a772948c9d · add gemma3 to test · Updated 2025-08-05 03:56:13 +08:00

4249
3

bc3b20f81f · accepted length code · Updated 2025-08-04 11:06:15 +08:00

6400
5

c3d9640b09 · Use gpu_1_queue · Updated 2025-07-31 06:08:58 +08:00

4420
7

bcc0a3cbef · fix: do not install lmcache as it overrides torch version · Updated 2025-07-30 03:15:13 +08:00

4596
1

5c2a80c37d · fix bad merge · Updated 2025-07-29 00:30:25 +08:00

4466
6

f1c9ef3afd · Merge remote-tracking branch 'nm/lwilkinson/fix-flashmla-full-cudagraph' into wide_ep_working_branch · Updated 2025-07-28 05:22:09 +08:00

4493
4

c0a8db461c · Revert "[TPU][Bugfix] fix OOM issue in CI test (#21550)" · Updated 2025-07-25 14:04:37 +08:00

4551
1