9 Commits

Author SHA1 Message Date
Jason
ed11ac644a
Merge f10ff9c26237af8a96a7b3eff70d37d43609f7f4 into 9b4e9788e4a3a731f7567338ed15d3ec549ce03b 2025-09-01 14:01:14 +00:00
youkaichao
adecc0efbe fix rmsnorm and act_quant_kernel 2025-08-27 17:12:13 +08:00
youkaichao
82f6008c8c
fix act_quant_kernel (#968)
Signed-off-by: youkaichao <youkaichao@gmail.com>
2025-08-27 16:23:30 +08:00
youkaichao
b15f0dbbbe
support scale_fmt=ue8m0 (#964)
* support scale_fmt=ue8m0

* keep improving

Signed-off-by: youkaichao <youkaichao@gmail.com>

* keep improving

Signed-off-by: youkaichao <youkaichao@gmail.com>

* add clamp min of 1e-4

Signed-off-by: youkaichao <youkaichao@gmail.com>

* rename config

Signed-off-by: youkaichao <youkaichao@gmail.com>

---------

Signed-off-by: youkaichao <youkaichao@gmail.com>
2025-08-27 15:30:21 +08:00
oyzh
4a65fd9221 fix an args description. 2025-02-15 11:02:28 +08:00
messagezsl
f10ff9c262 Update kernel.py 2025-02-08 16:07:47 +08:00
Roman Fitzjalen
2756e130c2 clarify assertion error 2025-01-28 13:16:54 +01:00
enoch kan
a1296f099e Enhance documentation and update .gitignore for model conversion scripts 2025-01-05 18:18:18 +00:00
stack-heap-overflow
4c2fdb8f55 Release DeepSeek-V3 2024-12-26 19:01:57 +08:00