Signed-off-by: Jinzhen Lin <jinzhen.ljz@antgroup.com> Co-authored-by: youkaichao <youkaichao@gmail.com> Co-authored-by: Michael Goin <mgoin64@gmail.com> Co-authored-by: Wentao Ye <44945378+yewentao256@users.noreply.github.com>
Optional[x]
x | None
Union[x, y]
x | y
w8a8
from_blob
get_cuda_view_from_cpu_tensor
fused_qknorm_rope_kernel