Signed-off-by: Daniel Campora <961215+dcampora@users.noreply.github.com> Co-authored-by: Michael Goin <mgoin64@gmail.com> Co-authored-by: Cyrus Leung <tlleungac@connect.ust.hk>
Optional[x]
x | None
Union[x, y]
x | y
moe_align_block_size
w8a8
from_blob
get_cuda_view_from_cpu_tensor
fused_qknorm_rope_kernel