Skip to content

Optimize scalar RHS packing function NxK F32 <- QAI8DXP x QSU4C32

Gian Marco Iodice requested to merge rhs_pack_scalar into main
  • Optimize the generic RHS packing NxK. The performance improvement is around ~1.5x

Signed-off-by: Gian Marco Iodice gianmarco.iodice@arm.com

Merge request reports