Skip to content

Optimize the NxK RHS packing function for qsu4c32s1s0

Gian Marco Iodice requested to merge opt_rhs_pack_qsi4c32p into main
  • Added a specialization for nr=4, kr=16, sr=2
  • Improved the RHS packing function performance by 55%

Signed-off-by: Gian Marco Iodice gianmarco.iodice@arm.com

Merge request reports