Add SME F32 GEMV micro-kernel
Compute the Vector-Matrix multiply of F32 inputs to produce an F32 matrix, optimized using SME instructions.
Signed-off-by: Jakub Sujak jakub.sujak@arm.com
Compute the Vector-Matrix multiply of F32 inputs to produce an F32 matrix, optimized using SME instructions.
Signed-off-by: Jakub Sujak jakub.sujak@arm.com