Fix out-of-bounds write in SME1 F16 GEMV
The kernel incorrectly made use of st1w
instructions instead of st1h
meaning on odd output widths the kernel would write back 2 bytes past the end of the row.
Signed-off-by: Jakub Sujak jakub.sujak@arm.com