Add scheduling arguments to FP16 example
Translation of a 1x1 Convolution into a GEMM use case is shown along with breaking the output matrix calculation into smaller blocks as per the underlying microkernel used.
Signed-off-by: Felix Thomasmathibalan felixjohnny.thomasmathibalan@arm.com