Skip to content

Improve SME2 FFTs with FMOPA instructions

Paolo requested to merge improving_sme2 into main

Improve SME2 code for FFT

Description

This patch modifies the sme2 c2c plan for 256x256 FFTs by introducing some new twiddle factor matrices, for faster load of twiddle factors, and reorder the outer product instructions to make full use of the computation capabilities of the C1-SME2 unit.

Checklist

Merge request reports

Loading