Outer product approach to 2D c2c FFTs
Description
This patch introduces a new approach to the calculation of a 2D c2c FFT of dimensions 256x256. The new method uses the outer product instructions FMOPA/FMOPS on the SME unit to calculate multiple FFTs at the same time.
The new code can be enabled by adding -DARMRAL_ENABLE_SME2=On to the cmake command when configuring the project.
Checklist
- [v] Contribution meets RAL's license terms
- [v] New functions adhere to RAL's naming scheme
- [v] Contribution conforms to RAL's directory structure
- [v] Documentation updated
- [v] "Unreleased" section of the Changelog updated
- [v]
clang-formatandclang-tidyrun and changes included (C/C++ code) - [v]
flake8run and changes included (Python code) - [v]
cmake-formatrun and changes included (CMake code) - [v] Tests added or updated
- [v] Tests pass when run with AddressSanitizer
- [v] Benchmarks added or updated
For any items that are not checked, please provide details.