Performance improvements and automations
- Adding frame rate throttling feature to all benchmarks (enabled using '--max_fps' command line flag)
- Switching Tensorflow tag to v2.20.0 with SME support. Turning off ArmNN builds.
- Adding automated benchmarking scripts that collect perfetto/simpleperf traces
- Fixing segmentation fault on Android in CollapseNet convolve_and_blur_za32 function
- Moving heap intermediate buffer allocations to stack due to page faulting overhead on Android (new/delete usage for data buffers inside hot compute functions is unsafe)
Edited by Tarush Bansal