- Use of the cpuinfo thirdparty library to perform CPU feature detection, including detection of dot-product instructions on iOS and detection of cache sizes. Combined with the ARM64 build change below, this means that iPhone11/SE devices now use dot-product instructions. We measured 250 Gop/s on 1 Lightning core in iPhone 11 Pro.
- ARM32: @lissyx's fix in google/ruy#69
- ARM64: build the kNeonDotprod path unconditionally (used to be linux-only).
- x86-64: enabling AVX512 instructions in the general opensource build.
PiperOrigin-RevId: 315467315
Change-Id: I0788ddc68da8d0aa2eee71f820fb5d8c188d35b2
- ARM32: @lissyx's fix in google/ruy#69
- ARM64: enabling dotprod instructions outside of Linux.
- x86-64: enabling AVX512 instructions in the general opensource build. A previous version of this CL was rolled back due to lack of support for -march=skylake-avx512 in some toolchains. Hopefully this will fare better thanks to b68dcd8713 .
PiperOrigin-RevId: 314766242
Change-Id: I56027a328248f2d85c9c74d3d8e27328ac25f332
- ARM32: @lissyx's fix in google/ruy#69
- ARM64: enabling dotprod instructions outside of Linux.
- x86-64: enabling AVX512 instructions in the general opensource build. A previous version of this CL was rolled back due to lack of support for -march=skylake-avx512 in some toolchains. Hopefully this will fare better thanks to b68dcd8713 .
PiperOrigin-RevId: 314597853
Change-Id: Ia64d5aa5bb9e47140e084e0896a5685cac247886