FFmpeg/libavutil/x86
Lynne 27cffd16aa
x86/tx_float: replace fft_sr_avx with fft_sr_fma3
When the SLOW_GATHER flag was added to the AVX2 version, this
made FMA3-features not enabled on Zen CPUs.
As FMA3 adds 6-7% across all platforms that support it, in
the interest of saving space, this commit removes the AVX
version and replaces it with an FMA3 version.
The only CPUs affected are Sandy Bridge and Bulldozer, which
have AVX support, but no FMA3 support.
In the future, if there's a demand for it, a version of the
function duplicated for AVX can be added.
2022-05-21 02:11:50 +02:00
..
asm.h
bswap.h
cpu.c avutil/cpu: add AVX512 Icelake flag 2022-03-10 16:45:48 -03:00
cpu.h avutil/cpu: add AVX512 Icelake flag 2022-03-10 16:45:48 -03:00
cpuid.asm
emms.asm
emms.h
fixed_dsp_init.c
fixed_dsp.asm
float_dsp_init.c
float_dsp.asm
imgutils_init.c
imgutils.asm
intmath.h
intreadwrite.h
lls_init.c
lls.asm
Makefile
pixelutils_init.c
pixelutils.asm
pixelutils.h
timer.h
tx_float_init.c x86/tx_float: replace fft_sr_avx with fft_sr_fma3 2022-05-21 02:11:50 +02:00
tx_float.asm x86/tx_float: replace fft_sr_avx with fft_sr_fma3 2022-05-21 02:11:50 +02:00
w64xmmtest.h
x86inc.asm avutil/cpu: add AVX512 Icelake flag 2022-03-10 16:45:48 -03:00
x86util.asm