Audio: Optimize IIR performance #4940

singalsu · 2021-10-29T17:06:07Z

This patch optimizes the buffer copying and output scaling to
other format than int32_t. The main saving is from not using
read/write frag buffer access functions for every sample.

The saving of processing cycles consumption varies per platform
but a second order IIR stereo EQ on TGL shows 43% improvement for
average copy() duration.

Signed-off-by: Seppo Ingalsuo seppo.ingalsuo@linux.intel.com

src/audio/eq_iir/eq_iir.c

singalsu · 2021-10-29T17:11:53Z

Note: There's still plenty of read/write frag buffer access remaining elsewhere. I will address them later in another PR.

lgirdwood · 2021-10-29T21:04:43Z

src/math/iir_df2t_generic.c

These should all be static inline in the header.

Sure, need to think how to make in a nice way the generic vs. hifi3 headers. Here I was hoping compiler does the inline but it's of course not guaranteed.

not when this is an archive. you can do

/* the header file */ #if USE_THE_STATIC_INLINE_VERSION static inline int16_t iir_df2t_s16(struct iir_state_df2t *iir, int16_t x) { return sat_int16(Q_SHIFT_RND(iir_df2t(iir, ((int32_t)x) << 16), 31, 15)); } #else /* just declare func */ int16_t iir_df2t_s16(struct iir_state_df2t *iir, int16_t x); #endif

@lgirdwood I'm testing approach to have in iir_df2t.h next code chunk.

/* Inline functions with or without HiFi3 intrinsics */ #if IIR_HIFI3 #include "iir_df2t_hifi3.h" #else #include "iir_df2t_generic.h" #endif

The inline functions e.g. in iir_df2t_hifi3.h are like:

static inline int16_t iir_df2t_s16(struct iir_state_df2t *iir, int16_t x) { ae_f32x2 y = iir_df2t(iir, ((int32_t)x) << 16); return AE_ROUND16X4F32SSYM(y, y); }

Is such OK?

should be , but what is the iir_df2t definition ?

It's https://github.com/thesofproject/sof/blob/main/src/math/iir_df2t_generic.c or https://github.com/thesofproject/sof/blob/main/src/math/iir_df2t_hifi3.c depending on build.

keyonjie · 2021-11-01T01:55:52Z

Looks nice improvement to me, thanks a lot @singalsu

lyakh

A nice optimisation! Not very obvious either - this mostly just seems to remove checking for buffer wrapping on each sample access! But yes, making some of those one-line wrapper functions inline would help a bit more!

singalsu · 2021-11-02T12:30:20Z

src/math/iir_df2t_hifi3.c

Here's a mistake. This instruction does not saturate to 24 bits. Need in addition to shift left with saturation by 8, shift right by 8. I think my cmocka test for 24 bit treated S24_LE as S32_LE so it can't detect overflow in 24 bits.

This patch optimizes the buffer copying and output scaling to other format than int32_t. The main saving is from not using read/write frag buffer access functions for every sample. The saving of processing cycles consumption varies per platform but a second order IIR stereo EQ on TGL shows 43% improvement for average copy() duration. Signed-off-by: Seppo Ingalsuo <seppo.ingalsuo@linux.intel.com>

kv2019i

Nice!

singalsu requested a review from keyonjie October 29, 2021 17:06

singalsu commented Oct 29, 2021

View reviewed changes

src/audio/eq_iir/eq_iir.c Outdated Show resolved Hide resolved

singalsu force-pushed the eqiir_readwritefrags_optimize branch from 0d11189 to 203160e Compare October 29, 2021 17:22

lgirdwood reviewed Oct 29, 2021

View reviewed changes

lyakh approved these changes Nov 2, 2021

View reviewed changes

singalsu commented Nov 2, 2021

View reviewed changes

kv2019i mentioned this pull request Nov 2, 2021

[DRAFT] cavs-nocodec: drop the extra mixers #4948

Closed

singalsu force-pushed the eqiir_readwritefrags_optimize branch from 203160e to 53ed0ff Compare November 3, 2021 17:00

singalsu marked this pull request as ready for review November 4, 2021 10:05

singalsu requested review from dbaluta, lbetlej, mmaka1 and plbossart as code owners November 4, 2021 10:05

kv2019i approved these changes Nov 4, 2021

View reviewed changes

singalsu mentioned this pull request Nov 5, 2021

[FEATURE] Optimize audio processing components' source/sink buffer access, deprecate read/write frag #4967

Closed

lgirdwood approved these changes Nov 8, 2021

View reviewed changes

lgirdwood merged commit 73f9814 into thesofproject:main Nov 8, 2021

singalsu deleted the eqiir_readwritefrags_optimize branch September 15, 2022 13:15

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Audio: Optimize IIR performance #4940

Audio: Optimize IIR performance #4940

Uh oh!

singalsu commented Oct 29, 2021

Uh oh!

Uh oh!

singalsu commented Oct 29, 2021

Uh oh!

lgirdwood Oct 29, 2021

Uh oh!

singalsu Nov 1, 2021

Uh oh!

lgirdwood Nov 1, 2021

Uh oh!

singalsu Nov 3, 2021

Uh oh!

lgirdwood Nov 3, 2021

Uh oh!

singalsu Nov 4, 2021

Uh oh!

keyonjie commented Nov 1, 2021

Uh oh!

lyakh left a comment •

edited

Loading

Uh oh!

singalsu Nov 2, 2021

Uh oh!

kv2019i left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Audio: Optimize IIR performance #4940

Audio: Optimize IIR performance #4940

Uh oh!

Conversation

singalsu commented Oct 29, 2021

Uh oh!

Uh oh!

singalsu commented Oct 29, 2021

Uh oh!

lgirdwood Oct 29, 2021

Choose a reason for hiding this comment

Uh oh!

singalsu Nov 1, 2021

Choose a reason for hiding this comment

Uh oh!

lgirdwood Nov 1, 2021

Choose a reason for hiding this comment

Uh oh!

singalsu Nov 3, 2021

Choose a reason for hiding this comment

Uh oh!

lgirdwood Nov 3, 2021

Choose a reason for hiding this comment

Uh oh!

singalsu Nov 4, 2021

Choose a reason for hiding this comment

Uh oh!

keyonjie commented Nov 1, 2021

Uh oh!

lyakh left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

singalsu Nov 2, 2021

Choose a reason for hiding this comment

Uh oh!

kv2019i left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

lyakh left a comment •

edited

Loading