Audio: DRC: DRC math function optimiazation #8431

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Closed

andrula-song wants to merge 1 commit into thesofproject:main from andrula-song:drc

Contributor

andrula-song commented Nov 2, 2023

Use the xtense intrinsic instrunctions directly can save at least 10% cycles for those functions, and save about 0.92mcps for DRC component.

andrula-song requested a review from singalsu

November 2, 2023 02:40

Contributor Author

andrula-song commented Nov 2, 2023 •

edited

Loading

here is the xtensa simulator result:

compared with the original functions, using instructions directly can save:
for log10_fixed can save about 12.8% cycles;
for drc_lin2db_fixed can save about 11.1% cycles;
for drc_log_fixed can save about 10.0% cycles;
for drc_asin_fixed can save about 17.1% cycles;
for drc_inv_fixed can save about 12.7% cycles;

and test with xt-testbench 32bit 48kHz on tgl, before optimization we get 150.05mcps(including many trace print and module adapter operation) and after optimization we get 149.13mcps, save about 0.92mcps for component DRC.

andrula-song force-pushed the drc branch from 4cd3d24 to d79b226 Compare

November 2, 2023 03:25

btian1 reviewed

View reviewed changes

src/audio/drc/drc_math_hifi3.c Outdated Show resolved Hide resolved

lyakh reviewed

View reviewed changes

src/audio/drc/drc_math_hifi3.c Show resolved Hide resolved

src/audio/drc/drc_math_hifi3.c Outdated Show resolved Hide resolved

singalsu reviewed

View reviewed changes

src/audio/drc/drc_math_hifi3.c Outdated Show resolved Hide resolved

src/audio/drc/drc_math_hifi3.c

    
              	ae_f32 exp; /* Q7.25 */

              	ae_f32 acc; /* Q6.26 */

              	ae_f32 tmp; /* Q6.26 */

              	ae_f64 tmp64;

Collaborator

singalsu Nov 3, 2023

So, there's overhead in inline functions, maybe it's from 26 as literal instead of variable? You could comment that the instructions normalize the value after the function was removed.

Contributor Author

andrula-song Nov 6, 2023

I tested in xtensa test bench, it really costs more cycles when using the function wrapper than instructions.

src/audio/drc/drc_math_hifi3.c Outdated

    
              		x = drc_mult_lshift(x, ONE_OVER_SQRT2_Q30, lshift);

              		tmp64 = AE_MULF32R_LL(x, ONE_OVER_SQRT2_Q30);

              		/* drc_get_lshift(30, 30, 30) = 1 */

              		tmp64 = AE_SLAI64S(tmp64, 1);

Collaborator

singalsu Nov 3, 2023

You could use #define macro for the magic shift values to know the are for a Qx to Qy conversion.

Contributor Author

andrula-song Nov 6, 2023

if use macro then I can not use AE_SLAI64S, and if use AE_SLAA64S, the log10_fixed reduce cycles from 12.8% down to 12.2%, so better get back the lshift calculation for better code readability.

src/audio/drc/drc_math_hifi3.c Outdated

    
              	int32_t lshift;

              	int32_t e;

              	/* drc_get_lshift(25, 30, 25) = 1 */

              	int32_t lshift = 1;

Collaborator

singalsu Nov 3, 2023

Use a #define macro for Qx Qy multiply as Qz shift value?

src/audio/drc/drc_math_hifi3.c Outdated Show resolved Hide resolved


          Audio: DRC: DRC math function optimiazation

39a4d6d

Use the xtense intrinsic instrunctions directly can save
at least 10% cycles for those functions, and save about
0.9mcps for DRC component.

Signed-off-by: Andrula Song <andrula.song@intel.com>

andrula-song force-pushed the drc branch from d79b226 to 39a4d6d Compare

November 6, 2023 06:57

lgirdwood approved these changes

View reviewed changes

Member

lgirdwood left a comment

Good improvement.

andrula-song closed this

Member

lgirdwood commented Jan 17, 2024

@andrula-song whats the reason for close if saving MCPS ?

Contributor Author

andrula-song commented Jan 18, 2024 •

edited by aiChaoSONG

Loading

@andrula-song whats the reason for close if saving MCPS ?
sorry, closed by mistake, and force pushed to the branch, can not reopen. so created a new one #8757

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet