Modal Absorption Coefficients

Modal analysis, synthesis, and absorption measurement of room impulse responses using the PolyMax frequency-domain estimator.

Author: Michele Ducceschi, March 2026

Overview

polymaxRIR is a MATLAB pipeline for physically-grounded analysis of room impulse responses (RIRs). It estimates modal parameters — frequencies f_p, damping coefficients c_p, and complex residues r_p = a_p + j b_p — and reconstructs a causal synthetic RIR as a direct modal sum:

h_syn(n) = 2 Re { Σ_p  r_p  exp(λ_p · n/fs) },    λ_p = −c_p + j√(ω_p² − c_p²)

Synthesis is done directly in the time domain — not via IFFT — so decay tails are physically correct regardless of RIR length.

The analysis range is split at the Schroeder frequency f_Sch into two physically motivated regimes:

Regime	Range	Method	Rationale
LF	f < f_Sch	PolyMax stability-diagram fitting	Modes are individually resolvable
HF	f ≥ f_Sch	Spectral peak-picking on \|H(f)\|	Dense modal overlap prevents pole-by-pole resolution

f_Sch is computed from the mid-frequency T20 (500–1000 Hz):

f_Sch = 2000 √(T20_mid / V_room)

Files

File	Role
`polymaxRIR.m`	Main batch script — modal analysis, synthesis, per-file `.mat` and `.wav` output
`polymax.m`	PolyMax frequency-domain estimator — standalone reusable function
`acoustic_indices.m`	Room-acoustic indices (T20, C80, Ts, BR, TR) — standalone reusable function
`analyseResults.m`	Post-processing comparison of two conditions (e.g. empty vs specimen)
`computeAbsorption.m`	Sound absorption coefficient estimation from the comparison results

All pipeline helper functions are local functions inside their respective scripts.

Pipeline

measured_IRs/
  empty/*.wav          ──► polymaxRIR.m ──► polymaxRIR_results/empty/*_polymaxRIR.mat
  specimen/*.wav       ──► polymaxRIR.m ──► polymaxRIR_results/specimen/*_polymaxRIR.mat
                                                          │
                                                   analyseResults.m
                                                          │
                                             comparison/comparison.mat
                                             comparison/fig1_T60distributions.fig
                                             comparison/fig2_T60spaghetti.fig
                                             comparison/fig3_T60comparison.fig
                                             comparison/empty/*_commonBasis.mat
                                             comparison/empty/*_commonBasis_SYNTH.wav
                                             comparison/specimen/*_commonBasis.mat
                                             comparison/specimen/*_commonBasis_SYNTH.wav
                                                          │
                                              computeAbsorption.m
                                                          │
                                             comparison/absorption.mat
                                             comparison/absorption_alpha.fig

Dependencies

MATLAB Toolbox	Functions used
Signal Processing Toolbox	`filtfilt`, `designfilt`, `findpeaks`
Statistics and Machine Learning Toolbox	`linkage`, `cluster`
Audio Toolbox	`audioread`, `audiowrite`

Step 1 — polymaxRIR.m

Quick start

Place .wav files in a folder (one condition per folder).
Edit the USER PARAMETERS block at the top of polymaxRIR.m.
Run the script. All results are saved to outRootDir/<folderName>/.

Batch mode

polymaxRIR processes all .wav files found in inputFolder in a single run. Each file is processed independently inside a try/catch block. A batch summary table is printed and saved as batchSummary.mat.

inputFolder = './measured_IRs/empty';
outRootDir  = './polymaxRIR_results/';

Two-band-grid architecture

Processing bands are narrow linear bands of fixed width procBandWidth_Hz [Hz]. PolyMax and peak-picking run on these. A constant bandwidth keeps polynomial orders tractable across the whole frequency range.

Display bands are ISO 266 / IEC 61260 octave or third-octave bands. All statistics, figures, and the saved poleStats struct use these. Poles are assigned to display bands by frequency after estimation.

Both grids are independent: bandMode and procBandWidth_Hz can be tuned separately.

User parameters

File and output

inputFolder = './measured_IRs/St Gobain';   % folder containing .wav files
outRootDir  = './polymaxRIR_results/';       % root for output subfolders

Analysis frequency range

fLow  = 20;      % [Hz]  lower analysis limit
fHigh = 12000;   % [Hz]  upper analysis limit

Display bands

bandMode        = 'thirdOctave';    % 'thirdOctave' | 'fullOctave' | 'custom'
customBandEdges = [50 100 200 ...]; % only used when bandMode = 'custom'

Processing bands

procBandWidth_Hz = 25;   % [Hz]  linear processing band width

Typical range: 20–100 Hz. Smaller values give lower polynomial orders and more bands.

Room geometry

V_room  = 150;   % [m³]  room volume
c_sound = 343;   % [m/s] speed of sound

Used for Weyl mode density, polynomial order sizing, and Schroeder frequency.

Transition frequency

fTransition = [];   % [Hz]  set to [] for automatic

When empty, computed from the mid-frequency T20 from acoustic_indices. Set a numeric value to override.

PolyMax parameters

maxPolyOrderLF = 2000;   % hard ceiling on polynomial order
maxPolyIters   = 100;    % target number of order steps per band

tolf_init   = 0.05;   % relative frequency stability tolerance, initial pass
told_init   = 0.10;   % relative damping  stability tolerance, initial pass
tolf_refine = 0.05;   % relative frequency stability tolerance, refinement
told_refine = 0.10;   % relative damping  stability tolerance, refinement

Per-band polynomial order is set to min(6 × N_expected, Nbins/4, maxPolyOrderLF, 900) where N_expected is the Weyl mode count for the band.

T60 rejection ceiling

T60cutMult  = 1.5;   % T60cut = T60cutMult × T20est  (per band)
T60cutFloor = 5.0;   % [s]  minimum T60cut

Poles with T60 > T60cut are rejected as spurious.

Gain outlier rejection

enableGainOutlierRejection = true;
gainOutlierMethod          = 'mad';   % 'mad' | 'std'
gainOutlierZthresh         = 6.0;
maxGainOutlierLSIters      = 5;
minModesToKeepAfterReject  = 3;

After the LS amplitude solve, poles with residue amplitudes more than gainOutlierZthresh robust Z-scores from the band median are discarded and the solve is repeated.

Iterative refinement

useIterativeRefinement = false;
nRefinementIters       = 8;

When enabled, re-estimates poles from the spectral residual of the previous iteration.

FIR correction

useFIRCorrection = false;

When enabled, a linear-phase FIR filter is fitted to the residual between reference and synthetic signal to correct for broadband spectral mismatch.

Synthesis quality threshold

nRMSE_time_keep_threshold = 5e-3;

Files with time-domain nRMSE above this value are flagged in the batch summary (but still saved).

Output files (per IR)

File	Description
`<name>_polymaxRIR.mat`	Full results struct (see fields below)
`<name>_polymaxRIR_SYNTH.wav`	Direct modal synthesis (peak-normalised)
`<name>_spectrum.fig`	Spectrum + residual + time-domain comparison
`<name>_bandStats.fig`	Per-band mode count, T60, and spacing
`<name>_LFcomparison.fig`	PolyMax T20 vs acoustic-indices T20 for LF bands

MAT file struct fields

Field	Description
`fAll`, `cAll`, `aAll`, `bAll`	All poles: frequency [Hz], damping [rad/s], residue components
`bandIdxAll`	Display-band index for each pole
`poleStats`	Per-display-band statistics struct (see below)
`globalStats`	Wideband summary struct
`refAI`	Full `acoustic_indices` output struct
`fTransition`	Schroeder frequency used [Hz]
`dispEdgesVec`	Display band edges [Hz]
`outSyn`	Synthetic IR time series
`nRMSE_IIR`, `nRMSE_final`	Time-domain normalised RMSE
`fs`	Sample rate [Hz]

poleStats struct fields (per display band)

Field	Description
`flow`, `fhigh`, `fcBand`	Band edges and ISO centre [Hz]
`N`	Number of poles in band
`regime`	`'LF'` or `'HF'`
`modalDensity_Hz`	Measured modal density [modes/Hz]
`densityRatio`	Measured / Weyl theoretical density
`meanT60`, `stdT60`, `cvT60`	T60 statistics [s]
`meanC`, `stdC`	Damping coefficient statistics [rad/s]
`meanF`, `stdF`	Modal frequency statistics [Hz]
`meanSpacing_Hz`, `cvSpacing`	Inter-mode spacing statistics [Hz]

Figures produced

Figure	Contents
acoustic_indices — aligned RIR	Original vs onset-aligned impulse response
acoustic_indices — decay and clarity	Per-band T20 [s] and C80 [dB]
polymaxRIR — Spectrum	Target vs synthetic spectrum; spectral residual; time-domain comparison with nRMSE
polymaxRIR — Per-band statistics	Mode count; mean ± 1σ T60; mean ± 1σ inter-mode spacing
polymaxRIR — LF T60 comparison	PolyMax T20 vs acoustic-indices T20 per LF band

Step 2 — analyseResults.m

Compares modal parameters across two conditions (empty room vs room with absorption specimen). Loads all *_polymaxRIR.mat files from two result folders.

Cache-skip behaviour

On each run, analyseResults checks whether comparison.mat and all per-IR _commonBasis.mat files already exist in outDir. If so, the expensive modal basis / amplitude re-fit pipeline is skipped and results are loaded directly from disk — only the figures are regenerated. Delete comparison.mat to force a full re-run.

Algorithm

Common pole basis: vote on shared modal frequencies from the empty-room IRs. A frequency bin enters the basis when at least minFileFraction of IRs contribute a pole. Representative (f, c) per basis pole = median across contributing IRs.
Specimen damping: for each basis frequency, find the nearest pole in each specimen IR within binWidth_Hz and take the median c across IRs. Frequencies are NOT re-voted — they are geometry-locked.
Amplitude re-fit: banded linear LS solve for residues (a_p, b_p) on the shared (f, c) basis, per IR.
Statistics: per-display-band T60 mean ± 1σ from both the original per-IR pole sets and the common basis.

User parameters

emptyDir    = './polymaxRIR_results/empty';
specimenDir = './polymaxRIR_results/specimen';
outDir      = './polymaxRIR_results/comparison';

binWidth_Hz     = 0.5;    % [Hz]  frequency bin width for pole voting
minFileFraction = 0.4;    % majority vote threshold [0, 1]
bandMode        = 'fullOctave';
lsBandWidth_Hz  = 50;     % [Hz]  linear LS band width for amplitude re-fit
overlapFrac     = 0.10;   % LS window extension beyond band edges
MinSecondsForFFT = 10;    % [s]   FFT zero-padding floor

Figures produced

Figure	Contents
`fig1_T60distributions.fig`	T60 histograms (LF and HF), empty vs specimen
`fig2_T60spaghetti.fig`	Per-IR T60 curves (faint) + mean (solid), both conditions in one panel
`fig3_T60comparison.fig`	Mean ± 1σ T60 per band: PolyMax modal (solid) and AI T20 (dashed), both conditions

Output files

File	Description
`comparison.mat`	Full comparison struct (bases, fits, T60 statistics, parameters)
`empty/<name>_commonBasis.mat`	Per-IR LS amplitudes on shared basis
`empty/<name>_commonBasis_SYNTH.wav`	Resynthesised IR on common basis (peak-normalised)
`specimen/<name>_commonBasis.mat`	As above for specimen condition
`specimen/<name>_commonBasis_SYNTH.wav`	As above for specimen condition

Step 3 — computeAbsorption.m

Estimates the sound absorption coefficient α of the specimen per octave or third-octave band from the comparison.mat output of analyseResults.

Method

LF regime (f < f_Sch) — per-mode all-pairs WLS regression

Every (empty IR i, specimen IR j) pair is one observation of the modal damping shift:

Δc_{ij,p} = c_{S,j,p} − c_{E,i,p}   →   K_abs · α_p

with pair weight w_{ij} = w_{E,i} · w_{S,j} where w = |r| = √(a² + b²). This all-pairs formulation uses N_E × N_S equations per mode (vs N_E + N_S for marginal means), maximising statistical efficiency. Standard errors come from the actual residuals, with MAD-based outlier rejection.

α_p = Σ_{ij} w_{ij} · Δc_{ij} / (K_abs · Σ_{ij} w_{ij})

K_abs = c_sound · S_spec / (8 V)

Per-band α is the reliability-weighted mean of α_p over all modes in the band, where the reliability weight combines amplitude magnitude and spatial consistency (coefficient of variation of |r| across IRs).

HF regime (f ≥ f_Sch) — Sabine formula on PolyMax T60

ΔA    = 55.3 V / c · (1/T60_S − 1/T60_E)
α_HF  = ΔA / S_spec

Uncertainty is propagated analytically from the inter-IR T60 standard deviations.

Advantages over ISO Schroeder / Sabine

Issue	ISO method	This method
Nonlinearity bias	Averages T60 then inverts	Works directly in c (linear)
Noise floor	Schroeder tail corrupts late decay	PolyMax robust to late-decay noise
Room absorption	Must measure or subtract	Cancels exactly via Δc
Uncertainty	Single T60 value	Per-mode spread → natural σ
Below f_Sch	T60 unreliable (few modes)	Mode-by-mode resolution
Spatial sampling	All IRs averaged equally	Amplitude-weighted by

All α values are hard-clamped to [0, 1].

User parameters

compMatPath = './polymaxRIR_results/comparison/comparison.mat';
outDir      = './polymaxRIR_results/comparison';

V_room   = 150;    % [m³]  room volume
c_sound  = 343;    % [m/s] speed of sound
S_spec   = 1.2;    % [m²]  specimen area (one-sided, facing the room)
bandMode = 'fullOctave';

useAmplitudeWeighting = true;   % weight per-mode α by |r_p|
OUTLIER_SIGMA         = 3.0;    % MAD outlier rejection threshold

Figures produced

Figure	Contents
`absorption_alpha.fig`	α per band: LF WLS (blue, errorbars), LF median (light blue), HF Sabine/PolyMax (red, errorbars), ISO reference Sabine/AI T20 (grey, full range)
Per-mode absorption	Scatter of α_p vs f_p coloured by reliability; reliability distribution histogram

Output files

File	Description
`absorption.mat`	α curve, uncertainties, per-mode estimates, reliability weights, all parameters

Standalone usage

polymax.m

% LF call: NoCluster=1, poles returned directly from stability diagram
[fBand, cBand, ~] = polymax(H, fv, NordMin, NordMax, ordStep, ...
                             tolf, told, Ncluster, T60cut, AutoCluster, NoCluster, LivePlot);

Argument	Description
`H`	Complex single-sided FFT of the RIR
`fv`	Frequency vector [Hz] matching H
`NorderMin/Max`	Polynomial order sweep range
`orderstep`	Order increment
`tolf`, `told`	Relative frequency and damping stability tolerances
`Ncluster`	Number of clusters (when `NoCluster=0`, HF regime)
`T60cut`	Reject poles with T60 > T60cut [s]
`AutoCluster`	1 = set Ncluster automatically from mean stable pole count
`NoCluster`	1 = return stability-diagram poles directly (LF regime)
`LivePlot`	1 = draw stability diagram in current figure

acoustic_indices.m

[h, fs] = audioread('myRIR.wav');
res = acoustic_indices(h, fs, bandsPerOctave, freqLims, tEarly, alignThreshold_dB);

Input	Description
`h`	Mono RIR vector
`fs`	Sample rate [Hz]
`bandsPerOctave`	1 for octave, 3 for third-octave
`freqLims`	`[fMin fMax]` [Hz]
`tEarly`	Early/late boundary for C80 [s] (ISO 3382: 0.08 s)
`alignThreshold_dB`	Onset detection threshold below peak [dB] (e.g. −20)

Output field	Description
`bandCentersHz`	ISO band centre frequencies [Hz]
`T20`	Reverberation time per band [s] — ISO 3382 T20 estimator (fit −5 to −25 dB, extrapolated to 60 dB)
`C80`	Clarity per band [dB]
`Ts`	Wideband centre time [s]
`BR`	Bass ratio: (T20₁₂₅ + T20₂₅₀) / (T20₅₀₀ + T20₁₀₀₀)
`TR`	Treble ratio: (T20₂₀₀₀ + T20₄₀₀₀) / (T20₅₀₀ + T20₁₀₀₀)
`hAligned`	Onset-aligned, mid-T60-cropped RIR (column vector)

Algorithm notes

Direct time-domain synthesis

Synthesis uses the modal sum directly rather than IFFT:

h_syn(n) = 2 Re { Σ_p  (a_p + j b_p)  exp(λ_p · n/fs) }

An IFFT-based approach treats the spectrum as periodic with period T = N/fs. Any mode whose T60 exceeds T wraps its tail back onto the start of the block, producing a spurious burst at the end. Direct synthesis has no such window constraint — each mode decays correctly to zero. Poles are processed in memory-safe chunks (~50 MB per chunk).

T20 as the T60 estimator

acoustic_indices uses exclusively the T20 estimator throughout: the Schroeder decay curve is fitted over −5 to −25 dB and the slope is extrapolated to a 60 dB drop. Per ISO 3382, T20 is an unbiased estimate of T60 — no factor of 2 is applied. T20 is used instead of T30 because it requires only 25 dB of usable dynamic range, making it more robust for low-SNR or short IRs.

PolyMax polynomial model

At each order N, a scalar right-matrix-fraction model is fitted to H(f):

H(z) ≈ B(z) / A(z),    z = exp(−jω / (2(f_high − f_low)))

The denominator polynomial A(z) is solved from the Schur complement of the real normal equations with amplitude-invariant Tikhonov regularisation (ε × trace(R)/N × I). Eigenvalues of the companion matrix give discrete-time poles; only upper-half-plane (Im > 0), left-half-plane (Re < 0) poles are retained.

A pole is accepted when between consecutive orders:

|f_p(N) − f_p(N−1)| / f_p  ≤  tolf
|c_p(N) − c_p(N−1)| / c_p  ≤  told

Amplitude solve

Residues are found by banded LS from the partial-fraction model:

H(jω) = Σ_p  r_p / (jω − λ_p)  +  r_p* / (jω − λ_p*)

Solved per processing band over a window extended by overlapFrac to reduce boundary artefacts. Tikhonov regularisation with ε = 10⁻⁸ × mean(diag(R'R)).

ISO band edge construction

fc(n) = 1000 · 2^(n/B)   Hz,    B = bandsPerOctave
lo(n) = fc(n) / 2^(1/2B),    hi(n) = fc(n) · 2^(1/2B)

Edges are always exactly N+1 values; fLow and fHigh select which bands to include without modifying their widths.

References

Peeters B., Van der Auweraer H., Guillaume P., Leuridan J. (2004). The PolyMAX frequency-domain method: a new standard for modal parameter estimation? Shock and Vibration, 11(3–4), 395–409.

Schroeder M. R. (1965). New method of measuring reverberation time. Journal of the Acoustical Society of America, 37(3), 409–412.

ISO 3382-1:2009. Acoustics — Measurement of room acoustic parameters — Part 1: Performance spaces.

ISO 266:1997. Acoustics — Preferred frequencies.

IEC 61260-1:2014. Electroacoustics — Octave-band and fractional-octave-band filters.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
measured_IRs		measured_IRs
.DS_Store		.DS_Store
.gitattributes		.gitattributes
README.md		README.md
acoustic_indices.m		acoustic_indices.m
analyseResults.m		analyseResults.m
computeAbsorption.m		computeAbsorption.m
polymax.m		polymax.m
polymaxRIR.m		polymaxRIR.m

Folders and files

Latest commit

History

Repository files navigation

Modal Absorption Coefficients

Overview

Files

Pipeline

Dependencies

Step 1 — polymaxRIR.m

Quick start

Batch mode

Two-band-grid architecture

User parameters

File and output

Analysis frequency range

Display bands

Processing bands

Room geometry

Transition frequency

PolyMax parameters

T60 rejection ceiling

Gain outlier rejection

Iterative refinement

FIR correction

Synthesis quality threshold

Output files (per IR)

MAT file struct fields

poleStats struct fields (per display band)

Figures produced

Step 2 — analyseResults.m

Cache-skip behaviour

Algorithm

User parameters

Figures produced

Output files

Step 3 — computeAbsorption.m

Method

User parameters

Figures produced

Output files

Standalone usage

polymax.m

acoustic_indices.m

Algorithm notes

Direct time-domain synthesis

T20 as the T60 estimator

PolyMax polynomial model

Amplitude solve

ISO band edge construction

References

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages