2024_04_25_349b87b3a8e4b37a3ffeg

Aup10 Audio Engineering Society Convention Paper 10242
Aup10 音频工程学会大会论文 10242

Presented at the Convention
在大会上发言2019 October 16-19, New York, USA
2019 年 10 月 16-19 日，美国纽约

Abstract 摘要

This Convention paper was selected based on a submitted abstract and 750-word precis that have been peer reviewed by at least two qualified anonymous reviewers. The complete manuscript was not peer reviewed.
这篇大会论文是根据已提交的摘要和 750 字的前言筛选出来的，至少有两名合格的匿名评审员对摘要和前言进行了同行评审。完整稿件未经同行评审。
This convention paper has been reproduced from the author's advance manuscript without editing, corrections, or consideration by the Review Board. The AES takes no responsibility for the contents. This paper is available in the AES E-Library, http://www.aes.org/e-lib. All rights reserved. Reproduction of this paper, or any portion thereof, is not permitted without direct permission from the Journal of the Audio Engineering Society.
本大会论文系根据作者的预发手稿转载，未经编辑、修改或审查委员会审议。AES 对其内容不承担任何责任。本文可在 AES 电子图书馆查阅，http://www.aes.org/e-lib。保留所有权利。未经《音频工程学会学报》直接许可，不得复制本文或其中任何部分。

Perceptual Assessment of Distortion In Low-Frequency Loudspeakers
低频扬声器失真感知评估

Louis D. Fielder , and Michael J. Smithers
Louis D. Fielder 和 Michael J. Smithers Millbrae, CA, USA
美国加利福尼亚州米尔布拉 Dolby Laboratories, Sydney, Australia
澳大利亚悉尼杜比实验室Correspondence should be addressed to Louis D. Fielder (louisfielder@earthlink.net)
通讯作者：Louis D. Fielder (louisfielder@earthlink.net)

Abstract 摘要

A perceptually-driven distortion metric for loudspeakers is proposed which is based on a critical-band spectral comparison of the distortion and noise to an appropriate masking threshold. The loudspeaker is excited by a sine-wave signal composed of windowed 0.3 second bursts.
该指标基于失真和噪声与适当掩蔽阈值的临界频段频谱比较。扬声器由窗口 0.3 秒脉冲串组成的正弦波信号激励。
Loudspeaker masking curves for sine waves between are derived from previously published ones for headphone distortion evaluation and expanded to curves at 1 decibel increments by linear interpolation and extrapolation.
之间正弦波的扬声器掩蔽曲线源自之前发布的耳机失真评估曲线，并通过线性插值和外推法扩展为 1 分贝增量的曲线。
For each burst, the ratios of measured distortion and noise levels to the appropriate masking curve values are determined for each critical band starting at the second harmonic.
对于每个脉冲串，从二次谐波开始，确定每个临界频段的测量失真和噪声水平与相应屏蔽曲线值的比率。
Once this is done the audibility of all these contributions are combined into various audibility values.
一旦完成这项工作，所有这些贡献的可听度就会合并成不同的可听度值。

1 Introduction 1 引言

This study assesses the audible degradation due to nonlinear distortion for low-frequency loudspeakers, i.e. those reproducing sounds at or below

. Unfortunately, simple nonlinear distortion metrics such as harmonic distortion, total harmonic distortion and noise, and intermodulation distortion do not correlate well with perceived quality.
本研究评估了低频扬声器非线性失真引起的听觉衰减，即重现

或以下声音的扬声器。遗憾的是，谐波失真、总谐波失真和噪声以及互调失真等简单的非线性失真指标与感知质量并没有很好的关联。
Instead this approach models the perceptual process of nonlinear distortion detection via a spectral comparison of loudspeaker acoustic output at the second harmonic and above to auditory masking values on a critical-band basis and then uses a critical-band combination model for overall audibility.
相反，这种方法通过将扬声器二次谐波及以上的声学输出与听觉掩蔽值进行临界频段的频谱比较，对非线性失真检测的感知过程进行建模，然后使用临界频段组合模型进行整体可听性建模。
Sine waves between

and at a sample rate (SR) of either 48 or

stimuli are used for this test. For the purposes of this study distortion will be used to designate nonlinear distortion
该测试使用

之间的正弦波，采样率 (SR) 为 48 或

。在本研究中，失真指的是非线性失真。

The problem of finding meaningful distortion evaluation has resulted in a number of efforts to find perceptually relevant metrics for transducer distortion.
为了找到有意义的失真评估方法，许多人都在努力寻找与感知相关的传感器失真指标。
Lin [1] proposed using subjective harmonic distortion audibility experiments to develop a weighting for a traditional distortion analyzer. Another paper by Boer et al.
Lin [1] 建议使用主观谐波失真可听性实验来为传统失真分析仪制定权重。Boer 等人的另一篇论文
[2] investigated the audibility of harmonic distortion, various intermodulation distortions, and a loudspeaker nonlinearity model using subjective paired-comparison tests with music.
[2]利用音乐主观配对比较测试，研究了谐波失真、各种互调失真和扬声器非线性模型的可听性。
Lee and Geddes [3] and [4] developed a metric based on evaluating the nonlinear characteristics of a transducer's transfer function and compared this metric to the results from subjective tests. Tan, Moore, and Zacharov [5] used artificially generated distortions
Lee 和 Geddes [3] 和 [4] 在评估传感器传递函数非线性特性的基础上开发了一种度量方法，并将该方法与主观测试结果进行了比较。Tan、Moore 和 Zacharov [5]使用人为产生的失真来评估传感器的非线性特性。
and recordings of real transducers on speech and music to assess the effectiveness of a proposed spectral distortion measure.
和真实传感器对语音和音乐的录音，以评估所提出的频谱失真测量方法的有效性。
Voishvillo [6] summarized the effectiveness of some of the above distortion metrics along with the PEAQ standard, "Method for Objective Measurement of Perceived Audio Quality" [7].
Voishvillo [6]总结了上述一些失真指标的有效性，以及 PEAQ 标准 "客观测量感知音频质量的方法"[7]。
Temme, Brunet, and Keele [8] also applied the PEAQ model to the assessment of loudspeaker distortion and Temme, Brunet, and Qarabaqi [9] adapted the PEAQ model for improved performance in distortion assessment by comparison to subjective tests.
Temme、Brunet 和 Keele[8]还将 PEAQ 模型应用于扬声器失真评估，Temme、Brunet 和 Qarabaqi[9]对 PEAQ 模型进行了改进，通过与主观测试进行比较，提高了失真评估的性能。
Fielder and Benjamin [10] evaluated subwoofer nonlinear performance via 20

sine-wave stimuli and a comparison of thirdoctave spectral levels to masking curves.
Fielder 和 Benjamin [10]通过 20

正弦波刺激和第三倍频程频谱水平与掩蔽曲线的比较，对亚低音扬声器的非线性性能进行了评估。

The method presented here is different than these earlier methods in that the present model includes use of measured masking curves and a method to combine critical-band audibility into an overall audibility number.
这里介绍的方法与之前的方法不同，因为本模型包括使用测量的掩蔽曲线和将临界频段可听度合并为整体可听度数字的方法。
This method is very similar to what was recently proposed for the assessment of headphone distortion by Fielder [11], except that the masking curves have been adapted for loudspeaker listening, the acoustic signal is taken from a microphone located closely to the loudspeaker, the acoustic level is referenced at a 1 meter or greater distance, a shorter 16,384-sample FFT is employed, and

sine-wave bursts are used instead of steady-state sine-wave signals. The sequence of sine-wave bursts is composed of windowed

bursts with sequentially increasing amplitudes in 1 decibel increments that are spaced apart at 2 second intervals.
该方法与 Fielder [11]最近提出的耳机失真评估方法十分相似，不同之处在于：掩蔽曲线经调整后适用于扬声器聆听；声学信号取自靠近扬声器的麦克风；声级以 1 米或更远的距离为基准；采用较短的 16,384 样本 FFT；以及使用

正弦波脉冲串代替稳态正弦波信号。正弦波脉冲串是由窗口化的

脉冲串组成，其振幅以 1 分贝为单位依次增大，间隔为 2 秒。
The use of short bursts reduces the possibility of damage by limiting the duration of the test stimulus and the sequence of bursts more quickly provide results over a range of drive levels.
使用短脉冲串可以限制测试刺激的持续时间，从而降低损坏的可能性，而且脉冲串的顺序可以更快地在一定范围内提供驱动水平的结果。

This distortion-audibility approach analyzes the acoustic output of the loudspeaker for each sine-wave burst and produces the spectrum of the acoustic signals using a 16,384-sample FFT.
这种失真可听度方法分析扬声器对每个正弦波脉冲的声学输出，并使用 16,384 样本 FFT 生成声学信号的频谱。
A spectral comparison of the acoustic signal to auditory-masking levels is made on a critical-band by critical-band basis and then the audibility values for each critical band are combined into several audibility numbers.
在逐个临界频段的基础上，将声音信号与听觉掩蔽水平进行频谱比较，然后将每个临界频段的可听度值合并为几个可听度数值。
The audibility of all distortion products, the combination of

harmonics, and high-frequency products above

for drive frequencies of

, respectively is calculated.
计算所有失真产物、

谐波组合以及高于

或

的高频产物的可听度，驱动频率分别为

或

。

This test method shares some similarities to a test of powered subwoofers, denoted as CEA-2010 [12]. Both tests use a sequence of short windowed sine-wave bursts, time-windowed spectral analysis, and comparison to a frequency dependent threshold.
这种测试方法与 CEA-2010 [12] 中的有源低音炮测试有一些相似之处。这两项测试都使用了短窗口正弦波脉冲串、时间窗口频谱分析以及与频率相关阈值的比较。
Differences are the length of the bursts, burst window shapes, analysis window lengths/shapes, range of applied frequencies, and type of frequency-dependent threshold employed.
不同之处在于脉冲串长度、脉冲串窗口形状、分析窗口长度/形状、应用频率范围以及所采用的频率阈值类型。
Additionally, the method proposed here generates distortion audibility values, rather than imposing a hard limit for acceptable performance.
此外，本文提出的方法可生成失真可听度值，而不是为可接受的性能设定硬性限制。
Another significant difference in this proposed method is that it more accurately focuses on the audibility of higher-frequency buzz and noise distortion products, which can significantly degrade sound quality but be low in numerical level.
这种拟议方法的另一个重要区别是，它更准确地关注高频嗡嗡声和噪声失真产品的可听性，这些产品会显著降低音质，但数值水平较低。
Further to CEA-2010 [12], whilst the thresholds for the acceptable level of harmonics have some similarity to perceptual masking in terms of fall-off with increasing frequency, their absolute levels are much higher than perceptual masking levels and so subwoofers deemed acceptable under this method still exhibit quite audible distortion.
此外，根据 CEA-2010 [12]，虽然谐波可接受水平的阈值在随频率增加而下降方面与感知掩蔽有一定的相似性，但其绝对水平远高于感知掩蔽水平，因此根据这种方法认为可接受的超低音仍然会表现出相当明显的失真。

A number of distortion assessment examples are examined by this new method. It will be shown that nonlinear impairments in loudspeakers can be quite significant and dependent on the fundamental frequency.
通过这种新方法，对一些失真评估实例进行了研究。结果表明，扬声器中的非线性损耗可能相当严重，并与基频有关。
Typically distortion products become less and less significant as the frequency of the signal increases because of the combined effect of increased upward-frequency auditory masking and reduced excursion of the loudspeaker diaphragm.
通常情况下，随着信号频率的增加，失真产品的重要性会越来越小，这是因为上行频率听觉掩蔽增加和扬声器振膜偏移减小的共同作用。

2 Design of Sine-Wave Bursts
2 正弦波脉冲串的设计

The design of the sine-wave bursts is a compromise between analysis time, reducing spectral splatter, and limiting damage at high drive levels.
正弦波脉冲串的设计是在分析时间、减少频谱飞溅和限制高驱动级损坏之间的折衷方案。
The compromise chosen results in bursts composed of an integer number of cycles of the fundamental frequency to result in a time interval that is equal or just less than

. It is assumed that a 24-bit drive signal is used, with either a 48 or

SR.
折中的结果是，脉冲串由整数个基频周期组成，时间间隔等于或刚好小于

。假设使用的是 24 位驱动信号，SR 为 48 或

。

The burst is then windowed with the same length, flat-top window with fade up/down shapes based on the first and last halves of a 6000-sample,

Kaiser-Bessel window. This window shape is chosen to reduce the effect of the spectral spatter of the sine-wave burst while maximizing the time spent at the desired level. The

sine-wave bursts all have virtually the same length and window shaping. Fig 1 shows the shape of a

sine-wave burst at a

SR.
然后，使用相同长度的平顶窗口对脉冲串进行窗口处理，窗口的上/下渐变形状基于 6000 个样本的前半部分和后半部分，

Kaiser-Bessel 窗口。选择这种窗口形状是为了减少正弦波脉冲串频谱散射的影响，同时最大限度地延长所需电平的时间。

正弦波脉冲串的长度和窗口形状几乎相同。图 1 显示了

SR 下

正弦波脉冲串的形状。

Figure 1.20 Hz windowed sine-wave burst
图 1.20 Hz 窗口正弦波脉冲串

Examination of figure 1 shows that a burst is mostly composed of sine-wave cycles at full amplitude but smoothly fades to zero at the burst boundaries. Compared to a steady-state sine wave of the same length, the RMS level is reduced 1.41 or

for 48 or

SR, respectively. This also means that the distortion products are slightly underestimated due to the shorter duration of the full-amplitude, sine-wave signal portion.
对图 1 的研究表明，脉冲串主要由全振幅正弦波周期组成，但在脉冲串边界处会平滑渐弱为零。与相同长度的稳态正弦波相比，48 或

SR 的有效值电平分别降低了 1.41 或

。这也意味着，由于全振幅正弦波信号部分的持续时间较短，失真产物被略微低估。

The bursts are assembled into a sequence, spaced at 2 second intervals and sequentially increasing in level by

increments. The use of a sequence of bursts allows rapid testing of distortion audibility over a range of levels. The

range of burst levels enables assessment of the loudspeaker performance from fairly linear operation at modest displacement, up to the maximum anticipated or designed displacement.
这些脉冲串组合成一个序列，间隔为 2 秒钟，音量以

为增量依次递增。使用脉冲串可以在一定范围内快速测试失真可听度。

的脉冲串电平范围可以评估扬声器的性能，从适度位移时的相当线性运行，一直到最大预期或设计位移。
It is also possible to examine the quiet intervals between bursts to determine the effect of background noise on the measurement.
还可以检查脉冲串之间的静音间隔，以确定背景噪声对测量的影响。
Figure 2 shows a sequence of

bursts covering the level range of

. Often it is useful to include an additional low-level burst at the beginning to allow for start-up effects. This additional burst is not included in the analysis
图 2 显示了

脉冲串序列，其电平范围为

至

。通常，在开始时加入一个额外的低电平突发以考虑启动效应是很有用的。这个额外的突发不包括在分析中。

Figure 2. Sequence of

bursts from

图 2.从

到

的脉冲串序列

The effect of spectral spatter due to the burst duration and window shape produces spectral components other than the fundamental frequency. These spectral components can cause errors in the perceptual distortion assessment process. These errors are greatest for

since the masking curve at

is minimal. Figure 3 shows a comparison of the spectrum of a

burst of figure 1 to the associated masking curve.
由于脉冲串持续时间和窗口形状造成的频谱散射效应会产生基频以外的频谱成分。这些频谱成分会在感知失真评估过程中造成误差。

的这些误差最大，因为

的掩蔽曲线最小。图 3 显示了图 1 中

突发的频谱与相关屏蔽曲线的对比。

Figure 3. Worst case spectral splatter

sinewave burst @

)
图 3.最坏情况下的频谱飞溅

正弦波突发 @

)

Examination of this figure shows that the burst spectral components are significantly below the masking curve for frequencies

. If the measurement of the

harmonic level does not include frequencies below

, the spectral splatter of the burst will not create significant errors in this measurement. Frequencies at

and above result in even less significant burst spectral splatter errors and the tests can employ a critical band centered at the second harmonic.
从图中可以看出，脉冲串的频谱成分明显低于

的掩蔽曲线。如果

谐波电平的测量不包括

以下的频率，则脉冲串的频谱飞溅不会对这一测量造成重大误差。频率在

及以上的脉冲串频谱飞溅误差更小，测试可采用以二次谐波为中心的临界频段。

3 Test Configuration 3 测试配置

The assessment of loudspeaker distortion is performed by placing the loudspeaker in a half-plane environment, preferably in an anechoic half plane or a full-anechoic environment and correcting the level to the half-plane one.
扬声器失真评估是通过将扬声器置于半平面环境中进行的，最好是半消声环境或全消声环境，并将电平校正至半平面电平。
The acoustic level reference is typically measured at 1 meter or greater. Non-anechoic environments require a frequency correction factor for both the level reference microphone and the more closely placed measurement microphone to account for room modes and effects.
声级参考通常在 1 米或更高处测量。在非消声环境中，需要对声级参考传声器和距离较近的测量传声器进行频率校正，以考虑房间模式和影响。
This is best measured using the same type of stimulus and analysis windowing. In the examples shown later, a high-quality listening room is used and the loudspeakers placed inward from the wall boundaries, but otherwise the effect of room modes is ignored.
最好使用相同类型的刺激和分析窗口进行测量。在后面的示例中，使用了高质量的聆听室，扬声器放置在墙壁边界向内的位置，除此之外，房间模式的影响都被忽略。
A Schoeps MK2 microphone and CMC6 microphone amplifier are used as the measurement microphone because of their low self noise and high overload capabilities.
使用 Schoeps MK2 麦克风和 CMC6 麦克风放大器作为测量麦克风，是因为它们具有低自噪声和高过载能力。

The sequence of bursts is used to drive the loudspeaker and calibrated-level 24-bit recordings are made by placing the measurement microphone close to the loudspeaker (

meters), which feed a wide dynamic range

as the recording device. The sine-wave burst sequence to the loudspeaker is supplied by a DAC driven by the 24-bit sine-wave burst-sequence file. Ideally, the

and DAC would use the same clocks, but this test does not require that. The test set up should be configured to maximize the SNR so as not to create errors due to environmental or measurement-device self noise.
猝发序列用于驱动扬声器，将测量麦克风靠近扬声器（

米）并馈入宽动态范围

作为记录设备，就能录制校准电平的 24 位录音。扬声器的正弦波突发序列由 24 位正弦波突发序列文件驱动的 DAC 提供。理想情况下，

和 DAC 使用相同的时钟，但本测试并不要求如此。测试装置的配置应使信噪比最大化，以免因环境或测量设备自身噪声而产生误差。
The distortion-assessment algorithm is also run during quiet intervals following each burst to determine the effect of this background and equipment noise.
失真评估算法还在每个脉冲串之后的安静间歇期间运行，以确定背景噪声和设备噪声的影响。

Figure 4. Experimental Setup
图 4.实验装置

4 Masking Curve Derivation
4 掩蔽曲线推导

The masking curves used for this analysis are from an earlier study of the perceptual assessment of headphone distortion by Fielder [11]. These are derived from masking tests for frequencies of 20,50 ,

, and

, where the masker is a sine wave and the masked signal is narrow-band noise that is less than or equal in width to a critical bandwidth.
本分析中使用的掩蔽曲线来自 Fielder [11] 早先对耳机失真感知评估的研究。这些曲线来自频率为 20、50、

和

的掩蔽测试，其中掩蔽器为正弦波，被掩蔽信号为宽度小于或等于临界带宽的窄带噪声。
Typical masking curves in the psychoacoustic literature are different in that they use single sine waves as both the masker and masked signals, see Fastl and Zwicker [13].
心理声学文献中的典型掩蔽曲线与此不同，它们使用单正弦波作为掩蔽信号和被掩蔽信号，见 Fastl 和 Zwicker [13]。
Perceived beating effects occur due to the interaction of the sine-wave masker and masked sine wave with nonlinearities in the human auditory system.
由于正弦波掩蔽器和被掩蔽的正弦波与人类听觉系统中的非线性相互作用，会产生感知跳动效应。
The use of narrow-band noise as the masked signal minimizes this and acts as a good compromise for the audibility of sine waves, noises, and
使用窄带噪声作为掩蔽信号可最大限度地减少这种情况，并对正弦波、噪声和窄带噪声的可听性起到良好的折中作用。
combinations of the two. Additionally, the Author [14] had found that lower-level sine wave and critical-bandwidth noises produced similar average masking levels.
两者的组合。此外，作者[14]还发现，低电平正弦波和临界带宽噪声产生的平均掩蔽水平相似。

The masking tests took place over a 16-month period, employed 25 listeners with an average age of 30.4 years. Test subjects were approximately a

mix of male and female. Listeners with abnormal hearing were excluded. Each masking frequency test included a threshold-in-quiet test. The

masking test used 10 test subjects while the remaining frequency tests used 6 subjects each. The masking tests employed Oppo PM3 headphones because of their very low distortion and high-output capability.
掩蔽测试历时 16 个月，共使用了 25 名平均年龄为 30.4 岁的听众。测试对象男女比例约为

。听力异常的听众被排除在外。每个掩蔽频率测试都包括一个安静阈值测试。

屏蔽测试使用了 10 名测试者，其余频率测试各使用了 6 名测试者。由于 Oppo PM3 耳机具有极低的失真和高输出能力，因此掩蔽测试使用了该耳机。

The masking curves for headphone distortion measurement at the ear drum are converted to ones appropriate for loudspeaker measurements by correction by the average transfer function from the loudspeaker measurement point to the ear drum for situations of frontally-positioned loudspeakers.
耳鼓处耳机失真测量的掩蔽曲线通过扬声器测量点到耳鼓的平均传递函数修正后，转换为适合扬声器测量的曲线，以适应前置扬声器的情况。
Therefore the effect of the acoustic gain of the head, torso, pinna, and ear canal in typical rooms are used to adjust the masking curves. Figure 5 shows the acoustic gain due to head, torso, pinna, and ear canal as determined in [11].
因此，典型房间中的头部、躯干、耳廓和耳道的声增益效应被用来调整掩蔽曲线。图 5 显示了 [11] 中确定的头部、躯干、耳廓和耳道的声增益。

Figure 5. Acoustic gain for the head and ear
图 5.头部和耳朵的声增益

Headphone-masking thresholds referenced at the ear drum position were specified at ISO

-octave frequencies between one

-octave frequency below the

harmonic to

and at levels between

for

for 50

for

for 200,315 &

, and

for

. These are converted to masking curves for loudspeaker listening conditions by the inverse of the curve shown in figure 5. Additionally, the designated drive levels for the masking curves at 315,400 , and 500

need to be reduced by the acoustic gain of the head and ear. At 315, 400, and

, these gains are 0.86 . 1.35, and

, respectively. Rounded to the nearest decibel, the drive levels for the masking curves are reduced 1,1 , and

for masking curves at 315, 400, and

, respectively. For instance, the

headphone masking curve at

becomes the loudspeaker masking curve at

after correction by the inverse of the curve in figure 5 .
以耳鼓位置为基准的耳机掩蔽阈值指定为 ISO

-倍频程频率，介于

谐波与

之间的一个

-倍频程频率，以及介于

for 50

for 200,315 &

和

for

之间的电平。通过图 5 所示曲线的倒数，将这些转换为扬声器聆听条件下的掩蔽曲线。此外，315、400 和 500

的掩蔽曲线的指定驱动电平需要根据头部和耳朵的声增益进行降低。在 315、400 和

时，这些增益分别为 0.86 .1.35 和

。四舍五入到最接近的分贝，315、400 和

的掩蔽曲线的驱动水平分别降低了 1、1 和

。例如，

耳机掩蔽曲线

，经图 5 中曲线的逆校正后，变为扬声器掩蔽曲线

。

The above process results in these masking curves:
上述过程产生了这些遮蔽曲线：

94, 104,

(119 dB excluded)

94、104、（不包括 119 分贝）

The values for these masking curves are indicated in table form for

-octave ISO frequencies in appendix 1. It should be noted that the hearing threshold below

may be too high by as much as

due to the amplification of physiologic noise of the test subjects by the closed volume of the Oppo PM3 headphones, see [11]. As a result, the hearing threshold and low-level masking curves may be too forgiving for low-level distortion components below

. Fortunately, masking thresholds higher than the hearing threshold should not be significantly affected.
附录 1 中以表格形式列出了

倍频程 ISO 频率的这些掩蔽曲线值。需要注意的是，由于 Oppo PM3 耳机的封闭音量放大了测试对象的生理噪音，因此

以下的听阈可能过高，高达

，见 [11]。因此，听阈和低电平掩蔽曲线可能对

以下的低电平失真成分过于宽容。幸运的是，高于听阈的掩蔽阈值应该不会受到明显影响。

Next, an estimate for the masking curves at 1 decibel intervals between

are obtained by linear interpolation and extrapolation. If it is assumed that low-level masking curves are never lower the hearing threshold, downward extrapolation proceeds in a linear manner but limits at the hearing-threshold values.
接下来，通过线性内插法和外推法，对

之间 1 分贝间隔的掩蔽曲线进行估计。如果假定低电平掩蔽曲线永远不会低于听阈，则向下外推法以线性方式进行，但以听阈值为限。
See appendix 2 for the appropriate equations.
相关公式见附录 2。

The masking curves for 20, 50100,200 , and

fundamentals, plus interpolated curves are shown in figures

and 10 , respectively.
图

和图 10 分别显示了 20、50100、200 和

基本数据的屏蔽曲线以及内插曲线。

Figure 6.

masking curves
图 6.

遮蔽曲线

Figure 7.

masking curves
图 7.

遮蔽曲线

Figure 8.

masking curves
图 8.

遮蔽曲线

Figure 9.

masking curves
图 9.

遮蔽曲线

Figure 10.315 Hz masking curves
图 10.315 Hz 屏蔽曲线

Examination of figures 6-10 show the masking thresholds derived from the headphone data in [11] as heavy lines. The interpolated/extrapolated curves are shown as fine lines. Note that the masking curves at lower levels merge with the hearing threshold.
图 6-10 显示了根据 [11] 中的耳机数据得出的掩蔽阈值，以粗线表示。内插/外推曲线显示为细线。请注意，较低水平的掩蔽曲线与听阈合并。

5 Distortion-Assessment Algorithm
5 失真评估算法

The distortion-assessment process analyzes the results for each sine-wave burst individually. Additionally, it performs a duplicate analysis for the quiet region

after the start of each burst, for a

SR, respectively. This analysis process allows the evaluation of the effect of background noise in the distortion-assessment process.
失真评估程序会单独分析每个正弦波脉冲串的结果。此外，它还对每个脉冲串开始后的安静区域

或

（分别为

或

SR）进行重复分析。通过这一分析过程，可以在失真评估过程中评估背景噪声的影响。

This results in two assessment processes occurring for each burst, one for the distortion-audibility values and the second for effect of background-noise errors.
这就导致对每个脉冲串进行两次评估，一次评估失真可听度值，另一次评估背景噪声误差的影响。

The analysis is begun by first centering the burst in the middle of the FFT interval of 16,384 samples, which is further windowed by a flat-top window with 1024-sample fade up/down with the shape of the first and last half of a 2048-sample,

Kaiser-Bessel window. At a

SR the burst has a small overlap of 32 samples with the fade up/down regions, while at

SR a gap of 553 samples on each side of the burst exists to the start of the fade regions.
分析开始时，首先将脉冲串置于 16,384 个采样点的 FFT 间隔中间，然后用一个平顶窗口对其进行窗口化，该窗口具有 1024 个采样点的上/下渐变，其形状为 2048 个采样点的前半部分和后半部分，

Kaiser-Bessel 窗口。在

SR 时，脉冲串与淡入淡出区域有 32 个采样点的微小重叠，而在

SR 时，脉冲串两侧与淡入淡出区域的起始点之间有 553 个采样点的间隔。

The FFT windowing for the transform does not affect the direct burst signal but does prevent spectral splatter of the low-frequency room-noise components upward in frequency from creating errors.
变换的 FFT 窗口不会影响直接猝发信号，但会防止低频室内噪声成分的频谱飞溅造成误差。
Because the transform is longer than the burst signal, the effect of background noise is amplified by the small amounts of 0.56 or

for 48 or

SR, respectively. This is true because the FFT analysis samples a longer interval for the background noise than the

sine-wave burst.
由于变换的时间比脉冲串信号长，背景噪声的影响被放大了，48 或

SR 的背景噪声分别为 0.56 或

。这是因为 FFT 分析对背景噪声的采样间隔比

正弦波脉冲串的采样间隔要长。

The perceptual distortion assessment method is shown in figure 11.
感知失真评估方法如图 11 所示。

Figure 11. Distortion-assessment algorithm
图 11.失真评估算法

Examination of this figure shows that the distortion-assessment algorithm combines the values from a level-scaled and windowed FFT with
从图中可以看出，失真评估算法将来自平移和加窗 FFT 的值与下列参数结合在一起
frequency-interpolated masking-curve values to produce distortion-audibility numbers.
频率内插的掩蔽曲线值，以产生失真可听度数字。

The FFT-level scaling is first done to match the

value of the fundamental at the reference position. This is found using the RMS sum of the five FFT-spectral components centered as much as possible around the fundamental frequency. This scaled level is rounded to the nearest

number and used to select the appropriate masking curve. These scaled FFT-spectral values are denoted

, where

is the "bin" or spectral component index. The selected masking curve with values at the

-octave ISO frequencies is linearly interpolated to create masking values at the FFT-basis frequencies associated with each index

, and denoted

. Next, the ratio of

is calculated, with these ratios designated as audibility components.
首先进行 FFT 级缩放，以匹配参考位置上基频的

值。这是用尽可能以基频为中心的五个 FFT 频谱分量的均方根和求得的。该缩放电平四舍五入为最接近的

数字，并用于选择适当的屏蔽曲线。这些经过缩放的 FFT 频谱值以

表示，其中

是 "bin "或频谱分量索引。选定的屏蔽曲线在

倍频程 ISO 频率上的值经过线性插值，在与每个索引

相关的 FFT 基准频率上创建屏蔽值，并表示为

。然后，计算

的比率，并将这些比率指定为可听分量。

These audibility components are first combined using RMS summation over critical bands to create audibility values in

. These audibility values in

are denoted sensation levels (SL). Each SL represents the number of

relative to the threshold of audibility for each ERB band individually.
首先使用临界频段的有效值求和法将这些可听度分量组合起来，以创建

中的可听度值。

中的这些可听度值被称为感觉级 (SL)。每个 SL 表示相对于每个 ERB 波段可听度阈值的

数量。

Moore's equivalent-rectangular-bandwidth (ERB) bands [15], are used to represent auditory critical bandwidths, which are used to assess the audible effect of the combination of noise and sine-wave components.
摩尔等效矩形带宽 (ERB) 波段 [15] 用来表示听觉临界带宽，用于评估噪声和正弦波成分组合的听觉效果。
Within a critical band, all sound components sum together on a power basis. The bandwidth of ERB bands in

is given by equation 1 :
在临界频段内，所有声音成分的功率相加。

中的 ERB 波段带宽由公式 1 得出：

The first ERB band is centered over the second harmonic. Succeeding ERB bands are then created adjacently, upward in frequency using the lower band-edge frequency and the assumption that the ERB bandwidth is divided equally in

above and below the center frequency. This process continues until the uppermost band has a center frequency slightly less than or equal

. The ERB as a function of lower band-edge frequency is given by equation 2 :
第一个 ERB 波段以二次谐波为中心。然后，利用较低的带边频率，并假设ERB 带宽在中心频率上下平分

，依次向上创建 ERB 频带。这个过程一直持续到最上层频带的中心频率略低于或等于

。ERB 与低频带边缘频率的函数关系如公式 2 所示：

When the burst frequency is less than

, the

level from first ERB band centered on the second harmonic is replaced with the

value of the RMS sum of the 3 spectral components centered as much as possible around the

harmonic of the burst frequency. This reduces the effect of spectral splatter components just below the second harmonic, takes advantage of the fact that the only significant distortion product around the second harmonic frequency is the second harmonic, and is within 0.25

of the level obtained by summation over a larger bandwidth.
当猝发频率小于

时，以二次谐波为中心的第一个 ERB 波段的

电平将被尽可能以猝发频率的

谐波为中心的 3 个频谱分量的有效值之和的

值所取代。这样可以减少低于二次谐波的频谱飞溅分量的影响，利用二次谐波频率附近唯一重要的失真产物是二次谐波这一事实，并且与在更大带宽上求和得到的电平在 0.25

范围内。

The final step in the distortion-assessment method is determining the effect that the audibility of the multiple ERB or critical bands have on the overall perception of distortion.
失真评估方法的最后一步是确定多个 ERB 或临界频段的可听度对失真总体感知的影响。

Buus et al. [16] found that multiple critical-band components had increased audibility because of a statistical sharing of detection probabilities. This effect was shown to increase the audibility for

equally audible components by equation 3 .
Buus 等人[16] 发现，由于检测概率的统计共享，多个临界频段成分的可听度增加了。等式 3 表明，这种效应会增加

同样可听的分量的可听度。

In order to accommodate components with varying sensation levels and be accurate for a single component, equation 4 was derived in [11].
为了适应感官水平不同的组件，并对单个组件进行精确计算，[11] 推导出方程 4。

The Buus combination of individual audibility values is used to calculate a total-distortion audibility metric. Additionally, distortion audibility metrics are generated for the combination of

harmonics, designated low-order harmonics (LOH), and high-frequency distortion including ERB bands above

for sine-wave burst frequencies of

, respectively. The noise analysis of the quiet regions performs the calculation used for the total-distortion audibility assessment. As a result, each burst
单个可听度值的 Buus 组合用于计算总失真可听度指标。此外，还为

谐波、指定的低阶谐波 (LOH) 和高频失真（包括高于

或

的 ERB 波段，正弦波突发频率分别为

或

）组合生成失真可听度指标。对安静区域的噪声分析进行用于总失真可听性评估的计算。因此，每个脉冲串
generates the 4 audibility values for each burst in the sequence and are associated with the appropriate acoustic level.
为序列中的每个脉冲串生成 4 个可听度值，并与相应的声级相关联。

6 Distortion Assessment Examples
6 失真评估示例

Five examples are used to demonstrate the perceptual distortion assessment process. These are a subwoofer tested at

, a 3-way loudspeaker with a 15 " woofer tested at

, another 15 " woofer tested at

, a coaxial loudspeaker with a 12 " woofer tested at

, and a final coaxial loudspeaker with a

woofer tested at

. Three listening rooms were used for the tests, with the first two using their own listening rooms and the third listening room used for the last three tests. The first two examples employed a

SR, while the remaining three a

SR.
我们用五个例子来演示感知失真评估过程。它们是在

测试的超低音扬声器、在

测试的配备 15 英寸低音扬声器的三路扬声器、在

测试的另一个 15 英寸低音扬声器、在

测试的配备 12 英寸低音扬声器的同轴扬声器，以及在

测试的配备

低音扬声器的最后一个同轴扬声器。测试使用了三个试听室，前两个试听室使用了各自的试听室，后三个测试使用了第三个试听室。前两个例子使用的是

SR，其余三个使用的是

SR。

These five examples are shown to demonstrate the perceptual-distortion-assessment process at the designated reference position but ignores the effect of transfer of the fundamental and distortion products into the room and at other listener positions.
这五个例子展示了在指定参考位置上的感知失真评估过程，但忽略了基音和失真产品转移到房间和其他听者位置上的影响。

The examples shown use burst sequences ranging -21-0 dBFS, but sometimes are limited in the upper level so as not to exceed

at the reference position. The initial

burst is only used to allow for possible start-up effects and is ignored in all the tests. The perceptual-distortion assessment is performed for each burst and assembled into plots for the audibility of the total,

, and high-frequency distortion components in decibels. Additionally, the error due to background and equipment noises for each burst measurement are calculated using the same total-distortion audibility process but set to the quiet period

after the beginning of each the sine-wave burst.
所示示例使用范围为 -21-0 dBFS 的脉冲串序列，但有时会限制上限，以免在参考位置超过

。

初始脉冲串仅用于考虑可能的启动效应，在所有测试中均被忽略。对每个脉冲串进行感知失真评估，并以分贝为单位将总失真、

、高频失真和高频失真分量的可听度绘制成图。此外，使用相同的总失真可听度流程计算每个脉冲串测量的背景噪声和设备噪声造成的误差，但将其设置为正弦波脉冲串开始后的安静期

。

Each example includes plots of the distortion audibility values versus output level at the reference position and 1 or 2 individual-burst analyses to demonstrate the distortion-assessment process in detail.
每个示例都包括失真可听性值与参考位置输出电平的对比图，以及 1 或 2 个单个脉冲串分析，以详细演示失真评估过程。
In the distortion-audibility-versus-level plots the total-distortion audibility is shown by squares with crosses at each tested level, the LOH audibility as a dashed line, the high-frequency distortion audibility as a heavy solid line, and the noise error as a thin line connecting open circles.
在失真可听度与电平对比图中，总失真可听度用每个测试电平上带十字的正方形表示，LOH 可听度用虚线表示，高频失真可听度用粗实线表示，噪声误差用连接开放圆圈的细线表示。
The individual-burst analysis examples used for the second set of figures are indicated.
第二组图中使用的单个脉冲串分析示例已标出。

The individual burst-level plots have the appropriate masked threshold curve shown by a heavy black line and the fine-frequency-resolution distortion-product spectrum indicated by a thin solid black line.
单个脉冲串级图的相应屏蔽阈值曲线用粗黑线表示，细黑线实线表示细频率分辨率失真产物频谱。
The perceptually-relevant distortion level for each ERB band are shown by solid circles at the center frequencies of each ERB band, with ERB segments of the masking curve within each ERB band shown as a thin blue line going through each relevant ERB-band distortion level. The

test eliminates the ERB band centered at the

harmonic and replaces it with RMS sum of the 3 spectral components. This is indicated by a heavy solid line on each side of the solid circle at the

-harmonic frequency.
每个 ERB 波段的感知相关失真水平用每个 ERB 波段中心频率的实心圆圈表示，每个 ERB 波段内的掩蔽曲线 ERB 段用蓝色细线表示，穿过每个相关 ERB 波段的失真水平。

测试消除了以

谐波为中心的 ERB 波段，取而代之的是 3 个频谱分量的均方根和。在

谐波频率处的实心圆两侧各用一条粗实线表示。

The SL value for each ERB band is the difference between the solid-circle level and the masking-curve value at each associated ERB-band center frequency.
每个 ERB 波段的 SL 值是每个相关 ERB 波段中心频率的实心圆电平与掩蔽曲线值之差。
The audibility components of figure 11 are ratios between the fine-frequency-resolution distortion component levels to the masking-curve values at the same frequencies. The noise error is shown by open circles connected by a heavy black line.
图 11 中的可听度分量是相同频率下细微频率分辨率失真分量水平与掩蔽曲线值之间的比率。噪声误差用粗黑线连接的圆圈表示。
The level of the fundamental frequency and level are indicated by a small solid square. The three specified overall distortion audibility values, plus noise-error estimates are shown in text form at the upper right hand corner of these figures.
基频和电平以小实心方形表示。三个指定的总体失真可听度值以及噪声误差估计值以文字形式显示在这些数字的右上角。

The first example considered is a self-powered subwoofer consisting of six 10 " woofers driven at 20

and referenced to the level at

. This test is shown to be very challenging because of the large cone excursions and the relatively small auditory masking created by a

fundamental.
考虑的第一个例子是由六个 10 英寸低音扬声器组成的自供电低音扬声器，其驱动电压为 20

，参考电平为

。由于音盆偏移较大，而

基本音量所产生的听觉掩蔽相对较小，因此该测试具有很大的挑战性。

The measurement was performed in the first listening room that was

long

wide

high. The front face of the subwoofer was located

from the right side wall,

from the front wall. The microphone position was

from the front face of one pair of the 10 " woofers, away
测量在第一听音室进行，该听音室长

，宽

，高

。超低音扬声器的正面距离右侧墙壁

，距离前墙壁

。麦克风的位置是

，距离一对 10 英寸低音扬声器的正面，距离
from the front wall and at a height of the center of the subwoofer woofers. The listening-room noise level was very low at

below an

level and the reverberation time was

and

between

. The reverberation time at

was not measured but is expected to be

or greater. Because of this the microphone and subwoofer position were checked to confirm that low-frequency room modes did not significantly distort the results.
距离前墙和低音炮低音单元中心的高度。在

处，听音室的噪音水平非常低，低于

的水平，在

处的混响时间为

，在

之间的混响时间为

。

处的混响时间没有测量，但预计为

或更长。因此，对麦克风和低音炮的位置进行了检查，以确认低频房间模式不会严重扭曲结果。

Figure 12 shows the audibility values versus level between

, while figure 13 and 14 show the detailed analysis at 76.3 and

drive levels.
图 12 显示了

和

之间的可听度值与电平的关系，图 13 和 14 则显示了在 76.3 和

驱动电平下的详细分析。

Figure 12.20 Hz audible distortion vs. level for a subwoofer @ 1 meter
图 12.1 米处超重低音扬声器的 20 赫兹可听失真与电平的关系

Examining figure 12 shows that the total distortion is slightly audible at the

level. The distortion audibility values remain relatively constant until

. As the drive level increases above that, the distortion audibility increases to 14.1

. At the higher drive levels, the distortion components at and above

become the most audible. At this high level the

are

above audibility but are less likely to impair the sound quality because of their lower audibility value and the fact that

are less annoying than the high-frequency distortion components. Figure 12 also shows the presence of background noise only produces a slight error since it is assessed to be slightly below or just at audibility.
图 12 显示，总失真在

级时略微可闻。在

之前，失真可听度值保持相对稳定。随着驱动电平的增加，失真可听度增加到 14.1

。在较高的驱动水平下，位于

及以上的失真成分变得最清晰可闻。

在此高电平下，

，但由于其可听值较低，而且

与高频失真成分相比不那么令人讨厌，因此对音质造成损害的可能性较小。图 12 还显示，背景噪声的存在只会产生轻微的误差，因为它被评估为略低于或刚刚达到可听度。

Figure 13.20 Hz audible distortion @

for a subwoofer
图 13.20 赫兹可听失真 @

低音扬声器

Examining figure 13 shows that the harmonics between

are audible for the

drive level even though the fundamental signal is inaudible because it is below

, the

hearing threshold, as specified by the ISO standard 226 [17]. This is an undesirable condition where the distortion is audible but the fundamental is not.
图 13 显示，

之间的谐波在

驱动电平下是可以听到的，尽管基波信号是听不到的，因为它低于

，即

的听阈，如 ISO 标准 226 所规定[17]。这是一种不理想的情况，即可以听到失真信号，但听不到基波信号。
The background-noise error is assessed to be just below audibility and so it has only a minor effect on the distortion-audibility assessment.
经评估，背景噪声误差略低于可听度，因此对失真-可听度评估的影响很小。

Figure 14.

audible distortion @

for a subwoofer
图 14.

亚低音扬声器的可听失真 @

Figure 14 shows the situation at the

drive level. This shows that the subwoofer is producing audible-distortion components over the frequency range of

. The combination of these results in a total-distortion audibility of

. This test shows the difficulty in producing distortion-free sound reproduction at

.
图 14 显示了

驱动电平下的情况。这表明亚低音扬声器在

的频率范围内产生了可听到的失真成分。这些成分的组合导致

的总失真可听度。该测试表明，在

下很难重现无失真声音。

The second example examines the performance of the 15 " woofer in a powered 3-way loudspeaker at

. Figure 15 shows the audibility values versus level between

, while figure 16 and 17 show the detailed analysis at 102 and 108.8

drive levels.
第二个例子是在

上检测有源 3 路扬声器中 15 英寸低音扬声器的性能。图 15 显示了

与

之间的可听度值与电平的关系，图 16 和 17 则显示了在 102 和 108.8

驱动电平下的详细分析。

The measurement was performed in the second listening room that was

long

wide

high. The front face of the loudspeaker was located diagonally at

from the right side wall and

from the front wall. The microphone position was

from 15" woofer, away from the front wall and at a height at the center of the woofer. The listening-room noise level was very low at

below an

level and the reverberation time was

and

between

. The reverberation time at

was not measured but is expected to be approximately

.
测量在第二个聆听室进行，该聆听室长

，宽

，高

。扬声器的正面距离右侧墙壁的对角线位置为

，距离前墙壁的对角线位置为

。麦克风的位置是

，距离 15 英寸低音扬声器、前墙和低音扬声器中心的高度。听音室的噪音水平很低，

，低于

，混响时间为

，

。

处的混响时间没有测量，但预计大约为

。

Figure 15.

audible distortion vs. level for a 3-way loudspeaker with a 15 " woofer @ 1 meter
图 15.

15 英寸低音扬声器的三路扬声器在 1 米处的可听失真与电平的关系
Figure 15 shows that the 3-way loudspeaker has inaudible distortion to just below

. Above that, the distortion audibly values rise to

, primarily due to high-frequency distortion products above

. LOH-audibility values also rise to

at the maximum drive level. The error due to background noise is shown to be negligible at

.
图 15 显示，三路扬声器的失真在

以下时听不到。在此之上，失真可听值上升至

，这主要是由于高频失真产品高于

。在最大驱动电平时，LOH 的可听值也升至

。在

处，背景噪声造成的误差可以忽略不计。

Figure 16.50 Hz audible distortion @

for a 3 -way loudspeaker with a 15 " woofer
图 16.50 Hz 可听失真（

），三路扬声器，15 英寸低音扬声器

Figure 16 shows the distortion audibility at

where total distortion is just audible, demonstrating extremely good linearity because the combination of the

, and

harmonics just reach audibility. Because the combined audibility values calculated by equation 4 are more weighted by the maximum levels than even in an RMS calculation, ERB bands with lower SLs rapidly become insignificant.
图 16 显示了

处的失真可听度，在该处总失真刚好可听，显示了极好的线性度，因为

和

谐波的组合刚好达到可听度。由于公式 4 计算出的综合可听度值比有效值计算出的最大电平更有权重，因此 SL 值较低的 ERB 频段很快就变得不重要了。
The background noise does not affect the calculation of distortion audibility values in this example.
在本例中，背景噪声不会影响失真可听度值的计算。

Figure 17.

audible distortion @

for a 3 -way loudspeaker with a 15 " woofer
图 17.

带有 15 英寸低音扬声器的三路扬声器的可听失真

Figure 17 demonstrates the distortion audibility at the highest tested level of

at 1 meter. This circumstance shows that many of the ERB bands individually are possessing audible distortions. In addition, the ERB bands between

have even higher audibility values. The combination of all the ERB-band audibility values raises the total-distortion audibility value to

. The

harmonic is the most audible individual distortion product at approximately

above audibility, but probably not as annoying as the other distortion products.
图 17 显示了在 1 米处

的最高测试电平下的失真可听度。这种情况表明，许多 ERB 波段都存在可听失真。此外，

之间的 ERB 波段的可听度值甚至更高。所有 ERB 波段的可听度值加在一起，使总失真可听度值达到

。

谐波是可听度最高的单个失真产品，约高于

，但可能不如其他失真产品那样令人讨厌。

The third distortion assessment example is a test at

of a 15" woofer designed for high acoustic output pro-audio use. This time the level is quite high, at 91.5-110.3 dB referenced at a 3 meter distance.
第三个失真评估示例是在

测试专为高声学输出专业音响用途设计的 15 英寸低音扬声器。这次的水平相当高，在 3 米的距离上达到 91.5-110.3 dB。

The listening room the test was performed in had dimensions of

long

wide

high, with minor reductions in the height along the upper side and front walls. The listening-room noise level was low at

and the reverberation time was very constant between 121-169 ms for the range of

. The loudspeaker was placed diagonally

away from the measurement microphone. The microphone position was placed

from the front and

from one side and placed at the height of the center of the loudspeaker under test.
进行测试的听音室尺寸为

，长

，宽

，高沿上侧墙和前墙略有降低。听音室的噪音水平较低，为

或

，在

的范围内，混响时间非常稳定，介于 121-169 毫秒之间。扬声器放置在距离测量麦克风对角线

的位置。麦克风位置为

，从正面和一侧

，高度为被测扬声器的中心位置。

Figure 18 shows the distortion audibility values as a function of level between 91.5-110.3 dB. Additionally figures 19 and 20 show the detailed analyses for the lowest level of

and the highest level of

tested.
图 18 显示了 91.5-110.3 dB 之间的失真可听度值与电平的函数关系。此外，图 19 和图 20 显示了对

和

测试的最低电平和最高电平的详细分析。

Figure 18.

audible distortion vs. level for a 15"woofer @ 3 meters
图 18.

15 "低音扬声器在 3 米处的可听失真与电平的关系

Figure 18 shows the distortion performance at levels as high as

at 3 meters, where the distortion is just above or at audibility up to

. Also demonstrated is the utility of the noise-error estimate because this error sits at the just audible level for the lower drive levels and likely raises the distortion-assessment values to indicate distortion audibility when actually they are less audible.
图 18 显示了在 3 米处高达

的电平下的失真性能，其中

的失真刚刚超过或达到可听水平。噪声误差估计值的实用性也得到了证明，因为该误差位于较低驱动电平的可听水平，很可能会提高失真评估值，以显示失真可听度，而实际上它们的可听度较低。
The decrease in the effect of the noise error with rising drive level occurs because the masking curve rises for higher drive levels. At the highest drive level of

, the high-frequency distortion components drive the distortion to

above audibility. This time the LOH-distortion products are

higher than audibility but probably not as annoying as the high-frequency ones, as previously discussed.
噪声误差的影响随着驱动电平的上升而减小，这是因为驱动电平越高，掩蔽曲线越高。在最高驱动电平

时，高频失真成分将失真驱动到高于可听度的

。此时，LOH 失真产品

高于可听度，但可能不像前面讨论的高频失真产品那样令人讨厌。

Figure 19.100 Hz audible distortion @

for a 15 " woofer
图 19.100 赫兹可听失真 @

（15 英寸低音扬声器

Figure 19 shows that the highest ERB-band audibility values are just below the masking curve and result in virtually no audible impairment at

since the

and

harmonics and the high-frequency distortion products between 3.5-5.2

are barely audible. The effect of background noise only produces negligible error since it is has dropped to

below audibility, due to the rising of the masking curve for this drive level.
图 19 显示，ERB 频段的最高可听度值刚好低于屏蔽曲线，在

时几乎没有可听度损害，因为

和

谐波以及 3.5-5.2

之间的高频失真产品几乎听不到。背景噪声的影响只会产生可忽略不计的误差，因为它已下降到

以下，低于可听度，这是因为该驱动电平的掩蔽曲线在上升。

Figure 20.100 Hz audible distortion @

for a

woofer
图 20.100 赫兹可听失真 @

低音扬声器

Figure 20 shows the situation is different at 110.3

drive level. Now the combination of the

and

harmonics are

above audibility, while the high-frequency distortions in the

region are even more audible, at

above audibility. Again the

and

harmonics are less annoying than the high-frequency ones because of their lower audibility number and the fact they sound more musical in nature.
图 20 显示，在 110.3

的驱动电平下，情况有所不同。现在，

和

谐波的组合超过了

的可听度，而

区域的高频失真则更加明显，超过了

的可听度。同样，由于

和

谐波的可听度较低，而且听起来更具音乐性，因此它们比高频谐波更不令人讨厌。
Despite the predicted audible distortion, this loudspeaker demonstrates good distortion performance for the high acoustic levels tested.
尽管预计会出现听觉失真，但这款扬声器在高声级测试中表现出良好的失真性能。

The fourth example shows the performance of a high-output coaxial loudspeaker with a 12" woofer driven with a

sine wave, again referenced at a distance of 3 meters. The loudspeaker was placed in a similar position and in the same room as the previous example. Again, the Schoeps microphone was placed at 0.5 meters from the woofer to prevent its overload.
第四个示例显示的是高输出同轴扬声器的性能，该扬声器配有一个 12 英寸低音扬声器，由

正弦波驱动，同样以 3 米距离为基准。扬声器放置的位置和房间与上一示例相似。同样，Schoeps 麦克风放置在距离低音扬声器 0.5 米处，以防止其超载。
Figure 21 shows the distortion audibility values as a function of level between

and figure 22 shows the detailed analysis at the highest drive level.
图 21 显示了

之间的失真可听度值与电平的函数关系，图 22 显示了最高驱动电平下的详细分析。

Figure 21.200 Hz audible distortion vs. level for a coaxial loudspeaker with a 12" woofer @ 3 meters
图 21.12 英寸低音扬声器同轴扬声器在 3 米处的 200 赫兹可听失真与电平的关系

Figure 21 shows that this loudspeaker produces just audible distortion products only at the highest drive level. The effect of noise error is also minor and decreases with level due to increased masking at higher drive levels.

are equal or more than

below audibility.
图 21 显示，该扬声器仅在最高驱动电平下产生可听的失真产品。噪声误差的影响也很微小，并且随着音量的增加而减小，这是因为在较高的驱动音量下，掩蔽会增加。

等于或大于

，低于可听度。

Figure 22.200 Hz audible distortion @ 109.8 dB for a coaxial loudspeaker with a 12 " woofer
图 22.12 英寸低音扬声器同轴扬声器的 200 赫兹可听失真（109.8 分贝

Figure 22 shows that the distortion products are inaudible to

and only slightly audible in the

frequency range. This indicates very good distortion performance at this very high output level.
图 22 显示，在

中听不到失真产物，在

频率范围内只能略微听到。这表明，在这种极高的输出电平下，失真性能非常好。

The final example shows the performance of a high-output coaxial loudspeaker with an 8 " woofer driven with

sine-wave bursts at levels referenced at a distance of 3 meters. The loudspeaker was placed in a similar position and room as the previous example and the Schoeps microphone was placed at 0.5 meters from the woofer to prevent its overload.
最后一个示例显示了高输出同轴扬声器的性能，该扬声器配有一个 8 英寸低音扬声器，使用

正弦波脉冲串驱动，电平参考距离为 3 米。扬声器放置的位置和房间与上一示例相似，Schoeps 麦克风放置在距离低音扬声器 0.5 米处，以防止其过载。
Figure 23 shows the distortion audibility values as a function of level between

and figure 24 shows the detailed analyses at the highest drive level.
图 23 显示了

之间的失真可听度值与电平的函数关系，图 24 显示了最高驱动电平下的详细分析。

Figure 23.

audible distortion vs. level for a coaxial loudspeaker with a 8" woofer @ 3 meters
图 23.

带有 8 英寸低音扬声器的同轴扬声器在 3 米处的可听失真与电平的关系

Figure 23 shows a situation where above

the audible distortion products are

and only at levels above

do the high-frequency distortion products become audible. The effect of background and equipment noise is seen to be negligible, being more than

below audibility and are not visible in figure 23 .
图 23 显示的情况是，在

以上，可听到的失真产物是

，只有在

以上，高频失真产物才变得可听到。从图 23 中可以看出，背景噪声和设备噪声的影响微乎其微，低于

，在图 23 中看不到。

Figure 24.315 Hz audible distortion @

for a coaxial loudspeaker with a 8 " woofer
图 24.8 英寸低音扬声器同轴扬声器的 315 赫兹可听失真 @

Examining figure 24 shows that the

harmonic is the most audible distortion product with high-order harmonics above

just becoming audible. Since the

harmonic distortion is often present in music and is only

above audibility this is
图 24 显示，

谐波是最容易听到的失真产物，而

以上的高阶谐波才刚刚可以听到。由于

谐波失真经常出现在音乐中，而且仅

高于可听度，这就意味着
likely to not create a significant audible problem. The high-frequency distortion components are likely to be more objectionable but still only rise to

above audibility at the maximum drive level.
可能不会造成明显的听觉问题。高频失真成分可能会更令人反感，但在最大驱动电平下，仍只会上升到

以上。

7 Conclusions 7 结论

A perceptually relevant nonlinear distortion analysis method has been presented based on

sine-wave bursts with frequencies between

. These bursts are analyzed by a spectral comparison of critical-band levels to previously measured auditory masking curves for fundamental levels between 60-110 dB.
基于

频率在

之间的正弦波脉冲串，提出了一种与感知相关的非线性失真分析方法。这些脉冲串的分析方法是将临界频段电平与先前测量的基频电平在 60-110 dB 之间的听觉掩蔽曲线进行频谱比较。
This critical-band comparison generates audibility or sensation-level values for each critical band at the 2 nd harmonic and above to

. These are then combined to create audibility values for the total,

harmonic combination (LOH), and high-frequency distortion components above 1 or

, depending on the fundamental frequency. The effect of background or equipment noise is evaluated to test its effect on the nonlinear-distortion assessment process.
这种临界频段比较可为 2 次谐波及以上的每个临界频段生成可听度或感觉级值，

。然后，根据基频的不同，将这些值组合起来，得出总谐波、

谐波组合 (LOH) 和高于 1 或

的高频失真成分的可听度值。对背景噪声或设备噪声的影响进行评估，以测试其对非线性失真评估过程的影响。

Five examples where shown using sine-wave bursts of 20,50,100,200, and

to demonstrate this analysis method and the distortion-audibility characteristics of the measured loudspeakers. One important result was that degradation due to high-frequency distortion products of buzz and noise was very significant.
使用 20、50、100、200 和

的正弦波脉冲串展示了五个例子，以演示这种分析方法和所测扬声器的失真-可听特性。一个重要的结果是，由于嗡嗡声和噪声的高频失真产物造成的衰减非常明显。
Although the tested loudspeakers were not always shown to produce inaudible distortions, these examples actually had distortion properties much better than average loudspeakers.
虽然测试的扬声器并不总能产生听不见的失真，但这些例子的失真特性实际上比普通扬声器要好得多。

Because this distortion assessment method is based on how many decibels the distortion is above audibility for given a sine-wave drive, it is a very sensitive test.
由于这种失真评估方法是基于正弦波驱动器的失真高于可听度多少分贝，因此是一种非常灵敏的测试。
This is true because typical music reproduction involves signals with broad spectral content that may mask distortion components.
之所以如此，是因为典型的音乐重放涉及的信号具有宽广的频谱内容，可能会掩盖失真成分。
Despite this limitation and the fact that this test does not deal with annoyance, this test is a rapid method for determining what might need improvement in a loudspeaker design or an individual loudspeaker driver.
尽管有这一局限性，而且该测试并不涉及烦扰，但该测试是一种快速方法，可用于确定扬声器设计或单个扬声器驱动器中可能需要改进的地方。
The five examples shown were tests that referenced the distortion performance at one distance and sampled the acoustic output close to the test loudspeaker.
图中所示的五个示例都是参照一个距离的失真性能进行的测试，并对靠近测试扬声器的声音输出进行了采样。
Room effects and the effect of the directivity characteristics of the distortion mechanisms on the acoustic transfer to the listener was not included in the analyses.
房间效应和失真机制的指向性特征对听者的声学传递的影响未包括在分析中。
Sampling the acoustic output at close distances minimized the effect of room rattles excited by the loudspeaker under test, reducing a confounding factor unrelated to the loudspeaker.
在近距离对声音输出进行采样，最大程度地减少了被测扬声器所激发的房间响声的影响，从而减少了与扬声器无关的干扰因素。

Additionally, the effect of distortion audibility with the same drive but referenced to other distances was not included.
此外，还不包括使用相同驱动器但以其他距离为参照的失真可听度的影响。
This can be an important consideration because, as the acoustic attenuation varies, the signal and distortion products vary in level, causing the masking to vary.
这可能是一个重要的考虑因素，因为随着声衰减的变化，信号和失真产物的电平也会变化，从而导致掩蔽的变化。
This can cause the distortion audibility to increase or decrease, depending on whether the masking curve varies more rapidly or slowly than the distortion products.
这可能会导致失真可听度增加或降低，具体取决于掩蔽曲线的变化速度比失真产品快还是慢。

References 参考资料

[1] H. Y. Lin, "Measurement of Auditory Distortion with Relation Between Harmonic Distortion and Human Auditory Sensation," Transactions on Instrumentation and Measurement, vol. IM-35, no. 2, pp. 195-200 (1986 June).
[1] H. Y. Lin，"测量听觉失真与谐波失真和人的听觉感觉之间的关系"，《仪器仪表与测量》，第 IM-35 卷，第 2 期，第 195-200 页（1986 年 6 月）。

[2] M. A. Boer, A. G. J. Nijmeijer, H. Schurer, W. F. Druyvesteyn, C. H. Slump, and O. E. Herrmann, "Audibility of Nonlinear Distortion in Loudspeakers," presented at the

Audio Eng. Soc. Convention, convention paper 4718 (1998 October).
[2] M. A. Boer、A. G. J. Nijmeijer、H. Schurer、W. F. Druyvesteyn、C. H. Slump 和 O. E. Herrmann，"扬声器非线性失真可听性"，在

Audio Eng.会议论文 4718（1998 年 10 月）。

[3] E. R. Geddes and L. W. Lee," Auditory Perception of Nonlinear Distortion - Theory," presented at the

Audio Eng. Soc. Convention, convention paper 5890 (2003 October).
[3] E. R. Geddes 和 L. W. Lee，"非线性失真听觉感知--理论"，在

Audio Eng.学会大会，大会论文 5890（2003 年 10 月）。

[4] L. W. Lee and E. R. Geddes, "Auditory Perception of Nonlinear Distortion," presented at the

Audio Eng. Soc. Convention, convention paper 5891 (2003 October)
[4] L. W. Lee 和 E. R. Geddes，"对非线性失真的听觉感知"，在

Audio Eng.学会大会，大会论文 5891（2003 年 10 月）

[5] C. T. Tan, B. C. J. Moore and N. Zacharov, "The Effect of Nonlinear Distortion of the Perceived Quality of Music and Speech Signals," J. Audio Eng. Soc., vol. 51, no. 11, pp. 1012-1031 (2003 November).
[5] C. T. Tan、B. C. J. Moore 和 N. Zacharov，《非线性失真对音乐和语音信号感知质量的影响》，J. Audio Eng.51, no. 11, pp.

[6] A. Voishvillo, "Assessment of Nonlinearity in Transducers and Sound Systems - from THD to Perceptual Models," presented at the

Audio Eng. Soc. Convention, convention paper 6910 (2006 October).
[6] A. Voishvillo，"换能器和声音系统中的非线性评估 - 从 THD 到感知模型"，在

Audio Eng.学会大会，大会论文 6910（2006 年 10 月）。

[7] ITU BS.1387-1, "Method for Objective Measurement of Perceived Audio Quality," International Telecommunications Union, Geneva, Switzerland (2001).
[7] ITU BS.1387-1，《客观测量感知音频质量的方法》，国际电信联盟，瑞士日内瓦（2001 年）。

[8] S. Temme, P. Brunet, and D. B. Keele, "Practical Measurement of Loudspeaker Distortion Using a Simplified Auditory Perceptual Model," presented at the

Audio Eng. Soc. Convention, convention paper 7905 (2009 October).
[8] S. Temme、P. Brunet 和 D. B. Keele，"使用简化的听觉感知模型实际测量扬声器失真"，在

Audio Eng.会议论文 7905（2009 年 10 月）。

[9] S. Temme, P. Brunet, and P. Qarabaqi, "Measurement of Harmonic Distortion Audibility Using a Simplified Psychoacoustic Model," presented at the

Audio Eng. Soc. Convention, convention paper 8704 (2012 October).
[9] S. Temme、P. Brunet 和 P. Qarabaqi，《使用简化心理声学模型测量谐波失真可听性》，在

Audio Eng.会议，会议论文 8704（2012 年 10 月）。

[10] L. D. Fielder and E. M. Benjamin, "Subwoofer Performance for Accurate Reproduction of Music," J. Audio Eng. Soc., Vol. 36, pp. 443-455 (1988 June).
[10] L. D. Fielder 和 E. M. Benjamin，"准确再现音乐的低音炮性能"，J. Audio Eng.Soc.，第 36 卷，第 443-455 页（1988 年 6 月）。

[11] L. D. Fielder, "Perceptual Assessment of Headphone Distortion," presented at the

Audio Eng. Soc. Convention, convention paper 9841 (2017 October)
[11] L. D. Fielder，"耳机失真的感知评估"，在

Audio Eng.学会大会，大会论文 9841（2017 年 10 月）

[12] ANSI/CEA Standard, "Standard Method of Measurement for Powered Subwoofers, ANSI/CEA-2010" (2006 November)
[12] ANSI/CEA 标准，"有源超低音的标准测量方法，ANSI/CEA-2010"（2006 年 11 月）

[13] H. Fastl and E. Zwicker, Psychoacoustics, Facts and Models, third edition, pp. 66-70, Springer-Verlag (2007).
[13] H. Fastl 和 E. Zwicker，《心理声学、事实与模型》，第三版，第 66-70 页，Springer-Verlag（2007 年）。
[14] L. D. Fielder, "Evaluation of the Audible Distortion and Noise Produced by Digital Audio Converters," J. Audio Eng. Soc., Vol. 35, pp. 517-535 (1987 July/Aug.).
[14] L. D. Fielder，"评估数字音频转换器产生的可听失真和噪音"，《音频工程学报》，第 35 卷，第 517-535 页（1987 年 7 月/8 月）。Soc.，第 35 卷，第 517-535 页（1987 年 7 月/8 月）。

[15] B. C. J. Moore, An Introduction to the Psychology of Hearing, fifth edition, pp. 66-78, Academic Press (2003).
[15] B. C. J. Moore，《听觉心理学导论》，第五版，第 66-78 页，学术出版社（2003 年）。

[16] S. Buus, E. Schorer, M. Florentine, and E. Zwicker, "Decision Rules in Detection of Simple and Complex Tones," J. Acoust. Soc. Am., vol. 80, no. 6, pp. 1646-1657 (1986 December).
[16] S. Buus、E. Schorer、M. Florentine 和 E. Zwicker，"简单和复杂音调检测中的决策规则"，《声学杂志》，美国，第 80 卷，第 6 期，第 1646-1657 页（1986 年 12 月）。Soc.Am.，第 80 卷，第 6 期，第 1646-1657 页（1986 年 12 月）。

[17] "Acoustics - Normal equal-loudness-level contours," ISO Recommendation R226: 2003(E), Geneva, Switzerland, International Organization for Standardization (2003 August).
[17] "声学 - 正常等效声级等值线"，国际标准化组织建议书 R226：2003(E)，瑞士日内瓦，国际标准化组织（2003 年 8 月）。

Appendix 1 附录 1

These tables represent the masking curve values at

-octave ISO frequencies for

, 315,400 , and

fundamentals. The hearing threshold values derived from [11] are included in table 1 .
这些表格代表了

倍频程 ISO 频率下的掩蔽曲线值，分别为

、315,400 和

基本频率。表 1 列出了从 [11] 中得出的听阈值。

Frequency

114

104

Threshold

31.5

86.3

79.5

72.1

69.3

81.2

71.8

64.9

61.6

77.6

65.4

58.8

56.3

72.3

49.6

65.8

50.7

43.5

100

60.3

45.2

125

39.1

32.4

160

32.7

27.3

200

40.1

27.8

250

36.3

24.4

20.5

315

30.6

19.8

16.8

400

24.2

14.7

11.8

500

19.6

10.5

7.8

630

15.9

7.7

5.8

800

12.8

5.2

1000

10.2

5.2

1250

8.1

5.5

1600

5.7

2000

2.6

2500

-0.9

3150

-2.6

4000

-1.5

5000

0.9

6300

8000

10000

17.3

12500

15.8

16000

55.1

Table 1:

masking values + hearing threshold
表 1：

屏蔽值 + 听力阈值

Frequency

111

101

80.4

73.4

62.9

100

77.7

68.2

57.7

125

72.5

62.3

49.2

38.4

160

55.8

41.4

200

64.1

51.4

35.5

25.6

250

58.2

45.2

30.1

20.5

315

52.6

37.1

22.3

16.8

400

45.7

30.2

15.9

11.8

500

39.9

23.8

11.6

7.8

630

35.2

18.6

8.8

5.8

800

29.9

14.4

6.2

5.2

1000

25.7

11.1

5.2

1250

7.5

5.5

1600

14.7

5.7

2000

2.6

2500

-0.9

3150

-0.8

-2.6

4000

-1.5

5000

0.9

6300

8000

10000

17.3

12500

15.8

16000

55.1

Table 2:

masking values
表 2：

屏蔽值

Frequency

110

100

160

79.9

71.6

60.1

52.6

200

76.2

67.3

54.7

44.5

27.9

250

72.5

63.3

49.4

37.3

22.6

315

57.6

41.5

28.5

18.1

400

65.6

51.9

32.5

22.2

12.5

500

62.7

48.1

28.2

16.6

7.8

630

59.7

20.8

5.8

800

57.8

40.6

15.9

9.4

5.2

1000

54.6

35.6

13.5

5.2

1250

31.8

11.7

5.5

1600

46.5

25.2

9.7

5.7

2000	40	18.3	4.9	2.6	2.6
2500	30.1	10	-0.9	-0.9	-0.9
3150	19.2	3	-2.6	-2.6	-2.6
4000	11.4	-1.8	-1.5	-1.5	-1.5
5000	6.1	0.9	0.9	0.9	0.9
6300	7	7	7	7	7
8000	12	12	12	12	12
10000	17.3	17.3	17.3	17.3	17.3
12500	15.8	15.8	15.8	15.8	15.8
16000	55.1	55.1	55.1	55.1	55.1

Table 3:

masking values
表 3：

屏蔽值

Frequency

105

100

315

69.2

66.8

57.5

23.8

400

64.1

60.4

50.9

37.5

14.9

500

62.1

56.6

44.6

29.1

7.8

630

61.1

54.3

5.8

800

60.2

53.6

36.5

18.9

5.2

1000

58.1

51.6

31.3

13.1

5.2

1250

55.6

49.6

28.6

5.5

1600

53.1

45.3

23.8

5.7

2000

47.1

36.3

13.6

2.6

2500

35.5

22.6

3.6

-0.9

3150

24.1

14.1

-1.3

-2.6

4000

9.8

3.9

-1.5

5000

3.5

0.9

6300

8000

10000

17.3

12500

15.8

16000

55.1

Table 4:

masking values
表 4：

屏蔽值

Frequency

104

500

64.1

62.2

50.6

40.3

17.6

630

62.8

59.4

48.7

35.4

9.5

800

61.7

59.3

46.1

30.3

6.4

1000	61.8	58.4	44.5	23.6	5.2
1250	63.7	59.8	45	20.7	5.5
1600	64.3	58.1	43.1	16.1	5.7
2000	58.4	52.6	34.1	11.2	2.6
2500	52.9	45	24.6	5.1	-0.9
3150	47.2	39.2	17.7	2.3	-2.6
4000	41.5	29.9	11.4	2.1	-1.5
5000	33.1	22.4	7.3	0.9	0.9
6300	27.2	18.3	7	7	7
8000	16.1	12	12	12	12
10000	17.3	17.3	17.3	17.3	17.3
12500	15.8	15.8	15.8	15.8	15.8
16000	55.1	55.1	55.1	55.1	55.1

Table 5:

masking values
表 5：

屏蔽值

Frequency

104

630

59.2

50.8

40.1

800

66.2

61.9

49.1

35.6

10.9

1000

64.3

62.6

42.8

29.5

6.8

1250

63.5

59.7

38.8

24.2

5.5

1600

61.2

56.1

33.6

19.3

5.7

2000

57.2

50.6

29.2

12.8

2.6

2500

49.6

41.1

21.2

4.6

-0.9

3150

43.5

35.1

12.9

0.5

-2.6

4000

30.9

25.7

7.4

-1.5

5000

20.6

14.1

0.9

6300

11.4

8000

16.8

10000

17.3

12500

15.8

16000

55.1

Table 6:

masking values
表 6：

屏蔽值

Frequency

800

57.7

53.5

45.1

14.3

1000

55.2

42.6

9.4

1250

64.6

52.7

40.2

8.4

1600	61.7	51	34.1	7.5
2000	56.8	46.3	26.6	3.3
2500	49.8	39.6	16.7	-0.9
3150	45.5	30	11.1	-2.6
4000	37.9	22.5	7.4	-1.5
5000	26.8	16.6	4.6	0.9
6300	23.3	12.9	7	7
8000	19.3	12	12	12
10000	17.3	17.3	17.3	17.3
12500	15.8	15.8	15.8	15.8
16000	55.1	55.1	55.1	55.1

Table 7:

masking values
表 7：

屏蔽值

Appendix 2 附录 2

This defines the process to produce new masking curves at

increments by linear interpolation and extrapolation from those of appendix 1.
这定义了通过对附录 1 中的曲线进行线性插值和外推法，以

为增量生成新遮蔽曲线的过程。

To generate a new masking curve

for a particular fundamental frequency

,
为特定基频生成新的屏蔽曲线

、

where: 在哪里？

fundamental frequency

基频

vector of

octave ISO frequency values from one ISO value below 2 (FF) to

倍频程 ISO 频率值向量，从低于 2 (FF) 的一个 ISO 值到

the new fundamental frequency level in

中的新基频水平。

level: existing lower masking curve

水平：现有的下遮蔽曲线

level: existing higher masking curve

水平：现有较高的遮蔽曲线

maximum

for upward extrapolation

向上外推的最大

minimum

for downward extrapolation threshold

hearing threshold curve

向下外推阈值的最小听力阈值曲线

Interpolation process 插值过程

interpolation(

:
插值（

：

for

对于

(4)

new masking curves are defined by equation (5)

fractL

新的遮蔽曲线由公式 (5) 定义 fractL

Extrapolation upward 向上推断

upward_extrapolation(FF,dBmax,dBL,dBH) (6)
上推（FF、dBmax、dBL、dBH） (6)

for

对于

(9)

new masking curves are defined by equation (10):

fract

新的遮蔽曲线由公式 (10) 定义：

fract

Extrapolation downward 向下推断

downward_extrapolation(FF,dBmin,dBL,dBH)
向下外推法（FF,dBmin,dBL,dBH）

for

对于

fract

碎裂

(14)

New masking curves are defined by equation (15):

新的遮蔽曲线由公式 (15) 定义：

The interpolated and extrapolated masking curves between

for

, and

are generated by equations

by the succeeding processes.

之间的内推和外推掩蔽曲线为

，而

则是由方程

通过后续过程生成的。

For masking curves at

对于在

interpolation

插值

interpolation

插值

downward_extrapolation(20,60,94,104)
向下外推法（20,60,94,104）

For masking curves at

对于在

interpolation(50,101,111)

interpolation(50,91,101)

interpolation

插值

downward_extrapolation(50,60,81,91)
向下外推法（50,60,81,91）

For masking curves at

对于在

interpolation

插值

interpolation(100,90,100)

interpolation

插值

interpolation(100,60,80)

For masking curves at

对于在

upward_extrapolation(200,110,100,105)

interpolation(200,100,105)

interpolation

插值

interpolation

插值

interpolation(200,60,80)

For masking curves at

对于在

upward_extrapolation(315,110,99,104)

interpolation

插值

interpolation

插值

interpolation

插值

interpolation(315,59,79)

For masking curves at

对于在

upward_extrapolation(400,110,99,104)

interpolation

内插

interpolation

内插

interpolation

内插

interpolation

内插

For masking curves at

对于在

upward_extrapolation

向上外推法

interpolation

内插

interpolation

内插

interpolation(500,58,78)

AES 147th Convention, New York, USA, 2019 October 16-19
AES 第 147 届大会，美国纽约，2019 年 10 月 16-19 日

Page 20 of 20
第 20 页，共 20 页

Aup10 Audio Engineering Society Convention Paper 10242 Aup10 音频工程学会大会论文 10242

Abstract 摘要

Perceptual Assessment of Distortion In Low-Frequency Loudspeakers低频扬声器失真感知评估

Abstract 摘要

1 Introduction 1 引言

2 Design of Sine-Wave Bursts2 正弦波脉冲串的设计

3 Test Configuration 3 测试配置

4 Masking Curve Derivation4 掩蔽曲线推导

5 Distortion-Assessment Algorithm5 失真评估算法

6 Distortion Assessment Examples6 失真评估示例

7 Conclusions 7 结论

References 参考资料

Appendix 1 附录 1

Appendix 2 附录 2

Interpolation process 插值过程

Extrapolation upward 向上推断

Extrapolation downward 向下推断

Aup10 Audio Engineering Society Convention Paper 10242
Aup10 音频工程学会大会论文 10242

Perceptual Assessment of Distortion In Low-Frequency Loudspeakers
低频扬声器失真感知评估

2 Design of Sine-Wave Bursts
2 正弦波脉冲串的设计

4 Masking Curve Derivation
4 掩蔽曲线推导

5 Distortion-Assessment Algorithm
5 失真评估算法

6 Distortion Assessment Examples
6 失真评估示例