多媒体通信-4-5-2018

本文介绍了多媒体通信中的Delta调制(DM)与自适应差分脉冲编码调制(ADPCM)技术,并深入探讨了MPEG音频编码中涉及的心理声学原理,包括人类听觉范围、频率掩蔽效应及临界频带等概念。

摘要生成于 C知道 ,由 DeepSeek-R1 满血版支持, 前往体验 >

多媒体通信

授课教师:黄晓燕

DM(Delta Modulation)

simplified version of DPCM. Uniform-DeltaDM:use only a single quantized error value. ——>1-bit coder.

if en>0 ,ek=+k; else ek=-k;

Adaptive DM:change the step size adaptively.

ADPCM(adaptive DPCM)

Making the predictor coefficients adaptive

Contents

MPEG Audio

Psychoacoustic(心理声学) The range of human hearing is about 20Hz to 20KHz.
The frequency range of the voice is typically only from about 500Hz to 4KHz
Threshold of Hearing
Fletcher-Munson Equal loudness Curves
Human ear's sensitivity is different for different frequency.Most sensitive:1KHz - 5KHz
Frequency masking(频率掩蔽)

if a very loud tone is produced , it is impossible to hear any sound nearby in the frequency spectrum.
纯音的掩蔽基本符合以下几个规律:低音容易掩蔽高音,高音较难掩蔽低音;频率相近的纯音容易互相掩蔽;提高掩蔽声的声压级时,掩蔽阈会提高,而且被掩蔽的频率范围会扩展。

Critical bands(临界频段)
Represents the ear's resolving power for simultaneous tones or partials.

Because of masking , the ear is not very discriminating within a critical band.
At the low frequency, the band is small, when the frequency get higher, the band becomes wider.

Temporal Masking (时间掩蔽)

Test tones with frequencies near the masking tone are the most masked.

MEPG Audio (Moving Pictures Experts Group)
Most of the complexity increase is at the encoder,not the decoder.

转载于:https://www.cnblogs.com/zysps1/p/multi_media_communication_4_8.html

评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值