/   Home   /   Newsroom   /   Research News

Novel Single Channel Speech Dereverberation Method Using Guided Spectrogram Filtering Developed

Apr 16, 2018     Email"> PrintText Size

Speech dereverberation is important for hands-free speech communication systems and human-machine speech interfaces. In reverberant environments, speech quality and speech intelligibility may degrade dramatically due to acoustic reverberation.

In addition, speech recognition often fails in highly reverberant conditions. How to effectively suppress late reverberation noise becomes a difficult and hot topic in recent years.

Conventional single-channel speech dereverberation methods usually blindly estimate some room acoustic parameters, such as the reverberation time (T60) or the damping constant.

By comparing a speech spectrogram into an image, ZHENG Chengshi and his colleagues from the Institute of Acoustics (IOA) of the Chinese Academy of Sciences proposed a novel single channel speech dereverberation method using guided spectrogram filtering. This can significantly improve speech quality with less computational cost and requires neither room acoustic parameter estimation nor late reverberant spectral variance estimation.

The paper entitled "Guided Spectrogram Filtering for Speech Dereverberation" was published in Applied Acoustics.

Acoustic reverberation has impact on clean speech spectrograms. Considering a clean speech spectrogram as a clean image, its corresponding reverberant version is a corrupted image covered by mist (Figure 1). Based on this fact, researchers proposed a guided spectrogram filtering method to reduce acoustic reverberation.

 

Figure 1. Waveforms and normalized spectrograms of the clean speech (a), (b), the reverberant speech with T60 = 400 ms (c), (d). (Image by IOA)

In the proposed guided spectrogram filtering (GSF) method, there is no need to estimate the late reverberant spectral variance to suppress the late reverberant speech component. As a result, neither the late reverberant spectral variance estimation nor room acoustic parameter estimation is necessary to implement the proposed method.

Objective comparison results, as shown in Figure 2, indicate that the proposed GSF method is economical in single-channel speech dereverberation compared with conventional methods.

 

Figure 2. Waveforms and normalized spectrograms of the clean speech (a), (b), the reverberant speech with T60 = 400 ms enhanced by the proposed guided spectrogram filtering method (c), (d). (Image by IOA)

Attachment:

(Editor: LI Yuan)

Contact

Related Articles

noise;algorithms;speech;enhancement;studies;reduction;signal;Acoustics;MC-SE;TC-PF;

Two-Channel Post-filtering Based on Adaptive Smoothing and Noise Properties

Sep 01, 2011

Multi-channel speech enhancement(MC-SE)could solve the drawbacks of single-channel speech enhancement(SC-SE)algorithms. Among the MC-SE algorithms,the two-channel adaptive wiener post-filtering(TC-PF)is a simple and effective way for both noise reducti...

Research;progress;in;speech;translation;National;Laboratory;of;Pattern;Recognitionthe;CAS;Institute;of;Automation

Research progress in speech translation

Jun 02, 2003


Hemisphere;dominance;for;early;speech;processing;depends;on;acoustic;cues;;CAS;scientists;findUniversity;of;Science;and;Technology;of;China

Hemisphere dominance for early speech processing depends on acoustic cues, CAS scientists find

Jan 11, 2007


Contact Us

Copyright © 2002 - Chinese Academy of Sciences