A comparative study of glottal source estimation techniques☆

Publication year: 2012
Source: Computer Speech & Language, Volume 26, Issue 1, January 2012, Pages 20-34

Thomas, Drugman , Baris, Bozkurt , Thierry, Dutoit

 Abstract: Source-tract decomposition (or glottal flow estimation) is one of the basic problems of speech processing. For this, several techniques have been proposed in the literature. However, studies comparing different approaches are almost nonexistent. Besides, experiments have been systematically performed either on synthetic speech or on sustained vowels. In this study we compare three of the main representative state-of-the-art methods of glottal flow estimation: closed-phase inverse filtering, iterative and adaptive inverse filtering, and mixed-phase decomposition. These techniques are first submitted to an objective assessment test on synthetic speech signals. Their sensitivity to various factors affecting the estimation quality, as well as…

 Highlights: ► CPIF and CCD give the best results on synthetic speech. ► NAQ, H1–H2 and HRF are useful glottal parameters. ► Glottal flow can be used for voice quality analysis. ► Methods are tested on a large speech database.