Publications
2015 |
Liberatore, C; Gutierrez-Osuna, R Joint Optimization of Anatomical and Gestural Parameters in a Physical Vocal Tract Model Proceedings Article In: ICASSP, IEEE 2015. Links | BibTeX | Tags: Accent conversion, Articulatory inversion, Articulatory synthesis @inproceedings{liberatore2015icassp, |
2013 |
Aryal, S; Gutierrez-Osuna, R Articulatory inversion and synthesis: towards articulatory-based modification of speech Proceedings Article In: 38th International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 7952-7956, 2013. Links | BibTeX | Tags: Articulatory inversion, Articulatory synthesis, Speech @inproceedings{aryal2013icassp, |
2012 |
Aryal, S; Gutierrez-Osuna, R Articulatory Inversion and Synthesis: Towards Articulatory-Based Modification of Speech Technical Report 2012. Abstract | Links | BibTeX | Tags: Articulatory inversion, Articulatory synthesis, Speech @techreport{aryal2012techreport, Certain speech modifications, such as changes in foreign/regional accents or articulatory styles, are performed more effectively in the articulatory domain than in the acoustic domain. Though measuring articulators is cumbersome, articulatory parameters may be estimated from acoustics through inversion. In this paper, we study the impact on synthesis quality when articulators predicted from acoustics are used in articulatory synthesis. For this purpose, we trained a GMM articulatory synthesizer and drove it with articulators predicted with an RBF-based inversion model. Using inverted instead of measured articulators degraded synthesis quality, as measured through Mel cepstral distortion and subjective tests. However, retraining the synthesizer with predicted articulators not only reversed the effect of errors introduced during inversion but also improved synthesis quality relative to using measured articulators. These results suggest that inverted articulators do not compromise synthesis quality, and open up the possibility of performing speech modification in the articulatory domain through inversion. |
Aryal, S; Huang, J; Felps, D; Gutierrez-Osuna, R Boosting Automatic Speech Recognition through Articulatory Inversion Technical Report 2012. Abstract | Links | BibTeX | Tags: Articulatory inversion @techreport{aryal2012techreport-2, This paper explores whether articulatory features predicted from speech acoustics through inversion may be used to boost the recognition of context-dependent units when combined with acoustic features. For this purpose, we performed articulatory inversion on a corpus containing acoustic and electromagnetic articulography recordings from a single speaker. We then compared the performance of an HMM-based diphone classifier on the individual feature sets (acoustic, articulatory, inversion) as well as on their combinations. To make good use of the limited corpus, we used a factorized representation that first classified diphones into broad overlapping categories and then combined them using a maximum-a-posteriori criterion. When comparing the individual feature sets, our results show no degradation in classification performance when predicted articulators are used instead of ground-truth articulators. Further, performance on the acoustic feature set improved by 10% when adding ground-truth articulators and by 5% when adding predicted articulators. |