Publications – PSI Lab

Quamer, W.; Gutierrez-Osuna, R.

Disentangling Segmental and Prosodic Factors to Non-Native Speech Comprehensibility Journal Article

In: IEEE Transactions on Audio, Speech and Language Processing, vol. 33, 2025.

Links | BibTeX | Tags: Accent conversion, Speech

A. Das W. Quamer, R. Gutierrez-Osuna

Speech synthesis and pronunciation teaching Book Chapter

In: J. Levis C. A. Chapelle, M. Munro; Huensch, A. (Ed.): 2024.

Links | BibTeX | Tags: Accent conversion, Speech

Quamer, W.; Gutierrez-Osuna, R.

End-to-end streaming model for low-latency speech anonymization Proceedings Article

In: Proc. IEEE Spoken Language Technology Workshop (SLT 2024), 2024.

Links | BibTeX | Tags: Accent conversion, Speech

Das, A.; R. Gutierrez-Osuna,

Improving mispronunciation detection using speech reconstruction Journal Article Forthcoming

In: IEEE/ACM Transactions on Audio, Speech and Language Processing, Forthcoming.

Links | BibTeX | Tags: Accent conversion, Speech

R. Neiriz A. Silpachai, M. Novotny; Chukharev, E.

Corrective feedback accuracy and pronunciation improvement: Feedback that is ‘good enough’ Journal Article

In: Language Learning & Technology, vol. 28, iss. 1, pp. 1–16, 2024.

Links | BibTeX | Tags: Speech

Anurag Das Waris Quamer, Ricardo Gutierrez-Osuna

Decoupling segmental and prosodic cues of non-native speech through vector quantization Proceedings Article

In: Proc. Interspeech, 2023.

Links | BibTeX | Tags: Accent conversion, Speech, Voice conversion

Quamer, W.; Das, A.; Levis, J.; Chukharev-Hudilainen, E.; Gutierrez-Osuna, R.

Zero-Shot Foreign Accent Conversion without a Native Reference Proceedings Article Forthcoming

In: Proc. Interspeech, Forthcoming.

Links | BibTeX | Tags: Accent conversion, Deep learning, Speech

Liberatore, C.; Gutierrez-Osuna, R.

Minimizing residuals for native-nonnative voice conversion in a sparse, anchor-based representation of speech Proceedings Article

In: Proc. ICASSP, 2022.

BibTeX | Tags: Accent conversion, Speech

Ding, S.; Zhao, G.; Gutierrez-Osuna, R.

Accentron: Foreign accent conversion to arbitrary non-native speakers using zero-shot learning Journal Article

In: Computer Speech & Language, 2021.

Links | BibTeX | Tags: Accent conversion, Deep learning, Speech

Liberatore, C.; Gutierrez-Osuna, R.

An Exemplar Selection Algorithm For Native-Nonnative Voice Conversion Proceedings Article

In: Proc. Interspeech, 2021.

Links | BibTeX | Tags: Accent conversion, Speech

Silpachai, A.; Rehman, I.; Barriuso, T. A.; Levis, J.; Chukharev-Hudilainen, E.; Zhao, G.; Gutierrez-Osuna, R.

Effects Of Voice Type And Task On L2 Learners’ Awareness Of Pronunciation Errors Proceedings Article

In: Proc. Interspeech, 2021.

BibTeX | Tags: Accent conversion, Speech

Hair, A.; Zhao, G.; Ahmed, B.; Ballard, K. J.; Gutierrez-Osuna, R.

Assessing Posterior-Based Mispronunciation Detection on Field-Collected Recordings from Child Speech Therapy Sessions Proceedings Article

In: Proc. Interspeech, 2021.

Links | BibTeX | Tags: Automatic Speech Recognition, Childhood apraxia of speech, Speech

Zhao, G.; Ding, S.; Gutierrez-Osuna, R.

Converting Foreign Accent Speech Without a Reference Journal Article

In: IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 29, pp. 2367, 2021.

Links | BibTeX | Tags: Accent conversion, Deep learning, Speech

Hair, A.; Ballard, K. J.; Markoulli, C.; Monroe, P.; McKechnie, J.; Ahmed, B.; Gutierrez-Osuna, R.

A Longitudinal Evaluation of Tablet-Based Child Speech Therapy with Apraxia World Journal Article

In: ACM Transactions On Accessible Computing, vol. 14, no. 1, 2021.

Abstract | Links | BibTeX | Tags: Games, Health, Speech

Ding, S.; Zhao, G.; Gutierrez-Osuna, R.

Improving the Speaker Identity of Non-Parallel Many-to-Many Voice Conversion with Adversarial Speaker Recognition Proceedings Article

In: Proc. Interspeech, 2020.

Links | BibTeX | Tags: Accent conversion, Deep learning, Speech

Das, A.; Zhao, G.; Levis, J.; Chukharev-Hudilainen, E.; Gutierrez-Osuna, R.

Understanding the Effect of Voice Quality and Accent on Talker Similarity Proceedings Article

In: Proc. Interspeech, 2020.

Links | BibTeX | Tags: Accent conversion, Deep learning, Speech

Lučić, I.; Silpachai, A.; Levis, J.; Zhao, G; Gutierrez-Osuna, R.

The English Pronunciation of Arabic Speakers - A Data-Driven Approach to Segmental Error Identification Journal Article

In: Language Teaching Research, 2020.

Links | BibTeX | Tags: Accent conversion, Speech

McKechnie, J.; Ahmed, B.; Gutierrez-Osuna, R.; Murray, E.; McCabe, P.; Ballard, K.

The influence of type of feedback during tablet-based delivery of intensive treatment for childhood apraxia of speech Journal Article

In: Journal of Communication Disorders, 2020.

Links | BibTeX | Tags: Health, Speech

Hair, A; Markoulli, C; Monroe, P; McKechnie, J; Ballard, K J; Ahmed, B; Gutierrez-Osuna, R

Preliminary Results From a Longitudinal Study of a Tablet-Based Speech Therapy Game Proceedings Article

In: Extended Abstracts of the 2020 CHI Conference on Human Factors in Computing, ACM, 2020, ISBN: 978-1-4503-6819-3/20/04.

Abstract | Links | BibTeX | Tags: Automatic Speech Recognition, Childhood apraxia of speech, Health, Speech

Ding, S.; Zhao, G; Liberatore, C.; Gutierrez-Osuna, R.

Learning Structured Sparse Representations for Voice Conversion Journal Article

In: IEEE Transactions on Audio, Speech and Language Processing, vol. 28, pp. 343-354, 2019.

Links | BibTeX | Tags: Accent conversion, Speech

Ding, S.; Liberatore, C.; Sonsaat, S.; Lučić, I.; Silpachai, A.; Zhao, G; Chukharev-Hudilainen, E.; Levis, J.; Gutierrez-Osuna, R.

Golden speaker builder – An interactive tool for pronunciation training Journal Article

In: Speech Communication, vol. 115, pp. 51-66, 2019.

Links | BibTeX | Tags: Accent conversion, Speech

Hair, A; Ballard, K J; Ahmed, B; Gutierrez-Osuna, R

Evaluating Automatic Speech Recognition for Child Speech Therapy Applications Proceedings Article

In: ACM SIGACCESS Conference on Computers and Accessibility, ACM 2019, ISBN: 978-1-4503-6676-2/19/10.

Abstract | Links | BibTeX | Tags: Automatic Speech Recognition, Childhood apraxia of speech, Health, Speech

Ding, S.; Gutierrez-Osuna, Ricardo

Group Latent Embedding for Vector Quantized Variational Autoencoder in Non-Parallel Voice Conversion Proceedings Article

In: Proc. Interspeech, 2019.

Links | BibTeX | Tags: Accent conversion, Speech

Zhao, G; Ding, S.; Gutierrez-Osuna, Ricardo

Foreign Accent Conversion by Synthesizing Speech from Phonetic Posteriorgrams Proceedings Article

In: Proc. Interspeech, 2019.

Links | BibTeX | Tags: Accent conversion, Deep learning, Speech

Zhao, G; Gutierrez-Osuna, R

Using Phonetic Posteriorgram Based Frame Pairing for Segmental Accent Conversion Journal Article

In: IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 27, no. 10, pp. 1649-1660, 2019, ISSN: 2329-9290.

Abstract | Links | BibTeX | Tags: Accent conversion, Deep learning, Speech

Monteiro, C. D. D.; Shipman, F. M.; III, S. Duggina; Gutierrez-Osuna, R.

Tradeoffs in the Efficient Detection of Sign Language Content in Video Sharing Sites Journal Article

In: ACM Transactions on Accessible Computing, vol. 12, no. 2, pp. 1-16, 2019.

Links | BibTeX | Tags: Computer vision, Speech

Ahmed, B; Monroe, P; Hair, A; Tan, C-T; Gutierrez-Osuna, R; Ballard, K J

Speech-driven mobile games for speech therapy: User experiences and feasibility Journal Article

In: International Journal of Speech-Language Pathology , vol. 20, no. 6, pp. 644-658, 2018.

Links | BibTeX | Tags: Childhood apraxia of speech, Games, Health, Speech

Levis, J.; Chukharev-Hudilainen, E.; Gutierrez-Osuna, R.; Lucic, I.; Silpachai, A.; Sonsaat, S.

Golden Speaker: Learner Experience with Computer-assisted Pronunciation Practice Proceedings Article

In: Proc. Pronunciation in Second Language Learning and Teaching Conference, 2018.

BibTeX | Tags: Accent conversion, Speech

Ding, S.; Zhao, G; Liberatore, C.; Gutierrez-Osuna, R.

Improving Sparse Representations in Exemplar-Based Voice Conversion with a Phoneme-Selective Objective Function Proceedings Article

In: Proc. Interspeech, 2018.

Links | BibTeX | Tags: Accent conversion, Speech

McKechnie, J.; Ahmed, B.; Gutierrez-Osuna, R.; Monroe, P.; McCabe, P.; Ballard, K. J.

Automated speech analysis tools for children’s speech production: A systematic literature review Journal Article

In: International Journal of Speech-Language Pathology, vol. 20, no. 6, pp. 583–598, 2018.

Links | BibTeX | Tags: Childhood apraxia of speech, Health, Speech

Zhao, G; Sonsaat, S; Silpachai, A; Lucic, I; Chukharev-Hudilainen, E; Levis, J; Gutierrez-Osuna, R

L2-ARCTIC: A Non-Native English Speech Corpus Proceedings Article

In: Proc. Interspeech, 2018.

Links | BibTeX | Tags: Accent conversion, Speech

Ding, S.; Liberatore, C.; Gutierrez-Osuna, R.

Learning Structured Dictionaries for Exemplar-based Voice Conversion Proceedings Article

In: Proc. Interspeech, 2018.

Links | BibTeX | Tags: Accent conversion, Speech

Hair, A; Monroe, P; Ahmed, B; Ballard, K J; Gutierrez-Osuna, R

Apraxia World: A Speech Therapy Game for Children with Speech Sound Disorders Proceedings Article

In: Proceedings of the 2018 Conference on Interaction Design and Children, ACM, 2018, ISBN: 978-1-4503-5152-2/18/06.

Abstract | Links | BibTeX | Tags: Childhood apraxia of speech, Health, Mobile computing, Speech

Liberatore, C; Zhao, G; Gutierrez-Osuna, R

Voice Conversion through Residual Warping in a Sparse, Anchor-Based Representation of Speech Proceedings Article

In: Proc. ICASSP, 2018.

Abstract | Links | BibTeX | Tags: Accent conversion, Speech

Zhao, G; Sonsaat, S; Levis, J; Chukharev-Hudilainen, E; Gutierrez-Osuna, R

Accent conversion using phonetic posteriorgrams Proceedings Article

In: Proc. ICASSP, 2018.

Links | BibTeX | Tags: Accent conversion, Deep learning, Speech

Angello, G.; Zhao, G.; Manam, A. B.; Gutierrez-Osuna, R.

Training Behavior of Successful Tacton-Phoneme Learners Proceedings Article

In: IEEE Haptics Symposium, 2018.

Links | BibTeX | Tags: Other, Speech

Shipman, F; Duggina, S; Monteiro, C; Gutierrez-Osuna, R

Speed-Accuracy Tradeoffs for Detecting Sign Language Content in Video Sharing Sites Proceedings Article

In: Proceedings of ACM SIGACCESS Conference on Computers and Accessibility (ASSETS 2017), pp. 185-189, 2017.

Links | BibTeX | Tags: Computer vision, Gestures, Speech

Zhao, G; Gutierrez-Osuna, R

Exemplar selection methods in voice conversion Proceedings Article

In: Proc. 42nd International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 5525-5529, 2017.

Links | BibTeX | Tags: Speech, Voice conversion

Aryal, S; Gutierrez-Osuna, R

Comparing Articulatory and Acoustic Strategies for Reducing Non-Native Accents Proceedings Article

In: Proc. Interspeech, 2016.

Links | BibTeX | Tags: Accent conversion, Articulatory synthesis, Speech

Liberatore, C; Gutierrez-Osuna, R

Generating Gestural Scores from Acoustics Through a Sparse Anchor-Based Representation of Speech Proceedings Article

In: Proc. Interspeech, 2016.

Links | BibTeX | Tags: Articulatory synthesis, Speech

Shahin, M; Gutierrez-Osuna, R; Ahmed, B

Classification of bisyllabic lexical stress patterns in disordered speech using deep learning Proceedings Article

In: Proc. International Conference on Acoustics, Speech, and Signal Processing, 2016.

Links | BibTeX | Tags: Speech

McKechnie, J; Ballard, K J; McCabe, P; Murray, E; Lan, T; Gutierrez-Osuna, R; Ahmed, B

Influence of type of feedback on effect of tablet-based delivery of intensive speech therapy in children with Childhood Apraxia of Speech Proceedings Article

In: Proceedings of the Motor Speech Conference, 2016.

BibTeX | Tags: Childhood apraxia of speech, Games, Health, Mobile computing, Speech

Aryal, S; Gutierrez-Osuna, R

Data driven articulatory synthesis with deep neural networks Journal Article

In: Computer Speech and Language, vol. 36, pp. 260-273, 2016.

Links | BibTeX | Tags: Accent conversion, Articulatory synthesis, Deep learning, Speech

Parnandi, A; Karappa, V; Lan, T; Shahin, M; McKechnie, J; Ballard, K; Ahmed, B; Gutierrez-Osuna, R

Development of a remote therapy tool for childhood apraxia of speech Journal Article

In: ACM Transactions on Accessible Computing, vol. 7, no. 3, pp. 10:1-10:23, 2015.

Links | BibTeX | Tags: Childhood apraxia of speech, Games, Health, Mobile computing, Speech

Aryal, S; Gutierrez-Osuna, R

Articulatory-based conversion of foreign accents with deep neural networks Proceedings Article

In: Proc. Interspeech, pp. 3385-3389, 2015.

Links | BibTeX | Tags: Accent conversion, Articulatory synthesis, Deep learning, Speech

Shahin, M; Ahmed, B; Parnandi, A; Karappa, V; McKechnie, J; Ballard, K; Gutierrez-Osuna, R

Tabby Talks: an automated tool for the assessment of childhood apraxia of speech Journal Article

In: Speech Communication, vol. in press, 2015.

Links | BibTeX | Tags: Childhood apraxia of speech, Games, Health, Mobile computing, Speech

Aryal, S; Gutierrez-Osuna, R

Reduction of non-native accents through statistical parametric articulatory synthesis Journal Article

In: Journal of the Acoustical Society of America, vol. 137, no. 1, pp. 433-446, 2015.

Links | BibTeX | Tags: Accent conversion, Articulatory synthesis, Speech

Lan, T; Aryal, S; Ahmed, B; Ballard, K; Gutierrez-Osuna, R

Flappy Voice: An Interactive Game for Childhood Apraxia of Speech Therapy Proceedings Article

In: Proc. CHI-PLAY, 2014.

Links | BibTeX | Tags: Childhood apraxia of speech, Games, Health, Speech

Shahin, M; Ahmed, B; McKechnie, J; Ballard, K; Gutierrez-Osuna, R

A comparison of GMM-HMM and DNN-HMM based pronunciation verification techniques for use in the assessment of childhood apraxia of speech Proceedings Article

In: Proc. Interspeech, 2014.

Links | BibTeX | Tags: Childhood apraxia of speech, Health, Speech

McKechnie, J; Ballard, K; McCabe, P; Gutierrez-Osuna, R; Karappa, V; Parnandi, A; Shahin, M; Murray, E; Ahmed, B

Tablet-based delivery of intensive speech therapy in children with Childhood Apraxia of Speech - Pilot Phase Proceedings Article

In: Speech Pathology Australia National Conference, 2014.

BibTeX | Tags: Games, Health, Speech

Felps, D; Aryal, S; Gutierrez-Osuna, R

Normalization of articulatory data through Procrustes transformations and analysis-by-synthesis Proceedings Article

In: Proc. 39th International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 3051-3055, 2014.

Links | BibTeX | Tags: Accent conversion, Articulatory synthesis, Speech

Aryal, S; Gutierrez-Osuna, R

Can voice conversion be used to reduce non-native accents Proceedings Article

In: Proc. 39th International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 7929-7933, 2014.

Links | BibTeX | Tags: Accent conversion, Speech

Aryal, S; Gutierrez-Osuna, R

Accent conversion through cross-speaker articulatory synthesis Proceedings Article

In: Proc. 39th International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 7744-7748, 2014.

Links | BibTeX | Tags: Accent conversion, Articulatory synthesis, Speech

Parnandi, A; Karappa, V; Son, Y; Shahin, M; McKechnie, J; Ballard, K; Ahmed, B; Gutierrez-Osuna, R

Architecture of an automated therapy tool for childhood apraxia of speech Conference

The 15th International ACM SIGACCESS Conference on Computers and Accessibility (ASSETS), 2013.

Links | BibTeX | Tags: Childhood apraxia of speech, Games, Health, Mobile computing, Speech

Aryal, S; Felps, D; Gutierrez-Osuna, R

Foreign Accent Conversion through Voice Morphing Proceedings Article

In: Interspeech, pp. 3077-3081, 2013.

Links | BibTeX | Tags: Accent conversion, Speech

Aryal, S; Gutierrez-Osuna, R

Articulatory inversion and synthesis: towards articulatory-based modification of speech Proceedings Article

In: 38th International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 7952-7956, 2013.

Links | BibTeX | Tags: Articulatory inversion, Articulatory synthesis, Speech

Aryal, S; Gutierrez-Osuna, R

Articulatory Inversion and Synthesis: Towards Articulatory-Based Modification of Speech Technical Report

2012.

Abstract | Links | BibTeX | Tags: Articulatory inversion, Articulatory synthesis, Speech

Parnandi, A; Son, Y; Shahin, M; Ahmed, B; Gutierrez-Osuna, R

Architecture of an Automated Therapy Tool for Childhood Apraxia of Speech Technical Report

2012.

Abstract | Links | BibTeX | Tags: Games, Health, Mobile computing, Speech

Felps, D; Geng, C; Gutierrez-Osuna, R

Foreign accent conversion through concatenative synthesis in the articulatory domain Journal Article

In: IEEE Transactions on Audio, Speech and Language Processing, 2012.

Abstract | Links | BibTeX | Tags: Accent conversion, Speech

Felps, D; Gutierrez-Osuna, R

Normalization of Articulatory Data through Procrustes Transformations and Analysis-by-synthesis Technical Report

2010.

Abstract | Links | BibTeX | Tags: Articulatory synthesis, Speech

Gutierrez-Osuna, R; Felps, D

Foreign Accent Conversion through Voice Morphing Technical Report

2010.

Abstract | Links | BibTeX | Tags: Accent conversion, Speech

Felps, D; Geng, C; Berger, M; Richmond, K; Gutierrez-Osuna, R

Relying on critical articulators to estimate vocal tract spectra in an articulatory-acoustic database Conference

Interspeech, 2010.

Abstract | Links | BibTeX | Tags: Accent conversion, Speech

Felps, D; Gutierrez-Osuna, R

Developing objective measures of foreign-accent conversion Journal Article

In: Audio, Speech, and Language Processing, IEEE Transactions on, vol. 18, no. 5, pp. 1030–1040, 2010.

Abstract | Links | BibTeX | Tags: Accent conversion, Speech

@article{felps2010talsp,

title = {Developing objective measures of foreign-accent conversion},

author = {D Felps and R Gutierrez-Osuna},

url = {https://psi.engr.tamu.edu/wp-content/uploads/2018/01/felps2010talsp.pdf},

year  = {2010},

date = {2010-01-01},

journal = {Audio, Speech, and Language Processing, IEEE Transactions on},

volume = {18},

number = {5},

pages = {1030--1040},

publisher = {IEEE},

abstract = {Various methods have recently appeared to transform foreign-accented speech into its native-accented counterpart. Evaluation of these accent conversion methods requires extensive listening tests across a number of perceptual dimensions. This article presents three objective measures that may be used to assess the acoustic quality, degree of foreign accent, and speaker identity of accent-converted utterances. Accent conversion generates novel utterances: those of a foreign speaker with a native accent. Therefore, the acoustic quality in accent conversion cannot be evaluated with conventional measures of spectral distortion, which assume that a clean recording of the speech signal is available for comparison. Here we evaluate a single-ended measure of speech quality, lTV -T recommendation P.563 for narrow-band telephony. We also propose a measure of foreign accent that exploits a weakness of automatic speech recognizers: their sensitivity to foreign accents. Namely, we use phoneme-level match scores given by the HTK recognizer trained on a large number of English American speakers to obtain a measure of native accent. Finally, we propose a measure of speaker identity that projects acoustic vectors (e.g., Mel cepstral, F0) onto the linear discriminant that maximizes separability for a given pair of source and target speakers. The three measures are evaluated on a corpus of accent-converted utterances that had been previously rated through perceptual tests. Our results show that the three measures have a high degree of correlation with their corresponding subjective ratings, suggesting that they may be used to accelerate the development of foreign-accent conversion tools. Applications of these measures in the context of computer assisted pronunciation training and voice conversion are also discussed.},

keywords = {Accent conversion, Speech},

pubstate = {published},

tppubtype = {article}

}

Close

Pazarloglou, A; Stoleru, R; Gutierrez-Osuna, R

High-resolution speech signal reconstruction in wireless sensor networks Conference

Consumer Communications and Networking Conference, IEEE 2009.

Abstract | Links | BibTeX | Tags: Speech