Publications

Show all

2023

Anurag Das Waris Quamer, Ricardo Gutierrez-Osuna

Decoupling segmental and prosodic cues of non-native speech through vector quantization Proceedings Article

In: Proc. Interspeech, 2023.

Links | BibTeX | Tags: Accent conversion, Speech, Voice conversion

2022

Quamer, W.; Das, A.; Levis, J.; Chukharev-Hudilainen, E.; Gutierrez-Osuna, R.

Zero-Shot Foreign Accent Conversion without a Native Reference Proceedings Article Forthcoming

In: Proc. Interspeech, Forthcoming.

Links | BibTeX | Tags: Accent conversion, Deep learning, Speech

Liberatore, C.; Gutierrez-Osuna, R.

Minimizing residuals for native-nonnative voice conversion in a sparse, anchor-based representation of speech Proceedings Article

In: Proc. ICASSP, 2022.

BibTeX | Tags: Accent conversion, Speech

2021

Ding, S.; Zhao, G.; Gutierrez-Osuna, R.

Accentron: Foreign accent conversion to arbitrary non-native speakers using zero-shot learning Journal Article

In: Computer Speech & Language, 2021.

Links | BibTeX | Tags: Accent conversion, Deep learning, Speech

Liberatore, C.; Gutierrez-Osuna, R.

An Exemplar Selection Algorithm For Native-Nonnative Voice Conversion Proceedings Article

In: Proc. Interspeech, 2021.

Links | BibTeX | Tags: Accent conversion, Speech

Silpachai, A.; Rehman, I.; Barriuso, T. A.; Levis, J.; Chukharev-Hudilainen, E.; Zhao, G.; Gutierrez-Osuna, R.

Effects Of Voice Type And Task On L2 Learners’ Awareness Of Pronunciation Errors Proceedings Article

In: Proc. Interspeech, 2021.

BibTeX | Tags: Accent conversion, Speech

Hair, A.; Zhao, G.; Ahmed, B.; Ballard, K. J.; Gutierrez-Osuna, R.

Assessing Posterior-Based Mispronunciation Detection on Field-Collected Recordings from Child Speech Therapy Sessions Proceedings Article

In: Proc. Interspeech, 2021.

Links | BibTeX | Tags: Automatic Speech Recognition, Childhood apraxia of speech, Speech

Zhao, G.; Ding, S.; Gutierrez-Osuna, R.

Converting Foreign Accent Speech Without a Reference Journal Article

In: IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 29, pp. 2367, 2021.

Links | BibTeX | Tags: Accent conversion, Deep learning, Speech

Hair, A.; Ballard, K. J.; Markoulli, C.; Monroe, P.; McKechnie, J.; Ahmed, B.; Gutierrez-Osuna, R.

A Longitudinal Evaluation of Tablet-Based Child Speech Therapy with Apraxia World Journal Article

In: ACM Transactions On Accessible Computing, vol. 14, no. 1, 2021.

Abstract | Links | BibTeX | Tags: Games, Health, Speech

2020

Ding, S.; Zhao, G.; Gutierrez-Osuna, R.

Improving the Speaker Identity of Non-Parallel Many-to-Many Voice Conversion with Adversarial Speaker Recognition Proceedings Article

In: Proc. Interspeech, 2020.

Links | BibTeX | Tags: Accent conversion, Deep learning, Speech

Das, A.; Zhao, G.; Levis, J.; Chukharev-Hudilainen, E.; Gutierrez-Osuna, R.

Understanding the Effect of Voice Quality and Accent on Talker Similarity Proceedings Article

In: Proc. Interspeech, 2020.

Links | BibTeX | Tags: Accent conversion, Deep learning, Speech

Lučić, I.; Silpachai, A.; Levis, J.; Zhao, G; Gutierrez-Osuna, R.

The English Pronunciation of Arabic Speakers - A Data-Driven Approach to Segmental Error Identification Journal Article

In: Language Teaching Research, 2020.

Links | BibTeX | Tags: Accent conversion, Speech

McKechnie, J.; Ahmed, B.; Gutierrez-Osuna, R.; Murray, E.; McCabe, P.; Ballard, K.

The influence of type of feedback during tablet-based delivery of intensive treatment for childhood apraxia of speech Journal Article

In: Journal of Communication Disorders, 2020.

Links | BibTeX | Tags: Health, Speech

Hair, A; Markoulli, C; Monroe, P; McKechnie, J; Ballard, K J; Ahmed, B; Gutierrez-Osuna, R

Preliminary Results From a Longitudinal Study of a Tablet-Based Speech Therapy Game Proceedings Article

In: Extended Abstracts of the 2020 CHI Conference on Human Factors in Computing, ACM, 2020, ISBN: 978-1-4503-6819-3/20/04.

Abstract | Links | BibTeX | Tags: Automatic Speech Recognition, Childhood apraxia of speech, Health, Speech

2019

Ding, S.; Zhao, G; Liberatore, C.; Gutierrez-Osuna, R.

Learning Structured Sparse Representations for Voice Conversion Journal Article

In: IEEE Transactions on Audio, Speech and Language Processing, vol. 28, pp. 343-354, 2019.

Links | BibTeX | Tags: Accent conversion, Speech

Ding, S.; Liberatore, C.; Sonsaat, S.; Lučić, I.; Silpachai, A.; Zhao, G; Chukharev-Hudilainen, E.; Levis, J.; Gutierrez-Osuna, R.

Golden speaker builder – An interactive tool for pronunciation training Journal Article

In: Speech Communication, vol. 115, pp. 51-66, 2019.

Links | BibTeX | Tags: Accent conversion, Speech

Hair, A; Ballard, K J; Ahmed, B; Gutierrez-Osuna, R

Evaluating Automatic Speech Recognition for Child Speech Therapy Applications Proceedings Article

In: ACM SIGACCESS Conference on Computers and Accessibility, ACM 2019, ISBN: 978-1-4503-6676-2/19/10.

Abstract | Links | BibTeX | Tags: Automatic Speech Recognition, Childhood apraxia of speech, Health, Speech

Ding, S.; Gutierrez-Osuna, Ricardo

Group Latent Embedding for Vector Quantized Variational Autoencoder in Non-Parallel Voice Conversion Proceedings Article

In: Proc. Interspeech, 2019.

Links | BibTeX | Tags: Accent conversion, Speech

Zhao, G; Ding, S.; Gutierrez-Osuna, Ricardo

Foreign Accent Conversion by Synthesizing Speech from Phonetic Posteriorgrams Proceedings Article

In: Proc. Interspeech, 2019.

Links | BibTeX | Tags: Accent conversion, Deep learning, Speech

Zhao, G; Gutierrez-Osuna, R

Using Phonetic Posteriorgram Based Frame Pairing for Segmental Accent Conversion Journal Article

In: IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 27, no. 10, pp. 1649-1660, 2019, ISSN: 2329-9290.

Abstract | Links | BibTeX | Tags: Accent conversion, Deep learning, Speech

Monteiro, C. D. D.; Shipman, F. M.; III, S. Duggina; Gutierrez-Osuna, R.

Tradeoffs in the Efficient Detection of Sign Language Content in Video Sharing Sites Journal Article

In: ACM Transactions on Accessible Computing, vol. 12, no. 2, pp. 1-16, 2019.

Links | BibTeX | Tags: Computer vision, Speech

2018

Ahmed, B; Monroe, P; Hair, A; Tan, C-T; Gutierrez-Osuna, R; Ballard, K J

Speech-driven mobile games for speech therapy: User experiences and feasibility Journal Article

In: International Journal of Speech-Language Pathology , vol. 20, no. 6, pp. 644-658, 2018.

Links | BibTeX | Tags: Childhood apraxia of speech, Games, Health, Speech

Levis, J.; Chukharev-Hudilainen, E.; Gutierrez-Osuna, R.; Lucic, I.; Silpachai, A.; Sonsaat, S.

Golden Speaker: Learner Experience with Computer-assisted Pronunciation Practice Proceedings Article

In: Proc. Pronunciation in Second Language Learning and Teaching Conference, 2018.

BibTeX | Tags: Accent conversion, Speech

Ding, S.; Zhao, G; Liberatore, C.; Gutierrez-Osuna, R.

Improving Sparse Representations in Exemplar-Based Voice Conversion with a Phoneme-Selective Objective Function Proceedings Article

In: Proc. Interspeech, 2018.

Links | BibTeX | Tags: Accent conversion, Speech

McKechnie, J.; Ahmed, B.; Gutierrez-Osuna, R.; Monroe, P.; McCabe, P.; Ballard, K. J.

Automated speech analysis tools for children’s speech production: A systematic literature review Journal Article

In: International Journal of Speech-Language Pathology, vol. 20, no. 6, pp. 583–598, 2018.

Links | BibTeX | Tags: Childhood apraxia of speech, Health, Speech

Zhao, G; Sonsaat, S; Silpachai, A; Lucic, I; Chukharev-Hudilainen, E; Levis, J; Gutierrez-Osuna, R

L2-ARCTIC: A Non-Native English Speech Corpus Proceedings Article

In: Proc. Interspeech, 2018.

Links | BibTeX | Tags: Accent conversion, Speech

Ding, S.; Liberatore, C.; Gutierrez-Osuna, R.

Learning Structured Dictionaries for Exemplar-based Voice Conversion Proceedings Article

In: Proc. Interspeech, 2018.

Links | BibTeX | Tags: Accent conversion, Speech

Hair, A; Monroe, P; Ahmed, B; Ballard, K J; Gutierrez-Osuna, R

Apraxia World: A Speech Therapy Game for Children with Speech Sound Disorders Proceedings Article

In: Proceedings of the 2018 Conference on Interaction Design and Children, ACM, 2018, ISBN: 978-1-4503-5152-2/18/06.

Abstract | Links | BibTeX | Tags: Childhood apraxia of speech, Health, Mobile computing, Speech

Liberatore, C; Zhao, G; Gutierrez-Osuna, R

Voice Conversion through Residual Warping in a Sparse, Anchor-Based Representation of Speech Proceedings Article

In: Proc. ICASSP, 2018.

Abstract | Links | BibTeX | Tags: Accent conversion, Speech

Zhao, G; Sonsaat, S; Levis, J; Chukharev-Hudilainen, E; Gutierrez-Osuna, R

Accent conversion using phonetic posteriorgrams Proceedings Article

In: Proc. ICASSP, 2018.

Links | BibTeX | Tags: Accent conversion, Deep learning, Speech

Angello, G.; Zhao, G.; Manam, A. B.; Gutierrez-Osuna, R.

Training Behavior of Successful Tacton-Phoneme Learners Proceedings Article

In: IEEE Haptics Symposium, 2018.

Links | BibTeX | Tags: Other, Speech

2017

Shipman, F; Duggina, S; Monteiro, C; Gutierrez-Osuna, R

Speed-Accuracy Tradeoffs for Detecting Sign Language Content in Video Sharing Sites Proceedings Article

In: Proceedings of ACM SIGACCESS Conference on Computers and Accessibility (ASSETS 2017), pp. 185-189, 2017.

Links | BibTeX | Tags: Computer vision, Gestures, Speech

Zhao, G; Gutierrez-Osuna, R

Exemplar selection methods in voice conversion Proceedings Article

In: Proc. 42nd International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 5525-5529, 2017.

Links | BibTeX | Tags: Speech, Voice conversion

2016

Aryal, S; Gutierrez-Osuna, R

Comparing Articulatory and Acoustic Strategies for Reducing Non-Native Accents Proceedings Article

In: Proc. Interspeech, 2016.

Links | BibTeX | Tags: Accent conversion, Articulatory synthesis, Speech

Liberatore, C; Gutierrez-Osuna, R

Generating Gestural Scores from Acoustics Through a Sparse Anchor-Based Representation of Speech Proceedings Article

In: Proc. Interspeech, 2016.

Links | BibTeX | Tags: Articulatory synthesis, Speech

Shahin, M; Gutierrez-Osuna, R; Ahmed, B

Classification of bisyllabic lexical stress patterns in disordered speech using deep learning Proceedings Article

In: Proc. International Conference on Acoustics, Speech, and Signal Processing, 2016.

Links | BibTeX | Tags: Speech

McKechnie, J; Ballard, K J; McCabe, P; Murray, E; Lan, T; Gutierrez-Osuna, R; Ahmed, B

Influence of type of feedback on effect of tablet-based delivery of intensive speech therapy in children with Childhood Apraxia of Speech Proceedings Article

In: Proceedings of the Motor Speech Conference, 2016.

BibTeX | Tags: Childhood apraxia of speech, Games, Health, Mobile computing, Speech

Aryal, S; Gutierrez-Osuna, R

Data driven articulatory synthesis with deep neural networks Journal Article

In: Computer Speech and Language, vol. 36, pp. 260-273, 2016.

Links | BibTeX | Tags: Accent conversion, Articulatory synthesis, Deep learning, Speech

2015

Parnandi, A; Karappa, V; Lan, T; Shahin, M; McKechnie, J; Ballard, K; Ahmed, B; Gutierrez-Osuna, R

Development of a remote therapy tool for childhood apraxia of speech Journal Article

In: ACM Transactions on Accessible Computing, vol. 7, no. 3, pp. 10:1-10:23, 2015.

Links | BibTeX | Tags: Childhood apraxia of speech, Games, Health, Mobile computing, Speech

Aryal, S; Gutierrez-Osuna, R

Articulatory-based conversion of foreign accents with deep neural networks Proceedings Article

In: Proc. Interspeech, pp. 3385-3389, 2015.

Links | BibTeX | Tags: Accent conversion, Articulatory synthesis, Deep learning, Speech

Shahin, M; Ahmed, B; Parnandi, A; Karappa, V; McKechnie, J; Ballard, K; Gutierrez-Osuna, R

Tabby Talks: an automated tool for the assessment of childhood apraxia of speech Journal Article

In: Speech Communication, vol. in press, 2015.

Links | BibTeX | Tags: Childhood apraxia of speech, Games, Health, Mobile computing, Speech

Aryal, S; Gutierrez-Osuna, R

Reduction of non-native accents through statistical parametric articulatory synthesis Journal Article

In: Journal of the Acoustical Society of America, vol. 137, no. 1, pp. 433-446, 2015.

Links | BibTeX | Tags: Accent conversion, Articulatory synthesis, Speech

2014

Lan, T; Aryal, S; Ahmed, B; Ballard, K; Gutierrez-Osuna, R

Flappy Voice: An Interactive Game for Childhood Apraxia of Speech Therapy Proceedings Article

In: Proc. CHI-PLAY, 2014.

Links | BibTeX | Tags: Childhood apraxia of speech, Games, Health, Speech

Shahin, M; Ahmed, B; McKechnie, J; Ballard, K; Gutierrez-Osuna, R

A comparison of GMM-HMM and DNN-HMM based pronunciation verification techniques for use in the assessment of childhood apraxia of speech Proceedings Article

In: Proc. Interspeech, 2014.

Links | BibTeX | Tags: Childhood apraxia of speech, Health, Speech

McKechnie, J; Ballard, K; McCabe, P; Gutierrez-Osuna, R; Karappa, V; Parnandi, A; Shahin, M; Murray, E; Ahmed, B

Tablet-based delivery of intensive speech therapy in children with Childhood Apraxia of Speech - Pilot Phase Proceedings Article

In: Speech Pathology Australia National Conference, 2014.

BibTeX | Tags: Games, Health, Speech

Aryal, S; Gutierrez-Osuna, R

Accent conversion through cross-speaker articulatory synthesis Proceedings Article

In: Proc. 39th International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 7744-7748, 2014.

Links | BibTeX | Tags: Accent conversion, Articulatory synthesis, Speech

Aryal, S; Gutierrez-Osuna, R

Can voice conversion be used to reduce non-native accents Proceedings Article

In: Proc. 39th International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 7929-7933, 2014.

Links | BibTeX | Tags: Accent conversion, Speech

Felps, D; Aryal, S; Gutierrez-Osuna, R

Normalization of articulatory data through Procrustes transformations and analysis-by-synthesis Proceedings Article

In: Proc. 39th International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 3051-3055, 2014.

Links | BibTeX | Tags: Accent conversion, Articulatory synthesis, Speech

2013

Parnandi, A; Karappa, V; Son, Y; Shahin, M; McKechnie, J; Ballard, K; Ahmed, B; Gutierrez-Osuna, R

Architecture of an automated therapy tool for childhood apraxia of speech Conference

The 15th International ACM SIGACCESS Conference on Computers and Accessibility (ASSETS), 2013.

Links | BibTeX | Tags: Childhood apraxia of speech, Games, Health, Mobile computing, Speech

Aryal, S; Felps, D; Gutierrez-Osuna, R

Foreign Accent Conversion through Voice Morphing Proceedings Article

In: Interspeech, pp. 3077-3081, 2013.

Links | BibTeX | Tags: Accent conversion, Speech

Aryal, S; Gutierrez-Osuna, R

Articulatory inversion and synthesis: towards articulatory-based modification of speech Proceedings Article

In: 38th International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 7952-7956, 2013.

Links | BibTeX | Tags: Articulatory inversion, Articulatory synthesis, Speech

2012

Aryal, S; Gutierrez-Osuna, R

Articulatory Inversion and Synthesis: Towards Articulatory-Based Modification of Speech Technical Report

2012.

Abstract | Links | BibTeX | Tags: Articulatory inversion, Articulatory synthesis, Speech

Parnandi, A; Son, Y; Shahin, M; Ahmed, B; Gutierrez-Osuna, R

Architecture of an Automated Therapy Tool for Childhood Apraxia of Speech Technical Report

2012.

Abstract | Links | BibTeX | Tags: Games, Health, Mobile computing, Speech

Felps, D; Geng, C; Gutierrez-Osuna, R

Foreign accent conversion through concatenative synthesis in the articulatory domain Journal Article

In: IEEE Transactions on Audio, Speech and Language Processing, 2012.

Abstract | Links | BibTeX | Tags: Accent conversion, Speech

2010

Felps, D; Gutierrez-Osuna, R

Normalization of Articulatory Data through Procrustes Transformations and Analysis-by-synthesis Technical Report

2010.

Abstract | Links | BibTeX | Tags: Articulatory synthesis, Speech

Gutierrez-Osuna, R; Felps, D

Foreign Accent Conversion through Voice Morphing Technical Report

2010.

Abstract | Links | BibTeX | Tags: Accent conversion, Speech

Felps, D; Gutierrez-Osuna, R

Developing objective measures of foreign-accent conversion Journal Article

In: Audio, Speech, and Language Processing, IEEE Transactions on, vol. 18, no. 5, pp. 1030–1040, 2010.

Abstract | Links | BibTeX | Tags: Accent conversion, Speech

Felps, D; Geng, C; Berger, M; Richmond, K; Gutierrez-Osuna, R

Relying on critical articulators to estimate vocal tract spectra in an articulatory-acoustic database Conference

Interspeech, 2010.

Abstract | Links | BibTeX | Tags: Accent conversion, Speech

2009

Felps, D; Bortfeld, H; Gutierrez-Osuna, R

Foreign accent conversion in computer assisted pronunciation training Journal Article

In: Speech communication, vol. 51, no. 10, pp. 920–932, 2009.

Abstract | Links | BibTeX | Tags: Accent conversion, Speech

Pazarloglou, A; Stoleru, R; Gutierrez-Osuna, R

High-resolution speech signal reconstruction in wireless sensor networks Conference

Consumer Communications and Networking Conference, IEEE 2009.

Abstract | Links | BibTeX | Tags: Speech

2008

Felps, D; Bortfeld, H; Gutierrez-Osuna, R

Prosodic and segmental factors in foreign-accent conversion Technical Report

2008.

Abstract | Links | BibTeX | Tags: Accent conversion, Speech

Choi, H; Gutierrez-Osuna, R; Choi, S; Choe, Y

Kernel oriented discriminant analysis for speaker-independent phoneme spaces Conference

International Conference on Pattern Recognition, IEEE 2008.

Abstract | Links | BibTeX | Tags: Speech

2005

Gutierrez-Osuna, R; Kakumanu, P; Esposito, A; Garcia, ON; Bojorquez, A; Castillo, JL; Rudomin, I

Speech-driven facial animation with realistic dynamics Journal Article

In: Multimedia, IEEE Transactions on, vol. 7, no. 1, pp. 33–42, 2005.

Abstract | Links | BibTeX | Tags: Facial animation, Speech

2001

Kakumanu, P; Gutierrez-Osuna, R; Esposito, A; Bryll, R; Goshtasby, A; Garcia, ON

Speech driven facial animation Conference

Proceedings of the 2001 workshop on Perceptive user interfaces, ACM 2001.

Abstract | Links | BibTeX | Tags: Facial animation, Speech