Automatic speech recognizers for Mexican Spanish and its open resources

Carlos Daniel Hernández Mena; Ivan V. Meza Ruiz; José Abel Herrera Camacho

doi:10.1016/j.jart.2017.02.001

PDF

Published: Jun 7, 2019

DOI: https://doi.org/10.1016/j.jart.2017.02.001

Keywords:

Automatic speech recognition, Mexican Spanish, Language resources, Language model, Acoustic model

Carlos Daniel Hernández Mena

Ivan V. Meza Ruiz

José Abel Herrera Camacho

Abstract

Development of automatic speech recognition systems relies on the availability of distinct language resources such as speech recordings, pronunciation dictionaries, and language models. These resources are scarce for the Mexican Spanish dialect. In this work, we present a revision of the CIEMPIESS corpus that is a resource for spontaneous speech recognition in Mexican Spanish of Central Mexico. It consists of 17 h of segmented and transcribed recordings, a phonetic dictionary composed by 53,169 unique words, and a language model composed by 1,505,491 words extracted from 2489 university newsletters. We also evaluate the CIEMPIESS corpus using three well known state of the art speech recognition engines, having satisfactory results. These resources are open for research and development in the field. Additionally, we present the methodology and the tools used to facilitate the creation of these resources which can be easily adapted to other variants of Spanish, or even other languages.

How to Cite

Mena, C. D. H., Ruiz, I. V. M., & Camacho, J. A. H. (2019). Automatic speech recognizers for Mexican Spanish and its open resources. Journal of Applied Research and Technology, 15(3). https://doi.org/10.1016/j.jart.2017.02.001

Issue

Vol. 15 No. 3

Section

Articles

Author Biographies

Carlos Daniel Hernández Mena

Laboratorio de Tecnologías del Lenguaje (LTL), Universidad Nacional Autónoma de México (UNAM), Mexico

Ivan V. Meza Ruiz

Instituto de Investigaciones en Matemáticas Aplicadas y en Sistemas (IIMAS), Universidad Nacional Autónoma de México (UNAM), Mexico

José Abel Herrera Camacho

Laboratorio de Tecnologías del Lenguaje (LTL), Universidad Nacional Autónoma de México (UNAM), Mexico

Article Sidebar

Main Article Content

Abstract

Article Details

Carlos Daniel Hernández Mena

Ivan V. Meza Ruiz

José Abel Herrera Camacho