A Video Corpus of Spanish Spoken in Texas
The goal of the Spanish in Texas project is to develop a corpus of Spanish and bilingual Spanish-English speech samples culled from interviews and conversations among speakers of diverse personal profiles and regional origins throughout Texas.
What’s Available
The corpus consists of over 500,000 words from 97 bilingual speakers living in Texas. Video files, audio files, full transcripts, and POS annotations are available for download.
Free Access for Researchers and Educators
The Spanish in Texas Corpus can be downloaded from the Texas Data Repository. Access is free and requires agreeing to abide by our Code of Ethics and providing your name and contact information.