Toward the compilation of C-ORAL-ANGOLA

an informal spontaneous speech corpus of Angolan Portuguese

Authors

DOI:

https://doi.org/10.11606/issn.2176-9419.v20iEspecialp139-157

Keywords:

Angolan Portuguese, Spontaneous speech, Corpus, Compilation

Abstract

The paper introduces the architecture and compilation criteria for an Angolan Portuguese spontaneous speech corpus. After a brief introduction about the linguistic scenario in Angola, we present an in-depth description of the recording modalities and treatment related to the multiple sociolinguistic variations documented, with special attention to diaphasic variation. The first twenty-seven recorded texts are then detailed. These will make up a minicorpus, portraying at least 30,000 words. The minicorpus will be prosodically segmented and will display text-to-speech alignment. The last part of the article is dedicated to the methodological steps taken for the corpus compilation: acoustic quality definition, transcription criteria, prosodic segmentation procedures, revision, alignment and statistic validation.

Downloads

Download data is not yet available.

Author Biographies

  • Bruno Rocha, Federal University of Para
    Professor Adjunto, Faculdade de Letras
  • Heliana Mello, Federal University of Minas Gerais

    Professora Titular, Faculdade de Letras

  • Tommaso Raso, Federal University of Minas Gerais
    Professor Titular, Faculdade de Letras s

References

Published

2018-12-30

Issue

Section

Papers

How to Cite

Rocha, B., Mello, H., & Raso, T. (2018). Toward the compilation of C-ORAL-ANGOLA: an informal spontaneous speech corpus of Angolan Portuguese. Filologia E Linguística Portuguesa, 20(Especial), 139-157. https://doi.org/10.11606/issn.2176-9419.v20iEspecialp139-157