Toward the compilation of C-ORAL-ANGOLA
an informal spontaneous speech corpus of Angolan Portuguese
DOI:
https://doi.org/10.11606/issn.2176-9419.v20iEspecialp139-157Keywords:
Angolan Portuguese, Spontaneous speech, Corpus, CompilationAbstract
The paper introduces the architecture and compilation criteria for an Angolan Portuguese spontaneous speech corpus. After a brief introduction about the linguistic scenario in Angola, we present an in-depth description of the recording modalities and treatment related to the multiple sociolinguistic variations documented, with special attention to diaphasic variation. The first twenty-seven recorded texts are then detailed. These will make up a minicorpus, portraying at least 30,000 words. The minicorpus will be prosodically segmented and will display text-to-speech alignment. The last part of the article is dedicated to the methodological steps taken for the corpus compilation: acoustic quality definition, transcription criteria, prosodic segmentation procedures, revision, alignment and statistic validation.
Downloads
References
Downloads
Published
Issue
Section
License
Copyright is transferred to the journal for the online publication, with free access, and for the printing in paper documents. Copyright may be preserved for authors who wish to republish their work in collections.






