Leo Corpus

Taxonomy :

The Leo Corpus documents the simultaneous development of Mandarin, Cantonese and English in a Hong Kong child from 1;06-2;11. The current corpus contains monthly audio recordings and corresponding transcripts in three languages for 18 months from 1;06 to 2;11 (54 files, 27 hours in total), featuring Leo interacting with his main providers of input in the three languages: Mandarin from father and grandmother, Cantonese from mother, and English from mother, domestic helper and school teachers who are native speakers of English (represented by an American research assistant in the recordings).


- Other info -

Language(s) :

Mandarin Chinese
Cantonese
English

Types : multilingual corpus
Domain : Everyday spoken language
Size : 27 hours of recording
Developer : Ziyin Mai Virginia Yip
Availability : Free
Update: 2018