Corpus of American Soap Operas

Taxonomy :

The SOAP corpus contains 100 million words of data from 22,000 transcripts from American soap operas from the early 2000s


- Other info -

Language(s) :

American English

Types : monolingual corpus
Domain : American soap operas from 2001-2012
Size : 100 million
Developer : Mark Davies
Availability : Free
Update: 07/2012