Corpora

Multilingual Web Data

Social Media and Public Discourse (blogs, news, cosial networks, etc.)

Task-Oriented Corpora

Large Annotated Corpora

Parallel Corpora

Regulatory and Institutional Texts

Corpora for Speech Technologies

COVID-19 Text and Data Resources