LDC
The Linguistic Data Consortium (LDC) is an open consortium of universities, companies and government research laboratories that serves as a repository and distribution point for language resources. LDC creates and distributes a wide array of language resources. This database includes speech and text databases, lexicons, data collections, corpora, software, research papers and specifications, and other resources for research and development purposes.