This is a list of links to lexical databases and corpora, organized by language or language group. The resources on this page were initially compiled from announcements on the LINGUIST list and web-search results. This is not intended to be an exhaustive list, but rather a place to organize and store potentially useful links as I encounter them. Suggestions for additional links to include on this page are welcome.

Contents of this page:

Lexical database resources (lemmas, wordforms, frequency information)

Resources for: Collections | French | German | English | Italian | Spanish

Collections

French

German

English

Italian

Spanish

Corpora

Collections | Chinese (Mandarin) | English | Icelandic | Indo-European | Italian | Japanese | Persian/Farsi | Polish | Portuguese | Spanish | Sumerian | Swedish | Turkish

Collections

Chinese (Mandarin)

English

Icelandic

Indo-European

Italian

Japanese

Persian/Farsi

Polish

Portuguese

Spanish

Sumerian

Swedish

Turkish

Lists of lexical-database and corpus resources


Last update and link check: March 2012