david-hutchinson.ca

Resources for applied linguistics and language teaching

Category Archives: Featured

Corpus Linguistics

One of my main interests in linguistics is corpus linguistics.  Some resources for corpus linguistics are here:

Corpus Portals

These sites allow you to make queries on existing language corpora.

The British National Corpus (BNC): http://corpus.byu.edu/bnc/ includes 100 million words of British English.

The Corpus of Contemporary American English (COCA): http://corpus.byu.edu/coca/ is an expanding corpus, updated every year.  It now stands at 450 million words of American English.

The Corpus of Historical American English (COHA): http://corpus.byu.edu/coha/ contains subcorpora divided by decade, from 1810 to 2009.

The Wortschatz hosted by the University of Leipzig: http://corpora.uni-leipzig.de/ hosts dozens of corpora in many different languages.