background preloader

Corpus of Historical American English (COHA)

Corpus of Historical American English (COHA)

Synonyms Thesaurus with Antonyms Shakespeare corpus The source texts came from Online Library of Liberty ( Their original source is the OUP edition of 1916. You get 37 plays, plus all the speeches of all the characters. Ie. you get the whole play Hamlet, plus separately all the speeches of Prince Hamlet, all the speeches of Horatio, etc. There is also a list of the plays and their dates. All the files are saved in 16-bit Unicode. The plays are in the root of 3 folders (comedies, historical, tragedies) as appropriate. Mike Scott mike (at) lexically.net

Baldwin Library of Historical Children's Literature - Special and Area Studies Collections - University of Florida Smathers Libraries The Baldwin Library of Historical Children’s Literature in the Department of Special and Area Studies Collections at the University of Florida’s George A. Smathers Libraries contains more than 130,000 books and periodicals published in the United States and Great Britain from the mid-1600s to present day. The Library also has manuscript collections, original artwork, and assorted ephemera such as board games, puzzles, and toys. Other strengths and distinctions of the Baldwin Library include: marginalia and inscriptions, the Hans Christian Andersen Awards Collection, Little Golden Books, religious tracts, and illustrated editions from the Golden Age of Children's Literature. The Baldwin Library also runs the Louise Seaman Bechtel Fellowship in conjunction with the Association of Library Services to Children in the American Library Association and has a year-long Speaker Series, which featured Dr.

Acronym Finder BASE (British Academic Spoken English) and BASE Plus Collections Overview of BASE The British Academic Spoken English (BASE) project took place at the Universities of Warwick and Reading between 2000–2005, under the directorship of Hilary Nesi (Warwick) , with Paul Thompson (Reading). Natalie Snodgrass and Sarah Creer were employed as research assistants and Tim Kelly was video producer of the project. Lou Burnard (Oxford University) and Adam Kilgarriff (Lexicography MasterClass Ltd) acted as consultants. The BASE Corpus consists of 160 lectures and 40 seminars recorded in a variety of departments (video-recorded at the University of Warwick and audio-recorded at the University of Reading). It contains 1,644,942 tokens in total (lectures and seminars). The corpus has been deposited in the Oxford Text Archive and is catalogued by the Arts and Humanities Data Service. Funding Overview of BASE Plus BASE Plus is a larger collection of British Academic Spoken English data held at the Centre for Applied Linguistics. i. ii. iii. iv. v.

Zentrales Verzeichnis Digitalisierter Drucke English to French, Italian, German & Spanish Dictionary Corpora, Collections, Data Archives 1. British National Corpus (BNC) [100m wds; 1990s British English, spoken & written]: There are many different web sites giving free (but limited) access to the corpus--limited due to copyright: i.e. you cannot expand the concordance context to read more of the surrounding text, & you cannot read the entire source texts (only snippets). BNCweb: User-friendly, free interface (limited features, if no paid licence). JustTheWord: The most accessible site for non-English-speaking background students (& most pedagogically useful) because it straightaway gives you a list of collocations for your search word/phrase, instead of concordances; results are categorized by POS-based patterns & by approximate sense clusters, & graph bars give an indication of how common each combination is. Results are based on a 80K-word subset of the BNC. 2. · Corpus of Contemporary American English (COCA): [450 m wds; 20 m wds of American Eng each year from 1990-2012.] 3. 4. 5. 6. 7. 8. 9. 10. 11. 12. 13.

Münchener Digitalisierungszentrum

Related: