The International Corpus of English (ICE) began in 1990 with the primary aim of collecting material for comparative studies of English worldwide. Twenty-four research teams around the world are preparing electronic corpora of their own national or regional variety of English. Each ICE corpus consists of one million words of spoken and written English produced after 1989. For most participating countries, the ICE project is stimulating the first systematic investigation of the national variety. To ensure compatibility among the component corpora, each team is following a common corpus design, as well as a common scheme for grammatical annotation.
Contact information: Professor Gerald Nelson, Department of English, The Chinese University of Hong Kong, Shatin, New Territories, Hong Kong SAR. Email: firstname.lastname@example.org Fax: +852 2603 5270
Contact information for individual ICE teams may be found here.
News April 2013
Tagged ICE corpora now available
The tagging of all currently available ICE corpora with CLAWS7 and the USAS semantic tagger is now complete, and the corpora are available for non-profit, academic research. If you wish to download any of the tagged corpora, please send an email to email@example.com with the subject line "Tagged ICE Corpora". Your email should also indicate your academic affiliation. I will then get back to you with details of how to proceed.
Thanks to Dr Paul Rayson, Director of the UCREL research centre at Lancaster University, for his generous cooperation in this initiative.
The following recent publications have made extensive use of ICE corpus materials:
Lange, Claudia (2012) The Syntax of Spoken Indian English. VEAW G45, Amsterdam: Benjamins.
Hundt, Marianne and Ulrike Gut (eds) (2012) Mapping Unity and Diversity Worldwide: Corpus-based Studies of New Englishes. VEAW G43, Amsterdam: Benjamins.
Aarts, Bas (2011) Oxford Modern English Grammar. Oxford: OUP.
Hasselgård, Hilde (2010) Adjunct Adverbials in English. Cambridge: CUP.
ICAME Journal No 34, April 2010, dedicated to 'new' ICE corpora.
Release of ICE Sri Lanka (written)
I am very pleased to announce the release of the written component of the ICE Sri Lanka (ICE-SL) corpus. The corpus is available in standard SGML format and in a POS-tagged version, using the CLAWS C7 tagset. To obtain a copy of the corpus and Manual, please email firstname.lastname@example.org.
Release of ICE USA (written)
Launch of ICE Uganda project
I am very pleased to announce the launch of ICE Uganda. The project is directed by Prof. Dr. Christiane Meierkord at Ruhr-University of Bochum, Germany.
Release of ICE Ireland
The ICE Ireland corpus is now available for academic research. Also available is ICE Ireland: A User's Guide, by Dr John Kirk (Belfast) and Dr Jeffrey Kallen (Dublin). For more information, see the ICE Ireland website.
Last updated: 30 April 2013 © The ICE Project
ICE corpora are available for non-commercial,