ICE-corpora.net

 


ICELite  An Internet-sourced International Corpus of English

Funded by the Arts Faculty, The Chinese University of Hong Kong. Project code: 2010317.


Aims:
a) To evaluate the technical feasibility of using webcrawler technology to collect ICE corpora via the Internet, as well as other software to convert Internet-sourced files to ICE format.

b) To collect, classify, and annotate samples of written English via the Internet from the following five domains: Uganda, Papua New Guinea, Sudan, Sierra Leone, and Oceania (Solomon Islands, Tonga, and Guam).

c) To assess the extent to which the existing ICE text categories can be replicated among Internet-sourced texts, and to consider what adjustments will need to be made to the original ICE design to accommodate Internet-sourced data.

Duration: Jan 1 - Sept 30 2009
The report on ICELite is available here (MS Word).

Principal Investigator: Professor Gerald Nelson, Department of English, The Chinese University of Hong Kong.

Research Assistants: Ren Hongtao and Dora Huang Zeping

<< Back to ICE