EPSRC Reference: |
GR/L39704/01 |
Title: |
PARALLEL WORDCLASS TAGGING OF THE PAROLE 2 CORPUS AND THE BRITISH NATIONAL CORPUS |
Principal Investigator: |
Leech, Professor G |
Other Investigators: |
|
Researcher Co-Investigators: |
|
Project Partners: |
|
Department: |
Linguistics and English Language |
Organisation: |
Lancaster University |
Scheme: |
Standard Research (Pre-FEC) |
Starts: |
01 July 1997 |
Ends: |
30 June 1998 |
Value (£): |
43,265
|
EPSRC Research Topic Classifications: |
Human Communication in ICT |
|
|
EPSRC Industrial Sector Classifications: |
No relevance to Underpinning Sectors |
|
|
Related Grants: |
|
Panel History: |
|
Summary on Grant Application Form |
To undertake a parallel word-class tagging of (a) 250000 words of the PAROLE 2 CORPUS, and (b) 1 million words of the British National Corpus (BNC), using the CLAWS4 tagger developed at Lancaster. At the same time, subcorpora (a) and (b) will also be tagged at Birmingham by the LUCID tagger developed there. The tagged versions will both make use of the same PAROLE tagset. A comparison of the two tagged versions will then be carried out, systematically across tags and lexemes multitagger at Leeds (Atwell) will be undertaken if and as opportunities arise. The three main deliverables of the project will be (I) the tagged Parole 2 subcorpus, (II) the tagged BNC subcorpus, and (III) the optmisation of the output of the combination of taggers.
|
Key Findings |
This information can now be found on Gateway to Research (GtR) http://gtr.rcuk.ac.uk
|
Potential use in non-academic contexts |
This information can now be found on Gateway to Research (GtR) http://gtr.rcuk.ac.uk
|
Impacts |
Description |
This information can now be found on Gateway to Research (GtR) http://gtr.rcuk.ac.uk |
Summary |
|
Date Materialised |
|
|
Sectors submitted by the Researcher |
This information can now be found on Gateway to Research (GtR) http://gtr.rcuk.ac.uk
|
Project URL: |
|
Further Information: |
|
Organisation Website: |
http://www.lancs.ac.uk |