EPSRC Reference: |
GR/T19919/01 |
Title: |
Accurate and Comprehensive Lexical Classification for Natural Language Processing Applications (ACLEX) |
Principal Investigator: |
Briscoe, Professor EJ |
Other Investigators: |
|
Researcher Co-Investigators: |
|
Project Partners: |
|
Department: |
Computer Science and Technology |
Organisation: |
University of Cambridge |
Scheme: |
Standard Research (Pre-FEC) |
Starts: |
01 August 2005 |
Ends: |
31 July 2008 |
Value (£): |
206,957
|
EPSRC Research Topic Classifications: |
Comput./Corpus Linguistics |
|
|
EPSRC Industrial Sector Classifications: |
|
Related Grants: |
|
Panel History: |
|
Summary on Grant Application Form |
Lexical classes which capture useful generalizations over a range of (cross-)linguistic properties can be used to support a number of important computational linguistic tasks and applications (e.g. parsing, anaphora resolution, information extraction, open-domain question-answering, machine translation). However, to date their use in NLP has been limited because no technology for accurate and comprehensive (i.e. automatic) lexical classification is available. We will build on the preliminary research on automatic lexical classification, and develop a system capable of acquiring (i) large-scale cross-domain and (ii) domain-specific classifications from corpus data. We will evaluate and demonstrate the capabilities of this system directly and in the context of a number of NLP tasks, such as parsing and biomedical text mining. We will use the final version of the system to acquire a substantial, relatively domain-independent lexical database from standard corpora and the web which we will enrich with additional relevant information from corpora and public-domain manual classifications. The resulting resource, which will enable large-scale exploitation of lexical classes, will be distributed freely via the internet, along with the evaluation tools and the software which can be used to tune the frequency information stored in the database to particular domains/tasks.
|
Key Findings |
This information can now be found on Gateway to Research (GtR) http://gtr.rcuk.ac.uk
|
Potential use in non-academic contexts |
This information can now be found on Gateway to Research (GtR) http://gtr.rcuk.ac.uk
|
Impacts |
Description |
This information can now be found on Gateway to Research (GtR) http://gtr.rcuk.ac.uk |
Summary |
|
Date Materialised |
|
|
Sectors submitted by the Researcher |
This information can now be found on Gateway to Research (GtR) http://gtr.rcuk.ac.uk
|
Project URL: |
http://www.cl.cam.ac.uk/~alk23/aclex.html |
Further Information: |
|
Organisation Website: |
http://www.cam.ac.uk |