EPSRC Reference: |
GR/M34041/01 |
Title: |
ROPA:MEASURING TEXT REUSE |
Principal Investigator: |
Gaizauskas, Professor R |
Other Investigators: |
|
Researcher Co-Investigators: |
|
Project Partners: |
|
Department: |
Computer Science |
Organisation: |
University of Sheffield |
Scheme: |
ROPA |
Starts: |
01 July 1999 |
Ends: |
28 February 2002 |
Value (£): |
180,872
|
EPSRC Research Topic Classifications: |
Human Communication in ICT |
|
|
EPSRC Industrial Sector Classifications: |
No relevance to Underpinning Sectors |
|
|
Related Grants: |
|
Panel History: |
|
Summary on Grant Application Form |
This work proposes experiments to find a satisfactory algorithm to determine, of pairs of short texts, if one is a rewrite of the other, within some likelihood. The work is part of a general program of content extraction from text, but a quite separate venture, using novel methods on sparse long n-grams of words. It will be tested against newspaper articles, hand coded for derivation as a gold standard , which will be of immediate interest to the Press Association, although the technique, if successful, will be of very wide application.
|
Key Findings |
This information can now be found on Gateway to Research (GtR) http://gtr.rcuk.ac.uk
|
Potential use in non-academic contexts |
This information can now be found on Gateway to Research (GtR) http://gtr.rcuk.ac.uk
|
Impacts |
Description |
This information can now be found on Gateway to Research (GtR) http://gtr.rcuk.ac.uk |
Summary |
|
Date Materialised |
|
|
Sectors submitted by the Researcher |
This information can now be found on Gateway to Research (GtR) http://gtr.rcuk.ac.uk
|
Project URL: |
|
Further Information: |
|
Organisation Website: |
http://www.shef.ac.uk |