Details of Grant

EPSRC Reference:

EP/C536762/1

Title:

CRONOPATH: Timeline and Named Entity Extraction for Hyperlink Corpora

Principal Investigator:

Wilks, Professor Y

Other Investigators:

Gaizauskas, Professor R

Researcher Co-Investigators:

Mr A Dalli

Project Partners:

Department:

Computer Science

Organisation:

University of Sheffield

Scheme:

Standard Research (Pre-FEC)

Starts:

01 July 2005

Ends:

31 December 2007

Value (£):

294,632

EPSRC Research Topic Classifications:

Artificial Intelligence

Information & Knowledge Mgmt

EPSRC Industrial Sector Classifications:

Information Technologies

Related Grants:

Panel History:

Summary on Grant Application Form

Have you ever had to go through hundreds of results when using a search engine to find what you are looking for? And then perhaps found that many results are outdated or just someone's latest blog entry? Imaging trying to collect from the web information on a famous person and being able to tie all the bits of information, together while discarding all the bits about someone else with the same name? Imaging, too, being able to find the latest information, concert locations and dates of your favourite music without being trapped in old information still up on the web after years?This research project, Cronopath, will lead to a new type of search engine that solves these common problems. Cronopath provides timely results in a friendly but informative manner, piecing together information about people, organisations and events automatically. Cronopath looks at different websites, spotting information about items that matter most. As it gathers information, Cronopath tries to determine the time and date when a particular item was created. This time and date information lets it piece information together in a chronological order, making it easy to identify the latest news and information about some subject or the latest gossip about some superstar.Much of this rests on Cronopath's core ability to distinguish between different people that have exactly the same name or very similar names - such as George Bush and George W. Bush. Cronopath also knows that people and organisations may change their name over time - if you are looking at Madonna's life before she launched her career, Cronopath will know that she was known as Louise Veronica Ciccone, and it will also know that she now wants to be called Esther.Currently, search engines work by working out which results are relevant by looking at the popularity of each web page that contains the result. The most popular web pages will have lots of links pointing to them, including some from other popular pages. However, the most popular kid on the block may not always be the most knowledgeable. Current search engines also lump everything on a single page together without trying to identify important pieces of meta information in the page itself. Thus, a site on Semiconductor Physics that mentions Britney Spears will get a very high placing if you search for Britney Spears. Cronopath should have the common sense to understand that you will probably be looking for songs or photos of Britney rather than Physics (unlike all other popular search engines!). Cronopath will thus let people around the world find more relevant, more up-to-date and more informative results quickly and efficiently.

Key Findings

This information can now be found on Gateway to Research (GtR) http://gtr.rcuk.ac.uk

Potential use in non-academic contexts

This information can now be found on Gateway to Research (GtR) http://gtr.rcuk.ac.uk

Impacts

Description	This information can now be found on Gateway to Research (GtR) http://gtr.rcuk.ac.uk
Summary
Date Materialised

Sectors submitted by the Researcher

This information can now be found on Gateway to Research (GtR) http://gtr.rcuk.ac.uk

Project URL:

Further Information:

Organisation Website:

http://www.shef.ac.uk