EEMCS

Home > Publications
Home University of Twente
Education
Research
Prospective Students
Jobs
Publications
Intranet (internal)
 
 Nederlands
 Contact
 Search
 Organisation

EEMCS EPrints Service


20392 Named Entity Extraction and Disambiguation: The Reinforcement Effect.
Home Policy Brochure Browse Search User Area Contact Help

Habib, M.B. and van Keulen, M. (2011) Named Entity Extraction and Disambiguation: The Reinforcement Effect. In: Proceedings of the 5th International Workshop on Management of Uncertain Data, MUD 2011, 29 Aug 2011, Seatle, USA. pp. 9-16. CTIT Workshop Proceedings Series WP11-02. Centre for Telematics and Information Technology University of Twente. ISSN 0929-0672

Full text available as:

PDF

653 Kb

Official URL: http://www.ctit.utwente.nl/library/proceedings/wp1102.pdf

Exported to Metis

Abstract

Named entity extraction and disambiguation have received much attention in recent years. Typical fields addressing these topics are information retrieval, natural language processing, and semantic web. Although these topics are highly dependent, almost no existing works examine this dependency. It is the aim of this paper to examine the dependency and show how one affects the other, and vice versa. We conducted experiments with a set of descriptions of holiday homes with the aim to extract and disambiguate toponyms as a representative example of named entities. We experimented with three approaches for disambiguation with the purpose to infer the country of the holiday home. We examined how the effectiveness of extraction influences the effectiveness of disambiguation, and reciprocally, how filtering out ambiguous names (an activity that depends on the disambiguation process) improves the effectiveness of extraction. Since this, in turn, may improve the effectiveness of disambiguation again, it shows that extraction and disambiguation may reinforce each other.

Item Type:Conference or Workshop Paper (Full Paper, Talk)
Research Group:EWI-DB: Databases
Research Program:CTIT-NICE: Natural Interaction in Computer-mediated Environments
Research Project:Neogeography: The Challenge of Channelling Large and Ill-Behaved Data Streams
ID Code:20392
Status:Published
Deposited On:10 August 2011
Refereed:Yes
International:Yes
More Information:statisticsmetis

Export this item as:

To correct this item please ask your editor

Repository Staff Only: edit this item