EEMCS

Home > Publications
Home University of Twente
Education
Research
Prospective Students
Jobs
Publications
Intranet (internal)
 
 Nederlands
 Contact
 Search
 Organisation

EEMCS EPrints Service


17797 MIREX: MapReduce Information Retrieval Experiments
Home Policy Brochure Browse Search User Area Contact Help

Hiemstra, D. and Hauff, C. (2010) MIREX: MapReduce Information Retrieval Experiments. Technical Report TR-CTIT-10-15, Centre for Telematics and Information Technology University of Twente, Enschede. ISSN 1381-3625

Full text available as:

PDF

113 Kb
Open Access


Exported to Metis

Abstract

We propose to use MapReduce to quickly test new retrieval approaches on a cluster of machines by sequentially scanning all documents. We present a small case study in which we use a cluster of 15 low cost machines to search a web crawl of 0.5 billion pages showing that sequential scanning is a viable approach to running large-scale information retrieval experiments with little effort. The code is available to other researchers at: http://sourceforge.net/projects/mirex/

Item Type:Internal Report (Technical Report)
Research Group:EWI-DB: Databases, EWI-HMI: Human Media Interaction
Research Program:CTIT-NICE: Natural Interaction in Computer-mediated Environments
Research Project:DIRKA: Distributed Information Retrieval by means of Keyword Auctions
ID Code:17797
Deposited On:23 April 2010
More Information:statisticsmetis

Export this item as:

To correct this item please ask your editor

Repository Staff Only: edit this item