EEMCS EPrints Service
|
||||||||||||||||||||||||||||||
|
Hiemstra, D. and Hauff, C.
(2010)
MIREX: MapReduce Information Retrieval Experiments.
Technical Report TR-CTIT-10-15,
Centre for Telematics and Information Technology University of Twente, Enschede.
ISSN 1381-3625
Full text available as:
![]() AbstractWe propose to use MapReduce to quickly test new retrieval approaches on a cluster of machines by sequentially scanning all documents. We present a small case study in which we use a cluster of 15 low cost machines to search a web crawl of 0.5 billion pages showing that sequential scanning is a viable approach to running large-scale information retrieval experiments with little effort. The code is available to other researchers at: http://sourceforge.net/projects/mirex/
Export this item as: To correct this item please ask your editor Repository Staff Only: edit this item |
||||||||||||||||||||||||||||||
