Home > Publications
Home University of Twente
Prospective Students
Intranet (internal)

EEMCS EPrints Service

22746 Deep web search: an overview and roadmap
Home Policy Brochure Browse Search User Area Contact Help

Tjin-Kam-Jet, K.T.T.E. and Trieschnigg, R.B. and Hiemstra, D. (2011) Deep web search: an overview and roadmap. Technical Report TR-CTIT-12-32, Centre for Telematics and Information Technology, University of Twente, Enschede. ISSN 1381-3625

Full text available as:


1321 Kb
Open Access

Exported to Metis


We review the state-of-the-art in deep web search and propose a novel classification scheme to better compare deep web search systems.
The current binary classification (surfacing versus virtual integration) hides a number of implicit decisions that must be made by a developer. We make these decisions explicit by distinguishing 7 system aspects that describe a system in terms of its functionality (what it can, and what it cannot do) and in terms of its solution to a specific problem.

We then motivate the need for a search system which has a single-field free-text query interface that supports real-time structured search over multiple sources.
To this end, we discuss two possible federated architectures and state the scientific challenges. Finally, we present the findings of our ongoing project and briefly outline related work to free-text interfaces over structured data.

Item Type:Internal Report (Technical Report)
Research Group:EWI-DB: Databases, EWI-HMI: Human Media Interaction
Research Program:CTIT-NICE: Natural Interaction in Computer-mediated Environments
Research Project:DIRKA: Distributed Information Retrieval by means of Keyword Auctions, EfFoRT: Effective Focused Retrieval Techniques
Uncontrolled Keywords:review, survey, deep web, deep web search, interfaces, OneBox, natural language, free text, surfacing
ID Code:22746
Deposited On:20 December 2012
More Information:statisticsmetis

Export this item as:

To correct this item please ask your editor

Repository Staff Only: edit this item