Tjin-Kam-Jet, K.T.T.E. and Trieschnigg, R.B. and Hiemstra, D.
Deep web search: an overview and roadmap.
Technical Report TR-CTIT-12-32,
Centre for Telematics and Information Technology, University of Twente, Enschede.
Full text available as:
We review the state-of-the-art in deep web search and propose a novel classification scheme to better compare deep web search systems.
The current binary classification (surfacing versus virtual integration) hides a number of implicit decisions that must be made by a developer. We make these decisions explicit by distinguishing 7 system aspects that describe a system in terms of its functionality (what it can, and what it cannot do) and in terms of its solution to a specific problem.
We then motivate the need for a search system which has a single-field free-text query interface that supports real-time structured search over multiple sources.
To this end, we discuss two possible federated architectures and state the scientific challenges. Finally, we present the findings of our ongoing project and briefly outline related work to free-text interfaces over structured data.
|Item Type:||Internal Report (Technical Report)|
|Research Group:||EWI-DB: Databases, EWI-HMI: Human Media Interaction|
|Research Program:||CTIT-NICE: Natural Interaction in Computer-mediated Environments|
|Research Project:||DIRKA: Distributed Information Retrieval by means of Keyword Auctions, EfFoRT: Effective Focused Retrieval Techniques|
|Uncontrolled Keywords:||review, survey, deep web, deep web search, interfaces, OneBox, natural language, free text, surfacing|
|Deposited On:||20 December 2012|
Export this item as:
To correct this item please ask your editor
Repository Staff Only: edit this item