EEMCS

Home > Publications
Home University of Twente
Education
Research
Prospective Students
Jobs
Publications
Intranet (internal)
 
 Nederlands
 Contact
 Sitemap
 Search
 Organisation

EEMCS EPrints Service


6341 Database Optimization Aspects for Information Retrieval
Home Policy Brochure Browse Search User Area Contact Help

Blok, H.E. (2002) Database Optimization Aspects for Information Retrieval. PhD thesis, University of Twente. CTIT Ph.D.-thesis series No. 02-41 ISBN 903651732X

Full text available as:

PDF
- Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
872 Kb

Abstract

There is a growing need for systems that can process queries, combining both structured data and text. One way to provide such functionality is to integrate information retrieval (IR) techniques in a database management system (DBMS). However, both IR and database research have been separate research fields for decades, resulting in different - even conflicting - approaches to data management.

Each DBMS has a component called a "query optimizer", which plays a crucial role in the efficiency and flexibility of the system. So, for successful integration the IR techniques and data structures, as well as the DBMS query optimizer, should be adapted to enable mutual cooperation.

The author concentrates on top-N queries - a common class of IR queries. An IR top-N query asks for the N best documents given a set of keywords. The author proposes processing the data in batches as a compromise between IR and DBMS query processing. Experiments with this technique show that porting IR optimization techniques is (still) not a promising option due to the additional administrative overhead. Two new mathematical models are introduced to eliminate this overhead: a model that predicts selectivity, which is a crucial factor in the execution costs, and a model that predicts the quality of the top-N.

Item Type:PhD Thesis
Supervisors:Apers, P.M.G.
Assistant Supervisors:Blanken, H.M.
Research Group:EWI-DB: Databases
Research Program:CTIT-NICE: Natural Interaction in Computer-mediated Environments
Research Project:AMIS: Advanced Multimedia Indexing and Searching
Additional Information:Imported from EWI/DB PMS [db-utwente:phdt:0000000009]
ID Code:6341
Deposited On:01 November 2006
More Information:statistics

Export this item as:

To correct this item please ask your editor

Repository Staff Only: edit this item