 |
 |
Mihajlovic, V. and Hiemstra, D. and Blok, H.E. and Apers, P.M.G.
(2006)
Exploiting Query Structure and Document Structure to Improve Document Retrieval Effectiveness.
Technical Report TR-CTIT-06-57,
Centre for Telematics and Information Technology University of Twente, Enschede.
ISSN 1381-3625
Full text available as:  AbstractIn this paper we present a systematic analysis of document
retrieval using unstructured and structured queries within
the score region algebra (SRA) structured retrieval framework. The behavior of di®erent retrieval models, namely
Boolean, tf.idf, GPX, language models, and Okapi, is tested
using the transparent SRA framework in our three-level structured retrieval system called TIJAH. The retrieval models are implemented along four elementary retrieval aspects: element and term selection, element score computation, score combination, and score propagation.
The analysis is performed on a numerous experiments
evaluated on TREC and CLEF collections, using manually
generated unstructured and structured queries. Unstructured queries range from the short title queries to long title
+ description + narrative queries. For generating structured
queries we exploit the knowledge of the document structure
and the content used to semantically describe or classify
documents. We show that such structured information can
be utilized in retrieval engines to give more precise answers to user queries then when using unstructured queries. | Item Type: | Internal Report (Technical Report) |
|---|
| Research Group: | EWI-DB: Databases |
|---|
| Research Program: | CTIT-NICE: Natural Interaction in Computer-mediated Environments |
|---|
| Research Project: | CIRQUID: Complex Information Retrieval Queries in a DBMS |
|---|
| ID Code: | 6918 |
|---|
| Deposited On: | 23 October 2006 |
|---|
| Refereed: | No |
|---|
| More Information: | statisticsmetis |
|---|
Export this item as: To correct this item please ask your editor Repository Staff Only: edit this item
|
 |
 |