By Norbert Fuhr, Mounia Lalmas, Saadia Malik, Gabriella Kazai

Content-oriented XML retrieval has been receiving expanding curiosity because of the common use of eXtensible Markup Language (XML), that is changing into a customary record structure on the internet, in electronic libraries,and publishing. by way of exploiting the enriched resource of syntactic and semantic info that XML markup presents, XML info retrieval (IR) structures objective to enforce a extra targeted retrieval technique and go back record elements, so-called XML parts – rather than entire records – based on a consumer question. This concentrated retrieval technique is of specific bene?t for collections containing lengthy files or records protecting a wide selection of issues (e.g., books, person manuals, felony files, etc.), the place clients’ e?ort to find suitable content material might be decreased by way of directing them to the main proper components of the records. enforcing this, extra centred, retrieval paradigm signifies that an XML IR process wishes not just to ?nd correct info within the XML records, however it additionally has to figure out the precise point of granularity to be again to the consumer. furthermore, the relevance of a retrieved part could be depending on assembly either content material and structural question conditions.

Show description

Read Online or Download Advances in XML Information Retrieval and Evaluation: 4th International Workshop of the Initiative for the Evaluation of XML Retrieval, INEX 2005, Dagstuhl Castle, Germany, November 28-30, 2005. Revised Selected Papers PDF

Similar storage & retrieval books

Internet Resources for Leisure and Tourism

'Internet assets for relaxation and Tourism' is designed to permit scholars, teachers and practitioners in the rest and tourism fields to get the very such a lot out of the area huge internet, assisting them music down and entirely make the most the main valuable assets to be had. This ebook comprises tips on how to define and utilise, between different issues: the newest fiscal records and demographics, information regarding executive firms and their courses, the content material of universities' web content, up to the moment records on customer arrivals and departures, details on coming near near conferences and meetings, and info of contents in periodicals.

Managing Gigabytes: Compressing and Indexing Documents and Images, Second Edition

During this absolutely up-to-date moment variation of the hugely acclaimed coping with Gigabytes, authors Witten, Moffat, and Bell proceed to supply remarkable assurance of cutting-edge innovations for compressing and indexing info. no matter what your box, in case you paintings with huge amounts of knowledge, this e-book is vital reading--an authoritative theoretical source and a realistic consultant to assembly the hardest garage and entry demanding situations.

The Google Model: Managing Continuous Innovation in a Rapidly Changing World

This e-book exhibits how businesses like Google have reinvented the typical perform in administration with the intention to always innovate in quickly altering industries. With the ever-increasing velocity of switch, reinventing current administration rules may well turn into a need and end up the most important within the long term competitiveness of many businesses.

Image databases : search and retrieval of digital imagery

The explosive development of multimedia facts transmission has generated a severe want for effective, high-capacity snapshot databases, in addition to robust se's to retrieve photograph information from them. This e-book brings jointly contributions by means of a world all-star crew of innovators within the box who percentage their insights into all key points of photo database and seek engine building.

Extra info for Advances in XML Information Retrieval and Evaluation: 4th International Workshop of the Initiative for the Evaluation of XML Retrieval, INEX 2005, Dagstuhl Castle, Germany, November 28-30, 2005. Revised Selected Papers

Example text

E1) is 2; (E2) is resp. 12 , 12 , 12 × ( 12 − 0) = 14 , and 14 for lists A, B, C and D. Precisions is 1, 1, 12 and 12 . 2. If the user is satisfied with an element of exhaustivity at least 1 (25 % of the users): 1 Recall 1 (level 1/3). (E1) is 1; (E2) is resp. 9 2 ) = 4 for lists A, B, C and D. Precision is 1, 1, 1, and 4 . Recall 2 (level 2/3). (E1) is 1; (E2) is resp. 9 2 − 2 ) = 4 for lists A, B, C and D. 8 2 , 4 and 4 . 40 B. Piwowarski Recall 3 (level 1). (E1) is 2; (E2) is resp. 9 4 for lists A, B, C and D.

1, 1, 1 × ( 13 − 0) + 12 × (1 − 13 ) = 23 , and 1 for lists A, B, C and D. Precision is 1, 1, 23 , and 1. 7 Recall 2 (level 2/3). (E1) is 2; (E2) is resp. 12 , 12 , 12 × ( 13 − 0) + 13 × (1 − 13 ) = 18 , 7 7 7 and 18 for lists A, B, C and D. Precisions are 1, 1, 9 and 9 . Recall 3 (level 1). (E1) is 3; (E2) is resp. 13 , 13 , 13 × ( 13 − 0) = 19 , and 19 for lists A, B, C and D. Precisions are 1, 1, 13 and 13 . There is a way to combine the two sets of precisions that we do not describe here but give an example instead.

0 OVERLAP: on, off QUANT FUNCTIONS: gen, strict, genLifted DCV: 10, 25, 50 26 G. Kazai and M. 4 SSCAS, SVCAS, VSCAS and VVCAS These tasks are evaluated based on the Thorough task assumption, with “overlap=off”. 1 Correlation Analysis of Results Correlation of XCG Measures We examined correlation among the different XCG measures by calculating the Kendall τ correlation [1] between their resulting respective system rankings. The correlation measure of Kendall’s τ is a nonparametric measure of the agreement between two rankings.

Download PDF sample

Rated 4.28 of 5 – based on 35 votes