Statistical Significance Testing in Information Retrieval: Theory and Practice

Author:   Ben Carterette
Publisher:   Morgan & Claypool Publishers
ISBN:  

9781627055277


Pages:   120
Publication Date:   01 August 2021
Format:   Paperback
Availability:   Temporarily unavailable   Availability explained
The supplier advises that this item is temporarily unavailable. It will be ordered for you and placed on backorder. Once it does come back in stock, we will ship it out to you.

Our Price $104.79 Quantity:  
Add to Cart

Share |

Statistical Significance Testing in Information Retrieval: Theory and Practice


Add your own review!

Overview

The past 20 years have seen a great improvement in the rigor of information retrieval experimentation, due primarily to two factors: high-quality, public, portable test collections such as those produced by TREC (the Text REtrieval Conference), and the increased practice of statistical hypothesis testing to determine whether measured improvements can be ascribed to something other than random chance. Together these create a very useful standard for reviewers, program commit- tees, and journal editors; work in information retrieval (IR) increasingly cannot be published unless it has been evaluated using a well-constructed test collection and shown to produce a statistically significant improvement over a good baseline. But, as the saying goes, any tool sharp enough to be useful is also sharp enough to be dangerous. Statistical tests of significance are widely misunderstood. Most researchers and developers treat them as a black box : evaluation results go in and a p-value comes out. But because significance is such an important factor in determining what research directions to explore and what is published, using p-values obtained without thought can have consequences for everyone doing research in IR. Ioannidis has argued that the main consequence in the biomedical sciences is that most published research findings are false; could that be the case in IR as well? Our goal with this work is to help researchers and developers gain a better understanding of how tests work and how they should be interpreted so that they can both use them more effectively in their day-to-day work as well as better understand how to interpret them when reading the work of others. We will do this primarily with three tools: (a) mathematical analysis; (b) simulation; and (c) experimentation with TREC data - because of the availability of TREC data, IR as a field is uniquely positioned to be able to evaluate significance testing in the presence of a wide variety of failed experiments.

Full Product Details

Author:   Ben Carterette
Publisher:   Morgan & Claypool Publishers
Imprint:   Morgan and Claypool Life Sciences
Weight:   0.525kg
ISBN:  

9781627055277


ISBN 10:   1627055274
Pages:   120
Publication Date:   01 August 2021
Audience:   General/trade ,  General
Format:   Paperback
Publisher's Status:   Active
Availability:   Temporarily unavailable   Availability explained
The supplier advises that this item is temporarily unavailable. It will be ordered for you and placed on backorder. Once it does come back in stock, we will ship it out to you.

Table of Contents

Reviews

Author Information

University of Delaware

Tab Content 6

Author Website:  

Customer Reviews

Recent Reviews

No review item found!

Add your own review!

Countries Available

All regions
Latest Reading Guide

MRG2025CC

 

Shopping Cart
Your cart is empty
Shopping cart
Mailing List