Laboratory Experiments in Information Retrieval

Sample Sizes, Effect Sizes, and Statistical Power

Nonfiction, Computers, Database Management, Information Storage & Retrievel, Science & Nature, Mathematics, Statistics, General Computing
Cover of the book Laboratory Experiments in Information Retrieval by Tetsuya Sakai, Springer Singapore
View on Amazon View on AbeBooks View on Kobo View on B.Depository View on eBay View on Walmart
Author: Tetsuya Sakai ISBN: 9789811311994
Publisher: Springer Singapore Publication: September 22, 2018
Imprint: Springer Language: English
Author: Tetsuya Sakai
ISBN: 9789811311994
Publisher: Springer Singapore
Publication: September 22, 2018
Imprint: Springer
Language: English

Covering aspects from principles and limitations of statistical significance tests to topic set size design and power analysis, this book guides readers to statistically well-designed experiments. Although classical statistical significance tests are to some extent useful in information retrieval (IR) evaluation, they can harm research unless they are used appropriately with the right sample sizes and statistical power and unless the test results are reported properly. The first half of the book is mainly targeted at undergraduate students, and the second half is suitable for graduate students and researchers who regularly conduct laboratory experiments in IR, natural language processing, recommendations, and related fields.

Chapters 1–5 review parametric significance tests for comparing system means, namely, t-tests and ANOVAs, and show how easily they can be conducted using Microsoft Excel or R. These chapters also discuss a few multiple comparison procedures for researchers who are interested in comparing every system pair, including a randomised version of Tukey's Honestly Significant Difference test. The chapters then deal with known limitations of classical significance testing and provide practical guidelines for reporting research results regarding comparison of means.

Chapters 6 and 7 discuss statistical power. Chapter 6 introduces topic set size design to enable test collection builders to determine an appropriate number of topics to create. Readers can easily use the author’s Excel tools for topic set size design based on the paired and two-sample t-tests, one-way ANOVA, and confidence intervals. Chapter 7 describes power-analysis-based methods for determining an appropriate sample size for a new experiment based on a similar experiment done in the past, detailing how to utilize the author’s R tools for power analysis and how to interpret the results. Case studies from IR for both Excel-based topic set size design and R-based power analysis are also provided.

View on Amazon View on AbeBooks View on Kobo View on B.Depository View on eBay View on Walmart

Covering aspects from principles and limitations of statistical significance tests to topic set size design and power analysis, this book guides readers to statistically well-designed experiments. Although classical statistical significance tests are to some extent useful in information retrieval (IR) evaluation, they can harm research unless they are used appropriately with the right sample sizes and statistical power and unless the test results are reported properly. The first half of the book is mainly targeted at undergraduate students, and the second half is suitable for graduate students and researchers who regularly conduct laboratory experiments in IR, natural language processing, recommendations, and related fields.

Chapters 1–5 review parametric significance tests for comparing system means, namely, t-tests and ANOVAs, and show how easily they can be conducted using Microsoft Excel or R. These chapters also discuss a few multiple comparison procedures for researchers who are interested in comparing every system pair, including a randomised version of Tukey's Honestly Significant Difference test. The chapters then deal with known limitations of classical significance testing and provide practical guidelines for reporting research results regarding comparison of means.

Chapters 6 and 7 discuss statistical power. Chapter 6 introduces topic set size design to enable test collection builders to determine an appropriate number of topics to create. Readers can easily use the author’s Excel tools for topic set size design based on the paired and two-sample t-tests, one-way ANOVA, and confidence intervals. Chapter 7 describes power-analysis-based methods for determining an appropriate sample size for a new experiment based on a similar experiment done in the past, detailing how to utilize the author’s R tools for power analysis and how to interpret the results. Case studies from IR for both Excel-based topic set size design and R-based power analysis are also provided.

More books from Springer Singapore

Cover of the book Design Aids for Stiffened Composite Shells with Cutouts by Tetsuya Sakai
Cover of the book Domain Specific High-Level Synthesis for Cryptographic Workloads by Tetsuya Sakai
Cover of the book Soft Computing: Theories and Applications by Tetsuya Sakai
Cover of the book Implementing Mobile Language Learning Technologies in Japan by Tetsuya Sakai
Cover of the book Point-of-Interest Recommendation in Location-Based Social Networks by Tetsuya Sakai
Cover of the book Orbital Data Applications for Space Objects by Tetsuya Sakai
Cover of the book Conceptual Evolution of Newtonian and Relativistic Mechanics by Tetsuya Sakai
Cover of the book Economics of Urban Externalities by Tetsuya Sakai
Cover of the book Electromagnetic Actuation and Sensing in Medical Robotics by Tetsuya Sakai
Cover of the book Vulnerability of Watersheds to Climate Change Assessed by Neural Network and Analytical Hierarchy Process by Tetsuya Sakai
Cover of the book Medicinal Plants and Fungi: Recent Advances in Research and Development by Tetsuya Sakai
Cover of the book Bangladeshi Migration to Singapore by Tetsuya Sakai
Cover of the book A Polymer Cochlear Electrode Array: Atraumatic Deep Insertion, Tripolar Stimulation, and Long-Term Reliability by Tetsuya Sakai
Cover of the book Beyond Sociology by Tetsuya Sakai
Cover of the book Computational Intelligence, Communications, and Business Analytics by Tetsuya Sakai
We use our own "cookies" and third party cookies to improve services and to see statistical information. By using this website, you agree to our Privacy Policy