Introduction to Information Retrieval

Introduction to Information Retrieval

This is the companion website for the following book.

Christopher D. Manning, Prabhakar Raghavan and Hinrich Schütze, Introduction to Information Retrieval, Cambridge University Press. 2008.

The book aims to provide a modern approach to information retrieval from a computer science perspective. It is based on a course we have been teaching in various forms at Stanford University and at the University of Stuttgart.

We'd be pleased to get feedback about how this book works out as a textbook, what is missing, or covered in too much detail, or what is simply wrong. Please send any feedback or comments to:

yahoogroups: informationretrieval

Print edition

Cambridge University Press is currently copy-editing the manuscript. The book is scheduled to appear July 2008.

HTML edition

You can browse the HTML edition of the book here.

PDF edition

A preliminary version of the book is available for download. Last update: May 11, 2008. PDF for online viewing (with nice hyperlink features). PDF for printing (best for black and white printers). These PDFs will remain online after the publication of the print edition.

Links to individual chapters can be found in the table of contents at the bottom of this page.

Slides

Slides that are somewhat out of date are available. We are in the process of creating an updated set of slides here.

Solutions to exercises

Solutions to the exercises in the book are available from Cambridge University Press.

Information retrieval resources

A list of information retrieval resources is also available.

Introduction to Information Retrieval: Table of Contents

  chapter      slides resources
Front matter (incl. table of notations) pdf
01   Boolean retrieval pdf slides  
02 The term vocabulary & postings lists pdf slides
03 Dictionaries and tolerant retrieval pdf slides
04 Index construction pdf slides
05 Index compression pdf slides
06 Scoring, term weighting & the vector space model pdf slides
07 Computing scores in a complete search system pdf slides
08 Evaluation in information retrieval pdf slides
09 Relevance feedback & query expansion pdf slides
10 XML retrieval pdf slides
11 Probabilistic information retrieval pdf slides
12 Language models for information retrieval pdf slides
13 Text classification & Naive Bayes pdf slides
14 Vector space classification pdf slides
15 Support vector machines & machine learning on documents pdf slides
16 Flat clustering pdf slides html
17 Hierarchical clustering pdf slides
18 Matrix decompositions & latent semantic indexing pdf slides
19 Web search basics I pdf slides
Web search basics II slides
20 Web crawling and indexes pdf slides
21 Link analysis pdf slides
Bibliography & Index pdf
bibtex file bib