Books on Web Information Retrieval Mining the Web: Analysis of Hypertext and Semi Structured Data. S. Chakrabarti. Morgan Kaufmann, 2002. The best introduction for web-centric IR. Google's PageRank and beyond: The science of Search Engine Rankings. Amy N. Langville, Carl D. Meyer. Princeton University Press, 2006. More focused on the algorithms of PageRank, but also covers general web IR. Modeling the Internet and the Web: Probabilistic Methods and Algorithms. P. Baldi, P. Frasconi, P. Smyth. Wiley, 2003. A bit terse. Recommended for those who have a good foundation in probability theory, but are new to IR.
Online Books - Browsable Introduction to Information Retrieval. C.D. Manning, P. Raghavan, H. Schütze. Cambridge UP, 2007. Draft. Focuses on algorithms and mathematical foundations without neglecting practical issues in building search systems. Equal coverage of classical IR and newer topics like XML, machine learning techniques and web search engines. Finding Out About. R. Belew's book (w/o figures and equations), see above. Information Retrieval. C. J. van Rijsbergen. Butterworths, 1979. The classic. Almost 40 years old, but still worth reading. Information Retrieval. T. van der Weide. 2004. Introduction to IR and hypertext.
Online Books - PDF Introduction to Information Retrieval. C.D. Manning, P. Raghavan, H. Schütze. Cambridge UP, 2007. Information Retrieval in Practice. B. Croft, D. Metzler, T. Strohman. Pearson Education, 2009. (two chapters) Information Retrieval. C. J. van Rijsbergen. Butterworths, 1979. Information Retrieval Interaction. P. Ingwersen. Taylor Graham, 1992. Focuses on user interaction in IR. Information Retrieval: A Survey. Ed Greengrass. 2000. Good survey of "classical" IR, but little or no coverage of recent work (e.g., language models, PageRank, SVMs). Various tutorials at Mi Islita
Research Centers CMU (LTI) Dublin CU Geneva (Viper) Glasgow Helsinki Institute for Information Technology IBM Illinois Institute of Technology Information Retrieval Facility (IRF) Microsoft Research NIST Peking Pittsburgh Queen Mary Sheffield UIUC UMASS <!-- U. of Washington -->
Courses Berkeley (SIMS) CMU Cornell DePaul IIT Johns Hopkins I Johns Hopkins II Maryland MPI Otago Princeton Stanford Stuttgart Texas UMASS <!-- U. of Sunderland --><!-- Multimedia Information RetrievalU. of Stuttgart -->
Problem Sets / Assignments <!-- Cornell U. of Massachusetts -->Bilkent DePaul Georgetown Minas Gerais North Texas Stuttgart Tennessee
Web Information Retrieval webir.org Search Engine Watch Users' Guide to Web Searching PageRank
Subareas, Applications, Methods <!-- Chemistry -->Information Retrieval & Extraction Information Retrieval & Machine Learning Text Mining & Web Mining INEX: XML retrieval Geographic Information Retrieval Music Information Retrieval <!-- Music Information Retrieval Music Information Retrieval (2) -->CLIR & Multilingual Information Retrieval <!-- Cross-Language Information Retrieval (CLIR) -->Cross-Language Information Retrieval (CLIR) Resources N-Grams in Information Retrieval Agent-based Information Retrieval Audio Information Retrieval Adversarial Information Retrieval
Conferences TREC Cross Language Evaluation Forum (CLEF) SIGIR 2007 (last), SIGIR 2008 (next) CIKM 2007, CIKM 2008 WWW 2008, WWW 2009 JCDL 2008, JCDL 2009 RIAO 2004, RIAO 2007 ECIR 2008, ECIR 2009 AIRS 2006, AIRS 2008 SPIRE 2007, SPIRE 2008 Norbert Fuhr's IR conference calendar
Journals ACM Transactions on Information Systems (TOIS): dblp home Information Processing and Management (IP&M): dblp home Information Retrieval: dblp home International Journal on Digital Libraries: dblp home Journal of the American Society of Information Science and Technology (JASIST): dblp home SIGIR Forum: dblp home Journal of Documentation D-Lib Magazine Data & Knowledge Engineering: dblp home Information Processing Letters: dblp home Information Research Information Systems: dblp home Journal of Intelligent Information Systems: dblp home Knowledge and Information Systems: dblp home Foundations and Trends in Information Retrieval: <!--dblp-->home
Popular Articles Wikipedia: Information Retrieval A. Singhal: Modern Information Retrieval: A Brief Overview S.E. Robertson, K. Sparck Jones: Simple, proven approaches to text retrieval Bruce Croft: What Do People Want From IR Information Retrieval on the World Wide Web Michael Lesk: The Seven Ages of Information Retrieval <!-- Marcia J. Bates: ... Getting Web Information Retrieval Right ... -->
Software ctos/Middleton-Baeza.pdf">C. Middleton, R. Baeza-Yates: A Comparison of Open Source Search Engines (contains an up-to-date list of available search engine software) Doug Oard's list of available text retrieval systems Avi Rappoport: open source search engines <!-- ht://Dig -->MySQL full text search <!-- Swish-e -->Text to Matrix Generator, a MATLAB toolbox for indexing, retrieval and other text processing tasks
Collections U. of Glasgow list of available text retrieval collections NLP/IR corpus list at NUS NLP/IR corpus list at Edinburgh Internet archive (limited availability) Linguistic Data Consortium
Professional Organizations ACM SIGIR BCS IRSG
Other Collections of Information Retrieval Links ACM SIGIRDavid Karger
Other Resources Glossary (Modern Information Retrieval) Information retrieval research links @ Search Tools BUBL: Information Retrieval Links LSU: Information Retrieval Systems Open Directory: Information Retrieval Links UBC: Indexing Resources IR & Neural Networks, Symbolic Learning, Genetic Algorithms A stop list (a list of stop words) <!-- IR links (Syracuse) IR links (U. of Tokushima) --><!-- IR resources (Mark Sanderson) --><!-- Open Directory: Information Retrieval --> Chris Manning's NLP resources Weiguo Patrick Fan's text mining links