Stephen Robertson
Computer scientist well known for his work on information retrieval
Stephen Robertson 's AcademicInfluence.com Rankings

Download Badge
Computer Science
Stephen Robertson 's Degrees
- PhD Computer Science Stanford University
- Masters Computer Science Stanford University
- Bachelors Computer Science University of California, Berkeley
Similar Degrees You Can Earn
Why Is Stephen Robertson Influential?
(Suggest an Edit or Addition)According to Wikipedia, Stephen Robertson is a British computer scientist. He is known for his work on probabilistic information retrieval together with Karen Spärck Jones and the Okapi BM25 weighting model. Okapi BM25 is very successful in experimental search evaluations and found its way in many information retrieval systems and products, including open source search systems like Lucene, Lemur, Xapian, and Terrier. BM25 is used as one of the most important signals in large web search engines, certainly in Microsoft Bing, and probably in other web search engines too. BM25 is also used in various other Microsoft products such as Microsoft SharePoint and SQL Server.
Stephen Robertson 's Published Works
Published Works
- Relevance weighting of search terms (1976) (2440)
- Okapi at TREC (1992) (2138)
- Okapi at TREC-3 (1994) (1963)
- The Probabilistic Relevance Framework: BM25 and Beyond (2009) (1940)
- Some simple effective approximations to the 2-Poisson model for probabilistic weighted retrieval (1994) (1546)
- Understanding inverse document frequency: on theoretical arguments for IDF (2004) (1362)
- The probability ranking principle in IR (1997) (1099)
- A probabilistic model of information retrieval: development and comparative experiments - Part 2 (2000) (752)
- A probabilistic model of information retrieval: development and comparative experiments - Part 1 (2000) (749)
- Simple BM25 extension to multiple weighted fields (2004) (725)
- Selecting good expansion terms for pseudo-relevance feedback (2008) (449)
- Okapi at TREC-7: Automatic Ad Hoc, Filtering, VLC and Interactive (1998) (447)
- Okapi/Keenbow at TREC-8 (1999) (397)
- Okapi at TREC-4 (1995) (386)
- Probabilistic models of indexing and searching (1980) (372)
- On Term Selection for Query Expansion (1991) (371)
- SoftRank: optimizing non-smooth rank metrics (2008) (333)
- Simple, proven approaches to text retrieval (1994) (320)
- Effective site finding using link anchor information (2001) (313)
- Experimentation as a way of life: Okapi at TREC (2000) (285)
- A new rank correlation coefficient for information retrieval (2008) (267)
- The TREC 2002 Filtering Track Report (2002) (256)
- Challenges in information retrieval and language modeling: report of a workshop held at the center for intelligent information retrieval, University of Massachusetts Amherst, September 2002 (2003) (243)
- The TREC-8 Filtering Track Final Report (1999) (229)
- Overview of the Okapi projects (1997) (206)
- On the Evaluation of IR Systems (1992) (202)
- Microsoft Cambridge at TREC 13: Web and Hard Tracks (2004) (201)
- On relevance weights with little relevance information (1997) (187)
- Relevance weighting for query independent evidence (2005) (173)
- Parsimonious language models for information retrieval (2004) (152)
- A new interpretation of average precision (2008) (148)
- A probabilistic model of information and retrieval: development and status (1998) (136)
- Okapi at TREC{7: automatic ad hoc, ltering, VLC and interactive track (1999) (131)
- Optimisation methods for ranking functions with multiple parameters (2006) (128)
- On GMAP: and other transformations (2006) (124)
- On the history of evaluation in IR (2008) (122)
- Large Test Collection Experiments on an Operational, Interactive System: Okapi at TREC (1995) (120)
- The methodology of information retrieval experiment (1981) (111)
- Extending average precision to graded relevance judgments (2010) (109)
- Okapi at TREC-5 (1996) (108)
- Information science and the phenomenon of information (1976) (105)
- Comparing citation contexts for information retrieval (2008) (104)
- Okapi at TREC-6 Automatic ad hoc, VLC, routing, filtering and QSDR (1997) (102)
- THEORIES AND MODELS IN INFORMATION RETRIEVAL (1977) (95)
- Building a filtering test collection for TREC 2002 (2003) (94)
- Expected browsing utility for web search evaluation (2010) (91)
- INEX 2007 Evaluation Measures (2008) (90)
- Interactive Thesaurus Navigation: Intelligence Rules OK? (1995) (88)
- Rethinking the ESP game (2009) (86)
- Probabilistic group recommendation via information matching (2013) (84)
- Where to stop reading a ranked list?: threshold optimization using truncated score distributions (2009) (84)
- On Relevance weight estimation and Query Expansion (1986) (80)
- Field-Weighted XML Retrieval Based on BM25 (2005) (79)
- On Collection Size and Retrieval Effectiveness (2004) (78)
- Hits hits TREC: exploring IR evaluation results with network analysis (2007) (77)
- Applying Machine Learning to Text Segmentation for Information Retrieval (2003) (73)
- Advances in Information Retrieval Theory, Second International Conference on the Theory of Information Retrieval, ICTIR 2009, Cambridge, UK, September 10-12, 2009, Proceedings (2009) (73)
- Parallel search using partitioned inverted files (2000) (70)
- Evaluating Interactive Systems in TREC (1996) (66)
- Query Expansion with Long-Span Collocates (2003) (65)
- A few good topics: Experiments in topic set reduction for retrieval evaluation (2009) (64)
- THE PARAMETRIC DESCRIPTION OF RETRIEVAL TESTS: PART I: THE BASIC PARAMETERS (1969) (64)
- Probabilistic relevance ranking for collaborative filtering (2008) (63)
- Ambiguous requests: implications for retrieval tests, systems and theories (2007) (61)
- Microsoft Cambridge at TREC 2002: Filtering Track (2002) (59)
- Threshold setting in adaptive filtering (2000) (58)
- Microsoft Cambridge at TREC-9: Filtering Track (2000) (55)
- Using Terms from Citations for IR: Some First Results (2008) (53)
- How to Find Better Index Terms Through Citations (2006) (51)
- On per-topic variance in IR evaluation (2012) (49)
- On rank-based effectiveness measures and optimization (2007) (49)
- Relevance Feedback Track Overview: TREC 2008 (2008) (48)
- Simple Evaluation Metrics for Diversified Search Results (2010) (47)
- The probabilistic character of relevance (1977) (47)
- Microsoft Cambridge at TREC 14: Enterprise Track (2005) (46)
- Modelling A User Population for Designing Information Retrieval Metrics (2008) (45)
- Okapi at TREC-2 (1993) (45)
- INEX 2006 Evaluation Measures (2006) (44)
- On Event Spaces and Probabilistic Models in Information Retrieval (2005) (43)
- Ranking in Principle (1978) (43)
- On Score Distributions and Relevance (2007) (42)
- Modeling score distributions in information retrieval (2011) (41)
- Karen Spärck Jones (2008) (37)
- Creating a Test Collection for Citation-based IR Experiments (2006) (36)
- Weighting, ranking and relevance feedback in a front—end system (1986) (36)
- The TREC-9 filtering track (1999) (33)
- Threshold Setting and Performance Optimization in Adaptive Filtering (2002) (33)
- On the choice of effectiveness measures for learning to rank (2010) (32)
- The Unified Probabilistic Model for IR (1982) (32)
- Probabilistic Automatic Indexing by Learning from Human indexers (1984) (32)
- THE PARAMETRIC DESCRIPTION OF RETRIEVAL TESTS (1969) (31)
- Deep versus shallow judgments in learning to rank (2009) (30)
- Microsoft Research at TREC 2009: Web and Relevance Feedback Track (2009) (30)
- Language Modelling and Relevance (2003) (30)
- On the Contributions of Topics to System Evaluation (2011) (30)
- Salton Award Lecture on theoretical argument in information retrieval (2000) (28)
- On document relevance and lexical cohesion between query terms (2006) (27)
- Language Modeling and Relevance (2003) (27)
- Relevance Feedback for Best Match Term Weighting Algorithms in Information Retrieval (2001) (25)
- On the nature of fuzz: A diatribe (1978) (24)
- TREC-10 Web Track Experiments at MSRA (2001) (23)
- A new unified probabilistic model (2004) (23)
- Evaluation of online catalogues: Eliciting information from the user (1991) (23)
- Evaluation in Information Retrieval (2000) (22)
- Microsoft Cambridge at TREC-12: HARD track (2003) (22)
- A domain-independent approach to finding related entities (2012) (21)
- On Using Fewer Topics in Information Retrieval Evaluations (2013) (21)
- Flexible pseudo-relevance feedback using optimization tables (2001) (20)
- Statistical problems in the application of probabilistic models to information retrieval (1982) (20)
- Deciphering cluster representations (2001) (19)
- Documentation note Query-Document Symmetry and Dual Models (1994) (18)
- On sample sizes for non-matched-pair IR experiments (1990) (17)
- Laboratory experiments with Okapi: participation in the TREC programme (1997) (17)
- Evaluation of online catalogues : an assessment of methods (1990) (17)
- Modelling Score Distributions Without Actual Scores (2013) (16)
- Clustering Information Retrieval Search Outputs (1999) (16)
- Effective and Robust Query-Based Stemming (2013) (15)
- Research and evaluation in information retrieval (1997) (15)
- Evaluation of Interfaces for IRS: Modelling End-User Searching Behaviour (1998) (15)
- Using self-supervised word segmentation in Chinese information retrieval (2002) (15)
- Window-based Enterprise Expert Search (2006) (15)
- Okapi Chinese Text Retrieval Experiments at TREC-6 (1997) (14)
- Integration of Collocation Statistics into the Probabilistic Retrieval Model (2002) (13)
- Score Distributions in Information Retrieval (2009) (13)
- Parallel computing in information retrieval - an updated review (1997) (13)
- Language models and probability of relevance (2001) (12)
- The unified model revisited (2003) (11)
- An algorithm for weighted searching on a Boolean system (1984) (11)
- On Smoothing Average Precision (2012) (11)
- Probability‐Based Chinese Text Processing and Retrieval (2000) (11)
- Introduction to the Special Issue: Overview of the TREC Routing and Filtering Tasks (2002) (11)
- Selecting Query Term Alternations for Web Search by Exploiting Query Contexts (2008) (11)
- Term frequency and term value (1981) (11)
- Incorporating User Behavior Information in IR Evaluation (2009) (10)
- ON DOCUMENT POPULATIONS AND MEASURES OF IR EFFECTIVENESS (2007) (10)
- On Bayesian models and event spaces in information retrieval (2002) (10)
- Microsoft Cambridge at TREC-10: Filtering and Web Tracks (2001) (9)
- The TREC-2001 Filtering Track Report | NIST (2002) (9)
- Probabilistic models in IR and their relationships (2014) (8)
- Ambiguous requests: implications for retrieval tests (2007) (8)
- Application of probabilistic methods to Chinese (1997) (8)
- Journal Acquisition by Libraries: Scatter and Cost Effectiveness. (1975) (7)
- INEX 2007 Evaluation Measures (Draft) (2007) (7)
- On Concurrency Control for Inverted Files (1995) (7)
- Indexing Theory and Retrieval Effectiveness. (1978) (7)
- A STATISTICAL ANALYSIS OF RETRIEVAL TESTS: A BAYESIAN APPROACH (1974) (7)
- Comparing the Performance of Adaptive Filtering and Ranked Output Systems (2002) (7)
- Salton Award lecture: on theoretical argument in information retrieval (summary only): on theoretical argument in information retrieval (2000) (7)
- XML-Structured Documents: Retrievable Units and Inheritance (2006) (6)
- Flexible Pseudo-Relevance Feedback for NTCIR-2 (2001) (6)
- Challenges posed by web-based retrieval of scientific papers: Okapi participation in TIPS (2002) (6)
- Average Precision at n (2009) (6)
- Creating a test collection: relevance judgements of cited & non-cited papers (2007) (5)
- Explicit and implicit variables in information retrieval (IR) systems (1975) (5)
- Flexible Pseudo-Relevance Feedback via Direct Mapping and Categorization of Search Requests (2006) (5)
- PLIERS at TREC8 (1999) (5)
- PLIERS: A Parallel Information Retrieval System Using MPI (1999) (4)
- A Study of Document Relevance and Lexical Cohesion between Query Terms (2005) (4)
- CISR at INEX 2006 (2006) (4)
- Parallel methods for the generation of partitioned inverted files (2005) (4)
- Computer retrieval as seen through the pages of Journal of Documentation (1994) (3)
- Parallel methods for the update of partitioned inverted files (2007) (3)
- On the Early History of Evaluation in IR (2005) (3)
- Advances in Information Retrieval Theory: Second International Conference on the Theory of Information Retrieval, ICTIR 2009 Cambridge, UK, September ... (Lecture Notes in Computer Science) (2009) (3)
- Introduction to special issue on the second international conference on the theory of information retrieval (2011) (3)
- Advances in Information Retrieval Theory: Second International Conference on the Theory of Information Retrieval, ICTIR 2009 Cambridge, UK, September 10-12, ... Applications, incl. Internet/Web, and HCI (2009) (3)
- Okapi at TREC { 6 Automatic ad hoc , VLC , routing , ltering and (1997) (3)
- Exploiting hyperlink recommendation evidence in navigational web search (2004) (2)
- On Fuzzy sets: Reply to Cerny (1979) (2)
- A Theory of Information Matching (2012) (2)
- Probabilistic Retrieval Models and Binary Independence Retrieval (BIR) Model (2009) (2)
- Process and Outcome: On the Evaluation of IR Systems in the Age of Interaction, GUIs and Multimedia (1999) (2)
- Parallel Methods for the Search of Partitioned Inverted Files (2)
- Relative and absolute term selection criteria: a comparative study for English and Japanese IR (2002) (2)
- Parallel Computing for Term Selection in Routing/Filtering (2003) (2)
- A Brief History of Search Results Ranking (2019) (2)
- An operational evaluation of weighting , ranking and relevance feedback via a front-end system (1985) (2)
- The TREC-8 Filtering Track Final Report - Figures (1999) (2)
- Relevance, Retrieval and Document Spaces (1979) (1)
- Parallel computing for passage retrieval (2004) (1)
- Documentation Note: Specificity and Weighted Retrieval. (1974) (1)
- Obituary: In Memoriam (2007) (1)
- Relevance, Retrieval and Document Spaces (1979) (1)
- In Defence of Relevance (1974) (1)
- Retrieval System Models: What’s New? (2004) (1)
- On real-time ad-hoc retrieval evaluation (2012) (1)
- Information Retrieval Research, Proc. Joint ACM/BCS Symposium in Information Storage and Retrieval, Cambridge, UK, June 1980 (1981) (1)
- Are Evaluation Metrics Identical With Binary Judgements ? (2009) (1)
- Machine Learning and Relevance Feedback (1992) (1)
- In memoriam: Karen Spärck Jones (2007) (1)
- The study of information retrieval: a long view (2008) (1)
- A tool for comparative evaluation in an interactive environment (2002) (1)
- On the science of search: statistical approaches, evaluation, optimisation (2006) (1)
- Bayesian Extension to the Language Model (2003) (0)
- The Last Half-Century: A Perspective on Experimentation in Information Retrieval (2007) (0)
- Workshop Report - Use of Training Materials in Constructing Routing Queries (1993) (0)
- Cyril W. Cleverdon (In Memoriam) (1998) (0)
- PLIERS AT VLC 2 (2007) (0)
- The Web, the Home and the Search Engine (2012) (0)
- Special Issue on the Second International Conference on the Theory of Information Retrieval (ICTIR 2009) (2011) (0)
- A Unified Theory of Information Matching (2012) (0)
- Retrieval and relevance : on the evaluation of IR systems (2011) (0)
- Proceedings of the 2nd International Conference on Theory of Information Retrieval: Advances in Information Retrieval Theory (2009) (0)
- In memoriam: Cyril W. Cleverdon (1998) (0)
- Forward to the Past: Notes towards a Pre-history of Web Search (2017) (0)
- Expanding a Test Collection for Citation-based IR Experiments (2007) (0)
- Final Report on International Research Forum in Information Science the Theoretical Basis of Information Science, 29 July-2 August, 1975 (1976) (0)
- Probabilistic retrieval: thresholding for automatic filtering (1999) (0)
- Advances in Information Retrieval Theory (Proceedings of the 2nd International Conference on the Theory of Information Retrieval, ICTIR 2009) (2009) (0)
- A Unified Relevance Retrieval Model by Eliteness Hypothesis (2011) (0)
- Development of the unified probabilistic model : report on an overseas study visit to Berkeley, California, in August 1986 (1986) (0)
- On retrieval system theory (2011) (0)
- Preface Organizing Committee Program Committee Usefulness as the Criterion for Evaluation of Interactive Information Retrieval Systems Semi-supervised Priors for Microblog Language Identification Scope of Negation Detection in Sentiment Analysis a Multi-dimensional Model for Search Intent Result Div (0)
- A Theory of Information Matching (TIM) (2012) (0)
- Cyril W. Cleverdon (1998) (0)
- Gerard (Gerry) Salton (1996) (0)
- PLIERS at TREC8 - Appendix (1999) (0)
This paper list is powered by the following services:
Other Resources About Stephen Robertson
What Schools Are Affiliated With Stephen Robertson ?
Stephen Robertson is affiliated with the following schools: