Gerald James Tesauro

Gerald James Tesauro's AcademicInfluence.com Rankings

Computer Science

#8537

World Rank

#8974

Historical Rank

Machine Learning

#3519

World Rank

#3562

Historical Rank

Artificial Intelligence

#3827

World Rank

#3882

Historical Rank

Database

#5538

World Rank

#5745

Historical Rank

computer-science Degrees

Download Badge

Computer Science

Gerald James Tesauro's Degrees

PhD Computer Science University of Massachusetts Amherst
Masters Computer Science University of Massachusetts Amherst
Bachelors Computer Science University of Massachusetts Amherst

Similar Degrees You Can Earn

Why Is Gerald James Tesauro Influential?

(Suggest an Edit or Addition)

(See a Problem?)

Gerald James Tesauro's Published Works

Number of citations in a given year to any of this author's works

Total number of citations to an author for the works they published in a given year. This highlights publication of the most important work(s) by the author

Published Works

Temporal difference learning and TD-Gammon (1995) (1946)
TD-Gammon, a Self-Teaching Backgammon Program, Achieves Master-Level Play (1994) (894)
Practical issues in temporal difference learning (1992) (699)
Utility functions in autonomic systems (2004) (472)
Learning to Learn without Forgetting By Maximizing Transfer and Minimizing Interference (2018) (444)
A Hybrid Reinforcement Learning Approach to Autonomic Resource Allocation (2006) (372)
Agent-Human Interactions in the Continuous Double Auction (2001) (280)
On-line Policy Improvement using Monte-Carlo Search (1996) (256)
A multi-agent systems approach to autonomic computing (2004) (251)
Extending Q-Learning to General Adaptive Multi-Agent Systems (2003) (220)
Multiresolution Recurrent Neural Networks: An Application to Dialogue Response Generation (2016) (185)
Coordinating Multiple Autonomic Managers to Achieve Specified Power-Performance Tradeoffs (2007) (182)
Analyzing Complex Strategic Interactions in Multi-Agent Systems (2002) (180)
R3: Reinforced Ranker-Reader for Open-Domain Question Answering (2018) (179)
Programming backgammon using self-teaching neural nets (2002) (176)
Diverse Few-Shot Text Classification with Multiple Metrics (2018) (175)
A Parallel Network that Learns to Play Backgammon (1989) (170)
Neural networks for computer virus recognition (1996) (165)
Pricing in Agent Economies Using Multi-Agent Q-Learning (2002) (162)
Metric Learning for Kernel Regression (2007) (161)
High-performance bidding agents for the continuous double auction (2001) (151)
Reinforcement Learning in Autonomic Computing: A Manifesto and Case Studies (2007) (149)
Evidence Aggregation for Answer Re-Ranking in Open-Domain Question Answering (2017) (147)
Biologically Inspired Defenses Against Computer Viruses (1995) (143)
Autonomic multi-agent management of power and performance in data centers (2008) (136)
Managing Power Consumption and Performance of Computing Systems Using Reinforcement Learning (2007) (131)
Strategic sequential bidding in auctions using dynamic programming (2002) (126)
On the use of hybrid reinforcement learning for autonomic resource allocation (2007) (123)
TD-Gammon: A Self-Teaching Backgammon Program (1995) (120)
Scaling Relationships in Back-propagation Learning (1988) (115)
Online Resource Allocation Using Decompositional Reinforcement Learning (2005) (115)
Eigenoption Discovery through the Deep Successor Representation (2017) (110)
Monte-Carlo simulation balancing (2009) (104)
Utility-Function-Driven Resource Allocation in Autonomic Systems (2005) (101)
The Hebb Rule for Synaptic Plasticity: Algorithms and Implementations (1989) (100)
Neurogammon Wins Computer Olympiad (1989) (100)
Connectionist Learning of Expert Preferences by Comparison Training (1988) (99)
Strategic pricebot dynamics (1999) (97)
Learning to Teach in Cooperative Multiagent Reinforcement Learning (2018) (93)
AUTOMATICALLY GENERATED WIN32 HEURISTIC VIRUS DETECTION (2000) (73)
Neurogammon: a neural-network backgammon program (1990) (73)
Playing repeated Stackelberg games with unknown opponents (2012) (72)
Cooperative Negotiation in Autonomic Systems using Incremental Utility Elicitation (2002) (72)
How Tight Are the Vapnik-Chervonenkis Bounds? (1992) (69)
Scaling and Generalization in Neural Networks: A Case Study (1988) (68)
Hierarchical Memory Networks (2016) (67)
Reinforcement learning in board games (2004) (67)
R3: Reinforced Reader-Ranker for Open-Domain Question Answering (2017) (65)
Learning Abstract Options (2018) (60)
Temporal Difference Learning of Backgammon Strategy (1992) (57)
Selecting Near-Optimal Learners via Incremental Data Allocation (2015) (56)
Pricing in Agent Economies Using Neural Networks and Multi-agent Q-Learning (2001) (50)
Asymptotic Convergence of Backpropagation (1989) (44)
Multi-agent Q-learning and regression trees for automated pricing decisions (2000) (44)
Scaling Relationships in Back-Propagation Learning: Dependence on Training Set Size (1987) (40)
A 'Neural' Network that Learns to Play Backgammon (1987) (38)
Bayesian Inference in Monte-Carlo Tree Search (2010) (36)
Foresight-based pricing algorithms in an economy of software agents (1998) (35)
Analysis of Watson's Strategies for Playing Jeopardy! (2013) (34)
Olfactory Processing and Associative Memory: Cellular and Modeling Studies (1989) (29)
Practical Issues in Temporal Difference Learning (1991) (29)
Text-based RL Agents with Commonsense Knowledge: New Challenges, Environments and Baselines (2020) (28)
Pseudo-convergent Q-Learning by Competitive Pricebots (2000) (28)
A Policy Gradient Algorithm for Learning to Learn in Multiagent Reinforcement Learning (2020) (27)
Can Neural Networks Do Better Than the Vapnik-Chervonenkis Bounds? (1990) (27)
Visualizing processes in neural networks (1991) (26)
A study of scaling and generalization in neural networks (1988) (26)
Comparison training of chess evaluation functions (2001) (26)
Neural Network Visualization (1989) (25)
A strategic decision model for multi-attribute bilateral negotiation with alternating (2003) (25)
Hybrid Reinforcement Learning with Expert State Sequences (2019) (23)
Simple neural models of classical conditioning (1986) (23)
Advances in neural information processing systems : proceedings of the ... conference (1989) (22)
Budgeted Prediction with Expert Advice (2015) (22)
Learning to Query, Reason, and Answer Questions On Ambiguous Texts (2016) (22)
Model-Based and Model-Free Approaches to Autonomic Resource Allocation (2005) (21)
New Approaches to Optimization and Utility Elicitation in Autonomic Computing (2005) (21)
Simulation, learning, and optimization techniques in Watson's game strategies (2012) (19)
Foresight-based pricing algorithms in agent economies (2000) (19)
Learning Hierarchical Teaching Policies for Cooperative Agents (2019) (19)
Active Collaborative Prediction with Maximum Margin Matrix Factorization (2008) (18)
Efficient search techniques for multi-attribute bilateral negotiation strategies (2002) (15)
Asymptotic Convergence of Backpropagation: Numerical Experiments (1989) (14)
The Eigenoption-Critic Framework (2017) (13)
Comments on “Co-Evolution in the Successful Learning of Backgammon Strategy” (1998) (13)
Decentralized TD Tracking with Linear Function Approximation and its Finite-Time Analysis (2020) (13)
A plausible neural circuit for classical conditioning without synaptic plasticity. (1988) (12)
Towards Cognitive Automation of Data Science (2015) (12)
Estimating End-to-End Performance by Collaborative Prediction with Active Sampling (2007) (12)
Neural models of classical conditioning: A theoretical viewpoint. (1990) (10)
Building network learning algorithms from Hebbian synapses (1990) (10)
On the Role of Weight Sharing During Deep Option Learning (2019) (9)
Multi-agent implementation of asymmetric protocol for bilateral negotiations (extended abstract) (2003) (8)
Online Performance Management Using Hybrid Reinforcement Learning (2005) (7)
Learning Hierarchical Teaching in Cooperative Multiagent Reinforcement Learning (2019) (7)
Optimal Sequential Drilling for Hydrocarbon Field Development Planning (2017) (6)
Connectionist Learning of Expert Backgammon Evaluations (1988) (6)
Context-Specific Representation Abstraction for Deep Option Learning (2021) (4)
Influencing Long-Term Behavior in Multiagent Reinforcement Learning (2022) (4)
Deep RL With Information Constrained Policies: Generalization in Continuous Control (2020) (4)
Statistical Approaches to Question Answering in Watson (2012) (3)
Introduction to the special issue on deep reinforcement learning: An editorial (2018) (3)
Efficient Black-Box Planning Using Macro-Actions with Focused Effects (2020) (2)
Proceedings of the 6th International Conference on Neural Information Processing Systems (1993) (2)
Reports of the AAAI 2014 Conference Workshops (2015) (2)
Capacity-Limited Decentralized Actor-Critic for Multi-Agent Games (2021) (2)
AI Planning Annotation in Reinforcement Learning: Options and Beyond (2021) (2)
Improvement of Systems Management Policies Using Hybrid Reinforcement Learning (2006) (2)
Finding Macro-Actions with Disentangled Effects for Efficient Planning with the Goal-Count Heuristic (2020) (1)
Robust Task Clustering for Deep Many-Task Learning (2017) (1)
Proceedings of the 7th International Conference on Neural Information Processing Systems (1994) (1)
Applying a framework for healthcare incentives simulation (2012) (1)
Consolidation via Policy Information Regularization in Deep RL for Multi-Agent Games (2020) (0)
Cognitive Computing (2017) (0)
Learning in Factored Domains with Information-Constrained Visual Representations (2023) (0)
Former NASA chief unveils $ 100 million neural chip maker KnuEdge (2016) (0)
MONTE-CARLO BACKGAMMON (2007) (0)
A Parallel Network Play Backgammon that Learns to (1989) (0)
The 3rd Advanced Chess match (León, June 2-5, 2000) (2000) (0)
RL Generalization in a Theory of Mind Game Through a Sleep Metaphor (Student Abstract) (2021) (0)
AI Planning Annotation for Sample Efficient Reinforcement Learning (2022) (0)
Institutional Knowledge at Singapore Management University Evidence aggregation for answer re-ranking in open-domain question answering (2019) (0)
Advances in Neural Information Processing Systems 6, [7th NIPS Conference, Denver, Colorado, USA, 1993] (1994) (0)
Game-Theoretical Perspectives on Active Equilibria: A Preferred Solution Concept over Nash Equilibria (2022) (0)
FORGETTING BY MAXIMIZING TRANSFER AND MINIMIZING INTERFERENCE (2018) (0)
Vc Dimension Further Information 3.3 Optimization Algorithms 3 Learning from Data (1996) (0)
Institutional Knowledge at Singapore Management University R3: Reinforced Ranker-Reader for open-domain Question Answering (2019) (0)
Scaling R elationships in Backpropagation Learning (2006) (0)

This paper list is powered by the following services:

Gerald James Tesauro's Academic­Influence.com Rankings

Gerald James Tesauro's Degrees

Similar Degrees You Can Earn

Why Is Gerald James Tesauro Influential?

Gerald James Tesauro's Published Works

Published Works

Gerald James Tesauro's AcademicInfluence.com Rankings