Peter Stone | Academic Influence

Peter Stone 's AcademicInfluence.com Rankings

Peter Stone

Computer Science

#1361

World Rank

#1408

Historical Rank

#681

USA Rank

computer-science Degrees

Download Badge

Computer Science

Peter Stone 's Degrees

PhD Computer Science Carnegie Mellon University
Masters Computer Science Carnegie Mellon University
Bachelors Mathematics University of Chicago

Similar Degrees You Can Earn

Why Is Peter Stone Influential?

(Suggest an Edit or Addition)

According to Wikipedia, Peter Stone is an American computer scientist who is the David Bruton Jr. Centennial Professor of Computer Science at the University of Texas at Austin. He is also an Alfred P. Sloan Research Fellow, Guggenheim Fellow, AAAI Fellow, and Fulbright Scholar.

(See a Problem?)

Peter Stone 's Published Works

Number of citations in a given year to any of this author's works

Total number of citations to an author for the works they published in a given year. This highlights publication of the most important work(s) by the author

Published Works

Transfer Learning for Reinforcement Learning Domains: A Survey (2009) (1665)
Reinforcement Learning (2010) (1490)
Multiagent Systems: A Survey from a Machine Learning Perspective (2000) (1347)
Deep Recurrent Q-Learning for Partially Observable MDPs (2015) (1259)
A Multiagent Approach to Autonomous Intersection Management (2008) (1038)
Policy gradient reinforcement learning for fast quadrupedal locomotion (2004) (616)
Multiagent traffic management: a reservation-based intersection control mechanism (2004) (597)
Task Decomposition, Dynamic Role Assignment, and Low-Bandwidth Communication for Real-Time Strategic Teamwork (1999) (527)
Reinforcement Learning for RoboCup Soccer Keepaway (2005) (461)
Interactively shaping agents via human reinforcement: the TAMER framework (2009) (433)
The RoboCup Synthetic Agent Challenge 97 (1997) (374)
Scalable training of artificial neural networks with adaptive sparse connectivity inspired by network science (2017) (372)
Behavioral Cloning from Observation (2018) (362)
Layered learning in multiagent systems - a winning approach to robotic soccer (2000) (343)
Autonomous agents modelling other agents: A comprehensive survey and open problems (2017) (336)
Ad Hoc Autonomous Agent Teams: Collaboration without Pre-Coordination (2010) (327)
PAC Subset Selection in Stochastic Multi-armed Bandits (2012) (317)
Multiagent traffic management: an improved intersection control mechanism (2005) (312)
Layered Learning in Multiagent Systems (1997) (263)
Deep Reinforcement Learning in Parameterized Action Space (2015) (254)
Evolutionary Function Approximation for Reinforcement Learning (2006) (238)
Transfer Learning via Inter-Task Mappings for Temporal Difference Learning (2007) (234)
Scaling Reinforcement Learning toward RoboCup Soccer (2001) (221)
Combining manual feedback with subsequent MDP reward signals for reinforcement learning (2010) (219)
Cross-domain transfer for reinforcement learning (2007) (207)
Curriculum Learning for Reinforcement Learning Domains: A Framework and Survey (2020) (205)
Layered Approach to Learning Client Behaviors in the Robocup Soccer Server (1998) (200)
Machine Learning for Fast Quadrupedal Locomotion (2004) (196)
Auction-based autonomous intersection management (2013) (192)
A Neuroevolution Approach to General Atari Game Playing (2014) (189)
Boosting for Regression Transfer (2010) (183)
Deep TAMER: Interactive Agent Shaping in High-Dimensional State Spaces (2017) (171)
Layered Learning (2000) (170)
When Security Games Go Green: Designing Defender Strategies to Prevent Poaching and Illegal Fishing (2015) (168)
Reinforcement learning from simultaneous human and MDP reward (2012) (162)
Learning to Interpret Natural Language Commands through Human-Robot Dialog (2015) (160)
Keepaway Soccer: From Machine Learning Testbed to Benchmark (2005) (155)
Team-partitioned, opaque-transition reinforcement learning (1999) (155)
A polynomial-time nash equilibrium algorithm for repeated games (2003) (153)
A multi-robot system for continuous area sweeping tasks (2006) (152)
The 2001 trading agent competition (2002) (150)
A social reinforcement learning agent (2001) (149)
Behavior transfer for value-function-based reinforcement learning (2005) (144)
Task Decomposition and Dynamic Role Assignment for Real-Time Strategic Teamwork (1998) (143)
Generative Adversarial Imitation from Observation (2018) (143)
State Abstraction Discovery from Irrelevant State Variables (2005) (141)
Learning Predictive State Representations (2003) (138)
Transfer via inter-task mappings in policy search reinforcement learning (2007) (137)
Automated Intersection Control (2011) (135)
Transferring Instances for Model-Based Reinforcement Learning (2008) (134)
Autonomous Intersection Management: Multi-intersection optimization (2011) (132)
Autonomous bidding agents - strategies and lessons from the trading agent competition (2007) (132)
Training a Robot via Human Feedback: A Case Study (2013) (132)
Sharing the Road: Autonomous Vehicles Meet Human Drivers (2007) (129)
Autonomous Bidding Agents in the Trading Agent Competition (2001) (129)
Empirical evaluation of ad hoc teamwork in the pursuit domain (2011) (126)
Autonomous transfer for reinforcement learning (2008) (125)
Artificial Intelligence and Life in 2030: The One Hundred Year Study on Artificial Intelligence (2016) (123)
Towards collaborative and adversarial learning: a case study in robotic soccer (1998) (123)
General Game Learning Using Knowledge Transfer (2007) (121)
Function Approximation via Tile Coding: Automating Parameter Choice (2005) (120)
Automatic feature selection in neuroevolution (2005) (120)
TEXPLORE: real-time sample-efficient reinforcement learning for robots (2012) (113)
Evolving Keepaway Soccer Players through Task Decomposition (2003) (113)
ATTac-2000: an adaptive autonomous bidding agent (2001) (111)
RoboCup 2000: Robot Soccer World Cup IV (2001) (111)
Half Field Offense in RoboCup Soccer: A Multiagent Reinforcement Learning Case Study (2006) (105)
Armed Conflict (2019) (104)
Generalized model learning for Reinforcement Learning on a humanoid robot (2010) (103)
Comparing evolutionary and temporal difference methods in a reinforcement learning domain (2006) (103)
BWIBots: A platform for bridging the gap between AI and human–robot interaction research (2017) (101)
The CMUnited-99 Champion Simulator Team (2000) (99)
Efficient Selection of Multiple Bandit Arms: Theory and Practice (2010) (98)
Proceedings of the fifth international joint conference on Autonomous agents and multiagent systems (2006) (97)
Learning Multi-Modal Grounded Linguistic Semantics by Playing "I Spy" (2016) (96)
A synthesis of automated planning and reinforcement learning for efficient, robust decision-making (2016) (96)
Improving Action Selection in MDP's via Knowledge Transfer (2005) (94)
Source Task Creation for Curriculum Learning (2016) (92)
The CMUnited-98 Champion Simulator Team (1998) (91)
Evolving Soccer Keepaway Players Through Task Decomposition (2005) (91)
Protecting against evaluation overfitting in empirical reinforcement learning (2011) (91)
Value Functions for RL-Based Behavior Transfer: A Comparative Study (2005) (90)
Empowerment for continuous agent—environment systems (2011) (89)
Value-Function-Based Transfer for Reinforcement Learning Using Structure Mapping (2006) (89)
Recent Advances in Imitation Learning from Observation (2019) (89)
Replacing the stop sign: unmanaged intersection control for autonomous vehicles (2008) (88)
Outracing champion Gran Turismo drivers with deep reinforcement learning (2022) (86)
Stochastic Grounded Action Transformation for Robot Learning in Simulation (2017) (85)
Automatic Heuristic Construction in a Complete General Game Player (2006) (85)
Humanoid robots learning to walk faster: from the real world to simulation and back (2013) (84)
Batch reinforcement learning in a complex domain (2007) (83)
Autonomous Intersection Management for Semi-Autonomous Vehicles (2015) (82)
The Nature of Belief-Directed Exploratory Choice in Human Decision-Making (2011) (81)
Automatic Curriculum Graph Generation for Reinforcement Learning Agents (2017) (81)
Intrinsically motivated model learning for developing curious robots (2017) (80)
Implicit Negotiation in Repeated Games (2001) (78)
Model-based function approximation in reinforcement learning (2007) (77)
Practical Vision-Based Monte Carlo Localization on a Legged Robot (2005) (77)
Teamwork with Limited Knowledge of Teammates (2013) (77)
Autonomous Task Sequencing for Customized Curriculum Design in Reinforcement Learning (2017) (77)
Cobot in LambdaMOO: A Social Statistics Agent (2000) (76)
Motion Planning Algorithms for Autonomous Intersection Management (2010) (76)
Transfer learning for reinforcement learning on a physical robot (2010) (75)
Bootstrapping with Models: Confidence Intervals for Off-Policy Evaluation (2016) (74)
The utility of temporal abstraction in reinforcement learning (2008) (73)
Decision-Theoretic Bidding Based on Learned Density Models in Simultaneous, Interacting Auctions (2003) (71)
Learning Curriculum Policies for Reinforcement Learning (2018) (71)
Graph-Based Domain Mapping for Transfer Learning in General Games (2007) (70)
Traffic Intersections of the Future (2006) (70)
The Robocup Physical Agent Challenge: Phase I (1998) (70)
Cooperating with Unknown Teammates in Complex Domains: A Robot Soccer Case Study of Ad Hoc Teamwork (2015) (69)
TacTex'13: A Champion Adaptive Power Trading Agent (2014) (69)
Design and Optimization of an Omnidirectional Humanoid Walk: A Winning Approach at the RoboCup 2011 3D Simulation Competition (2012) (68)
Making friends on the fly: Cooperating with new teammates (2017) (68)
To teach or not to teach?: decision making under uncertainty in ad hoc teams (2010) (66)
A Protocol for Mixed Autonomous and Human-Operated Vehicles at Intersections (2017) (66)
RTMBA: A Real-Time Model-Based Reinforcement Learning Architecture for robot control (2011) (66)
Leading Best-Response Strategies in Repeated Games (2001) (66)
An Introduction to Intertask Transfer for Reinforcement Learning (2011) (66)
Bringing simulation to life: A mixed reality autonomous intersection (2010) (65)
Leveraging Human Guidance for Deep Reinforcement Learning Tasks (2019) (65)
Modeling Auction Price Uncertainty Using Boosting-based Conditional Density Estimation (2002) (65)
Using decision tree confidence factors for multi-agent control (1998) (64)
Anticipation as a key for collaboration in a team of agents: a case study in robotic soccer (1999) (64)
The CMUnited-97 Small Robot Team (1997) (64)
The CMUnited-97 robotic soccer team: perception and multiagent control (1998) (64)
Dynamic lane reversal in traffic management (2011) (63)
DJ-MC: A Reinforcement-Learning Agent for Music Playlist Recommendation (2014) (62)
Cobot in LambdaMOO: An Adaptive Social Statistics Agent (2006) (62)
Empirical Studies in Action Selection with Reinforcement Learning (2007) (62)
HyperNEAT-GGP: a hyperNEAT-based atari general game player (2012) (61)
Defining and using ideal teammate and opponent agent models: a case study in robotic soccer (2000) (60)
Multiagent learning is not the answer. It is the question (2007) (59)
The Need for Different Domain-independent Heuristics (1994) (58)
The RoboCup Soccer Server and CMUnited Clients: Implemented Infrastructure for MAS Research (2003) (58)
Reinforcement learning from human reward: Discounting in episodic tasks (2012) (57)
Designing safe, profitable automated stock trading agents using evolutionary algorithms (2006) (56)
Three automated stock-trading agents: a comparative study (2004) (56)
Improving Grounded Natural Language Understanding through Human-Robot Dialog (2018) (55)
Concurrent layered learning (2003) (54)
On optimizing interdependent skills: a case study in simulated 3D humanoid robot soccer (2011) (54)
How Humans Teach Agents (2012) (54)
Leading ad hoc agents in joint action settings with multiple teammates (2012) (53)
Learning and Multiagent Reasoning for Autonomous Agents (2007) (53)
Framing reinforcement learning from human reward: Reward positivity, temporal discounting, episodicity, and performance (2015) (53)
The First International Trading Agent Competition: Autonomous Bidding Agents (2005) (53)
Bidding for customer orders in TAC SCM (2004) (52)
The CMUnited-98 champion small-robot team (1998) (52)
CORPP: Commonsense Reasoning and Probabilistic Planning, as Applied to Dialog with a Mobile Robot (2015) (52)
Model-Based Exploration in Continuous State Spaces (2007) (52)
Learning and Using Models (2012) (51)
Hierarchical model-based reinforcement learning: R-max + MAXQ (2008) (51)
An architecture for action selection in robotic soccer (2001) (51)
Importance Sampling Policy Evaluation with an Estimated Behavior Policy (2018) (50)
Variety Wins: Soccer-Playing Robots and Infant Walking (2018) (50)
Towards Self-Configuring Hardware for Distributed Computer Systems (2005) (50)
Enforcing Liveness in Autonomous Traffic Management (2011) (49)
Multiagent Traffic Management: Opportunities for Multiagent Learning (2005) (49)
FLECS: Planning with a Flexible Commitment Strategy (1995) (49)
SCRAM: Scalable Collision-avoiding Role Assignment with Minimal-Makespan for Formational Positioning (2014) (49)
Generalized model learning for reinforcement learning in factored domains (2009) (47)
Opportunistic Active Learning for Grounding Natural Language Descriptions (2017) (47)
Keepaway Soccer: A Machine Learning Testbed (2001) (47)
Know Thine Enemy: A Champion RoboCup Coach Agent (2006) (46)
Adaptive job routing and scheduling (2004) (46)
Agents teaching agents: a survey on inter-agent transfer learning (2019) (46)
Layered Disclosure: Revealing Agents' Internals (2000) (45)
Reasoning about Hypothetical Agent Behaviours and their Parameters (2017) (45)
Predictive Planning for Supply Chain Management (2006) (44)
TacTex-03: a supply chain management agent (2004) (43)
A Platform for Evaluating Autonomous Intersection Management Policies (2012) (43)
A learning agent for heat-pump thermostat control (2013) (43)
Autonomous Learning of Stable Quadruped Locomotion (2006) (43)
Team-Partitioned, Opaque-Transition Reinforced Learning (1998) (42)
Real-time vision on a mobile robot platform (2005) (42)
Multiagent Patrol Generalized to Complex Environmental Conditions (2011) (42)
Machine Learning for On-Line Hardware Reconfiguration (2007) (41)
Reinforcement Learning for 3 vs. 2 Keepaway (2000) (41)
Using a million cell simulation of the cerebellum: Network scaling and task generality (2013) (41)
Individual and collaborative behaviors in a team of homogeneous robotic soccer agents (1998) (40)
Structure Learning in Ergodic Factored MDPs without Knowledge of the Transition Function's In-Degree (2011) (40)
APPLD: Adaptive Planner Parameter Learning From Demonstration (2020) (40)
Leading a Best-Response Teammate in an Ad Hoc Team (2009) (40)
Deterministic Implementations for Reproducibility in Deep Reinforcement Learning (2018) (40)
Mitigating catastrophic failure at intersections of autonomous vehicles (2008) (40)
Intelligent Robots and Autonomous Agents (2002) (39)
Towards Illumination Invariance in the Legged League (2005) (39)
Learning Inter-Task Transferability in the Absence of Target Task Samples (2015) (39)
Approximately Orchestrated Routing and Transportation Analyzer: Large-scale traffic simulation for autonomous vehicles (2012) (39)
Ad hoc teamwork for leading a flock (2013) (38)
Beating a Defender in Robotic Soccer: Memory-Based Learning of a Continuous Function (1995) (38)
Communicating with Unknown Teammates (2014) (38)
UT Austin Villa: RoboCup 2016 3D Simulation League Competition and Technical Challenges Champions (2015) (38)
Learning Complementary Multiagent Behaviors: A Case Study (2009) (38)
UT Austin Villa 2011: a champion agent in the RoboCup 3D soccer simulation competition (2012) (38)
The RoboCup Physical Agent Challenge: Goals and Protocols for Phase 1 (1997) (38)
A Lifelong Learning Approach to Mobile Robot Navigation (2021) (38)
Kernel-Based Models for Reinforcement Learning (2006) (37)
From pixels to multi-robot decision-making: A study in uncertainty (2006) (37)
ESSENTIALS OF GAME THEORY (2007) (37)
The Impact of Determinism on Learning Atari 2600 Games (2015) (37)
Towards autonomous sensor and actuator model induction on a mobile robot (2006) (37)
Evolutionary Training of Sparse Artificial Neural Networks: A Network Science Perspective (2017) (37)
UT Austin Villa 2014: RoboCup 3D Simulation League Champion via Overlapping Layered Learning (2015) (37)
Cobot: A Social Reinforcement Learning Agent (2001) (36)
Color learning and illumination invariance on mobile robots: A survey (2009) (36)
CMUnited: a team of robotics soccer agents collaborating in an adversarial environment (1998) (36)
Jointly Improving Parsing and Perception for Natural Language Commands through Human-Robot Dialog (2020) (36)
Intrinsically motivated model learning for a developing curious agent (2012) (36)
Negative information and line observations for Monte Carlo localization (2008) (36)
ATTac-2001: A Learning, Autonomous Bidding Agent (2002) (36)
RoboCup 2012: Robot Soccer World Cup XVI (2013) (36)
Layered Learning and Flexible Teamwork in RoboCup Simulation Agents (1999) (36)
Conflict-Averse Gradient Descent for Multi-task Learning (2021) (35)
Critical factors in the empirical performance of temporal difference and evolutionary methods for reinforcement learning (2010) (35)
Planning in Action Language BC while Learning Action Costs for Mobile Robots (2014) (35)
The UT Austin Villa 2003 Champion Simulator Coach: A Machine Learning Approach (2005) (35)
TacTex-05: A Champion Supply Chain Management Agent (2006) (35)
Multiagent learning in the presence of memory-bounded agents (2014) (34)
Network-wide adaptive tolling for connected and automated vehicles (2017) (34)
An analysis framework for ad hoc teamwork tasks (2012) (34)
Gaussian Processes for Sample Efficient Reinforcement Learning with RMAX-like Exploration (2010) (34)
The CMUnited-97 Simulator Team (1997) (34)
Motion Control for Mobile Robot Navigation Using Machine Learning: a Survey (2020) (33)
TPOT-RL Applied to Network Routing (2000) (33)
Teaching and leading an ad hoc teammate: Collaboration without pre-coordination (2013) (33)
Representation Transfer for Reinforcement Learning (2007) (32)
Reinforcement Learning for Optimization of COVID-19 Mitigation policies (2020) (32)
Overlapping layered learning (2018) (32)
An Assessment of Autonomous Vehicles: Traffic Impacts and Infrastructure Needs—Final Report (2017) (32)
The CMUnited-97 robotic soccer team: Perception and multi-agent control (1999) (32)
Multiagent Competitions and Research: Lessons from RoboCup and TAC (2002) (32)
Positioning to Win: A Dynamic Role Assignment and Formation Positioning System (2012) (32)
The EMPATHIC Framework for Task Learning from Implicit Human Feedback (2020) (31)
Learning Teammate Models for Ad Hoc Teamwork (2012) (31)
Real-time Adaptive Tolling Scheme for Optimized Social Welfare in Traffic Networks (2017) (31)
Minimum Cost Matching for Autonomous Carsharing (2016) (31)
Continuous area sweeping: a task definition and initial approach (2005) (31)
The UT Austin Villa 2004 RoboCup Four-Legged Team: Coming of Age (31)
Flood disaster mitigation: a real-world challenge problem for multi-agent unmanned surface vehicles (2011) (30)
Adaptive mechanism design: a metalearning approach (2006) (30)
RoboCup-2001: The Fifth Robotic Soccer World Championships (2002) (30)
CARVE: A Cognitive Agent for Resource Value Estimation (2008) (30)
Progress in learning 3 vs. 2 keepaway (2003) (30)
Multiagent interactions in urban driving (2008) (30)
UT Austin Villa: RoboCup 2012 3D Simulation League Champion (2012) (30)
On coordination in practical multi-robot patrol (2012) (29)
Role-Based Ad Hoc Teamwork (2011) (29)
Anticipation: A Key for Collaboration in a Team of Agents (1999) (29)
How We Turned around a Problem School. (1992) (29)
User-guided interleaving of planning and execution (1996) (28)
Autonomous Color Learning on a Mobile Robot (2005) (28)
Benchmarking Metric Ground Navigation (2020) (28)
An empirical analysis of value function-based and policy search reinforcement learning (2009) (28)
Learning non-myopically from human-generated reward (2013) (28)
Dynamically Constructed (PO)MDPs for Adaptive Robot Planning (2017) (28)
Bidding for Customer Orders in TAC SCM: A Learning Approach (2004) (28)
Reward (Mis)design for Autonomous Driving (2021) (27)
Online kernel selection for Bayesian reinforcement learning (2008) (27)
Data-Efficient Policy Evaluation Through Behavior Policy Search (2017) (27)
Guiding Exploratory Behaviors for Multi-Modal Grounding of Linguistic Descriptions (2018) (27)
The Right Music at the Right Time: Adaptive Personalized Playlists Based on Sequence Modeling (2019) (27)
On-line evolutionary computation for reinforcement learning in stochastic domains (2006) (27)
FAucS : An FCC Spectrum Auction Simulator for Autonomous Bidding Agents (2001) (26)
Two Stock-Trading Agents: Market Making and Technical Analysis (2003) (26)
Learning Inverse Kinodynamics for Accurate High-Speed Off-Road Navigation on Unstructured Terrain (2021) (26)
Motion planning and control for mobile robot navigation using machine learning: a survey (2020) (26)
Color Learning on a Mobile Robot: Towards Full Autonomy under Changing Illumination (2007) (26)
Convergence, Targeted Optimality, and Safety in Multiagent Learning (2010) (26)
Keyframe Sampling, Optimization, and Behavior Integration: Towards Long-Distance Kicking in the RoboCup 3D Simulation League (2014) (26)
Communication in Domains with Unreliable, Single-Channel, Low-Bandwidth Communication (1998) (25)
The Impact of Nondeterminism on Reproducibility in Deep Reinforcement Learning (2018) (25)
Using RoboCup in university-level computer science education (2004) (25)
Learning to Order Objects Using Haptic and Proprioceptive Exploratory Behaviors (2016) (25)
Overview of RoboCup-99 (2000) (25)
Planning in Answer Set Programming while Learning Action Costs for Mobile Robots (2014) (25)
Temporal Difference and Policy Search Methods for Reinforcement Learning: An Empirical Comparison (2007) (24)
Toward Agile Maneuvers in Highly Constrained Spaces: Learning From Hallucination (2020) (24)
Ad Hoc Teamwork With Behavior Switching Agents (2019) (24)
Passive Demonstrations of Light-Based Robot Signals for Improved Human Interpretability (2018) (24)
Machine Learning Capabilities of a Simulated Cerebellum (2017) (24)
TacTex09: a champion bidding agent for ad auctions (2010) (24)
A Low Cost Ground Truth Detection System for RoboCup Using the Kinect (2012) (24)
Keeping in Touch: Maintaining Biconnected Structure by Homogeneous Robots (2006) (24)
Setpoint scheduling for autonomous vehicle controllers (2012) (24)
Imitation Learning from Video by Leveraging Proprioception (2019) (24)
Evasion planning for autonomous vehicles at intersections (2012) (24)
Firefly Neural Architecture Descent: a General Approach for Growing Neural Networks (2021) (23)
Learning Powerful Kicks on the Aibo ERS-7: The Quest for a Striker (2010) (23)
Semi-autonomous intersection management (2014) (23)
Cooperating with a markovian ad hoc teammate (2013) (23)
Simultaneous Calibration of Action and Sensor Models on a Mobile Robot (2005) (23)
PETLON: Planning Efficiently for Task-Level-Optimal Navigation (2018) (23)
APPLR: Adaptive Planner Parameter Learning from Reinforcement (2020) (23)
Agent-based supply chain management (2004) (23)
Modeling uncertainty in leading ad hoc teams (2014) (22)
Traffic Optimization For a Mixture of Self-interested and Compliant Agents (2017) (22)
Developing adaptive auction mechanisms (2005) (22)
Multirobot Symbolic Planning under Temporal Uncertainty (2017) (22)
The CMUnited-98 Small-Robot Team (1998) (22)
The RoboCup 2013 drop-in player challenges: Experiments in ad hoc teamwork (2014) (22)
Adversarial Imitation Learning from State-only Demonstrations (2019) (22)
CMUNITED-97: RoboCup-97 Small-Robot World Champion Team (1998) (22)
UT Austin Villa 2012: Standard Platform League World Champions (2012) (21)
Design Principles for Creating Human-Shapable Agents (2009) (21)
Online Contrastive Divergence with Generative Replay: Experience Replay without Storing Data (2016) (21)
Online Multiagent Learning against Memory Bounded Adversaries (2008) (21)
Fast and Precise Black and White Ball Detection for RoboCup Soccer (2017) (21)
Open-World Reasoning for Service Robots (2019) (21)
Intelligent Autonomous Robotics: A Robot Soccer Case Study (2007) (21)
A Penny for Your Thoughts: The Value of Communication in Ad Hoc Teamwork (2020) (21)
The Chin Pinch: A Case Study in Skill Learning on a Legged Robot (2006) (21)
An Architecture for Person-Following using Active Target Search (2018) (21)
An Imitation from Observation Approach to Transfer Learning with Dynamics Mismatch (2020) (21)
Structure-based color learning on a mobile robot under changing illumination (2007) (21)
Adaptive Auction Mechanism Design and the Incorporation of Prior Knowledge (2010) (20)
Learning exploration strategies in model-based reinforcement learning (2013) (20)
Characterizing reinforcement learning methods through parameterized learning problems (2011) (20)
Determining Placements of Influencing Agents in a Flock (2015) (20)
Task-Motion Planning with Reinforcement Learning for Adaptable Mobile Service Robots (2019) (20)
APPLI: Adaptive Planner Parameter Learning From Interventions (2020) (20)
Video: RoboCup robot soccer history 1997 – 2011 (2012) (19)
Agile Robot Navigation through Hallucinated Learning and Sober Deployment (2020) (19)
A Study of Layered Learning Strategies Applied to Individual Behaviors in Robot Soccer (2015) (19)
The UT Austin Villa 2006 RoboCup Four-Legged Team (2005) (19)
The Lottery as a Democratic Institution (2013) (19)
Socially CompliAnt Navigation Dataset (SCAND): A Large-Scale Dataset Of Demonstrations For Social Navigation (2022) (19)
Influencing a Flock via Ad Hoc Teamwork (2014) (19)
Selective Visual Attention for Object Detection on a Legged Robot (2006) (19)
RIDM: Reinforced Inverse Dynamics Modeling for Learning from a Single Observed Demonstration (2019) (19)
Prevention and Resolution of Conflicts in Social Navigation - a Survey (2021) (18)
RoboCup-2000: The Fourth Robotic Soccer World Championships (2001) (18)
Accelerating Search with Transferred Heuristics (2007) (18)
Real time targeted exploration in large domains (2010) (18)
Task planning in robotics: an empirical comparison of PDDL- and ASP-based systems (2019) (18)
A century-long commitment to assessing artificial intelligence and its impact on society (2018) (18)
Coach-Player Multi-Agent Reinforcement Learning for Dynamic Team Composition (2021) (17)
Robot Representing and Reasoning with Knowledge from Reinforcement Learning (2018) (17)
Learning a Policy for Opportunistic Active Learning (2018) (17)
Mobile Robot Planning Using Action Language BC with an Abstraction Hierarchy (2015) (17)
Controlled Kicking under Uncertainty (2010) (17)
DyETC: Dynamic Electronic Toll Collection for Traffic Congestion Alleviation (2018) (17)
Interactive, repair-based planning and scheduling for Shuttle payload operations (1997) (16)
APPL: Adaptive Planner Parameter Learning (2021) (16)
Dynamic Sparse Training for Deep Reinforcement Learning (2021) (16)
Towards autonomic computing: adaptive network routing and scheduling (2004) (16)
Generalized Domains for Empirical Evaluations in Reinforcement Learning (2009) (16)
APPLE: Adaptive Planner Parameter Learning From Evaluative Feedback (2021) (16)
Guest Editors' Introduction: Agents and Markets (2003) (16)
Model-Based Reinforcement Learning in a Complex Domain (2008) (16)
Coopernaut: End-to-End Driving with Cooperative Perception for Networked Vehicles (2022) (16)
VOILA: Visual-Observation-Only Imitation Learning for Autonomous Navigation (2021) (16)
Temporal-Logic-Based Reward Shaping for Continuing Reinforcement Learning Tasks (2020) (15)
Enhanced Delta-tolling: Traffic Optimization via Policy Gradient Reinforcement Learning (2018) (15)
Adding Influencing Agents to a Flock (2016) (15)
Leading the Way: An Efficient Multi-robot Guidance System (2015) (15)
Importance sampling in reinforcement learning with an estimated behavior policy (2021) (15)
UT Austin Villa 2011: 3D Simulation Team Report (2011) (15)
On the Analysis of Complex Backup Strategies in Monte Carlo Tree Search (2016) (15)
From Agile Ground to Aerial Navigation: Learning from Learned Hallucination (2021) (14)
LAAIR: A Layered Architecture for Autonomous Interactive Robots (2018) (14)
Transfer Learning and Intelligence: an Argument and Approach (2008) (14)
Expected Value of Communication for Planning in Ad Hoc Teamwork (2021) (14)
The 2007 TAC SCM Prediction Challenge (2008) (14)
Leveraging commonsense reasoning and multimodal perception for robot spoken dialog systems (2017) (14)
A Comparison of Two Approaches for Vision and Self-Localization on a Mobile Robot (2007) (14)
Unclogging Our Arteries: Using Human-Inspired Signals to Disambiguate Navigational Intentions (2019) (14)
Autonomous Electricity Trading Using Time-of-Use Tariffs in a Competitive Market (2016) (14)
Multi-robot planning with conflicts and synergies (2019) (14)
Breaking Bellman's Curse of Dimensionality: Efficient Kernel Gradient Temporal Difference (2017) (14)
Teaching agents with human feedback: a demonstration of the TAMER framework (2013) (13)
Towards reinforcement learning representation transfer (2007) (13)
A Distributed Biconnectivity Check (2006) (13)
An MDP-Based Winning Approach to Autonomous Power Trading: Formalization and Empirical Analysis (2016) (13)
Towards Eliminating Manual Color Calibration at RoboCup (2005) (13)
Robot-Centric Activity Recognition 'in the Wild' (2015) (13)
TacTex-05: An Adaptive Agent for TAC SCM (2006) (13)
Recent advances in leveraging human guidance for sequential decision-making tasks (2021) (13)
Inferring User Intention using Gaze in Vehicles (2018) (13)
WrightEagle and UT Austin Villa: RoboCup 2011 Simulation League Champions (2012) (13)
Balancing Individual Preferences and Shared Objectives in Multiagent Reinforcement Learning (2020) (13)
CMUNITED-98: RoboCup-98 Small-Robot World Champion Team (2000) (13)
A Model-Based Approach to Robot Joint Control (2005) (13)
Temporal-Logic-Based Reward Shaping for Continuing Learning Tasks (2020) (13)
Self-Enforcing Strategic Demand Reduction (2002) (13)
Austin Villa 2011 : Sharing is Caring : Better Awareness through Information Sharing (2012) (13)
Scalable Multiagent Driving Policies For Reducing Traffic Congestion (2021) (13)
TD Learning with Constrained Gradients (2018) (13)
Adapting in agent-based markets: a study from TAC SCM (2007) (12)
Policy Evaluation in Continuous MDPs With Efficient Kernelized Gradient Temporal Difference (2017) (12)
UT Austin Villa 2008: Standing On Two Legs (2008) (12)
Broad Learning from Narrow Training: A Case Study in Robotic Soccer (1995) (12)
RoboCup in Higher Education: A Preliminary Report (2003) (12)
Multi-modal Predicate Identification using Dynamically Learned Robot Controllers (2018) (12)
Causal Dynamics Learning for Task-Independent State Abstraction (2022) (12)
Reinforcement Learning with Human Feedback in Mountain Car (2011) (12)
IFSA: incremental feature-set augmentation for reinforcement learning tasks (2007) (12)
The RoboCup Soccer Server and CMUnited: Implemented Infrastructure for MAS Research (2000) (12)
Polynomial Regression with Automated Degree: A Function Approximator for Autonomous Agents (2006) (12)
Autonomous Planned Color Learning on a Mobile Robot Without Labeled Data (2006) (12)
Grounded action transformation for sim-to-real reinforcement learning (2021) (12)
Turning the corner: improved intersection control for autonomous vehicles (2005) (11)
Austin Villa 2010 Standard Platform Team Report (2010) (11)
Decision mechanisms underlying mood-congruent emotional classification (2018) (11)
Adversarial Intrinsic Motivation for Reinforcement Learning (2021) (11)
Batch reservations in autonomous intersection management (2011) (11)
Reducing Sampling Error in Batch Temporal Difference Learning (2020) (11)
RoboCup Soccer Leagues (2014) (11)
Machine versus Human Attention in Deep Reinforcement Learning Tasks (2020) (11)
Compositional Models for Reinforcement Learning (2009) (11)
Reinforced Grounded Action Transformation for Sim-to-Real Transfer (2020) (11)
Individual and Collaborative Behaviors in a Team of Robotic Soccer Agents (1998) (11)
Bottom-Up Skill Discovery From Unsegmented Demonstrations for Long-Horizon Robot Manipulation (2021) (10)
Special issue on multiagent interaction without prior coordination: guest editorial (2017) (10)
Keeping the Ball from CMUnited-99 (2000) (10)
UT Austin Villa RoboCup 3D Simulation Base Code Release (2016) (10)
Comparing Agents' Success against People in Security Domains (2011) (10)
Bringing Smart Transport to Texans: Ensuring the Benefits of a Connected and Autonomous Transport System in Texas (2016) (10)
MARIOnET: motion acquisition for robots through iterative online evaluative training (2010) (10)
RoboCup as an Introduction to CS Research (2003) (10)
The Fifth Robotic Soccer World Championships (2002) (10)
Three years of the RoboCup standard platform league drop-in player competition (2017) (10)
Dynamic behaviors on the NAO robot with closed-loop whole body operational space control (2016) (10)
A particle filter for bid estimation in ad auctions with periodic ranking observations (2011) (10)
Instance-Based Action Models for Fast Action Planning (2006) (10)
Toward the Robot Butler: the HUMABOT Challenge (2015) (10)
Automated Design of Robust Mechanisms (2017) (10)
Learning Policy Selection for Autonomous Intersection Management (2007) (10)
AD HOC TEAMWORK BEHAVIORS FOR INFLUENCING A FLOCK (2016) (9)
Global action selection for illumination invariant color modeling (2007) (9)
TT-UT Austin Villa 2009: Naos Across Texas (2009) (9)
VI-IKD: High-Speed Accurate Off-Road Navigation using Learned Visual-Inertial Inverse Kinodynamics (2022) (9)
Multi-Robot Human Guidance: Human Experiments and Multiple Concurrent Requests (2017) (9)
CMUnited-98: RoboCup-98 Simulator World Champion Team (1999) (9)
Towards on-board color constancy on mobile robots (2004) (9)
Point of use production of liposomal solubilised products (2018) (9)
Benchmarking robot cooperation without pre-coordination in the RoboCup Standard Platform League drop-in player competition (2015) (9)
Learning to Solve Complex Planning Problems: Finding Useful Auxiliary Problems (1994) (9)
Sample-efficient Adversarial Imitation Learning from Observation (2019) (9)
Adapting Price Predictions in TAC SCM (2007) (9)
Marginal cost pricing for system optimal traffic assignment with recourse under supply-side uncertainty (2018) (9)
Lifelong Navigation (2020) (9)
Overview of RoboCup-2000 (2001) (9)
Agent-based supply chain management: bidding for customer orders (2004) (9)
A Neural Network-Based Approach to Robot Motion Control (2008) (8)
Maximum likelihood estimation of sensor and action model functions on a mobile robot (2008) (8)
The Textual History of King Lear (1980) (8)
Three Humanoid Soccer Platforms: Comparison and Synthesis (2009) (8)
Task Planning in Robotics: an Empirical Comparison of PDDL-based and ASP-based Systems. (2018) (8)
A layered approach for an autonomous robotic soccer system (1997) (8)
Feature Selection for Value Function Approximation Using Bayesian Model Selection (2009) (8)
Towards a Data Efficient Off-Policy Policy Gradient (2018) (8)
Designing Better Playlists with Monte Carlo Tree Search (2017) (8)
Monte Carlo Hierarchical Model Learning (2015) (8)
Multi-Robot Human Guidance Using Topological Graphs (2014) (8)
Prioritized Role Assignment for Marking (2016) (8)
Performance analysis of a counter-intuitive automated stock-trading agent (2003) (8)
Deep R-Learning for Continual Area Sweeping (2020) (7)
Sample-Efficient Evolutionary Function Approximation for Reinforcement Learning (2006) (7)
How Music Alters Decision Making - Impact of Music Stimuli on Emotional Classification (2015) (7)
Real-world challenges for multi-agent reinforcement learning in grid-interactive buildings (2021) (7)
UT Austin Villa: Project-Driven Research in AI and Robotics (2016) (7)
A New Experimental Perspective (2012) (7)
Layered disclosure: why is the agent doing what it's doing? (2000) (7)
On learning with imperfect representations (2011) (7)
Beyond Teleoperation: Exploiting Human Motor Skills with MARIOnET (2010) (7)
ATT-CMUnited-2000: Third Place Finisher in the RoboCup-2000 Simulator League (2000) (7)
Autonomous Return on Investment Analysis of Additional Processing Resources (2007) (7)
Action-Space Knowledge Transfer in MDP ’ s : Formalism , Suboptimality Bounds , and Algorithms ? (2005) (7)
Bin-based estimation of the amount of effort for embedded software development projects with support vector machines (2016) (7)
Skeletal Feature Compensation for Imitation Learning with Embodiment Mismatch (2021) (7)
A Study of Human-Robot Copilot Systems for En-route Destination Changing (2018) (6)
Teaching Social Behavior through Human Reinforcement for Ad hoc Teamwork - The STAR Framework: Extended Abstract (2018) (6)
Randomized strategic demand reduction: getting more by asking for less (2002) (6)
Leading Multiple Ad Hoc Teammates in Joint Action Settings (2011) (6)
The PETLON Algorithm to Plan Efficiently for Task-Level-Optimal Navigation (2020) (6)
Prediction , Behaviors , and Collaboration in a Team of Robotic Soccer Agents (1998) (6)
Progress in RoboCup Soccer Research in 2000 (2000) (6)
Mechanism Design for Correlated Valuations: Efficient Methods for Revenue Maximization (2021) (6)
PRISM: Pose Registration for Integrated Semantic Mapping (2018) (6)
Inter-Classifier Feedback for Human-Robot Interaction in a Domestic Setting (2008) (6)
Three years of the RoboCup standard platform league drop-in player competition (2016) (6)
On Continuous-Action Q-Learning via Tile Coding Function Approximation (2004) (6)
Speeding Up Reinforcement Learning with Behavior Transfer (2004) (6)
ATTUnited-2001: Using Heterogeneous Players (2001) (6)
Mechanism Design with Unknown Correlated Distributions: Can We Learn Optimal Mechanisms? (2017) (6)
Integer Linear Programming Formulations (2007) (6)
Training a Tetris agent via interactive shaping: a demonstration of the TAMER framework (2010) (6)
Improving particle filter performance using SSE instructions (2009) (6)
The Open-Source TEXPLORE Code Release for Reinforcement Learning on Robots (2013) (6)
Desiderata for Planning Systems in General-Purpose Service Robots (2019) (6)
Multirobot Systems (2017) (6)
Ultrasonography in obstetrics. (1987) (6)
Distributional Reinforcement Learning Applied to Robot Soccer Simulation WorkIn-Progress Paper-ALA Workshop at AAMAS-19 (2019) (5)
Representative Selection in Nonmetric Datasets (2015) (5)
The CMUnited-99 Simulator Team (1999) (5)
Solving Service Robot Tasks: UT Austin Villa@Home 2019 Team Report (2019) (5)
Evaluating Ad Hoc Teamwork Performance in Drop-In Player Challenges (2017) (5)
Naos Across Texas (2009) (5)
Machine Learning Methods for Local Motion Planning: A Study of End-to-End vs. Parameter Learning (2021) (5)
A reservation-based multiagent system for intersection control (2004) (5)
The Cmunited-97 Simulator Team in Robocup-97: Robot Soccer World Cup I (1998) (5)
Layered Learning on a Physical Robot (2005) (5)
Optimal Use of Verbal Instructions for Multi-robot Human Navigation Guidance (2019) (5)
CMUnited-98: A Team of Robotic Soccer Agents (1999) (5)
Robust Motion Planning and Safety Benchmarking in Human Workspaces (2019) (5)
Visually Grounded Task and Motion Planning for Mobile Manipulation (2022) (5)
UT Austin Villa: RoboCup 2018 3D Simulation League Champions (2018) (5)
DIPD: Gaze-Based Intention Inference in Dynamic Environments (2018) (5)
High-Speed Accurate Robot Control using Learned Forward Kinodynamics and Non-linear Least Squares Optimization (2022) (5)
Autonomous Planned Color Learning on a Legged Robot (2006) (5)
Autonomous Model Management via Reinforcement Learning (2018) (4)
Learning a Shield from Catastrophic Action Effects: Never Repeat the Same Mistake (2022) (4)
Metric Residual Networks for Sample Efficient Goal-conditioned Reinforcement Learning (2022) (4)
Making Autonomous Intersection Management Backwards-Compatible (2006) (4)
Instantiating the Contingent Bids Model of Truthful Interdependent Value Auctions (2006) (4)
A Stitch in Time - Autonomous Model Management via Reinforcement Learning (2018) (4)
UT Austin Villa 2013: Advances in Vision, Kinematics, and Strategy (2013) (4)
Delta-Tolling: Adaptive Tolling for Optimizing Traffic Throughput (2016) (4)
Escape Room: A Configurable Testbed for Hierarchical Reinforcement Learning (2018) (4)
Placing Influencing Agents in a Flock (2015) (4)
RoboCup: A Treasure Trove of Rich Diversity for Research Issues and Interdisciplinary Connections [TC Spotlight] (2019) (4)
High Confidence Off-Policy Evaluation with Models (2016) (4)
CMUNITED-98 Simulator Team (2000) (4)
Robot Scavenger Hunt: A Standardized Framework for Evaluating Intelligent Mobile Robots (2016) (4)
Lucid dreaming for experience replay: refreshing past states with the current policy (2020) (4)
Domestic Interaction on a Segway Base (2009) (4)
TacTex09: Champion of the First Trading Agent Competition on Ad Auctions (2010) (4)
Artificial Musical Intelligence: A Survey (2020) (4)
Link-based Parameterized Micro-tolling Scheme for Optimal Traffic Management (2018) (4)
Integrating Task-Motion Planning with Reinforcement Learning for Robust Decision Making in Mobile Robots (2018) (4)
Selecting Compliant Agents for Opt-in Micro-Tolling (2019) (4)
Using Dynamic Rewards to Learn a Fully Holonomic Bipedal Walk (2012) (4)
Planning for Improving Throughput in Autonomous Intersection Management (2010) (4)
Team Orienteering Coverage Planning with Uncertain Reward (2021) (4)
The RoboCup 2014 SPL Drop-in Player Competition: Encouraging Teamwork without Pre-coordination (2015) (4)
Bayesian Models of Nonstationary Markov Decision Processes (2005) (4)
The RoboCup 2013 drop-in player challenges: a testbed for ad hoc teamwork (2014) (3)
RoboCup-97 Small-Robot World Champion Team (1998) (3)
Inter-Task Action Correlation for Reinforcement Learning Tasks (2006) (3)
Marginal Cost Pricing with a Fixed Error Factor in Traffic Networks (2019) (3)
The 10th International Conference on Autonomous Agents and Multiagent Systems - Volume 1 (2011) (3)
Building Self-Play Curricula Online by Playing with Expert Agents in Adversarial Games (2019) (3)
A Task Specification Language for Bootstrap Learning (2009) (3)
What's Hot at RoboCup (2016) (3)
An Empirical Comparison of PDDL-based and ASP-based Task Planners (2018) (3)
Adapting to Workload Changes Through On-The-Fly Reconfiguration (2006) (3)
Multiagent Learning Paradigms (2017) (3)
Autonomous Ground Navigation in Highly Constrained Spaces: Lessons Learned From the Benchmark Autonomous Robot Navigation Challenge at ICRA 2022 [Competitions] (2022) (3)
Interaction and Autonomy in RoboCup@Home and Building-Wide Intelligence. (2018) (3)
JANET: A report on its use for libraries (1990) (3)
Automatic Heuristic Construction for General Game Playing (2006) (3)
iCORPP: Interleaved Commonsense Reasoning and Probabilistic Planning on Robots (2020) (3)
Towards Safe Motion Planning in Human Workspaces: A Robust Multi-agent Approach (2021) (3)
Watch Where You’re Going! Gaze and Head Orientation as Predictors for Social Robot Navigation (2021) (3)
Impact of Music on Decision Making in Quantitative Tasks (2016) (3)
Varieties of Indeterminacy (2012) (3)
Defender Strategies In Domains Involving Frequent Adversary Interaction (2015) (3)
Benchmarking Reinforcement Learning Techniques for Autonomous Navigation (2022) (3)
Robust structure-based autonomous color learning on a mobile robot (2007) (3)
5th International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS 2006), Hakodate, Japan, May 8-12, 2006 (2006) (3)
The 2012 UT Austin Villa Code Release (2013) (3)
On linking cognitive mechanisms to game play: A critique of Morikawa, Hanley, and Orbell (2003) (3)
DEALIO: Data-Efficient Adversarial Learning for Imitation from Observation (2021) (3)
RAIL: A modular framework for Reinforcement-learning-based Adversarial Imitation Learning (2021) (3)
Adaptive auctions: Learning to adjust to bidders (2005) (3)
Special issue on autonomous agents modelling other agents: Guest editorial (2020) (3)
Person tracking on a mobile robot with heterogeneous inter-characteristic feedback (2008) (3)
Building a Dedicated Robotic Soccer (1996) (3)
Models of human preference for learning reward functions (2022) (3)
Incorporating Gaze into Social Navigation (2021) (2)
Multi-robot Learning for Continuous Area Sweeping (2005) (2)
Advances in Vision , Kinematics , and Strategy (2013) (2)
Who speaks for AI? (2016) (2)
Agent Behaviors for Joining and Leaving a Flock (2017) (2)
Continual Learning and Private Unlearning (2022) (2)
Aux-AIRL: End-to-End Self-Supervised Reward Learning for Extrapolating beyond Suboptimal Demonstrations (2021) (2)
Person recognition on a Segway Robot: A video of UT Austin Villa Robocup@Home 2007 finals demonstration (2008) (2)
Autonomous Ground Navigation in Highly Constrained Spaces: Lessons learned from The BARN Challenge at ICRA 2022 (2022) (2)
Reasoning about Human Behavior in Ad Hoc Teamwork (2021) (2)
Task-Independent Causal State Abstraction (2021) (2)
State Abstraction Synthesis for Discrete Models of Continuous Domains (2018) (2)
Generalizing Curricula for Reinforcement Learning (2020) (2)
Human versus Machine Attention in Deep Reinforcement Learning Tasks (2020) (2)
Learning and Reasoning for Robot Dialog and Navigation Tasks (2020) (2)
Conflict Avoidance in Social Navigation -- a Survey (2021) (2)
Capturing Skill State in Curriculum Learning for Human Skill Acquisition∗ (2021) (2)
On the Impact of Music on Decision Making in Cooperative Tasks (2018) (2)
State Aggregation through Reasoning in Answer Set Programming (2016) (2)
Layered Learning in Multiagent (1997) (2)
Interactive shaping of a tetris agent using the TAMER framework (2009) (2)
An Imitation from Observation Approach to Sim-to-Real Transfer (2020) (2)
Comparing Two Action Planning Approaches for Color Learning on a Mobile Robot (2008) (2)
Optimizing Interdependent Skills for Simulated 3D Humanoid Robot Soccer (2011) (2)
Intelligent Disobedience and AI Rebel Agents in Assistive Robotics (2021) (1)
Multi-robot planning with conflicts and synergies (2019) (1)
Ship patrol: multiagent patrol under complex environmental conditions (2011) (1)
Adversarial Imitation Learning from Video Using a State Observer (2022) (1)
Adversarial Goal Generation for Intrinsic Motivation (2018) (1)
Intersections of the Future: Using Fully Autonomous Vehicles (2011) (1)
Darpa Urban Challenge Technical Report Executive Summary (1)
Inverse Kinematics Kicking in the Humanoid RoboCup Simulation League (2012) (1)
DynaBARN: Benchmarking Metric Ground Navigation in Dynamic Environments (2022) (1)
What's hot at RoboCup (extended abstract) (2016) (1)
Ten years of autonomous agents and multiagent systems (2012) (1)
Agents teaching agents: a survey on inter-agent transfer learning (2019) (1)
RoboCup 2021 Worldwide: A Successful Robotics Competition During a Pandemic [Competitions] (2021) (1)
Learning a Robust Multiagent Driving Policy for Traffic Congestion Reduction (2021) (1)
Model-Selection for Non-parametric Function Approximation in Continuous Control Problems: A Case Study in a Smart Energy System (2013) (1)
Multiagent Epidemiologic Inference through Realtime Contact Tracing (2021) (1)
Collaborative learning agents : papers from the 2002 AAAI Symposium, March 25-27, Stanford, California (2002) (1)
Adaptive Tile Coding for Reinforcement Learning (2006) (1)
A Scavenger Hunt for Service Robots (2021) (1)
TT-UT Austin Villa 2010 Team Description Paper for the Standard Platform League (2010) (1)
Biconnected Structure for Multi-Robot Systems (2006) (1)
Reducing Sampling Error in Policy Gradient Learning (2019) (1)
Planning Actions to Enable Color Learning on a Mobile Robot (2007) (1)
The 10th International Conference on Autonomous Agents and Multiagent Systems - Volume 2 (2011) (1)
Layered Extrospe tion : Why is the agent doing what it ' s doing ? (1999) (1)
Efficient Real-Time Inference in Temporal Convolution Networks (2021) (1)
Towards Autonomic Computing: Adaptive Job Routing and Scheduling (2004) (1)
Simultaneous Learning and Reshaping of an Approximated Optimization Task (2013) (1)
Online model learning in adversarial Markov decision processes (2010) (1)
VIOLA: Imitation Learning for Vision-Based Manipulation with Object Proposal Priors (2022) (1)
Designing adaptive trading agents (2011) (1)
Learning Real-world Autonomous Navigation by Self-Supervised Environment Synthesis (2022) (1)
Detecting Motion in the World with a Moving Quadruped Robot (2005) (1)
Query Content in Sequential One-shot Multi-Agent Limited Inquiries when Communicating in Ad Hoc Teamwork (2020) (1)
Offline training of multi-agent reinforcement agents for grid-interactive buildings control (2022) (1)
Learning and Multiagent Reasoning for Autonomous Agents IJCAI-07 Computers and Thought Paper (2007) (1)
D Simulation Base Code Release (2016) (1)
The Concept of Picking (2011) (1)
A Broader, More Inclusive Definition of AI (2020) (1)
On Linking Cognitive Mechanisms to Game Play (2003) (1)
Relaxation therapy. (2008) (1)
Pi and the Movie Mind (2007) (1)
Allotted chambers as defenders of democracy (2021) (1)
VIOLA: Object-Centric Imitation Learning for Vision-Based Robot Manipulation (2022) (1)
DEEP REINFORCEMENT LEARNING IN PARAMETER- IZED ACTION SPACE (2016) (1)
B Robotic Soccer Agent Skills (2000) (0)
October 2018 PRISM : Pose Registration for Integrated Semantic Mapping (2018) (0)
Is the Cerebellum a Model-Based Reinforcement Learning Agent? (2021) (0)
Ad hoc Teamwork and Moral Feedback as a Framework for Safe Agent Behavior (2018) (0)
Robocup-99 Team Descriptions the Cmunited-99 Simulator Team (1999) (0)
Evaluation with an Estimated Behavior Policy (2019) (0)
A Protocol for Multi-Agent Traffic Control at Intersections (0)
ABC: Adversarial Behavioral Cloning for Offline Mode-Seeking Imitation Learning (2022) (0)
Experimental Methods and Strategic Analysis (2007) (0)
UT Austin Villa 3D Simulation Soccer Team 2014 (2014) (0)
The CMUnited-99 Simulator Team CMUnited 99 (1999) (0)
Learning Perceptual Hallucination for Multi-Robot Navigation in Narrow Hallways (2022) (0)
62 When EPR fixed the protocol problem (2021) (0)
Frugal Forests : Learning a Dynamic and Cost Sensitive Feature Extraction Policy for Anytime Activity Classification (2017) (0)
Foreign judgments (2021) (0)
The TAC Travel-Shopping Game (2007) (0)
Orienting a flock via ad hoc teamwork (2014) (0)
Preprint 0 (2000) 1-36 1 The First International Trading Agent Competition: (2000) (0)
Task planning in robotics: an empirical comparison of PDDL- and ASP-based systems (2019) (0)
Multiagent learning in the presence of memory-bounded agents (2013) (0)
Learning an Individual Skill (2000) (0)
Extended Abstract: Motion Planners Learned from Geometric Hallucination (2020) (0)
Comparing Human and AI Attention in Visuomotor Tasks (2021) (0)
CC-Log: Drastically Reducing Storage Requirements for Robots Using Classification and Compression (2017) (0)
Extended Abstract: Safe Learning from Hallucination for Navigation in the Wild (2021) (0)
Role selection in ad hoc teamwork (2012) (0)
Long-Term vs. Greedy Action Planning for Color Learning on a Mobile Robot (2008) (0)
Invited Talk: PRISM - Practical RL: Representation, Interaction, Synthesis, and Mortality (2011) (0)
Learning to Correct Mistakes: Backjumping in Long-Horizon Task and Motion Planning (2022) (0)
Predictive Memory for an Inaccessible EnvironmentMike Bowling (1996) (0)
Learning a Multiagent Behavior (2000) (0)
Book announcement: autonomous bidding agents (2008) (0)
A Task Specification Language for Bootstrap Learning (Extended Abstract) (2009) (0)
THE NEW ZEALAND UNDERGRADUATE OBSTETRICS AND GYNAECOLOGY CURRICULUM (2019) (0)
Ten Years of AAMAS: Introduction to the Special Issue (2012) (0)
Ad Hoc Teamwork in Variations of the Pursuit Domain (2011) (0)
The Art of Jacques Lipchitz (2013) (0)
Designing Defender Strategies Against Frequent Adversary Interaction (2015) (0)
Communicating with Unknown Teammates (Extended Abstract) (2014) (0)
Bidding in Interdependent Markets (2007) (0)
Integrated Task and Motion Planning for Mobile Service Robots (2010) (0)
C CMUnited-98 Simulator Team Behavior Modes (2000) (0)
UT Austin Villa@Home 2022 Team Description Paper (2021) (0)
ON SAMPLING ERROR IN BATCH ACTION-VALUE PREDICTION ALGORITHMS (2020) (0)
The Old, the New, and the Eternal (2013) (0)
Learning a Team Behavior (2000) (0)
Multiagent Traffic Management: Driver Agent Improvements A nd A Protocol for Intersection Control (2005) (0)
EXTENDED TABLE OF CONTENTS (2018) (0)
Randomized Strategic Demand Reduction (2002) (0)
TaskPlanning inRobotics : anEmpiricalComparison of PDDL-based andASP-based Systems (2019) (0)
DM2: Distributed Multi-Agent Reinforcement Learning for Distribution Matching (2022) (0)
How Humans Teach Agents (2012) (0)
Challenges and Opportunities of Applying Reinforcement Learning to Autonomous Racing (2022) (0)
Safe Evaluation For Offline Learning: Are We Ready To Deploy? (2022) (0)
List of acronyms (2000) (0)
Market-Specific Bidding Strategies (2007) (0)
@ Home 2018 DSPL Team Description Paper (2018) (0)
Table of Statutory Instruments (2006) (0)
New Results - Life-Long Robot Learning and Development of Motor and Social Skills (2013) (0)
MAINTENANCE AND PROPERTY (2018) (0)
Provisional measures and taking evidence (2010) (0)
MAIN INSOLVENCY PROCEEDINGS (2018) (0)
D CMUnited Simulator Team Source Code (2000) (0)
DM$^2$: Decentralized Multi-Agent Reinforcement Learning for Distribution Matching (2022) (0)
Towards a Real-Time, Low-Resource, End-to-end Object Detection Pipeline for Robot Soccer (2022) (0)
Monte Carlo Hierarchical Model Learning: (Doctoral Consortium) (2015) (0)
Behavior Policy Gradient Supplemental Material (2017) (0)
JEWS IN ENGLISH ART (2013) (0)
Multimodal embodied attribute learning by robots for object-centric action policies (2023) (0)
Presentation: Autonomous Robots Playing Soccer and Traversing Intersections (2010) (0)
RoboCup-2001 Engineering Challenge Award Fast Object Detection in Middle-Size RoboCup (2002) (0)
UvA-DARE ( Digital Academic Repository ) Extending virtual robots towards RoboCup Soccer Simulation (2012) (0)
The 2002 AAAI Spring Symposium Series (2002) (0)
Lifelong Learning for Navigation (2020) (0)
Advances in Adding Influencing Agents to a Flock (2015) (0)
Domestic Interaction on a Segway Base with Heterogeneous Inter-Classifier Feedback (2008) (0)
Machine Learning and Adaptivity (2007) (0)
Personalized agents : papers from the 2002 AAAI Fall Symposium, November 15-17, North Falmouth, Massachusetts (2002) (0)
Hanging Out with Russell, Brando and Lennon (2005) (0)
9. Pottery of Building A (2014) (0)
Robot Behavioral Exploration and Multi-modal Perception using Dynamically Constructed Controllers (2018) (0)
Work-in-progress: Corrected Self Imitation Learning via Demonstrations (2020) (0)
ATT-CMUnited-2000 : Third Pla e Finisher inthe Robo up-2000 Simulator (2001) (0)
C ONTINUAL L EARNING AND P RIVATE U NLEARNING (2022) (0)
Orienting a Flock via Ad Hoc Teamwork ( Extended Abstract) (2014) (0)
The Idea of Sortition (2011) (0)
Expectation-Based Vision for Self-Localization on a Legged Robot (2006) (0)
Designing Incentives for Boolean Games (2011) (0)
HR-TD: A Regularized TD Method to Avoid Over-Generalization (2018) (0)
Visually Adaptive Geometric Navigation (2022) (0)
Multiagent Pathfinding, Papers from the 2012 AAAI Workshop, MAPF@AAAI 2012, Toronto, Ontario, Canada, July 22, 2012 (2012) (0)
SOME PARTICULAR TORTS (2018) (0)
Goal Blending for Responsive Shared Autonomy in a Navigating Vehicle (2021) (0)
D-Shape: Demonstration-Shaped Reinforcement Learning via Goal Conditioning (2022) (0)
TacTex-05: Winner of the 2005 trading agent competition - Supply chain management scenario (2005) (0)
Representation Transfer via Elaboration (2007) (0)
The Sequential Online Chore Division Problem - Definition and Application (2020) (0)
Sophomoric (2010) (0)
The UT Austin Villa 2003 Legged Robot Team (2013) (0)
Russell Needed to Write (2016) (0)
Alfred Cohen—Atmospheric Expressionist (2013) (0)
Table of statutes (2006) (0)
TABLE OF LEGISLATION (2018) (0)
“An Aristotle’s Eye View” (2010) (0)
The Implications of Impartiality (2011) (0)
OTHER INSOLVENCY PROCEEDINGS (2018) (0)
Russell in the Philippines (2003) (0)
Table of EU directives and regulations (2006) (0)
Sortition and Mini-Publics (2020) (0)
Preface: (2022) (0)
Other family matters (2010) (0)
Government's AI principles overlook two important issues (2020) (0)
Recognition and Enforcement of Judgements (2006) (0)
Russell the Political Theorist (2000) (0)
Book Review: The healing sun (2005) (0)
Autonomous Learning Agents: Layered Learning and Ad Hoc Teamwork (2016) (0)
Multi-Agent Social Simulation (2010) (0)
Creating a sustainable business for the 21st century (2012) (0)
Efficient Robot Skill Learning: Grounded Simulation Learning and Imitation Learning from Observation (2021) (0)
Zombie Movie Morals (2013) (0)
Relevance-Weighted Action Selection in MDP ’ s (2005) (0)
Table of Treaties (2018) (0)
Familial Maintenance and Matrimonial Property (2006) (0)
Power: A New Social Analysis (2006) (0)
History, Outline and Scope (2010) (0)
Sequential Online Chore Division for Autonomous Vehicle Convoy Formation (2021) (0)
Research Summary and Plans (0)
Ucko, Peter (Indigenous Archaeology) (2020) (0)
‘Guaranteed Rotation in Office’: A Comment (2015) (0)
Adaptation of Surrogate Tasks for Bipedal Walk Optimization (2016) (0)
OTHER LEGISLATION ON JUDGMENTS (2018) (0)
The Negative and the Positive Side of Democratic Institutional Design (2016) (0)
The 10th International Conference on Autonomous Agents and Multiagent Systems - Volume 3 (2011) (0)
Recognition and enforcement of judgments (2010) (0)
SUCCESSION ON DEATH (2018) (0)
Theorizing Presidential Rotation (2019) (0)
Contractual issues and exceptions (2014) (0)
Curriculum Development for Transfer Learning in Dynamic Multiagent Settings (2016) (0)
Team Member Agent Architecture (2000) (0)
Intelligent Autonomous Robotics (2007) (0)
Agent Mediated Electronic Commerce IV : Designing Mechanisms and Systems , Springer Verlag , 2002 . ATTac-2001 : A Learning , Autonomous Bidding Agent (2015) (0)
Bidding with Price Predictions (2007) (0)
Foreword (1992) (0)
D Simulation Soccer Team 2013 (2013) (0)
Early community engagement provides a strong foundation for developing trust—a case study (2014) (0)
Training Champion-level Race Car Drivers Using Deep Reinforcement Learning (2021) (0)
An Unmanaged Intersection Protocol and Improved Intersection Safety for Autonomous Vehicles (2009) (0)
RoboCup 2018 3 D Simulation League Champions (2019) (0)
TEXPLORE: real-time sample-efficient reinforcement learning for robots (2012) (0)
In AAAI Fall Symposium on Artificial Intelligence and Human-Robot Interaction for Service Robots in Human Environments (2019) (0)
Holistic Action Transform Harsh Goyal Supervisors : (2019) (0)
Senior Program Committee Members (2004) (0)
Cognitive Robotics - Papers from the AAAI Workshop, Technical Report (2006) (0)
Multiagent Driving Policy for Congestion Reduction in a Large Scale Scenario (2020) (0)
Expectation-Based Vision for Precise Self-Localization on a Mobile Robot (2006) (0)
Drop-in games at RoboCup (2014) (0)
Task Phasing: Automated Curriculum Learning from Demonstrations (2022) (0)
AAAI-14 preface (2014) (0)
A Neuroevolution Approach to General (2014) (0)
Transfer Learning in Integrated Cognitive Systems (2010) (0)
Learning Complementary Multiagent Behaviors: a Case Study (Extended Abstract) (2009) (0)
Autonomous Model Management via Reinforcement Learning: Extended Abstract (2017) (0)
Role Selection in Ad Hoc Teamwork (Extended Abstract) (2012) (0)

This paper list is powered by the following services:

Other Resources About Peter Stone

What Schools Are Affiliated With Peter Stone ?

Peter Stone is affiliated with the following schools:

Image Attributions

Image Source for Peter Stone