Authors

* External authors

Date

Share

A Domain-Agnostic Approach for Characterization of Lifelong Learning Systems

Megan M. Baker*

Alexander New*

Mario Aguilar-Simon*

Ziad Al-Halah*

Sébastien M. R. Arnold*

Ese Ben-Iwhiwhu*

Andrew P. Brna*

Ethan Brooks*

Ryan C. Brown*

Zachary Daniels*

Anurag Daram*

Fabien Delattre*

Ryan Dellana*

Eric Eaton*

Haotian Fu*

Kristen Grauman*

Jesse Hostetler*

Shariq Iqbal*

Cassandra Kent*

Nicholas Ketz*

Soheil Kolouri*

George Konidaris*

Dhireesha Kudithipudi*

Seungwon Lee*

Michael L. Littman*

Sandeep Madireddy*

Jorge A. Mendez*

Eric Q. Nguyen*

Christine D. Piatko*

Praveen K. Pilly*

Aswin Raghavan*

Abrar Rahman*

Santhosh Kumar Ramakrishnan*

Neale Ratzlaff*

Andrea Soltoggio*

Peter Stone

Indranil Sur*

Zhipeng Tang*

Saket Tiwari*

Kyle Vedder*

Felix Wang*

Zifan Xu*

Angel Yanguas-Gil*

Harel Yedidsion*

Shangqun Yu*

Gautam K. Vallabha*

* External authors

2023

Abstract

Despite the advancement of machine learning techniques in recent years, state-of-the-art systems lack robustness to “real world” events, where the input distributions and tasks encountered by the deployed systems will not be limited to the original training context, and systems will instead need to adapt to novel distributions and tasks while deployed. This critical gap may be addressed through the development of “Lifelong Learning” systems that are capable of (1) Continuous Learning, (2) Transfer and Adaptation, and (3) Scalability. Unfortunately, efforts to improve these capabilities are typically treated as distinct areas of research that are assessed independently, without regard to the impact of each separate capability on other aspects of the system. We instead propose a holistic approach, using a suite of metrics and an evaluation framework to assess Lifelong Learning in a principled way that is agnostic to specific domains or system techniques. Through five case studies, we show that this suite of metrics can inform the development of varied and complex Lifelong Learning systems. We highlight how the proposed suite of metrics quantifies performance trade-offs present during Lifelong Learning system development — both the widely discussed Stability-Plasticity dilemma and the newly proposed relationship between Sample Efficient and Robust Learning. Further, we make recommendations for the formulation and use of metrics to guide the continuing development of Lifelong Learning systems and assess their progress in the future.

Related Publications

Building Minimal and Reusable Causal State Abstractions for Reinforcement Learning

AAAI, 2024
Zizhao Wang*, Caroline Wang*, Xuesu Xiao*, Yuke Zhu*, Peter Stone

Two desiderata of reinforcement learning (RL) algorithms are the ability to learn from relatively little experience and the ability to learn policies that generalize to a range of problem specifications. In factored state spaces, one approach towards achieving both goals is …

Minimum Coverage Sets for Training Robust Ad Hoc Teamwork Agents

AAAI, 2024
Arrasy Rahman*, Jiaxun Cui*, Peter Stone

Robustly cooperating with unseen agents and human partners presents significant challenges due to the diverse cooperative conventions these partners may adopt. Existing Ad Hoc Teamwork (AHT) methods address this challenge by training an agent with a population of diverse tea…

Learning Optimal Advantage from Preferences and Mistaking it for Reward

AAAI, 2024
W. Bradley Knox*, Stephane Hatgis-Kessell*, Sigurdur Orn Adalgeirsson*, Serena Booth*, Anca Dragan*, Peter Stone, Scott Niekum*

We consider algorithms for learning reward functions from human preferences over pairs of trajectory segments---as used in reinforcement learning from human feedback (RLHF)---including those used to fine tune ChatGPT and other contemporary language models. Most recent work o…

  • HOME
  • Publications
  • A Domain-Agnostic Approach for Characterization of Lifelong Learning Systems

JOIN US

Shape the Future of AI with Sony AI

We want to hear from those of you who have a strong desire
to shape the future of AI.