Abstract
Developing reinforcement learning agents that can generalise effectively to new tasks is one of the main challenges in AI research. This paper introduces Fracture Cluster Options (FraCOs), a multi-level hierarchical reinforcement learning method designed to improve generalisation performance. FraCOs identifies patterns in agent behaviour and forms temporally-extended actions (options) based on the expected future usefulness of those patterns, enabling rapid adaptation to new tasks. In tabular settings, FraCOs demonstrates effective transfer and improves performance as the depth of the hierarchy increases. In several complex procedurally-generated environments, FraCOs consistently outperforms state-of-the-art deep reinforcement learning algorithms, achieving superior results in both in-distribution and out-of-distribution scenarios.
| Original language | English |
|---|---|
| Title of host publication | The 13h International Conference on Learning Representations |
| Place of Publication | Singapore |
| Publisher | International Conference on Learning Representations, ICLR |
| Pages | 51284 - 51326 |
| Number of pages | 43 |
| ISBN (Electronic) | 9798331320850 |
| Publication status | Published - 28 Apr 2025 |
Keywords
- Reinforcement Learning
- Generalisation
- Hierarchical reinforcement learning
Fingerprint
Dive into the research topics of 'Accelerating Task Generalisation with Multi-Level Skill Hierarchies'. Together they form a unique fingerprint.Cite this
- APA
- Standard
- Harvard
- Vancouver
- Author
- BIBTEX
- RIS