TY - GEN
T1 - One-Shot Learning of Autonomous Behaviour
T2 - 4th International Joint Conference on Learning and Reasoning, IJCLR 2024, and 33rd International Conference on Inductive Logic Programming, ILP 2024
AU - Varghese, Dany
AU - Cyrus, Daniel
AU - Patsantzis, Stassa
AU - Trewern, James
AU - Treloar, Alfie Anthony
AU - Hunter, Alan
AU - Tamaddoni-Nezhad, Alireza
N1 - .
PY - 2025/11/21
Y1 - 2025/11/21
N2 - “One-shot learning" traditionally refers to classifying a single instance using a machine learning model pre-trained on extensive datasets. In contrast, Inductive Logic Programming (ILP) approaches such as Meta-Interpretive Learning (MIL) and Meta Inverse Entailment (MIE), can generate complex logic programs from just a single positive example and minimal background knowledge without prior extensive training. This approach offers a human-centred form of machine learning that is more controllable, reliable, and comprehensible due to its small training data size and the inherent interpretability of logic programs. We use PyGol, a Python-based implementation of Meta Inverse Entailment, and compare its performance with ExpGen-PPO in learning autonomous behaviour. ExpGen-PPO is a state-of-the-art reinforcement learning framework designed to address the challenge of generalisation across diverse tasks through experience diversification and robust policy optimisation. Our experiments focus on two domains: maze-solving and obstacle avoidance for mobile robotics. In both domains, we first train the systems in simplified environments without obstacles and then test their ability to generalise to more complex environments with obstacles. Our results show that PyGol effectively learns generalisable solutions from a single example in both domains, whereas ExpGen-PPO requires more training and significantly more exploration to achieve similar performance.
AB - “One-shot learning" traditionally refers to classifying a single instance using a machine learning model pre-trained on extensive datasets. In contrast, Inductive Logic Programming (ILP) approaches such as Meta-Interpretive Learning (MIL) and Meta Inverse Entailment (MIE), can generate complex logic programs from just a single positive example and minimal background knowledge without prior extensive training. This approach offers a human-centred form of machine learning that is more controllable, reliable, and comprehensible due to its small training data size and the inherent interpretability of logic programs. We use PyGol, a Python-based implementation of Meta Inverse Entailment, and compare its performance with ExpGen-PPO in learning autonomous behaviour. ExpGen-PPO is a state-of-the-art reinforcement learning framework designed to address the challenge of generalisation across diverse tasks through experience diversification and robust policy optimisation. Our experiments focus on two domains: maze-solving and obstacle avoidance for mobile robotics. In both domains, we first train the systems in simplified environments without obstacles and then test their ability to generalise to more complex environments with obstacles. Our results show that PyGol effectively learns generalisable solutions from a single example in both domains, whereas ExpGen-PPO requires more training and significantly more exploration to achieve similar performance.
KW - Autonomous Systems
KW - Meta Inverse Entailment (MIE)
KW - Meta-Interpretive Learning (MIL)
KW - One-shot learning
KW - PyGol
KW - Reinforcement Learning
UR - https://www.scopus.com/pages/publications/105023488215
U2 - 10.1007/978-3-032-09087-4_4
DO - 10.1007/978-3-032-09087-4_4
M3 - Chapter in a published conference proceeding
AN - SCOPUS:105023488215
SN - 9783032090867
T3 - Lecture Notes in Computer Science
SP - 48
EP - 65
BT - Learning and Reasoning - 4th International Joint Conference on Learning and Reasoning, IJCLR 2024, and 33rd International Conference on Inductive Logic Programming, ILP 2024, Proceedings
A2 - Dai, Wang-Zhou
PB - Springer
CY - Cham, Switzerland
Y2 - 20 September 2024 through 22 September 2024
ER -