Towards Value Alignment for Opaque Agents Through Concept Analysis and Inter-Agent Value Modelling

Research output: Chapter or section in a book/report/conference proceedingChapter in a published conference proceeding

Abstract

Value alignment, ensuring that artificial intelligence acts in ways aligned with humans, is a key challenge in the deployment of AI systems and an important aspect in the development of hybrid intelligence. A notable issue in value alignment is the lack of precision around what value alignment actually is and how to evaluate its success. Another issue is the fact that multiple agents are involved in value alignment, but these agents' values may not be easily understood or expressed. In order for autonomous agents to readily adapt to these agents' needs, and to promote interpretability of the system, an evidence-based means to assess the value priorities of agents is required. In my PhD, I am proposing a precise definition of value alignment through a qualitative concept analysis, synthesising themes from across the literature to produce a cohesive description. In addition, I am developing a novel mechanism for inter-agent value inference to enable agents to assess other agents' values in an interpretable way. My PhD will generate a cohesive description of the value alignment problem, identifying sub-problems for further research, and build a means for autonomous agents to perform value inference in pursuit of achieving value alignment.

Original languageEnglish
Title of host publicationHHAI 2024
Subtitle of host publicationHybrid Human AI Systems for the Social Good - Proceedings of the 3rd International Conference on Hybrid Human-Artificial Intelligence
EditorsFabian Lorig, Jason Tucker, Adam Dahlgren Lindstrom, Frank Dignum, Pradeep Murukannaiah, Andreas Theodorou, Pinar Yolum
Place of PublicationNetherlands
PublisherIOS Press BV
Pages386-393
Number of pages8
ISBN (Electronic)9781643685229
DOIs
Publication statusPublished - 5 Jun 2024
Event3rd International Conference on Hybrid Human-Artificial Intelligence, HHAI 2024 - Hybrid, Malmo, Sweden
Duration: 10 Jun 202414 Jun 2024

Publication series

NameFrontiers in Artificial Intelligence and Applications
Volume386
ISSN (Print)0922-6389
ISSN (Electronic)1879-8314

Conference

Conference3rd International Conference on Hybrid Human-Artificial Intelligence, HHAI 2024
Country/TerritorySweden
CityHybrid, Malmo
Period10/06/2414/06/24

Funding

FundersFunder number
UKRI CDT in Accountable, Responsible and Transparent AIEP/S023437/1

    Keywords

    • Value Alignment
    • Value Inference
    • Agent Modelling

    ASJC Scopus subject areas

    • Artificial Intelligence

    Fingerprint

    Dive into the research topics of 'Towards Value Alignment for Opaque Agents Through Concept Analysis and Inter-Agent Value Modelling'. Together they form a unique fingerprint.

    Cite this