Worrisome Properties of Neural Network Controllers and Their Symbolic Representations

Cyranka, Jacek; Church, Kevin E.M.; Lessard, Jean-Philippe

doi:10.3233/FAIA230311

Worrisome Properties of Neural Network Controllers and Their Symbolic Representations

Authors

Jacek Cyranka, Kevin E.M. Church, Jean-Philippe Lessard

Pages

517 - 525

DOI

10.3233/FAIA230311

Category

Research Article

Series

Frontiers in Artificial Intelligence and Applications

Ebook

Volume 372: ECAI 2023

Abstract

We raise concerns about controllers’ robustness in simple reinforcement learning benchmark problems. We focus on neural network controllers and their low neuron and symbolic abstractions. A typical controller reaching high mean return values still generates an abundance of persistent low-return solutions, which is a highly undesirable property, easily exploitable by an adversary. We find that the simpler controllers admit more persistent bad solutions. We provide an algorithm for a systematic robustness study and prove existence of persistent solutions and, in some cases, periodic orbits, using a computer-assisted proof methodology.

This website uses cookies

This website uses cookies