Citation

BibTex format

@inproceedings{Ward:2022,
author = {Ward, F and Toni, F and Belardinelli, F},
pages = {1--16},
publisher = {CEUR Workshop Proceedings},
title = {A causal perspective on AI deception in games},
url = {https://ceur-ws.org/Vol-3193/paper2CAUSAL.pdf},
year = {2022}
}

RIS format (EndNote, RefMan)

TY  - CPAPER
AB - Deception is a core challenge for AI safety and we focus on the problem that AI agents might learndeceptive strategies in pursuit of their objectives. We define the incentives one agent has to signal toand deceive another agent. We present several examples of deceptive artificial agents and show that ourdefinition has desirable properties.
AU - Ward,F
AU - Toni,F
AU - Belardinelli,F
EP - 16
PB - CEUR Workshop Proceedings
PY - 2022///
SP - 1
TI - A causal perspective on AI deception in games
UR - https://ceur-ws.org/Vol-3193/paper2CAUSAL.pdf
UR - http://hdl.handle.net/10044/1/104464
ER -

Contact us

Artificial Intelligence Network
South Kensington Campus
Imperial College London
SW7 2AZ

To reach the elected speaker of the network, Dr Rossella Arcucci, please contact:

ai-speaker@imperial.ac.uk

To reach the network manager, Diana O'Malley - including to join the network - please contact:

ai-net-manager@imperial.ac.uk