cakcora / MachineBehavior

Reading list for UTD Data Privacy and Security Lab

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Machine Behavior

Reading list for UT Dallas Data Privacy and Security Lab (CS) and Data Insights Lab (Statistics)

1- Machine behaviour by Rahwan et al. Nature 2019, 10 pages. https://www.nature.com/articles/s41586-019-1138-y.pdf

Machines powered by artificial intelligence increasingly mediate our social, cultural, economic and political interactions. Understanding the behaviour of artificial intelligence systems is essential to our ability to control their actions, reap their benefits and minimize their harms. Here we argue that this necessitates a broad scientific research agenda to study machine behaviour that incorporates and expands upon the discipline of computer science and includes insights from across the sciences. We first outline a set of questions that are fundamental to this emerging field and then explore the technical, legal and institutional constraints on the study of machine behaviour.

2- Explaining Explanations: An Overview of Interpretability of Machine Learning by Gilpin et al. 10 pages. https://arxiv.org/pdf/1806.00069.pdf

There has recently been a surge of work in explanatory artificial intelligence (XAI). This research area tackles the important problem that complex machines and algorithms often cannot provide insights into their behavior and thought processes. XAI allows users and parts of the internal system to be more transparent, providing explanations of their decisions in some level of detail. These explanations are important to ensure algorithmic fairness, identify potential bias/problems in the training data, and to ensure that the algorithms perform as expected. However, explanations produced by these systems is neither standardized nor systematically assessed. In an effort to create best practices and identify open challenges, we describe foundational concepts of explainability and show how they can be used to classify existing literature. We discuss why current approaches to explanatory methods especially for deep neural networks are insufficient. Finally, based on our survey, we conclude with suggested future research directions for explanatory artificial intelligence.

About

Reading list for UTD Data Privacy and Security Lab