Awesome Responsible AI

A curated list of awesome academic research, books, code of ethics, courses, data sets, frameworks, institutes, newsletters, principles, podcasts, reports, tools, regulations and standards related to Responsible, Trustworthy, and Human-Centered AI.

Main Concepts

What is AI Governance?

AI governance is a system of rules, processes, frameworks, and tools within an organization to ensure the ethical and responsible development of AI.

What is Human-Centered AI?

Human-Centered Artificial Intelligence (HCAI) is an approach to AI development that prioritizes human users' needs, experiences, and well-being.

What is Open Source AI

When we refer to a “system,” we are speaking both broadly about a fully functional structure and its discrete structural elements. To be considered Open Source, the requirements are the same, whether applied to a system, a model, weights and parameters, or other structural elements.

An Open Source AI is an AI system made available under terms and in a way that grant the freedoms1 to:

Use the system for any purpose and without having to ask for permission.
Study how the system works and inspect its components.
Modify the system for any purpose, including to change its output.
Share the system for others to use with or without modifications, for any purpose.

Source

What is Responsible AI?

Responsible AI (RAI) refers to the development, deployment, and use of artificial intelligence (AI) systems in ways that are ethical, transparent, accountable, and aligned with human values.

What is a Responsible AI framework?

Responsible AI frameworks often encompass guidelines, principles, and practices that prioritize fairness, safety, and respect for individual rights.

What is Trustworthy AI?

Trustworthy AI (TAI) refers to artificial intelligence systems designed and deployed to be transparent, robust and respectful of data privacy.

Why is Responsible, Trustworthy, and Human-Centered AI important?

AI is a transformative technology prone to reshape industries, yet it requires careful governance to balance the benefits of automation and insight with protections against unintended social, economic, and security impacts. You can read more about the current wave here.

Academic Research

Evaluation (of model explanations)

Agarwal, C., Krishna, S., Saxena, E., Pawelczyk, M., Johnson, N., Puri, I., ... & Lakkaraju, H. (2022). Openxai: Towards a transparent evaluation of model explanations. Advances in Neural Information Processing Systems, 35, 15784-15799. Article
Liesenfeld, A., and Dingemanse, M. (2024). Rethinking Open Source Generative AI: Open-Washing and the EU AI Act. In The 2024 ACM Conference on Fairness, Accountability, and Transparency (FAccT ’24). Rio de Janeiro, Brazil: ACM. Article Benchmark

Bias

Schwartz, R., Schwartz, R., Vassilev, A., Greene, K., Perine, L., Burt, A., & Hall, P. (2022). Towards a standard for identifying and managing bias in artificial intelligence (Vol. 3, p. 00). US Department of Commerce, National Institute of Standards and Technology. Article NIST

Challenges

D'Amour, A., Heller, K., Moldovan, D., Adlam, B., Alipanahi, B., Beutel, A., ... & Sculley, D. (2022). Underspecification presents challenges for credibility in modern machine learning. Journal of Machine Learning Research, 23(226), 1-61. Article Google

Drift

Ackerman, S., Dube, P., Farchi, E., Raz, O., & Zalmanovici, M. (2021, June). Machine learning model drift detection via weak data slices. In 2021 IEEE/ACM Third International Workshop on Deep Learning for Testing and Testing for Deep Learning (DeepTest) (pp. 1-8). IEEE. Article IBM
Ackerman, S., Raz, O., & Zalmanovici, M. (2020, February). FreaAI: Automated extraction of data slices to test machine learning models. In International Workshop on Engineering Dependable and Secure Machine Learning Systems (pp. 67-83). Cham: Springer International Publishing. Article IBM

Explainability

Dhurandhar, A., Chen, P. Y., Luss, R., Tu, C. C., Ting, P., Shanmugam, K., & Das, P. (2018). Explanations based on the missing: Towards contrastive explanations with pertinent negatives. Advances in neural information processing systems, 31. Article University of Michigan IBM Research
Dhurandhar, A., Shanmugam, K., Luss, R., & Olsen, P. A. (2018). Improving simple models with confidence profiles. Advances in Neural Information Processing Systems, 31. Article IBM Research
Gurumoorthy, K. S., Dhurandhar, A., Cecchi, G., & Aggarwal, C. (2019, November). Efficient data representation by selecting prototypes with importance weights. In 2019 IEEE International Conference on Data Mining (ICDM) (pp. 260-269). IEEE. Article Amazon Development Center IBM Research
Hind, M., Wei, D., Campbell, M., Codella, N. C., Dhurandhar, A., Mojsilović, A., ... & Varshney, K. R. (2019, January). TED: Teaching AI to explain its decisions. In Proceedings of the 2019 AAAI/ACM Conference on AI, Ethics, and Society (pp. 123-129)Article IBM Research
Lundberg, S. M., & Lee, S. I. (2017). A unified approach to interpreting model predictions. Advances in neural information processing systems, 30. Article, Github University of Washington
Luss, R., Chen, P. Y., Dhurandhar, A., Sattigeri, P., Zhang, Y., Shanmugam, K., & Tu, C. C. (2021, August). Leveraging latent features for local explanations. In Proceedings of the 27th ACM SIGKDD conference on knowledge discovery & data mining (pp. 1139-1149). Article IBM Research University of Michigan
Ribeiro, M. T., Singh, S., & Guestrin, C. (2016, August). "Why should i trust you?" Explaining the predictions of any classifier. In Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining (pp. 1135-1144). Article, Github University of Washington
Wei, D., Dash, S., Gao, T., & Gunluk, O. (2019, May). Generalized linear rule models. In International conference on machine learning (pp. 6687-6696). PMLR. Article IBM Research
Contrastive Explanations Method with Monotonic Attribute Functions (Luss et al., 2019)
Boolean Decision Rules via Column Generation (Light Edition) (Dash et al., 2018) IBM Research
Towards Robust Interpretability with Self-Explaining Neural Networks (Alvarez-Melis et al., 2018) MIT

Fairness

Caton, S., & Haas, C. (2024). Fairness in machine learning: A survey. ACM Computing Surveys, 56(7), 1-38. Article
Chouldechova, A. (2017). Fair prediction with disparate impact: A study of bias in recidivism prediction instruments. Big data, 5(2), 153-163. Article
Coston, A., Mishler, A., Kennedy, E. H., & Chouldechova, A. (2020, January). Counterfactual risk assessments, evaluation, and fairness. In Proceedings of the 2020 conference on fairness, accountability, and transparency (pp. 582-593). Article
Jesus, S., Saleiro, P., Jorge, B. M., Ribeiro, R. P., Gama, J., Bizarro, P., & Ghani, R. (2024). Aequitas Flow: Streamlining Fair ML Experimentation. arXiv preprint arXiv:2405.05809. Article
Saleiro, P., Kuester, B., Hinkson, L., London, J., Stevens, A., Anisfeld, A., ... & Ghani, R. (2018). Aequitas: A bias and fairness audit toolkit. arXiv preprint arXiv:1811.05577. Article
Vasudevan, S., & Kenthapadi, K. (2020, October). Lift: A scalable framework for measuring fairness in ml applications. In Proceedings of the 29th ACM international conference on information & knowledge management (pp. 2773-2780). Article LinkedIn

Ethical Data Products

Gebru, T., Morgenstern, J., Vecchione, B., Vaughan, J. W., Wallach, H., Iii, H. D., & Crawford, K. (2021). Datasheets for datasets. Communications of the ACM, 64(12), 86-92. Article Google
Mitchell, M., Wu, S., Zaldivar, A., Barnes, P., Vasserman, L., Hutchinson, B., ... & Gebru, T. (2019, January). Model cards for model reporting. In Proceedings of the conference on fairness, accountability, and transparency (pp. 220-229). Article Google
Pushkarna, M., Zaldivar, A., & Kjartansson, O. (2022, June). Data cards: Purposeful and transparent dataset documentation for responsible ai. In Proceedings of the 2022 ACM Conference on Fairness, Accountability, and Transparency (pp. 1776-1826). Article Google
Rostamzadeh, N., Mincu, D., Roy, S., Smart, A., Wilcox, L., Pushkarna, M., ... & Heller, K. (2022, June). Healthsheet: development of a transparency artifact for health datasets. In Proceedings of the 2022 ACM Conference on Fairness, Accountability, and Transparency (pp. 1943-1961). Article Google
Saint-Jacques, G., Sepehri, A., Li, N., & Perisic, I. (2020). Fairness through Experimentation: Inequality in A/B testing as an approach to responsible design. arXiv preprint arXiv:2002.05819. Article LinkedIn

Sustainability

Lacoste, A., Luccioni, A., Schmidt, V., & Dandres, T. (2019). Quantifying the carbon emissions of machine learning. arXiv preprint arXiv:1910.09700. Article
P. Li, J. Yang, M. A. Islam, S. Ren, (2023) Making AI Less “Thirsty”: Uncovering and Addressing the Secret Water Footprint of AI Models, arXiv:2304.03271 Article
Parcollet, T., & Ravanelli, M. (2021). The energy and carbon footprint of training end-to-end speech recognizers. Article
Patterson, D., Gonzalez, J., Le, Q., Liang, C., Munguia, L.M., Rothchild, D., So, D., Texier, M. and Dean, J. (2021). Carbon emissions and large neural network training. arXiv preprint arXiv:2104.10350. Article
Sculley, D., Holt, G., Golovin, D., Davydov, E., Phillips, T., Ebner, D., ... & Dennison, D. (2015). Hidden technical debt in machine learning systems. Advances in neural information processing systems, 28. Article Google
Sculley, D., Holt, G., Golovin, D., Davydov, E., Phillips, T., Ebner, D., ... & Young, M. (2014, December). Machine learning: The high interest credit card of technical debt. In SE4ML: software engineering for machine learning (NIPS 2014 Workshop) (Vol. 111, p. 112). Article Google
Strubell, E., Ganesh, A., & McCallum, A. (2019). Energy and policy considerations for deep learning in NLP. arXiv preprint arXiv:1906.02243. Article
Sustainable AI: AI for sustainability and the sustainability of AI (van Wynsberghe, A. 2021). AI and Ethics, 1-6
Green Algorithms: Quantifying the carbon emissions of computation (Lannelongue, L. et al. 2020)
C.-J. Wu, R. Raghavendra, U. Gupta, B. Acun, N. Ardalani, K. Maeng, G. Chang, F. Aga, J. Huang, C. Bai, M. Gschwind, A. Gupta, M. Ott, A. Melnikov, S. Candido, D. Brooks, G. Chauhan, B. Lee, H.-H. Lee, K. Hazelwood, Sustainable AI: Environmental implications, challenges and opportunities in Proceedings of the 5th Conference on Machine Learning and Systems (MLSys) (2022) vol. 4, pp. 795–813. Article

Collections

Google Research on Responsible AI: https://research.google/pubs/?collection=responsible-ai Google
Pipeline-Aware Fairness: http://fairpipe.dssg.io

Reproducible/Non-Reproducible Research

Computational reproducibility (when the results in a paper can be replicated using the exact code and dataset provided by the authors) is becoming a significant problem not only for academic but for practitionars who want to implement AI in their organizations and aim to resuse ideas from academia. Read more about this problem here.

Books

Open Access

Barrett, M., Gerke, T. & D’Agostino McGowa, L. (2024). Causal Inference in R Book Causal Inference R
Biecek, P., & Burzykowski, T. (2021). Explanatory model analysis: explore, explain, and examine predictive models. Chapman and Hall/CRC. Book Explainability Interpretability Transparency R
Biecek, P. (2024). Adversarial Model Analysis. Book Safety Red Teaming
Cunningham, Scott. (2021) Causal inference: The mixtape. Yale university press. Book Causal Inference
Fourrier, C. and et all. (2024) LLM Evaluation Guidebook. Github Repository. Web LLM Evaluation
Freiesleben, T. & Molnar, C. (2024). Supervised Machine Learning for Science: How to stop worrying and love your black box. Book
Matloff, N et all. (2204) Data Science Looks at Discrimination Book Fairness R
Molnar, C. (2020). Interpretable Machine Learning. Lulu.com. Book Explainability Interpretability Transparency R
Huntington-Klein, Nick. (2012) The effect: An introduction to research design and causality. Chapman and Hall/CRC. Book Causal Inference

Commercial / Propietary / Closed Access

Trust in Machine Learning (Varshney, K., 2022) Safety Privacy Drift Fairness Interpretability Explainability
Interpretable AI (Thampi, A., 2022) Explainability Fairness Interpretability
AI Fairness (Mahoney, T., Varshney, K.R., Hind, M., 2020 Report Fairness
Practical Fairness (Nielsen, A., 2021) Fairness
Hands-On Explainable AI (XAI) with Python (Rothman, D., 2020) Explainability
AI and the Law (Kilroy, K., 2021) Report Trust Law
Responsible Machine Learning (Hall, P., Gill, N., Cox, B., 2020) Report Law Compliance Safety Privacy
Privacy-Preserving Machine Learning
Human-In-The-Loop Machine Learning: Active Learning and Annotation for Human-Centered AI
Interpretable Machine Learning With Python: Learn to Build Interpretable High-Performance Models With Hands-On Real-World Examples
Responsible AI (Hall, P., Chowdhury, R., 2023) Governance Safety Drift

Code of Ethics

ACS Code of Professional Conduct by Australian ICT (Information and Communication Technology)
AI Standards Hub
Association for Computer Machinery's Code of Ethics and Professional Conduct
IEEE Global Initiative for Ethical Considerations in Artificial Intelligence (AI) and Autonomous Systems (AS)
ISO/IEC's Standards for Artificial Intelligence

Courses

AI Alignment

AI Alignment BlueDot Impact
AI Fast-Track BlueDot Impact

AI Governance

AI Governance BlueDot Impact

Data Sets

Frameworks

A Framework for Ethical Decision Making Markkula Center for Applied Ethics
Data Ethics Canvas Open Data Institute
Deon Python Drivendata
Ethics & Algorithms Toolkit
RAI Toolkit US Department of Defense

Institutes

Ada Lovelace Institute United Kingdom
AI Safety Institutes (or equivalent):
- Canada AISI Canada
- EU AI Office Europe
- Japan AISI Japan
- Korea AISI South Korea
- Singapore AISI Singapore
- UK AISI United Kingdom
- US AISI United States of America
Centre pour la Securité de l'IA, CeSIA France
European Centre for Algorithmic Transparency
Center for Human-Compatible AI UC Berkeley United States of America
Center for Responsible AI New York University United States of America
Montreal AI Ethics Institute Canada
Munich Center for Technology in Society (IEAI) TUM School of Social Sciences and Technology Germany
National AI Centre's Responsible AI Network Australia
Open Data Institute United Kingdom
Stanford University Human-Centered Artificial Intelligence (HAI) United States of America
The Institute for Ethical AI & Machine Learning
UNESCO Chair in AI Ethics & Governance IE University Spain
University of Oxford Institute for Ethics in AI University of Oxford United Kingdom

Newsletters

Principles

Allianz's Principles for a responsible usage of AI Allianz
Asilomar AI principles
European Commission's Guidelines for Trustworthy AI
Google's AI Principles Google
IEEE's Ethically Aligned Design IEEE
Microsoft's AI principles Microsoft
OECD's AI principles OECD
Telefonica's AI principles Telefonica
The Institute for Ethical AI & Machine Learning: The Responsible Machine Learning Principles

Additional:

FAIR Principles Findability Accessibility Interoperability Reuse

Podcasts

Reports

AI Governance

Araujo, R. 2024. Understanding the First Wave of AI Safety Institutes: Characteristics, Functions, and Challenges. Institute for AI Policy and Strategy (IAPS) Article
Buchanan, B. 2020. The AI triad and what it means for national security strategy. Center for Security and Emerging Technology. Article
Corrigan, J. et al. 2023. The Policy Playbook: Building a Systems-Oriented Approach to Technology and National Security Policy. CSET (Center for Security and Emerging Technology) Article
Curto, J. 2024. How Can Spain Remain Internationally Competitive in AI under EU Legislation? Article
CSIS. 2024 The AI Safety Institute International Network: Next Steps and Recommendations. CSIS (Center for Strategic and International Studies) Article
Gupta, Ritwik, et al. (2024). Data-Centric AI Governance: Addressing the Limitations of Model-Focused Policies. arXiv preprint arXiv:2409.17216 (Article)[https://arxiv.org/pdf/2409.17216]
Hendrycks, D. et al. 2023. An overview of catastrophic AI risks. Center of AI Safety. arXiv preprint arXiv:2306.12001. Article
Janjeva, A., et al. (2023). Strengthening Resilience to AI Risk. A guide for UK policymakers. CETaS (Centre for Emerging Technology and Security) Article
Piattini, M. and Fernández C.M. 2024. Marco Confiable. Revista SIC 162 Article
Sastry, G., et al. 2024. Computing Power and the Governance of Artificial Intelligence. arXiv preprint arXiv:2402.08797. Article

(AI) Incidents databases

Market Analysis

State of AI - from 2018 up to now -
The AI Index Report - from 2017 up to now - Stanford Institute for Human-Centered Artificial Intelligence

Other

Four Principles of Explainable Artificial Intelligence NIST Explainability
Psychological Foundations of Explainability and Interpretability in Artificial Intelligence NIST Explainability
Inferring Concept Drift Without Labeled Data, 2021 Drift
Interpretability, Fast Forward Labs, 2020 Interpretability
Towards a Standard for Identifying and Managing Bias in Artificial Intelligence (NIST Special Publication 1270) NIST Bias
Auditing machine learning algorithms Auditing

Tools

Assessments

The Assessment List for Trustworthy Artificial Intelligence

Bias

balance Python Facebook
smclafify Python Amazon
SolasAI Python

Causal Inference

CausalAI Python Salesforce
CausalNex Python
CausalImpact R
Causalinference Python
Causal Inference 360 Python
CausalPy Python
CIMTx: Causal Inference for Multiple Treatments with a Binary Outcome R
dagitty R
DoWhy Python Microsoft
mediation: Causal Mediation Analysis R
MRPC R

Drift

Alibi Detect Python
Deepchecks Python
drifter R
Evidently Python
nannyML Python
phoenix Python

Fairness

Aequitas' Bias & Fairness Audit Toolkit Python
AI360 Toolkit Python R IBM
dsld: Data Science Looks at Discrimination R
EDFfair: Explicitly Deweighted Features R
EquiPy Python
Fairlearn Python Microsoft
Fairmodels R University of California
fairness R
FairRankTune Python
FairPAN - Fair Predictive Adversarial Network R
OxonFair Python Oxford Internet Institute
Themis ML Python
What-If Tool Python Google

Interpretability/Explicability

Alibi Explain Python
Automated interpretability Python OpenAI
AI360 Toolkit Python R IBM
aorsf: Accelerated Oblique Random Survival Forests R
breakDown: Model Agnostic Explainers for Individual Predictions R
captum Python PyTorch
ceterisParibus: Ceteris Paribus Profiles R
DALEX: moDel Agnostic Language for Exploration and eXplanation Python R
DALEXtra: extension for DALEX Python R
Dianna Python
Diverse Counterfactual Explanations (DiCE) Python Microsoft
dtreeviz Python
ecco article Python
effectplots R
eli5 Python
explabox Python National Police Lab AI
eXplainability Toolbox Python
ExplainaBoard Python Carnegie Mellon University
ExplainerHub in github Python
fastshap R
fasttreeshap Python LinkedIn
FAT Forensics Python
flashlight R
Human Learn Python
hstats R
innvestigate Python Neural Networks
intepretML Python
interactions: Comprehensive, User-Friendly Toolkit for Probing Interactions R
kernelshap: Kernel SHAP R
Learning Interpretability Tool Python Google
lime: Local Interpretable Model-Agnostic Explanations R
Network Dissection Python Neural Networks MIT
OmniXAI Python Salesforce
Shap Python
Shapash Python
shapper R
shapviz R
Skater Python Oracle
survex R
teller Python
TCAV (Testing with Concept Activation Vectors) Python
truelens Python Truera
truelens-eval Python Truera
pre: Prediction Rule Ensembles R
Vetiver R Python Posit
vip R
vivid R
XAI - An eXplainability toolbox for machine learning Python The Institute for Ethical Machine Learning
xplique Python
XAIoGraphs Python Telefonica
Zennit Python

Interpretable Models

imodels Python
imodelsX Python
interpretML Python Microsoft R
PiML Toolbox Python
Tensorflow Lattice Python Google

LLM Regulation Compliance Regulation

COMPL-AI Python ETH Zurich Insait LaticeFlow AI

LLM Evaluation and Benchmarks

AIluminate
AlignEval: Making Evals Easy, Fun, and Semi-Automated Motivation
Azure AI Evaluation Python Microsoft
BALROG Python
DeepEval Python
evals Python OpenAI
FBI: Finding Blindspots in LLM Evaluations with Interpretable Checklists Python
FrontierMath
Geekbench AI
Giskard Python
Inspect UK AISI Python
Jailbreakbench Python
LightEval HuggingFace Python
LiveBench: A Challenging, Contamination-Free LLM Benchmark Contamination free
LM Evaluation Harness Python
lmms-eval Python
MixEval Python
ML Commons Safety Benchmark for general purpose AI chat model
MLPerf Training Benchmark Training
MMMU Apple Python
Moonshoot AI Verify Foundation Python
NaturalBench Python
opik Comet Python
Phoenix Arize AI Python
Prometheus Python
Promptfoo Python
ragas Python
Rouge Python
simple evals Python OpenAI
StrongREJECT jailbreak benchmark Python
τ-bench: A Benchmark for Tool-Agent-User Interaction in Real-World Domains Python
Yet Another Applied LLM Benchmark Python
VLMEvalKit Python
WindowsAgentArena Python Microsoft

Performance (& Automated ML)

auditor R
automl: Deep Learning with Metaheuristic R
AutoKeras Python
Auto-Sklearn Python
DataPerf Python Google
deepchecks Python
EloML R
Featuretools Python
LOFO Importance Python
forester R
metrica: Prediction performance metrics R
model-diagnostics Python
NNI: Neural Network Intelligence Python Microsoft
performance R
rliable Python Google
SLmetrics R
TensorFlow Model Analysis Python Google
TPOT Python
Unleash Python
Yellowbrick Python
WeightWatcher (Examples) Python

(AI/Data) Poisoning

Copyright Traps for Large Language Models Python
Nightshade University of Chicago Tool
Glaze University of Chicago Tool
Fawkes University of Chicago Tool

Privacy

BackPACK Python
DataSynthesizer: Privacy-Preserving Synthetic Datasets Python Drexel University University of Washington
diffpriv R
Diffprivlib Python IBM
Discrete Gaussian for Differential Privacy Python IBM
Opacus Python Facebook
Privacy Meter Python National University of Singapore
PyVacy: Privacy Algorithms for PyTorch Python
SEAL Python Microsoft
SmartNoise Python OpenDP
Tensorflow Privacy Python Google

Reliability Evaluation (of post hoc explanation methods)

BetterBench Database
openXAI Python

Robustness

Adversarial Robustness Toolbox (ART) Python
BackdoorBench Python
Foolbox Python
Guardrails Python
NeMo Guardrails Python Amazon

Safety

https://github.com/usnistgov/dioptra Python NIST
Garak Python Nvidia

Security

Counterfit Python Microsoft
Modelscan Python
NB Defense Python
Rebuff Playground Python
Turing Data Safe Haven Python The Alan Turing Institute

For consumers:

Sustainability

Azure Sustainability Calculator Microsoft
Carbon Tracker Website Python
CodeCarbon Website Python
Computer Progress
Impact Framework API

(RAI) Toolkit

Dr. Why R Warsaw University of Technology
Responsible AI Widgets R Microsoft
The Data Cards Playbook Python Google
Mercury Python BBVA
Deepchecks Python

(AI) Watermaring

AudioSeal: Proactive Localized Watermarking Python Facebook
MarkLLM: An Open-Source Toolkit for LLM Watermarking Python
SynthID Text Python Google

Regulations

Definition

What are regulations?

Regulations are requirements established by governments.

Interesting resources

Canada

European Union

Short Name	Code	Description	Status	Website	Legal text
Cyber Resilience Act (CRA) - horizontal cybersecurity requirements for products with digital elements	2022/0272(COD)	It introduces mandatory cybersecurity requirements for hardware and software products, throughout their whole lifecycle.	Proposal	Website	Source
Data Act	EU/2023/2854	It enables a fair distribution of the value of data by establishing clear and fair rules for accessing and using data within the European data economy.	Published	Website	Source
Data Governance Act	EU/2022/868	It supports the setup and development of Common European Data Spaces in strategic domains, involving both private and public players, in sectors such as health, environment, energy, agriculture, mobility, finance, manufacturing, public administration and skills.	Published	Website	Source
Digital Market Act	EU/2022/1925	It establishes a set of clearly defined objective criteria to identify “gatekeepers”. Gatekeepers are large digital platforms providing so called core platform services, such as for example online search engines, app stores, messenger services. Gatekeepers will have to comply with the do’s (i.e. obligations) and don’ts (i.e. prohibitions) listed in the DMA.	Published	Website	Source
Digital Services Act	EU/2022/2026	It regulates online intermediaries and platforms such as marketplaces, social networks, content-sharing platforms, app stores, and online travel and accommodation platforms. Its main goal is to prevent illegal and harmful activities online and the spread of disinformation. It ensures user safety, protects fundamental rights, and creates a fair and open online platform environment.	Published	Website	Source
DMS Directive	EU/2019/790	It is intended to ensure a well-functioning marketplace for copyright.	Published	Website	Source
Energy Efficiency Directive	EU/2023/1791	It establishes ‘energy efficiency first’ as a fundamental principle of EU energy policy, giving it legal-standing for the first time. In practical terms, this means that energy efficiency must be considered by EU countries in all relevant policy and major investment decisions taken in the energy and non-energy sectors.	Published	Website	Source
EU AI ACT	EU/2024/1689	It assigns applications of AI to three risk categories. First, applications and systems that create an unacceptable risk are banned. Second, high-risk applications are subject to specific legal requirements. Lastly, applications not explicitly banned or listed as high-risk are largely left unregulated.	Published	Website	Source
General Data Protection Regulation (GDPR)	EU/2016/679	It strengthens individuals' fundamental rights in the digital age and facilitate business by clarifying rules for companies and public bodies in the digital single market.	Published	Website	Source

Hiroshima Process International Guiding Principles for Advanced AI system

Singapore

Singapore’s Approach to AI Governance - Verify

United States

State consumer privacy laws: California (CCPA and its amendment, CPRA), Virginia (VCDPA), and Colorado (ColoPA).
Specific and limited privacy data laws: HIPAA, FCRA, FERPA, GLBA, ECPA, COPPA, VPPA and FTC.
EU-U.S. and Swiss-U.S. Privacy Shield Frameworks - The EU-U.S. and Swiss-U.S. Privacy Shield Frameworks were designed by the U.S. Department of Commerce and the European Commission and Swiss Administration to provide companies on both sides of the Atlantic with a mechanism to comply with data protection requirements when transferring personal data from the European Union and Switzerland to the United States in support of transatlantic commerce.
Executive Order on Maintaining American Leadership in AI - Official mandate by the President of the US to Privacy Act of 1974 - The privacy act of 1974 which establishes a code of fair information practices that governs the collection, maintenance, use and dissemination of information about individuals that is maintained in systems of records by federal agencies.
Privacy Protection Act of 1980 - The Privacy Protection Act of 1980 protects journalists from being required to turn over to law enforcement any work product and documentary materials, including sources, before it is disseminated to the public.
AI Bill of Rights - The Blueprint for an AI Bill of Rights is a guide for a society that protects all people from IA threats based on five principles: Safe and Effective Systems, Algorithmic Discrimination Protections, Data Privacy, Notice and Explanation, and Human Alternatives, Consideration, and Fallback.

Standards

Definition

What are standards?

Standards are voluntary, consensus solutions. They document an agreement on how a material, product, process, or service should be specified, performed or delivered. They keep people safe and ensure things work. They create confidence and provide security for investment.

Standards can be understood as formal specifications of best practices as well. There is a growing number of standards related to AI. You can search for the latest in the Standards Database from AI Standards Hub.

Standards

CEN Standards

The European Committee for Standardization is one of three European Standardization Organizations (together with CENELEC and ETSI) that have been officially recognized by the European Union and by the European Free Trade Association (EFTA) as being responsible for developing and defining voluntary standards at European level.

Domain	Standard	Status	URL
Data governance and quality for AI within the European context	CEN/CLC/TR 18115:2024	Published	[Source]

CEN AI Work programme can be found here.

IEEE Standards

Domain	Standard	Status	URL
IEEE Guide for an Architectural Framework for Explainable Artificial Intelligence	IEEE 2894-2024	Published	Source
IEEE Standard for Ethical Considerations in Emulated Empathy in Autonomous and Intelligent Systems	IEEE 7014-2024	Published	Source

UNE Standards

UNE is Spain's only Standardisation Organisation, designated by the Spanish Ministry of Economy, Industry and Competitiveness to the European Commission. It helps Spanish organizations to keep up-to-date on all aspects related to standardisation:

Discover the new regulatory developments;
Take part in developing standards;
Learn how to integrate standardisation in your R&D&i project;

Domain	Standard	Status	URL
Calidad del dato	UNE 0079:2023	Published	Source
Gestión del dato	UNE 0078:2023	Published	Source
Gobierno del dato	UNE 0077:2023	Published	Source
Guía de evaluación de la Calidad de un Conjunto de Datos.	UNE 0081:2023	Published	Source
Guía de evaluación del Gobierno, Gestión y Gestión de la Calidad del Dato.	UNE 0080:2023	Published	Source

Additional translations in Spanish can be found here.

ISO/IEC Standards

Domain	Standard	Status	URL
AI Concepts and Terminology	ISO/IEC 22989:2022 Information technology — Artificial intelligence — Artificial intelligence concepts and terminology	Published	https://www.iso.org/standard/74296.html
AI Risk Management	ISO/IEC 23894:2023 Information technology - Artificial intelligence - Guidance on risk management	Published	https://www.iso.org/standard/77304.html
AI Management System	ISO/IEC DIS 42001 Information technology — Artificial intelligence — Management system	Published	https://www.iso.org/standard/81230.html
Biases in AI	ISO/IEC TR 24027:2021 Information technology — Artificial intelligence (AI) — Bias in AI systems and AI aided decision making	Published	https://www.iso.org/standard/77607.html
AI Performance	ISO/IEC TS 4213:2022 Information technology — Artificial intelligence — Assessment of machine learning classification performance	Published	https://www.iso.org/standard/79799.html
Ethical and societal concerns	ISO/IEC TR 24368:2022 Information technology — Artificial intelligence — Overview of ethical and societal concerns	Published	https://www.iso.org/standard/78507.html
Explainability	ISO/IEC AWI TS 6254 Information technology — Artificial intelligence — Objectives and approaches for explainability of ML models and AI systems	Under Development	https://www.iso.org/standard/82148.html
AI Sustainability	ISO/IEC AWI TR 20226 Information technology — Artificial intelligence — Environmental sustainability aspects of AI systems	Under Development	https://www.iso.org/standard/86177.html
AI Verification and Validation	ISO/IEC AWI TS 17847 Information technology — Artificial intelligence — Verification and validation analysis of AI systems	Under Development	https://www.iso.org/standard/85072.html
AI Controllabitlity	ISO/IEC CD TS 8200 Information technology — Artificial intelligence — Controllability of automated artificial intelligence systems	Published	https://www.iso.org/standard/83012.html
Biases in AI	ISO/IEC CD TS 12791 Information technology — Artificial intelligence — Treatment of unwanted bias in classification and regression machine learning tasks	Published	https://www.iso.org/standard/84110.html
AI Impact Assessment	ISO/IEC AWI 42005 Information technology — Artificial intelligence — AI system impact assessment	Under Development	https://www.iso.org/standard/44545.html
Data Quality for AI/ML	ISO/IEC DIS 5259 Artificial intelligence — Data quality for analytics and machine learning (ML) (1 to 6)	Published	https://www.iso.org/standard/81088.html
Data Lifecycle	ISO/IEC FDIS 8183 Information technology — Artificial intelligence — Data life cycle framework	Published	https://www.iso.org/standard/83002.html
Audit and Certification	ISO/IEC CD 42006 Information technology — Artificial intelligence — Requirements for bodies providing audit and certification of artificial intelligence management systems	Under Development	https://www.iso.org/standard/44546.html
Transparency	ISO/IEC AWI 12792 Information technology — Artificial intelligence — Transparency taxonomy of AI systems	Under Development	https://www.iso.org/standard/84111.html
AI Quality	ISO/IEC AWI TR 42106 Information technology — Artificial intelligence — Overview of differentiated benchmarking of AI system quality characteristics	Under Development	https://www.iso.org/standard/86903.html
Trustworthy AI	ISO/IEC TR 24028:2020 Information technology — Artificial intelligence — Overview of trustworthiness in artificial intelligence	Published	https://www.iso.org/standard/77608.html
Synthetic Data	ISO/IEC AWI TR 42103 Information technology — Artificial intelligence — Overview of synthetic data in the context of AI systems	Under Development	https://www.iso.org/standard/86899.html
AI Security	ISO/IEC AWI 27090 Cybersecurity — Artificial Intelligence — Guidance for addressing security threats and failures in artificial intelligence systems	Under Development	https://www.iso.org/standard/56581.html
AI Privacy	ISO/IEC AWI 27091 Cybersecurity and Privacy — Artificial Intelligence — Privacy protection	Under Development	https://www.iso.org/standard/56582.html
AI Governance	ISO/IEC 38507:2022 Information technology — Governance of IT — Governance implications of the use of artificial intelligence by organizations	Published	https://www.iso.org/standard/56641.html
AI Safety	ISO/IEC CD TR 5469 Artificial intelligence — Functional safety and AI systems	Published	https://www.iso.org/standard/81283.html
Beneficial AI Systems	ISO/IEC AWI TR 21221 Information technology – Artificial intelligence – Beneficial AI systems	Under Development	https://www.iso.org/standard/86690.html

NIST Standards

Additional standards can be found using the Standards Database.

Citing this repository

Contributors with over 50 edits can be named coauthors in the citation of visible names. Otherwise, all contributors with fewer than 50 edits are included under "et al."

Bibtex

@misc{arai_repo,
  author={Josep Curto et al.},
  title={Awesome Responsible Artificial Intelligence},
  year={2024},
  note={\url{https://github.com/AthenaCore/AwesomeResponsibleAI}}
}

ACM, APA, Chicago, and MLA

ACM (Association for Computing Machinery)

Curto, J., et al. 2024. Awesome Responsible Artificial Intelligence. GitHub. https://github.com/AthenaCore/AwesomeResponsibleAI.

APA (American Psychological Association) 7th Edition

Curto, J., et al. (2024). Awesome Responsible Artificial Intelligence. GitHub. https://github.com/AthenaCore/AwesomeResponsibleAI.

Chicago Manual of Style 17th Edition

Curto, J., et al. "Awesome Responsible Artificial Intelligence." GitHub. Last modified 2024. https://github.com/AthenaCore/AwesomeResponsibleAI.

MLA (Modern Language Association) 9th Edition

Curto, J., et al. "Awesome Responsible Artificial Intelligence". GitHub, 2024, https://github.com/AthenaCore/AwesomeResponsibleAI. Accessed 03 Dec 2024.

Awesome Responsible AI

Main Concepts

What is AI Governance?

What is Human-Centered AI?

What is Open Source AI

What is Responsible AI?

What is a Responsible AI framework?

What is Trustworthy AI?

Why is Responsible, Trustworthy, and Human-Centered AI important?

Content

Academic Research

Evaluation (of model explanations)

Bias

Challenges

Drift

Explainability

Fairness

Ethical Data Products

Sustainability

Collections

Reproducible/Non-Reproducible Research

Books

Open Access

Commercial / Propietary / Closed Access

Code of Ethics

Courses

AI Alignment

AI Governance

Explainability/Interpretability

Causality

Data/AI Ethics

Data Privacy

Ethical Design

Safety

Data Sets

Frameworks

Institutes

Newsletters

Principles

Podcasts

Reports

AI Governance

(AI) Incidents databases

Market Analysis

Other

Tools

Assessments

Bias

Causal Inference

Drift

Fairness

Interpretability/Explicability

Interpretable Models

LLM Regulation Compliance Regulation

LLM Evaluation and Benchmarks

Performance (& Automated ML)

(AI/Data) Poisoning

Privacy

Reliability Evaluation (of post hoc explanation methods)

Robustness

Safety

Security

Sustainability

(RAI) Toolkit

(AI) Watermaring

Regulations

Definition

Interesting resources

Canada

European Union

Singapore

United States

Standards

Definition

Standards

CEN Standards

IEEE Standards

UNE Standards

ISO/IEC Standards

NIST Standards