eXplainable Machine Learning course for Machine Learning (MSc) studies at the University of Warsaw.
Winter semester 2022/23 @pbiecek @hbaniecki
The course consists of lectures, computer labs and a project.
The design of this course is based on four principles:
- Mixing experiences during studies is good. It allows you to generate more ideas. Also, in mixed groups, we can improve our communication skills,
- In eXplainable AI (XAI), the interface/esthetic of the solution is important. Like earlier Human-Computer Interaction (HCI), XAI is on the borderline between technical, domain and cognitive aspects. Therefore, apart from the purely technical descriptions, the results must be grounded in the domain and should be communicated aesthetically and legibly.
- Communication of results is important. Both in science and business, it is essential to be able to present the results concisely and legibly. In this course, it should translate into the ability to describe one XAI challenge in the form of a short report/article.
- It is worth doing valuable things. Let's look for new applications for XAI methods discussed on typical predictive problems.
Plan for the winter semester 2022/2023. UW classes are on Fridays.
- 2022-10-07 -- Introduction, slides, audio
- 2022-10-14 -- Break-Down / SHAP, slides, audio, code examples
- 2022-10-21 -- LIME / LORE, slides, audio, code examples
- 2022-10-28 -- CP / PDP, slides, code examples
- 2022-11-04 -- PROJECT: First checkpoint - Choose a topic and be familiar with the attached materials.
- 2022-11-18 -- VIP / MCR, slides, audio, code examples
- 2022-11-25 -- Fairness, slides, audio, code examples
- 2022-12-02 -- Explanations specific to neural networks, slides & Evaluation of explanations, slides
- 2022-12-09 -- PROJECT: Second checkpoint - Provide initial experimental results and/or code implementation.
- 2022-12-16 -- Fairness and project consultations
- 2022-12-22 -- Project consultations
- 2023-01-13 -- Student presentations
- 2023-01-20 -- Student presentations
- 2023-01-27 -- PROJECT: Final presentation - Present final experimental results and/or code implementation.
From different activities, you can get from 0 to 100 points. 51 points are needed to pass this course.
Grades:
- 51-60: (3) dst
- 61-70: (3.5) dst+
- 71-80: (4) db
- 81-90: (4.5) db+
- 91-100: (5) bdb
There are four key components:
- Homeworks (0-24)
- Presentations (0-10)
- Project (0-36)
- Exam (0-30)
- Homework 1 for 0-4 points. Deadline: 2022-10-13
- Homework 2 for 0-4 points. Deadline: 2022-10-20
- Homework 3 for 0-4 points. Deadline: 2022-10-27
- Homework 4 for 0-4 points. Deadline: 2022-11-17
- Homework 5 for 0-4 points. Deadline: 2022-11-24
- Homework 6 for 0-4 points. Deadline: 2022-12-08
Presentations can be prepared by one or two students. Each group should present a single paper related to XAI published in the last 3 years (journal or conference). Each group should choose a different paper. Here are some suggestions:
- J. DeYoung et al. ERASER : A Benchmark to Evaluate Rationalized NLP Models. ACL, 2020.
- D. Slack et al. Fooling LIME and SHAP: Adversarial Attacks on Post hoc Explanation Methods. AIES, 2020.
- S. Lundberg et al. From local explanations to global understanding with explainable AI for trees. Nature Machine Intelligence, 2020.
- U. Bhatt et al. Evaluating and Aggregating Feature-based Model Explanations. IJCAI, 2020.
- J. Adebayo et al. Debugging Tests for Model Explanations. NeurIPS, 2020.
- S. Sinha et al. Perturbing Inputs for Fragile Interpretations in Deep Natural Language Processing. EMNLP BlackboxNLP Workshop, 2021.
- L. Yang et al. Synthetic Benchmarks for Scientific Research in Explainable Machine Learning. NeurIPS Datasets and Benchmarks, 2021.
- M. Neely et al. Order in the Court: Explainable AI Methods Prone to Disagreement. ICML XAI Workshop, 2021.
- F. Poursabzi-Sangdeh et al. Manipulating and Measuring Model Interpretability. CHI, 2021.
- X. Zhao et al. BayLIME: Bayesian Local Interpretable Model-Agnostic Explanations. UAI, 2021.
- Y. Zhou et al. Do Feature Attribution Methods Correctly Attribute Features?. AAAI, 2022.
- J. Adebayo et al. Post hoc Explanations may be Ineffective for Detecting Unknown Spurious Correlation. ICLR, 2022.
- Papers from: NeurIPS 2021 Workshop on eXplainable AI approaches for debugging and diagnosis
// More suggestions for computer vision:
- Z. Huang & Y. Li. Interpretable and Accurate Fine-grained Recognition via Region Grouping. CVPR, 2020.
- K. Hanawa et al. Evaluation of Similarity-based Explanations. ICLR, 2021.
- A. Kapishnikov et al. Guided Integrated Gradients: An Adaptive Path Method for Removing Noise. CVPR, 2021.
- V. Petsiuk et al. Black-Box Explanation of Object Detectors via Saliency Maps. CVPR, 2021.
- L. Arras et al. CLEVR-XAI: A benchmark dataset for the ground truth evaluation of neural network explanations. Information Fusion, 2022.
- A. Khakzar et al. Do Explanations Explain? Model Knows Best. CVPR, 2022.
- S. Chen & Q. Zhao. REX: Reasoning-Aware and Grounded Explanation. CVPR, 2022.
- T. Makino et al. Differences between human and machine perception in medical diagnosis. Scientific Reports, 2022.
- M. Watson et al. Agree to Disagree: When Deep Learning Models With Identical Architectures Produce Distinct Explanations. WACV, 2022.
- Q. Zheng et al. Shap-CAM: Visual Explanations for Convolutional Neural Networks based on Shapley Value. ECCV, 2022.
- XAI-related papers from:
XAI stories ebook (previous editions):
- first edition https://pbiecek.github.io/xai_stories/
- second edition https://pbiecek.github.io/xai_stories_2/
A written exam will consist of simple exercises based on the materials from Lectures and Homeworks.
We recommend to dive deep into the following books and explore their references on a particular topic of interest:
- Explanatory Model Analysis. Explore, Explain and Examine Predictive Models by Przemysław Biecek, Tomasz Burzykowski
- Fairness and Machine Learning: Limitations and Opportunities by Solon Barocas, Moritz Hardt, Arvind Narayanan
- Interpretable Machine Learning. A Guide for Making Black Box Models Explainable by Christoph Molnar