Skip to content

Research Paper Library

A curated collection of seminal and recent papers in trustworthy machine learning. Papers are organized by topic and include our commentary on significance and practical implications.

Search & Filter

Use Ctrl+F to search for specific topics, authors, or venues. Papers are tagged with key concepts for easy discovery.

Foundational Papers

Fairness & Bias

Seminal Works

  • Fairness Through Awareness (Dwork et al., 2012)
    ITCS 2012 | individual-fairness awareness
    Introduces the concept of individual fairness and awareness in algorithmic decision-making.

  • Equality of Opportunity in Supervised Learning (Hardt et al., 2016)
    NIPS 2016 | group-fairness equalized-odds
    Defines equalized odds and equality of opportunity for binary classification.

  • Fairness Definitions Explained (Verma & Rubin, 2018)
    IEEE FATES 2018 | survey fairness-metrics
    Comprehensive survey of 20+ fairness definitions with mathematical formulations.

Recent Advances

Robustness & Adversarial ML

Foundational

Certified Defenses

Interpretability & Explainability

Core Methods

Evaluation & Benchmarking

Privacy-Preserving ML

Differential Privacy

Federated Learning

Recent Research (2023-2024)

Emerging Topics

Benchmark Papers

Paper Collections by Venue

Top-Tier Conferences

  • FAccT 2024 - Latest fairness and accountability research
  • FAccT 2023 - Includes algorithmic auditing advances
  • FAccT 2022 - Focus on intersectionality and bias
  • Focus on theoretical foundations and scalable algorithms
  • Strong representation in robustness and privacy research
  • Recent emphasis on LLM safety and alignment
  • Cutting-edge deep learning approaches to trustworthy ML
  • Novel architectures for interpretable models
  • Adversarial robustness innovations

Specialized Venues

  • AIES (AI, Ethics, and Society): Interdisciplinary perspectives
  • S&P, CCS, USENIX Security: Security and privacy focus
  • CHI, CSCW: Human-computer interaction and social impacts
  • AAAI: Broad AI applications and theoretical work

Reading Lists by Course Module

For Assignment 1: Bias Detection

  1. Verma & Rubin (2018) - Fairness definitions overview
  2. Bellamy et al. (2019) - Practical fairness toolkit usage
  3. Choose one: Group fairness vs. individual fairness comparison

For Assignment 2: Adversarial Robustness

  1. Goodfellow et al. (2014) - FGSM and basic concepts
  2. Madry et al. (2017) - PGD and evaluation methodology
  3. Cohen et al. (2019) - Certified defenses introduction

For Midterm Preparation

Core papers from each topic area marked with ⭐ in the full bibliography.


Contributing

Found an important paper we missed? Submit a suggestion to help keep this library comprehensive and current.

Last updated: {{ git_revision_date }}