Social Foundations of Computation
Max Planck Institute for Intelligent Systems, Tübingen
Popular repositories Loading
-
-
error-parity
error-parity PublicAchieve error-rate fairness between societal groups for any score-based classifier.
-
benchbench
benchbench PublicBenchBench is a Python package to evaluate multi-task benchmarks.
Repositories
Showing 10 of 12 repositories
- causal-features Public Forked from mlfoundations/tableshift
Code to reproduce the paper "Do causal predictors generalize better to new domains?"
socialfoundations/causal-features’s past year of commit activity - surveying-language-models Public
Code to reproduce the paper "Questioning the Survey Responses of Large Language Models"
socialfoundations/surveying-language-models’s past year of commit activity - lm-evaluation-harness Public Forked from EleutherAI/lm-evaluation-harness
A framework for few-shot evaluation of language models.
socialfoundations/lm-evaluation-harness’s past year of commit activity - training-on-the-test-task Public
Code to reproduce the experiments in the paper Training on the Test Task Confounds Evaluation and Emergence.
socialfoundations/training-on-the-test-task’s past year of commit activity - error-parity Public
Achieve error-rate fairness between societal groups for any score-based classifier.
socialfoundations/error-parity’s past year of commit activity