Cited by The Thousand Faces of Explainable AI Along the Machine Learning Life Cycle: Industrial Reality and Current State of Research

Unifying Corroborative and Contributive Attributions in Large Language Models DOI

Theodora Worledge,

Judy Hanwen Shen, Nicole Meister

et al.

Published: April 9, 2024

As businesses, products, and services spring up around large language models, the trustworthiness of these models hinges on verifiability their outputs. However, methods for explaining model outputs largely fall across two distinct fields study which both use term "attribution" to refer entirely separate techniques: citation generation training data attribution. In many modern applications, such as legal document medical question answering, types attributions are important. this work, we argue present a unified framework attributions. We show how existing different attribution under framework. also discuss real-world cases where one or required. believe that will guide case driven development systems leverage attribution, well standardization evaluation.

Language: Английский

Citations

ModelPred: A Framework for Predicting Trained Model from Training Data DOI

Yingyan Zeng,

Jiachen T. Wang,

Si Chen

et al.

Published: Feb. 1, 2023

In this work, we propose ModelPred, a framework that helps to understand the impact of changes in training data on trained model. This is critical for building trust various stages machine learning pipeline: from cleaning poor-quality samples and tracking important ones be collected during preparation, calibrating uncertainty model prediction, interpreting why certain behaviors emerge deployment. Specifically, ModelPred learns parameterized function takes dataset S as input predicts obtained by S. Our work differs recent Datamodels [1] aim predicting parameters directly instead behaviors. We demonstrate neural network-based set class capable complex relationships between parameters. introduce novel global local regularization techniques prevent overfitting rigorously characterize expressive power networks (NN) approximating end-to-end process. Through extensive empirical investigations, show enables variety applications boost interpretability accountability (ML), such valuation, selection, memorization quantification, calibration.

Language: Английский

Citations

Approximating Full Conformal Prediction at Scale via Influence Functions DOI

Javier Abad Martinez, Umang Bhatt, Adrian Weller

et al.

Proceedings of the AAAI Conference on Artificial Intelligence, Journal Year: 2023, Volume and Issue: 37(6), P. 6631 - 6639

Published: June 26, 2023

Conformal prediction (CP) is a wrapper around traditional machine learning models, giving coverage guarantees under the sole assumption of exchangeability; in classification problems, CP that error rate at most chosen significance level, irrespective whether underlying model misspecified. However, prohibitive computational costs full led researchers to design scalable alternatives, which alas do not attain same or statistical power CP. In this paper, we use influence functions efficiently approximate We prove our method consistent approximation CP, and empirically show becomes smaller as training set increases; e.g., for 1,000 points two methods output p-values are

Language: Английский

Citations

Rethinking Influence Functions of Neural Networks in the Over-Parameterized Regime DOI

Rui Zhang, Shihua Zhang

Proceedings of the AAAI Conference on Artificial Intelligence, Journal Year: 2022, Volume and Issue: 36(8), P. 9082 - 9090

Published: June 28, 2022

Understanding the black-box prediction for neural networks is challenging. To achieve this, early studies have designed influence function (IF) to measure effect of removing a single training point on networks. However, classic implicit Hessian-vector product (IHVP) method calculating IF fragile, and theoretical analysis in context still lacking. this end, we utilize tangent kernel (NTK) theory calculate network trained with regularized mean-square loss, prove that approximation error can be arbitrarily small when width sufficiently large two-layer ReLU We analyze bound IHVP over-parameterized regime understand why it fails or not. In detail, our reveals (1) accuracy depends regularization term, pretty low under weak regularization; (2) has significant correlation probability density corresponding points. further borrow from NTK IFs better, including quantifying complexity influential samples depicting variation during dynamics. Numerical experiments real-world data confirm results demonstrate findings.

Language: Английский

Citations

The Thousand Faces of Explainable AI Along the Machine Learning Life Cycle: Industrial Reality and Current State of Research DOI

Thomas Decker, Ralf Gross,

Alexander Koebler

et al.

Lecture notes in computer science, Journal Year: 2023, Volume and Issue: unknown, P. 184 - 208

Published: Jan. 1, 2023

Language: Английский

Citations