Lecture notes in computer science, Journal Year: 2023, Volume and Issue: unknown, P. 184 - 208
Published: Jan. 1, 2023
Language: Английский
Lecture notes in computer science, Journal Year: 2023, Volume and Issue: unknown, P. 184 - 208
Published: Jan. 1, 2023
Language: Английский
Published: April 9, 2024
As businesses, products, and services spring up around large language models, the trustworthiness of these models hinges on verifiability their outputs. However, methods for explaining model outputs largely fall across two distinct fields study which both use term "attribution" to refer entirely separate techniques: citation generation training data attribution. In many modern applications, such as legal document medical question answering, types attributions are important. this work, we argue present a unified framework attributions. We show how existing different attribution under framework. also discuss real-world cases where one or required. believe that will guide case driven development systems leverage attribution, well standardization evaluation.
Language: Английский
Citations
1Published: Feb. 1, 2023
In this work, we propose ModelPred, a framework that helps to understand the impact of changes in training data on trained model. This is critical for building trust various stages machine learning pipeline: from cleaning poor-quality samples and tracking important ones be collected during preparation, calibrating uncertainty model prediction, interpreting why certain behaviors emerge deployment. Specifically, ModelPred learns parameterized function takes dataset S as input predicts obtained by S. Our work differs recent Datamodels [1] aim predicting parameters directly instead behaviors. We demonstrate neural network-based set class capable complex relationships between parameters. introduce novel global local regularization techniques prevent overfitting rigorously characterize expressive power networks (NN) approximating end-to-end process. Through extensive empirical investigations, show enables variety applications boost interpretability accountability (ML), such valuation, selection, memorization quantification, calibration.
Language: Английский
Citations
3Proceedings of the AAAI Conference on Artificial Intelligence, Journal Year: 2023, Volume and Issue: 37(6), P. 6631 - 6639
Published: June 26, 2023
Conformal prediction (CP) is a wrapper around traditional machine learning models, giving coverage guarantees under the sole assumption of exchangeability; in classification problems, CP that error rate at most chosen significance level, irrespective whether underlying model misspecified. However, prohibitive computational costs full led researchers to design scalable alternatives, which alas do not attain same or statistical power CP. In this paper, we use influence functions efficiently approximate We prove our method consistent approximation CP, and empirically show becomes smaller as training set increases; e.g., for 1,000 points two methods output p-values are
Language: Английский
Citations
3Proceedings of the AAAI Conference on Artificial Intelligence, Journal Year: 2022, Volume and Issue: 36(8), P. 9082 - 9090
Published: June 28, 2022
Understanding the black-box prediction for neural networks is challenging. To achieve this, early studies have designed influence function (IF) to measure effect of removing a single training point on networks. However, classic implicit Hessian-vector product (IHVP) method calculating IF fragile, and theoretical analysis in context still lacking. this end, we utilize tangent kernel (NTK) theory calculate network trained with regularized mean-square loss, prove that approximation error can be arbitrarily small when width sufficiently large two-layer ReLU We analyze bound IHVP over-parameterized regime understand why it fails or not. In detail, our reveals (1) accuracy depends regularization term, pretty low under weak regularization; (2) has significant correlation probability density corresponding points. further borrow from NTK IFs better, including quantifying complexity influential samples depicting variation during dynamics. Numerical experiments real-world data confirm results demonstrate findings.
Language: Английский
Citations
5Lecture notes in computer science, Journal Year: 2023, Volume and Issue: unknown, P. 184 - 208
Published: Jan. 1, 2023
Language: Английский
Citations
2