There are 0 repository under data-pruning topic.
pyDVL is a library of stable implementations of algorithms for data valuation and influence function computation
Code for ACL 2025 Main paper "Data Whisperer: Efficient Data Selection for Task-Specific LLM Fine-Tuning via Few-Shot In-Context Learning".
Learning Large-scale Neural Fields via Context Pruned Meta-Learning (NeurIPS 2023)
Jaehyung Kim et al's ACL 2023 paper on "infoVerse: A Universal Framework for Dataset Characterization with Multidimensional Meta-information"
Official repository of the paper "DiffProb: Data Pruning for Face Recognition" (accepted at FG 2025)
code for the paper Beyond Neural scaling laws for fast proven robust certification of nearest prototype classifiers