There are 2 repositories under data-selection topic.
Preprint: Less: Selecting Influential Data for Targeted Instruction Tuning
:no_entry: [DEPRECATED] Adapt Transformer-based language models to new text domains
InstructionGPT-4
Keras sentence classification
Enhanced spatio-temporal electric load forecasts with less data using active deep learning
Dynamic Transfer Learning for Low-Resource Neural Machine Translation
Repository for the experiments in my paper accepted to the CLIN Journal: "Selecting Parallel In-domain Sentences for Neural Machine Translation Using Monolingual Texts"
This is an official repository for "Performance Scaling via Optimal Transport: Enabling Data Selection from Partially Revealed Sources" (NeurIPS 2023).
This repository contains the data and code for the paper "Self-training with Two-phase Self-augmentation for Few-shot Dialogue Generation" (EMNLP2022-Findings).
Introducing you to the fundamentals of the quintessential Python data analysis library, pandas, and its core data structures – the Series and DataFrame objects.
Quilt: Robust Data Segment Selection against Concept Drifts (AAAI 2024)
NU Bootcamp Module 14
A quick-start project that helps you to perform different types of selection in Vue Grid and know about different modes of selection – Row, Cell and Both. This project contains code snippet about cell, checkbox and toggle selection, and the way to get row index of selected cells using row selection events.