njuhugn

Code for paper: “What Data Benefits My Classifier?” Enhancing Model Performance and Interpretability through Influence-Based Data Selection (ICLR 2024 ORAL)

000

InfoBatch

Lossless Training Speed Up by Unbiased Dynamic Data Pruning

000

LESS

Preprint: Less: Selecting Influential Data for Targeted Instruction Tuning

MIT000

LiveBench

LiveBench: A Challenging, Contamination-Free LLM Benchmark

NOASSERTION000

llm-feedback

000

LLMRec

[WSDM'2024 Oral] "LLMRec: Large Language Models with Graph Augmentation for Recommendation"

000

LLMs-Finetuning-Safety

We jailbreak GPT-3.5 Turbo’s safety guardrails by fine-tuning it on only 10 adversarially designed examples, at a cost of less than $0.20 via OpenAI’s APIs.

Language:PythonMIT000

LongLoRA

Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)

Language:PythonApache-2.0000

MoDS

000

NineRec

Multimodal Dataset and Benchmark for Multi-domain and Cross-domain Recommendation System

000

QuRating

[ICML 2024] Selecting High-Quality Data for Training Language Models

000

RecFormer

Replication of the paper "Text Is All You Need: Learning Language Representations for Sequential Recommendation" on KDD'23.

000

RLMRec

[WWW'2024] "RLMRec: Representation Learning with Large Language Models for Recommendation"

000

self-rag

This includes the original implementation of SELF-RAG: Learning to Retrieve, Generate and Critique through self-reflection by Akari Asai, Zeqiu Wu, Yizhong Wang, Avirup Sil, and Hannaneh Hajishirzi.

MIT000