Guangneng Hu's repositories
njuhugn.github.io
Guangneng Hu, Assoc. Prof. @ Xidian Univ, PhD at HKUST, BA/MS at Nanjing Univ.
ALCE
[EMNLP 2023] Enabling Large Language Models to Generate Text with Citations. Paper: https://arxiv.org/abs/2305.14627
ALLaVA
Harnessing 1.4M GPT4V-synthesized Data for A Lite Vision-Language Model
clip_text_span
official implementation of "Interpreting CLIP's Image Representation via Text-Based Decomposition"
data-selection-survey
A Survey on Data Selection for Language Models
DL-Fairness-Study
This repository is for our ISSTA 2024 paper: A Large-Scale Empirical Study on Improving the Fairness of Image Classification Models
esci-data
Shopping Queries Dataset: A Large-Scale ESCI Benchmark for Improving Product Search
InfDataSel
Code for paper: “What Data Benefits My Classifier?” Enhancing Model Performance and Interpretability through Influence-Based Data Selection (ICLR 2024 ORAL)
InfoBatch
Lossless Training Speed Up by Unbiased Dynamic Data Pruning
LESS
Preprint: Less: Selecting Influential Data for Targeted Instruction Tuning
LLMRec
[WSDM'2024 Oral] "LLMRec: Large Language Models with Graph Augmentation for Recommendation"
LLMs-Finetuning-Safety
We jailbreak GPT-3.5 Turbo’s safety guardrails by fine-tuning it on only 10 adversarially designed examples, at a cost of less than $0.20 via OpenAI’s APIs.
LongLoRA
Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)
NineRec
Multimodal Dataset and Benchmark for Multi-domain and Cross-domain Recommendation System
QuRating
[ICML 2024] Selecting High-Quality Data for Training Language Models
RecFormer
Replication of the paper "Text Is All You Need: Learning Language Representations for Sequential Recommendation" on KDD'23.
RLMRec
[WWW'2024] "RLMRec: Representation Learning with Large Language Models for Recommendation"
self-rag
This includes the original implementation of SELF-RAG: Learning to Retrieve, Generate and Critique through self-reflection by Akari Asai, Zeqiu Wu, Yizhong Wang, Avirup Sil, and Hannaneh Hajishirzi.
SELFRec
An open-source framework for self-supervised recommender systems.
SOUL
[EMNLP 2023] Code and Data for "SOUL: Towards Sentiment and Opinion Understanding of Language"
SWE-bench
[ICLR 2024] SWE-Bench: Can Language Models Resolve Real-world Github Issues?
TASTE
[CIKM 2023] This is the code repo for our CIKM‘23 paper "Text Matching Improves Sequential Recommendation by Reducing Popularity Biases".
trak
A fast, effective data attribution method for neural networks in PyTorch
Vit-RGTS
Open source implementation of "Vision Transformers Need Registers"
Wuerstchen
Official implementation of Würstchen: Efficient Pretraining of Text-to-Image Models