Guangneng Hu's repositories
njuhugn.github.io
Guangneng Hu, Assoc. Prof. @ Xidian Univ, PhD at HKUST, BA/MS at Nanjing Univ.
LiveBench
LiveBench: A Challenging, Contamination-Free LLM Benchmark
VILA
VILA - A multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops)
trak
A fast, effective data attribution method for neural networks in PyTorch
data-selection-survey
A Survey on Data Selection for Language Models
QuRating
[ICML 2024] Selecting High-Quality Data for Training Language Models
SELFRec
An open-source framework for self-supervised recommender systems.
SWE-bench
[ICLR 2024] SWE-Bench: Can Language Models Resolve Real-world Github Issues?
IDvs.MoRec
End-to-end Training for Multimodal Recommendation Systems
InfDataSel
Code for paper: “What Data Benefits My Classifier?” Enhancing Model Performance and Interpretability through Influence-Based Data Selection (ICLR 2024 ORAL)
LESS
Preprint: Less: Selecting Influential Data for Targeted Instruction Tuning
RecFormer
Replication of the paper "Text Is All You Need: Learning Language Representations for Sequential Recommendation" on KDD'23.
DIMO
SIGIR 2024, Session-based recommendation, Co-occurrence patterns of ID, Fine-grained preferences of Modality, Disentanglement learning.
esci-data
Shopping Queries Dataset: A Large-Scale ESCI Benchmark for Improving Product Search
NineRec
Multimodal Dataset and Benchmark for Multi-domain and Cross-domain Recommendation System
InfoBatch
Lossless Training Speed Up by Unbiased Dynamic Data Pruning
DL-Fairness-Study
This repository is for our ISSTA 2024 paper: A Large-Scale Empirical Study on Improving the Fairness of Image Classification Models
Wuerstchen
Official implementation of Würstchen: Efficient Pretraining of Text-to-Image Models
self-rag
This includes the original implementation of SELF-RAG: Learning to Retrieve, Generate and Critique through self-reflection by Akari Asai, Zeqiu Wu, Yizhong Wang, Avirup Sil, and Hannaneh Hajishirzi.
clip_text_span
official implementation of "Interpreting CLIP's Image Representation via Text-Based Decomposition"
LLMRec
[WSDM'2024 Oral] "LLMRec: Large Language Models with Graph Augmentation for Recommendation"
DiffKG
[WSDM'2024 Oral] "DiffKG: Knowledge Graph Diffusion Model for Recommendation"
TASTE
[CIKM 2023] This is the code repo for our CIKM‘23 paper "Text Matching Improves Sequential Recommendation by Reducing Popularity Biases".
Vit-RGTS
Open source implementation of "Vision Transformers Need Registers"
ALLaVA
Harnessing 1.4M GPT4V-synthesized Data for A Lite Vision-Language Model
ALCE
[EMNLP 2023] Enabling Large Language Models to Generate Text with Citations. Paper: https://arxiv.org/abs/2305.14627