Paiheng Xu's starred repositories
tuning_playbook
A playbook for systematically maximizing the performance of deep learning models.
qa_metrics
An easy python package to run quick basic QA evaluations. This package includes standardized QA evaluation metrics and semantic evaluation metrics: Black-box and Open-Source large language model prompting and evaluation, exact match, F1 Score, PEDANT semantic match, transformer match. Our package also supports prompting OPENAI and Anthropic API.
vaderSentiment
VADER Sentiment Analysis. VADER (Valence Aware Dictionary and sEntiment Reasoner) is a lexicon and rule-based sentiment analysis tool that is specifically attuned to sentiments expressed in social media, and works well on texts from other domains.
edu-convokit
Edu-ConvoKit: An Open-Source Framework for Education Conversation Data
NLP4SocialGood_Papers
A reading list of up-to-date papers on NLP for Social Good.
stanford_alpaca
Code and documentation to train Stanford's Alpaca models, and generate the data.
LRV-Instruction
[ICLR'24] Mitigating Hallucination in Large Multi-Modal Models via Robust Instruction Tuning
transformers-interpret
Model explainability that works seamlessly with 🤗 transformers. Explain your transformers model in just 2 lines of code.
LLaMA-Factory
A WebUI for Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
multi-task-NLP
multi_task_NLP is a utility toolkit enabling NLP developers to easily train and infer a single model for multiple tasks.
dataset_difficulty
"Understanding Dataset Difficulty with V-Usable Information" (ICML 2022, outstanding paper)
vert-papers
This repository contains code and datasets related to entity/knowledge papers from the VERT (Versatile Entity Recognition & disambiguation Toolkit) project, by the Knowledge Computing group at Microsoft Research Asia (MSRA).
COVID-19-TweetIDs
The repository contains an ongoing collection of tweets IDs associated with the novel coronavirus COVID-19 (SARS-CoV-2), which commenced on January 28, 2020.
Awesome-Fair-Graph-Learning
Paper List for Fair Graph Learning (FairGL).
liwc-python
Linguistic Inquiry and Word Count (LIWC) analyzer
mrc-for-flat-nested-ner
Code for ACL 2020 paper `A Unified MRC Framework for Named Entity Recognition`
conversational-uptake
Code and data for the paper "Measuring Conversational Uptake: A Case-Study on Student-Teacher Interactions"
unsupervised_gender_bias
Code for https://arxiv.org/pdf/2004.08361.pdf
DocRE-reading-list
a paper reading list on Document level Relation Extraction