Paiheng Xu's starred repositories

tuning_playbook

A playbook for systematically maximizing the performance of deep learning models.

License:NOASSERTIONStargazers:26086Issues:0Issues:0

qa_metrics

An easy python package to run quick basic QA evaluations. This package includes standardized QA evaluation metrics and semantic evaluation metrics: Black-box and Open-Source large language model prompting and evaluation, exact match, F1 Score, PEDANT semantic match, transformer match. Our package also supports prompting OPENAI and Anthropic API.

Language:PythonLicense:MITStargazers:22Issues:0Issues:0

vaderSentiment

VADER Sentiment Analysis. VADER (Valence Aware Dictionary and sEntiment Reasoner) is a lexicon and rule-based sentiment analysis tool that is specifically attuned to sentiments expressed in social media, and works well on texts from other domains.

Language:PythonLicense:MITStargazers:4344Issues:0Issues:0

lloom

Concept Induction: Analyzing Unstructured Text with High-Level Concepts Using LLooM (CHI 2024 paper). LLooM automatically surfaces high-level concepts to analyze unstructured text.

Language:PythonLicense:BSD-3-ClauseStargazers:47Issues:0Issues:0

sammo

A library for prompt engineering and optimization (SAMMO = Structure-aware Multi-Objective Metaprompt Optimization)

Language:PythonLicense:MITStargazers:286Issues:0Issues:0

edu-convokit

Edu-ConvoKit: An Open-Source Framework for Education Conversation Data

Language:Jupyter NotebookLicense:MITStargazers:66Issues:0Issues:0

NLP4SocialGood_Papers

A reading list of up-to-date papers on NLP for Social Good.

Stargazers:270Issues:0Issues:0

tokreate

A minimal library to create tokens using LLMs.

Language:PythonStargazers:6Issues:0Issues:0

stanford_alpaca

Code and documentation to train Stanford's Alpaca models, and generate the data.

Language:PythonLicense:Apache-2.0Stargazers:29246Issues:0Issues:0

LRV-Instruction

[ICLR'24] Mitigating Hallucination in Large Multi-Modal Models via Robust Instruction Tuning

Language:PythonLicense:BSD-3-ClauseStargazers:239Issues:0Issues:0

ecco

Explain, analyze, and visualize NLP language models. Ecco creates interactive visualizations directly in Jupyter notebooks explaining the behavior of Transformer-based language models (like GPT2, BERT, RoBERTA, T5, and T0).

Language:Jupyter NotebookLicense:BSD-3-ClauseStargazers:1948Issues:0Issues:0

transformers-interpret

Model explainability that works seamlessly with 🤗 transformers. Explain your transformers model in just 2 lines of code.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:1240Issues:0Issues:0

LLaMA-Factory

A WebUI for Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

Language:PythonLicense:Apache-2.0Stargazers:28541Issues:0Issues:0

multi-task-NLP

multi_task_NLP is a utility toolkit enabling NLP developers to easily train and infer a single model for multiple tasks.

Language:PythonLicense:Apache-2.0Stargazers:364Issues:0Issues:0

Promptify

Prompt Engineering | Prompt Versioning | Use GPT or other prompt based models to get structured output. Join our discord for Prompt-Engineering, LLMs and other latest research

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:3163Issues:0Issues:0
Language:PythonLicense:MITStargazers:25Issues:0Issues:0

dataset_difficulty

"Understanding Dataset Difficulty with V-Usable Information" (ICML 2022, outstanding paper)

Language:Jupyter NotebookStargazers:76Issues:0Issues:0

zoe

Zero-Shot Open Entity Typing as Type-Compatible Grounding, EMNLP'18.

Language:PythonStargazers:43Issues:0Issues:0

vert-papers

This repository contains code and datasets related to entity/knowledge papers from the VERT (Versatile Entity Recognition & disambiguation Toolkit) project, by the Knowledge Computing group at Microsoft Research Asia (MSRA).

Language:PythonLicense:MITStargazers:265Issues:0Issues:0

COVID-19-TweetIDs

The repository contains an ongoing collection of tweets IDs associated with the novel coronavirus COVID-19 (SARS-CoV-2), which commenced on January 28, 2020.

Language:PythonLicense:NOASSERTIONStargazers:714Issues:0Issues:0

Awesome-Fair-Graph-Learning

Paper List for Fair Graph Learning (FairGL).

License:MITStargazers:119Issues:0Issues:0

EntLM

Codes for "Template-free Prompt Tuning for Few-shot NER".

Language:PythonStargazers:114Issues:0Issues:0

brat

brat rapid annotation tool (brat) - for all your textual annotation needs

Language:PythonLicense:NOASSERTIONStargazers:1806Issues:0Issues:0

SCPR

Interactive Path Reasoning on Graph for Conversational Recommendation

Language:PythonStargazers:24Issues:0Issues:0

liwc-python

Linguistic Inquiry and Word Count (LIWC) analyzer

Language:PythonLicense:MITStargazers:190Issues:0Issues:0

mrc-for-flat-nested-ner

Code for ACL 2020 paper `A Unified MRC Framework for Named Entity Recognition`

Language:PythonStargazers:649Issues:0Issues:0

conversational-uptake

Code and data for the paper "Measuring Conversational Uptake: A Case-Study on Student-Teacher Interactions"

Language:PythonLicense:MITStargazers:20Issues:0Issues:0

unsupervised_gender_bias

Code for https://arxiv.org/pdf/2004.08361.pdf

Language:PythonStargazers:8Issues:0Issues:0
Language:PythonStargazers:16Issues:0Issues:0

DocRE-reading-list

a paper reading list on Document level Relation Extraction

Stargazers:59Issues:0Issues:0