Yifeng Ding (natedingyifeng)

natedingyifeng

Geek Repo

Location:Urbana, Illinois

Home Page:yifeng-ding.com

Twitter:@YifengDing_

Github PK Tool:Github PK Tool

Yifeng Ding's starred repositories

LLaMA-Factory

Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)

Language:PythonLicense:Apache-2.0Stargazers:31784Issues:203Issues:4907

peft

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Language:PythonLicense:Apache-2.0Stargazers:15993Issues:108Issues:1042

trl

Train transformer language models with reinforcement learning.

Language:PythonLicense:Apache-2.0Stargazers:9571Issues:74Issues:1124

accelerate

🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support

Language:PythonLicense:Apache-2.0Stargazers:7763Issues:97Issues:1582

DeepSpeedExamples

Example models using DeepSpeed

Language:PythonLicense:Apache-2.0Stargazers:6016Issues:74Issues:534

cleanrl

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

Language:PythonLicense:NOASSERTIONStargazers:5398Issues:36Issues:182

trlx

A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)

Language:PythonLicense:MITStargazers:4461Issues:50Issues:290

self-instruct

Aligning pretrained language models with instruction data generated by themselves.

Language:PythonLicense:Apache-2.0Stargazers:4090Issues:56Issues:19

awesome-RLHF

A curated list of reinforcement learning with human feedback resources (continually updated)

RL4LMs

A modular RL library to fine-tune language models to human preferences

Language:PythonLicense:Apache-2.0Stargazers:2183Issues:23Issues:58

hh-rlhf

Human preference data for "Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback"

License:MITStargazers:1574Issues:20Issues:0

robustness

Corruption and Perturbation Robustness (ICLR 2019)

Language:PythonLicense:Apache-2.0Stargazers:1006Issues:14Issues:55

PrefixTuning

Prefix-Tuning: Optimizing Continuous Prompts for Generation

java-callgraph

Programs for producing static and dynamic (runtime) call graphs for Java programs

awesome-directed-fuzzing

A curated list of awesome directed fuzzing research papers

graph4code

GraphGen4Code: a toolkit for creating code knowledge graphs based on WALA code analysis and extraction of documentation and forum content.

Language:Jupyter NotebookLicense:EPL-2.0Stargazers:216Issues:10Issues:19

intercode

[NeurIPS 2023 D&B] Code repository for InterCode benchmark https://arxiv.org/abs/2306.14898

Language:PythonLicense:MITStargazers:186Issues:7Issues:17

Prompt-Tuning

Implementation of "The Power of Scale for Parameter-Efficient Prompt Tuning"

Language:Jupyter NotebookLicense:MITStargazers:158Issues:3Issues:4

LeTI

Official repo for NAACL 2024 Findings paper "LeTI: Learning to Generate from Textual Interactions."

Language:PythonLicense:Apache-2.0Stargazers:61Issues:1Issues:2
Language:PythonLicense:MITStargazers:46Issues:2Issues:16

SemanticFlowGraph

This repository provides the code and guidance for reproducing the results in our ESEC/FSE 2023 submission "Pre-training Code Representations with Semantic Flow Graph for Effective Bug Localization".

Language:JavaLicense:MITStargazers:21Issues:2Issues:5

java-call-graph-plotter

Create and visualize static call graphs of Spring/Java applications

CCKG

Repository to create CCKGs from the paper "Similarity-weighted Construction of Contextualized Commonsense Knowledge Graphs for Knowledge-intense Argumentation Tasks"

Language:PythonStargazers:10Issues:4Issues:0

modular_code_search

Code for NS3: Neuro-Symbolic Semantic Code Search

Language:PythonStargazers:7Issues:1Issues:0

bound_propagation

Linear and interval bound propagation in Pytorch with easy-to-use API and GPU support.

Language:PythonLicense:GPL-3.0Stargazers:7Issues:2Issues:8

APRNN

Code from PLDI '23 paper "Architecture-Preserving Provable Repair of Deep Neural Networks."

Language:HCLLicense:NOASSERTIONStargazers:6Issues:2Issues:1

specrepair

SpecRepair is a neural network repair algorithm.

Language:PythonLicense:Apache-2.0Stargazers:3Issues:1Issues:1