Linear95

Pengyu Cheng's repositories

CLUB

Code for ICML2020 paper - CLUB: A Contrastive Log-ratio Upper Bound of Mutual Information

Language:Jupyter Notebook297 7 26

bert-intent-slot-detector

BERT-based intent and slots detector for chatbots.

Language:Python93 1 8

SPAG

Self-playing Adversarial Language Game Enhances LLM Reasoning

Language:PythonApache-2.077 3 4

APO

Code for ACL2024 paper - Adversarial Preference Optimization (APO).

Language:PythonApache-2.045 1 2

BinarySentEmb

Code for ACL 2019 oral paper - Learning Compressed Sentence Representations for On-Device Text Processing.

Language:Python44 6 3

DSP

Domain-specific preference (DSP) data and customized RM fine-tuning.

Language:PythonApache-2.025 10

TC-estimation

Code for AISTATS 2023 paper - Estimating Total Correlation with Mutual Information Estimators

Language:Jupyter Notebook12 30

DetGP

Code for the AAAI 2020 oral paper - Dynamic Embedding on Textual Networks via a Gaussian Process.

Language:Python1000

RLM

Code for the paper - Replacing Language Model for Style Transfer

Language:Python3 10

Awesome-LLM-Robotics

A comprehensive list of papers using large language/multi-modal models for Robotics/RL, including papers, codes, and related websites

BSD-3-Clause200

linear95.github.io

Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes

Language:JavaScriptMIT2 10

ECC_classification

The implement of ECC classification

Language:Python1 10

LLM-with-RL-papers

A collection of LLM with RL papers

100

alpaca-lora

Instruct-tune LLaMA on consumer hardware

Language:Jupyter NotebookApache-2.0000

emacs-init

My emacs init file for python coding in deep learning

Language:Emacs Lisp020

Linear95

My personal repository

020

Megatron-LM

Ongoing research training transformer models at scale

Language:PythonNOASSERTION000

awesome-auto-alignment

000

Awesome-LLM-Reasoning

Reasoning in Large Language Models: Papers and Resources, including Chain-of-Thought, Instruction-Tuning and Multimodality.

MIT000

Awesome-LLM-RL

A comprehensive list of PAPERS, CODEBASES, and, DATASETS on Decision Making using Foundation Models including LLMs and VLMs.

000

awesome-RLHF

A curated list of reinforcement learning with human feedback resources (continually updated)

Apache-2.0000