Pengyu Cheng (Linear95)

Linear95

Geek Repo

Company:Tencent AI Lab

Location:Shenzhen, China

Home Page:https://linear95.github.io/

Twitter:@cheng_pengyu

Github PK Tool:Github PK Tool

Pengyu Cheng's repositories

CLUB

Code for ICML2020 paper - CLUB: A Contrastive Log-ratio Upper Bound of Mutual Information

Language:Jupyter NotebookStargazers:288Issues:7Issues:26

bert-intent-slot-detector

BERT-based intent and slots detector for chatbots.

SPAG

Self-playing Adversarial Language Game Enhances LLM Reasoning

Language:PythonLicense:Apache-2.0Stargazers:55Issues:3Issues:4

BinarySentEmb

Code for ACL 2019 oral paper - Learning Compressed Sentence Representations for On-Device Text Processing.

APO

Code for ACL2024 paper - Adversarial Preference Optimization (APO).

Language:PythonLicense:Apache-2.0Stargazers:42Issues:1Issues:2

DSP

Domain-specific preference (DSP) data and customized RM fine-tuning.

Language:PythonLicense:Apache-2.0Stargazers:25Issues:1Issues:0

TC-estimation

Code for AISTATS 2023 paper - Estimating Total Correlation with Mutual Information Estimators

Language:Jupyter NotebookStargazers:12Issues:3Issues:0

DetGP

Code for the AAAI 2020 oral paper - Dynamic Embedding on Textual Networks via a Gaussian Process.

Language:PythonStargazers:10Issues:0Issues:0

RLM

Code for the paper - Replacing Language Model for Style Transfer

Language:PythonStargazers:3Issues:1Issues:0

Awesome-LLM-Robotics

A comprehensive list of papers using large language/multi-modal models for Robotics/RL, including papers, codes, and related websites

License:BSD-3-ClauseStargazers:2Issues:0Issues:0

linear95.github.io

Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes

Language:JavaScriptLicense:MITStargazers:2Issues:1Issues:0

ECC_classification

The implement of ECC classification

Language:PythonStargazers:1Issues:1Issues:0

LLM-with-RL-papers

A collection of LLM with RL papers

Stargazers:1Issues:0Issues:0

alpaca-lora

Instruct-tune LLaMA on consumer hardware

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:0Issues:0

emacs-init

My emacs init file for python coding in deep learning

Language:Emacs LispStargazers:0Issues:2Issues:0

Linear95

My personal repository

Stargazers:0Issues:2Issues:0

Megatron-LM

Ongoing research training transformer models at scale

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

Awesome-LLM-Reasoning

Reasoning in Large Language Models: Papers and Resources, including Chain-of-Thought, Instruction-Tuning and Multimodality.

License:MITStargazers:0Issues:0Issues:0

Awesome-LLM-RL

A comprehensive list of PAPERS, CODEBASES, and, DATASETS on Decision Making using Foundation Models including LLMs and VLMs.

Stargazers:0Issues:0Issues:0

awesome-RLHF

A curated list of reinforcement learning with human feedback resources (continually updated)

License:Apache-2.0Stargazers:0Issues:0Issues:0