shizhediao

shizhediao

Geek Repo

Company:HKUST

Location:Hong Kong

Home Page:https://shizhediao.github.io/

Github PK Tool:Github PK Tool

shizhediao's starred repositories

SimPO

SimPO: Simple Preference Optimization with a Reference-Free Reward

Language:PythonStargazers:338Issues:0Issues:0

RLHFlow.github.io

Webpage for RLHFlow

Language:HTMLStargazers:7Issues:0Issues:0

HPT

HPT - Open Multimodal LLMs from HyperGAI

Language:PythonLicense:Apache-2.0Stargazers:290Issues:0Issues:0

torchtune

A Native-PyTorch Library for LLM Fine-tuning

Language:PythonLicense:BSD-3-ClauseStargazers:3331Issues:0Issues:0

llama3

The official Meta Llama 3 GitHub site

Language:PythonLicense:NOASSERTIONStargazers:21452Issues:0Issues:0

LMFlow

An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.

Language:PythonLicense:Apache-2.0Stargazers:8066Issues:0Issues:0

reward-bench

RewardBench: the first evaluation tool for reward models.

Language:PythonLicense:Apache-2.0Stargazers:229Issues:0Issues:0

OpenDevin

🐚 OpenDevin: Code Less, Make More

Language:PythonLicense:MITStargazers:26820Issues:0Issues:0

bootstrapped-preference-optimization-BPO-

code for "Strengthening Multimodal Large Language Model with Bootstrapped Preference Optimization"

Language:PythonLicense:Apache-2.0Stargazers:24Issues:0Issues:0

Open-Sora-Plan

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Language:PythonLicense:Apache-2.0Stargazers:10592Issues:0Issues:0

minbpe

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Language:PythonLicense:MITStargazers:8452Issues:0Issues:0

sleeper-agents-paper

Contains random samples referenced in the paper "Sleeper Agents: Training Robustly Deceptive LLMs that Persist Through Safety Training".

Stargazers:71Issues:0Issues:0

MLLM-protector

The official repository for paper "MLLM-Protector: Ensuring MLLM’s Safety without Hurting Performance"

Language:PythonLicense:Apache-2.0Stargazers:27Issues:0Issues:0

Directional-Preference-Alignment

Directional Preference Alignment

License:Apache-2.0Stargazers:35Issues:0Issues:0

DiT

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

Language:PythonLicense:NOASSERTIONStargazers:5363Issues:0Issues:0

Multi-LoRA-Composition

Repository for the Paper "Multi-LoRA Composition for Image Generation"

Language:PythonStargazers:403Issues:0Issues:0

Automate-CoT

Findings of EMNLP 2023

Language:PythonStargazers:7Issues:0Issues:0

UniTime

UniTime: A Language-Empowered Unified Model for Cross-Domain Time Series Forecasting (WWW 2024)

Language:PythonLicense:Apache-2.0Stargazers:46Issues:0Issues:0
Language:PythonStargazers:4Issues:0Issues:0

GradSafe

Official Code for ACL 2024 paper "GradSafe: Detecting Unsafe Prompts for LLMs via Safety-Critical Gradient Analysis"

Language:PythonLicense:Apache-2.0Stargazers:23Issues:0Issues:0

ChemistryHTMLPaperParser

Convert HTML/XML Chemistry/Material Science articles into plain text

Language:PythonLicense:MITStargazers:6Issues:0Issues:0

ConstraintChecker

Official code repository for the EACL2024 paper "ConstraintChecker: A Plugin for Large Language Models to Reason on Commonsense Knowledge Bases"

Language:Jupyter NotebookLicense:MITStargazers:8Issues:0Issues:0

Awesome-Scientific-Language-Models

A Curated List of Language Models in Scientific Domains

License:MITStargazers:285Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:3755Issues:0Issues:0

Contamination_For_PreTraining

The source code for the paper contamination analysis for pre-training language models.

Language:PythonStargazers:6Issues:0Issues:0

tianshou

An elegant PyTorch deep reinforcement learning library.

Language:PythonLicense:MITStargazers:7516Issues:0Issues:0

magicoder

Magicoder: Source Code Is All You Need

Language:PythonLicense:MITStargazers:1912Issues:0Issues:0

CoDA_NeurIPS2023

Official code for NeurIPS2023 paper: CoDA: Collaborative Novel Box Discovery and Cross-modal Alignment for Open-vocabulary 3D Object Detection

Language:Jupyter NotebookLicense:MITStargazers:156Issues:0Issues:0

doremi

Pytorch implementation of DoReMi, a method for optimizing the data mixture weights in language modeling datasets

Language:HTMLLicense:MITStargazers:254Issues:0Issues:0

promptbench

A unified evaluation framework for large language models

Language:PythonLicense:MITStargazers:2210Issues:0Issues:0