Beast code in Giters

shizhediao's starred repositories

SimPO

SimPO: Simple Preference Optimization with a Reference-Free Reward

Language:Python33800

RLHFlow.github.io

Webpage for RLHFlow

Language:HTML700

HPT

HPT - Open Multimodal LLMs from HyperGAI

Language:PythonApache-2.029000

torchtune

A Native-PyTorch Library for LLM Fine-tuning

Language:PythonBSD-3-Clause333100

llama3

The official Meta Llama 3 GitHub site

Language:PythonNOASSERTION2145200

LMFlow

An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.

Language:PythonApache-2.0806600

reward-bench

RewardBench: the first evaluation tool for reward models.

Language:PythonApache-2.022900

OpenDevin

🐚 OpenDevin: Code Less, Make More

Language:PythonMIT2682000

bootstrapped-preference-optimization-BPO-

code for "Strengthening Multimodal Large Language Model with Bootstrapped Preference Optimization"

Language:PythonApache-2.02400

Open-Sora-Plan

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Language:PythonApache-2.01059200

minbpe

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Language:PythonMIT845200

sleeper-agents-paper

Contains random samples referenced in the paper "Sleeper Agents: Training Robustly Deceptive LLMs that Persist Through Safety Training".

7100

MLLM-protector

The official repository for paper "MLLM-Protector: Ensuring MLLM’s Safety without Hurting Performance"

Language:PythonApache-2.02700

Directional-Preference-Alignment

Directional Preference Alignment

Apache-2.03500

DiT

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

Language:PythonNOASSERTION536300

Multi-LoRA-Composition

Repository for the Paper "Multi-LoRA Composition for Image Generation"

Language:Python40300

Automate-CoT

Findings of EMNLP 2023

Language:Python700

UniTime

UniTime: A Language-Empowered Unified Model for Cross-Domain Time Series Forecasting (WWW 2024)

Language:PythonApache-2.04600

R2PE

Language:Python400

GradSafe

Official Code for ACL 2024 paper "GradSafe: Detecting Unsafe Prompts for LLMs via Safety-Critical Gradient Analysis"

Language:PythonApache-2.02300

ChemistryHTMLPaperParser

Convert HTML/XML Chemistry/Material Science articles into plain text

Language:PythonMIT600

ConstraintChecker

Official code repository for the EACL2024 paper "ConstraintChecker: A Plugin for Large Language Models to Reason on Commonsense Knowledge Bases"

Language:Jupyter NotebookMIT800

Awesome-Scientific-Language-Models

A Curated List of Language Models in Scientific Domains

MIT28500

alphageometry

Language:PythonApache-2.0375500

Contamination_For_PreTraining

The source code for the paper contamination analysis for pre-training language models.

Language:Python600

tianshou

An elegant PyTorch deep reinforcement learning library.

Language:PythonMIT751600

magicoder

Magicoder: Source Code Is All You Need

Language:PythonMIT191200

CoDA_NeurIPS2023

Official code for NeurIPS2023 paper: CoDA: Collaborative Novel Box Discovery and Cross-modal Alignment for Open-vocabulary 3D Object Detection

Language:Jupyter NotebookMIT15600

doremi

Pytorch implementation of DoReMi, a method for optimizing the data mixture weights in language modeling datasets

Language:HTMLMIT25400

promptbench

A unified evaluation framework for large language models

Language:PythonMIT221000