Beast code in Giters

Jiaxian Guo's starred repositories

VL-RLHF

A RLHF Infrastructure for Vision-Language Models

Language:PythonApache-2.06500

alignment-handbook

Robust recipes to align language models with human and AI preferences

Language:PythonApache-2.0425700

SimPO

SimPO: Simple Preference Optimization with a Reference-Free Reward

Language:PythonMIT54900

chameleon

Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.

Language:PythonNOASSERTION157700

pykan

Kolmogorov Arnold Networks

Language:Jupyter NotebookMIT1388100

LLaVA-pp

🔥🔥 LLaVA++: Extending LLaVA with Phi-3 and LLaMA-3 (LLaVA LLaMA-3, LLaVA Phi-3)

Language:Python75700

RLHF-Reward-Modeling

Recipes to train reward model for RLHF.

Language:PythonApache-2.047400

Vitron

A Unified Pixel-level Vision LLM for Understanding, Generating, Segmenting, Editing

Language:Python26200

InternVL

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的可商用开源多模态对话模型

Language:PythonMIT433800

PLLaVA

Official repository for the paper PLLaVA

Language:Python49200

AgentBench

A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)

Language:PythonApache-2.0202900

LLaMA2-Accessory

An Open-source Toolkit for LLM Development

Language:PythonNOASSERTION263000

orpo

Official repository for ORPO

Language:PythonApache-2.038300

Next-Token-Failures

Language:Python5400

MGM

Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"

Language:PythonApache-2.0310400

LLaVA-Plus-Codebase

LLaVA-Plus: Large Language and Vision Assistants that Plug and Learn to Use Skills

Language:PythonApache-2.067000

ToolDec

Syntax Error-Free and Generalizable Tool Use for LLMs via Finite-State Decoding

Language:PythonMIT2800

Muffin

Language:Python4700

lmms-eval

Accelerating the development of large multimodal models (LMMs) with lmms-eval

Language:PythonNOASSERTION113500

lm-human-preference-details

RLHF implementation details of OAI's 2019 codebase

Language:PythonMIT13800

RLHF-APA

RL algorithm: Advantage induced policy alignment

Language:PythonMIT6200

MoE-LLaVA

Mixture-of-Experts for Large Vision-Language Models

Language:PythonApache-2.0185600

LlamaGym

Fine-tune LLM agents with online reinforcement learning

Language:PythonMIT94900

Machine-Learning-Interviews

This repo is meant to serve as a guide for Machine Learning/AI technical interviews.

Language:Jupyter NotebookMIT398500

LLM-FineTuning-Large-Language-Models

LLM (Large Language Model) FineTuning

Language:Jupyter Notebook40800

vstar

PyTorch Implementation of "V* : Guided Visual Search as a Core Mechanism in Multimodal LLMs"

Language:PythonMIT47700

ml-engineering

Machine Learning Engineering Open Book

Language:PythonCC-BY-SA-4.01028500

self-instruct

Aligning pretrained language models with instruction data generated by themselves.

Language:PythonApache-2.0396400

KwaiAgents

A generalized information-seeking agent system with Large Language Models (LLMs).

Language:PythonNOASSERTION104800

corr2cause

Data and code for the Corr2Cause paper (ICLR 2024)

Language:PythonMIT7800