Jiaxian Guo (CR-Gjx)

CR-Gjx

Geek Repo

Company:The University of Tokyo

Location:Tokyo, Japan

Home Page:https://cr-gjx.github.io/

Github PK Tool:Github PK Tool

Jiaxian Guo's starred repositories

VL-RLHF

A RLHF Infrastructure for Vision-Language Models

Language:PythonLicense:Apache-2.0Stargazers:65Issues:0Issues:0

alignment-handbook

Robust recipes to align language models with human and AI preferences

Language:PythonLicense:Apache-2.0Stargazers:4257Issues:0Issues:0

SimPO

SimPO: Simple Preference Optimization with a Reference-Free Reward

Language:PythonLicense:MITStargazers:549Issues:0Issues:0

chameleon

Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.

Language:PythonLicense:NOASSERTIONStargazers:1577Issues:0Issues:0

pykan

Kolmogorov Arnold Networks

Language:Jupyter NotebookLicense:MITStargazers:13881Issues:0Issues:0

LLaVA-pp

🔥🔥 LLaVA++: Extending LLaVA with Phi-3 and LLaMA-3 (LLaVA LLaMA-3, LLaVA Phi-3)

Language:PythonStargazers:757Issues:0Issues:0

RLHF-Reward-Modeling

Recipes to train reward model for RLHF.

Language:PythonLicense:Apache-2.0Stargazers:474Issues:0Issues:0

Vitron

A Unified Pixel-level Vision LLM for Understanding, Generating, Segmenting, Editing

Language:PythonStargazers:262Issues:0Issues:0

InternVL

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的可商用开源多模态对话模型

Language:PythonLicense:MITStargazers:4338Issues:0Issues:0

PLLaVA

Official repository for the paper PLLaVA

Language:PythonStargazers:492Issues:0Issues:0

AgentBench

A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)

Language:PythonLicense:Apache-2.0Stargazers:2029Issues:0Issues:0

LLaMA2-Accessory

An Open-source Toolkit for LLM Development

Language:PythonLicense:NOASSERTIONStargazers:2630Issues:0Issues:0

orpo

Official repository for ORPO

Language:PythonLicense:Apache-2.0Stargazers:383Issues:0Issues:0
Language:PythonStargazers:54Issues:0Issues:0

MGM

Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"

Language:PythonLicense:Apache-2.0Stargazers:3104Issues:0Issues:0

LLaVA-Plus-Codebase

LLaVA-Plus: Large Language and Vision Assistants that Plug and Learn to Use Skills

Language:PythonLicense:Apache-2.0Stargazers:670Issues:0Issues:0

ToolDec

Syntax Error-Free and Generalizable Tool Use for LLMs via Finite-State Decoding

Language:PythonLicense:MITStargazers:28Issues:0Issues:0
Language:PythonStargazers:47Issues:0Issues:0

lmms-eval

Accelerating the development of large multimodal models (LMMs) with lmms-eval

Language:PythonLicense:NOASSERTIONStargazers:1135Issues:0Issues:0

lm-human-preference-details

RLHF implementation details of OAI's 2019 codebase

Language:PythonLicense:MITStargazers:138Issues:0Issues:0

RLHF-APA

RL algorithm: Advantage induced policy alignment

Language:PythonLicense:MITStargazers:62Issues:0Issues:0

MoE-LLaVA

Mixture-of-Experts for Large Vision-Language Models

Language:PythonLicense:Apache-2.0Stargazers:1856Issues:0Issues:0

LlamaGym

Fine-tune LLM agents with online reinforcement learning

Language:PythonLicense:MITStargazers:949Issues:0Issues:0

Machine-Learning-Interviews

This repo is meant to serve as a guide for Machine Learning/AI technical interviews.

Language:Jupyter NotebookLicense:MITStargazers:3985Issues:0Issues:0

LLM-FineTuning-Large-Language-Models

LLM (Large Language Model) FineTuning

Language:Jupyter NotebookStargazers:408Issues:0Issues:0

vstar

PyTorch Implementation of "V* : Guided Visual Search as a Core Mechanism in Multimodal LLMs"

Language:PythonLicense:MITStargazers:477Issues:0Issues:0

ml-engineering

Machine Learning Engineering Open Book

Language:PythonLicense:CC-BY-SA-4.0Stargazers:10285Issues:0Issues:0

self-instruct

Aligning pretrained language models with instruction data generated by themselves.

Language:PythonLicense:Apache-2.0Stargazers:3964Issues:0Issues:0

KwaiAgents

A generalized information-seeking agent system with Large Language Models (LLMs).

Language:PythonLicense:NOASSERTIONStargazers:1048Issues:0Issues:0

corr2cause

Data and code for the Corr2Cause paper (ICLR 2024)

Language:PythonLicense:MITStargazers:78Issues:0Issues:0