zhuyichen's starred repositories

llama3

The official Meta Llama 3 GitHub site

Language:PythonLicense:NOASSERTIONStargazers:24682Issues:208Issues:208
Language:PythonLicense:NOASSERTIONStargazers:8231Issues:153Issues:0

TinyLlama

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Language:PythonLicense:Apache-2.0Stargazers:7410Issues:110Issues:150
Language:PythonLicense:Apache-2.0Stargazers:7030Issues:67Issues:69

lm-evaluation-harness

A framework for few-shot evaluation of language models.

Language:PythonLicense:MITStargazers:5972Issues:36Issues:956

DiT

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

Language:PythonLicense:NOASSERTIONStargazers:5772Issues:46Issues:75

moondream

tiny vision language model

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:4613Issues:54Issues:98

lerobot

🤗 LeRobot: End-to-end Learning for Real-World Robotics in Pytorch

Language:PythonLicense:Apache-2.0Stargazers:4554Issues:51Issues:70

mergekit

Tools for merging pretrained large language models.

Language:PythonLicense:LGPL-3.0Stargazers:4181Issues:47Issues:266

mobile-aloha

Mobile ALOHA: Learning Bimanual Mobile Manipulation with Low-Cost Whole-Body Teleoperation

Language:Jupyter NotebookLicense:MITStargazers:3688Issues:71Issues:14

xtuner

An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)

Language:PythonLicense:Apache-2.0Stargazers:3453Issues:33Issues:457

reflexion

[NeurIPS 2023] Reflexion: Language Agents with Verbal Reinforcement Learning

Language:PythonLicense:MITStargazers:2184Issues:31Issues:31

OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & Mixtral)

Language:PythonLicense:Apache-2.0Stargazers:1763Issues:21Issues:179
Language:CLicense:NOASSERTIONStargazers:1742Issues:40Issues:157

dora

DORA (Dataflow-Oriented Robotic Application) is middleware designed to streamline and simplify the creation of AI-based robotic applications. It offers low latency, composable, and distributed dataflow capabilities. Applications are modeled as directed graphs, also referred to as pipelines.

Language:RustLicense:Apache-2.0Stargazers:1380Issues:27Issues:119

OpenDiT

OpenDiT: An Easy, Fast and Memory-Efficient System for DiT Training and Inference

Language:PythonLicense:Apache-2.0Stargazers:1365Issues:23Issues:56

awesome-llm-powered-agent

Awesome things about LLM-powered agents. Papers / Repos / Blogs / ...

JetMoE

Reaching LLaMA2 Performance with 0.1M Dollars

Language:PythonLicense:Apache-2.0Stargazers:947Issues:8Issues:9

mmc4

MultimodalC4 is a multimodal extension of c4 that interleaves millions of images with text.

Language:PythonLicense:MITStargazers:887Issues:9Issues:17

mamba-chat

Mamba-Chat: A chat LLM based on the state-space model architecture 🐍

Language:PythonLicense:Apache-2.0Stargazers:883Issues:6Issues:27

LLM-in-Vision

Recent LLM-based CV and related works. Welcome to comment/contribute!

Language:PythonLicense:MITStargazers:562Issues:17Issues:25

Awesome-Knowledge-Distillation-of-LLMs

This repository collects papers for "A Survey on Knowledge Distillation of Large Language Models". We break down KD into Knowledge Elicitation and Distillation Algorithms, and explore the Skill & Vertical Distillation of LLMs.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:253Issues:10Issues:8

BLIVA

(AAAI 2024) BLIVA: A Simple Multimodal LLM for Better Handling of Text-rich Visual Questions

Language:PythonLicense:BSD-3-ClauseStargazers:252Issues:12Issues:24

model-metrics-plot

🔨🔨🔨(mmplot)used to draw graphs of multiple index parameters such as algorithm accuracy and speed of multiple deep learning models.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:68Issues:5Issues:4
Language:PythonLicense:BSD-3-ClauseStargazers:50Issues:4Issues:1
Language:Jupyter NotebookStargazers:38Issues:2Issues:0