austingg

Yubin Wang's starred repositories

nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Language:PythonMIT36268 368 315

LLM101n

LLM101n: Let's build a Storyteller

28358 20950

Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Language:PythonApache-2.021636 182 478

Awesome-Multimodal-Large-Language-Models

:sparkles::sparkles:Latest Advances on Multimodal Large Language Models

11768 269 109

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Language:Jupyter NotebookApache-2.010820 64 243

rathole

A lightweight and high-performance reverse proxy for NAT traversal, written in Rust. An alternative to frp and ngrok.

Language:RustApache-2.09371 63 217

ChatTTS-ui

一个简单的本地网页界面，使用ChatTTS将文字合成为语音，同时支持对外提供API接口。A simple native web interface that uses ChatTTS to synthesize text into speech, along with support for external API interfaces.

Language:PythonNOASSERTION5888 39 221

xtuner

An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)

Language:PythonApache-2.03739 33 508

build-nanogpt

Video+code lecture on building nanoGPT from scratch

Language:Python3403 34 19

stack

Open-source Auth0/Clerk alternative

Language:TypeScriptNOASSERTION3167 12 52

Awesome-LLM-Inference

📖A curated list of Awesome LLM Inference Paper with codes, TensorRT-LLM, vLLM, streaming-llm, AWQ, SmoothQuant, WINT8/4, Continuous Batching, FlashAttention, PagedAttention etc.

GPL-3.02496 82 6

LLaVA-NeXT

Language:PythonApache-2.02419 32 221

Phi-3CookBook

This is a Phi-3 book for getting started with Phi-3. Phi-3, a family of open AI models developed by Microsoft. Phi-3 models are the most capable and cost-effective small language models (SLMs) available, outperforming models of the same size and next size up across a variety of language, reasoning, coding, and math benchmarks.

Language:Jupyter NotebookMIT2248 16 62

cambrian

Cambrian-1 is a family of multimodal LLMs with a vision-centric design.

Language:PythonApache-2.01683 23 65

MetaCLIP

ICLR2024 Spotlight: curation/training code, metadata, distribution and pre-trained models for MetaCLIP; CVPR 2024: MoDE: CLIP Data Experts via Clustering

Language:PythonNOASSERTION1170 12 27

Awesome-ChatTTS

ChatTTS资源大全，免费体验地址，音色库等

1132 100

VLMEvalKit

Open-source evaluation toolkit of large vision-language models (LVLMs), support ~100 VLMs, 40+ benchmarks

Language:PythonApache-2.01030 10 156

dataline

Chat with your data - AI data analysis and visualization on CSV, Postgres, MySQL, Snowflake, SQLite...

Language:TypeScriptGPL-3.0747 10 136

Llama3-Tutorial

Llama3-Tutorial（XTuner、LMDeploy、OpenCompass）

Language:Python479 10 7

py-pkgs

Open source book about making Python packages.

Language:Jupyter NotebookNOASSERTION288 12 88

MMVP

Language:Python277 10 26

OmniCorpus

OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text

Language:Python246 13 3

RLAIF-V

RLAIF-V: Aligning MLLMs through Open-Source AI Feedback for Super GPT-4V Trustworthiness

Language:Python200 4 27

GlyphControl-release

[NeurIPS2023] This is the official code of the paper "GlyphControl: Glyph Conditional Control for Visual Text Generation"

Language:PythonMIT199 4 13

TACO

Language:PythonApache-2.0131 6 11

llm-structured-output-benchmarks

Benchmark various LLM Structured Output frameworks: Instructor, Mirascope, Langchain, LlamaIndex, Fructose, Marvin, Outlines, etc on tasks like multi-label classification, named entity recognition, synthetic data generation, etc.

Language:PythonApache-2.0118 5 1

TensorHue

TensorHue is a Python library that allows you to visualize tensors right in your console, making understanding and debugging tensor contents easier.

Language:Python101 3 2

CVLface

Language:PythonMIT4500

NCL-IML

Offical implement of NCL-IML (Pre-training-free Image Manipulation Localization through Non-Mutually Contrastive Learning), ICCV2023

Language:Jupyter Notebook38 2 6

ocr-dataset-rendering

Language:PythonMIT15 10