Dickachu Yang's starred repositories

gpt4all

gpt4all: run open-source LLMs anywhere

OpenDevin

šŸš OpenDevin: Code Less, Make More

Language:PythonLicense:MITStargazers:26064Issues:286Issues:931

dspy

DSPy: The framework for programmingā€”not promptingā€”foundation models

Language:PythonLicense:MITStargazers:11803Issues:114Issues:474

SWE-agent

SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It solves 12.29% of bugs in the SWE-bench evaluation set and takes just 1.5 minutes to run.

Language:PythonLicense:MITStargazers:10525Issues:78Issues:217

Open-Sora-Plan

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Language:PythonLicense:MITStargazers:10335Issues:152Issues:156

YOLO-World

[CVPR 2024] Real-Time Open-Vocabulary Object Detection

Language:PythonLicense:GPL-3.0Stargazers:3588Issues:33Issues:291

VAR

[GPT beats diffusionšŸ”„] [scaling laws in visual generationšŸ“ˆ] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!

Language:PythonLicense:MITStargazers:3556Issues:111Issues:60

LLaMA2-Accessory

An Open-source Toolkit for LLM Development

Language:PythonLicense:NOASSERTIONStargazers:2559Issues:36Issues:128

jepa

PyTorch code and models for V-JEPA self-supervised learning from video.

Language:PythonLicense:NOASSERTIONStargazers:2437Issues:31Issues:41

RAGatouille

Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-of-use, backed by research.

Language:PythonLicense:Apache-2.0Stargazers:2245Issues:21Issues:143

T-Rex

API for T-Rex2: Towards Generic Object Detection via Text-Visual Prompt Synergy

Language:PythonLicense:NOASSERTIONStargazers:1926Issues:39Issues:56

aici

AICI: Prompts as (Wasm) Programs

Language:RustLicense:MITStargazers:1786Issues:19Issues:74

BrushNet

The official implementation of paper "BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion"

Language:PythonLicense:NOASSERTIONStargazers:1036Issues:45Issues:35

evolutionary-model-merge

Official repository of Evolutionary Optimization of Model Merging Recipes

Language:PythonLicense:Apache-2.0Stargazers:1014Issues:38Issues:8

lmms-eval

Accelerating the development of large multimodal models (LMMs) with lmms-eval

Language:PythonLicense:NOASSERTIONStargazers:695Issues:3Issues:56

recurrentgemma

Open weights language model from Google DeepMind, based on Griffin.

Language:PythonLicense:Apache-2.0Stargazers:534Issues:16Issues:5

long-form-factuality

Benchmarking long-form factuality in large language models. Original code for our paper "Long-form factuality in large language models".

Language:PythonLicense:NOASSERTIONStargazers:465Issues:10Issues:1

vitaGL

openGL wrapper for PSVITA.

Language:CLicense:LGPL-3.0Stargazers:437Issues:21Issues:34

megalodon

Reference implementation of Megalodon 7B model

Language:CudaLicense:MITStargazers:385Issues:9Issues:6

DINOv

[CVPR 2024] Official implementation of the paper "Visual In-context Learning"

SEED-X

Multimodal Models in Real World

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:246Issues:15Issues:2

cobra

Cobra: Extending Mamba to Multi-modal Large Language Model for Efficient Inference

Language:PythonLicense:MITStargazers:193Issues:0Issues:0

ARENA_2.0

Resources for skilling up in AI alignment research engineering. Covers basics of deep learning, mechanistic interpretability, and RL.

FastV

Code for paper: An Image is Worth 1/2 Tokens After Layer 2: Plug-and-Play Inference Acceleration for Large Vision-Language Models

WSDM-Cup-2024

1st Solution For Conversational Multi-Doc QA Workshop & International Challenge @ WSDM'24 - Xiaohongshu.Inc

Griffon

The official repo of Griffon

Language:PythonLicense:Apache-2.0Stargazers:75Issues:2Issues:8

ovtrack

OVTrack: Open-Vocabulary Multiple Object Tracking [CVPR 2023]

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:66Issues:4Issues:14

transformers-crash-course

A collection of tutorials and notebooks explaining transformer models in deep learning.

Language:Jupyter NotebookLicense:MITStargazers:58Issues:1Issues:0

InstaGen

InstaGen: Enhancing Object Detection by Training on Synthetic Dataset, CVPR2024

Language:Jupyter NotebookLicense:MITStargazers:48Issues:4Issues:2