Yilun Chen's starred repositories

ollama

Get up and running with Llama 3, Mistral, Gemma 2, and other large language models.

LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Language:PythonLicense:Apache-2.0Stargazers:18296Issues:158Issues:1409

gaussian-splatting

Original reference implementation of "3D Gaussian Splatting for Real-Time Radiance Field Rendering"

Language:PythonLicense:NOASSERTIONStargazers:12866Issues:112Issues:839

open_clip

An open source implementation of CLIP.

Language:PythonLicense:NOASSERTIONStargazers:9290Issues:76Issues:454
Language:PythonLicense:NOASSERTIONStargazers:8224Issues:152Issues:0

OOTDiffusion

Official implementation of OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable Virtual Try-on

Language:PythonLicense:NOASSERTIONStargazers:5170Issues:73Issues:191

IP-Adapter

The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:4652Issues:60Issues:356

MobileSAM

This is the official code for MobileSAM project that makes SAM lightweight for mobile applications and beyond!

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:4531Issues:46Issues:121

AnyText

Official implementation code of the paper <AnyText: Multilingual Visual Text Generation And Editing>

Language:PythonLicense:Apache-2.0Stargazers:4062Issues:53Issues:113

scenic

Scenic: A Jax Library for Computer Vision Research and Beyond

Language:PythonLicense:Apache-2.0Stargazers:3159Issues:39Issues:243

T-Rex

[ECCV2024] API code for T-Rex2: Towards Generic Object Detection via Text-Visual Prompt Synergy

Language:PythonLicense:NOASSERTIONStargazers:2016Issues:36Issues:75

DeepSeek-VL

DeepSeek-VL: Towards Real-World Vision-Language Understanding

Language:PythonLicense:MITStargazers:1895Issues:18Issues:43

gpts-works

A Third-party GPTs store

Language:TypeScriptLicense:Apache-2.0Stargazers:1419Issues:8Issues:27

HuatuoGPT

HuatuoGPT, Towards Taming Language Models To Be a Doctor. (An Open Medical GPT)

Language:PythonLicense:Apache-2.0Stargazers:1006Issues:20Issues:50

salt

Segment Anything Labelling Tool

Language:PythonLicense:MITStargazers:1002Issues:9Issues:37

LanguageBind

【ICLR 2024🔥】 Extending Video-Language Pretraining to N-modality by Language-based Semantic Alignment

Language:PythonLicense:MITStargazers:629Issues:13Issues:52

wise-ft

Robust fine-tuning of zero-shot models

Language:PythonLicense:NOASSERTIONStargazers:607Issues:6Issues:25

nanosam

A distilled Segment Anything (SAM) model capable of running real-time with NVIDIA TensorRT

Language:PythonLicense:Apache-2.0Stargazers:585Issues:7Issues:26

RepoToTextForLLMs

Automate the analysis of GitHub repositories for LLMs with RepoToTextForLLMs. Fetch READMEs, structure, and non-binary files efficiently. Outputs include analysis prompts to aid in comprehensive repo evaluation

ml-mobileclip

This repository contains the official implementation of the research paper, "MobileCLIP: Fast Image-Text Models through Multi-Modal Reinforced Training" CVPR 2024

Language:PythonLicense:NOASSERTIONStargazers:489Issues:15Issues:0

MiniGPT4-video

Official code for Goldfish model for long video understanding and MiniGPT4-video for short video understanding

Language:PythonLicense:BSD-3-ClauseStargazers:479Issues:12Issues:30

CLIP_Surgery

CLIP Surgery for Better Explainability with Enhancement in Open-Vocabulary Tasks

Language:Jupyter NotebookStargazers:325Issues:5Issues:30

Open-GroundingDino

This is the third party implementation of the paper Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection.

Language:PythonLicense:MITStargazers:323Issues:2Issues:75

aistudio-copilot-sample

Sample quickstart repo for getting started building an enterprise chat copilot in Azure AI Studio

Language:PythonLicense:MITStargazers:310Issues:143Issues:36

CoSeR

[CVPR 2024] CoSeR: Bridging Image and Language for Cognitive Super-Resolution

CLIPA

[NeurIPS 2023] This repository includes the official implementation of our paper "An Inverse Scaling Law for CLIP Training"

Language:PythonLicense:Apache-2.0Stargazers:289Issues:13Issues:11

LabelConvert

🔄 A tool for object detection and image segmentation dataset format conversion.

Language:PythonLicense:Apache-2.0Stargazers:276Issues:4Issues:9

camera_calibration_tool

OpenCV-Python 相机标定及矫正,张正友相机标定法

Invariant-TemplateMatching

Rotation & scale invariant template matching

Language:PythonLicense:MITStargazers:101Issues:3Issues:3

books

WeChat Moments 小众图书馆