Yilun Chen's starred repositories

ollama

Get up and running with Llama 3, Mistral, Gemma, and other large language models.

LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Language:PythonLicense:Apache-2.0Stargazers:17585Issues:156Issues:1360

open_clip

An open source implementation of CLIP.

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:8978Issues:78Issues:444
Language:PythonLicense:NOASSERTIONStargazers:8163Issues:152Issues:0

OOTDiffusion

Official implementation of OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable Virtual Try-on

Language:PythonLicense:NOASSERTIONStargazers:5004Issues:72Issues:184

MobileSAM

This is the official code for MobileSAM project that makes SAM lightweight for mobile applications and beyond!

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:4440Issues:44Issues:121

IP-Adapter

The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:4367Issues:61Issues:338

AnyText

Official implementation code of the paper <AnyText: Multilingual Visual Text Generation And Editing>

Language:PythonLicense:Apache-2.0Stargazers:3961Issues:55Issues:106

scenic

Scenic: A Jax Library for Computer Vision Research and Beyond

Language:PythonLicense:Apache-2.0Stargazers:3099Issues:40Issues:238

T-Rex

API for T-Rex2: Towards Generic Object Detection via Text-Visual Prompt Synergy

Language:PythonLicense:NOASSERTIONStargazers:1962Issues:37Issues:67

DeepSeek-VL

DeepSeek-VL: Towards Real-World Vision-Language Understanding

Language:PythonLicense:MITStargazers:1825Issues:17Issues:41

gpts-works

A Third-party GPTs store

Language:TypeScriptLicense:Apache-2.0Stargazers:1394Issues:8Issues:27

salt

Segment Anything Labelling Tool

Language:PythonLicense:MITStargazers:996Issues:9Issues:36

HuatuoGPT

HuatuoGPT, Towards Taming Language Models To Be a Doctor. (An Open Medical GPT)

Language:PythonLicense:Apache-2.0Stargazers:978Issues:19Issues:50

LanguageBind

【ICLR 2024🔥】 Extending Video-Language Pretraining to N-modality by Language-based Semantic Alignment

Language:PythonLicense:MITStargazers:590Issues:13Issues:51

wise-ft

Robust fine-tuning of zero-shot models

Language:PythonLicense:NOASSERTIONStargazers:586Issues:6Issues:25

nanosam

A distilled Segment Anything (SAM) model capable of running real-time with NVIDIA TensorRT

Language:PythonLicense:Apache-2.0Stargazers:566Issues:7Issues:24

RepoToTextForLLMs

Automate the analysis of GitHub repositories for LLMs with RepoToTextForLLMs. Fetch READMEs, structure, and non-binary files efficiently. Outputs include analysis prompts to aid in comprehensive repo evaluation

MiniGPT4-video

Official code for MiniGPT4-video

Language:PythonLicense:BSD-3-ClauseStargazers:432Issues:10Issues:27

ml-mobileclip

This repository contains the official implementation of the research paper, "MobileCLIP: Fast Image-Text Models through Multi-Modal Reinforced Training" CVPR 2024

Language:PythonLicense:NOASSERTIONStargazers:429Issues:15Issues:0

CLIP_Surgery

CLIP Surgery for Better Explainability with Enhancement in Open-Vocabulary Tasks

Language:Jupyter NotebookStargazers:312Issues:5Issues:30

aistudio-copilot-sample

Sample quickstart repo for getting started building an enterprise chat copilot in Azure AI Studio

Language:PythonLicense:MITStargazers:306Issues:144Issues:36

Open-GroundingDino

This is the third party implementation of the paper Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection.

Language:PythonLicense:MITStargazers:285Issues:2Issues:71

CLIPA

[NeurIPS 2023] This repository includes the official implementation of our paper "An Inverse Scaling Law for CLIP Training"

Language:PythonLicense:Apache-2.0Stargazers:283Issues:13Issues:11

LabelConvert

🔄 A tool for object detection and image segmentation dataset format conversion.

Language:PythonLicense:Apache-2.0Stargazers:267Issues:4Issues:9

camera_calibration_tool

OpenCV-Python 相机标定及矫正,张正友相机标定法

jetson-intro-to-distillation

A tutorial introducing knowledge distillation as an optimization technique for deployment on NVIDIA Jetson

Language:PythonLicense:NOASSERTIONStargazers:131Issues:4Issues:2

Invariant-TemplateMatching

Rotation & scale invariant template matching

Language:PythonLicense:MITStargazers:95Issues:3Issues:3

books

WeChat Moments 小众图书馆