Yilun Chen's starred repositories

gaussian-splatting

Original reference implementation of "3D Gaussian Splatting for Real-Time Radiance Field Rendering"

Language:PythonLicense:NOASSERTIONStargazers:12469Issues:0Issues:0

camera_calibration_tool

OpenCV-Python 相机标定及矫正,张正友相机标定法

Language:PythonStargazers:145Issues:0Issues:0

LanguageBind

【ICLR 2024🔥】 Extending Video-Language Pretraining to N-modality by Language-based Semantic Alignment

Language:PythonLicense:MITStargazers:611Issues:0Issues:0

MiniGPT4-video

Official code for MiniGPT4-video

Language:PythonLicense:BSD-3-ClauseStargazers:440Issues:0Issues:0

T-Rex

API for T-Rex2: Towards Generic Object Detection via Text-Visual Prompt Synergy

Language:PythonLicense:NOASSERTIONStargazers:1977Issues:0Issues:0

aistudio-copilot-sample

Sample quickstart repo for getting started building an enterprise chat copilot in Azure AI Studio

Language:PythonLicense:MITStargazers:307Issues:0Issues:0
Language:PythonLicense:NOASSERTIONStargazers:8191Issues:0Issues:0

books

WeChat Moments 小众图书馆

Stargazers:10Issues:0Issues:0

MobileSAM

This is the official code for MobileSAM project that makes SAM lightweight for mobile applications and beyond!

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:4478Issues:0Issues:0

RepoToTextForLLMs

Automate the analysis of GitHub repositories for LLMs with RepoToTextForLLMs. Fetch READMEs, structure, and non-binary files efficiently. Outputs include analysis prompts to aid in comprehensive repo evaluation

Language:PythonStargazers:547Issues:0Issues:0

ollama

Get up and running with Llama 3, Mistral, Gemma 2, and other large language models.

Language:GoLicense:MITStargazers:76663Issues:0Issues:0

DeepSeek-VL

DeepSeek-VL: Towards Real-World Vision-Language Understanding

Language:PythonLicense:MITStargazers:1852Issues:0Issues:0

HuatuoGPT

HuatuoGPT, Towards Taming Language Models To Be a Doctor. (An Open Medical GPT)

Language:PythonLicense:Apache-2.0Stargazers:993Issues:0Issues:0

OOTDiffusion

Official implementation of OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable Virtual Try-on

Language:PythonLicense:NOASSERTIONStargazers:5071Issues:0Issues:0

ml-mobileclip

This repository contains the official implementation of the research paper, "MobileCLIP: Fast Image-Text Models through Multi-Modal Reinforced Training" CVPR 2024

Language:PythonLicense:NOASSERTIONStargazers:467Issues:0Issues:0
Stargazers:294Issues:0Issues:0

AnyText

Official implementation code of the paper <AnyText: Multilingual Visual Text Generation And Editing>

Language:PythonLicense:Apache-2.0Stargazers:4003Issues:0Issues:0

Open-GroundingDino

This is the third party implementation of the paper Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection.

Language:PythonLicense:MITStargazers:296Issues:0Issues:0

Invariant-TemplateMatching

Rotation & scale invariant template matching

Language:PythonLicense:MITStargazers:98Issues:0Issues:0

IP-Adapter

The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:4486Issues:0Issues:0

LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Language:PythonLicense:Apache-2.0Stargazers:17901Issues:0Issues:0

LabelConvert

🔄 A tool for object detection and image segmentation dataset format conversion.

Language:PythonLicense:Apache-2.0Stargazers:269Issues:0Issues:0

wise-ft

Robust fine-tuning of zero-shot models

Language:PythonLicense:NOASSERTIONStargazers:591Issues:0Issues:0

CLIP_Surgery

CLIP Surgery for Better Explainability with Enhancement in Open-Vocabulary Tasks

Language:Jupyter NotebookStargazers:315Issues:0Issues:0

CLIPA

[NeurIPS 2023] This repository includes the official implementation of our paper "An Inverse Scaling Law for CLIP Training"

Language:PythonLicense:Apache-2.0Stargazers:285Issues:0Issues:0

gpts-works

A Third-party GPTs store

Language:TypeScriptLicense:Apache-2.0Stargazers:1405Issues:0Issues:0

open_clip

An open source implementation of CLIP.

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:9088Issues:0Issues:0

scenic

Scenic: A Jax Library for Computer Vision Research and Beyond

Language:PythonLicense:Apache-2.0Stargazers:3124Issues:0Issues:0

salt

Segment Anything Labelling Tool

Language:PythonLicense:MITStargazers:998Issues:0Issues:0

nanosam

A distilled Segment Anything (SAM) model capable of running real-time with NVIDIA TensorRT

Language:PythonLicense:Apache-2.0Stargazers:571Issues:0Issues:0