Kaidi Zhang's starred repositories

llama-stack

Model components of the Llama Stack APIs

Language:PythonLicense:MITStargazers:3127Issues:0Issues:0

A2J-Transformer

[CVPR 2023] Code for paper 'A2J-Transformer: Anchor-to-Joint Transformer Network for 3D Interacting Hand Pose Estimation from a Single RGB Image'

Language:PythonLicense:Apache-2.0Stargazers:90Issues:0Issues:0

detr

End-to-End Object Detection with Transformers

Language:PythonLicense:Apache-2.0Stargazers:13427Issues:0Issues:0

tiny_computer

Click-to-run debian 12 xfce on android for Chinese users, with fcitx pinyin input method and some useful packages preinstalled. No termux required.

Language:CLicense:GPL-3.0Stargazers:973Issues:0Issues:0

Monkey

【CVPR 2024 Highlight】Monkey (LMM): Image Resolution and Text Label Are Important Things for Large Multi-modal Models

Language:PythonLicense:MITStargazers:1783Issues:0Issues:0

sam2

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:11317Issues:0Issues:0

langchain-tutorials

A set of LangChain Tutorials from my youtube channel

Language:Jupyter NotebookStargazers:1327Issues:0Issues:0

CVPR2024-Papers-with-Code

CVPR 2024 论文和开源项目合集

Stargazers:17861Issues:0Issues:0

HDSA-reID

Official code for paper “Person Re-identification with Hierarchical Discriminative Spatial Aggregation”

Language:PythonLicense:AGPL-3.0Stargazers:2Issues:0Issues:0

PyQt

PyQt Examples(PyQt各种测试和例子) PyQt4 PyQt5

Language:PythonLicense:LGPL-2.1Stargazers:6598Issues:0Issues:0

U-Mamba

U-Mamba: Enhancing Long-range Dependency for Biomedical Image Segmentation

Language:PythonLicense:Apache-2.0Stargazers:654Issues:0Issues:0

CrossGLG

The code for "CrossGLG: LLM Guides One-shot Skeleton-based 3D Action Recognition in a Cross-level Manner"

Stargazers:4Issues:0Issues:0

ragflow

RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.

Language:PythonLicense:Apache-2.0Stargazers:18538Issues:0Issues:0

EfficientSAM

EfficientSAM: Leveraged Masked Image Pretraining for Efficient Segment Anything

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:2104Issues:0Issues:0

SimpleITK-Notebooks

Jupyter notebooks for learning how to use SimpleITK

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:831Issues:0Issues:0

onebot

OneBot:统一的聊天机器人应用接口标准

Language:CSSLicense:MITStargazers:1739Issues:0Issues:0

iDVC

Digital Volume Correlation user interface

Language:PythonLicense:Apache-2.0Stargazers:5Issues:0Issues:0

labelme

Image Polygonal Annotation with Python (polygon, rectangle, circle, line, point and image-level flag annotation).

Language:PythonLicense:NOASSERTIONStargazers:13274Issues:0Issues:0

Dicom-Viewer

An application displaying 2D/3D Dicom

Language:PythonLicense:MITStargazers:59Issues:0Issues:0

PyOCT

Image reconstruction and data processing for spectral-domain optical coherence tomography

Language:PythonLicense:MITStargazers:15Issues:0Issues:0

MedSAM

Segment Anything in Medical Images

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:2846Issues:0Issues:0

pytorch-template

PyTorch deep learning projects made easy.

Language:PythonLicense:MITStargazers:4714Issues:0Issues:0

Grounded-Segment-Anything

Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:14879Issues:0Issues:0

segment-anything-with-clip

Segment Anything combined with CLIP

Language:PythonLicense:Apache-2.0Stargazers:328Issues:0Issues:0

MGM

Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"

Language:PythonLicense:Apache-2.0Stargazers:3189Issues:0Issues:0

InstMatt

Official repository for Instance Human Matting via Mutual Guidance and Multi-Instance Refinement

Language:PythonStargazers:101Issues:0Issues:0

MPEblink

[CVPR 2023] Real-time Multi-person Eyeblink Detection in the Wild for Untrimmed Video

Language:PythonLicense:Apache-2.0Stargazers:49Issues:0Issues:0
Language:PythonLicense:MITStargazers:33Issues:0Issues:0

DINO

[ICLR 2023] Official implementation of the paper "DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection"

Language:PythonLicense:Apache-2.0Stargazers:2189Issues:0Issues:0

GLEE

[CVPR2024 Highlight]GLEE: General Object Foundation Model for Images and Videos at Scale

Language:PythonLicense:MITStargazers:1040Issues:0Issues:0