Kaidi Zhang's starred repositories

PyQt

PyQt Examples(PyQt各种测试和例子) PyQt4 PyQt5

Language:PythonLicense:LGPL-2.1Stargazers:6460Issues:0Issues:0

U-Mamba

U-Mamba: Enhancing Long-range Dependency for Biomedical Image Segmentation

Language:PythonLicense:Apache-2.0Stargazers:586Issues:0Issues:0

CrossGLG

The code for "CrossGLG: LLM Guides One-shot Skeleton-based 3D Action Recognition in a Cross-level Manner"

Stargazers:3Issues:0Issues:0

ragflow

RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.

Language:PythonLicense:Apache-2.0Stargazers:12557Issues:0Issues:0

EfficientSAM

EfficientSAM: Leveraged Masked Image Pretraining for Efficient Segment Anything

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:2000Issues:0Issues:0

SimpleITK-Notebooks

Jupyter notebooks for learning how to use SimpleITK

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:817Issues:0Issues:0

onebot

OneBot:统一的聊天机器人应用接口标准

Language:CSSLicense:MITStargazers:1679Issues:0Issues:0

iDVC

Digital Volume Correlation user interface

Language:PythonLicense:Apache-2.0Stargazers:4Issues:0Issues:0

labelme

Image Polygonal Annotation with Python (polygon, rectangle, circle, line, point and image-level flag annotation).

Language:PythonLicense:NOASSERTIONStargazers:12875Issues:0Issues:0

Dicom-Viewer

An application displaying 2D/3D Dicom

Language:PythonLicense:MITStargazers:59Issues:0Issues:0

PyOCT

Image reconstruction and data processing for spectral-domain optical coherence tomography

Language:PythonLicense:MITStargazers:15Issues:0Issues:0

MedSAM

Segment Anything in Medical Images

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:2489Issues:0Issues:0

pytorch-template

PyTorch deep learning projects made easy.

Language:PythonLicense:MITStargazers:4654Issues:0Issues:0

Grounded-Segment-Anything

Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:14296Issues:0Issues:0

segment-anything-with-clip

Segment Anything combined with CLIP

Language:PythonLicense:Apache-2.0Stargazers:321Issues:0Issues:0

MGM

Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"

Language:PythonLicense:Apache-2.0Stargazers:3108Issues:0Issues:0

InstMatt

Official repository for Instance Human Matting via Mutual Guidance and Multi-Instance Refinement

Language:PythonStargazers:99Issues:0Issues:0

MPEblink

[CVPR 2023] Real-time Multi-person Eyeblink Detection in the Wild for Untrimmed Video

Language:PythonLicense:Apache-2.0Stargazers:48Issues:0Issues:0
Language:PythonLicense:MITStargazers:29Issues:0Issues:0

DINO

[ICLR 2023] Official implementation of the paper "DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection"

Language:PythonLicense:Apache-2.0Stargazers:2099Issues:0Issues:0

GLEE

[CVPR2024 Highlight]GLEE: General Object Foundation Model for Images and Videos at Scale

Language:PythonLicense:MITStargazers:978Issues:0Issues:0

vit-pytorch

Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch

Language:PythonLicense:MITStargazers:19007Issues:0Issues:0

NaViT

My implementation of "Patch n’ Pack: NaViT, a Vision Transformer for any Aspect Ratio and Resolution"

Language:PythonLicense:MITStargazers:150Issues:0Issues:0

learning_research

本人的科研经验

Stargazers:4996Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:67Issues:0Issues:0

EMO

Emote Portrait Alive: Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions

Stargazers:7253Issues:0Issues:0

ptp

[CVPR2023] The code for 《Position-guided Text Prompt for Vision-Language Pre-training》

Language:PythonLicense:Apache-2.0Stargazers:147Issues:0Issues:0

Awesome-state-space-models

Collection of papers on state-space models

Stargazers:487Issues:0Issues:0

mamba

The Fast Cross-Platform Package Manager

Language:C++License:BSD-3-ClauseStargazers:6587Issues:0Issues:0

CLIP

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Language:Jupyter NotebookLicense:MITStargazers:23810Issues:0Issues:0