yangmin09 (feymanpriv)

feymanpriv

Geek Repo

Company:BUPT

Location:Beijing

Github PK Tool:Github PK Tool

yangmin09's starred repositories

ColossalAI

Making large AI models cheaper, faster and more accessible

Language:PythonLicense:Apache-2.0Stargazers:38228Issues:381Issues:1591

milvus

A cloud-native vector database, storage for next generation AI applications

Language:GoLicense:Apache-2.0Stargazers:27852Issues:274Issues:11052

open_clip

An open source implementation of CLIP.

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:8970Issues:78Issues:444

glide-text2im

GLIDE: a diffusion-based text-conditional image synthesis model

Language:PythonLicense:MITStargazers:3493Issues:164Issues:44

pytorchvideo

A deep learning library for video understanding research.

Language:PythonLicense:Apache-2.0Stargazers:3213Issues:162Issues:178

NUWA

A unified 3D Transformer Pipeline for visual synthesis

VQGAN-CLIP

Just playing with getting VQGAN+CLIP running locally, rather than having to use colab.

Language:PythonLicense:NOASSERTIONStargazers:2586Issues:54Issues:120

MAE-pytorch

Unofficial PyTorch implementation of Masked Autoencoders Are Scalable Vision Learners

bootcamp

Dealing with all unstructured data, such as reverse image search, audio search, molecular search, video analysis, question and answer systems, NLP, etc.

Language:HTMLLicense:Apache-2.0Stargazers:1691Issues:32Issues:260

pytorch-distributed

A quickstart and benchmark for pytorch distributed training.

Language:PythonLicense:MITStargazers:1580Issues:16Issues:20

CMake-Cookbook

:book: 作为对《CMake Cookbook》的中文翻译。

Language:PythonLicense:Apache-2.0Stargazers:954Issues:19Issues:96

google-landmark

Dataset with 5 million images depicting human-made and natural landmarks spanning 200 thousand classes.

SLIP

Code release for SLIP Self-supervision meets Language-Image Pre-training

Language:PythonLicense:MITStargazers:731Issues:18Issues:27

xcit

Official code Cross-Covariance Image Transformer (XCiT)

Language:PythonLicense:Apache-2.0Stargazers:654Issues:18Issues:29

DenseCLIP

[CVPR 2022] DenseCLIP: Language-Guided Dense Prediction with Context-Aware Prompting

self_supervised

Implementation of popular SOTA self-supervised learning algorithms as Fastai Callbacks.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:317Issues:6Issues:33

mvits_for_class_agnostic_od

[ECCV'22] Official repository of paper titled "Class-agnostic Object Detection with Multi-modal Transformer".

Language:PythonLicense:MITStargazers:298Issues:8Issues:30

pydegensac

Advanced RANSAC (DEGENSAC) with bells and whistles for H and F estimation

Language:C++License:MITStargazers:278Issues:10Issues:15

BriVL

Bridging Vision and Language Model

Language:PythonLicense:MITStargazers:276Issues:4Issues:13

zero-shot-image-to-text

Implementation of Zero-Shot Image-to-Text Generation for Visual-Semantic Arithmetic

Practical_Python_Programming_2021

北邮《Python编程与实践》课程(2021)

RerankingTransformer

[ICCV 2021] Instance-level Image Retrieval using Reranking Transformers

Language:PythonLicense:GPL-3.0Stargazers:120Issues:4Issues:18

CLIPfa

CLIPfa: Connecting Farsi Text and Images

Language:Jupyter NotebookLicense:GPL-3.0Stargazers:78Issues:2Issues:0

DSANet

【ACMMM'2021】DSANet: Dynamic Segment Aggregation Network for Video-Level Representation Learning

Language:PythonLicense:MITStargazers:49Issues:1Issues:1
Language:PythonLicense:NOASSERTIONStargazers:40Issues:3Issues:3