stwrd's starred repositories

ChatGPT-Next-Web

A cross-platform ChatGPT/Gemini UI (Web / PWA / Linux / Win / MacOS). 一键拥有你自己的跨平台 ChatGPT/Gemini 应用。

Language:TypeScriptLicense:MITStargazers:74803Issues:408Issues:2984

sd-webui-controlnet

WebUI extension for ControlNet

Language:PythonLicense:GPL-3.0Stargazers:16793Issues:146Issues:1493

InstantID

InstantID : Zero-shot Identity-Preserving Generation in Seconds 🔥

Language:PythonLicense:Apache-2.0Stargazers:10741Issues:125Issues:217

yolov9

Implementation of paper - YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information

Language:PythonLicense:GPL-3.0Stargazers:8820Issues:55Issues:510

InternVL

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Language:PythonLicense:MITStargazers:5449Issues:49Issues:525

moondream

tiny vision language model

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:4874Issues:51Issues:113

Chinese-CLIP

Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.

Language:PythonLicense:MITStargazers:4311Issues:35Issues:321

pytorchvideo

A deep learning library for video understanding research.

Language:PythonLicense:Apache-2.0Stargazers:3277Issues:156Issues:181

recognize-anything

Open-source and strong foundation image recognition models.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:2738Issues:26Issues:156

InternLM-XComposer

InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output

Language:PythonLicense:Apache-2.0Stargazers:2444Issues:41Issues:382

hiera

Hiera: A fast, powerful, and simple hierarchical vision transformer.

Language:PythonLicense:Apache-2.0Stargazers:852Issues:19Issues:33

GODEL

Large-scale pretrained models for goal-directed dialog

Language:PythonLicense:MITStargazers:848Issues:20Issues:32

GenerativeImage2Text

GIT: A Generative Image-to-text Transformer for Vision and Language

Language:PythonLicense:MITStargazers:542Issues:9Issues:58

NeuralCompression

A collection of tools for neural compression enthusiasts.

Language:PythonLicense:MITStargazers:497Issues:21Issues:71

query2labels

Official implementation of paper "Query2Label: A Simple Transformer Way to Multi-Label Classification".

Language:PythonLicense:MITStargazers:405Issues:4Issues:57

ML_Decoder

Official PyTorch implementation of "ML-Decoder: Scalable and Versatile Classification Head" (2021)

Language:PythonLicense:MITStargazers:315Issues:3Issues:67

TrafficFlowForecasting

Some TrafficFlowForecasting Solutions(交通流量预测解决方案)

UniFormerV2

[ICCV2023] UniFormerV2: Spatiotemporal Learning by Arming Image ViTs with Video UniFormer

Language:PythonLicense:Apache-2.0Stargazers:282Issues:7Issues:76

ClipCap-Chinese

基于ClipCap的看图说话Image Caption模型

catr

Image Captioning Using Transformer

Language:PythonLicense:Apache-2.0Stargazers:255Issues:4Issues:26

HumanBench

This repo is official implementation of HumanBench (CVPR2023)

Language:PythonLicense:MITStargazers:229Issues:10Issues:20

Awesome-Multi-label-Image-Recognition

Awesome Multi-label Image Recognition Paper List

UniHCP

Official PyTorch implementation of UniHCP

Language:PythonLicense:MITStargazers:147Issues:4Issues:16

bioclip

This is the repository for the BioCLIP model and the TreeOfLife-10M dataset [CVPR'24 Oral, Best Student Paper].

Language:PythonLicense:NOASSERTIONStargazers:142Issues:15Issues:9

ExpansionNet_v2

Implementation code of the work "Exploiting Multiple Sequence Lengths in Fast End to End Training for Image Captioning"

Language:PythonLicense:MITStargazers:83Issues:5Issues:17

python-speech-enhancement

a python library for speech enhancement

Language:PythonLicense:BSD-3-ClauseStargazers:70Issues:2Issues:2

awesome-colab-project

Awesome Colab Projects Collection

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:24Issues:2Issues:0

Awesome-Multi-label-Image-Recognition

Awesome Multi-label Image Recognition Paper List

CLIPCap

ClipCap implementation with API for an easy inference + baselines for the HL Dataset

Language:PythonLicense:Apache-2.0Stargazers:5Issues:1Issues:1