Vinson's starred repositories

rustdesk

An open-source remote desktop application designed for self-hosting, as an alternative to TeamViewer.

Language:RustLicense:AGPL-3.0Stargazers:69210Issues:460Issues:2916

bark

🔊 Text-Prompted Generative Audio Model

Language:Jupyter NotebookLicense:MITStargazers:34039Issues:317Issues:424

examples

A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.

Language:PythonLicense:BSD-3-ClauseStargazers:22096Issues:397Issues:636

plotly.py

The interactive graphing library for Python :sparkles: This project now includes Plotly Express!

Language:PythonLicense:MITStargazers:15799Issues:272Issues:2893

albumentations

Fast and flexible image augmentation library. Paper about the library: https://www.mdpi.com/2078-2489/11/2/125

Language:PythonLicense:MITStargazers:13836Issues:129Issues:963

dask

Parallel computing with task scheduling

Language:PythonLicense:BSD-3-ClauseStargazers:12293Issues:212Issues:5101

mamba

Mamba SSM architecture

Language:PythonLicense:Apache-2.0Stargazers:11978Issues:98Issues:449

statsmodels

Statsmodels: statistical modeling and econometrics in Python

Language:PythonLicense:BSD-3-ClauseStargazers:9799Issues:282Issues:5412

FreeAskInternet

FreeAskInternet is a completely free, PRIVATE and LOCALLY running search aggregator & answer generate using MULTI LLMs, without GPU needed. The user can ask a question and the system will make a multi engine search and combine the search result to LLM and generate the answer based on search results. It's all FREE to use.

Language:PythonLicense:Apache-2.0Stargazers:8361Issues:56Issues:77
Language:PythonLicense:Apache-2.0Stargazers:7039Issues:67Issues:69

VAR

[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!

Language:PythonLicense:MITStargazers:3895Issues:114Issues:73

Vim

[ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model

Language:PythonLicense:Apache-2.0Stargazers:2677Issues:30Issues:102

s4

Structured state space sequence models

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:2292Issues:52Issues:132

EfficientSAM

EfficientSAM: Leveraged Masked Image Pretraining for Efficient Segment Anything

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:2011Issues:24Issues:64

VMamba

VMamba: Visual State Space Models,code is based on mamba

Language:PythonLicense:MITStargazers:1909Issues:16Issues:275

Unicorn

[ECCV'22 Oral] Towards Grand Unification of Object Tracking

Language:PythonLicense:MITStargazers:951Issues:20Issues:45

sam-pt

SAM-PT: Extending SAM to zero-shot video segmentation with point-based tracking.

Language:PythonLicense:Apache-2.0Stargazers:944Issues:41Issues:34

Awesome-Foundation-Models

A curated list of foundation models for vision and language tasks

License:MITStargazers:710Issues:36Issues:0

Cutie

[CVPR 2024 Highlight] Putting the Object Back Into Video Object Segmentation

Language:PythonLicense:MITStargazers:623Issues:3Issues:89

ProFusion

Code for Enhancing Detail Preservation for Customized Text-to-Image Generation: A Regularization-Free Approach

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:461Issues:16Issues:20

Matcher

[ICLR'24] Matcher: Segment Anything with One Shot Using All-Purpose Feature Matching

Language:PythonLicense:MITStargazers:406Issues:28Issues:26
Language:PythonLicense:MITStargazers:309Issues:7Issues:55

VideoFlow

Official implementation of ICCV2023 VideoFlow: Exploiting Temporal Cues for Multi-frame Optical Flow Estimation

owlvit_segment_anything

Combining OwlViT with Segment Anything - Open-vocabulary Detection and Segmentation (Text-conditioned, and Image-conditioned)

Language:Jupyter NotebookLicense:MITStargazers:142Issues:3Issues:4

cocoapi

Contains the "pycocotools" package on PyPI. Changes made to the official cocoapi about packaging.

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:123Issues:6Issues:0

meru

Code for the paper "Hyperbolic Image-Text Representations", Desai et al, ICML 2023

Language:PythonLicense:NOASSERTIONStargazers:120Issues:9Issues:7

udvd

Unsupervised Deep Video Denoising, ICCV 2021

Language:Jupyter NotebookLicense:MITStargazers:73Issues:5Issues:9

selective_search

Python implementation of selective search

Language:PythonLicense:MITStargazers:48Issues:2Issues:4
Language:PythonStargazers:1Issues:0Issues:0