enod-bataa's starred repositories

stable-diffusion-webui

Stable Diffusion web UI

Language:PythonLicense:AGPL-3.0Stargazers:134630Issues:1047Issues:7487

ultralytics

NEW - YOLOv8 🚀 in PyTorch > ONNX > OpenVINO > CoreML > TFLite

Language:PythonLicense:AGPL-3.0Stargazers:25365Issues:151Issues:7588

vit-pytorch

Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch

Language:PythonLicense:MITStargazers:18606Issues:144Issues:258

triton

Development repository for the Triton language and compiler

vaex

Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per second 🚀

Language:PythonLicense:MITStargazers:8215Issues:143Issues:1209

unstructured

Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.

Language:HTMLLicense:Apache-2.0Stargazers:7303Issues:51Issues:1011

lora

Using Low-rank adaptation to quickly fine-tune diffusion models.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:6739Issues:59Issues:137

mmagic

OpenMMLab Multimodal Advanced, Generative, and Intelligent Creation Toolbox. Unlock the magic 🪄: Generative-AI (AIGC), easy-to-use APIs, awsome model zoo, diffusion models, for text-to-image generation, image/video restoration/enhancement, etc.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:6699Issues:98Issues:703

YOLOv6

YOLOv6: a single-stage object detection framework dedicated to industrial applications.

Language:Jupyter NotebookLicense:GPL-3.0Stargazers:5602Issues:62Issues:786

flyte

Scalable and flexible workflow orchestration platform that seamlessly unifies data, ML and analytics stacks.

Language:GoLicense:Apache-2.0Stargazers:5151Issues:258Issues:3042

sort

Simple, online, and realtime tracking of multiple objects in a video sequence.

Language:PythonLicense:GPL-3.0Stargazers:3804Issues:73Issues:156

stylegan2-pytorch

Simplest working implementation of Stylegan2, state of the art generative adversarial network, in Pytorch. Enabling everyone to experience disentanglement

Language:PythonLicense:MITStargazers:3665Issues:70Issues:259

s5cmd

Parallel S3 and local filesystem execution tool.

neural-compressor

SOTA low-bit LLM quantization (INT8/FP8/INT4/FP4/NF4) & sparsity; leading model compression techniques on TensorFlow, PyTorch, and ONNX Runtime

Language:PythonLicense:Apache-2.0Stargazers:2063Issues:34Issues:187

innvestigate

A toolbox to iNNvestigate neural networks' predictions!

Language:PythonLicense:NOASSERTIONStargazers:1237Issues:35Issues:258

NeRF-SLAM

NeRF-SLAM: Real-Time Dense Monocular SLAM with Neural Radiance Fields. https://arxiv.org/abs/2210.13641 + Sigma-Fusion: Probabilistic Volumetric Fusion for Dense Monocular SLAM https://arxiv.org/abs/2210.01276

Language:PythonLicense:BSD-2-ClauseStargazers:1138Issues:27Issues:66

SpeechT5

Unified-Modal Speech-Text Pre-Training for Spoken Language Processing

Language:PythonLicense:MITStargazers:1083Issues:24Issues:76

magvit

Official JAX implementation of MAGVIT: Masked Generative Video Transformer

Language:PythonLicense:Apache-2.0Stargazers:885Issues:71Issues:22

pix2seq

Pix2Seq codebase: multi-tasks with generative modeling (autoregressive and diffusion)

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:835Issues:18Issues:48

phenaki-pytorch

Implementation of Phenaki Video, which uses Mask GIT to produce text guided videos of up to 2 minutes in length, in Pytorch

Language:PythonLicense:MITStargazers:731Issues:39Issues:30

Multilingual-CLIP

OpenAI CLIP text encoders for multiple languages!

Language:Jupyter NotebookLicense:MITStargazers:722Issues:19Issues:27

sjc

Score Jacobian Chaining: Lifting Pretrained 2D Diffusion Models for 3D Generation (CVPR 2023)

Language:PythonLicense:NOASSERTIONStargazers:497Issues:20Issues:29

sports

Cool experiments at the intersection of Computer Vision and Sports ⚽🏃

Language:Jupyter NotebookStargazers:455Issues:12Issues:3

4K-NeRF

Official implementation of arxiv paper "4K-NeRF: High Fidelity Neural Radiance Fields at Ultra High Resolutions"

MOTRv2

[CVPR2023] MOTRv2: Bootstrapping End-to-End Multi-Object Tracking by Pretrained Object Detectors

Language:PythonLicense:NOASSERTIONStargazers:338Issues:8Issues:70

cc2dataset

Easily convert common crawl to a dataset of caption and document. Image/text Audio/text Video/text, ...

Language:PythonLicense:MITStargazers:300Issues:9Issues:33

label-studio-converter

Tools for converting Label Studio annotations into common dataset formats

tet

Implementation of Tracking Every Thing in the Wild, ECCV 2022

Language:PythonLicense:Apache-2.0Stargazers:90Issues:13Issues:6

realtalk

The official implementation of the paper "Affective Faces for Goal-Driven Dyadic Communication."

ternaus-cleantext

Cleans text as in the CLIP model

Language:PythonLicense:MITStargazers:2Issues:2Issues:0