enod-bataa's starred repositories

realtalk

The official implementation of the paper "Affective Faces for Goal-Driven Dyadic Communication."

Stargazers:13Issues:0Issues:0

YOLOv6

YOLOv6: a single-stage object detection framework dedicated to industrial applications.

Language:Jupyter NotebookLicense:GPL-3.0Stargazers:5620Issues:0Issues:0

triton

Development repository for the Triton language and compiler

Language:C++License:MITStargazers:11925Issues:0Issues:0

mmagic

OpenMMLab Multimodal Advanced, Generative, and Intelligent Creation Toolbox. Unlock the magic 🪄: Generative-AI (AIGC), easy-to-use APIs, awsome model zoo, diffusion models, for text-to-image generation, image/video restoration/enhancement, etc.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:6735Issues:0Issues:0

ultralytics

NEW - YOLOv8 🚀 in PyTorch > ONNX > OpenVINO > CoreML > TFLite

Language:PythonLicense:AGPL-3.0Stargazers:25923Issues:0Issues:0

stable-diffusion-webui

Stable Diffusion web UI

Language:PythonLicense:AGPL-3.0Stargazers:135700Issues:0Issues:0

NeRF-SLAM

NeRF-SLAM: Real-Time Dense Monocular SLAM with Neural Radiance Fields. https://arxiv.org/abs/2210.13641 + Sigma-Fusion: Probabilistic Volumetric Fusion for Dense Monocular SLAM https://arxiv.org/abs/2210.01276

Language:PythonLicense:BSD-2-ClauseStargazers:1144Issues:0Issues:0

innvestigate

A toolbox to iNNvestigate neural networks' predictions!

Language:PythonLicense:NOASSERTIONStargazers:1241Issues:0Issues:0

flyte

Scalable and flexible workflow orchestration platform that seamlessly unifies data, ML and analytics stacks.

Language:GoLicense:Apache-2.0Stargazers:5193Issues:0Issues:0

sports

Cool experiments at the intersection of Computer Vision and Sports ⚽🏃

Language:Jupyter NotebookStargazers:457Issues:0Issues:0

sort

Simple, online, and realtime tracking of multiple objects in a video sequence.

Language:PythonLicense:GPL-3.0Stargazers:3822Issues:0Issues:0

MOTRv2

[CVPR2023] MOTRv2: Bootstrapping End-to-End Multi-Object Tracking by Pretrained Object Detectors

Language:PythonLicense:NOASSERTIONStargazers:343Issues:0Issues:0

magvit

Official JAX implementation of MAGVIT: Masked Generative Video Transformer

Language:PythonLicense:Apache-2.0Stargazers:902Issues:0Issues:0

ternaus-cleantext

Cleans text as in the CLIP model

Language:PythonLicense:MITStargazers:2Issues:0Issues:0

Multilingual-CLIP

OpenAI CLIP text encoders for multiple languages!

Language:Jupyter NotebookLicense:MITStargazers:730Issues:0Issues:0

unstructured

Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.

Language:HTMLLicense:Apache-2.0Stargazers:7530Issues:0Issues:0

lora

Using Low-rank adaptation to quickly fine-tune diffusion models.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:6785Issues:0Issues:0

4K-NeRF

Official implementation of arxiv paper "4K-NeRF: High Fidelity Neural Radiance Fields at Ultra High Resolutions"

Language:PythonStargazers:376Issues:0Issues:0

vaex

Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per second 🚀

Language:PythonLicense:MITStargazers:8225Issues:0Issues:0

neural-compressor

SOTA low-bit LLM quantization (INT8/FP8/INT4/FP4/NF4) & sparsity; leading model compression techniques on TensorFlow, PyTorch, and ONNX Runtime

Language:PythonLicense:Apache-2.0Stargazers:2077Issues:0Issues:0

label-studio-converter

Tools for converting Label Studio annotations into common dataset formats

Language:PythonStargazers:250Issues:0Issues:0

cc2dataset

Easily convert common crawl to a dataset of caption and document. Image/text Audio/text Video/text, ...

Language:PythonLicense:MITStargazers:301Issues:0Issues:0

pix2seq

Pix2Seq codebase: multi-tasks with generative modeling (autoregressive and diffusion)

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:837Issues:0Issues:0

sjc

Score Jacobian Chaining: Lifting Pretrained 2D Diffusion Models for 3D Generation (CVPR 2023)

Language:PythonLicense:NOASSERTIONStargazers:499Issues:0Issues:0

tet

Implementation of Tracking Every Thing in the Wild, ECCV 2022

Language:PythonLicense:Apache-2.0Stargazers:90Issues:0Issues:0

s5cmd

Parallel S3 and local filesystem execution tool.

Language:GoLicense:MITStargazers:2444Issues:0Issues:0

SpeechT5

Unified-Modal Speech-Text Pre-Training for Spoken Language Processing

Language:PythonLicense:MITStargazers:1096Issues:0Issues:0

stylegan2-pytorch

Simplest working implementation of Stylegan2, state of the art generative adversarial network, in Pytorch. Enabling everyone to experience disentanglement

Language:PythonLicense:MITStargazers:3668Issues:0Issues:0

vit-pytorch

Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch

Language:PythonLicense:MITStargazers:18793Issues:0Issues:0

phenaki-pytorch

Implementation of Phenaki Video, which uses Mask GIT to produce text guided videos of up to 2 minutes in length, in Pytorch

Language:PythonLicense:MITStargazers:734Issues:0Issues:0