faizan170

Official codebase for I-JEPA, the Image-based Joint-Embedding Predictive Architecture. First outlined in the CVPR paper, "Self-supervised learning from images with a joint-embedding predictive architecture."

Language:PythonNOASSERTION273600

audiocraft

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.

Language:PythonMIT2013900

segment-anything

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Language:Jupyter NotebookApache-2.04530400

SoundStorm

The reproduced code for Google's SoundStorm

Language:Python23400

mPLUG-Owl

mPLUG-Owl & mPLUG-Owl2: Modularized Multimodal Large Language Model

Language:PythonMIT201000

yolov8-object-tracking

YOLOv8 Object Tracking Using PyTorch, OpenCV and Ultralytics

Language:PythonAGPL-3.024900

PaLM

An open-source implementation of Google's PaLM models

Language:PythonMIT79900

spleeter

Deezer source separation library including pretrained models.

Language:PythonMIT2524100

AutoGPT

AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.

Language:PythonMIT16351200

Teenage-AGI

Language:PythonMIT89600

TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Language:PythonMPL-2.03168600

MAXINE-AR-SDK

NVIDIA AR SDK - API headers and sample applications

Language:CMIT73200

nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Language:PythonMIT3435700

antispoofing

Language:PythonMIT5600

facetorch

Python library for analysing faces using PyTorch

Language:PythonApache-2.045200

Silent-Face-Anti-Spoofing

静默活体检测（Silent-Face-Anti-Spoofing）

Language:PythonApache-2.0128500

FaceBagNet

FaceBagNet - Patch-based Methods for Multi-modal Face Anti-spoofing (FAS)

Language:Python65100

faizan170

Faizan Amin's starred repositories

awesome-multimodal-in-medical-imaging

gpu-finder

gemma_pytorch

GPT-SoVITS

OpenVoice

LLM-Finetuning

VoiceTyping

ladi-vton

CDCN-Face-Anti-Spoofing.pytorch

WarpFusion

Flask-React-Google-Login

generative-models

faster-whisper

ijepa