Ayaan-Sharif's starred repositories
GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
detectron2
Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
awesome-computer-vision
A curated list of awesome computer vision resources
Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
ml-engineering
Machine Learning Engineering Open Book
Open-Sora-Plan
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
facenet-pytorch
Pretrained Pytorch face detection (MTCNN) and facial recognition (InceptionResnet) models
Liger-Kernel
Efficient Triton Kernels for LLM Training
transformer-explainer
Transformer Explained Visually: Learn How LLM Transformer Models Work with Interactive Visualization
InternLM-XComposer
InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output
video-diffusion-pytorch
Implementation of Video Diffusion Models, Jonathan Ho's new paper extending DDPMs to Video Generation - in Pytorch
slot-attention
Implementation of Slot Attention from GoogleAI
Open-LLaVA-NeXT
An open-source implementation for training LLaVA-NeXT.
RetinaFace-tf2
RetinaFace (RetinaFace: Single-stage Dense Face Localisation in the Wild, published in 2019) reimplemented in Tensorflow 2.0, with pretrained weights available !
fullstack-assignment
Nexxtjs and django repo for assignments