Ayaan-Sharif's starred repositories

GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Language:PythonLicense:MITStargazers:35646Issues:210Issues:1324

detectron2

Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.

Language:PythonLicense:Apache-2.0Stargazers:30541Issues:391Issues:3513

LLM101n

LLM101n: Let's build a Storyteller

tinygrad

You like pytorch? You like micrograd? You love tinygrad! ❤️

Language:PythonLicense:MITStargazers:26855Issues:273Issues:790

awesome-computer-vision

A curated list of awesome computer vision resources

LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Language:PythonLicense:Apache-2.0Stargazers:20236Issues:156Issues:1535

mlx

MLX: An array framework for Apple silicon

flux

Official inference repo for FLUX.1 models

Language:PythonLicense:Apache-2.0Stargazers:15898Issues:143Issues:156

deepface

A Lightweight Face Recognition and Facial Attribute Analysis (Age, Gender, Emotion and Race) Library for Python

Language:PythonLicense:MITStargazers:14672Issues:151Issues:1131

exo

Run your own AI cluster at home with everyday devices 📱💻 🖥️⌚

Language:PythonLicense:GPL-3.0Stargazers:14670Issues:107Issues:289

facenet

Face recognition using Tensorflow

Language:PythonLicense:MITStargazers:13824Issues:562Issues:1129

Awesome-Multimodal-Large-Language-Models

:sparkles::sparkles:Latest Advances on Multimodal Large Language Models

MiniCPM-V

MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone

Language:PythonLicense:Apache-2.0Stargazers:12593Issues:103Issues:576

NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Language:PythonLicense:Apache-2.0Stargazers:12120Issues:206Issues:2298

ml-engineering

Machine Learning Engineering Open Book

Language:PythonLicense:CC-BY-SA-4.0Stargazers:11640Issues:117Issues:30

Open-Sora-Plan

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Language:PythonLicense:MITStargazers:11545Issues:154Issues:352

pytorch3d

PyTorch3D is FAIR's library of reusable components for deep learning with 3D data

Language:PythonLicense:NOASSERTIONStargazers:8806Issues:148Issues:1602

courses

Anthropic's educational courses

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:7988Issues:62Issues:17

MiniCPM

MiniCPM3-4B: An edge-side LLM that surpasses GPT-3.5-Turbo.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:7125Issues:76Issues:212

facenet-pytorch

Pretrained Pytorch face detection (MTCNN) and facial recognition (InceptionResnet) models

Language:PythonLicense:MITStargazers:4550Issues:53Issues:179

Liger-Kernel

Efficient Triton Kernels for LLM Training

Language:PythonLicense:BSD-2-ClauseStargazers:3427Issues:39Issues:98

transformer-explainer

Transformer Explained Visually: Learn How LLM Transformer Models Work with Interactive Visualization

Language:JavaScriptLicense:MITStargazers:3331Issues:34Issues:17

InternLM-XComposer

InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output

Language:PythonLicense:Apache-2.0Stargazers:2521Issues:43Issues:389

video-diffusion-pytorch

Implementation of Video Diffusion Models, Jonathan Ho's new paper extending DDPMs to Video Generation - in Pytorch

Language:PythonLicense:MITStargazers:1253Issues:28Issues:35

mflux

A MLX port of FLUX based on the Huggingface Diffusers implementation.

Language:PythonLicense:MITStargazers:963Issues:16Issues:51

DiffiT

[ECCV 2024] Official Repository for DiffiT: Diffusion Vision Transformers for Image Generation

slot-attention

Implementation of Slot Attention from GoogleAI

Language:PythonLicense:MITStargazers:394Issues:11Issues:7

Open-LLaVA-NeXT

An open-source implementation for training LLaVA-NeXT.

RetinaFace-tf2

RetinaFace (RetinaFace: Single-stage Dense Face Localisation in the Wild, published in 2019) reimplemented in Tensorflow 2.0, with pretrained weights available !

Language:PythonLicense:MITStargazers:258Issues:5Issues:13

fullstack-assignment

Nexxtjs and django repo for assignments

Language:PythonLicense:AGPL-3.0Stargazers:1Issues:0Issues:0