Ayaan-Sharif

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Language:PythonApache-2.012120 206 2298

ml-engineering

Machine Learning Engineering Open Book

Language:PythonCC-BY-SA-4.011640 117 30

Open-Sora-Plan

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Language:PythonMIT11545 154 352

pytorch3d

PyTorch3D is FAIR's library of reusable components for deep learning with 3D data

Language:PythonNOASSERTION8806 148 1602

courses

Anthropic's educational courses

Language:Jupyter NotebookNOASSERTION7988 62 17

MiniCPM

MiniCPM3-4B: An edge-side LLM that surpasses GPT-3.5-Turbo.

Language:Jupyter NotebookApache-2.07125 76 212

facenet-pytorch

Pretrained Pytorch face detection (MTCNN) and facial recognition (InceptionResnet) models

Language:PythonMIT4550 53 179

Liger-Kernel

Efficient Triton Kernels for LLM Training

Language:PythonBSD-2-Clause3427 39 98

transformer-explainer

Transformer Explained Visually: Learn How LLM Transformer Models Work with Interactive Visualization

Language:JavaScriptMIT3331 34 17

InternLM-XComposer

InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output

Language:PythonApache-2.02521 43 389

video-diffusion-pytorch

Implementation of Video Diffusion Models, Jonathan Ho's new paper extending DDPMs to Video Generation - in Pytorch

Language:PythonMIT1253 28 35

mflux

A MLX port of FLUX based on the Huggingface Diffusers implementation.

Language:PythonMIT963 16 51

DiffiT

[ECCV 2024] Official Repository for DiffiT: Diffusion Vision Transformers for Image Generation

454 56 4

slot-attention

Implementation of Slot Attention from GoogleAI

Language:PythonMIT394 11 7

Open-LLaVA-NeXT

An open-source implementation for training LLaVA-NeXT.

Language:Python393 7 21

RetinaFace-tf2

RetinaFace (RetinaFace: Single-stage Dense Face Localisation in the Wild, published in 2019) reimplemented in Tensorflow 2.0, with pretrained weights available !

Language:PythonMIT258 5 13

fullstack-assignment

Nexxtjs and django repo for assignments

Language:PythonAGPL-3.0100