awsaf49

followers

following

stars

@Google

Dhaka, Bangladesh

https://awsaf49.github.io

Awsaf's starred repositories

grok-1

Grok open release

Language:PythonApache-2.049196 561 202

OpenVoice

Instant voice cloning by MyShell.

Language:PythonMIT27562 209 212

llama3

The official Meta Llama 3 GitHub site

Language:PythonNOASSERTION23770 198 200

Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Language:PythonApache-2.020868 180 403

VoiceCraft

Zero-Shot Speech Editing and Text-to-Speech in the Wild

Language:Jupyter NotebookNOASSERTION7266 88 112

LWM

Language:PythonApache-2.07028 66 68

flax

Flax is a neural network library for JAX that is designed for flexibility.

Language:PythonApache-2.05832 87 865

moondream

tiny vision language model

Language:Jupyter NotebookApache-2.04590 54 98

dm-haiku

JAX-based neural network library

Language:PythonApache-2.02845 39 249

MLQuestions

Machine Learning and Computer Vision Engineer - Technical Interview Questions

uvadlc_notebooks

Repository of Jupyter notebook tutorials for teaching the Deep Learning Course at the University of Amsterdam (MSc AI), Fall 2023

Language:Jupyter NotebookMIT2349 33 86

DeepSeek-VL

DeepSeek-VL: Towards Real-World Vision-Language Understanding

Language:PythonMIT1896 18 43

multimodal

TorchMultimodal is a PyTorch library for training state-of-the-art multimodal multi-task models at scale.

Language:PythonBSD-3-Clause1378 22 38

torchtitan

A native PyTorch Library for large model training

Language:PythonBSD-3-Clause1367 36 111

OpenDiT

OpenDiT: An Easy, Fast and Memory-Efficient System for DiT Training and Inference

Language:PythonApache-2.01359 23 56

lmms-eval

Accelerating the development of large multimodal models (LMMs) with lmms-eval

Language:PythonNOASSERTION1140 4 84

sam-pt

SAM-PT: Extending SAM to zero-shot video segmentation with point-based tracking.

Language:PythonApache-2.0938 41 34

keras-nlp

Modular Natural Language Processing workflows with Keras

Language:PythonApache-2.0740 30 608

Awesome-Foundation-Models

A curated list of foundation models for vision and language tasks

MIT696 350

awesome-visual-question-answering

A curated list of Visual Question Answering(VQA)(Image/Video Question Answering),Visual Question Generation ,Visual Dialog ,Visual Commonsense Reasoning and related area.

fast-DiT

Fast Diffusion Models with Transformers

Language:PythonNOASSERTION629 7 11

sdxs

Official repo of our paper "SDXS: Real-Time One-Step Latent Diffusion Models with Image Conditions"

Language:PythonApache-2.0573 26 16

ml-mobileclip

This repository contains the official implementation of the research paper, "MobileCLIP: Fast Image-Text Models through Multi-Modal Reinforced Training" CVPR 2024

Language:PythonNOASSERTION490 150

DoRA

[ICML2024 (Oral)] Official PyTorch implementation of DoRA: Weight-Decomposed Low-Rank Adaptation

Language:PythonNOASSERTION472 9 13

DesignEdit

Code for DesignEdit

Language:PythonMIT286 9 6

ml-veclip

The official repo for the paper "VeCLIP: Improving CLIP Training via Visual-enriched Captions"

Language:Jupyter NotebookNOASSERTION204 150

ml-tic-clip

Repository for the paper: "TiC-CLIP: Continual Training of CLIP Models".

Language:PythonNOASSERTION88 150

ml-mofi

Language:PythonNOASSERTION56 100

diffuseMix

Language:Python48 1 3

diffusion_memorization

Official repo for Detecting, Explaining, and Mitigating Memorization in Diffusion Models (ICLR 2024)

Language:Python3000