Homayoun's starred repositories

whisper.cpp

Port of OpenAI's Whisper model in C/C++

guidance

A guidance language for controlling large language models.

Language:Jupyter NotebookLicense:MITStargazers:18344Issues:116Issues:507

web-llm

High-performance In-browser LLM Inference Engine

Language:TypeScriptLicense:Apache-2.0Stargazers:11896Issues:113Issues:270

MiniCPM-V

MiniCPM-Llama3-V 2.5: A GPT-4V Level Multimodal LLM on Your Phone

Language:PythonLicense:Apache-2.0Stargazers:8134Issues:75Issues:320

outlines

Structured Text Generation

Language:PythonLicense:Apache-2.0Stargazers:7418Issues:45Issues:517

instructor

structured outputs for llms

Language:PythonLicense:MITStargazers:6892Issues:48Issues:258

corenet

CoreNet: A library for training deep neural networks

Language:PythonLicense:NOASSERTIONStargazers:6850Issues:62Issues:19

awesome-generative-ai-guide

A one stop repository for generative AI research updates, interview resources, notebooks and much more!

marvin

✨ Build AI interfaces that spark joy

Language:PythonLicense:Apache-2.0Stargazers:5044Issues:37Issues:204

moondream

tiny vision language model

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:4639Issues:54Issues:98

MiniCPM

MiniCPM-2B: An end-side LLM outperforming Llama2-13B.

Language:PythonLicense:Apache-2.0Stargazers:4479Issues:52Issues:143

muzic

Muzic: Music Understanding and Generation with Artificial Intelligence

Language:PythonLicense:MITStargazers:4393Issues:76Issues:169

marqo

Unified embedding generation and search engine. Also available on cloud - cloud.marqo.ai

Language:PythonLicense:Apache-2.0Stargazers:4370Issues:36Issues:237

champ

Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance

Language:PythonLicense:MITStargazers:3510Issues:175Issues:104

bark-with-voice-clone

🔊 Text-prompted Generative Audio Model - With the ability to clone voices

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:2982Issues:47Issues:77

rivet

The open-source visual AI programming environment and TypeScript library

Language:TypeScriptLicense:MITStargazers:2598Issues:60Issues:191

InstantStyle

InstantStyle: Free Lunch towards Style-Preserving in Text-to-Image Generation 🔥

Language:Jupyter NotebookStargazers:1507Issues:20Issues:36

deep-chat

Fully customizable AI chatbot component for your website

Language:TypeScriptLicense:MITStargazers:1342Issues:29Issues:225

fastRAG

Efficient Retrieval Augmentation and Generation Framework

Language:PythonLicense:Apache-2.0Stargazers:1162Issues:10Issues:27

StreamingT2V

StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text

evolutionary-model-merge

Official repository of Evolutionary Optimization of Model Merging Recipes

Language:PythonLicense:Apache-2.0Stargazers:1123Issues:40Issues:11

cog-consistent-character

Create images of a given character in different poses

Language:PythonLicense:MITStargazers:485Issues:6Issues:12

DragAnything

[ECCV 2024] DragAnything: Motion Control for Anything using Entity Representation

HairFastGAN

Official Implementation for "HairFastGAN: Realistic and Robust Hair Transfer with a Fast Encoder-Based Approach"

Language:PythonLicense:MITStargazers:380Issues:9Issues:17

fashion-clip

FashionCLIP is a CLIP-like model fine-tuned for the fashion domain.

Language:PythonLicense:MITStargazers:305Issues:13Issues:31

BakedAvatar

Pytorch Code for "BakedAvatar: Baking Neural Fields for Real-Time Head Avatar Synthesis"

Language:PythonLicense:MITStargazers:287Issues:15Issues:15

StyleTTS2

🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning

Language:PythonLicense:NOASSERTIONStargazers:108Issues:4Issues:18
Language:PythonLicense:Apache-2.0Stargazers:79Issues:4Issues:11

GCL

Generalised Contrastive Learning. This is a Repository for Google Shopping Dataset and Benchmarks followed by our novel fine-grained contrastive learning framework.

Language:PythonLicense:Apache-2.0Stargazers:22Issues:5Issues:0