Raymond's starred repositories

UniPortrait

UniPortrait: A Unified Framework for Identity-Preserving Single- and Multi-Human Image Personalizations

License:Apache-2.0Stargazers:85Issues:0Issues:0

espnet

End-to-End Speech Processing Toolkit

Language:PythonLicense:Apache-2.0Stargazers:8211Issues:0Issues:0

TTS

:robot: :speech_balloon: Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)

Language:Jupyter NotebookLicense:MPL-2.0Stargazers:9134Issues:0Issues:0

mimic1

Mycroft's TTS engine, based on CMU's Flite (Festival Lite)

Language:CLicense:NOASSERTIONStargazers:810Issues:0Issues:0

Qwen2-Audio

The official repo of Qwen2-Audio chat & pretrained large audio language model proposed by Alibaba Cloud.

Language:PythonStargazers:842Issues:0Issues:0

athina-evals

Python SDK for running evaluations on LLM generated responses

Language:PythonStargazers:181Issues:0Issues:0

Deep-Live-Cam

real time face swap and one-click video deepfake with only a single image

Language:PythonLicense:AGPL-3.0Stargazers:22725Issues:0Issues:0

llamacoder

Open source Claude Artifacts – built with Llama 3.1 405B

Language:TypeScriptStargazers:1705Issues:0Issues:0

ai-agents-masterclass

Follow along with my AI Agents Masterclass videos! All of the code I create and use in this series on YouTube will be here for you to use and even build on top of!

Language:PythonLicense:MITStargazers:39Issues:0Issues:0

Autogen_GraphRAG_Ollama

Microsoft's GraphRAG + AutoGen + Ollama + Chainlit = Fully Local & Free Multi-Agent RAG Superbot

Language:PythonStargazers:274Issues:0Issues:0

llm-graph-builder

Neo4j graph construction from unstructured data using LLMs

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:1844Issues:0Issues:0

black

The uncompromising Python code formatter

Language:PythonLicense:MITStargazers:38175Issues:0Issues:0
Language:TypeScriptStargazers:55Issues:0Issues:0

stackwise

The open source AI app collection

Language:TypeScriptLicense:MITStargazers:158Issues:0Issues:0

fal

⚡ Fastest way to serve open source ML models to millions

Language:PythonLicense:Apache-2.0Stargazers:454Issues:0Issues:0

CogVideo

Text-to-video generation: CogVideoX (2024) and CogVideo (ICLR 2023)

Language:PythonLicense:Apache-2.0Stargazers:5574Issues:0Issues:0

diffusers

🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.

Language:PythonLicense:Apache-2.0Stargazers:24633Issues:0Issues:0

airllm

AirLLM 70B inference with single 4GB GPU

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:3820Issues:0Issues:0
Language:TypeScriptLicense:MITStargazers:114Issues:0Issues:0

scikit-lego

Extra blocks for scikit-learn pipelines.

Language:PythonLicense:MITStargazers:1230Issues:0Issues:0

NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Language:PythonLicense:Apache-2.0Stargazers:11237Issues:0Issues:0

parler-tts

Inference and training library for high-quality TTS models.

Language:PythonLicense:Apache-2.0Stargazers:3588Issues:0Issues:0

MARS5-TTS

MARS5 speech model (TTS) from CAMB.AI

Language:Jupyter NotebookLicense:AGPL-3.0Stargazers:2347Issues:0Issues:0

whisper-flamingo

[Interspeech 2024] Whisper-Flamingo: Integrating Visual Features into Whisper for Audio-Visual Speech Recognition and Translation

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:55Issues:0Issues:0

vibe

Transcribe on your own!

Language:TypeScriptLicense:MITStargazers:632Issues:0Issues:0

whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Language:PythonLicense:MITStargazers:66026Issues:0Issues:0

whisper-jax

JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:4311Issues:0Issues:0

posthog

🦔 PostHog provides open-source product analytics, session recording, feature flagging and A/B testing that you can self-host.

Language:PythonLicense:NOASSERTIONStargazers:20124Issues:0Issues:0
Language:PythonLicense:MITStargazers:1563Issues:0Issues:0

mactop

mactop - Apple Silicon Monitor Top written in pure Golang! Under 1,000 lines of code.

Language:GoLicense:MITStargazers:1243Issues:0Issues:0