ludwig

followers

following

stars

Luis Armendariz's starred repositories

mlx

MLX: An array framework for Apple silicon

Language:C++MIT14074 127 368

NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Language:PythonApache-2.010038 183 2048

pipx

Install and Run Python Applications in Isolated Environments

Language:PythonMIT8810 75 693

faster-whisper

Faster Whisper transcription with CTranslate2

Language:PythonMIT8720 112 543

taipy

Turns Data and AI algorithms into production-ready web applications in no time.

Language:PythonApache-2.08380 59 510

wavesurfer.js

Audio waveform player

Language:TypeScriptBSD-3-Clause8149 164 2045

trl

Train transformer language models with reinforcement learning.

Language:PythonApache-2.08060 73 861

pyannote-audio

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Language:Jupyter NotebookMIT5009 62 959

mlx-examples

Examples in the MLX framework

Language:PythonMIT4909 55 314

podman-compose

a script to run docker-compose.yml using podman

Language:PythonGPL-2.04704 44 577

big-AGI

Generative AI suite powered by state-of-the-art models and providing advanced AI/AGI functions. It features AI personas, AGI functions, multi-model chats, text-to-image, voice, response streaming, code highlighting and execution, PDF import, presets for developers, much more. Deploy on-prem or in the cloud.

Language:TypeScriptMIT4164 49 391

awesome-conformal-prediction

A professionally curated list of awesome Conformal Prediction videos, tutorials, books, papers, PhD and MSc theses, articles and open-source libraries.

CC0-1.03379 77 10

espeak-ng

eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.

Language:CGPL-3.02867 102 975

bark-with-voice-clone

🔊 Text-prompted Generative Audio Model - With the ability to clone voices

Language:Jupyter NotebookNOASSERTION2809 46 75

CTranslate2

Fast inference engine for Transformer models

Language:C++MIT2786 55 609

nbdime

Tools for diffing and merging of Jupyter notebooks.

Language:TypeScriptNOASSERTION2594 42 339

dateparser

python parser for human readable dates

Language:PythonBSD-3-Clause2461 134 652

DeepFilterNet

Noise supression using deep filtering

Language:PythonNOASSERTION1904 30 251

jupyterlab-git

A Git extension for JupyterLab

Language:TypeScriptBSD-3-Clause1390 39 584

musicinformationretrieval.com

Instructional notebooks on music information retrieval.

Language:Jupyter NotebookMIT1193 53 45

Thorsten-Voice

Thorsten-Voice: A free to use, offline working, high quality german TTS voice should be available for every project without any license struggling.

Language:PythonCC0-1.0479 18 54

life2vec

Language:Jupyter NotebookMIT431 19 4

edu

Educational materials on deep learning by Weights & Biases

Language:Jupyter NotebookGPL-2.0429 13 70

ctc-segmentation

Segment an audio file and obtain utterance alignments. (Python package)

Language:PythonApache-2.0292 13 26

wandbot

wandbot is a technical support bot for Weights & Biases' AI developer tools that can run in Discord, Slack, ChatGPT and Zendesk

Language:Jupyter NotebookApache-2.0210 8 6

fastfeedforward

A repository for log-time feedforward networks

Language:PythonMIT191 6 7

pyannote-metrics

A toolkit for reproducible evaluation, diagnostic, and error analysis of speaker diarization systems

Language:PythonMIT176 10 48

VocalForge

Your one-stop solution for voice dataset creation

Language:PythonMIT92 7 11

ozen-toolkit

Audio datasets, easier.

Language:Python79 4 17

VoiceDatasetCreation

Language:Python13 10