Luis Armendariz's starred repositories

mlx

MLX: An array framework for Apple silicon

NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Language:PythonLicense:Apache-2.0Stargazers:10038Issues:183Issues:2048

pipx

Install and Run Python Applications in Isolated Environments

Language:PythonLicense:MITStargazers:8810Issues:75Issues:693

faster-whisper

Faster Whisper transcription with CTranslate2

Language:PythonLicense:MITStargazers:8720Issues:112Issues:543

taipy

Turns Data and AI algorithms into production-ready web applications in no time.

Language:PythonLicense:Apache-2.0Stargazers:8380Issues:59Issues:510

wavesurfer.js

Audio waveform player

Language:TypeScriptLicense:BSD-3-ClauseStargazers:8149Issues:164Issues:2045

trl

Train transformer language models with reinforcement learning.

Language:PythonLicense:Apache-2.0Stargazers:8060Issues:73Issues:861

pyannote-audio

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Language:Jupyter NotebookLicense:MITStargazers:5009Issues:62Issues:959

mlx-examples

Examples in the MLX framework

Language:PythonLicense:MITStargazers:4909Issues:55Issues:314

podman-compose

a script to run docker-compose.yml using podman

Language:PythonLicense:GPL-2.0Stargazers:4704Issues:44Issues:577

big-AGI

Generative AI suite powered by state-of-the-art models and providing advanced AI/AGI functions. It features AI personas, AGI functions, multi-model chats, text-to-image, voice, response streaming, code highlighting and execution, PDF import, presets for developers, much more. Deploy on-prem or in the cloud.

Language:TypeScriptLicense:MITStargazers:4164Issues:49Issues:391

awesome-conformal-prediction

A professionally curated list of awesome Conformal Prediction videos, tutorials, books, papers, PhD and MSc theses, articles and open-source libraries.

espeak-ng

eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.

Language:CLicense:GPL-3.0Stargazers:2867Issues:102Issues:975

bark-with-voice-clone

🔊 Text-prompted Generative Audio Model - With the ability to clone voices

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:2809Issues:46Issues:75

CTranslate2

Fast inference engine for Transformer models

Language:C++License:MITStargazers:2786Issues:55Issues:609

nbdime

Tools for diffing and merging of Jupyter notebooks.

Language:TypeScriptLicense:NOASSERTIONStargazers:2594Issues:42Issues:339

dateparser

python parser for human readable dates

Language:PythonLicense:BSD-3-ClauseStargazers:2461Issues:134Issues:652

DeepFilterNet

Noise supression using deep filtering

Language:PythonLicense:NOASSERTIONStargazers:1904Issues:30Issues:251

jupyterlab-git

A Git extension for JupyterLab

Language:TypeScriptLicense:BSD-3-ClauseStargazers:1390Issues:39Issues:584

musicinformationretrieval.com

Instructional notebooks on music information retrieval.

Language:Jupyter NotebookLicense:MITStargazers:1193Issues:53Issues:45

Thorsten-Voice

Thorsten-Voice: A free to use, offline working, high quality german TTS voice should be available for every project without any license struggling.

Language:PythonLicense:CC0-1.0Stargazers:479Issues:18Issues:54
Language:Jupyter NotebookLicense:MITStargazers:431Issues:19Issues:4

edu

Educational materials on deep learning by Weights & Biases

Language:Jupyter NotebookLicense:GPL-2.0Stargazers:429Issues:13Issues:70

ctc-segmentation

Segment an audio file and obtain utterance alignments. (Python package)

Language:PythonLicense:Apache-2.0Stargazers:292Issues:13Issues:26

wandbot

wandbot is a technical support bot for Weights & Biases' AI developer tools that can run in Discord, Slack, ChatGPT and Zendesk

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:210Issues:8Issues:6

fastfeedforward

A repository for log-time feedforward networks

Language:PythonLicense:MITStargazers:191Issues:6Issues:7

pyannote-metrics

A toolkit for reproducible evaluation, diagnostic, and error analysis of speaker diarization systems

Language:PythonLicense:MITStargazers:176Issues:10Issues:48

VocalForge

Your one-stop solution for voice dataset creation

Language:PythonLicense:MITStargazers:92Issues:7Issues:11

ozen-toolkit

Audio datasets, easier.