boringtaskai

Boring Task AI's repositories

audiolm-pytorch

Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch

Language:PythonMIT000

Bark-Voice-Cloning

Bark Voice Cloning and Voice Cloning for Chinese Speech

Language:Jupyter NotebookMIT000

bark-with-voice-clone

🔊 Text-prompted Generative Audio Model - With the ability to clone voices

Language:Jupyter NotebookNOASSERTION000

big_vision

Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.

Language:Jupyter NotebookApache-2.0000

common-voice

Common Voice is part of Mozilla's initiative to help teach machines how real people speak.

Language:TypeScriptMPL-2.0000

common-voice-l10n

l10n for project common-voice, since pontoon sync is too long

Apache-2.0020

Compose_and_Embellish

Official PyTorch implementation of ICASSP 2023 paper "Compose & Embellish: Well-Structured Piano Performance Generation via A Two-Stage Approach"

MIT000

CorporaCreator

Command line tool to create corpora for Common Voice

Language:PythonMPL-2.0000

dataspeech

Language:PythonMIT000

ddsp-piano

MIDI Piano synthesizer using DDSP.

Apache-2.0000

eShop

A reference .NET application implementing an eCommerce site

MIT000

fullcontrol

Python version of FullControl for toolpath design (and more) - the readme below is best source of information

GPL-3.0000

grok-1

Grok open release

Language:PythonApache-2.0000

Image-Captioning-using-llava-and-llama3

lmage Caption Generator using llava and llama3 through the ollama library

000

knn-vc

Voice Conversion With Just Nearest Neighbors

Language:PythonNOASSERTION000

nendo-platform

Nendo is an open source platform for AI-driven audio management, intelligence, and generation.

Language:MakefileNOASSERTION000

nendo-server

The Nendo API Server.

Language:PythonNOASSERTION000

nendo-web

The Nendo Web Frontend.

Language:VueNOASSERTION000

nendo_plugin_stemify_demucs

Nendo Plugin for Music Source Separation.

Language:PythonMIT000

nerfies.github.io

Language:JavaScript000

openscad-playground

OpenSCAD Web Playground

Language:TypeScriptNOASSERTION000

parler-tts

Inference and training library for high-quality TTS models.

Language:PythonApache-2.0000

pontoon

Mozilla's Localization Platform

Language:PythonBSD-3-Clause000

pontoon-intro

Introduction to Pontoon

BSD-3-Clause000

Restore

000

so-vits-svc-fork

so-vits-svc fork with realtime support, improved interface and more features.

NOASSERTION000

stable-audio-tools

Generative models for conditional audio generation

Language:PythonMIT000

VoiceCraft

Zero-Shot Speech Editing and Text-to-Speech in the Wild

Language:Jupyter NotebookNOASSERTION000

whatsapp-api

This project is a REST API wrapper for the whatsapp-web.js library, providing an easy-to-use interface to interact with the WhatsApp Web platform.

Language:JavaScriptNOASSERTION000

whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Language:PythonMIT000