MahdeenSky

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.

Language:PythonMIT20615 203 372

StableLM

StableLM: Stability AI Language Models

Language:Jupyter NotebookApache-2.015841 200 76

Whisky

A modern Wine wrapper for macOS built with SwiftUI

Language:SwiftGPL-3.011964 50 749

whisperX

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Language:PythonBSD-2-Clause11438 133 684

insanely-fast-whisper

Language:Jupyter NotebookApache-2.07350 63 187

video-subtitle-extractor

视频硬字幕提取，生成srt文件。无需申请第三方API，本地实现文本识别。基于深度学习的视频字幕提取框架，包含字幕区域检测、字幕内容提取。A GUI tool for extracting hard-coded subtitle (hardsub) from videos and generating srt files.

Language:PythonApache-2.05703 45 264

whisper-jax

JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.

Language:Jupyter NotebookApache-2.04354 43 178

silero-vad

Silero VAD: pre-trained enterprise-grade Voice Activity Detector

Language:PythonMIT3986 50 227

semantra

Multi-tool for semantic search

Language:PythonMIT2485 34 60

aiXcoder-7B

official repository of aiXcoder-7B Code Large Language Model

Language:PythonApache-2.02183 21 30

DragGAN

Implementation of DragGAN: Interactive Point-based Manipulation on the Generative Image Manifold

Language:PythonMIT2161 49 10

text-generation-webui-colab

A colab gradio web UI for running Large Language Models

Language:Jupyter NotebookUnlicense2070 32 35

astrofox

Astrofox is a motion graphics program that lets you turn audio into amazing videos.

Language:JavaScriptMIT1733 27 66

audio-webui

A webui for different audio related Neural Networks

Language:PythonMIT1022 21 192

voltaML-fast-stable-diffusion

Beautiful and Easy to use Stable Diffusion WebUI

Language:PythonGPL-3.0971 24 78

gh-copilot

Ask for assistance right in your terminal.

697 10 78

chat-analytics

Generate interactive, beautiful and insightful chat analysis reports

Language:TypeScriptAGPL-3.0645 8 69

lora-svc

singing voice change based on whisper, and lora for singing voice clone

Language:PythonMIT617 24 69

uberlayer

Mac app: With the Uberlayer app you can put floating images on top of your computer screen.

Language:Objective-C29 1 5

YorkURMP

Rate My Professors extension for the YorkU course portal and VSB

Language:JavaScriptGPL-3.012 2 10

SoftVC-VITS-MusicSingerChanger

Google collab for testing SoftVC VITS Singing Voice Conversion for AI capable of changing the singer within music files.

Language:Jupyter Notebook11 2 1

PixAI-Wrapper

Python scripts that log into PixAI's website automatically using an account you provide. Then, it uses realtime generation to generate images for you.

Language:PythonGPL-3.02 2 1

PixAI-Wrapper

Python scripts that log into PixAI's website automatically using an account you provide. Then, it uses realtime generation to generate images for you.

Language:PythonGPL-3.0100