mpottinger's starred repositories

ollama

Get up and running with Llama 3, Mistral, Gemma, and other large language models.

open-webui

User-friendly WebUI for LLMs (Formerly Ollama WebUI)

Language:SvelteLicense:MITStargazers:22706Issues:115Issues:992
Language:PythonLicense:NOASSERTIONStargazers:7913Issues:149Issues:0

VoiceCraft

Zero-Shot Speech Editing and Text-to-Speech in the Wild

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:6862Issues:87Issues:97

piper

A fast, local neural text to speech system

Language:C++License:MITStargazers:4393Issues:62Issues:379

StyleTTS2

StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models

Language:PythonLicense:MITStargazers:4145Issues:78Issues:158

MeloTTS

High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.

Language:PythonLicense:MITStargazers:3746Issues:38Issues:112

distil-whisper

Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.

Language:PythonLicense:MITStargazers:3236Issues:63Issues:88

cookbook

A collection of guides and examples for the Gemini API.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:3199Issues:44Issues:49

parler-tts

Inference and training library for high-quality TTS models.

Language:PythonLicense:Apache-2.0Stargazers:2609Issues:43Issues:35

ollama-python

Ollama Python library

Language:PythonLicense:MITStargazers:2175Issues:18Issues:84

elevenlabs-python

The official Python API for ElevenLabs Text to Speech.

Language:PythonLicense:MITStargazers:1882Issues:31Issues:204

FEX

A fast usermode x86 and x86-64 emulator for Arm64 Linux

Language:C++License:MITStargazers:1853Issues:34Issues:699

enchanted

Enchanted is iOS and macOS app for chatting with private self hosted language models such as Llama2, Mistral or Vicuna using Ollama.

Language:SwiftLicense:Apache-2.0Stargazers:1765Issues:20Issues:78

OpenSwiftUI

WIP — OpenSwiftUI is an OpenSource implementation of Apple's SwiftUI DSL.

Language:SwiftLicense:MITStargazers:1371Issues:42Issues:4

claude-to-chatgpt

This project converts the API of Anthropic's Claude model to the OpenAI Chat API format.

Language:PythonLicense:MITStargazers:1203Issues:19Issues:25

Suno-API

I provide suno API, no deployment is required, no subscription to suno is required. It 's convenient to use. 👇

Language:PythonLicense:MITStargazers:1158Issues:4Issues:20

schedule_free

Schedule-Free Optimization in PyTorch

Language:PythonLicense:Apache-2.0Stargazers:1111Issues:13Issues:12

MonoGS

[CVPR'24 Highlight] Gaussian Splatting SLAM

Language:PythonLicense:NOASSERTIONStargazers:942Issues:12Issues:87

sherpa-onnx

Speech-to-text, text-to-speech, and speaker recongition using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, Raspberry Pi, RISC-V, x86_64 servers, websocket server/client, C/C++, Python, Kotlin, C#, Go, NodeJS, Java, Swift

Language:C++License:Apache-2.0Stargazers:936Issues:29Issues:270

suno-api

Use API to call the music generation AI of suno.ai, and easily integrate it into agents like GPTs.

Language:TypeScriptLicense:LGPL-3.0Stargazers:632Issues:19Issues:56

anthropic-sdk-typescript

Access to Anthropic's safety-first language model APIs

Language:TypeScriptLicense:MITStargazers:413Issues:64Issues:74

3DGS.cpp

A cross-platform, high performance renderer for Gaussian Splatting using Vulkan Compute. Supports ✅ Windows, Linux, macOS, iOS, and visionOS

Language:C++License:LGPL-2.1Stargazers:339Issues:11Issues:11

hlb-gpt

Minimalistic, extremely fast, and hackable researcher's toolbench for GPT models in 307 lines of code. Reaches <3.8 validation loss on wikitext-103 on a single A100 in <100 seconds. Scales to larger models with one parameter change (feature currently in alpha).

Language:PythonLicense:Apache-2.0Stargazers:253Issues:9Issues:5

macosvm

Tool for running macOS guest virtual machines in macOS 12 host or higher on M1 arm64 Macs

Language:Objective-CLicense:NOASSERTIONStargazers:150Issues:10Issues:16
Language:CudaLicense:NOASSERTIONStargazers:92Issues:0Issues:0

StyleTTS2

🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning

Language:PythonLicense:NOASSERTIONStargazers:86Issues:4Issues:16

chaquopy-console

Chaquopy console template

Language:JavaLicense:MITStargazers:50Issues:5Issues:6

SwiftAnthropic

An open-source Swift package for interacting with Anthropic's public API.

PcmDataPlayer

Playing raw audio data with AVAudioPlayer

Language:Objective-CStargazers:23Issues:3Issues:1