Jonathan Fly (JonathanFly)

JonathanFly

Geek Repo

Company:iforcedabot.com

Home Page:https://twitter.com/jonathanfly

Github PK Tool:Github PK Tool

Jonathan Fly's starred repositories

foam

A personal knowledge management and sharing system for VSCode

Language:TypeScriptLicense:NOASSERTIONStargazers:15110Issues:121Issues:689

hallo

Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation

Language:PythonLicense:MITStargazers:7675Issues:491Issues:123

fish-speech

Brand new TTS solution

Language:PythonLicense:NOASSERTIONStargazers:6872Issues:59Issues:274

marimo

A reactive notebook for Python — run reproducible experiments, execute as a script, deploy as an app, and version with git.

Language:PythonLicense:Apache-2.0Stargazers:5605Issues:29Issues:479

tonal

A functional music theory library for Javascript

parler-tts

Inference and training library for high-quality TTS models.

Language:PythonLicense:Apache-2.0Stargazers:2929Issues:47Issues:61

MARS5-TTS

MARS5 speech model (TTS) from CAMB.AI

Language:Jupyter NotebookLicense:AGPL-3.0Stargazers:2301Issues:33Issues:46

AnyGPT

Code for "AnyGPT: Unified Multimodal LLM with Discrete Sequence Modeling"

neutone_sdk

Join the community on Discord for more discussions around Neutone! https://discord.gg/VHSMzb8Wqp

Language:PythonLicense:LGPL-2.1Stargazers:453Issues:18Issues:22

gazelle

Joint speech-language model - respond directly to audio!

Language:PythonLicense:Apache-2.0Stargazers:296Issues:12Issues:1

rho

Repo for Rho-1: Token-level Data Selection & Selective Pretraining of LLMs.

llm_steer

Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vectors

Language:PythonLicense:MITStargazers:188Issues:6Issues:2

toon3d

Code for Toon3D https://toon3d.studio/

agc

Audiogen Codec

Language:PythonLicense:MITStargazers:106Issues:3Issues:1

Duolando

Code for ICLR 2024 paper "Duolando: Follower GPT with Off-Policy Reinforcement Learning for Dance Accompaniment"

myopic_defocus

Myopic Defocus Browser Extension.

Language:JavaScriptLicense:GPL-3.0Stargazers:79Issues:4Issues:3

hallo-webui

Webui for Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation

FlashSpeech

FlashSpeech: Efficient Zero-Shot Speech Synthesis

NAST

Official repository for NAST: Noise Aware Speech Tokenization for Speech Language Models (Interspeech 2024) https://arxiv.org/abs/2406.11037

Language:PythonLicense:MITStargazers:38Issues:0Issues:0

obsidian-ai-note-suggestion

An plugin for Obsidian.md for effortlessly get note suggestions base on semantic meaning as you type, eliminating the need for complex tagging. Simplifying note-taking

Language:TypeScriptLicense:MITStargazers:32Issues:1Issues:7

audiostretchy

AudioStretchy is a Python wrapper around the `audio-stretch` C library, which performs fast, high-quality time-stretching of WAV/MP3 files without changing their pitch. Works well for speech, can time-stretch silence separately.

Language:PythonLicense:BSD-3-ClauseStargazers:30Issues:2Issues:9

FlowNodes

Flow control nodes for comfyUI, allowing for more diverse workflows

Language:PythonLicense:MITStargazers:7Issues:1Issues:0

kimchi-grammar

Grammar definitions for Kimchi Reader

License:CC-BY-4.0Stargazers:5Issues:1Issues:0

DawnniExpanded

A mod for Dawnsbury Days by ComradeDanni

Language:C#License:MITStargazers:3Issues:1Issues:0

suno-clone

A clone of the Suno AI website UI using NextJS and Tailwind

Language:JavaScriptStargazers:1Issues:1Issues:0