Kenneth Estanislao's repositories

Deep-Live-Cam

real time face swap and one-click video deepfake with only a single image

Language:PythonLicense:AGPL-3.0Stargazers:75447Issues:459Issues:1053

suna

Suna - Open Source Generalist AI Agent

Language:TypeScriptLicense:Apache-2.0Stargazers:14Issues:0Issues:0

LLPlayer

The media player for language learning, with dual subtitles, AI-generated subtitles, realtime-OCR, translation, word lookup, and more!

Language:C#License:GPL-3.0Stargazers:13Issues:0Issues:0

dia

A TTS model capable of generating ultra-realistic dialogue in one pass.

Language:PythonLicense:Apache-2.0Stargazers:10Issues:0Issues:0

fantasy-talking

FantasyTalking: Realistic Talking Portrait Generation via Coherent Motion Synthesis

Language:PythonStargazers:10Issues:0Issues:0

Deep-Translate-Engine

a live subtitle with translation engine

Language:PythonStargazers:8Issues:0Issues:0

Wan2GP

Wan 2.1 for the GPU Poor

Language:PythonLicense:NOASSERTIONStargazers:6Issues:0Issues:0

chatterbox

SoTA open-source TTS

License:MITStargazers:5Issues:0Issues:0

KrillinAI

A video translation and dubbing tool powered by LLMs, offering professional-grade translations and one-click full-process deployment. It can generate content optimized for platforms like YouTube,TikTok, and Shorts. 基于AI大模型的视频翻译和配音工具,专业级翻译,一键部署全流程,可以生成适配抖音,小红书,哔哩哔哩,视频号,TikTok,Youtube Shorts等形态的内容

License:GPL-3.0Stargazers:5Issues:0Issues:0

SurfSense

Open Source Alternative to NotebookLM / Perplexity / Glean, connected to external sources such as search engines (Tavily, Linkup), Slack, Linear, Notion, YouTube, GitHub and more.

Language:TypeScriptLicense:Apache-2.0Stargazers:5Issues:0Issues:0

Wan2.2

Wan: Open and Advanced Large-Scale Video Generative Models

License:Apache-2.0Stargazers:4Issues:0Issues:0

MAGI-1

MAGI-1: Autoregressive Video Generation at Scale

Language:PythonLicense:Apache-2.0Stargazers:3Issues:0Issues:0

deer-flow

DeerFlow is a community-driven Deep Research framework, combining language models with tools like web search, crawling, and Python execution, while contributing back to the open-source community.

Language:TypeScriptLicense:MITStargazers:2Issues:0Issues:0
License:NOASSERTIONStargazers:2Issues:0Issues:0
Language:HTMLStargazers:2Issues:0Issues:0

ACE-Step

ACE-Step: A Step Towards Music Generation Foundation Model

Language:PythonLicense:Apache-2.0Stargazers:1Issues:0Issues:0

Bagel

Open-source unified multimodal model

Language:PythonLicense:Apache-2.0Stargazers:1Issues:0Issues:0
License:Apache-2.0Stargazers:1Issues:0Issues:0

Real-Time-Latent-Consistency-Model

App showcasing multiple real-time diffusion models pipelines with Diffusers

License:Apache-2.0Stargazers:1Issues:0Issues:0

ToonComposer

Streamlining Cartoon Production with Generative Post-Keyframing

Language:PythonLicense:NOASSERTIONStargazers:1Issues:0Issues:0

trae-agent

Trae Agent is an LLM-based agent for general purpose software engineering tasks.

Language:PythonLicense:MITStargazers:1Issues:0Issues:0

Voost

[Official] Voost: A Unified and Scalable Diffusion Transformer for Bidirectional Virtual Try-On and Try-Off

License:NOASSERTIONStargazers:1Issues:0Issues:0

whispering-ui

Native UI for the Whispering Tiger project - https://github.com/Sharrnah/whispering (live transcription / translation)

License:MITStargazers:1Issues:0Issues:0

comfyui-vrgamedevgirl

Custom ComfyUI nodes for film grain, color matching, and video enhancement.

License:NOASSERTIONStargazers:0Issues:0Issues:0

delayed-streams-modeling

Kyutai's Speech-To-Text and Text-To-Speech models based on the Delayed Streams Modeling framework.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

fantasy-portrait

FantasyPortrait: Enhancing Multi-Character Portrait Animation with Expression-Augmented Diffusion Transformers

License:Apache-2.0Stargazers:0Issues:0Issues:0

kilocode

Open Source AI coding assistant for planning, building, and fixing code. We're a superset of Roo, Cline, and our own features. Follow us: kilocode.ai/social

License:Apache-2.0Stargazers:0Issues:0Issues:0
License:Apache-2.0Stargazers:0Issues:0Issues:0

XVerse

Official implementation of "XVerse: Consistent Multi-Subject Control of Identity and Semantic Attributes via DiT Modulation".

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0