Kenneth Estanislao's repositories

Deep-Live-Cam

real time face swap and one-click video deepfake with only a single image

Language:PythonLicense:AGPL-3.0Stargazers:44698Issues:276Issues:665

roop-cam

real time face swap and one-click video deepfake with only a single image (Uncensored)

Language:PythonLicense:AGPL-3.0Stargazers:462Issues:9Issues:0

ShortsGenerator

Automate the creation of Shorts content locally with a couple simple steps.

Language:PythonLicense:MITStargazers:30Issues:1Issues:0

Webcam_Live_Portrait

Bring portraits to life via webcam!

Language:PythonLicense:MITStargazers:22Issues:0Issues:0

screenshot-to-code

Drop in a screenshot and convert it to clean code (HTML/Tailwind/React/Vue)

Language:TypeScriptLicense:MITStargazers:17Issues:0Issues:0

real-time-voice-translator

A desktop application that uses AI to translate voice between languages in real time, while preserving the speaker's tone and emotion.

Language:TclLicense:GPL-2.0Stargazers:11Issues:0Issues:0

hallo-for-windows

Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation

Language:PythonLicense:MITStargazers:9Issues:0Issues:0

aidialer

A full stack app for interruptible, low-latency and near-human quality AI phone calls built from stitching LLMs, speech understanding tools, text-to-speech models, and Twilio’s phone API.

Language:PythonLicense:MITStargazers:8Issues:0Issues:0

call-gpt

Generative AI phone call toolkit using Twilio Media Streams.

Language:JavaScriptLicense:MITStargazers:7Issues:0Issues:0

Thin-plate-spline-motion-model-ONNX-Faceswap

Thin Plate Spline Motion Model - ONNX. Extended version for FaceSwap - HeadSwap - PartSwap

Language:PythonStargazers:6Issues:0Issues:0

maxun

Free, open-source no-code web data extraction platform. Build custom robots to automate data scraping [In Beta]

Language:TypeScriptStargazers:5Issues:0Issues:0

Integuru

The first AI agent that builds third-party integrations through reverse engineering platforms' internal APIs.

Language:PythonLicense:AGPL-3.0Stargazers:4Issues:0Issues:0

kestra

:zap: Workflow Automation Platform. Orchestrate & Schedule code in any language, run anywhere, 500+ plugins. Alternative to Zapier, Rundeck, Camunda, Airflow...

License:Apache-2.0Stargazers:4Issues:0Issues:0

Short-Video-Creator

Automatic | AI-generated captions - No API Key | Background Video | YouTube Shorts | TikTok

Language:PythonLicense:NOASSERTIONStargazers:4Issues:0Issues:0

ColorFlow

The official implementation of paper "ColorFlow: Retrieval-Augmented Image Sequence Colorization"

Language:PythonLicense:NOASSERTIONStargazers:3Issues:0Issues:0

face-censor

Detect and blur faces in any input images or videos with AI.

License:GPL-3.0Stargazers:3Issues:0Issues:0

SVFR

Official implementation of SVFR.

Language:PythonStargazers:3Issues:0Issues:0

AniPortrait-for-windows

AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation

Language:PythonLicense:Apache-2.0Stargazers:2Issues:0Issues:0

Live_Portrait_Monitor

Bring portraits to life via Monitor!

Language:PythonStargazers:2Issues:0Issues:0

s3_upload_shell

Simply upload all the files to s3 every day and delete files on the folder every 10 days

Language:ShellStargazers:2Issues:1Issues:0

ai-data-analysis-MulitAgent

AI-Driven Research Assistant: An advanced multi-agent system for automating complex research processes. Leveraging LangChain, OpenAI GPT, and LangGraph, this tool streamlines hypothesis generation, data analysis, visualization, and report writing. Perfect for researchers and data scientists seeking to enhance their workflow and productivity.

License:MITStargazers:1Issues:0Issues:0

echomimic_v2

EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation

Language:PythonLicense:Apache-2.0Stargazers:1Issues:0Issues:0

LLPlayer

The media player for language learning, with dual subtitles, AI-generated subtitles, realtime-OCR, translation, word lookup, and more!

Language:C#License:GPL-3.0Stargazers:1Issues:0Issues:0

paperless-ngx

A community-supported supercharged version of paperless: scan, index and archive all your physical documents

Language:PythonLicense:GPL-3.0Stargazers:1Issues:0Issues:0

Zonos

Zonos-v0.1 is a leading open-weight text-to-speech model trained on more than 200k hours of varied multilingual speech, delivering expressiveness and quality on par with—or even surpassing—top TTS providers.

License:Apache-2.0Stargazers:1Issues:0Issues:0

ComfyUI-WanVideoWrapper-blackwell

for all the 50xx series GPU

Language:PythonStargazers:0Issues:0Issues:0

FacePoke

Select a portrait, click to move the head around (please use your own space / GPU!)

Language:JavaScriptLicense:NOASSERTIONStargazers:0Issues:0Issues:0

mimic_head

Unofficial One-click Version of LivePortrait, with Webcam Support

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

OmniGen

OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0