alter-sachin's starred repositories

openai-node

The official Node.js / Typescript library for the OpenAI API

Language:TypeScriptLicense:Apache-2.0Stargazers:7507Issues:0Issues:0

deepgram-ai-agent-demo

Deepgram Conversational AI demo

Language:TypeScriptLicense:MITStargazers:309Issues:0Issues:0

Fooocus

Focus on prompting and generating

Language:PythonLicense:GPL-3.0Stargazers:39375Issues:0Issues:0

noisereduce

Noise reduction in python using spectral gating (speech, bioacoustics, audio, time-domain signals)

Language:Jupyter NotebookLicense:MITStargazers:1382Issues:0Issues:0

silero-vad

Silero VAD: pre-trained enterprise-grade Voice Activity Detector

Language:PythonLicense:MITStargazers:3670Issues:0Issues:0

Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Language:PythonLicense:Apache-2.0Stargazers:21244Issues:0Issues:0

dreamoving-project

Official implementation of DreaMoving

License:Apache-2.0Stargazers:1789Issues:0Issues:0

faster-whisper

Faster Whisper transcription with CTranslate2

Language:PythonLicense:MITStargazers:10881Issues:0Issues:0

offline_sst

repo of files pertaining to realtime, offline translations using whisper realtime and argos translate. This repo is marked Creative Commons CC0. https://creativecommons.org/share-your-work/public-domain/cc0/

Language:PythonLicense:CC0-1.0Stargazers:12Issues:0Issues:0

Amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.

Language:PythonLicense:MITStargazers:4378Issues:0Issues:0

VALL-E-X

An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io

Language:PythonLicense:MITStargazers:7496Issues:0Issues:0

julius

Open-Source Large Vocabulary Continuous Speech Recognition Engine

Language:CLicense:BSD-3-ClauseStargazers:1818Issues:0Issues:0

magic-animate

[CVPR 2024] MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model

Language:PythonLicense:BSD-3-ClauseStargazers:10302Issues:0Issues:0

awesome-ai-agents

A list of AI autonomous agents

License:NOASSERTIONStargazers:9207Issues:0Issues:0

awesome-text-to-video

A Survey on Text-to-Video Generation/Synthesis.

License:Apache-2.0Stargazers:536Issues:0Issues:0

DINet_optimized

An optimized pipeline for DINet reducing inference latency for up to 60% 🚀. Kudos for the authors of the original repo for this amazing work.

Language:PythonStargazers:95Issues:0Issues:0

DINet

The source code of "DINet: deformation inpainting network for realistic face visually dubbing on high resolution video."

Language:PythonStargazers:926Issues:0Issues:0
Language:JavaScriptLicense:MITStargazers:27Issues:0Issues:0

SadTalker

[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation

Language:PythonLicense:NOASSERTIONStargazers:11448Issues:0Issues:0
Language:PythonLicense:NOASSERTIONStargazers:110Issues:0Issues:0

Personal-AI-News-Podcast

AI-gen news podcast🎙 customized to personal preferences | Stay updated with the most important and personally interesting🧡 AI news in just a few-minute podcast each day.

Language:PythonStargazers:39Issues:0Issues:0

streamlit-stt-app

Real time web based Speech-to-Text app with Streamlit

Language:PythonLicense:MITStargazers:210Issues:0Issues:0

genai-stack

Langchain + Docker + Neo4j + Ollama

Language:PythonLicense:CC0-1.0Stargazers:3650Issues:0Issues:0
Language:Jupyter NotebookLicense:Apache-2.0Stargazers:2Issues:0Issues:0

openai-cookbook

Examples and guides for using the OpenAI API

Language:MDXLicense:MITStargazers:58120Issues:0Issues:0

awesome-langchain

😎 Awesome list of tools and projects with the awesome LangChain framework

License:CC0-1.0Stargazers:7315Issues:0Issues:0

lm-hackers

Hackers' Guide to Language Models

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:1759Issues:0Issues:0

roop

one-click face swap

Language:PythonLicense:GPL-3.0Stargazers:26099Issues:0Issues:0

StyleGAN-Human

StyleGAN-Human: A Data-Centric Odyssey of Human Generation

Language:PythonStargazers:1130Issues:0Issues:0

stable-diffusion-webui

Stable Diffusion web UI

Language:PythonLicense:AGPL-3.0Stargazers:138020Issues:0Issues:0