songfang's repositories
ai-collection
The Generative AI Landscape - A Collection of Awesome Generative AI Applications
AutoGroq
AutoGroq is a groundbreaking tool that revolutionizes the way users interact with Autogen™ and other AI assistants. By dynamically generating tailored teams of AI agents based on your project requirements, AutoGroq eliminates the need for manual configuration and allows you to tackle any question, problem, or project with ease and efficiency.
Av1an
Cross-platform command-line AV1 / VP9 / HEVC / H264 encoding framework with per scene quality encoding
awesome-digital-human
A collection of resources on digital human including clothed people digitalization, virtual try-on, and other related directions.
awesome-generative-ai
A curated list of modern Generative Artificial Intelligence projects and services
azure-docs
Open source documentation of Microsoft Azure
blog-auto-publishing-tools
博客自动发布工具,一键把你的博客发到CSDN,掘金,知乎,头条,51blog,腾讯云,公众号等等,支持GPT重写!
ChatTTS
ChatTTS is a generative speech model for daily dialogue.
dub
Open-source link management infrastructure.
Edubot
基于Linly-Talker数字人改版的教育系统,包含网课总结、数字人对话、Chatbot对话,项目可在autodl部署
faster-whisper
Faster Whisper transcription with CTranslate2
GaussianTalker
Official implementation of “GaussianTalker: Real-Time High-Fidelity Talking Head Synthesis with Audio-Driven 3D Gaussian Splatting” by Kyusun Cho, Joungbin Lee, Heeji Yoon, Yeobin Hong, Jaehoon Ko, Sangjun Ahn and Seungryong Kim
generative-ai
Sample code and notebooks for Generative AI on Google Cloud, with Gemini on Vertex AI
generative-ai-docs
Documentation for Google's Gen AI site - including the Gemini API and Gemma
GenerativeAIExamples
Generative AI reference workflows optimized for accelerated infrastructure and microservice architecture.
haystack
:mag: LLM orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots.
llm-course
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
LLMs-from-scratch
Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step
lmdeploy
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
Omost
Your image is almost there!
pyvideotrans
Translate the video from one language to another and add dubbing. 将视频从一种语言翻译为另一种语言,并添加配音
quivr
Your GenAI Second Brain 🧠 A personal productivity assistant (RAG) ⚡️🤖 Chat with your docs (PDF, CSV, ...) & apps using Langchain, GPT 3.5 / 4 turbo, Private, Anthropic, VertexAI, Ollama, LLMs, Groq that you can share with users ! Local & Private alternative to OpenAI GPTs & ChatGPT powered by retrieval-augmented generation.
RealtimeSTT
A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcription.
RealtimeSTT_LLM_TTS
实时STT,连接智谱AI(流式LLM)和GPT-SOVITS,通过网页的方式,进行跨网络的服务调用,实现实时对话的效果
stream-wav2lip
优化wav2lip的执行步骤,将头脸分离、嘴型替换、回补背景三个步骤分离,添加gfpgan强化面部功能,实现提前解帧,流式循环处理,对接obs
ToonCrafter
a research paper for generative cartoon interpolation
V-Express
V-Express aims to generate a talking head video under the control of a reference image, an audio, and a sequence of V-Kps images.
vt-transformer
Transformer framework for edge computing based on C++.