songfang's repositories
PyTorch-VAE
A Collection of Variational Autoencoders (VAE) in PyTorch.
flowblade
Video Editor for Linux
social-auto-upload
自动化上传视频到社交媒体:抖音、小红书、视频号、tiktok、youtube、bilibili
robotframework
Generic automation framework for acceptance testing and RPA
awesome-ai-in-finance
🔬 A curated list of awesome LLMs & deep learning strategies & tools in financial market.
RealtimeSTT
A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcription.
lab
Showcase and examples lab for TresJS
BlenderGIS
Blender addons to make the bridge between Blender and geographic data
Omost
Your image is almost there!
liteflow
Lightweight, fast, stable, and programmable component-based rule engine/process engine. Component reuse, synchronous/asynchronous orchestration, dynamic orchestration, multi-language scripting support, complex nested rules, hot deployment, smooth refreshing. Let you improve your development efficiency!
autogen-ui
Web UI for AutoGen (A Framework Multi-Agent LLM Applications)
LLMs-from-scratch
Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step
RealtimeSTT_LLM_TTS
实时STT,连接智谱AI(流式LLM)和GPT-SOVITS,通过网页的方式,进行跨网络的服务调用,实现实时对话的效果
ToonCrafter
a research paper for generative cartoon interpolation
awesome-digital-human
A collection of resources on digital human including clothed people digitalization, virtual try-on, and other related directions.
V-Express
V-Express aims to generate a talking head video under the control of a reference image, an audio, and a sequence of V-Kps images.
vt-transformer
Transformer framework for edge computing based on C++.
ChatTTS
ChatTTS is a generative speech model for daily dialogue.
Edubot
基于Linly-Talker数字人改版的教育系统,包含网课总结、数字人对话、Chatbot对话,项目可在autodl部署
azure-docs
Open source documentation of Microsoft Azure
AutoGroq
AutoGroq is a groundbreaking tool that revolutionizes the way users interact with Autogen™ and other AI assistants. By dynamically generating tailored teams of AI agents based on your project requirements, AutoGroq eliminates the need for manual configuration and allows you to tackle any question, problem, or project with ease and efficiency.
lmdeploy
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
generative-ai
Sample code and notebooks for Generative AI on Google Cloud, with Gemini on Vertex AI
GaussianTalker
Official implementation of “GaussianTalker: Real-Time High-Fidelity Talking Head Synthesis with Audio-Driven 3D Gaussian Splatting” by Kyusun Cho, Joungbin Lee, Heeji Yoon, Yeobin Hong, Jaehoon Ko, Sangjun Ahn and Seungryong Kim
haystack
:mag: LLM orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots.
ai-collection
The Generative AI Landscape - A Collection of Awesome Generative AI Applications
quivr
Your GenAI Second Brain 🧠 A personal productivity assistant (RAG) ⚡️🤖 Chat with your docs (PDF, CSV, ...) & apps using Langchain, GPT 3.5 / 4 turbo, Private, Anthropic, VertexAI, Ollama, LLMs, Groq that you can share with users ! Local & Private alternative to OpenAI GPTs & ChatGPT powered by retrieval-augmented generation.
generative-ai-docs
Documentation for Google's Gen AI site - including the Gemini API and Gemma