songfang's repositories

ai-collection

The Generative AI Landscape - A Collection of Awesome Generative AI Applications

License:MITStargazers:0Issues:0Issues:0

AutoGroq

AutoGroq is a groundbreaking tool that revolutionizes the way users interact with Autogen™ and other AI assistants. By dynamically generating tailored teams of AI agents based on your project requirements, AutoGroq eliminates the need for manual configuration and allows you to tackle any question, problem, or project with ease and efficiency.

Stargazers:0Issues:0Issues:0

Av1an

Cross-platform command-line AV1 / VP9 / HEVC / H264 encoding framework with per scene quality encoding

License:GPL-3.0Stargazers:0Issues:0Issues:0

awesome-digital-human

A collection of resources on digital human including clothed people digitalization, virtual try-on, and other related directions.

License:MITStargazers:0Issues:0Issues:0

awesome-generative-ai

A curated list of modern Generative Artificial Intelligence projects and services

License:CC0-1.0Stargazers:0Issues:0Issues:0

azure-docs

Open source documentation of Microsoft Azure

License:CC-BY-4.0Stargazers:0Issues:0Issues:0

blog-auto-publishing-tools

博客自动发布工具,一键把你的博客发到CSDN,掘金,知乎,头条,51blog,腾讯云,公众号等等,支持GPT重写!

License:GPL-2.0Stargazers:0Issues:0Issues:0

ChatTTS

ChatTTS is a generative speech model for daily dialogue.

License:NOASSERTIONStargazers:0Issues:0Issues:0
License:NOASSERTIONStargazers:0Issues:0Issues:0

dub

Open-source link management infrastructure.

License:AGPL-3.0Stargazers:0Issues:0Issues:0

Edubot

基于Linly-Talker数字人改版的教育系统,包含网课总结、数字人对话、Chatbot对话,项目可在autodl部署

License:MITStargazers:0Issues:0Issues:0

faster-whisper

Faster Whisper transcription with CTranslate2

License:MITStargazers:0Issues:0Issues:0

GaussianTalker

Official implementation of “GaussianTalker: Real-Time High-Fidelity Talking Head Synthesis with Audio-Driven 3D Gaussian Splatting” by Kyusun Cho, Joungbin Lee, Heeji Yoon, Yeobin Hong, Jaehoon Ko, Sangjun Ahn and Seungryong Kim

License:NOASSERTIONStargazers:0Issues:0Issues:0

generative-ai

Sample code and notebooks for Generative AI on Google Cloud, with Gemini on Vertex AI

License:Apache-2.0Stargazers:0Issues:0Issues:0

generative-ai-docs

Documentation for Google's Gen AI site - including the Gemini API and Gemma

License:Apache-2.0Stargazers:0Issues:0Issues:0

GenerativeAIExamples

Generative AI reference workflows optimized for accelerated infrastructure and microservice architecture.

License:Apache-2.0Stargazers:0Issues:0Issues:0

haystack

:mag: LLM orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots.

License:Apache-2.0Stargazers:0Issues:0Issues:0

llm-course

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

License:Apache-2.0Stargazers:0Issues:0Issues:0

LLMs-from-scratch

Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step

License:NOASSERTIONStargazers:0Issues:0Issues:0
License:NOASSERTIONStargazers:0Issues:0Issues:0

lmdeploy

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

License:Apache-2.0Stargazers:0Issues:0Issues:0

Omost

Your image is almost there!

License:Apache-2.0Stargazers:0Issues:0Issues:0

pyvideotrans

Translate the video from one language to another and add dubbing. 将视频从一种语言翻译为另一种语言,并添加配音

License:GPL-3.0Stargazers:0Issues:0Issues:0

quivr

Your GenAI Second Brain 🧠 A personal productivity assistant (RAG) ⚡️🤖 Chat with your docs (PDF, CSV, ...) & apps using Langchain, GPT 3.5 / 4 turbo, Private, Anthropic, VertexAI, Ollama, LLMs, Groq that you can share with users ! Local & Private alternative to OpenAI GPTs & ChatGPT powered by retrieval-augmented generation.

License:Apache-2.0Stargazers:0Issues:0Issues:0

RealtimeSTT

A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcription.

License:MITStargazers:0Issues:0Issues:0

RealtimeSTT_LLM_TTS

实时STT,连接智谱AI(流式LLM)和GPT-SOVITS,通过网页的方式,进行跨网络的服务调用,实现实时对话的效果

License:MITStargazers:0Issues:0Issues:0

stream-wav2lip

优化wav2lip的执行步骤,将头脸分离、嘴型替换、回补背景三个步骤分离,添加gfpgan强化面部功能,实现提前解帧,流式循环处理,对接obs

License:Apache-2.0Stargazers:0Issues:0Issues:0

ToonCrafter

a research paper for generative cartoon interpolation

License:Apache-2.0Stargazers:0Issues:0Issues:0

V-Express

V-Express aims to generate a talking head video under the control of a reference image, an audio, and a sequence of V-Kps images.

Stargazers:0Issues:0Issues:0

vt-transformer

Transformer framework for edge computing based on C++.

License:MITStargazers:0Issues:0Issues:0