songfang's repositories

ai-collection

The Generative AI Landscape - A Collection of Awesome Generative AI Applications

License:MITStargazers:0Issues:0Issues:0

AutoGroq

AutoGroq is a groundbreaking tool that revolutionizes the way users interact with Autogen™ and other AI assistants. By dynamically generating tailored teams of AI agents based on your project requirements, AutoGroq eliminates the need for manual configuration and allows you to tackle any question, problem, or project with ease and efficiency.

Stargazers:0Issues:0Issues:0

awesome-ai-tools

A curated list of Artificial Intelligence Top Tools

Stargazers:0Issues:0Issues:0

awesome-digital-human

A collection of resources on digital human including clothed people digitalization, virtual try-on, and other related directions.

License:MITStargazers:0Issues:0Issues:0

azure-docs

Open source documentation of Microsoft Azure

License:CC-BY-4.0Stargazers:0Issues:0Issues:0

ChatTTS

ChatTTS is a generative speech model for daily dialogue.

License:NOASSERTIONStargazers:0Issues:0Issues:0
License:NOASSERTIONStargazers:0Issues:0Issues:0

Edubot

基于Linly-Talker数字人改版的教育系统,包含网课总结、数字人对话、Chatbot对话,项目可在autodl部署

License:MITStargazers:0Issues:0Issues:0

faster-whisper

Faster Whisper transcription with CTranslate2

License:MITStargazers:0Issues:0Issues:0

GaussianTalker

Official implementation of “GaussianTalker: Real-Time High-Fidelity Talking Head Synthesis with Audio-Driven 3D Gaussian Splatting” by Kyusun Cho, Joungbin Lee, Heeji Yoon, Yeobin Hong, Jaehoon Ko, Sangjun Ahn and Seungryong Kim

License:NOASSERTIONStargazers:0Issues:0Issues:0

generative-ai

Sample code and notebooks for Generative AI on Google Cloud, with Gemini on Vertex AI

License:Apache-2.0Stargazers:0Issues:0Issues:0

generative-ai-docs

Documentation for Google's Gen AI site - including the Gemini API and Gemma

License:Apache-2.0Stargazers:0Issues:0Issues:0

haystack

:mag: LLM orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

lab

Showcase and examples lab for TresJS

Stargazers:0Issues:0Issues:0

liteflow

Lightweight, fast, stable, and programmable component-based rule engine/process engine. Component reuse, synchronous/asynchronous orchestration, dynamic orchestration, multi-language scripting support, complex nested rules, hot deployment, smooth refreshing. Let you improve your development efficiency!

License:Apache-2.0Stargazers:0Issues:0Issues:0

llm-course

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

License:Apache-2.0Stargazers:0Issues:0Issues:0

LLMs-from-scratch

Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step

License:NOASSERTIONStargazers:0Issues:0Issues:0
License:NOASSERTIONStargazers:0Issues:0Issues:0

lmdeploy

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

License:Apache-2.0Stargazers:0Issues:0Issues:0

Omost

Your image is almost there!

License:Apache-2.0Stargazers:0Issues:0Issues:0

quivr

Your GenAI Second Brain 🧠 A personal productivity assistant (RAG) ⚡️🤖 Chat with your docs (PDF, CSV, ...) & apps using Langchain, GPT 3.5 / 4 turbo, Private, Anthropic, VertexAI, Ollama, LLMs, Groq that you can share with users ! Local & Private alternative to OpenAI GPTs & ChatGPT powered by retrieval-augmented generation.

License:Apache-2.0Stargazers:0Issues:0Issues:0

RealtimeSTT

A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcription.

License:MITStargazers:0Issues:0Issues:0

RealtimeSTT_LLM_TTS

实时STT,连接智谱AI(流式LLM)和GPT-SOVITS,通过网页的方式,进行跨网络的服务调用,实现实时对话的效果

License:MITStargazers:0Issues:0Issues:0

robotframework

Generic automation framework for acceptance testing and RPA

License:Apache-2.0Stargazers:0Issues:0Issues:0

social-auto-upload

自动化上传视频到社交媒体:抖音、小红书、视频号、tiktok、youtube、bilibili

Stargazers:0Issues:0Issues:0

stream-wav2lip

优化wav2lip的执行步骤,将头脸分离、嘴型替换、回补背景三个步骤分离,添加gfpgan强化面部功能,实现提前解帧,流式循环处理,对接obs

License:Apache-2.0Stargazers:0Issues:0Issues:0

ToonCrafter

a research paper for generative cartoon interpolation

License:Apache-2.0Stargazers:0Issues:0Issues:0

tts-now

跨平台基于云平台(阿里云、讯飞等)语音合成 API 的文字转语音助手。支持单文本快速合成和批量合成。支持windows、macOS、Linux。

License:Apache-2.0Stargazers:0Issues:0Issues:0

V-Express

V-Express aims to generate a talking head video under the control of a reference image, an audio, and a sequence of V-Kps images.

Stargazers:0Issues:0Issues:0

vt-transformer

Transformer framework for edge computing based on C++.

License:MITStargazers:0Issues:0Issues:0