Kedreamix's repositories

Linly-Talker

Digital Avatar Conversational System - Linly-Talker. 😄✨ Linly-Talker is an intelligent AI system that combines large language models (LLMs) with visual models to create a novel human-AI interaction method. 🤝🤖 It integrates various technologies like Whisper, Linly, Microsoft Speech Services, and SadTalker talking head generation system. 🌟🔬

Language:PythonLicense:MITStargazers:680Issues:15Issues:36

Awesome-Talking-Head-Synthesis

💬 An extensive collection of exceptional resources dedicated to the captivating world of talking face synthesis! ⭐ If you find this repo useful, please give it a star! 🤩

Pytorch-Image-Classification

用于pytorch的图像分类,包含多种模型方法,比如AlexNet,VGG,GoogleNet,ResNet,DenseNet等等,包含可完整运行的代码。除此之外,也有colab的在线运行代码,可以直接在colab在线运行查看结果。也可以迁移到自己的数据集进行迁移学习。

Language:Jupyter NotebookStargazers:162Issues:2Issues:3

PaddleAvatar

你是否曾经幻想过与自己的虚拟人交互?现在,使用PaddleAvatar,您可以将自己的图像、音频和视频转化为一个逼真的数字人视频,与其进行人机交互。 PaddleAvatar是一种基于PaddlePaddle深度学习框架的数字人生成工具,基于Paddle的许多套件,它可以将您的数字图像、音频和视频合成为一个逼真的数字人视频。除此之外,PaddleAvatar还支持进一步的开发,例如使用自然语言处理技术,将数字人视频转化为一个完整的人机交互系统,使得您能够与虚拟的自己进行真实的对话和互动。 使用PaddleAvatar,您可以将数字人视频用于各种场合,例如游戏、教育、虚拟现实等等。PaddleAvatar为您提供了一个自由创作的数字世界,让您的想象力得到了充分的释放!

Language:Jupyter NotebookStargazers:126Issues:2Issues:7

YoloGesture

基于计算机视觉手势识别控制系统YoLoGesture (利用YOLO实现),利用yolo进行手势识别的控制系统,最后利用streamlit进行了部署,可在线体验尝试https://kedreamix-yologesture.streamlit.app , huggingface也有https://huggingface.co/spaces/Kedreamix/YoloGesture ,除此之外,还可以将方法运用到其他数据集中,都可以完成目标检测任务,并且进行部署,一通百通

Language:PythonStargazers:20Issues:2Issues:0

pytorch-cppcuda-tutorial

tutorial for writing custom pytorch cpp+cuda kernel, applied on volume rendering (NeRF)

Language:PythonStargazers:16Issues:0Issues:0

ER-NeRF

[ICCV'23] Efficient Region-Aware Neural Radiance Fields for High-Fidelity Talking Portrait Synthesis

Language:PythonLicense:MITStargazers:1Issues:0Issues:0

metahuman-stream

Real time streaming digital human based on nerf

Language:PythonLicense:MITStargazers:1Issues:0Issues:0

prompt-engineering-for-developers

面向开发者的 LLM 入门教程,吴恩达大模型系列课程中文版

Language:Jupyter NotebookStargazers:1Issues:0Issues:0

AutoGPT

AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.

Language:JavaScriptLicense:MITStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

d2l-ai-solutions-manual

《动手学深度学习》习题解答,在线阅读地址如下:

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:0Issues:0Issues:0

DiffTalk

[CVPR2023] The implementation for "DiffTalk: Crafting Diffusion Models for Generalized Audio-Driven Portraits Animation"

Language:PythonStargazers:0Issues:1Issues:0
Language:Jupyter NotebookStargazers:0Issues:0Issues:0

Fay

Fay is an open-source digital human framework integrating language models and digital characters. It offers retail, assistant, and agent versions for diverse applications like virtual shopping guides, broadcasters, assistants, waiters, teachers, and voice or text-based mobile assistants.

Stargazers:0Issues:0Issues:0

gaussian-head

Official repository for 'GaussianHead: Impressive Head Avatars with Learnable Gaussian Diffusion'

Language:PythonStargazers:0Issues:0Issues:0

gaussian_splatting_notes

A detailed formulae explanation on gaussian splatting

Stargazers:0Issues:0Issues:0
License:NOASSERTIONStargazers:0Issues:0Issues:0

GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

HDTF

the dataset and code for "Flow-guided One-shot Talking Face Generation with a High-resolution Audio-visual Dataset"

Language:PythonLicense:GPL-3.0Stargazers:0Issues:0Issues:0

huawei-od-python

华为OD算法题解 愿景:加入C++的方法,学习相关知识

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:2Issues:0

OpenVoice

Instant voice cloning by MyShell.

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

R2-Talker-code

R2-Talker: Realistic Real-Time Talking Head Synthesis with Hash Grid Landmarks Encoding and Progressive Multilayer Conditioning

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

RealOrGPT

RealOrGPT

Stargazers:0Issues:1Issues:0
Language:Jupyter NotebookStargazers:0Issues:0Issues:0
Language:HTMLLicense:MITStargazers:0Issues:0Issues:0