skic's starred repositories
Linly-Talker
Digital Avatar Conversational System - Linly-Talker. 😄✨ Linly-Talker is an intelligent AI system that combines large language models (LLMs) with visual models to create a novel human-AI interaction method. 🤝🤖 It integrates various technologies like Whisper, Linly, Microsoft Speech Services, and SadTalker talking head generation system. 🌟🔬
CVPR-2023-24-Papers
CVPR 2023-2024 Papers: Dive into advanced research presented at the leading computer vision conference. Keep up to date with the latest developments in computer vision and deep learning. Code included. ⭐ support visual intelligence development!
lobe-chat
🤯 Lobe Chat - an open-source, modern-design LLMs/AI chat framework. Supports Multi AI Providers( OpenAI / Claude 3 / Gemini / Ollama / Bedrock / Azure / Mistral / Perplexity ), Multi-Modals (Vision/TTS) and plugin system. One-click FREE deployment of your private ChatGPT chat application.
SadTalker-Video-Lip-Sync
本项目基于SadTalkers实现视频唇形合成的Wav2lip。通过以视频文件方式进行语音驱动生成唇形,设置面部区域可配置的增强方式进行合成唇形(人脸)区域画面增强,提高生成唇形的清晰度。使用DAIN 插帧的DL算法对生成视频进行补帧,补充帧间合成唇形的动作过渡,使合成的唇形更为流畅、真实以及自然。
GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
FaceStudio
Put Your Face Everywhere in Seconds.
segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
sketch-code
Keras model to generate HTML code from hand-drawn website mockups. Implements an image captioning architecture to drawn source images.
handeye-calib
基于ROS的手眼标定
CVPR2024-Papers-with-Code
CVPR 2024 论文和开源项目合集
VINS-Mono-noted
detailed chinese notes for vins-mono
ORB_SLAM2_detailed_comments
Detailed comments for ORB-SLAM2 with trouble-shooting, key formula derivation, and diagrammatic drawing
ORB_SLAM3_detailed_comments
Detailed comments for ORB-SLAM3