skic's starred repositories

Linly-Talker

Digital Avatar Conversational System - Linly-Talker. 😄✨ Linly-Talker is an intelligent AI system that combines large language models (LLMs) with visual models to create a novel human-AI interaction method. 🤝🤖 It integrates various technologies like Whisper, Linly, Microsoft Speech Services, and SadTalker talking head generation system. 🌟🔬

Language:PythonLicense:MITStargazers:1025Issues:0Issues:0

CVPR-2023-24-Papers

CVPR 2023-2024 Papers: Dive into advanced research presented at the leading computer vision conference. Keep up to date with the latest developments in computer vision and deep learning. Code included. ⭐ support visual intelligence development!

Language:PythonLicense:MITStargazers:316Issues:0Issues:0

lobe-chat

🤯 Lobe Chat - an open-source, modern-design LLMs/AI chat framework. Supports Multi AI Providers( OpenAI / Claude 3 / Gemini / Ollama / Bedrock / Azure / Mistral / Perplexity ), Multi-Modals (Vision/TTS) and plugin system. One-click FREE deployment of your private ChatGPT chat application.

Language:TypeScriptLicense:NOASSERTIONStargazers:32912Issues:0Issues:0

MuseV

MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising

Language:PythonLicense:NOASSERTIONStargazers:2010Issues:0Issues:0

SadTalker-Video-Lip-Sync

本项目基于SadTalkers实现视频唇形合成的Wav2lip。通过以视频文件方式进行语音驱动生成唇形,设置面部区域可配置的增强方式进行合成唇形(人脸)区域画面增强,提高生成唇形的清晰度。使用DAIN 插帧的DL算法对生成视频进行补帧,补充帧间合成唇形的动作过渡,使合成的唇形更为流畅、真实以及自然。

Language:PythonStargazers:1665Issues:0Issues:0

ollama

Get up and running with Llama 3, Mistral, Gemma, and other large language models.

License:MITStargazers:1Issues:0Issues:0

ollama

Get up and running with Llama 3, Mistral, Gemma, and other large language models.

Language:GoLicense:MITStargazers:74362Issues:0Issues:0

Rope

GUI-focused roop

Language:PythonLicense:GPL-3.0Stargazers:3857Issues:0Issues:0

GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Language:PythonLicense:MITStargazers:27617Issues:0Issues:0

QAnything

Question and Answer based on Anything.

Language:PythonLicense:Apache-2.0Stargazers:10307Issues:0Issues:0

ChatRTX

A developer reference project for creating Retrieval Augmented Generation (RAG) chatbots on Windows using TensorRT-LLM

Language:PythonLicense:NOASSERTIONStargazers:2495Issues:0Issues:0

SCTNet

Official implementation of SCTNet (AAAI2024)

Language:PythonLicense:MITStargazers:142Issues:0Issues:0

point2cad

Code for "Point2CAD: Reverse Engineering CAD Models from 3D Point Clouds"

Language:PythonLicense:Apache-2.0Stargazers:196Issues:0Issues:0

FaceStudio

Put Your Face Everywhere in Seconds.

License:Apache-2.0Stargazers:307Issues:0Issues:0

edgeyolo

an edge-real-time anchor-free object detector with decent performance

Language:PythonLicense:Apache-2.0Stargazers:411Issues:0Issues:0

LISA

Project Page for "LISA: Reasoning Segmentation via Large Language Model"

Language:PythonLicense:Apache-2.0Stargazers:1561Issues:0Issues:0

segment-anything

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:45101Issues:0Issues:0

sketch-code

Keras model to generate HTML code from hand-drawn website mockups. Implements an image captioning architecture to drawn source images.

Language:PythonStargazers:5082Issues:0Issues:0
Language:C++License:Apache-2.0Stargazers:42Issues:0Issues:0

handeye-calib

基于ROS的手眼标定

Language:PythonLicense:GPL-3.0Stargazers:75Issues:0Issues:0

VIO-Doc

主流VIO论文推导及代码解析

Stargazers:914Issues:0Issues:0

hof

Histogram of Optical Flow by OpenCV

Language:C++Stargazers:2Issues:0Issues:0

CVPR2024-Papers-with-Code

CVPR 2024 论文和开源项目合集

Stargazers:16804Issues:0Issues:0

VINS-Mono

A Robust and Versatile Monocular Visual-Inertial State Estimator

Language:C++License:GPL-3.0Stargazers:4808Issues:0Issues:0

VINS-Mono-noted

detailed chinese notes for vins-mono

Language:C++License:GPL-3.0Stargazers:253Issues:0Issues:0

ORB_SLAM2_detailed_comments

Detailed comments for ORB-SLAM2 with trouble-shooting, key formula derivation, and diagrammatic drawing

Language:C++License:GPL-3.0Stargazers:1537Issues:0Issues:0

ORB_SLAM3_detailed_comments

Detailed comments for ORB-SLAM3

Language:C++License:GPL-3.0Stargazers:1201Issues:0Issues:0

tandem

[CoRL 21'] TANDEM: Tracking and Dense Mapping in Real-time using Deep Multi-view Stereo

Language:C++Stargazers:908Issues:0Issues:0

MonoRec

Official implementation of the paper: MonoRec: Semi-Supervised Dense Reconstruction in Dynamic Environments from a Single Moving Camera (CVPR 2021)

Language:PythonLicense:MITStargazers:575Issues:0Issues:0

CPP

Lecture notes, projects and other materials for Course 'CS205 C/C++ Program Design' at Southern University of Science and Technology.

Language:C++Stargazers:1912Issues:0Issues:0