robert zou's repositories
DragGAN
Official Code for DragGAN (SIGGRAPH 2023)
Chinese-CLIP
Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.
whisper
Robust Speech Recognition via Large-Scale Weak Supervision
DataGrip
DataGrip 是一款,可以导出Mysql、Postgres 、Oracle 建表语句,视图,索引以及序列等DDL语句的开源程序 未来将支持数据库架构转换,以及数据同步等特性,希望大家一起参与开发与优化
mmyolo
OpenMMLab YOLO series toolbox and benchmark. Implemented RTMDet, RTMDet-Rotated,YOLOv5, YOLOv6, YOLOv7, YOLOv8,YOLOX, PPYOLOE, etc.
ultralytics
NEW - YOLOv8 🚀 in PyTorch > ONNX > CoreML > TFLite
h2ogpt
Join us at H2O.ai to make the world's best open-source GPT with document and image Q&A, 100% private chat, no data leaks, Apache 2.0 https://arxiv.org/pdf/2306.08161.pdf Live Demo: https://gpt.h2o.ai/
RT-DETR
Official RT-DETR, RT-DETR, Real-Time DEtection TRansformer, DETRs Beat YOLOs on Real-time Object Detection. 🔥 🔥 🔥
h2o-llmstudio
H2O LLM Studio - a framework and no-code GUI for fine-tuning LLMs
Autoformer
About Code release for "Autoformer: Decomposition Transformers with Auto-Correlation for Long-Term Series Forecasting" (NeurIPS 2021), https://arxiv.org/abs/2106.13008
GirlfriendGPT
Girlfriend GPT is a Python project to build your own AI girlfriend using ChatGPT4.0
LangChain-Chinese-Getting-Started-Guide
LangChain 的中文入门教程
learnopencv
Learn OpenCV : C++ and Python Examples
STVT
Video Summarization With Spatiotemporal Vision Transformer
Fengshenbang-LM
Fengshenbang-LM(封神榜大模型)是IDEA研究院认知计算与自然语言研究中心主导的大模型开源体系,成为中文AIGC和认知智能的基础设施。
Linux-Kernel-Filesystem-Hook
Simple system file hook driver for control open, read, write and close.
Ask-Anything
[VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.
CnOCR
CnOCR: Awesome Chinese/English OCR toolkits based on PyTorch/MXNet, It comes with 20+ well-trained models for different application scenarios and can be used directly after installation. 【基于 PyTorch/MXNet 的中文/英文 OCR Python 包。】
FastSAM
Fast Segment Anything
ChatGLM-6B
ChatGLM-6B:开源双语对话语言模型 | An Open Bilingual Dialogue Language Model
pytorch-vsumm-reinforce
Unsupervised video summarization with deep reinforcement learning (AAAI'18)
GFPGAN
GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration.
stable-diffusion-webui
Stable Diffusion web UI
LangChain-ChatGLM-Webui
基于LangChain和ChatGLM-6B等系列LLM的针对本地知识库的自动问答
Fay
Fay是一个完整的开源项目,包含Fay控制器及数字人模型,可灵活组合出不同的应用场景:虚拟主播、现场推销货、商品导购、语音助理、远程语音助理、数字人互动、数字人面试官及心理测评、贾维斯、Her。 开源项目,非产品试用!!!
Robby-chatbot
AI chatbot 🤖 for chat with CSV, PDF, TXT files 📄 and YTB videos 🎥 | using Langchain🦜 | OpenAI | Streamlit ⚡
SadTalker
[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
MOSS
An open-source tool-augmented conversational language model from Fudan University
DeepFaceLab
DeepFaceLab is the leading software for creating deepfakes.
SadTalker-Video-Lip-Sync
本项目基于SadTalkers实现视频唇形合成的Wav2lip。通过以视频文件方式进行语音驱动生成唇形,设置面部区域可配置的增强方式进行合成唇形(人脸)区域画面增强,提高生成唇形的清晰度。使用DAIN 插帧的DL算法对生成视频进行补帧,补充帧间合成唇形的动作过渡,使合成的唇形更为流畅、真实以及自然。