robert zou (zouxiaodong)

zouxiaodong

Geek Repo

Github PK Tool:Github PK Tool

robert zou's repositories

DragGAN

Official Code for DragGAN (SIGGRAPH 2023)

License:NOASSERTIONStargazers:0Issues:0Issues:0

Chinese-CLIP

Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.

License:MITStargazers:0Issues:0Issues:0

whisper

Robust Speech Recognition via Large-Scale Weak Supervision

License:MITStargazers:0Issues:0Issues:0

DataGrip

DataGrip 是一款,可以导出Mysql、Postgres 、Oracle 建表语句,视图,索引以及序列等DDL语句的开源程序 未来将支持数据库架构转换,以及数据同步等特性,希望大家一起参与开发与优化

License:Apache-2.0Stargazers:0Issues:0Issues:0

mmyolo

OpenMMLab YOLO series toolbox and benchmark. Implemented RTMDet, RTMDet-Rotated,YOLOv5, YOLOv6, YOLOv7, YOLOv8,YOLOX, PPYOLOE, etc.

License:GPL-3.0Stargazers:0Issues:0Issues:0

ultralytics

NEW - YOLOv8 🚀 in PyTorch > ONNX > CoreML > TFLite

License:AGPL-3.0Stargazers:0Issues:0Issues:0

h2ogpt

Join us at H2O.ai to make the world's best open-source GPT with document and image Q&A, 100% private chat, no data leaks, Apache 2.0 https://arxiv.org/pdf/2306.08161.pdf Live Demo: https://gpt.h2o.ai/

License:Apache-2.0Stargazers:0Issues:0Issues:0

RT-DETR

Official RT-DETR, RT-DETR, Real-Time DEtection TRansformer, DETRs Beat YOLOs on Real-time Object Detection. 🔥 🔥 🔥

License:Apache-2.0Stargazers:0Issues:0Issues:0

h2o-llmstudio

H2O LLM Studio - a framework and no-code GUI for fine-tuning LLMs

License:Apache-2.0Stargazers:0Issues:0Issues:0

Autoformer

About Code release for "Autoformer: Decomposition Transformers with Auto-Correlation for Long-Term Series Forecasting" (NeurIPS 2021), https://arxiv.org/abs/2106.13008

License:MITStargazers:0Issues:0Issues:0

GirlfriendGPT

Girlfriend GPT is a Python project to build your own AI girlfriend using ChatGPT4.0

Stargazers:0Issues:0Issues:0

LangChain-Chinese-Getting-Started-Guide

LangChain 的中文入门教程

Stargazers:0Issues:0Issues:0

learnopencv

Learn OpenCV : C++ and Python Examples

Stargazers:0Issues:0Issues:0

STVT

Video Summarization With Spatiotemporal Vision Transformer

License:Apache-2.0Stargazers:0Issues:0Issues:0

Fengshenbang-LM

Fengshenbang-LM(封神榜大模型)是IDEA研究院认知计算与自然语言研究中心主导的大模型开源体系,成为中文AIGC和认知智能的基础设施。

License:Apache-2.0Stargazers:0Issues:0Issues:0

Linux-Kernel-Filesystem-Hook

Simple system file hook driver for control open, read, write and close.

License:GPL-3.0Stargazers:0Issues:0Issues:0

Ask-Anything

[VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.

License:MITStargazers:0Issues:0Issues:0

CnOCR

CnOCR: Awesome Chinese/English OCR toolkits based on PyTorch/MXNet, It comes with 20+ well-trained models for different application scenarios and can be used directly after installation. 【基于 PyTorch/MXNet 的中文/英文 OCR Python 包。】

License:Apache-2.0Stargazers:0Issues:0Issues:0

FastSAM

Fast Segment Anything

License:Apache-2.0Stargazers:0Issues:0Issues:0

ChatGLM-6B

ChatGLM-6B:开源双语对话语言模型 | An Open Bilingual Dialogue Language Model

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

pytorch-vsumm-reinforce

Unsupervised video summarization with deep reinforcement learning (AAAI'18)

License:MITStargazers:0Issues:0Issues:0

GFPGAN

GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration.

License:NOASSERTIONStargazers:0Issues:0Issues:0

stable-diffusion-webui

Stable Diffusion web UI

License:AGPL-3.0Stargazers:0Issues:0Issues:0

LangChain-ChatGLM-Webui

基于LangChain和ChatGLM-6B等系列LLM的针对本地知识库的自动问答

License:Apache-2.0Stargazers:0Issues:0Issues:0

Fay

Fay是一个完整的开源项目,包含Fay控制器及数字人模型,可灵活组合出不同的应用场景:虚拟主播、现场推销货、商品导购、语音助理、远程语音助理、数字人互动、数字人面试官及心理测评、贾维斯、Her。 开源项目,非产品试用!!!

License:GPL-3.0Stargazers:0Issues:0Issues:0

Robby-chatbot

AI chatbot 🤖 for chat with CSV, PDF, TXT files 📄 and YTB videos 🎥 | using Langchain🦜 | OpenAI | Streamlit ⚡

License:MITStargazers:0Issues:0Issues:0

SadTalker

[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation

License:MITStargazers:0Issues:0Issues:0

MOSS

An open-source tool-augmented conversational language model from Fudan University

License:Apache-2.0Stargazers:0Issues:0Issues:0

DeepFaceLab

DeepFaceLab is the leading software for creating deepfakes.

License:GPL-3.0Stargazers:0Issues:0Issues:0

SadTalker-Video-Lip-Sync

本项目基于SadTalkers实现视频唇形合成的Wav2lip。通过以视频文件方式进行语音驱动生成唇形,设置面部区域可配置的增强方式进行合成唇形(人脸)区域画面增强,提高生成唇形的清晰度。使用DAIN 插帧的DL算法对生成视频进行补帧,补充帧间合成唇形的动作过渡,使合成的唇形更为流畅、真实以及自然。

Stargazers:0Issues:0Issues:0