BeyondYourself

The implementations of some works from Davar-Lab. Currently we have the code of Text Perceptron (AAAI 2020). Some works' code will be published soon, including YORO (ACMMM 2019) , TRIE (ACMMM2020), FREE(TIP 2020), SPIN (AAAI 2021), MANGO (AAAI2021), etc.

Language:PythonApache-2.0010

DeepFaceLive

Real-time face swap for PC streaming or video calls

GPL-3.0000

gpt-researcher

GPT based autonomous agent that does online comprehensive research on any given topic

Language:PythonMIT000

inst-inpaint

A novel inpainting framework that can remove objects from images based on the instructions given as text prompts.

Language:PythonMIT000

llm_babyCare

育儿宝典

Apache-2.001 1

OMML

Multi-Modal learning toolkit based on PaddlePaddle and PyTorch, supporting multiple applications such as multi-modal classification, cross-modal retrieval and image caption.

Language:PythonApache-2.0000

PaddleOCR

Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)

Language:PythonApache-2.0010

paperless-ngx

A community-supported supercharged version of paperless: scan, index and archive all your physical documents

GPL-3.0000

Pix2Text

Pix In, Latex & Text Out. Recognize Chinese, English Texts, and Math Formulas from Images.

Language:PythonMIT000

PyTorch-Tutorial-2nd

《Pytorch实用教程》（第二版）无论是零基础入门，还是CV、NLP、LLM项目应用，或是进阶工程化部署落地，在这里都有。相信在本书的帮助下，读者将能够轻松掌握 PyTorch 的使用，成为一名优秀的深度学习工程师。

000

RingRWKV

修复Transformer官方库中RWKV的适配问题，支持RWKV所有系列模型在转换后，通过RingRWKV库，与其他transfomer模型一样简单方便地部署和微调。

Language:PythonApache-2.0000

SadTalker

[CVPR 2023] SadTalker：Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation

NOASSERTION000

sd-webui-EasyPhoto

📷 EasyPhoto | Your Smart AI Photo Generator.

Language:PythonApache-2.0000

Serving

A flexible, high-performance carrier for machine learning models（『飞桨』服务化部署框架）

Language:C++Apache-2.0010

Streamer-Sales

Streamer-Sales 销冠 —— 卖货主播 LLM 大模型🛒🎁，一个能够根据给定的商品特点从激发用户购买意愿角度出发进行商品解说的卖货主播大模型。🚀⭐内含详细的数据生成流程❗ 📦另外还集成了 LMDeploy 加速推理🚀、RAG检索增强生成 📚、TTS文字转语音🔊、数字人生成 🦸、 Agent 使用网络查询实时信息🌐、ASR 语音转文字🎙️

Apache-2.0000

TableGeneration

通过浏览器渲染生成表格图像

Language:PythonMIT000

Topic-on-Table-Recognition

This is a survey on the topic of table recognition

000

TransGPT

Language:PythonMIT000

TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Language:PythonMPL-2.0000

video-retalking

[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild

Apache-2.0000

VIMER

视觉预训练基础模型仓库

Language:Python010

WenxinWorkshop-Python-SDK

一个文心千帆平台的第三方 Python SDK。A third-party Python SDK for a WenxinWorkshop.

Apache-2.0000

BeyondYourself

shaohua.zhang's repositories

AlphX-Code-For-DAR

awesome-digital-human

cat-catch

CenterNet

char-detection

Code-LMs

CVprojects

danbooru-diffusion-prompt-builder

DAVAR-Lab-OCR

DeepFaceLive

gpt-researcher

inst-inpaint

llm_babyCare

OMML

PaddleOCR

paperless-ngx

Pix2Text

PyTorch-Tutorial-2nd

RingRWKV

SadTalker

sd-webui-EasyPhoto

Serving

Streamer-Sales

TableGeneration

Topic-on-Table-Recognition

TransGPT

TTS

video-retalking

VIMER

WenxinWorkshop-Python-SDK