shaohua.zhang (BeyondYourself)

BeyondYourself

Geek Repo

Location:Hangzhou China

Github PK Tool:Github PK Tool

shaohua.zhang's repositories

AI_Tutorial

精华机器学习,NLP,图像识别, 深度学习等人工智能领域学习资料,搜索,推荐,广告系统架构及算法技术资料整理

AlphX-Code-For-DAR

粤港澳大湾区(黄埔)国际算法算例大赛-古籍文档图像识别与分析算法比赛 Alphx队源码

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

awesome-digital-human

A collection of resources on digital human including clothed people digitalization, virtual try-on, and other related directions.

License:MITStargazers:0Issues:0Issues:0

cat-catch

猫抓 chrome资源嗅探扩展

License:GPL-3.0Stargazers:0Issues:0Issues:0

CenterNet

Object detection, 3D detection, and pose estimation using center point detection:

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

char-detection

🔥Char detection base on crnn 字符(单字)检测基于CRNN

Language:PythonStargazers:0Issues:1Issues:0

Code-LMs

Guide to using pre-trained large language models of source code

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

CVprojects

computer vision projects | 计算机视觉相关好玩的AI项目(Python、C++)

Language:Jupyter NotebookStargazers:0Issues:0Issues:0

danbooru-diffusion-prompt-builder

Danbooru / NovelAI 标签超市

Language:TypeScriptLicense:AGPL-3.0Stargazers:0Issues:0Issues:0

DAVAR-Lab-OCR

The implementations of some works from Davar-Lab. Currently we have the code of Text Perceptron (AAAI 2020). Some works' code will be published soon, including YORO (ACMMM 2019) , TRIE (ACMMM2020), FREE(TIP 2020), SPIN (AAAI 2021), MANGO (AAAI2021), etc.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

DeepFaceLive

Real-time face swap for PC streaming or video calls

License:GPL-3.0Stargazers:0Issues:0Issues:0

gpt-researcher

GPT based autonomous agent that does online comprehensive research on any given topic

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

inst-inpaint

A novel inpainting framework that can remove objects from images based on the instructions given as text prompts.

License:MITStargazers:0Issues:0Issues:0

llm_babyCare

育儿宝典

License:Apache-2.0Stargazers:0Issues:0Issues:0

OMML

Multi-Modal learning toolkit based on PaddlePaddle and PyTorch, supporting multiple applications such as multi-modal classification, cross-modal retrieval and image caption.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

PaddleOCR

Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

paperless-ngx

A community-supported supercharged version of paperless: scan, index and archive all your physical documents

License:GPL-3.0Stargazers:0Issues:0Issues:0

Pix2Text

Pix In, Latex & Text Out. Recognize Chinese, English Texts, and Math Formulas from Images.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

RingRWKV

修复Transformer官方库中RWKV的适配问题,支持RWKV所有系列模型在转换后,通过RingRWKV库,与其他transfomer模型一样简单方便地部署和微调。

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

SadTalker

[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation

License:NOASSERTIONStargazers:0Issues:0Issues:0

sd-webui-EasyPhoto

📷 EasyPhoto | Your Smart AI Photo Generator.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

Serving

A flexible, high-performance carrier for machine learning models(『飞桨』服务化部署框架)

Language:C++License:Apache-2.0Stargazers:0Issues:1Issues:0

TableGeneration

通过浏览器渲染生成表格图像

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

Token-Path-Prediction

This is an unofficial re-implementation to the EMNLP 2023 paper: Reading Order Matters: Information Extraction from Visually-rich Documents by Token Path Prediction.

License:Apache-2.0Stargazers:0Issues:0Issues:0

Topic-on-Table-Recognition

This is a survey on the topic of table recognition

Stargazers:0Issues:0Issues:0
Language:PythonLicense:MITStargazers:0Issues:0Issues:0

TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Language:PythonLicense:MPL-2.0Stargazers:0Issues:0Issues:0

video-retalking

[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild

License:Apache-2.0Stargazers:0Issues:0Issues:0

VIMER

视觉预训练基础模型仓库

Language:PythonStargazers:0Issues:1Issues:0

WenxinWorkshop-Python-SDK

一个文心千帆平台的第三方 Python SDK。A third-party Python SDK for a WenxinWorkshop.

License:Apache-2.0Stargazers:0Issues:0Issues:0