shaohua.zhang (BeyondYourself)

BeyondYourself

Geek Repo

Location:Hangzhou China

Github PK Tool:Github PK Tool

shaohua.zhang's repositories

AlphX-Code-For-DAR

粤港澳大湾区(黄埔)国际算法算例大赛-古籍文档图像识别与分析算法比赛 Alphx队源码

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

awesome-digital-human

A collection of resources on digital human including clothed people digitalization, virtual try-on, and other related directions.

License:MITStargazers:0Issues:0Issues:0

cat-catch

猫抓 chrome资源嗅探扩展

License:GPL-3.0Stargazers:0Issues:0Issues:0

CenterNet

Object detection, 3D detection, and pose estimation using center point detection:

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

char-detection

🔥Char detection base on crnn 字符(单字)检测基于CRNN

Language:PythonStargazers:0Issues:1Issues:0

Code-LMs

Guide to using pre-trained large language models of source code

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

CVprojects

computer vision projects | 计算机视觉相关好玩的AI项目(Python、C++)

Language:Jupyter NotebookStargazers:0Issues:0Issues:0

danbooru-diffusion-prompt-builder

Danbooru / NovelAI 标签超市

Language:TypeScriptLicense:AGPL-3.0Stargazers:0Issues:0Issues:0

DAVAR-Lab-OCR

The implementations of some works from Davar-Lab. Currently we have the code of Text Perceptron (AAAI 2020). Some works' code will be published soon, including YORO (ACMMM 2019) , TRIE (ACMMM2020), FREE(TIP 2020), SPIN (AAAI 2021), MANGO (AAAI2021), etc.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

DeepFaceLive

Real-time face swap for PC streaming or video calls

License:GPL-3.0Stargazers:0Issues:0Issues:0

gpt-researcher

GPT based autonomous agent that does online comprehensive research on any given topic

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

inst-inpaint

A novel inpainting framework that can remove objects from images based on the instructions given as text prompts.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

llm_babyCare

育儿宝典

License:Apache-2.0Stargazers:0Issues:1Issues:1

OMML

Multi-Modal learning toolkit based on PaddlePaddle and PyTorch, supporting multiple applications such as multi-modal classification, cross-modal retrieval and image caption.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

PaddleOCR

Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

paperless-ngx

A community-supported supercharged version of paperless: scan, index and archive all your physical documents

License:GPL-3.0Stargazers:0Issues:0Issues:0

Pix2Text

Pix In, Latex & Text Out. Recognize Chinese, English Texts, and Math Formulas from Images.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

PyTorch-Tutorial-2nd

《Pytorch实用教程》(第二版)无论是零基础入门,还是CV、NLP、LLM项目应用,或是进阶工程化部署落地,在这里都有。相信在本书的帮助下,读者将能够轻松掌握 PyTorch 的使用,成为一名优秀的深度学习工程师。

Stargazers:0Issues:0Issues:0

RingRWKV

修复Transformer官方库中RWKV的适配问题,支持RWKV所有系列模型在转换后,通过RingRWKV库,与其他transfomer模型一样简单方便地部署和微调。

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

SadTalker

[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation

License:NOASSERTIONStargazers:0Issues:0Issues:0

sd-webui-EasyPhoto

📷 EasyPhoto | Your Smart AI Photo Generator.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

Serving

A flexible, high-performance carrier for machine learning models(『飞桨』服务化部署框架)

Language:C++License:Apache-2.0Stargazers:0Issues:1Issues:0

Streamer-Sales

Streamer-Sales 销冠 —— 卖货主播 LLM 大模型🛒🎁,一个能够根据给定的商品特点从激发用户购买意愿角度出发进行商品解说的卖货主播大模型。🚀⭐内含详细的数据生成流程❗ 📦另外还集成了 LMDeploy 加速推理🚀、RAG检索增强生成 📚、TTS文字转语音🔊、数字人生成 🦸、 Agent 使用网络查询实时信息🌐、ASR 语音转文字🎙️

License:Apache-2.0Stargazers:0Issues:0Issues:0

TableGeneration

通过浏览器渲染生成表格图像

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

Topic-on-Table-Recognition

This is a survey on the topic of table recognition

Stargazers:0Issues:0Issues:0
Language:PythonLicense:MITStargazers:0Issues:0Issues:0

TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Language:PythonLicense:MPL-2.0Stargazers:0Issues:0Issues:0

video-retalking

[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild

License:Apache-2.0Stargazers:0Issues:0Issues:0

VIMER

视觉预训练基础模型仓库

Language:PythonStargazers:0Issues:1Issues:0

WenxinWorkshop-Python-SDK

一个文心千帆平台的第三方 Python SDK。A third-party Python SDK for a WenxinWorkshop.

License:Apache-2.0Stargazers:0Issues:0Issues:0