OKC0's repositories

General-Documents-Layout-parser

通用版面分析 | 中文文档解析 |Document Layout Analysis | layout paser

Language:PythonLicense:NOASSERTIONStargazers:37Issues:4Issues:1

awesome-pretrained-chinese-nlp-models

Awesome Pretrained Chinese NLP Models,高质量中文预训练模型集合

License:MITStargazers:0Issues:1Issues:0
Language:PythonStargazers:0Issues:0Issues:0

CCL2022-CLTC

CCL 2022 汉语学习者文本纠错评测

Stargazers:0Issues:0Issues:0

Chinese-OCR3

从NLP出发对于OCR的深度实践集锦,重在实战

Stargazers:0Issues:0Issues:0

CLIP-Chinese

中文CLIP预训练模型

Stargazers:0Issues:0Issues:0

DAVAR-Lab-OCR

OCR toolbox from Davar-Lab

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

DCN

Dynamic Connected Networks for Chinese Spelling Check

License:Apache-2.0Stargazers:0Issues:0Issues:0

EntityCorrect_Tool

利用actrie树进行句子实体纠错和标准实体抽取,实体纠错:找商银行 -> 招商银行;实体标准化,招行 -> 招商银行

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

JDBrandMember

京东自动入会获取京豆

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

JioNLP

中文 NLP 预处理、解析工具包,准确、高效、易用 A Chinese NLP Preprocessing & Parsing Package www.jionlp.com

License:Apache-2.0Stargazers:0Issues:0Issues:0

learn_python3_spider

python爬虫教程系列、从0到1学习python爬虫,包括浏览器抓包,手机APP抓包,如 fiddler、mitmproxy,各种爬虫涉及的模块的使用,如:requests、beautifulSoup、selenium、appium、scrapy等,以及IP代理,验证码识别,Mysql,MongoDB数据库的python使用,多线程多进程爬虫的使用,css 爬虫加密逆向破解,JS爬虫逆向,分布式爬虫,爬虫项目实战实例等

License:MITStargazers:0Issues:0Issues:0

lit-ie

A training and inference framework for open ner and re models! 信息抽取模型的统一训练和推理框架,包含丰富的开源SOTA模型

License:Apache-2.0Stargazers:0Issues:0Issues:0

LLMs_interview_notes

该仓库主要记录 大模型(LLMs) 算法工程师相关的面试题

License:Apache-2.0Stargazers:0Issues:0Issues:0

NLP-Interview-Notes

本项目是作者们根据个人面试和经验总结出的自然语言处理(NLP)面试准备的学习笔记与资料,该资料目前包含 自然语言处理各领域的 面试题积累。

Stargazers:0Issues:0Issues:0

NLP_ability

总结梳理自然语言处理工程师(NLP)需要积累的各方面知识,包括面试题,各种基础知识,工程能力等等,提升核心竞争力

Stargazers:0Issues:0Issues:0

nlp_paper_study

研读顶会论文,复现论文相关代码

Stargazers:0Issues:0Issues:0

nlp_simple_task_impl

NLP的一些常见任务的具体实践

Stargazers:0Issues:0Issues:0

pdfstructure

`pdfstructure` detects, splits and organizes the documents text content into its natural structure as envisioned by the author.

Language:PythonStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

pycorrector

pycorrector is a toolkit for text error correction. 文本纠错,Kenlm,Seq2Seq_Attention,BERT,MacBERT,ELECTRA,ERNIE,Transformer等模型实现,开箱即用。

License:Apache-2.0Stargazers:0Issues:0Issues:0

python-random-car-plate-generator

可以随机生成制定数量的车牌号,因为用到停车场的虚假数据生成,所以地区集中在一个地方。支持各类车辆的生成,只需在注释的地方修改即可。

Language:PythonStargazers:0Issues:0Issues:0

segment-anything

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

License:Apache-2.0Stargazers:0Issues:0Issues:0

SimCSE-Chinese-Pytorch

SimCSE在中文上的复现,有监督+无监督

License:MITStargazers:0Issues:0Issues:0

so-vits-svc

SoftVC VITS Singing Voice Conversion

License:AGPL-3.0Stargazers:0Issues:0Issues:0

sql-mother

免费的闯关式 SQL 自学教程网站,从 0 到 1 带大家掌握常用 SQL 语法,纯前端实现,简单易学~

Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

tr

Free Offline OCR 离线的中文文本检测+识别SDK

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

UER-py

Open Source Pre-training Model Framework in PyTorch & Pre-trained Model Zoo

Stargazers:0Issues:0Issues:0

wiki-error-extract

根据维基百科历史编辑数据提取纠错语料。

Language:PythonStargazers:0Issues:0Issues:0