suolyer

suolyer

Geek Repo

Github PK Tool:Github PK Tool

suolyer's starred repositories

ChineseGLUE

Language Understanding Evaluation benchmark for Chinese: datasets, baselines, pre-trained models,corpus and leaderboard

Language:PythonStargazers:1772Issues:0Issues:0
Language:PythonStargazers:133Issues:0Issues:0
Language:PythonStargazers:741Issues:0Issues:0

xDeepFM

a project of eXDeepFM

Language:PythonStargazers:1Issues:0Issues:0

fucking-algorithm

刷算法全靠套路,认准 labuladong 就够了!English version supported! Crack LeetCode, not only how, but also why.

Language:MarkdownStargazers:124493Issues:0Issues:0

nlpcda

一键中文数据增强包 ; NLP数据增强、bert数据增强、EDA:pip install nlpcda

Language:PythonLicense:Apache-2.0Stargazers:1729Issues:0Issues:0

pdf2word

60行代码实现多线程PDF转Word

Language:PythonLicense:MITStargazers:777Issues:0Issues:0

NeZha_Chinese_PyTorch

NEZHA: Neural Contextualized Representation for Chinese Language Understanding

Language:PythonLicense:MITStargazers:260Issues:0Issues:0

longformer-chinese

chinese version of longformer

Language:PythonStargazers:106Issues:0Issues:0
License:MITStargazers:3Issues:0Issues:0

NLP_ability

总结梳理自然语言处理工程师(NLP)需要积累的各方面知识,包括面试题,各种基础知识,工程能力等等,提升核心竞争力

Language:PythonStargazers:6539Issues:0Issues:0

MarkTool

DoTAT 是一款基于web、面向领域的通用文本标注工具,支持大规模实体标注、关系标注、事件标注、文本分类、基于字典匹配和正则匹配的自动标注以及用于实现归一化的标准名标注,同时也支持迭代标注、嵌套实体标注和嵌套事件标注。标注规范可自定义且同类型任务中可“一次创建多次复用”。通过分级实体集合扩大了实体类型的规模,并设计了全新高效的标注方式,提升了用户体验和标注效率。此外,本工具增加了审核环节,可对多人的标注结果进行一致性检验、自动合并和手动调整,提高了标注结果的准确率。

Language:VueLicense:Apache-2.0Stargazers:578Issues:0Issues:0

competition_baselines

开源的各大比赛baseline

Language:Jupyter NotebookStargazers:372Issues:0Issues:0
Language:PythonLicense:MITStargazers:45Issues:0Issues:0

Chinese-Text-Classification-Pytorch

中文文本分类,TextCNN,TextRNN,FastText,TextRCNN,BiLSTM_Attention,DPCNN,Transformer,基于pytorch,开箱即用。

Language:PythonLicense:MITStargazers:5213Issues:0Issues:0

Transformers_for_Text_Classification

基于Transformers的文本分类

Language:PythonStargazers:332Issues:0Issues:0

pytorch-transformer

Transformer模型的PyTorch实现

Language:PythonLicense:Apache-2.0Stargazers:8Issues:0Issues:0

Transformer_Pytorch

Transformer(attention-is-all-you-need)的pytorch实现,带run demo,可以跑通

Language:PythonStargazers:10Issues:0Issues:0

TextMatch

基于Pytorch的,中文语义相似度匹配模型(ABCNN、Albert、Bert、BIMPM、DecomposableAttention、DistilBert、ESIM、RE2、Roberta、SiaGRU、XlNet)

Language:PythonStargazers:776Issues:0Issues:0

QA

信息检索实验: 问答系统设计与实现

Language:PythonStargazers:57Issues:0Issues:0

fanqiang

翻墙-科学上网

Language:KotlinStargazers:37786Issues:0Issues:0

tensorflow_practice

tensorflow实战练习,包括强化学习、推荐系统、nlp等

Language:PythonStargazers:6631Issues:0Issues:0

TransformerDemo

Pytorch nn.Transformer Demo

Language:PythonLicense:MITStargazers:52Issues:0Issues:0

ChineseEHRBert

A Chinese EHR Bert Pretrained Model.

Language:PythonStargazers:249Issues:0Issues:0

Language_Understanding_based_BERT

基于BERT的预训练语言模型实现,分为两步:预训练和微调。目前已包括BERT、Roberta、ALbert三个模型,且皆可支持Whole Word Mask模式。

Stargazers:2Issues:0Issues:0

OpenCLaP

Open Chinese Language Pre-trained Model Zoo

License:MITStargazers:976Issues:0Issues:0
License:Apache-2.0Stargazers:672Issues:0Issues:0

ChineseNLPCorpus

中文自然语言处理数据集,平时做做实验的材料。欢迎补充提交合并。

Language:PythonStargazers:4184Issues:0Issues:0

LexiconAugmentedNER

Reject complicated operations for incorporating lexicon for Chinese NER.

Language:PythonStargazers:432Issues:0Issues:0