CurisZhou's repositories
bert-topics
A clustering based topic model using BERT embeddings and attention weights in conjunction with Tf-Idf. Trained, tested and evaluated on Twitter data.
ExtractionPro
利用哈工大的ltp,连接工具使用pyltp(从3.4版本改到4)实现了简单的分句,分词,词性分析,语义角色标注,依存句法分析,并以此为基础提出简单的知识图谱三元组抽取
pretrained-models
Open Language Pre-trained Model Zoo
rasa_chatbot_cn
building a chinese dialogue system based on the newest version of rasa(基于最新版本rasa搭建的对话系统)
Administrative-divisions-of-China
中华人民共和国行政区划:省级(省份直辖市自治区)、 地级(城市)、 县级(区县)、 乡级(乡镇街道)、 村级(村委会居委会) ,**省市区镇村二级三级四级五级联动地址数据。
bert4keras
keras implement of transformers for humans
BertBasedCorrectionModels
PyTorch impelementations of BERT-based Spelling Error Correction Models. 基于BERT的文本纠错模型,使用PyTorch实现。
coder2gwy
互联网首份程序员考公指南,由3位已经进入体制内的前大厂程序员联合献上。
Cool-NLPCV
Some Cool NLP and CV Repositories and Solutions (收集NLP中常见任务的开源解决方案、数据集、工具、学习资料等)
cross-transformers-pytorch
Implementation of Cross Transformer for spatially-aware few-shot transfer, in Pytorch
CRSLab
CRSLab is an open-source toolkit for building Conversational Recommender System (CRS).
geocoding-1
地理编码技术,提供地址标准化和相似度计算。
jieba
结巴中文分词
lac
百度NLP:分词,词性标注,命名实体识别,词重要性
ltp
Language Technology Platform
parser
A collection of state-of-the-art models for Dependency Parsing, Constituency Parsing and Semantic Dependency Parsing.
pinyin-data
汉字拼音数据
pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
pyBTM
Python wrapper for Biterm Model algorithm
pycorrector
pycorrector is a toolkit for text error correction. It was developed to facilitate the designing, comparing, and sharing of deep text error correction models.
pyunit-address
字符串地址查询,支持自定义地址词库,解析地址,地址识别,地址抽取,中文地址.
pyunit-ner
NER实体识别模型
simbert
a bert for retrieval and generation
text-to-text-transfer-transformer
Code for the paper "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"
WoBERT_pytorch
WoBERT_pytorch