bjutliulei's repositories
AntSpider
1000万豆瓣电影/评论/名人/评分数据采集源码分享(内含千万电影数据集,可下载)
fastNLP
fastNLP: A Modularized and Extensible NLP Framework. Currently still in incubation.
Summarization-Papers
Summarization Papers
transferlearning
Transfer learning / domain adaptation / domain generalization / multi-task learning etc. Papers, codes, datasets, applications, tutorials.-迁移学习
Research
novel deep learning research works with PaddlePaddle
pycorrector
pycorrector is a toolkit for text error correction. 文本纠错,Kenlm,Seq2Seq_Attention,BERT,MacBERT,ELECTRA,ERNIE,Transformer等模型实现,开箱即用。
ctx-rewriter-for-summ
Contextualized Rewriting for Text Summarization
Process-Data-of-CNN-DailyMail
This repository holds the output of the repository: https://github.com/abisee/cnn-dailymail
ABSA-Reading-List
Reading list of aspect-based sentiment analysis.
HanLP
中文分词 词性标注 命名实体识别 依存句法分析 语义依存分析 新词发现 关键词短语提取 自动摘要 文本分类聚类 拼音简繁转换 自然语言处理
CLUE
中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard
EasyBert
基于Pytorch的Bert应用,包括命名实体识别、情感分析、文本分类以及文本相似度等
asap
ASAP: A Chinese Review Dataset Towards Aspect Category Sentiment Analysis and Rating Prediction
RoBERTaABSA
Implementation of paper Does syntax matter? A strong baseline for Aspect-based Sentiment Analysis with RoBERTa.
text2vec
text2vec, text to vetor. 文本向量化表示,包括:词向量化表示,句子向量化表示,长文本向量化表示,文本相似度计算。
fastText
Library for fast text representation and classification.
vision
Datasets, Transforms and Models specific to Computer Vision
text
Data loaders and abstractions for text and NLP
faceswap
Deepfakes Software For All
NeuralNLP-NeuralClassifier
An Open-source Neural Hierarchical Multi-label Text Classification Toolkit
examples
A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.
PreSumm
code for EMNLP 2019 paper Text Summarization with Pretrained Encoders
Chinese-Word-Vectors
100+ Chinese Word Vectors 上百种预训练中文词向量
face_recognition
The world's simplest facial recognition api for Python and the command line
mongolian-nlp
Useful resources for Mongolian NLP
pu-learning-1
Pytorch implementation of risk estimators for unbiased and non-negative positive-unlabeled learning
BaiduSpider
BaiduSpider,一个爬取百度搜索结果的爬虫,目前支持百度网页搜索,百度图片搜索,百度知道搜索,百度视频搜索,百度资讯搜索,百度文库搜索,百度经验搜索和百度百科搜索。
arbitrary_pu
NeurIPS'20 Paper: "Learning from Positive and Unlabeled Data with Arbitrary Positive Shift"
Bert-Chinese-Text-Classification-Pytorch
使用Bert,ERNIE,进行中文文本分类
Synonyms
:herb: 中文近义词:聊天机器人,智能问答工具包