There are 80 repositories under text-classification topic.
中文分词 词性标注 命名实体识别 依存句法分析 成分句法分析 语义依存分析 语义角色标注 指代消解 风格转换 语义相似度 新词发现 关键词短语提取 自动摘要 文本分类聚类 拼音简繁转换 自然语言处理
大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP
all kinds of text classification models and more with deep learning
Natural Language Processing Best Practices & Examples
CNN-RNN中文文本分类,基于TensorFlow
Transformers for Classification, NER, QA, Language Modelling, Language Generation, T5, Multi-Modal, and Conversational AI
State of the Art Natural Language Processing
Accelerated deep learning R&D
搜索所有中文NLP数据集,附常用英文NLP数据集
fastNLP: A Modularized and Extensible NLP Framework. Currently still in incubation.
Kashgari is a production-level NLP Transfer learning framework built on top of tf.keras for text-labeling and text-classification, includes Word2Vec, BERT, and GPT2 Language Embedding.
✨ Argilla: Open-source platform empowering teams to make better LLM and NLP-based products through human feedback and curation
Text Classification Algorithms: A Survey
The Python Code Tutorials
Learn how to process, classify, cluster, summarize, understand syntax, semantics and sentiment of text data with the power of Python! This repository contains code and datasets used in my book, "Text Analytics with Python" published by Apress/Springer.
中文长文本分类、短句子分类、多标签分类、两句子相似度(Chinese Text Classification of Keras NLP, multi-label classify, or sentence classify, long or short),字词句向量嵌入层(embeddings)和网络层(graph)构建基类,FastText,TextCNN,CharCNN,TextRNN, RCNN, DCNN, DPCNN, VDCNN, CRNN, Bert, Xlnet, Albert, Attention, DeepMoji, HAN, 胶囊网络-CapsuleNet, Transformer-encode, Seq2seq, SWEM, LEAM, TextGCN
自然语言处理(nlp),小姜机器人(闲聊检索式chatbot),BERT句向量-相似度(Sentence Similarity),XLNET句向量-相似度(text xlnet embedding),文本分类(Text classification), 实体提取(ner,bert+bilstm+crf),数据增强(text augment, data enhance),同义句同义词生成,句子主干提取(mainpart),中文汉语短文本相似度,文本特征工程,keras-http-service调用
NLP DNN Toolkit - Building Your NLP DNN Models Like Playing Lego
Data augmentation for NLP, presented at EMNLP 2019
A list of NLP(Natural Language Processing) tutorials
Graph Convolutional Networks for Text Classification. AAAI 2019
An implement of the paper of EDA for Chinese corpus.中文语料的EDA数据增强工具。NLP数据增强。论文阅读笔记。
基于金融-司法领域(兼有闲聊性质)的聊天机器人,其中的主要模块有信息抽取、NLU、NLG、知识图谱等,并且利用Django整合了前端展示,目前已经封装了nlp和kg的restful接口
The data scientist's open-source choice to scale, assess and maintain natural language data. Treat training data like a software artifact.
pytorch实现 Bert 做seq2seq任务,使用unilm方案,现在也可以做自动摘要,文本分类,情感分析,NER,词性标注等任务,支持t5模型,支持GPT2进行文章续写。
Text classifier for Hierarchical Attention Networks for Document Classification
Starter code to solve real world text data problems. Includes: Gensim Word2Vec, phrase embeddings, Text Classification with Logistic Regression, word count with pyspark, simple text preprocessing, pre-trained embeddings and more.
Pre-training of Deep Bidirectional Transformers for Language Understanding: pre-train TextCNN
A tool for learning vector representations of words and entities from Wikipedia
Natural language detection library for Rust. Try demo online: https://whatlang.org/
Obsei is a low code AI powered automation tool. It can be used in various business flows like social listening, AI based alerting, brand image analysis, comparative study and more .
🎯🗯 Dataset generation for AI chatbots, NLP tasks, named entity recognition or text classification models using a simple DSL!