There are 94 repositories under text-classification topic.
大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP
all kinds of text classification models and more with deep learning
Natural Language Processing Best Practices & Examples
CNN-RNN中文文本分类,基于TensorFlow
Transformers for Information Retrieval, Text Classification, NER, QA, Language Modelling, Language Generation, T5, Multi-Modal, and Conversational AI
搜索所有中文NLP数据集,附常用英文NLP数据集
State of the Art Natural Language Processing
Accelerated deep learning R&D
The Python Code Tutorials
⭐️ NLP Algorithms with transformers lib. Supporting Text-Classification, Text-Generation, Information-Extraction, Text-Matching, RLHF, SFT etc.
Text Classification Algorithms: A Survey
中文长文本分类、短句子分类、多标签分类、两句子相似度(Chinese Text Classification of Keras NLP, multi-label classify, or sentence classify, long or short),字词句向量嵌入层(embeddings)和网络层(graph)构建基类,FastText,TextCNN,CharCNN,TextRNN, RCNN, DCNN, DPCNN, VDCNN, CRNN, Bert, Xlnet, Albert, Attention, DeepMoji, HAN, 胶囊网络-CapsuleNet, Transformer-encode, Seq2seq, SWEM, LEAM, TextGCN
[ACL 2023] One Embedder, Any Task: Instruction-Finetuned Text Embeddings
Learn how to process, classify, cluster, summarize, understand syntax, semantics and sentiment of text data with the power of Python! This repository contains code and datasets used in my book, "Text Analytics with Python" published by Apress/Springer.
Data augmentation for NLP, presented at EMNLP 2019
自然语言处理(nlp),小姜机器人(闲聊检索式chatbot),BERT句向量-相似度(Sentence Similarity),XLNET句向量-相似度(text xlnet embedding),文本分类(Text classification), 实体提取(ner,bert+bilstm+crf),数据增强(text augment, data enhance),同义句同义词生成,句子主干提取(mainpart),中文汉语短文本相似度,文本特征工程,keras-http-service调用
NLP DNN Toolkit - Building Your NLP DNN Models Like Playing Lego
The data scientist's open-source choice to scale, assess and maintain natural language data. Treat training data like a software artifact.
A list of NLP(Natural Language Processing) tutorials
MTEB: Massive Text Embedding Benchmark
Graph Convolutional Networks for Text Classification. AAAI 2019
An implement of the paper of EDA for Chinese corpus.中文语料的EDA数据增强工具。NLP数据增强。论文阅读笔记。
基于金融-司法领域(兼有闲聊性质)的聊天机器人,其中的主要模块有信息抽取、NLU、NLG、知识图谱等,并且利用Django整合了前端展示,目前已经封装了nlp和kg的restful接口
pytorch实现 Bert 做seq2seq任务,使用unilm方案,现在也可以做自动摘要,文本分类,情感分析,NER,词性标注等任务,支持t5模型,支持GPT2进行文章续写。
Human ChatGPT Comparison Corpus (HC3), Detectors, and more! 🔥
An elegent pytorch implement of transformers
Starter code to solve real world text data problems. Includes: Gensim Word2Vec, phrase embeddings, Text Classification with Logistic Regression, word count with pyspark, simple text preprocessing, pre-trained embeddings and more.
Obsei is a low code AI powered automation tool. It can be used in various business flows like social listening, AI based alerting, brand image analysis, comparative study and more .