text-classification

There are 94 repositories under text-classification topic.

HanLP
hankcs / HanLP
中文分词词性标注命名实体识别依存句法分析成分句法分析语义依存分析语义角色标注指代消解风格转换语义相似度新词发现关键词短语提取自动摘要文本分类聚类拼音简繁转换自然语言处理
dependency-parser hanlp named-entity-recognition natural-language-processing nlp pos-tagging semantic-parsing text-classification
Language:Python 32517
explosion / spaCy
💫 Industrial-strength Natural Language Processing (NLP) in Python
natural-language-processing data-science machine-learning python cython nlp artificial-intelligence ai spacy nlp-library neural-network neural-networks deep-learning named-entity-recognition entity-linking text-classification tokenization
Language:Python 28857
brightmart / nlp_chinese_corpus
大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP
chinese-dataset chinese-corpus pretrain word2vec nlp bert language-model wiki news question-answering chinese corpus chinese-nlp dataset text-classification
9200
brightmart / text_classification
all kinds of text classification models and more with deep learning
classification nlp fasttext textcnn textrnn tensorflow multi-label multi-class attention-mechanism text-classification convolutional-neural-networks sentence-classification memory-networks
Language:Python 7750
microsoft / nlp-recipes
Natural Language Processing Best Practices & Examples
azure-ml best-practices deep-learning machine-learning mlflow natural-language natural-language-inference natural-language-processing natural-language-understanding nli nlp nlu pretrained-models sota text text-classification transfomer
Language:Python 6334
gaussic / text-classification-cnn-rnn
CNN-RNN中文文本分类，基于TensorFlow
tensorflow cnn text-classification chinese classification rnn tensorboard
Language:Python 4080
simpletransformers
ThilinaRajapakse / simpletransformers
Transformers for Information Retrieval, Text Classification, NER, QA, Language Modelling, Language Generation, T5, Multi-Modal, and Conversational AI
conversational-ai information-retrival named-entity-recognition question-answering text-classification transformers
Language:Python 4000
CLUEbenchmark / CLUEDatasetSearch
搜索所有中文NLP数据集，附常用英文NLP数据集
nlp datasets chinese ner qa match text-classification machine-translation knowledge-graph corpus machine-reading-comprehension sentiment-analysis text-similarity text-summarization
Language:Python 3914
snipsco / snips-nlu
Snips Python library to extract meaning from text
nlp nlu python machine-learning text-classification intent-classification ner named-entity-recognition slot-filling intent-parser information-extraction snips machine-learning-library chatbot bot ml
Language:Python 3867
spark-nlp
JohnSnowLabs / spark-nlp
State of the Art Natural Language Processing
albert bert entity-extraction language-detection language-model lemmatizer llm machine-translation named-entity-recognition natural-language-processing nlp part-of-speech-tagger pyspark question-answering sentiment-analysis spark spell-checker tensorflow text-classification transformers
Language:Scala 3711
catalyst-team / catalyst
Accelerated deep learning R&D
deep-learning reinforcement-learning machine-learning computer-vision pytorch python distributed-computing infrastructure research reproducibility image-processing image-classification image-segmentation object-detection natural-language-processing text-classification text-segmentation information-retrieval recommender-system metric-learning
Language:Python 3233
fastnlp / fastNLP
fastNLP: A Modularized and Extensible NLP Framework. Currently still in incubation.
chinese-nlp deep-learning natural-language-processing nlp-library nlp-parsing text-classification text-processing
Language:Python 3034
BrikerMan / Kashgari
Kashgari is a production-level NLP Transfer learning framework built on top of tf.keras for text-labeling and text-classification, includes Word2Vec, BERT, and GPT2 Language Embedding.
bert bert-model gpt-2 machine-learning named-entity-recognition ner nlp nlp-framework seq2seq sequence-labeling text-classification text-labeling transfer-learning
Language:Python 2378
x4nth055 / pythoncode-tutorials
The Python Code Tutorials
python python3 scapy ethical-hacking network-programming network-security network-analysis python-tutorials tutorials scapy-tutorials machine-learning text-classification socket-programming face-detection computer-vision programming-tutorial natural-language-processing web-scraping
Language:Jupyter Notebook 2015
HarderThenHarder / transformers_tasks
⭐️ NLP Algorithms with transformers lib. Supporting Text-Classification, Text-Generation, Information-Extraction, Text-Matching, RLHF, SFT etc.
information-extraction nlp reinforcement-learning text-classification text-generation text-matching transformers
Language:Jupyter Notebook 1991
EasyNLP
alibaba / EasyNLP
EasyNLP: A Comprehensive and Easy-to-use NLP Toolkit
bert deep-learning fewshot-learning knowledge-distillation knowledge-pretraining machine-learning nlp pretrained-models pytorch text-classification text-image-retrieval text-to-image-synthesis transfer-learning transformers
Language:Python 1957
kk7nc / Text_Classification
Text Classification Algorithms: A Survey
text-classification nlp-machine-learning document-classification text-processing dimensionality-reduction rocchio-algorithm boosting-algorithms logistic-regression naive-bayes-classifier k-nearest-neighbours support-vector-machines decision-trees random-forest conditional-random-fields deep-learning deep-neural-network recurrent-neural-networks convolutional-neural-networks deep-belief-network hierarchical-attention-networks
Language:Python 1779
xlang-ai / instructor-embedding
[ACL 2023] One Embedder, Any Task: Instruction-Finetuned Text Embeddings
embeddings information-retrieval language-model prompt-retrieval text-classification text-clustering text-embedding text-evaluation text-reranking text-semantic-similarity
Language:Python 1723
yongzhuo / Keras-TextClassification
中文长文本分类、短句子分类、多标签分类、两句子相似度（Chinese Text Classification of Keras NLP, multi-label classify, or sentence classify, long or short），字词句向量嵌入层（embeddings）和网络层（graph）构建基类，FastText，TextCNN，CharCNN，TextRNN, RCNN, DCNN, DPCNN, VDCNN, CRNN, Bert, Xlnet, Albert, Attention, DeepMoji, HAN, 胶囊网络-CapsuleNet, Transformer-encode, Seq2seq, SWEM, LEAM, TextGCN
albert bert capsule charcnn crnn dcnn dpcnn embeddings fasttext han keras keras-textclassification leam nlp rcnn text-classification textcnn transformer vdcnn xlnet
Language:Python 1710
text-analytics-with-python
dipanjanS / text-analytics-with-python
Learn how to process, classify, cluster, summarize, understand syntax, semantics and sentiment of text data with the power of Python! This repository contains code and datasets used in my book, "Text Analytics with Python" published by Apress/Springer.
text-analytics text-summarization text-classification python natural-language natural-language-processing clustering sentiment semantic sentiment-analysis nltk stanford-nlp spacy pattern scikit-learn gensim
Language:Jupyter Notebook 1624
Delta-ML / delta
DELTA is a deep learning based natural language and speech processing platform.
nlp deep-learning tensorflow speech sequence-to-sequence seq2seq speech-recognition text-classification speaker-verification nlu text-generation emotion-recognition tensorflow-serving tensorflow-lite inference asr serving front-end custom-ops ops
Language:Python 1585
jasonwei20 / eda_nlp
Data augmentation for NLP, presented at EMNLP 2019
nlp data-augmentation text-classification synonyms embeddings sentence classification rnn cnn swap position
Language:Python 1552
yongzhuo / nlp_xiaojiang
自然语言处理（nlp），小姜机器人（闲聊检索式chatbot），BERT句向量-相似度（Sentence Similarity），XLNET句向量-相似度（text xlnet embedding），文本分类（Text classification），实体提取（ner，bert+bilstm+crf），数据增强（text augment, data enhance），同义句同义词生成，句子主干提取（mainpart），中文汉语短文本相似度，文本特征工程，keras-http-service调用
bert chatbot chinese data-augmentation distance enhance feature nlp text-augment text-classification xlnet
Language:Python 1512
bfelbo / DeepMoji
State-of-the-art deep learning model for analyzing sentiment, emotion, sarcasm etc.
ai deep-learning keras machine-learning natural-language-processing neural-networks nlp python sentiment-analysis tensorflow text-classification
Language:Python 1495
embeddings-benchmark / mteb
MTEB: Massive Text Embedding Benchmark
benchmark bitext-mining clustering information-retrieval multilingual-nlp neural-search reranking retrieval sbert semantic-search sentence-transformers sgpt sts text-classification text-embedding
Language:Python 1448
microsoft / NeuronBlocks
NLP DNN Toolkit - Building Your NLP DNN Models Like Playing Lego
question-answering deep-learning pytorch natural-language-processing text-classification artificial-intelligence dnn qna text-matching knowledge-distillation model-compression sequence-labeling
Language:Python 1440
refinery
code-kern-ai / refinery
The data scientist's open-source choice to scale, assess and maintain natural language data. Treat training data like a software artifact.
annotations data-centric-ai data-labeling deep-learning labeling labeling-tool machine-learning natural-language-processing neural-search nlp text-annotation transformers python human-in-the-loop spacy artificial-intelligence data-science text-classification active-learning supervised-learning
Language:Python 1365
lyeoni / nlp-tutorial
A list of NLP(Natural Language Processing) tutorials
nlp natural-language-processing nlp-tutorial neural-machine-translation text-classification sentiment-classification
Language:Jupyter Notebook 1363
yao8839836 / text_gcn
Graph Convolutional Networks for Text Classification. AAAI 2019
deep-learning graph-convolutional-networks nlp text-classification
Language:Python 1340
zhanlaoban / EDA_NLP_for_Chinese
An implement of the paper of EDA for Chinese corpus.中文语料的EDA数据增强工具。NLP数据增强。论文阅读笔记。
chinese chinese-data-augmentation data-augmentation easy-data-augmentation eda text-classification
Language:Python 1317
charlesXu86 / Chatbot_CN
基于金融-司法领域(兼有闲聊性质)的聊天机器人，其中的主要模块有信息抽取、NLU、NLG、知识图谱等，并且利用Django整合了前端展示,目前已经封装了nlp和kg的restful接口
deep-learning chatbot-cn tenserflow-serving intent-detection django-restful reinforcement-learning knowledge-graph slot-filling dialogue-systems ir oriented-dialogs sentiment-analysis tensorflow ner nlu nlg attention-mechanism text-correct text-classification recommendation
1270
920232796 / bert_seq2seq
pytorch实现 Bert 做seq2seq任务，使用unilm方案,现在也可以做自动摘要，文本分类，情感分析，NER，词性标注等任务,支持t5模型，支持GPT2进行文章续写。
autotitle bert crf gpt2 ner pytorch roberta seq2seq t5-model text-classification unilm
Language:Python 1264
Hello-SimpleAI / chatgpt-comparison-detection
Human ChatGPT Comparison Corpus (HC3), Detectors, and more! 🔥
ai chatbot chatgpt dataset nlp openai text-classification python gpt2 gpt3 gpt-3 ml machine-learning deep-learning
Language:Python 1192
Tongjilibo / bert4torch
An elegent pytorch implement of transformers
bert nlp pytorch bert4keras named-entity-recognition relation-extraction seq2seq text-classification transformers bert4torch belle chatglm llama llm large-language-models
Language:Python 1143
obsei
obsei / obsei
Obsei is a low code AI powered automation tool. It can be used in various business flows like social listening, AI based alerting, brand image analysis, comparative study and more .
artificial-intelligence natural-language-processing sentiment-analysis workflow social-network-analysis customer-engagement text-analysis text-analytics python nlp issue-tracking-system customer-support lowcode text-classification anonymization low-code business-process-automation workflow-automation process-automation social-listening
Language:Python 1141
nlp-in-practice
kavgan / nlp-in-practice
Starter code to solve real world text data problems. Includes: Gensim Word2Vec, phrase embeddings, Text Classification with Logistic Regression, word count with pyspark, simple text preprocessing, pre-trained embeddings and more.
gensim machine-learning natural-language-processing nlp text-classification text-mining tf-idf word2vec
Language:Jupyter Notebook 1120