There are 43 repositories under text-mining topic.
:book: A curated list of resources dedicated to Natural Language Processing (NLP)
extract text from any document. no muss. no fuss.
Beautiful visualizations of how language differs among document types.
a curated list of R tutorials for Data Science, NLP and Machine Learning
A curated list of resources dedicated to text summarization
Manuscript of the book "Tidy Text Mining with R" by Julia Silge and David Robinson
Text mining using tidy tools :sparkles::page_facing_up::sparkles:
AutoPhrase: Automated Phrase Mining from Massive Text Corpora
Starter code to solve real world text data problems. Includes: Gensim Word2Vec, phrase embeddings, Text Classification with Logistic Regression, word count with pyspark, simple text preprocessing, pre-trained embeddings and more.
Python implementation of the Rapid Automatic Keyword Extraction algorithm using NLTK.
Fast vectorization, topic modeling, distances and GloVe word embeddings in R.
A collection of notebooks for Natural Language Processing from NLP Town
从新浪财经、每经网、金融界、中国证券网、证券时报网上,爬取上市公司(个股)的历史新闻文本数据进行文本分析、提取特征集,然后利用SVM、随机森林等分类器进行训练,最后对实施抓取的新闻数据进行分类预测
A Node.Js / Neo4J tool that translates words and relations into network graphs and shows you how it all connects.
Open Source research tool to search, browse, analyze and explore large document collections by Semantic Search Engine and Open Source Text Mining & Text Analytics platform (Integrates ETL for document processing, OCR for images & PDF, named entity recognition for persons, organizations & locations, metadata management by thesaurus & ontologies, search user interface & search apps for fulltext search, faceted search & knowledge graph)
Python & command-line tool to gather text on the Web: web crawling/scraping, extraction of text, metadata, comments
Repository with all what is necessary for sentiment analysis and related areas
Resources for learning about Text Mining and Natural Language Processing
Various Algorithms for Short Text Mining
Language, Knowledge, Cognition
🗣️ Tool to generate adversarial text examples and test machine learning models against them
Machine Learning Lectures at the European Space Agency (ESA) in 2018
Python文本挖掘系统 Research of Text Mining System
This repository contains the code related to Natural Language Processing using python scripting language. All the codes are related to my book entitled "Python Natural Language Processing"
AraVec is a pre-trained distributed word representation (word embedding) open source project which aims to provide the Arabic NLP research community with free to use and powerful word embedding models.
Fake News Detection in Python
A Python package implementing a new interpretable machine learning model for text classification (with visualization tools for Explainable AI :octocat:)