DanK (iamdank)

iamdank

Geek Repo

Location:Australia

Github PK Tool:Github PK Tool

DanK's starred repositories

Enron2mbox

Converting the Enron email collection to mbox format

Language:PythonStargazers:10Issues:0Issues:0

Topic-modelling-using-LDA

The Enron database is analysed using Latent Dirichlet allocation.

Language:Jupyter NotebookStargazers:11Issues:0Issues:0

EnronTopicModelling

Topic Modelling the Enron Emails

Stargazers:1Issues:0Issues:0

enron-nlp-mining

Text analysis on Enron emails data

Language:Jupyter NotebookStargazers:5Issues:0Issues:0

TopicModelComparison

Scripts and codes for replicating experiments published in Exploring Topic Coherence over many models and many topics

Language:ScalaStargazers:82Issues:0Issues:0
Language:Jupyter NotebookLicense:MITStargazers:48Issues:0Issues:0

wayward

Wayward is a Python package that helps to identify characteristic terms from single documents or groups of documents. It can be used for keyword extraction and several related tasks, and can create efficient sparse representations for classifiers. It was originally created to provide term weights for word clouds.

Language:PythonLicense:NOASSERTIONStargazers:9Issues:0Issues:0

Doc2Map

Transform a corpus of text documents (any kind) into a map with different zoom levels and topics names to summarise sub corpus of similar docs.

Language:HTMLLicense:MITStargazers:25Issues:0Issues:0

financial-news-data

Construct a structured DataFrame from the Reuters news corpus

Language:PythonLicense:MITStargazers:19Issues:0Issues:0

mailinator-box

📬 Stream public mailinator emails .

Language:GoLicense:MITStargazers:1Issues:0Issues:0

disposable-emails.github.io

The complete list of disposable email domains

Language:HTMLLicense:MITStargazers:1Issues:0Issues:0

topic-modeling-textPrep

text preprocessing library for topic models

Language:PythonLicense:MITStargazers:5Issues:0Issues:0

Top2Vec

Top2Vec learns jointly embedded topic, document and word vectors.

Language:PythonLicense:BSD-3-ClauseStargazers:2909Issues:0Issues:0

Collab-Rdp

Use google Collab as a temporary server, with an rdp.

Language:Jupyter NotebookStargazers:27Issues:0Issues:0

OCTIS

OCTIS: Comparing Topic Models is Simple! A python package to optimize and evaluate topic models (accepted at EACL2021 demo track)

Language:PythonLicense:MITStargazers:709Issues:0Issues:0

harry_potter_nlp

Harry Potter and the Allocation of Dirichlet

Language:Jupyter NotebookStargazers:123Issues:0Issues:0
Language:Jupyter NotebookStargazers:116Issues:0Issues:0

polyleven

Fast Levenshtein Distance Library for Python 3

Language:CLicense:MITStargazers:79Issues:0Issues:0

topic-modelling

Handy Jupyter Notebooks that I use in for Topic Modeling. Including text mining from PDF files, text preprocessing, Latent Dirichlet Allocation (LDA), hyperparameters grid search and Topic Modeling visualiation.

Language:Jupyter NotebookLicense:MITStargazers:39Issues:0Issues:0

Colab-Hacks

Simple Hacks for Google Colaboratory to boost your productivity and help you to perform daily tasks.

Language:Jupyter NotebookLicense:MITStargazers:879Issues:0Issues:0

Smart-Literature-Review

This is the repository for the files and documents used in the Smart Literature Review paper from (Boye, Møller, 2019)

Language:HTMLStargazers:18Issues:0Issues:0

T2-Ubuntu

Ubuntu for T2 Macs

Language:ShellStargazers:567Issues:0Issues:0

mirrors-china

Mirrors and registries in Mainland China

Stargazers:20Issues:0Issues:0

Top2Vec-Demo

Demo on Top2Vec to generate topics using BERT model

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:4Issues:0Issues:0

Chapter-18-Topic-Modeling

Chapter 18: Topic Modeling

Language:Jupyter NotebookStargazers:2Issues:0Issues:0

ToModAPI

Train, evaluate, and use different unsupervised topic modelling algorithms using a RESTful API.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:36Issues:0Issues:0

topic-model-tutorial

Tutorial on topic models in Python with scikit-learn

Language:Jupyter NotebookStargazers:156Issues:0Issues:0

brigadier

Fetch and install Boot Camp ESDs with ease.

Language:PythonLicense:MITStargazers:2018Issues:0Issues:0

Sifter

Indexed search and clustering tool for digital forensics

Language:JavaScriptLicense:Apache-2.0Stargazers:25Issues:0Issues:0

open-semantic-search

Open Source research tool to search, browse, analyze and explore large document collections by Semantic Search Engine and Open Source Text Mining & Text Analytics platform (Integrates ETL for document processing, OCR for images & PDF, named entity recognition for persons, organizations & locations, metadata management by thesaurus & ontologies, search user interface & search apps for fulltext search, faceted search & knowledge graph)

Language:ShellLicense:GPL-3.0Stargazers:947Issues:0Issues:0