Jessica Martin (jlmartin100)

jlmartin100

Geek Repo

Location:Chicago IL

Github PK Tool:Github PK Tool

Jessica Martin's starred repositories

corex_topic

Hierarchical unsupervised and semi-supervised topic models for sparse count data with CorEx

Language:PythonLicense:Apache-2.0Stargazers:626Issues:0Issues:0

topic_modelling_demo

A workflow for CorEx-based topic modeling

Language:Jupyter NotebookStargazers:4Issues:0Issues:0

tqdm

:zap: A Fast, Extensible Progress Bar for Python and CLI

Language:PythonLicense:NOASSERTIONStargazers:28069Issues:0Issues:0

nlp-resources

Natural language processing resources for multiple languages, with an eye towards use for digital humanities.

License:GPL-3.0Stargazers:123Issues:0Issues:0

EPIC

EPIC: a large collection of over 30 million epidemic-related tweets

Stargazers:12Issues:0Issues:0

twitter-protest-analysis

Analysis of 10 million tweets

Language:PythonStargazers:1Issues:0Issues:0

metis-project4

Investigating the impact of Twitter Bots on the 2020 U.S. Presidential Election's Twitter Discourse

Language:Jupyter NotebookStargazers:2Issues:0Issues:0

Twitter_NLP

Metis NLP project on Twitter Customer Service data

Language:Jupyter NotebookStargazers:1Issues:0Issues:0

tweet-clustering

Clustering analysis of one million tweets using scikit-learn, including basic benchmarking of various clustering algorithms

Language:Jupyter NotebookStargazers:36Issues:0Issues:0

COVID-19-Arabic-Tweets-Dataset

The repository contains a collection of Arabic tweets IDs associated with the novel coronavirus COVID-19. The dataset contains Tweets' ids from 2020-01-01 to 2020-04-30. The Twitter search API was used to gather real-time tweets that contained specific keywords in the Arabic language. The dataset contains almost four millions and half Arabic tweets.

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:27Issues:0Issues:0

WIDH_2020_Arabic_Text_Analysis

Material for the Text Analysis of Arabic course taught at the NYU Abu Dhabi Winter Institute in Digital Humanities 2020.

Language:Jupyter NotebookStargazers:12Issues:0Issues:0

arabic-stop-words

Largest list of Arabic stop words on Github. أكبر قائمة لمستبعدات الفهرسة العربية على جيت هاب

License:MITStargazers:302Issues:0Issues:0

dldiy-practicals

Slides, Jupyter Notebooks and scripts for the Deep Learning: Do-It-Yourself! lectures at ENS

Language:Jupyter NotebookStargazers:21Issues:0Issues:0

Topic-Modeling-of-Tweets-Related-to-NFL-and-National-Anthem

My fourth project that I completed at Metis uses topic modeling to detect structure in tweets related to the nfl and national anthem.

Language:Jupyter NotebookStargazers:1Issues:0Issues:0

arabic_word_embeddings_CNN

Word Embeddings and Convolutional Neural Network for Arabic Sentiment Classification (Coling 2016)

Language:PythonStargazers:3Issues:0Issues:0

twint

An advanced Twitter scraping & OSINT tool written in Python that doesn't use Twitter's API, allowing you to scrape a user's followers, following, Tweets and more while evading most API limitations.

Language:PythonLicense:MITStargazers:15671Issues:0Issues:0
Language:Jupyter NotebookStargazers:3Issues:0Issues:0

hULMonA

hULMonA (حلمنا)​: tHe first Universal Language MOdel iN Arabic

Language:Jupyter NotebookStargazers:8Issues:0Issues:0

Arabic-Image-Captioning

Generate Arabic captions for images using Deep Learning

Language:Jupyter NotebookStargazers:26Issues:0Issues:0

Arabic-Image-Captioning

Generate Arabic captions for images using Deep Learning

Language:Jupyter NotebookStargazers:16Issues:0Issues:0

Arabic-Empathetic-Chatbot

Seq2Seq-based open domain empathetic conversational model for Arabic: Dataset & Model

Language:Jupyter NotebookStargazers:55Issues:0Issues:0

hULMonA

hULMonA (حلمنا)​: tHe first Universal Language MOdel iN Arabic

Language:Jupyter NotebookStargazers:46Issues:0Issues:0

arabert

Pre-trained Transformers for Arabic Language Understanding and Generation (Arabic BERT, Arabic GPT2, Arabic ELECTRA)

Language:PythonStargazers:613Issues:0Issues:0

Arabic-named-entity-recognition

Arabic named entity recognition using AnerCorp corpus (location , organisation, person, Miscellaneous Word)

Language:Jupyter NotebookStargazers:37Issues:0Issues:0

document_cluster

A guide to document clustering in Python

Language:Jupyter NotebookStargazers:505Issues:0Issues:0

Text-Scraping-Document-Clustering-Topic-modeling

The objective of this project is to scrape a corpus of news articles from a set of web pages, pre-process the corpus, and then to apply unsupervised clustering algorithms to explore and summarise the contents of the corpus. Part 1. Text Data Scraping This part of the project should be implemented as a Python script 1. Identify the URLs for all news articles listed on the website: http://mlg.ucd.ie/modules/COMP41680/news/index.html 2. Retrieve all web pages corresponding to these article URLs. 3. From the web pages, extract the main body text containing the content of each news article. Save the body of each article as plain text. Part 2. Corpus Exploration Tasks to be completed in your IPython notebook: 1. Load the text corpus generated in Part 1. Apply any appropriate pre-processing steps and construct a document-term matrix representation of the corpus. 2. Summarise the overall corpus by identifying the most characteristic terms and phrases in the corpus. 3. Apply two alternative clustering algorithms of your choice to the document-term matrix to produce clusters of related documents. This might require applying each algorithm several times with different parameter values. 4. For each clustering generated in Step 3, summarise the contents of the clusters. Based on your summary, suggest a topic/theme for each cluster.

Language:Jupyter NotebookStargazers:49Issues:0Issues:0

04_biden_election_tweets_NLP

METIS PROJECT 4: NATURAL LANGUAGE PROCESSING & UNSUPERVISED LEARNING // Skills: NLTK, Sci-kit Learn NLP libraries (TF-IDF vectorizer, K-means clustering, PCA, t-SNE), Wordcloud library

Language:Jupyter NotebookStargazers:1Issues:0Issues:0

nlp-in-python-tutorial

comparing stand up comedians using natural language processing

Language:Jupyter NotebookStargazers:1699Issues:0Issues:0

gt-nlp-class

Course materials for Georgia Tech CS 4650 and 7650, "Natural Language"

Language:TeXStargazers:4914Issues:0Issues:0

open-data-registry

A registry of publicly available datasets on AWS

Language:PythonLicense:Apache-2.0Stargazers:1365Issues:0Issues:0