Dr Mo El-Haj's repositories
NLP_ML_Visualization_Tutorial
This is a step by step tutorial for text analyst who want an easy start to basic and and common techniques in NLP, Text Analysis, Machine Learning, Topic Modelling and corpus Linguistics. The tutorial is pat of the "Data Visualisation Workshop for Critical Computational Discourse" at the Data Science Institute at Lancaster University, UK. Presented by Dr Mahmoud El-Haj https://www.lancaster.ac.uk/staff/elhaj
OsmanReadability
Open Source tool for Arabic text readability
ArabicDialects
test files not a real project
MachineLearning
Text Classification using Machine Learning session at Lancaster Summer Schools in Corpus Linguistics
CFIE-FRSE-2019-Runnable
This is the testing version for CFIE-FRSE for the Stable version please use https://github.com/drelhaj/CFIE-FRSE
Java_WordCloud_LogLikelihood
Java tool to create word cloud by calculating frequencies and log Likelihood for a word between two large corpora
Tweepy_Academic
Search and download tweets from Twitter using Tweepy and Twitter API v.2. The code is for those with an Academic Research Twitter Account, which has a limit of 10 million tweets a month and allows fully archive search back to March 2006.
acl-anthology
Data and software for building the ACL Anthology.
BLC-WSD-Frontend
Word sense disambiguation task front-end for translation of USAS lexicons
BioTextMining
Repository for the Bio text mining project. PI: Jo Knight Lancaster University
TypographicalSentimentValues
Human ratings of sentiment for typographical variants for IEEE TAC paper
visualize_my_corpus
This is a step by step tutorial for text analyst who want an easy start to basic and and common techniques in NLP, Text Analysis, Topic Modelling and corpus Linguistics.