Layla Bouzoubaa's starred repositories
simpletransformers
Transformers for Information Retrieval, Text Classification, NER, QA, Language Modelling, Language Generation, T5, Multi-Modal, and Conversational AI
Dreambooth-Stable-Diffusion
Implementation of Dreambooth (https://arxiv.org/abs/2208.12242) by way of Textual Inversion (https://arxiv.org/abs/2208.01618) for Stable Diffusion (https://arxiv.org/abs/2112.10752). Tweaks focused on training faces, objects, and styles.
pytextrank
Python implementation of TextRank algorithms ("textgraphs") for phrase extraction
AutoPhrase
AutoPhrase: Automated Phrase Mining from Massive Text Corpora
nlp-in-practice
Starter code to solve real world text data problems. Includes: Gensim Word2Vec, phrase embeddings, Text Classification with Logistic Regression, word count with pyspark, simple text preprocessing, pre-trained embeddings and more.
acl-style-files
Official style files for papers submitted to venues of the Association for Computational Linguistics
NLP4SocialGood_Papers
A reading list of up-to-date papers on NLP for Social Good.
tokenizers
Fast, Consistent Tokenization of Natural Language Text
hate-speech-dataset
Hate speech dataset from Stormfront forum manually labelled at sentence level.
DynamicWord2Vec
Dynamic Word Embeddings for Evolving Semantic Discovery code.
reddit-analysis
Perform network analysis on reddit
needs_detection
Detecting needs during a crisis