Layla Bouzoubaa (labouz)

labouz

Geek Repo

Company:Drexel University

Location:Philadelphia, PA

Home Page:laylab.me

Twitter:@bouzoulay

Github PK Tool:Github PK Tool


Organizations
social-nlp-lab

Layla Bouzoubaa's starred repositories

langchain

🦜🔗 Build context-aware reasoning applications

Language:Jupyter NotebookLicense:MITStargazers:90121Issues:674Issues:7300

llama.cpp

LLM inference in C/C++

networkx

Network Analysis in Python

Language:PythonLicense:NOASSERTIONStargazers:14549Issues:281Issues:3189

BERTopic

Leveraging BERT and c-TF-IDF to create easily interpretable topics.

Language:PythonLicense:MITStargazers:5869Issues:53Issues:1650

marqo

Unified embedding generation and search engine. Also available on cloud - cloud.marqo.ai

Language:PythonLicense:Apache-2.0Stargazers:4376Issues:36Issues:237

simpletransformers

Transformers for Information Retrieval, Text Classification, NER, QA, Language Modelling, Language Generation, T5, Multi-Modal, and Conversational AI

Language:PythonLicense:Apache-2.0Stargazers:4046Issues:64Issues:1119

Dreambooth-Stable-Diffusion

Implementation of Dreambooth (https://arxiv.org/abs/2208.12242) by way of Textual Inversion (https://arxiv.org/abs/2208.01618) for Stable Diffusion (https://arxiv.org/abs/2112.10752). Tweaks focused on training faces, objects, and styles.

Language:Jupyter NotebookLicense:MITStargazers:3184Issues:39Issues:107

alpa

Training and serving large-scale neural networks with auto parallelization.

Language:PythonLicense:Apache-2.0Stargazers:3024Issues:45Issues:296

pytextrank

Python implementation of TextRank algorithms ("textgraphs") for phrase extraction

Language:PythonLicense:MITStargazers:2122Issues:64Issues:99

jellyfish

🪼 a python library for doing approximate and phonetic matching of strings.

Language:Jupyter NotebookLicense:MITStargazers:2026Issues:42Issues:134

AutoPhrase

AutoPhrase: Automated Phrase Mining from Massive Text Corpora

Language:C++License:Apache-2.0Stargazers:1167Issues:39Issues:82

nlp-in-practice

Starter code to solve real world text data problems. Includes: Gensim Word2Vec, phrase embeddings, Text Classification with Logistic Regression, word count with pyspark, simple text preprocessing, pre-trained embeddings and more.

Language:Jupyter NotebookStargazers:1139Issues:51Issues:8

OCTIS

OCTIS: Comparing Topic Models is Simple! A python package to optimize and evaluate topic models (accepted at EACL2021 demo track)

Language:PythonLicense:MITStargazers:707Issues:14Issues:102

acl-style-files

Official style files for papers submitted to venues of the Association for Computational Linguistics

potato

potato: portable text annotation tool

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:276Issues:10Issues:41

NLP4SocialGood_Papers

A reading list of up-to-date papers on NLP for Social Good.

tokenizers

Fast, Consistent Tokenization of Natural Language Text

Language:RLicense:NOASSERTIONStargazers:184Issues:17Issues:64

hate-speech-dataset

Hate speech dataset from Stormfront forum manually labelled at sentence level.

embed

Extra recipes for predictor embeddings

Language:RLicense:NOASSERTIONStargazers:141Issues:12Issues:95

asyncpraw

Async PRAW, an abbreviation for "Asynchronous Python Reddit API Wrapper", is a python package that allows for simple access to Reddit's API.

Language:PythonLicense:BSD-2-ClauseStargazers:107Issues:4Issues:27

DynamicWord2Vec

Dynamic Word Embeddings for Evolving Semantic Discovery code.

Euphemism

Self-Supervised Euphemism Detection and Identification for Content Moderation, IEEE S&P (Oakland) 2021

Language:PythonLicense:MITStargazers:29Issues:3Issues:2

IaMaN

It's a Machine and Natural

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:14Issues:6Issues:0

tutorials

Hands on advanced machine learning for information extraction from tweets tasks, data, and open source tools

License:Apache-2.0Stargazers:14Issues:4Issues:0

GPS

Generate, Prune, Select: A Pipeline for Counterspeech Generation against Online Hate Speech (ACL-IJCNLP 2021 Findings)

Language:PythonLicense:MITStargazers:12Issues:3Issues:1

reddit-analysis

Perform network analysis on reddit

Language:Jupyter NotebookLicense:GPL-3.0Stargazers:11Issues:2Issues:0

needs_detection

Detecting needs during a crisis

Language:PythonStargazers:1Issues:4Issues:0