Manuel (manueltonneau)

manueltonneau

Geek Repo

Location:Berlin, Germany

Home Page:manueltonneau.com

Twitter:@ManuelTonneau

Github PK Tool:Github PK Tool

Manuel 's starred repositories

metaseq

Repo for external large-scale work

Language:PythonLicense:MITStargazers:6422Issues:110Issues:292

deepsparse

Sparsity-aware deep learning inference runtime for CPUs

Language:PythonLicense:NOASSERTIONStargazers:2926Issues:55Issues:130

GNNs-Recipe

🟠 A study guide to learn about Graph Neural Networks (GNNs)

License:CC0-1.0Stargazers:1072Issues:19Issues:0

small-text

Active Learning for Text Classification in Python

Language:PythonLicense:MITStargazers:532Issues:26Issues:55

gsdmm

GSDMM: Short text clustering

Language:PythonLicense:MITStargazers:350Issues:5Issues:13

academic-budget-bert

Repository containing code for "How to Train BERT with an Academic Budget" paper

Language:PythonLicense:Apache-2.0Stargazers:308Issues:16Issues:22

hlda

Gibbs sampler for the Hierarchical Latent Dirichlet Allocation topic model

Language:Jupyter NotebookLicense:GPL-3.0Stargazers:145Issues:6Issues:12

whatsapp-public-groups

Code to get data from WhatsApp public groups

timelms

TimeLMs: Diachronic Language Models from Twitter

Language:Jupyter NotebookStargazers:98Issues:3Issues:1

embedded-topic-model

A package to run embedded topic modelling with ETM. Adapted from the original at: https://github.com/adjidieng/ETM

Language:PythonLicense:MITStargazers:83Issues:3Issues:9

telegram

Pushshift Telegram Ingest

Language:PythonLicense:MITStargazers:83Issues:10Issues:8

twitter-demographer

A python package to enrich Twitter Data

Language:PythonLicense:MITStargazers:73Issues:3Issues:7

geographconv

Semi-supervised User Geolocation via Graph Convolutional Networks

IndoBERTweet

IndoBERTweet is the first large-scale pretrained model for Indonesian Twitter. Published at EMNLP 2021 (main conference)

Language:PythonStargazers:55Issues:4Issues:0

NQTM

Code for Short Text Topic Modeling with Topic Distribution Quantization and Negative Sampling Decoder (EMNLP2020).

pt-avitm

PyTorch implementation of AVITM (Autoencoding Variational Inference For Topic Models)

Language:PythonLicense:MITStargazers:35Issues:3Issues:7

seegull

SeeGULL is a broad-coverage stereotype dataset in English containing stereotypes about identity groups spanning 178 countries across 8 different geo-political regions across 6 continents, as well as state-level identities within the US and India.

reuters_loader

Load and convert dataset RCV1-v2 to csv file

Language:PythonLicense:MITStargazers:28Issues:0Issues:1

twauth-web

A simple Python Flask web app that demonstrates the flow of obtaining a Twitter user OAuth access token

Language:HTMLLicense:MITStargazers:25Issues:4Issues:1

CSSReview

This repository contains the paperlist of CSS.

BotPercent

implementation of "BotPercent: Estimating Twitter Bot Populations from Groups to Crowds"

Language:PythonStargazers:18Issues:1Issues:0

climate-news-db

A database of climate change newspaper articles

TWilBert

Specialization of BERT architecture both for the Spanish language and the Twitter domain

Language:PythonLicense:NOASSERTIONStargazers:13Issues:3Issues:1

twitwi

Collection of Twitter-related helper functions for python.

Language:PythonLicense:MITStargazers:10Issues:5Issues:46

income-prediction

predicting the occupation and income of Twitter users using graph embeddings

Language:Jupyter NotebookStargazers:9Issues:2Issues:1
Language:Jupyter NotebookLicense:MITStargazers:9Issues:0Issues:0

get-tweets

Single Python script to get tweet JSON objects from a list of tweet IDs

Language:PythonLicense:GPL-3.0Stargazers:7Issues:1Issues:0
Language:PythonStargazers:7Issues:4Issues:0

py_misinfo_exposure

A Python package that can be used to calculate misinformation-exposure scores for a user based on the falsity scores of public figures they follow on Twitter.

Language:PythonLicense:CC0-1.0Stargazers:6Issues:1Issues:5

ETM_tf

Tensorflow inplements of "Topic Modeling in Embedding Spaces" by Adji B. Dieng, Francisco J. R. Ruiz, and David M. Blei. (Arxiv link: https://arxiv.org/abs/1907.04907)

Language:PythonStargazers:6Issues:0Issues:0