zhanglipku's repositories
text
Using Transformers from HuggingFace in R
garc
Python library and command line tool for collecting JSON data from Gab.ai. Scrape posts, users and comments from "free-speech" social media platform Gab.
lightning
Build and train PyTorch models and connect them to the ML lifecycle using Lightning App templates, without handling DIY infrastructure, cost management, scaling, and other headaches.
gogettr
Public API client for GETTR, a "non-bias [sic] social network," designed for data archival and analysis.
gpt-j
A GPT-J API to use with python3 to generate text, blogs, code, and more
hSBM_Topicmodel
Using stochastic block models for topic modeling
BERTopic
Leveraging BERT and c-TF-IDF to create easily interpretable topics.
2019-10-fcc-comments
Data, code, and methodology supporting BuzzFeed News' analysis of comments submitted to three Federal Communications Commission (FCC) dockets.
webXray
webXray is a tool for analyzing webpage traffic and content, extracting legal policies, and identifying the companies which collect user data.
JSON-QAnon
A machine readable archive of QAnon drops for research only
rainette
R implementation of the Reinert text clustering method
LegalPLMs
Source code and checkpoints for legal pre-trained language models.
voson.tcn
R package for collecting threaded twitter conversations and generating networks.
US_County_Level_Election_Results_08-20
United States General Election Presidential Results by County from 2008 to 2016
crack-detection
using Softmax classifier and resnet18 pre-trained model
gssr
General Social Survey (GSS) data files packaged for R
textnets
R package to perform automated text analysis using network techniques
sentence-transformers
Sentence Embeddings with BERT & XLNet
reddit_incivility
Classification of incivility in Reddit posts
ANTMN
Supplementary code for "News Frame Analysis: An Inductive Mixed-method Computational Approach" http://dx.doi.org/10.1080/19312458.2019.1639145
netwulf
Interactive visualization of networks based on Ulf Aslak's d3 web app.
sentimentr
Dictionary based sentiment analysis that considers valence shifters
lit
The Language Interpretability Tool: Interactively analyze NLP models for model understanding in an extensible and framework agnostic interface.
UnsupervisedStanceDetection
Code for Embeddings-Based Clustering for Target Specific Stances
texthero
Text preprocessing, representation and visualization from zero to hero.
jamovi
jamovi - open software to bridge the gap between researcher and statistician
stLDA-C_public
Single-topic LDA (DMM) with unsupervised clustering
botnet-detection
Topological botnet detection datasets and graph neural network applications
MediaCloud-API-Tutorial-Notebooks
A set of jupyter notebooks demonstrating how to use the Media Cloud API.