isr-wang's repositories
BERTopic
Leveraging BERT and c-TF-IDF to create easily interpretable topics.
ChatGPT-Dataset-Reddit
Reddit comments about ChatGPT.
Contrastive-Clustering
Code for the paper "Contrastive Clustering" (AAAI 2021)
Dual-Contrastive-Learning
Code for our paper "Dual Contrastive Learning: Text Classification via Label-Aware Data Augmentation"
Learning-Disentangled-Representations-via-Mutual-Information-Estimation
Pytorch implementation of Learning Disentangled Representations via Mutual Information Estimation (ECCV 2020)
LibMTL
A PyTorch Library for Multi-Task Learning
lrec22-d3-dataset
The official repository for the LREC'22 paper "D3: A Massive Dataset of Scholarly Metadata for Analyzing the State of Computer Science Research"
MultiObjectiveOptimization
Source code for Neural Information Processing Systems (NeurIPS) 2018 paper "Multi-Task Learning as Multi-Objective Optimization"
ncsnv2
The official PyTorch implementation for NCSNv2 (NeurIPS 2020)
NeuralOptimalTransport
PyTorch implementation of "Neural Optimal Transport" (ICLR 2023)
NeuralSinkhornTopicModel
Neural Topic Model via Optimal Transport, ICLR 2021
OCTIS
OCTIS: Comparing Topic Models is Simple! A python package to optimize and evaluate topic models (accepted at EACL2021 demo track)
ot-4-ml-reading-group
Reading Group @mila-iqia on Computational Optimal Transport for Machine Learning Applications
prompt-engineering-for-developers
吴恩达大模型系列课程中文版,包括《Prompt Engineering》、《Building System》和《LangChain》
Pytorch-PCGrad
Pytorch reimplementation for "Gradient Surgery for Multi-Task Learning"
snscrape
A social networking service scraper in Python
Tensorflow_Pytorch_Sinkhorn_OT
Tensorflow (1.0 or 2.0) and Pytorch implementations of the Sinkhorn algorithm [1] for computing the optimal transport (OT) distance between two discrete distributions.
TopClus
[WWW 2022] Topic Discovery via Latent Space Clustering of Pretrained Language Model Representations
tweepy
Twitter for Python!
Tweepy_Academic
Search and download tweets from Twitter using Tweepy and Twitter API v.2. The code is for those with an Academic Research Twitter Account, which has a limit of 10 million tweets a month and allows fully archive search back to March 2006.
twint
An advanced Twitter scraping & OSINT tool written in Python that doesn't use Twitter's API, allowing you to scrape a user's followers, following, Tweets and more while evading most API limitations.
twitter-reddit-sentiment-pipline
Twitter & Reddit Data Pipeline for ChatGPT Sentiment Analysis
Twitter-Sentiment-Analysis-about-ChatGPT
A quantitative study on over 1.25 million tweets about ChatGPT, employed data scrapping, data cleaning, EDA, topic modeling, and sentiment analysis.
vicreg
VICReg official code base
WassersteinFisherRaoDistance
An optimization method for computing the unbalanced Wasserstein-Fisher-Rao optimal transport distance between two measures on S^2. Includes an application to computing the SRNF shape distance and color tranfer.