Wei Coco Xu's repositories
simplification
Text Simplification System and Dataset
SemEval-PIT2015
data and scripts for the shared task "Task 1: Paraphrase and Semantic Similarity in Twitter (PIT)" at SemEval 2015
CS7650_spring2024
CS 7650 (graduate-level NLP class) at Georgia Tech
tweet_deduplicator
remove duplicate (identical or near-identical tweets); sentence splitter for Twitter data.
acl17-handbook
ACL 2017 conference handbook
alignment-scripts
Scripts to preprocess training and test data and to run fast_align and giza
awesome-bert
bert nlp papers, applications and github resources, including the newst xlnet , BERT、XLNet 相关论文和 github 项目
cocoxu.github.io
Wei Xu's Homepage
CS7650_spring2024_projects
CS 7650 (graduate-level NLP class) at Georgia Tech
CS8803-LLM-fall2024
class website CS 8803 - LLM
english-words
:memo: A text file containing 479k English words for all your dictionary/word-based projects e.g: auto-completion / autosuggestion
lexi-frontend
Frontend for the Lexi web extension
NeuralTextSimplification
Exploring Neural Text Simplification
OpenNMT-py
Open Source Neural Machine Translation in PyTorch
socialmedia-class.github.io
Social Media and Text Analytics Course at UPenn
SurveyMan
SurveyMan programming language.
ubscrape
ubscrape is an Urban Dictionary scraper for NLP or other large scale analyses.
WLP-Parser
This repository contains a collection of neural network models that we used to demonstrate the utility of our dataset.