Jan Philip Wahle's repositories
lrec22-d3-dataset
The official repository for the LREC 2022 paper "D3: A Massive Dataset of Scholarly Metadata for Analyzing the State of Computer Science Research"
iconf22-paraphrase
The official implementation of the iConference 2022 paper "Identifying Machine-Paraphrased Plagiarism".
cs-insights
The main controller for services in the cs-insights project through docker-compose.
emnlp23-paraphrase-types
The official implementation of the EMNLP 2023 paper "Paraphrase Types for Generation and Detection"
emnlp22-transforming
The official implementation of the EMNLP 2022 paper "How Large Language Models are Transforming Machine-Paraphrased Plagiarism".
iconf22-covid-misinformation
The official implementation of the iConference 2022 paper "Testing the Generalization of Neural Language Models for COVID-19 Misinformation Detection"
cs-insights-crawler
This repository implements the interaction with DBLP, information extraction and pre-processing of papers, and a client to store data to the cs-insights-backend.
cs-insights-backend
API server of the cs-insights project. This is the main part of storing data and accessing an external data analysis endpoint. It uses a mongoDB instance to store everything and queries the cs-insights-prediction-endpoint to get machine learning results.
cs-insights-frontend
React frontend of the cs-insights project. This is the main part of visualizing data. It uses the cs-insights-backend and cs-insights-prediction-endpoint.
cs-insights-prediction-endpoint
Python prediction backend of the cs-insights project which does the heavy lifting for analyzing topics and other semantic analysis features using parents and childrens of docker containers that can run on different servers
21-word-sense-disambiguation
The official implementation of the paper "Incorporating Word Sense Disambiguation into Neural Language Models".
acl23-big-tech-nlp
The official implementation of the ACL 2023 paper "The Elephant in the Room: Analyzing the Presence of Big Tech in Natural Language Processing Research"
emnlp23-citation-field-influence
The official implementation of the EMNLP 2023 paper "We are Who We Cite: Bridges of Influence Between Natural Language Processing and Other Academic Fields"
cs-insights-uptime
Uptime tracker for endpoints of the cs-insights project.
citation-age-recession
This is the official implementation of the paper "Citation Amnesia: NLP and Other Academic Fields Are in a Citation Age Recession"
acl-anthology
Data and software for building the ACL Anthology.
cs-insights-webapp
The main frontend and backend of the cs-insights project.
jpwahle
My GitHub Profile Page.