Wissenschaftszentrum Berlin für Sozialforschung / WZB Berlin Social Science Center (WZBSocialScienceCenter)

Wissenschaftszentrum Berlin für Sozialforschung / WZB Berlin Social Science Center

WZBSocialScienceCenter

Geek Repo

Repository with scripts & tools used & developed @ WZB Berlin Social Science Center. Public money – public code! See also https://datascience.blog.wzb.eu.

Location:Berlin, Germany

Home Page:https://wzb.eu

Twitter:@WZB_Berlin

Github PK Tool:Github PK Tool

Wissenschaftszentrum Berlin für Sozialforschung / WZB Berlin Social Science Center's repositories

pdftabextract

A set of tools for extracting tables from PDF files helping to do data mining on (OCR-processed) scanned documents.

Language:PythonLicense:Apache-2.0Stargazers:2168Issues:86Issues:21

tmtoolkit

Text Mining and Topic Modeling Toolkit for Python with parallel processing power

Language:PythonLicense:Apache-2.0Stargazers:191Issues:16Issues:19

geovoronoi

a package to create and plot Voronoi regions within geographic boundaries

Language:PythonLicense:Apache-2.0Stargazers:128Issues:5Issues:18

pdf2xml-viewer

A simple viewer and inspection tool for text boxes in PDF documents

Language:HTMLLicense:Apache-2.0Stargazers:89Issues:10Issues:1

germalemma

A lemmatizer for German language text

Language:PythonLicense:Apache-2.0Stargazers:86Issues:13Issues:4

plz_geocoord

Dataset of all German postal codes and their geographic center as geo-coordinates.

otreeutils

Facilitate oTree experiment implementation with extensions for custom data models, surveys, understanding questions, timeout warnings and more.

Language:PythonLicense:Apache-2.0Stargazers:17Issues:4Issues:2

pandas-excel-styler

Styling individual cells in Excel output files created with pandas.

otree_iat

Implicit Association Test (IAT) experiment for oTree

Language:PythonLicense:Apache-2.0Stargazers:6Issues:3Issues:0

gemeindeverzeichnis

Python-Modul zum Einlesen von Gemeindeverzeichnisdaten des Statistischen Bundesamts als pandas DataFrame

Language:PythonLicense:Apache-2.0Stargazers:5Issues:4Issues:1

mdb-twitter-network

Twitter network of members of the 19th German Bundestag

Language:RStargazers:5Issues:4Issues:0

r-geodata-workshop

Workshop held at WZB: Working with geo-spatial data in R - Obtaining, linking and plotting geographic data

Language:RStargazers:5Issues:3Issues:0

tm_corona

A small showcase for topic modeling with the tmtoolkit Python package. I use a corpus of articles from the German online news website Spiegel Online (SPON) to create a topic model for before and during the COVID-19 pandemic.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:3Issues:2Issues:0

patternlite

Lightweight, Python 3.6+ fork of the original Pattern package: Web mining module for Python, with tools for scraping, natural language processing, machine learning, network analysis and visualization.

Language:PythonLicense:BSD-3-ClauseStargazers:2Issues:2Issues:0

r_clustered_se

Code for blog post "Clustered standard errors with R: Three ways, one result".

Language:RLicense:Apache-2.0Stargazers:2Issues:2Issues:0

r_simplify_features

Code for blog post showing how to simplify spatial features with R.

Language:RLicense:Apache-2.0Stargazers:2Issues:2Issues:0

spatially_weighted_avg

Code for "Spatially weighted averages in R with sf"

Language:RLicense:Apache-2.0Stargazers:2Issues:1Issues:0

covid19-placesapi

Code to obtain and analyse "popular times" data from Google Places. Also contains data fetched between March 22nd and April 15th 2020 for different places world-wide.

Language:RLicense:Apache-2.0Stargazers:1Issues:4Issues:0

dataverse

Open source research data repository software

Language:JavaLicense:NOASSERTIONStargazers:1Issues:2Issues:0

lda

Topic modeling with latent Dirichlet allocation using Gibbs sampling

Language:PythonLicense:MPL-2.0Stargazers:1Issues:2Issues:0

wzb_r_tutorial

Documents for R tutorial given at WZB accompanying the lecture "Studying Social Stratification with Big Data" (Hipp, Ulbricht) in winter semester 2018

aas-chronik-scraper

Web-Scraper für Chronik von antisemitischen Vorfällen erstellt von der Amadeu Antonio Stiftung und publiziert unter https://www.amadeu-antonio-stiftung.de/chronik/

Language:PythonLicense:Apache-2.0Stargazers:0Issues:2Issues:0
Language:TeXLicense:NOASSERTIONStargazers:0Issues:1Issues:0

fetch_google_places_api_data

Script to fetch data of Google Places in Berlin using the Google Places API and popularity data. Used at the beginning of the COVID-19 pandemic to measure change of popularity of different places.

Language:PythonStargazers:0Issues:1Issues:0

github_covid_gender_jfr

Replication code for article "Has Covid-19 increased gender inequalities in professional advancement? Cross-country evidence on productivity differences between male and female software developers" published in the Journal of Family Research.

Language:RStargazers:0Issues:1Issues:0

gmapswrapper

Google Maps API wrapper for Python enables convenient caching of Maps API results.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:2Issues:0

hipp_konrad_2021

Replication code for Men’s and women’s productivity before and during the COVID-19 pandemic: Evidence from a cross-country comparison of software developers

Language:HTMLStargazers:0Issues:1Issues:0

otree_amp

Affect Misattribution Procedure (AMP) experiment for oTree

Language:PythonLicense:Apache-2.0Stargazers:0Issues:2Issues:0

spaCy

💫 Industrial-strength Natural Language Processing (NLP) in Python

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

wzbsocialsciencecenter.github.io

wzbsocialsciencecenter.github.io landing page.

Stargazers:0Issues:3Issues:0