stephbuon

followers

following

stars

https://stephbuon.github.io/

Organizations

rOpenGov

Steph Buongiorno's repositories

democracy-viewer

Language:JavaScriptGPL-3.0200

stephbuon.github.io

My portfolio.

GPL-3.0000

AdHominem

Authorship Verification in Social Media via Attention-based Similarity Learning

Language:PythonNOASSERTION000

hansard-speakers

A data processing pipeline to disambiguate speakers in the 19th-century British Parliamentary debates.

Language:PythonMIT100

democracy-lab

Code, manuals, and concepts for Democracy Lab research and affiliate projects.

Language:Jupyter NotebookMIT000

faha-2023

Code for "Foundations and Applications of Humanities Analytics" (2023) at the Santa Fe Institute

Language:Jupyter NotebookMIT100

faha-2022

Code for "Foundations and Applications of Humanities Analytics" (2022) at the Santa Fe Institute

Language:Jupyter NotebookMIT100

dhmeasures

"White box" statistical functions for analyzing textual corpora.

Language:C++NOASSERTION000

posextract

Grammatical information extraction methods designed for the analysis of historical and contemporary textual corpora.

Language:PythonMIT300

Auto-GPT

An experimental open-source attempt to make GPT-4 fully autonomous.

MIT000

SCM4LLMs

100

hpc_docs

HPC Documentation and Examples

000

homepage

Source code for ropengov.org

Language:HTMLMIT000

noaa

For Accessing Current and Historic Weather Data by the National Oceanic and Atmospheric Administration (NOAA)

Language:RMIT000

posextractr

Grammatical information extraction methods designed for the analysis of historical and contemporary textual corpora.

Language:RMIT000

hansardr

Access a cleaned version of the c19 Hansard corpus with improved speaker names in the R environment.

Language:RNOASSERTION000

rogtemplate

pkgdown template for rOpenGov packages

Language:RNOASSERTION000

entascope

Extract named entities from the websites of news outlets.

Language:Jupyter NotebookMIT100

congressional-data-scraper

Export an analysis-ready version of the Daily Editions of the U.S. Congressional Records.

Language:PythonMIT100

hansard-shiny

Code for the "Hansard Viewer" web app (a prototype app for applying to future support).

Language:RMIT500

congress-shiny

Code for the "Congress Viewer" web app (a prototype app for applying to future support).

Language:RMIT000

pytorch_active_learning

PyTorch Library for Active Learning to accompany Human-in-the-Loop Machine Learning book

MIT000

digital-history

Instructional repository for "Text Mining as Historical Method"

Language:Jupyter NotebookGPL-3.0700

twitterscraper

Scrape Twitter for Tweets

MIT100

text_mining_data_sets

Notebooks for accessing data for text mining tutorials and projects on M2

000

text_mining_with_python

000

concept-lab-viewer-march

Latest version of the Shiny app created for the Concept Lab for viewing conceptual networks

MIT000

think-play-hack

Think-Play-Hack: World Views

MIT000

get_hansard_data

Script to pull down Hansard data until API works.

MIT000

box_archive

Scripts to tar, compress, and upload large datasets to Box. The scripts use GNU Tar's multivolume feature to keep each file's size less than 15 GB and Slurm to parallelize uploading the archives.

MIT000