Zachary Yocum's repositories

reader

Extract clean(er), readable text from web pages via Mercury Web Parser.

Language:PythonLicense:MITStargazers:112Issues:3Issues:2

dedup

Find duplicate text files.

Language:PythonLicense:MITStargazers:11Issues:4Issues:0

pdf2md

Convert PDF to Markdown via OpenAI multi-modal text/vision model.

Language:PythonLicense:MITStargazers:9Issues:0Issues:0

IAA

Inter-annotator agreement

Language:PythonLicense:MITStargazers:5Issues:4Issues:0

ipa-grammar

Basic grammar for parsing International Phonetic Alphabet (IPA) transcriptions

Language:Jupyter NotebookLicense:MITStargazers:3Issues:3Issues:0

text-alpha

Python implementation of character-level, textual inter-annotator agreement with Krippendorff's alpha.

Language:PythonLicense:MITStargazers:3Issues:2Issues:0

map-marker

Create Folium maps with markers from natural language queries.

Language:Jupyter NotebookLicense:MITStargazers:2Issues:3Issues:0

cohens_kappa

Functions for computing Cohen’s kappa coefficient

Language:PythonLicense:MITStargazers:1Issues:2Issues:0

newshowrss

Fetch new torrents for a specific NewShowRSS user and kick-off a download.

Language:PythonLicense:MITStargazers:1Issues:2Issues:0

phoible-notebook

Exploratory notebook for inspecting the PHOIBLE data set.

Language:Jupyter NotebookLicense:MITStargazers:1Issues:3Issues:0

ptllm-events-extraction

Demonstration of using OpenAI's pre-trained LLMs for the linguistic annotation task of event extraction.

Language:ShellLicense:MITStargazers:1Issues:0Issues:0

simphon

Proof-of-concept for measuring similarity of phoneme sequences using locality sensitive hashing (LSH).

Language:Jupyter NotebookLicense:MITStargazers:1Issues:1Issues:0

tic-tac-toe

Run a game of Tic-Tac-Toe on the command line via curses

Language:PythonLicense:MITStargazers:1Issues:3Issues:0

wikibase

Python wrapper of Wikibase API: https://www.mediawiki.org/wiki/Wikibase/API#API_modules

Language:PythonLicense:MITStargazers:1Issues:3Issues:0

word_game

Find solutions to a word game where the goal is to find sets of five words that share an interchangeable vowel.

Language:PythonStargazers:1Issues:2Issues:0

annovis

Visualize offset-based annotations of text spans as SVG.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

bitmap

Search 2D bitmaps for blocks of adjacent cells whose values are 1

Language:PythonLicense:MITStargazers:0Issues:2Issues:0

bnf_parser

A Backus-Naur form (BNF) grammar parser implemented in Python.

Language:PythonLicense:MITStargazers:0Issues:3Issues:0

edh-point-system-alpha

Jupyter notebook to explore inter-rater reliability of EDH point-rating system.

Language:Jupyter NotebookLicense:MITStargazers:0Issues:1Issues:0

finite_state_machine

A simple Python implementation of a finite state machine.

Language:PythonStargazers:0Issues:2Issues:0

motion-type-classifier

An ISO-Space PATH vs. MANNER vs. COMPOUND MOTION attribute classifier.

Language:HTMLLicense:NOASSERTIONStargazers:0Issues:2Issues:0

paths

Find solution to a graph traversal problem.

Language:PythonLicense:MITStargazers:0Issues:2Issues:0

pcfg

Probabilistic context free grammar

Language:PythonLicense:MITStargazers:0Issues:3Issues:0

quotidian

A time-tracking webapp.

Language:JavaScriptStargazers:0Issues:0Issues:0

rolldice

A script for simulating rolling n-sided dice.

Language:PythonLicense:MITStargazers:0Issues:2Issues:0

scrape_fcc_ecfs

Scrape Federal Communications Commssion (FCC) Electronic Comments Filing System (ECFS) via publicapi.fcc.gov

Language:PythonLicense:MITStargazers:0Issues:3Issues:0

scrape_wikinews

Python script for scraping Wikinews articles

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Language:PythonLicense:MITStargazers:0Issues:2Issues:0

torrent

Some basic BitTorrent utilities.

Language:PythonLicense:MITStargazers:0Issues:2Issues:0

wlcount

Count frequency of word lengths in text files.

Language:PythonLicense:MITStargazers:0Issues:2Issues:0