John Schriner's repositories

chan-b

expands threads, scrapes, edits html for corpus linguistics usage

Language:HTMLStargazers:4Issues:4Issues:0

4chan-a-b-pol

A corpus collection 4chan's /a/ and /b/ from June 2015 and /a/b/pol from July 2019

Language:HTMLStargazers:3Issues:2Issues:0

chan-a

expands threads, scrapes, edits html for corpus linguistics usage

Language:HTMLStargazers:2Issues:0Issues:0

chan-pol

a script using selenium that expands /pol/ threads, scrapes, and cleans up the text for corpus use

Stargazers:0Issues:0Issues:0
Stargazers:0Issues:2Issues:0

librarycode

some html and js code for the site

Stargazers:0Issues:0Issues:0

NLP

NLP Projects

Language:Jupyter NotebookStargazers:0Issues:0Issues:0

presentations

Presentations and Works in Progress

Stargazers:0Issues:0Issues:0

RU-Stress-Prediction

using Zaliznjak's dictionary and stresscodes I use FairSeq to predict Russian stress

Stargazers:0Issues:2Issues:0

SFWpy

categorizes and gives images a NSFW evaluation

Language:PythonStargazers:0Issues:0Issues:0

swapscrape

An automation tool using Selenium and ImageScraper to grab images interactively from a page

Language:PythonStargazers:0Issues:0Issues:0

TACIT

We introduce TACIT: An Open-Source Text Analysis, Crawling and Interpretation Tool. TACIT's plugin architecture has three main components: 1. Crawling plugins 2. Corpus management 3. Analysis plugins. TACIT's open-source plugin platform allows the architecture to easily adapt with the rapid developments text analysis.

Language:JavaStargazers:0Issues:1Issues:0

xkcd-substitutions-mozilla

A Firefox extension that replaces ordinary words with much more fun ones.

Language:JavaScriptStargazers:0Issues:0Issues:0