Connor Boyle's repositories
pinyin-tapt-wav2vec2
(Re)-Pre-training Wav2Vec2 on Converting Pinyin to Chinese Characters
bert-phi-annotator
PHI Annotator based on BERT trained on 2014 I2B2 data
NL-Augmenter
NL-Augmenter 🦎 → 🐍 A Collaborative Repository of Natural Language Transformations
person-name-annotator
My entry for a person name annotator in NLP Sandbox
date-annotator-example
Example implementation of the NLP Sandbox Date Annotator
dragonmapper
Identification and conversion functions for Chinese text processing
linked-list
Linked list tutorial in Rust
MarkovMerge
program for training n-gram based Markov models on multiple textual sources
mergesort
Implementing mergesort in Rust
mood-reddit
Reddit analytics for Mood
nlp-sandbox-deidentifier
NLP Sandbox de-identification service
nlp-sandbox-schemas
The OpenAPI specifications implemented by NLP Sandbox Methods.
nlpsandbox-client
Python client to interact with the NLP Sandbox
nlpsandbox-website-synapse
Pages of the NLP Sandbox website on Synapse
PeKo
(Connor Boyel's fork of) PeKo: A Large scale Precondition Knowledge dataset
phi-annotator
Connor' PHI Annotator for NLP Sandbox
phi-annotator-huggingface-bert
BERT NLP Sandbox PHI annotator
phi-deidentifier-app
React client for the NLP Sandbox PHI Deidentifier
rustlings
:crab: Small exercises to get you used to reading and writing Rust code!
scikit-learn
scikit-learn: machine learning in Python
sliding-window-demo
Demo for text classification sliding window
sorts-rs
Implementing quicksort (and others) in Rust
TaBERT
This repository contains source code for the TaBERT model, a pre-trained language model for learning joint representations of natural language utterances and (semi-)structured tables for semantic parsing. TaBERT is pre-trained on a massive corpus of 26M Web tables and their associated natural language context, and could be used as a drop-in replacement of a semantic parsers original encoder to compute representations for utterances and table schemas (columns).
tokenizers
💥 Fast State-of-the-Art Tokenizers optimized for Research and Production
web-markov
A web service to allow users to train and query text generation bots built on Markov models.