Cassandra Jacobs's starred repositories
openlexicon
Access to lexical databases
hopsparser
A neural dependency parser that does its best
noun-compound-interpretation
UBC Summer 2022 Undergraduate Research Project
intergroupEntropy
Measuring entropy in communication between and within groups
GPT2ForwardBackward
Code for running forward and backward versions of GPT2
Openreview
data from ICLR OpenReview and code for data analysis
TransformerDemo
Pytorch nn.Transformer Demo
zeugma_norms
Relatedness norms of ambiguous words using zeugmatic sentences.
prompt_semantics
This repository accompanies our paper “Do Prompt-Based Models Really Understand the Meaning of Their Prompts?”
AusterweilLab.github.io
Lab Website. site files copied to wisc.edu
Phrase-Detectives-Corpus-2.1.4
Phrase Detectives Corpus 2.1.4
Polyseme-Word-Sense-Similarity-Dataset-v1
This is the first version of a Polyseme Word Sense Similarity Dataset collected by Janosch Haber and Massimo Poesio for the DALI Project at queen Mary University of London.
UniversalAnaphora
An initiative to collect and distribute resources for co-reference resolution in a unified standard.
Mask-Language-Model
pytorch; mask language model ; bert
github-typo-corpus
GitHub Typo Corpus: A Large-Scale Multilingual Dataset of Misspellings and Grammatical Errors
pynlpl
PyNLPl, pronounced as 'pineapple', is a Python library for Natural Language Processing. It contains various modules useful for common, and less common, NLP tasks. PyNLPl can be used for basic tasks such as the extraction of n-grams and frequency lists, and to build simple language model. There are also more complex data types and algorithms. Moreover, there are parsers for file formats common in NLP (e.g. FoLiA/Giza/Moses/ARPA/Timbl/CQL). There are also clients to interface with various NLP specific servers. PyNLPl most notably features a very extensive library for working with FoLiA XML (Format for Linguistic Annotation).