BayesForDays

Cassandra Jacobs's starred repositories

yoyodyne

Small-vocabulary sequence-to-sequence generation with optional feature conditioning

Language:PythonApache-2.02500

A static site generator for data apps, dashboards, reports, and more. Observable Framework combines JavaScript on the front-end for interactive graphics with any language on the back-end for data analysis.

Language:TypeScriptISC219700

openlexicon

Access to lexical databases

Language:HTMLCC-BY-SA-4.011100

stat_rethinking_2024

Language:RCC0-1.081200

gmm-torch

Gaussian mixture models in PyTorch.

Language:PythonMIT49400

hopsparser

A neural dependency parser that does its best

Language:PythonNOASSERTION1300

noun-compound-interpretation

UBC Summer 2022 Undergraduate Research Project

Language:Jupyter Notebook300

intergroupEntropy

Measuring entropy in communication between and within groups

Language:Jupyter Notebook100

jsPsych

Create behavioral experiments in a browser using JavaScript

Language:TypeScriptMIT101800

GPT2ForwardBackward

Code for running forward and backward versions of GPT2

Language:Python1000

flair

A very simple framework for state-of-the-art Natural Language Processing (NLP)

Language:PythonNOASSERTION1373900

dockerize

Utility to simplify running applications in docker containers

Language:GoMIT495900

wordnet-homonymy

Language:Python100

Openreview

data from ICLR OpenReview and code for data analysis

Language:Python4800

TransformerDemo

Pytorch nn.Transformer Demo

Language:PythonMIT5200

zeugma_norms

Relatedness norms of ambiguous words using zeugmatic sentences.

Language:HTML200

fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Language:PythonMIT2984200

prompt_semantics

This repository accompanies our paper “Do Prompt-Based Models Really Understand the Meaning of Their Prompts?”

Language:PythonMIT8300

AusterweilLab.github.io

Lab Website. site files copied to wisc.edu

Language:HTML200

Phrase-Detectives-Corpus-2.1.4

Phrase Detectives Corpus 2.1.4

300

Polyseme-Word-Sense-Similarity-Dataset-v1

This is the first version of a Polyseme Word Sense Similarity Dataset collected by Janosch Haber and Massimo Poesio for the DALI Project at queen Mary University of London.

Apache-2.0200

UniversalAnaphora

An initiative to collect and distribute resources for co-reference resolution in a unified standard.

Language:Python2300

devU-api

Auto-grading 4.0 API

Language:TypeScript1300

Mask-Language-Model

pytorch； mask language model ； bert

Language:PythonApache-2.06800

rstWeb

Repository for rstWeb, a browser based annotation interface for Rhetorical Structure Theory

Language:PythonMIT4000

gum

Repository for the Georgetown University Multilayer Corpus (GUM)

Language:PythonNOASSERTION8600

mucoco

Official Code for the papers: "Controlled Text Generation as Continuous Optimization with Multiple Constraints" and "Gradient-based Constrained Sampling from LMs"

Language:PythonMIT5800

github-typo-corpus

GitHub Typo Corpus: A Large-Scale Multilingual Dataset of Misspellings and Grammatical Errors

Language:Python47700

pynlpl

PyNLPl, pronounced as 'pineapple', is a Python library for Natural Language Processing. It contains various modules useful for common, and less common, NLP tasks. PyNLPl can be used for basic tasks such as the extraction of n-grams and frequency lists, and to build simple language model. There are also more complex data types and algorithms. Moreover, there are parsers for file formats common in NLP (e.g. FoLiA/Giza/Moses/ARPA/Timbl/CQL). There are also clients to interface with various NLP specific servers. PyNLPl most notably features a very extensive library for working with FoLiA XML (Format for Linguistic Annotation).

Language:PythonGPL-3.047700