Peter Henderson's repositories

PileOfLaw

A dataset for pretraining language models targeted for legal tasks.

Language:Jupyter NotebookStargazers:108Issues:3Issues:3

DialogDatasets

A repository linking to publicly available dialog datasets. Feel free to send pull requests.

Language:HTMLStargazers:66Issues:0Issues:0
Language:PythonLicense:MITStargazers:12Issues:1Issues:0

ai-deadlines

:alarm_clock: AI conference deadline countdowns

Language:HTMLLicense:MITStargazers:2Issues:1Issues:0

court-scraper

Scrapers for U.S. county court sites.

License:ISCStargazers:1Issues:0Issues:0

FraudDetection

Accounting Fraud Detection Using Machine Learning

Stargazers:1Issues:0Issues:0
Language:CSSStargazers:1Issues:0Issues:0

TARProtocols

Dataset of Discovery Validation Protocols

Language:HTMLLicense:MITStargazers:1Issues:2Issues:0
Language:PythonLicense:MITStargazers:1Issues:0Issues:0

arxiv-sanity-lite

arxiv-sanity lite: tag arxiv papers of interest get recommendations of similar papers in a nice UI using SVMs over tfidf feature vectors based on paper abstracts.

License:MITStargazers:0Issues:0Issues:0

auto-stop-tar

When to Stop Reviewing in Technology-Assisted Reviews

Language:Jupyter NotebookStargazers:0Issues:1Issues:0

bertalign

Multilingual sentence alignment using sentence embeddings

Language:PythonLicense:GPL-3.0Stargazers:0Issues:0Issues:0
Language:HTMLStargazers:0Issues:0Issues:0

legalbench

An open science effort to benchmark legal reasoning in foundation models

Language:Jupyter NotebookStargazers:0Issues:0Issues:0

lnraw

Lab Notebook - Source Version

Language:TeXStargazers:0Issues:1Issues:0

orion

Asynchronous Distributed Hyperparameter Optimization.

Language:PythonLicense:NOASSERTIONStargazers:0Issues:1Issues:0

Parsivar

A Language Processing Toolkit for Persian

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

proselint

A linter for prose.

Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:1Issues:0

pyflann

python bindings for FLANN - Fast Library for Approximate Nearest Neighbors.

Language:PythonLicense:NOASSERTIONStargazers:0Issues:1Issues:0

pyJoules

A Python library to capture the energy consumption of code snippets

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

pylcs

super fast cpp implementation of longest common subsequence/substring

Language:C++Stargazers:0Issues:1Issues:0

scholarly

Retrieve author and publication information from Google Scholar in a friendly, Pythonic way

Language:PythonLicense:UnlicenseStargazers:0Issues:1Issues:0

sec-edgar-financials

Extract financial data from the SEC's EDGAR database

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

SQuAD-explorer

Visually Explore the Stanford Question Answering Dataset

License:MITStargazers:0Issues:0Issues:0

text_characterization_toolkit

A library for computing diverse text characteristics and using them to analyze data sets and models with ease.

License:MITStargazers:0Issues:0Issues:0

transformers

🤗Transformers: State-of-the-art Natural Language Processing for Pytorch and TensorFlow 2.0.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:2Issues:0

yahooquery

Python wrapper for an unofficial Yahoo Finance API

Language:PythonLicense:MITStargazers:0Issues:1Issues:0