Peter Henderson's repositories
DialogDatasets
A repository linking to publicly available dialog datasets. Feel free to send pull requests.
ai-deadlines
:alarm_clock: AI conference deadline countdowns
court-scraper
Scrapers for U.S. county court sites.
FraudDetection
Accounting Fraud Detection Using Machine Learning
TARProtocols
Dataset of Discovery Validation Protocols
arxiv-sanity-lite
arxiv-sanity lite: tag arxiv papers of interest get recommendations of similar papers in a nice UI using SVMs over tfidf feature vectors based on paper abstracts.
auto-stop-tar
When to Stop Reviewing in Technology-Assisted Reviews
bertalign
Multilingual sentence alignment using sentence embeddings
legalbench
An open science effort to benchmark legal reasoning in foundation models
Parsivar
A Language Processing Toolkit for Persian
sec-edgar-financials
Extract financial data from the SEC's EDGAR database
SQuAD-explorer
Visually Explore the Stanford Question Answering Dataset
text_characterization_toolkit
A library for computing diverse text characteristics and using them to analyze data sets and models with ease.
transformers
🤗Transformers: State-of-the-art Natural Language Processing for Pytorch and TensorFlow 2.0.
yahooquery
Python wrapper for an unofficial Yahoo Finance API