Peter Henderson's repositories
DialogDatasets
A repository linking to publicly available dialog datasets. Feel free to send pull requests.
ai-deadlines
:alarm_clock: AI conference deadline countdowns
court-scraper
Scrapers for U.S. county court sites.
FraudDetection
Accounting Fraud Detection Using Machine Learning
TARProtocols
Dataset of Discovery Validation Protocols
arxiv-sanity-lite
arxiv-sanity lite: tag arxiv papers of interest get recommendations of similar papers in a nice UI using SVMs over tfidf feature vectors based on paper abstracts.
auto-stop-tar
When to Stop Reviewing in Technology-Assisted Reviews
bertalign
Multilingual sentence alignment using sentence embeddings
gpt_paper_assistant
GPT4 based personalized ArXiv paper assistant bot
legalbench
An open science effort to benchmark legal reasoning in foundation models
lnraw
Lab Notebook - Source Version
Parsivar
A Language Processing Toolkit for Persian
pyflann
python bindings for FLANN - Fast Library for Approximate Nearest Neighbors.
pyJoules
A Python library to capture the energy consumption of code snippets
pylcs
super fast cpp implementation of longest common subsequence/substring
scholarly
Retrieve author and publication information from Google Scholar in a friendly, Pythonic way
sec-edgar-financials
Extract financial data from the SEC's EDGAR database
SQuAD-explorer
Visually Explore the Stanford Question Answering Dataset
text_characterization_toolkit
A library for computing diverse text characteristics and using them to analyze data sets and models with ease.
ThunderAI
ThunderAI is a Thunderbird Addon that uses the capabilities of ChatGPT or Ollama to enhance email management.
transformers
🤗Transformers: State-of-the-art Natural Language Processing for Pytorch and TensorFlow 2.0.
yahooquery
Python wrapper for an unofficial Yahoo Finance API