MuckRock's repositories
API-examples
A collection of scripts using the MuckRock API.
documentcloud
DocumentCloud's back end source code - Please report bugs, issues and feature requests to info@documentcloud.org
documentcloud-frontend
DocumentCloud's front end source code - Please report bugs, issues and feature requests to info@documentcloud.org
documentcloud-addon-workflows
Reusable workflows for DocumentCloud add-ons
documentcloud-custom-metadata-scraper-addon
The custom metadata output addon for documentcloud
documentcloud-whisper-addon
DocumentCloud Add-On that uses OpenAI's Whisper library to transcribe vidoes and upload the transcription to DocumentCloud
compress-pdf-add-on
Given a public Google Drive or Dropbox link to a file or set of files, it will download the file(s), attempt to compress each file, and upload the document(s) to DocumentCloud if the resulting compressed file <500MB
doctr-ocr-add-on
DocumentCloud Add-On that uses the docTR OCR system
document-rotator-addon
DocumentCloud Add-On that allows you to detect pages that need to be rotated in a document and auto-rotate them automatically.
documentcloud-gpt3-playpen-addon
Uses GPT-3.5 Turbo to help analyze, categorize and structure document on DocumentCloud
documentcloud-legal-citation-identification-addon
Pulls legal citations from a document using eyecite
documentcloud-rss-fetcher-addon
Given an RSS feed where each entry's <link> element points to a document, upload those documents to DocumentCloud.
Extract-Tag-AddOn
Given a document or set of documents on DocumentCloud.org, extracts the text of the document between the start and end parameters and creates a tag for that text on DocumentCloud
google-translate-addon
DocumentCloud Add-On that uses the Google Translate API to translate documents page by page.
gpt4-vision-addon
DocumentCloud Add-On that uses GPT-4 Vision to pull tabular data from documents in CSV or JSON format
Internet-Archive-Export-Add-On
This add-on allows you to back up DocumentCloud documents that you select or query to be archived to the Internet Archive.
OCR-Tagger
DocumentCloud Add-On that tags documents with the OCR engine used on them, if any
pdf-splitter-add-on
DocumentCloud Add-On that splits a DocumentCloud document on a designated page and creates two new documents
Reflow-Add-On
A DocumentCloud Add-On that uses K2pdfopt to optimize documents for mobile eReaders and smartphones
schedulable-gpt-35
DocumentCloud Add-On to run GPT 3.5 Turbo in batches on your documents on a scheduled basis
Site-Snapshot
DocumentCloud Add-On that uses pdfkit to take a snapshot of a site and upload the PDF to DocumentCloud