brancengregory / oscn

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

README

Rails API only backend for scraping and creating a Postgres database of OSCN data. Leverages the oscn_scraper gem to scrape data and return as json.

Getting started

TODO - Create sample .env file, Instructions for booting up

Configurations

You can configure the following ENV variables:

COUNTIES

Comma separated string of the county names. The application uses the several of the DISTRICT COURT REPORTS to determine what cases to update and what new cases have been filed. Therefore, only the following counties are available:

Adair,Canadian,Cleveland,Comanche,Ellis,Garfield,Logan,Oklahoma,Payne,Pushmataha,Roger Mills,Rogers,Tulsa

Example

COUNTIES=Tulsa,Oklahoma

CASE_TYPES_ABBREVIATION

Comma separated string of Case type abbreviations:

Example

CASE_TYPES_ABBREVIATION=CF,CM,TR,TRI,AM,CPC,DTR # default

OSCN_THROTTLE

Number of requests to send to OSCN per minute

OSCN_THROTTLE=120

OSCN_CONCURRENCY=10 # default

Number of threads to run concurrently

OSCN_CONCURRENCY=120 # default

TODO ENVs

MAX_REQUESTS, MEDIUM_PRIORITY, LOW_PRIORITY, DAYS_AGO, DAYS_AHEAD

Scraping Methodology

High Priority Cases - Any case that has appear on the docket in the past 7 days will be scraped nightly

Medium Priority Cases - Any open case (closed_on = nil). Scrapes the oldest first.

Low Priority Cases - Closed cases that likely will not be updated as often.

About


Languages

Language:Ruby 99.9%Language:HTML 0.1%Language:Procfile 0.0%