lesleyjanenorton / oscn

Python utilities for scraping OSCN cases.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

OSCN Query utilities

This is a collection of python utilities to scrape case information from the Oklahoma Supreme Court Network.

Contents

oscn > This is python package to provide an api for retrieving and parsing case records.

examples > Source HTML files for testing

scripts > Python scripts showing use of the oscn package

  • example.py: demostrates use of the request Case and Caselist
  • retrieve-counts.py: saves a list of all counts for a list of counties and years
  • find-counts.py: saves a list of counts passing a test for a list of counties and years
  • soup_test.py: a stub for testing parsing attempts using BeautifulSoup
  • parse_test.py: a stub for developing using saved examples

OSCN package

oscn.request

  • Case: Returns a single case.
  • CaseList: Returns an iterator for retrieving cases for a county and year.

oscn.parse

Parsers accept the html of an OSCN case page and return python objects.

  • counts: returns of list of counts found in a case
  • judge: returns a string of the judge's name
  • parties: returns a list of objects that look like this: {'name': 'Some Name', 'type': 'Description'}
  • docket: returns a list of rows in a docket

Install

  1. Create and activate a Python 3.6 virtual env
  2. git clone git@github.com:codefortulsa/oscn.git
  3. cd oscn
  4. pip install -e .

Usage

import oscn

Request a single case:

oscn.request.Case(county='tulsa', year='2018', number=84)

or request a list of case to iterate:

oscn.request.CaseList(county='adair', year='2016')

Run test scripts

  • python scripts/{file.py}

About

Python utilities for scraping OSCN cases.


Languages

Language:Python 100.0%