eltonlaw / diehard

Test for randomness in data.

Home Page:http://diehard.readthedocs.io/en/latest/

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

image

image

Diehard

Tests for randomness. Inspired by Robert G. Brown's random number Dieharder test suite and the Diehard tests

Read The Docs: https://diehard.readthedocs.io/en/latest/

Introduction

From Wikipedia, the "diehard tests are a battery of statistical tests for measuring the quality of a random number generator. They were developed by George Marsaglia over several years and first published in 1995 on a CD-ROM of random numbers."

Following the Wikipedia link, the outputs of each test vary wildly and I couldn't figure out a good way to aggregate them so I've for the most part left them as is.

This project was originally motivated from something I heard in an audiobook (maybe Naked Statistics?), my memory is hazy but it went something a long the lines of this. A blogger/statistician had calculated the statistical probabilities of some public data numbers and come to the conclusion that they weren't statisically likely to be organic, someone had fudged them. After a fuss was kicked up and an investigation was started, it was proven that they were right - one of the people in charge of the data recording/publishing or something or other had thrown in some random numbers.

Not only is this rather vindicating, statistics got to kick some ass in an easily understood and public way but it gave me a renewed sense of hope for the ability of an individual to create change in the world. This package was conceived with idea that it would provide the tools needed to something like the above on a wider scale, autonomously.

Or at least that's the end goal.

About

Test for randomness in data.

http://diehard.readthedocs.io/en/latest/

License:Apache License 2.0


Languages

Language:Python 88.8%Language:Dockerfile 6.5%Language:Makefile 4.7%