flesler / satoshi-data

Curated dataset of Satoshi Nakamoto responding to users

Home Page:https://flesler.github.io/satoshi-data/

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Satoshi-data

Curated dataset of Satoshi Nakamoto responding to users

What is this?

This project curates part of the dataset collected by NakamotoInstitute for nakamotoinstitute.org. Particularly the posts.json and emails.json.

The data is cleaned up, merged and re-assembled into Q&As answered by Satoshi Nakamoto. The Q&A data is used by the API powering satoshi-ama to answer user questions.

In addition, the final JSON includes several questions generated by GPT-4, they are all answered by literal quotes from Satoshi Nakamoto.

I aimed to keep the original dataset intact, so all modifications are recorded in inputs/overrides.json and even those don't change the original text, just re-arrange it. It still needs some more cleaning up but I won't be doing that in the short-term.

You can view the dataset as JSON here and visually here: https://flesler.github.io/satoshi-data/

Setup

npm install

Regenerate the JSON file

npm run start

Preview the JSON file locally

npm run serve

More about the dataset

The output is a JSON file, with an array of objects. They are sorted chronologically. Many have a type which is one of:

  1. gpt-4: Those generated with GPT-4
  2. favorite: The ones I personally liked the most
  3. ignore: The ones I personally didn't find useful for my particular needs

About

Curated dataset of Satoshi Nakamoto responding to users

https://flesler.github.io/satoshi-data/


Languages

Language:TypeScript 100.0%