Sefaria / Sefaria-Data

Source files, scripts and data imported to Sefaria.

Home Page:http://www.sefaria.org

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Sefaria-Data

This repo contains source data, parsing scripts, data files and logs for data projects going into Sefaria. This is the messy input that is processed to become the Sefaria Library.

If you're looking to download Sefaria's texts or links, please see Sefaria-Export. Exported data has a uniform structure.

For Sefaria source code see Sefaria-Project.

Contents

  • book structures - scripts to create schemas for new books
  • sources/ - original digital files that were manipulated to produce our data, along with scripts used in parsing.
  • Match Logs - logs from commentary/text matching scripts
  • misc/ - misc small data files about texts

About

Source files, scripts and data imported to Sefaria.

http://www.sefaria.org


Languages

Language:HTML 95.6%Language:Rich Text Format 2.6%Language:Roff 1.5%Language:Python 0.3%Language:Jupyter Notebook 0.0%Language:CSS 0.0%Language:Pkl 0.0%Language:C# 0.0%Language:JavaScript 0.0%Language:Perl 0.0%Language:C 0.0%Language:Shell 0.0%Language:Dockerfile 0.0%