IShengFang / Source-Han-Dataset

This is a numpy array dataset with Source Han Sans and Serif

Source Han Dataset

tldr:This is a numpy array dataset with Source Han Sans and Serif for Python.

Font Version

Source Han Sans 1.004
Source Han Serif 1.001

Data Structure

There is a tuple in each pickle file and two list in each tuple.
The first list contain unicode easten asian ideograph (aka Hanzi, Kanji, Hanja, Chinese characters, whatever you want to say), and the second list contain 2D numpy array of each easten asian ideographs.
All numpy array are aready normalized.
This dataset have below size dataset.
- 512
  - only have Korean version
- 256
- 128
- 64

About

This is a numpy array dataset with Source Han Sans and Serif

Languages

Language:Python 100.0%