Source Han Dataset
tldr:This is a numpy array dataset with Source Han Sans and Serif for Python.
Font Version
- Source Han Sans 1.004
- Source Han Serif 1.001
Data Structure
- There is a tuple in each pickle file and two list in each tuple.
- The first list contain unicode easten asian ideograph (aka Hanzi, Kanji, Hanja, Chinese characters, whatever you want to say), and the second list contain 2D numpy array of each easten asian ideographs.
- All numpy array are aready normalized.
- This dataset have below size dataset.
- 512
- only have Korean version
- 256
- 128
- 64
- 512