It is a repository that contains several scripts to fetch japanese celebrities' name, avatar and introduction from Internet and to generate a Anki deck finally.
You can go to the release page to get the japanese_celebrity.apkg
directly without running scripts in this repository.
git clone https://github.com/masakichi/japanese_celebrity.git
cd japanese_celebrity
pipenv install
pipenv shell
Run scrapy crawl pasonica -o celebrity.json
, after it finished, you will get a JSON file called celebrity.json
Run python clean_data.py
, after it finished, you will get a JSON file called celebrity_clean.json
(will be used for downloading images), and a CSV called celebrity.csv
(will be imported to Anki)
-
build go file
download_images.go
by runninggo build download_images.go
, binnary filedownload_images
will be generated. -
make
images
folder by runningmkdir images
. -
run
./download_images
then you will get all avatars in images folder.