- GCP Function will fetch the genre index page
- It will then parse the text content for top genres IDs
- For each genre ID, publish that ID to the
genres
GCP topic - This function can be triggered from an automated scheduler such as cron, either through HTTP or event-driven
- GCP Function will recursively fetch each paginated page for each letter of a given genre ID
- It will parse the text content for podcast IDs
- For each podcast ID, publish that ID to the
podcasts
GCP topic!!
- GCP Function will fetch podcast JSON for a given podcast ID
- Will fetch the XML RSS feed given the feed location in the JSON
- For each item in the feed, parse write the metadata to a CloudSQL table
- For each item in the feed, fetch the audio file and write to file Storage