The IMDB database is a great way to learn about Graph Databases, but loading it can be a bit tricky due to its size. This project aims to allow you to download the IMDB source files and load them into a Cosmos Graph database in a few easy steps.
- Download the source code (NOTE: Please review licensing to use the data here: http://www.imdb.com/interfaces)
- Create some Azure Resources
- Resource Group
- Storage account
- Azure Batch account
- Graph Database
- NOTE: Place all resources in the same Azure Region if possible
- Fill in the configuration values in the app.config file
- Mandatory config values
- StorageAccountConnectionString
- GraphEndpoint
- GraphKey
- BatchAccountName
- BatchAccountKey
- BatchAccountUrl
- GraphDatabase
- Optional config values
- All the other config values are optional
- Mandatory config values
- Run IT!
- Watch the app, download the source file from IMDB, parse them, and then kick off a Azure Batch Job to load it into Cosmos in parallel.
- Clean up the Azure Batch and Storage accounts