gitWayneZhang / my_Spark

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Jupyer Notebook using ESRI built-in Spark.

This notebook is borrowed from Mansour's github ESRI Spark notebook to demonstrate the spatial binning of AIS data around the port of Miami using Apache Spark. Mansour's notebook creates the spatila bining inside ArcGIS Pro. What I tried is to use notebook outside ArcGIS Pro.

The AIS broadcast data is in a FileGeodatabase that can be download from here.

I download one month of data for a test on my laptop. It is quickly out of memoery as it has over 50 milion points just for a month data. I clipped to one milion points of data closed to the port. The spark SQL runs well. The result shows below those blue dots are the most densely moving ships closed to the port. You can also do other queries for more analysis based on the feature class built in the memory.

Create a new conda environment and activate it to use this notebook, as follows:

  • Start a Python Command Prompt from Start > ArcGIS.

  • Create Conda environment and clone arcgispro environment:

conda create --yes --name spark_esri --clone arcgispro-py3
activate spark_esri
pip install pyarrow

Spatial Binning Notebook

About


Languages

Language:Jupyter Notebook 98.8%Language:Python 1.2%