Data Science System

Distributed/Big Data Geospatial Processing Tools

Work-in-progress. I will write more about each approach later in details.

Just summarizing the tools for connecting to Hadoop and running geospatial processing on a large dataset. I am working on a ~100 GB Hive Table which is just a small subset of the original dataset

  4. Kinetica GPU Database – Graph solver and Match solver
  5. PySpark python libraries
  6. Spatial Hadoop
  7. Alteryx – Using Connect-in-DB function to connect to Hadoop

Leave a Reply

Your email address will not be published. Required fields are marked *