Geo data: up-to-date and in multiple formats

Andrell Bower
DataSD
Published in
3 min readNov 30, 2017

Dear readers / followers, we have some exciting news! Our geographic data is now integrated with Poseidon, our data automation system.

Current geo data

It means you now have access to the most current geographic data at all times. Some data sources are updated every month. For others, this process occurs on a daily basis. No matter the schedule of the dataset, our portal will now automatically update along with it. This ensures that as a user of the portal, you will be retrieving the most up-to-date data available.

New formats

Until recently, we were sharing geo data exclusively as shapefiles — one of the most commonly used format for geo data. Nevertheless, there are a few reasons why it should not be the only format of geo data offered on an open data portal:

  • It is not an open standard format.
  • It is actually multiple files. These files need to be zipped, making it inconvenient to share.
  • It is not a practical format for web developers. Few web mapping javascript libraries support shapefiles natively, which usually leads to data conversion.
  • It is not the most practical format for non-technical users. It requires software installation to be visualized / used for data analysis.

For these reasons, we have decided to provide you with additional geo data formats. For now (and we will be adding more) they are: GeoJSON, TopoJSON, and Geobuf.

  • GeoJSON: This is the most common alternative to shapefiles. GeoJSON is an open standard format based on the JSON specification. It is supported by most desktop and web based mapping/GIS solutions. Geo data in this format is also human-readable. This means it can be opened with a text editor, which make it very easy to use. The only drawback with GeoJSON when compared to shapefile is size. This is a consequence of GeoJSON’s lack of topology support.
  • TopoJSON: This format is an extension of GeoJSON that encodes topology. As a result, TopoJSON files that contain line and polygon geometries are significantly smaller than both GeoJSON and shapefiles.
  • Geobuf: Geobuf is a relatively new format introduced by Mapbox. Geobuf provides lossless compression of GeoJSON data into protocol buffers. It is fast to decode and very light, making it attractive for developers who want to use large geo datasets in their web applications. Here is a comparative table to demonstrate the compression power of the Geobuf specification on our data:

So, you now have access to the most current geo data in multiple formats and we are pretty excited about it. What will you do with it? Let us know at data@sandiego.gov and stay tuned for more announcements!

Originally written by Arnaud Vedy and published at https://data.sandiego.gov

--

--