Querying your data
The GeoSpock database enables you to run SQL queries on your ingested data, and has optimized a subset of SQL functions that are used frequently in the analysis of geo-spatial data. These optimized functions use the indexes generated when your source input data was ingested.
For the best performance and to make sure that you are taking advantage of the benefits of the indexed data, you should follow the guidelines for creating optimized queries. Refer to Using the GeoSpock DB Presto connector for more information.
Auto-scaling SQL cluster behavior
As you run queries, the Geospock database automatically adds SQL clusters. You should be aware that the Geospock database will not use the new SQL clusters for queries that are already in the queue at time the scaling up happens. The Geospock database scales down the SQL clusters once it has completed all the queries in the queue.
Optimized functions include:
Custom GeoSpock functions include:
The are a number of ways that you can access the GeoSpock database by:
- running queries from a Business Intelligence (BI) tool, such as Tableau, Quicksight or Power BI; see Using Business Intelligence (BI) tools with the GeoSpock database
- using the Presto CLI to run queries on your ingested data; see Running GeoSpock database queries from the Presto CLI
In your data analysis, to find events that occur within a specific time range or at a certain time, use the GeoSpock database's optimized time queries by specifying a:
- single time window
- time interval
As part of your data analysis, you may want to focus on a specific region or postcode area to answer specific questions about your dataset. The GeoSpock database provides the following optimized geometry functions:
- Simple bounding box
- Points within/intersects with a geometry shape
- Line locate point
- Recorded geometries intersect with a query geometry shape