The GeoSpock database
The GeoSpock database is a cloud-based solution that processes big data and makes it available for analysis, enabling you to find patterns and trends in your data. It comprises:
- a command line interface (CLI) to enable you to manage your data, and the user accounts
- components that ingest and index your source input data
- an SQL interface that gives you access to your data for further analysis
- an architecture that enables you to integrate your existing analysis tools
Being installed in the cloud, you can add GeoSpock database components and resources as you need them to fit your analysis needs. See The GeoSpock database architecture for further details about its components.
Process
The first step to using the GeoSpock database is to load your data into it. You ingest each source of data using a schema that describes the source data field types (Creating a data source description for a dataset). As new data becomes available, you can add it to the dataset (Incremental ingests).
Ingesting your data into the GeoSpock database (Ingesting source input data) creates a set of indexes and stored data optimized for big data searches and queries that you can use to explore and analyze your data using the GeoSpock database tools and APIs. This is data is then available as a dataset for you to analyze and query (Using datasets).
The data management process is driven programmatically through a command line tool, the GeoSpock CLI, that enables you ingest your data and manage your datasets (The GeoSpock CLI).
Query
Once you've ingested your data, you can run both temporal and geospatial functions on your data GeoSpock database (Querying your data). A number of SQL functions have been optimized for use with the GeoSpock database. Refer to GeoSpock database optimized SQL functions for a list of these functions. The are a number of ways that you can access the GeoSpock database by:
- using the Presto CLI to run queries on your ingested data; see Running GeoSpock database queries from the Presto CLI
- running queries from a Business Intelligence (BI) tool, such as Tableau, Quicksight or Power BI; see Using Business Intelligence (BI) tools with the GeoSpock database
Integrate
You can integrate your existing Business Intelligence (BI) tools and visualization tools into the GeoSpock database enabling you to use the database in your existing workflows for data analysis (Integrating your third party tools).