Source input data formats

The GeoSpock ingestor supports the following data formats:

All field values must be string, numeric or boolean.

If your source input data is in a different format, you will have to process it so that it conforms to one of the supported ingest formats. The ingestor ignores incorrectly formatted source input data.

Example content

Say, for example, you have a row of data containing a device ID, a latitude, a longitude, calories and a Unix timestamp:

Value

Example value

Type

Device ID

2aadb-99d-97943

string

Latitude

42.32365

numeric

Longitude

44.538375

numeric

Calories

12.5

numeric

Unix timestamp

1041037198

numeric

Each of the supported data formats has an example that shows you how to format this row of data.

File compression

The files may be uncompressed, or compressed with:

  • bzip2 (with the .bz2 suffix)
  • lzo (with the .lzo suffix)
  • gzip (with the .gz suffix)
  • Snappy (with the .snappy suffix)

The ingestor does not support split archives, so you should make sure that your data files are small enough to be compressed; for further guidance, see the documentation about file size.

For compressed data files, you must add a file extension for each file to enable the ingestor to process the data correctly.