Data validation

Data validation#

SigTech has a curated verification system that allows our engineers to ensure data quality.

Our data cleansing process is broken down into two steps:

Validation: an iterative process where we select checks to apply to datasets and resolve any errors that arise.
Cleaning: corrections and deletions are loaded in. The data is then stored to a separate storage facility.

Errors that surface following validation:

Accepted: Accepted with individual errors.
Amended/corrected: On the basis of an error, the data has been corrected. Example: An anomaly.
Deleted: Data that shouldn’t be included. Example: an accidental entry.

Tip: a fully validated timeseries or instrument is marked with a .

Note: default is clean data, but you have the option to select raw data.