-
Notifications
You must be signed in to change notification settings - Fork 2
Closed
Description
As part of the terrautils implementation we've implemented a process to "clean" or standardize the metadata from the LemnaTec Field Scanalyzer. The "raw" metadata is now stored as a JSON file on ROGER and Clowder, but the Clowder metadata endpoint contains the cleaned metadata, which is now also used downstream by extractors.
Examples are available in the Data Release Trial space for the flirIR and stereoTop sensors.
FLIR:
- UI: https://terraref.ncsa.illinois.edu/clowder/datasets/59b062d94f0ca12ea0c32d04
- API: https://terraref.ncsa.illinois.edu/clowder/api/datasets/59b062d94f0ca12ea0c32d04/metadata.jsonld
StereoTop:
- UI: https://terraref.ncsa.illinois.edu/clowder/datasets/59b062d94f0ca12ea0c32d04
- API: https://terraref.ncsa.illinois.edu/clowder/api/datasets/59b062d94f0ca12ea0c32d04/metadata.jsonld
As noted in the PR, the primary goal was to clean up many of the inconsistencies in the raw metadata (i.e., Time v time v timestamp v Timestamp).
A few things to note:
- We discussed adding the sensor position in place of the gantry "position_m", but this has not been implemented
- Sensor fixed metadata is linked via URL
- I've added a dataset to capture the Gantry fixed metadata (https://terraref.ncsa.illinois.edu/clowder/api/datasets/5873eac64f0cad7d81349b0b/metadata.jsonld), which is linked in the metadata
- site_metadata was intended to get each of the plots associated with the image. As noted in 9/6 meeting, this is currently based on the centroid and should probably be removed or updated to make this clear.
- spatial_metadata was introduced to capture the bounding box and centroid. We've discussed changing to geoJSON and changing the current bounding box from format returned from terrautils.spatial _get_bounding_box_with_formula function
- spatial_metadata is intended to handle cases where there are multiple devices per sensor (e.g. left/right, east/west). For single-device sensors, the sensor name is currently used.
Completion criteria:
- Metadata reviewed
- Changes made for 2017 release
- Issue(s) created for 2018 release
- Changes identified and implemented in extractors as needed
- Extractors tested, tagged/versioned/deployed