For example, for a datasource called example if I have:
"dimensionsSpec": {
"dimensions": [
"SYMBOL",
{
"type": "float",
"name": "FIELD1"
},
{
"type": "double",
"name": "FIELD2"
}]
}
Then querying this datasource, where that FIELD1 and FIELD2 column contain the same data, but one is stored using double, I find that:
SELECT SYMBOL,FIELD1 from example WHERE __time = TIMESTAMP '2019-07-26 00:00:00'
SELECT SYMBOL,FIELD2 from example WHERE __time = TIMESTAMP '2019-07-26 00:00:00'
Now changing druid.generic.useDefaultValueForNull=true and restarting druid, when I load the data again into a new datasource example2, I find that that the double query is no longer slow and completes in about the same time as any other dimension query.
For example, for a datasource called example if I have:
"dimensionsSpec": {
"dimensions": [
"SYMBOL",
{
"type": "float",
"name": "FIELD1"
},
{
"type": "double",
"name": "FIELD2"
}]
}
Then querying this datasource, where that FIELD1 and FIELD2 column contain the same data, but one is stored using double, I find that:
SELECT SYMBOL,FIELD1 from example WHERE __time = TIMESTAMP '2019-07-26 00:00:00'
SELECT SYMBOL,FIELD2 from example WHERE __time = TIMESTAMP '2019-07-26 00:00:00'
Now changing druid.generic.useDefaultValueForNull=true and restarting druid, when I load the data again into a new datasource example2, I find that that the double query is no longer slow and completes in about the same time as any other dimension query.