Skip to content

node_cpu_seconds_total values are not monotonically increasing #1686

@venkatbvc

Description

@venkatbvc

Host operating system: output of uname -a

Linux ddebvnf-oame-1 3.10.0-1062.7.1.el7.x86_64 #1 SMP Wed Nov 13 08:44:42 EST 2019 x86_64 x86_64 x86_64 GNU/Linux

node_exporter version: output of node_exporter --version

node_exporter, version 0.17.0 (branch: HEAD, revision: f6f6194)
build user: root@322511e06ced
build date: 20181130-15:51:33
go version: go1.11.2

node_exporter command line flags

node_exporter --collector.systemd
--collector.systemd.unit-whitelist=^(grafana|prometheus|node_exporter|rabbitmq-server|asprom|gmond|gmetad|mariadb.|ntpd|httpd|jaeger|metrics|gen3gppxml|alertmanager|etcd|alarmagtd|keepalived|zabbix.).service$
--collector.textfile.directory=/opt/node_exporter/metrics

Are you running node_exporter in Docker?

NO

What did you do that produced an error?

nothing. Node exporter is running and prometheus is scrapping the metrics. scrape interval is 5s.
when a graph is plotted for node_cpu_seconds_total, we saw a huge spike. Following is the query used: rate(node_cpu_seconds_total{cpu="6",instance="osc1deacsdme1-oame-0",job="System",mode="iowait"}[2m])

What did you expect to see?

There should not be any huge spikes. and we saw a dip in node_cpu_seconds_total values.

What did you see instead?

There is a huge spike on 9th of March at 00:26:30 . as there is dip in node_cpu_seconds_total values.

following is the data in prometheus:
curl -g 'http://localhost:9090/api/v1/query?query=node_cpu_seconds_total{cpu="6",instance="osc1deacsdme1-oame-0",job="System",mode="iowait"}[2m]&time=1583693790'
{"status":"success","data":{"resultType":"matrix","result":[{"metric":{"name":"node_cpu_seconds_total","cpu":"6","instance":"osc1deacsdme1-oame-0","job":"System","mode":"iowait"},
"values":[[1583693670.227,"62176.51"],[1583693675.227,"62176.77"],[1583693680.227,"62176.98"],[1583693685.227,"62176.99"],[1583693690.227,"62176.99"],[1583693695.227,"62177.03"],
[1583693700.227,"62177.08"],[1583693705.228,"62177.08"],[1583693710.227,"62177.09"],[1583693715.227,"62177.09"],[1583693720.227,"62177.09"],[1583693725.227,"62177.09"],
[1583693730.227,"62177.09"],[1583693735.227,"62177.09"],[1583693740.227,"62177.09"],[1583693745.227,"62177.09"],[1583693750.227,"62177.09"],[1583693755.227,"62177.09"],
[1583693760.227,"62177.09"],[1583693765.227,"62177.09"],[1583693770.227,"62177.09"],[1583693775.227,"62177.24"],[1583693780.227,"62177.2"],[1583693785.227,"62177.2"]]}]}}

would like to know why there is a dip in counter value.

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions