Update grafana tikv dashboard doc#3042
Conversation
| This section shows images of the service panels on the TiKV dashboard. | ||
|
|
||
| ### Cluster | ||
| # The metrics description of TiKV |
There was a problem hiding this comment.
Please also change Line 2 and Line 3 accordingly.
| # The metrics description of TiKV | |
| # Description of TiKV Monitoring Metrics |
|
|
||
| If you use TiUP to deploy the TiDB cluster, the monitoring system (Prometheus/Grafana) is deployed at the same time. For more information, see [Overview of the Monitoring Framework](/tidb-monitoring-framework.md). | ||
|
|
||
| The Grafana dashboard is divided into a series of sub dashboards which include Overview, PD, TiDB, TiKV, Node\_exporter, and so on. A lot of metrics are there to help you diagnose. |
There was a problem hiding this comment.
| The Grafana dashboard is divided into a series of sub dashboards which include Overview, PD, TiDB, TiKV, Node\_exporter, and so on. A lot of metrics are there to help you diagnose. | |
| The Grafana dashboard is divided into a series of sub dashboards which include Overview, PD, TiDB, TiKV, Node_exporter, and so on. A lot of metrics are there to help you diagnose. |
|
|
||
| The Grafana dashboard is divided into a series of sub dashboards which include Overview, PD, TiDB, TiKV, Node\_exporter, and so on. A lot of metrics are there to help you diagnose. | ||
|
|
||
| You can get an overview of the component TiKV status from the TiKV dashboard, where the key metrics are displayed. According to the [Performance Map](https://asktug.com/_/tidb-performance-map/#/), you can check whether the status of the cluster is as expected. |
There was a problem hiding this comment.
| You can get an overview of the component TiKV status from the TiKV dashboard, where the key metrics are displayed. According to the [Performance Map](https://asktug.com/_/tidb-performance-map/#/), you can check whether the status of the cluster is as expected. | |
| You can get an overview of the component TiKV status from the **TiKV-Details** dashboard, where the key metrics are displayed. According to the [Performance Map](https://asktug.com/_/tidb-performance-map/#/), you can check whether the status of the cluster is as expected. |
|
|
||
| You can get an overview of the component TiKV status from the TiKV dashboard, where the key metrics are displayed. According to the [Performance Map](https://asktug.com/_/tidb-performance-map/#/), you can check whether the status of the cluster is as expected. | ||
|
|
||
| This document provides a detailed description of these key metrics. |
There was a problem hiding this comment.
| This document provides a detailed description of these key metrics. | |
| This document provides a detailed description of these key metrics on the **TiKV-Details** dashboard. |
| - IO utilization:The I/O utilization per TiKV instance | ||
| - MBps:The total bytes of read and write in each TiKV instance | ||
| - QPS:The QPS per command in each TiKV instance | ||
| - Errps:The total number of gRPC message failures |
There was a problem hiding this comment.
| - Errps:The total number of gRPC message failures | |
| - Errps:The rate of gRPC message failures |
| - Channel full:The number of Channel Full errors per TiKV instance. It should be `0` in normal case. | ||
| - Active written leaders:The number of leaders being written on each TiKV instance | ||
| - Approximate Region size:The approximate Region size | ||
| - Approximate Region size Histogram:The histogram of approximate Region size |
There was a problem hiding this comment.
| - Approximate Region size Histogram:The histogram of approximate Region size | |
| - Approximate Region size Histogram:The histogram of each approximate Region size |
| - Flush messages per server:The number of Raft messages flushed by each TiKV instance | ||
| - Receive messages per server:The number of Raft messages received by each TiKV instance | ||
| - Messages:The number of Raft messages sent per type | ||
| - Vote:The number of Vote messages sent in Raft |
There was a problem hiding this comment.
| - Vote:The number of Vote messages sent in Raft | |
| - Vote:The number of Vote messages sent in Raft per second |
| - Receive messages per server:The number of Raft messages received by each TiKV instance | ||
| - Messages:The number of Raft messages sent per type | ||
| - Vote:The number of Vote messages sent in Raft | ||
| - Raft dropped messages:The number of dropped Raft messages per type |
There was a problem hiding this comment.
The Chinese version says ops, which means operations per second. So please check and align.
| - Raft dropped messages:The number of dropped Raft messages per type | |
| - Raft dropped messages:The number of dropped Raft messages per type per second |
| - Raft apply proposals per ready:The number of Raft proposals of all Regions per ready handled bucket | ||
| - Raft read/write proposals:The number of proposals per type | ||
| - Raft read proposals per server:The number of read proposals made by each TiKV instance | ||
| - Raft write proposals per server:The number of write proposals made by each TiKV instance | ||
| - Propose wait duration:The wait time of each proposal | ||
| - Propose wait duration per server:The wait time of each proposal per TiKV instance | ||
| - Apply wait duration:The apply time of each proposal | ||
| - Apply wait duration per server:The apply time of each proposal per TiKV instance | ||
| - Raft log speed:The rate at which peers propose logs |
There was a problem hiding this comment.
@Win-Man This part is not consistent with the Chinese version. Please check and align.
| ## 面板常见参数的解释 | ||
|
|
||
| ### gRPC 消息类型 | ||
|
|
||
| 1. 使用事务型接口的命令: |
There was a problem hiding this comment.
Please translate the Chinese text.
| - kv_gc:The command of GC | ||
| - kv_delete_range:The command of deleting a continuous piece of data from TiKV | ||
|
|
||
| 2. 非事务型的裸命令: |
|
@TomShawn Thanks for your review, I've fixed what you commented. |
|
@TomShawn PTAL |
|
@TomShawn, @hunterlxt, PTAL. |
|
@Win-Man, please update your pull request. |
| 1. Transactional API: | ||
|
|
||
| - kv_get:The command of getting the latest version of data specified by ts | ||
| - kv_scan:The command of scanning a continuous piece of data |
| - kv_scan_lock:The command of scanning all locks with a version number before `max_version` to clean up expired transactions | ||
| - kv_resolve_lock:The command of committing or rollback the transaction lock, according to the transaction status. | ||
| - kv_gc:The command of GC | ||
| - kv_delete_range:The command of deleting a continuous piece of data from TiKV |
|
|
||
| - raw_get:The command of getting the value of key | ||
| - raw_batch_get:The command of getting the value of batch keys | ||
| - raw_scan:The command of scanning a continuous piece of data |
| - raw_batch_put:The command of writing a batch of key/value pairs | ||
| - raw_delete:The command of deleting a key/value pair | ||
| - raw_batch_delete:The command of a batch of key/value pairs | ||
| - raw_delete_range:The command of deleting a continuous interval |
There was a problem hiding this comment.
ditto, use data rather than inerval
|
@Win-Man, please update your pull request. |
|
@TomShawn, @hunterlxt, PTAL. |
…into tikv-monitor-metrics
|
@TomShawn, @hunterlxt, PTAL. |
|
@Win-Man, please update your pull request. |
|
@Win-Man, please update your pull request. |
Co-authored-by: TomShawn <41534398+TomShawn@users.noreply.github.com>
Co-authored-by: TomShawn <41534398+TomShawn@users.noreply.github.com>
|
@TomShawn, @hunterlxt, PTAL. |
|
@Win-Man, please update your pull request. |
|
@TomShawn, @hunterlxt, PTAL. |
|
@TomShawn, @hunterlxt, PTAL. |
|
@Win-Man, please update your pull request. |
|
@Win-Man, please update your pull request. |
|
@hunterlxt PTAL |
|
@hunterlxt,Thanks for your review. However, LGTM is restricted to Reviewers or higher roles.See the corresponding SIG page for more information. Related SIGs: docs(slack). |
Signed-off-by: ti-srebot <ti-srebot@pingcap.com>
|
cherry pick to release-4.0 in PR #3333 |
Signed-off-by: ti-srebot <ti-srebot@pingcap.com> Co-authored-by: Win-Man <825895587@qq.com>
What is changed, added or deleted? (Required)
Which TiDB version(s) do your changes apply to? (Required)
What is the related PR or file link(s)?