Conversation
* Update CHANGELOG * Update VERSION
|
Wifi collector is Linux only for now, by the way. Stoked! |
|
Also, thoughts on enabling more collectors by default? I can speak for the wifi and mountstats collectors, at least, being useful to have enabled by default. If the machine isn't using WiFi or NFS, neither will report any metrics. |
|
I don't object, as long as they behave well when nothing is enabled on the node. |
|
There is still #216 open which we wanted to get it. I'm flying out tomorrow, so won't have time this week. If you think we should get this out now, we can post-pone it IMO. |
|
Yea, I wanted to see that get in, but there's been no progress on #390 in 2 weeks. I'm not sure it's worth waiting for. |
|
Would it be possible to share with us why ntp is considered out of scope? Is it because it relies on the ntpd software being installed and therefore should get a specific exporter? If yes, will the current code be reused to provide a ntpd_exporter? |
|
@jcberthon yes, that is the reason. Right now nothing is being removed, and we'd like to only do so once alternatives are available. Would you like to take the code and throw together a standalone exporter? |
|
The reason we decided ntp was out of scope is because it functions as a blackbox probe. The collector does a real-time NTP probe against an external server. This could be very high traffic if someone were to point a large number of servers at I personally found this probe to be very useful as an additional check against ntpd or other such time syncing client software running on servers, it did produce a lot of jitter, and a lot of extra packets to our NTP server pools in production. A typical NTP client only sends one probe every ~15 minutes per server, not every 15-30sec like a It's also useful for nodes that are not running NTP clients for whatever reason. There are a few ways we can replace this functionality.
As for monitoring ntpd and other ntp clients, this is something we could easily add as a textfile helper tool. This would export the real metrics provided by NTP client software running on nodes. I have already written a couple, but they're currently not open source. I will attempt to re-implement them and publish them sometime soon. Maybe in Python this time instead of shell. 😄 |
|
THere is already a node_system_clock_milliseconds metric. Not sure how it's named. I think node_time or something. |
|
@discordianfish Seems like |
|
@SuperQ Right but how would you figure out the precise timestamp of the scrape? The best I came up with was just time() - node_time which might be off by the scrape interval anyway. |
|
@brian-brazil was talking about a specific function to compare a sample values with their collection timestamps at FOSDEM. Not implemented yet. |
|
Ah, yes something like that would be great. |
|
@matthiasr "would like" yes, but with 4 very young kids at home in Winter and a full time job, I have very limited time available for that. In addition, I haven't yet installed prometheus (no time) but I was looking into it to see if I could use it to monitor my Raspberry Pi NTP server, hence my initial interest ;-) |
|
If you want to monitor an NTP server, you definitely want the NTP metrics helper script, not the ntp collector plugin. See: #458 |
|
@SuperQ thanks for the hint. However, IMHO, It is better to use I'm also monitoring the frequency and sys_jitter returned value of For monitoring this, I'm using a simple script at the moment which loop around that command: Note: as one can see, I'm using the telegraf line protocol with an influxdb database, and grafana for display. But I want to investigate alternatives, hence my interest in prometheus. |
|
@jcberthon, That's super useful information! I have filed #462 to add additional metrics. |
Changes:
NOTE: We are deprecating several collectors in this release.
gmond- Out of scope.megacli- Requires forking, moved to textfile collection.ntp- Out of scope.