Add solana-watchtower program

This commit is contained in:
Michael Vines
2019-12-11 17:05:10 -07:00
parent dd54fff978
commit 2db28cae41
6 changed files with 171 additions and 0 deletions

16
watchtower/README.md Normal file
View File

@ -0,0 +1,16 @@
The `solana-watchtower` program is used to monitor the health of a cluster. It
periodically polls the cluster over an RPC API to confirm that the transaction
count is advancing, new blockhashes are available, and no validators are
delinquent. Results are reported as InfluxDB metrics.
### Metrics
#### `watchtower-sanity`
On every iteration this data point will be emitted indicating the overall result
using a boolean `ok` field.
#### `watchtower-sanity-failure`
On failure this data point contains details about the specific test that failed via
the following fields:
* `test`: name of the sanity test that failed
* `err`: exact sanity failure message