226 Commits

Author SHA1 Message Date
Michael Vines
f0fe089013 Adapt testnet-deploy metric datapoint names to {,bootnode-}fullnode 2018-12-06 08:04:33 -08:00
Michael Vines
a6312ba98f Switch snap to bootstrap-fullnode/fullnode naming 2018-12-05 18:59:43 -08:00
Michael Vines
04a0652614 Generalize net/ from leader/validator to bootstrap-fullnode/fullnode 2018-12-05 17:11:16 -08:00
Michael Vines
5d80edd969 Properly check for failure (can't rely on set -e here) 2018-12-05 13:26:06 -08:00
Michael Vines
33a5d5fe93 Enable debug builds by default for better backtraces 2018-11-17 10:52:08 -08:00
Michael Vines
d96a6b42a5 Move drone into its own crate 2018-11-16 20:42:21 -08:00
carllin
cf95708c18 Set drone address to always be the initial network entry point (#1847)
* Set drone address to always be the initial network entry point, so that even when leaders rotate the client can still find the drone

* Extract drone address as a separate argument to bench-tps

* Add drone port to client.sh instead of setting it in bench-tps

* Add drone entrypoint to scripts

* Fix build error
2018-11-16 19:56:26 -08:00
Sathish
c973de1d76 Decouple log and metrics rate (#1839)
Use separate env for log and metrics rate.

Set default log level to WARN if unset.
2018-11-15 22:27:16 -08:00
Michael Vines
83fc3c10cf Setup CUDA env for local builds 2018-11-15 08:00:52 -08:00
Michael Vines
017c281eaf Remove CUDA support from Snap 2018-11-12 20:31:16 -08:00
Michael Vines
c5b1bc1128 Remove obsolete update-default-cuda.sh 2018-11-12 20:31:16 -08:00
Michael Vines
9e7b9487b0 perf-libs now drives setting CUDA_HOME 2018-11-12 18:49:15 -08:00
Michael Vines
851e012c6c Upgrade EC2 image to 18.04 with CUDA 9.2 and 10 2018-11-12 15:17:34 -08:00
Michael Vines
7f76403d0a Clean ~/solana during network start to avoid tripping over leftover files 2018-11-12 15:09:14 -08:00
Michael Vines
7ee4dec3f1 Upgrade GCE GPU image to 18.04 2018-11-12 12:18:50 -08:00
Michael Vines
c07d09c011 Add net/scp.sh for easier file transfer to/from network nodes 2018-11-12 11:48:53 -08:00
Michael Vines
3466f139a4 set -e shuffling 2018-11-11 16:24:36 -08:00
Michael Vines
def7d156f6 codemod --extensions sh '#!/usr/bin/env bash -e' '#!/usr/bin/env bash\nset -e' 2018-11-11 16:24:36 -08:00
Michael Vines
33aab094ef codemod --extensions sh '#!/bin/bash' '#!/usr/bin/env bash' 2018-11-11 16:24:36 -08:00
Michael Vines
cf6f344ccc Add CUDA_HOME env var to permit overriding the CUDA install location 2018-11-11 16:24:18 -08:00
Michael Vines
49014393e1 Be less fancy for bash 4.4 compat 2018-11-10 18:05:55 -08:00
Michael Vines
818d03c835 Bump earlyoom version 2018-11-10 15:56:17 -08:00
Michael Vines
b8261d7d83 Determine network version for tar and local deploys 2018-11-08 22:02:42 -08:00
Michael Vines
51ed48941b Continue if docker0 is not present 2018-11-07 19:33:20 -08:00
Michael Vines
87ac549689 Work around AWS key management limitation 2018-11-07 18:48:27 -08:00
Michael Vines
f8f11b7f50 Remove docker0 interface if present 2018-11-07 18:23:24 -08:00
Michael Vines
82f914e0dc Work around AWS boot check weirdness 2018-11-07 15:46:04 -08:00
Michael Vines
9359cc69d5 Invert gpu check 2018-11-07 14:44:40 -08:00
Michael Vines
b02b636b36 Support local tarball deploys 2018-11-07 14:44:40 -08:00
Michael Vines
a537154c28 Remove all cuda dependencies from release tarball beyond solana-fullnode-cuda 2018-11-07 14:44:40 -08:00
Michael Vines
16d23292dc Improve error messages 2018-11-07 10:35:10 -08:00
Michael Vines
2ef8ebe111 AWS AMIs are region specific 2018-11-07 10:05:58 -08:00
Michael Vines
f8673931b8 Increase boot timeout 2018-11-07 08:32:15 -08:00
Michael Vines
dd4fb7aa90 Add AWS-based nets 2018-11-07 07:47:39 -08:00
Michael Vines
c4bc331663 Add support for using a release tar 2018-11-07 07:47:39 -08:00
Michael Vines
cd18a1b7db t 2018-11-06 14:08:47 -08:00
Michael Vines
6aac096c77 Add timeout to prevent a stuck ssh 2018-11-06 14:08:28 -08:00
Michael Vines
7b58bd621a Remove node check from client start-up
If the network loses a validator or two, it's the job of the sanity
check to detect this not the bench clients
2018-11-06 13:57:06 -08:00
Michael Vines
1a7830f460 Set imageName if G 2018-11-05 13:33:42 -08:00
Michael Vines
8041461a07 Bump EC2 validator machine type 2018-11-05 08:47:51 -08:00
Michael Vines
eae9372a5d Upgrade GCP CPU-based testnet to 18.04 2018-11-04 19:18:47 -08:00
Michael Vines
f3b04894b9 Try harder to snap download 2018-11-03 00:29:13 +00:00
Pankaj Garg
85869552e0 Update testnet scripts to use release tar ball (#1660)
* Update testnet scripts to use release tar ball

* use curl instead of s3cmd
2018-10-30 18:05:38 -07:00
Pankaj Garg
3cc78d3a41 Added a new remote node configuration script to set rmem/wmem (#1647)
* Added a new remote node configuration script to set rmem/wmem

* Update common.sh for rmem/wmem configuration
2018-10-30 09:17:35 -07:00
Pankaj Garg
fbde9bb731 Run bench-tps for longer duration in testnet (#1638)
- Increased to 2+ hours
2018-10-29 15:03:08 -07:00
Pankaj Garg
7abd456d45 Increase rmem and wmem for remote nodes in testnet (#1635) 2018-10-29 13:04:54 -07:00
Michael Vines
489894cb32 Mention logs more 2018-10-27 08:49:52 -07:00
Pankaj Garg
dfde83bdce Wildcard early OOM deb package revision (#1554) 2018-10-19 14:17:19 -07:00
Pankaj Garg
30c79fd40d Change validator node machine type (#1537)
- The current nodes are using lower RAM compared to leader/clients
2018-10-17 17:16:50 -07:00
Pankaj Garg
32fc0cd7e9 Fix bug introduced during RUST_LOG escaping (#1507)
* Fix bug introduced during RUST_LOG escaping
- remote node configuration should not be quoted

* shellcheck disable SC2090
2018-10-15 16:49:22 -07:00