solana

Author	SHA1	Message	Date
Tao Zhu	db85d659b9	Cost model 1.7 (#20188 ) * Cost Model to limit transactions which are not parallelizeable (#16694) * * Add following to banking_stage: 1. CostModel as immutable ref shared between threads, to provide estimated cost for transactions. 2. CostTracker which is shared between threads, tracks transaction costs for each block. * replace hard coded program ID with id() calls * Add Account Access Cost as part of TransactionCost. Account Access cost are weighted differently between read and write, signed and non-signed. * Establish instruction_execution_cost_table, add function to update or insert instruction cost, unit tested. It is read-only for now; it allows Replay to insert realtime instruction execution costs to the table. * add test for cost_tracker atomically try_add operation, serves as safety guard for future changes * check cost against local copy of cost_tracker, return transactions that would exceed limit as unprocessed transaction to be buffered; only apply bank processed transactions cost to tracker; * bencher to new banking_stage with max cost limit to allow cost model being hit consistently during bench iterations * replay stage feed back program cost (#17731) * replay stage feeds back realtime per-program execution cost to cost model; * program cost execution table is initialized into empty table, no longer populated with hardcoded numbers; * changed cost unit to microsecond, using value collected from mainnet; * add ExecuteCostTable with fixed capacity for security concern, when its limit is reached, programs with old age AND less occurrence will be pushed out to make room for new programs. * investigate system performance test degradation (#17919) * Add stats and counter around cost model ops, mainly: - calculate transaction cost - check transaction can fit in a block - update block cost tracker after transactions are added to block - replay_stage to update/insert execution cost to table * Change mutex on cost_tracker to RwLock * removed cloning cost_tracker for local use, as the metrics show clone is very expensive. * acquire and hold locks for block of TXs, instead of acquire and release per transaction; * remove redundant would_fit check from cost_tracker update execution path * refactor cost checking with less frequent lock acquiring * avoid many Transaction_cost heap allocation when calculate cost, which is in the hot path - executed per transaction. * create hashmap with new_capacity to reduce runtime heap realloc. * code review changes: categorize stats, replace explicit drop calls, concisely initiate to default * address potential deadlock by acquiring locks one at time * Persist cost table to blockstore (#18123) * Add `ProgramCosts` Column Family to blockstore, implement LedgerColumn; add `delete_cf` to Rocks * Add ProgramCosts to compaction excluding list alone side with TransactionStatusIndex in one place: `excludes_from_compaction()` * Write cost table to blockstore after `replay_stage` replayed active banks; add stats to measure persist time * Deletes program from `ProgramCosts` in blockstore when they are removed from cost_table in memory * Only try to persist to blockstore when cost_table is changed. * Restore cost table during validator startup * Offload `cost_model` related operations from replay main thread to dedicated service thread, add channel to send execute_timings between these threads; * Move `cost_update_service` to its own module; replay_stage is now decoupled from cost_model. * log warning when channel send fails (#18391) * Aggregate cost_model into cost_tracker (#18374) * * aggregate cost_model into cost_tracker, decouple it from banking_stage to prevent accidental deadlock. * Simplified code, removed unused functions * review fixes * update ledger tool to restore cost table from blockstore (#18489) * update ledger tool to restore cost model from blockstore when compute-slot-cost * Move initialize_cost_table into cost_model, so the function can be tested and shared between validator and ledger-tool * refactor and simplify a test * manually fix merge conflicts * Per-program id timings (#17554) * more manual fixing * solve a merge conflict * featurize cost model * more merge fix * cost model uses compute_unit to replace microsecond as cost unit (#18934) * Reject blocks for costs above the max block cost (#18994) * Update block max cost limit to fix performance regession (#19276) * replace function with const var for better readability (#19285) * Add few more metrics data points (#19624) * periodically report sigverify_stage stats (#19674) * manual merge * cost model nits (#18528) * Accumulate consumed units (#18714) * tx wide compute budget (#18631) * more manual merge * ignore zerorize drop security * - update const cost values with data collected by #19627 - update cost calculation to closely proposed fee schedule #16984 * add transaction cost histogram metrics (#20350) * rebase to 1.7.15 * add tx count and thread id to stats (#20451) each stat reports and resets when slot changes * remove cost_model feature_set * ignore vote transactions from cost model Co-authored-by: sakridge <sakridge@gmail.com> Co-authored-by: Jeff Biseda <jbiseda@gmail.com> Co-authored-by: Jack May <jack@solana.com>	2021-10-06 15:55:29 -06:00
Trent Nelson	a4df784e82	Bump version to 1.8.0	2021-10-06 15:48:23 -06:00
mergify[bot]	414674eba1	Fix dos data-type for non-gossip mode (#20465 ) (#20478 ) (cherry picked from commit `b178f3f2d3`) Co-authored-by: sakridge <sakridge@gmail.com>	2021-10-06 19:00:34 +00:00
Justin Starry	d922971ec6	Optimize stakes cache and rewards at epoch boundaries (backport #20432 ) (#20472 ) * Optimize stakes cache and rewards at epoch boundaries (backport #20432) * fix conflicts	2021-10-06 16:15:27 +00:00
mergify[bot]	95ac00d30a	Make rewards tracer async friendly (backport #20452 ) (#20456 ) * Make rewards tracer async friendly (#20452) (cherry picked from commit `250a8503fe`) # Conflicts: # Cargo.lock # ledger-tool/Cargo.toml # runtime/src/bank.rs * fix conflicts Co-authored-by: Justin Starry <justin@solana.com>	2021-10-06 11:20:50 +00:00
mergify[bot]	1ca4f7d110	Install openssl for travisci windows builds (#20420 ) (#20458 ) (cherry picked from commit `df73d8e8a1`) Co-authored-by: Tyera Eulberg <teulberg@gmail.com>	2021-10-05 22:30:23 -06:00
mergify[bot]	8999f07ed2	Remove nodejs (#20399 ) (#20433 ) (cherry picked from commit `6df0ce5457`) Co-authored-by: sakridge <sakridge@gmail.com>	2021-10-05 08:56:57 +00:00
mergify[bot]	9f4f8fc9e9	Add struct and convenience methods to track stake activation status (backport #20392 ) (#20425 ) * Add struct and convenience methods to track stake activation status (#20392) * Add struct and convenience methods to track stake activation status * fix nits * rename (cherry picked from commit `0ddb34a0b4`) # Conflicts: # runtime/src/stakes.rs * resolve conflicts Co-authored-by: Justin Starry <justin@solana.com>	2021-10-05 04:33:30 +00:00
Michael Vines	00b03897e1	Default --rpc-bind-address to 127.0.0.1 when --private-rpc is provided and --bind-address is not (cherry picked from commit `221343e849`)	2021-10-04 16:58:46 -07:00
mergify[bot]	6181df68cf	Staking docs: link to overview (#20426 ) (cherry picked from commit `2d5b471c09`) Co-authored-by: Ted Robertson <10043369+tredondo@users.noreply.github.com>	2021-10-04 23:22:21 +00:00
mergify[bot]	1588b00f2c	fix syntax error in bash_profile (#20386 ) if there is no newline at the end of the file, this export is glued to the rest of the code and generates a syntax error like this ```bash if [ -f ~/.git-completion.bash ]; then . ~/.git-completion.bash fiexport PATH="/Users/user/.local/share/solana/install/active_release/bin:$PATH" ``` (cherry picked from commit `87c0d8d9e7`) Co-authored-by: OleG <emptystamp@gmail.com>	2021-10-02 04:50:39 +00:00
mergify[bot]	ef306aa7cb	Deploy error is buffer is too small (#20358 ) (#20362 ) * Deploy error is buffer is too small * missing file (cherry picked from commit `de8331eeaf`) # Conflicts: # cli/tests/fixtures/noop.so Co-authored-by: Jack May <jack@solana.com>	2021-10-01 05:25:11 +00:00
mergify[bot]	e718f4b04a	terminology.md: remove CBC block and unneeded filename (#20269 ) (#20349 ) (cherry picked from commit `a7f2d9f55f`) Co-authored-by: Ted Robertson <10043369+tredondo@users.noreply.github.com>	2021-09-30 23:19:12 +00:00
mergify[bot]	51593a882b	Properly enable unprefixed_malloc_on_supported_platforms in tikv-jemallocator (#20351 ) (#20354 ) Trivial typo fix. Fixes: `4bf6d0c4d7` ("adds unprefixed_malloc_on_supported_platforms to jemalloc (#20317)") (cherry picked from commit `8ae88632cb`) Co-authored-by: Ivan Mironov <mironov.ivan@gmail.com>	2021-09-30 20:26:11 +00:00
mergify[bot]	1c15cc6e9a	add unchecked invokes (#20313 ) (#20337 ) (cherry picked from commit `8188c1dd59`) Co-authored-by: Jack May <jack@solana.com>	2021-09-30 17:05:51 +00:00
Tyera Eulberg	734b380cdb	Bump version to v1.7.15 (#20338 )	2021-09-30 10:51:34 -06:00
mergify[bot]	9cc26b3b00	cli: Stop topping up buffer balance (#20181 ) (#20312 ) (cherry picked from commit `53a810dbad`) Co-authored-by: Justin Starry <justin@solana.com>	2021-09-30 12:31:12 -04:00
mergify[bot]	ef5a0e842c	stake-accounts.md: fix grammar, link Solana Explorer (#20270 ) (#20274 ) (cherry picked from commit `f24fff8495`) Co-authored-by: Ted Robertson <10043369+tredondo@users.noreply.github.com>	2021-09-29 22:57:00 -06:00
Tyera Eulberg	5bdb824267	Remove original feature gating (#20334 ) v1.7.14	2021-09-29 22:36:51 -06:00
sakridge	474f2bcdf4	Prune sigverify queue (#20315 )	2021-09-30 05:40:48 +02:00
Tyera Eulberg	2302211963	Remove files (#20332 )	2021-09-30 02:25:24 +00:00
mergify[bot]	8178db52a5	Add transaction mode to dos (#20191 ) (#20329 ) (cherry picked from commit `94a1a57106`) Co-authored-by: sakridge <sakridge@gmail.com>	2021-09-29 23:53:15 +00:00
mergify[bot]	5d8429d953	adds unprefixed_malloc_on_supported_platforms to jemalloc (#20317 ) (#20325 ) Without this feature jemalloc is used only for Rust code but not for bundled C/C++ libraries (like rocksdb). https://github.com/solana-labs/solana/issues/14366#issuecomment-930404992 (cherry picked from commit `4bf6d0c4d7`) Co-authored-by: behzad nouri <behzadnouri@gmail.com>	2021-09-29 22:49:47 +00:00
sakridge	fec15f69f4	Increment 1.7 version (#20316 )	2021-09-29 15:37:45 -04:00
sakridge	257ddbeee1	Tpu vote 1.7 (#20187 ) * Add separate vote processing tpu port * Add feature to send to tpu vote port * Add vote rejecting sigverify mode * use packet.meta.is_simple_vote_tx in place of deserialization * consolidate code that identifies vote tx atcommon path for cpu and gpu * new key for feature set * banking forward tpu vote * add tpu vote port to dockerfile and other review changes * Simplify thread id compare * fix a test; updated cluster_info ABI change Co-authored-by: Tao Zhu <tao@solana.com> v1.7.13	2021-09-29 18:12:58 +02:00
mergify[bot]	47c1730808	uses rayon thread-pool for retransmit-stage parallelization (#19486 ) (#20293 ) (cherry picked from commit `01a7ec8198`) Co-authored-by: behzad nouri <behzadnouri@gmail.com>	2021-09-29 14:11:46 +00:00
mergify[bot]	a005a6b816	Restore ability for programs to upgrade themselves (backport #20265 ) (#20295 ) * Restore ability for programs to upgrade themselves (#20265) * Make helper associated fn * Add feature definition * Add handling to preserve program-id write lock when upgradeable loader is present; restore bpf upgrade-self test * Use single feature (cherry picked from commit `2cd9dc99b6`) # Conflicts: # runtime/src/accounts.rs # sdk/program/src/message.rs # sdk/program/src/message/mapped.rs # sdk/program/src/message/sanitized.rs # sdk/src/feature_set.rs * Fix conflicts Co-authored-by: Tyera Eulberg <teulberg@gmail.com> Co-authored-by: Tyera Eulberg <tyera@solana.com>	2021-09-29 01:34:33 +00:00
mergify[bot]	2f2948f998	Extricate RpcCompletedSlotsService from RetransmitStage (backport #18017 ) (#20294 ) * Extricate RpcCompletedSlotsService from RetransmitStage (cherry picked from commit `fa04531c7a`) # Conflicts: # core/src/replay_stage.rs # core/src/retransmit_stage.rs # core/src/tvu.rs # core/src/validator.rs * removes backport merge conflicts Co-authored-by: Michael Vines <mvines@gmail.com> Co-authored-by: behzad nouri <behzadnouri@gmail.com>	2021-09-28 18:25:51 +00:00
mergify[bot]	55ccff7604	skips retransmit for shreds with unknown slot leader (backport #19472 ) (#20291 ) * skips retransmit for shreds with unknown slot leader (#19472) Shreds' signatures should be verified before they reach retransmit stage, and if the leader is unknown they should fail signature check. Therefore retransmit-stage can as well expect to know who the slot leader is and otherwise just skip the shred. Blockstore checking signature of recovered shreds before sending them to retransmit stage: https://github.com/solana-labs/solana/blob/4305d4b7b/ledger/src/blockstore.rs#L884-L930 Shred signature verifier: https://github.com/solana-labs/solana/blob/4305d4b7b/core/src/sigverify_shreds.rs#L41-L57 https://github.com/solana-labs/solana/blob/4305d4b7b/ledger/src/sigverify_shreds.rs#L105 (cherry picked from commit `6d9818b8e4`) # Conflicts: # core/src/broadcast_stage/broadcast_duplicates_run.rs # ledger/src/shred.rs * removes backport merge conflicts Co-authored-by: behzad nouri <behzadnouri@gmail.com>	2021-09-28 15:26:30 +00:00
mergify[bot]	1bf88556ee	removes Slot from TransmitShreds (backport #19327 ) (#20260 ) * removes Slot from TransmitShreds (#19327) An earlier version of the code was funneling through stakes along with shreds to broadcast: https://github.com/solana-labs/solana/blob/b67ffab37/core/src/broadcast_stage.rs#L127 This was changed to only slots as stakes computation was pushed further down the pipeline in: https://github.com/solana-labs/solana/pull/18971 However shreds themselves embody which slot they belong to. So pairing them with slot is redundant and adds rooms for bugs should they become inconsistent. (cherry picked from commit `1deb4add81`) # Conflicts: # core/benches/cluster_info.rs # core/src/broadcast_stage.rs # core/src/broadcast_stage/broadcast_duplicates_run.rs # core/src/broadcast_stage/fail_entry_verification_broadcast_run.rs # core/src/broadcast_stage/standard_broadcast_run.rs * removes backport merge conflicts Co-authored-by: behzad nouri <behzadnouri@gmail.com>	2021-09-28 12:55:01 +00:00
mergify[bot]	4c4f183515	reverts #17542 (#20259 ) (#20273 ) https://github.com/solana-labs/solana/pull/17542 excludes caller's crds values from pull responses. Reverting that commit so that when a (staked) node restarts, it can obtain its crds values before restart from other nodes. (cherry picked from commit `43ed727ba7`) Co-authored-by: behzad nouri <behzadnouri@gmail.com>	2021-09-28 12:54:31 +00:00
mergify[bot]	282322cbe8	Add more docs for RpcClient (backport #19771 ) (#20266 ) * Add more docs for RpcClient (#19771) * Add more docs for RpcClient * Use custom mocks in rpc_client examples * Move create_rpc_client_mocks into rpc_client module Signed-off-by: Brian Anderson <andersrb@gmail.com> * Update client/src/rpc_client.rs Co-authored-by: Tyera Eulberg <teulberg@gmail.com> * Update RpcClient docs per review feedback * Consistently link 'commitment level' in RpcClient docs Co-authored-by: Tyera Eulberg <teulberg@gmail.com> (cherry picked from commit `082d5dc5b2`) # Conflicts: # client/src/mock_sender.rs # client/src/rpc_client.rs * Fix conflicts Co-authored-by: Brian Anderson <andersrb@gmail.com> Co-authored-by: Tyera Eulberg <tyera@solana.com>	2021-09-28 07:27:23 +00:00
mergify[bot]	2dc00d0e13	Paper wallet: fix URI scheme (#20233 ) (#20278 ) (cherry picked from commit `38844a7010`) Co-authored-by: Ted Robertson <10043369+tredondo@users.noreply.github.com>	2021-09-28 01:29:42 +00:00
Kirill Fomichev	a90c338982	Rpc: use rust convenient methods (cherry picked from commit `ac79ae6848`)	2021-09-27 13:46:27 -07:00
drbh	36c283026f	fix Borsh typo changes `BORSH_IO_ERROR` from `unkown` to `unknown` error (cherry picked from commit `e94b7984a1`)	2021-09-27 12:30:41 -07:00
mergify[bot]	a1a0c63862	retransmits shreds recovered from erasure codes (backport #19233 ) (#20249 ) * removes packet-count metrics from retransmit stage Working towards sending shreds (instead of packets) to retransmit stage so that shreds recovered from erasure codes are as well retransmitted. Following commit will add these metrics back to window-service, earlier in the pipeline. (cherry picked from commit `bf437b0336`) # Conflicts: # core/src/retransmit_stage.rs * adds packet/shred count stats to window-service Adding back these metrics from the earlier commit which removed them from retransmit stage. (cherry picked from commit `8198a7eae1`) * removes erroneous uses of Arc<...> from retransmit stage (cherry picked from commit `6e413331b5`) # Conflicts: # core/src/retransmit_stage.rs # core/src/tvu.rs * sends shreds (instead of packets) to retransmit stage Working towards channelling through shreds recovered from erasure codes to retransmit stage. (cherry picked from commit `3efccbffab`) # Conflicts: # core/src/retransmit_stage.rs * returns completed-data-set-info from insert_data_shred instead of opaque (u32, u32) which are then converted to CompletedDataSetInfo at the call-site. (cherry picked from commit `3c71670bd9`) # Conflicts: # ledger/src/blockstore.rs * retransmits shreds recovered from erasure codes Shreds recovered from erasure codes have not been received from turbine and have not been retransmitted to other nodes downstream. This results in more repairs across the cluster which is slower. This commit channels through recovered shreds to retransmit stage in order to further broadcast the shreds to downstream nodes in the tree. (cherry picked from commit `7a8807b8bb`) # Conflicts: # core/src/retransmit_stage.rs # core/src/window_service.rs * removes backport merge conflicts Co-authored-by: behzad nouri <behzadnouri@gmail.com>	2021-09-27 18:11:37 +00:00
mergify[bot]	e20fdde0a4	Wallet guide: fix grammar (#20228 ) (#20254 ) (cherry picked from commit `f107aa296b`) Co-authored-by: Ted Robertson <10043369+tredondo@users.noreply.github.com>	2021-09-27 16:56:14 +00:00
mergify[bot]	5b52ac8990	Fix grammar in conventions.md (#20236 ) (#20252 ) (cherry picked from commit `af57bd3d48`) Co-authored-by: Ted Robertson <10043369+tredondo@users.noreply.github.com>	2021-09-27 16:39:39 +00:00
mergify[bot]	502ae8b319	removes repeated bank-forks locking in window-service (backport #19210 ) (#20239 ) * removes repeated bank-forks locking in window-service Window service is repeatedly locking bank-forks to look-up working-bank for every single shred: https://github.com/solana-labs/solana/blob/5fde4ee3a/core/src/window_service.rs#L597-L606 This commit updates shred_filter signature in recv_window so that where we already obtain the lock on bank-forks, we can also look-up working-bank once for all packets: https://github.com/solana-labs/solana/blob/5fde4ee3a/core/src/window_service.rs#L256-L277 (cherry picked from commit `d57398a959`) # Conflicts: # core/src/window_service.rs * removes erroneous uses of &Arc<...> from window-service (cherry picked from commit `b64eeb7729`) # Conflicts: # core/src/window_service.rs * removes backport merge conflicts Co-authored-by: behzad nouri <behzadnouri@gmail.com>	2021-09-27 13:48:58 +00:00
mergify[bot]	30f0b3cf53	removes raw indexing from streamer (backport #19183 ) (#20237 ) * removes raw indexing from streamer (#19183) Raw indexing is verbose and error-prone. This same code had an indexing bug causing validator nodes panic just a few months ago: https://github.com/solana-labs/solana/commit/482b8c6be (cherry picked from commit `8229a4fbf6`) # Conflicts: # streamer/Cargo.toml * removes backport merge conflicts Co-authored-by: behzad nouri <behzadnouri@gmail.com>	2021-09-27 13:48:13 +00:00
mergify[bot]	2975dc5c1a	removes use of public ip addresses from perf tests (backport #19184 ) (#20238 ) * removes use of public ip addresses from system tests Using global IPs causes outbound traffic which costs money: https://github.com/solana-labs/solana/pull/18728#issuecomment-884290209 (cherry picked from commit `bd8f793809`) * removes redundant allow-private-addr from system tests Following https://github.com/solana-labs/solana/pull/19130 if gce.sh creat is invoked without -P then --allow-private-addr is implied: https://github.com/solana-labs/solana/blob/4cc1b1504/net/common.sh#L68-L73 Therefore tests only need to specify: USE_PUBLIC_IP_ADDRESSES: "false" (cherry picked from commit `18463aa846`) # Conflicts: # system-test/partition-testcases/gce-5-node-3-partition.yml # system-test/partition-testcases/gce-partition-once-then-stabilize.yml # system-test/partition-testcases/gce-partition-with-offline.yml * removes backport merge conflicts Co-authored-by: behzad nouri <behzadnouri@gmail.com>	2021-09-27 01:23:02 +00:00
mergify[bot]	d68377e927	unifies cluster-nodes computation & caching across turbine stages (backport #18971 ) (#20231 ) * sends slots (instead of stakes) through broadcast flow Current broadcast code is computing stakes for each slot before sending them down the channel: https://github.com/solana-labs/solana/blob/049fb0417/core/src/broadcast_stage/standard_broadcast_run.rs#L208-L228 https://github.com/solana-labs/solana/blob/0cf52e206/core/src/broadcast_stage.rs#L342-L349 Since the stakes are a function of epoch the slot belongs to (and so does not necessarily change from one slot to another), forwarding the slot itself would allow better caching downstream. In addition we need to invalidate the cache if the epoch changes (which the current code does not do), and that requires to know which slot (and so epoch) current broadcasted shreds belong to: https://github.com/solana-labs/solana/blob/19bd30262/core/src/broadcast_stage/standard_broadcast_run.rs#L332-L344 (cherry picked from commit `44b11154ca`) # Conflicts: # core/src/broadcast_stage/broadcast_duplicates_run.rs # core/src/broadcast_stage/standard_broadcast_run.rs * implements cluster-nodes cache Cluster nodes are cached keyed by the respective epoch from which stakes are obtained, and so if epoch changes cluster-nodes will be recomputed. A time-to-live eviction policy is enforced to refresh entries in case gossip contact-infos are updated. (cherry picked from commit `ecc1c7957f`) * uses cluster-nodes cache in retransmit stage The new cluster-nodes cache will: * ensure cluster-nodes are recalculated if the epoch (and so the epoch staked nodes) changes. * encapsulate time-to-live eviction policy. (cherry picked from commit `30bec3921e`) * uses cluster-nodes cache in broadcast-stage * Current caching mechanism does not update cluster-nodes when the epoch (and so epoch staked nodes) changes: https://github.com/solana-labs/solana/blob/19bd30262/core/src/broadcast_stage/standard_broadcast_run.rs#L332-L344 * Additionally, the cache update has a concurrency bug in which the thread which does compare_and_swap may be blocked when it tries to obtain the write-lock on cache, while other threads will keep running ahead with the outdated cache (since the atomic timestamp is already updated). In the new ClusterNodesCache, entries are keyed by epoch, and so if epoch changes cluster-nodes will be recalculated. The time-to-live eviction policy is also encapsulated and rigidly enforced. (cherry picked from commit `aa32738dd5`) # Conflicts: # core/src/broadcast_stage/broadcast_duplicates_run.rs # core/src/broadcast_stage/fail_entry_verification_broadcast_run.rs # core/src/broadcast_stage/standard_broadcast_run.rs * unifies cluster-nodes computation & caching across turbine stages Broadcast-stage is using epoch_staked_nodes based on the same slot that shreds belong to: https://github.com/solana-labs/solana/blob/049fb0417/core/src/broadcast_stage/standard_broadcast_run.rs#L208-L228 https://github.com/solana-labs/solana/blob/0cf52e206/core/src/broadcast_stage.rs#L342-L349 But retransmit-stage is using bank-epoch of the working-bank: https://github.com/solana-labs/solana/blob/19bd30262/core/src/retransmit_stage.rs#L272-L289 So the two are not consistent at epoch boundaries where some nodes may have a working bank (or similarly a root bank) lagging other nodes. As a result the node which obtains a packet may construct turbine broadcast tree inconsistently with its parent node in the tree and so some packets may fail to reach all nodes in the tree. (cherry picked from commit `50d0e830c9`) * adds fallback & metric for when epoch staked-nodes are none (cherry picked from commit `fb69f45f14`) * allows only one thread to update cluster-nodes cache entry for an epoch If two threads simultaneously call into ClusterNodesCache::get for the same epoch, and the cache entry is outdated, then both threads recompute cluster-nodes for the epoch and redundantly overwrite each other. This commit wraps ClusterNodesCache entries in Arc<Mutex<...>>, so that when needed only one thread does the computations to update the entry. (cherry picked from commit `eaf927cf49`) * falls back on working-bank if root-bank::epoch-staked-nodes is none bank.get_leader_schedule_epoch(shred_slot) is one epoch after epoch_schedule.get_epoch(shred_slot). At epoch boundaries, shred is already one epoch after the root-slot. So we need epoch-stakes 2 epochs ahead of the root. But the root bank only has epoch-stakes for one epoch ahead, and as a result looking up epoch staked-nodes from the root-bank fails. To be backward compatible with the current master code, this commit implements a fallback on working-bank if epoch staked-nodes obtained from the root-bank is none. (cherry picked from commit `e4be00fece`) * removes backport merge conflicts Co-authored-by: behzad nouri <behzadnouri@gmail.com>	2021-09-26 23:45:42 +00:00
mergify[bot]	cc1a3d6645	recvmmsg IPv6 awareness (#18957 ) (#20232 ) (cherry picked from commit `0b7ed18cfa`) Co-authored-by: Jeff Biseda <jbiseda@gmail.com>	2021-09-26 23:09:21 +00:00
mergify[bot]	e9a993fb59	allows sendmmsg api taking owned values (as well as references) (#18999 ) (#20226 ) Current signature of api in sendmmsg requires a slice of inner references: https://github.com/solana-labs/solana/blob/fe1ee4980/streamer/src/sendmmsg.rs#L130-L152 That forces the call-site to convert owned values to references even though doing so is redundant and adds an extra level of indirection: https://github.com/solana-labs/solana/blob/fe1ee4980/core/src/repair_service.rs#L291 This commit expands the api using AsRef and Borrow traits to allow calling the method with owned values (as well as references like before). (cherry picked from commit `049fb0417f`) Co-authored-by: behzad nouri <behzadnouri@gmail.com>	2021-09-26 20:35:13 +00:00
Kirill Fomichev	88177d33fd	Rpc: remove not required clone (cherry picked from commit `9542bae56e`)	2021-09-26 12:50:53 -07:00
mergify[bot]	0ec301f1c3	improves parallelism in window-service recv_window (backport #18446 ) (#20142 ) * sends packets in batches from sigverify-stage (#18446) sigverify-stage is breaking batches to single-item vectors before sending them down the channel: https://github.com/solana-labs/solana/blob/d451363dc/core/src/sigverify_stage.rs#L88-L92 Also simplifying window-service code, reducing number of nested branches. (cherry picked from commit `7d56fa8363`) # Conflicts: # core/src/window_service.rs * removes backport merge conflicts Co-authored-by: behzad nouri <behzadnouri@gmail.com>	2021-09-26 18:54:07 +00:00
mergify[bot]	34665571fa	drop outstanding_requests lock before sending repair requests (backport #18893 ) (#20227 ) * drop outstanding_requests lock before sending repair requests (#18893) (cherry picked from commit `9255ae334d`) # Conflicts: # core/src/repair_service.rs * removes backport merge conflicts Co-authored-by: Jeff Biseda <jbiseda@gmail.com> Co-authored-by: behzad nouri <behzadnouri@gmail.com>	2021-09-26 18:39:28 +00:00
mergify[bot]	5dd1c2191e	shares cluster-nodes between retransmit threads (backport #18947 ) (#20221 ) * shares cluster-nodes between retransmit threads (#18947) cluster_nodes and last_peer_update are not shared between retransmit threads, as each thread have its own value: https://github.com/solana-labs/solana/blob/65ccfed86/core/src/retransmit_stage.rs#L476-L477 Additionally, with shared references, this code: https://github.com/solana-labs/solana/blob/0167daa11/core/src/retransmit_stage.rs#L315-L328 has a concurrency bug where the thread which does compare_and_swap, updates cluster_nodes much later after other threads have run with outdated cluster_nodes for a while. In particular, the write-lock there may block. (cherry picked from commit `d06dc6c8a6`) # Conflicts: # core/benches/retransmit_stage.rs # core/src/retransmit_stage.rs * removes backport merge conflicts Co-authored-by: behzad nouri <behzadnouri@gmail.com>	2021-09-26 16:29:34 +00:00
mergify[bot]	aacb5e58ad	sendmmsg cleanup #18589 (#20175 ) Rationalize usage of sendmmsg(2). Skip packets which failed to send and track failures. (cherry picked from commit `ae5ad5cf9b`) Co-authored-by: Jeff Biseda <jbiseda@gmail.com>	2021-09-25 23:00:00 +00:00
mergify[bot]	8d2dce6f6b	allows private addresses if not public network (#20178 ) (cherry picked from commit `c4f2e5f88c`) # Conflicts: # net/net.sh Co-authored-by: behzad nouri <behzadnouri@gmail.com>	2021-09-25 21:34:24 +00:00

1 2 3 4 5 ...

14822 Commits