chore(fix): Hitting unhealthy node 10 times #2819

ivaylogarnev-limechain · 2025-01-23T08:19:41Z

Description:
This PR refactors and fixes the unhealthy node error handling logic in Executable.js. Additionally, it introduces a few unit tests to confirm this.

Related issue(s):
#2804

Checklist

Documented (Code comments, README, etc.)
Tested (unit, integration, etc.)

…roduces excludeCurrent method in the List class Signed-off-by: ivaylogarnev-limechain <[email protected]>

Signed-off-by: ivaylogarnev-limechain <[email protected]>

…hgraph/hedera-sdk-js into fix/hitting-unhealthy-node-ten-times

…d added unit tests Signed-off-by: ivaylogarnev-limechain <[email protected]>

0xivanov · 2025-01-24T10:18:39Z

src/Executable.js

                continue;
            }

+            this._nodeAccountIds.advance();


some notes:

Before always advance'd (before the if (!node.isHealthy())), now we have 2 advance calls, is this necessary?

Around line 740 in executable.js, we have client._network.increaseBackoff(node); which removes the node from the healthy nodes list and sets new value for _readmitTime. This logic works as expected, right?

Both advances handle different scenarios in the retry/rotation logic.

1.1 The first advance() inside the health check is specifically for when a node is unhealthy.

1.2 The second advance() is part of the normal node rotation logic when trying different nodes for retries.
(tested in AccountInfoMocking.js - should retry on INTERNAL and retry multiple nodes)

Yes, it still reaches that point because this code accounts for the scenario where the node is initially healthy but throws an error after making a request. If the error is a GrpcService or HttpError, the increaseBackOff() method is triggered, as expected.

About the 1st one - this means we advance in both cases - can we have only 1 advance above if (!node.isHealthy()) on line 644?

The issue is that with a single advance before the health check, we're effectively skipping the health check of the first node and this would break the node health checking functionality as demonstrated by the failing test "should skip unhealthy node and execute with healthy node".

0xivanov · 2025-01-24T10:29:50Z

test/unit/AccountInfoMocking.js

+            const responses1 = [
+                { response: ACCOUNT_INFO_QUERY_COST_RESPONSE },
+                { response: ACCOUNT_INFO_QUERY_RESPONSE },
+            ];


We have it("should retry on UNAVAILABLE", async function () { , what is the behaviour there with more than 1 node?

Test added that covers this case.

…ocking Signed-off-by: ivaylogarnev-limechain <[email protected]>

ivaylonikolov7

Did you manage to test the new functionality on testnet where we actually have unhealthy nodes? I know it's hard to make integration tests for this functionality because you can't make localnode unhealthy easily and then you are going to need a second account that will execute the transaction on?

test/unit/AccountInfoMocking.js

ivaylogarnev-limechain · 2025-01-28T08:15:28Z

Did you manage to test the new functionality on testnet where we actually have unhealthy nodes? I know it's hard to make integration tests for this functionality because you can't make localnode unhealthy easily and then you are going to need a second account that will execute the transaction on?

This functionality is being tested in AccountBalanceIntegrationTest.js under the test case "can connect to testnet with TLS."

Currently, node 5 on the testnet is down, yet the tests still pass. If you explicitly hardcode the nodeAccountId to only use node 5 with:

.setNodeAccountIds([new AccountId(5)])

it will throw the error:
"Network connectivity issue: All nodes are unhealthy. Original node list: 0.0.5

Signed-off-by: ivaylogarnev-limechain <[email protected]>

fix: Refactor the unhealthy logic inside the Executable class and int…

65199bd

…roduces excludeCurrent method in the List class Signed-off-by: ivaylogarnev-limechain <[email protected]>

ivaylogarnev-limechain changed the title ~~fix: Refactor the unhealthy logic inside the Executable class and int…~~ chore(fix): Hitting unhealthy node 10 times Jan 23, 2025

ivaylogarnev-limechain added 5 commits January 23, 2025 15:09

fix: Added nodeAccountIds current node increasement

95dd3a2

Signed-off-by: ivaylogarnev-limechain <[email protected]>

Merge branch 'main' into fix/hitting-unhealthy-node-ten-times

95c1970

refactor: Moved the nodeAccountIds advance outside condition

0b1f20d

Signed-off-by: ivaylogarnev-limechain <[email protected]>

Merge branch 'fix/hitting-unhealthy-node-ten-times' of github.com:has…

aa9080d

…hgraph/hedera-sdk-js into fix/hitting-unhealthy-node-ten-times

refactor: Changed and simplified the logic for the unhealthy nodes an…

926eea5

…d added unit tests Signed-off-by: ivaylogarnev-limechain <[email protected]>

ivaylogarnev-limechain marked this pull request as ready for review January 24, 2025 09:19

ivaylogarnev-limechain requested a review from a team as a code owner January 24, 2025 09:19

ivaylogarnev-limechain self-assigned this Jan 24, 2025

ivaylogarnev-limechain requested review from ivaylonikolov7, 0xivanov and agadzhalov January 24, 2025 09:20

0xivanov reviewed Jan 24, 2025

View reviewed changes

test: Added mmultiple nodes UNAVAILABLE behavior test in AccountInfoM…

c51ecec

…ocking Signed-off-by: ivaylogarnev-limechain <[email protected]>

ivaylonikolov7 reviewed Jan 27, 2025

View reviewed changes

test/unit/AccountInfoMocking.js Outdated Show resolved Hide resolved

refactor: Changed the mocker response node account id

ef5f979

Signed-off-by: ivaylogarnev-limechain <[email protected]>

ivaylonikolov7 previously approved these changes Jan 28, 2025

View reviewed changes

fix: Removed unhealthy node from ClientIntegration test suite

680b3f7

Signed-off-by: ivaylogarnev-limechain <[email protected]>

ivaylogarnev-limechain dismissed ivaylonikolov7’s stale review via 680b3f7 January 28, 2025 13:34

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

chore(fix): Hitting unhealthy node 10 times #2819

chore(fix): Hitting unhealthy node 10 times #2819

ivaylogarnev-limechain commented Jan 23, 2025 •

edited

Loading

0xivanov Jan 24, 2025 •

edited

Loading

ivaylogarnev-limechain Jan 27, 2025

0xivanov Jan 27, 2025

ivaylogarnev-limechain Jan 27, 2025

0xivanov Jan 27, 2025

0xivanov Jan 24, 2025

ivaylogarnev-limechain Jan 27, 2025

ivaylonikolov7 left a comment

ivaylogarnev-limechain commented Jan 28, 2025

chore(fix): Hitting unhealthy node 10 times #2819

Are you sure you want to change the base?

chore(fix): Hitting unhealthy node 10 times #2819

Conversation

ivaylogarnev-limechain commented Jan 23, 2025 • edited Loading

0xivanov Jan 24, 2025 • edited Loading

Choose a reason for hiding this comment

ivaylogarnev-limechain Jan 27, 2025

Choose a reason for hiding this comment

0xivanov Jan 27, 2025

Choose a reason for hiding this comment

ivaylogarnev-limechain Jan 27, 2025

Choose a reason for hiding this comment

0xivanov Jan 27, 2025

Choose a reason for hiding this comment

0xivanov Jan 24, 2025

Choose a reason for hiding this comment

ivaylogarnev-limechain Jan 27, 2025

Choose a reason for hiding this comment

ivaylonikolov7 left a comment

Choose a reason for hiding this comment

ivaylogarnev-limechain commented Jan 28, 2025

ivaylogarnev-limechain commented Jan 23, 2025 •

edited

Loading

0xivanov Jan 24, 2025 •

edited

Loading