[Access] Add endpoints to execution nodes to support tx result err msgs #1398

AndriiDiachuk · 2023-11-02T13:42:56Z

Description

This pull request introduces a new GRPC API call which gets the error messages of all failed transactions in the block ordered by transaction index.

…chuk/4754-add-endpoints-to-execution-nodes-to-suppport-tx-result-err-msgs

peterargue

Thanks for the PR @AndriiDiachuk!

I think this is a good start, and there's a little more needed for the usecase. I think we will need 2 new endpoints:

GetTransactionErrorMessage -> Takes a transactionID and blockID, and returns the error message string for that tx.
GetTransactionErrorMessagesByBlockID -> similar to what you have. takes a blockID, and returns a set of objects, one for each tx with an error.

I think the GetTransactionErrorMessagesByBlockID response should contain a repeated message that includes the transaction's ID, index and error string.

peterargue · 2023-11-02T16:45:13Z

protobuf/flow/execution/execution.proto

@@ -50,6 +50,11 @@ service ExecutionAPI {
  rpc GetTransactionResultsByBlockID(GetTransactionsByBlockIDRequest)
      returns (GetTransactionResultsResponse);

+  // GetTransactionErrorMessagesByBlockID gets the error messages of all failed transactions in the
+  // block ordered by transaction index
+  rpc GetTransactionErrorMessagesByBlockID(GetTransactionsByBlockIDRequest)


let's create a separate request object for this GetTransactionErrorMessagesByBlockIDRequest

peterargue · 2023-11-03T04:59:48Z

protobuf/flow/execution/execution.proto

@@ -141,6 +146,10 @@ message GetTransactionResultsResponse {
  entities.EventEncodingVersion event_encoding_version = 2;
 }

+message GetTransactionErrorMessagesResponse {
+  repeated string error_messages = 1;


we'll need a way to differentiate between the messages for different errors.

AndriiDiachuk · 2023-11-03T12:41:39Z

Thanks for the PR @AndriiDiachuk!

I think this is a good start, and there's a little more needed for the usecase. I think we will need 2 new endpoints:

GetTransactionErrorMessage -> Takes a transactionID and blockID, and returns the error message string for that tx. GetTransactionErrorMessagesByBlockID -> similar to what you have. takes a blockID, and returns a set of objects, one for each tx with an error.

I think the GetTransactionErrorMessagesByBlockID response should contain a repeated message that includes the transaction's ID, index and error string.

Thanks for response @peterargue. Originally, I wanted to implement a few methods like you have suggested but after taking a closer look on current implementation I have noticed that we sync execution data using execution data sync module which syncs error messages only block by block basis. So I thought that we should mirror this logic and sync error messages for all failed transactions in the block. Regarding the response, I've omitted txid and index since we can derive that data by looking at the transactions in block and filtering failed transactions.

For example, if we consider block txns: [A(success), B(failure), C(success), D(failure), E(failure)].
We will get the following response: [err_msg_B, err_msg_D, err_msg_E],
and we can derive txid and index by applying filter func F(res:LightTransactionResult) bool { return res.Failed }.
Main goal is to reduce the size of the response.

If you are worried about consistency of mapping error messages to the actual light execution results we can add a checksum of involved tx ids or a mechanism similar to signer indices(a bitset of tx indexes that are returned). If you think we can neglect with extra space I can always include full tx ids and indexes.

…chuk/4754-add-endpoints-to-execution-nodes-to-suppport-tx-result-err-msgs

peterargue · 2023-11-15T00:45:49Z

Thanks for response @peterargue. Originally, I wanted to implement a few methods like you have suggested but after taking a closer look on current implementation I have noticed that we sync execution data using execution data sync module which syncs error messages only block by block basis. So I thought that we should mirror this logic and sync error messages for all failed transactions in the block. Regarding the response, I've omitted txid and index since we can derive that data by looking at the transactions in block and filtering failed transactions.

For example, if we consider block txns: [A(success), B(failure), C(success), D(failure), E(failure)]. We will get the following response: [err_msg_B, err_msg_D, err_msg_E], and we can derive txid and index by applying filter func F(res:LightTransactionResult) bool { return res.Failed }. Main goal is to reduce the size of the response.

If you are worried about consistency of mapping error messages to the actual light execution results we can add a checksum of involved tx ids or a mechanism similar to signer indices(a bitset of tx indexes that are returned). If you think we can neglect with extra space I can always include full tx ids and indexes.

So my thinking on this is:

Not every Access node/observer will need all tx error messages for every block. The public nodes, nodes that are used with blockchain indexers and other special usecases may, but the majority will only care about a small subset.
While it may be simplest in an initial version to have every node sync all error messages for every block, this may not scale well. Even if this is how the initial implementation works on ANs, let's still add the endpoint to get the error from a specific tx. I'd rather get the work done now, than have to rescope it later since doing both is minimally more work than just one.
I see your point about the response size. I worry that if there's a bug or execution fork on an EN, it could returns results for a different set of tx, resulting in inaccurate information cached and returned from the AN. While it does add a little overhead (~40 bytes per tx), the overhead is generally much less than the size of the error messages themselves and allows the AN to validate it got errors for all of the expected tx.

…chuk/4754-add-endpoints-to-execution-nodes-to-suppport-tx-result-err-msgs

AndriiDiachuk · 2023-11-15T10:37:55Z

Thanks for response @peterargue. Originally, I wanted to implement a few methods like you have suggested but after taking a closer look on current implementation I have noticed that we sync execution data using execution data sync module which syncs error messages only block by block basis. So I thought that we should mirror this logic and sync error messages for all failed transactions in the block. Regarding the response, I've omitted txid and index since we can derive that data by looking at the transactions in block and filtering failed transactions.
For example, if we consider block txns: [A(success), B(failure), C(success), D(failure), E(failure)]. We will get the following response: [err_msg_B, err_msg_D, err_msg_E], and we can derive txid and index by applying filter func F(res:LightTransactionResult) bool { return res.Failed }. Main goal is to reduce the size of the response.
If you are worried about consistency of mapping error messages to the actual light execution results we can add a checksum of involved tx ids or a mechanism similar to signer indices(a bitset of tx indexes that are returned). If you think we can neglect with extra space I can always include full tx ids and indexes.

So my thinking on this is:

Not every Access node/observer will need all tx error messages for every block. The public nodes, nodes that are used with blockchain indexers and other special usecases may, but the majority will only care about a small subset.

While it may be simplest in an initial version to have every node sync all error messages for every block, this may not scale well. Even if this is how the initial implementation works on ANs, let's still add the endpoint to get the error from a specific tx. I'd rather get the work done now, than have to rescope it later since doing both is minimally more work than just one.

I see your point about the response size. I worry that if there's a bug or execution fork on an EN, it could returns results for a different set of tx, resulting in inaccurate information cached and returned from the AN. While it does add a little overhead (~40 bytes per tx), the overhead is generally much less than the size of the error messages themselves and allows the AN to validate it got errors for all of the expected tx.

@peterargue Made some changes with your suggestions. Waiting for you feedback. Thanks in advance.

AndriiDiachuk added 2 commits November 2, 2023 15:16

Updated protobuf to include new method

9bca3f8

Merge branch 'master' of github.com:AndriiDiachuk/flow into AndriiDia…

9d24923

…chuk/4754-add-endpoints-to-execution-nodes-to-suppport-tx-result-err-msgs

durkmurder requested a review from peterargue November 2, 2023 13:52

peterargue reviewed Nov 3, 2023

View reviewed changes

Merge branch 'master' of github.com:AndriiDiachuk/flow into AndriiDia…

63e3c27

…chuk/4754-add-endpoints-to-execution-nodes-to-suppport-tx-result-err-msgs

AndriiDiachuk added 2 commits November 15, 2023 12:33

Merge branch 'master' of github.com:AndriiDiachuk/flow into AndriiDia…

ce4c42a

…chuk/4754-add-endpoints-to-execution-nodes-to-suppport-tx-result-err-msgs

Added changes due to comments left in PR

ed39603

peterargue approved these changes Nov 17, 2023

View reviewed changes

peterargue requested a review from sideninja November 17, 2023 01:26

peterargue merged commit 19ae56e into onflow:master Nov 17, 2023
1 check passed

AndriiDiachuk mentioned this pull request Nov 21, 2023

[Access] Add endpoints to Execution nodes to support getting Transaction Result error messages onflow/flow-go#5042

Merged

AndriiDiachuk deleted the AndriiDiachuk/4754-add-endpoints-to-execution-nodes-to-suppport-tx-result-err-msgs branch November 21, 2023 12:38

peterargue changed the title ~~Andrii diachuk/4754 add endpoints to execution nodes to suppport tx result err msgs~~ [Access] Add endpoints to execution nodes to support tx result err msgs Nov 27, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Access] Add endpoints to execution nodes to support tx result err msgs #1398

[Access] Add endpoints to execution nodes to support tx result err msgs #1398

AndriiDiachuk commented Nov 2, 2023

peterargue left a comment

peterargue Nov 2, 2023

peterargue Nov 3, 2023

AndriiDiachuk commented Nov 3, 2023 •

edited

Loading

peterargue commented Nov 15, 2023

AndriiDiachuk commented Nov 15, 2023

[Access] Add endpoints to execution nodes to support tx result err msgs #1398

[Access] Add endpoints to execution nodes to support tx result err msgs #1398

Conversation

AndriiDiachuk commented Nov 2, 2023

Description

peterargue left a comment

Choose a reason for hiding this comment

peterargue Nov 2, 2023

Choose a reason for hiding this comment

peterargue Nov 3, 2023

Choose a reason for hiding this comment

AndriiDiachuk commented Nov 3, 2023 • edited Loading

peterargue commented Nov 15, 2023

AndriiDiachuk commented Nov 15, 2023

AndriiDiachuk commented Nov 3, 2023 •

edited

Loading