Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Document cluster restore implementation details of Scylla Manager 3.4 (with L&S) #4051

Open
karol-kokoszka opened this issue Sep 30, 2024 · 1 comment
Labels
documentation Improvements or additions to documentation restore
Milestone

Comments

@karol-kokoszka
Copy link
Collaborator

Scylla Manager 3.4 will include a set of improvements for the restore process conducted with SM.

We identified a few bottlenecks in pre-3.4 versions of the manager that prevented full utilization of cluster nodes during the restore procedure.

For example:

  • Tables were being restored sequentially, meaning only the SSTables of a given table were distributed to the cluster nodes. The batch-size flag for the restore task defines the number of files sent to each node. If the batch size is too large, some nodes may not receive any batches (due to an insufficient number of SSTables for a single table), causing them to remain idle.
  • The default batch-size value of 2 led to suboptimal utilization of Scylla's load and stream feature across shards. We found that a utilization level of ~80% could be reached by fine-tuning this value. A higher batch-size indicates that tables cannot be restored sequentially if we aim to utilize all nodes during the restore process.
  • The default value for the number of transfers (2) resulted in the data download not fully utilizing available bandwidth. ...

We must create documentation detailing the implementation specifics of the restore procedure in Manager 3.4.

@karol-kokoszka karol-kokoszka added documentation Improvements or additions to documentation restore labels Sep 30, 2024
@karol-kokoszka karol-kokoszka added this to the 3.4 milestone Sep 30, 2024
@karol-kokoszka
Copy link
Collaborator Author

Let's proceed with updating README.md from https://github.com/scylladb/scylla-manager/tree/master/pkg/service/restore
and leave the general documentation from https://manager.docs.scylladb.com/stable/restore/index.html without the implementation details.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
documentation Improvements or additions to documentation restore
Projects
None yet
Development

No branches or pull requests

1 participant