Create subsection "Nested workflow" in the docs #277

agoscinski · 2024-08-27T12:09:26Z

Moves the graph builder, if task and while task examples into this subsection. Also removed the graph_builder.ipynb since it is now generated. Solves issue #195

codecov-commenter · 2024-08-27T12:42:47Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 80.60%. Comparing base (5937b88) to head (3146393).
Report is 84 commits behind head on main.

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #277      +/-   ##
==========================================
+ Coverage   75.75%   80.60%   +4.85%     
==========================================
  Files          70       66       -4     
  Lines        4615     5135     +520     
==========================================
+ Hits         3496     4139     +643     
+ Misses       1119      996     -123

Flag	Coverage Δ
python-3.11	`80.52% <ø> (+4.85%)`	⬆️
python-3.12	`80.52% <ø> (?)`
python-3.9	`80.56% <ø> (+4.82%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

agoscinski · 2024-08-27T13:51:44Z

test_pause_task_after_submit seems a bit unstable it failed already two times in different jobs

superstar54

Looking at the if and while task again, because we now have new If task and while task, thus there are two ways to implement if

if task, dose not need nested workgraph
use graph_builder, thus nested workgraph.

thus it's better to keep if and while separately, and not move to the nested_workgraph.

I suggest at the end of the graph_builder.py, we make a link to show the application in the if and while.

agoscinski · 2024-08-28T14:16:03Z

I dont think the graph_builder example is really about the dynamic part as it just forwards inputs like any task does it. We could make another example that runs different tasks depending on the inputs. Something like

@task.calcfunction()
def add_one(x):
    return x.value+1

@task.calcfunction()
def modulo_five(x):
    return x % 5

@graph_builder(outputs = [...])
def my_modular(i: orm.Int):
    wg = WorkGraph()
    if i.value < 5:
        task = wg.add_task(add_one)
    else:
        task = wg.add_task(modulo_five)
    # need wg.selector to expose for linking?
    return wg

Then one could say that this does not preserve provenance and directly interlude to the If workgraph

agoscinski · 2024-08-29T12:20:39Z

Added an example for a dynamic use case of the graph_builder

superstar54

Hi @agoscinski , Thanks for the work.

I would suggest not creating a dynamic-workflows section, but moving all three notebooks into the howto, so that user can see the dynamic, if and while immediately.

superstar54 · 2024-09-03T08:29:36Z

docs/gallery/howto/dynamic_workflows/autogen/dynamic_graph_builder.py

+@task.graph_builder(outputs=[{"name": "result", "from": "context.out"}])
+def add_modulo(i: Int):
+    wg = WorkGraph()
+    if i.value < 2:
+        task = wg.add_task(add_one, x=i)
+    else:
+        task = wg.add_task(modulo_two, x=i)
+
+    task.set_context({"result": "out"})
+    return wg
+


This is a duplicate as in the if. I would suggest using a for loop inside the graph_builder so that the number of tasks depends on an input value, making the work graph dynamic.

I think we have quite a few for loops in the examples too therefore I don't see duplication as a reason to rather use for loops. I feel this example is more educative because you are changing the type of task, you are in different branch of your program depending on your input which might be more intuitive to be dynamic.

But I can also just add both.

superstar54 · 2024-09-03T08:31:56Z

Then one could say that this does not preserve provenance and directly interlude to the If workgraph

Why ti does not preserve provenance?

agoscinski · 2024-09-09T12:33:00Z

I would suggest not creating a dynamic-workflows section, but moving all three notebooks into the howto, so that user can see the dynamic, if and while immediately.

Okay, but the howto's starting to get messy and need some structure at some point, but I think for now one still can put everything there

agoscinski · 2024-09-09T12:42:17Z

Why ti does not preserve provenance?

I mean more that it does not store provenance as transparently as using the If WorkGraph, but looking at the provenance graph, It is not really capturing the if-then-else logic, but just only some additional information about the condition. But at least in the GUI the if-then-else is capture more transparently.

agoscinski · 2024-09-11T10:46:29Z

I merged now the nested and dynamic example because it seemed strange to see them separate

superstar54

Hi @agoscinski , thanks for the update!

The dynamic_graph_builder.py is not used, I think you want to remove it.

docs/source/howto/index.rst

superstar54 · 2024-09-18T16:55:36Z

docs/gallery/howto/autogen/graph_builder.py

+# Nested workflows
+# ================
+# The `Graph Builder` allow user to create nested workflows from an input.


Better to give an overview: user can create nested WorkGraph in two ways:

Create a Task from the workgraph

Use graph builder.

Also discuss what's the upside and downside of the two approaches, or later in each section.

Have done something similar

superstar54 · 2024-09-18T16:56:10Z

docs/gallery/howto/autogen/graph_builder.py

+# For that use case we need to use the graph builder
+
+# Create a graph builder function


Remove this empty line, otherwise the format will be wrong.

superstar54 · 2024-09-18T18:33:28Z

docs/gallery/howto/autogen/graph_builder.py

+# Suppose we want a WorkGraph which includes another WorkGraph`(x+y)*z` inside it.
+# We can actually add a WorkGraph to another WorkGraph


This belongs to the Create a Task from the workgraph

It is at the beginning now

superstar54 · 2024-09-18T20:05:05Z

docs/gallery/howto/autogen/graph_builder.py

+# However linking the two WorkGraphs will not work
+
+wg = WorkGraph("nested_workgraph")
+add_multiply1 = wg.add_task(add_multiply(x=Int(2), y=Int(3), z=Int(4)))
+
+try:
+    wg.add_task(
+        add_multiply(x=add_multiply1.outputs["multiply.result"], y=Int(3), z=Int(4))
+    )
+except Exception as err:
+    print(err)


Actually, one can link the two WorkGraph tasks. But we need to write the code in a different way:

def add_multiply(x=None, y=None, z=None): wg = WorkGraph() wg.add_task(add, name="add", x=x, y=y) wg.add_task(multiply, name="multiply", x=z) wg.add_link(wg.tasks["add"].outputs[0], wg.tasks["multiply"].inputs["y"]) return wg wg = WorkGraph("nested_workgraph") add_multiply1 = wg.add_task(add_multiply(x=2, y=3, z=4), name="add_multiply1") add_multiply2 = wg.add_task(add_multiply(y=5, z=6), name="add_multiply2") wg.add_link(add_multiply1.outputs["multiply.result"], add_multiply2.inputs["add.x"]) wg.run()

The difference between the above code and a graph_builder task is that in graph_builder, the workgraph is dynamic and only created during execution, while the above workgraph is static, so that we can access the socket directly, e.g., add.x.
Both can used to create nested workgrpah, but graph_builder is a black box, which is the downside. The upside is that it allows dynamic workgraph generation.

Okay used that example and explained a bit

Hm.. it is not properly working

# define add task @task.calcfunction() def add(x, y): return x + y # define multiply task @task.calcfunction() def multiply(x, y): return x * y def add_multiply(x=None, y=None, z=None): wg = WorkGraph() wg.add_task(add, name="add", x=x, y=y) wg.add_task(multiply, name="multiply", x=z) wg.add_link(wg.tasks["add"].outputs[0], wg.tasks["multiply"].inputs["y"]) return wg wg = WorkGraph("nested_workgraph") # Creating a task from the WorkGraph add_multiply1 = wg.add_task(add_multiply(x=Int(2), y=Int(3), z=Int(4))) add_multiply2 = wg.add_task(add_multiply(x=Int(2), y=Int(3))) # link the output of a task to the input of another task wg.add_link(add_multiply1.outputs[0], add_multiply2.inputs["multiply.x"]) wg.to_html() # %% # Run the workgraph wg.run()

Gives

Error: Error in task multiply: Cannot convert value of type <class 'aiida.orm.utils.managers.NodeLinksManager'> to AiiDA type.

Okay now it works!

superstar54 · 2024-09-18T20:09:36Z

docs/gallery/howto/autogen/graph_builder.py

+# Create a Task from the workgraph (Experimental)
+# -----------------------------------------------


Since you mentioned this part at the beginning, so better to move this section before graph_builder.

Is now integrated into the first part

Moves the if task and while task examples into this subsection.

docs/gallery/howto/autogen/graph_builder.py

Co-authored-by: Xing Wang <xingwang1991@gmail.com>

superstar54

LGTM, thanks!

agoscinski marked this pull request as ready for review August 27, 2024 12:19

agoscinski force-pushed the create-docs-nested-workflows branch 2 times, most recently from 94e0a29 to b4ed841 Compare August 27, 2024 12:26

agoscinski force-pushed the create-docs-nested-workflows branch 3 times, most recently from 16e2070 to b059def Compare August 27, 2024 13:34

agoscinski requested a review from superstar54 August 27, 2024 13:50

superstar54 requested changes Aug 28, 2024

View reviewed changes

agoscinski force-pushed the create-docs-nested-workflows branch 3 times, most recently from fb9410e to c344a45 Compare August 28, 2024 14:00

agoscinski force-pushed the create-docs-nested-workflows branch from c344a45 to 7f4145c Compare August 29, 2024 11:49

agoscinski force-pushed the create-docs-nested-workflows branch from a57e316 to b3d0cad Compare August 29, 2024 12:23

agoscinski requested a review from superstar54 September 2, 2024 09:48

superstar54 requested changes Sep 3, 2024

View reviewed changes

agoscinski requested a review from superstar54 September 11, 2024 10:46

superstar54 requested changes Sep 18, 2024

View reviewed changes

agoscinski added 5 commits September 19, 2024 10:53

Create subsection "Nested workflow" in the docs

efc6f70

Moves the if task and while task examples into this subsection.

Change nested_workflows to dynamic_workflows

1937c42

Update gallery header from howtos

bd876b2

Change graph_builder example to focus more on nesting workflows

b5b8fff

Remove graph_builder.ipynb since it is generated by sphinx-gallery

d9de78d

agoscinski added 9 commits September 19, 2024 10:53

General edits to the graph builder example

84b1a81

Integrate dynamic notebook into nested

6a75f42

change back to run

f11529c

remove directory from conf.py

5fcff73

remove dynamic example from index.rst

9e49495

format example

9925cb1

fix typo, improve text

56479aa

fix bug in example

bbc861c

Restructure according to review

fe738a0

agoscinski force-pushed the create-docs-nested-workflows branch from 8e823f3 to fe738a0 Compare September 19, 2024 08:53

agoscinski added 2 commits September 19, 2024 11:41

put back parallel.ipynb

485049d

fix wg

56e53c0

agoscinski commented Sep 19, 2024

View reviewed changes