Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[distGB]fix the problem when graph has few nodes or edges in distributed partition #7824

Open
wants to merge 13 commits into
base: master
Choose a base branch
from

Conversation

CfromBU
Copy link
Collaborator

@CfromBU CfromBU commented Oct 14, 2024

this pr fix the issue related to #7793. When the number of nodes or edges is less than the number of partitions, partition can be created normally.

@dgl-bot
Copy link
Collaborator

dgl-bot commented Oct 14, 2024

To trigger regression tests:

  • @dgl-bot run [instance-type] [which tests] [compare-with-branch];
    For example: @dgl-bot run g4dn.4xlarge all dmlc/master or @dgl-bot run c5.9xlarge kernel,api dmlc/master

@dgl-bot
Copy link
Collaborator

dgl-bot commented Oct 14, 2024

Commit ID: 4efd6d40f22c07f8f5d9d4e7b753063b2ddda620

Build ID: 1

Status: ⚪️ CI test cancelled due to overrun.

Report path: link

Full logs path: link

@dgl-bot
Copy link
Collaborator

dgl-bot commented Oct 14, 2024

Commit ID: f63864cdaf4050310cb324dca1935cab44f930d1

Build ID: 2

Status: ⚪️ CI test cancelled due to overrun.

Report path: link

Full logs path: link

@dgl-bot
Copy link
Collaborator

dgl-bot commented Oct 14, 2024

Commit ID: c6a3096731f3f4e793761f6b17d0c502fcabd2bb

Build ID: 3

Status: ⚪️ CI test cancelled due to overrun.

Report path: link

Full logs path: link

@dgl-bot
Copy link
Collaborator

dgl-bot commented Oct 14, 2024

Commit ID: 156323d4700ca0de74d4f2a4a77b16c6628b3ddf

Build ID: 4

Status: ✅ CI test succeeded.

Report path: link

Full logs path: link

python/dgl/distributed/partition.py Outdated Show resolved Hide resolved
@@ -355,6 +356,33 @@ def _process_partition_gb(
return indptr, indices[sorted_idx], edge_ids[sorted_idx]


def update_node_map(node_map_val, end_ids_per_rank, id_ntypes, prev_last_id):
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

copied from partition.py or modified accordingly? please add a comment to clarify it

next_table = pyarrow.Table.from_batches([next_chunk])
src_ids.append(next_table["f0"].to_numpy())
dst_ids.append(next_table["f1"].to_numpy())
if os.path.getsize(edge_file) != 0:
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

what if size == 0? add a comment here to clarify it.
if getsize() == 0: blabla... continue is better, I think.

@dgl-bot
Copy link
Collaborator

dgl-bot commented Oct 14, 2024

Commit ID: f46d64bb1b50c2426d220585156096d9c15472b5

Build ID: 5

Status: ✅ CI test succeeded.

Report path: link

Full logs path: link

@dgl-bot
Copy link
Collaborator

dgl-bot commented Oct 15, 2024

Commit ID: 7e2b3f76e5794fcd4351a8ced29fb85c3b7117ad

Build ID: 6

Status: ⚪️ CI test cancelled due to overrun.

Report path: link

Full logs path: link

@dgl-bot
Copy link
Collaborator

dgl-bot commented Oct 15, 2024

Commit ID: 7ed9d231cf251edb4386c6ac130d43f4fda38b46

Build ID: 7

Status: ⚪️ CI test cancelled due to overrun.

Report path: link

Full logs path: link

@dgl-bot
Copy link
Collaborator

dgl-bot commented Oct 15, 2024

Commit ID: 3b60ea39b49bdf735ffd4d39a9fcf75e098cac30

Build ID: 8

Status: ✅ CI test succeeded.

Report path: link

Full logs path: link

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

[Dist] check whether dist partition pipeline works with few num_nodes/edges
3 participants