Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Invalid predicates #103

Open
kevinschaper opened this issue May 19, 2023 · 2 comments
Open

Invalid predicates #103

kevinschaper opened this issue May 19, 2023 · 2 comments

Comments

@kevinschaper
Copy link
Collaborator

I'm stripping off invalid predicates in my phenio filtering in monarch-ingest, and found that I have 5128244 edges with predicates that aren't in biolink model, the categories are: 'biolink:category', 'biolink:inverseOf', 'biolink:subPropertyOf'

I'm not sure if these are edges that we want to keep and need to fix the predicates, or if these are just properties that are staying as edges in the conversion.

@matentzn
Copy link

What do you mean by "valid"?

category exists: https://github.com/biolink/biolink-model/blob/master/biolink-model.yaml#L507

We should:

  • not filter out edges with invalid predicates, but include edges with valid ones (assuming that 1 edge can have more than one predicate)
  • Figure out what the deal is with subPropertyOf which is probably added by KGX - and should probably be added to biolink (instead of this horrible mapping)

@caufieldjh
Copy link
Contributor

The majority (maybe even all!) of those 'biolink:category' nodes are leftovers from category mapping in the phenio build and serve no purpose in the final graph. It's fully redundant when the categories are also present as node properties. I've added a step to remove these edges during KG-Phenio assembly but, embarrassingly, that step doesn't work correctly.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants