Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How to handle metadata issues #9

Open
faroit opened this issue Mar 15, 2019 · 3 comments
Open

How to handle metadata issues #9

faroit opened this issue Mar 15, 2019 · 3 comments
Assignees
Labels
help wanted Extra attention is needed

Comments

@faroit
Copy link
Contributor

faroit commented Mar 15, 2019

I guess finding missing links and metadata can be quite some work resulting in some discussion... Can we agree one issue per missing metadata? Maybe we make it more readable by introducing a few nice labels?

e.g.


Issue Title: ACM_MIRUM
label: urlerror
text: 403 erro

@faroit faroit added the help wanted Extra attention is needed label Mar 15, 2019
@ejhumphrey
Copy link
Collaborator

1 - great idea! it is a bunch of work, but once every error has an issue, and we're sure we've covered our bases, I'd be comfortable rattling some cages via the community mailing list. Folks are generally motivated to make their data accessible, since link rot can undermine their citation counts and such.

2 - what if we change broken URLs to point to the dataset's corresponding issue on github? 🤔then the general public could upvote or contribute on the most sought-after datasets.

3 - is there an easy way to at least semi-automate the process of creating this? I'm thinking maybe some light parsing of the build error logs to produce markdown in whatever format we decide for (a) easy + consistent copy-pasting into github, and (b) keeping track of which ones we've created? Could even collaborate through a google doc to share state?

@ejhumphrey
Copy link
Collaborator

gah, quickly realized (2) won't really work, because then that would break our URL testing ... but maybe a slightly smarter URL checker would do, e.g. fail if not found OR github issue at this repository.

@faroit
Copy link
Contributor Author

faroit commented Mar 15, 2019

i like your idea, lets do that.

concerning (3) we can use the github api from python to automatically create issues. I've done that before.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
help wanted Extra attention is needed
Projects
None yet
Development

No branches or pull requests

2 participants