-
Notifications
You must be signed in to change notification settings - Fork 101
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Investigating issues with parsing Flex feeds #1767
Comments
Based on the investigation on #1770 , it's the missing_required_field, invalid_url, and invalid_timezone that lead to validation errors and make a GTFS file unparsable. |
Moving @qcdyx findings from #1770 here:
We're only planning to modify the logic of |
For clarification, when an unparsable error is triggered, it only affects single file validators for the referred file. In this case only agency.txt validators are affected. |
What's the problem?
Out of the 4 Flex feeds that we have for testing purposes for #1721, 3 have failed to run through the validator without parsing issues.
I took a look at 51 Flex v2 feeds, including ones that don't conform to the official spec yet, for the sake of trying to better understand this problem. 50% fail to fully parse, and all but 1 of the feeds that failed have an issue with parsing stop_times.txt.
Outstanding questions
1KB, 12KB, 2.4MB
No major infra change needed. We need to remove errors like UNKNOWN_COLUMN from UNPARSABLE_ROWS. However, this might not be necessary because we are adding the Flex rules. Explore running validation on feeds with unknown_column notices #1770
This is a critical set of questions to answer before we pursue more work on #1721
The text was updated successfully, but these errors were encountered: