Allow setting seed for frequency offsets #881

ansoncfit · 2023-06-13T16:58:52Z

Addresses #714: with frequency-based routes, users are often distracted by spurious changes in access from random schedule differences between scenarios.

This PR allows "locking" schedules by setting the seed used for randomizing frequency offsets to a deterministic value for each origin (namely, a multiple of fromLat, as is done in the multi-criteria router: see FrequencyRandomOffsets). It also adds a field to the analysis request to activate this mode (see AnalysisRequest). With lockSchedules: true, a given origin will use the same frequency offsets across scenarios, so resulting changes in access should be systematic ones. This PR also includes other minor changes for related tests and documentation.

To test, fetch isochrones repeatedly along an infrequent frequency-based route. Without lockSchedules, the isochrone should "waver." With this PR and lockSchedules: true (or using build v6.9-7-ged8378e, which hardcodes the logic to avoid needing to start a backend server via https://github.com/conveyal/r5/tree/lock-lock-schedules), there should be no variability with repeatedly fetched isochrones.

In documentation, we should encourage users to run analysis without locking schedules first, to get a sense of variability. If certain corridors show large variability, they may want to use phasing. We've discussed other longer-term changes related to optimized schedules, but the approach proposed here is a minimally invasive one that addresses multiple user requests.

Addresses #714

trevorgerhardt · 2023-06-20T03:50:33Z

src/main/java/com/conveyal/r5/profile/FrequencyRandomOffsets.java


-    public FrequencyRandomOffsets(TransitLayer data) {
+    public FrequencyRandomOffsets(TransitLayer data, ProfileRequest request) {


Is the entire request necessary to be passed as a parameter? I would recommend passing either the seed itself or a MersenneTwister.

Assuming we want to allow locking the schedules at all I agree: it would be better to pass in only the seed to deter further coupling with the ProfileRequest. The seed should probably include both lat and lon to avoid correlation between all origins in a row across a grid.

I don't see a clear justification for changing the schedules from one origin to another, but locking them at the per-origin (or per-row) level. This would lead to apparently stable differences between neighboring origins that use the same routes to reach most destinations, which are likely to be misinterpreted.

The effects of knowingly deriving results from one single permutation of the schedules are clearer if that permutation is the same for all origins. In that case the seed could just be derived from the TransitLayer (e.g. its center point), and the parameter can just be a boolean for whether to do this.

abyrd · 2023-08-04T07:21:25Z

In light of recent discussions summarized at #714 (comment) do we still want to add a lockSchedules parameter or just use a custom/experimental worker build if this is ever needed? Have we been able to handle most cases where this was an issue by simply switching to exact-times or phased schedules, or increasing the number of MC draws?

Allow setting seed for frequency offsets

cf9a0af

Addresses #714

ansoncfit requested a review from abyrd June 13, 2023 16:58

trevorgerhardt reviewed Jun 20, 2023

View reviewed changes

ansoncfit marked this pull request as draft October 27, 2023 20:55

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow setting seed for frequency offsets #881

Allow setting seed for frequency offsets #881

ansoncfit commented Jun 13, 2023 •

edited

Loading

trevorgerhardt Jun 20, 2023

abyrd Aug 4, 2023

abyrd commented Aug 4, 2023


		public FrequencyRandomOffsets(TransitLayer data) {
		public FrequencyRandomOffsets(TransitLayer data, ProfileRequest request) {

Allow setting seed for frequency offsets #881

Are you sure you want to change the base?

Allow setting seed for frequency offsets #881

Conversation

ansoncfit commented Jun 13, 2023 • edited Loading

trevorgerhardt Jun 20, 2023

Choose a reason for hiding this comment

abyrd Aug 4, 2023

Choose a reason for hiding this comment

abyrd commented Aug 4, 2023

ansoncfit commented Jun 13, 2023 •

edited

Loading