Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[develop] Added an option for RRFS external model files used as ICS and LBCS #1089

Merged
merged 25 commits into from
Sep 12, 2024

Conversation

natalie-perlin
Copy link
Collaborator

@natalie-perlin natalie-perlin commented May 30, 2024

DESCRIPTION OF CHANGES:

(updated 09/06/2024)

  • An option to use RRFS model output (control) files are added as initial and lateral boundary conditions, ICS and LBCS.
    RRFS_a data for the test was retrieved from the NODD website ((https://registry.opendata.aws/noaa-rrfs/)), pressure-level grib2 files from the control directory, RRFS forecasts interpolated into 3-km regular grid.

  • A new test has been added grid_RRFS_CONUScompact_25km_ics_RRFS_lbcs_RRFS_suite_RRFS_v1beta with RRFS input files for the event on 06/05/2024 with the tornadoes reported in Maryland.

RRFS data location:
https://noaa-rrfs-pds.s3.amazonaws.com/rrfs_a/rrfs_a.{yyyymmdd}/{hh}/control/
files are in the format rrfs.t{hh}z.prslev.f{fcst_hr:03d}.conus.grib2
where {yyyymmdd} are 4-digit year, 2-digit month, and 2-digit day of the forecast cycle, and {hh} is a 2-digit hour of the forecast cycle (forecast start), and {fcst_hr:03d} is a 3-digit forecast hour.

Browsing the bucket could be done at the site: browse the bucket:
https://noaa-rrfs-pds.s3.amazonaws.com/index.html#rrfs_a/

For this PR, RRFS input data uses are interpolated into a regular 3-km grid, these files need older sfs_data v1.
The sfc_data v2 that contains rotated u,v fields or fractional grids will be needed to use a newer UFS_UTILS version and tag. This would allow use of full RRFS input files, i.e. on a native grid with no remapping into regular grids; these files are ~6GB per file, and also require higher-version of packages (g2) that are not present in a spack-stack v1.5.1 or 1.6.0.

In order to bypass the limitation of currently used UFS_UTILS hash version where the the use of RRFS ICS and LBCS is not yet considered, the scripts exregional_make_ics.sh and exregional_make_lbcs.sh treat the "RRFS" option in the way similar to "HRRR", i.e.:

"HRRR"|"RRFS")
  external_model="HRRR"
  input_type="grib2"
  ;;

The following needs to added to config.yaml file to use RRFS ICS/LBCS option:

platform:
  EXTRN_MDL_DATA_STORES: aws
task_get_extrn_ics:
  EXTRN_MDL_NAME_ICS: RRFS
  USE_USER_STAGED_EXTRN_FILES: false
  FV3GFS_FILE_FMT_ICS: grib2
task_get_extrn_lbcs:
  EXTRN_MDL_NAME_LBCS: RRFS
  USE_USER_STAGED_EXTRN_FILES: true
  LBC_SPEC_INTVL_HRS: 1
  FV3GFS_FILE_FMT_LBCS: grib2

An example of a config.yaml file that is use for the test is shown below. RRFS data are accesses from the AWS.

metadata:
  description: |-
    This test is to ensure that the workflow running in community mode
    completes successfully on the RRFS_CONUScompact_25km grid using the RRFS_v1beta
    physics suite. It uses RRFS forecasts mapped onto 3-km regular grid (rrfs*.conus.grib2) for
    ICs and LBCs. This test uses old v1 sfc_data, not the v2 fractional grid sfc_data.
user:
  RUN_ENVIR: community
platform:
  EXTRN_MDL_DATA_STORES: aws
workflow:
  CCPP_PHYS_SUITE: FV3_RRFS_v1beta
  PREDEF_GRID_NAME: RRFS_CONUScompact_25km
  DATE_FIRST_CYCL: '2024060517'
  DATE_LAST_CYCL: '2024060517'
  FCST_LEN_HRS: 3
  PREEXISTING_DIR_METHOD: rename
task_get_extrn_ics:
  EXTRN_MDL_NAME_ICS: RRFS
  FV3GFS_FILE_FMT_ICS: grib2
  USE_USER_STAGED_EXTRN_FILES: false
task_get_extrn_lbcs:
  EXTRN_MDL_NAME_LBCS: RRFS
  LBC_SPEC_INTVL_HRS: 1
  FV3GFS_FILE_FMT_LBCS: grib2
  USE_USER_STAGED_EXTRN_FILES: false
task_plot_allvars:
  COMOUT_REF: ""
global:
  DO_ENSEMBLE: true
  NUM_ENS_MEMBERS: 2
rocoto:
  tasks:
    taskgroups: '{{ ["parm/wflow/prep.yaml", "parm/wflow/coldstart.yaml", "parm/wflow/post.yaml", "parm/wflow/plot.yaml"]|include }}'

Type of change

  • Bug fix (non-breaking change which fixes an issue)
  • New feature (non-breaking change which adds functionality)
  • Breaking change (fix or feature that would cause existing functionality to not work as expected)
  • This change requires a documentation update

TESTS CONDUCTED:

Conducted a test for RRFS_CONUScompact_25km grid, setting ICS and LBCS to "RRFS" option, running on NOAA AWS cloud. One-, two-, and 3-ensemble member experiments.
A new test configured, config.grid_RRFS_CONUScompact_25km_ics_RRFS_lbcs_RRFS_suite_RRFS_v1beta , which could be launched on all the platforms.
[As a recent update of Sept. 2024, ] RRFS ICS/LBCS input data for the test is retrieved from the AWS.

These data are staged in standard location for EPIC project on NOAA Tier 1 systems as well:

NOAA Cloud: /contrib/EPIC/UFS_SRW_data/develop/input_model_data/RRFS/
Derecho: /glade/work/epicufsrt/contrib/UFS_SRW_data/develop/input_model_data/RRFS/
Hera: /scratch1/NCEPDEV/nems/role.epic/UFS_SRW_data/develop/input_model_data
Gaea: /gpfs/f5/epic/world-shared/UFS_SRW_data/develop/input_model_data/RRFS/
Jet: /mnt/lfs4/HFIP/hfv3gfs/role.epic/UFS_SRW_data/develop/input_model_data/RRFS/
Orion/Hercules: /work/noaa/epic/role-epic/contrib/UFS_SRW_data/develop/input_model_data/RRFS/

A directory that uses forecast cycle date stamp for the test, ./2024060517, has 10 files:
rrfs.t17z.prslev.f000.conus.grib2
rrfs.t17z.prslev.f001.conus.grib2
rrfs.t17z.prslev.f002.conus.grib2
rrfs.t17z.prslev.f003.conus.grib2
rrfs.t17z.prslev.f004.conus.grib2
rrfs.t17z.prslev.f005.conus.grib2
rrfs.t17z.prslev.f006.conus.grib2
rrfs.t17z.prslev.f007.conus.grib2
rrfs.t17z.prslev.f008.conus.grib2
rrfs.t17z.prslev.f009.conus.grib2

  • hera.intel
  • orion.intel
  • hercules.intel
  • cheyenne.intel
  • cheyenne.gnu
  • derecho.intel
  • gaea.intel
  • gaeac5.intel
  • jet.intel
  • wcoss2.intel
  • NOAA Cloud (AWS)
  • Jenkins
  • fundamental test suite
  • comprehensive tests (specify which if a subset was used)

DEPENDENCIES:

DOCUMENTATION:

A new option for "RRFS" used as ICS and LBCS may need to be documented.

ISSUE:

In preparation for RRFS integration tasks, option to use "RRFS" model file as ICS and LBCS was added.

CHECKLIST

  • My code follows the style guidelines in the Contributor's Guide
  • I have performed a self-review of my own code using the Code Reviewer's Guide
  • I have commented my code, particularly in hard-to-understand areas
  • My changes need updates to the documentation. I have made corresponding changes to the documentation
  • My changes do not require updates to the documentation (explain).
  • My changes generate no new warnings
  • New and existing tests pass with my changes
  • Any dependent changes have been merged and published

LABELS (optional):

A Code Manager needs to add the following labels to this PR:

  • Work In Progress
  • bug
  • enhancement
  • documentation
  • release
  • high priority
  • run_ci
  • run_we2e_fundamental_tests
  • run_we2e_comprehensive_tests
  • Needs Cheyenne test
  • Needs Jet test
  • Needs Hera test
  • Needs Orion test
  • help wanted

CONTRIBUTORS (optional):

@christinaholtNOAA

@MichaelLueken
Copy link
Collaborator

@natalie-perlin -

I'll move this work to On-Hold until a WE2E test has been added so that we can properly test this new functionality.

@MichaelLueken MichaelLueken added enhancement New feature or request DO_NOT_MERGE Ensure that a PR isn't merged labels May 30, 2024
@natalie-perlin
Copy link
Collaborator Author

natalie-perlin commented May 31, 2024

NB: @MichaelLueken - this PR requires an option "RRFS" to be allowed in UFS_UTILS. Current develop branch of ufs-community/UFS_UTILS does have the "RRFS" option enabled, but the version checked out by the SRW does not (a correction needs to be to allow it).

How should we proceed with this requirement?

@natalie-perlin
Copy link
Collaborator Author

An updated tag could be used for the UFS_UTILS that had this option implemented

@MichaelLueken
Copy link
Collaborator

@natalie-perlin -

I'll check if updating the version of UFS_UTILS will work in the SRW App. The commit in UFS_UTILS following what is currently in the SRW App's External.cfg file causes the weather model to fail (the weather model is expecting sheleg, while chgres_cube is generating sheleg_ice and sheleg_land, leading to the previously mentioned failure).

I'll go ahead and try updating the UFS_UTILS version to the latest version and see if it works. If it does, then we can move forward with this update. However, if it continues to fail, I will need to open an issue in the UFS_UTILS repository to let them know about the continued failures and see what can be done.

@MichaelLueken
Copy link
Collaborator

@natalie-perlin -

What version of UFS_UTILS contains the necessary fix so that we can exercise the use of RRFS ICs/LBCs in the SRW App? I can try to update to that version and see what issues appear.

@MichaelLueken
Copy link
Collaborator

It looks like UFS_UTILS PR #902 includes the necessary changes for chgres_cube to work with RRFS. I'll try a later version of the UFS_UTILS repository, then this one, to see if either will work.

@natalie-perlin
Copy link
Collaborator Author

It looks like UFS_UTILS PR #902 includes the necessary changes for chgres_cube to work with RRFS. I'll try a later version of the UFS_UTILS repository, then this one, to see if either will work.

Yes - thank you!! I was having troubles finding exact time/version when this change was implemented!.
The changes required to allow for RRFS option had to be done in two locations in UFS_UTILS repository in ./sorc/chgres_cube.fd/program_setup.F90: line 57 and lines 321-322. It looks like the PR you mentioned address that:
https://github.com/ufs-community/UFS_UTILS/pull/902/files#diff-6b6d24e7712144952ef83ca8f5e9d56e164fdcab1f7faab27812e91bfd483ba2

@MichaelLueken
Copy link
Collaborator

@natalie-perlin -

Using the version of UFS_UTILS associated with PR #902 is causing a failure in the fundamental tests:

----------------------------------------------------------------------------------------------------
Experiment name                                                  | Status    | Core hours used 
----------------------------------------------------------------------------------------------------
grid_RRFS_CONUScompact_25km_ics_HRRR_lbcs_RAP_suite_RRFS_v1beta_2  COMPLETE               9.31
grid_RRFS_CONUS_25km_ics_FV3GFS_lbcs_FV3GFS_suite_GFS_v15p2_20240  COMPLETE               8.22
grid_RRFS_CONUS_25km_ics_FV3GFS_lbcs_FV3GFS_suite_GFS_v17_p8_plot  COMPLETE              15.17
grid_RRFS_CONUScompact_25km_ics_HRRR_lbcs_HRRR_suite_HRRR_2024060  DEAD                   5.71
grid_SUBCONUS_Ind_3km_ics_HRRR_lbcs_RAP_suite_WoFS_v0_20240603165  COMPLETE              23.23
grid_RRFS_CONUS_25km_ics_NAM_lbcs_NAM_suite_GFS_v16_2024060316510  COMPLETE              20.14
----------------------------------------------------------------------------------------------------
Total                                                              DEAD                  81.78

The grid_RRFS_CONUScompact_25km_ics_HRRR_lbcs_HRRR_suite_HRRR WE2E test failed in run_fcst_mem000 with the following error message:

FATAL from PE 0: NetCDF: Start+count exceeds dimension bound: netcdf_read_data_3d: file:INPUT/sfc_data.nc- variable:tiice

I'll try backing my way through the commits in the UFS_UTILS repository to see which entry is causing issues with tiice.

@natalie-perlin
Copy link
Collaborator Author

@MichaelLueken - thank you for testing!! Let me look into these errors - look like a data problem. I might need to stage an additional directory in the EPIC space with data that I though was not needed... will get back to you!

@natalie-perlin
Copy link
Collaborator Author

@MichaelLueken - What is the location of your test? I was not able to reproduce this error.

However, I'm also replacing the explicit format statement for RRFS in the config.yaml file by the format given in the code, and doing some more changes for this PR, but not yet in GitHub.

Natalie Perlin and others added 2 commits September 7, 2024 00:01
Updating the documentations files per Gillian Petro's suggestions

Co-authored-by: Gillian Petro <96886803+gspetro-NOAA@users.noreply.github.com>
@natalie-perlin
Copy link
Collaborator Author

@MichaelLueken @mkavulich -
After including suggestions to rebase the code to develop, I've also squashed some commits. It allowed to reduce the number of file changes to 19 (actually 18), and number of introduced commits to ~20.
I will re-run the new test and fundamental tests on Orion and AWS to be sure it has been fully cleaned up.

@natalie-perlin
Copy link
Collaborator Author

A new experiment has successfully completed using the updated code on Orion:

All 1 experiments finished
Calculating core-hour usage and printing final summary
----------------------------------------------------------------------------------------------------
Experiment name                                                  | Status    | Core hours used 
----------------------------------------------------------------------------------------------------
grid_RRFS_CONUScompact_25km_ics_RRFS_lbcs_RRFS_suite_RRFS_v1beta_  COMPLETE              51.86
----------------------------------------------------------------------------------------------------
Total                                                              COMPLETE              51.86

Detailed summary written to /work/noaa/epic/nperlin/SRW/expt_dirs/WE2E_summary_20240907004905.txt

All the fundamental tests passed on NOAA AWS (Centos7). One of the tests initially died due to insufficient run time for the "post" tasks; it successfully completed after the run time has been increases for these tasks:

Took 1:46:31.913582; will no longer monitor.
All 6 experiments finished
Calculating core-hour usage and printing final summary
----------------------------------------------------------------------------------------------------
Experiment name                                                  | Status    | Core hours used 
----------------------------------------------------------------------------------------------------
grid_RRFS_CONUScompact_25km_ics_HRRR_lbcs_RAP_suite_RRFS_v1beta_2  DEAD                 124.38
grid_RRFS_CONUS_25km_ics_FV3GFS_lbcs_FV3GFS_suite_GFS_v15p2_20240  COMPLETE              17.74
grid_RRFS_CONUS_25km_ics_FV3GFS_lbcs_FV3GFS_suite_GFS_v17_p8_plot  COMPLETE             101.85
grid_RRFS_CONUScompact_25km_ics_HRRR_lbcs_HRRR_suite_HRRR_2024090  COMPLETE             321.66
grid_SUBCONUS_Ind_3km_ics_HRRR_lbcs_RAP_suite_WoFS_v0_20240907034  COMPLETE             111.42
grid_RRFS_CONUS_25km_ics_NAM_lbcs_NAM_suite_GFS_v16_2024090703460  COMPLETE             181.12
----------------------------------------------------------------------------------------------------
Total                                                              DEAD                 858.17

Detailed summary written to /contrib/Natalie.Perlin/SRW/expt_dirs/WE2E_summary_20240907053232.txt

After increase of the runtime:

(srw_app) [Natalie.Perlin@NOAA-AWS:/contrib/Natalie.Perlin/SRW/expt_dirs/grid_RRFS_CONUScompact_25km_ics_HRRR_lbcs_RAP_suite_RRFS_v1beta]$ rocotostat -w FV3LAM_wflow.xml -d FV3LAM_wflow.db -v 10
       CYCLE                    TASK                       JOBID               STATE         EXIT STATUS     TRIES      DURATION
================================================================================================================================
202008100000               make_grid                         197           SUCCEEDED                   0         1          46.0
202008100000               make_orog                         223           SUCCEEDED                   0         1         339.0
202008100000          make_sfc_climo                         229           SUCCEEDED                   0         1         109.0
202008100000           get_extrn_ics                         198           SUCCEEDED                   0         1          39.0
202008100000          get_extrn_lbcs                         199           SUCCEEDED                   0         1          39.0
202008100000         make_ics_mem000                         238           SUCCEEDED                   0         1         254.0
202008100000        make_lbcs_mem000                         239           SUCCEEDED                   0         1         418.0
202008100000         run_fcst_mem000                         251           SUCCEEDED                   0         1        1430.0
202008100000    run_post_mem000_f000                         396           SUCCEEDED                   0         1        1956.0
202008100000    run_post_mem000_f001                         397           SUCCEEDED                   0         1        1970.0
202008100000    run_post_mem000_f002                         398           SUCCEEDED                   0         1        1970.0
202008100000    run_post_mem000_f003                         399           SUCCEEDED                   0         1        1944.0
202008100000    integration_test_mem000                         273           SUCCEEDED                   0         1          16.0
(srw_app) [Natalie.Perlin@NOAA-AWS:/contrib/Natalie.Perlin/SRW/expt_dirs/grid_RRFS_CONUScompact_25km_ics_HRRR_lbcs_RAP_suite_RRFS_v1beta]$

Launching fundamental tests on Orion now

@natalie-perlin
Copy link
Collaborator Author

Orion fundamental tests have completed successfully:

All 6 experiments finished
Calculating core-hour usage and printing final summary
----------------------------------------------------------------------------------------------------
Experiment name                                                  | Status    | Core hours used 
----------------------------------------------------------------------------------------------------
grid_RRFS_CONUScompact_25km_ics_HRRR_lbcs_RAP_suite_RRFS_v1beta_2  COMPLETE              17.41
grid_RRFS_CONUS_25km_ics_FV3GFS_lbcs_FV3GFS_suite_GFS_v15p2_20240  COMPLETE              12.40
grid_RRFS_CONUS_25km_ics_FV3GFS_lbcs_FV3GFS_suite_GFS_v17_p8_plot  COMPLETE              41.46
grid_RRFS_CONUScompact_25km_ics_HRRR_lbcs_HRRR_suite_HRRR_2024090  COMPLETE              63.53
grid_SUBCONUS_Ind_3km_ics_HRRR_lbcs_RAP_suite_WoFS_v0_20240909091  COMPLETE              38.75
grid_RRFS_CONUS_25km_ics_NAM_lbcs_NAM_suite_GFS_v16_2024090909101  COMPLETE              81.91
----------------------------------------------------------------------------------------------------
Total                                                              COMPLETE             255.46

Detailed summary written to /work/noaa/epic/nperlin/SRW/expt_dirs/WE2E_summary_20240909101243.txt

doc/UsersGuide/BuildingRunningTesting/RunSRW.rst Outdated Show resolved Hide resolved
* Obtained and staged by the user from a different source.
* Pulled from the NOAA High Performance Storage System (:term:`HPSS`) during the workflow execution (requires user access),
* Obtained and staged by the user from a different source, or
* Pulled from the `RRFS data bucket (rrfs_a data) <https://registry.opendata.aws/noaa-rrfs/>`_.
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think this section is more general and the mention of RRFS specifically should not go here.

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This is included specifically because the new test case for the RRFS is not staged in standard locations, as many other WE2E experiments (data pulled from the SRW App Data Bucket). It is also a ready reference for obtaining readily available initialization data besides the HPSS or other sources ("Obtained and staged by the user from a different source"), for users external to NOAA who may not be aware of actual data sources.
The question of where to get the data to initialize the UFS/SRW model is one of the most frequently asked questions (sometimes directly sent to Maoyi), so it looks that including this info in the section on "Input and Output" is likely very beneficial to users.

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think this still needs to be more generic. There are several different models that can be pulled by SRW from the NODD buckets. It is not just RRFS.

doc/UsersGuide/Reference/Glossary.rst Outdated Show resolved Hide resolved
Copy link
Collaborator

@MichaelLueken MichaelLueken left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@natalie-perlin -

It looks like data has been removed from https://registry.opendata.aws/noaa-rrfs/ bucket for 20240605. It only has data for 00, 06, 12, and 18 hours. This is causing the new WE2E test to fail:

INFO: Retrieved files will be placed here:
 /work/noaa/epic/mlueken/ufs-srweather-app/expt_dirs/grid_RRFS_CONUScompact_25km_ics_RRFS_lbcs_RRFS_suite_RRFS_v1beta/2024060517/RRFS/for_ICS/mem001

DEBUG: Looking for fhr = 0

DEBUG: Looking for files like ['rrfs.t{hh}z.prslev.f{fcst_hr:03d}.conus.grib2']

DEBUG: They should be here: https://noaa-rrfs-pds.s3.amazonaws.com/rrfs_a/rrfs_a.{yyyymmdd}/{hh}/control/

INFO: Getting file: https://noaa-rrfs-pds.s3.amazonaws.com/rrfs_a/rrfs_a.20240605/17/control/rrfs.t17z.prslev.f000.conus.grib2

DEBUG: Target path: /work/noaa/epic/mlueken/ufs-srweather-app/expt_dirs/grid_RRFS_CONUScompact_25km_ics_RRFS_lbcs_RRFS_suite_RRFS_v1beta/2024060517/RRFS/for_ICS/mem001

DEBUG: Running command:
 wget -q -c -T 15 -t 2 https://noaa-rrfs-pds.s3.amazonaws.com/rrfs_a/rrfs_a.20240605/17/control/rrfs.t17z.prslev.f000.conus.grib2

INFO: Command 'wget -q -c -T 15 -t 2 https://noaa-rrfs-pds.s3.amazonaws.com/rrfs_a/rrfs_a.20240605/17/control/rrfs.t17z.prslev.f000.conus.grib2' returned non-zero exit status 8.

DEBUG: Retrieved status: False

DEBUG: Some files were not retrieved: ['https://noaa-rrfs-pds.s3.amazonaws.com/rrfs_a/rrfs_a.20240605/17/control/rrfs.t17z.prslev.f000.conus.grib2']

If the data has been staged, please change the test so that the new test will pull the data that has been staged on disk. If the data hasn't been staged, please update the DATE_FIRST_CYCL and DATE_LAST_CYCL to use one of the available cycle hours available for the date.

natalie-perlin and others added 2 commits September 9, 2024 17:47
Updating a description for RRFS data available from the AWS S3 NOAA-RRFS bucket

Co-authored-by: Christina Holt <56881914+christinaholtNOAA@users.noreply.github.com>
…S bucket; use staged data for the new test with RRFS as ICS/LBCS
@natalie-perlin
Copy link
Collaborator Author

After all the updates, a new test grid_RRFS_CONUScompact_25km_ics_RRFS_lbcs_RRFS_suite_RRFS_v1beta has successfully completed on Orion:

(srw_app) [nperlin@Orion:/work/noaa/epic/nperlin/SRW/expt_dirs/grid_RRFS_CONUScompact_25km_ics_RRFS_lbcs_RRFS_suite_RRFS_v1beta]$  rocotostat -w FV3LAM_wflow.xml -d FV3LAM_wflow.db -v 10
       CYCLE                    TASK                       JOBID               STATE         EXIT STATUS     TRIES      DURATION
================================================================================================================================
202406051700               make_grid                    18610470           SUCCEEDED                   0         1          61.0
202406051700               make_orog                    18610479           SUCCEEDED                   0         1          72.0
202406051700          make_sfc_climo                    18610489           SUCCEEDED                   0         1          93.0
202406051700           get_extrn_ics                    18610471           SUCCEEDED                   0         1          51.0
202406051700          get_extrn_lbcs                    18610472           SUCCEEDED                   0         1          48.0
202406051700         make_ics_mem000                    18610499           SUCCEEDED                   0         1         115.0
202406051700        make_lbcs_mem000                    18610500           SUCCEEDED                   0         1         199.0
202406051700         run_fcst_mem000                    18610525           SUCCEEDED                   0         1         861.0
202406051700    run_post_mem000_f000                    18610540           SUCCEEDED                   0         1          41.0
202406051700    run_post_mem000_f001                    18610548           SUCCEEDED                   0         1          61.0
202406051700    run_post_mem000_f002                    18610566           SUCCEEDED                   0         1          60.0
202406051700    run_post_mem000_f003                    18610567           SUCCEEDED                   0         1          60.0
202406051700    plot_allvars_mem000_f000                    18610570           SUCCEEDED                   0         1         295.0
202406051700    plot_allvars_mem000_f001                    18610571           SUCCEEDED                   0         1         298.0
202406051700    plot_allvars_mem000_f002                    18610572           SUCCEEDED                   0         1         295.0
202406051700    plot_allvars_mem000_f003                    18610573           SUCCEEDED                   0         1         295.0
(srw_app) [nperlin@Orion:/work/noaa/epic/nperlin/SRW/expt_dirs/grid_RRFS_CONUScompact_25km_ics_RRFS_lbcs_RRFS_suite_RRFS_v1beta]$

Copy link
Collaborator

@christinaholtNOAA christinaholtNOAA left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Just one small additional request below. I think this is in good shape for the new feature. Thanks for handling the many moving parts to add the capability.

Copy link
Collaborator

@MichaelLueken MichaelLueken left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@natalie-perlin -

Thank you so much for working through the concerns I had with the changes in the PR!

The fundamental WE2E tests successfully passed on Orion:

----------------------------------------------------------------------------------------------------
Experiment name                                                  | Status    | Core hours used 
----------------------------------------------------------------------------------------------------
grid_RRFS_CONUScompact_25km_ics_HRRR_lbcs_RAP_suite_RRFS_v1beta_2  COMPLETE              16.47
grid_RRFS_CONUS_25km_ics_FV3GFS_lbcs_FV3GFS_suite_GFS_v15p2_20240  COMPLETE              11.77
grid_RRFS_CONUS_25km_ics_FV3GFS_lbcs_FV3GFS_suite_GFS_v17_p8_plot  COMPLETE              42.97
grid_RRFS_CONUScompact_25km_ics_HRRR_lbcs_HRRR_suite_HRRR_2024091  COMPLETE              63.05
grid_SUBCONUS_Ind_3km_ics_HRRR_lbcs_RAP_suite_WoFS_v0_20240910084  COMPLETE              40.20
grid_RRFS_CONUS_25km_ics_NAM_lbcs_NAM_suite_GFS_v16_2024091008425  COMPLETE              76.82
----------------------------------------------------------------------------------------------------
Total                                                              COMPLETE             251.28

as well as the Orion coverage WE2E tests:

----------------------------------------------------------------------------------------------------
Experiment name                                                  | Status    | Core hours used
----------------------------------------------------------------------------------------------------
custom_ESGgrid_SF_1p1km_20240910102634                             COMPLETE             469.23
deactivate_tasks_20240910102636                                    COMPLETE               2.12
get_from_AWS_ics_GEFS_lbcs_GEFS_fmt_grib2_2022040400_ensemble_2me  COMPLETE            1984.34
grid_CONUS_3km_GFDLgrid_ics_FV3GFS_lbcs_FV3GFS_suite_RRFS_v1beta_  COMPLETE            1107.76
grid_RRFS_AK_13km_ics_FV3GFS_lbcs_FV3GFS_suite_GFS_v16_plot_20240  COMPLETE             418.20
grid_RRFS_CONUS_25km_ics_NAM_lbcs_NAM_suite_RRFS_v1beta_202409101  COMPLETE              28.81
grid_RRFS_CONUS_3km_ics_FV3GFS_lbcs_FV3GFS_suite_HRRR_20240910102  COMPLETE            1028.28
grid_RRFS_CONUScompact_25km_ics_RRFS_lbcs_RRFS_suite_RRFS_v1beta_  COMPLETE              24.09
grid_RRFS_CONUScompact_13km_ics_FV3GFS_lbcs_FV3GFS_suite_GFS_v16_  COMPLETE              73.74
grid_RRFS_CONUScompact_3km_ics_FV3GFS_lbcs_FV3GFS_suite_GFS_v16_2  COMPLETE             808.48
grid_SUBCONUS_Ind_3km_ics_FV3GFS_lbcs_FV3GFS_suite_WoFS_v0_202409  COMPLETE              75.64
2020_CAD_20240910102655                                            COMPLETE              83.09
----------------------------------------------------------------------------------------------------
Total                                                              COMPLETE            6103.78

The new RRFS WE2E test successfully passes.

Approving PR now.

@MichaelLueken MichaelLueken added the run_we2e_coverage_tests Run the coverage set of SRW end-to-end tests label Sep 10, 2024
@MichaelLueken
Copy link
Collaborator

@natalie-perlin -

The tests have been submitted, but I can relaunch the Orion tests and they should finish over night along with the rest of the automated Jenkins testing. Please go ahead and remove the global: section in the new WE2E test configuration.

…FS_v1beta.yaml

Removing settings for "global" including ensemble size, from  the new test with RRFS as ICS/LBCS
@MichaelLueken
Copy link
Collaborator

The Derecho and Jet coverage WE2E test suites were closed for running beyond the 8 hour time limit. These coverage tests are now being run manually on both machines. The rest of the Jenkins tests successfully passed.

Once the manual runs on Derecho and Jet are complete, this PR will get merged to develop.

@MichaelLueken
Copy link
Collaborator

The WE2E coverage tests successfully completed on Derecho:

----------------------------------------------------------------------------------------------------
Experiment name                                                  | Status    | Core hours used 
----------------------------------------------------------------------------------------------------
custom_ESGgrid_IndianOcean_6km_20240911072005                      COMPLETE              39.99
grid_RRFS_CONUS_13km_ics_FV3GFS_lbcs_FV3GFS_suite_GFS_v16_plot_20  COMPLETE              77.84
grid_RRFS_CONUS_25km_ics_NAM_lbcs_NAM_suite_GFS_v16_2024091107200  COMPLETE              87.18
grid_RRFS_CONUScompact_13km_ics_HRRR_lbcs_RAP_suite_HRRR_20240911  COMPLETE              56.39
grid_RRFS_CONUScompact_25km_ics_HRRR_lbcs_RAP_suite_RRFS_v1beta_2  COMPLETE              26.35
grid_SUBCONUS_Ind_3km_ics_HRRR_lbcs_HRRR_suite_HRRR_2024091107201  COMPLETE              57.39
pregen_grid_orog_sfc_climo_20240911072013                          COMPLETE              24.91
specify_template_filenames_20240911072015                          COMPLETE              26.35
2019_hurricane_barry_20240911072016                                COMPLETE              72.66
2019_memorial_day_heat_wave_20240911072018                         COMPLETE              69.24
2020_denver_radiation_inversion_20240911072019                     COMPLETE              68.96
----------------------------------------------------------------------------------------------------
Total                                                              COMPLETE             607.26

and Jet:

----------------------------------------------------------------------------------------------------
Experiment name                                                  | Status    | Core hours used
----------------------------------------------------------------------------------------------------
community_20240911135724                                           COMPLETE              24.32
custom_ESGgrid_20240911135725                                      COMPLETE              30.62
custom_ESGgrid_Great_Lakes_snow_8km_20240911135726                 COMPLETE              26.56
custom_GFDLgrid_20240911135728                                     COMPLETE              14.91
get_from_HPSS_ics_FV3GFS_lbcs_FV3GFS_fmt_nemsio_2021032018_202409  COMPLETE              13.85
get_from_HPSS_ics_FV3GFS_lbcs_FV3GFS_fmt_netcdf_2022060112_48h_20  COMPLETE             104.05
get_from_HPSS_ics_RAP_lbcs_RAP_20240911135731                      COMPLETE              20.61
grid_RRFS_AK_3km_ics_FV3GFS_lbcs_FV3GFS_suite_HRRR_20240911135732  COMPLETE             659.92
grid_RRFS_CONUS_13km_ics_FV3GFS_lbcs_FV3GFS_suite_GFS_v16_plot_20  COMPLETE              84.44
grid_RRFS_CONUS_25km_ics_FV3GFS_lbcs_FV3GFS_suite_GFS_v15p2_20240  COMPLETE               8.67
grid_RRFS_CONUS_3km_ics_FV3GFS_lbcs_FV3GFS_suite_RRFS_v1beta_2024  COMPLETE             981.33
----------------------------------------------------------------------------------------------------
Total                                                              COMPLETE            1969.28

Moving forward with merging this work now.

@MichaelLueken MichaelLueken merged commit 26cdad8 into ufs-community:develop Sep 12, 2024
4 of 5 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request run_we2e_coverage_tests Run the coverage set of SRW end-to-end tests
Projects
None yet
Development

Successfully merging this pull request may close these issues.

6 participants