An official website of the United States government
blue sky with white clouds

World’s premier ground-based observations facility advancing atmospheric research

Computing Resources

The Atmospheric Radiation Measurement (ARM) Data Center offers computing infrastructure to support the ARM User Facility's next-generation atmospheric model simulations and petascale data storage, and to provide capabilities for users to conduct big-data analytics and machine learning for atmospheric research.

The ARM Data Center provides a co-located data and computing platform that enables users to work with large volumes of ARM data without the need to download them. ARM’s heterogeneous and flexible computing architecture provides resources for conventional physics-based model simulations for data analysis and machine learning.

Users with an active ARM account can request access to ARM’s high performance computing (HPC) facility that integrates a range of computer resources and storage systems.

Open-Source Programming

Scientists working with ARM data are encouraged to share experiences and codes and are invited to engage in community software development and make use of the ARM GitHub account—especially for routine processing activities—to accelerate the collective application of these data.

Coding guidelines have been developed to help streamline the process of integrating principal investigator (PI)-developed algorithms with ARM processes and libraries. The ARM development team has also created an open-source tool called the ARM Data Integrator (ADI) to integrate and transform multiple diverse datastreams and facilitate the creation of datastreams that meet ARM standards.

Other available open-source toolkits include the Python ARM Radar Toolkit (Py-ART) and the Atmospheric data Community Toolkit (ACT). Py-ART, which combines a variety of utilities for processing ARM radar data, is the first ARM open-source project. ACT, a toolkit for working with atmospheric time-series data sets of varying dimensions, has functions for every part of the scientific process: discovery, input/output, quality control, corrections, retrievals, visualization, and analysis. ACT is a community platform for sharing code with the goal of reducing duplication of effort and better connecting the science community and ARM infrastructure.


Scott Collis, Argonne National Lab, led the first Py-ART short course during the 2014 Joint User Principal Investigator Meeting. This course covers an intro to radar data and what it looks like in Py-ART, plotting RHIs, and basic analysis. It is an at-home exercise.

Justin Monroe, from the ARM Data Quality Office at the University of Oklahoma, led a tutorial on ARM netCDF files and how to navigate the ARM Python notebook during the 2014 Joint User Principal Investigator Meeting.

JupyterHub and the ARM Data Workbench

JupyterHub is a popular tool for supporting scientific analysis through notebook-based computational environments. Hosted by the ARM Data Center, JupyterHub provides scalable computing infrastructure designed to meet the needs of a variety of users.

JupyterHub functions as part of the ARM Data Workbench, a new ecosystem for interacting with ARM data. Currently in development, the workbench will provide a set of tools for users to select data, retrieve measurement values, visualize data, perform data analysis, and even create their own data bundles.

Any user with an active ARM account can explore JupyterHub and ARM Data Workbench capabilities. In order to gain access to parallel processing capabilities, save files past a single session, and stage data to JupyterHub from Data Discovery, users must request elevated  access.

Cumulus Cluster

ARM’s Cumulus HPC cluster provides programming ability in ADI, Python, MATLAB, and IDL, as well as specialized software such as Py-ART.

Cumulus is a midrange Dell system that consists of 16,384 processing cores with a 4-petabyte General Parallel File System (GPFS). It is used by ARM infrastructure staff for Large-Eddy Simulation (LES) ARM Symbiotic Simulation and Observation (LASSO) development and operation, radar data processing, large-scale reprocessing, value-added product generation, and data quality analysis.

The Cumulus cluster is also available to users with an active ARM account to conduct ARM-approved computationally intensive science projects that involve working with large volumes of ARM data, apply computationally intensive codes to ARM data sets, or analyze LASSO output data sets. Use of the Cumulus cluster for ARM user science projects requires submission of a short proposal to ARM.  

Propose to Use the Cumulus Cluster

The Cumulus cluster is available to ARM users for their high-performance computing needs. Users can request to access this resource.
The Cumulus cluster is available to ARM users for their high performance computing needs. Users can request to access this resource.

ARM users may request use of the Cumulus cluster by submitting a high performance computing request (HPCR), a short proposal describing the activities that will make use of the Cumulus cluster. All investigators who will need access to the computing resources for a given project must be included as a principal investigator (PI) or co-PI on the request. All investigators must have active ARM user accounts prior to submitting their HPCR proposal. Proposals to use the cluster cannot be submitted until accounts have been created for all PIs and co-PIs.

Proposals to use the ARM HPC resources should focus on science activities that meet one or more of the following criteria:

  • involve large volumes of ARM data, which would be prohibitive to download to other computer systems
  • require parallel processing using computationally intensive code applied to ARM data sets
  • analyze voluminous LASSO outputs.

Unless explicitly requested through a special call for proposals, requests will not be considered that primarily involve running dynamical model simulations, without meeting one or more of the above criteria.

Proposals for the use of Cumulus will be reviewed monthly by the ARM Infrastructure Management Board (IMB). Larger efforts may require additional information and undergo a longer scientific peer review. Computational requests must:

  • clearly indicate the relevance of the proposed computational activities to the ARM mission
  • describe the ARM data sets to be analyzed
  • explain how the proposal meets the criteria above
  • explain why ARM computational cluster resources are the most appropriate computational resources for achieving the science goals.

To submit an HPC proposal, PIs must submit the HPCR form. The request will be included in the next monthly IMB review that occurs after responses from the PI to all requests for clarifying information have been received. Results of the review of the proposal and responses to PIs will be communicated after the monthly review.

Once approved, the ARM Data Center HPC team will communicate with PIs to enable access to Cumulus and provide any technical support needed.

Use of Other DOE Computational Facilities

Projects focusing primarily on model simulations or data analytic applications that do not require the use of large volumes of ARM data or do not apply computationally intensive code to analysis of ARM data may be more suitable for the U.S. Department of Energy’s (DOE) National Energy Research Scientific Computing Center (NERSC) or other DOE leadership computing user facilities, such as the Oak Ridge National Laboratory Leadership Computing Facility (OLCF) and Argonne National Laboratory Leadership Computing Facility (ALCF). Allocations for these computational facilities should be requested directly from the individual facilities.

Executing an ARM HPC Project

All project allocations will be for a duration of 1 year but can be extended through an extension request process. For projects that extend beyond 1 year, PIs are required to submit a status report on an annual basis throughout the duration of the project.

Expectations for HPC Users

  1. Code of Conduct. PIs must review and agree to ARM’s Code of Conduct when requesting access to ARM’s HPC resources.
  2. Project Description. PIs must provide a description of their proposed computational project and its science goals. The project description must clearly show use of a large volume of ARM observational data or LASSO output, or the application of computationally intensive codes to ARM data sets. The project description should also include the computational approach, the planned ARM and non-ARM input data volumes, and all output data volumes required.
  3. Status Reports. If the project extends beyond 1 year, PIs must provide annual status reports to ARM.
  4. Data Submission. HPC projects that create new data products should plan to submit their data to the ARM Data Center as a PI data product. PIs should provide a detailed description of the product and data volume storage needs.
  5. Final Report. A final report for the project is required to be submitted to ARM within 6 months of project completion.
  6. Acknowledgment of ARM Support. Investigators who receive ARM support for their work should use the following acknowledgment in associated publications:

This research was supported by the Atmospheric Radiation Measurement (ARM) User Facility, a U.S. Department of Energy (DOE) Office of Science user facility managed by the Biological and Environmental Research Program.

Closing Out a Project

When closing out a project, the PI is required to submit a brief, final report of the outcome to the ARM Field Campaign Administrator to complete the ARM documentation.

ARM Logo

Follow Us:

Keep up with the Atmospheric Observer

Updates on ARM news, events, and opportunities delivered to your inbox

Subscribe Now

ARM User Profile

ARM welcomes users from all institutions and nations. A free ARM user account is needed to access ARM data.

Atmospheric Radiation Measurement (ARM) | Reviewed October 2024