Recently journal editors have updated their data policy and now require that data relating to the submitted paper should be made available by the authors. From the AGU data policy : "..all data necessary to understand, evaluate, replicate, and build upon the reported research must be made available and accessible whenever possible ..."
. The aim of this change in the policy is to satisfy the principle that someone reading the paper should be able to reproduce your experiment. Again form the AGU policy :
" For the purposes of this policy, data include, but are not limited to, the following:
  • Data used to generate, or be displayed in, figures, graphs, plots, videos, animations, or tables in a paper.
  • New protocols or methods used to generate the data in a paper.
  • New code/computer software used to generate results or analyses reported in the paper.
  • Derived data products reported or described in a paper. "

There can be practical and even copyright limitations to do this, but these can be taken into account and it should not be an impediment to publication if properly documented.
The JGR-Space Physics editor-in-chief has listed some of these challenges and clarified the scope of the policy on his blog. This include references to model data which apply to lots of the ARCCSS data as well.

How to publish

NCI is now providing web services to publish data and metadata. This is a work in progress and when they will be in production mode we'll list them here.
We are working with them to provide a way to publish also smaller datasets through this services.
In the meantime we can create a collection record on Research Data Australia (RDA), a metadata catalogue service provided by ANDS. Once you have a record on RDA the journal will be satisfied that readers can find all the necessary information about your data. The record will also contain information on how to access the data, this will be a direct link to the data itself once the data is published on a repository and a contact to obtain the data in the meantime.

There a few necessary steps to make your data available:
  1. move the data to a new directory under /g/data1/ua8: this disk space started as a storage for ERA Interim, but it is now dedicated to ARCCSS shared datasets;
  2. get the dataset to a good quality level, necessary to share it successfully: metadata in the files should satisfy CF conventions, there should be a READ-ME file or some data description sitting with the data and both directory structure and filenames should be understandable and contains information on the data. It sounds more difficult that normally it is, once you moved the data I'll have a look and tell you if there's anything which needs to be done
  3. Create a data management plan for your collection using the ARCCSS DMPonline tool, if you don't have one already. If you do make sure to have filled in the third phase of the plan which deals with the publishing details. You then should share the plan with me when you're ready (share with I use the dmp to collect the necessary information to create a metadata record on RDA on your behalf. You can look at one of ARCCSS records as an example of the kind of information required.
  4. Think carefully about the license and rights terms, you'll find some options on the form itself, contact me if there are other collaborators involved or any other special terms to be kept into account, the license will be virtually null if either the ARCCSS and/or your university don't hold the copyright.
  5. if you don't have one already to make a ORCID identity which we can list in the record as well. In the future you might be able to reference all your work through a researcher identity, when applying for a grant.
The above steps, don't need to be executed in the listed order, in particular we can start working on the RDA record before your dataset has undergone proper curation. So you can go ahead with submitting the paper, ideally though the data should be ready to be shared before your paper is published.

Managing your data is an essential part of the publishing process for more detailed information go to the Data management induction training

I'm still working on this page and the DMPonline tool, any feedback on both is welcome!
Look also to the other wiki pages under data services dedicated to the tool, data management in general and researcher identities for more information