Crystallographic data

International Data Week and the CODATA General Assembly 2018

Traditionally the IUCr has had a strong presence at CODATA meetings (see previous reports on the IUCr web site), and it was a particular pleasure to see the main international CODATA conference SciDataCon (now part of the International Data Week) return to Africa for the first time since 2010. The meeting took place in Gaborone, Botswana;International Data Week 2018 ran from 5th to 8th November, and was followed by the CODATA General Assembly (9-10 November).

[SKA presentation]Big data in astronomy (the Square Kilometre Array). MeerKAT, SKA-1 and SKA-2 are different stages of development of the project concept, and are expected to attain measurement data rates of 20, 300 and 7500 Gbps, respectively. For comparison, CERN currently generates 40 Gbps of data, while the total video streaming output from Google is 2.2 Gbps.
 

The meeting theme of IDW2018 (http://www.internationaldataweek.org/programme-overview) was The Digital Frontiers of Global Science. It attracted 820 participants from 66 countries with 60% of the attendees being from Africa.

A major role IUCr had within this conference was its organisation with IUPAC of a double session entitled Data interoperability in chemistry, biology, and crystallography: Enabling multidisciplinary solutions to societal challenges. The session organisers were Ian Bruno (Cambridge Crystallographic Data Centre), Jeremy Frey (University of Southampton), Richard Hartshorn (IUPAC), John Helliwell (IUCr) and Leah McEwen (Cornell University).

[Keynote presentation]Keynote address by Fosca Giannotti, Director of Research at the Information Science and Technology Institute "A. Faedo" of the National Research Council, Pisa, Italy.

Through IDW2018 there were two plenary sessions. This commenced with the Opening Event, attended by the President of Botswana. The Plenary themes were: Science and Data in Africa; Data and Health; Science Policy and the Future of Science; Pan African Infrastructure; CODATA and WDS (World Data System) Initiatives; Data Science; Data and Scholarly Communications.

There were multiple parallel sessions on data topics and themes. I attended: Journals and Data FAIR; African Health Data; Libraries and Research Data Management; the European Open Science Cloud; FAIR and Interoperability; Supporting FAIR Data; Data citation; Chemistry Research Data; Data Interoperability; and Data and Scholarly Communications.

Within these, of direct interest to crystallographers was the citing of the CODATA-ICSTI Data Citation Standards and Practices Report, which was very warmly commended in the sessions and in which IUCr had been a participant (Brian McMahon, IUCr Chester). It was well received by the scientific communities and was testimony that scientific data can be properly cited and that recognition of quality scientific data can thereby be gained. Of general interest I highlight the presentation on the radio astronomy Square Kilometre Array, the largest data generating scientific project, which was very interesting and provided a comparison of measured data rates with some other ‘big science’ projects.

The merger of the International Council of Scientific Unions and the International Council for the Social Sciences into the International Council for Science made its presence felt in the IDW2018 programme. Of especial note I felt was the Keynote Lecture of Professor Dr Fosca Gianotti (Italy) in which she stressed the need for data to be both “FACT and FAIR”. The Public would expect no less, i.e. reproducibility, not irreproducibility, of her studies, she said. She is the coordinator of SoBigData, the European research infrastructure on Big Data Analytics and Social Mining, an ecosystem of ten cutting edge European research centres providing an open platform for interdisciplinary data science and data-driven innovation (http://www.sobigdata.eu).

CODATA General Assembly

At the General Assembly there were miscellaneous topics of specific interest to IUCr. I highlight three items.

  1. Crystallography’s track record with preservation of its data and metadata along with its publications is frequently referred to as an exemplar for all science disciplines. A CODATA Task Group on Data Standards has been established to effect improvement in all science areas towards the ideal of IUCr. Lesley Wyborn, Adjunct Fellow at the National Computational Infrastructure Facility and The Research School of Earth Sciences in Australia (lesley.wyborn@anu.edu.au), Deputy Chair, along with the CODATA President, reaffirmed to me as IUCr Representative the strong role that IUCr has in this effort.
  2. The CODATA VAMAS Nanomaterials Uniform Description project, in which IUCr was a participant (represented by John R. Helliwell, Reinhard Neder and Daniel Chateignier), was very warmly commended as both being well received by the science communities and testimony that scientific data integration towards a common good is possible. The common good in this case is society’s need for guidance on the safety of nanomaterials and, first and foremost, a clear set of descriptors for them.
  3. The CODATA strategic initiative on Integration of Standards with the launch Workshop in Paris (JRH as participant) has led to several subsequent Workshops and the identification of three important themes for CODATA and the International Council for Science to focus on: (i) Infectious diseases, (ii) Resilient cities and (iii) Disaster risks reduction. See http://www.codata.org/strategic-initiatives/commission-on-standards.