Meeting report (IUCr supported)
Interest in data flourishes at AsCA
James Hester of ANSTO kicked off the session by posing the question “What is a dataset?” As crystallographers we use the word “dataset” constantly, but often without pondering its meaning. James walked us through an approach to mapping raw image data files to imgCIF format, thereby providing a protocol for systematic raw data archive and sharing. From there, Janet Newman of the CSIRO Collaborative Crystallisation Centre (C3) took us on a tour of C6 (Comparison of Crystallization Conditions @ C3). She highlighted some of the challenges in standardizing crystallization screen data and some of the tools her team is making available to analyse crystallization results.
The next two presentations provided updates and insights into the status of the Cambridge Structural Database and the PDB. Matthew Lightfoot of the CCDC detailed some of the work his team has been doing in understanding the quality of datasets deposited with CCDC and improving data validation through the deposition workflow. Stephen Burley of the RSCB PDB detailed the efforts of his team to identify and correct ligand refinements in PDB structures. Toward the end of his presentation, Stephen highlighted the impact of the PDB on drug approvals by the US Food and Drug Administration. Continuing with updates from the PDB, Takeshi Kawabata of the Institute for Protein Research, Osaka University, provided a look at what PDBj is doing with data from electron microscopy (EM) studies. Takeshi detailed the EMPIAR archive of raw 2D EM images as well as EM Navigator, which provides a user-friendly interface to the EMDB server.
Finally, the session concluded with a contribution from Brian McMahon of the IUCr entitled “The element of trust: validating and valuing crystallographic data.” Brian’s presentation touched on the utility of the CIF file not only as a means of sharing data but also as an enabler for data checking and validation. This talk underscored the importance of focusing on data-sharing practices and protocols, which stood at the heart of every presentation. Without standardization, we lack the ability to validate the data we are using. And without the validation, what can we truly say about the scientific conclusion we draw from these collections of data?
Sessions on data management and archive provide a forum for concerned researchers to share experiences and agree on standard practices. As members of the structural science community, we have a responsibility to provide the highest quality data achievable from our experimental studies. I would encourage everyone to look through the talks described above on the IUCr website to learn more about ongoing efforts throughout the crystallographic community.