It is a special honor to receive the CODATA Prize on the 50th anniversary of the birth of CODATA. My association began when I attended the First International CODATA Conference in Arnoldshain, Germany in 1968, just two years after CODATA’s founding. That was a remarkable event, probably the first international conference to deal with scientific data issues in a generic sense, and we owe a debt of gratitude to CODATA’s founders. They were all distinguished scientists, highly respected for their own research contributions and service to their governments, and they had the foresight to recognize that science was at the beginning of a major change, where the rapidly increasing production of new data from experiments and observations threatened to swamp the traditional archival publication mechanisms, leading to the possibility that important data would be lost to future generations of scientists.
The six founding fathers of CODATA were:
Although these leaders had worked in different scientific fields, they all understood the prime importance of accuracy and reliability in the data they dealt with, whether that data was to be used in basic scientific research or for industrial applications. Thus data quality became a core objective as CODATA began to develop its programs.
I want to make several comments about these founders, and about the first CODATA Conference itself:
Thus my association with CODATA began, and was to continue for the next 25 years. During those years, I witnessed its evolution into an influential scientific organization and saw the expansion of its universe. In terms of disciplinary scope, CODATA extended first into the biosciences, when we started projects like the Protein Sequence Data Task Group and the Hybridoma Data Bank. Several small projects were started in the geosciences, and more came later. And increasing attention was focused on data of industrial interest, such as properties of engineering materials. Attention to data quality was a feature of all these new activities.
CODATA also expanded in geographical terms, from 7 National Members in 1968 to 22 members at the 1990 Conference – and even more today. I was especially pleased that the Chinese Academies at Beijing and Taipei joined CODATA during my tenure as President. And it is gratifying to see that so many countries from Asia and Africa are now participating in CODATA – it is quite a change from 1968.
And, of course, during these years computer technology has become an integral part of virtually every CODATA activity. I’d like to note that CODATA produced its first computer-based product, called the CODATA Referral Database, in 1990. This was a guide to data sources in different disciplines, distributed on floppy disks with integrated search software for use on a personal computer – and this was before the Internet and Google were invented. I am proud that my wife Bettijoyce, who is here today, was a leader in producing that first CODATA electronic database.
During my years with CODATA I met and worked with hundreds of people who took part in the various CODATA activities. I have warm memories of these colleagues, who deserve the credit for the progress CODATA made in its first quarter-century. There are far too many to mention by name, but I do want to acknowledge one person, Phyllis Glaser, who served as Executive Director for 20 years. Phyllis gave her heart and soul to CODATA and was often the glue that held us together during turbulent times. And her esoteric flat on the Rue Bleue served as a stop-over for many children of CODATA participants as they back-packed through Europe. Those of us who worked with her were saddened when Phyllis passed away in late 2014.
In conclusion, CODATA began at a time when data evaluation and data management were considered rather dull subjects by most scientists. I was Director of the Standard Reference Data program at the National Bureau of Standards at that time, and I had a constant struggle with my management to get money to support data evaluation and to get proper respect for the people who were involved in data work. Fifty years later, it is a very different world. “Big Data” has become a popular buzz word, and many organizations want to get involved. “Data Scientist” is now a recognized profession. Many new organizations have been set up to deal with data issues. It is certainly pleasing to see the increased attention now given to the storage, preservation, and retrieval of scientific and technical data, and to removing the barriers to accessing this data. However, I worry that one thing is not getting proper attention, and that is the need for assurance of data quality. The Internet is a great tool for finding data, but if I Google for the thermal conductivity of magnesium at –50 Celsius, I might find 6 different values on 6 different sites. To justify the investment in massive data archives, it is essential to build into these structures a process for evaluating the quality of the data that goes in – to include a method for selecting the most accurate data and documenting the conditions under which the data were obtained. Data quality was a core objective when CODATA was founded, and is still the first objective in the CODATA charter. I want to conclude my remarks with a plea to everyone involved with data today – please continue the CODATA tradition of recognizing data quality as the highest priority.
The author has no competing interests to declare.