News & Articles

CODATA Data Science Journal: Revised Focus and Scope

The DSJ editorial board has revised the focus and scope of the journal. It is not a big change, but rather one of clarification in a changing world. We primarily want to specify our definition of 'data science' as the classic sense of the science of data practices that advance human understanding and knowledge — the evidence-based study of the socio-technical developments and transformations that affect science policy; the conduct and methods of research; and the data systems, standards, and infrastructure that are integral to research. 

We recognize the contemporary emphasis on data science, which is more concerned with data analytics, statistics, and inference. We embrace this new definition but seek papers that focus specifically on the data concerns of an application in analytics, machine learning, cybersecurity or what have you. We continue to seek papers addressing data stewardship, representation, re-use, policy, education etc.

Most importantly, we seek broad lessons on the science of data. Contributors should generalize the significance of their contribution and demonstrate how their work has wide significance or application beyond their core study. 

We look forward to hearing from you. 

Mark A. Parsons on behalf of the Editorial Board


China GEO-CODATA LODGD supports New Year Flood Impact in Honiara


From the 27th December 2018 the Solomon Islands experienced two weeks of intense rainfall due to the rapid succession of three tropical storms.  Firstly, a tropical depression that later turned into Cyclone Mona crossed the country bringing heavy downpours and high winds. This was followed by another tropical depression that formed nearby and lingered over the Solomon’s due to being sandwiched between Cyclone Penny to the south and another system to the west. A weather station in Honiara/Henderson recorded a total of 350mm of rain during this period. These heavy downpours caused rivers to burst their banks and flash flooding to occur.

On the 6th January the National Emergency Operations Centre (NEOC) reported that an estimated 23,708 people had been affected with homes and crops destroyed.  Flash flooding also contaminated drinking water. This damage was captured both on the ground and through satellite imagery produced by China GEO from the 25th January 2019.

Site Extent

Honiara is situated on the island of Guadalcanal, the largest in the Solomon Islands archipelago. It is volcanic in origin and ranges in topography from tall mountain ranges to low- lying coral atolls. The Guadalcanal Island has three main rivers the Mataniko, Lunga and Tenaru rivers. The China GEO supported satellite imagery from CGSTL was flown over the wider Honiara area, encapsulating all three rivers. The river of interest, the Lunga River and the site extent for analysis is shown in Figure 1.

Damage Scenario

Severe damage to both the natural and built environment occurred due to the continuous intense rainfall over the Honiara area in late December 2018 through to January 2019. The damage to the built environment, mostly consisting of family homes, was more concentrated compared to the damage to crop land and food gardens which was more widespread.  

A large amount of crop land was destroyed as a result of the constant inundation of water, which also caused hillsides to buckle as shown in Figure 2. Root crops and vegetables were destroyed meaning economic loss for majority of the island. Eighty five percent of the community are subsistence farmers therefore, the crops destroyed are their primary source of food. This means that crop damage causes not only an economic loss but a detrimental effect on community life and health through severe food shortages. The cocoa, cassava and fishing industries were also impacted by the flood waters.

Family homes were damaged due to the constant rain and inundation of water, which left many people homeless. The rain and associated wind caused roofs to collapse and landslides to occur. The landslides causing communities to be blocked off. More than 22,000 people had their homes or cropland destroyed. The majority of communities do not have proper water networks, therefore wells and streams are the main source of drinking water. The flooding caused contamination of water ways and increased the likelihood of mosquito-borne diseases and secondary impacts on the community such as diarrhoea.

The flooding also caused damage to infrastructure such as road networks and stopbanks. As seen in Figure 3, the force of the water ripped/bent the corrugated iron holding the rock mesh together. Damage to these key protection measures and roading networks can inhibit aid reaching affected areas and cause future inundation where resilient systems have been damaged.

Analysis of New Year Flood

Responding to requests from CODATA/LODGD, China GEO activated its Disaster Data Response (CDDR) Mechanism to provide high-resolution satellite imagery in support of disaster response planning. Analysis of the New Year Flood was undertaken using 2m resolution satellite provided by China GEO from the 25th January 2019.  The high resolution imagery was used to depict the extent of the flood water, which was indicative of the potential impacts for the surrounding community. The flooding extent was depicted from the satellite imagery using ArcGIS. To understand what the river and water extent was before the New Year’s flooding event, images from Google Earth were used. The comparison between the New Year Flood Extent and the river extent from the 7th January 2018 is shown in Figure 4. The imagery and river extent from the 7th January 2018 acts as a control for how the river is in its “normal” state, therefore the comparison allows for an understanding of the disaster event.

Discussion and Recommendations

High resolution satellite imagery is crucial data to have in response to a disaster. During the early stages of a disaster it is hard to understand the true extent of damage, due to complications in communications and logistics. Some communities can be completely cut off due to various reasons, therefore the earlier satellite imagery is retrieved and analysed, the quicker decisions can be made to reduce further impact.

There are issues around how open and freely available high resolution satellite imagery is after a disaster which can inhibit the response. Having open source, high resolution imagery shortly after a disaster can allow for rapid disaster mapping to occur. This analysis helps communicate where the areas of high risk and importance are to allow for the first responders to assess how they might assist those areas of greatest need and desperation.  A lack of satellite imagery after a disaster represents a lack of visualisation of the situation. A lack of visualisation means a poor understanding of the extent of the disaster and its consequences and therefore external help cannot be as effective in the attempt to send the relevant information or aid.

Rapid procurement of satellite imagery after a disaster is not only crucial for response but it can help for understanding future risk and impact. By visualising the extent of a disaster in the same place at different periods in time, alongside a control (the area in its natural state), trends for that particular area to a particular natural hazard can be understood. For example, Figure 4 shows the comparison between the river in its natural state (2018) and the river in a state of flooding (2019). It is notable that there are particular areas along the river that have burst their banks. If this data was then to be compared against the 2014 flood extent of the Lunga River, some trends may start to appear allowing for areas of vulnerability and future risk to be identified. Identifying these areas also leads to a better understanding of the impacts to the surrounding areas, therefore resilience measures can be put in place.

To ensure better response to future disasters, freely open and available high resolution satellite imagery is crucial. The resolution has to be high to capture the important and necessary features to inform the appropriate response. The imagery has to be freely available to allow for rapid disaster mapping to occur by external agencies, as during the early stages of response there can be other priorities for agencies on the ground. The imagery has to be taken as early as possible to gain a true understanding of the impact. For example, water retreat can occur rapidly, therefore the full extent of the disaster can be lost. Analysing satellite imagery after a disaster is the quickest way of understanding and visualising what has happened on the ground and can identify areas that could be of potential danger. It is crucial to have this information to reduce the impacts and devastation of a disaster.  

Image courtesy: China GEO & CGSTL

DEADLINE TOMORROW - CODATA International Training Workshop on Scientific Big Data and Machine Learning

9-20 September 2019, Beijing, China

Deadline for Applications, 30 May 2019 
Further information and application form:
CODATA China and the Computer Network Information Center, Chinese Academy of Sciences are organizing an international training workshop on "Scientific Big Data and Machine Learning" in Beijing from 9-20 September 2019. Supported by CODATA, CODATA China has successfully held similar training workshops since 2012 and this is the 5th event in the series. The training course is funded by the Chinese Academy of Science and qualified applicants will receive full support for travel to Beijing as well as their subsistence and accommodation when at the workshop.

Participants in the Training Workshop will also attend the CODATA 2019 Beijing Conference: Towards next-generation data-driven science: policies, practices and platforms.  Applicants are encouraged also to submit a paper or poster for the conference.

The deadline for applications is 11:59 pm CST (UTC+8) on 30 May.
Due to advances in information technology, we are witnessing an explosion in digital data through all forms of human activity: much of this data can also contribute to the production of knowledge for all domains of enquiry and across domains, as well as providing essential information for decision-making in response to global challenges such as sustainable development, disaster risk reduction, climate change, the growth of cities, the maintenance of biodiversity etc., etc. In order to meet the many global challenges and to take advantage of the opportunities of the data revolution, it is imperative to develop global skills and capacity in the science of data.

The mission of CODATA, the Committee on Data of the International Science Council, is to support the advancement of science as a global public good by promoting improved research data management and use, and by advancing data science and the enhanced exploitation of data in all areas of scientific research. Building data science capacity in countries with developing and emerging economies (in part through promoting technical training in data science, in data management and in the implications of the data revolution for science) is an important part of CODATA’s strategy.

The training program offered aims to engage participants with a number of facets of data science and data management in the age of the data revolution and Big Data. Topics include, but are not limited to:
  • Basic data science skills (e.g. Introduction to data infrastructures in CAS, data carpentry, data management plan)
  • Machine learning and data driven scientific discovery
  • Scientific data policies, good practices, norms, specifications and standards
  • Selected disciplinary scientific data stewardship exemplars (e.g. geoscience, biology, genomics, astronomy, etc)
A number of activities will be organized involving elite Chinese scientists, in order to promote knowledge sharing and to develop opportunities for future exchanges and collaboration. Participants will benefit also from visits to a number of leading research institutes of the Chinese Academy of Sciences (CAS). During these visits, participants will have the opportunity to learn from the scientific approach, management expertise, knowledge development and practical application which characterize activities at CAS institutes working at the frontiers of research. Furthermore, the program will promote interaction and exchange of knowledge between experts and participants and among participants who will benefit greatly from exchanges with colleagues for a variety of academic and national backgrounds.
How to Apply: 
Apply at to complete the online form and to submit the necessary support documents.  The deadline is 23:59 CST (UTC+8) on 30 May.
You will be asked to complete an application form with personal information and to submit supporting document. Once you have created an account and entered the personal information you will need to click on [Submission] to access the page to upload your supporting document.
The supporting document should comprise your CV, a personal statement (of no more than one side) and references combined as a single PDF file (maximum volume 20MB).

The personal statement should describe your interests and why you want to do the Training Workshop.  You should answer these questions:
  1. What expertise and qualities would you bring as a participant?
  2. What specific topics would you like to cover and how will this help you?
  3. How would you hope to apply the experience gained from the course in your own country? 
Please consult Eligibility and Important Dates
Further information about the training programme and contact details is available on the Training Workshop Website

Call for Applications - CODATA RDA Data Steward School 2019 - Deadline 1 June 2019

The CODATA-RDA Research Data Science Summer School provides training in the foundational skills of Research Data Science. Contemporary research – particularly when addressing the most significant, transdisciplinary research challenges – cannot be done effectively without a range of skills relating to data.

The school is held at the [International Centre for Theoretical Physics]( running from August 5-16. The venue, the Adriatico Guest House, is a delightful self-contained site, overlooking the sea,where there is accommodation, lecture halls, a terminal room and a canteen.


This year the school will be running a parallel strand that is oriented to students who are interested in Data Stewardship.

The curriculum for these students will be

  • Open Science
  • Introduction to Unix Shell
  • Programming for Analysis
  • Git
  • Research Data Management
  • Author Carpentry
  • Introduction to Information Security
  • Metadata & Persistent Identifiers
  • Data discovery and automated DMP’s
  • Repositories
  • Ontologies
  • Linked data

The first week of the school will be for both Early Career Reseachers (ECR’s) and Data Stewards. The seocnd week will break into two parallel strands, one focussing on Data Science techniques for ECR’s and the other for Data Stewards.

The provisional detailed programme can be found here with a breakdown of the shared and separate courses.

Graduate students or ECR’s considering a career in Data Stewardship and Early Career Data Stewards would benefit most from this school.

This school is a part of a planned series of CODATA-RDA schools that will run in Europe as part of the FAIRsFAIR initiative.

There is space for 5 students who wish to take the Data Steward strand. Students wishing to attend this strand will need to cover their accommodation and travel expenses to the school but there are no registration fees. Single rooms at the Adriatico start at €50 per night.

As this is a pilot students attending this strand will be interviewed afterwards to get feedback to improve the curriculum.

Candidates interested in attending should contact the director of the Data Steward strand, Hugh Shanahan before June 1.


The Call for Presentations and Posters for CODATA 2019 Beijing: Towards next-generation data-driven science: policies, practices and platforms, 19-20 September has now been released.  Please submit proposals at by the deadline of 24 June.
Registration is also now open!  The Conference Website now has information for registration, accommodation and visa matters.  Make your arrangements as soon as possible.
CODATA 2019: Towards next-generation data-driven science: policies, practices and platforms
Science globally is being transformed by new digital technologies.  At the same time addressing the major global challenges of the age requires the analysis of vast quantities of heterogeneous data from multiple sources.  In response, many countries, regions and scientific domains have developed Research Infrastructures to assist with the management, stewardship and analysis.  These developments have been stimulated by Open Science policies and practices, both those developed by funders and those that have emerged from communities.  The FAIR principles and supporting practices seek to accelerate this process and unlock the potential of analysis at scale with machines.  This conference provides a significant opportunity to survey and examine these developments from a global perspective.
The conference themes and guidelines for proposals for presentations and posters may be consulted at
The conference will follow a high-level workshop, 17-18 September 2019, on ‘Implementing Open Research Data Policy and Practice’ that will examine such challenges in China and elsewhere in the light of the emergence of data policies and in particular the China State Council’s Notice on ‘Measures for Managing Scientific Data’.
Applications are also open for the CODATA China 2019 Training Workshop: Scientific Big Data and Machine Learning deadline 30 May.
We advise you to register, to make travel, accommodation and visa arrangements as soon as possible.
Practical information about the conference is available at

Registration is now open at - Please register as soon as possible!

We suggest you make your visa application as soon as possible.  Please find useful information at

The conference venue is the Friendship Hotel.  Please find useful information at  Again we suggest you make accommodation arrangements at the Friendship Hotel or nearby as soon as possible.

For any further assistance and information please consult the website and contact details

Read more CODATA news