 Introducing the CODATA-RDA Schools of Research Data Science
Introducing the CODATA-RDA Schools of Research Data Science
The ever-accelerating volume and variety of data being generated is having a huge impact of a wide variety of research disciplines, from the sciences to the humanities: the international, collective ability to create, share and analyse vast quantities of data is having a profound, transformative effect. What can justly be called the ‘Data Revolution’ offers many opportunities coupled with significant challenges. High among these is the need to develop the necessary professions and skills. There is a recognised need for individuals with the combination of skills necessary to optimise use of the new data sets. Researchers and research institutions worldwide recognise the need to develop data skills and we see short courses, continuing professional development and MOOCs providing training in data skills and research data management.
For full information on activities by the CODATA-RDA Schools of Research Data Science, please visit the School’s website at: https://www.datascienceschools.org/.
The Need for Foundational Data Skills in all Disciplines
 Contemporary research – particularly when addressing the most significant, inter-disciplinary research challenges – cannot effectively be done without a range of skills relating to data, including the principles and practice of Open Science and research data management and curation, the use of a range of data platforms and infrastructures, large scale analysis, statistics, visualisation and modelling techniques, software development and annotation, etc, etc.  We define these skills together as ‘Research Data Science’, that is the science of research data: how to look after and use the data that is core to your research.
Contemporary research – particularly when addressing the most significant, inter-disciplinary research challenges – cannot effectively be done without a range of skills relating to data, including the principles and practice of Open Science and research data management and curation, the use of a range of data platforms and infrastructures, large scale analysis, statistics, visualisation and modelling techniques, software development and annotation, etc, etc.  We define these skills together as ‘Research Data Science’, that is the science of research data: how to look after and use the data that is core to your research.
The CODATA-RDA School of Research Data Science has developed a short course, summer school-style curriculum that addresses these training requirements. The course partners Software Carpentry (using the Shell command line and GitHub), Data Carpentry (using R and SQL) and the Digital Curation Centre (research data management and data management plans) and builds on materials developed by these organisations. Also included in the programme are modules on Open Science, ethics, visualisation, machine learning (recommender systems and artificial neural networks) and research computational infrastructures.
The foundational curriculum and school was piloted in August 2016 at the International Centre of Theoretical Physics in Trieste, Italy. In July 2017 it was repeated, with some refinements, and combined with a set of more advanced or discipline specific workshops looking at Extreme Data, the Internet of Things and Bioinformatics. In this way, we are exploring a vision where a foundational programme, suitable for any research discipline, can eventually be combined with more advanced training suitable for specific skills or disciplines.
An International Network of ‘Data Schools’

The significant demand from individuals and institutions means that it is a strategic priority for both CODATA and the Research Data Alliance to build capacity and to develop skills, training young researchers in the principles of Research Data Science. A particular issue is also the needs of young researchers in Lower and Middle Income Countries (LMICs): it is important that Open Data and Open Science benefit research in LMICs and that an unequal ability to exploit these developments does not become another lamentable aspect of the ‘digital divide’. On the contrary, it has been argued that the ‘Data Revolution’ may offer a notable opportunity for reducing that divide in a number of respects.
For these reasons, the vision of the CODATA-RDA Schools of Research Data Science is to develop into an international network which makes it easy for partner organisations and institutions to run the schools in a variety of locations. The annual event at the ICTP in Trieste will serve as a motor for building the network and building expertise and familiarity with the initiative’s mission and objectives. The core materials are made available for reuse and the co-chairs and Working Group team will provide guidance to assist partners in organising the school and identifying instructors and helpers. In 2017, the initiative expanded to include Advanced Workshops and the First School of Research Data Science in São Paulo, Brazil. In 2018 the data schools took place in Trieste, Italy (with the second year of advanced workshops), Kigali, Rwanda and São Paulo, Brazil. A week long winter school also ran in Brisbane, Australia.
Mission and Objectives
The CODATA-RDA Schools of Research Data Science:
- address recognised need for Research Data Science skills across all disciplines;
- follow a recognised curriculum that addresses foundational data skills required by all researchers;
- provide a pathway from a broad foundational course through to more advanced and specialised courses or workshops;
- are reproducible: all materials will be online with Open licences;
- are scalable: emphasis will be placed on training trainers, building partnerships and developing an international network which makes it easy for schools to be run in many locations.
Programmes, reports and documents
- See publications from the Schools team on Zenodo here: https://zenodo.org/communities/codata-rda-research-data-science-summer-school/
- Programme for #DataTrieste 2017, the second CODATA-RDA School of Research Data Science, ICTP, Trieste, July 2017
- Materials from #DataTrieste 2017, the second CODATA-RDA School of Research Data Science, ICTP, Trieste, July 2017
- Programme for #DataSãoPaulo 2017, the third CODATA-RDA School of Research Data Science, ICTP, São Paulo, December 2017
- Materials from #DataSãoPaulo 2017, the third CODATA-RDA School of Research Data Science, ICTP, São Paulo, December 2017
- Short Report on #DataTrieste 2016, the first CODATA-RDA School of Research Data Science
- Programme for #DataTrieste 2016, the first CODATA-RDA School of Research Data Science, ICTP, Trieste, August 2016
- Materials from #DataTrieste 2016, the first CODATA-RDA School of Research Data Science, ICTP, Trieste, August 2016
Upcoming Schools
For information on upcoming Schools events, see https://www.datascienceschools.org/upcomingschools/.
Past Schools
- Data Atlanta 2022: 5 Sept – 28 Oct 2022
- DataTrieste 2022: 11 – 22 July 2022
- South Africa/São Paolo Online & Self-paced 9 May – 15 July 2022
- Trieste Online & Self-paced 6 Sept – 5 Nov 2021
- South Africa 2021 Online and Self-Paced 3 May – 9 July 2021
 CODATA-RDA School of Research Data Science collaborated with the FAIRsFAIR project on a number of training events between 2019 and 2022.  More information on this activity is available at https://fairsfair.eu/codata-rda-summer-schools-data-science-and-cloud-computing-developing-world CODATA-RDA School of Research Data Science collaborated with the FAIRsFAIR project on a number of training events between 2019 and 2022.  More information on this activity is available at https://fairsfair.eu/codata-rda-summer-schools-data-science-and-cloud-computing-developing-world
- DataSaoPaulo 2020: CODATA-RDA School of Research Data Science, ICTP, São Paulo, 3 – 14 Dec 2020
- DataTrieste 2020, ICTP, Trieste, 5 – 16 Aug 2020
- #DataPretoria2020, University of Pretoria, Pretoria, 13 – 24 Jan 2020
- San José 2019
- Data Steward School, ICTP, Trieste 2019
- Addis Ababa 2019
- DataTrieste 2019
- Advanced Workshop 2019
- Research Data Winter School, University of Queensland, Brisbane, Australia, 12 – 15 June 2018
- #DataTrieste 2018, CODATA-RDA School of Research Data Science, ICTP, Trieste, 6 – 17 Aug 2018
- CODATA-RDA Research Data Science Applied Workshops, ICTP, Trieste, 20 – 24 Aug 2018
- #DataKigali 2018, CODATA-RDA School of Research Data Science, ICTP, Kigali, Rwanda, 22 Oct – 2 Nov 2018
- #DataSaoPaulo 2018, CODATA-RDA School of Research Data Science, ICTP, São Paulo, 3 – 14 Dec 2018
- #DataSaoPaulo 2017, CODATA-RDA Research Data Science Summer School, ICTP, São Paulo, 4 – 15 Dec 2017
- CODATA-RDA Research Data Science Applied Workshops, ICTP, Trieste, 24 – 28 July 2017
- #DataTrieste 2017, CODATA-RDA Research Data Science Summer School, ICTP, Trieste, 10 – 21 July 2017
- #DataTrieste 2016, CODATA-RDA Research Data Science Summer School, ICTP, Trieste, 1 – 12 Aug 2016
Convenors and Organisers

Partners

Sponsors
Working Group Co-Chairs
| Marcela Alfaro Córdob Universidad de Costa Rica |  | Louise Bezuidenhout Institute for Science, Innovation and Society, | |
| Raphael Cobe Center for Scientific Computing – |  | Sara El Jadid Ibn Tofail University- Morocco eljadidsara [at] gmail.com |  | 
| Bianca Peterson Centre of Excellence for Pharmaceutical Sciences – North-West University |  | Robert Quick UITS Research Technologies, Indiana University rquick (at) iu.edu |  | 
| Hugh Shanahan Department of Computer Science, Royal Holloway, University of London Hugh.Shanahan (at) rhul.ac.uk | 
Previous Co-Chairs
- Simon Hodson, CODATA.
- Anelda van der Walt, Talarify.
- Andrew Harrison, University of Essex.
- Sarah Jones, Digital Curation Centre.
- Shanmugasundaram Venkataraman (Venkat), Digital Curation Centre.
Page last reviewed: 2022-11-17.
