Building a Community of Data ScientistsAn Explorative Analysis

 

 Lei Liu1,2Hui Zhang1,2Jianhui Li1,2and Runqiang Wang2

 

1Secretariat of Chinese National Committee for CODATA,

2Computer Network Information Center, Chinese Academy of Sciences,

 4 South 4th Street, Zhongguancun, Haidian District, Beijing 100190, China, email: liulei@codata.cn

 

 

Public perception of the new discipline of data science is just in an embryonic stage. Similarly, people rarely mention of data scientists, not to mention building a community of data scientists. Based on an explorative analysis of the definition of a data scientist and a community of data scientists as well, this paper points out that the formation of a community of data scientists depends on the development of data science and data scientists, which is full of potential and hope though faced with obstacles and challenges. On one hand, to build such a community is promising for: 1) the inherent logic result of the sound development of data science as a scientific discipline along with the drive of fast development of S&T, the increasingly important strategic value of S&T data, and frequent in-depth international cooperation and exchanges;2) the persistent promotion from CODATA in conjunction with other interested international organizations; and 3) continuous support and efforts from various countries and regions, institutions and professionals. On the other hand, the process of building such a community is inevitably faced with difficulties and challenges because: 1) the perception, popularization and development of data science as a scientific discipline need a long period of time; and 2) comparatively speaking, there have been very few data scientists up to now.

 

Keywords: data science; discipline; data scientists; community; CODATA