|
1
|
- Steve Rossouw, Bill Anderson
- Co-chairs CODATA Data Archiving Working Group
- CODATA 2002
- 30 September 2002, Montréal, Canada
|
|
2
|
- “Digital resources will not survive or remain accessible by accident.”
- Bernard Smith, European Commission
- ICSTI/ICSU/CODATA Digital
Preservation Workshop
- 15 February 2002, Paris, France
|
|
3
|
- Formed at CODATA 2000 in Baveno, Italy
- Preliminary web site established
- Workshop in Pretoria, S. Africa, May 2002
- Annotated list of primary references
- Preliminary classification of issues
- Task Group proposal focusing on developing countries, and
- Collaboration with ICSTI on internet portal
|
|
4
|
- Primary and secondary references include:
- Workshop reports
- Journals, edited volumes, and standard textbooks
- Published guidelines and handbooks
- Government circulars and reports
- Web sites and portals
- Standards
|
|
5
|
- Data management
- Data policy
- Scientific Data
- Technical issues
- Technical web resources
- Digital preservation (focus on digital libraries, collections, etc.)
|
|
6
|
- Four categories of issues
- Science
- Management
- Policy
- Technical
|
|
7
|
- Discipline specific needs and practices of communities;
- Interdisciplinary and pan-disciplinary values, methods
|
|
8
|
- What are scientific data?
- OAIS model has reference definitions
- Mandates of different archives differ
- Data quality control and assurance
- Selection and appraisal criteria
- Value and relevance of data archived
- Language differences
- Not all data published in one language
- Developing and developed country differences
- Nomenclature / taxonomy
- Differs inside and across communities
- Names and concepts change over time (need to save historical contexts)
- Barriers to preservation
- original data in some fields on paper only
- original data buried in spreadsheets, databases, documents
- Interdisciplinary work can yield pan-disciplinary, unmanaged data
|
|
9
|
- Practices and procedures of individuals, archival institutions, and
communities
|
|
10
|
- What is archiving?
- Relation to other data management functions?
- OAIS model distinguishes issues by
- archive administration
- external and community management
- Advocacy needed to secure funding
- Data management is not science
- Business and organizational models
- economic and cost, public and private
- incentives and dis-incentives for populating and maintaining deposits
- Selection and appraisal criteria and prioritization
- Ownership and control
- Planning and requirements issues
- practices are changing
- local practices differ
- mandates and objectives differ
- what is effective access?
- Applications
- diversity of customers: scientists, politicians, citizens
- Some operational considerations
- size
- diversity: source, formats, documentation
- time horizon for access
- changes in data definitions, formats
- hardware and software obsolescence
|
|
11
|
- Rules, regulations, laws, external to the archive that inform,
constrain, and assist management
|
|
12
|
- National, regional & global perspectives
- Cultural ownership of data & preference for use
- Human data privacy & confidentiality
- Environmental data privacy & security
- Intellectual property: protection, limits & exceptions
- Public vs. private data
- Incentives and dis-incentives for managing archive deposits
- National security
- Institutional roles and policies
- Enabling legislation & controlling authorities
- Freedom of information
- policies, regulations & practices
- access authorization
- Financing and cost recovery policies
- economies of scale
- unfunded mandates
- Rationale for data archiving
- pure research needs
- cultural, economic & political needs
- Policy enforcement mechanisms
- Data rights
- redistribution
- transformation
- derivative product rights
|
|
13
|
- Standards, hardware and software that support data preservation,
archiving, and access functions
- Mostly discipline independent
|
|
14
|
- Scientific data and databases are different from literature
- size and volume differences
- human readability vs. application access
- Diversity of data types and formats, and media types, formats and
standards
- Nomenclature and taxonomy
- issues apply to the technology itself
- Search capabilities
- Who need what and to what ends?
- Metadata: difference between access and preservation (OAIS)
- Preservation issues
- Rapid evolution of technology
- Information buried in software is hard to maintain and access
- Information in proprietary formats and commercial databases
- Directories
- potential user authentication and authorization mechanism
- potential archive and content discovery mechanism
- Standards: OAIS, Open GIS
- continuing work is needed
- Interoperability among archives
|
|
15
|
- “Preservation and Archiving of Scientific and Technical Data in
Developing Countries”
- Improve understanding of S&T data management conditions in
developing countries
- Advance development and adoption of good archiving practices, policies, and tools
- Provide interdisciplinary forums
- Build a comprehensive directory of managers, experts, and archives
|
|
16
|
- Develop and maintain an Internet portal about archiving S&T data and
information
- STI archiving procedures, technologies, standards, and policies
- Archiving projects and activities
- Experts points of contact in all countries
|
|
17
|
- CODATA data archiving activities will pursue opportunities to
- Promote and advance management of S&T data
- Leverage common properties of digital data
- Learn from previous and ongoing experiences with managing growing
collections of digital data
|
|
18
|
|