Some Additional Proteomics Challenges:
ŸHigh-throughput crystallography generating large volumes of complex protein structure data
ŸSmall molecule (structure) databases growing to tens of millions of compounds
Ÿ3D and pharmacophore analysis require efficient storage, indices and operators of structure data
ŸIntegrated visualization & computation tools with DBMS
PDB format goes back to first myoglobin deposit at Brookhaven (~1960)
Public structure data banks are improving storage and serving of structures
Biotechs need improved storage for proprietary structures too
PDB format is 40 years old
RCSB has assumed control and they are pushing mmCIF ontology
Richer data model
Better for relational DB implementation
Providing a Java API (CORBA) for low-level query
Rapid increase in proprietary structures through high-throughput crystallography