PIR-NREF Database
Non-Redundant REFerence Protein Sequence Database
l Comprehensiveness: PIR-PSD, Swiss-Prot, TrEMBL, RefSeq,
GenPept, PDB
l Timeliness: Biweekly Updates (~ 1,000,000 Sequences)
l Non-Redundancy: by Sequence Identity & Taxonomy (Species)
l Source Attribution: Protein IDs and Names from Underlying
Databases, Sequence, Taxonomy, Bibliography
l Related Sequences: Identical Sequences from Different Species,
Complete Substring, >=95% Sequence Identity
Applications
l Protein Identification: Full-Scale or Species-Based Sequence
Analysis and Text Search
l Detection of Annotation Errors
l Development of Protein Name Ontology
FTP Distribution: XML and FASTA Formats