
Impact of the Protein Data Bank Across Scientific Disciplines
Abstract
The Protein Data Bank archive (PDB) was established in 1971 as the 1st open access digital data resource for biology and medicine. Today, the PDB contains >160,000 atomic-level, experimentally-determined 3D biomolecular structures. PDB data are freely and publicly available for download, without restrictions. Each entry contains summary information about the structure and experiment, atomic coordinates, and in most cases, a citation to a corresponding scientific publication. Individually and in bulk, PDB structures can be downloaded and/or analyzed and visualized online using tools at RCSB.org. As such, it is challenging to understand and monitor reuse of data. Citations of the scientific publications describing PDB structures provide one way of understanding which structures are being used, and in which research areas. Our analysis highlights frequently-cited structures and identifies milestone structures that have demonstrated impact across scientific fields.
DOI: https://doi.org/10.5334/dsj-2020-025 | Journal eISSN: 1683-1470
Language: English
Page range: 25 - 25
Submitted on: Dec 10, 2019
Accepted on: May 8, 2020
Published on: Jun 12, 2020
Published by: Ubiquity Press
In partnership with: Paradigm Publishing Services
Publication frequency: 1 issue per year
© 2020 Zukang Feng, Natalie Verdiguel, Luigi Di Costanzo, David S. Goodsell, John D. Westbrook, Stephen K. Burley, Christine Zardecki, published by Ubiquity Press
This work is licensed under the Creative Commons Attribution 4.0 License.