
Figure 1
Decision tree for the classification of personal data. This diagram guides researchers and depositors in assigning tags to datasets containing personal data based on their conditions for reuse and compliance with the GDPR and Spanish law. The colour-coded tags (blue, green, yellow, orange, purple and red) indicate different legal bases and limitations for the secure storage, access and reuse of the data in research contexts.
Table 1
The blue to red model for tags categorises datasets based on their risk levels. Datasets with no associated risks fall under the blue tag, while increasing risk levels demand stricter data protection measures and more complex safeguards, with the red tag assigned to datasets of the highest sensitivity and risk.
| TAG TYPE | IDENTIFICATION AND AUTHENTICATION | READ AND DOWNLOAD PERMISSIONS |
|---|---|---|
| Blue | Not necessary | Public access without authentication |
| Green | Registration to the repository is required Implementation of access controls (username and password, certificate, second-factor authentication) Assigned roles with privilege differentiation | Access by registered users In the case of downloading documentation, encrypted with a password |
| Yellow | Registration to the repository and approval by the data depositor is required Implementation of access controls (username and password, certificate, second-factor authentication) Assigned roles with privilege differentiation | Registered users can access the data after authorisation of the depositor In the case of downloading documentation, encryption with a password |
| Orange | Registration to the repository and approval by the data depositor is required Implementation of access controls (username and password, certificate, second-factor authentication) Assigned roles with privilege differentiation Validation according to source IP | Registered users can access the data after authorisation of the depositor In the case of downloading documentation, encryption with a password |
| Purple | Registration to the repository and approval by the data depositor is required Implementation of access controls (username and password, certificate, second-factor authentication) Assigned roles with privilege differentiation Validation according to source IP | Registered users can access the data after authorisation of the depositor In the case of downloading documentation, encryption with a password |
| Red | Registration to the repository and approval by the data depositor is required Implementation of access controls (username and password, certificate, second-factor authentication) Assigned roles with privilege differentiation Validation according to source IP | Access to protected data without permission to download |
| TAG TYPE | STORAGE AND TRANSMISSION | KEY STORAGE |
| Blue | Unencrypted | N.A. |
| Green | Storage: simple encryption Transmission: simple encryption Use secure encryption algorithms | Encryption key stored separately from repository data |
| Yellow | Storage: simple encryption Transmission: simple encryption Use secure encryption algorithms | Encryption key stored separately from repository and depositor data |
| Orange | Storage: double encryption Use secure encryption algorithms | One key is stored separately from the data by the repository, and the other key is stored by a trusted third party |
| Purple | Storage: double encryption Use secure encryption algorithms | One key is stored separately from the data by the repository, and the other key is stored by a trusted third party |
| Red | Storage: double encryption Use secure encryption algorithms | One key is stored separately from the data by the repository, and the other key is stored by a trusted third party |

Figure 2
Screenshots of the interactive classification tool developed to operationalise the decision tree. The tool is currently deployed in Catalan for the local research community at the University of Barcelona.
