Figure 1

Match Results for Zhong Lin Wang_
| Match field | Original dataset size | Reduced dataset size | Percent reduction |
|---|---|---|---|
| Source | 4,810 | 3,560 | 26% |
| Affiliation | 4,810 | 4,147 | 14% |
| Web of Science Category | 4,810 | 4,173 | 13% |
| Co-authors | 4,810 | 4,175 | 13% |
| Title | 4,810 | 3,794 | 21% |
| ISSN | 4,810 | 3,555 | 26% |
| Publication Year | 4,810 | 4,349 | 10% |
| Cited References | 4,810 | 4,347 | 10% |
| 4,810 | 3,349 | 30% | |
| All of the above | 4,810 | 2,894 | 40% |
Match Results for Haesun Park_
| Match field | Original dataset size | Reduced dataset size | Percent reduction |
|---|---|---|---|
| Co-authors | 23,298 | 8,527 | 63% |
| Source | 23,298 | 14,174 | 39% |
| Affiliation (Organization Only) | 23,298 | 20,867 | 10% |
| Title | 23,298 | 4,978 | 79% |
| ISSN | 23,298 | 14,762 | 37% |
| 1st Author | 23,298 | 14,791 | 37% |
| ORCID iD | 23,298 | 3,288 | 86% |
| Researcher ID | 23,298 | 2,903 | 88% |
| Web of Science Category | 23,298 | 21,906 | 6% |
| Publication Year | 23,298 | 23,094 | 1% |
| Country | 23,298 | 23,044 | 1% |
| All of the above | 23,298 | 2,319 | 90% |
Comparison of Disambiguation Results for the Three Cases_
| Porter | Wang | Park | |
|---|---|---|---|
| Top 3 list reduction match | Source (34%), Co-authors | Email (30%), Source | Researcher ID (88%), |
| fields | (32%) and ISSN (31%) | (26%), ISSN (26%) | ORCID (86%), Title (79%) |
| Original dataset size | 3,617 | 4,810 | 23,298 |
| Total list reduction | 52% | 40% | 90% |
| Number of true positives lost | 1 | 5 | 0 |
Match Results for Alan Porter_
| Match field | Original dataset size | Reduced dataset size | Percent reduction |
|---|---|---|---|
| Source | 3,617 | 2,377 | 34% |
| Co-authors | 3,617 | 2,465 | 32% |
| Title | 3,617 | 2,826 | 22% |
| ISSN | 3,617 | 2,486 | 31% |
| Publication Year | 3,617 | 2,905 | 20% |
| Affiliation | 3,617 | NA | NA |
| Cited References | 3,617 | NA | NA |
| 3,617 | NA | NA | |
| All of the above | 3,617 | 1,750 | 52% |