Considerations for using race and ethnicity as quantitative variables in medical education research

Paula T. Ross; Tamera Hart-Johnson; Sally A. Santen; Nikki L. Bibler Zaidi

doi:10.1007/S40037-020-00602-3

Figures & Tables

Table 1

Justifications for including race (R) and ethnicity (E) variables in research

Role of the

R & E variable

Purpose of R & E variable

Sample medical education research question using R & E variables

Grouping

To examine similarities or differences between R or E groups and/or subgroups based on a dependent (outcome) variable

Is there a significant difference in medical students’ access to professional mentors by R or E group?

Mediating

To examine whether R or E explains the relationship between an independent (predictor) and dependent (outcome) variable

Is the association between socioeconomic status and students’ perceptions of the medical school learning environment reduced when R or E are considered?

Moderating

To examine whether the strength of the relationship between an independent (predictor) and dependent variable (outcome) varies by R or E groups

Does the relationship between social support and well-being vary by R or E group?

Table 2

Advantages and disadvantages of various data collection methods

Category type

Advantage(s)

Disadvantage(s)

Example

Multiple-response (exclusive)

categories:

Multiple options provided; respondent can only select ONE pre-established category [33]

Maintains original unit(s) of analysis

Provides more complete and accurate data [34]

Aligns data with most statistical analyses [33]

Permits respondents to

self-report identity and

allows researchers to collect rich data [35]

Provides less data per category which increases the risk of error in interpreting outcomes [36]

Forces respondents into discrete category that does not allow for fluid or broad self-identification [37]

Respondent must select one option from White, African American, American Indian, Alaska Native, and Native Hawaiian

Multiple-response (inclusive)

categories:

Multiple options provided; respondent can select MULTIPLE options from pre-established categories [33]

Introduces issues related to comparability of samples across multiple data sets [38]

Forces researcher to decide how individuals fit into certain categories [37]

Counts multiracial respondents as members of each individual racial or ethnic group they select which inflates the number of respondents in denominator [33]

Respondent may select multiple options from White-non-Hispanic, African American, American Indian, Alaska Native, and Native Hawaiian

Combined categories ^a:

Multiple options combined to define new categories

Simplifies statistical analysis, interpretation and presentation of results [39]

Increases cell size when discrete categories are too small [40]

Limits conclusions to broad assumptions and generalizations about respondents within groups [41, 42]

Perpetuates obsolete majority/minority discourse when using certain binary frameworks (e.g., White/non-White) [34]

Uses subjective labels that can perpetuate bias/stereotypes [43]

Increases the risk of a false positive result [44]

Underestimates the extent of variation between groups by not fully accounting for within group variability [26]

Respondent must select either URiM^b [45] or Non-URiM

^aCombined categories also can represent collapsed and dichotomous categories

^bURiM Underrepresented in medicine

Considerations for using race and ethnicity as quantitative variables in medical education research

Figures & Tables

Table 1

Table 2

Paradigm

My account