Have a personal or library account? Click to login
Accounting for Label Shift of Positive Unlabeled Data under Selection Bias Cover

Accounting for Label Shift of Positive Unlabeled Data under Selection Bias

Open Access
|Sep 2025

Abstract

We consider the scenario when two samples of positive unlabeled (PU) data are available and for the second sample the change in prior probability of classes occurs while distributions of predictors in classes remain the same (label shift setting). The selection of positive elements may be object-dependent. We study the properties of the underlying probabilistic structure under the novel augmented PU scenario, proving in particular that label shift occurs also for unlabeled populations. We introduce and investigate an estimator of prior probability for label-shifted population. Furthermore, in this case we construct and analyze behavior of Bayes classifier in this setting. It turns out to be a Bayes classifier for the unlabeled class with a modified threshold. This gives rise to its three empirical counterparts which are compared on benchmark data sets.

DOI: https://doi.org/10.61822/amcs-2025-0036 | Journal eISSN: 2083-8492 | Journal ISSN: 1641-876X
Language: English
Page range: 507 - 517
Submitted on: Dec 2, 2024
Accepted on: May 11, 2025
Published on: Sep 8, 2025
Published by: University of Zielona Góra
In partnership with: Paradigm Publishing Services
Publication frequency: 4 issues per year

© 2025 Jan Mielniczuk, Adam Wawrzeńczyk, published by University of Zielona Góra
This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 License.