Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/43777
Title: A novel privacy-preserving probability transductive classifiers from group probabilities based on regression model
Authors: Jiang, Y
Deng, Z
Choi, KS 
Qian, P
Hu, W
Wang, S
Keywords: Classification
Group probability
Privacy preserving
Probability transductive
Regression model
Issue Date: 2015
Publisher: IOS Press
Source: Journal of intelligent and fuzzy systems, 2015, v. 29, no. 2, p. 917-925 How to cite?
Journal: Journal of intelligent and fuzzy systems 
Abstract: Group probability classifier learning is an emerging and promising learning technique, especially in privacy-preserving data mining. It is used to train a classifier from a group probability dataset, where the class labels of each sample are unknown while the probabilities of each class in the given data groups of the whole dataset are available. The existing work is mainly based on the inverse calibration (IC) strategy to obtain the estimated labels for data in the group probability dataset and then make use of classical classification algorithms such as support vector machine (SVM) model to train the desired classifier. A critical challenge of the exiting IC-based methods lies in the difficulty of designing an ideal IC function for label estimation and the methods are sensitive to the adopted IC function. In order to overcome this shortcoming, a novel probability transductive classifier that does not involve IC in the learning procedure is proposed, where the probability values are directly used as the output of the training data for the model training. Particularly, on the training data with the output being continuous real values, the existing classical regression model can be easily adopted to model the group probability classification problem. For a future testing data, the model output of the obtained group probability classification model can present the probability that the testing data belong to the positive class. With a given threshold, the final class label of the testing data can be obtained for the classification task. The experimental results on synthetic datasets and real UCI datasets show that the proposed method is more effective than the existing methods.
URI: http://hdl.handle.net/10397/43777
ISSN: 1064-1246 (Print)
1875-8967 (online)
DOI: 10.3233/IFS-151621
Appears in Collections:Journal/Magazine Article

Access
View full-text via PolyU eLinks SFX Query
Show full item record

Page view(s)

64
Last Week
0
Last month
Citations as of Apr 22, 2018

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.