A differential privacy algorithm DiffPRFs based on random forests was proposed.Exponential mechanism was used to select split point and split attribute in each decision tree building process
and noise was added according to Laplace mechanism.Differential privacy protection requirement was satisfied through overall process.Compared to existed algorithms
the proposed method does not require pre-discretization of continuous attributes which significantly reduces the performance cost of preprocessing in large multi-dimensional dataset.Classification is achieved conveniently and efficiently while maintains the high accuracy.Experimental results demonstrate the effectiveness and superiority of the algorithm compared to other classification algorithms.
关键词
Keywords
references
DWORK C , . Differential privacy [C ] // The 33rd International Colloquium on Automata,Languages and Programming . Berlin:Spinger-Verlag , 2006 : 1 - 12 .
DWORK C . A firm foundation for private data analysis [J ] . Communications of the ACM , 2011 , 54 ( 1 ): 86 - 95 .
ZHANG X J , MENG X F . Differential privacy in data publication and analysis [J ] . Chinese Journal of Computers , 2014 , 37 ( 4 ): 927 - 949 .
DWORK C , MCSHERRY F , NISSIM K , et al . Calibrating noise to sensitivity in private data analysis [C ] // Theory of Cryptography . Springer Berlin Heidelberg , 2006 : 265 - 284 .
MCSHERRY F , TALWAR K . Mechanism design via differential privacy [C ] // Foundations of Computer Science . 2007 : 94 - 103 .
FAN M MENG X F . Data minify:concepts and techniques [M ] . Beijing China Machine Press , 2012 .
BLUM A , DWORK C , MCSHERRY F , et al . Practical privacy:the SuLQ framework [C ] // The 24th ACM SIGMOD-SIGACT-SIGART Symposium on Principles of Database Systems . ACM , 2005 : 128 - 138 .
FRIEDMAN A , SCHUSTER A . Data mining with differential privacy [C ] // The 16th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining . ACM , 2010 : 493 - 502 .
MOHAMMED N , CHEN R , FUNG B , et al . Differentially private data release for data mining [C ] // The 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining . ACM , 2011 : 493 - 501 .
ZHU T , XIONG P , XIANG Y , et al . An effective deferentially private data releasing algorithm for decision tree [C ] // Trust,Security and Privacy in Computing and Communications (TrustCom),2013 12th IEEE International Conference . IEEE , 2013 : 388 - 395 .
PATIL A , SINGH S . Differential private random forest [C ] // Advances in Computing,Communications and Informatics International Conference . IEEE , 2014 : 2623 - 2630 .
LU G Q , ZHANG X J , DING L P , et al . Frequent sequential pattern mining under differential privacy [J ] . Journal of Computer Research and Development , 2015 , 52 ( 12 ): 2789 - 2801 .
QUINLAN J R , . Induction of decision trees [C ] // Readings in Knowledge Acquisition and Learning . San Francisco:Morgan Kaufmann Publishers Inc , 1993 : 349 - 361 .
MCSHERRY F , . Privacy integrated queries:an extensible platform for privacy-preserving data analysis [C ] // The 2009 ACM SIGMOD International Conference on Management of Data . ACM , 2009 : 19 - 30 .