Aiming at the contradiction between the efficiency and privacy of stochastic gradient descent algorithm in distributed computing environment
a stochastic gradient descent algorithm preserving differential privacy based on MapReduce was proposed.Based on the computing framework of MapReduce
the data were allocated randomly to each Map node and the Map tasks were started independently to execute the stochastic gradient descent algorithm.The Reduce tasks were appointed to update the model when the sub-target update models were meeting the update requirements
and to add Laplace random noise to achieve differential privacy protection.Based on the combinatorial features of differential privacy
the results of the algorithm is proved to be able to fulfill ε-differentially private.The experimental results show that the algorithm has obvious efficiency advantage and good data availability.
关键词
Keywords
references
WU F , LI F G , KUMAR A , et al . Bolt-on differential privacy for scalable stochastic gradient descent-based analytics [C ] // The 2017 ACM International Conference on Management of Data . 2017 : 1307 - 1322 .
ABADI M , CHU A , GOODFELLOW I , et al . Deep learning with differential privacy [C ] // The 2016 ACM SIGSAC Conference on Computer and Communications Security . 2016 : 308 - 318 .
ZHAO P , ZHANG T . Stochastic optimization with importance sampling [J ] . Eprint Arxiv , 2015 : 1 - 9 .
SCHMIDT M , ROUX N L , BACH F . Erratum to:minimizing finite sums with the stochastic average gradient [J ] . Mathematical Programming , 2016 , 162 ( 5 ):1.
MU Y , LIU W , LIU X , et al . Stochastic gradient made stable:a manifold propagation approach for large-scale optimization [J ] . IEEE Transactions on Knowledge & Data Engineering , 2015 , 29 ( 2 ): 458 - 471 .
ZINKEVICH M , WEIMER M , SMOLA A J , et al . Parallelized stochastic gradient descent [C ] // The Conference on Neural Information Processing Systems . 2011 : 2595 - 2603 .
CHEN Z H , LAN Y Y , GUO J F , et al . Distributed stochastic gradient descent with discriminative aggregating [J ] . Chinese Journal of Computers , 2015 , 38 ( 10 ): 2054 - 2063 .
ZHAO H , CANNY J F . Communication-efficient distributed stochastic gradient descent with butterfly mixing [D ] . Berkeley,USA:University of California , 2012 .
SONG S , CHAUDHURI K , SARWATE A D . Stochastic gradient descent with differentially private updates [C ] // Global conference on Signal and Information Processing (GlobalSIP) . 2013 : 245 - 248 .
BASSILY R , THAKURTA A . Private empirical risk minimization:Efficient algorithms and tight error bounds [C ] // 2014 IEEE 55th Annual Symposium on Foundations of Computer Science (FOCS) . 2014 : 464 - 473 .
DWORK C , MCSHERRY F , NISSIM K . Calibrating noise to sensitivity in private data analysis [J ] . The VLDB Endowment , 2006 , 7 ( 8 ): 637 - 648 .
DWORK C , ROTH A . The Algorithmic foundations of differential privacy [M ] . Now Publishers Inc , 2014 .
CHAUDHURI K , MONTELEONI C , SARWATE A D . Differentially private empirical risk minimization [J ] . Journal of Machine Learning Research , 2009 , 12 ( 2 ): 1069 - 1109 .
HE X M , WANG X Y , CHEN H H , et al . Study on choosing the parameter ε in differential privacy [J ] . Journal on Communications , 2015 , 36 ( 12 ): 124 - 130 .
MCSHERRY F D . Privacy integrated queries:an extensible platform for privacy-preserving data analysis [J ] . Communication of the ACM , 2010 , 53 ( 9 ): 89 - 97 .