数据与计算发展前沿 ›› 2023, Vol. 5 ›› Issue (5): 74-97.

CSTR: 32002.14.jfdc.CN10-1649/TP.2023.05.007

doi: 10.11871/jfdc.issn.2096-742X.2023.05.007

• 专刊:数据要素安全高效流通的关键技术 • 上一篇    下一篇

本地化差分隐私综述

孙一帆1,2(),张锐1,2,*(),陶杨1,2,高碧柔1,2,秦诗涵1,2,安超1,2   

  1. 1.中国科学院信息工程研究所,信息安全国家重点实验室,北京 100085
    2.中国科学院大学,网络空间安全学院,北京 100049
  • 收稿日期:2023-05-04 出版日期:2023-10-20 发布日期:2023-10-31
  • 通讯作者: 张锐(E-mail: r-zhang@iie.ac.cn
  • 作者简介:孙一帆,中国科学院信息工程研究所,博士研究生,主要研究方向为数据安全、差分隐私。
    本文中负责文献调研、总结归纳、论文写作等。
    SUN Yifan is currently a Ph.D. student at the Institute of Information Engineering, Chinese Academy of Sciences. Her research interests include data security and differential privacy.
    In this paper, she is responsible for literature research, analyzing and summarizing related work, and paper writing.
    E-mail: sunyifan@iie.ac.cn|张锐,中国科学院信息工程研究所,研究员,博士生导师,主要研究方向为密码学与安全协议、后量子密码、数据安全、云计算安全理论与技术和区块链理论与技术。
    本文中负责写作思路指导和论文修订等。
    ZHANG Rui is a research fellow and Ph.D. supervisor at the Institute of Information Engineering, Chinese Academy of Sciences. His research fields include cryptography and security protocols, post-quantum cryptography, data security, cloud computing security theory and technology, and blockchain theory and technology.
    In this paper, he is responsible for guidance of writing ideas and paper revision.
    E-mail: r-zhang@iie.ac.cn
  • 基金资助:
    国家自然科学基金项目(62172411);国家自然科学基金项目(62172404);国家自然科学基金项目(61972094);国家自然科学基金项目(62202458)

A Survey on Local Differential Privacy

SUN Yifan1,2(),ZHANG Rui1,2,*(),TAO Yang1,2,GAO Birou1,2,QIN Shihan1,2,AN Chao1,2   

  1. 1. SKLOIS, Institute of Information Engineering, Beijing 100085, China
    2. School of Cyber Security, University of Chinese Academy of Sciences, Beijing 100049, China
  • Received:2023-05-04 Online:2023-10-20 Published:2023-10-31

摘要:

【目的】 本地化差分隐私是优秀的隐私保护模型,能够在数据共享、发布的场景下对群体进行统计分析,保护个人数据隐私。本文围绕本地化差分隐私进行综述,为未来工作提供参考。【文献范围】 本文调研了来自主流会议、期刊的本地化差分隐私领域的论文,并进行了总结归纳。【方法】 本文以数据统计分析任务类型为线索,从基于本地化差分隐私模型的频率估计、均值估计、多维数据统计分析和机器学习4个方面展开调研。本文对相关研究进行了对比分析,对关键问题进行了总结,对现有工作的不足进行了讨论,对未来的研究方向进行了展望。【结果】 本地化差分隐私模型能够在用户数据被采集、分析时,为用户个人数据隐私提供强有力的隐私保护。【局限】 本文以数据统计分析任务类型为线索,未对图数据相关研究进行总结。【结论】 本地化差分隐私作为一种优秀的隐私保护模型,得到学者们的关注后迅速发展,但是仍然面临着诸多问题和挑战,值得进一步研究和探索。

关键词: 本地化差分隐私, 频率估计, 均值估计, 多维数据统计分析, 机器学习

Abstract:

[Objective] This paper systematically introduces local differential privacy and provides a reference for the protection of personal data privacy under data sharing and publishing. [Coverage] This paper investigates and summarizes papers from mainstream conferences and journals in the field of local differential privacy. [Methods] This paper takes the type of statistical data analysis task as a clue and conducts research based on local differential privacy from four aspects, which concludes frequency estimation, mean estimation, multidimensional data statistical analysis, and machine learning. This paper makes a comparative analysis of relevant studies, summarizes key issues, discusses the shortcomings of existing work, and looks forward to future research directions. [Results] The local differential privacy model can provide strong privacy protection for users' personal data privacy when user data is collected and analyzed. [Limitations] This paper takes the type of statistical data analysis task as a clue and does not summarize the research related to graph data. [Conclusions] Local differential privacy, as an excellent privacy-preserving model, has developed rapidly after gaining the attention of scholars. But it still faces many problems and challenges, which are worthy of further research and exploration.

Key words: local differential privacy, frequency estimation, mean estimation, multidimensional data statistical analysis, machine learning