National Natural Science Foundation of China (No.61572034, No.61806006);Natural Science Foundation for Colleges and Universities in Anhui Province (No.KJ2018A0083, No.KJ2014A061);Major Science and Technology Project of Anhui Province (No.18030901025)
We investigate in this paper how to effectively reduce the risk of privacy leakage caused by refactoring attacks when the sensitive attributes and some quasi-identifier attributes are correlated.Firstly
the mutual information theory is used to find the quasi-identifier attributes which have strong dependence on the sensitive attributes in the original dataset
which provides a theoretical basis for accurately perturbing the data attributes.Secondly
for the correlated attributes and the non-correlated attributes
the invariant random response method is applied to perturb a certain data attribute or a combination of data attributes to satisfy the local
ysis of the impact of data perturbations on privacy leakage probability and data utility is also conducted.Finally
the experiment verifies the effectiveness of the proposed algorithm and its ability to process incremental data.The experimental results demonstrate that the algorithm can achieve a better balance between data utility and privacy protection.