Differential Privacy

In machine learning, an anonymization approach to protect any sensitive data (for example, an individual's personal information) included in a model's training set from being exposed. This approach ensures that the model doesn't learn or remember much about a specific individual. This is accomplished by sampling and adding noise during model training to obscure individual data points, mitigating the risk of exposing sensitive training data.

Differential privacy is also used outside of machine learning. For example, data scientists sometimes use differential privacy to protect individual privacy when computing product usage statistics for different demographics.

Real-world uses

Created for this library

1.
A health-tech startup uses differential privacy when sharing aggregated population statistics with researchers so individuals cannot be re-identified.
2.
A bank uses differential privacy in its internal analytics platform to allow business teams to query customer data without seeing individual records.
3.
A government statistics agency publishes census tables with differential privacy noise so small communities are protected against record linkage.

Back to glossary

Differential Privacy

Real-world uses

Related terms

Loading…

Differential Privacy

Real-world uses

Related terms