Question 1

What is a differential privacy budget (epsilon)?

Accepted Answer

The privacy budget epsilon measures how much information about any individual record can leak through query responses. An epsilon of 0 means perfect privacy (no information leaks), while larger epsilon values allow more information leakage. In practice, epsilon values between 0.1 and 1.0 provide strong privacy guarantees, 1.0 to 10.0 provide moderate guarantees, and values above 10.0 provide weak privacy. The budget is consumed with each query: under sequential composition, epsilons add up linearly, meaning 10 queries each with epsilon 1.0 consume a total budget of 10.0.

Question 2

What is the difference between sequential and parallel composition?

Accepted Answer

Sequential composition applies when queries access the same data records. The total privacy loss is the sum of individual epsilons. If you run 5 queries each with epsilon 0.5, the total is 2.5. Parallel composition applies when queries access disjoint subsets of the data. The total privacy loss is the maximum individual epsilon, not the sum. If you run 5 queries on non-overlapping data partitions each with epsilon 0.5, the total is just 0.5. Parallel composition is dramatically more efficient and is the foundation of privacy-efficient analytics systems.

Question 3

How does advanced composition with Renyi divergence improve privacy budgets?

Accepted Answer

Advanced composition (also called Renyi DP or concentrated DP) provides tighter privacy accounting than naive sequential composition. Instead of adding epsilons linearly, it accounts for the fact that privacy loss across multiple queries follows a sub-linear growth pattern. For k queries each with epsilon e, advanced composition gives a total of approximately e * sqrt(2k * ln(1/delta)) + k * e * (e^e - 1), which for small epsilon is roughly e * sqrt(2k * ln(1/delta)). This means 100 queries with epsilon 0.1 cost approximately 1.0 under advanced composition instead of 10.0 under sequential composition — a 10x improvement.

Question 4

What happens when the privacy budget is exhausted?

Accepted Answer

When the privacy budget is exhausted, no more queries can be answered without violating the privacy guarantee. In a properly implemented system, queries are rejected once the cumulative epsilon reaches the maximum budget. The data must either be refreshed (if new data is available), the privacy parameters must be relaxed (accepting weaker guarantees), or the system must stop responding. Budget exhaustion is permanent for a given dataset — you cannot recover spent budget. This is why careful budget allocation across queries is critical for long-running analytics systems.

Question 5

How do I choose the right epsilon value for my queries?

Accepted Answer

Epsilon selection depends on the sensitivity of the data and the acceptable risk level. For healthcare and financial PII, use epsilon 0.1 to 1.0 per query. For general analytics with pseudonymized data, epsilon 1.0 to 5.0 is common. For aggregate statistics on non-sensitive data, epsilon 5.0 to 10.0 may be acceptable. The US Census Bureau used epsilon 19.6 for the 2020 Census, which was controversial. Apple uses epsilon 2 to 8 for telemetry. Google RAPPOR uses epsilon 2 to 3 for Chrome metrics. Start with the lowest epsilon that provides acceptable data utility, then increase only if analysis quality is insufficient.

Differential Privacy Budget Calculator

Privacy Budget Configuration

Add Query

Understanding Differential Privacy Budgets

Sequential Composition: The Naive Approach

Parallel Composition: Exploiting Data Partitions

Advanced Composition with Renyi Divergence

Budget Allocation Strategies

Real-World Epsilon Values

Compliance and Audit Requirements

Frequently Asked Questions

Michael Lip