Fugu-MT 論文翻訳(概要): Efficient derandomization of differentially private counting queries

論文の概要: Efficient derandomization of differentially private counting queries

arxiv url: http://arxiv.org/abs/2510.16959v2
Date: Tue, 21 Oct 2025 08:25:19 GMT
ステータス: 翻訳完了
システム内更新日: 2025-10-25 03:08:11.926907
Title: Efficient derandomization of differentially private counting queries
Title（参考訳）: 微分プライベートカウントクエリの効率的なデランドマイズ
Authors: Surendra Ghentiyala,
Abstract要約: 2020年国勢調査の異なるプライバシーは90テラバイトのランダムネス[GL20]を必要とした。これは、$mathcalP, dots, MathcalP_d$の述語を満たすデータセットにエントリの数を出力するタスクである。彼らはかなり驚くべき事実を示し、$varepsilon-differentially private mechanism for one counting query requires $O(log d)$ bits of in expectation。ここでは、時間的メカニズムを示します。
参考スコア（独自算出の注目度）: 0.0
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Differential privacy for the 2020 census required an estimated 90 terabytes of randomness [GL20], an amount which may be prohibitively expensive or entirely infeasible to generate. Motivated by these practical concerns, [CSV25] initiated the study of the randomness complexity of differential privacy, and in particular, the randomness complexity of $d$ counting queries. This is the task of outputting the number of entries in a dataset that satisfy predicates $\mathcal{P}_1, \dots, \mathcal{P}_d$ respectively. They showed the rather surprising fact that though any reasonably accurate, $\varepsilon$-differentially private mechanism for one counting query requires $1-O(\varepsilon)$ bits of randomness in expectation, there exists a fairly accurate mechanism for $d$ counting queries which requires only $O(\log d)$ bits of randomness in expectation. The mechanism of [CSV25] is inefficient (not polynomial time) and relies on a combinatorial object known as rounding schemes. Here, we give a polynomial time mechanism which achieves nearly the same randomness complexity versus accuracy tradeoff as that of [CSV25]. Our construction is based on the following simple observation: after a randomized shift of the answer to each counting query, the answer to many counting queries remains the same regardless of whether we add noise to that coordinate or not. This allows us to forgo the step of adding noise to the result of many counting queries. Our mechanism does not make use of rounding schemes. Therefore, it provides a different -- and, in our opinion, clearer -- insight into the origins of the randomness savings that can be obtained by batching $d$ counting queries.
Abstract（参考訳）: 2020年国勢調査の異なるプライバシーは90テラバイトのランダムネス[GL20]を必要とした。これらの実践的懸念に動機づけられた[CSV25]は、差分プライバシーのランダム性複雑性、特に$d$のクエリのランダム性複雑性の研究を開始した。これは、それぞれ$\mathcal{P}_1, \dots, \mathcal{P}_d$を満たすデータセットにエントリの数を出力するタスクである。彼らは、$\varepsilon$-differentially private mechanism for one counting query requires $1-O(\varepsilon)$ bits of randomness in expectationという驚くべき事実を示した。 CSV25] のメカニズムは非効率(多項式時間ではない)であり、丸めスキームとして知られる組合せ対象に依存している。ここでは[CSV25]とほぼ同じランダム性複雑性と精度トレードオフを実現する多項式時間機構を提案する。我々の構成は以下の単純な観察に基づいており、各カウントクエリに対する解のランダムなシフトの後、その座標にノイズを加えるかどうかに関わらず、多くのカウントクエリに対する解は同じままである。これにより、多くのカウントクエリの結果にノイズを追加するステップを回避できます。私たちのメカニズムは丸めのスキームを使わない。したがって、クエリを$d$のバッチで取得することで得られるランダムな保存の起源に関する洞察を、別の - そして私たちの意見では -- より明確にします。

論文の概要: Efficient derandomization of differentially private counting queries

関連論文リスト