Structured Reinforcement Learning for Incentivized Stochastic Covert Optimization

Jain, Adit; Krishnamurthy, Vikram

Full-text links:

Download:

Current browse context:

cs.LG

< prev | next >

new | recent | 2405

Computer Science > Machine Learning

Title: Structured Reinforcement Learning for Incentivized Stochastic Covert Optimization

Authors: Adit Jain, Vikram Krishnamurthy

(Submitted on 13 May 2024)

Abstract: This paper studies how a stochastic gradient algorithm (SG) can be controlled to hide the estimate of the local stationary point from an eavesdropper. Such problems are of significant interest in distributed optimization settings like federated learning and inventory management. A learner queries a stochastic oracle and incentivizes the oracle to obtain noisy gradient measurements and perform SG. The oracle probabilistically returns either a noisy gradient of the function} or a non-informative measurement, depending on the oracle state and incentive. The learner's query and incentive are visible to an eavesdropper who wishes to estimate the stationary point. This paper formulates the problem of the learner performing covert optimization by dynamically incentivizing the stochastic oracle and obfuscating the eavesdropper as a finite-horizon Markov decision process (MDP). Using conditions for interval-dominance on the cost and transition probability structure, we show that the optimal policy for the MDP has a monotone threshold structure. We propose searching for the optimal stationary policy with the threshold structure using a stochastic approximation algorithm and a multi-armed bandit approach. The effectiveness of our methods is numerically demonstrated on a covert federated learning hate-speech classification task.

Subjects:	Machine Learning (cs.LG); Systems and Control (eess.SY)
Cite as:	arXiv:2405.07415 [cs.LG]
	(or arXiv:2405.07415v1 [cs.LG] for this version)

Submission history

From: Vikram Krishnamurthy [view email]
[v1] Mon, 13 May 2024 01:29:48 GMT (71kb)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2405.07415

Download:

Current browse context:

Change to browse by:

References & Citations

Bookmark

Computer Science > Machine Learning

Title: Structured Reinforcement Learning for Incentivized Stochastic Covert Optimization

Submission history