Computer Science and Game Theory
New submissions
[ showing up to 2000 entries per page: fewer  more ]
New submissions for Wed, 30 Nov 22
 [1] arXiv:2211.15836 [pdf, ps, other]

Title: On the Envyfree Allocation of ChoresSubjects: Computer Science and Game Theory (cs.GT)
We study the problem of allocating a set of indivisible chores to three agents, among whom two have additive cost functions, in a fair manner. Two fairness notions under consideration are envyfreeness up to any chore (EFX) and a relaxed notion, namely envyfreeness up to transferring any chore (tEFX). In contrast to the case of goods, the case of chores remain relatively unexplored. In particular, our results constructively prove the existence of a tEFX allocation for three agents if two of them have additive cost functions and the ratio of their highest and lowest costs is bounded by two. In addition, if those two cost functions have identical ordering (IDO) on the costs of chores, then an EFX allocation exists even if the condition on the ratio bound is slightly relaxed. Throughout our entire framework, the third agent is unrestricted besides having a monotone cost function.
 [2] arXiv:2211.15936 [pdf, other]

Title: Finding mixedstrategy equilibria of continuousaction games without gradients using randomized policy networksSubjects: Computer Science and Game Theory (cs.GT); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Multiagent Systems (cs.MA)
We study the problem of computing an approximate Nash equilibrium of continuousaction game without access to gradients. Such game access is common in reinforcement learning settings, where the environment is typically treated as a black box. To tackle this problem, we apply zerothorder optimization techniques that combine smoothed gradient estimators with equilibriumfinding dynamics. We model players' strategies using artificial neural networks. In particular, we use randomized policy networks to model mixed strategies. These take noise in addition to an observation as input and can flexibly represent arbitrary observationdependent, continuousaction distributions. Being able to model such mixed strategies is crucial for tackling continuousaction games that lack purestrategy equilibria. We evaluate the performance of our method using an approximation of the Nash convergence metric from game theory, which measures how much players can benefit from unilaterally changing their strategy. We apply our method to continuous Colonel Blotto games, singleitem and multiitem auctions, and a visibility game. The experiments show that our method can quickly find highquality approximate equilibria. Furthermore, they show that the dimensionality of the input noise is crucial for performance. To our knowledge, this paper is the first to solve general continuousaction games with unrestricted mixed strategies and without any gradient information.
 [3] arXiv:2211.16143 [pdf, other]

Title: Fair Division with Prioritized AgentsComments: 15 pages, 1 figure; accepted in AAAI'23Subjects: Computer Science and Game Theory (cs.GT)
We consider the fair division problem of indivisible items. It is wellknown that an envyfree allocation may not exist, and a relaxed version of envyfreeness, envyfreeness up to one item (EF1), has been widely considered. In an EF1 allocation, an agent may envy others' allocated shares, but only up to one item. In many applications, we may wish to specify a subset of prioritized agents where strict envyfreeness needs to be guaranteed from these agents to the remaining agents, while ensuring the whole allocation is still EF1. Prioritized agents may be those agents who are envious in a previous EF1 allocation, those agents who belong to underrepresented groups, etc. Motivated by this, we propose a new fairness notion named envyfreeness with prioritized agents "EFPrior", and study the existence and the algorithmic aspects for the problem of computing an EFPrior allocation. With additive valuations, the simple roundrobin algorithm is able to compute an EFPrior allocation. In this paper, we mainly focus on general valuations. In particular, we present a polynomialtime algorithm that outputs an EFPrior allocation with most of the items allocated. When all the items need to be allocated, we also present polynomialtime algorithms for some wellmotivated special cases.
 [4] arXiv:2211.16251 [pdf, other]

Title: Utility Maximizer or Value Maximizer: Mechanism Design for Mixed Bidders in Online AdvertisingAuthors: Hongtao Lv, Zhilin Zhang, Zhenzhe Zheng, Jinghan Liu, Chuan Yu, Lei Liu, Lizhen Cui, Fan WuComments: accepted by AAAI2023Subjects: Computer Science and Game Theory (cs.GT)
Digital advertising constitutes one of the main revenue sources for online platforms. In recent years, some advertisers tend to adopt autobidding tools to facilitate advertising performance optimization, making the classical \emph{utility maximizer} model in auction theory not fit well. Some recent studies proposed a new model, called \emph{value maximizer}, for autobidding advertisers with returnoninvestment (ROI) constraints. However, the model of either utility maximizer or value maximizer could only characterize partial advertisers in realworld advertising platforms. In a mixed environment where utility maximizers and value maximizers coexist, the truthful ad auction design would be challenging since bidders could manipulate both their values and affiliated classes, leading to a multiparameter mechanism design problem. In this work, we address this issue by proposing a payment rule which combines the corresponding ones in classical VCG and GSP mechanisms in a novel way. Based on this payment rule, we propose a truthful auction mechanism with an approximation ratio of $2$ on social welfare, which is close to the lower bound of at least $\frac{5}{4}$ that we also prove. The designed auction mechanism is a generalization of VCG for utility maximizers and GSP for value maximizers.
Crosslists for Wed, 30 Nov 22
 [5] arXiv:2211.15792 (crosslist from cs.LG) [pdf, ps, other]

Title: Provably Efficient Modelfree RL in LeaderFollower MDP with Linear Function ApproximationAuthors: Arnob GhoshSubjects: Machine Learning (cs.LG); Computer Science and Game Theory (cs.GT); Multiagent Systems (cs.MA); Systems and Control (eess.SY)
We consider a multiagent episodic MDP setup where an agent (leader) takes action at each step of the episode followed by another agent (follower). The state evolution and rewards depend on the joint action pair of the leader and the follower. Such type of interactions can find applications in many domains such as smart grids, mechanism design, security, and policymaking. We are interested in how to learn policies for both the players with provable performance guarantee under a bandit feedback setting. We focus on a setup where both the leader and followers are {\em nonmyopic}, i.e., they both seek to maximize their rewards over the entire episode and consider a linear MDP which can model continuous statespace which is very common in many RL applications. We propose a {\em modelfree} RL algorithm and show that $\tilde{\mathcal{O}}(\sqrt{d^3H^3T})$ regret bounds can be achieved for both the leader and the follower, where $d$ is the dimension of the feature mapping, $H$ is the length of the episode, and $T$ is the total number of steps under the bandit feedback information setup. Thus, our result holds even when the number of states becomes infinite. The algorithm relies on {\em novel} adaptation of the LSVIUCB algorithm. Specifically, we replace the standard greedy policy (as the best response) with the softmax policy for both the leader and the follower. This turns out to be key in establishing uniform concentration bound for the value functions. To the best of our knowledge, this is the first sublinear regret bound guarantee for the Markov games with nonmyopic followers with function approximation.
 [6] arXiv:2211.15804 (crosslist from cs.CR) [pdf, other]

Title: Towards faster settlement in HTLCbased CrossChain Atomic SwapsAuthors: Subhra MazumdarComments: Invited Submission (Security and Privacy) to The Fourth IEEE International Conference on Trust, Privacy and Security in Intelligent Systems, and Applications, 2022, 11 pagesSubjects: Cryptography and Security (cs.CR); Computer Science and Game Theory (cs.GT)
Hashed Timelock (HTLC)based atomic swap protocols enable the exchange of coins between two or more parties without relying on a trusted entity. This protocol is like the American call option without premium. It allows the finalization of a deal within a certain period. This puts the swap initiator at liberty to delay before deciding to proceed with the deal. If she finds the deal unprofitable, she just waits for the timeperiod of the contract to elapse. However, the counterparty is at a loss since his assets remain locked in the contract. The best he can do is to predict the initiator's behavior based on the asset's price fluctuation in the future. But it is difficult to predict as cryptocurrencies are quite volatile, and their price fluctuates abruptly. We perform a game theoretic analysis of HTLCbased atomic crosschain swap to predict whether a swap will succeed or not. From the strategic behavior of the players, we infer that this model lacks fairness. We propose Quick Swap, a twoparty protocol based on hashlock and timelock that fosters faster settlement of the swap. The parties are required to lock griefingpremium along with the principal amount. If the party griefs, he ends up paying the griefingpremium. If a party finds a deal unfavorable, he has the provision to cancel the swap. We prove that Quick Swap is more participantfriendly than HTLCbased atomic swap. Our work is the first to propose a protocol to ensure fairness of atomicswap in a cyclic multiparty setting.
 [7] arXiv:2211.15824 (crosslist from cs.RO) [pdf, other]

Title: CLAS: Coordinating MultiRobot Manipulation with Central Latent Action SpacesSubjects: Robotics (cs.RO); Computer Science and Game Theory (cs.GT); Machine Learning (cs.LG); Multiagent Systems (cs.MA); Neural and Evolutionary Computing (cs.NE)
Multirobot manipulation tasks involve various control entities that can be separated into dynamically independent parts. A typical example of such realworld tasks is dualarm manipulation. Learning to naively solve such tasks with reinforcement learning is often unfeasible due to the sample complexity and exploration requirements growing with the dimensionality of the action and state spaces. Instead, we would like to handle such environments as multiagent systems and have several agents control parts of the whole. However, decentralizing the generation of actions requires coordination across agents through a channel limited to information central to the task. This paper proposes an approach to coordinating multirobot manipulation through learned latent action spaces that are shared across different agents. We validate our method in simulated multirobot manipulation tasks and demonstrate improvement over previous baselines in terms of sample efficiency and learning performance.
 [8] arXiv:2211.15837 (crosslist from cs.LG) [pdf, other]

Title: Survey on SelfSupervised Multimodal Representation Learning and Foundation ModelsAuthors: Sushil ThapaSubjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Computer Science and Game Theory (cs.GT)
Deep learning has been the subject of growing interest in recent years. Specifically, a specific type called Multimodal learning has shown great promise for solving a wide range of problems in domains such as language, vision, audio, etc. One promising research direction to improve this further has been learning rich and robust lowdimensional data representation of the highdimensional world with the help of largescale datasets present on the internet. Because of its potential to avoid the cost of annotating largescale datasets, selfsupervised learning has been the de facto standard for this task in recent years. This paper summarizes some of the landmark research papers that are directly or indirectly responsible to build the foundation of multimodal selfsupervised learning of representation today. The paper goes over the development of representation learning over the last few years for each modality and how they were combined to get a multimodal agent later.
 [9] arXiv:2211.16275 (crosslist from stat.ML) [pdf, ps, other]

Title: A survey on multiplayer banditsComments: works released after June 2022 are not considered in this surveySubjects: Machine Learning (stat.ML); Computer Science and Game Theory (cs.GT); Machine Learning (cs.LG)
Due mostly to its application to cognitive radio networks, multiplayer bandits gained a lot of interest in the last decade. A considerable progress has been made on its theoretical aspect. However, the current algorithms are far from applicable and many obstacles remain between these theoretical results and a possible implementation of multiplayer bandits algorithms in real cognitive radio networks. This survey contextualizes and organizes the rich multiplayer bandits literature. In light of the existing works, some clear directions for future research appear. We believe that a further study of these different directions might lead to theoretical algorithms adapted to realworld situations.
Replacements for Wed, 30 Nov 22
 [10] arXiv:2204.04186 (replaced) [pdf, other]

Title: The Complexity of InfiniteHorizon GeneralSum Stochastic GamesComments: accepted at ITCS 2023Subjects: Computer Science and Game Theory (cs.GT); Computational Complexity (cs.CC); Data Structures and Algorithms (cs.DS); Optimization and Control (math.OC)
 [11] arXiv:2210.16395 (replaced) [pdf, ps, other]

Title: Ensure Differential Privacy and Convergence Accuracy in Consensus Tracking and Aggregative Games with Coupling ConstraintsAuthors: Yongqiang WangComments: arXiv admin note: text overlap with arXiv:2209.01486Subjects: Computer Science and Game Theory (cs.GT); Cryptography and Security (cs.CR); Optimization and Control (math.OC)
 [12] arXiv:2211.14670 (replaced) [pdf, other]

Title: Mediated Cheap Talk Design (with proofs)Comments: To be presented at AAAI'23Subjects: Computer Science and Game Theory (cs.GT); Multiagent Systems (cs.MA)
 [13] arXiv:2202.06949 (replaced) [pdf, ps, other]

Title: Consensus Division in an Arbitrary RatioComments: Accepted to ITCS 2023Subjects: Computational Complexity (cs.CC); Discrete Mathematics (cs.DM); Computer Science and Game Theory (cs.GT)
[ showing up to 2000 entries per page: fewer  more ]
Disable MathJax (What is MathJax?)
Links to: arXiv, form interface, find, cs, recent, 2211, contact, help (Access key information)