Computer Science and Game Theory
New submissions
[ showing up to 2000 entries per page: fewer  more ]
New submissions for Tue, 7 Dec 21
 [1] arXiv:2112.02271 [pdf, other]

Title: Cooperation, Retaliation and Forgiveness in Revision GamesSubjects: Computer Science and Game Theory (cs.GT); Multiagent Systems (cs.MA); Theoretical Economics (econ.TH)
Revision game is a very new model formulating the situation where players can prepare and revise their actions in advance before a deadline when payoffs are realized. We identify the Limited Retaliation (LR) strategy for revision games which sustains a high level of mutual cooperation and is robust to players' occasional mistakes. The LR strategy stipulates that, (1) players first follow a recommended cooperative plan; (2) if anyone deviates from the plan, the LR player retaliates by using the defection action for a limited duration; (3) after the retaliation, the LR player returns to the cooperative plan. The LR strategy has two good features. First, it is vengeful, in the sense that it deters the opponent from noncooperative action by threatening a retaliation. Second, it is forgiving, because it returns to cooperation after a proper retaliation. The vengeful feature makes it constitute a subgame perfect equilibrium, while the forgiving feature makes it tolerate occasional mistakes. These are in clear contrast to the existing strategies for revision games which all assume players are extremely grim and never forgive. Besides its contribution as a new robust and welfareoptimizing equilibrium strategy, our results about LR strategy can also be used to explain how easy cooperation can happen, and why forgiveness emerges in realworld multiagent interactions.
 [2] arXiv:2112.02337 [pdf, other]

Title: A refined consumer behavior model for energy systems: Application to the pricing and energyefficiency problemsComments: Accepted by Applied EnergySubjects: Computer Science and Game Theory (cs.GT)
The sumutility maximization problem is known to be important in the energy systems literature. The conventional assumption to address this problem is that the utility is concave. But for some key applications, such an assumption is not reasonable and does not reflect well the actual behavior of the consumer. To address this issue, the authors pose and address a more general optimization problem, namely by assuming the consumer's utility to be sigmoidal and in a given class of functions. The considered class of functions is very attractive for at least two reasons. First, the classical NPhardness issue associated with sumutility maximization is circumvented. Second, the considered class of functions encompasses wellknown performance metrics used to analyze the problems of pricing and energyefficiency. This allows one to design a new and optimal inclining block rates (IBR) pricing policy which also has the virtue of flattening the power consumption and reducing the peak power. We also show how to maximize the energyefficiency by a lowcomplexity algorithm. When compared to existing policies, simulations fully support the benefit from using the proposed approach.
 [3] arXiv:2112.02932 [pdf, other]

Title: Indian Kidney Exchange Program: A Game Theoretic PerspectiveComments: 30 pages, 25 figures, 6 tablesSubjects: Computer Science and Game Theory (cs.GT)
We propose a ways in which Kidney exchange can be feasibly, economically and efficiently implemented in Indian medical space, named as Indian Kidney Exchange Program(IKEP) along with Indian specific influences on compatibility and final outcomes. Kidney exchange is a boon for those suffering from renal kidney failure and do have a donor with an incompatible kidney (compatible kidney also encouraged for better matches). In such situations the patient, donor pair is matched to another patient, donor pair having the same problem and are compatible to each other. Hospitals put up their patientdonor data. Using the biological data, compatibility scores(or weights) are generated and preferences are formed accordingly. Indian influences on weights, modify the compatibility scores generated and hence, the preferences. The pairs are then allocated using game theoretic matching algorithms for markets without money.
Crosslists for Tue, 7 Dec 21
 [4] arXiv:2112.02746 (crosslist from cs.MA) [pdf, other]

Title: Unfairness Despite Awareness: GroupFair Classification with Strategic AgentsSubjects: Multiagent Systems (cs.MA); Computers and Society (cs.CY); Computer Science and Game Theory (cs.GT); Machine Learning (cs.LG)
The use of algorithmic decision making systems in domains which impact the financial, social, and political wellbeing of people has created a demand for these decision making systems to be "fair" under some accepted notion of equity. This demand has in turn inspired a large body of work focused on the development of fair learning algorithms which are then used in lieu of their conventional counterparts. Most analysis of such fair algorithms proceeds from the assumption that the people affected by the algorithmic decisions are represented as immutable feature vectors. However, strategic agents may possess both the ability and the incentive to manipulate this observed feature vector in order to attain a more favorable outcome. We explore the impact that strategic agent behavior could have on fair classifiers and derive conditions under which this behavior leads to fair classifiers becoming less fair than their conventional counterparts under the same measure of fairness that the fair classifier takes into account. These conditions are related to the the way in which the fair classifier remedies unfairness on the original unmanipulated data: fair classifiers which remedy unfairness by becoming more selective than their conventional counterparts are the ones that become less fair than their counterparts when agents are strategic. We further demonstrate that both the increased selectiveness of the fair classifier, and consequently the loss of fairness, arises when performing fair learning on domains in which the advantaged group is overrepresented in the region near (and on the beneficial side of) the decision boundary of conventional classifiers. Finally, we observe experimentally, using several datasets and learning methods, that this fairness reversal is common, and that our theoretical characterization of the fairness reversal conditions indeed holds in most such cases.
 [5] arXiv:2112.02792 (crosslist from stat.ML) [pdf, other]

Title: Incentive Compatible Pareto Alignment for MultiSource Large GraphsSubjects: Machine Learning (stat.ML); Computer Science and Game Theory (cs.GT); Machine Learning (cs.LG)
In this paper, we focus on learning effective entity matching models over multisource largescale data. For real applications, we relax typical assumptions that data distributions/spaces, or entity identities are shared between sources, and propose a Relaxed Multisource Largescale Entitymatching (RMLE) problem. Challenges of the problem include 1) how to align largescale entities between sources to share information and 2) how to mitigate negative transfer from joint learning multisource data. What's worse, one practical issue is the entanglement between both challenges. Specifically, incorrect alignments may increase negative transfer; while mitigating negative transfer for one source may result in poorly learned representations for other sources and then decrease alignment accuracy. To handle the entangled challenges, we point out that the key is to optimize information sharing first based on Pareto front optimization, by showing that information sharing significantly influences the Pareto front which depicts lower bounds of negative transfer. Consequently, we proposed an Incentive Compatible Pareto Alignment (ICPA) method to first optimize crosssource alignments based on Pareto front optimization, then mitigate negative transfer constrained on the optimized alignments. This mechanism renders each source can learn based on its true preference without worrying about deteriorating representations of other sources. Specifically, the Pareto front optimization encourages minimizing lower bounds of negative transfer, which optimizes whether and which to align. Comprehensive empirical evaluation results on four largescale datasets are provided to demonstrate the effectiveness and superiority of ICPA. Online A/B test results at a search advertising platform also demonstrate the effectiveness of ICPA in production environments.
 [6] arXiv:2112.02856 (crosslist from cs.LG) [pdf, ps, other]

Title: Optimal NoRegret Learning in Strongly Monotone Games with Bandit FeedbackComments: 40 pages, 3 figuresSubjects: Machine Learning (cs.LG); Computer Science and Game Theory (cs.GT); Optimization and Control (math.OC)
We consider online noregret learning in unknown games with bandit feedback, where each agent only observes its reward at each time  determined by all players' current joint action  rather than its gradient. We focus on the class of smooth and strongly monotone games and study optimal noregret learning therein. Leveraging selfconcordant barrier functions, we first construct an online bandit convex optimization algorithm and show that it achieves the singleagent optimal regret of $\tilde{\Theta}(\sqrt{T})$ under smooth and stronglyconcave payoff functions. We then show that if each agent applies this noregret learning algorithm in strongly monotone games, the joint action converges in \textit{last iterate} to the unique Nash equilibrium at a rate of $\tilde{\Theta}(1/\sqrt{T})$. Prior to our work, the bestknow convergence rate in the same class of games is $O(1/T^{1/3})$ (achieved by a different algorithm), thus leaving open the problem of optimal noregret learning algorithms (since the known lower bound is $\Omega(1/\sqrt{T})$). Our results thus settle this open problem and contribute to the broad landscape of bandit gametheoretical learning by identifying the first doubly optimal bandit learning algorithm, in that it achieves (up to log factors) both optimal regret in the singleagent learning and optimal lastiterate convergence rate in the multiagent learning. We also present results on several simulation studies  Cournot competition, Kelly auctions, and distributed regularized logistic regression  to demonstrate the efficacy of our algorithm.
 [7] arXiv:2112.02884 (crosslist from cs.AI) [pdf, other]

Title: Invitation in Crowdsourcing ContestsSubjects: Artificial Intelligence (cs.AI); Computer Science and Game Theory (cs.GT); Theoretical Economics (econ.TH)
In a crowdsourcing contest, a requester holding a task posts it to a crowd. People in the crowd then compete with each other to win the rewards. Although in real life, a crowd is usually networked and people influence each other via social ties, existing crowdsourcing contest theories do not aim to answer how interpersonal relationships influence peoples' incentives and behaviors, and thereby affect the crowdsourcing performance. In this work, we novelly take peoples' social ties as a key factor in the modeling and designing of agents' incentives for crowdsourcing contests. We then establish a new contest mechanism by which the requester can impel agents to invite their neighbours to contribute to the task. The mechanism has a simple rule and is very easy for agents to play. According to our equilibrium analysis, in the Bayesian Nash equilibrium agents' behaviors show a vast diversity, capturing that besides the intrinsic ability, the social ties among agents also play a central role for decisionmaking. After that, we design an effective algorithm to automatically compute the Bayesian Nash equilibrium of the invitation crowdsourcing contest and further adapt it to large graphs. Both theoretical and empirical results show that, the invitation crowdsourcing contest can substantially enlarge the number of contributors, whereby the requester can obtain significantly better solutions without a large advertisement expenditure.
 [8] arXiv:2112.03112 (crosslist from physics.socph) [pdf, other]

Title: A Synergy of Institutional Incentives and Networked Structures in Evolutionary Game Dynamics of Multiagent SystemsComments: 6 pages, 3 figuresSubjects: Physics and Society (physics.socph); Computer Science and Game Theory (cs.GT); Multiagent Systems (cs.MA); Populations and Evolution (qbio.PE)
Understanding the emergence of prosocial behaviours (e.g., cooperation and trust) among selfinterested agents is an important problem in many disciplines. Network structure and institutional incentives (e.g., punishing antisocial agents) are known to promote prosocial behaviours, when acting in isolation, one mechanism being present at a time. Here we study the interplay between these two mechanisms to see whether they are independent, interfering or synergetic. Using evolutionary game theory, we show that punishing antisocial agents and a regular networked structure not only promote prosocial behaviours among agents playing the trust game, but they also interplay with each other, leading to interference or synergy, depending on the game parameters. Synergy emerges on a wider range of parameters than interference does. In this domain, the combination of incentives and networked structure improves the efficiency of incentives, yielding prosocial behaviours at a lower cost than the incentive does alone. This has a significant implication in the promotion of prosocial behaviours in multiagent systems.
 [9] arXiv:2112.03178 (crosslist from cs.AI) [pdf, other]

Title: Player of GamesAuthors: Martin Schmid, Matej Moravcik, Neil Burch, Rudolf Kadlec, Josh Davidson, Kevin Waugh, Nolan Bard, Finbarr Timbers, Marc Lanctot, Zach Holland, Elnaz Davoodi, Alden Christianson, Michael BowlingSubjects: Artificial Intelligence (cs.AI); Computer Science and Game Theory (cs.GT); Machine Learning (cs.LG)
Games have a long history of serving as a benchmark for progress in artificial intelligence. Recently, approaches using search and learning have shown strong performance across a set of perfect information games, and approaches using gametheoretic reasoning and learning have shown strong performance for specific imperfect information poker variants. We introduce Player of Games, a generalpurpose algorithm that unifies previous approaches, combining guided search, selfplay learning, and gametheoretic reasoning. Player of Games is the first algorithm to achieve strong empirical performance in large perfect and imperfect information games  an important step towards truly general algorithms for arbitrary environments. We prove that Player of Games is sound, converging to perfect play as available computation time and approximation capacity increases. Player of Games reaches strong performance in chess and Go, beats the strongest openly available agent in headsup nolimit Texas hold'em poker (Slumbot), and defeats the stateoftheart agent in Scotland Yard, an imperfect information game that illustrates the value of guided search, learning, and gametheoretic reasoning.
Replacements for Tue, 7 Dec 21
 [10] arXiv:2106.03278 (replaced) [pdf, other]

Title: Coordinating Followers to Reach Better Equilibria: EndtoEnd Gradient Descent for Stackelberg GamesSubjects: Computer Science and Game Theory (cs.GT)
 [11] arXiv:2111.13253 (replaced) [pdf, ps, other]

Title: A Simple and Tight Greedy OCRSAuthors: Vasilis LivanosComments: 14 pagesSubjects: Data Structures and Algorithms (cs.DS); Computer Science and Game Theory (cs.GT)
[ showing up to 2000 entries per page: fewer  more ]
Disable MathJax (What is MathJax?)
Links to: arXiv, form interface, find, cs, recent, 2112, contact, help (Access key information)