[theory students] Theory seminar Thursday (tomorrow) 9:45-10:45am: Why Does My Self-driving Car Keep Crashing? Attacks and Defenses for Reinforcement Learning.

Mailing List Archives Authenticated access	UW Madison Computer Sciences Department Computer Systems Lab

Date:	Thu, 13 Mar 2025 01:49:27 +0000
From:	Sandeep Silwal <silwal@xxxxxxxxxxx>
Subject:	[theory students] Theory seminar Thursday (tomorrow) 9:45-10:45am: Why Does My Self-driving Car Keep Crashing? Attacks and Defenses for Reinforcement Learning.

Hi everyone,

Tomorrow we will have our very on Jeremy talking about reinforcement learning. The location is our usual CS 3310.

Title: Why Does My Self-driving Car Keep Crashing? Attacks and Defenses for Reinforcement Learning.

Abstract: To ensure the usefulness of Reinforcement Learning (RL) in real systems, they must be robust to noise and adversarial attacks. In the subfield of adversarial RL, an external attacker can manipulate the agent's interaction with the environment. Unfortunately, the literature has devised many clever attack approaches, and even the most subtle attacks can lead to disastrous outcomes. In addition, we show that maximally devastating attacks are computable in polynomial time. Fortunately, we show that optimal defenses correspond to an equilibrium of an appropriately defined game. Although these games are NP-hard to approximate in general, we exploit the structure of most attack surfaces to enable efficient solutions via mutual recursion. Moreover, victims can strengthen their defenses by improving their attack detection ability.

See you there!

Best,

Sandeep

[← Prev in Thread]	Current Thread	[Next in Thread→]
[theory students] Theory seminar Thursday (tomorrow) 9:45-10:45am: Why Does My Self-driving Car Keep Crashing? Attacks and Defenses for Reinforcement Learning., Sandeep Silwal <= Re: [theory students] Theory seminar Thursday (tomorrow) 9:45-10:45am: Why Does My Self-driving Car Keep Crashing? Attacks and Defenses for Reinforcement Learning., Sandeep Silwal

Previous by Date:	Re: [theory students] [Theory of Computing Seminar] Theory seminar Thursday (tomorrow) with Cookies! 9:45-10:45am, Sandeep Silwal
Next by Date:	Re: [theory students] Theory seminar Thursday (tomorrow) 9:45-10:45am: Why Does My Self-driving Car Keep Crashing? Attacks and Defenses for Reinforcement Learning., Sandeep Silwal
Previous by Thread:	Re: [theory students] [theory faculty] Theory seminar Thursday (April 10) with cookies! 9:45-10:45am, Sandeep Silwal
Next by Thread:	Re: [theory students] Theory seminar Thursday (tomorrow) 9:45-10:45am: Why Does My Self-driving Car Keep Crashing? Attacks and Defenses for Reinforcement Learning., Sandeep Silwal
Indexes:	[Date] [Thread]

Mailing List Archives

Authenticated access

[theory students] Theory seminar Thursday (tomorrow) 9:45-10:45am: Why Does My Self-driving Car Keep Crashing? Attacks and Defenses for Reinforcement Learning.