Stats and Bytes

Stats and Bytes

Share this post

Stats and Bytes
Stats and Bytes
🎩 Top 5 Security and AI Reads - Week #5
Copy link
Facebook
Email
Notes
More

🎩 Top 5 Security and AI Reads - Week #5

Web agents transcending API boundaries, network security through foundation models, adversarial unlearning for safety benefits, backdoor'ing RL agents actions, and open problems in mech interp

Josh Collyer's avatar
Josh Collyer
Feb 02, 2025
∙ Paid
1

Share this post

Stats and Bytes
Stats and Bytes
🎩 Top 5 Security and AI Reads - Week #5
Copy link
Facebook
Email
Notes
More
1
Share

Welcome to the fifth installment of the Stats and Bytes Top 5 Security and AI Reads weekly newsletter. We're starting with an API-based web agent that challenges the traditional browser-centric agents. Next, we'll examine netFound, a promising foundation model specifically designed for network security. We'll then jump into some research on machine unlearning's effectiveness, followed by a comprehensive survey of open problems in mechanistic interpretability from leading researchers in the field. Finally, we'll round things off with UNIDOOR, which demonstrates how backdoor attacks can be implemented in deep reinforcement learning systems targeting RL agent actions.

A highly detailed digital illustration of a neural network architecture being unraveled like a tapestry, with glowing backdoors hidden in its layers, hyperrealistic 4K, intricate circuit patterns, cyberpunk aesthetic, iridescent threads of code weaving through dark portals, reminiscent of Tron legacy, deep learning visualiza…

Keep reading with a 7-day free trial

Subscribe to Stats and Bytes to keep reading this post and get 7 days of free access to the full post archives.

Already a paid subscriber? Sign in
© 2025 Josh Collyer
Privacy ∙ Terms ∙ Collection notice
Start writingGet the app
Substack is the home for great culture

Share

Copy link
Facebook
Email
Notes
More