Keynote Talk AI Control Hackathon with Tyler Tracy and Buck Shlegaris
Apart - Safe AI
124 viewsStreamed 3 weeks ago
Women in AI Safety Hackathon Keynote Talk with Nakshathra Suresh
Apart - Safe AI
98 viewsStreamed 1 month ago
Hacktalk with Joshua Landes
Apart - Safe AI
108 viewsStreamed 1 month ago
AI interpretability with Myra Deng
Apart - Safe AI
195 viewsStreamed 1 month ago
Intro to AI Assurance Technology with Kristian Rönn
Apart - Safe AI
106 viewsStreamed 3 months ago
Accelerating AI Safety with Finn Metz
Apart - Safe AI
100 viewsStreamed 3 months ago
Can an LLM Hack You? - Benchmarking Cybersecurity - Andrey Anurin
Apart - Safe AI
418 views11 months ago
Uncovering Limitations of LLM Memory Editing - Jason Hoelscher-Obermaier
Apart - Safe AI
327 views1 year ago
Neuron to Graph - Alex Foote
Apart - Safe AI
126 views1 year ago
Do Models Cheat on Tests? - Jacob Haimes
Apart - Safe AI
70 views9 months ago
Interpretability Hackathon 0.0 Keynote w/ Neel Nanda
662 views2 years ago
Mechanistic Interpretability 1.0 Hackathon - Neel Nanda
578 views1 year ago
Can an LLM Hack You? - Benchmarking Cybersecurity - Andrey Anurin
418 views11 months ago
Bad LLM Agents - Simon Lermen
333 views11 months ago
Uncovering Limitations of LLM Memory Editing - Jason Hoelscher-Obermaier
327 views1 year ago
Safety Testing for AGI Systems - Bo Li
307 views10 months ago
Engineering a World Designed for Safe Superintelligence
117 views12 days ago
Award Ceremony: Reprogramming AI Models Hackathon
99 views3 months ago
Researcher Spotlight: Jacob Haimes
111 views4 months ago
Jacques Thibodeau: Mastering Cursor
150 views4 months ago
Chandler Smith on the Concordia Contest
79 views5 months ago
Introduction to Research Augmentation for Alignment - Jacques Thibodeau
111 views8 months ago