Apart - Safe AI

👩‍💻 Alignment Jam Talks

Play all

Talks from Apart Research's Alignment Jam hackathons and research sprints.

Keynote Talk AI Control Hackathon with Tyler Tracy and Buck Shlegaris

Apart - Safe AI

124 viewsStreamed 3 weeks ago

Women in AI Safety Hackathon Keynote Talk with Nakshathra Suresh

Apart - Safe AI

98 viewsStreamed 1 month ago

Hacktalk with Joshua Landes

Apart - Safe AI

108 viewsStreamed 1 month ago

AI interpretability with Myra Deng

Apart - Safe AI

195 viewsStreamed 1 month ago

Intro to AI Assurance Technology with Kristian Rönn

Apart - Safe AI

106 viewsStreamed 3 months ago

Accelerating AI Safety with Finn Metz

Apart - Safe AI

100 viewsStreamed 3 months ago

👩‍🔬 Apart Lab Talks

Play all

Talks about the projects that come out of the research labs of the Alignment Jam research workshops.

Can an LLM Hack You? - Benchmarking Cybersecurity - Andrey Anurin

Apart - Safe AI

418 views11 months ago

Uncovering Limitations of LLM Memory Editing - Jason Hoelscher-Obermaier

Apart - Safe AI

327 views1 year ago

Neuron to Graph - Alex Foote

Apart - Safe AI

126 views1 year ago

Do Models Cheat on Tests? - Jacob Haimes

Apart - Safe AI

70 views9 months ago

Videos

Engineering a World Designed for Safe Superintelligence

117 views12 days ago

Award Ceremony: Reprogramming AI Models Hackathon

99 views3 months ago

Researcher Spotlight: Jacob Haimes

111 views4 months ago

Jacques Thibodeau: Mastering Cursor

150 views4 months ago

Chandler Smith on the Concordia Contest

79 views5 months ago

Introduction to Research Augmentation for Alignment - Jacques Thibodeau

111 views8 months ago

Apart - Safe AI

👩‍💻 Alignment Jam Talks

Play all

Keynote Talk AI Control Hackathon with Tyler Tracy and Buck Shlegaris

Women in AI Safety Hackathon Keynote Talk with Nakshathra Suresh

Hacktalk with Joshua Landes

AI interpretability with Myra Deng

Intro to AI Assurance Technology with Kristian Rönn

Accelerating AI Safety with Finn Metz

👩‍🔬 Apart Lab Talks

Play all

Can an LLM Hack You? - Benchmarking Cybersecurity - Andrey Anurin

Uncovering Limitations of LLM Memory Editing - Jason Hoelscher-Obermaier

Neuron to Graph - Alex Foote

Do Models Cheat on Tests? - Jacob Haimes

Popular videos

Interpretability Hackathon 0.0 Keynote w/ Neel Nanda

Mechanistic Interpretability 1.0 Hackathon - Neel Nanda

Can an LLM Hack You? - Benchmarking Cybersecurity - Andrey Anurin

Bad LLM Agents - Simon Lermen

Uncovering Limitations of LLM Memory Editing - Jason Hoelscher-Obermaier

Safety Testing for AGI Systems - Bo Li

Videos

Engineering a World Designed for Safe Superintelligence

Award Ceremony: Reprogramming AI Models Hackathon

Researcher Spotlight: Jacob Haimes

Jacques Thibodeau: Mastering Cursor

Chandler Smith on the Concordia Contest

Introduction to Research Augmentation for Alignment - Jacques Thibodeau