Safety

Interpretability

Understanding what models learn and why they behave as they do

Verification, specification, and provable guarantees for AI systems

What does 'safe' mean, and what are we actually afraid of?

AI and ML basics — gradient descent, backprop, loss functions, and more