Interpretability
Understanding what models learn and why they behave as they do
Understanding what models learn and why they behave as they do
Verification, specification, and provable guarantees for AI systems
What does 'safe' mean, and what are we actually afraid of?
AI and ML basics — gradient descent, backprop, loss functions, and more