AI Governance

Ethical Hacking of AI

The practice of systematically testing AI systems for vulnerabilities, biases, and failure modes with the goal of improving safety and robustness before malicious actors find the same weaknesses.

Why It Matters

Ethical hacking of AI is an emerging discipline that combines traditional security testing with AI-specific attacks like prompt injection and adversarial examples.

Example

A security team probing an AI customer service bot for: prompt injection vulnerabilities, data leakage risks, bias in responses, and harmful output generation.

Think of it like...

Like hiring a locksmith to test your locks — they try everything a burglar would, but they report vulnerabilities to you instead of exploiting them.

Related Terms