Policy example to block harmful content using classifier rule
classifier
rule. This rule helps prevent the AI from using or responding with specific words or phrases that are deemed inappropriate or sensitive. For more details, see our Rules Catalog.
classifier
"hate speech, harassment, sexual content, self-harm
fail
(to flag when a blocked topic is detected)