Policy example to block harmful content using classifier rule
To implement a classifier for topics and names, you can use the classifier
rule. This rule helps prevent the AI from using or responding with specific words or phrases that are deemed inappropriate or sensitive. For more details, see our Rules Catalog.
classifier
"hate speech, harassment, sexual content, self-harm
fail
(to flag when a blocked topic is detected)Here’s an example of a policy to implement a classifier:
Policy example to block harmful content using classifier rule
To implement a classifier for topics and names, you can use the classifier
rule. This rule helps prevent the AI from using or responding with specific words or phrases that are deemed inappropriate or sensitive. For more details, see our Rules Catalog.
classifier
"hate speech, harassment, sexual content, self-harm
fail
(to flag when a blocked topic is detected)Here’s an example of a policy to implement a classifier: