metric.uncertainty
rule, a semantic entropy based metric that generate different responses for the same input, and compute the entropy of the responses.
Rule structure:
- type:
metric.uncertainty
- expected:
fail
(to flag when the uncertainty is high) - threshold: Confidence level for uncertainty detection (e.g., 0.8 for 80% confidence)
Required input
system prompt
is provided, and the model
, and other relevant model parameters
is set up on application level.