metric.uncertainty rule, a semantic entropy based metric that generate different responses for the same input, and compute the entropy of the responses.
Rule structure:
- type:
metric.uncertainty - expected:
fail(to flag when the uncertainty is high) - threshold: Confidence level for uncertainty detection (e.g., 0.8 for 80% confidence)
Required input
system prompt is provided, and the model, and other relevant model parameters is set up on application level.
