metric.uncertainty rule, a semantic entropy based metric that generate different responses for the same input, and compute the entropy of the responses.
Rule structure:
- type: metric.uncertainty
- expected: fail(to flag when the uncertainty is high)
- threshold: Confidence level for uncertainty detection (e.g., 0.8 for 80% confidence)
Required input
system prompt is provided, and the model, and other relevant model parameters is set up on application level.
