We do show the Bayesian Risk (or expected loss) for each metric, which is often used as the stopping condition.
There are usually many variables and external factors that I don't think the stopping decision can be entirely left up to an algorithm. For example, if one metric is up, one is down. Or the metrics are slightly down, but you believe the lower maintenance cost will be worth it.