metric-divergence
Experiments where different evaluation metrics disagree, such as one metric improving while the promotion or deployment metric regresses.
Loading postsā¦
Experiments where different evaluation metrics disagree, such as one metric improving while the promotion or deployment metric regresses.