One evaluation metric has dominated these decisions, and that is doing more harm than good.