How Do We Assess the Impact of New Training Methods?