Sensitivity decreased from 0.91 on a balanced refinement dataset to 0.62 on an independent dataset reflecting real-world ...