Objective To explore two methods of sample size estimation in multi-reader multi-case study of radiological diagnostic test and realize them by software. Methods Demonstration programs were conducted in R software using the Van Dyke dataset, calculating combinations of readers and cases using the OR and DBM methods. These serve as pilot test results for multi-reader multi-case studies, providing a reference for parameter settings in subsequent formal experiments. Results When the effect size was 0.044, 6 readers and 247 cases could yield 0.80 power, while with an effect size of 0.088, only 6 readers and 44 cases were needed to reach 80.5% power. The sample sizes calculated using the OR method and the DBM method were consistent, and the same sample size calculation results could be obtained through conversion between the two methods. Conclusion For the estimation of sample size in multi-reader multi-case studies, R software provides a convenient and mature software package for sample size estimation using multi-reader multi-case designs in radiological diagnostic tests, thereby offering a reference for selecting appropriate sample size estimation and statistical analysis methods in radiological diagnostic tests.
Citation:
WAN Huiqin, XIANG Man, PAN Zhemin, QIN Yingyi, HE Qian, HE Jia. Introduction to the sample size estimation methods of multi-reader and multi-case design in radiological diagnostic test and software implementation. Chinese Journal of Evidence-Based Medicine, 2025, 25(5): 555-561. doi: 10.7507/1672-2531.202407148
Copy
Copyright © the editorial department of Chinese Journal of Evidence-Based Medicine of West China Medical Publisher. All rights reserved
1. |
|
2. |
|
3. |
Obuchowski NA, Rockette HE. Hypothesis testing of diagnostic accuracy for multiple readers and multiple tests: an ANOVA approach with dependent observations. Commun Stat Simul Comput. 1995, 24(2): 285-308.
|
4. |
|
5. |
|
6. |
Platisa L, Vansteenkiste E, Goossens B, et al. Optimization of medical imaging display systems: using the channelized hotelling observer for detecting lung nodules - experimental study. Proceedings of SPIE Medical Imaging, 2009.
|
7. |
|
8. |
|
9. |
|
10. |
FDA. Clinical performance assessment: considerations for computer-assisted detection devices applied to radiology images and radiology device applied to radiology images and radiology device data in premarket notification (510(k)) submissions: guidance for Industry and Food and Drug Administration Staff.
|
11. |
国家药品监督管理局医疗器械技术审评中心. 深度学习辅助决策医疗器械软件审评要点(2019年第7号). 2019.
|
12. |
国家药品监督管理局. 乳腺X射线系统注册技术审查指导原则(2021年第42号). 2021.
|
13. |
|
14. |
|
15. |
|
16. |
|
17. |
|
18. |
|
19. |
Lenth RV. Some practical guidelines for effective sample size determination. Am Stat. 2001, 55(3): 187-193.
|
20. |
|
21. |
|
22. |
Efron B, Tibshirani RJ. An introduction to the bootstrap. statistics and applied probability. Chapman & Hall/CRC, 1993.
|
23. |
|
24. |
|
25. |
|
26. |
|
27. |
|
28. |
Van Dyke CW. Cine MRI in the diagnosis of thoracic aortic dissection. 79th RSNA Meetings. 1993.
|
29. |
|
30. |
|
31. |
|
- 1.
- 2.
- 3. Obuchowski NA, Rockette HE. Hypothesis testing of diagnostic accuracy for multiple readers and multiple tests: an ANOVA approach with dependent observations. Commun Stat Simul Comput. 1995, 24(2): 285-308.
- 4.
- 5.
- 6. Platisa L, Vansteenkiste E, Goossens B, et al. Optimization of medical imaging display systems: using the channelized hotelling observer for detecting lung nodules - experimental study. Proceedings of SPIE Medical Imaging, 2009.
- 7.
- 8.
- 9.
- 10. FDA. Clinical performance assessment: considerations for computer-assisted detection devices applied to radiology images and radiology device applied to radiology images and radiology device data in premarket notification (510(k)) submissions: guidance for Industry and Food and Drug Administration Staff.
- 11. 国家药品监督管理局医疗器械技术审评中心. 深度学习辅助决策医疗器械软件审评要点(2019年第7号). 2019.
- 12. 国家药品监督管理局. 乳腺X射线系统注册技术审查指导原则(2021年第42号). 2021.
- 13.
- 14.
- 15.
- 16.
- 17.
- 18.
- 19. Lenth RV. Some practical guidelines for effective sample size determination. Am Stat. 2001, 55(3): 187-193.
- 20.
- 21.
- 22. Efron B, Tibshirani RJ. An introduction to the bootstrap. statistics and applied probability. Chapman & Hall/CRC, 1993.
- 23.
- 24.
- 25.
- 26.
- 27.
- 28. Van Dyke CW. Cine MRI in the diagnosis of thoracic aortic dissection. 79th RSNA Meetings. 1993.
- 29.
- 30.
- 31.