IMR Press / FBL / Volume 27 / Issue 7 / DOI: 10.31083/j.fbl2707198
Open Access Original Research
COVIDx-US: An Open-Access Benchmark Dataset of Ultrasound Imaging Data for AI-Driven COVID-19 Analytics
Show Less
1 Digital Technologies Research Centre, National Research Council Canada, Montreal, QC H3T 2B2, Canada
2 Department of Systems Design Engineering, University of Waterloo, Waterloo, ON N2L 3G1, Canada
3 Digital Technologies Research Centre, National Research Council Canada, Ottawa, ON K1K 2E1, Canada
4 Department of Emergency Medicine, McGill University, Montreal, QC H4A 3J1, Canada
5 Oakville Trafalgar Memorial Hospital, McMaster University, Oakville, ON L6M 0L8, Canada
6 Waterloo Artificial Intelligence Institute, Waterloo, ON N2L 3G1, Canada
*Correspondence: ashkan.ebadi@nrc-cnrc.gc.ca (Ashkan Ebadi)
Academic Editor: Xudong Huang
Front. Biosci. (Landmark Ed) 2022, 27(7), 198; https://doi.org/10.31083/j.fbl2707198
Submitted: 1 December 2021 | Revised: 1 June 2022 | Accepted: 6 June 2022 | Published: 24 June 2022
Copyright: © 2022 The Author(s). Published by IMR Press.
This is an open access article under the CC BY 4.0 license.
Abstract

Background: The Coronavirus Disease 2019 (COVID-19) pandemic continues to have a devastating effect on the health and well-being of the global population. Apart from the global health crises, the pandemic has also caused significant economic and financial difficulties and socio-physiological implications. Effective screening, triage, treatment planning, and prognostication of outcome play a key role in controlling the pandemic. Recent studies have highlighted the role of point-of-care ultrasound imaging for COVID-19 screening and prognosis, particularly given that it is non-invasive, globally available, and easy-to-sanitize. COVIDx-US Dataset: Motivated by these attributes and the promise of artificial intelligence tools to aid clinicians, we introduce COVIDx-US, an open-access benchmark dataset of COVID-19 related ultrasound imaging data. The COVIDx-US dataset was curated from multiple data sources and its current version, i.e., v1.5., consists of 173 ultrasound videos and 21,570 processed images across 147 patients with COVID-19 infection, non-COVID-19 infection, other lung diseases/conditions, as well as normal control cases. Conclusions: The COVIDx-US dataset was released as part of a large open-source initiative, the COVID-Net initiative, and will be continuously growing, as more data sources become available. To the best of the authors’ knowledge, COVIDx-US is the first and largest open-access fully-curated benchmark lung ultrasound imaging dataset that contains a standardized and unified lung ultrasound score per video file, providing better interpretation while enabling other research avenues such as severity assessment. In addition, the dataset is reproducible, easy-to-use, and easy-to-scale thanks to the well-documented modular design.

Keywords
ultrasound imaging
curated dataset
open-access
COVID-19
artificial intelligence
Figures
Fig. 1.
Share
Back to top