Amalgamation of cloud-based colonoscopy videos with patient-level metadata to facilitate large-scale machine learning

dc.contributor.authorKeswani, Rajesh N.
dc.contributor.authorByrd, Daniel
dc.contributor.authorGarcia Vicente, Florencia
dc.contributor.authorHeller, J. Alex
dc.contributor.authorKlug, Matthew
dc.contributor.authorMazumder, Nikhilesh R.
dc.contributor.authorWood, Jordan
dc.contributor.authorYang, Anthony D.
dc.contributor.authorEtemadi, Mozziyar
dc.contributor.departmentSurgery, School of Medicine
dc.date.accessioned2024-10-22T14:17:10Z
dc.date.available2024-10-22T14:17:10Z
dc.date.issued2021
dc.description.abstractBackground and study aims: Storage of full-length endoscopic procedures is becoming increasingly popular. To facilitate large-scale machine learning (ML) focused on clinical outcomes, these videos must be merged with the patient-level data in the electronic health record (EHR). Our aim was to present a method of accurately linking patient-level EHR data with cloud stored colonoscopy videos. Methods: This study was conducted at a single academic medical center. Most procedure videos are automatically uploaded to the cloud server but are identified only by procedure time and procedure room. We developed and then tested an algorithm to match recorded videos with corresponding exams in the EHR based upon procedure time and room and subsequently extract frames of interest. Results: Among 28,611 total colonoscopies performed over the study period, 21,170 colonoscopy videos in 20,420 unique patients (54.2 % male, median age 58) were matched to EHR data. Of 100 randomly sampled videos, appropriate matching was manually confirmed in all. In total, these videos represented 489,721 minutes of colonoscopy performed by 50 endoscopists (median 214 colonoscopies per endoscopist). The most common procedure indications were polyp screening (47.3 %), surveillance (28.9 %) and inflammatory bowel disease (9.4 %). From these videos, we extracted procedure highlights (identified by image capture; mean 8.5 per colonoscopy) and surrounding frames. Conclusions: We report the successful merging of a large database of endoscopy videos stored with limited identifiers to rich patient-level data in a highly accurate manner. This technique facilitates the development of ML algorithms based upon relevant patient outcomes.
dc.eprint.versionFinal published version
dc.identifier.citationKeswani RN, Byrd D, Garcia Vicente F, et al. Amalgamation of cloud-based colonoscopy videos with patient-level metadata to facilitate large-scale machine learning. Endosc Int Open. 2021;9(2):E233-E238. doi:10.1055/a-1326-1289
dc.identifier.urihttps://hdl.handle.net/1805/44144
dc.language.isoen_US
dc.publisherThieme
dc.relation.isversionof10.1055/a-1326-1289
dc.relation.journalEndoscopy International Open
dc.rightsAttribution-NonCommercial-NoDerivatives 4.0 Internationalen
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/4.0/
dc.sourcePMC
dc.subjectLarge-scale machine learning (ML)
dc.subjectColonoscopy videos
dc.subjectElectronic health record (EHR)
dc.titleAmalgamation of cloud-based colonoscopy videos with patient-level metadata to facilitate large-scale machine learning
dc.typeArticle
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Keswani2021Amalgamation-CCBYNCND.pdf
Size:
796.61 KB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
2.04 KB
Format:
Item-specific license agreed upon to submission
Description: