Reliability and Responsiveness of Endoscopic Disease Activity Assessment in Eosinophilic Esophagitis

Abstract

Background and Aims Endoscopic outcomes have become important measures of eosinophilic esophagitis (EoE) disease activity, including as an endpoint in randomized controlled trials (RCTs). We evaluated the operating properties of endoscopic measures for use in EoE RCTs. Methods Modified Research and Development/University of California Los Angeles (RAND/UCLA) appropriateness methods and a panel of 15 international EoE experts identified endoscopic items/definitions with face validity, which were used in a 2-round voting process to define simplified (all items graded absent/present) and expanded versions (additional grades for edema, furrows, and/or exudates) of the EoE Endoscopic Reference Score (EREFS). Inter- and intra-rater reliability of these instruments (expressed as intraclass correlation coefficients [ICC]), were evaluated using paired endoscopy video assessments of two blinded central readers before and after 8 weeks of proton pump inhibitors, swallowed topical corticosteroids, or dietary elimination. Responsiveness was measured using the standardized effect size (SES). Results The appropriateness of 41 statements relevant to EoE endoscopic activity (endoscopic items, item definitions/grading, and other considerations relevant for endoscopy) was considered. The original and expanded EREFS demonstrated moderate-to-substantial inter-rater reliability (ICCs 0.472-0.736, and 0.469-0.763, respectively) and moderate-to-almost perfect intra-rater reliability (ICCs 0.580-0.828, and 0.581-0.828, respectively). Strictures were least reliably assessed (ICCs 0.072-0.385). The original EREFS was highly responsive (SES 1.126 [95% CI 0.757, 1.534]), although both expanded versions of EREFS, scored based on worst affected area, were numerically most responsive to treatment (expanded furrows, SES 1.229 [95% CI: 0.858, 1.643]; all items expanded, SES 1.252 [95% CI: 0.880, 1.667]). The EREFS and its modifications were not more reliably scored by segment, and also not more responsive when proximal and distal EREFS scores were summed. Conclusions EREFS and its modifications were reliable and responsive, and the original or expanded versions of the EREFS may be preferred in RCTs. Disease activity scored based on the worst affected area optimizes reliability and responsiveness.

Description
item.page.description.tableofcontents
item.page.relation.haspart
Cite As
Ma, C., Bredenoord, A. J., Dellon, E. S., Alexander, J. A., Biedermann, L., Hogan, M., Guizzetti, L., Zou, G., Katzka, D. A., Chehade, M., Falk, G. W., Furuta, G. T., Gupta, S. K., Kagalwalla, A. F., Schoepfer, A. M., Miehlke, S., Moawad, F. J., Peterson, K., Gonsalves, N. P., … Hirano, I. (2022). Reliability and Responsiveness of Endoscopic Disease Activity Assessment in Eosinophilic Esophagitis. Gastrointestinal Endoscopy, S0016510722000827. https://doi.org/10.1016/j.gie.2022.01.014
ISSN
0016-5107
Publisher
Series/Report
Sponsorship
Major
Extent
Identifier
Relation
Journal
Gastrointestinal Endoscopy
Rights
Publisher Policy
Source
Author
Alternative Title
Type
Article
Number
Volume
Conference Dates
Conference Host
Conference Location
Conference Name
Conference Panel
Conference Secretariat Location
Version
Author's manuscript
Full Text Available at
This item is under embargo {{howLong}}