Ma, ChristopherBredenoord, Albert J.Dellon, Evan S.Alexander, Jeffrey A.Biedermann, LucHogan, MalcolmGuizzetti, LeonardoZou, GuangyongKatzka, David A.Chehade, MirnaFalk, Gary W.Furuta, Glenn T.Gupta, Sandeep K.Kagalwalla, Amir F.Schoepfer, Alain M.Miehlke, StephanMoawad, Fouad J.Peterson, KathrynGonsalves, Nirmala P.Straumann, AlexWechsler, Joshua B.Rémillard, JulieShackelton, Lisa M.Almonte, Hector S.Feagan, Brian G.Jairath, VipulHirano, Ikuo2022-03-102022-03-102022Ma, C., Bredenoord, A. J., Dellon, E. S., Alexander, J. A., Biedermann, L., Hogan, M., Guizzetti, L., Zou, G., Katzka, D. A., Chehade, M., Falk, G. W., Furuta, G. T., Gupta, S. K., Kagalwalla, A. F., Schoepfer, A. M., Miehlke, S., Moawad, F. J., Peterson, K., Gonsalves, N. P., … Hirano, I. (2022). Reliability and Responsiveness of Endoscopic Disease Activity Assessment in Eosinophilic Esophagitis. Gastrointestinal Endoscopy, S0016510722000827. https://doi.org/10.1016/j.gie.2022.01.0140016-5107https://hdl.handle.net/1805/28135Background and Aims Endoscopic outcomes have become important measures of eosinophilic esophagitis (EoE) disease activity, including as an endpoint in randomized controlled trials (RCTs). We evaluated the operating properties of endoscopic measures for use in EoE RCTs. Methods Modified Research and Development/University of California Los Angeles (RAND/UCLA) appropriateness methods and a panel of 15 international EoE experts identified endoscopic items/definitions with face validity, which were used in a 2-round voting process to define simplified (all items graded absent/present) and expanded versions (additional grades for edema, furrows, and/or exudates) of the EoE Endoscopic Reference Score (EREFS). Inter- and intra-rater reliability of these instruments (expressed as intraclass correlation coefficients [ICC]), were evaluated using paired endoscopy video assessments of two blinded central readers before and after 8 weeks of proton pump inhibitors, swallowed topical corticosteroids, or dietary elimination. Responsiveness was measured using the standardized effect size (SES). Results The appropriateness of 41 statements relevant to EoE endoscopic activity (endoscopic items, item definitions/grading, and other considerations relevant for endoscopy) was considered. The original and expanded EREFS demonstrated moderate-to-substantial inter-rater reliability (ICCs 0.472-0.736, and 0.469-0.763, respectively) and moderate-to-almost perfect intra-rater reliability (ICCs 0.580-0.828, and 0.581-0.828, respectively). Strictures were least reliably assessed (ICCs 0.072-0.385). The original EREFS was highly responsive (SES 1.126 [95% CI 0.757, 1.534]), although both expanded versions of EREFS, scored based on worst affected area, were numerically most responsive to treatment (expanded furrows, SES 1.229 [95% CI: 0.858, 1.643]; all items expanded, SES 1.252 [95% CI: 0.880, 1.667]). The EREFS and its modifications were not more reliably scored by segment, and also not more responsive when proximal and distal EREFS scores were summed. Conclusions EREFS and its modifications were reliable and responsive, and the original or expanded versions of the EREFS may be preferred in RCTs. Disease activity scored based on the worst affected area optimizes reliability and responsiveness.en-USPublisher Policycentral readingendoscopyesophagogastroduodenoscopyoperating propertiesoutcome measuresReliability and Responsiveness of Endoscopic Disease Activity Assessment in Eosinophilic EsophagitisArticle