A Perceptual Evaluation of Music Real-Time Communication Applications

dc.contributor.authorGoot, Dana Kemack
dc.contributor.authorChaubey, Harshit
dc.contributor.authorHsu, Timothy Y.
dc.contributor.authorDeal , William Scott
dc.contributor.departmentMusic and Arts Technology, School of Engineering and Technology
dc.date.accessioned2024-02-14T20:05:04Z
dc.date.available2024-02-14T20:05:04Z
dc.date.issued2023-04-28
dc.description.abstractMusic Real-time Communication applications (M-RTC) enable music making (musiking) for musicians simultaneously across geographic distance. When used for musiking, M-RTC such as Zoom and JackTrip, require satisfactorily received acoustical perception of the transmitted music to the end user; however, degradation of audio can be a deterrent to using M-RTC for the musician. Specific to the audio quality of M-RTC, we evaluate the quality of the audio, or the Quality of Experience (QoE), of five network music conferencing applications through quantitative perceptual analysis to determine if the results are commensurate with data analysis. The ITU-R BS.1534-3 MUlti Stimulus test with Hidden Reference and Anchor (MUSHRA) analysis is used to evaluate the perceived audio quality of the transmitted audio files in our study and to detect differences between the transmitted audio files and the hidden reference file. A comparison of the signal-to-noise ratio (SNR) and total harmonic distortion (THD) analysis to the MUSHRA analysis shows that the objective metrics may indicate that SNR and THD are factors in perceptual evaluation and may play a role in perceived audio quality; however, the SNR and THD scores do not directly correspond to the MUSHRA analysis and do not adequately represent the preferences of the individual listener. Since the benefits of improved M-RTC continue to be face-to-face communication, face-to-face musiking, reduction in travel costs, and depletion of travel time, further testing with statistical analysis of a larger sample size can provide the additional statistical power necessary to make conclusions to that end.
dc.eprint.versionFinal published version
dc.identifier.citationGoot, D. K., Chaubey, H., Hsu, T. Y., & Deal, W. S. (2023). A Perceptual Evaluation of Music Real-Time Communication Applications. IEEE Access, 11, 46860–46870. https://doi.org/10.1109/ACCESS.2023.3271525
dc.identifier.urihttps://hdl.handle.net/1805/38520
dc.language.isoen_US
dc.publisherIEEE
dc.relation.isversionof10.1109/ACCESS.2023.3271525
dc.relation.journalIEEE Access
dc.rightsAttribution-NonCommercial-NoDerivatives 4.0 Internationalen
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/4.0/
dc.sourcePublisher
dc.subjectMUSHRA
dc.subjectmusic real-time communications (M-RTC)
dc.subjectnetworked music
dc.subjectperceived audio quality
dc.subjectperceptual evaluation
dc.subjectquality of experience (QoE)
dc.subjectsignal to noise ratio (SNR)
dc.subjecttelematic
dc.subjecttotal harmonic distortion (THD)
dc.subjectweb RTC
dc.titleA Perceptual Evaluation of Music Real-Time Communication Applications
dc.typeArticle
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Goot2023APerceptual-CCBYNCND.pdf
Size:
1.51 MB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.99 KB
Format:
Item-specific license agreed upon to submission
Description: