Clarity in Motion Phase-1 Perceptual Study of Speech Intelligibility
CIM PhI PercX
Speech Intelligibility and Listening Effort
2 other identifiers
interventional
72
1 country
1
Brief Summary
Participants received a bilateral pure-tone hearing screen administered by the research team. All potential participants who failed the hearing screen were provided with information about its meaning and referral for further audiological testing. Participants who passed the hearing screen and other inclusion criteria were divided into 6 groups, each of which were presented with 144 stimuli equally distributed among processing conditions. Listeners choose a comfortable listening level using supplied headphones and were able to control the rate of presentation. Following a short practice session, listeners were be asked to transcribe each target sentence. The intelligibility of each stimulus was estimated by determining the mean percentage of content words correctly transcribed. After transcription, listeners were asked for two qualitative judgments: (1) the "clarity" of the stimulus, and (2) the "listening effort" involved. The quality of each stimulus was estimated by the median quality judgment, and the effort likewise. Listening sessions were located in a quiet room and presentation was controlled by the Superlab presentation software program. The Stimuli consisted of audio recordings of target spondaic words embedded in a carrier sentence produced by a male and a female native speaker of American English recorded under quiet conditions. Each stimulus presented to the listeners for identification was either unmasked pristine speech or speech that had been processed in one of five ways with different mixtures of noise and sensor movement. The latter are identified as QoS Levels 1-5. Collectively, the estimates of word intelligibility, clarity, and listening effort under the different conditions shed light on the effectiveness with which the tested algorithm preserves listener intelligibility with acceptable effort and quality.
Trial Health
Trial Health Score
Automated assessment based on enrollment pace, timeline, and geographic reach
participants targeted
Target at P75+ for not_applicable healthy
Started Aug 2024
Shorter than P25 for not_applicable healthy
1 active site
Health score is calculated from publicly available data and should be used for screening purposes only.
Trial Relationships
Click on a node to explore related trials.
Study Timeline
Key milestones and dates
Study Start
First participant enrolled
August 1, 2024
CompletedPrimary Completion
Last participant's last visit for primary outcome
August 31, 2024
CompletedStudy Completion
Last participant's last visit for all outcomes
August 31, 2024
CompletedFirst Submitted
Initial submission to the registry
June 5, 2025
CompletedFirst Posted
Study publicly available on registry
June 13, 2025
CompletedJune 25, 2025
June 1, 2025
1 month
June 5, 2025
June 19, 2025
Conditions
Outcome Measures
Primary Outcomes (3)
Target Word Transcription Accuracy
For each audio stimulus of a target word in a carrier phrase, the subject transcribed the target word they heard (by typing that word in a field in the display. Their transcription was later graded as correct or incorrect by a researcher.
The subject transcribed the target word immediately after hearing the carrier phrase.
Listening Effort
Subject-reported effort required to identify word in carrier phrase, reported by moving a slider on a 0% to 100% scale.
Immediately after hearing the carrier phrase.
Naturalness of speech
Subject-reported perceived naturalness of target word in carrier phrase, reported by moving a slider on a 0% to 100% scale.
Immediately after hearing the carrier phrase.
Study Arms (12)
Group 1A
EXPERIMENTALSubjects in this Group listened and responded to a unique set of stimulus/treatment pairs.
Group 1B
EXPERIMENTALSubjects in this Group listened and responded to a unique set of stimulus/treatment pairs.
Group 2A
EXPERIMENTALSubjects in this Group listened and responded to a unique set of stimulus/treatment pairs.
Group 2B
EXPERIMENTALSubjects in this Group listened and responded to a unique set of stimulus/treatment pairs.
Group 3A
EXPERIMENTALSubjects in this Group listened and responded to a unique set of stimulus/treatment pairs.
Group 3B
EXPERIMENTALSubjects in this Group listened and responded to a unique set of stimulus/treatment pairs.
Group 4A
EXPERIMENTALSubjects in this Group listened and responded to a unique set of stimulus/treatment pairs.
Group 4B
EXPERIMENTALSubjects in this Group listened and responded to a unique set of stimulus/treatment pairs.
Group 5A
EXPERIMENTALSubjects in this Group listened and responded to a unique set of stimulus/treatment pairs.
Group 5B
EXPERIMENTALSubjects in this Group listened and responded to a unique set of stimulus/treatment pairs.
Group 6A
EXPERIMENTALSubjects in this Group listened and responded to a unique set of stimulus/treatment pairs.
Group 6B
EXPERIMENTALSubjects in this Group listened and responded to a unique set of stimulus/treatment pairs.
Interventions
Speech stimuli recorded using non-moving speakers and mics. No masking sources present. No BSS applied to multi-channel recordings. Very high output QoS values.
Speech stimuli recorded using non-moving speakers and mics. All masking sources present. No speech separation or extraction methods applied to multi-channel recordings. Very low output QoS values.
Speech stimuli recorded using non-moving speakers and mics. All masking sources present. Joint ACES scrubbing of both noise sources applied to multi-channel recordings. Very high output QoS values.
Speech stimuli recorded using linearly moving speech source and stationary masking sources and mics. All masking sources present. Joint ACES scrubbing of both noise sources applied to multi-channel recordings. Moderately high output QoS values.
Mixed speech and noise sources recorded using a stationary speech source, a stationary noise source, and a linearly moving noise source. A valid source hypothesis of the speech source is used to extract the speech source. High output QoS values.
Mixed speech and noise sources recorded using all stationary sources, and a linearly moving microphone (mic 1). Joint ACES scrubbing of both noise sources is used to reduce the response of Mic 1 to a residue of speech. Low output QoS values.
Eligibility Criteria
You may not qualify if:
- No special populations requiring special protections will be utilized in this study. Potential participants who fail the hearing screen will be provided with results and appropriate clinical referral as per the IRB protocol.
Contact the study team to confirm eligibility.
Sponsors & Collaborators
Study Sites (1)
University of Cincinnati
Cincinnati, Ohio, 45221, United States
Study Officials
- PRINCIPAL INVESTIGATOR
Richard S Goldhor, PhD
Speech Technology & Applied Research Corp.
Study Design
- Study Type
- interventional
- Phase
- not applicable
- Allocation
- RANDOMIZED
- Masking
- SINGLE
- Who Masked
- PARTICIPANT
- Purpose
- BASIC SCIENCE
- Intervention Model
- CROSSOVER
- Sponsor Type
- INDUSTRY
- Responsible Party
- SPONSOR
Study Record Dates
First Submitted
June 5, 2025
First Posted
June 13, 2025
Study Start
August 1, 2024
Primary Completion
August 31, 2024
Study Completion
August 31, 2024
Last Updated
June 25, 2025
Record last verified: 2025-06