Effectiveness of Visual Biofeedback in Speech Training of

Effectiveness of Visual Biofeedback in Speech Training of

Effectiveness of Visual Biofeedback in Speech Training of Children with Hearing Impairment Elizabeth Reid, BSLT and Emily Lin, PhD Department of Communication Disorders, University of Canterbury, Christchurch, New Zealand Abstract The effectiveness of spectrograms in speech training of hearing-impaired children was examined and compared to traditional therapy approaches. Subjective and objective analyses suggested that spectrograms were effective in improving particular speech targets. The temporal and spectral properties of speech produced by the subjects were also examined and acoustic cues were identified which were related to the perceived accuracy of their speech productions. These results have the potential to provide clues to the type of compensatory feedback needed in therapy. 100 80 60 % Targets Correct 40 DFC CCR Goldman-Fristoe Probe List 20 0 Introduction Baseline Traditional Training Visual Training Follow-up Percentage of targets correct for subject 1 Minimal change was seen with the Goldman Fristoe recordings, which was likely due to the small number of tokens for each target in the recording. For the probe list, Subject 1 showed no improvement in target processes with traditional training, however a clear trend of improvement was seen with visual training. The Goldman Fristoe recordings for Subjects 2 and 3 showed minimal change in target accuracy scores over the training period, which was likely due to the small number of tokens as well as their high accuracy scores pre-training. 100 80 are accepted by clinicians as an effective treatment tool. Therefore the main objective of this study was to evaluate, using objective measures, the effectiveness of spectrograms compared to traditional speech training approaches for hearing-impaired children. The second objective of the study was to describe the temporal behaviour and formant characteristics of speech produced by hearingimpaired children and examine how the acoustic properties are related to the perceived accuracy of their speech production. The majority of studies describing the speech production of hearing-impaired children has been confined to perceptual analysis of phonetic and phonologic errors and acoustic analyses of temporal aspects of the speech signal. A recent study by Uchanski & Geers (2003) used spectral moment analysis to examine the acoustic energy characteristics of fricatives spoken by hearing-impaired children. Their study provided an interesting basis for further exploration of hearing impaired children's consonant production. 60 Subject 2 Subject 3 % Targets Correct 40 /a/ Measures of VOT displayed a downward trend for all three subjects, indicating reduced VOT over the training period. For subject 1, a reduction in VOT was seen immediately with traditional training, however the trend was variable making comparisons between training approaches difficult. 110 100 Time (ms) 90 1000 1000 0 0 200 120 100 80 40 70 20 400 600 F1 (Hz) 800 1000 1200 Baseline Treatment VOT for subjects 2 & 3 200 180 160 140 120 Length (msec) target /fl/ untreated control /pr/ 100 80 60 40 20 0 Pre-Tx Traditional Therapy Visual Therapy Post-Tx Consonant cluster length for subject 1 Subject 1 showed an increase in consonant cluster length for the trained /fl/ target with traditional training. During the visual training period, the length was maintained at a similar level with a slight drop in length over the period. Measures for the untreated control were variable over the training period suggesting no treatment effect. Final consonant length for subjects 2 showed a positive upward trend over the training period, however improvement were not maintained in the follow-up recording, indicating lack of maintenance. For subject 3, only three measures were taken of final consonant length, which showed a reduction in length, however the small number of recordings is likely to affect reliability. An increase in vowel space was seen for subjects 1 & 2 following the training period, 2800 /i/ Subject 1 Subject 2 while subject 3 showed a slight decrease in 2600 Subject 3 vowel space. The increase for subject 1 was 2400 attributed most to an increase in the range 2200 F2 of F2 productions, while the increase for 2000 subject 2 was due to the increase in the 1800 /a/ range of F1 productions. Calculation of the 1600 /u/ 1400 vowel working space area encompassing /i/, 1200 /a/, and /u/ showed a smaller working space 200 400 600 800 1000 1200 1400 area for incorrect productions than for F1 correct productions. There was a reduction Vowel space pre & post-treatment for each subject in vowel space for subject 3, which may have been due to the small number of recordings taken. 3000 /v/ /b/ /z/ /s/ /f/ /ch/ /k/ /s/ /sh/ /k/ /t/ /p/ /b/ /dg/ /n/ /p/ /m/ /d/ /n//z/ 0 1000 M1 2000 3000 4000 Moment 1 (mean) and Moment 2 (standard deviation) for consonant productions perceived as correct vs incorrect. Vowel space for correct and incorrect vowel productions. Most incorrect consonant productions consistently exhibited lower M1 values than correct consonant productions, which covered a

greater frequency range. All fricatives had M1 values lower than those reported for normal hearing speakers Fry (2001). Iincorrectly produced fricatives exhibited lower M2 values and incorrectly produced plosives higher M2 values than those of their correct counterparts, indicating that incorrectly produced fricatives and plosives tended to deviate from a normal pattern, Discussion 0 Visual Therapy Traditional Therapy Subject 2 Subject 3 VOT (ms) 60 80 Baseline /g/ 2000 /u/ /t/ /th/ /thi/ M2 /a/ /u/ Those vowel productions perceived to be correct (ABS = 194237) had larger vowel spaces compared to those perceived as incorrect (ABS = 125738). 120 /dg/ 3000 20 Percentage of deletion of final consonant /g/ /v/ 1500 Method A total of 180 values (3 pitch levels X 2 vowels X 3 groups X 10 subjects) for each measure were submitted to a one-way Analysis of Variances (ANOVA) to determine whether the three subject groups differed on each measure. 4000 /d/ Correct Productions Incorrect Productions /i/ 2000 F2 (Hz) Voice Onset Time for subject 1 Results Correct productions Incorrect Productions 2500 targets correct for subjects 2 & 3 60 Subjects: 3 subjects (S1=12y; S2=9y; S3=7y) with bilateral moderate-severe sensorineural hearing losses. Instrumentation: Hheadset microphone (AKG C420, Austria), mixer (Eurorack MX602A, Behringer), 12- bit A/D converter (National Instrument DAQCard-AI-16E-4, USA), SCB-68 68-pin shielded connector box, with a low-passed filter (cutoff frequency = 20 KHz), laptop installed with TF32 (Paul Milenkovic, 2000) & PRATT (Boersma & Weenink, 2005). Procedure: Recordings were done in a quiet room with the microphone 5 cm from the mouth. Initial recordings of the Goldman Fristoe Test of Articulation were obtained. Commonly occurring error processes were identified for each subject. Training targets were chosen (S1=Deletion of Final Consonant (DFC) and Consonant Cluster Reduction (CCR); S2=DFC; S3=DFC). Probe lists were developed for each target and were recorded throughout the training period. 30mins treatment sessions were carried out over 12 weeks for subject 1, 4 weeks for subject 2, and 2 weeks for subject 3. Subject 1 received traditional therapy followed by visual therapy; subjects 2 and 3 received visual therapy only. Traditional therapy involved verbal instruction with visual & tactile cues. Visual therapy used spectrogram displays of a correct production which the subjects were required to match using real-time pitch and intensity displays, and then judge their accuracy. (picture ***) Subjective analysis: phonemic transcriptions of each recording. Acoustic analysis: vowel and consonant lengths, F1 and F2, and spectral moments 1(mean, indicating ), 2 (standard deviation, indicating ), 3 (skewness, indicating ) and 4 (kurtosis indicating). Statistical Analysis: 5000 /i/ It is well known that the limitations of a hearing-impaired childs perceptual system can prevent them from perceiving differences in sounds, resulting in speech production that is delayed or disordered (Ruffin-Simon, 1983). To compensate for this lack of access to auditory cues, there has been a substantial increase in the development of real-time visual feedback displays such as spectrograms. Sspectrograms provide a visual representation of the frequency, intensity, and time domains of an acoustic signal (Ertmer & Maki, 2000). Unlike many other visual feedback devices that provide feedback on a single dimension of speech, spectrographic displays can provide many segmental and suprasegmental speech features simultaneously. Spectrographic displays (SDs) provide immediate and objective feedback, allowing a child to compare his/her own speech production with a correct visual template from the clinician (Dagenais, Critz-Crosby, Fletcher & McCutheon, 1994). Despite the growing interest in visual feedback tools, there have been few studies that have objectively examined the effectiveness of such devices. More research on their effectiveness is needed before they 3000 220 Subject 2 Subject 3 200 180 Length (msec) 160 Individually, all three subjects showed positive but different effects of training with spectrograms. The acoustic measures were more sensitive than subjective measures in identifying changes and highlighting differences in training approaches. VOT for all three subjects reduced over the training period. VOT length provides an important cue for the phonemic contrast between voiced plosives and their voiceless counterparts. The distinction requires fast movements of the articulators and good coordination of motor control between the larynx and upper articulators. Therefore the reduction in VOT indicates that visual training has improved all subjects coordination of phonation and articulation, which is likely to result in improved intelligibility. Temporal measures showed an increase in consonant cluster length for the trained target /fl/ for subject 1, but no improvement for the untrained target /pr/. This suggests that subject 1s awareness and production of the two components of the consonant cluster has improved, however further treatment is necessary to facilitate generalisation to other consonant clusters. Subject 2, showed an increase in final consonant length over the training period suggesting an improved awareness and production of final consonants. Conversely, These results suggest that visual training is effective in improving subjects awareness of the targets and their production accuracy. Subject 3 showed a negative decrease in final consonant length. This may be due to the small number of measures taken or the fact that he only received one session of training. Although vowels were not targeted, subjects 1 & 2 showed an increase in vowel space following visual training. This appeared to be largely due to the improved production range of Formant 2 for S1 and Formant 1 for S2. A reduced vowel space area represents a restriction of tongue elevation and front-back tongue movement (Liu Tso & Kuhl, 2005). Therefore the improved vowel space following training suggests that subjects 1 & 2 were producing a greater range of formant frequencies, resulting in greater distinction between vowels. Subject 3 showed a decrease in vowel space following the training period, which may be due to the shorter training period he experienced compared to the other two subjects. A number of acoustic properties were found which differentiated the correct and incorrect speech productions. 140 120 100 Post-Treatment Final consonant length for subjects 2&3 Baseline Treatment Final consonant length for subjects 2 & 3 Vowel Space PreTraining PostTraining Demaris 292025 301551 Blake -2921.17 114829 Jack 114829 148405.5 Perception of vowel accuracy was found to be related to an increased vowel space as well as shorter vowel durations. Researchers (Monsen, 1974; Gulian et al., 1983) have identified vowel prolongation as one of the speech characteristics of the hearing-impaired. In this study, vowel durations for incorrect productions were prolonged compared to those for correct productions. A smaller vowel space was seen for incorrect productions, indicating a more restricted articulation range than for correct productions. This result is similar to Angelocci et al.s (1964) comparison between hearing-impaired and normal-hearing speakers in that the vowel space derived from normal data was larger than that from the abnormal comparison groups. This result suggests that training aimed at the expansion of vowel space could be potentially beneficial to improve the speech intelligibility of hearing-impaired children. Perception of consonant accuracy was most closely related to VOT for plosives, and moment 1 (mean) and moment 3 (skewness) for fricatives, affricates and plosives. Correct plosive consonant productions contained a normal range of VOT measures, however incorrect productions were more variable and many were prolonged outside these ranges. As discussed previously, VOT is an important cue for the voicedvoiceless distinction. These results show that a reduced VOT improves perceptual intelligibility of speech production. M1 values for incorrect consonant productions tended to be much lower than those for correct productions, suggesting that tongue placement was more posterior in incorrect Conclusion productions. Since the M1 measure appeared to be sensitive in Investigation of the effectiveness of spectrographic displays suggested spectrograms can enhance the awareness improve differentiating correct and incorrect consonant productions, it could bethat used in clinical application to provide feedback and in speech thetraining production of particular speech targets that children with hearing impairment would otherwise miss with traditional training. and monitor progress. *** other moments?? Results of the acoustic-perceptual investigation highlighted the usefulness of acoustic analysis in establishing a link between the hearing-impaired childrens production and perceptual deficits and thus providing clues to the type of compensatory feedback needed for aural rehabilitation. Results also emphasize the importance of using acoustic measures in research, as they are able to Acknowledgements: This research is part of a Masters thesis which is currently being completed by the first author and directed by provide more detailed information and more sensitive to changes compared to subjective measures. the second author at the University of Canterbury. Support for this research was provided by the Oticon Foundation New Zealand. % DFC targets correct for subjects 2 &3 % targets correct for subject 1 VOT for subjects 2 & 3 Consonant cluster length Final consonant length VOT Subject 1

Recently Viewed Presentations

  • It's been a Long Nine Months

    It's been a Long Nine Months

    Value of programme to Primary schools. Primary. Timely use of Sports premium in addition with Government wage incentive. Schools can employ Sports Apprentice to provide increased opportunities for participation during lunchtime/after school activities
  • March 26, 2018

    March 26, 2018

    Login to kahoot.it. Teacher will put code on overhead projector. Students will complete handout as play Kahoot. 6th Pre-AP Science. Students will complete TEK 6.12 A/B/D ACP Review on own using strategies and bubble in scan tron for grade. When...
  • The Challenge: To Create More Value in All Negotiations

    The Challenge: To Create More Value in All Negotiations

    It is my favorite story— how the old couple had almost nothing to give but their willingness to be attentive— and for this alone the gods loved them and blessed them. Source: Mary Oliver, White Pine (from the poem "Mockingbird")...
  • Ieee 802.11

    Ieee 802.11

    Wireless Medium Access Control (MAC)(Refer Section 7.3.1 and 7.3.2 in textbook)Slides Adopted from:Romit Roy Choudhury Wireless Networking Lectures University of Illinois at Urbana Champaign
  • Culture and Diversity - Southside High School

    Culture and Diversity - Southside High School

    The Components of Culture. Culture is both learned and shared. All cultures have basic components such as technology, symbols, language, values, and norms. The Components of Culture, cont'd. Technology. Refers to objects and the rules for using them.
  • SURROUND THE LUXURY OF STUNNING SOUND PERFECT PITCH

    SURROUND THE LUXURY OF STUNNING SOUND PERFECT PITCH

    Chromecast built-in* and AirPlay 2 (OTA Q4 '19) Up to 24-bit/96kHz HD audio streaming. 4x HDMI inputs with 4K support. 1x HDMI output with ARC. Dolby Audio and DTS Digital Surround. Bluetooth wireless music streaming * Compatible with Harman Kardon...
  • GCSE Physical Education Healthy active lifestyles & how they ...

    GCSE Physical Education Healthy active lifestyles & how they ...

    GCSE Physical Education Goal Setting Learning Objectives By the end of this lesson pupils should: Define the term SMART goals Apply SMART goals to yourself Understand how SMART goals can work within your PEP SMART goals SMART goals are used...
  • Monday, August 26th - Denton ISD

    Monday, August 26th - Denton ISD

    Monday April 13, 2015. Warm Up: Jot these notes down in Writer's Notebook: The VERB is the most important part of sentence. Expresses action or state of being ("am", "became").. If a word is a verb, it has an "ing"...