Documents in the NTIS Technical Reports collection are the results of federally funded research. They are directly submitted to or collected by NTIS from Federal agencies for permanent accessibility to industry, academia and the public.  Before purchasing from NTIS, you may want to check for free access from (1) the issuing organization's website; (2) the U.S. Government Printing Office's Federal Digital System website; (3) the federal government Internet portal; or (4) a web search conducted using a commercial search engine such as
Accession Number ADA567198
Title Sequential Organization and Room Reverberation for Speech Segregation.
Publication Date Feb 2012
Media Count 22p
Personal Author D. Wang
Abstract Inspired by the perceptual account of auditory scene analysis, significant advances were made in speech segregation in recent years. Despite these advances, two major challenges remained: sequential organization and room reverberation. This project aimed to address these two challenges. Substantial progress has been made along the following directions. First a tandem algorithm was developed that performs pitch tracking and voiced speech segregation iteratively. Second, a multipitch tracking algorithm was proposed for noisy and reverberant speech, which was then used in a novel, supervised learning approach to segregation of voiced speech in reverberant environments. Third, a method was suggested for unvoiced speech segregation by first removing voiced speech and periodic components, and then grouping unvoiced speech segments through analyzing their spectral characteristics. Two algorithms were proposed for sequential organization, an unsupervised clustering algorithm applicable to monaural recordings and a binaural algorithm that integrates monaural and binaural analyses. In addition, speech intelligibility tests were conducted and their results firmly establish the effectiveness of binary masking for improving human speech recognition in noisy backgrounds.
Keywords Acoustics
Auditory perception
Computational audition
Computational auditory scene analysis
Pitch tracking
Room reverberation
Sequential organization
Signal processing
Sound pitch
Speech recognition
Speech segregation

Source Agency Non Paid ADAS
NTIS Subject Category 62 - Computers, Control & Information Theory
46A - Acoustics
45F - Verbal
Corporate Author Ohio State Univ. Research Foundation, Columbus.
Document Type Technical report
Title Note Final performance rept. Feb 2008-Nov 2011.
NTIS Issue Number 1308
Contract Number FA9550-08-1-0155

Science and Technology Highlights

See a sampling of the latest scientific, technical and engineering information from NTIS in the NTIS Technical Reports Newsletter

Acrobat Reader Mobile    Acrobat Reader