This thesis presents progress in new possibilities and addressing disturbing factors (overlapping speech, noise, and reverberation), first, by proposing ideas for a system for the classification of acoustic scenes and a method for acoustic gait-based person identification. Both of them are two relatively new audio recognition tasks. Furthermore, improvements for two established methods (speaker diarization and robust speech recognition) are presented. Together, the proposed modules represent a complete system for auditory scene analysis.
«
This thesis presents progress in new possibilities and addressing disturbing factors (overlapping speech, noise, and reverberation), first, by proposing ideas for a system for the classification of acoustic scenes and a method for acoustic gait-based person identification. Both of them are two relatively new audio recognition tasks. Furthermore, improvements for two established methods (speaker diarization and robust speech recognition) are presented. Together, the proposed modules represent a...
»