Dr Stefan Goetze
Department of Computer Science
Senior Lecturer
Course Director for MSc Computer Science with Speech and Language Processing
Member of the Speech and Hearing (SpandH) research group


Full contact details
Department of Computer Science
Regent Court (DCS)
211 Portobello
Sheffield
S1 4DP
- Profile
-
Stefan Goetze is Senior Lecturer in the Department of Computer Science. He obtained the degree 'Dipl.-Ing' in 2004 and 'Dr.-Ing.' in 2013 in Electrical/Communication Engineering from the University of Bremen, Germany.
From 2008 to 2020 he was with the Fraunhofer-Institute for Digital Media Technology IDMT in Oldenburg, Germany where he was first Head of "Audio System Technology for Audiology and Assistive Systems" (2010-2017) and later Head of "Automatic Speech Recognition" as well as Dept. Head of the Department "Hearing, Speech and Audio Technology" (2017-2020).
- Research interests
-
His research interests include machine learning, signal analysis, enhancement and classification as well for large scale applications as for resource-limited IoT (Internet of Things) and assistive devices.
- Publications
-
Journal articles
- Att-TasNet: attending to encodings in time-domain audio speech separation of noisy, reverberant speech mixtures. Frontiers in Signal Processing, 2. View this article in WRRO
- Non-intrusive speech quality prediction using modulation energies and LSTM-network. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 27(7), 1151-1163. View this article in WRRO
- Non-Intrusive Speech Quality Prediction Using Modulation Energies and LSTM-Network. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 27, 1151-1163.
- Joint estimation of reverberation time and early-to-late reverberation ratio from single-channel speech signals. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 27(2), 255-267. View this article in WRRO
- Intelligente Erkennersysteme für die Pflege. Pflegezeitschrift, 72(1-2), 17-19. View this article in WRRO
- Exploring auditory-inspired acoustic features for room acoustic parameter estimation from monaural speech. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 26(10), 1809-1820. View this article in WRRO
- Multi-Channel Speech Enhancement and Amplitude Modulation Analysis for Noise Robust Automatic Speech Recognition. Computer Speech & Language, 46, 558-573.
- Classifier architectures for acoustic scenes and events : implications for DNNs, TDNNs, and perceptual features from DCASE 2016. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 25(6), 1304-1314. View this article in WRRO
- Instrumental and perceptual evaluation of dereverberation techniques based on robust acoustic multichannel equalization. Journal of the Audio Engineering Society, 65(1/2), 117-129. View this article in WRRO
- Joint beamforming and spectral enhancement for robust automatic speech recognition in reverberant environments. The Journal of the Acoustical Society of America, 139(4), 2224-2225.
- Combination of MVDR beamforming and single-channel spectral processing for enhancing noisy and reverberant speech. EURASIP Journal on Advances in Signal Processing, 2015(1).
- Spectro-Temporal Gabor Filterbank Features for Acoustic Event Detection. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 23(12), 2198-2208.
- Front-end technologies for robust ASR in reverberant environments—spectral enhancement-based dereverberation and auditory modulation filterbank features. EURASIP Journal on Advances in Signal Processing, 2015(1).
- Reduction of Gaussian, Supergaussian, and Impulsive Noise by Interpolation of the Binary Mask Residual. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 23(10), 1680-1691.
- Joint estimation of pitch and direction of arrival: improving robustness and accuracy for multi-speaker scenarios. EURASIP Journal on Audio, Speech, and Music Processing, 2014(1).
- Information and communication technologies for promoting and sustaining quality of life, health and self-sufficiency in ageing societies – outcomes of the Lower Saxony Research NetworkDesign of Environments for Ageing(GAL). Informatics for Health and Social Care, 39(3-4), 166-187.
- Regularization for Partial Multichannel Equalization for Speech Dereverberation. IEEE Transactions on Audio, Speech, and Language Processing, 21(9), 1879-1890.
- Acoustic Monitoring and Localization for Social Care. Journal of Computing Science and Engineering, 6(1), 40-50.
- Acoustic user interfaces for ambient-assisted living technologies. Informatics for Health and Social Care, 35(3-4), 125-143.
- The Lower Saxony research networkdesign of environments for ageing: towards interdisciplinary research on information and communication technologies in ageing societies. Informatics for Health and Social Care, 35(3-4), 92-103.
- A study on combining acoustic echo cancelers with impulse response shortening. The Journal of the Acoustical Society of America, 120(5), 3258-3258.
- Speech Quality Assessment for Listening-Room Compensation. Journal of the Audio Engineering Society, 62(6), 386-399.
Chapters
- Computer-Based Adaption of Cooking Recipes Integrated in a Speech Dialogue Assistance System, Ambient Assisted Living (pp. 163-172). Springer International Publishing
- Ambient Voice Control for a Personal Activity and Household Assistant, Ambient Assisted Living (pp. 63-74). Springer Berlin Heidelberg
- Detection and Classification of Acoustic Events for In-Home Care, Ambient Assisted Living (pp. 181-195). Springer Berlin Heidelberg
- Automatic Live Monitoring of Communication Quality for Normal-Hearing and Hearing-Impaired Listeners, Lecture Notes in Computer Science (pp. 568-575). Springer Berlin Heidelberg
Conference proceedings papers
- ASR-Based, Single-Ended Modeling of Listening Effort - A Tool for TV Sound Engineers. Proceedings of Forum Acusticum (pp 2441-2445). Lyon, France, 7 December 2020 - 11 December 2020.
- Measuring, modelling and predicting perceived reverberation. Proceedings of 42nd IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2017) (pp 381-385). New Orleans, LA, USA, 5 March 2017 - 9 March 2017. View this article in WRRO
- On DNN posterior probability combination in multi-stream speech recognition for reverberant environments. Proceedings of 42nd IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2017) (pp 5250-5254). New Orleans, LA, USA, 5 March 2017 - 9 March 2017. View this article in WRRO
- Combination strategy based on relative performance monitoring for multi-stream reverberant speech recognition. Proceedings of 42nd IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2017) (pp 4870-4874). New Orleans, LA, USA, 5 March 2017 - 9 March 2017. View this article in WRRO
- Performance comparison of real-time single-channel speech dereverberation algorithms. 2017 Hands-free Speech Communications and Microphone Arrays (HSCMA), 1 March 2017 - 3 March 2017.
- Performance comparison of intrusive and non-intrusive instrumental quality measures for enhanced speech. 2016 IEEE International Workshop on Acoustic Signal Enhancement (IWAENC), 13 September 2016 - 16 September 2016.
- Perceptual and instrumental evaluation of the perceived level of reverberation. 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 20 March 2016 - 25 March 2016.
- Classification of human cough signals using spectro-temporal Gabor filterbank features. 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 20 March 2016 - 25 March 2016.
- Late reverberant spectral variance estimation using acoustic channel equalization. 2015 23rd European Signal Processing Conference (EUSIPCO), 31 August 2015 - 4 September 2015.
- A CHiME-3 challenge system: Long-term acoustic features for noise robust automatic speech recognition. 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU), 13 December 2015 - 17 December 2015.
- A study on joint beamforming and spectral enhancement for robust speech recognition in reverberant environments. 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 19 April 2015 - 24 April 2015.
- A study on speech quality and speech intelligibility measures for quality assessment of single-channel dereverberation algorithms. 2014 14th International Workshop on Acoustic Signal Enhancement (IWAENC), 8 September 2014 - 11 September 2014.
- Subjective speech quality and speech intelligibility evaluation of single-channel dereverberation algorithms. 2014 14th International Workshop on Acoustic Signal Enhancement (IWAENC), 8 September 2014 - 11 September 2014.
- Estimating room acoustic parameters for speech recognizer adaptation and combination in reverberant environments. 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 4 May 2014 - 9 May 2014.
- On the use of spectro-temporal features for the IEEE AASP challenge ‘detection and classification of acoustic scenes and events’. 2013 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 20 October 2013 - 23 October 2013.
- Enhancing Wireless Sensor Networks with Acoustic Sensing Technology: Use Cases, Applications & Experiments. 2013 IEEE International Conference on Green Computing and Communications and IEEE Internet of Things and IEEE Cyber, Physical and Social Computing, 20 August 2013 - 23 August 2013.
- A perceptually constrained channel shortening technique for speech dereverberation. 2013 IEEE International Conference on Acoustics, Speech and Signal Processing, 26 May 2013 - 31 May 2013.
- Automatic acoustic siren detection in traffic noise by part-based models. 2013 IEEE International Conference on Acoustics, Speech and Signal Processing, 26 May 2013 - 31 May 2013.
- Blind estimation of reverberation time based on spectro-temporal modulation filtering. 2013 IEEE International Conference on Acoustics, Speech and Signal Processing, 26 May 2013 - 31 May 2013.
- Non-intrusive regularization for least-squares multichannel equalization for speech dereverberation. 2012 IEEE 27th Convention of Electrical and Electronics Engineers in Israel, 14 November 2012 - 17 November 2012.
- System identification for listening-room compensation by means of acoustic echo cancellation and acoustic echo suppression filters. 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 25 March 2012 - 30 March 2012.
- Voice activity detection driven acoustic event classification for monitoring in smart homes. 2010 3rd International Symposium on Applied Sciences in Biomedical and Communication Technologies (ISABEL 2010), 7 November 2010 - 10 November 2010.
- Hands-free telecommunication for elderly persons suffering from hearing deficiencies. The 12th IEEE International Conference on e-Health Networking, Applications and Services, 1 July 2010 - 3 July 2010.
- Quality assessment for listening-room compensation algorithms. 2010 IEEE International Conference on Acoustics, Speech and Signal Processing, 14 March 2010 - 19 March 2010.
- Multi-channel listening-room compensation using a decoupled filtered-X LMS algorithm. 2008 42nd Asilomar Conference on Signals, Systems and Computers, 26 October 2008 - 29 October 2008.
- System Identification for Multi-Channel Listening-Room Compensation Using an Acoustic Echo Canceller. 2008 Hands-Free Speech Communication and Microphone Arrays, 6 May 2008 - 8 May 2008.
- Objective perceptual quality assessment for self-steering binaural hearing aid microphone arrays. 2008 IEEE International Conference on Acoustics, Speech and Signal Processing, 31 March 2008 - 4 April 2008.
- Optimization of Gabor Features for Text-Independent Speaker Identification. 2007 IEEE International Symposium on Circuits and Systems, 27 May 2007 - 30 May 2007.
- Direction of arrival estimation based on the dual delay line approach for binaural hearing aid microphone arrays. 2007 International Symposium on Intelligent Signal Processing and Communication Systems, 28 November 2007 - 1 December 2007.
- Enhanced Partitioned Stereo Residual Echo Estimation. 2006 Fortieth Asilomar Conference on Signals, Systems and Computers, 29 October 2006 - 1 November 2006.
Reports
Preprints
- Deformable Temporal Convolutional Networks for Monaural Noisy Reverberant Speech Separation, arXiv.
- View this article in WRRO
- Joint Estimation of Reverberation Time and Direct-to-Reverberation Ratio from Speech using Auditory-Inspired Features, arXiv.
- Att-TasNet: attending to encodings in time-domain audio speech separation of noisy, reverberant speech mixtures. Frontiers in Signal Processing, 2. View this article in WRRO
- Grants
-
Research Grants
- Participatory co-design of a platform for collecting atypical speech data, Research England, 03/2022 - 07/2022, £19,692, as PI