Dr Stefan Goetze

School of Computer Science

Visiting Professor

Member of the Speech and Hearing (SpandH) research group

s.goetze@sheffield.ac.uk

Regent Court (DCS)

Full contact details

Dr Stefan Goetze
School of Computer Science
Regent Court (DCS)
211 Portobello
Sheffield
S1 4DP

Profile

Stefan Goetze was a Senior Lecturer in the School of Computer Science at Sheffield from 2020 - 2025. He obtained the degree 'Dipl.-Ing' in 2004 and 'Dr.-Ing.' in 2013 in Electrical/Communication Engineering from the University of Bremen, Germany.

From 2008 to 2020 he was with the Fraunhofer-Institute for Digital Media Technology IDMT in Oldenburg, Germany where he was first Head of "Audio System Technology for Audiology and Assistive Systems" (2010-2017) and later Head of "Automatic Speech Recognition" as well as Dept. Head of the Department "Hearing, Speech and Audio Technology" (2017-2020).

Research interests: His research interests include machine learning, signal analysis, enhancement and classification as well for large scale applications as for resource-limited IoT (Internet of Things) and assistive devices.

Publications

Journal articles

Hao L, Goetze S, Alessa T & Hawley MS (2023) Effectiveness of computer-tailored health communication in increasing physical activity in people with or at risk of long-term conditions: systematic review and meta-analysis. Journal of Medical Internet Research, 25(1). View this article in WRRO
Ravenscroft W, Goetze S & Hain T (2022) Att-TasNet: attending to encodings in time-domain audio speech separation of noisy, reverberant speech mixtures. Frontiers in Signal Processing, 2. View this article in WRRO
Cauchi B, Siedenburg K, Santos JF, Falk TH, Doclo S & Goetze S (2019) Non-intrusive speech quality prediction using modulation energies and LSTM-network. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 27(7), 1151-1163. View this article in WRRO
Cauchi B, Siedenburg K, Santos JF, Falk TH, Doclo S & Goetze S (2019) Non-Intrusive Speech Quality Prediction Using Modulation Energies and LSTM-Network. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 27, 1151-1163.
Xiong F, Goetze S, Kollmeier B & Meyer BT (2019) Joint estimation of reverberation time and early-to-late reverberation ratio from single-channel speech signals. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 27(2), 255-267. View this article in WRRO
Goetze S (2019) Intelligente Erkennersysteme für die Pflege. Pflegezeitschrift, 72(1-2), 17-19. View this article in WRRO
Xiong F, Goetze S, Kollmeier B & Meyer BT (2018) Exploring auditory-inspired acoustic features for room acoustic parameter estimation from monaural speech. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 26(10), 1809-1820. View this article in WRRO
Moritz N, Adiloğlu K, Anemüller J, Goetze S & Kollmeier B (2017) Multi-Channel Speech Enhancement and Amplitude Modulation Analysis for Noise Robust Automatic Speech Recognition. Computer Speech & Language, 46, 558-573.
Schröder J, Moritz N, Anemüller J, Goetze S & Kollmeier B (2017) Classifier architectures for acoustic scenes and events : implications for DNNs, TDNNs, and perceptual features from DCASE 2016. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 25(6), 1304-1314. View this article in WRRO
Kodrasi I, Cauchi B, Goetze S & Doclo S (2017) Instrumental and perceptual evaluation of dereverberation techniques based on robust acoustic multichannel equalization. Journal of the Audio Engineering Society, 65(1/2), 117-129. View this article in WRRO
Spriet A, Goetze S & van Waterschoot T (2017) Special Issue on Dereverberation and Reverberation of Audio, Music, and Speech. JOURNAL OF THE AUDIO ENGINEERING SOCIETY, 65(1-2), 6-7.
Asmare FM, Xiong F, Bode M, Mayer B & Goetze S (2016) Joint beamforming and spectral enhancement for robust automatic speech recognition in reverberant environments. Journal of the Acoustical Society of America, 139(4_Supplement), 2224-2225.
Schroder J, Goetze S & Anemuller J (2015) Spectro-Temporal Gabor Filterbank Features for Acoustic Event Detection. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 23(12), 2198-2208.
Xiong F, Meyer BT, Moritz N, Rehr R, Anemüller J, Gerkmann T, Doclo S & Goetze S (2015) Front-end technologies for robust ASR in reverberant environments—spectral enhancement-based dereverberation and auditory modulation filterbank features. EURASIP Journal on Advances in Signal Processing, 2015(1).
Cauchi B, Kodrasi I, Rehr R, Gerlach S, Jukić A, Gerkmann T, Doclo S & Goetze S (2015) Combination of MVDR beamforming and single-channel spectral processing for enhancing noisy and reverberant speech. EURASIP Journal on Advances in Signal Processing, 2015(1).
Ruhland M, Bitzer J, Brandt M & Goetze S (2015) Reduction of Gaussian, Supergaussian, and Impulsive Noise by Interpolation of the Binary Mask Residual. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 23(10), 1680-1691.
Haux R, Hein A, Kolb G, Künemund H, Eichelberg M, Appell J-E, Appelrath H-J, Bartsch C, Bauer JM, Becker M , Bente P et al (2014) Information and communication technologies for promoting and sustaining quality of life, health and self-sufficiency in ageing societies – outcomes of the Lower Saxony Research NetworkDesign of Environments for Ageing(GAL). Informatics for Health and Social Care, 39(3-4), 166-187.
Gerlach S, Bitzer J, Goetze S & Doclo S (2014) Joint estimation of pitch and direction of arrival: improving robustness and accuracy for multi-speaker scenarios. EURASIP Journal on Audio, Speech, and Music Processing, 2014(1).
Kodrasi I, Goetze S & Doclo S (2013) Regularization for Partial Multichannel Equalization for Speech Dereverberation. IEEE Transactions on Audio, Speech, and Language Processing, 21(9), 1879-1890.
Schröder J, Hollosi D, Goetze S & Bitzer J (2013) Notrufsysteme mit automatischer akustischer Gefahrendetektion. Science^2 - Safety and Security, 1, 12-18.
Goetze S, Schroder J, Gerlach S, Hollosi D, Appell J-E & Wallhoff F (2012) Acoustic Monitoring and Localization for Social Care. Journal of Computing Science and Engineering, 6(1), 40-50.
Goetze S, Moritz N, Appell J-E, Meis M, Bartsch C & Bitzer J (2010) Acoustic user interfaces for ambient-assisted living technologies. Informatics for Health and Social Care, 35(3-4), 125-143.
Haux R, Hein A, Eichelberg M, Appell J-E, Appelrath H-J, Bartsch C, Bisitz T, Bitzer J, Blau M, Boll S , Buschermöhle M et al (2010) The Lower Saxony research networkdesign of environments for ageing: towards interdisciplinary research on information and communication technologies in ageing societies. Informatics for Health and Social Care, 35(3-4), 92-103.
Goetze S, Moritz N, Appell J-E, Meis M, Bartsch C & Bitzer J (2010) Acoustic User Interfaces for Ambient Assisted Living Technologies. Informatics for Health and Social Care, SI Ageing & Technology, 35, 161-179.
Haux R, Hein A, Eichelberg M, Appell J-E, Appelrath H-J, Bartsch C, Bisitz T, Bitzer J, Blau M, Boll S , Buschermöhle M et al (2010) The Lower Saxony Research Network Design of Environments for Ageing (GAL) - Towards Interdisciplinary Research on ICT in Ageing Societies. Informatics for Health and Social Care, SI Ageing & Technology, 35, 92-103.
Goetze S, Kammeyer K-D, Kallinger M & Mertins A (2006) A study on combining acoustic echo cancelers with impulse response shortening. The Journal of the Acoustical Society of America, 120(5_Supplement), 3258-3258.
Goetze S & et al () Speech Quality Assessment for Listening-Room Compensation. Journal of the Audio Engineering Society, 62(6), 386-399.

Book chapters

Wolf KI, Goetze S & Wallhoff F (2017) Computer-Based Adaption of Cooking Recipes Integrated in a Speech Dialogue Assistance System, Advanced Technologies and Societal Change (pp. 163-172). Springer International Publishing
Rennies J, Goetze T & Appell J-E (2012) Innovative Hörunterstützung in Kommunikationssystemen In Schick A, Meis M & Nocke C (Ed.), Beiträge zur psychologischen Akustik, Akustik in Büro und Objekt (pp. in press-in press). Oldenburg: Isensee Verlag.
Hollosi D, Goetze S, Appell JE & Wallhoff F (2011) Acoustic Applications and Technologies for Ambient Assisted Living Scenarios, Ambient Assisted Living (AAL) Forum (pp. 337-342). Lecce, Italy.
Moritz N, Goetze S & Appell J-E (2011) Ambient Voice Control for a Personal Activity and Household Assistant, Ambient Assisted Living (pp. 63-74). Springer Berlin Heidelberg
Rennies J, Goetze S & Appell J-E (2011) Considering Hearing Deficiencies in Human-Computer Interaction In Ziefle M & Röcker C (Ed.), Human-Centered Design of E-Health Technologies: Concepts, Methods and Applications (pp. 180-207). IGI Global
Schröder J, Wabnik S, Hengel PWJV & Goetze S (2011) Detection and Classification of Acoustic Events for In-Home Care (Best-Paper Award) In Wichert R & Eberhardt B (Ed.), Ambient Assisted Living - Advanced Technologies and Societal Change, Springer Lecture Notes in Computer Science (LNCS) (pp. 181-196). Springer Science
Schroeder J, Wabnik S, van Hengel PWJ & Goetze S (2011) Detection and Classification of Acoustic Events for In-Home Care, Ambient Assisted Living (pp. 181-195). Springer Berlin Heidelberg
Rennies J, Goetze S & Appell J-E (2011) Personalized Acoustic Interfaces for Human-Computer Interaction, Advances in Healthcare Information Systems and Administration (pp. 180-207). IGI Global
Rennies J, Albertin E, Goetze S & Appell J-E (2010) Automatic Live Monitoring of Communication Quality for Normal-Hearing and Hearing-Impaired Listeners, Lecture Notes in Computer Science (pp. 568-575). Springer Berlin Heidelberg
Goetze S, Rennies J & Appell J-E (2010) Intelligente Konferenzsysteme für natürliche Freisprechkommunikation In Schick A, Meis M & Nocke C (Ed.), Beiträge zur psychologischen Akustik, Akustik in Büro und Objekt (pp. 249-266). Oldenburg: Isensee Verlag.

Conference proceedings

Leung W-Z, Christensen H & Goetze S (2025) Text-to-dysarthric-speech generation for dysarthric automatic speech recognition: is purely synthetic data enough?. Speech and Computer: 27th International Conference, SPECOM 2025, Szeged, Hungary, October 13–15, 2025, Proceedings, Part I(LNAI 16187) (pp 203-216). Szeged, Hungary, 13 October 2025 - 13 October 2025. View this article in WRRO
Close G, Hong K, Hain T & Goetze S (2025) WhiSQA: Non-intrusive speech quality prediction using whisper encoder features. Speech and Computer, Vol. 16187(Part 1) (pp 39-51). Szeged, Hungary, 13 October 2025 - 13 October 2025. View this article in WRRO
Clarke J, Gotoh Y & Goetze S (2025) Ensembling synchronisation-based and face–voice association paradigms for robust active speaker detection in egocentric recordings. Speech and Computer: 27th International Conference, SPECOM 2025, Szeged, Hungary, October 13-15, 2025, Proceedings, Part II, Vol. LNAI 16188 (pp 289-301). Szeged, Hungary, 13 October 2025 - 13 October 2025. View this article in WRRO
Mai Y & Goetze S (2025) MetricGAN+KAN: Kolmogorov-Arnold networks in metric-driven speech enhancement systems. ICASSP 2025 - 2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Proceedings. Hyderabad, India, 6 April 2025 - 6 April 2025. View this article in WRRO
Clarke J, Gotoh Y & Goetze S (2025) Speaker embedding informed audiovisual active speaker detection for egocentric recordings. ICASSP 2025 - 2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Proceedings. Hyperabad, India, 6 April 2025 - 6 April 2025. View this article in WRRO
Close G, Hain T & Goetze S (2024) Hallucination in perceptual metric-driven speech enhancement networks. Proceedings of 2024 32nd European Signal Processing Conference (EUSIPCO) (pp 21-25). Lyon, France, 26 August 2024 - 26 August 2024. View this article in WRRO
Sutherland R, Close G, Hain T, Goetze S & Barker J (2024) Using speech foundational models in loss functions for hearing aid speech enhancement. Proceedings of 2024 32nd European Signal Processing Conference (EUSIPCO) (pp 421-425). Lyon, France, 26 August 2024 - 26 August 2024. View this article in WRRO
Ravenscroft W, Close G, Goetze S, Hain T, Soleymanpour M, Chowdhury A & Fuhs MC (2024) Transcription-free fine-tuning of speech separation models for noisy and reverberant multi-speaker automatic speech recognition. Proceedings of Interspeech 2024 (pp 4998-5002). Kos Island, Greece, 1 September 2024 - 1 September 2024. View this article in WRRO
Leung W-Z, Cross M, Ragni A & Goetze S (2024) Training data augmentation for dysarthric automatic speech recognition by text-to-dysarthric-speech synthesis. Proceedings of Interspeech 2024 (pp 2494-2498). Kos island, Greece, 1 September 2024 - 1 September 2024. View this article in WRRO
Shishkin S, Hollosi D, Goetze S & Doclo S (2024) Active learning for sound event classification using Bayesian neural networks with Gaussian variational posterior. ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Vol. 2024 (pp 896-900). Seoul, South Korea, 14 April 2024 - 14 April 2024. View this article in WRRO
Mogridge R, Close G, Sutherland R, Hain T, Barker J, Goetze S & Ragni A (2024) Non-Intrusive Speech Intelligibility Prediction for Hearing-Impaired Users Using Intermediate ASR Features and Human Memory Models.. ICASSP (pp 306-310)
Close G, Ravenscroft W, Hain T & Goetze S (2024) Multi-CMGAN+/+: leveraging multi-objective speech quality metric prediction for speech enhancement. ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Vol. 2024 (pp 351-355). Seoul, Korea, 14 April 2024 - 14 April 2024. View this article in WRRO
Yusufali H, Moore RK & Goetze S (2024) Refining text input for augmentative and alternative communication (AAC) devices: analysing language model layers for optimisation. ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Proceedings (pp 12016-12020). Seoul, Korea, Republic of, 14 April 2024 - 14 April 2024. View this article in WRRO
Clarke J, Gotoh Y & Goetze S (2024) Improving audiovisual active speaker detection in egocentric recordings with the data-efficient image transformer. 2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU). Taipei, Taiwan, 16 December 2023 - 16 December 2023. View this article in WRRO
Ravenscroft JW, Goetze S & Hain T (2024) On time domain conformer models for monaural speech separation in noisy reverberant acoustic environments. 2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU). Taipei, Taiwan, 16 December 2023 - 16 December 2023. View this article in WRRO
Leung W-Z, Cross M, Ragni A & Goetze S (2024) Training Data Augmentation for Dysarthric Automatic Speech Recognition by Text-to-Dysarthric-Speech Synthesis.. INTERSPEECH
Tang C, Zhang H, Loakman T, Yang B, Goetze S & Lin C (2024) CADGE: Context-Aware Dialogue Generation Enhanced with Graph-Structured Knowledge Aggregation. Inlg 2024 17th International Natural Language Generation Conference Proceedings of the Conference (pp 371-383)
Hao L, Goetze S & Hawley M (2023) Message recommendation strategies for tailoring health information to promote physical activities. HCI International 2023 – Late Breaking Papers, Vol. 14055. Copenhagen, Denmark, 23 July 2023 - 23 July 2023. View this article in WRRO
Yusufali H, Goetze S & Moore R (2023) Bridging the communication rate gap: enhancing text input for augmentative and alternative communication (AAC). HCI International 2023 – Late Breaking Papers, Vol. 14055. Copenhagen, Denmark, 23 July 2023 - 23 July 2023. View this article in WRRO
Ravenscroft J, Goetze S & Hain T (2023) On data sampling strategies for training neural network speech separation models. 2023 31st European Signal Processing Conference (EUSIPCO). Helsinki, Finland, 4 September 2023 - 4 September 2023. View this article in WRRO
Close G, Hain T & Goetze S (2023) The effect of spoken language on speech enhancement using self-supervised speech representation loss functions. Proceedings of 2023 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA). New Paltz, NY, USA, 22 October 2023 - 22 October 2023. View this article in WRRO
Close GL, Ravenscroft W, Hain T & Goetze S (2023) The University of Sheffield CHiME-7 UDASE challenge speech enhancement system. Proc. 7th International Workshop on Speech Processing in Everyday Environments (CHiME 2023) (pp 33-38). Dublin, Ireland, 25 August 2023 - 25 August 2023. View this article in WRRO
Mogridge R, Close G, Sutherland R, Goetze S & Ragni A (2023) Pre-Trained Intermediate ASR Features and Human Memory Simulation for Non-Intrusive Speech Intelligibility Prediction in the Clarity Prediction Challenge 2. he 4th Clarity Workshop on Machine Learning Challenges for Hearing Aids (Clarity-2023). https://claritychallenge.org/clarity2023-workshop/results.html, 19 August 2023 - 19 August 2023.
Close G, Hain T & Goetze S (2023) PAMGAN+/-: Improving phase-aware speech enhancement performance via expanded discriminator training. AES Convention Europe 2023: 154th Audio Engineering Society Conference (pp 10656). Espoo, Helsinki, FInland, 13 May 2023 - 13 May 2023. View this article in WRRO
Ellis S, Goetze S & Christensen H (2023) Moving towards non-binary gender Identification via analysis of system errors in binary gender classification. ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Rhodes Island, Greece, 4 June 2023 - 4 June 2023. View this article in WRRO
Close G, Ravenscroft W, Hain T & Goetze S (2023) Perceive and predict: self-supervised speech representation based loss functions for speech enhancement. ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Proceedings. Rhodes Island, Greece, 4 June 2023 - 4 June 2023. View this article in WRRO
Ravenscroft W, Goetze S & Hain T (2023) Deformable temporal convolutional networks for monaural noisy reverberant speech separation. ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Rhodes Island, Greece, 4 June 2023 - 4 June 2023. View this article in WRRO
Ravenscroft W, Goetze S & Hain T (2022) Receptive field analysis of temporal convolutional networks for monaural speech dereverberation. Proceedings of 30th European Signal Processing Conference (EUSIPCO 2022) (pp 80-84). Belgrade, Serbia, 29 August 2022 - 29 August 2022. View this article in WRRO
Close G, Hain T & Goetze S (2022) MetricGAN+/-: increasing robustness of noise reduction on unseen data. Proceedings of 2022 30th European Signal Processing Conference (EUSIPCO) (pp 165-169). Belgrade, Serbia, 29 August 2022 - 29 August 2022. View this article in WRRO
Ravenscroft W, Goetze S & Hain T (2022) Utterance weighted multi-dilation temporal convolutional networks for monaural speech dereverberation. Proceedings of 2022 International Workshop on Acoustic Signal Enhancement (IWAENC). Bamberg, Germany, 5 September 2022 - 5 September 2022. View this article in WRRO
Close G, Hollands S, Hain T & Goetze S (2022) Non-intrusive speech intelligibility estimated by metric prediction for hearing impaired individuals for the clarity prediction challenge 1. Interspeech 2022 - 23rd Annual Conference of the International Speech Communication Association (pp 3483-3487). Incheon, Korea, 18 September 2022 - 18 September 2022. View this article in WRRO
Huber R, Baumgartner H, Goetze S & Rennies-Hochmuth J (2020) ASR-Based, Single-Ended Modeling of Listening Effort - A Tool for TV Sound Engineers. Proceedings of Forum Acusticum (pp 2441-2445). Lyon, France, 7 December 2020 - 11 December 2020.
Huber R, Baumgartner H, Krishnan VN, Goetze S & Rennies J (2020) Single-ended Prediction of Listening Effort for English Speech. DAGA 2020 - 46. Jahrestagung für Akustik (pp 775-777). Hannover, Germany
Gerlach S, Goetze S & Doclo S (2020) 2D audio-visual localization in home environments using a particle filter. Sprachkommunikation 10 ITG Fachtagung (pp 75-78)
Meis M, Bach J-H, Becker A, Bilda K, Erem A, Feith T, Goetze S, Jürs J, Radeloff A, Tuschen L & Tschuschke B (2019) Context and user requirement analyses of a new digital speech therapy system (THERESIAH). Conf. on Implantable Auditory Prosthesis (CIAP). Lake Tahoe, CA, USA
Winneke A, Meis M, Wolf I, Rennies-Hochmuth J & Goetze S (2019) Hearing support to reduce listening effort at work: an EEG study. DAGA 2019 – Proc. 45th Annual Meeting of the Deutsche Gesellschaft für Akustik e.V.. Rostock, Germany
Huber R, Baumgartner H, Rollwage C, Goetze S & Rennies-Hochmuth J (2019) Erfassung der Höranstrengung fertiger TV-Mischungen. DAGA 2019 – Proc. 45th Annual Meeting of the Deutsche Gesellschaft für Akustik e.V.. Rostock, Germany
Huber R, Baumgartner H, Moritz N & Goetze S (2018) Automatische Überwachung der Sprachverständlichkeit im Rundfunkmaterial. 30th Tonmeistertagung – VDT International Convention. Düsseldorf, Germany
Xiong F, Meyer BT, Cauchi B, Jukic A, Doclo S & Goetze S (2017) Performance comparison of real-time single-channel speech dereverberation algorithms. 2017 Hands-free Speech Communications and Microphone Arrays (HSCMA) (pp 126-130), 1 March 2017 - 3 March 2017.
Javed HA, Cauchi B, Doclo S, Naylor PA & Goetze S (2017) Measuring, modelling and predicting perceived reverberation. Proceedings of 42nd IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2017) (pp 381-385). New Orleans, LA, USA, 5 March 2017 - 5 March 2017. View this article in WRRO
Xiong F, Goetze S & Meyer BT (2017) On DNN posterior probability combination in multi-stream speech recognition for reverberant environments. Proceedings of 42nd IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2017) (pp 5250-5254). New Orleans, LA, USA, 5 March 2017 - 5 March 2017. View this article in WRRO
Xiong F, Goetze S & Meyer BT (2017) Combination strategy based on relative performance monitoring for multi-stream reverberant speech recognition. Proceedings of 42nd IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2017) (pp 4870-4874). New Orleans, LA, USA, 5 March 2017 - 5 March 2017. View this article in WRRO
Avila A, Cauchi B, Goetze S, Doclo S & Falk T (2016) Performance comparison of intrusive and non-intrusive instrumental quality measures for enhanced speech. 2016 IEEE International Workshop on Acoustic Signal Enhancement (IWAENC) (pp 1-5), 13 September 2016 - 16 September 2016.
Moritz N, Schröder J, Goetze S, Anemüller J & Kollmeier B (2016) Acoustic Scene Classification using Time-Delay Neural Networks and Amplitude Modulation Filter Bank Features. Proceedings of the Detection and Classification of Acoustic Scenes and Events 2016 Workshop (DCASE2016) (pp 70-74). Budapest, Hungary
Schröder J, Anemüller J & Goetze S (2016) Performance comparison of GMM, HMM and DNN based approaches for acoustic event detection within Task 3 of the DCASE 2016 challenge. Proceedings of the Detection and Classification of Acoustic Scenes and Events 2016 Workshop (DCASE2016) (pp 80-84). Budapest, Hungary
Cauchi B, Javed H, Gerkmann T, Doclo S, Goetze S & Naylor P (2016) Perceptual and instrumental evaluation of the perceived level of reverberation. 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (pp 629-633), 20 March 2016 - 25 March 2016.
Schroder J, Anemuller J & Goetze S (2016) Classification of human cough signals using spectro-temporal Gabor filterbank features. 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (pp 6455-6459), 20 March 2016 - 25 March 2016.
Winneke A, Meis M, Wellmann J, Bruns T, Rahner S, Rennies J, Wallhoff F & Goetze S (2016) Messung der Höranstrengung älterer Mitarbeiter eines Callcenters mittels neuroergonomischer Messmethoden / Neuroergonomic assessment of listening effort in older call center employees. Proc. Zukunft Lebensräume Kongress 2016 (pp 327-332). Frankfurt, Germany
Wolf KI, Thalappully R, Goetze S & Wallhoff F (2016) Concept for automated usability evaluation of graphical user interfaces. Proc. Kognitive Systeme: Mensch, Teams, Systeme und Automaten. Bochum, Germany
Cauchi B, Gerkmann T, Doclo S, Naylor PA & Goetze S (2016) Spectrally and spatially informed noise suppression using beamforming and convolutive NMF. 60TH AES INTERNATIONAL CONFERENCE ON DREAMS (DEREVERBERATION AND REVERBERATION OF AUDIO, MUSIC, AND SPEECH)
Cauchi B, Santos JF, Siedenburg K, Falk TH, Naylor PA, Doclo S & Goetze S (2016) Predicting the quality of processed speech by combining modulation-based features and model trees. Speech Communication 12 ITG Fachtagung Sprachkommunikation (pp 180-184)
Moritz N, Gerlach S, Adiloglu K, Anemulle J, Kollmeier B & Goetze S (2015) A CHiME-3 challenge system: Long-term acoustic features for noise robust automatic speech recognition. 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU) (pp 468-474), 13 December 2015 - 17 December 2015.
Cauchi B, Naylor PA, Gerkmann T, Doclo S & Goetze S (2015) Late reverberant spectral variance estimation using acoustic channel equalization. 2015 23rd European Signal Processing Conference (EUSIPCO) (pp 2481-2485), 31 August 2015 - 4 September 2015.
Xiong F, Goetze S & Meyer BT (2015) Joint estimation of reverberation time and direct-to-reverberation ratio from speech using auditory inspired features. Proc. ACE Challenge Workshop, a satellite event of WASPAA. New Paltz, NY, USA
Xiong F, Meyer BT & Goetze S (2015) A study on joint beamforming and spectral enhancement for robust speech recognition in reverberant environments. 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (pp 5043-5047), 19 April 2015 - 24 April 2015.
Wolf KI, Goetze S, Wellmann J, Winneke A & Wallhoff F (2015) Concept of a Nutrition Consultant Application with Context Based Speech Recognition. 4. Interdisziplinärer Workshop Kognitive Systeme 2015, Mensch, Teams, Systeme und Automaten. Bielefeld, Germany
Wolf KI, Goetze S & Wallhoff F (2015) CooCo, what can i cook today? Surprise me. Ceur Workshop Proceedings, Vol. 1520 (pp 233-240)
Goetze S, Warzybok A, Kodrasi I, Jungmann JO, Cauchi B, Rennies J, Habets EAP, Mertins A, Gerkmann T, Doclo S & Kollmeier B (2014) A study on speech quality and speech intelligibility measures for quality assessment of single-channel dereverberation algorithms. 2014 14th International Workshop on Acoustic Signal Enhancement (IWAENC) (pp 233-237), 8 September 2014 - 11 September 2014.
Warzybok A, Kodrasi I, Jungmann JO, Habets E, Gerkmann T, Mertins A, Doclo S, Kollmeier B & Goetze S (2014) Subjective speech quality and speech intelligibility evaluation of single-channel dereverberation algorithms. 2014 14th International Workshop on Acoustic Signal Enhancement (IWAENC) (pp 332-336), 8 September 2014 - 11 September 2014.
Cauchi B, Kodrasi I, Rehr R, Gerlach S, Jukić A, Gerkmann T, Doclo S & Goetze S (2014) Joint Dereverberation and Noise Reduction Using Beamforming and a Single-Channel Speech Enhancement Scheme. Proc. REVERB (REverberant Voice Enhancement and Recognition Benchmark) challenge. Florence, Italy
F. Xiong BM (2014) Estimating room acoustic parameters for speech recognizer adaptation and combination in reverberant environments. Proc. 39th International Conference on Acoustics, Speech, and Signal Processing (ICASSP) (pp 5559-5563). Florence, Italy
Xiong F, Moritz N, Rehr R, Anemüller J, Meyer B, Gerkmann T, Doclo S & Goetze S (2014) Robust ASR in reverberant environments using temporal cepstrum smoothing for speech enhancement and an amplitude modulation filterbank for feature extraction. Proc. REVERB (REverberant Voice Enhancement and Recognition Benchmark) challenge. Florence, Italy
Brandes M, Schröder J, Mertins H-C & Goetze S (2014) Improving acoustic event detection by localization algorithms. Proc. 40th German Annual Conference on Acoustics (DAGA 14) (pp 523-524). Oldenburg, Germany
Deppermann M, Wellmann J, Moritz N & Goetze S (2014) Nutzbarkeit von modellierten Phonemfolgen zur Erkennung von unbekannten Wörtern in phonembasierten Spracherkennern. Proc. 40th German Annual Conference on Acoustics (DAGA 14) (pp 538-539). Oldenburg, Germany
Kranzusch P, Gerlach S, Hollosi D & Goetze S (2014) Influence of a spherical microphone array on a sound source number estimator based upon independent component analysis. Proc. 40th German Annual Conference on Acoustics (DAGA 14). Oldenburg, Germany
Sharma A & Goetze S (2014) A 2-Stage Approach for Joint Noise Reduction and Dereverberation by means of Multi-Channel Equalization and a Noise Processor. Proc. 40th German Annual Conference on Acoustics (DAGA 14) (pp 186-187). Oldenburg, Germany
Tomczyszyn T, Cauchi B, Gerlach S & Goetze S (2014) Room Transfer Function Estimation using Cepstral Smoothing. Proc. 40th German Annual Conference on Acoustics (DAGA 14) (pp 493-494). Oldenburg, Germany
Willuweit C, Wellmann J & Goetze S (2014) PTP Synchronized Isosynchronous Multi-Channel Audio-Streaming over Gigabit-Ethernet based on FPGAs. Proc. 40th German Annual Conference on Acoustics (DAGA 14) (pp 182-183). Oldenburg, Germany
Xiong F, Goetze S & Meyer BT (2014) Estimating room acoustic parameters for speech recognizer adaptation and combination in reverberant environments. 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (pp 5522-5526), 4 May 2014 - 9 May 2014.
Hollosi D, Nagy G, Rodigast R, Goetze S & Cousin P (2013) Enhancing Wireless Sensor Networks with Acoustic Sensing Technology: Use Cases, Applications & Experiments. 2013 IEEE International Conference on Green Computing and Communications and IEEE Internet of Things and IEEE Cyber, Physical and Social Computing (pp 335-342), 20 August 2013 - 23 August 2013.
Schroder J, Moritz N, Schadler MR, Cauchi B, Adiloglu K, Anemuller J, Doclo S, Kollmeier B & Goetze S (2013) On the use of spectro-temporal features for the IEEE AASP challenge ‘detection and classification of acoustic scenes and events’. 2013 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (pp 1-4), 20 October 2013 - 23 October 2013.
Uziel S, Elste T, Kattanek W, Hollosi D, Gerlach S & Goetze S (2013) Networked embedded acoustic processing system for smart building applications. Conference on Design and Architectures for Signal and Image Processing Dasip (pp 349-350)
Kodrasi I, Goetze S & Doclo S (2013) A perceptually constrained channel shortening technique for speech dereverberation. 2013 IEEE International Conference on Acoustics, Speech and Signal Processing (pp 151-155), 26 May 2013 - 31 May 2013.
Schroder J, Goetze S, Grutzmacher V & Anemuller J (2013) Automatic acoustic siren detection in traffic noise by part-based models. 2013 IEEE International Conference on Acoustics, Speech and Signal Processing (pp 493-497), 26 May 2013 - 31 May 2013.
Xiong F, Goetze S & Meyer BT (2013) Blind estimation of reverberation time based on spectro-temporal modulation filtering. 2013 IEEE International Conference on Acoustics, Speech and Signal Processing (pp 443-447), 26 May 2013 - 31 May 2013.
Schröder J, Cauchi B, Schädler MR, Moritz N, Adiloglu K, Anemüller J, Doclo S, Kollmeier B & Goetze S (2013) Acoustic Event Detection Using Signal Enhancement and Spectro-temporal Feature Extraction. IEEE AASP Challenge: Detection and Classification of Acoustic Scenes and Events. New Paltz, NY, USA
Wellmann J, Heindorf A & Goetze S (2013) MOBECS - User Requirements for a Mobile Emergency Call System. AAL Forum 2013. Norrköping, Sweden
Moritz N, Schädler MR, Adiloglu K, Meyer BT, Jürgens T, Gerkmann T, Kollmeier B, Doclo S & Goetze S (2013) Noise Robust Distant Automatic Speech Recognition Utilizing NMF based Source Separation and Auditory Feature Extraction. Proc. 2nd International Workshop on Machine Listening in Multisource Environments (CHiME 2013) (pp 1-6). Vancouver, Canada
Rennies J, Schröder J, Hollosi D, Wittorf M, Grützmacher V & Goetze S (2013) Anwendungen akustischer Ereigniserkennung im Automobil. Proc. AmE 2013 - Automotive meets Electronics. Dortmund, Germany
Wellmann J, Heindorf A, Hollosi D, Appell J-E, Wallhoff F & Goetze S (2013) MOBECS - Mobility by Safety: Konzept und Nutzeranforderungen. AAL Kongress 2013 (pp 504-507). Berlin, Germany
Kodrasi I, Doclo S & Goetze S (2012) Non-intrusive regularization for least-squares multichannel equalization for speech dereverberation. 2012 IEEE 27th Convention of Electrical and Electronics Engineers in Israel (pp 1-5), 14 November 2012 - 17 November 2012.
Xiong F, Appell J-E & Goetze S (2012) System identification for listening-room compensation by means of acoustic echo cancellation and acoustic echo suppression filters. 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (pp 525-528), 25 March 2012 - 30 March 2012.
Rehrl T, Troncy R, Bley A, Ihsen S, Scheibl K, Schneider W, Glende S, Goetze S, Kessler J & Wallhoff CHAF (2012) The Ambient Adaptable Living Assistant is Meeting its Users. In Proc. AAL Forum 2012 (pp 629-636). Eindhoven, The Netherlands
Ruhland M & Goetze S (2012) Computational Efficient Noise Reduction for Dialogue Systems in Car Environments based on Binary Time-Frequency Masking and Autoregressive Interpolation. Workshop on Dialog systems that think along - Do they really understand me. Saarbrücken, Germany
Brümmerstedt J, Rennies J, Xiong F, Goetze S & Bitzer J (2012) Objective Methods to Asses Speech Signals Processed by Short-Term Spectral Attenuation. Proc. 38th Annual Convention for Acoustics (DAGA). Darmstadt, Germany
Cauchi B, Goetze S & Doclo S (2012) Reduction of non-stationary noise for a robotic living assistant using sparse non-negative matrix factorization. Proceedings of the Annual Meeting of the Association for Computational Linguistics (pp 28-33)
Gerlach S, Goetze S & Doclo S (2012) 2D audio-visual localization in home environments using a particle filter. Proceedings of 10th ITG Symposium on Speech Communication
Goetze S, Fischer S, Moritz N, Appell JE & Wallhoff F (2012) Multimodal human-machine interaction for service robots in home-care environments. Proceedings of the Annual Meeting of the Association for Computational Linguistics (pp 1-7)
Kodrasi I, Goetze S & Doclo S (2012) Increasing the robustness of acoustic multichannel equalization by means of regularization. International Workshop on Acoustic Signal Enhancement Iwaenc 2012
Ruhland M, Goetze S, Brandt M, Doclo S & Bitzer J (2012) A new approach for reduction of supergaussian noise using autoregressive interpolation and time-frequency masking. International Workshop on Acoustic Signal Enhancement Iwaenc 2012
Goetze S, Xiong F, Jungmann JO, Kallinger M, Kammeyer KD & Mertins A (2011) System identification of equalized room impulse responses by an acoustic echo canceller using proportionate LMS algorithms. 130th Audio Engineering Society Convention 2011, Vol. 2 (pp 1150-1162)
Jungmann JO, Mei T, Goetze S & Mertins A (2011) Room impulse response reshaping by joint optimization of multiple p-norm based criteria. European Signal Processing Conference (pp 1658-1662)
Goetze S, Albertin E, Rennies J, Habets EAP & Kammeyer KD (2011) Speech quality assessment for listening-room compensation. Proceedings of the AES International Conference (pp 11-20)
Gerlach S, Goetze S, Bitzer J & Doclo S (2011) Evaluation of joint position-pitch estimation algorithm for localising multiple speakers in adverse acoustical environments. Proc. 37th Annual Convention for Acoustics (DAGA). Düsseldorf, Germany
Jungmann JO, Goetze S & Mertins A (2011) Room Impulse Response Reshaping by p-Norm Optimization based on Estimates of Room Impulse Responses. Proc. 37th Annual Convention for Acoustics (DAGA). Düsseldorf, Germany
Rehr R, Goetze S, Hollosi D, Appell J-E & Bitzer J (2011) Speech / Non-Speech Discrimination for Acoustic Monitoring Considering Privacy Issues. Proc. 37th Annual Convention for Acoustics (DAGA). Düsseldorf, Germany
Schröder J, Goetze S, Rennies J, Xiong F & Anemüller J (2011) Real-time Room Reverberation Estimation for Online Speech Intelligibility Monitoring. Proc. 37th Annual Convention for Acoustics (DAGA). Düsseldorf, Germany
Wilksen S, Goetze S, Hollosi D, Appell J-E & Bitzer J (2011) Speech Activity Detection for Activity Monitoring using an Embedded Platform. Proc. 37th Annual Convention for Acoustics (DAGA). Düsseldorf, Germany
Xiong F, Schneider D, Goetze S, Rohdenburg T & Appell J-E (2011) Hearing-Loss Compensation in a Telephone System. Proc. 37th Annual Convention for Acoustics (DAGA). Düsseldorf, Germany
Moritz N, Goetze S & Appell J-E (2011) Ambiente Sprachsteuerung für einen Pers"’onlichen Aktivitäts- und Haushaltsassistenten. 4. Deutscher AAL-Kongress. Berlin, Germany
Schröder J, Wabnik S, Hengel PWJV & Goetze S (2011) Erkennung und Klassifikation von akustischen Ereignissen zur häuslichen Pflege. 4. Deutscher AAL-Kongress. Berlin, Germany
Hollosi D, Schroder J, Goetze S & Appell J-E (2010) Voice activity detection driven acoustic event classification for monitoring in smart homes. 2010 3rd International Symposium on Applied Sciences in Biomedical and Communication Technologies (ISABEL 2010) (pp 1-5), 7 November 2010 - 10 November 2010.
Albertin E, Rennies J & Goetze S (2010) Objective Quality Measures for Dereverberation Methods based on Room Impulse Response Equalization. Proc. German Annual Conference on Acoustics (DAGA). Berlin, Germany
Goetze S, Albertin E, Kallinger M, Mertins A & Kammeyer K-D (2010) Quality assessment for listening-room compensation algorithms. 2010 IEEE International Conference on Acoustics, Speech and Signal Processing (pp 2450-2453), 14 March 2010 - 19 March 2010.
Goetze S, Xiong F, Rennies J, Rohdenburg T & Appell J-E (2010) Hands-free telecommunication for elderly persons suffering from hearing deficiencies. The 12th IEEE International Conference on e-Health Networking, Applications and Services (pp 209-214), 1 July 2010 - 3 July 2010.
Haux R, Appell J-E, Appelrath H-J, Christian Bartsch TB, Bitzer J, Blau M, Boll S, Buschermöhle M, Büsching F, Eichelberg M , Erdmann B et al (2010) The Lower Saxony Research Network Design of Environments for Ageing (GAL) - Towards Interdisciplinary Research on ICT in Aging Societies. Medizininformatik-Weltkongress Medinfo 2010
Rennies J, Goetze S & Appell JE (2009) How can audio technology improve working conditions?. Change 2009 –Ambient Assisted Working Accessible and assistive ICT in Enterprise Environments, Emden, Germany
Goetze S, Kallinger M, Mertins A & Kammeyer K-D (2009) Estimation of the Optimum System Delay for Speech Dereverberation by Inverse Filtering. International Conference on Acoustics (NAG/DAGA 2009). Rotterdam, The Netherlands
Goetze S, Kallinger M, Mertins A & Kammeyer K-D (2008) Multi-channel listening-room compensation using a decoupled filtered-X LMS algorithm. 2008 42nd Asilomar Conference on Signals, Systems and Computers (pp 811-815), 26 October 2008 - 29 October 2008.
Goetze S, Rohdenburg T, Hohmann V, Kollmeier B & Kammeyer K-D (2008) Direction of Arrival Estimation based on the Dual Delay Line Approach for Binaural Hearing Aid Microphone Arrays. Int. Symposium on Intelligent Signal Processing and Communication Systems (ISPACS) (pp 185-188). Xiamen, China
Rohdenburg T, Goetze S, Hohmann V, Kammeyer K-D & Kollmeier B (2008) Objective perceptual quality assessment for self-steering binaural hearing aid microphone arrays. 2008 IEEE International Conference on Acoustics, Speech and Signal Processing (pp 2449-2452), 31 March 2008 - 4 April 2008.
Goetze S, Kallinger M, Mertins A & Kammeyer K-D (2008) System Identification for Multi-Channel Listening-Room Compensation Using an Acoustic Echo Canceller. 2008 Hands-Free Speech Communication and Microphone Arrays (pp 224-227), 6 May 2008 - 8 May 2008.
Goetze S, Kallinger M, Mertins A & Kammeyer K-D (2008) A Decoupled Filtered-X LMS Algorithm for Listening-Room Compensation. Proc. Int. Workshop on Acoustic Echo and Noise Control (IWAENC). Seattle, USA
Goetze S, Kallinger M, Mertins A & Kammeyer K-D (2008) System Identification for Multi-Channel Listening-Room Compensation using an Acoustic Echo Canceller. Workshop on Hands-free Speech Communication and Microphone Arrays (HSCMA) (pp 224-227). Trento, Italy
Goetze S, Kallinger M, Mertins A & Kammeyer K-D (2008) Room Impulse Response Shaping based on Estimates of Room Impulse Responses. German Annual Conference on Acoustics (DAGA) (pp 829-830). Dresden, Germany
Rohdenburg T, Goetze S, Hohmann V, Kammeyer KD & Kollmeier B (2008) Combined source tracking and noise reduction for application in hearing AIDS. Sprachkommunikation 2008 8 ITG Fachtagung
Goetze S, Kallinger M, Mertins A & Kammeyer K-D (2008) System identification for multi-channel listening-room compensation using an acoustic echo canceller. 2008 HANDS-FREE SPEECH COMMUNICATION AND MICROPHONE ARRAYS (pp 225-+)
Mildner V, Goetze S, Karl-Dirk-Kammeyer & Mertins A (2007) Optimization of Gabor Features for Text-Independent Speaker Identification. 2007 IEEE International Symposium on Circuits and Systems (ISCAS) (pp 3932-3935), 27 May 2007 - 30 May 2007.
Goetze S, Kallinger M, Mertins A & Kammeyer K-D (2007) Least Squares Equalizer Design under Consideration of Tail Effects. Proc. German Annual Conference on Acoustics (DAGA) (pp 599-600). Stuttgart, Germany
Goetze S, Rohdenburg T, Hohmann V, Kollmeier B & Kammeyer K-D (2007) Direction of arrival estimation based on the dual delay line approach for binaural hearing aid microphone arrays. 2007 International Symposium on Intelligent Signal Processing and Communication Systems (pp 84-87), 28 November 2007 - 1 December 2007.
Goetze S, Rohdenburg T, Hohmann V, Kollmeier B & Kammeyer K-D (2007) Direction of arrival estimation based on the dual delay line approach for binaural hearing aid microphone arrays. 2007 INTERNATIONAL SYMPOSIUM ON INTELLIGENT SIGNAL PROCESSING AND COMMUNICATION SYSTEMS, VOLS 1 AND 2 (pp 112-+)
Goetze S, Kammeyer K-D, Kallinger M & Mertins A (2006) Enhanced Partitioned Stereo Residual Echo Estimation. 2006 Fortieth Asilomar Conference on Signals, Systems and Computers (pp 1326-1330), 29 October 2006 - 1 November 2006.
Goetze S, Mildner V & Kammeyer KD (2006) A psychoacoustic noise reduction approach for stereo hands-free systems. Audio Engineering Society 120th Convention Spring Preprints 2006, Vol. 4 (pp 1980-1989)
Mildner V, Goetze S & Kammeyer KD (2006) Multichannel-noise reduction-systems for speaker identification in an automotive environment. Audio Engineering Society 120th Convention Spring Preprints 2006, Vol. 4 (pp 1941-1952)
Xiao Y, Christensen H & Goetze S () Alzheimer’s Dementia Detection Using Perplexity from Paired Large Language Models. Interspeech 2025 (pp 1423-1427)
Ravenscroft W, Goetze S & Hain T () Combining conformer and dual-path-transformer networks for single channel noisy reverberant speech separation. ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Vol. 2024 (pp 11491-11495). Seoul, Korea, 14 April 2024 - 14 April 2024. View this article in WRRO
Goetze S, Kallinger M & Kammeyer K-D () Residual Echo Power Spectral Density Estimation Based on an Optimal Smoothed Misalignment For Acoustic Echo Cancelation. Proc. Int. Workshop on Acoustic Echo and Noise Control (IWAENC-2005) , Eindhoven, The Netherlands (pp 209-212)
Goetze S, Mildner V & Kammeyer K-D () Comparison of Speech Enhancement Systems for Noise Fields in a Car Environment. German 32. Deutsche Jahrestagung für Akustik (DAGA’06) (pp 45-46). Braunschweig, Germany
Mildner V, Goetze S & Kammeyer K-D () Performance of Text-Independent Speaker Identification considering In-Car Acoustics. German 32. Deutsche Jahrestagung für Akustik (DAGA’06) (pp 223-224). Braunschweig, Germany
Mildner V, Goetze S & Kammeyer K-D () Multi-Channel Speech Enhancement using a Psychoacoustic Approach for a Post-Filter. German ITG-Symposium on Speech Communication. Kiel, Germany
Shishkin S, Hollosi D, Doclo S & Goetze S () Active Learning for Sound Event Classification using Monte-Carlo Dropout and PANN Embeddings. Proc. DCASE Workshop. Online, 15 November 2021 - 19 November 2021.

Reports

Close G, Hollands S, Goetze S & Hain T (2022) Clarity Prediction Challenge 1 Entry: Non-intrusive Speech Intelligibility Metric Prediction - Technical Report

Preprints

Clarke J, Gotoh Y & Goetze S (2025) Ensembling Synchronisation-based and Face-Voice Association Paradigms for Robust Active Speaker Detection in Egocentric Recordings.
Close G, Hong K, Hain T & Goetze S (2025) WhiSQA: Non-Intrusive Speech Quality Prediction Using Whisper Encoder Features.
Clarke J, Gotoh Y & Goetze S (2025) Face-Voice Association for Audiovisual Active Speaker Detection in Egocentric Recordings.
Xiao Y, Christensen H & Goetze S (2025) Alzheimer's Dementia Detection Using Perplexity from Paired Large Language Models.
Clarke J, Gotoh Y & Goetze S (2025) Speaker Embedding Informed Audiovisual Active Speaker Detection for Egocentric Recordings.
Ravenscroft W, Close G, Goetze S, Hain T, Soleymanpour M, Chowdhury A & Fuhs MC (2024) Transcription-Free Fine-Tuning of Speech Separation Models for Noisy and Reverberant Multi-Speaker Automatic Speech Recognition, arXiv.
Leung W-Z, Cross M, Ragni A & Goetze S (2024) Training Data Augmentation for Dysarthric Automatic Speech Recognition by Text-to-Dysarthric-Speech Synthesis, arXiv.
Close G, Hain T & Goetze S (2024) Hallucination in Perceptual Metric-Driven Speech Enhancement Networks, arXiv.
Mogridge R, Close G, Sutherland R, Hain T, Barker J, Goetze S & Ragni A (2024) Non-Intrusive Speech Intelligibility Prediction for Hearing-Impaired Users using Intermediate ASR Features and Human Memory Models, arXiv.
Close G, Ravenscroft W, Hain T & Goetze S (2023) Multi-CMGAN+/+: Leveraging Multi-Objective Speech Quality Metric Prediction for Speech Enhancement, arXiv.
Ravenscroft W, Goetze S & Hain T (2023) On Time Domain Conformer Models for Monaural Speech Separation in Noisy Reverberant Acoustic Environments, arXiv.
Close G, Hain T & Goetze S (2023) The Effect of Spoken Language on Speech Enhancement using Self-Supervised Speech Representation Loss Functions.
Close G, Hain T & Goetze S (2023) The Effect of Spoken Language on Speech Enhancement using Self-Supervised Speech Representation Loss Functions, arXiv.
Zhang H, Tang C, Loakman T, Yang B, Goetze S & Lin C (2023) CADGE: Context-Aware Dialogue Generation Enhanced with Graph-Structured Knowledge Aggregation.
Hao L, Goetze S, Alessa T & Hawley MS (2023) Effectiveness of Computer-Tailored Health Communication in Increasing Physical Activity in People With or at Risk of Long-Term Conditions: Systematic Review and Meta-Analysis (Preprint), JMIR Publications Inc..
Ravenscroft W, Goetze S & Hain T (2022) Deformable Temporal Convolutional Networks for Monaural Noisy Reverberant Speech Separation, arXiv.
Ravenscroft W, Goetze S & Hain T (2022) Receptive Field Analysis of Temporal Convolutional Networks for Monaural Speech Dereverberation, arXiv.
Xiong F, Goetze S & Meyer BT (2015) Joint Estimation of Reverberation Time and Direct-to-Reverberation Ratio from Speech using Auditory-Inspired Features, arXiv.
Ravenscroft W, Goetze S & Hain T () Utterance weighted multi-dilation temporal convolutional networks for monaural speech dereverberation. View this article in WRRO

Grants

Research Grants

Participatory co-design of a platform for collecting atypical speech data, Research England, 03/2022 - 07/2022, £19,692, as PI

School of Computer Science

School of Computer Science

Dr Stefan Goetze

Journal articles

Book chapters

Conference proceedings

Reports

Preprints

Research Grants

Links