Dr Ning Ma

School of Computer Science

Lecturer in Medical Computing

NHS Liaison Link

Member of the Pervasive Computing research group

Member of the Speech and Hearing research group

Ning Ma
Profile picture of Ning Ma
n.ma@sheffield.ac.uk

Full contact details

Dr Ning Ma
School of Computer Science
Regent Court (DCS)
211 Portobello
Sheffield
S1 4DP
Profile

Ning is a Lecturer in Medical Computing at the Department of Computer Science, University of Sheffield, and also an Academic Directorate of Medical Imaging and Medical Physics at the Sheffield Teaching Hospitals NHS Foundation Trust. Before that he was a Research Fellow in Computer Science working on health-related research projects. His first degree was in Computer Science from South China University of Technology and he has a PhD in hearing inspired automatic speech processing from the University of Sheffield.

Ning’s research interests lie in speech and hearing technologies, machine learning and healthcare. In particular, his research interests focus on development of AI systems that can interpret sounds and low-cost sensor data and extract useful information for screening health issues, such as sleep-disordered breathing and respiratory diseases. He has been PI and Co-PI of several UKRI and HEIF grants on acoustic monitoring of sleep-disordered breathing and cough sound analysis for tuberculosis screening. He is also interested in music AI technology and its link with mental health.

Ning has published 60+ refereed journals and conference papers. He is on the Technical Programme Committee for INTERSPEECH 2023 and 2024 as the Lead Area Chair for Speech, voice, and hearing disorders. He regularly reviews manuscripts and grants for a range of journals and funders.

Ning is a Insigneo Institute Research Theme Co-Director for Healthcare data/AI. He is a member of the British Sleep Society, the British Thoracic Society and IEEE.

Research interests
  • Acoustic monitoring for healthcare, including sleep disordered breathing and respiratory conditions
  • Multimodal machine learning for health applications
  • Speech and hearing technology
  • Hearing impairment and cochlear implant processing
Publications

Show: Featured publications All publications

Journal articles

Conference proceedings papers

All publications

Journal articles

Conference proceedings papers

  • Hu Q, Ma N & Brown GJ (2023) Robust binaural sound localisation with temporal attention. ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Proceedings. Rhodes Island, Greece, 4 June 2023 - 4 June 2023. View this article in WRRO RIS download Bibtex download
  • Tu Z, Deadman J, Ma N & Barker J (2022) Auditory-Based Data Augmentation for end-to-end Automatic Speech Recognition. ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 23 May 2022 - 27 May 2022. RIS download Bibtex download
  • Barker E, Barker J, Gaizauskas R, Ma N & Paramita ML (2022) SNuC: The Sheffield Numbers Spoken Language Corpus. 2022 Language Resources and Evaluation Conference, LREC 2022 (pp 1978-1984) RIS download Bibtex download
  • Tu Z, Ma N & Barker J (2021) Optimising hearing aid fittings for speech in noise with a differentiable hearing loss model. Interspeech 2021 (pp 691-695). Brno, Czechia, 30 August 2021 - 30 August 2021. View this article in WRRO RIS download Bibtex download
  • Tu Z, Ma N & Barker J (2021) DHASP: Differentiable Hearing Aid Speech Processing, Vol. 00 (pp 296-300) RIS download Bibtex download
  • Ornolfsson I, Dau T, Ma N & May T (2021) Exploiting Non-Negative Matrix Factorization for Binaural Sound Localization in the Presence of Directional Interference. ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 6 June 2021 - 11 June 2021. RIS download Bibtex download
  • Ma N, Brown GJ & Vecchiotti P (2021) AMI – Creating musical compositions with a coherent long-term structure. AISB Convention 2021: Communication and Conversations RIS download Bibtex download
  • Ma N, Brown GJ & Vecchiotti P (2021) AMI – Creating Coherent Musical Composition with Attention. ICMC 2021 - Proceedings of the International Computer Music Conference 2021 (pp 414-418) RIS download Bibtex download
  • Romero HE, Ma N, Hill EA & Brown GJ (2020) 0573 Screening for obstructive sleep apnea at home based on deep learning features derived from respiration sounds. Sleep, Vol. 43(Supplement_1) (pp a219-a220). Philadelphia, PA, USA (online conference), 27 August 2020 - 27 August 2020. View this article in WRRO RIS download Bibtex download
  • Romero HE, Ma N & Brown GJ (2020) Snorer diarisation based on deep neural network embeddings. ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Barcelona, Spain (virtual conference), 4 May 2020 - 4 May 2020. View this article in WRRO RIS download Bibtex download
  • Romero HE, Ma N, Hill EA & Brown GF (2020) SCREENING FOR OBSTRUCTIVE SLEEP APNEA AT HOME BASED ON DEEP LEARNING FEATURES DERIVED FROM RESPIRATION SOUNDS. SLEEP, Vol. 43 (pp A219-A220) RIS download Bibtex download
  • Romero H, Ma N, Brown G, Beeston A & Hasan M (2019) Deep learning features for robust detection of acoustic events in sleep-disordered breathing. Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP-2019). Brighton, UK, 12 May 2019 - 17 May 2019. View this article in WRRO RIS download Bibtex download
  • Vecchiotti P, Ma N, Squartini S & Brown G (2019) End-to-end binaural sound localisation from the raw waveform. Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP-2019). Brighton, UK, 12 May 2019 - 17 May 2019. View this article in WRRO RIS download Bibtex download
  • Romero HE, Ma N, Brown GJ, Beeston AV & Hasan M (2019) Deep Learning Features for Robust Detection of Acoustic Events in Sleep-disordered Breathing.. ICASSP (pp 810-814) RIS download Bibtex download
  • Vecchiotti P, Ma N, Squartini S & Brown GJ (2019) End-to-end Binaural Sound Localisation from the Raw Waveform.. ICASSP (pp 451-455) RIS download Bibtex download
  • Meutzner H, Ma N, Nickel R, Schymura C & Kolossa D (2017) Improving audio-visual speech recognition using deep neural networks with dynamic stream reliability estimates. 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 5 March 2017 - 9 March 2017. View this article in WRRO RIS download Bibtex download
  • Guo Y, Wang X, Wu C, Fu Q, Ma N & Brown G (2016) A robust dual-microphone speech source localization algorithm for reverberant environments. Proceedings of INTERSPEECH 2016 View this article in WRRO RIS download Bibtex download
  • Zeiler S, Nicheli R, Ma N, Brown GJ & Kolossa D (2016) Robust audiovisual speech recognition using noise-adaptive linear discriminant analysis. ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, Vol. 2016-May (pp 2797-2801) View this article in WRRO RIS download Bibtex download
  • Ma N, Marxer R, Barker J & Brown GJ (2015) Exploiting synchrony spectra and deep neural networks for noise-robust automatic speech recognition. 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU), 13 December 2015 - 17 December 2015. View this article in WRRO RIS download Bibtex download
  • Ma N, Brown GJ & Gonzalez JA (2015) Exploiting top-down source models to improve binaural localisation of multiple sources in reverberant environments. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, Vol. 2015-January (pp 160-164) View this article in WRRO RIS download Bibtex download
  • Ma N, Brown G & May T (2015) Exploiting deep neural networks and head movements for binaural localisation of multiple speakers in reverberant conditions. Proceedings of Interspeech 2015 (pp 160-164). Dresden, Germany, 6 September 2015 - 10 September 2015. View this article in WRRO RIS download Bibtex download
  • May T, Ma N & Brown GJ (2015) Robust localisation of multiple speakers exploiting head movements and multi-conditional training of binaural cues. 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Brisbane, 19 April 2015 - 24 April 2015. View this article in WRRO RIS download Bibtex download
  • Ma N, May T, Wierstorf H & Brown GJ (2015) A machine-hearing system exploiting head movements for binaural sound localisation in reverberant conditions. 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 19 April 2015 - 24 April 2015. View this article in WRRO RIS download Bibtex download
  • Ma N, May T, Wierstorf H, Brown GJ & IEEE (2015) A MACHINE-HEARING SYSTEM EXPLOITING HEAD MOVEMENTS FOR BINAURAL SOUND LOCALISATION IN REVERBERANT CONDITIONS. 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP) (pp 2699-2703) RIS download Bibtex download
  • May T, Ma N, Brown GJ & IEEE (2015) ROBUST LOCALISATION OF MULTIPLE SPEAKERS EXPLOITING HEAD MOVEMENTS AND MULTI-CONDITIONAL TRAINING OF BINAURAL CUES. 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP) (pp 2679-2683) RIS download Bibtex download
  • Schymura C, Walther T, Kolossa D, Ma N & Brown GJ (2014) Binaural sound source localisation using a Bayesian-network-based blackboard system and hypothesis-driven feedback. Proceedings of Forum Acusticum, Vol. 2014-January View this article in WRRO RIS download Bibtex download
  • Ma N & Barker J (2013) A fragment-decoding plus missing-data imputation system evaluated on the 2nd CHiME challenge. Proceedings of the 2nd CHiME Workshop on Machine Listening in Multisource Environments (pp 53-58) RIS download Bibtex download
  • González JA, Peinado AM, Gómez AM & Ma N (2012) Log-spectral feature reconstruction based on an occlusion model for noise robust speech recognition. 13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012, Vol. 3 (pp 2629-2632) RIS download Bibtex download
  • González JA, Peinado AM, Gómez AM, Ma N & Barker J (2012) Combining missing-data reconstruction and uncertainty decoding for robust speech recognition. Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on (pp 4693-4696). IEEE RIS download Bibtex download
  • Ma N & Barker J (2012) Coupling identification and reconstruction of missing features for noise-robust automatic speech recognition. 13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012, Vol. 3 (pp 2637-2640) RIS download Bibtex download
  • Gonzalez JA, Peinado AM, Gomez AM, Ma N & Barker J (2012) Combining missing-data reconstruction and uncertainty decoding for robust speech recognition. 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 25 March 2012 - 30 March 2012. RIS download Bibtex download
  • Ma N, Barker J, Christensen H & Green P (2011) Recent advances in fragment-based speech recognition in reverberant multisource environments.. Proceedings of ISCA Workshop on Machine Listening in Multisource Environments (pp 68-73) RIS download Bibtex download
  • Ma N, Barker J, Christensen H & Green P (2011) Binaural cues for fragment-based speech recognition in reverberant multisource environments. Proceedings of INTERSPEECH 2011 (pp 1657-1660) RIS download Bibtex download
  • Ning Ma , Barker J, Christensen H & Green P (2011) Incorporating localisation cues in a fragment decoding framework for distant binaural speech recognition. 2011 Joint Workshop on Hands-free Speech Communication and Microphone Arrays, 30 May 2011 - 1 June 2011. RIS download Bibtex download
  • Ma N, Barker J, Christensen H, Green P & Assoc ISC (2011) Binaural cues for fragment-based speech recognition in reverberant multisource environments. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5 (pp 1668-1671) RIS download Bibtex download
  • Morales-Cordovilla JA, Ma N, Sanchez V, Carmona JL, Peinado AM & Barker J (2011) A pitch based noise estimation technique for robust speech recognition with Missing Data. 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 22 May 2011 - 27 May 2011. RIS download Bibtex download
  • Ma N, Barker J, Christensen H & Green P (2010) Distant microphone speech recognition in a noisy indoor environment: combining soft missing data and speech fragment decoding.. ISCA Tutorial and Research Workshop on Statistical And Perceptual Audition RIS download Bibtex download
  • Christensen H, Barker J, Ma N & Green P (2010) The CHiME corpus: A resource and a challenge for computational hearing in multisource environments. Proceedings of the 11th Annual Conference of the International Speech Communication Association, INTERSPEECH 2010 (pp 1918-1921) RIS download Bibtex download
  • Ma N, Bartels C, Bilmes J & Green P (2009) Modelling the prepausal lengthening effect for speech recognition: A dynamic Bayesian network approach. Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing. Taipei RIS download Bibtex download
  • Ma N, Bartels CD, Bilmes JA & Green PD (2009) Modelling the prepausal lengthening effect for speech recognition: a dynamic Bayesian network approach. 2009 IEEE International Conference on Acoustics, Speech and Signal Processing, 19 April 2009 - 24 April 2009. RIS download Bibtex download
  • Christensen H, Ma N, Wrigley SN & Barker J (2009) A SPEECH FRAGMENT APPROACH TO LOCALISING MULTIPLE SPEAKERS IN REVERBERANT ENVIRONMENTS. 2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS (pp 4593-4596) RIS download Bibtex download
  • Ma N & Green P (2008) A 'speechiness' measure to improve speech decoding in the presence of other sound sources. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH (pp 1285-1288) RIS download Bibtex download
  • Ma N & Green P (2008) A 'speechiness' measure to improve speech decoding in the presence of other sound sources. INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5 (pp 1285-1288) RIS download Bibtex download
  • Christensen H, Ma N, Wrigley SN & Barker J (2007) Integrating pitch and localisation cues at a speech fragment level. INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4 (pp 2752-2755) RIS download Bibtex download
  • Ma N, Barker J & Green P (2007) Applying word duration constraints by using unrolled HMMs. INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4 (pp 353-356) RIS download Bibtex download
  • Barker J, Coy A, Ma N & Cooke M (2006) Recent advances in speech fragment decoding techniques. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5 (pp 85-88) RIS download Bibtex download
  • Ma N, Green P & Coy A (2006) Exploiting dendritic autocorrelogram structure to identify spectro-temporal regions dominated by a single sound source. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5 (pp 669-672) RIS download Bibtex download
  • Ma N & Green P (2005) Context-dependent word duration modelling for robust speech recognition. 9th European Conference on Speech Communication and Technology (pp 2609-2612) RIS download Bibtex download
  • Xu X, Brown GJ & Ma N () Sound-based sleep staging using pretrained speech foundation models. Proceedings of the 2025 47th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC). Copenhagen, Denmark, 14 July 2025 - 14 July 2025. View this article in WRRO RIS download Bibtex download
  • Hughes C, Brown G, Ma N & Dibben N () Acoustic effects of facial feminisation surgery on speech and singing: A case study. Processings of Interspeech 2024. Kos island, Greece, 1 September 2024 - 1 September 2024. View this article in WRRO RIS download Bibtex download
  • Romero H, Ma N, Brown G & Johnson S () SLUMBR: SLeep statUs estiMation from aBdominal Respiratory effort. Proceedings of the 46th Annual International Conference of the IEEE Engineering in Medicine & Biology Society. Orlando, Florida, 15 July 2024 - 15 July 2024. View this article in WRRO RIS download Bibtex download
  • Romero HE, Ma N, Brown GJ & Johnson S () Obstructive sleep apnea screening with breathing sounds and respiratory effort: a multimodal deep learning approach. INTERSPEECH 2023 RIS download Bibtex download
  • Tu Z, Ma N & Barker J () Unsupervised Uncertainty Measures of Automatic Speech Recognition for Non-intrusive Speech Intelligibility Prediction. Interspeech 2022 RIS download Bibtex download
  • Tu Z, Ma N & Barker J () Exploiting Hidden Representations from a DNN-based Speech Recogniser for Speech Intelligibility Prediction in Hearing-impaired Listeners. Interspeech 2022 RIS download Bibtex download
  • Ma N & Brown GJ () Speech Localisation in a Multitalker Mixture by Humans and Machines. Interspeech 2016 View this article in WRRO RIS download Bibtex download
  • Christensen H, Ma N, Wrigley SN & Barker J () Integrating pitch and localisation cues at a speech fragment level. Interspeech 2007 RIS download Bibtex download
  • Ma N & Green P () Context-dependent word duration modelling for robust speech recognition. INTERSPEECH. Lisbon RIS download Bibtex download

Preprints

Grants
  • Advancing lung health in Zambia through increasing access to integrated and comprehensive screening, diagnosis and management of TB and other chronic respiratory diseases at community and primary care levels, Stop TB Partnership, 10/2024 - 05/2026, £30,364, as PI
  • Home Monitoring of Paediatric Sleep Disordered Breathing with Unobtrusive Sensors, MRC, 05/2024 - 10/2025, £74,822, as PI
  • Advance Acoustic AI Technology for Low-cost Tuberculosis Screening, RCUK, 04/2024 - 09/2025, £113,972, as Co-I
  • Speech and Acoustic Technology for Transgender Voice, Research England, 04/2023 - 06/2023, £5,000, as PI
  • AI-Enabled Cough Sound Analysis for Tuberculosis Screening, EPSRC IAA programme, 03/2023 - 10/2023 £27,434, as PI
  • Monitoring sleep disordered breathing of long-Covid patients at home using acoustic AI Technology, Research England, 01/2022 - 07/2022, £71,222, as PI
  • Artificial Musical Intelligence (AMI): Building Relationships and Identifying Use Cases with Creative Practitioners, Research England, 12/2021 - 06/2023, £19,820, as Co-I
  • SOMNUS: Sleep disOrder MoNitoring by Unobtrusive Sensors, Innovate UK, 07/2021 - 11/2023, £230,649, as Co-I
  • Making Elektra, Research England, 02/2021 - 04/2021, £6,236, as Co-I
  • Brahms: Breathing Resistance Assessment via Home Monitoring of Sleep, Innovate UK, 06/2019 - 02/2021, £109,600, as Co-I
  • MAI: Musical Artificial Intelligence, HEFCE, 02/2019 - 05/2020, £53,408, as Co-I
Professional activities and memberships
  • I am on the Technical Programme Committee for INTERSPEECH 2023 and 2024 as the Lead Area Chair for Speech, voice, and hearing disorders. I regularly review manuscripts and grants for a range of journals and funders.
  • Member of the British Sleep Society
  • Member of the British Thoracic Society
  • Insigneo Institute Research Theme Co-Director for Healthcare data/AI