Professor Philip Green

PhD

Department of Computer Science

Professor of Computer Science

Member of the Speech and Hearing (SpandH) research group

Phil Green profile photo
p.green@sheffield.ac.uk
+44 114 222 1828

Full contact details

Professor Philip Green
Department of Computer Science
Regent Court (DCS)
211 Portobello
Sheffield
S1 4DP
Profile

Phil Green is a Professor of Computer Science in the Speech and Hearing group which he founded when he came to Sheffield in 1985. His first degree, in Cybernetics was awarded by the University of Reading in 1967 and his PhD, from Keele University, in 1971.

He was head of Computer Science from 2004 to 2008.

Research interests

Professor Green has worked in several areas of speech research, particularly Automatic Speech Recognition, Auditory Scene Analysis and, latterly, Clinical Applications of Speech Technology. He has been involved in research projects worth around £30m and has coordinated 5 International collaborations.

Publications

Journal articles

Chapters

Conference proceedings papers

  • Sehgal S, Cunningham S & Green P (2018) Phase-Based Feature Representations for Improving Recognition of Dysarthric Speech. 2018 IEEE Spoken Language Technology Workshop (SLT), 18 December 2018 - 21 December 2018. RIS download Bibtex download
  • Alharbi S, Hasan M, Simons AJH, Brumfitt S & Green P (2018) A lightly supervised approach to detect stuttering in children's speech. Proceedings of Interspeech 2018 (pp 3433-3437), 2 September 2018 - 6 September 2018. View this article in WRRO RIS download Bibtex download
  • Cheah LA, Gilbert JM, Gonzalez JA, Green PD, R. Ell S, K. Moore R & Holdsworth E (2018) A Wearable Silent Speech Interface based on Magnetic Sensors with Motion-Artefact Removal. Proceedings of the 11th International Joint Conference on Biomedical Engineering Systems and Technologies, 19 January 2018 - 21 January 2018. RIS download Bibtex download
  • Coy A, Green P, Cunningham S, Christensen H, Atria JJ, Rudzicz F, Malavasi M & Desideri L (2018) Embedding speech technology into intelligent tutoring systems using the CloudCAST speech technology platform. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), Vol. 10858 LNCS (pp 421-424) RIS download Bibtex download
  • Cunningham S, Green P, Christensen H, Atria JJ, Coy A, Malavasi M, Desideri L & Rudzicz F (2017) Cloud-Based Speech Technology for Assistive Technology Applications (CloudCAST). Studies in Health Technology and Informatics, Vol. 242 (pp 322-329) View this article in WRRO RIS download Bibtex download
  • Alharbi S, Hasan M, Simons AJH, Brumfitt S & Green P (2017) Detecting Stuttering Events in Transcripts of Children’s Speech (pp 217-228) View this article in WRRO RIS download Bibtex download
  • Malavasi M, Turri E, Motolese MR, Marxer R, Farwer J, Christensen H, Desideri L, Tamburini F & Green P (2017) An Innovative Speech-Based Interface to Control AAL and IoT Solutions to Help People with Speech and Motor Disability (pp 269-278) RIS download Bibtex download
  • Nicolao M, Christensen H, Cunningham S, Green P & Hain T (2016) A framework for collecting realistic recordings of dysarthric speech - The homeService corpus. Proceedings of the 10th International Conference on Language Resources and Evaluation, LREC 2016 (pp 1993-1997) View this article in WRRO RIS download Bibtex download
  • Cheah LA, Bai J, Gonzalez JA, Gilbert JM, Ell SR, Green PD & Moore RK (2016) Preliminary Evaluation of a Silent Speech Interface based on Intra-Oral Magnetic Sensing. Proceedings of the 9th International Joint Conference on Biomedical Engineering Systems and Technologies, 21 February 2016 - 23 February 2016. RIS download Bibtex download
  • Gonzalez JA, Cheah LA, Gilbert JM, Bai J, Ell SR, Green PD & Moore RK (2016) Direct Speech Generation for a Silent Speech Interface based on Permanent Magnet Articulography. Proceedings of the 9th International Joint Conference on Biomedical Engineering Systems and Technologies, 21 February 2016 - 23 February 2016. RIS download Bibtex download
  • Casanueva I, Hain T, Christensen H, Marxer R & Green P (2015) Knowledge transfer between speakers for personalised dialogue management. Proceedings of the 16th Annual Meeting of the Special Interest Group on Discourse and Dialogue, September 2015 - September 2015. RIS download Bibtex download
  • Cheah LA, Gilbert JM, Gonzalez JA, Bai J, Ell SR, Fagan MJ, Moore RK, Green PD & Rychenko SI (2015) Integrating User-Centred Design in the Development of a Silent Speech Interface Based on Permanent Magnetic Articulography (pp 324-337) RIS download Bibtex download
  • A. Cheah L, Bai J, A. Gonzalez J, R. Ell S, M. Gilbert J, K. Moore R & D. Green P (2015) A User-centric Design of Permanent Magnetic Articulography based Assistive Speech Technology. Proceedings of the International Conference on Bio-inspired Systems and Signal Processing, 12 January 2015 - 15 January 2015. RIS download Bibtex download
  • Christensen H, Nicolao M, Cunningham S, Green P, Deena S & Hain T (2015) Speech-enabled environmental control in an AAL setting for people with speech disorders: a case study. IET International Conference on Technologies for Active and Assisted Living (TechAAL) RIS download Bibtex download
  • Gonzalez JA, Cheah LA, Bai J, Ell SR, Gilbert JM, Moore RK & Green PD (2014) Analysis of phonetic similarity in a silent speech interface based on permanent magnetic articulography. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH (pp 1018-1022) RIS download Bibtex download
  • Christensen H, Casanueva I, Cunningham S, Green P & Hain T (2014) Automatic selection of speakers for improved acoustic modelling: recognition of disordered speech with sparse data. 2014 IEEE Spoken Language Technology Workshop (SLT), 7 December 2014 - 10 December 2014. RIS download Bibtex download
  • Casanueva I, Christensen H, Hain T & Green P (2014) Adaptive speech recognition and dialogue management for users with speech disorders. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH (pp 1033-1037) RIS download Bibtex download
  • Christensen H, Aniol MB, Bell P, Green P, Hain T, King S & Swietojanski P (2013) Combining in-domain and out-of-domain speech data for automatic recognition of disordered speech. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH (pp 3642-3645) RIS download Bibtex download
  • Martínez D, Green P & Christensen H (2013) Dysarthria intelligibility assessment in a factor analysis total variability space. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH (pp 2133-2137) RIS download Bibtex download
  • Christensen H, Green P & Hain T (2013) Learning speaker-specific pronunciations of disordered speech. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH (pp 1159-1163) RIS download Bibtex download
  • Hofe R, Bai J, Cheah LA, Ell SR, Gilbert JM, Moore RK & Green PD (2013) Performance of the MVOCA Silent Speech Interface Across Multiple Speakers. 14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5 (pp 1139-1142) RIS download Bibtex download
  • Martinez D, Green P & Christensen H (2013) Dysarthria Intelligibility Assessment in a Factor Analysis Total Variability Space. 14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5 (pp 2132-2136) RIS download Bibtex download
  • Christensen H, Green P & Hain T (2013) Learning speaker-specific pronunciations of disordered speech. 14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5 (pp 1158-1162) RIS download Bibtex download
  • Christensen H, Cunningham S, Fox C, Green P & Hain T (2012) A comparative study of adaptive, automatic recognition of disordered speech. 13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012, Vol. 2 (pp 1774-1777) RIS download Bibtex download
  • Ning Ma , Barker J, Christensen H & Green P (2011) Incorporating localisation cues in a fragment decoding framework for distant binaural speech recognition. 2011 Joint Workshop on Hands-free Speech Communication and Microphone Arrays, 30 May 2011 - 1 June 2011. RIS download Bibtex download
  • Ma N, Barker J, Christensen H & Green P (2011) Binaural cues for fragment-based speech recognition in reverberant multisource environments. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH (pp 1657-1660) RIS download Bibtex download
  • Hofe R, Ell SR, Fagan MJ, Gilbert JM, Green PD, Moore RK, Rybchenko SI & Assoc ISC (2011) Speech Synthesis Parameter Generation for the Assistive Silent Speech Interface MVOCA. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5 (pp 3020-+) RIS download Bibtex download
  • Christensen H, Barker J, Ma N & Green P (2010) The CHiME corpus: A resource and a challenge for computational hearing in multisource environments. Proceedings of the 11th Annual Conference of the International Speech Communication Association, INTERSPEECH 2010 (pp 1918-1921) RIS download Bibtex download
  • Hofe R, Ell SR, Fagan MJ, Gilbert JM, Green PD, Moore RK, Rybchenko SI & ASSOC ISC (2010) Evaluation of a Silent Speech Interface Based on Magnetic Sensing. 11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2 (pp 246-+) RIS download Bibtex download
  • Ma N, Bartels CD, Bilmes JA & Green PD (2009) Modelling the prepausal lengthening effect for speech recognition: a dynamic Bayesian network approach. 2009 IEEE International Conference on Acoustics, Speech and Signal Processing, 19 April 2009 - 24 April 2009. RIS download Bibtex download
  • Creer SM, Cunningham SP, Green PD & Fatema K (2009) Personalizing synthetic voices for people with progressive speech disorders: Judging voice similarity. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH (pp 1427-1430) RIS download Bibtex download
  • Ma N & Green P (2008) A 'speechiness' measure to improve speech decoding in the presence of other sound sources. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH (pp 1285-1288) RIS download Bibtex download
  • Carmichael J, Wan V & Green P (2008) Combining neural network and rule-based systems for dysarthria diagnosis. INTERSPEECH 2008 - 9th Annual Conference of the International Speech Communication Association (pp 2226-2229) RIS download Bibtex download
  • Ma N & Green P (2008) A 'speechiness' measure to improve speech decoding in the presence of other sound sources. INTERSPEECH 2008 - 9th Annual Conference of the International Speech Communication Association (pp 1285-1288) RIS download Bibtex download
  • Ma N, Barker J & Green P (2007) Applying word duration constraints by using unrolled HMMs. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, Vol. 1 (pp 353-356) RIS download Bibtex download
  • Hawley MS, Enderby P, Green P, Cunningham S & Palmer R (2006) Development of a Voice-Input Voice-Output Communication Aid (VIVOCA) for People with Severe Dysarthria (pp 882-885) RIS download Bibtex download
  • Ma N, Green P & Coy A (2006) Exploiting dendritic autocorrelogram structure to identify spectro-temporal regions dominated by a single sound source. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, Vol. 2 (pp 669-672) RIS download Bibtex download
  • Ma N & Green P (2005) Context-dependent word duration modelling for robust speech recognition. 9th European Conference on Speech Communication and Technology (pp 2609-2612) RIS download Bibtex download
  • Morris AC, Maier V & Green P (2004) From WER and RIL to MER and WIL: Improved evaluation measures for connected speech recognition. 8th International Conference on Spoken Language Processing, ICSLP 2004 (pp 2765-2768) RIS download Bibtex download
  • Carmichael J & Green P (2004) Revisiting dysarthria assessment intelligibility metrics. 8th International Conference on Spoken Language Processing, ICSLP 2004 (pp 485-488) RIS download Bibtex download
  • Green P, Carmichael J, Hatzis A, Enderby P, Hawley M & Parker M (2003) Automatic speech recognition with sparse training data for dysarthric speakers. EUROSPEECH 2003 - 8th European Conference on Speech Communication and Technology (pp 1189-1192) RIS download Bibtex download
  • Parveen S & Green P (2003) Multitask learning in connectionist robust ASR using recurrent neural networks. EUROSPEECH 2003 - 8th European Conference on Speech Communication and Technology (pp 1813-1816) RIS download Bibtex download
  • Hatzis A, Green P, Carmichael J, Cunningham S, Palmer R, Parker M & O'Neill P (2003) An integrated toolkit deploying speech technology for computer based speech training with application to dysarthric speakers. EUROSPEECH 2003 - 8th European Conference on Speech Communication and Technology (pp 2213-2216) RIS download Bibtex download
  • Green PD & Parveen S (2002) Speech recognition with missing data using recurrent neural nets. Advances in Neural Information Processing Systems RIS download Bibtex download
  • Barker J, Cooke M & Green P (2001) Robust ASR based on clean speech models: An evaluation of missing data techniques for connected digit recognition in noise. EUROSPEECH 2001 - SCANDINAVIA - 7th European Conference on Speech Communication and Technology (pp 213-216) RIS download Bibtex download
  • Barker J, Josifovski L, Cooke M & Green P (2000) Soft decisions in missing data techniques for robust automatic speech recognition. 6th International Conference on Spoken Language Processing, ICSLP 2000 RIS download Bibtex download
  • Parveen S, Qadeer A & Green P (2000) Speaker recognition with recurrent neural networks. 6th International Conference on Spoken Language Processing, ICSLP 2000 RIS download Bibtex download
  • Morris AC, Josifovski L, Bourlard H, Cooke M & Green P (2000) A neural network for classification with incomplete data: Application to robust ASR. 6th International Conference on Spoken Language Processing, ICSLP 2000 RIS download Bibtex download
  • Alharbi S, Simons AJH, Brumfitt S & Green P () Automatic recognition of children's read speech for stuttering application. WOCCI 2017: 6th International Workshop on Child Computer Interaction View this article in WRRO RIS download Bibtex download
  • Gonzalez JA, Cheah LA, Green PD, Gilbert JM, Ell SR, Moore RK & Holdsworth E () Evaluation of a Silent Speech Interface Based on Magnetic Sensing and Deep Learning for a Phonetically Rich Vocabulary. Interspeech 2017 View this article in WRRO RIS download Bibtex download
  • Casanueva I, Hain T & Green P () Improving Generalisation to New Speakers in Spoken Dialogue State Tracking. Interspeech 2016 View this article in WRRO RIS download Bibtex download
  • Green P, Marxer R, Cunningham S, Christensen H, Rudzicz F, Yancheva M, Coy A, Malavasi M, Desideri L & Tamburini F () CloudCAST — Remote Speech Technology for Speech Professionals. Interspeech 2016 RIS download Bibtex download
  • Morris AC, Cooke MP & Green PD () Some solution to the missing feature problem in data classification, with application to noise robust ASR. Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181) RIS download Bibtex download
  • Cooke M, Morris A & Green P () Missing data techniques for robust speech recognition. 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing RIS download Bibtex download
  • Green PD, Cooke MP & Crawford MD () Auditory scene analysis and hidden Markov model recognition of speech in noise. 1995 International Conference on Acoustics, Speech, and Signal Processing RIS download Bibtex download
Grants

Current grants

  • CLOUDCAST: A cloud-based computational resource for clinical and educational applications of speech technology, LEVERHULME TRUST (THE), 01/2015 to 12/2018, £124,994, as PI
  • Speech Technology for Articulation Rehabilitation, NATIONAL INSTITUTE FOR HEALTH RESEARCH, 02/2015 to 07/2017, £271,158, as Co-PI
  • SRAM: Speech Rehabilitation from Articulator Movement, NATIONAL INSTITUTE FOR HEALTH RESEARCH, 01/2016 to 12/2018, £196,123, as PI
  • CLOUDCAST: A cloud-based computational resource for clinical and educational applications of speech technology, TORONTO REHABILITATION INSTITUTE - UNIVERSITY HEALTH NETWORK, 01/2015 to 12/2018, £27,000, as PI

Previous grants

  • Studentship, NOKIA RESEARCH CENTRE, 01/2000 to 12/2002, £31,300, as PI
  • Speech recognition for people with severe dysarthria, BARNSLEY HOSPITAL NHS FOUNDATION TRUST, 08/2000 to 08/2003, £103,146, as PI
  • Inter-formant intelligibility dips for narrowband speech, EPSRC, 12/2000 to 11/2001, £33,536, as PI
  • HOARSE: Hearing Organisation And Recognition of Speech in Europe, EUROPEAN COMMISSION - FP6/FP7, 09/2002 to 08/2006, £131,793, as PI
  • Multisource decoding for speech in the presence of other sound sources, EPSRC, 07/2002 to 12/2005, £271,630, as PI
  • Studentship, MOTOROLA INCORPORATED, 11/2001 to 10/2004, £42,000, as PI
  • Studentship, MOTOROLA INCORPORATED, 11/2001 to 10/2004, £42,000, as PI
  • AMI: Augmented Multi-party Interaction, EUROPEAN COMMISSION - FP6/FP7, 01/2004 to 12/2006, £413,049, as PI
  • VIVOCA: Voice Input Voice Output Communication Aid, BARNSLEY HOSPITAL NHS FOUNDATION TRUST, 01/2005 to 10/2008, £178,529, as Co-PI
  • SPECS, BARNSLEY HOSPITAL NHS FOUNDATION TRUST, 02/2006 to 11/2010, £193,163, as Co-PI
  • Studentship, EPSRC, 01/2005 to 03/2011, £125,000, as PI
  • REdRESS: Recognition and Reconstruction of Speech following Laryngectomy, UNIVERSITY OF HULL, 03/2009 to 02/2012, £59,275, as PI
  • VIVOCA II: Voice Input Voice Output Communication Aid, BARNSLEY HOSPITAL NHS FOUNDATION TRUST, 06/2010 to 05/2013, £213,792, as Co-PI
  • CHIME: Computational Hearing in Multisource Environments, EPSRC, 06/2009 to 05/2012, £326,245, as Co-PI
  • SCALE: Speech Communication with Adaptive LEarning, EUROPEAN COMMISSION - FP6/FP7, 01/2009 to 12/2012, £284,423, as PI
  • Natural Speech Technology, EPSRC, 05/2011 to 07/2016, £1,798,665, as Co-PI
  • DiSARM: Digital Speech Recovery from Articulator Movement, NATIONAL INSTITUTE FOR HEALTH RESEARCH, 10/2011 to 07/2015, £282,898, as PI