Professor Roger K. Moore
BA (Hons), MSc, PhD
School of Computer Science
Professor of Spoken Language Processing
Deputy Head of School
Head of the Speech and Hearing (SpandH) research group
Full contact details
School of Computer Science
Regent Court (DCS)
211 Portobello
Sheffield
S1 4DP
- Profile
-
Prof. Roger K. Moore has over 40 years’ experience in Speech Technology R&D and, although an engineer by training, much of his research has been based on insights from human speech perception and production.
As Head of the UK Government's Speech Research Unit from 1985 to 1999, he was responsible for the development of the Aurix range of speech technology products and the subsequent formation of 20/20 Speech Ltd.
Since 2004 he has been Professor of Spoken Language Processing at the University of Sheffield, and also holds Visiting Chairs at Bristol Robotics Laboratory and University College London Psychology & Language Sciences. He was President of the European/International Speech Communication Association from 1997 to 2001, General Chair for INTERSPEECH-2009 and ISCA Distinguished Lecturer during 2014-15.
In 2017 he organised the first international workshop on ‘Vocal Interactivity in-and-between Humans, Animals and Robots (VIHAR)’. Prof. Moore is the current Editor-in-Chief of Computer Speech & Language and in 2016 he was awarded the LREC Antonio Zampoli Prize for "Outstanding Contributions to the Advancement of Language Resources & Language Technology Evaluation within Human Language Technologies".
- Research interests
-
Prof. Moore is currently working on a unified theory of spoken language processing in the general area of `Cognitive Informatics` called `PRESENCE` (PREdictive SENsorimotor Control and Emulation). PRESENCE weaves together accounts from a wide variety of different disciplines concerned with the behaviour of living systems - many of them outside the normal realms of spoken language - and compiles them into a new framework that is intended to breathe life into a new generation of research into spoken language processing.
Prof. Moore is involved in collaborations aimed at Clinical Applications of Speech Technology (particularly for individuals with speaking difficulties) and he is becoming increasingly involved in Creative Applications of Speech Technology through interactions with colleagues from the performing arts.
- Publications
-
Books
- Biomedical Engineering Systems and Technologies. Springer International Publishing.
- Spoken language system and corpus design.
- Spoken Language Reference Materials. De Gruyter.
- Spoken Language Characterization. De Gruyter.
Journal articles
- Digital capability, open-source use, and interoperability standards within the NHS in England: a survey of healthcare trusts. JMIR Human Factors, 12. View this article in WRRO
- Freedom comes at a cost?: An exploratory study on affordances’ impact on users’ perception of a social robot. Frontiers in Robotics and AI, 11. View this article in WRRO
- Vocal interactivity in-and-between humans, animals and robots. Interaction Studies. Social Behaviour and Communication in Biological and Artificial Systems, 24(1), 1-4. View this article in WRRO
- Using social robots for language learning: are we there yet?. Journal of China Computer-Assisted Language Learning, 3(1), 208-230. View this article in WRRO
- Is honesty the best policy for mismatched partners? Aligning multi-modal affordances of a social robot: an opinion paper. Frontiers in Virtual Reality.
- Spoken language interaction with robots: Recommendations for future research. Computer Speech & Language, 71.
- Cross-species parallels in babbling : animals and algorithms. Philosophical Transactions of the Royal Society B: Biological Sciences, 376(1836).
- Acceptability and effectiveness of NHS recommended e-therapies for depression, anxiety and stress: A meta-analysis. Journal of Medical Internet Research, 22(10). View this article in WRRO
- Usability, acceptability and effectiveness of web-based conversational agents to facilitate problem solving in older adults : controlled study. Journal of Medical Internet Research, 22(5). View this article in WRRO
- E-therapies in England for stress, anxiety or depression: how are apps developed? A survey of NHS e-therapy developers. BMJ Health & Care Informatics, 26(1). View this article in WRRO
- The effects of robot facial emotional expressions and gender on child-robot interaction in a field study. Connection Science, 30(4), 343-361. View this article in WRRO
- Toward a needs-based architecture for 'intelligent' communicative agents: speaking with intention. Frontiers in Robotics and AI, 4. View this article in WRRO
- Direct Speech Reconstruction From Articulatory Sensor Data by Machine Learning. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 25(12), 2362-2374. View this article in WRRO
- Restoring speech following total removal of the larynx by a learned transformation from sensor data to acoustics. Journal of the Acoustical Society of America, 141(3), EL307-EL307. View this article in WRRO
- E-therapies in England for stress, anxiety or depression: what is being used in the NHS? A survey of mental health services.. BMJ Open, 7(1). View this article in WRRO
- Restoring Speech Following Total Removal of the Larynx. Studies in Health Technology and Informatics, 242, 314-321. View this article in WRRO
- Vocal Interactivity in-and-between Humans, Animals, and Robots. Frontiers in Robotics and AI, 3. View this article in WRRO
- A silent speech system based on permanent magnet articulography and direct synthesis. Computer Speech & Language, 39, 67-87. View this article in WRRO
- Introducing a Pictographic Language for Envisioning a Rich Variety of Enactive Systems with Different Degrees of Complexity. International Journal of Advanced Robotic Systems, 13. View this article in WRRO
- Vocal Interactivity in-and-between Humans, Animals and Robots (VIHAR) (Dagstuhl Seminar 16442).. Dagstuhl Reports, 6, 154-194.
- Spoken language processing: Time to look outside?. Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics, 8791, 21-36.
- Discovering the phoneme inventory of an unwritten language: A machine-assisted approach. SPEECH COMMUNICATION, 56, 152-166.
- Spoken Language Processing: Where Do We Go from Here?, 119-133.
- A Bayesian explanation of the 'Uncanny Valley' effect and related psychological phenomena.. Sci Rep, 2, 864.
- Generating context-sensitive ECA responses to user barge-in interruptions. Journal on Multimodal User Interfaces, 6(1-2), 13-25.
- Small-Vocabulary Speech Recognition Using a Silent Speech Interface Based on Magnetic Sensing. Speech Communication.
- Generating context-sensitive ECA responses to user barge-in interruptions. Journal on Multimodal User Interfaces, 1-13.
- Towards the detection of social dominance in dialogue. Speech Communication, 53(9-10), 1104-1114.
- A prototype for a conversational companion for reminiscing about images. COMPUT SPEECH LANG, 25(2), 140-157.
- Speech as the Perception of Affordances. ECOL PSYCHOL, 22(4), 327-343.
- Computing phonological generalization over real speech exemplars. J PHONETICS, 38(4), 540-547.
- Discovering an optimal set of minimally contrasting acoustic speech units: A point of focus for whole-word pattern matchinga1. Proceedings of the 11th Annual Conference of the International Speech Communication Association Interspeech 2010, 310-313.
- Isolated word recognition of silent speech using magnetic implants and sensors.. Med Eng Phys, 32(10), 1189-1197.
- An attention-gating recurrent working memory architecture for emergent speech representation. CONNECT SCI, 22(2), 157-175.
- Evaluation of a silent speech interface based on magnetic sensing. Proceedings of the 11th Annual Conference of the International Speech Communication Association Interspeech 2010, 246-249.
- Isolated word recognition of silent speech using magnetic implants and sensors. Medical Engineering and Physics.
- Biomimetic vocal tract modeling: Synthesis of speech articulation.. The Journal of the Acoustical Society of America, 125(4_Supplement), 2495-2495.
- Towards an investigation of speech energetics using 'AnTon': an animatronic model of a human tongue and vocal tract. CONNECT SCI, 20(4), 319-336.
- ACORNS - Towards computational modeling of communication and recognition skills. Proceedings of the 6th IEEE International Conference on Cognitive Informatics Icci 2007, 349-356.
- Using linguistic cues for the automatic recognition of personality in conversation and text. J ARTIF INTELL RES, 30, 457-500.
- PRESENCE: A human-inspired architecture for speech-based human-machine interaction. IEEE T COMPUT, 56(9), 1176-1188.
- Spoken language processing: Piecing together the puzzle. SPEECH COMMUN, 49(5), 418-435.
- 2006 Workshop on Spoken Language Technology. IEEE Transactions on Audio, Speech, and Language Processing, 14(3), 1094-1094.
- Results from a survey of attendees at ASRU 1997 and 2003. 9th European Conference on Speech Communication and Technology, 117-120.
- An investigation into a simulation of episodic memory for automatic speech recognition. 9th European Conference on Speech Communication and Technology, 1245-1248.
- Panel on ubiquitous speech processing. 9th European Conference on Speech Communication and Technology.
- Speech communication: Louis pols special issue. Speech Communication, 47(1-2), 3-6.
- Introduction to the special issue on data mining of speech, audio, and dialog. IEEE T SPEECH AUDI P, 13(5), 633-634.
- Dictation and Voice Control: Automatic Speech Recognition in the Marketplace. IEE Colloquium Digest(499), 7/4.
- Critique: The potential role of speech production models in automatic speech recognition. The Journal of the Acoustical Society of America, 99(3), 1710-1713.
- Modelling intonation contours at the phrase level using continuous density hidden Markov models. Computer Speech & Language, 8(3), 247-260.
- Editorial. Speech Communication, 9(1), ix-ix.
- Minimally distinct word-pair discrimination using a back-propagation network. Computer Speech & Language, 3(2), 119-131.
- Isolated digit recognition experiments using the multi-layer perceptron. Speech Communication, 7(4), 403-409.
- Speech Recognition Systems and Theories of Speech Perception, 427-441.
- A multilevel approach to pattern processing. Pattern Recognition, 14(1-6), 261-265.
- A Dynamic Programming Algorithm for the Distance Between Two Finite Areas. IEEE Transactions on Pattern Analysis and Machine Intelligence, PAMI-1(1), 86-88.
- Evaluating speech recognizers. IEEE Transactions on Acoustics, Speech, and Signal Processing, 25(2), 178-183.
- Vocal interactivity in crowds, flocks and swarms : implications for voice user interfaces. Proceedings of the 2nd International Workshop on Vocal Interactivity in-and-between Humans, Animals and Robots (VIHAR 2019), 94-99. View this article in WRRO
- Performance of the MVOCA silent speech interface across multiple speakers. Interspeech 2013, 1140-1143.
- Speech synthesis parameter generation for the assistive silent speech interface MVOCA. Interspeech 2011, 3009-3012.
Book chapters
- PCT and beyond: toward a computational framework for ‘intelligent’ communicative systems In Mansell W (Ed.), The Interdisciplinary Handbook of Perceptual Control Theory (pp. 557-582). Academic Press (Elsevier)
- A Structural Approach to Dealing with High Dimensionality Parameter Search Spaces, Lecture Notes in Computer Science (pp. 159-170). Springer International Publishing
- Evaluating ToRCH Structure for Characterizing Robots, Lecture Notes in Computer Science (pp. 319-330). Springer International Publishing
- Voice restoration after laryngectomy based on magnetic sensing of articulator movement and statistical articulation-to-speech conversion (pp. 295-316). View this article in WRRO
- Towards an intraoral-based silent speech restoration system for post-laryngectomy voice replacement (pp. 22-38). View this article in WRRO
- Part V: Conclusion, Robots that Talk and Listen (pp. 315-336). DE GRUYTER
- From talking and listening robots to intelligent communicative machines, ROBOTS THAT TALK AND LISTEN: TECHNOLOGY AND SOCIAL IMPACT (pp. 317-335).
- Spoken Language Processing: Time to Look Outside?, Lecture Notes in Computer Science (pp. 21-36). Springer International Publishing
- Interacting with Purpose (and Feeling!): What Neuropsychology and the Performing Arts Can Tell Us About ’Real’ Spoken Language Behaviour, Proceedings of the Paralinguistic Information and its Integration in Spoken Dialogue Systems Workshop (pp. 5-5). Springer New York
- Cognitive approaches to spoken language technology In Chen F & Jokinen K (Ed.), Speech Technology: Theory and Applications (pp. 89-103). Springer Verlag
- Speech Recognition In Clark A, Fox C & Lappin S (Ed.), The Handbook of Computational Linguistics and Natural Language Processing (pp. 299-332). Wiley-Blackwell
- Spoken Language Processing by Machine In Gaskell G (Ed.), Oxford Handbook of Psycholinguistics (pp. 723-738). New York: Oxford University Press.
- Affective computing and collaborative networks: Towards emotion-aware interaction (pp. 315-322).
- Isolated Digit Recognition Using the Multi-Layer Perceptron, Recent Advances in Speech Understanding and Dialog Systems (pp. 261-265). Springer Berlin Heidelberg
Conference proceedings
- Adaptive Affordance Design for Social Robots: Tailoring to Role-Specific Preferences. 2025 20th ACM/IEEE International Conference on Human-Robot Interaction (HRI) (pp 580-588), 4 March 2025 - 6 March 2025.
- Refining text input for augmentative and alternative communication (AAC) devices: analysing language model layers for optimisation. ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Proceedings (pp 12016-12020). Seoul, Korea, Republic of, 14 April 2024 - 14 April 2024. View this article in WRRO
- Bridging the communication rate gap: enhancing text input for augmentative and alternative communication (AAC). HCI International 2023 – Late Breaking Papers, Vol. 14055. Copenhagen, Denmark, 23 July 2023 - 23 July 2023. View this article in WRRO
- Progress and prospects for spoken language technology: Results from five sexennial surveys. Proceedings of INTERSPEECH 2023 (pp 401-405). Dublin, Ireland, 20 August 2023 - 20 August 2023. View this article in WRRO
- Local Minima Drive Communications in Cooperative Interaction. Proceedings of the Aisb Convention 2023 (pp 51-56)
- Incremental Disfluency Detection for Spoken Learner English. Proceedings of the 17th Workshop on Innovative Use of NLP for Building Educational Applications (BEA 2022) (pp 272-278), July 2022 - July 2022.
- Investigating deep neural structures and their interpretability in the domain of voice conversion. Interspeech 2021 (pp 806-810). Brno, Czechia, 30 August 2021 - 3 September 2021.
- Using Sampling Techniques and Machine Learning Algorithms to Improve Big Five Personality Traits Recognition from Non-verbal Cues. 2021 National Computing Colleges Conference (NCCC) (pp 1-6), 27 March 2021 - 28 March 2021.
- Removing Bias with Residual Mixture of Multi-View Attention for Speech Emotion Recognition. Interspeech 2020 (pp 4084-4088). Shanghai, China, 25 October 2020 - 25 October 2020. View this article in WRRO
- An end-to-end deep neural network for facial emotion classification. 2019 22th International Conference on Information Fusion (FUSION). Ottawa, Canada, 2 July 2019 - 2 July 2019. View this article in WRRO
- Dual stream spatio-temporal motion fusion with self-attention for action recognition. 2019 22th International Conference on Information Fusion (FUSION). Ottawa, Canada, 2 July 2019 - 2 July 2019. View this article in WRRO
- Spatio-Temporal Context Modelling for Speech Emotion Classification. 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU) (pp 853-859), 14 December 2019 - 18 December 2019.
- Learning temporal clusters using capsule routing for speech emotion recognition. Proceedings of Interspeech 2019 (pp 1701-1705). Graz, Austria, 15 September 2019 - 19 September 2019.
- Using Alexa for flashcard-based learning. Proceedings of Interspeech 2019 (pp 1846-1850). Graz, Austria, 15 September 2019 - 19 September 2019.
- On the use/misuse of the term 'phoneme'. Proceedings Interspeech 2019 (pp 2340-2344). Graz, Austria, 15 September 2019 - 19 September 2019.
- Mapping Theoretical and Methodological Perspectives for Understanding Speech Interface Interactions. Extended Abstracts of the 2019 CHI Conference on Human Factors in Computing Systems (pp 1-8)
- Examining Temporal Variations in Recognizing Unspoken Words using EEG Signals. 2018 IEEE International Conference on Systems, Man, and Cybernetics (SMC) (pp 976-981). Miyazaki, Japan, 7 October 2018 - 7 October 2018. View this article in WRRO
- Discriminating between imagined speech and non-speech tasks using EEG. 2018 40th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC) (pp 1952-1955). Honolulu, Hawaii, 18 July 2018 - 18 July 2018. View this article in WRRO
- American sign language posture understanding with deep neural networks. 2018 21st International Conference on Information Fusion (FUSION) (pp 573-579). UK, 10 July 2018 - 10 July 2018. View this article in WRRO
- Learning capsules for vehicle logo recognition. 2018 21st International Conference on Information Fusion (FUSION) (pp 565-572). UK, 10 July 2018 - 10 July 2018. View this article in WRRO
- Towards a comprehensive taxonomy for characterizing robots. Conference proceedings TAROS 2018, Vol. 10965 (pp 381-392). Bristol, UK, 25 July 2018 - 27 July 2018.
- A Wearable Silent Speech Interface based on Magnetic Sensors with Motion-Artefact Removal. Proceedings of the 11th International Joint Conference on Biomedical Engineering Systems and Technologies, 19 January 2018 - 21 January 2018.
- Creating a voice for MiRo, the world's first commercial biomimetic robot. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH 2017 (pp 3419-3420). Stockholm, Sweden, 20 August 2017 - 20 August 2017. View this article in WRRO
- Evaluation of a Silent Speech Interface based on Magnetic Sensing and Deep Learning for a Phonetically Rich Vocabulary. Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech (pp 3986-3990). Stockholm, 20 August 2017 - 20 August 2017. View this article in WRRO
- Children's age influences their use of biological and mechanical questions towards a humanoid. Proceedings of the 18th Towards Autonomous Robotic Systems (TAROS) Conference, Vol. 10454 (pp 290-299). University of Surrey, Guildford View this article in WRRO
- A biomimetic vocalisation system for MiRo. Biomimetic and Biohybrid Systems. Living Machines 2017, Vol. 10384 (pp 363-374). Stanford, CA, 26 July 2017 - 26 July 2017. View this article in WRRO
- You made him be alive: Children’s perceptions of animacy in a humanoid robot. Lecture Notes in Computer Science, Vol. 10384 (pp 73-85). Stanford University, California View this article in WRRO
- The Sheffield Search and Rescue corpus. 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (pp 5840-5844), 5 March 2017 - 9 March 2017.
- PrimEmo : a neural implementation of survival circuits supporting primitive emotions. Proceedings of AISB Annual Convention 2017 (pp 173-180). Bath, UK, 18 April 2017 - 18 April 2017. View this article in WRRO
- A needs-driven cognitive architecture for future 'intelligent' communicative agents. Proceedings of EUCognition 2016 - "Cognitive Robot Architectures" , Vol. 1855(1855) (pp 50-51). Vienna, Austria View this article in WRRO
- Interspeech 2017. Interspeech 2017
- Is spoken language all-or-nothing? Implications for future speech-based human-machine interaction. Lecture Notes in Electrical Engineering, Vol. 427 (pp 281-291)
- Brain-computer interface technology for speech recognition: A review. Proceedings of 2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA) (pp 1-5). Jeju, South Korea, 13 December 2016 - 16 December 2016.
- The EASEL project: Towards educational human-robot symbiotic interaction. Lecture Notes in Computer Science, Vol. 9793 (pp 297-306). Edinburgh, UK, 19 July 2016 - 19 July 2016. View this article in WRRO
- Towards a synthetic tutor assistant: The EASEL project and its architecture. Lecture Notes in Computer Science, Vol. 9793 (pp 353-364). Edinburgh, UK, 19 July 2016 - 19 July 2016. View this article in WRRO
- Designing robot personalities for human-robot symbiotic interaction in an educational context. Biomimetic and Biohybrid Systems, Vol. 9793 (pp 413-417). Edinburgh, UK View this article in WRRO
- Congratulations, It’s a Boy! Bench-Marking Children’s Perceptions of the Robokind Zeno-R25. Towards Autonomous Robotic Systems, Vol. 9716 (pp 33-39). Sheffield, UK, 28 June 2016 - 28 June 2016. View this article in WRRO
- Preliminary Evaluation of a Silent Speech Interface based on Intra-Oral Magnetic Sensing. Proceedings of the 9th International Joint Conference on Biomedical Engineering Systems and Technologies (pp 108-116), 21 February 2016 - 23 February 2016.
- Direct Speech Generation for a Silent Speech Interface based on Permanent Magnet Articulography. Proceedings of the 9th International Joint Conference on Biomedical Engineering Systems and Technologies (pp 96-105), 21 February 2016 - 23 February 2016.
- Integrating User-Centred Design in the Development of a Silent Speech Interface Based on Permanent Magnetic Articulography (pp 324-337)
- Speech-Based Location Estimation of First Responders in a Simulated Search and Rescue Scenario. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH (pp 2734-2738 (5)). Dresden, Germany, 6 September 2015 - 6 September 2015. View this article in WRRO
- Children's Age Influences Their Perceptions of a Humanoid Robot as Being Like a Person or Machine. Biomimetic and Biohybrid Systems, Vol. 9222 (pp 348-353). Barcelona, Spain View this article in WRRO
- A User-centric Design of Permanent Magnetic Articulography based Assistive Speech Technology. Proceedings of the International Conference on Bio-inspired Systems and Signal Processing (pp 109-116), 12 January 2015 - 15 January 2015.
- Presence of life-like robot expressions influences children's enjoyment of human-robot interactions in the field. Proceedings of the AISB Convention 2015. Canterbury, UK View this article in WRRO
- On the use of the 'pure data' programming language for teaching and public outreach in speech processing. Proceedings of the Annual Conference of the International Speech Communication Association Interspeech (pp 1498-1499)
- The Uncanny Valley: A Focus on Misaligned Cues (pp 256-265)
- Optimising robot personalities for symbiotic interaction. Biomimetic and Biohybrid Systems, Vol. 8608 (pp 392-395). Milan, Italy View this article in WRRO
- A phonetic-contrast motivated adaptation to control the degree-of-articulation on Italian HMM-based synthetic voices. 8th ISCA Workshop on Speech Synthesis Ssw 2013 (pp 107-112)
- Performance of the MVOCA Silent Speech Interface Across Multiple Speakers. 14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5 (pp 1139-1142)
- C2H: A Computational Model of H&H-based Phonetic Contrast in Synthetic Speech. 13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3 (pp 986-989)
- Establishing some principles of human speech production through two-dimensional computational models. Sapa Scale Conference 2012 (pp 5-10)
- Cross-language phone recognition when the target language phoneme inventory is not known. Interspeech’11. Florence
- Speech Synthesis Parameter Generation for the Assistive Silent Speech Interface MVOCA. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5 (pp 3020-+)
- Progress and Prospects for Speech Technology: Results from Three Sexennial Surveys. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5 (pp 1544-1547)
- Reactive speech synthesis: actively managing phonetic contrast along an H&H continuum. 17th International Congress of Phonetics Sciences (ICPhS). Hong Kong
- Evaluation of a Silent Speech Interface Based on Magnetic Sensing. 11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-4 (pp 246-249)
- Discovering an Optimal Set of Minimally Contrasting Acoustic Speech Units: A Point of Focus for Whole-Word Pattern Matching. 11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-4 (pp 310-313)
- Biomimetic vocal tract modeling: preliminary results of vocalization experiments. Proceedings of Meetings on Acoustics (pp 060004-060004)
- Evolving Spiking Neural Parameters for Behavioral Sequences. ARTIFICIAL NEURAL NETWORKS - ICANN 2009, PT II, Vol. 5769 (pp 784-793)
- Finding allophones: An evaluation on consonants in the TIMIT corpus. Proceedings of the Annual Conference of the International Speech Communication Association Interspeech (pp 1651-1654)
- Modelling vocabulary growth from birth to young adulthood. Proceedings of the Annual Conference of the International Speech Communication Association Interspeech (pp 1727-1730)
- Discovering keywords from cross-modal input: Ecological vs. engineering methods for enhancing acoustic repetitions. Proceedings of the Annual Conference of the International Speech Communication Association Interspeech (pp 1171-1174)
- The case for case-based automatic speech recognition. Proceedings of the Annual Conference of the International Speech Communication Association Interspeech (pp 3027-3030)
- Modelling Vocabulary Growth from Birth to Young Adulthood. INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5 (pp 1695-1698)
- The Case for Case-Based Automatic Speech Recognition. INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5 (pp 2999-3002)
- Discovering Keywords from Cross-Modal Input: Ecological vs. Engineering Methods for Enhancing Acoustic Repetitions. INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5 (pp 1151-1154)
- Finding Allophones: an Evaluation on Consonants in the TIMIT Corpus. INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5 (pp 1631-1634)
- A Computational Model of Language Acquisition: the Emergence of Words. FUNDAMENTA INFORMATICAE, Vol. 90(3) (pp 229-249)
- A Computational Model of Preverbal Infant Word Learning. Proceedings of Iccm 2009 9th International Conference on Cognitive Modeling (pp 432-433)
- AnTon: an Animatronic Model of a Human Tongue and Vocal Tract. INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5 (pp 2647-2650)
- Language identification: Insights from the classification of hand annotated phone transcripts. Odyssey 2008 Speaker and Language Recognition Workshop
- Language identification: Insights from the classification of hand annotated phone transcripts. Odyssey 2008: Speaker and Language Recognition Workshop
- AnTon: An animatronic model of a human tongue and vocal tract. Proceedings of the Annual Conference of the International Speech Communication Association Interspeech (pp 2647-2650)
- Animatronic model of a human tongue.. ALIFE (pp 775-775)
- Temporal Episodic Memory Model: An Evolution of MINERVA2. INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4 (pp 2256-2259)
- Towards capturing fine phonetic variation in speech using articulatory features. SPEECH COMMUNICATION, Vol. 49(10-11) (pp 811-826)
- Sound localization through evolutionary learning applied to spiking neural networks. 2007 IEEE Symposium on Foundations of Computational Intelligence, Vols 1 and 2 (pp 350-356)
- Towards a unified theory of Spoken Language Processing. ICCI 2005: FOURTH IEEE INTERNATIONAL CONFERENCE ON COGNITIVE INFORMATICS - PROCEEDINGS (pp 167-172)
- Speech technology for e-inclusion of people with physical disabilities and disordered speech. 9th European Conference on Speech Communication and Technology (pp 445-448)
- Modelling data entry rates for asr and alternative input methods. 8th International Conference on Spoken Language Processing ICSLP 2004 (pp 2285-2288)
- Spoken language output: Realising the vision. Eurospeech 2003 8th European Conference on Speech Communication and Technology (pp 2909-2912)
- A comparison of the data requirements of automatic speech recognition systems and human listeners. Eurospeech 2003 8th European Conference on Speech Communication and Technology (pp 2581-2584)
- Message from the ISCA president. Eurospeech 2001 Scandinavia 7th European Conference on Speech Communication and Technology (pp i)
- Dictation and voice control. IEE Colloquium Digest, Vol. 499 (pp 7/4)
- Modelling asynchrony in speech using elementary single-signal decomposition. ICASSP IEEE International Conference on Acoustics Speech and Signal Processing Proceedings, Vol. 2 (pp 1247-1250)
- Theory of word frequencies and its application to dialogue move recognition. International Conference on Spoken Language Processing ICSLP Proceedings, Vol. 3 (pp 1880-1883)
- THE APPLICATION OF DYNAMIC PROGRAMMING TECHNIQUES TO NON-WORD BASED TOPIC SPOTTING. 4th European Conference on Speech Communication and Technology Eurospeech 1995 (pp 1355-1358)
- EAGLES SPOKEN LANGUAGE WORKING GROUP: OVERVIEW AND RESULTS. 4th European Conference on Speech Communication and Technology Eurospeech 1995 (pp 841-844)
- WHITHER A THEORY OF SPEECH PATTERN PROCESSING?. 3rd European Conference on Speech Communication and Technology Eurospeech 1993 (pp 43-47)
- MODELLING OF INTONATION CONTOURS AT THE SENTENCE LEVEL USING CHMMS AND THE 1961 O'CONNOR AND ARNOLD SCHEME. 3rd European Conference on Speech Communication and Technology Eurospeech 1993 (pp 785-788)
- SIMULTANEOUS RECOGNITION OF CONCURRENT SPEECH SIGNALS USING HIDDEN MARKOV MODEL DECOMPOSITION. 2nd European Conference on Speech Communication and Technology Eurospeech 1991 (pp 1175-1178)
- IMPROVED SPEECH RECOGNITION USING A REDUCED AUDITORY REPRESENTATION.. ICASSP IEEE International Conference on Acoustics Speech and Signal Processing Proceedings (pp 75-78)
- Systems for Isolated and Connected Word Recognition (pp 73-143)
- Explicit modelling of state occupancy in hidden Markov models for automatic speech recognition. ICASSP '85. IEEE International Conference on Acoustics, Speech, and Signal Processing (pp 5-8), 26 April 1985 - 29 April 1985.
- OVERVIEW OF SPEECH INPUT.. undefined (pp 25-38)
- TOWARDS AN INTEGRATED DISCRIMINATIVE NETWORK FOR AUTOMATIC SPEECH RECOGNITION.. undefined
- AUTOMATIC SPEECH RECOGNITION USING LOCAL TIMESCALE VARIABILITY INFORMATION.. undefined
- A Real-Time Parametric General-Purpose Mammalian Vocal Synthesiser. Interspeech 2016 (pp 2636-2640)
- Progress and Prospects for Spoken Language Technology: What Ordinary People Think. Interspeech 2016 (pp 3007-3011)
- Progress and Prospects for Spoken Language Technology: Results from Four Sexennial Surveys. Interspeech 2016 (pp 3012-3016)
- Analysis of phonetic similarity in a silent speech interface based on permanent magnetic articulography. Interspeech 2014 (pp 1018-1022)
- C2h: a computational model of H&h-based phonetic contrast in synthetic speech. Interspeech 2012 (pp 987-990)
- Progress and prospects for speech technology: results from three sexennial surveys. Interspeech 2011 (pp 1533-1536)
- Reactive speech synthesis: actively managing phonetic contrast along an H&H continuum
- Understanding speech understanding. Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181), Vol. 2 (pp 1049-1052)
- A comparison of phoneme decision tree (PDT) and context adaptive phone (CAP) based approaches to vocabulary-independent speech recognition. Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing, Vol. i (pp I/541-I/544)
- The ARM continuous speech recognition system. International Conference on Acoustics, Speech, and Signal Processing (pp 69-72)
- Hidden Markov model decomposition of speech and noise. International Conference on Acoustics, Speech, and Signal Processing (pp 845-848)
- Noise compensation algorithms for use with hidden Markov model based speech recognition. ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing (pp 481-484)
- The discriminative network: A mechanism for focusing recognition in whole-word pattern matching. ICASSP '83. IEEE International Conference on Acoustics, Speech, and Signal Processing, Vol. 8 (pp 1041-1044)
- Some techniques for incorporating local timescale variability information into a dynamic time-warping algorithm for automatic speech recognition. ICASSP '83. IEEE International Conference on Acoustics, Speech, and Signal Processing, Vol. 8 (pp 1037-1040)
- Locally constrained dynamic programming in automatic speech recognition. ICASSP '82. IEEE International Conference on Acoustics, Speech, and Signal Processing, Vol. 7 (pp 1270-1273)
Presentations
- Get off on the right foot with whom?: How users’ profiles affect their perception and experience with a social robot. London, UK. View this article in WRRO
- Better curious than smart?: Enhance inclusiveness between mismatched conversational partners: An opinion paper. Hamburg, Germany. View this article in WRRO
Preprints
- Towards deployment-centric multimodal AI beyond vision and language, arXiv.
- The Influence of Facial Features on the Perceived Trustworthiness of a
Social Robot.
- Local Minima Drive Communications in Cooperative Interaction.
- Whither the Priors for (Vocal) Interactivity?.
- Investigating Deep Neural Structures and their Interpretability in the Domain of Voice Conversion, arXiv.
- Vocal interactivity in crowds, flocks and swarms: implications for voice user interfaces, PeerJ.
- Vocal interactivity in crowds, flocks and swarms: implications for voice user interfaces, PeerJ.
- On the Use/Misuse of the Term 'Phoneme', arXiv.
- A Biomimetic Vocalisation System for MiRo, arXiv.
- Automatic recognition of child speech for robotic applications in noisy environments, arXiv.
- Impact of robot responsiveness and adult involvement on children's social behaviours in human-robot interaction, arXiv.
- Biomedical Engineering Systems and Technologies. Springer International Publishing.
- Grants
-
- EASEL: Expressive Agents for Symbiotic Education and Learning, EU FP7, 11/2023 - 10/2016, £516,297, as PI
- Professional activities and memberships
-
- Chair of Spoken Language Processing in the ‘Speech and Hearing’ research group, Dept. Computer Science, University of Sheffield.
- Editor-in-Chief of ‘Computer Speech & Language’.
- Editorial Board Member for ‘Speech Communication’, ‘Languages’ and the ‘International Journal of Cognitive Informatics and Natural Intelligence’.
- Associate Editor for the ‘Advances in Cognitive Informatics and Natural Intelligence’ (ACINI) Book Series.
- Visiting Professor, Bristol Robotics Laboratory.
- Visiting Professor, Psychology and Language Sciences, University College London.
- 2014-15 Distinguished Lecturer International Speech Communication Association
- Fellow of the International Speech Communication Association since 2008.
- General Chair for INTERSPEECH, Brighton (6th-10th September 2009).
- Chief Scientific Officer of ‘20/20 Speech Ltd.’ (now ‘Aurix Ltd.’) from 1999 to 2004.
- Head of the UK Government’s ‘Speech Research Unit’ (SRU) from 1985 until its privatisation in 1999.
- President of the ‘International Speech Communication Association’ (ISCA) from 1997 to 2001.
- President of the ‘Permanent Council of the International Conferences on Spoken Language Processing’ (PC-ICSLP) from 1996 to 2000.
- Author and co-author of over 150 scientific publications in Speech Technology algorithms, applications and assessment and related areas (h-index = 23).
- Recipient of the 1999 NATO RTO Scientific Achievement Award for “repeated contribution in scientific and technological cooperation”.
- Recipient of the 1994 UK Institute of Acoustics Tyndall Medal for “distinguished work in the field of speech research and technology”.
- Founder Member of the European Speech Communication Association.