Professor Thomas Hain

Professor of Speech and Audio Technology

Telephone: +44 (0) 114 222 1836
Email: t.hain@sheffield.ac.uk

Head of the Speech and Hearing research group
Personal website: staffwww.dcs.shef.ac.uk/people/T.Hain/

ORCID | Google scholar

Selected publications | All publications

Prof. Thomas Hain

Profile

Biography

Thomas Hain obtained the degree 'Dipl.-Ing' in Electrical/Communication Engineering in 1994 from the University of Technology, Vienna. He joined the Speech Technology Group at Philips Speech Processing which he left in a senior position. In 1997 he joined the Speech, Vision and Robotics Group at the Cambridge University Engineering Department as Research Associate and PhD Student. He took up a Lectureship at the SVR group in 2001. In 2004 he joined the Speech and Hearing Group to work as Lecturer in Computer Science. He was promoted to Senior Lecturer in 2008 and Reader in 2011.

Other Professional Activities and Achievements

Research

Research

Thomas' research interests cover many areas in natural language processing, speech, audio and multimedia technology, machine learning, and complex system optimisation and design.

His interests include: large vocabulary continuous speech recognition, non-linear methods in speech processing, low bit-rate speech coding, machine learning, multi-modal systems, image classification, microphone arrays, system and resource optimisation.

Grants

Current grants

  • Acoustic correlates of emotions for automatic recognition, Industrial, 10/2018 to 04/2019, £48,900, as PI
  • Multimedia Analysis for Unsupervised Dubbing In Entertainment (MAUDIE), InnovateUK, 05/2018 to 04/2021, £393,115, as PI
  • VoiceBase Centre, VoiceBase Inc., 04/2018 - 03/2019, £508,000, as PI
  • WFST-based integration of ASR and MT in Spoken Language Translation, Google, 03/2014 to 12/2019, £63,588, as PI

Previous grants

  • Bridge Project, VoiceBase Inc., 09/2017 to 03/2018, £61,200, as PI
  • TUTO II: Reading skills tutoring system, ITSLANGUAGE BV, 08/2017 to 08/2018, £43,478, as PI
  • STATUS IV: Speech Technology and Translation Universal Survey, Defence Science and Technology Laboratory, 01/2017 to 10/2017, £60,000, as PI
  • TUTO: Reading skills tutoring system, ITSLANGUAGE BV, 09/2016 to 08/2017, £61,983, as PI
  • STATUS III: Speech Technology and Translation Universal Survey, Defence Science and Technology Laboratory, 01/2015 to 07/2016, £78,684, as PI
  • STATUS II: Speech Technology and Translation Universal Survey, Defence Science and Technology Laboratory, 11/2013 to 05/2014, £98,982, as PI
  • ItsLanguage, ITSLANGUAGE BV, 11/2012 to 03/2015, £68,333, as PI
  • German System Adaptation, ITSLANGUAGE BV, 11/2012 to 03/2015, £42,373, as PI
  • DocuMeet: Transcription, summarisation and documentation of meetings using advanced speech technologies, indexing and browsing capabilities, European Commission - FP7, 11/2012 to 10/2014, £368,433, as PI
  • STATUS: Speech Technology and Translation Universal Survey, Defence Science and Technology Laboratory, 10/2012 to 08/2013, £73,726, as PI
  • A Joint Model of Spoken Language Translation, Google, 09/2011 to 12/2016, £43,014, as PI
  • Natural Speech Technology, EPSRC, 05/2011 to 07/2016, £1,798,665, as PI
  • Unsupervised Domain Adaptation, CISCO, 11/2010 to 04/2012, £121,745, as PI
  • AMIDA: Augmented Multi-party Interaction with Distance Access, European Commission - FP6, 10/2006 to 12/2009, £467,074, as PI
  • AMIDA: Augmented Multi-party Interaction with Distance Access, European Commission - FP6, 10/2006 to 12/2009, £345,350, as PI