Books

  • Young SJ, Evermann G, Gales MJF, Hain T, Kershaw D, Moore GL, Odell JJ, Ollason D, Povey D, Valtchev V & Woodland PC (2004) The HTK Book. Cambridge, England: Cambridge University Engineering Department.

Journal articles

Chapters

  • Hain T & Garner PN (2012) Speech Recognition In Carletta J, Renals S & Bourlard H (Ed.), Multimodal Signal Processing: Human Interactions in Meetings (pp. 56-83). Cambridge: Cambridge University Press.
  • Moore D, Dines J, Doss MM, Vepa J, Cheng O & Hain T (2006) Juicer: A weighted finite-state transducer speech decoder (pp. 285-296).
  • Carletta J, Ashby S, Bourban S, Guillemot M, Kronenthal M, Lathoud G, Lincoln M, McCowan I, Hain T, Kraaij W, Post W, Kadlec J, Wellner P, Flynn M & Reidsma D (2005) The AMI Meeting Corpus: A Pre-announcement, Machine Learning for Multimodal Interaction, Lecture Notes in Computer Science (pp. 28-39). Edinburgh: Springer.
  • Moore RK (2003) Speech recognition In Frawley W & Bright W (Ed.), International encyclopedia of linguistics

Conference proceedings papers

  • Milner R & Hain T (2017) DNN approach to speaker diarisation using speaker channels. Proceedings - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing (pp 4925-4929) View this article in WRRO
  • Errattahi R, Hannani AE, Ouahmane H & Hain T (2017) Automatic speech recognition errors detection using supervised learning techniques. Proceedings of IEEE/ACS International Conference on Computer Systems and Applications, AICCSA View this article in WRRO
  • Wu C, Ng RWM, Torralba OS & Hain T (2017) Analysing Acoustic Model Changes for Active Learning in Automatic Speech Recognition. International Conference on Systems, Signals and Image Processing (IWSSIP) View this article in WRRO
  • Loweimi E, Barker J & Hain T (2017) Statistical normalisation of phase-based feature representation for robust speech recognition. ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings (pp 5310-5314) View this article in WRRO
  • Saz O, Doulaty M, Deena S, Milner R, Ng RWM, Hasan M, Liu Y & Hain T (2016) The 2015 Sheffield system for transcription of Multi-Genre Broadcast media. 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2015 - Proceedings (pp 624-631) View this article in WRRO
  • Olcoz J, Saz O & Hain T (2016) Error correction in lightly supervised alignment of broadcast subtitles. Proceedings of the 17th Annual Conference of the International Speech Communication Association (Interspeech) View this article in WRRO
  • Hain T, Christian J, Saz O, Deena S, Hasan M, Ng RWM, Milner R, Doulaty M & Liu Y (2016) webASR 2 - Improved cloud based speech technology. Proceedings of the 17th Annual Conference of the International Speech Communication Association (Interspeech) (pp 1613-1617) View this article in WRRO
  • casanueva I, Hain T, Nicolao M & Green P (2016) Using phone features to improve dialogue state tracking generalisation to unseen states. Proceeding of SIGDIAL 2016 View this article in WRRO
  • Ng R, Hain T & Chettri B (2016) Combining weak tokenisers for phonotactic language recognition in a resource-constrained setting. Combining weak tokenisers for phonotactic language recognition in a resource-constrained setting (pp 2939-2943), 9 September 2016 - 12 September 2016. View this article in WRRO
  • Loweimi E, Barker J & Hain T (2016) Use of generalised nonlinearity in Vector Taylor Series noise compensation for robust speech recognition. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, Vol. 08-12-September-2016 (pp 3798-3802) View this article in WRRO
  • Ng W, Nicolao M, Saz O, Hasan M, Chettri B, Doulaty M, Lee T & Hain T (2016) The Sheffield language recognition system in NIST LRE 2015. Proceedings of The Speaker and Language Recognition Workshop Odyssey 2016 View this article in WRRO
  • Nicolao M, Christensen H, Cunningham S, Green P & Hain T (2016) A Framework for Collecting Realistic Recordings of Dysarthric Speech - the homeService Corpus. Proceedings of LREC 2016, 24 May 2016 - 27 May 2016. View this article in WRRO
  • Ng RWM, Shah K, Specia L & Hain T (2016) Groupwise learning for ASR k-best list reranking in spoken language translation. ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, Vol. 2016-May (pp 6120-6124) View this article in WRRO
  • Milner R & Hain T (2016) Segment-oriented evaluation of speaker diarisation performance. Proceedings of the 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Shanghai, China, 20 March 2016 - 25 March 2016. View this article in WRRO
  • Doulaty M, Saz O, Ng RWM & Hain T (2016) Automatic Genre and Show Identification of Broadcast Media. Proceedings of the 17th Annual Conference of the International Speech Communication Association (Interspeech) View this article in WRRO
  • Al-Shareef S & Hain T (2016) Colloquialising modern standard Arabic text for improved speech recognition. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, Vol. 08-12-September-2016 (pp 1345-1349) View this article in WRRO
  • Deena S, Hasan M, Doulaty M, Saz O & Hain T (2016) Combining feature and model-based adaptation of RNNLMs for multi-genre broadcast speech recognition. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, Vol. 08-12-September-2016 (pp 2343-2347) View this article in WRRO
  • Milner R & Hain T (2016) DNN-based speaker clustering for speaker diarisation. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, Vol. 08-12-September-2016 (pp 2185-2189) View this article in WRRO
  • Casanueva I, Hain T & Green P (2016) Improving generalisation to new speakers in spoken dialogue state tracking. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, Vol. 08-12-September-2016 (pp 2726-2730) View this article in WRRO
  • Liu Y, Fox C, Hasan M & Hain T (2016) The Sheffield Wargame Corpus - Day two and day three. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, Vol. 08-12-September-2016 (pp 3833-3837) View this article in WRRO
  • Milner R, Saz O, Deena S, Doulaty M, Ng R & Hain T (2015) The 2015 Sheffield System for Longitudinal Diarisation of Broadcast Media. Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU) View this article in WRRO
  • Loweimi E, Barker J & Hain T (2015) Source-filter separation of speech signal in the phase domain. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, Vol. 2015-January (pp 598-602) View this article in WRRO
  • Doulaty M, Saz O, Ng RWM & Hain T (2015) Latent Dirichlet Allocation Based Organisation of Broadcast Media Archives for Deep Neural Network Adaptation. Proc. of ASRU View this article in WRRO
  • Bell P, Gales M, Hain T, Kilgour J, Lanchantin P, Liu A, McParland A, Renals S, Saz O, Wester M & Woodland P (2015) The MGB Challenge: Evaluating Multi-genre Broadcast Media Recognition. Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU) View this article in WRRO
  • I Casanueva PG (2015) Knowledge Transfer Between Speakers for Personalised Dialogue Management. Proceedings of 16th Annual SIGdial meeting on discourse and dialogue
  • Liu Y, Karanasou P & Hain T (2015) An Investigation into Speaker Informed DNN Front-end for LVCSR. 40th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2015. Brisbane, Australia, 19 April 2015 - 24 April 2015. View this article in WRRO
  • Nicolao M, Beeston AV & Hain T (2015) Automatic assessment of English learner pronunciation using discriminative classifiers. 40th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (pp 5351-5355). Brisbane, Australia, 19 April 2015 - 24 April 2015. View this article in WRRO
  • AlHarbi G, Ng RWM & Hain T (2015) Annotating meta-discourse in academic lectures from different disciplines.. SLaTE (pp 161-166)
  • Saz O, Doulaty M, Deena S, Milner R, Ng RWM, Hasan M, Liu Y & Hain T (2015) The 2015 sheffield system for transcription of Multi-Genre Broadcast media.. ASRU (pp 624-631)
  • Hasan M, Doddipatla R & Hain T (2015) Noise-matched training of CRF based sentence end detection models. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, Vol. 2015-January (pp 349-353)
  • Ng RWM, Shah K, Specia L & Hain T (2015) A study on the stability and effectiveness of features in quality estimation for spoken language translation. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, Vol. 2015-January (pp 2257-2261)
  • Loweimi E, Doulaty M, Barker J & Hain T (2015) Long-Term statistical feature extraction from speech signal and its application in emotion recognition. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), Vol. 9449 (pp 173-184)
  • Ng RWM, Shah K, Aziz W, Specia L & Hain T (2015) Quality estimation for asr k-best list rescoring in spoken language translation. ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, Vol. 2015-August (pp 5226-5230)
  • Zhang P, Liu Y & Hain T (2014) Semi-Supervised DNN Training in Meeting Recognition. 2014 IEEE Spoken Language Technology Workshop (SLT 2014). South Lake Tahoe, California and Nevada, USA, 7 December 2014 - 10 December 2014. View this article in WRRO
  • Liu Y, Zhang P & Hain T (2014) Using neural network front-ends on far field multiple microphones based speech recognition. 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) View this article in WRRO
  • Zhang P, Liu Y & Hain T (2014) Semi-supervised DNN training in meeting recognition. 2014 IEEE Workshop on Spoken Language Technology, SLT 2014 - Proceedings (pp 141-146)
  • Ng RWM, Doulaty M, Doddipatla R, Aziz W, Shah K, Saz O, Hasan M, AlHarbi G, Specia L & Hain T (2014) The USFD SLT System for IWSLT 2014. IWSLT View this article in WRRO
  • Saz O & Hain T (2014) Using contextual information in Joint Factor Eigenspace MLLR for speech recognition in diverse scenarios. Proceedings of the 2014 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) View this article in WRRO
  • Doddipatla R, Hasan M & Hain T (2014) Speaker Dependent Bottleneck Layer Training forSpeaker Adaptation in Automatic Speech Recognition. Accepted to Interspeech 2014
  • Hasan M, Doddipatla R & Hain T (2014) Multi-pass sentence-end detection of lecture speech. Accepted to Interspeech 2014
  • Fox C & Hain T (2014) Extending Limabeam with discrimination and coarse gradients. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH (pp 2440-2444)
  • Saz O, Doulaty M & Hain T (2014) Background-Tracking Acoustic Features for Genre Identification of Broadcast Shows. Proceedings of the 2014 Spoken Language Technology (SLT) Workshop (pp 118–123-118–123)
  • Christensen H, Casanueva I, Cunningham S, Green P & Hain T (2014) Automatic Selection of Speakers for Improved Acoustic Modelling : Recognition of Disordered Speech with Sparse Data. Spoken Language Technology Workshop, SLT’14
  • Casanueva I, Christensen H, Hain T & Green P (2014) Adaptive speech recognition and dialogue management for users with speech disorders. Proceedings of Interspeech’14
  • Saz O & Hain T (2013) Asynchronous factorisation of speaker and background with feature transforms in speech recognition. Proceedings of the 14th Annual Conference of the International Speech Communication Association (Interspeech) (pp 1238-1242), 25 August 2013 - 29 August 2013. View this article in WRRO
  • Ng R, Cohn T & Hain T (2013) Adaptation of lecture speech recognition system with machine translation output. Proceedings of the 38th IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP). Vancouver, Canada
  • Saz O & Hain T (2013) Asynchronous Factorisation of Speaker and Background with Feature Transforms in Speech Recognition. 14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5 (pp 1237-1241)
  • Christensen H, Aniol MB, Bell P, Green P, Hain T, King S & Swietojanski P (2013) Combining in-domain and out-of-domain speech data for automatic recognition of disordered speech. 14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5 (pp 3609-3612)
  • Christensen H, Green P & Hain T (2013) Learning speaker-specific pronunciations of disordered speech. 14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5 (pp 1158-1162)
  • Fox C, Liu Y, Zwyssig E & Hain T (2013) The Sheffield Wargames Corpus. 14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5 (pp 1115-1119)
  • Christensen H, Green P & Hain T (2013) Learning speaker-specific pronunciations of disordered speech. Interspeech’13
  • Christensen H, Aniol MB, Bell P, Green P, Hain T, King S & Swietojanski P (2013) Combining in-domain and out-of-domain speech data for automatic recognition of disordered speech. Interspeech’13
  • Lanchantin P, Bell PJ, Gales MJF, Hain T, Liu X, Long Y, Quinnell J, Renals S, Saz O, Seigel MS, Swietojanski P & Woodland PC (2013) Automatic Transcription of Multi-Genre Media Archives. Proceedings of the First Workshop on Speech, Language and Audio in Multimedia (pp 26–31-26–31) View this article in WRRO
  • Fox C, Liu Y, Zwyssig E & Hain T (2013) The Sheffield Wargames Corpus.. 14th Annual Conference of the International Speech Communication Association (Interspeech 2013). Lyon, France, 25 August 2013 - 29 August 2013. View this article in WRRO
  • Christensen H, Cunningham S, Fox C, Green P & Hain T (2012) A comparative study of adaptive, automatic recognition of disordered speech. 13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012, Vol. 2 (pp 1774-1777)
  • Tucker R, Fry D, Wan V, Wrigley S & Hain T (2011) Extending Audio Notetaker to Browse WebASR Transcriptions. Interspeech’11
  • Marino D & Hain T (2011) An Analysis of Automatic Speech Recognition with Multiple Microphones. Interspeech’11. Florence
  • Al-Shareef S & Hain T (2011) An Investigation in Speech Recognition for Colloquial Arabic. Interspeech’11
  • Wrigley SN & Hain T (2011) Web-based automatic speech recognition service - webASR. Interspeech’11
  • Wrigley SN & Hain T (2011) Making an automatic speech recognition service freely available on the web. Interspeech’11
  • Kempton T, Moore RK & Hain T (2011) Cross-language phone recognition when the target language phoneme inventory is not known. Interspeech’11. Florence
  • Hain T & Renals S (2010) Meeting Recognition. Tutorial interspeech 2010
  • Hain T, Burget L, Dines J, Garner PN, El Hannani A, Huijbregts M, Karafiat M, Lincoln M & Wan V (2010) The AMIDA 2009 Meeting Transcription System. 11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-4 (pp 358-361)
  • Hain T, Burget L, Dines J, Garner PN, el Hannani A, Huijbregts M, Karafiat M, Lincoln M & Wan V (2010) The AMIDA 2009 Meeting Transcription System. Interspeech’10 (pp 358-361)
  • Garner PN, Dines J, Hain T, El Hannani A, Karafiar M, Korchagin D, Lincoln M, Wan V, Zhang L & ASSOC I-ISC (2009) Real-Time ASR from Meetings. INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5 (pp 2067-+)
  • Garner PN, Dines J, Hain T, El Hannani A, Karafiát M, Korchagin D, Lincoln M, Wan V & Zhang L (2009) Real-time ASR from meetings. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH (pp 2119-2122)
  • Renals S, Hain T, Bourlard H & IEEE (2008) Interpretation of multiparty meetings the AMI and AMIDA projects. 2008 HANDS-FREE SPEECH COMMUNICATION AND MICROPHONE ARRAYS (pp 116-+)
  • Renals S, Hain T & Bourlard H (2008) Interpretation of multiparty meetings the AMI and AMIDA projects. 2008 Hands-free Speech Communication and Microphone Arrays, Proceedings, HSCMA 2008 (pp 115-118)
  • Hain T, El Hannani A, Wrigley SN & Wan V (2008) Automatic speech recognition for scientific purposes - webASR. INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5 (pp 504-507)
  • Wan V, Dines J, El Hannani A & Hain T (2008) BOB: A LEXICON AND PRONUNCIATION DICTIONARY GENERATOR. 2008 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY: SLT 2008, PROCEEDINGS (pp 217-220)
  • Hain T, Burget L, Dines J, Garau G, Karafiat M, van Leeuwen D, Lincoln M & Wan V (2008) The 2007 AMI(DA) system for meeting transcription. MULTIMODAL TECHNOLOGIES FOR PERCEPTION OF HUMANS, Vol. 4625 (pp 414-428)
  • Karafiat M, Burget L, Hain T & Cernocky J (2007) Application of CMLLR in narrow band wide band adapted systems. Interspeech’07 (pp 282-285). Antwerp
  • Gibson M & Hain T (2007) Temporal Masking for Unsupervised Minimum Bayes Risk Speaker Adaptation. INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4 (pp 1577-1580)
  • Hain T, Burget L, Dines J, Garau G, Karafiat M, Lincoln M, Vepa J & Wan V (2007) The AMI system for the transcription of speech in meetings. 2007 IEEE International Conference on Acoustics, Speech, and Signal Processing, Vol IV, Pts 1-3 (pp 357-360)
  • Wan V & Hain T (2006) Strategies for language model web-data collection. 2006 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-13 (pp 1069-1072)
  • Wan V & Hain T (2006) Strategies for language model web-data collection. 2006 IEEE International Conference on Acoustics, Speech, and Signal Processing, Vol I, Proceedings (pp 1069-1072). Toulouse, FRANCE, 14 May 2006 - 19 May 2006.
  • Dines J, Vepa J & Hain T (2006) The segmentation of multi-channel meeting recordings for automatic speech recognition. INTERSPEECH 2006 and 9th International Conference on Spoken Language Processing, INTERSPEECH 2006 - ICSLP, Vol. 3 (pp 1213-1216)
  • Al-Hames M, Hain T, Cernocky J, Schreiber S, Poel M, Muller R, Marcel S, van Leeuwen D, Odobez JM, Ba S, Bourlard H, Cardinaux F, Gatica-Perez D, Janin A, Motlicek P, Reiter S, Renals S, van Rest J, Rienks R, Rigoll G, Smith K, Thean A & Zemcik P (2006) Audio-visual processing in meetings: Seven questions and current AMI answers. Machine Learning for Multimodal Interaction, Vol. 4299 (pp 24-35)
  • Gibson M & Hain T (2006) Hypothesis Spaces For Minimum Bayes Risk Training In Large Vocabulary Speech Recognition. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5 (pp 2406-2409)
  • Uraga E & Hain T (2006) Automatic Speech Recognition Experiments with Articulatory Data. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5 (pp 353-356)
  • Hain T, Burget L, Dines J, Garau G, Karafiat M, Lincoln M, Vepal J & Wan V (2006) The AMI meeting transcription system: Progress and performance. Machine Learning for Multimodal Interaction, Vol. 4299 (pp 419-431)
  • McCowan I, Carletta J, Kraaij W, Ashby S, Bourban S, Flynn M, Guillemot M, Hain T, Kadlec J, Karaiskos V, Kronenthal M, Lathoud G, Lincoln M, Lisowska A, Post W, Reidsma D & Wellner P (2005) The AMI Meeting Corpus. 5th International Conference on Methods and Techniques in Behavioral Research
  • Hain T, Dines J, Garau G, Karafiat M, Moore D, Wan V, Ordelman R & Renals S (2005) Transcription of conference room meetings: An investigation. 9th European Conference on Speech Communication and Technology (pp 1661-1664)
  • Garau G, Renals S & Hain T (2005) Applying vocal tract length normalization to meeting recordings. 9th European Conference on Speech Communication and Technology (pp 265-268)
  • Hain T, Burget L, Dines J, Garau G, Karafiat M, Lincoln M, McCowan I, Moore D, Wan V, Ordelman R & Renals S (2005) The 2005 AMI system for the transcription of speech in meetings. MACHINE LEARNING FOR MULTIMODAL INTERACTION, Vol. 3869 (pp 450-462)
  • Hain T, Burget L, Dines J, McCowan I, Garau G, Karafiat M, Lincoln M, Moore D, Wan V, Ordelman R & Renals S (2005) The development of the AMI system for the transcription of speech in meetings. MACHINE LEARNING FOR MULTIMODAL INTERACTION, Vol. 3869 (pp 344-356)
  • Kim DY, Gales MJF, Chan HY, Woodland PC, Umesh S & Hain T (2004) Progress in Broadcast News English Transcription. EARS STT Technical Meeting 2004. Montreal, Canada
  • Woodland PC, Chan HY, Evermann G, Gales MJF, Hain T, Jia B, Kim DY, Liu X, Mrva D, Sim KC, Tranter SE & Wang L (2004) Cambridge STT Overview. EARS Mid-year Meeting 2004
  • Kim DY, Umesh S, Gales MJF, Hain T & Woodland PC (2004) Using VTLN for Broadcast News Transcription. ICSLP’04. Cambridge University, UK
  • Evermann G, Chan HY, Gales MJF, Hain T, Liu X, Mrva D, Wang L & Woodland PC (2004) Development of the 2003 CU-HTK conversational telephone speech transcription system. ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, Vol. 1
  • Evermann G, Chan HY, Gales MJF, Hain T, Liu X, Mrva D, Wang L & Woodland P (2004) Development of the 2003 CU-HTK Conversational Telephone Speech transcription system. 2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS (pp 249-252)
  • Hain T (2003) Single Pronunciation Dictionaries - Construction and Performance. EARS STT Technical Meeting 2004
  • Woodland PC, Chan HY, Evermann G, Gales MJF, Hain T, Kim DY, Liu X, Mrva D, Povey D, Tranter SE, Wang L & Yu K (2003) 2003 CU-HTK English CTS Systems. Rich Transcription Workshop 2003s. Boston, Ma
  • Kim DY, Evermann G, Hain T, Mrva D, Tranter SE, Wang L & Woodland PC (2003) 2003 CU-HTK Broadcast News English System Development. Rich Transcription Workshop 2003s
  • Jia B, Sim KC, Gales MJF, Hain T, Liu X, Woodland PC & Yu K (2003) CU-HTK RT-03 Mandarin CTS System. Rich Transcription Workshop 2003
  • Woodland PC, Evermann G, Gales MJF, Hain T, Chan HY, Jia B, Kim DY, Liu X, Mrva D, Povey D, Sim KC, Tomalin M, Tranter SE, Wang L & Yu K (2003) Recent Experiments with HTK Broadcast News and Conversational Telephone Systems. EARS Mid-year meeting 2003
  • Kim DY, Evermann G, Hain T, Mrva D, Tranter SE, Wang L & Woodland P (2003) Recent advances in broadcast news transcription. ASRU'03: 2003 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING ASRU '03 (pp 105-110)
  • Woodland PC, Evermann G, Gales MJF, Hain T, Liu X, Moore GL, Povey D & Wang L (2002) CU-HTK APRIL 2002 SWITCHBOARD SYSTEM. Rich Transcription Workshop 2002. Vienna, VA
  • Hain T (2002) Implicit Pronunciation Modelling in ASR. ITRW PMLA 2002. Estes Park, Colorado
  • Hain T, Woodland PC, Evermann G & Povey D (2001) New features in the CU-HTK system for transcription of conversational telephone speech. 2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS (pp 57-60)
  • Hain T, Woodland PC, Evermann G & Povey D (2000) The CU-HTK March 2000 HUB5E Transcription System. Speech Transcription Workshop 2000. College Park, Maryland
  • Hain T & Woodland PC (2000) Modelling sub-phone insertions and deletions in continuous speech recognition. ICSLP 2000
  • Hain T & Woodland PC (1999) Dynamic HMM selection for continuous speech recognition. Eurospeech’99 (pp 1327-1330). Budapest
  • Woodland PC, Hain T, Moore GL, Niesler TR, Povey D, Tuerk A & Whittaker EWD (1999) The 1998 HTK Broadcast News Transcription System: Development and Results. 1999 DARPA Broadcast News Transcription and Understanding Workshop. Herndon, VA
  • Woodland PC, Odell JJ, Hain T, Moore GL, Niesler TR, Tuerk A & Whittaker EWD (1999) Improvements in Accuracy and Speed in the HTK Broadcast News Transcription System. Eurospeech’99
  • Odell JJ, Woodland PC & Hain T (1999) The CUHTK-Entropic 10xRT Broadcast News Transcription System. 1999 DARPA Broadcast News Transcription and Understanding Workshop (pp 271-275). Herndon, VA
  • Hain T & Woodland PC (1999) RECENT EXPERIMENTS WITH THE CU-HTK HUB5 SYSTEM. Hub5 Workshop’99
  • Hain T & Woodland PC (1999) Hidden model sequences. Hub5 Workshop’99
  • Hain T, Woodland PC, Niesler TR & Whittaker EWD (1999) The 1998 HTK system for transcription of conversational telephone speech. ICASSP '99: 1999 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS VOLS I-VI (pp 57-60)
  • Woodland PC, Hain T, Johnson SE, Niesler TR, Tuerk A, Whittaker EWD & Young SJ (1998) The 1997 HTK Broadcast News Transcription System. 1998 DARPA Broadcast News Transcription and Understanding Workshop (pp 41-48)
  • Hain T & Woodland PC (1998) SEGMENTATION AND CLASSIFICATION OF BROADCAST NEWS AUDIO. ICSLP’98
  • Hain T, Johnson SE, Tuerk A, Woodland PC & Young SJ (1998) Segment Generation and Clustering in the HTK Broadcast News Transcription System. 1998 DARPA Broadcast News Transcription and Understanding Workshop (pp 133-137)
  • Hain T & Woodland PC (1998) CU-HTK Acoustic modeling experiments. Hub5 Workshop 98
  • Woodland PC, Hain T, Johnson SE, Niesler TR, Tuerk A & Young SJ (1998) Experiments in broadcast news transcription. PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6 (pp 909-912)
  • Huertgen B & Hain T (1994) On the convergence of fractal transforms. ICASSP’94 (pp 561-564)
  • Doulaty Bashkand M, Saz O & Hain T () Unsupervised Domain Discovery Using Latent Dirichlet Allocation for Acoustic Modelling in Speech Recognition. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. Dresden, Germany, 6 September 2015 - 10 September 2015. View this article in WRRO
  • Doulaty Bashkand M, Saz O & Hain T () Data-Selective Transfer Learning for Multi-Domain Speech Recognition. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. Dresden, Germany, 6 September 2015 - 10 September 2015. View this article in WRRO
  • Deena S, Ng RWM, Madhyashtha P, Specia L & Hain T () Exploring the use of Acoustic Embeddings in Neural Machine Translation. Proceedings of IEEE Automatic Speech Recognition and Understanding Workshop View this article in WRRO
  • Loweimi E, Barker & Hain () Robust Source-Filter Separation of Speech Signal in the Phase Domain. Proceedings of the Annual Conference of the International Speech Communication Association
  • Ng W, Kwan A, Lee T & Hain T () ShefCE: A Cantonese-English Bilingual Speech Corpus for Pronunciation Assessment. ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings View this article in WRRO

Reports

  • el Hannani A & Hain T (2011) Data Dependence of Speech Decoder Parameters
  • Gibson M & Hain T (2011) Confidence-informed unsupervised Minimum Bayes Risk acoustic model adaptation
  • Hain T, Dines J & McCowan I (2006) Conversational multi-party speech recognition using remote microphones
  • Hain T, Woodland PC, Evermann G, Liu X, Moore GL, Povey D & Wang L (2003) Automatic Transcription of Conversational Telephone Speech. Development of the CU-HTK 2002 System

Other

Theses / Dissertations

  • Hain T (2001) Hidden Model Sequence Models for Automatic Speech Recognition.
  • Hain T (1993) On the Use of Iterated Function Systems for Coding of Grayscale Images.

Datasets