Journal articles

Chapters

  • Cooke M, Barker J & Lecumberri MLG (2013) Crowdsourcing in Speech Perception In Eskanazi M, Levow G-A, Meng H, Parent G & Sundermann D (Ed.), Crowdsourcing for Speech Processing (pp. 137-169). John Wiley and Sons
  • Barker J (2012) Missing Data Techniques: Recognition with Incomplete Spectrograms In Virtanen T, Singh R & Raj B (Ed.), Techniques for Noise Robustness in Automatic Speech Recognition (pp. 371-398). Wiley
  • Barker J (2006) Robust automatic speech recognition In Wang D-L & Brown GJ (Ed.), Computational Auditory Scene Analysis: Principals, Algorithms and Applications (pp. 297-350). Wiley/IEEE Press

Conference proceedings papers

  • Loweimi E, Barker & Hain (2016) Use of Generalised Nonlinearity in Vector Taylor Series Noise Compensation for Robust Speech Recognition. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH View this article in WRRO
  • Ma N, Marxer R, Barker J & Brown GJ (2016) Exploiting synchrony spectra and deep neural networks for noise-robust automatic speech recognition. 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2015 - Proceedings (pp 490-495) View this article in WRRO
  • Abel A, Marxer R, Barker J, Watt R, Whitmer B, Derleth P & Hussain A (2016) A data driven approach to audiovisual speech mapping. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), Vol. 10023 LNAI (pp 331-342)
  • Lecumberri MLG, Barker J, Marxer R & Cooke M (2016) Language effects in noise-induced word misperceptions. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, Vol. 08-12-September-2016 (pp 640-644)
  • Tóth AM, Cooke M & Barker J (2016) Misperceptions arising from speech-in-babble interactions. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, Vol. 08-12-September-2016 (pp 630-634)
  • Mandel MI & Barker JP (2016) Multichannel spatial clustering for robust far-field automatic speech recognition in mismatched conditions. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, Vol. 08-12-September-2016 (pp 1991-1995)
  • Loweimi E, Barker J & Hain T (2015) Source-filter separation of speech signal in the phase domain. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, Vol. 2015-January (pp 598-602) View this article in WRRO
  • Foster P, Sigtia S, Krstulovic S, Barker J & Plumbley MD (2015) Chime-home: A dataset for sound source recognition in a domestic environment. 2015 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, WASPAA 2015
  • Alghamdi N, Maddock SC, Brown GJ & Barker J (2015) Investigating the Impact of Artificial Enhancement of Lip Visibility on the Intelligibility of Spectrally-Distorted Speech. FAAVSP-2015 (pp 93-98), 11 September 2015 - 13 September 2015.
  • Barker J, Marxer R, Vincent E & Watanabe S (2015) The third ’CHiME’ speech separation and recognition challenge: Dataset, task and baselines. 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2015, Scottsdale, AZ, USA, December 13-17, 2015 (pp 504-511)
  • Lin L, Barker J & Brown GJ (2015) The effect of cochlear implant processing on speaker intelligibility: A perceptual study and computer model. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, Vol. 2015-January (pp 1566-1570)
  • Marxer R, Cooke M & Barker J (2015) A framework for the evaluation of microscopic intelligibility models. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, Vol. 2015-January (pp 2558-2562)
  • Loweimi E, Doulaty M, Barker J & Hain T (2015) Long-Term statistical feature extraction from speech signal and its application in emotion recognition. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), Vol. 9449 (pp 173-184)
  • Al Dabel M & Barker J (2014) Speech pre-enhancement using a discriminative microscopic intelligibility model. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH (pp 2068-2072)
  • Ma N & Barker J (2013) A fragment-decoding plus missing-data imputation system evaluated on the 2nd CHiME challenge. Proceedings of the 2nd CHiME Workshop on Machine Listening in Multisource Environments (pp 53-58)
  • Vincent E, Barker J, Watanabe S, Roux JL, Nesta F & Matassoni M (2013) The second ‘CHiME’ Speech Separation and Recognition Challenge: Datasets, tasks and baselines. Proceedings of the 2013 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE
  • Ma N & Barker J (2012) Coupling identification and reconstruction of missing features for noise-robust automatic speech recognition. 13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012, Vol. 3 (pp 2637-2640)
  • González JA, Peinado AM, Gómez AM, Ma N & Barker J (2012) Combining missing-data reconstruction and uncertainty decoding for robust speech recognition. ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings (pp 4693-4696)
  • Ma N, Barker J, Christensen H & Green P (2011) Recent advances in fragment-based speech recognition in reverberant multisource environments.. Proceedings of ISCA Workshop on Machine Listening in Multisource Environments (pp 68-73)
  • Ma N, Barker J, Christensen H & Green P (2011) Binaural cues for fragment-based speech recognition in reverberant multisource environments. Proceedings of INTERSPEECH 2011 (pp 1657-1660)
  • Ma N, Barker J, Christensen H & Green P (2011) Incorporating localisation cues in a fragment decoding framework for distant binaural speech recognition.. IEEE Joint Workshop on Hands-Free Speech Communication and Microphone Arrays (HSCMA’11) (pp 207-212)
  • Cooke M, Barker J, Garcia Lecumberri ML, Wasilewski K & Assoc ISC (2011) Crowdsourcing for word recognition in noise. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5 (pp 3056-+)
  • Ma N, Barker J, Christensen H & Green P (2011) Binaural cues for fragment-based speech recognition in reverberant multisource environments. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH (pp 1657-1660)
  • Morales-Cordovilla JA, Ma N, Sánchez V, Carmona JL, Peinado AM & Barker J (2011) A pitch based noise estimation technique for robust speech recognition with missing data. ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings (pp 4808-4811)
  • Ma N, Barker J, Christensen H & Green P (2010) Distant microphone speech recognition in a noisy indoor environment: combining soft missing data and speech fragment decoding.. ISCA Tutorial and Research Workshop on Statistical And Perceptual Audition
  • Kabir A, Barker J & Giurgiu M (2010) An Approach to Vocal Tract Length Normalization by Robust Formant Estimation. Proceedings of the International Conference on Circuits, Systems and Signals, (Recent Advances in Circuits, Sistems and Signals) (pp 345-348)
  • Kabir A, Barker J & Giurgiu M (2010) Robust Formant Estimation: Increasing the Reliability by Comparison among three Methods. Proceedings of the International Conference on Circuits, Systems and Signals, (Recent Advances in Circuits, Sistems and Signals) (pp 341-344)
  • Christensen H & Barker J (2010) Speaker turn tracking with mobile microphones: Combining location and pitch information. European Signal Processing Conference (pp 954-958)
  • Christensen H, Barker J, Ma N & Green P (2010) The CHiME corpus: A resource and a challenge for computational hearing in multisource environments. Proceedings of the 11th Annual Conference of the International Speech Communication Association, INTERSPEECH 2010 (pp 1918-1921)
  • Kabir A, Giurgiu M & Barker J (2010) Robust automatic transcription of english speech corpora. 2010 8th International Conference on Communications, COMM 2010 (pp 79-82)
  • Kabir A, Barker J & Giurgiu M (2010) Integrating Hidden Markov Model and PRAAT: A toolbox for robust automatic speech transcription. Proceedings of SPIE - The International Society for Optical Engineering, Vol. 7745
  • Christensen H & Barker J (2009) Using location cues to track speaker changes from mobile, binaural microphones. INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5 (pp 124-127)
  • Christensen H, Ma N, Wrigley SN & Barker J (2009) A SPEECH FRAGMENT APPROACH TO LOCALISING MULTIPLE SPEAKERS IN REVERBERANT ENVIRONMENTS. 2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS (pp 4593-4596)
  • Arnaud E, Christensen H, Lu Y-C, Barker J, Khalidov V, Hansard ME, Holveck B, Mathieu H, Narasimha R, Taillant E, Forbes F & Horaud R (2008) The CAVA corpus: synchronised stereoscopic and binaural datasets with head movements.. ICMI (pp 109-116)
  • Barker J & Shao X (2007) Audio-visual speech fragment decoding. Proceedings of the International Conference on Auditory-Visual Speech Processing (AVSP 2007)
  • Christensen H, Ma N, Wrigley SN & Barker J (2007) Integrating pitch and localisation cues at a speech fragment level. INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4 (pp 2752-2755)
  • Ma N, Barker J & Green P (2007) Applying word duration constraints by using unrolled HMMs. INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4 (pp 353-356)
  • Brown GJ, Harding S & Barker JP (2006) Speech separation based on the statistics of binaural auditory features. ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, Vol. 5
  • Palomäki KJ, Brown GJ & Barker JP (2006) Recognition of reverberant speech using full cepstral features and spectral missing data. ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, Vol. 1
  • Coy A & Barker J (2006) A Multipitch Tracker for Monaural Speech Segmentation. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5 (pp 1678-1681)
  • Shao X & Barker J (2006) Audio-Visual Speech Recognition in the Presence of a Competing Speaker. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5 (pp 1292-1295)
  • Barker J, Coy A, Ma N & Cooke M (2006) Recent advances in speech fragment decoding techniques. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5 (pp 85-88)
  • Brown GJ, Harding S & Barker JP (2006) Speech separation based on the statistics of binaural auditory features. 2006 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-13 (pp 5807-5810)
  • Palomaki KJ, Brown GJ & Barker JP (2006) Recognition of reverberant speech using full cepstral features and spectral missing data. 2006 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-13 (pp 289-292)
  • Brown GJ, Harding S & Barker JP (2006) Speech separation based on the statistics of binaural auditory features. 2006 IEEE International Conference on Acoustics, Speech, and Signal Processing, Vol V, Proceedings (pp 949-952)
  • Palomaki KJ, Brown GJ & Barker JP (2006) Recognition of reverberant speech using full cepstral features and spectral missing data. 2006 IEEE International Conference on Acoustics, Speech, and Signal Processing, Vol I, Proceedings (pp 289-292). Toulouse, FRANCE, 14 May 2006 - 19 May 2006.
  • Harding S, Barker J & Brown GJ (2005) Binaural feature selection for missing data speech recognition. 9th European Conference on Speech Communication and Technology (pp 1269-1272)
  • Coy A & Barker J (2005) Soft harmonic masks for recognising speech in the presence of a competing speaker. 9th European Conference on Speech Communication and Technology (pp 2641-2644)
  • Barker J (2005) Tracking facial markers with an adaptive marker collocation model. 2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5 (pp 665-668)
  • Coy A & Barker J (2005) Recognising speech in the presence of a competing speaker using a 'speech fragment decoder'. 2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5 (pp 425-428)
  • Harding S, Barker J & Brown GJ (2005) Mask estimation based on sound localisation for missing data speech recognition. 2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5 (pp 537-540)
  • Brown GJ, Kalle Palomäki K & Barker J (2004) A Missing Data Approach for Robust Automatic Speech Recognition in the Presence of Reverberation. Proceedings of the 18th International Congress on Acoustics (ICA) (pp 449-452)
  • Barker J, Cooke M & Ellis D (2002) Temporal integration as a consequence of multi-source decoding. Proceedings of the ISCA Workshop on the Temporal Integration in the Perception of Speech (TIPS)
  • Palomäki KJ, Brown GJ & Barker J (2002) Missing data speech recognition in reverberant conditions. ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, Vol. 1
  • Palomaki KJ, Brown GJ & Barker J (2002) Missing data speech recognition in reverberant conditions. 2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS (pp 65-68)
  • Barker J, Cooke M & Ellis D (2001) Combining bottom-up and top-down constraints for robust ASR: The multisource decoder. Proceedings of Workshop on consistent and reliable acoustic cues for sound analysis (CRAC-01)
  • Morris AC, Barker J & Bourlard H (2001) From Missing Data to Maybe Useful Data: Soft Data Modelling for Noise Robust ASR. Proceedings of the Worshop on Innovation in Speech Processing (WISP 2001)
  • Green P, Barker J, Cooke M & Josifovski L (2001) Handling Missing and Unreliable Information in Speech Recognition. Proceedings of the 8th International Workshop on Artificial Intelligence and Statistics (AISTATS-2001)
  • Barker J, Green P & Cooke M (2001) Linking Auditory Scene Analysis and Robust ASR by Missing Data Techniques. Proceedings of the Worshop on Innovation in Speech Processing (WISP 2001)
  • Barker J, Cooke M & Green PD (2001) Robust ASR based on clean speech models: an evaluation of missing data techniques for connected digit recognition in noise.. INTERSPEECH (pp 213-217)
  • Brown GJ, Barker J & Wang DL (2001) A neural oscillator sound separator for missing data speech recognition. Proceedings of the International Joint Conference on Neural Networks, Vol. 4 (pp 2907-2912)
  • Brown GJ, Barker J & Wang DL (2001) A neural oscillator sound separator for missing data speech recognition. IJCNN'01: INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-4, PROCEEDINGS (pp 2907-2912)
  • Barker J, Cooke M & Ellis DPW (2000) Decoding speech in the presence of other sound sources.. INTERSPEECH (pp 270-273)
  • Barker J, Josifovski L, Cooke M & Green PD (2000) Soft decisions in missing data techniques for robust automatic speech recognition.. INTERSPEECH (pp 373-376)
  • Barker JP & Berthommier F (1999) Evidence of correlation between acoustic and visual features of speech. Proc. ICPhS ’99
  • Barker JP & Berthommier F (1999) Estimation of speech acoustics from visual speech features: A comparison of linear and non-linear models. Proceedings of the ISCA Workshop on Auditory-Visual Speech Processing (AVSP) ’99
  • Barker JP, Berthommier F & Schwartz JL (1998) Is primitive AV coherence an aid to segment the scene?. Proceedings of the ISCA Workshop on Auditory-Visual Speech Processing (AVSP) ’98
  • Barker J, Williams G & Renals S (1998) Acoustic confidence measures for segmenting broadcast news.. ICSLP
  • Barker J & Cooke M (1997) Modelling the recognition of spectrally reduced speech.. EUROSPEECH
  • Rajaravivarma V, Lord E & Barker J (1996) Data compression techniques in image compression for multimedia systems. Southcon Conference Record (pp 624-627)
  • Loweimi E, Barker & Hain () Robust Source-Filter Separation of Speech Signal in the Phase Domain. Proceedings of the Annual Conference of the International Speech Communication Association
  • Loweimi E, Barker J & Hain T () Statistical Normalisation of Phase-based Feature Representation For Robust Speech Recognition. Proceedings - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing View this article in WRRO
  • Barker J & Coy A () Towards Solving the Cocktail Party Problem through Primitive Grouping and Model Combination. Proceedings of Forum Acusticum

Other

  • Christensen H, Barker J, Lu Y-C, Xavier J, Caseiro R & Araújo H (2009) POPeye: Real-time, binaural sound source localisation on an audio-visual robot-head.
  • Christensen H & Barker J (2009) Simultaneous Tracking of Perceiver Movements and Speaker Changes Using Head-Centered, Binaural Data.

Posters

  • Alghamdi N, Maddock S, Brown GJ & Barker J (2015) A comparison of audiovisual and auditory-only training on the perception of spectrally-distorted speech. 18th International Congress of Phonetic Sciences.

Theses / Dissertations

  • Barker J (1998) The relationship between auditory organisation and speech perception: Studies with spectrally reduced speech.