Dr Rosanna Milner
Department of Computer Science
Research Associate
Member of the Speech and Hearing research group
rosanna.milner@sheffield.ac.uk
+44 114 222 1800
+44 114 222 1800
Regent Court (DCS)
Full contact details
Dr Rosanna Milner
Department of Computer Science
Regent Court (DCS)
211 Portobello
Sheffield
S1 4DP
Department of Computer Science
Regent Court (DCS)
211 Portobello
Sheffield
S1 4DP
- Research interests
-
- Speaker diarisation
- Speaker linking
- Speech emotion recognition
- Publications
-
Journal articles
- Lightly supervised alignment of subtitles on multi-genre broadcasts. Multimedia Tools and Applications, 77(23), 30533-30550. View this article in WRRO
Conference proceedings papers
- A Cross-Corpus Study on Speech Emotion Recognition. 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 14 December 2019 - 18 December 2019.
- DNN approach to speaker diarisation using speaker channels. 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 5 March 2017 - 9 March 2017. View this article in WRRO
- webASR 2 - Improved cloud based speech technology. Proceedings of the 17th Annual Conference of the International Speech Communication Association (Interspeech) (pp 1613-1617) View this article in WRRO
- Segment-oriented evaluation of speaker diarisation performance. 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 20 March 2016 - 25 March 2016. View this article in WRRO
- The 2015 sheffield system for transcription of Multi-Genre Broadcast media. 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU), 13 December 2015 - 17 December 2015. View this article in WRRO
- The 2015 sheffield system for longitudinal diarisation of broadcast media. 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU), 13 December 2015 - 17 December 2015. View this article in WRRO
- Removing Bias with Residual Mixture of Multi-View Attention for Speech Emotion Recognition. Interspeech 2020
- Empirical Interpretation of Speech Emotion Perception with Attention Based Model for Speech Emotion Recognition. Interspeech 2020
- DNN-Based Speaker Clustering for Speaker Diarisation. Interspeech 2016 View this article in WRRO
Theses / Dissertations
- Lightly supervised alignment of subtitles on multi-genre broadcasts. Multimedia Tools and Applications, 77(23), 30533-30550. View this article in WRRO