Dr Judita Preiss
MA (Cambridge), MPHil (Cambridge), PhD (Cambridge)
Information School
Lecturer in Data Science


Full contact details
Information School
Room 235
Regent Court (IS)
211 Portobello
Sheffield
S1 4DP
- Profile
-
I have a MA Cantab in Mathematics, MPhil in Computer Speech and Language Processing from Engineering and a PhD in Natural Language Processing (Computer Science) all from Cambridge. Natural Language Processing was a way to combine my interest in Mathematics and Languages.
After finishing my PhD, I was an RA at Cambridge in the Natural Language Processing group, working on multiple projects. Between 2008-2010, I was a visiting professor at The Ohio State University, before returning to the UK to undertake a number of research projects in the Natural Language Processing group at the University of Sheffield. The constant need for more and more data fuelled an interest in approaches to gathering data and big data techniques, and I took up a post as a lecturer in Data Science at the University of Salford, which I held from 2017 to 2022.
Alongside my interest in data, I have worked on knowledge transfer to industry and applications of my research to real life settings.
- Research interests
-
I have a great number of interests: my current research topics range from work in the biomedical domain (such as automatic discoveries) with the associated applications in health, through mental health which includes work with social media texts as well as other sources of input, the automatic organization of data and presentation of it to users, to approaches involving multiple languages and automatically detectable differences between cultures.
I am very interested in work which involves text or speech, particularly when large quantities of data are involved. My current areas of PhD topics include:
- mining, and deriving, of knowledge and applications
- social media applications
- automatic arranging of knowledge
- multi-lingual models and the differences between these
- Publications
-
Journal articles
- Validation through a comparison of physical examination and DNA test results: OLFML3 case study. Meta Gene, 27, 100819-100819.
- Is automatic detection of hidden knowledge an anomaly?. BMC Bioinformatics, 20(S10).
- Quantifying and filtering knowledge generated by literature based discovery. BMC Bioinformatics, (Suppl 7):249, 59-67. View this article in WRRO
- The Effect of Word Sense Disambiguation Accuracy on Literature Based Discovery. BMC Medical Informatics and Decision Making, 16(Suppl 1).
- Exploring relation types for literature-based discovery. Journal of the American Medical Informatics Association, 22(5), 987-992. View this article in WRRO
- A detailed comparison of WSD systems: an analysis of the system answers for the S
ENSEVAL -2 English all words task. Natural Language Engineering, 12(3), 209-228. - Probabilistic word sense disambiguation. Computer Speech & Language, 18(3), 319-337.
- Introduction to the special issue on word sense disambiguation. Computer Speech & Language, 18(3), 201-207.
- Predicting the impact of online news articles – is information necessary?. Multimedia Tools and Applications.
Conference proceedings papers
- Avoiding background knowledge: literature based discovery from important information. BMC Bioinformatics, Vol. 23(S9), 22 October 2021 - 22 October 2021.
- View this article in WRRO
- Predicting Informativeness Of Semantic Triples. Proceedings of the Conference Recent Advances in Natural Language Processing - Deep Learning for Natural Language Processing Methods and Applications
- Validation through a comparison of physical examination and DNA test results: OLFML3 case study. Meta Gene, 27, 100819-100819.
- Teaching activities
-
- Leading the Big Data module(INF6032)
- Contributing to Introduction to Programming (INF4002)
- Contributing to Practical Programming for Data Science (INF111)
- Professional activities and memberships
-
As well as being Databricks certified Associate Developer for Apache Spark 3.0 - Python, I am an active member of the Databricks University Alliance. Similarly, I have been involved with Amazon Web Services, where I'm certified SysOps Administrator - Associate as well as being an AWS Academy Educator. I am also a member of the rolling review panel for ACL.