Dr Monica Lestari Paramita
BSc (Indonesia), MSc (Sheffield), PhD (Sheffield)
Information School
Lecturer in Data Science


Full contact details
Information School
Room C227
The Wave
2 Whitham Road
Sheffield
S10 2AH
- Profile
-
I obtained my BSc in Computer Science from the University of Indonesia in 2006 and MSc in Information Management from the University of Sheffield in 2008. Since 2008, I have worked as a researcher in diverse areas in Information Retrieval and Natural Language Processing.
My research roles included investigating cross-lingual similarity in the Web, developing systems to support information access (e.g., incorporating voice-based input, visualising bias and transparency in search results), and analysing users' behaviour when interacting with such systems.
I obtained my PhD from the University of Sheffield in 2019 where I developed approaches for identifying cross-lingual similarity in Wikipedia articles.
I joined the Information School as a Lecturer in Data Science in September 2021.
University Responsibilities
- Deputy Programme Coordinator for BSc Data Science
- Researcher Development Lead
- Research interests
-
My research focuses on the study of bias and transparency in information retrieval and multilingual information access. I am especially interested to investigate how bias-aware search engines should be designed to support users in their search tasks. I am also interested in researching cross-lingual similarity in Wikipedia; this includes creating methods to measure cross-lingual similarity, understanding why dissimilar information exists, and how this impacts different users (e.g., users in different locations or those speaking different languages).
I would be interested in supervising PhD topics in the following areas:
- bias and transparency in search engines
- multilingual information access, such as multilingual search and cross-lingual similarity in Wikipedia.
- Publications
-
Show: Featured publications All publications
Featured publications
Journal articles
- Do you see what I see? Images of the COVID-19 pandemic through the lens of Google. Information Processing & Management. View this article in WRRO
- Report on the CyCAT winter school on fairness, accountability, transparency and ethics (FATE) in AI. ACM SIGIR Forum, 55(1). View this article in WRRO
Conference proceedings papers
- Europeana: What Users Search for and Why (pp 207-219) View this article in WRRO
- Using Section Headings to Compute Cross-Lingual Similarity of Wikipedia Articles (pp 633-639) View this article in WRRO
- A Comparison of Approaches for Measuring Cross-Lingual Similarity of Wikipedia Articles (pp 424-429)
- Do user preferences and evaluation measures line up?. Proceeding of the 33rd international ACM SIGIR conference on Research and development in information retrieval - SIGIR '10, 19 July 2010 - 23 July 2010. View this article in WRRO
All publications
Journal articles
- Do you see what I see? Images of the COVID-19 pandemic through the lens of Google. Information Processing & Management. View this article in WRRO
- Report on the CyCAT winter school on fairness, accountability, transparency and ethics (FATE) in AI. ACM SIGIR Forum, 55(1). View this article in WRRO
- Motivations, understandings and experiences of open-access mega-journal authors: Results of a large-scale survey. Journal of the Association for Information Science and Technology, 70(7), 754-768. View this article in WRRO
- Extracting bilingual terms from the Web. Terminology, 21(2), 205-236. View this article in WRRO
Chapters
- Named Entity Recommendations to Enhance Multilingual Retrieval in Europeana.eu, Lecture Notes in Computer Science (pp. 102-112). Springer International Publishing
- Collecting Comparable Corpora, Using Comparable Corpora for Under-Resourced Areas of Machine Translation (pp. 55-87). Springer International Publishing
- Cross-Language Comparability and Its Applications for MT, Using Comparable Corpora for Under-Resourced Areas of Machine Translation (pp. 13-53). Springer International Publishing
- Introduction, Using Comparable Corpora for Under-Resourced Areas of Machine Translation (pp. 1-11). Springer International Publishing
- Appendices, Using Comparable Corpora for Under-Resourced Areas of Machine Translation (pp. 291-323). Springer International Publishing
- Product Classification Using Microdata Annotations, Lecture Notes in Computer Science (pp. 716-732). Springer International Publishing
- Building and Using Comparable Corpora Springer Berlin Heidelberg
- Methods for Collection and Evaluation of Comparable Documents, Building and Using Comparable Corpora (pp. 93-112). Springer Berlin Heidelberg
- Photographic Image Retrieval, ImageCLEF (pp. 141-162). Springer Berlin Heidelberg
Conference proceedings papers
- The SENSEI Overview of Newspaper Readers’ Comments (pp 758-761) View this article in WRRO
- Europeana: What Users Search for and Why (pp 207-219) View this article in WRRO
- Using Section Headings to Compute Cross-Lingual Similarity of Wikipedia Articles (pp 633-639) View this article in WRRO
- The SENSEI Annotated Corpus: Human Summaries of Reader Comment Conversations in On-line News. Proceedings of the 17th Annual Meeting of the Special Interest Group on Discourse and Dialogue (pp 42-52), 13 September 2016 - 15 September 2016. View this article in WRRO
- View this article in WRRO
- Automatic label generation for news comment clusters. Proceedings of the 9th International Natural Language Generation conference, 2016 - 2016.
- A Graph-Based Approach to Topic Clustering for Online Comments to News. Advances in Information Retrieval (pp 15-29), 20 March 2016 - 23 March 2016. View this article in WRRO
- Assigning Terms to Domains by Document Classification. Proceedings of the 4th International Workshop on Computational Terminology (Computerm), August 2014 - August 2014.
- A Comparison of Approaches for Measuring Cross-Lingual Similarity of Wikipedia Articles (pp 424-429)
- View this article in WRRO
- Do user preferences and evaluation measures line up?. Proceeding of the 33rd international ACM SIGIR conference on Research and development in information retrieval - SIGIR '10, 19 July 2010 - 23 July 2010. View this article in WRRO
- Diversity in Photo Retrieval: Overview of the ImageCLEFPhoto Task 2009 (pp 45-59) View this article in WRRO
- Generic and Spatial Approaches to Image Search Results Diversification (pp 603-610)
- Multiple approaches to analysing query diversity. Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval - SIGIR '09, 19 July 2009 - 23 July 2009. View this article in WRRO
- Identifying location in indonesian documents for geographic information retrieval. Proceedings of the 4th ACM workshop on Geographical information retrieval - GIR '07, 9 November 2007 - 9 November 2007.
Website content
Datasets
- Do you see what I see? Images of the COVID-19 pandemic through the lens of Google. Information Processing & Management. View this article in WRRO
- Research group
-
I am part of the Information Retrieval research group.
- Teaching activities
-
I am the module coordinator for INF113 (Data-Driven Organisations) and contribute to the following modules: INF6027 (Introduction to Data Science) and INF6060 (Information Retrieval). I also supervise MSc Data Science students on their dissertation (INF6000).
- Professional activities and memberships
-
- Associate Fellow of the Higher Education Academy
- Committee Member of the British Computer Society's Information Retrieval Specialist Group (BCS IRSG)
- Co-lead of the Shef.AI Interest Group: "Revolutionising data-driven research in the arts, humanities, and social sciences"