Dr Ziqi Zhang
Information School
Senior Lecturer in Social Media


+44 114 222 2657
Full contact details
Information School
Room 323a
Regent Court (IS)
211 Portobello
Sheffield
S1 4DP
- Profile
-
I am currently an academic member of staff of the Information School. Before joining the Information School in 2018, I was a computer science lecturer at the Computing and Technology Department, Nottingham Trent University (2016-18), and a researcher in the Department of Computer Science, University of Sheffield (pre-2016).
My research addresses methods that enable machines to extract human knowledge from text, to represent such knowledge in a structured representation that is understandable and usable by machines. This covers areas of knowledge graphs, natural language processing, text mining, and social media analytics. My work features close collaboration with industry partners, to apply research output to real-world problem solving. For example, I worked with Archaeological Data Services from the University of York to create ArchSearch, an advanced search engine for archaeology grey literature backed by an archaoelogy knowledge graph. I worked with The Klood Ltd to create the first football rumour extraction engine based on Twitter, FootballWhispers.com. And I worked with Vamstar Ltd. to develop a knowledge graph of the healthcare sector supply chain, in order to improve public healthcare procurement. These projects are funded by a range of research councils and private bodies, such EPSRC, AHRC, and InnovateUK.
I am always looking for PhD students in the areas of knowledge graphs, text mining and social media analytics. And I am interested in collaboration with external partners in any capacity.
University responsibilities
- Exams Officer
- Research interests
-
My research addresses methods that enable machines to extract human knowledge from text, to represent such knowledge in a structured representation that is understandable and usable by machines. This ultimately enhances our capability of processing and sense-making of very large scale data, improving decision making. Specifically, this include but is not limited to:
- Knowledge graph research: the automatic creation, augmentation, and mapping of structured ‘databases’ that support machine understandability and reasoning. Knowledge graphs are widely used today by search engines and industry applications. For example, Google uses knowledge graphs to improve its search results; pharmaceutical companies use knowledge graphs to discover unknown chemical compouds that can be used for drug development. Our project ‘Archaeotools’ was an example of developing knowledge graphs for the archaeology domain. This led to the ArchSearch service that currently powers one of the largest archaeology databse in the UK.
- Information Extraction: developing computational methods to automatically transform unstructured, natural language text into structured representation that could support machine understandability and reasoning. This could include the extraction of terms, concepts, named entities, and relations between them from texts. I am particularly interested in the development and adaptation of IE methods in domain specific contexts, such as bibliometrics research, cultural heritage, and the legal domain.
- Social media analysis: the application and adaptation of Information Extraction methods onto social media text analytics, to enable event discovery and monitoring. For example, I worked with Rotherham United Football Club to develop content moderation methods to tackle online hate. I worked with Diabetes.co.uk to develop computational methods for analysing user generated content in its forums. And I worked with a team of researchers to develop the first football rumour extraction engine using Twitter, FootballWhispers.com.
- Semantic Web and Linked Data: the broader range of topics related to the vision of tomorrow’s Web where machine understandable data are put on the Web, shared and reused across application, enterprise, and community boundaries.
- Publications
-
Journal articles
- An exploratory study on utilising the web of linked data for product data mining. SN Computer Science, 4(1).
- Understanding the use of heterogenous data in tackling urban flooding: an integrative literature review. Water, 14(14).
- Utilizing subjectivity level to mitigate identity term bias in toxic comments classification. Online Social Networks and Media, 29, 100205-100205.
- Towards automated analysis of research methods in library and information science. Quantitative Science Studies.
- Towards understanding a football club’s social media network: an exploratory case study of Manchester United. Information Discovery and Delivery, 49(1), 71-83.
- “Less is more”. Online Information Review, 44(1), 213-237.
- Smart information retrieval: Domain knowledge centric optimization approach. IEEE Access, 7, 4167-4183. View this article in WRRO
- A comparison of information sharing behaviours across 379 health conditions on Twitter. International Journal of Public Health, 1-10. View this article in WRRO
- View this article in WRRO
- SemRe-Rank: Improving Automatic Term Extraction by Incorporating Semantic Relatedness with Personalised PageRank. ACM Transactions on Knowledge Discovery from Data, 12(5). View this article in WRRO
- Effective and efficient Semantic Table Interpretation using TableMiner+. Semantic Web, 8(6), 921-957. View this article in WRRO
- An unsupervised data-driven method to discover equivalent relations in large Linked Datasets. Semantic Web, 8(2), 197-223. View this article in WRRO
- Early Steps Towards Web Scale Information Extraction with LODIE. AI Magazine, 36(1), 55-64. View this article in WRRO
- "Linked data as background knowledge for information extraction on the web" by Ziqi Zhang, Anna Lisa Gentile and Isabelle Augenstein with Martin Vesely as coordinator. ACM SIGWEB Newsletter(Summer), 1-9.
- Recent advances in methods of lexical semantic relatedness - a survey. Natural Language Engineering, FirstView, 1-69.
- The Archaeotools project: faceted classification and natural language processing in an archaeological context.. Philos Trans A Math Phys Eng Sci, 367(1897), 2507-2519.
- Social Support in a Diabetes Online Community: A Mixed Methods Content Analysis (Preprint). JMIR Diabetes.
Chapters
- Product Classification Using Microdata Annotations, Lecture Notes in Computer Science (pp. 716-732). Springer International Publishing
- Towards Efficient and Effective Semantic Table Interpretation, The Semantic Web – ISWC 2014 (pp. 487-502). Springer International Publishing
- Combining Diverse Knowledge Based Features for Semantic Relatedness Measures In Brena RF & Guzman-Arenas A (Ed.), Quantitative Semantics and Soft Computing Methods for the Web (pp. 96-117). IGI Global
Conference proceedings papers
- View this article in WRRO
- A Comparative Study of Using Pre-trained Language Models for Toxic Comment Classification. Companion Proceedings of the Web Conference 2021 View this article in WRRO
- Adapted TextRank for Term Extraction: A generic method of improving automatic term extraction algorithms. Procedia Computer Science, Vol. 137 (pp 102-108), 10 September 2018 - 13 September 2018. View this article in WRRO
- Hate speech detection on Twitter : feature engineering v.s. feature selection. The Semantic Web: ESWC 2018 Satellite Events (pp 46-49). Crete, Greece, 3 June 2018 - 7 June 2018. View this article in WRRO
- Detecting Hate Speech on Twitter Using a Convolution-GRU Based Deep Neural Network. Lecture Notes in Computer Science, Vol. 10843 (pp 745-760), 3 June 2018 - 7 June 2018. View this article in WRRO
- Entity deduplication on ScholarlyData. The Semantic Web, Vol. Part 1 (pp 85-100), 28 May 2017 - 1 June 2017. View this article in WRRO
- View this article in WRRO
- The Semantic Web – ISWC 2014
- Learning with Partial Data for Semantic Table Interpretation (pp 607-618)
- Mapping Keywords to Linked Data Resources for Automatic Query Expansion (pp 101-112)
- View this article in WRRO
- An integrated environment for semantic knowledge work. Proceedings of the 20th ACM international conference on Information and knowledge management - CIKM '11, 24 October 2011 - 28 October 2011. View this article in WRRO
- Ontology augmentation: Combining semantic web and text resources. KCAP 2011 - Proceedings of the 2011 Knowledge Capture Conference (pp 9-16)
- Issues in learning an ontology from text. BMC Bioinformatics, Vol. 10(SUPPL. 5) View this article in WRRO
Preprints
- Social Support in a Diabetes Online Community: A Mixed Methods Content Analysis (Preprint), JMIR Publications Inc..
- An exploratory study on utilising the web of linked data for product data mining. SN Computer Science, 4(1).
- Research group
-
Current PhD students
- Amnah Alluqman: computational assistant for online-shopping for visually impaired people
- Daisy Da Moura Semedo: text mining in online health forums
- Omaima Fallatah: knowledge graph matching
- Jessica Fairbairn: computational methods for predicting hate speech propagation
- Jenny Hayes: social activisim on the social media
- Terence Egbelo: knowledge graph completion for drug discovery
- Zhixue Zhao: computational methods for hate speech detection
Past PhD examinations
- Aug 2021: Ruizhe Li , Department of Computer Science, University of Sheffield
- Aug 2020: Jun Zhang, Informatin School, University of Sheffield
- Teaching activities
-
I lead two modules:
- INF Introduction to Data Science covers key concepts and theories related to data science
- INF6024 Researching Social Media covers topics on research methods, data collection and analysis methods and research ethics in the context of social media research
- Professional activities and memberships
-
- External examiner for the Dublin City University
- Member of the Alan Turing Institute’s Knowledge Graph network
- Conference track co-chairs/senior committee members for the Extended Semantic Web Conference, the European Artificial Intelligence Conference
- Guest editor for the Semantic Web Journal, Frontiers in Clinical Diabetes and Healthcare
- Regular reviewer for conferences such as International Conference on Knowledge Engineering and Knowledge Management (EKAW), Internatonal Semantic Web Conference (ISWC), Extended Semantic Web Conference (ESWC), The Web Conference (WWW), Conference on Information and Knowledge Management (CIKM)
- Regular reviewer for journals such as IEEE Transactions on Knowledge and Data Engineering, ACM Transactions on Knowledge Discovery from Data, IOS The Semantic Web Journal, Elsevier Information Processing and Management