Professor Rob Gaizauskas

BA, MA, DPhil

Department of Computer Science

Personal Chair

Deputy Director of Research (operations)

Co-Director of CDT in Speech and Language Technologies

Head of the Natural Language Processing (NLP) research group

Rob Gaizauskas profile photo
r.gaizauskas@sheffield.ac.uk
+44 114 222 1827

Full contact details

Professor Rob Gaizauskas
Department of Computer Science
Regent Court (DCS)
211 Portobello
Sheffield
S1 4DP
Profile

Rob Gaizauskas studied Mathematics and Physics at the University of Toronto from 1972-74, then moved to Carleton University in Ottawa where he received an Honours BA in Philosophy in 1975 and an MA in Philosophy (with distinction) in 1978. Following two years teaching Logic as a temporary lecturer at Carleton he obtained a Diploma in Information Processing from Algonquin College, Ottawa, in 1981.

He then worked for several software companies in Ottawa, including Domus Software, Nabu Technologies, and Fulcrum Technologies (now part of Hummingbird), before moving to the U.K. in 1985, thanks to a Canadian SSHRC Doctoral Fellowship and British Council ORS award, to study for a DPhil in the School of Cognitive and Computing Sciences (now the Department of Informatics) at the University of Sussex.

He received his MA in Cognitive Studies in 1986 and was awarded his DPhil in 1992. During 1989 he lectured in Artificial Intelligence at Sussex. From 1990 to 1993 he worked as a Research Associate at the University of Sussex.

In 1993 he became a Lecturer in the Natural Language Processing Group of the Department of Computer Science, Sheffield University, became a Reader in Computer Science in the same group in 1999, and a Professor in 2002.

Research interests

Rob's research interests are in natural language processing, specifically in information extraction from natural language texts, software architectures for natural language processing and evaluation of language processing systems.

Publications

Journal articles

  • Tang Y, Wang JK, Wang X, Gao B, Dellandréa E, Gaizauskas R & Chen L (2017) Visual and Semantic Knowledge Transfer for Large Scale Semi-supervised Object Detection. IEEE Transactions on Pattern Analysis and Machine Intelligence, 40(12), 3045-3058. View this article in WRRO RIS download Bibtex download
  • Gaizauskas RJ, Paramita ML, Barker E, Pinnis M, Aker A & Pahisa M (2015) Extracting bilingual terms from the Web. Terminology, 21(2), 205-236. View this article in WRRO RIS download Bibtex download
  • Preiss J, Stevenson M & Gaizauskas R (2015) Exploring relation types for literature-based discovery. Journal of the American Medical Informatics Association, 22(5), 987-992. View this article in WRRO RIS download Bibtex download
  • Aker A & Gaizauskas R (2015) Generating descriptive multi‐document summaries of geo‐located entities using entity type models. Journal of the Association for Information Science and Technology, 66(4), 721-738. RIS download Bibtex download
  • Aker A, Paramita ML, Pinnis M & Gaizauskas R (2014) Bilingual dictionaries for all EU languages. Proceedings of the 9th International Conference on Language Resources and Evaluation, LREC 2014, 2839-2845. RIS download Bibtex download
  • Aker A, Paramita ML, Barker E & Gaizauskas R (2014) Bootstrapping term extractors for multiple languages. Proceedings of the 9th International Conference on Language Resources and Evaluation, LREC 2014, 483-489. RIS download Bibtex download
  • Alhelbawy A & Gaizauskas R (2013) Named entity disambiguation using HMMs. Proceedings - 2013 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology - Workshops, WI-IATW 2013, 3, 159-162. RIS download Bibtex download
  • Derczynski L & Gaizauskas R (2013) Information retrieval for temporal bounding. ACM International Conference Proceeding Series, 129-130. RIS download Bibtex download
  • Di Fabbrizio G, Aker A & Gaizauskas R (2013) Summarizing On-line Product and Service Reviews Using Aspect Rating Distributions and Language Modeling. IEEE Intelligent Systems. RIS download Bibtex download
  • Gaizauskas R, Barker E, Chang C-L, Derczynski L, Phiri M & Peng C (2012) Applying ISO-Space to Healthcare Facility Design Evaluation Reports. Seventh Workshop on Interoperable Semantic Annotation (ISA), Eighth International Conference on Language Resources and Evaluation, 13-20. RIS download Bibtex download
  • Aker A, Plaza L & Lloret E (2012) Do humans have conceptual models about Geographic Objects? A user study. Journal of the American Society for Information Science and Technology (JASIST). RIS download Bibtex download
  • Aker A, Fan X, Sanderson M & Gaizauskas R (2012) Investigating summarization techniques for geo-tagged image indexing. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 7224 LNCS, 472-475. RIS download Bibtex download
  • Di Fabbrizio G, Aker A & Gaizauskas R (2011) STARLET: Multi-document summarization of service and product reviews with balanced rating distributions. Proceedings - IEEE International Conference on Data Mining, ICDM, 67-74. RIS download Bibtex download
  • Aker A & Gaizauskas R (2011) Understanding the types of information humans associate with geographic objects. International Conference on Information and Knowledge Management, Proceedings, 1929-1932. RIS download Bibtex download
  • Fan X, Aker A, Tomko M, Smart P, Sanderson M & Gaizauskas R (2010) Automatic image captioning from the web for GPS photographs. MIR 2010 - Proceedings of the 2010 ACM SIGMM International Conference on Multimedia Information Retrieval, 445-448. RIS download Bibtex download
  • Skadiņa I, Aker A, Giouli V, Tufis D, Gaizauskas R, Mieriņa M & Mastropavlos N (2010) A collection of comparable corpora for under-resourced languages. Frontiers in Artificial Intelligence and Applications, 219, 161-168. RIS download Bibtex download
  • Aker A & Gaizauskas R (2009) Summary generation for toponym-referenced images using object type language models. International Conference Recent Advances in Natural Language Processing, RANLP, 6-11. RIS download Bibtex download
  • Verhagen M, Gaizauskas RJ, Schilder F, Hepple M, Moszkowicz J & Pustejovsky J (2009) The TempEval challenge: identifying temporal relations in text.. Lang. Resour. Evaluation, 43, 161-179. RIS download Bibtex download
  • Stevenson M, Guo Y, Gaizauskas R & Martinez D (2008) Disambiguation of biomedical text using diverse sources of information.. BMC Bioinformatics, 9 Suppl 11, S7. View this article in WRRO RIS download Bibtex download
  • Roberts A, Gaizauskas R, Hepple M & Guo Y (2008) Mining clinical relationships from patient narratives.. BMC Bioinformatics, 9 Suppl 11, S3. View this article in WRRO RIS download Bibtex download
  • Roberts A, Gaizauskas R, Hepple M, Davis N, Demetriou G, Guo Y, Kola J, Roberts I, Setzer A, Tapuria A & Wheeldin B (2007) The CLEF corpus: semantic annotation of clinical text.. AMIA Annu Symp Proc, 625-629. RIS download Bibtex download
  • Harkema H, Roberts I, Gaizauskas R & Hepple M (2005) A web service for biomedical term look-up.. Comp Funct Genomics, 6(1-2), 86-93. View this article in WRRO RIS download Bibtex download
  • Baker P, Hardie A, McEnery T, Xiao R, Bontcheva K, Cunningham H, Gaizauskas RJ, Hamza O, Maynard D, Tablan V , Ursu C et al (2004) Corpus Linguistics and South Asian Languages: Corpus Creation and Tool Development.. LLC, 19, 509-524. RIS download Bibtex download
  • Gaizauskas R, Davis N, Demetriou G, Guo Y & Roberts I (2004) Integrating text mining into distributed bioinformatics workflows: A Web services implementation. Proceedings - 2004 IEEE International Conference on Services Computing, SCC 2004, 145-152. RIS download Bibtex download
  • Roberts L & Gaizauskas R (2004) Evaluating passage retrieval approaches for question answering. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2997, 72-84. RIS download Bibtex download
  • Gaizauskas R (2003) Recent advances in computational terminology. COMPUT LINGUIST, 29(2), 328-332. RIS download Bibtex download
  • Gaizauskas RJ, Demetriou G, Artymiuk PJ & Willett P (2003) Protein Structures and Information Extraction from Biological Texts: The PASTA System.. Bioinform., 19, 135-143. View this article in WRRO RIS download Bibtex download
  • Hirschman L & Gaizauskas RJ (2001) Natural language question answering: the view from here.. Nat. Lang. Eng., 7, 275-300. RIS download Bibtex download
  • Humphreys K, Demetriou G & Gaizauskas R (2000) Two applications of information extraction to biological science journal articles: enzyme interactions and protein structures.. Biocomputing, 505-516. RIS download Bibtex download
  • Humphreys K, Demetriou G & Gaizauskas R (2000) Bioinformatics applications of information extraction from scientific journal articles. J INFORM SCI, 26(2), 75-85. RIS download Bibtex download
  • Humphreys K, Demetriou G & Gaizauskas R (2000) Two applications of information extraction to biological science journal articles: enzyme interactions and protein structures.. Pac Symp Biocomput, 505-516. RIS download Bibtex download
  • Krotov A, Hepple M, Gaizauskas RJ & Wilks Y (1999) Evaluating two methods for Treebank grammar compaction.. Nat. Lang. Eng., 5, 377-394. View this article in WRRO RIS download Bibtex download
  • Gaizauskas RJ (1998) Karen Sparck Jones and Julia Galliers, Evaluating Natural Language Processing Systems: An Analysis and Review. Berlin: Springer-Verlag, 1996. ISBN 3 540 61309 9, Price DM54.00 (paperback), 228 pages.. Nat. Lang. Eng., 4, 175-190. RIS download Bibtex download
  • Gaizauskas R & Wilks Y (1998) Information extraction: Beyond document retrieval. J DOC, 54(1), 70-105. RIS download Bibtex download
  • GAIZAUSKAS R & HUMPHREYS K (1997) Using a semantic network for information extraction. Natural Language Engineering, 3(2), 147-169. RIS download Bibtex download
  • Evans R, Gaizauskas R, Cahill LJ, Walker J, Richardson J & Dixon A (1995) POETIC: A system for gathering and disseminating traffic information. Natural Language Engineering, 1(4), 363-388. RIS download Bibtex download
  • Cunningham H, Wilks Y & Gaizauskas RJ () New Methods, Current Trends and Software Infrastructure for NLP. Proceedings of NEMLAP-2. RIS download Bibtex download
  • Cunningham H, Gaizauskas RJ & Wilks Y () A General Architecture for Language Engineering (GATE) - a new approach to Language Engineering R&D. RIS download Bibtex download

Chapters

Conference proceedings papers

  • Funk A, Aker A, Barker E, Paramita ML, Hepple M & Gaizauskas R (2017) The SENSEI Overview of Newspaper Readers’ Comments (pp 758-761) View this article in WRRO RIS download Bibtex download
  • Paramita ML, Clough P & Gaizauskas R (2017) Using Section Headings to Compute Cross-Lingual Similarity of Wikipedia Articles (pp 633-639) View this article in WRRO RIS download Bibtex download
  • Tang Y, Wang JK, Gao B, Dellandréa E, Gaizauskas R & Chen L (2016) Large Scale Semi-supervised Object Detection using Visual and Semantic Knowledge Transfer. 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (pp 2119-2128). Las Vegas, Nevada, 26 June 2016 - 1 July 2016. View this article in WRRO RIS download Bibtex download
  • Aker A, Paramita M, Kurtic E, Funk A, Barker E, Hepple M & Gaizauskas R (2016) Automatic Label Generation for News Comment Clusters. Proceedings of the 9th International Natural Language Generation Conference (pp 61-69), 5 September 2016 - 8 September 2016. View this article in WRRO RIS download Bibtex download
  • Gilbert A, Piras L, Wang JK, Yan F, Ramisa A, Dellandrea E, Gaizauskas R, Villegas M & Mikolajczyk K (2016) Overview of the ImageCLEF 2016 Scalable Concept Image Annotation Task. CLEF 2016 Working Notes. Évora, Portugal, 5 September 2016 - 8 September 2016. View this article in WRRO RIS download Bibtex download
  • Wang JK & Gaizauskas R (2016) Don't Mention the Shoe! A Learning to Rank Approach to Content Selection for Image Description Generation. Proceedings of the Ninth International Natural Language Generation Conference, 5 September 2016 - 8 September 2016. View this article in WRRO RIS download Bibtex download
  • Barker E, Paramita ML, Aker A, Kurtic E, Hepple M & Gaizauskas R (2016) The SENSEI Annotated Corpus: Human Summaries of Reader Comment Conversations in On-line News. Proceedings of the 17th Annual Meeting of the Special Interest Group on Discourse and Dialogue (pp 42-52), 13 September 2016 - 15 September 2016. View this article in WRRO RIS download Bibtex download
  • (2016) Experimental IR Meets Multilinguality, Multimodality, and Interaction View this article in WRRO RIS download Bibtex download
  • Barker E, Paramita M, Funk A, Kurtic E, Aker A, Foster J, Hepple M & Gaizauskas R (2016) What's the issue here?: Task-based evaluation of reader comment summarization systems. Proceedings of the 10th International Conference on Language Resources and Evaluation, LREC 2016 (pp 3094-3101) View this article in WRRO RIS download Bibtex download
  • Wang J & Gaizauskas R (2016) Cross-validating image description datasets and evaluation metrics. Proceedings of the 10th International Conference on Language Resources and Evaluation, LREC 2016 (pp 3059-3066) View this article in WRRO RIS download Bibtex download
  • Funk A, Gaizauskas R & Favre B (2016) A document repository for social media and speech conversations. Proceedings of the 10th International Conference on Language Resources and Evaluation, LREC 2016 (pp 436-440) RIS download Bibtex download
  • Aker A, Paramita M, Kurtic E, Funk A, Barker E, Hepple M & Gaizauskas R (2016) Automatic label generation for news comment clusters. Proceedings of the 9th International Natural Language Generation conference, 2016 - 2016. RIS download Bibtex download
  • Villegas M, Müller H, García Seco de Herrera A, Schaer R, Bromuri S, Gilbert A, Piras L, Wang J, Yan F, Ramisa A , Dellandrea E et al (2016) General Overview of ImageCLEF at the CLEF 2016 Labs (pp 267-285) RIS download Bibtex download
  • Villegas M, Mueller H, de Herrera AGS, Schaer R, Bromuri S, Gilbert A, Piras L, Wang J, Yan F, Ramisa A , Dellandrea E et al (2016) General Overview of ImageCLEF at the CLEF 2016 Labs. EXPERIMENTAL IR MEETS MULTILINGUALITY, MULTIMODALITY, AND INTERACTION, (CLEF 2016), Vol. 9822 (pp 267-285) RIS download Bibtex download
  • Riccardi G, Bechet F, Danieli M, Favre B, Gaizauskas R, Kruschwitz U & Poesio M (2016) The SENSEI Project: Making Sense of Human Conversations (pp 10-33) RIS download Bibtex download
  • Aker A, Kurtic E, Balamurali AR, Paramita M, Barker E, Hepple M & Gaizauskas R (2016) A Graph-Based Approach to Topic Clustering for Online Comments to News. Advances in Information Retrieval (pp 15-29), 20 March 2016 - 23 March 2016. View this article in WRRO RIS download Bibtex download
  • Ramisa A, Wang JK, Lu Y, Dellandrea E, Moreno-Noguer F & Gaizauskas R (2015) Combining Geometric, Textual and Visual Features for Predicting Prepositions in Image Descriptions. Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing (pp 214-220). Lisbon, Portugal, 17 September 2015 - 21 September 2015. View this article in WRRO RIS download Bibtex download
  • Gaizauskas R, Wang J & Ramisa A (2015) Defining Visually Descriptive Language. Proceedings of the Fourth Workshop on Vision and Language, September 2015 - September 2015. RIS download Bibtex download
  • Wang J & Gaizauskas R (2015) Generating Image Descriptions with Gold Standard Visual Inputs: Motivation, Evaluation and Baselines. Proceedings of the 15th European Workshop on Natural Language Generation (ENLG), September 2015 - September 2015. View this article in WRRO RIS download Bibtex download
  • Gilbert A, Piras L, Wang JK, Yan F, Dellandrea E, Gaizauskas R, Villegas M & Mikolajczyk K (2015) Overview of the ImageCLEF 2015 Scalable Image Annotation, Localization and Sentence Generation Task. CEUR Workshop Proceedings. Toulouse, France, 8 September 2015 - 11 September 2015. RIS download Bibtex download
  • Aker A, Celli F, Funk JA, Kurtic E, Hepple M & Gaizauskas R (2015) Sheffield-Trento System for Sentiment and Argument Structure Enhanced Comment-to-Article Linking in the Online News Domain (Ahmet Aker, Fabio Celli, Adam Funk, Emina Kurtic, Mark Hepple and Rob Gaizauskas). MultiLing 2015 in SIGDIAL. Prague, 2 September 2015 - 4 September 2015. RIS download Bibtex download
  • Derczynski L & Gaizauskas R (2015) Temporal relation classification using a model of tense and aspect. International Conference Recent Advances in Natural Language Processing, RANLP, Vol. 2015-January (pp 118-122) RIS download Bibtex download
  • Aker A, Kurtic E, Hepple M, Gaizauskas R & Di Fabbrizio G (2015) Comment-to-Article Linking in the Online News Domain. Proceedings of the 16th Annual Meeting of the Special Interest Group on Discourse and Dialogue, September 2015 - September 2015. View this article in WRRO RIS download Bibtex download
  • Wang JK, Yan F, Aker A & Gaizauskas R (2014) A Poodle or a Dog? Evaluating Automatic Image Annotation Using Human Descriptions at Different Levels of Granularity. Proceedings of the Workshop on Vision and Language 2014 (VL'14), in conjuction with the 25th International Conference on Computational Linguistics (COLING 2014). Dublin, 23 August 2014 - 23 August 2014. RIS download Bibtex download
  • Alhelbawy A & Gaizauskas R (2014) Collective named entity disambiguation using graph ranking and clique partitioning approaches. Proceedings of COLING 2014, the 25th International Conference on Computational Linguistics: Technical Papers (pp 1544-1555) RIS download Bibtex download
  • Aker A, Paramita ML, Pinnis M & Gaizauskas R (2014) Bilingual dictionaries for all EU languages. The Ninth International Conference on Language Resources and Evaluation (LREC 2014) (pp 2839-2845). Reykjavik, Iceland, 26 May 2014 - 31 May 2014. View this article in WRRO RIS download Bibtex download
  • Aker A, Paramita ML, Barker E & Gaizauskas R (2014) Bootstrapping Term Extractors for Multiple Languages. The Ninth International Conference on Language Resources and Evaluation (LREC 2014) (pp 483-489). Reykjavik, Iceland, 26 May 2014 - 31 May 2014. View this article in WRRO RIS download Bibtex download
  • Di Fabbrizio G, Stent A & Gaizauskas R (2014) A Hybrid Approach to Multi-document Summarization of Opinions in Reviews. Proceedings of the 8th International Natural Language Generation Conference (INLG), June 2014 - June 2014. RIS download Bibtex download
  • Gaizauskas R, Barker E, Paramita ML & Aker A (2014) Assigning Terms to Domains by Document Classification. Proceedings of the 4th International Workshop on Computational Terminology (Computerm), August 2014 - August 2014. RIS download Bibtex download
  • Alhelbawy A & Gaizauskas R (2014) Graph Ranking for Collective Named Entity Disambiguation. Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), June 2014 - June 2014. RIS download Bibtex download
  • Derczynski L & Gaizauskas R (2013) Empirical Validation of Reichenbach’s Tense Framework. International Conference on Computational Semantics. ACL RIS download Bibtex download
  • Derczynski L & Gaizauskas R (2013) Temporal Signals Help Label Temporal Relations. Proceedings of the 51st meeting of the Association for Computational Linguistics. ACL RIS download Bibtex download
  • Gaizauskas RJ, Aker A & Lestari Paramita M (2013) Extracting bilingual terminologies from comparable corpora. Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics. Sofia, Bulgaria RIS download Bibtex download
  • Paramita M, Clough P, Aker A & Gaizauskas R (2012) Correlation between Similarity Measures for Inter-Language Linked Wikipedia Articles. Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC 2012), Istanbul, Turkey.. Istanbul, Turkey, 21 May 2012 - 27 May 2012. View this article in WRRO RIS download Bibtex download
  • Skadina I, Aker A, Mastropavlos N, Su F, Tufis D, Verlic M, Vasiljevs A, Babych B, Clough P, Gaizauskas R , Glaros N et al (2012) Collecting and using comparable corpora for statistical machine translation. Proceedings of the 8th International Conference on Language Resources and Evaluation, LREC 2012 (pp 438-445) RIS download Bibtex download
  • Llorens H, Derczynski L, Gaizauskas RJ & Saquete E (2012) TIMEN: An Open Temporal Expression Normalisation Resource.. LREC (pp 3044-3051) RIS download Bibtex download
  • Barker E & Gaizauskas R (2012) Assessing the Comparability of News Texts. LREC 2012 - EIGHTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (pp 3996-4003) RIS download Bibtex download
  • Aker A, Kanoulas E & Gaizauskas R (2012) A light way to collect comparable corpora from the Web. LREC 2012 - EIGHTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (pp 15-20) RIS download Bibtex download
  • Alhelbawy A & Gaizauskas R (2012) Named Entity Based Document Similarity with SVM-Based Re-ranking for Entity Linking. ADVANCED MACHINE LEARNING TECHNOLOGIES AND APPLICATIONS, Vol. 322 (pp 379-388) RIS download Bibtex download
  • Aker A, Cohn T & Gaizauskas R (2012) Redundancy reduction for multi-document summaries using A* search and discriminative training. Proceedings of the Workshop on Automatic Text Summarization of the Future. Spain RIS download Bibtex download
  • Aker A, Kanoulas E & Gaizauskas R (2012) A light way to collect comparable corpora from the Web. Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC 2012), Istanbul, Turkey. (pp 21-27) RIS download Bibtex download
  • Burman A, Jayapal A, Kannan S, Kavilikatta M, Alhelbawy A, Derczynski L & Gaizauskas RJ (2011) USFD at KBP 2011: Entity Linking, Slot Filling and Temporal Bounding.. TAC RIS download Bibtex download
  • Llorens H, Saquete E, Navarro B & Gaizauskas RJ (2011) Time-Surfer: Time-Based Graphical Access to Document Content.. ECIR, Vol. 6611 (pp 767-771) RIS download Bibtex download
  • Skadina I, Vasiljevs A, Skadins R, Gaizauskas R, Tufis D & Gornostay T (2010) Analysis and Evaluation of Comparable Corpora for Under Resourced Areas of Machine Translation. LREC 2010 - SEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (pp 6-14) RIS download Bibtex download
  • Aker A, Cohn T & Gaizauskas R (2010) Multi-document summarization using A* search and discriminative training. Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing (pp 482-491) RIS download Bibtex download
  • Derczynski L & Gaizauskas RJ (2010) Analysing Temporally Annotated Corpora with CAVaT.. LREC RIS download Bibtex download
  • Aker A & Gaizauskas R (2010) Model Summaries for Location-related Images. Proc. of the 7th conference on International Language Resources and Evaluation RIS download Bibtex download
  • Aker A, Cohn T & Gaizauskas R (2010) Multi-document summarization using A* search and discriminative training. Proceedings of the 2010 Conference on Empirical Methods on Natural Language Processing (EMNLP) (pp 482-491). Cambridge, MA, USA RIS download Bibtex download
  • Aswani N & Gaizauskas RJ (2010) Developing Morphological Analysers for South Asian Languages: Experimenting with the Hindi and Gujarati Languages.. LREC RIS download Bibtex download
  • Aswani N & Gaizauskas RJ (2010) English-Hindi Transliteration using Multiple Similarity Metrics.. LREC RIS download Bibtex download
  • Catizone R, Dingli A & Gaizauskas RJ (2010) Using Dialogue Corpora to Extend Information Extraction Patterns for Natural Language Understanding of Dialogue.. LREC RIS download Bibtex download
  • Fan X, Aker A, Tomko M, Smart P, Sanderson M & Gaizauskas RJ (2010) Automatic image captioning from the web for GPS photographs.. Multimedia Information Retrieval (pp 445-448) RIS download Bibtex download
  • Aker A & Gaizauskas R (2010) Generating image descriptions using dependency relational patterns. Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics (ACL) (pp 1250-1258) RIS download Bibtex download
  • Stevenson M, Guo Y, Al Amri A & Gaizauskas R (2009) Disambiguation of biomedical abbreviations. Proceedings of the Workshop on BioNLP - BioNLP '09, 4 June 2009 - 5 June 2009. RIS download Bibtex download
  • Roberts A, Gaizauskas RJ, Hepple M, Demetriou G, Guo Y, Roberts I & Setzer A (2009) Building a semantically annotated corpus of clinical texts.. J. Biomed. Informatics, Vol. 42 (pp 950-966) View this article in WRRO RIS download Bibtex download
  • Stevenson M, Guo Y, Gaizauskas R & Martinez D (2008) Knowledge sources for word sense disambiguation of biomedical text. Proceedings of the Workshop on Current Trends in Biomedical Natural Language Processing - BioNLP '08, 19 June 2008 - 19 June 2008. RIS download Bibtex download
  • Gaizauskas R (2008) Generating image captions using topic focused multi-document summarization. Proceedings of the Workshop on Multi-source Multilingual Information Extraction and Summarization - MMIES '08, 23 August 2008 - 23 August 2008. RIS download Bibtex download
  • Aker A & Gaizauskas R (2008) Evaluating automatically generated user-focused multi-document summaries for geo-referenced images. Proceedings of the Workshop on Multi-source Multilingual Information Extraction and Summarization - MMIES '08, 23 August 2008 - 23 August 2008. RIS download Bibtex download
  • Shaw R, Solway B, Gaizauskas R & Greenwood MA (2008) Evaluation of automatically reformulated questions in question series. Coling 2008: Proceedings of the 2nd workshop on Information Retrieval for Question Answering - IRQA '08, 24 August 2008 - 24 August 2008. RIS download Bibtex download
  • Stevenson M, Guo Y & Gaizauskas R (2008) Acquiring Sense Tagged Examples using Relevance Feedback. Proceedings of the 22nd International Conference on Computational Linguistics (COLING-08). Manchester, UK RIS download Bibtex download
  • Roberts A, Gaizauskas R & Hepple M (2008) Extracting clinical relationships from patient narratives. Proceedings of the Workshop on Current Trends in Biomedical Natural Language Processing - BioNLP '08, 19 June 2008 - 19 June 2008. RIS download Bibtex download
  • Demetriou G, Gaizauskas RJ, Sun H & Roberts A (2008) ANNALIST - ANNotation ALIgnment and Scoring Tool.. LREC RIS download Bibtex download
  • Stevenson M, Guo Y & Gaizauskas R (2008) Acquiring sense tagged examples using relevance feedback. Coling 2008 - 22nd International Conference on Computational Linguistics, Proceedings of the Conference, Vol. 1 (pp 809-816) RIS download Bibtex download
  • Roberts A, Gaizauskas RJ, Hepple M & Guo Y (2008) Combining Terminology Resources and Statistical Methods for Entity Recognition: an Evaluation.. LREC RIS download Bibtex download
  • Hepple M, Setzer A & Gaizauskas R (2007) USFD. Proceedings of the 4th International Workshop on Semantic Evaluations - SemEval '07, 23 June 2007 - 24 June 2007. RIS download Bibtex download
  • Verhagen M, Gaizauskas R, Schilder F, Hepple M, Katz G & Pustejovsky J (2007) SemEval-2007 task 15. Proceedings of the 4th International Workshop on Semantic Evaluations - SemEval '07, 23 June 2007 - 24 June 2007. RIS download Bibtex download
  • Hepple M, Setzer A & Gaizauskas R (2007) USFD: Preliminary exploration of features and classifiers for the TempEval-2007 tasks. ACL 2007 - SemEval 2007 - Proceedings of the 4th International Workshop on Semantic Evaluations (pp 438-441) RIS download Bibtex download
  • Verhagen M, Gaizauskas R, Schilder F, Hepple M, Katz G & Pustejovsky J (2007) SemEval-2007 task 15: TempEval temporal relation identification. ACL 2007 - SemEval 2007 - Proceedings of the 4th International Workshop on Semantic Evaluations (pp 75-80) RIS download Bibtex download
  • Barker E, Higashinaka R, Mairesse F, Gaizauskas R, Walker M & Foster J (2006) Simulating cub reporter dialogues: The collection of naturalistic human-human dialogues for information access to text archives. Proceedings of the 5th International Conference on Language Resources and Evaluation, LREC 2006 (pp 125-130) RIS download Bibtex download
  • Saggion H & Gaizauskas R (2006) Language resources for background gathering. Proceedings of the 5th International Conference on Language Resources and Evaluation, LREC 2006 (pp 1318-1321) RIS download Bibtex download
  • Davis N, Demetriou G, Gaizauskas RJ, Guo Y & Roberts I (2006) Web Service Architectures for Text Mining: An Exploration of the Issues via an E-Science Demonstrator.. Int. J. Web Service Res., Vol. 3 (pp 95-112) RIS download Bibtex download
  • Greenwood MA, Stevenson M & Gaizauskas RJ (2006) The University of Sheffield's TREC 2006 Q&A Experiments.. TREC, Vol. 500-272 RIS download Bibtex download
  • Gaizauskas RJ, Harkema H, Hepple M & Setzer A (2006) Task-Oriented Extraction of Temporal Information: The Case of Clinical Narratives.. TIME (pp 188-195) RIS download Bibtex download
  • Saggion H & Gaizauskas RJ (2006) Experiments in Passage Selection and Answer Identification for Question Answering.. FinTAL, Vol. 4139 (pp 291-302) RIS download Bibtex download
  • Gaizauskas R, Hepple M, Saggion H, Greenwood MA & Humphreys K (2005) SUPPLE. Proceedings of the Ninth International Workshop on Parsing Technology - Parsing '05, 9 October 2005 - 10 October 2005. RIS download Bibtex download
  • Aswani N & Gaizauskas R (2005) A hybrid approach to align sentences and words in English-Hindi parallel corpora. Proceedings of the ACL Workshop on Building and Using Parallel Texts - ParaText '05, 29 June 2005 - 30 June 2005. RIS download Bibtex download
  • Aswani N & Gaizauskas R (2005) Aligning words in English-Hindi parallel corpora. Proceedings of the ACL Workshop on Building and Using Parallel Texts - ParaText '05, 29 June 2005 - 30 June 2005. RIS download Bibtex download
  • Gaizauskas RJ, Hepple M, Saggion H, Greenwood MA & Humphreys K (2005) SUPPLE: A Practical Parser for Natural Language Engineering Applications.. IWPT (pp 200-201) RIS download Bibtex download
  • Saggion H, Barker E, Gaizauskas R & Foster J (2005) Integrating NLP tools to support information access to news archives. International Conference Recent Advances in Natural Language Processing, RANLP, Vol. 2005-January (pp 452-458) RIS download Bibtex download
  • Setzer A, Gaizauskas RJ & Hepple M (2005) The Role of Inference in the Temporal Annotation and Analysis of Text.. Lang. Resour. Evaluation, Vol. 39 (pp 243-265) RIS download Bibtex download
  • Gaizauskas RJ, Greenwood MA, Harkema H, Hepple M, Saggion H & Sanka A (2005) The University of Sheffield's TREC 2005 Q&A Experiments.. TREC, Vol. 500-266 RIS download Bibtex download
  • Saggion H & Gaizauskas RJ (2005) Experiments on Statistical and Pattern-Based Biographical Summarization.. EPIA, Vol. 3808 (pp 611-621) RIS download Bibtex download
  • Mitchell B & Gaizauskas RJ (2004) A Labelled Corpus for Prepositional Phrase Attachment.. LREC RIS download Bibtex download
  • Harkema H, Gaizauskas RJ, Hepple M, Davis N, Guo Y, Roberts A & Roberts I (2004) A Large-Scale Resource for Storing and Recognizing Technical Terminology.. LREC RIS download Bibtex download
  • Gaizauskas RJ, Hepple M & Greenwood MA (2004) Information retrieval for question answering a SIGIR 2004 workshop.. SIGIR Forum, Vol. 38 (pp 41-44) RIS download Bibtex download
  • Guo Y, Harkema H & Gaizauskas RJ (2004) Sheffield University and the TREC 2004 Genomics Track: Query Expansion Using Synonymous Terms.. TREC, Vol. 500-261 RIS download Bibtex download
  • Gaizauskas RJ, Greenwood MA, Hepple M, Roberts I & Saggion H (2004) The University of Sheffield's TREC 2004 QA Experiments.. TREC, Vol. 500-261 RIS download Bibtex download
  • Pustejovsky J, Saurí R, Castaño JM, Radev DR, Gaizauskas RJ, Setzer A, Sundheim B & Katz G (2004) Representing Temporal and Event Knowledge for QA Systems.. New Directions in Question Answering (pp 99-112) RIS download Bibtex download
  • Saggion H & Gaizauskas RJ (2004) Mining On-line Sources for Definition Knowledge.. FLAIRS Conference (pp 61-66) RIS download Bibtex download
  • Gaizauskas RJ, Davis N, Demetriou G, Guo Y & Roberts I (2004) Text Mining into Distributed Bioinformatics Workflows: A Web Services Implementation.. IEEE SCC (pp 145-152) RIS download Bibtex download
  • Roberts I & Gaizauskas RJ (2004) Evaluating Passage Retrieval Approaches for Question Answering.. ECIR, Vol. 2997 (pp 72-84) RIS download Bibtex download
  • Harmain HM & Gaizauskas RJ (2003) CM-Builder: A Natural Language-Based CASE Tool for Object-Oriented Analysis.. Autom. Softw. Eng., Vol. 10 (pp 157-181) RIS download Bibtex download
  • Gaizauskas RJ (2003) Recent Advances in Computational Terminology edited by Didier Bourigault, Christian Jacquemin, and Marie-Claude L'Homme.. Computational Linguistics, Vol. 29 (pp 328-332) RIS download Bibtex download
  • Gaizauskas RJ, Greenwood MA, Hepple M, Roberts I, Saggion H & Sargaison M (2003) The University of Sheffield's TREC 2003 Q&A Experiments.. TREC, Vol. 500-255 (pp 782-790) RIS download Bibtex download
  • Pustejovsky J, Castaño JM, Ingria R, Saurí R, Gaizauskas RJ, Setzer A, Katz G & Radev DR (2003) TimeML: Robust Specification of Event and Temporal Expressions in Text.. New Directions in Question Answering (pp 28-34) RIS download Bibtex download
  • Moreau L, Miles S, Goble CA, Greenwood RM, Dialani V, Addis M, Alpdemir MN, Cawley R, Roure DD, Ferris J , Gaizauskas RJ et al (2003) On the Use of Agents in BioInformatics Grid.. CCGRID (pp 653-660) RIS download Bibtex download
  • Mitchell B & Gaizauskas RJ (2002) A Comparison of Machine Learning Algorithms for Prepositional Phrase Attachment.. LREC RIS download Bibtex download
  • Baker P, Hardie A, McEnery T, Cunningham H & Gaizauskas RJ (2002) EMILLE, A 67-Million Word Corpus of Indic Languages: Data Collection, Mark-up and Harmonisation.. LREC RIS download Bibtex download
  • Clough PD, Gaizauskas RJ & Piao SSL (2002) Building and annotating a corpus for the study of journalistic text reuse.. LREC View this article in WRRO RIS download Bibtex download
  • Clough P, Gaizauskas R, Piao SSL & Wilks Y (2002) METER: MEasuring TExt Reuse. 40TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE CONFERENCE (pp 152-159) View this article in WRRO RIS download Bibtex download
  • Greenwood MA, Roberts I & Gaizauskas RJ (2002) The University of Sheffield TREC 2002 Q&A System.. TREC, Vol. 500-251 RIS download Bibtex download
  • Ainsworth S, Clarke D & Gaizauskas RJ (2002) Using Edit Distance Algorithms to Compare Alternative Approaches to ITS Authoring.. Intelligent Tutoring Systems, Vol. 2363 (pp 873-882) RIS download Bibtex download
  • Demetriou G & Gaizauskas RJ (2002) Utilizing text mining results: The Pasta Web System.. ACL Workshop on Natural Language Processing in the Biomedical Domain (pp 77-84) RIS download Bibtex download
  • Setzer A & Gaizauskas R (2001) A pilot study on annotating temporal relations in text. Proceedings of the workshop on Temporal and spatial information processing -, 7 July 2001 - 7 July 2001. RIS download Bibtex download
  • Gaizauskas R, Herring P, Oakes M, Beaulieu M, Willett P, Fowkes H & Jonsson A (2001) Intelligent access to text. Proceedings of the first international conference on Human language technology research - HLT '01, 18 March 2001 - 21 March 2001. RIS download Bibtex download
  • Scott S & Gaizauskas RJ (2001) QA-LaSIE: A Natural Language Question Answering System.. Canadian Conference on AI, Vol. 2056 (pp 172-182) RIS download Bibtex download
  • Bontcheva K, Brewster C, Ciravegna F, Cunningham H, Guthrie L, Gaizauskas R & Wilks Y (2001) Using HLT for acquiring, retrieving and publishing knowledge in AKT. Proceedings of the workshop on Human Language Technology and Knowledge Management -, 6 July 2001 - 7 July 2001. RIS download Bibtex download
  • Gaizauskas RJ, Rodgers PJ & Humphreys K (2001) Visual Tools for Natural Language Processing.. J. Vis. Lang. Comput., Vol. 12 (pp 375-412) RIS download Bibtex download
  • Oakes MP, Gaizauskas RJ & Fowkes H (2001) A Method Based on the Chi-Square Test for Document Classification.. SIGIR (pp 440-441) RIS download Bibtex download
  • Setzer A & Gaizauskas RJ (2000) Annotating Events and Temporal Information in Newswire Texts.. LREC RIS download Bibtex download
  • Demetriou G & Gaizauskas RJ (2000) Automatically Augmenting Terminological Lexicons from Untagged Text.. LREC RIS download Bibtex download
  • Scott S & Gaizauskas RJ (2000) University of Sheffield TREC-9 Q&A System.. TREC, Vol. 500-249 RIS download Bibtex download
  • Harmain HM & Gaizauskas RJ (2000) CM-Builder: An Automated NL-Based CASE Tool.. ASE (pp 45-54) RIS download Bibtex download
  • Stevenson M & Gaizauskas RJ (2000) Experiments on Sentence Boundary Detection.. ANLP (pp 84-89) RIS download Bibtex download
  • Stevenson M & Gaizauskas RJ (2000) Using Corpus-derived Name Lists for Named Entity Recognition.. ANLP (pp 290-295) RIS download Bibtex download
  • Azzam S, Humphreys K & Gaizauskas R (1999) Using coreference chains for text summarization. Proceedings of the Workshop on Coreference and its Applications - CorefApp '99, 22 June 1999 - 22 June 1999. RIS download Bibtex download
  • Krotov A, Hepple M, Gaizauskas RJ & Wilks Y (1999) Compacting the Penn Treebank Grammar. CoRR, Vol. cs.CL/9902001 RIS download Bibtex download
  • Azzam S, Humphreys K, Gaizauskas RJ & Wilks Y (1999) Using a Language Independent Domain Model for Multilingual Information Extraction.. Applied Artificial Intelligence, Vol. 13 (pp 705-724) RIS download Bibtex download
  • Humphreys K, Gaizauskas RJ, Hepple M & Sanderson M (1999) University of Sheffield TREC-8 Q&A System.. TREC, Vol. 500-246 View this article in WRRO RIS download Bibtex download
  • Azzam S, Humphreys K & Gaizauskas RJ (1998) Evaluating a Focus-Based Approach to Anaphora Resolution. CoRR, Vol. cmp-lg/9807001 RIS download Bibtex download
  • Gaizauskas RJ (1998) Evaluation in language and speech technology.. Comput. Speech Lang., Vol. 12 (pp 249-262) RIS download Bibtex download
  • Krotov A, Hepple M, Gaizauskas RJ & Wilks Y (1998) Compacting the Penn Treebank Grammar.. COLING-ACL (pp 699-703) RIS download Bibtex download
  • Azzam S, Humphreys K & Gaizauskas RJ (1998) Evaluating a Focus-Based Approach to Anaphora Resolution.. COLING-ACL (pp 74-78) RIS download Bibtex download
  • Gaizauskas RJ & Humphreys K (1997) Conception vs. Lexicons: An Architecture for Multilingual Information Extraction.. SCIE, Vol. 1299 (pp 28-43) RIS download Bibtex download
  • Humphreys K, Gaizauskas R & Azzam S (1997) Event coreference for information extraction. Proceedings of a Workshop on Operational Factors in Practical, Robust Anaphora Resolution for Unrestricted Texts - ANARESOLUTION '97, 11 July 1997 - 11 July 1997. RIS download Bibtex download
  • Gaizauskas R, Humphreys K, Azzam S & Wilks Y (1997) Concepticons vs. Lexicons: An architecture for multilingual information extraction. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), Vol. 1299 (pp 29-43) RIS download Bibtex download
  • Rodgers P, Gaizauskas R, Humphreys K & Cunningham H (1997) Visual execution and data visualisation in natural language processing. 1997 IEEE SYMPOSIUM ON VISUAL LANGUAGES, PROCEEDINGS (pp 338-343) RIS download Bibtex download
  • Cunningham H, Humphreys K, Gaizauskas RJ & Wilks Y (1997) Software Infrastructure for Natural Language Processing. CoRR, Vol. cmp-lg/9702005 RIS download Bibtex download
  • Rodgers PJ, Gaizauskas RJ, Humphreys K & Cunningham H (1997) Visual Execution and Data Visualization in Natural Language Processing.. VL (pp 342-347) RIS download Bibtex download
  • Gaizauskas RJ & Robertson AM (1997) Coupling information retrieval and information extraction: A new text technology for gathering information from the web.. RIAO (pp 356-373) RIS download Bibtex download
  • Robertson AM & Gaizauskas RJ (1997) On the Marriage of Information Retrieval and Information Extraction.. BCS-IRSG Annual Colloquium on IR Research RIS download Bibtex download
  • Cunningham H, Humphreys K, Gaizauskas RJ & Wilks Y (1997) GATE - a General Architecture for Text Engineering.. ANLP (pp 29-30) RIS download Bibtex download
  • Cunningham H, Humphreys K, Gaizauskas RJ & Wilks Y (1997) Software Infrastructure for Natural Language Processing.. ANLP (pp 237-244) RIS download Bibtex download
  • Takemoto Y, Wakao T, Yamada H, Gaizauskas R & Wilks Y (1996) NEC corporation and University of Sheffield. Proceedings of a workshop on held at Vienna, Virginia May 6-8, 1996 -, 6 May 1996 - 8 May 1996. RIS download Bibtex download
  • Cunningham H, Humphreys K, Gaizauskas R & Wilks Y (1996) TIPSTER-compatible projects at Sheffield. Proceedings of a workshop on held at Vienna, Virginia May 6-8, 1996 -, 6 May 1996 - 8 May 1996. RIS download Bibtex download
  • Gaizauskas R & Humphreys K (1996) XI: A simple prolog-based language for cross-classification and inheritance. ARTIFICIAL INTELLIGENCE: METHODOLOGY, SYSTEMS, APPLICATIONS, Vol. 35 (pp 86-95) RIS download Bibtex download
  • Cunningham H, Wilks Y & Gaizauskas RJ (1996) GATE-a General Architecture for Text Engineering.. COLING (pp 1057-1060) RIS download Bibtex download
  • Wakao T, Gaizauskas RJ & Wilks Y (1996) Evaluation of an Algorithm for the Recognition and Classification of Proper Names.. COLING (pp 418-423) RIS download Bibtex download
  • Gaizauskas R, Cunningham H, Wilks Y, Rodgers P & Humphreys K (1996) GATE: An environment to support research and development in natural language engineering. EIGHTH IEEE INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, PROCEEDINGS (pp 58-66) RIS download Bibtex download
  • Gaizauskas RJ, Humphreys K, Cunningham H & Wilks Y (1995) University of Sheffield: description of the LaSIE system as used for MUC-6.. MUC (pp 207-220) RIS download Bibtex download
  • Gaizauskas RJ, Cahill LJ & Evans R (1993) Sussex University: description of the Sussex system used for MUC-5.. MUC (pp 321-335) RIS download Bibtex download
  • Gaizauskas RJ (1991) Deriving Answers to Logical Queries Via Answer Composition.. ALPUK (pp 112-134) RIS download Bibtex download
  • EVANS R, GAIZAUSKAS R & HARTLEY AF (1990) POETIC - THE PORTABLE EXTENDIBLE TRAFFIC INFORMATION COLLATOR. OECD WORKSHOP ON KNOWLEDGE-BASED EXPERT SYSTEMS IN TRANSPORTATION, VOL 1, Vol. 116 (pp 171-184) RIS download Bibtex download
  • Burman A, Jayapal A, Kannan S, Kavilikatta M, Alhelbawy A, Derczynski L & Gaizauskas R () USFD at KBP 2011: Entity Linking, Slot Filling and Temporal Bounding RIS download Bibtex download
  • Derczynski L & Gaizauskas R () A Corpus-based Study of Temporal Signals. Proceedings of the 6th Conference on Corpus Linguistics (2011), No. 197, pp. 1--8 RIS download Bibtex download
  • Derczynski L & Gaizauskas R () An Annotation Scheme for Reichenbach's Verbal Tense Structure. Proc. 6th Joint ACL-ISO Workshop on Interoperable Semantic Annotation (2011) 10-17 RIS download Bibtex download
  • Derczynski L & Gaizauskas R () Using Signals to Improve Automatic Classification of Temporal Relations RIS download Bibtex download
  • Derczynski L & Gaizauskas R () USFD2: Annotating Temporal Expresions and TLINKs for TempEval-2. Proc. 5th International Workshop on Semantic Evaluation (2010) 337-340 RIS download Bibtex download
  • Derczynski L & Gaizauskas R () Analysing Temporally Annotated Corpora with CAVaT. Proc. LREC (2010) 398-404 RIS download Bibtex download
  • Derczynski L, Wang J, Gaizauskas R & Greenwood MA () A Data Driven Approach to Query Expansion in Question Answering. Proc. IR4QA Workshop (2008) 34-41 View this article in WRRO RIS download Bibtex download
  • Gaizauskas RJ () Generating image descriptions using dependency relational patterns. Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics (pp 1250-1258). Uppsala, Sweden, 11 July 2010 - 16 July 2010. RIS download Bibtex download

Working papers

  • Crouch R, Gaizauskas R & Netter K () Report of the Study Group on Assessment and Evaluation. RIS download Bibtex download
Grants

Current grants

Previous grants

Professional activities

Head of Natural Language Processing (NLP) research group