Professor Hamish Cunningham

Department of Computer Science

Professor of Internet Computing

Member of the Natural Language Processing (NLP) research group

Hamish Cunningham profile photo
h.cunningham@sheffield.ac.uk
+44 114 222 1891

Full contact details

Professor Hamish Cunningham
Department of Computer Science
Regent Court
211 Portobello
Sheffield
S1 4DP
Profile

Prof. Hamish Cunningham is head of the 15 strong GATE team researching language analysis infrastructure, text mining and textual big data processing. He has published some 150 peer-reviewed articles, cited more than 3000 times (Google Scholar). He sits on a number of editorial boards and reviews project proposals for the EC, EPSRC, BBSRC, ESRC and NWO.

Since 1997, he has, singly or jointly, secured and directed 32 research grants worth over £12 million. He was a founding member of the Information Retrieval (IR) Facility along with many of the most influential figures in the IR community world-wide.

Prof. Cunningham was recently the coordinator of the AnnoMarket STREP and the ARCOMEM Integrated Project. His team produces the GATE platform for language and knowledge research, whose users are as diverse as WHO cancer research, OntoText, Matrixware, Generic, Garlik, Spock, Solcara, Fizzback, Innovantage, Astra Zeneca, Merck, Eli Lilly, Ontos, OntoPrise, Thompson, Greenstone, ANC, Perseus, NCSA, AT&T, IBM, British Telecom, Hewlett Packard and thousands of others.

Research interests
  • Language analysis infrastructure, text mining and textual big data processing
  • Physical computing; micro-manufacturing; maker culture; Raspberry Pi
  • Privacy-preserving social media. Crowdfunding.
Publications

Books

  • Cunningham H, Maynard D, Bontcheva K, Tablan V, Aswani N, Roberts I, Gorrell G, Funk A, Roberts A & Damljanovic D (2011) Text Processing with Gate (Version 6). GATE. RIS download Bibtex download

Journal articles

Chapters

Conference proceedings papers

  • Al Mhabis N & Cunningham H (2017) Socio-political perspectives on surveillance and censorship: Implications for on-line privacy in the age of cloud computing. Computing Conference, 2017 RIS download Bibtex download
  • Dimitrov M, Cunningham H, Roberts I, Kostov P, Simov A, Rigaux P & Lippell H (2014) AnnoMarket – Multilingual text analytics at scale on the cloud. The Semantic Web: ESWC 2014 Satellite Events, Vol. 8798 (pp 315-319) RIS download Bibtex download
  • Tablan V, Bontcheva K, Roberts I, Cunningham H & Dimitrov M (2013) AnnoMarket: An Open Cloud Platform for NLP. Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics: System Demonstrations (pp 19-24) RIS download Bibtex download
  • Damljanovic D, Agatonovic M & Cunningham H (2011) FREyA: An Interactive Way of Querying Linked Data Using Natural Language.. ESWC Workshops, Vol. 7117 (pp 125-138) RIS download Bibtex download
  • Damljanovic D, Petrak J, Lupu M, Cunningham H, Carlsson M, Engstrom G & Andersson B (2011) Random Indexing for Finding Similar Nodes within Large RDF Graphs.. ESWC Workshops, Vol. 7117 (pp 156-171) RIS download Bibtex download
  • Damljanovic D, Agatonovic M & Cunningham H (2010) Identification of the question focus: Combining syntactic analysis and ontology-based lookup through the user interaction. Proceedings of the 7th International Conference on Language Resources and Evaluation, LREC 2010 (pp 361-368) RIS download Bibtex download
  • Damljanovic D, Agatonovic M & Cunningham H (2010) Natural Language Interfaces to Ontologies: Combining Syntactic Analysis and Ontology-Based Lookup through the User Interaction.. ESWC (1), Vol. 6088 (pp 106-120) RIS download Bibtex download
  • Cunningham H, Hanbury A & Ruger S (2010) Scaling Up High-Value Retrieval to Medium-Volume Data. ADVANCES IN MULTIDISCIPLINARY RETRIEVAL, Vol. 6107 (pp 1-5) RIS download Bibtex download
  • Johansson M, Li YY, Wakefield J, Greenwood M, Heitz T, Roberts I, Cunningham H, Brennan P, Roberts A & Mckay J (2009) Using Prior Information Attained from the Literature to Improve Ranking in Genome-wide Association Studies. GENETIC EPIDEMIOLOGY, Vol. 33(8) (pp 798-798) RIS download Bibtex download
  • Davis B, Iqbal AA, Funk A, Tablan V, Bontcheva K, Cunningham H & Handschuh S (2008) RoundTrip Ontology Authoring (pp 50-65) RIS download Bibtex download
  • Yankova M, Saggion H & Cunningham H (2008) A framework for identity resolution and merging for multi-source information extraction. Proceedings of the 6th International Conference on Language Resources and Evaluation, LREC 2008 (pp 1367-1372) RIS download Bibtex download
  • Agatonovic M, Aswani N, Bontcheva K, Cunningham H, Heitz T, Li Y, Roberts I & Tablan V (2008) Large-scale, parallel automatic patent annotation.. PaIR (pp 1-8) RIS download Bibtex download
  • Davis B, Iqbal AA, Funk A, Tablan V, Bontcheva K, Cunningham H & Handschuh S (2008) RoundTrip Ontology Authoring. SEMANTIC WEB - ISWC 2008, Vol. 5318 (pp 50-65) RIS download Bibtex download
  • Funk A, Tablan V, Bontcheva K, Cunningham H, Davis B & Handschuh S (2007) CLOnE: Controlled language for ontology editing. SEMANTIC WEB, PROCEEDINGS, Vol. 4825 (pp 142-155) RIS download Bibtex download
  • Li Y, Bontcheva K & Cunningham H (2007) SVM Based Learning System for F-term Patent Classification.. NTCIR RIS download Bibtex download
  • Li Y, Bontcheva K & Cunningham H (2007) Experiments of Opinion Analysis on the Corpora MPQA and NTCIR-6.. NTCIR RIS download Bibtex download
  • Davis B, Handschuh S, Cunningham H & Tablan V (2006) Further use of Controlled Natural Language for Semantic Annotation of Wikis. Proceedings of the 1st Semantic Authoring and Annotation Workshop at ISWC2006. Athens, Georgia, USA RIS download Bibtex download
  • Tablan V, Peters W, Maynard D & Cunningham H (2006) Creating tools for morphological analysis of sumerian. Proceedings of the 5th International Conference on Language Resources and Evaluation, LREC 2006 (pp 1762-1765) RIS download Bibtex download
  • Tablan V, Polajnar T, Cunningham H & Bontcheva K (2006) User-friendly ontology authoring using a controlled language. Proceedings of the 5th International Conference on Language Resources and Evaluation, LREC 2006 (pp 35-40) RIS download Bibtex download
  • Aswani N, Bontcheva K & Cunningham H (2006) Mining information for instance unification. Semantic Web - ISEC 2006, Proceedings, Vol. 4273 (pp 329-342) RIS download Bibtex download
  • Wang T, Li YY, Bontcheva K, Cunningham H & Wang J (2006) Automatic extraction of hierarchical relations from text. SEMANTIC WEB: RESEARCH AND APPLICATIONS, PROCEEDINGS, Vol. 4011 (pp 215-229) RIS download Bibtex download
  • Li Y, Miao C, Bontcheva K & Cunningham H (2005) Perceptron Learning for Chinese Word Segmentation.. SIGHAN@IJCNLP 2005 RIS download Bibtex download
  • Aswani N, Tablan V, Bontcheva K & Cunningham H (2005) Indexing and querying linguistic metadata and document content. International Conference Recent Advances in Natural Language Processing, RANLP, Vol. 2005-January (pp 74-81) RIS download Bibtex download
  • Li Y, Bontcheva K & Cunningham H (2005) Using uneven margins SVM and Perceptron for information extraction. CoNLL 2005 - Proceedings of the Ninth Conference on Computational Natural Language Learning (pp 72-79) RIS download Bibtex download
  • Li Y, Bontcheva K & Cunningham H (2005) SVM based learning system for information extraction. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), Vol. 3635 LNAI (pp 319-339) RIS download Bibtex download
  • Dowman M, Tablan V, Cunningham H & Popov B (2005) Web-assisted annotation, semantic indexing and search of television and radio news.. WWW (pp 225-234) RIS download Bibtex download
  • Wang T, Maynard D, Peters W, Bontcheva K & Cunningham H (2005) Extracting a domain ontology from linguistic resource based on relatedness measurements. 2005 IEEE/WIC/ACM International Conference on Web Intelligence, Proceedings (pp 345-351) RIS download Bibtex download
  • Li YY, Bontcheva K & Cunningham H (2005) SVM based learning system for Information Extraction. DETERMINISTIC AND STATISTICAL METHODS IN MACHINE LEARNING, Vol. 3635 (pp 319-339) RIS download Bibtex download
  • Saggion H, Cunningham H, Bontcheva K, Maynard D, Hamza O & Wilks Y (2004) Multimedia indexing through multi-source and multi-language information extraction: the MUMIS project. DATA & KNOWLEDGE ENGINEERING, Vol. 48(2) (pp 247-264) RIS download Bibtex download
  • Maynard D, Bontcheva K & Cunningham H (2004) Automatic language-independent induction of gazetteer lists. Proceedings of the 4th International Conference on Language Resources and Evaluation, LREC 2004 (pp 709-712) RIS download Bibtex download
  • Guthrie L, Basili R, Zanzotto F, Bontcheva K, Cunningham H, Guthrie D, Cui J, Cammisa M, Liu JCC, Martin CF , Haralambiev K et al (2004) Large scale experiments for semantic labeling of noun phrases in raw text. Proceedings of the 4th International Conference on Language Resources and Evaluation, LREC 2004 (pp 811-814) RIS download Bibtex download
  • Dimitrov M, Bontcheva K, Cunningham H & Maynard D (2004) A lightweight approach to coreference resolution for named entities in text. Anaphora Processing, Vol. 263 (pp 97-111) RIS download Bibtex download
  • Wood MM, Lydon SJ, Tablan V, Maynard D & Cunningham H (2004) Using parallel texts to improve recall in botany. Recent Advances in Natural Language Processing III, Vol. 260 (pp 237-246) RIS download Bibtex download
  • Wood MM, Lydon SJ, Tablan V, Maynard D & Cunningham H (2004) Populating a database from parallel texts using ontology-based information extraction. NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS, Vol. 3136 (pp 254-264) RIS download Bibtex download
  • Maynard D, Yankova M, Aswani N & Cunningham H (2004) Automatic creation and monitoring of semantic metadata in a dynamic knowledge portal. ARTIFICIAL INTELLIGENCE: METHODOLOGY, SYSTEMS, AND APPLICATIONS, PROCEEDINGS, Vol. 3192 (pp 65-74) RIS download Bibtex download
  • DECLERCK T, CUNNINGHAM H, SAGGION H, KUPER J, REIDSMA D & WITTENBURG P (2003) MUMIS – ADVANCED INFORMATION EXTRACTION FOR MULTIMEDIA INDEXING AND SEARCHING. Digital Media Processing for Multimedia Interactive Services RIS download Bibtex download
  • Saggion H, Kuper J, Cunningham H, Declerck T, Wittenburg P, Puts M, Hoenkamp E, de Jong F & Wilks Y (2003) Event-coreference across multiple, multi-lingual sources in the Mumis project. Proceedings of the tenth conference on European chapter of the Association for Computational Linguistics - EACL '03, 12 April 2003 - 17 April 2003. RIS download Bibtex download
  • Manov D, Kiryakov A, Popov B, Bontcheva K, Maynard D & Cunningham H (2003) Experiments with geographic knowledge for information extraction. Proceedings of the HLT-NAACL 2003 workshop on Analysis of geographic references -, 31 May 2003. RIS download Bibtex download
  • Tablan V, Bontcheva K, Maynard D & Cunningham H (2003) OLLIE. Proceedings of the HLT-NAACL 2003 workshop on Software engineering and architecture of language technology systems - SEALTS '03, 31 May 2003 - 31 May 2003. RIS download Bibtex download
  • Saggion H, Bontcheva K & Cunningham H (2003) Robust generic and query-based summarisation. Proceedings of the tenth conference on European chapter of the Association for Computational Linguistics - EACL '03, 12 April 2003 - 17 April 2003. RIS download Bibtex download
  • Maynard D & Cunningham H (2003) Multilingual adaptations of ANNIE, a reusable information extraction tool. Proceedings of the tenth conference on European chapter of the Association for Computational Linguistics - EACL '03, 12 April 2003 - 17 April 2003. RIS download Bibtex download
  • Kuper J, Saggion H, Cunningham H, Declerck T, Jong FD, Reidsma D, Wilks Y & Wittenburg P (2003) Intelligent Multimedia Indexing and Retrieval through Multi-source Information Extraction and Merging.. IJCAI (pp 409-414) RIS download Bibtex download
  • Bontcheva K, Maynard D, Tablan V & Cunningham H (2003) GATE: A Unicode-based Infrastructure Supporting Multilingual Information Extraction. Proceedings of Workshop on Information Extraction for Slavonic and other Central and Eastern European Languages (IESL’03). Borovets, Bulgaria RIS download Bibtex download
  • Maynard D, Tablan V & Cunningham H (2003) NE recognition without training data on a language you don’t speak. ACL Workshop on Multilingual and Mixed-language Named Entity Recognition: Combining Statistical and Symbolic Models. Sapporo, Japan RIS download Bibtex download
  • Maynard D, Tablan V, Bontcheva K & Cunningham H (2003) Rapid customization of an information extraction system for a surprise language.. ACM Trans. Asian Lang. Inf. Process., Vol. 2 (pp 295-300) RIS download Bibtex download
  • Cunningham H, Maynard D, Bontcheva K & Tablan V (2002) GATE: A Framework and Graphical Development Environment for Robust NLP Tools and Applications. Proceedings of the 40th Anniversary Meeting of the Association for Computational Linguistics (ACL’02). Philadelphia, USA RIS download Bibtex download
  • Maynard D, Bontcheva K, Saggion H, Cunningham H & Hamza O (2002) Using a text engineering framework to build an extendable and portable IE-based summarisation system. Proceedings of the ACL-02 Workshop on Automatic Summarization -, 11 July 2002 - 12 July 2002. RIS download Bibtex download
  • Bontcheva K, Cunningham H, Tablan V, Maynard D & Hamza O (2002) Using GATE as an environment for teaching NLP. Proceedings of the ACL-02 Workshop on Effective tools and methodologies for teaching natural language processing and computational linguistics -, 7 July 2002 - 7 July 2002. RIS download Bibtex download
  • Pastra K, Maynard D, Hamza O, Cunningham H & Wilks Y (2002) How feasible is the reuse of grammars for Named Entity Recognition?. Proceedings of the 3rd International Conference on Language Resources and Evaluation, LREC 2002 (pp 1412-1418) RIS download Bibtex download
  • Saggion H, Cunningham H, Maynard D, Bontcheva K, Hamza O, Ursu C & Wilks Y (2002) Extracting information for automatic indexing of multimedia material. Proceedings of the 3rd International Conference on Language Resources and Evaluation, LREC 2002 (pp 669-676) RIS download Bibtex download
  • Tablan V, Ursu C, Bontcheva K, Cunningham H, Maynard D, Hamza O, Mcenery T, Baker P & Leisher M (2002) A unicode-based environment for creation and use of language resources. Proceedings of the 3rd International Conference on Language Resources and Evaluation, LREC 2002 (pp 66-71) RIS download Bibtex download
  • Baker P, Hardie A, McEnery T, Cunningham H & Gaizauskas R (2002) EMILLE, A 67-million word corpus of indic languages: Data collection, mark-up and harmonisation. Proceedings of the 3rd International Conference on Language Resources and Evaluation, LREC 2002 (pp 819-825) RIS download Bibtex download
  • Bontcheva K, Maynard D, Cunningham H & Saggion H (2002) Using Human Language Technology for Automatic Annotation and Indexing of Digital Library Content (pp 613-625) RIS download Bibtex download
  • Bontcheva K & Cunningham H (2002) Human Language Technology for Automatic Annotation and Indexing of Digital Library Content (pp 658-658) RIS download Bibtex download
  • Cunningham H, Maynard D, Bontcheva K & Tablan V (2002) A framework and graphical development environment for robust NLP tools and applications.. ACL (pp 168-175) RIS download Bibtex download
  • Cunningham H, Maynard D, Bontcheva K & Tablan V (2002) GATE: an architecture for development of robust HLT applications. 40TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE CONFERENCE (pp 168-175) RIS download Bibtex download
  • Saggion H, Cunningham H, Bontcheva K, Maynard D, Ursu C, Hamza O & Wilks Y (2002) Access to multimedia information through multisource and multilanguage information extraction. NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS, Vol. 2553 (pp 160-171) RIS download Bibtex download
  • Maynard D, Cunningham H, Bontcheva K & Dimitrov M (2002) Adapting a robust multi-genre NE system for automatic content extraction. ARTIFICIAL INTELLIGENCE: METHODOLOGY, SYSTEMS AND APPLICATIONS, PROCEEDINGS, Vol. 2443 (pp 264-273) RIS download Bibtex download
  • Bontcheva K, Cunningham H, Maynard D, Tablan V & Saggion H (2002) Developing reusable and robust language processing components for information systems using GATE. 13TH INTERNATIONAL WORKSHOP ON DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS (pp 223-227) RIS download Bibtex download
  • Declerck T, Wittenburg P & Cunningham H (2001) The automatic generation of formal annotations in a multimedia indexing and searching environment. Proceedings of the workshop on Human Language Technology and Knowledge Management -, 6 July 2001 - 7 July 2001. RIS download Bibtex download
  • Cunningham H, Maynard D, Bontcheva K & Tablan V (2001) GATE. Proceedings of the 40th Annual Meeting on Association for Computational Linguistics - ACL '02, 7 July 2002 - 12 July 2002. RIS download Bibtex download
  • Bontcheva K, Brewster C, Ciravegna F, Cunningham H, Guthrie L, Gaizauskas R & Wilks Y (2001) Using HLT for acquiring, retrieving and publishing knowledge in AKT. Proceedings of the workshop on Human Language Technology and Knowledge Management -, 6 July 2001 - 7 July 2001. RIS download Bibtex download
  • Maynard D, Tablan V, Ursu C, Cunningham H & Wilks Y (2001) Named Entity Recognition from Diverse Text Types. Recent Advances in Natural Language Processing 2001 Conference (pp 257-274-257-274). Tzigov Chark, Bulgaria RIS download Bibtex download
  • Cunningham H, Bontcheva K, Tablan V & Wilks Y (2000) Software infrastructure for language resources: A taxonomy of previous work and a requirements analysis. 2nd International Conference on Language Resources and Evaluation, LREC 2000 RIS download Bibtex download
  • Cunningham H, Maynard D, Bontcheva K, Tablan V & Wilks Y (2000) Experience of using GATE for NLP R&D. Proceedings of the Workshop on Using Toolsets and Architectures To Build NLP Systems at COLING-2000. Luxembourg RIS download Bibtex download
  • Stevenson M, Cunningham H & Wilks Y (1998) Sense tagging and language engineering. ECAI 1998: 13TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, PROCEEDINGS (pp 185-189) RIS download Bibtex download
  • Rodgers PJ, Gaizauskas RJ, Humphreys K & Cunningham H (1997) Visual Execution and Data Visualization in Natural Language Processing.. VL (pp 342-347) RIS download Bibtex download
  • Cunningham H, Humphreys K, Gaizauskas R & Wilks Y (1997) GATE. Proceedings of the fifth conference on Applied natural language processing Descriptions of system demonstrations and videos -, 31 March 1997 - 3 April 1997. RIS download Bibtex download
  • Cunningham H, Humphreys K, Gaizauskas R & Wilks Y (1997) Software infrastructure for natural language processing. Proceedings of the fifth conference on Applied natural language processing -, 31 March 1997 - 3 April 1997. RIS download Bibtex download
  • Rodgers P, Gaizauskas R, Humphreys K & Cunningham H (1997) Visual execution and data visualisation in natural language processing. 1997 IEEE SYMPOSIUM ON VISUAL LANGUAGES, PROCEEDINGS (pp 338-343) RIS download Bibtex download
  • Cunningham H, Humphreys K, Gaizauskas R & Wilks Y (1996) TIPSTER-compatible projects at Sheffield. Proceedings of a workshop on held at Vienna, Virginia May 6-8, 1996 -, 6 May 1996 - 8 May 1996. RIS download Bibtex download
  • Cunningham H, Wilks Y & Gaizauskas RJ (1996) GATE. Proceedings of the 16th conference on Computational linguistics -, 5 August 1996 - 9 August 1996. RIS download Bibtex download
  • Gaizauskas R, Cunningham H, Wilks Y, Rodgers P & Humphreys K (1996) GATE: An environment to support research and development in natural language engineering. EIGHTH IEEE INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, PROCEEDINGS (pp 58-66) RIS download Bibtex download
  • Gaizauskas R, Humphreys K, Cunningham H & Wilks Y (1995) University of Sheffield. Proceedings of the 6th conference on Message understanding - MUC6 '95, 6 November 1995 - 8 November 1995. RIS download Bibtex download
  • Cunningham H, Humphreys K, Gaizauskas R & Wilks Y () Software Infrastructure for Natural Language Processing. 5th Conference on Applied Natural Language Processing, 1997 RIS download Bibtex download

Reports

  • Cunningham H, Fuhr N & Stein B (2011) Challenges in Document Mining: Report from Dagstuhl Seminar 11171. RIS download Bibtex download

Software / Code

  • Cunningham H, Tablan V, Bontcheva K, Roberts I, Maynard D, Roberts A & Aswani N (2012) GATE, a General Architecture for Text Engineering. Sheffield, UK: University of Sheffield Retrieved from http://gate.ac.uk/ RIS download Bibtex download
Grants

Current grants

  • Resilient Campus, Resilient City (RC²), HEIF, 02/2019 to 07/2019, £54,000, as PI
  • Project 2, MRC, 10/2018 to 02/2019, £22,744, as PI
  • IRF II, Matrixware Information Services GMBH, 03/2008 to 12/2020, £110,687, as PI
  • IRF IIII, Matrixware Information Services GMBH, 10/2009 to 12/2020, £133,857, as PI
  • IRF V, Matrixware Information Services GMBH, 10/2009 to 12/2020, £36,506, as PI
  • SoBigData Research Infrastructure, EC - H2020, 09/2015 to 08/2019, £649,690, as PI

Previous grants

Professional activities

Research proposal reviewer for

  • EPSRC (the Engineering and Physical Sciences Research Council, UK)
  • the European Commission (FP6 IST, FP7 IST, ERC)
  • BBSRC (the Biotechnology and Biological Sciences Research Council, UK)
  • ESRC (the Economic and Social Research Council, UK)
  • NWO (the Netherlands Organization for Scientific Research)
  • NSERC (Natural Sciences and Engineering Research Council of Canada)
  • IWT (Belgian Institute for the Promotion of Innovation by Science and Technology)

Schools outreach

  • Accredited STEM Ambassador (DBS/CRB certified).
  • Sheffield Cutlers’ Ambassador Scheme Raspberry Pi programme.

Journals

  • Editorial Board member for the journal of Language Resources and Evaluation23.
  • Area Chair for language processing and Editorial Board member for the Journal of Web Semantics (2005-2009).
  • Editor of special issue of the Journal of Natural Language Engineering on Software Architecture for Language Engineering (2004)24.
  • Reviewer for IBM Systems Journal.
  • Reviewer for ACM Transactions on Information Systems (TOIS).
  • Reviewer for the Special Issue of Lingvisticae Investigationes on Named Entities: Recognition, Classification and Use.

Advisory boards and professional associations

  • Member of the Council of Professors and Heads of Computer Science (CPHC).
  • Founding Scientific Board member of the Information Retrieval Facility for large-scale IR experimentation.
  • Advisory group member of the International Internet Preservation Consortium.
  • Technical committee of the European Cultural Heritage Online project.

Standardisation activities

  • Founder member of OASIS/Open standardisation committee on Unstructured Information Management.
  • Principal investigator for Sheffield on LIRICS project for ISO TC37/SC4 standards team.
  • Member of the British Standards Institute committee TS/1 (Language Resources and Terminology).
  • Participant in ISO TC37/SC4 workshop on annotation standards, Pont a Mousson, November 2002.

Conference and workshop organisation

  • Co-chair of the Dagstuhl workshop Challenges in Document Mining, May 2011.
  • General chair of the first Information Retrieval Facility conference (IRFC 2010), Vienna, May 2010.
  • Co-organiser of the New Challenges for NLP Architectures workshop at LREC 2010, Malta, May 2010.
  • Organising committee of the workshop on Crossing Media for Improved Information Access at LREC, Genoa, May 2006.
  • Scientific committee of CASCON 2006, the 16th Annual International Conference of IBM Centers for Advanced Studies, Dublin, October 2006.
  • Co-proposer of Summer Workshop on Language Engineering (chair: Louise Guthrie), CLSP at Johns Hopkins University Baltimore, MA, USA. July 14 to August 22, 2003.
  • Co-chair of workshop on Software Engineering and Architecture of Language Technology Systems (SEALTS) at HLT-NAACL 2003.
  • Co-chair of workshop on Human Language Technology for the Semantic Web and Web Services at International Semantic Web Conference 2003.
  • Programme chair of Workshop on Information Extraction for Slavonic and other Eastern and Central European Languages, RANLP 2003.
  • Organising committee of the LREC-2000 workshop on Meta-Descriptions and Annotation
  • Organising committee of the LREC-2000 workshop on Schemas for Multimodal/Multimedia Language Resources and Data Architectures and Software Support for Large Corpora.
  • Organising committee of the COLING-2000 Workshop on Using Toolsets and Architectures To Build NLP Systems, Centre Universitaire, Luxembourg, 5 August 2000.
  • Co-chair of the EPSRC Workshop on NLP Architectures and Language Resources, Baslow, December 1998.
  • Co-chair of the Distributing and Accessing Language Resources workshop, Granada LREC conference, May 1998.
  • Scholarships, invited lectures and tutorials:
  • ANR Chaire d’Excellence, Internet Memory, Paris, France, 2011-2012.
  • Invited speaker, Text Analytics 2009, Boston, US.
  • Visiting Professor, Université Joseph Fourier, Grenoble, 2009.
  • Invited speaker Discovery Knowledge and Informatics, Amsterdam, April 2007.
  • Panellist on Applications of Memories for Life, British Library symposium, December 2006.
  • Invited speaker at conference on European Digital Cultural Heritage, Salzburg, June 2006.
  • Invited lecturer, EUROLAN 2005, Iasi, Romania.
  • Visiting Scientist, DERI, National University of Ireland at Galway, 2004-2006.
  • Tutorial on Human Language Technology for the Semantic Web at the European Semantic Web Symposium, Heraklion, Crete, May 2004.
  • Invited speaker, ILASH workshop on Human Language Technologies for the Semantic Web: After OWL: Defacto Standards for Semantic Technology, Sheffield, March 2004.
  • Visiting Scholar, Johns Hopkins University Center for Language and Speech Processing, Summer 2003.
  • Invited lecture at IBM TJ Watson laboratory: Software Architecture for Language Engineering. August 2003.
  • Invited tutorial on Named Entity Recognition at RANLP 2003.
  • Invited lecturer on Information Extraction for the EuroLan Summer School, Iasi, Romania, 2001.
  • Invited lecturer for HLT Center of Excellence in Information Society Technologies in 21 century (EC HLT project BIS-21) at the Linguistic Modelling Lab, Bulgarian Academy of Sciences.
  • Invited presentation at British Classification Society workshop on Computer Text Analysis, Feb 2001, Dept. Probability and Statistics, Univ. Sheffield: GATE, A General Architecture for Text Engineering.

Programme committee memberships

  • LREC 2014.
  • 24th International Conference on Computational Linguistics (COLING 2012), Mumbai, India, December 2012.
  • 16th International World Wide Web Conference (WWW2007), Banff, Canada, May 2007.
  • 10th biannual international Congress of the Italian Association for Artificial Intelligence (AI*IA), 2007
  • WWW 2006, the World-Wide Web conference, Edinburgh, UK, May 2006.
  • EACL 2006, the 11th Meeting of the European Chapter of the Association for Computational Linguistics, April 3-7 2006, Italy.
  • Senior Programme Committee of ISWC2006, the Fifth International Semantic Web Conference, Athens, USA, November 5-9, 2006.
  • Web Content Mining with Human Language Technologies workshop at the International Semantic Web Conference, November 2006.
  • The Semantic Desktop and Social Semantic Collaboration Workshop at the International Semantic Web Conference, 6 November 2006, Athens, GA, USA.
  • OntoLex 2006, workshop at LREC 2006, Genoa, Italy.
  • European Semantic Web Conference (ESWC), Budva, Montenegro, June 2006.
  • Workshop on Multi-dimensional Markup in NLP at EACL 2006.
  • Natural Language Processing for Metadata Extraction (NLP4ME 2006), AIMSA, Varna, Bulgaria, September 13-15, 2006.
  • Workshop on Web Content Mining with Human Language Technologies, ISWC, Athens, GA, U.S.A. November 5-9 2006.
  • IJCAI Edinburgh, UK, July/August 2005.
  • Workshop on Multimedia and the Semantic Web, 2nd European Semantic Web Conference, Crete, May / June 2005.
  • RANLP 2005 (Recent Advances in Natural Language Processing), Borovetz, Bulgaria, 2005.
  • Workshop on Text Mining Research, Practice and Opportunities, RANLP 2005.
  • Workshop on End-User Semantic Web Interaction, ISWC 2005, Galway, Ireland.
  • IJCAI workshop on Natural Language Generation and the Semantic Web: Perspective and Challenges, Edinburgh, UK, July/August 2005.
  • EUROLAN 2005, Multilingual Aligned Resources and their use in the context of Knowledge Web, July/August 2005, Cluj-Napoca, Romania.
  • Second European Semantic Web Conference (ESWC), Heraklion, Crete, Greece, May 29 to 1 June, 2005.
  • First International Workshop on Representation and Analysis of Web Space (RAWS-05), Prague-Tocna, September 15-16, 2005.
  • Senior Programme Committee of ISWC2004, the Third International Semantic Web Conference, Hiroshima, Japan, November 7-11, 2004.
  • ACL 2004: 42nd Annual Meeting of the Association for Computational Linguistics, Barcelona, Spain, July 21-26, 2004.
  • Workshop on the Semantic Web at SIGIR 2004, Sheffield, UK, July 2004.
  • IJCNLP-04 (International Joint Conference on Natural Language Processing), Hainan Island, China, March 22-24, 2004.
  • ESWS (First European Semantic Web Symposium), Heraklion, Greece, May 10-12, 2004.
  • Additional reviewer for COLING 2004.
  • BIS 2004: 7th International Conference on Business Information Systems, Poznan, Poland, April 21-23 2004.
  • ECAI 2004 workshop on Application of Semantic Web Technologies to Web Communities, Valencia, Spain, August 23rd, 2004.
  • ECAI 2004 workshop on Ontology Learning and Population from Text, Valencia, Spain, August, 2004.
  • RDF/RDFS and OWL in Language Technology: 4th Workshop on NLP and XML, ACL-2004, Barcelona, 2004.
  • Workshop on NLP for Multimedia Applications, 16th ESSLI, 16-20 August, Nancy 2004.
  • Workshop on the Semantic Web at SIGIR 2003, Toronto, Canada, July 2003.
  • RANLP 2003 (Recent Advances in Natural Language Processing), Borovetz, Bulgaria, 2003.
  • EACL 2003 workshop on Evaluation Initiatives in Natural Language Processing.
  • EACL 2003 workshop on Language Technology and the Semantic Web (the 3rd Workshop on NLP and XML).
  • Second Workshop on NLP and XML (NLPXML-2002).
  • RANLP 2001 (Recent Advances in Natural Language Processing), Tzigov Chark, Bulgaria, 2003.