Johann Petrak
Department of Computer Science
Research Fellow
Member of the Natural Language Processing research group
johann.petrak@sheffield.ac.uk
+44 114 222 1867
+44 114 222 1867
Regent Court (DCS)
Full contact details
Johann Petrak
Department of Computer Science
Regent Court (DCS)
211 Portobello
Sheffield
S1 4DP
Department of Computer Science
Regent Court (DCS)
211 Portobello
Sheffield
S1 4DP
- Research interests
-
- Cost-sensitive learning for NLP
- Learning to Rank, Metric Learning
- Entity Linking
- Semantic Similarity
- Publications
-
Books
Edited books
Journal articles
- Classification aware neural topic model for COVID-19 disinformation categorisation. PLoS ONE, 16(2). View this article in WRRO
- Using ontologies to map between research data and policymakers’ presumptions: the experience of the KNOWMAK project. Scientometrics. View this article in WRRO
- Analysis of named entity recognition and linking for tweets. Information Processing and Management, 51(2), 32-49. View this article in WRRO
- GPSDB: a new database for synonyms expansion of gene and protein names.. Bioinformatics, 21(8), 1743-1744.
- Searching for patterns in political event sequences: Experiments with the KEDS database. Cybernetics and Systems, 31(6), 649-668.
- Guest editorial: First-order knowledge discovery in databases. Applied Artificial Intelligence, 12(5), 345-361.
- Knowledge discovery in international conflict databases. Applied Artificial Intelligence, 11(2), 91-118.
Chapters
- Internet Support to Collaboration, Data Mining and Decision Support (pp. 247-259). Springer US
Conference proceedings papers
- Team Bertha von Suttner at SemEval-2019 Task 4: Hyperpartisan News Detection using ELMo Sentence Representation Convolutional Network. Proceedings of the 13th International Workshop on Semantic Evaluation, June 2019 - June 2019.
- View this article in WRRO
- A deep neural network sentence level classification method with context information. Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing (pp 900-904). Brussels, Belgium, 31 October 2018 - 4 November 2018.
- Adapted TextRank for Term Extraction: A generic method of improving automatic term extraction algorithms. Procedia Computer Science, Vol. 137 (pp 102-108), 10 September 2018 - 13 September 2018. View this article in WRRO
- An Extensible Multilingual Open Source Lemmatizer. Proceedings of the International Conference Recent Advances in Natural Language Processing, RANLP 2017 (pp 40-45) View this article in WRRO
- Using @Twitter Conventions to Improve #LOD-Based Named Entity Disambiguation (pp 171-186)
Preprints
- Misogyny classification of German newspaper forum comments, arXiv.
- Classification Aware Neural Topic Model and its Application on a New COVID-19 Disinformation Corpus, arXiv. View this article in WRRO
- Classification aware neural topic model for COVID-19 disinformation categorisation. PLoS ONE, 16(2). View this article in WRRO