Dr Thomas Roelleke

Senior Lecturer
School of Electronic Engineering and Computer Science
Queen Mary University of London
Queen Mary University of London
Research
information retrieval (IR) and probability theory, structured, semantic and knowledge-oriented IR, integration of data management technologies (DB+IR/In-DB IR/ML/AI), generalisations of probabilistic concepts
Interests
My research interest lies is in information retrieval (IR). IR is related to data and information management, database (DB) technology, machine learning (ML) and AI. My research expertise and contributions are in the following areas:1. probabilistic IR models and probability theory
2. structured, semantic and knowledge-oriented retrieval
3. integration of technologies (DB+IR, In-DB IR/ML)
4. modelling of uncertainty in data (probabilistic databases)
5. generalisations of ranking functions and probabilistic reasoning
IR models (ranking functions, e.g. BM25) are rooted in probability and information theory, but apply some magic quantifications and logarithmic expressions to achieve good retrieval quality. My research focuses on explaining model, and achieving mathematical standards. Publications include "IR Models: Foundations and Relationships" (Morgan Claypool book 2013), Harmony Assumptions (Computer Journal 2015), TF-IDF Uncovered, (ACM SIGIR 2008), General Matrix Framework (IP&M Journal), The Probability of Being Informative, (ACM SIGIR 2003), etc. My long-term research aim is finding the undiscovered parts of mathematics that explain the connection between ranking functions and probability theory.
Database-oriented research includes the integration of DB and IR (and ML, and AI), and it is an ongoing research challenge. The areas and methods are closely related, but surprisingly different and separated. My contributions include probabilistic object-relational, logic-based knowledge representations (Retrieval of Complex Objects, and various publications) that are beneficial for solving tasks in the domain of semantic and knowledge-oriented (so-called complex) information management tasks. Under the remit of DB+IR (in recent terminology, In-DB IR/ML), this led to a patented technology: the "Relational Bayes" (VLDB Journal 2008, extended SQL, WHERE ASSUMPTION IS MAX_INFORMATIVE).
Recent publications focus on probabilistic, information-theoretic and structured IR in the context of investigative IR (Journal of Information Systems, 2023), and the Dirichlet-multinomial modelling of recommendation and urgency (Big Data, ML and Intelligent Systems, Frontiers of AI, 2021).
Publications
Publications of specific relevance to the Centre for Multimodal AI2024
Document structure-driven investigative information retrievalKetola T Roelleke T
Information Systems, Elsevier vol. 121, 102315-102315.
01-03-2024
2023
Automatic and Analytical Field Weighting for Structured Document RetrievalKetola T Roelleke T
In Advances in Information Retrieval, Springer Nature 489-503.
01-01-2023
2022
Formal Constraints for Structured Document RetrievalKetola T Roelleke T
Proceedings of the 2022 ACM SIGIR International Conference on Theory of Information Retrieval., 121-126.
23-08-2022
2021
ADOR: A New Medical Dataset for Sentiment-based IRBahrani M
CIKM’21: Fourth Workshop on Knowledge-driven Analytics and Systems Impacting Human Quality of Life. vol. 3052
01-11-2021
Opinion-Aware Retrieval Models Based on Sentiment and Intensity of Lexical FeaturesBahrani M Roelleke T
In Modern Management Based on Big Data II and Machine Learning and Intelligent Systems III, Ios Press
29-10-2021
2020
FDCMBahrani M
Proceedings of the 29th ACM International Conference on Information & Knowledge Management., 1957-1960.
19-10-2020
BM25-FIC: Information content-based field weighting for BM25FKetola T
Ceur Workshop Proceedings. vol. 2741, 79-85.
01-01-2020
2018
A systematic approach to normalization in probabilistic models.Lipani A Roelleke T
Inf Retr Boston, Springer vol. 21 (6), 565-596.
30-06-2018
P/FDMGray PMD
In Encyclopedia of Database Systems, Springer Nature 2643-2644.
01-01-2018
Probabilistic Retrieval Models and Binary Independence Retrieval (BIR) ModelRoelleke T Wang J
In Encyclopedia of Database Systems, Springer Nature 2839-2845.
01-01-2018
2016
Scalable DB+IR Technology: Processing Probabilistic Datalog with HySpiritFrommholz I Roelleke T
Datenbank-Spektrum, Springer Nature vol. 16 (1), 39-48.
26-01-2016
Probabilistic Retrieval Models and Binary Independence Retrieval (BIR) ModelRoelleke T Wang J
In Encyclopedia of Database Systems, Springer Nature 1-7.
01-01-2016
2015
IR meets NLPMilajevs D Sadrzadeh M
Proceedings of the 2015 International Conference on The Theory of Information Retrieval., 231-240.
27-09-2015
Harmony Assumptions in Information Retrieval and Social NetworksRoelleke T Kaltenbrunner A
The Computer Journal, Oxford University Press (OUP) vol. 58 (11), 2982-2999.
14-05-2015
2013
Mathematical Specification and Logic Modelling in the context of IRMartinez-Alvarez M Roelleke T
Proceedings of the 2013 Conference on the Theory of Information Retrieval., 131-132.
29-09-2013
IR ModelsRoelleke T
Proceedings of the 2013 Conference on the Theory of Information Retrieval., 4-4.
29-09-2013
On the modelling of ranking algorithms in probabilistic datalogRoelleke T Bonzanini M
Proceedings of the 7th International Workshop on Ranking in Databases., 1-6.
30-08-2013
Extractive summarisation via sentence removalBonzanini M Martinez-Alvarez M
Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval., 893-896.
28-07-2013
Information Retrieval ModelsRoelleke T
, Springer Nature vol. 5 (3), 1-163.
26-07-2013
Document Difficulty Framework for Semi-automatic Text ClassificationMartinez-Alvarez M Bellogin A
Lecture Notes in Computer Science. vol. 8057, 110-121.
01-01-2013
The D2Q2 framework: On the relationship and combination of language modelling and TF-IDFRoelleke T Azzam H Martinez-Alvarez M Lalmas M
Lwa 2013 Lernen Wissen and Adaptivitat Workshop Proceedings., 33-40.
01-01-2013
Information Retrieval Models, Foundations and RelationshipsRoelleke T
01-01-2013
2012
Investigating the use of extractive summarisation in sentiment classificationBonzanini M Martinez-Alvarez M
Ceur Workshop Proceedings. vol. 835, 45-52.
01-12-2012
Opinion summarisation through sentence extractionBonzanini M Martinez-Alvarez M Roelleke T
Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval., 1121-1122.
12-08-2012
IR modelsRoelleke T
Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval., 1187-1188.
12-08-2012
A Schema-driven Approach for Knowledge-oriented Retrieval and Query FormulationAzzam H Yayhaei Roelleke
KEYS 2012, The 3rd International Workshop on Keyword Search and Structured Data Scottsdale, Arizona, USA 20 May 2012., 39-46.
01-01-2012
Semi-automatic Document Classification: Exploiting Document DifficultyMartinez-Alvarez M Yahyaei S
Lecture Notes in Computer Science. vol. 7224, 468-471.
01-01-2012
2011
On the probabilistic logical modelling of quantum and geometrically–inspired IRSmeraldi F Martinez-Alvarez M
Proceeding of the 2nd Italian Information Retrieval Workshop, Milan (Italy).
01-01-2011
A Generic Data Model for Schema-Driven Design in Information Retrieval ApplicationsAzzam H Roelleke T
ADVANCES IN INFORMATION RETRIEVAL THEORY. vol. 6931, 323-326.
01-01-2011
Cross-Lingual Text Fragment Alignment Using Divergence from RandomnessYahyaei S Bonzanini M
STRING PROCESSING AND INFORMATION RETRIEVAL. vol. 7024, 14-25.
01-01-2011
A Descriptive Approach to ClassificationMartinez-Alvarez M
Lecture Notes in Computer Science. vol. 6931, 297-308.
01-01-2011
Large-Scale Logical Retrieval: Technology for Semantic Modelling of Patent SearchAzzam H Klampanos IA
In Current Challenges in Patent Information Retrieval, Springer Nature 181-195.
01-01-2011
Teaching IR: Curricular ConsiderationsBlank D Fuhr N Henrich A Mandl T
In Teaching and Learning in Information Retrieval, Springer Nature 31-46.
01-01-2011
2010
An attribute-based model for semantic retrievalAzzam H
Lwa 2010 Lernen Wissen Und Adaptivitat Learning Knowledge and Adaptivity Workshop Proceedings., 175-182.
01-12-2010
SQRAzzam H Roelleke T
Proceedings of the third workshop on Exploiting semantic annotations in information retrieval., 21-22.
30-10-2010
Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics): PrefaceGurrin C He Y Kazai G Little S Roelleke T Rüger S
20-05-2010
Modelling Probabilistic Inference Networks and Classification in Probabilistic DatalogMartinez-Alvarez M Roelleke T
Lecture Notes in Computer Science. vol. 6379, 278-291.
01-01-2010
Recent developments in information retrievalGurrin C He Y Kazai G Kruschwitz U Little S Rüger S
Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics. vol. 5993 LNCS, 1-9.
01-01-2010
Logic-Based Retrieval: Technology for Content-Oriented and Analytical Querying of Patent DataKlampanos IA Wu HZ Cunningham H Hanbury A Ruger S
ADVANCES IN MULTIDISCIPLINARY RETRIEVAL. vol. 6107, 100-119.
01-01-2010
Recent Developments in Information RetrievalGurrin C He YL Kazai G Kruschwitz U Little S Ruger S Gurrin C He Y et al.
ADVANCES IN INFORMATION RETRIEVAL, PROCEEDINGS. vol. 5993, 1-9.
01-01-2010
2009
A case for probabilistic logic for scalable patent retrievalKlampanos IA Roelleke T
Proceedings of the 2nd international workshop on Patent information retrieval., 1-8.
06-11-2009
Less Is More: Maximal Marginal Relevance as a Summarisation FeatureForst JF Roelleke T Azzopardi L Kazai G Robertspm S Ruger S Shokouhi M Song D et al.
ADVANCES IN INFORMATION RETRIEVAL THEORY. vol. 5766, 350-353.
01-01-2009
P/FDMGray PMD
In Encyclopedia of Database Systems, Springer Nature 2011-2012.
01-01-2009
Probabilistic Retrieval Models and Binary Independence Retrieval (BIR) ModelRoelleke T Wang J Robertson S
In Encyclopedia of Database Systems, Springer Nature 2156-2160.
01-01-2009
Semi-subsumed Events: A Probabilistic Semantics of the BM25 Term Frequency QuantificationWu HZ Roelleke T Azzopardi L Kazai G Robertspm S Ruger S Shokouhi M Song D et al.
ADVANCES IN INFORMATION RETRIEVAL THEORY. vol. 5766, 375-379.
01-01-2009
2008
DB&IR integrationAmer-Yahia S Hiemstra D Srivastava D Weikum G
Acm Sigir Forum, Association For Computing Machinery (Acm) vol. 42 (2), 84-89.
30-11-2008
DB&IR integrationAmer-Yahia S Hiemstra D Roelleke T Srivastava D
Acm Sigmod Record, Association For Computing Machinery (Acm) vol. 37 (3), 46-49.
30-09-2008
DB&IR Integration: Report on the Dagstuhl Seminar Ranked XML QueryingAmer-Yahia S Hiemstra D Srivastava D
Sigmod Record vol. 37 (3), 46-49.
01-09-2008
TF-IDF Uncovered: A Study of Theories and ProbabilitiesROELLEKE T Wang J
31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval Singapore., 435-432.
01-01-2008
Modelling retrieval models in a probabilistic relational algebra with a new operator: the relational BayesRoelleke T Wu H
Vldb J vol. 17 (1), 5-37.
01-01-2008
DB&IR Integration: Report on the Dagstuhl Seminar Ranked XML QueryingAmer-Yahia S Hiemstra D Roelleke T
Dagstuhl Seminar Proceedings. vol. 8111
01-01-2008
2007
Modelling a summarisation logic in probabilistic datalogForst JF Roelleke T
Lwa 2007 Lernen Wissen Adaptivitat Learning Knowledge and Adaptivity Workshop Proceedings., 221-228.
01-12-2007
TOIS reviewers January 2006 through May 2007Acm Transactions on Information Systems, Association For Computing Machinery (Acm) vol. 25 (4), 15-es.
01-10-2007
2006
A Parallel Derivation of Probabilistic Retrieval ModelsROELLEKE T Wang J
29th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Seattle, US.
27-08-2006
Solving the enterprise TREC task with probabilistic data modelsForst JF Tombros A
NIST Special Publication.
01-01-2006
A general matrix framework for modelling Information RetrievalRolleke T Kazai G
Information Processing & Management vol. 42 (1), 4-30.
01-01-2006
Context-specific frequencies and discriminativeness for the retrieval of structured documentsWang J Roelleke T Lalmas M MacFarlane A Ruger S Tombros A Tsikrika T Yavlinsky A
ADVANCES IN INFORMATION RETRIEVAL. vol. 3936, 579-582.
01-01-2006
2005
Report on the DB/IR panel at SIGMOD 2005Amer-Yahia S Case P Shanmugasundaram J
Sigmod Record vol. 34 (4), 71-74.
01-12-2005
Relevance Information: A Loss of Entropy but a Gain for IDF?ROELLEKE T
28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Salvador, Brazil.
17-08-2005
Building and experimenting with a heterogeneous collectionSzlavik Z Fuhr N Lalmas M Malik S Szlavik Z
ADVANCES IN XML INFORMATION RETRIEVAL. vol. 3493, 349-357.
01-01-2005
The QMUL team with probabilistic SQL at enterprise trackRoelleke T Ashoori E Wu H
NIST Special Publication.
01-01-2005
2004
Third edition of the XML and information retrieval workshop first workshop on integration of IR and DB (WIRD) jointly held at SIGIR'2004, Sheffield, UK, July 29th, 2004Baeza-Yates R Maarek YS de Vries AP
Acm Sigir Forum, Association For Computing Machinery (Acm) vol. 38 (2), 24-30.
01-12-2004
Modelling vague content and structure querying in XML retrieval with a probabilistic object-relational frameworkLalmas M Rolleke T Christiansen H Hacid MS Andreasen T Larsen HL
FLEXIBLE QUERY ANSWERING SYSTEMS, PROCEEDINGS. vol. 3055, 432-445.
01-01-2004
2003
A Frequency-based and a Poisson-based Definition of the Probability of Being InformativeROELLEKE T
26th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Toronto, Canada.
31-07-2003
Four-valued knowledge augmentation for structured document retrievalLalmas M Rolleke T
INTERNATIONAL JOURNAL OF UNCERTAINTY FUZZINESS AND KNOWLEDGE-BASED SYSTEMS. vol. 11 (1), 67-86.
01-02-2003
Abductive retrieval for multimedia information seekingLALMAS M Roelleke T
10th International Conference on Human - Computer Interaction, HCI International, Crete, Greece, vol. 4.
01-01-2003
Intelligent Retrieval of Hypermedia DocumentsLalmas M Rölleke T Fuhr N
In Intelligent Exploration of The Web, Springer Nature 324-344.
01-01-2003
A Frequency-based and a Poisson-based Definition of the Probability of Being InformativeRoelleke T
SIGIR Forum ACM Special Interest Group on Information Retrieval. (SPEC. ISS.), 227-234.
01-01-2003
2002
Using MPEG-7 at the consumer terminal in broadcastingPearmain A Lalmas M Moutogianni E Papworth D Healey P
Eurasip J Appl Sig P vol. 2002 (4), 354-361.
01-04-2002
Four-valued knowledge augmentation for representing structured documentsLalmas M Roelleke T Hacid MS Ras ZW Zighed DA Kodratoff Y
FOUNDATIONS OF INTELLIGENT SYSTEMS, PROCEEDINGS. vol. 2366, 158-166.
01-01-2002
Using MPEG7 at the Consumer Terminal in BroadcastingHealey P LALMAS M Roelleke T Papworth D Moutogianni E
European Association For Signal, Speech and Image Processing Journal of Applied Signal Processing vol. Issue 4, 354-361.
01-01-2002
The accessibility dimension for structured document retrievalRoelleke T Lalmas M Ruthven I Crestani F Girolami M VanRijsbergen CJ
ADVANCES IN INFORMATION REFTRIEVAL. vol. 2291, 284-302.
01-01-2002
Intelligent Hypermedia RetrievalLalmas L ROELLEKE T Fuhr N
In Intelligent Exploration of The Web, Springer-Verlag Group (Physica-Verlag
01-01-2002
Focussed Structured Document RetrievalKazai G Roelleke T
In String Processing and Information Retrieval, Springer Nature 241-247.
01-01-2002
2001
The HySpirit retrieval platformRölleke T Lübeck R
Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval.
01-09-2001
Using MPEG-7 at the consumer terminal in broadcastingPearmain A Lalmas M Moutogianni E Papworth D Healey P Rolleke T
WIAMIS 2001 Workshop on Image Analysis for Multimedia Services Tampere, Finland 16 May 2001 - 17 May 2001.
01-01-2001
A model for the representation and focussed retrieval of structured documents based on fuzzy aggregationKazai G Lalmas M
EIGHTH SYMPOSIUM ON STRING PROCESSING AND INFORMATION RETRIEVAL, PROCEEDINGS., 123-135.
01-01-2001
Concepts for a graphical user interface for hypermedia retrievalLalmas M Rolleke T Larsen HL Kacprzyk J Zadrozny S Andreasen T Christiansen H
FLEXIBLE QUERY ANSWERING SYSTEMS., 301-314.
01-01-2001
1998
DOLORES: a system for logic-based retrieval of multimedia objectsFuhr N Gövert N Rölleke T
, Association For Computing Machinery (Acm), 257-265.
01-08-1998
DOLORES: A System for Logic-Based Retrieval ObjectsFuhr N Gövert N
SIGIR 1998 Proceedings of the 21st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval., 257-265.
01-08-1998
Querying for facts and content in hypermedia documentsRölleke T
Lecture Notes in Computer Science. vol. 1495, 320-328.
01-01-1998
HySpirit — A probabilistic inference engine for hypermedia retrieval in large databasesFuhr N Rölleke T
Lecture Notes in Computer Science. vol. 1377, 24-38.
01-01-1998
1997
A probabilistic relational algebra for the integration of information retrieval and database systemsFuhr N
Acm Transactions on Information Systems, Association For Computing Machinery (Acm) vol. 15 (1), 32-66.
01-01-1997
1996
Retrieval of complex objects using a four-valued logicRoelleke T
SIGIR Forum ACM Special Interest Group on Information Retrieval., 206-215.
01-12-1996
Ranking-based Processing of SQL QueriesAzzam H Roelleke T Yahyaei S Ounis I Ruthven I de Vries A
CIKM (Conference on Information & Knowledge Management) Glasgow 24 Oct 2011 - 28 Oct 2011., 231-236.
SQR: a Semantic Query Rating SchemeAzzam H Roelleke T
Third CIKM workshop on Exploiting Semantic Annotations in Information Retrieval, ESAIR 2010 Ontario, Candada 26 Oct 2010 - 30 Oct 2010., 21-22.
Semantic-aware Retrieval and Recommendation based on the Dirichlet Compound Language ModelBahrani M Roelleke T
In Research Square
Research Group
PhD Students
- Yufeng Li
Cog Sci
News
No news items found.


