• Slide 1
  • Slide 2
  • slide 3
  • slide 4

Welcome to The Centre for Multimodal AI

The Centre for Multimodal AI consolidates AI research in the School of Electronic Engineering and Computer Science. It builds on the expertise of world-leading academics in the school with emphasis on the development of Machine Learning algorithms, systems and applications for the Analysis and Synthesis of Multimodal Information such as Audio, Images, Videos, and Text, and on the development of AI methodologies in the domains of Games and Decision Support Systems.

The objective of the centre is to contribute to the development of AI methods and systems that will shape the future of our economy and society, striving not only for scientific excellence but also at setting and addressing research challenges for the benefit of our society. This includes challenges around developing AI methods and systems that are Trustworthy, Ethical and Responsible, but also efficient and capable of addressing some of the major challenges in the domains of Health, Education and Digital Economy.

The centre comprises more than 50 academics and 150 researchers, hosted across 6 research entities, namely the Centre for Digital Music, the Computer Vision group, the Multimedia and Vision group, the Computational Linguistics lab, the Game AI group, and the Machine Intelligence and Decision Systems group. Several members of the Centre are Fellows of The Alan Turing Institute and/or of the Digital Environment Research Institute (DERI).

News

Recent Publications

  • Scalable Evaluation for Audio Identification via Synthetic Latent Fingerprint Generation
    Bhattacharjee A Pasini M Benetos E
    IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) Barcelona, Spain 4 May 2026 - 8 May 2026
    04-05-2026
  • Domain-invariant representation learning of bird sounds
    Moummad I Serizel R Benetos E Farrugia N
    IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) Barcelona, Spain 4 May 2026 - 8 May 2026
    04-05-2026
  • YuE: Scaling Open Foundation Models for Long-Form Music Generation
    Yuan R Lin H Guo S Zhang G Pan J Zang Y Liu H Liang Y et al.
    14th International Conference on Learning Representations (ICLR) Rio de Janeiro, Brazil 23 Apr 2026 - 27 Apr 2026
    23-04-2026

View more publications »

Recent Grants

  • Intelligent Urban Noise Monitoring
    Lin Wang, Emmanouil Benetos and Andrea Cavallaro
    £31,974 Engineering and Physical Sciences Research Council
    01-06-2026 - 31-10-2026
  • From Prototype to Practice: Evaluating the Real-World Translation of a Hypertension Digital Twin
    Anthony Mathur, Ayesha Ahmed, Xu Chen, Greg Slabaugh and Ajay Gupta
    £45,380 British Heart Foundation
    01-05-2026 - 31-01-2027
  • Multimodal Interaction - controlled generation of human behaviour with diffusion models
    Ioannis Patras
    £102,850 Tavus Inc
    01-05-2026 - 30-04-2029

View more grants »