Dr Emmanouil Benetos

FHEA

Reader (Associate Professor) in Machine Listening
Director of Research for EECS, Deputy Director of the UKRI Centre for Doctoral Training in AI and Music
Communications and Dissemination Lead of The Centre for Multimodal AI

School of Electronic Engineering and Computer Science
Queen Mary University of London

+44 (0)20 7882 6206

emmanouil.benetos@qmul.ac.uk

webspace.eecs.qmul.ac.uk/emmanouil.benetos/

Centre for Multimodal AI
Centre for Human-Centred Computing

Research
Publications
Grants
Research Group
News

Research

Machine listening, Audio processing, Machine learning, Music information retrieval, Multimodal AI

Interests

I am currently Reader (US equivalent: Associate Professor) in Machine Listening and Director of Research at the School of Electronic Engineering and Computer Science of Queen Mary University of London. Within Queen Mary, I am member of the Centre for Digital Music, Centre for Multimodal AI, Centre for Intelligent Sensing, and Digital Environment Research Institute, and I co-lead the School's Machine Listening Lab.

My main research topic is computational audio analysis, also referred to as machine listening or computer audition - applied to music, urban, everyday and nature sounds. I have been Royal Academy of Engineering / Leverhulme Trust Research Fellow in resource-efficient machine listening, Turing Fellow at the Alan Turing Institute, Royal Academy of Engineering Research Fellow, and have been principal- and co-investigator for several funded research projects in the intersection of machine learning and audio. I am also Deputy Director for the UKRI Centre for Doctoral Training in Artificial Intelligence and Music (AIM).

On academic service, I am currently president-elect for the International Society for Music Information Retrieval (ISMIR), member of the IEEE Technical Committee on Audio and Acoustic Signal Processing (AASP TC) and vice-chair of its reviews subcommittee, vice-chair of the EURASIP Acoustic, Speech and Music Signal Processing Technical Area Committee (ASMSP TAC), associate editor for the IEEE Transactions on Audio, Speech, and Language Processing, and associate editor for the Journal on Audio, Speech, and Music Processing.

Publications

2026

Audio-Based Understanding of Audiobook Narration Appeal
Elisha S Beguerisse-Díaz M Benetos E
27th Annual Conference of the International Speech Communication Association (INTERSPEECH) Sydney, Australia 27 Sep 2026 - 1 Oct 2026.
27-09-2026

Velocity Prediction in Automatic Guitar Transcription
Loth J Riley J Dixon S Benetos E
34th European Signal Processing Conference (EUSIPCO) Bruges, Belgium 31 Aug 2026 - 4 Sep 2026.
31-08-2026

CMI-RewardBench: Evaluating Music Reward Models with Compositional Multimodal Instruction
Ma Y Xia H Gao H Chen W Ye Y Yang Y Chang S Ding M et al.

DOI 10.48550/arxiv.2603.00610

QMRO

11-06-2026

Quality Audio Prototyping: a prototype system for unified sound retrieval and procedural generation
Garcia N Bhattacharjee A Mason-Williams G Mason-Williams I Benetos E Reiss J

30-05-2026

Scalable Evaluation for Audio Identification via Synthetic Latent Fingerprint Generation
Bhattacharjee A Pasini M Benetos E
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) Barcelona, Spain 4 May 2026 - 8 May 2026.

DOI 10.1109/ICASSP55912.2026.11460511

QMRO

04-05-2026

Domain-invariant representation learning of bird sounds
Moummad I Serizel R Benetos E Farrugia N
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) Barcelona, Spain 4 May 2026 - 8 May 2026.

DOI 10.1109/ICASSP55912.2026.11463533

QMRO

04-05-2026

YuE: Scaling Open Foundation Models for Long-Form Music Generation
Yuan R Lin H Guo S Zhang G Pan J Zang Y Liu H Liang Y et al.
14th International Conference on Learning Representations (ICLR) Rio de Janeiro, Brazil 23 Apr 2026 - 27 Apr 2026.

DOI 10.48550/arxiv.2503.08638

QMRO

23-04-2026

SCRAPL: Scattering Transform with Random Paths for Machine Learning
Mitcheltree C Lostanlen V Benetos E Lagrange M
14th International Conference on Learning Representations (ICLR) Rio de Janeiro, Brazil 23 Apr 2026 - 27 Apr 2026.

DOI 10.48550/arXiv.2602.11145

QMRO

23-04-2026

OmniVideoBench: Towards Audio-Visual Understanding Evaluation for Omni MLLMs
Li C Chen Y Ji Y Xu J Cui Z Li S Zhang Y Tang J et al.

DOI 10.48550/arxiv.2510.10689

QMRO

05-03-2026

Computational hermeneutics: evaluating generative AI as a cultural technology
Kommers C Ahnert R Antoniak M Benetos E Benford S Bunz M Caramiaux B Concannon S et al.
Frontiers in Artificial Intelligence, Frontiers Media Sa vol. 9

DOI 10.3389/frai.2026.1753041

QMRO

26-02-2026

2025

AutoMV: An Automatic Multi-Agent System for Music Video Generation
Tang X Lei X Zhu C Chen S Yuan R Li Y Oh C Zhang G et al.
In Arxiv

DOI 10.48550/arxiv.2512.12196

13-12-2025

OmniBench: Towards The Future of Universal Omni-Language Models
Li Y Ma Y Ma Y Yuan R Zhu K Guo H Liang Y Liu J et al.
The Thirty-Ninth Annual Conference on Neural Information Processing Systems. (NeurIPS 2025) 2 Dec 2025 - 7 Dec 2025.

DOI 10.48550/arxiv.2409.15272

QMRO

02-12-2025

MMAR: A Challenging Benchmark for Deep Reasoning in Speech, Audio, Music, and Their Mix
Ma Z Ma Y Zhu Y Yang C Chao Y-W Xu R Chen W Chen Y et al.
The Thirty-Ninth Annual Conference on Neural Information Processing Systems (NeurIPS 2025) 2 Dec 2025 - 7 Dec 2025.

DOI 10.48550/arxiv.2505.13032

QMRO

02-12-2025

Velocity2DMs: A contextual modeling approach to dynamics marking prediction in piano performance
Kim H Benetos E Serra X
IEEE Signal Processing Letters, Institute of Electrical and Electronics Engineers vol. 32

DOI 10.1109/LSP.2025.3633579

QMRO

17-11-2025

CMI-Bench: A Comprehensive Benchmark for Evaluating Music Instruction Following
Ma Y Li S Yu J Benetos E Maezawa A
26th International Society for Music Information Retrieval Conference (ISMIR) Daejeon, Korea 21 Sep 2025 - 25 Sep 2025.

QMRO

21-09-2025

Universal Music Representations? Evaluating Foundation Models on World Music Corpora
Papaioannou C Benetos E Potamianos A
26th International Society for Music Information Retrieval Conference (ISMIR) Daejeon, Korea 21 Sep 2025 - 25 Sep 2025.

QMRO

21-09-2025

Perceptual errors in music source separation: looking beyond SDR averages
Sarkar S Moomjian V Woods B Benetos E Sandler M
26th International Society for Music Information Retrieval Conference (ISMIR) Daejeon, Korea 21 Sep 2025 - 25 Sep 2025.

QMRO

21-09-2025

Enhancing Lyrics Transcription on Music Mixtures with Consistency Loss
Huang J Sousa F Demirel E Benetos E Gadelha I
Interspeech 2025 Rotterdam, The Netherlands 17 Aug 2025 - 21 Aug 2025.

DOI 10.21437/Interspeech.2025-311

QMRO

17-07-2025

RUMAA: Repeat-Aware Unified Music Audio Analysis for Score-Performance Alignment, Transcription, and Mistake Detection
Chang S Dixon S Benetos E

DOI 10.48550/arxiv.2507.12175

QMRO

16-07-2025

Refining music sample identification with a self-supervised graph neural network
Bhattacharjee A Higgs IM Sandler M Benetos E

DOI 10.48550/arxiv.2506.14684

QMRO

20-06-2025

Towards a Unified Representation Evaluation Framework Beyond Downstream Tasks
Plachouras C Guinot J Fazekas G Quinton E Benetos E Pauwels J

DOI 10.48550/arxiv.2505.06224

QMRO

09-05-2025

Twenty-Five Years of MIR Research: Achievements, Practices, Evaluations, and Future Challenges
Peeters G Rafii Z Fuentes M Duan Z Benetos E
ICASSP 2025 - 2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). vol. 00, 1-5.

DOI 10.1109/icassp49660.2025.10888947

QMRO

11-04-2025

Learning Music Audio Representations With Limited Data
Plachouras C Benetos E Pauwels J
ICASSP 2025 - 2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). vol. 00, 1-5.

DOI 10.1109/icassp49660.2025.10887766

QMRO

11-04-2025

Acoustic identification of individual animals based on hierarchical contrastive learning
De Almeida Nolasco IS Stowell D Benetos E
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) Hyderabad, India 6 Apr 2025 - 11 Apr 2025.

DOI 10.1109/ICASSP49660.2025.10890076

QMRO

06-04-2025

LHGNN: Local-Higher Order Graph Neural Networks For Audio Classification and Tagging
Singh S Benetos E Phan H Stowell D
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) Hyderabad, India 6 Apr 2025 - 11 Apr 2025.

DOI 10.1109/ICASSP49660.2025.10890467

QMRO

06-04-2025

GraFPrint: A GNN-Based Approach for Audio Identification
Singh S Bhattacharjee A Benetos E
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) Hyderabad, India 6 Apr 2025 - 11 Apr 2025.

DOI 10.1109/ICASSP49660.2025.10888557

QMRO

06-04-2025

Singing to speech conversion with generative flow
Huang J
Eurasip Journal on Audio, Speech, and Music Processing, Springeropen vol. 2025

DOI 10.1186/s13636-025-00400-x

QMRO

10-03-2025

Acoustic Prompt Tuning: Empowering Large Language Models with Audition Capabilities
Liang J Liu X Wang W Plumbley M Phan H Benetos E
IEEE Transactions on Audio, Speech and Language Processing vol. 33, 949-961.

DOI 10.1109/TASLPRO.2025.3533375

QMRO

24-01-2025

LC-Protonets: Multi-label Few-shot learning for world music audio tagging.
Papaioannou C Benetos E
IEEE Open Journal of Signal Processing, Institute of Electrical and Electronics Engineers vol. 6, 138-146.

DOI 10.1109/OJSP.2025.3529315

QMRO

13-01-2025

2024

Classification of spontaneous and scripted speech for multilingual audio
Elisha S McDowell A Beguerisse-Díaz M
IEEE Spoken Language Technology Workshop 2024 Macao, China 2 Dec 2024 - 5 Dec 2024., 489-495.

DOI 10.1109/SLT61566.2024.10832309

QMRO

02-12-2024

ComposerX: Multi-Agent Symbolic Music Composition with LLMs
Deng Q Yang Q Yuan R Huang Y Wang Y Liu X Tian Z Pan J et al.
25th International Society for Music Information Retrieval Conference (ISMIR), San Francisco, CA, USA 10 Nov 2024 - 14 Nov 2024.

DOI 10.48550/arxiv.2404.18081

QMRO

10-11-2024

Can LLMs Reason in Music? An Evaluation of LLMs' Capability of Music Understanding and Generation
Zhou Z Wu Y Wu Z Zhang X Yuan R Ma Y Xue W
25th International Society for Music Information Retrieval Conference (ISMIR) San Franscisco, CA, USA 10 Nov 2024 - 14 Nov 2024.

DOI 10.48550/arxiv.2407.21531

QMRO

10-11-2024

ST-ITO: Controlling Audio Effects for Style Transfer with Inference-Time Optimization
Steinmetz CJ Singh S Comunità M Ibnyahya I Yuan S Benetos E Reiss JD

DOI 10.48550/arxiv.2410.21233

QMRO

28-10-2024

Exploratory analysis of early-life chick calls
Torrisi A De Almeida Nolasco IS Versace E Benetos E
4th International Workshop on Vocal Interactivity in-and-between Humans, Animals and Robots (VIHAR) Kos, Greece 6 Sep 2024.

QMRO

06-09-2024

Foundation Models for Music: A Survey
Ma Y Øland A Ragni A Del Sette BM Saitis C Donahue C Lin C Plachouras C et al.
In Arxiv

DOI 10.48550/arxiv.2408.14340

03-09-2024

Towards Building an End-to-End Multilingual Automatic Lyrics Transcription Model
Huang J Benetos E
32nd European Signal Processing Conference (EUSIPCO) Lyon, France 26 Aug 2024 - 30 Aug 2024.

DOI 10.23919/EUSIPCO63174.2024.10715045

QMRO

26-08-2024

ChatMusician: Understanding and Generating Music Intrinsically with LLM
Yuan R Lin H Wang Y Tian Z Wu S Shen T Zhang G Wu Y et al.
62nd Annual Meeting of the Association for Computational Linguistics (ACL 2024) Bangkok, Thailand 11 Aug 2024 - 16 Aug 2024.

DOI 10.18653/v1/2024.findings-acl.373

QMRO

11-08-2024

MuChoMusic: Evaluating Music Understanding in Multimodal Audio-Language Models
Weck B Manco I Benetos E Fazekas G

DOI 10.48550/arxiv.2408.01337

QMRO

02-08-2024

Explaining models relating objects and privacy
Xompero A Bontonou M Arbona J-M Benetos E
3rd Explainable AI for Computer Vision (XAI4CV) Workshop at CVPR 2024 Seattle Convention Center, Seattle WA, USA 18 Jun 2024.

DOI 10.48550/arXiv.2405.01646

QMRO

18-06-2024

MusiLingo: bridging music and text with pre-trained language models for music captioning and query response
Deng Z Ma Y Liu Y Guo R Zhang G Chen W Huang W Benetos E
2024 Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL 2024) Mexico City, Mexico 16 Jun 2024 - 21 Jun 2024., 3643-3655.

DOI 10.18653/v1/2024.findings-naacl.231

QMRO

16-06-2024

Globally, songs and instrumental melodies are slower and higher and use more stable pitches than speech: A Registered Report
Ozaki Y Tierney A Pfordresher PQ McBride JM Benetos E Proutskova P Chiba G Liu F et al.
Science Advances, American Association For The Advancement of Science (Aaas) vol. 10 (20)

DOI 10.1126/sciadv.adm9797

QMRO

15-05-2024

WavCraft: audio editing and generation with large language models
Liang J Zhang H Liu H Cao Y Kong Q Liu X Wang W Plumbley MD et al.
ICLR 2024 Workshop on LLM Agents Vienna, Austria 11 May 2024.

DOI 10.48550/arxiv.2403.09527

QMRO

11-05-2024

MERT: Acoustic Music Understanding Model with Large-Scale Self-supervised Training
Li Y Yuan R Zhang G Ma Y Chen X Yin H Xiao C Lin C et al.
International Conference on Learning Representations (ICLR) Vienna, Austria 7 May 2024 - 11 May 2024.

QMRO

07-05-2024

Learning from taxonomy: multi-label few-shot classification for everyday sound recognition
Liang J Phan QH Benetos E
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) Seoul, Korea 14 Apr 2024 - 19 Apr 2024., 771-775.

DOI 10.1109/ICASSP48485.2024.10446908

QMRO

14-04-2024

MERTech: instrument playing technique detection using self-supervised pretrained model with multi-task finetuning
Li D Ma Y Wei W KONG Q Wu Y Che M Xia F Benetos E et al.
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) Seoul, Korea 14 Apr 2024 - 19 Apr 2024., 521-525.

DOI 10.1109/ICASSP48485.2024.10447445

QMRO

14-04-2024

Generalized multi-source inference for text conditioned music diffusion models
Postolache E Mariani G Cosmo L
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) Seoul, Korea 14 Apr 2024 - 19 Apr 2024., 6980-6984.

DOI 10.1109/ICASSP48485.2024.10447122

QMRO

14-04-2024

Mind the Domain Gap: a Systematic Analysis on Bioacoustic Sound Event Detection
Liang J Nolasco I Ghani B Phan H Benetos E Stowell D
32nd European Signal Processing Conference (EUSIPCO 2024) Lyon, France 26 Aug 2024 - 30 Aug 2024.

DOI 10.23919/EUSIPCO63174.2024.10714948

QMRO

27-03-2024

A Data-Driven Analysis of Robust Automatic Piano Transcription
Edwards D Dixon S Benetos E Maezawa A Kusaka Y

DOI 10.48550/arxiv.2402.01424

QMRO

02-02-2024

YourMT3+: Multi-Instrument Music Transcription with Enhanced Transformer Architectures and Cross-Dataset STEM Augmentation
Chang S Benetos E Kirchhoff H Dixon S
2024 IEEE 34th International Workshop on Machine Learning for Signal Processing (MLSP). vol. 00, 1-6.

DOI 10.1109/mlsp58920.2024.10734819

QMRO

25-01-2024

ATGNN: audio tagging graph neural network
Singh S Steinmetz C Benetos E Phan QH Stowell D
IEEE Signal Processing Letters, Institute of Electrical and Electronics Engineers vol. 31, 825-829.

DOI 10.1109/LSP.2024.3352514

QMRO

17-01-2024

2023

Remaining-useful-life prediction and uncertainty quantification using LSTM ensembles for aircraft engines
Deb O Torr P
NeurIPS Workshop on Advancing Neural Network Training (WANT): Computational Efficiency, Scalability, and Resource Optimization New Orleans, USA 16 Dec 2023.

QMRO

16-12-2023

MARBLE: Music Audio Representation Benchmark for Universal Evaluation
Yuan R Ma Y Li Y Zhang G Chen X Yin H Zhuo L Liu Y et al.
37th Conference on Neural Information Processing Systems (NeurIPS) 10 Dec 2023 - 16 Dec 2023.

DOI 10.48550/arxiv.2306.10548

QMRO

10-12-2023

Learning Music Representations with wav2vec 2.0
Ragano A Benetos E
31st Irish Conference on Artificial Intelligence and Cognitive Science (AICS) Letterkenny, Ireland 7 Dec 2023.

DOI 10.48550/arxiv.2210.15310

QMRO

07-12-2023

The Song Describer Dataset: a Corpus of Audio Captions for Music-and-Language Evaluation
Manco I Weck B Doh S Won M Zhang Y Bogdanov D Tovstogan P Benetos E

DOI 10.48550/arxiv.2311.10057

QMRO

22-11-2023

From West to East: Who can understand the music of the others better?
Papaioannou C Benetos E Potamianos A
24th International Society for Music Information Retrieval Conference (ISMIR) Milan, Italy 5 Nov 2023 - 9 Nov 2023.

DOI 10.5281/zenodo.10265287

QMRO

05-11-2023

LyricWhiz: Robust Multilingual Lyrics Transcription by Whispering to ChatGPT
Zhuo L Yuan R Pan J Ma Y Li Y Zhang G Liu S Dannenberg R et al.
24th International Society for Music Information Retrieval Conference (ISMIR) Milan, Italy 5 Nov 2023 - 9 Nov 2023.

DOI 10.48550/arxiv.2306.17103

QMRO

05-11-2023

Leveraging Synthetic Data for Improving Chamber Ensemble Separation
Sarkar S Thorpe L Benetos E Sandler M
2023 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA). vol. 00, 1-5.

DOI 10.1109/waspaa58266.2023.10248118

QMRO

25-10-2023

Perceptual Musical Similarity Metric Learning with Graph Neural Networks
Vahidi C Singh S Benetos E Phan H Stowell D Fazekas G Lagrange M
2023 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA). vol. 00, 1-5.

DOI 10.1109/waspaa58266.2023.10248151

QMRO

25-10-2023

On the Effectiveness of Speech Self-supervised Learning for Music
Ma Y Yuan R Li Y Zhang G Chen X Yin H Lin C Benetos E et al.

DOI 10.48550/arxiv.2307.05161

QMRO

11-07-2023

A Comparison Of Deep Learning MOS Predictors For Speech Synthesis Quality
Ragano A Benetos E Chinen M Becerra H Chandan Karadagur Ananda R
Irish Signals & Systems Conference 2023 Dublin, Ireland 13 Jun 2023 - 14 Jun 2023.

DOI 10.1109/ISSC59246.2023.10162088

QMRO

13-06-2023

Audio Quality Assessment of Vinyl Music Collections Using Self-Supervised Learning
Ragano A Benetos E
2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 4 Jun 2023 - 10 Jun 2023., 1-5.

DOI 10.1109/icassp49357.2023.10096274

QMRO

04-06-2023

Few-shot Class-incremental Audio Classification Using Dynamically Expanded Classifier with Self-attention Modified Prototypes
Li Y Cao W Xie W
IEEE Transactions on Multimedia, Institute of Electrical and Electronics Engineers vol. 26, 1346-1360.

DOI 10.1109/TMM.2023.3280011

QMRO

25-05-2023

PiJAMA: Piano Jazz with Automatic MIDI Annotations
Edwards D Dixon S Benetos E
Transactions of The International Society For Music Information Retrieval, Ubiquity Press vol. 6 (1), 89-102.

DOI 10.5334/tismir.162

QMRO

01-01-2023

2022

Joint Scattering for Automatic Chick Call Recognition
Wang C Benetos E Wang S Versace E
2022 30th European Signal Processing Conference (EUSIPCO)., 195-199.

DOI 10.23919/eusipco55093.2022.9909738

QMRO

20-12-2022

Large-Scale Pretrained Model for Self-Supervised Music Audio Representation Learning
Li Y Yuan R Zhang G Ma Y Lin C Chen X Ragni A Yin H et al.
DMRN+17: Digital Music Research Network One-day Workshop 2022 London, UK 20 Dec 2022.

QMRO

20-12-2022

Performance MIDI-to-score conversion by neural beat tracking
Liu L KONG Q Morfi G-V Benetos E
23rd International Society for Music Information Retrieval Conference (ISMIR) Bengaluru, India 4 Dec 2022 - 8 Dec 2022.

DOI 10.5281/zenodo.7316682

QMRO

18-12-2022

EnsembleSet: A new high-quality synthesised dataset for chamber ensemble separation
Sarkar S Benetos E Sandler M
23rd International Society for Music Information Retrieval Conference (ISMIR) Bengaluru, India 5 Dec 2022 - 8 Dec 2022.

DOI 10.5281/zenodo.7316740

QMRO

08-12-2022

Contrastive audio-language learning for music
Manco I Benetos E Fazekas G
23rd International Society for Music Information Retrieval Conference (ISMIR) Bengaluru, India 4 Dec 2022 - 8 Dec 2022.

DOI 10.5281/zenodo.7316744

QMRO

04-12-2022

Explaining the decisions of anomalous sound detectors
Mai KT Davies T
7th Workshop on Detection and Classification of Acoustic Scenes and Events (DCASE) Nancy, France 3 Nov 2022 - 4 Nov 2022.

QMRO

03-11-2022

Leveraging label hierarchies for few-shot everyday sound recognition
Liang J Phan QH Benetos E
7th Workshop on Detection and Classification of Acoustic Scenes and Events (DCASE) Nancy, France 3 Nov 2022 - 4 Nov 2022.

QMRO

03-11-2022

Similarities and differences in a cross-linguistic sample of song and speech recordings
Ozaki Y Kuroyanagi J McBride J Proutskova P Tierney A Benetos E
Joint Conference on Language Evolution Kanazawa, Japan 5 Sep 2022 - 8 Sep 2022.

QMRO

05-09-2022

Hypernetworks for sound event detection: a proof-of-concept
Singh S Benetos E Phan QH
30th European Signal Processing Conference (EUSIPCO 2022) Belgrade, Serbia 29 Aug 2022 - 3 Sep 2022., 429-433.

DOI 10.23919/eusipco55093.2022.9909716

QMRO

29-08-2022

Agreement among human and automated estimates of similarity in a global music sample
Daikoku H Ding S Benetos E Wood ALC Shimizono T Sanne US
10th International Workshop on Folk Music Analysis (FMA 2022) Sheffield, UK 14 Jun 2022 - 17 Jun 2022.

QMRO

14-06-2022

Exploring Transformer’s Potential on Automatic Piano Transcription
Ou L Guo Z Benetos E
ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). vol. 00, 776-780.

DOI 10.1109/icassp43922.2022.9746789

QMRO

27-05-2022

Learning Music Audio Representations Via Weak Language Supervision
Manco I Benetos E Quinton E
ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). vol. 00, 456-460.

DOI 10.1109/icassp43922.2022.9746996

QMRO

27-05-2022

Improving lyrics Alignment through Joint Pitch Detection
Huang J Benetos E Ewert S
2022 IEEE International Conference on Acoustics, Speech and Signal Processing Singapore 22 May 2022 - 27 May 2022., 451-455.

DOI 10.1109/ICASSP43922.2022.9746460

QMRO

22-05-2022

Automatic Quality Assessment of Digitized and Restored Sound Archives
Ragano A Benetos E
Journal of The Audio Engineering Society, Audio Engineering Society vol. 70 (4), 252-270.

DOI 10.17743/jaes.2022.0002

QMRO

11-05-2022

Adaptive Scattering Transforms for Playing Technique Recognition
Wang C Benetos E Lostanlen V
IEEE/Acm Transactions on Audio, Speech and Language Processing, Institute of Electrical and Electronics Engineers vol. 30, 1407-1421.

DOI 10.1109/TASLP.2022.3156785

QMRO

07-03-2022

Measuring national mood with music: using machine learning to construct a measure of national valence from audio data
Benetos E Ragano A Sgroi D
Behavior Research Methods, Springer Nature vol. 54 (6), 3085-3092.

DOI 10.3758/s13428-021-01747-7

QMRO

25-02-2022

2021

Comparison of Feature Extraction Methods for Sound-Based Classification of Honey Bee Activity
Terenzi A Nolasco I Benetos E
IEEE Transactions on Audio Speech and Language Processing, Institute of Electrical and Electronics Engineers (IEEE) vol. 30, 112-122.

DOI 10.1109/taslp.2021.3133194

QMRO

07-12-2021

A framework for music similarity and cover song identification
Bodo RPP Benetos E
15th International Symposium on Computer Music Multidisciplinary Research (CMMR) Tokyo, Japan 15 Nov 2021 - 19 Nov 2021., 205-214.

QMRO

15-11-2021

ACPAS: A Dataset of Aligned Classical Piano Audio and Scores for Audio-to-Score Transcription
Liu L Morfi V Benetos E
Late-Breaking Demo Session of the 22nd Int. Society for Music Information Retrieval Conference.

QMRO

11-11-2021

Pitch-informed instrument assignment using a deep convolutional network with multiple kernel shapes
Vianna Lordelo C Benetos E Dixon S Ahlbäck S
22nd International Society for Music Information Retrieval Conference (ISMIR) 9 Nov 2021 - 12 Nov 2021., 389-395.

QMRO

09-11-2021

Agreement among human and annotated transcriptions of global songs
Ozaki Y McBride J Benetos E Pfordresher PQ Six J T. Tierney A Proutskova P Fukatsu H et al.
22nd International Society for Music Information Retrieval Conference (ISMIR) 9 Nov 2021 - 12 Nov 2021., 500-508.

QMRO

09-11-2021

Detecting Cover Songs with Pitch Class Key-Invariant Networks
O'Hanlon K Benetos E Dixon S
2021 IEEE 31st International Workshop on Machine Learning for Signal Processing (MLSP). vol. 00, 1-6.

DOI 10.1109/mlsp52302.2021.9596389

QMRO

28-10-2021

Humanities and engineering perspectives on music transcription
Holzapfel A Benetos E Widdess R
Digital Scholarship in The Humanities, Oxford University Press (OUP) vol. 37 (3), 747-764.

DOI 10.1093/llc/fqab074

QMRO

23-10-2021

Vocal Harmony Separation Using Time-Domain Neural Networks
Sarkar S Benetos E Sandler M
Interspeech 2021., 3515-3519.

DOI 10.21437/interspeech.2021-1531

QMRO

30-08-2021

An Evaluation of Data Augmentation Methods for Sound Scene Geotagging
Bear HL Morfi V
Interspeech 2021., 581-585.

DOI 10.21437/interspeech.2021-1837

QMRO

30-08-2021

Violinist identification based on vibrato features
Zhao Y Wang C Fazekas G Benetos E Sandler M
2021 29th European Signal Processing Conference (EUSIPCO). vol. 00, 381-385.

DOI 10.23919/eusipco54536.2021.9616197

QMRO

27-08-2021

Revisiting the onsets and frames model with additive attention
Cheuk KW Luo Y-J Benetos E Herremans D
International Joint Conference on Neural Networks (IJCNN) 18 Jul 2021 - 22 Jul 2021.

DOI 10.1109/IJCNN52387.2021.9533407

QMRO

18-07-2021

More for Less: Non-Intrusive Speech Quality Assessment with Limited Annotations
Ragano A Benetos E Hines A
2021 13th International Conference on Quality of Multimedia Experience (QoMEX). vol. 00, 103-108.

DOI 10.1109/qomex51781.2021.9465410

QMRO

17-06-2021

Prototypical Networks for Domain Adaptation in Acoustic Scene Classification
Singh S Bear H Benetos E
IEEE International Conference on Acoustics, Speech and Signal Processing Toronto, Canada 6 Jun 2021 - 11 Jun 2021.

DOI 10.1109/ICASSP39728.2021.9414876

QMRO

06-06-2021

Joint multi-pitch detection and score transcription for polyphonic piano music
Liu L Morfi G-V Benetos E
IEEE International Conference on Acoustics, Speech and Signal Processing Toronto, Canada 6 Jun 2021 - 11 Jun 2021.

DOI 10.1109/ICASSP39728.2021.9413601

QMRO

06-06-2021

Anomalous behaviour in loss-gradient based interpretability methods
Subramanian V Gururani S Benetos E Sandler M
RobustML workshop paper at ICLR 2021.

QMRO

07-05-2021

MusCaps: Generating Captions for Music Audio
Manco I Benetos E Fazekas G

DOI 10.48550/arxiv.2104.11984

QMRO

24-04-2021

The effect of spectrogram reconstructions on automatic music transcription: an alternative approach to improve transcription accuracy
Cheuk KW Benetos E Luo Y Herremans D
25th International Conference on Pattern Recognition (ICPR2020) Milan, Italy 10 Jan 2021 - 15 Jan 2021., 9091-9098.

DOI 10.1109/ICPR48806.2021.9412155

QMRO

10-01-2021

From Audio to Music Notation
Liu L Benetos E
In Handbook of Artificial Intelligence For Music, Springer Nature 693-714.

DOI 10.1007/978-3-030-72116-9_24

QMRO

01-01-2021

2020

Adversarial Unsupervised Domain Adaptation for Harmonic-Percussive Source Separation
Lordelo C Benetos E Dixon S Ahlbck S Ohlsson P
IEEE Signal Processing Letters, Institute of Electrical and Electronics Engineers (IEEE) vol. 28, 81-85.

DOI 10.1109/lsp.2020.3045915

QMRO

18-12-2020

Joint Piano-roll and Score Transcription for Polyphonic Piano Music
Liu L Morfi G-V Benetos E
DMRN+15: Digital Music Research Network One-day Workshop London, UK 15 Dec 2020.

QMRO

15-12-2020

Dataset artefacts in anti-spoofing systems: a case study on the ASVspoof 2017 benchmark
Chettri B Benetos E Sturm BLT
IEEE/Acm Transactions on Audio, Speech and Language Processing, Institute of Electrical and Electronics Engineers vol. 28, 3018-3028.

DOI 10.1109/TASLP.2020.3036777

QMRO

09-11-2020

Subband modeling for spoofing detection in automatic speaker verification
Chettri B Kinnunen T
Odyssey 2020: The Speaker and Language Recognition Workshop Tokyo, Japan 1 Nov 2020 - 5 Nov 2020., 341-348.

DOI 10.21437/Odyssey.2020-48

QMRO

01-11-2020

Memory Controlled Sequential Self Attention for Sound Recognition
Pankajakshan A Bear H Benetos E
21st Annual Conference of the International Speech Communication Association (INTERSPEECH 2020) Shanghai, China 25 Oct 2020 - 29 Oct 2020.

DOI 10.21437/Interspeech.2020-1953

QMRO

25-10-2020

Development of a Speech Quality Database Under Uncontrolled Conditions
Ragano A Benetos E
21st Annual Conference of the International Speech Communication Association (INTERSPEECH 2020) Shanghai, China 25 Oct 2020 - 29 Oct 2020.

DOI 10.21437/Interspeech.2020-1899

QMRO

25-10-2020

Reliable Local Explanations for Machine Listening
Mishra S Benetos E Sturm BLT Dixon S
2020 International Joint Conference on Neural Networks (IJCNN). vol. 00, 1-8.

DOI 10.1109/ijcnn48605.2020.9207444

QMRO

24-07-2020

Audio impairment recognition using a correlation-based feature representation
Ragano A Benetos E
12th International Conference on Quality of Multimedia Experience (QoMEX) Athlone, Ireland 26 May 2020 - 28 May 2020.

DOI 10.1109/QoMEX48832.2020.9123111

QMRO

26-05-2020

Modeling Plate and Spring Reverberation Using A Dsp-Informed Deep Neural Network
Ramírez MAM Benetos E Reiss JD
ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). vol. 00, 241-245.

DOI 10.1109/icassp40776.2020.9053093

QMRO

08-05-2020

A Study on the Transferability of Adversarial Attacks in Sound Event Classification
Subramanian V Pankajakshan A Benetos E Xu N McDonald S Sandler M
ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). vol. 00, 301-305.

DOI 10.1109/icassp40776.2020.9054445

QMRO

08-05-2020

Playing Technique Recognition by Joint Time–Frequency Scattering
Wang C Lostanlen V Benetos E
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2020) Barcelona, Spain 4 May 2020 - 8 May 2020., 881-885.

DOI 10.1109/ICASSP40776.2020.9053474

QMRO

04-05-2020

A-CRNN: a domain adaptation model for sound event detection
Wei W Zhu H Benetos E
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2020) Barcelona, Spain 4 May 2020 - 8 May 2020., 276-280.

DOI 10.1109/ICASSP40776.2020.9054248

QMRO

04-05-2020

Musical Features for Automatic Music Transcription Evaluation
Ycart A Liu L Benetos E

QMRO

15-04-2020

Learning and Evaluation Methodologies for Polyphonic Music Sequence Prediction with LSTMs
Ycart A Benetos E
IEEE/Acm Transactions on Audio, Speech and Language Processing, Institute of Electrical and Electronics Engineers vol. 28 (1), 1328-1341.

DOI 10.1109/TASLP.2020.2987130

QMRO

14-04-2020

Deep Generative Variational Autoencoding for Replay Spoof Detection in Automatic Speaker Verification
Chettri B Kinnunen T
Computer Speech and Language, Elsevier vol. 63

DOI 10.1016/j.csl.2020.101092

QMRO

19-03-2020

Deep Learning for Black-Box Modeling of Audio Effects †
Martínez Ramírez MA Benetos E Reiss JD
Applied Sciences, Mdpi vol. 10 (2)

DOI 10.3390/app10020638

QMRO

16-01-2020

Investigating the Perceptual Validity of Evaluation Metrics for Automatic Piano Music Transcription
Ycart A Liu L Benetos E Pearce MT
Transactions of The International Society For Music Information Retrieval, Ubiquity Press vol. 3 (1), 68-81.

DOI 10.5334/tismir.57

QMRO

01-01-2020

2019

Automatic Music Accompaniment with a Chroma-based Music Data Representation
Liu L Benetos E
DMRN+14: Digital Music Research Network One-day Workshop.

QMRO

17-12-2019

Adaptive Time–Frequency Scattering for Periodic Modulation Recognition in Music Signals
Wang C Benetos E Lostanlen V
International Society for Music Information Retrieval Conference Delft, The Netherlands 4 Nov 2019 - 8 Nov 2019., 809-815.

QMRO

04-11-2019

Automatic music transcription and ethnomusicology: a user study
Holzapfel A
20th conference of the International Society for Music Information Retrieval (ISMIR) Delft, The Netherlands 4 Nov 2019 - 8 Nov 2019., 678-684.

QMRO

04-11-2019

Blending acoustic and language model predictions for automatic music transcription
Ycart A McLeod A Benetos E
20th conference of the International Society for Music Information Retrieval (ISMIR) Delft, The Netherlands 4 Nov 2019 - 8 Nov 2019., 454-461.

QMRO

04-11-2019

A Comparative Study of Neural Models for Polyphonic Music Sequence Transduction
Ycart A Stoller D
20th conference of the International Society for Music Information Retrieval (ISMIR) Delft, The Netherlands 4 Nov 2019 - 8 Nov 2019., 470-477.

QMRO

04-11-2019

CBF-periDB: A Chinese Bamboo Flute Dataset for Periodic Modulation Analysis
Wang C Benetos E
International Society for Music Information Retrieval Conference Late-Breaking Demo Session Delft, The Netherlands 4 Nov 2019 - 8 Nov 2019.

QMRO

04-11-2019

Onsets, activity, and events: a multi-task approach for polyphonic sound event modelling
Pankajakshan A Benetos E
4th Workshop on Detection and Classification of Acoustic Scenes and Events (DCASE 2019) New York, USA 25 Oct 2019 - 26 Oct 2019., 174-178.

DOI 10.33682/sm6r-8p49

QMRO

25-10-2019

Audio tagging using a linear noise modelling layer
Singh S Pankajakshan A
4th Workshop on Detection and Classification of Acoustic Scenes and Events (DCASE 2019) New York, USA 25 Oct 2019 - 26 Oct 2019., 234-238.

DOI 10.33682/zyc0-jw35

QMRO

25-10-2019

Investigating Kernel Shapes and Skip Connections for Deep Learning-Based Harmonic-Percussive Separation
Lordelo C Benetos E Dixon S Ahlbäck S
2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA). vol. 00, 40-44.

DOI 10.1109/waspaa.2019.8937079

QMRO

23-10-2019

Polyphonic sound event and sound activity detection: a multi-task approach
Pankajakshan A Bear H
IEEE Workshop on Applications of Signal Processing to Audio and Acoustics New Paltz, NY, USA 20 Oct 2019 - 23 Oct 2019., 318-322.

DOI 10.1109/WASPAA.2019.8937193

QMRO

20-10-2019

City classification from multiple real-world sound scenes
Bear H Heittola T Mesaros A Virtanen T
IEEE Workshop on Applications of Signal Processing to Audio and Acoustics New Paltz, NY, USA 20 Oct 2019 - 23 Oct 2019., 11-15.

DOI 10.1109/WASPAA.2019.8937271

QMRO

20-10-2019

Ensemble Models for Spoofing Detection in Automatic Speaker Verification
Chettri B Stoller D Morfi V Martinez Ramirez M
20th Annual Conference of the International Speech Communication Association (INTERSPEECH 2019) Graz, Austria 15 Jul 2019 - 19 Sep 2019., 1018-1022.

DOI 10.21437/Interspeech.2019-2505

QMRO

15-09-2019

Towards joint sound scene and polyphonic sound event recognition
Bear H Nolasco I
20th Annual Conference of the International Speech Communication Association (INTERSPEECH 2019) Graz, Austria 15 Sep 2019 - 19 Sep 2019., 4594-4598.

DOI 10.21437/Interspeech.2019-2169

QMRO

15-09-2019

A general-purpose deep learning approach to model time-varying audio effects
Martinez Ramirez M Benetos E Reiss J
International Conference on Digital Audio Effects (DAFx-19) Birmingham, UK 2 Sep 2019 - 6 Sep 2019.

QMRO

02-09-2019

Adaptive Noise Reduction for Sound Event Detection Using Subband-Weighted NMF
Zhou Q Feng Z
Sensors, Mdpi Ag vol. 19 (14)

DOI 10.3390/s19143206

QMRO

20-07-2019

Adversarial Attacks in Sound Event Classification.
Subramanian V Benetos E Xu N McDonald S Sandler MB

04-07-2019

Optimal Neural Network Feature Selection for Spatial-Temporal Forecasting
Covas E
Chaos, Aip Publishing vol. 29 (6)

DOI 10.1063/1.5095060

QMRO

20-06-2019

Adapting the Quality of Experience Framework for Audio Archive Evaluation
Ragano A BENETOS E
11th International Conference on Quality of Multimedia Experience Berlin, Germany 5 Jun 2019 - 7 Jun 2019.

DOI 10.1109/QoMEX.2019.8743302

QMRO

05-06-2019

HMM-based Glissando Detection for Recordings of Chinese Bamboo Flute
WANG C BENETOS E MENG X
Sound and Music Computing Conference Malaga, Spain 28 May 2019 - 31 May 2019., 545-550.

QMRO

28-05-2019

SubSpectralNet - Using sub-spectrogram based convolutional neural networks for acoustic scene classification
Phaye SSR BENETOS E Wang Y
IEEE International Conference on Acoustics, Speech, and Signal Processing Brighton, UK 12 May 2019 - 17 May 2019.

DOI 10.1109/ICASSP.2019.8683288

QMRO

12-05-2019

Automatic Transcription of Diatonic Harmonica Recordings
Lins F Johann M BENETOS E
IEEE International Conference on Acoustics, Speech, and Signal Processing Brighton, UK 12 May 2019 - 17 May 2019.

DOI 10.1109/ICASSP.2019.8682334

QMRO

12-05-2019

GAN-based Generation and Automatic Selection of Explanations for Neural Networks
MISHRA S STOLLER D BENETOS E STURM B DIXON S
SafeML ICLR 2019 Workshop New Orleans, USA 6 May 2019.

QMRO

06-05-2019

Audio-based identification of beehive states
Nolasco I Terenzi A Cecchi S Orcioni S
IEEE International Conference on Acoustics, Speech, and Signal Processing Brighton, UK 12 May 2019 - 17 May 2019.

DOI 10.1109/ICASSP.2019.8682981

QMRO

12-02-2019

Automatic Music Transcription
Benetos E Dixon S Duan Z Ewert S
IEEE Signal Processing Magazine, Institute of Electrical and Electronics Engineers (IEEE) vol. 36 (1), 20-30.

DOI 10.1109/msp.2018.2869928

QMRO

01-01-2019

Robustness of Adversarial Attacks in Sound Event Classification
Subramanian V Benetos E Sandler MB
Proceedings of the Detection and Classification of Acoustic Scenes and Events 2019 Workshop (DCASE2019)., 239-243.

DOI 10.33682/sp9n-qk06

QMRO

01-01-2019

2018

Analysing the predictions of a CNN-based replay spoofing detection system
CHETTRI B MISHRA S STURM B BENETOS E
2018 IEEE Workshop on Spoken Language Technology Athens, Greece 18 Dec 2018 - 21 Dec 2018., 92-97.

DOI 10.1109/SLT.2018.8639666

QMRO

18-12-2018

An extensible cluster-graph taxonomy for open set sound scene analysis
BEAR H
Workshop on Detection and Classification of Acoustic Scenes and Events Surrey, UK 19 Nov 2018 - 20 Nov 2018.

QMRO

19-11-2018

To bee or not to bee: Investigating machine learning approaches for beehive sound recognition
Nolasco I BENETOS E
2018 Workshop on Detection and Classification of Acoustic Scenes and Events (DCASE 2018) Surrey, UK 19 Nov 2018 - 20 Nov 2018.

QMRO

19-11-2018

A-MAPS: Augmented MAPS Dataset with Rhythm and Key Annotations
YCART A
19th International Society for Music Information Retrieval Conference Late-Breaking Demos Session Paris 23 Sep 2018 - 27 Sep 2018.

QMRO

23-09-2018

Towards HMM-based glissando detection for recordings of Chinese bamboo flute
WANG C BENETOS E MENG X
International Society for Music Information Retrieval Conference Late-Breaking Demos Session Paris, France 23 Sep 2018 - 27 Sep 2018.

QMRO

23-09-2018

Analysing replay spoofing countermeasure performance under varied conditions
CHETTRI B STURM BLT BENETOS E
IEEE International Workshop on Machine Learning for Signal Processing Aalborg, Denmark 17 Sep 2018 - 20 Sep 2018.

DOI 10.1109/MLSP.2018.8516968

QMRO

17-09-2018

A Study On Convolutional Neural Network Based End-To-End Replay Anti-Spoofing
Chettri B Mishra S Sturm BL

QMRO

22-05-2018

Polyphonic Music Sequence Transduction with Meter-Constrained LSTM Networks
YCART A
IEEE International Conference on Acoustics, Speech and Signal Processing Calgary, Canada 15 Apr 2018 - 20 Apr 2018., 386-390.

DOI 10.1109/ICASSP.2018.8462128

QMRO

15-04-2018

Towards Complete Polyphonic Music Transcription: Integrating Multi-Pitch Detection and Rhythm Quantization
Nakamura E Benetos E Yoshii K Dixon S
2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). vol. 00, 101-105.

DOI 10.1109/icassp.2018.8461914

QMRO

15-04-2018

A Supervised Classification Approach for Note Tracking in Polyphonic Piano Transcription
Valero-Mas JJ BENETOS E Iñesta JM
Journal of New Music Research, Taylor & Francis (Routledge) vol. 47 (3), 249-263.

DOI 10.1080/09298215.2018.1451546

QMRO

26-03-2018

Speaker recognition with hybrid features from a deep belief network
Ali H Tran SN d'Avila Garcez AS
Neural Computing and Applications, Springer Verlag (Germany) vol. 29 (6), 13-19.

DOI 10.1007/s00521-016-2501-7

QMRO

01-03-2018

Detection and Classification of Acoustic Scenes and Events: Outcome of the DCASE 2016 Challenge
Mesaros A Heittola T Benetos E Foster P Lagrange M
IEEE/Acm Transactions on Audio, Speech and Language Processing, Institute of Electrical and Electronics Engineers vol. 26 (2), 379-393.

DOI 10.1109/TASLP.2017.2778423

QMRO

01-02-2018

A review of manual and computational approaches for the study of world music corpora
Panteli M Benetos E Dixon S
Journal of New Music Research, Taylor & Francis vol. 47 (2), 176-189.

DOI 10.1080/09298215.2017.1418896

QMRO

08-01-2018

Approaches to complex sound scene analysis
BENETOS E STOWELL D PLUMBLEY M Virtanen T PLUMBLEY M Ellis D
In Computational Analysis of Sound Scenes and Events, Springer International Publishing 215-242.

DOI 10.1007/978-3-319-63450-0

01-01-2018

Proceedings of the 19th International Society for Music Information Retrieval Conference, ISMIR 2018, Paris, France, September 23-27, 2018
ISMIR.
01-01-2018

2017

A computational study on outliers in world music
Panteli M Benetos E Dixon S
Plos One, Public Library of Science (Plos) vol. 12 (12)

DOI 10.1371/journal.pone.0189399

QMRO

18-12-2017

Automatic Transcription of Polyphonic Vocal Music
McLeod A Steedman M BENETOS E
Applied Sciences, Mdpi Ag vol. 7 (12)

DOI 10.3390/app7121285

QMRO

11-12-2017

Multi-pitch detection and voice assignment for a cappella recordings of multiple singers
Schramm R McLeod A Benetos E
18th International Society for Music Information Retrieval Conference (ISMIR 2017) Suzhou, China 23 Oct 2017 - 27 Oct 2017., 552-559.

QMRO

23-10-2017

A study on LSTM networks for polyphonic music sequence modelling
Ycart A Benetos E
18th International Society for Music Information Retrieval Conference (ISMIR 2017) Suzhou, China 23 Oct 2017 - 27 Oct 2017., 421-427.

QMRO

23-10-2017

Sound Event Detection in Synthetic Audio: Analysis of the DCASE 2016 Task Results
Lafay G Lagrange M
IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA 2017) New Paltz, NY, USA 15 Oct 2017 - 18 Oct 2017., 11-15.

DOI 10.1109/WASPAA.2017.8169985

QMRO

15-10-2017

Neural Music Language Models: investigating the training process
YCART A BENETOS E
International Conference of Students of Systematic Musicology.

QMRO

13-09-2017

Polyphonic note and instrument tracking using linear dynamical systems
Benetos E
2017 AES International Conference on Semantic Audio Erlangen, Germany 22 Jun 2017 - 24 Jun 2017.

DOI 10.17743/aesconf.2017.978-1-942220-15-2

QMRO

22-06-2017

Assessing the Relevance of Onset Information for Note Tracking in Piano Music Transcription
Valero-Mas JJ Benetos E
2017 AES International Conference on Semantic Audio Erlangen, Germany 22 Jun 2017 - 24 Jun 2017.

DOI 10.17743/aesconf.2017.978-1-942220-15-2

QMRO

22-06-2017

Automatic Transcription of a Cappella Recordings from Multiple Singers
Schramm R
2017 AES International Conference on Semantic Audio Erlangen, Germany 22 Jun 2017 - 24 Jun 2017.

DOI 10.17743/aesconf.2017.978-1-942220-15-2

QMRO

22-06-2017

On-bird Sound Recordings: Automatic Acoustic Recognition of Activities and Contexts
Stowell D Benetos E Gill LF
IEEE/Acm Transactions on Audio, Speech and Language Processing, IEEE vol. 25 (6), 1193-1206.

DOI 10.1109/TASLP.2017.2690565

QMRO

23-05-2017

Polyphonic Sound Event Tracking using Linear Dynamical Systems
Benetos E Lafay G Plumbley MD
IEEE/Acm Transactions on Audio, Speech and Language Processing, IEEE vol. 25 (6), 1266-1277.

DOI 10.1109/TASLP.2017.2690576

QMRO

23-05-2017

On the Memory Properties of Recurrent Neural Models
Russell AJ Benetos E
International Joint Conference on Neural Networks (IJCNN 2017) Anchorage, Alaska, USA 14 May 2017 - 19 May 2017., 2596-2603.

DOI 10.1109/IJCNN.2017.7966173

QMRO

14-05-2017

The Digital Music Lab: A Big Data Infrastructure for Digital Musicology
Abdallah S Benetos E Gold N Hargreaves S
Acm Journal on Computing and Cultural Heritage, Acm vol. 10 (1)

DOI 10.1145/2983918

QMRO

01-01-2017

2016

Towards a Music Language Model for Audio Analysis
YCART A Benetos E
DMRN+11: Digital Music Research Network One-day Workshop 2016 Centre for Digital Music, Queen Mary University of London 20 Dec 2016.

QMRO

20-12-2016

Automatic Transcription of Vocal Quartets
BENETOS E
DMRN+11: Digital Music Research Network One-day Workshop 2016 Centre for Digital Music, Queen Mary University of London 20 Dec 2016.

QMRO

20-12-2016

Classification-based Note Tracking for Automatic Music Transcription
Valero-Mas JJ Benetos E
9th International Workshop on Machine Learning and Music Riva del Garda, Italy 23 Sep 2016., 61-65.

QMRO

23-09-2016

Digital Music Lab: A Framework for Analysing Big Music Data
Abdallah S Gold N Hargreaves S Weyde T Wolff D
24th European Signal Processing Conference Budapest, Hungary 29 Aug 2016 - 2 Sep 2016., 1118-1122.

DOI 10.1109/EUSIPCO.2016.7760422

QMRO

29-08-2016

The Sousta corpus: Beat-informed automatic transcription of traditional dance tunes
Holzapfel A Benetos E
17th International Society for Music Information Retrieval Conference New York, USA 7 Aug 2016 - 11 Aug 2016., 531-537.

QMRO

07-08-2016

Learning a feature space for similarity in world music
Panteli M Benetos E Dixon S
17th International Society for Music Information Retrieval Conference New York, USA 7 Aug 2016 - 11 Aug 2016., 538-544.

QMRO

07-08-2016

An attack/decay model for piano transcription
Cheng T Mauch M Benetos E Dixon S
17th International Society for Music Information Retrieval Conference New York, USA 7 Aug 2016 - 11 Aug 2016., 584-590.

QMRO

07-08-2016

A morphological model for simulating acoustic scenes and its application to sound event detection
Lafay G Lagrange M Rossignol M Benetos E
IEEE/Acm Transactions on Audio, Speech, and Language Processing, IEEE vol. 24 (10), 1854-1864.

DOI 10.1109/TASLP.2016.2587218

QMRO

01-07-2016

Automatic detection of outliers in world music collections
Panteli M Benetos E Dixon S
Fourth International Conference on Analytical Approaches to World Music (AAWM 2016) New York, USA 8 Jun 2016 - 11 Jun 2016.

QMRO

08-06-2016

Detection of Overlapping Acoustic Events Using a Temporally-Constrained Probabilistic Model
Benetos E Lafay G Lagrange M
2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)., 6450-6454.

DOI 10.1109/icassp.2016.7472919

QMRO

01-03-2016

An End-to-End Neural Network for Polyphonic Piano Music Transcription
Sigtia S Benetos E Dixon S
IEEE Transactions on Audio Speech and Language Processing, Institute of Electrical and Electronics Engineers (IEEE) vol. 24 (5), 927-939.

DOI 10.1109/taslp.2016.2533858

QMRO

23-02-2016

2015

An efficient temporally-constrained probabilistic model for multiple-instrument music transcription
Benetos E
16th International Society for Music Information Retrieval Conference (ISMIR) Malaga, Spain 26 Oct 2015 - 30 Oct 2015., 701-707.

QMRO

26-10-2015

Automatic transcription of Turkish microtonal music
BENETOS E Holzapfel A
Journal of The Acoustical Society of America, Acoustical Society of America vol. 138 (4), 2118-2130.

DOI 10.1121/1.4930187

QMRO

14-10-2015

Detection and Classification of Acoustic Scenes and Events
Stowell D Giannoulis D Benetos E Lagrange M Plumbley MD
IEEE Transactions on Multimedia vol. 17 (10), 1733-1746.

DOI 10.1109/TMM.2015.2428998

QMRO

01-10-2015

Alternate level clustering for drum transcription
Rossignol M Lagrange M Lafay G
23rd European Signal Processing Conference (EUSIPCO) Nice, France 31 Aug 2015 - 4 Sep 2015., 2068-2072.

DOI 10.1109/EUSIPCO.2015.7362739

QMRO

31-08-2015

Automatic transcription and pitch analysis of the British Library World & Traditional Music Collection
Abdallah S Alencar-Brayner A BENETOS E Cottrell S Dykes J Gold N Kachkaev A Tidhar D
5th International Workshop on Folk Music Analysis Paris, France 10 Jun 2015 - 12 Jun 2015., 10-12.

QMRO

10-06-2015

A Hybrid Recurrent Neural Network for Music Transcription
Sigtia S Benetos E Boulanger-Lewandowski N Weyde T Garcez ASD Dixon S
2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)., 2061-2065.

DOI 10.1109/icassp.2015.7178333

QMRO

01-04-2015

2014

Template Adaptation for Improving Automatic Music Transcription
Benetos E Badeau R Weyde T
15th International Society for Music Information Retrieval Conference (ISMIR) Taipei, Taiwan 27 Oct 2014 - 31 Oct 2014., 175-180.

QMRO

27-10-2014

The temperament police
Tidhar D Dixon S Benetos E Weyde T
Early Music, Oxford University Press (OUP) vol. 42 (4), 579-590.

DOI 10.1093/em/cau101

QMRO

11-10-2014

Big Data for Musicology
Weyde T Cottrell S Dykes J Benetos E Wolff D Tidhar D Kachkaev A Plumbley M et al.
Proceedings of the 1st International Workshop on Digital Libraries for Musicology., 1-3.

DOI 10.1145/2660168.2660187

QMRO

12-09-2014

Incremental dataset definition for large scale musicological research
Wolff D Tidhar D Benetos E Dumon E Cherla S Page K Fields B
1st International Digital Libraries for Musicology workshop London, UK 12 Sep 2014.

DOI 10.1145/2660168.2660176

QMRO

12-09-2014

Learning motion-difference features using Gaussian restricted Boltzmann machines for efficient human action recognition
Tran S Benetos E d Avila Garcez A
2014 International Joint Conference on Neural Networks (IJCNN) Beijing, China 6 Jul 2014 - 11 Jul 2014., 2123-2129.

DOI 10.1109/IJCNN.2014.6889945

QMRO

06-07-2014

Incorporating pitch class profiles for improving automatic transcription of Turkish makam music
Benetos E Holzapfel A Holzapfel A
4th International Workshop on Folk Music Analysis Istanbul, Turkey 12 Jun 2014 - 13 Jun 2014., 15-20.

QMRO

13-06-2014

Improving instrument recognition in polyphonic music through system integration
Giannoulis D Benetos E Klapuri A
IEEE International Conference on Acoustics, Speech, and Signal Processing Florence, Italy 4 May 2014 - 9 May 2014., 5259-5263.

DOI 10.1109/ICASSP.2014.6854599

QMRO

04-05-2014

Improving automatic music transcription through key detection
Benetos E Weyde T
AES 53rd International Conference on Semantic Audio London, UK 27 Jan 2014 - 29 Jan 2014.

QMRO

29-01-2014

RNN-based Music Language Models for Improving Automatic Music Transcription
Sigtia S Benetos E Cherla S Weyde T Garcez A Dixon S
15th International Society for Music Information Retrieval Conference., 53-58.

QMRO

01-01-2014

Automatic Transcription Of Pitched And Unpitched Sounds From Polyphonic Music
Benetos E Ewert S Weyde T
Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)., 3131-3135.

DOI 10.1109/ICASSP.2014.6854172

QMRO

01-01-2014

The DML Research Project: Digital Music Lab - Analysing Big Music Data
BARTHET M Benetos E Cottrell S Dixon S Dykes J Gold N Mahey M Plumbley MD et al.

01-01-2014

2013

Automatic transcription of Turkish makam music
Benetos E Holzapfel A
14th International Society for Music Information Retrieval Conference Curitiba, PR, Brazil 4 Nov 2013 - 8 Nov 2013., 355-360.

QMRO

08-11-2013

Explicit duration hidden Markov models for multiple-instrument polyphonic music transcription
Benetos E Weyde T
14th International Society for Music Information Retrieval Conference Curitiba, PR, Brazil 4 Nov 2013 - 8 Nov 2013., 269-274.

QMRO

08-11-2013

A machine learning approach to voice separation in lute tablature
de Valk R Weyde T Britto AS Gouyon F Dixon S
14th International Society for Music Information Retrieval Conference Curitiba, PR, Brazil 4 Nov 2013 - 8 Nov 2013., 555-560.

QMRO

04-11-2013

DETECTION AND CLASSIFICATION OF ACOUSTIC SCENES AND EVENTS: AN IEEE AASP CHALLENGE
Giannoulis D Benetos E Stowell D Rossignol M Lagrange M Plumbley MD
2013 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics., 1-4.

DOI 10.1109/waspaa.2013.6701819

QMRO

01-10-2013

A database and challenge for acoustic scene classification and event detection
Giannoulis D Stowell D Benetos E Rossignol M Lagrange M Plumbley MD
21st European Signal Processing Conference Marrakech, Morocco.
01-09-2013

An efficient shift-invariant model for polyphonic music transcription
Benetos E Cherla S
6th International Workshop on Machine Learning and Music Prague, Czech Republic.

QMRO

01-09-2013

Automatic music transcription: challenges and future directions
Benetos E Dixon S Giannoulis D Kirchhoff H Klapuri A
Journal of Intelligent Information Systems, Springer Nature vol. 41 (3), 407-434.

DOI 10.1007/s10844-013-0258-3

QMRO

25-07-2013

Roadmap for Music Information ReSearch
Serra X Magas M Benetos E Chudy M Dixon S Flexer A Gomez E Gouyon F et al.

02-05-2013

Multiple-instrument polyphonic music transcription using a temporally constrained shift-invariant model
Benetos E Dixon S
Journal of The Acoustical Society of America vol. 133 (3), 1727-1741.

DOI 10.1121/1.4790351

01-03-2013

2012

Automatic Music Transcription: Breaking the Glass Ceiling
BENETOS E Dixon S Giannoulis D Kirchhoff H Klapuri A
13th International Society for Music Information Retrieval Conference (ISMIR 2012) Porto, Portugal 8 Oct 2012 - 12 Oct 2012., 379-384.
12-10-2012

SCORE-INFORMED TRANSCRIPTION FOR AUTOMATIC PIANO TUTORING
Benetos E Klapuri A Dixon S
2012 PROCEEDINGS OF THE 20TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO)., 2153-2157.
01-01-2012

A Shift-Invariant Latent Variable Model for Automatic Music Transcription
Benetos E Dixon S
Computer Music Journal vol. 36 (4), 81-94.

DOI 10.1162/COMJ_a_00146

01-01-2012

Temporally-Constrained Convolutive Probabilistic Latent Component Analysis for Multi-pitch Detection
Benetos E Dixon S
Lecture Notes in Computer Science. vol. 7191, 364-371.

DOI 10.1007/978-3-642-28551-6_45

01-01-2012

Characterisation of acoustic scenes using a temporally-constrained shift-invariant model
Benetos E Lagrange M Dixon S
15th International Conference on Digital Audio Effects, DAFx 2012 Proceedings.
01-01-2012

Sit-stand and stand-sit transitions in older adults and patients with Parkinson’s disease: event detection based on motion sensors versus force plates
Zijlstra A Mancini M Lindemann U Chiari L
Journal of Neuroengineering and Rehabilitation, Springer Nature vol. 9 (1)

DOI 10.1186/1743-0003-9-75

01-01-2012

2011

Joint multi-pitch detection using harmonic envelope estimation for polyphonic music transcription
Benetos E Dixon S
IEEE Journal on Selected Topics in Signal Processing vol. 5 (6), 1111-1123.

DOI 10.1109/JSTSP.2011.2162394

01-10-2011

Multiple-instrument polyphonic music transcription using a convolutive probabilistic model
Benetos E Dixon S
8th Sound and Music Computing Conference Padova, Italy 6 Jul 2011 - 9 Jul 2011., 19-24.
01-07-2011

Automatically detecting key modulations in J.S. Bach chorale recordings
Mearns L Benetos E Dixon S
8th Sound and Music Computing Conference., 25-32.
01-07-2011

Polyphonic music transcription using note onset and offset detection
Benetos E Dixon S
ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings., 37-40.

DOI 10.1109/ICASSP.2011.5946322

01-01-2011

A TEMPORALLY-CONSTRAINED CONVOLUTIVE PROBABILISTIC MODEL FOR PITCH DETECTION
Benetos E Dixon S
2011 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA)., 133-136.

DOI 10.1109/ASPAA.2011.6082270

01-01-2011

The temperament police: The truth, the ground truth, and nothing but the truth
Dixon S Tidhar D Benetos E
12th International Society for Music Information Retrieval Conference Miami, Florida, USA 24 Oct 2011 - 28 Oct 2011., 281-286.
01-01-2011

2010

Multiple-F0 estimation of piano sounds exploiting spectral structure and temporal evolution
Benetos E Dixon S
ISCA Tutorial and Research Workshop on Statistical and Perceptual Audition., 13-18.
01-09-2010

Auditory Spectrum-Based Pitched Instrument Onset Detection
Benetos E Stylianou Y
IEEE Transactions on Audio Speech and Language Processing, Institute of Electrical and Electronics Engineers (IEEE) vol. 18 (8), 1968-1977.

DOI 10.1109/tasl.2010.2040785

19-01-2010

Non-Negative Tensor Factorization Applied to Music Genre Classification
Benetos E Kotropoulos C
IEEE Transactions on Audio Speech and Language Processing, Institute of Electrical and Electronics Engineers (IEEE) vol. 18 (8), 1955-1967.

DOI 10.1109/tasl.2010.2040784

19-01-2010

Improving music genre classification using automatically induced harmony rules
Anglade A Benetos E Mauch M Dixon S
Journal of New Music Research vol. 39 (4), 349-361.

DOI 10.1080/09298215.2010.525654

01-01-2010

2009

Pitched instrument onset detection based on auditory spectra
Benetos E Holzapfel A
Proceedings of the 10th International Society for Music Information Retrieval Conference, ISMIR 2009., 105-110.
01-01-2009

2008

A tensor-based approach for automatic music genre classification
Benetos E Kotropoulos C
16th European Signal Processing Conference.
01-08-2008

MUSCLE movie-database: a multimodal corpus with rich annotation for dialogue and saliency detection
Spachos D Zlantintsi A Moschou V Antonopoulos P Benetos E Kotti M Tzimouli K Kotropoulos C et al.
6th Language Resources and Evaluation Conference., 16-19.
01-05-2008

Music Genre Classification: A Multilinear Approach.
Panagakis I Benetos E Kotropoulos C Bello JP Chew E Turnbull D
ISMIR., 583-588.
01-01-2008

Movie analysis with emphasis to dialogue and action scene detection
BENETOS E Siatras S Kotropoulos C Nikolaidis N
In Multimodal Processing and Interaction, Springer 157-177.

DOI 10.1007/978-0-387-76316-3_7

01-01-2008

2007

A neural network approach to audio-assisted movie dialogue detection
Kotti M Benetos E
Neurocomputing, Elsevier vol. 71 (1-3), 157-166.

DOI 10.1016/j.neucom.2007.08.006

01-12-2007

Systematic comparison of BIC-based speaker segmentation systems
Moschou V Benetos E Kotropoulos C
2007 IEEE 9th Workshop on Multimedia Signal Processing., 66-69.

DOI 10.1109/mmsp.2007.4412819

01-10-2007

Neural network-based movie dialogue detection
Kotti M Benetos E
10th International Conference on Engineering Applications of Neural Networks.
01-08-2007

Large scale musical instrument identification
Benetos E Kotti M Kotropoulos C
4th Sound and Music Computing Conference., 283-286.
01-07-2007

2006

Applying Supervised Classifiers Based on Non-negative Matrix Factorization to Musical Instrument Classification
Benetos E Kotropoulos C
2006 IEEE International Conference on Multimedia and Expo., 2105-2108.

DOI 10.1109/icme.2006.262650

01-07-2006

Automatic Speaker Segmentation using Multiple Features and Distance Measures: A Comparison of Three Approaches
Kotti M Martins LGPM Cardoso JS
2006 IEEE International Conference on Multimedia and Expo., 1101-1104.

DOI 10.1109/icme.2006.262727

01-07-2006

Automatic speaker change detection with the Bayesian information criterion using MPEG-7 features and a fusion scheme
Kotti M Benetos E
2006 IEEE International Symposium on Circuits and Systems., 4-pp..

DOI 10.1109/iscas.2006.1692970

01-01-2006

Musical instrument classification using non-negative matrix factorization algorithms
Benetos E Kotropoulos C
2006 IEEE International Symposium on Circuits and Systems., 4-pp..

DOI 10.1109/iscas.2006.1692967

01-01-2006

Musical instrument classification using non-negative matrix factorization algorithms and subset feature selection
Benetos E Kotti M
ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. vol. 5
01-01-2006

Testing supervised classifiers based on non-negative matrix factorization to musical instrument classification
Benetos E Kotropoulos C Lidy T
European Signal Processing Conference.
01-01-2006

2005

Comparison of subspace analysis-based and statistical model-based algorithms for musical instrument classification
Benetos E Kotti M Kotropoulos C Burred JJ Eisenberg G Sikora T
2nd Workshop On Immersive Communication And Broadcast Systems.
01-10-2005

Melodic expectation as an elicitor of music-evoked chills
de Fleurian R Clemente A Benetos E Pearce MT
In Biorxiv

DOI 10.1101/2024.10.02.616280

From Aesthetics to Human Preferences: Comparative Perspectives of Evaluating Text-to-Music Systems
35th IEEE International Workshop on Machine Learning for Signal Processing.

DOI 10.1109/MLSP62443.2025.11204254

QMRO

Sound Matching with a Differentiable Karplus-Strong Algorithm
Tablas De Paula P Marttila D Díaz R Román I Benetos E Reiss JD
The 29th International Conference on Digital Audio Effects Cambridge, MA, USA 1 Sep 2026 - 4 Sep 2026.

Adapting Language-Audio Models as Few-Shot Audio Learners
Liang J Benetos E Phan H
INTERSPEECH 2023.

DOI 10.21437/Interspeech.2023-1082

QMRO

MuPT: A Generative Symbolic Music Pretrained Transformer
Qu X Bai Y Ma Y Zhou Z Lo KM Liu J Yuan R Min L et al.
The Thirteenth International Conference on Learning Representations Singapore 23 Apr 2025 - 28 Apr 2025.

DOI 10.48550/arxiv.2404.06393

QMRO

Does synchronised singing enhance social bonding more than speaking does? A global experimental Stage 1 Registered Report
Savage PE Ampiah-Bonney A Arabadjiev A Arhine A Ariza JF Bamford JS Barbosa BS Beck A-K et al.
In Psyarxiv

DOI 10.31234/osf.io/pv3m9

Grants

Grants of specific relevance to the Centre for Multimodal AI

Intelligent Urban Noise Monitoring
Lin Wang, Emmanouil Benetos and Andrea Cavallaro
£31,974 Engineering and Physical Sciences Research Council
01-06-2026 - 31-10-2026

Large Language Models for Multimodal Music Understanding and Ethical Audio Generation
Emmanouil Benetos
£40,314 Google LLC
01-10-2025 - 30-09-2027

Style classification of podcasts using audio
Emmanouil Benetos
£33,000 Spotify Ltd
01-03-2024 - 28-02-2027

UKRI Centre for Doctoral Training in Artificial Intelligence and Music
Simon Dixon, Emmanouil Benetos, Nicholas Bryan-Kinns, Mark Sandler, Andrew Mcpherson, Mathieu Barthet, George Fazekas, Ekaterina Ivanova, Anna Xambo Sedo and Charalampos Saitis
£6,522,646 Engineering and Physical Sciences Research Council
01-07-2019 - 31-08-2028

Neural Fingerprinting Optimization
Emmanouil Benetos
£20,157 Sound Patrol
15-01-2026 - 15-07-2026

Improved Bioacoustic Encoding
Johan Pauwels and Emmanouil Benetos
£25,000 Earth Species Project
17-11-2025 - 17-05-2026

Self supervision in Audio Fingerprinting
Emmanouil Benetos
£1,854 Alan Turing Institute, The
01-10-2025 - 30-06-2026

Online Speech Enhancement in Scenarios with Low Direct-to-Reverberant-Ratio
Emmanouil Benetos and Aidan Hogg
£65,621 L-ACOUSTICS UK LIMITED
01-09-2024 - 28-02-2025

Enhancing lyrics transcription with open-source architectures and fine-tuning techniques
Emmanouil Benetos
£11,700 MOISES SYSTEMS, INC.
07-07-2024 - 06-10-2024

Project Maestro - AI Musical Analysis Platform
Emmanouil Benetos and Simon Dixon
£166,349 Innovate UK
01-07-2024 - 30-06-2026

Music Performance Assessment and Feedback (MPAF)
Simon Dixon and Emmanouil Benetos
£250,000 Associated Board of the Royal Schools of Music
24-06-2024 - 23-12-2025

Resource-efficient machine listening
Emmanouil Benetos
£52,455 Royal Academy of Engineering
01-10-2023 - 31-07-2024

Deep learning technologies for multi-instrument automatic music transcription
Emmanouil Benetos and Simon Dixon
£252,000 Huawei Technologies
15-02-2022 - 14-11-2023

Towards complete music transcription: converting performance MIDI to quantized MIDI
Emmanouil Benetos
£8,125 TIKTOK BYTEDANCE UK (TIKTOK INFORMATION TECHNOLOGIES UK LIMITED)
04-10-2021 - 04-01-2022

Graph Networks for Explainable Artificial Intelligence
Andrea Cavallaro and Emmanouil Benetos
£293,434 Engineering and Physical Sciences Research Council
01-08-2021 - 31-12-2024

Unsupervised detection of sound events for complex audio
Emmanouil Benetos
£3,800 Royal Society
30-03-2021 - 29-03-2023

Industry-scale machine listening for music and audio data
Simon Dixon and Emmanouil Benetos
£108,000 Spotify Ltd
14-09-2020 - 31-01-2025

Development of Next Generation Music Recognition Algorithm for Content Monitoring
Simon Dixon and Emmanouil Benetos
£159,918 Innovate UK
01-11-2019 - 30-06-2021

Integrating sound and context recognition for acoustic scene analysis
Emmanouil Benetos
£97,838 EPSRC Engineering and Physical Sciences Research Council
03-04-2018 - 02-06-2019

A Machine Learning Framework for Audio Analysis and Retrieval
Emmanouil Benetos
£404,470 Royal Academy of Engineering
30-03-2015 - 29-03-2020

News

July 2026

New AI benchmark helps music generation systems better understand human preference

9 July 2026

Researchers at Queen Mary University of London have developed a new benchmark and evaluation system designed to help AI-generated music better align with human preferences, marking an important step towards more creative, controllable, and human-centred music-generation technologies. The study, CMI-RewardBench: Evaluating Music Reward Models with Compositional Multimodal Instruction, introduces a ... [more]

New AI benchmark helps music generation systems better understand human preference

June 2026

Centre for Multimodal AI at ICML 2026

11 June 2026

On 6-11 July, CMAI researchers will participate at the 43rd International Conference on Machine Learning (ICML), taking place in Seoul, South Korea. ICML is an international academic conference in machine learning held annually since 1980. It is the oldest and, along with NeurIPS and ICLR, one of the three primary conferences ... [more]

April 2026

Centre for Multimodal AI at ICASSP 2026

20 April 2026

On 4-8 May 2026, several CMAI researchers will participate at the 2026 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2026). ICASSP is the leading conference in the field of signal processing and the flagship event of the IEEE Signal Processing Society. As in previous years, the Centre for Multimodal AI ... [more]

Centre for Multimodal AI at ICLR 2026

9 April 2026

On 23-27 April, CMAI researchers will participate at the Fourteenth International Conference on Learning Representations (ICLR 2026), taking place in Rio de Janeiro, Brazil. ICLR is the premier gathering of professionals dedicated to the advancement of the branch of artificial intelligence called representation learning, but generally referred to as deep learning. ... [more]

January 2026

Reimagining music videos with AI: CMAI research breaks new ground

6 January 2026

Yinghao Ma, a PhD candidate in the Centre for Multimodal AI at Queen Mary University of London, has helped develop AutoMV, the first open-source AI system capable of generating complete music videos directly from full-length songs. Music-to-video generation remains a major challenge for generative AI. While recent video models can ... [more]

Reimagining music videos with AI: CMAI research breaks new ground

November 2025

CMAI at NeurIPS 2025

17 November 2025

On 2-7 December, several CMAI researchers will participate at the 39th Annual Conference on Neural Information Processing Systems (NeurIPS 2025), taking place in San Diego. NeurIPS is a prestigious annual academic conference and non-profit foundation that fosters the exchange of research in artificial intelligence (AI), machine learning (ML), and computational neuroscience. ... [more]

October 2025

CMAI PhD student awarded Google PhD Fellowship

23 October 2025

We are extremely proud to announce that Yinghao Ma, PhD student in AI and Music at the Centre for Multimodal AI of QMUL and supervised by Dr Emmanouil Benetos, has been awarded the 2025 Google Fellowship in Machine Perception. A Google spokesperson said: "The student nominations we received this year were ... [more]

CMAI PhD student awarded Google PhD Fellowship

CMAI at WASPAA 2025

6 October 2025

On 12-15 October, several CMAI researchers will participate at the 2025 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, taking place at the Granlibakken Tahoe Resort near Lake Tahoe, in Tahoe City, CA, USA. WASPAA is a premier event in the field of audio signal processing, organised by ... [more]

September 2025

CMAI at ISMIR 2025

8 September 2025

On 21-25 September 2025, several CMAI researchers will participate at the 26th International Society for Music Information Retrieval Conference (ISMIR 2025). ISMIR is the leading conference in the field of music informatics, and is currently the top-cited publication for Music & Musicology (source: Google Scholar). This year ISMIR will take place onsite in ... [more]

CMAI organises AES AIMLA 2025 conference

3 September 2025

The AES International Conference on Artificial Intelligence and Machine Learning for Audio (AIMLA 2025) will be hosted by the Centre for Multimodal AI of Queen Mary University of London and is taking place on Sept. 8-10, 2025. Several CMAI members are involved in the organisation of the conference, including but not limited ... [more]

CMAI organises AES AIMLA 2025 conference

July 2025

CMAI student to join the Alan Turing Institute in 2025-2026

30 July 2025

CMAI PhD student Aditya Bhattacharjee has been awarded an enrichment placement by the Alan Turing Institute, the UK's national institute in artificial intelligence and data science, enabling him to join and interact with institute researchers and its community in the 2025/26 academic year. Aditya is supervised by Dr Emmanouil Benetos and ... [more]

CMAI student to join the Alan Turing Institute in 2025-2026

June 2025

CMAI at IJCNN 2025 conference

14 June 2025

On 30 June - 5 July 2025, CMAI researchers will participate at the IEEE International Joint Conference on Neural Networks (IJCNN 2025), the flagship conference of the IEEE Computational Intelligence Society and the International Neural Network Society. The Centre for Multimodal AI will have a strong presence at the conference. The following papers authored/... [more]

April 2025

CMAI at ICLR 2025

14 April 2025

On 24-28 April, CMAI researchers will participate at the Thirteenth International Conference on Learning Representations (ICLR 2025), taking place in Singapore. ICLR is the premier gathering of professionals dedicated to the advancement of the branch of artificial intelligence called representation learning, but generally referred to as deep learning. CMAI members will ... [more]

March 2025

CMAI at ICASSP 2025

24 March 2025

On 6-11 April 2025, several CMAI researchers will participate at the 2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2025). ICASSP is the leading conference in the field of signal processing and the flagship event of the IEEE Signal Processing Society. As in previous years, the Centre for Multimodal AI ... [more]

CMAI researchers pioneer AI that can "hear": a breakthrough in multimodal generative AI

20 March 2025

Researchers at the Centre for Multimodal AI have developed a novel approach that enables large language models (LLMs) to "hear" and "understand" sound. Read more at: https://www.qmul.ac.uk/eecs/news-and-events/news/items/eecs-phd-researcher-pioneers-ai-that-can-hear-a-breakthrough-in-multimodal-generative-ai.html

CMAI researchers pioneer AI that can "hear": a breakthrough in multimodal generative AI

May 2024

Singing researchers uncover cross-cultural patterns in music and language

17 May 2024

Predictable melodies in songs may aid social bonding and group synchronisation, according to researchers. Key points: Over 75 researchers from 46 countries participated in the study, singing traditional songs and speaking in their native languages. The study found that songs tend to have slower rhythms and higher pitches than speech, suggesting that ... [more]

Dr Emmanouil Benetos

Research

Interests

Publications

2026

2025

2024

2023

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

2009

2008

2007

2006

2005

Grants

Research Group

PhD Students

News

July 2026

June 2026

April 2026

January 2026

November 2025

October 2025

September 2025

July 2025

June 2025

April 2025

March 2025

May 2024