Dr Emmanouil Benetos

Emmanouil Benetos
FHEA

Reader in Machine Listening
Director of Research, Deputy Director of the UKRI Centre for Doctoral Training in AI and Music

School of Electronic Engineering and Computer Science
Queen Mary University of London
ResearcherID ORCID Scopus Google Scholar LinkedIn X

Research

Machine listening / computer audition, Machine learning for audio and sequential data, Music information retrieval, Multimodal AI, Resource-efficient AI

Interests

I am currently Reader in Machine Listening and Director of Research at the School of Electronic Engineering and Computer Science of Queen Mary University of London. Within Queen Mary, I am member of the Centre for Digital Music, Centre for Multimodal AI, Centre for Intelligent Sensing, and Digital Environment Research Institute, and I co-lead the School's Machine Listening Lab.

My main research topic is computational audio analysis, also referred to as machine listening or computer audition - applied to music, urban, everyday and nature sounds. I have been Royal Academy of Engineering / Leverhulme Trust Research Fellow in resource-efficient machine listening, Turing Fellow at the Alan Turing Institute, Royal Academy of Engineering Research Fellow, and have been principal- and co-investigator for several audio-related funded research projects on topics related to sound scene analysis, music information retrieval, and digital musicology. I am also Deputy Director for the UKRI Centre for Doctoral Training in Artificial Intelligence and Music (AIM).

On academic service, I am currently secretary for the International Society for Music Information Retrieval (ISMIR), member of the IEEE Technical Committee on Audio and Acoustic Signal Processing (AASP TC), member of the EURASIP Acoustic, Speech and Music Signal Processing Technical Area Committee (ASMSP TAC), associate editor for the IEEE/ACM Transactions on Audio, Speech, and Language Processing, and associate editor for the EURASIP Journal on Audio, Speech, and Music Processing.

Publications

Relevant PublicationPublications of specific relevance to the Centre for Multimodal AI

2024

Relevant PublicationDeng Q, Yang Q, Yuan R, Huang Y, Wang Y, Liu X, Tian Z, Pan J, Zhang G, Lin H, Li Y, Ma Y, Fu J, Lin C, Benetos E, Wang W, Xia G, Xue W and Guo Y (2024). ComposerX: Multi-Agent Symbolic Music Composition with LLMs. 25th International Society for Music Information Retrieval Conference (ISMIR), San Francisco, CA, USA 10 Nov 2024 - 14 Nov 2024
10-11-2024
bullet iconSteinmetz C, Singh S, Comunit� M, Ibnyahya I, Yuan S, Benetos E and Reiss J (2024). ST-ITO: Controlling audio effects for style transfer with inference-time optimization. 25th International Society for Music Information Retrieval Conference (ISMIR) San Francisco, CA, USA 10 Nov 2024 - 14 Nov 2024
10-11-2024
Relevant PublicationZhou Z, Wu Y, Wu Z, Zhang X, Yuan R, Ma Y, Wang L, Benetos E, Xue W and Guo Y (2024). Can LLMs Reason in Music? An Evaluation of LLMs' Capability of Music Understanding and Generation. 25th International Society for Music Information Retrieval Conference (ISMIR) San Franscisco, CA, USA 10 Nov 2024 - 14 Nov 2024
10-11-2024
Relevant PublicationWeck B, Manco I, Benetos E, QUINTON E, Fazekas G and Bogdanov D (2024). MuChoMusic: Evaluating Music Understanding in Multimodal Audio-Language Models. 25th International Society for Music Information Retrieval Conference (ISMIR) San Francisco, CA, USA 10 Nov 2024 - 14 Nov 2024
10-11-2024
Relevant PublicationChang SK, Benetos E, KIRCHHOFF H and Dixon S (2024). ˜YourMT3+: Multi-instrument Music Transcription with Enhanced Transformer Architectures and Cross-dataset Stem Augmentation. IEEE International Workshop on Machine Learning for Signal Processing (MLSP) London, UK 22 Sep 2024 - 25 Sep 2024
22-09-2024
bullet iconTorrisi A, De Almeida Nolasco IS, Versace E and Benetos E (2024). Exploratory analysis of early-life chick calls. 4th International Workshop on Vocal Interactivity in-and-between Humans, Animals and Robots (VIHAR) Kos, Greece 6 Sep 2024
06-09-2024
Relevant PublicationHuang J and Benetos E (2024). Towards Building an End-to-End Multilingual Automatic Lyrics Transcription Model. 32nd European Signal Processing Conference (EUSIPCO) Lyon, France 26 Aug 2024 - 30 Aug 2024
26-08-2024
Relevant PublicationYuan R, Lin H, Wang Y, Tian Z, Wu S, Shen T, Zhang G, Wu Y, Liu C, Zhou Z, Ma Z, Xue L, Wang Z, Liu Q, Zheng T, Li Y, Ma Y, Liang Y, Chi X, Liu R, et al. (2024). ChatMusician: Understanding and Generating Music Intrinsically with LLM. 62nd Annual Meeting of the Association for Computational Linguistics (ACL 2024) Bangkok, Thailand 11 Aug 2024 - 16 Aug 2024
11-08-2024
Relevant PublicationXompero A, Bontonou M, Arbona J-M, Benetos E and Cavallaro A (2024). Explaining models relating objects and privacy. 3rd Explainable AI for Computer Vision (XAI4CV) Workshop at CVPR 2024 Seattle Convention Center, Seattle WA, USA 18 Jun 2024
18-06-2024
Relevant PublicationDeng Z, Ma Y, Liu Y, Guo R, Zhang G, Chen W, Huang W and Benetos E (2024). MusiLingo: bridging music and text with pre-trained language models for music captioning and query response. 2024 Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL 2024) Mexico City, Mexico 16 Jun 2024 - 21 Jun 2024
16-06-2024
bullet iconOzaki Y, Tierney A, Pfordresher PQ, McBride JM, Benetos E, Proutskova P, Chiba G, Liu F, Jacoby N, Purdy SC, Opondo P, Fitch WT, Hegde S, Rocamora M, Thorne R, Nweke F, Sadaphal DP, Sadaphal PM, Hadavi S, Fujii S, et al. (2024). Globally, songs and instrumental melodies are slower and higher and use more stable pitches than speech: A Registered Report. Science Advances, American Association for the Advancement of Science (AAAS) vol. 10 (20) 
15-05-2024
Relevant PublicationLiang J, Zhang H, Liu H, Cao Y, Kong Q, Liu X, Wang W, Plumbley MD, Phan H and Benetos E (2024). WavCraft: audio editing and generation with large language models. ICLR 2024 Workshop on LLM Agents Vienna, Austria 11 May 2024
11-05-2024
Relevant PublicationLi Y, Yuan R, Zhang G, Ma Y, Chen X, Yin H, Xiao C, Lin C, Ragni A, Benetos E, Gyenge N, Dannenberg R, Liu R, Chen W, Xia G, Shi Y, Huang W, Wang Z, Guo Y and Fu J (2024). MERT: Acoustic Music Understanding Model with Large-Scale Self-supervised Training. International Conference on Learning Representations (ICLR) Vienna, Austria 7 May 2024 - 11 May 2024
07-05-2024
bullet iconLiang J, Phan QH and Benetos E (2024). Learning from taxonomy: multi-label few-shot classification for everyday sound recognition. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) Seoul, Korea 14 Apr 2024 - 19 Apr 2024
14-04-2024
Relevant PublicationLi D, Ma Y, Wei W, KONG Q, Wu Y, Che M, Xia F, Benetos E and Li W (2024). MERTech: instrument playing technique detection using self-supervised pretrained model with multi-task finetuning. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) Seoul, Korea 14 Apr 2024 - 19 Apr 2024
14-04-2024
Relevant PublicationPostolache E, Mariani G, Cosmo L, Benetos E and Rodola E (2024). Generalized multi-source inference for text conditioned music diffusion models. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) Seoul, Korea 14 Apr 2024 - 19 Apr 2024
14-04-2024
bullet iconLiang J, Nolasco I, Ghani B, Phan H, Benetos E and Stowell D (2024). Mind the Domain Gap: a Systematic Analysis on Bioacoustic Sound Event Detection. 32nd European Signal Processing Conference (EUSIPCO 2024) Lyon, France 26 Aug 2024 - 30 Aug 2024
27-03-2024
Relevant PublicationEDWARDS D, Dixon S, Benetos E, Maezawa A and Kusaka Y (2024). A Data-Driven Analysis of Robust Automatic Piano Transcription. IEEE Signal Processing Letters, Institute of Electrical and Electronics Engineers vol. 31, 681-685.  
08-02-2024
Relevant PublicationSingh S, Steinmetz C, Benetos E, Phan QH and Stowell D (2024). ATGNN: audio tagging graph neural network. IEEE Signal Processing Letters, Institute of Electrical and Electronics Engineers vol. 31, 825-829.  
17-01-2024

2023

Relevant PublicationManco I, Weck B, Doh S, Won M, Zhang Y, Bodganov D, Wu Y, Chen K, Tovstogan P, Benetos E, Quinton E, Fazekas G and Nam J (2023). The Song Describer dataset: a corpus of audio captions for music-and-language evaluation. NeurIPS Machine Learning for Audio Workshop New Orleans, USA 16 Dec 2023
16-12-2023
Relevant PublicationDeb O, Benetos E and Torr P (2023). Remaining-useful-life prediction and uncertainty quantification using LSTM ensembles for aircraft engines. NeurIPS Workshop on Advancing Neural Network Training (WANT): Computational Efficiency, Scalability, and Resource Optimization New Orleans, USA 16 Dec 2023
16-12-2023
Relevant PublicationYuan R, Ma Y, Li Y, Zhang G, Chen X, Yin H, Zhuo L, Liu Y, Huang J, Tian Z, Deng B, Wang N, Benetos E, Ragni A, Gyenge N, Dannenberg R, Chen W, Xia G, Xue W, Liu S, et al. (2023). MARBLE: Music Audio Representation Benchmark for Universal Evaluation. 37th Conference on Neural Information Processing Systems (NeurIPS) 10 Dec 2023 - 16 Dec 2023
10-12-2023
bullet iconRagano A, Benetos E and Hines A (2023). Learning Music Representations with wav2vec 2.0. 2023 31st Irish Conference on Artificial Intelligence and Cognitive Science (AICS)
08-12-2023
Relevant PublicationRagano A, Benetos E and Hines A (2023). Learning Music Representations with wav2vec 2.0. 31st Irish Conference on Artificial Intelligence and Cognitive Science (AICS) Letterkenny, Ireland 7 Dec 2023
07-12-2023
Relevant PublicationPapaioannou C, Benetos E and Potamianos A (2023). From West to East: Who can understand the music of the others better? 24th International Society for Music Information Retrieval Conference (ISMIR) Milan, Italy 5 Nov 2023 - 9 Nov 2023
05-11-2023
Relevant PublicationMa Y, Yuan R, Li Y, Zhang G, Chen X, Yin H, Lin C, Benetos E, Ragni A, Gyenge N, Liu R, Xia G, Dannenberg R, Guo Y and Fu J (2023). On the effectiveness of speech self-supervised learning for music. 24th International Society for Music Information Retrieval Conference (ISMIR) Milan, Italy 5 Nov 2023 - 9 Nov 2023
05-11-2023
Relevant PublicationZhuo L, Yuan R, Pan J, Ma Y, Li Y, Zhang G, Liu S, Dannenberg R, Fu J, Lin C, Benetos E, Chen W, Xue W and Guo Y (2023). LyricWhiz: Robust Multilingual Lyrics Transcription by Whispering to ChatGPT. 24th International Society for Music Information Retrieval Conference (ISMIR) Milan, Italy 5 Nov 2023 - 9 Nov 2023
05-11-2023
Relevant PublicationSarkar S, Thorpe L, Benetos E and Sandler M (2023). Leveraging synthetic data for improving chamber ensemble separation. IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA) New Paltz, New York, USA 22 Oct 2023 - 25 Oct 2023
22-10-2023
Relevant PublicationVahidi C, Singh S, Benetos E, Phan QH, Stowell D, Fazekas G and Lagrange M (2023). Perceptual musical similarity metric learning with graph neural networks. IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA) New Paltz, NY, USA 22 Oct 2023 - 25 Oct 2023
22-10-2023
Relevant PublicationEdwards D, Dixon S and Benetos E (2023). PiJAMA: Piano Jazz with Automatic MIDI Annotations. Transactions of the International Society for Music Information Retrieval, Ubiquity Press vol. 6 (1), 89-102.  
15-09-2023
Relevant PublicationLiang J, Liu X, Liu H, Phan H, Benetos E, Plumbley M and Wang W (2023). Adapting Language-Audio Models as Few-Shot Audio Learners. 24th Annual Conference of the International Speech Communication Association (INTERSPEECH) Dublin, Ireland 20 Aug 2023 - 24 Aug 2023
20-08-2023
Relevant PublicationRagano A, Benetos E, Chinen M, Becerra H, Chandan Karadagur Ananda R, Skoglund J and Hines A (2023). A Comparison Of Deep Learning MOS Predictors For Speech Synthesis Quality. Irish Signals & Systems Conference 2023 Dublin, Ireland 13 Jun 2023 - 14 Jun 2023
13-06-2023
Relevant PublicationRagano A, Benetos E and Hines A (2023). Audio Quality Assessment of Vinyl Music Collections Using Self-Supervised Learning. 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 4 Jun 2023 - 10 Jun 2023
04-06-2023
Relevant PublicationLi Y, Cao W, Xie W, Li J and Benetos E (2023). Few-shot Class-incremental Audio Classification Using Dynamically Expanded Classifier with Self-attention Modified Prototypes. IEEE Transactions on Multimedia, Institute of Electrical and Electronics Engineers vol. 26, 1346-1360.  
25-05-2023

2022

Relevant PublicationLi Y, Yuan R, Zhang G, Ma Y, Lin C, Chen X, Ragni A, Yin H, Hu Z, He H, Benetos E, Gyenge N, Liu R and Fu J (2022). Large-Scale Pretrained Model for Self-Supervised Music Audio Representation Learning. DMRN+17: Digital Music Research Network One-day Workshop 2022 London, UK 20 Dec 2022
20-12-2022
Relevant PublicationLiu L, KONG Q, Morfi G-V and Benetos E (2022). Performance MIDI-to-score conversion by neural beat tracking. 23rd International Society for Music Information Retrieval Conference (ISMIR) Bengaluru, India 4 Dec 2022 - 8 Dec 2022
18-12-2022
Relevant PublicationSarkar S, Benetos E and Sandler M (2022). EnsembleSet: A new high-quality synthesised dataset for chamber ensemble separation. 23rd International Society for Music Information Retrieval Conference (ISMIR) Bengaluru, India 5 Dec 2022 - 8 Dec 2022
08-12-2022
Relevant PublicationManco I, Benetos E, QUINTON E and Fazekas G (2022). Contrastive audio-language learning for music. 23rd International Society for Music Information Retrieval Conference (ISMIR) Bengaluru, India 4 Dec 2022 - 8 Dec 2022
04-12-2022
Relevant PublicationLiang J, Phan QH and Benetos E (2022). Leveraging label hierarchies for few-shot everyday sound recognition. 7th Workshop on Detection and Classification of Acoustic Scenes and Events (DCASE) Nancy, France 3 Nov 2022 - 4 Nov 2022
03-11-2022
Relevant PublicationMai KT, Davies T, Griffi LD and Benetos E (2022). Explaining the decisions of anomalous sound detectors. 7th Workshop on Detection and Classification of Acoustic Scenes and Events (DCASE) Nancy, France 3 Nov 2022 - 4 Nov 2022
03-11-2022
Relevant PublicationOzaki Y, Kuroyanagi J, McBride J, Proutskova P, Tierney A, Pfordresher P, Benetos E, Liu F and Savage PE (2022). Similarities and differences in a cross-linguistic sample of song and speech recordings. Joint Conference on Language Evolution Kanazawa, Japan 5 Sep 2022 - 8 Sep 2022
05-09-2022
Relevant PublicationWang C, Benetos E, Versace E and Wang S (2022). Joint Scattering for Automatic Chick Call Recognition. 30th European Signal Processing Conference Belgrade, Serbia 29 Aug 2022 - 2 Sep 2022
29-08-2022
Relevant PublicationSingh S, Benetos E and Phan QH (2022). Hypernetworks for sound event detection: a proof-of-concept. 30th European Signal Processing Conference (EUSIPCO 2022) Belgrade, Serbia 29 Aug 2022 - 3 Sep 2022
29-08-2022
Relevant PublicationDaikoku H, Ding S, Benetos E, Wood ALC, Shimizono T, Sanne US, Fujii S and Savage PE (2022). Agreement among human and automated estimates of similarity in a global music sample. 10th International Workshop on Folk Music Analysis (FMA 2022) Sheffield, UK 14 Jun 2022 - 17 Jun 2022
14-06-2022
Relevant PublicationHuang J, Benetos E and Ewert S (2022). Improving lyrics Alignment through Joint Pitch Detection. 2022 IEEE International Conference on Acoustics, Speech and Signal Processing Singapore 22 May 2022 - 27 May 2022
22-05-2022
Relevant PublicationManco I, Benetos E, Quinton E and Fazekas G (2022). Learning music audio representations via weak language supervision. 2022 IEEE International Conference on Acoustics, Speech and Signal Processing Singapore 22 May 2022 - 27 May 2022
22-05-2022
Relevant PublicationOu L, Guo Z, Benetos E, Han J and Wang Y (2022). Exploring transformer's potential on automatic piano transcription. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) Singapore 7 May 2022 - 13 May 2022
07-05-2022
Relevant PublicationRagano A, Benetos E and Hines A (2022). Automatic Quality Assessment of Digitized and Restored Sound Archives. Journal of the Audio Engineering Society, Audio Engineering Society vol. 70 (4), 252-270.  
01-04-2022
Relevant PublicationWang C, Benetos E, Lostanlen V and Chew E (2022). Adaptive Scattering Transforms for Playing Technique Recognition. IEEE/ACM Transactions on Audio, Speech and Language Processing, Institute of Electrical and Electronics Engineers vol. 30, 1407-1421.  
07-03-2022
Relevant PublicationBenetos E, Ragano A, Sgroi D and Tuckwell A (2022). Measuring national mood with music: using machine learning to construct a measure of national valence from audio data. Behavior Research Methods, Springer (part of Springer Nature) 
25-02-2022
Relevant PublicationTerenzi A, Ortolani N, De Almeida Nolasco I, Benetos E and Cecchi S (2022). Comparison of feature extraction methods for sound-based classification of honey bee activity. IEEE/ACM Transactions on Audio, Speech and Language Processing, Institute of Electrical and Electronics Engineers vol. 30, 112-122.  
01-01-2022

2021

bullet iconBodo RPP, Benetos E and Queiroz M (2021). A framework for music similarity and cover song identification. 15th International Symposium on Computer Music Multidisciplinary Research (CMMR) Tokyo, Japan 15 Nov 2021 - 19 Nov 2021
15-11-2021
bullet iconLiu L, Morfi V and Benetos E (2021). ACPAS: A Dataset of Aligned Classical Piano Audio and Scores for Audio-to-Score Transcription. Late-Breaking Demo Session of the 22nd Int. Society for Music Information Retrieval Conference
11-11-2021
bullet iconVianna Lordelo C, Benetos E, Dixon S and Ahlbäck S (2021). Pitch-informed instrument assignment using a deep convolutional network with multiple kernel shapes. 22nd International Society for Music Information Retrieval Conference (ISMIR) 9 Nov 2021 - 12 Nov 2021
09-11-2021
bullet iconOzaki Y, McBride J, Benetos E, Pfordresher PQ, Six J, T. Tierney A, Proutskova P, Sakai E, Kondo H, Fukatsu H, Fujii S and Savage PE (2021). Agreement among human and annotated transcriptions of global songs. 22nd International Society for Music Information Retrieval Conference (ISMIR) 9 Nov 2021 - 12 Nov 2021
09-11-2021
bullet iconO'Hanlon K, Benetos E and Dixon S (2021). Detecting cover songs with pitch class key-invariant networks. IEEE International Workshop on Machine Learning for Signal Processing (MLSP) Gold Coast, Queensland, Australia 25 Oct 2021 - 28 Oct 2021
25-10-2021
bullet iconHolzapfel A, Benetos E, Killick A and Widdess R (2021). Humanities and Engineering Perspectives on Music Transcription. Digital Scholarship in the Humanities, Oxford University Press (OUP) 
23-10-2021
Relevant PublicationSarkar S, Benetos E and Sandler M (2021). Vocal Harmony Separation using Time-domain Neural Networks. 22nd Annual Conference of the International Speech Communication Association (INTERSPEECH) Brno, Czech Republic 30 Aug 2021 - 3 Sep 2021
30-08-2021
bullet iconBear H, Morfi V and Benetos E (2021). An evaluation of data augmentation methods for sound scene geotagging. 22nd Annual Conference of the International Speech Communication Association (INTERSPEECH) Brno, Czech Republic 30 Aug 2021 - 3 Sep 2021
30-08-2021
Relevant PublicationZhao Y, Wang C, Fazekas G, Benetos E and Sandler M (2021). Violinist identification based on vibrato features. 29th European Signal Processing Conference (EUSIPCO) 23 Aug 2021 - 27 Aug 2021
23-08-2021
bullet iconCheuk KW, Luo Y-J, Benetos E and Herremans D (2021). Revisiting the onsets and frames model with additive attention. International Joint Conference on Neural Networks (IJCNN) 18 Jul 2021 - 22 Jul 2021
18-07-2021
bullet iconManco I, Benetos E, Quinton E and Fazekas G (2021). MusCaps: generating captions for music audio. International Joint Conference on Neural Networks (IJCNN) 18 Jul 2021 - 22 Jul 2021
18-07-2021
bullet iconLiu L and Benetos E (2021). From Audio to Music Notation. Handbook of Artificial Intelligence for Music , Editors: Miranda ER. 693-714.  
03-07-2021
bullet iconRagano A, Benetos E and Hines A (2021). More for Less: Non-Intrusive Speech Quality Assessment with Limited Annotations. 13th International Conference on Quality of Multimedia Experience (QoMEX) 14 Jun 2021 - 17 Jun 2021
14-06-2021
bullet iconSingh S, Bear H and Benetos E (2021). Prototypical Networks for Domain Adaptation in Acoustic Scene Classification. IEEE International Conference on Acoustics, Speech and Signal Processing Toronto, Canada 6 Jun 2021 - 11 Jun 2021
06-06-2021
bullet iconLiu L, Morfi G-V and Benetos E (2021). Joint multi-pitch detection and score transcription for polyphonic piano music. IEEE International Conference on Acoustics, Speech and Signal Processing Toronto, Canada 6 Jun 2021 - 11 Jun 2021
06-06-2021
Relevant PublicationSubramanian V, Gururani S, Benetos E and Sandler M (2021). Anomalous behaviour in loss-gradient based interpretability methods. RobustML workshop paper at ICLR 2021
07-05-2021
bullet iconCheuk KW, Benetos E, Luo Y and Herremans D (2021). The effect of spectrogram reconstructions on automatic music transcription: an alternative approach to improve transcription accuracy. 25th International Conference on Pattern Recognition (ICPR2020) Milan, Italy 10 Jan 2021 - 15 Jan 2021
10-01-2021
bullet iconLordelo C, Benetos E, Dixon S and Ahlbäck S (2021). PITCH-INFORMED INSTRUMENT ASSIGNMENT USING A DEEP CONVOLUTIONAL NETWORK WITH MULTIPLE KERNEL SHAPES. 
01-01-2021

2020

bullet iconVianna Lordelo C, Benetos E, Dixon S, Ahlbäck S and Ohlsson P (2020). Adversarial Unsupervised Domain Adaptation for Harmonic-Percussive Source Separation. IEEE Signal Processing Letters, Institute of Electrical and Electronics Engineers vol. 28, 81-85.  
18-12-2020
bullet iconLiu L, Morfi G-V and Benetos E (2020). Joint Piano-roll and Score Transcription for Polyphonic Piano Music. DMRN+15: Digital Music Research Network One-day Workshop London, UK 15 Dec 2020
15-12-2020
bullet iconChettri B, Benetos E and Sturm BLT (2020). Dataset artefacts in anti-spoofing systems: a case study on the ASVspoof 2017 benchmark. IEEE/ACM Transactions on Audio, Speech and Language Processing, Institute of Electrical and Electronics Engineers vol. 28, 3018-3028.  
09-11-2020
bullet iconChettri B, Kinnunen T and Benetos E (2020). Subband modeling for spoofing detection in automatic speaker verification. Odyssey 2020: The Speaker and Language Recognition Workshop Tokyo, Japan 1 Nov 2020 - 5 Nov 2020
01-11-2020
bullet iconRagano A, Benetos E and Hines A (2020). Development of a Speech Quality Database Under Uncontrolled Conditions. 21st Annual Conference of the International Speech Communication Association (INTERSPEECH 2020) Shanghai, China 25 Oct 2020 - 29 Oct 2020
25-10-2020
bullet iconPankajakshan A, Bear H, Subramanian V and Benetos E (2020). Memory Controlled Sequential Self Attention for Sound Recognition. 21st Annual Conference of the International Speech Communication Association (INTERSPEECH 2020) Shanghai, China 25 Oct 2020 - 29 Oct 2020
25-10-2020
bullet iconMISHRA S, Benetos E, Sturm B and Dixon S (2020). Reliable Local Explanations for Machine Listening. International Joint Conference on Neural Networks (IJCNN) Glasgow, UK 19 Jul 2020 - 24 Jul 2020
19-07-2020
bullet iconYcart A, Liu L, Benetos E and Pearce M (2020). Investigating the Perceptual Validity of Evaluation Metrics for Automatic Piano Music Transcription. Transactions of the International Society for Music Information Retrieval, Ubiquity Press vol. 3 (1), 68-81.  
12-06-2020
bullet iconRagano A, Benetos E and Hines A (2020). Audio impairment recognition using a correlation-based feature representation. 12th International Conference on Quality of Multimedia Experience (QoMEX) Athlone, Ireland 26 May 2020 - 28 May 2020
26-05-2020
bullet iconWang C, Lostanlen V, Benetos E and Chew E (2020). Playing Technique Recognition by Joint Time–Frequency Scattering. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2020) Barcelona, Spain 4 May 2020 - 8 May 2020
04-05-2020
bullet iconWei W, Zhu H, Benetos E and Wang Y (2020). A-CRNN: a domain adaptation model for sound event detection. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2020) Barcelona, Spain 4 May 2020 - 8 May 2020
04-05-2020
Relevant PublicationMartinez Ramirez M, Benetos E and Reiss J (2020). Modeling plate and spring reverberation using a DSP-informed deep neural network. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2020) Barcelona, Spain 4 May 2020 - 8 May 2020
04-05-2020
Relevant PublicationSUBRAMANIAN V, Pankajakshan A, Benetos E, Xu N, McDonald S and Sandler M (2020). A Study on the Transferability of Adversarial Attacks in Sound Event Classification. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2020) Barcelona, Spain 4 May 2020 - 8 May 2020
04-05-2020
bullet iconYcart A, Liu L, Benetos E and Pearce MT (2020). Musical Features for Automatic Music Transcription Evaluation. 
15-04-2020
bullet iconYcart A and Benetos E (2020). Learning and Evaluation Methodologies for Polyphonic Music Sequence Prediction with LSTMs. IEEE/ACM Transactions on Audio, Speech and Language Processing, Institute of Electrical and Electronics Engineers vol. 28 (1), 1328-1341.  
14-04-2020
bullet iconChettri B, Kinnunen T and Benetos E (2020). Deep Generative Variational Autoencoding for Replay Spoof Detection in Automatic Speaker Verification. Computer Speech and Language, Elsevier vol. 63 
19-03-2020
Relevant PublicationMartinez Ramirez M, Benetos E and Reiss J (2020). Deep Learning for Black-Box Modeling of Audio Effects. Applied Sciences, MDPI AG vol. 10 (2) 
16-01-2020

2019

bullet iconLiu L and Benetos E (2019). Automatic Music Accompaniment with a Chroma-based Music Data Representation. DMRN+14: Digital Music Research Network One-day Workshop
17-12-2019
bullet iconHolzapfel A and Benetos E (2019). Automatic music transcription and ethnomusicology: a user study. 20th conference of the International Society for Music Information Retrieval (ISMIR) Delft, The Netherlands 4 Nov 2019 - 8 Nov 2019
04-11-2019
bullet iconWang C, Benetos E, Lostanlen V and Chew E (2019). Adaptive Time–Frequency Scattering for Periodic Modulation Recognition in Music Signals. International Society for Music Information Retrieval Conference Delft, The Netherlands 4 Nov 2019 - 8 Nov 2019
04-11-2019
bullet iconYcart A, McLeod A, Benetos E and Yoshii K (2019). Blending acoustic and language model predictions for automatic music transcription. 20th conference of the International Society for Music Information Retrieval (ISMIR) Delft, The Netherlands 4 Nov 2019 - 8 Nov 2019
04-11-2019
bullet iconYcart A, Stoller D and Benetos E (2019). A Comparative Study of Neural Models for Polyphonic Music Sequence Transduction. 20th conference of the International Society for Music Information Retrieval (ISMIR) Delft, The Netherlands 4 Nov 2019 - 8 Nov 2019
04-11-2019
bullet iconWang C, Benetos E and Chew E (2019). CBF-periDB: A Chinese Bamboo Flute Dataset for Periodic Modulation Analysis. International Society for Music Information Retrieval Conference Late-Breaking Demo Session Delft, The Netherlands 4 Nov 2019 - 8 Nov 2019
04-11-2019
Relevant PublicationSUBRAMANIAN V, Benetos E and Sandler M (2019). Robustness of Adversarial Attacks in Sound Event Classification. 4th Workshop on Detection and Classification of Acoustic Scenes and Events (DCASE 2019) New York, USA 25 Oct 2019 - 26 Oct 2019
25-10-2019
bullet iconPankajakshan A, Bear H and Benetos E (2019). Onsets, activity, and events: a multi-task approach for polyphonic sound event modelling. 4th Workshop on Detection and Classification of Acoustic Scenes and Events (DCASE 2019) New York, USA 25 Oct 2019 - 26 Oct 2019
25-10-2019
bullet iconSingh S, Pankajakshan A and Benetos E (2019). Audio tagging using a linear noise modelling layer. 4th Workshop on Detection and Classification of Acoustic Scenes and Events (DCASE 2019) New York, USA 25 Oct 2019 - 26 Oct 2019
25-10-2019
bullet iconVianna Lordelo C, Benetos E, Dixon S and Ahlbäck S (2019). Investigating kernel shapes and skip connections for deep learning-based harmonic-percussive separation. IEEE Workshop on Applications of Signal Processing to Audio and Acoustics New Paltz, NY, USA 20 Oct 2019 - 23 Oct 2019
20-10-2019
bullet iconPankajakshan A, Bear H and Benetos E (2019). Polyphonic sound event and sound activity detection: a multi-task approach. IEEE Workshop on Applications of Signal Processing to Audio and Acoustics New Paltz, NY, USA 20 Oct 2019 - 23 Oct 2019
20-10-2019
bullet iconBear H, Heittola T, Mesaros A, Benetos E and Virtanen T (2019). City classification from multiple real-world sound scenes. IEEE Workshop on Applications of Signal Processing to Audio and Acoustics New Paltz, NY, USA 20 Oct 2019 - 23 Oct 2019
20-10-2019
bullet iconChettri B, Stoller D, Morfi V, Martinez Ramirez M, Benetos E and Sturm B (2019). Ensemble Models for Spoofing Detection in Automatic Speaker Verification. 20th Annual Conference of the International Speech Communication Association (INTERSPEECH 2019) Graz, Austria 15 Jul 2019 - 19 Sep 2019
15-09-2019
bullet iconBear H, Nolasco I and Benetos E (2019). Towards joint sound scene and polyphonic sound event recognition. 20th Annual Conference of the International Speech Communication Association (INTERSPEECH 2019) Graz, Austria 15 Sep 2019 - 19 Sep 2019
15-09-2019
Relevant PublicationMartinez Ramirez M, Benetos E and Reiss J (2019). A general-purpose deep learning approach to model time-varying audio effects. International Conference on Digital Audio Effects (DAFx-19) Birmingham, UK 2 Sep 2019 - 6 Sep 2019
02-09-2019
bullet iconZhou Q, Feng Z and Benetos E (2019). Adaptive Noise Reduction for Sound Event Detection Using Subband-Weighted NMF. Sensors, MDPI AG vol. 19 (14) 
20-07-2019
bullet iconCovas E and Benetos E (2019). Optimal Neural Network Feature Selection for Spatial-Temporal Forecasting. Chaos, AIP Publishing vol. 29 (6) 
20-06-2019
bullet iconRagano A, BENETOS E and Hines A (2019). Adapting the Quality of Experience Framework for Audio Archive Evaluation. 11th International Conference on Quality of Multimedia Experience Berlin, Germany 5 Jun 2019 - 7 Jun 2019
05-06-2019
bullet iconWANG C, BENETOS E, MENG X and CHEW E (2019). HMM-based Glissando Detection for Recordings of Chinese Bamboo Flute. Sound and Music Computing Conference Malaga, Spain 28 May 2019 - 31 May 2019
28-05-2019
bullet iconLins F, Johann M, BENETOS E and Schramm R (2019). Automatic Transcription of Diatonic Harmonica Recordings. IEEE International Conference on Acoustics, Speech, and Signal Processing Brighton, UK 12 May 2019 - 17 May 2019
12-05-2019
bullet iconPhaye SSR, BENETOS E and Wang Y (2019). SubSpectralNet - Using sub-spectrogram based convolutional neural networks for acoustic scene classification. IEEE International Conference on Acoustics, Speech, and Signal Processing Brighton, UK 12 May 2019 - 17 May 2019
12-05-2019
bullet iconMISHRA S, STOLLER D, BENETOS E, STURM B and DIXON S (2019). GAN-based Generation and Automatic Selection of Explanations for Neural Networks. SafeML ICLR 2019 Workshop New Orleans, USA 6 May 2019
06-05-2019
bullet iconNolasco I, Terenzi A, Cecchi S, Orcioni S, BEAR H and BENETOS E (2019). Audio-based identification of beehive states. IEEE International Conference on Acoustics, Speech, and Signal Processing Brighton, UK 12 May 2019 - 17 May 2019
12-02-2019
bullet iconBENETOS E, DIXON S, Duan Z and EWERT S (2019). Automatic Music Transcription: An Overview. IEEE Signal Processing Magazine, Institute of Electrical and Electronics Engineers vol. 36 (1), 20-30.  
01-01-2019

2018

bullet iconCHETTRI B, MISHRA S, STURM B and BENETOS E (2018). Analysing the predictions of a CNN-based replay spoofing detection system. 2018 IEEE Workshop on Spoken Language Technology Athens, Greece 18 Dec 2018 - 21 Dec 2018
18-12-2018
bullet iconBEAR H and BENETOS E (2018). An extensible cluster-graph taxonomy for open set sound scene analysis. Workshop on Detection and Classification of Acoustic Scenes and Events Surrey, UK 19 Nov 2018 - 20 Nov 2018
19-11-2018
bullet iconNolasco I and BENETOS E (2018). To bee or not to bee: Investigating machine learning approaches for beehive sound recognition. 2018 Workshop on Detection and Classification of Acoustic Scenes and Events (DCASE 2018) Surrey, UK 19 Nov 2018 - 20 Nov 2018
19-11-2018
bullet iconYCART A and BENETOS E (2018). A-MAPS: Augmented MAPS Dataset with Rhythm and Key Annotations. 19th International Society for Music Information Retrieval Conference Late-Breaking Demos Session Paris 23 Sep 2018 - 27 Sep 2018
23-09-2018
bullet iconWANG C, BENETOS E, MENG X and CHEW E (2018). Towards HMM-based glissando detection for recordings of Chinese bamboo flute. International Society for Music Information Retrieval Conference Late-Breaking Demos Session Paris, France 23 Sep 2018 - 27 Sep 2018
23-09-2018
bullet iconCHETTRI B, STURM BLT and BENETOS E (2018). Analysing replay spoofing countermeasure performance under varied conditions. IEEE International Workshop on Machine Learning for Signal Processing Aalborg, Denmark 17 Sep 2018 - 20 Sep 2018
17-09-2018
bullet iconChettri B, Mishra S, Sturm BL and Benetos E (2018). A Study On Convolutional Neural Network Based End-To-End Replay Anti-Spoofing. 
22-05-2018
bullet iconYCART A and BENETOS E (2018). Polyphonic Music Sequence Transduction with Meter-Constrained LSTM Networks. IEEE International Conference on Acoustics, Speech and Signal Processing Calgary, Canada 15 Apr 2018 - 20 Apr 2018
15-04-2018
bullet iconNakamura E, BENETOS E, Yoshii K and DIXON S (2018). Towards Complete Polyphonic Music Transcription: Integrating Multi-Pitch Detection and Rhythm Quantization. IEEE International Conference on Acoustics, Speech and Signal Processing Calgary, Canada 15 Apr 2018 - 20 Apr 2018
15-04-2018
bullet iconValero-Mas JJ, BENETOS E and Iñesta JM (2018). A Supervised Classification Approach for Note Tracking in Polyphonic Piano Transcription. Journal of New Music Research, Taylor & Francis (Routledge) vol. 47 (3), 249-263.  
26-03-2018
bullet iconAli H, Tran SN, Benetos E and d'Avila Garcez AS (2018). Speaker recognition with hybrid features from a deep belief network. Neural Computing and Applications, Springer Verlag (Germany) vol. 29 (6), 13-19.  
01-03-2018
bullet iconMesaros A, Heittola T, Benetos E, Foster P, Lagrange M, Virtanen T and Plumbley M (2018). Detection and Classification of Acoustic Scenes and Events: Outcome of the DCASE 2016 Challenge. IEEE/ACM Transactions on Audio, Speech and Language Processing, Institute of Electrical and Electronics Engineers vol. 26 (2), 379-393.  
01-02-2018
bullet iconPANTELI M, BENETOS E and DIXON S (2018). A review of manual and computational approaches for the study of world music corpora. Journal of New Music Research, Taylor & Francis (Routledge) vol. 47 (2), 176-189.  
08-01-2018
bullet iconBENETOS E, STOWELL D and PLUMBLEY M (2018). Approaches to complex sound scene analysis. Computational Analysis of Sound Scenes and Events , Editors: Virtanen T, PLUMBLEY M and Ellis D. 215-242.  
01-01-2018
bullet icon (2018). Proceedings of the 19th International Society for Music Information Retrieval Conference, ISMIR 2018, Paris, France, September 23-27, 2018., Editors: Gómez E, Hu X, Humphrey E and Benetos E. 
01-01-2018

2017

bullet iconPANTELI M, BENETOS E and DIXON S (2017). A computational study on outliers in world music. PLoS ONE, Public Library of Science (PLoS) vol. 12 (12), 1-28.  
18-12-2017
bullet iconMcLeod A, Schramm R, Steedman M and BENETOS E (2017). Automatic Transcription of Polyphonic Vocal Music. Applied Sciences, MDPI AG vol. 7 (12) 
11-12-2017
bullet iconYcart A and Benetos E (2017). A study on LSTM networks for polyphonic music sequence modelling. 18th International Society for Music Information Retrieval Conference (ISMIR 2017) Suzhou, China 23 Oct 2017 - 27 Oct 2017
23-10-2017
bullet iconSchramm R, McLeod A, Steedman M and Benetos E (2017). Multi-pitch detection and voice assignment for a cappella recordings of multiple singers. 18th International Society for Music Information Retrieval Conference (ISMIR 2017) Suzhou, China 23 Oct 2017 - 27 Oct 2017
23-10-2017
bullet iconLafay G, Benetos E and Lagrange M (2017). Sound Event Detection in Synthetic Audio: Analysis of the DCASE 2016 Task Results. IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA 2017) New Paltz, NY, USA 15 Oct 2017 - 18 Oct 2017
15-10-2017
bullet iconYCART A and BENETOS E (2017). Neural Music Language Models: investigating the training process. International Conference of Students of Systematic Musicology
13-09-2017
bullet iconBenetos E (2017). Polyphonic note and instrument tracking using linear dynamical systems. 2017 AES International Conference on Semantic Audio Erlangen, Germany 22 Jun 2017 - 24 Jun 2017
22-06-2017
bullet iconValero-Mas JJ, Benetos E and Iñesta JM (2017). Assessing the Relevance of Onset Information for Note Tracking in Piano Music Transcription. 2017 AES International Conference on Semantic Audio Erlangen, Germany 22 Jun 2017 - 24 Jun 2017
22-06-2017
bullet iconSchramm R and Benetos E (2017). Automatic Transcription of a Cappella Recordings from Multiple Singers. 2017 AES International Conference on Semantic Audio Erlangen, Germany 22 Jun 2017 - 24 Jun 2017
22-06-2017
bullet iconStowell D, Benetos E and Gill LF (2017). On-Bird Sound Recordings: Automatic Acoustic Recognition of Activities and Contexts. IEEE/ACM Trans. Audio, Speech & Language Processing vol. 25 (6), 1193-1206.  
23-05-2017
bullet iconBenetos E, Lafay G, Lagrange M and Plumbley MD (2017). Polyphonic Sound Event Tracking using Linear Dynamical Systems. IEEE/ACM Transactions on Audio, Speech and Language Processing, IEEE vol. 25 (6), 1266-1277.  
23-05-2017
bullet iconRussell AJ, Benetos E and d'Avila Garcez AS (2017). On the Memory Properties of Recurrent Neural Models. International Joint Conference on Neural Networks (IJCNN 2017) Anchorage, Alaska, USA 14 May 2017 - 19 May 2017
14-05-2017
bullet iconAbdallah S, Benetos E, Gold N, Hargreaves S, Weyde T and Wolff D (2017). The Digital Music Lab: A Big Data Infrastructure for Digital Musicology. ACM Journal on Computing and Cultural Heritage, ACM vol. 10 (1) 
01-01-2017

2016

bullet iconYCART A and Benetos E (2016). Towards a Music Language Model for Audio Analysis. DMRN+11: Digital Music Research Network One-day Workshop 2016 Centre for Digital Music, Queen Mary University of London 20 Dec 2016
20-12-2016
bullet iconBENETOS E and Schramm R (2016). Automatic Transcription of Vocal Quartets. DMRN+11: Digital Music Research Network One-day Workshop 2016 Centre for Digital Music, Queen Mary University of London 20 Dec 2016
20-12-2016
bullet iconValero-Mas JJ, Benetos E and Iñesta JM (2016). Classification-based Note Tracking for Automatic Music Transcription. 9th International Workshop on Machine Learning and Music Riva del Garda, Italy 23 Sep 2016
23-09-2016
bullet iconAbdallah S, Benetos E, Gold N, Hargreaves S, Weyde T and Wolff D (2016). Digital Music Lab: A Framework for Analysing Big Music Data. 24th European Signal Processing Conference Budapest, Hungary 29 Aug 2016 - 2 Sep 2016
29-08-2016
bullet iconHolzapfel A and Benetos E (2016). The Sousta corpus: Beat-informed automatic transcription of traditional dance tunes. 17th International Society for Music Information Retrieval Conference New York, USA 7 Aug 2016 - 11 Aug 2016
07-08-2016
bullet iconPanteli M, Benetos E and Dixon S (2016). Learning a feature space for similarity in world music. 17th International Society for Music Information Retrieval Conference New York, USA 7 Aug 2016 - 11 Aug 2016
07-08-2016
bullet iconCheng T, Mauch M, Benetos E and Dixon S (2016). An attack/decay model for piano transcription. 17th International Society for Music Information Retrieval Conference New York, USA 7 Aug 2016 - 11 Aug 2016
07-08-2016
bullet iconLafay G, Lagrange M, Rossignol M, Benetos E and Roebel A (2016). A morphological model for simulating acoustic scenes and its application to sound event detection. IEEE/ACM Transactions on Audio, Speech, and Language Processing, IEEE vol. 24 (10), 1854-1864.  
01-07-2016
bullet iconPanteli M, Benetos E and Dixon S (2016). Automatic detection of outliers in world music collections. Fourth International Conference on Analytical Approaches to World Music (AAWM 2016) New York, USA 8 Jun 2016 - 11 Jun 2016
08-06-2016
bullet iconBenetos E, Lafay G, Lagrange M and Plumbley MD (2016). Detection of overlapping acoustic events using a temporally-constrained probabilistic model. IEEE International Conference on Acoustics, Speech, and Signal Processing Shanghai, China 20 Mar 2016 - 25 Mar 2016
20-03-2016
bullet iconSigtia S, Benetos E and Dixon S (2016). An End-to-End Neural Network for Polyphonic Piano Music Transcription. IEEE/ACM Transactions on Audio, Speech, and Language Processing, IEEE vol. 24 (5), 927-939.  
23-02-2016

2015

bullet iconBenetos E and Weyde T (2015). An efficient temporally-constrained probabilistic model for multiple-instrument music transcription., Editors: Wiering F and Müller M. 16th International Society for Music Information Retrieval Conference (ISMIR) Malaga, Spain 26 Oct 2015 - 30 Oct 2015
26-10-2015
bullet iconBENETOS E and Holzapfel A (2015). Automatic transcription of Turkish microtonal music. Journal of the Acoustical Society of America, Acoustical Society of America vol. 138 (4), 2118-2130.  
14-10-2015
bullet iconStowell D, Giannoulis D, Benetos E, Lagrange M and Plumbley MD (2015). Detection and Classification of Acoustic Scenes and Events. IEEE Transactions on Multimedia vol. 17 (10), 1733-1746.  
01-10-2015
bullet iconRossignol M, Lagrange M, Lafay G and BENETOS E (2015). Alternate level clustering for drum transcription. 23rd European Signal Processing Conference (EUSIPCO) Nice, France 31 Aug 2015 - 4 Sep 2015
31-08-2015
bullet iconAbdallah S, Alencar-Brayner A, BENETOS E, Cottrell S, Dykes J, Gold N, Kachkaev A, Mahey M, Tidhar D, Tovell A, Weyde T and Wolff D (2015). Automatic transcription and pitch analysis of the British Library World & Traditional Music Collection. 5th International Workshop on Folk Music Analysis Paris, France 10 Jun 2015 - 12 Jun 2015
10-06-2015
bullet iconSigtia S, Benetos E, Boulanger-Lewandowski N, Weyde T, Garcez ASDA and Dixon S (2015). A Hybrid Recurrent Neural Network for Music Transcription. 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Brisbane, Australia 19 Apr 2015 - 24 Apr 2015
19-04-2015

2014

bullet iconBenetos E, Badeau R, Weyde T and Richard G (2014). Template Adaptation for Improving Automatic Music Transcription., Editors: Wang H-M, Yang Y-H and Lee JH. 15th International Society for Music Information Retrieval Conference (ISMIR) Taipei, Taiwan 27 Oct 2014 - 31 Oct 2014
27-10-2014
bullet iconTidhar D, Dixon S, Benetos E and Weyde T (2014). The temperament police. Early Music, Oxford University Press vol. 42 (4), 579-590.  
11-10-2014
bullet iconWeyde T, Cottrell S, Dykes J, Benetos E, Wolff D, Tidhar D, Gold N, Abdallah S, Plumbley M, Dixon S, Barthet M, Mahey M, Tovell A and Alancar-Brayner A (2014). Big Data for Musicology., Editors: Page K and Fields B. 1st International Digital Libraries for Musicology workshop London, UK 12 Sep 2014
12-09-2014
bullet iconWolff D, Tidhar D, Benetos E, Dumon E, Cherla S and Weyde T (2014). Incremental dataset definition for large scale musicological research., Editors: Page K and Fields B. 1st International Digital Libraries for Musicology workshop London, UK 12 Sep 2014
12-09-2014
bullet iconTran S, Benetos E and d Avila Garcez A (2014). Learning motion-difference features using Gaussian restricted Boltzmann machines for efficient human action recognition. 2014 International Joint Conference on Neural Networks (IJCNN) Beijing, China 6 Jul 2014 - 11 Jul 2014
06-07-2014
bullet iconBenetos E and Holzapfel A (2014). Incorporating pitch class profiles for improving automatic transcription of Turkish makam music., Editors: Holzapfel A. 4th International Workshop on Folk Music Analysis Istanbul, Turkey 12 Jun 2014 - 13 Jun 2014
13-06-2014
bullet iconGiannoulis D, Benetos E, Klapuri A and Plumbley M (2014). Improving instrument recognition in polyphonic music through system integration. IEEE International Conference on Acoustics, Speech, and Signal Processing Florence, Italy 4 May 2014 - 9 May 2014
04-05-2014
bullet iconBenetos E, Jansson A and Weyde T (2014). Improving automatic music transcription through key detection., Editors: Dittmar C, Fazekas G and Ewert S. AES 53rd International Conference on Semantic Audio London, UK 27 Jan 2014 - 29 Jan 2014
29-01-2014
bullet iconSigtia S, Benetos E, Cherla S, Weyde T, Garcez A and Dixon S (2014). RNN-based Music Language Models for Improving Automatic Music Transcription. 
01-01-2014
bullet iconBenetos E, Ewert S and Weyde T (2014). Automatic Transcription Of Pitched And Unpitched Sounds From Polyphonic Music. 
01-01-2014
bullet iconBARTHET M, Benetos E, Cottrell S, Dixon S, Dykes J, Gold N, Mahey M, Plumbley MD, Tidhar D, Weyde T and Wolff D (2014). The DML Research Project: Digital Music Lab - Analysing Big Music Data. 
01-01-2014

2013

bullet iconBenetos E, Dixon S, Giannoulis D, Kirchhoff H and Klapuri A (2013). Automatic music transcription: Challenges and future directions. Journal of Intelligent Information Systems vol. 41 (3), 407-434.  
01-12-2013
bullet iconBenetos E and Holzapfel A (2013). Automatic transcription of Turkish makam music. 14th International Society for Music Information Retrieval Conference Curitiba, PR, Brazil 4 Nov 2013 - 8 Nov 2013
08-11-2013
bullet iconBenetos E and Weyde T (2013). Explicit duration hidden Markov models for multiple-instrument polyphonic music transcription. 14th International Society for Music Information Retrieval Conference Curitiba, PR, Brazil 4 Nov 2013 - 8 Nov 2013
08-11-2013
bullet iconde Valk R, Weyde T and Benetos E (2013). A machine learning approach to voice separation in lute tablature., Editors: Britto AS, Gouyon F and Dixon S. 14th International Society for Music Information Retrieval Conference Curitiba, PR, Brazil 4 Nov 2013 - 8 Nov 2013
04-11-2013
bullet iconGiannoulis D, Benetos E, Stowell D, Rossignol M, Lagrange M and Plumbley M (2013). Detection and classification of acoustic scenes and events: an IEEE AASP challenge. IEEE Workshop on Applications of Signal Processing to Audio and Acoustics New Paltz, NY, USA 20 Oct 2013 - 23 Oct 2013
23-10-2013
bullet iconGiannoulis D, Stowell D, Benetos E, Rossignol M, Lagrange M and Plumbley MD (2013). A database and challenge for acoustic scene classification and event detection. 21st European Signal Processing Conference Marrakech, Morocco
01-09-2013
bullet iconBenetos E, Cherla S and Weyde T (2013). An efficient shift-invariant model for polyphonic music transcription. 6th International Workshop on Machine Learning and Music Prague, Czech Republic
01-09-2013
bullet iconSerra X, Magas M, Benetos E, Chudy M, Dixon S, Flexer A, Gomez E, Gouyon F, Herrera P, Jorda S, Paytuvi O, Peeters G, Schlüter J, Vinet H and Widmer G (2013). Roadmap for Music Information ReSearch., Editors: Peeters G. 
02-05-2013
bullet iconBenetos E and Dixon S (2013). Multiple-instrument polyphonic music transcription using a temporally constrained shift-invariant model. Journal of the Acoustical Society of America vol. 133 (3), 1727-1741.  
01-03-2013

2012

bullet iconBENETOS E (2012). Automatic transcription of polyphonic music exploiting temporal evolution., Editors: Dixon S. 
31-12-2012
bullet iconBENETOS E, Dixon S, Giannoulis D, Kirchhoff H and Klapuri A (2012). Automatic Music Transcription: Breaking the Glass Ceiling., Editors: Gouyon F, Herrera P, Martins LG and Müller M. 13th International Society for Music Information Retrieval Conference (ISMIR 2012) Porto, Portugal 8 Oct 2012 - 12 Oct 2012
12-10-2012
bullet iconBenetos E and Dixon S (2012). Temporally-constrained convolutive probabilistic latent component analysis for multi-pitch detection. 
01-03-2012
bullet iconBenetos E, Klapuri A and Dixon S (2012). SCORE-INFORMED TRANSCRIPTION FOR AUTOMATIC PIANO TUTORING. 
01-01-2012
bullet iconBenetos E and Dixon S (2012). A Shift-Invariant Latent Variable Model for Automatic Music Transcription. COMPUTER MUSIC JOURNAL vol. 36 (4), 81-94.  
01-01-2012
bullet iconBenetos E, Lagrange M and Dixon S (2012). Characterisation of acoustic scenes using a temporally-constrained shift-invariant model. 
01-01-2012

2011

bullet iconBenetos E and Dixon S (2011). Joint multi-pitch detection using harmonic envelope estimation for polyphonic music transcription. IEEE Journal on Selected Topics in Signal Processing vol. 5 (6), 1111-1123.  
01-10-2011
bullet iconBenetos E and Dixon S (2011). Multiple-instrument polyphonic music transcription using a convolutive probabilistic model. 8th Sound and Music Computing Conference Padova, Italy 6 Jul 2011 - 9 Jul 2011
01-07-2011
bullet iconMearns L, Benetos E and Dixon S (2011). Automatically detecting key modulations in J.S. Bach chorale recordings. 
01-07-2011
bullet iconBenetos E and Dixon S (2011). Polyphonic music transcription using note onset and offset detection. 
01-01-2011
bullet iconBenetos E and Dixon S (2011). A TEMPORALLY-CONSTRAINED CONVOLUTIVE PROBABILISTIC MODEL FOR PITCH DETECTION. 
01-01-2011
bullet iconDixon S, Tidhar D and Benetos E (2011). The temperament police: The truth, the ground truth, and nothing but the truth. 12th International Society for Music Information Retrieval Conference Miami, Florida, USA 24 Oct 2011 - 28 Oct 2011
01-01-2011

2010

bullet iconBenetos E and Dixon S (2010). Multiple-F0 estimation of piano sounds exploiting spectral structure and temporal evolution. 
01-09-2010
bullet iconBenetos E and Stylianou Y (2010). Auditory spectrum-based pitched instrument onset detection. IEEE Transactions on Audio, Speech and Language Processing vol. 18 (8), 1968-1977.  
01-01-2010
bullet iconBenetos E and Kotropoulos C (2010). Non-negative tensor factorization applied to music genre classification. IEEE Transactions on Audio, Speech and Language Processing vol. 18 (8), 1955-1967.  
01-01-2010
bullet iconAnglade A, Benetos E, Mauch M and Dixon S (2010). Improving music genre classification using automatically induced harmony rules. Journal of New Music Research vol. 39 (4), 349-361.  
01-01-2010

2009

bullet iconBenetos E, Holzapfel A and Stylianou Y (2009). Pitched instrument onset detection based on auditory spectra. 
01-01-2009

2008

bullet iconBenetos E and Kotropoulos C (2008). A tensor-based approach for automatic music genre classification. 
01-08-2008
bullet iconSpachos D, Zlantintsi A, Moschou V, Antonopoulos P, Benetos E, Kotti M, Tzimouli K, Kotropoulos C, Nikolaidis N, Maragos P and Pitas I (2008). MUSCLE movie-database: a multimodal corpus with rich annotation for dialogue and saliency detection. 
01-05-2008
bullet iconPanagakis I, Benetos E and Kotropoulos C (2008). Music Genre Classification: A Multilinear Approach., Editors: Bello JP, Chew E and Turnbull D. 
01-01-2008
bullet iconKotti M, Benetos E and Kotropoulos C (2008). Computationally efficient and robust BIC-based speaker segmentation. IEEE Transactions on Audio, Speech and Language Processing vol. 16 (5), 920-933.  
01-01-2008
bullet iconBENETOS E, Siatras S, Kotropoulos C, Nikolaidis N and Pitas I (2008). Movie analysis with emphasis to dialogue and action scene detection. Multimodal Processing and Interaction , Editors: Maragos PA, Potamianos A and Gros P. 157-177.  
01-01-2008

2007

bullet iconKotti M, Benetos E and Kotropoulos C (2007). Neural network-based movie dialogue detection. 
01-08-2007
bullet iconBenetos E, Kotti M and Kotropoulos C (2007). Large scale musical instrument identification. 
01-07-2007
bullet iconMoschou V, Kotti M, Benetos E and Kotropoulos C (2007). Systematic comparison of BIC-based speaker segmentation systems. 
01-01-2007
bullet iconKotti M, Benetos E, Kotropoulos C and Pitas I (2007). A neural network approach to audio-assisted movie dialogue detection. Neurocomputing vol. 71, 157-166.  
01-01-2007

2006

bullet iconBenetos E, Kotti M and Kotropoulos C (2006). Applying Supervised Classifiers Based on Non-negative Matrix Factorization to Musical Instrument Classification. 
01-01-2006
bullet iconKotti M, Benetos E and Kotropoulos C (2006). Automatic speaker change detection with the Bayesian information criterion using MPEG-7 features and a fusion scheme. 
01-01-2006
bullet iconKotti M, Martins LPM, Benetos E, Cardoso JS and Kotropoulos C (2006). Automatic Speaker Segmentation using Multiple Features and Distance Measures: A Comparison of Three Approaches. 
01-01-2006
bullet iconBenetos E, Kotti M and Kotropoulos C (2006). Musical instrument classification using non-negative matrix factorization algorithms. 
01-01-2006
bullet iconBenetos E, Kotti M and Kotropoulos C (2006). Musical instrument classification using non-negative matrix factorization algorithms and subset feature selection. 
01-01-2006
bullet iconBenetos E, Kotropoulos C, Lidy T and Rauber A (2006). Testing supervised classifiers based on non-negative matrix factorization to musical instrument classification. 
01-01-2006

2005

bullet iconBenetos E, Kotti M, Kotropoulos C, Burred JJ, Eisenberg G, Haller M and Sikora T (2005). Comparison of subspace analysis-based and statistical model-based algorithms for musical instrument classification. 
01-10-2005

Grants of specific relevance to the Centre for Multimodal AI

solid heart iconGrants of specific relevance to the Centre for Multimodal AI
bullet iconUKRI Centre for Doctoral Training in Artificial Intelligence and Music
Simon Dixon, Emmanouil Benetos, Nicholas Bryan-Kinns, Mark Sandler, Andrew McPherson, Mathieu Barthet, Gyorgy Fazekas, Ekaterina Ivanova, Anna Xambo Sedo and Charalampos Saitis
£6,519,934 EPSRC Engineering and Physical Sciences Research Council (01-07-2019 - 31-08-2028)
bullet iconProject Maestro - Ai Musical Analysis Platform
Emmanouil Benetos and Simon Dixon
£166,349 Innovate UK (01-07-2024 - 30-06-2026)
bullet iconSpotify PhD project - Style classification of podcasts using audio
Emmanouil Benetos
£33,000 Spotify Ltd (01-03-2024 - 28-02-2026)
bullet iconOnline Speech Enhancement in Scenarios with Low Direct-to-Reverberant-Ratio
Emmanouil Benetos and Aidan Hogg
£65,621 L-ACOUSTICS UK LIMITED (01-09-2024 - 28-02-2025)
bullet iconIndustry-scale machine listening for music and audio data”
Simon Dixon and Emmanouil Benetos
£108,000 Spotify Ltd (14-09-2020 - 31-01-2025)
bullet iconGraph Networks for Explainable Artificial Intelligence
Andrea Cavallaro and Emmanouil Benetos
£293,434 EPSRC Engineering and Physical Sciences Research Council (01-08-2021 - 31-12-2024)