Dr Emmanouil Benetos
FHEAReader in Machine Listening
Director of Research, Deputy Director of the UKRI Centre for Doctoral Training in AI and Music
School of Electronic Engineering and Computer Science
Queen Mary University of London
Queen Mary University of London
Research
Machine listening / computer audition, Machine learning for audio and sequential data, Music information retrieval, Multimodal AI, Resource-efficient AI
Interests
I am currently Reader in Machine Listening and Director of Research at the School of Electronic Engineering and Computer Science of Queen Mary University of London. Within Queen Mary, I am member of the Centre for Digital Music, Centre for Multimodal AI, Centre for Intelligent Sensing, and Digital Environment Research Institute, and I co-lead the School's Machine Listening Lab.My main research topic is computational audio analysis, also referred to as machine listening or computer audition - applied to music, urban, everyday and nature sounds. I have been Royal Academy of Engineering / Leverhulme Trust Research Fellow in resource-efficient machine listening, Turing Fellow at the Alan Turing Institute, Royal Academy of Engineering Research Fellow, and have been principal- and co-investigator for several audio-related funded research projects on topics related to sound scene analysis, music information retrieval, and digital musicology. I am also Deputy Director for the UKRI Centre for Doctoral Training in Artificial Intelligence and Music (AIM).
On academic service, I am currently secretary for the International Society for Music Information Retrieval (ISMIR), member of the IEEE Technical Committee on Audio and Acoustic Signal Processing (AASP TC), member of the EURASIP Acoustic, Speech and Music Signal Processing Technical Area Committee (ASMSP TAC), associate editor for the IEEE/ACM Transactions on Audio, Speech, and Language Processing, and associate editor for the EURASIP Journal on Audio, Speech, and Music Processing.
Publications
Publications of specific relevance to the Centre for Multimodal AI
2024
Deng Q, Yang Q, Yuan R, Huang Y, Wang Y, Liu X, Tian Z, Pan J, Zhang G, Lin H, Li Y, Ma Y, Fu J, Lin C, Benetos E, Wang W, Xia G, Xue W and Guo Y (2024). ComposerX: Multi-Agent Symbolic Music Composition with LLMs. 25th International Society for Music Information Retrieval Conference (ISMIR), San Francisco, CA, USA 10 Nov 2024 - 14 Nov 2024.
10-11-2024
10-11-2024
Steinmetz C, Singh S, Comunit� M, Ibnyahya I, Yuan S, Benetos E and Reiss J (2024). ST-ITO: Controlling audio effects for style transfer with inference-time optimization. 25th International Society for Music Information Retrieval Conference (ISMIR) San Francisco, CA, USA 10 Nov 2024 - 14 Nov 2024.
10-11-2024
10-11-2024
Zhou Z, Wu Y, Wu Z, Zhang X, Yuan R, Ma Y, Wang L, Benetos E, Xue W and Guo Y (2024). Can LLMs Reason in Music? An Evaluation of LLMs' Capability of Music Understanding and Generation. 25th International Society for Music Information Retrieval Conference (ISMIR) San Franscisco, CA, USA 10 Nov 2024 - 14 Nov 2024.
10-11-2024
10-11-2024
Weck B, Manco I, Benetos E, QUINTON E, Fazekas G and Bogdanov D (2024). MuChoMusic: Evaluating Music Understanding in Multimodal Audio-Language Models. 25th International Society for Music Information Retrieval Conference (ISMIR) San Francisco, CA, USA 10 Nov 2024 - 14 Nov 2024.
10-11-2024
10-11-2024
Chang SK, Benetos E, KIRCHHOFF H and Dixon S (2024). ˜YourMT3+: Multi-instrument Music Transcription with Enhanced Transformer Architectures and Cross-dataset Stem Augmentation. IEEE International Workshop on Machine Learning for Signal Processing (MLSP) London, UK 22 Sep 2024 - 25 Sep 2024.
22-09-2024
22-09-2024
Torrisi A, De Almeida Nolasco IS, Versace E and Benetos E (2024). Exploratory analysis of early-life chick calls. 4th International Workshop on Vocal Interactivity in-and-between Humans, Animals and Robots (VIHAR) Kos, Greece 6 Sep 2024.
06-09-2024
06-09-2024
Huang J and Benetos E (2024). Towards Building an End-to-End Multilingual Automatic Lyrics Transcription Model. 32nd European Signal Processing Conference (EUSIPCO) Lyon, France 26 Aug 2024 - 30 Aug 2024.
26-08-2024
26-08-2024
Yuan R, Lin H, Wang Y, Tian Z, Wu S, Shen T, Zhang G, Wu Y, Liu C, Zhou Z, Ma Z, Xue L, Wang Z, Liu Q, Zheng T, Li Y, Ma Y, Liang Y, Chi X, Liu R, et al. (2024). ChatMusician: Understanding and Generating Music Intrinsically with LLM. 62nd Annual Meeting of the Association for Computational Linguistics (ACL 2024) Bangkok, Thailand 11 Aug 2024 - 16 Aug 2024.
11-08-2024
11-08-2024
Xompero A, Bontonou M, Arbona J-M, Benetos E and Cavallaro A (2024). Explaining models relating objects and privacy. 3rd Explainable AI for Computer Vision (XAI4CV) Workshop at CVPR 2024 Seattle Convention Center, Seattle WA, USA 18 Jun 2024.
18-06-2024
18-06-2024
Deng Z, Ma Y, Liu Y, Guo R, Zhang G, Chen W, Huang W and Benetos E (2024). MusiLingo: bridging music and text with pre-trained language models for music captioning and query response. 2024 Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL 2024) Mexico City, Mexico 16 Jun 2024 - 21 Jun 2024.
16-06-2024
16-06-2024
Ozaki Y, Tierney A, Pfordresher PQ, McBride JM, Benetos E, Proutskova P, Chiba G, Liu F, Jacoby N, Purdy SC, Opondo P, Fitch WT, Hegde S, Rocamora M, Thorne R, Nweke F, Sadaphal DP, Sadaphal PM, Hadavi S, Fujii S, et al. (2024). Globally, songs and instrumental melodies are slower and higher and use more stable pitches than speech: A Registered Report. Science Advances, American Association for the Advancement of Science (AAAS) vol. 10 (20)
15-05-2024
15-05-2024
Liang J, Zhang H, Liu H, Cao Y, Kong Q, Liu X, Wang W, Plumbley MD, Phan H and Benetos E (2024). WavCraft: audio editing and generation with large language models. ICLR 2024 Workshop on LLM Agents Vienna, Austria 11 May 2024.
11-05-2024
11-05-2024
Li Y, Yuan R, Zhang G, Ma Y, Chen X, Yin H, Xiao C, Lin C, Ragni A, Benetos E, Gyenge N, Dannenberg R, Liu R, Chen W, Xia G, Shi Y, Huang W, Wang Z, Guo Y and Fu J (2024). MERT: Acoustic Music Understanding Model with Large-Scale Self-supervised Training. International Conference on Learning Representations (ICLR) Vienna, Austria 7 May 2024 - 11 May 2024.
07-05-2024
07-05-2024
Liang J, Phan QH and Benetos E (2024). Learning from taxonomy: multi-label few-shot classification for everyday sound recognition. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) Seoul, Korea 14 Apr 2024 - 19 Apr 2024.
14-04-2024
14-04-2024
Li D, Ma Y, Wei W, KONG Q, Wu Y, Che M, Xia F, Benetos E and Li W (2024). MERTech: instrument playing technique detection using self-supervised pretrained model with multi-task finetuning. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) Seoul, Korea 14 Apr 2024 - 19 Apr 2024.
14-04-2024
14-04-2024
Postolache E, Mariani G, Cosmo L, Benetos E and Rodola E (2024). Generalized multi-source inference for text conditioned music diffusion models. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) Seoul, Korea 14 Apr 2024 - 19 Apr 2024.
14-04-2024
14-04-2024
Liang J, Nolasco I, Ghani B, Phan H, Benetos E and Stowell D (2024). Mind the Domain Gap: a Systematic Analysis on Bioacoustic Sound Event Detection. 32nd European Signal Processing Conference (EUSIPCO 2024) Lyon, France 26 Aug 2024 - 30 Aug 2024.
27-03-2024
27-03-2024
EDWARDS D, Dixon S, Benetos E, Maezawa A and Kusaka Y (2024). A Data-Driven Analysis of Robust Automatic Piano Transcription. IEEE Signal Processing Letters, Institute of Electrical and Electronics Engineers vol. 31, 681-685.
08-02-2024
08-02-2024
Singh S, Steinmetz C, Benetos E, Phan QH and Stowell D (2024). ATGNN: audio tagging graph neural network. IEEE Signal Processing Letters, Institute of Electrical and Electronics Engineers vol. 31, 825-829.
17-01-2024
17-01-2024
2023
Manco I, Weck B, Doh S, Won M, Zhang Y, Bodganov D, Wu Y, Chen K, Tovstogan P, Benetos E, Quinton E, Fazekas G and Nam J (2023). The Song Describer dataset: a corpus of audio captions for music-and-language evaluation. NeurIPS Machine Learning for Audio Workshop New Orleans, USA 16 Dec 2023.
16-12-2023
16-12-2023
Deb O, Benetos E and Torr P (2023). Remaining-useful-life prediction and uncertainty quantification using LSTM ensembles for aircraft engines. NeurIPS Workshop on Advancing Neural Network Training (WANT): Computational Efficiency, Scalability, and Resource Optimization New Orleans, USA 16 Dec 2023.
16-12-2023
16-12-2023
Yuan R, Ma Y, Li Y, Zhang G, Chen X, Yin H, Zhuo L, Liu Y, Huang J, Tian Z, Deng B, Wang N, Benetos E, Ragni A, Gyenge N, Dannenberg R, Chen W, Xia G, Xue W, Liu S, et al. (2023). MARBLE: Music Audio Representation Benchmark for Universal Evaluation. 37th Conference on Neural Information Processing Systems (NeurIPS) 10 Dec 2023 - 16 Dec 2023.
10-12-2023
10-12-2023
Ragano A, Benetos E and Hines A (2023). Learning Music Representations with wav2vec 2.0. 2023 31st Irish Conference on Artificial Intelligence and Cognitive Science (AICS).
08-12-2023
08-12-2023
Ragano A, Benetos E and Hines A (2023). Learning Music Representations with wav2vec 2.0. 31st Irish Conference on Artificial Intelligence and Cognitive Science (AICS) Letterkenny, Ireland 7 Dec 2023.
07-12-2023
07-12-2023
Papaioannou C, Benetos E and Potamianos A (2023). From West to East: Who can understand the music of the others better? 24th International Society for Music Information Retrieval Conference (ISMIR) Milan, Italy 5 Nov 2023 - 9 Nov 2023.
05-11-2023
05-11-2023
Ma Y, Yuan R, Li Y, Zhang G, Chen X, Yin H, Lin C, Benetos E, Ragni A, Gyenge N, Liu R, Xia G, Dannenberg R, Guo Y and Fu J (2023). On the effectiveness of speech self-supervised learning for music. 24th International Society for Music Information Retrieval Conference (ISMIR) Milan, Italy 5 Nov 2023 - 9 Nov 2023.
05-11-2023
05-11-2023
Zhuo L, Yuan R, Pan J, Ma Y, Li Y, Zhang G, Liu S, Dannenberg R, Fu J, Lin C, Benetos E, Chen W, Xue W and Guo Y (2023). LyricWhiz: Robust Multilingual Lyrics Transcription by Whispering to ChatGPT. 24th International Society for Music Information Retrieval Conference (ISMIR) Milan, Italy 5 Nov 2023 - 9 Nov 2023.
05-11-2023
05-11-2023
Sarkar S, Thorpe L, Benetos E and Sandler M (2023). Leveraging synthetic data for improving chamber ensemble separation. IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA) New Paltz, New York, USA 22 Oct 2023 - 25 Oct 2023.
22-10-2023
22-10-2023
Vahidi C, Singh S, Benetos E, Phan QH, Stowell D, Fazekas G and Lagrange M (2023). Perceptual musical similarity metric learning with graph neural networks. IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA) New Paltz, NY, USA 22 Oct 2023 - 25 Oct 2023.
22-10-2023
22-10-2023
Edwards D, Dixon S and Benetos E (2023). PiJAMA: Piano Jazz with Automatic MIDI Annotations. Transactions of the International Society for Music Information Retrieval, Ubiquity Press vol. 6 (1), 89-102.
15-09-2023
15-09-2023
Liang J, Liu X, Liu H, Phan H, Benetos E, Plumbley M and Wang W (2023). Adapting Language-Audio Models as Few-Shot Audio Learners. 24th Annual Conference of the International Speech Communication Association (INTERSPEECH) Dublin, Ireland 20 Aug 2023 - 24 Aug 2023.
20-08-2023
20-08-2023
Ragano A, Benetos E, Chinen M, Becerra H, Chandan Karadagur Ananda R, Skoglund J and Hines A (2023). A Comparison Of Deep Learning MOS Predictors For Speech Synthesis Quality. Irish Signals & Systems Conference 2023 Dublin, Ireland 13 Jun 2023 - 14 Jun 2023.
13-06-2023
13-06-2023
Ragano A, Benetos E and Hines A (2023). Audio Quality Assessment of Vinyl Music Collections Using Self-Supervised Learning. 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 4 Jun 2023 - 10 Jun 2023.
04-06-2023
04-06-2023
Li Y, Cao W, Xie W, Li J and Benetos E (2023). Few-shot Class-incremental Audio Classification Using Dynamically Expanded Classifier with Self-attention Modified Prototypes. IEEE Transactions on Multimedia, Institute of Electrical and Electronics Engineers vol. 26, 1346-1360.
25-05-2023
25-05-2023
2022
Li Y, Yuan R, Zhang G, Ma Y, Lin C, Chen X, Ragni A, Yin H, Hu Z, He H, Benetos E, Gyenge N, Liu R and Fu J (2022). Large-Scale Pretrained Model for Self-Supervised Music Audio Representation Learning. DMRN+17: Digital Music Research Network One-day Workshop 2022 London, UK 20 Dec 2022.
20-12-2022
20-12-2022
Liu L, KONG Q, Morfi G-V and Benetos E (2022). Performance MIDI-to-score conversion by neural beat tracking. 23rd International Society for Music Information Retrieval Conference (ISMIR) Bengaluru, India 4 Dec 2022 - 8 Dec 2022.
18-12-2022
18-12-2022
Sarkar S, Benetos E and Sandler M (2022). EnsembleSet: A new high-quality synthesised dataset for chamber ensemble separation. 23rd International Society for Music Information Retrieval Conference (ISMIR) Bengaluru, India 5 Dec 2022 - 8 Dec 2022.
08-12-2022
08-12-2022
Manco I, Benetos E, QUINTON E and Fazekas G (2022). Contrastive audio-language learning for music. 23rd International Society for Music Information Retrieval Conference (ISMIR) Bengaluru, India 4 Dec 2022 - 8 Dec 2022.
04-12-2022
04-12-2022
Liang J, Phan QH and Benetos E (2022). Leveraging label hierarchies for few-shot everyday sound recognition. 7th Workshop on Detection and Classification of Acoustic Scenes and Events (DCASE) Nancy, France 3 Nov 2022 - 4 Nov 2022.
03-11-2022
03-11-2022
Mai KT, Davies T, Griffi LD and Benetos E (2022). Explaining the decisions of anomalous sound detectors. 7th Workshop on Detection and Classification of Acoustic Scenes and Events (DCASE) Nancy, France 3 Nov 2022 - 4 Nov 2022.
03-11-2022
03-11-2022
Ozaki Y, Kuroyanagi J, McBride J, Proutskova P, Tierney A, Pfordresher P, Benetos E, Liu F and Savage PE (2022). Similarities and differences in a cross-linguistic sample of song and speech recordings. Joint Conference on Language Evolution Kanazawa, Japan 5 Sep 2022 - 8 Sep 2022.
05-09-2022
05-09-2022
Wang C, Benetos E, Versace E and Wang S (2022). Joint Scattering for Automatic Chick Call
Recognition. 30th European Signal Processing Conference Belgrade, Serbia 29 Aug 2022 - 2 Sep 2022.
29-08-2022
29-08-2022
Singh S, Benetos E and Phan QH (2022). Hypernetworks for sound event detection: a proof-of-concept. 30th European Signal Processing Conference (EUSIPCO 2022) Belgrade, Serbia 29 Aug 2022 - 3 Sep 2022.
29-08-2022
29-08-2022
Daikoku H, Ding S, Benetos E, Wood ALC, Shimizono T, Sanne US, Fujii S and Savage PE (2022). Agreement among human and automated estimates of similarity in a global music sample. 10th International Workshop on Folk Music Analysis (FMA 2022) Sheffield, UK 14 Jun 2022 - 17 Jun 2022.
14-06-2022
14-06-2022
Huang J, Benetos E and Ewert S (2022). Improving lyrics Alignment through Joint Pitch Detection. 2022 IEEE International Conference on Acoustics, Speech and Signal Processing Singapore 22 May 2022 - 27 May 2022.
22-05-2022
22-05-2022
Manco I, Benetos E, Quinton E and Fazekas G (2022). Learning music audio representations via weak language supervision. 2022 IEEE International Conference on Acoustics, Speech and Signal Processing Singapore 22 May 2022 - 27 May 2022.
22-05-2022
22-05-2022
Ou L, Guo Z, Benetos E, Han J and Wang Y (2022). Exploring transformer's potential on automatic piano transcription. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) Singapore 7 May 2022 - 13 May 2022.
07-05-2022
07-05-2022
Ragano A, Benetos E and Hines A (2022). Automatic Quality Assessment of Digitized and Restored Sound Archives. Journal of the Audio Engineering Society, Audio Engineering Society vol. 70 (4), 252-270.
01-04-2022
01-04-2022
Wang C, Benetos E, Lostanlen V and Chew E (2022). Adaptive Scattering Transforms for Playing Technique Recognition. IEEE/ACM Transactions on Audio, Speech and Language Processing, Institute of Electrical and Electronics Engineers vol. 30, 1407-1421.
07-03-2022
07-03-2022
Benetos E, Ragano A, Sgroi D and Tuckwell A (2022). Measuring national mood with music: using machine learning to construct a measure of national valence from audio data. Behavior Research Methods, Springer (part of Springer Nature)
25-02-2022
25-02-2022
Terenzi A, Ortolani N, De Almeida Nolasco I, Benetos E and Cecchi S (2022). Comparison of feature extraction methods for sound-based classification of honey bee activity. IEEE/ACM Transactions on Audio, Speech and Language Processing, Institute of Electrical and Electronics Engineers vol. 30, 112-122.
01-01-2022
01-01-2022
2021
Bodo RPP, Benetos E and Queiroz M (2021). A framework for music similarity and cover song identification. 15th International Symposium on Computer Music Multidisciplinary Research (CMMR) Tokyo, Japan 15 Nov 2021 - 19 Nov 2021.
15-11-2021
15-11-2021
Liu L, Morfi V and Benetos E (2021). ACPAS: A Dataset of Aligned Classical Piano Audio and Scores for Audio-to-Score Transcription. Late-Breaking Demo Session of the 22nd Int. Society for Music Information Retrieval Conference.
11-11-2021
11-11-2021
Vianna Lordelo C, Benetos E, Dixon S and Ahlbäck S (2021). Pitch-informed instrument assignment using a deep convolutional network with multiple kernel shapes. 22nd International Society for Music Information Retrieval Conference (ISMIR) 9 Nov 2021 - 12 Nov 2021.
09-11-2021
09-11-2021
Ozaki Y, McBride J, Benetos E, Pfordresher PQ, Six J, T. Tierney A, Proutskova P, Sakai E, Kondo H, Fukatsu H, Fujii S and Savage PE (2021). Agreement among human and annotated transcriptions of global songs. 22nd International Society for Music Information Retrieval Conference (ISMIR) 9 Nov 2021 - 12 Nov 2021.
09-11-2021
09-11-2021
O'Hanlon K, Benetos E and Dixon S (2021). Detecting cover songs with pitch class key-invariant networks. IEEE International Workshop on Machine Learning for Signal Processing (MLSP) Gold Coast, Queensland, Australia 25 Oct 2021 - 28 Oct 2021.
25-10-2021
25-10-2021
Holzapfel A, Benetos E, Killick A and Widdess R (2021). Humanities and Engineering Perspectives on Music Transcription. Digital Scholarship in the Humanities, Oxford University Press (OUP)
23-10-2021
23-10-2021
Sarkar S, Benetos E and Sandler M (2021). Vocal Harmony Separation using Time-domain Neural Networks. 22nd Annual Conference of the International Speech Communication Association (INTERSPEECH) Brno, Czech Republic 30 Aug 2021 - 3 Sep 2021.
30-08-2021
30-08-2021
Bear H, Morfi V and Benetos E (2021). An evaluation of data augmentation methods for sound scene geotagging. 22nd Annual Conference of the International Speech Communication Association (INTERSPEECH) Brno, Czech Republic 30 Aug 2021 - 3 Sep 2021.
30-08-2021
30-08-2021
Zhao Y, Wang C, Fazekas G, Benetos E and Sandler M (2021). Violinist identification based on vibrato features. 29th European Signal Processing Conference (EUSIPCO) 23 Aug 2021 - 27 Aug 2021.
23-08-2021
23-08-2021
Cheuk KW, Luo Y-J, Benetos E and Herremans D (2021). Revisiting the onsets and frames model with additive attention. International Joint Conference on Neural Networks (IJCNN) 18 Jul 2021 - 22 Jul 2021.
18-07-2021
18-07-2021
Manco I, Benetos E, Quinton E and Fazekas G (2021). MusCaps: generating captions for music audio. International Joint Conference on Neural Networks (IJCNN) 18 Jul 2021 - 22 Jul 2021.
18-07-2021
18-07-2021
Liu L and Benetos E (2021). From Audio to Music Notation. Handbook of Artificial Intelligence for Music , Editors: Miranda ER. 693-714.
03-07-2021
03-07-2021
Ragano A, Benetos E and Hines A (2021). More for Less: Non-Intrusive Speech Quality Assessment with Limited Annotations. 13th International Conference on Quality of Multimedia Experience (QoMEX) 14 Jun 2021 - 17 Jun 2021.
14-06-2021
14-06-2021
Singh S, Bear H and Benetos E (2021). Prototypical Networks for Domain Adaptation in Acoustic Scene Classification. IEEE International Conference on Acoustics, Speech and Signal Processing Toronto, Canada 6 Jun 2021 - 11 Jun 2021.
06-06-2021
06-06-2021
Liu L, Morfi G-V and Benetos E (2021). Joint multi-pitch detection and score transcription for polyphonic piano music. IEEE International Conference on Acoustics, Speech and Signal Processing Toronto, Canada 6 Jun 2021 - 11 Jun 2021.
06-06-2021
06-06-2021
Subramanian V, Gururani S, Benetos E and Sandler M (2021). Anomalous behaviour in loss-gradient based interpretability methods. RobustML workshop paper at ICLR 2021.
07-05-2021
07-05-2021
Cheuk KW, Benetos E, Luo Y and Herremans D (2021). The effect of spectrogram reconstructions on automatic music transcription: an alternative approach to improve transcription accuracy. 25th International Conference on Pattern Recognition (ICPR2020) Milan, Italy 10 Jan 2021 - 15 Jan 2021.
10-01-2021
10-01-2021
Lordelo C, Benetos E, Dixon S and Ahlbäck S (2021). PITCH-INFORMED INSTRUMENT ASSIGNMENT USING A DEEP CONVOLUTIONAL NETWORK WITH MULTIPLE KERNEL SHAPES.
01-01-2021
01-01-2021
2020
Vianna Lordelo C, Benetos E, Dixon S, Ahlbäck S and Ohlsson P (2020). Adversarial Unsupervised Domain Adaptation for Harmonic-Percussive Source Separation. IEEE Signal Processing Letters, Institute of Electrical and Electronics Engineers vol. 28, 81-85.
18-12-2020
18-12-2020
Liu L, Morfi G-V and Benetos E (2020). Joint Piano-roll and Score Transcription for Polyphonic Piano Music. DMRN+15: Digital Music Research Network One-day Workshop London, UK 15 Dec 2020.
15-12-2020
15-12-2020
Chettri B, Benetos E and Sturm BLT (2020). Dataset artefacts in anti-spoofing systems: a case study on the ASVspoof 2017 benchmark. IEEE/ACM Transactions on Audio, Speech and Language Processing, Institute of Electrical and Electronics Engineers vol. 28, 3018-3028.
09-11-2020
09-11-2020
Chettri B, Kinnunen T and Benetos E (2020). Subband modeling for spoofing detection in automatic speaker verification. Odyssey 2020: The Speaker and Language Recognition Workshop Tokyo, Japan 1 Nov 2020 - 5 Nov 2020.
01-11-2020
01-11-2020
Ragano A, Benetos E and Hines A (2020). Development of a Speech Quality Database Under Uncontrolled Conditions. 21st Annual Conference of the International Speech Communication Association (INTERSPEECH 2020) Shanghai, China 25 Oct 2020 - 29 Oct 2020.
25-10-2020
25-10-2020
Pankajakshan A, Bear H, Subramanian V and Benetos E (2020). Memory Controlled Sequential Self Attention for Sound Recognition. 21st Annual Conference of the International Speech Communication Association (INTERSPEECH 2020) Shanghai, China 25 Oct 2020 - 29 Oct 2020.
25-10-2020
25-10-2020
MISHRA S, Benetos E, Sturm B and Dixon S (2020). Reliable Local Explanations for Machine Listening. International Joint Conference on Neural Networks (IJCNN) Glasgow, UK 19 Jul 2020 - 24 Jul 2020.
19-07-2020
19-07-2020
Ycart A, Liu L, Benetos E and Pearce M (2020). Investigating the Perceptual Validity of Evaluation Metrics for Automatic Piano Music Transcription. Transactions of the International Society for Music Information Retrieval, Ubiquity Press vol. 3 (1), 68-81.
12-06-2020
12-06-2020
Ragano A, Benetos E and Hines A (2020). Audio impairment recognition using a correlation-based feature representation. 12th International Conference on Quality of Multimedia Experience (QoMEX) Athlone, Ireland 26 May 2020 - 28 May 2020.
26-05-2020
26-05-2020
Wang C, Lostanlen V, Benetos E and Chew E (2020). Playing Technique Recognition by Joint Time–Frequency Scattering. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2020) Barcelona, Spain 4 May 2020 - 8 May 2020.
04-05-2020
04-05-2020
Wei W, Zhu H, Benetos E and Wang Y (2020). A-CRNN: a domain adaptation model for sound event detection. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2020) Barcelona, Spain 4 May 2020 - 8 May 2020.
04-05-2020
04-05-2020
Martinez Ramirez M, Benetos E and Reiss J (2020). Modeling plate and spring reverberation using a DSP-informed deep neural network. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2020) Barcelona, Spain 4 May 2020 - 8 May 2020.
04-05-2020
04-05-2020
SUBRAMANIAN V, Pankajakshan A, Benetos E, Xu N, McDonald S and Sandler M (2020). A Study on the Transferability of Adversarial Attacks in Sound Event Classification. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2020) Barcelona, Spain 4 May 2020 - 8 May 2020.
04-05-2020
04-05-2020
Ycart A, Liu L, Benetos E and Pearce MT (2020). Musical Features for Automatic Music Transcription Evaluation.
15-04-2020
15-04-2020
Ycart A and Benetos E (2020). Learning and Evaluation Methodologies for Polyphonic Music Sequence Prediction with LSTMs. IEEE/ACM Transactions on Audio, Speech and Language Processing, Institute of Electrical and Electronics Engineers vol. 28 (1), 1328-1341.
14-04-2020
14-04-2020
Chettri B, Kinnunen T and Benetos E (2020). Deep Generative Variational Autoencoding for Replay Spoof Detection in Automatic Speaker Verification. Computer Speech and Language, Elsevier vol. 63
19-03-2020
19-03-2020
Martinez Ramirez M, Benetos E and Reiss J (2020). Deep Learning for Black-Box Modeling of Audio Effects. Applied Sciences, MDPI AG vol. 10 (2)
16-01-2020
16-01-2020
2019
Liu L and Benetos E (2019). Automatic Music Accompaniment with a Chroma-based Music Data Representation. DMRN+14: Digital Music Research Network One-day Workshop.
17-12-2019
17-12-2019
Holzapfel A and Benetos E (2019). Automatic music transcription and ethnomusicology: a user study. 20th conference of the International Society for Music Information Retrieval (ISMIR) Delft, The Netherlands 4 Nov 2019 - 8 Nov 2019.
04-11-2019
04-11-2019
Wang C, Benetos E, Lostanlen V and Chew E (2019). Adaptive Time–Frequency Scattering for Periodic Modulation Recognition in Music Signals. International Society for Music Information Retrieval Conference Delft, The Netherlands 4 Nov 2019 - 8 Nov 2019.
04-11-2019
04-11-2019
Ycart A, McLeod A, Benetos E and Yoshii K (2019). Blending acoustic and language model predictions for automatic music transcription. 20th conference of the International Society for Music Information Retrieval (ISMIR) Delft, The Netherlands 4 Nov 2019 - 8 Nov 2019.
04-11-2019
04-11-2019
Ycart A, Stoller D and Benetos E (2019). A Comparative Study of Neural Models for Polyphonic Music Sequence Transduction. 20th conference of the International Society for Music Information Retrieval (ISMIR) Delft, The Netherlands 4 Nov 2019 - 8 Nov 2019.
04-11-2019
04-11-2019
Wang C, Benetos E and Chew E (2019). CBF-periDB: A Chinese Bamboo Flute Dataset for Periodic Modulation Analysis. International Society for Music Information Retrieval Conference Late-Breaking Demo Session Delft, The Netherlands 4 Nov 2019 - 8 Nov 2019.
04-11-2019
04-11-2019
SUBRAMANIAN V, Benetos E and Sandler M (2019). Robustness of Adversarial Attacks in Sound Event Classification. 4th Workshop on Detection and Classification of Acoustic Scenes and Events (DCASE 2019) New York, USA 25 Oct 2019 - 26 Oct 2019.
25-10-2019
25-10-2019
Pankajakshan A, Bear H and Benetos E (2019). Onsets, activity, and events: a multi-task approach for polyphonic sound event modelling. 4th Workshop on Detection and Classification of Acoustic Scenes and Events (DCASE 2019) New York, USA 25 Oct 2019 - 26 Oct 2019.
25-10-2019
25-10-2019
Singh S, Pankajakshan A and Benetos E (2019). Audio tagging using a linear noise modelling layer. 4th Workshop on Detection and Classification of Acoustic Scenes and Events (DCASE 2019) New York, USA 25 Oct 2019 - 26 Oct 2019.
25-10-2019
25-10-2019
Vianna Lordelo C, Benetos E, Dixon S and Ahlbäck S (2019). Investigating kernel shapes and skip connections for deep learning-based harmonic-percussive separation. IEEE Workshop on Applications of Signal Processing to Audio and Acoustics New Paltz, NY, USA 20 Oct 2019 - 23 Oct 2019.
20-10-2019
20-10-2019
Pankajakshan A, Bear H and Benetos E (2019). Polyphonic sound event and sound activity detection: a multi-task approach. IEEE Workshop on Applications of Signal Processing to Audio and Acoustics New Paltz, NY, USA 20 Oct 2019 - 23 Oct 2019.
20-10-2019
20-10-2019
Bear H, Heittola T, Mesaros A, Benetos E and Virtanen T (2019). City classification from multiple real-world sound scenes. IEEE Workshop on Applications of Signal Processing to Audio and Acoustics New Paltz, NY, USA 20 Oct 2019 - 23 Oct 2019.
20-10-2019
20-10-2019
Chettri B, Stoller D, Morfi V, Martinez Ramirez M, Benetos E and Sturm B (2019). Ensemble Models for Spoofing Detection in Automatic Speaker Verification. 20th Annual Conference of the International Speech Communication Association (INTERSPEECH 2019) Graz, Austria 15 Jul 2019 - 19 Sep 2019.
15-09-2019
15-09-2019
Bear H, Nolasco I and Benetos E (2019). Towards joint sound scene and polyphonic sound event recognition. 20th Annual Conference of the International Speech Communication Association (INTERSPEECH 2019) Graz, Austria 15 Sep 2019 - 19 Sep 2019.
15-09-2019
15-09-2019
Martinez Ramirez M, Benetos E and Reiss J (2019). A general-purpose deep learning approach to model time-varying audio effects. International Conference on Digital Audio Effects (DAFx-19) Birmingham, UK 2 Sep 2019 - 6 Sep 2019.
02-09-2019
02-09-2019
Zhou Q, Feng Z and Benetos E (2019). Adaptive Noise Reduction for Sound Event Detection Using Subband-Weighted NMF. Sensors, MDPI AG vol. 19 (14)
20-07-2019
20-07-2019
Covas E and Benetos E (2019). Optimal Neural Network Feature Selection for Spatial-Temporal Forecasting. Chaos, AIP Publishing vol. 29 (6)
20-06-2019
20-06-2019
Ragano A, BENETOS E and Hines A (2019). Adapting the Quality of Experience Framework for Audio Archive Evaluation. 11th International Conference on Quality of Multimedia Experience Berlin, Germany 5 Jun 2019 - 7 Jun 2019.
05-06-2019
05-06-2019
WANG C, BENETOS E, MENG X and CHEW E (2019). HMM-based Glissando Detection for Recordings of Chinese Bamboo Flute. Sound and Music Computing Conference Malaga, Spain 28 May 2019 - 31 May 2019.
28-05-2019
28-05-2019
Lins F, Johann M, BENETOS E and Schramm R (2019). Automatic Transcription of Diatonic Harmonica Recordings. IEEE International Conference on Acoustics, Speech, and Signal Processing Brighton, UK 12 May 2019 - 17 May 2019.
12-05-2019
12-05-2019
Phaye SSR, BENETOS E and Wang Y (2019). SubSpectralNet - Using sub-spectrogram based convolutional neural networks for acoustic scene classification. IEEE International Conference on Acoustics, Speech, and Signal Processing Brighton, UK 12 May 2019 - 17 May 2019.
12-05-2019
12-05-2019
MISHRA S, STOLLER D, BENETOS E, STURM B and DIXON S (2019). GAN-based Generation and Automatic Selection of Explanations for Neural Networks. SafeML ICLR 2019 Workshop New Orleans, USA 6 May 2019.
06-05-2019
06-05-2019
Nolasco I, Terenzi A, Cecchi S, Orcioni S, BEAR H and BENETOS E (2019). Audio-based identification of beehive states. IEEE International Conference on Acoustics, Speech, and Signal Processing Brighton, UK 12 May 2019 - 17 May 2019.
12-02-2019
12-02-2019
BENETOS E, DIXON S, Duan Z and EWERT S (2019). Automatic Music Transcription: An Overview. IEEE Signal Processing Magazine, Institute of Electrical and Electronics Engineers vol. 36 (1), 20-30.
01-01-2019
01-01-2019
2018
CHETTRI B, MISHRA S, STURM B and BENETOS E (2018). Analysing the predictions of a CNN-based replay spoofing detection system. 2018 IEEE Workshop on Spoken Language Technology Athens, Greece 18 Dec 2018 - 21 Dec 2018.
18-12-2018
18-12-2018
BEAR H and BENETOS E (2018). An extensible cluster-graph taxonomy for open set sound scene analysis. Workshop on Detection and Classification of Acoustic Scenes and Events Surrey, UK 19 Nov 2018 - 20 Nov 2018.
19-11-2018
19-11-2018
Nolasco I and BENETOS E (2018). To bee or not to bee: Investigating machine learning approaches for beehive sound recognition. 2018 Workshop on Detection and Classification of Acoustic Scenes and Events (DCASE 2018) Surrey, UK 19 Nov 2018 - 20 Nov 2018.
19-11-2018
19-11-2018
YCART A and BENETOS E (2018). A-MAPS: Augmented MAPS Dataset with Rhythm and Key Annotations. 19th International Society for Music Information Retrieval Conference Late-Breaking Demos Session Paris 23 Sep 2018 - 27 Sep 2018.
23-09-2018
23-09-2018
WANG C, BENETOS E, MENG X and CHEW E (2018). Towards HMM-based glissando detection for recordings of Chinese bamboo flute. International Society for Music Information Retrieval Conference Late-Breaking Demos Session Paris, France 23 Sep 2018 - 27 Sep 2018.
23-09-2018
23-09-2018
CHETTRI B, STURM BLT and BENETOS E (2018). Analysing replay spoofing countermeasure performance under varied conditions. IEEE International Workshop on Machine Learning for Signal Processing Aalborg, Denmark 17 Sep 2018 - 20 Sep 2018.
17-09-2018
17-09-2018
Chettri B, Mishra S, Sturm BL and Benetos E (2018). A Study On Convolutional Neural Network Based End-To-End Replay Anti-Spoofing.
22-05-2018
22-05-2018
YCART A and BENETOS E (2018). Polyphonic Music Sequence Transduction with Meter-Constrained LSTM Networks. IEEE International Conference on Acoustics, Speech and Signal Processing Calgary, Canada 15 Apr 2018 - 20 Apr 2018.
15-04-2018
15-04-2018
Nakamura E, BENETOS E, Yoshii K and DIXON S (2018). Towards Complete Polyphonic Music Transcription: Integrating Multi-Pitch Detection and Rhythm Quantization. IEEE International Conference on Acoustics, Speech and Signal Processing Calgary, Canada 15 Apr 2018 - 20 Apr 2018.
15-04-2018
15-04-2018
Valero-Mas JJ, BENETOS E and Iñesta JM (2018). A Supervised Classification Approach for Note Tracking in Polyphonic Piano Transcription. Journal of New Music Research, Taylor & Francis (Routledge) vol. 47 (3), 249-263.
26-03-2018
26-03-2018
Ali H, Tran SN, Benetos E and d'Avila Garcez AS (2018). Speaker recognition with hybrid features from a deep belief network. Neural Computing and Applications, Springer Verlag (Germany) vol. 29 (6), 13-19.
01-03-2018
01-03-2018
Mesaros A, Heittola T, Benetos E, Foster P, Lagrange M, Virtanen T and Plumbley M (2018). Detection and Classification of Acoustic Scenes and Events: Outcome of the DCASE 2016 Challenge. IEEE/ACM Transactions on Audio, Speech and Language Processing, Institute of Electrical and Electronics Engineers vol. 26 (2), 379-393.
01-02-2018
01-02-2018
PANTELI M, BENETOS E and DIXON S (2018). A review of manual and computational approaches for the study of world music corpora. Journal of New Music Research, Taylor & Francis (Routledge) vol. 47 (2), 176-189.
08-01-2018
08-01-2018
BENETOS E, STOWELL D and PLUMBLEY M (2018). Approaches to complex sound scene analysis. Computational Analysis of Sound Scenes and Events , Editors: Virtanen T, PLUMBLEY M and Ellis D. 215-242.
01-01-2018
01-01-2018
(2018). Proceedings of the 19th International Society for Music Information Retrieval Conference, ISMIR 2018, Paris, France, September 23-27, 2018., Editors: Gómez E, Hu X, Humphrey E and Benetos E.
01-01-2018
01-01-2018
2017
PANTELI M, BENETOS E and DIXON S (2017). A computational study on outliers in world music. PLoS ONE, Public Library of Science (PLoS) vol. 12 (12), 1-28.
18-12-2017
18-12-2017
McLeod A, Schramm R, Steedman M and BENETOS E (2017). Automatic Transcription of Polyphonic Vocal Music. Applied Sciences, MDPI AG vol. 7 (12)
11-12-2017
11-12-2017
Ycart A and Benetos E (2017). A study on LSTM networks for polyphonic music sequence modelling. 18th International Society for Music Information Retrieval Conference (ISMIR 2017) Suzhou, China 23 Oct 2017 - 27 Oct 2017.
23-10-2017
23-10-2017
Schramm R, McLeod A, Steedman M and Benetos E (2017). Multi-pitch detection and voice assignment for a cappella recordings of multiple singers. 18th International Society for Music Information Retrieval Conference (ISMIR 2017) Suzhou, China 23 Oct 2017 - 27 Oct 2017.
23-10-2017
23-10-2017
Lafay G, Benetos E and Lagrange M (2017). Sound Event Detection in Synthetic Audio: Analysis of the DCASE 2016 Task Results. IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA 2017) New Paltz, NY, USA 15 Oct 2017 - 18 Oct 2017.
15-10-2017
15-10-2017
YCART A and BENETOS E (2017). Neural Music Language Models: investigating the training process. International Conference of Students of Systematic Musicology.
13-09-2017
13-09-2017
Benetos E (2017). Polyphonic note and instrument tracking using linear dynamical systems. 2017 AES International Conference on Semantic Audio Erlangen, Germany 22 Jun 2017 - 24 Jun 2017.
22-06-2017
22-06-2017
Valero-Mas JJ, Benetos E and Iñesta JM (2017). Assessing the Relevance of Onset Information for Note Tracking in Piano Music Transcription. 2017 AES International Conference on Semantic Audio Erlangen, Germany 22 Jun 2017 - 24 Jun 2017.
22-06-2017
22-06-2017
Schramm R and Benetos E (2017). Automatic Transcription of a Cappella Recordings from Multiple Singers. 2017 AES International Conference on Semantic Audio Erlangen, Germany 22 Jun 2017 - 24 Jun 2017.
22-06-2017
22-06-2017
Stowell D, Benetos E and Gill LF (2017). On-Bird Sound Recordings: Automatic Acoustic Recognition of Activities and Contexts. IEEE/ACM Trans. Audio, Speech & Language Processing vol. 25 (6), 1193-1206.
23-05-2017
23-05-2017
Benetos E, Lafay G, Lagrange M and Plumbley MD (2017). Polyphonic Sound Event Tracking using Linear Dynamical Systems. IEEE/ACM Transactions on Audio, Speech and Language Processing, IEEE vol. 25 (6), 1266-1277.
23-05-2017
23-05-2017
Russell AJ, Benetos E and d'Avila Garcez AS (2017). On the Memory Properties of Recurrent Neural Models. International Joint Conference on Neural Networks (IJCNN 2017) Anchorage, Alaska, USA 14 May 2017 - 19 May 2017.
14-05-2017
14-05-2017
Abdallah S, Benetos E, Gold N, Hargreaves S, Weyde T and Wolff D (2017). The Digital Music Lab: A Big Data Infrastructure for Digital Musicology. ACM Journal on Computing and Cultural Heritage, ACM vol. 10 (1)
01-01-2017
01-01-2017
2016
YCART A and Benetos E (2016). Towards a Music Language Model for Audio Analysis. DMRN+11: Digital Music Research Network One-day Workshop 2016 Centre for Digital Music, Queen Mary University of London 20 Dec 2016.
20-12-2016
20-12-2016
BENETOS E and Schramm R (2016). Automatic Transcription of Vocal Quartets. DMRN+11: Digital Music Research Network One-day Workshop 2016 Centre for Digital Music, Queen Mary University of London 20 Dec 2016.
20-12-2016
20-12-2016
Valero-Mas JJ, Benetos E and Iñesta JM (2016). Classification-based Note Tracking for Automatic Music Transcription. 9th International Workshop on Machine Learning and Music Riva del Garda, Italy 23 Sep 2016.
23-09-2016
23-09-2016
Abdallah S, Benetos E, Gold N, Hargreaves S, Weyde T and Wolff D (2016). Digital Music Lab: A Framework for Analysing Big Music Data. 24th European Signal Processing Conference Budapest, Hungary 29 Aug 2016 - 2 Sep 2016.
29-08-2016
29-08-2016
Holzapfel A and Benetos E (2016). The Sousta corpus: Beat-informed automatic transcription of traditional dance tunes. 17th International Society for Music Information Retrieval Conference New York, USA 7 Aug 2016 - 11 Aug 2016.
07-08-2016
07-08-2016
Panteli M, Benetos E and Dixon S (2016). Learning a feature space for similarity in world music. 17th International Society for Music Information Retrieval Conference New York, USA 7 Aug 2016 - 11 Aug 2016.
07-08-2016
07-08-2016
Cheng T, Mauch M, Benetos E and Dixon S (2016). An attack/decay model for piano transcription. 17th International Society for Music Information Retrieval Conference New York, USA 7 Aug 2016 - 11 Aug 2016.
07-08-2016
07-08-2016
Lafay G, Lagrange M, Rossignol M, Benetos E and Roebel A (2016). A morphological model for simulating acoustic scenes and its application to sound event detection. IEEE/ACM Transactions on Audio, Speech, and Language Processing, IEEE vol. 24 (10), 1854-1864.
01-07-2016
01-07-2016
Panteli M, Benetos E and Dixon S (2016). Automatic detection of outliers in world music collections. Fourth International Conference on Analytical Approaches to World Music (AAWM 2016) New York, USA 8 Jun 2016 - 11 Jun 2016.
08-06-2016
08-06-2016
Benetos E, Lafay G, Lagrange M and Plumbley MD (2016). Detection of overlapping acoustic events using a temporally-constrained probabilistic model. IEEE International Conference on Acoustics, Speech, and Signal Processing Shanghai, China 20 Mar 2016 - 25 Mar 2016.
20-03-2016
20-03-2016
Sigtia S, Benetos E and Dixon S (2016). An End-to-End Neural Network for Polyphonic Piano Music Transcription. IEEE/ACM Transactions on Audio, Speech, and Language Processing, IEEE vol. 24 (5), 927-939.
23-02-2016
23-02-2016
2015
Benetos E and Weyde T (2015). An efficient temporally-constrained probabilistic model for multiple-instrument music transcription., Editors: Wiering F and Müller M. 16th International Society for Music Information Retrieval Conference (ISMIR) Malaga, Spain 26 Oct 2015 - 30 Oct 2015.
26-10-2015
26-10-2015
BENETOS E and Holzapfel A (2015). Automatic transcription of Turkish microtonal music. Journal of the Acoustical Society of America, Acoustical Society of America vol. 138 (4), 2118-2130.
14-10-2015
14-10-2015
Stowell D, Giannoulis D, Benetos E, Lagrange M and Plumbley MD (2015). Detection and Classification of Acoustic Scenes and Events. IEEE Transactions on Multimedia vol. 17 (10), 1733-1746.
01-10-2015
01-10-2015
Rossignol M, Lagrange M, Lafay G and BENETOS E (2015). Alternate level clustering for drum transcription. 23rd European Signal Processing Conference (EUSIPCO) Nice, France 31 Aug 2015 - 4 Sep 2015.
31-08-2015
31-08-2015
Abdallah S, Alencar-Brayner A, BENETOS E, Cottrell S, Dykes J, Gold N, Kachkaev A, Mahey M, Tidhar D, Tovell A, Weyde T and Wolff D (2015). Automatic transcription and pitch analysis of the British Library World & Traditional Music Collection. 5th International Workshop on Folk Music Analysis Paris, France 10 Jun 2015 - 12 Jun 2015.
10-06-2015
10-06-2015
Sigtia S, Benetos E, Boulanger-Lewandowski N, Weyde T, Garcez ASDA and Dixon S (2015). A Hybrid Recurrent Neural Network for Music Transcription. 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Brisbane, Australia 19 Apr 2015 - 24 Apr 2015.
19-04-2015
19-04-2015
2014
Benetos E, Badeau R, Weyde T and Richard G (2014). Template Adaptation for Improving Automatic Music Transcription., Editors: Wang H-M, Yang Y-H and Lee JH. 15th International Society for Music Information Retrieval Conference (ISMIR) Taipei, Taiwan 27 Oct 2014 - 31 Oct 2014.
27-10-2014
27-10-2014
Tidhar D, Dixon S, Benetos E and Weyde T (2014). The temperament police. Early Music, Oxford University Press vol. 42 (4), 579-590.
11-10-2014
11-10-2014
Weyde T, Cottrell S, Dykes J, Benetos E, Wolff D, Tidhar D, Gold N, Abdallah S, Plumbley M, Dixon S, Barthet M, Mahey M, Tovell A and Alancar-Brayner A (2014). Big Data for Musicology., Editors: Page K and Fields B. 1st International Digital Libraries for Musicology workshop London, UK 12 Sep 2014.
12-09-2014
12-09-2014
Wolff D, Tidhar D, Benetos E, Dumon E, Cherla S and Weyde T (2014). Incremental dataset definition for large scale musicological research., Editors: Page K and Fields B. 1st International Digital Libraries for Musicology workshop London, UK 12 Sep 2014.
12-09-2014
12-09-2014
Tran S, Benetos E and d Avila Garcez A (2014). Learning motion-difference features using Gaussian restricted Boltzmann machines for efficient human action recognition. 2014 International Joint Conference on Neural Networks (IJCNN) Beijing, China 6 Jul 2014 - 11 Jul 2014.
06-07-2014
06-07-2014
Benetos E and Holzapfel A (2014). Incorporating pitch class profiles for improving automatic transcription of Turkish makam music., Editors: Holzapfel A. 4th International Workshop on Folk Music Analysis Istanbul, Turkey 12 Jun 2014 - 13 Jun 2014.
13-06-2014
13-06-2014
Giannoulis D, Benetos E, Klapuri A and Plumbley M (2014). Improving instrument recognition in polyphonic music through system integration. IEEE International Conference on Acoustics, Speech, and Signal Processing Florence, Italy 4 May 2014 - 9 May 2014.
04-05-2014
04-05-2014
Benetos E, Jansson A and Weyde T (2014). Improving automatic music transcription through key detection., Editors: Dittmar C, Fazekas G and Ewert S. AES 53rd International Conference on Semantic Audio London, UK 27 Jan 2014 - 29 Jan 2014.
29-01-2014
29-01-2014
Sigtia S, Benetos E, Cherla S, Weyde T, Garcez A and Dixon S (2014). RNN-based Music Language Models for Improving Automatic Music Transcription.
01-01-2014
01-01-2014
Benetos E, Ewert S and Weyde T (2014). Automatic Transcription Of Pitched And Unpitched Sounds From Polyphonic Music.
01-01-2014
01-01-2014
BARTHET M, Benetos E, Cottrell S, Dixon S, Dykes J, Gold N, Mahey M, Plumbley MD, Tidhar D, Weyde T and Wolff D (2014). The DML Research Project: Digital Music Lab - Analysing Big Music Data.
01-01-2014
01-01-2014
2013
Benetos E, Dixon S, Giannoulis D, Kirchhoff H and Klapuri A (2013). Automatic music transcription: Challenges and future directions. Journal of Intelligent Information Systems vol. 41 (3), 407-434.
01-12-2013
01-12-2013
Benetos E and Holzapfel A (2013). Automatic transcription of Turkish makam music. 14th International Society for Music Information Retrieval Conference Curitiba, PR, Brazil 4 Nov 2013 - 8 Nov 2013.
08-11-2013
08-11-2013
Benetos E and Weyde T (2013). Explicit duration hidden Markov models for multiple-instrument polyphonic music transcription. 14th International Society for Music Information Retrieval Conference Curitiba, PR, Brazil 4 Nov 2013 - 8 Nov 2013.
08-11-2013
08-11-2013
de Valk R, Weyde T and Benetos E (2013). A machine learning approach to voice separation in lute tablature., Editors: Britto AS, Gouyon F and Dixon S. 14th International Society for Music Information Retrieval Conference Curitiba, PR, Brazil 4 Nov 2013 - 8 Nov 2013.
04-11-2013
04-11-2013
Giannoulis D, Benetos E, Stowell D, Rossignol M, Lagrange M and Plumbley M (2013). Detection and classification of acoustic scenes and events: an IEEE AASP challenge. IEEE Workshop on Applications of Signal Processing to Audio and Acoustics New Paltz, NY, USA 20 Oct 2013 - 23 Oct 2013.
23-10-2013
23-10-2013
Giannoulis D, Stowell D, Benetos E, Rossignol M, Lagrange M and Plumbley MD (2013). A database and challenge for acoustic scene classification and event detection. 21st European Signal Processing Conference Marrakech, Morocco.
01-09-2013
01-09-2013
Benetos E, Cherla S and Weyde T (2013). An efficient shift-invariant model for polyphonic music transcription. 6th International Workshop on Machine Learning and Music Prague, Czech Republic.
01-09-2013
01-09-2013
Serra X, Magas M, Benetos E, Chudy M, Dixon S, Flexer A, Gomez E, Gouyon F, Herrera P, Jorda S, Paytuvi O, Peeters G, Schlüter J, Vinet H and Widmer G (2013). Roadmap for Music Information ReSearch., Editors: Peeters G.
02-05-2013
02-05-2013
Benetos E and Dixon S (2013). Multiple-instrument polyphonic music transcription using a temporally constrained shift-invariant model. Journal of the Acoustical Society of America vol. 133 (3), 1727-1741.
01-03-2013
01-03-2013
2012
BENETOS E (2012). Automatic transcription of polyphonic music exploiting temporal evolution., Editors: Dixon S.
31-12-2012
31-12-2012
BENETOS E, Dixon S, Giannoulis D, Kirchhoff H and Klapuri A (2012). Automatic Music Transcription: Breaking the Glass Ceiling., Editors: Gouyon F, Herrera P, Martins LG and Müller M. 13th International Society for Music Information Retrieval Conference (ISMIR 2012) Porto, Portugal 8 Oct 2012 - 12 Oct 2012.
12-10-2012
12-10-2012
Benetos E and Dixon S (2012). Temporally-constrained convolutive probabilistic latent component analysis for multi-pitch detection.
01-03-2012
01-03-2012
Benetos E, Klapuri A and Dixon S (2012). SCORE-INFORMED TRANSCRIPTION FOR AUTOMATIC PIANO TUTORING.
01-01-2012
01-01-2012
Benetos E and Dixon S (2012). A Shift-Invariant Latent Variable Model for Automatic Music Transcription. COMPUTER MUSIC JOURNAL vol. 36 (4), 81-94.
01-01-2012
01-01-2012
Benetos E, Lagrange M and Dixon S (2012). Characterisation of acoustic scenes using a temporally-constrained shift-invariant model.
01-01-2012
01-01-2012
2011
Benetos E and Dixon S (2011). Joint multi-pitch detection using harmonic envelope estimation for polyphonic music transcription. IEEE Journal on Selected Topics in Signal Processing vol. 5 (6), 1111-1123.
01-10-2011
01-10-2011
Benetos E and Dixon S (2011). Multiple-instrument polyphonic music transcription using a convolutive probabilistic model. 8th Sound and Music Computing Conference Padova, Italy 6 Jul 2011 - 9 Jul 2011.
01-07-2011
01-07-2011
Mearns L, Benetos E and Dixon S (2011). Automatically detecting key modulations in J.S. Bach chorale recordings.
01-07-2011
01-07-2011
Benetos E and Dixon S (2011). Polyphonic music transcription using note onset and offset detection.
01-01-2011
01-01-2011
Benetos E and Dixon S (2011). A TEMPORALLY-CONSTRAINED CONVOLUTIVE PROBABILISTIC MODEL FOR PITCH DETECTION.
01-01-2011
01-01-2011
Dixon S, Tidhar D and Benetos E (2011). The temperament police: The truth, the ground truth, and nothing but the truth. 12th International Society for Music Information Retrieval Conference Miami, Florida, USA 24 Oct 2011 - 28 Oct 2011.
01-01-2011
01-01-2011
2010
Benetos E and Dixon S (2010). Multiple-F0 estimation of piano sounds exploiting spectral structure and temporal evolution.
01-09-2010
01-09-2010
Benetos E and Stylianou Y (2010). Auditory spectrum-based pitched instrument onset detection. IEEE Transactions on Audio, Speech and Language Processing vol. 18 (8), 1968-1977.
01-01-2010
01-01-2010
Benetos E and Kotropoulos C (2010). Non-negative tensor factorization applied to music genre classification. IEEE Transactions on Audio, Speech and Language Processing vol. 18 (8), 1955-1967.
01-01-2010
01-01-2010
Anglade A, Benetos E, Mauch M and Dixon S (2010). Improving music genre classification using automatically induced harmony rules. Journal of New Music Research vol. 39 (4), 349-361.
01-01-2010
01-01-2010
2009
Benetos E, Holzapfel A and Stylianou Y (2009). Pitched instrument onset detection based on auditory spectra.
01-01-2009
01-01-2009
2008
Benetos E and Kotropoulos C (2008). A tensor-based approach for automatic music genre classification.
01-08-2008
01-08-2008
Spachos D, Zlantintsi A, Moschou V, Antonopoulos P, Benetos E, Kotti M, Tzimouli K, Kotropoulos C, Nikolaidis N, Maragos P and Pitas I (2008). MUSCLE movie-database: a multimodal corpus with rich annotation for dialogue and saliency detection.
01-05-2008
01-05-2008
Panagakis I, Benetos E and Kotropoulos C (2008). Music Genre Classification: A Multilinear Approach., Editors: Bello JP, Chew E and Turnbull D.
01-01-2008
01-01-2008
Kotti M, Benetos E and Kotropoulos C (2008). Computationally efficient and robust BIC-based speaker segmentation. IEEE Transactions on Audio, Speech and Language Processing vol. 16 (5), 920-933.
01-01-2008
01-01-2008
BENETOS E, Siatras S, Kotropoulos C, Nikolaidis N and Pitas I (2008). Movie analysis with emphasis to dialogue and action scene detection. Multimodal Processing and Interaction , Editors: Maragos PA, Potamianos A and Gros P. 157-177.
01-01-2008
01-01-2008
2007
Kotti M, Benetos E and Kotropoulos C (2007). Neural network-based movie dialogue detection.
01-08-2007
01-08-2007
Benetos E, Kotti M and Kotropoulos C (2007). Large scale musical instrument identification.
01-07-2007
01-07-2007
Moschou V, Kotti M, Benetos E and Kotropoulos C (2007). Systematic comparison of BIC-based speaker segmentation systems.
01-01-2007
01-01-2007
Kotti M, Benetos E, Kotropoulos C and Pitas I (2007). A neural network approach to audio-assisted movie dialogue detection. Neurocomputing vol. 71, 157-166.
01-01-2007
01-01-2007
2006
Benetos E, Kotti M and Kotropoulos C (2006). Applying Supervised Classifiers Based on Non-negative Matrix Factorization to Musical Instrument Classification.
01-01-2006
01-01-2006
Kotti M, Benetos E and Kotropoulos C (2006). Automatic speaker change detection with the Bayesian information criterion using MPEG-7 features and a fusion scheme.
01-01-2006
01-01-2006
Kotti M, Martins LPM, Benetos E, Cardoso JS and Kotropoulos C (2006). Automatic Speaker Segmentation using Multiple Features and Distance Measures: A Comparison of Three Approaches.
01-01-2006
01-01-2006
Benetos E, Kotti M and Kotropoulos C (2006). Musical instrument classification using non-negative matrix factorization algorithms.
01-01-2006
01-01-2006
Benetos E, Kotti M and Kotropoulos C (2006). Musical instrument classification using non-negative matrix factorization algorithms and subset feature selection.
01-01-2006
01-01-2006
Benetos E, Kotropoulos C, Lidy T and Rauber A (2006). Testing supervised classifiers based on non-negative matrix factorization to musical instrument classification.
01-01-2006
01-01-2006
2005
Benetos E, Kotti M, Kotropoulos C, Burred JJ, Eisenberg G, Haller M and Sikora T (2005). Comparison of subspace analysis-based and statistical model-based algorithms for musical instrument classification.
01-10-2005
01-10-2005
Grants of specific relevance to the Centre for Multimodal AI
UKRI Centre for Doctoral Training in Artificial Intelligence and Music
Simon Dixon, Emmanouil Benetos, Nicholas Bryan-Kinns, Mark Sandler, Andrew McPherson, Mathieu Barthet, Gyorgy Fazekas, Ekaterina Ivanova, Anna Xambo Sedo and Charalampos Saitis
£6,519,934 EPSRC Engineering and Physical Sciences Research Council (01-07-2019 - 31-08-2028)
Simon Dixon, Emmanouil Benetos, Nicholas Bryan-Kinns, Mark Sandler, Andrew McPherson, Mathieu Barthet, Gyorgy Fazekas, Ekaterina Ivanova, Anna Xambo Sedo and Charalampos Saitis
£6,519,934 EPSRC Engineering and Physical Sciences Research Council (01-07-2019 - 31-08-2028)
Project Maestro - Ai Musical Analysis Platform
Emmanouil Benetos and Simon Dixon
£166,349 Innovate UK (01-07-2024 - 30-06-2026)
Emmanouil Benetos and Simon Dixon
£166,349 Innovate UK (01-07-2024 - 30-06-2026)
Spotify PhD project - Style classification of podcasts using audio
Emmanouil Benetos
£33,000 Spotify Ltd (01-03-2024 - 28-02-2026)
Emmanouil Benetos
£33,000 Spotify Ltd (01-03-2024 - 28-02-2026)
Online Speech Enhancement in Scenarios with Low Direct-to-Reverberant-Ratio
Emmanouil Benetos and Aidan Hogg
£65,621 L-ACOUSTICS UK LIMITED (01-09-2024 - 28-02-2025)
Emmanouil Benetos and Aidan Hogg
£65,621 L-ACOUSTICS UK LIMITED (01-09-2024 - 28-02-2025)
Industry-scale machine listening for music and audio data”
Simon Dixon and Emmanouil Benetos
£108,000 Spotify Ltd (14-09-2020 - 31-01-2025)
Simon Dixon and Emmanouil Benetos
£108,000 Spotify Ltd (14-09-2020 - 31-01-2025)
Graph Networks for Explainable Artificial Intelligence
Andrea Cavallaro and Emmanouil Benetos
£293,434 EPSRC Engineering and Physical Sciences Research Council (01-08-2021 - 31-12-2024)
Andrea Cavallaro and Emmanouil Benetos
£293,434 EPSRC Engineering and Physical Sciences Research Council (01-08-2021 - 31-12-2024)