Faculty of Science and Engineering - Research

Dr Iran Roman

Iran Roman

Lecturer

School of Electronic Engineering and Computer Science
Queen Mary University of London

i.roman@qmul.ac.uk

Centre for Multimodal AI
Centre for Fundamentals of AI and Computational Theory
Centre for Human-Centred Computing

Google Scholar

Research
Publications
Grants
Research Group
News

Research

Theoretical neuroscience, Machine Perception, Artificial Intelligence

Interests

Iran R. Roman is a Lecturer at the School of Electrical Engineering and Computer Science of Queen Mary University of London. Within Queen Mary, he is a member of the Center for Multimodal AI, Center for Digital Music, Center for Human-Centered Computing, the Computer Vision group, and the Cognitive Science group.

His research area is machine perception, with the goal of creating algorithms that allow computers to perceive environments as living agents do. To this end, Iran has developed algorithms that leverage multimodal signals to sense, identify, and track objects in the real world. These algorithms draw inspiration from the neural mechanisms that allow living organisms to carry out similar tasks. Iran’s work has found applications in products at companies such as Apple, Tesla, Raytheon/BBN, and Plantronics. His research has been funded by the US National Science Foundation (NSF), the US Defense Advanced Research Projects Agency (DARPA), and the Howard Hughes Medical Institute (HHMI).

On the academic service side, he serves as reviewer for IEEE ICASSP, IEEE MLSP, ISMIR, eLife, and Music & Science. Iran is also a volunteer professor for the National Autonomous University of Mexico, and the organizer of the annual Deep Learning for Music Information Retrieval workshop at the Center for Computer Research in Music and Acoustics at Stanford University.

Publications

2026

LLMs can read music, but struggle to hear it. An evaluation of core music perception tasks
Carone B Roman I Ripolles P
Proceedings of Machine Learning Research

QMRO

26-01-2026

2025

“AudibleLight (RC): A Controllable, End-to-End API for Soundscape Synthesis Across Ray-Traced & Real-World Measured Acoustics”
DMRN+20 Digital Music Research Network One-day Workshop 2025 King’s College London, Bush House, London UK 16 Dec 2025.

QMRO

15-12-2025

Towards Real-Time, Stable Mapping from Multimodal Sensing to Interpretable Timbre Axes
DMRN+20 Digital Music Research Network One-day Workshop 2025 King’s College London (Bush House). London, UK 16 Dec 2025.
15-12-2025

Relevant Publication

Evaluating Multimodal Large Language Models on Core Music Perception Tasks
Carone B Roman Guzman I Ripollés P
39th Conference on Neural Information Processing Systems (NeurIPS 2025) Workshop: AI for Music.

QMRO

07-12-2025

Relevant Publication

Latent Acoustic Mapping for Direction of Arrival Estimation: A Self-Supervised Approach
Roman A Roman I Bello J
IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA).

QMRO

12-10-2025

Relevant Publication

Vibe Sorcery: Integrating Emotion Recognition with Generative Music for Playlist Curation
Urrego-Gómez I Colton S Roman I
International Society for Music Information Retrieval: 1st Workshop on Large Language Models for Music & Audio (LLM4MA).
20-09-2025

Relevant Publication

Decoding Melodic Acoustic Features from Neural Data
Bozilovic Z Roman I
AES International Conference on Artificial Intelligence and Machine Learning for Audio.

QMRO

08-09-2025

Relevant Publication

Spectrotemporal Modulation: Efficient and Interpretable Feature Representation for Classifying Speech, Music, and Environmental Sounds
Chang A Li Y Roman I Poeppel D
Interspeech.

QMRO

17-08-2025

Relevant Publication

Combining Recurrent & Bayesian Models for Action Anticipation with Multiple Cues
Zimokha M Jamone L Roman I
Cognitive Computational Neuroscience.

QMRO

12-08-2025

Relevant Publication

Toward Affective Empathy in AI: Encoding Internal Representations of “Artificial Pain”
Wang A Roman I
Cognitive Computational Neuroscience.

QMRO

12-08-2025

Relevant Publication

Guitar-TECHS: An Electric Guitar Dataset Covering Techniques, Musical Excerpts, Chords and Scales Using a Diverse Array of Hardware
Pedroza H Abreu W Corey R Roman I
IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2025.

DOI 10.1109/ICASSP49660.2025.10887996

QMRO

06-04-2025

Relevant Publication

Musical neurodynamics
Harding EE Kim JC Demos A Roman I Tichko P
Nature Reviews Neuroscience, Nature Research

DOI 10.1038/s41583-025-00915-4

QMRO

18-03-2025

Relevant Publication

Perceptually-Guided Acoustic Foveation
Peng X Chen K Roman I
2025 IEEE Conference Virtual Reality and 3D User Interfaces (VR). vol. 00, 450-460.

DOI 10.1109/vr59515.2025.00069

QMRO

12-03-2025

Relevant Publication

Design and Implementation of the Transparent, Interpretable, and Multimodal (TIM) AR Personal Assistant
McGowan E Rulff J Castelo S Wu G Chen S Roman IR Dias FF Qian J
IEEE Computer Graphics and Applications, Institute of Electrical and Electronics Engineers (IEEE) vol. 45 (1), 28-42.

DOI 10.1109/mcg.2025.3549696

QMRO

01-01-2025

2024

Relevant Publication

HuBar: A Visual Analytics Tool to Explore Human Behavior Based on fNIRS in AR Guidance Systems
Castelo S Rulff J Solunke P McGowan E Wu G Roman I Lopez R Sun Q et al.
IEEE Transactions on Visualization and Computer Graphics, Institute of Electrical and Electronics Engineers (IEEE) vol. 31 (1), 119-129.

DOI 10.1109/tvcg.2024.3456388

QMRO

25-11-2024

Relevant Publication

LEVERAGING REAL ELECTRIC GUITAR TONES AND EFFECTS TO IMPROVE ROBUSTNESS IN GUITAR TABLATURE TRANSCRIPTION MODELING
Pedroza H Abreu W Corey R
27th International Conference on Digital Audio Effects (DAFx24).

QMRO

03-09-2024

Relevant Publication

Robust DoA Estimation from Deep Acoustic Imaging
Roman AS Roman IR
ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). vol. 00, 1321-1325.

DOI 10.1109/icassp48485.2024.10447883

QMRO

19-04-2024

Relevant Publication

Spatial Scaper: A Library to Simulate and Augment Soundscapes for Sound Event Localization and Detection in Realistic Rooms
Roman IR Ick C Roman AS McFee B
ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). vol. 00, 1221-1225.

DOI 10.1109/icassp48485.2024.10446118

QMRO

19-04-2024

2023

: Visualization of AI-Assisted Task Guidance in AR
Castelo S Rulff J McGowan E Steers B Wu G Chen S Roman I Brewer E et al.
IEEE Transactions on Visualization and Computer Graphics, Institute of Electrical and Electronics Engineers (IEEE) vol. 30 (1), 1313-1323.

DOI 10.1109/tvcg.2023.3327396

QMRO

02-11-2023

Sound Source Distance Estimation in Diverse and Dynamic Acoustic Conditions
Kushwaha SS Roman IR Fuentes M
2023 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA). vol. 00, 1-5.

DOI 10.1109/waspaa58266.2023.10248194

QMRO

25-10-2023

Exploring Approaches to Multi-Task Automatic Synthesizer Programming
Faronbi D Roman I
ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). vol. 00, 1-5.

DOI 10.1109/icassp49357.2023.10095540

QMRO

10-06-2023

Hebbian learning with elasticity explains how the spontaneous motor tempo affects music performance synchronization
Roman IR Roman AS Kim JC Large EW
Plos Computational Biology, Public Library of Science (Plos) vol. 19 (6)

DOI 10.1371/journal.pcbi.1011154

QMRO

07-06-2023

Relevant Publication

F0 analysis of Ghanaian pop singing reveals progressive alignment with equal temperament over the past three decades: a case study
Roman Guzman I
20th Sound and Music Computing Conference, SMC 2023.

QMRO

01-06-2023

Dynamic models for musical rhythm perception and coordination
Large EW Roman I Kim JC Cannon J Trainor LJ
Frontiers in Computational Neuroscience, Frontiers vol. 17

DOI 10.3389/fncom.2023.1151895

QMRO

17-05-2023

2022

Reconstructing room scales with a single sound for augmented reality displays
Liang BS Liang AS Roman I Weiss T Sun Q
Journal of Information Display, Taylor & Francis vol. 24 (1), 1-12.

DOI 10.1080/15980316.2022.2145377

QMRO

15-11-2022

Relevant Publication

Analyzing the effect of equal-angle spatial discretization on sound event localization and detection
Roman Guzman I
Detection and Classification of Acoustic Scenes and Events 2022.

QMRO

03-11-2022

2021

Relevant Publication

micarraylib: Software for Reproducible Aggregation, Standardization, and Signal Processing of Microphone Array Datasets.
Roman Guzman I Bello J
Detection and Classification of Acoustic Scenes and Events 2021.

QMRO

15-11-2021

Analyzing Pitch Content In Traditional Ghanaian Seperewa Songs
Walls K Roman I Van Ert K Harper C Adu-Gilmore L
1st Latin American Music Information Retrieval (LAMIR) workshop.

DOI 10.5281/zenodo.14908040

QMRO

THE MUSE BENCHMARK: PROBING MUSIC PERCEPTION AND AUDITORY RELATIONAL REASONING IN AUDIO LLMS
ICASSP 2026-2026 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 4 May 2026 - 8 May 2026.

QMRO