Dr Aidan Hogg
PhD, DIC, MEng, ACGI, AFHEALecturer in Computer Science
School of Electronic Engineering and Computer Science
Queen Mary University of London
Queen Mary University of London
Research
spatial acoustics and immersive audio, statistical signal processing, spatial hearing and hearing aids technologies, acoustical virtual and augmented reality, speech and audio processing
Interests
My current research focuses on using deep learning to capture head-related transfer functions and, more generally, spatial acoustics and immersive audio. Other research interests include speaker diarization and statistical signal processing for audio applications. More information about my current research can be found at www.aidanhogg.uk.Publications
Publications of specific relevance to the Centre for Multimodal AI
2024
Hu X, Picinali L, Li J and Hogg A (2024). HRTF spatial upsampling in the spherical harmonics domain employing a generative adversarial network. International Conference on Digital Audio Effects (DAFx) 3 Sep 2024 - 7 Sep 2024.
03-09-2024
03-09-2024
Hogg AOT, Jenkins M, Liu H, Squires I, Cooper SJ and Picinali L (2024). HRTF Upsampling With a Generative Adversarial Network Using a Gnomonic Equiangular Projection. IEEE/ACM Transactions on Audio Speech and Language Processing, Institute of Electrical and Electronics Engineers (IEEE) vol. 32, 2085-2099.
11-03-2024
11-03-2024
Hogg A, Liu H, Jenkins M and Picinali L (2024). Exploring the impact of transfer learning on GAN-based HRTF upsampling. Proceedings of the 10th Convention of the European Acoustics Association Forum Acusticum 2023.
17-01-2024
17-01-2024
2023
Engel I, Daugintis R, Vicente T, Hogg AOT, Pauwels J, Tournier AJ and Picinali L (2023). The SONICOM HRTF Dataset. Journal of the Audio Engineering Society, Audio Engineering Society vol. 71 (5), 241-253.
09-05-2023
09-05-2023
2022
McKnight SW, Hogg AOT, Neo VW and Naylor PA (2022). Studying Human-Based Speaker Diarization and Comparing to State-of-the-Art Systems. 2022 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC).
10-11-2022
10-11-2022
Neo VW, Weiss S, McKnight SW, Hogg AOT and Naylor PA (2022). Polynomial Eigenvalue Decomposition-Based Target Speaker Voice Activity Detection in the Presence of Competing Talkers. 2022 International Workshop on Acoustic Signal Enhancement (IWAENC).
08-01-2022
08-01-2022
2021
McKnight S, Hogg A, Neo V and Naylor P (2021). A Study of Salient Modulation Domain Features for Speaker Identification. 2021 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC).
14-12-2021
14-12-2021
Hogg AOT, Neo VW, Weiss S, Evers C and Naylor PA (2021). A Polynomial Eigenvalue Decomposition Music Approach for Broadband Sound Source Localization. 2021 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA).
20-10-2021
20-10-2021
Hogg AOT, Evers C and Naylor PA (2021). Multichannel Overlapping Speaker Segmentation Using Multiple Hypothesis Tracking Of Acoustic And Spatial Features. ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
11-06-2021
11-06-2021
Hogg AOT, Evers C, Moore AH and Naylor PA (2021). Overlapping Speaker Segmentation Using Multiple Hypothesis Tracking of Fundamental Frequency. IEEE/ACM Transactions on Audio Speech and Language Processing, Institute of Electrical and Electronics Engineers (IEEE) vol. 29, 1479-1490.
18-03-2021
18-03-2021
McKnight SW, Hogg AOT and Naylor PA (2021). Analysis of Phonetic Dependence of Segmentation Errors in Speaker Diarization. 2020 28th European Signal Processing Conference (EUSIPCO).
21-01-2021
21-01-2021
2019
Hogg AOT, Evers C and Naylor PA (2019). Multiple Hypothesis Tracking for Overlapping Speaker Segmentation. 2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA).
23-10-2019
23-10-2019
Hogg AOT, Evers C and Naylor PA (2019). Speaker Change Detection Using Fundamental Frequency with Application to Multi-talker Segmentation. ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
17-05-2019
17-05-2019
Sharma D, Hogg AOT, Wang Y, Nour-Eldin A and Naylor PA (2019). Non-Intrusive POLQA Estimation of Speech Quality using Recurrent Neural Networks. 2019 27th European Signal Processing Conference (EUSIPCO).
06-01-2019
06-01-2019
Grants
Grants of specific relevance to the Centre for Multimodal AI
Online Speech Enhancement in Scenarios with Low Direct-to-Reverberant-Ratio
Emmanouil Benetos and Aidan Hogg
£65,621 L-ACOUSTICS UK LIMITED (01-09-2024 - 28-02-2025)
Emmanouil Benetos and Aidan Hogg
£65,621 L-ACOUSTICS UK LIMITED (01-09-2024 - 28-02-2025)