Faculty of Science and Engineering - Research

Dr Georgios Tzimiropoulos

Georgios Tzimiropoulos

Senior Lecturer

School of Electronic Engineering and Computer Science
Queen Mary University of London

g.tzimiropoulos@qmul.ac.uk

www.qmul.ac.uk/eecs/people/profiles/tzimiropoulosgeorgios.html

Centre for Multimodal AI

Research
Publications
Grants
Research Group
News

Research

Computer Vision, Deep Learning

Interests

My research interests are mainly in the problems of image & video recognition, detection and tracking, pose estimation, image & video generation, 3D reconstruction and super-resolution, with humans and their actions being the focal point of my research. I have approached these problems mainly using tools from Mathematical Optimization and Machine Learning. My current focus is on Compute & Data Efficient Deep Learning and its application to video recognition.

Publications

2025

Compress & Cache: Vision token compression for efficient generation and retrieval
Bulat A Ouali Y Tzimiropoulos G
NeurIPS 2025. The Thirty-Ninth Annual Conference on Neural Information Processing Systems..

QMRO

29-10-2025

FAM Diffusion: Frequency and Attention Modulation for High-Resolution Image Generation with Stable Diffusion
Yang H Bulat A Hadji I Pham HX Zhu X Tzimiropoulos G Martinez B
2025 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). vol. 00, 2459-2468.

DOI 10.1109/cvpr52734.2025.00235

QMRO

17-06-2025

VladVA: Discriminative Fine-tuning of LVLMs
Ouali Y Bulat A Xenos A Zaganidis A Metaxas IM Martinez B Tzimiropoulos G
2025 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). vol. 00, 4101-4111.

DOI 10.1109/cvpr52734.2025.00388

QMRO

17-06-2025

Edge-SD-SR: Low Latency and Parameter Efficient On-device Super-Resolution with Stable Diffusion via Bidirectional Conditioning
Hadji I Noroozi M Escorcia V Zaganidis A Martinez B Tzimiropoulos G
2025 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). vol. 00, 12789-12798.

DOI 10.1109/cvpr52734.2025.01193

QMRO

17-06-2025

Vision-Free Retrieval: Rethinking Multimodal Search with Textual Scene Descriptions
Ntinou I Xenos A Ouali Y Bulat A Tzimiropoulos G
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing., 14057-14073.

DOI 10.18653/v1/2025.emnlp-main.709

QMRO

01-01-2025

2024

Memsvd: Long-Range Temporal Structure Capturing Using Incremental SVD
Ntinou I Sanchez E
2024 IEEE International Conference on Image Processing (ICIP). vol. 00, 458-464.

DOI 10.1109/icip51287.2024.10647706

30-10-2024

MobileQuant: Mobile-friendly Quantization for On-device Language Models
Tan F Lee R Dudziak Ł Hu SX Bhattacharya S Hospedales T Tzimiropoulos G Martinez B
In Arxiv

DOI 10.48550/arxiv.2408.13933

04-10-2024

Efficient Unsupervised Visual Representation Learning with Explicit Cluster Balancing
Maniadis Metaxas I Tzimiropoulos G Patras I
European Conference on Computer Vision 2024 29 Sep 2024 - 4 Oct 2024.

QMRO

29-09-2024

Efficient Unsupervised Visual Representation Learning with Explicit Cluster Balancing
Metaxas IM Tzimiropoulos G Patras I
In Arxiv

DOI 10.48550/arxiv.2407.11168

15-07-2024

Relevant Publication

QBB: Quantization with Binary Bases for LLMs
Bulat A Ouali Y Tzimiropoulos G
Advances in Neural Information Processing Systems 37., 3209-3228.

DOI 10.52202/079017-0105

QMRO

01-01-2024

Relevant Publication

CemiFace: Center-based Semi-hard Synthetic Face Generation for Face Recognition
Patras I Song S Sun Z Tzimiropoulos G
Advances in Neural Information Processing Systems 37., 35612-35638.

DOI 10.52202/079017-1123

QMRO

01-01-2024

Relevant Publication

Efficient Vision-Language pre-training via domain-specific learning for human activities
Bulat A Ouali Y Guerrero R Martinez B
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing., 7978-8000.

DOI 10.18653/v1/2024.emnlp-main.454

QMRO

01-01-2024

2023

Relevant Publication

Language-Aware Soft Prompting: Text-to-Text Optimization for Few- and Zero-Shot Adaptation of V &L Models
Bulat A Tzimiropoulos G
International Journal of Computer Vision, Springer Nature vol. 132 (4), 1108-1125.

DOI 10.1007/s11263-023-01904-9

QMRO

25-10-2023

Black Box Few-Shot Adaptation for Vision-Language models
Ouali Y Bulat A Matinez B
2023 IEEE/CVF International Conference on Computer Vision (ICCV). vol. 00, 15488-15500.

DOI 10.1109/iccv51070.2023.01424

QMRO

06-10-2023

Relevant Publication

FS-DETR: Few-Shot DEtection TRansformer with prompting and without re-training
Bulat A Guerrero R Tzimiropoulos G
2023 IEEE/CVF International Conference on Computer Vision (ICCV). vol. 00, 11759-11768.

DOI 10.1109/iccv51070.2023.01083

QMRO

06-10-2023

Relevant Publication

ReGen: A good Generative zero-shot video classifier should be Rewarded
Bulat A Martinez B
2023 IEEE/CVF International Conference on Computer Vision (ICCV). vol. 00, 13477-13487.

DOI 10.1109/iccv51070.2023.01244

QMRO

06-10-2023

Relevant Publication

HyperReenact: One-Shot Reenactment via Jointly Learning to Refine and Retarget Faces
Bounareli S Tzelepis C Argyriou V Patras I Tzimiropoulos G
2023 IEEE/CVF International Conference on Computer Vision (ICCV). vol. 00, 7115-7125.

DOI 10.1109/iccv51070.2023.00657

QMRO

06-10-2023

Relevant Publication

Bayesian Prompt Learning for Image-Language Model Generalization
Derakhshani MM Sanchez E Bulat A Da Costa VGT Martinez B
2023 IEEE/CVF International Conference on Computer Vision (ICCV). vol. 00, 15191-15200.

DOI 10.1109/iccv51070.2023.01398

QMRO

06-10-2023

Relevant Publication

LASP: Text-to-Text Optimization for Language-Aware Soft Prompting of Vision & Language Models
Bulat A
2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). vol. 00, 23232-23241.

DOI 10.1109/cvpr52729.2023.02225

QMRO

24-06-2023

Relevant Publication

From Keypoints to Object Landmarks via Self-Training Correspondence: A Novel Approach to Unsupervised Landmark Discovery
Mallis D Sanchez E Bell M
IEEE Transactions on Pattern Analysis and Machine Intelligence, Institute of Electrical and Electronics Engineers (IEEE) vol. 45 (7), 8390-8404.

DOI 10.1109/tpami.2023.3234212

QMRO

05-06-2023

2022

Relevant Publication

Part-based Face Recognition with Vision Transformers
Sun Z
British Machine Vision Conference.

QMRO

21-11-2022

Relevant Publication

Finding Directions in GAN’s Latent Space for Neural Face Reenactment
Bounareli S Tzimiropoulos G
British Machine Vision Conference.

QMRO

21-11-2022

Relevant Publication

EdgeViTs: Competing Light-Weight CNNs on Mobile Devices with Vision Transformers
Pan J Bulat A Tan F Zhu X Li H
Lecture Notes in Computer Science. vol. 13671, 294-311.

DOI 10.1007/978-3-031-20083-0_18

QMRO

01-01-2022

Relevant Publication

Pre-training Strategies and Datasets for Facial Representation Learning
Bulat A Cheng S Yang J Garbett A
Lecture Notes in Computer Science. vol. 13673, 107-125.

DOI 10.1007/978-3-031-19778-9_7

QMRO

01-01-2022

2021

Space-time Mixing Attention for Video Transformer
Bulat A Perez-Rua J-M Tzimiropoulos G
Thirty-fifth Conference on Neural Information Processing Systems.

QMRO

06-12-2021

Relevant Publication

WarpedGANSpace: Finding non-linear RBF paths in GAN latent space
Tzelepis C Tzimiropoulos G Patras I
2021 IEEE/CVF International Conference on Computer Vision (ICCV). vol. 00, 6373-6382.

DOI 10.1109/iccv48922.2021.00633

QMRO

17-10-2021

Bit-Mixer: Mixed-precision networks with runtime bit-width selection
Bulat A
2021 IEEE/CVF International Conference on Computer Vision (ICCV). vol. 00, 5168-5177.

DOI 10.1109/iccv48922.2021.00514

QMRO

17-10-2021

Affective Processes: stochastic modelling of temporal context for emotion and facial expression recognition
Sanchez E Valstar M Tzimiropoulos G
2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). vol. 00, 9070-9080.

DOI 10.1109/cvpr46437.2021.00896

QMRO

25-06-2021

Knowledge distillation via softmax regression representation learning
Yang J Martinez B Bulat A
International Conference on Learning Representations (ICLR).

QMRO

04-05-2021

High-Capacity Expert Binary Networks
Bulat A Tzimiropoulos G
International Conference on Learning Representations (ICLR).

QMRO

04-05-2021

Self-supervised Learning of Person-specific Facial Dynamics for Automatic Personality Recognition
Song S Sanchez E Tzimiropoulos G Shen L Valstar M
IEEE Transactions on Affective Computing, Institute of Electrical and Electronics Engineers

DOI 10.1109/TAFFC.2021.3064601

QMRO

09-03-2021

A Transfer Learning approach to Heatmap Regression for Action Unit intensity estimation
Ntinou IN Sanchez E Bulat A Tzimiropoulos G
IEEE Transactions on Affective Computing

DOI 10.1109/TAFFC.2021.3061605

QMRO

23-02-2021

2020

Unsupervised Learning of Object Landmarks via Self-Training Correspondence
Dimitrios M Enrique S
Advances in Neural Information Processing Systems (NeurIPS) 6 Dec 2020 - 12 Dec 2020.

QMRO

06-12-2020

AnimaWeb: A Large-Scale Hierarchical Dataset of Annotated Animal Faces
Khan MH Khan S Shahabuddin M Khan FS
2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). vol. 00, 6937-6946.

DOI 10.1109/cvpr42600.2020.00697

QMRO

13-06-2020

FAN-Face: a Simple Orthogonal Improvement to Deep Face Recognition
Yang J Bulat A
Proceedings of The Aaai Conference on Artificial Intelligence, Association For The Advancement of Artificial Intelligence (Aaai) vol. 34 (07), 12621-12628.

DOI 10.1609/aaai.v34i07.6953

03-04-2020

BATS: Binary ArchitecTure Search
Bulat A Martinez B Tzimiropoulos G
Lecture Notes in Computer Science. vol. 12368, 309-325.

DOI 10.1007/978-3-030-58592-1_19

QMRO

01-01-2020

2019

T-Net: Parametrizing Fully Convolutional Nets with a Single High-Order Tensor
Kossaifi J Bulat A Tzimiropoulos G Pantic M
2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). vol. 00, 7814-7823.

DOI 10.1109/cvpr.2019.00801

20-06-2019

2018

Hierarchical Binary CNNs for Landmark Localization with Limited Resources
Bulat A Tzimiropoulos G
IEEE Transactions on Pattern Analysis and Machine Intelligence, Institute of Electrical and Electronics Engineers (IEEE) vol. 42 (2), 343-356.

DOI 10.1109/tpami.2018.2866051

QMRO

23-08-2018

2017

Large Pose 3D Face Reconstruction from a Single Image via Direct Volumetric CNN Regression
Jackson AS Bulat A Tzimiropoulos G
2017 IEEE International Conference on Computer Vision (ICCV)., 1031-1039.

DOI 10.1109/iccv.2017.117

QMRO

01-10-2017

How Far are We from Solving the 2D & 3D Face Alignment Problem? (and a Dataset of 230,000 3D Facial Landmarks)
Bulat A Tzimiropoulos G
2017 IEEE International Conference on Computer Vision (ICCV)., 1021-1030.

DOI 10.1109/iccv.2017.116

QMRO

01-10-2017

A Functional Regression Approach to Facial Landmark Tracking
Sanchez-Lozano E Tzimiropoulos G Martinez B De la Torre F
IEEE Transactions on Pattern Analysis and Machine Intelligence, Institute of Electrical and Electronics Engineers (IEEE) vol. 40 (9), 2037-2050.

DOI 10.1109/tpami.2017.2745568

QMRO

29-08-2017

2016

Convolutional aggregation of local evidence for large pose face alignment
Bulat A
Procedings of the British Machine Vision Conference 2016., 86.1-86.12.

DOI 10.5244/c.30.86

01-01-2016

Human Pose Estimation via Convolutional Part Heatmap Regression
Bulat A
Lecture Notes in Computer Science. vol. 9911, 717-732.

DOI 10.1007/978-3-319-46478-7_44

QMRO

01-01-2016

2014

Gauss-Newton Deformable Part Models for Face Alignment in-the-Wild
Tzimiropoulos G Pantic M
2014 IEEE Conference on Computer Vision and Pattern Recognition., 1851-1858.

DOI 10.1109/cvpr.2014.239

QMRO

01-06-2014

Relevant Publication

A Simple Baseline for Knowledge-Based Visual Question Answering
Xenos A Stafylakis T Patras I Tzimiropoulos G
Empirical Methods in Natural Language Processing 6 Dec 2023 - 10 Dec 2023.

QMRO