Dr Arkaitz Zubiaga

Arkaitz Zubiaga

Senior Lecturer
Director of Graduate Studies (Research)

School of Electronic Engineering and Computer Science
Queen Mary University of London
ORCID Google Scholar LinkedIn X

Research

computational social science, natural language processing, social data science, social media mining

Interests

My research lies in the intersection between Computational Social Science and Natural Language Processing (NLP). I'm broadly interested in furthering NLP methods for mining social media and online data, as well as in furthering our understanding of human behaviour through the study of social media. My recent research has a particular focus on tackling problematic issues on the Web and social media that can have a damaging effect on individuals or society at large, such as hate speech, misinformation, inequality, biases and other forms of online harm.

Publications

solid heart iconPublications of specific relevance to the Centre for Multimodal AI

2024

bullet iconHalitaj A and Zubiaga A (2024). Providing Citations to Support Fact-Checking: Contextualizing Detection of Sentences Needing Citation on Small Wikipedias. Natural Language Processing Journal, Elsevier vol. 8, 100093-100093.  
01-09-2024
bullet iconSánchez-Corcuera R, Zubiaga A and Almeida A (2024). Early Detection and Prevention of Malicious User Behavior on Twitter Using Deep Learning Techniques. IEEE Transactions on Computational Social Systems, Institute of Electrical and Electronics Engineers (IEEE) vol. 11 (5), 6649-6661.  
12-07-2024
bullet iconZeng X, La Barbera D, Roitero K, Zubiaga A and Mizzaro S (2024). Combining Large Language Models and Crowdsourcing for Hybrid Human-AI Misinformation Detection. Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval
10-07-2024
bullet iconPanchendrarajan R and Zubiaga A (2024). Claim detection for automated fact-checking: A survey on monolingual, multilingual and cross-lingual research. Natural Language Processing Journal, Elsevier vol. 7 
01-06-2024
bullet iconZsisku E, Zubiaga A and Dubossarsky H (2024). Hate Speech Detection and Reclaimed Language: Mitigating False Positives and Compounded Discrimination. ACM Web Science Conference
21-05-2024
bullet iconPanchendrarajan R and Zubiaga A (2024). Synergizing machine learning & symbolic methods: A survey on hybrid approaches to natural language processing. Expert Systems with Applications, Elsevier vol. 251, 124097-124097.  
27-04-2024
bullet iconGkoumas D, Wang B, Tsakalidis A, Wolters M, Purver M, Zubiaga A and Liakata M (2024). A longitudinal multi-modal dataset for dementia monitoring and diagnosis. Language Resources and Evaluation, Springer Nature, 1-20.  
30-03-2024
bullet iconZubiaga A and Rosso P (2024). Special issue on analysis and mining of social media data. PeerJ Computer Science, PeerJ vol. 10 
29-02-2024
bullet iconZubiaga A (2024). Natural language processing in the era of large language models. Frontiers in Artificial Intelligence, Frontiers vol. 6 
12-01-2024
bullet iconLi Y, Panchendrarajan R and Zubiaga A (2024). FactFinders at CheckThat! 2024: Refining Check-worthy Statement Detection with LLMs through Data Pruning. 
01-01-2024
bullet iconAlkhalifa R, Borkakoty H, Deveaud R, El-Ebshihy A, Espinosa-Anke L, Fink T, Galuščáková P, Gonzalez-Saez G, Goeuriot L, Iommi D, Liakata M, Madabushi HT, Medina-Alias P, Mulhem P, Piroi F, Popel M and Zubiaga A (2024). Extended overview of the CLEF 2024 LongEval Lab on Longitudinal Evaluation of Model Performance. 
01-01-2024
bullet iconAlkhalifa R, Borkakoty H, Deveaud R, El-Ebshihy A, Espinosa-Anke L, Fink T, Galuščáková P, Gonzalez-Saez G, Goeuriot L, Iommi D, Liakata M, Madabushi HT, Medina-Alias P, Mulhem P, Piroi F, Popel M and Zubiaga A (2024). Overview of the CLEF 2024 LongEval Lab on Longitudinal Evaluation of Model Performance. 
01-01-2024
bullet iconAlkhalifa R, Borkakoty H, Deveaud R, El-Ebshihy A, Espinosa-Anke L, Fink T, Gonzalez-Saez G, Galuščáková P, Goeuriot L, Iommi D, Liakata M, Madabushi HT, Medina-Alias P, Mulhem P, Piroi F, Popel M, Servan C and Zubiaga A (2024). LongEval: Longitudinal Evaluation of Model Performance at CLEF 2024. Advances in Information Retrieval  60-66.  
01-01-2024
bullet iconZeng X and Zubiaga A (2024). MAPLE: Micro Analysis of Pairwise Language Evolution for Few-Shot Claim Verification. 
01-01-2024

2023

bullet iconKochkina E, Hossain T, Logan RL, Arana-Catania M, Procter R, Zubiaga A, Singh S, He Y and Liakata M (2023). Evaluating the generalisability of neural rumour verification models. Information Processing and Management, Elsevier vol. 60 (1) 
26-10-2023
bullet iconAlharthi R, Shekhar R and Zubiaga A (2023). Target-Oriented Investigation of Online Abusive Attacks: A Dataset and Analysis. IEEE Access, Institute of Electrical and Electronics Engineers 
23-06-2023
bullet iconYi P and Zubiaga A (2023). Session-based cyberbullying detection in social media: A survey. Online Social Media and Networks, Elsevier vol. 36 
17-06-2023
bullet iconYin W, Agarwal V, Jiang A, Zubiaga A and Sastry N (2023). AnnoBERT: Effectively Representing Multiple Annotators’ Label Choices to Improve Hate Speech Detection. International AAAI Conference on Web and Social Media
02-06-2023
bullet iconJiang A and Zubiaga A (2023). SexWEs: Domain-Aware Word Embeddings via Cross-Lingual Semantic Specialisation for Chinese Sexism Detection in Social Media. Seventeenth International AAAI Conference on Web and Social Media
02-06-2023
bullet iconAbumansour AS and Zubiaga A (2023). Check-worthy claim detection across topics for automated fact-checking. PeerJ Computer Science, PeerJ vol. 9 
15-05-2023
bullet iconYi P and Zubiaga A (2023). Learning like human annotators: Cyberbullying detection in lengthy social media sessions. ACM Web Conference 2023
30-04-2023
bullet iconKhiabani PJ and Zubiaga A (2023). Few-Shot Learning for Cross-Target Stance Detection by Aggregating Multimodal Embeddings. IEEE Transactions on Computational Social Systems, Institute of Electrical and Electronics Engineers 
10-04-2023
bullet iconGodoy D, Tommasel A and Zubiaga A (2023). Special issue on intelligent systems for tackling online harms. Personal and Ubiquitous Computing vol. 27 (1), 1-3.  
17-03-2023
bullet iconAlkhalifa R, Bilal I, Borkakoty H, Camacho-Collados J, Deveaud R, El-Ebshihy A, Espinosa-Anke L, Gonzalez-Saez G, Galuščáková P, Goeuriot L, Kochkina E, Liakata M, Loureiro D, Tayyar Madabushi H, Mulhem P, Piroi F, Popel M, Servan C and Zubiaga A (2023). LongEval: Longitudinal Evaluation of Model Performance at CLEF 2023. Advances in Information Retrieval  499-505.  
01-01-2023
bullet iconZhao R, Arana-Catania M, Zhu L, Kochkina E, Gui L, Zubiaga A, Procter R, Liakata M and He Y (2023). PANACEA: An Automated Misinformation Detection System on COVID-19. The 17th Conference of the European Chapter of the Association for Computational
01-01-2023
bullet iconZeng X and Zubiaga A (2023). Active PETs: Active Data Annotation Prioritisation for Few-Shot Claim Verification with Pattern Exploiting Training. Conference of the European Chapter of the Association for Computational Linguistics, Findings of EACL 2023
01-01-2023
bullet iconAlkhalifa R, Bilal I, Borkakoty H, Camacho-Collados J, Deveaud R, El-Ebshihy A, Espinosa-Anke L, Gonzalez-Saez G, Galuščáková P, Goeuriot L, Kochkina E, Liakata M, Loureiro D, Mulhem P, Piroi F, Popel M, Servan C, Tayyar Madabushi H and Zubiaga A (2023). Overview of the CLEF-2023 LongEval Lab on Longitudinal Evaluation of Model Performance. Experimental IR Meets Multilinguality, Multimodality, and Interaction  440-458.  
01-01-2023
bullet iconAlkhalifa R, Bilal I, Borkakoty H, Camacho-Collados J, Deveaud R, El-Ebshihy A, Espinosa-Anke L, Gonzalez-Saez G, Galuščáková P, Goeuriot L, Kochkina E, Liakata M, Loureiro D, Mulhem P, Piroi F, Popel M, Servan C, Madabushi HT and Zubiaga A (2023). Extended Overview of the CLEF-2023 LongEval Lab on Longitudinal Evaluation of Model Performance. 
01-01-2023

2022

bullet iconGuo X, Ma J and Zubiaga A (2022). Cluster-based deep ensemble learning for emotion classification in Internet memes. Journal of Information Science, SAGE Publications 
27-12-2022
bullet iconAlkhalifa R, Kochkina E and Zubiaga A (2022). Building for tomorrow: Assessing the temporal persistence of text classifiers. Information Processing and Management, Elsevier vol. 60 (2) 
05-12-2022
bullet iconZeng X and Zubiaga A (2022). Aggregating pairwise semantic differences for few-shot claim verification. PeerJ Computer Science, PeerJ vol. 8 
25-10-2022
bullet iconCorcuera RS, Zubiaga A and Almeida A (2022). Achieving Participatory Smart Cities by Making Social Networks Safer. 2022 7th International Conference on Smart and Sustainable Technologies (SpliTech)
08-07-2022
bullet iconZia HB, Castro I, Zubiaga A and Tyson G (2022). Improving Zero-Shot Cross-Lingual Hate Speech Detection with Pseudo-Label Fine-Tuning of Transformer Language Models. 
31-05-2022
bullet iconYi P and Zubiaga A (2022). Cyberbullying Detection across Social Media Platforms via Platform-Aware Adversarial Encoding. Sixteenth International AAAI Conference on Web and Social Media
31-05-2022
bullet iconYin W and Zubiaga A (2022). Hidden behind the obvious: Misleading keywords and implicitly abusive language on social media. Online Social Media and Networks, Elsevier vol. 30 
23-05-2022
bullet iconAlkhalifa R and Zubiaga A (2022). Capturing stance dynamics in social media: open challenges and research directions. International Journal of Digital Humanities, Springer Nature vol. 3 (1-3), 115-135.  
08-03-2022
bullet iconZubiaga A, Vidgen B, Fernandez M and Sastry N (2022). Editorial for Special Issue on Detecting, Understanding and Countering Online Harms. Online Social Networks and Media vol. 27 
01-01-2022
bullet iconArana-Catania M, Kochkina E, Zubiaga A, Liakata M, Procter R and He Y (2022). Natural Language Inference with Self-Attention for Veracity Assessment of Pandemic Claims. Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
01-01-2022
bullet iconZhai W, Feng M, Zubiaga A and Liu B (2022). HIT&QMUL at SemEval-2022 Task 9: Label-Enclosed Generative Question Answering (LEG-QA). Proceedings of the 16th International Workshop on Semantic Evaluation (SemEval-2022)
01-01-2022
bullet iconPisarevskaya D and Zubiaga A (2022). Team dina at SemEval-2022 Task 8: Pre-trained Language Models as Baselines for Semantic Similarity. Proceedings of the 16th International Workshop on Semantic Evaluation (SemEval-2022)
01-01-2022
bullet iconGuo X, Ma J and Zubiaga A (2022). NUAA-QMUL-AIIT at Memotion 3: Multi-modal Fusion with Squeeze-and-Excitation for Internet Meme Emotion Analysis. 
01-01-2022

2021

bullet iconJiang A, Yang X, Liu Y and Zubiaga A (2021). SWSR: A Chinese dataset and lexicon for online sexism detection. Online Social Networks and Media vol. 27 
14-12-2021
bullet iconAlkhalifa R, Kochkina E and Zubiaga A (2021). Opinions are Made to be Changed. Proceedings of the 2021 Workshop on Open Challenges in Online Social Networks
19-10-2021
bullet iconAshraf N, Zubiaga A and Gelbukh A (2021). Abusive language detection in youtube comments leveraging replies as conversational context. PeerJ Computer Science, PeerJ vol. 7 
08-10-2021
bullet iconTommasel A, Godoy D and Zubiaga A (2021). Second Workshop on Online Misinformation- And Harm-Aware Recommender Systems: Preface. Second Workshop on Online Misinformation and Harm-Aware Recommender Systems
02-10-2021
bullet iconZeng X, Abumansour AS and Zubiaga A (2021). Automated fact-checking: A survey. Language and Linguistics Compass vol. 15 (10) 
01-10-2021
bullet iconAmjad M, Ashraf N, Zhila A, Sidorov G, Zubiaga A and Gelbukh A (2021). Threatening Language Detection and Target Identification in Urdu Tweets. IEEE Access vol. 9, 128302-128313.  
16-09-2021
bullet iconTommasel A, Godoy D and Zubiaga A (2021). OHARS: Second Workshop on Online Misinformation- and Harm-Aware Recommender Systems. Fifteenth ACM Conference on Recommender Systems
13-09-2021
bullet iconYi P and Zubiaga A (2021). Weakly Supervised Cross-platform Teenager Detection with Adversarial BERT. 32nd ACM Conference on Hypertext and Social Media
30-08-2021
bullet iconJiang A and Zubiaga A (2021). Cross-lingual Capsule Network for Hate Speech Detection in Social Media. 32nd ACM Conference on Hypertext and Social Media
30-08-2021
bullet iconSanchez-Corcuera R, Zubiaga A and Almeida A (2021). Analysing the Existence of Organisation Specific Languages on Twitter. IEEE Access vol. 9, 1-1.  
15-08-2021
bullet iconAbumansour AS and Zubiaga A (2021). QMUL-SDS at CheckThat! 2021: Enriching pre-trained language models for the estimation of check-worthiness of Arabic tweets. CLEF
02-08-2021
bullet iconJiang A and Zubiaga A (2021). Qmul-sds at exist: Leveraging pre-trained semantics and lexical features for multilingual sexism detection in social networks. IberLEF
02-08-2021
bullet iconProcter R, Arana-Catania M, van Lier F-A, Tkachenko N, He Y, Zubiaga A and Liakata M (2021). Citizen Participation and Machine Learning for a Better Democracy. Digital Government Research and Practice vol. 2 (3), 1-22.  
01-07-2021
bullet iconYin W and Zubiaga A (2021). Towards generalisable hate speech detection: a review on obstacles and solutions. PeerJ Computer Science, PeerJ vol. 7 
17-06-2021
bullet iconFröhling L and Zubiaga A (2021). Feature-based detection of automated language models: tackling GPT-2, GPT-3 and Grover. PeerJ Computer Science, PeerJ vol. 7 
06-04-2021
bullet iconKonstantinovskiy L, Price O, Babakar M and Zubiaga A (2021). Toward Automated Factchecking. Digital Threats Research and Practice vol. 2 (2), 1-16.  
01-04-2021
bullet iconCortiz D and Zubiaga A (2021). Ethical and technical challenges of AI in tackling hate speech. The International Review of Information Ethics vol. 29 
30-03-2021
bullet iconVashistha N and Zubiaga A (2021). Online multilingual hate speech detection: Experimenting with hindi and english social media. Information (Switzerland) vol. 12 (1), 1-16.  
01-01-2021
bullet iconZeng X and Zubiaga A (2021). QMUL-SDS at SCIVER: Step-by-Step Binary Classification for Scientific Claim Verification. Proceedings of the Second Workshop on Scholarly Document Processing
01-01-2021
bullet iconvan der Goot R, Ramponi A, Zubiaga A, Plank B, Muller B, Roncal ISV, Ljubešić N, Çetinoğlu Ö, Mahendra R, Çolakoğlu T, Baldwin T, Caselli T and Sidorenko W (2021). MultiLexNorm: A Shared Task on Multilingual Lexical Normalization. Workshop on Noisy User-Generated Text
01-01-2021

2020

bullet iconAlkhalifa R and Zubiaga A (2020). QMUL-SDS @ SardiStance2020: Leveraging network interactions to boost performance on stance detection using knowledge graphs. SardiStance 2020
01-12-2020
bullet iconZubiaga A (2020). Exploiting Class Labels to Boost Performance on Embedding-based Text Classification. 29th ACM International Conference on Information & Knowledge Management
19-10-2020
bullet iconTommasel A, Godoy D and Zubiaga A (2020). Workshop on Online Misinformation- and Harm-Aware Recommender Systems. Fourteenth ACM Conference on Recommender Systems
22-09-2020
bullet iconLathiya S, Dhobi JS, Zubiaga A, Liakata M and Procter R (2020). Birds of a feather check together: Leveraging homophily for sequential rumour detection. Online Social Media and Networks, Elsevier vol. 19 
08-09-2020
bullet iconZubiaga A and Jiang A (2020). Early Detection of Social Media Hoaxes at Scale. ACM Transactions on the Web, Association for Computing Machinery (ACM) vol. 14 (4), 1-23.  
18-08-2020
bullet iconAlkhalifa R, Yoong T, Kochkina E, Zubiaga A and Liakata M (2020). QMUL-SDS at CheckThat! 2020: Determining COVID-19 Tweet Check-Worthiness Using an Enhanced CT-BERT with Numeric Expressions. 
01-01-2020
bullet iconGamallo P, Garcia M, Martín-Rodilla P, Pereira-Fariña M, Real L, Tonelli S, Quaresma P, Vieira R, Dias G, Oostdijk N, Villavicencio A, Vilares J, Ramisch C, Coheur L, Pardo T, Zubiaga A, Alonso MA, Claro D, Ferro MV and Gonzalez-Perez C (2020). Preface. 
01-01-2020
bullet iconAlkhalifa R, Tsakalidis A, Zubiaga A and Liakata M (2020). QMUL-SDS @ DIACR-Ita: Evaluating Unsupervised Diachronic Lexical Semantics Classification in Italian. 
01-01-2020
bullet iconTommasel A, Godoy D and Zubiaga A (2020). Workshop on online misinformation- And harm-aware recommender systems: Preface. 
01-01-2020
bullet iconDerczynski L and Zubiaga A (2020). Detection and Resolution of Rumors and Misinformation with NLP. Proceedings of the 28th International Conference on Computational Linguistics: Tutorial Abstracts
01-01-2020
bullet iconGuo X, Ma J and Zubiaga A (2020). NUAA-QMUL at SemEval-2020 Task 8: Utilizing BERT and DenseNet for Internet Meme Emotion Analysis. Proceedings of the Fourteenth Workshop on Semantic Evaluation
01-01-2020

2019

bullet iconZubiaga A, Wang B, Liakata M and Procter R (2019). Political Homophily in Independence Movements: Analyzing and Classifying Social Media Users by National Identity. IEEE Intelligent Systems, Institute of Electrical and Electronics Engineers vol. 34 (6), 34-42.  
01-11-2019
bullet iconZubiaga A (2019). Mining social media for newsgathering: A review. Online Social Media and Networks, Elsevier vol. 13 
13-09-2019
bullet iconZubiaga A, Heravi B, An J and Kwak H (2019). Social media mining for journalism. Online Information Review vol. 43 (1), 2-6.  
11-02-2019
bullet iconLukasik M, Bontcheva K, Cohn T, Zubiaga A, Liakata M and Procter R (2019). Gaussian processes for rumour stance classification in social media. ACM Transactions on Information Systems vol. 37 (2) 
01-02-2019
bullet iconJiang A and Zubiaga A (2019). Leveraging aspect phrase embeddings for cross-domain review rating prediction. PeerJ Computer Science, PeerJ vol. 2019 (10) 
01-01-2019
bullet iconGorrell G, Kochkina E, Liakata M, Aker A, Zubiaga A, Bontcheva K and Derczynski L (2019). SemEval-2019 Task 7: RumourEval, Determining Rumour Veracity and Support for Rumours. Proceedings of the 13th International Workshop on Semantic Evaluation
01-01-2019
bullet iconGorrell G, Kochkina E, Liakata M, Aker A, Zubiaga A, Bontcheva K and Derczynski L (2019). RumourEval 2019: Determining rumour veracity and support for rumours. 
01-01-2019

2018

bullet iconZubiaga A, Procter R and Maple C (2018). A longitudinal analysis of the public perception of the opportunities and challenges of the Internet of Things. PLoS One vol. 13 (12), e0209472-e0209472.  
20-12-2018
bullet iconSpina D, Zubiaga A, Sheth A and Strohmaier M (2018). Processing social media in real-time. Information Processing and Management vol. 56 (3), 1081-1083.  
07-07-2018
bullet iconZubiaga A (2018). A longitudinal assessment of the persistence of twitter datasets. Journal of the Association for Information Science and Technology vol. 69 (8), 974-984.  
14-05-2018
bullet iconZubiaga A, Kochkina E, Liakata M, Procter R, Lukasik M, Bontcheva K, Cohn T and Augenstein I (2018). Discourse-aware rumour stance classification in social media using sequential classifiers. Information Processing and Management vol. 54 (2), 273-290.  
01-03-2018
bullet iconZubiaga A, Aker A, Bontcheva K, Liakata M and Procter R (2018). Detection and resolution of rumours in social media: A survey. ACM Computing Surveys vol. 51 (2) 
01-02-2018
bullet iconTolmie P, Procter R, Rouncefield M, Liakata M and Zubiaga A (2018). Microblog Analysis as a Program of Work. ACM Transactions on Social Computing, Association for Computing Machinery (ACM) vol. 1 (1), 1-40.  
18-01-2018
bullet iconKochkina E, Liakata M and Zubiaga A (2018). All-in-one: Multi-task learning for rumour verification. 
01-01-2018
bullet iconAnke LE, Declerck T, Gromann D, Schockaert S, Christodoulopoulos C, Adrian K, Tuan LA, Ballesteros M, Camacho-Collados J, Cámara EM, Casamayor G, Grachten M, Garcia-Casulla D, Del Río JG, Helcl J, Osenova P, Riedl M, Roller S, Ronzano F, Santus E, et al. (2018). Preface. 
01-01-2018

2017

bullet iconTkachenko N, Zubiaga A and Procter R (2017). WISC at mediaeval 2017: Multimedia satellite task. MediaEval
13-09-2017
bullet iconZubiaga A, Voss A, Procter R, Liakata M, Wang B and Tsakalidis A (2017). Towards Real-Time, Country-Level Location Classification of Worldwide Tweets. IEEE Transactions on Knowledge and Data Engineering vol. 29 (9), 2053-2066.  
01-09-2017
bullet iconZubiaga A, Liakata M and Procter R (2017). Exploiting context for rumour detection in social media. International Conference on Social Informatics
01-09-2017
bullet iconMontalvo S, Martínez R, Fresno V, Agustín D, Zubiaga A and Berendsen R (2017). Overview of the M-WePNaD Task: Multilingual web person name disambiguation at IberEval 2017. IberEval
01-09-2017
bullet iconWang B, Liakata M, Zubiaga A and Procter R (2017). A hierarchical topic modelling approach for tweet clustering. International Conference on Social Informatics
01-09-2017
bullet iconAker A, Zubiaga A, Bontcheva K, Kolliakou A, Procter R and Liakata M (2017). Stance classification in out-of-domain rumours: A case study around mental health disorders. International Conference on Social Informatics
01-09-2017
bullet iconGarcía-Plaza AP, Fresno V, Unanue RM and Zubiaga A (2017). Using Fuzzy Logic to Leverage HTML Markup for Web Page Representation. IEEE Transactions on Fuzzy Systems vol. 25 (4), 919-933.  
01-08-2017
bullet iconTolmie P, Procter R, Randall DW, Rouncefield M, Burger C, Hoi GWS, Zubiaga A and Liakata M (2017). Supporting the Use of User Generated Content in Journalistic Practice. Proceedings of the 2017 CHI Conference on Human Factors in Computing Systems
02-05-2017
bullet iconWang B, Liakata M, Zubiaga A and Procter R (2017). TDParse: Multi-target-specific sentiment recognition on Twitter. Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics: Volume 1, Long Papers
01-01-2017
bullet iconDerczynski L, Bontcheva K, Liakata M, Procter R, Wong Sak Hoi G and Zubiaga A (2017). SemEval-2017 Task 8: RumourEval: Determining rumour veracity and support for rumours. Proceedings of the 11th International Workshop on Semantic Evaluation (SemEval-2017)
01-01-2017
bullet iconWang B, Liakata M, Tsakalidis A, Kolaitis SG, Papadopoulos S, Apostolidis L, Zubiaga A, Procter R and Kompatsiaris Y (2017). TOTEMSS: Topic-based, Temporal Sentiment Summarisation for Twitter. 
01-01-2017

2016

bullet iconAn J, Crandall DJ, Fedorov R, Fiesler C, Giglietto F, Heravi B, Pater J, Pelechrinis K, Quercia D, Weller K and Zubiaga A (2016). Reports of the workshops held at the 2016 International AAAI Conference on Web and Social Media. AI Magazine vol. 37 (4), 89-93.  
01-12-2016
bullet iconZubiaga A and Mac Namee B (2016). Graphical Perception of Value Distributions: An Evaluation of Non-Expert Viewers' Data Literacy. The Journal of Community Informatics, University of Waterloo vol. 12 (3) 
09-08-2016
bullet iconZubiaga A (2016). Euskahaldun: euskararen aldeko martxa baten sare sozialetako islaren bilketa eta analisia. EKAIA Euskal Herriko Unibertsitateko Zientzi eta Teknologi Aldizkaria, UPV/EHU Press (29), 139-154.  
18-03-2016
bullet iconZubiaga A, Liakata M, Procter R, Hoi GWS and Tolmie P (2016). Analysing How People Orient to and Spread Rumours in Social Media by Looking at Conversational Threads. PLOS ONE, Public Library of Science (PLoS) vol. 11 (3) 
04-03-2016
bullet iconVicente IS, Alegría I, Aranberri N, España-Bonet C, Gamallo P, Oliveira HG, Martínez E, Toral A and Zubiaga A (2016). TweeTMT: A parallel microblog corpus. 
01-01-2016
bullet iconZubiaga A, Kochkina E, Liakata M, Procter R and Lukasik M (2016). Stance classification in rumours as a sequential task exploiting the tree structure of social media conversations. 
01-01-2016
bullet iconLukasik M, Srijith PK, Vu D, Bontcheva K, Zubiaga A and Cohn T (2016). Hawkes Processes for Continuous Time Sequence Classification: an Application to Rumour Stance Classification in Twitter. Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers)
01-01-2016
bullet iconWang B, Liakata M, Zubiaga A, Procter R and Jensen E (2016). SMILE: Twitter emotion classification using domain adaptation. 
01-01-2016

2015

bullet iconZubiaga A, Vicente IS, Gamallo P, Pichel JR, Alegria I, Aranberri N, Ezeiza A and Fresno V (2015). TweetLID: a benchmark for tweet language identification. Language Resources and Evaluation, Springer vol. 50 (4), 729-766.  
26-09-2015
bullet iconAlegria I, Aranberri N, Comas PR, Fresno V, Gamallo P, Padró L, San Vicente I, Turmo J and Zubiaga A (2015). TweetNorm: a benchmark for lexical normalization of Spanish tweets. Language Resources and Evaluation, Springer Nature vol. 49 (4), 883-905.  
15-08-2015
bullet iconZubiaga A, Liakata M, Procter R, Bontcheva K and Tolmie P (2015). Crowdsourcing the Annotation of Rumourous Conversations in Social Media. Proceedings of the 24th International Conference on World Wide Web
18-05-2015
bullet iconFresno V, Zubiaga A, Ji H and Martínez R (2015). Exploiting geolocation, user and temporal information for monitoring natural hazards on twitter. Procesamiento del Lenguaje Natural vol. 54, 85-92.  
01-03-2015
bullet iconZubiaga A, Liakata M, Procter R, Bontcheva K and Tolmie P (2015). Towards detecting rumours in social media. 
01-01-2015
bullet iconAlegria I, Aranberri N, España-Bonet C, Gamallo P, Oliveira HG, Martnez E, San Vicente I, Toral A and Zubiaga A (2015). Overview of TweetMT: A shared task on machine translation of tweets at SEPLN 2015. 
01-01-2015
bullet iconWang B, Zubiaga A, Liakata M and Procter R (2015). Making the most of tweet-inherent features for social spam detection on twitter. 
01-01-2015
bullet iconTownsend R, Tsakalidis A, Zhou Y, Wang B, Liakata M, Zubiaga A, Cristea A and Procter R (2015). WarwickDCS: From Phrase-Based to Target-Specific Sentiment Recognition. Proceedings of the 9th International Workshop on Semantic Evaluation (SemEval 2015)
01-01-2015

2014

bullet iconDiakopoulos N and Zubiaga A (2014). Newsworthiness and Network Gatekeeping on Twitter: The Role of Social Deviance. 
16-05-2014
bullet iconZubiaga A, Spina D, Martínez R and Fresno V (2014). Real-time classification of Twitter trends. Journal of the Association for Information Science and Technology, Wiley vol. 66 (3), 462-473.  
09-05-2014
bullet iconZubiaga A and Ji H (2014). Tweet, but verify: epistemic study of information verification on Twitter. Social Network Analysis and Mining, Springer Nature vol. 4 (1) 
25-03-2014
bullet iconAlegria I, Aranberri N, Comas PR, Fresno V, Gamallo P, Padró L, Vicente IS, Turmo J and Zubiaga A (2014). TweetNorm es corpus: An annotated corpus for Spanish microtext normalization. 
01-01-2014
bullet iconAlegria I, Cabezon U, Fernandez de Betono U, Labaka G, Mayor A, Sarasola K and Zubiaga A (2014). Wikipedia and Machine Translation: killing two birds with one stone. 
01-01-2014
bullet iconZubiaga A, Vicente IS, Gamallo P, Pichel JR, Alegria I, Aranberri N, Ezeiza A and Fresno V (2014). Overview of TweetLID: Tweet language identification at SEPLN 2014. 
01-01-2014

2013

bullet iconGarcía-Plaza AP, Zubiaga A, Fresno V and Martínez R (2013). Tag cloud reorganization: Finding groups of related tags on delicious. Social Media Mining and Social Network Analysis: Emerging Research  140-155.  
01-12-2013
bullet iconZubiaga A, Fresno V, Martinez R and Garcia-Plaza AP (2013). Harnessing Folksonomies to Produce a Social Classification of Resources. IEEE Transactions on Knowledge and Data Engineering, Institute of Electrical and Electronics Engineers (IEEE) vol. 25 (8), 1801-1813.  
27-06-2013
bullet iconZubiaga A and Ji H (2013). Harnessing web page directories for large-scale classification of tweets. Proceedings of the 22nd International Conference on World Wide Web
13-05-2013
bullet iconZubiaga A (2013). Newspaper editors vs the crowd. Proceedings of the 22nd International Conference on World Wide Web
13-05-2013
bullet iconZubiaga A, Spina D, de Rijke M and Strohmaier M (2013). Session details: RAMSS'13 workshop. Proceedings of the 22nd International Conference on World Wide Web
13-05-2013
bullet iconZubiaga A, Ji H and Knight K (2013). Curating and contextualizing Twitter stories to assist with social newsgathering. Proceedings of the 2013 international conference on Intelligent user interfaces
19-03-2013
bullet iconArchambault D, Bouwmeester R, Cabulea C, Daly EM, Di Lorenzo G, de Rijke M, Harrigan M, Kandogan E, Muller M, Naaman M, Quercia D, Spina D, Strohmaier M and Zubiaga A (2013). Reports on the Workshops Held at the Sixth International AAAI Conference on Weblogs and Social Media. AI Magazine, Wiley vol. 34 (1), 101-103.  
01-03-2013
bullet iconAlegria I, Aranberri N, Fresno V, Gamallo P, Padró L, Vicente IS, Turmo J and Zubiaga A (2013). Tweet normalization workshop at SEPLN 2013: An overview. 
01-01-2013
bullet iconZubiaga A, Spina D, De Rijke M and Strohmaier M (2013). Ramss workshop chairs' welcome. 
01-01-2013
bullet iconAlegria I, Cabezon U, de Betoño UF, Labaka G, Mayor A, Sarasola K and Zubiaga A (2013). Reciprocal Enrichment Between Basque Wikipedia and Machine Translation. The People’s Web Meets NLP  101-118.  
01-01-2013

2012

bullet iconCassidy T, Ji H, Ratinov L, Zubiaga A and Huang H (2012). Analysis and enhancement of wikification for microblogs with context expansion. 
01-12-2012
bullet iconHuang H, Zubiaga A, Ji H, Deng H, Wang D, Le H, Abdelzaher T, Han J, Leung A, Hancock J and Voss C (2012). Tweet ranking based on heterogeneous networks. 
01-12-2012
bullet iconZubiaga A, Spina D, De Rijke M, Strohmaier M and Naaman M (2012). AAAI Workshop - Technical Report: Preface. 
01-12-2012
bullet iconGarcía-Plaza AP, Zubiaga A, Fresno V and Martínez R (2012). Reorganizing clouds: A study on tag clustering and evaluation. Expert Systems with Applications, Elsevier vol. 39 (10), 9483-9493.  
01-08-2012
bullet iconZubiaga A (2012). Harnessing folksonomies for resource classification by Arkaitz Zubiag with Danielle H. Lee as coordinator. ACM SIGWEB Newsletter, Association for Computing Machinery (ACM) vol. 2012 (Summer), 1-2.  
01-07-2012
bullet iconZubiaga A, Spina D, Amigó E and Gonzalo J (2012). Towards real-time summarization of scheduled events from twitter streams. Proceedings of the 23rd ACM conference on Hypertext and social media
25-06-2012

2011

bullet iconZubiaga A, Spina D, Fresno V and Martínez R (2011). Classifying trending topics. Proceedings of the 20th ACM international conference on Information and knowledge management
24-10-2011
bullet iconZubiaga A, Körner C and Strohmaier M (2011). Tags vs shelves. Proceedings of the 22nd ACM conference on Hypertext and hypermedia
06-06-2011
bullet iconZubiaga A, Martínez R and Fresno V (2011). Analyzing Tag Distributions in Folksonomies for Resource Classification. 
01-01-2011
bullet iconZubiaga A, Martínez R and Fresno V (2011). Augmenting web page classifiers with social annotations. Procesamiento del Lenguaje Natural vol. 47, 189-196.  
01-01-2011

2009

bullet iconZubiaga A, Martínez R and Fresno V (2009). Getting the most out of social annotations for web page classification. Proceedings of the 9th ACM symposium on Document engineering
16-09-2009
bullet iconZubiaga A, García-Plaza AP, Fresno V and Martínez R (2009). Content-based Clustering for Tag Cloud Visualization. 2009 International Conference on Advances in Social Network Analysis and Mining
01-07-2009
bullet iconZubiaga A, Fresno V and Martínez R (2009). Is unlabeled data suitable for multiclass SVM-based web page classification? Proceedings of the NAACL HLT 2009 Workshop on Semi-Supervised Learning for Natural Language Processing - SemiSupLearn '09
01-01-2009

Grants

solid heart iconGrants of specific relevance to the Centre for Multimodal AI
bullet iconCTP: Target/Biomarker selection using systems networks and decision theory (P. C. Siu)
Arkaitz Zubiaga and Emma Tjong
£125,917 BBSRC Biotechnology and Biological Sciences Research Council (01-10-2023 - 30-09-2027)
bullet iconJacky (Pui Chung) Siu CTP Studentship - Exscientia
Arkaitz Zubiaga
£8,600 EXSCIENTIA LIMITED (01-10-2023 - 30-09-2027)
bullet iconHYBRIDS 2 - MSCA DN 2021 - PI: Zubiaga
Arkaitz Zubiaga
£265,251 EPSRC - EU Scheme (01-01-2023 - 31-12-2026)
bullet iconBBSRC AIDD CTP CASE Award: Yuan Liang
Massimo Poesio and Arkaitz Zubiaga
£8,600 EXSCIENTIA LIMITED (01-12-2022 - 30-11-2026)
bullet iconUnderstanding Transitions when Tackling Online Harms
Gareth Tyson, Ignacio De Castro Arribas and Arkaitz Zubiaga
£339,177 EPSRC Engineering and Physical Sciences Research Council (01-04-2022 - 15-09-2025)