Dr. Corin Yeats

Computational Biologist


Works across a wide range of genomics and sequence-based research both theoretical (e.g. protein function evolution) and for applied software development (e.g. antimicrobial resistance prediction). Currently focussed on the application of genomics analysis software in the public health context.

Learn more about my work

After completing a PhD in protein sequence domain analysis as part of the Pfam group at the Sanger Institute (2005), I continued in the structural world in the CATH-Gene3D group at UCL. Here I gained experience in a range of modern computational biology practices: from the large-scale via development of ontologies and web-based tools for linking primary databases through to the detailed analysis of individual proteins.

I then joined David Aanensen at Imperial College as part of the nascent CGPS team - and then onto the BDI at Oxford University - to apply this experience to building genomics analysis tools for public health researchers and decision makers. In particular I feel the rapidly growing volumes of genome-phenotype data for antimicrobial resistance is leading to a transformation in the possibilities for modelling and prediction.


Yeats C, Finn R, Bateman A "The PASTA domain: a beta-lactam-binding domain" (2002) Trends in biochemical sciences 27 (9): 438-440

Bateman A, Lachlan C et al "The Pfam protein families database." (2004) Nucleic acids research 32 (suppl_1):D138-D141

Pain A, Renauld H et al "Genome of the host-cell transforming parasite Theileria annulata compared with T. parva" (2005) Science 309 (5731), 131-133

Bentley S, Maiwald M et al "Sequencing and analysis of the genome of the Whipple's disease bacterium Tropheryma whipplei" (2004) Proceedings of the National Academy of Sciences 101 (39):14240-14245

Hunter S, Apweiler R et al "InterPro: the integrative protein signature database" (2009) Nucleic acids research 37 (suppl_1):D211-D215

Aanensen D, Feil E et al "Whole-genome sequencing for routine pathogen surveillance in public health: a population snapshot of invasive Staphylococcus aureus in Europe" (2016) MBio 7 (3):e00444-16

Gladstone R, Lo S et al "Visualizing variation within Global Pneumococcal Sequence Clusters (GPSCs) and country population snapshots to contextualize pneumococcal isolates" (2020) Microbial Genomics mgen000357

Google Scholar Link: https://scholar.google.com/citations?user=TUMUF9gAAAAJ&hl=en

Our Research Alliances and Funders


Centers for Disease
Control and Prevention


European Centre for Disease Prevention and Control


Food and Agriculture Organisation
of the United Nations


National Institutes
for Health


National Institute for
Health Research


World Health Organisation

Big Data Institute logoOxford University logoWellcome Trust logo
Wellcome Sanger Institute logoWellcome Genome Campus logo

Ⓒ 2019 Centre for Genomic Pathogen Surveillance