Nanobodies® galore in Utrecht

At the end of September, I had the opportunity to present at the 4th Single-Domain Antibody (sdAb/VHH) Conference hosted in the city of Utrecht. The sdAb conference is a biennial event, and was held for the first time in Bonn (2019), then in Brussels (2021) and Paris (2023), before coming to the Netherlands this year.

This was the first time I’d attended a VHH-focused conference, and I was taken aback at just how large the community is; the Jaarbeurs ‘Supernova’ event hall was completely sold out, with over 400 researchers in attendance (pictures below courtesy of the organisers). The buzz reflects the ever growing interest in sdAbs as tools to discover new fundamental biology, vectors for diagnosing disease, and as prophylactic or curative therapeutics. Most every disease indication was represented at the conference, from anticancer and antiviral sdAbs to antivenom sdAbs (both for use in lateral flow tests to diagnose the snake that bit you, and as quick ‘epipen’-like therapeutics accessible even in the most remote parts of the world).

Continue reading →

Using Node-RED as a front-end to your software

Node-RED is an, open-source, visual programming tool that lets you wire together hardware (such as sensors), APIs (such as REST/POST) and custom functions. However, its custom functions aren’t simply the JavaScript you write, they can also be containers!

This can provide an intuitive front-end to otherwise difficult software. For example, you’ve written your magnum opus, you’ve even documented it (though no-one will ever read it) and to ensure maximum compatibility for the widest possible audience, you’ve containerised it. But it’s still a command-line driven application. Using node-RED you can make this accessible to an inexperienced audience.

Out of the box, node-RED’s quite pretty, you can string together nodes to perform functions that are useful. In this case, it’s for monitoring a log file, if the log doesn’t grow, something’s gone wrong, so email me to take a look at it.

Continue reading →

3 Reasons Why You Should Play Chess Instead of Doing Your Work

Academia rewards deep thinking, the long grind, and productivity (or the illusion of it). Chess also rewards all three, except you don’t have to write any papers or produce anything of value. Here are three reasons why you should play chess instead of doing your work.

Continue reading →

Fragment-to-Lead Successes in 2023

Back in 2021, I highlighted the annual fragment-to-lead (F2L) success stories from 2019 [Blog post] [Paper]. This is one of my favourite annual publications, and I’m delighted to see that it’s still going strong. In this post, I’ll discuss the 2023 edition that was published in at the start of 2025 [Paper].

Continue reading →

Are you addicted to dopamine?

Ever since the pandemic my attachment to screens and media has slowly crept up on me, and I suspect that’s the case for many of us. It hit me when I started panicked after leaving my flat without headphones, thinking “how could I ever walk around with just my thoughts?” I decided to significantly reduce my technology usage and I keep getting the sense that I’m experiencing some kind of withdrawal from the constant media and dopamine hits, but I was curious just what’s going on, and how bad it is.

What does dopamine actually do and is “dopamine addiction” scientifically accurate?

Continue reading →

Human Learning in the age of Machine Learning

Oxford University has recently announced that its students will receive free access to a professional-level subscription of ChatGPT Education. This decision is more than just a perk, it’s a signal. One of the world’s leading universities is openly acknowledging that generative AI will be central to the academic experience of its students. But what does this mean for learning? For education? For scholarship itself?

To frame this question, it is worth beginning with a macro view: Mary Meeker’s AI Trends Report (2025) argues that AI is accelerating the transformation of knowledge work, pushing tasks once reserved for experts into more automated or semi-automated regimes. In her framing, AI is less a standalone innovation than a “meta-technology” that amplifies other domains.

Continue reading →

Getting In the Flow – How to Flow (Match)

Introduction

In the world of computational structural biology you might have heard of diffusion models as the current big thing in generative modelling. Diffusion models are great because primarily they look cool when you visualise the denoising process to generate a protein structure (checkout RFdiffusion Colab notebook), but also because they are state of the art at diverse and designable protein backbone structure generation.

Originally emerging from computer vision, a lot of work has been built up around their application to macromolecules – especially exciting is their harmonious union with geometric deep learning in the case of SE(3) equivariance (see FrameDiff). I don’t know about you but I get particularly excited about geometric deep learning, mostly because it involves objectively dope words like “manifold” and “Riemannian”, better yet “Riemannian manifolds” – woah! (see Bronstein’s geometric deep learning for more fun vocabulary to add to your vernacular- like “geodesic”, Geometric Deep Learning).

But we’re getting side tracked. Diffusion is a square to rectangle case of score-based generative models with the clause that diffusion refers explicitly to the learning of a time-dependent score function that is typically learned via a denoising process. Checkout Jakub Tomczak’s blog for more on diffusion and score-based generative models. Flow matching, although technically different to score-based generative models, also makes use of transformations to gaussian but is generally faster and not constrained to discrete time steps (or even Gaussian priors). So the big question is, how does one flow match?

Continue reading →

Is attention all you need for protein folding?

Researchers from Apple have released SimpleFold, a protein structure prediction model which uses exclusively standard Transformer layers. The results seem to show that SimpleFold is a little less accurate than methods such as AlphaFold2, but much faster and easier to integrate into standard LLM-like workflows. SimpleFold also shows very good scaling performance, in line with other Transformer models like ESM2. So what is powering this seemingly simple development?

Continue reading →

Extracting 3D Pharmacophore Points with RDKit

Pharmacophores are simplified representations of the key interactions ligands make with proteins, such as hydrogen bonds, charge interactions, and aromatic contacts. Think of them as the essential “bumps and grooves” on a key that allow it to fit its lock (the protein). These maps can be derived from ligands or protein–ligand complexes and are powerful tools for virtual screening and generative models. Here, we’ll see how to extract 3D pharmacophore points from a ligand using RDKit.
(Code adapted from Dr. Ruben Sanchez.)

Why pharmacophore “points”?

RDKit represents each pharmacophore feature (donor, acceptor, aromatic, etc.) as a point in 3D space, located at the feature center. These points capture the essential interaction motifs of a ligand without requiring the full atomic detail.

Continue reading →

Oxford Protein Informatics Group

or "OPIG" to friends

Nanobodies® galore in Utrecht

Using Node-RED as a front-end to your software

3 Reasons Why You Should Play Chess Instead of Doing Your Work

Fragment-to-Lead Successes in 2023

Human Learning in the age of Machine Learning

Getting In the Flow – How to Flow (Match)

Introduction

Is attention all you need for protein folding?

Extracting 3D Pharmacophore Points with RDKit

Why pharmacophore “points”?