Computational Antibody Papers

Filter by tags

All

Filter by published year

All

TitleKey points

2025-02-17
YAbS: The Antibody Society's antibody therapeutics database
- databases
- New (old :) ) therapeutic antibody database, larger than what is available from other sources several times.
- Includes over 2,900 investigational antibody candidates and more than 450 approved or late-stage molecules.
- It tracks molecular format, target antigen, development status, clinical history, and company data, along with antibody isotype, conjugation status, and mechanism of action.
- Analysis highlights a rise in bispecifics, ADCs, and immunoconjugates, with most clinical-stage antibodies targeting cancer and originating from China or the U.S.
- The data are collected from public sources beyond INN lists, including company websites, press releases, clinical trial registries, regulatory agencies, and literature reports.
2025-02-17
IgGM: A Generative Model for Functional Antibody and Nanobody Design
- protein design
- generative methods
- Novel method to design antibodies de novo.
- Architecturally, it is a mix of language models, diffusion and structure prediction methods.
- Training happens by noising diffusion, firstly perturbing structure and making the model get it right and afterwards doing the same thing for sequences.
- After these two steps the model is distilled into a consistency model. This results in a model that can get the final coordinates/sequence in a single step rather than iterative denoising.
- Method achieves comparable accuracy to many methods out there, such as DiffAb, dyMEAN and others.
- On docking, the best performance is in the order of 4A iRMSD when using an AlphaFold3 antibody model - so still some challenges remain.
- No wetlab validation.
2025-02-03
Assessment and incorporation of in vitro correlates to pharmacokinetic outcomes in antibody developability workflows
- developability
- Nice developability dataset with associated computational modeling.
- A total of 334 antibodies were initially characterized, with a subset of 43 antibodies selected for in vivo pharmacokinetic (PK) assessment. These data points included high-throughput developability assays and various physicochemical measurements.
- A multivariate regression model, using Partial Least Squares (PLS) regression, was developed. This model combined multiple in vitro measures (nonspecific interactions, self-association, and FcRn binding) to predict in vivo clearance, significantly improving PK correlation over individual assays.
2025-02-03
Benchmarking Inverse Folding Models for Antibody CDR Sequence Design
- generative methods
- protein design
- nanobodies
- Benchmarking of sequence design methods that are structure-conditioned
- ESM-IF, LM-Design, ProteinMPNN and AntiFold were benchmarked.
- On sequence recovery, AntiFold beats others on antibodies, but LM-Design is better when VHHs are considered.
- AntiFold makes minimal use of the antigen information.
- ESM-IF and ProteinMPNN have some weak correlation with affinity data.
2025-02-03
De novo design of epitope-specific antibodies against soluble and multipass membrane proteins with high specificity, developability, and function
- binding prediction
- generative methods
- Novel method to design antibodies in silico with experimental validation.
- The actual computational method is not disclosed.
- The computational method takes target sequence/structure and constraints where the antibody should bind. The structure and sequence are then produced.
- Method can generate nanomolar grade binders.
- The main interesting take-away is test-time compute. By feeding the answers of the model back to itself, it produces better binders and does not compromise on diversity of the designs.
2025-02-03
Clinical antibody ADA
- developability
- clinical trials
- Authors study 171 Roche clinical studies representing 28 drugs for their ADA incidence.
- Authors demonstrate that ADA is highly context-specific with non-trivial inter-drug variation and factors such as disease or mode of action impacting the incidence.
- They train a random forest model on T-cell epitope predictions and a model combined with non-epitope features. The extended model, including non-epitope features performs better than the one that is solely sequence-based.
2025-01-21
Antibody affinity engineering using antibody repertoire data and machine learning
- binding prediction
- ngs
- Novel experimental/computational workflow that demonstrates how little data might be needed to develop antibody affinity predictors.
- Mice were immunized with hen egg white lysozyme and via computational procedure of clustering with known binders 35 antibodies were characterized together with their affinities.
- These 35 antibodies were used to train the methods: Gaussian Process (GP) models with Matern and RBF kernels, Kernel Ridge Regression (KRR), Random Forest (RF) and Linear Regression (used as a baseline).
- Seed sequences were point or double-mutated and their affinity predicted using GP (that performed the best). Eight mutants predicted to span the whole range of affinities were selected for experimental testing and they had very good agreement with the predictions.
2025-01-21
PROPERMAB: an integrative framework for in silico prediction of antibody developability using machine learning
- developability
- Computational framework to calculate descriptors correlating with certain developability features for early antibody screening.
- The framework calculates a number of sequence and structural descriptors.
- The correlations were demonstrated to bring value on a HIC and viscosity datasets.
- Exact calculation of descriptors takes time, so authors showed that it is possible to train a ML model to get the descriptors right away from sequence.
2025-01-21
Identifying biophysical assays and in silico properties that enrich for slow clearance in clinical-stage therapeutic antibodies
- clinical trials
- databases
- Computational analysis of pK (clearance) of biologics based on a dataset collated for this publication.
- Authors collated a set of 64 therapeutic antibodies and their clearances.
- Here, they defined fast clearance as more than 5.4 mL/day/kg. 48 antibodies fel below this threshold and 16 above.
- They tested whether any single computationally calculated property (e.g. isoelectric point etc.) determines fast vs slow clearance.
- No single computational property was a good discriminator.
- THey constructed a random forest algorithm and showed that the poly specify reagent (PSR), which is an in vitro property and isoelectrip point, which can be computationally calculated are the strong discriminators according to the model.
2025-01-21
Structure-based charge calculations for predicting isoelectric point, viscosity, clearance, and profiling antibody therapeutics
- developability
- Authors revisit computational calculations from sequence and structure to filter out clinical stage therapeutics as an alternative/refinement to the popular TAP metrics.
- Authors explain how the FvCSP charge asymmetry calculated in TAP might not be the ideal formulation.
- They introduce FV_CHML which as opposed to FvCSP is a difference between the net charges.
- Of the several computational metrics employed they show that the FV_CHML metric captures most of the clinical stage therapeutics.
- They analyse the effect of the isotype, demonstrating that for accurate pI calculations, constant region should be modeled and not only the Fv
- They propose four descriptors that appear to show good degree of separation of natural vs clinical antibodies and some correlation with the experimental values: 1. Patch_cdr_hyd - hydrophobicity of CDRs, not the same as in TAP 2. ens_charge_Fv - in lieu of PPC and PNC from TAP 3. Cdr_len - these separate repertoire from clinical abs. 4. Fv_chml - in lieu of FvCSP from TAP