News

Machine learning customises CRISPR-Cas PAM recognition

Researchers Profluent Bio have developed Protein2PAM, a deep learning model that predicts and engineers the protospacer-adjacent motif (PAM) specificity of CRISPR-Cas proteins without iterative laboratory screening. Using this approach, they computationally evolved Nme1Cas9 variants with dramatically broadened PAM specificity and up to 50-fold higher cleavage activity than the wild-type enzyme.

By: Gorm Palmgren - Feb. 2, 2026
News

CMN Intelligence - The World’s Most Comprehensive Intelligence Platform for CRISPR-Genomic Medicine and Gene-Editing Clinical Development

Providing market intelligence, data infrastructure, analytics, and reporting services for the global gene-editing sector. Read more...

The team built a training dataset of over 45,000 CRISPR-Cas PAMs by mining 26.2 terabases of microbial genomic data, and then trained transformer-based models to predict PAM specificity directly from Cas protein sequences across the types I, II, and V systems. The models achieved prediction accuracies ranging from 0.868 to 0.955 and successfully identified PAMs for 92% of tested operons – nearly threefold more than existing spacer-based methods. Using in silico mutagenesis, Protein2PAM pinpointed PAM-interacting residues in Cas9 without requiring structural information, with 80% of high-effect mutations occurring at residues that form hydrogen bonds with the PAM DNA.

The researchers then used Markov chain Monte Carlo optimisation to computationally evolve Nme1Cas9 towards specific target PAMs. Experimental validation using high-throughput PAM determination assays confirmed that engineered variants recognised substantially broader PAM repertoires – for example, NmeN4G.1 cleaved 42 different N4G PAMs compared to just seven for wild-type Nme1Cas9, whilst maintaining comparable on-target editing efficiency to SpCas9 in human cells. The machine learning framework enables single-step customisation of PAM specificity without the need for laboratory-derived training data, potentially expanding the targetable sequences for personalised genome-editing applications.

The study was led by Stephen Nayfach and Ali Madani at Profluent Bio in Emeryville, California. It was published in Nature Biotechnology on 2 February 2026.

To get more CRISPR Medicine News delivered to your inbox, sign up to the free weekly CMN Newsletter here.

Tags

HashtagArticleHashtagCMN BriefsHashtagNewsHashtagCancerHashtagCRISPR ScreensHashtagAstraZenecaHashtagRepare Therapeutics

CLINICAL TRIALS
Non-small Cell Lung Cancer, NSCLC, (NCT06097962)
Sponsors:
Base Therapeutics (Shanghai) Co., Ltd.
Indicator
IND Enabling
Phase I
Phase II
Phase III
Gastric Cancer and Colorectal Cancer, CRC, (NCT07166263)
Sponsors:
Base Therapeutics (Shanghai) Co., Ltd.  
Indicator
IND Enabling
Phase I
Phase II
Phase III
Relapsed or Refractory Acute Myeloid Leukemia, AML, (NCT06541444)
Sponsors:
Base Therapeutics (Shanghai) Co., Ltd.
Indicator
IND Enabling
Phase I
Phase II
Phase III
View all clinical trials
close
Search CRISPR Medicine