Proteins, one of the smallest building blocks of life on Earth, hold promise for answering some of biology's biggest questions. Consisting of amino acids strung together into peptide chains, these molecules perform much of the work inside living cells. While they execute life's most essential functions with apparent ease, decoding their precise sequence and structure has long been one of biology's hardest challenges.
Now, a team led by bioengineers at Stanford University has developed a novel approach to visualize proteins at unprecedented scale and sensitivity. The work, published March 18 in Nature Biotechnology, describes a chemical process that allows researchers to use existing, fast, and low-cost DNA sequencing platforms to decode proteins.
"In nature, proteins are made from DNA. Over the past two decades, our society has created amazing technologies to sequence a lot of DNA really quickly and inexpensively," said H. Tom Soh, the W. M. Keck Foundation Professor of Electrical Engineering and professor of bioengineering and of radiology in the schools of Engineering and Medicine and senior author of the study. "But unfortunately, we haven't really made similar progress for sequencing proteins. In this work, we created a technology that can convert protein sequences back into DNA sequences. It's kind of like running the natural process – in reverse – so that we can leverage powerful DNA sequencing technology that is already available."
The mysteries of proteins
Proteins contain biological information that DNA cannot fully reveal. DNA acts as an instruction manual, carrying the code needed to build cellular machinery. Proteins are that machinery, folding into intricate shapes that determine how cells grow, communicate, respond to immune threats, and malfunction in disease. Despite their central role, scientists have lacked a protein sequencing tool that can match the speed of modern DNA sequencing.
Sequencing proteins has remained notoriously difficult because they are built from 20 different amino acids – far more than the four bases that make up DNA. Those amino acids are also nearly three times smaller than nucleic acids, making them far harder for technologies to reliably detect and distinguish.
What really is different is how much from the same sample we can see. With mass spectrometry, you're shooting 1 billion to 10 billion protein molecules and see, typically, a million molecules out of it. With our method, you can potentially see 1,000 times that amount."
Liwei Zheng, research engineer at Stanford and first author of the study
Zheng hopes to see more rare types of proteins brought to light using this new sequencing.
Binding (and finding) the solution
The team's new chemistry tackles this problem by tagging individual amino acids within a peptide using DNA barcodes specific to each peptide molecule. Antibodies and cycle barcodes, known as synthetic DNA, then help encode the peptide's identity and mark its position, respectively. Once encoded, routine DNA sequencing can reveal the identity and location of the amino acid within the peptide chain.
"This is the breakthrough," said Soh. "We can sequence individual proteins at a single molecule level, requiring very little sample that can get us to individual cells."
This sensitivity could enable studies of cellular diversity at nanoscopic resolution, shedding light on why seemingly identical cells behave differently in response to diseases like cancer. It also opens up the possibility of studying rare proteins that are difficult to detect in bulk samples, including molecules that may drive disease progression but exist only in tiny quantities.
Soh points to immunotherapy as one example: treatments like CAR-T cell therapy have revolutionized cancer care by modifying patients' own immune cells to recognize and fight back against cancer cells. A big unknown is why it only works for certain types of cancers and certain types of cells.
By isolating immune cells that positively respond to treatments with other cells that don't, using a single-protein level analysis, scientists can more clearly understand why certain treatments work and how they can be enhanced to maximize efficiency across individual patients.
From bench to button
The breakthrough has already sparked commercial interest and is licensed with the intent of turning the laboratory process into a user-friendly instrument. The goal is to eventually enable researchers to "put in a sample, press a button, and have it go," said Soh, much like pressing a button to run a DNA sequencer today.
While the method is still in its early stages and will require optimization before widespread adoption, its ability to potentially scale billions of protein molecules from thousands of cells sets it apart from previous attempts at direct protein sequencing.
"Once you convert everything to DNA, you can think about all the naturally evolved machinery that can manipulate DNA – lengthening DNA, copying DNA – all these things will become possible for processing protein sequences," said Zheng.
If successful, the team's "reverse translation" approach could become a new cornerstone of molecular biology, giving scientists a long-missing tool to read the molecules that do life's work, one cell at a time.
Source:
Journal reference:
Zheng, L., et al. (2026). Single-molecule peptide sequencing through reverse translation of peptides into DNA. Nature Biotechnology. DOI: 10.1038/s41587-026-03061-z. https://www.nature.com/articles/s41587-026-03061-z