News

The music of proteins made audible

It's done through a computer program that learns from Chopin
Peng Zhang Yuzong Chen
By Peng Zhang and Yuzong Chen
Nov. 20, 2021

With the right computer program, proteins become pleasant music.

Chopin-445x219.jpg
Training an algorithm to play proteins like Chopin can produce more melodious songs.

There are many surprising analogies between proteins, the basic building blocks of life, and musical notation. These analogies can be used not only to help advance research, but also to make the complexity of proteins accessible to the public.

We’re computational biologists who believe that hearing the sound of life at the molecular level could help inspire people to learn more about biology and the computational sciences. While creating music based on proteins isn’t new, different musical styles and composition algorithms had yet to be explored. So we led a team of high school students and other scholars to figure out how to create classical music from proteins.

The musical analogies of proteins

Proteins are structured like folded chains. These chains are composed of small units of 20 possible amino acids, each labeled by a letter of the alphabet.

Protein-structure-445x776.jpg
Aspects of potein structure can be analogous to musical notation.

A protein chain can be represented as a string of these alphabetic letters, very much like a string of music notes in alphabetical notation.

Protein chains can also fold into wavy and curved patterns with ups, downs, turns and loops. Likewise, music consists of sound waves of higher and lower pitches, with changing tempos and repeating motifs.

Protein-to-music algorithms can thus map the structural and physiochemical features of a string of amino acids onto the musical features of a string of notes.

Enhancing the musicality of protein mapping

Protein-to-music mapping can be fine-tuned by basing it on the features of a specific music style. This enhances musicality, or the melodiousness of the song, when converting amino acid properties, such as sequence patterns and variations, into analogous musical properties, like pitch, note lengths and chords.

For our study, we specifically selected 19th-century Romantic period classical piano music, which includes composers like Chopin and Schubert, as a guide because it typically spans a wide range of notes with more complex features such as chromaticism, like playing both white and black keys on a piano in order of pitch, and chords. Music from this period also tends to have lighter and more graceful and emotive melodies. Songs are usually homophonic, meaning they follow a central melody with accompaniment. These features allowed us to test out a greater range of notes in our protein-to-music mapping algorithm. In this case, we chose to analyze features of Chopin’s “Fantaisie-Impromptu” to guide our development of the program.

To test the algorithm, we applied it to 18 proteins that play a key role in various biological functions. Each amino acid in the protein is mapped to a particular note based on how frequently they appear in the protein, and other aspects of their biochemistry correspond with other aspects of the music. A larger-sized amino acid, for instance, would have a shorter note length, and vice versa.

The resulting music is complex, with notable variations in pitch, loudness and rhythm. Because the algorithm was completely based on the amino acid sequence and no two proteins share the same amino acid sequence, each protein will produce a distinct song. This also means that there are variations in musicality across the different pieces, and interesting patterns can emerge.

For example, music generated from the receptor protein that binds to the hormone and neurotransmitter oxytocin has some recurring motifs due to the repetition of certain small sequences of amino acids.

OXTR protein music. Zhang et al., CC BY-NC-ND3.28 MB (download)

 

OXTR-890x443.jpg
OXTR, or the oxytocin receptor, has repeating sequences of amino acids.

On the other hand, music generated from tumor antigen p53, a protein that prevents cancer formation, is highly chromatic, producing particularly fascinating phrases where the music sounds almost toccata-like, a style that often features fast and virtuoso technique.

TP53 protein music. Zhang et al., CC BY-NC-ND2.12 MB (download)

 

TP53-890x443.jpg
TP53, or tumor protein p53, produces chromatic music.

By guiding analysis of amino acid properties through specific music styles, protein music can sound much more pleasant to the ear. This can be further developed and applied to a wider variety of music styles, including pop and jazz.

Protein music is an example of how combining the biological and computational sciences can produce beautiful works of art. Our hope is that this work will encourage researchers to compose protein music of different styles and inspire the public to learn about the basic building blocks of life.

This study was collaboratively developed with Nicole Tay, Fanxi Liu, Chaoxin Wang and Hui Zhang.

This article is republished from The Conversation under a Creative Commons license. Read the original article.

The Conversation

Enjoy reading ASBMB Today?

Become a member to receive the print edition four times a year and the digital edition monthly.

Learn more
Peng Zhang
Peng Zhang

Peng Zhang is a postdoctoral researcher in computational biology at the Rockefeller University.

Yuzong Chen
Yuzong Chen

Yuzong Chen is a professor of pharmacy at the National University of Singapore.

Get the latest from ASBMB Today

Enter your email address, and we’ll send you a weekly email with recent articles, interviews and more.

Latest in Science

Science highlights or most popular articles

How signals shape DNA via gene regulation
Journal News

How signals shape DNA via gene regulation

Aug. 19, 2025

A new chromatin isolation technique reveals how signaling pathways reshape DNA-bound proteins, offering insight into potential targets for precision therapies. Read more about this recent MCP paper.

A game changer in cancer kinase target profiling
Journal News

A game changer in cancer kinase target profiling

Aug. 19, 2025

A new phosphonate-tagging method improves kinase inhibitor profiling, revealing off-target effects and paving the way for safer, more precise cancer therapies tailored to individual patients. Read more about this recent MCP paper.

How scientists identified a new neuromuscular disease
Feature

How scientists identified a new neuromuscular disease

Aug. 14, 2025

NIH researchers discover Morimoto–Ryu–Malicdan syndrome, after finding shared symptoms and RFC4 gene variants in nine patients, offering hope for faster diagnosis and future treatments.

Unraveling cancer’s spaghetti proteins
Profile

Unraveling cancer’s spaghetti proteins

Aug. 13, 2025

MOSAIC scholar Katie Dunleavy investigates how Aurora kinase A shields oncogene c-MYC from degradation, using cutting-edge techniques to uncover new strategies targeting “undruggable” molecules.

How HCMV hijacks host cells — and beyond
Profile

How HCMV hijacks host cells — and beyond

Aug. 12, 2025

Ileana Cristea, an ASBMB Breakthroughs webinar speaker, presented her research on how viruses reprogram cell structure and metabolism to enhance infection and how these mechanisms might link viral infections to cancer and other diseases.

Understanding the lipid link to gene expression in the nucleus
Profile

Understanding the lipid link to gene expression in the nucleus

Aug. 11, 2025

Ray Blind, an ASBMB Breakthroughs speaker, presented his research on how lipids and sugars in the cell nucleus are involved in signaling and gene expression and how these pathways could be targeted to identify therapeutics for diseases like cancer.