In this article, we show how a new mathematical model of the genetic code can be exploited for investigating the almost periodic properties of DNA and mRNA protein-coding sequences. We present the main mathematical features of the model and highlight its connections with both number theory and group theory. The group theoretic framework presents interesting analogies with the theory of crystals. Moreover, we exploit the information provided by dichotomic classes, binary variables naturally derived from the mathematical model, in order to build statistical classifiers for retrieving and predicting the normal reading frame used by the ribosome in protein synthesis. The results show that coding sequences possess a local informational structure that can be related to frame synchronization processes. The information for retrieving the normal reading frame, which implies the existence of short-range correlations and almost periodic structures related to the organization of codons, offers an interesting analogy with the properties of quasi-crystals. From a theoretical point of view, our results might contribute to clarifying the relation between biological information and shape in nucleic acids and proteins. Also, from the point of view of applications, we present new promising tools for designing efficient algorithms for frame synchronization, which plays a crucial role in faithful synthesis of proteins.

DNA, dichotomic classes and frame synchronization: a quasi-crystal framework

GIANNERINI, SIMONE;
2012-01-01

Abstract

In this article, we show how a new mathematical model of the genetic code can be exploited for investigating the almost periodic properties of DNA and mRNA protein-coding sequences. We present the main mathematical features of the model and highlight its connections with both number theory and group theory. The group theoretic framework presents interesting analogies with the theory of crystals. Moreover, we exploit the information provided by dichotomic classes, binary variables naturally derived from the mathematical model, in order to build statistical classifiers for retrieving and predicting the normal reading frame used by the ribosome in protein synthesis. The results show that coding sequences possess a local informational structure that can be related to frame synchronization processes. The information for retrieving the normal reading frame, which implies the existence of short-range correlations and almost periodic structures related to the organization of codons, offers an interesting analogy with the properties of quasi-crystals. From a theoretical point of view, our results might contribute to clarifying the relation between biological information and shape in nucleic acids and proteins. Also, from the point of view of applications, we present new promising tools for designing efficient algorithms for frame synchronization, which plays a crucial role in faithful synthesis of proteins.
File in questo prodotto:
File Dimensione Formato  
Giannerini_Gonzalez_Rosa_PTRSA_2012.pdf

non disponibili

Licenza: Non pubblico
Dimensione 491.12 kB
Formato Adobe PDF
491.12 kB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11390/1293420
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 16
  • ???jsp.display-item.citation.isi??? 15
social impact