G+C-rich tract in 5' end of human introns

J Mol Biol. 1992 Sep 5;227(1):108-13. doi: 10.1016/0022-2836(92)90685-d.

Abstract

Analysis of an artificial neural network trained to classify DNA as coding or non-coding revealed compositional differences between sequence parts translated into protein and those that were not. The 5' end of human introns was found to have a base composition that was non-random to an extent matching the non-randomness in the 3' end that contains the polypyrimidine tract. The prevailing nucleotides in the initial 50 nucleotides of human introns are guanine and cytosine, the trinucleotide GGG was found to occur almost four times as frequently as it would in sequences with a uniform distribution of the nucleotides. The initial part of terminal exons and their associated terminal introns were shown to have a very special base composition deviating strongly from the normal picture in other exons and introns.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Base Composition
  • Base Sequence
  • Genes*
  • Humans
  • Introns*
  • Molecular Sequence Data
  • Neural Networks, Computer