Transcription of DNA to RNA to protein
Overview of gene structure and expression
Protein coding genes
Protein coding genes have a coding region flanked by untranslated regions and may be split into exons and introns.
The boundaries of a protein-encoding gene are defined as the points at which transcription begins and ends. The core of the gene is the coding region, which contains the nucleotide sequence that is eventually translated into the sequence of amino acids in the protein.
The coding region begins with the initiation codon, which is normally ATG. It ends with one of three termination codons: TAA, TAG or TGA. On either side of the coding region are DNA sequences that are transcribed but are not translated. These untranslated regions or non-coding regions often contain regulatory elements that control protein synthesis.
Sourced from the Wellcome Trust
See Also: Gene Analysis Project |