On 2014 Oct 18, Pavel Baranov commented:
What is the difference between Open Reading Frame (ORF) and Coding Sequence (CDS)?
Thank you for the reply. I think the disagreement lies in our understanding of what Open Reading Frame is.
A simple and effective definition of ORF is a sequence of codons not interrupted with stop codons: nucleotide sequence is open for reading in one of the three (for RNA) or six (dsRNA) frames. ORF is an abstract notion, it can be found in any sequence. Both protein-codng and non-coding sequences have ORFs.
Coding Sequence (CDS) is the part of RNA that encodes protein. CDS often, but not always (exceptions are ribosomal frameshifting, stop codon readthrough, etc.), is located within a single ORF. A single ORF may contain more than one Coding Region if, for example, translation begins at different start codons within the same ORF.
From your reply I presume that you suggest to define ORF as a sequence of codons from start to stop (like CDS). But the problem with this detention is that it is unclear what should we consider as a start of ORF. AUG? But not all AUGs are starts and not all starts are AUGs. Also there are no starts in non-coding RNAs, but there are ORFs in non-coding sequences.
Besides if we use this definition, all eukaryotic mRNAs coding for multiple protein isoforms would need to be described as bi- or even polycistronic.
I hope that this discussion brings some clarity to the terminology used or at least it draws attention to a potential confusion when terms such as ORF, CDS and cistron are not explicitly defined.
This comment, imported by Hypothesis from PubMed Commons, is licensed under CC BY.