Choose your level
This page is written in layers. Pick your preferred depth and reading style. (Nothing is tracked; this only changes what your browser shows.)
Tip: the expandable sections below work even if you ignore the dropdowns.
How a 4-letter nucleotide alphabet becomes a 20–amino-acid chemistry — and why that mapping is the information bottleneck for origins-of-life models.
This page is written in layers. Pick your preferred depth and reading style. (Nothing is tracked; this only changes what your browser shows.)
Tip: the expandable sections below work even if you ignore the dropdowns.
Life stores instructions in nucleic acids (RNA/DNA) using four symbols (A, C, G, U/T). Cells then convert that one-dimensional sequence into three-dimensional proteins using a translation system.
The genetic code is a mapping between codons (triplets of nucleotides) and amino acids. But in real cells, that mapping is not “data” floating in space — it is enforced by machinery.
RNA uses four letters (A, C, G, U). Reading three at a time yields 64 codons. Those codons map to 20 amino acids plus stop signals.
The code is degenerate: different codons can encode the same amino acid. That means two nucleotide strings can differ while producing the same amino acid sequence — or differ slightly and produce a big change, depending on where substitutions land.
A classic example is Leucine, which has six codons: UUA, UUG, CUU, CUC, CUA, CUG.
In cells, the mapping is enforced by a coordinated set of components:
This is why “a code exists” is not enough. A code must be executed and policed fast enough to matter before entropy destroys the system.
Information flow is effectively one-way in biology: nucleic acid → nucleic acid and nucleic acid → protein. But protein → nucleic acid (reverse-engineering exact sequence information from a protein) is not available as a general mechanism.
Translation assumes a stable partition of the sequence into triplets. If the ribosome slips a base (frameshift), all downstream codons change. Hybrid or unstable parsing is typically fatal to reliable synthesis.
This is one reason “partially working” translation often fails: it is not just missing pieces; it is a system that must cohere on a single convention.
At a conceptual level, translation is a rule-based mapping from codons to amino acids plus start/stop control. In cells, the rule is not a table in the abstract — it is embodied in molecules and kinetics.
(If you want, we can include your Java switch example here as an optional “code analogy” box.)
If this page clarified the mapping, the next question is how that mapping is executed by a living system. That’s the focus of Phase 2 (translation).