nitrogen-containing biological compounds that form nucleosides
Base pairing: two base pairs are produced by four nucleotide monomers, nucleobases are in blue. Guanine (G) is paired with cytosine (C) via three hydrogen bonds, in red. Adenine (A) is paired with uracil (U) via two hydrogen bonds, in red.
Nucleotide bases (also nucleobases, nitrogenous bases) are nitrogen-containing biological compounds that form nucleosides, which, in turn, are components of nucleotides, with all of these monomers constituting the basic building blocks of nucleic acids. The ability of nucleobases to form base pairs and to stack one upon another leads directly to long-chain helical structures such as deoxyribonucleic acid (DNA). Five nucleobases—adenine (A), cytosine (C), guanine (G), thymine (T), and uracil (U)—are called primary or canonical. They function as the fundamental units of the genetic code, with the bases A, C, G and T being found in DNA while A, C, G and U are found in RNA. Thymine and uracil are distinguished by merely the presence or absence of a methyl group on the fifth carbon (C5) of these heterocyclic six-membered rings. In addition, some viruses have aminoadenine (Z) instead of adenine. It differs in having an extra amine group, creating a more stable bond to thymine.
Discovered by embedding cosine similarity (sentence-transformers MiniLM, 384-dim).