thumb|300px|This example of a naturally occurring pseudoknot is found in the RNA component of human telomerase. Sequence from Chen and Greider (2005). thumb|300px|Three dimensional structure of almost the same pseudoknot from telomerase RNA. (A) sticks (B) backbone. The pdb-file is based on . Colors: A U C G __NOTOC__ A pseudoknot is a nucleic acid secondary structure containing at least two stem-loop structures in which half of one stem is intercalated between the two halves of another stem. The pseudoknot was first recognized in the turnip yellow mosaic virus in 1982. Pseudoknots fold into k
thumb|300px|This example of a naturally occurring pseudoknot is found in the RNA component of human telomerase. Sequence from Chen and Greider (2005). thumb|300px|Three dimensional structure of almost the same pseudoknot from telomerase RNA. (A) sticks (B) backbone. The pdb-file is based on . Colors: A U C G __NOTOC__ A pseudoknot is a nucleic acid secondary structure containing at least two stem-loop structures in which half of one stem is intercalated between the two halves of another stem. The pseudoknot was first recognized in the turnip yellow mosaic virus in 1982. Pseudoknots fold into knot-shaped three-dimensional conformations but are not true topological knots. These structures are categorized as cross (X) topology within the circuit topology framework, which, in contrast to knot theory, is a contact-based approach.
==Prediction and identification== The structural configuration of pseudoknots does not lend itself well to bio-computational detection due to its context-sensitivity or "overlapping" nature. The base pairing in pseudoknots is not well nested; that is, base pairs occur that "overlap" one another in sequence position. This makes the presence of pseudoknots in RNA sequences more difficult to predict by the standard method of dynamic programming, which use a recursive scoring system to identify paired stems and consequently, most cannot detect non-nested base pairs. The newer method of stochastic context-free grammars suffers from the same problem. Thus, popular secondary structure prediction methods like Mfold and Pfold will not predict pseudoknot structures present in a query sequence; they will only identify the more stable of the two pseudoknot stems.
Discovered by embedding cosine similarity (sentence-transformers MiniLM, 384-dim).