Kozak序列

科扎克共有序列, 或Kozak 序列 ，是在真核生物的mRNA中共有的(gcc)gccRccAUGG序列。它在翻譯過程的啟動中扮演了重要角色。^[1]，得名自揭示了其重要性的瑪麗蓮·科扎克。

序列標識為(gcc)gccRccAUGG, 源自科扎克對大範圍的樣本（共約699種）^[2]的歸納和分析，其中：

小寫字母表示此位置處最常見的鹼基（仍然可以變化）;
大寫字母表示高度保守的鹼基，例如「AUGG」序列是不變的或很少，甚至根本沒有變化，唯一的例外是IUPAC的模糊碼^[3]。「R」這表明嘌呤（A或G）總是在這個位置上（科扎克聲稱A更常見）
括號內的序列((gcc))重要性不明。

科扎克的研究局限於脊椎動物的一部分（例如人類，牛，貓，狗，雞，豚鼠，倉鼠，小鼠，豬，兔，羊，非洲爪蟾）。

簡介

這個序列在mRNA分子上被核糖體識別為翻譯起始位點，即蛋白質是由此開始被該mRNA分子編碼的。核糖體需要此序列，或一個可能的可變形式（見下文）來啟動翻譯。Kozak序列不應與核糖體結合位點（RBS）相混淆，後者一般為信使RNA的5'端帽或內部核糖體進入位點（IRES）。

在體內(In vivo)，不同的mRNA上這段區域往往不完全匹配，由一個給定的mRNA合成蛋白質的量取決於Kozak序列的強度。^[4] 這個序列中的一些核苷酸比其它的更重要：AUG是最重要的，因為它是實際的起始密碼子，在對應蛋白質的N-末端編碼一個甲硫氨酸。（很少地，CUG被用作起始密碼子，編碼亮氨酸，而不是典型的蛋氨酸作為起始。）「AUG」的A被定為1號位的話，對於一個「強」的Kozak序列，核苷酸在相對於1號核苷酸的4號位（即G所在的位置）和-3號位（即R的位點，沒有號碼0的位置）必須匹配一般的Kozak序列。「適中」強度的Kozak序列只有上述兩個中的一個匹配，而「弱」的兩個都不匹配。在-1和-2的cc並不那麼保守，但有助於提升整體強度。^[5]有證據表明，在-6位的G在翻譯起始中也是重要。 ^[1]

體內有這些類型的Kozak序列的例子，他們很可能是演變而來的基因調控的又一機制。Lmx1b （頁面存檔備份，存於互聯網檔案館）是「弱」Kozak序列基因的一個例子。^[6]從這樣的位點起始的翻譯，mRNA序列需要有額外的特性，以便使核糖體識別起始密碼子。

變異

研究顯示β-珠蛋白基因（β+45;人類）在-6位G—>C的變異表現打亂血液和生物合成功能的表型。這是首個被發現的人類Kozak序列變異。它在一個意大利東南部的地中海貧血家庭中被發現。^[1]

共有序列的可變形式

(gcc)gccRccAUGG
       AGNNAUGN
        ANNAUGG
        ACCAUGG
     GACACCAUGG

不同真核生物中的類Kozak序列
生物類群	門	共有序列
脊椎動物		`gccRccATGG`^[2]
果蠅 (Drosophila spp.)	節肢動物門	`cAAaATG`^[7]
釀酒酵母 (Saccharomyces cerevisiae)	子囊菌門	`aAaAaAATGTCt`^[8]
粘菌 (Dictyostelium discoideum)	變形蟲門	`aaaAAAATGRna`^[9]
纖毛蟲	纖毛蟲門	`nTaAAAATGRct`^[9]
瘧原蟲 (Plasmodium spp.)	頂復門	`taaAAAATGAan`^[9]
弓形蟲 (Toxoplasma gondii)	頂復門	`gncAaaATGg`^[10]
Trypanosomatidae	眼蟲門	`nnnAnnATGnC`^[9]
陸生植物		`AACAATGGC`^[11]

參見

夏因-達爾加諾序列, 原核生物的核糖體結合位點。

參考

^ ^1.0 ^1.1 ^1.2 De Angioletti M, Lacerra G, Sabato V, Carestia C. Beta+45 G --> C: a novel silent beta-thalassaemia mutation, the first in the Kozak sequence. Br J Haematol. 2004, 124 (2): 224–31. PMID 14687034. doi:10.1046/j.1365-2141.2003.04754.x.
^ ^2.0 ^2.1 Kozak M. An analysis of 5'-noncoding sequences from 699 vertebrate messenger RNAs. Nucleic Acids Res. October 1987, 15 (20): 8125–8148 [2014-02-05]. PMC 306349  . PMID 3313277. doi:10.1093/nar/15.20.8125. （原始內容存檔於2019-09-15）.
^ Nomenclature for Incompletely Specified Bases in Nucleic Acid Sequences （頁面存檔備份，存於互聯網檔案館）, NC-IUB, 1984.
^ Kozak M. Point mutations close to the AUG initiator codon affect the efficiency of translation of rat preproinsulin in vivo. Nature. 1984, 308 (5956): 241–246 [2014-02-05]. PMID 6700727. doi:10.1038/308241a0. （原始內容存檔於2017-05-06）.
^ Kozak M. Point mutations define a sequence flanking the AUG initiator codon that modulates translation by eukaryotic ribosomes. Cell. 1986, 44 (2): 283–92 [2014-02-05]. PMID 3943125. doi:10.1016/0092-8674(86)90762-2. （原始內容存檔於2017-05-06）.
^ Dunston JA, Hamlington JD, Zaveri J; et al. The human LMX1B gene: transcription unit, promoter, and pathogenic mutations. Genomics. September 2004, 84 (3): 565–76. PMID 15498463. doi:10.1016/j.ygeno.2004.06.002.
^ Cavener DR. Comparison of the consensus sequence flanking translational start sites in Drosophila and vertebrates. Nucleic Acids Res. February 1987, 15 (4): 1353–61 [2014-02-05]. PMC 340553  . PMID 3822832. doi:10.1093/nar/15.4.1353. （原始內容存檔於2019-09-15）.
^ Hamilton R, Watanabe CK, de Boer HA. Compilation and comparison of the sequence context around the AUG startcodons in Saccharomyces cerevisiae mRNAs. Nucleic Acids Res. April 1987, 15 (8): 3581–93 [2014-02-05]. PMC 340751  . PMID 3554144. doi:10.1093/nar/15.8.3581. （原始內容存檔於2019-09-15）.
^ ^9.0 ^9.1 ^9.2 ^9.3 Yamauchi K. The sequence flanking translational initiation site in protozoa. Nucleic Acids Res. May 1991, 19 (10): 2715–20 [2014-02-05]. PMC 328191  . PMID 2041747. doi:10.1093/nar/19.10.2715. （原始內容存檔於2019-09-15）.
^ Seeber, F. Consensus sequence of translational initiation sites from Toxoplasma gondii genes. Parasitology Research. 1997, 83 (3): 309–311. PMID 9089733. doi:10.1007/s004360050254.
^ Lütcke HA, Chow KC, Mickel FS, Moss KA, Kern HF, Scheele GA. Selection of AUG initiation codons differs in plants and animals. EMBO J. January 1987, 6 (1): 43–8. PMC 553354  . PMID 3556162.

[Angioletti2004-1] 1.0 ^1.1 ^1.2 De Angioletti M, Lacerra G, Sabato V, Carestia C. Beta+45 G --> C: a novel silent beta-thalassaemia mutation, the first in the Kozak sequence. Br J Haematol. 2004, 124 (2): 224–31. PMID 14687034. doi:10.1046/j.1365-2141.2003.04754.x.

[Kozak1987-2] 2.0 ^2.1 Kozak M. An analysis of 5'-noncoding sequences from 699 vertebrate messenger RNAs. Nucleic Acids Res. October 1987, 15 (20): 8125–8148 [2014-02-05]. PMC 306349  . PMID 3313277. doi:10.1093/nar/15.20.8125. （原始內容存檔於2019-09-15）.

[NCIUB-3] Nomenclature for Incompletely Specified Bases in Nucleic Acid Sequences （頁面存檔備份，存於互聯網檔案館）, NC-IUB, 1984.

[Kozak1984-4] Kozak M. Point mutations close to the AUG initiator codon affect the efficiency of translation of rat preproinsulin in vivo. Nature. 1984, 308 (5956): 241–246 [2014-02-05]. PMID 6700727. doi:10.1038/308241a0. （原始內容存檔於2017-05-06）.

[Kozak1986-5] Kozak M. Point mutations define a sequence flanking the AUG initiator codon that modulates translation by eukaryotic ribosomes. Cell. 1986, 44 (2): 283–92 [2014-02-05]. PMID 3943125. doi:10.1016/0092-8674(86)90762-2. （原始內容存檔於2017-05-06）.

[Dunston2004-6] Dunston JA, Hamlington JD, Zaveri J; et al. The human LMX1B gene: transcription unit, promoter, and pathogenic mutations. Genomics. September 2004, 84 (3): 565–76. PMID 15498463. doi:10.1016/j.ygeno.2004.06.002.

[7] Cavener DR. Comparison of the consensus sequence flanking translational start sites in Drosophila and vertebrates. Nucleic Acids Res. February 1987, 15 (4): 1353–61 [2014-02-05]. PMC 340553  . PMID 3822832. doi:10.1093/nar/15.4.1353. （原始內容存檔於2019-09-15）.

[8] Hamilton R, Watanabe CK, de Boer HA. Compilation and comparison of the sequence context around the AUG startcodons in Saccharomyces cerevisiae mRNAs. Nucleic Acids Res. April 1987, 15 (8): 3581–93 [2014-02-05]. PMC 340751  . PMID 3554144. doi:10.1093/nar/15.8.3581. （原始內容存檔於2019-09-15）.

[Yamauchi1991-9] 9.0 ^9.1 ^9.2 ^9.3 Yamauchi K. The sequence flanking translational initiation site in protozoa. Nucleic Acids Res. May 1991, 19 (10): 2715–20 [2014-02-05]. PMC 328191  . PMID 2041747. doi:10.1093/nar/19.10.2715. （原始內容存檔於2019-09-15）.

[10] Seeber, F. Consensus sequence of translational initiation sites from Toxoplasma gondii genes. Parasitology Research. 1997, 83 (3): 309–311. PMID 9089733. doi:10.1007/s004360050254.

[11] Lütcke HA, Chow KC, Mickel FS, Moss KA, Kern HF, Scheele GA. Selection of AUG initiation codons differs in plants and animals. EMBO J. January 1987, 6 (1): 43–8. PMC 553354  . PMID 3556162.

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

[10]

[11]