GENEMARK PREDICTIONS Sequence: gi|56973315|ref|NC_004721.2| Bacillus cereus ATCC 14579 plasmid pBClin15, complete sequence Sequence file: NC_004721.fna Sequence length: 15274 GC Content: 38.03% Window length: 96 Window step: 12 Threshold value: 0.500 --- Matrix: Heuristic, GC = 38 Matrix author: Borodovsky Laboratory - Georgia Tech, School of Biology, Atlanta, GA, USA Matrix order: 2 List of Open reading frames predicted as CDSs, shown with alternate starts (regions from start to stop codon w/ coding function >0.50) Left Right DNA Coding Avg Start end end Strand Frame Prob Prob -------- -------- ---------- ----- ---- ---- 399 551 direct fr 3 0.58 0.40 532 1035 direct fr 1 0.76 0.70 538 1035 direct fr 1 0.76 0.65 586 1035 direct fr 1 0.79 0.33 616 1035 direct fr 1 0.77 0.05 718 1035 direct fr 1 0.80 0.43 772 1035 direct fr 1 0.80 0.24 1053 1325 direct fr 3 0.58 0.17 1068 1325 direct fr 3 0.60 0.54 1104 1325 direct fr 3 0.69 0.40 1110 1325 direct fr 3 0.73 0.47 1194 1325 direct fr 3 0.63 0.28 1390 2091 direct fr 1 0.73 0.93 1507 2091 direct fr 1 0.75 0.39 1555 2091 direct fr 1 0.75 0.16 1597 2091 direct fr 1 0.76 0.14 1786 2091 direct fr 1 0.79 0.57 2104 4293 direct fr 1 0.80 0.87 2188 4293 direct fr 1 0.82 0.20 2356 4293 direct fr 1 0.83 0.04 2395 4293 direct fr 1 0.83 0.15 2416 4293 direct fr 1 0.84 0.29 4771 5127 direct fr 1 0.70 0.87 4780 5127 direct fr 1 0.72 0.92 4938 5669 direct fr 3 0.53 0.45 5115 5669 direct fr 3 0.65 0.63 5268 5669 direct fr 3 0.55 0.13 5280 5669 direct fr 3 0.55 0.27 5292 5669 direct fr 3 0.56 0.35 5310 5669 direct fr 3 0.55 0.80 5316 5669 direct fr 3 0.55 0.64 5822 6049 direct fr 2 0.50 0.51 6021 6671 direct fr 3 0.76 0.39 6054 6671 direct fr 3 0.81 0.57 6102 6671 direct fr 3 0.82 0.18 6117 6671 direct fr 3 0.82 0.52 6135 6671 direct fr 3 0.82 0.61 6726 6824 direct fr 3 0.53 0.03 6980 8047 direct fr 2 0.64 0.70 7058 8047 direct fr 2 0.67 0.10 7076 8047 direct fr 2 0.67 0.20 7229 8047 direct fr 2 0.70 0.06 8088 8318 direct fr 3 0.55 0.55 8130 8318 direct fr 3 0.69 0.25 8255 8494 direct fr 2 0.54 0.07 8321 8494 direct fr 2 0.70 0.87 8342 8494 direct fr 2 0.67 0.03 8390 8494 direct fr 2 0.53 0.28 8568 8990 direct fr 3 0.73 0.26 8676 8990 direct fr 3 0.86 0.21 8727 8990 direct fr 3 0.83 0.24 8742 8990 direct fr 3 0.82 0.16 9613 10206 direct fr 1 0.72 0.74 9682 10206 direct fr 1 0.77 0.43 9709 10206 direct fr 1 0.76 0.03 9730 10206 direct fr 1 0.77 0.03 9826 10206 direct fr 1 0.83 0.65 9838 10206 direct fr 1 0.83 0.35 10210 10941 direct fr 1 0.70 0.87 10297 10941 direct fr 1 0.75 0.11 10333 10941 direct fr 1 0.74 0.18 10414 10941 direct fr 1 0.73 0.22 10880 11410 direct fr 2 0.60 0.58 10889 11410 direct fr 2 0.62 0.80 11033 11410 direct fr 2 0.70 0.90 11039 11410 direct fr 2 0.70 0.81 11066 11410 direct fr 2 0.68 0.30 11414 12217 direct fr 2 0.68 0.33 11423 12217 direct fr 2 0.68 0.28 11645 12217 direct fr 2 0.82 0.49 11654 12217 direct fr 2 0.83 0.45 11774 12217 direct fr 2 0.80 0.44 12390 14183 direct fr 3 0.69 0.09 12396 14183 direct fr 3 0.69 0.08 12549 14183 direct fr 3 0.68 0.25 14196 14483 direct fr 3 0.63 0.70 14220 14483 direct fr 3 0.69 0.76 14226 14483 direct fr 3 0.72 0.65 14298 14483 direct fr 3 0.68 0.17 14304 14483 direct fr 3 0.68 0.17 14496 15179 direct fr 3 0.79 0.56 14556 15179 direct fr 3 0.85 0.13 14649 15179 direct fr 3 0.84 0.03 14778 15179 direct fr 3 0.83 0.01 14787 15179 direct fr 3 0.83 0.03 List of Regions of interest (regions from stop to stop codon w/ a signal in between) LEnd REnd Strand Frame -------- -------- ----------- ----- 357 551 direct fr 3 499 1035 direct fr 1 1050 1325 direct fr 3 1363 2091 direct fr 1 2089 4293 direct fr 1 4221 4496 direct fr 3 4494 4661 direct fr 3 4765 5127 direct fr 1 4923 5669 direct fr 3 5125 5760 direct fr 1 5720 6049 direct fr 2 6012 6671 direct fr 3 6669 6824 direct fr 3 6803 6976 direct fr 2 6974 8047 direct fr 2 8073 8318 direct fr 3 8249 8494 direct fr 2 8559 8990 direct fr 3 9131 9310 direct fr 2 9289 9600 direct fr 1 9607 10206 direct fr 1 10204 10941 direct fr 1 10859 11410 direct fr 2 11143 11502 complement fr 3 11408 12217 direct fr 2 12207 14183 direct fr 3 14181 14483 direct fr 3 14481 15179 direct fr 3 POSSIBLE SEQUENCE FRAMESHIFTS DETECTED From To Frame Frame At base... ----- ----- ---------- 3 1 5520 +/- 11 bp (direct) -------------------- ABOUT THE MATRIX USED: For details on the model building procedure see: Besemer J. and Borodovsky M. "Heuristic approach to deriving models for gene finding" NAR, 1999, Vol. 27, No. 19, pp. 3911-3920