GENEMARK PREDICTIONS Sequence: gi|58038467|ref|NC_006675.1| Gluconobacter oxydans 621H plasmid pGOX4, complete sequence Sequence file: NC_006675.fna Sequence length: 13223 GC Content: 54.37% Window length: 96 Window step: 12 Threshold value: 0.500 --- Matrix: Heuristic, GC = 54 Matrix author: Borodovsky Laboratory - Georgia Tech, School of Biology, Atlanta, GA, USA Matrix order: 2 List of Open reading frames predicted as CDSs, shown with alternate starts (regions from start to stop codon w/ coding function >0.50) Left Right DNA Coding Avg Start end end Strand Frame Prob Prob -------- -------- ---------- ----- ---- ---- 128 1102 direct fr 2 0.51 0.25 143 1102 direct fr 2 0.51 0.14 185 1102 direct fr 2 0.53 0.09 260 1102 direct fr 2 0.52 0.11 2694 3395 direct fr 3 0.66 0.11 2718 3395 direct fr 3 0.68 0.56 2724 3395 direct fr 3 0.68 0.68 2745 3395 direct fr 3 0.70 0.48 2835 3395 direct fr 3 0.69 0.04 3027 3395 direct fr 3 0.70 0.64 3132 3395 direct fr 3 0.68 0.60 3392 3667 direct fr 2 0.50 0.39 3431 3667 direct fr 2 0.57 0.09 3557 3667 direct fr 2 0.50 0.02 4095 5174 complement fr 2 0.72 0.72 4095 5087 complement fr 2 0.72 0.53 4095 5075 complement fr 2 0.71 0.37 4095 5000 complement fr 2 0.70 0.75 5211 5633 complement fr 2 0.72 0.31 5211 5564 complement fr 2 0.81 0.70 5211 5534 complement fr 2 0.80 0.28 5211 5453 complement fr 2 0.73 0.46 6241 7104 direct fr 1 0.71 0.38 6289 7104 direct fr 1 0.75 0.23 6322 7104 direct fr 1 0.75 0.27 6436 7104 direct fr 1 0.74 0.74 8182 8811 direct fr 1 0.60 0.26 8188 8811 direct fr 1 0.61 0.32 8206 8811 direct fr 1 0.62 0.24 8347 8811 direct fr 1 0.64 0.30 8371 8811 direct fr 1 0.63 0.11 8931 9170 direct fr 3 0.58 0.73 8943 9170 direct fr 3 0.57 0.49 8967 9170 direct fr 3 0.52 0.08 9290 9628 complement fr 1 0.69 0.72 9290 9619 complement fr 1 0.69 0.31 9290 9610 complement fr 1 0.68 0.35 10130 10546 complement fr 1 0.62 0.12 10130 10507 complement fr 1 0.68 0.16 10130 10492 complement fr 1 0.67 0.20 10130 10447 complement fr 1 0.67 0.79 10822 11475 direct fr 1 0.76 0.03 10918 11475 direct fr 1 0.81 0.38 10975 11475 direct fr 1 0.80 0.37 11062 11475 direct fr 1 0.76 0.13 12081 12239 direct fr 3 0.58 0.30 12111 12239 direct fr 3 0.50 0.43 12224 13054 complement fr 1 0.53 0.57 12224 12955 complement fr 1 0.51 0.00 12224 12823 complement fr 1 0.53 0.34 12224 12775 complement fr 1 0.52 0.29 12224 12760 complement fr 1 0.51 0.23 List of Regions of interest (regions from stop to stop codon w/ a signal in between) LEnd REnd Strand Frame -------- -------- ----------- ----- 104 1102 direct fr 2 476 1045 complement fr 1 1182 1346 direct fr 3 1696 1878 complement fr 3 1745 2269 direct fr 2 2667 3395 direct fr 3 3371 3667 direct fr 2 3512 3733 complement fr 1 4044 4220 direct fr 3 4095 5213 complement fr 2 5211 5639 complement fr 2 6235 7104 direct fr 1 7117 7698 direct fr 1 7674 7943 direct fr 3 8116 8811 direct fr 1 8784 9128 complement fr 2 8871 9170 direct fr 3 9290 9712 complement fr 1 9607 9894 complement fr 3 9652 10017 direct fr 1 10130 10555 complement fr 1 10726 11475 direct fr 1 11530 12081 direct fr 1 11874 12239 direct fr 3 12224 13072 complement fr 1 12293 12520 direct fr 2 -------------------- ABOUT THE MATRIX USED: For details on the model building procedure see: Besemer J. and Borodovsky M. "Heuristic approach to deriving models for gene finding" NAR, 1999, Vol. 27, No. 19, pp. 3911-3920