GENEMARK PREDICTIONS Sequence: gi|145294040|ref|NC_009347.1| Shigella sonnei Ss046 plasmid pSS046_spC, complete sequence Sequence file: NC_009347.fna Sequence length: 2101 GC Content: 47.12% Window length: 96 Window step: 12 Threshold value: 0.500 --- Matrix: Heuristic, GC = 47 Matrix author: Borodovsky Laboratory - Georgia Tech, School of Biology, Atlanta, GA, USA Matrix order: 2 List of Open reading frames predicted as CDSs, shown with alternate starts (regions from start to stop codon w/ coding function >0.50) Left Right DNA Coding Avg Start end end Strand Frame Prob Prob -------- -------- ---------- ----- ---- ---- 703 1668 direct fr 1 0.55 0.29 832 1668 direct fr 1 0.55 0.01 877 1668 direct fr 1 0.56 0.04 967 1668 direct fr 1 0.63 0.68 1009 1668 direct fr 1 0.67 0.01 1096 1668 direct fr 1 0.73 0.55 1117 1668 direct fr 1 0.72 0.19 1171 1668 direct fr 1 0.69 0.03 1672 1938 direct fr 1 0.54 0.68 1741 1938 direct fr 1 0.64 0.39 List of Regions of interest (regions from stop to stop codon w/ a signal in between) LEnd REnd Strand Frame -------- -------- ----------- ----- 655 1668 direct fr 1 1666 1938 direct fr 1 1833 2081 direct fr 3 -------------------- ABOUT THE MATRIX USED: For details on the model building procedure see: Besemer J. and Borodovsky M. "Heuristic approach to deriving models for gene finding" NAR, 1999, Vol. 27, No. 19, pp. 3911-3920