LOCUS       KJ778813                2082 bp    DNA     linear   BCT 29-MAR-2016
DEFINITION  Escherichia coli strain F 8198-41 serotype O57:K-:H- O-antigen gene
            cluster, complete sequence.
ACCESSION   KJ778813
VERSION     KJ778813.1
KEYWORDS    .
SOURCE      Escherichia coli
  ORGANISM  Escherichia coli
            Bacteria; Proteobacteria; Gammaproteobacteria; Enterobacterales;
            Enterobacteriaceae; Escherichia.
REFERENCE   1  (bases 1 to 2082)
  AUTHORS   DebRoy,C., Fratamico,P.M., Yan,X., Baranzoni,G., Liu,Y.,
            Needleman,D.S., Tebbs,R., O'Connell,C.D., Allred,A., Swimley,M.,
            Mwangi,M., Kapur,V., Raygoza Garay,J.A., Roberts,E.L. and Katani,R.
  TITLE     Comparison of O-Antigen Gene Clusters of All O-Serogroups of
            Escherichia coli and Proposal for Adopting a New Nomenclature for
            O-Typing
  JOURNAL   PLoS ONE 11 (1), E0147434 (2016)
   PUBMED   26824864
  REMARK    Publication Status: Online-Only
REFERENCE   2  (bases 1 to 2082)
  AUTHORS   Yan,X., Chen,C.-Y., Fratamico,P.M., Tebbs,R.S., O'Connell,C.D.,
            Baranzoni,G.M., Debroy,C. and Liu,Y.
  TITLE     Direct Submission
  JOURNAL   Submitted (02-MAY-2014) Molecular Characterization of Foodborne
            Pathogens Research Unit, USDA-ARS, 600 East Mermaid Lane, Wyndmoor,
            PA 19038, USA
COMMENT     ##Assembly-Data-START##
            Assembly Method       :: CLC Genomics Workbench v. 7.0
            Coverage              :: >50X
            Sequencing Technology :: IonTorrent
            ##Assembly-Data-END##
FEATURES             Location/Qualifiers
     source          1..2082
                     /organism="Escherichia coli"
                     /mol_type="genomic DNA"
                     /strain="F 8198-41"
                     /serotype="O57:K-:H-"
                     /db_xref="taxon:562"
     misc_feature    1..2082
                     /note="O-antigen gene cluster"
     CDS             complement(201..704)
                     /codon_start=1
                     /transl_table=11
                     /product="transposase"
                     /protein_id="AJE27136.1"
                     /translation="MKELSNPEHDSYAISEKSHGREEIRLHIVCDIPDELIDFTFEWK
                     GLKKLCMAVSFRSIIAEQKKEPEMTVRYYIRSAHLTAEKFATAIRNHWHVENKLHWRL
                     DVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDRNYLASV
                     LAGSGLS"
     CDS             complement(768..1271)
                     /codon_start=1
                     /transl_table=11
                     /product="IS1 transposase"
                     /protein_id="AJE27137.1"
                     /translation="MPGNCPHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMD
                     EQWGYVGAKSRQRWLFYAYDRLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMTDG
                     WPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVELHDKVIGHY
                     LNIKHYQ"
ORIGIN      
        1 cgccccatca ctgccatgcc gacgacgccg atctgttgat ttgacattat ttactcctgt
       61 cagggggtgt tcaccgcgat gtatgcgcga cttgaatatg attgagatgt tatctcaagt
      121 tgcacgaatg tgtgtagtgc tgaatttcat taatgcaacg aatacttttc ttcttcctat
      181 accccatcat cagggcaaga ttacgaaagc ccgctccccg caaggactga cgccaggtag
      241 tttctgtcca tcgctgcttt tcgcatctta cgtcttaacc ctgccttgaa taccttatca
      301 ttcgttaaaa tattaatagc gatgtgccgt atccctgaaa ataattctgc tgcatttcct
      361 cttcttattt tgcagtcgtc ttcattcatt accacgtcca gacgccagtg cagcttattc
      421 tccacgtgcc agtgatttcg gatcgctgtg gcgaacttct ctgcggttaa atgagcagaa
      481 cggatataat atcttaccgt catttcgggc tctttctttt gttctgctat tattgaccga
      541 aaggagactg ccatgcataa tttcttcagc cctttccatt caaacgtgaa atcaataagt
      601 tcatcaggga tatcgcaaac aatatgaaga cggatttctt ctctgccgtg actcttttca
      661 ctaattgcgt aactgtcatg ctctggatta cttaattctt tcaacggaaa tttttcctcg
      721 aaggctttat ttagccgccc ctggtttcct ttggtaatga ctccaactta ttgatagtgt
      781 tttatgttca gataatgccc gatgactttg tcatgcagct ccaccgattt tgagaacgac
      841 agcgacttcc gtcccagccg tgccaggtgc tgcctcagat tcaggttatg ccgctcaatt
      901 cgctgcgtat atcgcttgct gattacgtgc agctttccct tcaggcggga ttcatacagc
      961 ggccagccat ccgtcatcca tatcaccacg tcaaagggtg acagcaggct cataagacgc
     1021 cccagcgtcg ccatagtgcg ttcaccgaat acgtgcgcaa caaccgtctt ccggagcctg
     1081 tcatacgcgt aaaacagcca gcgctggcgc gatttagccc cgacatagcc ccactgttcg
     1141 tccatttccg cgcagacgat gacgtcactg cccggctgta tgcgcgaggt taccgactgc
     1201 ggcctgagtt ttttaagtga cgtaaaatcg tgttgaggcc aacgcccata atgcgggcag
     1261 ttgcccggca tccaacgcca ttcatggcca tatcaatgat tttctggtgc gtaccgggtt
     1321 gagaagcggt gtaagtgaac tgcagttgcc atgttttacg gcagtgagag cagagatagc
     1381 gctgatgtcc ggcggtgctt ttgccgttac gcaccacccc gtcagtagct gaacaggagg
     1441 gacagctgat agaaacagaa gccactggag cacctcaaaa acaccatcat acactaaatc
     1501 agtaagttgg cagcatcacc cctgagtatt atttataatg tgacgaacta cagcagaacc
     1561 aataaatcct gcgccaccag taacaagtat tttcacttaa tttattccat attacttcag
     1621 agcatgctgt gaaataagcg gctctcagtt tgattaatag aggtattaat gcacgctacc
     1681 gcccctggct ttacagctac cagagcactg catgcatgcc tacgatgtag cgagcgttac
     1741 ccactcgcgc ttaacccgaa aaattcaaac gctaattgtc ttaccaatcc gccctggaaa
     1801 caaggaaaat cctggaaaac tttgactaaa atcctattgc taactcgttg ttatcctgat
     1861 tgtttatata aaacaacggc aggaaaattc gcaacaaatt actttcacca cgaatcttca
     1921 ctgccgttat aattttctta tcaaccgtta cattcggtca gattttcatt attcgcttaa
     1981 cagcttctca atacctttac ggaacttcgc cccttctttc aggttgcgta gtccatactt
     2041 cacaaatgcc tgcatataac ccattttttt accgcagtcg ta
//