LOCUS       KJ755560                6430 bp    DNA     linear   BCT 29-MAR-2016
DEFINITION  Escherichia coli strain E 110-69 serotype O160:K-:H34 O-antigen
            gene cluster, complete sequence.
ACCESSION   KJ755560
VERSION     KJ755560.1
KEYWORDS    .
SOURCE      Escherichia coli
  ORGANISM  Escherichia coli
            Bacteria; Proteobacteria; Gammaproteobacteria; Enterobacterales;
            Enterobacteriaceae; Escherichia.
REFERENCE   1  (bases 1 to 6430)
  AUTHORS   DebRoy,C., Fratamico,P.M., Yan,X., Baranzoni,G., Liu,Y.,
            Needleman,D.S., Tebbs,R., O'Connell,C.D., Allred,A., Swimley,M.,
            Mwangi,M., Kapur,V., Raygoza Garay,J.A., Roberts,E.L. and Katani,R.
  TITLE     Comparison of O-Antigen Gene Clusters of All O-Serogroups of
            Escherichia coli and Proposal for Adopting a New Nomenclature for
            O-Typing
  JOURNAL   PLoS ONE 11 (1), E0147434 (2016)
   PUBMED   26824864
  REMARK    Publication Status: Online-Only
REFERENCE   2  (bases 1 to 6430)
  AUTHORS   Yan,X., Fratamico,P.M., Tebbs,R.S., O'Connell,C.D., Baranzoni,G.M.,
            Liu,Y. and Debroy,C.
  TITLE     Direct Submission
  JOURNAL   Submitted (26-APR-2014) Molecular Characterization of Foodborne
            Pathogens Research Unit, USDA-ARS, 600 East Mermaid Lane, Wyndmoor,
            PA 19038, USA
COMMENT     ##Assembly-Data-START##
            Assembly Method       :: CLC Genomics Workbench v. 7.0
            Coverage              :: >50X
            Sequencing Technology :: IonTorrent
            ##Assembly-Data-END##
FEATURES             Location/Qualifiers
     source          1..6430
                     /organism="Escherichia coli"
                     /mol_type="genomic DNA"
                     /strain="E 110-69"
                     /serotype="O160:K-:H34"
                     /db_xref="taxon:562"
     misc_feature    1..6430
                     /note="O-antigen gene cluster"
     gene            134..1381
                     /gene="wzx"
     CDS             134..1381
                     /gene="wzx"
                     /codon_start=1
                     /transl_table=11
                     /product="O-antigen flippase"
                     /protein_id="AJE24473.1"
                     /translation="MSLKKNVIYLVLMQAVNYIAPLVLVPYLTRILGVEKYGVLGLAI
                     TVSQYLILLTDFGFNFTASRKIAQFKDSKVRVSQIFWTIISAKFLMMIVSFGLIVPFV
                     VFSEKLNPLKWEIFLVSLSVVASVIIPSWLFQGLEKVTVFSGINIFSKILIVPLVFIF
                     VKSKEDLLIACLLQGGVQVFSGIISILYVKYNKIISFKVVRPKLIFIYLKESLSVFLG
                     NLSISLYTLSTPLVLALMGTTYQVGLYSATDRIRGAAIGIFIVVGYAIFPRVSYLFKK
                     NPLEANVLLKKIIFIFSILGCLGGILVYSIADEIVLVAFGNQYLDSAILLKIMAPMFL
                     LIPLSIIMANYLLLPNGFKKEYAKNSVIVCLLHMIYVFPLCKYYGAVGGSYSILISEI
                     ISFILLIFWTIKNNLLKKVFYAR"
     gene            1371..1937
                     /gene="epsJ"
     CDS             1371..1937
                     /gene="epsJ"
                     /codon_start=1
                     /transl_table=11
                     /product="glycosyl transferase"
                     /protein_id="AJE24474.1"
                     /translation="MQDNKSIKLSVIIPCWNCSKYITKTLDSIKYSYYMSRKPILEII
                     LVDDGSTDLTSKIIKAYDFGTRAVNVKYHFQNNAGPSKARNQGIKLAQGRYVTFLDSD
                     DIWSPDYLRIIESIMDKYDSEIIEFNAVRFIEENANLKIHNNYTLVDEEYHGPINERI
                     LSEVFIKSEWYVWARVYKKKTFGRRVLQ"
     CDS             1882..2373
                     /codon_start=1
                     /transl_table=11
                     /product="hypothetical protein"
                     /protein_id="AIG62566.1"
                     /translation="MFGLVFTKKKLLEGESFNENITHHEDAEFLPRIYLTAKSISRIT
                     SQLIYYRLTQNSITTKPKISSIKDLTLVCELYYENINGPNDKYYKAAMINCLWGLKRL
                     ILDKGEFSFIKKSRILYYCHIARNSSQLFNMLSWKKRLFIYLPRTYIFLNSLKYKLSK
                     RGA"
     gene            2373..3305
                     /gene="epsH"
     CDS             2373..3305
                     /gene="epsH"
                     /codon_start=1
                     /transl_table=11
                     /product="glycosyl transferase family 2"
                     /protein_id="AJE24475.1"
                     /translation="MEISIVIPLYNKENYIKRTLLSIIDTFGRVYNEKEYEIVIVDDG
                     SKDKSVSVVESINSNVVKLYKQENGGPSKARNNGVNFSDGKYVIFCDADDIILPNYGD
                     YIKYSIDNYPDHDVFVAFSRVLRNSKENYSIPLFDPVDVSIVDDFFYEWDKRKFISAS
                     SICIKKHFFIQNDLYFDETISSGEDLLMFYKAAIKTKYVAYSQPAVLYDKTVDSQLSS
                     NPDLKIGAHTYFLIKVYKTENLSVKDKTAINRIIDKQICFVAVDNLLRKKYANSLNIV
                     LFRKRMLLQPKLAIKLIIASISYKLFTKISSLLR"
     gene            3302..4402
                     /gene="wzy"
     CDS             3302..4402
                     /gene="wzy"
                     /codon_start=1
                     /transl_table=11
                     /product="O-antigen polymerase"
                     /protein_id="AJE24476.1"
                     /translation="MIIENNKVSSHSYLWVLFQSFIFYYKFMLQSWDGDEQIVNVISI
                     ILTLLLLCGTISAFIISNIKEKCIFLIIFIFVALNIIIADNKSVFWMVTAFTFLILFS
                     TLTIKNRIRILVFSFIIAWCFFLPLQIFFSNSYTYIDDRYLRYTFGFLNPNGLGMFLL
                     LLQTLLYYWIWTSIKATIIVKQMITIILGGSIICIIFLSESRTYILLSFLLLILTVIY
                     GFKKFKFSSRLLFIYLLFVMLLQWLSVKGFENYLIFQDMNAYMSGRVWFSYNLLSQMG
                     EPKFFIGSDISLYQPIDFFFISLLYNNGILASLILLYCNYIFLKKLDNSTKYESILAF
                     IFITVSFTEAVYNIPLLNFFFLLLYKKELRFS"
     gene            4399..5418
                     /gene="sacB"
     CDS             4399..5418
                     /gene="sacB"
                     /codon_start=1
                     /transl_table=11
                     /product="capsular polysaccharide phosphotransferase"
                     /protein_id="AJE24477.1"
                     /translation="MKVDAVILWVDGNDPKWQEEYNKYCKPASRIENGVQRYRDWDTL
                     RYVIRGIEYNLPWIDKIHFVTCGQKPSWMVGYHPKLNFVHHNDIFENDTFLPTFNSSA
                     IELNLSRIKGLSERFIYFNDDMLVLKNTPLQRFFVNDLPVDFLIEAFPRRGLLYEKIR
                     SNSTWVSMINNCTSLINRVYHKNKYIQDNRNLYYNMNYGRHVIANVLASPFKQFLAFK
                     HYHHPQAYLKKTLQSVEREFPVEFNLTCKSRFREHDNISQALFRYYQLVTGSFYPCYY
                     NDHACVNVVNKKSASQCIEALHQKRFVCINDEINDDLDDSSILINDIIKELDLILPNK
                     SSFEI"
     gene            5425..6243
                     /gene="wbbD"
     CDS             5425..6243
                     /gene="wbbD"
                     /codon_start=1
                     /transl_table=11
                     /product="UDP-Gal:alpha-D-GlcNAc-diphosphoundecaprenol
                     beta-1,3-galactosyltransferase"
                     /protein_id="AJE24478.1"
                     /translation="MNSFDEFNVLLSLYKNESPDNLDACFQSISTQSLKRFKIILVID
                     GPISSELNEVVGKWKSLLPIKIINLERNVGLGNALNIGLKYCSCDYVFRMDTDDICHP
                     DRFSIQFSYLRKHPDIDLLGGQIVEFHECIEEPNGMRLVPSKYEEILQYCKLKNPFNH
                     MTVVFKRESVLKVGGYKHHLYMEDYNLWLRMISIGCKVENLDDVIVFARTDVNSLMRR
                     RGWQYVKSEWKLALLKIKLRINNPIVSLSVFILRSIPRLLPIMLIRRIYAHNRK"
ORIGIN      
        1 ataactatta atgagcatag tgcactggta gctatagagc caggggcggt agcttgttta
       61 gtgttaggta ttctggtttt aaaagcataa aactctccag tattttcatt gcatacataa
      121 tattattcat taaatgtcat taaaaaaaaa tgttatatat ctagtgttga tgcaagcggt
      181 taattatatt gctcctcttg ttttagtacc atatttaacg cgaatccttg gagtggaaaa
      241 atatggcgtt ctaggtcttg ctattacggt tagccagtac ttaattttgt tgacggattt
      301 tggttttaat tttactgcca gtagaaaaat agctcaattt aaagatagta aagtcagggt
      361 tagtcaaata ttttggacta tcatctcagc aaaattttta atgatgatag tatcattcgg
      421 tttaattgtc ccttttgttg tgttttcaga aaaattaaat cccttaaaat gggaaatatt
      481 tttagtttcg ctatctgtag ttgcgagtgt gattattcca tcttggttgt ttcaggggct
      541 tgaaaaggtt acagtttttt ccggcatcaa tattttttct aagattttga ttgtgcctct
      601 tgtctttatt tttgtgaaat ctaaggagga tttactgatt gcctgcttat tacaaggtgg
      661 agtgcaagta ttttctggaa taatatccat attgtatgtt aagtataata agattatttc
      721 atttaaagtt gtgaggccaa aattaatctt tatatattta aaagagtctc tatcagtatt
      781 tttaggtaat ttatctattt ctttatatac tctcagtacg cccttagttt tagctcttat
      841 ggggacaacg tatcaagttg ggctttatag tgccactgat agaatcagag gggctgcaat
      901 tggtatcttt atagttgttg gatatgcaat tttccccaga gtaagctatt tgtttaaaaa
      961 aaatccctta gaagctaatg ttttgcttaa aaaaataata tttatcttct cgattctggg
     1021 atgtttaggt ggaatacttg tatatagtat tgctgatgag attgttttag ttgcatttgg
     1081 taatcaatat ttagattctg ccattctttt gaaaataatg gcgcctatgt ttttactcat
     1141 tcctctttca ataattatgg caaactactt attattgcca aatggtttta aaaaagagta
     1201 tgcaaaaaat tctgtgatag tatgcttatt gcatatgata tatgttttcc ctctttgtaa
     1261 atactatggt gcagttggcg gtagttattc gattcttatt tcagagataa taagctttat
     1321 tttattaatt ttttggacga taaaaaataa tttattgaag aaggtttttt atgcaagata
     1381 ataaaagtat aaaactaagc gttataatac catgttggaa ctgtagtaaa tatatcacaa
     1441 aaacattgga ttctatcaaa tattcatatt atatgtctag aaagccaatt ttagagatta
     1501 ttttagtgga tgatggctct acagacttaa caagcaaaat aatcaaagcg tatgacttcg
     1561 gaactcgagc ggttaatgtt aaatatcatt ttcaaaataa cgccggtcct agtaaggcac
     1621 gaaatcaagg aattaaatta gcgcaaggta ggtatgtgac tttcttggat tcggatgata
     1681 tatggagtcc agattacctt agaatcatag agtctataat ggacaaatat gattctgaaa
     1741 ttattgaatt taatgccgtt aggtttattg aagaaaatgc caatttaaaa attcataata
     1801 attatacatt agtggacgag gagtatcatg gtcctataaa tgaacgtatt ctttctgagg
     1861 tttttataaa aagcgagtgg tatgtttggg ctcgtgttta caaaaaaaaa acttttggaa
     1921 ggagagtcct tcaatgaaaa tattacacat catgaagatg ctgaatttct accacggatt
     1981 tatcttacag ctaaatcaat cagcagaatt acttcccagt taatttatta tagattaacg
     2041 cagaatagta ttacaacaaa accaaaaatt agtagcatca aagatctcac tttagtatgc
     2101 gaattgtatt atgaaaatat caatgggcca aatgacaaat attataaggc tgcaatgatc
     2161 aactgtctct ggggtttaaa aagactaatc cttgataaag gagaatttag ttttataaaa
     2221 aaaagtagaa tattatatta ctgccatatt gctagaaata gttcacaact ttttaatatg
     2281 ttaagttgga aaaaaagatt atttatttat cttcctcgaa cctatatatt tttaaattcc
     2341 cttaaatata agctgtcaaa acgtggtgca taatggaaat atcaatagta attccactgt
     2401 ataacaaaga gaattatatc aagaggacgt tgctttcaat aattgatact tttgggagag
     2461 tctataacga gaaagagtat gaaatagtca tcgtggatga tgggtcaaaa gataaaagtg
     2521 taagtgtagt agaaagtata aatagtaatg ttgtaaaact ctacaaacaa gaaaatggcg
     2581 gtccttcaaa agcaagaaat aatggtgtaa atttcagtga tggaaaatat gtcatttttt
     2641 gtgatgcaga tgatattatt ttaccaaatt atggtgatta tataaaatat tccattgata
     2701 attatcctga ccatgatgtt tttgttgcat ttagccgtgt tttaagaaac agtaaagaaa
     2761 attatagcat acctcttttt gaccccgtgg atgtttctat tgttgatgat tttttctatg
     2821 agtgggataa gagaaagttt attagtgcat cgtcaatttg tataaaaaaa catttcttta
     2881 tacaaaatga cttgtatttt gatgaaacga tttcatcagg tgaagattta ttgatgtttt
     2941 ataaagctgc gattaaaact aaatatgttg cctatagtca accggcagta ttatatgata
     3001 aaacagtaga ttcgcagtta agttctaatc ctgatttgaa aattggtgca catacttatt
     3061 ttttaataaa agtctacaaa acagagaatt tatcagtcaa agataaaact gcaataaatc
     3121 gaattatcga taaacaaata tgctttgttg cagtggataa tctcttacgt aaaaagtatg
     3181 caaattcgtt aaatatagta ttgttccgca aacgaatgct tttacaacct aaattagcta
     3241 taaaattaat tatagccagt atttcgtata aactttttac taaaatatct agtctactaa
     3301 gatgatcata gagaataata aagtgagttc acatagctat ttatgggttc ttttccaatc
     3361 ctttatattt tattacaaat tcatgctcca aagttgggac ggtgatgaac agattgtaaa
     3421 tgtaatatct atcatcttaa ccttacttct tttatgtggt acgattagtg cttttataat
     3481 ctcaaatatt aaagagaaat gtatattttt aataattttc atttttgttg ccttgaacat
     3541 aattatagca gataacaaat ctgtcttttg gatggtgacg gcgttcactt tcttgatact
     3601 cttttcaacg ttaaccatta aaaatagaat taggatattg gttttctctt ttataattgc
     3661 atggtgtttc tttttaccac tacaaatatt tttctccaat tcctatacat atattgatga
     3721 tcgatattta agatatacat ttggattttt aaatcctaat ggattaggga tgtttctatt
     3781 actgttgcag acacttttat attattggat atggacttca ataaaagcaa caataatcgt
     3841 gaagcaaatg attacaataa ttttgggtgg atctatcata tgcataatat ttttaagtga
     3901 atcaagaaca tatatattac tatccttttt attactgatt ttaactgtaa tatacggatt
     3961 taagaaattt aaattctcat cacgacttct atttatatat ttactgtttg ttatgctttt
     4021 acaatggctt tctgtaaaag gttttgagaa ttacctcata tttcaagata tgaacgcata
     4081 tatgagtggg cgagtctggt tttcctataa tttactaagc caaatgggag aaccgaaatt
     4141 ctttattggt agtgatattt cattgtatca accgatagac ttcttcttta tatcactatt
     4201 atataataat gggatactgg cttcgttaat attgttatac tgtaactata tctttctaaa
     4261 aaagttagac aactctacga aatatgaaag tatattagct tttatattta ttacagtaag
     4321 tttcacagaa gctgtatata atattcctct gttgaatttc ttctttttgc tgttatataa
     4381 aaaggaattg aggttttcat gaaagttgat gcagtaatct tatgggtaga tggtaatgat
     4441 ccaaaatggc aggaagaata caataaatat tgtaagccgg cctctcgaat tgagaatggt
     4501 gtacaaagat atagagattg ggatacacta cgatatgtaa ttcgtggaat agaatataac
     4561 cttccttgga ttgacaaaat acattttgtg acatgtggtc aaaagccaag ttggatggtt
     4621 ggatatcatc caaaactcaa ttttgtacat cataacgata tttttgagaa tgatacattt
     4681 ttacccacat ttaattcaag tgcaatagag ttaaatctgt caaggataaa gggattaagt
     4741 gagaggttta tatattttaa tgatgatatg cttgtgctta aaaacactcc tcttcagagg
     4801 ttttttgtaa atgatttacc ggtggatttt ttaattgaag ctttcccacg gcgtgggtta
     4861 ttgtatgaaa aaatacgttc caactcaact tgggtttcaa tgataaataa ttgcacttcg
     4921 ctcataaata gagtgtatca caagaataaa tatatacagg ataatcgtaa tttatattat
     4981 aacatgaact atggacgaca tgtcattgct aatgtactgg cgtccccttt taagcagttt
     5041 ttggctttta aacactatca tcatcctcaa gcatatctta aaaagacatt acaatccgtt
     5101 gagcgtgagt tcccagtgga gtttaactta acatgcaaat ctagattccg agaacatgat
     5161 aatatctcac aggcgttgtt tagatattac caattggtaa ctggtagttt ttatccgtgc
     5221 tattataatg atcatgcctg tgttaatgta gtaaataaaa aatctgcaag ccagtgtatt
     5281 gaagctcttc atcaaaaacg atttgtgtgt ataaatgatg agattaatga cgacttagat
     5341 gatagttcaa ttttaataaa tgatataatt aaagagttgg atttgattct tccaaataaa
     5401 tcaagttttg agatctaaat taatatgaat tcgtttgatg agtttaatgt gctactatct
     5461 ctttataaaa atgagagccc cgataatctg gatgcgtgtt ttcaaagcat aagcactcaa
     5521 tctttgaaaa gatttaaaat aatacttgtc atagatggtc ctatttcttc tgagttaaat
     5581 gaggttgtcg gaaagtggaa gtcactatta cctattaaaa ttatcaatct cgaacgtaat
     5641 gttggcttag gtaatgcttt aaatataggt ttgaaatatt gcagttgtga ttatgtgttt
     5701 cgtatggata cagatgatat ttgtcatccg gataggtttt ccatacaatt tagttatctg
     5761 aggaaacacc ccgatattga tttgcttggt ggacagattg tagaattcca tgaatgcata
     5821 gaggagccta acggtatgcg gttagtacct tctaaatatg aggaaattct tcagtattgt
     5881 aaacttaaaa acccatttaa tcatatgaca gttgtgttta aaagagagag tgtacttaag
     5941 gtcggtggat ataaacatca tttatatatg gaggattata atctttggct tcgtatgata
     6001 tcaattggtt gcaaagtgga aaatttagat gatgtgattg tttttgcccg aacagacgtg
     6061 aattcattaa tgagacgtag agggtggcaa tatgttaaaa gtgaatggaa attagcactg
     6121 ttaaaaataa agttaagaat aaataatcct attgtttctt tatctgtttt tatcctccga
     6181 tctatacctc gattattacc tattatgttg attcgtagaa tttatgccca caaccgcaag
     6241 tgaatttttt taagtcatat tgagtaattg agtaatagtt ttcctatgta atcaaaatta
     6301 actatcagtt ttattattag ttgggtaata tctctctata tctcaaccag tgcagtcatt
     6361 gcatggtgaa cacccctgac aggagcaaac aatgtcaaag caacagattg gcgtcgtcgg
     6421 tatggcagtg
//