-- dump date 20110930_031352 -- class Genbank::CDS -- table cds_note -- id note CBI99516.1 This CDS may be transcribed separately or form part of a pseudogene with the downstream CDS CBI99571.1 3-isopropylmalate isomerase subunit CBI99632.1 putative fimbrial protein CBI99635.1 putative fimbrial protein CBI99636.1 predicted outer membrane usher protein CBI99637.1 predicted periplasmic pilin chaperone CBI99638.1 predicted fimbrial-like adhesin protein CBI99639.1 2-amino-4-hydroxy-6-hydroxymethyldihyropteridin e pyrophosphokinase ETEC_0212 Similar to Escherichia coli B IS1 transposase. UniProt:A2ULP6 (125 aa) fasta scores: E()=1.1e-46, 98.400% id in 125 aa, and to Shigella boydii serotype 4 (strain sb227). insb UniProt:Q321L1 (EMBL:CP000036 (167 aa) fasta scores: E()=2.9e-47, 99.200% id in 125 aa. This CDS contains a stop codon near the N-terminus CBI99722.1 putative cation efflux system protein CBI99728.1 Similar to N-terminus to codon 272 of Escherichia coli pcoA copper resistance protein A precursor. UniProt:Q47452 (605 aa) fasta scores: E()=1.9e-106, 100.000% id in 270 aa, and to N-terminus to codon 272 of Escherichia coli B. copper-resistance protein, copA family precursor. copper-resistance protein, copa family precursor. UniProt:A2UFX0 (EMBL:AAWW01000008 (605 aa) fasta scores: E()=1.9e-106, 100.000% id in 270 aa. This CDS may be transcribed separately or may form part of a pseudogene with the downstream CDS CBI99729.1 Similar to C-terminus from codon 280 of Escherichia coli pcoa copper resistance protein a precursor. UniProt:Q47452 (605 aa) fasta scores: E()=2.8e-125, 100.000% id in 327 aa. This CDS may be transcribed separately or may form part of a pseudogene with the upstream CDS ETEC_0240 Similar to N-terminus of Escherichia coli o1:k1/apec. isec8 transposase. UniProt:Q19N68 (EMBL:DQ517526 (511 aa) fasta scores: E()=7.8e-123, 88.127% id in 379 aa, and to N-terminus of Escherichia coli. putative uncharacterized protein. UniProt:Q6KD23 (EMBL:AJ586888 (512 aa) fasta scores: E()=2e-124, 92.643% id in 367 aa ETEC_0241 Similar to C-terminus of Escherichia coli RhsE protein. UniProt:P24211 (682 aa) fasta scores: E()=5.9e-45, 71.875% id in 160 aa, and to C-terminus of Shigella sonnei (strain ss046). rhs core protein with extension. UniProt:Q3Z5C1 (EMBL:CP000038 (1402 aa) fasta scores: E()=3.1e-126, 97.853% id in 326 aa CBI99751.1 putative exported protein CBI99796.1 putative phage capsid protein CBI99812.1 Similar to C-terminus of Escherichia coli. StfE UniProt:P33227 (EMBL:X01805 (166 aa) fasta scores: E()=5.7e-28, 58.791% id in 182 aa, and to C-terminus of bacteriophage sfv (shigella flexneri bacteriophage v). orf21 UniProt:O22004 (EMBL:U82619 (216 aa) fasta scores: E()=2.5e-34, 92.593% id in 108 aa, and to entire protein of Shigella flexneri. hypothetical bacteriophage protein. UniProt:Q83M73 (EMBL:AE005674 (216 aa) fasta scores: E()=4.6e-34, 92.593% id in 108 aa CBI99817.1 no significant database hits ETEC_0315 Similar, but truncated at the N-terminus, to Shigella flexneri. TnpF UniProt:Q9XCH9 (EMBL:AF141323 (263 aa) fasta scores: E()=4e-15, 71.212% id in 66 aa, and to N-terminus of Shigella sonnei (strain ss046). IS2 orf2. UniProt:Q3YZL3 (EMBL:CP000038 (117 aa) fasta scores: E()=4.6e-26, 98.571% id in 70 aa CBI99832.1 no significant database hits CBI99882.1 Similar to Escherichia coli (strain K12). YahG UniProt:P77221 (EMBL:U73857 (472 aa) fasta scores: E()=6.4e-37, 100.000% id in 106 aa. The CDS may be transcribed separately or may form part of a pseudogene with the downstream CDS CBI99883.1 Similar to C-terminus of Escherichia coli (strain k12). yahg UniProt:P77221 (EMBL:U73857 (472 aa) fasta scores: E()=2.5e-135, 99.725% id in 363 aa. The CDS may be transcribed separately or may form part of a pseudogene with the upstream CDS CBJ00062.1 tRNA 2-selenouridine synthase, selenophosphate-dependent ETEC_0584 This CDS has been interrupted by an insertion element protein CBJ00101.1 Similar to N-terminus of Escherichia coli B. yd repeat protein. UniProt:A2UJ45 (EMBL:AAWW01000021 (1505 aa) fasta scores: E()=0, 98.800% id in 1417 aa CBJ00217.1 two-component response regulator ETEC_0772 This CDS has been interrupted by an insertion sequence transposase CBJ00306.1 Similar to Bacteriophage lambda O replication protein O. UniProt:P03688 (333 aa) fasta scores: E()=1.9e-45, 91.111% id in 135 aa, and to bacteriophage lambda. O UniProt:A2SY76 (EMBL:DQ372056 (157 aa) fasta scores: E()=9.9e-46, 91.111% id in 135 aa. This CDS may be truncated at the C-terminus and may be transcribed separately or form part of a pseudogene with the downstream CDS CBJ00307.1 Similar to C-terminus of stx2 converting phage i. o UniProt:Q776H0 (EMBL:AP004402 (312 aa) fasta scores: E()=9.4e-56, 100.000% id in 124 aa. This CDS may be transcribed separately or may form part of a pseudogene with the upstream CDS CBJ00312.1 putative protein ninx CBJ00321.1 putative phage DNA methylase CBJ00346.1 phage tail fiber assembly protein ETEC_0947 Similar to C-terminus of Escherichia coli o1:k1/apec. macb1 UniProt:A1A9B7 (EMBL:CP000468 (648 aa) fasta scores: E()=2.1e-137, 100.000% id in 389 aa CBJ00516.1 predicted outer membrane usher protein CBJ00521.1 predicted fimbrial-like adhesin protein CBJ00522.1 predicted fimbrial-like adhesin protein CBJ00523.1 predicted periplasmic pilini chaperone CBJ00558.1 phosphoanhydride phosphorylase CBJ00733.1 Similar, but truncated at the N-terminus, to Escherichia coli o6:k15:h31 (strain 536/upec). major coat protein. UniProt:Q0TIP6 (EMBL:CP000247 (341 aa) fasta scores: E()=1e-108, 99.315% id in 292 aa CBJ00753.1 spermidine/putrescine ABC transporter, permease protein CBJ00771.1 Similar to ) escherichia coli. insb. UniProt:Q5I3M5 (EMBL:AY857617 (151 aa) fasta scores: E()=4.9e-44, 99.048% id in 105 aa, and to ) shigella boydii serotype 4 (strain sb227). insb UniProt:Q31YA9 (EMBL:CP000036 (123 aa) fasta scores: E()=4.1e-44, 99.048% id in 105 aa CBJ00772.1 Similar, but truncated at the N-terminus, to Escherichia coli E24377A putative uncharacterized protein. putative uncharacterized protein. UniProt:A7ZKT3 (82 aa) fasta scores: E()=8e-21, 100.000% id in 60 aa CBJ00945.1 Similar to C-terminus of Escherichia coli (strain k12). abga UniProt:P77357 (EMBL:U00096 (436 aa) fasta scores: E()=2.8e-122, 100.000% id in 326 aa. This CDS may be transcribed separately or may form part of a pseudogene with the upstream CDS CBJ00946.1 This CDS may be transcribed separately or may form part of a pseudogene with the downstream CDS CBJ00979.1 Similar to C-terminus of Salmonella typhi. BigB. UniProt:Q9X687 (EMBL:AF133185 (739 aa) fasta scores: E()=3.4e-164, 72.320% id in 737 aa, and to entire protein of escherichia coli. ydba UniProt:P33666 (EMBL:U00096 (2003 aa) fasta scores: E()=0, 86.517% id in 2314 aa ETEC_1502A Similar, but truncated at the N and C termini, to Escherichia coli E24377A transposase, is605 family. UniProt:A7ZMR8 (315 aa) fasta scores: E()=5.5e-40, 96.154% id in 104 aa CBJ01029.1 putative glutathione S-transferase CBJ01045.1 Similar to Escherichia coli yddj uncharacterized protein yddj. UniProt:P76122 (111 aa) fasta scores: E()=3.2e-41, 99.099% id in 111 aa, and similar, but truncated at the N-terminus, to Escherichia coli b. putative uncharacterized protein. UniProt:A2UD33 (EMBL:AAWW01000001 (443 aa) fasta scores: E()=2.9e-41, 100.000% id in 111 aa CBJ01046.1 Similar, but truncated at the C-terminus to Escherichia coli B putative uncharacterized protein. UniProt:A2UD33 (443 aa) fasta scores: E()=3e-117, 99.685% id in 317 aa, and similar to entire protein of Escherichia coli yddk uncharacterized protein yddk. uncharacterized protein yddk. UniProt:P76123 (318 aa) fasta scores: E()=9.5e-118, 99.686% id in 318 aa CBJ01049.1 Note the read-through TGA selenocystiene stop codon CBJ01064.1 Similar to C-terminus of Escherichia coli B diguanylate cyclase. diguanylate cyclase. UniProt:A2UD54 (460 aa) fasta scores: E()=2.9e-127, 100.000% id in 347 aa ETEC_1576 Similar to C-terminus of Escherichia coli (strain K12) fimd outer membrane usher protein fimd precursor. UniProt:P30130 (878 aa) fasta scores: E()=8.2e-36, 59.006% id in 161 aa, and to C-terminus of Escherichia coli (strain k12). ydet UniProt:P76137 (EMBL:U00096 (382 aa) fasta scores: E()=2.3e-60, 100.000% id in 161 aa CBJ01096.1 Similar, but exended at the N-terminus, to shigella boydii serotype 4 (strain sb227). putative uncharacterized protein. UniProt:Q320K9 (EMBL:CP000036 (315 aa) fasta scores: E()=3.6e-112, 99.048% id in 315 aa, and similar, but truncated at the N-terminus, to Escherichia coli E24377A diguanylate cyclase (ggdef) domain protein. UniProt:A7ZLX8 (472 aa) fasta scores: E()=3.8e-128, 99.437% id in 355 aa. This CDS may form part of a pseudogene with the upstream CDS, or may be transcribed separately starting at codon 41 CBJ01097.1 Similar to Escherichia coli e24377a. diguanylate cyclase (ggdef) domain protein. UniProt:A7ZLX8 (EMBL:CP000800 (472 aa) fasta scores: E()=1.5e-42, 98.291% id in 117 aa. This CDS may be transcribed separately or may form part of a frameshift pseudogene with the downstream CDS ETEC_1740 This CDS possesses an internal stop codon ETEC_1752 Similar to Escherichia coli ArpB ankyrin repeat protein b. UniProt:P76205 (632 aa) fasta scores: E()=0, 98.418% id in 632 aa CBJ01459.1 no significant database hits CBJ01460.1 no significant database hits CBJ01462.1 no significant database hits CBJ01464.1 no significant database hits CBJ01465.1 no significant database hits CBJ01489.1 no significant database hits CBJ01508.1 Similar to Bacteriophage P2 h probable tail fiber protein (gph). UniProt:P26700 (669 aa) fasta scores: E()=4.2e-63, 49.746% id in 788 aa, and C-terminus is similar to Shigella sonnei. bv' UniProt:Q53813 (EMBL:D00660 (318 aa) fasta scores: E()=2.3e-122, 95.597% id in 318 aa CBJ01513.1 Similar to Escherichia coli B phage p2 gpu family protein. UniProt:A2UF07 (161 aa) fasta scores: E()=3.2e-27, 48.148% id in 162 aa, and to ) escherichia coli o157:h7. putative tail protein. UniProt:Q7AD22 (EMBL:BA000007 (162 aa) fasta scores: E()=7.5e-62, 98.765% id in 162 aa CBJ01520.1 similar to C-terminus of these CDS CBJ01521.1 bacteriophage regulatory protein. bacteriophage regulatory protein ETEC_2029 Similar to N-terminus of Escherichia coli. FliZ UniProt:P52627 (EMBL:U18539 (183 aa) fasta scores: E()=5.1e-43, 99.074% id in 108 aa, and to N-terminus of Escherichia coli B. phage integrase domain protein sam domain protein. UniProt:A2UMZ3 (EMBL:AAWW01000050 (183 aa) fasta scores: E()=5.1e-43, 99.074% id in 108 aa CBJ01586.1 yersiniabactin siderophore biosynthetic protein CBJ01591.1 Similar to codons 745 to 1235 of Escherichia coli. EaeH. UniProt:Q3I510 (EMBL:DQ109813 (1418 aa) fasta scores: E()=1.2e-08, 26.233% id in 507 aa, and to entire protein of Escherichia coli o6:k15:h31 (strain 536/upec). putative uncharacterized protein. UniProt:Q0TGH9 (EMBL:CP000247 (489 aa) fasta scores: E()=3.8e-156, 99.796% id in 489 aa. This CDS may be transcribed separately or may form part of a pseudogene with the downstream CDS CBJ01592.1 Similar to codons 1321 to 1470 of Escherichia coli O157:H7 putative invasin. UniProt:Q7A8L6 (1579 aa) fasta scores: E()=9.9e-27, 58.278% id in 151 aa, and to entire protein of Escherichia coli (strain uti89/upec). putative uncharacterized protein. UniProt:Q1RAF1 (EMBL:CP000243 (158 aa) fasta scores: E()=2.1e-55, 99.367% id in 158 aa. This CDS may be transcribed separately or as part of a pseudogene with the upstream CDS. ETEC_2092 Similar to Escherichia coli o6:k15:h31 (strain 536/upec). putative uncharacterized protein. UniProt:Q0TGH7 (EMBL:CP000247 (350 aa) fasta scores: E()=3.8e-75, 96.474% id in 312 aa. An insertion sequence has inserted into the protein, there is also another IS element at the C-terminus of the protein CBJ01611.1 no significant database hits CBJ01615.1 no significant database hits ETEC_2149 Similar to E.coli yeeA; with=UniProt:P33011 (EMBL:U00096;); Escherichia coli.; yeeA; Inner membrane protein yeeA.; length=352; id 93.750%; ungapped id 97.059%; E()=7.1e-81; 352 aa overlap; query 1-340; subject 1-352. This CDS has been interrupted by an insertion sequence/element CBJ01657.1 exonuclease i. exonuclease i CBJ01676.1 gluconate-6-phosphate dehydrogenase, decarboxylating CBJ01688.1 predicted subunit with GalU CBJ01729.1 conserved protein CBJ01732.1 xylulose kinase ETEC_2253 Similar to Escherichia coli MolR molybdate metabolism regulator. UniProt:P33345 (1264 aa) fasta scores: E()=1.7e-125, 96.524% id in 633 aa, and to ) escherichia coli b. wgr domain protein. UniProt:A2UE17 (EMBL:AAWW01000003 (1264 aa) fasta scores: E()=7.2e-128, 97.156% id in 633 aa. This CDS has been interrupted by an insertion sequence ETEC_2258 Similar to C-terminus of Shigella dysenteriae serotype 1 (strain sd197). putative uncharacterized protein. UniProt:Q32EK2 (EMBL:CP000034 (323 aa) fasta scores: E()=2.5e-78, 96.585% id in 205 aa CBJ01887.1 putative molybdopterin-binding protein CBJ01908.1 Similar to N-terminus of Escherichia coli. elad UniProt:Q47013 (EMBL:U58768 (402 aa) fasta scores: E()=6.9e-95, 99.606% id in 254 aa ETEC_2441 Similar to N-terminus of Escherichia coli B. putative transposase, yhga family protein. UniProt:A2UDI6 (EMBL:AAWW01000002 (296 aa) fasta scores: E()=2e-86, 100.000% id in 220 aa, and to N-terminus of Escherichia coli. yfci UniProt:P77768 (EMBL:U00096 (296 aa) fasta scores: E()=2e-86, 100.000% id in 220 aa CBJ01996.1 probable oxalyl-CoA decarboxylase CBJ02120.1 no significant database hits CBJ02124.1 Similar to N-terminus of Escherichia coli. eliminase. UniProt:P77039 (EMBL:X96495 (820 aa) fasta scores: E()=1.1e-33, 67.470% id in 166 aa, and to N-terminus of Phage PhiV10. putative tail fiber. UniProt:Q286Z3 (EMBL:DQ126339 (875 aa) fasta scores: E()=2.2e-69, 74.812% id in 266 aa. CBJ02125.1 no significant database hits CBJ02127.1 no significant database hits CBJ02129.1 Similar to codons 138 to 195 of Bacillus weihenstephanensis kbab4. ABC transporter EcsB. UniProt:Q2AUV5 (EMBL:AAOY01000011 (403 aa) fasta scores: E()=3.8, 37.288% id in 59 aa CBJ02157.1 Similar to Escherichia coli dnaT primosomal protein 1 (primosomal protein i). UniProt:P0A8J2 (179 aa) fasta scores: E()=5.5e-11, 41.284% id in 109 aa, and to ) phage phiv10. putative primosomal protein. UniProt:Q286X4 (EMBL:DQ126339 (275 aa) fasta scores: E()=1.1e-110, 98.545% id in 275 aa CBJ02163.1 Similar to C-terminus of Escherichia coli. rece UniProt:P15032 (EMBL:U00096 (866 aa) fasta scores: E()=6.6e-51, 48.507% id in 268 aa, and to N-terminus of Phage PhiV10. putative recet. UniProt:Q286Y0 (EMBL:DQ126339 (586 aa) fasta scores: E()=8.1e-110, 97.015% id in 268 aa, and to entire protein of Burkholderia multivorans atcc 17616. exonuclease viii, 5'-> 3' specific dsdna exonuclease. UniProt:A0UIW7 (EMBL:AAVB01000005 (270 aa) fasta scores: E()=1e-22, 33.692% id in 279 aa CBJ02164.1 Similar to escherichia coli. enterohemolysin 1. UniProt:Q06917 (EMBL:X70047 (267 aa) fasta scores: E()=2.7e-50, 73.684% id in 285 aa, and to C-terminus of phage phiv10. putative recet. UniProt:Q286Y0 (EMBL:DQ126339 (586 aa) fasta scores: E()=3.8e-120, 99.042% id in 313 aa CBJ02193.1 putative L-cysteine desulfurase CBJ02227.1 Similar to C-terminus of Escherichia coli PinE DNA-invertase from lambdoid prophage e14. UniProt:P03014 (184 aa) fasta scores: E()=6.8e-22, 87.324% id in 71 aa, and to C-terminus of Escherichia coli o157:h7. DNA-invertase. UniProt:Q7AHF0 (EMBL:BA000007 (184 aa) fasta scores: E()=9.6e-23, 90.141% id in 71 aa. This CDS may be transcribed separately or may form part of a pseudogene with the upstream CDS CBJ02228.1 Similar to N-terminus of Escherichia coli PinE DNA-invertase from lambdoid prophage e14. UniProt:P03014 (184 aa) fasta scores: E()=1.9e-33, 90.476% id in 105 aa, and similar to N-terminus of Escherichia coli B. resolvase, N-terminal domain. UniProt:A2UF01 (EMBL:AAWW01000005 (188 aa) fasta scores: E()=2.8e-35, 96.190% id in 105 aa. This CDS may be transcribed separately or may form part of a pseudogene with the downstream CDS CBJ02236.1 Similar to codons 80 to 130 of alpha proteobacterium htcc2255. diguanylate cyclase (ggdef domain). UniProt:Q0F5P4 (EMBL:AATR01000025 (331 aa) fasta scores: E()=4, 41.176% id in 51 aa CBJ02259.1 putative transmembrane phage holin protein CBJ02262.1 Codons 160 to 510 is similar to codons 360 to 710 of Staphylococcus aureus. similar to D. nodosus VapE. UniProt:Q8VLX1 (EMBL:U93688 (477 aa) fasta scores: E()=8.6e-19, 30.319% id in 376 aa, and to C-terminus is similar to C-terminus of acyrthosiphon pisum secondary endosymbiont phage 2. p3 UniProt:Q3LZR9 (EMBL:DQ092612 (547 aa) fasta scores: E()=3.9e-113, 72.439% id in 410 aa CBJ02276.1 Similar to N-terminus of Bacteriophage apse-1. 41 UniProt:Q9T1Q7 (EMBL:AF157835 (460 aa) fasta scores: E()=7.5e-85, 72.340% id in 282 aa CBJ02277.1 Similar to C-terminus from codon 180 of shigella dysenteriae serotype 1 (strain sd197). hypothetical bacteriophage protein. UniProt:Q32CW4 (EMBL:CP000034 (311 aa) fasta scores: E()=4.9e-54, 99.254% id in 134 aa. This CDS may be transcribed on its own or may form part of a pseudogene with the upstream CDS. CBJ02279.1 putative ferredoxin CBJ02331.1 integrase CBJ02335.1 From codon 40 to 140 similar to N-terminus to codon 105 of Cryptosporidium hominis. putative uncharacterized protein. UniProt:Q5CJI4 (EMBL:AAEL01000125 (119 aa) fasta scores: E()=5.3, 33.333% id in 105 aa CBJ02341.1 Similar, but truncated at the N-terminus, to Erwinia carotovora subsp. atroseptica (Pectobacterium atrosepticum) putative uncharacterized protein. UniProt:Q6D8B0 (140 aa) fasta scores: E()=2.4e-10, 32.039% id in 103 aa CBJ02345.1 Similar, truncated at the N-terminus, to Yersinia enterocolitica serotype O:8/biotype 1B (strain 8081). putative integrase/recombinase. UniProt:A1JKG5 (EMBL:AM286415 (452 aa) fasta scores: E()=1.9e-46, 37.783% id in 442 aa, and similar to entire protein of Enterobacter sp. 638. phage integrase family protein. UniProt:A4WCY6 (EMBL:CP000653 (400 aa) fasta scores: E()=3.6e-103, 74.874% id in 398 aa CBJ02350.1 Similar to codons 40 to 100 of Escherichia coli. PinE UniProt:P03014 (EMBL:K00676 (184 aa) fasta scores: E()=8.1e-11, 69.643% id in 56 aa, and to entire protein of Escherichia coli. PinH UniProt:P76611 (EMBL:U00096 (79 aa) fasta scores: E()=4.3e-35, 100.000% id in 79 aa ETEC_2851 Similar to Clostridium acetobutylicum AmyA alpha-amylase precursor (ec 3.2.1.1) (1,4-alpha-d-glucan glucanohydrolase). UniProt:P23671 (760 aa) fasta scores: E()=0.019, 26.126% id in 111 aa, and to Escherichia coli. YgaQ UniProt:P76616 (EMBL:U00096 (750 aa) fasta scores: E()=8.5e-90, 99.561% id in 228 aa CBJ02448.1 predicted protein CBJ02449.1 predicted protein ETEC_2952 This CDS has been interrupted by a complex insertion CBJ02482.1 putative flavodoxin ETEC_3042 Similar to Escherichia coli. ygek UniProt:Q46791 (EMBL:U28375 (210 aa) fasta scores: E()=4.2e-59, 97.619% id in 168 aa. Stop codon at codon 52 CBJ02586.1 RF-2 carries a +1 frameshift at codon 26. This is a regulatory feature of this gene CBJ02596.1 6-phospho-beta-glucosidase A ETEC_3120 Similar, with a frameshift mutation, to Escherichia coli B. hypothetical protein. UniProt:A2UCU6 (EMBL:AAWW01000001 (237 aa) fasta scores: E()=4.8e-54, 92.857% id in 252 aa CBJ02667.1 Similar to N-terminus of Escherichia coli. cdib UniProt:Q3YL97 (EMBL:DQ100454 (588 aa) fasta scores: E()=6e-114, 88.338% id in 343 aa, and to ) escherichia coli. hypothetical protein. UniProt:Q1RPM2 (EMBL:AM236323 (588 aa) fasta scores: E()=3.9e-115, 88.338% id in 343 aa CBJ02687.1 transposase CBJ02689.1 Similar, but truncated at the N-terminus, to Escherichia coli ulac ascorbate-specific phosphotransferase enzyme iia component (ec 2.7.1.-) (pts system ascorbate-specific eiia component). UniProt:P69820 (154 aa) fasta scores: E()=2.7e-07, 37.805% id in 82 aa, and similar, but truncated at the N-terminus, to Escherichia coli. orf10 UniProt:Q9AI22 (EMBL:AF286670 (147 aa) fasta scores: E()=2.7e-27, 100.000% id in 82 aa ETEC_3191 Similar, although with an insertion sequence transposed into the protein, to Erwinia carotovora subsp. atroseptica (pectobacterium atrosepticum). putative membrane protein. UniProt:Q6D0M3 (EMBL:BX950851 (419 aa) fasta scores: E()=6.1e-118, 83.886% id in 422 aa. It is faintly possible that this CDS, although interrupted could be transcribed as two separate CDS CBJ02694.1 Similar to N-terminus of Escherichia coli. transposase orfa, isec8. UniProt:Q3ZU26 (EMBL:AY258503 (139 aa) fasta scores: E()=7.6e-17, 57.778% id in 90 aa, and to N-terminus of Escherichia coli. orf7 UniProt:Q9AI25 (EMBL:AF286670 (94 aa) fasta scores: E()=3.5e-35, 97.872% id in 94 aa ETEC_3216 this CDS has been disrupted by an IS element insertion CBJ02813.1 3,4-dihydroxy-2-butanone-4-phosphate synthase CBJ02817.1 predicted outer membrane usher protein CBJ02818.1 predicted periplasmic pilin chaperone CBJ02909.1 putative galactosamine-6-phosphate isomerase CBJ02914.1 putative methyltransferase ETEC_3481 Similar to Escherichia coli. yhcg UniProt:P45423 (EMBL:U18997 (375 aa) fasta scores: E()=2.3e-101, 100.000% id in 247 aa CBJ03056.1 Similar, but truncated at the N-terminus, to Escherichia coli RpsE 30S ribosomal protein S5. UniProt:P0A7W1 (167 aa) fasta scores: E()=5.7e-34, 100.000% id in 107 aa CBJ03083.1 probable general secretion pathway protein i precursor CBJ03089.1 bacterioferritin, iron storage and detoxification protein CBJ03091.1 C-terminus from codon 350 is similar to Metarhizium anisopliae. chi11 UniProt:O74199 (EMBL:AF036320 (522 aa) fasta scores: E()=2e-171, 87.961% id in 515 aa, and similar to entire sequence of Escherichia coli. ChiA UniProt:P13656 (EMBL:U18997 (897 aa) fasta scores: E()=0, 100.000% id in 897 aa CBJ03123.1 putative fructoselysine transporter CBJ03190.1 putative acetyltransferase CBJ03233.1 Similar, but truncated at the N-terminus, to Escherichia coli yhhi h repeat-associated protein yhhi (orf-h). h repeat-associated protein yhhi (orf-h). UniProt:P28912 (378 aa) fasta scores: E()=5.1e-135, 100.000% id in 347 aa, and similar, but truncated at the N-terminus, to Escherichia coli B transposase, is4 family protein. UniProt:A2ULP9 (378 aa) fasta scores: E()=1e-134, 99.712% id in 347 aa ETEC_3784 Similar to Escherichia coli B serine transporter. UniProt:A2UJ81 (423 aa) fasta scores: E()=1.6e-141, 97.430% id in 428 aa, and to shigella flexneri serotype 5b (strain 8401). yhjv UniProt:Q0SZD3 (EMBL:CP000266 (444 aa) fasta scores: E()=2.2e-149, 96.882% id in 449 aa. Putative frameshift mutation around codon 120. These CDS could potentially be transcribed as two separate CDS. CBJ03304.1 Similar, but truncated at the N-terminus, to Escherichia coli Insk putative transposase for insertion sequence element IS150. UniProt:P19769 (283 aa) fasta scores: E()=5e-75, 100.000% id in 201 aa, and similar, but truncated at the N-terminus, to shigella flexneri serotype 5b (strain 8401). is150 orf2. UniProt:Q0SZE9 (EMBL:CP000266 (283 aa) fasta scores: E()=5e-75, 100.000% id in 201 aa CBJ03312.1 Similar, but truncated at the N-terminus, to Escherichia coli. XylF UniProt:P37387 (EMBL:U00039 (330 aa) fasta scores: E()=8.1e-60, 99.441% id in 179 aa, and similar, but truncated the N-terminus, to Shigella flexneri. xylf UniProt:Q83PQ9 (EMBL:AE005674 (330 aa) fasta scores: E()=5.4e-60, 100.000% id in 179 aa CBJ03345.1 unnamed protein product; Similar, but extended at the C-terminus, to Escherichia coli. yibv UniProt:A5A625 (EMBL:U00096 (111 aa) fasta scores: E()=2e-35, 98.165% id in 109 aa CBJ03371.1 lipopolysaccharide heptosyltransferase 1 CBJ03377.1 UDP-galactose:(glucosyl) LPS alpha-1,3-galactosyltransferase CBJ03411.1 required for maximal secretion of the heat-labile enterotoxin. CBJ03421.1 Similar to C-terminus of Aeromonas salmonicida (strain a449). hypothetical protein. UniProt:A4SIH7 (EMBL:CP000644 (196 aa) fasta scores: E()=3.3e-06, 66.667% id in 36 aa ETEC_3921 Similar to C-terminus of Escherichia coli o6:k15:h31 (strain 536/upec). putative phage integrase. UniProt:Q0TBE3 (EMBL:CP000247 (393 aa) fasta scores: E()=4.5e-08, 63.415% id in 41 aa CBJ03430.1 Similar to codons 110 to 140 of Silicibacter pomeroyi. xanthine/uracil permease family protein. UniProt:Q5LV28 (EMBL:CP000031 (470 aa) fasta scores: E()=6.7, 43.750% id in 32 aa CBJ03431.1 N-terminus is similar to Vibrio cholerae 1587. hypothetical protein. UniProt:A2P676 (EMBL:AAUR01000019 (109 aa) fasta scores: E()=0.46, 35.556% id in 45 aa CBJ03432.1 Similar to N-terminus to codon 120 of Serratia proteamaculans 568. uncharacterised conserved protein ucp028301. UniProt:A0ISR6 (EMBL:AAUN01000012 (165 aa) fasta scores: E()=2.3e-29, 74.138% id in 116 aa CBJ03455.1 Similar to N-terminus to codon 310 of Serratia marcescens. TraG1 UniProt:P95799 (EMBL:U60283 (563 aa) fasta scores: E()=0.00082, 21.382% id in 304 aa, and to entire protein of Escherichia coli. hypothetical protein. UniProt:Q8VQQ6 (EMBL:AF453442 (280 aa) fasta scores: E()=3.2e-107, 94.604% id in 278 aa CBJ03669.1 Note the selenocystiene TGA read-through stop codon CBJ03791.1 putative phosphate starvation-inducible membrane protein CBJ03817.1 hypothetical phage protein CBJ03877.1 This CDS may be transcribed separately or may form part of a pseudogene with the downstream CDS CBJ03878.1 This CDS may be transcribed separately or may form part of a pseudogene with the upstream CDS CBJ03896.1 Note the selenocystiene TGA read-through stop codon CBJ03936.1 alpha-galactosidase, NAD(P)-binding CBJ03940.1 putative membrane protein CBJ03983.1 Similar to Yersinia pestis. mcrb UniProt:Q74PU0 (EMBL:AE017042 (687 aa) fasta scores: E()=2.1e-41, 34.443% id in 691 aa, and to C-terminus of Vibrio splendidus 12b01. GTPase subunit of restriction endonuclease-like. UniProt:A3UV37 (EMBL:AAMR01000025 (829 aa) fasta scores: E()=1e-102, 46.020% id in 691 aa CBJ03985.1 Similar to Bifidobacterium longum hsdm hsdm. UniProt:Q8CY44 (855 aa) fasta scores: E()=3.2e-10, 70.909% id in 55 aa, and to ) yersinia pseudotuberculosis. putative type i restriction-modification system, methyltransferase subunit (n-6 dna methylase) (ec 2.1.1.72). UniProt:Q66F04 (EMBL:BX936398 (863 aa) fasta scores: E()=1.7e-15, 94.118% id in 51 aa CBJ03986.1 no significant database hits CBJ03990.1 integrase ETEC_4538 Similar to C-terminus of Escherichia coli O157:H7 UlaG probable l-ascorbate-6-phosphate lactonase UlaG (ec 3.1.1.-) (l- ascorbate utilization protein g). UniProt:Q8XDJ6 (354 aa) fasta scores: E()=2e-77, 100.000% id in 192 aa CBJ04093.1 Similar but truncated at the N-terminus to Escherichia coli (strain k12). cybC UniProt:P0ABE7 (EMBL:S74736 (128 aa) fasta scores: E()=3.1e-32, 98.000% id in 100 aa, and similar but truncated at the N-terminus to Escherichia coli b. transcriptional regulator, fis family precursor. UniProt:A2UGY4 (EMBL:AAWW01000011 (128 aa) fasta scores: E()=1.3e-32, 100.000% id in 100 aa CBJ04135.1 major type 1 subunit fimbrin (pilin) CBJ04137.1 fimbrial chaperone CBJ04138.1 outer membrane usher protein, type 1 fimbrial synthesis CBJ04191.1 This CDS overlaps the opposite strand CDS by over 200 bp CBJ04192.1 Similar to Escherichia coli (strain uti89/upec). putative uncharacterized protein. UniProt:Q1R855 (EMBL:CP000243 (167 aa) fasta scores: E()=4.5e-64, 94.545% id in 165 aa. This CDS overlaps the CDS on the opposite strand by more than 200 bp ETEC_4691 Similar to C-terminus of Escherichia coli o1:k1/apec. putative uncharacterized protein. UniProt:A1AEI4 (EMBL:CP000468 (265 aa) fasta scores: E()=4.5e-79, 95.169% id in 207 aa CBJ04218.1 putative transposase CBJ04219.1 putative IS element transposase ETEC_4726 Similar to C-terminus of Escherichia coli (strain k12). stfr UniProt:P76072 (EMBL:U00096 (1120 aa) fasta scores: E()=1.8e-114, 90.208% id in 337 aa CBJ04236.1 peptide chain release factor RF-3 CBJ04370.1 similar to modulators of H-NS activity CBJ04390.1 hypothetical protein TrbD CBJ04405.1 hypothetical protein CBJ04420.1 site of translational frameshifting that fuses INsA and InsB CBJ04422.1 site of translational frameshifting that fuses INsA and InsB CBJ04436.1 site of translational frameshifting that fuses INsA and InsB ETEC_p948_0060 IS element transposase (pseudogene) CBJ04457.1 Operon characterised in PMID: 16552055. EtpC described as having similarity to glycosyltranferases; may be involved in post-translational modification of EtpA CBJ04458.1 Operon characterised in PMID: 16552055. This version shorter than previously sequenced version due to internal deletion of one repeat unit (4 vs 5 x 228 aa) CBJ04459.1 Operon characterised in PMID: 16552055. Probably responsible for secretion of EtpA CBJ04461.1 unnamed protein product; putative IS element transposase ETEC_p948_0650 deletion/fusion protein between putative methylase and psiB