RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy8682
(1956 letters)
>gnl|CDD|238112 cd00189, TPR, Tetratricopeptide repeat domain; typically contains 34
amino acids
[WLF]-X(2)-[LIM]-[GAS]-X(2)-[YLF]-X(8)-[ASE]-X(3)-[FYL]-
X(2)-[ASL]-X(4)-[PKE] is the consensus sequence; found in
a variety of organisms including bacteria, cyanobacteria,
yeast, fungi, plants, and humans in various subcellular
locations; involved in a variety of functions including
protein-protein interactions, but common features in the
interaction partners have not been defined; involved in
chaperone, cell-cycle, transciption, and protein
transport complexes; the number of TPR motifs varies
among proteins (1,3-11,13 15,16,19); 5-6 tandem repeats
generate a right-handed helical structure with an
amphipathic channel that is thought to accomodate an
alpha-helix of a target protein; it has been proposed
that TPR proteins preferably interact with WD-40 repeat
proteins, but in many instances several TPR-proteins seem
to aggregate to multi-protein complexes; examples of
TPR-proteins include, Cdc16p, Cdc23p and Cdc27p
components of the cyclosome/APC, the Pex5p/Pas10p
receptor for peroxisomal targeting signals, the Tom70p
co-receptor for mitochondrial targeting signals, Ser/Thr
phosphatase 5C and the p110 subunit of O-GlcNAc
transferase; three copies of the repeat are present here.
Length = 100
Score = 83.2 bits (206), Expect = 8e-19
Identities = 32/93 (34%), Positives = 52/93 (55%)
Query: 1767 NGLGVLFNLSDEYDKAVDCFRSALQVRPDDSRLWNRLGASLANGNRPEEAVEAYHTALQL 1826
LG L+ +YD+A++ + AL++ PD++ + L A+ + EEA+E Y AL+L
Sbjct: 4 LNLGNLYYKLGDYDEALEYYEKALELDPDNADAYYNLAAAYYKLGKYEEALEDYEKALEL 63
Query: 1827 SPGFVRARYNLGITCVHLGANTQAVEHFLTALN 1859
P +A YNLG+ LG +A+E + AL
Sbjct: 64 DPDNAKAYYNLGLAYYKLGKYEEALEAYEKALE 96
Score = 66.6 bits (163), Expect = 5e-13
Identities = 24/93 (25%), Positives = 44/93 (47%), Gaps = 3/93 (3%)
Query: 1736 LSSREIHQQVLSLYLNAARQCPSQSIDPDVQNGLGVLFNLSDEYDKAVDCFRSALQVRPD 1795
+ + L Y A P + D L + +Y++A++ + AL++ PD
Sbjct: 10 YYKLGDYDEALEYYEKALELDPD---NADAYYNLAAAYYKLGKYEEALEDYEKALELDPD 66
Query: 1796 DSRLWNRLGASLANGNRPEEAVEAYHTALQLSP 1828
+++ + LG + + EEA+EAY AL+L P
Sbjct: 67 NAKAYYNLGLAYYKLGKYEEALEAYEKALELDP 99
Score = 64.7 bits (158), Expect = 2e-12
Identities = 21/89 (23%), Positives = 42/89 (47%)
Query: 1609 FALGQEKLRQGDLPSAILYLEAAAKQEPDNAEVWLSLGISLAENEQDPQAIAALSKCLSI 1668
LG + GD A+ Y E A + +PDNA+ + +L + + + +A+ K L +
Sbjct: 4 LNLGNLYYKLGDYDEALEYYEKALELDPDNADAYYNLAAAYYKLGKYEEALEDYEKALEL 63
Query: 1669 EPKNLEALMAISICFTNEACLHDALDTLK 1697
+P N +A + + + +AL+ +
Sbjct: 64 DPDNAKAYYNLGLAYYKLGKYEEALEAYE 92
Score = 48.5 bits (116), Expect = 1e-06
Identities = 20/62 (32%), Positives = 26/62 (41%)
Query: 1799 LWNRLGASLANGNRPEEAVEAYHTALQLSPGFVRARYNLGITCVHLGANTQAVEHFLTAL 1858
LG +EA+E Y AL+L P A YNL LG +A+E + AL
Sbjct: 2 ALLNLGNLYYKLGDYDEALEYYEKALELDPDNADAYYNLAAAYYKLGKYEEALEDYEKAL 61
Query: 1859 NQ 1860
Sbjct: 62 EL 63
Score = 44.7 bits (106), Expect = 2e-05
Identities = 16/63 (25%), Positives = 32/63 (50%)
Query: 1609 FALGQEKLRQGDLPSAILYLEAAAKQEPDNAEVWLSLGISLAENEQDPQAIAALSKCLSI 1668
+ L + G A+ E A + +PDNA+ + +LG++ + + +A+ A K L +
Sbjct: 38 YNLAAAYYKLGKYEEALEDYEKALELDPDNAKAYYNLGLAYYKLGKYEEALEAYEKALEL 97
Query: 1669 EPK 1671
+P
Sbjct: 98 DPN 100
Score = 41.6 bits (98), Expect = 2e-04
Identities = 24/109 (22%), Positives = 45/109 (41%), Gaps = 15/109 (13%)
Query: 736 NGLGVLFNLSDEYDKAVDCFRSALQVRPDFTELLVYLFSSLDFQPPKQKISDPDE-LRDY 794
LG L+ +YD+A++ + AL++ PD + L ++ K+ +E L DY
Sbjct: 4 LNLGNLYYKLGDYDEALEYYEKALELDPDNADAYYNLAAAY------YKLGKYEEALEDY 57
Query: 795 QHRKRKAFEDNIRKNRMVISNWIKYAQWEESQKQVDRARSIYERALDVD 843
KA + + + + + A YE+AL++D
Sbjct: 58 ----EKA----LELDPDNAKAYYNLGLAYYKLGKYEEALEAYEKALELD 98
Score = 40.8 bits (96), Expect = 5e-04
Identities = 13/72 (18%), Positives = 27/72 (37%), Gaps = 3/72 (4%)
Query: 705 LSSREIHQQVLSLYLNAARQCPSQSIDPDVQNGLGVLFNLSDEYDKAVDCFRSALQVRPD 764
+ + L Y A P + D L + +Y++A++ + AL++ PD
Sbjct: 10 YYKLGDYDEALEYYEKALELDPD---NADAYYNLAAAYYKLGKYEEALEDYEKALELDPD 66
Query: 765 FTELLVYLFSSL 776
+ L +
Sbjct: 67 NAKAYYNLGLAY 78
Score = 34.3 bits (79), Expect = 0.11
Identities = 22/156 (14%), Positives = 46/156 (29%), Gaps = 56/156 (35%)
Query: 1640 EVWLSLGISLAENEQDPQAIAALSKCLSIEPKNLEALMAISICFTNEACLHDALDTLKDK 1699
E L+LG + +A+ K L ++P N +A ++ + +AL+
Sbjct: 1 EALLNLGNLYYKLGDYDEALEYYEKALELDPDNADAYYNLAAAYYKLGKYEEALE----- 55
Query: 1700 IRPGQESNPRPSAYKADALPSKLTRLANHTLTFRSPLSSREIHQQVLSLYLNAARQCPSQ 1759
Y A P
Sbjct: 56 ------------------------------------------------DYEKALELDPD- 66
Query: 1760 SIDPDVQNGLGVLFNLSDEYDKAVDCFRSALQVRPD 1795
+ LG+ + +Y++A++ + AL++ P+
Sbjct: 67 --NAKAYYNLGLAYYKLGKYEEALEAYEKALELDPN 100
Score = 32.7 bits (75), Expect = 0.32
Identities = 13/83 (15%), Positives = 27/83 (32%)
Query: 815 NWIKYAQWEESQKQVDRARSIYERALDVDHRNITLWLKYTELEMRNRQVNHARNLWDRAV 874
+ D A YE+AL++D N + + + A +++A+
Sbjct: 2 ALLNLGNLYYKLGDYDEALEYYEKALELDPDNADAYYNLAAAYYKLGKYEEALEDYEKAL 61
Query: 875 TILPRANQFWYKYTYMEEMLENV 897
+ P + +Y L
Sbjct: 62 ELDPDNAKAYYNLGLAYYKLGKY 84
Score = 31.2 bits (71), Expect = 1.2
Identities = 11/60 (18%), Positives = 26/60 (43%), Gaps = 3/60 (5%)
Query: 705 LSSREIHQQVLSLYLNAARQCPSQSIDPDVQNGLGVLFNLSDEYDKAVDCFRSALQVRPD 764
+++ L Y A P + LG+ + +Y++A++ + AL++ P+
Sbjct: 44 YYKLGKYEEALEDYEKALELDPD---NAKAYYNLGLAYYKLGKYEEALEAYEKALELDPN 100
>gnl|CDD|222112 pfam13414, TPR_11, TPR repeat.
Length = 69
Score = 62.7 bits (153), Expect = 5e-12
Identities = 19/68 (27%), Positives = 34/68 (50%), Gaps = 1/68 (1%)
Query: 1762 DPDVQNGLGVLFNLSDEYDKAVDCFRSALQVRPDDSRLWNRLGASLAN-GNRPEEAVEAY 1820
+ + LG +YD+A++ + AL++ PD++ + L + G EEA+E
Sbjct: 2 NAEALKNLGNALFKLGDYDEAIEAYEKALELDPDNAEAYYNLALAYLKLGKDYEEALEDL 61
Query: 1821 HTALQLSP 1828
AL+L P
Sbjct: 62 EKALELDP 69
Score = 55.4 bits (134), Expect = 2e-09
Identities = 20/69 (28%), Positives = 33/69 (47%), Gaps = 1/69 (1%)
Query: 1603 QNETNAFALGQEKLRQGDLPSAILYLEAAAKQEPDNAEVWLSLGISLAE-NEQDPQAIAA 1661
N LG + GD AI E A + +PDNAE + +L ++ + + +A+
Sbjct: 1 DNAEALKNLGNALFKLGDYDEAIEAYEKALELDPDNAEAYYNLALAYLKLGKDYEEALED 60
Query: 1662 LSKCLSIEP 1670
L K L ++P
Sbjct: 61 LEKALELDP 69
Score = 50.0 bits (120), Expect = 1e-07
Identities = 23/66 (34%), Positives = 34/66 (51%), Gaps = 1/66 (1%)
Query: 1795 DDSRLWNRLGASLANGNRPEEAVEAYHTALQLSPGFVRARYNLGITCVHLGAN-TQAVEH 1853
D++ LG +L +EA+EAY AL+L P A YNL + + LG + +A+E
Sbjct: 1 DNAEALKNLGNALFKLGDYDEAIEAYEKALELDPDNAEAYYNLALAYLKLGKDYEEALED 60
Query: 1854 FLTALN 1859
AL
Sbjct: 61 LEKALE 66
Score = 40.0 bits (94), Expect = 5e-04
Identities = 16/47 (34%), Positives = 26/47 (55%)
Query: 1637 DNAEVWLSLGISLAENEQDPQAIAALSKCLSIEPKNLEALMAISICF 1683
DNAE +LG +L + +AI A K L ++P N EA +++ +
Sbjct: 1 DNAEALKNLGNALFKLGDYDEAIEAYEKALELDPDNAEAYYNLALAY 47
Score = 33.8 bits (78), Expect = 0.074
Identities = 11/42 (26%), Positives = 20/42 (47%)
Query: 731 DPDVQNGLGVLFNLSDEYDKAVDCFRSALQVRPDFTELLVYL 772
+ + LG +YD+A++ + AL++ PD E L
Sbjct: 2 NAEALKNLGNALFKLGDYDEAIEAYEKALELDPDNAEAYYNL 43
Score = 33.4 bits (77), Expect = 0.097
Identities = 10/54 (18%), Positives = 22/54 (40%), Gaps = 4/54 (7%)
Query: 711 HQQVLSLYLNAARQCPSQSIDPDVQNGLG-VLFNLSDEYDKAVDCFRSALQVRP 763
+ + + Y A P + + L L +Y++A++ AL++ P
Sbjct: 19 YDEAIEAYEKALELDPD---NAEAYYNLALAYLKLGKDYEEALEDLEKALELDP 69
Score = 32.7 bits (75), Expect = 0.18
Identities = 11/64 (17%), Positives = 20/64 (31%), Gaps = 1/64 (1%)
Query: 816 WIKYAQWEESQKQVDRARSIYERALDVDHRNITLWLKYTE-LEMRNRQVNHARNLWDRAV 874
D A YE+AL++D N + + A ++A+
Sbjct: 6 LKNLGNALFKLGDYDEAIEAYEKALELDPDNAEAYYNLALAYLKLGKDYEEALEDLEKAL 65
Query: 875 TILP 878
+ P
Sbjct: 66 ELDP 69
Score = 30.0 bits (68), Expect = 1.8
Identities = 14/50 (28%), Positives = 29/50 (58%), Gaps = 4/50 (8%)
Query: 1111 LRHLEAAAKQEPDNAEVWLSLGISLAENEQLLGEFWEHNLGNMQQAFRMD 1160
+ E A + +PDNAE + +L ++ + LG+ +E L ++++A +D
Sbjct: 23 IEAYEKALELDPDNAEAYYNLALAYLK----LGKDYEEALEDLEKALELD 68
>gnl|CDD|223533 COG0457, NrfG, FOG: TPR repeat [General function prediction only].
Length = 291
Score = 68.3 bits (165), Expect = 7e-12
Identities = 59/254 (23%), Positives = 103/254 (40%), Gaps = 21/254 (8%)
Query: 1578 FSDFSRNQRSSVSMHEYTFAEDNPMQNETNAFALGQEKLRQGDLPSAILYLEAAAKQE-- 1635
+ ++ + E L L+ G L A+ LE A + E
Sbjct: 32 LLELLGELAEALELLEEALELLPNSDLAGLLLLLALALLKLGRLEEALELLEKALELELL 91
Query: 1636 PDNAEVWLSLGISLAENEQDPQAIAALSKCLSIEPKNLEALMAISI-CFTNEACLHDALD 1694
P+ AE L+LG+ L + +A+ L K L+++P A +++ +AL+
Sbjct: 92 PNLAEALLNLGLLLEALGKYEEALELLEKALALDPDPDLAEALLALGALYELGDYEEALE 151
Query: 1695 TLKDKIRPGQESNPRPSAYKADALPSKLTRLANHTLTFRSPLSSREIHQQVLSLYLNAAR 1754
+ KA L +L LA L + L + +++ L L A +
Sbjct: 152 LYE----------------KALELDPELNELAEALLALGALLEALGRYEEALELLEKALK 195
Query: 1755 QCPSQSIDPDVQNGLGVLFNLSDEYDKAVDCFRSALQVRPDDSRLWNRLGASLANGNRPE 1814
P D + LG+L+ +Y++A++ + AL++ PD++ L L R E
Sbjct: 196 LNPD--DDAEALLNLGLLYLKLGKYEEALEYYEKALELDPDNAEALYNLALLLLELGRYE 253
Query: 1815 EAVEAYHTALQLSP 1828
EA+EA AL+L P
Sbjct: 254 EALEALEKALELDP 267
Score = 56.8 bits (135), Expect = 3e-08
Identities = 30/178 (16%), Positives = 61/178 (34%), Gaps = 12/178 (6%)
Query: 718 YLNAARQCPSQSIDPDVQNGLGVLFNLSDEYDKAVDCFRSALQVRPDFTELLVYLFSSLD 777
L A + LG L+ L +Y++A++ + AL++ P+ EL L +
Sbjct: 117 LLEKALALDPDPDLAEALLALGALYEL-GDYEEALELYEKALELDPELNELAEALLALGA 175
Query: 778 FQPPKQKISDPDELRDYQHRKRKAFEDNIRKNRMVISNWIKYAQWEESQKQVDRARSIYE 837
+ + +E + KA + N + + + + A YE
Sbjct: 176 LL---EALGRYEEALEL---LEKALKLNPDDDAEA---LLNLGLLYLKLGKYEEALEYYE 226
Query: 838 RALDVDHRNITLWLKYTELEMRNRQVNHARNLWDRAVTILPRANQF--WYKYTYMEEM 893
+AL++D N L + + A ++A+ + P E +
Sbjct: 227 KALELDPDNAEALYNLALLLLELGRYEEALEALEKALELDPDLYNLGLALLLLLAEAL 284
Score = 45.2 bits (105), Expect = 2e-04
Identities = 25/86 (29%), Positives = 41/86 (47%)
Query: 1598 EDNPMQNETNAFALGQEKLRQGDLPSAILYLEAAAKQEPDNAEVWLSLGISLAENEQDPQ 1657
+ NP + LG L+ G A+ Y E A + +PDNAE +L + L E + +
Sbjct: 195 KLNPDDDAEALLNLGLLYLKLGKYEEALEYYEKALELDPDNAEALYNLALLLLELGRYEE 254
Query: 1658 AIAALSKCLSIEPKNLEALMAISICF 1683
A+ AL K L ++P +A+ +
Sbjct: 255 ALEALEKALELDPDLYNLGLALLLLL 280
>gnl|CDD|234059 TIGR02917, PEP_TPR_lipo, putative PEP-CTERM system TPR-repeat
lipoprotein. This protein family occurs in strictly
within a subset of Gram-negative bacterial species with
the proposed PEP-CTERM/exosortase system, analogous to
the LPXTG/sortase system common in Gram-positive
bacteria. This protein occurs in a species if and only if
a transmembrane histidine kinase (TIGR02916) and a
DNA-binding response regulator (TIGR02915) also occur.
The present of tetratricopeptide repeats (TPR) suggests
protein-protein interaction, possibly for the regulation
of PEP-CTERM protein expression, since many PEP-CTERM
proteins in these genomes are preceded by a proposed DNA
binding site for the response regulator.
Length = 899
Score = 65.1 bits (159), Expect = 3e-10
Identities = 69/299 (23%), Positives = 109/299 (36%), Gaps = 50/299 (16%)
Query: 1584 NQRSSVSMHEYTFAEDNPMQNETNAFALGQEKLRQGDLPSAILYLEAAAKQEPDNAEVWL 1643
N+ +V+ E + Q A AL Q L +G L A+ L AA PD+ E WL
Sbjct: 548 NEEEAVAWLEKAAELNP--QEIEPALALAQYYLGKGQLKKALAILNEAADAAPDSPEAWL 605
Query: 1644 SLGISLAENEQDPQAIAALSKCLSIEPKNLEALMAISICFTNEACLHDALDTLKDKIRPG 1703
LG + +A+++ K L+++P + AL L DA +K+
Sbjct: 606 MLGRAQLAAGDLNKAVSSFKKLLALQPDSALAL----------LLLADAYAVMKN----- 650
Query: 1704 QESNPRPSAYKADALPSKLTRLANHTLTFRSPLSSREIHQQVL-SLYLNAARQCPSQSID 1762
A +L L ++T Q L L L A R ++ I
Sbjct: 651 -------YAKAITSLKRALELKPDNTEA-----------QIGLAQLLLAAKRTESAKKIA 692
Query: 1763 PDVQNG----------LGVLFNLSDEYDKAVDCFRSALQVRPDDSRLWNRLGASLANGNR 1812
+Q G L+ +Y A+ +R AL+ P A LA+GN
Sbjct: 693 KSLQKQHPKAALGFELEGDLYLRQKDYPAAIQAYRKALKRAPSSQNAIKLHRALLASGNT 752
Query: 1813 PEEAVEAYHTALQLSPGFVRARYNLGITCVHLGANTQAVEHFLTALNQQ---AATHDGL 1868
EAV+ L+ P R L + +A++H+ T + + A + L
Sbjct: 753 A-EAVKTLEAWLKTHPNDAVLRTALAELYLAQKDYDKAIKHYQTVVKKAPDNAVVLNNL 810
Score = 62.8 bits (153), Expect = 2e-09
Identities = 59/273 (21%), Positives = 92/273 (33%), Gaps = 43/273 (15%)
Query: 1603 QNETNAFAL---GQEKLRQGDLPSAILYLEAAAKQEPDNAEVWLSLGIS-LAENEQDPQA 1658
+ + AL G+ L GD A YL A + +P+NA LGIS L++ + +A
Sbjct: 358 LDPDDPAALSLLGEAYLALGDFEKAAEYLAKATELDPENAAARTQLGISKLSQGDPS-EA 416
Query: 1659 IAALSKCLSIEPKNLEALMAISICFTNEACLHDALDTLKDKIRPGQESNPRPSAYKADAL 1718
IA L ++P+ A + + + + AL K K+ Q N
Sbjct: 417 IADLETAAQLDPELGRADLLLILSYLRSGQFDKALAAAK-KLEKKQPDNASLHNLLGAIY 475
Query: 1719 PSKLTRLANHTLTFRSPLSSREIHQQVLSLYLNAARQCPSQSIDPDVQNGLGVLFNLS-- 1776
L ++ + + A SI+PD NL+
Sbjct: 476 -----------------LGKGDLAKAREA--FEKAL-----SIEPD---FFPAAANLARI 508
Query: 1777 ----DEYDKAVDCFRSALQVRPDDSRLWNRLGASLANGNRPEEAVEAYHTALQLSPGFVR 1832
D A+ F L + P + R L EEAV A +L+P +
Sbjct: 509 DIQEGNPDDAIQRFEKVLTIDPKNLRAILALAGLYLRTGNEEEAVAWLEKAAELNPQEIE 568
Query: 1833 ARYNLGITCVHLGANTQAVEHFLTALNQQAATH 1865
L + G +A L LN+ A
Sbjct: 569 PALALAQYYLGKGQLKKA----LAILNEAADAA 597
Score = 57.0 bits (138), Expect = 8e-08
Identities = 56/250 (22%), Positives = 91/250 (36%), Gaps = 28/250 (11%)
Query: 1608 AFALGQEKLRQGDLPSAILYLEAAAKQEPDNAEVWLSLGISLAENEQDPQAIAALSKCLS 1667
+ ++ + A L+ A K P+ L G S + QA L++ L
Sbjct: 264 HYLKALVDFQKKNYEDARETLQDALKSAPEYLPALLLAGASEYQLGNLEQAYQYLNQILK 323
Query: 1668 IEPKNLEALMAISICFTNEACLHDALDTLKDKIRPGQESNPRPSAY-------------- 1713
P + +A ++ + +A+ TL + +P A
Sbjct: 324 YAPNSHQARRLLASIQLRLGRVDEAIATLSPAL----GLDPDDPAALSLLGEAYLALGDF 379
Query: 1714 -KADALPSKLTRL-----ANHTLTFRSPLSSREIHQQVLSLYLNAARQCPSQSIDPDVQN 1767
KA +K T L A T S LS + + + L A Q + D+
Sbjct: 380 EKAAEYLAKATELDPENAAARTQLGISKLSQGDPSEAIADL--ETAAQLDPELGRADLLL 437
Query: 1768 GLGVLFNLSDEYDKAVDCFRSALQVRPDDSRLWNRLGASLANGNRPEEAVEAYHTALQLS 1827
L L S ++DKA+ + + +PD++ L N LGA +A EA+ AL +
Sbjct: 438 ILSYL--RSGQFDKALAAAKKLEKKQPDNASLHNLLGAIYLGKGDLAKAREAFEKALSIE 495
Query: 1828 PGFVRARYNL 1837
P F A NL
Sbjct: 496 PDFFPAAANL 505
Score = 56.2 bits (136), Expect = 2e-07
Identities = 53/259 (20%), Positives = 88/259 (33%), Gaps = 32/259 (12%)
Query: 1612 GQEKLRQGDLPSAILYLEAAAKQEPDNAEVWLSLGISLAENEQDPQAIAALSKCLSI--E 1669
+ L++ +AI+ L+ A +++P++AE LG A L K LS+
Sbjct: 29 AKSYLQKNKYKAAIIQLKNALQKDPNDAEARFLLGKIYLALGDYAAAEKELRKALSLGYP 88
Query: 1670 PKNLEALMAISICFTNEACLHDALDTLKDKIRPGQESNPRPSAYKADALPSKLTRLANHT 1729
+ L+A + + LD L PG+ A + L
Sbjct: 89 KNQVLPLLARAYLLQGKF--QQVLDEL-----PGKTLLDDEGAAEL------LALRGLAY 135
Query: 1730 LTFRSPLSSREIHQQVLSLYLNAARQCPSQSIDP---DVQNGLGVLFNLSDEYDKAVDCF 1786
L ++ Y A +IDP + GL L + +D+A
Sbjct: 136 LGLGQLELAQ-------KSYEQA------LAIDPRSLYAKLGLAQLALAENRFDEARALI 182
Query: 1787 RSALQVRPDDSRLWNRLGASLANGNRPEEAVEAYHTALQLSPGFVRARYNLGITCVHLGA 1846
L P + G L + E A+ AY A+ L P + L + G
Sbjct: 183 DEVLTADPGNVDALLLKGDLLLSLGNIELALAAYRKAIALRPNNIAVLLALATILIEAGE 242
Query: 1847 NTQAVEHFLTALNQQAATH 1865
+A +H AL ++A
Sbjct: 243 FEEAEKH-ADALLKKAPNS 260
Score = 53.9 bits (130), Expect = 8e-07
Identities = 61/274 (22%), Positives = 101/274 (36%), Gaps = 36/274 (13%)
Query: 1610 ALGQEKLRQGDLPSAILYLEAAAKQEPDNAEVWLSLGISLAENEQDPQAIAALSKCLSIE 1669
LG+ +L GDL A+ + +PD+A L L + A + +AI +L + L ++
Sbjct: 606 MLGRAQLAAGDLNKAVSSFKKLLALQPDSALALLLLADAYAVMKNYAKAITSLKRALELK 665
Query: 1670 PKNLEALMAISICFTNEACLHDALDTLKDKIRPGQESNPRPSAYKADALPSKLTRL---- 1725
P N EA + ++ A K Q+ +P KA L
Sbjct: 666 PDNTEAQIGLAQLLLAAKRTESAKKIAKSL----QKQHP-----KAALGFELEGDLYLRQ 716
Query: 1726 --------ANHTLTFRSPLSSREIHQQVLSLYLNAARQCP--------SQSIDPD---VQ 1766
A R+P S I L L A+ P+ ++
Sbjct: 717 KDYPAAIQAYRKALKRAPSSQNAIK---LHRALLASGNTAEAVKTLEAWLKTHPNDAVLR 773
Query: 1767 NGLGVLFNLSDEYDKAVDCFRSALQVRPDDSRLWNRLGASLANGNRPEEAVEAYHTALQL 1826
L L+ +YDKA+ +++ ++ PD++ + N L P A+E AL+L
Sbjct: 774 TALAELYLAQKDYDKAIKHYQTVVKKAPDNAVVLNNLAWLYLELKDP-RALEYAERALKL 832
Query: 1827 SPGFVRARYNLGITCVHLGANTQAVEHFLTALNQ 1860
+P LG V G +A+ A+N
Sbjct: 833 APNIPAILDTLGWLLVEKGEADRALPLLRKAVNI 866
Score = 52.4 bits (126), Expect = 2e-06
Identities = 65/277 (23%), Positives = 95/277 (34%), Gaps = 40/277 (14%)
Query: 1609 FALGQEKLRQGDLPSAILYLEAAAKQEPDNAEVWLSLGISLAENEQDPQAIAALSKCLSI 1668
L Q L + A ++ +P N + L G L A+AA K +++
Sbjct: 163 LGLAQLALAENRFDEARALIDEVLTADPGNVDALLLKGDLLLSLGNIELALAAYRKAIAL 222
Query: 1669 EPKN-----LEALMAISICFTNEACLHDALDTLKDKIRPGQESNPRPSAYKADALPS--- 1720
P N A + I EA H D L K P P A+ AL
Sbjct: 223 RPNNIAVLLALATILIEAGEFEEAEKH--ADALL-KKAPNS-----PLAHYLKALVDFQK 274
Query: 1721 -----------KLTRLANHTLTFRSPLSSREIHQQVLSL---YLNAARQCPSQSIDPDVQ 1766
+ A L + E L YLN + P+
Sbjct: 275 KNYEDARETLQDALKSAPEYLPALLLAGASEYQLGNLEQAYQYLNQILK-----YAPNSH 329
Query: 1767 NGLGVLFNL---SDEYDKAVDCFRSALQVRPDDSRLWNRLG-ASLANGNRPEEAVEAYHT 1822
+L ++ D+A+ AL + PDD + LG A LA G+ E+A E
Sbjct: 330 QARRLLASIQLRLGRVDEAIATLSPALGLDPDDPAALSLLGEAYLALGD-FEKAAEYLAK 388
Query: 1823 ALQLSPGFVRARYNLGITCVHLGANTQAVEHFLTALN 1859
A +L P AR LGI+ + G ++A+ TA
Sbjct: 389 ATELDPENAAARTQLGISKLSQGDPSEAIADLETAAQ 425
Score = 50.5 bits (121), Expect = 1e-05
Identities = 56/218 (25%), Positives = 86/218 (39%), Gaps = 32/218 (14%)
Query: 1616 LRQGDLPSAILYLEAAAKQEPDNAEVWLSLG-ISLAENEQDPQAIAALSKCLSIEPKNLE 1674
LR G A+ + K++PDNA + LG I L + + +A A K LSIEP
Sbjct: 442 LRSGQFDKALAAAKKLEKKQPDNASLHNLLGAIYLGKGDLA-KAREAFEKALSIEPDFFP 500
Query: 1675 ALMAISICFTNEACLHDALDTLKDKIRPGQESNPRPSAYKADALPSKLTRLANH--TLTF 1732
A N A + QE NP + + + + LT + +
Sbjct: 501 AA-------ANLARIDI------------QEGNPDDAIQRFEKV---LTIDPKNLRAILA 538
Query: 1733 RSPLSSREIH-QQVLSLYLNAARQCPSQSIDPDVQNGLGVLFNLSDEYDKAVDCFRSALQ 1791
+ L R + ++ ++ AA P Q I+P + L + + KA+ A
Sbjct: 539 LAGLYLRTGNEEEAVAWLEKAAELNP-QEIEPALA--LAQYYLGKGQLKKALAILNEAAD 595
Query: 1792 VRPDDSRLWNRLG-ASLANGNRPEEAVEAYHTALQLSP 1828
PD W LG A LA G+ +AV ++ L L P
Sbjct: 596 AAPDSPEAWLMLGRAQLAAGDL-NKAVSSFKKLLALQP 632
Score = 47.8 bits (114), Expect = 7e-05
Identities = 51/225 (22%), Positives = 84/225 (37%), Gaps = 34/225 (15%)
Query: 1615 KLRQGDLPSAILYLEAAAKQEPDNAEVWLSLGISLAENEQDPQAIAALSKCLSIEPKNLE 1674
+LR G + AI L A +PD+ LG + +A L+K ++P+N
Sbjct: 339 QLRLGRVDEAIATLSPALGLDPDDPAALSLLGEAYLALGDFEKAAEYLAKATELDPENAA 398
Query: 1675 ALMAISICFTNEACLHDALDTLKDKIRPGQESNPRPSAYKADALPSKLTRLANHTLTFRS 1734
A + I K+ G S A L +L R L S
Sbjct: 399 ARTQLGIS----------------KLSQGDPSEAIADLETAAQLDPELGRA--DLLLILS 440
Query: 1735 PLSSREIHQQVLSLYLNAARQCPSQSIDPDVQNGLGVLFNLSDEYDKAVDCFRSALQVRP 1794
L S + + L+ A + Q + + N LG ++ + KA + F AL + P
Sbjct: 441 YLRSGQF-DKALAA---AKKLEKKQPDNASLHNLLGAIYLGKGDLAKAREAFEKALSIEP 496
Query: 1795 DD--SRLWNRLGASLAN----GNRPEEAVEAYHTALQLSPGFVRA 1833
D + A+LA P++A++ + L + P +RA
Sbjct: 497 DFFPAA------ANLARIDIQEGNPDDAIQRFEKVLTIDPKNLRA 535
Score = 41.6 bits (98), Expect = 0.005
Identities = 17/68 (25%), Positives = 28/68 (41%)
Query: 1610 ALGQEKLRQGDLPSAILYLEAAAKQEPDNAEVWLSLGISLAENEQDPQAIAALSKCLSIE 1669
G L G L A E A +P + L L + +A A + + L+ +
Sbjct: 130 LRGLAYLGLGQLELAQKSYEQALAIDPRSLYAKLGLAQLALAENRFDEARALIDEVLTAD 189
Query: 1670 PKNLEALM 1677
P N++AL+
Sbjct: 190 PGNVDALL 197
Score = 39.7 bits (93), Expect = 0.018
Identities = 16/59 (27%), Positives = 27/59 (45%)
Query: 1616 LRQGDLPSAILYLEAAAKQEPDNAEVWLSLGISLAENEQDPQAIAALSKCLSIEPKNLE 1674
+ P A+ Y E A K P+ + +LG L E + +A+ L K ++I P+
Sbjct: 814 YLELKDPRALEYAERALKLAPNIPAILDTLGWLLVEKGEADRALPLLRKAVNIAPEAAA 872
Score = 32.4 bits (74), Expect = 3.2
Identities = 31/164 (18%), Positives = 61/164 (37%), Gaps = 13/164 (7%)
Query: 721 AARQCPSQSIDPDVQNGLGVLFNLSDEYDKAVDCFRSALQVRPDFTELLVYLFSSLDFQP 780
A + Q + + N LG ++ + KA + F AL + PDF L + +D Q
Sbjct: 454 AKKLEKKQPDNASLHNLLGAIYLGKGDLAKAREAFEKALSIEPDFFPAAANL-ARIDIQE 512
Query: 781 PKQKISDPDELRDYQHRKRKAFEDNIRKNRMVISNWIKYAQWEESQKQVDRARSIYERAL 840
+PD+ FE + + + + A + A + E+A
Sbjct: 513 -----GNPDDAIQR-------FEKVLTIDPKNLRAILALAGLYLRTGNEEEAVAWLEKAA 560
Query: 841 DVDHRNITLWLKYTELEMRNRQVNHARNLWDRAVTILPRANQFW 884
+++ + I L + + Q+ A + + A P + + W
Sbjct: 561 ELNPQEIEPALALAQYYLGKGQLKKALAILNEAADAAPDSPEAW 604
>gnl|CDD|222123 pfam13432, TPR_16, Tetratricopeptide repeat.
Length = 65
Score = 54.2 bits (131), Expect = 4e-09
Identities = 23/64 (35%), Positives = 32/64 (50%)
Query: 1609 FALGQEKLRQGDLPSAILYLEAAAKQEPDNAEVWLSLGISLAENEQDPQAIAALSKCLSI 1668
AL + LR GD A+ LEAA + P AE L LG +L + +A A L L+
Sbjct: 1 LALARAALRAGDYDEALAALEAALARYPLAAEALLLLGEALLRQGRLAEAAALLRAALAA 60
Query: 1669 EPKN 1672
+P +
Sbjct: 61 DPDD 64
Score = 47.3 bits (113), Expect = 1e-06
Identities = 15/52 (28%), Positives = 21/52 (40%)
Query: 1778 EYDKAVDCFRSALQVRPDDSRLWNRLGASLANGNRPEEAVEAYHTALQLSPG 1829
+YD+A+ +AL P + LG +L R EA AL P
Sbjct: 12 DYDEALAALEAALARYPLAAEALLLLGEALLRQGRLAEAAALLRAALAADPD 63
Score = 43.8 bits (104), Expect = 2e-05
Identities = 14/57 (24%), Positives = 19/57 (33%)
Query: 1802 RLGASLANGNRPEEAVEAYHTALQLSPGFVRARYNLGITCVHLGANTQAVEHFLTAL 1858
L + +EA+ A AL P A LG + G +A AL
Sbjct: 2 ALARAALRAGDYDEALAALEAALARYPLAAEALLLLGEALLRQGRLAEAAALLRAAL 58
Score = 33.4 bits (77), Expect = 0.10
Identities = 12/55 (21%), Positives = 17/55 (30%)
Query: 826 QKQVDRARSIYERALDVDHRNITLWLKYTELEMRNRQVNHARNLWDRAVTILPRA 880
D A + E AL L E +R ++ A L A+ P
Sbjct: 10 AGDYDEALAALEAALARYPLAAEALLLLGEALLRQGRLAEAAALLRAALAADPDD 64
>gnl|CDD|227343 COG5010, TadD, Flp pilus assembly protein TadD, contains TPR repeats
[Intracellular trafficking and secretion].
Length = 257
Score = 55.1 bits (133), Expect = 8e-08
Identities = 26/79 (32%), Positives = 38/79 (48%), Gaps = 2/79 (2%)
Query: 1762 DPDVQNGLGV-LFNLSDEYDKAVDCFRSALQVRPDDSRLWNRLGASLANGNRPEEAVEAY 1820
D ++ G + + +AV R A ++ P D WN LGA+L R +EA AY
Sbjct: 99 DRELLAAQGKNQIRNGN-FGEAVSVLRKAARLAPTDWEAWNLLGAALDQLGRFDEARRAY 157
Query: 1821 HTALQLSPGFVRARYNLGI 1839
AL+L+P NLG+
Sbjct: 158 RQALELAPNEPSIANNLGM 176
Score = 42.8 bits (101), Expect = 0.001
Identities = 22/90 (24%), Positives = 37/90 (41%), Gaps = 3/90 (3%)
Query: 1593 EYTFAEDNPMQNETNAFALGQEKLRQGDLPSAILYLEAAAKQEPDNAEVWLSLGISLAEN 1652
A P E A G+ ++R G+ A+ L AA+ P + E W LG +L +
Sbjct: 89 LQKSAIAYPKDREL-LAAQGKNQIRNGNFGEAVSVLRKAARLAPTDWEAWNLLGAALDQL 147
Query: 1653 EQDPQAIAALSKCLSIEPKNLEAL--MAIS 1680
+ +A A + L + P + +S
Sbjct: 148 GRFDEARRAYRQALELAPNEPSIANNLGMS 177
Score = 37.8 bits (88), Expect = 0.034
Identities = 22/97 (22%), Positives = 34/97 (35%), Gaps = 16/97 (16%)
Query: 1602 MQNETNAFALGQEKLRQGDLPSAI-----LYLEAAAKQ-----------EPDNAEVWLSL 1645
Q + A ALG LR + S LYL A P + E+ +
Sbjct: 47 RQTQGAAAALGAAVLRNPEDLSIAKLATALYLRGDADSSLAVLQKSAIAYPKDRELLAAQ 106
Query: 1646 GISLAENEQDPQAIAALSKCLSIEPKNLEALMAISIC 1682
G + N +A++ L K + P + EA +
Sbjct: 107 GKNQIRNGNFGEAVSVLRKAARLAPTDWEAWNLLGAA 143
Score = 34.3 bits (79), Expect = 0.56
Identities = 23/105 (21%), Positives = 32/105 (30%), Gaps = 12/105 (11%)
Query: 1787 RSALQVRPDDSRLWNRLGASLANGNRPEEAVEAYHTALQLSPGFVRARYNLGITCVHLGA 1846
+ + P D L G + EAV A +L+P A LG LG
Sbjct: 90 QKSAIAYPKDRELLAAQGKNQIRNGNFGEAVSVLRKAARLAPTDWEAWNLLGAALDQLGR 149
Query: 1847 NTQAVEHFLTALNQQAATHDGLTPHGLEPRAVKEMSDSIWYSLRG 1891
+A + AL EP + + LRG
Sbjct: 150 FDEARRAYRQALELAPN----------EPSIANNL--GMSLLLRG 182
>gnl|CDD|214642 smart00386, HAT, HAT (Half-A-TPR) repeats. Present in several
RNA-binding proteins. Structurally and sequentially
thought to be similar to TPRs.
Length = 33
Score = 46.0 bits (110), Expect = 2e-06
Identities = 17/33 (51%), Positives = 23/33 (69%)
Query: 827 KQVDRARSIYERALDVDHRNITLWLKYTELEMR 859
++RAR IYERAL+ +++ LWLKY E E R
Sbjct: 1 GDIERARKIYERALEKFPKSVELWLKYAEFEER 33
Score = 36.8 bits (86), Expect = 0.002
Identities = 11/32 (34%), Positives = 17/32 (53%)
Query: 794 YQHRKRKAFEDNIRKNRMVISNWIKYAQWEES 825
R RK +E + K + W+KYA++EE
Sbjct: 2 DIERARKIYERALEKFPKSVELWLKYAEFEER 33
Score = 34.4 bits (80), Expect = 0.021
Identities = 10/29 (34%), Positives = 17/29 (58%)
Query: 864 NHARNLWDRAVTILPRANQFWYKYTYMEE 892
AR +++RA+ P++ + W KY EE
Sbjct: 4 ERARKIYERALEKFPKSVELWLKYAEFEE 32
>gnl|CDD|205602 pfam13424, TPR_12, Tetratricopeptide repeat.
Length = 78
Score = 43.9 bits (104), Expect = 2e-05
Identities = 22/73 (30%), Positives = 30/73 (41%), Gaps = 7/73 (9%)
Query: 1793 RPDDSRLWNRLGASLANGNRPEEAVEAYHTALQL-------SPGFVRARYNLGITCVHLG 1845
PD + N L L +EA+E AL+L P RA NL + LG
Sbjct: 1 HPDLAAALNNLALVLRRLGDYDEALELLEKALELARELGEDHPETARALNNLARLYLALG 60
Query: 1846 ANTQAVEHFLTAL 1858
+A+E+ AL
Sbjct: 61 DYDEALEYLEKAL 73
Score = 38.9 bits (91), Expect = 0.002
Identities = 17/72 (23%), Positives = 29/72 (40%), Gaps = 7/72 (9%)
Query: 1764 DVQNGLGVLFNLSDEYDKAVDCFRSALQVR-------PDDSRLWNRLGASLANGNRPEEA 1816
N L ++ +YD+A++ AL++ P+ +R N L +EA
Sbjct: 6 AALNNLALVLRRLGDYDEALELLEKALELARELGEDHPETARALNNLARLYLALGDYDEA 65
Query: 1817 VEAYHTALQLSP 1828
+E AL L
Sbjct: 66 LEYLEKALALRE 77
Score = 33.1 bits (76), Expect = 0.16
Identities = 15/70 (21%), Positives = 23/70 (32%), Gaps = 7/70 (10%)
Query: 1609 FALGQEKLRQGDLPSAILYLEAAAKQE-------PDNAEVWLSLGISLAENEQDPQAIAA 1661
L R GD A+ LE A + P+ A +L +A+
Sbjct: 9 NNLALVLRRLGDYDEALELLEKALELARELGEDHPETARALNNLARLYLALGDYDEALEY 68
Query: 1662 LSKCLSIEPK 1671
L K L++
Sbjct: 69 LEKALALREA 78
>gnl|CDD|226687 COG4235, COG4235, Cytochrome c biogenesis factor [Posttranslational
modification, protein turnover, chaperones].
Length = 287
Score = 44.3 bits (105), Expect = 4e-04
Identities = 21/70 (30%), Positives = 34/70 (48%), Gaps = 3/70 (4%)
Query: 1611 LGQEKLRQGDLPSAILYLEAAAKQEPDNAEVWLSLGISL---AENEQDPQAIAALSKCLS 1667
LG+ + G A+L A + DN E+ L L +L A + +A A L + L+
Sbjct: 162 LGRAYMALGRASDALLAYRNALRLAGDNPEILLGLAEALYYQAGQQMTAKARALLRQALA 221
Query: 1668 IEPKNLEALM 1677
++P N+ AL
Sbjct: 222 LDPANIRALS 231
Score = 40.1 bits (94), Expect = 0.008
Identities = 20/95 (21%), Positives = 34/95 (35%), Gaps = 11/95 (11%)
Query: 1734 SPLSSREIHQQVLSLYLNAARQCPSQSIDPDVQNGLGVLFNLSDEYDKAVDCFRSALQVR 1793
++Q V + A R + P E + + + LQ
Sbjct: 104 LSYLLLGLYQAVGAPEQPADRLADPLAQPPA-----------EQEMEALIARLETHLQQN 152
Query: 1794 PDDSRLWNRLGASLANGNRPEEAVEAYHTALQLSP 1828
P D+ W+ LG + R +A+ AY AL+L+
Sbjct: 153 PGDAEGWDLLGRAYMALGRASDALLAYRNALRLAG 187
Score = 36.6 bits (85), Expect = 0.095
Identities = 18/80 (22%), Positives = 31/80 (38%), Gaps = 8/80 (10%)
Query: 1601 PMQNETNAFALGQEKLRQGDLPSAILYLEAAAKQEPDNAEVWLSLG-ISLAENEQDPQAI 1659
+ + A ++++ I LE +Q P +AE W LG +A A+
Sbjct: 124 RLADP-LAQPPAEQEMEA-----LIARLETHLQQNPGDAEGWDLLGRAYMALGRAS-DAL 176
Query: 1660 AALSKCLSIEPKNLEALMAI 1679
A L + N E L+ +
Sbjct: 177 LAYRNALRLAGDNPEILLGL 196
Score = 34.7 bits (80), Expect = 0.41
Identities = 16/65 (24%), Positives = 26/65 (40%), Gaps = 3/65 (4%)
Query: 1778 EYDKAVDCFRSALQVRPDDSRLWNRLGASL---ANGNRPEEAVEAYHTALQLSPGFVRAR 1834
A+ +R+AL++ D+ + L +L A +A AL L P +RA
Sbjct: 171 RASDALLAYRNALRLAGDNPEILLGLAEALYYQAGQQMTAKARALLRQALALDPANIRAL 230
Query: 1835 YNLGI 1839
L
Sbjct: 231 SLLAF 235
Score = 30.8 bits (70), Expect = 6.9
Identities = 14/62 (22%), Positives = 19/62 (30%)
Query: 1618 QGDLPSAILYLEAAAKQEPDNAEVWLSLGISLAENEQDPQAIAALSKCLSIEPKNLEALM 1677
Q A L A +P N L + E +A AA L + P +
Sbjct: 206 QQMTAKARALLRQALALDPANIRALSLLAFAAFEQGDYAEAAAAWQMLLDLLPADDPRRS 265
Query: 1678 AI 1679
I
Sbjct: 266 LI 267
>gnl|CDD|131573 TIGR02521, type_IV_pilW, type IV pilus biogenesis/stability protein
PilW. Members of this family are designated PilF in ref
(PMID:8973346) and PilW in ref (PMID:15612916). This
outer membrane protein is required both for pilus
stability and for pilus function such as adherence to
human cells. Members of this family contain copies of the
TPR (tetratricopeptide repeat) domain.
Length = 234
Score = 43.5 bits (103), Expect = 5e-04
Identities = 20/87 (22%), Positives = 32/87 (36%)
Query: 1779 YDKAVDCFRSALQVRPDDSRLWNRLGASLANGNRPEEAVEAYHTALQLSPGFVRARYNLG 1838
+ A + AL+ PDD + L E+A +++ AL L+P N G
Sbjct: 47 LEVAKENLDKALEHDPDDYLAYLALALYYQQLGELEKAEDSFRRALTLNPNNGDVLNNYG 106
Query: 1839 ITCVHLGANTQAVEHFLTALNQQAATH 1865
G QA++ F A+
Sbjct: 107 TFLCQQGKYEQAMQQFEQAIEDPLYPQ 133
Score = 41.9 bits (99), Expect = 0.001
Identities = 24/104 (23%), Positives = 40/104 (38%), Gaps = 4/104 (3%)
Query: 1762 DPDVQNGLGVLFNLSDEYDKAVDCFRSALQVRPDDSRLWNRLGASLANGNRPEEAVEAYH 1821
D L + + E +KA D FR AL + P++ + N G L + E+A
Sbjct: 64 DYLAYLALALYYQQLGELEKAEDSFRRALTLNPNNGDVLNNYGTFLCQQGKYEQA-MQQF 122
Query: 1822 TALQLSPGFV---RARYNLGITCVHLGANTQAVEHFLTALNQQA 1862
P + R+ N G+ + G +A ++ AL
Sbjct: 123 EQAIEDPLYPQPARSLENAGLCALKAGDFDKAEKYLTRALQIDP 166
Score = 40.0 bits (94), Expect = 0.006
Identities = 18/67 (26%), Positives = 33/67 (49%)
Query: 1610 ALGQEKLRQGDLPSAILYLEAAAKQEPDNAEVWLSLGISLAENEQDPQAIAALSKCLSIE 1669
L L QGDL A L+ A + +PD+ +L+L + + + +A + + L++
Sbjct: 36 QLALGYLEQGDLEVAKENLDKALEHDPDDYLAYLALALYYQQLGELEKAEDSFRRALTLN 95
Query: 1670 PKNLEAL 1676
P N + L
Sbjct: 96 PNNGDVL 102
Score = 38.9 bits (91), Expect = 0.016
Identities = 33/150 (22%), Positives = 51/150 (34%), Gaps = 27/150 (18%)
Query: 731 DPDVQNGLGVLFNLSDEYDKAVDCFRSALQVRPDFTELL----VYLFSSLDFQPPKQK-- 784
D L + + E +KA D FR AL + P+ ++L +L ++ Q+
Sbjct: 64 DYLAYLALALYYQQLGELEKAEDSFRRALTLNPNNGDVLNNYGTFLCQQGKYEQAMQQFE 123
Query: 785 --ISDPDELRDYQHRKRKAFEDNIRKNRMVISNWIKYAQWEESQKQVDRARSIYERALDV 842
I DP Y R + E+ D+A RAL +
Sbjct: 124 QAIEDP----LYPQPAR-SLENAGLCAL--------------KAGDFDKAEKYLTRALQI 164
Query: 843 DHRNITLWLKYTELEMRNRQVNHARNLWDR 872
D + L+ EL Q AR +R
Sbjct: 165 DPQRPESLLELAELYYLRGQYKDARAYLER 194
Score = 30.8 bits (70), Expect = 6.8
Identities = 17/61 (27%), Positives = 23/61 (37%)
Query: 1616 LRQGDLPSAILYLEAAAKQEPDNAEVWLSLGISLAENEQDPQAIAALSKCLSIEPKNLEA 1675
L+ GD A YL A + +P E L L Q A A L + + E+
Sbjct: 146 LKAGDFDKAEKYLTRALQIDPQRPESLLELAELYYLRGQYKDARAYLERYQQTYNQTAES 205
Query: 1676 L 1676
L
Sbjct: 206 L 206
Score = 30.0 bits (68), Expect = 9.5
Identities = 15/62 (24%), Positives = 27/62 (43%), Gaps = 2/62 (3%)
Query: 1617 RQGDLPSAILYLEAAAKQE--PDNAEVWLSLGISLAENEQDPQAIAALSKCLSIEPKNLE 1674
+QG A+ E A + P A + G+ + +A L++ L I+P+ E
Sbjct: 111 QQGKYEQAMQQFEQAIEDPLYPQPARSLENAGLCALKAGDFDKAEKYLTRALQIDPQRPE 170
Query: 1675 AL 1676
+L
Sbjct: 171 SL 172
>gnl|CDD|111114 pfam02184, HAT, HAT (Half-A-TPR) repeat. The HAT (Half A TPR)
repeat is found in several RNA processing proteins.
Length = 32
Score = 38.7 bits (90), Expect = 5e-04
Identities = 14/33 (42%), Positives = 20/33 (60%), Gaps = 1/33 (3%)
Query: 827 KQVDRARSIYERALDVDHRNITLWLKYTELEMR 859
K++DRAR IYER + V H + W+K+ E
Sbjct: 1 KEIDRARGIYERFVHV-HPEVQNWIKWARFEEE 32
>gnl|CDD|201277 pfam00515, TPR_1, Tetratricopeptide repeat.
Length = 34
Score = 38.2 bits (90), Expect = 9e-04
Identities = 8/34 (23%), Positives = 17/34 (50%)
Query: 732 PDVQNGLGVLFNLSDEYDKAVDCFRSALQVRPDF 765
LG + +YD+A++ + AL++ P+
Sbjct: 1 AKALYNLGNAYLKLGKYDEALEYYEKALELNPNN 34
Score = 37.8 bits (89), Expect = 0.001
Identities = 8/34 (23%), Positives = 18/34 (52%)
Query: 1763 PDVQNGLGVLFNLSDEYDKAVDCFRSALQVRPDD 1796
LG + +YD+A++ + AL++ P++
Sbjct: 1 AKALYNLGNAYLKLGKYDEALEYYEKALELNPNN 34
Score = 37.4 bits (88), Expect = 0.001
Identities = 10/34 (29%), Positives = 18/34 (52%)
Query: 1797 SRLWNRLGASLANGNRPEEAVEAYHTALQLSPGF 1830
++ LG + + +EA+E Y AL+L+P
Sbjct: 1 AKALYNLGNAYLKLGKYDEALEYYEKALELNPNN 34
Score = 33.2 bits (77), Expect = 0.045
Identities = 8/34 (23%), Positives = 16/34 (47%)
Query: 1639 AEVWLSLGISLAENEQDPQAIAALSKCLSIEPKN 1672
A+ +LG + + + +A+ K L + P N
Sbjct: 1 AKALYNLGNAYLKLGKYDEALEYYEKALELNPNN 34
>gnl|CDD|191825 pfam07719, TPR_2, Tetratricopeptide repeat. This Pfam entry includes
outlying Tetratricopeptide-like repeats (TPR) that are
not matched by pfam00515.
Length = 34
Score = 36.3 bits (85), Expect = 0.004
Identities = 12/33 (36%), Positives = 15/33 (45%)
Query: 1798 RLWNRLGASLANGNRPEEAVEAYHTALQLSPGF 1830
LG + EEA+EAY AL+L P
Sbjct: 2 EALYNLGLAYYKLGDYEEALEAYEKALELDPNN 34
Score = 32.1 bits (74), Expect = 0.14
Identities = 10/34 (29%), Positives = 18/34 (52%)
Query: 1639 AEVWLSLGISLAENEQDPQAIAALSKCLSIEPKN 1672
AE +LG++ + +A+ A K L ++P N
Sbjct: 1 AEALYNLGLAYYKLGDYEEALEAYEKALELDPNN 34
Score = 29.4 bits (67), Expect = 1.0
Identities = 9/30 (30%), Positives = 15/30 (50%)
Query: 1609 FALGQEKLRQGDLPSAILYLEAAAKQEPDN 1638
+ LG + GD A+ E A + +P+N
Sbjct: 5 YNLGLAYYKLGDYEEALEAYEKALELDPNN 34
>gnl|CDD|197478 smart00028, TPR, Tetratricopeptide repeats. Repeats present in 4 or
more copies in proteins. Contain a minimum of 34 amino
acids each and self-associate via a "knobs and holes"
mechanism.
Length = 34
Score = 36.3 bits (85), Expect = 0.004
Identities = 8/34 (23%), Positives = 19/34 (55%)
Query: 1763 PDVQNGLGVLFNLSDEYDKAVDCFRSALQVRPDD 1796
+ LG + +YD+A++ + AL++ P++
Sbjct: 1 AEALYNLGNAYLKLGDYDEALEYYEKALELDPNN 34
Score = 36.3 bits (85), Expect = 0.004
Identities = 8/34 (23%), Positives = 18/34 (52%)
Query: 732 PDVQNGLGVLFNLSDEYDKAVDCFRSALQVRPDF 765
+ LG + +YD+A++ + AL++ P+
Sbjct: 1 AEALYNLGNAYLKLGDYDEALEYYEKALELDPNN 34
Score = 34.3 bits (80), Expect = 0.018
Identities = 10/34 (29%), Positives = 15/34 (44%)
Query: 1797 SRLWNRLGASLANGNRPEEAVEAYHTALQLSPGF 1830
+ LG + +EA+E Y AL+L P
Sbjct: 1 AEALYNLGNAYLKLGDYDEALEYYEKALELDPNN 34
Score = 32.8 bits (76), Expect = 0.067
Identities = 9/34 (26%), Positives = 16/34 (47%)
Query: 1639 AEVWLSLGISLAENEQDPQAIAALSKCLSIEPKN 1672
AE +LG + + +A+ K L ++P N
Sbjct: 1 AEALYNLGNAYLKLGDYDEALEYYEKALELDPNN 34
>gnl|CDD|222121 pfam13428, TPR_14, Tetratricopeptide repeat.
Length = 44
Score = 36.0 bits (83), Expect = 0.007
Identities = 14/37 (37%), Positives = 18/37 (48%)
Query: 1610 ALGQEKLRQGDLPSAILYLEAAAKQEPDNAEVWLSLG 1646
AL + L GDL A+ L A +PD+ E L L
Sbjct: 6 ALARALLALGDLDEALALLRRALALDPDDPEALLLLA 42
Score = 34.8 bits (80), Expect = 0.016
Identities = 9/41 (21%), Positives = 17/41 (41%)
Query: 816 WIKYAQWEESQKQVDRARSIYERALDVDHRNITLWLKYTEL 856
+ A+ + +D A ++ RAL +D + L L
Sbjct: 4 LLALARALLALGDLDEALALLRRALALDPDDPEALLLLARL 44
Score = 34.1 bits (78), Expect = 0.029
Identities = 10/41 (24%), Positives = 13/41 (31%)
Query: 1798 RLWNRLGASLANGNRPEEAVEAYHTALQLSPGFVRARYNLG 1838
L +L +EA+ AL L P A L
Sbjct: 2 AALLALARALLALGDLDEALALLRRALALDPDDPEALLLLA 42
Score = 33.3 bits (76), Expect = 0.070
Identities = 11/42 (26%), Positives = 15/42 (35%)
Query: 1763 PDVQNGLGVLFNLSDEYDKAVDCFRSALQVRPDDSRLWNRLG 1804
P L + D+A+ R AL + PDD L
Sbjct: 1 PAALLALARALLALGDLDEALALLRRALALDPDDPEALLLLA 42
Score = 31.0 bits (70), Expect = 0.44
Identities = 11/40 (27%), Positives = 21/40 (52%)
Query: 1639 AEVWLSLGISLAENEQDPQAIAALSKCLSIEPKNLEALMA 1678
L+L +L +A+A L + L+++P + EAL+
Sbjct: 1 PAALLALARALLALGDLDEALALLRRALALDPDDPEALLL 40
>gnl|CDD|227518 COG5191, COG5191, Uncharacterized conserved protein, contains HAT
(Half-A-TPR) repeat [General function prediction only].
Length = 435
Score = 40.7 bits (95), Expect = 0.008
Identities = 25/117 (21%), Positives = 49/117 (41%), Gaps = 7/117 (5%)
Query: 782 KQKISDPDELRDYQHRKRKAFE----DNIRKNRMVISNWIKYAQWEESQKQVDRARSIYE 837
++K+ D ++K FE N N I W +YA + +K ++I+
Sbjct: 74 RKKVGKKASFSDMSIPQKKIFELYRSTNKFFNDPKI--WSQYAAYVIKKKMYGEMKNIFA 131
Query: 838 RALDVDHRNITLWLKYTELEMR-NRQVNHARNLWDRAVTILPRANQFWYKYTYMEEM 893
L N+ LW+ E+ + +R ++ + + + R+ + W +Y ME M
Sbjct: 132 ECLTKHPLNVDLWIYCCAFELFEIANIESSRAMFLKGLRMNSRSPRIWIEYFRMELM 188
>gnl|CDD|236875 PRK11189, PRK11189, lipoprotein NlpI; Provisional.
Length = 296
Score = 39.9 bits (94), Expect = 0.011
Identities = 18/57 (31%), Positives = 27/57 (47%), Gaps = 2/57 (3%)
Query: 1763 PDVQNGLGVLFNLSDEYDKAVDCFRSALQVRPD-DSRLWNRLGASLANGNRPEEAVE 1818
D N LG+ + +D A + F S L++ P + NR G +L G R E A +
Sbjct: 98 ADAYNYLGIYLTQAGNFDAAYEAFDSVLELDPTYNYAYLNR-GIALYYGGRYELAQD 153
Score = 34.9 bits (81), Expect = 0.35
Identities = 11/39 (28%), Positives = 19/39 (48%), Gaps = 2/39 (5%)
Query: 732 PDVQNGLGVLFNLSDEYDKAVDCFRSALQVRP--DFTEL 768
D N LG+ + +D A + F S L++ P ++ L
Sbjct: 98 ADAYNYLGIYLTQAGNFDAAYEAFDSVLELDPTYNYAYL 136
Score = 32.9 bits (76), Expect = 1.4
Identities = 23/70 (32%), Positives = 33/70 (47%)
Query: 1786 FRSALQVRPDDSRLWNRLGASLANGNRPEEAVEAYHTALQLSPGFVRARYNLGITCVHLG 1845
F AL +RPD + +N LG L + A EA+ + L+L P + A N GI + G
Sbjct: 87 FSQALALRPDMADAYNYLGIYLTQAGNFDAAYEAFDSVLELDPTYNYAYLNRGIALYYGG 146
Query: 1846 ANTQAVEHFL 1855
A + L
Sbjct: 147 RYELAQDDLL 156
>gnl|CDD|205609 pfam13431, TPR_17, Tetratricopeptide repeat.
Length = 34
Score = 34.5 bits (80), Expect = 0.016
Identities = 8/34 (23%), Positives = 19/34 (55%)
Query: 1785 CFRSALQVRPDDSRLWNRLGASLANGNRPEEAVE 1818
+ AL++ P+++ + L L N + +EA++
Sbjct: 1 LYEKALELDPNNAEAYYNLALLLLNLGQYDEALQ 34
Score = 32.9 bits (76), Expect = 0.065
Identities = 10/34 (29%), Positives = 18/34 (52%)
Query: 1627 YLEAAAKQEPDNAEVWLSLGISLAENEQDPQAIA 1660
E A + +P+NAE + +L + L Q +A+
Sbjct: 1 LYEKALELDPNNAEAYYNLALLLLNLGQYDEALQ 34
Score = 29.9 bits (68), Expect = 0.79
Identities = 9/29 (31%), Positives = 15/29 (51%)
Query: 1113 HLEAAAKQEPDNAEVWLSLGISLAENEQL 1141
E A + +P+NAE + +L + L Q
Sbjct: 1 LYEKALELDPNNAEAYYNLALLLLNLGQY 29
>gnl|CDD|198111 smart01043, BTAD, Bacterial transcriptional activator domain. Found
in the DNRI/REDD/AFSR family of regulators. This region
of AFSR along with the C terminal region is capable of
independently directing actinorhodin production. This
family contains TPR repeats.
Length = 145
Score = 36.5 bits (85), Expect = 0.042
Identities = 13/49 (26%), Positives = 21/49 (42%)
Query: 1778 EYDKAVDCFRSALQVRPDDSRLWNRLGASLANGNRPEEAVEAYHTALQL 1826
+++A+ L + P RL L +L R EA+ AY +L
Sbjct: 76 RHEEALALLERLLALDPLRERLHRLLMRALYRAGRRAEALRAYRRLRRL 124
>gnl|CDD|233924 TIGR02552, LcrH_SycD, type III secretion low calcium response
chaperone LcrH/SycD. Genes in this family are found in
type III secretion operons. LcrH, from Yersinia is
believed to have a regulatory function in the low-calcium
response of the secretion system. The same protein is
also known as SycD (SYC = Specific Yop Chaperone) for its
chaperone role. In Pseudomonas, where the homolog is
known as PcrH, the chaperone role has been demonstrated
and the regulatory role appears to be absent. ScyD/LcrH
contains three central tetratricopeptide-like repeats
that are predicted to fold into an all-alpha-helical
array.
Length = 135
Score = 36.1 bits (84), Expect = 0.044
Identities = 23/81 (28%), Positives = 35/81 (43%)
Query: 1778 EYDKAVDCFRSALQVRPDDSRLWNRLGASLANGNRPEEAVEAYHTALQLSPGFVRARYNL 1837
YD+A+ F+ P +SR W L A EEA++AY A L P R ++
Sbjct: 32 RYDEALKLFQLLAAYDPYNSRYWLGLAACCQMLKEYEEAIDAYALAAALDPDDPRPYFHA 91
Query: 1838 GITCVHLGANTQAVEHFLTAL 1858
+ LG A++ A+
Sbjct: 92 AECLLALGEPESALKALDLAI 112
Score = 36.1 bits (84), Expect = 0.049
Identities = 20/84 (23%), Positives = 31/84 (36%), Gaps = 1/84 (1%)
Query: 1635 EPDNAEVWLSLGISLAENEQDPQAIAALSKCLSIEPKNLEALMAISICFTNEACLHDALD 1694
+P N+ WL L ++ +AI A + +++P + + C AL
Sbjct: 47 DPYNSRYWLGLAACCQMLKEYEEAIDAYALAAALDPDDPRPYFHAAECLLALGEPESALK 106
Query: 1695 TLKDKIRPGQESNPRPSAYKADAL 1718
L I E NP S K A
Sbjct: 107 ALDLAIEICGE-NPEYSELKERAE 129
Score = 34.2 bits (79), Expect = 0.18
Identities = 17/62 (27%), Positives = 29/62 (46%)
Query: 1768 GLGVLFNLSDEYDKAVDCFRSALQVRPDDSRLWNRLGASLANGNRPEEAVEAYHTALQLS 1827
GL + EY++A+D + A + PDD R + L PE A++A A+++
Sbjct: 56 GLAACCQMLKEYEEAIDAYALAAALDPDDPRPYFHAAECLLALGEPESALKALDLAIEIC 115
Query: 1828 PG 1829
Sbjct: 116 GE 117
>gnl|CDD|225605 COG3063, PilF, Tfp pilus assembly protein PilF [Cell motility and
secretion / Intracellular trafficking and secretion].
Length = 250
Score = 37.4 bits (87), Expect = 0.049
Identities = 30/110 (27%), Positives = 44/110 (40%), Gaps = 5/110 (4%)
Query: 1751 NAARQCPSQSIDPDVQNGLGVLFNLSDEYDKAVDCFRSALQVRPDDSRLWNRLGASLANG 1810
A PS + E D A + +R AL + P++ + N GA L
Sbjct: 60 KALEHDPS---YYLAHLVRAHYYQKLGENDLADESYRKALSLAPNNGDVLNNYGAFLCAQ 116
Query: 1811 NRPEEAVEAYHTALQLSPGFVRARY--NLGITCVHLGANTQAVEHFLTAL 1858
RPEEA++ + AL + NLG+ + G QA E+ AL
Sbjct: 117 GRPEEAMQQFERALADPAYGEPSDTLENLGLCALKAGQFDQAEEYLKRAL 166
Score = 33.2 bits (76), Expect = 1.1
Identities = 27/157 (17%), Positives = 47/157 (29%), Gaps = 22/157 (14%)
Query: 720 NAARQCPSQSIDPDVQNGLGVLFNLSDEYDKAVDCFRSALQVRPDFTELL----VYLFSS 775
A PS + E D A + +R AL + P+ ++L +L +
Sbjct: 60 KALEHDPS---YYLAHLVRAHYYQKLGENDLADESYRKALSLAPNNGDVLNNYGAFLCAQ 116
Query: 776 LDFQPPKQKISDPDELRDYQHRKRKAFEDNIRKNRMVISNWIKYAQWEESQKQVDRARSI 835
+ Q+ Y E+ Q D+A
Sbjct: 117 GRPEEAMQQFERALADPAYGEPSD-TLEN---------LGLCAL-----KAGQFDQAEEY 161
Query: 836 YERALDVDHRNITLWLKYTELEMRNRQVNHARNLWDR 872
+RAL++D + L+ L + AR +R
Sbjct: 162 LKRALELDPQFPPALLELARLHYKAGDYAPARLYLER 198
Score = 30.9 bits (70), Expect = 6.7
Identities = 21/70 (30%), Positives = 28/70 (40%), Gaps = 6/70 (8%)
Query: 1610 ALGQEKLRQGDLPSAILYLEAAAKQEPDNAEVWLSLGI---SLAENEQDPQAIAALSKCL 1666
L L+QGD A LE A + +P L L EN+ A + K L
Sbjct: 40 QLALGYLQQGDYAQAKKNLEKALEHDPSYYLAHLVRAHYYQKLGEND---LADESYRKAL 96
Query: 1667 SIEPKNLEAL 1676
S+ P N + L
Sbjct: 97 SLAPNNGDVL 106
>gnl|CDD|188247 TIGR02795, tol_pal_ybgF, tol-pal system protein YbgF. Members of
this protein family are the product of one of seven genes
regularly clustered in operons to encode the proteins of
the tol-pal system, which is critical for maintaining the
integrity of the bacterial outer membrane. The gene for
this periplasmic protein has been designated orf2 and
ybgF. All members of the seed alignment were from unique
tol-pal gene regions from completed bacterial genomes.
The architecture of this protein is a signal sequence, a
low-complexity region usually rich in Asn and Gln, a
well-conserved region with tandem repeats that resemble
the tetratricopeptide (TPR) repeat, involved in
protein-protein interaction.
Length = 117
Score = 35.3 bits (82), Expect = 0.057
Identities = 23/76 (30%), Positives = 34/76 (44%), Gaps = 4/76 (5%)
Query: 1607 NA-FALGQEKLRQGDLPSAILYLEAAAKQEPDNA---EVWLSLGISLAENEQDPQAIAAL 1662
NA + LG+ QGD A A K+ P + + L LG+SL E +A A L
Sbjct: 38 NAHYWLGEAYYAQGDYADAAKAFLAVVKKYPKSPKAPDALLKLGMSLQELGDKEKAKATL 97
Query: 1663 SKCLSIEPKNLEALMA 1678
+ + P + A +A
Sbjct: 98 QQVIKRYPGSSAAKLA 113
>gnl|CDD|236983 PRK11788, PRK11788, tetratricopeptide repeat protein; Provisional.
Length = 389
Score = 37.1 bits (87), Expect = 0.091
Identities = 16/52 (30%), Positives = 22/52 (42%)
Query: 1611 LGQEKLRQGDLPSAILYLEAAAKQEPDNAEVWLSLGISLAENEQDPQAIAAL 1662
L Q+ L +GDL +A L+ A +P + LG AI AL
Sbjct: 186 LAQQALARGDLDAARALLKKALAADPQCVRASILLGDLALAQGDYAAAIEAL 237
Score = 35.6 bits (83), Expect = 0.25
Identities = 13/30 (43%), Positives = 20/30 (66%)
Query: 739 GVLFNLSDEYDKAVDCFRSALQVRPDFTEL 768
G+ F L+++ DKA+D F L+V P+ EL
Sbjct: 42 GLNFLLNEQPDKAIDLFIEMLKVDPETVEL 71
Score = 33.2 bits (77), Expect = 1.2
Identities = 11/26 (42%), Positives = 18/26 (69%)
Query: 1770 GVLFNLSDEYDKAVDCFRSALQVRPD 1795
G+ F L+++ DKA+D F L+V P+
Sbjct: 42 GLNFLLNEQPDKAIDLFIEMLKVDPE 67
>gnl|CDD|227122 COG4783, COG4783, Putative Zn-dependent protease, contains TPR
repeats [General function prediction only].
Length = 484
Score = 37.0 bits (86), Expect = 0.10
Identities = 19/74 (25%), Positives = 33/74 (44%)
Query: 1778 EYDKAVDCFRSALQVRPDDSRLWNRLGASLANGNRPEEAVEAYHTALQLSPGFVRARYNL 1837
+YD+A+ + + +PD+ G L N+ +EA+E AL L P + NL
Sbjct: 321 QYDEALKLLQPLIAAQPDNPYYLELAGDILLEANKAKEAIERLKKALALDPNSPLLQLNL 380
Query: 1838 GITCVHLGANTQAV 1851
+ G +A+
Sbjct: 381 AQALLKGGKPQEAI 394
Score = 34.7 bits (80), Expect = 0.50
Identities = 19/74 (25%), Positives = 34/74 (45%), Gaps = 4/74 (5%)
Query: 1599 DNPMQNETNAFALGQEKLRQGDLPSAILYLEAAAKQEPDNAEVWLSLGISLAENEQDPQA 1658
DNP E G L AI L+ A +P++ + L+L +L + + +A
Sbjct: 338 DNPYYLE----LAGDILLEANKAKEAIERLKKALALDPNSPLLQLNLAQALLKGGKPQEA 393
Query: 1659 IAALSKCLSIEPKN 1672
I L++ L +P++
Sbjct: 394 IRILNRYLFNDPED 407
Score = 32.4 bits (74), Expect = 3.2
Identities = 14/51 (27%), Positives = 21/51 (41%)
Query: 1611 LGQEKLRQGDLPSAILYLEAAAKQEPDNAEVWLSLGISLAENEQDPQAIAA 1661
L Q L+ G AI L +P++ W L + AE +A+ A
Sbjct: 380 LAQALLKGGKPQEAIRILNRYLFNDPEDPNGWDLLAQAYAELGNRAEALLA 430
Score = 30.8 bits (70), Expect = 8.8
Identities = 15/62 (24%), Positives = 25/62 (40%)
Query: 1609 FALGQEKLRQGDLPSAILYLEAAAKQEPDNAEVWLSLGISLAENEQDPQAIAALSKCLSI 1668
+ + G A+ L+ +PDN G L E + +AI L K L++
Sbjct: 310 YGRALQTYLAGQYDEALKLLQPLIAAQPDNPYYLELAGDILLEANKAKEAIERLKKALAL 369
Query: 1669 EP 1670
+P
Sbjct: 370 DP 371
>gnl|CDD|227124 COG4785, NlpI, Lipoprotein NlpI, contains TPR repeats [General
function prediction only].
Length = 297
Score = 36.7 bits (85), Expect = 0.11
Identities = 27/86 (31%), Positives = 40/86 (46%)
Query: 1770 GVLFNLSDEYDKAVDCFRSALQVRPDDSRLWNRLGASLANGNRPEEAVEAYHTALQLSPG 1829
GVL++ A + F AL +RPD ++N LG L + A EA+ + L+L P
Sbjct: 72 GVLYDSLGLRALARNDFSQALAIRPDMPEVFNYLGIYLTQAGNFDAAYEAFDSVLELDPT 131
Query: 1830 FVRARYNLGITCVHLGANTQAVEHFL 1855
+ A N GI + G A + L
Sbjct: 132 YNYAHLNRGIALYYGGRYKLAQDDLL 157
Score = 32.1 bits (73), Expect = 2.4
Identities = 19/58 (32%), Positives = 31/58 (53%), Gaps = 4/58 (6%)
Query: 1763 PDVQNGLGVLFNLSDEYDKAVDCFRSALQVRP--DDSRLWNRLGASLANGNRPEEAVE 1818
P+V N LG+ + +D A + F S L++ P + + L NR G +L G R + A +
Sbjct: 99 PEVFNYLGIYLTQAGNFDAAYEAFDSVLELDPTYNYAHL-NR-GIALYYGGRYKLAQD 154
>gnl|CDD|234406 TIGR03939, PGA_TPR_OMP, poly-beta-1,6 N-acetyl-D-glucosamine export
porin PgaA. Members of this protein family are the
poly-beta-1,6 N-acetyl-D-glucosamine (PGA) export porin
PgaA of Gram-negative bacteria. There is no counterpart
in the poly-beta-1,6 N-acetyl-D-glucosamine biosynthesis
systems of Gram-positive bacteria such as Staphylococcus
epidermidis. The PGA polysaccharide adhesin is a critical
determinant of biofilm formation. The conserved
C-terminal domain of this outer membrane protein is
preceded by a variable number of TPR repeats.
Length = 800
Score = 36.6 bits (85), Expect = 0.16
Identities = 14/69 (20%), Positives = 32/69 (46%)
Query: 1610 ALGQEKLRQGDLPSAILYLEAAAKQEPDNAEVWLSLGISLAENEQDPQAIAALSKCLSIE 1669
AL + + A+ +++P+N + L L ++LA+ +D +A+ L + ++
Sbjct: 91 ALAKAYRNEKQWDKALELYRKLLQRDPNNPDGLLGLALTLADAGKDAEALKYLKEYVARF 150
Query: 1670 PKNLEALMA 1678
P + A
Sbjct: 151 PTDAARYEA 159
Score = 30.8 bits (70), Expect = 9.9
Identities = 16/84 (19%), Positives = 27/84 (32%), Gaps = 4/84 (4%)
Query: 825 SQKQVDRARSIYERALDVDHRNITLWLKY--TELEMRNRQVNHARNLWDRAVTILPRANQ 882
++KQ D+A +Y + L D N L T + A V P
Sbjct: 98 NEKQWDKALELYRKLLQRDPNNPDGLLGLALTLADAGKDA--EALKYLKEYVARFPTDAA 155
Query: 883 FWYKYTYMEEMLENVAGKLFVFHR 906
+ Y+ E+ L + +
Sbjct: 156 RYEALAYVLRAAEDHLDALQAWQQ 179
>gnl|CDD|235419 PRK05339, PRK05339, PEP synthetase regulatory protein; Provisional.
Length = 269
Score = 35.8 bits (84), Expect = 0.17
Identities = 17/72 (23%), Positives = 34/72 (47%), Gaps = 14/72 (19%)
Query: 788 PDELRDYQHRKRKAF-----ED---NIRKNRMVISNWIKYAQWEESQKQVDRARSIYERA 839
P+EL + +K F + IRK R+ +YA E+ ++++ A ++ R
Sbjct: 182 PEEL--FPIDPKKIFGLTIDPERLIEIRKERLPNLGLSRYASLEQCREELAEAERLFRRE 239
Query: 840 ----LDVDHRNI 847
+DV +++I
Sbjct: 240 GIPVIDVTNKSI 251
>gnl|CDD|205550 pfam13371, TPR_9, Tetratricopeptide repeat.
Length = 73
Score = 31.8 bits (73), Expect = 0.38
Identities = 15/63 (23%), Positives = 24/63 (38%)
Query: 1616 LRQGDLPSAILYLEAAAKQEPDNAEVWLSLGISLAENEQDPQAIAALSKCLSIEPKNLEA 1675
LR+ D A+ +E PD+ G+ A+ A+A L L + P +A
Sbjct: 6 LREEDFERALAVVERLLLLAPDDPYERRDRGLLYAQLGCFQAALADLEYFLELCPDAPDA 65
Query: 1676 LMA 1678
Sbjct: 66 ERI 68
Score = 31.0 bits (71), Expect = 0.72
Identities = 11/53 (20%), Positives = 21/53 (39%)
Query: 1776 SDEYDKAVDCFRSALQVRPDDSRLWNRLGASLANGNRPEEAVEAYHTALQLSP 1828
+++++A+ L + PDD G A + A+ L+L P
Sbjct: 8 EEDFERALAVVERLLLLAPDDPYERRDRGLLYAQLGCFQAALADLEYFLELCP 60
>gnl|CDD|224643 COG1729, COG1729, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 262
Score = 34.3 bits (79), Expect = 0.49
Identities = 21/76 (27%), Positives = 30/76 (39%), Gaps = 4/76 (5%)
Query: 1607 NA-FALGQEKLRQGDLPSAILYLEAAAKQEPDNA---EVWLSLGISLAENEQDPQAIAAL 1662
NA + LG+ QGD A K P + + L LG+SL +A A L
Sbjct: 179 NAYYWLGESLYAQGDYEDAAYIFARVVKDYPKSPKAPDALLKLGVSLGRLGNTDEACATL 238
Query: 1663 SKCLSIEPKNLEALMA 1678
+ + P A +A
Sbjct: 239 QQVIKRYPGTDAAKLA 254
>gnl|CDD|203781 pfam07849, DUF1641, Protein of unknown function (DUF1641).
Archaeal and bacterial hypothetical proteins are found
in this family, with the region in question being
approximately 40 residues long.
Length = 42
Score = 30.5 bits (70), Expect = 0.55
Identities = 10/14 (71%), Positives = 11/14 (78%)
Query: 731 DPDVQNGLGVLFNL 744
DPDVQ GLG L +L
Sbjct: 22 DPDVQRGLGFLLSL 35
Score = 30.5 bits (70), Expect = 0.55
Identities = 10/14 (71%), Positives = 11/14 (78%)
Query: 1762 DPDVQNGLGVLFNL 1775
DPDVQ GLG L +L
Sbjct: 22 DPDVQRGLGFLLSL 35
>gnl|CDD|205362 pfam13181, TPR_8, Tetratricopeptide repeat.
Length = 34
Score = 30.1 bits (69), Expect = 0.55
Identities = 8/34 (23%), Positives = 13/34 (38%)
Query: 1639 AEVWLSLGISLAENEQDPQAIAALSKCLSIEPKN 1672
A + LG + +A K L ++P N
Sbjct: 1 ARAYYLLGQIYLQLGDYEEAKEYYEKALELDPNN 34
>gnl|CDD|185096 PRK15174, PRK15174, Vi polysaccharide export protein VexE;
Provisional.
Length = 656
Score = 34.3 bits (78), Expect = 0.73
Identities = 27/242 (11%), Positives = 61/242 (25%), Gaps = 34/242 (14%)
Query: 1618 QGDLPSAILYLEAAAKQEPDNAEVWLSLGISLAENEQDPQAIAALSKCLSIEPKNLEALM 1677
+ + + E L + L +++Q + N +
Sbjct: 89 SSQPDAVLQVVNKLLAVNVCQPEDVLLVASVLLKSKQYATVADLAEQAWLAFSGNS-QIF 147
Query: 1678 AISICFTNEAC-LHDALDTLKD----KIRPGQE----------------SNPRPSAYKAD 1716
A+ + A+ + G + +
Sbjct: 148 ALHLRTLVLMDKELQAISLARTQAQEVPPRGDMIATCLSFLNKSRLPEDHDLARALLPFF 207
Query: 1717 ALPSKLTRLANHTLTFRSPLSSREIHQQVLSLYLNAARQCPSQSIDPDVQNGLGVL-FNL 1775
AL + A + + + Q L + ++ LG+ +
Sbjct: 208 ALERQ--ESAGLAVDTLCAVGKYQEAIQTGESALARGLDGAA------LRRSLGLAYYQS 259
Query: 1776 SDEYDKAVDC---FRSALQVRPDDSRLWNRLGASLANGNRPEEAVEAYHTALQLSPGFVR 1832
+ + +R ALQ D+ R+ +L + E+A+ +L P
Sbjct: 260 GRSREAKLQAAEHWRHALQFNSDNVRIVTLYADALIRTGQNEKAIPLLQQSLATHPDLPY 319
Query: 1833 AR 1834
R
Sbjct: 320 VR 321
>gnl|CDD|220793 pfam10516, SHNi-TPR, SHNi-TPR. SHNi-TPR family members contain a
reiterated sequence motif that is an interrupted form of
TPR repeat.
Length = 38
Score = 30.1 bits (69), Expect = 0.77
Identities = 17/38 (44%), Positives = 22/38 (57%), Gaps = 2/38 (5%)
Query: 1639 AEVWLSLG-ISLAENEQDPQAIAALSKCLSIEPKNLEA 1675
A+V+ LG ISL ENE PQA+ L K L + + L
Sbjct: 1 ADVYDLLGEISL-ENENFPQAVEDLRKALELREQLLPP 37
>gnl|CDD|225613 COG3071, HemY, Uncharacterized enzyme of heme biosynthesis [Coenzyme
metabolism].
Length = 400
Score = 33.5 bits (77), Expect = 1.2
Identities = 21/65 (32%), Positives = 32/65 (49%), Gaps = 1/65 (1%)
Query: 1762 DPDVQNGLGVLFNLSDEYDKAVDCFRSALQVRPDDSRLWNRLGASLANGNRPEEAVEAYH 1821
DP + + LG L + + KA + +AL++RP S + L +L PEEA +
Sbjct: 327 DPLLLSTLGRLALKNKLWGKASEALEAALKLRPSASD-YAELADALDQLGEPEEAEQVRR 385
Query: 1822 TALQL 1826
AL L
Sbjct: 386 EALLL 390
>gnl|CDD|227438 COG5107, RNA14, Pre-mRNA 3'-end processing (cleavage and
polyadenylation) factor [RNA processing and
modification].
Length = 660
Score = 33.8 bits (77), Expect = 1.2
Identities = 15/43 (34%), Positives = 22/43 (51%), Gaps = 2/43 (4%)
Query: 816 WIKYAQWEESQKQVDRARSIYERALDVDHRNIT-LWLKYTELE 857
+ +WEE Q+ +D+ R+ Y RAL N+ LW Y E
Sbjct: 165 IEELGKWEEQQR-IDKIRNGYMRALQTPMGNLEKLWKDYENFE 206
>gnl|CDD|183140 PRK11447, PRK11447, cellulose synthase subunit BcsC; Provisional.
Length = 1157
Score = 33.9 bits (78), Expect = 1.3
Identities = 15/38 (39%), Positives = 19/38 (50%), Gaps = 2/38 (5%)
Query: 1610 ALGQEKLRQGDLPSAILYLEAAAKQEPD--NAEVWLSL 1645
ALGQ +QGD A+ E A +P N + W SL
Sbjct: 308 ALGQAYSQQGDRARAVAQFEKALALDPHSSNRDKWESL 345
Score = 32.7 bits (75), Expect = 2.6
Identities = 17/70 (24%), Positives = 25/70 (35%)
Query: 1804 GASLANGNRPEEAVEAYHTALQLSPGFVRARYNLGITCVHLGANTQAVEHFLTALNQQAA 1863
+L N + +A E L L PG V Y L G +QA Q+
Sbjct: 468 AEALENQGKWAQAAELQRQRLALDPGSVWLTYRLAQDLRQAGQRSQADALMRRLAQQKPN 527
Query: 1864 THDGLTPHGL 1873
+ + +GL
Sbjct: 528 DPEQVYAYGL 537
Score = 32.0 bits (73), Expect = 4.3
Identities = 17/66 (25%), Positives = 31/66 (46%), Gaps = 3/66 (4%)
Query: 1608 AFALGQEKLRQ---GDLPSAILYLEAAAKQEPDNAEVWLSLGISLAENEQDPQAIAALSK 1664
AF + L G AI L+ A + P ++E +LG + ++ +A+A K
Sbjct: 269 AFRARAQGLAAVDSGQGGKAIPELQQAVRANPKDSEALGALGQAYSQQGDRARAVAQFEK 328
Query: 1665 CLSIEP 1670
L+++P
Sbjct: 329 ALALDP 334
>gnl|CDD|233223 TIGR00990, 3a0801s09, mitochondrial precursor proteins import
receptor (72 kDa mitochondrial outermembrane protein)
(mitochondrial import receptor for the ADP/ATP carrier)
(translocase of outermembrane tom70). [Transport and
binding proteins, Amino acids, peptides and amines].
Length = 615
Score = 33.4 bits (76), Expect = 1.3
Identities = 32/162 (19%), Positives = 58/162 (35%), Gaps = 24/162 (14%)
Query: 1714 KADALPSKLTRLANHTLTFRSPLS------SREIHQQVLSLYLNAARQCPSQSIDPDVQN 1767
K + LPS +T + N+ +FR S E+ ++ + L + P D
Sbjct: 254 KPENLPS-VTFVGNYLQSFRPKPRPAGLEDSNELDEETGNGQLQLGLKSPESKADE---- 308
Query: 1768 GLGVLFNLSDEYDKAVDCFRSAL---QVRPDDSRLWNRLGASLANGNRPEEAVEAYHTAL 1824
Y++A F AL ++ ++ N G + EA+ ++
Sbjct: 309 ----------SYEEAARAFEKALDLGKLGEKEAIALNLRGTFKCLKGKHLEALADLSKSI 358
Query: 1825 QLSPGFVRARYNLGITCVHLGANTQAVEHFLTALNQQAATHD 1866
+L P ++ + LG +A E F AL + D
Sbjct: 359 ELDPRVTQSYIKRASMNLELGDPDKAEEDFDKALKLNSEDPD 400
>gnl|CDD|217642 pfam03618, Kinase-PPPase, Kinase/pyrophosphorylase. This family of
regulatory proteins has ADP-dependent kinase and
inorganic phosphate-dependent pyrophosphorylase
activity.
Length = 255
Score = 32.8 bits (76), Expect = 1.3
Identities = 17/74 (22%), Positives = 36/74 (48%), Gaps = 16/74 (21%)
Query: 788 PDELRDYQHRKRKAF------E--DNIRKNRMVISNWI--KYAQWEESQKQVDRARSIYE 837
P+EL ++ ++K F E IR+ R+ +YA E+ ++++ A ++
Sbjct: 176 PEEL--FEVDRKKVFGLTIDPERLHEIRRERLPSLGLDESRYASLEQCREELAYAERLFR 233
Query: 838 RA----LDVDHRNI 847
R +DV +++I
Sbjct: 234 RLGIPVIDVTNKSI 247
>gnl|CDD|214817 smart00777, Mad3_BUB1_I, Mad3/BUB1 hoMad3/BUB1 homology region 1.
Proteins containing this domain are checkpoint proteins
involved in cell division. This region has been shown to
be essential for the binding of the binding of BUB1 and
MAD3 to CDC20p.
Length = 124
Score = 31.0 bits (71), Expect = 1.8
Identities = 21/89 (23%), Positives = 39/89 (43%), Gaps = 22/89 (24%)
Query: 798 KRKAFEDNIR---KNRMVISNWIKYAQW-EESQKQVDRA---RSIYERALDV-------- 842
+R+AFE ++ + + W++Y +W EE+ Q + ++ ER +
Sbjct: 3 QRQAFEAELQDLYEGDDPLDLWLRYIKWTEENYPQGGKESGLLTLLERCIRYFEDDERYK 62
Query: 843 -DHRNITLWLKYTELEMRNRQVNHARNLW 870
D R + +WLKY E + R L+
Sbjct: 63 NDPRYLKIWLKYAE------YCDEPRELF 85
>gnl|CDD|179448 PRK02603, PRK02603, photosystem I assembly protein Ycf3; Provisional.
Length = 172
Score = 32.0 bits (73), Expect = 1.9
Identities = 21/76 (27%), Positives = 38/76 (50%), Gaps = 5/76 (6%)
Query: 1778 EYDKAVDCFRSALQVRPDDSR----LWNRLGASLANGNRPEEAVEAYHTALQLSPGFVRA 1833
EY +A++ + AL++ D + L+N +NG ++A+E YH AL+L+P A
Sbjct: 50 EYAEALENYEEALKLEEDPNDRSYILYNMGIIYASNGEH-DKALEYYHQALELNPKQPSA 108
Query: 1834 RYNLGITCVHLGANTQ 1849
N+ + G +
Sbjct: 109 LNNIAVIYHKRGEKAE 124
>gnl|CDD|182792 PRK10866, PRK10866, outer membrane biogenesis protein BamD;
Provisional.
Length = 243
Score = 32.1 bits (73), Expect = 2.7
Identities = 24/75 (32%), Positives = 38/75 (50%), Gaps = 6/75 (8%)
Query: 1599 DNPMQNETNAFALGQEKLRQGDLPSAILYLEAAAKQEP---DNAEVWLSLGISLAENEQD 1655
DNP +E +A Q+KL+ G+ AI LEA + P + +V L L + +N
Sbjct: 29 DNP-PSEI--YATAQQKLQDGNWKQAITQLEALDNRYPFGPYSQQVQLDLIYAYYKNADL 85
Query: 1656 PQAIAALSKCLSIEP 1670
P A AA+ + + + P
Sbjct: 86 PLAQAAIDRFIRLNP 100
>gnl|CDD|176974 CHL00033, ycf3, photosystem I assembly protein Ycf3.
Length = 168
Score = 31.1 bits (71), Expect = 2.9
Identities = 24/95 (25%), Positives = 43/95 (45%), Gaps = 9/95 (9%)
Query: 1778 EYDKAVDCFRSALQVRPDD---SRLWNRLGASLANGNRPEEAVEAYHTALQLSPGFVRAR 1834
EY +A+ + A+++ D S + +G + +A+E Y AL+ +P +A
Sbjct: 50 EYAEALQNYYEAMRLEIDPYDRSYILYNIGLIHTSNGEHTKALEYYFQALERNPFLPQAL 109
Query: 1835 YNLGITCVHLGANTQAVE--HFLTA--LNQQAATH 1865
N+ + C + G QA+E A QAA +
Sbjct: 110 NNMAVICHYRG--EQAIEQGDSEIAEAWFDQAAEY 142
>gnl|CDD|225463 COG2911, COG2911, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 1278
Score = 32.3 bits (74), Expect = 3.1
Identities = 18/80 (22%), Positives = 28/80 (35%), Gaps = 21/80 (26%)
Query: 1490 GPLFSNGVASINQQPVPNTNGNLHSLDRTLAVSK---LFKGPII-----------GTPST 1535
G L NG ++ G + + L + K LF GP+ +T
Sbjct: 1082 GNLGLNGQINL-------RRGRYRAYGQDLTIRKGTLLFSGPLDQPYLNIEAIRNPDSTT 1134
Query: 1536 TSGVSTSGTVDKPEPALWNE 1555
V +GT D P ++E
Sbjct: 1135 AGAVRVTGTADSPRITFFSE 1154
>gnl|CDD|184854 PRK14852, PRK14852, hypothetical protein; Provisional.
Length = 989
Score = 32.4 bits (73), Expect = 3.1
Identities = 20/70 (28%), Positives = 28/70 (40%)
Query: 1544 TVDKPEPALWNELNSHWKEMTESLGSGESLPHQWFSDFSRNQRSSVSMHEYTFAEDNPMQ 1603
TVD P AL +L+ E+ E G E F+ F+R E D P++
Sbjct: 187 TVDAPAVALRIDLHEAMDELQEKYGDEEDFETNLFAFFARMNSDETETSENPVRRDRPLE 246
Query: 1604 NETNAFALGQ 1613
T + L Q
Sbjct: 247 PYTAYYLLQQ 256
>gnl|CDD|215568 PLN03088, PLN03088, SGT1, suppressor of G2 allele of SKP1;
Provisional.
Length = 356
Score = 32.1 bits (73), Expect = 3.4
Identities = 20/72 (27%), Positives = 39/72 (54%), Gaps = 2/72 (2%)
Query: 1775 LSDEYDKAVDCFRSALQVRPDDSRLW-NRLGASLANGNRPEEAVEAYHTALQLSPGFVRA 1833
+ D++ AVD + A+ + P+++ L+ +R A++ GN E +A + A++L P +A
Sbjct: 14 VDDDFALAVDLYTQAIDLDPNNAELYADRAQANIKLGNFTEAVADA-NKAIELDPSLAKA 72
Query: 1834 RYNLGITCVHLG 1845
G C+ L
Sbjct: 73 YLRKGTACMKLE 84
>gnl|CDD|218234 pfam04733, Coatomer_E, Coatomer epsilon subunit. This family
represents the epsilon subunit of the coatomer complex,
which is involved in the regulation of intracellular
protein trafficking between the endoplasmic reticulum and
the Golgi complex.
Length = 288
Score = 31.7 bits (72), Expect = 3.7
Identities = 15/55 (27%), Positives = 19/55 (34%)
Query: 1812 RPEEAVEAYHTALQLSPGFVRARYNLGITCVHLGANTQAVEHFLTALNQQAATHD 1866
R EEA AL NL + +HLG + L+ L TH
Sbjct: 214 RYEEAESLLKEALDKDAKDPETLINLVVCALHLGKPAEVSNRNLSQLKLSHPTHP 268
>gnl|CDD|234074 TIGR02966, phoR_proteo, phosphate regulon sensor kinase PhoR.
Members of this protein family are the regulatory
histidine kinase PhoR associated with the phosphate ABC
transporter in most Proteobacteria. Related proteins from
Gram-positive organisms are not included in this model.
The phoR gene usually is adjacent to the response
regulator phoB gene (TIGR02154) [Signal transduction,
Two-component systems].
Length = 333
Score = 31.4 bits (72), Expect = 4.6
Identities = 22/97 (22%), Positives = 40/97 (41%), Gaps = 13/97 (13%)
Query: 1135 LAENEQLLGEFWEHNLGNM-QQAFRMDNLLAEMRD---IEAASQRPKPQQSAPISLLAR- 1189
LA+ E W L M +Q+ RM +L+ ++ +E+A+ + + +LL
Sbjct: 139 LADGPDEDPEEWNRALEIMLEQSQRMQSLVEDLLTLSRLESAASPLEDEPVDMPALLDHL 198
Query: 1190 QDDAREWANQFLEAGSHFEHTPPETIWDNAPIMRGSE 1226
+D+A EA S ++ D + G E
Sbjct: 199 RDEA--------EALSQGKNHQITFEIDGGVDVLGDE 227
>gnl|CDD|234531 TIGR04282, glyco_like_cofC, transferase 1,
rSAM/selenodomain-associated. Members of this protein
family show strongly correlated phylogenetic
distribution, and in most cases co-clustering, with an
unusual radical SAM enzyme (TIGR04167) whose C-terminal
pfam12345 domain often contains a selenocysteine residue.
Other members of the conserved gene neighborhood include
another putative glycosyltransferase, an
alkylhydroperoxidase family protein (TIGR04169), and a
phosphoesterase family protein (TIGR04168). The cassette
is likely to be biosynthetic but its exact function is
unknown [Unknown function, Enzymes of unknown
specificity].
Length = 189
Score = 30.9 bits (71), Expect = 4.9
Identities = 17/60 (28%), Positives = 23/60 (38%), Gaps = 8/60 (13%)
Query: 1107 QSFLLRHLEAAAKQ-EPDNAEVWLSLGISLAENEQLLGEFWEH------NLGN-MQQAFR 1158
LL H A AK + E+ S A + LG E +LG M +AF+
Sbjct: 30 YRRLLEHTLAVAKSLAGVDVELAYSPDGPHAAFDDWLGPGVELRLQGGGDLGERMARAFQ 89
>gnl|CDD|236624 PRK09782, PRK09782, bacteriophage N4 receptor, outer membrane
subunit; Provisional.
Length = 987
Score = 31.8 bits (72), Expect = 5.2
Identities = 13/48 (27%), Positives = 28/48 (58%)
Query: 1617 RQGDLPSAILYLEAAAKQEPDNAEVWLSLGISLAENEQDPQAIAALSK 1664
++ ++P+A+ L AA + EP+N+ +LG +L ++ Q+ L +
Sbjct: 621 QRHNVPAAVSDLRAALELEPNNSNYQAALGYALWDSGDIAQSREMLER 668
>gnl|CDD|213348 cd12788, RasGAP_plexin_D1, Ras-GTPase Activating Domain of
plexin-D1. Plexins form a conserved family of
transmembrane receptors for semaphorins and may be the
ancestors of semaphorins. Plexins are divided into four
types (A-D) according to sequence similarity. Plexin-D1
has been identified as the receptor of Sema3E. It binds
to Sema3E directly with high affinity. Sema3E is
implicated in axonal path finding and inhibition of
developmental and postischemic angiogenesis. Plexin-D1
is broadly expressed on tumor vessels and tumor cells in
a number of different types of human tumors. The
Plexin-D1 and Sema3E interaction inhibits tumor growth
but promotes invasiveness and metastasis. Plexins
contain a C-terminal RasGAP domain, which functions as
an enhancer of the hydrolysis of GTP that is bound to
Ras-GTPases. Plexins display GAP activity towards the
Ras homolog Rap. Although the Rho (Ras homolog) GTPases
are most closely related to members of the Ras family,
RhoGAP and RasGAP show no sequence homology at their
amino acid level. RasGTPases function as molecular
switches in a large number of of signaling pathways.
When bound to GTP they are in the on state and when
bound to GDP they are in the off state. The RasGAP
domain speeds up the hydrolysis of GTP in Ras-like
proteins acting as a negative regulator.
Length = 419
Score = 31.5 bits (71), Expect = 5.4
Identities = 16/40 (40%), Positives = 22/40 (55%), Gaps = 1/40 (2%)
Query: 753 DCFRSALQVRPDFTELLV-YLFSSLDFQPPKQKISDPDEL 791
D F++ L + D L V Y F L+ Q K+ I+DPD L
Sbjct: 215 DLFQAILSIPEDRPPLAVKYFFDFLEEQAEKRGITDPDTL 254
>gnl|CDD|234243 TIGR03523, GldN, gliding motility associated protien GldN. Members
of this protein family are exclusive to the
Bacteroidetes phylum (previously
Cytophaga-Flavobacteria-Bacteroides). GldN is a protein
linked to a type of rapid surface gliding motility found
in certain Bacteroidetes, such as Flavobacterium
johnsoniae and Cytophaga hutchinsonii. Knockouts of GldN
abolish the gliding phenotype. Gliding motility appears
closely linked to chitin utilization in the model
species Flavobacterium johnsoniae. Bacteroidetes with
members of this protein also include those which are not
believed to express the gliding phenotype, such as
Prevotella intermedia and Porphyromonas gingivales.
Length = 280
Score = 31.3 bits (71), Expect = 5.5
Identities = 16/76 (21%), Positives = 26/76 (34%), Gaps = 2/76 (2%)
Query: 579 NLPLFVNFPKHVPNARRSVNKQFYITANAENRYKVYDDEYLCRVLEATVFTPAKFIQEET 638
NLPL+ +P RRS+ + + +VY D+ + T +F
Sbjct: 73 NLPLY--YPTDPVGDRRSLFDYIFKAIVEGSIIEVYADDGFEIFTDTEPITLQEFQDRFY 130
Query: 639 LESPHFEYFDITGIDP 654
L + D I
Sbjct: 131 LYYEPSDRLDGDSIFS 146
>gnl|CDD|233834 TIGR02358, thia_cytX, putative hydroxymethylpyrimidine transporter
CytX. On the basis of a phylogenomic study of thiamine
biosythetic, salvage, and transporter genes and a highly
conserved RNA element THI, this protein family has been
identified as a probable transporter of
hydroxymethylpyrimidine (HMP), the phosphorylated (by
ThiD) form of which gets joined (by ThiE) to
hydroxyethylthiazole phosphate to make thiamine phosphate
[Transport and binding proteins, Nucleosides, purines and
pyrimidines, Biosynthesis of cofactors, prosthetic
groups, and carriers, Thiamine].
Length = 386
Score = 31.3 bits (71), Expect = 5.5
Identities = 24/111 (21%), Positives = 41/111 (36%), Gaps = 19/111 (17%)
Query: 910 SQVTLWLGALTLSLEGLGSNLQSWVRFLARDVGECCCGQVVSTVDSEVGGLSSKGLIEGT 969
+WL ++ L + + V F D+ E G +S +E
Sbjct: 131 PLAFVWLNNWSVWLLLIATLWLLVVVFSKADLSEL----WNRA-----GDMSFAVAVELV 181
Query: 970 VTTP-SYLPEIQEKSRQDSSRKPVVPRT-------NLIC--LGLSSSRATG 1010
+ P S+LP I + +R + + V T + LGL+ + ATG
Sbjct: 182 IAMPLSWLPLIADYTRFARNPRHVFLGTVLGYFIGSCWMYFLGLAVTLATG 232
>gnl|CDD|225504 COG2956, COG2956, Predicted N-acetylglucosaminyl transferase
[Carbohydrate transport and metabolism].
Length = 389
Score = 31.2 bits (71), Expect = 5.5
Identities = 23/100 (23%), Positives = 38/100 (38%), Gaps = 5/100 (5%)
Query: 1769 LGVLFNLSDEYDKAVDCFRSALQVRPDDSRL-----WNRLGASLANGNRPEEAVEAYHTA 1823
L ++ + E++KA+D +++ R+ + L + + A E A
Sbjct: 147 LLNIYQATREWEKAIDVAERLVKLGGQTYRVEIAQFYCELAQQALASSDVDRARELLKKA 206
Query: 1824 LQLSPGFVRARYNLGITCVHLGANTQAVEHFLTALNQQAA 1863
LQ VRA LG + G +AVE L Q
Sbjct: 207 LQADKKCVRASIILGRVELAKGDYQKAVEALERVLEQNPE 246
>gnl|CDD|227310 COG4976, COG4976, Predicted methyltransferase (contains TPR repeat)
[General function prediction only].
Length = 287
Score = 30.9 bits (70), Expect = 6.2
Identities = 14/51 (27%), Positives = 24/51 (47%)
Query: 1778 EYDKAVDCFRSALQVRPDDSRLWNRLGASLANGNRPEEAVEAYHTALQLSP 1828
+ + A + + AL++ P+ + W RLG + A AY L+L P
Sbjct: 10 DAEAAAELYNQALELAPEWAAGWFRLGEYTEKAGEFDAAAAAYEEVLELDP 60
>gnl|CDD|221954 pfam13170, DUF4003, Protein of unknown function (DUF4003). This
family of proteins is functionally uncharacterized. This
family of proteins is found in bacteria. Proteins in
this family are typically between 327 and 345 amino
acids in length.
Length = 297
Score = 30.7 bits (70), Expect = 6.9
Identities = 15/83 (18%), Positives = 37/83 (44%), Gaps = 8/83 (9%)
Query: 768 LLVYLFSSLDFQPPKQKISDPDELRDYQHRKRKAFEDNIRKNRMVISNWI--KYAQWEES 825
+ + F+ P++ I + ++ Y K + F+ R + ++ I +Y + E+
Sbjct: 61 YSLASLLDIHFEDPEEAIQEILDI--YDKLKEEGFK---RSEYLYLAALIILEYREKEDY 115
Query: 826 QKQVDRARSIYERALDVDHRNIT 848
+ ++R + IY+ H +T
Sbjct: 116 DEIINRTKEIYKLMKK-KHPFLT 137
>gnl|CDD|190425 pfam02786, CPSase_L_D2, Carbamoyl-phosphate synthase L chain, ATP
binding domain. Carbamoyl-phosphate synthase catalyzes
the ATP-dependent synthesis of carbamyl-phosphate from
glutamine or ammonia and bicarbonate. This important
enzyme initiates both the urea cycle and the biosynthesis
of arginine and/or pyrimidines. The carbamoyl-phosphate
synthase (CPS) enzyme in prokaryotes is a heterodimer of
a small and large chain. The small chain promotes the
hydrolysis of glutamine to ammonia, which is used by the
large chain to synthesise carbamoyl phosphate. See
pfam00988. The small chain has a GATase domain in the
carboxyl terminus. See pfam00117. The ATP binding domain
(this one) has an ATP-grasp fold.
Length = 211
Score = 30.3 bits (69), Expect = 7.6
Identities = 14/58 (24%), Positives = 21/58 (36%)
Query: 1492 LFSNGVASINQQPVPNTNGNLHSLDRTLAVSKLFKGPIIGTPSTTSGVSTSGTVDKPE 1549
LF + VP T G + + + LA +K P+I + G G E
Sbjct: 4 LFKAAMKEAGVPTVPGTAGPVETEEEALAAAKEIGYPVIIKAAFGGGGLGMGIARNEE 61
>gnl|CDD|177845 PLN02194, PLN02194, cytochrome-c oxidase.
Length = 265
Score = 30.4 bits (68), Expect = 8.8
Identities = 16/35 (45%), Positives = 20/35 (57%), Gaps = 2/35 (5%)
Query: 1408 WFSDFSRNQRSSVSMHVSLVFDLRPRYGPILFCVS 1442
W+ D R S++ H + V L PRYG ILF VS
Sbjct: 60 WWRDVLR--ESTLEGHHTKVVQLGPRYGSILFIVS 92
Score = 30.4 bits (68), Expect = 9.8
Identities = 14/26 (53%), Positives = 15/26 (57%)
Query: 269 ESAFLGQNASVFDLPPRYGPILFCVS 294
ES G + V L PRYG ILF VS
Sbjct: 67 ESTLEGHHTKVVQLGPRYGSILFIVS 92
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.317 0.132 0.399
Gapped
Lambda K H
0.267 0.0733 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 97,779,808
Number of extensions: 9517912
Number of successful extensions: 7776
Number of sequences better than 10.0: 1
Number of HSP's gapped: 7716
Number of HSP's successfully gapped: 166
Length of query: 1956
Length of database: 10,937,602
Length adjustment: 111
Effective length of query: 1845
Effective length of database: 6,014,308
Effective search space: 11096398260
Effective search space used: 11096398260
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 67 (29.6 bits)