RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy9209
(349 letters)
>gnl|CDD|238112 cd00189, TPR, Tetratricopeptide repeat domain; typically contains
34 amino acids
[WLF]-X(2)-[LIM]-[GAS]-X(2)-[YLF]-X(8)-[ASE]-X(3)-[FYL]-
X(2)-[ASL]-X(4)-[PKE] is the consensus sequence; found
in a variety of organisms including bacteria,
cyanobacteria, yeast, fungi, plants, and humans in
various subcellular locations; involved in a variety of
functions including protein-protein interactions, but
common features in the interaction partners have not
been defined; involved in chaperone, cell-cycle,
transciption, and protein transport complexes; the
number of TPR motifs varies among proteins (1,3-11,13
15,16,19); 5-6 tandem repeats generate a right-handed
helical structure with an amphipathic channel that is
thought to accomodate an alpha-helix of a target
protein; it has been proposed that TPR proteins
preferably interact with WD-40 repeat proteins, but in
many instances several TPR-proteins seem to aggregate to
multi-protein complexes; examples of TPR-proteins
include, Cdc16p, Cdc23p and Cdc27p components of the
cyclosome/APC, the Pex5p/Pas10p receptor for peroxisomal
targeting signals, the Tom70p co-receptor for
mitochondrial targeting signals, Ser/Thr phosphatase 5C
and the p110 subunit of O-GlcNAc transferase; three
copies of the repeat are present here.
Length = 100
Score = 57.4 bits (139), Expect = 1e-10
Identities = 23/84 (27%), Positives = 46/84 (54%)
Query: 93 SQAVLYFQRALKLNPNYLGVWTLLGHEYMEMKNTNAAIQCYRQAIEINNLDYRAWYGLGQ 152
+A+ Y+++AL+L+P+ + L Y ++ A++ Y +A+E++ + +A+Y LG
Sbjct: 17 DEALEYYEKALELDPDNADAYYNLAAAYYKLGKYEEALEDYEKALELDPDNAKAYYNLGL 76
Query: 153 TYEILRLPYYGLYYYKQAHMVRPN 176
Y L L Y++A + PN
Sbjct: 77 AYYKLGKYEEALEAYEKALELDPN 100
Score = 55.8 bits (135), Expect = 5e-10
Identities = 25/96 (26%), Positives = 49/96 (51%)
Query: 112 VWTLLGHEYMEMKNTNAAIQCYRQAIEINNLDYRAWYGLGQTYEILRLPYYGLYYYKQAH 171
LG+ Y ++ + + A++ Y +A+E++ + A+Y L Y L L Y++A
Sbjct: 2 ALLNLGNLYYKLGDYDEALEYYEKALELDPDNADAYYNLAAAYYKLGKYEEALEDYEKAL 61
Query: 172 MVRPNDPRMLTALGEAFEKQEKISEAMKCYNKSRAI 207
+ P++ + LG A+ K K EA++ Y K+ +
Sbjct: 62 ELDPDNAKAYYNLGLAYYKLGKYEEALEAYEKALEL 97
Score = 35.8 bits (83), Expect = 0.004
Identities = 22/137 (16%), Positives = 47/137 (34%), Gaps = 46/137 (33%)
Query: 4 KLYDKLNETEAAADLFMEFVSKLDTFAAPPDKTCGFFAFKYLANHHLKANNLDTAYKCAQ 63
LY KL + + A + + + + A A+ LA + K + A +
Sbjct: 8 NLYYKLGDYDEALEYYEKALELDPDNAD---------AYYNLAAAYYKLGKYEEALED-- 56
Query: 64 KCLQHEETAEEAKSLLRSIAQKRQPDESSSQAVLYFQRALKLNPNYLGVWTLLGHEYMEM 123
+++AL+L+P+ + LG Y ++
Sbjct: 57 -----------------------------------YEKALELDPDNAKAYYNLGLAYYKL 81
Query: 124 KNTNAAIQCYRQAIEIN 140
A++ Y +A+E++
Sbjct: 82 GKYEEALEAYEKALELD 98
Score = 31.6 bits (72), Expect = 0.13
Identities = 18/83 (21%), Positives = 33/83 (39%), Gaps = 9/83 (10%)
Query: 213 KALFKLAKLYDKLNETEAAADLFMEFVSKLDTFAAPPDKTCGFFAFKYLANHHLKANNLD 272
+AL L LY KL + + A + + + + A A+ LA + K +
Sbjct: 1 EALLNLGNLYYKLGDYDEALEYYEKALELDPDNAD---------AYYNLAAAYYKLGKYE 51
Query: 273 TAYKCAQKCLQHEETAEEAKSLL 295
A + +K L+ + +A L
Sbjct: 52 EALEDYEKALELDPDNAKAYYNL 74
>gnl|CDD|223533 COG0457, NrfG, FOG: TPR repeat [General function prediction only].
Length = 291
Score = 44.5 bits (103), Expect = 4e-05
Identities = 40/204 (19%), Positives = 73/204 (35%), Gaps = 6/204 (2%)
Query: 4 KLYDKLNETEAAADLFMEFVSKLDTFAAPPDKTCGFFAFKYLANHHLKANNLDTAYKCAQ 63
L L E + L+ A L + A + +
Sbjct: 60 GLLLLLALALLKLGRLEEALELLEKALELELLPNLAEALLNLGLLLEALGKYEEALELLE 119
Query: 64 KCLQHEETAEEAKSLLRSIAQKRQPDESSSQAVLYFQRALKLNP---NYLGVWTLLGHEY 120
K L + + A++LL A D +A+ +++AL+L+P LG
Sbjct: 120 KALALDPDPDLAEALLALGALYELGD--YEEALELYEKALELDPELNELAEALLALGALL 177
Query: 121 MEMKNTNAAIQCYRQAIEIN-NLDYRAWYGLGQTYEILRLPYYGLYYYKQAHMVRPNDPR 179
+ A++ +A+++N + D A LG Y L L YY++A + P++
Sbjct: 178 EALGRYEEALELLEKALKLNPDDDAEALLNLGLLYLKLGKYEEALEYYEKALELDPDNAE 237
Query: 180 MLTALGEAFEKQEKISEAMKCYNK 203
L L + + EA++ K
Sbjct: 238 ALYNLALLLLELGRYEEALEALEK 261
Score = 29.0 bits (63), Expect = 3.5
Identities = 17/101 (16%), Positives = 40/101 (39%), Gaps = 2/101 (1%)
Query: 40 FAFKYLANHHLKANNLDTAYKCAQKCLQHEETAEEAKSLLRSIAQKRQPDESSSQAVLYF 99
A L + A + +K L+ + L + + +A+ Y+
Sbjct: 168 EALLALGALLEALGRYEEALELLEKALKLNPDDDAEALLNLGLLYLKLGK--YEEALEYY 225
Query: 100 QRALKLNPNYLGVWTLLGHEYMEMKNTNAAIQCYRQAIEIN 140
++AL+L+P+ L +E+ A++ +A+E++
Sbjct: 226 EKALELDPDNAEALYNLALLLLELGRYEEALEALEKALELD 266
>gnl|CDD|222112 pfam13414, TPR_11, TPR repeat.
Length = 69
Score = 39.2 bits (92), Expect = 2e-04
Identities = 11/44 (25%), Positives = 23/44 (52%)
Query: 112 VWTLLGHEYMEMKNTNAAIQCYRQAIEINNLDYRAWYGLGQTYE 155
LG+ ++ + + AI+ Y +A+E++ + A+Y L Y
Sbjct: 5 ALKNLGNALFKLGDYDEAIEAYEKALELDPDNAEAYYNLALAYL 48
Score = 37.7 bits (88), Expect = 5e-04
Identities = 11/48 (22%), Positives = 30/48 (62%), Gaps = 1/48 (2%)
Query: 94 QAVLYFQRALKLNPNYLGVWTLLGHEYMEM-KNTNAAIQCYRQAIEIN 140
+A+ +++AL+L+P+ + L Y+++ K+ A++ +A+E++
Sbjct: 21 EAIEAYEKALELDPDNAEAYYNLALAYLKLGKDYEEALEDLEKALELD 68
Score = 33.4 bits (77), Expect = 0.016
Identities = 16/57 (28%), Positives = 26/57 (45%)
Query: 176 NDPRMLTALGEAFEKQEKISEAMKCYNKSRAIGDADGKALFKLAKLYDKLNETEAAA 232
++ L LG A K EA++ Y K+ + + +A + LA Y KL + A
Sbjct: 1 DNAEALKNLGNALFKLGDYDEAIEAYEKALELDPDNAEAYYNLALAYLKLGKDYEEA 57
Score = 29.2 bits (66), Expect = 0.61
Identities = 19/67 (28%), Positives = 30/67 (44%), Gaps = 2/67 (2%)
Query: 41 AFKYLANHHLKANNLDTAYKCAQKCLQHEETAEEAKSLLRSIAQKRQPDESSSQAVLYFQ 100
A K L N K + D A + +K L+ + EA L K D +A+ +
Sbjct: 5 ALKNLGNALFKLGDYDEAIEAYEKALELDPDNAEAYYNLALAYLKLGKDYE--EALEDLE 62
Query: 101 RALKLNP 107
+AL+L+P
Sbjct: 63 KALELDP 69
>gnl|CDD|234059 TIGR02917, PEP_TPR_lipo, putative PEP-CTERM system TPR-repeat
lipoprotein. This protein family occurs in strictly
within a subset of Gram-negative bacterial species with
the proposed PEP-CTERM/exosortase system, analogous to
the LPXTG/sortase system common in Gram-positive
bacteria. This protein occurs in a species if and only
if a transmembrane histidine kinase (TIGR02916) and a
DNA-binding response regulator (TIGR02915) also occur.
The present of tetratricopeptide repeats (TPR) suggests
protein-protein interaction, possibly for the regulation
of PEP-CTERM protein expression, since many PEP-CTERM
proteins in these genomes are preceded by a proposed DNA
binding site for the response regulator.
Length = 899
Score = 42.8 bits (101), Expect = 2e-04
Identities = 46/197 (23%), Positives = 77/197 (39%), Gaps = 18/197 (9%)
Query: 41 AFKYLANHHLKANNLDTAYKCAQKCLQHEETAEEAKSLLRSIA-QKRQPDESSSQAVLYF 99
A LA+ + N A ++ L+ + EA+ L + ++ + A
Sbjct: 637 ALLLLADAYAVMKNYAKAITSLKRALELKPDNTEAQIGLAQLLLAAKRTES----AKKIA 692
Query: 100 QRALKLNPNYLGVWTLLGHEYMEMKNTNAAIQCYRQAIEINNLD------YRAWYGLGQT 153
+ K +P + L G Y+ K+ AAIQ YR+A++ +RA G T
Sbjct: 693 KSLQKQHPKAALGFELEGDLYLRQKDYPAAIQAYRKALKRAPSSQNAIKLHRALLASGNT 752
Query: 154 YEILRLPYYGLYYYKQAHMVRPNDPRMLTALGEAFEKQEKISEAMKCYNKSRAIGDADGK 213
E ++ + PND + TAL E + Q+ +A+K Y +
Sbjct: 753 AEAVKT-------LEAWLKTHPNDAVLRTALAELYLAQKDYDKAIKHYQTVVKKAPDNAV 805
Query: 214 ALFKLAKLYDKLNETEA 230
L LA LY +L + A
Sbjct: 806 VLNNLAWLYLELKDPRA 822
Score = 41.2 bits (97), Expect = 8e-04
Identities = 48/269 (17%), Positives = 102/269 (37%), Gaps = 12/269 (4%)
Query: 39 FFAFKYLANHHLKANNLDTAYKCAQKCLQHEETAEEAKSLLRSIAQKRQPDESSSQAVLY 98
F A LA ++ N D A + +K L + A L + + + +AV +
Sbjct: 499 FPAAANLARIDIQEGNPDDAIQRFEKVLTIDPKNLRAILALAGLYLRTG---NEEEAVAW 555
Query: 99 FQRALKLNPNYLGVWTLLGHEYMEMKNTNAAIQCYRQAIEINNLDYRAWYGLGQTYEILR 158
++A +LNP + L Y+ A+ +A + AW LG+
Sbjct: 556 LEKAAELNPQEIEPALALAQYYLGKGQLKKALAILNEAADAAPDSPEAWLMLGRAQLAAG 615
Query: 159 LPYYGLYYYKQAHMVRPNDPRMLTALGEAFEKQEKISEAMKCYNKSRAIGDADGKALFKL 218
+ +K+ ++P+ L L +A+ + ++A+ ++ + + +A L
Sbjct: 616 DLNKAVSSFKKLLALQPDSALALLLLADAYAVMKNYAKAITSLKRALELKPDNTEAQIGL 675
Query: 219 AKLYDKLNETEAAADLFMEFVSKLDTFAAPPDKTCGFFAFKYLANHHLKANNLDTAYKCA 278
A+L TE+A + + A F+ + +L+ + A +
Sbjct: 676 AQLLLAAKRTESAKKIAKSLQKQHPKAA---------LGFELEGDLYLRQKDYPAAIQAY 726
Query: 279 QKCLQHEETAEEAKSLLRSIAQKRQPDES 307
+K L+ +++ A L R++ E+
Sbjct: 727 RKALKRAPSSQNAIKLHRALLASGNTAEA 755
Score = 38.5 bits (90), Expect = 0.004
Identities = 39/196 (19%), Positives = 74/196 (37%), Gaps = 5/196 (2%)
Query: 41 AFKYLANHHLKANNLDTAYKCAQKCLQHEETAEEAKSLLRSIA-QKRQPDESSSQAVLYF 99
A + LA+ L+ +D A L + A SLL ++ A Y
Sbjct: 331 ARRLLASIQLRLGRVDEAIATLSPALGLDPDDPAALSLLGEAYLALGDFEK----AAEYL 386
Query: 100 QRALKLNPNYLGVWTLLGHEYMEMKNTNAAIQCYRQAIEINNLDYRAWYGLGQTYEILRL 159
+A +L+P T LG + + + AI A +++ RA L +Y
Sbjct: 387 AKATELDPENAAARTQLGISKLSQGDPSEAIADLETAAQLDPELGRADLLLILSYLRSGQ 446
Query: 160 PYYGLYYYKQAHMVRPNDPRMLTALGEAFEKQEKISEAMKCYNKSRAIGDADGKALFKLA 219
L K+ +P++ + LG + + +++A + + K+ +I A LA
Sbjct: 447 FDKALAAAKKLEKKQPDNASLHNLLGAIYLGKGDLAKAREAFEKALSIEPDFFPAAANLA 506
Query: 220 KLYDKLNETEAAADLF 235
++ + + A F
Sbjct: 507 RIDIQEGNPDDAIQRF 522
Score = 37.8 bits (88), Expect = 0.009
Identities = 33/183 (18%), Positives = 67/183 (36%), Gaps = 9/183 (4%)
Query: 92 SSQAVLYFQRALKLNPNYLGVWTLLGHEYMEMKNTNAAIQCYRQAIEINNLDYRAWYGLG 151
S+A+ + A +L+P LL Y+ + A+ ++ + + LG
Sbjct: 413 PSEAIADLETAAQLDPELGRADLLLILSYLRSGQFDKALAAAKKLEKKQPDNASLHNLLG 472
Query: 152 QTYEILRLPYYGLYYYKQAHMVRPNDPRMLTALGEAFEKQEKISEAMKCYNKSRAIGDAD 211
Y +++A + P+ L ++ +A++ + K I +
Sbjct: 473 AIYLGKGDLAKAREAFEKALSIEPDFFPAAANLARIDIQEGNPDDAIQRFEKVLTIDPKN 532
Query: 212 GKALFKLAKLYDKLNETEAAADLFMEFVSKLDTFAAPPDKTCGFFAFKYLANHHLKANNL 271
+A+ LA LY + E A ++E ++L+ P LA ++L L
Sbjct: 533 LRAILALAGLYLRTGNEEEAVA-WLEKAAELNPQEIEPALA--------LAQYYLGKGQL 583
Query: 272 DTA 274
A
Sbjct: 584 KKA 586
Score = 35.4 bits (82), Expect = 0.047
Identities = 61/302 (20%), Positives = 110/302 (36%), Gaps = 15/302 (4%)
Query: 21 EFVSKLDTFAAPPDKTCGFFAFKYLAN-HHLKANNLDTAYKCAQKCLQHEETAEEAKSLL 79
+ +T + LA + NL+ AY+ + L++ + +A+ LL
Sbjct: 276 NYEDARETLQDALKSAPEYLPALLLAGASEYQLGNLEQAYQYLNQILKYAPNSHQARRLL 335
Query: 80 RSIAQKRQPDESSSQAVLYFQRALKLNPNYLGVWTLLGHEYMEMKNTNAAIQCYRQAIEI 139
SI Q R + A L AL L+P+ +LLG Y+ + + A + +A E+
Sbjct: 336 ASI-QLRLGRVDEAIATL--SPALGLDPDDPAALSLLGEAYLALGDFEKAAEYLAKATEL 392
Query: 140 NNLDYRAWYGLGQTYEILRLPYYGLYYYKQAHMVRPNDPRMLTALGEAFEKQEKISEAMK 199
+ + A LG + P + + A + P R L ++ + + +A+
Sbjct: 393 DPENAAARTQLGISKLSQGDPSEAIADLETAAQLDPELGRADLLLILSYLRSGQFDKALA 452
Query: 200 CYNKSRAIGDADGKALFKLAKLYDKLNETEAA-ADLFMEFVSKLDTFAAPPDKTCGFFAF 258
K A L + + A A E ++ PD F A
Sbjct: 453 AAKKLEK--KQPDNASLHNLLGAIYLGKGDLAKAREAFEKALSIE-----PDF---FPAA 502
Query: 259 KYLANHHLKANNLDTAYKCAQKCLQHEETAEEAKSLLRSIAQKRQPDESSSQPMECAVVL 318
LA ++ N D A + +K L + A L + + +E + +E A L
Sbjct: 503 ANLARIDIQEGNPDDAIQRFEKVLTIDPKNLRAILALAGLYLRTGNEEEAVAWLEKAAEL 562
Query: 319 DP 320
+P
Sbjct: 563 NP 564
Score = 32.0 bits (73), Expect = 0.50
Identities = 21/98 (21%), Positives = 37/98 (37%), Gaps = 5/98 (5%)
Query: 44 YLANHHLKANNLDTAYKCAQKCLQHEETAEEAK-SLLRSIAQKRQPDESSSQAVLYFQRA 102
+L L+ A K ++ L + + AK L + + + DE A
Sbjct: 130 LRGLAYLGLGQLELAQKSYEQALAIDPRSLYAKLGLAQLALAENRFDE----ARALIDEV 185
Query: 103 LKLNPNYLGVWTLLGHEYMEMKNTNAAIQCYRQAIEIN 140
L +P + L G + + N A+ YR+AI +
Sbjct: 186 LTADPGNVDALLLKGDLLLSLGNIELALAAYRKAIALR 223
Score = 31.6 bits (72), Expect = 0.75
Identities = 26/111 (23%), Positives = 43/111 (38%), Gaps = 4/111 (3%)
Query: 44 YLANHHLKANNLDTAYKCAQKCLQHEETAEEAKSLLRSIAQKRQPDESSSQAVLYFQRAL 103
LA +L + D A K Q ++ + L + + + +A+ Y +RAL
Sbjct: 775 ALAELYLAQKDYDKAIKHYQTVVKKAPDNAVVLNNLAWLYLELKD----PRALEYAERAL 830
Query: 104 KLNPNYLGVWTLLGHEYMEMKNTNAAIQCYRQAIEINNLDYRAWYGLGQTY 154
KL PN + LG +E + A+ R+A+ I Y L
Sbjct: 831 KLAPNIPAILDTLGWLLVEKGEADRALPLLRKAVNIAPEAAAIRYHLALAL 881
Score = 30.4 bits (69), Expect = 1.7
Identities = 42/175 (24%), Positives = 71/175 (40%), Gaps = 24/175 (13%)
Query: 69 EETAEEAKSLLRSIAQKRQPDESSSQAVLYFQRALKLNPNYLGVWTLLGHEYMEMKNTNA 128
EE E AKS L QK + A++ + AL+ +PN LLG Y+ + + A
Sbjct: 23 EELIEAAKSYL----QKNKYKA----AIIQLKNALQKDPNDAEARFLLGKIYLALGDYAA 74
Query: 129 AIQCYRQAIE-------INNLDYRAWYGLGQTYEILRLPYYGLYYYKQAHMVRPNDPRML 181
A + R+A+ + L RA+ G+ ++L + + +L
Sbjct: 75 AEKELRKALSLGYPKNQVLPLLARAYLLQGKFQQVLD------ELPGKTLLDDEGAAELL 128
Query: 182 TALGEAFEKQEKISEAMKCYNKSRAIGDADGKALFKLAKL---YDKLNETEAAAD 233
G A+ ++ A K Y ++ AI A LA+L ++ +E A D
Sbjct: 129 ALRGLAYLGLGQLELAQKSYEQALAIDPRSLYAKLGLAQLALAENRFDEARALID 183
Score = 29.3 bits (66), Expect = 4.1
Identities = 27/133 (20%), Positives = 46/133 (34%), Gaps = 9/133 (6%)
Query: 175 PNDPRMLTALGEAFEKQEKISEAMKCYNKSRAIGDADGKALFKLAKLYDKLNETEAAADL 234
PND LG+ + + A K K+ ++G + L LA+ Y + + D
Sbjct: 53 PNDAEARFLLGKIYLALGDYAAAEKELRKALSLGYPKNQVLPLLARAYLLQGKFQQVLDE 112
Query: 235 FMEFVSKLDTFAAPPDKTCGFFAFKYLANHHLKANNLDTAYKCAQKCLQHEETAEEAK-S 293
D AA +L L+ A K ++ L + + AK
Sbjct: 113 LPGKTLLDDEGAAE--------LLALRGLAYLGLGQLELAQKSYEQALAIDPRSLYAKLG 164
Query: 294 LLRSIAQKRQPDE 306
L + + + DE
Sbjct: 165 LAQLALAENRFDE 177
>gnl|CDD|201277 pfam00515, TPR_1, Tetratricopeptide repeat.
Length = 34
Score = 37.0 bits (87), Expect = 4e-04
Identities = 8/29 (27%), Positives = 18/29 (62%)
Query: 112 VWTLLGHEYMEMKNTNAAIQCYRQAIEIN 140
LG+ Y+++ + A++ Y +A+E+N
Sbjct: 3 ALYNLGNAYLKLGKYDEALEYYEKALELN 31
Score = 25.8 bits (58), Expect = 3.5
Identities = 8/16 (50%), Positives = 14/16 (87%)
Query: 94 QAVLYFQRALKLNPNY 109
+A+ Y+++AL+LNPN
Sbjct: 19 EALEYYEKALELNPNN 34
Score = 25.5 bits (57), Expect = 4.5
Identities = 10/26 (38%), Positives = 14/26 (53%)
Query: 178 PRMLTALGEAFEKQEKISEAMKCYNK 203
+ L LG A+ K K EA++ Y K
Sbjct: 1 AKALYNLGNAYLKLGKYDEALEYYEK 26
Score = 25.1 bits (56), Expect = 6.3
Identities = 12/34 (35%), Positives = 17/34 (50%)
Query: 144 YRAWYGLGQTYEILRLPYYGLYYYKQAHMVRPND 177
+A Y LG Y L L YY++A + PN+
Sbjct: 1 AKALYNLGNAYLKLGKYDEALEYYEKALELNPNN 34
>gnl|CDD|197478 smart00028, TPR, Tetratricopeptide repeats. Repeats present in 4
or more copies in proteins. Contain a minimum of 34
amino acids each and self-associate via a "knobs and
holes" mechanism.
Length = 34
Score = 36.3 bits (85), Expect = 8e-04
Identities = 7/29 (24%), Positives = 19/29 (65%)
Query: 112 VWTLLGHEYMEMKNTNAAIQCYRQAIEIN 140
LG+ Y+++ + + A++ Y +A+E++
Sbjct: 3 ALYNLGNAYLKLGDYDEALEYYEKALELD 31
Score = 26.6 bits (60), Expect = 2.0
Identities = 9/30 (30%), Positives = 14/30 (46%)
Query: 178 PRMLTALGEAFEKQEKISEAMKCYNKSRAI 207
L LG A+ K EA++ Y K+ +
Sbjct: 1 AEALYNLGNAYLKLGDYDEALEYYEKALEL 30
>gnl|CDD|131573 TIGR02521, type_IV_pilW, type IV pilus biogenesis/stability protein
PilW. Members of this family are designated PilF in ref
(PMID:8973346) and PilW in ref (PMID:15612916). This
outer membrane protein is required both for pilus
stability and for pilus function such as adherence to
human cells. Members of this family contain copies of
the TPR (tetratricopeptide repeat) domain.
Length = 234
Score = 40.0 bits (94), Expect = 0.001
Identities = 44/192 (22%), Positives = 75/192 (39%), Gaps = 9/192 (4%)
Query: 45 LANHHLKANNLDTAYKCAQKCLQHEETAEEAKSLLRSIAQKRQPDESSSQAVLYFQRALK 104
LA +L+ +L+ A + K L+H+ A L Q+ E +A F+RAL
Sbjct: 37 LALGYLEQGDLEVAKENLDKALEHDPDDYLAYLALALYYQQLGELE---KAEDSFRRALT 93
Query: 105 LNPNYLGVWTLLGHEYMEMKNTNAAIQCYRQAIEINNLD--YRAWYGLGQTYEILRLPYY 162
LNPN V G + A+Q + QAIE R+ G L+ +
Sbjct: 94 LNPNNGDVLNNYGTFLCQQGKYEQAMQQFEQAIEDPLYPQPARSLENAGLCA--LKAGDF 151
Query: 163 GLY--YYKQAHMVRPNDPRMLTALGEAFEKQEKISEAMKCYNKSRAIGDADGKALFKLAK 220
Y +A + P P L L E + + + +A + + + ++L+ +
Sbjct: 152 DKAEKYLTRALQIDPQRPESLLELAELYYLRGQYKDARAYLERYQQTYNQTAESLWLGIR 211
Query: 221 LYDKLNETEAAA 232
+ L + AA
Sbjct: 212 IARALGDVAAAQ 223
Score = 39.6 bits (93), Expect = 0.001
Identities = 26/151 (17%), Positives = 50/151 (33%), Gaps = 18/151 (11%)
Query: 100 QRALKLNPNYLGVWTLLGHEYMEMKNTNAAIQCYRQAIEINNLDYRAWYGLG-------- 151
+AL+ +P+ + L Y ++ A +R+A+ +N + G
Sbjct: 55 DKALEHDPDDYLAYLALALYYQQLGELEKAEDSFRRALTLNPNNGDVLNNYGTFLCQQGK 114
Query: 152 --QTYEILRLPYYGLYYYKQAHMVRPNDPRMLTALGEAFEKQEKISEAMKCYNKSRAIGD 209
Q + Y + A R L G K +A K ++ I
Sbjct: 115 YEQAMQQFEQAIEDPLYPQPA--------RSLENAGLCALKAGDFDKAEKYLTRALQIDP 166
Query: 210 ADGKALFKLAKLYDKLNETEAAADLFMEFVS 240
++L +LA+LY + + A +
Sbjct: 167 QRPESLLELAELYYLRGQYKDARAYLERYQQ 197
>gnl|CDD|205362 pfam13181, TPR_8, Tetratricopeptide repeat.
Length = 34
Score = 35.5 bits (83), Expect = 0.001
Identities = 8/30 (26%), Positives = 18/30 (60%)
Query: 112 VWTLLGHEYMEMKNTNAAIQCYRQAIEINN 141
+ LLG Y+++ + A + Y +A+E++
Sbjct: 3 AYYLLGQIYLQLGDYEEAKEYYEKALELDP 32
>gnl|CDD|222123 pfam13432, TPR_16, Tetratricopeptide repeat.
Length = 65
Score = 36.5 bits (85), Expect = 0.001
Identities = 11/63 (17%), Positives = 19/63 (30%)
Query: 116 LGHEYMEMKNTNAAIQCYRQAIEINNLDYRAWYGLGQTYEILRLPYYGLYYYKQAHMVRP 175
L + + + A+ A+ L A LG+ + A P
Sbjct: 3 LARAALRAGDYDEALAALEAALARYPLAAEALLLLGEALLRQGRLAEAAALLRAALAADP 62
Query: 176 NDP 178
+DP
Sbjct: 63 DDP 65
Score = 28.4 bits (64), Expect = 1.1
Identities = 12/58 (20%), Positives = 20/58 (34%)
Query: 183 ALGEAFEKQEKISEAMKCYNKSRAIGDADGKALFKLAKLYDKLNETEAAADLFMEFVS 240
AL A + EA+ + A +AL L + + AA L ++
Sbjct: 2 ALARAALRAGDYDEALAALEAALARYPLAAEALLLLGEALLRQGRLAEAAALLRAALA 59
>gnl|CDD|226687 COG4235, COG4235, Cytochrome c biogenesis factor [Posttranslational
modification, protein turnover, chaperones].
Length = 287
Score = 39.3 bits (92), Expect = 0.002
Identities = 32/139 (23%), Positives = 52/139 (37%), Gaps = 29/139 (20%)
Query: 76 KSLLRSIAQKRQP--DESSSQAVLYFQRALKLNPNYLGVWTLLGHEYMEMKNTNAAIQCY 133
+ R QP ++ + + L+ NP W LLG YM + + A+ Y
Sbjct: 120 QPADRLADPLAQPPAEQEMEALIARLETHLQQNPGDAEGWDLLGRAYMALGRASDALLAY 179
Query: 134 RQAIEI--NNLDYRAWYGLGQTYEILRLPYYGLYY-------------YKQAHMVRPNDP 178
R A+ + +N + GL + LYY +QA + P +
Sbjct: 180 RNALRLAGDNPEI--LLGLAEA----------LYYQAGQQMTAKARALLRQALALDPANI 227
Query: 179 RMLTALGEAFEKQEKISEA 197
R L+ L A +Q +EA
Sbjct: 228 RALSLLAFAAFEQGDYAEA 246
>gnl|CDD|233013 TIGR00540, TPR_hemY_coli, heme biosynthesis-associated TPR protein.
Members of this protein family are uncharacterized
tetratricopeptide repeat (TPR) proteins invariably found
in heme biosynthesis gene clusters. The absence of any
invariant residues other than Ala argues against this
protein serving as an enzyme per se. The gene symbol
hemY assigned in E. coli is unfortunate in that an
unrelated protein, protoporphyrinogen oxidase (HemG in
E. coli) is designated HemY in Bacillus subtilis
[Unknown function, General].
Length = 367
Score = 37.7 bits (88), Expect = 0.007
Identities = 19/62 (30%), Positives = 34/62 (54%), Gaps = 1/62 (1%)
Query: 174 RPNDPRMLTALGEAFEKQEKISEAMKCYNKSRAIGDADGKALFKLAKLYDKLNETEAAAD 233
P+D +L ALG +Q+ +A S ++ +A +LA+L+++L +TEAAA
Sbjct: 304 HPDDALLLLALGRLCLRQQLWGKAQSYLEASLSL-APTEEAHLELAQLFEQLGDTEAAAQ 362
Query: 234 LF 235
+
Sbjct: 363 HY 364
>gnl|CDD|227343 COG5010, TadD, Flp pilus assembly protein TadD, contains TPR
repeats [Intracellular trafficking and secretion].
Length = 257
Score = 37.0 bits (86), Expect = 0.009
Identities = 21/81 (25%), Positives = 35/81 (43%), Gaps = 16/81 (19%)
Query: 72 AEEAKSLLRSIAQKRQPDESSSQAVLYFQRALKLNPNYLGVWTLLGHEYMEMKNTNAAIQ 131
A + K+ +R + + +AV ++A +L P W LLG ++ + A +
Sbjct: 104 AAQGKNQIR--------NGNFGEAVSVLRKAARLAPTDWEAWNLLGAALDQLGRFDEARR 155
Query: 132 CYRQAIE--------INNLDY 144
YRQA+E NNL
Sbjct: 156 AYRQALELAPNEPSIANNLGM 176
Score = 31.6 bits (72), Expect = 0.51
Identities = 22/117 (18%), Positives = 41/117 (35%), Gaps = 1/117 (0%)
Query: 121 MEMKNTNAAIQCYRQAIEINNLDYRAWYGLGQTYEILRLPYYGLYYYKQAHMVRPNDPRM 180
+ M+ T A A+ N D L + L +++ + P D +
Sbjct: 44 LAMRQTQGAAAALGAAVLRNPEDLSI-AKLATALYLRGDADSSLAVLQKSAIAYPKDREL 102
Query: 181 LTALGEAFEKQEKISEAMKCYNKSRAIGDADGKALFKLAKLYDKLNETEAAADLFME 237
L A G+ + EA+ K+ + D +A L D+L + A + +
Sbjct: 103 LAAQGKNQIRNGNFGEAVSVLRKAARLAPTDWEAWNLLGAALDQLGRFDEARRAYRQ 159
Score = 30.1 bits (68), Expect = 1.7
Identities = 12/64 (18%), Positives = 22/64 (34%)
Query: 174 RPNDPRMLTALGEAFEKQEKISEAMKCYNKSRAIGDADGKALFKLAKLYDKLNETEAAAD 233
P D LG A ++ + EA + Y ++ + + L + E A
Sbjct: 130 APTDWEAWNLLGAALDQLGRFDEARRAYRQALELAPNEPSIANNLGMSLLLRGDLEDAET 189
Query: 234 LFME 237
L +
Sbjct: 190 LLLP 193
>gnl|CDD|205602 pfam13424, TPR_12, Tetratricopeptide repeat.
Length = 78
Score = 33.5 bits (77), Expect = 0.019
Identities = 14/66 (21%), Positives = 26/66 (39%), Gaps = 7/66 (10%)
Query: 179 RMLTALGEAFEKQEKISEAMKCYNKSRAI----GDADGK---ALFKLAKLYDKLNETEAA 231
L L + EA++ K+ + G+ + AL LA+LY L + + A
Sbjct: 6 AALNNLALVLRRLGDYDEALELLEKALELARELGEDHPETARALNNLARLYLALGDYDEA 65
Query: 232 ADLFME 237
+ +
Sbjct: 66 LEYLEK 71
Score = 30.0 bits (68), Expect = 0.38
Identities = 14/78 (17%), Positives = 31/78 (39%), Gaps = 7/78 (8%)
Query: 106 NPNYLGVWTLLGHEYMEMKNTNAAIQCYRQAIEI------NNLDY-RAWYGLGQTYEILR 158
+P+ L + + + A++ +A+E+ ++ + RA L + Y L
Sbjct: 1 HPDLAAALNNLALVLRRLGDYDEALELLEKALELARELGEDHPETARALNNLARLYLALG 60
Query: 159 LPYYGLYYYKQAHMVRPN 176
L Y ++A +R
Sbjct: 61 DYDEALEYLEKALALREA 78
Score = 27.3 bits (61), Expect = 3.4
Identities = 9/53 (16%), Positives = 24/53 (45%), Gaps = 7/53 (13%)
Query: 94 QAVLYFQRALKL-------NPNYLGVWTLLGHEYMEMKNTNAAIQCYRQAIEI 139
+A+ ++AL+L +P L Y+ + + + A++ +A+ +
Sbjct: 23 EALELLEKALELARELGEDHPETARALNNLARLYLALGDYDEALEYLEKALAL 75
>gnl|CDD|227122 COG4783, COG4783, Putative Zn-dependent protease, contains TPR
repeats [General function prediction only].
Length = 484
Score = 35.8 bits (83), Expect = 0.033
Identities = 15/76 (19%), Positives = 29/76 (38%), Gaps = 5/76 (6%)
Query: 82 IAQKRQPDESSSQAVLYFQRALKLNPNYLGVWTLLGHEYMEMKNTNAAIQCYRQAIEINN 141
+ + + A+ ++AL L+PN + L ++ AI+ + + +
Sbjct: 351 LEANKAKE-----AIERLKKALALDPNSPLLQLNLAQALLKGGKPQEAIRILNRYLFNDP 405
Query: 142 LDYRAWYGLGQTYEIL 157
D W L Q Y L
Sbjct: 406 EDPNGWDLLAQAYAEL 421
Score = 31.6 bits (72), Expect = 0.66
Identities = 20/129 (15%), Positives = 41/129 (31%)
Query: 69 EETAEEAKSLLRSIAQKRQPDESSSQAVLYFQRALKLNPNYLGVWTLLGHEYMEMKNTNA 128
+ + + A + +A+ Q + P+ L G +E
Sbjct: 299 KRSKRGGLAAQYGRALQTYLAGQYDEALKLLQPLIAAQPDNPYYLELAGDILLEANKAKE 358
Query: 129 AIQCYRQAIEINNLDYRAWYGLGQTYEILRLPYYGLYYYKQAHMVRPNDPRMLTALGEAF 188
AI+ ++A+ ++ L Q P + + P DP L +A+
Sbjct: 359 AIERLKKALALDPNSPLLQLNLAQALLKGGKPQEAIRILNRYLFNDPEDPNGWDLLAQAY 418
Query: 189 EKQEKISEA 197
+ +EA
Sbjct: 419 AELGNRAEA 427
>gnl|CDD|236983 PRK11788, PRK11788, tetratricopeptide repeat protein; Provisional.
Length = 389
Score = 35.6 bits (83), Expect = 0.038
Identities = 14/63 (22%), Positives = 27/63 (42%), Gaps = 1/63 (1%)
Query: 82 IAQKRQPDESSSQAVLYFQRALKLNPNYLGVWTLLGHEYMEMKNTNAAIQCYRQAIEINN 141
+AQ+ A ++AL +P + LLG + + AAI+ + +E +
Sbjct: 186 LAQQALARGDLDAARALLKKALAADPQCVRASILLGDLALAQGDYAAAIEALER-VEEQD 244
Query: 142 LDY 144
+Y
Sbjct: 245 PEY 247
>gnl|CDD|205609 pfam13431, TPR_17, Tetratricopeptide repeat.
Length = 34
Score = 31.4 bits (72), Expect = 0.042
Identities = 8/34 (23%), Positives = 18/34 (52%)
Query: 98 YFQRALKLNPNYLGVWTLLGHEYMEMKNTNAAIQ 131
+++AL+L+PN + L + + + A+Q
Sbjct: 1 LYEKALELDPNNAEAYYNLALLLLNLGQYDEALQ 34
Score = 27.5 bits (62), Expect = 0.95
Identities = 6/24 (25%), Positives = 12/24 (50%)
Query: 132 CYRQAIEINNLDYRAWYGLGQTYE 155
Y +A+E++ + A+Y L
Sbjct: 1 LYEKALELDPNNAEAYYNLALLLL 24
>gnl|CDD|183140 PRK11447, PRK11447, cellulose synthase subunit BcsC; Provisional.
Length = 1157
Score = 35.4 bits (82), Expect = 0.052
Identities = 30/133 (22%), Positives = 51/133 (38%), Gaps = 28/133 (21%)
Query: 94 QAVLYFQRALKLNPN--YLGVWTLLGHEYMEMKNTN--AAIQCYRQAIEINNLDYRAWYG 149
+AV F++AL L+P+ W L +K IQ A++ NNL
Sbjct: 321 RAVAQFEKALALDPHSSNRDKWESL------LKVNRYWLLIQQGDAALKANNLA------ 368
Query: 150 LGQTYEILRLPYYGLYYYKQAHMVRPNDPRMLTALGEAFEKQEKISEAMKCYNKSRAIGD 209
+ Y+QA V D + LG+ ++ + A + Y ++ +
Sbjct: 369 QAERL------------YQQARQVDNTDSYAVLGLGDVAMARKDYAAAERYYQQALRMDP 416
Query: 210 ADGKALFKLAKLY 222
+ A+ LA LY
Sbjct: 417 GNTNAVRGLANLY 429
>gnl|CDD|225605 COG3063, PilF, Tfp pilus assembly protein PilF [Cell motility and
secretion / Intracellular trafficking and secretion].
Length = 250
Score = 33.6 bits (77), Expect = 0.10
Identities = 29/149 (19%), Positives = 58/149 (38%), Gaps = 21/149 (14%)
Query: 94 QAVLYFQRALKLNPNYLGVWTLLGHEYMEMKNTNAAIQCYRQAI-------EINNLDYRA 146
QA ++AL+ +P+Y + H Y ++ + A + YR+A+ ++ N +Y A
Sbjct: 53 QAKKNLEKALEHDPSYYLAHLVRAHYYQKLGENDLADESYRKALSLAPNNGDVLN-NYGA 111
Query: 147 W-YGLG---QTYEILRLPYYGLYYYKQAHMVRPNDPRMLTALGEAFEKQEKISEAMKCYN 202
+ G + + Y + + L LG K + +A +
Sbjct: 112 FLCAQGRPEEAMQQFERALADPAYGEPS--------DTLENLGLCALKAGQFDQAEEYLK 163
Query: 203 KSRAIGDADGKALFKLAKL-YDKLNETEA 230
++ + AL +LA+L Y + A
Sbjct: 164 RALELDPQFPPALLELARLHYKAGDYAPA 192
Score = 33.2 bits (76), Expect = 0.13
Identities = 19/84 (22%), Positives = 34/84 (40%)
Query: 116 LGHEYMEMKNTNAAIQCYRQAIEINNLDYRAWYGLGQTYEILRLPYYGLYYYKQAHMVRP 175
L Y++ + A + +A+E + Y A Y+ L Y++A + P
Sbjct: 41 LALGYLQQGDYAQAKKNLEKALEHDPSYYLAHLVRAHYYQKLGENDLADESYRKALSLAP 100
Query: 176 NDPRMLTALGEAFEKQEKISEAMK 199
N+ +L G Q + EAM+
Sbjct: 101 NNGDVLNNYGAFLCAQGRPEEAMQ 124
Score = 30.5 bits (69), Expect = 1.1
Identities = 16/64 (25%), Positives = 31/64 (48%), Gaps = 3/64 (4%)
Query: 45 LANHHLKANNLDTAYKCAQKCLQHEETAEEAKSLLRSIAQKRQPDESSSQAVLYFQRALK 104
LA +L+ + A K +K L+H+ + A + QK ++ + + +++AL
Sbjct: 41 LALGYLQQGDYAQAKKNLEKALEHDPSYYLAHLVRAHYYQKLGENDLADE---SYRKALS 97
Query: 105 LNPN 108
L PN
Sbjct: 98 LAPN 101
>gnl|CDD|221956 pfam13174, TPR_6, Tetratricopeptide repeat.
Length = 33
Score = 29.8 bits (68), Expect = 0.16
Identities = 10/29 (34%), Positives = 16/29 (55%)
Query: 213 KALFKLAKLYDKLNETEAAADLFMEFVSK 241
AL+KLA Y KL +T+ A + + +
Sbjct: 1 DALYKLALAYLKLGDTDEAKEALERLLKR 29
>gnl|CDD|236624 PRK09782, PRK09782, bacteriophage N4 receptor, outer membrane
subunit; Provisional.
Length = 987
Score = 33.0 bits (75), Expect = 0.25
Identities = 49/235 (20%), Positives = 89/235 (37%), Gaps = 41/235 (17%)
Query: 34 DKTCGFFAFKYLANHHLKANNLDTAYKCAQKC----------LQHEETAEEAKSLLRSIA 83
D A++ ++ H + +L A AQ Q E+ +L +
Sbjct: 524 DYATALAAWQKISLHDMSNEDLLAAANTAQAAGNGAARDRWLQQAEQRGLGDNALYWWLH 583
Query: 84 QKRQPDESSSQAVLYFQRALKLNPN---YLGVWTLLGHEYMEMKNTNAAIQCYRQAIEI- 139
+R A+ R+L + P+ Y+ T+ Y + N AA+ R A+E+
Sbjct: 584 AQRYIPGQPELALNDLTRSLNIAPSANAYVARATI----YRQRHNVPAAVSDLRAALELE 639
Query: 140 -NNLDYRAWYGL--------GQTYEILRLPYYGLYYYKQAHMVRPNDPRMLTALGEAFEK 190
NN +Y+A G Q+ E+L ++AH P+DP ++ L ++
Sbjct: 640 PNNSNYQAALGYALWDSGDIAQSREML----------ERAHKGLPDDPALIRQLAYVNQR 689
Query: 191 QEKISEAMKCYNKSRAIGDADGKALFKLAKLYDKLNETEAAADLFMEFVSKLDTF 245
+ ++ + I D D +AL + L + N+ E V + TF
Sbjct: 690 LDDMAATQ--HYARLVIDDIDNQAL--ITPLTPEQNQQRFNFRRLHEEVGRRWTF 740
>gnl|CDD|226428 COG3914, Spy, Predicted O-linked N-acetylglucosamine transferase,
SPINDLY family [Posttranslational modification, protein
turnover, chaperones].
Length = 620
Score = 32.4 bits (74), Expect = 0.32
Identities = 27/127 (21%), Positives = 46/127 (36%), Gaps = 9/127 (7%)
Query: 77 SLLRSIAQKRQPDESSSQAVLYFQRALKLNPNYLGVWTLLGHEYMEMKNTN---AAIQCY 133
+ SI P S+ A L + L +NP L A I
Sbjct: 70 AAFLSILLA--PLADSTLAFLAKRIPLSVNPENCPAVQNLAAALELDGLQFLALADISEI 127
Query: 134 RQAIEINNLDYRA----WYGLGQTYEILRLPYYGLYYYKQAHMVRPNDPRMLTALGEAFE 189
+ + +N ++ +Y LG+ ++L ++A + P PR+L AL A +
Sbjct: 128 AEWLSPDNAEFLGHLIRFYQLGRYLKLLGRTAEAELALERAVDLLPKYPRVLGALMTARQ 187
Query: 190 KQEKISE 196
+Q E
Sbjct: 188 EQCSWPE 194
>gnl|CDD|232953 TIGR00392, ileS, isoleucyl-tRNA synthetase. The isoleucyl tRNA
synthetase (IleS) is a class I amino acyl-tRNA ligase
and is particularly closely related to the valyl tRNA
synthetase. This model may recognize IleS from every
species, including eukaryotic cytosolic and
mitochondrial forms [Protein synthesis, tRNA
aminoacylation].
Length = 861
Score = 32.7 bits (75), Expect = 0.36
Identities = 38/153 (24%), Positives = 49/153 (32%), Gaps = 41/153 (26%)
Query: 73 EEAKSLLRSIAQKRQPDESSSQAVLYFQRALKLNPNYLGVWTLLGHEYMEMKNTNAAIQC 132
EE + R A K Q +E Q FQR LGVW + Y M + Q
Sbjct: 111 EEFREKCREFALK-QIEEQREQ----FQR--------LGVWGDWENPYKTMDPSYEESQ- 156
Query: 133 YRQAIEINNLDYRAWYGLGQTYEILRLPYYGLYYYKQAHMVRPNDPRMLTALGEAFEKQE 192
W+ + +E GL Y + PR TAL EA E +
Sbjct: 157 --------------WWLFKEAHE------KGLLY--RGLKPVYWSPRCRTALAEA-EVEY 193
Query: 193 KISEAMK----CYNKSRAIGDADGKALFKLAKL 221
K + Y K D KL+ L
Sbjct: 194 KENYKDVKDPSIYVKFPVKKDKKTYLKVKLSSL 226
>gnl|CDD|225504 COG2956, COG2956, Predicted N-acetylglucosaminyl transferase
[Carbohydrate transport and metabolism].
Length = 389
Score = 32.4 bits (74), Expect = 0.37
Identities = 10/59 (16%), Positives = 25/59 (42%)
Query: 82 IAQKRQPDESSSQAVLYFQRALKLNPNYLGVWTLLGHEYMEMKNTNAAIQCYRQAIEIN 140
+AQ+ +A ++AL+ + + +LG + + A++ + +E N
Sbjct: 186 LAQQALASSDVDRARELLKKALQADKKCVRASIILGRVELAKGDYQKAVEALERVLEQN 244
>gnl|CDD|239108 cd02430, PTH2, Peptidyl-tRNA hydrolase, type 2 (PTH2).
Peptidyl-tRNA hydrolase (PTH) activity releases tRNA
from the premature translation termination product
peptidyl-tRNA, therefore allowing the tRNA and peptide
to be reused in protein synthesis. PTH2 is present in
archaea and eukaryotes.
Length = 115
Score = 30.6 bits (70), Expect = 0.38
Identities = 10/25 (40%), Positives = 15/25 (60%)
Query: 128 AAIQCYRQAIEINNLDYRAWYGLGQ 152
AA+ Y++A++ N RAW GQ
Sbjct: 25 AALGAYKKAMKSNPELLRAWEREGQ 49
>gnl|CDD|191825 pfam07719, TPR_2, Tetratricopeptide repeat. This Pfam entry
includes outlying Tetratricopeptide-like repeats (TPR)
that are not matched by pfam00515.
Length = 34
Score = 28.6 bits (65), Expect = 0.39
Identities = 7/29 (24%), Positives = 16/29 (55%)
Query: 112 VWTLLGHEYMEMKNTNAAIQCYRQAIEIN 140
LG Y ++ + A++ Y +A+E++
Sbjct: 3 ALYNLGLAYYKLGDYEEALEAYEKALELD 31
>gnl|CDD|225613 COG3071, HemY, Uncharacterized enzyme of heme biosynthesis
[Coenzyme metabolism].
Length = 400
Score = 31.6 bits (72), Expect = 0.59
Identities = 16/63 (25%), Positives = 27/63 (42%), Gaps = 1/63 (1%)
Query: 175 PNDPRMLTALGEAFEKQEKISEAMKCYNKSRAIGDADGKALFKLAKLYDKLNETEAAADL 234
P DP +L+ LG K + +A + + + + +LA D+L E E A +
Sbjct: 325 PEDPLLLSTLGRLALKNKLWGKASEALEAALKLRPSA-SDYAELADALDQLGEPEEAEQV 383
Query: 235 FME 237
E
Sbjct: 384 RRE 386
>gnl|CDD|190182 pfam01981, PTH2, Peptidyl-tRNA hydrolase PTH2. Peptidyl-tRNA
hydrolases are enzymes that release tRNAs from
peptidyl-tRNA during translation.
Length = 116
Score = 30.2 bits (69), Expect = 0.60
Identities = 10/31 (32%), Positives = 18/31 (58%)
Query: 128 AAIQCYRQAIEINNLDYRAWYGLGQTYEILR 158
AA+ Y++A++ N +AW GQ +L+
Sbjct: 26 AAVGAYKKALKANPELLKAWLREGQKKIVLK 56
>gnl|CDD|222121 pfam13428, TPR_14, Tetratricopeptide repeat.
Length = 44
Score = 28.3 bits (63), Expect = 0.73
Identities = 7/41 (17%), Positives = 17/41 (41%)
Query: 112 VWTLLGHEYMEMKNTNAAIQCYRQAIEINNLDYRAWYGLGQ 152
L + + + + A+ R+A+ ++ D A L +
Sbjct: 3 ALLALARALLALGDLDEALALLRRALALDPDDPEALLLLAR 43
Score = 26.0 bits (57), Expect = 4.8
Identities = 14/44 (31%), Positives = 21/44 (47%)
Query: 178 PRMLTALGEAFEKQEKISEAMKCYNKSRAIGDADGKALFKLAKL 221
P L AL A + EA+ ++ A+ D +AL LA+L
Sbjct: 1 PAALLALARALLALGDLDEALALLRRALALDPDDPEALLLLARL 44
>gnl|CDD|219851 pfam08456, Vmethyltransf_C, Viral methyltransferase C-terminal.
This domain is found to the C-terminus of the viral
methyltransferase domain (pfam01660) in
single-stranded-RNA positive-strand viruses with no DNA
stage in the Virgaviridae family.
Length = 208
Score = 30.5 bits (69), Expect = 0.94
Identities = 11/58 (18%), Positives = 18/58 (31%)
Query: 273 TAYKCAQKCLQHEETAEEAKSLLRSIAQKRQPDESSSQPMECAVVLDPVPPSTRSSKF 330
+K +EE E K + R I + E + L + R + F
Sbjct: 84 DFEDLDEKIRLYEEYERERKRISRKIVSDKLEVSEDGSASESSSDLKELRKFRRRASF 141
>gnl|CDD|226456 COG3947, COG3947, Response regulator containing CheY-like receiver
and SARP domains [Signal transduction mechanisms].
Length = 361
Score = 30.2 bits (68), Expect = 1.6
Identities = 21/88 (23%), Positives = 36/88 (40%), Gaps = 3/88 (3%)
Query: 71 TAEEAKSLLRSIAQKRQPDESSSQAVLYFQRALKLNPNYLGVWTLLGHEYMEMKNTNAAI 130
T +E K L+ P+ A +R +L LG + Y+E N AI
Sbjct: 243 TIDELKELVGQYKGDYLPEADYPWAEDERERLEQLYMKLLG---KVARAYLEAGKPNEAI 299
Query: 131 QCYRQAIEINNLDYRAWYGLGQTYEILR 158
Q +++A+ ++ L + GL + L
Sbjct: 300 QLHQRALTLDPLSEQDNKGLMASLATLG 327
>gnl|CDD|182415 PRK10370, PRK10370, formate-dependent nitrite reductase complex
subunit NrfG; Provisional.
Length = 198
Score = 29.5 bits (66), Expect = 2.2
Identities = 23/97 (23%), Positives = 38/97 (39%), Gaps = 14/97 (14%)
Query: 79 LRSIAQKRQPDESSSQAVLYFQRALKLNPNYLGVWTLLGHEYMEMKNTNAAIQCYRQAIE 138
L A ++ P+ + Q ++ NP W LLG Y+ + + A+ YRQA++
Sbjct: 46 LHQFASQQTPEAQ----LQALQDKIRANPQNSEQWALLGEYYLWRNDYDNALLAYRQALQ 101
Query: 139 INNLDYRAWYGLGQTYEILRLPYYGLYYYKQAHMVRP 175
+ G+ E+ LYY HM
Sbjct: 102 LR----------GENAELYAALATVLYYQAGQHMTPQ 128
>gnl|CDD|216950 pfam02259, FAT, FAT domain. The FAT domain is named after FRAP,
ATM and TRRAP.
Length = 350
Score = 29.3 bits (66), Expect = 2.9
Identities = 7/29 (24%), Positives = 15/29 (51%)
Query: 127 NAAIQCYRQAIEINNLDYRAWYGLGQTYE 155
+ +Q YR A++ ++ Y+AW+
Sbjct: 274 DEILQAYRTAVQFDDQWYKAWHSWALANF 302
>gnl|CDD|235280 PRK04322, PRK04322, peptidyl-tRNA hydrolase; Provisional.
Length = 113
Score = 27.9 bits (63), Expect = 3.1
Identities = 8/25 (32%), Positives = 11/25 (44%)
Query: 128 AAIQCYRQAIEINNLDYRAWYGLGQ 152
AA+ Y +A + N W GQ
Sbjct: 23 AAVSAYEKADKSNREWLEEWLNEGQ 47
>gnl|CDD|214513 smart00093, SERPIN, SERine Proteinase INhibitors.
Length = 359
Score = 29.1 bits (66), Expect = 3.2
Identities = 13/61 (21%), Positives = 24/61 (39%), Gaps = 9/61 (14%)
Query: 50 LKANNLDTAYKCAQKCLQHEETAEEAKSLLRSIAQKRQPDESSSQAVLYFQRALKLNPNY 109
L N +T+ + QH LL + + E + L+ ++LKL ++
Sbjct: 46 LGFNLTETSEADIHQGFQH---------LLHLLNRPDSQLELKTANALFVDKSLKLKDSF 96
Query: 110 L 110
L
Sbjct: 97 L 97
>gnl|CDD|218970 pfam06278, DUF1032, Protein of unknown function (DUF1032). This
family consists of several conserved eukaryotic proteins
of unknown function.
Length = 565
Score = 28.8 bits (64), Expect = 5.7
Identities = 14/47 (29%), Positives = 22/47 (46%), Gaps = 3/47 (6%)
Query: 301 KRQPDESSSQPMECAVVLDPVPPSTRSSKFPANPAYPFATSDPDDDA 347
++ P + QPME + PVP S + P PA A +++A
Sbjct: 165 QKDPSVAEEQPMEVSDCGSPVPVLQFSQEEPEGPA---ANGGYEEEA 208
>gnl|CDD|224971 COG2060, KdpA, K+-transporting ATPase, A chain [Inorganic ion
transport and metabolism].
Length = 560
Score = 28.4 bits (64), Expect = 6.3
Identities = 13/36 (36%), Positives = 17/36 (47%), Gaps = 3/36 (8%)
Query: 95 AVLYFQRALKLNPNYLGV--WTLLGHEYME-MKNTN 127
A+L Q L LNP L W L + + + NTN
Sbjct: 82 ALLMLQGWLPLNPQGLPGLSWDLAFNTAVSFVTNTN 117
>gnl|CDD|219284 pfam07064, RIC1, RIC1. RIC1 has been identified in yeast as a
Golgi protein involved in retrograde transport to the
cis-Golgi network. It forms a heterodimer with Rgp1 and
functions as a guanyl-nucleotide exchange factor.
Length = 251
Score = 28.1 bits (63), Expect = 6.6
Identities = 13/56 (23%), Positives = 19/56 (33%), Gaps = 6/56 (10%)
Query: 63 QKCLQHEETAEEAKSLLRSIAQKRQPDESSSQAVLYFQRALKLNPNYLGVWTLLGH 118
++CL+ + A S L + SS AV AL+ W L
Sbjct: 185 EECLELGRL-KTAASYLLVLQNLEGSASSSDCAVRLLVLALE-----EKNWELCKE 234
>gnl|CDD|189027 cd09857, PIN_EXO1, PIN domain of Exonuclease-1, a
structure-specific, divalent-metal-ion dependent, 5'
nuclease and homologs. Exonuclease-1 (EXO1) is involved
in multiple, eukaryotic DNA metabolic pathways,
including DNA replication processes (5' flap DNA
endonuclease activity and double stranded DNA
5'-exonuclease activity), DNA repair processes (DNA
mismatch repair (MMR) and post-replication repair
(PRR)), recombination, and telomere integrity. EXO1
functions in the MMS2 error-free branch of the PRR
pathway in the maintenance and repair of stalled
replication forks. Studies also suggest that EXO1 plays
both structural and catalytic roles during MMR-mediated
mutation avoidance. EXO1 belongs to the FEN1-EXO1-like
family of structure-specific, 5' nucleases. These
nucleases contain a PIN (PilT N terminus) domain with a
helical arch/clamp region (I domain) of variable length
(approximately 43 residues in EXO1 PIN domains) and a
H3TH (helix-3-turn-helix) domain, an atypical
helix-hairpin-helix-2-like region. Both the H3TH domain
(not included here) and the helical arch/clamp region
are involved in DNA binding. Nucleases within this group
also have a carboxylate-rich active site that is
involved in binding essential divalent metal ion
cofactors (Mg2+/Mn2+). EXO1 nucleases also have
C-terminal Mlh1- and Msh2-binding domains which allow
interaction with MMR and PRR proteins, respectively.
Length = 210
Score = 27.5 bits (62), Expect = 8.0
Identities = 6/25 (24%), Positives = 14/25 (56%)
Query: 116 LGHEYMEMKNTNAAIQCYRQAIEIN 140
E + N + A +C+++A++I
Sbjct: 102 KALELLREGNRSEARECFQRAVDIT 126
>gnl|CDD|152569 pfam12134, PRP8_domainIV, PRP8 domain IV core. This domain is
found in eukaryotes, and is about 20 amino acids in
length. It is found associated with pfam10597,
pfam10596, pfam10598, pfam08083, pfam08082, pfam01398,
pfam08084. There is a conserved LILR sequence motif.
The domain is a selenomethionine domain in a subunit of
the spliceosome. The function of PRP8 domain IV is
believed to be interaction with the splicosomal core.
Length = 231
Score = 27.7 bits (62), Expect = 9.2
Identities = 12/29 (41%), Positives = 18/29 (62%), Gaps = 3/29 (10%)
Query: 63 QKCLQH---EETAEEAKSLLRSIAQKRQP 88
QK L +TAEE +L+RS+ ++ QP
Sbjct: 71 QKRLSQLAKWKTAEEVAALIRSLPKEEQP 99
Score = 27.7 bits (62), Expect = 9.2
Identities = 12/29 (41%), Positives = 18/29 (62%), Gaps = 3/29 (10%)
Query: 279 QKCLQH---EETAEEAKSLLRSIAQKRQP 304
QK L +TAEE +L+RS+ ++ QP
Sbjct: 71 QKRLSQLAKWKTAEEVAALIRSLPKEEQP 99
>gnl|CDD|206687 cd01900, YchF, YchF GTPase. YchF is a member of the Obg family,
which includes four other subfamilies of GTPases: Obg,
DRG, Ygr210, and NOG1. Obg is an essential gene that is
involved in DNA replication in C. crescentus and
Streptomyces griseus and is associated with the
ribosome. Several members of the family, including YchF,
possess the TGS domain related to the RNA-binding
proteins. Experimental data and genomic analysis suggest
that YchF may be part of a nucleoprotein complex and may
function as a GTP-dependent translational factor.
Length = 274
Score = 27.8 bits (63), Expect = 9.3
Identities = 9/19 (47%), Positives = 12/19 (63%), Gaps = 2/19 (10%)
Query: 325 TRSSKFPANPAYPFATSDP 343
T+S+ AN YPF T +P
Sbjct: 19 TKSNAEAAN--YPFCTIEP 35
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.318 0.131 0.388
Gapped
Lambda K H
0.267 0.0733 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 17,349,684
Number of extensions: 1602918
Number of successful extensions: 1834
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1781
Number of HSP's successfully gapped: 120
Length of query: 349
Length of database: 10,937,602
Length adjustment: 98
Effective length of query: 251
Effective length of database: 6,590,910
Effective search space: 1654318410
Effective search space used: 1654318410
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 59 (26.5 bits)