RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy12594
(542 letters)
>gnl|CDD|237987 cd00020, ARM, Armadillo/beta-catenin-like repeats. An approximately
40 amino acid long tandemly repeated sequence motif
first identified in the Drosophila segment polarity gene
armadillo; these repeats were also found in the
mammalian armadillo homolog beta-catenin, the junctional
plaque protein plakoglobin, the adenomatous polyposis
coli (APC) tumor suppressor protein, and a number of
other proteins. ARM has been implicated in mediating
protein-protein interactions, but no common features
among the target proteins recognized by the ARM repeats
have been identified; related to the HEAT domain; three
consecutive copies of the repeat are represented by this
alignment model.
Length = 120
Score = 55.8 bits (135), Expect = 1e-09
Identities = 28/119 (23%), Positives = 56/119 (47%), Gaps = 1/119 (0%)
Query: 161 ERIVQCGALPPLVQILASPQLSLKHAAAFALDGIVKHSPELAESASDCDVIPLLIKALNN 220
E ++Q G LP LV +L+S +++ AA+AL + + + ++ + +P L++ L +
Sbjct: 1 EAVIQAGGLPALVSLLSSSDENVQREAAWALSNLSAGNNDNIQAVVEAGGLPALVQLLKS 60
Query: 221 DEVNVKADAFLCLADIAKDSPSLASTIAD-GATTLAVEHLTYPHGLVQKNAATLLRDMC 278
++ V A L ++A + + G V L + +QKNA L ++
Sbjct: 61 EDEEVVKAALWALRNLAAGPEDNKLIVLEAGGVPKLVNLLDSSNEDIQKNATGALSNLA 119
Score = 55.4 bits (134), Expect = 2e-09
Identities = 30/117 (25%), Positives = 58/117 (49%)
Query: 121 LIEDKDLDSIISSLHDFDPDTRANAALALGGLAKDNKHFAERIVQCGALPPLVQILASPQ 180
+I+ L +++S L D + + AA AL L+ N + +V+ G LP LVQ+L S
Sbjct: 3 VIQAGGLPALVSLLSSSDENVQREAAWALSNLSAGNNDNIQAVVEAGGLPALVQLLKSED 62
Query: 181 LSLKHAAAFALDGIVKHSPELAESASDCDVIPLLIKALNNDEVNVKADAFLCLADIA 237
+ AA +AL + + + +P L+ L++ +++ +A L+++A
Sbjct: 63 EEVVKAALWALRNLAAGPEDNKLIVLEAGGVPKLVNLLDSSNEDIQKNATGALSNLA 119
Score = 44.6 bits (106), Expect = 1e-05
Identities = 29/120 (24%), Positives = 52/120 (43%), Gaps = 6/120 (5%)
Query: 287 LVVLAGGLDGLLRLIDCNGGYNSLPAIMALGYMSAQSEQIAEQIIELQGVGSLQTILQCP 346
V+ AGGL L+ L+ + A AL +SA + + ++E G+ +L +L+
Sbjct: 2 AVIQAGGLPALVSLLSSSDENVQREAAWALSNLSAGNNDNIQAVVEAGGLPALVQLLKSE 61
Query: 347 HKEQLQVAAAWTLGQIGKHSVDNVTALCEENVLSTLLSKITQTCVSDNLKSKCKNALNIL 406
E++ AA W L + DN + E + L++ + + + KNA L
Sbjct: 62 D-EEVVKAALWALRNLAAGPEDNKLIVLEAGGVPKLVN-----LLDSSNEDIQKNATGAL 115
Score = 43.8 bits (104), Expect = 2e-05
Identities = 22/119 (18%), Positives = 50/119 (42%), Gaps = 1/119 (0%)
Query: 78 QEVLERNLVALVLKDLTKQN-KHYKKTALHMLTCVAQQDERCAGLIEDKDLDSIISSLHD 136
+ V++ + ++ L+ + ++ A + A ++ ++E L +++ L
Sbjct: 1 EAVIQAGGLPALVSLLSSSDENVQREAAWALSNLSAGNNDNIQAVVEAGGLPALVQLLKS 60
Query: 137 FDPDTRANAALALGGLAKDNKHFAERIVQCGALPPLVQILASPQLSLKHAAAFALDGIV 195
D + A AL LA + +++ G +P LV +L S ++ A AL +
Sbjct: 61 EDEEVVKAALWALRNLAAGPEDNKLIVLEAGGVPKLVNLLDSSNEDIQKNATGALSNLA 119
Score = 32.7 bits (75), Expect = 0.14
Identities = 24/102 (23%), Positives = 42/102 (41%), Gaps = 3/102 (2%)
Query: 4 KSVKNSIDHYQRARLCFVQTMSAFADKPSTIHFLKDLNVVDLIVPLLADKVPSIQHAALA 63
+ +S ++ QR + +SA + I + + + +V LL + + AAL
Sbjct: 14 SLLSSSDENVQREAAWALSNLSAGNNDN--IQAVVEAGGLPALVQLLKSEDEEVVKAALW 71
Query: 64 TLGKLMCAKHVDIRQEVLERNLVALVLKDLTKQNKHYKKTAL 105
L L A D + VLE V ++ L N+ +K A
Sbjct: 72 ALRNL-AAGPEDNKLIVLEAGGVPKLVNLLDSSNEDIQKNAT 112
>gnl|CDD|227396 COG5064, SRP1, Karyopherin (importin) alpha [Intracellular
trafficking and secretion].
Length = 526
Score = 56.1 bits (135), Expect = 3e-08
Identities = 52/237 (21%), Positives = 99/237 (41%), Gaps = 5/237 (2%)
Query: 152 LAKDNKHFAERIVQCGALPPLVQILASPQLS-LKHAAAFALDGIVKHSPELAESASDCDV 210
L+K+ + ++ G +P V+ + Q L+ AA+AL I + + + D
Sbjct: 99 LSKETSPPIQPVIDAGVVPRFVEFMDEIQRDMLQFEAAWALTNIASGTTQQTKVVVDAGA 158
Query: 211 IPLLIKALNNDEVNVKADAFLCLADIAKDSPSLASTIAD-GA--TTLAVEHLTYPHGLVQ 267
+PL I+ L++ E +V+ A L +IA DS + GA L + + H +
Sbjct: 159 VPLFIQLLSSTEDDVREQAVWALGNIAGDSEGCRDYVLQCGALEPLLGLLLSSAIHISML 218
Query: 268 KNAATLLRDMCKHNLRLANLVVLAGGLDGLLRLIDCNGGYNSLPAIMALGYMSAQSEQIA 327
+NA L ++C+ + ++ L L +LI + A A+ Y+S +
Sbjct: 219 RNATWTLSNLCRGKNPPPDWSNISQALPILAKLIYSRDPEVLVDACWAISYLSDGPNEKI 278
Query: 328 EQIIELQGVGSLQTILQCPHKEQLQVAAAWTLGQIGKHSVDNVTALCEENVLSTLLS 384
+ ++++ G L +L ++Q A ++G I S D + L S
Sbjct: 279 QAVLDVGIPGRLVELLSHE-SAKIQTPALRSVGNIVTGSDDQTQVIINCGALKAFRS 334
Score = 51.0 bits (122), Expect = 1e-06
Identities = 51/260 (19%), Positives = 105/260 (40%), Gaps = 46/260 (17%)
Query: 131 ISSLHDFDPDTRANAALALGGLAKDNKHFAERIVQCGALPPLVQILAS--PQLSLKHAAA 188
I L + D R A ALG +A D++ + ++QCGAL PL+ +L S +S+ A
Sbjct: 163 IQLLSSTEDDVREQAVWALGNIAGDSEGCRDYVLQCGALEPLLGLLLSSAIHISMLRNAT 222
Query: 189 FALDGIVKHSPELAESASDCDVIPLLIKALNNDEVNVKADAFLCLADIAKDSPSLASTIA 248
+ L + + + ++ +P+L K + + + V DA ++ ++ +
Sbjct: 223 WTLSNLCRGKNPPPDWSNISQALPILAKLIYSRDPEVLVDACWAISYLSDGPNEKIQAVL 282
Query: 249 D-GATTLAVEHLTYPHGLVQKNAATLLRDMCKHNLRLANLVVLAGGLDGLLRLIDCNGGY 307
D G VE L++ +Q
Sbjct: 283 DVGIPGRLVELLSHESAKIQT--------------------------------------- 303
Query: 308 NSLPAIMALGYMSAQSEQIAEQIIELQGVGSLQTILQCPHKEQLQVAAAWTLGQIGKHSV 367
PA+ ++G + S+ + II + + +++L P KE ++ A WT+ I +
Sbjct: 304 ---PALRSVGNIVTGSDDQTQVIINCGALKAFRSLLSSP-KENIRKEACWTISNITAGNT 359
Query: 368 DNVTALCEENVLSTLLSKIT 387
+ + A+ + N++ L+ ++
Sbjct: 360 EQIQAVIDANLIPPLIHLLS 379
Score = 42.2 bits (99), Expect = 6e-04
Identities = 20/67 (29%), Positives = 38/67 (56%)
Query: 163 IVQCGALPPLVQILASPQLSLKHAAAFALDGIVKHSPELAESASDCDVIPLLIKALNNDE 222
I+ CGAL +L+SP+ +++ A + + I + E ++ D ++IP LI L++ E
Sbjct: 323 IINCGALKAFRSLLSSPKENIRKEACWTISNITAGNTEQIQAVIDANLIPPLIHLLSSAE 382
Query: 223 VNVKADA 229
+K +A
Sbjct: 383 YKIKKEA 389
Score = 35.3 bits (81), Expect = 0.087
Identities = 50/282 (17%), Positives = 108/282 (38%), Gaps = 20/282 (7%)
Query: 35 HFLKDLNVVDLIVPLLADKVPSIQHAALATLGKLMCAKHVDIRQEVLERNLVALVLKDLT 94
+ D V L + LL+ ++ A+ LG + R VL+ + +L L
Sbjct: 151 KVVVDAGAVPLFIQLLSSTEDDVREQAVWALGNI-AGDSEGCRDYVLQCGALEPLLGLLL 209
Query: 95 KQNKHYK--KTALHMLTCVAQQDERCAGLIEDKDLDSIISSL-------HDFDPDTRANA 145
H + A L+ + C G D +I +L + DP+ +A
Sbjct: 210 SSAIHISMLRNATWTLSNL------CRGKNPPPDWSNISQALPILAKLIYSRDPEVLVDA 263
Query: 146 ALALGGLAKDNKHFAERIVQCGALPPLVQILASPQLSLKHAAAFALDGIVKHSPELAESA 205
A+ L+ + ++ G LV++L+ ++ A ++ IV S + +
Sbjct: 264 CWAISYLSDGPNEKIQAVLDVGIPGRLVELLSHESAKIQTPALRSVGNIVTGSDDQTQVI 323
Query: 206 SDCDVIPLLIKALNNDEVNVKADAFLCLADI-AKDSPSLASTIADGATTLAVEHLTYPHG 264
+C + L++ + N++ +A +++I A ++ + + I + L+
Sbjct: 324 INCGALKAFRSLLSSPKENIRKEACWTISNITAGNTEQIQAVIDANLIPPLIHLLSSAEY 383
Query: 265 LVQKNAATLLRDMCKHNLRLANLV---VLAGGLDGLLRLIDC 303
++K A + + L +++ V G + L L+D
Sbjct: 384 KIKKEACWAISNATSGGLNRPDIIRYLVSQGFIKPLCDLLDV 425
>gnl|CDD|215629 PLN03200, PLN03200, cellulose synthase-interactive protein;
Provisional.
Length = 2102
Score = 46.2 bits (110), Expect = 5e-05
Identities = 50/205 (24%), Positives = 87/205 (42%), Gaps = 12/205 (5%)
Query: 49 LLADKVPSIQHAALATLGKLM-CAKHVDIRQEVLERNLVALVLKDLTKQNKHYKKTALHM 107
LL + P Q A TL KL+ A I Q L AL+L DL + H H+
Sbjct: 538 LLKNGGPKGQEIAAKTLTKLVRTADAATISQ------LTALLLGDLPESKVHVLDVLGHV 591
Query: 108 LTCVAQQDERCAGLIEDKDLDSIISSLHDFDPDTRANAALALGGLAKDNKHFAERIVQCG 167
L+ + +D G + L ++I L +T+ AA L + + E +
Sbjct: 592 LSVASLEDLVREGSAANDALRTLIQLLSSSKEETQEKAASVLADIFSSRQDLCESLATDE 651
Query: 168 ALPPLVQILASPQLSLKHAAAFALDGI---VKHSPELAESASDCDVIPLLIKALNNDEVN 224
+ P +++L + ++ +A AL + +K + +++ +A D I LIK + +
Sbjct: 652 IINPCIKLLTNNTEAVATQSARALAALSRSIKENRKVSYAAE--DAIKPLIKLAKSSSIE 709
Query: 225 VKADAFLCLADIAKDSPSLASTIAD 249
V A LA++ D A +A+
Sbjct: 710 VAEQAVCALANLLSDPEVAAEALAE 734
Score = 43.2 bits (102), Expect = 4e-04
Identities = 27/97 (27%), Positives = 50/97 (51%), Gaps = 1/97 (1%)
Query: 122 IEDKDLDSIISSLHDFDPDTRANAALALGGLAKDNKHFAERIVQCGALPPLVQILASP-Q 180
+E +D ++ L + D +ANAA L L + +++ GA+ L+++L +
Sbjct: 186 LEAGGVDILVKLLSSGNSDAQANAASLLARLMMAFESSISKVLDAGAVKQLLKLLGQGNE 245
Query: 181 LSLKHAAAFALDGIVKHSPELAESASDCDVIPLLIKA 217
+S++ AA AL+ + S E ++ +D IP LI A
Sbjct: 246 VSVRAEAAGALEALSSQSKEAKQAIADAGGIPALINA 282
Score = 37.4 bits (87), Expect = 0.022
Identities = 63/250 (25%), Positives = 102/250 (40%), Gaps = 49/250 (19%)
Query: 131 ISSLHDFDPDTRANAALALGGLAKDNKHFAERIVQCGALPPLVQILASPQLSLKHAAA-- 188
+S L + NAA LG L K+ +++ G +PPL+ +L S + AAA
Sbjct: 64 VSLLRSGTLGAKVNAAAVLGVLCKEED-LRVKVLLGGCIPPLLSLLKSGSAEAQKAAAEA 122
Query: 189 -----------------FALDGIVKHSPELAE-----SASDCDVIPLLIKALNNDEVNVK 226
F+ +G+V P L + + D V LL AL N
Sbjct: 123 IYAVSSGGLSDHVGSKIFSTEGVV---PSLWDQLQPGNKQDKVVEGLLTGALRN------ 173
Query: 227 ADAFLCLADIAKDSPSLASTIADGATTLAVEHLTYPHGLVQKNAATLLRDMCKHNLRLAN 286
LC ++T+ G + V+ L+ + Q NAA+LL + +
Sbjct: 174 ----LC----GSTDGFWSATLEAGGVDILVKLLSSGNSDAQANAASLLARLMMAFESSIS 225
Query: 287 LVVLAGGLDGLLRLIDCNGGYNSLP----AIMALGYMSAQSEQIAEQIIELQGVGSLQTI 342
V+ AG + LL+L+ G N + A AL +S+QS++ + I + G+ +L
Sbjct: 226 KVLDAGAVKQLLKLL---GQGNEVSVRAEAAGALEALSSQSKEAKQAIADAGGIPALINA 282
Query: 343 LQCPHKEQLQ 352
P KE +Q
Sbjct: 283 TVAPSKEFMQ 292
Score = 30.1 bits (68), Expect = 3.9
Identities = 21/85 (24%), Positives = 36/85 (42%), Gaps = 2/85 (2%)
Query: 163 IVQCGALPPLVQILASPQLSLKHAAAFALDGIVKHSPELAESASDCDVIPLLIKALNNDE 222
I G +PPLVQ+L + K +A L + HS ++ +P L+ L N
Sbjct: 484 ITAAGGIPPLVQLLETGSQKAKEDSATVLWNLCCHSEDIRACVESAGAVPALLWLLKNGG 543
Query: 223 VNVKADAFLCLADIAKDSPSLASTI 247
+ A L + + + + +TI
Sbjct: 544 PKGQEIAAKTLTKLVRTADA--ATI 566
Score = 30.1 bits (68), Expect = 4.1
Identities = 19/65 (29%), Positives = 33/65 (50%), Gaps = 1/65 (1%)
Query: 127 LDSIISSLHDFDPDTRANAALALGGLAKDNKHFAERIVQCGALPPLVQILASPQLSLKHA 186
++ +++ L R +AA AL L +H + + A+ PLV++L + S +HA
Sbjct: 1234 VNQLVAVLRLGSRSARYSAARALQELF-SAEHIRDSELARQAVQPLVEMLNTGSESEQHA 1292
Query: 187 AAFAL 191
A AL
Sbjct: 1293 AIGAL 1297
>gnl|CDD|224331 COG1413, COG1413, FOG: HEAT repeat [Energy production and
conversion].
Length = 335
Score = 43.4 bits (102), Expect = 2e-04
Identities = 57/236 (24%), Positives = 90/236 (38%), Gaps = 33/236 (13%)
Query: 37 LKDLNVVDLIVPLLADKVPSIQHAALATLGKLMCAKHVDIRQEVLERNLVALV------- 89
L V L+ LL+D+ P ++ AA LG+L + V E+LE + V
Sbjct: 70 LGSEEAVPLLRELLSDEDPRVRDAAADALGELGDPEAVPPLVELLENDENEGVRAAAARA 129
Query: 90 ------------LKDLTKQNKHYKKTALHMLTCVAQQDERCAGLIEDKDLDSI---ISSL 134
L + + A + + L E D ++I I L
Sbjct: 130 LGKLGDERALDPLLEALQDEDSGSAAAALDAALLDVRAAAAEALGELGDPEAIPLLIELL 189
Query: 135 HDFDPDTRANAALALGGLAKDNKHFAERIVQCGALPPLVQILASPQLSLKHAAAFALD-- 192
D D D R AA ALG L +N A+ LV+ L+ L ++ AA AL
Sbjct: 190 EDEDADVRRAAASALGQLGSENVEAADL---------LVKALSDESLEVRKAALLALGEI 240
Query: 193 GIVKHSPELAESASDCDVIPLLIKALNNDEVNVKADAFLCLADIAKDSPSLASTIA 248
G + LA++ D DVI L+ A +++ A L + ++ ++ A
Sbjct: 241 GDEEAVDALAKALEDEDVILALLAAAALGALDLAEAALPLLLLLIDEANAVRLEAA 296
Score = 37.2 bits (86), Expect = 0.016
Identities = 63/289 (21%), Positives = 108/289 (37%), Gaps = 54/289 (18%)
Query: 92 DLTKQNKHYKKTALHMLTCVAQQDERCAGLIEDKDLDSIISSLHDFDPDTRANAALALGG 151
+ AL L + A + D ++ L D D R +AA+ALG
Sbjct: 14 LVLLSYGSEAAAALQALAELDDLILELA----PEAADELLKLLEDEDLLVRLSAAVALGE 69
Query: 152 LAKDNKHFAERIVQCGALPPLVQILASPQLSLKHAAAFALDGIVKHSPELAESASDCDVI 211
L +E A+P L ++L+ ++ AAA AL G + D + +
Sbjct: 70 LG------SEE-----AVPLLRELLSDEDPRVRDAAADAL-GELG----------DPEAV 107
Query: 212 PLLIKALNNDE-VNVKADAFLCLADIAKDSPSLASTIA---DGATTLAVEHLTYPHGLVQ 267
P L++ L NDE V+A A L + D +L + D + A L V+
Sbjct: 108 PPLVELLENDENEGVRAAAARALGKL-GDERALDPLLEALQDEDSGSAAAALDAALLDVR 166
Query: 268 KNAATLLRDMCKHNLRLANLVVLAGGLDGLLRLIDCNGGYNSLPAIMALGYMSAQSEQIA 327
AA L ++ + L L L+ + A ALG + +++ + A
Sbjct: 167 AAAAEALGEL--GDPEAIPL--LIELLEDEDADVRR-------AAASALGQLGSENVEAA 215
Query: 328 EQIIELQGVGSLQTILQCPHKEQLQVAAAWTLGQIGKHSVDNVTALCEE 376
+ +++ +++ AA LG+IG + V AL +
Sbjct: 216 DLLVKALS----------DESLEVRKAALLALGEIG--DEEAVDALAKA 252
Score = 32.6 bits (74), Expect = 0.44
Identities = 48/205 (23%), Positives = 82/205 (40%), Gaps = 35/205 (17%)
Query: 42 VVDLIVPLLADKVPSIQHAALATLGKLMCAKHVDIRQEVLERNLVALVLKDLTKQNKHYK 101
D ++ LL D+ ++ +A LG+L E V L+ + L+ ++ +
Sbjct: 44 AADELLKLLEDEDLLVRLSAAVALGEL----------GSEEA--VPLLRELLSDEDPRVR 91
Query: 102 KTALHMLTCVAQQDERCAGLIEDKDLDSIISSL-HDFDPDTRANAALALGGLAKDNKHFA 160
A L L + + + ++ L +D + RA AA ALG L
Sbjct: 92 DAAADAL----------GELGDPEAVPPLVELLENDENEGVRAAAARALGKL------GD 135
Query: 161 ERIVQCGALPPLVQILASP-QLSLKHAAAFALDGIVKHSPELAESASDCDVIPLLIKALN 219
ER AL PL++ L S A AL + + E D + IPLLI+ L
Sbjct: 136 ER-----ALDPLLEALQDEDSGSAAAALDAALLDVRAAAAEALGELGDPEAIPLLIELLE 190
Query: 220 NDEVNVKADAFLCLADIAKDSPSLA 244
+++ +V+ A L + ++ A
Sbjct: 191 DEDADVRRAAASALGQLGSENVEAA 215
>gnl|CDD|222285 pfam13646, HEAT_2, HEAT repeats. This family includes multiple
HEAT repeats.
Length = 88
Score = 36.5 bits (85), Expect = 0.003
Identities = 29/92 (31%), Positives = 42/92 (45%), Gaps = 22/92 (23%)
Query: 134 LHDFDPDTRANAALALGGLAKDNKHFAERIVQCGALPPLVQILASPQLSLKHAAAFALDG 193
L D DP+ RA AA ALG L ALP L+++L P ++ AAA AL G
Sbjct: 9 LSDPDPEVRAAAARALGELGDPE-----------ALPALLELLKDPDPEVRRAAAEAL-G 56
Query: 194 IVKHSPELAESASDCDVIPLLIKALNNDEVNV 225
+ + +P L++ L +D+ V
Sbjct: 57 KLGDP----------EALPALLELLQDDDDAV 78
Score = 31.2 bits (71), Expect = 0.23
Identities = 25/77 (32%), Positives = 36/77 (46%), Gaps = 13/77 (16%)
Query: 117 RCAGLIEDKD-LDSIISSLHDFDPDTRANAALALGGLAKDNKHFAERIVQCGALPPLVQI 175
R G + D + L +++ L D DP+ R AA ALG L ALP L+++
Sbjct: 22 RALGELGDPEALPALLELLKDPDPEVRRAAAEALGKLGDPE-----------ALPALLEL 70
Query: 176 LASPQ-LSLKHAAAFAL 191
L ++ AAA AL
Sbjct: 71 LQDDDDAVVRAAAASAL 87
Score = 27.7 bits (62), Expect = 4.5
Identities = 12/32 (37%), Positives = 17/32 (53%)
Query: 37 LKDLNVVDLIVPLLADKVPSIQHAALATLGKL 68
L D + ++ LL D P ++ AA LGKL
Sbjct: 27 LGDPEALPALLELLKDPDPEVRRAAAEALGKL 58
Score = 27.3 bits (61), Expect = 5.8
Identities = 16/69 (23%), Positives = 28/69 (40%), Gaps = 12/69 (17%)
Query: 169 LPPLVQILA-SPQLSLKHAAAFALDGIVKHSPELAESASDCDVIPLLIKALNNDEVNVKA 227
L L++ L P ++ AAA AL E + +P L++ L + + V+
Sbjct: 1 LEALLEALLSDPDPEVRAAAARAL----------GELGDP-EALPALLELLKDPDPEVRR 49
Query: 228 DAFLCLADI 236
A L +
Sbjct: 50 AAAEALGKL 58
>gnl|CDD|205691 pfam13513, HEAT_EZ, HEAT-like repeat. The HEAT repeat family is
related to armadillo/beta-catenin-like repeats (see
pfam00514). These EZ repeats are found in subunits of
cyanobacterial phycocyanin lyase and other proteins and
probably carry out a scaffolding role.
Length = 55
Score = 34.7 bits (80), Expect = 0.008
Identities = 19/53 (35%), Positives = 25/53 (47%), Gaps = 1/53 (1%)
Query: 139 PDTRANAALALGGLAKDNKHFAERIVQCGALPPLVQILASPQLSLKHAAAFAL 191
+ R AALALG LA V LP L+ +L ++ AAA+AL
Sbjct: 1 WEVREAAALALGALAGGGPELLRPAVP-ELLPALLPLLKDDDDEVREAAAWAL 52
Score = 34.0 bits (78), Expect = 0.011
Identities = 17/55 (30%), Positives = 27/55 (49%), Gaps = 1/55 (1%)
Query: 182 SLKHAAAFALDGIVKHSPELAESASDCDVIPLLIKALNNDEVNVKADAFLCLADI 236
++ AAA AL + PEL A +++P L+ L +D+ V+ A L I
Sbjct: 2 EVREAAALALGALAGGGPELLRPAVP-ELLPALLPLLKDDDDEVREAAAWALGRI 55
Score = 28.6 bits (64), Expect = 1.1
Identities = 13/51 (25%), Positives = 29/51 (56%), Gaps = 2/51 (3%)
Query: 312 AIMALGYMSAQSEQIAEQIIELQGVGSLQTILQCPHKEQLQVAAAWTLGQI 362
A +ALG ++ ++ + + + +L +L+ ++++ AAAW LG+I
Sbjct: 7 AALALGALAGGGPELLRPAVP-ELLPALLPLLKDDD-DEVREAAAWALGRI 55
Score = 28.2 bits (63), Expect = 1.5
Identities = 13/50 (26%), Positives = 21/50 (42%)
Query: 101 KKTALHMLTCVAQQDERCAGLIEDKDLDSIISSLHDFDPDTRANAALALG 150
++ A L +A + L +++ L D D + R AA ALG
Sbjct: 4 REAAALALGALAGGGPELLRPAVPELLPALLPLLKDDDDEVREAAAWALG 53
>gnl|CDD|201276 pfam00514, Arm, Armadillo/beta-catenin-like repeat. Approx. 40
amino acid repeat. Tandem repeats form super-helix of
helices that is proposed to mediate interaction of
beta-catenin with its ligands. CAUTION: This family does
not contain all known armadillo repeats.
Length = 41
Score = 31.7 bits (73), Expect = 0.054
Identities = 14/36 (38%), Positives = 25/36 (69%)
Query: 156 NKHFAERIVQCGALPPLVQILASPQLSLKHAAAFAL 191
+ + +++ GA+PPLVQ+L+SP ++ AA+AL
Sbjct: 1 SPENKQAVIEAGAVPPLVQLLSSPDEEVQEEAAWAL 36
Score = 28.2 bits (64), Expect = 0.86
Identities = 9/38 (23%), Positives = 14/38 (36%)
Query: 31 PSTIHFLKDLNVVDLIVPLLADKVPSIQHAALATLGKL 68
P + + V +V LL+ +Q A L L
Sbjct: 2 PENKQAVIEAGAVPPLVQLLSSPDEEVQEEAAWALSNL 39
>gnl|CDD|214547 smart00185, ARM, Armadillo/beta-catenin-like repeats. Approx. 40
amino acid repeat. Tandem repeats form superhelix of
helices that is proposed to mediate interaction of
beta-catenin with its ligands. Involved in transducing
the Wingless/Wnt signal. In plakoglobin arm repeats bind
alpha-catenin and N-cadherin.
Length = 41
Score = 29.3 bits (67), Expect = 0.34
Identities = 12/29 (41%), Positives = 17/29 (58%)
Query: 163 IVQCGALPPLVQILASPQLSLKHAAAFAL 191
+V G LP LV++L S + AA+AL
Sbjct: 8 VVDAGGLPALVELLKSEDEEVVKEAAWAL 36
>gnl|CDD|237512 PRK13800, PRK13800, putative oxidoreductase/HEAT repeat-containing
protein; Provisional.
Length = 897
Score = 32.9 bits (75), Expect = 0.46
Identities = 26/108 (24%), Positives = 42/108 (38%), Gaps = 21/108 (19%)
Query: 127 LDSIISSLHDFDPDTRANAALALGGLAKDNKHFAERIVQCGALPPLVQILASPQLSLKHA 186
D++ + D DP RA A AL L A L + ++
Sbjct: 777 GDAVRALTGDPDPLVRAAALAALAELGCPPDDVA----------AATAALRASAWQVRQG 826
Query: 187 AAFALDGIVKHSPELAESASDCDVIPLLIKALNNDEVNVKADAFLCLA 234
AA AL G + ++A +P L++AL + ++V+ A L L
Sbjct: 827 AARALAGA---AADVA--------VPALVEALTDPHLDVRKAAVLALT 863
>gnl|CDD|221393 pfam12044, Metallopep, Putative peptidase family. This family of
proteins is functionally uncharacterized. However, it
does contain an HEXXH motif characteristic of
metallopeptidases. This protein is found in bacteria and
eukaryotes. Proteins in this family are typically
between 625 to 773 amino acids in length.
Length = 419
Score = 32.3 bits (74), Expect = 0.68
Identities = 17/60 (28%), Positives = 22/60 (36%), Gaps = 4/60 (6%)
Query: 487 VSPHYLED--LINRLDQ--FNPQKSSNLPDCKDSGSGNPSGSAQSLQRSEASTTSTASIY 542
P +D +RLD F S LP S + S S + TS+A IY
Sbjct: 360 SRPILPKDECTWHRLDALRFRYHPSFRLPVDGLSAKSDDSPSVYPVGNGGVFITSSAGIY 419
>gnl|CDD|216598 pfam01602, Adaptin_N, Adaptin N terminal region. This family
consists of the N terminal region of various alpha, beta
and gamma subunits of the AP-1, AP-2 and AP-3 adaptor
protein complexes. The adaptor protein (AP) complexes
are involved in the formation of clathrin-coated pits
and vesicles. The N-terminal region of the various
adaptor proteins (APs) is constant by comparison to the
C-terminal which is variable within members of the AP-2
family; and it has been proposed that this constant
region interacts with another uniform component of the
coated vesicles.
Length = 522
Score = 31.8 bits (73), Expect = 0.87
Identities = 21/122 (17%), Positives = 46/122 (37%), Gaps = 4/122 (3%)
Query: 168 ALPPLVQILASPQLSLKHAAAFALDGIVKHSPELAESASDCDVIPLLIKALNNDEVNVKA 227
P + ++L ++ AA A+ + + P+L ++P L + L++ + V +
Sbjct: 114 LAPDIKKLLVDRDPYVRKKAALAILKLYRKDPDLVRD----FLVPELKELLSDKDPGVVS 169
Query: 228 DAFLCLADIAKDSPSLASTIADGATTLAVEHLTYPHGLVQKNAATLLRDMCKHNLRLANL 287
A L +I K+ + + LT + +Q LL + R
Sbjct: 170 AAVALLYEIRKNDRLYLNKLLPLLVRRLCNLLTVCNPWLQVKILRLLTRYAPQDPREPKE 229
Query: 288 VV 289
++
Sbjct: 230 LL 231
>gnl|CDD|170047 PRK09687, PRK09687, putative lyase; Provisional.
Length = 280
Score = 30.5 bits (69), Expect = 1.8
Identities = 33/154 (21%), Positives = 49/154 (31%), Gaps = 21/154 (13%)
Query: 33 TIHFLKDLNVVDLIVPLLADKVPSIQHAALATLGKL-MCAKHVD----IRQEVLERNLVA 87
+ +V L + L + K P + L +L M + D I + + A
Sbjct: 46 VLQLRGGQDVFRLAIELCSSKNPIERDIGADILSQLGMAKRCQDNVFNILNNLALEDKSA 105
Query: 88 LVLKDL-------TKQNKHYKKTALHMLTCVA---QQDERCA---GLIEDKD---LDSII 131
V K+N Y + A + R A L D + +I
Sbjct: 106 CVRASAINATGHRCKKNPLYSPKIVEQSQITAFDKSTNVRFAVAFALSVINDEAAIPLLI 165
Query: 132 SSLHDFDPDTRANAALALGGLAKDNKHFAERIVQ 165
+ L D + D R AA AL DN E V
Sbjct: 166 NLLKDPNGDVRNWAAFALNSNKYDNPDIREAFVA 199
>gnl|CDD|202500 pfam02985, HEAT, HEAT repeat. The HEAT repeat family is related
to armadillo/beta-catenin-like repeats (see pfam00514).
Length = 31
Score = 26.3 bits (59), Expect = 3.2
Identities = 10/28 (35%), Positives = 16/28 (57%)
Query: 42 VVDLIVPLLADKVPSIQHAALATLGKLM 69
++ L++ LL D P ++ AA LG L
Sbjct: 1 LLPLLLELLNDPDPEVREAAAEALGALA 28
Score = 25.6 bits (57), Expect = 6.2
Identities = 9/29 (31%), Positives = 18/29 (62%)
Query: 210 VIPLLIKALNNDEVNVKADAFLCLADIAK 238
++PLL++ LN+ + V+ A L +A+
Sbjct: 1 LLPLLLELLNDPDPEVREAAAEALGALAE 29
>gnl|CDD|129779 TIGR00696, wecG_tagA_cpsF, bacterial polymer biosynthesis proteins,
WecB/TagA/CpsF family. The WecG member of this
superfamily, believed to be
UDP-N-acetyl-D-mannosaminuronic acid transferase, plays
a role in enterobacterial common antigen (eca) synthesis
in Escherichia coli. Another family member, the Bacillus
subtilis TagA protein, is involved in the biosynthesis
of the cell wall polymer poly(glycerol phosphate). The
third family member, CpsF, CMP-N-acetylneuraminic acid
synthetase has a role in the capsular polysaccharide
biosynthesis pathway [Cell envelope, Biosynthesis and
degradation of surface polysaccharides and
lipopolysaccharides].
Length = 177
Score = 29.4 bits (66), Expect = 3.5
Identities = 21/105 (20%), Positives = 38/105 (36%), Gaps = 2/105 (1%)
Query: 247 IADGATTLAVEHLTYPHGLVQKNAATLLRDMCKHNLRLANLVVLAGGLDGLLRLIDCN-- 304
DG + L + L+ ++C+ + + L GG +L+ +
Sbjct: 10 TPDGIGVVWGLKLLGYPQQSRVAGPDLMEELCQRAGKEKLPIFLYGGKPDVLQQLKVKLI 69
Query: 305 GGYNSLPAIMALGYMSAQSEQIAEQIIELQGVGSLQTILQCPHKE 349
Y L + A G + + + A I G G + L CP +E
Sbjct: 70 KEYPKLKIVGAFGPLEPEERKAALAKIARSGAGIVFVGLGCPKQE 114
>gnl|CDD|218626 pfam05536, Neurochondrin, Neurochondrin. This family contains
several eukaryotic neurochondrin proteins. Neurochondrin
induces hydroxyapatite resorptive activity in bone
marrow cells resistant to bafilomycin A1, an inhibitor
of macrophage- and osteoclast-mediated resorption.
Expression of the gene is localised to chondrocyte,
osteoblast, and osteocyte in the bone and to the
hippocampus and Purkinje cell layer of cerebellum in the
brain.
Length = 542
Score = 29.7 bits (67), Expect = 4.5
Identities = 35/161 (21%), Positives = 60/161 (37%), Gaps = 23/161 (14%)
Query: 116 ERCAGLIEDKDLDSIISSL---------HDFDPDTRANAALALGGLAKDNKHFAERIVQC 166
E+C L+ ++L D + TR A+G F +R++
Sbjct: 8 EKCLSLLRGARDTEKFAALLLVTKLVKAGDINSKTRRLIFEAVG------FKFLDRLLLT 61
Query: 167 GALPPLVQILASPQLSLKHAAAFALDGIVKHSPELAESASDCDVIPLLIKALNN-DEVNV 225
G+ P L + A F + PELA + + IP L++ L ++++
Sbjct: 62 GSGPSGCPRHVYLDLGVTILACFCSE------PELARHSQVVNRIPTLLECLLKSSDLSM 115
Query: 226 KADAFLCLADIAKDSPSLASTIADGA-TTLAVEHLTYPHGL 265
D + CL+ IA + +A GA L + GL
Sbjct: 116 VEDCYECLSAIAATPRGAKALLALGAVPALCQAYSNGSFGL 156
>gnl|CDD|128837 smart00567, EZ_HEAT, E-Z type HEAT repeats. Present in subunits of
cyanobacterial phycocyanin lyase, and other proteins.
Probable scaffolding role.
Length = 30
Score = 25.8 bits (58), Expect = 4.7
Identities = 13/41 (31%), Positives = 20/41 (48%), Gaps = 11/41 (26%)
Query: 181 LSLKHAAAFALDGIVKHSPELAESASDCDVIPLLIKALNND 221
++H AAFAL G + + +P LIKAL ++
Sbjct: 1 PLVRHEAAFAL-GQLGDE----------EAVPALIKALEDE 30
>gnl|CDD|215936 pfam00478, IMPDH, IMP dehydrogenase / GMP reductase domain. This
family is involved in biosynthesis of guanosine
nucleotide. Members of this family contain a TIM barrel
structure. In the inosine monophosphate dehydrogenases 2
CBS domains pfam00571 are inserted in the TIM barrel.
This family is a member of the common phosphate binding
site TIM barrel family.
Length = 467
Score = 29.5 bits (67), Expect = 4.8
Identities = 16/52 (30%), Positives = 24/52 (46%), Gaps = 1/52 (1%)
Query: 38 KDLNVVDLIVPLLADKVPSIQHAALATLGKLMCAKHVDIRQEVLERNLVALV 89
D +V LI +K +A+ G+L+ V R++ LER ALV
Sbjct: 182 DDGELVGLITRKDIEKARDYPNASKDAQGRLLVGAAVGTREDDLER-AEALV 232
>gnl|CDD|227427 COG5096, COG5096, Vesicle coat complex, various subunits
[Intracellular trafficking and secretion].
Length = 757
Score = 29.7 bits (67), Expect = 4.8
Identities = 26/124 (20%), Positives = 51/124 (41%), Gaps = 13/124 (10%)
Query: 116 ERCAGLIEDKDLDSIISSLHDF---DPDTRANAALALGGLAKDNKHFAERIVQCGALPPL 172
ER A L + L ++ + D + + R A L L + E + + P+
Sbjct: 80 ERYAKLKPELALLAVNTIQKDLQDPNEEIRGFALRTLSLL-RVK----ELLGN--IIDPI 132
Query: 173 VQILASPQLSLKHAAAFALDGIVKHSPELAESASDCDVIPLLIKALNNDEVNVKADAFLC 232
++L P ++ AA A+ + + +L D +L + + + + V A+A
Sbjct: 133 KKLLTDPHAYVRKTAALAVAKLYRLDKDLYHELGLID---ILKELVADSDPIVIANALAS 189
Query: 233 LADI 236
LA+I
Sbjct: 190 LAEI 193
Score = 29.3 bits (66), Expect = 5.9
Identities = 29/119 (24%), Positives = 48/119 (40%), Gaps = 12/119 (10%)
Query: 85 LVALVLKDLTKQNKHYKKTALHMLTCVAQQDERCAGLIEDKDLDSIISSLHDFDPDTRAN 144
V + KDL N+ + AL L+ + ++ + +D I L D R
Sbjct: 93 AVNTIQKDLQDPNEEIRGFALRTLSLLRVKE------LLGNIIDPIKKLLTDPHAYVRKT 146
Query: 145 AALALGGLAKDNKHFAERIVQCGALPPLVQILASPQLSLKHAAAFALDGIVKHSPELAE 203
AALA+ L + +K + G + L +++A + A +L I PELA
Sbjct: 147 AALAVAKLYRLDKD---LYHELGLIDILKELVADSDPIVIANALASLAEID---PELAH 199
>gnl|CDD|236718 PRK10574, PRK10574, putative major pilin subunit; Provisional.
Length = 146
Score = 27.7 bits (62), Expect = 8.9
Identities = 14/39 (35%), Positives = 21/39 (53%), Gaps = 2/39 (5%)
Query: 302 DCNGGYNSLPAIMALGYMSAQSEQIAEQIIELQGVGSLQ 340
C+ G N +P+ Y+SA S +A+ +I L G SL
Sbjct: 65 TCDAGSNGIPSPTTSRYVSAMS--VAKGVITLTGQESLN 101
>gnl|CDD|99876 cd06066, H2MP_NAD-link-bidir, Endopeptidases that belong to the
bidirectional NAD-linked hydrogenase group. This group
of endopeptidases are highly specific carboxyl-terminal
protease (HoxW protease) which releases a 24-amino-acid
peptide from HoxH prior to progression of subunit
assembly. These bidirectional hydrogenases are
heteropentamers encoded by the hox (hydrogen oxidation)
genes, in which complex HoxEFU shows the diaphorase
activity, and HoxYH constitutes the NiFe-hydrogenase.
Length = 139
Score = 27.6 bits (62), Expect = 9.0
Identities = 13/33 (39%), Positives = 18/33 (54%), Gaps = 2/33 (6%)
Query: 158 HFAERIVQCGALPPLVQILASPQLSLKHAAAFA 190
AERI + L P V++LA QL+ + A A
Sbjct: 18 AVAERIEEW--LLPGVEVLAVHQLTPELAEDLA 48
>gnl|CDD|235040 PRK02463, PRK02463, OxaA-like protein precursor; Provisional.
Length = 307
Score = 28.5 bits (64), Expect = 9.8
Identities = 18/46 (39%), Positives = 25/46 (54%), Gaps = 4/46 (8%)
Query: 319 MSAQSEQIAEQ----IIELQGVGSLQTILQCPHKEQLQVAAAWTLG 360
M+AQ+E +A Q I L G+G L ++Q P L AA +T G
Sbjct: 117 MAAQTELMAAQRENGISMLGGIGCLPLLIQMPFFSALYFAAQYTKG 162
>gnl|CDD|107059 PHA01746, PHA01746, hypothetical protein.
Length = 131
Score = 27.6 bits (61), Expect = 10.0
Identities = 11/35 (31%), Positives = 18/35 (51%), Gaps = 1/35 (2%)
Query: 147 LALGGLAKDNKHFAERIVQCGALPPL-VQILASPQ 180
L AK++ A RI++ + + VQ+ A PQ
Sbjct: 9 DELKDQAKESNKIASRILKIKGIKRIVVQLNAVPQ 43
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.318 0.132 0.382
Gapped
Lambda K H
0.267 0.0885 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 26,635,861
Number of extensions: 2614693
Number of successful extensions: 2346
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2318
Number of HSP's successfully gapped: 71
Length of query: 542
Length of database: 10,937,602
Length adjustment: 102
Effective length of query: 440
Effective length of database: 6,413,494
Effective search space: 2821937360
Effective search space used: 2821937360
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 61 (27.0 bits)