RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy10729
(363 letters)
>gnl|CDD|227396 COG5064, SRP1, Karyopherin (importin) alpha [Intracellular
trafficking and secretion].
Length = 526
Score = 325 bits (835), Expect = e-108
Identities = 154/293 (52%), Positives = 192/293 (65%), Gaps = 2/293 (0%)
Query: 5 ATRSARKMLSKERHPPIDELIEAGVVPICVELLDDENPNT-QFEATWALTNIASGTSEQT 63
A RK+LSKE PPI +I+AGVVP VE +D+ + QFEA WALTNIASGT++QT
Sbjct: 91 AVYKFRKLLSKETSPPIQPVIDAGVVPRFVEFMDEIQRDMLQFEAAWALTNIASGTTQQT 150
Query: 64 MTVINANAIPKFLQLLSSPHLNLAEQATWALGNIAGDGARARDLLLGLGTMPQILALVQP 123
V++A A+P F+QLLSS ++ EQA WALGNIAGD RD +L G + +L L+
Sbjct: 151 KVVVDAGAVPLFIQLLSSTEDDVREQAVWALGNIAGDSEGCRDYVLQCGALEPLLGLLLS 210
Query: 124 NTPT-TFLRNIVWAISNLCRNKNPAPDFEKIKICLPLLNQLIHTTDVETLSDICWALSYI 182
+ + LRN W +SNLCR KNP PD+ I LP+L +LI++ D E L D CWA+SY+
Sbjct: 211 SAIHISMLRNATWTLSNLCRGKNPPPDWSNISQALPILAKLIYSRDPEVLVDACWAISYL 270
Query: 183 SDGHNDKIQAVVDTGVVPRLVELLDSEETTILTAALRTVGNIATGNDHQTDCVIQAGGLQ 242
SDG N+KIQAV+D G+ RLVELL E I T ALR+VGNI TG+D QT +I G L+
Sbjct: 271 SDGPNEKIQAVLDVGIPGRLVELLSHESAKIQTPALRSVGNIVTGSDDQTQVIINCGALK 330
Query: 243 KMKKLLSSSRVNIVKEAAWTISNITAGNSRQIDHVIQEGLDNLLEIALQVGEL 295
+ LLSS + NI KEA WTISNITAGN+ QI VI L L L E
Sbjct: 331 AFRSLLSSPKENIRKEACWTISNITAGNTEQIQAVIDANLIPPLIHLLSSAEY 383
Score = 30.2 bits (68), Expect = 1.7
Identities = 13/38 (34%), Positives = 21/38 (55%)
Query: 21 IDELIEAGVVPICVELLDDENPNTQFEATWALTNIASG 58
I +I+A ++P + LL + EA WA++N SG
Sbjct: 362 IQAVIDANLIPPLIHLLSSAEYKIKKEACWAISNATSG 399
>gnl|CDD|237987 cd00020, ARM, Armadillo/beta-catenin-like repeats. An approximately
40 amino acid long tandemly repeated sequence motif
first identified in the Drosophila segment polarity gene
armadillo; these repeats were also found in the
mammalian armadillo homolog beta-catenin, the junctional
plaque protein plakoglobin, the adenomatous polyposis
coli (APC) tumor suppressor protein, and a number of
other proteins. ARM has been implicated in mediating
protein-protein interactions, but no common features
among the target proteins recognized by the ARM repeats
have been identified; related to the HEAT domain; three
consecutive copies of the repeat are represented by this
alignment model.
Length = 120
Score = 108 bits (271), Expect = 7e-29
Identities = 38/121 (31%), Positives = 66/121 (54%), Gaps = 1/121 (0%)
Query: 22 DELIEAGVVPICVELLDDENPNTQFEATWALTNIASGTSEQTMTVINANAIPKFLQLLSS 81
+ +I+AG +P V LL + N Q EA WAL+N+++G ++ V+ A +P +QLL S
Sbjct: 1 EAVIQAGGLPALVSLLSSSDENVQREAAWALSNLSAGNNDNIQAVVEAGGLPALVQLLKS 60
Query: 82 PHLNLAEQATWALGNIAGDGARARDLLLGLGTMPQILALVQPNTPTTFLRNIVWAISNLC 141
+ + A WAL N+A + ++L G +P+++ L+ + +N A+SNL
Sbjct: 61 EDEEVVKAALWALRNLAAGPEDNKLIVLEAGGVPKLVNLLDSS-NEDIQKNATGALSNLA 119
Query: 142 R 142
Sbjct: 120 S 120
Score = 106 bits (267), Expect = 2e-28
Identities = 46/113 (40%), Positives = 69/113 (61%)
Query: 156 CLPLLNQLIHTTDVETLSDICWALSYISDGHNDKIQAVVDTGVVPRLVELLDSEETTILT 215
LP L L+ ++D + WALS +S G+ND IQAVV+ G +P LV+LL SE+ ++
Sbjct: 8 GLPALVSLLSSSDENVQREAAWALSNLSAGNNDNIQAVVEAGGLPALVQLLKSEDEEVVK 67
Query: 216 AALRTVGNIATGNDHQTDCVIQAGGLQKMKKLLSSSRVNIVKEAAWTISNITA 268
AAL + N+A G + V++AGG+ K+ LL SS +I K A +SN+ +
Sbjct: 68 AALWALRNLAAGPEDNKLIVLEAGGVPKLVNLLDSSNEDIQKNATGALSNLAS 120
Score = 83.9 bits (208), Expect = 5e-20
Identities = 39/141 (27%), Positives = 61/141 (43%), Gaps = 25/141 (17%)
Query: 191 QAVVDTGVVPRLVELLDSEETTILTAALRTVGNIATGNDHQTDCVIQAGGLQKMKKLLSS 250
+AV+ G +P LV LL S + + A + N++ GN+ V++AGGL + +LL S
Sbjct: 1 EAVIQAGGLPALVSLLSSSDENVQREAAWALSNLSAGNNDNIQAVVEAGGLPALVQLLKS 60
Query: 251 SRVNIVKEAAWTISNITAGNSRQIDHVIQEGLDNLLEIALQVGELNQMALLVEESGGLEK 310
+VK A W + N+ AG L+V E+GG+ K
Sbjct: 61 EDEEVVKAALWALRNLAAGPEDN-------------------------KLIVLEAGGVPK 95
Query: 311 LEALQHHENETVYQKCYKIIS 331
L L NE + + +S
Sbjct: 96 LVNLLDSSNEDIQKNATGALS 116
Score = 80.1 bits (198), Expect = 2e-18
Identities = 31/86 (36%), Positives = 49/86 (56%)
Query: 13 LSKERHPPIDELIEAGVVPICVELLDDENPNTQFEATWALTNIASGTSEQTMTVINANAI 72
LS + I ++EAG +P V+LL E+ A WAL N+A+G + + V+ A +
Sbjct: 34 LSAGNNDNIQAVVEAGGLPALVQLLKSEDEEVVKAALWALRNLAAGPEDNKLIVLEAGGV 93
Query: 73 PKFLQLLSSPHLNLAEQATWALGNIA 98
PK + LL S + ++ + AT AL N+A
Sbjct: 94 PKLVNLLDSSNEDIQKNATGALSNLA 119
Score = 38.1 bits (89), Expect = 0.001
Identities = 18/37 (48%), Positives = 23/37 (62%)
Query: 21 IDELIEAGVVPICVELLDDENPNTQFEATWALTNIAS 57
++EAG VP V LLD N + Q AT AL+N+AS
Sbjct: 84 KLIVLEAGGVPKLVNLLDSSNEDIQKNATGALSNLAS 120
>gnl|CDD|201276 pfam00514, Arm, Armadillo/beta-catenin-like repeat. Approx. 40
amino acid repeat. Tandem repeats form super-helix of
helices that is proposed to mediate interaction of
beta-catenin with its ligands. CAUTION: This family
does not contain all known armadillo repeats.
Length = 41
Score = 52.5 bits (127), Expect = 1e-09
Identities = 19/40 (47%), Positives = 25/40 (62%)
Query: 59 TSEQTMTVINANAIPKFLQLLSSPHLNLAEQATWALGNIA 98
+ E VI A A+P +QLLSSP + E+A WAL N+A
Sbjct: 1 SPENKQAVIEAGAVPPLVQLLSSPDEEVQEEAAWALSNLA 40
Score = 50.9 bits (123), Expect = 6e-09
Identities = 17/39 (43%), Positives = 23/39 (58%)
Query: 19 PPIDELIEAGVVPICVELLDDENPNTQFEATWALTNIAS 57
+IEAG VP V+LL + Q EA WAL+N+A+
Sbjct: 3 ENKQAVIEAGAVPPLVQLLSSPDEEVQEEAAWALSNLAA 41
Score = 46.7 bits (112), Expect = 2e-07
Identities = 15/41 (36%), Positives = 24/41 (58%)
Query: 228 NDHQTDCVIQAGGLQKMKKLLSSSRVNIVKEAAWTISNITA 268
+ VI+AG + + +LLSS + +EAAW +SN+ A
Sbjct: 1 SPENKQAVIEAGAVPPLVQLLSSPDEEVQEEAAWALSNLAA 41
Score = 45.5 bits (109), Expect = 4e-07
Identities = 14/41 (34%), Positives = 22/41 (53%)
Query: 186 HNDKIQAVVDTGVVPRLVELLDSEETTILTAALRTVGNIAT 226
+ QAV++ G VP LV+LL S + + A + N+A
Sbjct: 1 SPENKQAVIEAGAVPPLVQLLSSPDEEVQEEAAWALSNLAA 41
Score = 35.1 bits (82), Expect = 0.002
Identities = 10/29 (34%), Positives = 16/29 (55%)
Query: 156 CLPLLNQLIHTTDVETLSDICWALSYISD 184
+P L QL+ + D E + WALS ++
Sbjct: 13 AVPPLVQLLSSPDEEVQEEAAWALSNLAA 41
>gnl|CDD|214547 smart00185, ARM, Armadillo/beta-catenin-like repeats. Approx. 40
amino acid repeat. Tandem repeats form superhelix of
helices that is proposed to mediate interaction of
beta-catenin with its ligands. Involved in transducing
the Wingless/Wnt signal. In plakoglobin arm repeats
bind alpha-catenin and N-cadherin.
Length = 41
Score = 51.3 bits (124), Expect = 5e-09
Identities = 12/40 (30%), Positives = 22/40 (55%)
Query: 59 TSEQTMTVINANAIPKFLQLLSSPHLNLAEQATWALGNIA 98
E V++A +P ++LL S + ++A WAL N++
Sbjct: 1 DDENKQAVVDAGGLPALVELLKSEDEEVVKEAAWALSNLS 40
Score = 46.3 bits (111), Expect = 3e-07
Identities = 15/41 (36%), Positives = 23/41 (56%)
Query: 17 RHPPIDELIEAGVVPICVELLDDENPNTQFEATWALTNIAS 57
+++AG +P VELL E+ EA WAL+N++S
Sbjct: 1 DDENKQAVVDAGGLPALVELLKSEDEEVVKEAAWALSNLSS 41
Score = 44.7 bits (107), Expect = 9e-07
Identities = 17/41 (41%), Positives = 26/41 (63%)
Query: 228 NDHQTDCVIQAGGLQKMKKLLSSSRVNIVKEAAWTISNITA 268
+D V+ AGGL + +LL S +VKEAAW +SN+++
Sbjct: 1 DDENKQAVVDAGGLPALVELLKSEDEEVVKEAAWALSNLSS 41
Score = 44.0 bits (105), Expect = 2e-06
Identities = 16/40 (40%), Positives = 25/40 (62%)
Query: 186 HNDKIQAVVDTGVVPRLVELLDSEETTILTAALRTVGNIA 225
++ QAVVD G +P LVELL SE+ ++ A + N++
Sbjct: 1 DDENKQAVVDAGGLPALVELLKSEDEEVVKEAAWALSNLS 40
Score = 30.5 bits (70), Expect = 0.093
Identities = 11/29 (37%), Positives = 17/29 (58%)
Query: 156 CLPLLNQLIHTTDVETLSDICWALSYISD 184
LP L +L+ + D E + + WALS +S
Sbjct: 13 GLPALVELLKSEDEEVVKEAAWALSNLSS 41
>gnl|CDD|222285 pfam13646, HEAT_2, HEAT repeats. This family includes multiple
HEAT repeats.
Length = 88
Score = 40.8 bits (96), Expect = 7e-05
Identities = 24/91 (26%), Positives = 35/91 (38%), Gaps = 21/91 (23%)
Query: 21 IDELIEAGVVPICVELLDDENPNTQFEATWALTNIASGTSEQTMTVINANAIPKFLQLLS 80
++ L+EA LL D +P + A AL + + A+P L+LL
Sbjct: 1 LEALLEA--------LLSDPDPEVRAAAARALGELG-----------DPEALPALLELLK 41
Query: 81 SPHLNLAEQATWALGNIAGDGARARDLLLGL 111
P + A ALG + A LL L
Sbjct: 42 DPDPEVRRAAAEALGKLGDP--EALPALLEL 70
Score = 34.2 bits (79), Expect = 0.013
Identities = 22/103 (21%), Positives = 35/103 (33%), Gaps = 22/103 (21%)
Query: 158 PLLNQLIHTTDVETLSDICWALSYISDGHNDKIQAVVDTGVVPRLVELLDSEETTILTAA 217
LL L+ D E + AL + D +P L+ELL + + AA
Sbjct: 3 ALLEALLSDPDPEVRAAAARALGELGD-----------PEALPALLELLKDPDPEVRRAA 51
Query: 218 LRTVGNIATGNDHQTDCVIQAGGLQKMKKLLSSSRVNIVKEAA 260
+G + L + +LL +V+ AA
Sbjct: 52 AEALGKLGDPEA-----------LPALLELLQDDDDAVVRAAA 83
Score = 33.9 bits (78), Expect = 0.020
Identities = 19/74 (25%), Positives = 28/74 (37%), Gaps = 12/74 (16%)
Query: 23 ELIEAGVVPICVELLDDENPNTQFEATWALTNIASGTSEQTMTVINANAIPKFLQLL-SS 81
EL + +P +ELL D +P + A AL + A+P L+LL
Sbjct: 26 ELGDPEALPALLELLKDPDPEVRRAAAEALGKLGD-----------PEALPALLELLQDD 74
Query: 82 PHLNLAEQATWALG 95
+ A AL
Sbjct: 75 DDAVVRAAAASALA 88
>gnl|CDD|205691 pfam13513, HEAT_EZ, HEAT-like repeat. The HEAT repeat family is
related to armadillo/beta-catenin-like repeats (see
pfam00514). These EZ repeats are found in subunits of
cyanobacterial phycocyanin lyase and other proteins and
probably carry out a scaffolding role.
Length = 55
Score = 39.7 bits (93), Expect = 8e-05
Identities = 17/56 (30%), Positives = 22/56 (39%), Gaps = 1/56 (1%)
Query: 42 PNTQFEATWALTNIASGTSEQTMTVINANAIPKFLQLLSSPHLNLAEQATWALGNI 97
+ A AL +A G E + +P L LL + E A WALG I
Sbjct: 1 WEVREAAALALGALAGGGPE-LLRPAVPELLPALLPLLKDDDDEVREAAAWALGRI 55
Score = 29.3 bits (66), Expect = 0.32
Identities = 12/49 (24%), Positives = 24/49 (48%), Gaps = 1/49 (2%)
Query: 176 CWALSYISDGHNDKIQAVVDTGVVPRLVELLDSEETTILTAALRTVGNI 224
AL ++ G + ++ V ++P L+ LL ++ + AA +G I
Sbjct: 8 ALALGALAGGGPELLRPAVPE-LLPALLPLLKDDDDEVREAAAWALGRI 55
Score = 27.4 bits (61), Expect = 1.6
Identities = 12/51 (23%), Positives = 19/51 (37%), Gaps = 1/51 (1%)
Query: 216 AALRTVGNIATGNDHQTDCVIQAGGLQKMKKLLSSSRVNIVKEAAWTISNI 266
AA +G +A G + L + LL + + AAW + I
Sbjct: 6 AAALALGALAGGGPELLRPAVPEL-LPALLPLLKDDDDEVREAAAWALGRI 55
Score = 25.5 bits (56), Expect = 9.8
Identities = 14/56 (25%), Positives = 22/56 (39%), Gaps = 8/56 (14%)
Query: 88 EQATWALGNIAGDGARARDLLLGLGTMPQILALVQP--NTPTTFLR-NIVWAISNL 140
E A ALG +AG G + P++L + P +R WA+ +
Sbjct: 5 EAAALALGALAGGGPELLRPAV-----PELLPALLPLLKDDDDEVREAAAWALGRI 55
>gnl|CDD|215629 PLN03200, PLN03200, cellulose synthase-interactive protein;
Provisional.
Length = 2102
Score = 37.4 bits (87), Expect = 0.011
Identities = 28/101 (27%), Positives = 47/101 (46%), Gaps = 2/101 (1%)
Query: 25 IEAGVVPICVELLDDENPNTQFEATWALTNIASGTSEQTMTVINANAIPKFLQLLSSPH- 83
+EAG V I V+LL N + Q A L + V++A A+ + L+LL +
Sbjct: 186 LEAGGVDILVKLLSSGNSDAQANAASLLARLMMAFESSISKVLDAGAVKQLLKLLGQGNE 245
Query: 84 LNLAEQATWALGNIAGDGARARDLLLGLGTMPQ-ILALVQP 123
+++ +A AL ++ A+ + G +P I A V P
Sbjct: 246 VSVRAEAAGALEALSSQSKEAKQAIADAGGIPALINATVAP 286
Score = 33.2 bits (76), Expect = 0.30
Identities = 53/216 (24%), Positives = 94/216 (43%), Gaps = 30/216 (13%)
Query: 114 MPQILALVQPNTPTTFLRNIVWAISNLCRNKNPAPDFEKIKI----CLPLLNQLIHTTDV 169
MP +++L++ T + N + LC+ ++ ++K+ C+P L L+ +
Sbjct: 60 MPLLVSLLRSGTLGAKV-NAAAVLGVLCKEEDL-----RVKVLLGGCIPPLLSLLKSGSA 113
Query: 170 ETLSDICWALSYISDGHND-----KIQAVVDTGVVPRLVELLDSEETT------ILTAAL 218
E A+ +S G KI GVVP L + L +LT AL
Sbjct: 114 EAQKAAAEAIYAVSSGGLSDHVGSKI--FSTEGVVPSLWDQLQPGNKQDKVVEGLLTGAL 171
Query: 219 RTVGNIATGNDHQTDCVIQAGGLQKMKKLLSSSRVNIVKEAAWTISNITAGNSRQIDHVI 278
R N+ D ++AGG+ + KLLSS + AA ++ + I V+
Sbjct: 172 R---NLCGSTDGFWSATLEAGGVDILVKLLSSGNSDAQANAASLLARLMMAFESSISKVL 228
Query: 279 QEG-LDNLLEIALQVGELNQMALLVEESGGLEKLEA 313
G + LL++ +G+ N++++ E +G LE L +
Sbjct: 229 DAGAVKQLLKL---LGQGNEVSVRAEAAGALEALSS 261
Score = 32.0 bits (73), Expect = 0.64
Identities = 29/101 (28%), Positives = 46/101 (45%), Gaps = 12/101 (11%)
Query: 9 ARKMLSKERHPPIDELIEAGVVPICVELLDDENP-NTQFEATWALTNIASGTSEQTMTVI 67
AR M++ E I ++++AG V ++LL N + + EA AL ++S + E +
Sbjct: 214 ARLMMAFES--SISKVLDAGAVKQLLKLLGQGNEVSVRAEAAGALEALSSQSKEAKQAIA 271
Query: 68 NANAIPKFLQLLSSPHLN---------LAEQATWALGNIAG 99
+A IP + +P L E A AL NI G
Sbjct: 272 DAGGIPALINATVAPSKEFMQGEFAQALQENAMGALANICG 312
Score = 31.6 bits (72), Expect = 0.76
Identities = 21/88 (23%), Positives = 41/88 (46%), Gaps = 1/88 (1%)
Query: 22 DELIEAGVVPICVELLDDENPNTQFEATWALTNIASGTSE-QTMTVINANAIPKFLQLLS 80
+ L ++ C++LL + ++ AL ++ E + ++ +AI ++L
Sbjct: 645 ESLATDEIINPCIKLLTNNTEAVATQSARALAALSRSIKENRKVSYAAEDAIKPLIKLAK 704
Query: 81 SPHLNLAEQATWALGNIAGDGARARDLL 108
S + +AEQA AL N+ D A + L
Sbjct: 705 SSSIEVAEQAVCALANLLSDPEVAAEAL 732
>gnl|CDD|224331 COG1413, COG1413, FOG: HEAT repeat [Energy production and
conversion].
Length = 335
Score = 33.4 bits (76), Expect = 0.17
Identities = 17/69 (24%), Positives = 28/69 (40%), Gaps = 11/69 (15%)
Query: 29 VVPICVELLDDENPNTQFEATWALTNIASGTSEQTMTVINANAIPKFLQLLSSPHLNLAE 88
++LL+DE+ + A AL + S A+P +LLS + +
Sbjct: 44 AADELLKLLEDEDLLVRLSAAVALGELGS-----------EEAVPLLRELLSDEDPRVRD 92
Query: 89 QATWALGNI 97
A ALG +
Sbjct: 93 AAADALGEL 101
>gnl|CDD|202500 pfam02985, HEAT, HEAT repeat. The HEAT repeat family is related
to armadillo/beta-catenin-like repeats (see pfam00514).
Length = 31
Score = 29.4 bits (67), Expect = 0.19
Identities = 10/29 (34%), Positives = 18/29 (62%)
Query: 29 VVPICVELLDDENPNTQFEATWALTNIAS 57
++P+ +ELL+D +P + A AL +A
Sbjct: 1 LLPLLLELLNDPDPEVREAAAEALGALAE 29
Score = 25.9 bits (58), Expect = 3.0
Identities = 11/27 (40%), Positives = 16/27 (59%)
Query: 72 IPKFLQLLSSPHLNLAEQATWALGNIA 98
+P L+LL+ P + E A ALG +A
Sbjct: 2 LPLLLELLNDPDPEVREAAAEALGALA 28
>gnl|CDD|147776 pfam05804, KAP, Kinesin-associated protein (KAP). This family
consists of several eukaryotic kinesin-associated (KAP)
proteins. Kinesins are intracellular multimeric
transport motor proteins that move cellular cargo on
microtubule tracks. It has been shown that the sea
urchin KRP85/95 holoenzyme associates with a KAP115
non-motor protein, forming a heterotrimeric complex in
vitro, called the Kinesin-II.
Length = 708
Score = 33.3 bits (76), Expect = 0.19
Identities = 32/151 (21%), Positives = 62/151 (41%), Gaps = 18/151 (11%)
Query: 157 LPLLNQLIHTTDVETLSDICWALSYISDGHNDKIQAVVDTGVVPRLVELLDSEETTILTA 216
L L H + + + LS+ + + G++P+LV LLD++ +
Sbjct: 332 LLKLFPCQHEDLLNITLRLLFNLSFDTGLRPKMVNG----GLLPKLVSLLDNDNHHGI-- 385
Query: 217 ALRTVGNIATGND-----HQTDCVIQAGGLQKMKKLLSSSRVNIVKEAAWTISNITAGNS 271
AL + +++ + TDC+ MK +L + + E N+ N
Sbjct: 386 ALCVLYHLSCDDKAKSMFAYTDCIPML-----MKMVLEGTGERVDLELIALCINLAL-NK 439
Query: 272 RQIDHVIQ-EGLDNLLEIALQVGELNQMALL 301
R + + +GLD L+E AL+ + M ++
Sbjct: 440 RNAQLICEGQGLDLLMERALKFRDPLLMKMI 470
>gnl|CDD|227540 COG5215, KAP95, Karyopherin (importin) beta [Intracellular
trafficking and secretion].
Length = 858
Score = 29.9 bits (67), Expect = 2.7
Identities = 30/135 (22%), Positives = 42/135 (31%), Gaps = 26/135 (19%)
Query: 48 ATWALTNIASGTSEQTMTVINANAIPKFLQLLSSPHLNLAEQATWALGNIAGDGARARD- 106
A A ++ G E +T I A+P +S L + W G IA A
Sbjct: 386 AVMAFGSVMHGPCEDCLTKIVPQALPGIENEMSDSCLWVKSTTAWCFGAIADHVAMIISP 445
Query: 107 ----------LLLGLGTMPQILALVQPNTPTTFLRNIVWAISNLCRNKNPAPDFEK---I 153
L+GL P N W NL + A +
Sbjct: 446 CGHLVLEVSASLIGLMDCPFRSI------------NCSWRKENLVDHIAKAVREVESFLA 493
Query: 154 KICLPLLNQLIHTTD 168
K L +LN L+ T+
Sbjct: 494 KFYLAILNALVKGTE 508
>gnl|CDD|238159 cd00256, VATPase_H, VATPase_H, regulatory vacuolar ATP synthase
subunit H (Vma13p); activation component of the
peripheral V1 complex of V-ATPase, a heteromultimeric
enzyme which uses ATP to actively transport protons
into organelles and extracellular compartments. The
topology is that of a superhelical spiral, in part the
geometry is similar to superhelices composed of
armadillo repeat motifs, as found in importins for
example.
Length = 429
Score = 29.3 bits (66), Expect = 3.9
Identities = 35/212 (16%), Positives = 77/212 (36%), Gaps = 49/212 (23%)
Query: 104 ARDLLLGLGTMPQILALVQPNTPTTFLRNIVWAISNLCRNKNPAPDFEKIKICLPLL-NQ 162
A ++L L + + +++ +T +R ++ NL K L ++ +
Sbjct: 220 AAEVLKRLSLIQDLSDILKESTKEKVIRIVLAIFRNLI--SKRVDREVKKTAALQMVQCK 277
Query: 163 LIHTTDVETLS-------DICWALSYISDGHNDKIQ---------AVVDTG--------- 197
++ T +++L D+ L ++++ + +Q + + +G
Sbjct: 278 VLKT--LQSLEQRKYDDEDLTDDLKFLTEELKNSVQDLSSFDEYKSELRSGRLHWSPVHK 335
Query: 198 ------------------VVPRLVELLD-SEETTILTAALRTVGNIATGNDHQTDCVIQA 238
++ L+ LL+ S + IL A +G D V Q
Sbjct: 336 SEKFWRENADRLNEKNYELLKILIHLLETSVDPIILAVACHDIGEYVRHYPRGKDVVEQL 395
Query: 239 GGLQKMKKLLSSSRVNIVKEAAWTISNITAGN 270
GG Q++ +LL+ N+ EA + + N
Sbjct: 396 GGKQRVMRLLNHEDPNVRYEALLAVQKLMVHN 427
>gnl|CDD|143386 cd00141, NT_POLXc, Nucleotidyltransferase (NT) domain of family X
DNA Polymerases. X family polymerases fill in short
gaps during DNA repair. They are relatively inaccurate
enzymes and play roles in base excision repair, in
non-homologous end joining (NHEJ) which acts mainly to
repair damage due to ionizing radiation, and in V(D)J
recombination. This family includes eukaryotic Pol beta,
Pol lambda, Pol mu, and terminal deoxyribonucleotidyl
transferase (TdT). Pol beta and Pol lambda are primarily
DNA template-dependent polymerases. TdT is a DNA
template-independent polymerase. Pol mu has both
template dependent and template independent activities.
This subgroup belongs to the Pol beta-like NT
superfamily. In the majority of enzymes in this
superfamily, two carboxylates, Dx[D/E], together with a
third more distal carboxylate, coordinate two divalent
metal cations involved in a two-metal ion mechanism of
nucleotide addition. These three carboxylate residues
are fairly well conserved in this family.
Length = 307
Score = 29.1 bits (66), Expect = 4.1
Identities = 12/50 (24%), Positives = 24/50 (48%), Gaps = 1/50 (2%)
Query: 165 HTTDVETLSDICWALSYISDGHNDKIQAVVDTGVVPRLVELLDSEETTIL 214
+E+L + L I +KI+ +++TG + +L EL + +L
Sbjct: 37 LPEPIESLEEAK-KLPGIGKKIAEKIEEILETGKLRKLEELREDVPPGLL 85
>gnl|CDD|216598 pfam01602, Adaptin_N, Adaptin N terminal region. This family
consists of the N terminal region of various alpha, beta
and gamma subunits of the AP-1, AP-2 and AP-3 adaptor
protein complexes. The adaptor protein (AP) complexes
are involved in the formation of clathrin-coated pits
and vesicles. The N-terminal region of the various
adaptor proteins (APs) is constant by comparison to the
C-terminal which is variable within members of the AP-2
family; and it has been proposed that this constant
region interacts with another uniform component of the
coated vesicles.
Length = 522
Score = 29.1 bits (66), Expect = 4.7
Identities = 45/274 (16%), Positives = 83/274 (30%), Gaps = 76/274 (27%)
Query: 23 ELIEAGVVPICVELLDDENPNTQFEATWALTNIASGTSEQTMTVINANAIPKFLQLLSSP 82
+L+ +VP ELL D++P A L I + + K L LL
Sbjct: 146 DLVRDFLVPELKELLSDKDPGVVSAAVALLYEIRK---------NDRLYLNKLLPLLV-- 194
Query: 83 HLNLAEQATWALGNIAGDGARARDLLLGLGTMPQILALVQPNTPTTFLRNIVWAISNLCR 142
R L L L + I+ ++
Sbjct: 195 ----------------------RRLCNLLTVCNPWLQVK-----------ILRLLTRYAP 221
Query: 143 NKNPAPDFEKIKICLPLLNQLIHTTD------VETLSDICWALSYISDGHNDKIQAVVDT 196
P ++ +LN L ++ + V+T+ + D + I V
Sbjct: 222 QDPREPK----ELLEDILNLLQNSNNAVLYEAVKTIIHL--------DPEPELIVLAV-- 267
Query: 197 GVVPRLVELLDSEETTILTAALRTVGNIATGNDHQ-TDCVIQAGGLQKMKKLLSSSRVNI 255
L LL S + + ALR + I + + + L + ++I
Sbjct: 268 ---NALGRLLSSPDENLRYVALRNLNKILEKHPPAVQHLDL------IIFCLKTDDDISI 318
Query: 256 VKEAAWTISNITAGNSRQIDHVIQEGLDNLLEIA 289
A + + + + +++E L + EIA
Sbjct: 319 RLRALDLLYKL--VDESNVKEIVKELLKYVSEIA 350
>gnl|CDD|173953 cd08194, Fe-ADH6, Iron-containing alcohol dehydrogenases-like.
Iron-containing alcohol dehydrogenase-like. Proteins of
this family have not been characterized. Their specific
function is unknown. The protein structure represents a
dehydroquinate synthase-like fold and belongs to the
alcohol dehydrogenase-like superfamily. They are
distinct from other alcohol dehydrogenases which contain
different protein domains. Alcohol dehydrogenase
catalyzes the reduction of acetaldehyde to alcohol with
NADP as cofactor. Its activity requires iron ions.
Length = 375
Score = 28.0 bits (63), Expect = 8.3
Identities = 16/61 (26%), Positives = 27/61 (44%), Gaps = 10/61 (16%)
Query: 188 DKIQAVVDTGVVPRLVELLDSE--ETTILTAALR--TVGNIATG----NDHQTDCVIQAG 239
DK+ +V G+V +L + L E E+ I + T ++ G + D +I G
Sbjct: 31 DKV--MVKLGLVDKLTDSLKKEGIESAIFDDVVSEPTDESVEEGVKLAKEGGCDVIIALG 88
Query: 240 G 240
G
Sbjct: 89 G 89
>gnl|CDD|133161 cd06097, Aspergillopepsin_like, Aspergillopepsin_like, aspartic
proteases of fungal origin. The members of this family
are aspartic proteases of fungal origin, including
aspergillopepsin, rhizopuspepsin, endothiapepsin, and
rodosporapepsin. The various fungal species in this
family may be the most economically important genus of
fungi. They may serve as virulence factors or as
industrial aids. For example, Aspergillopepsin from A.
fumigatus is involved in invasive aspergillosis owing to
its elastolytic activity and Aspergillopepsins from the
mold A. saitoi are used in fermentation industry.
Aspartic proteinases are a group of proteolytic enzymes
in which the scissile peptide bond is attacked by a
nucleophilic water molecule activated by two aspartic
residues in a DT(S)G motif at the active site. They have
a similar fold composed of two beta-barrel domains.
Between the N-terminal and C-terminal domains, each of
which contributes one catalytic aspartic residue, there
is an extended active-site cleft capable of interacting
with multiple residues of a substrate. Although members
of the aspartic protease family of enzymes have very
similar three-dimensional structures and catalytic
mechanisms, each has unique substrate specificity. The
members of this family has an optimal acidic pH (5.5)
and cleaves protein substrates with similar specificity
to that of porcine pepsin A, preferring hydrophobic
residues at P1 and P1' in the cleave site. This family
of aspartate proteases is classified by MEROPS as the
peptidase family A1 (pepsin A, clan AA).
Length = 278
Score = 27.7 bits (62), Expect = 8.7
Identities = 14/55 (25%), Positives = 19/55 (34%), Gaps = 10/55 (18%)
Query: 106 DLLLGLGTMPQILALVQPNTPTTFLRNI-------VWAISNLCRNKNPAPDFEKI 153
D LLGL + VQP TF N ++ +L + F I
Sbjct: 107 DGLLGLA--FSSINTVQPPKQKTFFENALSSLDAPLFTA-DLRKAAPGFYTFGYI 158
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.316 0.132 0.380
Gapped
Lambda K H
0.267 0.0788 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 18,267,690
Number of extensions: 1750446
Number of successful extensions: 1603
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1564
Number of HSP's successfully gapped: 59
Length of query: 363
Length of database: 10,937,602
Length adjustment: 98
Effective length of query: 265
Effective length of database: 6,590,910
Effective search space: 1746591150
Effective search space used: 1746591150
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 60 (26.8 bits)