RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy6493
(1321 letters)
>gnl|CDD|238103 cd00176, SPEC, Spectrin repeats, found in several proteins involved
in cytoskeletal structure; family members include
spectrin, alpha-actinin and dystrophin; the spectrin
repeat forms a three helix bundle with the second helix
interrupted by proline in some sequences; the repeats
are independent folding units; tandem repeats are found
in differing numbers and arrange in an antiparallel
manner to form dimers; the repeats are defined by a
characteristic tryptophan (W) residue in helix A and a
leucine (L) at the carboxyl end of helix C and separated
by a linker of 5 residues; two copies of the repeat are
present here.
Length = 213
Score = 154 bits (391), Expect = 1e-42
Identities = 89/206 (43%), Positives = 138/206 (66%), Gaps = 1/206 (0%)
Query: 520 AVKDLDFWLGEVESLLTSEDSGKDLASVQNLIKKHQLVEADIQAHDDRIKDMNGQADSLI 579
+L+ WL E E LL+S D G DL SV+ L+KKH+ +EA++ AH++R++ +N + LI
Sbjct: 8 DADELEAWLSEKEELLSSTDYGDDLESVEALLKKHEALEAELAAHEERVEALNELGEQLI 67
Query: 580 DSGQFDASSIQEKRQSINERYERIKNLAAHRQARLNEANTLHQFFRDIADEESWIKEKKL 639
+ G DA IQE+ + +N+R+E ++ LA R+ RL EA L QFFRD D E W++EK+
Sbjct: 68 EEGHPDAEEIQERLEELNQRWEELRELAEERRQRLEEALDLQQFFRDADDLEQWLEEKEA 127
Query: 640 LVGSDDYGRDLTGVQNLKKKHKRLEAELASHQPAIQNVQETGEKLMDVSNLGV-PEIEQR 698
+ S+D G+DL V+ L KKHK LE EL +H+P ++++ E E+L++ + EIE++
Sbjct: 128 ALASEDLGKDLESVEELLKKHKELEEELEAHEPRLKSLNELAEELLEEGHPDADEEIEEK 187
Query: 699 LKLLNQAWSELKQLAANRGQKLDESL 724
L+ LN+ W EL +LA R +KL+E+L
Sbjct: 188 LEELNERWEELLELAEERQKKLEEAL 213
Score = 145 bits (368), Expect = 2e-39
Identities = 79/210 (37%), Positives = 127/210 (60%), Gaps = 1/210 (0%)
Query: 620 LHQFFRDIADEESWIKEKKLLVGSDDYGRDLTGVQNLKKKHKRLEAELASHQPAIQNVQE 679
L QF RD + E+W+ EK+ L+ S DYG DL V+ L KKH+ LEAELA+H+ ++ + E
Sbjct: 2 LQQFLRDADELEAWLSEKEELLSSTDYGDDLESVEALLKKHEALEAELAAHEERVEALNE 61
Query: 680 TGEKLMDVSNLGVPEIEQRLKLLNQAWSELKQLAANRGQKLDESLTYQHFLAKVEEEEAW 739
GE+L++ + EI++RL+ LNQ W EL++LA R Q+L+E+L Q F ++ E W
Sbjct: 62 LGEQLIEEGHPDAEEIQERLEELNQRWEELRELAEERRQRLEEALDLQQFFRDADDLEQW 121
Query: 740 ISEKQQLLSVEDYGDTMAAVQGLLKKHDAFETDFSVHRDRCADICSAGNKLIEAKNHHA- 798
+ EK+ L+ ED G + +V+ LLKKH E + H R + +L+E + A
Sbjct: 122 LEEKEAALASEDLGKDLESVEELLKKHKELEEELEAHEPRLKSLNELAEELLEEGHPDAD 181
Query: 799 DSITQRCQQLQLKLDNLMALATKRKTKLMD 828
+ I ++ ++L + + L+ LA +R+ KL +
Sbjct: 182 EEIEEKLEELNERWEELLELAEERQKKLEE 211
Score = 141 bits (356), Expect = 8e-38
Identities = 72/213 (33%), Positives = 128/213 (60%), Gaps = 1/213 (0%)
Query: 88 EVQRFHRDVDETKDWIQEKDEALNNNDLGKDLRSVQALQRKHEGLERDLAALGDKIRQLD 147
++Q+F RD DE + W+ EK+E L++ D G DL SV+AL +KHE LE +LAA +++ L+
Sbjct: 1 KLQQFLRDADELEAWLSEKEELLSSTDYGDDLESVEALLKKHEALEAELAAHEERVEALN 60
Query: 148 ETANRLMQTHPETAEQTYAKQKEINEEWTQLTAKANTRKEKLLDSYDLQRFLSDYRDLMS 207
E +L++ AE+ + +E+N+ W +L A R+++L ++ DLQ+F D DL
Sbjct: 61 ELGEQLIEEGHPDAEEIQERLEELNQRWEELRELAEERRQRLEEALDLQQFFRDADDLEQ 120
Query: 208 WINSMMGLVSSDELANDVTGAEALLERHQEHRTEIDARTGTFQAFDLFGQQLLQSGHYAS 267
W+ ++S++L D+ E LL++H+E E++A ++ + ++LL+ GH +
Sbjct: 121 WLEEKEAALASEDLGKDLESVEELLKKHKELEEELEAHEPRLKSLNELAEELLEEGHPDA 180
Query: 268 V-EIQDKLGNLAEAREDLEKAWIARRMQLDQCL 299
EI++KL L E E+L + R+ +L++ L
Sbjct: 181 DEEIEEKLEELNERWEELLELAEERQKKLEEAL 213
Score = 139 bits (352), Expect = 2e-37
Identities = 70/212 (33%), Positives = 129/212 (60%), Gaps = 1/212 (0%)
Query: 194 DLQRFLSDYRDLMSWINSMMGLVSSDELANDVTGAEALLERHQEHRTEIDARTGTFQAFD 253
LQ+FL D +L +W++ L+SS + +D+ EALL++H+ E+ A +A +
Sbjct: 1 KLQQFLRDADELEAWLSEKEELLSSTDYGDDLESVEALLKKHEALEAELAAHEERVEALN 60
Query: 254 LFGQQLLQSGHYASVEIQDKLGNLAEAREDLEKAWIARRMQLDQCLELQLFYRDCEQAEN 313
G+QL++ GH + EIQ++L L + E+L + RR +L++ L+LQ F+RD + E
Sbjct: 61 ELGEQLIEEGHPDAEEIQERLEELNQRWEELRELAEERRQRLEEALDLQQFFRDADDLEQ 120
Query: 314 WMSAREAFLNAEEVDSKTDNVEALIKKHEDFDKAINAHEEKIGALQTLADQLIAADHYAA 373
W+ +EA L +E++ ++VE L+KKH++ ++ + AHE ++ +L LA++L+ H A
Sbjct: 121 WLEEKEAALASEDLGKDLESVEELLKKHKELEEELEAHEPRLKSLNELAEELLEEGHPDA 180
Query: 374 KP-IDDKRKQVLDRWRLLKEALIEKRSRLGES 404
I++K +++ +RW L E E++ +L E+
Sbjct: 181 DEEIEEKLEELNERWEELLELAEERQKKLEEA 212
Score = 136 bits (343), Expect = 4e-36
Identities = 76/214 (35%), Positives = 131/214 (61%), Gaps = 2/214 (0%)
Query: 300 ELQLFYRDCEQAENWMSAREAFLNAEEVDSKTDNVEALIKKHEDFDKAINAHEEKIGALQ 359
+LQ F RD ++ E W+S +E L++ + ++VEAL+KKHE + + AHEE++ AL
Sbjct: 1 KLQQFLRDADELEAWLSEKEELLSSTDYGDDLESVEALLKKHEALEAELAAHEERVEALN 60
Query: 360 TLADQLIAADHYAAKPIDDKRKQVLDRWRLLKEALIEKRSRLGESQTLQQFSRDADEMEN 419
L +QLI H A+ I ++ +++ RW L+E E+R RL E+ LQQF RDAD++E
Sbjct: 61 ELGEQLIEEGHPDAEEIQERLEELNQRWEELRELAEERRQRLEEALDLQQFFRDADDLEQ 120
Query: 420 WIAEKL-QLATEESYKDPANIQSKHQKHQAFEAELAANADRIQSVLAMGQNLIDKRQCVG 478
W+ EK LA+E+ KD +++ +KH+ E EL A+ R++S+ + + L+++
Sbjct: 121 WLEEKEAALASEDLGKDLESVEELLKKHKELEEELEAHEPRLKSLNELAEELLEEGHP-D 179
Query: 479 SEEAVQARLASIADQWEFLTQKTTEKSLKLKEAN 512
++E ++ +L + ++WE L + E+ KL+EA
Sbjct: 180 ADEEIEEKLEELNERWEELLELAEERQKKLEEAL 213
Score = 132 bits (333), Expect = 9e-35
Identities = 85/214 (39%), Positives = 136/214 (63%), Gaps = 4/214 (1%)
Query: 407 LQQFSRDADEMENWIAEKLQLATEESY-KDPANIQSKHQKHQAFEAELAANADRIQSVLA 465
LQQF RDADE+E W++EK +L + Y D ++++ +KH+A EAELAA+ +R++++
Sbjct: 2 LQQFLRDADELEAWLSEKEELLSSTDYGDDLESVEALLKKHEALEAELAAHEERVEALNE 61
Query: 466 MGQNLIDKRQCVGSEEAVQARLASIADQWEFLTQKTTEKSLKLKEANKQRTYIAAVKDLD 525
+G+ LI++ E +Q RL + +WE L + E+ +L+EA + + DL+
Sbjct: 62 LGEQLIEEGHP--DAEEIQERLEELNQRWEELRELAEERRQRLEEALDLQQFFRDADDLE 119
Query: 526 FWLGEVESLLTSEDSGKDLASVQNLIKKHQLVEADIQAHDDRIKDMNGQADSLIDSGQFD 585
WL E E+ L SED GKDL SV+ L+KKH+ +E +++AH+ R+K +N A+ L++ G D
Sbjct: 120 QWLEEKEAALASEDLGKDLESVEELLKKHKELEEELEAHEPRLKSLNELAEELLEEGHPD 179
Query: 586 AS-SIQEKRQSINERYERIKNLAAHRQARLNEAN 618
A I+EK + +NER+E + LA RQ +L EA
Sbjct: 180 ADEEIEEKLEELNERWEELLELAEERQKKLEEAL 213
Score = 127 bits (320), Expect = 6e-33
Identities = 74/192 (38%), Positives = 121/192 (63%), Gaps = 2/192 (1%)
Query: 3 AQVQDVGEDLEQVEVMQKKFDDFQSDLKANEVRLAEMNEIAMQLMSLGQTEAALKIQTQL 62
D G+DLE VE + KK + +++L A+E R+ +NE+ QL+ G A +IQ +L
Sbjct: 23 LSSTDYGDDLESVEALLKKHEALEAELAAHEERVEALNELGEQLIEEGH-PDAEEIQERL 81
Query: 63 QDLNQKWTSLQQLTAERATQLGSAHEVQRFHRDVDETKDWIQEKDEALNNNDLGKDLRSV 122
++LNQ+W L++L ER +L A ++Q+F RD D+ + W++EK+ AL + DLGKDL SV
Sbjct: 82 EELNQRWEELRELAEERRQRLEEALDLQQFFRDADDLEQWLEEKEAALASEDLGKDLESV 141
Query: 123 QALQRKHEGLERDLAALGDKIRQLDETANRLMQT-HPETAEQTYAKQKEINEEWTQLTAK 181
+ L +KH+ LE +L A +++ L+E A L++ HP+ E+ K +E+NE W +L
Sbjct: 142 EELLKKHKELEEELEAHEPRLKSLNELAEELLEEGHPDADEEIEEKLEELNERWEELLEL 201
Query: 182 ANTRKEKLLDSY 193
A R++KL ++
Sbjct: 202 AEERQKKLEEAL 213
Score = 118 bits (297), Expect = 6e-30
Identities = 71/209 (33%), Positives = 122/209 (58%), Gaps = 2/209 (0%)
Query: 726 YQHFLAKVEEEEAWISEKQQLLSVEDYGDTMAAVQGLLKKHDAFETDFSVHRDRCADICS 785
Q FL +E EAW+SEK++LLS DYGD + +V+ LLKKH+A E + + H +R +
Sbjct: 2 LQQFLRDADELEAWLSEKEELLSSTDYGDDLESVEALLKKHEALEAELAAHEERVEALNE 61
Query: 786 AGNKLIEAKNHHADSITQRCQQLQLKLDNLMALATKRKTKLMDNSAYLQFMWKADVVESW 845
G +LIE + A+ I +R ++L + + L LA +R+ +L + QF AD +E W
Sbjct: 62 LGEQLIEEGHPDAEEIQERLEELNQRWEELRELAEERRQRLEEALDLQQFFRDADDLEQW 121
Query: 846 IADKETHVKSEEYGRDLSTVQTLLTKQETFDAGLHAFEHEGIQNITTLKDQLVASNH-DQ 904
+ +KE + SE+ G+DL +V+ LL K + + L A E ++++ L ++L+ H D
Sbjct: 122 LEEKEAALASEDLGKDLESVEELLKKHKELEEELEAHEPR-LKSLNELAEELLEEGHPDA 180
Query: 905 TPAIVKRHGDVIARWQKLLGDSNARKQRL 933
I ++ ++ RW++LL + R+++L
Sbjct: 181 DEEIEEKLEELNERWEELLELAEERQKKL 209
Score = 91.0 bits (226), Expect = 2e-20
Identities = 55/220 (25%), Positives = 102/220 (46%), Gaps = 10/220 (4%)
Query: 832 YLQFMWKADVVESWIADKETHVKSEEYGRDLSTVQTLLTKQETFDAGLHAFEHEGIQNIT 891
QF+ AD +E+W+++KE + S +YG DL +V+ LL K E +A L A E E ++ +
Sbjct: 2 LQQFLRDADELEAWLSEKEELLSSTDYGDDLESVEALLKKHEALEAELAAHE-ERVEALN 60
Query: 892 TLKDQLVASNHDQTPAIVKRHGDVIARWQKLLGDSNARKQRLLRMQEQFRQIEDLYLTFA 951
L +QL+ H I +R ++ RW++L + R+QRL + + D
Sbjct: 61 ELGEQLIEEGHPDAEEIQERLEELNQRWEELRELAEERRQRLEEALDLQQFFRD------ 114
Query: 952 KKASSFNSWFENAEEDLTDPVRCNSIEEIRALREAHAQFQASLSSAQADFEALAALDQQ- 1010
A W E E L +E + L + H + + L + + ++L L ++
Sbjct: 115 --ADDLEQWLEEKEAALASEDLGKDLESVEELLKKHKELEEELEAHEPRLKSLNELAEEL 172
Query: 1011 IKSFNVGPNPYTWFTMEALEDTWRNLQKIIKERDIELAKE 1050
++ + + +E L + W L ++ +ER +L +
Sbjct: 173 LEEGHPDADEEIEEKLEELNERWEELLELAEERQKKLEEA 212
Score = 69.0 bits (169), Expect = 7e-13
Identities = 48/221 (21%), Positives = 83/221 (37%), Gaps = 14/221 (6%)
Query: 946 LYLTFAKKASSFNSWFENAEEDLTDPVRCNSIEEIRALREAHAQFQASLSSAQADFEALA 1005
F + A +W EE L+ + +E + AL + H +A L++ + EAL
Sbjct: 1 KLQQFLRDADELEAWLSEKEELLSSTDYGDDLESVEALLKKHEALEAELAAHEERVEALN 60
Query: 1006 ALDQQIKSFNVGPNPYTWFTMEALEDTWRNLQKIIKERDIELAKEATRQDENDALRKEFA 1065
L +Q+ +E L W L+++ +ER L + Q
Sbjct: 61 ELGEQLIEEGHPDAEEIQERLEELNQRWEELRELAEERRQRLEEALDLQQFFRDADD--- 117
Query: 1066 KHANAFHQWLTET-----RTSMMEGTGSLEQQLEAIKRKAAEVRSRRSDLKKIEDLGAIL 1120
QWL E + + S+E+ L+ K E+ + LK + +L L
Sbjct: 118 -----LEQWLEEKEAALASEDLGKDLESVEELLKKHKELEEELEAHEPRLKSLNELAEEL 172
Query: 1121 EEHLILDN-RYTEHSTVGLAQQWDQLDQLGMRMQHNLEQQI 1160
E D E L ++W++L +L Q LE+ +
Sbjct: 173 LEEGHPDADEEIEEKLEELNERWEELLELAEERQKKLEEAL 213
>gnl|CDD|219990 pfam08726, efhand_Ca_insen, Ca2+ insensitive EF hand. EF hands are
helix-loop-helix binding motifs involved in the
regulation of many cellular processes. EF hands usually
bind to Ca2+ ions which causes a major conformational
change that allows the protein to interact with its
designated targets. This domain corresponds to an EF hand
which has partially or entirely lost its calcium-binding
properties. The calcium insensitive EF hand is still able
to mediate protein-protein recognition.
Length = 69
Score = 99.3 bits (248), Expect = 4e-25
Identities = 32/71 (45%), Positives = 45/71 (63%), Gaps = 2/71 (2%)
Query: 1250 NVQSSEEIENAFHAIAASDRPYVTKEELYANLTKEMADYCVERMKPYVDPKTERGIPGAL 1309
+ ++E++E +F A+A +PYVT+E+L LT E A+YC+ RM PY P R +PGA
Sbjct: 1 DTDTAEQVEQSFRALA-EGKPYVTEEDLRRALTPEQAEYCIARMPPYSGPDG-RSVPGAY 58
Query: 1310 DYIEFTRTLFQ 1320
DYI F LF
Sbjct: 59 DYISFMEALFG 69
>gnl|CDD|215918 pfam00435, Spectrin, Spectrin repeat. Spectrin repeat-domains are
found in several proteins involved in cytoskeletal
structure. These include spectrin, alpha-actinin and
dystrophin. The sequence repeat used in this family is
taken from the structural repeat in reference. The
spectrin domain- repeat forms a three helix bundle. The
second helix is interrupted by proline in some
sequences. The repeats are defined by a characteristic
tryptophan (W) residue at position 17 in helix A and a
leucine (L) at 2 residues from the carboxyl end of helix
C. Although the domain occurs in ultiple repeats along
sequences, the domains are actually stable on their own
- ie they act, biophysically, like domains rather than
repeats that along function when aggregated.
Length = 105
Score = 95.1 bits (237), Expect = 3e-23
Identities = 55/103 (53%), Positives = 73/103 (70%)
Query: 620 LHQFFRDIADEESWIKEKKLLVGSDDYGRDLTGVQNLKKKHKRLEAELASHQPAIQNVQE 679
L QFFRD D ESWI+EK+ L+ S+DYG+DL VQ L KKHK LEAELA+HQ ++ + E
Sbjct: 3 LQQFFRDADDLESWIEEKEALLSSEDYGKDLESVQALLKKHKALEAELAAHQDRVEALNE 62
Query: 680 TGEKLMDVSNLGVPEIEQRLKLLNQAWSELKQLAANRGQKLDE 722
EKL+ + EI++RL+ LN+ W +L +LAA R QKL+E
Sbjct: 63 LAEKLIAEGHYASEEIQERLEELNERWEQLLELAAERKQKLEE 105
Score = 90.1 bits (224), Expect = 2e-21
Identities = 44/105 (41%), Positives = 67/105 (63%)
Query: 87 HEVQRFHRDVDETKDWIQEKDEALNNNDLGKDLRSVQALQRKHEGLERDLAALGDKIRQL 146
+Q+F RD D+ + WI+EK+ L++ D GKDL SVQAL +KH+ LE +LAA D++ L
Sbjct: 1 LLLQQFFRDADDLESWIEEKEALLSSEDYGKDLESVQALLKKHKALEAELAAHQDRVEAL 60
Query: 147 DETANRLMQTHPETAEQTYAKQKEINEEWTQLTAKANTRKEKLLD 191
+E A +L+ +E+ + +E+NE W QL A RK+KL +
Sbjct: 61 NELAEKLIAEGHYASEEIQERLEELNERWEQLLELAAERKQKLEE 105
Score = 81.6 bits (202), Expect = 2e-18
Identities = 38/105 (36%), Positives = 70/105 (66%)
Query: 299 LELQLFYRDCEQAENWMSAREAFLNAEEVDSKTDNVEALIKKHEDFDKAINAHEEKIGAL 358
L LQ F+RD + E+W+ +EA L++E+ ++V+AL+KKH+ + + AH++++ AL
Sbjct: 1 LLLQQFFRDADDLESWIEEKEALLSSEDYGKDLESVQALLKKHKALEAELAAHQDRVEAL 60
Query: 359 QTLADQLIAADHYAAKPIDDKRKQVLDRWRLLKEALIEKRSRLGE 403
LA++LIA HYA++ I ++ +++ +RW L E E++ +L E
Sbjct: 61 NELAEKLIAEGHYASEEIQERLEELNERWEQLLELAAERKQKLEE 105
Score = 80.4 bits (199), Expect = 5e-18
Identities = 45/105 (42%), Positives = 70/105 (66%)
Query: 512 NKQRTYIAAVKDLDFWLGEVESLLTSEDSGKDLASVQNLIKKHQLVEADIQAHDDRIKDM 571
+ + DL+ W+ E E+LL+SED GKDL SVQ L+KKH+ +EA++ AH DR++ +
Sbjct: 1 LLLQQFFRDADDLESWIEEKEALLSSEDYGKDLESVQALLKKHKALEAELAAHQDRVEAL 60
Query: 572 NGQADSLIDSGQFDASSIQEKRQSINERYERIKNLAAHRQARLNE 616
N A+ LI G + + IQE+ + +NER+E++ LAA R+ +L E
Sbjct: 61 NELAEKLIAEGHYASEEIQERLEELNERWEQLLELAAERKQKLEE 105
Score = 77.7 bits (192), Expect = 4e-17
Identities = 36/105 (34%), Positives = 58/105 (55%)
Query: 193 YDLQRFLSDYRDLMSWINSMMGLVSSDELANDVTGAEALLERHQEHRTEIDARTGTFQAF 252
LQ+F D DL SWI L+SS++ D+ +ALL++H+ E+ A +A
Sbjct: 1 LLLQQFFRDADDLESWIEEKEALLSSEDYGKDLESVQALLKKHKALEAELAAHQDRVEAL 60
Query: 253 DLFGQQLLQSGHYASVEIQDKLGNLAEAREDLEKAWIARRMQLDQ 297
+ ++L+ GHYAS EIQ++L L E E L + R+ +L++
Sbjct: 61 NELAEKLIAEGHYASEEIQERLEELNERWEQLLELAAERKQKLEE 105
Score = 73.5 bits (181), Expect = 1e-15
Identities = 43/105 (40%), Positives = 68/105 (64%), Gaps = 3/105 (2%)
Query: 407 LQQFSRDADEMENWIAEKLQLATEESY-KDPANIQSKHQKHQAFEAELAANADRIQSVLA 465
LQQF RDAD++E+WI EK L + E Y KD ++Q+ +KH+A EAELAA+ DR++++
Sbjct: 3 LQQFFRDADDLESWIEEKEALLSSEDYGKDLESVQALLKKHKALEAELAAHQDRVEALNE 62
Query: 466 MGQNLIDKRQCVGSEEAVQARLASIADQWEFLTQKTTEKSLKLKE 510
+ + LI + + E +Q RL + ++WE L + E+ KL+E
Sbjct: 63 LAEKLIAEG--HYASEEIQERLEELNERWEQLLELAAERKQKLEE 105
Score = 71.2 bits (175), Expect = 8e-15
Identities = 38/104 (36%), Positives = 62/104 (59%), Gaps = 1/104 (0%)
Query: 832 YLQFMWKADVVESWIADKETHVKSEEYGRDLSTVQTLLTKQETFDAGLHAFEHEGIQNIT 891
QF AD +ESWI +KE + SE+YG+DL +VQ LL K + +A L A + + ++ +
Sbjct: 3 LQQFFRDADDLESWIEEKEALLSSEDYGKDLESVQALLKKHKALEAELAAHQ-DRVEALN 61
Query: 892 TLKDQLVASNHDQTPAIVKRHGDVIARWQKLLGDSNARKQRLLR 935
L ++L+A H + I +R ++ RW++LL + RKQ+L
Sbjct: 62 ELAEKLIAEGHYASEEIQERLEELNERWEQLLELAAERKQKLEE 105
Score = 70.0 bits (172), Expect = 2e-14
Identities = 39/101 (38%), Positives = 60/101 (59%)
Query: 726 YQHFLAKVEEEEAWISEKQQLLSVEDYGDTMAAVQGLLKKHDAFETDFSVHRDRCADICS 785
Q F ++ E+WI EK+ LLS EDYG + +VQ LLKKH A E + + H+DR +
Sbjct: 3 LQQFFRDADDLESWIEEKEALLSSEDYGKDLESVQALLKKHKALEAELAAHQDRVEALNE 62
Query: 786 AGNKLIEAKNHHADSITQRCQQLQLKLDNLMALATKRKTKL 826
KLI ++ ++ I +R ++L + + L+ LA +RK KL
Sbjct: 63 LAEKLIAEGHYASEEIQERLEELNERWEQLLELAAERKQKL 103
Score = 58.1 bits (141), Expect = 4e-10
Identities = 26/102 (25%), Positives = 45/102 (44%)
Query: 946 LYLTFAKKASSFNSWFENAEEDLTDPVRCNSIEEIRALREAHAQFQASLSSAQADFEALA 1005
L F + A SW E E L+ +E ++AL + H +A L++ Q EAL
Sbjct: 2 LLQQFFRDADDLESWIEEKEALLSSEDYGKDLESVQALLKKHKALEAELAAHQDRVEALN 61
Query: 1006 ALDQQIKSFNVGPNPYTWFTMEALEDTWRNLQKIIKERDIEL 1047
L +++ + + +E L + W L ++ ER +L
Sbjct: 62 ELAEKLIAEGHYASEEIQERLEELNERWEQLLELAAERKQKL 103
Score = 53.1 bits (128), Expect = 2e-08
Identities = 29/78 (37%), Positives = 53/78 (67%), Gaps = 1/78 (1%)
Query: 6 QDVGEDLEQVEVMQKKFDDFQSDLKANEVRLAEMNEIAMQLMSLGQTEAALKIQTQLQDL 65
+D G+DLE V+ + KK +++L A++ R+ +NE+A +L++ G A+ +IQ +L++L
Sbjct: 27 EDYGKDLESVQALLKKHKALEAELAAHQDRVEALNELAEKLIAEG-HYASEEIQERLEEL 85
Query: 66 NQKWTSLQQLTAERATQL 83
N++W L +L AER +L
Sbjct: 86 NERWEQLLELAAERKQKL 103
Score = 49.2 bits (118), Expect = 5e-07
Identities = 18/101 (17%), Positives = 41/101 (40%), Gaps = 5/101 (4%)
Query: 1063 EFAKHANAFHQWLTETRTSMMEG-----TGSLEQQLEAIKRKAAEVRSRRSDLKKIEDLG 1117
+F + A+ W+ E + S++ L+ K AE+ + + ++ + +L
Sbjct: 5 QFFRDADDLESWIEEKEALLSSEDYGKDLESVQALLKKHKALEAELAAHQDRVEALNELA 64
Query: 1118 AILEEHLILDNRYTEHSTVGLAQQWDQLDQLGMRMQHNLEQ 1158
L + + L ++W+QL +L + LE+
Sbjct: 65 EKLIAEGHYASEEIQERLEELNERWEQLLELAAERKQKLEE 105
>gnl|CDD|197544 smart00150, SPEC, Spectrin repeats.
Length = 101
Score = 92.8 bits (231), Expect = 2e-22
Identities = 45/101 (44%), Positives = 69/101 (68%)
Query: 621 HQFFRDIADEESWIKEKKLLVGSDDYGRDLTGVQNLKKKHKRLEAELASHQPAIQNVQET 680
QF RD + E+W++EK+ L+ S+D G+DL V+ L KKH+ EAEL +H+ ++ + E
Sbjct: 1 QQFLRDADELEAWLEEKEQLLASEDLGKDLESVEALLKKHEAFEAELEAHEERVEALNEL 60
Query: 681 GEKLMDVSNLGVPEIEQRLKLLNQAWSELKQLAANRGQKLD 721
GE+L++ + EIE+RL+ LN+ W ELK+LA R QKL+
Sbjct: 61 GEQLIEEGHPDAEEIEERLEELNERWEELKELAEERRQKLE 101
Score = 86.2 bits (214), Expect = 4e-20
Identities = 40/100 (40%), Positives = 64/100 (64%)
Query: 90 QRFHRDVDETKDWIQEKDEALNNNDLGKDLRSVQALQRKHEGLERDLAALGDKIRQLDET 149
Q+F RD DE + W++EK++ L + DLGKDL SV+AL +KHE E +L A +++ L+E
Sbjct: 1 QQFLRDADELEAWLEEKEQLLASEDLGKDLESVEALLKKHEAFEAELEAHEERVEALNEL 60
Query: 150 ANRLMQTHPETAEQTYAKQKEINEEWTQLTAKANTRKEKL 189
+L++ AE+ + +E+NE W +L A R++KL
Sbjct: 61 GEQLIEEGHPDAEEIEERLEELNERWEELKELAEERRQKL 100
Score = 82.4 bits (204), Expect = 9e-19
Identities = 42/101 (41%), Positives = 68/101 (67%)
Query: 515 RTYIAAVKDLDFWLGEVESLLTSEDSGKDLASVQNLIKKHQLVEADIQAHDDRIKDMNGQ 574
+ ++ +L+ WL E E LL SED GKDL SV+ L+KKH+ EA+++AH++R++ +N
Sbjct: 1 QQFLRDADELEAWLEEKEQLLASEDLGKDLESVEALLKKHEAFEAELEAHEERVEALNEL 60
Query: 575 ADSLIDSGQFDASSIQEKRQSINERYERIKNLAAHRQARLN 615
+ LI+ G DA I+E+ + +NER+E +K LA R+ +L
Sbjct: 61 GEQLIEEGHPDAEEIEERLEELNERWEELKELAEERRQKLE 101
Score = 80.8 bits (200), Expect = 4e-18
Identities = 39/100 (39%), Positives = 64/100 (64%)
Query: 302 QLFYRDCEQAENWMSAREAFLNAEEVDSKTDNVEALIKKHEDFDKAINAHEEKIGALQTL 361
Q F RD ++ E W+ +E L +E++ ++VEAL+KKHE F+ + AHEE++ AL L
Sbjct: 1 QQFLRDADELEAWLEEKEQLLASEDLGKDLESVEALLKKHEAFEAELEAHEERVEALNEL 60
Query: 362 ADQLIAADHYAAKPIDDKRKQVLDRWRLLKEALIEKRSRL 401
+QLI H A+ I+++ +++ +RW LKE E+R +L
Sbjct: 61 GEQLIEEGHPDAEEIEERLEELNERWEELKELAEERRQKL 100
Score = 78.5 bits (194), Expect = 2e-17
Identities = 40/100 (40%), Positives = 61/100 (61%)
Query: 727 QHFLAKVEEEEAWISEKQQLLSVEDYGDTMAAVQGLLKKHDAFETDFSVHRDRCADICSA 786
Q FL +E EAW+ EK+QLL+ ED G + +V+ LLKKH+AFE + H +R +
Sbjct: 1 QQFLRDADELEAWLEEKEQLLASEDLGKDLESVEALLKKHEAFEAELEAHEERVEALNEL 60
Query: 787 GNKLIEAKNHHADSITQRCQQLQLKLDNLMALATKRKTKL 826
G +LIE + A+ I +R ++L + + L LA +R+ KL
Sbjct: 61 GEQLIEEGHPDAEEIEERLEELNERWEELKELAEERRQKL 100
Score = 77.4 bits (191), Expect = 5e-17
Identities = 33/101 (32%), Positives = 61/101 (60%)
Query: 196 QRFLSDYRDLMSWINSMMGLVSSDELANDVTGAEALLERHQEHRTEIDARTGTFQAFDLF 255
Q+FL D +L +W+ L++S++L D+ EALL++H+ E++A +A +
Sbjct: 1 QQFLRDADELEAWLEEKEQLLASEDLGKDLESVEALLKKHEAFEAELEAHEERVEALNEL 60
Query: 256 GQQLLQSGHYASVEIQDKLGNLAEAREDLEKAWIARRMQLD 296
G+QL++ GH + EI+++L L E E+L++ RR +L+
Sbjct: 61 GEQLIEEGHPDAEEIEERLEELNERWEELKELAEERRQKLE 101
Score = 70.1 bits (172), Expect = 2e-14
Identities = 40/103 (38%), Positives = 68/103 (66%), Gaps = 3/103 (2%)
Query: 408 QQFSRDADEMENWIAEKLQ-LATEESYKDPANIQSKHQKHQAFEAELAANADRIQSVLAM 466
QQF RDADE+E W+ EK Q LA+E+ KD ++++ +KH+AFEAEL A+ +R++++ +
Sbjct: 1 QQFLRDADELEAWLEEKEQLLASEDLGKDLESVEALLKKHEAFEAELEAHEERVEALNEL 60
Query: 467 GQNLIDKRQCVGSEEAVQARLASIADQWEFLTQKTTEKSLKLK 509
G+ LI++ E ++ RL + ++WE L + E+ KL+
Sbjct: 61 GEQLIEEGH--PDAEEIEERLEELNERWEELKELAEERRQKLE 101
Score = 68.5 bits (168), Expect = 7e-14
Identities = 36/100 (36%), Positives = 60/100 (60%), Gaps = 1/100 (1%)
Query: 834 QFMWKADVVESWIADKETHVKSEEYGRDLSTVQTLLTKQETFDAGLHAFEHEGIQNITTL 893
QF+ AD +E+W+ +KE + SE+ G+DL +V+ LL K E F+A L A E E ++ + L
Sbjct: 2 QFLRDADELEAWLEEKEQLLASEDLGKDLESVEALLKKHEAFEAELEAHE-ERVEALNEL 60
Query: 894 KDQLVASNHDQTPAIVKRHGDVIARWQKLLGDSNARKQRL 933
+QL+ H I +R ++ RW++L + R+Q+L
Sbjct: 61 GEQLIEEGHPDAEEIEERLEELNERWEELKELAEERRQKL 100
Score = 59.7 bits (145), Expect = 9e-11
Identities = 30/78 (38%), Positives = 54/78 (69%), Gaps = 1/78 (1%)
Query: 6 QDVGEDLEQVEVMQKKFDDFQSDLKANEVRLAEMNEIAMQLMSLGQTEAALKIQTQLQDL 65
+D+G+DLE VE + KK + F+++L+A+E R+ +NE+ QL+ G A +I+ +L++L
Sbjct: 24 EDLGKDLESVEALLKKHEAFEAELEAHEERVEALNELGEQLIEEG-HPDAEEIEERLEEL 82
Query: 66 NQKWTSLQQLTAERATQL 83
N++W L++L ER +L
Sbjct: 83 NERWEELKELAEERRQKL 100
Score = 46.9 bits (112), Expect = 2e-06
Identities = 25/98 (25%), Positives = 43/98 (43%)
Query: 950 FAKKASSFNSWFENAEEDLTDPVRCNSIEEIRALREAHAQFQASLSSAQADFEALAALDQ 1009
F + A +W E E+ L +E + AL + H F+A L + + EAL L +
Sbjct: 3 FLRDADELEAWLEEKEQLLASEDLGKDLESVEALLKKHEAFEAELEAHEERVEALNELGE 62
Query: 1010 QIKSFNVGPNPYTWFTMEALEDTWRNLQKIIKERDIEL 1047
Q+ +E L + W L+++ +ER +L
Sbjct: 63 QLIEEGHPDAEEIEERLEELNERWEELKELAEERRQKL 100
Score = 29.6 bits (67), Expect = 3.3
Identities = 24/100 (24%), Positives = 44/100 (44%), Gaps = 5/100 (5%)
Query: 1063 EFAKHANAFHQWLTETRTSMM-EGTGSLEQQLEAIKRK----AAEVRSRRSDLKKIEDLG 1117
+F + A+ WL E + E G + +EA+ +K AE+ + ++ + +LG
Sbjct: 2 QFLRDADELEAWLEEKEQLLASEDLGKDLESVEALLKKHEAFEAELEAHEERVEALNELG 61
Query: 1118 AILEEHLILDNRYTEHSTVGLAQQWDQLDQLGMRMQHNLE 1157
L E D E L ++W++L +L + LE
Sbjct: 62 EQLIEEGHPDAEEIEERLEELNERWEELKELAEERRQKLE 101
>gnl|CDD|238008 cd00051, EFh, EF-hand, calcium binding motif; A diverse superfamily
of calcium sensors and calcium signal modulators; most
examples in this alignment model have 2 active canonical
EF hands. Ca2+ binding induces a conformational change in
the EF-hand motif, leading to the activation or
inactivation of target proteins. EF-hands tend to occur
in pairs or higher copy numbers.
Length = 63
Score = 63.3 bits (155), Expect = 1e-12
Identities = 21/69 (30%), Positives = 36/69 (52%), Gaps = 7/69 (10%)
Query: 1175 EFSMMFKHFDKDKSGKLNQTEFKSCLRALGYDLPMVEEGQPDPEFEAILDLVDPNRDGHV 1234
E F+ FDKD G ++ E K+ L++LG EG + E + ++ VD + DG +
Sbjct: 1 ELREAFRLFDKDGDGTISADELKAALKSLG-------EGLSEEEIDEMIREVDKDGDGKI 53
Query: 1235 SLQEYMAFM 1243
+E++ M
Sbjct: 54 DFEEFLELM 62
Score = 31.4 bits (72), Expect = 0.35
Identities = 9/23 (39%), Positives = 13/23 (56%)
Query: 1174 KEFSMMFKHFDKDKSGKLNQTEF 1196
+E M + DKD GK++ EF
Sbjct: 36 EEIDEMIREVDKDGDGKIDFEEF 58
>gnl|CDD|222177 pfam13499, EF_hand_5, EF-hand domain pair.
Length = 60
Score = 48.9 bits (117), Expect = 2e-07
Identities = 19/63 (30%), Positives = 33/63 (52%), Gaps = 3/63 (4%)
Query: 1181 KHFDKDKSGKLNQTEFKSCLRALGYDLPMVEEGQPDPEFEAILDLVDPNRDGHVSLQEYM 1240
K DKD G ++ E + L+ALG L + + + EA + +D + DG +S +E++
Sbjct: 1 KLLDKDGDGYIDVEELRKLLKALGLKLT---DEEVEELIEADFNEIDKDGDGRISFEEFL 57
Query: 1241 AFM 1243
M
Sbjct: 58 EAM 60
>gnl|CDD|227455 COG5126, FRQ1, Ca2+-binding protein (EF-Hand superfamily) [Signal
transduction mechanisms / Cytoskeleton / Cell division
and chromosome partitioning / General function prediction
only].
Length = 160
Score = 46.9 bits (112), Expect = 8e-06
Identities = 35/162 (21%), Positives = 71/162 (43%), Gaps = 16/162 (9%)
Query: 1158 QQIQARNQSGVSEDALKEFSMMFKHFDKDKSGKLNQTEFKSCLRALGYDLPMVEEGQPDP 1217
+ + ++E+ ++E F+ FD+D G +++ E LR+LG++ +
Sbjct: 4 KISDLLTFTQLTEEQIQELKEAFQLFDRDSDGLIDRNELGKILRSLGFNP-------SEA 56
Query: 1218 EFEAILDLVDPNRDGHVSLQEYMAFMISKETENVQSSEEIENAFHAIAASDRPYVTKEEL 1277
E + + +D + V E++ M S + + EE+ AF Y++ EL
Sbjct: 57 EINKLFEEIDAGNET-VDFPEFLTVM-SVKLKRGDKEEELREAFKLFDKDHDGYISIGEL 114
Query: 1278 Y---ANLTKEMADYCVERMKPYVDPKTERGIPGALDYIEFTR 1316
+L + ++D VE++ D + G +DY EF +
Sbjct: 115 RRVLKSLGERLSDEEVEKLLKEYDEDGD----GEIDYEEFKK 152
Score = 43.5 bits (103), Expect = 1e-04
Identities = 23/79 (29%), Positives = 35/79 (44%), Gaps = 7/79 (8%)
Query: 1171 DALKEFSMMFKHFDKDKSGKLNQTEFKSCLRALGYDLPMVEEGQPDPEFEAILDLVDPNR 1230
D +E FK FDKD G ++ E + L++LG E D E E +L D +
Sbjct: 89 DKEEELREAFKLFDKDHDGYISIGELRRVLKSLG-------ERLSDEEVEKLLKEYDEDG 141
Query: 1231 DGHVSLQEYMAFMISKETE 1249
DG + +E+ + T
Sbjct: 142 DGEIDYEEFKKLIKDSPTI 160
>gnl|CDD|185504 PTZ00184, PTZ00184, calmodulin; Provisional.
Length = 149
Score = 45.1 bits (107), Expect = 3e-05
Identities = 38/157 (24%), Positives = 71/157 (45%), Gaps = 15/157 (9%)
Query: 1168 VSEDALKEFSMMFKHFDKDKSGKLNQTEFKSCLRALGYDLPMVEEGQPDPEFEAILDLVD 1227
++E+ + EF F FDKD G + E + +R+LG + + E + +++ VD
Sbjct: 5 LTEEQIAEFKEAFSLFDKDGDGTITTKELGTVMRSLG-------QNPTEAELQDMINEVD 57
Query: 1228 PNRDGHVSLQEYMAFMISKETENVQSSEEIENAFHAIAASDRPYVTKEEL---YANLTKE 1284
+ +G + E++ M K + S EEI+ AF +++ EL NL ++
Sbjct: 58 ADGNGTIDFPEFLTLMARKMKDT-DSEEEIKEAFKVFDRDGNGFISAAELRHVMTNLGEK 116
Query: 1285 MADYCVERMKPYVDPKTERGIPGALDYIEFTRTLFQN 1321
+ D V+ M D + G ++Y EF + +
Sbjct: 117 LTDEEVDEMIREADVDGD----GQINYEEFVKMMMSK 149
>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
bacterial type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. This family
represents the SMC protein of most bacteria. The smc
gene is often associated with scpB (TIGR00281) and scpA
genes, where scp stands for segregation and condensation
protein. SMC was shown (in Caulobacter crescentus) to be
induced early in S phase but present and bound to DNA
throughout the cell cycle [Cellular processes, Cell
division, DNA metabolism, Chromosome-associated
proteins].
Length = 1179
Score = 46.6 bits (111), Expect = 1e-04
Identities = 98/654 (14%), Positives = 237/654 (36%), Gaps = 87/654 (13%)
Query: 10 EDLEQVEVMQKKFDDFQSDLKANEVRLAEMNEIAMQLMSLGQTEAALKIQTQLQDLNQKW 69
E E++ +++K ++ + +L++ E L E+ ++ + E +++T + Q
Sbjct: 334 ELAEELAELEEKLEELKEELESLEAELEELEA-ELEELESRLEELEEQLETLRSKVAQL- 391
Query: 70 TSLQQLTAERATQLGSAHEVQRFHRDVDETKDWIQEKDEALNNNDLGKDLRSVQALQRKH 129
Q+ + ++R + + I+E + L +L + LQ +
Sbjct: 392 --ELQIASLNNEIERLEARLERLEDRRERLQQEIEELLKKLEEAEL-------KELQAEL 442
Query: 130 EGLERDLAALGDKIRQLDETANRLMQTHPETAEQTYAKQKEINEEWTQLTAKANTRKEKL 189
E LE +L L +++ +L+E L + E + A ++E+ + +L + +E L
Sbjct: 443 EELEEELEELQEELERLEEALEELREELEEAEQALDAAERELAQLQARLDS-LERLQENL 501
Query: 190 LDSYDLQRFLSDYRDLMSWINSMMGLVSS-------DELANDVTGAEALL---------- 232
+ + L + S ++ ++G++S E A + L
Sbjct: 502 EGFSEGVKALLKNQ---SGLSGILGVLSELISVDEGYEAAIEAALGGRLQAVVVENLNAA 558
Query: 233 ERHQEHRTEIDARTGTFQAFDLFGQQLLQSGHYASVEIQDK----LGNLAEAREDLEKA- 287
++ + + TF D +Q ++ + +L + L KA
Sbjct: 559 KKAIAFLKQNELGRVTFLPLDSIKGTEIQGNDREILKNIEGFLGVAKDLVKFDPKLRKAL 618
Query: 288 --WIARRM---QLDQCLELQLFYRDCEQAENWMS---------------AREAFLNAEEV 327
+ + LD LEL R ++ + + + E
Sbjct: 619 SYLLGGVLVVDDLDNALELAKKLR---PGYRIVTLDGDLVRPGGVITGGSAKTNSSILER 675
Query: 328 DSKTDNVEALIKKHEDFDKAINAHEEKIGALQTLADQLIAADHYAAKPIDDKRKQVLDRW 387
+ + +E I++ E+ I E+ + L+ ++L K +++ +Q+
Sbjct: 676 RREIEELEEKIEELEE---KIAELEKALAELRKELEELEEELEQLRKELEELSRQISALR 732
Query: 388 RLLKEALIEKRSRLGESQTLQQFSRDADEMENWIAEKLQLATEESYKDPANIQSKHQKHQ 447
+ L E L + + + + E+L+ A EE + A I+ + +
Sbjct: 733 KDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIE 792
Query: 448 AFEAELAANADRIQSVLAMGQNLIDKRQCVGSE-EAVQARLASIADQWEFLTQKTTEKSL 506
+ EL A + + + A L ++ + E+++ R+A+ + E L ++ E S
Sbjct: 793 QLKEELKALREALDELRAELTLLNEEAANLRERLESLERRIAATERRLEDLEEQIEELSE 852
Query: 507 KLKEANKQRTYIAAVKDLDFWLGEVESLLTSEDSGKDLASVQNLIKKH----QLVEADIQ 562
++ + + E E + AS++ + + + +++
Sbjct: 853 DIESLAAEIE--------ELEELIEELESELEALLNERASLEEALALLRSELEELSEELR 904
Query: 563 AHDDRIKDMNGQADSLIDSGQFDASSIQEKRQSINERYERIKNLAAHRQARLNE 616
+ + ++ + + L +EK + R E ++ + Q RL+E
Sbjct: 905 ELESKRSELRRELEEL-----------REKLAQLELRLEGLEVRIDNLQERLSE 947
Score = 42.4 bits (100), Expect = 0.002
Identities = 60/345 (17%), Positives = 130/345 (37%), Gaps = 33/345 (9%)
Query: 121 SVQALQRKHEGLERDLAALGDKIRQLDETANRLMQTHPETAEQTYAKQKEINEEWTQLTA 180
S+ +R+ E LE + L +KI +L++ L + E E+ +KE+ E Q+
Sbjct: 671 SILERRREIEELEEKIEELEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQI-- 728
Query: 181 KANTRKEKLLDSYDLQRFLSDYRDLMSWINSMMGLVSSDELANDVTGAEALLERHQEHRT 240
+ RK+ ++++ L + + +EL + AE L +
Sbjct: 729 -SALRKDLARLEAEVEQLEERIAQLSKELTEL--EAEIEELEERLEEAEEELAEAEAEIE 785
Query: 241 EIDARTGTFQAFDLFGQQLLQSGHYASVEIQDKLGNLAEAREDLEKAWIARRMQLDQCLE 300
E++A+ ++ L++ A E++ +L L E +L + ++
Sbjct: 786 ELEAQIEQL-------KEELKALREALDELRAELTLLNEEAANLRER--LESLERRIAAT 836
Query: 301 LQLFYRDCEQAENWMSAREAFLNAEEVDSKTDNVEALIKKHEDFDKAINAHEEKIGALQT 360
+ EQ E +S L A E++ + +E L + E + EE + L++
Sbjct: 837 ERRLEDLEEQIEE-LSEDIESLAA-EIEELEELIEELESELEALLNERASLEEALALLRS 894
Query: 361 LADQLIAADHYAAKPIDDKRKQVLDRWRLLKEALIEKRSRLGESQTLQQFSRDADEMENW 420
++L + + + + L+ L E R +L + + + +E
Sbjct: 895 ELEELSEE-------LRELESKRSE----LRRELEELREKLAQLELRLE------GLEVR 937
Query: 421 IAEKLQLATEESYKDPANIQSKHQKHQAFEAELAANADRIQSVLA 465
I + +EE ++ K + E E R+++ +
Sbjct: 938 IDNLQERLSEEYSLTLEEAEALENKIEDDEEEARRRLKRLENKIK 982
Score = 38.5 bits (90), Expect = 0.028
Identities = 135/830 (16%), Positives = 287/830 (34%), Gaps = 128/830 (15%)
Query: 10 EDLEQVE--VMQKKFDDFQSDLKANEVRLAEMNEIAMQLMSLGQTEAALKIQTQLQDLNQ 67
+L ++E ++ + ++ + +L+ + L E E E ++Q + L +
Sbjct: 220 AELRELELALLVLRLEELREELEELQEELKEAEEE--------LEELTAELQELEEKLEE 271
Query: 68 KWTSLQQLTAERATQLGSAHEVQRFHRDVDETKDWIQEKDEALNNNDLGKDLRSVQALQR 127
+ +L E + + +++ K ++E+ L + + + +
Sbjct: 272 LRLEVSELEEEIEELQKELYALANEISRLEQQKQILRERLANLERQLEELEAQLEELESK 331
Query: 128 KHEGLERDLAALGDKIRQLDETANRLMQTHPETAEQTYAKQKEINEEWTQLTAKANTRKE 187
E L +LA L +K+ +L E L A+ +E+ E +L ++ +E
Sbjct: 332 LDE-LAEELAELEEKLEELKEELESLE-----------AELEELEAELEELESRLEELEE 379
Query: 188 KLLDSYDLQRFLSDYRDLMSWINSMMGLVSSDELANDVTGAEALLERHQEHRTEIDARTG 247
+ L+ S L I S L N++ EA LER ++ R +
Sbjct: 380 Q------LETLRSKVAQLELQIAS---------LNNEIERLEARLERLEDRRERLQQEIE 424
Query: 248 TFQAFDLFGQQLLQSGHYASVEIQDKLGNLAEAREDLEKAWIARRMQLDQCLELQLFYRD 307
+ +E + L L E E LE+A R +L+ E +
Sbjct: 425 ELLKKLEEAELKELQAELEELEEE--LEELQEELERLEEALEELREELE---EAEQALDA 479
Query: 308 CEQAENWMSAREAFLNA--EEVDSKTDNVEALIKKHEDFDKAINAHEEKIGALQTLADQL 365
E+ + AR L E ++ ++ V+AL+K G L L++ +
Sbjct: 480 AERELAQLQARLDSLERLQENLEGFSEGVKALLKNQSGLS----------GILGVLSELI 529
Query: 366 IAADHYAA-----------KPIDDKRKQVLDRWRLLKEALIEKRSRLGESQTLQQFSRDA 414
+ Y A + + LK+ + + + L +
Sbjct: 530 SVDEGYEAAIEAALGGRLQAVVVENLNAAKKAIAFLKQNELGRVTFLPLDSIKGTEIQ-G 588
Query: 415 DEMENWIAEKLQLATEESYKDPAN--IQSKHQKHQAFEAELA--------ANADRIQSVL 464
++ E L E + A ++ + +A L NA + L
Sbjct: 589 NDRE-------ILKNIEGFLGVAKDLVKFDPKLRKALSYLLGGVLVVDDLDNALELAKKL 641
Query: 465 AMGQNLIDK---------RQCVGSEEAVQARLA---SIADQW---EFLTQKTTEKSLKLK 509
G ++ GS + + L I + E L +K E L
Sbjct: 642 RPGYRIVTLDGDLVRPGGVITGGSAKTNSSILERRREIEELEEKIEELEEKIAELEKALA 701
Query: 510 EANKQRTYIA-AVKDLDFWLGEVESLLTSEDSGKDLASVQNLIKKHQLVEADIQAHDDRI 568
E K+ + ++ L L E+ +++ + L + + +E I +
Sbjct: 702 ELRKELEELEEELEQLRKELEELSRQISALR-----KDLARLEAEVEQLEERIAQLSKEL 756
Query: 569 KDMNGQADSL---IDSGQFDASSIQEKRQSINERYERIKNLAAHRQARLNEANTLHQFFR 625
++ + + L ++ + + + + + + + + E++K + L+E
Sbjct: 757 TELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKALREALDELRAELTLLN 816
Query: 626 DIADEESWIKEKKLLVGSDDYGRDLTGVQNLKKKHKRLEAELASHQPAIQNVQETGEKLM 685
+ A ++L +++L+++ + L ++ S I+ ++E E+L
Sbjct: 817 EEAAN----LRERLESLERRIAATERRLEDLEEQIEELSEDIESLAAEIEELEELIEELE 872
Query: 686 D----VSNLGVPEIEQRLKLLN----QAWSELKQLAANRGQKLDESLTYQHFLAKVEEE- 736
+ N +E+ L LL + EL++L + R + E + LA++E
Sbjct: 873 SELEALLNE-RASLEEALALLRSELEELSEELRELESKRSELRRELEELREKLAQLELRL 931
Query: 737 ---EAWISEKQQLLSVEDYGDTMAAVQGLLKKHDAFETDFSVHRDRCADI 783
E I Q+ LS E+Y T+ + + E D R R +
Sbjct: 932 EGLEVRIDNLQERLS-EEYSLTL---EEAEALENKIEDDEEEARRRLKRL 977
Score = 31.2 bits (71), Expect = 4.9
Identities = 42/248 (16%), Positives = 89/248 (35%), Gaps = 16/248 (6%)
Query: 928 ARKQRLLRMQEQFRQIEDLYLTFAKKASSFNSWFENAEEDLTDPVRCNSIEEIRALREAH 987
R++ + ++E+ ++E+ K + E EE+L +R E R +
Sbjct: 674 ERRREIEELEEKIEELEEKIAELEKALAELRKELEELEEELEQ-LRKELEELSRQISALR 732
Query: 988 AQFQASLSSAQADFEALAALDQQIKSFNVGPNPYTWFTMEALEDTWRNLQKIIKERDIEL 1047
+ + E +A L +++ E LE+ L + E +
Sbjct: 733 KDLARLEAEVEQLEERIAQLSKELTELEAEIEELE----ERLEEAEEELAEAEAEIEELE 788
Query: 1048 AKEATRQDENDALRKEFAKHANAFHQWLTETRTSMMEGTGSLEQQLEAIKRKAAEVRSRR 1107
A+ ++E ALR+ + A L E ++ E SLE+++ A +R+ ++ +
Sbjct: 789 AQIEQLKEELKALREALDE-LRAELTLLNEEAANLRERLESLERRIAATERRLEDLEEQI 847
Query: 1108 SDLKKIEDLGAILEEHLILDNRYTEHSTVGLAQQWDQLDQLGMRMQHNLEQQIQARNQSG 1167
+L E + E + + +L+ L E R++
Sbjct: 848 EEL----------SEDIESLAAEIEELEELIEELESELEALLNERASLEEALALLRSELE 897
Query: 1168 VSEDALKE 1175
+ L+E
Sbjct: 898 ELSEELRE 905
>gnl|CDD|197492 smart00054, EFh, EF-hand, calcium binding motif. EF-hands are
calcium-binding motifs that occur at least in pairs.
Links between disease states and genes encoding EF-hands,
particularly the S100 subclass, are emerging. Each motif
consists of a 12 residue loop flanked on either side by a
12 residue alpha-helix. EF-hands undergo a conformational
change unpon binding calcium ions.
Length = 29
Score = 38.5 bits (91), Expect = 4e-04
Identities = 14/29 (48%), Positives = 18/29 (62%)
Query: 1175 EFSMMFKHFDKDKSGKLNQTEFKSCLRAL 1203
E F+ FDKD GK++ EFK L+AL
Sbjct: 1 ELKEAFRLFDKDGDGKIDFEEFKDLLKAL 29
Score = 31.2 bits (72), Expect = 0.18
Identities = 6/29 (20%), Positives = 13/29 (44%)
Query: 1218 EFEAILDLVDPNRDGHVSLQEYMAFMISK 1246
E + L D + DG + +E+ + +
Sbjct: 1 ELKEAFRLFDKDGDGKIDFEEFKDLLKAL 29
>gnl|CDD|185503 PTZ00183, PTZ00183, centrin; Provisional.
Length = 158
Score = 42.0 bits (99), Expect = 4e-04
Identities = 29/102 (28%), Positives = 47/102 (46%), Gaps = 8/102 (7%)
Query: 1160 IQARNQSGVSEDALKEFSMMFKHFDKDKSGKLNQTEFKSCLRALGYDLPMVEEGQPDPEF 1219
+ + G++ED KE F FD D SG ++ E K +R+LG++ E
Sbjct: 3 KRRSERPGLTEDQKKEIREAFDLFDTDGSGTIDPKELKVAMRSLGFE-------PKKEEI 55
Query: 1220 EAILDLVDPNRDGHVSLQEYMAFMISKETENVQSSEEIENAF 1261
+ ++ VD + G + +E++ M K E EEI AF
Sbjct: 56 KQMIADVDKDGSGKIDFEEFLDIMTKKLGER-DPREEILKAF 96
Score = 32.4 bits (74), Expect = 0.88
Identities = 23/73 (31%), Positives = 35/73 (47%), Gaps = 7/73 (9%)
Query: 1171 DALKEFSMMFKHFDKDKSGKLNQTEFKSCLRALGYDLPMVEEGQPDPEFEAILDLVDPNR 1230
D +E F+ FD DK+GK++ K + LG E D E + ++D D N
Sbjct: 87 DPREEILKAFRLFDDDKTGKISLKNLKRVAKELG-------ETITDEELQEMIDEADRNG 139
Query: 1231 DGHVSLQEYMAFM 1243
DG +S +E+ M
Sbjct: 140 DGEISEEEFYRIM 152
>gnl|CDD|200946 pfam00036, efhand, EF hand. The EF-hands can be divided into two
classes: signaling proteins and buffering/transport
proteins. The first group is the largest and includes the
most well-known members of the family such as calmodulin,
troponin C and S100B. These proteins typically undergo a
calcium-dependent conformational change which opens a
target binding site. The latter group is represented by
calbindin D9k and do not undergo calcium dependent
conformational changes.
Length = 29
Score = 38.2 bits (90), Expect = 6e-04
Identities = 14/29 (48%), Positives = 17/29 (58%)
Query: 1175 EFSMMFKHFDKDKSGKLNQTEFKSCLRAL 1203
E FK FDKD GK++ EFK L+ L
Sbjct: 1 ELKEAFKEFDKDGDGKISFEEFKELLKKL 29
Score = 30.1 bits (69), Expect = 0.37
Identities = 6/26 (23%), Positives = 12/26 (46%)
Query: 1218 EFEAILDLVDPNRDGHVSLQEYMAFM 1243
E + D + DG +S +E+ +
Sbjct: 1 ELKEAFKEFDKDGDGKISFEEFKELL 26
>gnl|CDD|205583 pfam13405, EF_hand_4, EF-hand domain.
Length = 30
Score = 37.6 bits (88), Expect = 8e-04
Identities = 13/30 (43%), Positives = 17/30 (56%)
Query: 1175 EFSMMFKHFDKDKSGKLNQTEFKSCLRALG 1204
E FK FDKD G ++ E + LR+LG
Sbjct: 1 ELREAFKLFDKDGDGYISAEELRKALRSLG 30
>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
primarily archaeal type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. It is found
in a single copy and is homodimeric in prokaryotes, but
six paralogs (excluded from this family) are found in
eukarotes, where SMC proteins are heterodimeric. This
family represents the SMC protein of archaea and a few
bacteria (Aquifex, Synechocystis, etc); the SMC of other
bacteria is described by TIGR02168. The N- and
C-terminal domains of this protein are well conserved,
but the central hinge region is skewed in composition
and highly divergent [Cellular processes, Cell division,
DNA metabolism, Chromosome-associated proteins].
Length = 1164
Score = 41.6 bits (98), Expect = 0.003
Identities = 60/293 (20%), Positives = 118/293 (40%), Gaps = 34/293 (11%)
Query: 117 KDLRSVQALQRKHEGLERDLAALGDKIRQLDETANRLMQTHPETAEQTYAKQKEINEEWT 176
+ +Q L+ + EGL+R+L++L ++R+++ + L Q + + + +KEI
Sbjct: 671 SEPAELQRLRERLEGLKRELSSLQSELRRIENRLDELSQELSDASRKIGEIEKEIE---- 726
Query: 177 QLTAKANTRKEKLLDSYDLQRFLSDYRDLMSWINSMMGLVSSDELANDVTGAEALLERHQ 236
QL + KE+L +L+ LS + + S + EL + E L + +
Sbjct: 727 QLEQEEEKLKERL---EELEEDLSSLEQEIENVKSEL-----KELEARIEELEEDLHKLE 778
Query: 237 EHRTEIDARTGTFQAFDLFGQQLLQSGHYASVEIQDKLGNLAEAREDLEKAWIARRMQLD 296
E +++AR ++ E+ L E + L + + +
Sbjct: 779 EALNDLEARLS-----HSRIPEIQAELSKLEEEVSRIEARLREIEQKLNRLTLEKEYLEK 833
Query: 297 QCLELQLFYRDCEQAENWMSAREAFLNA--EEVDSKTDNVEA----LIKKHEDFDKAINA 350
+ ELQ D ++ + LN EE++ + + +EA L + D K +
Sbjct: 834 EIQELQEQRIDLKEQIKSIEKEIENLNGKKEELEEELEELEAALRDLESRLGDLKKERDE 893
Query: 351 HEEKIGALQTLADQLIAADHYAAKPIDDKRKQVLDRWRLLKEALIEKRSRLGE 403
E ++ L+ ++L A I+ KRK++ + LK L L E
Sbjct: 894 LEAQLRELERKIEELEAQ-------IEKKRKRLSE----LKAKLEALEEELSE 935
Score = 32.7 bits (75), Expect = 1.8
Identities = 57/349 (16%), Positives = 128/349 (36%), Gaps = 44/349 (12%)
Query: 388 RLLKEALIEKRSRLGESQTLQQFSRDADEMENWIAEKLQLATEESYKDPANIQSKHQKHQ 447
R L E R L Q DA I ++++ +E K ++ +
Sbjct: 688 RELSSLQSELRRIENRLDELSQELSDASRKIGEIEKEIEQLEQEEEKLKERLEELEEDLS 747
Query: 448 AFEAELAANADRIQSVLAMGQNLIDKRQCVGSEEAVQARLASIADQWEFLTQKTTEKSLK 507
+ E E+ ++ + A + L + EEA+ A ++ + +
Sbjct: 748 SLEQEIENVKSELKELEARIEEL--EEDLHKLEEALNDLEARLSH--SRIPEIQ------ 797
Query: 508 LKEANKQRTYIAAVKDLDFWLGEVESLLTSEDSGKDLAS--VQNLIKKHQLVEADIQAHD 565
E +K V ++ L E+E L K+ +Q L ++ ++ I++ +
Sbjct: 798 -AELSKLEEE---VSRIEARLREIEQKLNRLTLEKEYLEKEIQELQEQRIDLKEQIKSIE 853
Query: 566 DRIKDMNGQADSLIDSGQFDASSIQEKRQSINERYERIKNLAAHRQARLNEANTLHQFFR 625
I+++NG+ + L ++E ++ + R+ +L R + L +
Sbjct: 854 KEIENLNGKKEEL-------EEELEELEAALRDLESRLGDLKKERDELEAQLRELERKIE 906
Query: 626 DIADEESWIKEKKLLVGSDDYGRDLTGVQNLKKKHKRLEAELASHQPA------------ 673
++ + I++K+ + ++ L+++ +E +
Sbjct: 907 ELEAQ---IEKKRKRL-----SELKAKLEALEEELSEIEDPKGEDEEIPEEELSLEDVQA 958
Query: 674 -IQNVQETGEKLMDVSNLGVPEIEQRLKLLNQAWSELKQLAANRGQKLD 721
+Q V+E L V+ L + E E+ LK L++ + +L R L+
Sbjct: 959 ELQRVEEEIRALEPVNMLAIQEYEEVLKRLDELKEKRAKLEEERKAILE 1007
>gnl|CDD|222407 pfam13833, EF_hand_6, EF-hand domain pair.
Length = 53
Score = 36.3 bits (85), Expect = 0.004
Identities = 13/60 (21%), Positives = 24/60 (40%), Gaps = 7/60 (11%)
Query: 1187 KSGKLNQTEFKSCLRALGYDLPMVEEGQPDPEFEAILDLVDPNRDGHVSLQEYMAFMISK 1246
+ G + + E K L LG L + E + + D + DG +S +E+ +
Sbjct: 1 EKGLITREELKRALALLGISLS-------EEEVDILFREFDTDGDGKISFEEFCVLLQRL 53
Score = 35.2 bits (82), Expect = 0.010
Identities = 11/29 (37%), Positives = 17/29 (58%)
Query: 1175 EFSMMFKHFDKDKSGKLNQTEFKSCLRAL 1203
E ++F+ FD D GK++ EF L+ L
Sbjct: 25 EVDILFREFDTDGDGKISFEEFCVLLQRL 53
>gnl|CDD|238131 cd00213, S-100, S-100: S-100 domain, which represents the largest
family within the superfamily of proteins carrying the
Ca-binding EF-hand motif. Note that this S-100 hierarchy
contains only S-100 EF-hand domains, other EF-hands have
been modeled separately. S100 proteins are expressed
exclusively in vertebrates, and are implicated in
intracellular and extracellular regulatory activities.
Intracellularly, S100 proteins act as Ca-signaling or
Ca-buffering proteins. The most unusual characteristic of
certain S100 proteins is their occurrence in
extracellular space, where they act in a cytokine-like
manner through RAGE, the receptor for advanced glycation
products. Structural data suggest that many S100 members
exist within cells as homo- or heterodimers and even
oligomers; oligomerization contributes to their
functional diversification. Upon binding calcium, most
S100 proteins change conformation to a more open
structure exposing a hydrophobic cleft. This hydrophobic
surface represents the interaction site of S100 proteins
with their target proteins. There is experimental
evidence showing that many S100 proteins have multiple
binding partners with diverse mode of interaction with
different targets. In addition to S100 proteins (such as
S100A1,-3,-4,-6,-7,-10,-11,and -13), this group includes
the ''fused'' gene family, a group of calcium binding
S100-related proteins. The ''fused'' gene family includes
multifunctional epidermal differentiation proteins -
profilaggrin, trichohyalin, repetin, hornerin, and
cornulin; functionally these proteins are associated with
keratin intermediate filaments and partially crosslinked
to the cell envelope. These ''fused'' gene proteins
contain N-terminal sequence with two Ca-binding EF-hands
motif, which may be associated with calcium signaling in
epidermal cells and autoprocessing in a calcium-dependent
manner. In contrast to S100 proteins, "fused" gene family
proteins contain an extraordinary high number of almost
perfect peptide repeats with regular array of polar and
charged residues similar to many known cell envelope
proteins.
Length = 88
Score = 36.7 bits (86), Expect = 0.008
Identities = 20/63 (31%), Positives = 32/63 (50%), Gaps = 5/63 (7%)
Query: 1184 DKDKSGKLNQTEFKSCLRALGYDLPMVEEGQPDPE-FEAILDLVDPNRDGHVSLQEYMAF 1242
+ DK L++ E K L +LP + Q DPE + I+ +D N+DG V QE++
Sbjct: 21 EGDK-DTLSKKELKELLET---ELPNFLKNQKDPEAVDKIMKDLDVNKDGKVDFQEFLVL 76
Query: 1243 MIS 1245
+
Sbjct: 77 IGK 79
>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
chromosome partitioning].
Length = 1163
Score = 39.7 bits (93), Expect = 0.013
Identities = 51/356 (14%), Positives = 148/356 (41%), Gaps = 28/356 (7%)
Query: 335 EALIKKHEDFDKAINAHEEKIGALQTLADQLIAADHYAAKPIDDKRKQVLDRWRLLKEAL 394
+ L ++ + + + EE++ +L+ L +D +++ + L+ L
Sbjct: 670 KELEEELAELEAQLEKLEEELKSLKNELRSL-----------EDLLEELRRQLEELERQL 718
Query: 395 IEKRSRL-GESQTLQQFSRDADEMENWIAEKLQLATEESYKDPANIQSKHQKHQAFEAEL 453
E + L + L+Q +E+E + E L+ EE + ++ + + + A+L
Sbjct: 719 EELKRELAALEEELEQLQSRLEELEEELEE-LEEELEELQERLEELEEELESLEEALAKL 777
Query: 454 AANADRIQSVLAMGQNLIDKRQCVGSEEAVQARLASIADQWEFLTQKTTEKSLKLKEANK 513
+ ++ Q +++ + E + RL ++ + E L Q+ +++E +
Sbjct: 778 KEEIEELEEKRQALQEELEELE--EELEEAERRLDALERELESLEQRRERLEQEIEELEE 835
Query: 514 QR-TYIAAVKDLDFWLGEVESLLTSEDSGKDLASVQNLIKKHQLVEADIQAHDDRIKDMN 572
+ + +L+ L E+E L E+ ++L ++ ++ +E +++ ++ +++
Sbjct: 836 EIEELEEKLDELEEELEELEKEL--EELKEELEELEAEKEE---LEDELKELEEEKEELE 890
Query: 573 GQADSLIDSGQFDASSIQEKRQSINERYERIKNLAAHRQARLNEANTLHQFFRDIADEES 632
+ L I++ R+ + E +++ L E ++ + E
Sbjct: 891 EELRELESELAELKEEIEKLRERLEELEAKLERLEVELPELEEELEEEYEDTLETELER- 949
Query: 633 WIKEKKLLVGSDDYG----RDLTGVQNLKKKHKRLEAELASHQPAIQNVQETGEKL 684
+ ++L + G R + + ++++++ L+++ + A + + E E+L
Sbjct: 950 --EIERLEEEIEALGPVNLRAIEEYEEVEERYEELKSQREDLEEAKEKLLEVIEEL 1003
Score = 35.1 bits (81), Expect = 0.29
Identities = 49/317 (15%), Positives = 109/317 (34%), Gaps = 15/317 (4%)
Query: 54 AALKIQTQLQDLNQKWTSLQQLTAERATQLGSAHEVQRFHRDVDETKDWIQEKDEALNNN 113
+ ++ L + ++L A E++ ++ +D ++E L
Sbjct: 655 GSRNKRSSLAQKRELKELEEELAELEAQLEKLEEELKSLKNELRSLEDLLEELRRQLEEL 714
Query: 114 DLGKDLRSVQALQRKHEGLERDLAALGDKIRQLDETANRLMQTHPETAEQTYAKQKEINE 173
+ R ++ L+R+ LE +L L ++ +L+E L + E E+ ++E+
Sbjct: 715 E-----RQLEELKRELAALEEELEQLQSRLEELEEELEELEEELEELQERLEELEEELES 769
Query: 174 EWTQLTAKANTRKEKLLDSYDLQRFLSDYRDLMSWINSMMGLVSSDELANDVTGAEALLE 233
L +E LQ L + + + + D L ++ E E
Sbjct: 770 LEEALAKLKEEIEELEEKRQALQEELEELEEELEEAERRL-----DALERELESLEQRRE 824
Query: 234 RHQEHRTEIDARTGTFQAFDLFGQQLLQSGHYASVEIQDKLGNLAEAREDLEKAWIARRM 293
R ++ E++ + ++ L+ E++++L L +E+LE
Sbjct: 825 RLEQEIEELEEEIEELEEKLDELEEELEELEKELEELKEELEELEAEKEELEDELKELEE 884
Query: 294 QLDQCLELQLFYRDC-EQAENWMSAREAFLNAEEVDSKTDNVEALI----KKHEDFDKAI 348
+ ++ E + + + L E + VE + E D
Sbjct: 885 EKEELEEELRELESELAELKEEIEKLRERLEELEAKLERLEVELPELEEELEEEYEDTLE 944
Query: 349 NAHEEKIGALQTLADQL 365
E +I L+ + L
Sbjct: 945 TELEREIERLEEEIEAL 961
Score = 33.1 bits (76), Expect = 1.3
Identities = 40/246 (16%), Positives = 107/246 (43%), Gaps = 18/246 (7%)
Query: 269 EIQDKLGNLAEAREDLEKAWIARRMQLDQC-LELQLFYRDCEQAENWMSAREAFLNAEEV 327
++++L +L + E+L + QL++ EL + EQ ++ + E EE+
Sbjct: 692 SLKNELRSLEDLLEELRRQLEELERQLEELKRELAALEEELEQLQSRLEELEE--ELEEL 749
Query: 328 DSKTDNVEALIKKHEDFDKAINAHEEKIGALQTLADQLIAADHYAAKPIDDKRKQVLDRW 387
+ + + ++ +++ E+ + + EE + L+ ++L + +++ +++ +
Sbjct: 750 EEELEELQERLEELEE---ELESLEEALAKLKEEIEELEEKRQALQEELEELEEELEEAE 806
Query: 388 RLLKEALIEKRSRLGESQTLQQFSRDA-DEMENWIAEK--LQLATEESYKDPANIQSKHQ 444
R L E S + L+Q + +E+E + L+ EE K+ ++ + +
Sbjct: 807 RRLDALERELESLEQRRERLEQEIEELEEEIEELEEKLDELEEELEELEKELEELKEELE 866
Query: 445 KHQAFEAELAANADRIQSVLAMGQNLIDKRQCVGSEEAVQARLASIADQWEFLTQKTTEK 504
+ +A + EL ++ + + + +++ LA + ++ E L ++ E
Sbjct: 867 ELEAEKEELEDELKELEEEKEELEEELRE---------LESELAELKEEIEKLRERLEEL 917
Query: 505 SLKLKE 510
KL+
Sbjct: 918 EAKLER 923
Score = 32.4 bits (74), Expect = 1.9
Identities = 66/360 (18%), Positives = 124/360 (34%), Gaps = 36/360 (10%)
Query: 127 RKHEGLERDLAALGDKIRQLDETANRLMQTHPETAEQTYAKQKEINEEWTQLTAKANTRK 186
RK E ER L + + +L++ L E + +Q E E + +L A+
Sbjct: 173 RKEE-AERKLERTEENLERLEDLLEEL-----EKQLEKLERQAEKAERYQELKAELR-EL 225
Query: 187 EKLLDSYDLQRFLSDYRDLMSWINSMMGLVSSDELANDVTGAEALLERHQEHRTEIDART 246
E L L+ + +L EL+ E L E +E EI+
Sbjct: 226 ELALLLAKLKELRKELEELEE------------ELSRLEEELEELQEELEEAEKEIEELK 273
Query: 247 GTFQAFDLFGQQLLQSGHYASVEIQDKLGNLAEAREDLEKAWIARRMQLDQCLELQLFYR 306
+ ++L + EI++ G ++ RE LE+ ++ EL+
Sbjct: 274 SELEELREELEELQEELLELKEEIEELEGEISLLRERLEELENELEELEERLEELK---- 329
Query: 307 DCEQAENWMSAREAFLN-AEEVDSKTDNVEALIKKHED-FDKAINAHEEKIGALQTLADQ 364
E+ E E EE++ +E ++ E+ + EE AL+ +
Sbjct: 330 --EKIEALKEELEERETLLEELEQLLAELEEAKEELEEKLSALLEELEELFEALREELAE 387
Query: 365 LIAADHYAAKPIDDKRKQV------LDRWRLLKEALIEKRSRLGESQTLQQFSRDADEME 418
L A +++ ++++ L+R E L E+ L Q +
Sbjct: 388 LEAELAEIRNELEELKREIESLEERLERLSERLEDLKEELKELEAELEELQTELEE---L 444
Query: 419 NWIAEKLQLATEESYKDPANIQSKHQKHQAFEAELAANADRIQSVLAMGQNLIDKRQCVG 478
N E+L+ EE ++ + + Q L +++ L + Q V
Sbjct: 445 NEELEELEEQLEELRDRLKELERELAELQEELQRLEKELSSLEARLDRLEAEQRASQGVR 504
Score = 32.4 bits (74), Expect = 2.4
Identities = 44/279 (15%), Positives = 110/279 (39%), Gaps = 34/279 (12%)
Query: 500 KTTEKSLKLKEANKQRTYIAAVKDLDFWLGEVESLLTSEDSGKDLASVQNLIKKHQLVEA 559
+ E+ +LK ++ + L E+E L E+ + ++ L ++ + E
Sbjct: 210 EKAERYQELKAELRELELALLLAKLKELRKELEELE--EELSRLEEELEELQEELEEAEK 267
Query: 560 DIQAHDDRIKDMNGQADSL------------------------IDSGQFDASSIQEKRQS 595
+I+ ++++ + + L ++ + + ++E+ +
Sbjct: 268 EIEELKSELEELREELEELQEELLELKEEIEELEGEISLLRERLEELENELEELEERLEE 327
Query: 596 INERYERIKNLAAHRQARLNEANTLHQFFRDIADEESWIKEKKLLVGSDDYGRDLTGVQN 655
+ E+ E +K R+ L E L + +E E+KL ++ ++
Sbjct: 328 LKEKIEALKEELEERETLLEELEQLLAELEEAKEEL----EEKLSALLEELEELFEALRE 383
Query: 656 ----LKKKHKRLEAELASHQPAIQNVQETGEKLMDVSNLGVPEIEQRLKLLNQAWSELKQ 711
L+ + + EL + I++++E E+L + E+++ L + +EL++
Sbjct: 384 ELAELEAELAEIRNELEELKREIESLEERLERLSERLEDLKEELKELEAELEELQTELEE 443
Query: 712 LAANRGQKLDESLTYQHFLAKVEEEEAWISEKQQLLSVE 750
L + ++ + L ++E E A + E+ Q L E
Sbjct: 444 LNEELEELEEQLEELRDRLKELERELAELQEELQRLEKE 482
Score = 32.0 bits (73), Expect = 2.5
Identities = 49/282 (17%), Positives = 125/282 (44%), Gaps = 15/282 (5%)
Query: 433 YKDPANIQSKHQKHQAFEAELAANADRIQSVLAMGQNLIDK-RQCVGSEEAVQARLASIA 491
++ + ++ ++ + E ELA +++ + ++L ++ R E ++ +L +
Sbjct: 656 SRNKRSSLAQKRELKELEEELAELEAQLEKLEEELKSLKNELRSLEDLLEELRRQLEELE 715
Query: 492 DQWEFLTQKTTEKSLKLKEANKQRT--------YIAAVKDLDFWLGEVESLLTS-EDSGK 542
Q E L ++ +L++ + +++L L E+E L S E++
Sbjct: 716 RQLEELKRELAALEEELEQLQSRLEELEEELEELEEELEELQERLEELEEELESLEEALA 775
Query: 543 DL-ASVQNLIKKHQLVEADIQAHDDRIKDMNGQADSLIDSGQFDASSIQEKRQSINERYE 601
L ++ L +K Q ++ +++ ++ +++ + D+L + + Q I E E
Sbjct: 776 KLKEEIEELEEKRQALQEELEELEEELEEAERRLDALERELESLEQRRERLEQEIEELEE 835
Query: 602 RIKNLAAHRQARLNEANTLHQFFRDIADEESWIKEKKLLVGSDDYGRDLTGVQNLKKKHK 661
I+ L E L + ++ +E ++ +K + D+ + L+++ +
Sbjct: 836 EIEELEEKLDELEEELEELEKELEELKEELEELEAEKEEL-EDELKELEEEKEELEEELR 894
Query: 662 RLEAELASHQPAIQNVQE---TGEKLMDVSNLGVPEIEQRLK 700
LE+ELA + I+ ++E E ++ + +PE+E+ L+
Sbjct: 895 ELESELAELKEEIEKLRERLEELEAKLERLEVELPELEEELE 936
>gnl|CDD|238009 cd00052, EH, Eps15 homology domain; found in proteins implicated in
endocytosis, vesicle transport, and signal transduction.
The alignment contains a pair of EF-hand motifs,
typically one of them is canonical and binds to Ca2+,
while the other may not bind to Ca2+. A hydrophobic
binding pocket is formed by residues from both EF-hand
motifs. The EH domain binds to proteins containing NPF
(class I), [WF]W or SWG (class II), or H[TS]F (class III)
sequence motifs.
Length = 67
Score = 34.1 bits (79), Expect = 0.033
Identities = 17/64 (26%), Positives = 28/64 (43%), Gaps = 9/64 (14%)
Query: 1176 FSMMFKHFDKDKSGKLNQTEFKSCLRALGYDLPMVEEGQPDPEFEAILDLVDPNRDGHVS 1235
+ +F+ D D G ++ E + L G LP Q I DL D ++DG +
Sbjct: 1 YDQIFRSLDPDGDGLISGDEARPFLGKSG--LPRSVLAQ-------IWDLADTDKDGKLD 51
Query: 1236 LQEY 1239
+E+
Sbjct: 52 KEEF 55
>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
recombination, and repair].
Length = 908
Score = 34.4 bits (79), Expect = 0.54
Identities = 92/592 (15%), Positives = 216/592 (36%), Gaps = 65/592 (10%)
Query: 95 DVDETKDWIQEKDEALNNNDLGKDLRSVQALQRKHEGLERDLAALGDKIRQLDETANRLM 154
D+++ + ++E+ + L + ++ + + L+++ E LE LA L ++ +L+E RL+
Sbjct: 200 DIEDLLEALEEELKELKKLEEIQEEQEEEELEQEIEALEERLAELEEEKERLEELKARLL 259
Query: 155 QTHPETAEQTYAKQKEINE---EWTQLTAKANTRKEKLLDSYDLQRFLSDYRDLMSWINS 211
+ E +++E+ E +L K +E + +L+ L R L
Sbjct: 260 EIESLELEALKIREEELRELERLLEELEEKIERLEELEREIEELEEELEGLRAL------ 313
Query: 212 MMGLVSSDELANDVTGAEALLERHQEHRTEIDARTGTFQAFDLFGQQLLQSGHYASVEIQ 271
L +EL + E LE+ +E ++++ +LL+ E++
Sbjct: 314 ---LEELEELLEKLKSLEERLEKLEEKLEKLESELEELAEEKNELAKLLEERLK---ELE 367
Query: 272 DKLGNLAEAREDLEKAWIARRMQLDQCLELQLFYRDCEQAENWMSAREAFLNAEE--VDS 329
++L L + E + + + E + + + L E ++
Sbjct: 368 ERLEELEKELEKALERLKQLEEAIQELKEEL------AELSAALEEIQEELEELEKELEE 421
Query: 330 KTDNVEALIKKHEDFDKAINAHEEKIGALQTLADQ-----LIAADHYAAKPIDDKRKQVL 384
+E L ++ + ++ IN E K + LA + + + L
Sbjct: 422 LERELEELEEEIKKLEEQINQLESKELMIAELAGAGEKCPVCGQELPEEHEKELLELYEL 481
Query: 385 DRWRLLKEALIEK--RSRLGESQTLQQFSRDADEMENWIAEKLQLATEESYKD-----PA 437
+ L +E EK E + L++ R+ +E + E + EE +
Sbjct: 482 ELEELEEELSREKEEAELREEIEELEKELRELEEELIELLELEEALKEELEEKLEKLENL 541
Query: 438 NIQSKHQKHQA----FEAELAANADRIQSVLAMGQNLIDKRQCVGSEEAVQARLASIADQ 493
+ + K + + EL DR+Q + + + L R E ++ RL + +
Sbjct: 542 LEELEELKEKLQLQQLKEELRQLEDRLQELKELLEELRLLRTRKEELEELRERLKELKKK 601
Query: 494 WEFLTQKTTEKSLKLKEANKQRTYIAAVKDLDFWLGEVESLLTSEDSGKDLASVQNLIKK 553
+ L ++ ++ L+ E++ ++L S +
Sbjct: 602 LKELEERLSQLEELLQSLELSEAENEL-----------------EEAEEELESELEKLNL 644
Query: 554 HQLVEADIQAHDDRIKDMNGQADSLIDSGQFDASSIQEKRQSINERYERIKNLAAHRQAR 613
+E +QA + +++ + ++ I + + I+ + Q + E ++ L +
Sbjct: 645 QAELEELLQAALEELEEKVEELEAEI---RRELQRIENEEQLEEKL-EELEQLEEELEQL 700
Query: 614 LNEANTLHQFFRDIADEESWIKEKKLLVGSDDYGRDLTGVQNLKKKHKRLEA 665
E L + +I ++ +K + ++ L+K + LE
Sbjct: 701 REELEELLKKLGEIEQLIEELESRK-----AELEELKKELEKLEKALELLEE 747
>gnl|CDD|205383 pfam13202, EF_hand_3, EF hand.
Length = 25
Score = 28.8 bits (66), Expect = 1.0
Identities = 6/25 (24%), Positives = 11/25 (44%)
Query: 1219 FEAILDLVDPNRDGHVSLQEYMAFM 1243
+ + D N DG +S +E +
Sbjct: 1 LKDLFRQFDTNGDGKISKEELKRLL 25
Score = 27.6 bits (63), Expect = 2.4
Identities = 8/22 (36%), Positives = 14/22 (63%)
Query: 1179 MFKHFDKDKSGKLNQTEFKSCL 1200
+F+ FD + GK+++ E K L
Sbjct: 4 LFRQFDTNGDGKISKEELKRLL 25
>gnl|CDD|132791 cd06881, PX_SNX15_like, The phosphoinositide binding Phox Homology
domain of Sorting Nexin 15-like proteins. The PX domain
is a phosphoinositide (PI) binding module present in
many proteins with diverse functions such as cell
signaling, vesicular trafficking, protein sorting, and
lipid modification, among others. Members of this
subfamily have similarity to sorting nexin 15 (SNX15),
which contains an N-terminal PX domain and a C-terminal
Microtubule Interacting and Trafficking (MIT) domain.
SNXs make up the largest group among PX domain
containing proteins. They are involved in regulating
membrane traffic and protein sorting in the endosomal
system. The PX domain of SNXs binds PIs and targets the
protein to PI-enriched membranes. SNX15 plays a role in
protein trafficking processes in the endocytic pathway
and the trans-Golgi network. The PX domain of SNX15
interacts with the PDGF receptor and is responsible for
the membrane association of the protein. Other members
of this subfamily contain an additional C-terminal
kinase domain, similar to human RPK118, which binds
sphingosine kinase and the antioxidant peroxiredoxin-3
(PRDX3). RPK118 may be involved in the transport of
proteins such as PRDX3 from the cytoplasm to its site of
function in the mitochondria.
Length = 117
Score = 31.1 bits (71), Expect = 1.2
Identities = 16/44 (36%), Positives = 24/44 (54%), Gaps = 5/44 (11%)
Query: 582 GQFDASSIQEKRQSINERYERIKNLAAHRQARLNEANTLHQFFR 625
G+FDA+ I+E+RQ+I E + + N A Q+ QFF
Sbjct: 77 GRFDAAVIEERRQAILELLDFVGNHPALYQSS-----AFQQFFE 115
>gnl|CDD|130078 TIGR01005, eps_transp_fam, exopolysaccharide transport protein
family. The model describes the exopolysaccharide
transport protein family in bacteria. The transport
protein is part of a large genetic locus which is
associated with exopolysaccharide (EPS) biosynthesis.
Detailed molecular characterization and gene fusion
analysis revealed atleast seven gene products are
involved in the overall regulation, which among other
things, include exopolysaccharide biosynthesis, property
of conferring virulence and exopolysaccharide export
[Transport and binding proteins, Carbohydrates, organic
alcohols, and acids].
Length = 754
Score = 33.1 bits (75), Expect = 1.4
Identities = 41/205 (20%), Positives = 70/205 (34%), Gaps = 39/205 (19%)
Query: 33 EVRLAEMNEIAMQLMS-LGQTEAALKIQTQLQDLNQKWTSLQQLTAERATQLGSAHEVQR 91
+ R AE A + S L A QL +LN +L+ RA + +
Sbjct: 209 QSRDAEAEVAAYRAQSDLLMGNNATLATQQLAELNT------ELSRARANRAAAEGTADS 262
Query: 92 FHRDVDETKDWIQEKD---EALNNNDLGKDLRSVQALQRKHE---------------GLE 133
+ + + L DL + LR QA R +
Sbjct: 263 VKKALQNGGSLDVLPEVLSSQLKLEDLIQRLRERQAELRATIADLSTTMLANHPRVVAAK 322
Query: 134 RDLAALGDKIRQLDETANRLMQTHPETAEQTYAKQKEINEEWTQLTAKANTRKEKLLDSY 193
LA L +IR ++ ++ A+ A++ ++ + QL A + E+ +D
Sbjct: 323 SSLADLDAQIRSE---LQKITKSLLMQADAAQARESQLVSDVNQLKAASAQAGEQQVDLD 379
Query: 194 DLQR-----------FLSDYRDLMS 207
LQR +L++YR S
Sbjct: 380 ALQRDAAAKRQLYESYLTNYRQAAS 404
>gnl|CDD|213544 TIGR00636, PduO_Nterm, ATP:cob(I)alamin adenosyltransferase. This
model represents as ATP:cob(I)alamin adenosyltransferase
family corresponding to the N-terminal half of
Salmonella PduO, a 1,2-propanediol utilization protein
that probably is bifunctional. PduO represents one of at
least three families of ATP:corrinoid
adenosyltransferase: others are CobA (which partially
complements PduO) and EutT. It was not clear originally
whether ATP:cob(I)alamin adenosyltransferase activity
resides in the N-terminal region of PduO, modeled here,
but this has now become clear from the characterization
of MeaD from Methylobacterium extorquens [Biosynthesis
of cofactors, prosthetic groups, and carriers, Heme,
porphyrin, and cobalamin].
Length = 171
Score = 31.2 bits (71), Expect = 2.1
Identities = 22/98 (22%), Positives = 42/98 (42%), Gaps = 16/98 (16%)
Query: 521 VKDLDFWLGEVESLLTSEDSGKDLASVQNLI-----------KKHQLVEADIQAHDDRIK 569
+ +L+ ++G SLL ED +DL +QN + ++ E D++ ++RI
Sbjct: 31 LDELNSFIGVALSLLKWEDLKEDLERIQNDLFDIGGDLATPGDTKKITEEDVKWLEERID 90
Query: 570 DMNGQADSL----IDSGQFDASSIQEKRQSINERYERI 603
+ L + G A+ + R ++ R ER
Sbjct: 91 QYRKELPPLKLFVLPGGTPAAAFLHVAR-TVARRAERR 127
>gnl|CDD|240150 cd05023, S-100A11, S-100A11: S-100A11 domain found in proteins
similar to S100A11. S100A11 is a member of the S-100
domain family within EF-hand Ca2+-binding proteins
superfamily. Note that the S-100 hierarchy, to which this
S-100A11 group belongs, contains only S-100 EF-hand
domains, other EF-hands have been modeled separately.
S100 proteins exhibit unique patterns of tissue- and cell
type-specific expression and have been implicated in the
Ca2+-dependent regulation of diverse physiological
processes, including cell cycle regulation,
differentiation, growth, and metabolic control . S100
proteins have also been associated with a variety of
pathological events, including neoplastic transformation
and neurodegenerative diseases such as Alzheimer's,
usually via over expression of the protein. S100A11 is
expressed in smooth muscle and other tissues and involves
in calcium-dependent membrane aggregation, which is
important for cell vesiculation . As is the case for many
other S100 proteins, S100A11 is homodimer, which is able
to form a heterodimer with S100B through subunit
exchange. Ca2+ binding to S100A11 results in a
conformational change in the protein, exposing a
hydrophobic surface that interacts with target proteins.
In addition to binding to annexin A1 and A6 S100A11 also
interacts with actin and transglutaminase.
Length = 89
Score = 29.7 bits (67), Expect = 2.1
Identities = 19/62 (30%), Positives = 33/62 (53%), Gaps = 5/62 (8%)
Query: 1181 KHFDKD-KSGKLNQTEFKSCLRALGYDLPMVEEGQPDPE-FEAILDLVDPNRDGHVSLQE 1238
K+ KD S +L++TEF S + +L + Q DP + ++ +D N DG + QE
Sbjct: 17 KYAGKDGDSYQLSKTEFLSFMNT---ELASFTKNQKDPGVLDRMMKKLDLNSDGQLDFQE 73
Query: 1239 YM 1240
++
Sbjct: 74 FL 75
>gnl|CDD|226809 COG4372, COG4372, Uncharacterized protein conserved in bacteria
with the myosin-like domain [Function unknown].
Length = 499
Score = 32.3 bits (73), Expect = 2.2
Identities = 34/212 (16%), Positives = 72/212 (33%), Gaps = 22/212 (10%)
Query: 4 QVQDVGEDLEQVEVMQKKFDD----FQSDLKANEVRLAEMN-EIAMQLMSLGQ-TEAALK 57
++ + E ++ + + +A LA +A L + T+ A
Sbjct: 89 ELGTAQGEKRAAETEREAARSELQKARQEREAVRQELAAARQNLAKAQQELARLTKQAQD 148
Query: 58 IQTQLQDLNQKWTSL----QQLTAERATQLGSAHEVQRFHRDVDETKDWIQEKDEALNNN 113
+QT+L+ L ++ L Q L A + SA +++ D+ I+++ + L
Sbjct: 149 LQTRLKTLAEQRRQLEAQAQSLQASQKQLQASATQLKSQVLDLKLRSAQIEQEAQNLATR 208
Query: 114 DLGKDLRSVQALQRKHEGLERDLAALGDKIRQLDETANRLMQTHPETAEQTYAKQKEINE 173
+ QA + I+Q D ++ Q AEQ +++++
Sbjct: 209 A-----NAAQARTEELARRAAAAQQTAQAIQQRDAQISQKAQQIAARAEQIRERERQLQR 263
Query: 174 EWTQLTAKANTRKEKLLDSYDLQRFLSDYRDL 205
T + L+ + Y L
Sbjct: 264 LETAQARLEQEVAQ-------LEAYYQAYVRL 288
>gnl|CDD|215520 PLN02964, PLN02964, phosphatidylserine decarboxylase.
Length = 644
Score = 32.1 bits (73), Expect = 2.3
Identities = 19/63 (30%), Positives = 31/63 (49%), Gaps = 2/63 (3%)
Query: 1222 ILDLVDPNRDGHVSLQEYMAFMISKETENVQSSEEIENAFHAIAASDRPYVTKEELYANL 1281
IL +VD + DG +S E+ + K N+ ++ + E F A + VT +EL A L
Sbjct: 184 ILAIVDYDEDGQLSFSEFSDLI--KAFGNLVAANKKEELFKAADLNGDGVVTIDELAALL 241
Query: 1282 TKE 1284
+
Sbjct: 242 ALQ 244
>gnl|CDD|238062 cd00114, LIGANc, NAD+ dependent DNA ligase adenylation domain. DNA
ligases catalyze the crucial step of joining the breaks
in duplex DNA during DNA replication, repair and
recombination, utilizing either ATP or NAD(+) as a
cofactor, but using the same basic reaction mechanism.
The enzyme reacts with the cofactor to form a
phosphoamide-linked AMP with the amino group of a
conserved Lysine in the KXDG motif, and subsequently
transfers it to the DNA substrate to yield adenylated
DNA. This alignment contains members of the NAD+
dependent subfamily only.
Length = 307
Score = 31.4 bits (72), Expect = 2.8
Identities = 19/76 (25%), Positives = 27/76 (35%), Gaps = 14/76 (18%)
Query: 1200 LRALGYDLPMVEEGQPDPEFEAILDLVDPNRDGHVSLQEYMAFMISKETENVQSSEEIEN 1259
LR Y L E P + EA+ L + F +S ET ++ EE+
Sbjct: 211 LRFFIYGLGEAEGLGPKTQSEALAFLKE------------WGFPVSPETRLCKNIEEVLA 258
Query: 1260 AFHAIAA--SDRPYVT 1273
+ I A PY
Sbjct: 259 FYDEIEAKRDSLPYEI 274
>gnl|CDD|220623 pfam10186, Atg14, UV radiation resistance protein and
autophagy-related subunit 14. The Atg14 or Apg14
proteins are hydrophilic proteins with a predicted
molecular mass of 40.5 kDa, and have a coiled-coil motif
at the N terminus region. Yeast cells with mutant Atg14
are defective not only in autophagy but also in sorting
of carboxypeptidase Y (CPY), a vacuolar-soluble
hydrolase, to the vacuole. Subcellular fractionation
indicate that Apg14p and Apg6p are peripherally
associated with a membrane structure(s). Apg14p was
co-immunoprecipitated with Apg6p, suggesting that they
form a stable protein complex. These results imply that
Apg6/Vps30p has two distinct functions: in the
autophagic process and in the vacuolar protein sorting
pathway. Apg14p may be a component specifically required
for the function of Apg6/Vps30p through the autophagic
pathway. There are 17 auto-phagosomal component proteins
which are categorized into six functional units, one of
which is the AS-PI3K complex (Vps30/Atg6 and Atg14). The
AS-PI3K complex and the Atg2-Atg18 complex are essential
for nucleation, and the specific function of the AS-PI3K
apparently is to produce phosphatidylinositol
3-phosphate (PtdIns(3)P) at the pre-autophagosomal
structure (PAS). The localisation of this complex at the
PAS is controlled by Atg14. Autophagy mediates the
cellular response to nutrient deprivation, protein
aggregation, and pathogen invasion in humans, and
malfunction of autophagy has been implicated in multiple
human diseases including cancer. This effect seems to be
mediated through direct interaction of the human Atg14
with Beclin 1 in the human phosphatidylinositol 3-kinase
class III complex.
Length = 307
Score = 31.6 bits (72), Expect = 2.8
Identities = 25/129 (19%), Positives = 48/129 (37%), Gaps = 12/129 (9%)
Query: 337 LIKKHEDFDKAINAH-EEKIGALQTLADQLIAADHYAAKPIDDKRKQVLDRWRLLKEALI 395
L+ ++E+ + + E LA L+ + K ++ Q+ R LKE +
Sbjct: 32 LLLENEELKQKVEEALEGATNEDGKLAADLLKLEVARKK---ERLNQIRARISQLKEEIE 88
Query: 396 EKRSRLGESQ--------TLQQFSRDADEMENWIAEKLQLATEESYKDPANIQSKHQKHQ 447
+KR R+ E + L S ++ EKLQ + + + S + +
Sbjct: 89 QKRERIEELKRALAQRRSDLSSASYQLEKRRASQLEKLQDEIKRTRSKLNALHSLLAEKR 148
Query: 448 AFEAELAAN 456
+F A
Sbjct: 149 SFLCRELAK 157
>gnl|CDD|202014 pfam01858, RB_A, Retinoblastoma-associated protein A domain. This
domain has the cyclin fold as predicted.
Length = 192
Score = 30.8 bits (70), Expect = 3.0
Identities = 32/185 (17%), Positives = 60/185 (32%), Gaps = 37/185 (20%)
Query: 1073 QWLTETRTSMM-EGTGSLEQQLEAIKRKAAEVRSRRSDLKKIEDLGAILEEHLILDNRYT 1131
L E +S+ + +L Q L + R E + +K+++ +G + E +
Sbjct: 11 SQLREVLSSLSDAPSETLLQYLNSCDRNPTE-----AIIKRVKIIGEEIFETFAEAEDQS 65
Query: 1132 EHSTVGLAQQWDQLDQLGMRMQHNLEQQIQARNQSGVSEDALKEFSMMFKHFDKDKSGKL 1191
+ Q +L +++ + + + I + + D D S L
Sbjct: 66 -PKEIA-----SQRFKLALKLYYRVLESILKAEEKRLH--------------DMDLSNLL 105
Query: 1192 NQTEFKSCLRALGYDLPMVEEGQPDPEFEAILDLVDPNRDGHVSLQEYMAFMISKETENV 1251
Q F L A +L + D F IL++ AF K E+
Sbjct: 106 EQEAFHRSLLACCLELVLATYKTTDLSFPWILEVFG-----------ITAFDFYKVIESF 154
Query: 1252 QSSEE 1256
E
Sbjct: 155 IRHEG 159
>gnl|CDD|221432 pfam12128, DUF3584, Protein of unknown function (DUF3584). This
protein is found in bacteria and eukaryotes. Proteins in
this family are typically between 943 to 1234 amino
acids in length. This family contains a P-loop motif
suggesting it is a nucleotide binding protein. It may be
involved in replication.
Length = 1198
Score = 32.0 bits (73), Expect = 3.3
Identities = 114/684 (16%), Positives = 212/684 (30%), Gaps = 145/684 (21%)
Query: 55 ALKIQTQLQDLNQKWTSLQQLTAE-RATQLGSAHEVQRF-------HRDVDETKDWIQEK 106
K+Q + L LQ L E A + A E + + + +D ++E
Sbjct: 248 IEKLQEDFEQLLSLELRLQHLHGELVADEERLAEEQEERQEAKNRLRQQLRTLEDQLKEA 307
Query: 107 DEALN------NNDLGKDLRSVQALQRKHEG-LERDLAALGDKIRQLDETANRLMQTHPE 159
+ LN N L D ++ L+ + + D+ L + QL + L +
Sbjct: 308 RDELNQELSAANAKLAADRSELELLEDQKGAFEDADIEQLQADLDQLPSIRSELEEVEAR 367
Query: 160 TAEQTYAKQKEINEEWTQLTAKANTRKEKLLDSYDLQRFLSDYRDLMSWINSMMGLVSSD 219
T K +++ ++ +L K + E+ L+ + +
Sbjct: 368 LDALT-GKHQDVQRKYERLKQKIKEQLERDLEKNN------------------------E 402
Query: 220 ELANDVTGAEALLERHQEHRTEIDARTGTFQAFDLFGQQLLQSGHYASVEIQDKLGNLAE 279
LA + +E QA + QL Q +E ++ L
Sbjct: 403 RLAAIREEKDRQKAAIEED----------LQALE---SQLRQQLEAGKLEFNEEEYELEL 449
Query: 280 AREDLEKAWIARRMQLDQCLELQLFYRDCEQAENWMSAREAFLNAEEVDSKTDNVEALIK 339
L++ + ++ +L++ E+A+ EE + NVE L
Sbjct: 450 RLGRLKQRLDSATATPEELEQLEINDEALEKAQ------------EEQEQAEANVEQLQS 497
Query: 340 KHE----DFDKAINAH-------EEKIGALQTLADQLIAAD----HYAAKPIDDKRK--- 381
+ D+A+ A + AL L QL H+ +
Sbjct: 498 ELRQLRKRRDEALEALQRAERRLLQLRQALDELELQLSPQAGSLLHFLRNEAPGWEESIG 557
Query: 382 QVLDRWRLLKEALIEKRSRLGESQTLQQFSRDAD--------EMENWIAEKLQLATEESY 433
+V+ L + L + +S TL S D E + E+LQ A E
Sbjct: 558 KVISPELLERTDLDPQLVEGSDSDTLYGVSLDLQRLDVPDYAANETELRERLQQAEEA-- 615
Query: 434 KDPANIQSKHQKHQAFEAEL-----------AANADRIQSVLAMGQNLI--------DKR 474
+QS K + E +L A A+ ++ +L K
Sbjct: 616 -----LQSAVAKQKQAEEQLVQANAELEEQKRAEAEARTALKQARLDLQRLQNEQQSLKD 670
Query: 475 QCVGSEEAVQARLASIADQWEFLTQ--KTTEKSLKLKEANKQRTYIAAVKDLDFWLGEVE 532
+ E A+ R Q L K + + + + + VE
Sbjct: 671 KL---ELAIAERKQQAETQLRQLDAQLKQLLEQQQAFLEALKDDFRELRTERLAKWQVVE 727
Query: 533 SLLTSEDSGKDLASVQNLIKKHQLVEADIQAHDDRIKDMNGQADSLIDSGQFDASSIQEK 592
L + L +EA R+K++ Q D + S D ++++E
Sbjct: 728 GELD--------NQLAQLSA---AIEAARTQAKARLKELKKQYDRELASLDVDPNTVKEL 776
Query: 593 RQSINERYERIKNLAAHRQARLNEANTLHQFFRDIADEESWIKEKKLLVGSDDYGRDLTG 652
++ I E I+ +A R E F + E+W+ L +
Sbjct: 777 KRQIEELETTIERIAVRRP----EVREYRAFMQ-----ETWLHRDSLREERPNLA---IQ 824
Query: 653 VQNLKKKHKRLEAELASHQPAIQN 676
++ L+ + L+ EL +
Sbjct: 825 LRELESSAEELQQELTRLIKDTKL 848
Score = 30.8 bits (70), Expect = 7.1
Identities = 84/488 (17%), Positives = 178/488 (36%), Gaps = 74/488 (15%)
Query: 10 EDLEQVEVMQKKFDDFQSDLKANEVRLAEMNEIAMQLMSLGQTEAALKIQTQLQDLNQKW 69
ED EQ+ ++ + +L A+E RLAE E + EA +++ QL+ L +
Sbjct: 253 EDFEQLLSLELRLQHLHGELVADEERLAEEQEE--------RQEAKNRLRQQLRTLEDQL 304
Query: 70 TSLQQLTAERATQLGSA-HEVQRFHRDVD--ETKDWIQEKDEALNNNDLGKDLRSVQALQ 126
++ E +L +A ++ +++ E + E + L DL + +++
Sbjct: 305 ---KEARDELNQELSAANAKLAADRSELELLEDQKGAFEDADI---EQLQADLDQLPSIR 358
Query: 127 RKHEGLERDLAALGDKIRQLDETANRLMQ-------THPETAEQTYAKQKE--------I 171
+ E +E L AL K + + RL Q E + A +E I
Sbjct: 359 SELEEVEARLDALTGKHQDVQRKYERLKQKIKEQLERDLEKNNERLAAIREEKDRQKAAI 418
Query: 172 NEEWTQLTAKANTRKEKLLDSYDLQRFLSDYRDLMSWINSMM-GLVSSDELANDVTGAEA 230
E+ L ++ + E ++ + + + + + + ++ E + +
Sbjct: 419 EEDLQALESQLRQQLEAGKLEFNEEEY--ELELRLGRLKQRLDSATATPEELEQLEINDE 476
Query: 231 LLERHQEHRTEIDARTGTFQAFDLFGQQLLQSGHYASVEIQDKLGNLAEAREDLEK---- 286
LE+ QE + + +A Q+ ++ A + +L L +A ++LE
Sbjct: 477 ALEKAQEEQEQAEANVEQLQSELRQLRKRRDEALEALQRAERRLLQLRQALDELELQLSP 536
Query: 287 --------------------AWIARRMQLDQCLELQLFYRDCEQAENWMSAREAFLNAEE 326
+ L++ +L + ++ L +
Sbjct: 537 QAGSLLHFLRNEAPGWEESIGKVISPELLERT-DLDPQLVEGSDSDTLYGVSLD-LQRLD 594
Query: 327 VDSKTDNVEALIKKHEDFDKAINAHEEKIGALQTLADQLIAADHYAAKPIDDKRKQVLDR 386
V N L ++ + ++A+ + K + Q A ++++++ +
Sbjct: 595 VPDYAANETELRERLQQAEEALQSAVAKQKQAEEQLVQ-------ANAELEEQKRAEAEA 647
Query: 387 WRLLKEALIEKRSRLGESQTLQQFSRDADEMENWIAEKLQLATEESYKDPANIQSKHQKH 446
LK+A R L Q QQ D++E IAE+ Q A + + A ++ ++
Sbjct: 648 RTALKQA----RLDLQRLQNEQQ--SLKDKLELAIAERKQQAETQLRQLDAQLKQLLEQQ 701
Query: 447 QAFEAELA 454
QAF L
Sbjct: 702 QAFLEALK 709
>gnl|CDD|225638 COG3096, MukB, Uncharacterized protein involved in chromosome
partitioning [Cell division and chromosome partitioning].
Length = 1480
Score = 31.8 bits (72), Expect = 3.7
Identities = 117/646 (18%), Positives = 221/646 (34%), Gaps = 73/646 (11%)
Query: 410 FSRDADEMENWIAEKLQLATEESYKDPANIQSKHQKHQAFEAELAANADRIQSVLAMGQN 469
+ R A+E + + L+ E Y + ++ +H ELA + A Q
Sbjct: 274 YMRHANERRVHLDQALEF-RRELYTSRQQLAAEQYRHVDMSRELAELNGAEGDLEADYQA 332
Query: 470 LIDKRQCVGSEEAVQARLASIADQWEFLTQKTTEKSLKLKEANKQRT-----YIAAVKDL 524
D V + Q ++ E LT + E++ ++EAN+++ AA ++
Sbjct: 333 ASDHLNLVQTALRQQEKIERYQADLEELTIRLEEQNEVVEEANERQEENEARAEAAELEV 392
Query: 525 DFWLGEVESLLTSEDSGKDLASVQNLIKKHQLVEADIQAHDDRIKDMNGQADSLIDSGQF 584
D E++S L D+ + + + Q + A +R K++ D DS +
Sbjct: 393 D----ELKSQLADYQQALDVQQTRAI-QYQQAIAA-----LERAKELCHLPDLTADSAEE 442
Query: 585 DASSIQEKRQSINERYERIKNLAAHRQARLNEANTLHQFFRDIADE----ESWIKEKKLL 640
+ Q K + E+ ++ + QA ++ +Q IA E E+W ++LL
Sbjct: 443 WLETFQAKEEEATEKLLSLEQKMSMAQAAHSQFEQAYQLVVAIAGELARSEAWDVARELL 502
Query: 641 VGSDDYGRDLTGVQNLKKKHKRLEAELASHQPAIQNVQETGEKLMDVSNLGVPEIEQRLK 700
D VQ L+ + LE L Q A + + + ++ + E +
Sbjct: 503 REGPDQRHLAEQVQPLRMRLSELEQRLRQQQSAERLLADFCKRQGKNLDAEELEALHQEL 562
Query: 701 --LLNQAWSELKQLAANRG---QKLDE-SLTYQHFLAKVEEEEAWIS--------EKQQL 746
L+ + R Q+ ++ Q + + W++ +Q
Sbjct: 563 EALIESLSDSVSNAREQRMALRQEQEQLQSRIQSLMQRA---PVWLAAQNALEQLSEQSG 619
Query: 747 LSVEDYGDTMAAVQGLLKKHDAFETDFSVHRDRCADICSAGNKLIEAKNHHADSITQRCQ 806
D D +Q LL++ E + +V RD +A ++ IE + S QR
Sbjct: 620 EEFTDSQDVTEYMQQLLER----EREATVERDELGARKNALDEEIERLSQPGGSEDQRLN 675
Query: 807 QLQLKLDNLMALATKRKTKLMDNSAYLQFMW----KADVVESWIADKETHVKSEEYGRDL 862
L + ++ L+ ++++ Y ++ A VV KE + DL
Sbjct: 676 ALAERFGGVL-LSEIYDDVTIEDAPYFSALYGPSRHAIVVPDLSQVKEHLEGLTDCPEDL 734
Query: 863 STVQTLLTKQETFDAGLHAFEHEGIQNITTLKDQLVASNHDQTP------------AIVK 910
++ + D+ E E + Q S + P ++
Sbjct: 735 YLIEGDPQSFD--DSVFSVDELEKAVVVKIADRQWRYSRFPEIPLFGRAAREQRLESLHA 792
Query: 911 RHGDVIARWQKLLGDSNARKQRLLRMQEQFRQIEDLYLTFAKKASSFNSWFENAEEDLTD 970
+ R L D Q+ R+ + F + +L A FE E
Sbjct: 793 ERDVLSERHATLSFDV----QKTQRLHQAFSRFIGSHLAVA---------FEADPEAEIR 839
Query: 971 PVRCNSIEEIRALREAHAQFQASLSSAQADFEALAALDQQIKSFNV 1016
+ E RAL Q E + AL++ I N+
Sbjct: 840 QLNSRRNELERALSNHENDNQQQRIQFDQAKEGVTALNRLIPQLNL 885
>gnl|CDD|220663 pfam10266, Strumpellin, Hereditary spastic paraplegia protein
strumpellin. This is a family of proteins conserved
from plants to humans, in which two closely situated
point mutations in the human protein lead to the
condition of hereditary spastic paraplegia. Strumpellin
contains one known domain called a spectrin repeat that
consists of three alpha-helices of a characteristic
length wrapped in a left-handed coiled coil. The
spectrin proteins have multiple copies of this repeat,
which can then form multimers in the cell. Spectrin
associates with the cell membrane via spectrin repeats
in the ankyrin protein. The spectrin repeat is a
structural platform for cytoskeletal protein assemblies.
Length = 1077
Score = 31.5 bits (72), Expect = 4.2
Identities = 19/58 (32%), Positives = 31/58 (53%), Gaps = 3/58 (5%)
Query: 706 WSELKQLAANRGQKLDESLTYQHFLAKVEEEE---AWISEKQQLLSVEDYGDTMAAVQ 760
W E K+ A R Q+L E + L+KV+++E W +E + + DY D+ AA +
Sbjct: 410 WEESKKEAVERMQELAEFFSGTKPLSKVKKDENLQKWFAEISKEIEKLDYDDSTAAGR 467
>gnl|CDD|227701 COG5414, COG5414, TATA-binding protein-associated factor
[Transcription].
Length = 392
Score = 31.2 bits (70), Expect = 4.3
Identities = 34/188 (18%), Positives = 71/188 (37%), Gaps = 10/188 (5%)
Query: 932 RLLRMQEQFRQIEDLYLTFAKKA--SSFNSWFENAEEDLTDPVRCNSIEEIRALREAHAQ 989
LL + + + + A +SW EN +E+ +P+ ++++ + A+
Sbjct: 211 DLLEKDMKAESVSVVLKDEKELARQERVSSW-ENFKEEPGEPLSRPALKK----EKQGAE 265
Query: 990 FQASLSSAQADFEALAALDQQIKSFNVGPNPYTWFTMEALEDTWRNLQKIIKERDIELAK 1049
+ ++ D + AA + + K + G +E E +Q + E +
Sbjct: 266 EEGEEGMSEEDLDVGAA-EIENKEVSEGDKEQQQEEVENAEAHKEEVQSDRPDEIGEEKE 324
Query: 1050 EATRQDENDALRKEFAKHANAFHQWLTETRTSMMEGTGSLEQQ--LEAIKRKAAEVRSRR 1107
E +EN+ + A N + + E R M T + Q+ + E+ +R
Sbjct: 325 EDDENEENERHTELLADELNELEKGIEEKRRQMESATNPILQKRFESQLNVLLKELELKR 384
Query: 1108 SDLKKIED 1115
L+ ED
Sbjct: 385 KQLEMEED 392
>gnl|CDD|205692 pfam13514, AAA_27, AAA domain. This domain is found in a number of
double-strand DNA break proteins. This domain contains a
P-loop motif.
Length = 1118
Score = 31.3 bits (71), Expect = 4.4
Identities = 50/359 (13%), Positives = 110/359 (30%), Gaps = 52/359 (14%)
Query: 350 AHEEKIGALQTLADQLIAADHYAAKPIDDKRKQVLDRWRL---------LKEALIEKRSR 400
A+EE + + LAD+L+ + ++++ R RL L+ AL R
Sbjct: 531 AYEEAVESADQLADRLLREAQLVGELQSLRQQEEAARRRLEQLEKELEVLELALAALREA 590
Query: 401 LGESQTLQQFSRDADEMENWIAE--KLQLATEESYKDPANIQSKHQ---KHQAFEAELAA 455
EME+W+AE + +K A + + + +A
Sbjct: 591 WQAQWAAAGLPLTPAEMEDWLAERATAREQVRAYFKARAELDALLDRRARLRAALRAALK 650
Query: 456 NADRIQSVLAMGQNLIDKRQCVGSEEAVQARLASIADQWEFLTQKTTEKSLKLKEANKQR 515
+ + + L RQ + E AR AS+ ++ + E++L+ E
Sbjct: 651 AVAIVLPGEELAELLELARQLLEEAEKQAARKASLDER-----LRDAERALEEAEERHDE 705
Query: 516 TYIAAVKDLDFWLGEVESLLTSEDSGKDLASVQNLIKKHQLVEADIQAHDDRIKDMNGQA 575
A + W + + A + ++ Q ++ +QA DD
Sbjct: 706 AQSALEAWQEQWYDALLEAGLGGRASPAGA--LDALELLQNIKEKLQAADD--------- 754
Query: 576 DSLIDSGQFDASSIQEKRQSINERYERIKNLAAHRQARLNEANTLHQFFRDIADEESWIK 635
++++ ++ R + ++ +
Sbjct: 755 -------------LRQRIAAMERDLARFEEEVEA---------LAEAVAPEMLGTPADET 792
Query: 636 EKKLLVGSDDYGRDLTGVQNLKKKHKRLEAELASHQPAIQNVQETGEKLMDVSNLGVPE 694
+ L + L ++ + E E++ A+ + L+ + E
Sbjct: 793 ARALKQRLKRARDTAAAAEKLAEEIEEAEKEVSEAAAALDEAEARLTALLRAARCTTIE 851
>gnl|CDD|234141 TIGR03185, DNA_S_dndD, DNA sulfur modification protein DndD. This
model describes the DndB protein encoded by an operon
associated with a sulfur-containing modification to DNA.
The operon is sporadically distributed in bacteria, much
like some restriction enzyme operons. DndD is described
as a putative ATPase. The small number of examples known
so far include species from among the Firmicutes,
Actinomycetes, Proteobacteria, and Cyanobacteria [DNA
metabolism, Restriction/modification].
Length = 650
Score = 31.2 bits (71), Expect = 4.4
Identities = 18/80 (22%), Positives = 30/80 (37%), Gaps = 2/80 (2%)
Query: 984 REAHAQFQASLSSAQADFEALAALDQQIKSFNVGPNPYTWFTMEALEDTWRNLQKIIKER 1043
RE L + E LA +D++I + E L + L + E
Sbjct: 387 RELQDAKSQLLKELRELEEELAEVDKKIST-IPSEEQIAQLL-EELGEAQNELFRSEAEI 444
Query: 1044 DIELAKEATRQDENDALRKE 1063
+ L + T ++ +ALRK
Sbjct: 445 EELLRQLETLKEAIEALRKT 464
>gnl|CDD|217680 pfam03702, UPF0075, Uncharacterized protein family (UPF0075). The
proteins is this family are about 370 amino acids long
and have no known function.
Length = 365
Score = 30.8 bits (70), Expect = 5.0
Identities = 32/128 (25%), Positives = 51/128 (39%), Gaps = 15/128 (11%)
Query: 536 TSEDSGKDLASVQNLIKKHQLVEADIQAHDDRIKD-----MNGQADSLIDSGQFD----- 585
TS D G D A V K +L+ + ++ G A +L G+ D
Sbjct: 10 TSLD-GVDAALVDIGDAKIELLASHSSPMPASLRQKLLDLCQGGATTLSRLGELDHQLGL 68
Query: 586 --ASSIQEKRQSINERYERIKNLAAHRQARLNEANTLHQFFRDIADEESWIKEKK-LLVG 642
A ++ E Q N + +I+ + H Q +E N F I D + I E+ + V
Sbjct: 69 LFADAVNELLQKQNLKPSQIRAIGCHGQTVRHEPNGRFPFTMQIGDP-NLIAERTGITVV 127
Query: 643 SDDYGRDL 650
+D RD+
Sbjct: 128 ADFRRRDV 135
>gnl|CDD|221952 pfam13166, AAA_13, AAA domain. This family of domains contain a
P-loop motif that is characteristic of the AAA
superfamily. Many of the proteins in this family are
conjugative transfer proteins. This family includes the
PrrC protein that is thought to be the active component
of the anticodon nuclease.
Length = 713
Score = 31.1 bits (71), Expect = 5.1
Identities = 69/390 (17%), Positives = 133/390 (34%), Gaps = 60/390 (15%)
Query: 332 DNVEALIKKHEDFDKAINAHEEKIGALQTLADQLIAADHYAAKPIDDK--RKQVLDRWRL 389
+N+E + E+ K + EEKI L+ ++ A DK +K
Sbjct: 89 ENIEIE-AQIEELKKELKKLEEKIEQLEAEIEKKEEELEKAKNKFLDKAWKKLAKKYDSN 147
Query: 390 LKEALIEKRSRLGESQTLQQFSRDADEMENWIAEKLQLATEESYKDPANIQSKHQKHQAF 449
L EAL G + + E+++ I L + E K K
Sbjct: 148 LSEAL------KGLNYKKNFKEKLLKELKSVILNASSLLSLEELKAKIKTLFSSNKP--- 198
Query: 450 EAELAANADRIQSVLAMGQNLIDKRQCVGSE----EAVQARLASIA------------DQ 493
ELA + + Q I ++ +GS + L + D
Sbjct: 199 --ELALLTLSVIDFDEIEQAEILEKSIIGSSDVPISELINNLGNSDWVKEGLEYHEEGDT 256
Query: 494 WEFLTQKTTEKSLKLKEANKQRTYIAAVKDLDFWLGEVESLLTSEDSGKDLASVQNLIKK 553
F Q TE+ EA+ Y ++ L+ + + ES + L S+ + K+
Sbjct: 257 CPFCQQTITEERKAELEAHFDEEYQELIEQLEELIDKYESHIEKALEE--LESILDTEKE 314
Query: 554 HQLVEADIQAHDDRIKDMNGQADSLID---------SGQFDASSIQEKRQSINERY---- 600
+ + D++ ++ + + + S + SI + +SIN+
Sbjct: 315 NSEFKLDVEELKALLEALEEILEKNLQKLEEKLKDPSTSIELESITDLIESINDIIDAIN 374
Query: 601 -------ERIKNLAAHRQA-----RLNEANTLHQFFRDIADEESWIKEKKLLVGSDDYGR 648
E+I NL + L+ L + E+ ++ K + + +
Sbjct: 375 ELIREHNEKIDNLKKEKNKAKKKLWLHLVAELKEDIDAYQKEKKGLE--KAINSLEKEIK 432
Query: 649 DLTG-VQNLKKKHKRLEAELASHQPAIQNV 677
L ++ L+K+ K LE +L + +P +
Sbjct: 433 QLEAEIKALEKEIKELEKQLTNIEPTADEI 462
>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
Length = 880
Score = 30.8 bits (70), Expect = 5.6
Identities = 49/295 (16%), Positives = 114/295 (38%), Gaps = 35/295 (11%)
Query: 430 EESYKDPANIQSKHQKHQAFEAELAANADRIQSVLAMGQNLIDKRQCVGSEEAVQARLAS 489
EE + I+ + ++ + E +L ++ VL LI ++ + ++ +L
Sbjct: 455 EEYTAELKRIEKELKEIEEKERKLRKELRELEKVLKKESELIKLKELAEQLKELEEKLKK 514
Query: 490 IADQWEFLTQKTTEKSLKLKEANKQRTYIAAVKDLDFWLGEVESLLTSEDSGKDLASVQN 549
E L +K E ++ K + GE++SL K+L ++
Sbjct: 515 Y--NLEELEKKAEEYEKLKEKLIKLK-------------GEIKSLK------KELEKLEE 553
Query: 550 LIKKHQLVEADIQAHDDRIKDMNGQADSLIDSGQFDASSIQEKRQSINERYERIKNLAAH 609
L KK +E + ++ + ++ + + L + ++ + Y +K+
Sbjct: 554 LKKKLAELEKKLDELEEELAELLKELEELGFESVEELEERLKELEPFYNEYLELKDAEKE 613
Query: 610 RQARLNE----ANTLHQFFRDIADEES-------WIKEKKLLVGSDDYGRDLTGVQNLKK 658
+ E L + F ++A+ E ++E + ++Y L +
Sbjct: 614 LEREEKELKKLEEELDKAFEELAETEKRLEELRKELEELEKKYSEEEYEELREEYLELSR 673
Query: 659 KHKRLEAELASHQPAIQNVQETGEKLMDVSNLG-VPEIEQRLKLLNQAWSELKQL 712
+ L AEL + + +++T EKL + L + ++ L+ L +A +++L
Sbjct: 674 ELAGLRAELEELEKRREEIKKTLEKLKE--ELEEREKAKKELEKLEKALERVEEL 726
>gnl|CDD|218731 pfam05752, Calici_MSP, Calicivirus minor structural protein. This
family consists of minor structural proteins largely
from human calicivirus isolates. Human calicivirus
causes gastroenteritis. The function of this family is
unknown.
Length = 167
Score = 29.7 bits (67), Expect = 5.8
Identities = 17/82 (20%), Positives = 34/82 (41%), Gaps = 17/82 (20%)
Query: 405 QTLQQFSRDADEMENWIAEKLQLATEESYKDPANIQSKHQKHQAFEAELAAN--ADRIQS 462
+ L Q ++ ++W+ ++ L ++ Q +L+ N A R+QS
Sbjct: 29 RQLAQLAKQNQLQQDWMNKQEALQ---------------RRGQDLSRDLSVNGPALRVQS 73
Query: 463 VLAMGQNLIDKRQCVGSEEAVQ 484
+ G + + R+ GS E V
Sbjct: 74 AVDAGFDPVSARRLAGSGERVI 95
>gnl|CDD|233459 TIGR01541, tape_meas_lam_C, phage tail tape measure protein, lambda
family. This model represents a relatively
well-conserved region near the C-terminus of the tape
measure protein of a lambda and related phage. This
protein, which controls phage tail length, is typically
about 1000 residues in length. Both low-complexity
sequence and insertion/deletion events appear common in
this family. Mutational studies suggest a ruler or
template role in the determination of phage tail length.
Similar behavior is attributed to proteins from
distantly related or unrelated families in other phage
[Mobile and extrachromosomal element functions, Prophage
functions].
Length = 332
Score = 30.6 bits (69), Expect = 5.8
Identities = 24/137 (17%), Positives = 41/137 (29%), Gaps = 14/137 (10%)
Query: 71 SLQQLTAERATQLGSAHEVQRFHRDVDETKDWIQEK--DEALNNNDLGKDLRSVQALQRK 128
L Q A+R + + + + DE I+ + E L + +A
Sbjct: 5 LLTQQIADRKLKKLNTADEKSLQSRSDEIIALIKLEKLLEEAERKALEALKKLAEATASI 64
Query: 129 HEGLERDLAALGDKIRQLDETANRLMQTHPETAEQTYAKQKEIN-----------EEWTQ 177
+R L G +Q + RL Q +Q K + E+
Sbjct: 65 RAQNKRQLDRFGLGDKQRERLDARL-QIDRTFRKQQRDLNKAMTAKGLAGSDLYKEQLAA 123
Query: 178 LTAKANTRKEKLLDSYD 194
+ A N +L Y
Sbjct: 124 IKAALNEALAELHAYYA 140
>gnl|CDD|132821 cd07288, PX_SNX15, The phosphoinositide binding Phox Homology
domain of Sorting Nexin 15. The PX domain is a
phosphoinositide (PI) binding module present in many
proteins with diverse functions. Sorting nexins (SNXs)
make up the largest group among PX domain containing
proteins. They are involved in regulating membrane
traffic and protein sorting in the endosomal system. The
PX domain of SNXs binds PIs and targets the protein to
PI-enriched membranes. SNXs differ from each other in
PI-binding specificity and affinity, and the presence of
other protein-protein interaction domains, which help
determine subcellular localization and specific function
in the endocytic pathway. SNX15 contains an N-terminal
PX domain and a C-terminal Microtubule Interacting and
Trafficking (MIT) domain. It plays a role in protein
trafficking processes in the endocytic pathway and the
trans-Golgi network. The PX domain of SNX15 interacts
with the PDGF receptor and is responsible for the
membrane association of the protein.
Length = 118
Score = 29.2 bits (65), Expect = 6.2
Identities = 14/45 (31%), Positives = 23/45 (51%), Gaps = 5/45 (11%)
Query: 582 GQFDASSIQEKRQSINERYERIKNLAAHRQARLNEANTLHQFFRD 626
G+F+A+ I+E+R + N+ A L + L +FFRD
Sbjct: 78 GRFEAAVIEERRNAAEAMLLFTVNIPA-----LYNSPQLKEFFRD 117
>gnl|CDD|225803 COG3264, COG3264, Small-conductance mechanosensitive channel [Cell
envelope biogenesis, outer membrane].
Length = 835
Score = 30.8 bits (70), Expect = 6.7
Identities = 27/131 (20%), Positives = 44/131 (33%), Gaps = 11/131 (8%)
Query: 59 QTQLQDLNQKWTSLQQLTAERATQLGSAHEVQRFHRDVDETKDWIQEKDEALNN-----N 113
+ LQ L + S +Q AE A + + + + +ALN N
Sbjct: 28 EAALQLLQEAVNSKRQEEAEPAAEEAELQA-----ELIQQELAINDQLSQALNQQTERLN 82
Query: 114 DLGKDLRSVQALQRKHEGLERDLAALGDKIR-QLDETANRLMQTHPETAEQTYAKQKEIN 172
L D R + L + R + +R L + L Q P +Q E+
Sbjct: 83 ALASDDRQLANLLLQLLQSSRTIREQIAVLRGSLLLSRILLQQLGPLPEAGQPQEQFEVT 142
Query: 173 EEWTQLTAKAN 183
+E L A+
Sbjct: 143 QERDALQAEKA 153
>gnl|CDD|182791 PRK10865, PRK10865, protein disaggregation chaperone; Provisional.
Length = 857
Score = 30.6 bits (69), Expect = 8.0
Identities = 41/162 (25%), Positives = 67/162 (41%), Gaps = 39/162 (24%)
Query: 574 QADSLIDSGQFDASSIQ----EKRQSINERYERIKNLAAHRQA-----------RLNEAN 618
+A LID ASSI+ K + ++ RI L +QA RL+ N
Sbjct: 389 KAIDLIDEA---ASSIRMQIDSKPEELDRLDRRIIQLKLEQQALMKESDEASKKRLDMLN 445
Query: 619 T-LHQFFRDIAD-EESWIKEKKLLVGSDDYGRDLTGVQNLKKKHKRLEAELASHQPAIQN 676
L R ++ EE W EK L+G Q +K AEL + AI+
Sbjct: 446 EELSDKERQYSELEEEWKAEKA----------SLSGTQTIK-------AELEQAKIAIEQ 488
Query: 677 VQETGE--KLMDVSNLGVPEIEQRLKLLNQAWSELKQLAANR 716
+ G+ ++ ++ +PE+E++L Q + +L N+
Sbjct: 489 ARRVGDLARMSELQYGKIPELEKQLAAATQLEGKTMRLLRNK 530
>gnl|CDD|153078 cd07931, eukaryotic_phosphagen_kinases, Phosphagen (guanidino)
kinases mostly found in eukaryotes. Phosphagen
(guanidino) kinases are enzymes that transphosphorylate
a high energy phosphoguanidino compound, like
phosphocreatine (PCr) in the case of creatine kinase
(CK) or phosphoarginine in the case of arginine kinase,
which is used as an energy-storage and -transport
metabolite, to ADP, thereby creating ATP. The substrate
binding site is located in the cleft between the N and
C-terminal domains, but most of the catalytic residues
are found in the larger C-terminal domain. In higher
eukaryotes, CK exists in tissue-specific (muscle,
brain), as well as compartment-specific (mitochondrial
and cytosolic) isoforms. They are either coupled to
glycolysis (cytosolic form) or oxidative phosphorylation
(mitochondrial form). Besides CK and AK, the most
studied members of this family are also other phosphagen
kinases with different substrate specificities, like
glycocyamine kinase (GK), lombricine kinase (LK),
taurocyamine kinase (TK) and hypotaurocyamine kinase
(HTK).
Length = 338
Score = 29.9 bits (68), Expect = 8.3
Identities = 13/45 (28%), Positives = 17/45 (37%), Gaps = 2/45 (4%)
Query: 10 EDLEQVE-VMQKKFDDFQSDLKANEVRLAEM-NEIAMQLMSLGQT 52
E Q+E +M + DLK L EM E QL+
Sbjct: 124 EQRRQIERLMVSALSSLEGDLKGTYYSLTEMTEEQQQQLIDDHFL 168
>gnl|CDD|235316 PRK04863, mukB, cell division protein MukB; Provisional.
Length = 1486
Score = 30.3 bits (69), Expect = 8.5
Identities = 57/253 (22%), Positives = 100/253 (39%), Gaps = 45/253 (17%)
Query: 19 QKKFDDFQSDLKANEVRLAEMNEIA-----MQLMSLGQTEAA----LKIQTQLQDLNQKW 69
Q+K + +Q+DL+ E RL E NE+ Q + + EAA ++++QL D Q
Sbjct: 347 QEKIERYQADLEELEERLEEQNEVVEEADEQQEENEARAEAAEEEVDELKSQLADYQQAL 406
Query: 70 TSLQQLTAERATQLGSAHEVQRFHR----------DVDETKDWI-----QEKDEALNNND 114
Q RA Q A VQ R D +DW+ +E++
Sbjct: 407 DVQQT----RAIQYQQA--VQALERAKQLCGLPDLTADNAEDWLEEFQAKEQEATEELLS 460
Query: 115 LGKDLRSVQALQRKHEGLERDLAALGDKIRQLD--ETANRLMQTHPETAEQTYAKQKEIN 172
L + L QA + E + + + ++ + + + A L++ E Q+ +
Sbjct: 461 LEQKLSVAQAAHSQFEQAYQLVRKIAGEVSRSEAWDVARELLRRLRE--------QRHLA 512
Query: 173 EEWTQLTAKANTRKEKLLDSYDLQRFLSDYRDLMSWINSMMGLVSSDELANDVTGAEALL 232
E+ QL + + +++L +R L+++ + L DEL EA L
Sbjct: 513 EQLQQLRMRLSELEQRLRQQQRAERLLAEFCKRLG-----KNLDDEDELEQLQEELEARL 567
Query: 233 ERHQEHRTEIDAR 245
E E +E R
Sbjct: 568 ESLSESVSEARER 580
>gnl|CDD|181087 PRK07701, flgL, flagellar hook-associated protein FlgL; Validated.
Length = 298
Score = 30.0 bits (68), Expect = 8.6
Identities = 16/63 (25%), Positives = 33/63 (52%), Gaps = 3/63 (4%)
Query: 88 EVQRFHRDVDETKDWIQEKDEALNN-NDLGKDLRSVQALQRKHEGL-ERDLAALGDKIRQ 145
EV+++ ++ + K W++ + AL+ D+ + R A+Q + + D A+ +I Q
Sbjct: 58 EVEQYQKNASDAKSWLENTESALDQATDILQRARE-LAVQAANGTNTQTDRQAIAQEIEQ 116
Query: 146 LDE 148
L E
Sbjct: 117 LKE 119
>gnl|CDD|223571 COG0497, RecN, ATPase involved in DNA repair [DNA replication,
recombination, and repair].
Length = 557
Score = 30.3 bits (69), Expect = 9.3
Identities = 21/86 (24%), Positives = 36/86 (41%), Gaps = 9/86 (10%)
Query: 5 VQDVGEDLEQVEVMQKKFDDFQSDLKANEVRLAEMNEIAMQLMSL-----GQTEAAL--- 56
+ + E L ++E ++ + +L+ + RL E+ E L SL E L
Sbjct: 268 AELLEEALYELEEASEELRAYLDELEFDPNRLEEVEERLFALKSLARKYGVTIEDLLEYL 327
Query: 57 -KIQTQLQDLNQKWTSLQQLTAERAT 81
KI+ +L L+ SL+ L E
Sbjct: 328 DKIKEELAQLDNSEESLEALEKEVKK 353
>gnl|CDD|184940 PRK14977, PRK14977, bifunctional DNA-directed RNA polymerase A'/A''
subunit; Provisional.
Length = 1321
Score = 30.4 bits (68), Expect = 9.8
Identities = 26/85 (30%), Positives = 41/85 (48%), Gaps = 7/85 (8%)
Query: 1035 NLQKIIKERDIELAKEATRQDENDALRKEFAKHANAFHQWLTETRTSMMEGTGSLEQQLE 1094
NL++II+++ IE + +DE + L KE+ K NA L + + G E +LE
Sbjct: 904 NLERIIEKQKIEDRGKGASKDEIEELAKEYTKTFNAN---LPKLLADAIHGAELKEDELE 960
Query: 1095 AIKRKAAEVRSRRSDLKKIEDLGAI 1119
AI + E + K+E AI
Sbjct: 961 AICAEGKEGFEK----AKVEPGQAI 981
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.314 0.128 0.358
Gapped
Lambda K H
0.267 0.0788 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 67,064,171
Number of extensions: 6759562
Number of successful extensions: 7311
Number of sequences better than 10.0: 1
Number of HSP's gapped: 7062
Number of HSP's successfully gapped: 362
Length of query: 1321
Length of database: 10,937,602
Length adjustment: 108
Effective length of query: 1213
Effective length of database: 6,147,370
Effective search space: 7456759810
Effective search space used: 7456759810
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (22.0 bits)
S2: 65 (28.7 bits)