RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy10604
(225 letters)
>gnl|CDD|238125 cd00204, ANK, ankyrin repeats; ankyrin repeats mediate
protein-protein interactions in very diverse families of
proteins. The number of ANK repeats in a protein can
range from 2 to over 20 (ankyrins, for example). ANK
repeats may occur in combinations with other types of
domains. The structural repeat unit contains two
antiparallel helices and a beta-hairpin, repeats are
stacked in a superhelical arrangement; this alignment
contains 4 consecutive repeats.
Length = 126
Score = 83.2 bits (206), Expect = 2e-20
Identities = 47/126 (37%), Positives = 64/126 (50%), Gaps = 19/126 (15%)
Query: 69 NTSFLRAARAGHLDKLRELLEAGVDINTSNASDPNTSFLRAARAGHLDKLRELLEAGVDI 128
T AA GHL+ ++ LLE G D+N + T AA+ GHL+ ++ LLE G D+
Sbjct: 8 RTPLHLAASNGHLEVVKLLLENGADVNAKDNDG-RTPLHLAAKNGHLEIVKLLLEKGADV 66
Query: 129 NTSNATSKKLDNQLNIHQKNIKNNGLNALHLAAKDGHTTVVKALLKGGCKVDASTKKGNT 188
N + +G LHLAA++G+ VVK LLK G V+A K G T
Sbjct: 67 NARD------------------KDGNTPLHLAARNGNLDVVKLLLKHGADVNARDKDGRT 108
Query: 189 ALHIAA 194
LH+AA
Sbjct: 109 PLHLAA 114
Score = 55.5 bits (134), Expect = 4e-10
Identities = 22/44 (50%), Positives = 27/44 (61%)
Query: 151 NNGLNALHLAAKDGHTTVVKALLKGGCKVDASTKKGNTALHIAA 194
+G LHLAA +GH VVK LL+ G V+A G T LH+AA
Sbjct: 5 EDGRTPLHLAASNGHLEVVKLLLENGADVNAKDNDGRTPLHLAA 48
>gnl|CDD|205076 pfam12796, Ank_2, Ankyrin repeats (3 copies).
Length = 91
Score = 65.7 bits (161), Expect = 3e-14
Identities = 34/110 (30%), Positives = 55/110 (50%), Gaps = 21/110 (19%)
Query: 72 FLRAARAGHLDKLRELLEAGVDINTSNASDPNTSFLRAARAGHLDKLRELLEAGVDINTS 131
AA+ G+L+ ++ LLE G D+N + AAR G+L+ ++ LLE G D+N
Sbjct: 1 LHLAAKNGNLELVKLLLEKGADVNLGDTDTA---LHLAARNGNLEIVKLLLEHGADVNAK 57
Query: 132 NATSKKLDNQLNIHQKNIKNNGLNALHLAAKDGHTTVVKALLKGGCKVDA 181
+ +G ALHLAA++G+ +VK LL+ G ++
Sbjct: 58 D------------------KDGNTALHLAARNGNLEIVKLLLEHGADINL 89
Score = 63.4 bits (155), Expect = 2e-13
Identities = 33/89 (37%), Positives = 48/89 (53%), Gaps = 20/89 (22%)
Query: 106 FLRAARAGHLDKLRELLEAGVDINTSNATSKKLDNQLNIHQKNIKNNGLNALHLAAKDGH 165
AA+ G+L+ ++ LLE G D+N + + ALHLAA++G+
Sbjct: 1 LHLAAKNGNLELVKLLLEKGADVNLGDTDT--------------------ALHLAARNGN 40
Query: 166 TTVVKALLKGGCKVDASTKKGNTALHIAA 194
+VK LL+ G V+A K GNTALH+AA
Sbjct: 41 LEIVKLLLEHGADVNAKDKDGNTALHLAA 69
Score = 48.4 bits (116), Expect = 6e-08
Identities = 25/58 (43%), Positives = 37/58 (63%), Gaps = 1/58 (1%)
Query: 75 AARAGHLDKLRELLEAGVDINTSNASDPNTSFLRAARAGHLDKLRELLEAGVDINTSN 132
AAR G+L+ ++ LLE G D+N + NT+ AAR G+L+ ++ LLE G DIN +
Sbjct: 35 AARNGNLEIVKLLLEHGADVNAKDKDG-NTALHLAARNGNLEIVKLLLEHGADINLKD 91
Score = 29.1 bits (66), Expect = 0.60
Identities = 14/30 (46%), Positives = 20/30 (66%)
Query: 69 NTSFLRAARAGHLDKLRELLEAGVDINTSN 98
NT+ AAR G+L+ ++ LLE G DIN +
Sbjct: 62 NTALHLAARNGNLEIVKLLLEHGADINLKD 91
>gnl|CDD|215625 PLN03192, PLN03192, Voltage-dependent potassium channel;
Provisional.
Length = 823
Score = 51.4 bits (123), Expect = 1e-07
Identities = 46/165 (27%), Positives = 71/165 (43%), Gaps = 17/165 (10%)
Query: 67 DPNTSF--LRAARAGHLDKLRELLEAGVDINTSNASDPNTSFLRAARAGHLDKLRELLEA 124
DPN + L A G+ L ELL+A +D + ++ T AA G+ D + LL+
Sbjct: 522 DPNMASNLLTVASTGNAALLEELLKAKLDPDIGDSKG-RTPLHIAASKGYEDCVLVLLKH 580
Query: 125 GVDINTS---------NATSKKLDNQLNI--HQKNIKN--NGLNALHLAAKDGHTTVVKA 171
+++ NA S K I H +I + + L AAK T +K
Sbjct: 581 ACNVHIRDANGNTALWNAISAKHHKIFRILYHFASISDPHAAGDLLCTAAKRNDLTAMKE 640
Query: 172 LLKGGCKVDASTKKGNTALHIAALDSLVTDIASSAITDATSLPCS 216
LLK G VD+ +G TAL +A + V D+ I + + +
Sbjct: 641 LLKQGLNVDSEDHQGATALQVAMAEDHV-DMVRLLIMNGADVDKA 684
Score = 46.4 bits (110), Expect = 5e-06
Identities = 30/98 (30%), Positives = 39/98 (39%), Gaps = 20/98 (20%)
Query: 95 NTSNASDPNTSF--LRAARAGHLDKLRELLEAGVDINTSNATSKKLDNQLNIHQKNIKNN 152
N DPN + L A G+ L ELL+A +D + ++
Sbjct: 516 NGGEHDDPNMASNLLTVASTGNAALLEELLKAKLDPDIGDSK------------------ 557
Query: 153 GLNALHLAAKDGHTTVVKALLKGGCKVDASTKKGNTAL 190
G LH+AA G+ V LLK C V GNTAL
Sbjct: 558 GRTPLHIAASKGYEDCVLVLLKHACNVHIRDANGNTAL 595
>gnl|CDD|222277 pfam13637, Ank_4, Ankyrin repeats (many copies).
Length = 54
Score = 45.7 bits (109), Expect = 3e-07
Identities = 21/43 (48%), Positives = 27/43 (62%)
Query: 153 GLNALHLAAKDGHTTVVKALLKGGCKVDASTKKGNTALHIAAL 195
G ALH AA G +VK LL+ G ++ + + GNTALHIAA
Sbjct: 1 GRTALHKAAISGRLELVKYLLEKGVDINRTDEDGNTALHIAAE 43
Score = 43.4 bits (103), Expect = 2e-06
Identities = 22/54 (40%), Positives = 34/54 (62%), Gaps = 1/54 (1%)
Query: 69 NTSFLRAARAGHLDKLRELLEAGVDINTSNASDPNTSFLRAARAGHLDKLRELL 122
T+ +AA +G L+ ++ LLE GVDIN ++ NT+ AA G+L+ L+ LL
Sbjct: 2 RTALHKAAISGRLELVKYLLEKGVDINRTDEDG-NTALHIAAENGNLEVLKLLL 54
Score = 42.6 bits (101), Expect = 4e-06
Identities = 24/71 (33%), Positives = 38/71 (53%), Gaps = 18/71 (25%)
Query: 103 NTSFLRAARAGHLDKLRELLEAGVDINTSNATSKKLDNQLNIHQKNIKNNGLNALHLAAK 162
T+ +AA +G L+ ++ LLE GVDIN ++ +G ALH+AA+
Sbjct: 2 RTALHKAAISGRLELVKYLLEKGVDINRTD------------------EDGNTALHIAAE 43
Query: 163 DGHTTVVKALL 173
+G+ V+K LL
Sbjct: 44 NGNLEVLKLLL 54
>gnl|CDD|223738 COG0666, Arp, FOG: Ankyrin repeat [General function prediction
only].
Length = 235
Score = 48.3 bits (114), Expect = 6e-07
Identities = 38/140 (27%), Positives = 60/140 (42%), Gaps = 21/140 (15%)
Query: 66 SDPNTSFLRAARAGHLDKLRELLEAGVDINTSNASDPNTSFLRAARAGHL-----DKLRE 120
D AA G ++ LL +G D+N +A D +T AA G+ + +
Sbjct: 71 LDGRLPLHSAASKGDDKIVKLLLASGADVNAKDA-DGDTPLHLAALNGNPPEGNIEVAKL 129
Query: 121 LLEAGVDINTSNATSKKLDNQLNIHQKNIKNNGLNALHLAAKDGHTTVVKALLKGGCKVD 180
LLEAG D++ +N + +G LH AA +G +V+ LL+ G +
Sbjct: 130 LLEAGADLDVNNLRDE---------------DGNTPLHWAALNGDADIVELLLEAGADPN 174
Query: 181 ASTKKGNTALHIAALDSLVT 200
+ G TAL AA + +
Sbjct: 175 SRNSYGVTALDPAAKNGRIE 194
Score = 37.5 bits (86), Expect = 0.002
Identities = 35/101 (34%), Positives = 48/101 (47%), Gaps = 20/101 (19%)
Query: 78 AGHLDKLRELLEAGVDINTSNASDP--NTSFLRAARAGHLDKLRELLEAGVDINTSNATS 135
G+++ + LLEAG D++ +N D NT AA G D + LLEAG D N+ N
Sbjct: 121 EGNIEVAKLLLEAGADLDVNNLRDEDGNTPLHWAALNGDADIVELLLEAGADPNSRN--- 177
Query: 136 KKLDNQLNIHQKNIKNNGLNALHLAAKDGHTTVVKALLKGG 176
+ G+ AL AAK+G +VK LL G
Sbjct: 178 ---------------SYGVTALDPAAKNGRIELVKLLLDKG 203
Score = 32.9 bits (74), Expect = 0.11
Identities = 22/62 (35%), Positives = 30/62 (48%)
Query: 134 TSKKLDNQLNIHQKNIKNNGLNALHLAAKDGHTTVVKALLKGGCKVDASTKKGNTALHIA 193
+ L ++ H +G LH AA G +VK LL G V+A G+T LH+A
Sbjct: 54 SELLLKLIVDRHLAARDLDGRLPLHSAASKGDDKIVKLLLASGADVNAKDADGDTPLHLA 113
Query: 194 AL 195
AL
Sbjct: 114 AL 115
>gnl|CDD|206028 pfam13857, Ank_5, Ankyrin repeats (many copies).
Length = 56
Score = 44.3 bits (105), Expect = 1e-06
Identities = 19/43 (44%), Positives = 24/43 (55%)
Query: 151 NNGLNALHLAAKDGHTTVVKALLKGGCKVDASTKKGNTALHIA 193
NG LHLAAK G +V+ LLK G ++ G TAL +A
Sbjct: 14 GNGNTPLHLAAKYGALELVQWLLKPGVDLNLRDSDGLTALDLA 56
Score = 36.9 bits (86), Expect = 4e-04
Identities = 22/74 (29%), Positives = 33/74 (44%), Gaps = 18/74 (24%)
Query: 87 LLEAGVDINTSNASDPNTSFLRAARAGHLDKLRELLEAGVDINTSNATSKKLDNQLNIHQ 146
LLE G + + NT AA+ G L+ ++ LL+ GVD+N +
Sbjct: 1 LLEHGPIDLNATDGNGNTPLHLAAKYGALELVQWLLKPGVDLNLRD-------------- 46
Query: 147 KNIKNNGLNALHLA 160
++GL AL LA
Sbjct: 47 ----SDGLTALDLA 56
Score = 30.0 bits (68), Expect = 0.12
Identities = 12/32 (37%), Positives = 20/32 (62%)
Query: 69 NTSFLRAARAGHLDKLRELLEAGVDINTSNAS 100
NT AA+ G L+ ++ LL+ GVD+N ++
Sbjct: 17 NTPLHLAAKYGALELVQWLLKPGVDLNLRDSD 48
>gnl|CDD|222984 PHA03100, PHA03100, ankyrin repeat protein; Provisional.
Length = 422
Score = 45.0 bits (107), Expect = 1e-05
Identities = 35/119 (29%), Positives = 53/119 (44%), Gaps = 5/119 (4%)
Query: 79 GHLDKLRELLEAGVDINTSNASDPNTSFLRAARAGHLDK--LRELLEAGVDINTSNATSK 136
+ LL+ G ++N N SD + +D L+ L++ GVDIN N +
Sbjct: 119 NSYSIVEYLLDNGANVNIKN-SDGENLLHLYLESNKIDLKILKLLIDKGVDINAKNRVNY 177
Query: 137 KLDNQLNIHQKNIKNNGLNALHLAAKDGHTTVVKALLKGGCKVDASTKKGNTALHIAAL 195
L + I+ K++ LH A + + VK LL G + K G+T LHIA L
Sbjct: 178 LLSYGVPINIKDVYGF--TPLHYAVYNNNPEFVKYLLDLGANPNLVNKYGDTPLHIAIL 234
Score = 37.3 bits (87), Expect = 0.004
Identities = 29/123 (23%), Positives = 48/123 (39%), Gaps = 22/123 (17%)
Query: 93 DINTSNASDPNTSFLRAARAGHLDKLRELLEAGVDINTSNATSKKLDNQLNIHQKNIK-- 150
D+N + P A A ++D ++ LL+ G DIN+S + + L+ + N+
Sbjct: 26 DLNDYSYKKPVLPLYLAKEARNIDVVKILLDNGADINSSTKNNSTPLHYLSNIKYNLTDV 85
Query: 151 ------------------NNGLNALHLAA--KDGHTTVVKALLKGGCKVDASTKKGNTAL 190
NNG+ L A K ++V+ LL G V+ G L
Sbjct: 86 KEIVKLLLEYGANVNAPDNNGITPLLYAISKKSNSYSIVEYLLDNGANVNIKNSDGENLL 145
Query: 191 HIA 193
H+
Sbjct: 146 HLY 148
Score = 33.5 bits (77), Expect = 0.070
Identities = 21/95 (22%), Positives = 35/95 (36%), Gaps = 19/95 (20%)
Query: 82 DKLRELLEAGVDINTSNASDPNTSFLRAARAGHLDKLRELLEAGVDINTSNATSKKLDNQ 141
+++ LL GV IN + T A + + ++ LL+ G + N N
Sbjct: 173 NRVNYLLSYGVPINIKDVYG-FTPLHYAVYNNNPEFVKYLLDLGANPNLVN--------- 222
Query: 142 LNIHQKNIKNNGLNALHLAAKDGHTTVVKALLKGG 176
G LH+A + + + K LL G
Sbjct: 223 ---------KYGDTPLHIAILNNNKEIFKLLLNNG 248
Score = 30.4 bits (69), Expect = 0.78
Identities = 19/81 (23%), Positives = 29/81 (35%), Gaps = 2/81 (2%)
Query: 69 NTSFLRAARAGHLDKLRELLEAGVDINTSNASDPNTSFLRAARAGHLDKLRELLEAGVDI 128
T A + + ++ LL+ G + N N +T A + + + LL G I
Sbjct: 193 FTPLHYAVYNNNPEFVKYLLDLGANPNLVN-KYGDTPLHIAILNNNKEIFKLLLNNGPSI 251
Query: 129 NTSNATSKKL-DNQLNIHQKN 148
T T D LN K
Sbjct: 252 KTIIETLLYFKDKDLNTITKI 272
>gnl|CDD|222980 PHA03095, PHA03095, ankyrin-like protein; Provisional.
Length = 471
Score = 44.6 bits (106), Expect = 2e-05
Identities = 28/111 (25%), Positives = 40/111 (36%), Gaps = 21/111 (18%)
Query: 82 DKLRELLEAGVDINTSNASDPNTSFLRAARAGHLD--KLRELLEAGVDINTSNATSKKLD 139
+REL+ AG D ++ NT A + LL AG+ IN N
Sbjct: 203 RIVRELIRAGCDPAATDMLG-NTPLHSMATGSSCKRSLVLPLLIAGISINARN------- 254
Query: 140 NQLNIHQKNIKNNGLNALHLAAKDGHTTVVKALLKGGCKVDASTKKGNTAL 190
G LH AA + + L+ G ++A + GNT L
Sbjct: 255 -----------RYGQTPLHYAAVFNNPRACRRLIALGADINAVSSDGNTPL 294
Score = 43.5 bits (103), Expect = 4e-05
Identities = 33/129 (25%), Positives = 51/129 (39%), Gaps = 20/129 (15%)
Query: 66 SDPNTSFLRAARAGHLDKLRELLEAGVDINTSNASDPNTSFLRAARAGHLDKLRELLEAG 125
P +L + D +R LLEAG D+N L A LD ++ L++AG
Sbjct: 48 KTPLHLYLHYSSEKVKDIVRLLLEAGADVNAPERCGFTPLHLYLYNATTLDVIKLLIKAG 107
Query: 126 VDINTSNATSKKLDNQLNIHQKNIKNNGLNALH--LAAKDGHTTVVKALLKGGCKVDAST 183
D+N + G LH L+ + + V++ LL+ G V+A
Sbjct: 108 ADVNAKD------------------KVGRTPLHVYLSGFNINPKVIRLLLRKGADVNALD 149
Query: 184 KKGNTALHI 192
G T L +
Sbjct: 150 LYGMTPLAV 158
Score = 41.9 bits (99), Expect = 1e-04
Identities = 30/116 (25%), Positives = 52/116 (44%), Gaps = 19/116 (16%)
Query: 84 LRELLEAGVDINTSNASD--PNTSFLRAARAGHLDKLRELLEAGVDINTSNATSKKLDNQ 141
+R LL G D+N + P L++ A +++ LR L++AG D+ + + +
Sbjct: 135 IRLLLRKGADVNALDLYGMTPLAVLLKSRNA-NVELLRLLIDAGADVYAVDD---RFRSL 190
Query: 142 LNIHQKNIKNNGLNALHLAAKDGHTTVVKALLKGGCKVDASTKKGNTALHIAALDS 197
L+ H ++ K +V+ L++ GC A+ GNT LH A S
Sbjct: 191 LHHHLQSFKPR-------------ARIVRELIRAGCDPAATDMLGNTPLHSMATGS 233
Score = 26.9 bits (60), Expect = 9.4
Identities = 18/77 (23%), Positives = 32/77 (41%), Gaps = 15/77 (19%)
Query: 115 LDKLRELLEAGVDINTSNATSKKLDNQLNIHQKNIKNNGLNALHLAAKDGHTTVVKALLK 174
++++R LL AG D+N + L+ + + +V+ LL+
Sbjct: 27 VEEVRRLLAAGADVN---------------FRGEYGKTPLHLYLHYSSEKVKDIVRLLLE 71
Query: 175 GGCKVDASTKKGNTALH 191
G V+A + G T LH
Sbjct: 72 AGADVNAPERCGFTPLH 88
>gnl|CDD|165205 PHA02874, PHA02874, ankyrin repeat protein; Provisional.
Length = 434
Score = 44.2 bits (104), Expect = 2e-05
Identities = 36/129 (27%), Positives = 54/129 (41%), Gaps = 6/129 (4%)
Query: 70 TSFLRAARAGHLDKLRELLEAGVDINTSNASDPNTSFLRAARAGHLDKLRELLEAGVDIN 129
T + A R+G + ++ G DIN N P+ L A + G D ++ L++ GVD +
Sbjct: 37 TPLIDAIRSGDAKIVELFIKHGADINHINTKIPH-PLLTAIKIGAHDIIKLLIDNGVDTS 95
Query: 130 TSNATSKKLDNQLNIHQKNIKNNGLNA-----LHLAAKDGHTTVVKALLKGGCKVDASTK 184
+ D I I N +A LH A K G +K L + G V+
Sbjct: 96 ILPIPCIEKDMIKTILDCGIDVNIKDAELKTFLHYAIKKGDLESIKMLFEYGADVNIEDD 155
Query: 185 KGNTALHIA 193
G +HIA
Sbjct: 156 NGCYPIHIA 164
Score = 41.9 bits (98), Expect = 1e-04
Identities = 34/134 (25%), Positives = 57/134 (42%), Gaps = 26/134 (19%)
Query: 82 DKLRELLEAGVDINTSNASDPNTSFLRAARAGHLDKLRELLEAGVDINTSNATSKKLDNQ 141
D ++ +L+ G+D+N +A + T A + G L+ ++ L E G D+N + +
Sbjct: 105 DMIKTILDCGIDVNIKDA-ELKTFLHYAIKKGDLESIKMLFEYGADVNI-----EDDNGC 158
Query: 142 LNIH-------------------QKNIKNNGLNA-LHLAAKDGHTTVVKALLKGGCKVDA 181
IH N+K+N + LH AA+ G +K L+ G +
Sbjct: 159 YPIHIAIKHNFFDIIKLLLEKGAYANVKDNNGESPLHNAAEYGDYACIKLLIDHGNHIMN 218
Query: 182 STKKGNTALHIAAL 195
K G T LH A +
Sbjct: 219 KCKNGFTPLHNAII 232
>gnl|CDD|200936 pfam00023, Ank, Ankyrin repeat. Ankyrins are multifunctional
adaptors that link specific proteins to the
membrane-associated, spectrin- actin cytoskeleton. This
repeat-domain is a 'membrane-binding' domain of up to 24
repeated units, and it mediates most of the protein's
binding activities. Repeats 13-24 are especially active,
with known sites of interaction for the Na/K ATPase,
Cl/HCO(3) anion exchanger, voltage-gated sodium channel,
clathrin heavy chain and L1 family cell adhesion
molecules. The ANK repeats are found to form a
contiguous spiral stack such that ion transporters like
the anion exchanger associate in a large central cavity
formed by the ANK repeat spiral, while clathrin and cell
adhesion molecules associate with specific regions
outside this cavity.
Length = 33
Score = 39.8 bits (94), Expect = 3e-05
Identities = 17/33 (51%), Positives = 22/33 (66%)
Query: 152 NGLNALHLAAKDGHTTVVKALLKGGCKVDASTK 184
+G LHLAA++GH VVK LL+ G V+A K
Sbjct: 1 DGNTPLHLAARNGHLEVVKLLLEAGADVNARDK 33
Score = 28.3 bits (64), Expect = 0.28
Identities = 15/30 (50%), Positives = 20/30 (66%)
Query: 69 NTSFLRAARAGHLDKLRELLEAGVDINTSN 98
NT AAR GHL+ ++ LLEAG D+N +
Sbjct: 3 NTPLHLAARNGHLEVVKLLLEAGADVNARD 32
Score = 28.3 bits (64), Expect = 0.28
Identities = 15/30 (50%), Positives = 20/30 (66%)
Query: 103 NTSFLRAARAGHLDKLRELLEAGVDINTSN 132
NT AAR GHL+ ++ LLEAG D+N +
Sbjct: 3 NTPLHLAARNGHLEVVKLLLEAGADVNARD 32
>gnl|CDD|165207 PHA02876, PHA02876, ankyrin repeat protein; Provisional.
Length = 682
Score = 40.4 bits (94), Expect = 5e-04
Identities = 37/142 (26%), Positives = 61/142 (42%), Gaps = 17/142 (11%)
Query: 69 NTSFLRAARAGHLDKL-RELLEAGVDINTSNASDPNTSFLRAARAGHLDKLRELLEAGVD 127
NT A++A L +L +LLE G D+N N +L A + +R L+ G D
Sbjct: 274 NTPLHHASQAPSLSRLVPKLLERGADVNAKNIKGETPLYLMAKNGYDTENIRTLIMLGAD 333
Query: 128 INTSNATSKKLDNQLNIHQKN------IKNNGLNA----------LHLAAKDGHTTVVKA 171
+N ++ +Q + +N + G N +H AA + ++
Sbjct: 334 VNAADRLYITPLHQASTLDRNKDIVITLLELGANVNARDYCDKTPIHYAAVRNNVVIINT 393
Query: 172 LLKGGCKVDASTKKGNTALHIA 193
LL G ++A ++K TALH A
Sbjct: 394 LLDYGADIEALSQKIGTALHFA 415
Score = 39.3 bits (91), Expect = 0.001
Identities = 40/126 (31%), Positives = 61/126 (48%), Gaps = 21/126 (16%)
Query: 71 SFLRAARAGHLDKLRELLEAGVDINTSNASDPNTSFLRAARAGHLDKL-RELLEAGVDIN 129
S L+A R L+ L +AG +N+ + NT A++A L +L +LLE G D+N
Sbjct: 243 SLLKAIRNEDLETSLLLYDAGFSVNSIDDCK-NTPLHHASQAPSLSRLVPKLLERGADVN 301
Query: 130 TSNATSKKLDNQLNIHQKNIKNNGLNALHLAAKDGHTTV-VKALLKGGCKVDASTKKGNT 188
KNIK G L+L AK+G+ T ++ L+ G V+A+ + T
Sbjct: 302 A----------------KNIK--GETPLYLMAKNGYDTENIRTLIMLGADVNAADRLYIT 343
Query: 189 ALHIAA 194
LH A+
Sbjct: 344 PLHQAS 349
Score = 27.3 bits (60), Expect = 8.8
Identities = 31/132 (23%), Positives = 62/132 (46%), Gaps = 13/132 (9%)
Query: 75 AARAGHLDKLRELLEAGVDINTSNASDPNTSFLRAARAGHLDKLRELLEAGVDINTSNAT 134
AA G+ + LL G D+N A D + A + ++D ++ +++ +IN ++ +
Sbjct: 185 AAERGNAKMVNLLLSYGADVNII-ALDDLSVLECAVDSKNIDTIKAIIDNRSNINKNDLS 243
Query: 135 ------SKKLDNQLNIHQKNIKNNGLN-----ALHLAAKDGH-TTVVKALLKGGCKVDAS 182
++ L+ L ++ N ++ LH A++ + +V LL+ G V+A
Sbjct: 244 LLKAIRNEDLETSLLLYDAGFSVNSIDDCKNTPLHHASQAPSLSRLVPKLLERGADVNAK 303
Query: 183 TKKGNTALHIAA 194
KG T L++ A
Sbjct: 304 NIKGETPLYLMA 315
>gnl|CDD|222939 PHA02878, PHA02878, ankyrin repeat protein; Provisional.
Length = 477
Score = 39.9 bits (93), Expect = 6e-04
Identities = 31/110 (28%), Positives = 43/110 (39%), Gaps = 18/110 (16%)
Query: 84 LRELLEAGVDINTSNASDPNTSFLRAARAGHLDKLRELLEAGVDINTSNATSKKLDNQLN 143
+ LL G DIN + NT+ A LL G ++N + T+
Sbjct: 150 TKLLLSYGADINMKDRHKGNTALHYATENKDQRLTELLLSYGANVNIPDKTNN------- 202
Query: 144 IHQKNIKNNGLNALHLAAKDGHTTVVKALLKGGCKVDASTKKGNTALHIA 193
+ LH A K + +V LL+ G DA K GNT LHI+
Sbjct: 203 -----------SPLHHAVKHYNKPIVHILLENGASTDARDKCGNTPLHIS 241
Score = 27.9 bits (62), Expect = 4.5
Identities = 22/56 (39%), Positives = 28/56 (50%), Gaps = 12/56 (21%)
Query: 82 DKLRELLEAGVDINTSNASDPNTSFLRAARAGHL-----DKLRELLEAGVDINTSN 132
D L+ LLE GVD+N S++ A H KL+ LLE G DIN+ N
Sbjct: 249 DILKLLLEHGVDVNAK-------SYILGLTALHSSIKSERKLKLLLEYGADINSLN 297
>gnl|CDD|197603 smart00248, ANK, ankyrin repeats. Ankyrin repeats are about 33
amino acids long and occur in at least four consecutive
copies. They are involved in protein-protein
interactions. The core of the repeat seems to be an
helix-loop-helix structure.
Length = 30
Score = 32.6 bits (75), Expect = 0.008
Identities = 14/30 (46%), Positives = 20/30 (66%)
Query: 152 NGLNALHLAAKDGHTTVVKALLKGGCKVDA 181
+G LHLAA++G+ VVK LL G ++A
Sbjct: 1 DGRTPLHLAAENGNLEVVKLLLDKGADINA 30
Score = 27.6 bits (62), Expect = 0.49
Identities = 11/27 (40%), Positives = 16/27 (59%)
Query: 69 NTSFLRAARAGHLDKLRELLEAGVDIN 95
T AA G+L+ ++ LL+ G DIN
Sbjct: 3 RTPLHLAAENGNLEVVKLLLDKGADIN 29
Score = 27.6 bits (62), Expect = 0.49
Identities = 11/27 (40%), Positives = 16/27 (59%)
Query: 103 NTSFLRAARAGHLDKLRELLEAGVDIN 129
T AA G+L+ ++ LL+ G DIN
Sbjct: 3 RTPLHLAAENGNLEVVKLLLDKGADIN 29
>gnl|CDD|205784 pfam13606, Ank_3, Ankyrin repeat. Ankyrins are multifunctional
adaptors that link specific proteins to the
membrane-associated, spectrin- actin cytoskeleton. This
repeat-domain is a 'membrane-binding' domain of up to 24
repeated units, and it mediates most of the protein's
binding activities.
Length = 30
Score = 32.6 bits (75), Expect = 0.009
Identities = 13/30 (43%), Positives = 21/30 (70%)
Query: 152 NGLNALHLAAKDGHTTVVKALLKGGCKVDA 181
+G LHLAA++G+ +VK LL+ G ++A
Sbjct: 1 DGNTPLHLAARNGNLELVKLLLEHGADINA 30
Score = 31.5 bits (72), Expect = 0.022
Identities = 14/28 (50%), Positives = 18/28 (64%)
Query: 69 NTSFLRAARAGHLDKLRELLEAGVDINT 96
NT AAR G+L+ ++ LLE G DIN
Sbjct: 3 NTPLHLAARNGNLELVKLLLEHGADINA 30
Score = 31.5 bits (72), Expect = 0.022
Identities = 14/28 (50%), Positives = 18/28 (64%)
Query: 103 NTSFLRAARAGHLDKLRELLEAGVDINT 130
NT AAR G+L+ ++ LLE G DIN
Sbjct: 3 NTPLHLAARNGNLELVKLLLEHGADINA 30
>gnl|CDD|165206 PHA02875, PHA02875, ankyrin repeat protein; Provisional.
Length = 413
Score = 36.1 bits (83), Expect = 0.010
Identities = 38/156 (24%), Positives = 54/156 (34%), Gaps = 17/156 (10%)
Query: 65 YSDPNTSFLRAARAGHLDKLRELLEAGVDINTSNASDPNTSFLRAARAGHLDKLRELLEA 124
Y D + A G + + ELL+ G + D T A LD ++ L+
Sbjct: 65 YPDIESELHDAVEEGDVKAVEELLDLGKFADDVFYKDGMTPLHLATILKKLDIMKLLIAR 124
Query: 125 GVDINTSNATSKKLDNQLNIHQKNIKNNGLNALHLAAKD----------------GHTTV 168
G D + N T K L + +IK L H A D G +
Sbjct: 125 GADPDIPN-TDKFSPLHLAVMMGDIKGIELLIDHKACLDIEDCCGCTPLIIAMAKGDIAI 183
Query: 169 VKALLKGGCKVDASTKKGNTALHIAALDSLVTDIAS 204
K LL G +D K G A A+++ DI
Sbjct: 184 CKMLLDSGANIDYFGKNGCVAALCYAIENNKIDIVR 219
Score = 30.3 bits (68), Expect = 0.85
Identities = 18/78 (23%), Positives = 33/78 (42%), Gaps = 5/78 (6%)
Query: 52 HNAVISLEPCYDCYSDPNTSFLRAARAGHLDKLRELLEAGVDINTSNASDPNTSFLRAAR 111
H A + +E C C T + A G + + LL++G +I+ + + A
Sbjct: 157 HKACLDIEDCCGC-----TPLIIAMAKGDIAICKMLLDSGANIDYFGKNGCVAALCYAIE 211
Query: 112 AGHLDKLRELLEAGVDIN 129
+D +R ++ G D N
Sbjct: 212 NNKIDIVRLFIKRGADCN 229
>gnl|CDD|140343 PTZ00322, PTZ00322,
6-phosphofructo-2-kinase/fructose-2,6-biphosphatase;
Provisional.
Length = 664
Score = 35.6 bits (82), Expect = 0.017
Identities = 16/42 (38%), Positives = 22/42 (52%)
Query: 152 NGLNALHLAAKDGHTTVVKALLKGGCKVDASTKKGNTALHIA 193
+G LH+A +GH VV+ LL+ G K G T L +A
Sbjct: 114 DGRTPLHIACANGHVQVVRVLLEFGADPTLLDKDGKTPLELA 155
Score = 28.3 bits (63), Expect = 3.3
Identities = 15/43 (34%), Positives = 18/43 (41%), Gaps = 1/43 (2%)
Query: 158 HLAAKDGHTTVVKALLKGGCKVDASTKKGNTALHIAALDSLVT 200
LAA G + LL GG + G T LHIA + V
Sbjct: 88 QLAAS-GDAVGARILLTGGADPNCRDYDGRTPLHIACANGHVQ 129
>gnl|CDD|222954 PHA02989, PHA02989, ankyrin repeat protein; Provisional.
Length = 494
Score = 34.3 bits (79), Expect = 0.038
Identities = 33/136 (24%), Positives = 55/136 (40%), Gaps = 24/136 (17%)
Query: 82 DKLRELLEAGVDINTSNA----SDPNTSFLRAARAGHLDKLRELLEAGVDINTSNATSKK 137
D ++ LL GV++ + + N + ++ L++ GV+I T+N S+
Sbjct: 161 DVIKILLSFGVNLFEKTSLYGLTPMNIYLRNDIDVISIKVIKYLIKKGVNIETNNNGSES 220
Query: 138 -----LDNQLNIHQKNIK-----------NN----GLNALHLAAKDGHTTVVKALLKGGC 177
LDN + +K K N G N L ++AK + LLK G
Sbjct: 221 VLESFLDNNKILSKKEFKVLNFILKYIKINKKDKKGFNPLLISAKVDNYEAFNYLLKLGD 280
Query: 178 KVDASTKKGNTALHIA 193
+ +K G+T L A
Sbjct: 281 DIYNVSKDGDTVLTYA 296
>gnl|CDD|218303 pfam04874, Mak16, Mak16 protein C-terminal region. The precise
function of this eukaryotic protein family is unknown.
The yeast orthologues have been implicated in cell
cycle progression and biogenesis of 60S ribosomal
subunits. The Schistosoma mansoni Mak16 has been shown
to target protein transport to the nucleolus.
Length = 97
Score = 32.5 bits (74), Expect = 0.040
Identities = 16/31 (51%), Positives = 20/31 (64%)
Query: 4 YNVDLGAFGTMLEKKEKEEERKEEEEEEEKN 34
YN+ AF LE +E EE +EEEEEEE+
Sbjct: 44 YNISQSAFKKALEAEESEENDEEEEEEEEEE 74
>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger. [Transport and
binding proteins, Cations and iron carrying compounds].
Length = 1096
Score = 33.8 bits (77), Expect = 0.078
Identities = 16/37 (43%), Positives = 25/37 (67%)
Query: 16 EKKEKEEERKEEEEEEEKNISAIGPAVTKLRQEAKIH 52
E++E+EEE +EEEEEEE+N + + RQ+ I+
Sbjct: 872 EEEEEEEEEEEEEEEEEENEEPLSLEWPETRQKQAIY 908
Score = 31.1 bits (70), Expect = 0.54
Identities = 12/18 (66%), Positives = 17/18 (94%)
Query: 16 EKKEKEEERKEEEEEEEK 33
E++E+EEE +EEEEEEE+
Sbjct: 869 EEEEEEEEEEEEEEEEEE 886
Score = 30.7 bits (69), Expect = 0.66
Identities = 12/18 (66%), Positives = 17/18 (94%)
Query: 16 EKKEKEEERKEEEEEEEK 33
E++E+EEE +EEEEEEE+
Sbjct: 871 EEEEEEEEEEEEEEEEEE 888
Score = 30.7 bits (69), Expect = 0.70
Identities = 12/18 (66%), Positives = 17/18 (94%)
Query: 16 EKKEKEEERKEEEEEEEK 33
E++E+EEE +EEEEEEE+
Sbjct: 870 EEEEEEEEEEEEEEEEEE 887
Score = 30.7 bits (69), Expect = 0.74
Identities = 12/18 (66%), Positives = 17/18 (94%)
Query: 16 EKKEKEEERKEEEEEEEK 33
E++E+EEE +EEEEEEE+
Sbjct: 864 EEEEEEEEEEEEEEEEEE 881
Score = 30.7 bits (69), Expect = 0.77
Identities = 12/18 (66%), Positives = 17/18 (94%)
Query: 16 EKKEKEEERKEEEEEEEK 33
E++E+EEE +EEEEEEE+
Sbjct: 867 EEEEEEEEEEEEEEEEEE 884
Score = 30.7 bits (69), Expect = 0.78
Identities = 12/18 (66%), Positives = 17/18 (94%)
Query: 16 EKKEKEEERKEEEEEEEK 33
E++E+EEE +EEEEEEE+
Sbjct: 866 EEEEEEEEEEEEEEEEEE 883
Score = 30.3 bits (68), Expect = 0.92
Identities = 12/18 (66%), Positives = 17/18 (94%)
Query: 16 EKKEKEEERKEEEEEEEK 33
E++E+EEE +EEEEEEE+
Sbjct: 868 EEEEEEEEEEEEEEEEEE 885
Score = 30.0 bits (67), Expect = 1.2
Identities = 12/18 (66%), Positives = 17/18 (94%)
Query: 16 EKKEKEEERKEEEEEEEK 33
E++E+EEE +EEEEEEE+
Sbjct: 865 EEEEEEEEEEEEEEEEEE 882
Score = 28.8 bits (64), Expect = 2.5
Identities = 14/28 (50%), Positives = 19/28 (67%)
Query: 6 VDLGAFGTMLEKKEKEEERKEEEEEEEK 33
VD G + +E+EEE +EEEEEEE+
Sbjct: 852 VDGGGGSDGGDSEEEEEEEEEEEEEEEE 879
>gnl|CDD|185582 PTZ00373, PTZ00373, 60S Acidic ribosomal protein P2; Provisional.
Length = 112
Score = 28.7 bits (64), Expect = 0.91
Identities = 11/19 (57%), Positives = 16/19 (84%)
Query: 17 KKEKEEERKEEEEEEEKNI 35
+ K+EE+KEEEEEEE ++
Sbjct: 88 AEAKKEEKKEEEEEEEDDL 106
>gnl|CDD|227032 COG4688, COG4688, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 665
Score = 30.0 bits (67), Expect = 1.1
Identities = 14/65 (21%), Positives = 23/65 (35%), Gaps = 2/65 (3%)
Query: 80 HLDKLRELLEAGVDIN-TSNASDPNTSFLRAARAGHLDKLRE-LLEAGVDINTSNATSKK 137
HL L L A ++ A+ P A LD +R+ L A + +
Sbjct: 596 HLTSLERKLVAKLEHAQARPATTPGRDNQIAHAQVRLDGVRKLLTTADGERPFPDGPDLS 655
Query: 138 LDNQL 142
+ + L
Sbjct: 656 MPSPL 660
>gnl|CDD|239647 cd03675, Nudix_Hydrolase_2, Contains a crystal structure of the
Nudix hydrolase from Nitrosomonas europaea, which has an
unknown function. In general, members of the Nudix
hydrolase superfamily catalyze the hydrolysis of
NUcleoside DIphosphates linked to other moieties, X.
Enzymes belonging to this superfamily require a divalent
cation, such as Mg2+ or Mn2+, for their activity. They
also contain a highly conserved 23-residue nudix motif
(GX5EX7REUXEEXGU, where U = I, L or V), which forms a
structural motif that functions as a metal binding and
catalytic site. Substrates of nudix hydrolases include
intact and oxidatively damaged nucleoside triphosphates,
dinucleoside polyphosphates, nucleotide-sugars and
dinucleotide enzymes. These substrates are metabolites
or cell signaling molecules that require regulation
during different stages of the cell cycle or during
periods of stress. In general, the role of the nudix
hydrolase is to sanitize the nucleotide pools and to
maintain cell viability, thereby serving as surveillance
& "house-cleaning" enzymes. Substrate specificity is
used to define families within the superfamily.
Differences in substrate specificity are determined by
the N-terminal extension or by residues in variable loop
regions. Mechanistically, substrate hydrolysis occurs by
a nucleophilic substitution reaction, with variation in
the numbers and roles of divalent cations required.
Length = 134
Score = 28.7 bits (65), Expect = 1.3
Identities = 18/57 (31%), Positives = 25/57 (43%), Gaps = 17/57 (29%)
Query: 78 AGHLDK--------LRELLE---------AGVDINTSNASDPNTSFLRAARAGHLDK 117
AGHL+ +RE LE A + I A D +T++LR A A L +
Sbjct: 30 AGHLEPGESLIEAAVRETLEETGWHVEPTALLGIYQWTAPDSDTTYLRFAFAAELLE 86
>gnl|CDD|235795 PRK06402, rpl12p, 50S ribosomal protein L12P; Reviewed.
Length = 106
Score = 28.0 bits (63), Expect = 1.5
Identities = 10/17 (58%), Positives = 15/17 (88%)
Query: 16 EKKEKEEERKEEEEEEE 32
+K+E+EEE ++EE EEE
Sbjct: 80 KKEEEEEEEEKEESEEE 96
Score = 27.6 bits (62), Expect = 2.1
Identities = 12/17 (70%), Positives = 15/17 (88%)
Query: 16 EKKEKEEERKEEEEEEE 32
EKKE+EEE +E+EE EE
Sbjct: 79 EKKEEEEEEEEKEESEE 95
Score = 27.2 bits (61), Expect = 2.8
Identities = 11/22 (50%), Positives = 18/22 (81%)
Query: 16 EKKEKEEERKEEEEEEEKNISA 37
E+K++EEE +EE+EE E+ +A
Sbjct: 78 EEKKEEEEEEEEKEESEEEAAA 99
Score = 26.1 bits (58), Expect = 6.9
Identities = 10/18 (55%), Positives = 14/18 (77%)
Query: 16 EKKEKEEERKEEEEEEEK 33
+EK+EE +EEEE+EE
Sbjct: 76 AAEEKKEEEEEEEEKEES 93
>gnl|CDD|181642 PRK09082, PRK09082, methionine aminotransferase; Validated.
Length = 386
Score = 29.5 bits (67), Expect = 1.6
Identities = 21/88 (23%), Positives = 35/88 (39%), Gaps = 22/88 (25%)
Query: 55 VISLEPCYDCYSDPNTSFLRAARAGHL-----------DKLRELLEAG---VDINTSNAS 100
VI +P YD Y+ P L RA + + + + +NT +
Sbjct: 118 VIVFDPSYDSYA-PAIE-LAGGRAVRVALQPPDFRVDWQRFAAAISPRTRLIILNTPH-- 173
Query: 101 DPNTSFLRAARAGHLDKLRELLEAGVDI 128
+P+ + AA + L +L+ AG DI
Sbjct: 174 NPSGTVWSAA---DMRALWQLI-AGTDI 197
>gnl|CDD|214750 smart00633, Glyco_10, Glycosyl hydrolase family 10.
Length = 263
Score = 28.7 bits (65), Expect = 2.0
Identities = 22/100 (22%), Positives = 37/100 (37%), Gaps = 20/100 (20%)
Query: 64 CYSDPNTSFLRAARAGHLDKLRELLEAGVDIN------TSNASDPNTSFLRAARAGHLDK 117
Y+D NT A R + +++L GV I+ + PN + +RAA
Sbjct: 126 FYNDYNTEEPNAKRQAIYELVKKLKAKGVPIDGIGLQSHLSLGGPNIAEIRAA------- 178
Query: 118 LRELLEAGV-------DINTSNATSKKLDNQLNIHQKNIK 150
L G+ DI+ + L Q +++ K
Sbjct: 179 LDRFASLGLEIWITELDISGPPNPEENLQAQAADYEEVFK 218
>gnl|CDD|173965 cd08045, TAF4, TATA Binding Protein (TBP) Associated Factor 4
(TAF4) is one of several TAFs that bind TBP and is
involved in forming Transcription Factor IID (TFIID)
complex. The TATA Binding Protein (TBP) Associated
Factor 4 (TAF4) is one of several TAFs that bind TBP and
are involved in forming the Transcription Factor IID
(TFIID) complex. TFIID is one of seven General
Transcription Factors (GTF) (TFIIA, TFIIB, TFIID, TFIIE,
TFIIF, and TFIID) that are involved in accurate
initiation of transcription by RNA polymerase II in
eukaryote. TFIID plays an important role in the
recognition of promoter DNA and assembly of the
pre-initiation complex. TFIID complex is composed of the
TBP and at least 13 TAFs. TAFs from various species were
originally named by their predicted molecular weight or
their electrophoretic mobility in polyacrylamide gels. A
new, unified nomenclature for the pol II TAFs has been
suggested to show the relationship between TAF orthologs
and paralogs. Several hypotheses are proposed for TAFs
functions such as serving as activator-binding sites,
core-promoter recognition or a role in essential
catalytic activity. Each TAF, with the help of a
specific activator, is required only for the expression
of subset of genes and is not universally involved for
transcription as are GTFs. In yeast and human cells,
TAFs have been found as components of other complexes
besides TFIID. Several TAFs interact via histone-fold
(HFD) motifs; HFD is the interaction motif involved in
heterodimerization of the core histones and their
assembly into nucleosome octamers. The minimal HFD
contains three alpha-helices linked by two loops and is
found in core histones, TAFS and many other
transcription factors. TFIID has a histone octamer-like
substructure. TAF4 domain interacts with TAF12 and makes
a novel histone-like heterodimer that binds DNA and has
a core promoter function of a subset of genes.
Length = 212
Score = 28.8 bits (65), Expect = 2.0
Identities = 14/36 (38%), Positives = 21/36 (58%)
Query: 15 LEKKEKEEERKEEEEEEEKNISAIGPAVTKLRQEAK 50
LE+ E+EEE K +EEE E+ + A + R + K
Sbjct: 119 LEQLEREEEEKRDEEERERLLRAAKSRSEQSRLKQK 154
>gnl|CDD|202096 pfam02029, Caldesmon, Caldesmon.
Length = 431
Score = 28.8 bits (64), Expect = 2.1
Identities = 14/35 (40%), Positives = 20/35 (57%), Gaps = 2/35 (5%)
Query: 16 EKKEKEEERKEEEEEEEKNISAIGPAVTKLRQEAK 50
KK++EE RK EEEE++ A K R+E +
Sbjct: 219 LKKKREERRKVLEEEEQRRKQE--EADRKSREEEE 251
>gnl|CDD|220371 pfam09736, Bud13, Pre-mRNA-splicing factor of RES complex. This
entry is characterized by proteins with alternating
conserved and low-complexity regions. Bud13 together
with Snu17p and a newly identified factor,
Pml1p/Ylr016c, form a novel trimeric complex. called
The RES complex, pre-mRNA retention and splicing
complex. Subunits of this complex are not essential for
viability of yeasts but they are required for efficient
splicing in vitro and in vivo. Furthermore,
inactivation of this complex causes pre-mRNA leakage
from the nucleus. Bud13 contains a unique,
phylogenetically conserved C-terminal region of unknown
function.
Length = 141
Score = 28.0 bits (63), Expect = 2.4
Identities = 11/23 (47%), Positives = 16/23 (69%)
Query: 14 MLEKKEKEEERKEEEEEEEKNIS 36
E+KE+E+E KE +EE+EK
Sbjct: 17 KREEKEREKEEKERKEEKEKEWG 39
Score = 26.5 bits (59), Expect = 7.1
Identities = 8/21 (38%), Positives = 17/21 (80%)
Query: 15 LEKKEKEEERKEEEEEEEKNI 35
+E+K +E+ER++EE+E ++
Sbjct: 14 IEEKREEKEREKEEKERKEEK 34
>gnl|CDD|179712 PRK04019, rplP0, acidic ribosomal protein P0; Validated.
Length = 330
Score = 28.7 bits (65), Expect = 2.6
Identities = 10/17 (58%), Positives = 14/17 (82%)
Query: 16 EKKEKEEERKEEEEEEE 32
E++E+EEE +EE EEE
Sbjct: 304 EEEEEEEEEEEEPSEEE 320
Score = 27.9 bits (63), Expect = 4.0
Identities = 10/17 (58%), Positives = 14/17 (82%)
Query: 16 EKKEKEEERKEEEEEEE 32
E++E+EEE +EEE EE
Sbjct: 303 EEEEEEEEEEEEEPSEE 319
Score = 27.5 bits (62), Expect = 5.3
Identities = 11/22 (50%), Positives = 17/22 (77%)
Query: 16 EKKEKEEERKEEEEEEEKNISA 37
E++E+EEE +EEEE E+ +A
Sbjct: 302 EEEEEEEEEEEEEEPSEEEAAA 323
Score = 27.1 bits (61), Expect = 7.4
Identities = 10/18 (55%), Positives = 13/18 (72%)
Query: 16 EKKEKEEERKEEEEEEEK 33
+EEE +EEEEEEE+
Sbjct: 298 AAAAEEEEEEEEEEEEEE 315
Score = 26.8 bits (60), Expect = 9.1
Identities = 10/18 (55%), Positives = 13/18 (72%)
Query: 16 EKKEKEEERKEEEEEEEK 33
+E+EEE +EEEEEE
Sbjct: 300 AAEEEEEEEEEEEEEEPS 317
>gnl|CDD|227492 COG5163, NOP7, Protein required for biogenesis of the 60S ribosomal
subunit [Translation, ribosomal structure and
biogenesis].
Length = 591
Score = 28.9 bits (64), Expect = 2.6
Identities = 11/22 (50%), Positives = 15/22 (68%)
Query: 17 KKEKEEERKEEEEEEEKNISAI 38
K K ++RK +EEEEEK + I
Sbjct: 526 NKSKNKKRKVDEEEEEKKLKMI 547
>gnl|CDD|222931 PHA02798, PHA02798, ankyrin-like protein; Provisional.
Length = 489
Score = 28.6 bits (64), Expect = 2.8
Identities = 36/138 (26%), Positives = 57/138 (41%), Gaps = 25/138 (18%)
Query: 81 LDKLRELLEAGVDINTSNASDPNTS----FLRAARAGHLDKLRELLEAGVDINTSNATSK 136
++ ++ LLE GVDINT N + + F D L+ ++ G IN N + K
Sbjct: 161 IEIIKLLLEKGVDINTHNNKEKYDTLHCYFKYNIDRIDADILKLFVDNGFIINKENKSHK 220
Query: 137 K--LD--NQLNIHQKNIKNN-----------------GLNALHLAAKDGHTTVVKALLKG 175
K ++ N L K K N G N L+ + + + + LL+
Sbjct: 221 KKFMEYLNSLLYDNKRFKKNILDFIFSYIDINQVDELGFNPLYYSVSHNNRKIFEYLLQL 280
Query: 176 GCKVDASTKKGNTALHIA 193
G ++ T+ GNT L A
Sbjct: 281 GGDINIITELGNTCLFTA 298
>gnl|CDD|151656 pfam11214, Med2, Mediator complex subunit 2. This family of
mediator complex subunit 2 proteins is conserved in
fungi. Cyclin-dependent kinase CDK8 or Srb10 interacts
with and phosphorylates Med2. Post-translational
modifications of Mediator subunits are important for
regulation of gene expression.
Length = 99
Score = 27.4 bits (61), Expect = 2.8
Identities = 12/21 (57%), Positives = 18/21 (85%)
Query: 13 TMLEKKEKEEERKEEEEEEEK 33
+KK++EEERK++EEEE+K
Sbjct: 79 VENKKKQEEEERKKKEEEEKK 99
>gnl|CDD|181127 PRK07807, PRK07807, inosine 5-monophosphate dehydrogenase;
Validated.
Length = 479
Score = 28.7 bits (65), Expect = 2.8
Identities = 24/70 (34%), Positives = 32/70 (45%), Gaps = 22/70 (31%)
Query: 75 AARAGHLDKLRELLEAGVD---INTSNASDPNT-SFLRAARAGHL------------DKL 118
AA+A R LLEAGVD ++T++ LRA RA +
Sbjct: 229 AAKA------RALLEAGVDVLVVDTAHGHQEKMLEALRAVRALDPGVPIVAGNVVTAEGT 282
Query: 119 RELLEAGVDI 128
R+L+EAG DI
Sbjct: 283 RDLVEAGADI 292
>gnl|CDD|233161 TIGR00870, trp, transient-receptor-potential calcium channel
protein. The Transient Receptor Potential Ca2+ Channel
(TRP-CC) Family (TC. 1.A.4)The TRP-CC family has also
been called the store-operated calcium channel (SOC)
family. The prototypical members include the Drosophila
retinal proteinsTRP and TRPL (Montell and Rubin, 1989;
Hardie and Minke, 1993). SOC members of the family
mediate the entry of extracellular Ca2+ into cells in
responseto depletion of intracellular Ca2+ stores
(Clapham, 1996) and agonist stimulated production of
inositol-1,4,5 trisphosphate (IP3). One member of the
TRP-CCfamily, mammalian Htrp3, has been shown to form a
tight complex with the IP3 receptor (TC #1.A.3.2.1).
This interaction is apparently required for IP3
tostimulate Ca2+ release via Htrp3. The vanilloid
receptor subtype 1 (VR1), which is the receptor for
capsaicin (the ?hot? ingredient in chili peppers) and
servesas a heat-activated ion channel in the pain
pathway (Caterina et al., 1997), is also a member of
this family. The stretch-inhibitable non-selective
cation channel(SIC) is identical to the vanilloid
receptor throughout all of its first 700 residues, but
it exhibits a different sequence in its last 100
residues. VR1 and SICtransport monovalent cations as
well as Ca2+. VR1 is about 10x more permeable to Ca2+
than to monovalent ions. Ca2+ overload probably causes
cell deathafter chronic exposure to capsaicin.
(McCleskey and Gold, 1999) [Transport and binding
proteins, Cations and iron carrying compounds].
Length = 743
Score = 28.5 bits (64), Expect = 2.9
Identities = 14/62 (22%), Positives = 27/62 (43%), Gaps = 5/62 (8%)
Query: 135 SKKLDNQLNIHQKNIKNNGLNALHLAAKDG-HTTVVKALLKGGCKVDASTKKGNTALHIA 193
+ L+ ++ G +AL +AA + + + + LL C+ G+T LH
Sbjct: 34 YRDLEEPKKLNINCPDRLGRSALFVAAIENENLELTELLLNLSCRGAV----GDTLLHAI 89
Query: 194 AL 195
+L
Sbjct: 90 SL 91
>gnl|CDD|222447 pfam13904, DUF4207, Domain of unknown function (DUF4207). This
family is found in eukaryotes; it has several conserved
tryptophan residues. The function is not known.
Length = 261
Score = 28.5 bits (64), Expect = 2.9
Identities = 11/23 (47%), Positives = 15/23 (65%)
Query: 15 LEKKEKEEERKEEEEEEEKNISA 37
LE+K+K+E KE EE E + A
Sbjct: 108 LEEKQKQEREKEREEAELRQRLA 130
Score = 27.0 bits (60), Expect = 7.6
Identities = 5/21 (23%), Positives = 16/21 (76%)
Query: 14 MLEKKEKEEERKEEEEEEEKN 34
L+K +++++++EEE +++
Sbjct: 188 ELKKLKQQQQKREEERRKQRK 208
>gnl|CDD|234311 TIGR03685, L12P_arch, 50S ribosomal protein L12P. This model
represents the L12P protein of the large (50S) subunit
of the archaeal ribosome.
Length = 105
Score = 27.3 bits (61), Expect = 3.0
Identities = 12/24 (50%), Positives = 17/24 (70%)
Query: 16 EKKEKEEERKEEEEEEEKNISAIG 39
++E+EEE +EEEEEEE A+
Sbjct: 75 AEEEEEEEEEEEEEEEESEEEAMA 98
Score = 26.9 bits (60), Expect = 3.5
Identities = 12/24 (50%), Positives = 20/24 (83%)
Query: 16 EKKEKEEERKEEEEEEEKNISAIG 39
E++E+EEE +EEEE EE+ ++ +G
Sbjct: 78 EEEEEEEEEEEEEESEEEAMAGLG 101
Score = 25.8 bits (57), Expect = 9.3
Identities = 12/24 (50%), Positives = 18/24 (75%)
Query: 16 EKKEKEEERKEEEEEEEKNISAIG 39
E++E+EEE +EEEEEE + + G
Sbjct: 76 EEEEEEEEEEEEEEEESEEEAMAG 99
>gnl|CDD|218312 pfam04889, Cwf_Cwc_15, Cwf15/Cwc15 cell cycle control protein.
This family represents Cwf15/Cwc15 (from
Schizosaccharomyces pombe and Saccharomyces cerevisiae
respectively) and their homologues. The function of
these proteins is unknown, but they form part of the
spliceosome and are thus thought to be involved in mRNA
splicing.
Length = 241
Score = 28.2 bits (63), Expect = 3.2
Identities = 10/19 (52%), Positives = 14/19 (73%)
Query: 15 LEKKEKEEERKEEEEEEEK 33
E+KE+EEE K EEE+ +
Sbjct: 159 AEEKEREEEEKAAEEEKAR 177
Score = 27.0 bits (60), Expect = 8.0
Identities = 8/19 (42%), Positives = 13/19 (68%)
Query: 16 EKKEKEEERKEEEEEEEKN 34
EK+ +EEE+ EEE+ +
Sbjct: 161 EKEREEEEKAAEEEKAREE 179
>gnl|CDD|218223 pfam04712, Radial_spoke, Radial spokehead-like protein. This
family includes the radial spoke head proteins RSP4 and
RSP6 from Chlamydomonas reinhardtii, and several
eukaryotic homologues, including mammalian RSHL1, the
protein product of a familial ciliary dyskinesia
candidate gene.
Length = 481
Score = 28.5 bits (64), Expect = 3.2
Identities = 11/19 (57%), Positives = 18/19 (94%)
Query: 16 EKKEKEEERKEEEEEEEKN 34
E+K++EEE+++EEEEEE+
Sbjct: 345 EQKDEEEEQEDEEEEEEEE 363
Score = 27.7 bits (62), Expect = 6.1
Identities = 11/25 (44%), Positives = 18/25 (72%)
Query: 16 EKKEKEEERKEEEEEEEKNISAIGP 40
E+++++EE +EEEEE E+ GP
Sbjct: 350 EEEQEDEEEEEEEEEPEEPEPEEGP 374
Score = 26.9 bits (60), Expect = 9.3
Identities = 11/21 (52%), Positives = 15/21 (71%)
Query: 14 MLEKKEKEEERKEEEEEEEKN 34
+K E+EE+ EEEEEEE+
Sbjct: 344 PEQKDEEEEQEDEEEEEEEEE 364
>gnl|CDD|215914 pfam00428, Ribosomal_60s, 60s Acidic ribosomal protein. This
family includes archaebacterial L12, eukaryotic P0, P1
and P2.
Length = 88
Score = 26.8 bits (60), Expect = 3.3
Identities = 7/17 (41%), Positives = 14/17 (82%)
Query: 16 EKKEKEEERKEEEEEEE 32
+E+++E +EEEEE++
Sbjct: 65 AAEEEKKEEEEEEEEDD 81
Score = 25.7 bits (57), Expect = 8.8
Identities = 9/17 (52%), Positives = 13/17 (76%)
Query: 16 EKKEKEEERKEEEEEEE 32
+EE+++EEEEEEE
Sbjct: 63 AAAAEEEKKEEEEEEEE 79
Score = 25.7 bits (57), Expect = 9.3
Identities = 8/17 (47%), Positives = 13/17 (76%)
Query: 16 EKKEKEEERKEEEEEEE 32
E+E++ +EEEEEE+
Sbjct: 64 AAAEEEKKEEEEEEEED 80
>gnl|CDD|221175 pfam11705, RNA_pol_3_Rpc31, DNA-directed RNA polymerase III subunit
Rpc31. RNA polymerase III contains seventeen subunits
in yeasts and in human cells. Twelve of these are akin
to RNA polymerase I or II and the other five are RNA pol
III-specific, and form the functionally distinct groups
(i) Rpc31-Rpc34-Rpc82, and (ii) Rpc37-Rpc53. Rpc31,
Rpc34 and Rpc82 form a cluster of enzyme-specific
subunits that contribute to transcription initiation in
S.cerevisiae and H.sapiens. There is evidence that these
subunits are anchored at or near the N-terminal Zn-fold
of Rpc1, itself prolonged by a highly conserved but RNA
polymerase III-specific domain.
Length = 221
Score = 28.2 bits (63), Expect = 3.5
Identities = 12/21 (57%), Positives = 17/21 (80%)
Query: 14 MLEKKEKEEERKEEEEEEEKN 34
MLEKK KE E ++ +EE+EK+
Sbjct: 155 MLEKKLKELEAEDVDEEDEKD 175
>gnl|CDD|220376 pfam09745, DUF2040, Coiled-coil domain-containing protein 55
(DUF2040). This entry is a conserved domain of
approximately 130 residues of proteins conserved from
fungi to humans. The proteins do contain a coiled-coil
domain, but the function is unknown.
Length = 128
Score = 27.4 bits (61), Expect = 3.7
Identities = 12/28 (42%), Positives = 18/28 (64%), Gaps = 2/28 (7%)
Query: 9 GAFGTMLE--KKEKEEERKEEEEEEEKN 34
A+ LE +K +EEE++ EE EEE +
Sbjct: 93 SAYKKQLEENRKLEEEEKEREELEEEND 120
>gnl|CDD|218115 pfam04502, DUF572, Family of unknown function (DUF572). Family of
eukaryotic proteins with undetermined function.
Length = 321
Score = 28.2 bits (63), Expect = 3.9
Identities = 11/32 (34%), Positives = 20/32 (62%), Gaps = 2/32 (6%)
Query: 10 AFGTMLEK--KEKEEERKEEEEEEEKNISAIG 39
+MLE + +++E +EEEEE+E I ++
Sbjct: 168 DVNSMLEALFRREKKEEEEEEEEDEALIKSLS 199
>gnl|CDD|132724 cd02737, RNAP_IV_NRPD1_C, Largest subunit (NRPD1) of Higher plant
RNA polymerase IV, C-terminal domain. Higher plants
have five multi-subunit nuclear RNA polymerases: RNAP I,
RNAP II and RNAP III, which are essential for viability;
plus the two isoforms of the non-essential polymerase
RNAP IV (IVa and IVb), which specialize in small
RNA-mediated gene silencing pathways. RNAP IVa and/or
RNAP IVb might be involved in RNA-directed DNA
methylation of endogenous repetitive elements, silencing
of transgenes, regulation of flowering-time genes,
inducible regulation of adjacent gene pairs, and
spreading of mobile silencing signals. NRPD1a is the
largest subunit of RNAP IVa, whereas NRPD1b is the
largest subunit of RNAP IVb. The full subunit
compositions of RNAP IVa and RNAP IVb are not known, nor
are their templates or enzymatic products. However, it
has been shown that RNAP IVa and, to a lesser extent,
RNAP IVb are crucial for several RNA-mediated gene
silencing phenomena.
Length = 381
Score = 28.2 bits (63), Expect = 4.1
Identities = 12/27 (44%), Positives = 17/27 (62%)
Query: 64 CYSDPNTSFLRAARAGHLDKLRELLEA 90
C+S P FL+AA+ G D L +L+A
Sbjct: 336 CFSSPIKCFLKAAKKGASDSLSGVLDA 362
>gnl|CDD|236731 PRK10658, PRK10658, putative alpha-glucosidase; Provisional.
Length = 665
Score = 28.3 bits (64), Expect = 4.2
Identities = 9/19 (47%), Positives = 12/19 (63%)
Query: 75 AARAGHLDKLRELLEAGVD 93
A + DKL+ LL+ GVD
Sbjct: 392 DACKWYADKLKGLLDMGVD 410
Score = 28.3 bits (64), Expect = 4.2
Identities = 9/19 (47%), Positives = 12/19 (63%)
Query: 109 AARAGHLDKLRELLEAGVD 127
A + DKL+ LL+ GVD
Sbjct: 392 DACKWYADKLKGLLDMGVD 410
>gnl|CDD|215936 pfam00478, IMPDH, IMP dehydrogenase / GMP reductase domain. This
family is involved in biosynthesis of guanosine
nucleotide. Members of this family contain a TIM barrel
structure. In the inosine monophosphate dehydrogenases 2
CBS domains pfam00571 are inserted in the TIM barrel.
This family is a member of the common phosphate binding
site TIM barrel family.
Length = 467
Score = 28.0 bits (63), Expect = 4.2
Identities = 15/69 (21%), Positives = 29/69 (42%), Gaps = 16/69 (23%)
Query: 76 ARAGHLDKLRELLEAGVD---INTSNASDPNT-SFLRAARAGHLDK------------LR 119
R L++ L+EAGVD I++++ ++ + + + R
Sbjct: 220 TREDDLERAEALVEAGVDVIVIDSAHGHSEYVLEMIKWIKKKYPELDVIAGNVVTAEAAR 279
Query: 120 ELLEAGVDI 128
EL++AG D
Sbjct: 280 ELIDAGADA 288
>gnl|CDD|110514 pfam01517, HDV_ag, Hepatitis delta virus delta antigen. The
hepatitis delta virus (HDV) encodes a single protein,
the hepatitis delta antigen (HDAg). The central region
of this protein has been shown to bind RNA. Several
interactions are also mediated by a coiled-coil region
at the N terminus of the protein.
Length = 194
Score = 27.5 bits (61), Expect = 4.2
Identities = 14/35 (40%), Positives = 21/35 (60%)
Query: 8 LGAFGTMLEKKEKEEERKEEEEEEEKNISAIGPAV 42
L + G L ++E+EE R+ EE+EE+ GP V
Sbjct: 114 LSSGGKHLSREEEEELRRLTEEDEERERRVAGPRV 148
>gnl|CDD|206063 pfam13892, DBINO, DNA-binding domain. DBINO is a DNA-binding
domain found on global transcription activator SNF2L1
proteins and chromatin re-modelling proteins.
Length = 140
Score = 27.2 bits (61), Expect = 4.3
Identities = 9/18 (50%), Positives = 12/18 (66%)
Query: 16 EKKEKEEERKEEEEEEEK 33
EK+ E+ +KEEE E K
Sbjct: 93 EKEALEQAKKEEELREAK 110
>gnl|CDD|218482 pfam05178, Kri1, KRI1-like family. The yeast member of this
family (Kri1p) is found to be required for 40S ribosome
biogenesis in the nucleolus.
Length = 99
Score = 26.8 bits (60), Expect = 4.4
Identities = 15/46 (32%), Positives = 27/46 (58%), Gaps = 2/46 (4%)
Query: 16 EKKEKEEERKEEEEEEEKN--ISAIGPAVTKLRQEAKIHNAVISLE 59
E+KE+E+ ++EEE + KN I + K+++ A + A +S E
Sbjct: 2 ERKEEEKAQREEELKRLKNLKREEIEEKLEKIKKVAGLRGADLSEE 47
>gnl|CDD|235302 PRK04456, PRK04456, acetyl-CoA decarbonylase/synthase complex
subunit beta; Reviewed.
Length = 463
Score = 28.1 bits (63), Expect = 4.7
Identities = 18/50 (36%), Positives = 25/50 (50%), Gaps = 6/50 (12%)
Query: 16 EKKEKEEERKEEEEEEEKNISAIGPAVTKLRQEA----KI--HNAVISLE 59
+E+EEE +EEEEEEE + +++ A KI NA I E
Sbjct: 403 AAEEEEEEEEEEEEEEEPVAEVMMMPAPEMQLPASGGIKIILKNAKIYAE 452
>gnl|CDD|219655 pfam07946, DUF1682, Protein of unknown function (DUF1682). The
members of this family are all hypothetical eukaryotic
proteins of unknown function. One member is described as
being an adipocyte-specific protein, but no evidence of
this was found.
Length = 322
Score = 27.6 bits (62), Expect = 4.7
Identities = 8/22 (36%), Positives = 15/22 (68%)
Query: 16 EKKEKEEERKEEEEEEEKNISA 37
+++KEE++KEE E + +S
Sbjct: 284 AQEKKEEKKKEEREAKLAKLSP 305
>gnl|CDD|227798 COG5511, COG5511, Bacteriophage capsid protein [General function
prediction only].
Length = 492
Score = 27.7 bits (62), Expect = 5.0
Identities = 12/33 (36%), Positives = 13/33 (39%), Gaps = 7/33 (21%)
Query: 88 LEAGVDINTSNASDPNTSF-------LRAARAG 113
L G DI + DPN LRA AG
Sbjct: 299 LNPGEDIKLQTSQDPNNGLESFRQGQLRAIAAG 331
>gnl|CDD|221641 pfam12569, NARP1, NMDA receptor-regulated protein 1. This domain
family is found in eukaryotes, and is approximately 40
amino acids in length. The family is found in
association with pfam07719, pfam00515. There is a single
completely conserved residue L that may be functionally
important. NARP1 is the mammalian homologue of a yeast
N-terminal acetyltransferase that regulates entry into
the G(0) phase of the cell cycle.
Length = 516
Score = 28.0 bits (63), Expect = 5.0
Identities = 10/20 (50%), Positives = 15/20 (75%)
Query: 15 LEKKEKEEERKEEEEEEEKN 34
L KK+++ E+K E+EE EK
Sbjct: 414 LRKKQRKAEKKAEKEEAEKA 433
>gnl|CDD|218458 pfam05136, Phage_portal_2, Phage portal protein, lambda family.
This protein forms a hole, or portal, that enables DNA
passage during packaging and ejection. It also forms the
junction between the phage capsid and the tail proteins.
Length = 352
Score = 27.6 bits (62), Expect = 5.6
Identities = 13/33 (39%), Positives = 14/33 (42%), Gaps = 7/33 (21%)
Query: 88 LEAGVDINTSNASDPNTSF-------LRAARAG 113
L G DI SN P +F LRA AG
Sbjct: 284 LPPGEDIKFSNPDRPGNNFEEFVKAQLRAIAAG 316
>gnl|CDD|100109 cd05831, Ribosomal_P1, Ribosomal protein P1. This subfamily
represents the eukaryotic large ribosomal protein P1.
Eukaryotic P1 and P2 are functionally equivalent to the
bacterial protein L7/L12, but are not homologous to
L7/L12. P1 is located in the L12 stalk, with proteins
P2, P0, L11, and 28S rRNA. P1 and P2 are the only
proteins in the ribosome to occur as multimers, always
appearing as sets of heterodimers. Recent data indicate
that eukaryotes have four copies (two heterodimers),
while most archaeal species contain six copies of L12p
(three homodimers) and bacteria may have four or six
copies (two or three homodimers), depending on the
species. Experiments using S. cerevisiae P1 and P2
indicate that P1 proteins are positioned more
internally with limited reactivity in the C-terminal
domains, while P2 proteins seem to be more externally
located and are more likely to interact with other
cellular components. In lower eukaryotes, P1 and P2 are
further subdivided into P1A, P1B, P2A, and P2B, which
form P1A/P2B and P1B/P2A heterodimers. Some plant
species have a third P-protein, called P3, which is not
homologous to P1 and P2. In humans, P1 and P2 are
strongly autoimmunogenic. They play a significant role
in the etiology and pathogenesis of systemic lupus
erythema (SLE). In addition, the ribosome-inactivating
protein trichosanthin (TCS) interacts with human P0,
P1, and P2, with its primary binding site located in
the C-terminal region of P2. TCS inactivates the
ribosome by depurinating a specific adenine in the
sarcin-ricin loop of 28S rRNA.
Length = 103
Score = 26.5 bits (59), Expect = 5.6
Identities = 10/17 (58%), Positives = 13/17 (76%)
Query: 16 EKKEKEEERKEEEEEEE 32
+ K+EE+KEEEEEE
Sbjct: 78 AAEAKKEEKKEEEEEES 94
>gnl|CDD|192481 pfam10197, Cir_N, N-terminal domain of CBF1 interacting
co-repressor CIR. This is a 45 residue conserved
region at the N-terminal end of a family of proteins
referred to as CIRs (CBF1-interacting co-repressors).
CBF1 (centromere-binding factor 1) acts as a
transcription factor that causes repression by binding
specifically to GTGGGAA motifs in responsive promoters,
and it requires CIR as a co-repressor. CIR binds to
histone deacetylase and to SAP30 and serves as a linker
between CBF1 and the histone deacetylase complex.
Length = 37
Score = 24.9 bits (55), Expect = 5.6
Identities = 8/19 (42%), Positives = 12/19 (63%)
Query: 14 MLEKKEKEEERKEEEEEEE 32
E+K EE++K EE +E
Sbjct: 15 EAEQKALEEQKKIEELRKE 33
Score = 24.5 bits (54), Expect = 8.5
Identities = 9/17 (52%), Positives = 14/17 (82%)
Query: 15 LEKKEKEEERKEEEEEE 31
LE+++K EE ++E EEE
Sbjct: 21 LEEQKKIEELRKEIEEE 37
>gnl|CDD|100108 cd04411, Ribosomal_P1_P2_L12p, Ribosomal protein P1, P2, and L12p.
Ribosomal proteins P1 and P2 are the eukaryotic proteins
that are functionally equivalent to bacterial L7/L12.
L12p is the archaeal homolog. Unlike other ribosomal
proteins, the archaeal L12p and eukaryotic P1 and P2 do
not share sequence similarity with their bacterial
counterparts. They are part of the ribosomal stalk
(called the L7/L12 stalk in bacteria), along with 28S
rRNA and the proteins L11 and P0 in eukaryotes (23S
rRNA, L11, and L10e in archaea). In bacterial ribosomes,
L7/L12 homodimers bind the extended C-terminal helix of
L10 to anchor the L7/L12 molecules to the ribosome.
Eukaryotic P1/P2 heterodimers and archaeal L12p
homodimers are believed to bind the L10 equivalent
proteins, eukaryotic P0 and archaeal L10e, in a similar
fashion. P1 and P2 (L12p, L7/L12) are the only proteins
in the ribosome to occur as multimers, always appearing
as sets of dimers. Recent data indicate that most
archaeal species contain six copies of L12p (three
homodimers), while eukaryotes have two copies each of P1
and P2 (two heterodimers). Bacteria may have four or six
copies (two or three homodimers), depending on the
species. As in bacteria, the stalk is crucial for
binding of initiation, elongation, and release factors
in eukaryotes and archaea.
Length = 105
Score = 26.5 bits (58), Expect = 6.1
Identities = 14/24 (58%), Positives = 14/24 (58%)
Query: 16 EKKEKEEERKEEEEEEEKNISAIG 39
E EK EE KEEEEEEE G
Sbjct: 80 EPAEKAEEAKEEEEEEEDEDFGFG 103
>gnl|CDD|191634 pfam06886, TPX2, Targeting protein for Xklp2 (TPX2). This family
represents a conserved region approximately 60 residues
long within the eukaryotic targeting protein for Xklp2
(TPX2). Xklp2 is a kinesin-like protein localised on
centrosomes throughout the cell cycle and on spindle
pole microtubules during metaphase. In Xenopus, it has
been shown that Xklp2 protein is required for
centrosome separation and maintenance of spindle
bi-polarity. TPX2 is a microtubule-associated protein
that mediates the binding of the C-terminal domain of
Xklp2 to microtubules. It is phosphorylated during
mitosis in a microtubule-dependent way.
Length = 57
Score = 25.4 bits (56), Expect = 6.2
Identities = 12/35 (34%), Positives = 18/35 (51%), Gaps = 7/35 (20%)
Query: 14 MLEKKEKEEERKEEEEEEEKNISAIGPAVTKLRQE 48
+ EKEE ++EEEE+ I +LR+E
Sbjct: 22 KALEAEKEEAEARQKEEEEEAIK-------QLRKE 49
>gnl|CDD|100110 cd05832, Ribosomal_L12p, Ribosomal protein L12p. This subfamily
includes archaeal L12p, the protein that is functionally
equivalent to L7/L12 in bacteria and the P1 and P2
proteins in eukaryotes. L12p is homologous to P1 and P2
but is not homologous to bacterial L7/L12. It is located
in the L12 stalk, with proteins L10, L11, and 23S rRNA.
L12p is the only protein in the ribosome to occur as
multimers, always appearing as sets of dimers. Recent
data indicate that most archaeal species contain six
copies of L12p (three homodimers), while eukaryotes have
four copies (two heterodimers), and bacteria may have
four or six copies (two or three homodimers), depending
on the species. The organization of proteins within the
stalk has been characterized primarily in bacteria,
where L7/L12 forms either two or three homodimers and
each homodimer binds to the extended C-terminal helix of
L10. L7/L12 is attached to the ribosome through L10 and
is the only ribosomal protein that does not directly
interact with rRNA. Archaeal L12p is believed to
function in a similar fashion. However, hybrid ribosomes
containing the large subunit from E. coli with an
archaeal stalk are able to bind archaeal and eukaryotic
elongation factors but not bacterial elongation factors.
In several mesophilic and thermophilic archaeal species,
the binding of 23S rRNA to protein L11 and to the
L10/L12p pentameric complex was found to be
temperature-dependent and cooperative.
Length = 106
Score = 26.3 bits (58), Expect = 6.3
Identities = 10/24 (41%), Positives = 19/24 (79%)
Query: 16 EKKEKEEERKEEEEEEEKNISAIG 39
++++EE++KEEE+EEE+ + G
Sbjct: 78 AEEKEEEKKKEEEKEEEEEEALAG 101
>gnl|CDD|238200 cd00326, alpha_CA, Carbonic anhydrase alpha (vertebrate-like)
group. Carbonic anhydrases (CAs) are zinc-containing
enzymes that catalyze the reversible hydration of carbon
dioxide in a two-step mechanism: a nucleophilic attack
of a zinc-bound hydroxide ion on carbon dioxide,
followed by the regeneration of the active site by
ionization of the zinc-bound water molecule and removal
of a proton from the active site. They are ubiquitous
enzymes involved in fundamental processes like
photosynthesis, respiration, pH homeostasis and ion
transport. There are three evolutionary distinct groups
- alpha, beta and gamma carbonic anhydrases - which show
no significant sequence identity or structural
similarity. Most alpha CAs are monomeric enzymes. The
zinc ion is complexed by three histidine residues and a
fourth conserved histidine plays a potential role in
proton transfer.
Length = 227
Score = 27.2 bits (61), Expect = 6.5
Identities = 8/33 (24%), Positives = 11/33 (33%), Gaps = 5/33 (15%)
Query: 126 VDINTSNATSKKLDNQLNIH-----QKNIKNNG 153
++I TS LN + NNG
Sbjct: 7 INIVTSAVVYDPSLPPLNFDYYPTTSLTLVNNG 39
>gnl|CDD|205206 pfam13025, DUF3886, Protein of unknown function (DUF3886). This
family of proteins is functionally uncharacterized.
This family of proteins is found in bacteria. Proteins
in this family are approximately 90 amino acids in
length. There are two completely conserved L residues
that may be functionally important.
Length = 70
Score = 25.8 bits (57), Expect = 6.7
Identities = 12/22 (54%), Positives = 17/22 (77%), Gaps = 1/22 (4%)
Query: 16 EKKEKEEERKEEE-EEEEKNIS 36
+++E+EE RK EE +E EKN S
Sbjct: 35 KREEEEEARKREERKEREKNKS 56
>gnl|CDD|224530 COG1614, CdhC, CO dehydrogenase/acetyl-CoA synthase beta subunit
[Energy production and conversion].
Length = 470
Score = 27.5 bits (61), Expect = 6.7
Identities = 11/19 (57%), Positives = 15/19 (78%)
Query: 15 LEKKEKEEERKEEEEEEEK 33
+E+ +EEE +EEEEEEE
Sbjct: 398 VERWAEEEEEEEEEEEEEA 416
>gnl|CDD|224969 COG2058, RPP1A, Ribosomal protein L12E/L44/L45/RPP1/RPP2
[Translation, ribosomal structure and biogenesis].
Length = 109
Score = 26.2 bits (58), Expect = 6.8
Identities = 12/17 (70%), Positives = 12/17 (70%)
Query: 16 EKKEKEEERKEEEEEEE 32
E E EEE KEEE EEE
Sbjct: 82 EADEAEEEEKEEEAEEE 98
>gnl|CDD|146016 pfam03179, V-ATPase_G, Vacuolar (H+)-ATPase G subunit. This
family represents the eukaryotic vacuolar (H+)-ATPase
(V-ATPase) G subunit. V-ATPases generate an acidic
environment in several intracellular compartments.
Correspondingly, they are found as membrane-attached
proteins in several organelles. They are also found in
the plasma membranes of some specialised cells.
V-ATPases consist of peripheral (V1) and membrane
integral (V0) heteromultimeric complexes. The G subunit
is part of the V1 subunit, but is also thought to be
strongly attached to the V0 complex. It may be involved
in the coupling of ATP degradation to H+ translocation.
Length = 105
Score = 26.4 bits (59), Expect = 6.9
Identities = 11/38 (28%), Positives = 19/38 (50%), Gaps = 4/38 (10%)
Query: 17 KKEKEEERKEEEEE----EEKNISAIGPAVTKLRQEAK 50
+KE EE R + E E E ++ + G K+ +E +
Sbjct: 39 EKEIEEYRAQREAEFKEFEAEHSGSRGELEKKIEKETE 76
>gnl|CDD|236321 PRK08645, PRK08645, bifunctional homocysteine
S-methyltransferase/5,10-methylenetetrahydrofolate
reductase protein; Reviewed.
Length = 612
Score = 27.5 bits (62), Expect = 7.1
Identities = 16/47 (34%), Positives = 25/47 (53%), Gaps = 8/47 (17%)
Query: 85 RELLEAGVDI---NTSNASDPNTSFLRAARAGHLDKLRELLEAGVDI 128
RE +EAG D+ NT A+ ++ R G DK++E+ A V +
Sbjct: 50 REYIEAGADVIQTNTFGAN-----RIKLKRYGLEDKVKEINRAAVRL 91
>gnl|CDD|238640 cd01315, L-HYD_ALN, L-Hydantoinases (L-HYDs) and Allantoinase
(ALN); L-Hydantoinases are a member of the
dihydropyrimidinase family, which catalyzes the
reversible hydrolytic ring opening of dihydropyrimidines
and hydantoins (five-membered cyclic diamides used in
biotechnology). But L-HYDs differ by having an L-enantio
specificity and by lacking activity on possible natural
substrates such as dihydropyrimidines. Allantoinase
catalyzes the hydrolytic cleavage of the five-member
ring of allantoin (5-ureidohydantoin) to form allantoic
acid.
Length = 447
Score = 27.3 bits (61), Expect = 7.5
Identities = 10/14 (71%), Positives = 12/14 (85%)
Query: 79 GHLDKLRELLEAGV 92
G+LD+LR L EAGV
Sbjct: 128 GNLDQLRPLDEAGV 141
Score = 27.3 bits (61), Expect = 7.5
Identities = 10/14 (71%), Positives = 12/14 (85%)
Query: 113 GHLDKLRELLEAGV 126
G+LD+LR L EAGV
Sbjct: 128 GNLDQLRPLDEAGV 141
>gnl|CDD|222948 PHA02941, PHA02941, hypothetical protein; Provisional.
Length = 356
Score = 27.2 bits (60), Expect = 7.6
Identities = 10/26 (38%), Positives = 18/26 (69%)
Query: 14 MLEKKEKEEERKEEEEEEEKNISAIG 39
MLE+ ++++E +E EEEEE + +
Sbjct: 331 MLEETQEQQEEEENEEEEENDTVKLS 356
>gnl|CDD|240285 PTZ00135, PTZ00135, 60S acidic ribosomal protein P0; Provisional.
Length = 310
Score = 27.3 bits (61), Expect = 7.6
Identities = 7/17 (41%), Positives = 7/17 (41%)
Query: 16 EKKEKEEERKEEEEEEE 32
EEEEEEE
Sbjct: 285 AAAAAAAAPAEEEEEEE 301
>gnl|CDD|218517 pfam05236, TAF4, Transcription initiation factor TFIID component
TAF4 family. This region of similarity is found in
Transcription initiation factor TFIID component TAF4.
Length = 255
Score = 27.0 bits (60), Expect = 7.7
Identities = 15/45 (33%), Positives = 22/45 (48%), Gaps = 2/45 (4%)
Query: 15 LEKKEKEEERKEEEEEEEKNISAIGPAVTKLRQEAKIHNAVISLE 59
L +K+KEEE + E E + P +L+Q+AK S E
Sbjct: 118 LAQKQKEEEERRVERRRELGLE--DPEQLRLKQKAKEEQKAESEE 160
>gnl|CDD|215568 PLN03088, PLN03088, SGT1, suppressor of G2 allele of SKP1;
Provisional.
Length = 356
Score = 27.1 bits (60), Expect = 7.8
Identities = 23/78 (29%), Positives = 32/78 (41%), Gaps = 6/78 (7%)
Query: 67 DPNTSFLRAARAGHLDKLRELLEAGVDINTSNASDPNTS--FLRAARA----GHLDKLRE 120
DPN + L A RA KL EA D N + DP+ + +LR A +
Sbjct: 32 DPNNAELYADRAQANIKLGNFTEAVADANKAIELDPSLAKAYLRKGTACMKLEEYQTAKA 91
Query: 121 LLEAGVDINTSNATSKKL 138
LE G + ++ KL
Sbjct: 92 ALEKGASLAPGDSRFTKL 109
>gnl|CDD|218258 pfam04774, HABP4_PAI-RBP1, Hyaluronan / mRNA binding family.
This family includes the HABP4 family of
hyaluronan-binding proteins, and the PAI-1 mRNA-binding
protein, PAI-RBP1. HABP4 has been observed to bind
hyaluronan (a glucosaminoglycan), but it is not known
whether this is its primary role in vivo. It has also
been observed to bind RNA, but with a lower affinity
than that for hyaluronan. PAI-1 mRNA-binding protein
specifically binds the mRNA of type-1 plasminogen
activator inhibitor (PAI-1), and is thought to be
involved in regulation of mRNA stability. However, in
both cases, the sequence motifs predicted to be
important for ligand binding are not conserved
throughout the family, so it is not known whether
members of this family share a common function.
Length = 106
Score = 26.2 bits (58), Expect = 7.9
Identities = 9/21 (42%), Positives = 13/21 (61%)
Query: 16 EKKEKEEERKEEEEEEEKNIS 36
KE E +E EEEE+K ++
Sbjct: 60 ANKEGVVEEEEVEEEEDKEMT 80
>gnl|CDD|221473 pfam12230, PRP21_like_P, Pre-mRNA splicing factor PRP21 like
protein. This domain family is found in eukaryotes,
and is typically between 212 and 238 amino acids in
length. The family is found in association with
pfam01805. There are two completely conserved residues
(W and H) that may be functionally important. PRP21 is
required for assembly of the prespliceosome and it
interacts with U2 snRNP and/or pre-mRNA in the
prespliceosome. This family also contains proteins
similar to PRP21, such as the mammalian SF3a. SF3a also
interacts with U2 snRNP from the prespliceosome,
converting it to its active form.
Length = 230
Score = 27.0 bits (60), Expect = 7.9
Identities = 11/16 (68%), Positives = 16/16 (100%)
Query: 18 KEKEEERKEEEEEEEK 33
++KEE++KEEEEEEE+
Sbjct: 33 QKKEEQKKEEEEEEER 48
>gnl|CDD|225815 COG3276, SelB, Selenocysteine-specific translation elongation
factor [Translation, ribosomal structure and
biogenesis].
Length = 447
Score = 27.0 bits (60), Expect = 8.5
Identities = 25/104 (24%), Positives = 35/104 (33%), Gaps = 23/104 (22%)
Query: 103 NTSFLRAARAGHLDKLRELLEAGVDINTSNATSKKLDNQLNI-----HQKNIKN--NGLN 155
T+ L+A G D+L E + G+ I+ K D + H I N GL
Sbjct: 13 KTTLLKALTGGVTDRLPEEKKRGITIDLGFYYRKLEDGVMGFIDVPGHPDFISNLLAGLG 72
Query: 156 ALHLAAKDGHTTVVKALLKGGCKVDASTK-KGNTALHIAALDSL 198
+ A +V V A T H+ LD L
Sbjct: 73 GIDYA------LLV---------VAADEGLMAQTGEHLLILDLL 101
Score = 27.0 bits (60), Expect = 8.5
Identities = 16/61 (26%), Positives = 24/61 (39%), Gaps = 6/61 (9%)
Query: 69 NTSFLRAARAGHLDKLRELLEAGVDINTSNAS----DPNTSFLRAARAGHLDKLRELLEA 124
T+ L+A G D+L E + G+ I+ D F+ GH D + LL
Sbjct: 13 KTTLLKALTGGVTDRLPEEKKRGITIDLGFYYRKLEDGVMGFIDV--PGHPDFISNLLAG 70
Query: 125 G 125
Sbjct: 71 L 71
>gnl|CDD|198151 smart01083, Cir_N, N-terminal domain of CBF1 interacting
co-repressor CIR. This is a 45 residue conserved
region at the N-terminal end of a family of proteins
referred to as CIRs (CBF1-interacting co-repressors).
CBF1 (centromere-binding factor 1) acts as a
transcription factor that causes repression by binding
specifically to GTGGGAA motifs in responsive promoters,
and it requires CIR as a co-repressor. CIR binds to
histone deacetylase and to SAP30 and serves as a linker
between CBF1 and the histone deacetylase complex.
Length = 37
Score = 24.4 bits (54), Expect = 9.6
Identities = 8/19 (42%), Positives = 13/19 (68%)
Query: 14 MLEKKEKEEERKEEEEEEE 32
E+K +EE++K EE +E
Sbjct: 15 KAEQKAEEEKKKIEERRKE 33
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.311 0.127 0.345
Gapped
Lambda K H
0.267 0.0812 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 11,076,277
Number of extensions: 1043159
Number of successful extensions: 3751
Number of sequences better than 10.0: 1
Number of HSP's gapped: 3393
Number of HSP's successfully gapped: 255
Length of query: 225
Length of database: 10,937,602
Length adjustment: 93
Effective length of query: 132
Effective length of database: 6,812,680
Effective search space: 899273760
Effective search space used: 899273760
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.8 bits)
S2: 57 (25.6 bits)