RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy10604
         (225 letters)



>gnl|CDD|238125 cd00204, ANK, ankyrin repeats;  ankyrin repeats mediate
           protein-protein interactions in very diverse families of
           proteins. The number of ANK repeats in a protein can
           range from 2 to over 20 (ankyrins, for example). ANK
           repeats may occur in combinations with other types of
           domains. The structural repeat unit contains two
           antiparallel helices and a beta-hairpin, repeats are
           stacked in a superhelical arrangement; this alignment
           contains 4 consecutive repeats.
          Length = 126

 Score = 83.2 bits (206), Expect = 2e-20
 Identities = 47/126 (37%), Positives = 64/126 (50%), Gaps = 19/126 (15%)

Query: 69  NTSFLRAARAGHLDKLRELLEAGVDINTSNASDPNTSFLRAARAGHLDKLRELLEAGVDI 128
            T    AA  GHL+ ++ LLE G D+N  +     T    AA+ GHL+ ++ LLE G D+
Sbjct: 8   RTPLHLAASNGHLEVVKLLLENGADVNAKDNDG-RTPLHLAAKNGHLEIVKLLLEKGADV 66

Query: 129 NTSNATSKKLDNQLNIHQKNIKNNGLNALHLAAKDGHTTVVKALLKGGCKVDASTKKGNT 188
           N  +                   +G   LHLAA++G+  VVK LLK G  V+A  K G T
Sbjct: 67  NARD------------------KDGNTPLHLAARNGNLDVVKLLLKHGADVNARDKDGRT 108

Query: 189 ALHIAA 194
            LH+AA
Sbjct: 109 PLHLAA 114



 Score = 55.5 bits (134), Expect = 4e-10
 Identities = 22/44 (50%), Positives = 27/44 (61%)

Query: 151 NNGLNALHLAAKDGHTTVVKALLKGGCKVDASTKKGNTALHIAA 194
            +G   LHLAA +GH  VVK LL+ G  V+A    G T LH+AA
Sbjct: 5   EDGRTPLHLAASNGHLEVVKLLLENGADVNAKDNDGRTPLHLAA 48


>gnl|CDD|205076 pfam12796, Ank_2, Ankyrin repeats (3 copies). 
          Length = 91

 Score = 65.7 bits (161), Expect = 3e-14
 Identities = 34/110 (30%), Positives = 55/110 (50%), Gaps = 21/110 (19%)

Query: 72  FLRAARAGHLDKLRELLEAGVDINTSNASDPNTSFLRAARAGHLDKLRELLEAGVDINTS 131
              AA+ G+L+ ++ LLE G D+N  +          AAR G+L+ ++ LLE G D+N  
Sbjct: 1   LHLAAKNGNLELVKLLLEKGADVNLGDTDTA---LHLAARNGNLEIVKLLLEHGADVNAK 57

Query: 132 NATSKKLDNQLNIHQKNIKNNGLNALHLAAKDGHTTVVKALLKGGCKVDA 181
           +                   +G  ALHLAA++G+  +VK LL+ G  ++ 
Sbjct: 58  D------------------KDGNTALHLAARNGNLEIVKLLLEHGADINL 89



 Score = 63.4 bits (155), Expect = 2e-13
 Identities = 33/89 (37%), Positives = 48/89 (53%), Gaps = 20/89 (22%)

Query: 106 FLRAARAGHLDKLRELLEAGVDINTSNATSKKLDNQLNIHQKNIKNNGLNALHLAAKDGH 165
              AA+ G+L+ ++ LLE G D+N  +  +                    ALHLAA++G+
Sbjct: 1   LHLAAKNGNLELVKLLLEKGADVNLGDTDT--------------------ALHLAARNGN 40

Query: 166 TTVVKALLKGGCKVDASTKKGNTALHIAA 194
             +VK LL+ G  V+A  K GNTALH+AA
Sbjct: 41  LEIVKLLLEHGADVNAKDKDGNTALHLAA 69



 Score = 48.4 bits (116), Expect = 6e-08
 Identities = 25/58 (43%), Positives = 37/58 (63%), Gaps = 1/58 (1%)

Query: 75  AARAGHLDKLRELLEAGVDINTSNASDPNTSFLRAARAGHLDKLRELLEAGVDINTSN 132
           AAR G+L+ ++ LLE G D+N  +    NT+   AAR G+L+ ++ LLE G DIN  +
Sbjct: 35  AARNGNLEIVKLLLEHGADVNAKDKDG-NTALHLAARNGNLEIVKLLLEHGADINLKD 91



 Score = 29.1 bits (66), Expect = 0.60
 Identities = 14/30 (46%), Positives = 20/30 (66%)

Query: 69 NTSFLRAARAGHLDKLRELLEAGVDINTSN 98
          NT+   AAR G+L+ ++ LLE G DIN  +
Sbjct: 62 NTALHLAARNGNLEIVKLLLEHGADINLKD 91


>gnl|CDD|215625 PLN03192, PLN03192, Voltage-dependent potassium channel;
           Provisional.
          Length = 823

 Score = 51.4 bits (123), Expect = 1e-07
 Identities = 46/165 (27%), Positives = 71/165 (43%), Gaps = 17/165 (10%)

Query: 67  DPNTSF--LRAARAGHLDKLRELLEAGVDINTSNASDPNTSFLRAARAGHLDKLRELLEA 124
           DPN +   L  A  G+   L ELL+A +D +  ++    T    AA  G+ D +  LL+ 
Sbjct: 522 DPNMASNLLTVASTGNAALLEELLKAKLDPDIGDSKG-RTPLHIAASKGYEDCVLVLLKH 580

Query: 125 GVDINTS---------NATSKKLDNQLNI--HQKNIKN--NGLNALHLAAKDGHTTVVKA 171
             +++           NA S K      I  H  +I +     + L  AAK    T +K 
Sbjct: 581 ACNVHIRDANGNTALWNAISAKHHKIFRILYHFASISDPHAAGDLLCTAAKRNDLTAMKE 640

Query: 172 LLKGGCKVDASTKKGNTALHIAALDSLVTDIASSAITDATSLPCS 216
           LLK G  VD+   +G TAL +A  +  V D+    I +   +  +
Sbjct: 641 LLKQGLNVDSEDHQGATALQVAMAEDHV-DMVRLLIMNGADVDKA 684



 Score = 46.4 bits (110), Expect = 5e-06
 Identities = 30/98 (30%), Positives = 39/98 (39%), Gaps = 20/98 (20%)

Query: 95  NTSNASDPNTSF--LRAARAGHLDKLRELLEAGVDINTSNATSKKLDNQLNIHQKNIKNN 152
           N     DPN +   L  A  G+   L ELL+A +D +  ++                   
Sbjct: 516 NGGEHDDPNMASNLLTVASTGNAALLEELLKAKLDPDIGDSK------------------ 557

Query: 153 GLNALHLAAKDGHTTVVKALLKGGCKVDASTKKGNTAL 190
           G   LH+AA  G+   V  LLK  C V      GNTAL
Sbjct: 558 GRTPLHIAASKGYEDCVLVLLKHACNVHIRDANGNTAL 595


>gnl|CDD|222277 pfam13637, Ank_4, Ankyrin repeats (many copies). 
          Length = 54

 Score = 45.7 bits (109), Expect = 3e-07
 Identities = 21/43 (48%), Positives = 27/43 (62%)

Query: 153 GLNALHLAAKDGHTTVVKALLKGGCKVDASTKKGNTALHIAAL 195
           G  ALH AA  G   +VK LL+ G  ++ + + GNTALHIAA 
Sbjct: 1   GRTALHKAAISGRLELVKYLLEKGVDINRTDEDGNTALHIAAE 43



 Score = 43.4 bits (103), Expect = 2e-06
 Identities = 22/54 (40%), Positives = 34/54 (62%), Gaps = 1/54 (1%)

Query: 69  NTSFLRAARAGHLDKLRELLEAGVDINTSNASDPNTSFLRAARAGHLDKLRELL 122
            T+  +AA +G L+ ++ LLE GVDIN ++    NT+   AA  G+L+ L+ LL
Sbjct: 2   RTALHKAAISGRLELVKYLLEKGVDINRTDEDG-NTALHIAAENGNLEVLKLLL 54



 Score = 42.6 bits (101), Expect = 4e-06
 Identities = 24/71 (33%), Positives = 38/71 (53%), Gaps = 18/71 (25%)

Query: 103 NTSFLRAARAGHLDKLRELLEAGVDINTSNATSKKLDNQLNIHQKNIKNNGLNALHLAAK 162
            T+  +AA +G L+ ++ LLE GVDIN ++                   +G  ALH+AA+
Sbjct: 2   RTALHKAAISGRLELVKYLLEKGVDINRTD------------------EDGNTALHIAAE 43

Query: 163 DGHTTVVKALL 173
           +G+  V+K LL
Sbjct: 44  NGNLEVLKLLL 54


>gnl|CDD|223738 COG0666, Arp, FOG: Ankyrin repeat [General function prediction
           only].
          Length = 235

 Score = 48.3 bits (114), Expect = 6e-07
 Identities = 38/140 (27%), Positives = 60/140 (42%), Gaps = 21/140 (15%)

Query: 66  SDPNTSFLRAARAGHLDKLRELLEAGVDINTSNASDPNTSFLRAARAGHL-----DKLRE 120
            D       AA  G    ++ LL +G D+N  +A D +T    AA  G+      +  + 
Sbjct: 71  LDGRLPLHSAASKGDDKIVKLLLASGADVNAKDA-DGDTPLHLAALNGNPPEGNIEVAKL 129

Query: 121 LLEAGVDINTSNATSKKLDNQLNIHQKNIKNNGLNALHLAAKDGHTTVVKALLKGGCKVD 180
           LLEAG D++ +N   +               +G   LH AA +G   +V+ LL+ G   +
Sbjct: 130 LLEAGADLDVNNLRDE---------------DGNTPLHWAALNGDADIVELLLEAGADPN 174

Query: 181 ASTKKGNTALHIAALDSLVT 200
           +    G TAL  AA +  + 
Sbjct: 175 SRNSYGVTALDPAAKNGRIE 194



 Score = 37.5 bits (86), Expect = 0.002
 Identities = 35/101 (34%), Positives = 48/101 (47%), Gaps = 20/101 (19%)

Query: 78  AGHLDKLRELLEAGVDINTSNASDP--NTSFLRAARAGHLDKLRELLEAGVDINTSNATS 135
            G+++  + LLEAG D++ +N  D   NT    AA  G  D +  LLEAG D N+ N   
Sbjct: 121 EGNIEVAKLLLEAGADLDVNNLRDEDGNTPLHWAALNGDADIVELLLEAGADPNSRN--- 177

Query: 136 KKLDNQLNIHQKNIKNNGLNALHLAAKDGHTTVVKALLKGG 176
                          + G+ AL  AAK+G   +VK LL  G
Sbjct: 178 ---------------SYGVTALDPAAKNGRIELVKLLLDKG 203



 Score = 32.9 bits (74), Expect = 0.11
 Identities = 22/62 (35%), Positives = 30/62 (48%)

Query: 134 TSKKLDNQLNIHQKNIKNNGLNALHLAAKDGHTTVVKALLKGGCKVDASTKKGNTALHIA 193
           +   L   ++ H      +G   LH AA  G   +VK LL  G  V+A    G+T LH+A
Sbjct: 54  SELLLKLIVDRHLAARDLDGRLPLHSAASKGDDKIVKLLLASGADVNAKDADGDTPLHLA 113

Query: 194 AL 195
           AL
Sbjct: 114 AL 115


>gnl|CDD|206028 pfam13857, Ank_5, Ankyrin repeats (many copies). 
          Length = 56

 Score = 44.3 bits (105), Expect = 1e-06
 Identities = 19/43 (44%), Positives = 24/43 (55%)

Query: 151 NNGLNALHLAAKDGHTTVVKALLKGGCKVDASTKKGNTALHIA 193
            NG   LHLAAK G   +V+ LLK G  ++     G TAL +A
Sbjct: 14  GNGNTPLHLAAKYGALELVQWLLKPGVDLNLRDSDGLTALDLA 56



 Score = 36.9 bits (86), Expect = 4e-04
 Identities = 22/74 (29%), Positives = 33/74 (44%), Gaps = 18/74 (24%)

Query: 87  LLEAGVDINTSNASDPNTSFLRAARAGHLDKLRELLEAGVDINTSNATSKKLDNQLNIHQ 146
           LLE G     +   + NT    AA+ G L+ ++ LL+ GVD+N  +              
Sbjct: 1   LLEHGPIDLNATDGNGNTPLHLAAKYGALELVQWLLKPGVDLNLRD-------------- 46

Query: 147 KNIKNNGLNALHLA 160
               ++GL AL LA
Sbjct: 47  ----SDGLTALDLA 56



 Score = 30.0 bits (68), Expect = 0.12
 Identities = 12/32 (37%), Positives = 20/32 (62%)

Query: 69  NTSFLRAARAGHLDKLRELLEAGVDINTSNAS 100
           NT    AA+ G L+ ++ LL+ GVD+N  ++ 
Sbjct: 17  NTPLHLAAKYGALELVQWLLKPGVDLNLRDSD 48


>gnl|CDD|222984 PHA03100, PHA03100, ankyrin repeat protein; Provisional.
          Length = 422

 Score = 45.0 bits (107), Expect = 1e-05
 Identities = 35/119 (29%), Positives = 53/119 (44%), Gaps = 5/119 (4%)

Query: 79  GHLDKLRELLEAGVDINTSNASDPNTSFLRAARAGHLDK--LRELLEAGVDINTSNATSK 136
                +  LL+ G ++N  N SD          +  +D   L+ L++ GVDIN  N  + 
Sbjct: 119 NSYSIVEYLLDNGANVNIKN-SDGENLLHLYLESNKIDLKILKLLIDKGVDINAKNRVNY 177

Query: 137 KLDNQLNIHQKNIKNNGLNALHLAAKDGHTTVVKALLKGGCKVDASTKKGNTALHIAAL 195
            L   + I+ K++       LH A  + +   VK LL  G   +   K G+T LHIA L
Sbjct: 178 LLSYGVPINIKDVYGF--TPLHYAVYNNNPEFVKYLLDLGANPNLVNKYGDTPLHIAIL 234



 Score = 37.3 bits (87), Expect = 0.004
 Identities = 29/123 (23%), Positives = 48/123 (39%), Gaps = 22/123 (17%)

Query: 93  DINTSNASDPNTSFLRAARAGHLDKLRELLEAGVDINTSNATSKKLDNQLNIHQKNIK-- 150
           D+N  +   P      A  A ++D ++ LL+ G DIN+S   +    + L+  + N+   
Sbjct: 26  DLNDYSYKKPVLPLYLAKEARNIDVVKILLDNGADINSSTKNNSTPLHYLSNIKYNLTDV 85

Query: 151 ------------------NNGLNALHLAA--KDGHTTVVKALLKGGCKVDASTKKGNTAL 190
                             NNG+  L  A   K    ++V+ LL  G  V+     G   L
Sbjct: 86  KEIVKLLLEYGANVNAPDNNGITPLLYAISKKSNSYSIVEYLLDNGANVNIKNSDGENLL 145

Query: 191 HIA 193
           H+ 
Sbjct: 146 HLY 148



 Score = 33.5 bits (77), Expect = 0.070
 Identities = 21/95 (22%), Positives = 35/95 (36%), Gaps = 19/95 (20%)

Query: 82  DKLRELLEAGVDINTSNASDPNTSFLRAARAGHLDKLRELLEAGVDINTSNATSKKLDNQ 141
           +++  LL  GV IN  +     T    A    + + ++ LL+ G + N  N         
Sbjct: 173 NRVNYLLSYGVPINIKDVYG-FTPLHYAVYNNNPEFVKYLLDLGANPNLVN--------- 222

Query: 142 LNIHQKNIKNNGLNALHLAAKDGHTTVVKALLKGG 176
                      G   LH+A  + +  + K LL  G
Sbjct: 223 ---------KYGDTPLHIAILNNNKEIFKLLLNNG 248



 Score = 30.4 bits (69), Expect = 0.78
 Identities = 19/81 (23%), Positives = 29/81 (35%), Gaps = 2/81 (2%)

Query: 69  NTSFLRAARAGHLDKLRELLEAGVDINTSNASDPNTSFLRAARAGHLDKLRELLEAGVDI 128
            T    A    + + ++ LL+ G + N  N    +T    A    + +  + LL  G  I
Sbjct: 193 FTPLHYAVYNNNPEFVKYLLDLGANPNLVN-KYGDTPLHIAILNNNKEIFKLLLNNGPSI 251

Query: 129 NTSNATSKKL-DNQLNIHQKN 148
            T   T     D  LN   K 
Sbjct: 252 KTIIETLLYFKDKDLNTITKI 272


>gnl|CDD|222980 PHA03095, PHA03095, ankyrin-like protein; Provisional.
          Length = 471

 Score = 44.6 bits (106), Expect = 2e-05
 Identities = 28/111 (25%), Positives = 40/111 (36%), Gaps = 21/111 (18%)

Query: 82  DKLRELLEAGVDINTSNASDPNTSFLRAARAGHLD--KLRELLEAGVDINTSNATSKKLD 139
             +REL+ AG D   ++    NT     A         +  LL AG+ IN  N       
Sbjct: 203 RIVRELIRAGCDPAATDMLG-NTPLHSMATGSSCKRSLVLPLLIAGISINARN------- 254

Query: 140 NQLNIHQKNIKNNGLNALHLAAKDGHTTVVKALLKGGCKVDASTKKGNTAL 190
                        G   LH AA   +    + L+  G  ++A +  GNT L
Sbjct: 255 -----------RYGQTPLHYAAVFNNPRACRRLIALGADINAVSSDGNTPL 294



 Score = 43.5 bits (103), Expect = 4e-05
 Identities = 33/129 (25%), Positives = 51/129 (39%), Gaps = 20/129 (15%)

Query: 66  SDPNTSFLRAARAGHLDKLRELLEAGVDINTSNASDPNTSFLRAARAGHLDKLRELLEAG 125
             P   +L  +     D +R LLEAG D+N           L    A  LD ++ L++AG
Sbjct: 48  KTPLHLYLHYSSEKVKDIVRLLLEAGADVNAPERCGFTPLHLYLYNATTLDVIKLLIKAG 107

Query: 126 VDINTSNATSKKLDNQLNIHQKNIKNNGLNALH--LAAKDGHTTVVKALLKGGCKVDAST 183
            D+N  +                    G   LH  L+  + +  V++ LL+ G  V+A  
Sbjct: 108 ADVNAKD------------------KVGRTPLHVYLSGFNINPKVIRLLLRKGADVNALD 149

Query: 184 KKGNTALHI 192
             G T L +
Sbjct: 150 LYGMTPLAV 158



 Score = 41.9 bits (99), Expect = 1e-04
 Identities = 30/116 (25%), Positives = 52/116 (44%), Gaps = 19/116 (16%)

Query: 84  LRELLEAGVDINTSNASD--PNTSFLRAARAGHLDKLRELLEAGVDINTSNATSKKLDNQ 141
           +R LL  G D+N  +     P    L++  A +++ LR L++AG D+   +    +  + 
Sbjct: 135 IRLLLRKGADVNALDLYGMTPLAVLLKSRNA-NVELLRLLIDAGADVYAVDD---RFRSL 190

Query: 142 LNIHQKNIKNNGLNALHLAAKDGHTTVVKALLKGGCKVDASTKKGNTALHIAALDS 197
           L+ H ++ K                 +V+ L++ GC   A+   GNT LH  A  S
Sbjct: 191 LHHHLQSFKPR-------------ARIVRELIRAGCDPAATDMLGNTPLHSMATGS 233



 Score = 26.9 bits (60), Expect = 9.4
 Identities = 18/77 (23%), Positives = 32/77 (41%), Gaps = 15/77 (19%)

Query: 115 LDKLRELLEAGVDINTSNATSKKLDNQLNIHQKNIKNNGLNALHLAAKDGHTTVVKALLK 174
           ++++R LL AG D+N                +       L+     + +    +V+ LL+
Sbjct: 27  VEEVRRLLAAGADVN---------------FRGEYGKTPLHLYLHYSSEKVKDIVRLLLE 71

Query: 175 GGCKVDASTKKGNTALH 191
            G  V+A  + G T LH
Sbjct: 72  AGADVNAPERCGFTPLH 88


>gnl|CDD|165205 PHA02874, PHA02874, ankyrin repeat protein; Provisional.
          Length = 434

 Score = 44.2 bits (104), Expect = 2e-05
 Identities = 36/129 (27%), Positives = 54/129 (41%), Gaps = 6/129 (4%)

Query: 70  TSFLRAARAGHLDKLRELLEAGVDINTSNASDPNTSFLRAARAGHLDKLRELLEAGVDIN 129
           T  + A R+G    +   ++ G DIN  N   P+   L A + G  D ++ L++ GVD +
Sbjct: 37  TPLIDAIRSGDAKIVELFIKHGADINHINTKIPH-PLLTAIKIGAHDIIKLLIDNGVDTS 95

Query: 130 TSNATSKKLDNQLNIHQKNIKNNGLNA-----LHLAAKDGHTTVVKALLKGGCKVDASTK 184
                  + D    I    I  N  +A     LH A K G    +K L + G  V+    
Sbjct: 96  ILPIPCIEKDMIKTILDCGIDVNIKDAELKTFLHYAIKKGDLESIKMLFEYGADVNIEDD 155

Query: 185 KGNTALHIA 193
            G   +HIA
Sbjct: 156 NGCYPIHIA 164



 Score = 41.9 bits (98), Expect = 1e-04
 Identities = 34/134 (25%), Positives = 57/134 (42%), Gaps = 26/134 (19%)

Query: 82  DKLRELLEAGVDINTSNASDPNTSFLRAARAGHLDKLRELLEAGVDINTSNATSKKLDNQ 141
           D ++ +L+ G+D+N  +A +  T    A + G L+ ++ L E G D+N      +  +  
Sbjct: 105 DMIKTILDCGIDVNIKDA-ELKTFLHYAIKKGDLESIKMLFEYGADVNI-----EDDNGC 158

Query: 142 LNIH-------------------QKNIKNNGLNA-LHLAAKDGHTTVVKALLKGGCKVDA 181
             IH                     N+K+N   + LH AA+ G    +K L+  G  +  
Sbjct: 159 YPIHIAIKHNFFDIIKLLLEKGAYANVKDNNGESPLHNAAEYGDYACIKLLIDHGNHIMN 218

Query: 182 STKKGNTALHIAAL 195
             K G T LH A +
Sbjct: 219 KCKNGFTPLHNAII 232


>gnl|CDD|200936 pfam00023, Ank, Ankyrin repeat.  Ankyrins are multifunctional
           adaptors that link specific proteins to the
           membrane-associated, spectrin- actin cytoskeleton. This
           repeat-domain is a 'membrane-binding' domain of up to 24
           repeated units, and it mediates most of the protein's
           binding activities. Repeats 13-24 are especially active,
           with known sites of interaction for the Na/K ATPase,
           Cl/HCO(3) anion exchanger, voltage-gated sodium channel,
           clathrin heavy chain and L1 family cell adhesion
           molecules. The ANK repeats are found to form a
           contiguous spiral stack such that ion transporters like
           the anion exchanger associate in a large central cavity
           formed by the ANK repeat spiral, while clathrin and cell
           adhesion molecules associate with specific regions
           outside this cavity.
          Length = 33

 Score = 39.8 bits (94), Expect = 3e-05
 Identities = 17/33 (51%), Positives = 22/33 (66%)

Query: 152 NGLNALHLAAKDGHTTVVKALLKGGCKVDASTK 184
           +G   LHLAA++GH  VVK LL+ G  V+A  K
Sbjct: 1   DGNTPLHLAARNGHLEVVKLLLEAGADVNARDK 33



 Score = 28.3 bits (64), Expect = 0.28
 Identities = 15/30 (50%), Positives = 20/30 (66%)

Query: 69 NTSFLRAARAGHLDKLRELLEAGVDINTSN 98
          NT    AAR GHL+ ++ LLEAG D+N  +
Sbjct: 3  NTPLHLAARNGHLEVVKLLLEAGADVNARD 32



 Score = 28.3 bits (64), Expect = 0.28
 Identities = 15/30 (50%), Positives = 20/30 (66%)

Query: 103 NTSFLRAARAGHLDKLRELLEAGVDINTSN 132
           NT    AAR GHL+ ++ LLEAG D+N  +
Sbjct: 3   NTPLHLAARNGHLEVVKLLLEAGADVNARD 32


>gnl|CDD|165207 PHA02876, PHA02876, ankyrin repeat protein; Provisional.
          Length = 682

 Score = 40.4 bits (94), Expect = 5e-04
 Identities = 37/142 (26%), Positives = 61/142 (42%), Gaps = 17/142 (11%)

Query: 69  NTSFLRAARAGHLDKL-RELLEAGVDINTSNASDPNTSFLRAARAGHLDKLRELLEAGVD 127
           NT    A++A  L +L  +LLE G D+N  N       +L A      + +R L+  G D
Sbjct: 274 NTPLHHASQAPSLSRLVPKLLERGADVNAKNIKGETPLYLMAKNGYDTENIRTLIMLGAD 333

Query: 128 INTSNATSKKLDNQLNIHQKN------IKNNGLNA----------LHLAAKDGHTTVVKA 171
           +N ++       +Q +   +N      +   G N           +H AA   +  ++  
Sbjct: 334 VNAADRLYITPLHQASTLDRNKDIVITLLELGANVNARDYCDKTPIHYAAVRNNVVIINT 393

Query: 172 LLKGGCKVDASTKKGNTALHIA 193
           LL  G  ++A ++K  TALH A
Sbjct: 394 LLDYGADIEALSQKIGTALHFA 415



 Score = 39.3 bits (91), Expect = 0.001
 Identities = 40/126 (31%), Positives = 61/126 (48%), Gaps = 21/126 (16%)

Query: 71  SFLRAARAGHLDKLRELLEAGVDINTSNASDPNTSFLRAARAGHLDKL-RELLEAGVDIN 129
           S L+A R   L+    L +AG  +N+ +    NT    A++A  L +L  +LLE G D+N
Sbjct: 243 SLLKAIRNEDLETSLLLYDAGFSVNSIDDCK-NTPLHHASQAPSLSRLVPKLLERGADVN 301

Query: 130 TSNATSKKLDNQLNIHQKNIKNNGLNALHLAAKDGHTTV-VKALLKGGCKVDASTKKGNT 188
                            KNIK  G   L+L AK+G+ T  ++ L+  G  V+A+ +   T
Sbjct: 302 A----------------KNIK--GETPLYLMAKNGYDTENIRTLIMLGADVNAADRLYIT 343

Query: 189 ALHIAA 194
            LH A+
Sbjct: 344 PLHQAS 349



 Score = 27.3 bits (60), Expect = 8.8
 Identities = 31/132 (23%), Positives = 62/132 (46%), Gaps = 13/132 (9%)

Query: 75  AARAGHLDKLRELLEAGVDINTSNASDPNTSFLRAARAGHLDKLRELLEAGVDINTSNAT 134
           AA  G+   +  LL  G D+N   A D  +    A  + ++D ++ +++   +IN ++ +
Sbjct: 185 AAERGNAKMVNLLLSYGADVNII-ALDDLSVLECAVDSKNIDTIKAIIDNRSNINKNDLS 243

Query: 135 ------SKKLDNQLNIHQKNIKNNGLN-----ALHLAAKDGH-TTVVKALLKGGCKVDAS 182
                 ++ L+  L ++      N ++      LH A++    + +V  LL+ G  V+A 
Sbjct: 244 LLKAIRNEDLETSLLLYDAGFSVNSIDDCKNTPLHHASQAPSLSRLVPKLLERGADVNAK 303

Query: 183 TKKGNTALHIAA 194
             KG T L++ A
Sbjct: 304 NIKGETPLYLMA 315


>gnl|CDD|222939 PHA02878, PHA02878, ankyrin repeat protein; Provisional.
          Length = 477

 Score = 39.9 bits (93), Expect = 6e-04
 Identities = 31/110 (28%), Positives = 43/110 (39%), Gaps = 18/110 (16%)

Query: 84  LRELLEAGVDINTSNASDPNTSFLRAARAGHLDKLRELLEAGVDINTSNATSKKLDNQLN 143
            + LL  G DIN  +    NT+   A           LL  G ++N  + T+        
Sbjct: 150 TKLLLSYGADINMKDRHKGNTALHYATENKDQRLTELLLSYGANVNIPDKTNN------- 202

Query: 144 IHQKNIKNNGLNALHLAAKDGHTTVVKALLKGGCKVDASTKKGNTALHIA 193
                      + LH A K  +  +V  LL+ G   DA  K GNT LHI+
Sbjct: 203 -----------SPLHHAVKHYNKPIVHILLENGASTDARDKCGNTPLHIS 241



 Score = 27.9 bits (62), Expect = 4.5
 Identities = 22/56 (39%), Positives = 28/56 (50%), Gaps = 12/56 (21%)

Query: 82  DKLRELLEAGVDINTSNASDPNTSFLRAARAGHL-----DKLRELLEAGVDINTSN 132
           D L+ LLE GVD+N         S++    A H       KL+ LLE G DIN+ N
Sbjct: 249 DILKLLLEHGVDVNAK-------SYILGLTALHSSIKSERKLKLLLEYGADINSLN 297


>gnl|CDD|197603 smart00248, ANK, ankyrin repeats.  Ankyrin repeats are about 33
           amino acids long and occur in at least four consecutive
           copies. They are involved in protein-protein
           interactions. The core of the repeat seems to be an
           helix-loop-helix structure.
          Length = 30

 Score = 32.6 bits (75), Expect = 0.008
 Identities = 14/30 (46%), Positives = 20/30 (66%)

Query: 152 NGLNALHLAAKDGHTTVVKALLKGGCKVDA 181
           +G   LHLAA++G+  VVK LL  G  ++A
Sbjct: 1   DGRTPLHLAAENGNLEVVKLLLDKGADINA 30



 Score = 27.6 bits (62), Expect = 0.49
 Identities = 11/27 (40%), Positives = 16/27 (59%)

Query: 69 NTSFLRAARAGHLDKLRELLEAGVDIN 95
           T    AA  G+L+ ++ LL+ G DIN
Sbjct: 3  RTPLHLAAENGNLEVVKLLLDKGADIN 29



 Score = 27.6 bits (62), Expect = 0.49
 Identities = 11/27 (40%), Positives = 16/27 (59%)

Query: 103 NTSFLRAARAGHLDKLRELLEAGVDIN 129
            T    AA  G+L+ ++ LL+ G DIN
Sbjct: 3   RTPLHLAAENGNLEVVKLLLDKGADIN 29


>gnl|CDD|205784 pfam13606, Ank_3, Ankyrin repeat.  Ankyrins are multifunctional
           adaptors that link specific proteins to the
           membrane-associated, spectrin- actin cytoskeleton. This
           repeat-domain is a 'membrane-binding' domain of up to 24
           repeated units, and it mediates most of the protein's
           binding activities.
          Length = 30

 Score = 32.6 bits (75), Expect = 0.009
 Identities = 13/30 (43%), Positives = 21/30 (70%)

Query: 152 NGLNALHLAAKDGHTTVVKALLKGGCKVDA 181
           +G   LHLAA++G+  +VK LL+ G  ++A
Sbjct: 1   DGNTPLHLAARNGNLELVKLLLEHGADINA 30



 Score = 31.5 bits (72), Expect = 0.022
 Identities = 14/28 (50%), Positives = 18/28 (64%)

Query: 69 NTSFLRAARAGHLDKLRELLEAGVDINT 96
          NT    AAR G+L+ ++ LLE G DIN 
Sbjct: 3  NTPLHLAARNGNLELVKLLLEHGADINA 30



 Score = 31.5 bits (72), Expect = 0.022
 Identities = 14/28 (50%), Positives = 18/28 (64%)

Query: 103 NTSFLRAARAGHLDKLRELLEAGVDINT 130
           NT    AAR G+L+ ++ LLE G DIN 
Sbjct: 3   NTPLHLAARNGNLELVKLLLEHGADINA 30


>gnl|CDD|165206 PHA02875, PHA02875, ankyrin repeat protein; Provisional.
          Length = 413

 Score = 36.1 bits (83), Expect = 0.010
 Identities = 38/156 (24%), Positives = 54/156 (34%), Gaps = 17/156 (10%)

Query: 65  YSDPNTSFLRAARAGHLDKLRELLEAGVDINTSNASDPNTSFLRAARAGHLDKLRELLEA 124
           Y D  +    A   G +  + ELL+ G   +     D  T    A     LD ++ L+  
Sbjct: 65  YPDIESELHDAVEEGDVKAVEELLDLGKFADDVFYKDGMTPLHLATILKKLDIMKLLIAR 124

Query: 125 GVDINTSNATSKKLDNQLNIHQKNIKNNGLNALHLAAKD----------------GHTTV 168
           G D +  N T K     L +   +IK   L   H A  D                G   +
Sbjct: 125 GADPDIPN-TDKFSPLHLAVMMGDIKGIELLIDHKACLDIEDCCGCTPLIIAMAKGDIAI 183

Query: 169 VKALLKGGCKVDASTKKGNTALHIAALDSLVTDIAS 204
            K LL  G  +D   K G  A    A+++   DI  
Sbjct: 184 CKMLLDSGANIDYFGKNGCVAALCYAIENNKIDIVR 219



 Score = 30.3 bits (68), Expect = 0.85
 Identities = 18/78 (23%), Positives = 33/78 (42%), Gaps = 5/78 (6%)

Query: 52  HNAVISLEPCYDCYSDPNTSFLRAARAGHLDKLRELLEAGVDINTSNASDPNTSFLRAAR 111
           H A + +E C  C     T  + A   G +   + LL++G +I+    +    +   A  
Sbjct: 157 HKACLDIEDCCGC-----TPLIIAMAKGDIAICKMLLDSGANIDYFGKNGCVAALCYAIE 211

Query: 112 AGHLDKLRELLEAGVDIN 129
              +D +R  ++ G D N
Sbjct: 212 NNKIDIVRLFIKRGADCN 229


>gnl|CDD|140343 PTZ00322, PTZ00322,
           6-phosphofructo-2-kinase/fructose-2,6-biphosphatase;
           Provisional.
          Length = 664

 Score = 35.6 bits (82), Expect = 0.017
 Identities = 16/42 (38%), Positives = 22/42 (52%)

Query: 152 NGLNALHLAAKDGHTTVVKALLKGGCKVDASTKKGNTALHIA 193
           +G   LH+A  +GH  VV+ LL+ G       K G T L +A
Sbjct: 114 DGRTPLHIACANGHVQVVRVLLEFGADPTLLDKDGKTPLELA 155



 Score = 28.3 bits (63), Expect = 3.3
 Identities = 15/43 (34%), Positives = 18/43 (41%), Gaps = 1/43 (2%)

Query: 158 HLAAKDGHTTVVKALLKGGCKVDASTKKGNTALHIAALDSLVT 200
            LAA  G     + LL GG   +     G T LHIA  +  V 
Sbjct: 88  QLAAS-GDAVGARILLTGGADPNCRDYDGRTPLHIACANGHVQ 129


>gnl|CDD|222954 PHA02989, PHA02989, ankyrin repeat protein; Provisional.
          Length = 494

 Score = 34.3 bits (79), Expect = 0.038
 Identities = 33/136 (24%), Positives = 55/136 (40%), Gaps = 24/136 (17%)

Query: 82  DKLRELLEAGVDINTSNA----SDPNTSFLRAARAGHLDKLRELLEAGVDINTSNATSKK 137
           D ++ LL  GV++    +    +  N           +  ++ L++ GV+I T+N  S+ 
Sbjct: 161 DVIKILLSFGVNLFEKTSLYGLTPMNIYLRNDIDVISIKVIKYLIKKGVNIETNNNGSES 220

Query: 138 -----LDNQLNIHQKNIK-----------NN----GLNALHLAAKDGHTTVVKALLKGGC 177
                LDN   + +K  K           N     G N L ++AK  +      LLK G 
Sbjct: 221 VLESFLDNNKILSKKEFKVLNFILKYIKINKKDKKGFNPLLISAKVDNYEAFNYLLKLGD 280

Query: 178 KVDASTKKGNTALHIA 193
            +   +K G+T L  A
Sbjct: 281 DIYNVSKDGDTVLTYA 296


>gnl|CDD|218303 pfam04874, Mak16, Mak16 protein C-terminal region.  The precise
          function of this eukaryotic protein family is unknown.
          The yeast orthologues have been implicated in cell
          cycle progression and biogenesis of 60S ribosomal
          subunits. The Schistosoma mansoni Mak16 has been shown
          to target protein transport to the nucleolus.
          Length = 97

 Score = 32.5 bits (74), Expect = 0.040
 Identities = 16/31 (51%), Positives = 20/31 (64%)

Query: 4  YNVDLGAFGTMLEKKEKEEERKEEEEEEEKN 34
          YN+   AF   LE +E EE  +EEEEEEE+ 
Sbjct: 44 YNISQSAFKKALEAEESEENDEEEEEEEEEE 74


>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger.  [Transport and
           binding proteins, Cations and iron carrying compounds].
          Length = 1096

 Score = 33.8 bits (77), Expect = 0.078
 Identities = 16/37 (43%), Positives = 25/37 (67%)

Query: 16  EKKEKEEERKEEEEEEEKNISAIGPAVTKLRQEAKIH 52
           E++E+EEE +EEEEEEE+N   +     + RQ+  I+
Sbjct: 872 EEEEEEEEEEEEEEEEEENEEPLSLEWPETRQKQAIY 908



 Score = 31.1 bits (70), Expect = 0.54
 Identities = 12/18 (66%), Positives = 17/18 (94%)

Query: 16  EKKEKEEERKEEEEEEEK 33
           E++E+EEE +EEEEEEE+
Sbjct: 869 EEEEEEEEEEEEEEEEEE 886



 Score = 30.7 bits (69), Expect = 0.66
 Identities = 12/18 (66%), Positives = 17/18 (94%)

Query: 16  EKKEKEEERKEEEEEEEK 33
           E++E+EEE +EEEEEEE+
Sbjct: 871 EEEEEEEEEEEEEEEEEE 888



 Score = 30.7 bits (69), Expect = 0.70
 Identities = 12/18 (66%), Positives = 17/18 (94%)

Query: 16  EKKEKEEERKEEEEEEEK 33
           E++E+EEE +EEEEEEE+
Sbjct: 870 EEEEEEEEEEEEEEEEEE 887



 Score = 30.7 bits (69), Expect = 0.74
 Identities = 12/18 (66%), Positives = 17/18 (94%)

Query: 16  EKKEKEEERKEEEEEEEK 33
           E++E+EEE +EEEEEEE+
Sbjct: 864 EEEEEEEEEEEEEEEEEE 881



 Score = 30.7 bits (69), Expect = 0.77
 Identities = 12/18 (66%), Positives = 17/18 (94%)

Query: 16  EKKEKEEERKEEEEEEEK 33
           E++E+EEE +EEEEEEE+
Sbjct: 867 EEEEEEEEEEEEEEEEEE 884



 Score = 30.7 bits (69), Expect = 0.78
 Identities = 12/18 (66%), Positives = 17/18 (94%)

Query: 16  EKKEKEEERKEEEEEEEK 33
           E++E+EEE +EEEEEEE+
Sbjct: 866 EEEEEEEEEEEEEEEEEE 883



 Score = 30.3 bits (68), Expect = 0.92
 Identities = 12/18 (66%), Positives = 17/18 (94%)

Query: 16  EKKEKEEERKEEEEEEEK 33
           E++E+EEE +EEEEEEE+
Sbjct: 868 EEEEEEEEEEEEEEEEEE 885



 Score = 30.0 bits (67), Expect = 1.2
 Identities = 12/18 (66%), Positives = 17/18 (94%)

Query: 16  EKKEKEEERKEEEEEEEK 33
           E++E+EEE +EEEEEEE+
Sbjct: 865 EEEEEEEEEEEEEEEEEE 882



 Score = 28.8 bits (64), Expect = 2.5
 Identities = 14/28 (50%), Positives = 19/28 (67%)

Query: 6   VDLGAFGTMLEKKEKEEERKEEEEEEEK 33
           VD G      + +E+EEE +EEEEEEE+
Sbjct: 852 VDGGGGSDGGDSEEEEEEEEEEEEEEEE 879


>gnl|CDD|185582 PTZ00373, PTZ00373, 60S Acidic ribosomal protein P2; Provisional.
          Length = 112

 Score = 28.7 bits (64), Expect = 0.91
 Identities = 11/19 (57%), Positives = 16/19 (84%)

Query: 17  KKEKEEERKEEEEEEEKNI 35
            + K+EE+KEEEEEEE ++
Sbjct: 88  AEAKKEEKKEEEEEEEDDL 106


>gnl|CDD|227032 COG4688, COG4688, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 665

 Score = 30.0 bits (67), Expect = 1.1
 Identities = 14/65 (21%), Positives = 23/65 (35%), Gaps = 2/65 (3%)

Query: 80  HLDKLRELLEAGVDIN-TSNASDPNTSFLRAARAGHLDKLRE-LLEAGVDINTSNATSKK 137
           HL  L   L A ++      A+ P      A     LD +R+ L  A  +    +     
Sbjct: 596 HLTSLERKLVAKLEHAQARPATTPGRDNQIAHAQVRLDGVRKLLTTADGERPFPDGPDLS 655

Query: 138 LDNQL 142
           + + L
Sbjct: 656 MPSPL 660


>gnl|CDD|239647 cd03675, Nudix_Hydrolase_2, Contains a crystal structure of the
           Nudix hydrolase from Nitrosomonas europaea, which has an
           unknown function. In general, members of the Nudix
           hydrolase superfamily catalyze the hydrolysis of
           NUcleoside DIphosphates linked to other moieties, X.
           Enzymes belonging to this superfamily require a divalent
           cation, such as Mg2+ or Mn2+, for their activity. They
           also contain a highly conserved 23-residue nudix motif
           (GX5EX7REUXEEXGU, where U = I, L or V), which forms a
           structural motif that functions as a metal binding and
           catalytic site. Substrates of nudix hydrolases include
           intact and oxidatively damaged nucleoside triphosphates,
           dinucleoside polyphosphates, nucleotide-sugars and
           dinucleotide enzymes. These substrates are metabolites
           or cell signaling molecules that require regulation
           during different stages of the cell cycle or during
           periods of stress. In general, the role of the nudix
           hydrolase is to sanitize the nucleotide pools and to
           maintain cell viability, thereby serving as surveillance
           & "house-cleaning" enzymes. Substrate specificity is
           used to define families within the superfamily.
           Differences in substrate specificity are determined by
           the N-terminal extension or by residues in variable loop
           regions. Mechanistically, substrate hydrolysis occurs by
           a nucleophilic substitution reaction, with variation in
           the numbers and roles of divalent cations required.
          Length = 134

 Score = 28.7 bits (65), Expect = 1.3
 Identities = 18/57 (31%), Positives = 25/57 (43%), Gaps = 17/57 (29%)

Query: 78  AGHLDK--------LRELLE---------AGVDINTSNASDPNTSFLRAARAGHLDK 117
           AGHL+         +RE LE         A + I    A D +T++LR A A  L +
Sbjct: 30  AGHLEPGESLIEAAVRETLEETGWHVEPTALLGIYQWTAPDSDTTYLRFAFAAELLE 86


>gnl|CDD|235795 PRK06402, rpl12p, 50S ribosomal protein L12P; Reviewed.
          Length = 106

 Score = 28.0 bits (63), Expect = 1.5
 Identities = 10/17 (58%), Positives = 15/17 (88%)

Query: 16 EKKEKEEERKEEEEEEE 32
          +K+E+EEE ++EE EEE
Sbjct: 80 KKEEEEEEEEKEESEEE 96



 Score = 27.6 bits (62), Expect = 2.1
 Identities = 12/17 (70%), Positives = 15/17 (88%)

Query: 16 EKKEKEEERKEEEEEEE 32
          EKKE+EEE +E+EE EE
Sbjct: 79 EKKEEEEEEEEKEESEE 95



 Score = 27.2 bits (61), Expect = 2.8
 Identities = 11/22 (50%), Positives = 18/22 (81%)

Query: 16 EKKEKEEERKEEEEEEEKNISA 37
          E+K++EEE +EE+EE E+  +A
Sbjct: 78 EEKKEEEEEEEEKEESEEEAAA 99



 Score = 26.1 bits (58), Expect = 6.9
 Identities = 10/18 (55%), Positives = 14/18 (77%)

Query: 16 EKKEKEEERKEEEEEEEK 33
            +EK+EE +EEEE+EE 
Sbjct: 76 AAEEKKEEEEEEEEKEES 93


>gnl|CDD|181642 PRK09082, PRK09082, methionine aminotransferase; Validated.
          Length = 386

 Score = 29.5 bits (67), Expect = 1.6
 Identities = 21/88 (23%), Positives = 35/88 (39%), Gaps = 22/88 (25%)

Query: 55  VISLEPCYDCYSDPNTSFLRAARAGHL-----------DKLRELLEAG---VDINTSNAS 100
           VI  +P YD Y+ P    L   RA  +            +    +      + +NT +  
Sbjct: 118 VIVFDPSYDSYA-PAIE-LAGGRAVRVALQPPDFRVDWQRFAAAISPRTRLIILNTPH-- 173

Query: 101 DPNTSFLRAARAGHLDKLRELLEAGVDI 128
           +P+ +   AA    +  L +L+ AG DI
Sbjct: 174 NPSGTVWSAA---DMRALWQLI-AGTDI 197


>gnl|CDD|214750 smart00633, Glyco_10, Glycosyl hydrolase family 10. 
          Length = 263

 Score = 28.7 bits (65), Expect = 2.0
 Identities = 22/100 (22%), Positives = 37/100 (37%), Gaps = 20/100 (20%)

Query: 64  CYSDPNTSFLRAARAGHLDKLRELLEAGVDIN------TSNASDPNTSFLRAARAGHLDK 117
            Y+D NT    A R    + +++L   GV I+        +   PN + +RAA       
Sbjct: 126 FYNDYNTEEPNAKRQAIYELVKKLKAKGVPIDGIGLQSHLSLGGPNIAEIRAA------- 178

Query: 118 LRELLEAGV-------DINTSNATSKKLDNQLNIHQKNIK 150
           L      G+       DI+      + L  Q   +++  K
Sbjct: 179 LDRFASLGLEIWITELDISGPPNPEENLQAQAADYEEVFK 218


>gnl|CDD|173965 cd08045, TAF4, TATA Binding Protein (TBP) Associated Factor 4
           (TAF4) is one of several TAFs that bind TBP and is
           involved in forming Transcription Factor IID (TFIID)
           complex.  The TATA Binding Protein (TBP) Associated
           Factor 4 (TAF4) is one of several TAFs that bind TBP and
           are involved in forming the Transcription Factor IID
           (TFIID) complex. TFIID is one of seven General
           Transcription Factors (GTF) (TFIIA, TFIIB, TFIID, TFIIE,
           TFIIF, and TFIID) that are involved in accurate
           initiation of transcription by RNA polymerase II in
           eukaryote. TFIID plays an important role in the
           recognition of promoter DNA and assembly of the
           pre-initiation complex. TFIID complex is composed of the
           TBP and at least 13 TAFs. TAFs from various species were
           originally named by their predicted molecular weight or
           their electrophoretic mobility in polyacrylamide gels. A
           new, unified nomenclature for the pol II TAFs has been
           suggested to show the relationship between TAF orthologs
           and paralogs. Several hypotheses are proposed for TAFs
           functions such as serving as activator-binding sites,
           core-promoter recognition or a role in essential
           catalytic activity. Each TAF, with the help of a
           specific activator, is required only for the expression
           of subset of genes and is not universally involved for
           transcription as are GTFs. In yeast and human cells,
           TAFs have been found as components of other complexes
           besides TFIID.   Several TAFs interact via histone-fold
           (HFD) motifs; HFD is the interaction motif involved in
           heterodimerization of the core histones and their
           assembly into nucleosome octamers. The minimal HFD
           contains three alpha-helices linked by two loops and is
           found in core histones, TAFS and many other
           transcription factors. TFIID has a histone octamer-like
           substructure. TAF4 domain interacts with TAF12 and makes
           a novel histone-like heterodimer that binds DNA and has
           a core promoter function of a subset of genes.
          Length = 212

 Score = 28.8 bits (65), Expect = 2.0
 Identities = 14/36 (38%), Positives = 21/36 (58%)

Query: 15  LEKKEKEEERKEEEEEEEKNISAIGPAVTKLRQEAK 50
           LE+ E+EEE K +EEE E+ + A      + R + K
Sbjct: 119 LEQLEREEEEKRDEEERERLLRAAKSRSEQSRLKQK 154


>gnl|CDD|202096 pfam02029, Caldesmon, Caldesmon. 
          Length = 431

 Score = 28.8 bits (64), Expect = 2.1
 Identities = 14/35 (40%), Positives = 20/35 (57%), Gaps = 2/35 (5%)

Query: 16  EKKEKEEERKEEEEEEEKNISAIGPAVTKLRQEAK 50
            KK++EE RK  EEEE++       A  K R+E +
Sbjct: 219 LKKKREERRKVLEEEEQRRKQE--EADRKSREEEE 251


>gnl|CDD|220371 pfam09736, Bud13, Pre-mRNA-splicing factor of RES complex.  This
          entry is characterized by proteins with alternating
          conserved and low-complexity regions. Bud13 together
          with Snu17p and a newly identified factor,
          Pml1p/Ylr016c, form a novel trimeric complex. called
          The RES complex, pre-mRNA retention and splicing
          complex. Subunits of this complex are not essential for
          viability of yeasts but they are required for efficient
          splicing in vitro and in vivo. Furthermore,
          inactivation of this complex causes pre-mRNA leakage
          from the nucleus. Bud13 contains a unique,
          phylogenetically conserved C-terminal region of unknown
          function.
          Length = 141

 Score = 28.0 bits (63), Expect = 2.4
 Identities = 11/23 (47%), Positives = 16/23 (69%)

Query: 14 MLEKKEKEEERKEEEEEEEKNIS 36
            E+KE+E+E KE +EE+EK   
Sbjct: 17 KREEKEREKEEKERKEEKEKEWG 39



 Score = 26.5 bits (59), Expect = 7.1
 Identities = 8/21 (38%), Positives = 17/21 (80%)

Query: 15 LEKKEKEEERKEEEEEEEKNI 35
          +E+K +E+ER++EE+E ++  
Sbjct: 14 IEEKREEKEREKEEKERKEEK 34


>gnl|CDD|179712 PRK04019, rplP0, acidic ribosomal protein P0; Validated.
          Length = 330

 Score = 28.7 bits (65), Expect = 2.6
 Identities = 10/17 (58%), Positives = 14/17 (82%)

Query: 16  EKKEKEEERKEEEEEEE 32
           E++E+EEE +EE  EEE
Sbjct: 304 EEEEEEEEEEEEPSEEE 320



 Score = 27.9 bits (63), Expect = 4.0
 Identities = 10/17 (58%), Positives = 14/17 (82%)

Query: 16  EKKEKEEERKEEEEEEE 32
           E++E+EEE +EEE  EE
Sbjct: 303 EEEEEEEEEEEEEPSEE 319



 Score = 27.5 bits (62), Expect = 5.3
 Identities = 11/22 (50%), Positives = 17/22 (77%)

Query: 16  EKKEKEEERKEEEEEEEKNISA 37
           E++E+EEE +EEEE  E+  +A
Sbjct: 302 EEEEEEEEEEEEEEPSEEEAAA 323



 Score = 27.1 bits (61), Expect = 7.4
 Identities = 10/18 (55%), Positives = 13/18 (72%)

Query: 16  EKKEKEEERKEEEEEEEK 33
               +EEE +EEEEEEE+
Sbjct: 298 AAAAEEEEEEEEEEEEEE 315



 Score = 26.8 bits (60), Expect = 9.1
 Identities = 10/18 (55%), Positives = 13/18 (72%)

Query: 16  EKKEKEEERKEEEEEEEK 33
             +E+EEE +EEEEEE  
Sbjct: 300 AAEEEEEEEEEEEEEEPS 317


>gnl|CDD|227492 COG5163, NOP7, Protein required for biogenesis of the 60S ribosomal
           subunit [Translation, ribosomal structure and
           biogenesis].
          Length = 591

 Score = 28.9 bits (64), Expect = 2.6
 Identities = 11/22 (50%), Positives = 15/22 (68%)

Query: 17  KKEKEEERKEEEEEEEKNISAI 38
            K K ++RK +EEEEEK +  I
Sbjct: 526 NKSKNKKRKVDEEEEEKKLKMI 547


>gnl|CDD|222931 PHA02798, PHA02798, ankyrin-like protein; Provisional.
          Length = 489

 Score = 28.6 bits (64), Expect = 2.8
 Identities = 36/138 (26%), Positives = 57/138 (41%), Gaps = 25/138 (18%)

Query: 81  LDKLRELLEAGVDINTSNASDPNTS----FLRAARAGHLDKLRELLEAGVDINTSNATSK 136
           ++ ++ LLE GVDINT N  +   +    F         D L+  ++ G  IN  N + K
Sbjct: 161 IEIIKLLLEKGVDINTHNNKEKYDTLHCYFKYNIDRIDADILKLFVDNGFIINKENKSHK 220

Query: 137 K--LD--NQLNIHQKNIKNN-----------------GLNALHLAAKDGHTTVVKALLKG 175
           K  ++  N L    K  K N                 G N L+ +    +  + + LL+ 
Sbjct: 221 KKFMEYLNSLLYDNKRFKKNILDFIFSYIDINQVDELGFNPLYYSVSHNNRKIFEYLLQL 280

Query: 176 GCKVDASTKKGNTALHIA 193
           G  ++  T+ GNT L  A
Sbjct: 281 GGDINIITELGNTCLFTA 298


>gnl|CDD|151656 pfam11214, Med2, Mediator complex subunit 2.  This family of
          mediator complex subunit 2 proteins is conserved in
          fungi. Cyclin-dependent kinase CDK8 or Srb10 interacts
          with and phosphorylates Med2. Post-translational
          modifications of Mediator subunits are important for
          regulation of gene expression.
          Length = 99

 Score = 27.4 bits (61), Expect = 2.8
 Identities = 12/21 (57%), Positives = 18/21 (85%)

Query: 13 TMLEKKEKEEERKEEEEEEEK 33
             +KK++EEERK++EEEE+K
Sbjct: 79 VENKKKQEEEERKKKEEEEKK 99


>gnl|CDD|181127 PRK07807, PRK07807, inosine 5-monophosphate dehydrogenase;
           Validated.
          Length = 479

 Score = 28.7 bits (65), Expect = 2.8
 Identities = 24/70 (34%), Positives = 32/70 (45%), Gaps = 22/70 (31%)

Query: 75  AARAGHLDKLRELLEAGVD---INTSNASDPNT-SFLRAARAGHL------------DKL 118
           AA+A      R LLEAGVD   ++T++         LRA RA               +  
Sbjct: 229 AAKA------RALLEAGVDVLVVDTAHGHQEKMLEALRAVRALDPGVPIVAGNVVTAEGT 282

Query: 119 RELLEAGVDI 128
           R+L+EAG DI
Sbjct: 283 RDLVEAGADI 292


>gnl|CDD|233161 TIGR00870, trp, transient-receptor-potential calcium channel
           protein.  The Transient Receptor Potential Ca2+ Channel
           (TRP-CC) Family (TC. 1.A.4)The TRP-CC family has also
           been called the store-operated calcium channel (SOC)
           family. The prototypical members include the Drosophila
           retinal proteinsTRP and TRPL (Montell and Rubin, 1989;
           Hardie and Minke, 1993). SOC members of the family
           mediate the entry of extracellular Ca2+ into cells in
           responseto depletion of intracellular Ca2+ stores
           (Clapham, 1996) and agonist stimulated production of
           inositol-1,4,5 trisphosphate (IP3). One member of the
           TRP-CCfamily, mammalian Htrp3, has been shown to form a
           tight complex with the IP3 receptor (TC #1.A.3.2.1).
           This interaction is apparently required for IP3
           tostimulate Ca2+ release via Htrp3. The vanilloid
           receptor subtype 1 (VR1), which is the receptor for
           capsaicin (the ?hot? ingredient in chili peppers) and
           servesas a heat-activated ion channel in the pain
           pathway (Caterina et al., 1997), is also a member of
           this family. The stretch-inhibitable non-selective
           cation channel(SIC) is identical to the vanilloid
           receptor throughout all of its first 700 residues, but
           it exhibits a different sequence in its last 100
           residues. VR1 and SICtransport monovalent cations as
           well as Ca2+. VR1 is about 10x more permeable to Ca2+
           than to monovalent ions. Ca2+ overload probably causes
           cell deathafter chronic exposure to capsaicin.
           (McCleskey and Gold, 1999) [Transport and binding
           proteins, Cations and iron carrying compounds].
          Length = 743

 Score = 28.5 bits (64), Expect = 2.9
 Identities = 14/62 (22%), Positives = 27/62 (43%), Gaps = 5/62 (8%)

Query: 135 SKKLDNQLNIHQKNIKNNGLNALHLAAKDG-HTTVVKALLKGGCKVDASTKKGNTALHIA 193
            + L+    ++       G +AL +AA +  +  + + LL   C+       G+T LH  
Sbjct: 34  YRDLEEPKKLNINCPDRLGRSALFVAAIENENLELTELLLNLSCRGAV----GDTLLHAI 89

Query: 194 AL 195
           +L
Sbjct: 90  SL 91


>gnl|CDD|222447 pfam13904, DUF4207, Domain of unknown function (DUF4207).  This
           family is found in eukaryotes; it has several conserved
           tryptophan residues. The function is not known.
          Length = 261

 Score = 28.5 bits (64), Expect = 2.9
 Identities = 11/23 (47%), Positives = 15/23 (65%)

Query: 15  LEKKEKEEERKEEEEEEEKNISA 37
           LE+K+K+E  KE EE E +   A
Sbjct: 108 LEEKQKQEREKEREEAELRQRLA 130



 Score = 27.0 bits (60), Expect = 7.6
 Identities = 5/21 (23%), Positives = 16/21 (76%)

Query: 14  MLEKKEKEEERKEEEEEEEKN 34
            L+K +++++++EEE  +++ 
Sbjct: 188 ELKKLKQQQQKREEERRKQRK 208


>gnl|CDD|234311 TIGR03685, L12P_arch, 50S ribosomal protein L12P.  This model
          represents the L12P protein of the large (50S) subunit
          of the archaeal ribosome.
          Length = 105

 Score = 27.3 bits (61), Expect = 3.0
 Identities = 12/24 (50%), Positives = 17/24 (70%)

Query: 16 EKKEKEEERKEEEEEEEKNISAIG 39
           ++E+EEE +EEEEEEE    A+ 
Sbjct: 75 AEEEEEEEEEEEEEEEESEEEAMA 98



 Score = 26.9 bits (60), Expect = 3.5
 Identities = 12/24 (50%), Positives = 20/24 (83%)

Query: 16  EKKEKEEERKEEEEEEEKNISAIG 39
           E++E+EEE +EEEE EE+ ++ +G
Sbjct: 78  EEEEEEEEEEEEEESEEEAMAGLG 101



 Score = 25.8 bits (57), Expect = 9.3
 Identities = 12/24 (50%), Positives = 18/24 (75%)

Query: 16 EKKEKEEERKEEEEEEEKNISAIG 39
          E++E+EEE +EEEEEE +  +  G
Sbjct: 76 EEEEEEEEEEEEEEEESEEEAMAG 99


>gnl|CDD|218312 pfam04889, Cwf_Cwc_15, Cwf15/Cwc15 cell cycle control protein.
           This family represents Cwf15/Cwc15 (from
           Schizosaccharomyces pombe and Saccharomyces cerevisiae
           respectively) and their homologues. The function of
           these proteins is unknown, but they form part of the
           spliceosome and are thus thought to be involved in mRNA
           splicing.
          Length = 241

 Score = 28.2 bits (63), Expect = 3.2
 Identities = 10/19 (52%), Positives = 14/19 (73%)

Query: 15  LEKKEKEEERKEEEEEEEK 33
            E+KE+EEE K  EEE+ +
Sbjct: 159 AEEKEREEEEKAAEEEKAR 177



 Score = 27.0 bits (60), Expect = 8.0
 Identities = 8/19 (42%), Positives = 13/19 (68%)

Query: 16  EKKEKEEERKEEEEEEEKN 34
           EK+ +EEE+  EEE+  + 
Sbjct: 161 EKEREEEEKAAEEEKAREE 179


>gnl|CDD|218223 pfam04712, Radial_spoke, Radial spokehead-like protein.  This
           family includes the radial spoke head proteins RSP4 and
           RSP6 from Chlamydomonas reinhardtii, and several
           eukaryotic homologues, including mammalian RSHL1, the
           protein product of a familial ciliary dyskinesia
           candidate gene.
          Length = 481

 Score = 28.5 bits (64), Expect = 3.2
 Identities = 11/19 (57%), Positives = 18/19 (94%)

Query: 16  EKKEKEEERKEEEEEEEKN 34
           E+K++EEE+++EEEEEE+ 
Sbjct: 345 EQKDEEEEQEDEEEEEEEE 363



 Score = 27.7 bits (62), Expect = 6.1
 Identities = 11/25 (44%), Positives = 18/25 (72%)

Query: 16  EKKEKEEERKEEEEEEEKNISAIGP 40
           E+++++EE +EEEEE E+     GP
Sbjct: 350 EEEQEDEEEEEEEEEPEEPEPEEGP 374



 Score = 26.9 bits (60), Expect = 9.3
 Identities = 11/21 (52%), Positives = 15/21 (71%)

Query: 14  MLEKKEKEEERKEEEEEEEKN 34
             +K E+EE+  EEEEEEE+ 
Sbjct: 344 PEQKDEEEEQEDEEEEEEEEE 364


>gnl|CDD|215914 pfam00428, Ribosomal_60s, 60s Acidic ribosomal protein.  This
          family includes archaebacterial L12, eukaryotic P0, P1
          and P2.
          Length = 88

 Score = 26.8 bits (60), Expect = 3.3
 Identities = 7/17 (41%), Positives = 14/17 (82%)

Query: 16 EKKEKEEERKEEEEEEE 32
            +E+++E +EEEEE++
Sbjct: 65 AAEEEKKEEEEEEEEDD 81



 Score = 25.7 bits (57), Expect = 8.8
 Identities = 9/17 (52%), Positives = 13/17 (76%)

Query: 16 EKKEKEEERKEEEEEEE 32
              +EE+++EEEEEEE
Sbjct: 63 AAAAEEEKKEEEEEEEE 79



 Score = 25.7 bits (57), Expect = 9.3
 Identities = 8/17 (47%), Positives = 13/17 (76%)

Query: 16 EKKEKEEERKEEEEEEE 32
             E+E++ +EEEEEE+
Sbjct: 64 AAAEEEKKEEEEEEEED 80


>gnl|CDD|221175 pfam11705, RNA_pol_3_Rpc31, DNA-directed RNA polymerase III subunit
           Rpc31.  RNA polymerase III contains seventeen subunits
           in yeasts and in human cells. Twelve of these are akin
           to RNA polymerase I or II and the other five are RNA pol
           III-specific, and form the functionally distinct groups
           (i) Rpc31-Rpc34-Rpc82, and (ii) Rpc37-Rpc53. Rpc31,
           Rpc34 and Rpc82 form a cluster of enzyme-specific
           subunits that contribute to transcription initiation in
           S.cerevisiae and H.sapiens. There is evidence that these
           subunits are anchored at or near the N-terminal Zn-fold
           of Rpc1, itself prolonged by a highly conserved but RNA
           polymerase III-specific domain.
          Length = 221

 Score = 28.2 bits (63), Expect = 3.5
 Identities = 12/21 (57%), Positives = 17/21 (80%)

Query: 14  MLEKKEKEEERKEEEEEEEKN 34
           MLEKK KE E ++ +EE+EK+
Sbjct: 155 MLEKKLKELEAEDVDEEDEKD 175


>gnl|CDD|220376 pfam09745, DUF2040, Coiled-coil domain-containing protein 55
           (DUF2040).  This entry is a conserved domain of
           approximately 130 residues of proteins conserved from
           fungi to humans. The proteins do contain a coiled-coil
           domain, but the function is unknown.
          Length = 128

 Score = 27.4 bits (61), Expect = 3.7
 Identities = 12/28 (42%), Positives = 18/28 (64%), Gaps = 2/28 (7%)

Query: 9   GAFGTMLE--KKEKEEERKEEEEEEEKN 34
            A+   LE  +K +EEE++ EE EEE +
Sbjct: 93  SAYKKQLEENRKLEEEEKEREELEEEND 120


>gnl|CDD|218115 pfam04502, DUF572, Family of unknown function (DUF572).  Family of
           eukaryotic proteins with undetermined function.
          Length = 321

 Score = 28.2 bits (63), Expect = 3.9
 Identities = 11/32 (34%), Positives = 20/32 (62%), Gaps = 2/32 (6%)

Query: 10  AFGTMLEK--KEKEEERKEEEEEEEKNISAIG 39
              +MLE   + +++E +EEEEE+E  I ++ 
Sbjct: 168 DVNSMLEALFRREKKEEEEEEEEDEALIKSLS 199


>gnl|CDD|132724 cd02737, RNAP_IV_NRPD1_C, Largest subunit (NRPD1) of Higher plant
           RNA polymerase IV, C-terminal domain.  Higher plants
           have five multi-subunit nuclear RNA polymerases: RNAP I,
           RNAP II and RNAP III, which are essential for viability;
           plus the two isoforms of the non-essential polymerase
           RNAP IV (IVa and IVb), which specialize in small
           RNA-mediated gene silencing pathways. RNAP IVa and/or
           RNAP IVb might be involved in RNA-directed DNA
           methylation of endogenous repetitive elements, silencing
           of transgenes, regulation of flowering-time genes,
           inducible regulation of adjacent gene pairs, and
           spreading of mobile silencing signals. NRPD1a is the
           largest subunit of RNAP IVa, whereas NRPD1b is the
           largest subunit of RNAP IVb. The full subunit
           compositions of RNAP IVa and RNAP IVb are not known, nor
           are their templates or enzymatic products. However, it
           has been shown that RNAP IVa and, to a lesser extent,
           RNAP IVb are crucial for several RNA-mediated gene
           silencing phenomena.
          Length = 381

 Score = 28.2 bits (63), Expect = 4.1
 Identities = 12/27 (44%), Positives = 17/27 (62%)

Query: 64  CYSDPNTSFLRAARAGHLDKLRELLEA 90
           C+S P   FL+AA+ G  D L  +L+A
Sbjct: 336 CFSSPIKCFLKAAKKGASDSLSGVLDA 362


>gnl|CDD|236731 PRK10658, PRK10658, putative alpha-glucosidase; Provisional.
          Length = 665

 Score = 28.3 bits (64), Expect = 4.2
 Identities = 9/19 (47%), Positives = 12/19 (63%)

Query: 75  AARAGHLDKLRELLEAGVD 93
            A   + DKL+ LL+ GVD
Sbjct: 392 DACKWYADKLKGLLDMGVD 410



 Score = 28.3 bits (64), Expect = 4.2
 Identities = 9/19 (47%), Positives = 12/19 (63%)

Query: 109 AARAGHLDKLRELLEAGVD 127
            A   + DKL+ LL+ GVD
Sbjct: 392 DACKWYADKLKGLLDMGVD 410


>gnl|CDD|215936 pfam00478, IMPDH, IMP dehydrogenase / GMP reductase domain.  This
           family is involved in biosynthesis of guanosine
           nucleotide. Members of this family contain a TIM barrel
           structure. In the inosine monophosphate dehydrogenases 2
           CBS domains pfam00571 are inserted in the TIM barrel.
           This family is a member of the common phosphate binding
           site TIM barrel family.
          Length = 467

 Score = 28.0 bits (63), Expect = 4.2
 Identities = 15/69 (21%), Positives = 29/69 (42%), Gaps = 16/69 (23%)

Query: 76  ARAGHLDKLRELLEAGVD---INTSNASDPNT-SFLRAARAGHLDK------------LR 119
            R   L++   L+EAGVD   I++++         ++  +  + +              R
Sbjct: 220 TREDDLERAEALVEAGVDVIVIDSAHGHSEYVLEMIKWIKKKYPELDVIAGNVVTAEAAR 279

Query: 120 ELLEAGVDI 128
           EL++AG D 
Sbjct: 280 ELIDAGADA 288


>gnl|CDD|110514 pfam01517, HDV_ag, Hepatitis delta virus delta antigen.  The
           hepatitis delta virus (HDV) encodes a single protein,
           the hepatitis delta antigen (HDAg). The central region
           of this protein has been shown to bind RNA. Several
           interactions are also mediated by a coiled-coil region
           at the N terminus of the protein.
          Length = 194

 Score = 27.5 bits (61), Expect = 4.2
 Identities = 14/35 (40%), Positives = 21/35 (60%)

Query: 8   LGAFGTMLEKKEKEEERKEEEEEEEKNISAIGPAV 42
           L + G  L ++E+EE R+  EE+EE+     GP V
Sbjct: 114 LSSGGKHLSREEEEELRRLTEEDEERERRVAGPRV 148


>gnl|CDD|206063 pfam13892, DBINO, DNA-binding domain.  DBINO is a DNA-binding
           domain found on global transcription activator SNF2L1
           proteins and chromatin re-modelling proteins.
          Length = 140

 Score = 27.2 bits (61), Expect = 4.3
 Identities = 9/18 (50%), Positives = 12/18 (66%)

Query: 16  EKKEKEEERKEEEEEEEK 33
           EK+  E+ +KEEE  E K
Sbjct: 93  EKEALEQAKKEEELREAK 110


>gnl|CDD|218482 pfam05178, Kri1, KRI1-like family.  The yeast member of this
          family (Kri1p) is found to be required for 40S ribosome
          biogenesis in the nucleolus.
          Length = 99

 Score = 26.8 bits (60), Expect = 4.4
 Identities = 15/46 (32%), Positives = 27/46 (58%), Gaps = 2/46 (4%)

Query: 16 EKKEKEEERKEEEEEEEKN--ISAIGPAVTKLRQEAKIHNAVISLE 59
          E+KE+E+ ++EEE +  KN     I   + K+++ A +  A +S E
Sbjct: 2  ERKEEEKAQREEELKRLKNLKREEIEEKLEKIKKVAGLRGADLSEE 47


>gnl|CDD|235302 PRK04456, PRK04456, acetyl-CoA decarbonylase/synthase complex
           subunit beta; Reviewed.
          Length = 463

 Score = 28.1 bits (63), Expect = 4.7
 Identities = 18/50 (36%), Positives = 25/50 (50%), Gaps = 6/50 (12%)

Query: 16  EKKEKEEERKEEEEEEEKNISAIGPAVTKLRQEA----KI--HNAVISLE 59
             +E+EEE +EEEEEEE     +     +++  A    KI   NA I  E
Sbjct: 403 AAEEEEEEEEEEEEEEEPVAEVMMMPAPEMQLPASGGIKIILKNAKIYAE 452


>gnl|CDD|219655 pfam07946, DUF1682, Protein of unknown function (DUF1682).  The
           members of this family are all hypothetical eukaryotic
           proteins of unknown function. One member is described as
           being an adipocyte-specific protein, but no evidence of
           this was found.
          Length = 322

 Score = 27.6 bits (62), Expect = 4.7
 Identities = 8/22 (36%), Positives = 15/22 (68%)

Query: 16  EKKEKEEERKEEEEEEEKNISA 37
            +++KEE++KEE E +   +S 
Sbjct: 284 AQEKKEEKKKEEREAKLAKLSP 305


>gnl|CDD|227798 COG5511, COG5511, Bacteriophage capsid protein [General function
           prediction only].
          Length = 492

 Score = 27.7 bits (62), Expect = 5.0
 Identities = 12/33 (36%), Positives = 13/33 (39%), Gaps = 7/33 (21%)

Query: 88  LEAGVDINTSNASDPNTSF-------LRAARAG 113
           L  G DI    + DPN          LRA  AG
Sbjct: 299 LNPGEDIKLQTSQDPNNGLESFRQGQLRAIAAG 331


>gnl|CDD|221641 pfam12569, NARP1, NMDA receptor-regulated protein 1.  This domain
           family is found in eukaryotes, and is approximately 40
           amino acids in length. The family is found in
           association with pfam07719, pfam00515. There is a single
           completely conserved residue L that may be functionally
           important. NARP1 is the mammalian homologue of a yeast
           N-terminal acetyltransferase that regulates entry into
           the G(0) phase of the cell cycle.
          Length = 516

 Score = 28.0 bits (63), Expect = 5.0
 Identities = 10/20 (50%), Positives = 15/20 (75%)

Query: 15  LEKKEKEEERKEEEEEEEKN 34
           L KK+++ E+K E+EE EK 
Sbjct: 414 LRKKQRKAEKKAEKEEAEKA 433


>gnl|CDD|218458 pfam05136, Phage_portal_2, Phage portal protein, lambda family.
           This protein forms a hole, or portal, that enables DNA
           passage during packaging and ejection. It also forms the
           junction between the phage capsid and the tail proteins.
          Length = 352

 Score = 27.6 bits (62), Expect = 5.6
 Identities = 13/33 (39%), Positives = 14/33 (42%), Gaps = 7/33 (21%)

Query: 88  LEAGVDINTSNASDPNTSF-------LRAARAG 113
           L  G DI  SN   P  +F       LRA  AG
Sbjct: 284 LPPGEDIKFSNPDRPGNNFEEFVKAQLRAIAAG 316


>gnl|CDD|100109 cd05831, Ribosomal_P1, Ribosomal protein P1. This subfamily
          represents the eukaryotic large ribosomal protein P1.
          Eukaryotic P1 and P2 are functionally equivalent to the
          bacterial protein L7/L12, but are not homologous to
          L7/L12. P1 is located in the L12 stalk, with proteins
          P2, P0, L11, and 28S rRNA. P1 and P2 are the only
          proteins in the ribosome to occur as multimers, always
          appearing as sets of heterodimers. Recent data indicate
          that eukaryotes have four copies (two heterodimers),
          while most archaeal species contain six copies of L12p
          (three homodimers) and bacteria may have four or six
          copies (two or three homodimers), depending on the
          species. Experiments using S. cerevisiae P1 and P2
          indicate that P1 proteins are positioned more
          internally with limited reactivity in the C-terminal
          domains, while P2 proteins seem to be more externally
          located and are more likely to interact with other
          cellular components. In lower eukaryotes, P1 and P2 are
          further subdivided into P1A, P1B, P2A, and P2B, which
          form P1A/P2B and P1B/P2A heterodimers. Some plant
          species have a third P-protein, called P3, which is not
          homologous to P1 and P2. In humans, P1 and P2 are
          strongly autoimmunogenic. They play a significant role
          in the etiology and pathogenesis of systemic lupus
          erythema (SLE). In addition, the ribosome-inactivating
          protein trichosanthin (TCS) interacts with human P0,
          P1, and P2, with its primary binding site located in
          the C-terminal region of P2. TCS inactivates the
          ribosome by depurinating a specific adenine in the
          sarcin-ricin loop of 28S rRNA.
          Length = 103

 Score = 26.5 bits (59), Expect = 5.6
 Identities = 10/17 (58%), Positives = 13/17 (76%)

Query: 16 EKKEKEEERKEEEEEEE 32
            + K+EE+KEEEEEE 
Sbjct: 78 AAEAKKEEKKEEEEEES 94


>gnl|CDD|192481 pfam10197, Cir_N, N-terminal domain of CBF1 interacting
          co-repressor CIR.  This is a 45 residue conserved
          region at the N-terminal end of a family of proteins
          referred to as CIRs (CBF1-interacting co-repressors).
          CBF1 (centromere-binding factor 1) acts as a
          transcription factor that causes repression by binding
          specifically to GTGGGAA motifs in responsive promoters,
          and it requires CIR as a co-repressor. CIR binds to
          histone deacetylase and to SAP30 and serves as a linker
          between CBF1 and the histone deacetylase complex.
          Length = 37

 Score = 24.9 bits (55), Expect = 5.6
 Identities = 8/19 (42%), Positives = 12/19 (63%)

Query: 14 MLEKKEKEEERKEEEEEEE 32
            E+K  EE++K EE  +E
Sbjct: 15 EAEQKALEEQKKIEELRKE 33



 Score = 24.5 bits (54), Expect = 8.5
 Identities = 9/17 (52%), Positives = 14/17 (82%)

Query: 15 LEKKEKEEERKEEEEEE 31
          LE+++K EE ++E EEE
Sbjct: 21 LEEQKKIEELRKEIEEE 37


>gnl|CDD|100108 cd04411, Ribosomal_P1_P2_L12p, Ribosomal protein P1, P2, and L12p.
           Ribosomal proteins P1 and P2 are the eukaryotic proteins
           that are functionally equivalent to bacterial L7/L12.
           L12p is the archaeal homolog. Unlike other ribosomal
           proteins, the archaeal L12p and eukaryotic P1 and P2 do
           not share sequence similarity with their bacterial
           counterparts. They are part of the ribosomal stalk
           (called the L7/L12 stalk in bacteria), along with 28S
           rRNA and the proteins L11 and P0 in eukaryotes (23S
           rRNA, L11, and L10e in archaea). In bacterial ribosomes,
           L7/L12 homodimers bind the extended C-terminal helix of
           L10 to anchor the L7/L12 molecules to the ribosome.
           Eukaryotic P1/P2 heterodimers and archaeal L12p
           homodimers are believed to bind the L10 equivalent
           proteins, eukaryotic P0 and archaeal L10e, in a similar
           fashion. P1 and P2 (L12p, L7/L12) are the only proteins
           in the ribosome to occur as multimers, always appearing
           as sets of dimers. Recent data indicate that most
           archaeal species contain six copies of L12p (three
           homodimers), while eukaryotes have two copies each of P1
           and P2 (two heterodimers). Bacteria may have four or six
           copies (two or three homodimers), depending on the
           species. As in bacteria, the stalk is crucial for
           binding of initiation, elongation, and release factors
           in eukaryotes and archaea.
          Length = 105

 Score = 26.5 bits (58), Expect = 6.1
 Identities = 14/24 (58%), Positives = 14/24 (58%)

Query: 16  EKKEKEEERKEEEEEEEKNISAIG 39
           E  EK EE KEEEEEEE      G
Sbjct: 80  EPAEKAEEAKEEEEEEEDEDFGFG 103


>gnl|CDD|191634 pfam06886, TPX2, Targeting protein for Xklp2 (TPX2).  This family
          represents a conserved region approximately 60 residues
          long within the eukaryotic targeting protein for Xklp2
          (TPX2). Xklp2 is a kinesin-like protein localised on
          centrosomes throughout the cell cycle and on spindle
          pole microtubules during metaphase. In Xenopus, it has
          been shown that Xklp2 protein is required for
          centrosome separation and maintenance of spindle
          bi-polarity. TPX2 is a microtubule-associated protein
          that mediates the binding of the C-terminal domain of
          Xklp2 to microtubules. It is phosphorylated during
          mitosis in a microtubule-dependent way.
          Length = 57

 Score = 25.4 bits (56), Expect = 6.2
 Identities = 12/35 (34%), Positives = 18/35 (51%), Gaps = 7/35 (20%)

Query: 14 MLEKKEKEEERKEEEEEEEKNISAIGPAVTKLRQE 48
             + EKEE    ++EEEE+ I        +LR+E
Sbjct: 22 KALEAEKEEAEARQKEEEEEAIK-------QLRKE 49


>gnl|CDD|100110 cd05832, Ribosomal_L12p, Ribosomal protein L12p. This subfamily
           includes archaeal L12p, the protein that is functionally
           equivalent to L7/L12 in bacteria and the P1 and P2
           proteins in eukaryotes. L12p is homologous to P1 and P2
           but is not homologous to bacterial L7/L12. It is located
           in the L12 stalk, with proteins L10, L11, and 23S rRNA.
           L12p is the only protein in the ribosome to occur as
           multimers, always appearing as sets of dimers. Recent
           data indicate that most archaeal species contain six
           copies of L12p (three homodimers), while eukaryotes have
           four copies (two heterodimers), and bacteria may have
           four or six copies (two or three homodimers), depending
           on the species. The organization of proteins within the
           stalk has been characterized primarily in bacteria,
           where L7/L12 forms either two or three homodimers and
           each homodimer binds to the extended C-terminal helix of
           L10. L7/L12 is attached to the ribosome through L10 and
           is the only ribosomal protein that does not directly
           interact with rRNA. Archaeal L12p is believed to
           function in a similar fashion. However, hybrid ribosomes
           containing the large subunit from E. coli with an
           archaeal stalk are able to bind archaeal and eukaryotic
           elongation factors but not bacterial elongation factors.
           In several mesophilic and thermophilic archaeal species,
           the binding of 23S rRNA to protein L11 and to the
           L10/L12p pentameric complex was found to be
           temperature-dependent and cooperative.
          Length = 106

 Score = 26.3 bits (58), Expect = 6.3
 Identities = 10/24 (41%), Positives = 19/24 (79%)

Query: 16  EKKEKEEERKEEEEEEEKNISAIG 39
            ++++EE++KEEE+EEE+  +  G
Sbjct: 78  AEEKEEEKKKEEEKEEEEEEALAG 101


>gnl|CDD|238200 cd00326, alpha_CA, Carbonic anhydrase alpha (vertebrate-like)
           group. Carbonic anhydrases (CAs) are zinc-containing
           enzymes that catalyze the reversible hydration of carbon
           dioxide in a two-step mechanism: a nucleophilic attack
           of a zinc-bound hydroxide ion on carbon dioxide,
           followed by the regeneration of the active site by
           ionization of the zinc-bound water molecule and removal
           of a proton from the active site. They are ubiquitous
           enzymes involved in fundamental processes like
           photosynthesis, respiration, pH homeostasis and ion
           transport. There are three evolutionary distinct groups
           - alpha, beta and gamma carbonic anhydrases - which show
           no significant sequence identity or structural
           similarity. Most alpha CAs are monomeric enzymes. The
           zinc ion is complexed by three histidine residues and a
           fourth conserved histidine plays a potential role in
           proton transfer.
          Length = 227

 Score = 27.2 bits (61), Expect = 6.5
 Identities = 8/33 (24%), Positives = 11/33 (33%), Gaps = 5/33 (15%)

Query: 126 VDINTSNATSKKLDNQLNIH-----QKNIKNNG 153
           ++I TS          LN          + NNG
Sbjct: 7   INIVTSAVVYDPSLPPLNFDYYPTTSLTLVNNG 39


>gnl|CDD|205206 pfam13025, DUF3886, Protein of unknown function (DUF3886).  This
          family of proteins is functionally uncharacterized.
          This family of proteins is found in bacteria. Proteins
          in this family are approximately 90 amino acids in
          length. There are two completely conserved L residues
          that may be functionally important.
          Length = 70

 Score = 25.8 bits (57), Expect = 6.7
 Identities = 12/22 (54%), Positives = 17/22 (77%), Gaps = 1/22 (4%)

Query: 16 EKKEKEEERKEEE-EEEEKNIS 36
          +++E+EE RK EE +E EKN S
Sbjct: 35 KREEEEEARKREERKEREKNKS 56


>gnl|CDD|224530 COG1614, CdhC, CO dehydrogenase/acetyl-CoA synthase beta subunit
           [Energy production and conversion].
          Length = 470

 Score = 27.5 bits (61), Expect = 6.7
 Identities = 11/19 (57%), Positives = 15/19 (78%)

Query: 15  LEKKEKEEERKEEEEEEEK 33
           +E+  +EEE +EEEEEEE 
Sbjct: 398 VERWAEEEEEEEEEEEEEA 416


>gnl|CDD|224969 COG2058, RPP1A, Ribosomal protein L12E/L44/L45/RPP1/RPP2
          [Translation, ribosomal structure and biogenesis].
          Length = 109

 Score = 26.2 bits (58), Expect = 6.8
 Identities = 12/17 (70%), Positives = 12/17 (70%)

Query: 16 EKKEKEEERKEEEEEEE 32
          E  E EEE KEEE EEE
Sbjct: 82 EADEAEEEEKEEEAEEE 98


>gnl|CDD|146016 pfam03179, V-ATPase_G, Vacuolar (H+)-ATPase G subunit.  This
          family represents the eukaryotic vacuolar (H+)-ATPase
          (V-ATPase) G subunit. V-ATPases generate an acidic
          environment in several intracellular compartments.
          Correspondingly, they are found as membrane-attached
          proteins in several organelles. They are also found in
          the plasma membranes of some specialised cells.
          V-ATPases consist of peripheral (V1) and membrane
          integral (V0) heteromultimeric complexes. The G subunit
          is part of the V1 subunit, but is also thought to be
          strongly attached to the V0 complex. It may be involved
          in the coupling of ATP degradation to H+ translocation.
          Length = 105

 Score = 26.4 bits (59), Expect = 6.9
 Identities = 11/38 (28%), Positives = 19/38 (50%), Gaps = 4/38 (10%)

Query: 17 KKEKEEERKEEEEE----EEKNISAIGPAVTKLRQEAK 50
          +KE EE R + E E    E ++  + G    K+ +E +
Sbjct: 39 EKEIEEYRAQREAEFKEFEAEHSGSRGELEKKIEKETE 76


>gnl|CDD|236321 PRK08645, PRK08645, bifunctional homocysteine
           S-methyltransferase/5,10-methylenetetrahydrofolate
           reductase protein; Reviewed.
          Length = 612

 Score = 27.5 bits (62), Expect = 7.1
 Identities = 16/47 (34%), Positives = 25/47 (53%), Gaps = 8/47 (17%)

Query: 85  RELLEAGVDI---NTSNASDPNTSFLRAARAGHLDKLRELLEAGVDI 128
           RE +EAG D+   NT  A+      ++  R G  DK++E+  A V +
Sbjct: 50  REYIEAGADVIQTNTFGAN-----RIKLKRYGLEDKVKEINRAAVRL 91


>gnl|CDD|238640 cd01315, L-HYD_ALN, L-Hydantoinases (L-HYDs) and Allantoinase
           (ALN); L-Hydantoinases are a member of the
           dihydropyrimidinase family, which catalyzes the
           reversible hydrolytic ring opening of dihydropyrimidines
           and hydantoins (five-membered cyclic diamides used in
           biotechnology). But L-HYDs differ by having an L-enantio
           specificity and by lacking activity on possible natural
           substrates such as dihydropyrimidines. Allantoinase
           catalyzes the hydrolytic cleavage of the five-member
           ring of allantoin (5-ureidohydantoin) to form allantoic
           acid.
          Length = 447

 Score = 27.3 bits (61), Expect = 7.5
 Identities = 10/14 (71%), Positives = 12/14 (85%)

Query: 79  GHLDKLRELLEAGV 92
           G+LD+LR L EAGV
Sbjct: 128 GNLDQLRPLDEAGV 141



 Score = 27.3 bits (61), Expect = 7.5
 Identities = 10/14 (71%), Positives = 12/14 (85%)

Query: 113 GHLDKLRELLEAGV 126
           G+LD+LR L EAGV
Sbjct: 128 GNLDQLRPLDEAGV 141


>gnl|CDD|222948 PHA02941, PHA02941, hypothetical protein; Provisional.
          Length = 356

 Score = 27.2 bits (60), Expect = 7.6
 Identities = 10/26 (38%), Positives = 18/26 (69%)

Query: 14  MLEKKEKEEERKEEEEEEEKNISAIG 39
           MLE+ ++++E +E EEEEE +   + 
Sbjct: 331 MLEETQEQQEEEENEEEEENDTVKLS 356


>gnl|CDD|240285 PTZ00135, PTZ00135, 60S acidic ribosomal protein P0; Provisional.
          Length = 310

 Score = 27.3 bits (61), Expect = 7.6
 Identities = 7/17 (41%), Positives = 7/17 (41%)

Query: 16  EKKEKEEERKEEEEEEE 32
                     EEEEEEE
Sbjct: 285 AAAAAAAAPAEEEEEEE 301


>gnl|CDD|218517 pfam05236, TAF4, Transcription initiation factor TFIID component
           TAF4 family.  This region of similarity is found in
           Transcription initiation factor TFIID component TAF4.
          Length = 255

 Score = 27.0 bits (60), Expect = 7.7
 Identities = 15/45 (33%), Positives = 22/45 (48%), Gaps = 2/45 (4%)

Query: 15  LEKKEKEEERKEEEEEEEKNISAIGPAVTKLRQEAKIHNAVISLE 59
           L +K+KEEE +  E   E  +    P   +L+Q+AK      S E
Sbjct: 118 LAQKQKEEEERRVERRRELGLE--DPEQLRLKQKAKEEQKAESEE 160


>gnl|CDD|215568 PLN03088, PLN03088, SGT1,  suppressor of G2 allele of SKP1;
           Provisional.
          Length = 356

 Score = 27.1 bits (60), Expect = 7.8
 Identities = 23/78 (29%), Positives = 32/78 (41%), Gaps = 6/78 (7%)

Query: 67  DPNTSFLRAARAGHLDKLRELLEAGVDINTSNASDPNTS--FLRAARA----GHLDKLRE 120
           DPN + L A RA    KL    EA  D N +   DP+ +  +LR   A          + 
Sbjct: 32  DPNNAELYADRAQANIKLGNFTEAVADANKAIELDPSLAKAYLRKGTACMKLEEYQTAKA 91

Query: 121 LLEAGVDINTSNATSKKL 138
            LE G  +   ++   KL
Sbjct: 92  ALEKGASLAPGDSRFTKL 109


>gnl|CDD|218258 pfam04774, HABP4_PAI-RBP1, Hyaluronan / mRNA binding family.
          This family includes the HABP4 family of
          hyaluronan-binding proteins, and the PAI-1 mRNA-binding
          protein, PAI-RBP1. HABP4 has been observed to bind
          hyaluronan (a glucosaminoglycan), but it is not known
          whether this is its primary role in vivo. It has also
          been observed to bind RNA, but with a lower affinity
          than that for hyaluronan. PAI-1 mRNA-binding protein
          specifically binds the mRNA of type-1 plasminogen
          activator inhibitor (PAI-1), and is thought to be
          involved in regulation of mRNA stability. However, in
          both cases, the sequence motifs predicted to be
          important for ligand binding are not conserved
          throughout the family, so it is not known whether
          members of this family share a common function.
          Length = 106

 Score = 26.2 bits (58), Expect = 7.9
 Identities = 9/21 (42%), Positives = 13/21 (61%)

Query: 16 EKKEKEEERKEEEEEEEKNIS 36
            KE   E +E EEEE+K ++
Sbjct: 60 ANKEGVVEEEEVEEEEDKEMT 80


>gnl|CDD|221473 pfam12230, PRP21_like_P, Pre-mRNA splicing factor PRP21 like
          protein.  This domain family is found in eukaryotes,
          and is typically between 212 and 238 amino acids in
          length. The family is found in association with
          pfam01805. There are two completely conserved residues
          (W and H) that may be functionally important. PRP21 is
          required for assembly of the prespliceosome and it
          interacts with U2 snRNP and/or pre-mRNA in the
          prespliceosome. This family also contains proteins
          similar to PRP21, such as the mammalian SF3a. SF3a also
          interacts with U2 snRNP from the prespliceosome,
          converting it to its active form.
          Length = 230

 Score = 27.0 bits (60), Expect = 7.9
 Identities = 11/16 (68%), Positives = 16/16 (100%)

Query: 18 KEKEEERKEEEEEEEK 33
          ++KEE++KEEEEEEE+
Sbjct: 33 QKKEEQKKEEEEEEER 48


>gnl|CDD|225815 COG3276, SelB, Selenocysteine-specific translation elongation
           factor [Translation, ribosomal structure and
           biogenesis].
          Length = 447

 Score = 27.0 bits (60), Expect = 8.5
 Identities = 25/104 (24%), Positives = 35/104 (33%), Gaps = 23/104 (22%)

Query: 103 NTSFLRAARAGHLDKLRELLEAGVDINTSNATSKKLDNQLNI-----HQKNIKN--NGLN 155
            T+ L+A   G  D+L E  + G+ I+      K  D  +       H   I N   GL 
Sbjct: 13  KTTLLKALTGGVTDRLPEEKKRGITIDLGFYYRKLEDGVMGFIDVPGHPDFISNLLAGLG 72

Query: 156 ALHLAAKDGHTTVVKALLKGGCKVDASTK-KGNTALHIAALDSL 198
            +  A       +V         V A       T  H+  LD L
Sbjct: 73  GIDYA------LLV---------VAADEGLMAQTGEHLLILDLL 101



 Score = 27.0 bits (60), Expect = 8.5
 Identities = 16/61 (26%), Positives = 24/61 (39%), Gaps = 6/61 (9%)

Query: 69  NTSFLRAARAGHLDKLRELLEAGVDINTSNAS----DPNTSFLRAARAGHLDKLRELLEA 124
            T+ L+A   G  D+L E  + G+ I+         D    F+     GH D +  LL  
Sbjct: 13  KTTLLKALTGGVTDRLPEEKKRGITIDLGFYYRKLEDGVMGFIDV--PGHPDFISNLLAG 70

Query: 125 G 125
            
Sbjct: 71  L 71


>gnl|CDD|198151 smart01083, Cir_N, N-terminal domain of CBF1 interacting
          co-repressor CIR.  This is a 45 residue conserved
          region at the N-terminal end of a family of proteins
          referred to as CIRs (CBF1-interacting co-repressors).
          CBF1 (centromere-binding factor 1) acts as a
          transcription factor that causes repression by binding
          specifically to GTGGGAA motifs in responsive promoters,
          and it requires CIR as a co-repressor. CIR binds to
          histone deacetylase and to SAP30 and serves as a linker
          between CBF1 and the histone deacetylase complex.
          Length = 37

 Score = 24.4 bits (54), Expect = 9.6
 Identities = 8/19 (42%), Positives = 13/19 (68%)

Query: 14 MLEKKEKEEERKEEEEEEE 32
            E+K +EE++K EE  +E
Sbjct: 15 KAEQKAEEEKKKIEERRKE 33


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.311    0.127    0.345 

Gapped
Lambda     K      H
   0.267   0.0812    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 11,076,277
Number of extensions: 1043159
Number of successful extensions: 3751
Number of sequences better than 10.0: 1
Number of HSP's gapped: 3393
Number of HSP's successfully gapped: 255
Length of query: 225
Length of database: 10,937,602
Length adjustment: 93
Effective length of query: 132
Effective length of database: 6,812,680
Effective search space: 899273760
Effective search space used: 899273760
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.8 bits)
S2: 57 (25.6 bits)