RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy5490
         (352 letters)



>gnl|CDD|238125 cd00204, ANK, ankyrin repeats;  ankyrin repeats mediate
           protein-protein interactions in very diverse families of
           proteins. The number of ANK repeats in a protein can
           range from 2 to over 20 (ankyrins, for example). ANK
           repeats may occur in combinations with other types of
           domains. The structural repeat unit contains two
           antiparallel helices and a beta-hairpin, repeats are
           stacked in a superhelical arrangement; this alignment
           contains 4 consecutive repeats.
          Length = 126

 Score =  124 bits (313), Expect = 4e-35
 Identities = 54/120 (45%), Positives = 77/120 (64%), Gaps = 4/120 (3%)

Query: 63  ESPLTLAACGGHVDLALLLIERGANIEEVNDEGYTPLMEAAREGHEEMVCVLLQKGAKIN 122
            +PL LAA  GH+++  LL+E GA++   +++G TPL  AA+ GH E+V +LL+KGA +N
Sbjct: 8   RTPLHLAASNGHLEVVKLLLENGADVNAKDNDGRTPLHLAAKNGHLEIVKLLLEKGADVN 67

Query: 123 AHTEETQETALTLACCGGFLDVADFLLKNGANIELGAS---TPLMEAAQEGHLELVRYLL 179
           A  ++   T L LA   G LDV   LLK+GA++        TPL  AA+ GHLE+V+ LL
Sbjct: 68  A-RDKDGNTPLHLAARNGNLDVVKLLLKHGADVNARDKDGRTPLHLAAKNGHLEVVKLLL 126



 Score =  117 bits (296), Expect = 1e-32
 Identities = 49/123 (39%), Positives = 76/123 (61%), Gaps = 4/123 (3%)

Query: 27  ESPLTLAACGGHVDLALLLIERGANIEEVNMPTDSFESPLTLAACGGHVDLALLLIERGA 86
            +PL LAA  GH+++  LL+E GA++   N   +   +PL LAA  GH+++  LL+E+GA
Sbjct: 8   RTPLHLAASNGHLEVVKLLLENGADV---NAKDNDGRTPLHLAAKNGHLEIVKLLLEKGA 64

Query: 87  NIEEVNDEGYTPLMEAAREGHEEMVCVLLQKGAKINAHTEETQETALTLACCGGFLDVAD 146
           ++   + +G TPL  AAR G+ ++V +LL+ GA +NA  ++   T L LA   G L+V  
Sbjct: 65  DVNARDKDGNTPLHLAARNGNLDVVKLLLKHGADVNARDKD-GRTPLHLAAKNGHLEVVK 123

Query: 147 FLL 149
            LL
Sbjct: 124 LLL 126



 Score =  100 bits (250), Expect = 7e-26
 Identities = 38/97 (39%), Positives = 56/97 (57%), Gaps = 4/97 (4%)

Query: 90  EVNDEGYTPLMEAAREGHEEMVCVLLQKGAKINAHTEETQETALTLACCGGFLDVADFLL 149
             +++G TPL  AA  GH E+V +LL+ GA +NA   +   T L LA   G L++   LL
Sbjct: 2   ARDEDGRTPLHLAASNGHLEVVKLLLENGADVNAKDND-GRTPLHLAAKNGHLEIVKLLL 60

Query: 150 KNGANIELGAS---TPLMEAAQEGHLELVRYLLDSGA 183
           + GA++        TPL  AA+ G+L++V+ LL  GA
Sbjct: 61  EKGADVNARDKDGNTPLHLAARNGNLDVVKLLLKHGA 97



 Score = 96.7 bits (241), Expect = 1e-24
 Identities = 44/118 (37%), Positives = 65/118 (55%), Gaps = 16/118 (13%)

Query: 7   ESPLTLAACGGH-------------VNMPTDSFESPLTLAACGGHVDLALLLIERGANIE 53
            +PL LAA  GH             VN   +   +PL LAA  GH+++  LL+E+GA+  
Sbjct: 8   RTPLHLAASNGHLEVVKLLLENGADVNAKDNDGRTPLHLAAKNGHLEIVKLLLEKGAD-- 65

Query: 54  EVNMPTDSFESPLTLAACGGHVDLALLLIERGANIEEVNDEGYTPLMEAAREGHEEMV 111
            VN       +PL LAA  G++D+  LL++ GA++   + +G TPL  AA+ GH E+V
Sbjct: 66  -VNARDKDGNTPLHLAARNGNLDVVKLLLKHGADVNARDKDGRTPLHLAAKNGHLEVV 122



 Score = 58.5 bits (142), Expect = 7e-11
 Identities = 30/88 (34%), Positives = 44/88 (50%), Gaps = 16/88 (18%)

Query: 8   SPLTLAACGGH-------------VNMPTDSFESPLTLAACGGHVDLALLLIERGANIEE 54
           +PL LAA  GH             VN       +PL LAA  G++D+  LL++ GA++  
Sbjct: 42  TPLHLAAKNGHLEIVKLLLEKGADVNARDKDGNTPLHLAARNGNLDVVKLLLKHGADV-- 99

Query: 55  VNMPTDSFESPLTLAACGGHVDLALLLI 82
            N       +PL LAA  GH+++  LL+
Sbjct: 100 -NARDKDGRTPLHLAAKNGHLEVVKLLL 126


>gnl|CDD|205076 pfam12796, Ank_2, Ankyrin repeats (3 copies). 
          Length = 91

 Score = 93.1 bits (232), Expect = 1e-23
 Identities = 37/94 (39%), Positives = 58/94 (61%), Gaps = 5/94 (5%)

Query: 30  LTLAACGGHVDLALLLIERGANIEEVNMPTDSFESPLTLAACGGHVDLALLLIERGANIE 89
           L LAA  G+++L  LL+E+GA++   +  T      L LAA  G++++  LL+E GA++ 
Sbjct: 1   LHLAAKNGNLELVKLLLEKGADVNLGDTDT-----ALHLAARNGNLEIVKLLLEHGADVN 55

Query: 90  EVNDEGYTPLMEAAREGHEEMVCVLLQKGAKINA 123
             + +G T L  AAR G+ E+V +LL+ GA IN 
Sbjct: 56  AKDKDGNTALHLAARNGNLEIVKLLLEHGADINL 89



 Score = 90.8 bits (226), Expect = 7e-23
 Identities = 35/88 (39%), Positives = 53/88 (60%), Gaps = 6/88 (6%)

Query: 99  LMEAAREGHEEMVCVLLQKGAKINAHTEETQETALTLACCGGFLDVADFLLKNGANIELG 158
           L  AA+ G+ E+V +LL+KGA +N    +T   AL LA   G L++   LL++GA++   
Sbjct: 1   LHLAAKNGNLELVKLLLEKGADVNLGDTDT---ALHLAARNGNLEIVKLLLEHGADVNAK 57

Query: 159 AS---TPLMEAAQEGHLELVRYLLDSGA 183
                T L  AA+ G+LE+V+ LL+ GA
Sbjct: 58  DKDGNTALHLAARNGNLEIVKLLLEHGA 85



 Score = 85.4 bits (212), Expect = 6e-21
 Identities = 38/92 (41%), Positives = 58/92 (63%), Gaps = 3/92 (3%)

Query: 66  LTLAACGGHVDLALLLIERGANIEEVNDEGYTPLMEAAREGHEEMVCVLLQKGAKINAHT 125
           L LAA  G+++L  LL+E+GA++   +    T L  AAR G+ E+V +LL+ GA +NA  
Sbjct: 1   LHLAAKNGNLELVKLLLEKGADVNLGDT--DTALHLAARNGNLEIVKLLLEHGADVNA-K 57

Query: 126 EETQETALTLACCGGFLDVADFLLKNGANIEL 157
           ++   TAL LA   G L++   LL++GA+I L
Sbjct: 58  DKDGNTALHLAARNGNLEIVKLLLEHGADINL 89



 Score = 60.3 bits (147), Expect = 8e-12
 Identities = 22/65 (33%), Positives = 38/65 (58%), Gaps = 3/65 (4%)

Query: 28 SPLTLAACGGHVDLALLLIERGANIEEVNMPTDSFESPLTLAACGGHVDLALLLIERGAN 87
          + L LAA  G++++  LL+E GA++   N       + L LAA  G++++  LL+E GA+
Sbjct: 30 TALHLAARNGNLEIVKLLLEHGADV---NAKDKDGNTALHLAARNGNLEIVKLLLEHGAD 86

Query: 88 IEEVN 92
          I   +
Sbjct: 87 INLKD 91



 Score = 43.8 bits (104), Expect = 5e-06
 Identities = 18/62 (29%), Positives = 28/62 (45%), Gaps = 13/62 (20%)

Query: 8  SPLTLAACGGH-------------VNMPTDSFESPLTLAACGGHVDLALLLIERGANIEE 54
          + L LAA  G+             VN       + L LAA  G++++  LL+E GA+I  
Sbjct: 30 TALHLAARNGNLEIVKLLLEHGADVNAKDKDGNTALHLAARNGNLEIVKLLLEHGADINL 89

Query: 55 VN 56
           +
Sbjct: 90 KD 91


>gnl|CDD|222984 PHA03100, PHA03100, ankyrin repeat protein; Provisional.
          Length = 422

 Score = 84.7 bits (210), Expect = 3e-18
 Identities = 45/168 (26%), Positives = 68/168 (40%), Gaps = 32/168 (19%)

Query: 44  LLIERGANIEEVNMPTDSFESPLTLAACGGHV-----DLALLLIERGANIEEVNDEGYTP 98
           +L++ GA+I   N  T +  +PL   +   +      ++  LL+E GAN+   ++ G TP
Sbjct: 53  ILLDNGADI---NSSTKNNSTPLHYLSNIKYNLTDVKEIVKLLLEYGANVNAPDNNGITP 109

Query: 99  LMEAARE--GHEEMVCVLLQKGAKINAHTEETQETALTLA--CCGGFLDVADFLLKNGAN 154
           L+ A  +      +V  LL  GA +N       E  L L        L +   L+  G +
Sbjct: 110 LLYAISKKSNSYSIVEYLLDNGANVNIKN-SDGENLLHLYLESNKIDLKILKLLIDKGVD 168

Query: 155 I----------ELGA---------STPLMEAAQEGHLELVRYLLDSGA 183
           I            G           TPL  A    + E V+YLLD GA
Sbjct: 169 INAKNRVNYLLSYGVPINIKDVYGFTPLHYAVYNNNPEFVKYLLDLGA 216



 Score = 75.1 bits (185), Expect = 5e-15
 Identities = 42/161 (26%), Positives = 67/161 (41%), Gaps = 24/161 (14%)

Query: 16  GGHVNMPTDSFESPLTLAACG--GHVDLALLLIERGANIEEVNMPTDSFESPLTLAACGG 73
           G +VN P ++  +PL  A         +   L++ GAN+   N+     E+ L L     
Sbjct: 96  GANVNAPDNNGITPLLYAISKKSNSYSIVEYLLDNGANV---NIKNSDGENLLHLYLESN 152

Query: 74  HVDLAL--LLIERGAN----------------IEEVNDEGYTPLMEAAREGHEEMVCVLL 115
            +DL +  LLI++G +                I   +  G+TPL  A    + E V  LL
Sbjct: 153 KIDLKILKLLIDKGVDINAKNRVNYLLSYGVPINIKDVYGFTPLHYAVYNNNPEFVKYLL 212

Query: 116 QKGAKINAHTEETQETALTLACCGGFLDVADFLLKNGANIE 156
             GA  N    +  +T L +A      ++   LL NG +I+
Sbjct: 213 DLGANPNL-VNKYGDTPLHIAILNNNKEIFKLLLNNGPSIK 252



 Score = 59.3 bits (144), Expect = 8e-10
 Identities = 35/123 (28%), Positives = 53/123 (43%), Gaps = 15/123 (12%)

Query: 16  GGHVNMPTDSFESPLTLAACGGHVDLAL--LLIERGANIE---EVNM------PTDS--- 61
           G +VN+     E+ L L      +DL +  LLI++G +I     VN       P +    
Sbjct: 131 GANVNIKNSDGENLLHLYLESNKIDLKILKLLIDKGVDINAKNRVNYLLSYGVPINIKDV 190

Query: 62  -FESPLTLAACGGHVDLALLLIERGANIEEVNDEGYTPLMEAAREGHEEMVCVLLQKGAK 120
              +PL  A    + +    L++ GAN   VN  G TPL  A    ++E+  +LL  G  
Sbjct: 191 YGFTPLHYAVYNNNPEFVKYLLDLGANPNLVNKYGDTPLHIAILNNNKEIFKLLLNNGPS 250

Query: 121 INA 123
           I  
Sbjct: 251 IKT 253



 Score = 39.6 bits (93), Expect = 0.002
 Identities = 21/110 (19%), Positives = 42/110 (38%), Gaps = 21/110 (19%)

Query: 16  GGHVNMPTDSFESPLTLAACGGHVDLALLLIERGANIEEVNMPTDSFESPLTLAACGGHV 75
           G  +N+      +PL  A    + +    L++ GAN    N+     ++PL +A    + 
Sbjct: 182 GVPINIKDVYGFTPLHYAVYNNNPEFVKYLLDLGANP---NLVNKYGDTPLHIAILNNNK 238

Query: 76  DLALLLIERGANIEEVNDEGYTPLMEAAREGHEEMVCVLLQKGAKINAHT 125
           ++  LL+  G +I+ + +                   +L  K   +N  T
Sbjct: 239 EIFKLLLNNGPSIKTIIE------------------TLLYFKDKDLNTIT 270


>gnl|CDD|223738 COG0666, Arp, FOG: Ankyrin repeat [General function prediction
           only].
          Length = 235

 Score = 70.6 bits (172), Expect = 4e-14
 Identities = 46/138 (33%), Positives = 65/138 (47%), Gaps = 10/138 (7%)

Query: 64  SPLTLAACGGHVDLALLLIERGANIEEVNDEGYTPLMEAAREGHE-----EMVCVLLQKG 118
            PL  AA  G   +  LL+  GA++   + +G TPL  AA  G+      E+  +LL+ G
Sbjct: 75  LPLHSAASKGDDKIVKLLLASGADVNAKDADGDTPLHLAALNGNPPEGNIEVAKLLLEAG 134

Query: 119 A--KINAHTEETQETALTLACCGGFLDVADFLLKNGANIELGAS---TPLMEAAQEGHLE 173
           A   +N   +E   T L  A   G  D+ + LL+ GA+     S   T L  AA+ G +E
Sbjct: 135 ADLDVNNLRDEDGNTPLHWAALNGDADIVELLLEAGADPNSRNSYGVTALDPAAKNGRIE 194

Query: 174 LVRYLLDSGAQEMQTKHI 191
           LV+ LLD G      K  
Sbjct: 195 LVKLLLDKGLHLSLLKFN 212



 Score = 62.5 bits (151), Expect = 2e-11
 Identities = 42/171 (24%), Positives = 70/171 (40%), Gaps = 12/171 (7%)

Query: 24  DSFESPLTLAACGGHVDLALLLIERGANIEEVNMPTDSFESPLTLAACGGHVDLALLLIE 83
               S L L        L + L+   +                        +   LL + 
Sbjct: 2   KPSLSALLLINKCFLDLLLVALLLLLSLDLSNPSDKKLNLYLELALLPAASLSELLLKLI 61

Query: 84  RGANIEEVNDEGYTPLMEAAREGHEEMVCVLLQKGAKINAHTEETQETALTLAC-----C 138
              ++   + +G  PL  AA +G +++V +LL  GA +NA  +   +T L LA       
Sbjct: 62  VDRHLAARDLDGRLPLHSAASKGDDKIVKLLLASGADVNA-KDADGDTPLHLAALNGNPP 120

Query: 139 GGFLDVADFLLKNGANIELGAS------TPLMEAAQEGHLELVRYLLDSGA 183
            G ++VA  LL+ GA++++         TPL  AA  G  ++V  LL++GA
Sbjct: 121 EGNIEVAKLLLEAGADLDVNNLRDEDGNTPLHWAALNGDADIVELLLEAGA 171



 Score = 60.6 bits (146), Expect = 1e-10
 Identities = 42/112 (37%), Positives = 61/112 (54%), Gaps = 5/112 (4%)

Query: 12  LAACGGHVNMPTDSFESPLTLAA-----CGGHVDLALLLIERGANIEEVNMPTDSFESPL 66
           L A G  VN      ++PL LAA       G++++A LL+E GA+++  N+  +   +PL
Sbjct: 92  LLASGADVNAKDADGDTPLHLAALNGNPPEGNIEVAKLLLEAGADLDVNNLRDEDGNTPL 151

Query: 67  TLAACGGHVDLALLLIERGANIEEVNDEGYTPLMEAAREGHEEMVCVLLQKG 118
             AA  G  D+  LL+E GA+    N  G T L  AA+ G  E+V +LL KG
Sbjct: 152 HWAALNGDADIVELLLEAGADPNSRNSYGVTALDPAAKNGRIELVKLLLDKG 203



 Score = 59.1 bits (142), Expect = 3e-10
 Identities = 45/158 (28%), Positives = 70/158 (44%), Gaps = 12/158 (7%)

Query: 1   MPTDSFESPLTLAACGGHVNMPTDSFESPLTLAACGGHVDLALLLIERGANIEEVNMPTD 60
           +P  S    L       H+         PL  AA  G   +  LL+  GA++   +    
Sbjct: 48  LPAASLSELLLKLIVDRHLAARDLDGRLPLHSAASKGDDKIVKLLLASGADVNAKDAD-- 105

Query: 61  SFESPLTLAA-----CGGHVDLALLLIERGA---NIEEVNDEGYTPLMEAAREGHEEMVC 112
             ++PL LAA       G++++A LL+E GA        +++G TPL  AA  G  ++V 
Sbjct: 106 -GDTPLHLAALNGNPPEGNIEVAKLLLEAGADLDVNNLRDEDGNTPLHWAALNGDADIVE 164

Query: 113 VLLQKGAKINAHTEETQETALTLACCGGFLDVADFLLK 150
           +LL+ GA  N+       TAL  A   G +++   LL 
Sbjct: 165 LLLEAGADPNSRNSYGV-TALDPAAKNGRIELVKLLLD 201


>gnl|CDD|165206 PHA02875, PHA02875, ankyrin repeat protein; Provisional.
          Length = 413

 Score = 59.2 bits (143), Expect = 8e-10
 Identities = 43/127 (33%), Positives = 63/127 (49%), Gaps = 3/127 (2%)

Query: 28  SPLTLAACGGHVDLALLLIERGANIEEVNMPTDSFESPLTLAACGGHVDLALLLIERGAN 87
           +PL LA     +D+  LLI RGA+ +  N  TD F SPL LA   G +    LLI+  A 
Sbjct: 104 TPLHLATILKKLDIMKLLIARGADPDIPN--TDKF-SPLHLAVMMGDIKGIELLIDHKAC 160

Query: 88  IEEVNDEGYTPLMEAAREGHEEMVCVLLQKGAKINAHTEETQETALTLACCGGFLDVADF 147
           ++  +  G TPL+ A  +G   +  +LL  GA I+   +     AL  A     +D+   
Sbjct: 161 LDIEDCCGCTPLIIAMAKGDIAICKMLLDSGANIDYFGKNGCVAALCYAIENNKIDIVRL 220

Query: 148 LLKNGAN 154
            +K GA+
Sbjct: 221 FIKRGAD 227



 Score = 57.7 bits (139), Expect = 3e-09
 Identities = 48/161 (29%), Positives = 79/161 (49%), Gaps = 10/161 (6%)

Query: 28  SPLTLAACGGHVDLALLLIERGANIEEVNMPTDSFESPLTLAACGGHVDLALLLIERGAN 87
           SP+ LA      +   LL++ GA I +V  P    ES L  A   G V     L++ G  
Sbjct: 37  SPIKLAMKFRDSEAIKLLMKHGA-IPDVKYP--DIESELHDAVEEGDVKAVEELLDLGKF 93

Query: 88  IEEV-NDEGYTPLMEAAREGHEEMVCVLLQKGAKIN-AHTEETQETALTLACCGGFLDVA 145
            ++V   +G TPL  A      +++ +L+ +GA  +  +T++   + L LA   G +   
Sbjct: 94  ADDVFYKDGMTPLHLATILKKLDIMKLLIARGADPDIPNTDKF--SPLHLAVMMGDIKGI 151

Query: 146 DFLLKNGA--NIELG-ASTPLMEAAQEGHLELVRYLLDSGA 183
           + L+ + A  +IE     TPL+ A  +G + + + LLDSGA
Sbjct: 152 ELLIDHKACLDIEDCCGCTPLIIAMAKGDIAICKMLLDSGA 192



 Score = 42.7 bits (100), Expect = 2e-04
 Identities = 35/128 (27%), Positives = 57/128 (44%), Gaps = 8/128 (6%)

Query: 11  TLAACGGHVNMPTDSFESPLTLAACGGHVDLALLLIERGA--NIEEVNMPTDSFESPLTL 68
            L A G   ++P     SPL LA   G +    LLI+  A  +IE+    T     PL +
Sbjct: 120 LLIARGADPDIPNTDKFSPLHLAVMMGDIKGIELLIDHKACLDIEDCCGCT-----PLII 174

Query: 69  AACGGHVDLALLLIERGANIEEVNDEGYTPLMEAAREGHE-EMVCVLLQKGAKINAHTEE 127
           A   G + +  +L++ GANI+     G    +  A E ++ ++V + +++GA  N     
Sbjct: 175 AMAKGDIAICKMLLDSGANIDYFGKNGCVAALCYAIENNKIDIVRLFIKRGADCNIMFMI 234

Query: 128 TQETALTL 135
             E    L
Sbjct: 235 EGEECTIL 242



 Score = 29.2 bits (65), Expect = 4.0
 Identities = 28/115 (24%), Positives = 51/115 (44%), Gaps = 5/115 (4%)

Query: 73  GHVDLALLLIERGANIEEVNDEGYTPLMEAAREGHEEMVCVLLQKGAKINAHTEETQETA 132
           G +D+A  L++ G N      +G +P+  A +    E + +L++ GA  +    +  E+ 
Sbjct: 13  GELDIARRLLDIGINPNFEIYDGISPIKLAMKFRDSEAIKLLMKHGAIPDVKYPDI-ESE 71

Query: 133 LTLACCGGFLDVADFLLKNGANIE----LGASTPLMEAAQEGHLELVRYLLDSGA 183
           L  A   G +   + LL  G   +        TPL  A     L++++ L+  GA
Sbjct: 72  LHDAVEEGDVKAVEELLDLGKFADDVFYKDGMTPLHLATILKKLDIMKLLIARGA 126


>gnl|CDD|222277 pfam13637, Ank_4, Ankyrin repeats (many copies). 
          Length = 54

 Score = 53.0 bits (128), Expect = 1e-09
 Identities = 18/52 (34%), Positives = 31/52 (59%)

Query: 64  SPLTLAACGGHVDLALLLIERGANIEEVNDEGYTPLMEAAREGHEEMVCVLL 115
           + L  AA  G ++L   L+E+G +I   +++G T L  AA  G+ E++ +LL
Sbjct: 3   TALHKAAISGRLELVKYLLEKGVDINRTDEDGNTALHIAAENGNLEVLKLLL 54



 Score = 53.0 bits (128), Expect = 1e-09
 Identities = 24/55 (43%), Positives = 30/55 (54%), Gaps = 1/55 (1%)

Query: 95  GYTPLMEAAREGHEEMVCVLLQKGAKINAHTEETQETALTLACCGGFLDVADFLL 149
           G T L +AA  G  E+V  LL+KG  IN   E+   TAL +A   G L+V   LL
Sbjct: 1   GRTALHKAAISGRLELVKYLLEKGVDINRTDED-GNTALHIAAENGNLEVLKLLL 54



 Score = 43.0 bits (102), Expect = 4e-06
 Identities = 16/55 (29%), Positives = 30/55 (54%), Gaps = 3/55 (5%)

Query: 28 SPLTLAACGGHVDLALLLIERGANIEEVNMPTDSFESPLTLAACGGHVDLALLLI 82
          + L  AA  G ++L   L+E+G +I   N   +   + L +AA  G++++  LL+
Sbjct: 3  TALHKAAISGRLELVKYLLEKGVDI---NRTDEDGNTALHIAAENGNLEVLKLLL 54



 Score = 35.3 bits (82), Expect = 0.003
 Identities = 13/24 (54%), Positives = 16/24 (66%)

Query: 161 TPLMEAAQEGHLELVRYLLDSGAQ 184
           T L +AA  G LELV+YLL+ G  
Sbjct: 3   TALHKAAISGRLELVKYLLEKGVD 26


>gnl|CDD|222939 PHA02878, PHA02878, ankyrin repeat protein; Provisional.
          Length = 477

 Score = 54.5 bits (131), Expect = 4e-08
 Identities = 43/156 (27%), Positives = 69/156 (44%), Gaps = 9/156 (5%)

Query: 4   DSFESPLT--LAACGGHVNMPT-DSFESPLTLAACGGHVDLALLLIERGANIEEVNMPTD 60
           D  E+ +T  L + G  +NM       + L  A       L  LL+  GAN   VN+P  
Sbjct: 143 DIIEAEITKLLLSYGADINMKDRHKGNTALHYATENKDQRLTELLLSYGAN---VNIPDK 199

Query: 61  SFESPLTLAACGGHVDLALLLIERGANIEEVNDEGYTPL-MEAAREGHEEMVCVLLQKGA 119
           +  SPL  A    +  +  +L+E GA+ +  +  G TPL +        +++ +LL+ G 
Sbjct: 200 TNNSPLHHAVKHYNKPIVHILLENGASTDARDKCGNTPLHISVGYCKDYDILKLLLEHGV 259

Query: 120 KINAHTEETQETALTLACCGGFLDVADFLLKNGANI 155
            +NA +     TAL  +           LL+ GA+I
Sbjct: 260 DVNAKSYILGLTALHSSIKSE--RKLKLLLEYGADI 293



 Score = 48.3 bits (115), Expect = 3e-06
 Identities = 40/163 (24%), Positives = 72/163 (44%), Gaps = 22/163 (13%)

Query: 44  LLIERGANIEEVNMPTDSFESPLTLAACGGHVDLALLLIERGANIEEVNDEGYTPLMEAA 103
           LL+  GA+I   +    +  + L  A       L  LL+  GAN+   +    +PL  A 
Sbjct: 152 LLLSYGADINMKDRHKGN--TALHYATENKDQRLTELLLSYGANVNIPDKTNNSPLHHAV 209

Query: 104 REGHEEMVCVLLQKGAKINAHTEETQETALTLACCGGFL---DVADFLLKNGANIELGAS 160
           +  ++ +V +LL+ GA  +   ++   T L ++   G+    D+   LL++G ++   + 
Sbjct: 210 KHYNKPIVHILLENGASTD-ARDKCGNTPLHISV--GYCKDYDILKLLLEHGVDVNAKSY 266

Query: 161 ----TPLMEAAQ-EGHLELVRYLLDSGAQEMQTKHIDILEFKK 198
               T L  + + E  L+L   LL+ GA       I+ L   K
Sbjct: 267 ILGLTALHSSIKSERKLKL---LLEYGAD------INSLNSYK 300



 Score = 41.4 bits (97), Expect = 5e-04
 Identities = 26/104 (25%), Positives = 47/104 (45%), Gaps = 9/104 (8%)

Query: 83  ERGANIEEVNDEGYTPLMEAAREGHEEMVCVLLQKGAKINAHTEETQETALTLACCGGFL 142
            +  ++  ++ +    ++EA      E+  +LL  GA IN        TAL  A      
Sbjct: 128 IQTIDLVYIDKKSKDDIIEA------EITKLLLSYGADINMKDRHKGNTALHYATENKDQ 181

Query: 143 DVADFLLKNGANI---ELGASTPLMEAAQEGHLELVRYLLDSGA 183
            + + LL  GAN+   +   ++PL  A +  +  +V  LL++GA
Sbjct: 182 RLTELLLSYGANVNIPDKTNNSPLHHAVKHYNKPIVHILLENGA 225



 Score = 38.3 bits (89), Expect = 0.004
 Identities = 35/128 (27%), Positives = 62/128 (48%), Gaps = 18/128 (14%)

Query: 16  GGHVNMPTDSFESPLTLAACGGHVDLALLLIERGANIEEVNMPTDSFESPLTLAACGGHV 75
           G +VN+P  +  SPL  A    +  +  +L+E GA+ +  +   +   +PL ++   G+ 
Sbjct: 191 GANVNIPDKTNNSPLHHAVKHYNKPIVHILLENGASTDARDKCGN---TPLHISV--GYC 245

Query: 76  ---DLALLLIERGANIEEVNDE----GYTPLMEAAREGHEEMVCVLLQKGAKINAHTEET 128
              D+  LL+E G +   VN +    G T L  + +   E  + +LL+ GA IN+     
Sbjct: 246 KDYDILKLLLEHGVD---VNAKSYILGLTALHSSIKS--ERKLKLLLEYGADINSLNSY- 299

Query: 129 QETALTLA 136
           + T L+ A
Sbjct: 300 KLTPLSSA 307


>gnl|CDD|165207 PHA02876, PHA02876, ankyrin repeat protein; Provisional.
          Length = 682

 Score = 53.9 bits (129), Expect = 7e-08
 Identities = 56/176 (31%), Positives = 86/176 (48%), Gaps = 13/176 (7%)

Query: 16  GGHVNMPTDSFESPLTLAACGGH-VDLALLLIERGANIEEVNMPTDSFESPLTLAAC-GG 73
           G  VN      E+PL L A  G+  +    LI  GA   +VN     + +PL  A+    
Sbjct: 297 GADVNAKNIKGETPLYLMAKNGYDTENIRTLIMLGA---DVNAADRLYITPLHQASTLDR 353

Query: 74  HVDLALLLIERGANIEEVNDEGYTPLMEAAREGHEEMVCVLLQKGAKINAHTEETQETAL 133
           + D+ + L+E GAN+   +    TP+  AA   +  ++  LL  GA I A +++   TAL
Sbjct: 354 NKDIVITLLELGANVNARDYCDKTPIHYAAVRNNVVIINTLLDYGADIEALSQKIG-TAL 412

Query: 134 TLACCGG--FLDVADFLLKNGANIELG---ASTPLMEAAQEG-HLELVRYLLDSGA 183
             A CG   ++ V   L+  GAN+       STPL  A ++   L+++  LLD+GA
Sbjct: 413 HFALCGTNPYMSVKT-LIDRGANVNSKNKDLSTPLHYACKKNCKLDVIEMLLDNGA 467



 Score = 47.4 bits (112), Expect = 9e-06
 Identities = 45/152 (29%), Positives = 76/152 (50%), Gaps = 10/152 (6%)

Query: 39  VDLALLLIERGANIEEVNMPTDSFESPLTLAACGGHVD-LALLLIERGANIEEVNDEGYT 97
           ++ +LLL + G ++  ++   D   +PL  A+    +  L   L+ERGA++   N +G T
Sbjct: 253 LETSLLLYDAGFSVNSID---DCKNTPLHHASQAPSLSRLVPKLLERGADVNAKNIKGET 309

Query: 98  PLMEAAREGHE-EMVCVLLQKGAKINAHTEETQETALTLACC-GGFLDVADFLLKNGANI 155
           PL   A+ G++ E +  L+  GA +NA  +    T L  A       D+   LL+ GAN+
Sbjct: 310 PLYLMAKNGYDTENIRTLIMLGADVNA-ADRLYITPLHQASTLDRNKDIVITLLELGANV 368

Query: 156 ---ELGASTPLMEAAQEGHLELVRYLLDSGAQ 184
              +    TP+  AA   ++ ++  LLD GA 
Sbjct: 369 NARDYCDKTPIHYAAVRNNVVIINTLLDYGAD 400



 Score = 46.6 bits (110), Expect = 2e-05
 Identities = 41/148 (27%), Positives = 74/148 (50%), Gaps = 8/148 (5%)

Query: 10  LTLAACGGHVNMPTDSFESPLTLAACGGHVDLALLLIERGANIEEVNMPTDSFESPLTLA 69
           +TL   G +VN      ++P+  AA   +V +   L++ GA+IE +   +    + L  A
Sbjct: 359 ITLLELGANVNARDYCDKTPIHYAAVRNNVVIINTLLDYGADIEAL---SQKIGTALHFA 415

Query: 70  ACGGHVDLAL-LLIERGANIEEVNDEGYTPLMEAAREGHE-EMVCVLLQKGAKINAHTEE 127
            CG +  +++  LI+RGAN+   N +  TPL  A ++  + +++ +LL  GA +NA   +
Sbjct: 416 LCGTNPYMSVKTLIDRGANVNSKNKDLSTPLHYACKKNCKLDVIEMLLDNGADVNAINIQ 475

Query: 128 TQETALTLACCGGFLDVADFLLKNGANI 155
            Q   L       +  + + LL  GA +
Sbjct: 476 NQYPLLIAL---EYHGIVNILLHYGAEL 500



 Score = 38.5 bits (89), Expect = 0.005
 Identities = 30/106 (28%), Positives = 51/106 (48%), Gaps = 2/106 (1%)

Query: 77  LALLLIERGANIEEVNDEGYTPLMEAAREGHEEMVCVLLQKGAKINAHTEETQETALTLA 136
           +A +L+E GA++   +    TP+  AA  G+ +MV +LL  GA +N    +   + L  A
Sbjct: 160 IAEMLLEGGADVNAKDIYCITPIHYAAERGNAKMVNLLLSYGADVNIIALDDL-SVLECA 218

Query: 137 CCGGFLDVADFLLKNGANIELGASTPLMEAAQEGHLELVRYLLDSG 182
                +D    ++ N +NI       L++A +   LE    L D+G
Sbjct: 219 VDSKNIDTIKAIIDNRSNINKN-DLSLLKAIRNEDLETSLLLYDAG 263



 Score = 30.8 bits (69), Expect = 1.4
 Identities = 41/160 (25%), Positives = 72/160 (45%), Gaps = 23/160 (14%)

Query: 41  LALLLIERGANIEEVNMPTDSFESPLTLAACGGHVDLALLLIERGANIEEVNDEGYTPLM 100
           +A +L+E GA   +VN       +P+  AA  G+  +  LL+  GA++  +  +  + L 
Sbjct: 160 IAEMLLEGGA---DVNAKDIYCITPIHYAAERGNAKMVNLLLSYGADVNIIALDDLSVLE 216

Query: 101 EAAREGHEEMVCVLLQKGAKIN--------AHTEETQETALTLACCGGFLDVADFLLKNG 152
            A    + + +  ++   + IN        A   E  ET+L L   G  ++         
Sbjct: 217 CAVDSKNIDTIKAIIDNRSNINKNDLSLLKAIRNEDLETSLLLYDAGFSVN--------- 267

Query: 153 ANIELGASTPLMEAAQEGHL-ELVRYLLDSGAQEMQTKHI 191
            +I+   +TPL  A+Q   L  LV  LL+ GA ++  K+I
Sbjct: 268 -SIDDCKNTPLHHASQAPSLSRLVPKLLERGA-DVNAKNI 305



 Score = 30.0 bits (67), Expect = 2.2
 Identities = 16/45 (35%), Positives = 28/45 (62%), Gaps = 3/45 (6%)

Query: 142 LDVADFLLKNGANI---ELGASTPLMEAAQEGHLELVRYLLDSGA 183
           L +A+ LL+ GA++   ++   TP+  AA+ G+ ++V  LL  GA
Sbjct: 158 LLIAEMLLEGGADVNAKDIYCITPIHYAAERGNAKMVNLLLSYGA 202


>gnl|CDD|215625 PLN03192, PLN03192, Voltage-dependent potassium channel;
           Provisional.
          Length = 823

 Score = 53.3 bits (128), Expect = 1e-07
 Identities = 34/123 (27%), Positives = 55/123 (44%), Gaps = 6/123 (4%)

Query: 64  SPLTLAACGGHVDLALLLIERGANIEEVNDEGYTPLMEAAREGHEEMVCVLLQKGAKINA 123
           +PL +AA  G+ D  L+L++   N+   +  G T L  A    H ++  +L    +  + 
Sbjct: 560 TPLHIAASKGYEDCVLVLLKHACNVHIRDANGNTALWNAISAKHHKIFRILYHFASISDP 619

Query: 124 HTEETQETALTLACCGGFLDVADFLLKNGANIEL---GASTPLMEAAQEGHLELVRYLLD 180
           H        L  A     L     LLK G N++      +T L  A  E H+++VR L+ 
Sbjct: 620 H---AAGDLLCTAAKRNDLTAMKELLKQGLNVDSEDHQGATALQVAMAEDHVDMVRLLIM 676

Query: 181 SGA 183
           +GA
Sbjct: 677 NGA 679



 Score = 49.9 bits (119), Expect = 1e-06
 Identities = 48/198 (24%), Positives = 79/198 (39%), Gaps = 22/198 (11%)

Query: 28  SPLTLAACGGHVDLALLLIERGAN--IEEVNMPTDSFESPLTLAACGGHVDLALLLIE-- 83
           +PL +AA  G+ D  L+L++   N  I + N  T  +      A    H  +  +L    
Sbjct: 560 TPLHIAASKGYEDCVLVLLKHACNVHIRDANGNTALWN-----AISAKHHKIFRILYHFA 614

Query: 84  RGANIEEVNDEGYTPLMEAAREGHEEMVCVLLQKGAKINAHTEETQ-ETALTLACCGGFL 142
             ++     D     L  AA+      +  LL++G  +N  +E+ Q  TAL +A     +
Sbjct: 615 SISDPHAAGD----LLCTAAKRNDLTAMKELLKQG--LNVDSEDHQGATALQVAMAEDHV 668

Query: 143 DVADFLLKNGANI------ELGASTPLMEAAQEGHLELVRYLLDSGAQEMQTKHIDILEF 196
           D+   L+ NGA++      +  + T L E  Q+  L     ++DS   +      D    
Sbjct: 669 DMVRLLIMNGADVDKANTDDDFSPTELRELLQKRELGHSITIVDSVPADEPDLGRDGGSR 728

Query: 197 KKRKQEESKAAFCRGSES 214
             R Q  S    CR   S
Sbjct: 729 PGRLQGTSSDNQCRPRVS 746


>gnl|CDD|200936 pfam00023, Ank, Ankyrin repeat.  Ankyrins are multifunctional
           adaptors that link specific proteins to the
           membrane-associated, spectrin- actin cytoskeleton. This
           repeat-domain is a 'membrane-binding' domain of up to 24
           repeated units, and it mediates most of the protein's
           binding activities. Repeats 13-24 are especially active,
           with known sites of interaction for the Na/K ATPase,
           Cl/HCO(3) anion exchanger, voltage-gated sodium channel,
           clathrin heavy chain and L1 family cell adhesion
           molecules. The ANK repeats are found to form a
           contiguous spiral stack such that ion transporters like
           the anion exchanger associate in a large central cavity
           formed by the ANK repeat spiral, while clathrin and cell
           adhesion molecules associate with specific regions
           outside this cavity.
          Length = 33

 Score = 47.2 bits (113), Expect = 1e-07
 Identities = 17/33 (51%), Positives = 23/33 (69%)

Query: 94  EGYTPLMEAAREGHEEMVCVLLQKGAKINAHTE 126
           +G TPL  AAR GH E+V +LL+ GA +NA  +
Sbjct: 1   DGNTPLHLAARNGHLEVVKLLLEAGADVNARDK 33



 Score = 40.2 bits (95), Expect = 3e-05
 Identities = 14/23 (60%), Positives = 19/23 (82%)

Query: 161 TPLMEAAQEGHLELVRYLLDSGA 183
           TPL  AA+ GHLE+V+ LL++GA
Sbjct: 4   TPLHLAARNGHLEVVKLLLEAGA 26



 Score = 37.9 bits (89), Expect = 2e-04
 Identities = 12/31 (38%), Positives = 20/31 (64%)

Query: 63 ESPLTLAACGGHVDLALLLIERGANIEEVND 93
           +PL LAA  GH+++  LL+E GA++   + 
Sbjct: 3  NTPLHLAARNGHLEVVKLLLEAGADVNARDK 33



 Score = 37.1 bits (87), Expect = 3e-04
 Identities = 12/30 (40%), Positives = 20/30 (66%)

Query: 27 ESPLTLAACGGHVDLALLLIERGANIEEVN 56
           +PL LAA  GH+++  LL+E GA++   +
Sbjct: 3  NTPLHLAARNGHLEVVKLLLEAGADVNARD 32



 Score = 35.6 bits (83), Expect = 0.001
 Identities = 11/26 (42%), Positives = 15/26 (57%)

Query: 130 ETALTLACCGGFLDVADFLLKNGANI 155
            T L LA   G L+V   LL+ GA++
Sbjct: 3   NTPLHLAARNGHLEVVKLLLEAGADV 28


>gnl|CDD|222931 PHA02798, PHA02798, ankyrin-like protein; Provisional.
          Length = 489

 Score = 52.5 bits (126), Expect = 1e-07
 Identities = 35/126 (27%), Positives = 60/126 (47%), Gaps = 17/126 (13%)

Query: 39  VDLALLLIERGANIEEVNMPTDSFESPLTLAACGGHV---DLALLLIERGANIEEVNDEG 95
           +D+  +LIE GA+I   N      E+PL      G++   ++ L +IE GA+   ++ +G
Sbjct: 89  LDIVKILIENGADI---NKKNSDGETPLYCLLSNGYINNLEILLFMIENGADTTLLDKDG 145

Query: 96  YTPLMEAAREGHE---EMVCVLLQKGAKINAHTEETQETALTLACCGGF------LDVAD 146
           +T L    +  H    E++ +LL+KG  IN H  + +    TL C   +       D+  
Sbjct: 146 FTMLQVYLQSNHHIDIEIIKLLLEKGVDINTHNNKEKYD--TLHCYFKYNIDRIDADILK 203

Query: 147 FLLKNG 152
             + NG
Sbjct: 204 LFVDNG 209



 Score = 41.4 bits (97), Expect = 5e-04
 Identities = 42/155 (27%), Positives = 65/155 (41%), Gaps = 45/155 (29%)

Query: 39  VDLALLLIERGANIEEVNMPTDSFESPL-TLAA----CGGHVDLALLLIERGANIEEVND 93
            D+  L I  GAN   VN   + + +PL T+ +        +D+  +LIE GA+I + N 
Sbjct: 51  TDIVKLFINLGAN---VNGLDNEYSTPLCTILSNIKDYKHMLDIVKILIENGADINKKNS 107

Query: 94  EGYTPLMEAAREGHEEMVCVLLQKGAKINAHTEETQETALTLACCGGFLDVADFLLKNGA 153
           +G TPL              LL  G  IN                   L++  F+++NGA
Sbjct: 108 DGETPL------------YCLLSNGY-INN------------------LEILLFMIENGA 136

Query: 154 NIELGAS---TPLMEAAQEGH---LELVRYLLDSG 182
           +  L      T L    Q  H   +E+++ LL+ G
Sbjct: 137 DTTLLDKDGFTMLQVYLQSNHHIDIEIIKLLLEKG 171



 Score = 31.7 bits (72), Expect = 0.67
 Identities = 22/91 (24%), Positives = 37/91 (40%), Gaps = 3/91 (3%)

Query: 39  VDLALLLIERGANIEEVNMPTDS--FESPLTLAACGGHVDLALL-LIERGANIEEVNDEG 95
            D+  L ++ G  I + N        E   +L          +L  I    +I +V++ G
Sbjct: 199 ADILKLFVDNGFIINKENKSHKKKFMEYLNSLLYDNKRFKKNILDFIFSYIDINQVDELG 258

Query: 96  YTPLMEAAREGHEEMVCVLLQKGAKINAHTE 126
           + PL  +    + ++   LLQ G  IN  TE
Sbjct: 259 FNPLYYSVSHNNRKIFEYLLQLGGDINIITE 289


>gnl|CDD|165205 PHA02874, PHA02874, ankyrin repeat protein; Provisional.
          Length = 434

 Score = 50.0 bits (119), Expect = 8e-07
 Identities = 47/196 (23%), Positives = 78/196 (39%), Gaps = 28/196 (14%)

Query: 16  GGHVNMPTDSFESPLTLAACGGHVDLALLLIERGANIEEVNMPTDSFESPLTLAACGGHV 75
           G  +N+  D   +PL  A   G   +  L I+ GA+I  +N  T      LT    G H 
Sbjct: 25  GNCINISVDETTTPLIDAIRSGDAKIVELFIKHGADINHIN--TKIPHPLLTAIKIGAHD 82

Query: 76  DLALLLI----------------------ERGANIEEVNDEGYTPLMEAAREGHEEMVCV 113
            + LL+                       + G ++   + E  T L  A ++G  E + +
Sbjct: 83  IIKLLIDNGVDTSILPIPCIEKDMIKTILDCGIDVNIKDAELKTFLHYAIKKGDLESIKM 142

Query: 114 LLQKGAKINAHTEETQETALTLACCGGFLDVADFLLKNGANIELG---ASTPLMEAAQEG 170
           L + GA +N   ++     + +A    F D+   LL+ GA   +      +PL  AA+ G
Sbjct: 143 LFEYGADVNIE-DDNGCYPIHIAIKHNFFDIIKLLLEKGAYANVKDNNGESPLHNAAEYG 201

Query: 171 HLELVRYLLDSGAQEM 186
               ++ L+D G   M
Sbjct: 202 DYACIKLLIDHGNHIM 217



 Score = 44.6 bits (105), Expect = 5e-05
 Identities = 49/155 (31%), Positives = 68/155 (43%), Gaps = 11/155 (7%)

Query: 6   FESPLTLAACGGHVNMPTDSFESPLTLAACGGHVDLALLLIERGANIEEVNMPTDSFESP 65
            ES   L   G  VN+  D+   P+ +A      D+  LL+E+GA     N+  ++ ESP
Sbjct: 137 LESIKMLFEYGADVNIEDDNGCYPIHIAIKHNFFDIIKLLLEKGAYA---NVKDNNGESP 193

Query: 66  LTLAACGGHVDLALLLIERGANIEEVNDEGYTPLMEAAREGHEEMVCVLLQKGAKINAHT 125
           L  AA  G      LLI+ G +I      G+TPL  A    H      LL   A IN   
Sbjct: 194 LHNAAEYGDYACIKLLIDHGNHIMNKCKNGFTPLHNAII--HNRSAIELLINNASIND-Q 250

Query: 126 EETQETALTLAC---CGGFLDVADFLLKNGANIEL 157
           +    T L  A    C   +D+ D LL + A+I +
Sbjct: 251 DIDGSTPLHHAINPPCD--IDIIDILLYHKADISI 283



 Score = 40.3 bits (94), Expect = 0.001
 Identities = 44/170 (25%), Positives = 70/170 (41%), Gaps = 27/170 (15%)

Query: 11  TLAACGGHVNMPTDSFESPLTLAACGGHVDLALLLIERGANIEEVNMPTDSFESPLTLAA 70
           T+  CG  VN+     ++ L  A   G ++   +L E GA   +VN+  D+   P+ +A 
Sbjct: 109 TILDCGIDVNIKDAELKTFLHYAIKKGDLESIKMLFEYGA---DVNIEDDNGCYPIHIAI 165

Query: 71  CGGHVDLALLLIERGANIEEVNDEGYTPLMEAAREGHEEMVCVLLQKGAKINAHTEETQE 130
                D+  LL+E+GA     ++ G +PL  AA  G    + +L+  G  I         
Sbjct: 166 KHNFFDIIKLLLEKGAYANVKDNNGESPLHNAAEYGDYACIKLLIDHGNHI--------- 216

Query: 131 TALTLACCGGFLDVADFLLKNGANIEL------------GASTPLMEAAQ 168
                 C  GF  + + ++ N + IEL              STPL  A  
Sbjct: 217 ---MNKCKNGFTPLHNAIIHNRSAIELLINNASINDQDIDGSTPLHHAIN 263



 Score = 32.2 bits (73), Expect = 0.35
 Identities = 29/99 (29%), Positives = 46/99 (46%), Gaps = 5/99 (5%)

Query: 4   DSFESPLTLAACGGHVNMPTDSFESPLTLAACGGHVDLALLLIERGANIEEVNMPTDSFE 63
           + F+    L   G + N+  ++ ESPL  AA  G      LLI+ G +I  +N   + F 
Sbjct: 168 NFFDIIKLLLEKGAYANVKDNNGESPLHNAAEYGDYACIKLLIDHGNHI--MNKCKNGF- 224

Query: 64  SPLTLAACGGHVDLALLLIERGANIEEVNDEGYTPLMEA 102
           +PL  A       + LL+     N ++++  G TPL  A
Sbjct: 225 TPLHNAIIHNRSAIELLINNASINDQDID--GSTPLHHA 261


>gnl|CDD|197603 smart00248, ANK, ankyrin repeats.  Ankyrin repeats are about 33
           amino acids long and occur in at least four consecutive
           copies. They are involved in protein-protein
           interactions. The core of the repeat seems to be an
           helix-loop-helix structure.
          Length = 30

 Score = 43.7 bits (104), Expect = 2e-06
 Identities = 17/30 (56%), Positives = 21/30 (70%)

Query: 94  EGYTPLMEAAREGHEEMVCVLLQKGAKINA 123
           +G TPL  AA  G+ E+V +LL KGA INA
Sbjct: 1   DGRTPLHLAAENGNLEVVKLLLDKGADINA 30



 Score = 37.6 bits (88), Expect = 3e-04
 Identities = 14/23 (60%), Positives = 18/23 (78%)

Query: 161 TPLMEAAQEGHLELVRYLLDSGA 183
           TPL  AA+ G+LE+V+ LLD GA
Sbjct: 4   TPLHLAAENGNLEVVKLLLDKGA 26



 Score = 34.5 bits (80), Expect = 0.003
 Identities = 11/27 (40%), Positives = 20/27 (74%)

Query: 28 SPLTLAACGGHVDLALLLIERGANIEE 54
          +PL LAA  G++++  LL+++GA+I  
Sbjct: 4  TPLHLAAENGNLEVVKLLLDKGADINA 30



 Score = 34.5 bits (80), Expect = 0.003
 Identities = 11/27 (40%), Positives = 20/27 (74%)

Query: 64 SPLTLAACGGHVDLALLLIERGANIEE 90
          +PL LAA  G++++  LL+++GA+I  
Sbjct: 4  TPLHLAAENGNLEVVKLLLDKGADINA 30



 Score = 34.5 bits (80), Expect = 0.003
 Identities = 12/27 (44%), Positives = 14/27 (51%)

Query: 131 TALTLACCGGFLDVADFLLKNGANIEL 157
           T L LA   G L+V   LL  GA+I  
Sbjct: 4   TPLHLAAENGNLEVVKLLLDKGADINA 30


>gnl|CDD|205784 pfam13606, Ank_3, Ankyrin repeat.  Ankyrins are multifunctional
           adaptors that link specific proteins to the
           membrane-associated, spectrin- actin cytoskeleton. This
           repeat-domain is a 'membrane-binding' domain of up to 24
           repeated units, and it mediates most of the protein's
           binding activities.
          Length = 30

 Score = 43.0 bits (102), Expect = 3e-06
 Identities = 17/30 (56%), Positives = 22/30 (73%)

Query: 94  EGYTPLMEAAREGHEEMVCVLLQKGAKINA 123
           +G TPL  AAR G+ E+V +LL+ GA INA
Sbjct: 1   DGNTPLHLAARNGNLELVKLLLEHGADINA 30



 Score = 36.1 bits (84), Expect = 0.001
 Identities = 14/23 (60%), Positives = 18/23 (78%)

Query: 161 TPLMEAAQEGHLELVRYLLDSGA 183
           TPL  AA+ G+LELV+ LL+ GA
Sbjct: 4   TPLHLAARNGNLELVKLLLEHGA 26



 Score = 33.8 bits (78), Expect = 0.005
 Identities = 11/26 (42%), Positives = 16/26 (61%)

Query: 130 ETALTLACCGGFLDVADFLLKNGANI 155
            T L LA   G L++   LL++GA+I
Sbjct: 3   NTPLHLAARNGNLELVKLLLEHGADI 28



 Score = 32.6 bits (75), Expect = 0.015
 Identities = 13/26 (50%), Positives = 19/26 (73%)

Query: 27 ESPLTLAACGGHVDLALLLIERGANI 52
           +PL LAA  G+++L  LL+E GA+I
Sbjct: 3  NTPLHLAARNGNLELVKLLLEHGADI 28



 Score = 32.6 bits (75), Expect = 0.015
 Identities = 13/26 (50%), Positives = 19/26 (73%)

Query: 63 ESPLTLAACGGHVDLALLLIERGANI 88
           +PL LAA  G+++L  LL+E GA+I
Sbjct: 3  NTPLHLAARNGNLELVKLLLEHGADI 28


>gnl|CDD|222980 PHA03095, PHA03095, ankyrin-like protein; Provisional.
          Length = 471

 Score = 47.7 bits (114), Expect = 5e-06
 Identities = 44/161 (27%), Positives = 72/161 (44%), Gaps = 35/161 (21%)

Query: 44  LLIERGANIEEVNMPTDSFESPLTL---AACGGHVDLALLLIERGANIEEVNDEGYTPL- 99
            L+  GA+   VN   +  ++PL L    +     D+  LL+E GA++      G+TPL 
Sbjct: 32  RLLAAGAD---VNFRGEYGKTPLHLYLHYSSEKVKDIVRLLLEAGADVNAPERCGFTPLH 88

Query: 100 --MEAAREGHEEMVCVLLQKGAKINAHTEETQETALTLACCGGF---LDVADFLLKNGAN 154
             +  A     +++ +L++ GA +NA  ++   T L      GF     V   LL+ GA+
Sbjct: 89  LYLYNATT--LDVIKLLIKAGADVNA-KDKVGRTPLH-VYLSGFNINPKVIRLLLRKGAD 144

Query: 155 IE---LGASTPLMEAAQEGH---------LELVRYLLDSGA 183
           +    L   TPL                 +EL+R L+D+GA
Sbjct: 145 VNALDLYGMTPL-------AVLLKSRNANVELLRLLIDAGA 178



 Score = 43.5 bits (103), Expect = 1e-04
 Identities = 28/94 (29%), Positives = 39/94 (41%), Gaps = 6/94 (6%)

Query: 45  LIERGANIEEVNMPTDSFESPLTLAACGGHVDLALL--LIERGANIEEVNDEGYTPLMEA 102
           LI  G +    +M      +PL   A G     +L+  L+  G +I   N  G TPL  A
Sbjct: 208 LIRAGCDPAATDM---LGNTPLHSMATGSSCKRSLVLPLLIAGISINARNRYGQTPLHYA 264

Query: 103 AREGHEEMVCVLLQKGAKINAHTEETQETALTLA 136
           A   +      L+  GA INA +     T L+L 
Sbjct: 265 AVFNNPRACRRLIALGADINAVS-SDGNTPLSLM 297



 Score = 41.5 bits (98), Expect = 4e-04
 Identities = 25/93 (26%), Positives = 41/93 (44%), Gaps = 6/93 (6%)

Query: 10  LTLAACGGHVNMPTDSFESPLTLAACGGHV-DLALLLIERGANIEEVNMPTDSFESPLTL 68
             L   G  VN P     +PL L        D+  LLI+ GA++   +    +   PL +
Sbjct: 67  RLLLEAGADVNAPERCGFTPLHLYLYNATTLDVIKLLIKAGADVNAKDKVGRT---PLHV 123

Query: 69  AACGG--HVDLALLLIERGANIEEVNDEGYTPL 99
              G   +  +  LL+ +GA++  ++  G TPL
Sbjct: 124 YLSGFNINPKVIRLLLRKGADVNALDLYGMTPL 156



 Score = 33.5 bits (77), Expect = 0.17
 Identities = 29/109 (26%), Positives = 41/109 (37%), Gaps = 5/109 (4%)

Query: 11  TLAACGGHVNMPTDSFESPLTLAACGGHVDLALL--LIERGANIEEVNMPTDSFESPLTL 68
            L   G           +PL   A G     +L+  L+  G +I   N      ++PL  
Sbjct: 207 ELIRAGCDPAATDMLGNTPLHSMATGSSCKRSLVLPLLIAGISINARN---RYGQTPLHY 263

Query: 69  AACGGHVDLALLLIERGANIEEVNDEGYTPLMEAAREGHEEMVCVLLQK 117
           AA   +      LI  GA+I  V+ +G TPL    R  +   V   L K
Sbjct: 264 AAVFNNPRACRRLIALGADINAVSSDGNTPLSLMVRNNNGRAVRAALAK 312


>gnl|CDD|206028 pfam13857, Ank_5, Ankyrin repeats (many copies). 
          Length = 56

 Score = 42.3 bits (100), Expect = 8e-06
 Identities = 21/57 (36%), Positives = 30/57 (52%), Gaps = 2/57 (3%)

Query: 81  LIERG-ANIEEVNDEGYTPLMEAAREGHEEMVCVLLQKGAKINAHTEETQETALTLA 136
           L+E G  ++   +  G TPL  AA+ G  E+V  LL+ G  +N    +   TAL LA
Sbjct: 1   LLEHGPIDLNATDGNGNTPLHLAAKYGALELVQWLLKPGVDLNLRDSDGL-TALDLA 56



 Score = 36.9 bits (86), Expect = 7e-04
 Identities = 17/58 (29%), Positives = 30/58 (51%), Gaps = 2/58 (3%)

Query: 45  LIERGANIEEVNMPTDSFESPLTLAACGGHVDLALLLIERGANIEEVNDEGYTPLMEA 102
           L+E G    ++N    +  +PL LAA  G ++L   L++ G ++   + +G T L  A
Sbjct: 1   LLEHGP--IDLNATDGNGNTPLHLAAKYGALELVQWLLKPGVDLNLRDSDGLTALDLA 56



 Score = 35.8 bits (83), Expect = 0.002
 Identities = 15/44 (34%), Positives = 22/44 (50%)

Query: 114 LLQKGAKINAHTEETQETALTLACCGGFLDVADFLLKNGANIEL 157
           LL+ G      T+    T L LA   G L++  +LLK G ++ L
Sbjct: 1   LLEHGPIDLNATDGNGNTPLHLAAKYGALELVQWLLKPGVDLNL 44



 Score = 31.9 bits (73), Expect = 0.042
 Identities = 16/40 (40%), Positives = 23/40 (57%), Gaps = 4/40 (10%)

Query: 148 LLKNGAN----IELGASTPLMEAAQEGHLELVRYLLDSGA 183
           LL++G       +   +TPL  AA+ G LELV++LL  G 
Sbjct: 1   LLEHGPIDLNATDGNGNTPLHLAAKYGALELVQWLLKPGV 40



 Score = 30.8 bits (70), Expect = 0.096
 Identities = 15/58 (25%), Positives = 26/58 (44%), Gaps = 3/58 (5%)

Query: 12 LAACGGHVNMPTDSFESPLTLAACGGHVDLALLLIERGANIEEVNMPTDSFESPLTLA 69
          L      +N    +  +PL LAA  G ++L   L++ G ++   N+      + L LA
Sbjct: 2  LEHGPIDLNATDGNGNTPLHLAAKYGALELVQWLLKPGVDL---NLRDSDGLTALDLA 56


>gnl|CDD|236776 PRK10856, PRK10856, cytoskeletal protein RodZ; Provisional.
          Length = 331

 Score = 46.6 bits (111), Expect = 9e-06
 Identities = 21/95 (22%), Positives = 31/95 (32%), Gaps = 6/95 (6%)

Query: 257 DQSSVVNVATDST---LSHVTSTPSTSAATTTQTTSPPVVSAASQNTSSATQSSTAATQN 313
           DQSS            L   T+T   +             ++ +   ++A   +    QN
Sbjct: 148 DQSSAELSQNSGQSVPLDTSTTTDPATTPAPAAPVDTTPTNSQTPAVATAPAPAVDPQQN 207

Query: 314 ASPATQSSNVATQNTPV---TNLNNAVATLKTAVA 345
           A  A   +NV T  TP        +  A L T  A
Sbjct: 208 AVVAPSQANVDTAATPAPAAPATPDGAAPLPTDQA 242



 Score = 37.3 bits (87), Expect = 0.009
 Identities = 15/91 (16%), Positives = 30/91 (32%)

Query: 235 PSGKPSDPPNSGKTKEPPNSGDDQSSVVNVATDSTLSHVTSTPSTSAATTTQTTSPPVVS 294
            +   S P ++  T +P  +    + V    T+S    V + P+ +          P  +
Sbjct: 156 QNSGQSVPLDTSTTTDPATTPAPAAPVDTTPTNSQTPAVATAPAPAVDPQQNAVVAPSQA 215

Query: 295 AASQNTSSATQSSTAATQNASPATQSSNVAT 325
                 + A  +       A   T  + V+T
Sbjct: 216 NVDTAATPAPAAPATPDGAAPLPTDQAGVST 246



 Score = 34.2 bits (79), Expect = 0.072
 Identities = 13/85 (15%), Positives = 22/85 (25%), Gaps = 5/85 (5%)

Query: 256 DDQSSVVNVATDSTLSHV-TSTPSTSAATTTQTTSPPVVSAASQNTSSATQSSTAATQNA 314
              ++              T T S + A  T        +   Q  +    S       A
Sbjct: 166 TSTTTDPATTPAPAAPVDTTPTNSQTPAVATAPAP----AVDPQQNAVVAPSQANVDTAA 221

Query: 315 SPATQSSNVATQNTPVTNLNNAVAT 339
           +PA  +        P+      V+T
Sbjct: 222 TPAPAAPATPDGAAPLPTDQAGVST 246



 Score = 33.8 bits (78), Expect = 0.11
 Identities = 22/105 (20%), Positives = 35/105 (33%), Gaps = 10/105 (9%)

Query: 211 GSESTP-TVSSKQAVKTTFPPVSTEPSGKPSDPPNSGKTKEPPNSGDDQSSVVNVATDST 269
             +S P   S+     TT  P +   +   +    +  T   P     Q++VV  +  + 
Sbjct: 158 SGQSVPLDTSTTTDPATTPAPAAPVDTTPTNSQTPAVATAPAPAVDPQQNAVVAPSQANV 217

Query: 270 LSHVTSTPSTSAATTTQTTSPPVVSAASQNTSSATQSSTAATQNA 314
            +  T  P+  A        P         T  A  S+ AA  NA
Sbjct: 218 DTAATPAPAAPATPDGAAPLP---------TDQAGVSTPAADPNA 253



 Score = 33.1 bits (76), Expect = 0.21
 Identities = 16/93 (17%), Positives = 29/93 (31%), Gaps = 4/93 (4%)

Query: 230 PVSTEPSGKPSDPPNSGKTKEPPNSGDDQSSVVNVATDSTLSHVTSTPS-TSAATTTQTT 288
           P+ T  +  P+  P       P ++    S    VAT    +      +  + +     T
Sbjct: 163 PLDTSTTTDPATTPAPA---APVDTTPTNSQTPAVATAPAPAVDPQQNAVVAPSQANVDT 219

Query: 289 SPPVVSAASQNTSSATQSSTAATQNASPATQSS 321
           +     AA      A    T     ++PA   +
Sbjct: 220 AATPAPAAPATPDGAAPLPTDQAGVSTPAADPN 252



 Score = 29.2 bits (66), Expect = 3.4
 Identities = 21/69 (30%), Positives = 31/69 (44%), Gaps = 2/69 (2%)

Query: 273 VTSTPSTSAATTTQTTSPPVVSAASQNTSSATQSSTAATQNASPA-TQSSNVATQNTP-V 330
           +T+    S+A  +Q +   V    S  T  AT  + AA  + +P  +Q+  VAT   P V
Sbjct: 143 ITTMADQSSAELSQNSGQSVPLDTSTTTDPATTPAPAAPVDTTPTNSQTPAVATAPAPAV 202

Query: 331 TNLNNAVAT 339
               NAV  
Sbjct: 203 DPQQNAVVA 211


>gnl|CDD|233161 TIGR00870, trp, transient-receptor-potential calcium channel
           protein.  The Transient Receptor Potential Ca2+ Channel
           (TRP-CC) Family (TC. 1.A.4)The TRP-CC family has also
           been called the store-operated calcium channel (SOC)
           family. The prototypical members include the Drosophila
           retinal proteinsTRP and TRPL (Montell and Rubin, 1989;
           Hardie and Minke, 1993). SOC members of the family
           mediate the entry of extracellular Ca2+ into cells in
           responseto depletion of intracellular Ca2+ stores
           (Clapham, 1996) and agonist stimulated production of
           inositol-1,4,5 trisphosphate (IP3). One member of the
           TRP-CCfamily, mammalian Htrp3, has been shown to form a
           tight complex with the IP3 receptor (TC #1.A.3.2.1).
           This interaction is apparently required for IP3
           tostimulate Ca2+ release via Htrp3. The vanilloid
           receptor subtype 1 (VR1), which is the receptor for
           capsaicin (the ?hot? ingredient in chili peppers) and
           servesas a heat-activated ion channel in the pain
           pathway (Caterina et al., 1997), is also a member of
           this family. The stretch-inhibitable non-selective
           cation channel(SIC) is identical to the vanilloid
           receptor throughout all of its first 700 residues, but
           it exhibits a different sequence in its last 100
           residues. VR1 and SICtransport monovalent cations as
           well as Ca2+. VR1 is about 10x more permeable to Ca2+
           than to monovalent ions. Ca2+ overload probably causes
           cell deathafter chronic exposure to capsaicin.
           (McCleskey and Gold, 1999) [Transport and binding
           proteins, Cations and iron carrying compounds].
          Length = 743

 Score = 42.8 bits (101), Expect = 2e-04
 Identities = 50/176 (28%), Positives = 72/176 (40%), Gaps = 39/176 (22%)

Query: 23  TDSF---ESPLTLAACGGHVDLALLLIERGANI-------EEVNMP-TDSF---ESPLTL 68
           T  F    + L LAA   + ++  LL+ERGA++         V     DSF   ESPL  
Sbjct: 122 TSEFTPGITALHLAAHRQNYEIVKLLLERGASVPARACGDFFVKSQGVDSFYHGESPLNA 181

Query: 69  AACGGHVDLALLLIERGANIEEVNDEGYTPL----MEAA-REGHEEMVCVLLQKGAKINA 123
           AAC G   +  LL E  A+I   +  G T L    ME   +  +EE+ C +      +  
Sbjct: 182 AACLGSPSIVALLSEDPADILTADSLGNTLLHLLVMENEFKAEYEELSCQMYNFALSLLD 241

Query: 124 HTEETQETALTLACCGGFLDVADFLLKNGANIELGASTPLMEAAQEGHLELVRYLL 179
              +++E  + L               N   +     TPL  AA+EG + L R  L
Sbjct: 242 KLRDSKELEVIL---------------NHQGL-----TPLKLAAKEGRIVLFRLKL 277



 Score = 38.1 bits (89), Expect = 0.006
 Identities = 36/159 (22%), Positives = 59/159 (37%), Gaps = 36/159 (22%)

Query: 27  ESPLTLAACGG-HVDLALLLIERGANIEEVNMPTDSFESPLTLAACGGHVDL--ALLLI- 82
            S L +AA    +++L  LL+                   L  A    +VD   A+LL  
Sbjct: 53  RSALFVAAIENENLELTELLLNLSCRGAV--------GDTLLHAISLEYVDAVEAILLHL 104

Query: 83  ----ERGANIEEVNDE-------GYTPLMEAAREGHEEMVCVLLQKGAKIN--AHTEETQ 129
                +   +E  ND+       G T L  AA   + E+V +LL++GA +   A  +   
Sbjct: 105 LAAFRKSGPLELANDQYTSEFTPGITALHLAAHRQNYEIVKLLLERGASVPARACGDFFV 164

Query: 130 ETA-----------LTLACCGGFLDVADFLLKNGANIEL 157
           ++            L  A C G   +   L ++ A+I  
Sbjct: 165 KSQGVDSFYHGESPLNAAACLGSPSIVALLSEDPADILT 203



 Score = 36.6 bits (85), Expect = 0.020
 Identities = 34/122 (27%), Positives = 53/122 (43%), Gaps = 20/122 (16%)

Query: 23  TDSF---ESPLTLAACGGHVDLALLLIERGANIE-------EV----NMPTDSFESPLTL 68
            DSF   ESPL  AAC G   +  LL E  A+I         +     M  + F++    
Sbjct: 169 VDSFYHGESPLNAAACLGSPSIVALLSEDPADILTADSLGNTLLHLLVMENE-FKAEYEE 227

Query: 69  AACGGHVDLALLLIERGANIEEV----NDEGYTPLMEAAREGHEEMVCVLLQKGAKINAH 124
            +C  + + AL L+++  + +E+    N +G TPL  AA+EG   +  + L    K    
Sbjct: 228 LSCQMY-NFALSLLDKLRDSKELEVILNHQGLTPLKLAAKEGRIVLFRLKLAIKYKQKKF 286

Query: 125 TE 126
             
Sbjct: 287 VA 288


>gnl|CDD|114270 pfam05539, Pneumo_att_G, Pneumovirinae attachment membrane
           glycoprotein G. 
          Length = 408

 Score = 41.6 bits (97), Expect = 4e-04
 Identities = 33/120 (27%), Positives = 45/120 (37%), Gaps = 3/120 (2%)

Query: 212 SESTPTVSSKQAVKTTFPPVSTEPSGKPSDPPNSGKTKEPPNSGDDQSSVVNVATDSTLS 271
             ST  V ++    ++ P   TEP   PS    SG  + PP S   Q             
Sbjct: 213 LSSTEPVGTQGTTTSSNPEPQTEP--PPSQRGPSGSPQHPP-STTSQDQSTTGDGQEHTQ 269

Query: 272 HVTSTPSTSAATTTQTTSPPVVSAASQNTSSATQSSTAATQNASPATQSSNVATQNTPVT 331
              + P+TS   +  +T+ P  +   Q T   T   TA TQ+ S    SS    Q  P T
Sbjct: 270 RRKTPPATSNRRSPHSTATPPPTTKRQETGRPTPRPTATTQSGSSPPHSSPPGVQANPTT 329



 Score = 38.5 bits (89), Expect = 0.004
 Identities = 29/113 (25%), Positives = 43/113 (38%), Gaps = 5/113 (4%)

Query: 217 TVSSKQAVKTTFPPVSTEPSGKPSDPPNSGKTKEPPNSGDDQSSVVNVATDSTLSHVTST 276
           T ++ Q + +T  PV T+ +   S+P    +T+ PP+      S       ST S   ST
Sbjct: 206 TATANQRLSST-EPVGTQGTTTSSNPEP--QTEPPPSQRGPSGS--PQHPPSTTSQDQST 260

Query: 277 PSTSAATTTQTTSPPVVSAASQNTSSATQSSTAATQNASPATQSSNVATQNTP 329
                  T +  +PP  S      S+AT   T   Q     T      TQ+  
Sbjct: 261 TGDGQEHTQRRKTPPATSNRRSPHSTATPPPTTKRQETGRPTPRPTATTQSGS 313



 Score = 28.1 bits (62), Expect = 8.9
 Identities = 21/126 (16%), Positives = 41/126 (32%), Gaps = 10/126 (7%)

Query: 207 AFCRGSESTPTVSSKQAVKTTFPPVSTEPSG-KPSDP--PNSGKTKEPPNSGDDQSSVVN 263
             CR +     VS  +  KT      T     + S P  P+    +  P +   Q++  N
Sbjct: 151 PKCRCTLRGKDVSCCKEPKTAVTTSKTTSWPTEVSHPTYPSQVTPQSQPATQGHQTATAN 210

Query: 264 VATDSTLSHVTSTPSTSAATTTQTTSPPVVSAASQNTSSATQSSTAATQNASPATQSSNV 323
                    ++ST       TT +++P   +    +    + S        S    ++  
Sbjct: 211 QR-------LSSTEPVGTQGTTTSSNPEPQTEPPPSQRGPSGSPQHPPSTTSQDQSTTGD 263

Query: 324 ATQNTP 329
             ++T 
Sbjct: 264 GQEHTQ 269


>gnl|CDD|215130 PLN02217, PLN02217, probable pectinesterase/pectinesterase
           inhibitor.
          Length = 670

 Score = 40.5 bits (94), Expect = 0.001
 Identities = 27/102 (26%), Positives = 43/102 (42%), Gaps = 2/102 (1%)

Query: 232 STEPSGKPSDPPNSGKTKEPPNSGDDQSSVVNVATDSTLSHVTSTPSTSAATTTQTTSPP 291
           +  P    S P  S  +     S D  S+VV  +T     H+ S P+T +   + +TSPP
Sbjct: 563 AGNPGSTNSTPTGSAASSNTTFSSDSPSTVVAPSTSPPAGHLGSPPATPSKIVSPSTSPP 622

Query: 292 V--VSAASQNTSSATQSSTAATQNASPATQSSNVATQNTPVT 331
              + + S   SS   S   A+   +    S  VA+  + V+
Sbjct: 623 ASHLGSPSTTPSSPESSIKVASTETASPESSIKVASTESSVS 664



 Score = 36.2 bits (83), Expect = 0.023
 Identities = 25/104 (24%), Positives = 44/104 (42%), Gaps = 8/104 (7%)

Query: 211 GSESTPTVSSKQAVKTTFPPVSTEPSGKPSDPPNSGKTKEPPNSGDDQSSVVNVATDSTL 270
           GS ++    S  +  TTF   S      PS  P +G    PP +    S +V+ +T    
Sbjct: 567 GSTNSTPTGSAASSNTTFSSDSPSTVVAPSTSPPAGHLGSPPAT---PSKIVSPSTSPPA 623

Query: 271 SHVTSTPSTSAATTTQTTSPPVVSAASQNTSSATQSSTAATQNA 314
           SH+ S       TT  +    +  A+++  S  +    A+T+++
Sbjct: 624 SHLGSPS-----TTPSSPESSIKVASTETASPESSIKVASTESS 662


>gnl|CDD|140343 PTZ00322, PTZ00322,
           6-phosphofructo-2-kinase/fructose-2,6-biphosphatase;
           Provisional.
          Length = 664

 Score = 39.9 bits (93), Expect = 0.002
 Identities = 28/84 (33%), Positives = 42/84 (50%), Gaps = 4/84 (4%)

Query: 32  LAACGGHVDLALLLIERGANIEEVNMPTDSFESPLTLAACGGHVDLALLLIERGANIEEV 91
           LAA G  V   +LL   GA+    +       +PL +A   GHV +  +L+E GA+   +
Sbjct: 89  LAASGDAVGARILL-TGGADPNCRDYDG---RTPLHIACANGHVQVVRVLLEFGADPTLL 144

Query: 92  NDEGYTPLMEAAREGHEEMVCVLL 115
           + +G TPL  A   G  E+V +L 
Sbjct: 145 DKDGKTPLELAEENGFREVVQLLS 168



 Score = 39.9 bits (93), Expect = 0.002
 Identities = 36/109 (33%), Positives = 52/109 (47%), Gaps = 4/109 (3%)

Query: 68  LAACGGHVDLALLLIERGANIEEVNDEGYTPLMEAAREGHEEMVCVLLQKGAKINAHTEE 127
           LAA G  V   +LL   GA+    + +G TPL  A   GH ++V VLL+ GA      ++
Sbjct: 89  LAASGDAVGARILL-TGGADPNCRDYDGRTPLHIACANGHVQVVRVLLEFGADPTL-LDK 146

Query: 128 TQETALTLACCGGFLDVADFLLKNG-ANIELGASTPLME-AAQEGHLEL 174
             +T L LA   GF +V   L ++   + ELGA+        +   LE 
Sbjct: 147 DGKTPLELAEENGFREVVQLLSRHSQCHFELGANAKPDSFTGKPPSLED 195


>gnl|CDD|165212 PHA02884, PHA02884, ankyrin repeat protein; Provisional.
          Length = 300

 Score = 37.7 bits (87), Expect = 0.006
 Identities = 34/110 (30%), Positives = 50/110 (45%), Gaps = 7/110 (6%)

Query: 38  HVDLALLLIERGANIEEVNMPT--DSFESPLTLAA-CGGHVDLALLLIERGANIEEVNDE 94
           + D+   +++ GA+  E   P   +S  +PL  A  C    D A LLI  GA++    +E
Sbjct: 45  YTDIIDAILKLGAD-PEAPFPLSENSKTNPLIYAIDCDND-DAAKLLIRYGADVNRYAEE 102

Query: 95  G-YTPLMEAAREGHEEMVCVLLQKGAKINAHTEETQET-ALTLACCGGFL 142
              TPL  +   G  + + +LL  GA IN  T +      L L  C  FL
Sbjct: 103 AKITPLYISVLHGCLKCLEILLSYGADINIQTNDMVTPIELALMICNNFL 152



 Score = 34.6 bits (79), Expect = 0.065
 Identities = 23/86 (26%), Positives = 46/86 (53%), Gaps = 4/86 (4%)

Query: 74  HVDLALLLIERGANIE---EVNDEGYT-PLMEAAREGHEEMVCVLLQKGAKINAHTEETQ 129
           + D+   +++ GA+ E    +++   T PL+ A    +++   +L++ GA +N + EE +
Sbjct: 45  YTDIIDAILKLGADPEAPFPLSENSKTNPLIYAIDCDNDDAAKLLIRYGADVNRYAEEAK 104

Query: 130 ETALTLACCGGFLDVADFLLKNGANI 155
            T L ++   G L   + LL  GA+I
Sbjct: 105 ITPLYISVLHGCLKCLEILLSYGADI 130


>gnl|CDD|237019 PRK11907, PRK11907, bifunctional 2',3'-cyclic nucleotide
           2'-phosphodiesterase/3'-nucleotidase precursor protein;
           Reviewed.
          Length = 814

 Score = 37.9 bits (88), Expect = 0.007
 Identities = 15/70 (21%), Positives = 28/70 (40%)

Query: 248 TKEPPNSGDDQSSVVNVATDSTLSHVTSTPSTSAATTTQTTSPPVVSAASQNTSSATQSS 307
               P +  +      V +D+T     +    +A T  +  S    +  S  TS AT ++
Sbjct: 32  VTTTPATSTEAEQTTPVESDATEEADNTETPVAATTAAEAPSSSETAETSDPTSEATDTT 91

Query: 308 TAATQNASPA 317
           T+  +  +PA
Sbjct: 92  TSEARTVTPA 101



 Score = 36.0 bits (83), Expect = 0.035
 Identities = 21/87 (24%), Positives = 41/87 (47%), Gaps = 5/87 (5%)

Query: 265 ATDSTLSHVTSTPSTSAATTTQT-TSPPVVSAASQ---NTSSATQSSTAATQNASPAT-Q 319
           A++  L+      +T+ AT+T+   + PV S A++   NT +   ++TAA   +S  T +
Sbjct: 20  ASNPKLAQAEEIVTTTPATSTEAEQTTPVESDATEEADNTETPVAATTAAEAPSSSETAE 79

Query: 320 SSNVATQNTPVTNLNNAVATLKTAVAT 346
           +S+  ++ T  T       T      +
Sbjct: 80  TSDPTSEATDTTTSEARTVTPAATETS 106



 Score = 31.7 bits (72), Expect = 0.60
 Identities = 30/138 (21%), Positives = 47/138 (34%), Gaps = 16/138 (11%)

Query: 198 KRKQEESKAAFCRGSESTPTVSS--KQAVKTTFPPVSTEPSGKPSDPPNSGKTKEPPNSG 255
             K   SK+A           +    QA +      +T    + + P  S  T+E  N+ 
Sbjct: 1   MMKHYFSKSAVALTLALLTASNPKLAQAEEIVTTTPATSTEAEQTTPVESDATEEADNT- 59

Query: 256 DDQSSVVNVATDSTLSHVTSTPSTSAATTTQTTSPPVVSAASQNTSSATQSSTAATQNAS 315
                     T    +     PS+S    T  TS P  S A+  T+S  ++ T A    S
Sbjct: 60  ---------ETPVAATTAAEAPSSS---ETAETSDP-TSEATDTTTSEARTVTPAATETS 106

Query: 316 PATQSSNVATQNTPVTNL 333
              +   V  +    T+L
Sbjct: 107 KPVEGQTVDVRILSTTDL 124



 Score = 31.0 bits (70), Expect = 1.2
 Identities = 11/86 (12%), Positives = 29/86 (33%), Gaps = 5/86 (5%)

Query: 259 SSVVNVATDSTLSHVTSTPSTSAATTTQTTSP----PVVSAASQNTSSATQSSTAATQNA 314
           S+      +  ++   +T + +  TT   +               T++A   S++ T   
Sbjct: 21  SNPKLAQAEEIVTTTPATSTEAEQTTPVESDATEEADNTETPVAATTAAEAPSSSETAET 80

Query: 315 S-PATQSSNVATQNTPVTNLNNAVAT 339
           S P +++++  T             +
Sbjct: 81  SDPTSEATDTTTSEARTVTPAATETS 106



 Score = 29.1 bits (65), Expect = 4.0
 Identities = 12/77 (15%), Positives = 26/77 (33%), Gaps = 1/77 (1%)

Query: 272 HVTSTPSTSAATTTQTTSPPVVSAASQNTSSATQSSTAATQNASPATQSSNVATQNTPVT 331
           H  S  + +      T S P ++ A +   + T +++   +  +P    +     NT   
Sbjct: 4   HYFSKSAVALTLALLTASNPKLAQA-EEIVTTTPATSTEAEQTTPVESDATEEADNTETP 62

Query: 332 NLNNAVATLKTAVATLN 348
                 A   ++  T  
Sbjct: 63  VAATTAAEAPSSSETAE 79


>gnl|CDD|165195 PHA02859, PHA02859, ankyrin repeat protein; Provisional.
          Length = 209

 Score = 36.3 bits (84), Expect = 0.011
 Identities = 22/92 (23%), Positives = 41/92 (44%), Gaps = 19/92 (20%)

Query: 19  VNMPTDSFESPLTLAACGGHVDLALL--LIERGANIEEVNMPTDSFESPLTLAACGGH-- 74
           VN   D +E+P+        V++ +L  LIE GA++       ++  S L       H  
Sbjct: 44  VNDCNDLYETPIFSCLEKDKVNVEILKFLIENGADVNFK-TRDNNL-SAL------HHYL 95

Query: 75  -------VDLALLLIERGANIEEVNDEGYTPL 99
                   ++  +LI+ G++I E +++G   L
Sbjct: 96  SFNKNVEPEILKILIDSGSSITEEDEDGKNLL 127


>gnl|CDD|221173 pfam11702, DUF3295, Protein of unknown function (DUF3295).  This
           family is conserved in fungi but the function is not
           known.
          Length = 509

 Score = 37.2 bits (86), Expect = 0.012
 Identities = 29/110 (26%), Positives = 49/110 (44%), Gaps = 1/110 (0%)

Query: 218 VSSKQAVKTTFPPVSTEPSGKPSDPPNSGKTKEPPNSGDDQSSVVNVATDSTLS-HVTST 276
           V S +  K   P     P+  P  PP+S  T  PP+S    +   +    +  S   TST
Sbjct: 67  VLSIKEKKDLEPLALPMPNLAPITPPSSEPTPAPPSSESTATRTPDPNQQALESTESTST 126

Query: 277 PSTSAATTTQTTSPPVVSAASQNTSSATQSSTAATQNASPATQSSNVATQ 326
            S     + Q+++P + S+ +  +SS    ST+  +  SP+  SS+  + 
Sbjct: 127 TSADCNDSEQSSTPNLNSSDTSTSSSGALPSTSVVRGFSPSHISSSYRST 176



 Score = 28.0 bits (62), Expect = 8.6
 Identities = 17/88 (19%), Positives = 32/88 (36%), Gaps = 3/88 (3%)

Query: 247 KTKEPPNSGDDQSSVVNVATDSTLSHVTSTPSTSAATTTQTTSPPVVSAASQNTSSATQS 306
           K  EP        + +   +        S+ ST+  T          + ++  TS+    
Sbjct: 74  KDLEPLALPMPNLAPITPPSSEPTPAPPSSESTATRTPDPNQQALESTESTSTTSADCND 133

Query: 307 STAAT---QNASPATQSSNVATQNTPVT 331
           S  ++    N+S  + SS+ A  +T V 
Sbjct: 134 SEQSSTPNLNSSDTSTSSSGALPSTSVV 161


>gnl|CDD|215397 PLN02744, PLN02744, dihydrolipoyllysine-residue acetyltransferase
           component of pyruvate dehydrogenase complex.
          Length = 539

 Score = 36.8 bits (85), Expect = 0.017
 Identities = 28/111 (25%), Positives = 49/111 (44%), Gaps = 21/111 (18%)

Query: 155 IELGASTPLMEAAQEGHLELVRYLLDSGAQEMQTKHI---------DILEFKKRKQEESK 205
           +E   +T  ME  +EG+L   + +   GA+E++   +         DI +FK  K   S 
Sbjct: 149 VETDKATVEMECMEEGYL--AKIVKGDGAKEIKVGEVIAITVEEEEDIGKFKDYKPSSSA 206

Query: 206 AAFCRGSESTPTVSSKQAVKTTFPPVSTEPSGKPSDPPNSGKTKEPPNSGD 256
           A     ++ +P    ++ V+        +P+  P   P + K   PP+SGD
Sbjct: 207 APAAPKAKPSPPPPKEEEVE--------KPASSPE--PKASKPSAPPSSGD 247


>gnl|CDD|165157 PHA02795, PHA02795, ankyrin-like protein; Provisional.
          Length = 437

 Score = 36.5 bits (84), Expect = 0.018
 Identities = 20/55 (36%), Positives = 27/55 (49%)

Query: 52  IEEVNMPTDSFESPLTLAACGGHVDLALLLIERGANIEEVNDEGYTPLMEAAREG 106
           IE++N       + L  A   G++DL   L+E GAN+  V   GYT L  A   G
Sbjct: 211 IEDINQLDAGGRTLLYRAIYAGYIDLVSWLLENGANVNAVMSNGYTCLDVAVDRG 265


>gnl|CDD|218439 pfam05109, Herpes_BLLF1, Herpes virus major outer envelope
           glycoprotein (BLLF1).  This family consists of the BLLF1
           viral late glycoprotein, also termed gp350/220. It is
           the most abundantly expressed glycoprotein in the viral
           envelope of the Herpesviruses and is the major antigen
           responsible for stimulating the production of
           neutralising antibodies in vivo.
          Length = 830

 Score = 36.7 bits (84), Expect = 0.019
 Identities = 33/124 (26%), Positives = 43/124 (34%), Gaps = 2/124 (1%)

Query: 205 KAAFCRGSESTPTVSSKQAVKTTFPPVSTEPSGKPSDPPNSGKTKEPPNSGDDQSSVVNV 264
           K  F +  ++T +V     +    P  +T     PS PP S          D  S     
Sbjct: 419 KVVFHKAPDTTKSVIFVYTLVHVEPHKTTAVPTTPSLPPASTGPTVST--ADPTSGTPTG 476

Query: 265 ATDSTLSHVTSTPSTSAATTTQTTSPPVVSAASQNTSSATQSSTAATQNASPATQSSNVA 324
            T STL   TS  S + + T   TSP         TS  TQ ++      SP      V 
Sbjct: 477 TTSSTLPEDTSPTSRTTSATPNATSPTPAVTTPNATSPTTQKTSDTPNATSPTPIVIGVT 536

Query: 325 TQNT 328
           T  T
Sbjct: 537 TTAT 540



 Score = 31.7 bits (71), Expect = 0.72
 Identities = 27/141 (19%), Positives = 51/141 (36%), Gaps = 6/141 (4%)

Query: 202 EESKAAFCRGSESTPTVSSKQAVKTTFPPVSTEPSGKPSDPPNSGKTKEPPNSGDDQSSV 261
             +     + S++    S    V       ++ P+G  S    +  T          ++ 
Sbjct: 510 NATSPTTQKTSDTPNATSPTPIVIGVTTTATSPPTGTTS--VPNA-TSPQVTEESPVNNT 566

Query: 262 VNVATDSTLSHVTSTPSTSAATTTQ--TTSPPVVSAASQNTSSATQSSTAATQNASPATQ 319
                 S  S +TS  +T    T    T+  P + ++S +T  +  +ST     ++  T 
Sbjct: 567 NTPVVTSAPSVLTSAVTTGQHGTGSSPTSQQPGIPSSSHSTPRSNSTSTTPLLTSAHPTG 626

Query: 320 SSNVATQNTPVTNLNNAVATL 340
             N+ T+ TP       V+TL
Sbjct: 627 GENI-TEETPSVPSTTHVSTL 646



 Score = 30.5 bits (68), Expect = 1.8
 Identities = 29/132 (21%), Positives = 44/132 (33%), Gaps = 1/132 (0%)

Query: 200 KQEESKAAFCRGSESTPTVSSKQAVKTTFPPVSTEPSGKPSDPPNSGKTKEPPNSGDDQS 259
                 A   + +E +P  ++   V T+ P V T           S  T + P       
Sbjct: 546 TTSVPNATSPQVTEESPVNNTNTPVVTSAPSVLTSAVTTGQHGTGSSPTSQQPGIPSSSH 605

Query: 260 SVVNVATDSTLSHVTSTPSTSAATTTQTTSPPVVSAASQNTSSATQSSTAATQNASPATQ 319
           S     + ST   +TS   T     T+ T P V S    +T S        +Q + P   
Sbjct: 606 STPRSNSTSTTPLLTSAHPTGGENITEET-PSVPSTTHVSTLSPGPGPGTTSQVSGPGNS 664

Query: 320 SSNVATQNTPVT 331
           S++       VT
Sbjct: 665 STSRYPGEVHVT 676



 Score = 29.8 bits (66), Expect = 2.4
 Identities = 23/126 (18%), Positives = 48/126 (38%), Gaps = 2/126 (1%)

Query: 214 STPTVSSKQAVKTTFPPVSTEPSGKPSDPPNSGKTKEPPNSGDDQSSVVNVATDSTLSHV 273
           ++ T ++         P +T P+ + +    +  +  P   G   ++       +++ + 
Sbjct: 493 TSATPNATSPTPAVTTPNATSPTTQKTSDTPNATSPTPIVIGVTTTATSPPTGTTSVPNA 552

Query: 274 TSTPSTSAATTTQTTSPPVVSAASQNTSSATQSSTAATQNASPATQSSNVATQNTPVTNL 333
           TS   T  +    T +P V SA S  TS+ T         +SP +Q   + + +      
Sbjct: 553 TSPQVTEESPVNNTNTPVVTSAPSVLTSAVTTGQHGTG--SSPTSQQPGIPSSSHSTPRS 610

Query: 334 NNAVAT 339
           N+   T
Sbjct: 611 NSTSTT 616



 Score = 29.8 bits (66), Expect = 2.9
 Identities = 20/148 (13%), Positives = 39/148 (26%)

Query: 202 EESKAAFCRGSESTPTVSSKQAVKTTFPPVSTEPSGKPSDPPNSGKTKEPPNSGDDQSSV 261
                     +       S    +    P S+  + + +    +        +G +  + 
Sbjct: 573 SAPSVLTSAVTTGQHGTGSSPTSQQPGIPSSSHSTPRSNSTSTTPLLTSAHPTGGENITE 632

Query: 262 VNVATDSTLSHVTSTPSTSAATTTQTTSPPVVSAASQNTSSATQSSTAATQNASPATQSS 321
              +  ST    T +P     TT+Q + P   S +                  SP+  S 
Sbjct: 633 ETPSVPSTTHVSTLSPGPGPGTTSQVSGPGNSSTSRYPGEVHVTEGMPNPNATSPSAPSG 692

Query: 322 NVATQNTPVTNLNNAVATLKTAVATLNR 349
                 T  +    A +T K    +   
Sbjct: 693 QKTAVPTVTSTGGKANSTTKETSGSTLM 720



 Score = 29.4 bits (65), Expect = 3.9
 Identities = 23/120 (19%), Positives = 44/120 (36%), Gaps = 2/120 (1%)

Query: 212 SESTPTVSSKQAVKTTFPPVSTEPSGKPSDPPNSGKTKEPPNSGDDQSSVVNVATDSTLS 271
           S + PT  +     ++  P  T P+ + +    +  +  P  +  + +S     T  T +
Sbjct: 465 STADPTSGTPTGTTSSTLPEDTSPTSRTTSATPNATSPTPAVTTPNATSPTTQKTSDTPN 524

Query: 272 HVTSTPSTSAATTTQTTSPPVVSAASQNTSS--ATQSSTAATQNASPATQSSNVATQNTP 329
             + TP     TTT T+ P   ++    TS     +S    T      +  S + +  T 
Sbjct: 525 ATSPTPIVIGVTTTATSPPTGTTSVPNATSPQVTEESPVNNTNTPVVTSAPSVLTSAVTT 584


>gnl|CDD|237592 PRK14040, PRK14040, oxaloacetate decarboxylase; Provisional.
          Length = 593

 Score = 35.7 bits (83), Expect = 0.036
 Identities = 14/71 (19%), Positives = 29/71 (40%), Gaps = 2/71 (2%)

Query: 228 FPPVSTEPSGKPSDPPNSGKTKEPPNSGDDQSSVVNVATDSTLSHVTSTPSTSAATTTQT 287
           F PV    + +P+       ++      + ++ VV V+    +S +  TP+  AA     
Sbjct: 457 FEPVPQAEAAQPAAKAEPAGSETYTVEVEGKAYVVKVSEGGDISQI--TPAAPAAAPAAA 514

Query: 288 TSPPVVSAASQ 298
            +    +AA +
Sbjct: 515 AAAAPAAAAGE 525


>gnl|CDD|237015 PRK11901, PRK11901, hypothetical protein; Reviewed.
          Length = 327

 Score = 35.0 bits (81), Expect = 0.052
 Identities = 23/152 (15%), Positives = 45/152 (29%), Gaps = 22/152 (14%)

Query: 201 QEESKAAFCRGSESTPTVSSKQAVKTTFPPVSTEPSGKPSDPPNSG--KTKEPPNSGD-- 256
              +  +    +      +  Q +  + PP+S  P+        +G  + + P N  D  
Sbjct: 96  SAANNTSDGHDASGVKNTAPPQDI--SAPPISPTPTQAAPPQTPNGQQRIELPGNISDAL 153

Query: 257 -DQSSVVNVATDSTLSHVTSTPSTSA--------ATTTQTTSPPVVSAASQNTSSATQSS 307
             Q   VN A+ +   + ++ P+  A               + P           A    
Sbjct: 154 SQQQGQVNAASQNAQGNTSTLPTAPATVAPSKGAKVPATAETHPTPPQKPATKKPAVNHH 213

Query: 308 TAATQNASPATQSSNVAT-------QNTPVTN 332
             AT    PAT     +         + P ++
Sbjct: 214 KTATVAVPPATSGKPKSGAASARALSSAPASH 245


>gnl|CDD|222954 PHA02989, PHA02989, ankyrin repeat protein; Provisional.
          Length = 494

 Score = 35.1 bits (81), Expect = 0.055
 Identities = 30/113 (26%), Positives = 56/113 (49%), Gaps = 3/113 (2%)

Query: 39  VDLALLLIERGANIEEVNMPTDS-FESPLTLAACGGHVDLALL-LIERGANIEEVNDEGY 96
           + +   LI++G NIE  N  ++S  ES L         +  +L  I +   I + + +G+
Sbjct: 198 IKVIKYLIKKGVNIETNNNGSESVLESFLDNNKILSKKEFKVLNFILKYIKINKKDKKGF 257

Query: 97  TPLMEAAREGHEEMVCVLLQKGAKINAHTEETQETALTLACCGGFLDVADFLL 149
            PL+ +A+  + E    LL+ G  I   +++  +T LT A   G +D+ + +L
Sbjct: 258 NPLLISAKVDNYEAFNYLLKLGDDIYNVSKDG-DTVLTYAIKHGNIDMLNRIL 309



 Score = 32.8 bits (75), Expect = 0.31
 Identities = 29/92 (31%), Positives = 41/92 (44%), Gaps = 25/92 (27%)

Query: 80  LLIERGANIEEVNDEGY--TPL------MEAAREGHEEMVCVLLQKGAKINAHTEETQET 131
           LLI+ GA+   VN +GY  TPL       E      +++V +LL+ GA IN    +T   
Sbjct: 55  LLIDNGAD---VNYKGYIETPLCAVLRNREITSNKIKKIVKLLLKFGADINL---KTFNG 108

Query: 132 ALTLACCGGF--------LDVADFLLKNGANI 155
              + C   F         D+  FLL  G N+
Sbjct: 109 VSPIVC---FIYNSNINNCDMLRFLLSKGINV 137


>gnl|CDD|226406 COG3889, COG3889, Predicted solute binding protein [General
           function prediction only].
          Length = 872

 Score = 35.2 bits (81), Expect = 0.059
 Identities = 27/100 (27%), Positives = 41/100 (41%), Gaps = 17/100 (17%)

Query: 215 TPTVSSKQ---AVKTTFPPVSTEPSGKPSDPPNSGKTKEPPNSGDDQSSVVNVATDSTLS 271
           T   +S      V  +  P  TE +   S   N+    E        +SVV   T +   
Sbjct: 764 TSITTSGTFTAEVPQS--PTKTETTLSYSAYSNTSILIE-------TTSVVITKTVTQTQ 814

Query: 272 HVTSTPSTSAATTTQTTSPPVVSAASQNTSSATQSSTAAT 311
             T+T S    + TQTTSP   S ++  T+S +Q++T   
Sbjct: 815 --TTTSSP---SPTQTTSPTQTSTSTTTTTSPSQTTTGGG 849



 Score = 34.1 bits (78), Expect = 0.12
 Identities = 24/105 (22%), Positives = 37/105 (35%), Gaps = 9/105 (8%)

Query: 223 AVKTTFPPVSTEPSGKPSDPPNSGKTK---------EPPNSGDDQSSVVNVATDSTLSHV 273
                F P +     +  DP +   T          E P S     + ++ +  S  S +
Sbjct: 739 GENWGFIPTTKRVKVRIMDPASGTGTSITTSGTFTAEVPQSPTKTETTLSYSAYSNTSIL 798

Query: 274 TSTPSTSAATTTQTTSPPVVSAASQNTSSATQSSTAATQNASPAT 318
             T S     T   T     S +   T+S TQ+ST+ T   SP+ 
Sbjct: 799 IETTSVVITKTVTQTQTTTSSPSPTQTTSPTQTSTSTTTTTSPSQ 843


>gnl|CDD|220749 pfam10428, SOG2, RAM signalling pathway protein.  SOG2 proteins in
           Saccharomyces cerevisiae are involved in cell separation
           and cytokinesis.
          Length = 419

 Score = 34.3 bits (79), Expect = 0.086
 Identities = 20/113 (17%), Positives = 39/113 (34%), Gaps = 2/113 (1%)

Query: 214 STPTVSSKQAVKTTFPPVSTEPSGKPS--DPPNSGKTKEPPNSGDDQSSVVNVATDSTLS 271
            T + SS  A  T         S  P+    P+S        +  +  S + V TD    
Sbjct: 160 VTASPSSMIARNTPISDRLRPRSVTPTRGRRPSSSPRSLSNPTTLESPSNLQVTTDVPPP 219

Query: 272 HVTSTPSTSAATTTQTTSPPVVSAASQNTSSATQSSTAATQNASPATQSSNVA 324
           +   T  +S  +++   S     A  ++  S   + T+ + + +P +      
Sbjct: 220 YSNGTSRSSTMSSSANLSIISSLATPRSGESFRSTPTSGSSSINPVSGLDEAE 272


>gnl|CDD|213844 TIGR03657, IsdB, heme uptake protein IsdB.  Isd proteins are
           iron-regulated surface proteins found in Bacillus,
           Staphylococcus and Listeria species and are responsible
           for heme scavenging from hemoproteins. The IsdB protein
           is only observed in Staphylococcus and consists of an
           N-terminal hydrophobic signal sequence, a pair of tandem
           NEAT (NEAr Transporter, pfam05031) domains which confers
           the ability to bind heme and a C-terminal sortase
           processing signal which targets the protein to the cell
           wall. IsdB is believed to make a direct contact with
           methemoglobin facilitating transfer of heme to IsdB. The
           heme is then transferred to other cell wall-bound NEAT
           domain proteins such as IsdA and IsdC.
          Length = 644

 Score = 34.5 bits (78), Expect = 0.096
 Identities = 31/136 (22%), Positives = 58/136 (42%), Gaps = 7/136 (5%)

Query: 205 KAAFCRGSESTPTVSSKQAVKTTFPPVSTEPSGKPSDPPNSGKTKEPPNSGDDQSSVVNV 264
           K AF + +        +Q            PS KP+ PP   ++++  +  DD     +V
Sbjct: 450 KEAFTKANADKTNKKEQQDNSAKKETTPATPS-KPTTPPVEKESQKQDSQKDDNKQSPSV 508

Query: 265 ATD---STLSHVTSTPSTSAA---TTTQTTSPPVVSAASQNTSSATQSSTAATQNASPAT 318
             +   S+ S    TP+T  A     + +T+P  V + +QN +  T +S+  T++    +
Sbjct: 509 EKENDASSESGKDKTPATKPAKGEVESSSTTPTKVVSTTQNVAKPTTASSETTKDVVQTS 568

Query: 319 QSSNVATQNTPVTNLN 334
             S+ A  + P+   N
Sbjct: 569 AGSSEAKDSAPLQKAN 584


>gnl|CDD|240026 cd04669, Nudix_Hydrolase_11, Members of the Nudix hydrolase
           superfamily catalyze the hydrolysis of NUcleoside
           DIphosphates linked to other moieties, X. Enzymes
           belonging to this superfamily require a divalent cation,
           such as Mg2+ or Mn2+, for their activity and contain a
           highly conserved 23-residue nudix motif
           (GX5EX7REUXEEXGU, where U = I, L or V), which functions
           as a metal binding and catalytic site. Substrates of
           nudix hydrolases include intact and oxidatively damaged
           nucleoside triphosphates, dinucleoside polyphosphates,
           nucleotide-sugars and dinucleotide enzymes. These
           substrates are metabolites or cell signaling molecules
           that require regulation during different stages of the
           cell cycle or during periods of stress. In general, the
           role of the nudix hydrolase is to sanitize the
           nucleotide pools and to maintain cell viability, thereby
           serving as surveillance & "house-cleaning" enzymes.
           Substrate specificity is used to define families within
           the superfamily. Differences in substrate specificity
           are determined by the N-terminal extension or by
           residues in variable loop regions. Mechanistically,
           substrate hydrolysis occurs by a nucleophilic
           substitution reaction, with variation in the numbers and
           roles of divalent cations required.
          Length = 121

 Score = 32.0 bits (73), Expect = 0.13
 Identities = 18/55 (32%), Positives = 24/55 (43%), Gaps = 16/55 (29%)

Query: 79  LLLIER-----------GANIEEVNDEGYTPLMEAAREGHEEM-VCVLLQKGAKI 121
           +LLI R           G  IEE    G TP   A RE  EE+ + V +++   I
Sbjct: 14  ILLIRRIKPGKTYYVFPGGGIEE----GETPEEAAKREALEELGLDVRVEEIFLI 64


>gnl|CDD|223023 PHA03249, PHA03249, DNA packaging tegument protein UL25;
           Provisional.
          Length = 653

 Score = 33.8 bits (77), Expect = 0.15
 Identities = 26/131 (19%), Positives = 51/131 (38%), Gaps = 10/131 (7%)

Query: 202 EESKAAFCRGSESTPTVSSKQAVKTTFPPVSTEPS-GKPSDPPNSGKTKEPPNSGDDQSS 260
           +  +A     S S+   SS + V  T      E   G+ +      K  +P        +
Sbjct: 43  DRMEAGLSSYSSSSDNKSSFEVVSETDSGSEAEAERGRRAGMGGRNKATKPSRR---NKT 99

Query: 261 VVNVATDSTLSHVTSTPST-SAATTTQTTSPPVVSAASQNTSSATQSSTAATQNASPATQ 319
                T   L+   + P+T S+  + + +SPP + + S+    A ++S         ++ 
Sbjct: 100 TQCRPTSLALATAATMPATPSSGKSPKVSSPPSIPSLSEEDEGAERNS-----GGDDSSH 154

Query: 320 SSNVATQNTPV 330
           + N +TQ+ P 
Sbjct: 155 TDNESTQSQPE 165


>gnl|CDD|224343 COG1426, COG1426, Predicted transcriptional regulator contains
           Xre-like HTH domain [Function unknown].
          Length = 284

 Score = 33.2 bits (76), Expect = 0.15
 Identities = 23/98 (23%), Positives = 33/98 (33%), Gaps = 2/98 (2%)

Query: 251 PPNSGDDQSSVVNVATDSTLSHVTSTPSTSAATTTQTTSPPVVSAASQNTSSATQSSTAA 310
                       +VA+ S  S   S  ++S   +  TTS    SA +  T + T  +   
Sbjct: 129 LAQEEPPTLPDQSVASVSQNSQDVSLATSSTTPSEGTTSASPSSATTSFTPTVTAIAPVV 188

Query: 311 TQNASPATQSSNVATQNT-PVTNLN-NAVATLKTAVAT 346
              A P T     A       T      +AT + AV T
Sbjct: 189 APTAKPVTVPKQPAADLAASSTAPAAKEMATGQEAVPT 226


>gnl|CDD|183854 PRK13042, PRK13042, superantigen-like protein; Reviewed.
          Length = 291

 Score = 33.1 bits (75), Expect = 0.17
 Identities = 21/73 (28%), Positives = 33/73 (45%), Gaps = 5/73 (6%)

Query: 264 VATDSTLSHVTSTPSTSAATTTQTTSPPVVSAASQNTSSATQSST----AATQNASPAT- 318
           V T +T +   +TPS++     Q+T P     A Q+  +AT   +    A  Q  +  T 
Sbjct: 20  VITTTTQAANATTPSSTKVEAPQSTPPSTKVEAPQSKPNATTPPSTKVEAPQQTPNATTP 79

Query: 319 QSSNVATQNTPVT 331
            S+ V T  +P T
Sbjct: 80  SSTKVETPQSPTT 92


>gnl|CDD|240410 PTZ00418, PTZ00418, Poly(A) polymerase; Provisional.
          Length = 593

 Score = 32.8 bits (75), Expect = 0.24
 Identities = 23/75 (30%), Positives = 34/75 (45%), Gaps = 4/75 (5%)

Query: 247 KTKEPPNSGDDQSSVVNVATDSTLSHVTSTPSTSAATTTQTTSPPVVSAASQNTSSATQS 306
           K  + P     Q+    V T +     T T ++SA T+ Q+ S    S ++ N SS T S
Sbjct: 523 KKSQLPAFVLSQTPEEPVKTKAN----TKTNTSSATTSGQSGSSGSTSNSNSNESSPTMS 578

Query: 307 STAATQNASPATQSS 321
           ST     +S +T  S
Sbjct: 579 STELLNVSSTSTTGS 593



 Score = 28.2 bits (63), Expect = 8.8
 Identities = 11/59 (18%), Positives = 23/59 (38%)

Query: 285 TQTTSPPVVSAASQNTSSATQSSTAATQNASPATQSSNVATQNTPVTNLNNAVATLKTA 343
           T        +    NTSSAT S  + +  ++  + S+  +   +    LN +  +   +
Sbjct: 535 TPEEPVKTKANTKTNTSSATTSGQSGSSGSTSNSNSNESSPTMSSTELLNVSSTSTTGS 593


>gnl|CDD|220684 pfam10310, DUF2413, Protein of unknown function (DUF2413).  This is
           a family of proteins conserved in fungi. The function is
           not known.
          Length = 436

 Score = 32.8 bits (75), Expect = 0.26
 Identities = 29/141 (20%), Positives = 44/141 (31%), Gaps = 33/141 (23%)

Query: 192 DILEF-----KKRKQEESKAAFCRGSESTPTVSSKQAVKTTFPPVSTEPSGKPSDPPNSG 246
           DILEF     +  K +  K         TP    K +  T      +  +     P    
Sbjct: 33  DILEFLDELEQSEKAKPPKKPKEASRPGTPRNPKKSSKPT-----ESSAASSEEKPAKPR 87

Query: 247 KTKEPPNSGDDQSSVVNVATDSTLSHVTSTPSTSAATTTQTTSPPVVSAASQN------- 299
           K+ E   S   +S                 PST +    +    P   A+          
Sbjct: 88  KSAESTRSSHPKSK---------------APSTESEEEEEPEETPDPIASIGGWWSLWGS 132

Query: 300 -TSSATQSSTAATQNASPATQ 319
            TS+AT +++AA + A  A  
Sbjct: 133 ITSTATSTASAAVKQAEQAVN 153


>gnl|CDD|227430 COG5099, COG5099, RNA-binding protein of the Puf family,
           translational repressor [Translation, ribosomal
           structure and biogenesis].
          Length = 777

 Score = 32.8 bits (75), Expect = 0.32
 Identities = 26/143 (18%), Positives = 53/143 (37%), Gaps = 9/143 (6%)

Query: 211 GSESTPTVSSKQAVKTTFPPVSTEPSGKPSDPPNSGKTKEPPNSGDDQSSVVNVAT---D 267
                P+ +S  A  T   P++   + K     +S   ++P  S     S     +    
Sbjct: 42  SFSPIPSKASSSATFTLNLPINNSVNHK--ITSSSSSRRKPSGSWSVAISSSTSGSQSLL 99

Query: 268 STLSHVTSTPSTSAATTTQTTSPPVVSAASQNTSSATQSSTAATQNAS----PATQSSNV 323
             L   +  PSTS+   + +         + ++ + + S+ ++  N++    P    SN 
Sbjct: 100 MELPSSSFNPSTSSRNKSNSALSSTQQGNANSSVTLSSSTASSMFNSNKLPLPNPNHSNS 159

Query: 324 ATQNTPVTNLNNAVATLKTAVAT 346
           AT N   ++  N  A+  +   T
Sbjct: 160 ATTNQSGSSFINTPASSSSQPLT 182


>gnl|CDD|236304 PRK08581, PRK08581, N-acetylmuramoyl-L-alanine amidase; Validated.
          Length = 619

 Score = 32.5 bits (74), Expect = 0.39
 Identities = 14/90 (15%), Positives = 31/90 (34%), Gaps = 10/90 (11%)

Query: 250 EPPNSGDDQSSVVNVATDSTLSHVTSTPSTSAATTTQTTSPPVVSAA----------SQN 299
           E P + +  ++  N  +DS++ + T T S+         +P   +             + 
Sbjct: 137 EQPRNSEKSTNDSNKNSDSSIKNDTDTQSSKQDKADNQKAPSSNNTKPSTSNKQPNSPKP 196

Query: 300 TSSATQSSTAATQNASPATQSSNVATQNTP 329
           T     +S  A+ + +    SS      + 
Sbjct: 197 TQPNQSNSQPASDDTANQKSSSKDNQSMSD 226



 Score = 28.2 bits (63), Expect = 7.7
 Identities = 17/138 (12%), Positives = 36/138 (26%), Gaps = 6/138 (4%)

Query: 197 KKRKQEESKAAFCRGSESTPTVSSKQAVKTTFPPVSTEPSGKPSDPPNSGKTKEPPNSGD 256
           K    E SK    + ++     ++           + + S   S+       K  P +  
Sbjct: 45  KSNDDETSKDTSSKDTDKADNNNTSNQDNNDKKFSTIDSSTSDSNNIIDFIYKNLPQTNI 104

Query: 257 DQSSVVNVATDSTLSHVTSTP------STSAATTTQTTSPPVVSAASQNTSSATQSSTAA 310
           +Q    N   D+                 S     + +      +   + SS    +   
Sbjct: 105 NQLLTKNKYDDNYSLTTLIQNLFNLNSDISDYEQPRNSEKSTNDSNKNSDSSIKNDTDTQ 164

Query: 311 TQNASPATQSSNVATQNT 328
           +     A      ++ NT
Sbjct: 165 SSKQDKADNQKAPSSNNT 182



 Score = 27.8 bits (62), Expect = 10.0
 Identities = 13/95 (13%), Positives = 27/95 (28%), Gaps = 6/95 (6%)

Query: 239 PSDPPNSGKTKEPPNSGDDQSSV--VNVATDSTLSHVTSTPSTSAATTTQTTSPPVVSAA 296
                 +        S DD++S    +  TD   ++ TS    +    +   S    ++ 
Sbjct: 32  KDSTAKTTSHDSK-KSNDDETSKDTSSKDTDKADNNNTSNQDNNDKKFSTIDSS---TSD 87

Query: 297 SQNTSSATQSSTAATQNASPATQSSNVATQNTPVT 331
           S N       +   T      T++      +    
Sbjct: 88  SNNIIDFIYKNLPQTNINQLLTKNKYDDNYSLTTL 122


>gnl|CDD|233273 TIGR01108, oadA, oxaloacetate decarboxylase alpha subunit.  This
           model describes the bacterial oxaloacetate decarboxylase
           alpha subunit and its equivalents in archaea. The
           oxaloacetate decarboxylase Na+ pump is the paradigm of
           the family of Na+ transport decarboxylases that present
           in bacteria and archaea. It a multi subunit enzyme
           consisting of a peripheral alpha-subunit and integral
           membrane subunits beta and gamma. The energy released by
           the decarboxylation reaction of oxaloacetate is coupled
           to Na+ ion pumping across the membrane [Transport and
           binding proteins, Cations and iron carrying compounds,
           Energy metabolism, Other].
          Length = 582

 Score = 32.5 bits (74), Expect = 0.41
 Identities = 30/139 (21%), Positives = 46/139 (33%), Gaps = 21/139 (15%)

Query: 173 ELVRYLLDSGAQEMQTKHIDIL--EFKKRKQEESKAAFCRGSEST-------PTVSSK-- 221
           EL R +L      +  +  D+L  E  K + E  +A   + S          P V  K  
Sbjct: 378 ELQRKILGDEKPIVDCRPADLLEPELDKLRAEVREAGAEKNSIEDVLTYALFPQVGLKFL 437

Query: 222 --QAVKTTFPPVSTEPSGKPSDPPNSGKTKEPPNSG------DDQSSVVNVATDSTLSHV 273
             +     F P   E   +       GK +E   SG      + ++ VV V +       
Sbjct: 438 ENRHNPAAFEPKPEEKVIEQEHAQVVGKYEETHASGSYTVEVEGKAFVVKV-SPGGD-VS 495

Query: 274 TSTPSTSAATTTQTTSPPV 292
             T S  A T+  T +   
Sbjct: 496 QITASAPANTSGGTVAAKA 514


>gnl|CDD|183756 PRK12799, motB, flagellar motor protein MotB; Reviewed.
          Length = 421

 Score = 32.0 bits (72), Expect = 0.47
 Identities = 27/154 (17%), Positives = 49/154 (31%), Gaps = 10/154 (6%)

Query: 189 KHIDILEFKKRKQE-------ESKAAFCRGSESTPTVSSKQAVKTTFPPVSTEPSGKPSD 241
           + I IL   K+ Q        +++A     +     + +   V       S+  +   + 
Sbjct: 257 RRISILVLNKQSQHDIEHENLDNRALDIEKATGLKQIDTHGTVPVAAVTPSSAVTQSSAI 316

Query: 242 PPNSGKTKEP---PNSGDDQSSVVNVATDSTLSHVTSTPSTSAATTTQTTSPPVVSAASQ 298
            P+S     P   P+S   QS+    A+   LS     PS      T             
Sbjct: 317 TPSSAAIPSPAVIPSSVTTQSATTTQASAVALSSAGVLPSDVTLPGTVALPAAEPVNMQP 376

Query: 299 NTSSATQSSTAATQNASPATQSSNVATQNTPVTN 332
              S T++  ++T N +        +    P +N
Sbjct: 377 QPMSTTETQQSSTGNITSTANGPTTSLPAAPASN 410


>gnl|CDD|165513 PHA03255, PHA03255, BDLF3; Provisional.
          Length = 234

 Score = 31.4 bits (70), Expect = 0.57
 Identities = 26/79 (32%), Positives = 39/79 (49%)

Query: 268 STLSHVTSTPSTSAATTTQTTSPPVVSAASQNTSSATQSSTAATQNASPATQSSNVATQN 327
           +T S   S PST+ +TT  TTS P+ + A  +T++ T +ST  T    P T +++     
Sbjct: 44  TTPSPSASGPSTNQSTTLTTTSAPITTTAILSTNTTTVTSTGTTVTPVPTTSNASTINVT 103

Query: 328 TPVTNLNNAVATLKTAVAT 346
           T VT  N       T  +T
Sbjct: 104 TKVTAQNITATEAGTGTST 122



 Score = 30.6 bits (68), Expect = 0.88
 Identities = 31/121 (25%), Positives = 48/121 (39%), Gaps = 7/121 (5%)

Query: 212 SESTPTVSSKQAVKTTFPPVSTEPSGKPSDPP-NSGKTKEPPNSGDDQSSVVNVATDSTL 270
           S S P+ +    + TT  P++T      +     S  T   P      +S +NV T  T 
Sbjct: 49  SASGPSTNQSTTLTTTSAPITTTAILSTNTTTVTSTGTTVTPVPTTSNASTINVTTKVTA 108

Query: 271 SHVTSTPSTSAATTTQTTSPPVVSAASQNTSSATQSSTAATQNASPATQSSNVATQNTPV 330
            ++T+T           TS  V S  +  +SS T ++T  T   + A   S+  T N   
Sbjct: 109 QNITAT------EAGTGTSTGVTSNVTTRSSSTTSATTRITNATTLAPTLSSKGTSNATK 162

Query: 331 T 331
           T
Sbjct: 163 T 163



 Score = 29.5 bits (65), Expect = 2.5
 Identities = 29/135 (21%), Positives = 56/135 (41%), Gaps = 6/135 (4%)

Query: 214 STPTVSSKQAVKTTFPPVSTEPSGKPSDPPNSGKTKEPPNSGDDQSSVVNVATDSTLSHV 273
           ++   S+  A   T     T PS   S P  +  T     S    ++ + ++T++T    
Sbjct: 25  TSSGSSTASAGNVTGTTAVTTPSPSASGPSTNQSTTLTTTSAPITTTAI-LSTNTTTVTS 83

Query: 274 TSTPSTSAATTTQTTSPPVVSAASQNTSSATQSSTAATQNASPATQSSNVATQNTPVTNL 333
           T T  T   TT+  ++  V +  +    +AT++ T  +        +SNV T+++  T+ 
Sbjct: 84  TGTTVTPVPTTSNASTINVTTKVTAQNITATEAGTGTSTGV-----TSNVTTRSSSTTSA 138

Query: 334 NNAVATLKTAVATLN 348
              +    T   TL+
Sbjct: 139 TTRITNATTLAPTLS 153


>gnl|CDD|180536 PRK06347, PRK06347, autolysin; Reviewed.
          Length = 592

 Score = 31.6 bits (71), Expect = 0.60
 Identities = 22/83 (26%), Positives = 31/83 (37%), Gaps = 2/83 (2%)

Query: 233 TEPSGKPSDPPNSGKTKEPPNSGDDQSSVVNVAT--DSTLSHVTSTPSTSAATTTQTTSP 290
           T P+ + S    +  TKE P +   +++         +     T TP        Q    
Sbjct: 55  TAPADEASKSAEANTTKEAPATATPENTTEPTVEPKQTETKEQTKTPEEKQPAAKQVEKA 114

Query: 291 PVVSAASQNTSSATQSSTAATQN 313
           P   A   N  +AT SST AT N
Sbjct: 115 PAEPATVSNPDNATSSSTPATYN 137


>gnl|CDD|234541 TIGR04320, Surf_Exclu_PgrA, SEC10/PgrA surface exclusion domain.
           This model describes a conserved domain found in surface
           proteins of a number of Firmutes. Many members have
           LPXTG C-terminal anchoring motifs and a substantial
           number have the KxYKxGKxW putative sorting signal at the
           N-terminus. The tetracycline resistance plasmid pCF10 in
           Enterococcus faecalis promotes conjugal plasmid transfer
           in response to sex pheromones, but PgrA/Sec10 encoded by
           that plasmid, a member of this family, specifically
           inhibits the ability of cells to receive homologous
           plasmids. The phenomenon is called surface exclusion.
          Length = 356

 Score = 31.2 bits (71), Expect = 0.66
 Identities = 17/54 (31%), Positives = 25/54 (46%)

Query: 295 AASQNTSSATQSSTAATQNASPATQSSNVATQNTPVTNLNNAVATLKTAVATLN 348
           AA+Q   +  Q   A  Q  +  T  +N+AT    + N    +A  K A+A LN
Sbjct: 292 AAAQAALATAQKELANAQAQALQTAQNNLATAQAALANAEARLAKAKEALANLN 345



 Score = 28.5 bits (64), Expect = 5.0
 Identities = 13/51 (25%), Positives = 23/51 (45%)

Query: 295 AASQNTSSATQSSTAATQNASPATQSSNVATQNTPVTNLNNAVATLKTAVA 345
           A +Q  +  T  +  AT  A+ A   + +A     + NLN  +A  + A+ 
Sbjct: 306 ANAQAQALQTAQNNLATAQAALANAEARLAKAKEALANLNADLAKKQAALD 356


>gnl|CDD|165256 PHA02946, PHA02946, ankyin-like protein; Provisional.
          Length = 446

 Score = 31.6 bits (71), Expect = 0.69
 Identities = 16/43 (37%), Positives = 25/43 (58%)

Query: 81  LIERGANIEEVNDEGYTPLMEAAREGHEEMVCVLLQKGAKINA 123
           L+ RG +  E +D+G  PL  A++  +  +V +LL  GA  NA
Sbjct: 58  LLHRGYSPNETDDDGNYPLHIASKINNNRIVAMLLTHGADPNA 100



 Score = 30.0 bits (67), Expect = 1.8
 Identities = 28/95 (29%), Positives = 45/95 (47%), Gaps = 6/95 (6%)

Query: 45  LIERGANIEEVNMPTDSFESPLTLAACGGHVDLALLLIERGANIEEVNDEGYTPLMEAAR 104
           L+ RG +  E +   D    PL +A+   +  +  +L+  GA+    + +  TPL   + 
Sbjct: 58  LLHRGYSPNETD---DDGNYPLHIASKINNNRIVAMLLTHGADPNACDKQHKTPLYYLSG 114

Query: 105 EGHE--EMVCVLLQKGAKINAHTEETQETALTLAC 137
              E  E + +L+Q GAKIN   +E     L LAC
Sbjct: 115 TDDEVIERINLLVQYGAKINNSVDEEGCGPL-LAC 148


>gnl|CDD|236652 PRK10118, PRK10118, flagellar hook-length control protein;
           Provisional.
          Length = 408

 Score = 31.4 bits (71), Expect = 0.70
 Identities = 23/107 (21%), Positives = 42/107 (39%), Gaps = 4/107 (3%)

Query: 227 TFPPVSTEPSGKPSDPPNSGKTKEPPNSGDDQSSVVNVATDSTLSHVTSTPSTSAATTTQ 286
              PV+  PS        +  TK+ P++  D      +++D     +TS   T+A     
Sbjct: 155 NTTPVADAPSTVLPAEKPTLLTKDMPSAPQD--ETHTLSSDEHEKGLTSAQLTTAQPDDA 212

Query: 287 TTSP--PVVSAASQNTSSATQSSTAATQNASPATQSSNVATQNTPVT 331
             +P  P+   A++  + A   ST +   A+ +   +   TQ  P  
Sbjct: 213 PGTPAQPLTPLAAEAQAKAEVISTPSPVTAAASPTITPHQTQPLPTA 259


>gnl|CDD|215964 pfam00513, Late_protein_L2, Late Protein L2. 
          Length = 466

 Score = 31.5 bits (72), Expect = 0.73
 Identities = 23/123 (18%), Positives = 40/123 (32%), Gaps = 31/123 (25%)

Query: 211 GSESTPTVSSKQAVKTTFPPVSTEPSGKPSDP---------------------PNSGKTK 249
           G+     V +   V+   PPV  E +  P+DP                     P  G   
Sbjct: 71  GTRPVRVVGTGTPVR---PPVVVESTVGPTDPSIVSLVEESSIIESGAPIPPIPGDGSGF 127

Query: 250 EPPNSGDDQSSVVNVATDSTLSHVTST----PSTSAATTTQTTSPPVVSAASQNTSSATQ 305
               S     ++++V   +   HV+ T    P  +  +  Q   PP  +  S +   + Q
Sbjct: 128 PITTSSTTTPAILDVTPTTRTVHVSRTQYNNPLFTDPSVLQ---PPQPAEVSGHVLVSGQ 184

Query: 306 SST 308
           +  
Sbjct: 185 TIG 187


>gnl|CDD|225805 COG3266, DamX, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 292

 Score = 30.7 bits (69), Expect = 0.92
 Identities = 22/103 (21%), Positives = 39/103 (37%), Gaps = 5/103 (4%)

Query: 246 GKTKEPPNSGDDQSSVVNVATDSTLSHVTSTPSTSAATTTQTTSPPVVSAASQNTSSATQ 305
             +     +  ++S  +N AT +        P+++  T+   + PP+   +S  T     
Sbjct: 35  STSSSEAPASAEKSIDLNGATQANAQQPAPGPTSAENTSQDLSLPPI---SSTPTQGQEP 91

Query: 306 SSTAATQNASPATQSSNVATQNTPVTNLNNAVA--TLKTAVAT 346
            +    Q        +N A Q   ++ LNN     TL T  AT
Sbjct: 92  LAQDGQQRVEVQGDLNNAAVQPQNLSQLNNVAVTSTLPTEPAT 134



 Score = 29.1 bits (65), Expect = 2.9
 Identities = 28/137 (20%), Positives = 45/137 (32%), Gaps = 25/137 (18%)

Query: 214 STPTVSSKQAVKTTFPPVS-TEPSGKPSDPPNSGKTKEPPNS-------GDDQSSVVNVA 265
             PT +   +   + PP+S T   G+     +  +  E             + S + NVA
Sbjct: 64  PGPTSAENTSQDLSLPPISSTPTQGQEPLAQDGQQRVEVQGDLNNAAVQPQNLSQLNNVA 123

Query: 266 TDSTLSHVTST-----PSTSAATTTQTTSPPVVS------AASQNTSSATQSSTAATQNA 314
             STL    +T      ++         + PV +      A     +    ++ A  Q A
Sbjct: 124 VTSTLPTEPATVAPVRNASVPTAERPAITRPVRAQAVSEPAVEPKAAKTATATEAKVQTA 183

Query: 315 SPATQSSNVATQNTPVT 331
           SPA       T  TP  
Sbjct: 184 SPA------QTPATPPA 194


>gnl|CDD|233045 TIGR00601, rad23, UV excision repair protein Rad23.  All proteins
           in this family for which functions are known are
           components of a multiprotein complex used for targeting
           nucleotide excision repair to specific parts of the
           genome. In humans, Rad23 complexes with the XPC protein.
           This family is based on the phylogenomic analysis of JA
           Eisen (1999, Ph.D. Thesis, Stanford University) [DNA
           metabolism, DNA replication, recombination, and repair].
          Length = 378

 Score = 31.0 bits (70), Expect = 0.97
 Identities = 15/86 (17%), Positives = 32/86 (37%), Gaps = 2/86 (2%)

Query: 256 DDQSSVVNVATDSTLSHVTSTPSTSAATTTQTTSPPV--VSAASQNTSSATQSSTAATQN 313
            D   V+     +    V    +T  +  T T SPP    S  S   +SA +  + + ++
Sbjct: 67  KDFVVVMVSKPKTGTGKVAPPAATPTSAPTPTPSPPASPASGMSAAPASAVEEKSPSEES 126

Query: 314 ASPATQSSNVATQNTPVTNLNNAVAT 339
           A+     S   +  +  ++  + +  
Sbjct: 127 ATATAPESPSTSVPSSGSDAASTLVV 152



 Score = 30.6 bits (69), Expect = 1.3
 Identities = 15/66 (22%), Positives = 28/66 (42%), Gaps = 3/66 (4%)

Query: 274 TSTPSTSAATTTQTTSPPVVSAASQNTSSATQSSTAATQNASPATQSSNVATQNTPVT-- 331
           T TPS  A+  +  ++ P  SA  + + S   ++  A ++ S +  SS     +T V   
Sbjct: 96  TPTPSPPASPASGMSAAP-ASAVEEKSPSEESATATAPESPSTSVPSSGSDAASTLVVGS 154

Query: 332 NLNNAV 337
                +
Sbjct: 155 ERETTI 160


>gnl|CDD|118131 pfam09595, Metaviral_G, Metaviral_G glycoprotein.  This is a viral
           attachment glycoprotein from region G of metaviruses. It
           is high in serine and threonine suggesting it is highly
           glycosylated.
          Length = 183

 Score = 30.4 bits (68), Expect = 0.97
 Identities = 20/105 (19%), Positives = 37/105 (35%), Gaps = 4/105 (3%)

Query: 232 STEPSGKPSDPPNSGKTKEP----PNSGDDQSSVVNVATDSTLSHVTSTPSTSAATTTQT 287
           ST P+        +  T  P         D+ +    ++ +  +        +  T+   
Sbjct: 68  STLPAATSESHLETEPTSTPDTTNRQQTVDRHTTPPSSSRTQTTQAVHEKKNTRTTSRTQ 127

Query: 288 TSPPVVSAASQNTSSATQSSTAATQNASPATQSSNVATQNTPVTN 332
           T P   +AA Q T++   SST      +     S+  TQ+   T+
Sbjct: 128 TPPTTSTAAVQTTTTTNTSSTGKEPTTTSVQPRSSATTQSHEETS 172


>gnl|CDD|218191 pfam04652, DUF605, Vta1 like.  Vta1 (VPS20-associated protein 1) is
           a positive regulator of Vps4. Vps4 is an ATPase that is
           required in the multivesicular body (MVB) sorting
           pathway to dissociate the endosomal sorting complex
           required for transport (ESCRT). Vta1 promotes correct
           assembly of Vps4 and stimulates its ATPase activity
           through its conserved Vta1/SBP1/LIP5 region.
          Length = 315

 Score = 30.8 bits (70), Expect = 0.97
 Identities = 19/125 (15%), Positives = 32/125 (25%), Gaps = 10/125 (8%)

Query: 228 FPPVSTEPSGKPSDPPNSGKTKEPPNSGDDQSSVVNVATDSTLSHVTSTPSTSAATTTQT 287
                   S       + G    P    D  S   +    +  S  + TP  S  + T  
Sbjct: 175 EDADPASASPSDPPSSSPGVPSFPSPPEDPSSPSDSSLPPAPSSFQSDTPPPSPESPTNP 234

Query: 288 TSPPVVSAASQNTSSATQSSTAATQNASPATQSSNVATQNTPVTNLNNAVATLKTAVATL 347
           + PP                      A P   S++      P+  +         A+A  
Sbjct: 235 SPPP----GPAAPPPPPVQQVPPLSTAKPTPPSASATPA--PIGGITLDDD----AIAKA 284

Query: 348 NRHCR 352
            +H +
Sbjct: 285 QKHAK 289


>gnl|CDD|234383 TIGR03895, protease_PatA, cyanobactin maturation protease,
           PatA/PatG family.  This model describes a protease
           domain associated with the maturation of various members
           of the cyanobactin family of ribosomally produced,
           heavily modified bioactive metabolites. Members include
           the PatA protein and C-terminal domain of the PatG
           protein of Prochloron didemni, TenA and a region of TenG
           from Nostoc spongiaeforme var. tenue, etc.
          Length = 602

 Score = 30.8 bits (70), Expect = 1.0
 Identities = 22/109 (20%), Positives = 37/109 (33%), Gaps = 7/109 (6%)

Query: 245 SGKTKEPPNSGDDQSSVVNVATDSTLSHVTSTPSTSAATTTQTTSP---PVVSA---ASQ 298
           +G+T        +Q  V   +       + S+ S          +P   PV +    A+ 
Sbjct: 227 TGETMSEGLVTSEQDGVEEASGCGVQGTIESSTSVIPPGRAAEPAPVSIPVAAPGEGATP 286

Query: 299 NTSSATQSSTAATQNASPATQSSNVATQNTPVTNLNNAVATLKTAVATL 347
             +    S+       SPAT     +   TP +   +A   L  A+ TL
Sbjct: 287 AAAQIELSAGVLPNAISPATPVRPASNGVTP-SQAPSAEPQLVYALGTL 334


>gnl|CDD|217495 pfam03326, Herpes_TAF50, Herpesvirus transcription activation
           factor (transactivator).  This family includes EBV BRLF1
           and similar ORF 50 proteins from other herpesviruses.
          Length = 500

 Score = 30.9 bits (70), Expect = 1.2
 Identities = 23/115 (20%), Positives = 33/115 (28%), Gaps = 13/115 (11%)

Query: 227 TFPPVSTEPSGKPSDPPNSGKTKEPPNSGDDQSSVVNVATDSTLSHVT---------STP 277
             P  S      P  P  S      P   +  S V   +T S     +         S+ 
Sbjct: 216 FTPHPSLPQPQSPLKPSPSSA---RPQQSESFSDVWPASTQSPREETSAEPLAPASPSSR 272

Query: 278 STSAATTTQTTSPPVVSAASQNTSSATQSSTAATQNASPATQSSNVATQNTPVTN 332
             S A   Q       +   Q   S    S+ +  +  PA  ++   T   P TN
Sbjct: 273 RPSTAQEEQIACSSPQAEPEQGVQSYVPQSSDSRPSCFPAPSTTQ-PTFLPPNTN 326



 Score = 29.8 bits (67), Expect = 2.3
 Identities = 25/135 (18%), Positives = 40/135 (29%), Gaps = 19/135 (14%)

Query: 215 TPTVSSKQAVKTTFPPV-STEPSGKPSDPPNSGKTKEPPNSGDDQSSVVNVATDSTLSHV 273
           TP      AV           P G+PS    S     PP++    SS+            
Sbjct: 339 TPKQEGGAAVSQNHDGGTVRAPRGRPSGSGQS-----PPSNSPLLSSL------------ 381

Query: 274 TSTPSTSAATTTQTTSPPVVSAASQNTSSATQSSTAATQNASPATQSSNVATQNTPVTNL 333
             TPS +A        P VV    ++ +S  Q  T  +       +     +    +   
Sbjct: 382 ADTPSGAAHQPASLLPPAVVQQQLED-ASDKQPPTPGSSLVPQPDEQELGPSVMALLDRD 440

Query: 334 NNAVATLKTAVATLN 348
              +A +   +  L 
Sbjct: 441 QGILALILAPIPGLY 455


>gnl|CDD|234351 TIGR03773, anch_rpt_wall, putative ABC transporter-associated
           repeat protein.  Members of this protein family occur in
           genomes that contain a three-gene ABC transporter operon
           associated with the presence of domain TIGR03769. That
           domain occurs as a single-copy insert in the
           substrate-binding protein, and occurs in two or more
           copies in members of this protein family. Members of
           this family typically are encoded adjacent to the said
           transporter operon and may serve as a substrate
           receptor.
          Length = 513

 Score = 30.7 bits (69), Expect = 1.2
 Identities = 29/173 (16%), Positives = 49/173 (28%), Gaps = 17/173 (9%)

Query: 139 GGFLDVADFLLKNGANIELGASTPLMEAAQEGHLELVRYLLDSGAQEMQTKHIDILEFKK 198
           GG L     L   G + +  +     +A    H+       + G  E+       L    
Sbjct: 98  GGTLGKVTPLFTIGDSSKPAS----FKAEGPTHVHGNWGFTEPGIYELTVTATADLADGG 153

Query: 199 RKQEESKAAFCRGSESTPTVSSKQAVKTTFPPVSTEPSGKPSDPPNSGKTKEPPNSGDDQ 258
            K +        G      +          P      +GKP +  N     +P    D  
Sbjct: 154 AKSKPETYTVVVGKVEVDKID---------PARCATGAGKPQNDANGPAADKP-LFDDP- 202

Query: 259 SSVVNVATDSTLSHVTSTPSTSA--ATTTQTTSPPVVSAASQNTSSATQSSTA 309
           +S V    D +        +     +      +P  V+A     + +T SS A
Sbjct: 203 ASGVQALGDESAFSPGQQATVQIGKSVRLPADAPLGVAAVVVKAAPSTGSSDA 255


>gnl|CDD|171499 PRK12438, PRK12438, hypothetical protein; Provisional.
          Length = 991

 Score = 31.0 bits (70), Expect = 1.2
 Identities = 17/79 (21%), Positives = 24/79 (30%), Gaps = 4/79 (5%)

Query: 208 FCRGSESTPTVSSKQAVKTTFPPVSTEPSGKPSD--PPNSGKTKEPPNSGDDQSSVVNVA 265
           F  G+    T     A   + PP    P   P    PP + +    P  G D        
Sbjct: 895 FGPGTGRVATAPGGDA--ASAPPPGAGPPAPPQAVPPPRTTQPPAAPPRGPDVPPAAVAE 952

Query: 266 TDSTLSHVTSTPSTSAATT 284
              TL+ + S   +   T 
Sbjct: 953 LRETLADLRSAQRSGDFTA 971



 Score = 29.4 bits (66), Expect = 3.2
 Identities = 25/135 (18%), Positives = 39/135 (28%), Gaps = 38/135 (28%)

Query: 216 PTVSSKQAVKTTFPP----VSTEPSGKPSDPPNSGKTKEPPNSGDDQSSVVNVATDSTLS 271
           PT++  +A+   F P    V+T P G  +  P  G    PP                   
Sbjct: 885 PTLA--EALDQVFGPGTGRVATAPGGDAASAPPPGAG--PP------------------- 921

Query: 272 HVTSTPSTSAATTTQTTSPPVVSAASQNTSSATQSSTAATQNASPATQSSNVATQNTPVT 331
                      TT    +PP               +  A    + A   S  A ++   T
Sbjct: 922 -APPQAVPPPRTTQPPAAPP--------RGPDVPPAAVAELRETLADLRS--AQRSGDFT 970

Query: 332 NLNNAVATLKTAVAT 346
               A+  L+ A+  
Sbjct: 971 AYGAALDRLEKAIDA 985


>gnl|CDD|236782 PRK10871, nlpD, lipoprotein NlpD; Provisional.
          Length = 319

 Score = 30.2 bits (68), Expect = 1.4
 Identities = 25/88 (28%), Positives = 36/88 (40%), Gaps = 3/88 (3%)

Query: 262 VNVATDSTLSHVTSTPST--SAATTTQTTSPPVVSAASQNTSSATQSSTAATQNASPATQ 319
           +NV     + + + TP T  +A T        VV   +QN++ A  S    T + S   Q
Sbjct: 97  LNVGQTLQVGNASGTPITGGNAITQADAAEQGVVIKPAQNSTVAVASQPTITYSESSGEQ 156

Query: 320 SSN-VATQNTPVTNLNNAVATLKTAVAT 346
           S+N +   N P      A  T  TA  T
Sbjct: 157 SANKMLPNNKPAATTVTAPVTAPTASTT 184


>gnl|CDD|220944 pfam11018, Cuticle_3, Pupal cuticle protein C1.  Insect cuticles
           are composite structures whose mechanical properties are
           optimised for biological function. The major components
           are the chitin filament system and the cuticular
           proteins, and the cuticle's properties are determined
           largely by the interactions between these two sets of
           molecules. The proteins can be ordered by species.
          Length = 164

 Score = 29.6 bits (66), Expect = 1.7
 Identities = 18/58 (31%), Positives = 26/58 (44%), Gaps = 1/58 (1%)

Query: 289 SPPVVSAASQNTS-SATQSSTAATQNASPATQSSNVATQNTPVTNLNNAVATLKTAVA 345
           S P V +  QNT  S     T +  + +  T  S+V   +T ++N     A  KTA A
Sbjct: 1   SAPAVGSTHQNTVRSFGGLGTVSHYSKAVDTPYSSVRKSDTRISNNAYQPAYAKTAYA 58


>gnl|CDD|165527 PHA03269, PHA03269, envelope glycoprotein C; Provisional.
          Length = 566

 Score = 30.1 bits (67), Expect = 1.9
 Identities = 18/112 (16%), Positives = 33/112 (29%), Gaps = 7/112 (6%)

Query: 214 STPTVSSKQAVKTTFPPVSTEPSGKPSDP-------PNSGKTKEPPNSGDDQSSVVNVAT 266
             P  +  QA      P     S     P       P + +  +P  +    +S      
Sbjct: 40  PDPAPAPHQAASRAPDPAVAPTSAASRKPDLAQAPTPAASEKFDPAPAPHQAASRAPDPA 99

Query: 267 DSTLSHVTSTPSTSAATTTQTTSPPVVSAASQNTSSATQSSTAATQNASPAT 318
            +        P  + A T+   +    + A  + +S      A TQ++ P  
Sbjct: 100 VAPQLAAAPKPDAAEAFTSAAQAHEAPADAGTSAASKKPDPAAHTQHSPPPF 151


>gnl|CDD|221480 pfam12238, MSA-2c, Merozoite surface antigen 2c.  This family of
           proteins is found in eukaryotes. Proteins in this family
           are typically between 263 and 318 amino acids in length.
           There is a conserved SFT sequence motif. MSA-2 is a
           plasma membrane glycoprotein which can be found in
           Babesia bovis species.
          Length = 201

 Score = 29.4 bits (66), Expect = 2.0
 Identities = 12/45 (26%), Positives = 16/45 (35%)

Query: 285 TQTTSPPVVSAASQNTSSATQSSTAATQNASPATQSSNVATQNTP 329
                     ++SQ TS+   S  +AT N S      N A Q   
Sbjct: 137 APAEYYSPKHSSSQGTSTTRPSDGSATPNTSAPPTPGNPAAQPEK 181



 Score = 28.6 bits (64), Expect = 3.6
 Identities = 15/56 (26%), Positives = 22/56 (39%), Gaps = 8/56 (14%)

Query: 274 TSTPSTSAATTTQTTSPPVVSAASQNTSSATQSSTAATQNASPATQSSNVATQNTP 329
             +P  S++  T TT P      S    SAT +++A     +PA Q    A     
Sbjct: 141 YYSPKHSSSQGTSTTRP------SDG--SATPNTSAPPTPGNPAAQPEKPAETPKG 188


>gnl|CDD|225010 COG2099, CobK, Precorrin-6x reductase [Coenzyme metabolism].
          Length = 257

 Score = 29.6 bits (67), Expect = 2.1
 Identities = 14/50 (28%), Positives = 22/50 (44%), Gaps = 11/50 (22%)

Query: 139 GGFLDV---ADFLLKNGANIELGASTP--------LMEAAQEGHLELVRY 177
           GGFL     A FL + G ++ + A+ P           AA+E  +  +R 
Sbjct: 50  GGFLGAEGLAAFLREEGIDLLIDATHPYAARISQNAARAAKETGIPYLRL 99


>gnl|CDD|227578 COG5253, MSS4, Phosphatidylinositol-4-phosphate 5-kinase [Signal
           transduction mechanisms].
          Length = 612

 Score = 29.9 bits (67), Expect = 2.3
 Identities = 23/112 (20%), Positives = 36/112 (32%), Gaps = 17/112 (15%)

Query: 212 SESTPTVSSKQAVKTTFPPVSTEPSGKPSDPPNSGKTKEPPNSGDDQSSVVNVATDSTLS 271
           + S+    S   VKT      + P G+ S   N        +   D     +++    L 
Sbjct: 117 TRSSGNNLSNANVKT-----LSAPVGEHSRSNNP----PNLDQNLDTEPESSISQWGELQ 167

Query: 272 HVTSTPSTSAATTTQTTSPPVVSAASQNTSSATQSSTAATQNASPATQSSNV 323
              S  + S        S P     S+N  S + +S   T   SP    S +
Sbjct: 168 LNPSGKTLS--------SQPSRKPTSENPKSESDNSKLPTSVNSPLPDKSLL 211


>gnl|CDD|220430 pfam09829, DUF2057, Uncharacterized protein conserved in bacteria
           (DUF2057).  This domain, found in various prokaryotic
           proteins, has no known function.
          Length = 189

 Score = 29.2 bits (66), Expect = 2.4
 Identities = 10/47 (21%), Positives = 15/47 (31%)

Query: 295 AASQNTSSATQSSTAATQNASPATQSSNVATQNTPVTNLNNAVATLK 341
            A  N  S   ++  A   A     + +VA         N A   L+
Sbjct: 126 LADYNQKSGAAAAVPAPATAVTTAAAVSVAATTASAQTQNEAEQMLQ 172


>gnl|CDD|223039 PHA03307, PHA03307, transcriptional regulator ICP4; Provisional.
          Length = 1352

 Score = 29.8 bits (67), Expect = 2.5
 Identities = 24/113 (21%), Positives = 33/113 (29%), Gaps = 6/113 (5%)

Query: 210 RGSESTPTVSSKQAVKTTFPPVSTEPSGKPSDPPNSGKTKEPPNSGDDQSSVVNVATDST 269
             S      S      ++  P    PS  PS P +      P  S    SS    ++ ST
Sbjct: 271 EASGWNGPSSRPGPASSSSSPRERSPSPSPSSPGSGPAPSSPRAS-SSSSSSRESSSSST 329

Query: 270 LSHVTSTPSTSAATTTQTTSPPVVSAASQNTS-SATQSSTAATQNASPATQSS 321
               +S+  +S          P  S +       A  SS       S A  S 
Sbjct: 330 ----SSSSESSRGAAVSPGPSPSRSPSPSRPPPPADPSSPRKRPRPSRAPSSP 378



 Score = 29.0 bits (65), Expect = 5.5
 Identities = 21/121 (17%), Positives = 34/121 (28%), Gaps = 4/121 (3%)

Query: 211 GSESTPTVSSKQAVKTTFPPVSTEPS-GKPSDPPNSGKTKEPPNSGDDQSSVVNVATDST 269
           GS + P  SS      T PP S  PS                P       +         
Sbjct: 104 GSPTPPGPSSPDPPPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGASPAAVA 163

Query: 270 LSHVTSTPSTSAATTTQTTSPPVVSAASQNTSSATQSSTAATQNASPATQSSNVATQNTP 329
               +S  +    ++ + T+    S  +        +  AA     P   S   A+ ++P
Sbjct: 164 SDAASSRQAALPLSSPEETARAPSSPPA---EPPPSTPPAAASPRPPRRSSPISASASSP 220

Query: 330 V 330
            
Sbjct: 221 A 221


>gnl|CDD|235467 PRK05441, murQ, N-acetylmuramic acid-6-phosphate etherase;
           Reviewed.
          Length = 299

 Score = 29.4 bits (67), Expect = 2.6
 Identities = 15/49 (30%), Positives = 20/49 (40%), Gaps = 9/49 (18%)

Query: 31  TLAACGGHVDLALLLIERGANIEEVNMPTDSFESPLTLAACGGHVDLAL 79
            L A  G V LA+++I  G +  E             LA  GG +  AL
Sbjct: 256 ALEAADGSVKLAIVMILTGLDAAEAKA---------LLARHGGFLRKAL 295



 Score = 29.0 bits (66), Expect = 3.8
 Identities = 16/57 (28%), Positives = 26/57 (45%), Gaps = 7/57 (12%)

Query: 39  VDLALL---LIERGANI--EEVNMPTDSFESPLTLAACGGHVDLALLLIERGANIEE 90
           VD+      L++R   I  E   +  +  E+   L A  G V LA+++I  G +  E
Sbjct: 225 VDVKATNEKLVDRAVRIVMEATGVSREEAEA--ALEAADGSVKLAIVMILTGLDAAE 279


>gnl|CDD|227502 COG5175, MOT2, Transcriptional repressor [Transcription].
          Length = 480

 Score = 29.6 bits (66), Expect = 2.6
 Identities = 30/177 (16%), Positives = 60/177 (33%), Gaps = 15/177 (8%)

Query: 177 YLLDSGAQEMQTKHIDILEFKKRKQEESKAAFCRGSESTPTVSSKQAVKTTFPPVSTEPS 236
           YL + G ++      ++   + +          +      T +++    T     +  PS
Sbjct: 221 YLHEPGPEKDSLTKDELCNSQHKLHGSEVRNKNKKRIHRSTSTAR--YDTDLLNFTGTPS 278

Query: 237 GKPSDPPNSGKTK-----------EPPNSGDDQSSVVNVATDSTLSHVTSTPSTSAATTT 285
               +     KT             P +  + QS+   V   ++ S    T + S    T
Sbjct: 279 PAAMEAQFKHKTSRVFKAPDKILFPPLDFTNTQSA-TPVTLSNSSSINLPTLNDSLGHHT 337

Query: 286 QTTSPPVVSAASQNTSSATQSSTAATQNASPATQSSNVATQNTPVTNLNNAVATLKT 342
           +TT+    +A S +  S  + S AA +   P     N A     ++N      ++K+
Sbjct: 338 ETTTTTNTNATSHSHGSKKKQSLAAEEYKDPYDALGNAARL-HSLSNYQKRPISIKS 393


>gnl|CDD|223031 PHA03273, PHA03273, envelope glycoprotein C; Provisional.
          Length = 486

 Score = 29.6 bits (66), Expect = 2.7
 Identities = 16/73 (21%), Positives = 37/73 (50%), Gaps = 3/73 (4%)

Query: 259 SSVVNVATDSTLSHVTSTPSTSAATTTQTTSPPVVSAASQNTSSATQSSTAATQNASPAT 318
           +S     +D++ + + STP+T   TT+  TSP      + N+++A  +  + TQ + P +
Sbjct: 30  TSSSIENSDNSTAEMQSTPATPTHTTSNLTSP--FGTGTDNSTNANGTE-STTQASQPHS 86

Query: 319 QSSNVATQNTPVT 331
             + +    + ++
Sbjct: 87  HETTITCTKSLIS 99


>gnl|CDD|234022 TIGR02813, omega_3_PfaA, polyketide-type polyunsaturated fatty acid
            synthase PfaA.  Members of the seed for this alignment
            are involved in omega-3 polyunsaturated fatty acid
            biosynthesis, such as the protein PfaA from the
            eicosapentaenoic acid biosynthesis operon in
            Photobacterium profundum strain SS9. PfaA is encoded
            together with PfaB, PfaC, and PfaD, and the functions of
            the individual polypeptides have not yet been described.
            More distant homologs of PfaA, also included with the
            reach of this model, appear to be involved in
            polyketide-like biosynthetic mechanisms of
            polyunsaturated fatty acid biosynthesis, an alternative
            to the more familiar iterated mechanism of chain
            extension and desaturation, and in most cases are encoded
            near genes for homologs of PfaB, PfaC, and/or PfaD.
          Length = 2582

 Score = 30.0 bits (67), Expect = 2.8
 Identities = 18/59 (30%), Positives = 26/59 (44%), Gaps = 2/59 (3%)

Query: 264  VATDSTLSHVTSTPSTSAATTTQTTSPPVVSAASQNTSSATQSST--AATQNASPATQS 320
            V T + +  VT + + +A       SPPVVSAA   + +   +    A    A P  QS
Sbjct: 1132 VVTQAPVVQVTISVAPAAPVLPAVVSPPVVSAAPAQSVATAVAMAPVAEVPIAVPVQQS 1190


>gnl|CDD|147982 pfam06112, Herpes_capsid, Gammaherpesvirus capsid protein.  This
           family consists of several Gammaherpesvirus capsid
           proteins. The exact function of this family is unknown.
          Length = 148

 Score = 28.7 bits (64), Expect = 2.9
 Identities = 15/65 (23%), Positives = 29/65 (44%), Gaps = 2/65 (3%)

Query: 265 ATDSTLSHVTSTPSTSAATTTQTTSPPVVSAASQNTSSATQSSTAATQNASPATQSSNVA 324
           A   T S + S  + SA++++ +  P   +  S ++ SA  S   +  ++S  + S   A
Sbjct: 80  AGPQTSSSIGS--ALSASSSSASGVPGGANQLSGSSGSALSSGPGSLSSSSSLSGSGAGA 137

Query: 325 TQNTP 329
               P
Sbjct: 138 GDTAP 142


>gnl|CDD|191179 pfam05053, Menin, Menin.  MEN1, the gene responsible for multiple
           endocrine neoplasia type 1, is a tumour suppressor gene
           that encodes a protein called Menin which may be an
           atypical GTPase stimulated by nm23.
          Length = 618

 Score = 29.6 bits (66), Expect = 2.9
 Identities = 23/93 (24%), Positives = 34/93 (36%), Gaps = 9/93 (9%)

Query: 232 STEPSGKPSDPP-------NSGKTKEPPNSGDDQS--SVVNVATDSTLSHVTSTPSTSAA 282
           S EPSG  S  P       NS       N  D +   +    AT ++    TS P   ++
Sbjct: 489 SQEPSGGESPNPELPANNNNSNSNNNNNNGADRKEAAATTGNATTTSNGSGTSVPLPVSS 548

Query: 283 TTTQTTSPPVVSAASQNTSSATQSSTAATQNAS 315
              Q    PV++  S+      +   A   N+S
Sbjct: 549 EPPQHKEGPVITFYSEKMKGMKELLLAEKLNSS 581


>gnl|CDD|237864 PRK14950, PRK14950, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 585

 Score = 29.4 bits (66), Expect = 3.1
 Identities = 17/110 (15%), Positives = 27/110 (24%), Gaps = 1/110 (0%)

Query: 224 VKTTFPPVSTEPSGKPSDPPNSGKTKEPPNSGDDQSSVVNVATDSTLSHVTSTPSTSAAT 283
           ++    PV      KP+    S     P  S   +++             T+TP      
Sbjct: 356 IEALLVPVPAPQPAKPTAAAPSPVRPTPAPSTRPKAAAAANIPPKEPVRETATPPPVPPR 415

Query: 284 TTQTTSPPVVSAASQNTSSATQSSTAATQNASPATQSSNVATQNTPVTNL 333
                 P    +A + T +A            PA               L
Sbjct: 416 PVAPPVPHTPESAPKLTRAAIPVDEKPKYTP-PAPPKEEEKALIADGDVL 464


>gnl|CDD|172358 PRK13831, PRK13831, conjugal transfer protein TrbI; Provisional.
          Length = 432

 Score = 29.3 bits (66), Expect = 3.2
 Identities = 14/60 (23%), Positives = 21/60 (35%), Gaps = 3/60 (5%)

Query: 265 ATDSTLSHVTSTPSTSAATTTQTTSPPVVSAASQNTS---SATQSSTAATQNASPATQSS 321
           A DS L+          A T  T +    + ++   S   +A   +    QN  P  Q S
Sbjct: 158 AYDSPLAVDIGKVEKRTADTNDTATQTTNATSTAGASDLYAAAMRAGLGGQNVDPNGQKS 217


>gnl|CDD|236766 PRK10811, rne, ribonuclease E; Reviewed.
          Length = 1068

 Score = 29.6 bits (67), Expect = 3.4
 Identities = 27/133 (20%), Positives = 40/133 (30%), Gaps = 9/133 (6%)

Query: 213 ESTPTVSSKQAVKTTFPPVSTE-PSGK-----PSDPPNSGKTKEPPNSGDDQSSVVNVAT 266
           E  PT S    +  T    S E  SGK     P   P   + +E   + + Q   V    
Sbjct: 817 ERYPTQS---PMPLTVACASPEMASGKVWIRYPVVRPQDVQVEEQREAEEVQVQPVVAEV 873

Query: 267 DSTLSHVTSTPSTSAATTTQTTSPPVVSAASQNTSSATQSSTAATQNASPATQSSNVATQ 326
               +      +       +    PVV A  Q        +T     A+P T+   V T+
Sbjct: 874 PVAAAVEPVVSAPVVEAVAEVVEEPVVVAEPQPEEVVVVETTHPEVIAAPVTEQPQVITE 933

Query: 327 NTPVTNLNNAVAT 339
           +        A   
Sbjct: 934 SDVAVAQEVAEHA 946


>gnl|CDD|237874 PRK14971, PRK14971, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 614

 Score = 29.4 bits (66), Expect = 3.5
 Identities = 15/80 (18%), Positives = 24/80 (30%), Gaps = 10/80 (12%)

Query: 252 PNSGDDQSSVVNVATDSTLSHVTSTPSTSAATTTQTTSPPVVSAASQNTSSATQSSTA-- 309
           P +    S+    +   + S   + PS   + T    +PP VS            STA  
Sbjct: 386 PAAAPQPSAAAAASPSPSQSSAAAQPSAPQSATQPAGTPPTVSVDPPAAVPVNPPSTAPQ 445

Query: 310 --------ATQNASPATQSS 321
                     +    +  SS
Sbjct: 446 AVRPAQFKEEKKIPVSKVSS 465



 Score = 28.6 bits (64), Expect = 5.9
 Identities = 17/56 (30%), Positives = 25/56 (44%), Gaps = 3/56 (5%)

Query: 284 TTQTTSPPVVSAASQNTSSATQSSTAATQNASPATQSSNVATQNTPVTNLNNAVAT 339
            TQ  + P  SAA+  + S +QSS AA      A QS+       P  +++   A 
Sbjct: 383 FTQPAAAPQPSAAAAASPSPSQSSAAAQ---PSAPQSATQPAGTPPTVSVDPPAAV 435


>gnl|CDD|144411 pfam00802, Glycoprotein_G, Pneumovirus attachment glycoprotein G.
           This family includes attachment proteins from
           respiratory synctial virus. Glycoprotein G has not been
           shown to have any neuraminidase or hemagglutinin
           activity. The amino terminus is thought to be
           cytoplasmic, and the carboxyl terminus extracellular.
           The extracellular region contains four completely
           conserved cysteine residues.
          Length = 263

 Score = 28.9 bits (64), Expect = 3.6
 Identities = 19/86 (22%), Positives = 29/86 (33%), Gaps = 3/86 (3%)

Query: 212 SESTPTVSSKQAVKTTFPPVSTEPSGKPSDPPNSGKTKEPPNSGDDQSSVVNVATDSTLS 271
             +TPT      ++       T+ +   + P N   T    ++  D ++     T ST  
Sbjct: 62  PTTTPTQQITNQIQNHTSTYLTQHNQLSTSPSNQSTTTPLIHTILDDTT---PGTKSTYQ 118

Query: 272 HVTSTPSTSAATTTQTTSPPVVSAAS 297
           H T        T  QT  PP     S
Sbjct: 119 HTTVGTKGRTTTPAQTNKPPTKPRQS 144


>gnl|CDD|165089 PHA02716, PHA02716, CPXV016; CPX019; EVM010; Provisional.
          Length = 764

 Score = 29.5 bits (66), Expect = 3.7
 Identities = 13/25 (52%), Positives = 16/25 (64%)

Query: 75  VDLALLLIERGANIEEVNDEGYTPL 99
           +D+   LI  GA+I  VN  GYTPL
Sbjct: 379 LDVIQCLISLGADITAVNCLGYTPL 403


>gnl|CDD|219419 pfam07462, MSP1_C, Merozoite surface protein 1 (MSP1) C-terminus.
           This family represents the C-terminal region of
           merozoite surface protein 1 (MSP1) which are found in a
           number of Plasmodium species. MSP-1 is a 200-kDa protein
           expressed on the surface of the P. vivax merozoite.
           MSP-1 of Plasmodium species is synthesised as a
           high-molecular-weight precursor and then processed into
           several fragments. At the time of red cell invasion by
           the merozoite, only the 19-kDa C-terminal fragment
           (MSP-119), which contains two epidermal growth
           factor-like domains, remains on the surface. Antibodies
           against MSP-119 inhibit merozoite entry into red cells,
           and immunisation with MSP-119 protects monkeys from
           challenging infections. Hence, MSP-119 is considered a
           promising vaccine candidate.
          Length = 574

 Score = 29.1 bits (65), Expect = 3.7
 Identities = 20/85 (23%), Positives = 31/85 (36%), Gaps = 1/85 (1%)

Query: 253 NSGDDQSSVVNVATDSTLSHVTSTPSTSAATTTQTTSPPVVSAASQNTSSATQSSTAATQ 312
           N+  + +  VN A +     +    +  A  TT  T P   +A S  +     SS +A+ 
Sbjct: 238 NTNPENNPEVNEALEQYKELLPKGTTQEAKVTTVVTPPQADAAPSPLSVRPAGSSGSASG 297

Query: 313 NASPATQSSNVATQNTPVTNLNNAV 337
           +    T  S V       T L   V
Sbjct: 298 STQIPTSGS-VLGPGAAATELQQVV 321


>gnl|CDD|183757 PRK12800, fliF, flagellar MS-ring protein; Reviewed.
          Length = 574

 Score = 29.2 bits (65), Expect = 3.8
 Identities = 13/54 (24%), Positives = 21/54 (38%)

Query: 283 TTTQTTSPPVVSAASQNTSSATQSSTAATQNASPATQSSNVATQNTPVTNLNNA 336
           T+T  T P     A+ N+     +  AA    +PA  +   A    P  +  +A
Sbjct: 307 TSTSATGPQGPPGATSNSPGQPPAPAAAGAPGTPAAANGQAAAAAAPTESSKSA 360


>gnl|CDD|237555 PRK13914, PRK13914, invasion associated secreted endopeptidase;
           Provisional.
          Length = 481

 Score = 29.0 bits (64), Expect = 4.3
 Identities = 18/111 (16%), Positives = 40/111 (36%)

Query: 218 VSSKQAVKTTFPPVSTEPSGKPSDPPNSGKTKEPPNSGDDQSSVVNVATDSTLSHVTSTP 277
           ++ KQ   T  P    +     ++   +   KE  N+    +      T    +    T 
Sbjct: 241 LAIKQTANTATPKAEVKTEAPAAEKQAAPVVKENTNTNTATTEKKETTTQQQTAPKAPTE 300

Query: 278 STSAATTTQTTSPPVVSAASQNTSSATQSSTAATQNASPATQSSNVATQNT 328
           +   A    T +    +  + NT++   +++  ++N +  T S+     NT
Sbjct: 301 AAKPAPAPSTNTNANKTNTNTNTNTNNTNTSTPSKNTNTNTNSNTNTNSNT 351



 Score = 29.0 bits (64), Expect = 4.6
 Identities = 23/115 (20%), Positives = 45/115 (39%), Gaps = 3/115 (2%)

Query: 200 KQEESKAAFCRGSESTPTV---SSKQAVKTTFPPVSTEPSGKPSDPPNSGKTKEPPNSGD 256
           K E    A     ++ P V   ++     T     +T+    P  P  + K    P++  
Sbjct: 253 KAEVKTEAPAAEKQAAPVVKENTNTNTATTEKKETTTQQQTAPKAPTEAAKPAPAPSTNT 312

Query: 257 DQSSVVNVATDSTLSHVTSTPSTSAATTTQTTSPPVVSAASQNTSSATQSSTAAT 311
           + +        +T +  TSTPS +  T T + +    +  +   SS   S+++A+
Sbjct: 313 NANKTNTNTNTNTNNTNTSTPSKNTNTNTNSNTNTNSNTNANQGSSNNNSNSSAS 367



 Score = 28.6 bits (63), Expect = 5.0
 Identities = 21/118 (17%), Positives = 43/118 (36%), Gaps = 6/118 (5%)

Query: 231 VSTEPSGKPSDPPNSGKTKEPPNSGDDQSSVVNVATDSTLSHVTSTPSTSAATTTQTTSP 290
           ++ + +   + P    KT+ P      +     V  ++T ++  +T      T  QT   
Sbjct: 241 LAIKQTANTATPKAEVKTEAPAA----EKQAAPVVKENTNTNTATTEKKETTTQQQTAPK 296

Query: 291 PVVSAASQNTSSATQSSTAATQNASPATQSSNVATQNTPVTNLNNAVATLKTAVATLN 348
               AA    + A  ++T A +  +    ++N    +TP  N N    +     +  N
Sbjct: 297 APTEAA--KPAPAPSTNTNANKTNTNTNTNTNNTNTSTPSKNTNTNTNSNTNTNSNTN 352


>gnl|CDD|148679 pfam07218, RAP1, Rhoptry-associated protein 1 (RAP-1).  This family
           consists of several rhoptry-associated protein 1 (RAP-1)
           sequences which appear to be specific to Plasmodium
           falciparum.
          Length = 790

 Score = 28.9 bits (64), Expect = 4.8
 Identities = 17/138 (12%), Positives = 42/138 (30%), Gaps = 7/138 (5%)

Query: 218 VSSKQAVKTTFPPVSTEPSGKPSDPPNSGKTKEPPNSG------DDQSSVVNVATDSTLS 271
           + +K +       ++       +     GK++    S       DD    +    +   +
Sbjct: 70  LENKASKDDGNINLTDTSENGDASKKGHGKSRVRSASAAAILEEDDSKDDMEFKANPNEA 129

Query: 272 HVTSTPSTSAATTTQTTSPPVVSAASQ-NTSSATQSSTAATQNASPATQSSNVATQNTPV 330
                P  +      ++S     A+++  + SA++   + + + S        A+    V
Sbjct: 130 GKPGKPKGNQGEGLASSSDGKSKASAKSGSKSASKHGESNSSDESATDSGKASASVAGIV 189

Query: 331 TNLNNAVATLKTAVATLN 348
                A    K  +  L 
Sbjct: 190 GADEEAPPAPKNTLTPLE 207


>gnl|CDD|217503 pfam03344, Daxx, Daxx Family.  The Daxx protein (also known as the
           Fas-binding protein) is thought to play a role in
           apoptosis, but precise role played by Daxx remains to be
           determined. Daxx forms a complex with Axin.
          Length = 715

 Score = 28.7 bits (64), Expect = 4.9
 Identities = 25/115 (21%), Positives = 36/115 (31%), Gaps = 12/115 (10%)

Query: 220 SKQAVKTTFPPVSTEPSGKPSDPP--NSGKTKEPPNSGDDQSSVVNVATDSTLSHVTSTP 277
           S     TT     +  + +   PP     K KE   S    +  +         H  S  
Sbjct: 598 SATVTSTTRNGNVSPHTPQDEQPPSGRKRKRKEEVESEPLGNQYL-------RHHNGSEK 650

Query: 278 STSAATTTQTTSPPVVSAASQNTSSATQ---SSTAATQNASPATQSSNVATQNTP 329
               A     TS   V+ +S    + +    +S+  T    P     NVATQ  P
Sbjct: 651 DGLPAPMDPVTSCSPVADSSTRVDTPSHELVTSSPQTPGDPPKKNKVNVATQCDP 705


>gnl|CDD|216257 pfam01034, Syndecan, Syndecan domain.  Syndecans are transmembrane
           heparin sulfate proteoglycans which are implicated in
           the binding of extracellular matrix components and
           growth factors.
          Length = 207

 Score = 28.2 bits (63), Expect = 5.1
 Identities = 18/75 (24%), Positives = 30/75 (40%), Gaps = 1/75 (1%)

Query: 251 PPNSGDDQSSVVNVATDSTLSHVTSTPSTSAATTTQTTSPPVVSAASQNTSSATQSSTAA 310
                DD+ S   V T +T   +T+T S+ +  TT  ++    S     T + T S +  
Sbjct: 59  GATPSDDEDSEP-VTTSATPPKLTTTSSSPSNDTTTASTSTKTSPTVSTTVTTTTSPSET 117

Query: 311 TQNASPATQSSNVAT 325
               +  T S+   T
Sbjct: 118 DTEEATTTVSTETPT 132



 Score = 27.4 bits (61), Expect = 9.5
 Identities = 16/75 (21%), Positives = 31/75 (41%), Gaps = 7/75 (9%)

Query: 241 DPPNSGKTKEPPNSGDDQSSVVNVATDSTLSHVTSTPSTSAATTTQTTSPPVVSAASQNT 300
            PP    T   P++    +S     + +  + VT+T S S   T + T        + +T
Sbjct: 76  TPPKLTTTSSSPSNDTTTASTSTKTSPTVSTTVTTTTSPSETDTEEAT-------TTVST 128

Query: 301 SSATQSSTAATQNAS 315
            + T+  ++A  + S
Sbjct: 129 ETPTEGGSSAATDPS 143


>gnl|CDD|218673 pfam05642, Sporozoite_P67, Sporozoite P67 surface antigen.  This
           family consists of several Theileria P67 surface
           antigens. A stage specific surface antigen of Theileria
           parva, p67, is the basis for the development of an
           anti-sporozoite vaccine for the control of East Coast
           fever (ECF) in cattle. The antigen has been shown to
           contain five distinct linear peptide sequences
           recognised by sporozoite-neutralising murine monoclonal
           antibodies.
          Length = 727

 Score = 28.9 bits (64), Expect = 5.5
 Identities = 20/49 (40%), Positives = 26/49 (53%), Gaps = 2/49 (4%)

Query: 267 DSTLSHVTSTPSTSAATTTQTTSPPVVSAASQNTSSATQSSTAATQNAS 315
           DST S  T+ PS S  TTT  +S P  SA     SS+  + T +T + S
Sbjct: 272 DSTSSPSTTRPSGSTTTTTPASSGP--SAPGGPGSSSRNAVTRSTDSIS 318


>gnl|CDD|222812 PHA00727, PHA00727, hypothetical protein.
          Length = 278

 Score = 28.3 bits (63), Expect = 5.5
 Identities = 13/29 (44%), Positives = 19/29 (65%)

Query: 272 HVTSTPSTSAATTTQTTSPPVVSAASQNT 300
           HVT+T +T  ATTT TT+ P  +  S ++
Sbjct: 248 HVTNTTTTHLATTTTTTTSPSFTIPSNSS 276


>gnl|CDD|185214 PRK15314, PRK15314, outer membrane protein RatB; Provisional.
          Length = 2435

 Score = 29.0 bits (64), Expect = 5.5
 Identities = 22/82 (26%), Positives = 38/82 (46%), Gaps = 1/82 (1%)

Query: 252  PNSGDDQSSVVNVATDSTLSHVTSTPSTSAATTTQTTSPPVVSAASQNTSSATQSSTAAT 311
            PN+G  Q   +N     T S  T+T +       +T++P  ++  S    SA  ++  A 
Sbjct: 2117 PNTGYYQFINLNSGGKGTASSSTATGAQVCLVEPRTSTPASITLTSTAMDSAKNAAVVAK 2176

Query: 312  QNASPATQSSNVATQNTPVTNL 333
             +A P T +   ++ N PV N+
Sbjct: 2177 GSAMPLTVTVKDSSGN-PVANV 2197


>gnl|CDD|173184 PRK14721, flhF, flagellar biosynthesis regulator FlhF; Provisional.
          Length = 420

 Score = 28.4 bits (63), Expect = 6.0
 Identities = 16/59 (27%), Positives = 25/59 (42%)

Query: 268 STLSHVTSTPSTSAATTTQTTSPPVVSAASQNTSSATQSSTAATQNASPATQSSNVATQ 326
           S+L+  T   +T   TT + T+PP    AS            ATQ  +    S+N+  +
Sbjct: 49  SSLTSDTPEEATIPETTVKPTAPPRQKPASGQPQPPAIHKQPATQPPAADIPSANIMQE 107


>gnl|CDD|222925 PHA02743, PHA02743, Viral ankyrin protein; Provisional.
          Length = 166

 Score = 27.9 bits (62), Expect = 6.1
 Identities = 17/55 (30%), Positives = 26/55 (47%), Gaps = 1/55 (1%)

Query: 113 VLLQKGAKINAHTEETQETALTLACCGGFLDVADFLLKNGANIELGASTPLMEAA 167
           +L+  GA INA    T  T L +A      ++A++L +    + LGA     E A
Sbjct: 78  LLVNMGADINARELGTGNTLLHIAASTKNYELAEWLCRQ-LGVNLGAINYQHETA 131


>gnl|CDD|185616 PTZ00436, PTZ00436, 60S ribosomal protein L19-like protein;
           Provisional.
          Length = 357

 Score = 28.4 bits (62), Expect = 6.1
 Identities = 20/108 (18%), Positives = 34/108 (31%), Gaps = 1/108 (0%)

Query: 212 SESTPTVSSKQAVKTTFPPV-STEPSGKPSDPPNSGKTKEPPNSGDDQSSVVNVATDSTL 270
           + + P  ++    K   PP  +  P  K + PP          +     +    A  +  
Sbjct: 239 AAAAPAKAAAAPAKAAAPPAKAAAPPAKAAAPPAKAAAPPAKAAAPPAKAAAPPAKAAAA 298

Query: 271 SHVTSTPSTSAATTTQTTSPPVVSAASQNTSSATQSSTAATQNASPAT 318
               +     AA      + P   AA+    +AT  + AA   A  A 
Sbjct: 299 PAKAAAAPAKAAAAPAKAAAPPAKAAAPPAKAATPPAKAAAPPAKAAA 346


>gnl|CDD|115579 pfam06933, SSP160, Special lobe-specific silk protein SSP160.  This
           family consists of several special lobe-specific silk
           protein SSP160 sequences which appear to be specific to
           Chironomus (Midge) species.
          Length = 758

 Score = 28.6 bits (63), Expect = 6.1
 Identities = 15/57 (26%), Positives = 34/57 (59%)

Query: 254 SGDDQSSVVNVATDSTLSHVTSTPSTSAATTTQTTSPPVVSAASQNTSSATQSSTAA 310
           SG   +S  + ++ ++ S+  ST S ++ T++ +T+    S +S N++S+  +S A+
Sbjct: 100 SGSGSASGNSSSSANSTSNSNSTTSNNSTTSSNSTTTTSNSTSSSNSTSSGLTSGAS 156


>gnl|CDD|178748 PLN03209, PLN03209, translocon at the inner envelope of chloroplast
           subunit 62; Provisional.
          Length = 576

 Score = 28.4 bits (63), Expect = 6.4
 Identities = 23/103 (22%), Positives = 36/103 (34%), Gaps = 2/103 (1%)

Query: 229 PPVSTEPSGKPSDPPNSGKTKEPPNSGDDQSSV--VNVATDSTLSHVTSTPSTSAATTTQ 286
           PP S+  S K  D        +   S    S+V  V  A          +P         
Sbjct: 390 PPSSSPASSKSVDAVAKPAEPDVVPSPGSASNVPEVEPAQVEAKKTRPLSPYARYEDLKP 449

Query: 287 TTSPPVVSAASQNTSSATQSSTAATQNASPATQSSNVATQNTP 329
            TSP   +    + S ++ SS  A  + +PAT +++ A     
Sbjct: 450 PTSPSPTAPTGVSPSVSSTSSVPAVPDTAPATAATDAAAPPPA 492


>gnl|CDD|218908 pfam06136, DUF966, Domain of unknown function (DUF966).  Family of
           plant proteins with unknown function.
          Length = 308

 Score = 28.2 bits (63), Expect = 6.4
 Identities = 20/114 (17%), Positives = 39/114 (34%), Gaps = 17/114 (14%)

Query: 210 RGSESTPTVSSKQAVKTTFPPVSTEPSGKPSDPPNSGKTKEPP-NSGDDQSSVVNVATDS 268
           +GSE   + SSK   +      S+    + SD P   +      +S D     V      
Sbjct: 84  KGSEILDSSSSKGDPEEA----SSRKLQEESDTPPVNRRANQSWSSSDLAEYKV------ 133

Query: 269 TLSHVTSTPSTSAATTTQTTSPPVVSAASQNTSSATQSSTAATQNASPATQSSN 322
                       A  +TQT       ++   ++  ++   +   ++S  + SS+
Sbjct: 134 ------YKAEEPADASTQTDDRRSRDSSEAESTELSREEISPPSSSSSPSSSSS 181


>gnl|CDD|219071 pfam06525, SoxE, Sulfocyanin (SoxE).  This family consists of
           several archaeal sulfocyanin (or blue copper protein)
           sequences from a number of Sulfolobus species.
          Length = 195

 Score = 27.9 bits (62), Expect = 6.9
 Identities = 10/46 (21%), Positives = 22/46 (47%)

Query: 272 HVTSTPSTSAATTTQTTSPPVVSAASQNTSSATQSSTAATQNASPA 317
            + S+P++++ T++ TT PP       ++S+ T        +  P 
Sbjct: 29  SMLSSPTSTSTTSSSTTLPPGAITLPYDSSNKTVFIYLVVLSTGPT 74


>gnl|CDD|139494 PRK13335, PRK13335, superantigen-like protein; Reviewed.
          Length = 356

 Score = 28.2 bits (62), Expect = 7.1
 Identities = 11/56 (19%), Positives = 20/56 (35%), Gaps = 7/56 (12%)

Query: 274 TSTPSTSAATTTQTTSPPVVSAASQNTSSATQSSTAATQNASPATQSSNVATQNTP 329
           T+   TSA+   + + P       +   S   S+T A +     T +   +T    
Sbjct: 82  TNEEKTSASKIEKISQPK-----QEEQKSLNISATPAPKQEQSQTTTE--STTPKT 130


>gnl|CDD|224680 COG1766, fliF, Flagellar basal body M-ring protein [Cell motility
           and secretion].
          Length = 545

 Score = 28.1 bits (63), Expect = 7.2
 Identities = 18/70 (25%), Positives = 25/70 (35%), Gaps = 7/70 (10%)

Query: 273 VTSTPSTSAATTTQTTSPPV---VSAASQNTSSATQSSTAATQNASPATQSSNVATQNTP 329
           VT+    SA  +T+    P     +  S+ T ++   S   TQ   P    SN      P
Sbjct: 267 VTAELDFSAKESTEEKYDPNGREPAVRSEQTVTSLSQSGQGTQGGVPGALESNQ----PP 322

Query: 330 VTNLNNAVAT 339
           V    N   T
Sbjct: 323 VAGNQNGKDT 332


>gnl|CDD|236792 PRK10905, PRK10905, cell division protein DamX; Validated.
          Length = 328

 Score = 28.0 bits (62), Expect = 7.4
 Identities = 17/73 (23%), Positives = 29/73 (39%), Gaps = 2/73 (2%)

Query: 273 VTSTPSTSAATTTQTTSPPVVSAASQNTSSATQSSTAATQNASPATQSSNVATQNTPVTN 332
              T     A T + T P    A+++  ++ +  +   T   +P   +S   T  TP   
Sbjct: 171 TAKTEPKPVAQTPKRTEPAAPVASTKAPAATSTPAPKETATTAPVQTASPAQTTATPAAG 230

Query: 333 LNNA--VATLKTA 343
              A  V +LK+A
Sbjct: 231 GKTAGNVGSLKSA 243


>gnl|CDD|184285 PRK13733, PRK13733, conjugal transfer protein TraV; Provisional.
          Length = 171

 Score = 27.4 bits (61), Expect = 7.4
 Identities = 14/93 (15%), Positives = 27/93 (29%), Gaps = 3/93 (3%)

Query: 197 KKRKQEESKAAFCRGSESTPTVSSKQAVKTTFPPVST--EPSGKPSDPPNSGKT-KEPPN 253
           +K K+ E  +     + S P ++           V+       +P+      +    P  
Sbjct: 44  EKAKKLEQSSDAKPAAASLPRLAEGNFRTMPVQTVTATTPSGSRPAVTATPEQKLLAPRP 103

Query: 254 SGDDQSSVVNVATDSTLSHVTSTPSTSAATTTQ 286
                  V  V   S+++ VT          T 
Sbjct: 104 LFTAAREVKTVVPVSSVTPVTPPRPLRTGEQTA 136


>gnl|CDD|202716 pfam03646, FlaG, FlaG protein.  Although important for flagella the
           exact function of this protein is unknown.
          Length = 107

 Score = 26.8 bits (60), Expect = 7.5
 Identities = 11/58 (18%), Positives = 23/58 (39%)

Query: 295 AASQNTSSATQSSTAATQNASPATQSSNVATQNTPVTNLNNAVATLKTAVATLNRHCR 352
           AA  ++ +A +   +A Q  +   + +    +      L  AV  L   + +LN +  
Sbjct: 1   AAPSSSQTAAEKVVSAAQAVAEEAEETAAEEKEVSKEELEEAVEKLNKFLQSLNTNLE 58


>gnl|CDD|218597 pfam05466, BASP1, Brain acid soluble protein 1 (BASP1 protein).
           This family consists of several brain acid soluble
           protein 1 (BASP1) or neuronal axonal membrane protein
           NAP-22. The BASP1 is a neuron enriched Ca(2+)-dependent
           calmodulin-binding protein of unknown function.
          Length = 233

 Score = 27.9 bits (61), Expect = 7.5
 Identities = 20/109 (18%), Positives = 48/109 (44%), Gaps = 5/109 (4%)

Query: 230 PVSTEPSGKPSDPPNSGKTKEPPNSGDDQSSVVNVATDSTLSHVTSTPSTSAATTTQTTS 289
           P ++E S +P++     K +E      +       A  +  +   + P++ +  ++   +
Sbjct: 127 PKASEASSQPAESAAPAKEEEKSKEEGEAKKTEAPAAAAQETKSDAAPASDSKPSSSEAA 186

Query: 290 PPVVSAASQNTSSATQSSTAATQNASPATQSSNVATQNTPVTNLNNAVA 338
           P     +S+ T +AT++ ++  + ++PA  +  V     P  N +  VA
Sbjct: 187 P-----SSKETPAATEAPSSTAKASAPAAPAEEVKPSEAPAANSDQTVA 230


>gnl|CDD|203841 pfam08030, NAD_binding_6, Ferric reductase NAD binding domain. 
          Length = 149

 Score = 27.3 bits (61), Expect = 7.6
 Identities = 14/70 (20%), Positives = 27/70 (38%), Gaps = 1/70 (1%)

Query: 170 GHLELVRYLLDSGAQEMQTKHIDILEFKKRKQEESKAAFCRGSESTPTVSSKQAVKTTFP 229
             LE  + +L+   +  +  +I+I  +   + E   A+    SE      SK   + +  
Sbjct: 45  SSLEWFKDVLNELEELKELGNIEIHIYLTGEYEAEDASDASDSEQISMFDSKNHEEISGT 104

Query: 230 PVSTEPSGKP 239
            V     G+P
Sbjct: 105 RVEFHF-GRP 113


>gnl|CDD|216513 pfam01456, Mucin, Mucin-like glycoprotein.  This family of
           trypanosomal proteins resemble vertebrate mucins. The
           protein consists of three regions. The N and C terminii
           are conserved between all members of the family, whereas
           the central region is not well conserved and contains a
           large number of threonine residues which can be
           glycosylated. Indirect evidence suggested that these
           genes might encode the core protein of parasite mucins,
           glycoproteins that were proposed to be involved in the
           interaction with, and invasion of, mammalian host cells.
           This family contains an N-terminal signal peptide.
          Length = 143

 Score = 27.5 bits (60), Expect = 7.7
 Identities = 17/61 (27%), Positives = 31/61 (50%)

Query: 268 STLSHVTSTPSTSAATTTQTTSPPVVSAASQNTSSATQSSTAATQNASPATQSSNVATQN 327
           +T +  T+ P+T+  TTT TT+    +  +  T++ T ++T  T+  S  T +S   T  
Sbjct: 46  TTTTTTTTPPTTTTTTTTTTTTITTTTTKTTTTTTTTTTTTTTTEAPSKNTTTSEAPTTT 105

Query: 328 T 328
            
Sbjct: 106 D 106


>gnl|CDD|236733 PRK10672, PRK10672, rare lipoprotein A; Provisional.
          Length = 361

 Score = 28.1 bits (63), Expect = 8.0
 Identities = 31/118 (26%), Positives = 43/118 (36%), Gaps = 14/118 (11%)

Query: 211 GSESTPTVSSKQAVKTTF----PPVSTEPSGKPSDPPNSGKTKEPPNSGDDQ---SSVVN 263
           GS S P  +     K ++     P  +   G PS  P       P   GD     +S + 
Sbjct: 175 GSLSGPGTAGTTVAKQSYALPARPDLSGGMGTPSVQP------APAPQGDVLPVSNSTLK 228

Query: 264 VATDSTLSHVTSTPSTSAATTTQTTSPPVVSAASQNTSSATQSSTAATQNASPATQSS 321
            + D T + VTS+    A TT                SSA  ++ AA    + AT SS
Sbjct: 229 -SEDPTGAPVTSSGFLGAPTTLAPGVLEGSEPTPTAPSSAPATAPAAAAPQAAATSSS 285


>gnl|CDD|235513 PRK05579, PRK05579, bifunctional phosphopantothenoylcysteine
          decarboxylase/phosphopantothenate synthase; Validated.
          Length = 399

 Score = 28.2 bits (64), Expect = 8.1
 Identities = 19/68 (27%), Positives = 30/68 (44%), Gaps = 26/68 (38%)

Query: 39 VDLALLLIERGANIEEVNMPTDS---FESPLTL-AACG----------------GHVDLA 78
          ++L   L + GA++  V   T++   F +PLT  A  G                GH++LA
Sbjct: 23 LELVRRLRKAGADVRVV--MTEAAKKFVTPLTFQALSGNPVSTDLWDPAAEAAMGHIELA 80

Query: 79 ----LLLI 82
              L+LI
Sbjct: 81 KWADLVLI 88


>gnl|CDD|237191 PRK12757, PRK12757, cell division protein FtsN; Provisional.
          Length = 256

 Score = 27.7 bits (62), Expect = 8.1
 Identities = 11/55 (20%), Positives = 16/55 (29%)

Query: 268 STLSHVTSTPSTSAATTTQTTSPPVVSAASQNTSSATQSSTAATQNASPATQSSN 322
           ST+             TT    P      +         +   TQ A+P TQ+  
Sbjct: 117 STVQIQQQAQQQQPPATTAQPQPVTPPRQTTAPVQPQTPAPVRTQPAAPVTQAVE 171


>gnl|CDD|236709 PRK10531, PRK10531, acyl-CoA thioesterase; Provisional.
          Length = 422

 Score = 27.8 bits (62), Expect = 8.4
 Identities = 17/64 (26%), Positives = 26/64 (40%), Gaps = 3/64 (4%)

Query: 264 VATDSTLSHVTSTP---STSAATTTQTTSPPVVSAASQNTSSATQSSTAATQNASPATQS 320
           +A   TL+  +STP     S  T   T S P++SA+     +A     +    A+P   S
Sbjct: 13  LAFGVTLTACSSTPPDQRPSDQTAPGTASRPILSASEAKNFTAAHYFASLGPIAAPWNPS 72

Query: 321 SNVA 324
               
Sbjct: 73  PITL 76


>gnl|CDD|221825 pfam12877, DUF3827, Domain of unknown function (DUF3827).  This
           family contains the human KIAA1549 protein which has
           been found to be fused fused to BRAF gene in many cases
           of pilocytic astrocytomas. The fusion is due mainly to a
           tandem duplication of 2 Mb at 7q34. Although nothing is
           known about the function of KIAA1549 protein, the BRAF
           protein is a well characterized oncoprotein. It is a
           serine/threonine protein kinase which is implicated in
           MAP/ERK signalling, a critical pathway for the
           regulation of cell division, differentiation and
           secretion.
          Length = 684

 Score = 28.0 bits (62), Expect = 8.4
 Identities = 22/91 (24%), Positives = 31/91 (34%), Gaps = 9/91 (9%)

Query: 229 PPVSTEPSGKPSDPPNSGKTKEP------PNSGDDQ--SSVVNVATDSTLSHVTSTPSTS 280
              S  PS  PS       TKE       P SG D+  SS         LS  +S P   
Sbjct: 403 REKSGRPSTTPSVTAQQKPTKEEGRKKPAPPSGTDEQLSSASIFEHVDRLSRPSSDPYDR 462

Query: 281 AATTTQTTS-PPVVSAASQNTSSATQSSTAA 310
           ++   Q  +  P+ +         ++   AA
Sbjct: 463 SSGKIQLIAMQPMPAPPVPPRFEPSRDDRAA 493


>gnl|CDD|233728 TIGR02102, pullulan_Gpos, pullulanase, extracellular, Gram-positive. 
            Pullulan is an unusual, industrially important
            polysaccharide in which short alpha-1,4 chains
            (maltotriose) are connected in alpha-1,6 linkages.
            Enzymes that cleave alpha-1,6 linkages in pullulan and
            release maltotriose are called pullulanases although
            pullulan itself may not be the natural substrate. In
            contrast, a glycogen debranching enzyme such GlgX,
            homologous to this family, can release glucose at
            alpha,1-6 linkages from glycogen first subjected to limit
            degradation by phosphorylase. Characterized members of
            this family include a surface-located pullulanase from
            Streptococcus pneumoniae (PMID:11083842) and an
            extracellular bifunctional amylase/pullulanase with
            C-terminal pullulanase activity (PMID:8798645).
          Length = 1111

 Score = 28.3 bits (63), Expect = 8.4
 Identities = 13/56 (23%), Positives = 18/56 (32%), Gaps = 4/56 (7%)

Query: 211  GSESTPTVSSKQAVKTTFPPVSTEPSGKPSDPPNSGKTKE---PPNSGDDQSSVVN 263
              E TP    +   +   PP       KP D  +    K+   PP S D+      
Sbjct: 1009 APEKTPP-PPEHEPQAPKPPTQDPDGSKPKDKVDPKDNKDPLTPPGSDDENGETPK 1063


>gnl|CDD|132697 TIGR03658, IsdH_HarA, haptoglobin-binding heme uptake protein HarA.
            HarA is a heme-binding NEAT-domain (NEAr Transporter,
           pfam05031) protein which has been shown to bind to the
           haptoglobin-hemoglobin complex in order to extract heme
           from it. HarA has also been reported to bind hemoglobin
           directly. HarA (also known as IsdH) contains three NEAT
           domains as well as a sortase A C-terminal signal for
           localization to the cell wall. The heme bound at the
           third of these NEAT domains has been shown to be
           transferred to the IsdA protein also localized at the
           cell wall, presumably through an additional specific
           protein-protein interaction. Haptoglobin is a hemoglobin
           carrier protein involved in scavenging hemoglobin in the
           blood following red blood cell lysis and targetting it
           to the liver.
          Length = 895

 Score = 28.3 bits (62), Expect = 8.6
 Identities = 21/65 (32%), Positives = 30/65 (46%), Gaps = 4/65 (6%)

Query: 268 STLSHVTSTPSTSAATTTQTTSPPVVSAASQNTSSATQSSTAATQNASPATQSSNVATQN 327
           STL  +TS     AA  T T+      + +QN ++ T      T    PATQ +N A +N
Sbjct: 27  STLFLITSQHQAQAAENTNTSDK---ISENQNNNATTTQPPKDTNQTQPATQPANTA-KN 82

Query: 328 TPVTN 332
            P  +
Sbjct: 83  YPAAD 87


>gnl|CDD|223033 PHA03291, PHA03291, envelope glycoprotein I; Provisional.
          Length = 401

 Score = 28.0 bits (62), Expect = 9.4
 Identities = 16/95 (16%), Positives = 30/95 (31%)

Query: 235 PSGKPSDPPNSGKTKEPPNSGDDQSSVVNVATDSTLSHVTSTPSTSAATTTQTTSPPVVS 294
             G  + PP    + +         S   +         T  P+     + +TT  P  +
Sbjct: 168 AEGTLAAPPLGEGSADGSCDPALPLSAPRLGPADVFVPATPRPTPRTTASPETTPTPSTT 227

Query: 295 AASQNTSSATQSSTAATQNASPATQSSNVATQNTP 329
            +  +T+    S+T A   A    ++       TP
Sbjct: 228 TSPPSTTIPAPSTTIAAPQAGTTPEAEGTPAPPTP 262


>gnl|CDD|219627 pfam07897, DUF1675, Protein of unknown function (DUF1675).  The
           members of this family are sequences derived from
           hypothetical plant proteins of unknown function. One
           member of this family is annotated as a putative
           RNA-binding protein, but no evidence was found to
           support this.
          Length = 283

 Score = 27.6 bits (61), Expect = 9.4
 Identities = 29/146 (19%), Positives = 53/146 (36%), Gaps = 22/146 (15%)

Query: 185 EMQTKHIDILEFKKRKQEESKAAFCRGSESTPTVS------------SKQAVKTTFPPVS 232
           EMQ+         KRK+ E +       +   +++            S + V+T  P  +
Sbjct: 77  EMQSLR---RLEAKRKRSEKEYNGVSNGDDMDSINAANGGGSGRDKESSKGVETGSPIAT 133

Query: 233 TEPSGKPS-----DPPNSGKTKEPPNSGDDQSSVVNVATDSTLSHVTSTPSTSAATTTQT 287
           +  SG PS     +        +   +   +S V N+ T +  +   + P  +    + T
Sbjct: 134 SSSSGGPSSSGLSELDVGVIGGQSGQTKQVRSPVNNILTGTESTSHGTDPVDAETNKSNT 193

Query: 288 TSPPVVSAASQNTSSATQSSTAATQN 313
           TSP     + +  S  T S   A+ N
Sbjct: 194 TSPT--KRSRRPPSVDTNSGGEASLN 217


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.308    0.122    0.340 

Gapped
Lambda     K      H
   0.267   0.0742    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 16,274,125
Number of extensions: 1487585
Number of successful extensions: 2642
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2228
Number of HSP's successfully gapped: 404
Length of query: 352
Length of database: 10,937,602
Length adjustment: 98
Effective length of query: 254
Effective length of database: 6,590,910
Effective search space: 1674091140
Effective search space used: 1674091140
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.7 bits)
S2: 59 (26.5 bits)