RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy4869
         (2435 letters)



>gnl|CDD|238121 cd00200, WD40, WD40 domain, found in a number of eukaryotic
           proteins that cover a wide variety of functions
           including adaptor/regulatory modules in signal
           transduction, pre-mRNA processing and cytoskeleton
           assembly; typically contains a GH dipeptide 11-24
           residues from its N-terminus and the WD dipeptide at its
           C-terminus and is 40 residues long, hence the name WD40;
           between GH and WD lies a conserved core; serves as a
           stable propeller-like platform to which proteins can
           bind either stably or reversibly; forms a propeller-like
           structure with several blades where each blade is
           composed of a four-stranded anti-parallel b-sheet;
           instances with few detectable copies are hypothesized to
           form larger structures by dimerization; each WD40
           sequence repeat forms the first three strands of one
           blade and the last strand in the next blade; the last
           C-terminal WD40 repeat completes the blade structure of
           the first WD40 repeat to create the closed ring
           propeller-structure; residues on the top and bottom
           surface of the propeller are proposed to coordinate
           interactions with other proteins and/or small ligands; 7
           copies of the repeat are present in this alignment.
          Length = 289

 Score =  235 bits (601), Expect = 2e-69
 Identities = 97/293 (33%), Positives = 157/293 (53%), Gaps = 11/293 (3%)

Query: 358 LEGHGGEIFCSKYHPDGQYIASSGYDRQIFIWSVYGECENIGVMSGHTGAVMDLKFSTDG 417
           L+GH G + C  + PDG+ +A+   D  I +W +    E +  + GHTG V D+  S DG
Sbjct: 5   LKGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLET-GELLRTLKGHTGPVRDVAASADG 63

Query: 418 CHIFTCSTDQTLAVWDLEKGQRIKKMKGHSTFVNSCDPVRRGQLLIASGSDDCTVKVWDP 477
            ++ + S+D+T+ +WDLE G+ ++ + GH+++V+S      G++ ++S S D T+KVWD 
Sbjct: 64  TYLASGSSDKTIRLWDLETGECVRTLTGHTSYVSSVAFSPDGRI-LSSSSRDKTIKVWDV 122

Query: 478 RKKNQAVSMNN-TYQVTSVAFNDTAECVLTGGIDNDIKMWDLRTNSVVQKLRGHSDTVTG 536
                  ++   T  V SVAF+     V +   D  IK+WDLRT   V  L GH+  V  
Sbjct: 123 ETGKCLTTLRGHTDWVNSVAFSPDGTFVASSSQDGTIKLWDLRTGKCVATLTGHTGEVNS 182

Query: 537 LSLSPDGSYILSNAMDNTVRIWDIRPYVPGERCVKVMSGHQHNFEKNLLRCAWSVSGLYV 596
           ++ SPDG  +LS++ D T+++WD+       +C+  + GH    E  +   A+S  G  +
Sbjct: 183 VAFSPDGEKLLSSSSDGTIKLWDLS----TGKCLGTLRGH----ENGVNSVAFSPDGYLL 234

Query: 597 TAGSADKCVYIWDTTTRRIAYKLPGHNGSVNDVQFHPKEPIIMSASSDKTIYL 649
            +GS D  + +WD  T      L GH  SV  + + P    + S S+D TI +
Sbjct: 235 ASGSEDGTIRVWDLRTGECVQTLSGHTNSVTSLAWSPDGKRLASGSADGTIRI 287



 Score =  204 bits (521), Expect = 1e-58
 Identities = 105/349 (30%), Positives = 154/349 (44%), Gaps = 64/349 (18%)

Query: 1078 LYGHKLPVLSLDMSYDSTLIATGSGDRTVKVWGLDYGDCHKSLLAHEDSVTGVTFVPKTH 1137
            L GH   V  +  S D  L+ATGSGD T+KVW L+ G+  ++L  H   V  V       
Sbjct: 5    LKGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAASADGT 64

Query: 1138 YFFTTSKDGRVKQWDADNFERIVTLHFFISLYGHKLPVLSLDMSYDSTLIATGSGDRTVK 1197
            Y  + S D  ++ WD +  E + TL       GH   V S+  S D  ++++ S D+T+K
Sbjct: 65   YLASGSSDKTIRLWDLETGECVRTLT------GHTSYVSSVAFSPDGRILSSSSRDKTIK 118

Query: 1198 VWGLDYGDCHKSLLAHEDSVTGVTFVPKTHYFFTTSKDGRVKQWDADNFERIVTLHFNPN 1257
            VW ++ G C  +L  H D V  V F          S DG                     
Sbjct: 119  VWDVETGKCLTTLRGHTDWVNSVAF----------SPDGTF------------------- 149

Query: 1258 VYLPLQIQVVTGGGDKSVKLWQLELVSVNREADEETKDVSRSHKVLSLLHTRTLKLEEQV 1317
                    V +   D ++KLW L                 R+ K ++ L   T     +V
Sbjct: 150  --------VASSSQDGTIKLWDL-----------------RTGKCVATLTGHT----GEV 180

Query: 1318 LCARVSPDSKLLAVSLLDTTVKIFFLDTFKFFISLYGHKLPVLSLDMSYDSTLIATGSGD 1377
                 SPD + L  S  D T+K++ L T K   +L GH+  V S+  S D  L+A+GS D
Sbjct: 181  NSVAFSPDGEKLLSSSSDGTIKLWDLSTGKCLGTLRGHENGVNSVAFSPDGYLLASGSED 240

Query: 1378 RTVKVWGLDYGDCHKSLLAHEDSVTGVTFVPKTHYFFTTSKDGRVKQWD 1426
             T++VW L  G+C ++L  H +SVT + + P      + S DG ++ WD
Sbjct: 241  GTIRVWDLRTGECVQTLSGHTNSVTSLAWSPDGKRLASGSADGTIRIWD 289



 Score =  192 bits (489), Expect = 2e-54
 Identities = 83/254 (32%), Positives = 132/254 (51%), Gaps = 10/254 (3%)

Query: 397 NIGVMSGHTGAVMDLKFSTDGCHIFTCSTDQTLAVWDLEKGQRIKKMKGHSTFVNSCDPV 456
               + GHTG V  + FS DG  + T S D T+ VWDLE G+ ++ +KGH+  V      
Sbjct: 1   LRRTLKGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAAS 60

Query: 457 RRGQLLIASGSDDCTVKVWDPRKKNQAVSMNN-TYQVTSVAFNDTAECVLTGGIDNDIKM 515
             G  L +  SD  T+++WD        ++   T  V+SVAF+     + +   D  IK+
Sbjct: 61  ADGTYLASGSSDK-TIRLWDLETGECVRTLTGHTSYVSSVAFSPDGRILSSSSRDKTIKV 119

Query: 516 WDLRTNSVVQKLRGHSDTVTGLSLSPDGSYILSNAMDNTVRIWDIRPYVPGERCVKVMSG 575
           WD+ T   +  LRGH+D V  ++ SPDG+++ S++ D T+++WD+R      +CV  ++G
Sbjct: 120 WDVETGKCLTTLRGHTDWVNSVAFSPDGTFVASSSQDGTIKLWDLR----TGKCVATLTG 175

Query: 576 HQHNFEKNLLRCAWSVSGLYVTAGSADKCVYIWDTTTRRIAYKLPGHNGSVNDVQFHPKE 635
           H       +   A+S  G  + + S+D  + +WD +T +    L GH   VN V F P  
Sbjct: 176 H----TGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKCLGTLRGHENGVNSVAFSPDG 231

Query: 636 PIIMSASSDKTIYL 649
            ++ S S D TI +
Sbjct: 232 YLLASGSEDGTIRV 245



 Score =  191 bits (486), Expect = 5e-54
 Identities = 91/253 (35%), Positives = 138/253 (54%), Gaps = 11/253 (4%)

Query: 358 LEGHGGEIFCSKYHPDGQYIASSGYDRQIFIWSVYGECENIGVMSGHTGAVMDLKFSTDG 417
           L+GH G +       DG Y+AS   D+ I +W +    E +  ++GHT  V  + FS DG
Sbjct: 47  LKGHTGPVRDVAASADGTYLASGSSDKTIRLWDL-ETGECVRTLTGHTSYVSSVAFSPDG 105

Query: 418 CHIFTCSTDQTLAVWDLEKGQRIKKMKGHSTFVNSCDPVRRGQLLIASGSDDCTVKVWDP 477
             + + S D+T+ VWD+E G+ +  ++GH+ +VNS          +AS S D T+K+WD 
Sbjct: 106 RILSSSSRDKTIKVWDVETGKCLTTLRGHTDWVNSVA-FSPDGTFVASSSQDGTIKLWDL 164

Query: 478 RKKN-QAVSMNNTYQVTSVAFNDTAECVLTGGIDNDIKMWDLRTNSVVQKLRGHSDTVTG 536
           R     A    +T +V SVAF+   E +L+   D  IK+WDL T   +  LRGH + V  
Sbjct: 165 RTGKCVATLTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKCLGTLRGHENGVNS 224

Query: 537 LSLSPDGSYILSNAMDNTVRIWDIRPYVPGERCVKVMSGHQHNFEKNLLRCAWSVSGLYV 596
           ++ SPDG  + S + D T+R+WD+R       CV+ +SGH +    ++   AWS  G  +
Sbjct: 225 VAFSPDGYLLASGSEDGTIRVWDLRT----GECVQTLSGHTN----SVTSLAWSPDGKRL 276

Query: 597 TAGSADKCVYIWD 609
            +GSAD  + IWD
Sbjct: 277 ASGSADGTIRIWD 289



 Score =  185 bits (471), Expect = 6e-52
 Identities = 87/320 (27%), Positives = 139/320 (43%), Gaps = 39/320 (12%)

Query: 882  QGHHSEVRALAFSSDNLALVSACA-SQVKIWNRPSLSCLRTIDTGSYALSVC-FVPGDRH 939
            +GH   V  +AFS D   L +      +K+W+  +   LRT+   +  +          +
Sbjct: 6    KGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAASADGTY 65

Query: 940  VLVGTKDGRLLIVDIGAGEILEDIPAHSQELWSVAMLPDQFNPNVYLPLQIQVVTGGGDK 999
            +  G+ D  + + D+  GE +  +  H+  + SVA  PD          +I + +   DK
Sbjct: 66   LASGSSDKTIRLWDLETGECVRTLTGHTSYVSSVAFSPDG---------RI-LSSSSRDK 115

Query: 1000 SVKLWQLELVSVNREADEETKDVSRSHKVLSLLHTRTLKLEEQVLCARVSPDSKLLAVSL 1059
            ++K+W +E          +     R H              + V     SPD   +A S 
Sbjct: 116  TIKVWDVE--------TGKCLTTLRGH-------------TDWVNSVAFSPDGTFVASSS 154

Query: 1060 LDTTVKIFFLDTFKFFISLYGHKLPVLSLDMSYDSTLIATGSGDRTVKVWGLDYGDCHKS 1119
             D T+K++ L T K   +L GH   V S+  S D   + + S D T+K+W L  G C  +
Sbjct: 155  QDGTIKLWDLRTGKCVATLTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKCLGT 214

Query: 1120 LLAHEDSVTGVTFVPKTHYFFTTSKDGRVKQWDADNFERIVTLHFFISLYGHKLPVLSLD 1179
            L  HE+ V  V F P  +   + S+DG ++ WD    E + T      L GH   V SL 
Sbjct: 215  LRGHENGVNSVAFSPDGYLLASGSEDGTIRVWDLRTGECVQT------LSGHTNSVTSLA 268

Query: 1180 MSYDSTLIATGSGDRTVKVW 1199
             S D   +A+GS D T+++W
Sbjct: 269  WSPDGKRLASGSADGTIRIW 288



 Score =  184 bits (469), Expect = 1e-51
 Identities = 82/279 (29%), Positives = 123/279 (44%), Gaps = 36/279 (12%)

Query: 1168 LYGHKLPVLSLDMSYDSTLIATGSGDRTVKVWGLDYGDCHKSLLAHEDSVTGVTFVPKTH 1227
            L GH   V  +  S D  L+ATGSGD T+KVW L+ G+  ++L  H   V  V       
Sbjct: 5    LKGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAASADGT 64

Query: 1228 YFFTTSKDGRVKQWDADNFERIVTL----------HFNPNVYLPLQIQVVTGGGDKSVKL 1277
            Y  + S D  ++ WD +  E + TL           F+P+  +     + +   DK++K+
Sbjct: 65   YLASGSSDKTIRLWDLETGECVRTLTGHTSYVSSVAFSPDGRI-----LSSSSRDKTIKV 119

Query: 1278 WQLELVSVNREADEETKDVSRSHKVLSLLHTRTLKLEEQVLCARVSPDSKLLAVSLLDTT 1337
            W +E          +     R H              + V     SPD   +A S  D T
Sbjct: 120  WDVE--------TGKCLTTLRGH-------------TDWVNSVAFSPDGTFVASSSQDGT 158

Query: 1338 VKIFFLDTFKFFISLYGHKLPVLSLDMSYDSTLIATGSGDRTVKVWGLDYGDCHKSLLAH 1397
            +K++ L T K   +L GH   V S+  S D   + + S D T+K+W L  G C  +L  H
Sbjct: 159  IKLWDLRTGKCVATLTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKCLGTLRGH 218

Query: 1398 EDSVTGVTFVPKTHYFFTTSKDGRVKQWDADNFERIVTL 1436
            E+ V  V F P  +   + S+DG ++ WD    E + TL
Sbjct: 219  ENGVNSVAFSPDGYLLASGSEDGTIRVWDLRTGECVQTL 257



 Score =  136 bits (344), Expect = 5e-35
 Identities = 83/274 (30%), Positives = 131/274 (47%), Gaps = 35/274 (12%)

Query: 882  QGHHSEVRALAFSSDNLALVSACA-SQVKIWNRPSLSCLRTIDTG--SYALSVCFVPGDR 938
            +GH   VR +A S+D   L S  +   +++W+  +  C+RT+ TG  SY  SV F P  R
Sbjct: 48   KGHTGPVRDVAASADGTYLASGSSDKTIRLWDLETGECVRTL-TGHTSYVSSVAFSPDGR 106

Query: 939  HVLVGTKDGRLLIVDIGAGEILEDIPAHSQELWSVAMLPDQFNPNVYLPLQIQVVTGGGD 998
             +   ++D  + + D+  G+ L  +  H+  + SVA  PD             V +   D
Sbjct: 107  ILSSSSRDKTIKVWDVETGKCLTTLRGHTDWVNSVAFSPDG----------TFVASSSQD 156

Query: 999  KSVKLWQLELVSVNREADEETKDVSRSHKVLSLLHTRTLKLEEQVLCARVSPDSKLLAVS 1058
             ++KLW L                 R+ K ++ L   T     +V     SPD + L  S
Sbjct: 157  GTIKLWDL-----------------RTGKCVATLTGHT----GEVNSVAFSPDGEKLLSS 195

Query: 1059 LLDTTVKIFFLDTFKFFISLYGHKLPVLSLDMSYDSTLIATGSGDRTVKVWGLDYGDCHK 1118
              D T+K++ L T K   +L GH+  V S+  S D  L+A+GS D T++VW L  G+C +
Sbjct: 196  SSDGTIKLWDLSTGKCLGTLRGHENGVNSVAFSPDGYLLASGSEDGTIRVWDLRTGECVQ 255

Query: 1119 SLLAHEDSVTGVTFVPKTHYFFTTSKDGRVKQWD 1152
            +L  H +SVT + + P      + S DG ++ WD
Sbjct: 256  TLSGHTNSVTSLAWSPDGKRLASGSADGTIRIWD 289



 Score =  130 bits (330), Expect = 4e-33
 Identities = 67/207 (32%), Positives = 99/207 (47%), Gaps = 45/207 (21%)

Query: 354 PIMLLEGHGGEIFCSKYHPDGQYIASSGYDRQIFIWSV-YGECENIGVMSGHTGAVMDLK 412
            +  L GH   +    + PDG ++ASS  D  I +W +  G+C  +  ++GHTG V  + 
Sbjct: 127 CLTTLRGHTDWVNSVAFSPDGTFVASSSQDGTIKLWDLRTGKC--VATLTGHTGEVNSVA 184

Query: 413 FSTDGCHIFTCSTDQTLAVWDLEKGQRIKKMKGHSTFVNSCDPVRRGQLLIASGSDDCTV 472
           FS DG  + + S+D T+ +WDL  G+ +  ++GH   VNS         L+ASGS+D T+
Sbjct: 185 FSPDGEKLLSSSSDGTIKLWDLSTGKCLGTLRGHENGVNSVA-FSPDGYLLASGSEDGTI 243

Query: 473 KVWDPRKKNQAVSMNNTYQVTSVAFNDTAECVLTGGIDNDIKMWDLRTNSVVQKLRGHSD 532
           +V                                         WDLRT   VQ L GH++
Sbjct: 244 RV-----------------------------------------WDLRTGECVQTLSGHTN 262

Query: 533 TVTGLSLSPDGSYILSNAMDNTVRIWD 559
           +VT L+ SPDG  + S + D T+RIWD
Sbjct: 263 SVTSLAWSPDGKRLASGSADGTIRIWD 289



 Score =  112 bits (281), Expect = 1e-26
 Identities = 56/202 (27%), Positives = 96/202 (47%), Gaps = 13/202 (6%)

Query: 36  GRFLATGASED-VIIWDLRLAEKALLLPGEKHEALLLPGEKHEVCQLSPNHDSSQLAVAY 94
           G +LA+G+S+  + +WDL   E              L G    V  ++ + D   L+ + 
Sbjct: 63  GTYLASGSSDKTIRLWDLETGECV-------RT---LTGHTSYVSSVAFSPDGRILSSSS 112

Query: 95  TNGSLKTFSLDTTDVISTFTGHKSAITVIQYDPLGHRLATGSKDTDIVLWDVVAECGLHR 154
            + ++K + ++T   ++T  GH   +  + + P G  +A+ S+D  I LWD+     +  
Sbjct: 113 RDKTIKVWDVETGKCLTTLRGHTDWVNSVAFSPDGTFVASSSQDGTIKLWDLRTGKCVAT 172

Query: 155 LSGHKGVITDIRFMSQPGHHFVVSSAKDTFVKIWDADTGDCFKTMAAHLTEVWGVCVMRE 214
           L+GH G +  + F S  G   + SS+ D  +K+WD  TG C  T+  H   V  V    +
Sbjct: 173 LTGHTGEVNSVAF-SPDGEKLLSSSS-DGTIKLWDLSTGKCLGTLRGHENGVNSVAFSPD 230

Query: 215 DSYLISGSNDAELKVWNVRDRS 236
              L SGS D  ++VW++R   
Sbjct: 231 GYLLASGSEDGTIRVWDLRTGE 252



 Score =  111 bits (280), Expect = 1e-26
 Identities = 47/163 (28%), Positives = 82/163 (50%), Gaps = 2/163 (1%)

Query: 71  LPGEKHEVCQLSPNHDSSQLAVAYTNGSLKTFSLDTTDVISTFTGHKSAITVIQYDPLGH 130
           L G    V  ++ + D   LA    +G++K + L+T +++ T  GH   +  +     G 
Sbjct: 5   LKGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAASADGT 64

Query: 131 RLATGSKDTDIVLWDVVAECGLHRLSGHKGVITDIRFMSQPGHHFVVSSAKDTFVKIWDA 190
            LA+GS D  I LWD+     +  L+GH   ++ + F   P    + SS++D  +K+WD 
Sbjct: 65  YLASGSSDKTIRLWDLETGECVRTLTGHTSYVSSVAFS--PDGRILSSSSRDKTIKVWDV 122

Query: 191 DTGDCFKTMAAHLTEVWGVCVMREDSYLISGSNDAELKVWNVR 233
           +TG C  T+  H   V  V    + +++ S S D  +K+W++R
Sbjct: 123 ETGKCLTTLRGHTDWVNSVAFSPDGTFVASSSQDGTIKLWDLR 165



 Score =  109 bits (275), Expect = 8e-26
 Identities = 49/197 (24%), Positives = 94/197 (47%), Gaps = 14/197 (7%)

Query: 37  RFLATGASED--VIIWDLRLAEKALLLPGEKHEALLLPGEKHEVCQLSPNHDSSQLAVAY 94
             + + +S D  + +WD+   +    L G  H           V  ++ + D + +A + 
Sbjct: 105 GRILSSSSRDKTIKVWDVETGKCLTTLRG--HTDW--------VNSVAFSPDGTFVASSS 154

Query: 95  TNGSLKTFSLDTTDVISTFTGHKSAITVIQYDPLGHRLATGSKDTDIVLWDVVAECGLHR 154
            +G++K + L T   ++T TGH   +  + + P G +L + S D  I LWD+     L  
Sbjct: 155 QDGTIKLWDLRTGKCVATLTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKCLGT 214

Query: 155 LSGHKGVITDIRFMSQPGHHFVVSSAKDTFVKIWDADTGDCFKTMAAHLTEVWGVCVMRE 214
           L GH+  +  + F   P  + + S ++D  +++WD  TG+C +T++ H   V  +    +
Sbjct: 215 LRGHENGVNSVAFS--PDGYLLASGSEDGTIRVWDLRTGECVQTLSGHTNSVTSLAWSPD 272

Query: 215 DSYLISGSNDAELKVWN 231
              L SGS D  +++W+
Sbjct: 273 GKRLASGSADGTIRIWD 289



 Score =  104 bits (262), Expect = 4e-24
 Identities = 41/131 (31%), Positives = 64/131 (48%), Gaps = 2/131 (1%)

Query: 110 ISTFTGHKSAITVIQYDPLGHRLATGSKDTDIVLWDVVAECGLHRLSGHKGVITDIRFMS 169
             T  GH   +T + + P G  LATGS D  I +WD+     L  L GH G + D+   +
Sbjct: 2   RRTLKGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAASA 61

Query: 170 QPGHHFVVSSAKDTFVKIWDADTGDCFKTMAAHLTEVWGVCVMREDSYLISGSNDAELKV 229
                ++ S + D  +++WD +TG+C +T+  H + V  V    +   L S S D  +KV
Sbjct: 62  D--GTYLASGSSDKTIRLWDLETGECVRTLTGHTSYVSSVAFSPDGRILSSSSRDKTIKV 119

Query: 230 WNVRDRSDIDT 240
           W+V     + T
Sbjct: 120 WDVETGKCLTT 130



 Score =  102 bits (255), Expect = 3e-23
 Identities = 50/140 (35%), Positives = 77/140 (55%), Gaps = 1/140 (0%)

Query: 1307 HTRTLKL-EEQVLCARVSPDSKLLAVSLLDTTVKIFFLDTFKFFISLYGHKLPVLSLDMS 1365
              RTLK     V C   SPD KLLA    D T+K++ L+T +   +L GH  PV  +  S
Sbjct: 1    LRRTLKGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAAS 60

Query: 1366 YDSTLIATGSGDRTVKVWGLDYGDCHKSLLAHEDSVTGVTFVPKTHYFFTTSKDGRVKQW 1425
             D T +A+GS D+T+++W L+ G+C ++L  H   V+ V F P      ++S+D  +K W
Sbjct: 61   ADGTYLASGSSDKTIRLWDLETGECVRTLTGHTSYVSSVAFSPDGRILSSSSRDKTIKVW 120

Query: 1426 DADNFERIVTLHICSCSLNS 1445
            D +  + + TL   +  +NS
Sbjct: 121  DVETGKCLTTLRGHTDWVNS 140



 Score = 74.3 bits (183), Expect = 7e-14
 Identities = 43/158 (27%), Positives = 80/158 (50%), Gaps = 13/158 (8%)

Query: 33  NQEGRFLATGASED-VIIWDLRLAEKALLLPGEKHEALLLPGEKHEVCQLSPNHDSSQLA 91
           + +G F+A+ + +  + +WDLR           K  A L  G   EV  ++ + D  +L 
Sbjct: 144 SPDGTFVASSSQDGTIKLWDLR---------TGKCVATL-TGHTGEVNSVAFSPDGEKLL 193

Query: 92  VAYTNGSLKTFSLDTTDVISTFTGHKSAITVIQYDPLGHRLATGSKDTDIVLWDVVAECG 151
            + ++G++K + L T   + T  GH++ +  + + P G+ LA+GS+D  I +WD+     
Sbjct: 194 SSSSDGTIKLWDLSTGKCLGTLRGHENGVNSVAFSPDGYLLASGSEDGTIRVWDLRTGEC 253

Query: 152 LHRLSGHKGVITDIRFMSQPGHHFVVSSAKDTFVKIWD 189
           +  LSGH   +T + +   P    + S + D  ++IWD
Sbjct: 254 VQTLSGHTNSVTSLAWS--PDGKRLASGSADGTIRIWD 289



 Score = 65.0 bits (159), Expect = 9e-11
 Identities = 26/82 (31%), Positives = 40/82 (48%), Gaps = 1/82 (1%)

Query: 352 FAPIMLLEGHGGEIFCSKYHPDGQYIASSGYDRQIFIWSVYGECENIGVMSGHTGAVMDL 411
              +  L GH   +    + PDG  +AS   D  I +W +    E +  +SGHT +V  L
Sbjct: 209 GKCLGTLRGHENGVNSVAFSPDGYLLASGSEDGTIRVWDL-RTGECVQTLSGHTNSVTSL 267

Query: 412 KFSTDGCHIFTCSTDQTLAVWD 433
            +S DG  + + S D T+ +WD
Sbjct: 268 AWSPDGKRLASGSADGTIRIWD 289



 Score = 58.5 bits (142), Expect = 1e-08
 Identities = 30/123 (24%), Positives = 56/123 (45%), Gaps = 10/123 (8%)

Query: 23  NCNVVFVTLKNQEGRFLATGASEDVIIWDLRLAEKALLLPGEKHEALLLPGEKHEVCQLS 82
              V  V       + L++ +   + +WDL                  L G ++ V  ++
Sbjct: 177 TGEVNSVAFSPDGEKLLSSSSDGTIKLWDLS----------TGKCLGTLRGHENGVNSVA 226

Query: 83  PNHDSSQLAVAYTNGSLKTFSLDTTDVISTFTGHKSAITVIQYDPLGHRLATGSKDTDIV 142
            + D   LA    +G+++ + L T + + T +GH +++T + + P G RLA+GS D  I 
Sbjct: 227 FSPDGYLLASGSEDGTIRVWDLRTGECVQTLSGHTNSVTSLAWSPDGKRLASGSADGTIR 286

Query: 143 LWD 145
           +WD
Sbjct: 287 IWD 289



 Score = 32.7 bits (75), Expect = 2.0
 Identities = 12/46 (26%), Positives = 20/46 (43%)

Query: 195 CFKTMAAHLTEVWGVCVMREDSYLISGSNDAELKVWNVRDRSDIDT 240
             +T+  H   V  V    +   L +GS D  +KVW++     + T
Sbjct: 1   LRRTLKGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRT 46


>gnl|CDD|215726 pfam00112, Peptidase_C1, Papain family cysteine protease. 
          Length = 213

 Score =  133 bits (338), Expect = 5e-35
 Identities = 71/268 (26%), Positives = 96/268 (35%), Gaps = 89/268 (33%)

Query: 2174 IPETFDAREEWPQCKDVIGKVWDQGACQSCWVSHQPRTAGLKGLFSFIKYGQGQERTLSV 2233
            +PE+FD RE     K  +  V DQG C SCW                             
Sbjct: 1    LPESFDWRE-----KGAVTPVKDQGQCGSCW----------------------------- 26

Query: 2234 WDKAISAASVMSDRICIQSKGQVKPILSPQHLICSCTNCTRMHTKTPMSMCMGGDSAAAW 2293
               A SA   +  R CI++   V   LS Q L+  C   T  +       C GG    A+
Sbjct: 27   ---AFSAVGALEGRYCIKTGKLV--SLSEQQLV-DC--DTGNNG------CNGGLPDNAF 72

Query: 2294 MYWI-NAGLVDGGDY---GTH-----DVSMGRYIEGIGHAASVMGSSNPE---------- 2334
             Y   N G+V   DY             S  +Y + I     V    N E          
Sbjct: 73   EYIKKNGGIVTESDYPYTAHDGTCKFKKSNSKYAK-IKGYGDV--PYNDEEALQAALAKN 129

Query: 2335 ------VNNFEKVIRLYS--------CEGSINPRYIHSVKIIGWGKSSQNEPYWLCTNSY 2380
                  ++ +E   +LY         C G ++    H+V I+G+G  +   PYW+  NS+
Sbjct: 130  GPVSVAIDAYEDDFQLYKSGVYKHTECSGELD----HAVLIVGYGTEN-GVPYWIVKNSW 184

Query: 2381 NQGWGEQGLFKIRRGVNMCSIEDSVMAG 2408
               WGE G F+I RGVN C I       
Sbjct: 185  GTDWGENGYFRIARGVNECGIASEASYP 212


>gnl|CDD|225201 COG2319, COG2319, FOG: WD40 repeat [General function prediction
           only].
          Length = 466

 Score =  137 bits (346), Expect = 7e-34
 Identities = 104/309 (33%), Positives = 159/309 (51%), Gaps = 20/309 (6%)

Query: 349 SNLFAPIMLLEGHGGEIFCSKYHPDGQYIAS-SGYDRQIFIWSVYGECENIGVMSGHTGA 407
           S     I  LEGH   +    + PDG+ +AS S  D  I +W +    + +  ++GHT  
Sbjct: 142 STPGKLIRTLEGHSESVTSLAFSPDGKLLASGSSLDGTIKLWDLRTG-KPLSTLAGHTDP 200

Query: 408 VMDLKFSTDG-CHIFTCSTDQTLAVWDLEKGQRIK-KMKGHSTFVNSCDPVRRGQLLIAS 465
           V  L FS DG   I + S+D T+ +WDL  G+ ++  + GHS  V S         L+AS
Sbjct: 201 VSSLAFSPDGGLLIASGSSDGTIRLWDLSTGKLLRSTLSGHSDSVVSS--FSPDGSLLAS 258

Query: 466 GSDDCTVKVWDPRKKNQAVSMN--NTYQVTSVAFNDTAECVLTGGIDNDIKMWDLRTNSV 523
           GS D T+++WD R  +  +     ++  V SVAF+   + + +G  D  +++WDL T  +
Sbjct: 259 GSSDGTIRLWDLRSSSSLLRTLSGHSSSVLSVAFSPDGKLLASGSSDGTVRLWDLETGKL 318

Query: 524 VQ--KLRGHSDTVTGLSLSPDGSYILSNAM-DNTVRIWDIRPYVPGERCVKVMSGHQHNF 580
           +    L+GH   V+ LS SPDGS ++S    D T+R+WD+R      + +K + GH    
Sbjct: 319 LSSLTLKGHEGPVSSLSFSPDGSLLVSGGSDDGTIRLWDLR----TGKPLKTLEGH---- 370

Query: 581 EKNLLRCAWSVSGLYVTAGSADKCVYIWDTTTRRIAYKLPGHNGSVNDVQFHPKEPIIMS 640
             N+L  ++S  G  V++GS D  V +WD +T  +   L GH   V  + F P    + S
Sbjct: 371 -SNVLSVSFSPDGRVVSSGSTDGTVRLWDLSTGSLLRNLDGHTSRVTSLDFSPDGKSLAS 429

Query: 641 ASSDKTIYL 649
            SSD TI L
Sbjct: 430 GSSDNTIRL 438



 Score =  135 bits (341), Expect = 3e-33
 Identities = 117/427 (27%), Positives = 178/427 (41%), Gaps = 41/427 (9%)

Query: 1023 SRSHKVLSLLHTRTLKLEEQVLCARVSPDSKLLAVSLLDTTVKIFFLDTFKFFISLY--G 1080
                 +  L        E+ +     SPD +LL     D T+K++ LD  +  I      
Sbjct: 48   DSLVSLPDLSSLLLRGHEDSITSIAFSPDGELLLSGSSDGTIKLWDLDNGEKLIKSLEGL 107

Query: 1081 HKLPVLSLDMSY---DSTLIATGSGDRTVKVWGL-DYGDCHKSLLAHEDSVTGVTFVPKT 1136
            H   V  L +S    +S L+A+ S D TVK+W L   G   ++L  H +SVT + F P  
Sbjct: 108  HDSSVSKLALSSPDGNSILLASSSLDGTVKLWDLSTPGKLIRTLEGHSESVTSLAFSPDG 167

Query: 1137 HYFFTTS-KDGRVKQWDADNFERIVTLHFFISLYGHKLPVLSLDMSYDST-LIATGSGDR 1194
                + S  DG +K WD    + + TL       GH  PV SL  S D   LIA+GS D 
Sbjct: 168  KLLASGSSLDGTIKLWDLRTGKPLSTLA------GHTDPVSSLAFSPDGGLLIASGSSDG 221

Query: 1195 TVKVWGLDYGDCHKSLLAHEDSVTGVTFVPKTHYFFTTSKDGRVKQWDADNFERIV---- 1250
            T+++W L  G   +S L+        +F P      + S DG ++ WD  +   ++    
Sbjct: 222  TIRLWDLSTGKLLRSTLSGHSDSVVSSFSPDGSLLASGSSDGTIRLWDLRSSSSLLRTLS 281

Query: 1251 --TLHFNPNVYLPLQIQVVTGGGDKSVKLWQLELVSVNREADEETKDVSRSHKVLSLLHT 1308
              +       + P    + +G  D +V+LW LE   +      +                
Sbjct: 282  GHSSSVLSVAFSPDGKLLASGSSDGTVRLWDLETGKLLSSLTLKGH-------------- 327

Query: 1309 RTLKLEEQVLCARVSPDSKLLAVSL-LDTTVKIFFLDTFKFFISLYGHKLPVLSLDMSYD 1367
                 E  V     SPD  LL      D T++++ L T K   +L GH   VLS+  S D
Sbjct: 328  -----EGPVSSLSFSPDGSLLVSGGSDDGTIRLWDLRTGKPLKTLEGHS-NVLSVSFSPD 381

Query: 1368 STLIATGSGDRTVKVWGLDYGDCHKSLLAHEDSVTGVTFVPKTHYFFTTSKDGRVKQWDA 1427
              ++++GS D TV++W L  G   ++L  H   VT + F P      + S D  ++ WD 
Sbjct: 382  GRVVSSGSTDGTVRLWDLSTGSLLRNLDGHTSRVTSLDFSPDGKSLASGSSDNTIRLWDL 441

Query: 1428 DNFERIV 1434
                + V
Sbjct: 442  KTSLKSV 448



 Score =  129 bits (324), Expect = 4e-31
 Identities = 114/471 (24%), Positives = 194/471 (41%), Gaps = 43/471 (9%)

Query: 102 FSLDTTDVISTFTGHKSAITVIQYDPLGHRLATGSKDTDIVLWDVVAECGLHRLSGHKGV 161
            S          + +  ++  +     G  L     D+ + L D+        L GH+  
Sbjct: 12  KSKLLKKSELGPSLNSLSLLSLGSSESGILLLALLSDSLVSLPDL----SSLLLRGHEDS 67

Query: 162 ITDIRFMSQPGHHFVVSSAKDTFVKIWDADTGDCFKTM--AAHLTEVWGVCVMREDSYLI 219
           IT I F   P    ++S + D  +K+WD D G+         H + V  + +   D   I
Sbjct: 68  ITSIAF--SPDGELLLSGSSDGTIKLWDLDNGEKLIKSLEGLHDSSVSKLALSSPDGNSI 125

Query: 220 ---SGSNDAELKVWNVRDRSDIDTEDKDKLSEQLNQLLLSEDEPDLTVSKIEVQIINELK 276
              S S D  +K+W++        +    L      +      PD  +      +   +K
Sbjct: 126 LLASSSLDGTVKLWDLST----PGKLIRTLEGHSESVTSLAFSPDGKLLASGSSLDGTIK 181

Query: 277 NLSTGKKKWLQVFRLALCISSITLNIDDFAFGIDTTQELRTRSMKRKNDEVTVYDREKNY 336
                  K L         +  T  +   AF  D    + + S       + ++D     
Sbjct: 182 LWDLRTGKPLSTL------AGHTDPVSSLAFSPDGGLLIASGSSDGT---IRLWDLSTGK 232

Query: 337 KVQKVQKDVGRTSNLFAPIMLLEGHGGEIFCSKYHPDGQYIASSGYDRQIFIWSVYGECE 396
            ++                  L GH   +  S + PDG  +AS   D  I +W +     
Sbjct: 233 LLRST----------------LSGHSDSV-VSSFSPDGSLLASGSSDGTIRLWDLRSSSS 275

Query: 397 NIGVMSGHTGAVMDLKFSTDGCHIFTCSTDQTLAVWDLEKGQRIKKMK--GHSTFVNSCD 454
            +  +SGH+ +V+ + FS DG  + + S+D T+ +WDLE G+ +  +   GH   V+S  
Sbjct: 276 LLRTLSGHSSSVLSVAFSPDGKLLASGSSDGTVRLWDLETGKLLSSLTLKGHEGPVSSLS 335

Query: 455 PVRRGQLLIASGSDDCTVKVWDPRKKNQAVSMNNTYQVTSVAFNDTAECVLTGGIDNDIK 514
               G LL++ GSDD T+++WD R      ++     V SV+F+     V +G  D  ++
Sbjct: 336 FSPDGSLLVSGGSDDGTIRLWDLRTGKPLKTLEGHSNVLSVSFSPDGRVVSSGSTDGTVR 395

Query: 515 MWDLRTNSVVQKLRGHSDTVTGLSLSPDGSYILSNAMDNTVRIWDIRPYVP 565
           +WDL T S+++ L GH+  VT L  SPDG  + S + DNT+R+WD++  + 
Sbjct: 396 LWDLSTGSLLRNLDGHTSRVTSLDFSPDGKSLASGSSDNTIRLWDLKTSLK 446



 Score =  127 bits (319), Expect = 2e-30
 Identities = 118/431 (27%), Positives = 182/431 (42%), Gaps = 60/431 (13%)

Query: 978  DQFNPNVYLPLQIQVVTGGGDKSVKLWQLELVSVNREADEETKDVSRSHKVLSLLHTRTL 1037
            D      + P    +++G  D ++KLW L+               +    + SL      
Sbjct: 66   DSITSIAFSPDGELLLSGSSDGTIKLWDLD---------------NGEKLIKSLEGLHDS 110

Query: 1038 KLEEQVLCARVSPDSK--LLAVSLLDTTVKIFFLDT-FKFFISLYGHKLPVLSLDMS-YD 1093
             + +    A  SPD    LLA S LD TVK++ L T  K   +L GH   V SL  S   
Sbjct: 111  SVSK---LALSSPDGNSILLASSSLDGTVKLWDLSTPGKLIRTLEGHSESVTSLAFSPDG 167

Query: 1094 STLIATGSGDRTVKVWGLDYGDCHKSLLAHEDSVTGVTFVPKTHYFFTT-SKDGRVKQWD 1152
              L +  S D T+K+W L  G    +L  H D V+ + F P       + S DG ++ WD
Sbjct: 168  KLLASGSSLDGTIKLWDLRTGKPLSTLAGHTDPVSSLAFSPDGGLLIASGSSDGTIRLWD 227

Query: 1153 ADNFERIVTLHFFISLYGHKLPVLSLDMSYDSTLIATGSGDRTVKVWGLDYGD-CHKSLL 1211
                + + +     +L GH   V+S   S D +L+A+GS D T+++W L       ++L 
Sbjct: 228  LSTGKLLRS-----TLSGHSDSVVSS-FSPDGSLLASGSSDGTIRLWDLRSSSSLLRTLS 281

Query: 1212 AHEDSVTGVTFVPKTHYFFTTSKDGRVKQWDADNFERIVTLHFNPN-------VYLPLQI 1264
             H  SV  V F P      + S DG V+ WD +  + + +L    +        + P   
Sbjct: 282  GHSSSVLSVAFSPDGKLLASGSSDGTVRLWDLETGKLLSSLTLKGHEGPVSSLSFSPDGS 341

Query: 1265 QVVTGGG-DKSVKLWQLELVSVNREADEETKDVSRSHKVLSLLHTRTLKLEEQVLCARVS 1323
             +V+GG  D +++LW L                            +TL+    VL    S
Sbjct: 342  LLVSGGSDDGTIRLWDLRTGK----------------------PLKTLEGHSNVLSVSFS 379

Query: 1324 PDSKLLAVSLLDTTVKIFFLDTFKFFISLYGHKLPVLSLDMSYDSTLIATGSGDRTVKVW 1383
            PD ++++    D TV+++ L T     +L GH   V SLD S D   +A+GS D T+++W
Sbjct: 380  PDGRVVSSGSTDGTVRLWDLSTGSLLRNLDGHTSRVTSLDFSPDGKSLASGSSDNTIRLW 439

Query: 1384 GLDYGDCHKSL 1394
             L       S 
Sbjct: 440  DLKTSLKSVSF 450



 Score =  126 bits (316), Expect = 4e-30
 Identities = 100/406 (24%), Positives = 168/406 (41%), Gaps = 44/406 (10%)

Query: 852  LLLNNNSLELHSLSLGGSTDSVRHLRSIHAQGHHSEVRALAFSSDNLALVSACASQVKIW 911
            LL  ++   +    L      ++ L  +H         +    +  L   S+    VK+W
Sbjct: 80   LLSGSSDGTIKLWDLDNGEKLIKSLEGLHDSSVSKLALSSPDGNSILLASSSLDGTVKLW 139

Query: 912  NRPSLSCLRTIDTGSYA--LSVCFVPGDRHVLVG-TKDGRLLIVDIGAGEILEDIPAHSQ 968
            +  +   L     G      S+ F P  + +  G + DG + + D+  G+ L  +  H+ 
Sbjct: 140  DLSTPGKLIRTLEGHSESVTSLAFSPDGKLLASGSSLDGTIKLWDLRTGKPLSTLAGHTD 199

Query: 969  ELWSVAMLPDQFNPNVYLPLQIQVVTGGGDKSVKLWQLELVSVNREADEETKDVSRSHKV 1028
             + S+A  PD           + + +G  D +++LW           D  T  + RS   
Sbjct: 200  PVSSLAFSPD---------GGLLIASGSSDGTIRLW-----------DLSTGKLLRS--- 236

Query: 1029 LSLLHTRTLKLEEQVLCARVSPDSKLLAVSLLDTTVKIFFLDTF-KFFISLYGHKLPVLS 1087
                   TL      + +  SPD  LLA    D T++++ L +      +L GH   VLS
Sbjct: 237  -------TLSGHSDSVVSSFSPDGSLLASGSSDGTIRLWDLRSSSSLLRTLSGHSSSVLS 289

Query: 1088 LDMSYDSTLIATGSGDRTVKVWGLDYGDC--HKSLLAHEDSVTGVTFVPKTHYFFTT-SK 1144
            +  S D  L+A+GS D TV++W L+ G      +L  HE  V+ ++F P      +  S 
Sbjct: 290  VAFSPDGKLLASGSSDGTVRLWDLETGKLLSSLTLKGHEGPVSSLSFSPDGSLLVSGGSD 349

Query: 1145 DGRVKQWDADNFERIVTLHFFISLYGHKLPVLSLDMSYDSTLIATGSGDRTVKVWGLDYG 1204
            DG ++ WD    + + TL            VLS+  S D  ++++GS D TV++W L  G
Sbjct: 350  DGTIRLWDLRTGKPLKTLE-------GHSNVLSVSFSPDGRVVSSGSTDGTVRLWDLSTG 402

Query: 1205 DCHKSLLAHEDSVTGVTFVPKTHYFFTTSKDGRVKQWDADNFERIV 1250
               ++L  H   VT + F P      + S D  ++ WD     + V
Sbjct: 403  SLLRNLDGHTSRVTSLDFSPDGKSLASGSSDNTIRLWDLKTSLKSV 448



 Score =  122 bits (305), Expect = 1e-28
 Identities = 117/429 (27%), Positives = 181/429 (42%), Gaps = 45/429 (10%)

Query: 1023 SRSHKVLSLLHTRTLKLEEQVLCARVSPDSKLLAVSLLDTTVKIFFLDTFKFFISLYGHK 1082
                K+L             +L    S    LL   L D+ V +  L +    + L GH+
Sbjct: 10   ENKSKLLKKSELGPSLNSLSLLSLGSSESGILLLALLSDSLVSLPDLSS----LLLRGHE 65

Query: 1083 LPVLSLDMSYDSTLIATGSGDRTVKVWGLDYGDCHKSLL--AHEDSVTGVTFVPKTHYFF 1140
              + S+  S D  L+ +GS D T+K+W LD G+     L   H+ SV+ +          
Sbjct: 66   DSITSIAFSPDGELLLSGSSDGTIKLWDLDNGEKLIKSLEGLHDSSVSKLALSSPDGNSI 125

Query: 1141 ---TTSKDGRVKQWDADNFERIVTLHFFISLYGHKLPVLSLDMS-YDSTLIATGSGDRTV 1196
               ++S DG VK WD     +++      +L GH   V SL  S     L +  S D T+
Sbjct: 126  LLASSSLDGTVKLWDLSTPGKLIR-----TLEGHSESVTSLAFSPDGKLLASGSSLDGTI 180

Query: 1197 KVWGLDYGDCHKSLLAHEDSVTGVTFVPKTHYFFTT-SKDGRVKQWDADNFERIVTLHFN 1255
            K+W L  G    +L  H D V+ + F P       + S DG ++ WD    + + +    
Sbjct: 181  KLWDLRTGKPLSTLAGHTDPVSSLAFSPDGGLLIASGSSDGTIRLWDLSTGKLLRSTLSG 240

Query: 1256 P-----NVYLPLQIQVVTGGGDKSVKLWQLELVSVNREADEETKDVSRSHKVLSLLHTRT 1310
                  + + P    + +G  D +++LW                D+  S  +L  L   +
Sbjct: 241  HSDSVVSSFSPDGSLLASGSSDGTIRLW----------------DLRSSSSLLRTLSGHS 284

Query: 1311 LKLEEQVLCARVSPDSKLLAVSLLDTTVKIFFLDT--FKFFISLYGHKLPVLSLDMSYDS 1368
                  VL    SPD KLLA    D TV+++ L+T      ++L GH+ PV SL  S D 
Sbjct: 285  ----SSVLSVAFSPDGKLLASGSSDGTVRLWDLETGKLLSSLTLKGHEGPVSSLSFSPDG 340

Query: 1369 TLIATG-SGDRTVKVWGLDYGDCHKSLLAHEDSVTGVTFVPKTHYFFTTSKDGRVKQWDA 1427
            +L+ +G S D T+++W L  G   K+L  H  +V  V+F P      + S DG V+ WD 
Sbjct: 341  SLLVSGGSDDGTIRLWDLRTGKPLKTLEGH-SNVLSVSFSPDGRVVSSGSTDGTVRLWDL 399

Query: 1428 DNFERIVTL 1436
                 +  L
Sbjct: 400  STGSLLRNL 408



 Score =  116 bits (290), Expect = 9e-27
 Identities = 118/486 (24%), Positives = 206/486 (42%), Gaps = 61/486 (12%)

Query: 169 SQPGHHFVVSSAKDTFVKIWDADTGDCFKTMAAHLTEVWGVCVMREDSYLISGSNDAELK 228
           S      +++   D+ V + D  +      +  H   +  +    +   L+SGS+D  +K
Sbjct: 35  SSESGILLLALLSDSLVSLPDLSSL----LLRGHEDSITSIAFSPDGELLLSGSSDGTIK 90

Query: 229 VWNVRDRSD-IDTEDKDKLSEQLNQLLLSEDEPDLTVSKIEVQIINELKNLSTGKKKWLQ 287
           +W++ +    I + +    S      L S D   + ++   +    +L +LST       
Sbjct: 91  LWDLDNGEKLIKSLEGLHDSSVSKLALSSPDGNSILLASSSLDGTVKLWDLST------- 143

Query: 288 VFRLALCISSITLNIDDFAFGIDTTQELRTRSMKRKNDEVTVYDREKNYKVQKVQKDVGR 347
             +L   +   + ++   AF   +       S    +  + ++D                
Sbjct: 144 PGKLIRTLEGHSESVTSLAF---SPDGKLLASGSSLDGTIKLWDLRTG------------ 188

Query: 348 TSNLFAPIMLLEGHGGEIFCSKYHPDGQ-YIASSGYDRQIFIWSVYGECENIGVMSGHTG 406
                 P+  L GH   +    + PDG   IAS   D  I +W +         +SGH+ 
Sbjct: 189 -----KPLSTLAGHTDPVSSLAFSPDGGLLIASGSSDGTIRLWDLSTGKLLRSTLSGHSD 243

Query: 407 AVMDLKFSTDGCHIFTCSTDQTLAVWDLE-KGQRIKKMKGHSTFVNSCDPVRRGQLLIAS 465
           +V+   FS DG  + + S+D T+ +WDL      ++ + GHS+ V S      G+LL AS
Sbjct: 244 SVVSS-FSPDGSLLASGSSDGTIRLWDLRSSSSLLRTLSGHSSSVLSVAFSPDGKLL-AS 301

Query: 466 GSDDCTVKVWDPRKKNQAVSMNNTY---QVTSVAFNDTAECVLTGG-IDNDIKMWDLRTN 521
           GS D TV++WD        S+        V+S++F+     +++GG  D  I++WDLRT 
Sbjct: 302 GSSDGTVRLWDLETGKLLSSLTLKGHEGPVSSLSFSPDGSLLVSGGSDDGTIRLWDLRTG 361

Query: 522 SVVQKLRGHSDTVTGLSLSPDGSYILSNAMDNTVRIWDIRPYVPGERCVKVMSGHQHNFE 581
             ++ L GHS  V  +S SPDG  + S + D TVR+WD+         ++ + GH     
Sbjct: 362 KPLKTLEGHS-NVLSVSFSPDGRVVSSGSTDGTVRLWDLSTG----SLLRNLDGHTSR-- 414

Query: 582 KNLLRCAWSVSGLYVTAGSADKCVYIWDTTTRRIAYKLPGHNGSVNDVQFHPKEPIIMSA 641
             +    +S  G  + +GS+D  + +WD  T            S+  V F P   ++ S 
Sbjct: 415 --VTSLDFSPDGKSLASGSSDNTIRLWDLKT------------SLKSVSFSPDGKVLASK 460

Query: 642 SSDKTI 647
           SSD ++
Sbjct: 461 SSDLSV 466



 Score =  115 bits (289), Expect = 1e-26
 Identities = 111/395 (28%), Positives = 171/395 (43%), Gaps = 40/395 (10%)

Query: 1057 VSLLDTTVKIFFLDTFKFFISLYGHKLPVLSLDMSYDSTLIATGSGDRTVKVWGLDYGDC 1116
            V    T+ +       K  +    + L +LSL  S    L+     D  V +  L     
Sbjct: 2    VDNSSTSSENKSKLLKKSELGPSLNSLSLLSLGSSESGILLLALLSDSLVSLPDLSSL-- 59

Query: 1117 HKSLLAHEDSVTGVTFVPKTHYFFTTSKDGRVKQWDADNFERIV-TLHFFISLYGHKLPV 1175
               L  HEDS+T + F P      + S DG +K WD DN E+++ +L         KL +
Sbjct: 60   --LLRGHEDSITSIAFSPDGELLLSGSSDGTIKLWDLDNGEKLIKSLEGLHDSSVSKLAL 117

Query: 1176 LSLDMSYDSTLIATGSGDRTVKVWGL-DYGDCHKSLLAHEDSVTGVTFVPKTHYFFTTS- 1233
             S D   +S L+A+ S D TVK+W L   G   ++L  H +SVT + F P      + S 
Sbjct: 118  SSPD--GNSILLASSSLDGTVKLWDLSTPGKLIRTLEGHSESVTSLAFSPDGKLLASGSS 175

Query: 1234 KDGRVKQWDADNFERIVTLHFNPNVYLPL------QIQVVTGGGDKSVKLWQLELVSVNR 1287
             DG +K WD    + + TL  + +    L       + + +G  D +++LW         
Sbjct: 176  LDGTIKLWDLRTGKPLSTLAGHTDPVSSLAFSPDGGLLIASGSSDGTIRLW--------- 226

Query: 1288 EADEETKDVSRSHKVLSLLHTRTLKLEEQVLCARVSPDSKLLAVSLLDTTVKIFFLDTF- 1346
              D  T  + RS          TL      + +  SPD  LLA    D T++++ L +  
Sbjct: 227  --DLSTGKLLRS----------TLSGHSDSVVSSFSPDGSLLASGSSDGTIRLWDLRSSS 274

Query: 1347 KFFISLYGHKLPVLSLDMSYDSTLIATGSGDRTVKVWGLDYGDC--HKSLLAHEDSVTGV 1404
                +L GH   VLS+  S D  L+A+GS D TV++W L+ G      +L  HE  V+ +
Sbjct: 275  SLLRTLSGHSSSVLSVAFSPDGKLLASGSSDGTVRLWDLETGKLLSSLTLKGHEGPVSSL 334

Query: 1405 TFVPKTHYFFTT-SKDGRVKQWDADNFERIVTLHI 1438
            +F P      +  S DG ++ WD    + + TL  
Sbjct: 335  SFSPDGSLLVSGGSDDGTIRLWDLRTGKPLKTLEG 369



 Score =  100 bits (249), Expect = 1e-21
 Identities = 88/408 (21%), Positives = 164/408 (40%), Gaps = 39/408 (9%)

Query: 37  RFLATGASEDVIIWDLRLAEKALLLPGEKHEALLLPGEKHEVCQLSPNHDSSQLAVAYTN 96
             L+  +   + +WDL   EK +      H+         ++   SP+ +S  LA +  +
Sbjct: 79  LLLSGSSDGTIKLWDLDNGEKLIKSLEGLHD-----SSVSKLALSSPDGNSILLASSSLD 133

Query: 97  GSLKTFSLDTTDV-ISTFTGHKSAITVIQYDPLGHRLATGS-KDTDIVLWDVVAECGLHR 154
           G++K + L T    I T  GH  ++T + + P G  LA+GS  D  I LWD+     L  
Sbjct: 134 GTVKLWDLSTPGKLIRTLEGHSESVTSLAFSPDGKLLASGSSLDGTIKLWDLRTGKPLST 193

Query: 155 LSGHKGVITDIRFMSQPGHHFVVSSAKDTFVKIWDADTGDCFKTMAAHLTEVWGVCVMRE 214
           L+GH   ++ + F S  G   + S + D  +++WD  TG   ++  +  ++        +
Sbjct: 194 LAGHTDPVSSLAF-SPDGGLLIASGSSDGTIRLWDLSTGKLLRSTLSGHSDSVVSSFSPD 252

Query: 215 DSYLISGSNDAELKVWNVRDRSDIDTEDKDKLSEQLNQLLLSEDEPDLTVSKIEVQIINE 274
            S L SGS+D  +++W++R  S          S  +  +  S D   L     +  +   
Sbjct: 253 GSLLASGSSDGTIRLWDLRS-SSSLLRTLSGHSSSVLSVAFSPDGKLLASGSSDGTV--R 309

Query: 275 LKNLSTGKKKWLQVFRLALCISSITLNIDDFAFGIDTTQELRTRSMKRKNDEVTVYDREK 334
           L +L TGK       +           +   +F  D +  +   S    +  + ++D   
Sbjct: 310 LWDLETGKLLSSLTLK------GHEGPVSSLSFSPDGSLLVSGGS---DDGTIRLWDLRT 360

Query: 335 NYKVQKVQKDVGRTSNLFAPIMLLEGHGGEIFCSKYHPDGQYIASSGYDRQIFIWSVYGE 394
              ++ +                       +    + PDG+ ++S   D  + +W +   
Sbjct: 361 GKPLKTL------------------EGHSNVLSVSFSPDGRVVSSGSTDGTVRLWDLSTG 402

Query: 395 CENIGVMSGHTGAVMDLKFSTDGCHIFTCSTDQTLAVWDLEKGQRIKK 442
              +  + GHT  V  L FS DG  + + S+D T+ +WDL+   +   
Sbjct: 403 -SLLRNLDGHTSRVTSLDFSPDGKSLASGSSDNTIRLWDLKTSLKSVS 449



 Score = 97.5 bits (241), Expect = 1e-20
 Identities = 100/439 (22%), Positives = 182/439 (41%), Gaps = 42/439 (9%)

Query: 69  LLLPGEKHEVCQLSPNHDSSQLAVAYTNGSLKTFSLDT-TDVISTFTGH--KSAITVIQY 125
           LLL G +  +  ++ + D   L    ++G++K + LD    +I +  G    S   +   
Sbjct: 59  LLLRGHEDSITSIAFSPDGELLLSGSSDGTIKLWDLDNGEKLIKSLEGLHDSSVSKLALS 118

Query: 126 DPLGHRLATGSKDTD--IVLWDVV-AECGLHRLSGHKGVITDIRFMSQPGHHFVVSSAKD 182
            P G+ +   S   D  + LWD+      +  L GH   +T + F    G      S+ D
Sbjct: 119 SPDGNSILLASSSLDGTVKLWDLSTPGKLIRTLEGHSESVTSLAFSPD-GKLLASGSSLD 177

Query: 183 TFVKIWDADTGDCFKTMAAHLTEVWGVCVMREDSYLI-SGSNDAELKVWNVRDRSDIDTE 241
             +K+WD  TG    T+A H   V  +    +   LI SGS+D  +++W++     + + 
Sbjct: 178 GTIKLWDLRTGKPLSTLAGHTDPVSSLAFSPDGGLLIASGSSDGTIRLWDLSTGKLLRST 237

Query: 242 DKDKLSEQLNQLLLSEDEPDLTVSKIEVQIINELKNLSTGKKKWLQVFRLALCISSITLN 301
                   ++    S D   L     +  I   L +L +       +   +  + S+  +
Sbjct: 238 LSGHSDSVVS--SFSPDGSLLASGSSDGTI--RLWDLRSSSSLLRTLSGHSSSVLSVAFS 293

Query: 302 IDDFAFGIDTTQELRTRSMKRKNDEVTVYDREKNYKVQKVQKDVGRTSNLFAPIMLLEGH 361
            D               +    +  V ++D E               +      + L+GH
Sbjct: 294 PDGKLL-----------ASGSSDGTVRLWDLE---------------TGKLLSSLTLKGH 327

Query: 362 GGEIFCSKYHPDGQYIASSG-YDRQIFIWSVYGECENIGVMSGHTGAVMDLKFSTDGCHI 420
            G +    + PDG  + S G  D  I +W      + +  + GH+  V+ + FS DG  +
Sbjct: 328 EGPVSSLSFSPDGSLLVSGGSDDGTIRLWD-LRTGKPLKTLEGHSN-VLSVSFSPDGRVV 385

Query: 421 FTCSTDQTLAVWDLEKGQRIKKMKGHSTFVNSCDPVRRGQLLIASGSDDCTVKVWDPRKK 480
            + STD T+ +WDL  G  ++ + GH++ V S D    G+  +ASGS D T+++WD +  
Sbjct: 386 SSGSTDGTVRLWDLSTGSLLRNLDGHTSRVTSLDFSPDGKS-LASGSSDNTIRLWDLKTS 444

Query: 481 NQAVSMNNTYQVTSVAFND 499
            ++VS +   +V +   +D
Sbjct: 445 LKSVSFSPDGKVLASKSSD 463



 Score = 88.6 bits (218), Expect = 8e-18
 Identities = 84/346 (24%), Positives = 140/346 (40%), Gaps = 42/346 (12%)

Query: 825  TIKTASKTGKIKSVDVILGGGGEIRLALLLNNNSLELHSLSLGG-----STDSVRHLRSI 879
            T+K    +   K +  + G    +       +  L     SL G        + + L ++
Sbjct: 135  TVKLWDLSTPGKLIRTLEGHSESVTSLAFSPDGKLLASGSSLDGTIKLWDLRTGKPLSTL 194

Query: 880  HAQGHHSEVRALAFSSDNLALVSACAS--QVKIWNRPSLSCLRTIDTGSYALSV-CFVPG 936
               GH   V +LAFS D   L+++ +S   +++W+  +   LR+  +G     V  F P 
Sbjct: 195  A--GHTDPVSSLAFSPDGGLLIASGSSDGTIRLWDLSTGKLLRSTLSGHSDSVVSSFSPD 252

Query: 937  DRHVLVGTKDGRLLIVDIGAG-EILEDIPAHSQELWSVAMLPDQFNPNVYLPLQIQVVTG 995
               +  G+ DG + + D+ +   +L  +  HS  + SVA  PD             + +G
Sbjct: 253  GSLLASGSSDGTIRLWDLRSSSSLLRTLSGHSSSVLSVAFSPDGKL----------LASG 302

Query: 996  GGDKSVKLWQLELVSVNREADEETKDVSRSHKVLSLLHTRTLKLEEQVLCARVSPDSKLL 1055
              D +V+LW LE   +      +                     E  V     SPD  LL
Sbjct: 303  SSDGTVRLWDLETGKLLSSLTLKGH-------------------EGPVSSLSFSPDGSLL 343

Query: 1056 AVSL-LDTTVKIFFLDTFKFFISLYGHKLPVLSLDMSYDSTLIATGSGDRTVKVWGLDYG 1114
                  D T++++ L T K   +L GH   VLS+  S D  ++++GS D TV++W L  G
Sbjct: 344  VSGGSDDGTIRLWDLRTGKPLKTLEGHS-NVLSVSFSPDGRVVSSGSTDGTVRLWDLSTG 402

Query: 1115 DCHKSLLAHEDSVTGVTFVPKTHYFFTTSKDGRVKQWDADNFERIV 1160
               ++L  H   VT + F P      + S D  ++ WD     + V
Sbjct: 403  SLLRNLDGHTSRVTSLDFSPDGKSLASGSSDNTIRLWDLKTSLKSV 448



 Score = 74.0 bits (180), Expect = 4e-13
 Identities = 58/221 (26%), Positives = 101/221 (45%), Gaps = 16/221 (7%)

Query: 29  VTLKNQEGRFLATGASEDVI-IWDLRLAEKALLLPGEKHEALLLPGEKHEVCQLSPNHDS 87
           V+  + +G  LA+G+S+  I +WDLR                 L G    V  ++ + D 
Sbjct: 246 VSSFSPDGSLLASGSSDGTIRLWDLR---------SSSSLLRTLSGHSSSVLSVAFSPDG 296

Query: 88  SQLAVAYTNGSLKTFSLDTTDVIS--TFTGHKSAITVIQYDPLGHRLATG-SKDTDIVLW 144
             LA   ++G+++ + L+T  ++S  T  GH+  ++ + + P G  L +G S D  I LW
Sbjct: 297 KLLASGSSDGTVRLWDLETGKLLSSLTLKGHEGPVSSLSFSPDGSLLVSGGSDDGTIRLW 356

Query: 145 DVVAECGLHRLSGHKGVITDIRFMSQPGHHFVVSSAKDTFVKIWDADTGDCFKTMAAHLT 204
           D+     L  L GH  V   + F   P    V S + D  V++WD  TG   + +  H +
Sbjct: 357 DLRTGKPLKTLEGHSNV-LSVSFS--PDGRVVSSGSTDGTVRLWDLSTGSLLRNLDGHTS 413

Query: 205 EVWGVCVMREDSYLISGSNDAELKVWNVRDRSDIDTEDKDK 245
            V  +    +   L SGS+D  +++W+++      +   D 
Sbjct: 414 RVTSLDFSPDGKSLASGSSDNTIRLWDLKTSLKSVSFSPDG 454



 Score = 53.9 bits (128), Expect = 8e-07
 Identities = 44/156 (28%), Positives = 68/156 (43%), Gaps = 7/156 (4%)

Query: 1297 SRSHKVLSLLHTRTLKLEEQVLCARVSPDSKLLAVSLLDTTVKIFFLDTFKFFISLY--G 1354
                 +  L        E+ +     SPD +LL     D T+K++ LD  +  I      
Sbjct: 48   DSLVSLPDLSSLLLRGHEDSITSIAFSPDGELLLSGSSDGTIKLWDLDNGEKLIKSLEGL 107

Query: 1355 HKLPVLSLDMSY---DSTLIATGSGDRTVKVWGL-DYGDCHKSLLAHEDSVTGVTFVPKT 1410
            H   V  L +S    +S L+A+ S D TVK+W L   G   ++L  H +SVT + F P  
Sbjct: 108  HDSSVSKLALSSPDGNSILLASSSLDGTVKLWDLSTPGKLIRTLEGHSESVTSLAFSPDG 167

Query: 1411 HYFFTTS-KDGRVKQWDADNFERIVTLHICSCSLNS 1445
                + S  DG +K WD    + + TL   +  ++S
Sbjct: 168  KLLASGSSLDGTIKLWDLRTGKPLSTLAGHTDPVSS 203


>gnl|CDD|214761 smart00645, Pept_C1, Papain family cysteine protease. 
          Length = 175

 Score =  122 bits (309), Expect = 1e-31
 Identities = 63/242 (26%), Positives = 89/242 (36%), Gaps = 75/242 (30%)

Query: 2174 IPETFDAREEWPQCKDVIGKVWDQGACQSCWVSHQPRTAGLKGLFSFIKYGQGQERTLSV 2233
            +PE+FD R++       +  V DQG C SCW                             
Sbjct: 1    LPESFDWRKKG-----AVTPVKDQGQCGSCW----------------------------- 26

Query: 2234 WDKAISAASVMSDRICIQSKGQVKPILSPQHLICSCTNCTRMHTKTPMSMCMGGDSAAAW 2293
               A SA   +  R CI++   V   LS Q L+    +C+          C GG    A+
Sbjct: 27   ---AFSATGALEGRYCIKTGKLV--SLSEQQLV----DCSGGGNCG----CNGGLPDNAF 73

Query: 2294 MYWI-NAGLVDGGDYGTHDVSMGRYIEGIGHAASVMGSSNPEVNNFEK----VIRLYSCE 2348
             Y   N GL     Y         Y   +   AS          +F+     +     C 
Sbjct: 74   EYIKKNGGLETESCY--------PYTGSVAIDAS----------DFQFYKSGIYDHPGCG 115

Query: 2349 GSINPRYIHSVKIIGWGKSSQN-EPYWLCTNSYNQGWGEQGLFKIRRGV-NMCSIEDSVM 2406
               +    H+V I+G+G   +N + YW+  NS+   WGE G F+I RG  N C IE SV 
Sbjct: 116  ---SGTLDHAVLIVGYGTEVENGKDYWIVKNSWGTDWGENGYFRIARGKNNECGIEASVA 172

Query: 2407 AG 2408
            + 
Sbjct: 173  SY 174


>gnl|CDD|239111 cd02620, Peptidase_C1A_CathepsinB, Cathepsin B group; composed of
            cathepsin B and similar proteins, including
            tubulointerstitial nephritis antigen (TIN-Ag). Cathepsin
            B is a lysosomal papain-like cysteine peptidase which is
            expressed in all tissues and functions primarily as an
            exopeptidase through its carboxydipeptidyl activity.
            Together with other cathepsins, it is involved in the
            degradation of proteins, proenzyme activation, Ag
            processing, metabolism and apoptosis. Cathepsin B has
            been implicated in a number of human diseases such as
            cancer, rheumatoid arthritis, osteoporosis and
            Alzheimer's disease. The unique carboxydipeptidyl
            activity of cathepsin B is attributed to the presence of
            an occluding loop in its active site which favors the
            binding of the C-termini of substrate proteins. Some
            members of this group do not possess the occluding loop.
            TIN-Ag is an extracellular matrix basement protein which
            was originally identified as a target Ag involved in
            anti-tubular basement membrane antibody-mediated
            interstitial nephritis. It plays a role in renal
            tubulogenesis and is defective in hereditary
            tubulointerstitial disorders. TIN-Ag is exclusively
            expressed in kidney tissues. .
          Length = 236

 Score =  113 bits (285), Expect = 1e-27
 Identities = 47/131 (35%), Positives = 57/131 (43%), Gaps = 41/131 (31%)

Query: 2175 PETFDAREEWPQCKDVIGKVWDQGACQSCWVSHQPRTAGLKGLFSFIKYGQGQERTLSVW 2234
            PE+FDARE+WP C   IG++ DQG C SCW                              
Sbjct: 1    PESFDAREKWPNCI-SIGEIRDQGNCGSCW------------------------------ 29

Query: 2235 DKAISAASVMSDRICIQSKGQVKPILSPQHLICSCTNCTRMHTKTPMSMCMGGDSAAAWM 2294
              A SA    SDR+CIQS G+   +LS Q L+  C+ C           C GG   AAW 
Sbjct: 30   --AFSAVEAFSDRLCIQSNGKENVLLSAQDLLSCCSGCGD--------GCNGGYPDAAWK 79

Query: 2295 YWINAGLVDGG 2305
            Y    G+V GG
Sbjct: 80   YLTTTGVVTGG 90



 Score = 95.4 bits (238), Expect = 2e-21
 Identities = 29/52 (55%), Positives = 33/52 (63%), Gaps = 1/52 (1%)

Query: 2357 HSVKIIGWGKSSQNEPYWLCTNSYNQGWGEQGLFKIRRGVNMCSIEDSVMAG 2408
            H+VKIIGWG      PYWL  NS+   WGE G F+I RG N C IE  V+AG
Sbjct: 186  HAVKIIGWG-VENGVPYWLAANSWGTDWGENGYFRILRGSNECGIESEVVAG 236



 Score = 48.4 bits (116), Expect = 2e-05
 Identities = 16/29 (55%), Positives = 18/29 (62%)

Query: 1562 GSKVYYVNNSTTDIQKEIMQHGPVQAKFY 1590
            G   Y V +  TDI KEIM +GPVQA F 
Sbjct: 134  GKSAYSVPSDETDIMKEIMTNGPVQAAFT 162


>gnl|CDD|239068 cd02248, Peptidase_C1A, Peptidase C1A subfamily (MEROPS database
            nomenclature); composed of cysteine peptidases (CPs)
            similar to papain, including the mammalian CPs
            (cathepsins B, C, F, H, L, K, O, S, V, X and W). Papain
            is an endopeptidase with specific substrate preferences,
            primarily for bulky hydrophobic or aromatic residues at
            the S2 subsite, a hydrophobic pocket in papain that
            accommodates the P2 sidechain of the substrate (the
            second residue away from the scissile bond). Most members
            of the papain subfamily are endopeptidases. Some
            exceptions to this rule can be explained by specific
            details of the catalytic domains like the occluding loop
            in cathepsin B which confers an additional
            carboxydipeptidyl activity and the mini-chain of
            cathepsin H resulting in an N-terminal exopeptidase
            activity. Papain-like CPs have different functions in
            various organisms. Plant CPs are used to mobilize storage
            proteins in seeds. Parasitic CPs act extracellularly to
            help invade tissues and cells, to hatch or to evade the
            host immune system. Mammalian CPs are primarily lysosomal
            enzymes with the exception of cathepsin W, which is
            retained in the endoplasmic reticulum. They are
            responsible for protein degradation in the lysosome.
            Papain-like CPs are synthesized as inactive proenzymes
            with N-terminal propeptide regions, which are removed
            upon activation. In addition to its inhibitory role, the
            propeptide is required for proper folding of the newly
            synthesized enzyme and its stabilization in denaturing pH
            conditions. Residues within the propeptide region also
            play a role in the transport of the proenzyme to
            lysosomes or acidified vesicles. Also included in this
            subfamily are proteins classified as non-peptidase
            homologs, which lack peptidase activity or have missing
            active site residues.
          Length = 210

 Score = 98.9 bits (247), Expect = 7e-23
 Identities = 58/253 (22%), Positives = 83/253 (32%), Gaps = 72/253 (28%)

Query: 2175 PETFDAREEWPQCKDVIGKVWDQGACQSCWVSHQPRTAGLKGLFSFIKYGQGQERTLSVW 2234
            PE+ D RE     K  +  V DQG+C SCW                              
Sbjct: 1    PESVDWRE-----KGAVTPVKDQGSCGSCW------------------------------ 25

Query: 2235 DKAISAASVMSDRICIQSKGQVKPILSPQHLICSCTNCTRMHTKTPMSMCMGGDSAAAWM 2294
              A S    +     I++       LS Q L+    +C+          C GG+   A+ 
Sbjct: 26   --AFSTVGALEGAYAIKTG--KLVSLSEQQLV----DCSTSGNNG----CNGGNPDNAFE 73

Query: 2295 YWINAGLVDGGDYGTHDVSMGRYIEGIGHAASVMGSSNPEVNNFEKVIR----------- 2343
            Y  N GL    DY                 A + G SN    + E +             
Sbjct: 74   YVKNGGLASESDYPYTGKDGTCKYNSSKVGAKITGYSNVPPGDEEALKAALANYGPVSVA 133

Query: 2344 -------------LYSCEGSINPRYIHSVKIIGWGKSSQNEPYWLCTNSYNQGWGEQGLF 2390
                         +YS     N    H+V ++G+G +     YW+  NS+   WGE+G  
Sbjct: 134  IDASSSFQFYKGGIYSGPCCSNTNLNHAVLLVGYG-TENGVDYWIVKNSWGTSWGEKGYI 192

Query: 2391 KIRRGVNMCSIED 2403
            +I RG N+C I  
Sbjct: 193  RIARGSNLCGIAS 205


>gnl|CDD|218439 pfam05109, Herpes_BLLF1, Herpes virus major outer envelope
            glycoprotein (BLLF1).  This family consists of the BLLF1
            viral late glycoprotein, also termed gp350/220. It is the
            most abundantly expressed glycoprotein in the viral
            envelope of the Herpesviruses and is the major antigen
            responsible for stimulating the production of
            neutralising antibodies in vivo.
          Length = 830

 Score = 96.0 bits (238), Expect = 2e-19
 Identities = 69/316 (21%), Positives = 131/316 (41%), Gaps = 28/316 (8%)

Query: 1854 LAATAVAISVIDNYSEIIFTTNNNSESTVVMSTLNSLLSENTTTNSPESESTTTNNPESE 1913
            L  T  A +      +++F    ++  +V+       +  + TT  P    TT + P + 
Sbjct: 404  LIITRTATNATTTTHKVVFHKAPDTTKSVIFVYTLVHVEPHKTTAVP----TTPSLPPAS 459

Query: 1914 STTTSSPESESTTTSSLVSESTTT--SSPESESTTTSSPESESTTTSSLVSESTTTSSPE 1971
            +  T S    ++ T +  + ST    +SP S +T+ +   +  T   +  + ++ T+   
Sbjct: 460  TGPTVSTADPTSGTPTGTTSSTLPEDTSPTSRTTSATPNATSPTPAVTTPNATSPTTQKT 519

Query: 1972 SESTTTSSPESE-STTTSSLVSESTTTSSPESEST---TTISPVSEST--TTSSPVSEST 2025
            S++   +SP       T++  S  T T+S  + ++   T  SPV+ +     +S  S  T
Sbjct: 520  SDTPNATSPTPIVIGVTTTATSPPTGTTSVPNATSPQVTEESPVNNTNTPVVTSAPSVLT 579

Query: 2026 TTISPESESTTTSS----PASESTTTNNPKSESTTTNNP-------ASESITSSSPASES 2074
            + ++     T +S     P   S++ + P+S ST+T            E+IT  +P+  S
Sbjct: 580  SAVTTGQHGTGSSPTSQQPGIPSSSHSTPRSNSTSTTPLLTSAHPTGGENITEETPSVPS 639

Query: 2075 T---TTSSPASESTTTS--SPASESTTTSSPASESTTTSSPESESTTTSSPASESTTIEE 2129
            T   +T SP     TTS  S    S+T+  P     T   P   +T+ S+P+ + T +  
Sbjct: 640  TTHVSTLSPGPGPGTTSQVSGPGNSSTSRYPGEVHVTEGMPNPNATSPSAPSGQKTAVPT 699

Query: 2130 QGVSPHSEKLSANEDP 2145
               +      +  E  
Sbjct: 700  VTSTGGKANSTTKETS 715



 Score = 87.1 bits (215), Expect = 8e-17
 Identities = 64/304 (21%), Positives = 111/304 (36%), Gaps = 12/304 (3%)

Query: 1875 NNNSESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSES 1934
             + S ++   S   +  S      +P + S TT         TS        T++  S  
Sbjct: 484  EDTSPTSRTTSATPNATSPTPAVTTPNATSPTTQKTSDTPNATSPTPIVIGVTTTATSPP 543

Query: 1935 TTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVS-E 1993
            T T+S  +   T+     ES   ++     T+  S  + + TT    + S+ TS      
Sbjct: 544  TGTTSVPN--ATSPQVTEESPVNNTNTPVVTSAPSVLTSAVTTGQHGTGSSPTSQQPGIP 601

Query: 1994 STTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNN--PK 2051
            S++ S+P S ST+T   ++ +  T        T   P +   +T SP     TT+     
Sbjct: 602  SSSHSTPRSNSTSTTPLLTSAHPTGGENITEETPSVPSTTHVSTLSPGPGPGTTSQVSGP 661

Query: 2052 SESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPE 2111
              S+T+  P    +T   P   +T+ S+P+ + T    P   ST   + ++   T+ S  
Sbjct: 662  GNSSTSRYPGEVHVTEGMPNPNATSPSAPSGQKTAV--PTVTSTGGKANSTTKETSGSTL 719

Query: 2112 SESTTTSSPASESTTIEEQGVSPHSEKLSANEDPEEFPNEDVFEHTFAEIP-----NIDH 2166
              ST+  +      T      +      S+   P             A +P     + DH
Sbjct: 720  MASTSPHTNEGAFRTTPYNATTYLPPSTSSKLRPRWTFTSPPVTTKQATVPVPPTQHPDH 779

Query: 2167 SNQT 2170
            SN +
Sbjct: 780  SNLS 783



 Score = 85.2 bits (210), Expect = 2e-16
 Identities = 66/267 (24%), Positives = 111/267 (41%), Gaps = 15/267 (5%)

Query: 1880 STVVMSTLNSLLSENTTTNSPESESTTTN-NPESESTTTSSPESESTTTSSLVSESTTTS 1938
            S     T +S L E+T   SP S +T+   N  S +   ++P + S TT         TS
Sbjct: 471  SGTPTGTTSSTLPEDT---SPTSRTTSATPNATSPTPAVTTPNATSPTTQKTSDTPNATS 527

Query: 1939 SPESESTTTS---SPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSEST 1995
                    T+   SP + +T+  +  S   T  SP + + T     + S  TS++ +   
Sbjct: 528  PTPIVIGVTTTATSPPTGTTSVPNATSPQVTEESPVNNTNTPVVTSAPSVLTSAVTTGQH 587

Query: 1996 -TTSSPESE-----STTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNN 2049
             T SSP S+     S++  +P S ST+T+  ++ +  T        T S P++   +T +
Sbjct: 588  GTGSSPTSQQPGIPSSSHSTPRSNSTSTTPLLTSAHPTGGENITEETPSVPSTTHVSTLS 647

Query: 2050 PKSESTTTNNPAS--ESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTT 2107
            P     TT+  +    S TS  P     T   P   +T+ S+P+ + T   +  S     
Sbjct: 648  PGPGPGTTSQVSGPGNSSTSRYPGEVHVTEGMPNPNATSPSAPSGQKTAVPTVTSTGGKA 707

Query: 2108 SSPESESTTTSSPASESTTIEEQGVSP 2134
            +S   E++ ++  AS S    E     
Sbjct: 708  NSTTKETSGSTLMASTSPHTNEGAFRT 734



 Score = 82.9 bits (204), Expect = 1e-15
 Identities = 59/271 (21%), Positives = 108/271 (39%), Gaps = 11/271 (4%)

Query: 1859 VAISVIDNYSEIIFTTNNNSESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTS 1918
            VA  V D  + II  T  N+ +T      +   + +TT +     +     P   +   +
Sbjct: 394  VANPVADAKTLIITRTATNATTTTHKVVFHK--APDTTKSVIFVYTLVHVEPHKTTAVPT 451

Query: 1919 SPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTS 1978
            +P     +T   VS +  TS   + +T+++ PE  S T+ +    ++ T +  S +   +
Sbjct: 452  TPSLPPASTGPTVSTADPTSGTPTGTTSSTLPEDTSPTSRT----TSATPNATSPTPAVT 507

Query: 1979 SPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTS 2038
            +P + S TT         TS        T +  S  T T+S  + ++  ++ ES    T+
Sbjct: 508  TPNATSPTTQKTSDTPNATSPTPIVIGVTTTATSPPTGTTSVPNATSPQVTEESPVNNTN 567

Query: 2039 SPASESTTTNNPKSESTTTNNPASESITSSS--PASESTTTSSPASESTTTSSPASESTT 2096
            +P   S  +    + +T  +   S   +     P+S     S+P S ST+T+   + +  
Sbjct: 568  TPVVTSAPSVLTSAVTTGQHGTGSSPTSQQPGIPSSSH---STPRSNSTSTTPLLTSAHP 624

Query: 2097 TSSPASESTTTSSPESESTTTSSPASESTTI 2127
            T        T S P +   +T SP     T 
Sbjct: 625  TGGENITEETPSVPSTTHVSTLSPGPGPGTT 655



 Score = 80.6 bits (198), Expect = 7e-15
 Identities = 58/262 (22%), Positives = 102/262 (38%), Gaps = 6/262 (2%)

Query: 1873 TTNNNSESTVVMSTLNSLLSENT---TTNSPESESTTTNNPESESTTTSSPESESTTTSS 1929
            TT N +  T   ++     +  T      +  + S  T      + T+     ES   ++
Sbjct: 507  TTPNATSPTTQKTSDTPNATSPTPIVIGVTTTATSPPTGTTSVPNATSPQVTEESPVNNT 566

Query: 1930 LVSESTTTSSPESESTTTSSPESESTTTSSLVS-ESTTTSSPESESTTTSSPESESTTTS 1988
                 T+  S  + + TT    + S+ TS      S++ S+P S ST+T+   + +  T 
Sbjct: 567  NTPVVTSAPSVLTSAVTTGQHGTGSSPTSQQPGIPSSSHSTPRSNSTSTTPLLTSAHPTG 626

Query: 1989 SLVSESTTTSSPESESTTTISPVSESTTTS--SPVSESTTTISPESESTTTSSPASESTT 2046
                   T S P +   +T+SP     TTS  S    S+T+  P     T   P   +T+
Sbjct: 627  GENITEETPSVPSTTHVSTLSPGPGPGTTSQVSGPGNSSTSRYPGEVHVTEGMPNPNATS 686

Query: 2047 TNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTT 2106
             + P  + T      S    ++S   E++ ++  AS S  T+  A  +T  ++      +
Sbjct: 687  PSAPSGQKTAVPTVTSTGGKANSTTKETSGSTLMASTSPHTNEGAFRTTPYNATTYLPPS 746

Query: 2107 TSSPESESTTTSSPASESTTIE 2128
            TSS      T +SP   +    
Sbjct: 747  TSSKLRPRWTFTSPPVTTKQAT 768



 Score = 79.8 bits (196), Expect = 1e-14
 Identities = 62/257 (24%), Positives = 103/257 (40%), Gaps = 9/257 (3%)

Query: 1873 TTNNNSESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTT-SSPESESTTTSSLV 1931
            +   N+ S     T  +  S  T   S    +T+        TTT +SP + +T+  +  
Sbjct: 494  SATPNATSPTPAVTTPNATSPTTQKTSDTPNATSPTPIVIGVTTTATSPPTGTTSVPNAT 553

Query: 1932 SESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLV 1991
            S   T  SP + + T     + S  TS++ +    T S    S T+  P   S++ S+  
Sbjct: 554  SPQVTEESPVNNTNTPVVTSAPSVLTSAVTTGQHGTGS----SPTSQQPGIPSSSHSTPR 609

Query: 1992 SESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTS--SPASESTTTNN 2049
            S ST+T+   + +  T        T S P +   +T+SP     TTS  S    S+T+  
Sbjct: 610  SNSTSTTPLLTSAHPTGGENITEETPSVPSTTHVSTLSPGPGPGTTSQVSGPGNSSTSRY 669

Query: 2050 PKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSS 2109
            P     T   P   + + S+P+ + T    P   ST   + ++   T+ S    ST+  +
Sbjct: 670  PGEVHVTEGMPNPNATSPSAPSGQKTAV--PTVTSTGGKANSTTKETSGSTLMASTSPHT 727

Query: 2110 PESESTTTSSPASESTT 2126
             E    TT   A+    
Sbjct: 728  NEGAFRTTPYNATTYLP 744



 Score = 72.1 bits (176), Expect = 3e-12
 Identities = 48/239 (20%), Positives = 90/239 (37%), Gaps = 8/239 (3%)

Query: 1870 IIFTTNNNSESTVVMSTLNSLLSENTTTNSPESEST--TTNNPESESTTTSSPESESTTT 1927
            +I  T   +      +++ +  S   T  SP + +      +  S  T+  +     T +
Sbjct: 532  VIGVTTTATSPPTGTTSVPNATSPQVTEESPVNNTNTPVVTSAPSVLTSAVTTGQHGTGS 591

Query: 1928 SSLVSES----TTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESE 1983
            S    +     ++ S+P S ST+T+   + +  T        T S P +   +T SP   
Sbjct: 592  SPTSQQPGIPSSSHSTPRSNSTSTTPLLTSAHPTGGENITEETPSVPSTTHVSTLSPGPG 651

Query: 1984 STTTSSLVSESTTTSSPESEST--TTISPVSESTTTSSPVSESTTTISPESESTTTSSPA 2041
              TTS +     +++S        T   P   +T+ S+P  + T   +  S     +S  
Sbjct: 652  PGTTSQVSGPGNSSTSRYPGEVHVTEGMPNPNATSPSAPSGQKTAVPTVTSTGGKANSTT 711

Query: 2042 SESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSP 2100
             E++ +    S S  TN  A  +   ++      +TSS      T +SP   +   + P
Sbjct: 712  KETSGSTLMASTSPHTNEGAFRTTPYNATTYLPPSTSSKLRPRWTFTSPPVTTKQATVP 770



 Score = 64.4 bits (156), Expect = 7e-10
 Identities = 55/246 (22%), Positives = 92/246 (37%), Gaps = 10/246 (4%)

Query: 1845 ITNNLLISMLAATAVAISVIDNYSEIIFTTNNNSESTVVMSTLNSLLSENTTTNSPESES 1904
            +T          T+V  +     +E   +  NN+ + VV S  + L S  TT       S
Sbjct: 535  VTTTATSPPTGTTSVPNATSPQVTEE--SPVNNTNTPVVTSAPSVLTSAVTTGQHGTGSS 592

Query: 1905 TTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSES 1964
             T+  P   S++ S+P S ST+T+ L++ +  T        T S P +   +T S     
Sbjct: 593  PTSQQPGIPSSSHSTPRSNSTSTTPLLTSAHPTGGENITEETPSVPSTTHVSTLSPGPGP 652

Query: 1965 TTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSES 2024
             TTS       +++S                T   P   +T+  +P  + T   +  S  
Sbjct: 653  GTTSQVSGPGNSSTSRYPGEV--------HVTEGMPNPNATSPSAPSGQKTAVPTVTSTG 704

Query: 2025 TTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASES 2084
                S   E++ ++  AS S  TN     +T  N       ++SS      T +SP   +
Sbjct: 705  GKANSTTKETSGSTLMASTSPHTNEGAFRTTPYNATTYLPPSTSSKLRPRWTFTSPPVTT 764

Query: 2085 TTTSSP 2090
               + P
Sbjct: 765  KQATVP 770



 Score = 48.2 bits (114), Expect = 6e-05
 Identities = 33/189 (17%), Positives = 68/189 (35%), Gaps = 11/189 (5%)

Query: 1981 ESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSP 2040
             +      +L+   T T++  +          + TT S     +   + P     TT+ P
Sbjct: 395  ANPVADAKTLIITRTATNATTTTHKVVFHKAPD-TTKSVIFVYTLVHVEP---HKTTAVP 450

Query: 2041 ASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSP 2100
             + S         + +T +P S + T ++  S     +SP S +T+ +  A+  T   + 
Sbjct: 451  TTPSLPPA-STGPTVSTADPTSGTPTGTTS-STLPEDTSPTSRTTSATPNATSPTPAVTT 508

Query: 2101 ASESTTTSSPESESTTTSSPASESTTIEEQGVSPHSEKLSANEDP-----EEFPNEDVFE 2155
             + ++ T+   S++   +SP      +     SP +   S          EE P  +   
Sbjct: 509  PNATSPTTQKTSDTPNATSPTPIVIGVTTTATSPPTGTTSVPNATSPQVTEESPVNNTNT 568

Query: 2156 HTFAEIPNI 2164
                  P++
Sbjct: 569  PVVTSAPSV 577



 Score = 47.5 bits (112), Expect = 1e-04
 Identities = 45/223 (20%), Positives = 83/223 (37%), Gaps = 13/223 (5%)

Query: 1840 SVSPYITNNLLISMLAATAVAISVIDNYSEIIFTTNNNSESTVVMSTLNSLLSENTTTNS 1899
              SP   NN    ++ +    ++      +    ++  S+   + S+ +S    N+T+ +
Sbjct: 559  EESP--VNNTNTPVVTSAPSVLTSAVTTGQHGTGSSPTSQQPGIPSSSHSTPRSNSTSTT 616

Query: 1900 PESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTS--SPESESTTT 1957
            P    T+ +    E+ T  +P   STT         +T SP     TTS  S    S+T+
Sbjct: 617  P--LLTSAHPTGGENITEETPSVPSTT-------HVSTLSPGPGPGTTSQVSGPGNSSTS 667

Query: 1958 SSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTT 2017
                    T   P   +T+ S+P  + T   ++ S     +S   E++ +    S S  T
Sbjct: 668  RYPGEVHVTEGMPNPNATSPSAPSGQKTAVPTVTSTGGKANSTTKETSGSTLMASTSPHT 727

Query: 2018 SSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNP 2060
            +     +T   +      +TSS      T  +P   +     P
Sbjct: 728  NEGAFRTTPYNATTYLPPSTSSKLRPRWTFTSPPVTTKQATVP 770


>gnl|CDD|223039 PHA03307, PHA03307, transcriptional regulator ICP4; Provisional.
          Length = 1352

 Score = 85.6 bits (212), Expect = 3e-16
 Identities = 44/234 (18%), Positives = 81/234 (34%), Gaps = 9/234 (3%)

Query: 1900 PESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSS 1959
                +   +     S   SS ++      S   E+    S        S+P + ++    
Sbjct: 150  ASPPAAGASPAAVASDAASSRQAALP--LSSPEETARAPSSPPAEPPPSTPPAAASPRPP 207

Query: 1960 LVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSS 2019
              S   + S+         S   ++  +SS  S S ++            P     T  +
Sbjct: 208  RRSSPISASASSPAPAPGRSAADDAGASSSDSSSSESSGCGWGPENECPLPRPAPITLPT 267

Query: 2020 PVSESTTTISPESESTTTSSPASES-----TTTNNPKSESTTTNNPASESITSSSPASES 2074
             + E++    P S     SS +S        + ++P S    ++  AS S +SS  +S S
Sbjct: 268  RIWEASGWNGPSSRPGPASSSSSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSRESSSS 327

Query: 2075 TTTSSPASESTTTSSPASESTT--TSSPASESTTTSSPESESTTTSSPASESTT 2126
            +T+SS  S      SP    +   + S        SSP      + +P+S + +
Sbjct: 328  STSSSSESSRGAAVSPGPSPSRSPSPSRPPPPADPSSPRKRPRPSRAPSSPAAS 381



 Score = 72.9 bits (179), Expect = 2e-12
 Identities = 50/250 (20%), Positives = 87/250 (34%), Gaps = 27/250 (10%)

Query: 1892 SENTTTNSPESESTTTNN---------PESESTTTSSPESE---STTTSSLVSESTTTSS 1939
                  +     S   ++         PE  +   SSP +E   ST  ++        SS
Sbjct: 152  PPAAGASPAAVASDAASSRQAALPLSSPEETARAPSSPPAEPPPSTPPAAASPRPPRRSS 211

Query: 1940 PESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESE----STTTSSLVSEST 1995
            P S S ++ +P    +      + S+ +SS ES S     PE+E         +L +   
Sbjct: 212  PISASASSPAPAPGRSAADDAGASSSDSSSSES-SGCGWGPENECPLPRPAPITLPTRIW 270

Query: 1996 TTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSEST 2055
              S     S+    P S S++            SP    ++  S  + S+   +  S S+
Sbjct: 271  EASGWNGPSSRP-GPASSSSSPRER--------SPSPSPSSPGSGPAPSSPRASSSSSSS 321

Query: 2056 TTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESEST 2115
              ++ +S S +S S    + +     S S + S P       SSP      + +P S + 
Sbjct: 322  RESSSSSTSSSSESSRGAAVSPGPSPSRSPSPSRP-PPPADPSSPRKRPRPSRAPSSPAA 380

Query: 2116 TTSSPASEST 2125
            +   P     
Sbjct: 381  SAGRPTRRRA 390



 Score = 71.0 bits (174), Expect = 7e-12
 Identities = 55/282 (19%), Positives = 82/282 (29%), Gaps = 42/282 (14%)

Query: 1897 TNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTT--TSSPESES 1954
               P     T        +T +   S     S     S T   P S      T  P S  
Sbjct: 68   PTGPPPGPGTEAPANESRSTPTWSLSTLAPASPAREGSPTPPGPSSPDPPPPTPPPASPP 127

Query: 1955 TTTSSLVSE-------STTTSSPESESTTTSSPESESTTTSS--------LVSESTTTSS 1999
             + +  +SE            +    +   S     S   SS           E+    S
Sbjct: 128  PSPAPDLSEMLRPVGSPGPPPAASPPAAGASPAAVASDAASSRQAALPLSSPEETARAPS 187

Query: 2000 PESESTTTI--------------SPVSESTTTSSPVSESTTTISPESESTTTSSPAS--- 2042
                                   SP+S S ++ +P    +      + S+ +SS  S   
Sbjct: 188  SPPAEPPPSTPPAAASPRPPRRSSPISASASSPAPAPGRSAADDAGASSSDSSSSESSGC 247

Query: 2043 -ESTTTNNPKSESTTTNNPAS--ESITSSSPASESTTTSSPASES-----TTTSSPASES 2094
                    P         P    E+   + P+S     SS +S        + SSP S  
Sbjct: 248  GWGPENECPLPRPAPITLPTRIWEASGWNGPSSRPGPASSSSSPRERSPSPSPSSPGSGP 307

Query: 2095 TTTSSPASESTTTSSPESESTTTSSPASESTTIEEQGVSPHS 2136
              +S  AS S+++S   S S+T+SS  S        G SP  
Sbjct: 308  APSSPRASSSSSSSRESSSSSTSSSSESSRGAAVSPGPSPSR 349



 Score = 63.3 bits (154), Expect = 2e-09
 Identities = 46/241 (19%), Positives = 81/241 (33%), Gaps = 20/241 (8%)

Query: 1910 PESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSS 1969
            P S S ++ +P    +      + S+ +SS ES S     PE+E            T  +
Sbjct: 212  PISASASSPAPAPGRSAADDAGASSSDSSSSES-SGCGWGPENE---CPLPRPAPITLPT 267

Query: 1970 PESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTIS 2029
               E++  + P S     SS        SS   E     SP    ++  S  + S+   S
Sbjct: 268  RIWEASGWNGPSSRPGPASS--------SSSPRER----SPSPSPSSPGSGPAPSSPRAS 315

Query: 2030 PESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSS 2089
              S S+  SS +S S+++ + +  + +     S S + S P   +  +S          S
Sbjct: 316  SSSSSSRESSSSSTSSSSESSRGAAVSPGPSPSRSPSPSRPPPPADPSSPRKRP---RPS 372

Query: 2090 PASESTTTSSPASESTTTSSPESESTTTSSPASESTTIEEQGVSPHSEKLSANEDPEEFP 2149
             A  S   S+         +  +        A+          SP     ++      +P
Sbjct: 373  RAPSSPAASAGRPTRRRARAAVAGRARRRD-ATGRFPAGRPRPSPLDAGAASGAFYARYP 431

Query: 2150 N 2150
             
Sbjct: 432  L 432



 Score = 55.6 bits (134), Expect = 4e-07
 Identities = 40/231 (17%), Positives = 76/231 (32%), Gaps = 17/231 (7%)

Query: 1898 NSPESESTTTNNPESESTTTSSPESESTTTSSLVS-------ESTTTSSPESESTTTSSP 1950
            +SP S S ++  P    +      + S+ +SS  S       E+       +  T  +  
Sbjct: 210  SSPISASASSPAPAPGRSAADDAGASSSDSSSSESSGCGWGPENECPLPRPAPITLPTRI 269

Query: 1951 ESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISP 2010
               S       S     SS  S      SP    ++  S  + S+  +S  S S+   S 
Sbjct: 270  WEASGWNGP-SSRPGPASSSSSPR--ERSPSPSPSSPGSGPAPSSPRASSSSSSSRESS- 325

Query: 2011 VSESTTTSSPVSEST-TTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSS 2069
             S ST++SS  S     +  P    + + S        ++P+     +  P+S + ++  
Sbjct: 326  -SSSTSSSSESSRGAAVSPGPSPSRSPSPSRPPPPADPSSPRKRPRPSRAPSSPAASAGR 384

Query: 2070 PASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSP 2120
            P       ++ A  +    +        +     S   +   S +     P
Sbjct: 385  PTRR-RARAAVAGRARRRDATGR---FPAGRPRPSPLDAGAASGAFYARYP 431



 Score = 53.6 bits (129), Expect = 2e-06
 Identities = 35/189 (18%), Positives = 60/189 (31%), Gaps = 7/189 (3%)

Query: 1892 SENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPE 1951
                    P     T      E++  + P S     SS  S S    SP    ++  S  
Sbjct: 250  GPENECPLPRPAPITLPTRIWEASGWNGPSSRPGPASS--SSSPRERSPSPSPSSPGSGP 307

Query: 1952 SESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPV 2011
            + S+  +S  S S+  SS  S S+++ S    + +     S S + S P        SP 
Sbjct: 308  APSSPRASSSSSSSRESSSSSTSSSSESSRGAAVSPGPSPSRSPSPSRP-PPPADPSSPR 366

Query: 2012 SESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPA 2071
                 + +P S + +   P       ++ A  +   +               S   +  A
Sbjct: 367  KRPRPSRAPSSPAASAGRPTRR-RARAAVAGRARRRDAT---GRFPAGRPRPSPLDAGAA 422

Query: 2072 SESTTTSSP 2080
            S +     P
Sbjct: 423  SGAFYARYP 431



 Score = 53.3 bits (128), Expect = 2e-06
 Identities = 42/244 (17%), Positives = 80/244 (32%), Gaps = 15/244 (6%)

Query: 1897 TNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTT 1956
               P +     ++  S S      +S      ++V+ +      E  +     P +E+  
Sbjct: 22   PRPPATPGDAADDLLSGSQGQLVSDSAELAAVTVVAGAAACDRFEPPTGPPPGPGTEAPA 81

Query: 1957 ----TSSLVSESTTT--SSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISP 2010
                ++   S ST    S     S T   P S      +    S    SP  + +  + P
Sbjct: 82   NESRSTPTWSLSTLAPASPAREGSPTPPGPSSPDPPPPTPPPASPP-PSPAPDLSEMLRP 140

Query: 2011 VSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSP 2070
            V       +    +         S   SS  + +   ++P+  +   ++P +E   S+ P
Sbjct: 141  VGSPGPPPAASPPAAGASPAAVASDAASSRQA-ALPLSSPEETARAPSSPPAEPPPSTPP 199

Query: 2071 ASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTTIEEQ 2130
            A+    +  P   S+  S+ AS       PA   +      + S+ +SS  S       +
Sbjct: 200  AA---ASPRPPRRSSPISASASSPA----PAPGRSAADDAGASSSDSSSSESSGCGWGPE 252

Query: 2131 GVSP 2134
               P
Sbjct: 253  NECP 256



 Score = 51.3 bits (123), Expect = 7e-06
 Identities = 30/175 (17%), Positives = 56/175 (32%), Gaps = 13/175 (7%)

Query: 1884 MSTLNSLLSENTTTNSPESESTTTNN--------PESESTTTSSPESESTTTSSLVSEST 1935
              TL + + E +  N P S     ++        P    ++  S  + S+  +S  S S+
Sbjct: 262  PITLPTRIWEASGWNGPSSRPGPASSSSSPRERSPSPSPSSPGSGPAPSSPRASSSSSSS 321

Query: 1936 TTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSEST 1995
              SS  S S+++ S    + +     S S + S P   +  +S  +       S    S 
Sbjct: 322  RESSSSSTSSSSESSRGAAVSPGPSPSRSPSPSRPPPPADPSSPRKRP---RPSRAPSSP 378

Query: 1996 TTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNP 2050
              S+         + V+         +           S   +  AS +     P
Sbjct: 379  AASAGRPTRRRARAAVAGRARRRD--ATGRFPAGRPRPSPLDAGAASGAFYARYP 431



 Score = 48.2 bits (115), Expect = 7e-05
 Identities = 45/225 (20%), Positives = 73/225 (32%), Gaps = 21/225 (9%)

Query: 1930 LVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSS 1989
            LVS+S   ++    +   +    E  T         T +      +T +   S     S 
Sbjct: 43   LVSDSAELAAVTVVAGAAACDRFEPPTGPP--PGPGTEAPANESRSTPTWSLSTLAPASP 100

Query: 1990 LVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNN 2049
                S T   P S      +P   S    SP  + +  + P        + +  +   + 
Sbjct: 101  AREGSPTPPGPSSPDPPPPTPPPASPP-PSPAPDLSEMLRPVGSPGPPPAASPPAAGASP 159

Query: 2050 PKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTT--- 2106
                S           +S   A      SSP   +   SSP +E   ++ PA+ S     
Sbjct: 160  AAVASDAA--------SSRQAALPL---SSPEETARAPSSPPAEPPPSTPPAAASPRPPR 208

Query: 2107 TSSPESESTTTSSPASESTTIEEQGVSP----HSEKLSANEDPEE 2147
             SSP S S ++ +PA   +  ++ G S      SE       PE 
Sbjct: 209  RSSPISASASSPAPAPGRSAADDAGASSSDSSSSESSGCGWGPEN 253


>gnl|CDD|236304 PRK08581, PRK08581, N-acetylmuramoyl-L-alanine amidase; Validated.
          Length = 619

 Score = 82.1 bits (203), Expect = 2e-15
 Identities = 42/301 (13%), Positives = 99/301 (32%), Gaps = 21/301 (6%)

Query: 1881 TVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESE----STTTSSLVSESTT 1936
            T   +  +    ++T   +      + ++  S+ T++   +      ++   +   + +T
Sbjct: 21   TSPTAYADDPQKDSTAKTTSHDSKKSNDDETSKDTSSKDTDKADNNNTSNQDNNDKKFST 80

Query: 1937 TSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTT 1996
              S  S+S        ++   +++    T     ++ S TT      +  +     E   
Sbjct: 81   IDSSTSDSNNIIDFIYKNLPQTNINQLLTKNKYDDNYSLTTLIQNLFNLNSDISDYEQPR 140

Query: 1997 TSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTT 2056
             S   +  +   S  S    T +  S+     + ++ S+  + P++ +   N+PK     
Sbjct: 141  NSEKSTNDSNKNSDSSIKNDTDTQSSKQDKADNQKAPSSNNTKPSTSNKQPNSPKPTQPN 200

Query: 2057 TNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTT 2116
             +N    S           T +  +S     S   S   +     SE    +  +  S +
Sbjct: 201  QSNSQPAS---------DDTANQKSSSKDNQSMSDSALDSILDQYSEDAKKTQKDYASQS 251

Query: 2117 TSSPASESTTIEEQGVSPHSEKLSANEDPEEFPNEDVFE------HTFAEIPNIDHSNQT 2170
                   S T   Q   P  ++L     P +    DV +        F   P++ +++ +
Sbjct: 252  KKDKTETSNTKNPQ--LPTQDELKHKSKPAQSFENDVNQSNTRSTSLFETGPSLSNNDDS 309

Query: 2171 D 2171
             
Sbjct: 310  G 310



 Score = 79.1 bits (195), Expect = 1e-14
 Identities = 40/258 (15%), Positives = 90/258 (34%), Gaps = 4/258 (1%)

Query: 1904 STTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSE 1963
             T T+             +++T+  S  S    TS   S   T  +  + ++   +   +
Sbjct: 18   PTLTSPTAYADDPQKDSTAKTTSHDSKKSNDDETSKDTSSKDTDKADNNNTSNQDNNDKK 77

Query: 1964 STTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSE 2023
             +T  S  S+S        ++   +++    T     ++ S TT+     +  +     E
Sbjct: 78   FSTIDSSTSDSNNIIDFIYKNLPQTNINQLLTKNKYDDNYSLTTLIQNLFNLNSDISDYE 137

Query: 2024 STTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASE 2083
                    +  +  +S +S    T+   S+    +N  + S  ++ P++ +   +SP   
Sbjct: 138  QPRNSEKSTNDSNKNSDSSIKNDTDTQSSKQDKADNQKAPSSNNTKPSTSNKQPNSPKPT 197

Query: 2084 STTTSSPASESTTTS-SPASESTTTSSPESESTTTSSPASESTTIEEQ---GVSPHSEKL 2139
                S+    S  T+   +S     S  +S   +     SE     ++     S   +  
Sbjct: 198  QPNQSNSQPASDDTANQKSSSKDNQSMSDSALDSILDQYSEDAKKTQKDYASQSKKDKTE 257

Query: 2140 SANEDPEEFPNEDVFEHT 2157
            ++N    + P +D  +H 
Sbjct: 258  TSNTKNPQLPTQDELKHK 275



 Score = 66.7 bits (163), Expect = 1e-10
 Identities = 36/231 (15%), Positives = 84/231 (36%), Gaps = 16/231 (6%)

Query: 1957 TSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTT 2016
            +++LV  + T+ +          P+ +ST  ++      +     S+ T++         
Sbjct: 12   STTLVLPTLTSPT-----AYADDPQKDSTAKTTSHDSKKSNDDETSKDTSSKDTDKADNN 66

Query: 2017 TSSPVSESTTTISPESESTTTSSPASE-------STTTNNPKSESTTTNNPASESITSSS 2069
             +S    +    S    ST+ S+   +        T  N   +++   +N +  ++  + 
Sbjct: 67   NTSNQDNNDKKFSTIDSSTSDSNNIIDFIYKNLPQTNINQLLTKNKYDDNYSLTTLIQNL 126

Query: 2070 PASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTTIEE 2129
                S  +      ++  S+  S   + SS  +++ T SS + ++    +P+S +T    
Sbjct: 127  FNLNSDISDYEQPRNSEKSTNDSNKNSDSSIKNDTDTQSSKQDKADNQKAPSSNNTKPST 186

Query: 2130 QGVSPHSEKLSANEDPEEFPNEDVFEHTFAEIPNIDHSN-QTDEAIPETFD 2179
                P+S K +        P  D    T  +  +   +   +D A+    D
Sbjct: 187  SNKQPNSPKPTQPNQSNSQPASD---DTANQKSSSKDNQSMSDSALDSILD 234



 Score = 41.3 bits (97), Expect = 0.008
 Identities = 47/292 (16%), Positives = 95/292 (32%), Gaps = 46/292 (15%)

Query: 1765 NSVSPNVTSKILTTDNYSEIIFTTNNNSESTVVMSTLNSLLSENEKLFKPHAKTPGAEFL 1824
             S   N   K  T D+ +     +NN  +                  +K   +T   + L
Sbjct: 68   TSNQDNNDKKFSTIDSSTS---DSNNIIDFI----------------YKNLPQTNINQLL 108

Query: 1825 IQCQYCDFDSSMNLLSVSPYITNNLLISMLAATAVAISVIDNYSEIIFTTNNNSESTVVM 1884
             + +Y D  S   L+        NL            ++  + S+     N+   +    
Sbjct: 109  TKNKYDDNYSLTTLI-------QNLF-----------NLNSDISDYEQPRNSEKSTNDSN 150

Query: 1885 STLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESES 1944
               +S +  +T T S + +        S + T  S  ++   +      + + S P S+ 
Sbjct: 151  KNSDSSIKNDTDTQSSKQDKADNQKAPSSNNTKPSTSNKQPNSPKPTQPNQSNSQPASDD 210

Query: 1945 TT---TSSPESESTTTSSLVS--ESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSS 1999
            T    +SS +++S + S+L S  +  +  + +++    S  + + T TS+  +    T  
Sbjct: 211  TANQKSSSKDNQSMSDSALDSILDQYSEDAKKTQKDYASQSKKDKTETSNTKNPQLPTQD 270

Query: 2000 PESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSP----ASESTTT 2047
                 +            S+  S S     P   +   S       S+ T  
Sbjct: 271  ELKHKSKPAQSFENDVNQSNTRSTSLFETGPSLSNNDDSGSFNVVDSKDTRQ 322


>gnl|CDD|239112 cd02621, Peptidase_C1A_CathepsinC, Cathepsin C; also known as
            Dipeptidyl Peptidase I (DPPI), an atypical papain-like
            cysteine peptidase with chloride dependency and
            dipeptidyl aminopeptidase activity, resulting from its
            tetrameric structure which limits substrate access. Each
            subunit of the tetramer is composed of three peptides:
            the heavy and light chains, which together adopts the
            papain fold and forms the catalytic domain; and the
            residual propeptide region, which forms a beta barrel and
            points towards the substrate's N-terminus. The subunit
            composition is the result of the unique characteristic of
            procathepsin C maturation involving the cleavage of the
            catalytic domain and the non-autocatalytic excision of an
            activation peptide within its propeptide region. By
            removing N-terminal dipeptide extensions, cathepsin C
            activates granule serine peptidases (granzymes) involved
            in cell-mediated apoptosis, inflammation and tissue
            remodelling. Loss-of-function mutations in cathepsin C
            are associated with Papillon-Lefevre and Haim-Munk
            syndromes, rare diseases characterized by hyperkeratosis
            and early-onset periodontitis. Cathepsin C is widely
            expressed in many tissues with high levels in lung,
            kidney and placenta. It is also highly expressed in
            cytotoxic lymphocytes and mature myeloid cells.
          Length = 243

 Score = 72.8 bits (179), Expect = 1e-13
 Identities = 70/289 (24%), Positives = 92/289 (31%), Gaps = 107/289 (37%)

Query: 2175 PETFDAREEWPQCKDVIGKVWDQGACQSCWVSHQPRTAGLKGLFSFIKYGQGQERTLSVW 2234
            P++FD  +       V   V +QG C SC+                              
Sbjct: 2    PKSFDWGDVNNGFNYVSP-VRNQGGCGSCY------------------------------ 30

Query: 2235 DKAISAASVMSDRICIQS-----KGQVKPILSPQH-LICS-------------------- 2268
              A ++   +  RI I S      GQ +PILSPQH L CS                    
Sbjct: 31   --AFASVYALEARIMIASNKTDPLGQ-QPILSPQHVLSCSQYSQGCDGGFPFLVGKFAED 87

Query: 2269 ------------------CT----NCTRMHTKT--PMSMCMGGDSAAAWMYW--INAGLV 2302
                              C      C R +      +  C G  +    M W     G +
Sbjct: 88   FGIVTEDYFPYTADDDRPCKASPSECRRYYFSDYNYVGGCYGCTNEDE-MKWEIYRNGPI 146

Query: 2303 DGGDYGTHDVSMGRYIEGIGHAAS---VMGSSNPEVNNFEKVIRLYSCEGSINPRYIHSV 2359
                    D     Y EG+ H      V    N   N FE            N    H+V
Sbjct: 147  VVAFEVYSDFDF--YKEGVYHHTDNDEVSDGDNDNFNPFELT----------N----HAV 190

Query: 2360 KIIGWGKSSQN-EPYWLCTNSYNQGWGEQGLFKIRRGVNMCSIEDSVMA 2407
             ++GWG+     E YW+  NS+   WGE+G FKIRRG N C IE   + 
Sbjct: 191  LLVGWGEDEIKGEKYWIVKNSWGSSWGEKGYFKIRRGTNECGIESQAVF 239


>gnl|CDD|239149 cd02698, Peptidase_C1A_CathepsinX, Cathepsin X; the only papain-like
            lysosomal cysteine peptidase exhibiting
            carboxymonopeptidase activity. It can also act as a
            carboxydipeptidase, like cathepsin B, but has been shown
            to preferentially cleave substrates through a
            monopeptidyl carboxypeptidase pathway. The propeptide
            region of cathepsin X, the shortest among papain-like
            peptidases, is covalently attached to the active site
            cysteine in the inactive form of the enzyme. Little is
            known about the biological function of cathepsin X. Some
            studies point to a role in early tumorigenesis. A more
            recent study indicates that cathepsin X expression is
            restricted to immune cells suggesting a role in
            phagocytosis and the regulation of the immune response.
          Length = 239

 Score = 72.8 bits (179), Expect = 1e-13
 Identities = 49/209 (23%), Positives = 71/209 (33%), Gaps = 63/209 (30%)

Query: 2232 SVWDKAISAASVMSDRICIQSKGQVKPI-LSPQHLI-CSCTNCTRMHTKTPMSMCMGGDS 2289
            S W  A  + S ++DRI I  KG    + LS Q +I C+               C GGD 
Sbjct: 30   SCW--AHGSTSALADRINIARKGAWPSVYLSVQVVIDCAGGGS-----------CHGGDP 76

Query: 2290 AAAWMYWINAGLVDG--------------------------------------GDYGTHD 2311
               + Y    G+ D                                        DYG+  
Sbjct: 77   GGVYEYAHKHGIPDETCNPYQAKDGECNPFNRCGTCNPFGECFAIKNYTLYFVSDYGS-- 134

Query: 2312 VS----MGRYIEGIGHAASVMGSSNPEVNNFEKVIRLYSCEGSINPRYIHSVKIIGWGKS 2367
            VS    M   I   G  +  + ++    N    V + Y  +  IN    H + + GWG  
Sbjct: 135  VSGRDKMMAEIYARGPISCGIMATEALENYTGGVYKEYVQDPLIN----HIISVAGWGVD 190

Query: 2368 SQNEPYWLCTNSYNQGWGEQGLFKIRRGV 2396
                 YW+  NS+ + WGE+G F+I    
Sbjct: 191  ENGVEYWIVRNSWGEPWGERGWFRIVTSS 219


>gnl|CDD|177776 PLN00181, PLN00181, protein SPA1-RELATED; Provisional.
          Length = 793

 Score = 73.6 bits (180), Expect = 9e-13
 Identities = 63/249 (25%), Positives = 118/249 (47%), Gaps = 20/249 (8%)

Query: 373 DGQYIASSGYDRQIFIWSVYGECENI-----------GVMSGHTGAVMDLKFSTDGCHIF 421
           DG++ A++G +++I I+    ECE+I             ++  +        S     + 
Sbjct: 494 DGEFFATAGVNKKIKIF----ECESIIKDGRDIHYPVVELASRSKLSGICWNSYIKSQVA 549

Query: 422 TCSTDQTLAVWDLEKGQRIKKMKGHSTFVNSCDPVRRGQLLIASGSDDCTVKVWDPRKKN 481
           + + +  + VWD+ + Q + +MK H   V S D       L+ASGSDD +VK+W   +  
Sbjct: 550 SSNFEGVVQVWDVARSQLVTEMKEHEKRVWSIDYSSADPTLLASGSDDGSVKLWSINQGV 609

Query: 482 QAVSMNNTYQVTSVAF-NDTAECVLTGGIDNDIKMWDLRTNSV-VQKLRGHSDTVTGLSL 539
              ++     +  V F +++   +  G  D+ +  +DLR   + +  + GHS TV+ +  
Sbjct: 610 SIGTIKTKANICCVQFPSESGRSLAFGSADHKVYYYDLRNPKLPLCTMIGHSKTVSYVRF 669

Query: 540 SPDGSYILSNAMDNTVRIWDIRPYVPGERCVKVMSGHQHNFEKNLLRCAWSVSGLYVTAG 599
             D S ++S++ DNT+++WD+   + G     + S   H   KN +    SVS  Y+  G
Sbjct: 670 V-DSSTLVSSSTDNTLKLWDLSMSISGINETPLHSFMGHTNVKNFV--GLSVSDGYIATG 726

Query: 600 SADKCVYIW 608
           S    V+++
Sbjct: 727 SETNEVFVY 735



 Score = 37.4 bits (86), Expect = 0.11
 Identities = 46/211 (21%), Positives = 95/211 (45%), Gaps = 24/211 (11%)

Query: 994  TGGGDKSVKLWQLE-LVSVNREADEETKDVSRSHKVLSLLHTRTLKLEEQVLCARVSPDS 1052
            T G +K +K+++ E ++   R+      +++   K+  +     +K +            
Sbjct: 500  TAGVNKKIKIFECESIIKDGRDIHYPVVELASRSKLSGICWNSYIKSQ------------ 547

Query: 1053 KLLAVSLLDTTVKIFFLDTFKFFISLYGHKLPVLSLDMSY-DSTLIATGSGDRTVKVWGL 1111
              +A S  +  V+++ +   +    +  H+  V S+D S  D TL+A+GS D +VK+W +
Sbjct: 548  --VASSNFEGVVQVWDVARSQLVTEMKEHEKRVWSIDYSSADPTLLASGSDDGSVKLWSI 605

Query: 1112 DYGDCHKSLLAHEDSVTGVTFVPKTHYFFT-TSKDGRVKQWDADNFERIVTLHFFISLYG 1170
            + G      +  + ++  V F  ++       S D +V  +D  N +  +      ++ G
Sbjct: 606  NQG-VSIGTIKTKANICCVQFPSESGRSLAFGSADHKVYYYDLRNPKLPLC-----TMIG 659

Query: 1171 HKLPVLSLDMSYDSTLIATGSGDRTVKVWGL 1201
            H   V  +     STL+++ S D T+K+W L
Sbjct: 660  HSKTVSYVRFVDSSTLVSS-STDNTLKLWDL 689



 Score = 35.8 bits (82), Expect = 0.39
 Identities = 20/66 (30%), Positives = 30/66 (45%), Gaps = 1/66 (1%)

Query: 176 VVSSAKDTFVKIWDADTGDCFKTMAAHLTEVWGVCVMREDSYLI-SGSNDAELKVWNVRD 234
           V SS  +  V++WD         M  H   VW +     D  L+ SGS+D  +K+W++  
Sbjct: 548 VASSNFEGVVQVWDVARSQLVTEMKEHEKRVWSIDYSSADPTLLASGSDDGSVKLWSINQ 607

Query: 235 RSDIDT 240
              I T
Sbjct: 608 GVSIGT 613



 Score = 33.9 bits (77), Expect = 1.2
 Identities = 27/123 (21%), Positives = 58/123 (47%), Gaps = 16/123 (13%)

Query: 1268 TGGGDKSVKLWQLE-LVSVNREADEETKDVSRSHKVLSLLHTRTLKLEEQVLCARVSPDS 1326
            T G +K +K+++ E ++   R+      +++   K+  +     +K +            
Sbjct: 500  TAGVNKKIKIFECESIIKDGRDIHYPVVELASRSKLSGICWNSYIKSQ------------ 547

Query: 1327 KLLAVSLLDTTVKIFFLDTFKFFISLYGHKLPVLSLDMSY-DSTLIATGSGDRTVKVWGL 1385
              +A S  +  V+++ +   +    +  H+  V S+D S  D TL+A+GS D +VK+W +
Sbjct: 548  --VASSNFEGVVQVWDVARSQLVTEMKEHEKRVWSIDYSSADPTLLASGSDDGSVKLWSI 605

Query: 1386 DYG 1388
            + G
Sbjct: 606  NQG 608



 Score = 33.5 bits (76), Expect = 1.7
 Identities = 22/61 (36%), Positives = 35/61 (57%), Gaps = 7/61 (11%)

Query: 1145 DGRVKQWDADNFERIVTLHFFISLYGHKLPVLSLDMSY-DSTLIATGSGDRTVKVWGLDY 1203
            +G V+ WD     ++VT      +  H+  V S+D S  D TL+A+GS D +VK+W ++ 
Sbjct: 554  EGVVQVWDVAR-SQLVT-----EMKEHEKRVWSIDYSSADPTLLASGSDDGSVKLWSINQ 607

Query: 1204 G 1204
            G
Sbjct: 608  G 608



 Score = 32.4 bits (73), Expect = 4.1
 Identities = 39/149 (26%), Positives = 66/149 (44%), Gaps = 9/149 (6%)

Query: 88  SQLAVAYTNGSLKTFSLDTTDVISTFTGHKSAITVIQY---DPLGHRLATGSKDTDIVLW 144
           SQ+A +   G ++ + +  + +++    H+  +  I Y   DP    LA+GS D  + LW
Sbjct: 546 SQVASSNFEGVVQVWDVARSQLVTEMKEHEKRVWSIDYSSADPT--LLASGSDDGSVKLW 603

Query: 145 DVVAECGLHRLSGHKGVITDIRFMSQPGHHFVVSSAKDTFVKIWDADTGDC-FKTMAAHL 203
            +     +  +   K  I  ++F S+ G      SA D  V  +D         TM  H 
Sbjct: 604 SINQGVSIGTIKT-KANICCVQFPSESGRSLAFGSA-DHKVYYYDLRNPKLPLCTMIGHS 661

Query: 204 TEVWGVCVMREDSYLISGSNDAELKVWNV 232
             V  V  + + S L+S S D  LK+W++
Sbjct: 662 KTVSYVRFV-DSSTLVSSSTDNTLKLWDL 689


>gnl|CDD|237555 PRK13914, PRK13914, invasion associated secreted endopeptidase;
            Provisional.
          Length = 481

 Score = 69.8 bits (170), Expect = 1e-11
 Identities = 44/229 (19%), Positives = 100/229 (43%), Gaps = 17/229 (7%)

Query: 1914 STTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESE 1973
               T   + E+TT  +  +  T T   ++   TT +P+   T  + +V ++ TT + +S 
Sbjct: 148  VAPTQEVKKETTTQQAAPAAETKTEVKQTTQATTPAPKVAETKETPVVDQNATTHAVKSG 207

Query: 1974 STTTSSPESESTTTSSLVSESTTTSSP--------ESESTTTISPVSESTTTSSPVSEST 2025
             T  +       +   ++S +  +SS           ++  T +P +E  T +    +  
Sbjct: 208  DTIWALSVKYGVSVQDIMSWNNLSSSSIYVGQKLAIKQTANTATPKAEVKTEAPAAEKQA 267

Query: 2026 TTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSP-----ASESTTTSSP 2080
              +  E+ +T T++   + TTT     + T    P   +  + +P     A+++ T ++ 
Sbjct: 268  APVVKENTNTNTATTEKKETTTQ----QQTAPKAPTEAAKPAPAPSTNTNANKTNTNTNT 323

Query: 2081 ASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTTIEE 2129
             + +T TS+P+  + T ++  + + + ++    S+  +S +S S  I E
Sbjct: 324  NTNNTNTSTPSKNTNTNTNSNTNTNSNTNANQGSSNNNSNSSASAIIAE 372



 Score = 66.0 bits (160), Expect = 1e-10
 Identities = 43/220 (19%), Positives = 96/220 (43%), Gaps = 17/220 (7%)

Query: 1893 ENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPES 1952
            E TT  +  +  T T   ++   TT +P+   T  + +V ++ TT + +S  T  +    
Sbjct: 157  ETTTQQAAPAAETKTEVKQTTQATTPAPKVAETKETPVVDQNATTHAVKSGDTIWALSVK 216

Query: 1953 ESTTTSSLVSESTTTSSP--------ESESTTTSSPESESTTTSSLVSESTTTSSPESES 2004
               +   ++S +  +SS           ++  T++P++E  T +    +       E+ +
Sbjct: 217  YGVSVQDIMSWNNLSSSSIYVGQKLAIKQTANTATPKAEVKTEAPAAEKQAAPVVKENTN 276

Query: 2005 TTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASES 2064
            T T +   + TTT     + T   +P    T  + PA   +T  N  +++ T  N  + +
Sbjct: 277  TNTATTEKKETTTQ----QQTAPKAP----TEAAKPAPAPSTNTN-ANKTNTNTNTNTNN 327

Query: 2065 ITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASES 2104
              +S+P+  + T ++  + + + ++    S+  +S +S S
Sbjct: 328  TNTSTPSKNTNTNTNSNTNTNSNTNANQGSSNNNSNSSAS 367



 Score = 54.0 bits (129), Expect = 8e-07
 Identities = 52/229 (22%), Positives = 103/229 (44%), Gaps = 38/229 (16%)

Query: 1947 TSSPESESTTTSSLVSESTTT-SSPESESTT---------TSSPESESTTTSSLVSESTT 1996
            TS+P      T  +  E+TT  ++P +E+ T         T +P+   T  + +V ++ T
Sbjct: 144  TSTP---VAPTQEVKKETTTQQAAPAAETKTEVKQTTQATTPAPKVAETKETPVVDQNAT 200

Query: 1997 TSSPESEST------------------TTISPVSESTTTSSPVSESTTTISPESESTTTS 2038
            T + +S  T                    +S  S        + ++  T +P++E   T 
Sbjct: 201  THAVKSGDTIWALSVKYGVSVQDIMSWNNLSSSSIYVGQKLAIKQTANTATPKAE-VKTE 259

Query: 2039 SPASESTTTNNPKSESTTTNNPAS---ESITSSSPASESTTTSSPASESTTTSSPASEST 2095
            +PA+E       K E+T TN   +   E+ T    A ++ T ++  + + +T++ A+++ 
Sbjct: 260  APAAEKQAAPVVK-ENTNTNTATTEKKETTTQQQTAPKAPTEAAKPAPAPSTNTNANKTN 318

Query: 2096 TTSSPASESTTTSSP--ESESTTTSSPASESTTIEEQGVSPHSEKLSAN 2142
            T ++  + +T TS+P   + + T S+  + S T   QG S ++   SA+
Sbjct: 319  TNTNTNTNNTNTSTPSKNTNTNTNSNTNTNSNTNANQGSSNNNSNSSAS 367



 Score = 51.3 bits (122), Expect = 6e-06
 Identities = 34/189 (17%), Positives = 81/189 (42%), Gaps = 14/189 (7%)

Query: 1892 SENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSP--------ESE 1943
               T       ++ TT+  +S  T  +       +   ++S +  +SS           +
Sbjct: 186  VAETKETPVVDQNATTHAVKSGDTIWALSVKYGVSVQDIMSWNNLSSSSIYVGQKLAIKQ 245

Query: 1944 STTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESE 2003
            +  T++P++E  T +    +       E+ +T T++ E + TTT     + T   +P   
Sbjct: 246  TANTATPKAEVKTEAPAAEKQAAPVVKENTNTNTATTEKKETTT----QQQTAPKAPTEA 301

Query: 2004 STTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASE 2063
            +    +P   + T ++  + +T T +  + ++T S   + +T +N   + +T  N  +S 
Sbjct: 302  AKP--APAPSTNTNANKTNTNTNTNTNNTNTSTPSKNTNTNTNSNTNTNSNTNANQGSSN 359

Query: 2064 SITSSSPAS 2072
            + ++SS ++
Sbjct: 360  NNSNSSASA 368



 Score = 45.2 bits (106), Expect = 4e-04
 Identities = 44/206 (21%), Positives = 85/206 (41%), Gaps = 20/206 (9%)

Query: 1967 TSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTT 2026
            TS+P      T   + E+TT  +  +  T T   ++   TT +P    T  +  V ++ T
Sbjct: 144  TSTP---VAPTQEVKKETTTQQAAPAAETKTEVKQTTQATTPAPKVAETKETPVVDQNAT 200

Query: 2027 TISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPAS--ESTTTSSPASES 2084
            T + +S  T  +       +  +  S     NN +S SI      +  ++  T++P +E 
Sbjct: 201  THAVKSGDTIWALSVKYGVSVQDIMS----WNNLSSSSIYVGQKLAIKQTANTATPKAE- 255

Query: 2085 TTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTTIEEQGVSPHSEKLSANED 2144
              T +PA+E      P  +  T ++      T ++   E+TT  +Q  +P +   +A   
Sbjct: 256  VKTEAPAAEKQAA--PVVKENTNTN------TATTEKKETTT--QQQTAPKAPTEAAKPA 305

Query: 2145 PEEFPNEDVFEHTFAEIPNIDHSNQT 2170
            P    N +  +       N +++N +
Sbjct: 306  PAPSTNTNANKTNTNTNTNTNNTNTS 331



 Score = 44.4 bits (104), Expect = 7e-04
 Identities = 29/159 (18%), Positives = 74/159 (46%), Gaps = 14/159 (8%)

Query: 1871 IFTTNNNSESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSL 1930
            I + NN S S++ +      + +   T +P++E  T      +       E+ +T T++ 
Sbjct: 224  IMSWNNLSSSSIYVGQ-KLAIKQTANTATPKAEVKTEAPAAEKQAAPVVKENTNTNTATT 282

Query: 1931 VSESTTTSSPESESTTTSSPE-----SESTTTSSLVSESTTTSSPESESTTTSSPESEST 1985
              + TTT     + T   +P      + + +T++  +++ T ++  + +T TS+P   + 
Sbjct: 283  EKKETTTQ----QQTAPKAPTEAAKPAPAPSTNTNANKTNTNTNTNTNNTNTSTPSKNTN 338

Query: 1986 TTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSES 2024
            T ++    S T ++  + +    S  + +++ S+ ++E+
Sbjct: 339  TNTN----SNTNTNSNTNANQGSSNNNSNSSASAIIAEA 373


>gnl|CDD|217837 pfam04003, Utp12, Dip2/Utp12 Family.  This domain is found at the
            C-terminus of proteins containing WD40 repeats. These
            proteins are part of the U3 ribonucleoprotein the yeast
            protein is called Utp12 or DIP2.
          Length = 109

 Score = 61.8 bits (151), Expect = 3e-11
 Identities = 28/78 (35%), Positives = 43/78 (55%), Gaps = 1/78 (1%)

Query: 1630 SSELEEVLLVLSLSQVTDLLTHLSSLL-DSSHHRCELVIRVAVFLVRIHHGPLTASKELL 1688
             S++E  LL L  S V  LL  L+  L        E ++R   FL+RIH   L ++  LL
Sbjct: 15   PSDIELTLLSLPFSYVLRLLEFLAERLQAERSPHLEFLLRWLKFLLRIHGKYLVSNPNLL 74

Query: 1689 PVLQRLEQLASRRVEEIR 1706
            P L+ L+++  RRV+++R
Sbjct: 75   PQLRSLQKVLRRRVKDLR 92


>gnl|CDD|227430 COG5099, COG5099, RNA-binding protein of the Puf family,
            translational repressor [Translation, ribosomal structure
            and biogenesis].
          Length = 777

 Score = 64.0 bits (156), Expect = 9e-10
 Identities = 51/257 (19%), Positives = 104/257 (40%), Gaps = 22/257 (8%)

Query: 1894 NTTTN-SPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPES 1952
            +T  N  P  +S   ++ +S  ++T+S E  +  ++   +  +   S  S S T +    
Sbjct: 4    DTMNNLLPSIKSQLHHSKKSPPSSTTSQELMNGNSTP--NSFSPIPSKASSSATFTLNLP 61

Query: 1953 ESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVS 2012
             + + +  ++ S++ S  +   + + +  S ++ + SL+ E           +++ +P +
Sbjct: 62   INNSVNHKITSSSS-SRRKPSGSWSVAISSSTSGSQSLLMEL---------PSSSFNPST 111

Query: 2013 ESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPAS 2072
             S   S+    ST   +  S  T +SS AS    +N     +   +N A+ + + SS  +
Sbjct: 112  SSRNKSNSALSSTQQGNANSSVTLSSSTASSMFNSNKLPLPNPNHSNSATTNQSGSSFIN 171

Query: 2073 ESTTTSSPASESTTTSSPASESTTTS---------SPASESTTTSSPESESTTTSSPASE 2123
               ++SS    +   SS       TS          P+S+S T S+  S S       S 
Sbjct: 172  TPASSSSQPLTNLVVSSIKRFPYLTSLSPFFNYLIDPSSDSATASADTSPSFNPPPNLSP 231

Query: 2124 STTIEEQGVSPHSEKLS 2140
            +       +SP  +  S
Sbjct: 232  NNLFSTSDLSPLPDTQS 248



 Score = 61.3 bits (149), Expect = 5e-09
 Identities = 42/225 (18%), Positives = 88/225 (39%), Gaps = 6/225 (2%)

Query: 1924 STTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESE 1983
            S T ++L+    +      +S  +S+   E    +S  +  +   S  S S T +     
Sbjct: 3    SDTMNNLLPSIKSQLHHSKKSPPSSTTSQELMNGNSTPNSFSPIPSKASSSATFTLNLPI 62

Query: 1984 STTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVS---ESTTTISPESESTTTSSP 2040
            + + +  ++ S   SS   + + + S    S+T+ S        +++ +P + S   S+ 
Sbjct: 63   NNSVNHKITSS---SSSRRKPSGSWSVAISSSTSGSQSLLMELPSSSFNPSTSSRNKSNS 119

Query: 2041 ASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSP 2100
            A  ST   N  S  T +++ AS    S+     +   S+ A+ + + SS  +   ++SS 
Sbjct: 120  ALSSTQQGNANSSVTLSSSTASSMFNSNKLPLPNPNHSNSATTNQSGSSFINTPASSSSQ 179

Query: 2101 ASESTTTSSPESESTTTSSPASESTTIEEQGVSPHSEKLSANEDP 2145
               +   SS +     TS     +  I+    S  +   ++    
Sbjct: 180  PLTNLVVSSIKRFPYLTSLSPFFNYLIDPSSDSATASADTSPSFN 224



 Score = 57.1 bits (138), Expect = 1e-07
 Identities = 50/320 (15%), Positives = 99/320 (30%), Gaps = 17/320 (5%)

Query: 1829 YCDFDSSMNLLSVSPYITNNLLISMLAATAVAISVIDNYSEIIFTTNNNSESTVVMSTLN 1888
               F    +  S S   T NL I+      +  S           +   S ST    + +
Sbjct: 40   PNSFSPIPSKASSSATFTLNLPINNSVNHKITSSSSSRRKPSGSWSVAISSSTS--GSQS 97

Query: 1889 SLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTS 1948
             L+   +++ +P + S   +N    S+T     + S T SS  + S   S+         
Sbjct: 98   LLMELPSSSFNPSTSSRNKSNSA-LSSTQQGNANSSVTLSSSTASSMFNSNKLP------ 150

Query: 1949 SPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTI 2008
                 +   S+  + + + SS  +   ++SS    +   SS+      TS     +   I
Sbjct: 151  ---LPNPNHSNSATTNQSGSSFINTPASSSSQPLTNLVVSSIKRFPYLTSLSPFFN-YLI 206

Query: 2009 SPVSESTTTSSPVSESTTTISPESESTTTSSPASEST---TTNNPKSESTTTNNPASESI 2065
             P S+S T S+  S  +    P        S +  S    T +   +    +++  +E  
Sbjct: 207  DPSSDSATASADTS-PSFNPPPNLSPNNLFSTSDLSPLPDTQSVENNIILNSSSSINELT 265

Query: 2066 TSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASEST 2125
            +               S   +  + +S S   S+   +  + +   S  +        S 
Sbjct: 266  SIYGSVPSIRNLRGLNSALVSFLNVSSSSLAFSALNGKEVSPTGSPSTRSFARVLPKSSP 325

Query: 2126 TIEEQGVSPHSEKLSANEDP 2145
                  +         +   
Sbjct: 326  NNLLTEILTTGVNPPQSLPS 345



 Score = 53.6 bits (129), Expect = 1e-06
 Identities = 39/234 (16%), Positives = 86/234 (36%), Gaps = 12/234 (5%)

Query: 1891 LSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSP 1950
              ++  +++   E    N+  +  +   S  S S T +  +  + + +   + S+++   
Sbjct: 20   SKKSPPSSTTSQELMNGNSTPNSFSPIPSKASSSATFTLNLPINNSVNHKITSSSSSRRK 79

Query: 1951 ESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISP 2010
             S S + +   S S + S      +++ +P + S   S   S  ++T    + S+ T+S 
Sbjct: 80   PSGSWSVAISSSTSGSQSLLMELPSSSFNPSTSSRNKS--NSALSSTQQGNANSSVTLSS 137

Query: 2011 VSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSP 2070
             + S+  +S            S +T  S  +  +T        S+++    +  ++S   
Sbjct: 138  STASSMFNSNKLPLPNPNHSNSATTNQSGSSFINT------PASSSSQPLTNLVVSSIKR 191

Query: 2071 ASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASES 2124
                T+ S   +     SS    S T S+  S  +    P        S +  S
Sbjct: 192  FPYLTSLSPFFNYLIDPSSD---SATASADTS-PSFNPPPNLSPNNLFSTSDLS 241


>gnl|CDD|227709 COG5422, ROM1, RhoGEF, Guanine nucleotide exchange factor for
            Rho/Rac/Cdc42-like GTPases [Signal transduction
            mechanisms].
          Length = 1175

 Score = 63.8 bits (155), Expect = 1e-09
 Identities = 49/232 (21%), Positives = 87/232 (37%), Gaps = 18/232 (7%)

Query: 1903 ESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPE-SESTTTSSPESESTTTSSLV 1961
              +  N  +++   + S ES           S+ +SSP+  +   ++ P + S + +S  
Sbjct: 43   PISIRNGADNDIINSESKESFGKYALGHQIFSSFSSSPKLFQRRNSAGPITHSPSATSS- 101

Query: 1962 SESTTTSSPESESTTTSSPESES-----TTTSSLVSESTTTSSPESESTTTISPVSESTT 2016
                 TSS  S      SP S+S     ++T S         SP  +    + P S +  
Sbjct: 102  -----TSSLNSNDGDQFSPASDSLSFNPSSTQSRKDSGPGDGSPVQKRKNPLLPSSSTHG 156

Query: 2017 TSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTT 2076
            T  P+  +    S        S     S  + + +  S       S S TS+  +  S  
Sbjct: 157  THPPIVFTDNNGSHAGAPNARSRKEIPSLGSQSMQLPSPHFRQKFSSSDTSNGFSYPSIR 216

Query: 2077 TSSPASESTTTSSPASEST------TTSSPASESTTTSSPESESTTTSSPAS 2122
             +S  S ++  S P S +       + SS AS  ++  +P S ++   S +S
Sbjct: 217  KNSRHSSNSMPSFPHSSTAVLLKRHSGSSGASLISSNITPSSSNSEAMSTSS 268



 Score = 63.0 bits (153), Expect = 2e-09
 Identities = 49/228 (21%), Positives = 86/228 (37%), Gaps = 18/228 (7%)

Query: 1892 SENTTTNSPESESTTTNNPESESTTTSSPESESTTTSS-----LVSESTTTSSPESESTT 1946
            ++   + S ES        +  S+ +SSP+      S+       S +++TSS  S    
Sbjct: 52   NDIINSESKESFGKYALGHQIFSSFSSSPKLFQRRNSAGPITHSPSATSSTSSLNSNDGD 111

Query: 1947 TSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTT 2006
              SP S+S +     + S+T S  +S     S  +      + L+  S+T  +      T
Sbjct: 112  QFSPASDSLS----FNPSSTQSRKDSGPGDGSPVQKRK---NPLLPSSSTHGTHPPIVFT 164

Query: 2007 --TISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASES 2064
                S        S     S  + S +  S       S S T+N     S   N+  S +
Sbjct: 165  DNNGSHAGAPNARSRKEIPSLGSQSMQLPSPHFRQKFSSSDTSNGFSYPSIRKNSRHSSN 224

Query: 2065 ITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPES 2112
               S P S    +++   +  + SS AS  ++  +P+S ++   S  S
Sbjct: 225  SMPSFPHS----STAVLLKRHSGSSGASLISSNITPSSSNSEAMSTSS 268



 Score = 56.4 bits (136), Expect = 2e-07
 Identities = 43/249 (17%), Positives = 87/249 (34%), Gaps = 13/249 (5%)

Query: 1911 ESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSP 1970
            +S++  +             ++  +  +  +++   + S ES           S+ +SSP
Sbjct: 22   KSDAFVSKQLLPPRRL-QRKLNPISIRNGADNDIINSESKESFGKYALGHQIFSSFSSSP 80

Query: 1971 E-SESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSES-----TTTSSPVSES 2024
            +  +   ++ P + S + +S       TSS  S      SP S+S     ++T S     
Sbjct: 81   KLFQRRNSAGPITHSPSATSS------TSSLNSNDGDQFSPASDSLSFNPSSTQSRKDSG 134

Query: 2025 TTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASES 2084
                SP  +      P+S +  T+ P   +    + A      S     S  + S    S
Sbjct: 135  PGDGSPVQKRKNPLLPSSSTHGTHPPIVFTDNNGSHAGAPNARSRKEIPSLGSQSMQLPS 194

Query: 2085 TTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTTIEEQGVSPHSEKLSANED 2144
                   S S T++  +  S   +S  S ++  S P S +  + ++        L ++  
Sbjct: 195  PHFRQKFSSSDTSNGFSYPSIRKNSRHSSNSMPSFPHSSTAVLLKRHSGSSGASLISSNI 254

Query: 2145 PEEFPNEDV 2153
                 N + 
Sbjct: 255  TPSSSNSEA 263



 Score = 44.9 bits (106), Expect = 7e-04
 Identities = 42/186 (22%), Positives = 70/186 (37%), Gaps = 6/186 (3%)

Query: 1876 NNSESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSEST 1935
              + S    S+ +SL S +    SP S+S + N      ++T S +       S V +  
Sbjct: 91   PITHSPSATSSTSSLNSNDGDQFSPASDSLSFNP-----SSTQSRKDSGPGDGSPVQKRK 145

Query: 1936 TTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSEST 1995
                P S +  T  P   +    S        S  E  S  + S +  S       S S 
Sbjct: 146  NPLLPSSSTHGTHPPIVFTDNNGSHAGAPNARSRKEIPSLGSQSMQLPSPHFRQKFSSSD 205

Query: 1996 TTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTN-NPKSES 2054
            T++     S    S  S ++  S P S +   +   S S+  S  +S  T ++ N ++ S
Sbjct: 206  TSNGFSYPSIRKNSRHSSNSMPSFPHSSTAVLLKRHSGSSGASLISSNITPSSSNSEAMS 265

Query: 2055 TTTNNP 2060
            T++  P
Sbjct: 266  TSSKRP 271



 Score = 35.3 bits (81), Expect = 0.51
 Identities = 40/201 (19%), Positives = 69/201 (34%), Gaps = 14/201 (6%)

Query: 1759 HHSRDINSVSPNVTSKILTTDNYSEIIFTTNNNSESTVVMSTLNSLLSENEKLFKPHAKT 1818
               +  NS  P   S   T+        T++ NS      S  +  LS N    +    +
Sbjct: 81   KLFQRRNSAGPITHSPSATS-------STSSLNSNDGDQFSPASDSLSFNPSSTQSRKDS 133

Query: 1819 -PGAEFLIQCQYCDFDSSMNLLSVSPYITNNLLISMLAATAVAISVIDNYSEIIFTTNNN 1877
             PG    +Q +      +  L S S + T+  ++      + A +      + I +  + 
Sbjct: 134  GPGDGSPVQKR-----KNPLLPSSSTHGTHPPIVFTDNNGSHAGAPNARSRKEIPSLGSQ 188

Query: 1878 SESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTT 1937
            S             S + T+N     S   N+  S ++  S P S +       S S+  
Sbjct: 189  SMQLPSPHF-RQKFSSSDTSNGFSYPSIRKNSRHSSNSMPSFPHSSTAVLLKRHSGSSGA 247

Query: 1938 SSPESESTTTSSPESESTTTS 1958
            S   S  T +SS     +T+S
Sbjct: 248  SLISSNITPSSSNSEAMSTSS 268



 Score = 34.5 bits (79), Expect = 0.92
 Identities = 32/169 (18%), Positives = 63/169 (37%), Gaps = 12/169 (7%)

Query: 2009 SPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSS 2068
               SES  +    +      S  S S       + +    +  S +++T++  S      
Sbjct: 54   IINSESKESFGKYALGHQIFSSFSSSPKLFQRRNSAGPITHSPSATSSTSSLNSNDGDQF 113

Query: 2069 SPASES-----TTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESES----TTTSS 2119
            SPAS+S     ++T S         SP  +      P+S +  T  P   +    +   +
Sbjct: 114  SPASDSLSFNPSSTQSRKDSGPGDGSPVQKRKNPLLPSSSTHGTHPPIVFTDNNGSHAGA 173

Query: 2120 PASESTTIEEQGVSPHSEKLSANEDPEEFPNEDVFEHTFAEIPNIDHSN 2168
            P + S   E   +   S +L +    ++F + D   + F+  P+I  ++
Sbjct: 174  PNARSRK-EIPSLGSQSMQLPSPHFRQKFSSSD-TSNGFS-YPSIRKNS 219


>gnl|CDD|118131 pfam09595, Metaviral_G, Metaviral_G glycoprotein.  This is a viral
            attachment glycoprotein from region G of metaviruses. It
            is high in serine and threonine suggesting it is highly
            glycosylated.
          Length = 183

 Score = 58.5 bits (141), Expect = 3e-09
 Identities = 36/154 (23%), Positives = 66/154 (42%), Gaps = 8/154 (5%)

Query: 1977 TSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTT 2036
            TSSP +ES+  +         ++P S+  T  S  S +   ++  S   T  +   ++T 
Sbjct: 32   TSSPPTESSKKTPTTPTDNPDTNPNSQHPTQQSTESSTLPAATSESHLETEPTSTPDTTN 91

Query: 2037 TSSPASESTT----TNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPAS 2092
                    TT    +    +++         +  + +P + ++T +   + +T TSS   
Sbjct: 92   RQQTVDRHTTPPSSSRTQTTQAVHEKKNTRTTSRTQTPPT-TSTAAVQTTTTTNTSSTGK 150

Query: 2093 ESTTTSS-PASESTTTSSPESESTTTSSPASEST 2125
            E TTTS  P S +TT S    E++  +  +S ST
Sbjct: 151  EPTTTSVQPRSSATTQSH--EETSQANPQSSAST 182



 Score = 55.4 bits (133), Expect = 3e-08
 Identities = 42/170 (24%), Positives = 71/170 (41%), Gaps = 12/170 (7%)

Query: 1895 TTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESES 1954
             T+ S    S+       ++ TT +   ++   S   ++ +T SS    +T+ S  E+E 
Sbjct: 24   NTSESEHHTSSPPTESSKKTPTTPTDNPDTNPNSQHPTQQSTESSTLPAATSESHLETEP 83

Query: 1955 TTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSES 2014
            T+T        TT+  ++    T+ P S  T T+  V E   T +     T        +
Sbjct: 84   TSTPD------TTNRQQTVDRHTTPPSSSRTQTTQAVHEKKNTRTTSRTQTP-----PTT 132

Query: 2015 TTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASES 2064
            +T +   + +T T S   E TTTS     S TT +   E++  N  +S S
Sbjct: 133  STAAVQTTTTTNTSSTGKEPTTTSVQPRSSATTQS-HEETSQANPQSSAS 181



 Score = 54.6 bits (131), Expect = 5e-08
 Identities = 54/188 (28%), Positives = 79/188 (42%), Gaps = 15/188 (7%)

Query: 1858 AVAISVIDNYSEIIFTTNNNSESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTT 1917
            A+ I +I NY+       + SE         S  S+ T T   ++  T   NP S+  T 
Sbjct: 10   ALNIYLIINYA--TQKNTSESEHHTSSPPTES--SKKTPTTPTDNPDT---NPNSQHPTQ 62

Query: 1918 SSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTT 1977
             S ES +   ++  S   T  +   ++T         TT  S     TT +  E ++T T
Sbjct: 63   QSTESSTLPAATSESHLETEPTSTPDTTNRQQTVDRHTTPPSSSRTQTTQAVHEKKNTRT 122

Query: 1978 SSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTT 2037
            +S      TTS+   ++TTT       T T S   E TTTS     S TT S E E++  
Sbjct: 123  TSRTQTPPTTSTAAVQTTTT-------TNTSSTGKEPTTTSVQPRSSATTQSHE-ETSQA 174

Query: 2038 SSPASEST 2045
            +  +S ST
Sbjct: 175  NPQSSAST 182



 Score = 53.9 bits (129), Expect = 1e-07
 Identities = 41/171 (23%), Positives = 74/171 (43%), Gaps = 12/171 (7%)

Query: 1925 TTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESES 1984
             T+ S    S+  +    ++ TT +   ++   S   ++ +T SS    +T+ S  E+E 
Sbjct: 24   NTSESEHHTSSPPTESSKKTPTTPTDNPDTNPNSQHPTQQSTESSTLPAATSESHLETEP 83

Query: 1985 TTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASES 2044
            T+T        TT+  ++    T  P S  T T+  V E   T      ++ T +P + S
Sbjct: 84   TSTPD------TTNRQQTVDRHTTPPSSSRTQTTQAVHEKKNT----RTTSRTQTPPTTS 133

Query: 2045 TTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASEST 2095
            T      + + T++     + TS  P S +TT S    E++  +  +S ST
Sbjct: 134  TAAVQTTTTTNTSSTGKEPTTTSVQPRSSATTQSH--EETSQANPQSSAST 182



 Score = 47.7 bits (113), Expect = 1e-05
 Identities = 34/152 (22%), Positives = 61/152 (40%), Gaps = 10/152 (6%)

Query: 1997 TSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTT 2056
            TSSP +ES+            ++P S+  T  S ES +   ++  S   T      ++T 
Sbjct: 32   TSSPPTESSKKTPTTPTDNPDTNPNSQHPTQQSTESSTLPAATSESHLETEPTSTPDTT- 90

Query: 2057 TNNPASESITSSSPASES-------TTTSSPASESTTTSSPASESTTTSSPASESTTTSS 2109
              N        ++P S S                ++ T +P + ST      + + T+S+
Sbjct: 91   --NRQQTVDRHTTPPSSSRTQTTQAVHEKKNTRTTSRTQTPPTTSTAAVQTTTTTNTSST 148

Query: 2110 PESESTTTSSPASESTTIEEQGVSPHSEKLSA 2141
             +  +TT+  P S +TT   +  S  + + SA
Sbjct: 149  GKEPTTTSVQPRSSATTQSHEETSQANPQSSA 180


>gnl|CDD|218440 pfam05110, AF-4, AF-4 proto-oncoprotein.  This family consists of AF4
            (Proto-oncogene AF4) and FMR2 (Fragile X E mental
            retardation syndrome) nuclear proteins. These proteins
            have been linked to human diseases such as acute
            lymphoblastic leukaemia and mental retardation. The
            family also contains a Drosophila AF4 protein homologue
            Lilliputian which contains an AT-hook domain. Lilliputian
            represents a novel pair-rule gene that acts in
            cytoskeleton regulation, segmentation and morphogenesis
            in Drosophila.
          Length = 1154

 Score = 62.2 bits (151), Expect = 3e-09
 Identities = 64/317 (20%), Positives = 115/317 (36%), Gaps = 53/317 (16%)

Query: 1873 TTNNNSESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVS 1932
            ++  ++ S    S L   L  +++ +S E ++T      +   +  S   E   +SS   
Sbjct: 342  SSKTSTNSQSGTSMLEDDLKLSSSEDSDEEQATEKPPSRNTPPSAPSSNPEPAASSS--- 398

Query: 1933 ESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTS-------------- 1978
              +++SS  SES++ S  ESES+++ S  +E   T+SPE E  +T+              
Sbjct: 399  -GSSSSSSGSESSSGSDSESESSSSDSEENEPPRTASPEPEPPSTNKWQLDNWLNKVNPH 457

Query: 1979 --SPES------------ESTTTSSLVSESTTTSSPESESTTTISPVSESTTT------- 2017
              SP              E               S E    ++        T        
Sbjct: 458  KVSPAESVSSNPPIKQPMEKEGKVKSSGSQYHPESKEPPPKSSSKEKRRPRTAQKGPESG 517

Query: 2018 -----SSPVSESTT---TISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSS 2069
                 S   SE+     T+  +       + A +  T   P+SE  T    +S       
Sbjct: 518  RGKQKSPAQSEAPPQRRTVGKKQPKKPEKASAGDERTGLRPESEPGTLPYGSSVQTPPDR 577

Query: 2070 PASESTTTSSPA--SESTTTSSPASESTTTSSPA-SESTTTSSPESESTTTSSPASES-- 2124
            P + +  +  P+   E  ++  PA+E     SP+     +    E++S+++ SP  ES  
Sbjct: 578  PKAATKGSRKPSPRKEPKSSVPPAAEKRKYKSPSKIVPKSREFIETDSSSSDSPEDESLP 637

Query: 2125 TTIEEQGVSPHSEKLSA 2141
             + +  G +  S K S 
Sbjct: 638  PSSQSPG-NTESSKESC 653



 Score = 60.7 bits (147), Expect = 1e-08
 Identities = 54/262 (20%), Positives = 94/262 (35%), Gaps = 25/262 (9%)

Query: 1896 TTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESEST 1955
            T +S      T N  + + ++ +S  S+S T  S++ +    SS E      ++ +  S 
Sbjct: 323  TKDSQHVSPGTQNQKQYDPSSKTSTNSQSGT--SMLEDDLKLSSSEDSDEEQATEKPPSR 380

Query: 1956 TTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSEST 2015
             T      S    +  S  +++SS  SES++ S   SES+++ S E+E   T SP  E  
Sbjct: 381  NTPPSAPSSNPEPAASSSGSSSSSSGSESSSGSDSESESSSSDSEENEPPRTASPEPEPP 440

Query: 2016 TT---------------SSPVSESTTTISP-----ESESTTTSSPASESTTTNNPKSEST 2055
            +T                   +ES ++  P     E E    SS +     +  P  +S+
Sbjct: 441  STNKWQLDNWLNKVNPHKVSPAESVSSNPPIKQPMEKEGKVKSSGSQYHPESKEPPPKSS 500

Query: 2056 TTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSS---PASESTTTSSPES 2112
            +       +      +      S   SE+        +          A +  T   PES
Sbjct: 501  SKEKRRPRTAQKGPESGRGKQKSPAQSEAPPQRRTVGKKQPKKPEKASAGDERTGLRPES 560

Query: 2113 ESTTTSSPASESTTIEEQGVSP 2134
            E  T    +S  T  +    + 
Sbjct: 561  EPGTLPYGSSVQTPPDRPKAAT 582



 Score = 49.5 bits (118), Expect = 2e-05
 Identities = 58/242 (23%), Positives = 95/242 (39%), Gaps = 23/242 (9%)

Query: 1910 PESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSS 1969
             E++S+++ SPE ES   SS    +T +S     S  T    S   + + L  +   +  
Sbjct: 621  IETDSSSSDSPEDESLPPSSQSPGNTESSKESCASLRTPVCRSSVGSQNDLSKDRLLSPM 680

Query: 1970 PESESTTTSSPESESTTTSSLVSESTT---TSSPESESTTTISP-VSESTTTSSPVSEST 2025
             E+E     SP  +S    SL  +      +  P       + P  +E  + S+P  +++
Sbjct: 681  RETE---LLSPLRDSEERYSLWVKIDLDLLSRIPGHPYKKGVPPKPAEKDSLSAPKKQTS 737

Query: 2026 TTIS--PESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASE 2083
             T S    S+         E+    + K      ++  S S +SS   S S   S  +S 
Sbjct: 738  KTASEKSSSKGKRKHKNDEEADKIESKKQRLEEKSSSCSPSSSSSHHHSSSNKESRKSSR 797

Query: 2084 ST------TTSSPASESTTTSS-------PASESTTTSS-PESESTTTSSPASESTTIEE 2129
            +       + SSP S S+              E T++SS P S S+T SS  S ST+   
Sbjct: 798  NKEEEMLPSPSSPLSSSSPKPEHPSRKRPRRQEDTSSSSGPFSASSTKSSSKSSSTSKHR 857

Query: 2130 QG 2131
            + 
Sbjct: 858  KT 859



 Score = 46.5 bits (110), Expect = 2e-04
 Identities = 52/246 (21%), Positives = 92/246 (37%), Gaps = 17/246 (6%)

Query: 1911 ESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSP 1970
            E +S+   + E     + S +    +    E++S+++ SPE ES   SS    +T +S  
Sbjct: 593  EPKSSVPPAAEKRKYKSPSKIV-PKSREFIETDSSSSDSPEDESLPPSSQSPGNTESSKE 651

Query: 1971 ESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTI-- 2028
               S  T    S   + + L  +   +   E+E    +SP+ +S    S   +    +  
Sbjct: 652  SCASLRTPVCRSSVGSQNDLSKDRLLSPMRETE---LLSPLRDSEERYSLWVKIDLDLLS 708

Query: 2029 -SPESESTTTSSPA-SESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTT 2086
              P         P  +E  + + PK +++ T         S   +S+         E+  
Sbjct: 709  RIPGHPYKKGVPPKPAEKDSLSAPKKQTSKT--------ASEKSSSKGKRKHKNDEEADK 760

Query: 2087 TSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTTIEEQGVSPHSEKLSANEDPE 2146
              S        SS  S S+++S   S S   S  +S +   EE   SP S   S++  PE
Sbjct: 761  IESKKQRLEEKSSSCSPSSSSSHHHSSSNKESRKSSRNKE-EEMLPSPSSPLSSSSPKPE 819

Query: 2147 EFPNED 2152
                + 
Sbjct: 820  HPSRKR 825



 Score = 44.9 bits (106), Expect = 6e-04
 Identities = 43/211 (20%), Positives = 69/211 (32%), Gaps = 23/211 (10%)

Query: 1952 SESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPV 2011
             ES    S + ES    S     T            + L   S       S   + +  +
Sbjct: 236  DESPELKSSIEESYGQQS--FGKTMDELKSPAKAKLTKLKIPSQPVEQSYSGDVSCVEEI 293

Query: 2012 SESTTTSSPVSESTTTISPESESTTTSSPASES----TTTNNPKSESTTTNNPA------ 2061
             +  T S P   +      ++E +    P  +S      T N K    ++          
Sbjct: 294  LKEMTHSWPPPLTAIHTPGKTEPSKFPFPTKDSQHVSPGTQNQKQYDPSSKTSTNSQSGT 353

Query: 2062 ---------SESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPES 2112
                     S S  S    +     S     S  +S+P   ++++ S +S S + SS  S
Sbjct: 354  SMLEDDLKLSSSEDSDEEQATEKPPSRNTPPSAPSSNPEPAASSSGSSSSSSGSESSSGS 413

Query: 2113 ESTTTSSPASESTTIEEQGV-SPHSEKLSAN 2142
            +S + SS +S+S   E     SP  E  S N
Sbjct: 414  DSESESS-SSDSEENEPPRTASPEPEPPSTN 443



 Score = 44.1 bits (104), Expect = 0.001
 Identities = 55/274 (20%), Positives = 98/274 (35%), Gaps = 47/274 (17%)

Query: 1900 PESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTS----------- 1948
             E++S+++++PE ES   SS    +T +S     S  T    S   + +           
Sbjct: 621  IETDSSSSDSPEDESLPPSSQSPGNTESSKESCASLRTPVCRSSVGSQNDLSKDRLLSPM 680

Query: 1949 ------SPESESTTTSSLV---------------SESTTTSSPESESTTTSSPESESTTT 1987
                  SP  +S    SL                 +      P  + + ++  +  S T 
Sbjct: 681  RETELLSPLRDSEERYSLWVKIDLDLLSRIPGHPYKKGVPPKPAEKDSLSAPKKQTSKTA 740

Query: 1988 SSLVS-ESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASE--- 2043
            S   S +       + E+    S        SS  S S+++    S S   S  +S    
Sbjct: 741  SEKSSSKGKRKHKNDEEADKIESKKQRLEEKSSSCSPSSSSSHHHSSSNKESRKSSRNKE 800

Query: 2044 ---------STTTNNPKSESTTTNNPASESITSSS--PASESTTTSSPASESTTTSSPAS 2092
                       ++++PK E  +   P  +  TSSS  P S S+T SS  S ST+      
Sbjct: 801  EEMLPSPSSPLSSSSPKPEHPSRKRPRRQEDTSSSSGPFSASSTKSSSKSSSTSKHRKTE 860

Query: 2093 ESTTTSSPASESTTTSSPESESTTTSSPASESTT 2126
               +++S   + ++  +P   S+    P S  ++
Sbjct: 861  GKGSSTSKEHKGSSGDTPNKASSFPVPPLSNGSS 894



 Score = 42.2 bits (99), Expect = 0.004
 Identities = 50/260 (19%), Positives = 96/260 (36%), Gaps = 27/260 (10%)

Query: 1892 SENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTS--- 1948
            +++++++SPE ES   ++    +T +S     S  T    S   + +    +   +    
Sbjct: 623  TDSSSSDSPEDESLPPSSQSPGNTESSKESCASLRTPVCRSSVGSQNDLSKDRLLSPMRE 682

Query: 1949 ----SPESESTTTSSLV---------------SESTTTSSPESESTTTSSPESESTTTSS 1989
                SP  +S    SL                 +      P  + + ++  +  S T S 
Sbjct: 683  TELLSPLRDSEERYSLWVKIDLDLLSRIPGHPYKKGVPPKPAEKDSLSAPKKQTSKTASE 742

Query: 1990 LVS-ESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTN 2048
              S +       + E+    S        SS  S S+++    S S   S  +S +    
Sbjct: 743  KSSSKGKRKHKNDEEADKIESKKQRLEEKSSSCSPSSSSSHHHSSSNKESRKSSRNKEEE 802

Query: 2049 --NPKSESTTTNNPASESITSSSPASESTTTSS--PASESTTTSSPASESTTTSSPASES 2104
                 S   ++++P  E  +   P  +  T+SS  P S S+T SS  S ST+        
Sbjct: 803  MLPSPSSPLSSSSPKPEHPSRKRPRRQEDTSSSSGPFSASSTKSSSKSSSTSKHRKTEGK 862

Query: 2105 TTTSSPESESTTTSSPASES 2124
             +++S E + ++  +P   S
Sbjct: 863  GSSTSKEHKGSSGDTPNKAS 882



 Score = 35.7 bits (82), Expect = 0.46
 Identities = 32/129 (24%), Positives = 55/129 (42%), Gaps = 6/129 (4%)

Query: 1898 NSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTT 1957
            +S  S S+++++  S S   S   S +     L S S+  SS    S     P  +    
Sbjct: 772  SSSCSPSSSSSHHHSSSNKESRKSSRNKEEEMLPSPSSPLSS---SSPKPEHPSRKRPRR 828

Query: 1958 SSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTT 2017
                S S   S P S S+T SS +S ST+         +++S E + ++  +P   S+  
Sbjct: 829  QEDTSSS---SGPFSASSTKSSSKSSSTSKHRKTEGKGSSTSKEHKGSSGDTPNKASSFP 885

Query: 2018 SSPVSESTT 2026
              P+S  ++
Sbjct: 886  VPPLSNGSS 894



 Score = 31.4 bits (71), Expect = 7.3
 Identities = 29/130 (22%), Positives = 53/130 (40%), Gaps = 6/130 (4%)

Query: 1878 SESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTT 1937
             E +   S  +S    ++++N    +S    +   E     SP S  +++S      +  
Sbjct: 769  EEKSSSCSPSSSSSHHHSSSNKESRKS----SRNKEEEMLPSPSSPLSSSSPKPEHPSRK 824

Query: 1938 SSPESESTTTSS-PESESTTTSSLVSESTTTSSP-ESESTTTSSPESESTTTSSLVSEST 1995
                 E T++SS P S S+T SS  S ST+     E + ++TS     S+  +   + S 
Sbjct: 825  RPRRQEDTSSSSGPFSASSTKSSSKSSSTSKHRKTEGKGSSTSKEHKGSSGDTPNKASSF 884

Query: 1996 TTSSPESEST 2005
                  + S+
Sbjct: 885  PVPPLSNGSS 894


>gnl|CDD|165513 PHA03255, PHA03255, BDLF3; Provisional.
          Length = 234

 Score = 59.5 bits (143), Expect = 3e-09
 Identities = 40/160 (25%), Positives = 77/160 (48%), Gaps = 9/160 (5%)

Query: 1967 TSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTT 2026
            TS   + S ++++     T T+++ + S + S P +  +TT++  S   TT++ +S +TT
Sbjct: 20   TSLIWTSSGSSTASAGNVTGTTAVTTPSPSASGPSTNQSTTLTTTSAPITTTAILSTNTT 79

Query: 2027 TISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASEST-TTSSPASEST 2085
            T+      T+T +  +   TT+N  + + TT   A     + +    ST  TS+  + S+
Sbjct: 80   TV------TSTGTTVTPVPTTSNASTINVTTKVTAQNITATEAGTGTSTGVTSNVTTRSS 133

Query: 2086 TTSSPASEST--TTSSPASESTTTSSPESESTTTSSPASE 2123
            +T+S  +  T  TT +P   S  TS+    +    +   E
Sbjct: 134  STTSATTRITNATTLAPTLSSKGTSNATKTTAELPTVPDE 173



 Score = 56.5 bits (135), Expect = 3e-08
 Identities = 37/158 (23%), Positives = 68/158 (43%), Gaps = 5/158 (3%)

Query: 1914 STTTSSPESESTTTSS--LVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPE 1971
            +++ SS  S    T +  + + S + S P +  +TT +  S   TT++++S +TTT +  
Sbjct: 25   TSSGSSTASAGNVTGTTAVTTPSPSASGPSTNQSTTLTTTSAPITTTAILSTNTTTVTST 84

Query: 1972 SESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPE 2031
              + T     S ++T +     +    +     T T + V+ + TT    S STT+ +  
Sbjct: 85   GTTVTPVPTTSNASTINVTTKVTAQNITATEAGTGTSTGVTSNVTTR---SSSTTSATTR 141

Query: 2032 SESTTTSSPASESTTTNNPKSESTTTNNPASESITSSS 2069
              + TT +P   S  T+N    +        E   S S
Sbjct: 142  ITNATTLAPTLSSKGTSNATKTTAELPTVPDERQPSLS 179



 Score = 54.5 bits (130), Expect = 2e-07
 Identities = 38/155 (24%), Positives = 77/155 (49%), Gaps = 7/155 (4%)

Query: 1937 TSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTT 1996
            TS   + S ++++     T T+++ + S + S P +  +TT +  S   TT++++S +TT
Sbjct: 20   TSLIWTSSGSSTASAGNVTGTTAVTTPSPSASGPSTNQSTTLTTTSAPITTTAILSTNTT 79

Query: 1997 TSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASEST-TTNNPKSEST 2055
            T +    + TT++PV    TTS+  + + TT       T T +    ST  T+N  + S+
Sbjct: 80   TVT---STGTTVTPVP---TTSNASTINVTTKVTAQNITATEAGTGTSTGVTSNVTTRSS 133

Query: 2056 TTNNPASESITSSSPASESTTTSSPASESTTTSSP 2090
            +T +  +    +++ A   ++  +  +  TT   P
Sbjct: 134  STTSATTRITNATTLAPTLSSKGTSNATKTTAELP 168



 Score = 53.0 bits (126), Expect = 5e-07
 Identities = 43/154 (27%), Positives = 74/154 (48%), Gaps = 10/154 (6%)

Query: 1897 TNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTT 1956
            T+S  S ++  N   + + TT SP +   +T+   +  TTTS+P + +   S+  +  T+
Sbjct: 25   TSSGSSTASAGNVTGTTAVTTPSPSASGPSTNQSTTL-TTTSAPITTTAILSTNTTTVTS 83

Query: 1957 TSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTT 2016
            T + V+   TTS+  + + TT       T T +     T TS+  + + TT    S STT
Sbjct: 84   TGTTVTPVPTTSNASTINVTTKVTAQNITATEA----GTGTSTGVTSNVTT---RSSSTT 136

Query: 2017 TSSPVSESTTTISPESESTTTSSPASESTTTNNP 2050
            +++    + TT++P   S  TS   +  TT   P
Sbjct: 137  SATTRITNATTLAPTLSSKGTS--NATKTTAELP 168



 Score = 49.1 bits (116), Expect = 9e-06
 Identities = 37/144 (25%), Positives = 66/144 (45%), Gaps = 8/144 (5%)

Query: 1896 TTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESEST 1955
            TT SP +   +TN   +  TTTS+P     TT++++S +TTT +    + T     S ++
Sbjct: 44   TTPSPSASGPSTNQSTTL-TTTSAP----ITTTAILSTNTTTVTSTGTTVTPVPTTSNAS 98

Query: 1956 TTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSEST 2015
            T +     +    +     T TS+  + + TT    S STT+++    + TT++P   S 
Sbjct: 99   TINVTTKVTAQNITATEAGTGTSTGVTSNVTTR---SSSTTSATTRITNATTLAPTLSSK 155

Query: 2016 TTSSPVSESTTTISPESESTTTSS 2039
             TS+    +    +   E   + S
Sbjct: 156  GTSNATKTTAELPTVPDERQPSLS 179



 Score = 49.1 bits (116), Expect = 9e-06
 Identities = 42/169 (24%), Positives = 74/169 (43%), Gaps = 8/169 (4%)

Query: 1921 ESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSP 1980
            E+    TSS  S ++  +   + + TT SP +   +T+   +  TTTS+P + +   S+ 
Sbjct: 19   ETSLIWTSSGSSTASAGNVTGTTAVTTPSPSASGPSTNQSTTL-TTTSAPITTTAILSTN 77

Query: 1981 ESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSP 2040
             +  T+T + V+   TTS+  + + TT       T T +    ST   S    + TT S 
Sbjct: 78   TTTVTSTGTTVTPVPTTSNASTINVTTKVTAQNITATEAGTGTSTGVTS----NVTTRSS 133

Query: 2041 ASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSS 2089
            ++ S TT   +  + TT  P   S  +S+    +    +   E   + S
Sbjct: 134  STTSATT---RITNATTLAPTLSSKGTSNATKTTAELPTVPDERQPSLS 179



 Score = 47.6 bits (112), Expect = 3e-05
 Identities = 36/159 (22%), Positives = 68/159 (42%), Gaps = 10/159 (6%)

Query: 1987 TSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTT 2046
            TS + + S ++++     T T +  + S + S P +  +TT++  S   TT++  S +TT
Sbjct: 20   TSLIWTSSGSSTASAGNVTGTTAVTTPSPSASGPSTNQSTTLTTTSAPITTTAILSTNTT 79

Query: 2047 TNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTT 2106
            T       T+T    +   T+S+ ++ + TT   A   T T +       TS+  + + T
Sbjct: 80   T------VTSTGTTVTPVPTTSNASTINVTTKVTAQNITATEAGTG----TSTGVTSNVT 129

Query: 2107 TSSPESESTTTSSPASESTTIEEQGVSPHSEKLSANEDP 2145
            T S  + S TT    + +           +   +  E P
Sbjct: 130  TRSSSTTSATTRITNATTLAPTLSSKGTSNATKTTAELP 168



 Score = 44.9 bits (105), Expect = 2e-04
 Identities = 30/148 (20%), Positives = 65/148 (43%), Gaps = 10/148 (6%)

Query: 1873 TTNNNSESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVS 1932
            T +  + +     T  S  +   +TN   + +TT+    +  TTT+      +T ++ V+
Sbjct: 31   TASAGNVTGTTAVTTPSPSASGPSTNQSTTLTTTS----APITTTAI----LSTNTTTVT 82

Query: 1933 ESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVS 1992
             + TT +P   ++  S+    +  T+  ++ +   +   +  T+  +  S STT+++   
Sbjct: 83   STGTTVTPVPTTSNASTINVTTKVTAQNITATEAGTGTSTGVTSNVTTRSSSTTSATTRI 142

Query: 1993 ESTTTSSPESESTTTISPVSESTTTSSP 2020
             + TT +P   S  T +  +  TT   P
Sbjct: 143  TNATTLAPTLSSKGTSN--ATKTTAELP 168



 Score = 41.8 bits (97), Expect = 0.002
 Identities = 34/134 (25%), Positives = 60/134 (44%), Gaps = 10/134 (7%)

Query: 1866 NYSEIIFTTNNNSESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESEST 1925
            N S  + TT+    +T ++ST  + ++   TT +P     TT+N  + + TT       T
Sbjct: 56   NQSTTLTTTSAPITTTAILSTNTTTVTSTGTTVTPVP---TTSNASTINVTTKVTAQNIT 112

Query: 1926 TTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESEST 1985
             T +     T TS+  + + TT    S STT+++    + TT +P   S  TS+    + 
Sbjct: 113  ATEA----GTGTSTGVTSNVTT---RSSSTTSATTRITNATTLAPTLSSKGTSNATKTTA 165

Query: 1986 TTSSLVSESTTTSS 1999
               ++  E   + S
Sbjct: 166  ELPTVPDERQPSLS 179



 Score = 41.0 bits (95), Expect = 0.004
 Identities = 21/104 (20%), Positives = 37/104 (35%)

Query: 1846 TNNLLISMLAATAVAISVIDNYSEIIFTTNNNSESTVVMSTLNSLLSENTTTNSPESEST 1905
            TN   ++    T   +    N S I  TT   +++           +  T+  +  S ST
Sbjct: 76   TNTTTVTSTGTTVTPVPTTSNASTINVTTKVTAQNITATEAGTGTSTGVTSNVTTRSSST 135

Query: 1906 TTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSS 1949
            T+      + TT +P   S  TS+    +    +   E   + S
Sbjct: 136  TSATTRITNATTLAPTLSSKGTSNATKTTAELPTVPDERQPSLS 179



 Score = 38.7 bits (89), Expect = 0.025
 Identities = 25/98 (25%), Positives = 49/98 (50%), Gaps = 1/98 (1%)

Query: 2031 ESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSP 2090
            E+    TSS +S ++  N   + + TT +P++   +++   +  TTTS+P + +   S+ 
Sbjct: 19   ETSLIWTSSGSSTASAGNVTGTTAVTTPSPSASGPSTNQSTTL-TTTSAPITTTAILSTN 77

Query: 2091 ASESTTTSSPASESTTTSSPESESTTTSSPASESTTIE 2128
             +  T+T +  +   TTS+  + + TT   A   T  E
Sbjct: 78   TTTVTSTGTTVTPVPTTSNASTINVTTKVTAQNITATE 115


>gnl|CDD|234368 TIGR03835, termin_org_DnaJ, terminal organelle assembly protein TopJ.
             This model describes TopJ (MG_200, CbpA), a DnaJ homolog
            and probable assembly protein of the Mycoplasma terminal
            organelle. The terminal organelle is involved in both
            cytadherence and gliding motility [Cellular processes,
            Chemotaxis and motility].
          Length = 871

 Score = 61.8 bits (149), Expect = 4e-09
 Identities = 57/316 (18%), Positives = 97/316 (30%), Gaps = 31/316 (9%)

Query: 1880 STVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSS 1939
            S  +  T    + +       + E+      E E   T + E++ T+      E      
Sbjct: 260  SPTLEVTAPKEVEQPLQPEPVDEETVAETKAEEEPQPTQTVETKPTSAPESTVEENL--- 316

Query: 1940 PESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSS 1999
            PE     T + +  S T S+   E T    P+         E    T    V E  T  +
Sbjct: 317  PEINQ-PTQAVQPTSETISTTPVEPTDQLKPKEVDQIQ---EELKKTKEIEVEELPTKKN 372

Query: 2000 PESESTTTISPVSESTTTSSPVSESTTTISPES--ESTTTSSPASESTTTN-------NP 2050
               E         +       + ++     PE   E+  T     E T +N       +P
Sbjct: 373  DLVEIN-----FDDLEELKFELVQTNQEKEPEKAVENWATDYQLDEPTQSNIDWYKQEDP 427

Query: 2051 KSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASE---STTT 2107
            K       + A+  IT  +  S       P+ EST       E+        E   +   
Sbjct: 428  KDLEQLVQDQATLEITEENQISPEPVEEQPSVESTAPEDQVVEAIKEEEELLEQKKAAEF 487

Query: 2108 SSPESESTTTSSPASESTTIEEQGVSPHSEKLSANEDPEEFPNEDVFE-----HTFAEI- 2161
            +    + T T+S        + Q        +  N D     ++  ++       F  I 
Sbjct: 488  AELFGQPTPTTSIEELLNPEQTQPTEFDEIIIENNLDNVSVADDQNYQLKDDNKKFINIS 547

Query: 2162 -PNIDHSNQTDEAIPE 2176
             P I  SN++D+ I +
Sbjct: 548  LPTIVSSNESDDLIYD 563



 Score = 59.8 bits (144), Expect = 2e-08
 Identities = 51/316 (16%), Positives = 92/316 (29%), Gaps = 26/316 (8%)

Query: 1893 ENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPES 1952
                      +     + E  S+ T    +       L  E       + E+   +  E 
Sbjct: 238  RELEPQDDSEDDYVIPDAEIISSPTLEVTAPKEVEQPLQPEPV-----DEETVAETKAEE 292

Query: 1953 ESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVS 2012
            E   T ++  E+  TS+PES +   + PE    T +   +  T +++P  E T  + P  
Sbjct: 293  EPQPTQTV--ETKPTSAPES-TVEENLPEINQPTQAVQPTSETISTTPV-EPTDQLKP-K 347

Query: 2013 ESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNP----ASESITSS 2068
            E       + ++      E  +                K E   TN       +    ++
Sbjct: 348  EVDQIQEELKKTKEIEVEELPTKKNDLVEINFDDLEELKFELVQTNQEKEPEKAVENWAT 407

Query: 2069 SPASESTTTSS--------PASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSP 2120
                +  T S+        P          A+   T  +  S       P  EST     
Sbjct: 408  DYQLDEPTQSNIDWYKQEDPKDLEQLVQDQATLEITEENQISPEPVEEQPSVESTAPEDQ 467

Query: 2121 ASESTTIEEQGVSPHSEKLSANEDPEEFPN---EDVFEHTFAEIPNIDHSNQTDEAIPET 2177
              E+   EE+ +        A    +  P    E++      + P        +  +   
Sbjct: 468  VVEAIKEEEELLEQKKAAEFAELFGQPTPTTSIEELLNPEQTQ-PTEFDEIIIENNLDNV 526

Query: 2178 FDAREEWPQCKDVIGK 2193
              A ++  Q KD   K
Sbjct: 527  SVADDQNYQLKDDNKK 542


>gnl|CDD|220365 pfam09726, Macoilin, Transmembrane protein.  This entry is a highly
            conserved protein present in eukaryotes.
          Length = 680

 Score = 60.7 bits (147), Expect = 8e-09
 Identities = 42/214 (19%), Positives = 76/214 (35%), Gaps = 19/214 (8%)

Query: 1933 ESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVS 1992
            E+ T S  + E +  SS    ++T   +  +++  +   S+S+ + +PE E +       
Sbjct: 205  ENHTLSVTDKEKSEASSK-GLTSTKELVPVQNSGGNHSLSKSSNSQTPELEYSEKGKDHH 263

Query: 1993 ESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKS 2052
             S                +  +   S        TI      +  S P+S ST  +   +
Sbjct: 264  HSHNHQHHS---------IGINNHHSKHADSKLQTIEVIENHSNKSRPSSSSTNGSKETT 314

Query: 2053 ESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPES 2112
             ++++    S    SS  A  S    S        SSP S S+   S    S++ S  ES
Sbjct: 315  SNSSSAAAGSIGSKSSKSAKHSNRNKSN-------SSPKSHSSANGS--VPSSSVSDNES 365

Query: 2113 ESTTTSSPASESTTIEEQGVSPHSEKLSANEDPE 2146
            +    S  +S +   ++      +     N  PE
Sbjct: 366  KQKRASKSSSGARDSKKDASGMSANGTVENCIPE 399



 Score = 53.0 bits (127), Expect = 2e-06
 Identities = 41/229 (17%), Positives = 85/229 (37%), Gaps = 16/229 (6%)

Query: 1900 PESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSS 1959
            P+ E+ T +  + E +  SS    ++T   +  +++  +   S+S+ + +PE E +    
Sbjct: 202  PKEENHTLSVTDKEKSEASSK-GLTSTKELVPVQNSGGNHSLSKSSNSQTPELEYSEKGK 260

Query: 1960 LVSESTTTSSPE-SESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTS 2018
                S          +   S        T  ++   +  S P S ST        ++ +S
Sbjct: 261  DHHHSHNHQHHSIGINNHHSKHADSKLQTIEVIENHSNKSRPSSSSTN--GSKETTSNSS 318

Query: 2019 SPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTS 2078
            S  + S  +          SS +++ +  N   S   + ++      +SS   +ES    
Sbjct: 319  SAAAGSIGS---------KSSKSAKHSNRNKSNSSPKSHSSANGSVPSSSVSDNESKQKR 369

Query: 2079 SPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTTI 2127
            +  S S    S    S  +++   E+     PE++ +T S+       I
Sbjct: 370  ASKSSSGARDSKKDASGMSANGTVENCI---PENKISTPSAIERLEQDI 415



 Score = 48.0 bits (114), Expect = 7e-05
 Identities = 34/166 (20%), Positives = 67/166 (40%), Gaps = 11/166 (6%)

Query: 1980 PESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSS 2039
            P+ E+ T S    E +  SS    ++T      +++  +  +S+S+ + +PE E +    
Sbjct: 202  PKEENHTLSVTDKEKSEASSK-GLTSTKELVPVQNSGGNHSLSKSSNSQTPELEYSEKGK 260

Query: 2040 PASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSS 2099
                S    +        NN  S+   S     E     S  S  +++S+  S+ TT++S
Sbjct: 261  DHHHSHNHQHHSIGI---NNHHSKHADSKLQTIEVIENHSNKSRPSSSSTNGSKETTSNS 317

Query: 2100 PASESTTTSSPESESTTTSSPASESTTIEEQGVSPHSEKLSANEDP 2145
             ++ + +  S  S+S   S+    ++       SP S   +    P
Sbjct: 318  SSAAAGSIGSKSSKSAKHSNRNKSNS-------SPKSHSSANGSVP 356



 Score = 45.3 bits (107), Expect = 4e-04
 Identities = 36/187 (19%), Positives = 71/187 (37%), Gaps = 7/187 (3%)

Query: 1880 STVVMSTLNSLL-SENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTS 1938
            S+  +++   L+  +N+  N   S+S+ +  PE E +        S         S   +
Sbjct: 220  SSKGLTSTKELVPVQNSGGNHSLSKSSNSQTPELEYSEKGKDHHHSHNHQH---HSIGIN 276

Query: 1939 SPESESTTTSSPESE---STTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSEST 1995
            +  S+   +     E   + +  S  S S+T  S E+ S ++S+      + SS  ++ +
Sbjct: 277  NHHSKHADSKLQTIEVIENHSNKSRPSSSSTNGSKETTSNSSSAAAGSIGSKSSKSAKHS 336

Query: 1996 TTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSEST 2055
              +   S   +  S      ++S   +ES    + +S S    S    S  + N   E+ 
Sbjct: 337  NRNKSNSSPKSHSSANGSVPSSSVSDNESKQKRASKSSSGARDSKKDASGMSANGTVENC 396

Query: 2056 TTNNPAS 2062
               N  S
Sbjct: 397  IPENKIS 403



 Score = 37.6 bits (87), Expect = 0.10
 Identities = 47/240 (19%), Positives = 91/240 (37%), Gaps = 27/240 (11%)

Query: 1787 TTNNNSESTVVMSTLNSLLSENEKLFK-PHAKTPGAEFLIQCQYCDFDSSMNLLSVSPYI 1845
            ++   + +  ++   NS    N  L K  +++TP  E+  + +      +    S+    
Sbjct: 220  SSKGLTSTKELVPVQNS--GGNHSLSKSSNSQTPELEYSEKGKDHHHSHNHQHHSIG--- 274

Query: 1846 TNNLLISMLAATAVAISVIDNYSEIIFTTNNNSESTVVMSTLNSLLSENTTTNSPESEST 1905
             NN       +    I VI+N+S       N S  +   +  +   + N++  S  + S 
Sbjct: 275  INNHHSKHADSKLQTIEVIENHS-------NKSRPSSSSTNGSKETTSNSS--SAAAGSI 325

Query: 1906 TTNNPESESTTT-----SSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSL 1960
             + + +S   +      SSP+S S+   S    S++ S  ES+    S   S +  +   
Sbjct: 326  GSKSSKSAKHSNRNKSNSSPKSHSSANGS--VPSSSVSDNESKQKRASKSSSGARDSKKD 383

Query: 1961 VSESTTTSS-----PESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSEST 2015
             S  +   +     PE++ +T S+ E        L +E       ESE    IS ++   
Sbjct: 384  ASGMSANGTVENCIPENKISTPSAIERLEQDIKKLQAELQQARQNESELRNQISLLTSLE 443


>gnl|CDD|114270 pfam05539, Pneumo_att_G, Pneumovirinae attachment membrane
            glycoprotein G. 
          Length = 408

 Score = 59.7 bits (144), Expect = 9e-09
 Identities = 38/207 (18%), Positives = 76/207 (36%), Gaps = 2/207 (0%)

Query: 1910 PESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSS 1969
            P  +         +   T      S       + +T+ ++      +  +  S+ T  S 
Sbjct: 139  PICQRDYNPRDRPKCRCTLRGKDVSCCKEPKTAVTTSKTTSWPTEVSHPTYPSQVTPQSQ 198

Query: 1970 PESESTTTSSPESESTTTSSLVSESTTTSS-PESESTTTISPVSESTTTSSPVSESTTTI 2028
            P ++   T++     ++T  + ++ TTTSS PE ++    S    S +   P S ++   
Sbjct: 199  PATQGHQTATANQRLSSTEPVGTQGTTTSSNPEPQTEPPPSQRGPSGSPQHPPSTTSQDQ 258

Query: 2029 SPESESTTTSSPASESTTTNNPKS-ESTTTNNPASESITSSSPASESTTTSSPASESTTT 2087
            S   +    +        T+N +S  ST T  P ++   +  P    T T+   S    +
Sbjct: 259  STTGDGQEHTQRRKTPPATSNRRSPHSTATPPPTTKRQETGRPTPRPTATTQSGSSPPHS 318

Query: 2088 SSPASESTTTSSPASESTTTSSPESES 2114
            S P  ++  T+    +      P+  S
Sbjct: 319  SPPGVQANPTTQNLVDCKELDPPKPNS 345



 Score = 58.9 bits (142), Expect = 2e-08
 Identities = 37/187 (19%), Positives = 70/187 (37%), Gaps = 10/187 (5%)

Query: 1909 NPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTS 1968
               + +T+ ++      +  +  S+ T  S P ++   T++     ++T  + ++ TTTS
Sbjct: 168  PKTAVTTSKTTSWPTEVSHPTYPSQVTPQSQPATQGHQTATANQRLSSTEPVGTQGTTTS 227

Query: 1969 S-PESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTT 2027
            S PE ++    S    S +         +T+S +  STT           + P + +  +
Sbjct: 228  SNPEPQTEPPPSQRGPSGSP----QHPPSTTSQDQ-STTGDGQEHTQRRKTPPATSNRRS 282

Query: 2028 ISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTT 2087
                  ST T  P ++   T  P    T T    S    SS P  ++  T+    +    
Sbjct: 283  P----HSTATPPPTTKRQETGRPTPRPTATTQSGSSPPHSSPPGVQANPTTQNLVDCKEL 338

Query: 2088 SSPASES 2094
              P   S
Sbjct: 339  DPPKPNS 345



 Score = 58.5 bits (141), Expect = 2e-08
 Identities = 41/174 (23%), Positives = 68/174 (39%), Gaps = 4/174 (2%)

Query: 1896 TTNSPESESTTTNNP--ESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSS-PES 1952
            TT+   S  T  ++P   S+ T  S P ++   T++     ++T    ++ TTTSS PE 
Sbjct: 173  TTSKTTSWPTEVSHPTYPSQVTPQSQPATQGHQTATANQRLSSTEPVGTQGTTTSSNPEP 232

Query: 1953 ESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPES-ESTTTISPV 2011
            ++    S    S +   P S ++   S   +    +        TS+  S  ST T  P 
Sbjct: 233  QTEPPPSQRGPSGSPQHPPSTTSQDQSTTGDGQEHTQRRKTPPATSNRRSPHSTATPPPT 292

Query: 2012 SESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESI 2065
            ++   T  P    T T    S    +S P  ++  T     +    + P   SI
Sbjct: 293  TKRQETGRPTPRPTATTQSGSSPPHSSPPGVQANPTTQNLVDCKELDPPKPNSI 346



 Score = 55.1 bits (132), Expect = 3e-07
 Identities = 52/295 (17%), Positives = 101/295 (34%), Gaps = 29/295 (9%)

Query: 1851 ISMLAATAVAISVIDNYSEIIFTTNNNSESTVVMSTLNSLLSENTTTNSPESESTTTNNP 1910
            ++     A++IS+    + +   T      T   S  N      TTT++    +TTT + 
Sbjct: 39   LTGTTTIALSISISVEQAVLSDCTTYLRNGTTSGSLSNP---TRTTTST----ATTTRDI 91

Query: 1911 ESESTTTSSPESESTTTSSLV-----SESTTT--------------SSPESESTTTSSPE 1951
                TT +  + ES +   +        S                 S P  +        
Sbjct: 92   RGLQTTRTR-KLESCSNVQIAYGDMHDRSNPVLGGIDCLGLLALCESGPICQRDYNPRDR 150

Query: 1952 SESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPV 2011
             +   T      S       + +T+ ++      +  +  S+ T  S P ++   T +  
Sbjct: 151  PKCRCTLRGKDVSCCKEPKTAVTTSKTTSWPTEVSHPTYPSQVTPQSQPATQGHQTATAN 210

Query: 2012 SESTTTSSPVSESTTT-ISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSP 2070
               ++T    ++ TTT  +PE ++    S    S +  +P S ++   +   +    +  
Sbjct: 211  QRLSSTEPVGTQGTTTSSNPEPQTEPPPSQRGPSGSPQHPPSTTSQDQSTTGDGQEHTQR 270

Query: 2071 ASESTTTSSPAS-ESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASES 2124
                  TS+  S  ST T  P ++   T  P    T T+   S    +S P  ++
Sbjct: 271  RKTPPATSNRRSPHSTATPPPTTKRQETGRPTPRPTATTQSGSSPPHSSPPGVQA 325



 Score = 53.1 bits (127), Expect = 1e-06
 Identities = 36/175 (20%), Positives = 60/175 (34%), Gaps = 4/175 (2%)

Query: 1954 STTTSSLVSESTTTSSP--ESESTTTSSPES--ESTTTSSLVSESTTTSSPESESTTTIS 2009
            + TTS   S  T  S P   S+ T  S P +    T T++    ST     +  +T++  
Sbjct: 171  AVTTSKTTSWPTEVSHPTYPSQVTPQSQPATQGHQTATANQRLSSTEPVGTQGTTTSSNP 230

Query: 2010 PVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSS 2069
                    S      +    P + S   S+       T   K+   T+N  +  S  +  
Sbjct: 231  EPQTEPPPSQRGPSGSPQHPPSTTSQDQSTTGDGQEHTQRRKTPPATSNRRSPHSTATPP 290

Query: 2070 PASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASES 2124
            P ++   T  P    T T+   S    +S P  ++  T+    +      P   S
Sbjct: 291  PTTKRQETGRPTPRPTATTQSGSSPPHSSPPGVQANPTTQNLVDCKELDPPKPNS 345



 Score = 49.3 bits (117), Expect = 2e-05
 Identities = 41/177 (23%), Positives = 63/177 (35%), Gaps = 8/177 (4%)

Query: 1991 VSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNP 2050
            V+ S TTS P   S  T    S+ T  S P ++   T +     ++T    ++ TTT   
Sbjct: 172  VTTSKTTSWPTEVSHPTYP--SQVTPQSQPATQGHQTATANQRLSSTEPVGTQGTTT--S 227

Query: 2051 KSESTTTNNPASESITSSSPASESTTTS----SPASESTTTSSPASESTTTSSPASESTT 2106
             +    T  P S+   S SP    +TTS    +       T    +   T++  +  ST 
Sbjct: 228  SNPEPQTEPPPSQRGPSGSPQHPPSTTSQDQSTTGDGQEHTQRRKTPPATSNRRSPHSTA 287

Query: 2107 TSSPESESTTTSSPASESTTIEEQGVSPHSEKLSANEDPEEFPNEDVFEHTFAEIPN 2163
            T  P ++   T  P    T   + G SP        +      N    +      PN
Sbjct: 288  TPPPTTKRQETGRPTPRPTATTQSGSSPPHSSPPGVQANPTTQNLVDCKELDPPKPN 344



 Score = 43.5 bits (102), Expect = 0.001
 Identities = 25/131 (19%), Positives = 50/131 (38%), Gaps = 2/131 (1%)

Query: 2018 SSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTT 2077
            S P+ +         +   T      S       + +T+        ++  +  S+ T  
Sbjct: 137  SGPICQRDYNPRDRPKCRCTLRGKDVSCCKEPKTAVTTSKTTSWPTEVSHPTYPSQVTPQ 196

Query: 2078 SSPASESTTTSSPASESTTTSSPASESTTTSS-PESESTTTSSPASESTTIEEQGVSPHS 2136
            S PA++   T++     ++T    ++ TTTSS PE ++    S    S + +    +  S
Sbjct: 197  SQPATQGHQTATANQRLSSTEPVGTQGTTTSSNPEPQTEPPPSQRGPSGSPQHPPSTT-S 255

Query: 2137 EKLSANEDPEE 2147
            +  S   D +E
Sbjct: 256  QDQSTTGDGQE 266



 Score = 42.3 bits (99), Expect = 0.002
 Identities = 32/127 (25%), Positives = 51/127 (40%), Gaps = 8/127 (6%)

Query: 2009 SPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASE-SITS 2067
            + V+ S TTS P   S  T    S+ T  S PA++   T       ++T    ++ + TS
Sbjct: 170  TAVTTSKTTSWPTEVSHPTY--PSQVTPQSQPATQGHQTATANQRLSSTEPVGTQGTTTS 227

Query: 2068 SSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTTI 2127
            S+P  ++    S    S +   P S    T+S    STT    E      + PA+ +   
Sbjct: 228  SNPEPQTEPPPSQRGPSGSPQHPPS----TTSQDQ-STTGDGQEHTQRRKTPPATSNRRS 282

Query: 2128 EEQGVSP 2134
                 +P
Sbjct: 283  PHSTATP 289



 Score = 40.4 bits (94), Expect = 0.010
 Identities = 30/161 (18%), Positives = 51/161 (31%), Gaps = 10/161 (6%)

Query: 1994 STTTSSPESESTTTISPV--SESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPK 2051
            + TTS   S  T    P   S+ T  S P        + +   T T++    ST     +
Sbjct: 171  AVTTSKTTSWPTEVSHPTYPSQVTPQSQP--------ATQGHQTATANQRLSSTEPVGTQ 222

Query: 2052 SESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPE 2111
              +T++N         S      +    P++ S   S+       T    +   T++   
Sbjct: 223  GTTTSSNPEPQTEPPPSQRGPSGSPQHPPSTTSQDQSTTGDGQEHTQRRKTPPATSNRRS 282

Query: 2112 SESTTTSSPASESTTIEEQGVSPHSEKLSANEDPEEFPNED 2152
              ST T  P ++          P +   S +  P   P   
Sbjct: 283  PHSTATPPPTTKRQETGRPTPRPTATTQSGSSPPHSSPPGV 323



 Score = 39.3 bits (91), Expect = 0.023
 Identities = 30/130 (23%), Positives = 54/130 (41%), Gaps = 8/130 (6%)

Query: 1878 SESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTT 1937
            + +   +S+   + ++ TTT+S     T    P S+   + SP+   +TTS    + +TT
Sbjct: 207  ATANQRLSSTEPVGTQGTTTSSNPEPQTEP--PPSQRGPSGSPQHPPSTTSQ---DQSTT 261

Query: 1938 SSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTT 1997
               +  +    +P + S   S     ST T  P ++   T  P    T T+   S    +
Sbjct: 262  GDGQEHTQRRKTPPATSNRRSPH---STATPPPTTKRQETGRPTPRPTATTQSGSSPPHS 318

Query: 1998 SSPESESTTT 2007
            S P  ++  T
Sbjct: 319  SPPGVQANPT 328



 Score = 33.1 bits (75), Expect = 1.8
 Identities = 28/157 (17%), Positives = 52/157 (33%), Gaps = 10/157 (6%)

Query: 1888 NSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTT 1947
             +     T T +    ST     +  +T++ +PE ++    S    S +   P     +T
Sbjct: 199  PATQGHQTATANQRLSSTEPVGTQGTTTSS-NPEPQTEPPPSQRGPSGSPQHP----PST 253

Query: 1948 SSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTT 2007
            +S +  +T           T   ++   T++     ST T    ++   T  P    T T
Sbjct: 254  TSQDQSTTGDG-----QEHTQRRKTPPATSNRRSPHSTATPPPTTKRQETGRPTPRPTAT 308

Query: 2008 ISPVSESTTTSSPVSESTTTISPESESTTTSSPASES 2044
                S    +S P  ++  T     +      P   S
Sbjct: 309  TQSGSSPPHSSPPGVQANPTTQNLVDCKELDPPKPNS 345



 Score = 32.3 bits (73), Expect = 3.6
 Identities = 20/104 (19%), Positives = 38/104 (36%), Gaps = 2/104 (1%)

Query: 2084 STTTSSPASESTTTSSPA--SESTTTSSPESESTTTSSPASESTTIEEQGVSPHSEKLSA 2141
            + TTS   S  T  S P   S+ T  S P ++   T++     ++ E  G    +   + 
Sbjct: 171  AVTTSKTTSWPTEVSHPTYPSQVTPQSQPATQGHQTATANQRLSSTEPVGTQGTTTSSNP 230

Query: 2142 NEDPEEFPNEDVFEHTFAEIPNIDHSNQTDEAIPETFDAREEWP 2185
                E  P++     +    P+    +Q+     +    R + P
Sbjct: 231  EPQTEPPPSQRGPSGSPQHPPSTTSQDQSTTGDGQEHTQRRKTP 274


>gnl|CDD|218597 pfam05466, BASP1, Brain acid soluble protein 1 (BASP1 protein).  This
            family consists of several brain acid soluble protein 1
            (BASP1) or neuronal axonal membrane protein NAP-22. The
            BASP1 is a neuron enriched Ca(2+)-dependent
            calmodulin-binding protein of unknown function.
          Length = 233

 Score = 56.8 bits (136), Expect = 2e-08
 Identities = 39/201 (19%), Positives = 85/201 (42%), Gaps = 17/201 (8%)

Query: 1938 SSPESESTTTSSPESESTTTSSLVSEST-TTSSPESESTTTSSPESESTTTSSLVSESTT 1996
            ++ E E T   + E+++   ++ V E+       +++ T   + E E    ++   E   
Sbjct: 28   AATEEEGTPKENEEAQAAAETTEVKEAKEEKPDKDAQDTANKTEEKEGEKEAAAAKEEAP 87

Query: 1997 TSSPESE---STTTISPVSESTTTSSPVSESTTTISPE----SESTTTSSPASESTTTNN 2049
             + PE     +     P   S     P +        E    SE+++  + ++       
Sbjct: 88   KAEPEKTEGAAEAKAEPPKASDPEQEPAAAPGPAAGGEAPKASEASSQPAESAAPAKEEE 147

Query: 2050 PKSE---STTTNNPAS---ESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASE 2103
               E   +  T  PA+   E+ + ++PAS+S  +SS A+ S+  +  A+E+    S  ++
Sbjct: 148  KSKEEGEAKKTEAPAAAAQETKSDAAPASDSKPSSSEAAPSSKETPAATEA---PSSTAK 204

Query: 2104 STTTSSPESESTTTSSPASES 2124
            ++  ++P  E   + +PA+ S
Sbjct: 205  ASAPAAPAEEVKPSEAPAANS 225



 Score = 53.3 bits (127), Expect = 3e-07
 Identities = 37/201 (18%), Positives = 83/201 (41%), Gaps = 3/201 (1%)

Query: 1909 NPESESTTTSSPESESTTTSSLVSEST-TTSSPESESTTTSSPESESTTTSSLVSESTTT 1967
              E E T   + E+++   ++ V E+       +++ T   + E E    ++   E    
Sbjct: 29   ATEEEGTPKENEEAQAAAETTEVKEAKEEKPDKDAQDTANKTEEKEGEKEAAAAKEEAPK 88

Query: 1968 SSPE-SESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTT 2026
            + PE +E    +  E    +       +    +   E+       S+   +++P  E   
Sbjct: 89   AEPEKTEGAAEAKAEPPKASDPEQEPAAAPGPAAGGEAPKASEASSQPAESAAPAKEEEK 148

Query: 2027 TISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTT 2086
            +   E E+  T +PA+ +  T +  + ++ +   +SE+  SS     +T   S  ++++ 
Sbjct: 149  S-KEEGEAKKTEAPAAAAQETKSDAAPASDSKPSSSEAAPSSKETPAATEAPSSTAKASA 207

Query: 2087 TSSPASESTTTSSPASESTTT 2107
             ++PA E   + +PA+ S  T
Sbjct: 208  PAAPAEEVKPSEAPAANSDQT 228



 Score = 43.7 bits (102), Expect = 5e-04
 Identities = 34/186 (18%), Positives = 74/186 (39%), Gaps = 14/186 (7%)

Query: 1968 SSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTT 2027
            ++ E E T   + E+++   ++ V E+     P+ ++  T +   E        +     
Sbjct: 28   AATEEEGTPKENEEAQAAAETTEVKEAKE-EKPDKDAQDTANKTEEKEGEKEAAAAKEEA 86

Query: 2028 ISPESESTTTSSPA-SESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTT 2086
               E E T  ++ A +E    ++P+ E      PA+     +  ASE+++  + ++    
Sbjct: 87   PKAEPEKTEGAAEAKAEPPKASDPEQEPAAAPGPAAGG--EAPKASEASSQPAESAAPAK 144

Query: 2087 TSSPASE---STTTSSPASESTTTSSPESESTTTSSPASESTTIEEQGVSPHSEKLSANE 2143
                + E   +  T +PA+ +        E+ + ++PAS+S     +      E  +A E
Sbjct: 145  EEEKSKEEGEAKKTEAPAAAA-------QETKSDAAPASDSKPSSSEAAPSSKETPAATE 197

Query: 2144 DPEEFP 2149
             P    
Sbjct: 198  APSSTA 203


>gnl|CDD|237863 PRK14949, PRK14949, DNA polymerase III subunits gamma and tau;
            Provisional.
          Length = 944

 Score = 59.4 bits (144), Expect = 3e-08
 Identities = 53/343 (15%), Positives = 113/343 (32%), Gaps = 49/343 (14%)

Query: 1887 LNSLLSENTTTN----SPESESTTTNNPESESTTTS--SPESESTTTSSLVSEST----- 1935
              + L+E TT      +  +E+    +  +E   T   + + ES   ++L +E       
Sbjct: 407  KKTALTEQTTAQQQVQAANAEAVAEADASAEPADTVEQALDDESELLAALNAEQAVILSQ 466

Query: 1936 ---------------TTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSP 1980
                            ++ PE   +T        + T + V +++ +++  +++T   + 
Sbjct: 467  AQSQGFEASSSLDADNSAVPEQIDSTAEQSVVNPSVTDTQVDDTSASNNSAADNTVDDNY 526

Query: 1981 ESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSP 2040
             +E T  S+ + E                 V+ S+ + + +S+     +    + + +  
Sbjct: 527  SAEDTLESNGLDEGDYAQDSAPLDAYQDDYVAFSSESYNALSDDEQHSANVQSAQSAAEA 586

Query: 2041 ASESTTTNNPKSESTTTNNPASESITS---------------SSPASESTTTSSPASEST 2085
               S + +   + +T   + A + I                  SP       SS   +  
Sbjct: 587  QPSSQSLSPISAVTTAAASLADDDILDAVLAARDSLLSDLDALSPKEGDGKKSSADRKPK 646

Query: 2086 TTSS---PASESTTTSSPASESTTTSSPESESTTTSSPASESTTIEEQGVSPHSEKLSAN 2142
            T  S   PAS S   SSP +  T+ S         ++  S        G +P    +   
Sbjct: 647  TPPSRAPPASLSKPASSPDASQTSASFDLDPDFELATHQSVPEAALASGSAPAPPPVPDP 706

Query: 2143 ED--PEEFPNEDVFEHTFAEIPNIDHSNQTDEAIPETFDAREE 2183
             D  P E   E        + PN        E++ +  ++  +
Sbjct: 707  YDRPPWEEAPEVASA---NDGPNNAAEGNLSESVEDASNSELQ 746



 Score = 54.7 bits (132), Expect = 6e-07
 Identities = 32/233 (13%), Positives = 83/233 (35%), Gaps = 21/233 (9%)

Query: 1912 SESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESEST-TTSSLVSESTTTSSP 1970
             E  T S+  +      +   +    +  E ++  T    ++     ++  + +   +S 
Sbjct: 377  PEGQTPSALAAAVQAPHANEPQFVNAAPAEKKTALTEQTTAQQQVQAANAEAVAEADASA 436

Query: 1971 ESESTTTSSPESESTTTSSLVSEST--------------------TTSSPESESTTTISP 2010
            E   T   + + ES   ++L +E                       ++ PE   +T    
Sbjct: 437  EPADTVEQALDDESELLAALNAEQAVILSQAQSQGFEASSSLDADNSAVPEQIDSTAEQS 496

Query: 2011 VSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSP 2070
            V   + T + V +++ + +  +++T   + ++E T  +N   E     + A         
Sbjct: 497  VVNPSVTDTQVDDTSASNNSAADNTVDDNYSAEDTLESNGLDEGDYAQDSAPLDAYQDDY 556

Query: 2071 ASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASE 2123
             + S+ + +  S+    S+    + + +     S + S   + +T  +S A +
Sbjct: 557  VAFSSESYNALSDDEQHSANVQSAQSAAEAQPSSQSLSPISAVTTAAASLADD 609



 Score = 37.0 bits (86), Expect = 0.18
 Identities = 23/193 (11%), Positives = 58/193 (30%), Gaps = 13/193 (6%)

Query: 1983 ESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPAS 2042
            +     SL    T ++   +         +E    ++  +E  T         TT+    
Sbjct: 369  DDPAEISLPEGQTPSALAAAVQAP---HANEPQFVNAAPAEKKT----ALTEQTTAQQQV 421

Query: 2043 ESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPAS 2102
            ++           +     +      +   ES   ++  +E     S A      +S + 
Sbjct: 422  QAANAEAVAEADASAEPADTVE---QALDDESELLAALNAEQAVILSQAQSQGFEASSSL 478

Query: 2103 ESTTTSSPESESTTTSSPASESTTIEEQGVSPHSEKLSANEDPEEFPNEDVFEHTFAEIP 2162
            ++  ++ PE   +T        +  + Q     +   S N   +   +++       E  
Sbjct: 479  DADNSAVPEQIDSTAEQSVVNPSVTDTQVDDTSA---SNNSAADNTVDDNYSAEDTLESN 535

Query: 2163 NIDHSNQTDEAIP 2175
             +D  +   ++ P
Sbjct: 536  GLDEGDYAQDSAP 548


>gnl|CDD|173611 PTZ00421, PTZ00421, coronin; Provisional.
          Length = 493

 Score = 58.8 bits (142), Expect = 3e-08
 Identities = 46/189 (24%), Positives = 84/189 (44%), Gaps = 12/189 (6%)

Query: 346 GRTSNLFAPIMLLEGHGGEIFCSKYHPDGQ-YIASSGYDRQIFIWSV-YGECENIGVMSG 403
           G T N+  PI+ L+GH  ++    +HP     +AS+G D  + +W V  G+     V+  
Sbjct: 109 GLTQNISDPIVHLQGHTKKVGIVSFHPSAMNVLASAGADMVVNVWDVERGKAVE--VIKC 166

Query: 404 HTGAVMDLKFSTDGCHIFTCSTDQTLAVWDLEKGQRIKKMKGH-STFVNSCDPVRRGQLL 462
           H+  +  L+++ DG  + T S D+ L + D   G  +  ++ H S     C   +R  L+
Sbjct: 167 HSDQITSLEWNLDGSLLCTTSKDKKLNIIDPRDGTIVSSVEAHASAKSQRCLWAKRKDLI 226

Query: 463 IASG---SDDCTVKVWDPRKKNQAVSMNNTYQVTSVAF----NDTAECVLTGGIDNDIKM 515
           I  G   S    + +WD RK     S  +  Q +++       DT    +    + +I+ 
Sbjct: 227 ITLGCSKSQQRQIMLWDTRKMASPYSTVDLDQSSALFIPFFDEDTNLLYIGSKGEGNIRC 286

Query: 516 WDLRTNSVV 524
           ++L    + 
Sbjct: 287 FELMNERLT 295



 Score = 54.9 bits (132), Expect = 4e-07
 Identities = 54/215 (25%), Positives = 86/215 (40%), Gaps = 50/215 (23%)

Query: 403 GHTGAVMDLKFST-DGCHIFTCSTDQTLAVWDL-EKGQRIKKMKGHSTFVNSCDPVRRGQ 460
           G  G ++D+ F+  D   +FT S D T+  W + E+G                       
Sbjct: 73  GQEGPIIDVAFNPFDPQKLFTASEDGTIMGWGIPEEG----------------------- 109

Query: 461 LLIASGSDDCTVKVWDPRKKNQAVSMNNTYQVTSVAFNDTAECVL-TGGIDNDIKMWDLR 519
             +     D  V +    KK           V  V+F+ +A  VL + G D  + +WD+ 
Sbjct: 110 --LTQNISDPIVHLQGHTKK-----------VGIVSFHPSAMNVLASAGADMVVNVWDVE 156

Query: 520 TNSVVQKLRGHSDTVTGLSLSPDGSYILSNAMDNTVRIWDIRPYVPGERCVKVMSGHQHN 579
               V+ ++ HSD +T L  + DGS + + + D  + I D R          V S   H 
Sbjct: 157 RGKAVEVIKCHSDQITSLEWNLDGSLLCTTSKDKKLNIIDPR------DGTIVSSVEAHA 210

Query: 580 FEKNLLRCAWSV-SGLYVTAG---SADKCVYIWDT 610
             K+  RC W+    L +T G   S  + + +WDT
Sbjct: 211 SAKS-QRCLWAKRKDLIITLGCSKSQQRQIMLWDT 244



 Score = 46.0 bits (109), Expect = 2e-04
 Identities = 44/180 (24%), Positives = 78/180 (43%), Gaps = 17/180 (9%)

Query: 354 PIMLLEGHGGEIFCSKYHP-DGQYIASSGYDRQIFIWSVYGE------CENIGVMSGHTG 406
           PI+L  G  G I    ++P D Q + ++  D  I  W +  E       + I  + GHT 
Sbjct: 69  PILL--GQEGPIIDVAFNPFDPQKLFTASEDGTIMGWGIPEEGLTQNISDPIVHLQGHTK 126

Query: 407 AVMDLKFSTDGCHIF-TCSTDQTLAVWDLEKGQRIKKMKGHSTFVNSCDPVRRGQLLIAS 465
            V  + F     ++  +   D  + VWD+E+G+ ++ +K HS  + S +    G LL  +
Sbjct: 127 KVGIVSFHPSAMNVLASAGADMVVNVWDVERGKAVEVIKCHSDQITSLEWNLDGSLL-CT 185

Query: 466 GSDDCTVKVWDPRKKN--QAVSMNNTYQVTSVAFNDTAECVLTGGIDN----DIKMWDLR 519
            S D  + + DPR      +V  + + +     +    + ++T G        I +WD R
Sbjct: 186 TSKDKKLNIIDPRDGTIVSSVEAHASAKSQRCLWAKRKDLIITLGCSKSQQRQIMLWDTR 245



 Score = 43.3 bits (102), Expect = 0.002
 Identities = 34/128 (26%), Positives = 57/128 (44%), Gaps = 7/128 (5%)

Query: 1120 LLAHEDSVTGVTFVP-KTHYFFTTSKDGRVKQWD--ADNFERIVTLHFFISLYGH--KLP 1174
            LL  E  +  V F P      FT S+DG +  W    +   + ++    + L GH  K+ 
Sbjct: 71   LLGQEGPIIDVAFNPFDPQKLFTASEDGTIMGWGIPEEGLTQNISDPI-VHLQGHTKKVG 129

Query: 1175 VLSLDMSYDSTLIATGSGDRTVKVWGLDYGDCHKSLLAHEDSVTGVTFVPKTHYFFTTSK 1234
            ++S   S  + L + G+ D  V VW ++ G   + +  H D +T + +        TTSK
Sbjct: 130  IVSFHPSAMNVLASAGA-DMVVNVWDVERGKAVEVIKCHSDQITSLEWNLDGSLLCTTSK 188

Query: 1235 DGRVKQWD 1242
            D ++   D
Sbjct: 189  DKKLNIID 196



 Score = 39.1 bits (91), Expect = 0.031
 Identities = 33/130 (25%), Positives = 58/130 (44%), Gaps = 13/130 (10%)

Query: 115 GHKSAITVIQYDPL-GHRLATGSKDTDIVLWDVVAECGLHR--------LSGHKGVITDI 165
           G +  I  + ++P    +L T S+D  I+ W +  E GL +        L GH   +  +
Sbjct: 73  GQEGPIIDVAFNPFDPQKLFTASEDGTIMGWGIPEE-GLTQNISDPIVHLQGHTKKVGIV 131

Query: 166 RFMSQPGHHFVVSSA-KDTFVKIWDADTGDCFKTMAAHLTEVWGVCVMREDSYLISGSND 224
            F   P    V++SA  D  V +WD + G   + +  H  ++  +    + S L + S D
Sbjct: 132 SF--HPSAMNVLASAGADMVVNVWDVERGKAVEVIKCHSDQITSLEWNLDGSLLCTTSKD 189

Query: 225 AELKVWNVRD 234
            +L + + RD
Sbjct: 190 KKLNIIDPRD 199



 Score = 32.6 bits (74), Expect = 3.2
 Identities = 22/79 (27%), Positives = 38/79 (48%), Gaps = 3/79 (3%)

Query: 1076 ISLYGH--KLPVLSLDMSYDSTLIATGSGDRTVKVWGLDYGDCHKSLLAHEDSVTGVTFV 1133
            + L GH  K+ ++S   S  + L + G+ D  V VW ++ G   + +  H D +T + + 
Sbjct: 119  VHLQGHTKKVGIVSFHPSAMNVLASAGA-DMVVNVWDVERGKAVEVIKCHSDQITSLEWN 177

Query: 1134 PKTHYFFTTSKDGRVKQWD 1152
                   TTSKD ++   D
Sbjct: 178  LDGSLLCTTSKDKKLNIID 196



 Score = 32.6 bits (74), Expect = 3.2
 Identities = 22/79 (27%), Positives = 38/79 (48%), Gaps = 3/79 (3%)

Query: 1350 ISLYGH--KLPVLSLDMSYDSTLIATGSGDRTVKVWGLDYGDCHKSLLAHEDSVTGVTFV 1407
            + L GH  K+ ++S   S  + L + G+ D  V VW ++ G   + +  H D +T + + 
Sbjct: 119  VHLQGHTKKVGIVSFHPSAMNVLASAGA-DMVVNVWDVERGKAVEVIKCHSDQITSLEWN 177

Query: 1408 PKTHYFFTTSKDGRVKQWD 1426
                   TTSKD ++   D
Sbjct: 178  LDGSLLCTTSKDKKLNIID 196


>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family.  Atrophin-1 is the protein
            product of the dentatorubral-pallidoluysian atrophy
            (DRPLA) gene. DRPLA OMIM:125370 is a progressive
            neurodegenerative disorder. It is caused by the expansion
            of a CAG repeat in the DRPLA gene on chromosome 12p. This
            results in an extended polyglutamine region in
            atrophin-1, that is thought to confer toxicity to the
            protein, possibly through altering its interactions with
            other proteins. The expansion of a CAG repeat is also the
            underlying defect in six other neurodegenerative
            disorders, including Huntington's disease. One
            interaction of expanded polyglutamine repeats that is
            thought to be pathogenic is that with the short glutamine
            repeat in the transcriptional coactivator CREB binding
            protein, CBP. This interaction draws CBP away from its
            usual nuclear location to the expanded polyglutamine
            repeat protein aggregates that are characteristic of the
            polyglutamine neurodegenerative disorders. This
            interferes with CBP-mediated transcription and causes
            cytotoxicity.
          Length = 979

 Score = 58.9 bits (142), Expect = 3e-08
 Identities = 51/256 (19%), Positives = 99/256 (38%), Gaps = 20/256 (7%)

Query: 1884 MSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESE 1943
            MSTL S       T SP+  ++ TN  +  S+  +SP + ST+++   +EST   + + +
Sbjct: 14   MSTLRS--GRKKQTASPDGRASPTNE-DQRSSGRNSPSAASTSSNDSKAESTKKPNKKIK 70

Query: 1944 STTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESE 2003
               TS  +S     +    E   + + E E  T    +++  +  +  SE       E E
Sbjct: 71   EEATSPLKS-----TKRQREKPASDTEEPERVTAKKSKTQELSRPNSPSEGEGEGEGEGE 125

Query: 2004 STTTISPVSESTTTSSPVSESTTTISPESESTTTSSPA---SESTTTNNPKSESTTTNNP 2060
            S       S+S + +   S     I  ++ S++ S P+   +ES + ++ + +      P
Sbjct: 126  S-------SDSRSVNEEGSSDPKDIDQDNRSSSPSIPSPQDNESDSDSSAQQQLLQPQGP 178

Query: 2061 ASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTT--SSPESESTTTS 2118
             S  +   +  + S    +P++++         +     P   S  +  S+P        
Sbjct: 179  PSIQVPPGAALAPSAPPPTPSAQAVPPQGSPIAAQPAPQPQQPSPLSLISAPSLHPQRLP 238

Query: 2119 SPASESTTIEEQGVSP 2134
            SP            SP
Sbjct: 239  SPHPPLQPQTASQQSP 254



 Score = 50.5 bits (120), Expect = 1e-05
 Identities = 44/218 (20%), Positives = 87/218 (39%), Gaps = 7/218 (3%)

Query: 1937 TSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTT 1996
            T+SP+  ++ T+  +  S   S   + ST+++  ++EST   + + +   TS L S    
Sbjct: 25   TASPDGRASPTNEDQRSSGRNSP-SAASTSSNDSKAESTKKPNKKIKEEATSPLKSTKRQ 83

Query: 1997 TSSPESESTTTISPVSESTTT---SSPVSESTTTISPESESTTTSSPASESTTTNNPKSE 2053
               P S++       ++ + T   S P S S      E E  ++ S +     +++PK +
Sbjct: 84   REKPASDTEEPERVTAKKSKTQELSRPNSPSEGEGEGEGEGESSDSRSVNEEGSSDPK-D 142

Query: 2054 STTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESE 2113
                N  +S SI S    +ES + SS   +      P S      +  + S    +P ++
Sbjct: 143  IDQDNRSSSPSIPSPQD-NESDSDSSAQQQLLQPQGPPSIQVPPGAALAPSAPPPTPSAQ 201

Query: 2114 STTTSSPASESTTIEEQGVSPHSEKLSA-NEDPEEFPN 2150
            +         +    +         +SA +  P+  P+
Sbjct: 202  AVPPQGSPIAAQPAPQPQQPSPLSLISAPSLHPQRLPS 239



 Score = 47.0 bits (111), Expect = 1e-04
 Identities = 45/242 (18%), Positives = 86/242 (35%), Gaps = 14/242 (5%)

Query: 1895 TTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESES 1954
            +T    E  ++ T  PE  +   S  +  S   S   SE       E ES+ + S   E 
Sbjct: 79   STKRQREKPASDTEEPERVTAKKSKTQELSRPNSP--SEGEGEGEGEGESSDSRSVNEEG 136

Query: 1955 TTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSES 2014
            ++    + +   +SSP    +  S  ++ES + SS   +      P S     + P +  
Sbjct: 137  SSDPKDIDQDNRSSSP----SIPSPQDNESDSDSSAQQQLLQPQGPPSIQ---VPPGAAL 189

Query: 2015 TTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASES 2074
              ++ P + S   + P+  S   + PA +    +     S  + +P  + + S  P  + 
Sbjct: 190  APSAPPPTPSAQAVPPQG-SPIAAQPAPQPQQPSPLSLISAPSLHP--QRLPSPHPPLQP 246

Query: 2075 TTTS--SPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTTIEEQGV 2132
             T S  SP   + ++  P S       P   +        +  +++ P        +   
Sbjct: 247  QTASQQSPQPPAPSSRHPQSSHHGPGPPMPHALQQGPVFLQHPSSNPPQPFGLAQSQVPP 306

Query: 2133 SP 2134
             P
Sbjct: 307  LP 308



 Score = 44.7 bits (105), Expect = 7e-04
 Identities = 42/253 (16%), Positives = 77/253 (30%), Gaps = 19/253 (7%)

Query: 1893 ENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSP-- 1950
            E  T    +++  +  N  SE       E ES+ + S+  E ++      +   +SSP  
Sbjct: 95   ERVTAKKSKTQELSRPNSPSEGEGEGEGEGESSDSRSVNEEGSSDPKDIDQDNRSSSPSI 154

Query: 1951 ----ESESTTTSSLVSESTTTSSPESE---STTTSSPESESTTTSSLVSESTTTSSPESE 2003
                ++ES + SS   +      P S         +P +   T S+        + P   
Sbjct: 155  PSPQDNESDSDSSAQQQLLQPQGPPSIQVPPGAALAPSAPPPTPSA-------QAVPPQG 207

Query: 2004 STTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASE 2063
            S     P  +    S     S  ++ P+    +   P    T +         ++     
Sbjct: 208  SPIAAQPAPQPQQPSPLSLISAPSLHPQ-RLPSPHPPLQPQTASQQSPQPPAPSSRHPQS 266

Query: 2064 SITSSSPASESTTTSSPASESTTTSSPAS--ESTTTSSPASESTTTSSPESESTTTSSPA 2121
            S     P         P      +S+P        +  P     + + P S +  + S  
Sbjct: 267  SHHGPGPPMPHALQQGPVFLQHPSSNPPQPFGLAQSQVPPLPLPSQAQPHSHTPPSQSAL 326

Query: 2122 SESTTIEEQGVSP 2134
                   EQ + P
Sbjct: 327  QPQQPPREQPLPP 339



 Score = 33.5 bits (76), Expect = 1.8
 Identities = 25/146 (17%), Positives = 42/146 (28%), Gaps = 10/146 (6%)

Query: 1910 PESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSS 1969
            P+S+   +   +    T S  +    +T       +            +S    +     
Sbjct: 410  PQSQPLQSVPAQPPVLTQSQSLPPKASTHPHSGLHSGPPQSPFAQHPFTSGGLPAIGPPP 469

Query: 1970 PESESTTTSSPESESTTTSSL-VSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTI 2028
                ST  + P + S +        S+   +        I    E      P+ E+    
Sbjct: 470  SLPTSTPAAPPRASSGSQPPGSALPSSGGCAGPGPPLPPIQIKEE------PLDEAE--- 520

Query: 2029 SPESESTTTSSPASESTTTNNPKSES 2054
             PES      SP+ E T  N P   S
Sbjct: 521  EPESPPPPPRSPSPEPTVVNTPSHAS 546


>gnl|CDD|226406 COG3889, COG3889, Predicted solute binding protein [General function
            prediction only].
          Length = 872

 Score = 59.1 bits (143), Expect = 3e-08
 Identities = 29/115 (25%), Positives = 52/115 (45%), Gaps = 3/115 (2%)

Query: 1895 TTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESES 1954
            +    P  E+                 +  T TS   S + T   P+S + T ++    +
Sbjct: 732  SLEVFPAGENWGFIPTTKRVKVRIMDPASGTGTSITTSGTFTAEVPQSPTKTETTLSYSA 791

Query: 1955 TTTSSLVSESTTTSSPESE---STTTSSPESESTTTSSLVSESTTTSSPESESTT 2006
             + +S++ E+T+    ++     TTTSSP    TT+ +  S STTT++  S++TT
Sbjct: 792  YSNTSILIETTSVVITKTVTQTQTTTSSPSPTQTTSPTQTSTSTTTTTSPSQTTT 846



 Score = 57.6 bits (139), Expect = 8e-08
 Identities = 27/97 (27%), Positives = 52/97 (53%), Gaps = 9/97 (9%)

Query: 1951 ESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISP 2010
             +  T TS   S + T   P+S + T ++    + + +S++ E+T+    ++ + T    
Sbjct: 758  PASGTGTSITTSGTFTAEVPQSPTKTETTLSYSAYSNTSILIETTSVVITKTVTQT---- 813

Query: 2011 VSESTTTSSP-VSESTTTISPESESTTTSSPASESTT 2046
                TTTSSP  +++T+     + +TTT+SP S++TT
Sbjct: 814  ---QTTTSSPSPTQTTSPTQTSTSTTTTTSP-SQTTT 846



 Score = 57.6 bits (139), Expect = 9e-08
 Identities = 37/146 (25%), Positives = 61/146 (41%), Gaps = 11/146 (7%)

Query: 1986 TTSSLVSESTTTSSPESESTTTIS----PVSESTTTSSPVSESTTTISPESESTTTSSPA 2041
            TT S  +++  T       T   S    P  E+             I   +  T TS   
Sbjct: 709  TTLSSEAKNPDTVKIGQALTVYGSLEVFPAGENWGFIPTTKRVKVRIMDPASGTGTSITT 768

Query: 2042 SESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPA 2101
            S + T   P+S + T    +  + +++S   E+T+     + + T        TTTSSP+
Sbjct: 769  SGTFTAEVPQSPTKTETTLSYSAYSNTSILIETTSVVITKTVTQT-------QTTTSSPS 821

Query: 2102 SESTTTSSPESESTTTSSPASESTTI 2127
               TT+ +  S STTT++  S++TT 
Sbjct: 822  PTQTTSPTQTSTSTTTTTSPSQTTTG 847



 Score = 56.4 bits (136), Expect = 2e-07
 Identities = 29/94 (30%), Positives = 48/94 (51%), Gaps = 7/94 (7%)

Query: 1933 ESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVS 1992
              T TS   S + T   P+S + T ++L   + + +S   E+T+    ++ + T      
Sbjct: 760  SGTGTSITTSGTFTAEVPQSPTKTETTLSYSAYSNTSILIETTSVVITKTVTQT------ 813

Query: 1993 ESTTTSSPESESTTTISPVSESTTTSSPVSESTT 2026
              TTTSSP    TT+ +  S STTT++  S++TT
Sbjct: 814  -QTTTSSPSPTQTTSPTQTSTSTTTTTSPSQTTT 846



 Score = 54.9 bits (132), Expect = 6e-07
 Identities = 28/98 (28%), Positives = 49/98 (50%), Gaps = 7/98 (7%)

Query: 1901 ESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSL 1960
             +  T T+   S + T   P+S + T ++L   + + +S   E+T+    ++ + T    
Sbjct: 758  PASGTGTSITTSGTFTAEVPQSPTKTETTLSYSAYSNTSILIETTSVVITKTVTQT---- 813

Query: 1961 VSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTS 1998
                TTTSSP    TT+ +  S STTT++  S++TT  
Sbjct: 814  ---QTTTSSPSPTQTTSPTQTSTSTTTTTSPSQTTTGG 848



 Score = 53.7 bits (129), Expect = 1e-06
 Identities = 26/89 (29%), Positives = 47/89 (52%), Gaps = 3/89 (3%)

Query: 1893 ENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESE---STTTSS 1949
              T T+   S + T   P+S + T ++    + + +S++ E+T+    ++     TTTSS
Sbjct: 760  SGTGTSITTSGTFTAEVPQSPTKTETTLSYSAYSNTSILIETTSVVITKTVTQTQTTTSS 819

Query: 1950 PESESTTTSSLVSESTTTSSPESESTTTS 1978
            P    TT+ +  S STTT++  S++TT  
Sbjct: 820  PSPTQTTSPTQTSTSTTTTTSPSQTTTGG 848



 Score = 53.3 bits (128), Expect = 2e-06
 Identities = 31/143 (21%), Positives = 59/143 (41%), Gaps = 8/143 (5%)

Query: 1914 STTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESE 1973
            S+   +P++     +  V  S     P  E+                 +  T TS   S 
Sbjct: 712  SSEAKNPDTVKIGQALTVYGSLEVF-PAGENWGFIPTTKRVKVRIMDPASGTGTSITTSG 770

Query: 1974 STTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESE 2033
            + T   P+S  T T + +S S  +++     TT++      T T +  S  + T     +
Sbjct: 771  TFTAEVPQSP-TKTETTLSYSAYSNTSILIETTSVVITKTVTQTQTTTSSPSPT-----Q 824

Query: 2034 STTTSSPASESTTTNNPKSESTT 2056
            +T+ +  ++ +TTT +P S++TT
Sbjct: 825  TTSPTQTSTSTTTTTSP-SQTTT 846



 Score = 51.8 bits (124), Expect = 5e-06
 Identities = 32/145 (22%), Positives = 56/145 (38%), Gaps = 8/145 (5%)

Query: 1974 STTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESE 2033
            S+   +P++     +  V  S     P  E+   I             +  T T    S 
Sbjct: 712  SSEAKNPDTVKIGQALTVYGSLEVF-PAGENWGFIPTTKRVKVRIMDPASGTGTSITTSG 770

Query: 2034 STTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASE 2093
            + T   P S + T       S  +N       TS       T T +     T++ SP ++
Sbjct: 771  TFTAEVPQSPTKTETTLSY-SAYSNTSILIETTSVVITKTVTQTQTT----TSSPSP-TQ 824

Query: 2094 STTTSSPASESTTTSSPESESTTTS 2118
            +T+ +  ++ +TTT+SP S++TT  
Sbjct: 825  TTSPTQTSTSTTTTTSP-SQTTTGG 848



 Score = 50.2 bits (120), Expect = 1e-05
 Identities = 33/159 (20%), Positives = 58/159 (36%), Gaps = 10/159 (6%)

Query: 1940 PESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSS 1999
            P + S   ++  S        V      +   S     +        T+  V        
Sbjct: 700  PYTNSLYKATTLSSEAKNPDTVKIGQALTVYGSLEVFPAGENWGFIPTTKRV---KVRIM 756

Query: 2000 PESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNN 2059
              +  T T    S + T   P S + T  +    + + +S   E+T+    K+ + T   
Sbjct: 757  DPASGTGTSITTSGTFTAEVPQSPTKTETTLSYSAYSNTSILIETTSVVITKTVTQT--- 813

Query: 2060 PASESITSSSPASESTTTSSPASESTTTSSPASESTTTS 2098
                  T+SSP+   TT+ +  S STTT++  S++TT  
Sbjct: 814  ----QTTTSSPSPTQTTSPTQTSTSTTTTTSPSQTTTGG 848



 Score = 47.2 bits (112), Expect = 1e-04
 Identities = 36/151 (23%), Positives = 60/151 (39%), Gaps = 18/151 (11%)

Query: 1966 TTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTT-------ISPVSESTTTS 2018
            TT S E+++  T       T   SL       +        T       I   +  T TS
Sbjct: 709  TTLSSEAKNPDTVKIGQALTVYGSL---EVFPAGENWGFIPTTKRVKVRIMDPASGTGTS 765

Query: 2019 SPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTS 2078
               S + T   P+S  T T +  S S  +N      TT+         + +     T++ 
Sbjct: 766  ITTSGTFTAEVPQSP-TKTETTLSYSAYSNTSILIETTSVVITKTVTQTQTT----TSSP 820

Query: 2079 SPASESTTTSSPASESTTTSSPASESTTTSS 2109
            SP +++T+ +  ++ +TTT+SP+   TTT  
Sbjct: 821  SP-TQTTSPTQTSTSTTTTTSPS--QTTTGG 848



 Score = 41.8 bits (98), Expect = 0.005
 Identities = 29/135 (21%), Positives = 50/135 (37%), Gaps = 15/135 (11%)

Query: 2006 TTISPVSESTTTSSPVSESTTTIS---PESESTTTSSPASESTTTNNPKSESTTTNNPAS 2062
            TT+S  +++  T       T   S     +       P ++          S T      
Sbjct: 709  TTLSSEAKNPDTVKIGQALTVYGSLEVFPAGENWGFIPTTKRVKVRIMDPASGTGT---- 764

Query: 2063 ESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESE---STTTSS 2119
             SIT+S       T   P S + T ++ +  + + +S   E+T+    ++     TTTSS
Sbjct: 765  -SITTSGT----FTAEVPQSPTKTETTLSYSAYSNTSILIETTSVVITKTVTQTQTTTSS 819

Query: 2120 PASESTTIEEQGVSP 2134
            P+   TT   Q  + 
Sbjct: 820  PSPTQTTSPTQTSTS 834



 Score = 41.4 bits (97), Expect = 0.008
 Identities = 23/77 (29%), Positives = 36/77 (46%), Gaps = 7/77 (9%)

Query: 1873 TTNNNSEST-VVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLV 1931
             +   +E+T    +  N+ +   TT+       T T       TTTSSP    TT+ +  
Sbjct: 778  QSPTKTETTLSYSAYSNTSILIETTSVVITKTVTQT------QTTTSSPSPTQTTSPTQT 831

Query: 1932 SESTTTSSPESESTTTS 1948
            S STTT++  S++TT  
Sbjct: 832  STSTTTTTSPSQTTTGG 848



 Score = 41.0 bits (96), Expect = 0.009
 Identities = 18/74 (24%), Positives = 34/74 (45%), Gaps = 6/74 (8%)

Query: 1889 SLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTS 1948
            + LS +  +N+     TT+       T T +  S  + T      +T+ +   + +TTT+
Sbjct: 785  TTLSYSAYSNTSILIETTSVVITKTVTQTQTTTSSPSPTQ-----TTSPTQTSTSTTTTT 839

Query: 1949 SPESESTTTSSLVS 1962
            SP S++TT   +  
Sbjct: 840  SP-SQTTTGGGICG 852


>gnl|CDD|221121 pfam11489, DUF3210, Protein of unknown function (DUF3210).  This is a
            family of proteins conserved in yeasts. The function is
            not known. The Schizosaccharomyces pombe member is
            SPBC18E5.07 and the Saccharomyces cerevisiae member is
            AIM21.
          Length = 671

 Score = 58.0 bits (140), Expect = 6e-08
 Identities = 50/254 (19%), Positives = 83/254 (32%), Gaps = 21/254 (8%)

Query: 1885 STLNSLLSENTTTNSPESESTTTNNPESESTTTS--------------SPESESTTTSSL 1930
            S L+S   ++++ +SP  ES        E    S              SP  E   +   
Sbjct: 284  SRLSSPAPDSSSFSSPSGESGLEEREAEEPILASDEVAKEPAGESPAVSPSFEREKSEKS 343

Query: 1931 VSESTTTSSPESESTTTSSPESESTTTSSLVS-ESTTTSSPESESTTTSSPESESTTTSS 1989
              ES   S   S+  +      +    + L   E      PE ES     P +E ++   
Sbjct: 344  RHESDPKSRENSKPASIYGSVPDLIRHTPLEDVEEYEPLFPEDESEIAVKPPTEESSRRP 403

Query: 1990 LVSESTTTSSPESESTTTISPVSESTTTSSP-VSESTTTISPESESTTTSSPASESTTTN 2048
               +    S    E +   S + ++ T S+P       + +PE E++ +SS  S     +
Sbjct: 404  EEEKHRFPSEDVWEDSP--SSLQDTATVSTPSNPPPRASETPEQETSRSSSEVSLDPHQS 461

Query: 2049 NPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTS 2108
              KSE        S+    S    E    S       TT     E  ++S   ++    S
Sbjct: 462  ELKSEKKKARPEVSKQRFPSRDVWEDAPESQELV---TTEETPEEVKSSSPGVTKPAIPS 518

Query: 2109 SPESESTTTSSPAS 2122
             P+    T+     
Sbjct: 519  RPKKGKPTSEKRKP 532



 Score = 49.9 bits (119), Expect = 2e-05
 Identities = 48/245 (19%), Positives = 77/245 (31%), Gaps = 29/245 (11%)

Query: 1899 SPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPES-ESTTTSSPESESTTT 1957
            SP  E   +     ES   S   S+  +    V +    +  E  E      PE ES   
Sbjct: 332  SPSFEREKSEKSRHESDPKSRENSKPASIYGSVPDLIRHTPLEDVEEYEPLFPEDESEIA 391

Query: 1958 SSLVSESTTTSSPESESTTTSSPESESTTTSSLVSE--STTTSSPESESTTTISPVSEST 2015
                +E ++    E +    S    E + +S   +   ST ++ P   S T   P  E++
Sbjct: 392  VKPPTEESSRRPEEEKHRFPSEDVWEDSPSSLQDTATVSTPSNPPPRASET---PEQETS 448

Query: 2016 TTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASEST 2075
             +SS VS        +SE  +    A    +     S     + P S+ + ++    E  
Sbjct: 449  RSSSEVSLD----PHQSELKSEKKKARPEVSKQRFPSRDVWEDAPESQELVTTEETPEEV 504

Query: 2076 TTSSPASESTTT-SSPASESTTTSS-----------------PASESTTTSSPES-ESTT 2116
             +SSP        S P     T+                   PA      +  E+  S  
Sbjct: 505  KSSSPGVTKPAIPSRPKKGKPTSEKRKPPPVPKKPKPQIPARPAKLQKQQAGEEANSSAF 564

Query: 2117 TSSPA 2121
               P 
Sbjct: 565  KPKPR 569



 Score = 49.9 bits (119), Expect = 2e-05
 Identities = 58/267 (21%), Positives = 88/267 (32%), Gaps = 42/267 (15%)

Query: 1908 NNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTT 1967
            N    E+ +T S    S        E    ++PE  ++  SSP  +S++ SS   ES   
Sbjct: 247  NKIVRETASTGSGLGTSPEVDGTPEEQVGYTAPEEYTSRLSSPAPDSSSFSSPSGESGLE 306

Query: 1968 SSPESESTTTS---SPESES-------TTTSSLVSESTTTSSPESESTTTISPVSESTTT 2017
                 E    S   + E          +       +S   S P+S   +  + +  S   
Sbjct: 307  EREAEEPILASDEVAKEPAGESPAVSPSFEREKSEKSRHESDPKSRENSKPASIYGSVPD 366

Query: 2018 SSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPA-----SESITSSSPAS 2072
                   T     E          SE      P  ES+           SE +   SP  
Sbjct: 367  LI---RHTPLEDVEEYEPLFPEDESEIAV-KPPTEESSRRPEEEKHRFPSEDVWEDSP-- 420

Query: 2073 ESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTTIEEQGV 2132
                  S   ++ T S+P++       P   S T   PE E++ +SS  S      E   
Sbjct: 421  ------SSLQDTATVSTPSNP------PPRASET---PEQETSRSSSEVSLDPHQSEL-- 463

Query: 2133 SPHSEKLSANEDP--EEFPNEDVFEHT 2157
               SEK  A  +   + FP+ DV+E  
Sbjct: 464  --KSEKKKARPEVSKQRFPSRDVWEDA 488



 Score = 49.5 bits (118), Expect = 2e-05
 Identities = 47/200 (23%), Positives = 66/200 (33%), Gaps = 27/200 (13%)

Query: 1892 SENTTTNSPESESTTTNNPE-----SESTTTSSPESESTTTSSLVSESTTTSSPESESTT 1946
                    P  ES+     E     SE     SP S   T +     ST ++ P   S T
Sbjct: 387  ESEIAVKPPTEESSRRPEEEKHRFPSEDVWEDSPSSLQDTATV----STPSNPPPRASET 442

Query: 1947 TSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESEST- 2005
               PE E++ +SS VS     S  +SE        S+    S  V E     +PES+   
Sbjct: 443  ---PEQETSRSSSEVSLDPHQSELKSEKKKARPEVSKQRFPSRDVWE----DAPESQELV 495

Query: 2006 TTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESI 2065
            TT     E  ++S  V++      P+    T+            PK +      PA  + 
Sbjct: 496  TTEETPEEVKSSSPGVTKPAIPSRPKKGKPTSEKRKP-PPVPKKPKPQI-----PARPAK 549

Query: 2066 TSSSPASE----STTTSSPA 2081
                 A E    S     P 
Sbjct: 550  LQKQQAGEEANSSAFKPKPR 569



 Score = 38.8 bits (90), Expect = 0.042
 Identities = 60/328 (18%), Positives = 106/328 (32%), Gaps = 27/328 (8%)

Query: 1878 SESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSL----VSE 1933
            +E   V     S+   +      E E+             S+P         L       
Sbjct: 28   NEPPDVPQRPPSVTLPSLGEEGAEYEALEEAELSDSHH--STPAQTRNVGEDLKLHAPKP 85

Query: 1934 STTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTS--SPESESTTTSSLV 1991
            S  +SS +++    +  +S+    + L   S+    P  +ST+ S  S  S S+  SS  
Sbjct: 86   SLPSSSAKAKVQAVTRTDSQQAAAAGLGRPSSPEQRPVRKSTSRSLHSVASASSQDSSAS 145

Query: 1992 SESTTTSS--------PESESTTTISPVSESTTTSSPVSESTTTISPES------ESTTT 2037
            S    TSS        PE      + P +      SP   +  ++ P S           
Sbjct: 146  STLRPTSSAVDDEHGIPEIGQRVPMYPNAGDVQAPSPAPYA-NSLPPGSYGLHGHGVFPQ 204

Query: 2038 SSPASEST--TTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASEST 2095
                          PK E      PA  +      A  S   +    E+ +T S    S 
Sbjct: 205  EKFEKAWYEKHPEEPKKEEQGEYGPAVGTERPIDWALSSDDLNKIVRETASTGSGLGTSP 264

Query: 2096 TTSSPASESTTTSSPESESTTTSSPASESTTIEEQGVSPHSEKLSANEDPEEFPNEDVFE 2155
                   E    ++PE  ++  SSPA +S++          E+  A E      +++V +
Sbjct: 265  EVDGTPEEQVGYTAPEEYTSRLSSPAPDSSSFSSPSGESGLEEREAEEP--ILASDEVAK 322

Query: 2156 HTFAEIPNIDHSNQTDEAIPETFDAREE 2183
                E P +  S + +++     ++  +
Sbjct: 323  EPAGESPAVSPSFEREKSEKSRHESDPK 350



 Score = 36.8 bits (85), Expect = 0.19
 Identities = 39/183 (21%), Positives = 68/183 (37%), Gaps = 17/183 (9%)

Query: 2003 ESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPAS 2062
            E+ +T S +  S        E     +PE  ++  SSPA +S++ ++P  ES      A 
Sbjct: 252  ETASTGSGLGTSPEVDGTPEEQVGYTAPEEYTSRLSSPAPDSSSFSSPSGESGLEEREAE 311

Query: 2063 ESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPAS 2122
            E I +S   ++     SPA       SP+ E   +     ES     P+S   +  +   
Sbjct: 312  EPILASDEVAKEPAGESPAV------SPSFEREKSEKSRHESD----PKSRENSKPASIY 361

Query: 2123 ESTTIEEQGVSPHSEKLSANEDPEEFPNEDVFEHTFAEIPNIDHSNQTDEAIPETFDARE 2182
             S     +    H+      E    FP ++       + P  + S++  E     F + +
Sbjct: 362  GSVPDLIR----HTPLEDVEEYEPLFPEDE--SEIAVKPPT-EESSRRPEEEKHRFPSED 414

Query: 2183 EWP 2185
             W 
Sbjct: 415  VWE 417



 Score = 32.6 bits (74), Expect = 3.4
 Identities = 33/184 (17%), Positives = 61/184 (33%), Gaps = 16/184 (8%)

Query: 1970 PESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTIS 2029
            P      + SP  +S   S L         P S +         S        E+     
Sbjct: 7    PRRRGDRSVSPNPDSFAPSPLNEPPDVPQRPPSVTL-------PSLGEEGAEYEALEEAE 59

Query: 2030 PESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTT--T 2087
                    S+PA       + K  +   + P+S +   +   + + T S  A+ +     
Sbjct: 60   LSDSHH--STPAQTRNVGEDLKLHAPKPSLPSSSAK--AKVQAVTRTDSQQAAAAGLGRP 115

Query: 2088 SSPASESTTTSSPAS--ESTTTSSPESESTTTSSPASESTTIEEQGVSPHSEKLSANEDP 2145
            SSP       S+  S     + SS +S +++T  P S S   +E G+    +++    + 
Sbjct: 116  SSPEQRPVRKSTSRSLHSVASASSQDSSASSTLRPTS-SAVDDEHGIPEIGQRVPMYPNA 174

Query: 2146 EEFP 2149
             +  
Sbjct: 175  GDVQ 178


>gnl|CDD|197651 smart00320, WD40, WD40 repeats.  Note that these repeats are
           permuted with respect to the structural repeats (blades)
           of the beta propeller domain.
          Length = 40

 Score = 50.4 bits (121), Expect = 7e-08
 Identities = 15/40 (37%), Positives = 28/40 (70%)

Query: 520 TNSVVQKLRGHSDTVTGLSLSPDGSYILSNAMDNTVRIWD 559
           +  +++ L+GH+  VT ++ SPDG Y+ S + D T+++WD
Sbjct: 1   SGELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40



 Score = 44.6 bits (106), Expect = 6e-06
 Identities = 13/37 (35%), Positives = 20/37 (54%)

Query: 354 PIMLLEGHGGEIFCSKYHPDGQYIASSGYDRQIFIWS 390
            +  L+GH G +    + PDG+Y+AS   D  I +W 
Sbjct: 4   LLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40



 Score = 44.6 bits (106), Expect = 7e-06
 Identities = 15/38 (39%), Positives = 23/38 (60%)

Query: 396 ENIGVMSGHTGAVMDLKFSTDGCHIFTCSTDQTLAVWD 433
           E +  + GHTG V  + FS DG ++ + S D T+ +WD
Sbjct: 3   ELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40



 Score = 43.8 bits (104), Expect = 1e-05
 Identities = 15/39 (38%), Positives = 23/39 (58%)

Query: 1071 TFKFFISLYGHKLPVLSLDMSYDSTLIATGSGDRTVKVW 1109
            + +   +L GH  PV S+  S D   +A+GS D T+K+W
Sbjct: 1    SGELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLW 39



 Score = 43.8 bits (104), Expect = 1e-05
 Identities = 15/39 (38%), Positives = 23/39 (58%)

Query: 1345 TFKFFISLYGHKLPVLSLDMSYDSTLIATGSGDRTVKVW 1383
            + +   +L GH  PV S+  S D   +A+GS D T+K+W
Sbjct: 1    SGELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLW 39



 Score = 43.8 bits (104), Expect = 1e-05
 Identities = 15/40 (37%), Positives = 23/40 (57%)

Query: 106 TTDVISTFTGHKSAITVIQYDPLGHRLATGSKDTDIVLWD 145
           + +++ T  GH   +T + + P G  LA+GS D  I LWD
Sbjct: 1   SGELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40



 Score = 42.7 bits (101), Expect = 3e-05
 Identities = 15/33 (45%), Positives = 21/33 (63%)

Query: 1167 SLYGHKLPVLSLDMSYDSTLIATGSGDRTVKVW 1199
            +L GH  PV S+  S D   +A+GS D T+K+W
Sbjct: 7    TLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLW 39



 Score = 41.9 bits (99), Expect = 5e-05
 Identities = 16/39 (41%), Positives = 20/39 (51%)

Query: 1114 GDCHKSLLAHEDSVTGVTFVPKTHYFFTTSKDGRVKQWD 1152
            G+  K+L  H   VT V F P   Y  + S DG +K WD
Sbjct: 2    GELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40



 Score = 41.9 bits (99), Expect = 5e-05
 Identities = 16/39 (41%), Positives = 20/39 (51%)

Query: 1204 GDCHKSLLAHEDSVTGVTFVPKTHYFFTTSKDGRVKQWD 1242
            G+  K+L  H   VT V F P   Y  + S DG +K WD
Sbjct: 2    GELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40



 Score = 41.9 bits (99), Expect = 5e-05
 Identities = 16/39 (41%), Positives = 20/39 (51%)

Query: 1388 GDCHKSLLAHEDSVTGVTFVPKTHYFFTTSKDGRVKQWD 1426
            G+  K+L  H   VT V F P   Y  + S DG +K WD
Sbjct: 2    GELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40



 Score = 40.8 bits (96), Expect = 1e-04
 Identities = 14/38 (36%), Positives = 17/38 (44%)

Query: 612 TRRIAYKLPGHNGSVNDVQFHPKEPIIMSASSDKTIYL 649
           +  +   L GH G V  V F P    + S S D TI L
Sbjct: 1   SGELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKL 38



 Score = 40.4 bits (95), Expect = 2e-04
 Identities = 19/41 (46%), Positives = 26/41 (63%), Gaps = 1/41 (2%)

Query: 436 KGQRIKKMKGHSTFVNSCDPVRRGQLLIASGSDDCTVKVWD 476
            G+ +K +KGH+  V S      G+ L ASGSDD T+K+WD
Sbjct: 1   SGELLKTLKGHTGPVTSVAFSPDGKYL-ASGSDDGTIKLWD 40



 Score = 37.3 bits (87), Expect = 0.002
 Identities = 14/40 (35%), Positives = 22/40 (55%)

Query: 192 TGDCFKTMAAHLTEVWGVCVMREDSYLISGSNDAELKVWN 231
           +G+  KT+  H   V  V    +  YL SGS+D  +K+W+
Sbjct: 1   SGELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40



 Score = 36.5 bits (85), Expect = 0.005
 Identities = 13/38 (34%), Positives = 20/38 (52%), Gaps = 2/38 (5%)

Query: 152 LHRLSGHKGVITDIRFMSQPGHHFVVSSAKDTFVKIWD 189
           L  L GH G +T + F   P   ++ S + D  +K+WD
Sbjct: 5   LKTLKGHTGPVTSVAFS--PDGKYLASGSDDGTIKLWD 40



 Score = 33.8 bits (78), Expect = 0.038
 Identities = 12/42 (28%), Positives = 20/42 (47%), Gaps = 4/42 (9%)

Query: 568 RCVKVMSGHQHNFEKNLLRCAWSVSGLYVTAGSADKCVYIWD 609
             +K + GH       +   A+S  G Y+ +GS D  + +WD
Sbjct: 3   ELLKTLKGHTGP----VTSVAFSPDGKYLASGSDDGTIKLWD 40



 Score = 33.8 bits (78), Expect = 0.049
 Identities = 13/29 (44%), Positives = 18/29 (62%)

Query: 489 TYQVTSVAFNDTAECVLTGGIDNDIKMWD 517
           T  VTSVAF+   + + +G  D  IK+WD
Sbjct: 12  TGPVTSVAFSPDGKYLASGSDDGTIKLWD 40



 Score = 33.1 bits (76), Expect = 0.083
 Identities = 11/32 (34%), Positives = 17/32 (53%), Gaps = 1/32 (3%)

Query: 882 QGHHSEVRALAFSSDNLALVSACA-SQVKIWN 912
           +GH   V ++AFS D   L S      +K+W+
Sbjct: 9   KGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40



 Score = 29.2 bits (66), Expect = 2.0
 Identities = 14/48 (29%), Positives = 23/48 (47%), Gaps = 10/48 (20%)

Query: 957  GEILEDIPAHSQELWSVAMLPDQFNPNVYLPLQIQVVTGGGDKSVKLW 1004
            GE+L+ +  H+  + SVA  PD             + +G  D ++KLW
Sbjct: 2    GELLKTLKGHTGPVTSVAFSPDGK----------YLASGSDDGTIKLW 39


>gnl|CDD|201208 pfam00400, WD40, WD domain, G-beta repeat. 
          Length = 39

 Score = 50.0 bits (120), Expect = 7e-08
 Identities = 16/38 (42%), Positives = 27/38 (71%)

Query: 522 SVVQKLRGHSDTVTGLSLSPDGSYILSNAMDNTVRIWD 559
            +++ L+GH+  VT ++ SPDG+ + S + D TVR+WD
Sbjct: 2   KLLRTLKGHTGPVTSVAFSPDGNLLASGSDDGTVRVWD 39



 Score = 45.8 bits (109), Expect = 2e-06
 Identities = 15/39 (38%), Positives = 22/39 (56%)

Query: 395 CENIGVMSGHTGAVMDLKFSTDGCHIFTCSTDQTLAVWD 433
            + +  + GHTG V  + FS DG  + + S D T+ VWD
Sbjct: 1   GKLLRTLKGHTGPVTSVAFSPDGNLLASGSDDGTVRVWD 39



 Score = 44.7 bits (106), Expect = 6e-06
 Identities = 17/33 (51%), Positives = 22/33 (66%)

Query: 1077 SLYGHKLPVLSLDMSYDSTLIATGSGDRTVKVW 1109
            +L GH  PV S+  S D  L+A+GS D TV+VW
Sbjct: 6    TLKGHTGPVTSVAFSPDGNLLASGSDDGTVRVW 38



 Score = 44.7 bits (106), Expect = 6e-06
 Identities = 17/33 (51%), Positives = 22/33 (66%)

Query: 1167 SLYGHKLPVLSLDMSYDSTLIATGSGDRTVKVW 1199
            +L GH  PV S+  S D  L+A+GS D TV+VW
Sbjct: 6    TLKGHTGPVTSVAFSPDGNLLASGSDDGTVRVW 38



 Score = 44.7 bits (106), Expect = 6e-06
 Identities = 17/33 (51%), Positives = 22/33 (66%)

Query: 1351 SLYGHKLPVLSLDMSYDSTLIATGSGDRTVKVW 1383
            +L GH  PV S+  S D  L+A+GS D TV+VW
Sbjct: 6    TLKGHTGPVTSVAFSPDGNLLASGSDDGTVRVW 38



 Score = 43.5 bits (103), Expect = 2e-05
 Identities = 13/37 (35%), Positives = 22/37 (59%)

Query: 109 VISTFTGHKSAITVIQYDPLGHRLATGSKDTDIVLWD 145
           ++ T  GH   +T + + P G+ LA+GS D  + +WD
Sbjct: 3   LLRTLKGHTGPVTSVAFSPDGNLLASGSDDGTVRVWD 39



 Score = 42.7 bits (101), Expect = 3e-05
 Identities = 14/39 (35%), Positives = 19/39 (48%)

Query: 1114 GDCHKSLLAHEDSVTGVTFVPKTHYFFTTSKDGRVKQWD 1152
            G   ++L  H   VT V F P  +   + S DG V+ WD
Sbjct: 1    GKLLRTLKGHTGPVTSVAFSPDGNLLASGSDDGTVRVWD 39



 Score = 42.7 bits (101), Expect = 3e-05
 Identities = 14/39 (35%), Positives = 19/39 (48%)

Query: 1204 GDCHKSLLAHEDSVTGVTFVPKTHYFFTTSKDGRVKQWD 1242
            G   ++L  H   VT V F P  +   + S DG V+ WD
Sbjct: 1    GKLLRTLKGHTGPVTSVAFSPDGNLLASGSDDGTVRVWD 39



 Score = 42.7 bits (101), Expect = 3e-05
 Identities = 14/39 (35%), Positives = 19/39 (48%)

Query: 1388 GDCHKSLLAHEDSVTGVTFVPKTHYFFTTSKDGRVKQWD 1426
            G   ++L  H   VT V F P  +   + S DG V+ WD
Sbjct: 1    GKLLRTLKGHTGPVTSVAFSPDGNLLASGSDDGTVRVWD 39



 Score = 42.7 bits (101), Expect = 3e-05
 Identities = 11/37 (29%), Positives = 18/37 (48%)

Query: 354 PIMLLEGHGGEIFCSKYHPDGQYIASSGYDRQIFIWS 390
            +  L+GH G +    + PDG  +AS   D  + +W 
Sbjct: 3   LLRTLKGHTGPVTSVAFSPDGNLLASGSDDGTVRVWD 39



 Score = 42.3 bits (100), Expect = 4e-05
 Identities = 20/40 (50%), Positives = 26/40 (65%), Gaps = 1/40 (2%)

Query: 437 GQRIKKMKGHSTFVNSCDPVRRGQLLIASGSDDCTVKVWD 476
           G+ ++ +KGH+  V S      G LL ASGSDD TV+VWD
Sbjct: 1   GKLLRTLKGHTGPVTSVAFSPDGNLL-ASGSDDGTVRVWD 39



 Score = 41.2 bits (97), Expect = 9e-05
 Identities = 12/37 (32%), Positives = 18/37 (48%)

Query: 613 RRIAYKLPGHNGSVNDVQFHPKEPIIMSASSDKTIYL 649
            ++   L GH G V  V F P   ++ S S D T+ +
Sbjct: 1   GKLLRTLKGHTGPVTSVAFSPDGNLLASGSDDGTVRV 37



 Score = 36.2 bits (84), Expect = 0.006
 Identities = 12/39 (30%), Positives = 20/39 (51%)

Query: 193 GDCFKTMAAHLTEVWGVCVMREDSYLISGSNDAELKVWN 231
           G   +T+  H   V  V    + + L SGS+D  ++VW+
Sbjct: 1   GKLLRTLKGHTGPVTSVAFSPDGNLLASGSDDGTVRVWD 39



 Score = 36.2 bits (84), Expect = 0.007
 Identities = 13/38 (34%), Positives = 20/38 (52%), Gaps = 2/38 (5%)

Query: 152 LHRLSGHKGVITDIRFMSQPGHHFVVSSAKDTFVKIWD 189
           L  L GH G +T + F   P  + + S + D  V++WD
Sbjct: 4   LRTLKGHTGPVTSVAFS--PDGNLLASGSDDGTVRVWD 39



 Score = 34.6 bits (80), Expect = 0.023
 Identities = 11/43 (25%), Positives = 20/43 (46%), Gaps = 4/43 (9%)

Query: 567 ERCVKVMSGHQHNFEKNLLRCAWSVSGLYVTAGSADKCVYIWD 609
            + ++ + GH       +   A+S  G  + +GS D  V +WD
Sbjct: 1   GKLLRTLKGH----TGPVTSVAFSPDGNLLASGSDDGTVRVWD 39



 Score = 33.9 bits (78), Expect = 0.035
 Identities = 11/29 (37%), Positives = 17/29 (58%)

Query: 489 TYQVTSVAFNDTAECVLTGGIDNDIKMWD 517
           T  VTSVAF+     + +G  D  +++WD
Sbjct: 11  TGPVTSVAFSPDGNLLASGSDDGTVRVWD 39



 Score = 33.5 bits (77), Expect = 0.049
 Identities = 11/32 (34%), Positives = 17/32 (53%), Gaps = 1/32 (3%)

Query: 882 QGHHSEVRALAFSSDNLALVSACA-SQVKIWN 912
           +GH   V ++AFS D   L S      V++W+
Sbjct: 8   KGHTGPVTSVAFSPDGNLLASGSDDGTVRVWD 39



 Score = 30.4 bits (69), Expect = 0.65
 Identities = 12/48 (25%), Positives = 22/48 (45%), Gaps = 10/48 (20%)

Query: 957  GEILEDIPAHSQELWSVAMLPDQFNPNVYLPLQIQVVTGGGDKSVKLW 1004
            G++L  +  H+  + SVA  PD             + +G  D +V++W
Sbjct: 1    GKLLRTLKGHTGPVTSVAFSPDGN----------LLASGSDDGTVRVW 38



 Score = 28.1 bits (63), Expect = 5.1
 Identities = 10/28 (35%), Positives = 13/28 (46%)

Query: 1040 EEQVLCARVSPDSKLLAVSLLDTTVKIF 1067
               V     SPD  LLA    D TV+++
Sbjct: 11   TGPVTSVAFSPDGNLLASGSDDGTVRVW 38



 Score = 28.1 bits (63), Expect = 5.1
 Identities = 10/28 (35%), Positives = 13/28 (46%)

Query: 1314 EEQVLCARVSPDSKLLAVSLLDTTVKIF 1341
               V     SPD  LLA    D TV+++
Sbjct: 11   TGPVTSVAFSPDGNLLASGSDDGTVRVW 38


>gnl|CDD|217503 pfam03344, Daxx, Daxx Family.  The Daxx protein (also known as the
            Fas-binding protein) is thought to play a role in
            apoptosis, but precise role played by Daxx remains to be
            determined. Daxx forms a complex with Axin.
          Length = 715

 Score = 57.2 bits (138), Expect = 1e-07
 Identities = 58/272 (21%), Positives = 76/272 (27%), Gaps = 26/272 (9%)

Query: 1899 SPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTS 1958
            S E ES      E E       ESE         E    +   SE     S E +     
Sbjct: 439  SEEEESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEADNGSEEEMEGSSEGDGDGEE 498

Query: 1959 SLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTT-- 2016
                     S     S      E +    SS+  ES      + ES    S   ES    
Sbjct: 499  PEEDAERRNSEMAGISRM---SEGQQPRGSSVQPESPQEEPLQPESMDAESVGEESDEEL 555

Query: 2017 -TSSPVSESTTTISPESESTTTS-------SPASESTTTNNPKSESTTTN---NPASESI 2065
                    S T +   +    T         P   ST+  N  +  T+T    N +  + 
Sbjct: 556  LAEESPLSSHTELEGVATPVETKISSSRKLPPPPVSTSLENDSATVTSTTRNGNVSPHTP 615

Query: 2066 TSSSPASESTTTSSPASESTTTSS----PASESTTTSSPASESTTTSSP--ESESTTTSS 2119
                P S          ES    +      + S     PA     TS       ST   +
Sbjct: 616  QDEQPPSGRKRKRKEEVESEPLGNQYLRHHNGSEKDGLPAPMDPVTSCSPVADSSTRVDT 675

Query: 2120 PASESTTIEEQ--GVSPHSEKLSANE--DPEE 2147
            P+ E  T   Q  G  P   K++     DPEE
Sbjct: 676  PSHELVTSSPQTPGDPPKKNKVNVATQCDPEE 707



 Score = 54.5 bits (131), Expect = 7e-07
 Identities = 45/249 (18%), Positives = 75/249 (30%), Gaps = 20/249 (8%)

Query: 1892 SENTTTNSPESESTTTNNPE---SESTTTSSPESESTTTSSLVSESTTTSSPESESTTTS 1948
            + + +++  ++ ST+  +P     ES    S E E         E       ESE     
Sbjct: 414  TSSRSSDPSKASSTSGESPSMASQESEEEESVEEEEEEEEEEEEEEQ-----ESEEEEGE 468

Query: 1949 SPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTI 2008
              E E    +   SE     S E +       E          SE    S          
Sbjct: 469  DEEEEEEVEADNGSEEEMEGSSEGD----GDGEEPEEDAERRNSEMAGISRMSEGQQPRG 524

Query: 2009 SPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSS 2068
            S V   +    P+   +       E +     A ES  +++ + E   T      S +  
Sbjct: 525  SSVQPESPQEEPLQPESMDAESVGEESDEELLAEESPLSSHTELEGVATPVETKISSSRK 584

Query: 2069 -SPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTTI 2127
              P   ST+  + ++  T+T+   + S  T            P S          ES  +
Sbjct: 585  LPPPPVSTSLENDSATVTSTTRNGNVSPHTPQD-------EQPPSGRKRKRKEEVESEPL 637

Query: 2128 EEQGVSPHS 2136
              Q +  H+
Sbjct: 638  GNQYLRHHN 646



 Score = 51.8 bits (124), Expect = 5e-06
 Identities = 51/266 (19%), Positives = 77/266 (28%), Gaps = 25/266 (9%)

Query: 1902 SESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLV 1961
            + S +++  ++ ST+  SP   S        ES    S E E       E E        
Sbjct: 414  TSSRSSDPSKASSTSGESPSMAS-------QESEEEESVEEEEE-----EEEEEEEEEQE 461

Query: 1962 SESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPV 2021
            SE       E E    +   SE     S    S      E          SE    S   
Sbjct: 462  SEEEEGEDEEEEEEVEADNGSEEEMEGS----SEGDGDGEEPEEDAERRNSEMAGISRMS 517

Query: 2022 SESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPA 2081
                   S     +    P    +       E +     A ES  SS    E   T    
Sbjct: 518  EGQQPRGSSVQPESPQEEPLQPESMDAESVGEESDEELLAEESPLSSHTELEGVATPVET 577

Query: 2082 SESTTTS-SPASESTTTSSPASESTTTSSPESESTTTSSPASESTTIEEQGVSPHSEKLS 2140
              S++    P   ST+  + ++  T+T+   + S          T  +EQ  S    K  
Sbjct: 578  KISSSRKLPPPPVSTSLENDSATVTSTTRNGNVSP--------HTPQDEQPPSGRKRKRK 629

Query: 2141 ANEDPEEFPNEDVFEHTFAEIPNIDH 2166
               + E   N+ +  H  +E   +  
Sbjct: 630  EEVESEPLGNQYLRHHNGSEKDGLPA 655



 Score = 48.8 bits (116), Expect = 4e-05
 Identities = 49/249 (19%), Positives = 73/249 (29%), Gaps = 18/249 (7%)

Query: 1892 SENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPE 1951
            SE     S E +     + E          SE    S +        S     +    P 
Sbjct: 482  SEEEMEGSSEGDG----DGEEPEEDAERRNSEMAGISRMSEGQQPRGSSVQPESPQEEPL 537

Query: 1952 SESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPV 2011
               +  +  V E +       ES  +S  E E   T      S++   P         PV
Sbjct: 538  QPESMDAESVGEESDEELLAEESPLSSHTELEGVATPVETKISSSRKLPP-------PPV 590

Query: 2012 SESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNN-PASESITSSSP 2070
            S S    S    STT     + + +  +P  E   +   +       + P          
Sbjct: 591  STSLENDSATVTSTT----RNGNVSPHTPQDEQPPSGRKRKRKEEVESEPLGNQYLRHHN 646

Query: 2071 ASESTTTSSPASESTTTSSPASESTTTSSPASESTTTS--SPESESTTTSSPASESTTIE 2128
             SE     +P    T+ S  A  ST   +P+ E  T+S  +P           +     E
Sbjct: 647  GSEKDGLPAPMDPVTSCSPVADSSTRVDTPSHELVTSSPQTPGDPPKKNKVNVATQCDPE 706

Query: 2129 EQGVSPHSE 2137
            E  V   SE
Sbjct: 707  EVIVLSDSE 715



 Score = 44.5 bits (105), Expect = 7e-04
 Identities = 38/211 (18%), Positives = 65/211 (30%), Gaps = 1/211 (0%)

Query: 1859 VAISVIDNYSEIIFTTNNNSESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTS 1918
               S +   S +         S    S     L   +       E +       ES  +S
Sbjct: 505  RRNSEMAGISRMSEGQQPRGSSVQPESPQEEPLQPESMDAESVGEESDEELLAEESPLSS 564

Query: 1919 SPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTS 1978
              E E   T      S++   P     +TS     +T TS+  + + +  +P+ E   + 
Sbjct: 565  HTELEGVATPVETKISSSRKLPPP-PVSTSLENDSATVTSTTRNGNVSPHTPQDEQPPSG 623

Query: 1979 SPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTS 2038
                      S    +             +    +  T+ SPV++S+T +   S    TS
Sbjct: 624  RKRKRKEEVESEPLGNQYLRHHNGSEKDGLPAPMDPVTSCSPVADSSTRVDTPSHELVTS 683

Query: 2039 SPASESTTTNNPKSESTTTNNPASESITSSS 2069
            SP +        K    T  +P    + S S
Sbjct: 684  SPQTPGDPPKKNKVNVATQCDPEEVIVLSDS 714



 Score = 33.0 bits (75), Expect = 2.6
 Identities = 27/128 (21%), Positives = 42/128 (32%), Gaps = 9/128 (7%)

Query: 2061 ASESITSSSPASESTTTSSP--ASESTTTSSPASESTTTSSPASESTTTSSPESESTTTS 2118
             + S +S    + ST+  SP  AS+ +       E         E    S  E       
Sbjct: 413  GTSSRSSDPSKASSTSGESPSMASQESEEEESVEEEEEEEEEEEEEEQESEEEEGEDEEE 472

Query: 2119 SPASESTTIEEQGVSPHSEKLSANEDPEEFPNEDVFEHTFAEIPNIDHSNQTDEAIPETF 2178
                E+    E+ +   SE     E+PEE       E   +E+  I   ++     P   
Sbjct: 473  EEEVEADNGSEEEMEGSSEGDGDGEEPEED-----AERRNSEMAGISRMSEGQ--QPRGS 525

Query: 2179 DAREEWPQ 2186
              + E PQ
Sbjct: 526  SVQPESPQ 533


>gnl|CDD|185594 PTZ00395, PTZ00395, Sec24-related protein; Provisional.
          Length = 1560

 Score = 56.6 bits (136), Expect = 2e-07
 Identities = 42/236 (17%), Positives = 92/236 (38%), Gaps = 14/236 (5%)

Query: 1940 PESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSS 1999
            P++       P S ++   +  S +  +++ +S +  +++  S    ++   + +  +++
Sbjct: 373  PDARGAWAGGPHSNASYNCAAYSNAAQSNAAQSNAGFSNAGYSNPGNSNPGYNNAPNSNT 432

Query: 2000 PESESTTTISPVSESTTTSSPVSE---STTTISPESESTTTSSPASESTTTNNPKSESTT 2056
            P +    + +P S    ++ P S    S T  S    S    S A +  +  +   +   
Sbjct: 433  PYNNPPNSNTPYSNPPNSNPPYSNLPYSNTPYSNAPLSNAPPSSAKDHHSAYHAAYQHRA 492

Query: 2057 TNNPASESITSSSPASE-------STTTSSPASESTTTSSPASESTTTSSPASESTTTSS 2109
             N PA+   T++ PA+        ++  +  AS    ++     + TT+ P   +     
Sbjct: 493  ANQPAANLPTANQPAANNFHGAAGNSVGNPFASRPFGSAPYGGNAATTADPNGIAKREDH 552

Query: 2110 PESESTTTSSPASESTTIEEQGVSPHSEKLSANEDPEEFPNEDVFEHTFAEIPNID 2165
            PE  +       S+  ++E    S  SE  S NE+      E+++      I  ID
Sbjct: 553  PEGGTNRQKYEQSDEESVE----SSSSENSSENENEVTDKGEEIYSLLKKTINRID 604



 Score = 51.6 bits (123), Expect = 6e-06
 Identities = 39/224 (17%), Positives = 84/224 (37%), Gaps = 15/224 (6%)

Query: 1970 PESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTIS 2029
            P++       P S ++   +  S +  +++ +S +  + +  S    ++   + +  + +
Sbjct: 373  PDARGAWAGGPHSNASYNCAAYSNAAQSNAAQSNAGFSNAGYSNPGNSNPGYNNAPNSNT 432

Query: 2030 PESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSS---PASESTT 2086
            P +    +++P S    +N P S    +N P S +  S++P S +    S    A +   
Sbjct: 433  PYNNPPNSNTPYSNPPNSNPPYSNLPYSNTPYSNAPLSNAPPSSAKDHHSAYHAAYQHRA 492

Query: 2087 TSSPASESTTTSSPASE-------STTTSSPESESTTTSSPASESTTIEEQGVSPHSEKL 2139
             + PA+   T + PA+        ++  +   S    ++     + T  +       E  
Sbjct: 493  ANQPAANLPTANQPAANNFHGAAGNSVGNPFASRPFGSAPYGGNAATTADPNGIAKRE-- 550

Query: 2140 SANEDPEEFPNEDVFEHTFAEIPNIDHSNQTDEAIPETFDAREE 2183
               + PE   N   +E +  E      S  + E   E  D  EE
Sbjct: 551  ---DHPEGGTNRQKYEQSDEESVESSSSENSSENENEVTDKGEE 591



 Score = 39.7 bits (92), Expect = 0.026
 Identities = 32/189 (16%), Positives = 70/189 (37%), Gaps = 17/189 (8%)

Query: 1875 NNNSESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSES 1934
            +N ++S    S  N+  S    +N   S     N P S +   + P S +  ++   S  
Sbjct: 395  SNAAQSNAAQS--NAGFSNAGYSNPGNSNPGYNNAPNSNTPYNNPPNSNTPYSNPPNSNP 452

Query: 1935 TTTSSPESESTTTSSPESESTTTSSLVSEST--------TTSSPESESTTTSSPESE--- 1983
              ++ P S +  +++P S +  +S+    S           + P +   T + P +    
Sbjct: 453  PYSNLPYSNTPYSNAPLSNAPPSSAKDHHSAYHAAYQHRAANQPAANLPTANQPAANNFH 512

Query: 1984 ----STTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSS 2039
                ++  +   S    ++     + TT  P   +     P   +      +S+  +  S
Sbjct: 513  GAAGNSVGNPFASRPFGSAPYGGNAATTADPNGIAKREDHPEGGTNRQKYEQSDEESVES 572

Query: 2040 PASESTTTN 2048
             +SE+++ N
Sbjct: 573  SSSENSSEN 581



 Score = 38.9 bits (90), Expect = 0.049
 Identities = 28/196 (14%), Positives = 78/196 (39%), Gaps = 9/196 (4%)

Query: 1888 NSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTT 1947
            N+  +    +N+ +S +  +N   S +  ++   S     ++  S +   + P S +  +
Sbjct: 386  NASYNCAAYSNAAQSNAAQSNAGFSNAGYSNPGNSNPGYNNAPNSNTPYNNPPNSNTPYS 445

Query: 1948 SSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTT 2007
            + P S    ++   S +  +++P S +  +S+ +  S   ++   +    + P +   T 
Sbjct: 446  NPPNSNPPYSNLPYSNTPYSNAPLSNAPPSSAKDHHSAYHAAY--QHRAANQPAANLPTA 503

Query: 2008 ISPVSE-------STTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNP 2060
              P +        ++  +   S    +      + TT+ P   +   ++P+  +      
Sbjct: 504  NQPAANNFHGAAGNSVGNPFASRPFGSAPYGGNAATTADPNGIAKREDHPEGGTNRQKYE 563

Query: 2061 ASESITSSSPASESTT 2076
             S+  +  S +SE+++
Sbjct: 564  QSDEESVESSSSENSS 579



 Score = 34.3 bits (78), Expect = 1.0
 Identities = 40/281 (14%), Positives = 104/281 (37%), Gaps = 22/281 (7%)

Query: 1894 NTTTNSPESESTT-TNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSP-E 1951
            N+ T+ P +E+   T +  ++    +S  +ES       S   + +  ++     ++P  
Sbjct: 244  NSATSPPANENNAVTLSCSNDQQRGASSAAESGYAHHRGSNIASHTPNDNIMHAANNPLN 303

Query: 1952 SESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPV 2011
            + +    + +       +P  +++     E      +  +       SP + S       
Sbjct: 304  NTNDAQRNAIQGDLVRGAPNDKNSFDRGNEK-----TYQIYGGFHDGSPNAASAGAPFNG 358

Query: 2012 SESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPA 2071
              +      +++    + P++       P S ++      S +  +N   S +  S++  
Sbjct: 359  LGNQADGGHINQ----VHPDARGAWAGGPHSNASYNCAAYSNAAQSNAAQSNAGFSNAGY 414

Query: 2072 SESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTTIEEQG 2131
            S    ++   + +  +++P +    +++P S    ++ P S    +++P S +       
Sbjct: 415  SNPGNSNPGYNNAPNSNTPYNNPPNSNTPYSNPPNSNPPYSNLPYSNTPYSNAPLSN--- 471

Query: 2132 VSPHSEKLSANEDPEEFPNEDVFEHTFAEIP--NIDHSNQT 2170
             +P S   SA +    +     ++H  A  P  N+  +NQ 
Sbjct: 472  -APPS---SAKDHHSAY--HAAYQHRAANQPAANLPTANQP 506


>gnl|CDD|146273 pfam03546, Treacle, Treacher Collins syndrome protein Treacle. 
          Length = 519

 Score = 55.7 bits (133), Expect = 2e-07
 Identities = 46/244 (18%), Positives = 87/244 (35%), Gaps = 9/244 (3%)

Query: 1893 ENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLV-SESTTTSSPESESTTTSSPE 1951
            E    +S E   +    P + + TTS  +++    +S V   ST T  P  +      P+
Sbjct: 87   EEEAKSSEEESDSEGETPTAATLTTSPAQAKPLGKNSQVRPASTVTPGPSGKGANLPCPQ 146

Query: 1952 SESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPV 2011
               +    +  +  + SS E ES +          +S  + ++   S P         PV
Sbjct: 147  KAGSAAVQVGKQEDSESSSEEESDSDGPGAPAQAKSSGKLLQARPASGPAKGPPQKAGPV 206

Query: 2012 SESTTTSSPV--SESTTTIS-PESESTTTSSPASESTTTNNPKSEST----TTNNPASES 2064
            +           SES+   S  E E+    + A        P+++++    T   P S  
Sbjct: 207  ATQVKAERGKEDSESSEESSDSEEEAPAAMTAAQAKPALKTPQTKASPRKGTPITPTSAK 266

Query: 2065 ITSSSPASESTTTSSPASESTTTSSPA-SESTTTSSPASESTTTSSPESESTTTSSPASE 2123
            +      + +   +   +     SSPA +  T      S S+  S  E E T  ++   +
Sbjct: 267  VPPVRVGTPAPRKAGAVTSPACASSPALARGTQRPDEDSSSSEESESEEEGTAPATARGQ 326

Query: 2124 STTI 2127
            + ++
Sbjct: 327  AKSV 330



 Score = 49.1 bits (116), Expect = 3e-05
 Identities = 43/243 (17%), Positives = 82/243 (33%), Gaps = 26/243 (10%)

Query: 1914 STTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESE 1973
            +      E++S+   S     T T++  + S   + P  ++   S +   ST T  P  +
Sbjct: 82   AAQAGEEEAKSSEEESDSEGETPTAATLTTSPAQAKPLGKN---SQVRPASTVTPGPSGK 138

Query: 1974 STTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESE 2033
                  P+   +    +  +  + SS E ES +          +S  + ++     P   
Sbjct: 139  GANLPCPQKAGSAAVQVGKQEDSESSSEEESDSDGPGAPAQAKSSGKLLQARPASGPAKG 198

Query: 2034 STTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASE 2093
                + P +        K +S ++   +S+S   +  A  +           T +SP   
Sbjct: 199  PPQKAGPVATQVKAERGKEDSESSEE-SSDSEEEAPAAMTAAQAKPALKTPQTKASPRKG 257

Query: 2094 STTTSSPASES---TTTSSP-----ESESTTTSSPA--------------SESTTIEEQG 2131
            +  T + A        T +P      +     SSPA              SE +  EE+G
Sbjct: 258  TPITPTSAKVPPVRVGTPAPRKAGAVTSPACASSPALARGTQRPDEDSSSSEESESEEEG 317

Query: 2132 VSP 2134
             +P
Sbjct: 318  TAP 320



 Score = 41.8 bits (97), Expect = 0.005
 Identities = 43/256 (16%), Positives = 82/256 (32%), Gaps = 34/256 (13%)

Query: 1900 PESESTTTNNPESESTTT---SSPESESTTTSSLVSESTTTSSPESESTTTS-------- 1948
            P +       PE +S ++   S  E E     +      +  SP+ ++ +          
Sbjct: 10   PAATQAKAEKPEEDSESSSEDSDSEEEMPAAKNPPQAKPSGKSPQVKAASAPAKESPQKG 69

Query: 1949 ----SPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESES 2004
                +P       +    E   +S  ES+S   +   +  TT+ +        S     S
Sbjct: 70   APPVTPGKAGPAAAQAGEEEAKSSEEESDSEGETPTAATLTTSPAQAKPLGKNSQVRPAS 129

Query: 2005 TTTISPVSESTTTSSPV--------------SESTTTISPESESTTTSSPASESTTTNNP 2050
            T T  P  +      P               SES++    +S+     + A  S      
Sbjct: 130  TVTPGPSGKGANLPCPQKAGSAAVQVGKQEDSESSSEEESDSDGPGAPAQAKSSGKLLQA 189

Query: 2051 KSESTTTNNPASE----SITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASE-ST 2105
            +  S     P  +    +    +   +  + SS  S  +   +PA+ +   + PA +   
Sbjct: 190  RPASGPAKGPPQKAGPVATQVKAERGKEDSESSEESSDSEEEAPAAMTAAQAKPALKTPQ 249

Query: 2106 TTSSPESESTTTSSPA 2121
            T +SP   +  T + A
Sbjct: 250  TKASPRKGTPITPTSA 265



 Score = 37.9 bits (87), Expect = 0.063
 Identities = 40/218 (18%), Positives = 75/218 (34%), Gaps = 15/218 (6%)

Query: 1933 ESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESE-----STTT 1987
            E + +SS +S+S      E E     +      +  SP+ ++ +  + ES        T 
Sbjct: 22   EDSESSSEDSDS------EEEMPAAKNPPQAKPSGKSPQVKAASAPAKESPQKGAPPVTP 75

Query: 1988 SSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTT 2047
                  +      E++S+   S     T T++ ++ S     P  ++   S     ST T
Sbjct: 76   GKAGPAAAQAGEEEAKSSEEESDSEGETPTAATLTTSPAQAKPLGKN---SQVRPASTVT 132

Query: 2048 NNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTT 2107
              P  +      P      +     +  + SS   ES +    A     +S    ++   
Sbjct: 133  PGPSGKGANLPCPQKAGSAAVQVGKQEDSESSSEEESDSDGPGAPAQAKSSGKLLQARPA 192

Query: 2108 SSPESESTTTSSPASESTTIEE-QGVSPHSEKLSANED 2144
            S P       + P +     E  +  S  SE+ S +E+
Sbjct: 193  SGPAKGPPQKAGPVATQVKAERGKEDSESSEESSDSEE 230



 Score = 36.8 bits (84), Expect = 0.16
 Identities = 43/239 (17%), Positives = 79/239 (33%), Gaps = 28/239 (11%)

Query: 1911 ESESTTTSSPE-SESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLV---SESTT 1966
             +     SSP  +  T      S S+  S  E E T  ++   ++ +    +   + S  
Sbjct: 283  VTSPACASSPALARGTQRPDEDSSSSEESESEEEGTAPATARGQAKSVGKGLQVKAASVP 342

Query: 1967 TSSPESESTTTSSPESESTTTSSL---VSESTTTSSPESESTTTISPVSESTT----TSS 2019
            T  P  + T    P       + +   V E + +S  ES+S    +  ++  T      +
Sbjct: 343  TKGPLGQGTAPVPPGKTGPAVAQVKAEVQEDSESSEEESDSEEAAATPAQVKTSVKTPQA 402

Query: 2020 PVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSS 2079
              + + T   P       S+P          K  S     P   ++ +S+ +     +  
Sbjct: 403  KANPAPTRAPPAK--GAASAPGKVVAAAAQAKQRSPAKVKPPVRTLQNSTVSVRGQRSVP 460

Query: 2080 PASESTTTSSPA-----------SESTTTSSPASESTTTSSPESEST----TTSSPASE 2123
               ++   ++ A           SES+   S + E T      S  T      S+PA E
Sbjct: 461  AVGKAVAAAAQAQPGPVKGTEEDSESSEEESDSEEETPAQIKPSGKTPQVRAASAPAKE 519



 Score = 33.3 bits (75), Expect = 1.6
 Identities = 48/246 (19%), Positives = 84/246 (34%), Gaps = 26/246 (10%)

Query: 1914 STTTSSPE--SESTTTSSLVSESTTTSSPESESTTTSSPESES-------TTTSSLVSES 1964
              T +SP   +  T TS+ V      +    ++   +SP   S       T      S S
Sbjct: 248  PQTKASPRKGTPITPTSAKVPPVRVGTPAPRKAGAVTSPACASSPALARGTQRPDEDSSS 307

Query: 1965 TTTSSPESESTTTSSP--ESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPV- 2021
            +  S  E E T  ++   +++S      V  ++  +       T   P  ++    + V 
Sbjct: 308  SEESESEEEGTAPATARGQAKSVGKGLQVKAASVPTKGPLGQGTAPVPPGKTGPAVAQVK 367

Query: 2022 ------SESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSS----PA 2071
                  SES+   S   E+  T +    S  T   K+    T  P ++   S+      A
Sbjct: 368  AEVQEDSESSEEESDSEEAAATPAQVKTSVKTPQAKANPAPTRAPPAKGAASAPGKVVAA 427

Query: 2072 SESTTTSSPAS----ESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTTI 2127
            +      SPA       T  +S  S     S PA      ++ +++         +S + 
Sbjct: 428  AAQAKQRSPAKVKPPVRTLQNSTVSVRGQRSVPAVGKAVAAAAQAQPGPVKGTEEDSESS 487

Query: 2128 EEQGVS 2133
            EE+  S
Sbjct: 488  EEESDS 493


>gnl|CDD|178748 PLN03209, PLN03209, translocon at the inner envelope of chloroplast
            subunit 62; Provisional.
          Length = 576

 Score = 55.3 bits (133), Expect = 3e-07
 Identities = 54/276 (19%), Positives = 92/276 (33%), Gaps = 38/276 (13%)

Query: 1878 SESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTT 1937
            +E+T  ++ +  LL++  +   P  ES   + P+   T   +PE+ S             
Sbjct: 307  AETTAPLTPMEELLAKIPSQRVPPKESDAADGPKPVPTKPVTPEAPSPPIEE-------- 358

Query: 1938 SSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTT 1997
              P         P S  T    L       +SP     ++S   S+S    +  +E    
Sbjct: 359  -EPPQPKAVVPRPLSPYTAYEDL----KPPTSPIPTPPSSSPASSKSVDAVAKPAEPDVV 413

Query: 1998 SSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTT 2057
             SP S S               P  E       E++ T   SP +       P S S T 
Sbjct: 414  PSPGSASNV-------------PEVEPAQV---EAKKTRPLSPYARYEDLKPPTSPSPTA 457

Query: 2058 NNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSP--------ASESTTTSS 2109
                S S++S+S       T+ PA+ +T  ++P   +    SP           S + ++
Sbjct: 458  PTGVSPSVSSTSSVPAVPDTA-PATAATDAAAPPPANMRPLSPYAVYDDLKPPTSPSPAA 516

Query: 2110 PESESTTTSSPASESTTIEEQGVSPHSEKLSANEDP 2145
            P  +   +S+             +   E+  A   P
Sbjct: 517  PVGKVAPSSTNEVVKVGNSAPPTALADEQHHAQPKP 552



 Score = 43.0 bits (101), Expect = 0.002
 Identities = 33/172 (19%), Positives = 53/172 (30%), Gaps = 18/172 (10%)

Query: 1994 STTTSSPESESTTTISPV-SESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKS 2052
            S      ES++     PV ++  T  +P         P         P S  T   + K 
Sbjct: 325  SQRVPPKESDAADGPKPVPTKPVTPEAPSPPIEEE--PPQPKAVVPRPLSPYTAYEDLKP 382

Query: 2053 ESTTTNNPASESITSSSP----ASESTTTSSPASESTTT---SSPASESTTTSSPASEST 2105
             ++    P S S  SS      A  +     P+  S +      PA      + P S   
Sbjct: 383  PTSPIPTPPSSSPASSKSVDAVAKPAEPDVVPSPGSASNVPEVEPAQVEAKKTRPLSPYA 442

Query: 2106 --------TTSSPESESTTTSSPASESTTIEEQGVSPHSEKLSANEDPEEFP 2149
                    T+ SP + +  + S +S S+       +P +    A   P    
Sbjct: 443  RYEDLKPPTSPSPTAPTGVSPSVSSTSSVPAVPDTAPATAATDAAAPPPANM 494


>gnl|CDD|237015 PRK11901, PRK11901, hypothetical protein; Reviewed.
          Length = 327

 Score = 54.3 bits (131), Expect = 4e-07
 Identities = 40/215 (18%), Positives = 72/215 (33%), Gaps = 39/215 (18%)

Query: 1939 SPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTS 1998
            SP    +  SS  + +     L   S+ +S        +S   + +T+     S    T+
Sbjct: 60   SPTEHESQQSSNNAGAEKNIDLSGSSSLSSG-----NQSSPSAANNTSDGHDASGVKNTA 114

Query: 1999 SPESESTTTISPVSESTTTSSPVSESTTT--------IS-----------------PESE 2033
             P+     +  P+S + T ++P               IS                   + 
Sbjct: 115  PPQ---DISAPPISPTPTQAAPPQTPNGQQRIELPGNISDALSQQQGQVNAASQNAQGNT 171

Query: 2034 STTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASE 2093
            ST  ++PA+ + +       +  T+    +   +  PA       +    +T    PA+ 
Sbjct: 172  STLPTAPATVAPSKGAKVPATAETHPTPPQKPATKKPAV------NHHKTATVAVPPATS 225

Query: 2094 STTTSSPASESTTTSSPESESTTTSSPASESTTIE 2128
                S  AS    +S+P S  T   S AS S T+ 
Sbjct: 226  GKPKSGAASARALSSAPASHYTLQLSSASRSDTLN 260



 Score = 48.9 bits (117), Expect = 2e-05
 Identities = 41/207 (19%), Positives = 75/207 (36%), Gaps = 17/207 (8%)

Query: 1897 TNSPESESTTTNNPESE-STTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESEST 1955
            T     +S+     E     + SS  S    +S   + +T+     S    T+ P+    
Sbjct: 62   TEHESQQSSNNAGAEKNIDLSGSSSLSSGNQSSPSAANNTSDGHDASGVKNTAPPQ---D 118

Query: 1956 TTSSLVSESTTTSSPESESTTTSSPESESTTTSSL------VSESTTTSSPESESTTTIS 2009
             ++  +S + T ++P          E     + +L      V+ ++  +   + +  T +
Sbjct: 119  ISAPPISPTPTQAAPPQTPNGQQRIELPGNISDALSQQQGQVNAASQNAQGNTSTLPT-A 177

Query: 2010 PVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSS 2069
            P + + +  + V  +  T     +   T  PA       N    +T    PA+     S 
Sbjct: 178  PATVAPSKGAKVPATAETHPTPPQKPATKKPAV------NHHKTATVAVPPATSGKPKSG 231

Query: 2070 PASESTTTSSPASESTTTSSPASESTT 2096
             AS    +S+PAS  T   S AS S T
Sbjct: 232  AASARALSSAPASHYTLQLSSASRSDT 258


>gnl|CDD|115650 pfam07010, Endomucin, Endomucin.  This family consists of several
            mammalian endomucin proteins. Endomucin is an early
            endothelial-specific antigen that is also expressed on
            putative hematopoietic progenitor cells.
          Length = 259

 Score = 53.2 bits (127), Expect = 5e-07
 Identities = 47/180 (26%), Positives = 83/180 (46%), Gaps = 5/180 (2%)

Query: 1890 LLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSS 1949
             L  N+  NS   +    N+  + STT +S  + +T +   V++ TT + P+  +T+   
Sbjct: 11   FLLSNSLCNSEGVKEAANNSLVTTSTTKASITTPNTVSLKNVNKPTTGTPPKGTTTSELL 70

Query: 1950 PESESTTTSSLVSE----STTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESEST 2005
              S  +T +SL +      TTT+      ++TS     + T S+ VS +  +S  ++E+ 
Sbjct: 71   KTSLMSTATSLTTPKHELKTTTTGVRKNESSTSKVTVTNVTLSNAVS-TLQSSQNKTENQ 129

Query: 2006 TTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESI 2065
            ++I     S T+      S       S S TT+   S+S  T + K  ST++  P+  SI
Sbjct: 130  SSIRTTEISPTSVLQPDASPKKTGTTSASLTTAETTSQSQDTEDGKIASTSSTTPSYSSI 189



 Score = 45.9 bits (108), Expect = 1e-04
 Identities = 50/182 (27%), Positives = 86/182 (47%), Gaps = 22/182 (12%)

Query: 1958 SSLVSESTTTSSPESESTTTSSPESESTTTSSLVS-----ESTTTSSPESESTTTISPVS 2012
            S+ +  S       + S  T+S    S TT + VS     + TT + P+  +T+ +   S
Sbjct: 14   SNSLCNSEGVKEAANNSLVTTSTTKASITTPNTVSLKNVNKPTTGTPPKGTTTSELLKTS 73

Query: 2013 ESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPAS 2072
              +T +S      TT   E ++TTT    +ES+T+    +  T +N  A  ++ SS   +
Sbjct: 74   LMSTATSL-----TTPKHELKTTTTGVRKNESSTSKVTVTNVTLSN--AVSTLQSSQNKT 126

Query: 2073 ES-----TTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESE-----STTTSSPAS 2122
            E+     TT  SP S     +SP    TT++S  +  TT+ S ++E     ST++++P+ 
Sbjct: 127  ENQSSIRTTEISPTSVLQPDASPKKTGTTSASLTTAETTSQSQDTEDGKIASTSSTTPSY 186

Query: 2123 ES 2124
             S
Sbjct: 187  SS 188



 Score = 45.1 bits (106), Expect = 2e-04
 Identities = 32/139 (23%), Positives = 59/139 (42%), Gaps = 9/139 (6%)

Query: 1873 TTNNNSESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVS 1932
            TT +    T +MST  SL      T       TTT       ++TS     + T S+ VS
Sbjct: 64   TTTSELLKTSLMSTATSL------TTPKHELKTTTTGVRKNESSTSKVTVTNVTLSNAVS 117

Query: 1933 ESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVS 1992
               T  S ++++   SS  +   + +S++    +     + S + ++ E+ S +  +   
Sbjct: 118  ---TLQSSQNKTENQSSIRTTEISPTSVLQPDASPKKTGTTSASLTTAETTSQSQDTEDG 174

Query: 1993 ESTTTSSPESESTTTISPV 2011
            +  +TSS     ++ I PV
Sbjct: 175  KIASTSSTTPSYSSIILPV 193



 Score = 45.1 bits (106), Expect = 3e-04
 Identities = 52/193 (26%), Positives = 87/193 (45%), Gaps = 18/193 (9%)

Query: 1922 SESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPE 1981
            S S   S  V E+   S      T+T+     +  T SL + +  T+    + TTTS   
Sbjct: 14   SNSLCNSEGVKEAANNSLVT---TSTTKASITTPNTVSLKNVNKPTTGTPPKGTTTSE-- 68

Query: 1982 SESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPA 2041
                  +SL+S +T+ ++P+ E  TT + V ++ +++S V+ +  T+S    +  +S   
Sbjct: 69   ---LLKTSLMSTATSLTTPKHELKTTTTGVRKNESSTSKVTVTNVTLSNAVSTLQSSQNK 125

Query: 2042 SESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPA 2101
            +E     N  S  TT  +P S     +SP    TT     S S TT+   S+S  T    
Sbjct: 126  TE-----NQSSIRTTEISPTSVLQPDASPKKTGTT-----SASLTTAETTSQSQDTEDGK 175

Query: 2102 SESTTTSSPESES 2114
              ST++++P   S
Sbjct: 176  IASTSSTTPSYSS 188



 Score = 39.7 bits (92), Expect = 0.012
 Identities = 39/156 (25%), Positives = 79/156 (50%), Gaps = 13/156 (8%)

Query: 1979 SPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTS 2038
            S   +    +SLV+ STT +S  + +T ++  V++ TT + P   +T+ +   S  +T +
Sbjct: 20   SEGVKEAANNSLVTTSTTKASITTPNTVSLKNVNKPTTGTPPKGTTTSELLKTSLMSTAT 79

Query: 2039 SPAS-----ESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASE 2093
            S  +     ++TTT   K+ES+T+       +T ++    +  ++  +S++ T +  +S 
Sbjct: 80   SLTTPKHELKTTTTGVRKNESSTSK------VTVTNVTLSNAVSTLQSSQNKTENQ-SSI 132

Query: 2094 STTTSSPASESTTTSSPESESTTTSSPASESTTIEE 2129
             TT  SP S     +SP  ++ TTS+  + + T  +
Sbjct: 133  RTTEISPTSVLQPDASP-KKTGTTSASLTTAETTSQ 167


>gnl|CDD|233045 TIGR00601, rad23, UV excision repair protein Rad23.  All proteins in
            this family for which functions are known are components
            of a multiprotein complex used for targeting nucleotide
            excision repair to specific parts of the genome. In
            humans, Rad23 complexes with the XPC protein. This family
            is based on the phylogenomic analysis of JA Eisen (1999,
            Ph.D. Thesis, Stanford University) [DNA metabolism, DNA
            replication, recombination, and repair].
          Length = 378

 Score = 53.7 bits (129), Expect = 6e-07
 Identities = 43/180 (23%), Positives = 67/180 (37%), Gaps = 37/180 (20%)

Query: 1968 SSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTT-SSPVSESTT 2026
            S P++ +   + P +  T      S  T T SP +   + +S    S     SP  ES T
Sbjct: 75   SKPKTGTGKVAPPAATPT------SAPTPTPSPPASPASGMSAAPASAVEEKSPSEESAT 128

Query: 2027 TISPESESTTTSSPASESTTTNNPKSESTTT----------------------NNP--AS 2062
              +PES ST+  S  S++ +T    SE  TT                      NNP  A 
Sbjct: 129  ATAPESPSTSVPSSGSDAASTLVVGSERETTIEEIMEMGYEREEVERALRAAFNNPDRAV 188

Query: 2063 ESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPAS 2122
            E + +  P         P     T +S A  + TT +P   S    + +  +   ++ A+
Sbjct: 189  EYLLTGIPED----PEQPEPVQQTAASTA--AATTETPQHGSVFEQAAQGGTEQPATEAA 242



 Score = 51.8 bits (124), Expect = 3e-06
 Identities = 25/78 (32%), Positives = 38/78 (48%), Gaps = 7/78 (8%)

Query: 2050 PKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTT-SSPASESTTTS 2108
            PK+ +     PA+      +P S  T T SP +   +  S A  S     SP+ ES T +
Sbjct: 77   PKTGTGKVAPPAA------TPTSAPTPTPSPPASPASGMSAAPASAVEEKSPSEESATAT 130

Query: 2109 SPESESTTTSSPASESTT 2126
            +PES ST+  S  S++ +
Sbjct: 131  APESPSTSVPSSGSDAAS 148



 Score = 49.5 bits (118), Expect = 1e-05
 Identities = 40/177 (22%), Positives = 67/177 (37%), Gaps = 25/177 (14%)

Query: 1921 ESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSP 1980
            + ++ T       +T TS+P     T S P S ++  S+  + +    SP  ES T ++P
Sbjct: 76   KPKTGTGKVAPPAATPTSAPTP---TPSPPASPASGMSAAPASAVEEKSPSEESATATAP 132

Query: 1981 ESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSEST--------------- 2025
            ES ST+  S  S++ +T    SE  TTI  + E       V  +                
Sbjct: 133  ESPSTSVPSSGSDAASTLVVGSERETTIEEIMEMGYEREEVERALRAAFNNPDRAVEYLL 192

Query: 2026 TTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPAS 2082
            T I  + E        + ST        + TT  P   S+   +    +   ++ A+
Sbjct: 193  TGIPEDPEQPEPVQQTAASTA-------AATTETPQHGSVFEQAAQGGTEQPATEAA 242



 Score = 43.0 bits (101), Expect = 0.002
 Identities = 22/75 (29%), Positives = 35/75 (46%), Gaps = 3/75 (4%)

Query: 2062 SESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPA 2121
             ++ T       +T TS+P     T S PAS ++  S+  + +    SP  ES T ++P 
Sbjct: 77   PKTGTGKVAPPAATPTSAPTP---TPSPPASPASGMSAAPASAVEEKSPSEESATATAPE 133

Query: 2122 SESTTIEEQGVSPHS 2136
            S ST++   G    S
Sbjct: 134  SPSTSVPSSGSDAAS 148



 Score = 43.0 bits (101), Expect = 0.002
 Identities = 22/79 (27%), Positives = 34/79 (43%), Gaps = 3/79 (3%)

Query: 1882 VVMSTLNSLLSENTTTN--SPESESTTTNNPESESTTTSSPESESTTT-SSLVSESTTTS 1938
            VVM +     +        +P S  T T +P +   +  S    S     S   ES T +
Sbjct: 71   VVMVSKPKTGTGKVAPPAATPTSAPTPTPSPPASPASGMSAAPASAVEEKSPSEESATAT 130

Query: 1939 SPESESTTTSSPESESTTT 1957
            +PES ST+  S  S++ +T
Sbjct: 131  APESPSTSVPSSGSDAAST 149



 Score = 41.4 bits (97), Expect = 0.005
 Identities = 31/181 (17%), Positives = 63/181 (34%), Gaps = 26/181 (14%)

Query: 1900 PESESTTTNNPES--ESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTT 1957
            P++ +     P +   S  T +P   ++  S + +   +    +     + S ES + T 
Sbjct: 77   PKTGTGKVAPPAATPTSAPTPTPSPPASPASGMSAAPASAVEEK-----SPSEESATATA 131

Query: 1958 SSLVSESTTTSSPESESTTTSSPESESTTT--------SSLVSESTTTS--SPESESTTT 2007
                S S  +S  ++ ST     E E+T             V  +   +  +P+      
Sbjct: 132  PESPSTSVPSSGSDAASTLVVGSERETTIEEIMEMGYEREEVERALRAAFNNPDRAVEYL 191

Query: 2008 ISPVSESTTTSSPVSE------STTTISPESESTTTSSPASESTTTNNPKSESTTTNNPA 2061
            ++ + E      PV +      + TT +P+  S    +    +     P +E+    NP 
Sbjct: 192  LTGIPEDPEQPEPVQQTAASTAAATTETPQHGSVFEQAAQGGTE---QPATEAAQGGNPL 248

Query: 2062 S 2062
             
Sbjct: 249  E 249



 Score = 34.5 bits (79), Expect = 0.80
 Identities = 14/69 (20%), Positives = 24/69 (34%), Gaps = 8/69 (11%)

Query: 2078 SSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTTIEEQGVSPHSE 2137
            S P + +   + PA+      +P S  T T SP +   +  S A  S   E+        
Sbjct: 75   SKPKTGTGKVAPPAA------TPTSAPTPTPSPPASPASGMSAAPASAVEEKS--PSEES 126

Query: 2138 KLSANEDPE 2146
              +   +  
Sbjct: 127  ATATAPESP 135


>gnl|CDD|115579 pfam06933, SSP160, Special lobe-specific silk protein SSP160.  This
            family consists of several special lobe-specific silk
            protein SSP160 sequences which appear to be specific to
            Chironomus (Midge) species.
          Length = 758

 Score = 54.0 bits (129), Expect = 9e-07
 Identities = 56/271 (20%), Positives = 121/271 (44%), Gaps = 27/271 (9%)

Query: 1880 STVVMSTLN--SLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTT 1937
            S ++ ++ N  +++S N       S S + N   S S+  S+  S STT+++  + S +T
Sbjct: 78   SGIIKASFNLIAMISANIQAIQSGSGSASGN---SSSSANSTSNSNSTTSNNSTTSSNST 134

Query: 1938 SSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTT 1997
            ++  + +++++S  S  T+ +S+VS   T +    +S+   +    S    +L    + +
Sbjct: 135  TTTSNSTSSSNSTSSGLTSGASVVSLIDTCAWVYQDSSVGIAYLMVSIL--ALFYGQSVS 192

Query: 1998 SSPESE-------STTTISPVSESTTTSSPVSESTTTIS---------PESESTTTSSPA 2041
            + P ++       +  + + V +S    + ++    TI+          + +    +   
Sbjct: 193  APPYADLGIPALPANCSGAGVPQSVQIKAAIAYINITINFINLTGQQFEDLQGPVATDCG 252

Query: 2042 SESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPA 2101
              +TT+  P          A E+  + S ++ ST+ S+  S STT S+    STTT++  
Sbjct: 253  CPNTTSVAPLVAEWEAILAALEAFANGSASANSTSNSNSTSNSTTNSN----STTTTNST 308

Query: 2102 SESTTTSSPESESTTTSSPASESTTIEEQGV 2132
            + + +TSS  S +       + + TI  Q +
Sbjct: 309  TSTNSTSSSNSSTIAGCIDIAANFTIALQNL 339



 Score = 46.7 bits (110), Expect = 2e-04
 Identities = 61/250 (24%), Positives = 95/250 (38%), Gaps = 46/250 (18%)

Query: 1850 LISMLAATAVAISVIDNYSEIIFTTNNNSESTVVMSTLNSLLSENTTTNSPESESTTTNN 1909
            LI+M++A   AI      S     + N+S S    S  NS  S N+TT+S  + +TTT+N
Sbjct: 87   LIAMISANIQAIQ-----SGSGSASGNSSSSANSTSNSNSTTSNNSTTSS--NSTTTTSN 139

Query: 1910 PESESTTTSSPESESTTTSSLVSESTTTSSPES--------------ESTTTSSPESEST 1955
              S S +TSS  +   +  SL+          S                 + S+P     
Sbjct: 140  STSSSNSTSSGLTSGASVVSLIDTCAWVYQDSSVGIAYLMVSILALFYGQSVSAPPYADL 199

Query: 1956 TTSSLVSESTTTSSPES-----------------ESTTTSSPESESTTTSSLVSESTTTS 1998
               +L +  +    P+S                   T     + +    +     +TT+ 
Sbjct: 200  GIPALPANCSGAGVPQSVQIKAAIAYINITINFINLTGQQFEDLQGPVATDCGCPNTTSV 259

Query: 1999 SPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTN 2058
            +P       I    E+    S  + ST+     S ST+ S+  S STTT N    STT+ 
Sbjct: 260  APLVAEWEAILAALEAFANGSASANSTSN----SNSTSNSTTNSNSTTTTN----STTST 311

Query: 2059 NPASESITSS 2068
            N  S S +S+
Sbjct: 312  NSTSSSNSST 321



 Score = 37.4 bits (86), Expect = 0.11
 Identities = 18/69 (26%), Positives = 42/69 (60%)

Query: 2065 ITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASES 2124
            I+++  A +S + S+  + S++ +S ++ ++TTS+ ++ S+ +++  S ST++S+  S  
Sbjct: 91   ISANIQAIQSGSGSASGNSSSSANSTSNSNSTTSNNSTTSSNSTTTTSNSTSSSNSTSSG 150

Query: 2125 TTIEEQGVS 2133
             T     VS
Sbjct: 151  LTSGASVVS 159



 Score = 37.1 bits (85), Expect = 0.16
 Identities = 27/134 (20%), Positives = 61/134 (45%), Gaps = 2/134 (1%)

Query: 1881 TVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSP 1940
            T   S+L + L+    T +    + + NN E +S+  +  ES     +++++        
Sbjct: 607  TKAESSLTAFLASFNATINATIAAASANNSEVQSSEAACIESSLADAAAILAMFEAAYQN 666

Query: 1941 ESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSP 2000
             +   + + P + +TTTSS  + +TTT++  +  TTT++  + +  T  L +   + +  
Sbjct: 667  CTAPGSVTVPAAANTTTSS--TTTTTTTTTTAAPTTTTTKAANAPFTYPLCNLIMSAACS 724

Query: 2001 ESESTTTISPVSES 2014
               +  T   +S +
Sbjct: 725  AGGAGCTYPFISSA 738



 Score = 34.4 bits (78), Expect = 0.86
 Identities = 18/78 (23%), Positives = 38/78 (48%)

Query: 1943 ESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPES 2002
            E+    S  + ST+ S+  S STT S+  + + +T+S  S S++ SS ++     ++  +
Sbjct: 274  EAFANGSASANSTSNSNSTSNSTTNSNSTTTTNSTTSTNSTSSSNSSTIAGCIDIAANFT 333

Query: 2003 ESTTTISPVSESTTTSSP 2020
             +   +  +     T +P
Sbjct: 334  IALQNLQALLLQEATCAP 351



 Score = 34.0 bits (77), Expect = 1.2
 Identities = 25/118 (21%), Positives = 58/118 (49%), Gaps = 11/118 (9%)

Query: 1911 ESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVS-------- 1962
            ++ES+ T+   S + T ++ ++ ++  +S E +S+  +  ES     +++++        
Sbjct: 608  KAESSLTAFLASFNATINATIAAASANNS-EVQSSEAACIESSLADAAAILAMFEAAYQN 666

Query: 1963 --ESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTS 2018
                 + + P + +TTTSS  + +TTT++    +TTT +  +  T  +  +  S   S
Sbjct: 667  CTAPGSVTVPAAANTTTSSTTTTTTTTTTAAPTTTTTKAANAPFTYPLCNLIMSAACS 724



 Score = 34.0 bits (77), Expect = 1.4
 Identities = 24/109 (22%), Positives = 46/109 (42%), Gaps = 4/109 (3%)

Query: 1963 ESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVS 2022
            E+    S  + ST+ S+  S STT S+  + + +T+S  S S++  S ++     ++  +
Sbjct: 274  EAFANGSASANSTSNSNSTSNSTTNSNSTTTTNSTTSTNSTSSSNSSTIAGCIDIAANFT 333

Query: 2023 ESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPA 2071
             +   +        T +PA  +    N K        P   + T+S  A
Sbjct: 334  IALQNLQALLLQEATCAPALAA----NAKKSGVRDFGPCKAAKTASGCA 378



 Score = 33.6 bits (76), Expect = 1.6
 Identities = 14/60 (23%), Positives = 35/60 (58%), Gaps = 1/60 (1%)

Query: 1879 ESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTS 1938
            E   +++ L +  + + + NS  + ++T+N+  + S +T++  S ++T S+  S S+T +
Sbjct: 265  EWEAILAALEAFANGSASANSTSNSNSTSNS-TTNSNSTTTTNSTTSTNSTSSSNSSTIA 323



 Score = 31.3 bits (70), Expect = 7.5
 Identities = 29/100 (29%), Positives = 51/100 (51%), Gaps = 9/100 (9%)

Query: 2031 ESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSP 2090
            ++ES+ T+  AS + T N     +T     A+ S   SS A+   ++ + A+        
Sbjct: 608  KAESSLTAFLASFNATIN-----ATIAAASANNSEVQSSEAACIESSLADAAAILAMFEA 662

Query: 2091 ASESTT----TSSPASESTTTSSPESESTTTSSPASESTT 2126
            A ++ T     + PA+ +TTTSS  + +TTT++ A  +TT
Sbjct: 663  AYQNCTAPGSVTVPAAANTTTSSTTTTTTTTTTAAPTTTT 702


>gnl|CDD|215621 PLN03188, PLN03188, kinesin-12 family protein; Provisional.
          Length = 1320

 Score = 53.8 bits (129), Expect = 1e-06
 Identities = 45/215 (20%), Positives = 70/215 (32%), Gaps = 19/215 (8%)

Query: 1898 NSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTS----SPESE 1953
             S   + +   + + E   +   E    T          T +     T        P  E
Sbjct: 555  QSIIKQGSEDTDVDMEEAISEQEEKHEITIVDCAEPVRNTQNSLQIDTLDHESSEQPLEE 614

Query: 1954 STTTSSLVSESTTTSSP-ESESTTTSSPESESTT-TSSLVSESTTTSSPE-------SES 2004
                 S VS+  T  SP +      S  +S S +  S+ VS +  ++  E       S S
Sbjct: 615  KNALHSSVSKLNTEESPSKMVEIRPSCQDSVSESGVSTGVSVADESNDSENELVNCASPS 674

Query: 2005 TTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASES 2064
            + +I PV  S    SP    +  I    +S  TSS  + S      + +S   +    E 
Sbjct: 675  SLSIVPVEVSPVLKSPTLSVSPRIRNSRKSLRTSSMLTAS------QKDSEDESKLTPED 728

Query: 2065 ITSSSPASESTTTSSPASESTTTSSPASESTTTSS 2099
               S   S    +SS  S   + S  A      +S
Sbjct: 729  AEPSFAKSMKNNSSSALSTQKSKSFLAPTEHLAAS 763



 Score = 46.9 bits (111), Expect = 2e-04
 Identities = 50/225 (22%), Positives = 78/225 (34%), Gaps = 18/225 (8%)

Query: 1928 SSLVSESTTTSSPESESTTTSSPES---ESTTTSSLVSESTTTSSPESESTTTSSPESES 1984
             +  +E         ES  +S  +S   + +  + +  E   +   E    T        
Sbjct: 532  PAGAAEGNNVDMGRVESIHSSDQQSIIKQGSEDTDVDMEEAISEQEEKHEITIVDCAEPV 591

Query: 1985 TTTSSLVSESTTTS----SPESESTTTISPVSESTTTSSP-----VSESTTTISPESEST 2035
              T + +   T        P  E     S VS+  T  SP     +  S      ES  +
Sbjct: 592  RNTQNSLQIDTLDHESSEQPLEEKNALHSSVSKLNTEESPSKMVEIRPSCQDSVSESGVS 651

Query: 2036 TTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASEST 2095
            T  S A ES   N+ ++E     N AS S  S  P   S    SP    +     + +S 
Sbjct: 652  TGVSVADES---NDSENELV---NCASPSSLSIVPVEVSPVLKSPTLSVSPRIRNSRKSL 705

Query: 2096 TTSSPASESTTTSSPESESTTTSSPASESTTIEEQGVSPHSEKLS 2140
             TSS  + S   S  ES+ T   +  S + +++    S  S + S
Sbjct: 706  RTSSMLTASQKDSEDESKLTPEDAEPSFAKSMKNNSSSALSTQKS 750



 Score = 36.5 bits (84), Expect = 0.21
 Identities = 35/194 (18%), Positives = 68/194 (35%), Gaps = 27/194 (13%)

Query: 1833 DSSMNLLSVSPYITNNLLISMLAATAVAISVIDNYSEIIFTTNNNSESTVVMSTLNSLLS 1892
            D+ +++            I+++       +  ++         ++ +     + L+S +S
Sbjct: 564  DTDVDMEEAISEQEEKHEITIVDCAEPVRNTQNSLQIDTLDHESSEQPLEEKNALHSSVS 623

Query: 1893 ENTTTNSPE----------------------SESTTTNNPESESTTTSSPESESTTTSSL 1930
            +  T  SP                       S +  +N+ E+E    +SP S S     +
Sbjct: 624  KLNTEESPSKMVEIRPSCQDSVSESGVSTGVSVADESNDSENELVNCASPSSLSIVPVEV 683

Query: 1931 VSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSL 1990
               S    SP    +       +S  TSS+++ S   S  E ES  T      S   S  
Sbjct: 684  ---SPVLKSPTLSVSPRIRNSRKSLRTSSMLTASQKDS--EDESKLTPEDAEPSFAKSMK 738

Query: 1991 VSESTTTSSPESES 2004
             + S+  S+ +S+S
Sbjct: 739  NNSSSALSTQKSKS 752


>gnl|CDD|240244 PTZ00049, PTZ00049, cathepsin C-like protein; Provisional.
          Length = 693

 Score = 53.4 bits (128), Expect = 1e-06
 Identities = 20/49 (40%), Positives = 30/49 (61%), Gaps = 3/49 (6%)

Query: 2357 HSVKIIGWGKSSQN---EPYWLCTNSYNQGWGEQGLFKIRRGVNMCSIE 2402
            H++ ++GWG+   N     YW+  NS+ + WG++G FKI RG N   IE
Sbjct: 620  HAIVLVGWGEEEINGKLYKYWIGRNSWGKNWGKEGYFKIIRGKNFSGIE 668


>gnl|CDD|216257 pfam01034, Syndecan, Syndecan domain.  Syndecans are transmembrane
            heparin sulfate proteoglycans which are implicated in the
            binding of extracellular matrix components and growth
            factors.
          Length = 207

 Score = 50.9 bits (122), Expect = 2e-06
 Identities = 38/144 (26%), Positives = 61/144 (42%), Gaps = 8/144 (5%)

Query: 1929 SLVSESTTTSSPESESTTTSSPESE-STTTSSLVSESTTTSSPESESTTTSS--PESEST 1985
            +L ++    +   +E       + E S      + +        S S  T S   +SE  
Sbjct: 12   ALSAQPALAAQAAAEYPDERYLDEEGSGDDDEFIDDEMDDEYSGSGSGATPSDDEDSEPV 71

Query: 1986 TTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASEST 2045
            TTS+   + TTTSS  S  TTT S  ++++ T S    +TT  SP SE+ T  +  + ST
Sbjct: 72   TTSATPPKLTTTSSSPSNDTTTASTSTKTSPTVSTTVTTTT--SP-SETDTEEATTTVST 128

Query: 2046 TTNNPKSESTTTNNPASESITSSS 2069
             T      S  T+   S+++    
Sbjct: 129  ETPTEGGSSAATD--PSKNLLERK 150



 Score = 50.1 bits (120), Expect = 2e-06
 Identities = 31/91 (34%), Positives = 51/91 (56%), Gaps = 3/91 (3%)

Query: 2052 SESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPE 2111
            S +T +++  SE +T+S+   + TTTSS  S  TTT+S +++++ T S    +TT  SP 
Sbjct: 58   SGATPSDDEDSEPVTTSATPPKLTTTSSSPSNDTTTASTSTKTSPTVSTTVTTTT--SP- 114

Query: 2112 SESTTTSSPASESTTIEEQGVSPHSEKLSAN 2142
            SE+ T  +  + ST    +G S  +   S N
Sbjct: 115  SETDTEEATTTVSTETPTEGGSSAATDPSKN 145



 Score = 47.0 bits (112), Expect = 3e-05
 Identities = 31/121 (25%), Positives = 55/121 (45%), Gaps = 4/121 (3%)

Query: 2022 SESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPA 2081
            S        +      S   S +T +++  SE  TT+    +  T+SS  S  TTT+S +
Sbjct: 38   SGDDDEFIDDEMDDEYSGSGSGATPSDDEDSEPVTTSATPPKLTTTSSSPSNDTTTASTS 97

Query: 2082 SESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPA-SESTTIEEQGVSPHSEKLS 2140
            ++++ T S    +TT+    SE+ T  +  + ST T +   S + T   + +    E L+
Sbjct: 98   TKTSPTVSTTVTTTTS---PSETDTEEATTTVSTETPTEGGSSAATDPSKNLLERKEVLA 154

Query: 2141 A 2141
            A
Sbjct: 155  A 155



 Score = 47.0 bits (112), Expect = 4e-05
 Identities = 30/89 (33%), Positives = 49/89 (55%), Gaps = 4/89 (4%)

Query: 1902 SESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLV 1961
            S +T +++ +SE  TTS+   + TTTSS  S  TTT+S  ++++ T S    +TT+    
Sbjct: 58   SGATPSDDEDSEPVTTSATPPKLTTTSSSPSNDTTTASTSTKTSPTVSTTVTTTTSP--- 114

Query: 1962 SESTTTSSPESESTTTSS-PESESTTTSS 1989
            SE+ T  +  + ST T +   S + T  S
Sbjct: 115  SETDTEEATTTVSTETPTEGGSSAATDPS 143



 Score = 46.7 bits (111), Expect = 4e-05
 Identities = 31/91 (34%), Positives = 48/91 (52%), Gaps = 3/91 (3%)

Query: 1912 SESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPE 1971
            S +T +   +SE  TTS+   + TTTSS  S  TTT+S  ++++ T S    +TT  SP 
Sbjct: 58   SGATPSDDEDSEPVTTSATPPKLTTTSSSPSNDTTTASTSTKTSPTVSTTVTTTT--SP- 114

Query: 1972 SESTTTSSPESESTTTSSLVSESTTTSSPES 2002
            SE+ T  +  + ST T +    S  T   ++
Sbjct: 115  SETDTEEATTTVSTETPTEGGSSAATDPSKN 145



 Score = 44.7 bits (106), Expect = 2e-04
 Identities = 30/94 (31%), Positives = 48/94 (51%), Gaps = 1/94 (1%)

Query: 1892 SENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPE 1951
            S  T ++  +SE  TT+    + TTTSS  S  TTT+S  ++++ T S    +TT+ S E
Sbjct: 58   SGATPSDDEDSEPVTTSATPPKLTTTSSSPSNDTTTASTSTKTSPTVSTTVTTTTSPS-E 116

Query: 1952 SESTTTSSLVSESTTTSSPESESTTTSSPESEST 1985
            +++   ++ VS  T T    S +T  S    E  
Sbjct: 117  TDTEEATTTVSTETPTEGGSSAATDPSKNLLERK 150



 Score = 44.3 bits (105), Expect = 2e-04
 Identities = 31/104 (29%), Positives = 46/104 (44%), Gaps = 11/104 (10%)

Query: 1992 SESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPK 2051
            S +T +   +SE  TT +   + TTTSS  S  TTT S  ++++ T S    +TT+  P 
Sbjct: 58   SGATPSDDEDSEPVTTSATPPKLTTTSSSPSNDTTTASTSTKTSPTVSTTVTTTTS--P- 114

Query: 2052 SESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASEST 2095
            SE+ T     +         S  T T   +S +T  S    E  
Sbjct: 115  SETDTEEATTT--------VSTETPTEGGSSAATDPSKNLLERK 150



 Score = 43.2 bits (102), Expect = 6e-04
 Identities = 25/86 (29%), Positives = 40/86 (46%)

Query: 2067 SSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTT 2126
             S   S +T +    SE  TTS+   + TTTSS  S  TTT+S  ++++ T S    +TT
Sbjct: 53   YSGSGSGATPSDDEDSEPVTTSATPPKLTTTSSSPSNDTTTASTSTKTSPTVSTTVTTTT 112

Query: 2127 IEEQGVSPHSEKLSANEDPEEFPNED 2152
               +  +  +    + E P E  +  
Sbjct: 113  SPSETDTEEATTTVSTETPTEGGSSA 138



 Score = 39.3 bits (92), Expect = 0.012
 Identities = 26/86 (30%), Positives = 41/86 (47%), Gaps = 3/86 (3%)

Query: 2012 SESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPA 2071
            S +T +    SE  TT +   + TTTSS  S  TTT +  ++++ T +    + TS    
Sbjct: 58   SGATPSDDEDSEPVTTSATPPKLTTTSSSPSNDTTTASTSTKTSPTVSTTVTTTTS---P 114

Query: 2072 SESTTTSSPASESTTTSSPASESTTT 2097
            SE+ T  +  + ST T +    S  T
Sbjct: 115  SETDTEEATTTVSTETPTEGGSSAAT 140



 Score = 35.1 bits (81), Expect = 0.28
 Identities = 21/84 (25%), Positives = 42/84 (50%), Gaps = 4/84 (4%)

Query: 1892 SENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTT----TSSPESESTTT 1947
            SE  TT++   + TTT++  S  TTT+S  ++++ T S    +TT    T + E+ +T +
Sbjct: 68   SEPVTTSATPPKLTTTSSSPSNDTTTASTSTKTSPTVSTTVTTTTSPSETDTEEATTTVS 127

Query: 1948 SSPESESTTTSSLVSESTTTSSPE 1971
            +   +E  ++++           E
Sbjct: 128  TETPTEGGSSAATDPSKNLLERKE 151


>gnl|CDD|191716 pfam07263, DMP1, Dentin matrix protein 1 (DMP1).  This family
            consists of several mammalian dentin matrix protein 1
            (DMP1) sequences. The dentin matrix acidic phosphoprotein
            1 (DMP1) gene has been mapped to human chromosome 4q21.
            DMP1 is a bone and teeth specific protein initially
            identified from mineralised dentin. DMP1 is primarily
            localised in the nuclear compartment of undifferentiated
            osteoblasts. In the nucleus, DMP1 acts as a
            transcriptional component for activation of
            osteoblast-specific genes like osteocalcin. During the
            early phase of osteoblast maturation, Ca(2+) surges into
            the nucleus from the cytoplasm, triggering the
            phosphorylation of DMP1 by a nuclear isoform of casein
            kinase II. This phosphorylated DMP1 is then exported out
            into the extracellular matrix, where it regulates
            nucleation of hydroxyapatite. DMP1 is a unique molecule
            that initiates osteoblast differentiation by
            transcription in the nucleus and orchestrates mineralised
            matrix formation extracellularly, at later stages of
            osteoblast maturation. The DMP1 gene has been found to be
            ectopically expressed in lung cancer although the reason
            for this is unknown.
          Length = 514

 Score = 52.7 bits (126), Expect = 2e-06
 Identities = 58/213 (27%), Positives = 96/213 (45%), Gaps = 9/213 (4%)

Query: 1874 TNNNSESTVVMSTLNSLLSENTTTNSPES-ESTTTNNPESESTTTSSPESESTTTSSLVS 1932
             ++N+      ST N+ LS++   +  ES E +  N  + +S     P SES+  + L S
Sbjct: 284  DDSNTMEVKSDSTENAGLSQSREHSRSESQEDSEENQSQEDSQEVQDPSSESSQEADLPS 343

Query: 1933 ESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVS 1992
            +  ++ S E E  + S  ++   TTS    +  + SS E    T SS ES+ST       
Sbjct: 344  QENSSESQE-EVVSESRGDNPDNTTSHSEDQEDSESSEEDSLDTPSSSESQST------E 396

Query: 1993 ESTTTSSPESESTTTISPVS-ESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPK 2051
            E   + S ES S++  SP S E   +SS     + + S ES S  + S     +  ++  
Sbjct: 397  EQADSESNESLSSSEESPESTEDENSSSQEGLQSHSASTESRSQESQSEQDSRSEEDDSD 456

Query: 2052 SESTTTNNPASESITSSSPASESTTTSSPASES 2084
            S+ ++ +   S S  S+S + E     +   ES
Sbjct: 457  SQDSSRSKEDSNSTESASSSEEDGQPKNTEIES 489



 Score = 52.0 bits (124), Expect = 3e-06
 Identities = 59/252 (23%), Positives = 102/252 (40%), Gaps = 11/252 (4%)

Query: 1901 ESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSL 1960
            E E   +     ES +   P  +    S +  E       +S +    S  +E+   S  
Sbjct: 245  EDEEQASTQDSGESQSVEYPSRKFFRKSRISEEDGRGELDDSNTMEVKSDSTENAGLSQS 304

Query: 1961 VSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSES-----T 2015
               S + S  +SE   +     E    SS  S+     S E+ S +    VSES      
Sbjct: 305  REHSRSESQEDSEENQSQEDSQEVQDPSSESSQEADLPSQENSSESQEEVVSESRGDNPD 364

Query: 2016 TTSSPVSESTTTISPESESTTTSSPASESTTTNNP-KSESTTTNNPASESITSS----SP 2070
             T+S   +   + S E +S  T S +SES +T     SES  + + + ES  S+    S 
Sbjct: 365  NTTSHSEDQEDSESSEEDSLDTPS-SSESQSTEEQADSESNESLSSSEESPESTEDENSS 423

Query: 2071 ASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTTIEEQ 2130
            + E   + S ++ES +  S + + + +    S+S  +S  + +S +T S +S     + +
Sbjct: 424  SQEGLQSHSASTESRSQESQSEQDSRSEEDDSDSQDSSRSKEDSNSTESASSSEEDGQPK 483

Query: 2131 GVSPHSEKLSAN 2142
                 S KL+ +
Sbjct: 484  NTEIESRKLTVD 495



 Score = 51.6 bits (123), Expect = 4e-06
 Identities = 62/242 (25%), Positives = 104/242 (42%), Gaps = 20/242 (8%)

Query: 1892 SENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPE 1951
             E   +N+ E +S +T     E+   S     S + S    +S    S E +S     P 
Sbjct: 281  GELDDSNTMEVKSDST-----ENAGLSQSREHSRSESQ--EDSEENQSQE-DSQEVQDPS 332

Query: 1952 SESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPV 2011
            SES+  + L S+  ++ S E E  + S  ++   TTS    +  + SS E    T  S  
Sbjct: 333  SESSQEADLPSQENSSESQE-EVVSESRGDNPDNTTSHSEDQEDSESSEEDSLDTPSS-- 389

Query: 2012 SESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKS-ESTTTNNPASESITSSSP 2070
            SES +T           S  +ES ++S  + EST   N  S E   +++ ++ES +  S 
Sbjct: 390  SESQSTEEQAD------SESNESLSSSEESPESTEDENSSSQEGLQSHSASTESRSQESQ 443

Query: 2071 ASESTTTSSPASESTTTSSPASESTTTSSPAS--ESTTTSSPESESTTTSSPASESTTIE 2128
            + + + +    S+S  +S    +S +T S +S  E     + E ES   +  A  +  I 
Sbjct: 444  SEQDSRSEEDDSDSQDSSRSKEDSNSTESASSSEEDGQPKNTEIESRKLTVDAYHNKPIG 503

Query: 2129 EQ 2130
            +Q
Sbjct: 504  DQ 505



 Score = 49.3 bits (117), Expect = 2e-05
 Identities = 59/263 (22%), Positives = 102/263 (38%), Gaps = 14/263 (5%)

Query: 1892 SENTTTNSPESESTTTNNPESESTTTSSPESEST--TTSSLVSESTTTSSPESESTTTSS 1949
            +E+   + PE   +T ++   E       E ES+    S    E   +  PES    T S
Sbjct: 170  NEDEVDSRPEGGDSTQDSESEEHWVGGGSEGESSHGDGSEFDDEGMQSDDPES----TRS 225

Query: 1950 PESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTIS 2009
                S  +S+ +    +    E +++T  S ES+S    S      +  S E        
Sbjct: 226  ERGNSRMSSAGLKSKESKGEDEEQASTQDSGESQSVEYPSRKFFRKSRISEEDGRGELDD 285

Query: 2010 PVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSS 2069
              S +    S  +E+         S + S    E +  N  + +S    +P+SES   S 
Sbjct: 286  --SNTMEVKSDSTENAGLSQSREHSRSESQ---EDSEENQSQEDSQEVQDPSSES---SQ 337

Query: 2070 PASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTTIEE 2129
             A   +  +S  S+    S    ++   ++  SE    S    E +  +  +SES + EE
Sbjct: 338  EADLPSQENSSESQEEVVSESRGDNPDNTTSHSEDQEDSESSEEDSLDTPSSSESQSTEE 397

Query: 2130 QGVSPHSEKLSANEDPEEFPNED 2152
            Q  S  +E LS++E+  E   ++
Sbjct: 398  QADSESNESLSSSEESPESTEDE 420



 Score = 47.0 bits (111), Expect = 1e-04
 Identities = 60/260 (23%), Positives = 98/260 (37%), Gaps = 21/260 (8%)

Query: 1893 ENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPE- 1951
            E   ++ PES  +   N    S    S ES+         +++T  S ES+S    S + 
Sbjct: 213  EGMQSDDPESTRSERGNSRMSSAGLKSKESKGEDEE----QASTQDSGESQSVEYPSRKF 268

Query: 1952 ------SESTTTSSLVSESTT-TSSPESESTTTSSPESESTTTSSLVSESTTTSSPESES 2004
                  SE      L   +T    S  +E+   S     S + S    +S    S E +S
Sbjct: 269  FRKSRISEEDGRGELDDSNTMEVKSDSTENAGLSQSREHSRSESQ--EDSEENQSQE-DS 325

Query: 2005 TTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASES 2064
                 P SES   S      +   S ES+    S    ++       SE    +  + E 
Sbjct: 326  QEVQDPSSES---SQEADLPSQENSSESQEEVVSESRGDNPDNTTSHSEDQEDSESSEED 382

Query: 2065 ITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASES 2124
               +  +SES +T   A   +  S  +SE +  S+    S++    +S S +T S + ES
Sbjct: 383  SLDTPSSSESQSTEEQADSESNESLSSSEESPESTEDENSSSQEGLQSHSASTESRSQES 442

Query: 2125 TTIEEQGVSPHSEKLSANED 2144
             + ++   S   E  S ++D
Sbjct: 443  QSEQD---SRSEEDDSDSQD 459



 Score = 42.0 bits (98), Expect = 0.004
 Identities = 55/258 (21%), Positives = 101/258 (39%), Gaps = 21/258 (8%)

Query: 1901 ESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSL 1960
            + E   +++PES    T S    S  +S+ +    +    E +++T  S ES+S    S 
Sbjct: 211  DDEGMQSDDPES----TRSERGNSRMSSAGLKSKESKGEDEEQASTQDSGESQSVEYPSR 266

Query: 1961 VSESTTTSSPE---SESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTT 2017
                 +  S E    E   +++ E +S +T     E+   S     S    S   E +  
Sbjct: 267  KFFRKSRISEEDGRGELDDSNTMEVKSDST-----ENAGLSQSREHSR---SESQEDSEE 318

Query: 2018 SSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTT 2077
            +    +S     P SES   S  A   +  N+ +S+    +    ++  +++  SE    
Sbjct: 319  NQSQEDSQEVQDPSSES---SQEADLPSQENSSESQEEVVSESRGDNPDNTTSHSEDQED 375

Query: 2078 SSPASESTTTSSPASESTTTSSPA---SESTTTSSPESESTTTSSPASESTTIEEQGVSP 2134
            S  + E +  +  +SES +T   A   S  + +SS ES  +T    +S    ++    S 
Sbjct: 376  SESSEEDSLDTPSSSESQSTEEQADSESNESLSSSEESPESTEDENSSSQEGLQSHSAST 435

Query: 2135 HSEKLSANEDPEEFPNED 2152
             S    +  + +    ED
Sbjct: 436  ESRSQESQSEQDSRSEED 453



 Score = 41.6 bits (97), Expect = 0.005
 Identities = 50/236 (21%), Positives = 89/236 (37%), Gaps = 14/236 (5%)

Query: 1911 ESESTTTSSPESESTTTSSLVSESTTTSSPE---SESTTTSSPESESTTTSSLVSESTTT 1967
            E +++T  S ES+S    S      +  S E    E   +++ E +S +T     E+   
Sbjct: 247  EEQASTQDSGESQSVEYPSRKFFRKSRISEEDGRGELDDSNTMEVKSDST-----ENAGL 301

Query: 1968 SSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTT 2027
            S     S + S  +SE   +     +S     P SES+         +  +S  S+    
Sbjct: 302  SQSREHSRSESQEDSEENQSQE---DSQEVQDPSSESS---QEADLPSQENSSESQEEVV 355

Query: 2028 ISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTT 2087
                 ++   ++  SE    +    E +     +SES ++   A   +  S  +SE +  
Sbjct: 356  SESRGDNPDNTTSHSEDQEDSESSEEDSLDTPSSSESQSTEEQADSESNESLSSSEESPE 415

Query: 2088 SSPASESTTTSSPASESTTTSSPESESTTTSSPASESTTIEEQGVSPHSEKLSANE 2143
            S+    S++     S S +T S   ES +     SE    + Q  S   E  ++ E
Sbjct: 416  STEDENSSSQEGLQSHSASTESRSQESQSEQDSRSEEDDSDSQDSSRSKEDSNSTE 471



 Score = 34.2 bits (78), Expect = 1.0
 Identities = 60/291 (20%), Positives = 99/291 (34%), Gaps = 41/291 (14%)

Query: 1932 SESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLV 1991
              S   S  +S  TT SS +S     +S    ++ +   ++E    S PE   +T  S  
Sbjct: 130  GNSRLGSDEDSADTTQSSEDSTPQGENSAQDTTSESRDLDNEDEVDSRPEGGDSTQDSES 189

Query: 1992 SESTTTSSPESEST------------------TTISPVSESTTTSSPVSESTTTISPESE 2033
             E       E ES+                  +T S    S  +S+ +    +    E +
Sbjct: 190  EEHWVGGGSEGESSHGDGSEFDDEGMQSDDPESTRSERGNSRMSSAGLKSKESKGEDEEQ 249

Query: 2034 STTTSSPASESTTTNNPK------------------SESTTTNNPASESITSSSPASEST 2075
            ++T  S  S+S    + K                  S +    + ++E+   S     S 
Sbjct: 250  ASTQDSGESQSVEYPSRKFFRKSRISEEDGRGELDDSNTMEVKSDSTENAGLSQSREHSR 309

Query: 2076 TTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTTIEEQGVSPH 2135
            + S   SE   +   + E    SS +S+     S E+ S +     SES        + H
Sbjct: 310  SESQEDSEENQSQEDSQEVQDPSSESSQEADLPSQENSSESQEEVVSESRGDNPDNTTSH 369

Query: 2136 SEKLSANEDPEEFPNEDVFEHTFAEIPNIDHSNQTDEAIPETFDAREEWPQ 2186
            SE    +E  E    ED    T +   +     Q D    E+  + EE P+
Sbjct: 370  SEDQEDSESSE----EDSL-DTPSSSESQSTEEQADSESNESLSSSEESPE 415



 Score = 31.6 bits (71), Expect = 6.6
 Identities = 27/113 (23%), Positives = 51/113 (45%), Gaps = 3/113 (2%)

Query: 1865 DNYSEIIFTTNNNSESTVVMSTLNSLLSENTTTNSPESESTTTNNPESE---STTTSSPE 1921
            ++  E    T ++SES       +S  +E+ +++    EST   N  S+    + ++S E
Sbjct: 377  ESSEEDSLDTPSSSESQSTEEQADSESNESLSSSEESPESTEDENSSSQEGLQSHSASTE 436

Query: 1922 SESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESES 1974
            S S  + S     +     +S+ ++ S  +S ST ++S   E     + E ES
Sbjct: 437  SRSQESQSEQDSRSEEDDSDSQDSSRSKEDSNSTESASSSEEDGQPKNTEIES 489


>gnl|CDD|239110 cd02619, Peptidase_C1, C1 Peptidase family (MEROPS database
            nomenclature), also referred to as the papain family;
            composed of two subfamilies of cysteine peptidases (CPs),
            C1A (papain) and C1B (bleomycin hydrolase). Papain-like
            enzymes are mostly endopeptidases with some exceptions
            like cathepsins B, C, H and X, which are exopeptidases.
            Papain-like CPs have different functions in various
            organisms. Plant CPs are used to mobilize storage
            proteins in seeds while mammalian CPs are primarily
            lysosomal enzymes responsible for protein degradation in
            the lysosome. Papain-like CPs are synthesized as inactive
            proenzymes with N-terminal propeptide regions, which are
            removed upon activation. Bleomycin hydrolase (BH) is a CP
            that detoxifies bleomycin by hydrolysis of an amide
            group. It acts as a carboxypeptidase on its C-terminus to
            convert itself into an aminopeptidase and peptide ligase.
            BH is found in all tissues in mammals as well as in many
            other eukaryotes. It forms a hexameric ring barrel
            structure with the active sites imbedded in the central
            channel. Some members of the C1 family are proteins
            classified as non-peptidase homologs which lack peptidase
            activity or have missing active site residues.
          Length = 223

 Score = 50.6 bits (121), Expect = 3e-06
 Identities = 42/245 (17%), Positives = 67/245 (27%), Gaps = 71/245 (28%)

Query: 2187 CKDVIGKVWDQGACQSCWVSHQPRTAGLKGLFSFIKYGQGQERTLSVWDKAISAASVMSD 2246
                +  V +QG+  SCW                                A ++A  +  
Sbjct: 5    RPLRLTPVKNQGSRGSCW--------------------------------AFASAYALES 32

Query: 2247 RICIQSKGQVKPILSPQHLICSCTNCTRMHTKTPMSMCMGGDSAAAW-MYWINAGLVDGG 2305
               I+        LSPQ+L      C           C GG   +A        G+    
Sbjct: 33   AYRIKGGEDEYVDLSPQYLY----ICANDECLGINGSCDGGGPLSALLKLVALKGIPPEE 88

Query: 2306 D--YGTHDVSMGRYIEGIGHAASV-------------------MGSSNPEVNNFEKVIRL 2344
            D  YG          E   +AA V                   +    P V  F+     
Sbjct: 89   DYPYGAESDGEEPKSEAALNAAKVKLKDYRRVLKNNIEDIKEALAKGGPVVAGFDVYSGF 148

Query: 2345 YSCEGSINPRYI------------HSVKIIGWGKS-SQNEPYWLCTNSYNQGWGEQGLFK 2391
               +  I    I            H+V I+G+  +  + +  ++  NS+   WG+ G  +
Sbjct: 149  DRLKEGIIYEEIVYLLYEDGDLGGHAVVIVGYDDNYVEGKGAFIVKNSWGTDWGDNGYGR 208

Query: 2392 IRRGV 2396
            I    
Sbjct: 209  ISYED 213


>gnl|CDD|216860 pfam02063, MARCKS, MARCKS family. 
          Length = 296

 Score = 51.4 bits (122), Expect = 3e-06
 Identities = 45/236 (19%), Positives = 78/236 (33%), Gaps = 31/236 (13%)

Query: 1900 PESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSS 1959
               E   +     E    +S E +     +  +E  + +  E E+ T S+ ++E   T S
Sbjct: 69   TGKEEAASAAAAEEKEAAASTEPDKEPAEAEPAEPASPAEAEGEAAT-STEKAEDGATPS 127

Query: 1960 LVSES----------------TTTSSPESESTTTSSPESESTTTSSL-VSESTTTSSPES 2002
              SE+                +  S  +++       E+E          E    ++PE+
Sbjct: 128  PSSETPKKKKKRFSFKKSFKLSGFSFKKNKKEAGEGAEAEGAAAEKEGAKEEAAAAAPEA 187

Query: 2003 ESTTTISPVSE----STTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTN 2058
             S    +   E    +        E      PE         A E      P++E     
Sbjct: 188  GSGEEAAAPGEEAGAAGAEGEAGEEPAADAEPEQPEAKPEEAAPEK-----PQAEEAK-- 240

Query: 2059 NPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESES 2114
              A E      PA E+   SS A E+      A+ +   ++P  E+ + SSPE+  
Sbjct: 241  -AAEEQKAEEKPAEEAGA-SSAAQEAPAAEQEAAPAEEPAAPPQEACSESSPEAPP 294



 Score = 50.2 bits (119), Expect = 6e-06
 Identities = 45/241 (18%), Positives = 81/241 (33%), Gaps = 29/241 (12%)

Query: 1904 STTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSP---ESESTTTSSL 1960
            S        E   +++   E    +S   +       E+E    +SP   E E+ T++  
Sbjct: 63   SAPAEETGKEEAASAAAAEEKEAAASTEPDK---EPAEAEPAEPASPAEAEGEAATSTE- 118

Query: 1961 VSESTTTSSPESES----------------TTTSSPESESTTTSSLVSESTTTSSPES-E 2003
             +E   T SP SE+                +  S  +++        +E        + E
Sbjct: 119  KAEDGATPSPSSETPKKKKKRFSFKKSFKLSGFSFKKNKKEAGEGAEAEGAAAEKEGAKE 178

Query: 2004 STTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASE 2063
                 +P + S   ++   E       E E+    +  +E       K E      P +E
Sbjct: 179  EAAAAAPEAGSGEEAAAPGEEAGAAGAEGEAGEEPAADAEPEQPE-AKPEEAAPEKPQAE 237

Query: 2064 SITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASE 2123
                +  A E      PA E+   SS A E+      A+ +   ++P  E+ + SSP + 
Sbjct: 238  E---AKAAEEQKAEEKPAEEAGA-SSAAQEAPAAEQEAAPAEEPAAPPQEACSESSPEAP 293

Query: 2124 S 2124
             
Sbjct: 294  P 294



 Score = 45.2 bits (106), Expect = 3e-04
 Identities = 49/253 (19%), Positives = 84/253 (33%), Gaps = 13/253 (5%)

Query: 1903 ESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESE-STTTSSLV 1961
            E+ T   P  E+   SSP   +   +  V  +   S   +E+      ++  S       
Sbjct: 12   EAATAERP-GEAAVASSPSKANGQENGHVKVNGDASPAAAEAGAKEELQANGSAPAEETG 70

Query: 1962 SESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPV 2021
             E   +++   E    +S E +     +  +E  + +  E E+ T+ +  +E   T SP 
Sbjct: 71   KEEAASAAAAEEKEAAASTEPDKEPAEAEPAEPASPAEAEGEAATS-TEKAEDGATPSPS 129

Query: 2022 SESTTTISPESESTTTSSPASESTTTNNPK--SESTTTNNPASESITSSSPASESTTTSS 2079
            SE T     +  S   S   S  +   N K   E       A+E   +   A+ +   + 
Sbjct: 130  SE-TPKKKKKRFSFKKSFKLSGFSFKKNKKEAGEGAEAEGAAAEKEGAKEEAAAAAPEAG 188

Query: 2080 PASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTTIEEQGVSPHSEKL 2139
               E+      A  +        E    + PE         A E    EE        K 
Sbjct: 189  SGEEAAAPGEEAGAAGAEGEAGEEPAADAEPEQPEAKPEEAAPEKPQAEEA-------KA 241

Query: 2140 SANEDPEEFPNED 2152
            +  +  EE P E+
Sbjct: 242  AEEQKAEEKPAEE 254


>gnl|CDD|233044 TIGR00600, rad2, DNA excision repair protein (rad2).  All proteins in
            this family for which functions are known are flap
            endonucleases that generate the 3' incision next to DNA
            damage as part of nucleotide excision repair. This family
            is related to many other flap endonuclease families
            including the fen1 family. This family is based on the
            phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis,
            Stanford University) [DNA metabolism, DNA replication,
            recombination, and repair].
          Length = 1034

 Score = 52.6 bits (126), Expect = 3e-06
 Identities = 53/289 (18%), Positives = 94/289 (32%), Gaps = 21/289 (7%)

Query: 1896 TTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESEST 1955
             ++         +   +   T+S  E+      SL+  +T  S   SE T        S 
Sbjct: 457  LSSVNSKPEAVASTKIAREVTSSGHEAVPKAVQSLLLGATNDSPIPSEFTILDRKSELSI 516

Query: 1956 TTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSEST 2015
                 V   ++     S+     +  +E T     +S        + E    +SP+    
Sbjct: 517  --ERTVKPVSSEFGLPSQREDKLAIPTEGTQNLQGIS----DHPEQFEFQNELSPLETKN 570

Query: 2016 TTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASEST 2075
              S+  S++ T  SP  E  + SS    S   +N       T NP        S A E  
Sbjct: 571  NESNLSSDAETEGSPNPEMPSWSSVTVPSEALDN-----YETTNP--------SNAKEVR 617

Query: 2076 TTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTS-SPASESTTIEEQGVSP 2134
              +    ++T     A     ++    E   +   ESES  +     S S+T+E Q  S 
Sbjct: 618  NFAETGIQTTNVGESADLLLISNPMEVEPMESEKEESESDGSFIEVDSVSSTLELQVPSK 677

Query: 2135 HSEKLSANEDPEEFPNEDVFEHTFAEIPNIDHSNQTDEAIPETFDAREE 2183
                  + E+ E      +      EI ++      ++ I    +  ++
Sbjct: 678  SQPTDESEENAEN-KVASIEGEHRKEIEDLLFDESEEDNIVGMIEEEKD 725



 Score = 31.4 bits (71), Expect = 7.7
 Identities = 29/133 (21%), Positives = 47/133 (35%), Gaps = 4/133 (3%)

Query: 1874 TNNNSESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSE 1933
            + N    +    T+ S   +N  T +P +     N  E+   TT+  ES      S   E
Sbjct: 584  SPNPEMPSWSSVTVPSEALDNYETTNPSNAKEVRNFAETGIQTTNVGESADLLLISNPME 643

Query: 1934 STTTSSPESESTTT--SSPESES-TTTSSLVSESTTTSSPESESTTTSSPESESTTTSSL 1990
                 S E E + +  S  E +S ++T  L   S +  + ESE    +   S        
Sbjct: 644  VEPMES-EKEESESDGSFIEVDSVSSTLELQVPSKSQPTDESEENAENKVASIEGEHRKE 702

Query: 1991 VSESTTTSSPESE 2003
            + +     S E  
Sbjct: 703  IEDLLFDESEEDN 715


>gnl|CDD|183756 PRK12799, motB, flagellar motor protein MotB; Reviewed.
          Length = 421

 Score = 51.3 bits (122), Expect = 5e-06
 Identities = 29/118 (24%), Positives = 54/118 (45%), Gaps = 4/118 (3%)

Query: 1977 TSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESEST- 2035
             +   S + T SS ++ S+      +   ++++  S +TT +S V+ S+  + P   +  
Sbjct: 302  AAVTPSSAVTQSSAITPSSAAIPSPAVIPSSVTTQSATTTQASAVALSSAGVLPSDVTLP 361

Query: 2036 -TTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPAS 2092
             T + PA+E          +T T   ++ +ITS+  A+  TT+   A  S    SP S
Sbjct: 362  GTVALPAAEPVNMQPQPMSTTETQQSSTGNITST--ANGPTTSLPAAPASNIPVSPTS 417



 Score = 49.3 bits (117), Expect = 2e-05
 Identities = 29/142 (20%), Positives = 60/142 (42%), Gaps = 4/142 (2%)

Query: 1893 ENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTT-TSSPESESTTTSSPE 1951
            +N   +  ++      +        +   S + T SS ++ S+    SP    ++ ++  
Sbjct: 278  DNRALDIEKATGLKQIDTHGTVPVAAVTPSSAVTQSSAITPSSAAIPSPAVIPSSVTTQ- 336

Query: 1952 SESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPV 2011
            S +TT +S V+ S+    P S+ T   +    +    ++  +  +T+  +  ST  I+  
Sbjct: 337  SATTTQASAVALSSAGVLP-SDVTLPGTVALPAAEPVNMQPQPMSTTETQQSSTGNITST 395

Query: 2012 SESTTTSSPVS-ESTTTISPES 2032
            +   TTS P +  S   +SP S
Sbjct: 396  ANGPTTSLPAAPASNIPVSPTS 417



 Score = 48.9 bits (116), Expect = 3e-05
 Identities = 25/131 (19%), Positives = 47/131 (35%), Gaps = 11/131 (8%)

Query: 1972 SESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPE 2031
            +      +P S  T +S++   S    SP         P S +T +++    S   +S  
Sbjct: 298  TVPVAAVTPSSAVTQSSAITPSSAAIPSPAV------IPSSVTTQSATTTQASAVALS-- 349

Query: 2032 SESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPA 2091
              +    S  +   T   P +E          +  +   ++ + T++  A+  TT+   A
Sbjct: 350  -SAGVLPSDVTLPGTVALPAAEPVNMQPQPMSTTETQQSSTGNITST--ANGPTTSLPAA 406

Query: 2092 SESTTTSSPAS 2102
              S    SP S
Sbjct: 407  PASNIPVSPTS 417



 Score = 44.7 bits (105), Expect = 5e-04
 Identities = 31/132 (23%), Positives = 53/132 (40%), Gaps = 9/132 (6%)

Query: 2005 TTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASES 2064
            T    PV+  T +S+    S  T S  +  +    P+S +T +      S       S +
Sbjct: 295  THGTVPVAAVTPSSAVTQSSAITPSSAAIPSPAVIPSSVTTQSATTTQASAVA---LSSA 351

Query: 2065 ITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASES 2124
                S  +   T + PA+E          +T T   ++ + T+++      TTS PA+ +
Sbjct: 352  GVLPSDVTLPGTVALPAAEPVNMQPQPMSTTETQQSSTGNITSTA---NGPTTSLPAAPA 408

Query: 2125 TTIEEQGVSPHS 2136
            + I    VSP S
Sbjct: 409  SNIP---VSPTS 417



 Score = 44.7 bits (105), Expect = 5e-04
 Identities = 25/122 (20%), Positives = 46/122 (37%), Gaps = 3/122 (2%)

Query: 1932 SESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLV 1991
            +      +P S  T +S+    S    S       +S     +TTT +     ++   L 
Sbjct: 298  TVPVAAVTPSSAVTQSSAITPSSAAIPS--PAVIPSSVTTQSATTTQASAVALSSAGVLP 355

Query: 1992 SESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPAS-ESTTTNNP 2050
            S+ T   +    +   ++   +  +T+     ST  I+  +   TTS PA+  S    +P
Sbjct: 356  SDVTLPGTVALPAAEPVNMQPQPMSTTETQQSSTGNITSTANGPTTSLPAAPASNIPVSP 415

Query: 2051 KS 2052
             S
Sbjct: 416  TS 417



 Score = 40.1 bits (93), Expect = 0.016
 Identities = 19/95 (20%), Positives = 40/95 (42%), Gaps = 1/95 (1%)

Query: 2051 KSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSP 2110
             +       P+S    SS+    S    SPA   ++ ++ ++ +T  S+ A  S      
Sbjct: 297  GTVPVAAVTPSSAVTQSSAITPSSAAIPSPAVIPSSVTTQSATTTQASAVALSSAGVLPS 356

Query: 2111 E-SESTTTSSPASESTTIEEQGVSPHSEKLSANED 2144
            + +   T + PA+E   ++ Q +S    + S+  +
Sbjct: 357  DVTLPGTVALPAAEPVNMQPQPMSTTETQQSSTGN 391


>gnl|CDD|221173 pfam11702, DUF3295, Protein of unknown function (DUF3295).  This
            family is conserved in fungi but the function is not
            known.
          Length = 509

 Score = 50.7 bits (121), Expect = 8e-06
 Identities = 49/242 (20%), Positives = 85/242 (35%), Gaps = 26/242 (10%)

Query: 1889 SLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTS 1948
            +L   N    +P S   T   P SEST T +P  +    +   +EST+T+S +   +  S
Sbjct: 80   ALPMPNLAPITPPSSEPTPAPPSSESTATRTP--DPNQQALESTESTSTTSADCNDSEQS 137

Query: 1949 SPESESTTTSSLVSESTTTSSPESESTTTS----SPESESTTTSSLVSESTTTSSPESES 2004
            S  + +       S  T+TSS  +  +T+     SP   S++       ST   +     
Sbjct: 138  STPNLN-------SSDTSTSSSGALPSTSVVRGFSPSHISSSYR-----STAQLNKAPSP 185

Query: 2005 TTTISPVSESTTTSSPVSE--STTTISPESESTTTSSPASESTTTNNPKSESTTTNNPAS 2062
            T +  P +          +  +  T+   S      S   +  ++ +PK  S     P  
Sbjct: 186  TKSAEPTAAPQAKPELPKKKQAMFTLGGSSGDDDEDS-FEDRMSSQDPKRSSLPKPKPKM 244

Query: 2063 ESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPAS 2122
              +  S    +S  +     + T +     +  T + P     T+   E    T      
Sbjct: 245  FQLGGSDELGKSLPSLMSPRKKTASFK--EQVVTRTFPER---TSDDDEDAIETEEDDVD 299

Query: 2123 ES 2124
            ES
Sbjct: 300  ES 301



 Score = 49.6 bits (118), Expect = 2e-05
 Identities = 44/187 (23%), Positives = 75/187 (40%), Gaps = 9/187 (4%)

Query: 1944 STTTSSPESESTTTSSLVSESTTTSSPES----ESTTTSSPESESTTTSSLVSESTTTSS 1999
            S +  S  SE        ++S+ T         + + +S         +S   E    S 
Sbjct: 11   SASVDSAASEEAVDIEHHTDSSPTDISRPRIVRQDSCSSRSRGRERHITSDDLEKMVLSI 70

Query: 2000 PESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTT--T 2057
             E +    ++    +    +P S   T   P SEST T +P        + +S STT   
Sbjct: 71   KEKKDLEPLALPMPNLAPITPPSSEPTPAPPSSESTATRTPDPNQQALESTESTSTTSAD 130

Query: 2058 NNPASESITSSSPASESTTTSSPASESTTTS---SPASESTTTSSPASESTTTSSPESES 2114
             N + +S T +  +S+++T+SS A  ST+     SP+  S++  S A  +   S  +S  
Sbjct: 131  CNDSEQSSTPNLNSSDTSTSSSGALPSTSVVRGFSPSHISSSYRSTAQLNKAPSPTKSAE 190

Query: 2115 TTTSSPA 2121
             T +  A
Sbjct: 191  PTAAPQA 197



 Score = 46.5 bits (110), Expect = 1e-04
 Identities = 46/232 (19%), Positives = 85/232 (36%), Gaps = 16/232 (6%)

Query: 1929 SLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSP---ESEST 1985
            +L   +    +P S   T + P SEST T     +    +   +EST+T+S    +SE +
Sbjct: 80   ALPMPNLAPITPPSSEPTPAPPSSESTATR--TPDPNQQALESTESTSTTSADCNDSEQS 137

Query: 1986 TTSSLVSESTTTSSPESESTTTI----SPVSESTTTSSPVSESTTTI-SPESESTTTSSP 2040
            +T +L S  T+TSS  +  +T++    SP   S++  S    +     +  +E T     
Sbjct: 138  STPNLNSSDTSTSSSGALPSTSVVRGFSPSHISSSYRSTAQLNKAPSPTKSAEPTAAPQA 197

Query: 2041 ASESTTTNNPK-----SESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASEST 2095
              E             S      +   + ++S  P   S     P       S    +S 
Sbjct: 198  KPELPKKKQAMFTLGGSSGDDDEDSFEDRMSSQDPKRSSLPKPKPKMFQLGGSDELGKSL 257

Query: 2096 TTSSPASESTTTSSPESESTTTSSPASESTTIEEQGVSPHSEKLSANEDPEE 2147
             +     + T +   +  + T     S+      +      ++ SA ED ++
Sbjct: 258  PSLMSPRKKTASFKEQVVTRTFPERTSDDDEDAIETEEDDVDE-SAIEDDDD 308



 Score = 44.9 bits (106), Expect = 5e-04
 Identities = 48/242 (19%), Positives = 88/242 (36%), Gaps = 19/242 (7%)

Query: 1928 SSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTT 1987
            +S   E    S  E +     +    +    +  S   T + P SEST T +P  +    
Sbjct: 59   TSDDLEKMVLSIKEKKDLEPLALPMPNLAPITPPSSEPTPAPPSSESTATRTP--DPNQQ 116

Query: 1988 SSLVSESTTTSSPESESTTTISPV---SESTTTSSPVSESTTTI----SPESESTTTSSP 2040
            +   +EST+T+S +   +   S     S  T+TSS  +  +T++    SP   S++  S 
Sbjct: 117  ALESTESTSTTSADCNDSEQSSTPNLNSSDTSTSSSGALPSTSVVRGFSPSHISSSYRST 176

Query: 2041 ASESTTTNNPKSESTTTNNPASESITSSSP-------ASESTTTSSPASESTTTSSPASE 2093
            A  +    +P   +  T  P ++               S          +  ++  P   
Sbjct: 177  AQLNKAP-SPTKSAEPTAAPQAKPELPKKKQAMFTLGGSSGDDDEDSFEDRMSSQDPKRS 235

Query: 2094 STTTSSPASESTTTSSPESESTTTS-SPASESTTIEEQGVS-PHSEKLSANEDPEEFPNE 2151
            S     P       S    +S  +  SP  ++ + +EQ V+    E+ S +++      E
Sbjct: 236  SLPKPKPKMFQLGGSDELGKSLPSLMSPRKKTASFKEQVVTRTFPERTSDDDEDAIETEE 295

Query: 2152 DV 2153
            D 
Sbjct: 296  DD 297



 Score = 43.0 bits (101), Expect = 0.002
 Identities = 50/232 (21%), Positives = 81/232 (34%), Gaps = 32/232 (13%)

Query: 1892 SENTTTNSPESESTTTNNPE--------SESTTTSSP---ESESTTTSSLVSESTTTSSP 1940
            S   T   P SEST T  P+        +EST+T+S    +SE ++T +L S  T+TSS 
Sbjct: 93   SSEPTPAPPSSESTATRTPDPNQQALESTESTSTTSADCNDSEQSSTPNLNSSDTSTSSS 152

Query: 1941 ESESTTTS----SPESESTTTSSLVSESTT---TSSPESESTTTSSPE------------ 1981
             +  +T+     SP   S++  S    +     T S E  +   + PE            
Sbjct: 153  GALPSTSVVRGFSPSHISSSYRSTAQLNKAPSPTKSAEPTAAPQAKPELPKKKQAMFTLG 212

Query: 1982 -SESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSP 2040
             S          +  ++  P+  S     P       S  + +S  ++    + T +   
Sbjct: 213  GSSGDDDEDSFEDRMSSQDPKRSSLPKPKPKMFQLGGSDELGKSLPSLMSPRKKTASFKE 272

Query: 2041 ASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPAS 2092
               + T     S+         E     S A E     S   +S   S  +S
Sbjct: 273  QVVTRTFPERTSDDDEDAIETEEDDVDES-AIEDDDDDSDWEDSVEESGRSS 323


>gnl|CDD|222010 pfam13254, DUF4045, Domain of unknown function (DUF4045).  This
            presumed domain is functionally uncharacterized. This
            domain family is found in bacteria and eukaryotes, and is
            typically between 384 and 430 amino acids in length.
          Length = 414

 Score = 50.2 bits (120), Expect = 9e-06
 Identities = 53/268 (19%), Positives = 76/268 (28%), Gaps = 32/268 (11%)

Query: 1878 SESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTT 1937
            SE+T+V                        +      +  S P S S + S+       +
Sbjct: 79   SEATIVRQAKEG-------ERPATPPEARPDEGFVRPSLPSHPRSRSASVSNSKDGDRPS 131

Query: 1938 SSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTT- 1996
              P S S T        T  + L S      SP+ +      PE +   +    S ++  
Sbjct: 132  DLPPSPSKTMDPRRWSPTKATWLESALNKPESPKHKPQPPQQPEWKKDLSRLRQSRASVD 191

Query: 1997 ---TSSPESEST----TTISPVSESTTTSSPVSEST------------TTISPESESTTT 2037
               T+S +  +      T  P S S + S                         S   T 
Sbjct: 192  LGRTNSFKEVTPVGLMRTPPPGSHSKSPSKSGIPDLPSSRDSEKTKPEKPQQETSSMDTE 251

Query: 2038 SSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTT 2097
             S A +   T +PKS         +E    S     S   S  AS    + S  S S   
Sbjct: 252  KSSAPKPRETLDPKSPEKAPPIDTTEEELKSP--EASPKESEEASARKRSPSLLSPSPKA 309

Query: 2098 SSP---ASESTTTSSPESESTTTSSPAS 2122
             SP   AS   +   P S      SP  
Sbjct: 310  ESPKPLASPGKSPRDPLSPRPKPQSPPV 337



 Score = 38.7 bits (90), Expect = 0.034
 Identities = 37/163 (22%), Positives = 59/163 (36%), Gaps = 18/163 (11%)

Query: 1897 TNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTT 1956
            TNS +  +            T  P S S + S    +S     P S  +  + PE     
Sbjct: 195  TNSFKEVTPVG------LMRTPPPGSHSKSPS----KSGIPDLPSSRDSEKTKPEKPQQE 244

Query: 1957 TSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTT 2016
            TSS+ +E ++   P       S  ++    T+    +S   S  ESE        S    
Sbjct: 245  TSSMDTEKSSAPKPRETLDPKSPEKAPPIDTTEEELKSPEASPKESE------EASARKR 298

Query: 2017 TSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNN 2059
            + S +S S    SP+  ++   SP      +  PK +S   N+
Sbjct: 299  SPSLLSPSPKAESPKPLASPGKSP--RDPLSPRPKPQSPPVND 339



 Score = 36.4 bits (84), Expect = 0.22
 Identities = 46/280 (16%), Positives = 85/280 (30%), Gaps = 36/280 (12%)

Query: 1928 SSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTT 1987
            S  VS+  +   P   S   S   + ++  +   ++S +       +   SS    S + 
Sbjct: 21   SDSVSKRWSAQLPSGLSRGNSFLSNRNSDAAPSGTDSLSGRPASRLNREPSSRPGSSHSE 80

Query: 1988 SSLVSESTTTSSPESESTTTISPV-SESTTTSSPVSESTTTISPESESTTTSSPASESTT 2046
            +++V ++     P +             +  S P S S +  + +     +  P S S T
Sbjct: 81   ATIVRQAKEGERPATPPEARPDEGFVRPSLPSHPRSRSASVSNSKDGDRPSDLPPSPSKT 140

Query: 2047 TNNPKSESTTT--------NNPAS---------------------ESITSSSPASESTTT 2077
             + P+  S T         N P S                     +S  S      ++  
Sbjct: 141  MD-PRRWSPTKATWLESALNKPESPKHKPQPPQQPEWKKDLSRLRQSRASVDLGRTNSFK 199

Query: 2078 SSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTTIEEQGVSPHSE 2137
                     T  P S S + S         SS +SE T    P  E+++++ +     S 
Sbjct: 200  EVTPVGLMRTPPPGSHSKSPSKSGIPD-LPSSRDSEKTKPEKPQQETSSMDTE---KSSA 255

Query: 2138 KLSANEDPEEFPNEDVFEHTFAEIPNI-DHSNQTDEAIPE 2176
                     + P +     T  E     + S +  E    
Sbjct: 256  PKPRETLDPKSPEKAPPIDTTEEELKSPEASPKESEEASA 295



 Score = 35.6 bits (82), Expect = 0.34
 Identities = 49/256 (19%), Positives = 77/256 (30%), Gaps = 19/256 (7%)

Query: 1876 NNSESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSEST 1935
              S     +S  NS  + + T +     ++  N  E  S   SS  SE+T          
Sbjct: 35   GLSRGNSFLSNRNSDAAPSGTDSLSGRPASRLNR-EPSSRPGSSH-SEATIVRQAKEGER 92

Query: 1936 TTSSPESE-------STTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTS 1988
              + PE+         +  S P S S + S+       +  P S S T        T  +
Sbjct: 93   PATPPEARPDEGFVRPSLPSHPRSRSASVSNSKDGDRPSDLPPSPSKTMDPRRWSPTKAT 152

Query: 1989 SLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTN 2048
             L S      SP+ +      P  E     S + +S  ++     ++           T 
Sbjct: 153  WLESALNKPESPKHKPQPPQQP--EWKKDLSRLRQSRASVDLGRTNSFKEVTPVGLMRTP 210

Query: 2049 NPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTS 2108
             P S S +          S  P   S+  S            +S  T   S A +   T 
Sbjct: 211  PPGSHSKS-------PSKSGIPDLPSSRDSEKTKPEKPQQETSSMDT-EKSSAPKPRETL 262

Query: 2109 SPESESTTTSSPASES 2124
             P+S         +E 
Sbjct: 263  DPKSPEKAPPIDTTEE 278


>gnl|CDD|236792 PRK10905, PRK10905, cell division protein DamX; Validated.
          Length = 328

 Score = 49.6 bits (118), Expect = 1e-05
 Identities = 40/243 (16%), Positives = 86/243 (35%), Gaps = 39/243 (16%)

Query: 1910 PESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSS 1969
            P + S+  ++   +S   +   ++      P   +  T+S E  +  T   VS    +S+
Sbjct: 23   PSTSSSDQTASGEKSIDLAGNATDQANGVQP---APGTTSAEQTAGNTQQDVSLPPISST 79

Query: 1970 PESESTTTSSPESESTT------------------TSSLVSESTTTSSPESESTTTISPV 2011
            P ++  T  + + +                      +++   ST  + P      T++PV
Sbjct: 80   P-TQGQTPVATDGQQRVEVQGDLNNALTQPQNQQQLNNVAVNSTLPTEP-----ATVAPV 133

Query: 2012 ----SESTTTSSPVSESTTTISP-------ESESTTTSSPASESTTTNNPK-SESTTTNN 2059
                +   T  +  +E   T  P       E +    ++          PK +E      
Sbjct: 134  RNGNASRQTAKTQTAERPATTRPARKQAVIEPKKPQATAKTEPKPVAQTPKRTEPAAPVA 193

Query: 2060 PASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSS 2119
                   +S+PA + T T++P   ++   + A+ +    +  +  +  S+P S  T   S
Sbjct: 194  STKAPAATSTPAPKETATTAPVQTASPAQTTATPAAGGKTAGNVGSLKSAPSSHYTLQLS 253

Query: 2120 PAS 2122
             +S
Sbjct: 254  SSS 256



 Score = 42.2 bits (99), Expect = 0.002
 Identities = 31/218 (14%), Positives = 70/218 (32%), Gaps = 17/218 (7%)

Query: 1939 SPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTS 1998
            +P + S+  ++   +S   +   ++      P   +T+       +    SL   S+T +
Sbjct: 22   APSTSSSDQTASGEKSIDLAGNATDQANGVQPAPGTTSAEQTAGNTQQDVSLPPISSTPT 81

Query: 1999 SPESESTT--------------TISPVSESTTTSSPVSESTTTISPESESTTTSSPASES 2044
              ++   T               ++        ++    ST    P + +   +  AS  
Sbjct: 82   QGQTPVATDGQQRVEVQGDLNNALTQPQNQQQLNNVAVNSTLPTEPATVAPVRNGNASRQ 141

Query: 2045 TTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASES 2104
            T          TT  PA +         ++T  + P  +    +   +E     +     
Sbjct: 142  TAKTQTAERPATTR-PARKQAVIEPKKPQATAKTEP--KPVAQTPKRTEPAAPVASTKAP 198

Query: 2105 TTTSSPESESTTTSSPASESTTIEEQGVSPHSEKLSAN 2142
              TS+P  + T T++P   ++  +         K + N
Sbjct: 199  AATSTPAPKETATTAPVQTASPAQTTATPAAGGKTAGN 236



 Score = 36.1 bits (83), Expect = 0.22
 Identities = 28/154 (18%), Positives = 52/154 (33%), Gaps = 17/154 (11%)

Query: 1908 NNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSP------ESESTTTSSLV 1961
            NN    ST  + P + +   +   S  T  +       TT         E +    ++  
Sbjct: 115  NNVAVNSTLPTEPATVAPVRNGNASRQTAKTQTAERPATTRPARKQAVIEPKKPQATAKT 174

Query: 1962 SESTTTSSPE-SESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSP 2020
                   +P+ +E     +       TS+   + T T++P      T SP +++T T + 
Sbjct: 175  EPKPVAQTPKRTEPAAPVASTKAPAATSTPAPKETATTAP----VQTASP-AQTTATPAA 229

Query: 2021 VSESTTTIS-----PESESTTTSSPASESTTTNN 2049
              ++   +      P S  T   S +S     N 
Sbjct: 230  GGKTAGNVGSLKSAPSSHYTLQLSSSSNYDNLNG 263



 Score = 35.7 bits (82), Expect = 0.28
 Identities = 16/102 (15%), Positives = 35/102 (34%), Gaps = 8/102 (7%)

Query: 1892 SENTTTNSPESEST--------TTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESE 1943
            +E   T  P  +           T   E +    +   +E     +       TS+P  +
Sbjct: 148  AERPATTRPARKQAVIEPKKPQATAKTEPKPVAQTPKRTEPAAPVASTKAPAATSTPAPK 207

Query: 1944 STTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESEST 1985
             T T++P   ++   +  + +    +  +  +  S+P S  T
Sbjct: 208  ETATTAPVQTASPAQTTATPAAGGKTAGNVGSLKSAPSSHYT 249



 Score = 33.0 bits (75), Expect = 1.9
 Identities = 14/87 (16%), Positives = 30/87 (34%)

Query: 1896 TTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESEST 1955
             T   E +        +E     +       TS+   + T T++P   ++   +  + + 
Sbjct: 170  ATAKTEPKPVAQTPKRTEPAAPVASTKAPAATSTPAPKETATTAPVQTASPAQTTATPAA 229

Query: 1956 TTSSLVSESTTTSSPESESTTTSSPES 1982
               +  +  +  S+P S  T   S  S
Sbjct: 230  GGKTAGNVGSLKSAPSSHYTLQLSSSS 256


>gnl|CDD|223021 PHA03247, PHA03247, large tegument protein UL36; Provisional.
          Length = 3151

 Score = 50.7 bits (121), Expect = 1e-05
 Identities = 34/242 (14%), Positives = 68/242 (28%), Gaps = 23/242 (9%)

Query: 1897 TNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTT 1956
                     T   PE      +         +  +  +   SSP       ++  +  + 
Sbjct: 2636 NEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSL 2695

Query: 1957 TSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSES-- 2014
            TS           P    T   +P +     S+        ++ ++      +P   +  
Sbjct: 2696 TSL-------ADPPPPPPTPEPAPHA---LVSATPLPPGPAAARQASPALPAAPAPPAVP 2745

Query: 2015 TTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASES 2074
               ++P   +     P     TT+ P + +               PA  S++ S  +  S
Sbjct: 2746 AGPATPGGPARPARPP-----TTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPS 2800

Query: 2075 TTTSSPASESTTTSSPASESTTTSSPASES--TTTSSPESESTTTSSPASESTTIEEQGV 2132
                 PA       +PA+     +SPA      T++ P +            +      V
Sbjct: 2801 P--WDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPP--PGPPPPSLPLGGSV 2856

Query: 2133 SP 2134
            +P
Sbjct: 2857 AP 2858



 Score = 44.9 bits (106), Expect = 8e-04
 Identities = 32/287 (11%), Positives = 67/287 (23%), Gaps = 33/287 (11%)

Query: 1895 TTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESES 1954
                +P +       P       + P + +   +     +     P   +    +  SES
Sbjct: 2736 PAAPAPPAVPAGPATPGGP-ARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSES 2794

Query: 1955 TTTSSLVSE----STTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTI-- 2008
              +     +         +P +     +SP       +S    +          +  +  
Sbjct: 2795 RESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGG 2854

Query: 2009 -----SPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASE 2063
                   V     + SP ++      P        + +  + +   P  +      P + 
Sbjct: 2855 SVAPGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAP 2914

Query: 2064 SITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTT----------------- 2106
                  P         P         P    TT  + A E +                  
Sbjct: 2915 PPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAV 2974

Query: 2107 ----TSSPESESTTTSSPASESTTIEEQGVSPHSEKLSANEDPEEFP 2149
                   P       +S     T      VS  +  L+ +E+ +  P
Sbjct: 2975 PRFRVPQPAPSREAPASSTPPLTGHSLSRVSSWASSLALHEETDPPP 3021



 Score = 42.6 bits (100), Expect = 0.004
 Identities = 26/254 (10%), Positives = 63/254 (24%), Gaps = 13/254 (5%)

Query: 1910 PESESTTTSSPESESTTTSSLVSES---TTTSSPESESTTTSSPESESTTTSSLVSESTT 1966
            PE       S        ++    S       +P +     ++P   +       +    
Sbjct: 2708 PEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPP 2767

Query: 1967 TSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTI---SPVSESTTTSSPVSE 2023
              +P +               +SL     +  SP   +       +P +     +SP   
Sbjct: 2768 APAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGP 2827

Query: 2024 STTTISPESESTTTSSPASESTTT-------NNPKSESTTTNNPASESITSSSPASESTT 2076
                 S +  +          +                  + +PA++    + P      
Sbjct: 2828 LPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRRLA 2887

Query: 2077 TSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTTIEEQGVSPHS 2136
              + +  + + + P  +      P +       P+        P        +  ++P +
Sbjct: 2888 RPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTT 2947

Query: 2137 EKLSANEDPEEFPN 2150
            +   A E     P 
Sbjct: 2948 DPAGAGEPSGAVPQ 2961



 Score = 41.8 bits (98), Expect = 0.006
 Identities = 28/231 (12%), Positives = 58/231 (25%), Gaps = 11/231 (4%)

Query: 1910 PESESTTTSSPES-ESTTTSSLVSESTTTSSPESESTTTSSPESESTTT---SSLVSEST 1965
            P    T   +P +  S T       +   +SP   +            T    +  +   
Sbjct: 2702 PPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPP 2761

Query: 1966 TTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSEST 2025
            TT+ P + +   +         +     S + S     S    +    +    +      
Sbjct: 2762 TTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPA 2821

Query: 2026 TTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASEST 2085
             + +      T++ P +       P   S         S+           + SPA++  
Sbjct: 2822 ASPAGPLPPPTSAQPTAPPPPPG-PPPPSLPLGG----SVAPGGDVRRRPPSRSPAAKPA 2876

Query: 2086 TTSSPASESTTTS--SPASESTTTSSPESESTTTSSPASESTTIEEQGVSP 2134
              + P          S ++ES      + E               +    P
Sbjct: 2877 APARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPP 2927



 Score = 40.3 bits (94), Expect = 0.019
 Identities = 28/214 (13%), Positives = 57/214 (26%), Gaps = 13/214 (6%)

Query: 1937 TSSPESESTTTSSPESESTTTSSLVSESTT-TSSPESESTTTSSPESESTTTSSLVSEST 1995
            T  P   +  +++P       +   S +     +P +     ++P   +       +   
Sbjct: 2707 TPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPAR-----PARPP 2761

Query: 1996 TTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSEST 2055
            TT+ P + +             + P   S +       S    +    +           
Sbjct: 2762 TTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPA 2821

Query: 2056 TTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESEST 2115
             +        TS+ P +       P       S P   S            + SP ++  
Sbjct: 2822 ASPAGPLPPPTSAQPTA-----PPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPA 2876

Query: 2116 TTSSPASESTTIEEQGVSPHSEKLSANEDPEEFP 2149
              + P      +    VS  +E  +   D  E P
Sbjct: 2877 APARPPVRR--LARPAVSRSTESFALPPDQPERP 2908


>gnl|CDD|218673 pfam05642, Sporozoite_P67, Sporozoite P67 surface antigen.  This
            family consists of several Theileria P67 surface
            antigens. A stage specific surface antigen of Theileria
            parva, p67, is the basis for the development of an
            anti-sporozoite vaccine for the control of East Coast
            fever (ECF) in cattle. The antigen has been shown to
            contain five distinct linear peptide sequences recognised
            by sporozoite-neutralising murine monoclonal antibodies.
          Length = 727

 Score = 50.1 bits (119), Expect = 2e-05
 Identities = 57/284 (20%), Positives = 91/284 (32%), Gaps = 25/284 (8%)

Query: 1873 TTNNNSESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPES-----ESTTT 1927
            T    S++T V  +  S   ++ T  +P SE   T + +   +  S  +      + T  
Sbjct: 52   TVGALSKATKVWKSAVSSSDDSKTVPTPVSEPNITRSFQEPVSQESEVQDNTEQNQDTKG 111

Query: 1928 SSLVSESTTTSSPESESTTTSSP-ESESTTTSSLVSESTT-TSSPESESTTTS----SPE 1981
            S   SE     S E ++ +TSS     S  T   VS S+  T+S    +T  S       
Sbjct: 112  SKTDSEEDDDDSEEEDNKSTSSKDGKGSKKTQPGVSTSSGSTTSGTDLNTKQSQTGLGAS 171

Query: 1982 SESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPA 2041
                     VS+S     P         P          V      +SP           
Sbjct: 172  GSHAQQDPAVSQSGVVGVPGLGVPGVGVPGGGGAGALPGVGVGRAGVSPGVGVGGLGGVP 231

Query: 2042 SESTTTNNPKSESTTTNNPASESITSS--------------SPASESTTTSSPASESTTT 2087
                  +N   E  T ++   +                   S +S STT  S ++ +TT 
Sbjct: 232  GVGILASNTSREGQTQDDQERDGDGRVIEPGVGLPGVRVGDSTSSPSTTRPSGSTTTTTP 291

Query: 2088 SSPASESTTTSSPASESTTTSSPESESTTTSSPASESTTIEEQG 2131
            +S    +      +S +  T S +S S    SP +      + G
Sbjct: 292  ASSGPSAPGGPGSSSRNAVTRSTDSISGPIPSPGAPRAITGQMG 335



 Score = 43.1 bits (101), Expect = 0.002
 Identities = 39/241 (16%), Positives = 72/241 (29%), Gaps = 21/241 (8%)

Query: 1913 ESTTTSSPESESTTTSSLVSESTTTSSPES-----ESTTTSSPESESTTTSSLVSESTTT 1967
            +S T  +P SE   T S     +  S  +      + T  S  +SE     S   ++ +T
Sbjct: 72   DSKTVPTPVSEPNITRSFQEPVSQESEVQDNTEQNQDTKGSKTDSEEDDDDSEEEDNKST 131

Query: 1968 SSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTT 2027
            SS + + +  + P   +++ S+       T     +S T +            VS+S   
Sbjct: 132  SSKDGKGSKKTQPGVSTSSGSTTSGTDLNTK----QSQTGLGASGSHAQQDPAVSQSGVV 187

Query: 2028 ISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTT 2087
              P         P                  +P                 S+ + E  T 
Sbjct: 188  GVPGLGVPGVGVPGGGGAGALPGVGVGRAGVSPGVGVGGLGGVPGVGILASNTSREGQTQ 247

Query: 2088 SSPASESTTTSS------------PASESTTTSSPESESTTTSSPASESTTIEEQGVSPH 2135
                 +                   ++ S +T+ P   +TTT+  +S  +     G S  
Sbjct: 248  DDQERDGDGRVIEPGVGLPGVRVGDSTSSPSTTRPSGSTTTTTPASSGPSAPGGPGSSSR 307

Query: 2136 S 2136
            +
Sbjct: 308  N 308



 Score = 43.1 bits (101), Expect = 0.002
 Identities = 44/193 (22%), Positives = 76/193 (39%), Gaps = 28/193 (14%)

Query: 1940 PESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSS 1999
            P  ES  TS P       S LV+  +  + P  +   T    S++T     V +S  +SS
Sbjct: 22   PAGESPRTSKP-------SPLVTLESAITQPSKDPFKTVGALSKATK----VWKSAVSSS 70

Query: 2000 PESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNN 2059
               +S T  +PVSE   T S        +S ESE              N  +++ T  + 
Sbjct: 71   --DDSKTVPTPVSEPNITRS----FQEPVSQESE-----------VQDNTEQNQDTKGSK 113

Query: 2060 PASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSS 2119
              SE     S   ++ +TSS   + +  + P   +++ S+ +     T   ++    + S
Sbjct: 114  TDSEEDDDDSEEEDNKSTSSKDGKGSKKTQPGVSTSSGSTTSGTDLNTKQSQTGLGASGS 173

Query: 2120 PASESTTIEEQGV 2132
             A +   + + GV
Sbjct: 174  HAQQDPAVSQSGV 186



 Score = 41.2 bits (96), Expect = 0.008
 Identities = 52/295 (17%), Positives = 86/295 (29%), Gaps = 35/295 (11%)

Query: 1898 NSPESESTTTNNPESESTTTSS---PESESTTTSSLVSEST----TTSSPESESTTTSSP 1950
              P  ES  T+ P    T  S+   P  +   T   +S++T    +  S   +S T  +P
Sbjct: 20   KMPAGESPRTSKPSPLVTLESAITQPSKDPFKTVGALSKATKVWKSAVSSSDDSKTVPTP 79

Query: 1951 ESESTTTSSLVSESTTTSSPES-----ESTTTSSPESESTTTSSLVSESTTTSSPESEST 2005
             SE   T S     +  S  +      + T  S  +SE     S   ++ +TSS + + +
Sbjct: 80   VSEPNITRSFQEPVSQESEVQDNTEQNQDTKGSKTDSEEDDDDSEEEDNKSTSSKDGKGS 139

Query: 2006 TTISPVSESTTTSSPVSESTTT------ISPESESTTTSSPASESTTTNNPKSESTTTNN 2059
                P   +++ S+       T      +             S+S     P         
Sbjct: 140  KKTQPGVSTSSGSTTSGTDLNTKQSQTGLGASGSHAQQDPAVSQSGVVGVPGLGVPGVGV 199

Query: 2060 PASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESE------ 2113
            P      +           SP                 S+ + E  T    E +      
Sbjct: 200  PGGGGAGALPGVGVGRAGVSPGVGVGGLGGVPGVGILASNTSREGQTQDDQERDGDGRVI 259

Query: 2114 -----------STTTSSPASESTTIEEQGVSPHSEKLSANEDPEEFPNEDVFEHT 2157
                         +TSSP++   +      +P S   SA   P       V   T
Sbjct: 260  EPGVGLPGVRVGDSTSSPSTTRPSGSTTTTTPASSGPSAPGGPGSSSRNAVTRST 314


>gnl|CDD|139494 PRK13335, PRK13335, superantigen-like protein; Reviewed.
          Length = 356

 Score = 49.4 bits (117), Expect = 2e-05
 Identities = 32/168 (19%), Positives = 64/168 (38%), Gaps = 8/168 (4%)

Query: 1986 TTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASEST 2045
            TT S+ +E   ++  +   T     ++    T+   S +T   +   E T     A  + 
Sbjct: 24   TTQSVKAEKIQSTKVDKVPTLKAERLAMINITAGANSATTQAANTRQERTPKLEKAPNTN 83

Query: 2046 TTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASEST 2105
                  S+    + P  E   S + ++        +  +T +++P ++ TT   P S +T
Sbjct: 84   EEKTSASKIEKISQPKQEEQKSLNISATPAPKQEQSQTTTESTTPKTKVTT---PPSTNT 140

Query: 2106 TTSSPESESTTTSSPASESTTIEEQGVSPHSEKLSA--NEDPEEFPNE 2151
                  ++S T  SP  +    +   ++P  E L A   +   EF  +
Sbjct: 141  PQPMQSTKSDTPQSPTIKQAQTD---MTPKYEDLRAYYTKPSFEFEKQ 185



 Score = 41.7 bits (97), Expect = 0.004
 Identities = 31/175 (17%), Positives = 70/175 (40%), Gaps = 26/175 (14%)

Query: 1851 ISMLAATAVAISVIDNYSEIIFTTNNNSES-----TVVMSTLN---------SLLSENTT 1896
            +  +A T++A+ ++   +  + T +  +E         + TL          +  + + T
Sbjct: 3    MRTIAKTSLALGLLTTGAITVTTQSVKAEKIQSTKVDKVPTLKAERLAMINITAGANSAT 62

Query: 1897 TNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTT 1956
            T +  +    T   E       +P +    TS+  S+    S P+ E   + +  +    
Sbjct: 63   TQAANTRQERTPKLE------KAPNTNEEKTSA--SKIEKISQPKQEEQKSLNISATPAP 114

Query: 1957 TSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPES-ESTTTISP 2010
                   +T +++P+++ TT   P S +T      ++S T  SP   ++ T ++P
Sbjct: 115  KQEQSQTTTESTTPKTKVTT---PPSTNTPQPMQSTKSDTPQSPTIKQAQTDMTP 166



 Score = 40.5 bits (94), Expect = 0.008
 Identities = 29/144 (20%), Positives = 54/144 (37%), Gaps = 7/144 (4%)

Query: 1946 TTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESEST 2005
            TT S ++E   ++ +    T  +   +    T+   + S TT +  +    T   E    
Sbjct: 24   TTQSVKAEKIQSTKVDKVPTLKAERLAMINITAG--ANSATTQAANTRQERTPKLEKAPN 81

Query: 2006 TTISPVSES--TTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASE 2063
            T     S S     S P  E   +++  +        +  +T +  PK++ TT   P S 
Sbjct: 82   TNEEKTSASKIEKISQPKQEEQKSLNISATPAPKQEQSQTTTESTTPKTKVTT---PPST 138

Query: 2064 SITSSSPASESTTTSSPASESTTT 2087
            +      +++S T  SP  +   T
Sbjct: 139  NTPQPMQSTKSDTPQSPTIKQAQT 162



 Score = 40.5 bits (94), Expect = 0.010
 Identities = 29/151 (19%), Positives = 58/151 (38%), Gaps = 18/151 (11%)

Query: 1896 TTNSPESESTTTNNPESESTTTS--------SPESESTTTSSLVSESTTTSSPESESTTT 1947
            TT S ++E   +   +   T  +        +  + S TT +  +    T   E    T 
Sbjct: 24   TTQSVKAEKIQSTKVDKVPTLKAERLAMINITAGANSATTQAANTRQERTPKLEKAPNT- 82

Query: 1948 SSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTT-----TSSPES 2002
                +E  T++S + + +     E +S   S+  +     S   +ESTT     T+ P +
Sbjct: 83   ----NEEKTSASKIEKISQPKQEEQKSLNISATPAPKQEQSQTTTESTTPKTKVTTPPST 138

Query: 2003 ESTTTISPVSESTTTSSPVSESTTTISPESE 2033
             +   +      T  S  + ++ T ++P+ E
Sbjct: 139  NTPQPMQSTKSDTPQSPTIKQAQTDMTPKYE 169



 Score = 37.8 bits (87), Expect = 0.058
 Identities = 25/151 (16%), Positives = 51/151 (33%), Gaps = 8/151 (5%)

Query: 1926 TTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESEST 1985
            TT S+ +E   ++  +   T  +   +    T+   + S TT +  +    T   E    
Sbjct: 24   TTQSVKAEKIQSTKVDKVPTLKAERLAMINITAG--ANSATTQAANTRQERTPKLEKAPN 81

Query: 1986 TTSSLVSES---TTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPAS 2042
            T     S S     +   + E  +     + +          TTT S   ++  T+ P++
Sbjct: 82   TNEEKTSASKIEKISQPKQEEQKSLNISATPAPKQEQS---QTTTESTTPKTKVTTPPST 138

Query: 2043 ESTTTNNPKSESTTTNNPASESITSSSPASE 2073
             +          T  +    ++ T  +P  E
Sbjct: 139  NTPQPMQSTKSDTPQSPTIKQAQTDMTPKYE 169


>gnl|CDD|185513 PTZ00203, PTZ00203, cathepsin L protease; Provisional.
          Length = 348

 Score = 48.9 bits (116), Expect = 2e-05
 Identities = 20/60 (33%), Positives = 33/60 (55%), Gaps = 4/60 (6%)

Query: 2344 LYSCEGSINPRYIHSVKIIGWGKSSQNEPYWLCTNSYNQGWGEQGLFKIRRGVNMCSIED 2403
            L SC G    +  H V ++G+  + +  PYW+  NS+ + WGE+G  ++  GVN C +  
Sbjct: 278  LTSCIGE---QLNHGVLLVGYNMTGE-VPYWVIKNSWGEDWGEKGYVRVTMGVNACLLTG 333



 Score = 32.0 bits (72), Expect = 4.4
 Identities = 14/32 (43%), Positives = 18/32 (56%), Gaps = 5/32 (15%)

Query: 2173 AIPETFDAREEWPQCKDVIGKVWDQGACQSCW 2204
            A+P+  D RE     K  +  V +QGAC SCW
Sbjct: 125  AVPDAVDWRE-----KGAVTPVKNQGACGSCW 151


>gnl|CDD|220749 pfam10428, SOG2, RAM signalling pathway protein.  SOG2 proteins in
            Saccharomyces cerevisiae are involved in cell separation
            and cytokinesis.
          Length = 419

 Score = 49.0 bits (117), Expect = 3e-05
 Identities = 39/233 (16%), Positives = 80/233 (34%), Gaps = 22/233 (9%)

Query: 1861 ISVIDNYSEII---------FTTNNNSE--STVVMSTLNSLLSENTTTNSPESESTTTNN 1909
            ++ +  +  II         F  N +     T+++    S++      +S          
Sbjct: 98   LTCVSAFRHIISLLRKNLDAFFDNGDVRYIRTLLLMLYGSIMELRNAWSSLGPPLQHRKR 157

Query: 1910 PESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSS 1969
                ++ +S     +  +  L   S T +     S++  S  + +T  S    + TT   
Sbjct: 158  DAVTASPSSMIARNTPISDRLRPRSVTPTRGRRPSSSPRSLSNPTTLESPSNLQVTTDVP 217

Query: 1970 PESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTIS 2029
            P   + T+ S    S+   S++S   T  S ES  +T        T+ SS ++  +    
Sbjct: 218  PPYSNGTSRSSTMSSSANLSIISSLATPRSGESFRST-------PTSGSSSINPVSGLDE 270

Query: 2030 PESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPAS 2082
             E +           T T+     +       +E  + S  AS ++   +P+ 
Sbjct: 271  AEEDRIDEQLFLKLRTATDM----ALRVLPQLTEQFSKSLIASTTSRNITPSL 319



 Score = 38.2 bits (89), Expect = 0.058
 Identities = 30/152 (19%), Positives = 50/152 (32%), Gaps = 20/152 (13%)

Query: 1956 TTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSEST 2015
              SSL             ++ +S     +  +  L   S T +     S++  S  + +T
Sbjct: 144  AWSSLGPPLQHRKRDAVTASPSSMIARNTPISDRLRPRSVTPTRGRRPSSSPRSLSNPTT 203

Query: 2016 TTSSPVSESTTTI-SPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASES 2074
              S    + TT +  P S  T+ SS  S S   +   S +T              P S  
Sbjct: 204  LESPSNLQVTTDVPPPYSNGTSRSSTMSSSANLSIISSLAT--------------PRSGE 249

Query: 2075 TTTSSPASESTTTSSPASESTTTSSPASESTT 2106
            +  S+P     T+ S +    +    A E   
Sbjct: 250  SFRSTP-----TSGSSSINPVSGLDEAEEDRI 276



 Score = 37.0 bits (86), Expect = 0.12
 Identities = 29/155 (18%), Positives = 58/155 (37%), Gaps = 17/155 (10%)

Query: 2004 STTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASE 2063
            + +++ P  +     +  +  ++ I+  +  +    P S + T     S S  + +  + 
Sbjct: 144  AWSSLGPPLQHRKRDAVTASPSSMIARNTPISDRLRPRSVTPTRGRRPSSSPRSLSNPT- 202

Query: 2064 SITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASE 2123
            ++ S S    +T    P S  T+ SS  S S   S  +S +T   S ES  +T +S    
Sbjct: 203  TLESPSNLQVTTDVPPPYSNGTSRSSTMSSSANLSIISSLATP-RSGESFRSTPTS---- 257

Query: 2124 STTIEEQGVSPHSEKLSANEDPEEFPNEDVFEHTF 2158
                        S  ++     +E   + + E  F
Sbjct: 258  -----------GSSSINPVSGLDEAEEDRIDEQLF 281


>gnl|CDD|215130 PLN02217, PLN02217, probable pectinesterase/pectinesterase inhibitor.
          Length = 670

 Score = 49.3 bits (117), Expect = 3e-05
 Identities = 34/112 (30%), Positives = 51/112 (45%), Gaps = 5/112 (4%)

Query: 1897 TNSPESESTTTNNPESESTTTSSPESESTTTSSLVS-ESTTTSSPESESTTTSSPESEST 1955
              +P S ++T     + S TT S +S ST  +   S  +    SP +  +   SP + S 
Sbjct: 563  AGNPGSTNSTPTGSAASSNTTFSSDSPSTVVAPSTSPPAGHLGSPPATPSKIVSPST-SP 621

Query: 1956 TTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTT 2007
              S L S STT SSPES     S+    + +  S +  ++T SS    S +T
Sbjct: 622  PASHLGSPSTTPSSPESSIKVASTE---TASPESSIKVASTESSVSMVSMST 670



 Score = 48.9 bits (116), Expect = 4e-05
 Identities = 36/122 (29%), Positives = 52/122 (42%), Gaps = 17/122 (13%)

Query: 1907 TNNPESESTTTSSPESESTTTSSLVSESTTTS---SPESESTTTSSPESESTTTSSLVSE 1963
              NP S ++T +   + S TT S  S ST  +   SP +     S P + S   S     
Sbjct: 563  AGNPGSTNSTPTGSAASSNTTFSSDSPSTVVAPSTSPPA-GHLGSPPATPSKIVSP---- 617

Query: 1964 STTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSE 2023
              +TS P S         S STT SS  S     S+  +   ++I  V+ + ++ S VS 
Sbjct: 618  --STSPPAS------HLGSPSTTPSSPESSIKVASTETASPESSIK-VASTESSVSMVSM 668

Query: 2024 ST 2025
            ST
Sbjct: 669  ST 670



 Score = 48.5 bits (115), Expect = 4e-05
 Identities = 33/104 (31%), Positives = 48/104 (46%), Gaps = 2/104 (1%)

Query: 1932 SESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLV 1991
            + ST T S  S +TT SS    +    S    +    SP +  +   SP + S   S L 
Sbjct: 569  TNSTPTGSAASSNTTFSSDSPSTVVAPSTSPPAGHLGSPPATPSKIVSPST-SPPASHLG 627

Query: 1992 SESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESEST 2035
            S STT SSPES      +  + S  +S  V+ + +++S  S ST
Sbjct: 628  SPSTTPSSPESSIKVASTE-TASPESSIKVASTESSVSMVSMST 670



 Score = 46.2 bits (109), Expect = 2e-04
 Identities = 35/114 (30%), Positives = 47/114 (41%), Gaps = 12/114 (10%)

Query: 1937 TSSPESESTTTSSPESESTTTSSLVSESTTTS---SPESESTTTSSPESESTTTSSLVSE 1993
              +P S ++T +   + S TT S  S ST  +   SP +     S P + S   S   S 
Sbjct: 563  AGNPGSTNSTPTGSAASSNTTFSSDSPSTVVAPSTSPPA-GHLGSPPATPSKIVSP--ST 619

Query: 1994 STTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTT 2047
            S   S   S STT  SP S     S+       T SPES     S+ +S S  +
Sbjct: 620  SPPASHLGSPSTTPSSPESSIKVASTE------TASPESSIKVASTESSVSMVS 667



 Score = 45.9 bits (108), Expect = 3e-04
 Identities = 32/99 (32%), Positives = 44/99 (44%), Gaps = 3/99 (3%)

Query: 2047 TNNPKSESTTTNNPASESITSSSPASESTTTSSPAS-ESTTTSSPASESTTTSSPASEST 2105
              NP S ++T    A+ S T+ S  S ST  +   S  +    SP +  +   SP S S 
Sbjct: 563  AGNPGSTNSTPTGSAASSNTTFSSDSPSTVVAPSTSPPAGHLGSPPATPSKIVSP-STSP 621

Query: 2106 TTSSPESESTTTSSPASESTTIEEQGVSPHSE-KLSANE 2143
              S   S STT SSP S       +  SP S  K+++ E
Sbjct: 622  PASHLGSPSTTPSSPESSIKVASTETASPESSIKVASTE 660



 Score = 45.5 bits (107), Expect = 3e-04
 Identities = 39/125 (31%), Positives = 52/125 (41%), Gaps = 14/125 (11%)

Query: 1867 YSEIIFTTNNNSESTVVMSTLNSLLSENTT--TNSPESESTTTNNPESESTTTSSPESES 1924
            Y   +F  N  S ++       S  S NTT  ++SP +    + +P +     S P + S
Sbjct: 557  YIPGLFAGNPGSTNSTPTG---SAASSNTTFSSDSPSTVVAPSTSPPA-GHLGSPPATPS 612

Query: 1925 TTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESES 1984
               S   S S   S   S STT SSPES           ST T+SPES     S+  S S
Sbjct: 613  KIVSP--STSPPASHLGSPSTTPSSPESSIK------VASTETASPESSIKVASTESSVS 664

Query: 1985 TTTSS 1989
              + S
Sbjct: 665  MVSMS 669



 Score = 45.5 bits (107), Expect = 4e-04
 Identities = 36/106 (33%), Positives = 48/106 (45%), Gaps = 16/106 (15%)

Query: 2029 SPESESTTTSSPASESTTTNNPKSESTTTNNPAS--ESITSSSPASEST----TTSSPA- 2081
            +P S ++T +  A+ S TT +  S ST      S       S PA+ S     +TS PA 
Sbjct: 565  NPGSTNSTPTGSAASSNTTFSSDSPSTVVAPSTSPPAGHLGSPPATPSKIVSPSTSPPAS 624

Query: 2082 ---SESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASES 2124
               S STT SSP S     S+       T+SPES     S+ +S S
Sbjct: 625  HLGSPSTTPSSPESSIKVASTE------TASPESSIKVASTESSVS 664



 Score = 43.5 bits (102), Expect = 0.002
 Identities = 31/111 (27%), Positives = 45/111 (40%), Gaps = 6/111 (5%)

Query: 2009 SPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSS 2068
            +P S ++T +   + S TT S +S ST  +   S           + +     S S  +S
Sbjct: 565  NPGSTNSTPTGSAASSNTTFSSDSPSTVVAPSTSPPAGHLGSPPATPSKIVSPSTSPPAS 624

Query: 2069 SPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSS 2119
               S STT SSP S     S+       T+SP S     S+  S S  + S
Sbjct: 625  HLGSPSTTPSSPESSIKVASTE------TASPESSIKVASTESSVSMVSMS 669



 Score = 43.5 bits (102), Expect = 0.002
 Identities = 35/125 (28%), Positives = 57/125 (45%), Gaps = 20/125 (16%)

Query: 1977 TSSPESESTTTSSLVSESTTTSSPESESTTTISPVS--ESTTTSSPVSESTTTISPESES 2034
              +P S ++T +   + S TT S +S ST  ++P +   +    SP +  +  +SP    
Sbjct: 563  AGNPGSTNSTPTGSAASSNTTFSSDSPSTV-VAPSTSPPAGHLGSPPATPSKIVSPS--- 618

Query: 2035 TTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASES 2094
              TS PAS      +  S STT ++P S    +S+       T+SP S     S+ +S S
Sbjct: 619  --TSPPAS------HLGSPSTTPSSPESSIKVASTE------TASPESSIKVASTESSVS 664

Query: 2095 TTTSS 2099
              + S
Sbjct: 665  MVSMS 669



 Score = 40.1 bits (93), Expect = 0.017
 Identities = 36/118 (30%), Positives = 49/118 (41%), Gaps = 17/118 (14%)

Query: 1962 SESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPV 2021
            + ST T S  S +TT SS    +    S    +    SP +  +  +SP     +TS P 
Sbjct: 569  TNSTPTGSAASSNTTFSSDSPSTVVAPSTSPPAGHLGSPPATPSKIVSP-----STSPPA 623

Query: 2022 SESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSS 2079
            S         S STT SSP S        K  ST T +P S    +S+ +S S  + S
Sbjct: 624  S------HLGSPSTTPSSPESSI------KVASTETASPESSIKVASTESSVSMVSMS 669



 Score = 33.1 bits (75), Expect = 2.1
 Identities = 21/67 (31%), Positives = 32/67 (47%), Gaps = 3/67 (4%)

Query: 2081 ASESTTTSSPASESTTTS--SPASESTTTSSPESESTTTSSPASESTTIEEQGVSPHSEK 2138
            ++ ST T S AS +TT S  SP++    ++SP +     S PA+ S  +      P S  
Sbjct: 568  STNSTPTGSAASSNTTFSSDSPSTVVAPSTSPPA-GHLGSPPATPSKIVSPSTSPPASHL 626

Query: 2139 LSANEDP 2145
             S +  P
Sbjct: 627  GSPSTTP 633


>gnl|CDD|237019 PRK11907, PRK11907, bifunctional 2',3'-cyclic nucleotide
            2'-phosphodiesterase/3'-nucleotidase precursor protein;
            Reviewed.
          Length = 814

 Score = 49.5 bits (118), Expect = 3e-05
 Identities = 28/114 (24%), Positives = 49/114 (42%), Gaps = 7/114 (6%)

Query: 2021 VSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSP 2080
             S+S   ++    + +    A          + ST            S    E+  T +P
Sbjct: 6    FSKSAVALTLALLTASNPKLAQAEEIVTTTPATSTEAEQTTP---VESDATEEADNTETP 62

Query: 2081 ASESTTTSSPASESTT-TSSPASESTTTSSPESESTTTSSPASESTT-IEEQGV 2132
             + +T   +P+S  T  TS P SE+T T++  SE+ T +  A+E++  +E Q V
Sbjct: 63   VAATTAAEAPSSSETAETSDPTSEATDTTT--SEARTVTPAATETSKPVEGQTV 114



 Score = 49.1 bits (117), Expect = 3e-05
 Identities = 21/105 (20%), Positives = 40/105 (38%), Gaps = 4/105 (3%)

Query: 1952 SESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPV 2011
            S+S    +L   + +           ++  + ST            S    E+  T +PV
Sbjct: 7    SKSAVALTLALLTASNPKLAQAEEIVTTTPATSTEAEQTT---PVESDATEEADNTETPV 63

Query: 2012 SESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTT 2056
            + +T   +P S S T  + +  S  T +  SE+ T     +E++ 
Sbjct: 64   AATTAAEAP-SSSETAETSDPTSEATDTTTSEARTVTPAATETSK 107



 Score = 47.9 bits (114), Expect = 7e-05
 Identities = 18/93 (19%), Positives = 34/93 (36%), Gaps = 4/93 (4%)

Query: 1984 STTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASE 2043
            + +   L       ++  + ST       ++T   S  +E          +TT +   S 
Sbjct: 19   TASNPKLAQAEEIVTTTPATSTEA----EQTTPVESDATEEADNTETPVAATTAAEAPSS 74

Query: 2044 STTTNNPKSESTTTNNPASESITSSSPASESTT 2076
            S T       S  T+   SE+ T +  A+E++ 
Sbjct: 75   SETAETSDPTSEATDTTTSEARTVTPAATETSK 107



 Score = 46.8 bits (111), Expect = 2e-04
 Identities = 19/86 (22%), Positives = 38/86 (44%), Gaps = 7/86 (8%)

Query: 1892 SENTTTNSPESESTTTNNPESESTTTSSPESE-STTTSSLVSESTTTSSPESESTTTSSP 1950
            +E   T +P + +         +T   S  +E +  T + V+ +T   +P S S T  + 
Sbjct: 28   AEEIVTTTPATSTEAEQ-----TTPVESDATEEADNTETPVAATTAAEAP-SSSETAETS 81

Query: 1951 ESESTTTSSLVSESTTTSSPESESTT 1976
            +  S  T +  SE+ T +   +E++ 
Sbjct: 82   DPTSEATDTTTSEARTVTPAATETSK 107



 Score = 46.4 bits (110), Expect = 2e-04
 Identities = 19/94 (20%), Positives = 40/94 (42%), Gaps = 1/94 (1%)

Query: 1915 TTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESES 1974
              T+S    +     + +   T++  E  +   S    E+  T + V+ +T   +P S  
Sbjct: 17   LLTASNPKLAQAEEIVTTTPATSTEAEQTTPVESDATEEADNTETPVAATTAAEAPSSSE 76

Query: 1975 TTTSSPESESTTTSSLVSESTTTSSPESESTTTI 2008
            T  +S  + S  T +  SE+ T +   +E++  +
Sbjct: 77   TAETSDPT-SEATDTTTSEARTVTPAATETSKPV 109



 Score = 46.0 bits (109), Expect = 3e-04
 Identities = 29/97 (29%), Positives = 40/97 (41%), Gaps = 12/97 (12%)

Query: 1981 ESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTT-TSS 2039
              E  TT+   S     ++P     T      E+  T +PV+ +T   +P S  T  TS 
Sbjct: 28   AEEIVTTTPATSTEAEQTTPVESDAT-----EEADNTETPVAATTAAEAPSSSETAETSD 82

Query: 2040 PASESTTTNNPKSESTTTNNPASESITSSSPASESTT 2076
            P SE+T T    SE+ T    A    T +S   E  T
Sbjct: 83   PTSEATDTTT--SEARTVTPAA----TETSKPVEGQT 113



 Score = 45.2 bits (107), Expect = 5e-04
 Identities = 25/102 (24%), Positives = 48/102 (47%), Gaps = 10/102 (9%)

Query: 2016 TTSSPV---SESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPAS 2072
            T S+P    +E   T +P + +    +   ES  T   ++++T T   A+ +  + S  S
Sbjct: 19   TASNPKLAQAEEIVTTTPATSTEAEQTTPVESDATE--EADNTETPVAATTAAEAPSS-S 75

Query: 2073 ESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESES 2114
            E+  TS P SE+T T++    S   +   + + T+   E ++
Sbjct: 76   ETAETSDPTSEATDTTT----SEARTVTPAATETSKPVEGQT 113



 Score = 45.2 bits (107), Expect = 5e-04
 Identities = 17/87 (19%), Positives = 32/87 (36%), Gaps = 8/87 (9%)

Query: 2001 ESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNP 2060
              E  TT    S     ++PV    T      E+  T +P + +T    P S  T     
Sbjct: 28   AEEIVTTTPATSTEAEQTTPVESDAT-----EEADNTETPVAATTAAEAPSSSETAET-- 80

Query: 2061 ASESITSSSPASESTTTSSPASESTTT 2087
             S+  + ++  + S   +   + + T+
Sbjct: 81   -SDPTSEATDTTTSEARTVTPAATETS 106



 Score = 44.8 bits (106), Expect = 6e-04
 Identities = 25/110 (22%), Positives = 41/110 (37%), Gaps = 9/110 (8%)

Query: 1922 SESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPE 1981
            S+S    +L   + +           ++  + ST            S    E+  T +P 
Sbjct: 7    SKSAVALTLALLTASNPKLAQAEEIVTTTPATSTEAEQTT---PVESDATEEADNTETPV 63

Query: 1982 SEST-TTSSLVSESTTTSSPESESTTTI-----SPVSESTTTSSPVSEST 2025
            + +T   +   SE+  TS P SE+T T      +    +T TS PV   T
Sbjct: 64   AATTAAEAPSSSETAETSDPTSEATDTTTSEARTVTPAATETSKPVEGQT 113



 Score = 44.5 bits (105), Expect = 9e-04
 Identities = 19/92 (20%), Positives = 37/92 (40%), Gaps = 1/92 (1%)

Query: 1945 TTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESES 2004
              T+S    +     + +   T++  E  +   S    E+  T + V+ +T   +P S  
Sbjct: 17   LLTASNPKLAQAEEIVTTTPATSTEAEQTTPVESDATEEADNTETPVAATTAAEAPSSSE 76

Query: 2005 TTTISPVSESTTTSSPVSESTTTISPESESTT 2036
            T   S  + S  T +  SE+ T     +E++ 
Sbjct: 77   TAETSDPT-SEATDTTTSEARTVTPAATETSK 107



 Score = 43.7 bits (103), Expect = 0.001
 Identities = 24/89 (26%), Positives = 38/89 (42%), Gaps = 9/89 (10%)

Query: 2052 SESTTTNNPASESITSSSPASESTTTSSPASE---STTTSSPASESTTTSSPASESTTTS 2108
            +E   T  PA+ +    +     T   S A+E   +T T   A+ +    S  SE+  TS
Sbjct: 28   AEEIVTTTPATSTEAEQT-----TPVESDATEEADNTETPVAATTAAEAPSS-SETAETS 81

Query: 2109 SPESESTTTSSPASESTTIEEQGVSPHSE 2137
             P SE+T T++  + + T      S   E
Sbjct: 82   DPTSEATDTTTSEARTVTPAATETSKPVE 110



 Score = 43.7 bits (103), Expect = 0.002
 Identities = 22/99 (22%), Positives = 41/99 (41%), Gaps = 3/99 (3%)

Query: 1849 LLISMLAATAVAISVIDNYSEIIFTTNNNS-ESTVVMSTLNSLLSENTTTNSPESESTTT 1907
            + +++   TA     +    EI+ TT   S E+       +    E   T +P + +T  
Sbjct: 11   VALTLALLTASN-PKLAQAEEIVTTTPATSTEAEQTTPVESDATEEADNTETPVAATTAA 69

Query: 1908 NNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTT 1946
              P S  T  +S  + S  T +  SE+ T +   +E++ 
Sbjct: 70   EAPSSSETAETSDPT-SEATDTTTSEARTVTPAATETSK 107



 Score = 41.0 bits (96), Expect = 0.009
 Identities = 15/64 (23%), Positives = 28/64 (43%), Gaps = 6/64 (9%)

Query: 2071 ASESTTTSSPASESTTTSSPASESTTTSSPASE-STTTSSPESESTTTSSPASESTTIEE 2129
             +E   T++PA+ +    +     T   S A+E +  T +P + +T   +P+S  T    
Sbjct: 27   QAEEIVTTTPATSTEAEQT-----TPVESDATEEADNTETPVAATTAAEAPSSSETAETS 81

Query: 2130 QGVS 2133
               S
Sbjct: 82   DPTS 85



 Score = 40.2 bits (94), Expect = 0.016
 Identities = 18/82 (21%), Positives = 35/82 (42%), Gaps = 6/82 (7%)

Query: 1906 TTNNPESESTTTSSPESESTTTSSLVSE-STTTSSPESESTTTSSPESESTTTSSLVSES 1964
                  + ST       ++T   S  +E +  T +P + +T   +P S  T  +S  + S
Sbjct: 31   IVTTTPATSTEAE----QTTPVESDATEEADNTETPVAATTAAEAPSSSETAETSDPT-S 85

Query: 1965 TTTSSPESESTTTSSPESESTT 1986
              T +  SE+ T +   +E++ 
Sbjct: 86   EATDTTTSEARTVTPAATETSK 107



 Score = 36.4 bits (84), Expect = 0.27
 Identities = 14/80 (17%), Positives = 35/80 (43%), Gaps = 1/80 (1%)

Query: 1878 SESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTT 1937
            +E  V  +   S  +E TT    ++     N     + TT++    S++ ++  S+ T+ 
Sbjct: 28   AEEIVTTTPATSTEAEQTTPVESDATEEADNTETPVAATTAAEAP-SSSETAETSDPTSE 86

Query: 1938 SSPESESTTTSSPESESTTT 1957
            ++  + S   +   + + T+
Sbjct: 87   ATDTTTSEARTVTPAATETS 106



 Score = 33.7 bits (77), Expect = 1.5
 Identities = 16/79 (20%), Positives = 31/79 (39%), Gaps = 11/79 (13%)

Query: 2076 TTSSP---ASESTTTSSPASESTTTSSPASESTTTSSPESE-STTTSSPASESTTIEEQG 2131
            T S+P    +E   T++PA+ +    +     T   S  +E +  T +P + +T  E   
Sbjct: 19   TASNPKLAQAEEIVTTTPATSTEAEQT-----TPVESDATEEADNTETPVAATTAAEAP- 72

Query: 2132 VSPHSEKLSANEDPEEFPN 2150
             S      +++   E    
Sbjct: 73   -SSSETAETSDPTSEATDT 90


>gnl|CDD|240412 PTZ00420, PTZ00420, coronin; Provisional.
          Length = 568

 Score = 48.8 bits (116), Expect = 4e-05
 Identities = 54/233 (23%), Positives = 88/233 (37%), Gaps = 68/233 (29%)

Query: 415 TDGCHIFTCSTDQTLAVWDLEKGQRIK--------------KMKGHSTFV-----NSCDP 455
            D C I  CS+      W++E G  I               K+KGH++ +     N C  
Sbjct: 29  IDSCGI-ACSSGFVAVPWEVEGGGLIGAIRLENQMRKPPVIKLKGHTSSILDLQFNPC-- 85

Query: 456 VRRGQLLIASGSDDCTVKVWDPRKKNQAVSMNNTYQVTSVAFNDTAECVLTGGIDNDIKM 515
                 ++ASGS+D T++VW+    +++V      Q           C+L          
Sbjct: 86  ---FSEILASGSEDLTIRVWEIPHNDESVKEIKDPQ-----------CIL---------- 121

Query: 516 WDLRTNSVVQKLRGHSDTVTGLSLSPDGSYIL-SNAMDNTVRIWDIRPYVPGERCVKVMS 574
                       +GH   ++ +  +P   YI+ S+  D+ V IWDI      E+      
Sbjct: 122 ------------KGHKKKISIIDWNPMNYYIMCSSGFDSFVNIWDIE----NEK-----R 160

Query: 575 GHQHNFEKNLLRCAWSVSGLYVTAGSADKCVYIWDTTTRRIAYKLPGHNGSVN 627
             Q N  K L    W++ G  ++     K ++I D   + IA     H+G  N
Sbjct: 161 AFQINMPKKLSSLKWNIKGNLLSGTCVGKHMHIIDPRKQEIASSFHIHDGGKN 213



 Score = 43.8 bits (103), Expect = 0.001
 Identities = 41/179 (22%), Positives = 78/179 (43%), Gaps = 18/179 (10%)

Query: 401 MSGHTGAVMDLKFSTDGCHIF-TCSTDQTLAVWDL----EKGQRIKK----MKGHSTFVN 451
           + GHT +++DL+F+     I  + S D T+ VW++    E  + IK     +KGH   ++
Sbjct: 70  LKGHTSSILDLQFNPCFSEILASGSEDLTIRVWEIPHNDESVKEIKDPQCILKGHKKKIS 129

Query: 452 SCDPVRRGQLLIASGSDDCTVKVWDPRKKNQAVSMNNTYQVTSVAFNDTAECVLTGGIDN 511
             D       ++ S   D  V +WD   + +A  +N   +++S+ +N     +    +  
Sbjct: 130 IIDWNPMNYYIMCSSGFDSFVNIWDIENEKRAFQINMPKKLSSLKWNIKGNLLSGTCVGK 189

Query: 512 DIKMWDLRTNSVVQKLRGHSDTVTGLSLSPDG-----SYILSNAMD-NTVR---IWDIR 561
            + + D R   +      H       ++  DG     +YILS     N +R   +WD++
Sbjct: 190 HMHIIDPRKQEIASSFHIHDGGKNTKNIWIDGLGGDDNYILSTGFSKNNMREMKLWDLK 248



 Score = 39.9 bits (93), Expect = 0.018
 Identities = 29/85 (34%), Positives = 47/85 (55%), Gaps = 12/85 (14%)

Query: 115 GHKSAITVIQYDP-LGHRLATGSKDTDIVLWDV------VAECG--LHRLSGHKGVITDI 165
           GH S+I  +Q++P     LA+GS+D  I +W++      V E       L GHK  I+ I
Sbjct: 72  GHTSSILDLQFNPCFSEILASGSEDLTIRVWEIPHNDESVKEIKDPQCILKGHKKKISII 131

Query: 166 RFMSQPGHHFVVSSAK-DTFVKIWD 189
            +   P +++++ S+  D+FV IWD
Sbjct: 132 DW--NPMNYYIMCSSGFDSFVNIWD 154



 Score = 36.9 bits (85), Expect = 0.14
 Identities = 25/95 (26%), Positives = 46/95 (48%), Gaps = 14/95 (14%)

Query: 1076 ISLYGHKLPVLSLDMSYD---STLIATGSGDRTVKVWGLDYGDCHKS--------LLAHE 1124
            I L GH   +L  D+ ++   S ++A+GS D T++VW + + D            L  H+
Sbjct: 68   IKLKGHTSSIL--DLQFNPCFSEILASGSEDLTIRVWEIPHNDESVKEIKDPQCILKGHK 125

Query: 1125 DSVTGVTFVPKTHYFF-TTSKDGRVKQWDADNFER 1158
              ++ + + P  +Y   ++  D  V  WD +N +R
Sbjct: 126  KKISIIDWNPMNYYIMCSSGFDSFVNIWDIENEKR 160



 Score = 36.9 bits (85), Expect = 0.14
 Identities = 25/95 (26%), Positives = 46/95 (48%), Gaps = 14/95 (14%)

Query: 1166 ISLYGHKLPVLSLDMSYD---STLIATGSGDRTVKVWGLDYGDCHKS--------LLAHE 1214
            I L GH   +L  D+ ++   S ++A+GS D T++VW + + D            L  H+
Sbjct: 68   IKLKGHTSSIL--DLQFNPCFSEILASGSEDLTIRVWEIPHNDESVKEIKDPQCILKGHK 125

Query: 1215 DSVTGVTFVPKTHYFF-TTSKDGRVKQWDADNFER 1248
              ++ + + P  +Y   ++  D  V  WD +N +R
Sbjct: 126  KKISIIDWNPMNYYIMCSSGFDSFVNIWDIENEKR 160



 Score = 36.9 bits (85), Expect = 0.14
 Identities = 25/95 (26%), Positives = 46/95 (48%), Gaps = 14/95 (14%)

Query: 1350 ISLYGHKLPVLSLDMSYD---STLIATGSGDRTVKVWGLDYGDCHKS--------LLAHE 1398
            I L GH   +L  D+ ++   S ++A+GS D T++VW + + D            L  H+
Sbjct: 68   IKLKGHTSSIL--DLQFNPCFSEILASGSEDLTIRVWEIPHNDESVKEIKDPQCILKGHK 125

Query: 1399 DSVTGVTFVPKTHYFF-TTSKDGRVKQWDADNFER 1432
              ++ + + P  +Y   ++  D  V  WD +N +R
Sbjct: 126  KKISIIDWNPMNYYIMCSSGFDSFVNIWDIENEKR 160



 Score = 32.2 bits (73), Expect = 4.4
 Identities = 28/86 (32%), Positives = 45/86 (52%), Gaps = 10/86 (11%)

Query: 618 KLPGHNGSVNDVQFHP-KEPIIMSASSDKTIYLGESPLHCDKAGS-------ILRSGKGR 669
           KL GH  S+ D+QF+P    I+ S S D TI + E P H D++         IL+  K +
Sbjct: 69  KLKGHTSSILDLQFNPCFSEILASGSEDLTIRVWEIP-HNDESVKEIKDPQCILKGHKKK 127

Query: 670 VHTMV-NDKHRQILCCHGNDNVVDLF 694
           +  +  N  +  I+C  G D+ V+++
Sbjct: 128 ISIIDWNPMNYYIMCSSGFDSFVNIW 153


>gnl|CDD|118064 pfam09528, Ehrlichia_rpt, Ehrlichia tandem repeat (Ehrlichia_rpt).
            This entry represents 77 residues of an 80 amino acid
            (240 nucleotide) tandem repeat, found in a variable
            number of copies in an immunodominant outer membrane
            protein of Ehrlichia chaffeensis, a tick-borne obligate
            intracellular pathogen.
          Length = 707

 Score = 48.9 bits (115), Expect = 4e-05
 Identities = 54/279 (19%), Positives = 80/279 (28%), Gaps = 18/279 (6%)

Query: 1892 SENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPE 1951
             E+   + P   S     PE ++         S   SS   E     + + E    S  E
Sbjct: 234  HESEVGDKPAETSKEEETPEVKAEDLQPAVDGSVEHSSSEIEEHQGETEKEEGIPESHAE 293

Query: 1952 SESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPV 2011
                    +V     +S P       S  E E      L  +    +  ES  +   + V
Sbjct: 294  DLQPAVDDIVEHP--SSEPFVAEEEVSETEKEENNPEVLAEDLQDAADGESGVSDQPAQV 351

Query: 2012 SESTTT------SSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTT---NNPA- 2061
             E   +           E     S   +    S P+ E  +    K  S T    +NP  
Sbjct: 352  VEERESEIEEHQGETEKEEGIPESHAEDDEIASDPSIEHFSAEVGKEVSETEKEESNPEV 411

Query: 2062 -----SESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTT 2116
                   ++       ES     PA  S    SP  E+     PA +     S + E   
Sbjct: 412  KAEDLQPAVDGDVAHHESEVGDKPAETSKEEESPEIEA-EDGEPAKDGGIEESHQEEDEI 470

Query: 2117 TSSPASESTTIEEQGVSPHSEKLSANEDPEEFPNEDVFE 2155
             S P+ E  T E +          + E       E+V E
Sbjct: 471  VSEPSKEEFTAEVKAEDLQPAVDGSVEHSSSEVGEEVSE 509



 Score = 44.6 bits (104), Expect = 8e-04
 Identities = 58/302 (19%), Positives = 90/302 (29%), Gaps = 27/302 (8%)

Query: 1906 TTNNPESE-STTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESEST--------- 1955
               NP SE     ++PE ++      V+ES   SS E     + + + ES          
Sbjct: 57   NVGNPSSEVGKEENAPEVKAEDLEPAVAESVEHSSSEVGKEVSETEKEESNPEVKAEDLQ 116

Query: 1956 ---TTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVS 2012
                      ES     P   S    +PE E+        +    S    E    +S  S
Sbjct: 117  PAVDGDIAHHESEVGDKPAKTSKEEENPEIEAEDGEPAKDDGIEESH--QEEDEIVSESS 174

Query: 2013 ESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPAS 2072
            +   T+   +E        S   ++S    E + T   +S           ++       
Sbjct: 175  KEEFTAEVKAEDLQPAVDGSIEHSSSEVGEEVSKTEKEESNPEVKAEDLQPAVDDDVAHH 234

Query: 2073 ESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPE---------SESTTTSSPASE 2123
            ES     PA  S    +P  ++         S   SS E          E     S A +
Sbjct: 235  ESEVGDKPAETSKEEETPEVKAEDLQPAVDGSVEHSSSEIEEHQGETEKEEGIPESHAED 294

Query: 2124 STTIEEQGVS-PHSEKLSANEDPEEFPNEDVFEHTFAE--IPNIDHSNQTDEAIPETFDA 2180
                 +  V  P SE   A E+  E   E+      AE      D  +   +   +  + 
Sbjct: 295  LQPAVDDIVEHPSSEPFVAEEEVSETEKEENNPEVLAEDLQDAADGESGVSDQPAQVVEE 354

Query: 2181 RE 2182
            RE
Sbjct: 355  RE 356



 Score = 44.3 bits (103), Expect = 8e-04
 Identities = 60/343 (17%), Positives = 98/343 (28%), Gaps = 38/343 (11%)

Query: 1858 AVAISVIDNYSEIIFTTNNNSESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTT 1917
            AVA SV  + SE+    +   +           L      +    ES   + P   S   
Sbjct: 82   AVAESVEHSSSEVGKEVSETEKEESNPEVKAEDLQPAVDGDIAHHESEVGDKPAKTSKEE 141

Query: 1918 SSPESESTTTSSLVSESTTTSSPESESTTTSS------PESESTTTSSLVSESTTTSSPE 1971
             +PE E+        +    S  E +   + S       E ++      V  S   SS E
Sbjct: 142  ENPEIEAEDGEPAKDDGIEESHQEEDEIVSESSKEEFTAEVKAEDLQPAVDGSIEHSSSE 201

Query: 1972 SESTTTSSPESEST-----------------TTSSLVSESTTTSSPESESTTTISPVSES 2014
                 + + + ES                     S V +    +S E E T  +      
Sbjct: 202  VGEEVSKTEKEESNPEVKAEDLQPAVDDDVAHHESEVGDKPAETSKE-EETPEVKAEDLQ 260

Query: 2015 TTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASE- 2073
                  V  S++ I      T       ES   +   +      +P+SE   +    SE 
Sbjct: 261  PAVDGSVEHSSSEIEEHQGETEKEEGIPESHAEDLQPAVDDIVEHPSSEPFVAEEEVSET 320

Query: 2074 STTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTTIEEQGVS 2133
                ++P   +      A   +  S   ++       E E     +   E          
Sbjct: 321  EKEENNPEVLAEDLQDAADGESGVSDQPAQVVEERESEIEEHQGETEKEEGIP------E 374

Query: 2134 PHSEKLSANEDPEEFPNEDVFEHTFAEIPNIDHSNQTDEAIPE 2176
             H+E      DP         EH  AE+       + +E+ PE
Sbjct: 375  SHAEDDEIASDPS-------IEHFSAEVGKEVSETEKEESNPE 410



 Score = 41.2 bits (95), Expect = 0.007
 Identities = 52/287 (18%), Positives = 88/287 (30%), Gaps = 12/287 (4%)

Query: 1898 NSPESESTTTNNPESESTTTS----SPESESTTTSSLVSESTTTSSPESESTTTSSPESE 1953
            +  E  S+     E E + T     +PE  +          +  S   ++       E E
Sbjct: 301  DIVEHPSSEPFVAEEEVSETEKEENNPEVLAEDLQDAADGESGVSDQPAQVVEERESEIE 360

Query: 1954 STTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSE 2013
                     E     S   +    S P  E   ++ +  E + T   ES        +  
Sbjct: 361  E-HQGETEKEEGIPESHAEDDEIASDPSIEH-FSAEVGKEVSETEKEESNPEVKAEDLQP 418

Query: 2014 STTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASE 2073
            +        ES     P   S    SP  E+     P  +     +   E    S P+ E
Sbjct: 419  AVDGDVAHHESEVGDKPAETSKEEESPEIEAEDG-EPAKDGGIEESHQEEDEIVSEPSKE 477

Query: 2074 STTTSSPASESTTTSSPASESTTTSSPASESTTTSSPES--ESTTTSSPASESTTIEEQG 2131
              T    A E    +   S   ++S    E + T   ES  E      P +   ++ E  
Sbjct: 478  EFTAEVKA-EDLQPAVDGSVEHSSSEVGEEVSETEKEESNPEIKAEDLPPAVDDSL-EHS 535

Query: 2132 VSPHSEKLSANEDPEEFPNEDVFEHTFAEIPNIDH-SNQTDEAIPET 2177
            +    EK+      E  P     +   A   +++H S++  + + ET
Sbjct: 536  IPEVGEKVDEMFAEEFNPEVIAEDLQPAVDGSVEHSSSEVGDKVCET 582



 Score = 40.4 bits (93), Expect = 0.014
 Identities = 50/254 (19%), Positives = 75/254 (29%), Gaps = 12/254 (4%)

Query: 1932 SESTTTSSPESESTTTSSPESESTTTSSLVSESTT-TSSPESESTTTSSPESESTTTSSL 1990
             ES     P   S    +PE E+        +    +   E E  + SS E  +    + 
Sbjct: 126  HESEVGDKPAKTSKEEENPEIEAEDGEPAKDDGIEESHQEEDEIVSESSKEEFTAEVKAE 185

Query: 1991 VSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNP 2050
              +     S E  S+     VS++    S        + P  +         ES   + P
Sbjct: 186  DLQPAVDGSIEHSSSEVGEEVSKTEKEESNPEVKAEDLQPAVDDDVAHH---ESEVGDKP 242

Query: 2051 KSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSS----PASESTT 2106
               S     P  ++         S   SS   E     +   E    S       +    
Sbjct: 243  AETSKEEETPEVKAEDLQPAVDGSVEHSSSEIEEHQGETEKEEGIPESHAEDLQPAVDDI 302

Query: 2107 TSSPESESTTTSSPASESTTIEEQGVSPHSEKLSANEDPEEFPNE---DVFEHTFAEIPN 2163
               P SE        SE T  EE      +E L    D E   ++    V E   +EI  
Sbjct: 303  VEHPSSEPFVAEEEVSE-TEKEENNPEVLAEDLQDAADGESGVSDQPAQVVEERESEIEE 361

Query: 2164 IDHSNQTDEAIPET 2177
                 + +E IPE+
Sbjct: 362  HQGETEKEEGIPES 375



 Score = 39.6 bits (91), Expect = 0.022
 Identities = 56/303 (18%), Positives = 94/303 (31%), Gaps = 13/303 (4%)

Query: 1893 ENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSE-STTTSSPESESTTTSSPE 1951
            E  + +S E  +      + +     S E  S+     VS+     S+PE ++       
Sbjct: 168  EIVSESSKEEFTAEVKAEDLQPAVDGSIEHSSSEVGEEVSKTEKEESNPEVKAEDLQPAV 227

Query: 1952 SEST-TTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISP 2010
             +      S V +    +S E E+    + + +     S+   S+     + E T     
Sbjct: 228  DDDVAHHESEVGDKPAETSKEEETPEVKAEDLQPAVDGSVEHSSSEIEEHQGE-TEKEEG 286

Query: 2011 VSESTTTSSPVSESTTTISPESESTTTSSPASE-STTTNNPKSESTTTNNPAS-ESITSS 2068
            + ES       +       P SE        SE     NNP+  +    + A  ES  S 
Sbjct: 287  IPESHAEDLQPAVDDIVEHPSSEPFVAEEEVSETEKEENNPEVLAEDLQDAADGESGVSD 346

Query: 2069 SPASESTTTSS--------PASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSP 2120
             PA       S           E     S A +    S P+ E  +    +  S T    
Sbjct: 347  QPAQVVEERESEIEEHQGETEKEEGIPESHAEDDEIASDPSIEHFSAEVGKEVSETEKEE 406

Query: 2121 ASESTTIEEQGVSPHSEKLSANEDPEEFPNEDVFEHTFAEIPNIDHSNQTDEAIPETFDA 2180
            ++     E+   +   +      +  + P E   E    EI   D     D  I E+   
Sbjct: 407  SNPEVKAEDLQPAVDGDVAHHESEVGDKPAETSKEEESPEIEAEDGEPAKDGGIEESHQE 466

Query: 2181 REE 2183
             +E
Sbjct: 467  EDE 469



 Score = 38.5 bits (88), Expect = 0.054
 Identities = 53/250 (21%), Positives = 89/250 (35%), Gaps = 27/250 (10%)

Query: 1966 TTSSPESE-STTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSES 2024
               +P SE     ++PE ++      V+ES   SS  SE    +S   +  +     +E 
Sbjct: 57   NVGNPSSEVGKEENAPEVKAEDLEPAVAESVEHSS--SEVGKEVSETEKEESNPEVKAED 114

Query: 2025 TTTISP----ESESTTTSSPASESTTTNNPKSESTTTNNPASESITSS--------SPAS 2072
                        ES     PA  S    NP+ E+        + I  S        S +S
Sbjct: 115  LQPAVDGDIAHHESEVGDKPAKTSKEEENPEIEAEDGEPAKDDGIEESHQEEDEIVSESS 174

Query: 2073 ESTTTSSPASESTTTSSPASESTTTSSPASESTTT----SSPESESTTTSSPASESTTIE 2128
            +   T+   +E    +   S   ++S    E + T    S+PE ++        +     
Sbjct: 175  KEEFTAEVKAEDLQPAVDGSIEHSSSEVGEEVSKTEKEESNPEVKAEDLQPAVDDDVAHH 234

Query: 2129 EQGVSPHSEKLSANEDPEEFPNEDV-------FEHTFAEIPNIDHSNQTDEAIPETFDAR 2181
            E  V     + S  E+  E   ED+        EH+ +EI       + +E IPE+  A 
Sbjct: 235  ESEVGDKPAETSKEEETPEVKAEDLQPAVDGSVEHSSSEIEEHQGETEKEEGIPES-HAE 293

Query: 2182 EEWPQCKDVI 2191
            +  P   D++
Sbjct: 294  DLQPAVDDIV 303



 Score = 35.4 bits (80), Expect = 0.42
 Identities = 38/232 (16%), Positives = 63/232 (27%), Gaps = 9/232 (3%)

Query: 1889 SLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSE-STTTSSPES----- 1942
            S + E+      E     ++  + E  +  S E  S      VSE     S+PE      
Sbjct: 357  SEIEEHQGETEKEEGIPESHAEDDEIASDPSIEHFSAEVGKEVSETEKEESNPEVKAEDL 416

Query: 1943 ESTTTSSPESESTTTSSLVSE-STTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPE 2001
            +           +      +E S    SPE E+     P  +     S   E    S P 
Sbjct: 417  QPAVDGDVAHHESEVGDKPAETSKEEESPEIEA-EDGEPAKDGGIEESHQEEDEIVSEPS 475

Query: 2002 SES-TTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNP 2060
             E  T  +            V  S++ +  E   T       E    + P +   +  + 
Sbjct: 476  KEEFTAEVKAEDLQPAVDGSVEHSSSEVGEEVSETEKEESNPEIKAEDLPPAVDDSLEHS 535

Query: 2061 ASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPES 2112
              E                  +E    +   S   ++S    +   T   E 
Sbjct: 536  IPEVGEKVDEMFAEEFNPEVIAEDLQPAVDGSVEHSSSEVGDKVCETCEEEF 587


>gnl|CDD|184900 PRK14907, rplD, 50S ribosomal protein L4; Provisional.
          Length = 295

 Score = 47.3 bits (112), Expect = 6e-05
 Identities = 24/116 (20%), Positives = 43/116 (37%), Gaps = 5/116 (4%)

Query: 2045 TTTNNPKSESTTTNNPASE-SITSSSPASESTTTSSP----ASESTTTSSPASESTTTSS 2099
             T    K ++T    PA++ + TS   A    T  +     A ++       S  TTT  
Sbjct: 3    ETKKTTKKKTTEEKKPAAKKATTSKETAKTKKTAKTTSTKAAKKAAKVKKTKSVKTTTKK 62

Query: 2100 PASESTTTSSPESESTTTSSPASESTTIEEQGVSPHSEKLSANEDPEEFPNEDVFE 2155
               +   T S + ES    +   E+ + E    S    K ++    + F +E ++ 
Sbjct: 63   VTVKFEKTESVKKESVAKKTVKKEAVSAEVFEASNKLFKNTSKLPKKLFASEKIYS 118



 Score = 39.5 bits (92), Expect = 0.014
 Identities = 21/108 (19%), Positives = 40/108 (37%), Gaps = 5/108 (4%)

Query: 2015 TTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASES 2074
             T  +   ++T    P ++  TTS   +++  T   K+ ST     A++        S  
Sbjct: 3    ETKKTTKKKTTEEKKPAAKKATTSKETAKTKKTA--KTTSTKAAKKAAKV---KKTKSVK 57

Query: 2075 TTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPAS 2122
            TTT     +   T S   ES    +   E+ +    E+ +    + + 
Sbjct: 58   TTTKKVTVKFEKTESVKKESVAKKTVKKEAVSAEVFEASNKLFKNTSK 105



 Score = 37.6 bits (87), Expect = 0.055
 Identities = 22/106 (20%), Positives = 41/106 (38%), Gaps = 11/106 (10%)

Query: 1920 PESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSS 1979
             E++ TT       +T    P ++  TTS  E+  T  ++  + +               
Sbjct: 2    AETKKTTKKK----TTEEKKPAAKKATTSK-ETAKTKKTAKTTSTKAAKKAAK----VKK 52

Query: 1980 PESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSEST 2025
             +S  TTT  +  +   T S + ES    +   E+   S+ V E++
Sbjct: 53   TKSVKTTTKKVTVKFEKTESVKKESVAKKTVKKEA--VSAEVFEAS 96



 Score = 37.6 bits (87), Expect = 0.058
 Identities = 20/101 (19%), Positives = 41/101 (40%), Gaps = 7/101 (6%)

Query: 1895 TTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESES 1954
             T  + + ++T    P ++  TTS  E+  T  ++  + +                +S  
Sbjct: 3    ETKKTTKKKTTEEKKPAAKKATTSK-ETAKTKKTAKTTSTKAAKKAAK----VKKTKSVK 57

Query: 1955 TTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSEST 1995
            TTT  +  +   T S + ES    + + E+   S+ V E++
Sbjct: 58   TTTKKVTVKFEKTESVKKESVAKKTVKKEA--VSAEVFEAS 96



 Score = 36.5 bits (84), Expect = 0.14
 Identities = 20/113 (17%), Positives = 39/113 (34%), Gaps = 9/113 (7%)

Query: 1950 PESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTIS 2009
             E++ TT       +T    P ++  TTS  E+  T  ++  + +               
Sbjct: 2    AETKKTTKKK----TTEEKKPAAKKATTSK-ETAKTKKTAKTTSTKAAKKAAKVK----K 52

Query: 2010 PVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPAS 2062
              S  TTT     +   T S + ES    +   E+ +    ++ +    N + 
Sbjct: 53   TKSVKTTTKKVTVKFEKTESVKKESVAKKTVKKEAVSAEVFEASNKLFKNTSK 105



 Score = 36.1 bits (83), Expect = 0.21
 Identities = 19/98 (19%), Positives = 36/98 (36%), Gaps = 7/98 (7%)

Query: 1900 PESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSS 1959
             E++ TT      ++T    P ++  TTS   +  T  ++  + +           T S 
Sbjct: 2    AETKKTTK----KKTTEEKKPAAKKATTSK-ETAKTKKTAKTTSTKAAKKAAKVKKTKS- 55

Query: 1960 LVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTT 1997
             V  +T   + + E T +   ES +  T    + S   
Sbjct: 56   -VKTTTKKVTVKFEKTESVKKESVAKKTVKKEAVSAEV 92



 Score = 36.1 bits (83), Expect = 0.22
 Identities = 17/100 (17%), Positives = 35/100 (35%), Gaps = 11/100 (11%)

Query: 1910 PESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSS 1969
             E++ TT      E    +    ++TT+        T  +  +++        ++     
Sbjct: 2    AETKKTTKKKTTEEKKPAA---KKATTSKETAKTKKTAKTTSTKAA------KKAAKVKK 52

Query: 1970 PESESTTTS--SPESESTTTSSLVSESTTTSSPESESTTT 2007
             +S  TTT   + + E T +    S +  T   E+ S   
Sbjct: 53   TKSVKTTTKKVTVKFEKTESVKKESVAKKTVKKEAVSAEV 92



 Score = 34.9 bits (80), Expect = 0.45
 Identities = 18/98 (18%), Positives = 35/98 (35%), Gaps = 5/98 (5%)

Query: 1905 TTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSES 1964
             T    + ++T    P ++  TTS   +  T  ++  + +           T S      
Sbjct: 3    ETKKTTKKKTTEEKKPAAKKATTSK-ETAKTKKTAKTTSTKAAKKAAKVKKTKS----VK 57

Query: 1965 TTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPES 2002
            TTT     +   T S + ES    ++  E+ +    E+
Sbjct: 58   TTTKKVTVKFEKTESVKKESVAKKTVKKEAVSAEVFEA 95



 Score = 34.5 bits (79), Expect = 0.53
 Identities = 19/113 (16%), Positives = 36/113 (31%), Gaps = 11/113 (9%)

Query: 1995 TTTSSPESESTTTISPVSESTTTSS-PVSESTTTISPES----ESTTTSSPASESTTTNN 2049
             T  + + ++T    P ++  TTS        T  +  +    ++       S  TTT  
Sbjct: 3    ETKKTTKKKTTEEKKPAAKKATTSKETAKTKKTAKTTSTKAAKKAAKVKKTKSVKTTTKK 62

Query: 2050 PKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPAS 2102
               +   T +   ES+   +   E+       S     +S      T+  P  
Sbjct: 63   VTVKFEKTESVKKESVAKKTVKKEAV------SAEVFEASNKLFKNTSKLPKK 109



 Score = 34.5 bits (79), Expect = 0.62
 Identities = 17/95 (17%), Positives = 34/95 (35%), Gaps = 7/95 (7%)

Query: 1935 TTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSES 1994
             T  + + ++T    P ++  TTS   +  T  ++  + +           T S      
Sbjct: 3    ETKKTTKKKTTEEKKPAAKKATTSK-ETAKTKKTAKTTSTKAAKKAAKVKKTKS----VK 57

Query: 1995 TTTS--SPESESTTTISPVSESTTTSSPVSESTTT 2027
            TTT   + + E T ++   S +  T    + S   
Sbjct: 58   TTTKKVTVKFEKTESVKKESVAKKTVKKEAVSAEV 92


>gnl|CDD|216513 pfam01456, Mucin, Mucin-like glycoprotein.  This family of
            trypanosomal proteins resemble vertebrate mucins. The
            protein consists of three regions. The N and C terminii
            are conserved between all members of the family, whereas
            the central region is not well conserved and contains a
            large number of threonine residues which can be
            glycosylated. Indirect evidence suggested that these
            genes might encode the core protein of parasite mucins,
            glycoproteins that were proposed to be involved in the
            interaction with, and invasion of, mammalian host cells.
            This family contains an N-terminal signal peptide.
          Length = 143

 Score = 44.9 bits (105), Expect = 6e-05
 Identities = 27/92 (29%), Positives = 48/92 (52%), Gaps = 4/92 (4%)

Query: 2011 VSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSP 2070
            V E+    S  + +TTT +P + +TTT++  +  TTT      +TTT    + + T+ +P
Sbjct: 35   VVEAAEGQSQTTTTTTTTTPPTTTTTTTTTTTTITTT--TTKTTTTTTTTTTTTTTTEAP 92

Query: 2071 ASESTTTSSPASESTTTSSPASESTTTSSPAS 2102
            +  +TT+ +P   +T T +P+S      S  S
Sbjct: 93   SKNTTTSEAPT--TTDTRAPSSIREIDGSLGS 122



 Score = 44.9 bits (105), Expect = 6e-05
 Identities = 22/92 (23%), Positives = 43/92 (46%), Gaps = 2/92 (2%)

Query: 2031 ESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSP 2090
             +        +  +TTT      +TTT    + +  +++    +TTT++  + +TTT +P
Sbjct: 33   AAVVEAAEGQSQTTTTTTTTTPPTTTTTTTTTTTTITTTTTKTTTTTTTTTTTTTTTEAP 92

Query: 2091 ASESTTTSSPASESTTTSSPESESTTTSSPAS 2122
            +  +TT+ +P   +T T +P S      S  S
Sbjct: 93   SKNTTTSEAPT--TTDTRAPSSIREIDGSLGS 122



 Score = 43.7 bits (102), Expect = 2e-04
 Identities = 27/92 (29%), Positives = 47/92 (51%), Gaps = 4/92 (4%)

Query: 1991 VSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNP 2050
            V E+    S  + +TTT +P + +TTT++  +  TTT      +TTT++  + +TTT  P
Sbjct: 35   VVEAAEGQSQTTTTTTTTTPPTTTTTTTTTTTTITTT--TTKTTTTTTTTTTTTTTTEAP 92

Query: 2051 KSESTTTNNPASESITSSSPASESTTTSSPAS 2082
               +TT+  P +    + +P+S      S  S
Sbjct: 93   SKNTTTSEAPTTT--DTRAPSSIREIDGSLGS 122



 Score = 43.7 bits (102), Expect = 2e-04
 Identities = 27/90 (30%), Positives = 48/90 (53%), Gaps = 4/90 (4%)

Query: 2003 ESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPAS 2062
            E+    S  + +TTT++P + +TTT +  + +T T++    +TTT    + +TTT  P+ 
Sbjct: 37   EAAEGQSQTTTTTTTTTPPTTTTTTTT--TTTTITTTTTKTTTTTTTTTTTTTTTEAPSK 94

Query: 2063 ESITSSSPASESTTTSSPASESTTTSSPAS 2092
             + TS +P   +T T +P+S      S  S
Sbjct: 95   NTTTSEAPT--TTDTRAPSSIREIDGSLGS 122



 Score = 42.9 bits (100), Expect = 3e-04
 Identities = 22/78 (28%), Positives = 50/78 (64%), Gaps = 2/78 (2%)

Query: 1915 TTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESES 1974
               +  +S++TTT++  +  TTT++  + +TT ++  +++TTT++  + +TTT++ E+ S
Sbjct: 36   VEAAEGQSQTTTTTTTTTPPTTTTTTTTTTTTITTTTTKTTTTTT--TTTTTTTTTEAPS 93

Query: 1975 TTTSSPESESTTTSSLVS 1992
              T++ E+ +TT +   S
Sbjct: 94   KNTTTSEAPTTTDTRAPS 111



 Score = 42.5 bits (99), Expect = 4e-04
 Identities = 21/85 (24%), Positives = 44/85 (51%), Gaps = 2/85 (2%)

Query: 2041 ASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSP 2100
            A      +    E+    +  + + T+++P + +TTT++  +  TTT      +TTT++ 
Sbjct: 25   AQGEGQYDAAVVEAAEGQSQTTTTTTTTTPPTTTTTTTTTTTTITTT--TTKTTTTTTTT 82

Query: 2101 ASESTTTSSPESESTTTSSPASEST 2125
             + +TTT +P   +TT+ +P +  T
Sbjct: 83   TTTTTTTEAPSKNTTTSEAPTTTDT 107



 Score = 42.5 bits (99), Expect = 4e-04
 Identities = 21/83 (25%), Positives = 43/83 (51%)

Query: 1911 ESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSP 1970
             +           +TTT++    +TTT++  + +T T++    +TTT++  + +TTT +P
Sbjct: 33   AAVVEAAEGQSQTTTTTTTTTPPTTTTTTTTTTTTITTTTTKTTTTTTTTTTTTTTTEAP 92

Query: 1971 ESESTTTSSPESESTTTSSLVSE 1993
               +TT+ +P +  T   S + E
Sbjct: 93   SKNTTTSEAPTTTDTRAPSSIRE 115



 Score = 42.5 bits (99), Expect = 4e-04
 Identities = 21/89 (23%), Positives = 45/89 (50%)

Query: 1941 ESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSP 2000
             +           +TTT++    +TTT++  + +T T++    +TTT++  + +TTT +P
Sbjct: 33   AAVVEAAEGQSQTTTTTTTTTPPTTTTTTTTTTTTITTTTTKTTTTTTTTTTTTTTTEAP 92

Query: 2001 ESESTTTISPVSESTTTSSPVSESTTTIS 2029
               +TT+ +P +  T   S + E   ++ 
Sbjct: 93   SKNTTTSEAPTTTDTRAPSSIREIDGSLG 121



 Score = 42.5 bits (99), Expect = 5e-04
 Identities = 26/88 (29%), Positives = 48/88 (54%), Gaps = 1/88 (1%)

Query: 1895 TTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESES 1954
                  +S++TTT    +  TTT++  + +TT ++  +++TTT++  + +TTT+   S++
Sbjct: 36   VEAAEGQSQTTTTTTTTTPPTTTTTTTTTTTTITTTTTKTTTTTTTTTTTTTTTEAPSKN 95

Query: 1955 TTTSSLVSESTTTSSPESESTTTSSPES 1982
            TTTS     +T T +P S      S  S
Sbjct: 96   TTTSE-APTTTDTRAPSSIREIDGSLGS 122



 Score = 42.2 bits (98), Expect = 5e-04
 Identities = 23/92 (25%), Positives = 44/92 (47%), Gaps = 2/92 (2%)

Query: 1971 ESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISP 2030
             +           +TTT++    +TTT++  + +T T +    +TTT++  + +TTT +P
Sbjct: 33   AAVVEAAEGQSQTTTTTTTTTPPTTTTTTTTTTTTITTTTTKTTTTTTTTTTTTTTTEAP 92

Query: 2031 ESESTTTSSPASESTTTNNPKSESTTTNNPAS 2062
               +TT+ +P   +T T  P S      +  S
Sbjct: 93   SKNTTTSEAPT--TTDTRAPSSIREIDGSLGS 122



 Score = 42.2 bits (98), Expect = 6e-04
 Identities = 27/92 (29%), Positives = 52/92 (56%), Gaps = 4/92 (4%)

Query: 1961 VSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSP 2020
            V E+    S  + +TTT++P + +TTT++  + +T T++    +TTT +  + +TTT +P
Sbjct: 35   VVEAAEGQSQTTTTTTTTTPPTTTTTTTT--TTTTITTTTTKTTTTTTTTTTTTTTTEAP 92

Query: 2021 VSESTTTISPESESTTTSSPASESTTTNNPKS 2052
               +TT+ +P   +T T +P+S      +  S
Sbjct: 93   SKNTTTSEAP--TTTDTRAPSSIREIDGSLGS 122



 Score = 41.8 bits (97), Expect = 9e-04
 Identities = 30/108 (27%), Positives = 55/108 (50%), Gaps = 9/108 (8%)

Query: 1965 TTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSES 2024
            T     + ++    + E +S TT++    +TTT+ P + +TTT    + +T T++    +
Sbjct: 24   TAQGEGQYDAAVVEAAEGQSQTTTT----TTTTTPPTTTTTTT---TTTTTITTTTTKTT 76

Query: 2025 TTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPAS 2072
            TTT +  + +TTT +P+  +TT+  P   +T T  P+S      S  S
Sbjct: 77   TTTTTTTTTTTTTEAPSKNTTTSEAPT--TTDTRAPSSIREIDGSLGS 122



 Score = 40.2 bits (93), Expect = 0.003
 Identities = 31/109 (28%), Positives = 57/109 (52%), Gaps = 11/109 (10%)

Query: 1905 TTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSES 1964
            T     + ++    + E +S TT++    +TTT+ P + +TTT++  + +TTT+      
Sbjct: 24   TAQGEGQYDAAVVEAAEGQSQTTTT----TTTTTPPTTTTTTTTTTTTITTTTT------ 73

Query: 1965 TTTSSPESESTTTSSPESES-TTTSSLVSESTTTSSPESESTTTISPVS 2012
             TT++  + +TTT++ E+ S  TT+S    +T T +P S      S  S
Sbjct: 74   KTTTTTTTTTTTTTTTEAPSKNTTTSEAPTTTDTRAPSSIREIDGSLGS 122



 Score = 40.2 bits (93), Expect = 0.003
 Identities = 31/108 (28%), Positives = 58/108 (53%), Gaps = 9/108 (8%)

Query: 1935 TTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSES 1994
            T     + ++    + E +S TT++     TTT++P + +TTT++  +  TTT++    +
Sbjct: 24   TAQGEGQYDAAVVEAAEGQSQTTTT-----TTTTTPPTTTTTTTTTTTTITTTTT--KTT 76

Query: 1995 TTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPAS 2042
            TTT++  + +TTT +P   +TT+ +P   +T T +P S      S  S
Sbjct: 77   TTTTTTTTTTTTTEAPSKNTTTSEAPT--TTDTRAPSSIREIDGSLGS 122



 Score = 39.8 bits (92), Expect = 0.004
 Identities = 22/77 (28%), Positives = 41/77 (53%), Gaps = 2/77 (2%)

Query: 2052 SESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPE 2111
                  +    E+    S  + +TTT++P + +TTT++  +  TTT      +TTT++  
Sbjct: 26   QGEGQYDAAVVEAAEGQSQTTTTTTTTTPPTTTTTTTTTTTTITTT--TTKTTTTTTTTT 83

Query: 2112 SESTTTSSPASESTTIE 2128
            + +TTT +P+  +TT E
Sbjct: 84   TTTTTTEAPSKNTTTSE 100



 Score = 39.5 bits (91), Expect = 0.005
 Identities = 22/71 (30%), Positives = 44/71 (61%), Gaps = 2/71 (2%)

Query: 1892 SENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPE 1951
            S+ TTT +  +  TTT    + +TT ++  +++TTT++  + +TTT++ E+ S  T++ E
Sbjct: 43   SQTTTTTTTTTPPTTTTTTTTTTTTITTTTTKTTTTTT--TTTTTTTTTEAPSKNTTTSE 100

Query: 1952 SESTTTSSLVS 1962
            + +TT +   S
Sbjct: 101  APTTTDTRAPS 111



 Score = 39.1 bits (90), Expect = 0.007
 Identities = 24/89 (26%), Positives = 47/89 (52%), Gaps = 9/89 (10%)

Query: 2035 TTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASES 2094
               +   S++TTT      +TTT  P + + T++   + +T T++    +TTT++  + +
Sbjct: 36   VEAAEGQSQTTTT------TTTTTPPTTTTTTTT---TTTTITTTTTKTTTTTTTTTTTT 86

Query: 2095 TTTSSPASESTTTSSPESESTTTSSPASE 2123
            TTT +P+  +TT+ +P +  T   S   E
Sbjct: 87   TTTEAPSKNTTTSEAPTTTDTRAPSSIRE 115



 Score = 33.3 bits (75), Expect = 0.58
 Identities = 24/78 (30%), Positives = 43/78 (55%), Gaps = 1/78 (1%)

Query: 1885 STLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESES 1944
            +T  +  +  TTT +  + +TT     +++TTT++  + +TTT+   S++TTTS   + +
Sbjct: 46   TTTTTTTTPPTTTTTTTTTTTTITTTTTKTTTTTTTTTTTTTTTEAPSKNTTTSEAPT-T 104

Query: 1945 TTTSSPESESTTTSSLVS 1962
            T T +P S      SL S
Sbjct: 105  TDTRAPSSIREIDGSLGS 122



 Score = 33.3 bits (75), Expect = 0.71
 Identities = 15/55 (27%), Positives = 34/55 (61%)

Query: 2075 TTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTTIEE 2129
               +   S++TTT++  +  TTT++  + +TT ++  +++TTT++  + +TT  E
Sbjct: 36   VEAAEGQSQTTTTTTTTTPPTTTTTTTTTTTTITTTTTKTTTTTTTTTTTTTTTE 90



 Score = 32.9 bits (74), Expect = 0.75
 Identities = 20/77 (25%), Positives = 34/77 (44%), Gaps = 1/77 (1%)

Query: 1877 NSESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESES-TTTSSLVSEST 1935
             + +T       +  +  TTT    + + TT    + +TTT++ E+ S  TT+S    +T
Sbjct: 46   TTTTTTTTPPTTTTTTTTTTTTITTTTTKTTTTTTTTTTTTTTTEAPSKNTTTSEAPTTT 105

Query: 1936 TTSSPESESTTTSSPES 1952
             T +P S      S  S
Sbjct: 106  DTRAPSSIREIDGSLGS 122



 Score = 31.0 bits (69), Expect = 4.2
 Identities = 15/56 (26%), Positives = 30/56 (53%)

Query: 2071 ASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTT 2126
            A       +   E+    S  + +TTT++P + +TTT++  +  TTT++  + +TT
Sbjct: 25   AQGEGQYDAAVVEAAEGQSQTTTTTTTTTPPTTTTTTTTTTTTITTTTTKTTTTTT 80



 Score = 29.8 bits (66), Expect = 7.9
 Identities = 15/61 (24%), Positives = 27/61 (44%)

Query: 1873 TTNNNSESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVS 1932
             T   + +T   +   +     TTT +  + +TTT  P   +TT+ +P +  T   S + 
Sbjct: 55   PTTTTTTTTTTTTITTTTTKTTTTTTTTTTTTTTTEAPSKNTTTSEAPTTTDTRAPSSIR 114

Query: 1933 E 1933
            E
Sbjct: 115  E 115



 Score = 29.8 bits (66), Expect = 8.9
 Identities = 16/60 (26%), Positives = 29/60 (48%)

Query: 1873 TTNNNSESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVS 1932
            TT   + +T   +T     +   TT +  + +TTT   E+ S  T++ E+ +TT +   S
Sbjct: 52   TTPPTTTTTTTTTTTTITTTTTKTTTTTTTTTTTTTTTEAPSKNTTTSEAPTTTDTRAPS 111


>gnl|CDD|113514 pfam04747, DUF612, Protein of unknown function, DUF612.  This family
            includes several uncharacterized proteins from
            Caenorhabditis elegans.
          Length = 517

 Score = 47.7 bits (112), Expect = 7e-05
 Identities = 52/283 (18%), Positives = 101/283 (35%), Gaps = 17/283 (6%)

Query: 1911 ESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSP 1970
            +++  +T +P  E      + ++ +   +PE + T T++P   +     +  +    +  
Sbjct: 163  KTKKASTPAPVEEEIVVKKVANDRSAAPAPEPK-TPTNTPAEPAEQVQEITGKKNKKNKK 221

Query: 1971 ESESTTTSSPESESTTTSS---LVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTT 2027
            +SES  T++P S          +  E    ++P+ +        SES    +    S T 
Sbjct: 222  KSESEATAAPASVEQVVEQPKVVTEEPHQQAAPQEKKNKKNKRKSESENVPAA---SETP 278

Query: 2028 ISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPA---SES 2084
            + P  E   T+ PASE+   N    + + +     E + + +P S+  T           
Sbjct: 279  VEPVVE---TTPPASENQKKNKKDKKKSESEKVVEEPVQAEAPKSKKPTADDNMDFLDFV 335

Query: 2085 TTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTTIEEQGVSPHSEKLSANED 2144
            T    P  E   T +   E    +  E+    +++P +     + +     SE     E 
Sbjct: 336  TAKEEPKDEPAETPAAPVEEVVENVVENVVEKSTTPPATENKKKNKKDKKKSESEKVTEQ 395

Query: 2145 PEEF----PNEDVFEHTFAEIPNIDHSNQTDEAIPETFDAREE 2183
            P E     P  +               N+ D+   E+  A EE
Sbjct: 396  PVESAPAPPQVEQVVEKTPPASENKKKNKKDKKKSESEKAVEE 438



 Score = 47.4 bits (111), Expect = 8e-05
 Identities = 49/233 (21%), Positives = 90/233 (38%), Gaps = 25/233 (10%)

Query: 1908 NNPESESTTTSSPESESTTTSS---LVSESTTTSSPESESTTTSSPESES----TTTSSL 1960
            N  +SES  T++P S          +  E    ++P+ +    +  +SES      + + 
Sbjct: 219  NKKKSESEATAAPASVEQVVEQPKVVTEEPHQQAAPQEKKNKKNKRKSESENVPAASETP 278

Query: 1961 VSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESES-------------TTT 2007
            V     T+ P SE+   +  + + + +  +V E     +P+S+              T  
Sbjct: 279  VEPVVETTPPASENQKKNKKDKKKSESEKVVEEPVQAEAPKSKKPTADDNMDFLDFVTAK 338

Query: 2008 ISPVSE-STTTSSPVSESTTTISPESESTTTSSPASESTTTNNP---KSESTTTNNPASE 2063
              P  E + T ++PV E    +       +T+ PA+E+   N     KSES        E
Sbjct: 339  EEPKDEPAETPAAPVEEVVENVVENVVEKSTTPPATENKKKNKKDKKKSESEKVTEQPVE 398

Query: 2064 SITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTT 2116
            S  +     +    + PASE+   +    +  + S  A E    ++P S+  T
Sbjct: 399  SAPAPPQVEQVVEKTPPASENKKKNKK-DKKKSESEKAVEEPVQAAPSSKKPT 450



 Score = 43.5 bits (101), Expect = 0.002
 Identities = 37/203 (18%), Positives = 76/203 (37%), Gaps = 20/203 (9%)

Query: 1961 VSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSP 2020
            V       + +++  +T +P  E      + ++ +   +PE + T T +P   +      
Sbjct: 153  VKAEKAEKAEKTKKASTPAPVEEEIVVKKVANDRSAAPAPEPK-TPTNTPAEPAEQVQEI 211

Query: 2021 VSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSP 2080
              +       +SES  T++PAS       PK     T  P  ++        ++   S  
Sbjct: 212  TGKKNKKNKKKSESEATAAPASVEQVVEQPK---VVTEEPHQQAAPQEKKNKKNKRKSES 268

Query: 2081 ASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTTIEEQGVSPHSEK-- 2138
             +    + +P      T+ PASE+   +  + + + +     E      Q  +P S+K  
Sbjct: 269  ENVPAASETPVEPVVETTPPASENQKKNKKDKKKSESEKVVEEPV----QAEAPKSKKPT 324

Query: 2139 ----------LSANEDPEEFPNE 2151
                      ++A E+P++ P E
Sbjct: 325  ADDNMDFLDFVTAKEEPKDEPAE 347



 Score = 35.0 bits (79), Expect = 0.62
 Identities = 34/227 (14%), Positives = 76/227 (33%), Gaps = 11/227 (4%)

Query: 1892 SENTTTNSPESESTTTNNPESESTTTSSPESESTTTSS---LVSESTTTSSPESESTTTS 1948
            SEN   N  + + + +     E     +P+S+  T       +   T    P+ E   T 
Sbjct: 290  SENQKKNKKDKKKSESEKVVEEPVQAEAPKSKKPTADDNMDFLDFVTAKEEPKDEPAETP 349

Query: 1949 SPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTI 2008
            +   E    + + +    +++P +      + + +  + S  V+E    S+P        
Sbjct: 350  AAPVEEVVENVVENVVEKSTTPPATENKKKNKKDKKKSESEKVTEQPVESAPAP------ 403

Query: 2009 SPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPAS-ESITS 2067
             P  E     +P +      + + +  + S  A E      P S+  T ++        +
Sbjct: 404  -PQVEQVVEKTPPASENKKKNKKDKKKSESEKAVEEPVQAAPSSKKPTADDNMDFLDFVT 462

Query: 2068 SSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESES 2114
            + P    +      + +    + A E T  ++   +       + ES
Sbjct: 463  AKPDKSESAEEHIEAPAIVEPAHADEETAAAAEGKKKNKKDKKKKES 509


>gnl|CDD|227578 COG5253, MSS4, Phosphatidylinositol-4-phosphate 5-kinase [Signal
            transduction mechanisms].
          Length = 612

 Score = 46.9 bits (111), Expect = 1e-04
 Identities = 42/247 (17%), Positives = 86/247 (34%), Gaps = 9/247 (3%)

Query: 1915 TTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESES 1974
            T    P S S T  S+  + +T  +  S S  +S       + ++    S      + ++
Sbjct: 1    TEERPPISRSGTGISMTHDKSTRPNDRSMSNDSSLCGLNQASDANGNEYSPNNKVSKKDT 60

Query: 1975 TTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESES 2034
             +    ++ S   +          +        +   +          ++    +    S
Sbjct: 61   FSDQLHDALSKEFTLERERDRLQLNKRKYQAIRLQTSTPIVEIFKNNKDAVDPPNHTRSS 120

Query: 2035 TTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTS---SPASESTTTSSPA 2091
                S A+  T +      S + N P  +    + P S  +         S  T +S P+
Sbjct: 121  GNNLSNANVKTLSAPVGEHSRSNNPPNLDQNLDTEPESSISQWGELQLNPSGKTLSSQPS 180

Query: 2092 SESTTTSSPASESTTTSSPESESTTTSSPASESTTIEEQGVSPHSEKLSANEDPEEFPNE 2151
             + T+ + P SES  +  P    T+ +SP  + + ++    +  +E+ S N  P  +P+ 
Sbjct: 181  RKPTSEN-PKSESDNSKLP----TSVNSPLPDKSLLKRTLSNFWAERNSYNWKPLVYPSC 235

Query: 2152 DVFEHTF 2158
               EH F
Sbjct: 236  PS-EHIF 241



 Score = 39.5 bits (92), Expect = 0.026
 Identities = 35/216 (16%), Positives = 64/216 (29%), Gaps = 10/216 (4%)

Query: 1875 NNNSESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSES 1934
            N  S+       L+  LS+  T            N            +          ++
Sbjct: 53   NKVSKKDTFSDQLHDALSKEFT--LERERDRLQLNKRKYQAIRLQTSTPIVEIFKNNKDA 110

Query: 1935 TTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLV--- 1991
                +    S    S  +  T ++ +   S + + P  +    + PES  +    L    
Sbjct: 111  VDPPNHTRSSGNNLSNANVKTLSAPVGEHSRSNNPPNLDQNLDTEPESSISQWGELQLNP 170

Query: 1992 SESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPK 2051
            S  T +S P  + T+  +P SES  +  P S ++          T S+  +E  + N   
Sbjct: 171  SGKTLSSQPSRKPTSE-NPKSESDNSKLPTSVNSPLPDKSLLKRTLSNFWAERNSYNWKP 229

Query: 2052 SESTT----TNNPASESITSSSPASESTTTSSPASE 2083
                +         S+ I      S         S+
Sbjct: 230  LVYPSCPSEHIFSDSDVIIREDEPSSLIAFCLSTSD 265



 Score = 33.0 bits (75), Expect = 2.2
 Identities = 25/171 (14%), Positives = 50/171 (29%), Gaps = 17/171 (9%)

Query: 2009 SPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSS 2068
             P+S S T  S   + +T  +  S S  +S       +  N    S        ++ +  
Sbjct: 5    PPISRSGTGISMTHDKSTRPNDRSMSNDSSLCGLNQASDANGNEYSPNNKVSKKDTFSDQ 64

Query: 2069 SPASESTTTSSPASES-------------TTTSSPASESTTTSSPASESTTTSSPESEST 2115
               + S   +                     TS+P  E    +  A +    +     + 
Sbjct: 65   LHDALSKEFTLERERDRLQLNKRKYQAIRLQTSTPIVEIFKNNKDAVDPPNHTRSSGNNL 124

Query: 2116 TTSSPASESTTIEEQGVSPHSEKLSANEDPEEFPNEDVFEHTFAEIPNIDH 2166
            + ++  + S  + E   S +   L  N D E     +     + E+     
Sbjct: 125  SNANVKTLSAPVGEHSRSNNPPNLDQNLDTE----PESSISQWGELQLNPS 171


>gnl|CDD|225828 COG3291, COG3291, FOG: PKD repeat [General function prediction only].
          Length = 297

 Score = 46.0 bits (109), Expect = 2e-04
 Identities = 36/242 (14%), Positives = 81/242 (33%), Gaps = 15/242 (6%)

Query: 1896 TTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPE-SES 1954
            +T +P S      + E+ +        +     + V+ +   ++     T T+  E SE+
Sbjct: 21   STGTPTSWIWDFGDGENSTEQNPIHTYKKVGNYT-VNLTVENAAGSDTETKTNYIEVSEA 79

Query: 1955 TTTSSLVSESTTTSSPESES-TTTSSPESES---------TTTSSLVSESTTTSSPESES 2004
               +   +  T+  +P + + T TS+ E+ S          TTS+  +   T +   + +
Sbjct: 80   PPVADFTANPTSGYAPLTVNFTDTSTNEATSWSWDFGDGGVTTSTEQNPVHTYTDAGTYT 139

Query: 2005 TTTISPVSEST-TTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASE 2063
             T    VS ST + S   ++  T      E     + ++  T         +++ N +S 
Sbjct: 140  VTLT--VSNSTGSDSKTKTDYVTVSEEGIEEAVPEAASTVVTKPLTVSGTESSSGNLSSW 197

Query: 2064 SITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASE 2123
                      ++T  +P        +  S    T    ++        + +        +
Sbjct: 198  VYVFEDDKGTNSTVKTPLLGGVIKVTLGSPLPDTVVYPTDKEGKGYYITLTGNGEFSFVD 257

Query: 2124 ST 2125
              
Sbjct: 258  VV 259



 Score = 45.3 bits (107), Expect = 3e-04
 Identities = 44/267 (16%), Positives = 88/267 (32%), Gaps = 27/267 (10%)

Query: 1872 FTTNNNSESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTS-SL 1930
            FT  +    T  +       +          +        + +   ++     T T+   
Sbjct: 17   FTDGSTGTPTSWIWDFGDGENSTEQNPIHTYKKVGNYT-VNLTVENAAGSDTETKTNYIE 75

Query: 1931 VSESTTTSSPESESTTTSSPESES-TTTSSLVSES---------TTTSSPESESTTTSSP 1980
            VSE+   +   +  T+  +P + + T TS+  + S          TTS+ ++   T +  
Sbjct: 76   VSEAPPVADFTANPTSGYAPLTVNFTDTSTNEATSWSWDFGDGGVTTSTEQNPVHTYTDA 135

Query: 1981 ESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSP 2040
             + + T +  VS ST + S     T  ++   E    + P + ST    P + S T SS 
Sbjct: 136  GTYTVTLT--VSNSTGSDS--KTKTDYVTVSEEGIEEAVPEAASTVVTKPLTVSGTESSS 191

Query: 2041 ASESTTT--------NNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPAS 2092
             + S+           N   ++         ++ S  P +    T     E        +
Sbjct: 192  GNLSSWVYVFEDDKGTNSTVKTPLLGGVIKVTLGSPLPDTVVYPTD---KEGKGYYITLT 248

Query: 2093 ESTTTSSPASESTTTSSPESESTTTSS 2119
             +   S     +   +   SE+ + S 
Sbjct: 249  GNGEFSFVDVVAYVKNGDWSENNSPSE 275



 Score = 41.4 bits (97), Expect = 0.005
 Identities = 38/218 (17%), Positives = 69/218 (31%), Gaps = 25/218 (11%)

Query: 1861 ISVIDNYSEIIFTTNNNSES---TVVMSTLNSLLSEN--------TTTNSPE-SESTTTN 1908
            I V +      FT N  S     TV  +  ++  + +          T S E +   T  
Sbjct: 74   IEVSEAPPVADFTANPTSGYAPLTVNFTDTSTNEATSWSWDFGDGGVTTSTEQNPVHTYT 133

Query: 1909 NPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTS 1968
            +  + + T +   S  + + +     T   +   E    + PE+ ST  +  ++ S T S
Sbjct: 134  DAGTYTVTLTVSNSTGSDSKT----KTDYVTVSEEGIEEAVPEAASTVVTKPLTVSGTES 189

Query: 1969 SPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTI 2028
            S  + S+     E +  T S       T  +P       ++  S    T    ++     
Sbjct: 190  SSGNLSSWVYVFEDDKGTNS-------TVKTPLLGGVIKVTLGSPLPDTVVYPTDKEGKG 242

Query: 2029 SPESESTTTSSPASESTTTNNPKSESTTTNNPASESIT 2066
               + +        +          S   NN  SE I 
Sbjct: 243  YYITLTGNGEFSFVDVVAYVKNGDWS--ENNSPSEYID 278


>gnl|CDD|218107 pfam04484, DUF566, Family of unknown function (DUF566).  Family of
            related proteins that is plant specific.
          Length = 313

 Score = 45.7 bits (108), Expect = 2e-04
 Identities = 38/132 (28%), Positives = 54/132 (40%), Gaps = 7/132 (5%)

Query: 1987 TSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTT 2046
            + S  S S   SSP S S   +S    ST+ SS         SP S S   ++ +S S+ 
Sbjct: 4    SVSSGSTSGDASSPRSSSRRRLSSSFLSTSASSRPRRLNAPASPPSSSPARNT-SSSSSF 62

Query: 2047 TNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTT 2106
              + +  S+ +    S    S S  S S   S   S +T ++S +S      SP+    T
Sbjct: 63   GLSKQRPSSLSRGRLSSRFVSPSRGSPSAAASLNGSLATASTSGSS------SPSRSRRT 116

Query: 2107 TSSPESESTTTS 2118
            TSS  S     S
Sbjct: 117  TSSDLSSGNGPS 128



 Score = 39.9 bits (93), Expect = 0.013
 Identities = 33/149 (22%), Positives = 49/149 (32%), Gaps = 11/149 (7%)

Query: 1910 PESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSS 1969
                S +TS   S     S   S     SS    ++ +S P   +   S   S     +S
Sbjct: 3    ASVSSGSTSGDAS-----SPRSSSRRRLSSSFLSTSASSRPRRLNAPASPPSSSPARNTS 57

Query: 1970 PESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTT----SSPVSEST 2025
              S S+   S +  S+ +   +S    + S  S S       S +T +    SSP     
Sbjct: 58   --SSSSFGLSKQRPSSLSRGRLSSRFVSPSRGSPSAAASLNGSLATASTSGSSSPSRSRR 115

Query: 2026 TTISPESESTTTSSPASESTTTNNPKSES 2054
            TT S  S     S  +  +      K  S
Sbjct: 116  TTSSDLSSGNGPSVLSFMADVKRGKKGPS 144



 Score = 37.6 bits (87), Expect = 0.064
 Identities = 40/152 (26%), Positives = 52/152 (34%), Gaps = 27/152 (17%)

Query: 1937 TSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTT 1996
            + S  S S   SSP S S    S    ST+ SS        +SP S S   ++  S S  
Sbjct: 4    SVSSGSTSGDASSPRSSSRRRLSSSFLSTSASSRPRRLNAPASPPSSSPARNTSSSSSFG 63

Query: 1997 TSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTT 2056
             S     S              S    S+  +SP        S  S S   +   S +T 
Sbjct: 64   LSKQRPSS-------------LSRGRLSSRFVSP--------SRGSPSAAASLNGSLATA 102

Query: 2057 TNNPASESITSSSPASESTTTSSPASESTTTS 2088
            + +       SSSP+    TTSS  S     S
Sbjct: 103  STSG------SSSPSRSRRTTSSDLSSGNGPS 128



 Score = 35.3 bits (81), Expect = 0.32
 Identities = 27/108 (25%), Positives = 42/108 (38%), Gaps = 1/108 (0%)

Query: 2027 TISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTT 2086
            ++S  S S   SSP S S    +    ST+ ++        +SP S S   ++ +S S+ 
Sbjct: 4    SVSSGSTSGDASSPRSSSRRRLSSSFLSTSASSRPRRLNAPASPPSSSPARNT-SSSSSF 62

Query: 2087 TSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTTIEEQGVSP 2134
              S    S+ +    S    + S  S S   S   S +T       SP
Sbjct: 63   GLSKQRPSSLSRGRLSSRFVSPSRGSPSAAASLNGSLATASTSGSSSP 110



 Score = 33.4 bits (76), Expect = 1.3
 Identities = 21/75 (28%), Positives = 31/75 (41%), Gaps = 1/75 (1%)

Query: 2067 SSSPASESTTTSSPASES-TTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASEST 2125
            S S  S S   SSP S S    SS    ++ +S P   +   S P S     +S +S   
Sbjct: 4    SVSSGSTSGDASSPRSSSRRRLSSSFLSTSASSRPRRLNAPASPPSSSPARNTSSSSSFG 63

Query: 2126 TIEEQGVSPHSEKLS 2140
              +++  S    +LS
Sbjct: 64   LSKQRPSSLSRGRLS 78



 Score = 33.0 bits (75), Expect = 2.0
 Identities = 34/131 (25%), Positives = 53/131 (40%), Gaps = 15/131 (11%)

Query: 1897 TNSPESESTTTNNPESESTTTSSPESESTTTSS----LVSESTTTSSPESESTTTSSPES 1952
            + S  S S   ++P S S    S    ST+ SS    L + ++  SS  + +T++SS   
Sbjct: 4    SVSSGSTSGDASSPRSSSRRRLSSSFLSTSASSRPRRLNAPASPPSSSPARNTSSSSSFG 63

Query: 1953 -----ESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTT 2007
                  S+ +   +S    + S  S S   S   S +T ++S        SSP     TT
Sbjct: 64   LSKQRPSSLSRGRLSSRFVSPSRGSPSAAASLNGSLATASTSGS------SSPSRSRRTT 117

Query: 2008 ISPVSESTTTS 2018
             S +S     S
Sbjct: 118  SSDLSSGNGPS 128



 Score = 31.5 bits (71), Expect = 5.9
 Identities = 29/119 (24%), Positives = 47/119 (39%), Gaps = 2/119 (1%)

Query: 1874 TNNNSESTVVMSTLNSLLSENTTTNSPESES-TTTNNPESESTTTSSPESESTTTSSLVS 1932
             + +S  +     L+S     + ++ P   +   +    S +  TSS  S   +     S
Sbjct: 12   GDASSPRSSSRRRLSSSFLSTSASSRPRRLNAPASPPSSSPARNTSSSSSFGLSKQRPSS 71

Query: 1933 ESTTTSSPESESTTTSSPESESTTTSSLVSESTT-TSSPESESTTTSSPESESTTTSSL 1990
             S    S    S +  SP + ++   SL + ST+ +SSP     TTSS  S     S L
Sbjct: 72   LSRGRLSSRFVSPSRGSPSAAASLNGSLATASTSGSSSPSRSRRTTSSDLSSGNGPSVL 130


>gnl|CDD|114205 pfam05467, Herpes_U47, Herpesvirus glycoprotein U47. 
          Length = 627

 Score = 46.4 bits (109), Expect = 2e-04
 Identities = 51/249 (20%), Positives = 100/249 (40%), Gaps = 11/249 (4%)

Query: 1895 TTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESES 1954
            T +++P S S +  +P   ST   +PE         V+++ T    ++   T ++P   +
Sbjct: 240  TPSSTPSSTSASITSPHIPSTNIPTPE------PPPVTKNFTELHTDTIKVTPNTPTITA 293

Query: 1955 TTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSES 2014
             TT S+      +  P    T T  P       +++ +E  T +  E+  +       E+
Sbjct: 294  QTTESIKKIVKRSDFPRPMYTPTDIPTLTIRLNATIKTEQNTENPTENPKSPPKPTNFEN 353

Query: 2015 TTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASES 2074
            TT   P +  + T+   + + T    ++   TT   +  +    +    SI   S + +S
Sbjct: 354  TTIRIPETFESATV---ATNATQKIESTTFATTIGIEEINDNIYSSPKNSIYLKSKSQQS 410

Query: 2075 TTTSSPASESTTTSSPASESTTTSSPASESTTTSS-PESESTTTSSPASESTTIEEQGVS 2133
            TT  + A  +T      +      +  S +T   +  +    TT   ++E  TI+   V+
Sbjct: 411  TTKFTDAEHTTPILKFTTWQDAARTYMSHNTEVQNMTDRFQRTTLKSSNELPTIQTLSVT 470

Query: 2134 PHSEKLSAN 2142
            P  +KL +N
Sbjct: 471  P-KKKLPSN 478



 Score = 43.3 bits (101), Expect = 0.002
 Identities = 64/279 (22%), Positives = 107/279 (38%), Gaps = 24/279 (8%)

Query: 1839 LSVSPYITNNLLISMLAATAVAISVIDNYSEIIFTTNNNSESTVVMSTLNSLLSENTTTN 1898
            +++S +  +NL  S+   T     +  NY+  ++    N+ S  + S            N
Sbjct: 166  MAISKFSNSNLTRSLTPFTP---EIFFNYTSFVYFLLYNTTS-CIPSNDQYFEHSPKPIN 221

Query: 1899 SPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTS 1958
               S      N +S  TTT S    ST        S + +SP   ST   +PE       
Sbjct: 222  VTTSFGRAIVNFDSILTTTPSSTPSST--------SASITSPHIPSTNIPTPE------P 267

Query: 1959 SLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTS 2018
              V+++ T    ++   T ++P   + TT S+      +  P    T T  P       +
Sbjct: 268  PPVTKNFTELHTDTIKVTPNTPTITAQTTESIKKIVKRSDFPRPMYTPTDIPTLTIRLNA 327

Query: 2019 SPVSESTTTISPESESTTTSSPASESTTTNNPKS-ESTTTNNPASESITSSSPASESTTT 2077
            +  +E  T    E+  +       E+TT   P++ ES T    A++ I S++ A   TT 
Sbjct: 328  TIKTEQNTENPTENPKSPPKPTNFENTTIRIPETFESATVATNATQKIESTTFA---TTI 384

Query: 2078 SSPASESTTTSSPASESTTTSSPASESTTTSSPESESTT 2116
                      SSP +     S   S+ +TT   ++E TT
Sbjct: 385  GIEEINDNIYSSPKNSIYLKSK--SQQSTTKFTDAEHTT 421



 Score = 42.9 bits (100), Expect = 0.002
 Identities = 60/292 (20%), Positives = 121/292 (41%), Gaps = 25/292 (8%)

Query: 1877 NSESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTT------TSSL 1930
            N +S +  +  ++  S + +  SP   ST    PE    T +  E  + T      T ++
Sbjct: 232  NFDSILTTTPSSTPSSTSASITSPHIPSTNIPTPEPPPVTKNFTELHTDTIKVTPNTPTI 291

Query: 1931 VSESTTTSSPESESTTTSSPESESTTTSSL-VSESTTTSSPESESTTTSSPESESTTTSS 1989
             +++T +     + +    P    T   +L +  + T  + ++    T +P+S    T+ 
Sbjct: 292  TAQTTESIKKIVKRSDFPRPMYTPTDIPTLTIRLNATIKTEQNTENPTENPKSPPKPTN- 350

Query: 1990 LVSESTTTSSPESESTTTISPVS----ESTT--TSSPVSESTTTISPESESTTTSSPASE 2043
               E+TT   PE+  + T++  +    ESTT  T+  + E    I    +++      S+
Sbjct: 351  --FENTTIRIPETFESATVATNATQKIESTTFATTIGIEEINDNIYSSPKNSIYLKSKSQ 408

Query: 2044 STTTNNPKSEST------TTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTT 2097
             +TT    +E T      TT   A+ +  S +   ++ T     +   +++   +  T +
Sbjct: 409  QSTTKFTDAEHTTPILKFTTWQDAARTYMSHNTEVQNMTDRFQRTTLKSSNELPTIQTLS 468

Query: 2098 SSPASE--STTTSSPESESTTTSSPASEST-TIEEQGVSPHSEKLSANEDPE 2146
             +P  +  S  T+  E   T  + P+S S+ +I E    P   ++SA+   E
Sbjct: 469  VTPKKKLPSNVTAKTEVHITNNALPSSNSSHSITEVTEEPKHNRMSASTHEE 520



 Score = 40.2 bits (93), Expect = 0.015
 Identities = 52/220 (23%), Positives = 94/220 (42%), Gaps = 18/220 (8%)

Query: 1925 TTTSSLVSESTTTSSPESESTTTSSPESESTTTS--SLVSESTTTSSPESESTTTSSPES 1982
            T  S  ++ S  ++S  + S T  +PE     TS    +  +TT+  P ++     SP+ 
Sbjct: 160  TDESLQMAISKFSNSNLTRSLTPFTPEIFFNYTSFVYFLLYNTTSCIPSNDQYFEHSPKP 219

Query: 1983 ESTTTS--------SLVSESTTTSSPESESTTTISPVSESTTTS----SPVSESTTTISP 2030
             + TTS          +  +T +S+P S S +  SP   ST        PV+++ T +  
Sbjct: 220  INVTTSFGRAIVNFDSILTTTPSSTPSSTSASITSPHIPSTNIPTPEPPPVTKNFTELHT 279

Query: 2031 ESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSP 2090
            ++   T ++P   + TT + K     ++ P      +  P       ++  +E  T +  
Sbjct: 280  DTIKVTPNTPTITAQTTESIKKIVKRSDFPRPMYTPTDIPTLTIRLNATIKTEQNTENPT 339

Query: 2091 ASESTTTSSPASESTTTSSPES-ESTTTSSPAS---ESTT 2126
             +  +       E+TT   PE+ ES T ++ A+   ESTT
Sbjct: 340  ENPKSPPKPTNFENTTIRIPETFESATVATNATQKIESTT 379



 Score = 39.4 bits (91), Expect = 0.023
 Identities = 52/270 (19%), Positives = 108/270 (40%), Gaps = 29/270 (10%)

Query: 1893 ENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVS----ESTTTSSPE------- 1941
            ENTT   PE+  + T        T ++ + ESTT ++ +          SSP+       
Sbjct: 352  ENTTIRIPETFESAT------VATNATQKIESTTFATTIGIEEINDNIYSSPKNSIYLKS 405

Query: 1942 -SESTTTSSPESESTT------TSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSES 1994
             S+ +TT   ++E TT      T    + +  + + E ++ T     +   +++ L +  
Sbjct: 406  KSQQSTTKFTDAEHTTPILKFTTWQDAARTYMSHNTEVQNMTDRFQRTTLKSSNELPTIQ 465

Query: 1995 TTTSSPESESTTTISPVSESTTT-----SSPVSESTTTISPESESTTTSSPASESTTTNN 2049
            T + +P+ +  + ++  +E   T     SS  S S T ++ E +    S+   E      
Sbjct: 466  TLSVTPKKKLPSNVTAKTEVHITNNALPSSNSSHSITEVTEEPKHNRMSASTHEEINHTE 525

Query: 2050 PKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSS 2109
                +   N   SE  T+      + T  + +S+    +     STT   P + ++  S+
Sbjct: 526  IAQITPILNAHTSEKSTTPQRPFTAETFLTTSSKPAILTWSNLLSTTPKEPLTNTSLRST 585

Query: 2110 PESESTTTSSPASESTTIEEQGVSPHSEKL 2139
                +  T+S  ++S  + +  +S  +  +
Sbjct: 586  DHITTQLTTSNRTQSAKLTKAHISSQTTNI 615



 Score = 39.1 bits (90), Expect = 0.032
 Identities = 51/232 (21%), Positives = 99/232 (42%), Gaps = 18/232 (7%)

Query: 1904 STTTNNPESESTTTSSPESESTTTS--SLVSESTTTSSPESESTTTSSPESESTTTS--- 1958
            S  +N+  + S T  +PE     TS    +  +TT+  P ++     SP+  + TTS   
Sbjct: 169  SKFSNSNLTRSLTPFTPEIFFNYTSFVYFLLYNTTSCIPSNDQYFEHSPKPINVTTSFGR 228

Query: 1959 SLVS-ESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTT 2017
            ++V+ +S  T++P S  ++TS+    S T+  + S +  T  P         PV+++ T 
Sbjct: 229  AIVNFDSILTTTPSSTPSSTSA----SITSPHIPSTNIPTPEP--------PPVTKNFTE 276

Query: 2018 SSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTT 2077
                +   T  +P   + TT S       ++ P+   T T+ P      +++  +E  T 
Sbjct: 277  LHTDTIKVTPNTPTITAQTTESIKKIVKRSDFPRPMYTPTDIPTLTIRLNATIKTEQNTE 336

Query: 2078 SSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTTIEE 2129
            +   +  +       E+TT   P +  + T +  +     S+  + +  IEE
Sbjct: 337  NPTENPKSPPKPTNFENTTIRIPETFESATVATNATQKIESTTFATTIGIEE 388



 Score = 31.7 bits (71), Expect = 6.4
 Identities = 25/96 (26%), Positives = 47/96 (48%), Gaps = 11/96 (11%)

Query: 2054 STTTNNPASESITSSSPASESTTTSSPAS--------ESTTTSSPASESTTTSSPASEST 2105
            +TT+  P+++     SP   + TTS   +         +T +S+P+S S + +SP   ST
Sbjct: 201  NTTSCIPSNDQYFEHSPKPINVTTSFGRAIVNFDSILTTTPSSTPSSTSASITSPHIPST 260

Query: 2106 TTSSPESESTTTSSPASESTTIEEQGVSPHSEKLSA 2141
               +PE    T +     + TI+   V+P++  ++A
Sbjct: 261  NIPTPEPPPVTKNFTELHTDTIK---VTPNTPTITA 293



 Score = 31.7 bits (71), Expect = 6.5
 Identities = 35/156 (22%), Positives = 68/156 (43%), Gaps = 7/156 (4%)

Query: 1875 NNNSESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSES 1934
            +N +  T V  T N+L S N++ +  E      +N  S ST      +E    + +++  
Sbjct: 477  SNVTAKTEVHITNNALPSSNSSHSITEVTEEPKHNRMSASTHEEINHTEIAQITPILNAH 536

Query: 1935 TTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSES 1994
            T+  S   +   T+     +++  ++++ S   S+   E  T +S  S    T+ L    
Sbjct: 537  TSEKSTTPQRPFTAETFLTTSSKPAILTWSNLLSTTPKEPLTNTSLRSTDHITTQL---- 592

Query: 1995 TTTSSPESESTTTISPVSESTTTSSP--VSESTTTI 2028
             TTS+    +  T + +S  TT   P  ++E +T +
Sbjct: 593  -TTSNRTQSAKLTKAHISSQTTNIYPQTITERSTDV 627


>gnl|CDD|217835 pfam03999, MAP65_ASE1, Microtubule associated protein (MAP65/ASE1
            family). 
          Length = 619

 Score = 46.4 bits (110), Expect = 2e-04
 Identities = 34/163 (20%), Positives = 51/163 (31%), Gaps = 13/163 (7%)

Query: 1962 SESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPV 2021
              ST +S P + ST  +     S T S   + + T SS  S+  + IS  + +T   S  
Sbjct: 461  YGSTESSVPSTPSTRRNDRNITSNTPSLKRTPNLTKSS-LSQEASLISKSTGNTHKHSTP 519

Query: 2022 SESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPA 2081
               TT           ++  S                N  + S  SS  +  S +     
Sbjct: 520  RRLTTL------PKLPAASRSSKGNLIRSG------ANGNASSDLSSPGSINSKSPEHSV 567

Query: 2082 SESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASES 2124
                        STT    ++ ST          +  SP  ES
Sbjct: 568  PLVRVFDIHLRASTTKGRHSTPSTNEKKKRLLKRSPLSPPKES 610



 Score = 44.1 bits (104), Expect = 0.001
 Identities = 33/178 (18%), Positives = 62/178 (34%), Gaps = 9/178 (5%)

Query: 1936 TTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSEST 1995
              +S   E    S+  S  +T S+  ++   TS+      T S   + + T SSL S+  
Sbjct: 451  NKTSTVMEPPYGSTESSVPSTPSTRRNDRNITSN------TPSLKRTPNLTKSSL-SQEA 503

Query: 1996 TTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSEST 2055
            +  S  + +T   S     TT     + S ++       +  ++  + S  ++     S 
Sbjct: 504  SLISKSTGNTHKHSTPRRLTTLPKLPAASRSS-KGNLIRSG-ANGNASSDLSSPGSINSK 561

Query: 2056 TTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESE 2113
            +  +              STT    ++ ST          +  SP  ES  T+   + 
Sbjct: 562  SPEHSVPLVRVFDIHLRASTTKGRHSTPSTNEKKKRLLKRSPLSPPKESVATTPRLNS 619



 Score = 40.2 bits (94), Expect = 0.013
 Identities = 36/162 (22%), Positives = 59/162 (36%), Gaps = 9/162 (5%)

Query: 1885 STLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESES 1944
            ST+      +T ++ P + ST  N+    S T S   + + T SSL S+  +  S  + +
Sbjct: 454  STVMEPPYGSTESSVPSTPSTRRNDRNITSNTPSLKRTPNLTKSSL-SQEASLISKSTGN 512

Query: 1945 TTTSSPESESTTTSSLVSEST--------TTSSPESESTTTSSPESESTTTSSLVSESTT 1996
            T   S     TT   L + S         + ++  + S  +S     S +    V     
Sbjct: 513  THKHSTPRRLTTLPKLPAASRSSKGNLIRSGANGNASSDLSSPGSINSKSPEHSVPLVRV 572

Query: 1997 TSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTS 2038
                   STT     + ST          + +SP  ES  T+
Sbjct: 573  FDIHLRASTTKGRHSTPSTNEKKKRLLKRSPLSPPKESVATT 614



 Score = 40.2 bits (94), Expect = 0.013
 Identities = 35/189 (18%), Positives = 55/189 (29%), Gaps = 24/189 (12%)

Query: 1900 PESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSS 1959
                ST    P   ST +S P + ST  +     S T S   + + T SS  S+  +  S
Sbjct: 450  ANKTSTVMEPP-YGSTESSVPSTPSTRRNDRNITSNTPSLKRTPNLTKSS-LSQEASLIS 507

Query: 1960 LVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSS 2019
              + +T   S     TT     + S                 S     I         ++
Sbjct: 508  KSTGNTHKHSTPRRLTTLPKLPAASR----------------SSKGNLIRS------GAN 545

Query: 2020 PVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSS 2079
              + S  +      S +           +     STT    ++ S           +  S
Sbjct: 546  GNASSDLSSPGSINSKSPEHSVPLVRVFDIHLRASTTKGRHSTPSTNEKKKRLLKRSPLS 605

Query: 2080 PASESTTTS 2088
            P  ES  T+
Sbjct: 606  PPKESVATT 614



 Score = 34.8 bits (80), Expect = 0.75
 Identities = 27/129 (20%), Positives = 47/129 (36%), Gaps = 12/129 (9%)

Query: 2041 ASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSP 2100
            A++++T   P   ST ++ P++ S   +     S T S   + + T SS  S+  +  S 
Sbjct: 450  ANKTSTVMEPPYGSTESSVPSTPSTRRNDRNITSNTPSLKRTPNLTKSS-LSQEASLISK 508

Query: 2101 ASESTTTSSPESESTTTSSPA----SESTTIEEQGVSP-HSEKLSANED------PEEFP 2149
            ++ +T   S     TT         S    +   G +   S  LS+             P
Sbjct: 509  STGNTHKHSTPRRLTTLPKLPAASRSSKGNLIRSGANGNASSDLSSPGSINSKSPEHSVP 568

Query: 2150 NEDVFEHTF 2158
               VF+   
Sbjct: 569  LVRVFDIHL 577



 Score = 34.1 bits (78), Expect = 1.1
 Identities = 25/141 (17%), Positives = 47/141 (33%), Gaps = 9/141 (6%)

Query: 2006 TTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESI 2065
            T + P   ST +S P + ST        S T S   + + T ++  S+  +  + ++ + 
Sbjct: 455  TVMEPPYGSTESSVPSTPSTRRNDRNITSNTPSLKRTPNLTKSS-LSQEASLISKSTGNT 513

Query: 2066 TSSSPASESTTTSSPASEST--------TTSSPASESTTTSSPASESTTTSSPESESTTT 2117
               S     TT     + S         + ++  + S  +S  +  S +           
Sbjct: 514  HKHSTPRRLTTLPKLPAASRSSKGNLIRSGANGNASSDLSSPGSINSKSPEHSVPLVRVF 573

Query: 2118 SSPASESTTIEEQGVSPHSEK 2138
                  STT         +EK
Sbjct: 574  DIHLRASTTKGRHSTPSTNEK 594


>gnl|CDD|236776 PRK10856, PRK10856, cytoskeletal protein RodZ; Provisional.
          Length = 331

 Score = 45.8 bits (109), Expect = 2e-04
 Identities = 17/115 (14%), Positives = 39/115 (33%), Gaps = 7/115 (6%)

Query: 1988 SSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTT 2047
            +++  +S   S+  S+++    P+  STTT      +TT        TT ++  + +  T
Sbjct: 144  TTMADQS---SAELSQNSGQSVPLDTSTTT----DPATTPAPAAPVDTTPTNSQTPAVAT 196

Query: 2048 NNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPAS 2102
                +     N   + S  +   A+     +    +            +T +   
Sbjct: 197  APAPAVDPQQNAVVAPSQANVDTAATPAPAAPATPDGAAPLPTDQAGVSTPAADP 251



 Score = 45.0 bits (107), Expect = 3e-04
 Identities = 17/101 (16%), Positives = 37/101 (36%), Gaps = 4/101 (3%)

Query: 2008 ISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITS 2067
             + +S+++  S P+  STTT   +  +T   +   ++T TN+      T   PA +    
Sbjct: 151  SAELSQNSGQSVPLDTSTTT---DPATTPAPAAPVDTTPTNSQTPAVATAPAPAVDP-QQ 206

Query: 2068 SSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTS 2108
            ++  + S      A+     +    +            +T 
Sbjct: 207  NAVVAPSQANVDTAATPAPAAPATPDGAAPLPTDQAGVSTP 247



 Score = 42.7 bits (101), Expect = 0.002
 Identities = 17/105 (16%), Positives = 36/105 (34%), Gaps = 4/105 (3%)

Query: 2018 SSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTT 2077
            S+ +S+++    P   STTT    + +         +T TN+      T+ +PA +    
Sbjct: 151  SAELSQNSGQSVPLDTSTTTDPATTPAPAAPVD---TTPTNSQTPAVATAPAPAVDPQQN 207

Query: 2078 SSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPAS 2122
            +  A  S      A+     +    +       +    +T +   
Sbjct: 208  AVVAP-SQANVDTAATPAPAAPATPDGAAPLPTDQAGVSTPAADP 251



 Score = 37.3 bits (87), Expect = 0.087
 Identities = 14/98 (14%), Positives = 33/98 (33%), Gaps = 2/98 (2%)

Query: 1875 NNNSESTVVMSTLNSLLSENTTTNSPESESTT-TNNPESESTTTSSPESESTTTSSLV-S 1932
             + +    V    ++     TT        TT TN+      T  +P  +    + +  S
Sbjct: 154  LSQNSGQSVPLDTSTTTDPATTPAPAAPVDTTPTNSQTPAVATAPAPAVDPQQNAVVAPS 213

Query: 1933 ESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSP 1970
            ++   ++        ++P+  +   +     ST  + P
Sbjct: 214  QANVDTAATPAPAAPATPDGAAPLPTDQAGVSTPAADP 251



 Score = 35.0 bits (81), Expect = 0.49
 Identities = 29/132 (21%), Positives = 47/132 (35%), Gaps = 12/132 (9%)

Query: 2067 SSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTT 2126
            SS+  S+++  S P   ST      ++  TT +PA+   TT +        ++PA     
Sbjct: 150  SSAELSQNSGQSVPLDTST-----TTDPATTPAPAAPVDTTPTNSQTPAVATAPAPAVDP 204

Query: 2127 IEEQGVSPHSEKLSANEDPEEFPNEDVFEHTFAEIPNIDHSNQTDEAIPE----TFDARE 2182
             +   V+P     + +      P         A +P       T  A P      F A +
Sbjct: 205  QQNAVVAPSQA--NVDTAATPAPAAPATPDGAAPLPTDQAGVSTPAADPNALVMNFTA-D 261

Query: 2183 EWPQCKDVIGKV 2194
             W +  D  GK 
Sbjct: 262  CWLEVTDATGKK 273


>gnl|CDD|217495 pfam03326, Herpes_TAF50, Herpesvirus transcription activation factor
            (transactivator).  This family includes EBV BRLF1 and
            similar ORF 50 proteins from other herpesviruses.
          Length = 500

 Score = 45.9 bits (109), Expect = 2e-04
 Identities = 46/252 (18%), Positives = 78/252 (30%), Gaps = 24/252 (9%)

Query: 1909 NPESESTTTSSPESE--STTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTT 1966
            NPE    T +SP S+    T    + +  +   P     + SS   + + + S V  ++T
Sbjct: 198  NPEEILETRASPLSQFHGFTPHPSLPQPQSPLKP-----SPSSARPQQSESFSDVWPAST 252

Query: 1967 TSSPESESTTTSSPESESTTTSSLVSE--STTTSSPESESTTTISPVSESTTTSSPVSES 2024
             S  E  S    +P S S+   S   E     +S          S V +S+ +      +
Sbjct: 253  QSPREETSAEPLAPASPSSRRPSTAQEEQIACSSPQAEPEQGVQSYVPQSSDSRPSCFPA 312

Query: 2025 TTTISP--------ESESTTTSSPASESTTTNNPKSESTTTNNP--ASESITSSSPASES 2074
             +T  P        +                               A     S S  S  
Sbjct: 313  PSTTQPTFLPPNTNKKAKRDRRPQMVTPKQEGGAAVSQNHDGGTVRAPRGRPSGSGQSPP 372

Query: 2075 TTTSSPASESTTTSSPASESTTTSSPA--SESTTTSSPESESTTTSSPASESTTIEEQGV 2132
            + +   +S + T S  A +  +   PA   +    +S +   T  SS   +     EQ +
Sbjct: 373  SNSPLLSSLADTPSGAAHQPASLLPPAVVQQQLEDASDKQPPTPGSSLVPQPD---EQEL 429

Query: 2133 SPHSEKLSANED 2144
             P    L   + 
Sbjct: 430  GPSVMALLDRDQ 441



 Score = 32.4 bits (74), Expect = 3.5
 Identities = 49/211 (23%), Positives = 77/211 (36%), Gaps = 24/211 (11%)

Query: 1970 PESESTTTSSPESESTTTSSLVSESTTTSS---PESESTTTISPVSESTTTSSP-VSEST 2025
             + +  T  S E +    + + S   +  S   PE      +  +SE    ++     S 
Sbjct: 139  DDVKLCTQGSAERKRPPHTGIFSGLVSQQSFVLPEP----LLLEISEPGLLAASDADLSE 194

Query: 2026 TTISPESESTTTSSPASE--STTTNNPKSESTTTNNPASESITSSSPA-SESTTTSSPAS 2082
               +PE    T +SP S+    T +    +  +   P   S +S+ P  SES +   PAS
Sbjct: 195  LLQNPEEILETRASPLSQFHGFTPHPSLPQPQS---PLKPSPSSARPQQSESFSDVWPAS 251

Query: 2083 ESTTTSSPASESTTTS-SPASESTTTSSPESESTTTSSPASESTTIEEQGVSPHSEKLSA 2141
                T SP  E++    +PAS S+   S   E     S          Q   P     S+
Sbjct: 252  ----TQSPREETSAEPLAPASPSSRRPSTAQEEQIACSSPQAEPEQGVQSYVP----QSS 303

Query: 2142 NEDPEEFPNEDVFEHTFAEIPNIDHSNQTDE 2172
            +  P  FP     + TF   PN +   + D 
Sbjct: 304  DSRPSCFPAPSTTQPTFLP-PNTNKKAKRDR 333


>gnl|CDD|219916 pfam08580, KAR9, Yeast cortical protein KAR9.  The KAR9 protein in
            Saccharomyces cerevisiae is a cytoskeletal protein
            required for karyogamy, correct positioning of the
            mitotic spindle and for orientation of cytoplasmic
            microtubules. KAR9 localises at the shmoo tip in mating
            cells and at the tip of the growing bud in anaphase.
          Length = 626

 Score = 46.0 bits (109), Expect = 2e-04
 Identities = 43/282 (15%), Positives = 82/282 (29%), Gaps = 25/282 (8%)

Query: 1863 VIDNYSEIIFTTNNNS--ESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTT---- 1916
            V+D+      ++      +S  V  +  S    +  T S    S+    P          
Sbjct: 360  VVDHVLRDSQSSKIQQIRDSISVSGSDYSNPGSSIDTPSSSPSSSVIMTPPDSGPGSNVS 419

Query: 1917 ----TSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPES 1972
                 +         + L+       +  S      S  S    + +  + ST    P  
Sbjct: 420  SRRVGTPGSKSDRVGAVLLRRMNIKPTLASIPDEKPSNISVFEDSETSPNSSTLLRDPPP 479

Query: 1973 ESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPES 2032
            +     S    +    + +  + ++  P S      S ++  T +      S+ ++   S
Sbjct: 480  KKCGEESGHLPNNPFFNKLKLTLSSIPPLSPRQ---SIITLPTPSRPASRISSLSLRLGS 536

Query: 2033 ESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPAS 2092
             S +  SP    T  +   +   + N     S++             P   +   + P  
Sbjct: 537  YSGSIVSPPPYPTLVSRKGAAGLSFN----RSVSDIEGERIGRYNLLP---TRIPALPFK 589

Query: 2093 ESTTTSSPASESTTTSSPESESTTTSSPASESTTIEEQGVSP 2134
              +TTSS  S     SS  S +     P S      E  + P
Sbjct: 590  AESTTSSRRS-----SSLPSPTGVIGFPGSVPRFDHENLLPP 626



 Score = 44.9 bits (106), Expect = 6e-04
 Identities = 43/254 (16%), Positives = 73/254 (28%), Gaps = 9/254 (3%)

Query: 1864 IDNYSEIIFTTNNNSESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESE 1923
            I++ S+ I  T       +  S L+  ++  T       +  +           S     
Sbjct: 315  IESKSKTISKTFTLIYKALEESILDKGVASRTNREL-APKWLSLKTVVDHVLRDSQSSKI 373

Query: 1924 STTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESE 1983
                 S+    +  S+P S   T SS  S S   +   S   +  S     T        
Sbjct: 374  QQIRDSISVSGSDYSNPGSSIDTPSSSPSSSVIMTPPDSGPGSNVSSRRVGT---PGSKS 430

Query: 1984 STTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASE 2043
                + L+       +  S      S +S    + +  + ST    P  +     S    
Sbjct: 431  DRVGAVLLRRMNIKPTLASIPDEKPSNISVFEDSETSPNSSTLLRDPPPKKCGEESGHLP 490

Query: 2044 STTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPA-SESTTTSSPAS 2102
                NNP          +   ++         T S PAS  ++ S    S S +  SP  
Sbjct: 491  ----NNPFFNKLKLTLSSIPPLSPRQSIITLPTPSRPASRISSLSLRLGSYSGSIVSPPP 546

Query: 2103 ESTTTSSPESESTT 2116
              T  S   +   +
Sbjct: 547  YPTLVSRKGAAGLS 560



 Score = 44.5 bits (105), Expect = 7e-04
 Identities = 57/308 (18%), Positives = 99/308 (32%), Gaps = 38/308 (12%)

Query: 1870 IIFTTNN---NSESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTS-------S 1919
            ++FT  N         V  +L  L +  TT    ++ +T T+  ES+S T S        
Sbjct: 272  LLFTNLNHELQKMLDSVERSLQKLQNNKTTGMHLDNRTTMTDQIESKSKTISKTFTLIYK 331

Query: 1920 PESESTTTSSLVSESTTTSSP--------ESESTTTSSPESESTTTSSLVSESTTTSSPE 1971
               ES     + S +    +P               S          S+    +  S+P 
Sbjct: 332  ALEESILDKGVASRTNRELAPKWLSLKTVVDHVLRDSQSSKIQQIRDSISVSGSDYSNPG 391

Query: 1972 SESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPE 2031
            S   T SS  S S     +++   +       S    +P S+S    + +      I P 
Sbjct: 392  SSIDTPSSSPSSSV----IMTPPDSGPGSNVSSRRVGTPGSKSDRVGAVLL-RRMNIKPT 446

Query: 2032 SESTTTSSP---ASESTTTNNPKSESTTTNNPASESITSSSPASES---------TTTSS 2079
              S     P   +    +  +P + ST   +P  +     S    +         T +S 
Sbjct: 447  LASIPDEKPSNISVFEDSETSP-NSSTLLRDPPPKKCGEESGHLPNNPFFNKLKLTLSSI 505

Query: 2080 PASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTTIEEQGVSPHSEKL 2139
            P        S  +  T +   +  S+ +    S S +  SP    T +  +G +  S   
Sbjct: 506  PPLSPR--QSIITLPTPSRPASRISSLSLRLGSYSGSIVSPPPYPTLVSRKGAAGLSFNR 563

Query: 2140 SANEDPEE 2147
            S ++   E
Sbjct: 564  SVSDIEGE 571


>gnl|CDD|215570 PLN03091, PLN03091, hypothetical protein; Provisional.
          Length = 459

 Score = 45.7 bits (108), Expect = 2e-04
 Identities = 54/220 (24%), Positives = 91/220 (41%), Gaps = 16/220 (7%)

Query: 1875 NNNSESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSES 1934
             ++  S+VV + LN L ++N    S    +       S S      E ES+++S + + +
Sbjct: 145  KSDKASSVVSNELNLLKADN----SKPLAALQEKRSSSISPAGYQLEVESSSSSKINNSN 200

Query: 1935 TTTSSPESESTTTSSPES--ESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVS 1992
                S  +  T T + +   +  TTS    ES+TTS   S+       +  +  +++ +S
Sbjct: 201  NNNHSNSNLMTPTPNKDFFLDRFTTSH---ESSTTSCRPSDLVGHFPFQQLNYASNARLS 257

Query: 1993 ESTTTS---SPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNN 2049
             +   +   S  S+S    S  S S T S      T++  P    T  S   S S  ++N
Sbjct: 258  TNPNPTLWFSQNSKSFEMNSEFSSSMTPSILPPSVTSSFLP----TPMSYKPSISLPSDN 313

Query: 2050 PKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSS 2089
            P   S T N   +    + S  S S+  SS + E  + SS
Sbjct: 314  PSIPSFTVNGVRNWEAGAFSNNSNSSNGSSSSIELQSNSS 353



 Score = 31.9 bits (72), Expect = 4.4
 Identities = 41/206 (19%), Positives = 72/206 (34%), Gaps = 19/206 (9%)

Query: 1937 TSSPESESTTTSSPESESTTTSSLVSESTTTSSPESE-STTTSSPESESTTTSSLVSEST 1995
            T     ++++  S E       +    +       S  S      E ES+++S + + + 
Sbjct: 142  TDDKSDKASSVVSNELNLLKADNSKPLAALQEKRSSSISPAGYQLEVESSSSSKINNSNN 201

Query: 1996 TTSSPESESTTT---------ISPVSESTTTSSPVSESTTTISPES---ESTTTSSPASE 2043
               S  +  T T          +   ES+TTS   S+       +     S    S    
Sbjct: 202  NNHSNSNLMTPTPNKDFFLDRFTTSHESSTTSCRPSDLVGHFPFQQLNYASNARLSTNPN 261

Query: 2044 STTTNNPKSESTTTNNPASESITSSSPASESTTT--SSP----ASESTTTSSPASESTTT 2097
             T   +  S+S   N+  S S+T S      T++   +P     S S  + +P+  S T 
Sbjct: 262  PTLWFSQNSKSFEMNSEFSSSMTPSILPPSVTSSFLPTPMSYKPSISLPSDNPSIPSFTV 321

Query: 2098 SSPASESTTTSSPESESTTTSSPASE 2123
            +   +      S  S S+  SS + E
Sbjct: 322  NGVRNWEAGAFSNNSNSSNGSSSSIE 347


>gnl|CDD|234504 TIGR04216, halo_surf_glyco, major cell surface glycoprotein.  Members
            of this family are the S-layer-forming halobacterial
            major cell surface glycoprotein. The highest scores below
            model cutoffs are fragmentary paralogs to actual members
            of the family. Modifications include at N-linked and
            O-linked glycosylation, a C-terminal diphytanylglyceryl
            modification, and probable cleavage of the PGF-CTERM
            tail.
          Length = 782

 Score = 46.0 bits (109), Expect = 2e-04
 Identities = 31/151 (20%), Positives = 57/151 (37%), Gaps = 15/151 (9%)

Query: 1869 EIIFTTNNNSESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTS 1928
            E ++       +  V  + N    +NT T           N +  S T  S +  ++   
Sbjct: 623  ESVYNPVEAGGTLEVAGSTNRKPDDNTIT-------VELLNEDDTSVTLESTDEWNSDGQ 675

Query: 1929 SLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTS 1988
              V    +     + +      ++       +V E+        ++TT+  P + +T T+
Sbjct: 676  WSVEVDLSDVETGNYTVEADDGDNTDRVNVEVVEET-----ERPDTTTSEDPTTTTTPTT 730

Query: 1989 SLVSESTTTSSPESESTTTISPVSESTTTSS 2019
            +   E+T T+ P    TTT  P  E+TT SS
Sbjct: 731  TGPEETTETAEPT---TTTEEPTEETTTGSS 758



 Score = 39.1 bits (91), Expect = 0.031
 Identities = 52/266 (19%), Positives = 97/266 (36%), Gaps = 27/266 (10%)

Query: 1855 AATAVAISVIDNYSEIIFTTNNNSESTVVMSTLNSLLSENTTTNSPESESTTTNN-PESE 1913
                V++   D + E        ++ TV    L+S            + +T  +     +
Sbjct: 519  NYQEVSVDSDDTFDEEDIDIGGLTQGTVTAHILSSGRDGEIGDTGTSNGATLNDLIGYLD 578

Query: 1914 STTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESE 1973
            +    S   E      L +    T+S +   T T       TT  S+ +      + E  
Sbjct: 579  TYAGGSNTGEQIREQILSNTVDDTASDDLIVTETFRLADGLTTIESVYNPVEAGGTLEVA 638

Query: 1974 STTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESE 2033
             +T   P+  + T   L  + T   S   EST   +  S+   +   V    + +  E+ 
Sbjct: 639  GSTNRKPDDNTITVELLNEDDT---SVTLESTDEWN--SDGQWS---VEVDLSDV--ETG 688

Query: 2034 STTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASE 2093
            + T  +   ++T   N +             +   +   ++TT+  P + +T T++   E
Sbjct: 689  NYTVEADDGDNTDRVNVE-------------VVEETERPDTTTSEDPTTTTTPTTTGPEE 735

Query: 2094 STTTSSPASESTTTSSPESESTTTSS 2119
            +T T+ P   +TTT  P  E+TT SS
Sbjct: 736  TTETAEP---TTTTEEPTEETTTGSS 758



 Score = 39.1 bits (91), Expect = 0.039
 Identities = 37/160 (23%), Positives = 63/160 (39%), Gaps = 16/160 (10%)

Query: 1977 TSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTT 2036
            T+S +   T T  L    TT  S  +      +     +T   P      TI+ E  +  
Sbjct: 602  TASDDLIVTETFRLADGLTTIESVYNPVEAGGTLEVAGSTNRKP---DDNTITVELLNED 658

Query: 2037 TSSPASESTTTNNPK---SESTTTNNPASESITSSSPASESTTTSS-------PASESTT 2086
             +S   EST   N     S     ++  + + T  +   ++T   +          ++TT
Sbjct: 659  DTSVTLESTDEWNSDGQWSVEVDLSDVETGNYTVEADDGDNTDRVNVEVVEETERPDTTT 718

Query: 2087 TSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTT 2126
            +  P + +T T++   E+T T+ P    TTT  P  E+TT
Sbjct: 719  SEDPTTTTTPTTTGPEETTETAEPT---TTTEEPTEETTT 755


>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger.  [Transport and
            binding proteins, Cations and iron carrying compounds].
          Length = 1096

 Score = 45.8 bits (108), Expect = 4e-04
 Identities = 61/249 (24%), Positives = 105/249 (42%), Gaps = 31/249 (12%)

Query: 1897 TNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTT 1956
            T SP      +  P   ST  + P S   T  + V +S  T++ +   T  S   +  TT
Sbjct: 191  TPSPLGRMVNSYAP---STFMTMPRSHGITPRTTVKDSEITATYKMLETNPSKRTAGKTT 247

Query: 1957 TSSL--VSESTTTS-SPESESTTTSSPES---ESTTTSSLVSESTTTSSP-----ESEST 2005
             + L  ++++T T  + E E+   +SP S   ++T T+    ES ++++      ++  T
Sbjct: 248  PTPLKGMTDNTPTFLTREVETDLLTSPRSVVEKNTLTTPRRVESNSSTNHWGLVGKNNLT 307

Query: 2006 TTISPVSESTTTSSPVSESTTTISPESESTTTSSPASEST-TTNNPKSESTTTN---NPA 2061
            T    V E T      SE   TIS  + S+   + AS +     NP S ++        A
Sbjct: 308  TPQGTVLEHT---PATSEGQVTISIMTGSSPAETKASTAAWKIRNPLSRTSAPAVRIASA 364

Query: 2062 SESITSSSPASESTTTSSPASESTTTSS--------PASESTTTSSPASESTTTSSPESE 2113
            +      +P++  +T ++P   +  T+         PA    TT SP+   TT   PE+ 
Sbjct: 365  TFRGLEKNPSTAPSTPATPRVRAVLTTQVHHCVVVKPAPAVPTTPSPS--LTTALFPEAP 422

Query: 2114 STTTSSPAS 2122
            S + S+   
Sbjct: 423  SPSPSALPP 431



 Score = 41.1 bits (96), Expect = 0.008
 Identities = 63/277 (22%), Positives = 110/277 (39%), Gaps = 35/277 (12%)

Query: 1868 SEIIFTTNNNSESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTT- 1926
            ++I  TT  N+ S     T      E    ++P + S   N+  S   T+     +S T 
Sbjct: 119  AKITPTTPKNNYSPTAAGT------ERVKEDTPATPSRALNHYIS---TSGRQRVKSYTP 169

Query: 1927 -TSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTT--TSSPESE 1983
                 V  S+ T + E     T SP      +    + ST  + P S   T  T+  +SE
Sbjct: 170  KPRGEVKSSSPTQTREKVRKYTPSPLGRMVNS---YAPSTFMTMPRSHGITPRTTVKDSE 226

Query: 1984 STTTSSLVSESTTTSSPESESTTTISPVSESTTT--SSPVSESTTTISPES---ESTTTS 2038
             T T  ++  + +  +    + T +  ++++T T  +  V E+    SP S   ++T T+
Sbjct: 227  ITATYKMLETNPSKRTAGKTTPTPLKGMTDNTPTFLTREV-ETDLLTSPRSVVEKNTLTT 285

Query: 2039 SPASESTTTNNPKSESTTTN--NPASESITSSSPASESTTTSSPASESTTTSSPASE--- 2093
                ES ++ N        N   P    +  +   SE   T S  + S+   + AS    
Sbjct: 286  PRRVESNSSTNHWGLVGKNNLTTPQGTVLEHTPATSEGQVTISIMTGSSPAETKASTAAW 345

Query: 2094 -----STTTSSPA---SESTTTSSPESESTTTSSPAS 2122
                  + TS+PA   + +T     ++ ST  S+PA+
Sbjct: 346  KIRNPLSRTSAPAVRIASATFRGLEKNPSTAPSTPAT 382



 Score = 33.4 bits (76), Expect = 2.2
 Identities = 36/178 (20%), Positives = 59/178 (33%), Gaps = 15/178 (8%)

Query: 1990 LVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNN 2049
            +VS     SS E E    ++P +      +  S +          T   +P    TT  N
Sbjct: 74   MVSSDPPKSSSEME-GEMLAPQATVGRDEATPSIAMENTPSPPRRTAKITP----TTPKN 128

Query: 2050 PKSESTTTNNPASESI--TSSSPASESTTTSSPASESTTTSSPASE---STTTSSPASES 2104
              S +        E    T S   +   +TS      + T  P  E   S+ T +     
Sbjct: 129  NYSPTAAGTERVKEDTPATPSRALNHYISTSGRQRVKSYTPKPRGEVKSSSPTQTREKVR 188

Query: 2105 TTTSSPESESTTTSSPASESTTIEEQGVSPH-----SEKLSANEDPEEFPNEDVFEHT 2157
              T SP      + +P++  T     G++P      SE  +  +  E  P++     T
Sbjct: 189  KYTPSPLGRMVNSYAPSTFMTMPRSHGITPRTTVKDSEITATYKMLETNPSKRTAGKT 246


>gnl|CDD|227952 COG5665, NOT5, CCR4-NOT transcriptional regulation complex, NOT5
            subunit [Transcription].
          Length = 548

 Score = 45.4 bits (107), Expect = 4e-04
 Identities = 33/197 (16%), Positives = 63/197 (31%), Gaps = 26/197 (13%)

Query: 1971 ESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISP 2030
            E + +++++   +     +  S S+  SS + E     SP  ++      +S+  TT  P
Sbjct: 199  EIQPSSSNNEAPKEGNNQT--SLSSIRSSKKQER----SPKKKAPQRDVSISDRATT--P 250

Query: 2031 ESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSP 2090
             +    ++S +  ST T         T    S    +S+  + +T      S     S  
Sbjct: 251  IAPGVESASQSISSTPTPVSTDTPLHTVKDDSIKFDNSTLGTPTT----HVSMKKKESEN 306

Query: 2091 ASESTTTSSPASESTTTSSPESESTTTSSPASESTTIEE--------------QGVSPHS 2136
             SE        S      + + +  T ++  +     E               Q +SP  
Sbjct: 307  DSEQQLNFPKDSTDEIRKTIQHDVETNAAFQNPLFNDELKWWLASKRYLTQPLQEMSPSM 366

Query: 2137 EKLSANEDPEEFPNEDV 2153
                 N       + DV
Sbjct: 367  VSTLENSLLNCPDSLDV 383



 Score = 42.0 bits (98), Expect = 0.004
 Identities = 31/182 (17%), Positives = 62/182 (34%), Gaps = 16/182 (8%)

Query: 1909 NPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTS 1968
              E + +++++   +     +  S S+  SS + E +          + S     +TT  
Sbjct: 197  GCEIQPSSSNNEAPKEGNNQT--SLSSIRSSKKQERSPKKKAPQRDVSISD---RATTPI 251

Query: 1969 SPESESTT---TSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSEST 2025
            +P  ES +   +S+P   ST T     +  +     S   T  + VS     S   SE  
Sbjct: 252  APGVESASQSISSTPTPVSTDTPLHTVKDDSIKFDNSTLGTPTTHVSMKKKESENDSEQQ 311

Query: 2026 TTISPESESTTTSSPASESTTTNNPKSESTT------TNNP--ASESITSSSPASESTTT 2077
                 +S      +   +  T    ++            +    ++ +   SP+  ST  
Sbjct: 312  LNFPKDSTDEIRKTIQHDVETNAAFQNPLFNDELKWWLASKRYLTQPLQEMSPSMVSTLE 371

Query: 2078 SS 2079
            +S
Sbjct: 372  NS 373



 Score = 40.8 bits (95), Expect = 0.010
 Identities = 42/228 (18%), Positives = 72/228 (31%), Gaps = 14/228 (6%)

Query: 1872 FTTNNNSESTVVMSTLNSLLSENTTTNSPESES-TTTNNPESESTTTSSPESESTTTS-- 1928
            +  NN+    +   T+   +      +S  +E+    NN  S S+  SS + E +     
Sbjct: 177  YVENNDDPDFIEYDTIYEDMGCEIQPSSSNNEAPKEGNNQTSLSSIRSSKKQERSPKKKA 236

Query: 1929 -----SLVSESTTTSSPESESTT---TSSPESESTTTSSLVSESTTTSSPESESTTTSSP 1980
                 S+   +TT  +P  ES +   +S+P   ST T     +  +     S   T ++ 
Sbjct: 237  PQRDVSISDRATTPIAPGVESASQSISSTPTPVSTDTPLHTVKDDSIKFDNSTLGTPTTH 296

Query: 1981 ESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSP 2040
             S     S   SE       +S      +   +  T ++  +        E +    S  
Sbjct: 297  VSMKKKESENDSEQQLNFPKDSTDEIRKTIQHDVETNAAFQNPLFN---DELKWWLASKR 353

Query: 2041 ASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTS 2088
                       S  +T  N       S    S     + P S    TS
Sbjct: 354  YLTQPLQEMSPSMVSTLENSLLNCPDSLDVDSPICLYTKPLSLPHPTS 401



 Score = 35.4 bits (81), Expect = 0.45
 Identities = 33/189 (17%), Positives = 62/189 (32%), Gaps = 26/189 (13%)

Query: 1923 ESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPES 1982
            E   +SS        ++  S S+  SS + E +          + S     +TT  +P  
Sbjct: 199  EIQPSSSNNEAPKEGNNQTSLSSIRSSKKQERSPKKKAPQRDVSISD---RATTPIAPGV 255

Query: 1983 ESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTI-SPESESTTT---S 2038
            ES + S      ++T +P S  T         T     +    +T+ +P +  +     S
Sbjct: 256  ESASQS-----ISSTPTPVSTDTPLH------TVKDDSIKFDNSTLGTPTTHVSMKKKES 304

Query: 2039 SPASESTTTNNPKSESTTTNNPASESITSSSPASESTT------TSSP--ASESTTTSSP 2090
               SE        S          +  T+++  +           +S    ++     SP
Sbjct: 305  ENDSEQQLNFPKDSTDEIRKTIQHDVETNAAFQNPLFNDELKWWLASKRYLTQPLQEMSP 364

Query: 2091 ASESTTTSS 2099
            +  ST  +S
Sbjct: 365  SMVSTLENS 373


>gnl|CDD|216421 pfam01299, Lamp, Lysosome-associated membrane glycoprotein (Lamp). 
          Length = 305

 Score = 44.7 bits (106), Expect = 4e-04
 Identities = 22/108 (20%), Positives = 41/108 (37%), Gaps = 3/108 (2%)

Query: 1984 STTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASE 2043
            S T  +     + T+   + ++  +  V+ ST T +P + +   +S  +   T  +    
Sbjct: 2    SVTELTFSYNLSDTTLFPNATSKGVKTVTSSTDTKAPTNTTYRCVSSTTVPMTNVTVTLH 61

Query: 2044 STTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPA 2091
              T     S  T +         + SP + +T + SP       SSPA
Sbjct: 62   DVTLQAYLSNGTFSKTETRCEADTPSPTTVATPSPSPTP---VPSSPA 106



 Score = 38.6 bits (90), Expect = 0.032
 Identities = 25/112 (22%), Positives = 47/112 (41%), Gaps = 7/112 (6%)

Query: 2010 PVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSS 2069
             V+E T + +    S TT+ P + ++      + ST T  P + +    +  +  +T+ +
Sbjct: 2    SVTELTFSYNL---SDTTLFPNA-TSKGVKTVTSSTDTKAPTNTTYRCVSSTTVPMTNVT 57

Query: 2070 PASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPA 2121
                  T  +  S  T + +       T SP + +T + SP   +   SSPA
Sbjct: 58   VTLHDVTLQAYLSNGTFSKTETRCEADTPSPTTVATPSPSP---TPVPSSPA 106



 Score = 38.2 bits (89), Expect = 0.041
 Identities = 20/108 (18%), Positives = 41/108 (37%), Gaps = 3/108 (2%)

Query: 2004 STTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASE 2063
            S T ++     + T+   + ++  +   + ST T +P + +    +  +   T       
Sbjct: 2    SVTELTFSYNLSDTTLFPNATSKGVKTVTSSTDTKAPTNTTYRCVSSTTVPMTNVTVTLH 61

Query: 2064 SITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPE 2111
             +T  +  S  T + +       T SP + +T + SP       SSP 
Sbjct: 62   DVTLQAYLSNGTFSKTETRCEADTPSPTTVATPSPSPTP---VPSSPA 106



 Score = 37.0 bits (86), Expect = 0.10
 Identities = 26/109 (23%), Positives = 44/109 (40%), Gaps = 4/109 (3%)

Query: 1933 ESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVS 1992
              T  +   + S TT  P + S    + V+ ST T +P + +    S  +   T  ++  
Sbjct: 2    SVTELTFSYNLSDTTLFPNATSKGVKT-VTSSTDTKAPTNTTYRCVSSTTVPMTNVTVTL 60

Query: 1993 ESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPA 2041
               T  +  S  T + +       T SP + +T + SP   +   SSPA
Sbjct: 61   HDVTLQAYLSNGTFSKTETRCEADTPSPTTVATPSPSP---TPVPSSPA 106



 Score = 35.5 bits (82), Expect = 0.32
 Identities = 27/115 (23%), Positives = 50/115 (43%), Gaps = 17/115 (14%)

Query: 1922 SESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPE 1981
            +E T + +L   S TT  P       +S     T TSS  +++ T ++    S+TT    
Sbjct: 4    TELTFSYNL---SDTTLFP-----NATSKGV-KTVTSSTDTKAPTNTTYRCVSSTTVPMT 54

Query: 1982 SESTTTSSLVSESTTTSSPESESTT-----TISPVSESTTTSSPVSESTTTISPE 2031
            + + T   +  ++  ++   S++ T     T SP + +T + SP   +    SP 
Sbjct: 55   NVTVTLHDVTLQAYLSNGTFSKTETRCEADTPSPTTVATPSPSP---TPVPSSPA 106



 Score = 34.7 bits (80), Expect = 0.56
 Identities = 22/114 (19%), Positives = 47/114 (41%), Gaps = 10/114 (8%)

Query: 1899 SPESESTTTN-NPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTT 1957
            S    + + N +  +     +S     T TSS  +++ T ++    S+TT    + + T 
Sbjct: 2    SVTELTFSYNLSDTTLFPNATSKGV-KTVTSSTDTKAPTNTTYRCVSSTTVPMTNVTVTL 60

Query: 1958 SSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPV 2011
              +  ++  ++   S++ T    ++ S TT      +T + SP   +    SP 
Sbjct: 61   HDVTLQAYLSNGTFSKTETRCEADTPSPTTV-----ATPSPSP---TPVPSSPA 106



 Score = 32.8 bits (75), Expect = 1.9
 Identities = 18/103 (17%), Positives = 41/103 (39%), Gaps = 1/103 (0%)

Query: 1952 SESTTTSSLVSESTTTSSPESESTTTSSPES-ESTTTSSLVSESTTTSSPESESTTTISP 2010
            +E T + +L   +   ++      T +S    ++ T ++    S+TT    + + T    
Sbjct: 4    TELTFSYNLSDTTLFPNATSKGVKTVTSSTDTKAPTNTTYRCVSSTTVPMTNVTVTLHDV 63

Query: 2011 VSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSE 2053
              ++  ++   S++ T    ++ S TT +  S S T       
Sbjct: 64   TLQAYLSNGTFSKTETRCEADTPSPTTVATPSPSPTPVPSSPA 106



 Score = 31.2 bits (71), Expect = 6.4
 Identities = 29/120 (24%), Positives = 47/120 (39%), Gaps = 17/120 (14%)

Query: 1902 SESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLV 1961
            +E T + N    S TT  P + S    + V+ ST T +P + +    S  +   T  ++ 
Sbjct: 4    TELTFSYNL---SDTTLFPNATSKGVKT-VTSSTDTKAPTNTTYRCVSSTTVPMTNVTVT 59

Query: 1962 SESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPV 2021
                T  +  S + T S  E+              T SP + +T + SP   +   SSP 
Sbjct: 60   LHDVTLQAYLS-NGTFSKTETRC---------EADTPSPTTVATPSPSP---TPVPSSPA 106


>gnl|CDD|222447 pfam13904, DUF4207, Domain of unknown function (DUF4207).  This
            family is found in eukaryotes; it has several conserved
            tryptophan residues. The function is not known.
          Length = 261

 Score = 43.9 bits (104), Expect = 5e-04
 Identities = 26/78 (33%), Positives = 36/78 (46%), Gaps = 5/78 (6%)

Query: 1921 ESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSP 1980
             S S +T SL+S      SP S S  T     EST + S + E    S   S S     P
Sbjct: 1    LSCSDSTRSLLSPLGNELSP-SSSDETEDCSEESTDSWSDMYEGLKDSESSSNSV----P 55

Query: 1981 ESESTTTSSLVSESTTTS 1998
                ++T+S +S+S+T S
Sbjct: 56   SLSLSSTASSLSDSSTYS 73



 Score = 41.6 bits (98), Expect = 0.003
 Identities = 25/77 (32%), Positives = 35/77 (45%), Gaps = 5/77 (6%)

Query: 1951 ESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISP 2010
             S S +T SL+S      SP S S  T     EST + S + E    S   S S     P
Sbjct: 1    LSCSDSTRSLLSPLGNELSP-SSSDETEDCSEESTDSWSDMYEGLKDSESSSNSV----P 55

Query: 2011 VSESTTTSSPVSESTTT 2027
                ++T+S +S+S+T 
Sbjct: 56   SLSLSSTASSLSDSSTY 72



 Score = 37.0 bits (86), Expect = 0.097
 Identities = 26/79 (32%), Positives = 37/79 (46%), Gaps = 5/79 (6%)

Query: 1885 STLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESES 1944
             +  SLLS      SP S S  T +   EST + S   E        SES++ S P    
Sbjct: 5    DSTRSLLSPLGNELSP-SSSDETEDCSEESTDSWSDMYEG----LKDSESSSNSVPSLSL 59

Query: 1945 TTTSSPESESTTTSSLVSE 1963
            ++T+S  S+S+T S  + E
Sbjct: 60   SSTASSLSDSSTYSRSLKE 78



 Score = 35.9 bits (83), Expect = 0.19
 Identities = 23/77 (29%), Positives = 31/77 (40%), Gaps = 5/77 (6%)

Query: 1971 ESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISP 2030
             S S +T S  S      S  S S  T     EST + S + E    S   S S     P
Sbjct: 1    LSCSDSTRSLLSPLGNELS-PSSSDETEDCSEESTDSWSDMYEGLKDSESSSNSV----P 55

Query: 2031 ESESTTTSSPASESTTT 2047
                ++T+S  S+S+T 
Sbjct: 56   SLSLSSTASSLSDSSTY 72



 Score = 35.1 bits (81), Expect = 0.37
 Identities = 24/77 (31%), Positives = 37/77 (48%), Gaps = 5/77 (6%)

Query: 2022 SESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPA 2081
            S S +T S  S      SP+S S  T +   EST + +   E +  S  +S S     P+
Sbjct: 2    SCSDSTRSLLSPLGNELSPSS-SDETEDCSEESTDSWSDMYEGLKDSESSSNSV----PS 56

Query: 2082 SESTTTSSPASESTTTS 2098
               ++T+S  S+S+T S
Sbjct: 57   LSLSSTASSLSDSSTYS 73



 Score = 33.5 bits (77), Expect = 1.2
 Identities = 24/78 (30%), Positives = 35/78 (44%), Gaps = 5/78 (6%)

Query: 2031 ESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSP 2090
             S S +T S  S      +P S S  T + + ES  S S   E    S  +S S     P
Sbjct: 1    LSCSDSTRSLLSPLGNELSP-SSSDETEDCSEESTDSWSDMYEGLKDSESSSNSV----P 55

Query: 2091 ASESTTTSSPASESTTTS 2108
            +   ++T+S  S+S+T S
Sbjct: 56   SLSLSSTASSLSDSSTYS 73



 Score = 32.4 bits (74), Expect = 2.2
 Identities = 22/78 (28%), Positives = 32/78 (41%), Gaps = 5/78 (6%)

Query: 2001 ESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNP 2060
             S S +T S +S      SP S   T      EST + S   E    +   S S     P
Sbjct: 1    LSCSDSTRSLLSPLGNELSPSSSDETE-DCSEESTDSWSDMYEGLKDSESSSNSV----P 55

Query: 2061 ASESITSSSPASESTTTS 2078
            +    +++S  S+S+T S
Sbjct: 56   SLSLSSTASSLSDSSTYS 73



 Score = 32.0 bits (73), Expect = 3.1
 Identities = 27/97 (27%), Positives = 41/97 (42%), Gaps = 16/97 (16%)

Query: 2062 SESITSSSPASESTTTSSPASESTTTSSPASESTT------TSSPASESTTTSSPESEST 2115
            S S ++ S  S      SP+S S  T   + EST            SES++ S P    +
Sbjct: 2    SCSDSTRSLLSPLGNELSPSS-SDETEDCSEESTDSWSDMYEGLKDSESSSNSVPSLSLS 60

Query: 2116 TTSSPASESTT---------IEEQGVSPHSEKLSANE 2143
            +T+S  S+S+T         +E Q    +   LSA +
Sbjct: 61   STASSLSDSSTYSRSLKEVKLERQAQEAYENWLSAKQ 97



 Score = 32.0 bits (73), Expect = 3.3
 Identities = 26/83 (31%), Positives = 33/83 (39%), Gaps = 4/83 (4%)

Query: 1901 ESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSL 1960
             S S +T +  S      SP S S  T     EST + S   E    S   S S  + SL
Sbjct: 1    LSCSDSTRSLLSPLGNELSP-SSSDETEDCSEESTDSWSDMYEGLKDSESSSNSVPSLSL 59

Query: 1961 VSESTTTSSPESESTTTSSPESE 1983
               S+T SS    ST + S +  
Sbjct: 60   ---SSTASSLSDSSTYSRSLKEV 79


>gnl|CDD|177546 PHA03151, PHA03151, hypothetical protein; Provisional.
          Length = 259

 Score = 44.0 bits (103), Expect = 6e-04
 Identities = 32/198 (16%), Positives = 63/198 (31%), Gaps = 16/198 (8%)

Query: 1901 ESESTTTNNPESESTTTSSPESESTTTSSLVSESTT---------TSSPESESTTTSSPE 1951
            E +ST + N ++ES++       +++ S  V  ST          T      + + S+ +
Sbjct: 42   EDDSTPSENTKAESSSIDEDGLLTSSGSDSVFNSTDYESTPEPSKTPGFSDSNVSDSNND 101

Query: 1952 SESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPV 2011
             +          S+  SS     T+  S + E+   SS   +    S  +     TI   
Sbjct: 102  KDFDFKPQDEDTSSDDSSAPDFITSLVSSDCEARGLSSSEEDGEPYSKQKMSQPLTIDAK 161

Query: 2012 SESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPA 2071
            +E  T+         +   E +                 K + + +        +   P 
Sbjct: 162  TEEITSEEDCCVQEDSSDSEEDVVEAFIRQRAQMAGKKKKGKRSIST-------SDDEPP 214

Query: 2072 SESTTTSSPASESTTTSS 2089
             +S         S++T S
Sbjct: 215  RKSRRKRHSHRISSSTDS 232



 Score = 41.3 bits (96), Expect = 0.003
 Identities = 33/199 (16%), Positives = 69/199 (34%), Gaps = 6/199 (3%)

Query: 1920 PESESTTTSSLVSESTTTSSPESESTTTSSPES--ESTTTSSLVSESTTTSSPESESTTT 1977
            P  E  +T S  +++ ++S  E    T+S  +S   ST   S   E + T      + + 
Sbjct: 39   PTDEDDSTPSENTKAESSSIDEDGLLTSSGSDSVFNSTDYEST-PEPSKTPGFSDSNVSD 97

Query: 1978 SSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTT 2037
            S+ + +          S+  SS     T+ +S   E+   SS   +       +     T
Sbjct: 98   SNNDKDFDFKPQDEDTSSDDSSAPDFITSLVSSDCEARGLSSSEEDGEPYSKQKMSQPLT 157

Query: 2038 SSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSS---PASES 2094
                +E  T+         +++   + + +               + + ++S   P  +S
Sbjct: 158  IDAKTEEITSEEDCCVQEDSSDSEEDVVEAFIRQRAQMAGKKKKGKRSISTSDDEPPRKS 217

Query: 2095 TTTSSPASESTTTSSPESE 2113
                     S++T S + E
Sbjct: 218  RRKRHSHRISSSTDSDDEE 236



 Score = 40.9 bits (95), Expect = 0.005
 Identities = 36/217 (16%), Positives = 73/217 (33%), Gaps = 13/217 (5%)

Query: 1919 SPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTS 1978
            +P  + T      ++   ++  E+    +SS + +   TSS       ++  ES    + 
Sbjct: 27   APREKLTNVFKFPTDEDDSTPSENTKAESSSIDEDGLLTSSGSDSVFNSTDYESTPEPSK 86

Query: 1979 SPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTS 2038
            +P    +  S   ++      P+ E T        S+  SS     T+ +S + E+   S
Sbjct: 87   TPGFSDSNVSDSNNDKDFDFKPQDEDT--------SSDDSSAPDFITSLVSSDCEARGLS 138

Query: 2039 SPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTS 2098
            S   +    +  K     T +  +E ITS         +S    +               
Sbjct: 139  SSEEDGEPYSKQKMSQPLTIDAKTEEITSEEDCCVQEDSSDSEEDVVEAFIRQRAQMAGK 198

Query: 2099 SPASESTTTSSPE-----SESTTTSSPASESTTIEEQ 2130
                + + ++S +     S     S   S ST  +++
Sbjct: 199  KKKGKRSISTSDDEPPRKSRRKRHSHRISSSTDSDDE 235



 Score = 39.4 bits (91), Expect = 0.016
 Identities = 30/211 (14%), Positives = 66/211 (31%), Gaps = 13/211 (6%)

Query: 1875 NNNSESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSES 1934
            ++++ S    +  +S+  +   T+S       + + ES    + +P    +  S   ++ 
Sbjct: 43   DDSTPSENTKAESSSIDEDGLLTSSGSDSVFNSTDYESTPEPSKTPGFSDSNVSDSNNDK 102

Query: 1935 TTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSES 1994
                 P+ E T++    +    TS          S + E+   SS E +    S      
Sbjct: 103  DFDFKPQDEDTSSDDSSAPDFITS--------LVSSDCEARGLSSSEEDGEPYSKQKMSQ 154

Query: 1995 TTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNP---- 2050
              T   ++E  T+         +S    +       +           + + + +     
Sbjct: 155  PLTIDAKTEEITSEEDCCVQEDSSDSEEDVVEAFIRQRAQMAGKKKKGKRSISTSDDEPP 214

Query: 2051 -KSESTTTNNPASESITSSSPASESTTTSSP 2080
             KS     ++  S S  S         T +P
Sbjct: 215  RKSRRKRHSHRISSSTDSDDEEPRHKMTGTP 245



 Score = 35.9 bits (82), Expect = 0.22
 Identities = 32/149 (21%), Positives = 57/149 (38%), Gaps = 4/149 (2%)

Query: 2010 PVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSS 2069
            P  E  +T S  +++ ++ S + +   TSS +     + + +S    +  P       S 
Sbjct: 39   PTDEDDSTPSENTKAESS-SIDEDGLLTSSGSDSVFNSTDYESTPEPSKTPGFSDSNVSD 97

Query: 2070 PASESTTTSSPASE--STTTSSPASESTTTSSPASESTTTSSPESESTTTS-SPASESTT 2126
              ++      P  E  S+  SS     T+  S   E+   SS E +    S    S+  T
Sbjct: 98   SNNDKDFDFKPQDEDTSSDDSSAPDFITSLVSSDCEARGLSSSEEDGEPYSKQKMSQPLT 157

Query: 2127 IEEQGVSPHSEKLSANEDPEEFPNEDVFE 2155
            I+ +     SE+    ++      EDV E
Sbjct: 158  IDAKTEEITSEEDCCVQEDSSDSEEDVVE 186


>gnl|CDD|165021 PHA02638, PHA02638, CC chemokine receptor-like protein; Provisional.
          Length = 417

 Score = 44.6 bits (105), Expect = 6e-04
 Identities = 24/76 (31%), Positives = 40/76 (52%), Gaps = 5/76 (6%)

Query: 1921 ESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSP 1980
            ++ STT SS++  S+T S      TT  + E+    + S++S  T     E  ++ + SP
Sbjct: 2    DNSSTTLSSIILSSSTLSP-----TTFFTIETSMDESKSIISTFTEIIPTEIPTSESPSP 56

Query: 1981 ESESTTTSSLVSESTT 1996
             S S+++SS  S S T
Sbjct: 57   NSNSSSSSSSSSSSIT 72



 Score = 44.2 bits (104), Expect = 6e-04
 Identities = 26/86 (30%), Positives = 44/86 (51%), Gaps = 5/86 (5%)

Query: 1981 ESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSP 2040
            ++ STT SS++  S+T S   +   T  + + ES +  S  +E   T  P SE   + SP
Sbjct: 2    DNSSTTLSSIILSSSTLSP--TTFFTIETSMDESKSIISTFTEIIPTEIPTSE---SPSP 56

Query: 2041 ASESTTTNNPKSESTTTNNPASESIT 2066
             S S+++++  S S T +     +IT
Sbjct: 57   NSNSSSSSSSSSSSITYDYEYENNIT 82



 Score = 44.2 bits (104), Expect = 6e-04
 Identities = 28/79 (35%), Positives = 39/79 (49%), Gaps = 11/79 (13%)

Query: 1971 ESESTTTSSPESESTTTSSLVSESTTTSSPES---ESTTTISPVSESTTTSSPVSESTTT 2027
            ++ STT SS        SS     TT  + E+   ES + IS  +E   T  P SES + 
Sbjct: 2    DNSSTTLSS-----IILSSSTLSPTTFFTIETSMDESKSIISTFTEIIPTEIPTSESPS- 55

Query: 2028 ISPESESTTTSSPASESTT 2046
              P S S+++SS +S S T
Sbjct: 56   --PNSNSSSSSSSSSSSIT 72



 Score = 43.1 bits (101), Expect = 0.002
 Identities = 24/79 (30%), Positives = 35/79 (44%), Gaps = 12/79 (15%)

Query: 1911 ESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSP 1970
            ++ STT SS        SS     TT  + E+          ES +  S  +E   T  P
Sbjct: 2    DNSSTTLSS-----IILSSSTLSPTTFFTIETSM-------DESKSIISTFTEIIPTEIP 49

Query: 1971 ESESTTTSSPESESTTTSS 1989
             SES + +S  S S+++SS
Sbjct: 50   TSESPSPNSNSSSSSSSSS 68



 Score = 42.3 bits (99), Expect = 0.003
 Identities = 25/68 (36%), Positives = 38/68 (55%), Gaps = 1/68 (1%)

Query: 2004 STTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASE 2063
            S+TT+S +  S++T SP +  T   S   ES +  S  +E   T  P SES + N+ +S 
Sbjct: 4    SSTTLSSIILSSSTLSPTTFFTIETS-MDESKSIISTFTEIIPTEIPTSESPSPNSNSSS 62

Query: 2064 SITSSSPA 2071
            S +SSS +
Sbjct: 63   SSSSSSSS 70



 Score = 41.5 bits (97), Expect = 0.004
 Identities = 23/68 (33%), Positives = 40/68 (58%), Gaps = 1/68 (1%)

Query: 2014 STTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASE 2073
            S+TT S +  S++T+SP +  T  +S   ES +  +  +E   T  P SES + +S +S 
Sbjct: 4    SSTTLSSIILSSSTLSPTTFFTIETS-MDESKSIISTFTEIIPTEIPTSESPSPNSNSSS 62

Query: 2074 STTTSSPA 2081
            S+++SS +
Sbjct: 63   SSSSSSSS 70



 Score = 40.8 bits (95), Expect = 0.009
 Identities = 27/86 (31%), Positives = 37/86 (43%), Gaps = 15/86 (17%)

Query: 1941 ESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSP 2000
            ++ STT SS        SS     TT  + E+          ES +  S  +E   T  P
Sbjct: 2    DNSSTTLSS-----IILSSSTLSPTTFFTIETSM-------DESKSIISTFTEIIPTEIP 49

Query: 2001 ESESTTTISPVSESTTTSSPVSESTT 2026
             SES +   P S S+++SS  S S T
Sbjct: 50   TSESPS---PNSNSSSSSSSSSSSIT 72



 Score = 40.4 bits (94), Expect = 0.010
 Identities = 22/68 (32%), Positives = 39/68 (57%), Gaps = 1/68 (1%)

Query: 2024 STTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASE 2083
            S+TT+S    S++T SP +  T   +   ES +  +  +E I +  P SES + +S +S 
Sbjct: 4    SSTTLSSIILSSSTLSPTTFFTIETS-MDESKSIISTFTEIIPTEIPTSESPSPNSNSSS 62

Query: 2084 STTTSSPA 2091
            S+++SS +
Sbjct: 63   SSSSSSSS 70



 Score = 40.4 bits (94), Expect = 0.012
 Identities = 27/86 (31%), Positives = 40/86 (46%), Gaps = 15/86 (17%)

Query: 1951 ESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISP 2010
            ++ STT SS++  S+T S      TT  + E+          ES +  S  +E   T  P
Sbjct: 2    DNSSTTLSSIILSSSTLSP-----TTFFTIETSM-------DESKSIISTFTEIIPTEIP 49

Query: 2011 VSESTTTSSPVSESTTTISPESESTT 2036
             SE   + SP S S+++ S  S S T
Sbjct: 50   TSE---SPSPNSNSSSSSSSSSSSIT 72



 Score = 40.0 bits (93), Expect = 0.015
 Identities = 24/66 (36%), Positives = 37/66 (56%), Gaps = 1/66 (1%)

Query: 1934 STTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSE 1993
            S+TT S    S++T SP +  T  +S+  ES +  S  +E   T  P SES + +S  S 
Sbjct: 4    SSTTLSSIILSSSTLSPTTFFTIETSM-DESKSIISTFTEIIPTEIPTSESPSPNSNSSS 62

Query: 1994 STTTSS 1999
            S+++SS
Sbjct: 63   SSSSSS 68



 Score = 39.2 bits (91), Expect = 0.022
 Identities = 28/77 (36%), Positives = 38/77 (49%), Gaps = 9/77 (11%)

Query: 2052 SESTTTNNPASESITSSSPASEST--TTSSPASESTTTSSPASESTTTSSPASESTTTSS 2109
            + STT     S  I SSS  S +T  T  +   ES +  S  +E   T  P SE   + S
Sbjct: 3    NSSTTL----SSIILSSSTLSPTTFFTIETSMDESKSIISTFTEIIPTEIPTSE---SPS 55

Query: 2110 PESESTTTSSPASESTT 2126
            P S S+++SS +S S T
Sbjct: 56   PNSNSSSSSSSSSSSIT 72



 Score = 38.8 bits (90), Expect = 0.032
 Identities = 19/70 (27%), Positives = 31/70 (44%), Gaps = 3/70 (4%)

Query: 2067 SSSPASESTTTSSPASESTTTSSPAS---ESTTTSSPASESTTTSSPESESTTTSSPASE 2123
            +SS    S   SS     TT  +  +   ES +  S  +E   T  P SES + +S +S 
Sbjct: 3    NSSTTLSSIILSSSTLSPTTFFTIETSMDESKSIISTFTEIIPTEIPTSESPSPNSNSSS 62

Query: 2124 STTIEEQGVS 2133
            S++     ++
Sbjct: 63   SSSSSSSSIT 72



 Score = 38.8 bits (90), Expect = 0.033
 Identities = 27/95 (28%), Positives = 45/95 (47%), Gaps = 12/95 (12%)

Query: 1901 ESESTTTNNPESESTTTSSPESESTTTSSLVS---ESTTTSSPESESTTTSSPESESTTT 1957
            ++ STT +     S   SS     TT  ++ +   ES +  S  +E   T  P SES + 
Sbjct: 2    DNSSTTLS-----SIILSSSTLSPTTFFTIETSMDESKSIISTFTEIIPTEIPTSESPSP 56

Query: 1958 SSLVSESTTTSSPESESTTTSSPESESTTTSSLVS 1992
            +S    ++++SS  S S+ T   E E+  T  L++
Sbjct: 57   NS----NSSSSSSSSSSSITYDYEYENNITYELIN 87



 Score = 38.5 bits (89), Expect = 0.047
 Identities = 26/91 (28%), Positives = 45/91 (49%), Gaps = 7/91 (7%)

Query: 1875 NNNSESTVVMSTLNSLLSENTTTNSPES---ESTTTNNPESESTTTSSPESESTTTSSLV 1931
             +NS +T+    L+S     TT  + E+   ES +  +  +E   T  P SES + +S  
Sbjct: 1    MDNSSTTLSSIILSSSTLSPTTFFTIETSMDESKSIISTFTEIIPTEIPTSESPSPNS-- 58

Query: 1932 SESTTTSSPESESTTTSSPESESTTTSSLVS 1962
              ++++SS  S S+ T   E E+  T  L++
Sbjct: 59   --NSSSSSSSSSSSITYDYEYENNITYELIN 87



 Score = 38.1 bits (88), Expect = 0.052
 Identities = 18/70 (25%), Positives = 32/70 (45%)

Query: 1897 TNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTT 1956
             +S    S   ++     TT  + E+    + S++S  T     E  ++ + SP S S++
Sbjct: 3    NSSTTLSSIILSSSTLSPTTFFTIETSMDESKSIISTFTEIIPTEIPTSESPSPNSNSSS 62

Query: 1957 TSSLVSESTT 1966
            +SS  S S T
Sbjct: 63   SSSSSSSSIT 72



 Score = 37.7 bits (87), Expect = 0.083
 Identities = 26/76 (34%), Positives = 40/76 (52%), Gaps = 5/76 (6%)

Query: 2031 ESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSP 2090
            ++ STT SS    S+T +     +  T+   S+SI S+   +E   T  P SE   + SP
Sbjct: 2    DNSSTTLSSIILSSSTLSPTTFFTIETSMDESKSIIST--FTEIIPTEIPTSE---SPSP 56

Query: 2091 ASESTTTSSPASESTT 2106
             S S+++SS +S S T
Sbjct: 57   NSNSSSSSSSSSSSIT 72



 Score = 37.3 bits (86), Expect = 0.11
 Identities = 21/66 (31%), Positives = 36/66 (54%), Gaps = 1/66 (1%)

Query: 2044 STTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASE 2103
            S+TT +    S++T +P +   T  +   ES +  S  +E   T  P SES + +S +S 
Sbjct: 4    SSTTLSSIILSSSTLSPTTFF-TIETSMDESKSIISTFTEIIPTEIPTSESPSPNSNSSS 62

Query: 2104 STTTSS 2109
            S+++SS
Sbjct: 63   SSSSSS 68



 Score = 36.1 bits (83), Expect = 0.20
 Identities = 25/82 (30%), Positives = 36/82 (43%), Gaps = 1/82 (1%)

Query: 2074 STTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTTIEEQGVS 2133
            S+TT S    S++T SP +  T  +S   ES +  S  +E   T  P SES +      S
Sbjct: 4    SSTTLSSIILSSSTLSPTTFFTIETS-MDESKSIISTFTEIIPTEIPTSESPSPNSNSSS 62

Query: 2134 PHSEKLSANEDPEEFPNEDVFE 2155
              S   S+     E+ N   +E
Sbjct: 63   SSSSSSSSITYDYEYENNITYE 84



 Score = 32.7 bits (74), Expect = 2.8
 Identities = 18/64 (28%), Positives = 32/64 (50%), Gaps = 3/64 (4%)

Query: 1873 TTNNNSESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVS 1932
              ++++ S     T+ + + E+ +  S  +E   T  P SE   + SP S S+++SS  S
Sbjct: 12   ILSSSTLSPTTFFTIETSMDESKSIISTFTEIIPTEIPTSE---SPSPNSNSSSSSSSSS 68

Query: 1933 ESTT 1936
             S T
Sbjct: 69   SSIT 72


>gnl|CDD|236333 PRK08691, PRK08691, DNA polymerase III subunits gamma and tau;
            Validated.
          Length = 709

 Score = 44.7 bits (105), Expect = 6e-04
 Identities = 45/200 (22%), Positives = 68/200 (34%), Gaps = 27/200 (13%)

Query: 1993 ESTTTSSPESESTTTI-SPVSESTTTSSPVS-ESTTTISPESESTTTSSPASESTTTNNP 2050
            +S +  + E E+      P  E+ T  +PV   S   +  E +   T+ P S     + P
Sbjct: 378  QSPSAQTAEKETAAKKPQPRPEAETAQTPVQTASAAAMPSEGK---TAGPVSNQENNDVP 434

Query: 2051 -----KSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASEST 2105
                   E+ T    A  S  S   ASE+  T      S   ++        S   SE+ 
Sbjct: 435  PWEDAPDEAQTAAGTAQTSAKSIQTASEA-ETPPENQVSKNKAADNETDAPLSEVPSENP 493

Query: 2106 TTSSPESESTTTSSPASESTTIEEQGVSPHSEKLSANEDPEEFPNEDVFEHTFAEIPNID 2165
              ++P  E+  T + A E+      G                FP+ D      AEIP  D
Sbjct: 494  IQATPNDEAVETETFAHEAPAEPFYGYG--------------FPDNDCPPEDGAEIPPPD 539

Query: 2166 --HSNQTDEAIPETFDAREE 2183
              H+   D A     +  E 
Sbjct: 540  WEHAAPADTAGGGADEEAEA 559


>gnl|CDD|240323 PTZ00233, PTZ00233, variable surface protein Vir18; Provisional.
          Length = 509

 Score = 44.2 bits (104), Expect = 8e-04
 Identities = 58/329 (17%), Positives = 111/329 (33%), Gaps = 54/329 (16%)

Query: 1894 NTTTNSPESESTTTNN-PESESTTTSSPES-ESTTTSSLVSESTTTSS---PESESTTTS 1948
                  P  + TT       +   T +P+  ++ + S L    +   S    + +  + +
Sbjct: 104  FPAKKPPLIKPTTQEPCKGGKGCKTETPQRVDTKSQSKLRPVPSKAKSLEIKDPQEQSQN 163

Query: 1949 SPESESTTTSSLV--SESTTTSSPESESTTTSSPES---ESTTTSSLVSESTT--TSSPE 2001
              +++ +   S+V   +S +  SP S  T    P+S     +TTS +    T    +S +
Sbjct: 164  QADAQESNKESVVLQPQSDSMPSPSSIGTEDKEPQSIVNHHSTTSGMGETQTQQLNASGD 223

Query: 2002 SESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPAS-----ESTTTNNPKSESTT 2056
            S      S   +  +      ES  T +   E+  TS   +     ++   N+P+++ + 
Sbjct: 224  SPIRELDSSAGDPPSECVSGKESDLTCTSTGENLDTSLFQTNLSSGKTLDANHPETQDSA 283

Query: 2057 TNNPASE---------------SITSSSPASE--------STTTSSPAS-------ESTT 2086
             N    +               S    +P S          T T   ++       E+T 
Sbjct: 284  GNVIEVQTHGDKDIITEAADNLSSLEGTPGSVQLADEDSVDTDTDRGSTGAVASDPENTG 343

Query: 2087 TSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTTIEEQGVSPH-SEKL------ 2139
            T   +SES  ++     S    +    S    +   ES  +E      H SE +      
Sbjct: 344  TEETSSESLVSAPSGDVSNGGITEVDISNDDKAVDGESNGVEISHDQEHDSETICNESTC 403

Query: 2140 SANEDPEEFPNEDVFEHTFAEIPNIDHSN 2168
               ++ E   +        A+I N+  SN
Sbjct: 404  REEQNGELTDDGGDKLDILAQIFNVIQSN 432



 Score = 34.9 bits (80), Expect = 0.52
 Identities = 39/211 (18%), Positives = 75/211 (35%), Gaps = 15/211 (7%)

Query: 1865 DNYSEIIFTTNNNSESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESES 1924
            D  SE +    ++   T     L++ L +   TN    ++   N+PE++ +  +  E ++
Sbjct: 235  DPPSECVSGKESDLTCTSTGENLDTSLFQ---TNLSSGKTLDANHPETQDSAGNVIEVQT 291

Query: 1925 TTTSSLVSE-STTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESE 1983
                 +++E +   SS E    +    + +S          T T      ST   + + E
Sbjct: 292  HGDKDIITEAADNLSSLEGTPGSVQLADEDSV--------DTDTDRG---STGAVASDPE 340

Query: 1984 STTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASE 2043
            +T T    SES  ++     S   I+ V  S    +   ES        +   + +  +E
Sbjct: 341  NTGTEETSSESLVSAPSGDVSNGGITEVDISNDDKAVDGESNGVEISHDQEHDSETICNE 400

Query: 2044 STTTNNPKSESTTTNNPASESITSSSPASES 2074
            ST       E T       + +       +S
Sbjct: 401  STCREEQNGELTDDGGDKLDILAQIFNVIQS 431


>gnl|CDD|240381 PTZ00364, PTZ00364, dipeptidyl-peptidase I precursor; Provisional.
          Length = 548

 Score = 44.1 bits (104), Expect = 8e-04
 Identities = 21/51 (41%), Positives = 27/51 (52%), Gaps = 2/51 (3%)

Query: 2357 HSVKIIGWGKSSQNEPYWLCTNSY--NQGWGEQGLFKIRRGVNMCSIEDSV 2405
            H+V IIGWG       YWL  + +   + W + G  KI RGVN  +IE  V
Sbjct: 404  HTVLIIGWGTDENGGDYWLVLDPWGSRRSWCDGGTRKIARGVNAYNIESEV 454


>gnl|CDD|224343 COG1426, COG1426, Predicted transcriptional regulator contains
            Xre-like HTH domain [Function unknown].
          Length = 284

 Score = 43.2 bits (102), Expect = 9e-04
 Identities = 31/168 (18%), Positives = 58/168 (34%), Gaps = 21/168 (12%)

Query: 1980 PESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSS 2039
            P +    + + VS+++   S  + STT     SE TT++SP S +T        S T + 
Sbjct: 134  PPTLPDQSVASVSQNSQDVSLATSSTTP----SEGTTSASPSSATT--------SFTPTV 181

Query: 2040 PASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSS 2099
             A         K  +      A  + +S++PA++   T   A     T+     +    S
Sbjct: 182  TAIAPVVAPTAKPVTVPKQPAADLAASSTAPAAKEMATGQEAVP---TAGSGVTTVAGKS 238

Query: 2100 PASESTTTSSPESESTTTSSPASESTTIEEQGVSPHSEKLSANEDPEE 2147
             A     T+               +  +   G++   + L+       
Sbjct: 239  AALVINFTAD------CWIEVTDANGKVLFSGLTKKGDSLTLTGKAPY 280



 Score = 42.4 bits (100), Expect = 0.002
 Identities = 27/132 (20%), Positives = 49/132 (37%), Gaps = 17/132 (12%)

Query: 1970 PESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTIS 2029
            P +    + +  S+++   SL + STT S    E TT+ SP S +T+ +  V+     ++
Sbjct: 134  PPTLPDQSVASVSQNSQDVSLATSSTTPS----EGTTSASPSSATTSFTPTVTAIAPVVA 189

Query: 2030 P-ESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTS 2088
            P     T    PA++   +      ST        +   + P + S         +    
Sbjct: 190  PTAKPVTVPKQPAADLAAS------STAPAAKEMATGQEAVPTAGS------GVTTVAGK 237

Query: 2089 SPASESTTTSSP 2100
            S A     T+  
Sbjct: 238  SAALVINFTADC 249



 Score = 35.9 bits (83), Expect = 0.19
 Identities = 19/75 (25%), Positives = 38/75 (50%), Gaps = 4/75 (5%)

Query: 2059 NPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTS 2118
             P +    S +  S+++   S A+ STT S    E TT++SP+S +T+ +   +      
Sbjct: 133  EPPTLPDQSVASVSQNSQDVSLATSSTTPS----EGTTSASPSSATTSFTPTVTAIAPVV 188

Query: 2119 SPASESTTIEEQGVS 2133
            +P ++  T+ +Q  +
Sbjct: 189  APTAKPVTVPKQPAA 203


>gnl|CDD|219927 pfam08601, PAP1, Transcription factor PAP1.  The transcription factor
            Pap1 regulates antioxidant-gene transcription in response
            to H2O2. This region is cysteine rich. Alkylation of
            cysteine residues following treatment with a cysteine
            alkylating agent can mask the accessibility of the
            nuclear exporter Crm1, triggering nuclear accumulation
            and Pap1 dependent transcriptional expression.
          Length = 344

 Score = 43.8 bits (103), Expect = 0.001
 Identities = 28/164 (17%), Positives = 56/164 (34%), Gaps = 8/164 (4%)

Query: 1999 SPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASEST-TTNNPKSESTTT 2057
                 +  T +        +S VS     +   +ES  +S   + +  T+ +  + +  +
Sbjct: 31   GKLPGACGTKNCPIPKLAKNSSVSSPVPGLLNSTESNVSSPNNNPNGYTSPSSAAMNNKS 90

Query: 2058 NNPASESITSSSPASESTTTSSP------ASESTTTSSPASESTTTSSPASESTTTSSPE 2111
            NN A +   ++S AS ++            S   ++S   S  +   +    S+  +SPE
Sbjct: 91   NNRAVDPSANASAASTNSPNGLQSSATQYNSNDNSSSDSPSSGSDGFTNQLLSSLGTSPE 150

Query: 2112 SESTTTSSPASEST-TIEEQGVSPHSEKLSANEDPEEFPNEDVF 2154
              + +    AS +           +S   SA       P  D  
Sbjct: 151  PSTESPPQLASVNNFAAIRNNAESNSNVPSAASSTPNIPGIDFL 194



 Score = 38.4 bits (89), Expect = 0.039
 Identities = 31/159 (19%), Positives = 60/159 (37%), Gaps = 4/159 (2%)

Query: 1906 TTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSEST 1965
            T N P  +    SS  S      +  S  +  SSP +     +SP S +    S      
Sbjct: 39   TKNCPIPKLAKNSSVSSPVPGLLN--STESNVSSPNNNPNGYTSPSSAAMNNKSNNR--A 94

Query: 1966 TTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSEST 2025
               S  + + +T+SP    ++ +   S   ++S   S  +   +    S+  +SP   + 
Sbjct: 95   VDPSANASAASTNSPNGLQSSATQYNSNDNSSSDSPSSGSDGFTNQLLSSLGTSPEPSTE 154

Query: 2026 TTISPESESTTTSSPASESTTTNNPKSESTTTNNPASES 2064
            +     S +   +   +  + +N P + S+T N P  + 
Sbjct: 155  SPPQLASVNNFAAIRNNAESNSNVPSAASSTPNIPGIDF 193



 Score = 38.4 bits (89), Expect = 0.042
 Identities = 37/191 (19%), Positives = 74/191 (38%), Gaps = 10/191 (5%)

Query: 1894 NTTTNSPESESTTTNNPESE---STTTSSPESESTTTSSLVSESTTTSSPESESTTTSSP 1950
            N   N   S S   NN             P +  T    +   +  +S         +S 
Sbjct: 5    NQLHNDCSSMSNFNNNNFDFDFPKFCGKLPGACGTKNCPIPKLAKNSSVSSPVPGLLNST 64

Query: 1951 ESESTTTSSLVSESTTTSS------PESESTTTSSPESESTTTSSLVSESTTTSSPESES 2004
            ES  ++ ++  +  T+ SS        + +   S+  S ++T S    +S+ T    +++
Sbjct: 65   ESNVSSPNNNPNGYTSPSSAAMNNKSNNRAVDPSANASAASTNSPNGLQSSATQYNSNDN 124

Query: 2005 TTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASES 2064
            +++ SP S S   ++ +  S  T SPE  + +    AS +       +  + +N P++ S
Sbjct: 125  SSSDSPSSGSDGFTNQLLSSLGT-SPEPSTESPPQLASVNNFAAIRNNAESNSNVPSAAS 183

Query: 2065 ITSSSPASEST 2075
             T + P  +  
Sbjct: 184  STPNIPGIDFL 194



 Score = 37.6 bits (87), Expect = 0.076
 Identities = 23/166 (13%), Positives = 64/166 (38%), Gaps = 5/166 (3%)

Query: 1950 PESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTIS 2009
            P +  T    +   +  +S         +S ES  ++ ++  +  T+  S  + +  + +
Sbjct: 34   PGACGTKNCPIPKLAKNSSVSSPVPGLLNSTESNVSSPNNNPNGYTS-PSSAAMNNKSNN 92

Query: 2010 PVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSS 2069
               + +  +S  S +    SP    ++ +   S   ++++  S  +        S   +S
Sbjct: 93   RAVDPSANASAASTN----SPNGLQSSATQYNSNDNSSSDSPSSGSDGFTNQLLSSLGTS 148

Query: 2070 PASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESEST 2115
            P   + +    AS +   +   +  + ++ P++ S+T + P  +  
Sbjct: 149  PEPSTESPPQLASVNNFAAIRNNAESNSNVPSAASSTPNIPGIDFL 194



 Score = 37.2 bits (86), Expect = 0.088
 Identities = 30/151 (19%), Positives = 62/151 (41%), Gaps = 11/151 (7%)

Query: 2020 PVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSS 2079
            P +  T        +  +S  +      N+ +S  ++ NN  +   TS S A+ +  +++
Sbjct: 34   PGACGTKNCPIPKLAKNSSVSSPVPGLLNSTESNVSSPNNNPN-GYTSPSSAAMNNKSNN 92

Query: 2080 PASESTTTSSPASESTTTSS--PASESTTTSSPESESTTTSSPASESTTIEEQGVSPHSE 2137
             A + +  +S AS ++       A++  +  +  S+S ++ S    +  +   G SP   
Sbjct: 93   RAVDPSANASAASTNSPNGLQSSATQYNSNDNSSSDSPSSGSDGFTNQLLSSLGTSP--- 149

Query: 2138 KLSANEDPEEFPNEDVFEHTFAEIPNIDHSN 2168
               + E P +  + +     FA I N   SN
Sbjct: 150  -EPSTESPPQLASVN----NFAAIRNNAESN 175



 Score = 34.9 bits (80), Expect = 0.55
 Identities = 19/114 (16%), Positives = 49/114 (42%)

Query: 1873 TTNNNSESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVS 1932
              N N  ++   + +N+  +      S  + + +TN+P    ++ +   S   ++S   S
Sbjct: 72   NNNPNGYTSPSSAAMNNKSNNRAVDPSANASAASTNSPNGLQSSATQYNSNDNSSSDSPS 131

Query: 1933 ESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTT 1986
              +   + +  S+  +SPE  + +   L S +   +   +  + ++ P + S+T
Sbjct: 132  SGSDGFTNQLLSSLGTSPEPSTESPPQLASVNNFAAIRNNAESNSNVPSAASST 185



 Score = 34.1 bits (78), Expect = 0.80
 Identities = 24/140 (17%), Positives = 54/140 (38%), Gaps = 7/140 (5%)

Query: 1876 NNSESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSEST 1935
            N++ES V     N       T+ S  + +  +NN   + +  +S  S ++      S + 
Sbjct: 62   NSTESNVSSPNNNPNGY---TSPSSAAMNNKSNNRAVDPSANASAASTNSPNGLQSSATQ 118

Query: 1936 TTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSEST 1995
              S+      ++S   S  +   +    S+  +SPE  + +     S +   +   +  +
Sbjct: 119  YNSN----DNSSSDSPSSGSDGFTNQLLSSLGTSPEPSTESPPQLASVNNFAAIRNNAES 174

Query: 1996 TTSSPESESTTTISPVSEST 2015
             ++ P + S+T   P  +  
Sbjct: 175  NSNVPSAASSTPNIPGIDFL 194


>gnl|CDD|236766 PRK10811, rne, ribonuclease E; Reviewed.
          Length = 1068

 Score = 44.3 bits (105), Expect = 0.001
 Identities = 20/185 (10%), Positives = 46/185 (24%), Gaps = 7/185 (3%)

Query: 1940 PESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSS 1999
               +       E+E      +V+E    ++ E   +              +V        
Sbjct: 849  RPQDVQVEEQREAEEVQVQPVVAEVPVAAAVEPVVSAPVVEAVAEVVEEPVVVAEPQPEE 908

Query: 2000 PESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPA-SESTTTNNPKSESTTTN 2058
                 TT    ++       PV+E    I+    +        +E       ++      
Sbjct: 909  VVVVETTHPEVIAA------PVTEQPQVITESDVAVAQEVAEHAEPVVEPQDETADIEEA 962

Query: 2059 NPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTS 2118
               +E + +             A  +    +  +     +       T     + +  T 
Sbjct: 963  AETAEVVVAEPEVVAQPAAPVVAEVAAEVETVTAVEPEVAPAQVPEATVEHNHATAPMTR 1022

Query: 2119 SPASE 2123
            +PA E
Sbjct: 1023 APAPE 1027


>gnl|CDD|144411 pfam00802, Glycoprotein_G, Pneumovirus attachment glycoprotein G.
            This family includes attachment proteins from respiratory
            synctial virus. Glycoprotein G has not been shown to have
            any neuraminidase or hemagglutinin activity. The amino
            terminus is thought to be cytoplasmic, and the carboxyl
            terminus extracellular. The extracellular region contains
            four completely conserved cysteine residues.
          Length = 263

 Score = 43.1 bits (101), Expect = 0.001
 Identities = 40/210 (19%), Positives = 80/210 (38%), Gaps = 9/210 (4%)

Query: 1910 PESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSS 1969
            P +  T   + + ++ T++ L   +  ++SP ++STTT    +    T+     +   ++
Sbjct: 62   PTTTPTQQITNQIQNHTSTYLTQHNQLSTSPSNQSTTTPLIHTILDDTTPGTKSTYQHTT 121

Query: 1970 PESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISP-VSESTTTSSPVSESTTTI 2028
              ++  TT+  ++    T     +S     P+ +    +   V  S   ++P   S    
Sbjct: 122  VGTKGRTTTPAQTNKPPTKP--RQSNPPEKPQDDFHFEVFNFVPCSICENNPACLSICKR 179

Query: 2029 SPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTS 2088
             PE        P    TT    K +  TT        T S  A+    TS P   +T T+
Sbjct: 180  IPE------KKPGKAPTTKPTKKPKPKTTKKDTKTQTTKSKEATTHHPTSEPTKLTTKTN 233

Query: 2089 SPASESTTTSSPASESTTTSSPESESTTTS 2118
            +   + T  S+  + +   +S      +T+
Sbjct: 234  TTTPQFTPLSTTTTRNPELTSQMETFHSTN 263



 Score = 42.4 bits (99), Expect = 0.002
 Identities = 39/206 (18%), Positives = 75/206 (36%), Gaps = 24/206 (11%)

Query: 1961 VSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSP 2020
            +S +    +P +  T   + + ++ T++ L   +  ++SP ++STTT    +    T+  
Sbjct: 53   ISSANHKVTPTTTPTQQITNQIQNHTSTYLTQHNQLSTSPSNQSTTTPLIHTILDDTTPG 112

Query: 2021 VSESTTTISPESESTTTSSPASESTTTNNPKSE-------------------STTTNNPA 2061
               +    +  ++  TT+   +    T   +S                    S   NNPA
Sbjct: 113  TKSTYQHTTVGTKGRTTTPAQTNKPPTKPRQSNPPEKPQDDFHFEVFNFVPCSICENNPA 172

Query: 2062 SESI----TSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTT 2117
              SI        P    TT  +   +  TT       TT S  A+ +   +S  ++ TT 
Sbjct: 173  CLSICKRIPEKKPGKAPTTKPTKKPKPKTTKKDTKTQTTKSKEAT-THHPTSEPTKLTTK 231

Query: 2118 SSPASESTTIEEQGVSPHSEKLSANE 2143
            ++  +   T      + + E  S  E
Sbjct: 232  TNTTTPQFTPLSTTTTRNPELTSQME 257



 Score = 41.2 bits (96), Expect = 0.004
 Identities = 42/232 (18%), Positives = 82/232 (35%), Gaps = 11/232 (4%)

Query: 1854 LAATAVAISVIDNYSEIIFTTNNNSESTVVMSTLNSLLS--ENTTTNSPESESTTTNNPE 1911
            L+  A+ IS     + IIF ++ N + T   +    + +  +N T+      +  + +P 
Sbjct: 34   LSILAMIISTSLIIAAIIFISSANHKVTPTTTPTQQITNQIQNHTSTYLTQHNQLSTSPS 93

Query: 1912 SESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPE 1971
            ++STTT    +    T+     +   ++  ++  TT+  ++    T     +S     P+
Sbjct: 94   NQSTTTPLIHTILDDTTPGTKSTYQHTTVGTKGRTTTPAQTNKPPTKP--RQSNPPEKPQ 151

Query: 1972 SESTTTSSP-----ESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTT 2026
             +              E+      + +      P    TT  +   +  TT       TT
Sbjct: 152  DDFHFEVFNFVPCSICENNPACLSICKRIPEKKPGKAPTTKPTKKPKPKTTKKDTKTQTT 211

Query: 2027 TISPESESTTTSSPASESTTTN--NPKSESTTTNNPASESITSSSPASESTT 2076
                 +    TS P   +T TN   P+    +T    +  +TS      ST 
Sbjct: 212  KSKEATTHHPTSEPTKLTTKTNTTTPQFTPLSTTTTRNPELTSQMETFHSTN 263


>gnl|CDD|219426 pfam07489, Tir_receptor_C, Translocated intimin receptor (Tir)
            C-terminus.  Intimin and its translocated intimin
            receptor (Tir) are bacterial proteins that mediate
            adhesion between mammalian cells and attaching and
            effacing (A/E) pathogens. A unique and essential feature
            of A/E bacterial pathogens is the formation of actin-rich
            pedestals beneath the intimately adherent bacteria and
            localised destruction of the intestinal brush border. The
            bacterial outer membrane adhesin, intimin, is necessary
            for the production of the A/E lesion and diarrhoea. The
            A/E bacteria translocate their own receptor for intimin,
            Tir, into the membrane of mammalian cells using the type
            III secretion system. The translocated Tir triggers
            additional host signalling events and actin nucleation,
            which are essential for lesion formation. This family
            represents the Tir C-terminal domain which has been
            reported to bind uninfected host cells and beta-1
            integrins although the role of intimin binding to
            integrins is unclear. This intimin C-terminal domain has
            also been shown to be sufficient for Tir recognition.
          Length = 222

 Score = 42.6 bits (100), Expect = 0.001
 Identities = 23/102 (22%), Positives = 46/102 (45%), Gaps = 11/102 (10%)

Query: 2070 PASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPE-------SESTTTSSPAS 2122
            P  ++TTT++    +TTT +        ++PA  +T TS  E       S   +T+S   
Sbjct: 56   PVEQTTTTTT---TTTTTHTTVENKPANNTPAQGNTDTSGAEETASSRRSSQASTASTTW 112

Query: 2123 ESTTIEEQGVSPHSE-KLSANEDPEEFPNEDVFEHTFAEIPN 2163
              T+  +   +P+++  +S N+       E +++   A+ P 
Sbjct: 113  SDTSSIDTVDNPYADVGMSRNDSQARNSEEPIYDEVAADSPI 154



 Score = 39.9 bits (93), Expect = 0.007
 Identities = 30/117 (25%), Positives = 52/117 (44%), Gaps = 5/117 (4%)

Query: 1909 NPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPE-SESTTTSSLVSESTTT 1967
            N   E TTT++  + +TTT + V      ++P   +T TS  E + S+  SS  S ++TT
Sbjct: 54   NQPVEQTTTTT--TTTTTTHTTVENKPANNTPAQGNTDTSGAEETASSRRSSQASTASTT 111

Query: 1968 SSPESESTTTSSPESESTTT--SSLVSESTTTSSPESESTTTISPVSESTTTSSPVS 2022
             S  S   T  +P ++   +   S    S      E  + + I  V +  +  +P +
Sbjct: 112  WSDTSSIDTVDNPYADVGMSRNDSQARNSEEPIYDEVAADSPIYSVIQHFSGDTPDT 168



 Score = 37.6 bits (87), Expect = 0.046
 Identities = 26/99 (26%), Positives = 47/99 (47%), Gaps = 10/99 (10%)

Query: 2048 NNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPA-SESTTTSSPASESTT 2106
            N P  ++TTT    + + T+ +        ++PA  +T TS    + S+  SS AS ++T
Sbjct: 54   NQPVEQTTTTT---TTTTTTHTTVENKPANNTPAQGNTDTSGAEETASSRRSSQASTAST 110

Query: 2107 TSSPESESTTTSSPASESTTIEEQGVSPHSEKLSANEDP 2145
            T S  S   T  +P ++       G+S +  +   +E+P
Sbjct: 111  TWSDTSSIDTVDNPYADV------GMSRNDSQARNSEEP 143



 Score = 34.9 bits (80), Expect = 0.31
 Identities = 24/80 (30%), Positives = 41/80 (51%), Gaps = 7/80 (8%)

Query: 2031 ESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTT--SSPASESTTTS 2088
            E  +TTT++     TTT +   E+   NN  ++  T +S A E+ ++  SS AS ++TT 
Sbjct: 58   EQTTTTTTT-----TTTTHTTVENKPANNTPAQGNTDTSGAEETASSRRSSQASTASTTW 112

Query: 2089 SPASESTTTSSPASESTTTS 2108
            S  S   T  +P ++   + 
Sbjct: 113  SDTSSIDTVDNPYADVGMSR 132



 Score = 33.4 bits (76), Expect = 1.1
 Identities = 24/77 (31%), Positives = 40/77 (51%), Gaps = 3/77 (3%)

Query: 1993 ESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPE-SESTTTSSPASESTTTNNPK 2051
            E TTT++  + +TTT + V      ++P   +T T   E + S+  SS AS ++TT +  
Sbjct: 58   EQTTTTT--TTTTTTHTTVENKPANNTPAQGNTDTSGAEETASSRRSSQASTASTTWSDT 115

Query: 2052 SESTTTNNPASESITSS 2068
            S   T +NP ++   S 
Sbjct: 116  SSIDTVDNPYADVGMSR 132



 Score = 33.0 bits (75), Expect = 1.5
 Identities = 21/89 (23%), Positives = 42/89 (47%), Gaps = 12/89 (13%)

Query: 2010 PVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSS 2069
            PV ++TTT++    +TTT +        ++PA  +T T+  +  +++  +         S
Sbjct: 56   PVEQTTTTTT---TTTTTHTTVENKPANNTPAQGNTDTSGAEETASSRRS---------S 103

Query: 2070 PASESTTTSSPASESTTTSSPASESTTTS 2098
             AS ++TT S  S   T  +P ++   + 
Sbjct: 104  QASTASTTWSDTSSIDTVDNPYADVGMSR 132



 Score = 32.2 bits (73), Expect = 2.4
 Identities = 21/66 (31%), Positives = 31/66 (46%), Gaps = 2/66 (3%)

Query: 1895 TTTNSPESESTTTNN-PESESTTTSSPE-SESTTTSSLVSESTTTSSPESESTTTSSPES 1952
            TTT     E+   NN P   +T TS  E + S+  SS  S ++TT S  S   T  +P +
Sbjct: 67   TTTTHTTVENKPANNTPAQGNTDTSGAEETASSRRSSQASTASTTWSDTSSIDTVDNPYA 126

Query: 1953 ESTTTS 1958
            +   + 
Sbjct: 127  DVGMSR 132


>gnl|CDD|217330 pfam03035, RNA_capsid, Calicivirus putative RNA polymerase/capsid
            protein. 
          Length = 226

 Score = 42.3 bits (100), Expect = 0.001
 Identities = 32/111 (28%), Positives = 50/111 (45%), Gaps = 16/111 (14%)

Query: 1915 TTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESES 1974
            T   +P S +TT+ S    S+    P S S++ SS  S+ST ++ L S S ++SS     
Sbjct: 103  TRYWAPNSMATTSYSGGFTSSPVPVPPSSSSSASSVSSQSTQSTGLSSSSYSSSSA---- 158

Query: 1975 TTTSSPESESTTTSSLVSESTTTSSPESES---TTTISPVSESTTTSSPVS 2022
                     S+ TSS V    +   P       T  ++P S + ++S  VS
Sbjct: 159  ---------SSRTSSWVRSQNSNLEPFMPGALQTAWVTPPSSTASSSGTVS 200



 Score = 41.2 bits (97), Expect = 0.003
 Identities = 30/104 (28%), Positives = 45/104 (43%), Gaps = 16/104 (15%)

Query: 1892 SENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPE 1951
            S  TT+ S    S+    P S S++ SS  S+ST ++ L S S ++SS            
Sbjct: 110  SMATTSYSGGFTSSPVPVPPSSSSSASSVSSQSTQSTGLSSSSYSSSSA----------- 158

Query: 1952 SESTTTSSLVSESTTTSSPESES---TTTSSPESESTTTSSLVS 1992
              S+ TSS V    +   P       T   +P S + ++S  VS
Sbjct: 159  --SSRTSSWVRSQNSNLEPFMPGALQTAWVTPPSSTASSSGTVS 200



 Score = 37.3 bits (87), Expect = 0.063
 Identities = 28/101 (27%), Positives = 48/101 (47%), Gaps = 16/101 (15%)

Query: 2035 TTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASES 2094
            T   +P S +TT+ +    S+    P S S ++SS +S+ST ++  +S S ++SS +S  
Sbjct: 103  TRYWAPNSMATTSYSGGFTSSPVPVPPSSSSSASSVSSQSTQSTGLSSSSYSSSSASS-- 160

Query: 2095 TTTSS-------------PASESTTTSSPESESTTTSSPAS 2122
              TSS             P +  T   +P S + ++S   S
Sbjct: 161  -RTSSWVRSQNSNLEPFMPGALQTAWVTPPSSTASSSGTVS 200



 Score = 35.8 bits (83), Expect = 0.18
 Identities = 29/107 (27%), Positives = 48/107 (44%), Gaps = 16/107 (14%)

Query: 1945 TTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESES 2004
            T   +P S +TT+ S    S+    P S S++ SS  S+ST ++ L S S ++SS     
Sbjct: 103  TRYWAPNSMATTSYSGGFTSSPVPVPPSSSSSASSVSSQSTQSTGLSSSSYSSSSA---- 158

Query: 2005 TTTISPVSESTTTSSPVSESTTTISPESES---TTTSSPASESTTTN 2048
                     S+ TSS V    + + P       T   +P S + +++
Sbjct: 159  ---------SSRTSSWVRSQNSNLEPFMPGALQTAWVTPPSSTASSS 196



 Score = 35.0 bits (81), Expect = 0.31
 Identities = 29/109 (26%), Positives = 50/109 (45%), Gaps = 3/109 (2%)

Query: 1910 PESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSS 1969
            P S +TT+ S    S+      S S++ SS  S+ST ++   S S ++SS    S+ TSS
Sbjct: 108  PNSMATTSYSGGFTSSPVPVPPSSSSSASSVSSQSTQSTGLSSSSYSSSSA---SSRTSS 164

Query: 1970 PESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTS 2018
                  +   P       ++ V+  ++T+S     +T    V +S T +
Sbjct: 165  WVRSQNSNLEPFMPGALQTAWVTPPSSTASSSGTVSTVPKGVLDSWTPA 213



 Score = 33.5 bits (77), Expect = 1.0
 Identities = 30/105 (28%), Positives = 46/105 (43%), Gaps = 8/105 (7%)

Query: 2015 TTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASES 2074
            T   +P S +TT+ S    S+    P S S++ ++  S+ST        S +S S +S S
Sbjct: 103  TRYWAPNSMATTSYSGGFTSSPVPVPPSSSSSASSVSSQSTQ---STGLSSSSYSSSSAS 159

Query: 2075 TTTSSPASESTTTSSPASES---TTTSSPASESTTTSSPESESTT 2116
            + TSS      +   P       T   +P   S+T SS  + ST 
Sbjct: 160  SRTSSWVRSQNSNLEPFMPGALQTAWVTPP--SSTASSSGTVSTV 202



 Score = 32.3 bits (74), Expect = 2.6
 Identities = 23/76 (30%), Positives = 34/76 (44%), Gaps = 7/76 (9%)

Query: 2060 PASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSS 2119
            P S + TS S    S+    P S S++ SS +S+ST       +ST  SS    S++ SS
Sbjct: 108  PNSMATTSYSGGFTSSPVPVPPSSSSSASSVSSQST-------QSTGLSSSSYSSSSASS 160

Query: 2120 PASESTTIEEQGVSPH 2135
              S     +   + P 
Sbjct: 161  RTSSWVRSQNSNLEPF 176



 Score = 31.9 bits (73), Expect = 3.4
 Identities = 19/86 (22%), Positives = 36/86 (41%), Gaps = 9/86 (10%)

Query: 1880 STVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSS 1939
            ++  +    S  S  ++ +S  ++ST  ++    S++ SS       TSS V    +   
Sbjct: 121  TSSPVPVPPSSSSSASSVSSQSTQSTGLSSSSYSSSSASS------RTSSWVRSQNSNLE 174

Query: 1940 PESES---TTTSSPESESTTTSSLVS 1962
            P       T   +P S + ++S  VS
Sbjct: 175  PFMPGALQTAWVTPPSSTASSSGTVS 200


>gnl|CDD|183558 PRK12495, PRK12495, hypothetical protein; Provisional.
          Length = 226

 Score = 42.2 bits (99), Expect = 0.002
 Identities = 20/129 (15%), Positives = 46/129 (35%), Gaps = 5/129 (3%)

Query: 2016 TTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASEST 2075
            T   PV+E       ++     ++  S++ +  +P  ++     PA+E+  +   A    
Sbjct: 63   TCQQPVTEDGAA-GDDAGDGAEATAPSDAGSQASPDDDAQ----PAAEAEAADQSAPPEA 117

Query: 2076 TTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTTIEEQGVSPH 2135
            +++S   E+ T     + +    +P   +   +  E  S     P S          +  
Sbjct: 118  SSTSATDEAATDPPATAAARDGPTPDPTAQPATPDERRSPRQRPPVSGEPPTPSTPDAHV 177

Query: 2136 SEKLSANED 2144
            +  L A  +
Sbjct: 178  AGTLQAARE 186



 Score = 41.8 bits (98), Expect = 0.002
 Identities = 22/147 (14%), Positives = 55/147 (37%), Gaps = 16/147 (10%)

Query: 2001 ESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNP 2060
            ++      +  S++ + +SP  ++    + E+E+   S+P   S+T+   ++ +      
Sbjct: 77   DAGDGAEATAPSDAGSQASPDDDAQP--AAEAEAADQSAPPEASSTSATDEAATDPPATA 134

Query: 2061 ASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSP 2120
            A+    +  P ++  T     S            T ++  A  + T  +           
Sbjct: 135  AARDGPTPDPTAQPATPDERRSPRQRPPVSGEPPTPSTPDAHVAGTLQA----------- 183

Query: 2121 ASESTTIEEQGVSPHSEKLSANEDPEE 2147
            A ES     + ++  + + +A +DP  
Sbjct: 184  ARESL---VETLARFARRAAATDDPRR 207



 Score = 41.0 bits (96), Expect = 0.004
 Identities = 26/121 (21%), Positives = 54/121 (44%), Gaps = 5/121 (4%)

Query: 1992 SESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPK 2051
            S++ + +SP+ ++    +  +E+   S+P   S+T+ + E+ +   ++ A+    T +P 
Sbjct: 88   SDAGSQASPDDDAQP--AAEAEAADQSAPPEASSTSATDEAATDPPATAAARDGPTPDPT 145

Query: 2052 SESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASES--TTTSSPASESTTTSS 2109
            ++  T +   S        + E  T S+P +    T   A ES   T +  A  +  T  
Sbjct: 146  AQPATPDERRSPR-QRPPVSGEPPTPSTPDAHVAGTLQAARESLVETLARFARRAAATDD 204

Query: 2110 P 2110
            P
Sbjct: 205  P 205



 Score = 39.5 bits (92), Expect = 0.013
 Identities = 22/111 (19%), Positives = 44/111 (39%), Gaps = 9/111 (8%)

Query: 2046 TTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASEST 2105
            T   P +E     + A +   +++P S++ + +SP  ++     PA+E+      A    
Sbjct: 63   TCQQPVTEDGAAGDDAGDGAEATAP-SDAGSQASPDDDAQ----PAAEAEAADQSAPPEA 117

Query: 2106 TTSSPESESTTTSSPASESTTIEEQGVSPHSEKLSANEDPEEFPNEDVFEH 2156
            +++S   E+  T  PA+ +      G +P      A  D    P +     
Sbjct: 118  SSTSATDEA-ATDPPATAAA---RDGPTPDPTAQPATPDERRSPRQRPPVS 164



 Score = 38.3 bits (89), Expect = 0.032
 Identities = 24/138 (17%), Positives = 47/138 (34%), Gaps = 9/138 (6%)

Query: 1962 SESTTTSSPESESTTTSSPES-ESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSP 2020
            S++ + +SP+ ++   +  E+ + +      S S T  +      T     +    T  P
Sbjct: 88   SDAGSQASPDDDAQPAAEAEAADQSAPPEASSTSATDEAATDPPATA---AARDGPTPDP 144

Query: 2021 VSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITS--SSPASESTTTS 2078
             ++  T     S            T +      + T    A ES+    +  A  +  T 
Sbjct: 145  TAQPATPDERRSPRQRPPVSGEPPTPSTPDAHVAGTLQA-ARESLVETLARFARRAAATD 203

Query: 2079 SP--ASESTTTSSPASES 2094
             P  A E    +  A+E+
Sbjct: 204  DPRRAREYLEAAREAAEA 221


>gnl|CDD|227651 COG5347, COG5347, GTPase-activating protein that regulates ARFs
            (ADP-ribosylation factors), involved in ARF-mediated
            vesicular transport [Intracellular trafficking and
            secretion].
          Length = 319

 Score = 42.8 bits (101), Expect = 0.002
 Identities = 27/174 (15%), Positives = 67/174 (38%), Gaps = 6/174 (3%)

Query: 1923 ESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTS-SPESESTTTSSPE 1981
            +S++ S   S S +++            ES+S ++S+ +  S         ES  ++  +
Sbjct: 125  DSSSPSDFSSFSASSTRTVDSVDDRLDSESQSRSSSASLGNSNRPDDELNVESFQSTGSK 184

Query: 1982 SESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPA 2041
              S T++    ++   S   + ++   S     + T S  S++    S ++  +    P 
Sbjct: 185  PRSLTSTKSNKDNLLNSELLTLNSLLSSNSEVGSGTKSR-SDAQEKSSTKATESVKPGPV 243

Query: 2042 SESTTTNNPKSESTTTNNPASESITSSSPA----SESTTTSSPASESTTTSSPA 2091
            + S+T++ P +   +         T+          +   +  +  S+ T++ A
Sbjct: 244  NTSSTSSLPPAIKRSPVQQLESFTTTPVYFPVNTPATFDATLKSYYSSLTANIA 297



 Score = 42.1 bits (99), Expect = 0.002
 Identities = 31/173 (17%), Positives = 66/173 (38%), Gaps = 6/173 (3%)

Query: 1943 ESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTS-SLVSESTTTSSPE 2001
            +S++ S   S S +++  V         ES+S ++S+    S      L  ES  ++  +
Sbjct: 125  DSSSPSDFSSFSASSTRTVDSVDDRLDSESQSRSSSASLGNSNRPDDELNVESFQSTGSK 184

Query: 2002 SESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPA 2061
              S T+     ++   S  ++ ++   S     + T S +     ++  K+  +    P 
Sbjct: 185  PRSLTSTKSNKDNLLNSELLTLNSLLSSNSEVGSGTKSRSDAQEKSST-KATESVKPGPV 243

Query: 2062 SESITSSSPASESTTTSSPASESTTTSSPA----SESTTTSSPASESTTTSSP 2110
            + S TSS P +   +        TTT          +   +  +  S+ T++ 
Sbjct: 244  NTSSTSSLPPAIKRSPVQQLESFTTTPVYFPVNTPATFDATLKSYYSSLTANI 296



 Score = 40.9 bits (96), Expect = 0.007
 Identities = 33/172 (19%), Positives = 66/172 (38%), Gaps = 7/172 (4%)

Query: 1896 TTNSPESESTTTNNPESESTTTSSP-ESESTTTSSLVSESTTTS-SPESESTTTSSPESE 1953
            ++ S  S  + ++    +S       ES+S ++S+ +  S         ES  ++  +  
Sbjct: 127  SSPSDFSSFSASSTRTVDSVDDRLDSESQSRSSSASLGNSNRPDDELNVESFQSTGSKPR 186

Query: 1954 STTTSSLVSESTTTSSPESESTTTSSPESESTTTSS--LVSESTTTSSPESESTTTISPV 2011
            S T++    ++   S   + ++  SS     + T S     E ++T + ES     ++  
Sbjct: 187  SLTSTKSNKDNLLNSELLTLNSLLSSNSEVGSGTKSRSDAQEKSSTKATESVKPGPVNTS 246

Query: 2012 SESTTTSSPVSESTTTISPESESTTTSSPA-SESTTTNNPKSE-STTTNNPA 2061
            S S +    +  S         +T    P  + +T     KS  S+ T N A
Sbjct: 247  STS-SLPPAIKRSPVQQLESFTTTPVYFPVNTPATFDATLKSYYSSLTANIA 297



 Score = 36.7 bits (85), Expect = 0.14
 Identities = 26/141 (18%), Positives = 52/141 (36%), Gaps = 5/141 (3%)

Query: 1885 STLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESES 1944
            S+  SL + N   +    ES  +   +  S T++    ++   S L++ ++  SS     
Sbjct: 158  SSSASLGNSNRPDDELNVESFQSTGSKPRSLTSTKSNKDNLLNSELLTLNSLLSSNSEVG 217

Query: 1945 TTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSP---- 2000
            + T S        SS  +  +    P + S+T+S P +   +    +   TTT       
Sbjct: 218  SGTKSRSDAQ-EKSSTKATESVKPGPVNTSSTSSLPPAIKRSPVQQLESFTTTPVYFPVN 276

Query: 2001 ESESTTTISPVSESTTTSSPV 2021
               +         S+ T++  
Sbjct: 277  TPATFDATLKSYYSSLTANIA 297



 Score = 35.9 bits (83), Expect = 0.25
 Identities = 29/157 (18%), Positives = 58/157 (36%), Gaps = 4/157 (2%)

Query: 1973 ESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTS-SPVSESTTTISPE 2031
            +S++ S   S S +++  V         ES+S ++ + +  S         ES  +   +
Sbjct: 125  DSSSPSDFSSFSASSTRTVDSVDDRLDSESQSRSSSASLGNSNRPDDELNVESFQSTGSK 184

Query: 2032 SESTTTSSPASESTTTNNPKS-ESTTTNNPASESITSS-SPASESTTTSSPASESTTTSS 2089
              S T++    ++   +   +  S  ++N    S T S S A E ++T +  S      +
Sbjct: 185  PRSLTSTKSNKDNLLNSELLTLNSLLSSNSEVGSGTKSRSDAQEKSSTKATESVKPGPVN 244

Query: 2090 PASESTTTSSPASESTTTSSPESESTTTSSPASESTT 2126
             +S S +       S         +T    P +   T
Sbjct: 245  TSSTS-SLPPAIKRSPVQQLESFTTTPVYFPVNTPAT 280


>gnl|CDD|236652 PRK10118, PRK10118, flagellar hook-length control protein;
            Provisional.
          Length = 408

 Score = 42.9 bits (101), Expect = 0.002
 Identities = 36/216 (16%), Positives = 64/216 (29%), Gaps = 22/216 (10%)

Query: 1901 ESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSL 1960
                 TT      S   +   ++       V E+    + E   ++  +P   +  TS+L
Sbjct: 63   SKGLLTTKGEPLVSDKLADLLAQQANLLIPVDETLPVITDEQSLSSPLTP---ALKTSAL 119

Query: 1961 VSESTTTSSPESESTTTSSPESESTTTSSLV------SESTTTSSPESESTTTISPVSES 2014
             + S      E     +   + +  + S+L         +T  +   S       P   +
Sbjct: 120  AALSKNAQKDEKADDLS---DEDLASLSALFAMLPGQDNTTPVADAPSTVLPAEKPTLLT 176

Query: 2015 TTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASES 2074
                S   + T T    S         S   TT  P     T   P +     +   +E 
Sbjct: 177  KDMPSAPQDETHT---LSSDEHEKGLTSAQLTTAQPDDAPGTPAQPLTPLAAEAQAKAEV 233

Query: 2075 TTTSSPASESTTTSSPASESTTTSSPASESTTTSSP 2110
             +T SP        + A+  T T        T ++P
Sbjct: 234  ISTPSP-------VTAAASPTITPHQTQPLPTAAAP 262



 Score = 39.5 bits (92), Expect = 0.020
 Identities = 34/213 (15%), Positives = 67/213 (31%), Gaps = 11/213 (5%)

Query: 1911 ESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSP 1970
                 TT      S   + L+++      P  E+    + E    + SS ++ +  TS+ 
Sbjct: 63   SKGLLTTKGEPLVSDKLADLLAQQANLLIPVDETLPVITDEQ---SLSSPLTPALKTSAL 119

Query: 1971 ESESTTTSSPESESTTTSS-LVSESTTTSSPESESTTTISPVSESTT--TSSPVSESTTT 2027
             + S      E     +   L S S   +    +  TT    + ST      P   +   
Sbjct: 120  AALSKNAQKDEKADDLSDEDLASLSALFAMLPGQDNTTPVADAPSTVLPAEKPTLLTKDM 179

Query: 2028 ISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTT 2087
             S   + T T S + E          S        +    +     +   +   +++   
Sbjct: 180  PSAPQDETHTLS-SDEHEKGLT----SAQLTTAQPDDAPGTPAQPLTPLAAEAQAKAEVI 234

Query: 2088 SSPASESTTTSSPASESTTTSSPESESTTTSSP 2120
            S+P+  +   S   +   T   P + +   S+P
Sbjct: 235  STPSPVTAAASPTITPHQTQPLPTAAAPVLSAP 267



 Score = 33.3 bits (76), Expect = 1.8
 Identities = 26/138 (18%), Positives = 46/138 (33%), Gaps = 13/138 (9%)

Query: 1877 NSESTVVMSTLNSLLSENTTTNSPESESTTTNN---PESESTTTSSPESESTTTSSLVSE 1933
            + E    +S L ++L     T       +T      P   +    S   + T T   +S 
Sbjct: 136  SDEDLASLSALFAMLPGQDNTTPVADAPSTVLPAEKPTLLTKDMPSAPQDETHT---LSS 192

Query: 1934 STTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSE 1993
                    S   TT+ P+    T +  ++     +  ++E  +T SP      T++    
Sbjct: 193  DEHEKGLTSAQLTTAQPDDAPGTPAQPLTPLAAEAQAKAEVISTPSP-----VTAAASP- 246

Query: 1994 STTTSSPESESTTTISPV 2011
             T T        T  +PV
Sbjct: 247  -TITPHQTQPLPTAAAPV 263


>gnl|CDD|218056 pfam04388, Hamartin, Hamartin protein.  This family includes the
            hamartin protein which is thought to function as a tumour
            suppressor. The hamartin protein interacts with the
            tuberin protein pfam03542. Tuberous sclerosis complex
            (TSC) is an autosomal dominant disorder and is
            characterized by the presence of hamartomas in many
            organs, such as brain, skin, heart, lung, and kidney. It
            is caused by mutation either TSC1 or TSC2 tumour
            suppressor gene. TSC1 encodes a protein, hamartin,
            containing two coiled-coil regions, which have been shown
            to mediate binding to tuberin. The TSC2 gene codes for
            tuberin pfam03542. These two proteins function within the
            same pathway(s) regulating cell cycle, cell growth,
            adhesion, and vesicular trafficking.
          Length = 667

 Score = 43.0 bits (101), Expect = 0.002
 Identities = 44/259 (16%), Positives = 84/259 (32%), Gaps = 17/259 (6%)

Query: 1878 SESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTT--TSSLVSEST 1935
            + S       +  L +NT+T+     + T+  P +     +S  ++ +    SSL   +T
Sbjct: 284  NSSPRQALPPSISLPQNTSTSGSLHSAQTSRRPNTTFDKAASSGTKDSLWSPSSLCGMAT 343

Query: 1936 TTSS-PESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSES 1994
              SS   S    + SP   S              + ES  +T+  P   +      +  +
Sbjct: 344  PPSSIGMSPLILSLSPSHLSGRAPGTTGSGKGEPASESTPSTSPPPPGLADDIVRAIFAT 403

Query: 1995 TTTSSPESESTTTIS--PVSESTTTSSPVSESTT------TISPESESTTTSSPASESTT 2046
            ++ S+P  E     S  P          + +S         ++ E    T       S  
Sbjct: 404  SSRSAPRKEELQNESSFPKLVRQENLQNIEKSAEGGILDAAVTEELLKLTNEKDDLGSRG 463

Query: 2047 TNNPKSESTTT----NNPASESITSSSPASESTTTSSPASEST--TTSSPASESTTTSSP 2100
             ++P S  T      N    E + S+     + + S+     +  T          ++  
Sbjct: 464  LDSPFSRDTLLGSQRNKAQPELLVSTPDKGPAESQSAANLRVSWFTPIENPMREEKSAPA 523

Query: 2101 ASESTTTSSPESESTTTSS 2119
            + E   TS  ES  + +  
Sbjct: 524  SEEDEQTSLEESLISPSPC 542



 Score = 36.9 bits (85), Expect = 0.14
 Identities = 63/332 (18%), Positives = 117/332 (35%), Gaps = 34/332 (10%)

Query: 1886 TLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESEST 1945
            +L+   + +    S    S   N+   ++   S    ++T+TS  +  + T+  P +   
Sbjct: 262  SLDPTETSSEDGYSFSRSSAYPNSSPRQALPPSISLPQNTSTSGSLHSAQTSRRPNTTFD 321

Query: 1946 TTSSPESESTT--TSSLVSESTTTSSPE------SESTTTSSPESESTTTSS---LVSES 1994
              +S  ++ +    SSL   +T  SS        S S +  S  +  TT S      SES
Sbjct: 322  KAASSGTKDSLWSPSSLCGMATPPSSIGMSPLILSLSPSHLSGRAPGTTGSGKGEPASES 381

Query: 1995 T--TTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTN---- 2048
            T  T+  P   +   +  +  +++ S+P  E     S   +     +  +   +      
Sbjct: 382  TPSTSPPPPGLADDIVRAIFATSSRSAPRKEELQNESSFPKLVRQENLQNIEKSAEGGIL 441

Query: 2049 ----NPKSESTTTNNPASESITSSSPASESTTTSSPASE-------STTTSSPASESTTT 2097
                  +    T       S    SP S  T   S  ++       ST    PA   +  
Sbjct: 442  DAAVTEELLKLTNEKDDLGSRGLDSPFSRDTLLGSQRNKAQPELLVSTPDKGPAESQSAA 501

Query: 2098 SSPASESTTTSSPESESTTTS-SPASESTTIEEQGVSPHSEKLSANEDPEEFPNEDVFEH 2156
            +   S  T   +P  E  +   S   E T++EE  +SP          P + P + +F+ 
Sbjct: 502  NLRVSWFTPIENPMREEKSAPASEEDEQTSLEESLISPSPCSR-----PPQPPYDRLFDI 556

Query: 2157 TFAEIPNIDHSNQTDEAIPETFDAREEWPQCK 2188
               +   +  S +T EA+ +    R    + +
Sbjct: 557  ALPKTACLFLSRKTYEALLKEAGQRLSQEEGE 588



 Score = 32.6 bits (74), Expect = 3.3
 Identities = 38/184 (20%), Positives = 65/184 (35%), Gaps = 11/184 (5%)

Query: 1988 SSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTT 2047
            + L  + T TSS +  S       S +   SSP      +IS    ++T+ S  S  T+ 
Sbjct: 259  AKLSLDPTETSSEDGYSF----SRSSAYPNSSPRQALPPSISLPQNTSTSGSLHSAQTSR 314

Query: 2048 N-NPKSESTTTNNPASESITSSSPASESTTTSS-PASESTTTSSPASESTTTSSPASEST 2105
              N   +   ++       + SS    +T  SS   S    + SP+  S           
Sbjct: 315  RPNTTFDKAASSGTKDSLWSPSSLCGMATPPSSIGMSPLILSLSPSHLSGRAPGTTGSGK 374

Query: 2106 TTSSPESESTTTSSPASESTTIEEQGVSPHSEKLSANEDPEEFPNEDVFE--HTFAEIPN 2163
               + ES  +T+  P   +   ++   +  +    +    EE  NE  F        + N
Sbjct: 375  GEPASESTPSTSPPPPGLA---DDIVRAIFATSSRSAPRKEELQNESSFPKLVRQENLQN 431

Query: 2164 IDHS 2167
            I+ S
Sbjct: 432  IEKS 435


>gnl|CDD|114172 pfam05432, BSP_II, Bone sialoprotein II (BSP-II).  Bone sialoprotein
            (BSP) is a major structural protein of the bone matrix
            that is specifically expressed by fully-differentiated
            osteoblasts. The expression of bone sialoprotein (BSP) is
            normally restricted to mineralised connective tissues of
            bones and teeth where it has been associated with mineral
            crystal formation. However, it has been found that
            ectopic expression of BSP occurs in various lesions,
            including oral and extraoral carcinomas, in which it has
            been associated with the formation of microcrystalline
            deposits and the metastasis of cancer cells to bone.
          Length = 291

 Score = 42.4 bits (99), Expect = 0.002
 Identities = 49/224 (21%), Positives = 74/224 (33%), Gaps = 31/224 (13%)

Query: 1937 TSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTT 1996
            + S E      SS E     TS+        ++ +S+       E+E+TT S      T 
Sbjct: 48   SDSSEENGDGDSSEEEGEEETSN-----EEENNEDSDGNEDEEAEAENTTLS------TV 96

Query: 1997 TSSPESESTTTIS---------PVSESTTTSSPVSESTTTISPESESTTTSSPAS---ES 2044
            T     ++T             P            E  +    E E       A      
Sbjct: 97   TLGYGGDATPGTGNIGLAALQLPKKAGNAGKKATKEDESDEDEEEEEEEEEEEAEVEENE 156

Query: 2045 TTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASES 2104
              TN   + ST  ++    S   +    E  + +   +E TT + P     TT+SP    
Sbjct: 157  QGTNGTSTNSTEVDHGNGSSGGDNGEEGEEESVTEAEAEGTTVAGP-----TTTSPNGGF 211

Query: 2105 TTTSSPESESTTTSSPASESTTIEEQGVSPHSEKLSANEDPEEF 2148
              T+ P+    TT  P  + TT E QG     E+  ANE    +
Sbjct: 212  QPTTPPQEVYGTTDPPFGKVTTPEYQG---EYEQTGANEYDGGY 252



 Score = 37.4 bits (86), Expect = 0.086
 Identities = 44/230 (19%), Positives = 71/230 (30%), Gaps = 7/230 (3%)

Query: 1917 TSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTT 1976
            + S E      SS       TS+ E  +  +   E E     +    + T       +  
Sbjct: 48   SDSSEENGDGDSSEEEGEEETSNEEENNEDSDGNEDEEAEAENTTLSTVTLGYGGDATPG 107

Query: 1977 TSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTT 2036
            T +    +         +   ++ E ES        E     + V E+    +  S ++T
Sbjct: 108  TGNIGLAALQLPKKAGNAGKKATKEDESDEDEEEEEEEEEEEAEVEENEQGTNGTSTNST 167

Query: 2037 -TSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASEST 2095
                    S   N  + E  +     +E  T + P     TT+SP      T+ P     
Sbjct: 168  EVDHGNGSSGGDNGEEGEEESVTEAEAEGTTVAGP-----TTTSPNGGFQPTTPPQEVYG 222

Query: 2096 TTSSPASESTTTSSP-ESESTTTSSPASESTTIEEQGVSPHSEKLSANED 2144
            TT  P  + TT     E E T  +         E +   P  +   A ED
Sbjct: 223  TTDPPFGKVTTPEYQGEYEQTGANEYDGGYEIYESENGEPRGDSYRAYED 272



 Score = 36.6 bits (84), Expect = 0.14
 Identities = 37/200 (18%), Positives = 69/200 (34%), Gaps = 12/200 (6%)

Query: 1901 ESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTT-TSSPESESTTTSS 1959
            E+ +   NN +S+       E+E+TT S      T T     ++T  T +    +     
Sbjct: 67   ETSNEEENNEDSDGNEDEEAEAENTTLS------TVTLGYGGDATPGTGNIGLAALQLPK 120

Query: 1960 LVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSS 2019
                +   ++ E ES      E E     + V E+   ++  S ++T +     + ++  
Sbjct: 121  KAGNAGKKATKEDESDEDEEEEEEEEEEEAEVEENEQGTNGTSTNSTEVD--HGNGSSGG 178

Query: 2020 PVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSS 2079
               E     S        ++ A  +TT+ N   + TT   P  E   ++ P     TT  
Sbjct: 179  DNGEEGEEESVTEAEAEGTTVAGPTTTSPNGGFQPTT---PPQEVYGTTDPPFGKVTTPE 235

Query: 2080 PASESTTTSSPASESTTTSS 2099
               E   T +   +      
Sbjct: 236  YQGEYEQTGANEYDGGYEIY 255


>gnl|CDD|213932 TIGR04319, SerAla_Lrha_rpt, surface protein repeat Ser-Ala-175.  This
            serine and alanine-rich surface protein repeat, about 175
            amino acids long, occurs up to nine times in surface
            proteins of some Lactobacillus strains, particularly in
            Lactobacillus rhamnosus. Members proteins have the
            N-terminal variant signal sequence described by TIGR03715
            and C-terminal LPXTG signals for surface attachment by
            sortase.
          Length = 175

 Score = 41.0 bits (96), Expect = 0.002
 Identities = 46/187 (24%), Positives = 79/187 (42%), Gaps = 16/187 (8%)

Query: 1934 STTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSE 1993
            ST +S   S +   SS  S      SL S + T SS  + S T+S   S S   S+  S 
Sbjct: 2    STASSVASSANAVASSAASRFPDNQSLASLAKTASS--ANSVTSSYAASASADASAASSL 59

Query: 1994 STTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSE 2053
            +T  SS                  SS  S++ + ++  +    +S     S   +N  S 
Sbjct: 60   ATKVSSANK-------------AASSAASQANSALAAGNLDAASSYANQASKAASNASSL 106

Query: 2054 STTTNNPASESITSSSPASESTT-TSSPASESTTTSSPASESTTTSSPASESTTTSSPES 2112
            +   N+ AS++++ +  AS +    SS AS + T +   + +    S AS ++  +S  S
Sbjct: 107  ADKANSAASKALSEALQASSAAAIASSAASSAATLAGSLASANDAKSDASAASDAASSAS 166

Query: 2113 ESTTTSS 2119
               +++S
Sbjct: 167  VVASSAS 173



 Score = 40.6 bits (95), Expect = 0.003
 Identities = 44/178 (24%), Positives = 86/178 (48%), Gaps = 9/178 (5%)

Query: 1944 STTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESE 2003
            ST +S   S +   SS  S      S  S + T SS  S    TSS  + ++  +S  S 
Sbjct: 2    STASSVASSANAVASSAASRFPDNQSLASLAKTASSANS---VTSSYAASASADASAASS 58

Query: 2004 STTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPA-- 2061
              T +S  S +   SS  S++ + ++  +    +S     S   +N  S +   N+ A  
Sbjct: 59   LATKVS--SANKAASSAASQANSALAAGNLDAASSYANQASKAASNASSLADKANSAASK 116

Query: 2062 --SESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTT 2117
              SE++ +SS A+ +++ +S A+    + + A+++ + +S AS++ +++S  + S +T
Sbjct: 117  ALSEALQASSAAAIASSAASSAATLAGSLASANDAKSDASAASDAASSASVVASSAST 174



 Score = 39.1 bits (91), Expect = 0.009
 Identities = 42/167 (25%), Positives = 75/167 (44%), Gaps = 9/167 (5%)

Query: 1964 STTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSE 2023
            ST +S   S +   SS  S      SL S + T SS  + S T+    S S   S+  S 
Sbjct: 2    STASSVASSANAVASSAASRFPDNQSLASLAKTASS--ANSVTSSYAASASADASAASSL 59

Query: 2024 STTTISPESESTTTSSPASESTTTNNPK--SESTTTNNPASE--SITSSSPASESTTTSS 2079
            +T   S        SS AS++ +         +++  N AS+  S  SS     ++  S 
Sbjct: 60   ATKVSSANK---AASSAASQANSALAAGNLDAASSYANQASKAASNASSLADKANSAASK 116

Query: 2080 PASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTT 2126
              SE+   SS A+ +++ +S A+    + +  +++ + +S AS++ +
Sbjct: 117  ALSEALQASSAAAIASSAASSAATLAGSLASANDAKSDASAASDAAS 163



 Score = 31.4 bits (71), Expect = 3.6
 Identities = 42/197 (21%), Positives = 76/197 (38%), Gaps = 26/197 (13%)

Query: 1904 STTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSE 1963
            ST ++   S +   SS  S      SL S + T SS            + S T+S   S 
Sbjct: 2    STASSVASSANAVASSAASRFPDNQSLASLAKTASS------------ANSVTSSYAASA 49

Query: 1964 STTTSSPESESTTTSSPESESTTTSSLV-SESTTTSSPESESTTTISPVSESTTTSSPVS 2022
            S   S+  S +T  SS    +++ +S   S     +   + S    +  + S  +S    
Sbjct: 50   SADASAASSLATKVSSANKAASSAASQANSALAAGNLDAASSYANQASKAASNASSLADK 109

Query: 2023 ESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPAS 2082
             ++      SE+   SS A+ ++             + AS + T +   + +    S AS
Sbjct: 110  ANSAASKALSEALQASSAAAIAS-------------SAASSAATLAGSLASANDAKSDAS 156

Query: 2083 ESTTTSSPASESTTTSS 2099
             ++  +S AS   +++S
Sbjct: 157  AASDAASSASVVASSAS 173


>gnl|CDD|223031 PHA03273, PHA03273, envelope glycoprotein C; Provisional.
          Length = 486

 Score = 42.7 bits (100), Expect = 0.002
 Identities = 25/99 (25%), Positives = 45/99 (45%), Gaps = 8/99 (8%)

Query: 1912 SESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPE 1971
            S ++T+SS E+   +T+ + S   T +   S  T+     ++++T ++    +T  S P 
Sbjct: 26   SGASTSSSIENSDNSTAEMQSTPATPTHTTSNLTSPFGTGTDNSTNANGTESTTQASQPH 85

Query: 1972 SESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISP 2010
            S  TT        T T SL+S      S +   TT++  
Sbjct: 86   SHETTI-------TCTKSLIS-VPYYKSVDMNCTTSVGV 116



 Score = 42.7 bits (100), Expect = 0.003
 Identities = 27/97 (27%), Positives = 43/97 (44%), Gaps = 7/97 (7%)

Query: 1932 SESTTTSSPESESTT----TSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTT 1987
            S ++T+SS E+   +     S+P + + TTS+L S   T +   + +  T S    S   
Sbjct: 26   SGASTSSSIENSDNSTAEMQSTPATPTHTTSNLTSPFGTGTDNSTNANGTESTTQASQPH 85

Query: 1988 SSLVSESTTTSSPESESTTTISPVSESTTTSSPVSES 2024
            S    E+T T +    S      V  + TTS  V+ S
Sbjct: 86   S---HETTITCTKSLISVPYYKSVDMNCTTSVGVNYS 119



 Score = 42.3 bits (99), Expect = 0.003
 Identities = 19/67 (28%), Positives = 32/67 (47%)

Query: 2061 ASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSP 2120
            AS + TSSS  +   +T+   S   T +   S  T+     ++++T ++    +T  S P
Sbjct: 25   ASGASTSSSIENSDNSTAEMQSTPATPTHTTSNLTSPFGTGTDNSTNANGTESTTQASQP 84

Query: 2121 ASESTTI 2127
             S  TTI
Sbjct: 85   HSHETTI 91



 Score = 42.3 bits (99), Expect = 0.003
 Identities = 22/94 (23%), Positives = 42/94 (44%), Gaps = 8/94 (8%)

Query: 1942 SESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPE 2001
            S ++T+SS E+   +T+ + S   T +   S  T+     ++++T ++    +T  S P 
Sbjct: 26   SGASTSSSIENSDNSTAEMQSTPATPTHTTSNLTSPFGTGTDNSTNANGTESTTQASQPH 85

Query: 2002 SESTTTISPVSESTTTSSPVSES-----TTTISP 2030
            S  TT        +  S P  +S     TT++  
Sbjct: 86   SHETTI---TCTKSLISVPYYKSVDMNCTTSVGV 116



 Score = 41.9 bits (98), Expect = 0.004
 Identities = 23/87 (26%), Positives = 42/87 (48%), Gaps = 3/87 (3%)

Query: 1962 SESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPV 2021
            S ++T+SS E+   +T+  +S   T +   S  T+     ++++T  +  +ESTT +S  
Sbjct: 26   SGASTSSSIENSDNSTAEMQSTPATPTHTTSNLTSPFGTGTDNSTNAN-GTESTTQASQP 84

Query: 2022 SESTTTISPESESTTTSSPASESTTTN 2048
                TTI+     +  S P  +S   N
Sbjct: 85   HSHETTIT--CTKSLISVPYYKSVDMN 109



 Score = 41.5 bits (97), Expect = 0.006
 Identities = 21/88 (23%), Positives = 40/88 (45%), Gaps = 1/88 (1%)

Query: 2022 SESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPA 2081
            S ++T+ S E+   +T+   S   T  +  S  T+     +++ T+++    +T  S P 
Sbjct: 26   SGASTSSSIENSDNSTAEMQSTPATPTHTTSNLTSPFGTGTDNSTNANGTESTTQASQPH 85

Query: 2082 S-ESTTTSSPASESTTTSSPASESTTTS 2108
            S E+T T + +  S         + TTS
Sbjct: 86   SHETTITCTKSLISVPYYKSVDMNCTTS 113



 Score = 40.8 bits (95), Expect = 0.010
 Identities = 24/95 (25%), Positives = 43/95 (45%), Gaps = 3/95 (3%)

Query: 1892 SENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPE 1951
            S  +T++S E+   +T   +S   T +   S  T+     ++++T ++    +T  S P 
Sbjct: 26   SGASTSSSIENSDNSTAEMQSTPATPTHTTSNLTSPFGTGTDNSTNANGTESTTQASQPH 85

Query: 1952 SESTT---TSSLVSESTTTSSPESESTTTSSPESE 1983
            S  TT   T SL+S     S   + +T+     SE
Sbjct: 86   SHETTITCTKSLISVPYYKSVDMNCTTSVGVNYSE 120



 Score = 40.8 bits (95), Expect = 0.010
 Identities = 21/88 (23%), Positives = 39/88 (44%), Gaps = 1/88 (1%)

Query: 2012 SESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPA 2071
            S ++T+SS  +   +T   +S   T +   S  T+     ++++T  N    +  +S P 
Sbjct: 26   SGASTSSSIENSDNSTAEMQSTPATPTHTTSNLTSPFGTGTDNSTNANGTESTTQASQPH 85

Query: 2072 S-ESTTTSSPASESTTTSSPASESTTTS 2098
            S E+T T + +  S         + TTS
Sbjct: 86   SHETTITCTKSLISVPYYKSVDMNCTTS 113



 Score = 38.4 bits (89), Expect = 0.045
 Identities = 18/71 (25%), Positives = 33/71 (46%)

Query: 1982 SESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPA 2041
            S ++T+SS+ +   +T+  +S   T     S  T+     ++++T  +    +T  S P 
Sbjct: 26   SGASTSSSIENSDNSTAEMQSTPATPTHTTSNLTSPFGTGTDNSTNANGTESTTQASQPH 85

Query: 2042 SESTTTNNPKS 2052
            S  TT    KS
Sbjct: 86   SHETTITCTKS 96



 Score = 38.4 bits (89), Expect = 0.054
 Identities = 21/88 (23%), Positives = 40/88 (45%), Gaps = 1/88 (1%)

Query: 1992 SESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPK 2051
            S ++T+SS E+   +T    S   T +   S  T+     ++++T ++    +T  + P 
Sbjct: 26   SGASTSSSIENSDNSTAEMQSTPATPTHTTSNLTSPFGTGTDNSTNANGTESTTQASQPH 85

Query: 2052 S-ESTTTNNPASESITSSSPASESTTTS 2078
            S E+T T   +  S+        + TTS
Sbjct: 86   SHETTITCTKSLISVPYYKSVDMNCTTS 113



 Score = 37.3 bits (86), Expect = 0.11
 Identities = 21/82 (25%), Positives = 40/82 (48%), Gaps = 10/82 (12%)

Query: 2041 ASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSP 2100
            AS ++T+++ ++   +T      +   S+PA+ + TTS+  S   T +  ++ +  T S 
Sbjct: 25   ASGASTSSSIENSDNST------AEMQSTPATPTHTTSNLTSPFGTGTDNSTNANGTES- 77

Query: 2101 ASESTTTSSPESESTTTSSPAS 2122
               +T  S P S  TT +   S
Sbjct: 78   ---TTQASQPHSHETTITCTKS 96



 Score = 36.1 bits (83), Expect = 0.25
 Identities = 22/65 (33%), Positives = 34/65 (52%), Gaps = 12/65 (18%)

Query: 2081 ASESTTTSSPASESTT----TSSPASESTTTSSPESESTT-----TSSPASESTTIEEQG 2131
            AS ++T+SS  +   +     S+PA+ + TTS+  S   T     T++  +ESTT   Q 
Sbjct: 25   ASGASTSSSIENSDNSTAEMQSTPATPTHTTSNLTSPFGTGTDNSTNANGTESTT---QA 81

Query: 2132 VSPHS 2136
              PHS
Sbjct: 82   SQPHS 86



 Score = 35.7 bits (82), Expect = 0.30
 Identities = 23/102 (22%), Positives = 41/102 (40%), Gaps = 11/102 (10%)

Query: 2032 SESTTTSSPASESTT----TNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTT 2087
            S ++T+SS  +   +     + P + + TT+N  S   T +  ++ +  T S    +T  
Sbjct: 26   SGASTSSSIENSDNSTAEMQSTPATPTHTTSNLTSPFGTGTDNSTNANGTES----TTQA 81

Query: 2088 SSPASESTTTSSPASESTTTSSPESESTTTSSPASESTTIEE 2129
            S P S  TT +      +  S P  +S   +   S      E
Sbjct: 82   SQPHSHETTIT---CTKSLISVPYYKSVDMNCTTSVGVNYSE 120



 Score = 31.1 bits (70), Expect = 8.2
 Identities = 27/100 (27%), Positives = 42/100 (42%), Gaps = 8/100 (8%)

Query: 1855 AATAVAISVIDNYSEIIFTTNNNSESTVVMSTLNSLLSENTTTNSPESESTTT-NNPESE 1913
            A+T+ +I   DN +  + +T      T    T       + +TN+  +ESTT  + P S 
Sbjct: 28   ASTSSSIENSDNSTAEMQSTPATPTHTTSNLTSPFGTGTDNSTNANGTESTTQASQPHSH 87

Query: 1914 STTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESE 1953
             TT        T T SL+S     S   + +T+     SE
Sbjct: 88   ETTI-------TCTKSLISVPYYKSVDMNCTTSVGVNYSE 120


>gnl|CDD|113196 pfam04415, DUF515, Protein of unknown function (DUF515).  Family of
            hypothetical Archaeal proteins.
          Length = 416

 Score = 42.6 bits (100), Expect = 0.002
 Identities = 26/80 (32%), Positives = 36/80 (45%), Gaps = 3/80 (3%)

Query: 2051 KSESTTTNN--PASESITSSSPASESTTTSSPASESTTTSSPASESTTTS-SPASESTTT 2107
            K   T       +   I SS   SES + S+  S S++TSS  S ST+ S   AS   + 
Sbjct: 257  KQNGTIFYEIVDSGYVILSSISVSESQSQSTSTSSSSSTSSSESSSTSYSPGDASIQNSQ 316

Query: 2108 SSPESESTTTSSPASESTTI 2127
             S    ST+ S   S S++ 
Sbjct: 317  RSQLQSSTSQSESESASSSY 336



 Score = 41.8 bits (98), Expect = 0.004
 Identities = 24/70 (34%), Positives = 36/70 (51%), Gaps = 1/70 (1%)

Query: 1910 PESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPE-SESTTTSSLVSESTTTS 1968
                   +S   SES + S+  S S++TSS ES ST+ S  + S   +  S +  ST+ S
Sbjct: 268  DSGYVILSSISVSESQSQSTSTSSSSSTSSSESSSTSYSPGDASIQNSQRSQLQSSTSQS 327

Query: 1969 SPESESTTTS 1978
              ES S++ S
Sbjct: 328  ESESASSSYS 337



 Score = 41.8 bits (98), Expect = 0.004
 Identities = 20/80 (25%), Positives = 36/80 (45%), Gaps = 2/80 (2%)

Query: 1932 SESTTTSS--PESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSS 1989
               T             +S   SES + S+  S S++TSS ES ST+ S  ++    +  
Sbjct: 258  QNGTIFYEIVDSGYVILSSISVSESQSQSTSTSSSSSTSSSESSSTSYSPGDASIQNSQR 317

Query: 1990 LVSESTTTSSPESESTTTIS 2009
               +S+T+ S    ++++ S
Sbjct: 318  SQLQSSTSQSESESASSSYS 337



 Score = 41.5 bits (97), Expect = 0.005
 Identities = 25/82 (30%), Positives = 35/82 (42%), Gaps = 1/82 (1%)

Query: 1908 NNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTS-SLVSESTT 1966
                               +S  VSES + S+  S S++TSS ES ST+ S    S   +
Sbjct: 256  LKQNGTIFYEIVDSGYVILSSISVSESQSQSTSTSSSSSTSSSESSSTSYSPGDASIQNS 315

Query: 1967 TSSPESESTTTSSPESESTTTS 1988
              S    ST+ S  ES S++ S
Sbjct: 316  QRSQLQSSTSQSESESASSSYS 337



 Score = 38.8 bits (90), Expect = 0.036
 Identities = 24/81 (29%), Positives = 38/81 (46%), Gaps = 7/81 (8%)

Query: 1922 SESTTTSSLVSES----TTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTT 1977
               T    +V       ++ S  ES+S +TS+  S ST++S     S+T+ SP   S   
Sbjct: 258  QNGTIFYEIVDSGYVILSSISVSESQSQSTSTSSSSSTSSSES---SSTSYSPGDASIQN 314

Query: 1978 SSPESESTTTSSLVSESTTTS 1998
            S      ++TS   SES ++S
Sbjct: 315  SQRSQLQSSTSQSESESASSS 335



 Score = 38.4 bits (89), Expect = 0.043
 Identities = 24/84 (28%), Positives = 40/84 (47%), Gaps = 3/84 (3%)

Query: 1876 NNSESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTS-SLVSES 1934
                 T+    ++S     ++ +  ES+S +T+   S ST++S  ES ST+ S    S  
Sbjct: 256  LKQNGTIFYEIVDSGYVILSSISVSESQSQSTSTSSSSSTSSS--ESSSTSYSPGDASIQ 313

Query: 1935 TTTSSPESESTTTSSPESESTTTS 1958
             +  S    ST+ S  ES S++ S
Sbjct: 314  NSQRSQLQSSTSQSESESASSSYS 337



 Score = 38.4 bits (89), Expect = 0.044
 Identities = 25/81 (30%), Positives = 37/81 (45%), Gaps = 5/81 (6%)

Query: 1962 SESTTTSS--PESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISP--VSESTTT 2017
               T             +S   SES + S+  S S++TSS ES S+T+ SP   S   + 
Sbjct: 258  QNGTIFYEIVDSGYVILSSISVSESQSQSTSTSSSSSTSSSES-SSTSYSPGDASIQNSQ 316

Query: 2018 SSPVSESTTTISPESESTTTS 2038
             S +  ST+    ES S++ S
Sbjct: 317  RSQLQSSTSQSESESASSSYS 337



 Score = 38.0 bits (88), Expect = 0.061
 Identities = 23/81 (28%), Positives = 39/81 (48%), Gaps = 7/81 (8%)

Query: 2012 SESTTTSSPVSESTTTISP----ESESTTTSSPASESTTTNNPKSESTTTNNPASESITS 2067
               T     V      +S     ES+S +TS+ +S ST++      S+T+ +P   SI +
Sbjct: 258  QNGTIFYEIVDSGYVILSSISVSESQSQSTSTSSSSSTSS---SESSSTSYSPGDASIQN 314

Query: 2068 SSPASESTTTSSPASESTTTS 2088
            S  +   ++TS   SES ++S
Sbjct: 315  SQRSQLQSSTSQSESESASSS 335



 Score = 37.6 bits (87), Expect = 0.071
 Identities = 32/89 (35%), Positives = 46/89 (51%), Gaps = 3/89 (3%)

Query: 2062 SESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPA 2121
            SES + S+  S S++TSS  S S+T+ SP   S   S  +   ++TS  ESES   SS  
Sbjct: 280  SESQSQSTSTSSSSSTSSSES-SSTSYSPGDASIQNSQRSQLQSSTSQSESES--ASSSY 336

Query: 2122 SESTTIEEQGVSPHSEKLSANEDPEEFPN 2150
            S S  + E   +  + KL A+E   +  N
Sbjct: 337  SYSVNLPEILKAIAAGKLDADEIKAQLQN 365



 Score = 37.2 bits (86), Expect = 0.10
 Identities = 23/79 (29%), Positives = 35/79 (44%), Gaps = 3/79 (3%)

Query: 1992 SESTTTSS--PESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNN 2049
               T             +   VSES + S+  S S++T S ES S+T+ SP   S   + 
Sbjct: 258  QNGTIFYEIVDSGYVILSSISVSESQSQSTSTSSSSSTSSSES-SSTSYSPGDASIQNSQ 316

Query: 2050 PKSESTTTNNPASESITSS 2068
                 ++T+   SES +SS
Sbjct: 317  RSQLQSSTSQSESESASSS 335



 Score = 37.2 bits (86), Expect = 0.11
 Identities = 20/82 (24%), Positives = 39/82 (47%), Gaps = 6/82 (7%)

Query: 1952 SESTTTSSLVSES----TTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTT 2007
               T    +V       ++ S  ES+S +TS+  S ST++S   S ST+ S  ++    +
Sbjct: 258  QNGTIFYEIVDSGYVILSSISVSESQSQSTSTSSSSSTSSSE--SSSTSYSPGDASIQNS 315

Query: 2008 ISPVSESTTTSSPVSESTTTIS 2029
                 +S+T+ S    ++++ S
Sbjct: 316  QRSQLQSSTSQSESESASSSYS 337



 Score = 36.4 bits (84), Expect = 0.18
 Identities = 26/85 (30%), Positives = 39/85 (45%), Gaps = 5/85 (5%)

Query: 1942 SESTTTSS--PESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSS 1999
               T             +S  VSES + S+  S S++TSS ES ST+ S   +    +  
Sbjct: 258  QNGTIFYEIVDSGYVILSSISVSESQSQSTSTSSSSSTSSSESSSTSYSPGDASIQNSQR 317

Query: 2000 PESESTTTISPVSESTTTSSPVSES 2024
             + +S+T+    SES + SS  S S
Sbjct: 318  SQLQSSTS---QSESESASSSYSYS 339



 Score = 36.1 bits (83), Expect = 0.21
 Identities = 21/69 (30%), Positives = 38/69 (55%), Gaps = 1/69 (1%)

Query: 2044 STTTNNPKSESTTTNNPASESITSSSPASES-TTTSSPASESTTTSSPASESTTTSSPAS 2102
            S + +  +S+ST+T++ +S S + SS  S S    S   S+ +   S  S+S + S+ +S
Sbjct: 276  SISVSESQSQSTSTSSSSSTSSSESSSTSYSPGDASIQNSQRSQLQSSTSQSESESASSS 335

Query: 2103 ESTTTSSPE 2111
             S + + PE
Sbjct: 336  YSYSVNLPE 344



 Score = 31.8 bits (72), Expect = 4.5
 Identities = 24/86 (27%), Positives = 37/86 (43%), Gaps = 7/86 (8%)

Query: 1972 SESTTTSS--PESESTTTSSLVSESTTTSSPESESTTTISPVSESTT-----TSSPVSES 2024
               T             +S  VSES + S+  S S++T S  S ST+      S   S+ 
Sbjct: 258  QNGTIFYEIVDSGYVILSSISVSESQSQSTSTSSSSSTSSSESSSTSYSPGDASIQNSQR 317

Query: 2025 TTTISPESESTTTSSPASESTTTNNP 2050
            +   S  S+S + S+ +S S + N P
Sbjct: 318  SQLQSSTSQSESESASSSYSYSVNLP 343



 Score = 31.4 bits (71), Expect = 6.7
 Identities = 24/86 (27%), Positives = 35/86 (40%), Gaps = 4/86 (4%)

Query: 2058 NNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPE-SESTT 2116
            N      I     +     +S   SES + S+  S S++TSS  S ST+ S  + S   +
Sbjct: 259  NGTIFYEIV---DSGYVILSSISVSESQSQSTSTSSSSSTSSSESSSTSYSPGDASIQNS 315

Query: 2117 TSSPASESTTIEEQGVSPHSEKLSAN 2142
              S    ST+  E   +  S   S N
Sbjct: 316  QRSQLQSSTSQSESESASSSYSYSVN 341


>gnl|CDD|220401 pfam09786, CytochromB561_N, Cytochrome B561, N terminal.  Members of
            this family are found in the N terminal region of
            cytochrome B561, as well as in various other putative
            uncharacterized proteins.
          Length = 559

 Score = 42.8 bits (101), Expect = 0.002
 Identities = 32/231 (13%), Positives = 67/231 (29%), Gaps = 17/231 (7%)

Query: 1950 PESESTTTSSLVSESTTTSSPESESTT-TSSPESESTTTSSLVSESTTTSSPESESTTTI 2008
             +    T  S   +S   S   +   T        S+ + S    ++ +      ST   
Sbjct: 105  AKDSQFTVVSQAKKSPPASKTSTPMNTSEPLVPGHSSFSDSPSRSASPSRKFSPSSTIQQ 164

Query: 2009 SPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSS 2068
            SP    +   +  S S  + S  S     +S  + S   ++P +  ++      +  T  
Sbjct: 165  SPQLTPSNKPASPSSSYQSPSYSSSLGPVNSSGNRSNLRSSPWALRSSG--DKKDITTDE 222

Query: 2069 SP-----ASESTTTSSPASEST--TTSSPASESTTTSSPASESTTTSSPESESTTTSSPA 2121
                   A          S +    T      S  +SSP+  + + ++ ++  +      
Sbjct: 223  KYLETFLAEVDEEQHMITSSAGKNATPPETINSFGSSSPSFWNYSRNASDAARSLKKRSY 282

Query: 2122 SESTTIEEQGVSPHSEKLSANEDPEEFPNEDVFEHTFAEIPNIDHSNQTDE 2172
              S         P  +K  A+  P++   E       +            +
Sbjct: 283  QLS-----PSPVPSKQK--ASTSPKKGEGEPPNMSLESASEVFKRVGVLPQ 326



 Score = 40.9 bits (96), Expect = 0.008
 Identities = 48/228 (21%), Positives = 89/228 (39%), Gaps = 20/228 (8%)

Query: 1920 PESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTS-SPESESTTTS 1978
             +    T  S        S P S+++T  +         S  S+S + S SP  + + +S
Sbjct: 105  AKDSQFTVVS----QAKKSPPASKTSTPMNTSEPLVPGHSSFSDSPSRSASPSRKFSPSS 160

Query: 1979 SPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTS 2038
            + +     T S    ++ +SS +S S ++      S+   S         SP +     S
Sbjct: 161  TIQQSPQLTPSN-KPASPSSSYQSPSYSSSLGPVNSSGNRSN-----LRSSPWALR---S 211

Query: 2039 SPASESTTTNNPKSESTTTN-NPASESITSSSPASESTTTSSPASESTTTSSPASESTTT 2097
            S   +  TT+    E+     +     ITSS+  +    T      S  +SSP+  + + 
Sbjct: 212  SGDKKDITTDEKYLETFLAEVDEEQHMITSSAGKN---ATPPETINSFGSSSPSFWNYSR 268

Query: 2098 SSP-ASESTTTSSPESESTTTSSPASESTTIEE-QGVSPHSEKLSANE 2143
            ++  A+ S    S +   +   S    ST+ ++ +G  P+    SA+E
Sbjct: 269  NASDAARSLKKRSYQLSPSPVPSKQKASTSPKKGEGEPPNMSLESASE 316



 Score = 35.5 bits (82), Expect = 0.35
 Identities = 33/198 (16%), Positives = 72/198 (36%), Gaps = 14/198 (7%)

Query: 1872 FTTNNNSESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTS---SPESESTTTS 1928
                ++  + V  +  +   S+ +T  +        ++  S+S + S   S +   ++T 
Sbjct: 103  VKAKDSQFTVVSQAKKSPPASKTSTPMNTSEPLVPGHSSFSDSPSRSASPSRKFSPSSTI 162

Query: 1929 SLVSESTTTSSPESESTTTSSPESESTTTSSLVS--ESTTTSSPESESTTTSSPESESTT 1986
                + T ++ P S S++  SP   S+      S   S   SSP +    +S  + + TT
Sbjct: 163  QQSPQLTPSNKPASPSSSYQSPSYSSSLGPVNSSGNRSNLRSSPWALR--SSGDKKDITT 220

Query: 1987 TSSL-------VSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSS 2039
                       V E     +  +    T      S  +SSP   + +  + ++  +    
Sbjct: 221  DEKYLETFLAEVDEEQHMITSSAGKNATPPETINSFGSSSPSFWNYSRNASDAARSLKKR 280

Query: 2040 PASESTTTNNPKSESTTT 2057
                S +    K +++T+
Sbjct: 281  SYQLSPSPVPSKQKASTS 298


>gnl|CDD|221548 pfam12361, DBP, Duffy-antigen binding protein.  This family of
            proteins is found in eukaryotes. Proteins in this family
            are typically between 449 and 1061 amino acids in length.
            The family is found in association with pfam05424. There
            are two conserved sequence motifs: NKNGG and QKHDF. This
            family is part of the Duffy-antigen binding protein of
            Plasmodium spp. This protein is an antigen on these
            parasites which enable them to invade erythrocytes.
          Length = 318

 Score = 42.0 bits (98), Expect = 0.002
 Identities = 42/196 (21%), Positives = 73/196 (37%), Gaps = 7/196 (3%)

Query: 1894 NTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESE 1953
              T+NS  SEST   N   + T  S+     ++ + LV++      P  +++  S   S 
Sbjct: 95   LGTSNSRPSESTVEANSPGDGTVNSASIPVVSSENPLVTKHKGLE-PSKDNSDNSGSASH 153

Query: 1954 STTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTIS---- 2009
            +    +  S +   S+ + E T       E   T    + S  TSS   +STT++     
Sbjct: 154  ALAGENGESMAGPDSNSKGE-TADPQDNIEVKATKDSSNRSDGTSSATGDSTTSVDRAIN 212

Query: 2010 -PVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSS 2068
              V E    S     +    S   +   T +  S +    N   ++   N P S +  + 
Sbjct: 213  KGVPEDGDKSVGSKRAENEDSSAEKDGATVAGGSTNDPEQNVSVDTDNGNVPGSGNKQNE 272

Query: 2069 SPASESTTTSSPASES 2084
               + S   S  ++ES
Sbjct: 273  GATALSGAESLESNES 288



 Score = 41.6 bits (97), Expect = 0.004
 Identities = 56/276 (20%), Positives = 101/276 (36%), Gaps = 32/276 (11%)

Query: 1892 SENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPE 1951
            S +   NS   +STT      E+ T        T   S V  S    S  +++     P 
Sbjct: 20   SAHGNVNSGAGKSTT-----GEAVTGDGQNGNQTPAESNVQRSDIVESLSAKNVDPQKPV 74

Query: 1952 SESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSP------ESEST 2005
            SE +  +S V++     + +    T++S  SEST  ++   + T  S+       E+   
Sbjct: 75   SERSADTSSVTD--IAEAGKENLGTSNSRPSESTVEANSPGDGTVNSASIPVVSSENPLV 132

Query: 2006 TT---ISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPAS 2062
            T    + P  +++  S   S +    + ES +   S+   E T       E   T + ++
Sbjct: 133  TKHKGLEPSKDNSDNSGSASHALAGENGESMAGPDSNSKGE-TADPQDNIEVKATKDSSN 191

Query: 2063 ESITSSSPASESTTT---------------SSPASESTTTSSPASESTTTSSPASESTTT 2107
             S  +SS   +STT+               S  +  +    S A +   T +  S +   
Sbjct: 192  RSDGTSSATGDSTTSVDRAINKGVPEDGDKSVGSKRAENEDSSAEKDGATVAGGSTNDPE 251

Query: 2108 SSPESESTTTSSPASESTTIEEQGVSPHSEKLSANE 2143
             +   ++   + P S +   E       +E L +NE
Sbjct: 252  QNVSVDTDNGNVPGSGNKQNEGATALSGAESLESNE 287



 Score = 40.5 bits (94), Expect = 0.009
 Identities = 38/211 (18%), Positives = 77/211 (36%), Gaps = 8/211 (3%)

Query: 1938 SSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTT 1997
            S+P  ++  +S  E     ++     S    S   E+ T        T   S V  S   
Sbjct: 1    SNPIIQAVDSSKAEKVQGDSAHGNVNSGAGKSTTGEAVTGDGQNGNQTPAESNVQRSDIV 60

Query: 1998 SSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTT 2057
             S  +++     PVSE +  +S V++       +    T++S  SEST   N   + T  
Sbjct: 61   ESLSAKNVDPQKPVSERSADTSSVTDIAEA--GKENLGTSNSRPSESTVEANSPGDGTVN 118

Query: 2058 NNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTT 2117
            +      ++S +P         P+ +++  S  AS +    +  S +   S+ + E+   
Sbjct: 119  SASIP-VVSSENPLVTKHKGLEPSKDNSDNSGSASHALAGENGESMAGPDSNSKGETADP 177

Query: 2118 SSPAS-----ESTTIEEQGVSPHSEKLSANE 2143
                      +S+   +   S   +  ++ +
Sbjct: 178  QDNIEVKATKDSSNRSDGTSSATGDSTTSVD 208



 Score = 39.0 bits (90), Expect = 0.025
 Identities = 39/202 (19%), Positives = 71/202 (35%), Gaps = 6/202 (2%)

Query: 1873 TTNNNSESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVS 1932
                 S S    ST+ +    + T NS      ++ NP         P  +++  S   S
Sbjct: 93   ENLGTSNSRPSESTVEANSPGDGTVNSASIPVVSSENPLVTKHKGLEPSKDNSDNSGSAS 152

Query: 1933 ESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVS 1992
             +    + ES +   S+ + E+      +    T  S    S  TSS   +STT+   V 
Sbjct: 153  HALAGENGESMAGPDSNSKGETADPQDNIEVKATKDSSNR-SDGTSSATGDSTTS---VD 208

Query: 1993 ESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKS 2052
             +     PE    +  S    +    S   +   T++  S +    + + ++   N P S
Sbjct: 209  RAINKGVPEDGDKSVGS--KRAENEDSSAEKDGATVAGGSTNDPEQNVSVDTDNGNVPGS 266

Query: 2053 ESTTTNNPASESITSSSPASES 2074
             +       + S   S  ++ES
Sbjct: 267  GNKQNEGATALSGAESLESNES 288



 Score = 37.4 bits (86), Expect = 0.076
 Identities = 47/239 (19%), Positives = 86/239 (35%), Gaps = 9/239 (3%)

Query: 1908 NNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTT 1967
            +NP  ++  +S  E     ++     S    S   E+ T        T   S V  S   
Sbjct: 1    SNPIIQAVDSSKAEKVQGDSAHGNVNSGAGKSTTGEAVTGDGQNGNQTPAESNVQRSDIV 60

Query: 1968 SSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTT 2027
             S  +++     P SE +  +S V++     + +    T+ S  SEST  ++   + T  
Sbjct: 61   ESLSAKNVDPQKPVSERSADTSSVTD--IAEAGKENLGTSNSRPSESTVEANSPGDGTVN 118

Query: 2028 ISPESESTTTSSPASESTTTN--NPKSESTTTNNPASESITSSSPASESTTTSSPASEST 2085
             +        SS     T      P  +++  +  AS ++   +  S +   S+   E+ 
Sbjct: 119  SAS---IPVVSSENPLVTKHKGLEPSKDNSDNSGSASHALAGENGESMAGPDSNSKGETA 175

Query: 2086 TTSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTTIEEQGVSPHSEKLSANED 2144
                      T  S  S  +  +S  +  +TTS   + +  + E G      K + NED
Sbjct: 176  DPQDNIEVKATKDS--SNRSDGTSSATGDSTTSVDRAINKGVPEDGDKSVGSKRAENED 232



 Score = 37.4 bits (86), Expect = 0.090
 Identities = 39/182 (21%), Positives = 70/182 (38%), Gaps = 13/182 (7%)

Query: 1877 NSESTVVMSTLNSLLS-----ENTTTNSPESEST----TTNNPESESTTTSSPESESTTT 1927
            NS S  V+S+ N L++     E +  NS  S S        N ES +   S+ + E+   
Sbjct: 118  NSASIPVVSSENPLVTKHKGLEPSKDNSDNSGSASHALAGENGESMAGPDSNSKGETADP 177

Query: 1928 SSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTT 1987
               +    T  S    S  TSS   +STT+   V  +     PE    +  S  +E+  +
Sbjct: 178  QDNIEVKATKDSSNR-SDGTSSATGDSTTS---VDRAINKGVPEDGDKSVGSKRAENEDS 233

Query: 1988 SSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTT 2047
            S+    +T      ++    +S  +++       ++     +  S + +  S  S   T 
Sbjct: 234  SAEKDGATVAGGSTNDPEQNVSVDTDNGNVPGSGNKQNEGATALSGAESLESNESVHKTI 293

Query: 2048 NN 2049
            +N
Sbjct: 294  DN 295



 Score = 35.9 bits (82), Expect = 0.23
 Identities = 38/232 (16%), Positives = 78/232 (33%), Gaps = 4/232 (1%)

Query: 1897 TNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTT 1956
            ++  ES S    +P+   +  S+  S  T  +    E+  TS+     +T  +      T
Sbjct: 57   SDIVESLSAKNVDPQKPVSERSADTSSVTDIAEAGKENLGTSNSRPSESTVEANSPGDGT 116

Query: 1957 TSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTT 2016
             +S      ++ +P         P  +++  S   S +    + ES +    +   E+  
Sbjct: 117  VNSASIPVVSSENPLVTKHKGLEPSKDNSDNSGSASHALAGENGESMAGPDSNSKGETAD 176

Query: 2017 TSSPVSESTT----TISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPAS 2072
                +    T      S  + S T  S  S     N    E    +  +  +    S A 
Sbjct: 177  PQDNIEVKATKDSSNRSDGTSSATGDSTTSVDRAINKGVPEDGDKSVGSKRAENEDSSAE 236

Query: 2073 ESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASES 2124
            +   T +  S +    + + ++   + P S +       + S   S  ++ES
Sbjct: 237  KDGATVAGGSTNDPEQNVSVDTDNGNVPGSGNKQNEGATALSGAESLESNES 288


>gnl|CDD|235906 PRK07003, PRK07003, DNA polymerase III subunits gamma and tau;
            Validated.
          Length = 830

 Score = 42.5 bits (100), Expect = 0.003
 Identities = 19/123 (15%), Positives = 41/123 (33%), Gaps = 2/123 (1%)

Query: 2014 STTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASE 2073
            +T   +P +      + +         A      N   S  +  +   ++    S  AS 
Sbjct: 420  ATRAEAPPAAPAPPATADRGDDAADGDAPVPAKANARASADSRCDERDAQPPADSGSASA 479

Query: 2074 STTTSSPAS--ESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTTIEEQG 2131
              + + P +  E    ++  S +T  + P + +   +S E      + PA E+       
Sbjct: 480  PASDAPPDAAFEPAPRAAAPSAATPAAVPDARAPAAASREDAPAAAAPPAPEARPPTPAA 539

Query: 2132 VSP 2134
             +P
Sbjct: 540  AAP 542



 Score = 36.8 bits (85), Expect = 0.20
 Identities = 21/159 (13%), Positives = 57/159 (35%), Gaps = 5/159 (3%)

Query: 1995 TTTSSPESESTTTISPVSESTTTS-SPVSESTTTISPESESTTTSSPASESTTTNNPKSE 2053
               ++  + +   ++ V+ +   + +P + +    +         +P + +   ++    
Sbjct: 386  RAAAAVGASAVPAVTAVTGAAGAALAPKAAAAAAATRAEAPPAAPAPPATADRGDDAADG 445

Query: 2054 STTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESE 2113
                  PA  +  +S+ +      + P ++S + S+PA  S      A E    ++  S 
Sbjct: 446  DAPV--PAKANARASADSRCDERDAQPPADSGSASAPA--SDAPPDAAFEPAPRAAAPSA 501

Query: 2114 STTTSSPASESTTIEEQGVSPHSEKLSANEDPEEFPNED 2152
            +T  + P + +     +  +P +    A E     P   
Sbjct: 502  ATPAAVPDARAPAAASREDAPAAAAPPAPEARPPTPAAA 540



 Score = 36.0 bits (83), Expect = 0.31
 Identities = 18/186 (9%), Positives = 58/186 (31%), Gaps = 19/186 (10%)

Query: 1940 PESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSS 1999
            P   +   ++  + +    + V+     ++   ++   ++            + +   ++
Sbjct: 381  PAPGARAAAAVGASAVPAVTAVT-GAAGAALAPKAAAAAAATRAEAPP---AAPAPPATA 436

Query: 2000 PESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNN 2059
               +              ++  S  +     +++    S  AS   +   P        +
Sbjct: 437  DRGDDAAD-GDAPVPAKANARASADSRCDERDAQPPADSGSASAPASDAPP--------D 487

Query: 2060 PASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSS 2119
             A E    ++  S +T  + P + +   +S        + PA E+         +   ++
Sbjct: 488  AAFEPAPRAAAPSAATPAAVPDARAPAAASREDAPAAAAPPAPEAR------PPTPAAAA 541

Query: 2120 PASEST 2125
            PA+ + 
Sbjct: 542  PAARAG 547



 Score = 33.7 bits (77), Expect = 1.4
 Identities = 15/168 (8%), Positives = 58/168 (34%), Gaps = 3/168 (1%)

Query: 1920 PESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSS 1979
            P   +   +++ + +    +  +     ++   ++   ++        ++P   +T    
Sbjct: 381  PAPGARAAAAVGASAVPAVTAVT-GAAGAALAPKAAAAAAATRAEAPPAAPAPPATADRG 439

Query: 1980 PESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSS 2039
             ++           +   S+          P ++S + S+P S++    + E      + 
Sbjct: 440  DDAADGDAPVPAKANARASADSRCDERDAQPPADSGSASAPASDAPPDAAFEPAPRAAAP 499

Query: 2040 PASESTTTNNPKSESTTTNN--PASESITSSSPASESTTTSSPASEST 2085
             A+      + ++ +  +    PA+ +  +      +   ++PA+ + 
Sbjct: 500  SAATPAAVPDARAPAAASREDAPAAAAPPAPEARPPTPAAAAPAARAG 547


>gnl|CDD|237284 PRK13108, PRK13108, prolipoprotein diacylglyceryl transferase;
            Reviewed.
          Length = 460

 Score = 41.9 bits (98), Expect = 0.004
 Identities = 27/146 (18%), Positives = 44/146 (30%), Gaps = 4/146 (2%)

Query: 2031 ESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSP 2090
            E E    ++ A  S  +          N P   +    +  +E T   +  S        
Sbjct: 297  EREPAELAAAAVASAASAVGPVGPGEPNQPDDVAEAVKAEVAEVTDEVAAESVVQVADRD 356

Query: 2091 ASESTTTSSPASESTTTSSPESESTTTSSPASESTTIEEQGVSPHSEKLSANEDPEEFPN 2150
              EST      SE+      +       +PA+     E    +P      A+E  +E   
Sbjct: 357  G-ESTPAVEETSEADIER-EQPGDLAGQAPAAHQVDAEAASAAPEEPAALASEAHDE--T 412

Query: 2151 EDVFEHTFAEIPNIDHSNQTDEAIPE 2176
            E       A IP+    ++   A P 
Sbjct: 413  EPEVPEKAAPIPDPAKPDELAVAGPG 438



 Score = 41.9 bits (98), Expect = 0.004
 Identities = 23/151 (15%), Positives = 47/151 (31%), Gaps = 4/151 (2%)

Query: 1981 ESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSP 2040
            E E    ++    S  ++           P   +    + V+E T  ++ ES        
Sbjct: 297  EREPAELAAAAVASAASAVGPVGPGEPNQPDDVAEAVKAEVAEVTDEVAAESVVQVADRD 356

Query: 2041 ASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSP 2100
              EST      SE+          +   +PA+      + ++     ++ ASE+   + P
Sbjct: 357  G-ESTPAVEETSEADIER-EQPGDLAGQAPAAHQVDAEAASAAPEEPAALASEAHDETEP 414

Query: 2101 ASESTTTSSPESESTTTSSPASESTTIEEQG 2131
              E    ++P  +       A      +   
Sbjct: 415  --EVPEKAAPIPDPAKPDELAVAGPGDDPAE 443



 Score = 40.0 bits (93), Expect = 0.016
 Identities = 23/151 (15%), Positives = 46/151 (30%), Gaps = 5/151 (3%)

Query: 1921 ESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSP 1980
            E E    ++    S  ++         + P+  +    + V+E T   + ES     +  
Sbjct: 297  EREPAELAAAAVASAASAVGPVGPGEPNQPDDVAEAVKAEVAEVTDEVAAESVVQV-ADR 355

Query: 1981 ESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSP 2040
            + EST      SE+      +       +P +      +  +      +  SE+   + P
Sbjct: 356  DGESTPAVEETSEADIER-EQPGDLAGQAPAAHQVDAEAASAAPEEPAALASEAHDETEP 414

Query: 2041 ASESTTTNNPKSESTTTNNPASESITSSSPA 2071
              E      P  +    +  A        PA
Sbjct: 415  --EVPEKAAPIPDPAKPDELAVAG-PGDDPA 442



 Score = 39.2 bits (91), Expect = 0.030
 Identities = 21/153 (13%), Positives = 48/153 (31%), Gaps = 4/153 (2%)

Query: 1971 ESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISP 2030
            E E    ++    S  ++         + P+  +    + V+E T   +  S        
Sbjct: 297  EREPAELAAAAVASAASAVGPVGPGEPNQPDDVAEAVKAEVAEVTDEVAAESVVQVA-DR 355

Query: 2031 ESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSP 2090
            + EST      SE+      +        PA+  + + + ++     ++ ASE+   + P
Sbjct: 356  DGESTPAVEETSEADIE-REQPGDLAGQAPAAHQVDAEAASAAPEEPAALASEAHDETEP 414

Query: 2091 ASESTTTSSPASESTTTSSPESESTTTSSPASE 2123
              E    ++P  +                   +
Sbjct: 415  --EVPEKAAPIPDPAKPDELAVAGPGDDPAEPD 445



 Score = 38.8 bits (90), Expect = 0.036
 Identities = 20/164 (12%), Positives = 46/164 (28%), Gaps = 16/164 (9%)

Query: 1951 ESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISP 2010
            E E    ++    S  ++         + P+  +    + V+E T   + ES        
Sbjct: 297  EREPAELAAAAVASAASAVGPVGPGEPNQPDDVAEAVKAEVAEVTDEVAAESVVQVA--- 353

Query: 2011 VSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSP 2070
                  ++  V E++       +    +  A  +      + ++       + S     P
Sbjct: 354  -DRDGESTPAVEETSEADIEREQPGDLAGQAPAA-----HQVDAE------AASAAPEEP 401

Query: 2071 ASE-STTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESE 2113
            A+  S        E    ++P  +       A         E +
Sbjct: 402  AALASEAHDETEPEVPEKAAPIPDPAKPDELAVAGPGDDPAEPD 445



 Score = 37.3 bits (86), Expect = 0.10
 Identities = 21/151 (13%), Positives = 43/151 (28%), Gaps = 4/151 (2%)

Query: 2001 ESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNP 2060
            E E     +    S  ++           P+  +    +  +E T     +S     +  
Sbjct: 297  EREPAELAAAAVASAASAVGPVGPGEPNQPDDVAEAVKAEVAEVTDEVAAESVVQVADRD 356

Query: 2061 ASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSP 2120
              ES  +    SE+              +PA+      + ++     ++  SE+   + P
Sbjct: 357  G-ESTPAVEETSEADIER-EQPGDLAGQAPAAHQVDAEAASAAPEEPAALASEAHDETEP 414

Query: 2121 ASESTTIEEQGVSPHSEKLSANEDPEEFPNE 2151
                        +   E   A   P + P E
Sbjct: 415  EVPEKAAPIPDPAKPDE--LAVAGPGDDPAE 443



 Score = 35.3 bits (81), Expect = 0.47
 Identities = 27/170 (15%), Positives = 50/170 (29%), Gaps = 11/170 (6%)

Query: 2013 ESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPAS 2072
            E    ++    S  +          + P   +       +E T      S    +     
Sbjct: 299  EPAELAAAAVASAASAVGPVGPGEPNQPDDVAEAVKAEVAEVTDEVAAESVVQVADRDG- 357

Query: 2073 ESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTTIEEQGV 2132
            EST      SE+              +PA+        E+ S     PA+ ++   ++  
Sbjct: 358  ESTPAVEETSEADIE-REQPGDLAGQAPAAHQ---VDAEAASAAPEEPAALASEAHDETE 413

Query: 2133 SPHSEKLSANEDPEEFPNEDVFEHTFAEIPNIDHSNQTDEAIPETFDARE 2182
                EK +   DP + P+E        +    D   + D+     F +R 
Sbjct: 414  PEVPEKAAPIPDPAK-PDELAVAGPGDDPAEPDGIRRQDD-----FSSRR 457



 Score = 34.6 bits (79), Expect = 0.81
 Identities = 16/138 (11%), Positives = 36/138 (26%), Gaps = 14/138 (10%)

Query: 1896 TTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESEST 1955
              N P+  +       +E T   + ES            +T +  E+        +    
Sbjct: 322  EPNQPDDVAEAVKAEVAEVTDEVAAESVVQVADRD--GESTPAVEETSEADIEREQPGDL 379

Query: 1956 TTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSEST 2015
                        ++ + ++   S+   E    +S   + T    PE       +P+ +  
Sbjct: 380  -------AGQAPAAHQVDAEAASAAPEEPAALASEAHDETEPEVPEKA-----APIPDPA 427

Query: 2016 TTSSPVSESTTTISPESE 2033
                           E +
Sbjct: 428  KPDELAVAGPGDDPAEPD 445


>gnl|CDD|221866 pfam12935, Sec16_N, Vesicle coat trafficking protein Sec16
            N-terminus.  Sec16 is a multi-domain vesicle coat
            protein. The overall function of Sec16 is in mediating
            the movement of protein-cargo between the organelles of
            the secretory pathway. Over-expression of truncated
            mutants of only the N-terminus are lethal, and this
            portion does not appear to be essential for function so
            may act as a stabilising region.
          Length = 246

 Score = 41.0 bits (96), Expect = 0.004
 Identities = 46/230 (20%), Positives = 65/230 (28%), Gaps = 31/230 (13%)

Query: 1921 ESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSP 1980
               ST T  +       S  E  +    + E       S        S    +     S 
Sbjct: 20   NQLSTQTKPIYLPPENESRFEEGAPLLDNGEQNEPVEESAPQTVAIDSVFVEDEDDEGSD 79

Query: 1981 ESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSP 2040
               S      V E         +ST   S V +S   +     S    S E    T  + 
Sbjct: 80   FFNSLHEGEAVEEQQPPPHLTRKST---SQVLDSLGLNPDSLSS--PASAEPLDPTAQNE 134

Query: 2041 ASES---TTTNNPKSESTTTNNPASE-----SITSSSPASESTTTSSPASE--------- 2083
             S     +T  NP  ES + + P+SE         S   SEST T    +E         
Sbjct: 135  FSNVLAASTDGNP--ESESQSEPSSEEELAARAELSDDESESTPTEDDLAERWQAFLDND 192

Query: 2084 -----STTTSSPASESTTTSSPASESTTTSSP--ESESTTTSSPASESTT 2126
                    T+     +  T   +  +    SP    E  +   P +E TT
Sbjct: 193  DDLLLDDETALAEGPNGDTPENSQNTLNDDSPFGTPEFPSPVRPKAEPTT 242


>gnl|CDD|221242 pfam11816, DUF3337, Domain of unknown function (DUF3337).  This
            family of proteins are functionally uncharacterized. This
            family is only found in eukaryotes. This presumed domain
            is typically between 285 to 342 amino acids in length.
          Length = 320

 Score = 41.4 bits (97), Expect = 0.004
 Identities = 27/171 (15%), Positives = 51/171 (29%), Gaps = 11/171 (6%)

Query: 1876 NNSESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSEST 1935
            NN  S +   +  S+ +     +    E+    + E E     S  S +  T        
Sbjct: 5    NNKRSILSKDSSGSV-TLWDIPSGKVVETPGEVSEEEEIKELESVYSPNWFTVDSKEGKL 63

Query: 1936 TTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESES-------TTTS 1988
            ++S    +   +SS   +    S+   E     S ++    +S  E +           +
Sbjct: 64   SSSLFGKKFRMSSSLLKKCGAAST---EGKPQKSEKAIDLKSSKAEKDPEINLGGLLLRA 120

Query: 1989 SLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSS 2039
             L        +P     +T  P  ++ T    +   TT I  E        
Sbjct: 121  LLEYWKELKCNPRVLVFSTFLPSLDNETPYLKLPPDTTIIISEESPDLGGG 171



 Score = 39.1 bits (91), Expect = 0.024
 Identities = 31/195 (15%), Positives = 52/195 (26%), Gaps = 8/195 (4%)

Query: 1936 TTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSEST 1995
               S  S+ ++ S   +     S  V E+    S E E     S  S +  T        
Sbjct: 6    NKRSILSKDSSGSV--TLWDIPSGKVVETPGEVSEEEEIKELESVYSPNWFTVDSKEGKL 63

Query: 1996 TTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSEST 2055
            ++S    +   + S + +    S+           E      SS A +    N       
Sbjct: 64   SSSLFGKKFRMSSSLLKKCGAASTEGKPQK----SEKAIDLKSSKAEKDPEINLGGLLLR 119

Query: 2056 TTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESEST 2115
                   E   +      ST   S  +E+     P    TT              +    
Sbjct: 120  ALLEYWKELKCNPRVLVFSTFLPSLDNETPYLKLPPD--TTIIISEESPDLGGGRDLYRG 177

Query: 2116 TTSSPASESTTIEEQ 2130
               S + +   +EE 
Sbjct: 178  LVGSTSGDEELLEEN 192



 Score = 32.1 bits (73), Expect = 3.7
 Identities = 25/155 (16%), Positives = 49/155 (31%), Gaps = 21/155 (13%)

Query: 1996 TTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSEST 2055
               S  S+ ++    V+     S  V E+   +S E E     S  S +  T + K    
Sbjct: 6    NKRSILSKDSSG--SVTLWDIPSGKVVETPGEVSEEEEIKELESVYSPNWFTVDSK---- 59

Query: 2056 TTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESEST 2115
                     ++SS    +   +SS   +    S+      +  +   +S   S  E +  
Sbjct: 60   ------EGKLSSSLFGKKFRMSSSLLKKCGAASTEGKPQKSEKAIDLKS---SKAEKD-- 108

Query: 2116 TTSSPASESTTIEEQGVSPHSEKLSANEDPEEFPN 2150
                P      +  + +  + ++L  N     F  
Sbjct: 109  ----PEINLGGLLLRALLEYWKELKCNPRVLVFST 139


>gnl|CDD|225805 COG3266, DamX, Uncharacterized protein conserved in bacteria
            [Function unknown].
          Length = 292

 Score = 41.5 bits (97), Expect = 0.004
 Identities = 33/174 (18%), Positives = 64/174 (36%), Gaps = 12/174 (6%)

Query: 1958 SSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTT 2017
            S+L + ST++S     S   S   + +T  ++       TS+  +    ++ P+S + T 
Sbjct: 29   SALKAPSTSSSEA-PASAEKSIDLNGATQANAQQPAPGPTSAENTSQDLSLPPISSTPTQ 87

Query: 2018 --SSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESIT--SSSPASE 2073
                   +    +  + +    +      +  NN    ST    PA+ +    +S P +E
Sbjct: 88   GQEPLAQDGQQRVEVQGDLNNAAVQPQNLSQLNNVAVTSTLPTEPATVAPVRNASVPTAE 147

Query: 2074 STTTSSPASESTTTS---SPASESTTTSSPASESTTTSSPESESTTTSSPASES 2124
                + P      +     P +  T T++ A     T+SP     T   PA + 
Sbjct: 148  RPAITRPVRAQAVSEPAVEPKAAKTATATEAK--VQTASPAQTPAT--PPAGKG 197



 Score = 36.5 bits (84), Expect = 0.13
 Identities = 34/185 (18%), Positives = 62/185 (33%), Gaps = 12/185 (6%)

Query: 1917 TSSPESESTTTSSL-VSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESEST 1975
            TSS E+ ++   S+ ++ +T  ++ +     TS+  +    +   +S + T         
Sbjct: 36   TSSSEAPASAEKSIDLNGATQANAQQPAPGPTSAENTSQDLSLPPISSTPTQGQEPLAQD 95

Query: 1976 TTSSPESESTTTSSLVSESTTTS----SPESESTTTISPVSESTTTSSPVSESTTTISPE 2031
                 E +    ++ V     +     +  S   T  + V+     S P +E      P 
Sbjct: 96   GQQRVEVQGDLNNAAVQPQNLSQLNNVAVTSTLPTEPATVAPVRNASVPTAERPAITRPV 155

Query: 2032 SESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPA 2091
              +   S PA E      PK+  T T   A     S +    +      A+ S    S  
Sbjct: 156  -RAQAVSEPAVE------PKAAKTATATEAKVQTASPAQTPATPPAGKGAAASGQLKSAP 208

Query: 2092 SESTT 2096
            S   T
Sbjct: 209  SSHYT 213



 Score = 33.8 bits (77), Expect = 1.1
 Identities = 27/178 (15%), Positives = 59/178 (33%), Gaps = 18/178 (10%)

Query: 1947 TSSPESESTTTSSL-VSESTTTSSPESESTTTSSPESESTTTSSLVSES-TTTSSPES-- 2002
            TSS E+ ++   S+ ++ +T  ++ +     TS+  +    +   +S + T    P +  
Sbjct: 36   TSSSEAPASAEKSIDLNGATQANAQQPAPGPTSAENTSQDLSLPPISSTPTQGQEPLAQD 95

Query: 2003 -----ESTTTISPVSESTTTSSPV------SESTTTISPESESTTTSSPASESTTTNNPK 2051
                 E    ++  +      S +      S   T  +  +     S P +E      P 
Sbjct: 96   GQQRVEVQGDLNNAAVQPQNLSQLNNVAVTSTLPTEPATVAPVRNASVPTAERPAITRPV 155

Query: 2052 SESTTTN---NPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTT 2106
                 +     P +    +++ A   T + +    +      A+ S    S  S   T
Sbjct: 156  RAQAVSEPAVEPKAAKTATATEAKVQTASPAQTPATPPAGKGAAASGQLKSAPSSHYT 213


>gnl|CDD|222274 pfam13634, Nucleoporin_FG, Nucleoporin FG repeat region.  This family
            includes a number of FG repeats that are found in
            nucleoporin proteins. This family includes the yeast
            nucleoporins Nup116, Nup100, Nup49, Nup57 and Nup 145.
          Length = 106

 Score = 38.6 bits (90), Expect = 0.004
 Identities = 13/100 (13%), Positives = 38/100 (38%), Gaps = 4/100 (4%)

Query: 1907 TNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTT--TSSPESESTTTSSLVSES 1964
            +++  + +++     S   T   L  ++   ++P S      +SS ++   +   L   +
Sbjct: 1    SSSTTAGASSGGLFGSAPATGGGLFGQNAANTTPTSGGGLFGSSSSQATQPSGGGLFGSA 60

Query: 1965 TTTSSPESESTT--TSSPESESTTTSSLVSESTTTSSPES 2002
              TS+  +      +++  + + T   L   +T      +
Sbjct: 61   AQTSATTTGGGLFGSTTATTTTATGGGLFGNATAAQPATT 100



 Score = 33.2 bits (76), Expect = 0.39
 Identities = 18/108 (16%), Positives = 38/108 (35%), Gaps = 10/108 (9%)

Query: 1937 TSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTT--TSSPESESTTTSSLVSES 1994
            +SS  + +++     S   T   L  ++   ++P S      +SS ++   +   L   +
Sbjct: 1    SSSTTAGASSGGLFGSAPATGGGLFGQNAANTTPTSGGGLFGSSSSQATQPSGGGLFGSA 60

Query: 1995 TTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPAS 2042
              TS+     TT       +T T++      T       +T      +
Sbjct: 61   AQTSAT----TTGGGLFGSTTATTTT----ATGGGLFGNATAAQPATT 100



 Score = 32.5 bits (74), Expect = 0.74
 Identities = 17/103 (16%), Positives = 37/103 (35%), Gaps = 1/103 (0%)

Query: 1977 TSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTT 2036
            +SS  + +++     S   T      ++    +P S      S  S++T         + 
Sbjct: 1    SSSTTAGASSGGLFGSAPATGGGLFGQNAANTTPTSGGGLFGSSSSQATQPSGGGLFGSA 60

Query: 2037 TSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSS 2079
              + A+ +T      S + TT       +  ++ A++  TT  
Sbjct: 61   AQTSAT-TTGGGLFGSTTATTTTATGGGLFGNATAAQPATTGG 102



 Score = 30.9 bits (70), Expect = 2.4
 Identities = 15/105 (14%), Positives = 38/105 (36%), Gaps = 5/105 (4%)

Query: 1918 SSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTT- 1976
            SS  +   ++  L   +  T          ++  +  T+   L   S++ ++  S     
Sbjct: 1    SSSTTAGASSGGLFGSAPATGGG---LFGQNAANTTPTSGGGLFGSSSSQATQPSGGGLF 57

Query: 1977 -TSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSP 2020
             +++  S +TT   L   +T T++  +      +  +    T+  
Sbjct: 58   GSAAQTSATTTGGGLFGSTTATTTTATGGGLFGNATAAQPATTGG 102



 Score = 30.5 bits (69), Expect = 2.8
 Identities = 17/103 (16%), Positives = 40/103 (38%), Gaps = 6/103 (5%)

Query: 1894 NTTTNSPESESTTTNNPESESTT--TSSPESESTTTSSLVSESTTTSSPESESTT--TSS 1949
            ++TT    S     + P +       ++  +  T+   L   S++ ++  S      +++
Sbjct: 2    SSTTAGASSGGLFGSAPATGGGLFGQNAANTTPTSGGGLFGSSSSQATQPSGGGLFGSAA 61

Query: 1950 PESESTTTSSLVSESTTTSSPESESTT--TSSPESESTTTSSL 1990
              S +TT   L   +T T++  +       ++    +TT   L
Sbjct: 62   QTSATTTGGGLFGSTTATTTTATGGGLFGNATAAQPATTGGGL 104



 Score = 30.1 bits (68), Expect = 4.7
 Identities = 13/74 (17%), Positives = 26/74 (35%), Gaps = 3/74 (4%)

Query: 2054 STTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESE 2113
            S+TT   +S  +  S+PA   T        +  T+  +      SS +  +  +      
Sbjct: 2    SSTTAGASSGGLFGSAPA---TGGGLFGQNAANTTPTSGGGLFGSSSSQATQPSGGGLFG 58

Query: 2114 STTTSSPASESTTI 2127
            S   +S  +    +
Sbjct: 59   SAAQTSATTTGGGL 72



 Score = 29.8 bits (67), Expect = 5.7
 Identities = 13/109 (11%), Positives = 34/109 (31%), Gaps = 7/109 (6%)

Query: 1948 SSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTT 2007
            SS  +   ++  L   +  T          ++  +  T+   L   S++ ++      + 
Sbjct: 1    SSSTTAGASSGGLFGSAPATGGG---LFGQNAANTTPTSGGGLFGSSSSQATQ----PSG 53

Query: 2008 ISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTT 2056
                  +  TS+  +      S  + +TT +        T    + +  
Sbjct: 54   GGLFGSAAQTSATTTGGGLFGSTTATTTTATGGGLFGNATAAQPATTGG 102


>gnl|CDD|213844 TIGR03657, IsdB, heme uptake protein IsdB.  Isd proteins are
            iron-regulated surface proteins found in Bacillus,
            Staphylococcus and Listeria species and are responsible
            for heme scavenging from hemoproteins. The IsdB protein
            is only observed in Staphylococcus and consists of an
            N-terminal hydrophobic signal sequence, a pair of tandem
            NEAT (NEAr Transporter, pfam05031) domains which confers
            the ability to bind heme and a C-terminal sortase
            processing signal which targets the protein to the cell
            wall. IsdB is believed to make a direct contact with
            methemoglobin facilitating transfer of heme to IsdB. The
            heme is then transferred to other cell wall-bound NEAT
            domain proteins such as IsdA and IsdC.
          Length = 644

 Score = 41.8 bits (97), Expect = 0.005
 Identities = 36/174 (20%), Positives = 64/174 (36%), Gaps = 19/174 (10%)

Query: 1901 ESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSL 1960
            + E+ T  N +  +       S    T+       TT   E ES    S + ++  + S+
Sbjct: 449  DKEAFTKANADKTNKKEQQDNSAKKETTPATPSKPTTPPVEKESQKQDSQKDDNKQSPSV 508

Query: 1961 VSESTTTSSPESESTTTSSP-----ESESTTTSSLVSESTTTSSPESESTTTISPVSEST 2015
              E+  +S    + T  + P     ES STT + +VS +   + P              T
Sbjct: 509  EKENDASSESGKDKTPATKPAKGEVESSSTTPTKVVSTTQNVAKP--------------T 554

Query: 2016 TTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSS 2069
            T SS  ++     S  S     S+P  ++   N     + + NN  ++   + S
Sbjct: 555  TASSETTKDVVQTSAGSSEAKDSAPLQKANIKNTNDGHTQSQNNKNTQENKAKS 608



 Score = 40.3 bits (93), Expect = 0.016
 Identities = 39/166 (23%), Positives = 70/166 (42%), Gaps = 13/166 (7%)

Query: 1894 NTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESE 1953
            N    + + +   +   E+   T S P     TT  +  ES    S + ++  + S E E
Sbjct: 457  NADKTNKKEQQDNSAKKETTPATPSKP-----TTPPVEKESQKQDSQKDDNKQSPSVEKE 511

Query: 1954 STTTSSLVSESTTTSSP---ESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISP 2010
            +  +S    + T  + P   E ES++T+  +  STT +  V++ TT SS  ++     S 
Sbjct: 512  NDASSESGKDKTPATKPAKGEVESSSTTPTKVVSTTQN--VAKPTTASSETTKDVVQTSA 569

Query: 2011 VSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTT 2056
             S     S+P+ ++       +    T S  +++T  N  KS   T
Sbjct: 570  GSSEAKDSAPLQKANIK---NTNDGHTQSQNNKNTQENKAKSLPQT 612



 Score = 38.0 bits (87), Expect = 0.064
 Identities = 26/126 (20%), Positives = 54/126 (42%), Gaps = 2/126 (1%)

Query: 1987 TSSLVSESTTTSSPESESTTTISPVSESTTTSSPVS-ESTTTISPESESTTTSSPASEST 2045
            T +   ++      ++ +    +P + S  T+ PV  ES    S + ++  + S   E+ 
Sbjct: 454  TKANADKTNKKEQQDNSAKKETTPATPSKPTTPPVEKESQKQDSQKDDNKQSPSVEKEND 513

Query: 2046 TTNNPKSESTTTNNPASESITSSSPA-SESTTTSSPASESTTTSSPASESTTTSSPASES 2104
             ++    + T    PA   + SSS   ++  +T+   ++ TT SS  ++    +S  S  
Sbjct: 514  ASSESGKDKTPATKPAKGEVESSSTTPTKVVSTTQNVAKPTTASSETTKDVVQTSAGSSE 573

Query: 2105 TTTSSP 2110
               S+P
Sbjct: 574  AKDSAP 579



 Score = 36.5 bits (83), Expect = 0.21
 Identities = 22/102 (21%), Positives = 48/102 (47%), Gaps = 2/102 (1%)

Query: 1880 STVVMSTLNSLLSENTTTNSPESESTTTN-NPESESTTTSSPESESTTTSSLVSESTTTS 1938
            ++V +STL  L+S      + E+  T T   P++E+  + +  +E    +  V+ + + S
Sbjct: 22   ASVAISTLLLLMSNGEAQAAEETGGTNTEAQPKTEAVASPTTTTEKAPEAKPVANAVSVS 81

Query: 1939 SPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSP 1980
            + E E+ T+ + E++         E T    P +++   + P
Sbjct: 82   NKEVEAPTSETKEAKEVKEVKAPKE-TKEVKPAAKADNNTYP 122



 Score = 36.5 bits (83), Expect = 0.21
 Identities = 30/146 (20%), Positives = 65/146 (44%), Gaps = 9/146 (6%)

Query: 1968 SSPESESTTTSSP-ESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTT 2026
            ++P + S  T+ P E ES    S   ++  + S E E+  +    SES    +P ++   
Sbjct: 475  TTPATPSKPTTPPVEKESQKQDSQKDDNKQSPSVEKENDAS----SESGKDKTPATKPAK 530

Query: 2027 TISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTT 2086
                E ES++T+ P    +TT N    +T ++    + + +S+ +SE+  ++     +  
Sbjct: 531  G---EVESSSTT-PTKVVSTTQNVAKPTTASSETTKDVVQTSAGSSEAKDSAPLQKANIK 586

Query: 2087 TSSPASESTTTSSPASESTTTSSPES 2112
             ++     +  +    E+   S P++
Sbjct: 587  NTNDGHTQSQNNKNTQENKAKSLPQT 612



 Score = 36.1 bits (82), Expect = 0.29
 Identities = 33/128 (25%), Positives = 56/128 (43%), Gaps = 8/128 (6%)

Query: 1872 FTTNNNSESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLV 1931
            FT  N  ++       NS   E TT  +P     TT   E ES    S + ++  + S+ 
Sbjct: 453  FTKANADKTNKKEQQDNSAKKE-TTPATP--SKPTTPPVEKESQKQDSQKDDNKQSPSVE 509

Query: 1932 SESTTTSSPESESTTTSSP-----ESESTTTSSLVSESTTTSSPESESTTTSSPESESTT 1986
             E+  +S    + T  + P     ES STT + +VS +   + P + S+ T+    +++ 
Sbjct: 510  KENDASSESGKDKTPATKPAKGEVESSSTTPTKVVSTTQNVAKPTTASSETTKDVVQTSA 569

Query: 1987 TSSLVSES 1994
             SS   +S
Sbjct: 570  GSSEAKDS 577



 Score = 35.3 bits (80), Expect = 0.48
 Identities = 37/170 (21%), Positives = 67/170 (39%), Gaps = 8/170 (4%)

Query: 1963 ESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVS 2022
            E+ T ++ +  +       S    T+       TT   E ES    S   ++  + S   
Sbjct: 451  EAFTKANADKTNKKEQQDNSAKKETTPATPSKPTTPPVEKESQKQDSQKDDNKQSPSVEK 510

Query: 2023 ESTTTISPESESTTTSSPAS---ESTTTNNPKSESTTTNNPASESITSSSPASESTTTSS 2079
            E+  +     + T  + PA    ES++T   K  STT N   ++  T+SS  ++    +S
Sbjct: 511  ENDASSESGKDKTPATKPAKGEVESSSTTPTKVVSTTQN--VAKPTTASSETTKDVVQTS 568

Query: 2080 PASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTTIEE 2129
              S     S+P  ++   +   +    T S  +++T  +   S   T EE
Sbjct: 569  AGSSEAKDSAPLQKANIKN---TNDGHTQSQNNKNTQENKAKSLPQTGEE 615



 Score = 35.3 bits (80), Expect = 0.50
 Identities = 35/179 (19%), Positives = 68/179 (37%), Gaps = 9/179 (5%)

Query: 1942 SESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPE 2001
            +++    + + E    S+   + TT ++P     TT   E ES    S   ++  + S E
Sbjct: 454  TKANADKTNKKEQQDNSA--KKETTPATP--SKPTTPPVEKESQKQDSQKDDNKQSPSVE 509

Query: 2002 SESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPA 2061
             E+  +    SES    +P ++       ES STT +   S +     P + S+ T    
Sbjct: 510  KENDAS----SESGKDKTPATKPAKG-EVESSSTTPTKVVSTTQNVAKPTTASSETTKDV 564

Query: 2062 SESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSP 2120
             ++   SS A +S        ++T      S++   +      +   + E  +   + P
Sbjct: 565  VQTSAGSSEAKDSAPLQKANIKNTNDGHTQSQNNKNTQENKAKSLPQTGEESNKDMTLP 623



 Score = 34.1 bits (77), Expect = 1.0
 Identities = 33/181 (18%), Positives = 74/181 (40%), Gaps = 21/181 (11%)

Query: 2011 VSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNP-KSESTTTNNPASESITSSS 2069
            V +   T +   ++      ++ +   ++PA+ S  T  P + ES   ++   ++  S S
Sbjct: 448  VDKEAFTKANADKTNKKEQQDNSAKKETTPATPSKPTTPPVEKESQKQDSQKDDNKQSPS 507

Query: 2070 PASESTTTSSPASESTTTSSPA-----SESTTTSSPASESTTTSSPESESTTT------- 2117
               E+  +S    + T  + PA     S STT +   S +   + P + S+ T       
Sbjct: 508  VEKENDASSESGKDKTPATKPAKGEVESSSTTPTKVVSTTQNVAKPTTASSETTKDVVQT 567

Query: 2118 ---SSPASESTTIEEQGVSPHSEKLSANEDPEEFPNEDVFEHTFAEIPNIDHSNQTDEAI 2174
               SS A +S  +++  +   ++  + +++     N++  E+    +P     +  D  +
Sbjct: 568  SAGSSEAKDSAPLQKANIKNTNDGHTQSQN-----NKNTQENKAKSLPQTGEESNKDMTL 622

Query: 2175 P 2175
            P
Sbjct: 623  P 623



 Score = 33.4 bits (75), Expect = 1.8
 Identities = 25/120 (20%), Positives = 53/120 (44%), Gaps = 8/120 (6%)

Query: 1888 NSLLSENTTTNSPESESTTTNNPESESTTTSSP-----ESESTTTSSLVSESTTTSSPES 1942
            +S   +N  + S E E+  ++    + T  + P     ES STT + +VS +   + P +
Sbjct: 496  DSQKDDNKQSPSVEKENDASSESGKDKTPATKPAKGEVESSSTTPTKVVSTTQNVAKPTT 555

Query: 1943 ESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPES 2002
             S+ T+    +++  SS   +S        ++T     +S++   +    E+   S P++
Sbjct: 556  ASSETTKDVVQTSAGSSEAKDSAPLQKANIKNTNDGHTQSQNNKNTQ---ENKAKSLPQT 612


>gnl|CDD|219947 pfam08639, SLD3, DNA replication regulator SLD3.  The SLD3 DNA
            replication regulator is required for loading and
            maintenance of Cdc45 on chromatin during DNA replication.
          Length = 437

 Score = 41.7 bits (98), Expect = 0.005
 Identities = 35/206 (16%), Positives = 67/206 (32%), Gaps = 14/206 (6%)

Query: 1913 ESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPES 1972
             S   S  + + T T         T      S  +SS +S   +   ++++   +SS   
Sbjct: 233  PSMKISPLKKKKTGTLKSSKPEPGTPLKRQTSPASSSQKSRRRSLQRVLTDERKSSSR-- 290

Query: 1973 ESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPES 2032
                  +P    + T+S + E     S E+   +  S  S     +  + +    +S  S
Sbjct: 291  -----RTPSLLRSRTNSSLIEFLKRESSENLLPSLSSRTSSDLLKNKRLQKRQVDLSDSS 345

Query: 2033 ESTTTSSPASESTTTNNPKSESTTTN----NPASESITSSSPASESTTTSSPASESTTTS 2088
                      +    N  K E             E  +     +    +S        T+
Sbjct: 346  RQHEEK--LKKKQMLNEQKKELKRAISALKKSNRELSSKDIVETAEKRSSQFGQGVQVTA 403

Query: 2089 SPASESTTTSSPASESTTTSSPESES 2114
            +PA      +   +E+T++S P S+S
Sbjct: 404  TPAGNRKKDAGL-TEATSSSFPSSDS 428


>gnl|CDD|234977 PRK01741, PRK01741, cell division protein ZipA; Provisional.
          Length = 332

 Score = 41.3 bits (97), Expect = 0.005
 Identities = 34/178 (19%), Positives = 61/178 (34%), Gaps = 13/178 (7%)

Query: 1942 SESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPE 2001
            S + T +     S   S+  ++   T +P+S   TT  P  +  T  S            
Sbjct: 35   SNANTFTRTRPPSRPISNEEADQPNTLNPQSYVETTPPPFQQPQTEESESENEVQIQQEV 94

Query: 2002 SESTTTIS---PVSEST-TTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTT 2057
             +S   I    P  E      +  SE      P+ +S T ++ AS +        E T +
Sbjct: 95   EQSVDEIKITLPNQEPAYYMQNHRSEPIQPTQPQYQSPTQTNVASMTI-------EETQS 147

Query: 2058 NNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESEST 2115
             N   E I SSS   +     +  +    + +        +    ++ T + PE+ + 
Sbjct: 148  PNVPIEGINSSSE--QLRVELAELAAEIYSDASHRVELAKNFMEPQAETEAQPEATTN 203


>gnl|CDD|165527 PHA03269, PHA03269, envelope glycoprotein C; Provisional.
          Length = 566

 Score = 41.3 bits (96), Expect = 0.006
 Identities = 22/128 (17%), Positives = 45/128 (35%), Gaps = 5/128 (3%)

Query: 2003 ESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPAS 2062
             +  T  P+ E   TS+   +     +P   ++    PA   T+  + K +      PA+
Sbjct: 20   ANLNTNIPIPE-LHTSAATQKPDPAPAPHQAASRAPDPAVAPTSAASRKPDLAQAPTPAA 78

Query: 2063 ESITSSSPASESTTTSSP----ASESTTTSSPASESTTTSSPASESTTTSSPESESTTTS 2118
                  +PA     + +P    A +      P +    TS+  +      +  S ++   
Sbjct: 79   SEKFDPAPAPHQAASRAPDPAVAPQLAAAPKPDAAEAFTSAAQAHEAPADAGTSAASKKP 138

Query: 2119 SPASESTT 2126
             PA+ +  
Sbjct: 139  DPAAHTQH 146



 Score = 40.5 bits (94), Expect = 0.013
 Identities = 22/172 (12%), Positives = 58/172 (33%), Gaps = 19/172 (11%)

Query: 1957 TSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTT 2016
               +++ +       + +T    PE     TS+   +     +P   ++    P    T+
Sbjct: 6    IILIITIACINLIIANLNTNIPIPE---LHTSAATQKPDPAPAPHQAASRAPDPAVAPTS 62

Query: 2017 TSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTT 2076
             +S   +     +P   ++    PA            ++   +PA     +++P      
Sbjct: 63   AASRKPD--LAQAPTPAASEKFDPAPAPH------QAASRAPDPAVAPQLAAAP------ 108

Query: 2077 TSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTTIE 2128
               P +    TS+  +      +  S ++    P + +  +  P + + ++E
Sbjct: 109  --KPDAAEAFTSAAQAHEAPADAGTSAASKKPDPAAHTQHSPPPFAYTRSME 158



 Score = 38.6 bits (89), Expect = 0.046
 Identities = 17/142 (11%), Positives = 44/142 (30%), Gaps = 13/142 (9%)

Query: 1911 ESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSP 1970
             + +T    PE  ++  +          +P      + +P+     TS+   +     +P
Sbjct: 20   ANLNTNIPIPELHTSAATQK-----PDPAPAPHQAASRAPDPAVAPTSAASRKPDLAQAP 74

Query: 1971 ESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTT--TSSPVSESTTTI 2028
               ++    P             ++    P        +P  ++    TS+  +      
Sbjct: 75   TPAASEKFDPAPAPH------QAASRAPDPAVAPQLAAAPKPDAAEAFTSAAQAHEAPAD 128

Query: 2029 SPESESTTTSSPASESTTTNNP 2050
            +  S ++    PA+ +  +  P
Sbjct: 129  AGTSAASKKPDPAAHTQHSPPP 150



 Score = 35.1 bits (80), Expect = 0.62
 Identities = 25/137 (18%), Positives = 50/137 (36%), Gaps = 16/137 (11%)

Query: 2006 TTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESI 2065
              I+ +  +  T+ P+ E  T     S +T    PA            ++   +PA    
Sbjct: 13   ACINLIIANLNTNIPIPELHT-----SAATQKPDPAPAPH------QAASRAPDPAVAPT 61

Query: 2066 TSSSPASESTTTSSPASESTTTSSPASESTTTSSP----ASESTTTSSPESESTTTSSPA 2121
            +++S   +     +PA+      +PA     + +P    A +      P++    TS+  
Sbjct: 62   SAASRKPDLAQAPTPAASEKFDPAPAPHQAASRAPDPAVAPQLAAAPKPDAAEAFTSAAQ 121

Query: 2122 SESTTIEEQGVSPHSEK 2138
            +      + G S  S+K
Sbjct: 122  AHEAPA-DAGTSAASKK 137



 Score = 34.3 bits (78), Expect = 0.88
 Identities = 20/133 (15%), Positives = 45/133 (33%), Gaps = 11/133 (8%)

Query: 1889 SLLSENTTTNSPE---SESTTTNNPESESTTTSS--PESESTTTSSLVSESTTTSSPESE 1943
             + + NT    PE   S +T   +P       +S  P+     TS+   +     +P   
Sbjct: 18   IIANLNTNIPIPELHTSAATQKPDPAPAPHQAASRAPDPAVAPTSAASRKPDLAQAPTPA 77

Query: 1944 STTTSSPESESTTTSS------LVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTT 1997
            ++    P       +S      +  +      P++    TS+ ++      +  S ++  
Sbjct: 78   ASEKFDPAPAPHQAASRAPDPAVAPQLAAAPKPDAAEAFTSAAQAHEAPADAGTSAASKK 137

Query: 1998 SSPESESTTTISP 2010
              P + +  +  P
Sbjct: 138  PDPAAHTQHSPPP 150


>gnl|CDD|218908 pfam06136, DUF966, Domain of unknown function (DUF966).  Family of
            plant proteins with unknown function.
          Length = 308

 Score = 40.9 bits (96), Expect = 0.006
 Identities = 26/132 (19%), Positives = 48/132 (36%), Gaps = 10/132 (7%)

Query: 1995 TTTSSPESESTTTISPVSESTTTSSPV---SESTTTISPESESTTTSS-PASESTTTNNP 2050
             ++SS       +   + E + T       ++S ++          +  PA  ST T++ 
Sbjct: 90   DSSSSKGDPEEASSRKLQEESDTPPVNRRANQSWSSSDLAEYKVYKAEEPADASTQTDDR 149

Query: 2051 KSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSP 2110
            +S  ++       S    SP       SS +S S+++S    ES   +      +  S  
Sbjct: 150  RSRDSSEAESTELSREEISPP------SSSSSPSSSSSPETLESLIKADGRLSLSFRSLE 203

Query: 2111 ESESTTTSSPAS 2122
            E ES      +S
Sbjct: 204  EDESAGRVRASS 215



 Score = 40.9 bits (96), Expect = 0.007
 Identities = 23/132 (17%), Positives = 51/132 (38%), Gaps = 13/132 (9%)

Query: 1895 TTTNSPESESTTTNNPESESTTTSSPE---SESTTTSSL----VSESTTTSSPESESTTT 1947
             +++S       ++    E + T       ++S ++S L    V ++   +   +++   
Sbjct: 90   DSSSSKGDPEEASSRKLQEESDTPPVNRRANQSWSSSDLAEYKVYKAEEPADASTQTDDR 149

Query: 1948 SSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTT 2007
             S +S    ++ L  E  +  S  S  +++SSPE   T  S + ++   + S  S     
Sbjct: 150  RSRDSSEAESTELSREEISPPSSSSSPSSSSSPE---TLESLIKADGRLSLSFRSLEE-- 204

Query: 2008 ISPVSESTTTSS 2019
                +     SS
Sbjct: 205  -DESAGRVRASS 215



 Score = 37.4 bits (87), Expect = 0.068
 Identities = 21/126 (16%), Positives = 45/126 (35%), Gaps = 5/126 (3%)

Query: 2012 SESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPA 2071
            SE   +SS   +     S + +  + + P +     N   S S        ++   +  +
Sbjct: 86   SEILDSSSSKGDPEEASSRKLQEESDTPPVNR--RANQSWSSSDLAEYKVYKAEEPADAS 143

Query: 2072 SESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTTIEEQG 2131
            +++    S  S    ++  + E  +  S +S  +++SSPE   T  S   ++        
Sbjct: 144  TQTDDRRSRDSSEAESTELSREEISPPSSSSSPSSSSSPE---TLESLIKADGRLSLSFR 200

Query: 2132 VSPHSE 2137
                 E
Sbjct: 201  SLEEDE 206



 Score = 35.5 bits (82), Expect = 0.32
 Identities = 28/140 (20%), Positives = 49/140 (35%), Gaps = 17/140 (12%)

Query: 1945 TTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESES 2004
             ++SS       +S  + E + T          +   S S      V ++   +     S
Sbjct: 90   DSSSSKGDPEEASSRKLQEESDTPPVNR---RANQSWSSSDLAEYKVYKAEEPAD---AS 143

Query: 2005 TTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASES 2064
            T T    S  ++ +     S   ISP S   ++SSP+S S        E+  +   A   
Sbjct: 144  TQTDDRRSRDSSEAESTELSREEISPPS---SSSSPSSSS------SPETLESLIKADGR 194

Query: 2065 ITSSSPASE--STTTSSPAS 2082
            ++ S  + E   +     AS
Sbjct: 195  LSLSFRSLEEDESAGRVRAS 214



 Score = 33.5 bits (77), Expect = 1.2
 Identities = 27/119 (22%), Positives = 44/119 (36%), Gaps = 5/119 (4%)

Query: 1932 SESTTTSSP--ESESTTTSSPESESTTTSSLVSESTTTSSPESES---TTTSSPESESTT 1986
            SE   +SS   + E  ++   + ES T       + + SS +           P   ST 
Sbjct: 86   SEILDSSSSKGDPEEASSRKLQEESDTPPVNRRANQSWSSSDLAEYKVYKAEEPADASTQ 145

Query: 1987 TSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASEST 2045
            T    S  ++ +     S   ISP S S++ SS  S  T     +++   + S  S   
Sbjct: 146  TDDRRSRDSSEAESTELSREEISPPSSSSSPSSSSSPETLESLIKADGRLSLSFRSLEE 204



 Score = 32.0 bits (73), Expect = 3.3
 Identities = 18/132 (13%), Positives = 37/132 (28%), Gaps = 9/132 (6%)

Query: 1871 IFTTNNNSESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSL 1930
            I  ++++       S+                 +   N   S S        ++   +  
Sbjct: 88   ILDSSSSKGDPEEASSRKLQEES-----DTPPVNRRANQSWSSSDLAEYKVYKAEEPADA 142

Query: 1931 VSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSL 1990
             +++    S +S    ++    E  +  S  S  +++SSPE    T  S        S  
Sbjct: 143  STQTDDRRSRDSSEAESTELSREEISPPSSSSSPSSSSSPE----TLESLIKADGRLSLS 198

Query: 1991 VSESTTTSSPES 2002
                    S   
Sbjct: 199  FRSLEEDESAGR 210


>gnl|CDD|221577 pfam12440, MAGE_N, Melanoma associated antigen family N terminal.
            This domain family is found in eukaryotes, and is
            typically between 82 and 96 amino acids in length. The
            family is found in association with pfam01454. This
            family is the N terminal of various melanoma associated
            antigens. These are tumour rejection antigens which are
            expressed on HLA-A1 of tumour cells and they are
            recognised by cytotoxic T lymphocytes (CTLs).
          Length = 96

 Score = 37.9 bits (88), Expect = 0.007
 Identities = 22/61 (36%), Positives = 33/61 (54%), Gaps = 1/61 (1%)

Query: 2063 ESITSSSPASESTTTSSPASESTTTS-SPASESTTTSSPASESTTTSSPESESTTTSSPA 2121
            ES +SSSP    T  S PA+ S +   SP   S+++++ A+ S + S   S S    SP+
Sbjct: 35   ESPSSSSPLIPGTPESVPAAGSPSPPQSPQGASSSSTAVAATSWSQSDEGSSSQEEESPS 94

Query: 2122 S 2122
            S
Sbjct: 95   S 95



 Score = 36.8 bits (85), Expect = 0.018
 Identities = 19/62 (30%), Positives = 30/62 (48%), Gaps = 1/62 (1%)

Query: 2073 ESTTTSSPASESTTTSSPASESTTTS-SPASESTTTSSPESESTTTSSPASESTTIEEQG 2131
            ES ++SSP    T  S PA+ S +   SP   S+++++  + S + S   S S   E   
Sbjct: 35   ESPSSSSPLIPGTPESVPAAGSPSPPQSPQGASSSSTAVAATSWSQSDEGSSSQEEESPS 94

Query: 2132 VS 2133
             S
Sbjct: 95   SS 96



 Score = 35.2 bits (81), Expect = 0.059
 Identities = 21/73 (28%), Positives = 39/73 (53%), Gaps = 6/73 (8%)

Query: 1917 TSSPESESTTTSSLVSESTTTSSPESESTTTS-SPESESTTTSSLVSESTTTSSPESEST 1975
             ++ E ES ++SS +   T  S P + S +   SP+  S++++++ + S + S   S   
Sbjct: 29   PAAEEEESPSSSSPLIPGTPESVPAAGSPSPPQSPQGASSSSTAVAATSWSQSDEGS--- 85

Query: 1976 TTSSPESESTTTS 1988
              SS E ES ++S
Sbjct: 86   --SSQEEESPSSS 96



 Score = 33.3 bits (76), Expect = 0.28
 Identities = 19/73 (26%), Positives = 37/73 (50%), Gaps = 6/73 (8%)

Query: 1947 TSSPESESTTTSSLVSESTTTSSPESESTTTS-SPESESTTTSSLVSESTTTSSPESEST 2005
             ++ E ES ++SS +   T  S P + S +   SP+  S++++++ + S + S   S   
Sbjct: 29   PAAEEEESPSSSSPLIPGTPESVPAAGSPSPPQSPQGASSSSTAVAATSWSQSDEGSS-- 86

Query: 2006 TTISPVSESTTTS 2018
               S   ES ++S
Sbjct: 87   ---SQEEESPSSS 96



 Score = 31.8 bits (72), Expect = 0.86
 Identities = 21/68 (30%), Positives = 35/68 (51%), Gaps = 4/68 (5%)

Query: 2051 KSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSP 2110
            + ES ++++P       S PA+ S   S P S    +SS  + + T+ S + E   +SS 
Sbjct: 33   EEESPSSSSPLIPGTPESVPAAGSP--SPPQSPQGASSSSTAVAATSWSQSDEG--SSSQ 88

Query: 2111 ESESTTTS 2118
            E ES ++S
Sbjct: 89   EEESPSSS 96



 Score = 30.6 bits (69), Expect = 2.2
 Identities = 18/67 (26%), Positives = 37/67 (55%), Gaps = 1/67 (1%)

Query: 1997 TSSPESESTTTISPVSESTTTSSPVSESTTTI-SPESESTTTSSPASESTTTNNPKSEST 2055
             ++ E ES ++ SP+   T  S P + S +   SP+  S+++++ A+ S + ++  S S 
Sbjct: 29   PAAEEEESPSSSSPLIPGTPESVPAAGSPSPPQSPQGASSSSTAVAATSWSQSDEGSSSQ 88

Query: 2056 TTNNPAS 2062
               +P+S
Sbjct: 89   EEESPSS 95



 Score = 30.6 bits (69), Expect = 2.5
 Identities = 20/70 (28%), Positives = 35/70 (50%), Gaps = 2/70 (2%)

Query: 1937 TSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTT 1996
             ++ E ES ++SSP    T  S   + S   S P+S    +SS  + + T+ S   E ++
Sbjct: 29   PAAEEEESPSSSSPLIPGTPESVPAAGSP--SPPQSPQGASSSSTAVAATSWSQSDEGSS 86

Query: 1997 TSSPESESTT 2006
            +   ES S++
Sbjct: 87   SQEEESPSSS 96



 Score = 30.2 bits (68), Expect = 3.3
 Identities = 18/61 (29%), Positives = 33/61 (54%), Gaps = 3/61 (4%)

Query: 2081 ASESTT--TSSPASESTTTSSPASESTTTS-SPESESTTTSSPASESTTIEEQGVSPHSE 2137
            A E  +  +SSP    T  S PA+ S +   SP+  S+++++ A+ S +  ++G S   E
Sbjct: 31   AEEEESPSSSSPLIPGTPESVPAAGSPSPPQSPQGASSSSTAVAATSWSQSDEGSSSQEE 90

Query: 2138 K 2138
            +
Sbjct: 91   E 91


>gnl|CDD|185628 PTZ00449, PTZ00449, 104 kDa microneme/rhoptry antigen; Provisional.
          Length = 943

 Score = 41.2 bits (96), Expect = 0.007
 Identities = 42/279 (15%), Positives = 69/279 (24%), Gaps = 18/279 (6%)

Query: 1903 ESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESE------STTTSSPESESTT 1956
            +         +S  +  P+       +   E      P  E       T +  PE     
Sbjct: 526  DKEGEEGEHEDSKESDEPKEGGKPGETKEGEVGKKPGPAKEHKPSKIPTLSKKPEFPKDP 585

Query: 1957 TSSLVSESTTTS-SPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSEST 2015
                  E       P S    T     +      +        SP+S       P     
Sbjct: 586  KHPKDPEEPKKPKRPRSAQRPTRPKSPKLPELLDIPKSPKRPESPKSPK----RPPPPQR 641

Query: 2016 TTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASEST 2075
             +S    E    I        +  P        +PK +    ++    +  S    +   
Sbjct: 642  PSSPERPEGPKIIK-------SPKPPKSPKPPFDPKFKEKFYDDYLDAAAKSKETKTTVV 694

Query: 2076 TTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTTIEEQGVSPH 2135
               S  S    T      +  T+            E        P +E     E    P 
Sbjct: 695  LDESFESILKETLPETPGTPFTTPRPLPPKLPRDEEFPFEPIGDPDAEQPDDIEFFTPPE 754

Query: 2136 SEKLSANEDPEEFPNEDVFEHTFAEIPNIDHSNQTDEAI 2174
             E+   +E P + P  D+    F E      + + DEA+
Sbjct: 755  EERTFFHETPADTPLPDILAEEFKEEDIHAETGEPDEAM 793



 Score = 33.1 bits (75), Expect = 2.4
 Identities = 36/273 (13%), Positives = 70/273 (25%), Gaps = 19/273 (6%)

Query: 1897 TNSPESESTTTNNPESESTTTSSP--ESESTTTSSLVSESTTTSSPESESTTTSSPESES 1954
            ++ P+         E E      P  E + +   +L  +      P+         + + 
Sbjct: 540  SDEPKEGGKPGETKEGEVGKKPGPAKEHKPSKIPTLSKKPEFPKDPKHPKDPEEPKKPKR 599

Query: 1955 TTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSES 2014
              ++   +   +   PE      S    ES  +          SSPE      I    + 
Sbjct: 600  PRSAQRPTRPKSPKLPELLDIPKSPKRPESPKSPKRPPPPQRPSSPERPEGPKIIKSPKP 659

Query: 2015 TTTSSP-----------------VSESTTTISPESESTTTSSPASESTTTNNPKSESTTT 2057
              +  P                  ++S  T +      +  S   E+         +T  
Sbjct: 660  PKSPKPPFDPKFKEKFYDDYLDAAAKSKETKTTVVLDESFESILKETLPETPGTPFTTPR 719

Query: 2058 NNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTT 2117
              P         P        +   +     +P  E  T        T      +E    
Sbjct: 720  PLPPKLPRDEEFPFEPIGDPDAEQPDDIEFFTPPEEERTFFHETPADTPLPDILAEEFKE 779

Query: 2118 SSPASESTTIEEQGVSPHSEKLSANEDPEEFPN 2150
                +E+   +E    P S     ++ P + P+
Sbjct: 780  EDIHAETGEPDEAMKRPDSPSEHEDKPPGDHPS 812


>gnl|CDD|237874 PRK14971, PRK14971, DNA polymerase III subunits gamma and tau;
            Provisional.
          Length = 614

 Score = 41.3 bits (97), Expect = 0.007
 Identities = 20/138 (14%), Positives = 40/138 (28%), Gaps = 17/138 (12%)

Query: 2032 SESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPA 2091
             +    S             ++      P++ +  S SP+  S      A +S T  +  
Sbjct: 364  QKGDDASGGRGPKQHIKPVFTQPAAAPQPSAAAAASPSPSQSSAAAQPSAPQSATQPAGT 423

Query: 2092 SESTTTSSPASESTTTSSPESESTTTSSPASE---------STTIE-EQGVSPHSEKLSA 2141
              + +   PA+      S   ++   +    E         S      + +   +E+ + 
Sbjct: 424  PPTVSVDPPAAVPVNPPSTAPQAVRPAQFKEEKKIPVSKVSSLGPSTLRPIQEKAEQATG 483

Query: 2142 NE-------DPEEFPNED 2152
            N          E F  ED
Sbjct: 484  NIKEAPTGTQKEIFTEED 501



 Score = 34.0 bits (78), Expect = 1.3
 Identities = 17/124 (13%), Positives = 33/124 (26%), Gaps = 4/124 (3%)

Query: 1942 SESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPE 2001
             +    S           + ++      P + +  + SP   S        +S T  +  
Sbjct: 364  QKGDDASGGRGPKQHIKPVFTQPAAAPQPSAAAAASPSPSQSSAAAQPSAPQSATQPAGT 423

Query: 2002 SESTTTISPVSEST-TTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSEST---TT 2057
              + +   P +      S+           E +    S  +S   +T  P  E     T 
Sbjct: 424  PPTVSVDPPAAVPVNPPSTAPQAVRPAQFKEEKKIPVSKVSSLGPSTLRPIQEKAEQATG 483

Query: 2058 NNPA 2061
            N   
Sbjct: 484  NIKE 487


>gnl|CDD|203922 pfam08377, MAP2_projctn, MAP2/Tau projection domain.  This domain is
            found in the MAP2/Tau family of proteins which includes
            MAP2, MAP4, Tau, and their homologs. All isoforms contain
            a conserved C-terminal domain containing tubulin-binding
            repeats (pfam00418), and a N-terminal projection domain
            of varying size. This domain has a net negative charge
            and exerts a long-range repulsive force. This provides a
            mechanism that can regulate microtubule spacing which
            might facilitate efficient organelle transport.
          Length = 1134

 Score = 41.3 bits (96), Expect = 0.008
 Identities = 60/300 (20%), Positives = 112/300 (37%), Gaps = 36/300 (12%)

Query: 1878 SESTVVMSTLNS-LLSENTTTNSPESESTTTNNPESESTTTS--SPESESTTTSSLVSES 1934
            SE+T V+  ++S  +      N    E  TT+  + E++T S   P    T   + + E+
Sbjct: 9    SEATTVLGDVHSPAVEGFVGENISGEEKGTTDQEKKETSTPSVQEPTLTETEPQTKLEET 68

Query: 1935 TTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPES---ESTTTSSPESESTTTS--S 1989
            +  S  E+ +    S + +      + + +  + S E    +  T  + + +S   S   
Sbjct: 69   SKVSIEETVAKEEESLKLKDDKAGVIQTSTEHSFSKEDQKGQEQTIEALKQDSFPISLEQ 128

Query: 1990 LVSESTTTSSPESESTTTISPVSE----------------------STTTSS---PVSES 2024
             V+++   +    + T+    VSE                      S T +    P  E 
Sbjct: 129  AVTDAAMATKTLEKVTSEPEAVSEKREIQGLFEEDIADKSKLEGAGSATVAEVEMPFYED 188

Query: 2025 TTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASES 2084
             + +S   E++      + ST   +   E + +   A ES+ + SP ++       A  S
Sbjct: 189  KSGMSKYFETSALKEDVTRSTGLGSDYYELSDSRGNAQESLDTVSPKNQQDEKELLAKAS 248

Query: 2085 TTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTTIEEQGVSPHSEKLSANED 2144
               S PA E+    S  ++S T+  P       SSP     TI+ +      +  S N+D
Sbjct: 249  -QPSPPAHEAGY--STLAQSYTSDHPSELPEEPSSPQERMFTIDPKVYGEKRDLHSKNKD 305


>gnl|CDD|227680 COG5391, COG5391, Phox homology (PX) domain protein [Intracellular
            trafficking and secretion / General function prediction
            only].
          Length = 524

 Score = 40.9 bits (96), Expect = 0.009
 Identities = 29/136 (21%), Positives = 51/136 (37%), Gaps = 11/136 (8%)

Query: 1909 NPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSP--------ESESTTTSSL 1960
            +P++ES+ + S  S S++ S   S           S+  S+P          ES+     
Sbjct: 8    SPKNESSASDSGPSGSSSESQESSTVKNNDGSPVNSSIKSTPLDIQKRYSGFESSAKLPR 67

Query: 1961 VSESTTTSSPESESTTTSSPESESTTTSSLVSE--STTTSSPESESTTTISPVSESTTTS 2018
            +S++ +   P    T + +     +   S  SE  S          T+   P+S S T  
Sbjct: 68   ISDAPSFVPPPGGHTISYTIAIHDSKIHSRASEFRSLRDMLSLLLPTSLQPPLSTSHTIL 127

Query: 2019 SPVSESTTTISPESES 2034
                 ST +  P+S +
Sbjct: 128  DYFISSTVSN-PQSLT 142



 Score = 39.4 bits (92), Expect = 0.025
 Identities = 33/142 (23%), Positives = 55/142 (38%), Gaps = 11/142 (7%)

Query: 1938 SSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSP--------ESESTTTSS 1989
            SSP++ES+ + S  S S++ S   S           S+  S+P          ES+    
Sbjct: 7    SSPKNESSASDSGPSGSSSESQESSTVKNNDGSPVNSSIKSTPLDIQKRYSGFESSAKLP 66

Query: 1990 LVSESTTTSSPESESTTTISPVSESTTTSSPVSE--STTTISPESESTTTSSPASESTTT 2047
             +S++ +   P    T + +     +   S  SE  S   +      T+   P S S  T
Sbjct: 67   RISDAPSFVPPPGGHTISYTIAIHDSKIHSRASEFRSLRDMLSLLLPTSLQPPLSTS-HT 125

Query: 2048 NNPKSESTTTNNPASESITSSS 2069
                  S+T +NP S ++   S
Sbjct: 126  ILDYFISSTVSNPQSLTLLVDS 147



 Score = 39.4 bits (92), Expect = 0.027
 Identities = 32/139 (23%), Positives = 55/139 (39%), Gaps = 5/139 (3%)

Query: 1968 SSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTT 2027
            SSP++ES+ + S  S S++ S   S           S+   +P+      S    ES+  
Sbjct: 7    SSPKNESSASDSGPSGSSSESQESSTVKNNDGSPVNSSIKSTPLDIQKRYSG--FESSAK 64

Query: 2028 ISPESESTTTSSPASESTTTNNPKSESTTTNNPASE--SITSSSPASESTTTSSPASEST 2085
            +   S++ +   P    T +       +  ++ ASE  S+         T+   P S S 
Sbjct: 65   LPRISDAPSFVPPPGGHTISYTIAIHDSKIHSRASEFRSLRDMLSLLLPTSLQPPLSTS- 123

Query: 2086 TTSSPASESTTTSSPASES 2104
             T      S+T S+P S +
Sbjct: 124  HTILDYFISSTVSNPQSLT 142



 Score = 34.4 bits (79), Expect = 0.97
 Identities = 27/122 (22%), Positives = 44/122 (36%), Gaps = 6/122 (4%)

Query: 2009 SPVSESTTTSSPVSESTTTIS-PESESTTTSSPASESTTTNNPKSESTTTNNPASESITS 2067
            SP +ES+ + S  S S++      +      SP + S  +     +   +     ES   
Sbjct: 8    SPKNESSASDSGPSGSSSESQESSTVKNNDGSPVNSSIKSTPLDIQKRYSG---FESSAK 64

Query: 2068 SSPASESTTTSSPASESTTTSSPASESTTTSSPASE--STTTSSPESESTTTSSPASEST 2125
                S++ +   P    T + + A   +   S ASE  S          T+   P S S 
Sbjct: 65   LPRISDAPSFVPPPGGHTISYTIAIHDSKIHSRASEFRSLRDMLSLLLPTSLQPPLSTSH 124

Query: 2126 TI 2127
            TI
Sbjct: 125  TI 126



 Score = 34.0 bits (78), Expect = 1.3
 Identities = 28/138 (20%), Positives = 54/138 (39%), Gaps = 3/138 (2%)

Query: 1918 SSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTT 1977
            SSP++ES+ + S  S S++ S   S           S+  S+ +           ES+  
Sbjct: 7    SSPKNESSASDSGPSGSSSESQESSTVKNNDGSPVNSSIKSTPLDIQKRY--SGFESSAK 64

Query: 1978 SSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSE-STTTISPESESTT 2036
                S++ +        T + +     +   S  SE  +    +S    T++ P   ++ 
Sbjct: 65   LPRISDAPSFVPPPGGHTISYTIAIHDSKIHSRASEFRSLRDMLSLLLPTSLQPPLSTSH 124

Query: 2037 TSSPASESTTTNNPKSES 2054
            T      S+T +NP+S +
Sbjct: 125  TILDYFISSTVSNPQSLT 142



 Score = 31.7 bits (72), Expect = 6.4
 Identities = 22/101 (21%), Positives = 35/101 (34%), Gaps = 12/101 (11%)

Query: 2059 NPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPAS--------ESTTTSSP 2110
            +P +ES  S S  S S++ S  +S           S+  S+P          ES+     
Sbjct: 8    SPKNESSASDSGPSGSSSESQESSTVKNNDGSPVNSSIKSTPLDIQKRYSGFESSAKLPR 67

Query: 2111 ESESTTTSSP---ASESTTIEEQGVSPHSEKLSANEDPEEF 2148
             S++ +   P    + S TI       HS   S      + 
Sbjct: 68   ISDAPSFVPPPGGHTISYTIAIHDSKIHSRA-SEFRSLRDM 107



 Score = 31.3 bits (71), Expect = 7.7
 Identities = 13/45 (28%), Positives = 21/45 (46%)

Query: 2078 SSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPAS 2122
            SSP +ES+ + S  S S++ S  +S           S+  S+P  
Sbjct: 7    SSPKNESSASDSGPSGSSSESQESSTVKNNDGSPVNSSIKSTPLD 51


>gnl|CDD|227400 COG5068, ARG80, Regulator of arginine metabolism and related MADS
            box-containing transcription factors [Transcription].
          Length = 412

 Score = 40.8 bits (95), Expect = 0.009
 Identities = 37/253 (14%), Positives = 72/253 (28%), Gaps = 24/253 (9%)

Query: 1912 SESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTS-SP 1970
            +E       E+    T +     +   S E +S   S   +  + +S   S S + S  P
Sbjct: 121  TEVLLLVISENGLVHTFTTPKLESVVKSLEGKSLIQSPCSNAPSDSSEEPSSSASFSVDP 180

Query: 1971 ESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISP-----------------VSE 2013
               +   S   + S  T+ +  ++  T   +  S+    P                 + E
Sbjct: 181  NDNNPMGSFQHNGSPQTNFIPLQNPQTQQYQQHSSRKDHPTVPHSNTNNGRPPAKFMIPE 240

Query: 2014 STTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASE 2073
              ++ S +   +  IS       +S+      +     +     NNP  E         E
Sbjct: 241  LHSSHSTLDLPSDFISDSGFPNQSSTSIFPLDSAIIQITPPHLPNNPPQE------NRHE 294

Query: 2074 STTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTTIEEQGVS 2133
              +  S     T         +   SP +  +  +     S    +   E  +   +  S
Sbjct: 295  LYSNDSSMVSETPPPKNLPNGSPNQSPLNNLSRGNPASPNSIIRENNQVEDESFNGRQGS 354

Query: 2134 PHSEKLSANEDPE 2146
                 L +   P 
Sbjct: 355  AIWNALISTTQPN 367


>gnl|CDD|173171 PRK14708, PRK14708, flagellin; Provisional.
          Length = 888

 Score = 41.1 bits (96), Expect = 0.009
 Identities = 42/268 (15%), Positives = 106/268 (39%), Gaps = 12/268 (4%)

Query: 1872 FTTNNNSESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLV 1931
            + T +N  +T+  +T + L    + +N+  + +   +     ++T S  ++      S+ 
Sbjct: 105  YATKSNVSATIAGATADDLRGTQSFSNAVATSNVIFDGTAGGTSTASGTDTLGGGIVSIA 164

Query: 1932 SESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTT---S 1988
            + +  T    +++T   S  S  T  ++       +S     + T + P +  + T    
Sbjct: 165  AGTAVTVLGAADATALGSVLSVGTAAATATGADLISSLTNGSTATATGPAAGDSITVNGK 224

Query: 1989 SLVSESTTTSSPESESTTTI---SPVSESTTTSSPVSESTTTISPESESTTTSSPASEST 2045
            ++   +   ++ +S    TI     ++    T   ++ +TT     + S  T+      +
Sbjct: 225  TITFTTAGAATADSNGNYTIGLDQTLTALLATIDTINGNTT-----NPSVVTAGKLELHS 279

Query: 2046 TTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASEST 2105
             TN+P + S          +   +  + +  T++ A+ S TT    +    +++  ++ T
Sbjct: 280  GTNSPLTISDNAGGAVLAKLGLGAQVTTTAGTTAAANISATTQLFNTHGGLSTTAIADGT 339

Query: 2106 T-TSSPESESTTTSSPASESTTIEEQGV 2132
            T T + ++ +  TS     +  +   GV
Sbjct: 340  TLTVNGKTITFKTSDAPQGNNILTGSGV 367



 Score = 38.0 bits (88), Expect = 0.083
 Identities = 42/253 (16%), Positives = 94/253 (37%), Gaps = 6/253 (2%)

Query: 1885 STLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESES 1944
            S  N  L  N    +  + S T     ++    +   S +  TS+++ + T   +  +  
Sbjct: 93   SIANQALQTNVGYATKSNVSATIAGATADDLRGTQSFSNAVATSNVIFDGTAGGTSTASG 152

Query: 1945 TTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSE-----STTTSS 1999
            T T      S    + V+      +    S  +    + + T + L+S      + T + 
Sbjct: 153  TDTLGGGIVSIAAGTAVTVLGAADATALGSVLSVGTAAATATGADLISSLTNGSTATATG 212

Query: 2000 PESESTTTISPVSES-TTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTN 2058
            P +  + T++  + + TT  +  ++S    +   + T T+  A+  T   N  + S  T 
Sbjct: 213  PAAGDSITVNGKTITFTTAGAATADSNGNYTIGLDQTLTALLATIDTINGNTTNPSVVTA 272

Query: 2059 NPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTS 2118
                    ++SP + S              +  + +  T++ A+ S TT    +    ++
Sbjct: 273  GKLELHSGTNSPLTISDNAGGAVLAKLGLGAQVTTTAGTTAAANISATTQLFNTHGGLST 332

Query: 2119 SPASESTTIEEQG 2131
            +  ++ TT+   G
Sbjct: 333  TAIADGTTLTVNG 345



 Score = 34.5 bits (79), Expect = 0.84
 Identities = 40/217 (18%), Positives = 78/217 (35%), Gaps = 6/217 (2%)

Query: 1926 TTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESEST 1985
             T S VS +   ++ +    T S   S +  TS+++ + T   +  +  T T      S 
Sbjct: 106  ATKSNVSATIAGATADDLRGTQSF--SNAVATSNVIFDGTAGGTSTASGTDTLGGGIVSI 163

Query: 1986 TTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASEST 2045
               + V+      +    S  ++   + + T +  +S  T   +     T T   A +S 
Sbjct: 164  AAGTAVTVLGAADATALGSVLSVGTAAATATGADLISSLTNGSTA----TATGPAAGDSI 219

Query: 2046 TTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASEST 2105
            T N      TT     ++S  + +   + T T+  A+  T   +  + S  T+      +
Sbjct: 220  TVNGKTITFTTAGAATADSNGNYTIGLDQTLTALLATIDTINGNTTNPSVVTAGKLELHS 279

Query: 2106 TTSSPESESTTTSSPASESTTIEEQGVSPHSEKLSAN 2142
             T+SP + S            +  Q  +      +AN
Sbjct: 280  GTNSPLTISDNAGGAVLAKLGLGAQVTTTAGTTAAAN 316


>gnl|CDD|220779 pfam10488, PP1c_bdg, Phosphatase-1 catalytic subunit binding region. 
            This conserved C-terminus appears to be a protein
            phosphatase-1 catalytic subunit (PP1C) binding region,
            which may in some circumstances also be retroviral in
            origin since it is found in both herpes simplex virus and
            in mouse and man. This domain is found in Gadd-34
            apoptosis-associated proteins as well as the constitutive
            repressor of eIF2-alpha phosphorylation/protein
            phosphatase 1, regulatory (inhibitor) subunit 15b,
            otherwise known as CReP. Diverse stressful conditions are
            associated with phosphorylation of the {alpha} subunit of
            eukaryotic translation initiation factor 2 (eIF2{alpha})
            on serine 51. This signaling event, which is conserved
            from yeast to mammals, negatively regulates the guanine
            nucleotide exchange factor, eIF2-B and inhibits the
            recycling of eIF2 to its active GTP bound form. In
            mammalian cells eIF2{alpha} phosphorylation emerges as an
            important event in stress signaling that impacts on gene
            expression at both the translational and transcriptional
            levels.
          Length = 307

 Score = 40.4 bits (94), Expect = 0.010
 Identities = 36/177 (20%), Positives = 66/177 (37%), Gaps = 9/177 (5%)

Query: 1932 SESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESES-----TTTSSPESESTT 1986
            S+  ++S  ES S    S +    + SSL SES      E        T +  P +    
Sbjct: 24   SDLESSSDVESISWDEESEDDGFDSDSSL-SESDREQDDEGLHLWNSFTKSVDPYNPLNF 82

Query: 1987 TSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTT 2046
            T+++ + +T    P S   +  S     ++   P+  +    S E +S  +S+   ES  
Sbjct: 83   TATIQTAATIKPKPPSSE-SDWSGEENVSSQEGPLPSTPEHSSSEDDSWESSADEEESLK 141

Query: 2047 TNNPKSESTTTNNPAS--ESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPA 2101
              N   ++    NP +      +S    + +   S  +     +  + +ST  S  A
Sbjct: 142  LWNSFCQNDDPYNPLNFKAPFQTSGKNPKGSKHDSKTNSEQNVAIRSLKSTRLSCKA 198



 Score = 36.2 bits (83), Expect = 0.18
 Identities = 37/185 (20%), Positives = 58/185 (31%), Gaps = 35/185 (18%)

Query: 1962 SESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPES--------ESTTTISPVSE 2013
            S+  ++S  ES S    S +    + SSL SES      E         +S    +P++ 
Sbjct: 24   SDLESSSDVESISWDEESEDDGFDSDSSL-SESDREQDDEGLHLWNSFTKSVDPYNPLNF 82

Query: 2014 STTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPA-SESITSSSPAS 2072
            + T  +       TI P+  S+ +     E    N    E    + P  S S   S  +S
Sbjct: 83   TATIQTAA-----TIKPKPPSSESDWSGEE----NVSSQEGPLPSTPEHSSSEDDSWESS 133

Query: 2073 ESTTTSSPASES----------------TTTSSPASESTTTSSPASESTTTSSPESESTT 2116
                 S     S                  TS    + +   S  +     +    +ST 
Sbjct: 134  ADEEESLKLWNSFCQNDDPYNPLNFKAPFQTSGKNPKGSKHDSKTNSEQNVAIRSLKSTR 193

Query: 2117 TSSPA 2121
             S  A
Sbjct: 194  LSCKA 198



 Score = 35.0 bits (80), Expect = 0.42
 Identities = 38/188 (20%), Positives = 64/188 (34%), Gaps = 21/188 (11%)

Query: 1902 SESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPES--------ESTTTSSPESE 1953
            S+  ++++ ES S    S +    + SSL SES      E         +S     P + 
Sbjct: 24   SDLESSSDVESISWDEESEDDGFDSDSSL-SESDREQDDEGLHLWNSFTKSVD---PYNP 79

Query: 1954 STTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSE 2013
               T+++ + +T    P S  +  S  E+ S+    L S    +SS +          S 
Sbjct: 80   LNFTATIQTAATIKPKPPSSESDWSGEENVSSQEGPLPSTPEHSSSEDDS-----WESSA 134

Query: 2014 STTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASE 2073
                S  +  S              +P   S          + TN+  + +I S     +
Sbjct: 135  DEEESLKLWNSFCQNDDPYNPLNFKAPFQTSGKNPKGSKHDSKTNSEQNVAIRS----LK 190

Query: 2074 STTTSSPA 2081
            ST  S  A
Sbjct: 191  STRLSCKA 198



 Score = 30.8 bits (69), Expect = 8.9
 Identities = 28/134 (20%), Positives = 51/134 (38%), Gaps = 5/134 (3%)

Query: 2012 SESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPA 2071
            S+  ++S   S S      ESE     S +S S +      E     N  ++S+   +P 
Sbjct: 24   SDLESSSDVESISWDE---ESEDDGFDSDSSLSESDREQDDEGLHLWNSFTKSVDPYNPL 80

Query: 2072 SESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTTIEEQG 2131
            + + T  + A+       P+SES  +      S     P +   ++S   S  ++ +E+ 
Sbjct: 81   NFTATIQTAATIK--PKPPSSESDWSGEENVSSQEGPLPSTPEHSSSEDDSWESSADEEE 138

Query: 2132 VSPHSEKLSANEDP 2145
                      N+DP
Sbjct: 139  SLKLWNSFCQNDDP 152


>gnl|CDD|219929 pfam08604, Nup153, Nucleoporin Nup153-like.  This family contains
            both the nucleoporin Nup153 from human and Nup153 from
            fission yeast. These have been demonstrated to be
            functionally equivalent.
          Length = 519

 Score = 40.8 bits (95), Expect = 0.010
 Identities = 49/236 (20%), Positives = 89/236 (37%), Gaps = 23/236 (9%)

Query: 1910 PESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSS 1969
            P  +   T    S ST  S  +  S T   P    +  S    E+       + +    +
Sbjct: 253  PPVQRLVTPKSRSVSTNRSGYIKPSLT---PSGVFSAVSRRLDEACEDDVRKN-ALPKQN 308

Query: 1970 PESESTT---TSSPESESTTTSS--LVSESTT---TSSPESESTTTISPVSESTTTSSPV 2021
            P+SE  +    S+P +   ++    +  E  +   +   E E    + P       +SP 
Sbjct: 309  PKSERFSYPIFSTPAANGLSSGGGKMTRERPSFASSKPHEEELEAPVLPKISLPIKTSPA 368

Query: 2022 SESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPA 2081
              + T  SPE  +T + SP S+ T   + + + T+     S   T SSP  +ST     +
Sbjct: 369  LPTFTFSSPEDTATFSHSPISKDTPAKSQEVKITS----PSPQFTFSSPIVKST----ES 420

Query: 2082 SESTTTSSPASESTTTSSPASESTTTSS---PESESTTTSSPASESTTIEEQGVSP 2134
            +    + S     +   +  +E+T   S   P+ E   T +   +ST +++    P
Sbjct: 421  NVEPPSPSKEFTFSVPVAKFTEATGDKSLVVPKFEFKPTHTATVQSTNLKDNEPKP 476



 Score = 39.3 bits (91), Expect = 0.030
 Identities = 40/232 (17%), Positives = 83/232 (35%), Gaps = 13/232 (5%)

Query: 1893 ENTTTNSPESESTTTNNPESESTT---TSSPESESTTTSS--LVSESTT---TSSPESES 1944
            +    +     +    NP+SE  +    S+P +   ++    +  E  +   +   E E 
Sbjct: 292  DEACEDDVRKNALPKQNPKSERFSYPIFSTPAANGLSSGGGKMTRERPSFASSKPHEEEL 351

Query: 1945 TTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESES 2004
                 P+      +S    + T SSPE  +T + SP S+ T   S   + T+ S   + S
Sbjct: 352  EAPVLPKISLPIKTSPALPTFTFSSPEDTATFSHSPISKDTPAKSQEVKITSPSPQFTFS 411

Query: 2005 TTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASES 2064
            +  +     +    SP  E T ++ P ++ T  +       +   PK E   T+    +S
Sbjct: 412  SPIVKSTESNVEPPSPSKEFTFSV-PVAKFTEATG----DKSLVVPKFEFKPTHTATVQS 466

Query: 2065 ITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTT 2116
                    + T  +   +++    S      +    +S S    + +  + +
Sbjct: 467  TNLKDNEPKPTFGAFKPAKTLKEGSVLDLLKSPGFFSSPSPKREATQKTANS 518



 Score = 35.8 bits (82), Expect = 0.27
 Identities = 33/210 (15%), Positives = 77/210 (36%), Gaps = 10/210 (4%)

Query: 1947 TSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTT 2006
            + +P    +  S  + E+         +    +P+SE  +     + +    S      T
Sbjct: 277  SLTPSGVFSAVSRRLDEACEDDV-RKNALPKQNPKSERFSYPIFSTPAANGLSSGGGKMT 335

Query: 2007 TISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESIT 2066
               P   S+       E    + P+      +SPA  + T ++P+  +T +++P S+   
Sbjct: 336  RERPSFASSKPHE--EELEAPVLPKISLPIKTSPALPTFTFSSPEDTATFSHSPISKDTP 393

Query: 2067 SSSPASESTTTSSPASESTTT--SSPASESTTTSSPASESTTTSSPESESTTTSSPASES 2124
            + S   + T+ S   + S+    S+ ++    + S      T S P ++ T  +    +S
Sbjct: 394  AKSQEVKITSPSPQFTFSSPIVKSTESNVEPPSPSK---EFTFSVPVAKFTEATG--DKS 448

Query: 2125 TTIEEQGVSPHSEKLSANEDPEEFPNEDVF 2154
              + +    P       + + ++   +  F
Sbjct: 449  LVVPKFEFKPTHTATVQSTNLKDNEPKPTF 478



 Score = 32.8 bits (74), Expect = 2.8
 Identities = 34/148 (22%), Positives = 58/148 (39%), Gaps = 14/148 (9%)

Query: 1886 TLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESEST 1945
            T +S     T ++SP S+ T   + E + T+ S   + S+        +    SP S+  
Sbjct: 373  TFSSPEDTATFSHSPISKDTPAKSQEVKITSPSPQFTFSSPIVKSTESNVEPPSP-SKEF 431

Query: 1946 TTSSPE---SESTTTSSLVS---ESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSS 1999
            T S P    +E+T   SLV    E   T +   +ST     E + T  +   +++    S
Sbjct: 432  TFSVPVAKFTEATGDKSLVVPKFEFKPTHTATVQSTNLKDNEPKPTFGAFKPAKTLKEGS 491

Query: 2000 -------PESESTTTISPVSESTTTSSP 2020
                   P   S+ +    +   T +SP
Sbjct: 492  VLDLLKSPGFFSSPSPKREATQKTANSP 519


>gnl|CDD|219106 pfam06614, Neuromodulin, Neuromodulin.  This family consists of
            several neuromodulin (Axonal membrane protein GAP-43)
            sequences and is found in conjunction with pfam00612.
            GAP-43 is a neuronal calmodulin-binding phosphoprotein
            that is concentrated in growth cones and pre-synaptic
            terminals.
          Length = 174

 Score = 39.1 bits (90), Expect = 0.011
 Identities = 34/159 (21%), Positives = 62/159 (38%), Gaps = 9/159 (5%)

Query: 1895 TTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESES 1954
            T T + E+E+  T+ P  + ++ +  E +   +S    E     +P S     +S E+ES
Sbjct: 21   TATEATEAETPKTDEPTKDGSSPAE-EKKGEGSSDKPQEQPAPQAPASSEEKQASAETES 79

Query: 1955 TTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSES 2014
             T +S      T +SP S++      E         V+ +  T+    ++T   +P  E 
Sbjct: 80   ATKAS------TDNSPSSKADVAPLKEESKKADVPAVTAAAATTPAAEDATAKAAPQPEQ 133

Query: 2015 TTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSE 2053
             T  S  S+         E+  + S   E       K++
Sbjct: 134  ETAES--SQEEEKKDAVEETKPSESAQQEEAKEEEAKAD 170



 Score = 37.1 bits (85), Expect = 0.044
 Identities = 33/161 (20%), Positives = 60/161 (37%), Gaps = 11/161 (6%)

Query: 1971 ESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISP 2030
            + E+ T +      T  +   ++  ++ + E +   +     E     +P S      S 
Sbjct: 16   KGEAKTATEATEAETPKTDEPTKDGSSPAEEKKGEGSSDKPQEQPAPQAPASSEEKQASA 75

Query: 2031 ESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSS-PASESTTTSSPASESTTTSS 2089
            E+ES T +S      T N+P S++     P  E    +  PA  +   ++PA+E  T  +
Sbjct: 76   ETESATKAS------TDNSPSSKADVA--PLKEESKKADVPAVTAAAATTPAAEDATAKA 127

Query: 2090 PASESTTTSSPASESTTTSSPESESTTTSSPASESTTIEEQ 2130
                   T+  + E         E T  S  A +    EE+
Sbjct: 128  APQPEQETAESSQEEEKKD--AVEETKPSESAQQEEAKEEE 166



 Score = 34.5 bits (78), Expect = 0.39
 Identities = 26/138 (18%), Positives = 52/138 (37%), Gaps = 6/138 (4%)

Query: 1909 NPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTS 1968
            N + E+ T +      T  +   ++  ++ + E +   +S    E     +  S     +
Sbjct: 14   NKKGEAKTATEATEAETPKTDEPTKDGSSPAEEKKGEGSSDKPQEQPAPQAPASSEEKQA 73

Query: 1969 SPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTI 2028
            S E+ES T +S      T +S  S++      E      +  V+ +  T+    ++T   
Sbjct: 74   SAETESATKAS------TDNSPSSKADVAPLKEESKKADVPAVTAAAATTPAAEDATAKA 127

Query: 2029 SPESESTTTSSPASESTT 2046
            +P+ E  T  S   E   
Sbjct: 128  APQPEQETAESSQEEEKK 145



 Score = 32.5 bits (73), Expect = 1.5
 Identities = 40/174 (22%), Positives = 62/174 (35%), Gaps = 18/174 (10%)

Query: 2020 PVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSS 2079
            P  E+      E+++ T ++ A    T    K  S+       E   SS    E     +
Sbjct: 7    PSEEAVENKKGEAKTATEATEAETPKTDEPTKDGSSPAEEKKGE--GSSDKPQEQPAPQA 64

Query: 2080 PASESTTTSSPASESTTT-------SSPASESTTTSSPESESTTTSSPASESTTIEE--- 2129
            PAS     +S  +ES T        SS A  +      +       + A+ +T   E   
Sbjct: 65   PASSEEKQASAETESATKASTDNSPSSKADVAPLKEESKKADVPAVTAAAATTPAAEDAT 124

Query: 2130 QGVSPHSEKLSANEDPEEFPNEDVFEHTFAEIPNIDHSNQTDEAIPETFDAREE 2183
               +P  E+ +A    EE   + V E   +E      S Q +EA  E   A +E
Sbjct: 125  AKAAPQPEQETAESSQEEEKKDAVEETKPSE------SAQQEEAKEEEAKADQE 172


>gnl|CDD|215964 pfam00513, Late_protein_L2, Late Protein L2. 
          Length = 466

 Score = 40.4 bits (95), Expect = 0.013
 Identities = 21/120 (17%), Positives = 39/120 (32%), Gaps = 10/120 (8%)

Query: 1986 TTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASEST 2045
            +  SLV ES+   S                TTSS  + +   ++P + +   S      T
Sbjct: 100  SIVSLVEESSIIESGAPIPPIPGDGSGFPITTSSTTTPAILDVTPTTRTVHVS-----RT 154

Query: 2046 TTNNPKSESTTTNNPASES-----ITSSSPASESTTTSSPASESTTTSSPASESTTTSSP 2100
              NNP     +   P   +     +  S     + +      ++   S   +    +S+P
Sbjct: 155  QYNNPLFTDPSVLQPPQPAEVSGHVLVSGQTIGTHSYEEIPMDTFAVSEGTTPPPISSTP 214



 Score = 38.0 bits (89), Expect = 0.069
 Identities = 25/115 (21%), Positives = 35/115 (30%), Gaps = 3/115 (2%)

Query: 1934 STTTSSPESE-STTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVS 1992
            S    + E E           S     L          + E  T S       TT     
Sbjct: 327  SPIAPAEEIELQPLGEHSGDTSPVEDGLYDIYADPDPLDVELDTYSDDLLLDETTE--DF 384

Query: 1993 ESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTT 2047
             ++   S  S ++TT + +  S+T   PV      + PES  TT   P      T
Sbjct: 385  STSQLVSSSSRTSTTNTTIPLSSTPDVPVYYGPDIVLPESPGTTPIVPVPPDLPT 439



 Score = 36.9 bits (86), Expect = 0.15
 Identities = 26/117 (22%), Positives = 36/117 (30%), Gaps = 7/117 (5%)

Query: 1928 SSLVSESTTTSSPESESTTTSSPESESTT-----TSSLVSESTTTSSPESESTTTSSPES 1982
            S +         P  E +  +SP  +           L  E  T S       TT    +
Sbjct: 327  SPIAPAEEIELQPLGEHSGDTSPVEDGLYDIYADPDPLDVELDTYSDDLLLDETTEDFST 386

Query: 1983 ESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSS 2039
                +SS  + +T T+ P S +     PV        P S  TT I P      T  
Sbjct: 387  SQLVSSSSRTSTTNTTIPLSSTPDV--PVYYGPDIVLPESPGTTPIVPVPPDLPTVI 441



 Score = 36.9 bits (86), Expect = 0.15
 Identities = 30/130 (23%), Positives = 43/130 (33%), Gaps = 30/130 (23%)

Query: 1926 TTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTS-----SPESESTTTSSP 1980
            +  SLV ES+   S                TTS     STTT      +P + +   S  
Sbjct: 100  SIVSLVEESSIIESGAPIPPIPGDGSGFPITTS-----STTTPAILDVTPTTRTVHVSR- 153

Query: 1981 ESESTTTSSLVSE-STTTSSPESE-------STTTISPVS--ESTTTSSPVSESTTTISP 2030
               +   + L ++ S       +E       S  TI   S  E    +  VSE TT    
Sbjct: 154  ---TQYNNPLFTDPSVLQPPQPAEVSGHVLVSGQTIGTHSYEEIPMDTFAVSEGTTP--- 207

Query: 2031 ESESTTTSSP 2040
                  +S+P
Sbjct: 208  ---PPISSTP 214



 Score = 36.5 bits (85), Expect = 0.17
 Identities = 28/152 (18%), Positives = 44/152 (28%), Gaps = 17/152 (11%)

Query: 1998 SSPESESTTTISPVSESTTTSSPVSESTTTISP-ESESTTTSSPASESTTTNNPKSESTT 2056
             +       T +PV       S V  +  +I     ES+   S A       +      T
Sbjct: 71   GTRPVRVVGTGTPVRPPVVVESTVGPTDPSIVSLVEESSIIESGAPIPPIPGDGSGFPIT 130

Query: 2057 TNNPASESITSSSPASESTTTSSPASE-------STTTSSPASE-------STTTSSPAS 2102
            T++  + +I   +P + +   S            S       +E       S  T    S
Sbjct: 131  TSSTTTPAILDVTPTTRTVHVSRTQYNNPLFTDPSVLQPPQPAEVSGHVLVSGQTIGTHS 190

Query: 2103 ESTTTSSPESESTTTSSPASESTTIEEQGVSP 2134
                     + S  T+ P   ST I   GV  
Sbjct: 191  YEEIPMDTFAVSEGTTPPPISSTPI--PGVRR 220



 Score = 36.5 bits (85), Expect = 0.19
 Identities = 22/101 (21%), Positives = 29/101 (28%), Gaps = 2/101 (1%)

Query: 1919 SPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTS 1978
                 S     L          + E  T S       TT    +    +SS  + +T T+
Sbjct: 343  HSGDTSPVEDGLYDIYADPDPLDVELDTYSDDLLLDETTEDFSTSQLVSSSSRTSTTNTT 402

Query: 1979 SPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSS 2019
             P S +               PES  TT I PV     T  
Sbjct: 403  IPLSSTPDVPVYYGPDIV--LPESPGTTPIVPVPPDLPTVI 441



 Score = 34.6 bits (80), Expect = 0.83
 Identities = 26/126 (20%), Positives = 42/126 (33%), Gaps = 11/126 (8%)

Query: 1994 STTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSE 2053
            S    + E E    + P+ E +  +SPV +    I  + +                    
Sbjct: 327  SPIAPAEEIE----LQPLGEHSGDTSPVEDGLYDIYADPDPLDVELDTYSDDLL-----L 377

Query: 2054 STTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESE 2113
              TT + ++  + SSS  + +T T+ P S  +T   P         P S  TT   P   
Sbjct: 378  DETTEDFSTSQLVSSSSRTSTTNTTIPLS--STPDVPVYYGPDIVLPESPGTTPIVPVPP 435

Query: 2114 STTTSS 2119
               T  
Sbjct: 436  DLPTVI 441



 Score = 34.2 bits (79), Expect = 0.90
 Identities = 20/125 (16%), Positives = 37/125 (29%), Gaps = 20/125 (16%)

Query: 1956 TTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSEST 2015
            +  SLV ES+   S                TTSS      TT+    +    ++P + + 
Sbjct: 100  SIVSLVEESSIIESGAPIPPIPGDGSGFPITTSS------TTTPAILD----VTPTTRTV 149

Query: 2016 TTSSPVSE-------STTTISPESEST---TTSSPASESTTTNNPKSESTTTNNPASESI 2065
              S            S       +E +     S     + +      ++   +   +   
Sbjct: 150  HVSRTQYNNPLFTDPSVLQPPQPAEVSGHVLVSGQTIGTHSYEEIPMDTFAVSEGTTPPP 209

Query: 2066 TSSSP 2070
             SS+P
Sbjct: 210  ISSTP 214



 Score = 33.4 bits (77), Expect = 1.6
 Identities = 28/165 (16%), Positives = 54/165 (32%), Gaps = 33/165 (20%)

Query: 1870 IIFTTNNNSESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSS 1929
            ++ +T   ++ ++V     SL+ E++   S         +      TTSS      TT+ 
Sbjct: 89   VVESTVGPTDPSIV-----SLVEESSIIESGAPIPPIPGDGSGFPITTSS------TTTP 137

Query: 1930 LVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSS 1989
             + + T                   TT +  VS +   +   ++ +    P+    +   
Sbjct: 138  AILDVT------------------PTTRTVHVSRTQYNNPLFTDPSVLQPPQPAEVSGHV 179

Query: 1990 LVSEST--TTSSPESESTTTISPVSEST--TTSSPVSESTTTISP 2030
            LVS  T  T S  E    T       +    +S+P+         
Sbjct: 180  LVSGQTIGTHSYEEIPMDTFAVSEGTTPPPISSTPIPGVRRVARL 224



 Score = 33.4 bits (77), Expect = 1.6
 Identities = 22/116 (18%), Positives = 33/116 (28%), Gaps = 3/116 (2%)

Query: 1877 NSESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTT 1936
                 + +  L     + +       +     +P      T S +     T+   S S  
Sbjct: 330  APAEEIELQPLGEHSGDTSPVEDGLYDIYADPDPLDVELDTYSDDLLLDETTEDFSTSQL 389

Query: 1937 TSSPESESTT-TSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLV 1991
             SS    STT T+ P S +               PES  TT   P      T  + 
Sbjct: 390  VSSSSRTSTTNTTIPLSSTPDVPVYYGPDIV--LPESPGTTPIVPVPPDLPTVIIH 443



 Score = 32.3 bits (74), Expect = 4.2
 Identities = 22/119 (18%), Positives = 34/119 (28%), Gaps = 13/119 (10%)

Query: 2018 SSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTT 2077
            S         + P  E +  +SP  +         +         E  T S       TT
Sbjct: 327  SPIAPAEEIELQPLGEHSGDTSPVEDGLYDIYADPDPLDV-----ELDTYSDDLLLDETT 381

Query: 2078 SSPASESTTTSSPASESTTTSSPASESTTT--------SSPESESTTTSSPASESTTIE 2128
               ++    +SS  + +T T+ P S +             PES  TT   P        
Sbjct: 382  EDFSTSQLVSSSSRTSTTNTTIPLSSTPDVPVYYGPDIVLPESPGTTPIVPVPPDLPTV 440


>gnl|CDD|219594 pfam07816, DUF1645, Protein of unknown function (DUF1645).  These
            sequences are derived from a number of hypothetical plant
            proteins. The region in question is approximately 270
            amino acids long. Some members of this family are
            annotated as yeast pheromone receptor proteins AR781 but
            no literature was found to support this.
          Length = 191

 Score = 39.1 bits (91), Expect = 0.013
 Identities = 26/134 (19%), Positives = 51/134 (38%), Gaps = 23/134 (17%)

Query: 2009 SPVSESTTTSSPVSESTTTISPESESTTTSSPASEST--TTNNPKSESTTTNNPASESIT 2066
            SP SES    + +  +  ++SPE    ++ S +++        P S   +++  +S   +
Sbjct: 36   SPRSESAFAPARLRRALRSLSPERGGGSSDSESTDEGELEGVPPSSYCVSSSPASSSRKS 95

Query: 2067 SSSPASE---------------------STTTSSPASESTTTSSPASESTTTSSPASEST 2105
            SS+ +S+                           P  + +  SSPAS     S+ + ES+
Sbjct: 96   SSTGSSKRWRLSDLLLFRSASDGKDAFVFDAAKDPLLKYSPLSSPASPVKPASAKSRESS 155

Query: 2106 TTSSPESESTTTSS 2119
             +       T  S+
Sbjct: 156  ASKGKRRGKTVASA 169



 Score = 36.0 bits (83), Expect = 0.13
 Identities = 31/156 (19%), Positives = 56/156 (35%), Gaps = 19/156 (12%)

Query: 1979 SPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTS 2038
            SP SES    + +  +  + SPE    ++ S  ++                P S   ++S
Sbjct: 36   SPRSESAFAPARLRRALRSLSPERGGGSSDSESTDEGELEGV--------PPSSYCVSSS 87

Query: 2039 SPASESTTTNNPKSE-----------STTTNNPASESITSSSPASESTTTSSPASESTTT 2087
              +S   +++   S+           S +    A     +  P  + +  SSPAS     
Sbjct: 88   PASSSRKSSSTGSSKRWRLSDLLLFRSASDGKDAFVFDAAKDPLLKYSPLSSPASPVKPA 147

Query: 2088 SSPASESTTTSSPASESTTTSSPESESTTTSSPASE 2123
            S+ + ES+ +       T  S+ E    T  + A E
Sbjct: 148  SAKSRESSASKGKRRGKTVASAHELLYATNRAAAEE 183



 Score = 33.3 bits (76), Expect = 0.85
 Identities = 31/139 (22%), Positives = 48/139 (34%), Gaps = 23/139 (16%)

Query: 1919 SPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTS 1978
            SP SES    + +  +  + SPE    ++ S  ++         E    SS    S+  S
Sbjct: 36   SPRSESAFAPARLRRALRSLSPERGGGSSDSESTDEGE-----LEGVPPSSYCVSSSPAS 90

Query: 1979 SPESESTTTSS-------LV---SES--------TTTSSPESESTTTISPVSESTTTSSP 2020
            S    S+T SS       L+   S S             P  + +   SP S     S+ 
Sbjct: 91   SSRKSSSTGSSKRWRLSDLLLFRSASDGKDAFVFDAAKDPLLKYSPLSSPASPVKPASAK 150

Query: 2021 VSESTTTISPESESTTTSS 2039
              ES+ +       T  S+
Sbjct: 151  SRESSASKGKRRGKTVASA 169



 Score = 32.5 bits (74), Expect = 1.9
 Identities = 25/134 (18%), Positives = 44/134 (32%), Gaps = 13/134 (9%)

Query: 1899 SPESESTTTNNPESESTTTSSPESESTTTSSLVSEST--TTSSPESESTTTSSPESESTT 1956
            SP SES         +  + SPE    ++ S  ++        P S   ++S   S   +
Sbjct: 36   SPRSESAFAPARLRRALRSLSPERGGGSSDSESTDEGELEGVPPSSYCVSSSPASSSRKS 95

Query: 1957 TSS-----------LVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESEST 2005
            +S+           L+  S +             P  + +  SS  S     S+   ES+
Sbjct: 96   SSTGSSKRWRLSDLLLFRSASDGKDAFVFDAAKDPLLKYSPLSSPASPVKPASAKSRESS 155

Query: 2006 TTISPVSESTTTSS 2019
             +       T  S+
Sbjct: 156  ASKGKRRGKTVASA 169



 Score = 31.3 bits (71), Expect = 4.1
 Identities = 31/142 (21%), Positives = 55/142 (38%), Gaps = 9/142 (6%)

Query: 1949 SPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSES-TTTSSPESESTTT 2007
            SP SES    + +  +  + SPE    ++ S  ++      +   S   +SSP S     
Sbjct: 36   SPRSESAFAPARLRRALRSLSPERGGGSSDSESTDEGELEGVPPSSYCVSSSPAS----- 90

Query: 2008 ISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITS 2067
             S    S+T SS     +  +   S S      A       +P  + +  ++PAS    +
Sbjct: 91   -SSRKSSSTGSSKRWRLSDLLLFRSAS--DGKDAFVFDAAKDPLLKYSPLSSPASPVKPA 147

Query: 2068 SSPASESTTTSSPASESTTTSS 2089
            S+ + ES+ +       T  S+
Sbjct: 148  SAKSRESSASKGKRRGKTVASA 169


>gnl|CDD|221745 pfam12737, Mating_C, C-terminal domain of homeodomain 1.  Mating in
            fungi is controlled by the loci that determine the mating
            type of an individual, and only individuals with
            differing mating types can mate. Basidiomycete fungi have
            evolved a unique mating system, termed tetrapolar or
            bifactorial incompatibility, in which mating type is
            determined by two unlinked loci; compatibility at both
            loci is required for mating to occur. The multi-allelic
            tetrapolar mating system is considered to be a novel
            innovation that could have only evolved once, and is thus
            unique to the mushroom fungi. This domain is C-terminal
            to the homeodomain transcription factor region.
          Length = 418

 Score = 39.8 bits (93), Expect = 0.016
 Identities = 52/298 (17%), Positives = 97/298 (32%), Gaps = 38/298 (12%)

Query: 1899 SPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTS 1958
            SPE       +P   S    SP   S    S V  S  T      S+ +   +     + 
Sbjct: 73   SPER------SPALSSERLLSPSP-SVLDLSPVLASPQTGKRRRSSSPSDDEDEAERPSK 125

Query: 1959 SLVSESTTTSSPESESTTTSSPESESTTTSSLVSES-----TTTSSPESESTTTISPVSE 2013
               S+S ++SS  ++      P   ++T   L   S     T + SP    T T +P  +
Sbjct: 126  RPRSDSISSSSSPAKPPEACLPSPAASTQDELSEASAAPLPTPSLSPPHTPTDT-APSGK 184

Query: 2014 STTTSSPVSESTTTISPESES-TTTSS---PASESTTTNNPKSESTTTNNPASESITSSS 2069
                 S   +      P++ S   T S   P   +T  +     + +++     +     
Sbjct: 185  RKRRLSDGFQLPAPKRPQTSSRPQTVSDPLPLHATTDWDTWFQATVSSSPSLLLTGDIPP 244

Query: 2070 PASESTTTSS----------PASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSS 2119
            P S      S          P        +         +P + S+++S+    + T+SS
Sbjct: 245  PVSVFAPDDSTPLDISLFNFPLIPLLPPEALDL-----PAPTAVSSSSSTFAVPALTSSS 299

Query: 2120 PASESTTIEEQ----GVSPHSEKL-SANEDPEE-FPNEDVFEHTFAEIPNIDHSNQTD 2171
                +T +++     G + +SE L   N+      P+           P    ++ + 
Sbjct: 300  VDQSATPLDQGFSNFGSNMYSEPLNPTNDSLLYGLPSSSSLYANRTIFPAWASTSVSP 357



 Score = 39.0 bits (91), Expect = 0.032
 Identities = 49/266 (18%), Positives = 93/266 (34%), Gaps = 46/266 (17%)

Query: 1900 PESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTT--- 1956
            P S+S ++++  ++      P   ++T   L SE++    P    +   +P   + +   
Sbjct: 127  PRSDSISSSSSPAKPPEACLPSPAASTQDEL-SEASAAPLPTPSLSPPHTPTDTAPSGKR 185

Query: 1957 --TSSLVSESTTTSSPESES--TTTSSPESESTTTSSLVSESTT-TSSPESESTTTIS-P 2010
                S   +      P++ S   T S P     TT        T +SSP    T  I  P
Sbjct: 186  KRRLSDGFQLPAPKRPQTSSRPQTVSDPLPLHATTDWDTWFQATVSSSPSLLLTGDIPPP 245

Query: 2011 VS-------------------------ESTTTSSPVSESTTTISPESESTTTSSPASEST 2045
            VS                         E+    +P + S+++ +    + T+SS    +T
Sbjct: 246  VSVFAPDDSTPLDISLFNFPLIPLLPPEALDLPAPTAVSSSSSTFAVPALTSSSVDQSAT 305

Query: 2046 TTNNPKSESTTTNNPASESITSSSPASESTT---TSSPASEST------TTSSPASESTT 2096
              +   S   +  N  SE +  ++ +        +S  A+ +       T+ SP   ST 
Sbjct: 306  PLDQGFSNFGS--NMYSEPLNPTNDSLLYGLPSSSSLYANRTIFPAWASTSVSPLDFSTL 363

Query: 2097 TSSPASESTTTSSPESESTTTSSPAS 2122
             + P+     + S  + +  TS    
Sbjct: 364  FNQPSPSPMASQSILAPAQPTSPSPV 389



 Score = 36.3 bits (84), Expect = 0.18
 Identities = 42/263 (15%), Positives = 83/263 (31%), Gaps = 28/263 (10%)

Query: 1878 SESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTT-----TSSLVS 1932
            S S+        L S   +T    SE++    P    +   +P   + +       S   
Sbjct: 134  SSSSPAKPPEACLPSPAASTQDELSEASAAPLPTPSLSPPHTPTDTAPSGKRKRRLSDGF 193

Query: 1933 ESTTTSSPESES----------TTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPES 1982
            +      P++ S             ++        +   S S   +       +  +P+ 
Sbjct: 194  QLPAPKRPQTSSRPQTVSDPLPLHATTDWDTWFQATVSSSPSLLLTGDIPPPVSVFAPDD 253

Query: 1983 ESTTTSSL----VSESTTTSSPESESTTTISPVSES----TTTSSPVSESTTTISPESES 2034
             +    SL    +       + +  + T +S  S +      TSS V +S T +     S
Sbjct: 254  STPLDISLFNFPLIPLLPPEALDLPAPTAVSSSSSTFAVPALTSSSVDQSATPLDQ-GFS 312

Query: 2035 TTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASES 2094
               S+  SE     N        +  +S     +   + ++T+ SP   ST  + P+   
Sbjct: 313  NFGSNMYSEPLNPTNDSLLYGLPS-SSSLYANRTIFPAWASTSVSPLDFSTLFNQPSPSP 371

Query: 2095 TTTSS---PASESTTTSSPESES 2114
              + S   PA  ++ +      S
Sbjct: 372  MASQSILAPAQPTSPSPVALPSS 394


>gnl|CDD|221391 pfam12042, RP1-2, Tubuliform egg casing silk strands structural
            domain.  Spiders use fibroins to make silk strands. This
            family includes tubuliform silk fibroins which are used
            to protect egg cases. This domain is a structural domain
            which is found in repeats of up to 20 in many individuals
            (although this is not necessarily the case). RP1 makes up
            structural domains in the N terminal while RP2 makes up
            structural domains in the C terminal.
          Length = 167

 Score = 38.3 bits (89), Expect = 0.017
 Identities = 34/166 (20%), Positives = 75/166 (45%), Gaps = 12/166 (7%)

Query: 1952 SESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPES--ESTTTIS 2009
            + S   SS  S S+ ++  +S S+  +S    S+  SS  S S   S   +  +S     
Sbjct: 2    AASQAASSASSSSSASAFAQSLSSALASSSQFSSAFSSATSASAAGSLAYALGQSAARSL 61

Query: 2010 PVSESTTTSSPVSESTTTI----SPESESTTTSSPASESTTTN------NPKSESTTTNN 2059
             +S ++  +S V+++ +++    S  + +   S+   +           N  S +++  +
Sbjct: 62   GLSNASALASAVAQAVSSVGVGASASAYANAISNAIGQFLAGQGVLNASNASSLASSFAS 121

Query: 2060 PASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASEST 2105
              S S  SS+  + S + ++ A++S   +S  S++ + SS  S ++
Sbjct: 122  ALSASAASSAAQAASASAAAAAAQSQAAASAFSQAASQSSSQSAAS 167


>gnl|CDD|221145 pfam11596, DUF3246, Protein of unknown function (DUF3246).  This is a
            small family of fungal proteins one of whose members from
            Pichia stipitis is described as being an extremely serine
            rich protein-mucin-like protein.
          Length = 208

 Score = 38.6 bits (89), Expect = 0.018
 Identities = 43/213 (20%), Positives = 80/213 (37%), Gaps = 12/213 (5%)

Query: 1926 TTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESEST 1985
            T   + S   TT    + + T+    S  +TT+   +   T S+ + +       + E  
Sbjct: 1    TVDPITSNDITTIGSSTVTITSGGSGSSVSTTAGSSTILPTGSATDDDDYDDEETDCEGQ 60

Query: 1986 TTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSE-STTTISPESESTTTSSPASES 2044
            TT++     T T+ P   ++ T+ P   +TT     +    TTIS  +  TT +      
Sbjct: 61   TTAN--PTGTVTTDPTGTTSQTVVPTKPTTTDDDDDTTCVETTISDPTTITTPTG----- 113

Query: 2045 TTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSP---ASESTTTSSPA 2101
             T N   + + TTN  A+ ++ ++      T T +  + +T  +       E+TT ++  
Sbjct: 114  -TVNGNPTGTVTTNGTATTTVITTVEGVAVTYTGTGQTFTTDGTEDDEDCDETTTYTTTY 172

Query: 2102 SESTTTSSPESESTTTSSPASESTTIEEQGVSP 2134
                TT        T       + T+    V  
Sbjct: 173  YTPYTTVIHGGTVYTNGVTVIATHTVYPTDVED 205



 Score = 35.1 bits (80), Expect = 0.28
 Identities = 30/162 (18%), Positives = 58/162 (35%), Gaps = 4/162 (2%)

Query: 1876 NNSESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSEST 1935
              + S+ ++ T ++   ++      + E  TT NP    TT  +  +  T   +  + + 
Sbjct: 31   TTAGSSTILPTGSATDDDDYDDEETDCEGQTTANPTGTVTTDPTGTTSQTVVPTKPTTTD 90

Query: 1936 TTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSEST 1995
                     TT S P + +T T ++    T T +    +TTT     E    +   +  T
Sbjct: 91   DDDDTTCVETTISDPTTITTPTGTVNGNPTGTVTTNGTATTTVITTVEGVAVTYTGTGQT 150

Query: 1996 TTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTT 2037
             T+    +        + +TT  +P     TT+       T 
Sbjct: 151  FTTDGTEDDEDCDETTTYTTTYYTP----YTTVIHGGTVYTN 188



 Score = 35.1 bits (80), Expect = 0.28
 Identities = 40/202 (19%), Positives = 83/202 (41%), Gaps = 14/202 (6%)

Query: 1886 TLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESEST 1945
            T++ + S + TT    + + T+    S  +TT+   +   T S+   +       + E  
Sbjct: 1    TVDPITSNDITTIGSSTVTITSGGSGSSVSTTAGSSTILPTGSATDDDDYDDEETDCEGQ 60

Query: 1946 TTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESEST 2005
            TT++P    T T +     TT+ +      TT+  + ++T   + +S+ TT ++P    T
Sbjct: 61   TTANP----TGTVTTDPTGTTSQTVVPTKPTTTDDDDDTTCVETTISDPTTITTP----T 112

Query: 2006 TTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESI 2065
             T++     T T++  + +TT I+       T +   ++ TT+  + +         E+ 
Sbjct: 113  GTVNGNPTGTVTTNGTA-TTTVITTVEGVAVTYTGTGQTFTTDGTEDDED-----CDETT 166

Query: 2066 TSSSPASESTTTSSPASESTTT 2087
            T ++      TT        T 
Sbjct: 167  TYTTTYYTPYTTVIHGGTVYTN 188



 Score = 30.5 bits (68), Expect = 7.7
 Identities = 38/184 (20%), Positives = 66/184 (35%), Gaps = 15/184 (8%)

Query: 1879 ESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTS 1938
             STV +++  S  S +TT  S     T +   + +     +     TT +      T T+
Sbjct: 15   SSTVTITSGGSGSSVSTTAGSSTILPTGSATDDDDYDDEETDCEGQTTANPT---GTVTT 71

Query: 1939 SPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTS 1998
             P   ++ T  P   +TT     +    T+  +  + TT +       T ++ +  T T 
Sbjct: 72   DPTGTTSQTVVPTKPTTTDDDDDTTCVETTISDPTTITTPTGTVNGNPTGTVTTNGTAT- 130

Query: 1999 SPESESTTTISPVSESTTTSSPVSESTTTISPES-----ESTTTSSPASESTTTNNPKSE 2053
                  TT I+ V     T +   ++ TT   E      E+TT ++      TT      
Sbjct: 131  ------TTVITTVEGVAVTYTGTGQTFTTDGTEDDEDCDETTTYTTTYYTPYTTVIHGGT 184

Query: 2054 STTT 2057
              T 
Sbjct: 185  VYTN 188


>gnl|CDD|218191 pfam04652, DUF605, Vta1 like.  Vta1 (VPS20-associated protein 1) is a
            positive regulator of Vps4. Vps4 is an ATPase that is
            required in the multivesicular body (MVB) sorting pathway
            to dissociate the endosomal sorting complex required for
            transport (ESCRT). Vta1 promotes correct assembly of Vps4
            and stimulates its ATPase activity through its conserved
            Vta1/SBP1/LIP5 region.
          Length = 315

 Score = 39.3 bits (92), Expect = 0.021
 Identities = 19/132 (14%), Positives = 44/132 (33%), Gaps = 8/132 (6%)

Query: 1981 ESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSP 2040
            E E    ++  S+++        ++ + S    S+              P S S ++  P
Sbjct: 157  EDEDADVATTNSDNSFPGEDADPASASPSDPPSSSPGVPSFPSPPE--DPSSPSDSSLPP 214

Query: 2041 ASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSP 2100
            A  S  ++ P     +  NP      S  P   +            +++  +  + +++P
Sbjct: 215  APSSFQSDTPPPSPESPTNP------SPPPGPAAPPPPPVQQVPPLSTAKPTPPSASATP 268

Query: 2101 ASESTTTSSPES 2112
            A     T   ++
Sbjct: 269  APIGGITLDDDA 280


>gnl|CDD|240430 PTZ00473, PTZ00473, Plasmodium Vir superfamily; Provisional.
          Length = 420

 Score = 39.4 bits (92), Expect = 0.024
 Identities = 22/85 (25%), Positives = 31/85 (36%), Gaps = 3/85 (3%)

Query: 1915 TTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESES 1974
                   S S  T S  SES    + +S +T   S    S T S+    S +T    +  
Sbjct: 321  NYGGQFNSRSGRTGS--SESIRGFTYDSSTTYGGSSYGTSQTDSTSTYGSRSTFDSSTGG 378

Query: 1975 TTTSSPESESTTTSSLVSESTTTSS 1999
             + S   S +   SS    S+  SS
Sbjct: 379  GSQSGGGS-TYGGSSTFDGSSRGSS 402



 Score = 38.7 bits (90), Expect = 0.039
 Identities = 22/89 (24%), Positives = 33/89 (37%), Gaps = 4/89 (4%)

Query: 1932 SESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLV 1991
            S S  T S  SES    + +S +T   S    S T S+    S +T    +   + S   
Sbjct: 328  SRSGRTGS--SESIRGFTYDSSTTYGGSSYGTSQTDSTSTYGSRSTFDSSTGGGSQSG-- 383

Query: 1992 SESTTTSSPESESTTTISPVSESTTTSSP 2020
              ST   S   + ++  S  S   +   P
Sbjct: 384  GGSTYGGSSTFDGSSRGSSDSFGVSYFGP 412



 Score = 37.5 bits (87), Expect = 0.075
 Identities = 24/105 (22%), Positives = 37/105 (35%), Gaps = 6/105 (5%)

Query: 1895 TTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESES 1954
                   S S  T +  SES    + +S +T   S    S T S+    S +T    +  
Sbjct: 321  NYGGQFNSRSGRTGS--SESIRGFTYDSSTTYGGSSYGTSQTDSTSTYGSRSTFDSSTGG 378

Query: 1955 TTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSS 1999
             + S     ST   S   + ++  S  S+S   S    + T   S
Sbjct: 379  GSQSG--GGSTYGGSSTFDGSSRGS--SDSFGVSYFGPQQTVGFS 419



 Score = 37.1 bits (86), Expect = 0.11
 Identities = 23/87 (26%), Positives = 33/87 (37%), Gaps = 5/87 (5%)

Query: 2052 SESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPE 2111
            S S  T   +SESI   +  S +T   S    S T S+    S +T   ++   + S   
Sbjct: 328  SRSGRTG--SSESIRGFTYDSSTTYGGSSYGTSQTDSTSTYGSRSTFDSSTGGGSQSGGG 385

Query: 2112 SESTTTSSPASESTTIEEQ--GVSPHS 2136
            S +   SS    S+       GVS   
Sbjct: 386  S-TYGGSSTFDGSSRGSSDSFGVSYFG 411



 Score = 37.1 bits (86), Expect = 0.12
 Identities = 21/98 (21%), Positives = 33/98 (33%), Gaps = 4/98 (4%)

Query: 1945 TTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESES 2004
                   S S  T S  SES    + +S +T   S    S T S+    S +T    +  
Sbjct: 321  NYGGQFNSRSGRTGS--SESIRGFTYDSSTTYGGSSYGTSQTDSTSTYGSRSTFDSSTGG 378

Query: 2005 TTTISPVSESTTTSSPVSESTTTISPESESTTTSSPAS 2042
             +       ST   S   + ++  S +S   +   P  
Sbjct: 379  GSQSG--GGSTYGGSSTFDGSSRGSSDSFGVSYFGPQQ 414



 Score = 37.1 bits (86), Expect = 0.13
 Identities = 21/91 (23%), Positives = 34/91 (37%), Gaps = 4/91 (4%)

Query: 1892 SENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPE 1951
            S +  T S ES    T   +S +T   S    S T S+    S +T    +   + S   
Sbjct: 328  SRSGRTGSSESIRGFTY--DSSTTYGGSSYGTSQTDSTSTYGSRSTFDSSTGGGSQSG-- 383

Query: 1952 SESTTTSSLVSESTTTSSPESESTTTSSPES 1982
              ST   S   + ++  S +S   +   P+ 
Sbjct: 384  GGSTYGGSSTFDGSSRGSSDSFGVSYFGPQQ 414



 Score = 36.0 bits (83), Expect = 0.25
 Identities = 22/91 (24%), Positives = 34/91 (37%), Gaps = 4/91 (4%)

Query: 1962 SESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPV 2021
            S S  T S  SES    + +S +T   S    S T S+    S +T    +   + S   
Sbjct: 328  SRSGRTGS--SESIRGFTYDSSTTYGGSSYGTSQTDSTSTYGSRSTFDSSTGGGSQSG-- 383

Query: 2022 SESTTTISPESESTTTSSPASESTTTNNPKS 2052
              ST   S   + ++  S  S   +   P+ 
Sbjct: 384  GGSTYGGSSTFDGSSRGSSDSFGVSYFGPQQ 414



 Score = 35.6 bits (82), Expect = 0.36
 Identities = 21/85 (24%), Positives = 29/85 (34%), Gaps = 3/85 (3%)

Query: 1935 TTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSES 1994
                   S S  T S  SES    +  S +T   S    S T S+    S +T    +  
Sbjct: 321  NYGGQFNSRSGRTGS--SESIRGFTYDSSTTYGGSSYGTSQTDSTSTYGSRSTFDSSTGG 378

Query: 1995 TTTSSPESESTTTISPVSESTTTSS 2019
             + S   S +    S    S+  SS
Sbjct: 379  GSQSGGGS-TYGGSSTFDGSSRGSS 402



 Score = 34.8 bits (80), Expect = 0.55
 Identities = 19/85 (22%), Positives = 29/85 (34%), Gaps = 3/85 (3%)

Query: 2035 TTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASES 2094
                   S S  T + +S    T    S +    S    S T S+    S +T   ++  
Sbjct: 321  NYGGQFNSRSGRTGSSESIRGFTY--DSSTTYGGSSYGTSQTDSTSTYGSRSTFDSSTGG 378

Query: 2095 TTTSSPASESTTTSSPESESTTTSS 2119
             + S   S +   SS    S+  SS
Sbjct: 379  GSQSGGGS-TYGGSSTFDGSSRGSS 402



 Score = 32.9 bits (75), Expect = 2.2
 Identities = 26/88 (29%), Positives = 35/88 (39%), Gaps = 13/88 (14%)

Query: 2012 SESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPA 2071
            S S  T S  SES    + +S +T   S    S T      +ST+T    S S   SS  
Sbjct: 328  SRSGRTGS--SESIRGFTYDSSTTYGGSSYGTSQT------DSTSTYG--SRSTFDSS-- 375

Query: 2072 SESTTTSSPASESTTTSSPASESTTTSS 2099
            +   + S   S +   SS    S+  SS
Sbjct: 376  TGGGSQSGGGS-TYGGSSTFDGSSRGSS 402



 Score = 32.9 bits (75), Expect = 2.2
 Identities = 28/98 (28%), Positives = 38/98 (38%), Gaps = 6/98 (6%)

Query: 1992 SESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPK 2051
            S S  T S ES    T    S +T   S    S T    +S ST  S    +S+T    +
Sbjct: 328  SRSGRTGSSESIRGFTYD--SSTTYGGSSYGTSQT----DSTSTYGSRSTFDSSTGGGSQ 381

Query: 2052 SESTTTNNPASESITSSSPASESTTTSSPASESTTTSS 2089
            S   +T   +S    SS  +S+S   S    + T   S
Sbjct: 382  SGGGSTYGGSSTFDGSSRGSSDSFGVSYFGPQQTVGFS 419



 Score = 32.9 bits (75), Expect = 2.5
 Identities = 21/95 (22%), Positives = 31/95 (32%), Gaps = 2/95 (2%)

Query: 1995 TTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSES 2054
                   S S  T S  SES    +  S +T   S    S T S+    S +T +  +  
Sbjct: 321  NYGGQFNSRSGRTGS--SESIRGFTYDSSTTYGGSSYGTSQTDSTSTYGSRSTFDSSTGG 378

Query: 2055 TTTNNPASESITSSSPASESTTTSSPASESTTTSS 2089
             + +   S    SS+    S  +S     S     
Sbjct: 379  GSQSGGGSTYGGSSTFDGSSRGSSDSFGVSYFGPQ 413


>gnl|CDD|221734 pfam12722, Hid1, High-temperature-induced dauer-formation protein.
            Hid1 (high-temperature-induced dauer-formation protein 1)
            represents proteins of approximately 800 residues long
            and is conserved from fungi to humans. It contains up to
            seven potential transmembrane domains separated by
            regions of low complexity. Functionally it might be
            involved in vesicle secretion or be an inter-cellular
            signalling protein or be a novel insulin receptor.
          Length = 813

 Score = 39.7 bits (93), Expect = 0.025
 Identities = 24/143 (16%), Positives = 48/143 (33%), Gaps = 14/143 (9%)

Query: 1972 SESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPE 2031
                  SS E +  +  S     +   S +++S          + + S V E   +++  
Sbjct: 572  RNLILDSSQEEDERSNQSASGSLSDNPSNDNDS---------RSPSLSEVPEENKSLAIT 622

Query: 2032 SESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPA 2091
             +      PAS   +T+   +  +  + P      S     ++   S   S   + + P 
Sbjct: 623  DDF----DPASRENSTSEAAAPPSVNSVPLQLQGPSEKDRGKNPAGSLAFSRLNSATRPK 678

Query: 2092 SESTTTSSPASESTTTSSPESES 2114
              S  +S  + E    +S   ES
Sbjct: 679  WPSGLSSK-SKEKFPPTSDWVES 700


>gnl|CDD|144541 pfam00985, MSA_2, Merozoite Surface Antigen 2 (MSA-2) family. 
          Length = 171

 Score = 37.6 bits (86), Expect = 0.033
 Identities = 26/142 (18%), Positives = 53/142 (37%), Gaps = 8/142 (5%)

Query: 2042 SESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPA 2101
            + +TTT N    ST+T++       + +         SP +++   +   S     S   
Sbjct: 1    TTTTTTTNDAEASTSTSSENPNHNNAETNPKGEGEVQSP-NQANKETQNNSNVQQDSQTK 59

Query: 2102 SESTTTSSPESESTTTSSPASESTTIEEQGVSPHSEKLSANEDPEEFPNEDVFEHTFAEI 2161
            S    T   +++S T     +E++       +P +E+  + E      N+   +H     
Sbjct: 60   SNVPETQDADTKSPTAQPEQAENS-------APTAEQTESPELQSAPENKGTGQHGHMHG 112

Query: 2162 PNIDHSNQTDEAIPETFDAREE 2183
               +H   T ++  E  D  +E
Sbjct: 113  SRNNHPQNTSDSQKECTDGNKE 134



 Score = 36.4 bits (83), Expect = 0.080
 Identities = 18/93 (19%), Positives = 44/93 (47%), Gaps = 2/93 (2%)

Query: 1892 SENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPE 1951
            +  TTTN  E+ ++T++   + +   ++P+ E    S   +++   +   S     S  +
Sbjct: 2    TTTTTTNDAEASTSTSSENPNHNNAETNPKGEGEVQSP--NQANKETQNNSNVQQDSQTK 59

Query: 1952 SESTTTSSLVSESTTTSSPESESTTTSSPESES 1984
            S    T    ++S T    ++E++  ++ ++ES
Sbjct: 60   SNVPETQDADTKSPTAQPEQAENSAPTAEQTES 92



 Score = 36.4 bits (83), Expect = 0.088
 Identities = 22/105 (20%), Positives = 45/105 (42%), Gaps = 3/105 (2%)

Query: 1902 SESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLV 1961
            + +TTTN+ E+ ++T+S   + +   ++   E    S    ++   +   S     S   
Sbjct: 2    TTTTTTNDAEASTSTSSENPNHNNAETNPKGEGEVQSPN--QANKETQNNSNVQQDSQTK 59

Query: 1962 SESTTTSSPESESTTTSSPESE-STTTSSLVSESTTTSSPESEST 2005
            S    T   +++S T    ++E S  T+         S+PE++ T
Sbjct: 60   SNVPETQDADTKSPTAQPEQAENSAPTAEQTESPELQSAPENKGT 104



 Score = 36.0 bits (82), Expect = 0.11
 Identities = 24/123 (19%), Positives = 47/123 (38%), Gaps = 1/123 (0%)

Query: 1942 SESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPE 2001
            + +TTT++    ST+TSS         +         SP +++   +   S     S  +
Sbjct: 1    TTTTTTTNDAEASTSTSSENPNHNNAETNPKGEGEVQSP-NQANKETQNNSNVQQDSQTK 59

Query: 2002 SESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPA 2061
            S    T    ++S T     +E++   + ++ES    S      T  +     +  N+P 
Sbjct: 60   SNVPETQDADTKSPTAQPEQAENSAPTAEQTESPELQSAPENKGTGQHGHMHGSRNNHPQ 119

Query: 2062 SES 2064
            + S
Sbjct: 120  NTS 122



 Score = 34.9 bits (79), Expect = 0.26
 Identities = 22/105 (20%), Positives = 45/105 (42%), Gaps = 2/105 (1%)

Query: 1912 SESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPE 1971
            + +TTT++    ST+TSS         +         SP +++   +   S     S  +
Sbjct: 1    TTTTTTTNDAEASTSTSSENPNHNNAETNPKGEGEVQSP-NQANKETQNNSNVQQDSQTK 59

Query: 1972 SESTTTSSPESESTTTSSLVSESTTTSSPESESTTTIS-PVSEST 2015
            S    T   +++S T     +E++  ++ ++ES    S P ++ T
Sbjct: 60   SNVPETQDADTKSPTAQPEQAENSAPTAEQTESPELQSAPENKGT 104



 Score = 34.1 bits (77), Expect = 0.46
 Identities = 28/113 (24%), Positives = 51/113 (45%), Gaps = 13/113 (11%)

Query: 1873 TTNNNSESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVS 1932
            TT N++E++   S+ N   +   T    E E  + N    E+   S+ + +S T S++  
Sbjct: 5    TTTNDAEASTSTSSENPNHNNAETNPKGEGEVQSPNQANKETQNNSNVQQDSQTKSNV-- 62

Query: 1933 ESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESEST 1985
                   PE++   T SP ++     +    +  T SPE +    S+PE++ T
Sbjct: 63   -------PETQDADTKSPTAQPEQAENSAPTAEQTESPELQ----SAPENKGT 104



 Score = 33.3 bits (75), Expect = 0.93
 Identities = 21/105 (20%), Positives = 45/105 (42%), Gaps = 2/105 (1%)

Query: 1992 SESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPK 2051
            + +TTT++    ST+T S         +         SP +++   +   S     +  K
Sbjct: 1    TTTTTTTNDAEASTSTSSENPNHNNAETNPKGEGEVQSP-NQANKETQNNSNVQQDSQTK 59

Query: 2052 SESTTTNNPASESITSSSPASESTTTSSPASEST-TTSSPASEST 2095
            S    T +  ++S T+    +E++  ++  +ES    S+P ++ T
Sbjct: 60   SNVPETQDADTKSPTAQPEQAENSAPTAEQTESPELQSAPENKGT 104



 Score = 32.6 bits (73), Expect = 1.7
 Identities = 22/105 (20%), Positives = 42/105 (40%), Gaps = 2/105 (1%)

Query: 1982 SESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPA 2041
            + +TTT++    ST+TSS         +         SP +++       S     S   
Sbjct: 1    TTTTTTTNDAEASTSTSSENPNHNNAETNPKGEGEVQSP-NQANKETQNNSNVQQDSQTK 59

Query: 2042 SESTTTNNPKSESTTTNNPASESITSSSPASEST-TTSSPASEST 2085
            S    T +  ++S T     +E+   ++  +ES    S+P ++ T
Sbjct: 60   SNVPETQDADTKSPTAQPEQAENSAPTAEQTESPELQSAPENKGT 104



 Score = 31.8 bits (71), Expect = 2.7
 Identities = 21/105 (20%), Positives = 43/105 (40%), Gaps = 2/105 (1%)

Query: 1922 SESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPE 1981
            + +TTT++    ST+TSS         +         S  +++   +   S     S  +
Sbjct: 1    TTTTTTTNDAEASTSTSSENPNHNNAETNPKGEGEVQS-PNQANKETQNNSNVQQDSQTK 59

Query: 1982 SESTTTSSLVSESTTTSSPESESTTTISPVSEST-TTSSPVSEST 2025
            S    T    ++S T    ++E++   +  +ES    S+P ++ T
Sbjct: 60   SNVPETQDADTKSPTAQPEQAENSAPTAEQTESPELQSAPENKGT 104



 Score = 31.0 bits (69), Expect = 4.8
 Identities = 25/106 (23%), Positives = 48/106 (45%), Gaps = 4/106 (3%)

Query: 2012 SESTTTSSPVSESTTTISP-ESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSP 2070
            + +TTT++    ST+T S   + +   ++P  E    +  ++   T NN  S     S  
Sbjct: 1    TTTTTTTNDAEASTSTSSENPNHNNAETNPKGEGEVQSPNQANKETQNN--SNVQQDSQT 58

Query: 2071 ASESTTTSSPASESTTTSSPASESTTTSSPASEST-TTSSPESEST 2115
             S    T    ++S T     +E++  ++  +ES    S+PE++ T
Sbjct: 59   KSNVPETQDADTKSPTAQPEQAENSAPTAEQTESPELQSAPENKGT 104


>gnl|CDD|220267 pfam09494, Slx4, Slx4 endonuclease.  The Slx4 protein is a
            heteromeric structure-specific endonuclease found in
            fungi. Slx4 with Slx1 acts as a nuclease on branched DNA
            substrates, particularly simple-Y, 5'-flap, or
            replication fork structures by cleaving the strand
            bearing the 5' non-homologous arm at the branch junction
            and thus generating ligatable nicked products from
            5'-flap or replication fork substrates.
          Length = 627

 Score = 39.2 bits (91), Expect = 0.034
 Identities = 53/288 (18%), Positives = 90/288 (31%), Gaps = 18/288 (6%)

Query: 1895 TTTNSPESESTTTNNPESESTTTSSPESESTT----TSSLVSESTTTSSPESESTTTSSP 1950
            + +  P     T  + +          +   +    T S   E  +    ES+S   S P
Sbjct: 197  SASQLPPDTELTDEDLQWLYDLDDEQMANDNSPLVMTLSQTMEDQSAIEKESDSYIDSEP 256

Query: 1951 ESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISP 2010
             S  T       +   +      S   SS +  ST   S +       +  S+S + IS 
Sbjct: 257  NSSITEPYDHDIQVKNSEPEFKPSNEISSHQVNSTDNESSIISFPLHIADSSDSVSEISL 316

Query: 2011 VSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSS-- 2068
                T  S P S  +T   P       S+P     ++   K    + +   S S  S+  
Sbjct: 317  ----TEPSRPQSIDSTIEPPIEIPRKMSTPFFTPRSSILDKHIELSQD---SFSAVSTAT 369

Query: 2069 SPASESTTTS-----SPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASE 2123
            SP   S+              T+TS P  +S T +    +   TS  E  S        E
Sbjct: 370  SPFKVSSAQIINSDGDVPLTRTSTSIPTRQSGTAAYKKRKKLNTSRYEISSKLRVKDYQE 429

Query: 2124 STTIEEQGVSPHSEKLSANEDPEEFPNEDVFEHTFAEIPNIDHSNQTD 2171
              T  +  +     K    ++  E  + +  + +   I  I  ++   
Sbjct: 430  DKTNNKAKLLKEETKRLPVDNLNEIADSESDDDSSLSIIEIVDTSVLQ 477


>gnl|CDD|220271 pfam09507, CDC27, DNA polymerase subunit Cdc27.  This protein forms
            the C subunit of DNA polymerase delta. It carries the
            essential residues for binding to the Pol1 subunit of
            polymerase alpha, from residues 293-332, which are
            characterized by the motif D--G--VT, referred to as the
            DPIM motif. The first 160 residues of the protein form
            the minimal domain for binding to the B subunit, Cdc1, of
            polymerase delta, the final 10 C-terminal residues,
            362-372, being the DNA sliding clamp, PCNA, binding
            motif.
          Length = 427

 Score = 38.7 bits (90), Expect = 0.036
 Identities = 30/205 (14%), Positives = 56/205 (27%), Gaps = 20/205 (9%)

Query: 1900 PESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSS 1959
                  +  +PE +  +    +  S  T++   E T   +    ++   +P  +S   SS
Sbjct: 164  SSKPPKSIMSPEVKVKSAKKTQDTSKETTT---EKTEGKTSVKAASLKRNPPKKSNIMSS 220

Query: 1960 LVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSS 2019
               + T     + E++ ++  E ES        ES        + +   + + E      
Sbjct: 221  FFKKKTKEKKEKKEASESTVKE-ESE------EESGKRDVILEDESAEPTGLDEDEDEDE 273

Query: 2020 PVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTT-TS 2078
            P      + S E                   + E           I   SP  E  +   
Sbjct: 274  PKPSGERSDSEEETEEKEKEKRKRLKKMMEDEDED------EEMEIVPESPVEEEESEEP 327

Query: 2079 SPASESTTTSSPASESTTTSSPASE 2103
             P            +   T SP   
Sbjct: 328  EPPPLPKK---EEEKEEVTVSPDGG 349



 Score = 37.5 bits (87), Expect = 0.076
 Identities = 28/155 (18%), Positives = 53/155 (34%), Gaps = 13/155 (8%)

Query: 1998 SSPESESTTT--ISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSEST 2055
            ++P  +  T   + PV+ + + +   + +    S      +  SP  +  +    +  S 
Sbjct: 131  TNPNVKRRTGVGLPPVAPAASPALKPTANGKRPS-SKPPKSIMSPEVKVKSAKKTQDTSK 189

Query: 2056 TTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSP-ESES 2114
             T    +E  TS   AS      +P  +S   SS   + T       E++ ++   ESE 
Sbjct: 190  ETTTEKTEGKTSVKAAS---LKRNPPKKSNIMSSFFKKKTKEKKEKKEASESTVKEESEE 246

Query: 2115 TTTSSPASESTTIEEQGVSPHSEKLSANEDPEEFP 2149
             +                S     L  +ED +E  
Sbjct: 247  ESGKRDVILEDE------SAEPTGLDEDEDEDEPK 275



 Score = 37.5 bits (87), Expect = 0.085
 Identities = 28/151 (18%), Positives = 50/151 (33%), Gaps = 6/151 (3%)

Query: 1998 SSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTT 2057
              P + + +     + +    S      + +SPE +  +       S  T   K+E  T+
Sbjct: 143  LPPVAPAASPALKPTANGKRPS-SKPPKSIMSPEVKVKSAKKTQDTSKETTTEKTEGKTS 201

Query: 2058 NNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTT 2117
               AS      +P  +S   SS   + T       E++ ++    ES   S         
Sbjct: 202  VKAAS---LKRNPPKKSNIMSSFFKKKTKEKKEKKEASESTVKE-ESEEESGKRDVILED 257

Query: 2118 SSPASESTTIEEQGVSPH-SEKLSANEDPEE 2147
             S        +E    P  S + S +E+  E
Sbjct: 258  ESAEPTGLDEDEDEDEPKPSGERSDSEEETE 288


>gnl|CDD|191251 pfam05283, MGC-24, Multi-glycosylated core protein 24 (MGC-24).  This
            family consists of several MGC-24 (or Cd164 antigen)
            proteins from eukaryotic organisms. MGC-24/CD164 is a
            sialomucin expressed in many normal and cancerous
            tissues. In humans, soluble and transmembrane forms of
            MGC-24 are produced by alternative splicing.
          Length = 187

 Score = 37.7 bits (87), Expect = 0.038
 Identities = 31/151 (20%), Positives = 49/151 (32%), Gaps = 23/151 (15%)

Query: 1990 LVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPES------------ESTTT 2037
            L + S     P       I        TS  V     T  PE              +   
Sbjct: 18   LAAGSNWAQLPNVTKGARIFG-----RTSLLVLNVWLTTYPEGCEHLNSCVSCVNRTHNN 72

Query: 2038 SSPASESTTTNNPK---SESTTTNNPASESITSSSPASESTT---TSSPASESTTTSSPA 2091
            S+   +      P    S++    +      T+ S +  +TT   T+S A  + T S   
Sbjct: 73   STCVWQQCGPEEPGYCSSQAEVVKSGCQIYNTTDSCSVATTTPVPTNSTAKPTITPSPTT 132

Query: 2092 SESTTTSSPASESTTTSSPESESTTTSSPAS 2122
            S    TS P + +T T + + +  +T   AS
Sbjct: 133  SHHHVTSEPKTNTTVTPTSQPDRKSTFDAAS 163



 Score = 36.9 bits (85), Expect = 0.060
 Identities = 36/151 (23%), Positives = 56/151 (37%), Gaps = 13/151 (8%)

Query: 1960 LVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPES--ESTTTISPVSESTTT 2017
            L + S     P                TS LV     T+ PE      + +S V+ +   
Sbjct: 18   LAAGSNWAQLPNVTKGARIFG-----RTSLLVLNVWLTTYPEGCEHLNSCVSCVNRTHNN 72

Query: 2018 SSPVSESTTTISPE---SESTTTSSPASESTTTNNPKSESTT---TNNPASESITSSSPA 2071
            S+ V +      P    S++    S      TT++    +TT   TN+ A  +IT S   
Sbjct: 73   STCVWQQCGPEEPGYCSSQAEVVKSGCQIYNTTDSCSVATTTPVPTNSTAKPTITPSPTT 132

Query: 2072 SESTTTSSPASESTTTSSPASESTTTSSPAS 2102
            S    TS P + +T T +   +  +T   AS
Sbjct: 133  SHHHVTSEPKTNTTVTPTSQPDRKSTFDAAS 163



 Score = 31.1 bits (70), Expect = 4.5
 Identities = 23/84 (27%), Positives = 36/84 (42%), Gaps = 15/84 (17%)

Query: 1922 SESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPE 1981
            S++    S      TT S    S  T++P   ++T    ++ S TTS        TS P+
Sbjct: 90   SQAEVVKSGCQIYNTTDSC---SVATTTPVPTNSTAKPTITPSPTTSH----HHVTSEPK 142

Query: 1982 SESTTTSSLVSESTTTSSPESEST 2005
            + +T T         TS P+ +ST
Sbjct: 143  TNTTVTP--------TSQPDRKST 158


>gnl|CDD|215299 PLN02543, PLN02543, pfkB-type carbohydrate kinase family protein.
          Length = 496

 Score = 38.7 bits (90), Expect = 0.039
 Identities = 18/93 (19%), Positives = 33/93 (35%), Gaps = 8/93 (8%)

Query: 2060 PASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSS 2119
                SI  S P   ST  ++       +     +  T+S P +++T   + +++      
Sbjct: 40   SLHPSIKRSRPGRCSTNGAAVPESPKPSRRGRKKKPTSSPPKAKTTRRRTKKTDQELDPE 99

Query: 2120 PASESTTIEEQGVSPHSEKLSANEDPEEFPNED 2152
             A E     E G           +D  +FP +D
Sbjct: 100  GAEEDQEAAEDG--------EDYDDGIDFPYDD 124


>gnl|CDD|234371 TIGR03839, termin_org_P1, adhesin P1.  Members of this protein family
            are the major adhesin of the Mycoplasma terminal
            organelle. The protein is called adhesin P1, cytadhesin
            P1, P140, attachment protein, and MgPa, with locus names
            MG191 in Mycoplasma genitalium and MPN141 in M.
            pneumoniae. A conserved C-terminal region is shared by
            additional paralogs in M. pneumoniae and M.
            gallisepticum, as well as by the member of this family
            [Cell envelope, Surface structures, Cellular processes,
            Pathogenesis].
          Length = 1425

 Score = 38.6 bits (89), Expect = 0.048
 Identities = 40/256 (15%), Positives = 76/256 (29%), Gaps = 15/256 (5%)

Query: 1819 PGAEFLIQCQYCDFDSSMNLLSVSPYITNNLLISMLAATAVAISVIDNYSEIIFTTN-NN 1877
            PGA  L++ +      +    S           ++ A+T  AI   D Y   ++  +   
Sbjct: 69   PGALVLVRSK-SAKGITAGSGSQQTTYPTRTEAALTASTTFAIRRYDLYGRALYDFDPGK 127

Query: 1878 SESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTT----TSSPESESTTTSSLVSE 1933
                     L   +  N  T    S     N  E +S      T  PE  +     LV +
Sbjct: 128  LNPQTPTRDLTGKVGFNPFTGFGLSGDAPFNWNELKSKVPVEVTQDPEDPNVFYVLLVPD 187

Query: 1934 STTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSE 1993
            +                E +   +           +  S++  T    +E T +S+    
Sbjct: 188  AAVQYEQLQRGLQEQKTEDQVFESYFGAMFGLKVKNAMSDAPKTGEKLAEGTASSAGSGS 247

Query: 1994 STTTSSPESESTTTISPV---------SESTTTSSPVSESTTTISPESESTTTSSPASES 2044
            S++ +   + + T    +         S   T       + T I   S+S       +  
Sbjct: 248  SSSAAGGGAVAPTAAKALKREVEEGSSSGMGTMLPKNDTAETPIKYNSDSGKIVKLKALL 307

Query: 2045 TTTNNPKSESTTTNNP 2060
             +T + +S +     P
Sbjct: 308  DSTESSESINGGRWRP 323


>gnl|CDD|227270 COG4934, COG4934, Predicted protease [Posttranslational modification,
            protein turnover, chaperones].
          Length = 1174

 Score = 38.6 bits (90), Expect = 0.050
 Identities = 46/380 (12%), Positives = 103/380 (27%), Gaps = 50/380 (13%)

Query: 1764 INSVSPNVTSKILTTDNYSEIIFTTNNNSESTVVMSTLNSLLSENEKLFKPHAKTPGAEF 1823
            + S +P  T+ +  ++  +  ++   N    +    T  ++                   
Sbjct: 759  VISYAPPFTTGLFLSNGTAYTVYWNGNLIAESNGTLTPQTIQFNTTYSGSNTVT-----N 813

Query: 1824 LIQCQYCDFDSSMNLLSVSPYITNNLLISMLAATAVAIS----VIDNYSEIIFTTNNNSE 1879
                Q          +    Y +    I               +    + + F +     
Sbjct: 814  QTIPQVGLLIPLFKFVYGYYYSSAIATIDAKYVFNEGNGPGAYIYVGSTPLYFFSAIIYP 873

Query: 1880 STVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSS--LVSESTTT 1937
            +++     N  +  +         +T            +S  S  T +    ++      
Sbjct: 874  NSLS---YNIYVIGSIAIIPLPYNATLLEWVGPAIIPLTSSGSNFTFSFGYYVIQFPPGI 930

Query: 1938 SSPESESTTTSSPESESTTTSSLVSESTT---TSSPESEST-------TTSSPESESTTT 1987
             +  +         S   + S  VS       +S P S  T         +        T
Sbjct: 931  YTINTSIPGLDPYSSLINSKSGTVSNLQIYFLSSVPTSGLTGKSSDGGIKNFVIDVLVNT 990

Query: 1988 SSLVSESTTTSSP---ESESTTTISPVS---------------ESTTTSSPVSESTTTIS 2029
            + + + +  T +     S S  TIS  S                 T+ +S +S    ++S
Sbjct: 991  NGISAINNGTGNYYVIASVSNGTISFSSQIYGKDVYNITVAEGNITSVNSALSNLIVSLS 1050

Query: 2030 ----PESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASEST 2085
                P  ++   S    E    N+     T  +      + S+SP+    +T     ++ 
Sbjct: 1051 STTVPIIKNVLPSLVYGEYNIINS----YTGNDFGVITIVISNSPSGSYPSTLYNTDQTQ 1106

Query: 2086 TTSSPASESTTTSSPASEST 2105
            T+S  +S     +   +   
Sbjct: 1107 TSSYISSTLPAHNYIINLIL 1126


>gnl|CDD|221188 pfam11725, AvrE, Pathogenicity factor.  This family is secreted by
            gram-negative Gammaproteobacteria such as Pseudomonas
            syringae of tomato and the fire blight plant pathogen
            Erwinia amylovora, amongst others. It is an essential
            pathogenicity factor of approximately 198 kDa. Its
            injection into the host-plant is dependent upon the
            bacterial type III or Hrp secretion system. The family is
            long and carries a number of predicted functional
            regions, including an ERMS or endoplasmic reticulum
            membrane retention signal at both the C- and the
            N-termini, a leucine-zipper motif from residues 539-560,
            and a nuclear localisation signal at 1358-1361. this
            conserved AvrE-family of effectors is among the few that
            are required for full virulence of many phytopathogenic
            pseudomonads, erwinias and pantoeas.
          Length = 1771

 Score = 38.6 bits (90), Expect = 0.056
 Identities = 33/253 (13%), Positives = 65/253 (25%), Gaps = 17/253 (6%)

Query: 1886 TLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESEST 1945
             L S+ +   T   PE  + +   P     ++ SP   ++   SL SE         +  
Sbjct: 2    QLISINTATKTAVQPE-ATPSAGAPTGLQQSSESPTQRASH--SLASEGKKNRKKMPKVF 58

Query: 1946 TTSSPESESTTTSSLVSESTTTSSPESESTTTSS----PES-------ESTTTSSLVSES 1994
              SS   +           T  +   S   T       PE        ES+ ++  ++ S
Sbjct: 59   QKSSAPRQIQAAPPQALNPTAAAPQSSRGPTLRELLALPEDDGETQAPESSPSARRLTRS 118

Query: 1995 TTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSES 2054
               +  E E       V               + S      +     S         S +
Sbjct: 119  EGVARHEMEDLAGRPVVKPDADRQLRQDILNKSSSSRRPPVSKEEGTSSKMPATALASAA 178

Query: 2055 TTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESES 2114
               ++   + + ++    +  + S  +       +          P    +     E E 
Sbjct: 179  LFKDDEIRQEVDAARS--DQASQSRLSRSRGNPPAI-PPDAAPRQPMLTRSAGGRFEGED 235

Query: 2115 TTTSSPASESTTI 2127
                      + I
Sbjct: 236  ENLERNLQPQSPI 248


>gnl|CDD|234383 TIGR03895, protease_PatA, cyanobactin maturation protease, PatA/PatG
            family.  This model describes a protease domain
            associated with the maturation of various members of the
            cyanobactin family of ribosomally produced, heavily
            modified bioactive metabolites. Members include the PatA
            protein and C-terminal domain of the PatG protein of
            Prochloron didemni, TenA and a region of TenG from Nostoc
            spongiaeforme var. tenue, etc.
          Length = 602

 Score = 38.2 bits (89), Expect = 0.058
 Identities = 22/109 (20%), Positives = 34/109 (31%), Gaps = 13/109 (11%)

Query: 2007 TISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESIT 2066
            T+S    ++        S   +    ES+T+  P   +               PA  SI 
Sbjct: 230  TMSEGLVTSEQDGVEEASGCGVQGTIESSTSVIPPGRAAE-------------PAPVSIP 276

Query: 2067 SSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESEST 2115
             ++P   +T  ++    S      A    T   PAS   T S   S   
Sbjct: 277  VAAPGEGATPAAAQIELSAGVLPNAISPATPVRPASNGVTPSQAPSAEP 325



 Score = 33.5 bits (77), Expect = 1.5
 Identities = 19/99 (19%), Positives = 31/99 (31%), Gaps = 3/99 (3%)

Query: 1977 TSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTT 2036
            T S    ++    +   S        ES+T++ P       + P   S    +P   +T 
Sbjct: 230  TMSEGLVTSEQDGVEEASGCGVQGTIESSTSVIPPGR---AAEPAPVSIPVAAPGEGATP 286

Query: 2037 TSSPASESTTTNNPKSESTTTNNPASESITSSSPASEST 2075
             ++    S           T   PAS  +T S   S   
Sbjct: 287  AAAQIELSAGVLPNAISPATPVRPASNGVTPSQAPSAEP 325



 Score = 31.2 bits (71), Expect = 7.5
 Identities = 18/99 (18%), Positives = 26/99 (26%), Gaps = 3/99 (3%)

Query: 1987 TSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTT 2046
            T S    ++     E  S   +    ES+T+  P   +                A+ +  
Sbjct: 230  TMSEGLVTSEQDGVEEASGCGVQGTIESSTSVIPPGRAAEPAPVSIPVAAPGEGATPAAA 289

Query: 2047 TNNPKSESTTTNNPASESITSSSPASESTTTSSPASEST 2085
                   S      A    T   PAS   T S   S   
Sbjct: 290  QI---ELSAGVLPNAISPATPVRPASNGVTPSQAPSAEP 325


>gnl|CDD|218902 pfam06121, DUF959, Domain of Unknown Function (DUF959).  This
            N-terminal domain is not expressed in the 'Short' isoform
            of Collagen A.
          Length = 202

 Score = 37.1 bits (85), Expect = 0.061
 Identities = 39/177 (22%), Positives = 63/177 (35%), Gaps = 17/177 (9%)

Query: 1884 MSTLNSL--LSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPE 1941
            +ST      L +  T  SP + S       +   +T S  +          ESTT +S E
Sbjct: 16   LSTPKKPTWLWKPYTELSPTASSAAVPQASTPVQSTESTTTHVVPRPGETEESTTPASSE 75

Query: 1942 SES------TTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSS------ 1989
                          P + +TT +         SSP+      +   +E    +       
Sbjct: 76   EPKEIVEKGKQNVVPGTVATTPTVTPVAMDVASSPDLSEENIAGVGAEILNVAEGIRSFV 135

Query: 1990 -LVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASEST 2045
             L  +  T +S ++    T  P+  +T  SS     TTT+ P S     SSP++ +T
Sbjct: 136  QLWEDKVTNASAQTPVPDTEMPLVLATPISSLPQNDTTTLWPSSH--IPSSPSANTT 190



 Score = 34.4 bits (78), Expect = 0.48
 Identities = 40/204 (19%), Positives = 68/204 (33%), Gaps = 20/204 (9%)

Query: 1899 SPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTS 1958
            +     T + +   + T    P +E + T+S  +    ++  +S    T S  +      
Sbjct: 7    TSADAETASLSTPKKPTWLWKPYTELSPTASSAAVPQASTPVQS----TESTTTHVVPRP 62

Query: 1959 SLVSESTTTSSPESES------TTTSSPESESTTTSSLVSESTTTSSPESESTTT----- 2007
                ESTT +S E              P + +TT +         SSP+           
Sbjct: 63   GETEESTTPASSEEPKEIVEKGKQNVVPGTVATTPTVTPVAMDVASSPDLSEENIAGVGA 122

Query: 2008 -ISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESIT 2066
             I  V+E   +   + E   T    + S  T  P +E          S   N+  +   +
Sbjct: 123  EILNVAEGIRSFVQLWEDKVT----NASAQTPVPDTEMPLVLATPISSLPQNDTTTLWPS 178

Query: 2067 SSSPASESTTTSSPASESTTTSSP 2090
            S  P+S S  T+   + S  T  P
Sbjct: 179  SHIPSSPSANTTEAGTLSGPTKLP 202


>gnl|CDD|223065 PHA03378, PHA03378, EBNA-3B; Provisional.
          Length = 991

 Score = 38.5 bits (89), Expect = 0.062
 Identities = 39/196 (19%), Positives = 56/196 (28%), Gaps = 18/196 (9%)

Query: 1954 STTTSSLVSES---TTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISP 2010
            S TTS L S +     T  P    + T  P    TT S +   S     P       + P
Sbjct: 579  SPTTSQLASSAPSYAQTPWPVPHPSQTPEP---PTTQSHIPETSAPRQWPMPLRPIPMRP 635

Query: 2011 VSESTTTSSPVSESTTTISPESESTTTSS--------PASESTTTNN----PKSESTTTN 2058
            +     T + +   T    P+ E T            P   S T  N     +    T  
Sbjct: 636  LRMQPITFNVLVFPTPHQPPQVEITPYKPTWTQIGHIPYQPSPTGANTMLPIQWAPGTMQ 695

Query: 2059 NPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTS 2118
             P         PA+       PA+ +     PA+       PA+       P +      
Sbjct: 696  PPPRAPTPMRPPAAPPGRAQRPAAATGRARPPAAAPGRARPPAAAPGRARPPAAAPGRAR 755

Query: 2119 SPASESTTIEEQGVSP 2134
             PA+          +P
Sbjct: 756  PPAAAPGRARPPAAAP 771



 Score = 35.4 bits (81), Expect = 0.45
 Identities = 39/198 (19%), Positives = 60/198 (30%), Gaps = 16/198 (8%)

Query: 1924 STTTSSLVSES---TTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSP 1980
            S TTS L S +     T  P    + T  P    TT S +   S     P         P
Sbjct: 579  SPTTSQLASSAPSYAQTPWPVPHPSQTPEP---PTTQSHIPETSAPRQWPMPLRPIPMRP 635

Query: 1981 ESESTTTSSLVSESTTTSSPESEST------TTISPVSESTTTSSPVSESTTTISPESES 2034
                  T +++   T    P+ E T      T I  +    + +       T +  +   
Sbjct: 636  LRMQPITFNVLVFPTPHQPPQVEITPYKPTWTQIGHIPYQPSPTGA----NTMLPIQWAP 691

Query: 2035 TTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASES 2094
             T   P    T    P +       PA+ +  +  PA+       PA+       PA+  
Sbjct: 692  GTMQPPPRAPTPMRPPAAPPGRAQRPAAATGRARPPAAAPGRARPPAAAPGRARPPAAAP 751

Query: 2095 TTTSSPASESTTTSSPES 2112
                 PA+       P +
Sbjct: 752  GRARPPAAAPGRARPPAA 769



 Score = 34.7 bits (79), Expect = 0.74
 Identities = 37/197 (18%), Positives = 56/197 (28%), Gaps = 17/197 (8%)

Query: 1938 SSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTT 1997
            +SP +    +S+P      T   V   + T  P    TT S     S      +      
Sbjct: 578  TSPTTSQLASSAPSY--AQTPWPVPHPSQTPEP---PTTQSHIPETSAPRQWPMPLRPIP 632

Query: 1998 SSPESESTTTISPVSESTTTSSPVSESTTTIS--------PESESTTTSS----PASEST 2045
              P      T + +   T    P  E T            P   S T ++          
Sbjct: 633  MRPLRMQPITFNVLVFPTPHQPPQVEITPYKPTWTQIGHIPYQPSPTGANTMLPIQWAPG 692

Query: 2046 TTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASEST 2105
            T   P    T    PA+    +  PA+ +     PA+       PA+       PA+   
Sbjct: 693  TMQPPPRAPTPMRPPAAPPGRAQRPAAATGRARPPAAAPGRARPPAAAPGRARPPAAAPG 752

Query: 2106 TTSSPESESTTTSSPAS 2122
                P +       PA+
Sbjct: 753  RARPPAAAPGRARPPAA 769


>gnl|CDD|220888 pfam10846, DUF2722, Protein of unknown function (DUF2722).  This
            eukaryotic family of proteins has no known function.
          Length = 373

 Score = 37.9 bits (88), Expect = 0.063
 Identities = 53/255 (20%), Positives = 89/255 (34%), Gaps = 27/255 (10%)

Query: 1897 TNSPESESTTTNNPESESTTTSS--PESESTTTSSLVSESTTTSSPESESTTTSSPE--- 1951
                +S ST    PE   T  S   P S     SS  S  T   +        +SP    
Sbjct: 123  ALPTKSNSTGLLAPEQNGTNASPVPPSSYKFPPSS--SGLTPRHTVLPTHRRPNSPARIG 180

Query: 1952 -----SESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTT 2006
                 + +T T+    ES   SSP       S     + +  S    S  T+S   +  T
Sbjct: 181  AAAVANLATPTTPYKEESLGASSPLRRKKFGSQLHQRNMSLPSNTPTSGNTNSNIPKPAT 240

Query: 2007 TISPVSESTTTSSPVSESTTTISPESESTTTSSPAS-----ESTTTNNPKSESTTTNNPA 2061
            ++     S     P+ + + +    S+ + TS         ES    + +++S+++    
Sbjct: 241  SVLNFKPSPA--QPLHKQSKSAPQPSQESMTSFQHIIQWKPESQQKKHRRTKSSSSFG-- 296

Query: 2062 SESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPA 2121
               I  +S +  S        +   + S   ++   S P  EST +   + ++ + SS  
Sbjct: 297  --VIDLNSISEASQVN--EDDDPPDSDSKERKNEENSDP--ESTPSDDNDDKTCSESSSR 350

Query: 2122 SESTTIEEQGVSPHS 2136
            SES      G  PH 
Sbjct: 351  SESPNRTNTGRYPHD 365



 Score = 31.4 bits (71), Expect = 7.3
 Identities = 33/139 (23%), Positives = 51/139 (36%), Gaps = 25/139 (17%)

Query: 1998 SSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSS--PASE----STTTNNPK 2051
            S   +EST                S ST  ++PE   T  S   P+S     S++   P+
Sbjct: 111  SGGLAEST-------NPRQALPTKSNSTGLLAPEQNGTNASPVPPSSYKFPPSSSGLTPR 163

Query: 2052 SESTTTN-NPASES-ITSSSPASESTTTSSPASESTTTSSPASESTTTS----------S 2099
                 T+  P S + I +++ A+ +T T+    ES   SSP       S          S
Sbjct: 164  HTVLPTHRRPNSPARIGAAAVANLATPTTPYKEESLGASSPLRRKKFGSQLHQRNMSLPS 223

Query: 2100 PASESTTTSSPESESTTTS 2118
                S  T+S   +  T+ 
Sbjct: 224  NTPTSGNTNSNIPKPATSV 242


>gnl|CDD|221185 pfam11719, Drc1-Sld2, DNA replication and checkpoint protein.  Genome
            duplication is precisely regulated by cyclin-dependent
            kinases CDKs, which bring about the onset of S phase by
            activating replication origins and then prevent
            relicensing of origins until mitosis is completed. The
            optimum sequence motif for CDK phosphorylation is
            S/T-P-K/R-K/R, and Drc1-Sld2 is found to have at least 11
            potential phosphorylation sites. Drc1 is required for DNA
            synthesis and S-M replication checkpoint control. Drc1
            associates with Cdc2 and is phosphorylated at the onset
            of S phase when Cdc2 is activated. Thus Cdc2 promotes DNA
            replication by phosphorylating Drc1 and regulating its
            association with Cut5. Sld2 and Sld3 represent the
            minimal set of S-CDK substrates required for DNA
            replication.
          Length = 397

 Score = 37.9 bits (88), Expect = 0.063
 Identities = 45/183 (24%), Positives = 73/183 (39%), Gaps = 14/183 (7%)

Query: 1929 SLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTS 1988
             L+S  T   SP+    ++   ES+ST    + S+     SP S   + SS ++E   T 
Sbjct: 46   KLLSAKTIEPSPKKRKHSSPDGESQSTPRKRIPSDVDPYDSP-SALRSPSSLKTELGPTP 104

Query: 1989 S----------LVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTS 2038
                       L+S ST   S  S+     S V+ +T  S+P S+   T+  E E     
Sbjct: 105  QRDGKVLSLFDLLSSSTPPESTPSKRKLA-SSVASATPFSTP-SKRRETLDAEDEDRPEY 162

Query: 2039 SPASESTTTNNPKSESTTTNN-PASESITSSSPASESTTTSSPASESTTTSSPASESTTT 2097
             P SE T  ++ K         P S   +S +P+    +    ++ S   +S   +   +
Sbjct: 163  GPRSERTPLSSGKKVMLDLFFTPTSWRYSSETPSFLRRSNQDVSATSNPLNSAEPDFGVS 222

Query: 2098 SSP 2100
             SP
Sbjct: 223  PSP 225



 Score = 33.6 bits (77), Expect = 1.3
 Identities = 40/184 (21%), Positives = 65/184 (35%), Gaps = 16/184 (8%)

Query: 1899 SPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTS 1958
               S  T   +P+    ++   ES+ST    + S+     SP S   + SS ++E   T 
Sbjct: 46   KLLSAKTIEPSPKKRKHSSPDGESQSTPRKRIPSDVDPYDSP-SALRSPSSLKTELGPTP 104

Query: 1959 S-----------LVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTT 2007
                        L S +   S+P      +S   +   +T S   E   T   E E    
Sbjct: 105  QRDGKVLSLFDLLSSSTPPESTPSKRKLASSVASATPFSTPSKRRE---TLDAEDEDRPE 161

Query: 2008 ISPVSESTTTSSPVSES-TTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESIT 2066
              P SE T  SS          +P S   ++ +P+    +  +  + S   N+   +   
Sbjct: 162  YGPRSERTPLSSGKKVMLDLFFTPTSWRYSSETPSFLRRSNQDVSATSNPLNSAEPDFGV 221

Query: 2067 SSSP 2070
            S SP
Sbjct: 222  SPSP 225


>gnl|CDD|148679 pfam07218, RAP1, Rhoptry-associated protein 1 (RAP-1).  This family
            consists of several rhoptry-associated protein 1 (RAP-1)
            sequences which appear to be specific to Plasmodium
            falciparum.
          Length = 790

 Score = 38.1 bits (88), Expect = 0.068
 Identities = 23/151 (15%), Positives = 51/151 (33%), Gaps = 6/151 (3%)

Query: 1865 DNYSEIIFTTNNNS-ESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESE 1923
            D +S+  F  N  S +   +  T  S   + +     +S   + +         S  + E
Sbjct: 62   DEFSDESFLENKASKDDGNINLTDTSENGDASKKGHGKSRVRSASAAAILEEDDSKDDME 121

Query: 1924 STTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESE 1983
                 +   +       + E   +SS      +  S     + ++S   ES ++    ++
Sbjct: 122  FKANPNEAGKPGKPKGNQGEGLASSSDGKSKASAKS----GSKSASKHGESNSSDESATD 177

Query: 1984 STTTSSLVSESTTTSSPESES-TTTISPVSE 2013
            S   S+ V+           +   T++P+ E
Sbjct: 178  SGKASASVAGIVGADEEAPPAPKNTLTPLEE 208



 Score = 38.1 bits (88), Expect = 0.075
 Identities = 33/169 (19%), Positives = 56/169 (33%), Gaps = 17/169 (10%)

Query: 1894 NTTTNSPESESTTTNNPESES----TTTSSPESESTTTSSLVSESTTTSSPESESTTTSS 1949
            N+  +    ES   N    +      T +S   +++      S   + S+        S 
Sbjct: 58   NSWEDEFSDESFLENKASKDDGNINLTDTSENGDASKKGHGKSRVRSASAAAILEEDDSK 117

Query: 1950 PESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTIS 2009
             + E     +   +       + E   +SS      +  S     + ++S   ES ++  
Sbjct: 118  DDMEFKANPNEAGKPGKPKGNQGEGLASSSDGKSKASAKS----GSKSASKHGESNSS-- 171

Query: 2010 PVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTN 2058
               ES T S   S S   I    E    + PA ++T T  P  E   TN
Sbjct: 172  --DESATDSGKASASVAGIVGADEE---APPAPKNTLT--PLEELYETN 213



 Score = 37.4 bits (86), Expect = 0.14
 Identities = 28/170 (16%), Positives = 62/170 (36%), Gaps = 5/170 (2%)

Query: 1988 SSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTT 2047
            +S   E +  S  E++++     ++ + T+ +  +          +S   S+ A+     
Sbjct: 58   NSWEDEFSDESFLENKASKDDGNINLTDTSENGDASKKGH----GKSRVRSASAAAILEE 113

Query: 2048 NNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTT 2107
            ++ K +     NP +E+     P        + +S+  + +S  S S + S     +++ 
Sbjct: 114  DDSKDDMEFKANP-NEAGKPGKPKGNQGEGLASSSDGKSKASAKSGSKSASKHGESNSSD 172

Query: 2108 SSPESESTTTSSPASESTTIEEQGVSPHSEKLSANEDPEEFPNEDVFEHT 2157
             S       ++S A      EE   +P +      E  E   N    +H 
Sbjct: 173  ESATDSGKASASVAGIVGADEEAPPAPKNTLTPLEELYETNVNLFALKHP 222


>gnl|CDD|236138 PRK07994, PRK07994, DNA polymerase III subunits gamma and tau;
            Validated.
          Length = 647

 Score = 37.9 bits (89), Expect = 0.074
 Identities = 18/116 (15%), Positives = 36/116 (31%), Gaps = 2/116 (1%)

Query: 2010 PVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSS 2069
            P +       P   +    S ++ +  T++ A        P   S     PA     ++S
Sbjct: 361  PAAPLPEPEVPPQSAAPAASAQATAAPTAAVAPPQAPAVPPPPASAPQQAPAVPLPETTS 420

Query: 2070 PASESTT--TSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASE 2123
                +      +  +     S PA+ S      ++     S   + S    +PA +
Sbjct: 421  QLLAARQQLQRAQGATKAKKSEPAAASRARPVNSALERLASVRPAPSALEKAPAKK 476


>gnl|CDD|132697 TIGR03658, IsdH_HarA, haptoglobin-binding heme uptake protein HarA.
            HarA is a heme-binding NEAT-domain (NEAr Transporter,
            pfam05031) protein which has been shown to bind to the
            haptoglobin-hemoglobin complex in order to extract heme
            from it. HarA has also been reported to bind hemoglobin
            directly. HarA (also known as IsdH) contains three NEAT
            domains as well as a sortase A C-terminal signal for
            localization to the cell wall. The heme bound at the
            third of these NEAT domains has been shown to be
            transferred to the IsdA protein also localized at the
            cell wall, presumably through an additional specific
            protein-protein interaction. Haptoglobin is a hemoglobin
            carrier protein involved in scavenging hemoglobin in the
            blood following red blood cell lysis and targetting it to
            the liver.
          Length = 895

 Score = 37.9 bits (87), Expect = 0.080
 Identities = 25/119 (21%), Positives = 56/119 (47%), Gaps = 2/119 (1%)

Query: 1866 NYSEIIFTTNNNSESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESEST 1925
            +Y++++F     ++ ++V S  N  +  N  ++S  S  T TN     ++T ++  ++  
Sbjct: 213  DYTKLVFAKPIYNDPSLVKSDTNDAVVTNDQSSSDASNQTNTNTSNQNTSTINNANNQPQ 272

Query: 1926 TTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESES 1984
             T+++   +   SS  ++    SS  +  T ++   ++ T  SS +S+      P  ES
Sbjct: 273  ATTNMSQPAQPKSSANADQ--ASSQPAHETNSNGNTNDKTNESSNQSDVNQQYPPADES 329



 Score = 37.5 bits (86), Expect = 0.11
 Identities = 32/147 (21%), Positives = 60/147 (40%), Gaps = 7/147 (4%)

Query: 1980 PESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTT--ISPESESTTT 2037
            P S+ T    +VS +      E+    T    ++       + +S T   +    +S++ 
Sbjct: 188  PVSDGTQELKIVSSTQIDDGEETNYDYTKLVFAKPIYNDPSLVKSDTNDAVVTNDQSSSD 247

Query: 2038 SSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTT 2097
            +S  + + T+N   S     NN    +   S PA   ++ ++       +S PA E T +
Sbjct: 248  ASNQTNTNTSNQNTSTINNANNQPQATTNMSQPAQPKSSANA----DQASSQPAHE-TNS 302

Query: 2098 SSPASESTTTSSPESESTTTSSPASES 2124
            +   ++ T  SS +S+      PA ES
Sbjct: 303  NGNTNDKTNESSNQSDVNQQYPPADES 329



 Score = 32.1 bits (72), Expect = 4.4
 Identities = 28/131 (21%), Positives = 56/131 (42%), Gaps = 2/131 (1%)

Query: 1944 STTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESE 2003
            ST     E  +   + LV      + P    + T+     +  +SS  S  T T++  ++
Sbjct: 201  STQIDDGEETNYDYTKLVFAKPIYNDPSLVKSDTNDAVVTNDQSSSDASNQTNTNT-SNQ 259

Query: 2004 STTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASE 2063
            +T+TI+  +     ++ +S+     S  +    +S PA E+ +  N   ++  ++N  S+
Sbjct: 260  NTSTINNANNQPQATTNMSQPAQPKSSANADQASSQPAHETNSNGNTNDKTNESSN-QSD 318

Query: 2064 SITSSSPASES 2074
                  PA ES
Sbjct: 319  VNQQYPPADES 329


>gnl|CDD|152561 pfam12126, DUF3583, Protein of unknown function (DUF3583).  This
            domain is found in eukaryotes, and is typically between
            302 and 338 amino acids in length. It is found in
            association with pfam00097 and pfam00643. Most members
            are promyelocytic leukemia proteins, and this family lies
            towards the C-terminus.
          Length = 284

 Score = 37.3 bits (86), Expect = 0.082
 Identities = 35/147 (23%), Positives = 52/147 (35%), Gaps = 12/147 (8%)

Query: 1904 STTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSE 1963
            S  T   ++  +  +SPE+ ST           T+  E+ +TTTS     S T       
Sbjct: 149  SCITQGIDAAVSKKASPEAAST------PRDPVTTDTEASNTTTSQKRKCSQTDCPRKII 202

Query: 1964 STTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSE 2023
               +     +   TSSPE    +TS  VS       P  ES      +        P S 
Sbjct: 203  KMESEEGNEDRLATSSPEQPRPSTSKAVSPPHLDGPPSPESPVPEKEI------LLPNSN 256

Query: 2024 STTTISPESESTTTSSPASESTTTNNP 2050
              T+ + E+E       +SE +   N 
Sbjct: 257  HVTSDTGETEERVVVISSSEDSDAENL 283


>gnl|CDD|223020 PHA03246, PHA03246, large tegument protein UL36; Provisional.
          Length = 3095

 Score = 38.0 bits (88), Expect = 0.085
 Identities = 41/244 (16%), Positives = 84/244 (34%), Gaps = 26/244 (10%)

Query: 1925 TTTSSLVSESTTTSS----PESESTTTSSPESESTTTSSLV---SESTTTSSPESESTTT 1977
              T  ++  + + +       SEST  +  E      S+LV   S +   S P + +   
Sbjct: 329  WQTKIVIGTADSYADSSPKLHSESTDLTPHEHGEYDPSTLVGGASTNINISDPPARTDCR 388

Query: 1978 SSPESEST--TTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISP-ESES 2034
               E      +  S + + T  +S  +  +   S +SE  +  +      T     ++  
Sbjct: 389  RYSEGSVIHESVDSHIEDVTEATSVVAAWSDAFSDISEDYSHLTRPDLPATAHDVSKNGH 448

Query: 2035 TTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASES 2094
             T S   S  + + + +   + T   +SE+++S  P    +   S  S+          S
Sbjct: 449  DTKSDRRSRGSNSRHKRRRPSWTPPSSSENVSSDGPTFSQSRKPSRKSKRALDLDYGHLS 508

Query: 2095 TTTSSPASE-----STTTSSPESESTTTSSPASE---STTIEEQGV------SPHSEKLS 2140
               S    E     +   S+     +     +S+     +IE   +      +PH+  ++
Sbjct: 509  NEPSDVDGENSDSPAGAISNIPDNVSFNEFISSQARAEDSIEHLSLRNRPVFNPHT--VT 566

Query: 2141 ANED 2144
             N D
Sbjct: 567  GNLD 570



 Score = 35.0 bits (80), Expect = 0.77
 Identities = 41/220 (18%), Positives = 80/220 (36%), Gaps = 20/220 (9%)

Query: 1899 SPESESTTTNNPESESTTTSSPESESTTTSSLVS-ESTTTSSPESESTTTSSPESESTTT 1957
            + +S + ++    SEST  +  E      S+LV   ST  +  +  + T     SE +  
Sbjct: 337  TADSYADSSPKLHSESTDLTPHEHGEYDPSTLVGGASTNINISDPPARTDCRRYSEGSVI 396

Query: 1958 SSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTT 2017
                   +  S  E  +  TS   + S   S +  + +  + P+  +T     VS++   
Sbjct: 397  -----HESVDSHIEDVTEATSVVAAWSDAFSDISEDYSHLTRPDLPATA--HDVSKNGHD 449

Query: 2018 SSPVSESTTTIS----------PESESTTTSSPASESTTTNNP--KSESTTTNNPASESI 2065
            +     S  + S          P S S   SS     + +  P  KS+     +    S 
Sbjct: 450  TKSDRRSRGSNSRHKRRRPSWTPPSSSENVSSDGPTFSQSRKPSRKSKRALDLDYGHLSN 509

Query: 2066 TSSSPASESTTTSSPASESTTTSSPASESTTTSSPASEST 2105
              S    E++ + + A  +   +   +E  ++ + A +S 
Sbjct: 510  EPSDVDGENSDSPAGAISNIPDNVSFNEFISSQARAEDSI 549



 Score = 31.5 bits (71), Expect = 8.2
 Identities = 23/128 (17%), Positives = 47/128 (36%), Gaps = 7/128 (5%)

Query: 1892 SENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSE-----STTTSSPESESTT 1946
            SEN +++ P    +   + +S+          S   S +  E     +   S+     + 
Sbjct: 476  SENVSSDGPTFSQSRKPSRKSKRALDLDYGHLSNEPSDVDGENSDSPAGAISNIPDNVSF 535

Query: 1947 TSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTT 2006
                 S++    S+   S       +  T T     ++T   SL ++  + S P S+ + 
Sbjct: 536  NEFISSQARAEDSIEHLSLRNRPVFNPHTVTG--NLDNTLRDSLWNDEYSGSYPLSDISD 593

Query: 2007 TISPVSES 2014
             I  ++ES
Sbjct: 594  MIDDITES 601


>gnl|CDD|183854 PRK13042, PRK13042, superantigen-like protein; Reviewed.
          Length = 291

 Score = 37.3 bits (86), Expect = 0.090
 Identities = 26/99 (26%), Positives = 43/99 (43%), Gaps = 7/99 (7%)

Query: 1954 STTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSE 2013
            S     L +   TT++  + +TT SS + E+  ++     ST   +P+S+   T  P   
Sbjct: 10   SLALGLLTTGVITTTTQAANATTPSSTKVEAPQSTP---PSTKVEAPQSKPNATTPP--- 63

Query: 2014 STTTSSPVSESTTTISPESESTTTSSPASESTTTN-NPK 2051
            ST   +P      T    ++  T  SP ++   T  NPK
Sbjct: 64   STKVEAPQQTPNATTPSSTKVETPQSPTTKQVPTEINPK 102



 Score = 36.1 bits (83), Expect = 0.18
 Identities = 23/87 (26%), Positives = 42/87 (48%), Gaps = 7/87 (8%)

Query: 1946 TTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESEST 2005
            TT++  + +TT SS   E+  ++ P   ST   +P+S+   T+     ST   +P+    
Sbjct: 22   TTTTQAANATTPSSTKVEAPQSTPP---STKVEAPQSKPNATTP---PSTKVEAPQQTPN 75

Query: 2006 TTISPVSESTTTSSPVSEST-TTISPE 2031
             T    ++  T  SP ++   T I+P+
Sbjct: 76   ATTPSSTKVETPQSPTTKQVPTEINPK 102



 Score = 34.2 bits (78), Expect = 0.71
 Identities = 21/72 (29%), Positives = 36/72 (50%), Gaps = 5/72 (6%)

Query: 2076 TTSSPASESTTTSSPASESTTTSSPASESTTTSSPESE--STTTSSPASESTTIEEQGVS 2133
            TT++ A+ +TT SS   E+  ++ P   ST   +P+S+  +TT  S   E+        +
Sbjct: 22   TTTTQAANATTPSSTKVEAPQSTPP---STKVEAPQSKPNATTPPSTKVEAPQQTPNATT 78

Query: 2134 PHSEKLSANEDP 2145
            P S K+   + P
Sbjct: 79   PSSTKVETPQSP 90



 Score = 33.5 bits (76), Expect = 1.2
 Identities = 31/128 (24%), Positives = 51/128 (39%), Gaps = 13/128 (10%)

Query: 2026 TTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASEST 2085
            TTI+  S +    +    +TTT     ++     P+S  + +      ST   +P S+  
Sbjct: 4    TTIAKTSLALGLLTTGVITTTT-----QAANATTPSSTKVEAPQSTPPSTKVEAPQSKPN 58

Query: 2086 TTSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTTIEEQGVSPHSEKLSA--NE 2143
             T+ P   ST   +P      T+     ST   +P S +T      ++P  + L A   +
Sbjct: 59   ATTPP---STKVEAPQQTPNATTPS---STKVETPQSPTTKQVPTEINPKFKDLRAYYTK 112

Query: 2144 DPEEFPNE 2151
               EF NE
Sbjct: 113  PSLEFKNE 120



 Score = 33.5 bits (76), Expect = 1.2
 Identities = 23/80 (28%), Positives = 38/80 (47%), Gaps = 9/80 (11%)

Query: 1976 TTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESEST 2035
            TT++  + +TT SS   E+  ++ P   ST   +P S+   T+ P   ST   +P+    
Sbjct: 22   TTTTQAANATTPSSTKVEAPQSTPP---STKVEAPQSKPNATTPP---STKVEAPQQTPN 75

Query: 2036 TTSSPASESTTTNNPKSEST 2055
             T+     ST    P+S +T
Sbjct: 76   ATTPS---STKVETPQSPTT 92



 Score = 32.7 bits (74), Expect = 2.1
 Identities = 20/82 (24%), Positives = 38/82 (46%), Gaps = 6/82 (7%)

Query: 1916 TTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESEST 1975
            TT++  + +TT SS   E+  ++ P   ST   +P+S+   T+     ST   +P+    
Sbjct: 22   TTTTQAANATTPSSTKVEAPQSTPP---STKVEAPQSKPNATTP---PSTKVEAPQQTPN 75

Query: 1976 TTSSPESESTTTSSLVSESTTT 1997
             T+   ++  T  S  ++   T
Sbjct: 76   ATTPSSTKVETPQSPTTKQVPT 97



 Score = 30.8 bits (69), Expect = 9.5
 Identities = 18/89 (20%), Positives = 35/89 (39%), Gaps = 5/89 (5%)

Query: 1989 SLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTN 2048
             L++    T++ ++ + TT S        S+P S        +  +TT  S   E+    
Sbjct: 14   GLLTTGVITTTTQAANATTPSSTKVEAPQSTPPSTKVEAPQSKPNATTPPSTKVEA---- 69

Query: 2049 NPKSESTTTNNPASESITSSSPASESTTT 2077
             P+     T   +++  T  SP ++   T
Sbjct: 70   -PQQTPNATTPSSTKVETPQSPTTKQVPT 97


>gnl|CDD|218825 pfam05956, APC_basic, APC basic domain.  This region of the APC
            family of proteins is known as the basic domain. It
            contains a high proportion of positively charged amino
            acids and interacts with microtubules.
          Length = 359

 Score = 37.4 bits (86), Expect = 0.091
 Identities = 38/162 (23%), Positives = 57/162 (35%), Gaps = 8/162 (4%)

Query: 1970 PESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTIS 2029
              S+STT S       T  S   +S +     S S    +  SE    S P   S T  +
Sbjct: 16   NRSQSTTPSKKGPPLKTQPSDPPKSPSPGQQRSRSLHRPAKPSELAELS-PPPRSATPPA 74

Query: 2030 PESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASEST--TTSSPASESTTT 2087
              +++ ++SS  + + +   P+     T +    SI      S S    TSSPA      
Sbjct: 75   RLAKTPSSSSSQTSTPSQPLPRPLPRPTQSAGRNSILPGPGNSLSQVPRTSSPA--RALL 132

Query: 2088 SSPASESTTTSSPA---SESTTTSSPESESTTTSSPASESTT 2126
            +S  S+  T  SP            P      +S P  E  +
Sbjct: 133  ASSGSQHKTQKSPVRIPFMQNPAKPPPLSKNASSRPRPEPGS 174



 Score = 37.4 bits (86), Expect = 0.094
 Identities = 50/254 (19%), Positives = 78/254 (30%), Gaps = 32/254 (12%)

Query: 1904 STTTNNP-ESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVS 1962
            S    +P   +  + S       +  + +S    +++P +    T S  S  T+T     
Sbjct: 35   SDPPKSPSPGQQRSRSLHRPAKPSELAELSPPPRSATPPARLAKTPSSSSSQTSTP---- 90

Query: 1963 ESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPV- 2021
             S     P    T ++   S      + +S+   TSSP        S  S+  T  SPV 
Sbjct: 91   -SQPLPRPLPRPTQSAGRNSILPGPGNSLSQVPRTSSP--ARALLASSGSQHKTQKSPVR 147

Query: 2022 --SESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSS 2079
                      P      +S P  E       +  +     P +    S        +  S
Sbjct: 148  IPFMQNPAKPPPLSKNASSRPRPEPG----SRGRAGMNGGPGARG--SRLELVRMASAKS 201

Query: 2080 PASESTTT----------SSPASESTTTSSPASESTTTSSPESESTTTSSPASE-----S 2124
              SES  +           SP +     S  +S  +  SS +  S   S PA       S
Sbjct: 202  SGSESDRSGFRRQLTFIKESPGTLRRRRSELSSAESLASSSQPASPRRSRPALPAVFLCS 261

Query: 2125 TTIEEQGVSPHSEK 2138
            +   E   S HS  
Sbjct: 262  SRCPELRASTHSSV 275



 Score = 33.9 bits (77), Expect = 1.0
 Identities = 32/200 (16%), Positives = 52/200 (26%), Gaps = 10/200 (5%)

Query: 1935 TTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSES 1994
            T    P   + + S+  S+              S    +  + S       +  + +S  
Sbjct: 7    TVIYIPGPANRSQSTTPSKKGPPLKTQPSDPPKSPSPGQQRSRSLHRPAKPSELAELSPP 66

Query: 1995 TTTSSPESESTTTISPVSESTTTSS---PVSESTTTISPESESTTTSSPASESTTTNNPK 2051
              +++P +    T S  S  T+T S   P      T S    S       S S       
Sbjct: 67   PRSATPPARLAKTPSSSSSQTSTPSQPLPRPLPRPTQSAGRNSILPGPGNSLSQVPRTSS 126

Query: 2052 SESTTTNNPASESITSSSPA---SESTTTSSPASESTTTSSPASESTTTSSPASESTTTS 2108
                   +  S+  T  SP            P      +S P  E  +           +
Sbjct: 127  PARALLASSGSQHKTQKSPVRIPFMQNPAKPPPLSKNASSRPRPEPGSRGRAGMNGGPGA 186

Query: 2109 SPE----SESTTTSSPASES 2124
                       +  S  SES
Sbjct: 187  RGSRLELVRMASAKSSGSES 206



 Score = 32.0 bits (72), Expect = 4.1
 Identities = 46/240 (19%), Positives = 66/240 (27%), Gaps = 28/240 (11%)

Query: 1917 TSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSL-----VSESTTTSSPE 1971
               P + S +T+         + P   S    SP      + SL      SE    S P 
Sbjct: 11   IPGPANRSQSTTPSKKGPPLKTQP---SDPPKSPSPGQQRSRSLHRPAKPSELAELSPPP 67

Query: 1972 SESTTTSSPESESTTTSSLVSESTTTSS---PESESTTTISPVSESTTTSSPVSESTTTI 2028
              +T    P +    T S  S  T+T S   P      T S    S       S S    
Sbjct: 68   RSAT----PPARLAKTPSSSSSQTSTPSQPLPRPLPRPTQSAGRNSILPGPGNSLSQVPR 123

Query: 2029 SPESESTTTSSPASESTTTNNP---KSESTTTNNPASESITSSSPASESTTTSSPASEST 2085
            +        +S  S+  T  +P            P      SS P  E  +         
Sbjct: 124  TSSPARALLASSGSQHKTQKSPVRIPFMQNPAKPPPLSKNASSRPRPEPGSRGRAGMNGG 183

Query: 2086 TTSSPA----SESTTTSSPASESTTTSSPESESTTTSSPA------SESTTIEEQGVSPH 2135
              +  +        +  S  SES  +      +    SP       SE ++ E    S  
Sbjct: 184  PGARGSRLELVRMASAKSSGSESDRSGFRRQLTFIKESPGTLRRRRSELSSAESLASSSQ 243



 Score = 30.9 bits (69), Expect = 9.6
 Identities = 56/336 (16%), Positives = 94/336 (27%), Gaps = 56/336 (16%)

Query: 1895 TTTNSPESESTTTNNPESESTTTSSPESESTTTS------------SLVSESTT------ 1936
            + + +P  +        S+   + SP  + + +             S    S T      
Sbjct: 18   SQSTTPSKKGPPLKTQPSDPPKSPSPGQQRSRSLHRPAKPSELAELSPPPRSATPPARLA 77

Query: 1937 -TSSPESESTTTSS---PESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVS 1992
             T S  S  T+T S   P      T S    S       S S    +        +S  S
Sbjct: 78   KTPSSSSSQTSTPSQPLPRPLPRPTQSAGRNSILPGPGNSLSQVPRTSSPARALLASSGS 137

Query: 1993 ESTTTSSP---ESESTTTISPVSESTTTSSPVSESTTTISPESES--------------T 2035
            +  T  SP            P      +S P  E  +                       
Sbjct: 138  QHKTQKSPVRIPFMQNPAKPPPLSKNASSRPRPEPGSRGRAGMNGGPGARGSRLELVRMA 197

Query: 2036 TTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASEST 2095
            +  S  SES  +   +  +    +P +     S  +S  +  SS    S   S PA  + 
Sbjct: 198  SAKSSGSESDRSGFRRQLTFIKESPGTLRRRRSELSSAESLASSSQPASPRRSRPALPAV 257

Query: 2096 TTSSPASE--STTTSSPESESTTTSSPASESTTIEEQ--------GVSPHSEKLSANEDP 2145
               S        +T S          P  +   IE           ++    + +++E P
Sbjct: 258  FLCSSRCPELRASTHSSVQAGGWRKLPPRQGPAIEYNQRRPAARPDIAERYGRRTSSESP 317

Query: 2146 EEFP------NEDVFEHTFAEIPNIDHSNQTDEAIP 2175
               P        +  +  +A +P+I    +TD A  
Sbjct: 318  SRLPVRAGPGKPETVKR-YASLPHISVWRRTDSASS 352


>gnl|CDD|223044 PHA03325, PHA03325, nuclear-egress-membrane-like protein;
            Provisional.
          Length = 418

 Score = 37.6 bits (87), Expect = 0.093
 Identities = 28/169 (16%), Positives = 51/169 (30%), Gaps = 11/169 (6%)

Query: 1962 SESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPV 2021
            S +   +S    S             ++    +       SE +    P     +   P 
Sbjct: 259  SSAFMLNSSLPTSAPKRRSRRAGAMRAAAGETADLADDDGSEHS---DPEPLPASLPPPP 315

Query: 2022 SESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPA 2081
                    PE+          E     N +++       ++ S  SSS  ++ + ++ P 
Sbjct: 316  VRRPRVKHPEA-------GKEEPDGARNAEAKEPAQPATSTSSKGSSSAQNKDSGSTGPG 368

Query: 2082 SESTTTSSPASESTTTSSPASESTTTSSPESESTTTS-SPASESTTIEE 2129
            S     SS   +    S P   +T+     S S T++  P S   T   
Sbjct: 369  SSLAAASSFLEDDDFGSPPLDLTTSLRHMPSPSVTSAPEPPSIPLTYLS 417



 Score = 36.8 bits (85), Expect = 0.14
 Identities = 33/171 (19%), Positives = 65/171 (38%), Gaps = 11/171 (6%)

Query: 1953 ESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVS--ESTTTSSPESESTTTISP 2010
            + T+++ +++ S  TS+P+  S    +  + +  T+ L     S  +      ++    P
Sbjct: 256  QLTSSAFMLNSSLPTSAPKRRSRRAGAMRAAAGETADLADDDGSEHSDPEPLPASLPPPP 315

Query: 2011 VSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSP 2070
            V           +     +  +E+   + PA   T+T++  S S       ++   S+ P
Sbjct: 316  VRRPRVKHPEAGKEEPDGARNAEAKEPAQPA---TSTSSKGSSSA-----QNKDSGSTGP 367

Query: 2071 ASESTTTSSPASESTTTSSPASESTTTSSPASESTTTS-SPESESTTTSSP 2120
             S     SS   +    S P   +T+     S S T++  P S   T  S 
Sbjct: 368  GSSLAAASSFLEDDDFGSPPLDLTTSLRHMPSPSVTSAPEPPSIPLTYLSD 418



 Score = 34.1 bits (78), Expect = 1.1
 Identities = 18/92 (19%), Positives = 35/92 (38%), Gaps = 1/92 (1%)

Query: 1899 SPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSS-PESESTTT 1957
            + + E     N E++     +  + S  +SS  ++ + ++ P S     SS  E +   +
Sbjct: 326  AGKEEPDGARNAEAKEPAQPATSTSSKGSSSAQNKDSGSTGPGSSLAAASSFLEDDDFGS 385

Query: 1958 SSLVSESTTTSSPESESTTTSSPESESTTTSS 1989
              L   ++    P    T+   P S   T  S
Sbjct: 386  PPLDLTTSLRHMPSPSVTSAPEPPSIPLTYLS 417


>gnl|CDD|217602 pfam03535, Paxillin, Paxillin family. 
          Length = 193

 Score = 36.4 bits (84), Expect = 0.094
 Identities = 23/71 (32%), Positives = 38/71 (53%), Gaps = 3/71 (4%)

Query: 2060 PASESITSSS--PASESTTTSSPASESTTTSSPASESTTTSSPASESTT-TSSPESESTT 2116
            P++E++  SS   AS S  +S P  ES    S +++ ++ S PA E     S P  + + 
Sbjct: 10   PSAEALNGSSWVEASSSYHSSQPQQESPKYRSSSAKPSSPSPPAGEEEHVYSFPNKQKSA 69

Query: 2117 TSSPASESTTI 2127
             SSPA  S+++
Sbjct: 70   ESSPAVMSSSL 80



 Score = 35.7 bits (82), Expect = 0.17
 Identities = 22/69 (31%), Positives = 33/69 (47%), Gaps = 1/69 (1%)

Query: 2056 TTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTT-TSSPASESTTTSSPESES 2114
            ++   AS S  SS P  ES    S +++ ++ S PA E     S P  + +  SSP   S
Sbjct: 18   SSWVEASSSYHSSQPQQESPKYRSSSAKPSSPSPPAGEEEHVYSFPNKQKSAESSPAVMS 77

Query: 2115 TTTSSPASE 2123
            ++  S  SE
Sbjct: 78   SSLGSNLSE 86



 Score = 34.5 bits (79), Expect = 0.35
 Identities = 22/79 (27%), Positives = 38/79 (48%), Gaps = 3/79 (3%)

Query: 2038 SSPASEST--TTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTT-TSSPASES 2094
              P++E+   ++    S S  ++ P  ES    S +++ ++ S PA E     S P  + 
Sbjct: 8    PPPSAEALNGSSWVEASSSYHSSQPQQESPKYRSSSAKPSSPSPPAGEEEHVYSFPNKQK 67

Query: 2095 TTTSSPASESTTTSSPESE 2113
            +  SSPA  S++  S  SE
Sbjct: 68   SAESSPAVMSSSLGSNLSE 86



 Score = 33.4 bits (76), Expect = 0.92
 Identities = 36/181 (19%), Positives = 67/181 (37%), Gaps = 13/181 (7%)

Query: 1894 NTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTT-TSSPESESTTTSSPES 1952
            N ++    S S  ++ P+ ES    S  ++ ++ S    E     S P  + +  SSP  
Sbjct: 16   NGSSWVEASSSYHSSQPQQESPKYRSSSAKPSSPSPPAGEEEHVYSFPNKQKSAESSPAV 75

Query: 1953 ESTTTSSLVSE---------STTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESE 2003
             S++  S +SE         +   S P   +   +SP   S++    + E+  +   ++ 
Sbjct: 76   MSSSLGSNLSELDRLLLELNAVQHSPPSFPADEEASPPLPSSSIPHYIPENGGSPGGKAA 135

Query: 2004 STTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASE 2063
              T   P          V  S  ++  E ES   S P+     T +    S+     +S+
Sbjct: 136  PPTKEKPKRNGGRGIEDVRPSVESLLDELES---SVPSPVPAITVSQGETSSPQQVNSSQ 192

Query: 2064 S 2064
             
Sbjct: 193  Q 193


>gnl|CDD|206007 pfam13836, DUF4195, Domain of unknown function (DUF4195).  This
            family is found at the N-terminus of metazoan proteins
            that carry PHD-like zinc-finger domains. The function is
            not known.
          Length = 184

 Score = 36.2 bits (83), Expect = 0.099
 Identities = 40/148 (27%), Positives = 65/148 (43%), Gaps = 10/148 (6%)

Query: 1936 TTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESE-STTTSSPESESTTTSSLVSES 1994
             TS      ++       +    S  S+S+    P S+   T +SP+ +S  +S L+ +S
Sbjct: 44   PTSQHYRNPSSNPVAALPNFHPESKSSDSSVIVQPFSKPDFTKNSPQVDSNNSSELLFDS 103

Query: 1995 TTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSP-ASESTTTNNPKSE 2053
            T  + P S+   T+S      T+      ST+ ++    S     P  SES +  NP S 
Sbjct: 104  TQDTLPHSQGGPTLSRAGMDETSFLLKHPSTSKVN----SVNPKKPKTSESVSGINPSSS 159

Query: 2054 STTTNNPASESITSSSPA-SESTTTSSP 2080
             ++  +P   S+TSS    S+ T TSS 
Sbjct: 160  LSSQKSP---SVTSSQVVLSKGTNTSSQ 184



 Score = 36.2 bits (83), Expect = 0.10
 Identities = 32/141 (22%), Positives = 56/141 (39%), Gaps = 6/141 (4%)

Query: 1956 TTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSE-STTTSSPESESTTTISPVSES 2014
             TS      ++       +    S  S+S+      S+   T +SP+ +S  +   + +S
Sbjct: 44   PTSQHYRNPSSNPVAALPNFHPESKSSDSSVIVQPFSKPDFTKNSPQVDSNNSSELLFDS 103

Query: 2015 TTTSSPVSESTTTISPESESTTT---SSPASESTTTNNPKSESTTTNNPASESITSSSPA 2071
            T  + P S+   T+S      T+     P++    + NPK   T+ +       +S S  
Sbjct: 104  TQDTLPHSQGGPTLSRAGMDETSFLLKHPSTSKVNSVNPKKPKTSESVSGINPSSSLSSQ 163

Query: 2072 SESTTTSSPA--SESTTTSSP 2090
               + TSS    S+ T TSS 
Sbjct: 164  KSPSVTSSQVVLSKGTNTSSQ 184



 Score = 35.0 bits (80), Expect = 0.24
 Identities = 32/118 (27%), Positives = 54/118 (45%), Gaps = 6/118 (5%)

Query: 1899 SPESESTTTNNPESE-STTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTT 1957
            S  S+S+    P S+   T +SP+ +S  +S L+ +ST  + P S+   T S      T+
Sbjct: 67   SKSSDSSVIVQPFSKPDFTKNSPQVDSNNSSELLFDSTQDTLPHSQGGPTLSRAGMDETS 126

Query: 1958 SSLVSESTT---TSSPESESTTTSSPESESTTTSSLVSESTTTSSPE--SESTTTISP 2010
              L   ST+   + +P+   T+ S      +++ S     + TSS    S+ T T S 
Sbjct: 127  FLLKHPSTSKVNSVNPKKPKTSESVSGINPSSSLSSQKSPSVTSSQVVLSKGTNTSSQ 184



 Score = 33.5 bits (76), Expect = 0.80
 Identities = 39/147 (26%), Positives = 61/147 (41%), Gaps = 8/147 (5%)

Query: 1966 TTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESE-STTTISPVSESTTTSSPVSES 2024
             TS      ++       +    S  S+S+    P S+   T  SP  +S  +S  + +S
Sbjct: 44   PTSQHYRNPSSNPVAALPNFHPESKSSDSSVIVQPFSKPDFTKNSPQVDSNNSSELLFDS 103

Query: 2025 TTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSP-ASESTTTSSPASE 2083
            T    P S+   T S A    T+   K  ST+  N    S+    P  SES +  +P+S 
Sbjct: 104  TQDTLPHSQGGPTLSRAGMDETSFLLKHPSTSKVN----SVNPKKPKTSESVSGINPSSS 159

Query: 2084 STTTSSPASESTTTSSPASESTTTSSP 2110
             ++  SP+  S+      S+ T TSS 
Sbjct: 160  LSSQKSPSVTSSQVV--LSKGTNTSSQ 184



 Score = 31.5 bits (71), Expect = 3.7
 Identities = 28/119 (23%), Positives = 54/119 (45%), Gaps = 18/119 (15%)

Query: 1877 NSESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTT--------S 1928
            +S+S+V++   +     + T NSP+ +S  ++    +ST  + P S+   T        +
Sbjct: 69   SSDSSVIVQPFSKP---DFTKNSPQVDSNNSSELLFDSTQDTLPHSQGGPTLSRAGMDET 125

Query: 1929 SLVSESTTTS-----SPESESTTTSSPESESTTTSSLVSESTTTSSPE--SESTTTSSP 1980
            S + +  +TS     +P+   T+ S      +++ S     + TSS    S+ T TSS 
Sbjct: 126  SFLLKHPSTSKVNSVNPKKPKTSESVSGINPSSSLSSQKSPSVTSSQVVLSKGTNTSSQ 184


>gnl|CDD|227549 COG5224, HAP2, CCAAT-binding factor, subunit B [Transcription].
          Length = 248

 Score = 36.7 bits (84), Expect = 0.099
 Identities = 26/136 (19%), Positives = 52/136 (38%), Gaps = 8/136 (5%)

Query: 1878 SESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTT 1937
            + S+ V  T       N +  S  S S+ +  P + +T + SP + ++  +         
Sbjct: 31   TVSSEVTHTSEGYADSNDSRPSSISNSSESPAPINSATASMSPANNTSGNNIT------- 83

Query: 1938 SSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTT 1997
             SP        S    +T ++S       T  P+++S T++   S S   S     +   
Sbjct: 84   -SPNVRGELDMSSGPTNTASTSGPVPHDMTVLPQTDSNTSNLMSSGSQLGSFATQSTNGN 142

Query: 1998 SSPESESTTTISPVSE 2013
            +S  + +++   P S 
Sbjct: 143  NSTTTTTSSAAHPGSF 158



 Score = 35.9 bits (82), Expect = 0.18
 Identities = 31/155 (20%), Positives = 62/155 (40%), Gaps = 9/155 (5%)

Query: 1897 TNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTT 1956
            TN+ ++    T + E   T+    +S  +  SS +S S+ + +P + +T + SP + ++ 
Sbjct: 21   TNANDATVPATVSSEVTHTSEGYADSNDSRPSS-ISNSSESPAPINSATASMSPANNTSG 79

Query: 1957 TSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTT 2016
             +          SP        S    +T ++S       T  P+++S T+    S S  
Sbjct: 80   NNIT--------SPNVRGELDMSSGPTNTASTSGPVPHDMTVLPQTDSNTSNLMSSGSQL 131

Query: 2017 TSSPVSESTTTISPESESTTTSSPASESTTTNNPK 2051
             S     +    S  + +++ + P S      N K
Sbjct: 132  GSFATQSTNGNNSTTTTTSSAAHPGSFQPDYVNAK 166



 Score = 32.5 bits (73), Expect = 2.1
 Identities = 29/156 (18%), Positives = 68/156 (43%), Gaps = 3/156 (1%)

Query: 1901 ESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTS-SPESESTTTSS 1959
            E+     N   +     ++  +++T  +++ SE T TS   ++S  +  S  S S+ + +
Sbjct: 3    EAAEAAANGGSTGDDVNATNANDATVPATVSSEVTHTSEGYADSNDSRPSSISNSSESPA 62

Query: 1960 LVSESTTTSSPESEST--TTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTT 2017
             ++ +T + SP + ++    +SP        S    +T ++S       T+ P ++S T+
Sbjct: 63   PINSATASMSPANNTSGNNITSPNVRGELDMSSGPTNTASTSGPVPHDMTVLPQTDSNTS 122

Query: 2018 SSPVSESTTTISPESESTTTSSPASESTTTNNPKSE 2053
            +   S S         +   +S  + +++  +P S 
Sbjct: 123  NLMSSGSQLGSFATQSTNGNNSTTTTTSSAAHPGSF 158



 Score = 32.5 bits (73), Expect = 2.7
 Identities = 29/146 (19%), Positives = 56/146 (38%), Gaps = 4/146 (2%)

Query: 1942 SESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPE 2001
            S     ++  +   T  + VS   T +S     +  S P S S ++ S    ++ T+S  
Sbjct: 13   STGDDVNATNANDATVPATVSSEVTHTSEGYADSNDSRPSSISNSSESPAPINSATASMS 72

Query: 2002 SESTTTI----SPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTT 2057
              + T+     SP        S    +T + S       T  P ++S T+N   S S   
Sbjct: 73   PANNTSGNNITSPNVRGELDMSSGPTNTASTSGPVPHDMTVLPQTDSNTSNLMSSGSQLG 132

Query: 2058 NNPASESITSSSPASESTTTSSPASE 2083
            +     +  ++S  + +++ + P S 
Sbjct: 133  SFATQSTNGNNSTTTTTSSAAHPGSF 158



 Score = 30.5 bits (68), Expect = 9.1
 Identities = 25/116 (21%), Positives = 51/116 (43%), Gaps = 8/116 (6%)

Query: 2024 STTTISPESESTTTSSPASESTTTNNPKSESTTTNN--PASESITSSSPASESTTTSS-- 2079
            ST      + +   + PA+ S+   +       +N+  P+S S +S SPA  ++ T+S  
Sbjct: 13   STGDDVNATNANDATVPATVSSEVTHTSEGYADSNDSRPSSISNSSESPAPINSATASMS 72

Query: 2080 PASES----TTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTTIEEQG 2131
            PA+ +     T+ +   E   +S P + ++T+     + T      S ++ +   G
Sbjct: 73   PANNTSGNNITSPNVRGELDMSSGPTNTASTSGPVPHDMTVLPQTDSNTSNLMSSG 128



 Score = 30.5 bits (68), Expect = 9.2
 Identities = 32/142 (22%), Positives = 59/142 (41%), Gaps = 11/142 (7%)

Query: 1997 TSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTT------NNP 2050
            T++ ++    T+S     T+     S  +    P S S ++ SPA  ++ T      NN 
Sbjct: 21   TNANDATVPATVSSEVTHTSEGYADSNDS---RPSSISNSSESPAPINSATASMSPANNT 77

Query: 2051 KSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASE--STTTS 2108
               + T+ N   E   SS P + ++T+     + T      S ++   S  S+  S  T 
Sbjct: 78   SGNNITSPNVRGELDMSSGPTNTASTSGPVPHDMTVLPQTDSNTSNLMSSGSQLGSFATQ 137

Query: 2109 SPESESTTTSSPASESTTIEEQ 2130
            S    ++TT++ +S +     Q
Sbjct: 138  STNGNNSTTTTTSSAAHPGSFQ 159


>gnl|CDD|237864 PRK14950, PRK14950, DNA polymerase III subunits gamma and tau;
            Provisional.
          Length = 585

 Score = 37.5 bits (87), Expect = 0.10
 Identities = 14/104 (13%), Positives = 31/104 (29%), Gaps = 12/104 (11%)

Query: 2053 ESTTTNNPASESITSSSPA--------SESTTTSSPASESTTTSSP----ASESTTTSSP 2100
            E+     PA +    ++ A        + ST   + A+ +     P    A+       P
Sbjct: 357  EALLVPVPAPQPAKPTAAAPSPVRPTPAPSTRPKAAAAANIPPKEPVRETATPPPVPPRP 416

Query: 2101 ASESTTTSSPESESTTTSSPASESTTIEEQGVSPHSEKLSANED 2144
             +     +   +   T ++   +          P  E+ +   D
Sbjct: 417  VAPPVPHTPESAPKLTRAAIPVDEKPKYTPPAPPKEEEKALIAD 460


>gnl|CDD|220684 pfam10310, DUF2413, Protein of unknown function (DUF2413).  This is a
            family of proteins conserved in fungi. The function is
            not known.
          Length = 436

 Score = 37.1 bits (86), Expect = 0.11
 Identities = 23/104 (22%), Positives = 37/104 (35%), Gaps = 11/104 (10%)

Query: 2038 SSPASESTTTNNPKSESTTTNNPASESITS------SSPASESTTTSSPASESTTTSSPA 2091
            S P  ++ T    K +++  +    E I         S  ++       AS   T  +P 
Sbjct: 7    SLPDEKAPTKKPKKGDASKDSTEDDEDILEFLDELEQSEKAKPPKKPKEASRPGTPRNPK 66

Query: 2092 SESTTTSSPASESTTTS-----SPESESTTTSSPASESTTIEEQ 2130
              S  T S A+ S         S ES  ++     + ST  EE+
Sbjct: 67   KSSKPTESSAASSEEKPAKPRKSAESTRSSHPKSKAPSTESEEE 110



 Score = 34.0 bits (78), Expect = 1.1
 Identities = 29/147 (19%), Positives = 47/147 (31%), Gaps = 25/147 (17%)

Query: 1958 SSLVSESTTTSSPESESTTTSSPESE-----------------STTTSSLVSESTTTSSP 2000
             SL  E   T  P+    +  S E +                         S   T  +P
Sbjct: 6    DSLPDEKAPTKKPKKGDASKDSTEDDEDILEFLDELEQSEKAKPPKKPKEASRPGTPRNP 65

Query: 2001 ESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNP 2060
            +  S  T     ES+  SS    +    S ES    +S P S++ +T + + E       
Sbjct: 66   KKSSKPT-----ESSAASSEEKPAKPRKSAESTR--SSHPKSKAPSTESEEEEEPEETPD 118

Query: 2061 ASESITSS-SPASESTTTSSPASESTT 2086
               SI    S     T+T++  + +  
Sbjct: 119  PIASIGGWWSLWGSITSTATSTASAAV 145



 Score = 31.3 bits (71), Expect = 6.9
 Identities = 19/108 (17%), Positives = 34/108 (31%), Gaps = 6/108 (5%)

Query: 2009 SPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSS 2068
            S  ++        S   T  +P+  S  T S A+ S        +S       S   +  
Sbjct: 44   SEKAKPPKKPKEASRPGTPRNPKKSSKPTESSAASSEEKPAKPRKSAE-----STRSSHP 98

Query: 2069 SPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTT 2116
               + ST +     E   T  P +      S     T+T++  + +  
Sbjct: 99   KSKAPSTESEEE-EEPEETPDPIASIGGWWSLWGSITSTATSTASAAV 145


>gnl|CDD|221143 pfam11593, Med3, Mediator complex subunit 3 fungal.  Mediator is a
            large complex of up to 33 proteins that is conserved from
            plants to fungi to humans - the number and representation
            of individual subunits varying with species. It is
            arranged into four different sections, a core, a head, a
            tail and a kinase-activity part, and the number of
            subunits within each of these is what varies with
            species. Overall, Mediator regulates the transcriptional
            activity of RNA polymerase II but it would appear that
            each of the four different sections has a slightly
            different function. Mediator subunit Hrs1/Med3 is a
            physical target for Cyc8-Tup1, a yeast transcriptional
            co-repressor.
          Length = 381

 Score = 36.9 bits (85), Expect = 0.11
 Identities = 17/117 (14%), Positives = 42/117 (35%), Gaps = 11/117 (9%)

Query: 2010 PVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSS 2069
             +       +  + ++ T +     +  S  A+ S+T N P +      N A      S+
Sbjct: 115  TLGTYNQLGNAGASASITKT-----SNGSDAATTSSTANTPAAAKVLKANAA------SA 163

Query: 2070 PASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTT 2126
            P + +   S+  + + + ++  + +TT   P     T  +  + +    + A     
Sbjct: 164  PNTTTGVGSAATTAAISATTATTPTTTQKKPRKPRQTKKTGPAAAAKAQASAQAQAQ 220



 Score = 36.5 bits (84), Expect = 0.15
 Identities = 31/206 (15%), Positives = 66/206 (32%), Gaps = 18/206 (8%)

Query: 1878 SESTVVMSTLNSLLSENTTTNSPESESTTTNN----PESESTTTSSPESESTTTSSLVSE 1933
            S S    S  +   + ++T N+P +      N    P + +   S+  + + + ++  + 
Sbjct: 128  SASITKTSNGSDAATTSSTANTPAAAKVLKANAASAPNTTTGVGSAATTAAISATTATTP 187

Query: 1934 STTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESEST-----TTSSPESESTTTS 1988
            +TT   P     T  +  + +    +        S+     +      TS        T 
Sbjct: 188  TTTQKKPRKPRQTKKTGPAAAAKAQASAQAQAQASAYNQMGSLGVPQNTSMLAQIPNPTP 247

Query: 1989 SL-----VSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASE 2043
             +     VS +   +SP       +SP+       +  +    T S  + +    S  + 
Sbjct: 248  LMQLLNGVSPNNAMASP----LNNMSPMRNLNQMGNQNNGGQMTPSANNGNMNNQSRENS 303

Query: 2044 STTTNNPKSESTTTNNPASESITSSS 2069
                  P +     NN    +I + S
Sbjct: 304  MNQGMTPSASMINLNNITPANILNMS 329



 Score = 36.1 bits (83), Expect = 0.22
 Identities = 27/172 (15%), Positives = 65/172 (37%), Gaps = 2/172 (1%)

Query: 1887 LNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTT 1946
            L +L + N   N+  S S T  +  S++ TTSS  + +T  ++ V ++   S+P + +  
Sbjct: 113  LETLGTYNQLGNAGASASITKTSNGSDAATTSS--TANTPAAAKVLKANAASAPNTTTGV 170

Query: 1947 TSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTT 2006
             S+  + + + ++  + +TT   P     T  +  + +    +        S+     + 
Sbjct: 171  GSAATTAAISATTATTPTTTQKKPRKPRQTKKTGPAAAAKAQASAQAQAQASAYNQMGSL 230

Query: 2007 TISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTN 2058
             +   +         +     ++  S +   +SP +  +   N        N
Sbjct: 231  GVPQNTSMLAQIPNPTPLMQLLNGVSPNNAMASPLNNMSPMRNLNQMGNQNN 282



 Score = 34.6 bits (79), Expect = 0.68
 Identities = 20/85 (23%), Positives = 39/85 (45%), Gaps = 2/85 (2%)

Query: 2050 PKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSS 2109
                  T N   +    +S+  ++++  S  A+ S+T ++PA+     ++ AS   TT+ 
Sbjct: 112  VLETLGTYNQLGNAG--ASASITKTSNGSDAATTSSTANTPAAAKVLKANAASAPNTTTG 169

Query: 2110 PESESTTTSSPASESTTIEEQGVSP 2134
              S +TT +  A+ +TT       P
Sbjct: 170  VGSAATTAAISATTATTPTTTQKKP 194



 Score = 32.7 bits (74), Expect = 2.6
 Identities = 23/191 (12%), Positives = 63/191 (32%), Gaps = 12/191 (6%)

Query: 1894 NTTTNSPESESTTTNNPESESTTT----SSPESESTTTSSLVSESTTTSSPESESTTTSS 1949
             +  +   + S+T N P +         S+P + +   S+  + + + ++  + +TT   
Sbjct: 134  TSNGSDAATTSSTANTPAAAKVLKANAASAPNTTTGVGSAATTAAISATTATTPTTTQKK 193

Query: 1950 PESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTIS 2009
            P     T  +  + +    +        S+     +      +         +     ++
Sbjct: 194  PRKPRQTKKTGPAAAAKAQASAQAQAQASAYNQMGSLGVPQNTSMLAQIPNPTPLMQLLN 253

Query: 2010 PVSESTTTSSPV-----SESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASES 2064
             VS +   +SP+       +   +  ++     +  A+     N  +  S       S S
Sbjct: 254  GVSPNNAMASPLNNMSPMRNLNQMGNQNNGGQMTPSANNGNMNNQSRENSMNQGMTPSAS 313

Query: 2065 ITSS---SPAS 2072
            + +    +PA+
Sbjct: 314  MINLNNITPAN 324



 Score = 31.9 bits (72), Expect = 4.0
 Identities = 24/134 (17%), Positives = 49/134 (36%), Gaps = 5/134 (3%)

Query: 2002 SESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSEST-TTNNP 2060
            S S T  S  S++ TTSS    +  T +        ++ A  +TT     + +   +   
Sbjct: 128  SASITKTSNGSDAATTSS----TANTPAAAKVLKANAASAPNTTTGVGSAATTAAISATT 183

Query: 2061 ASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSP 2120
            A+   T+     +   T      +   +  ++++   +S  ++  +   P++ S     P
Sbjct: 184  ATTPTTTQKKPRKPRQTKKTGPAAAAKAQASAQAQAQASAYNQMGSLGVPQNTSMLAQIP 243

Query: 2121 ASESTTIEEQGVSP 2134
                      GVSP
Sbjct: 244  NPTPLMQLLNGVSP 257


>gnl|CDD|178666 PLN03119, PLN03119, putative ADP-ribosylation factor
            GTPase-activating protein AGD14; Provisional.
          Length = 648

 Score = 37.5 bits (86), Expect = 0.11
 Identities = 51/197 (25%), Positives = 78/197 (39%), Gaps = 30/197 (15%)

Query: 1966 TTSSPESESTTTSSPESESTTTSSL---VSES-TTTSSPESESTTTISPVSESTTT---- 2017
            TTSS    S  ++    +S T+  L   VSES   T S + +++  +  V+EST      
Sbjct: 269  TTSSGSVRSVDSNFMSIKSYTSGGLGEAVSESRQNTGSQQGKTSNHVPLVAESTKAPIDL 328

Query: 2018 ----SSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITS--SSPA 2071
                 +PV++S  T  P        S A  S   N  ++  T +  PA+    +    P 
Sbjct: 329  FQLPGAPVAQSVDTFQP--------SIAPRSPPVNLQQAPQTYSFTPANSFAGNLGQQPT 380

Query: 2072 SESTTTSSPASE---STTTSSPASESTTT-SSPASESTTTSSPESESTTTSSPASESTTI 2127
            S  +  S+P +E   S     PA++ST   +SP          E    +TS       + 
Sbjct: 381  SRPSELSAPKNEGWASFDNPMPAAKSTNVITSPGDFQLELKIEEILQPSTSMQLPPYPST 440

Query: 2128 EEQGV----SPHSEKLS 2140
             +Q      SP  E LS
Sbjct: 441  VDQHALSIPSPWQEDLS 457


>gnl|CDD|185274 PRK15376, PRK15376, pathogenicity island 1 effector protein SipA;
            Provisional.
          Length = 670

 Score = 37.3 bits (86), Expect = 0.13
 Identities = 32/155 (20%), Positives = 55/155 (35%), Gaps = 9/155 (5%)

Query: 2031 ESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSP 2090
            ES  +T SS  S S +  +  +  T T + AS        A +   T+   +E+ T +S 
Sbjct: 336  ESHHSTNSSNVSHSHSRVDSTTHQTETAHSASTGAIDHGIAGKIDVTAHATAEAVTNASS 395

Query: 2091 ASESTT--TSSPASESTTTSSPESESTTTSSPASESTTIEEQGVSPHSEKLSANE----- 2143
             S+     TS   +   TTS  E +  T+ S   +       GV  + ++    E     
Sbjct: 396  ESKDGKVVTSEKGTTGETTSFDEVDGVTSKSIIGKPVQATVHGVDDNKQQSQTAEIVNVK 455

Query: 2144 --DPEEFPNEDVFEHTFAEIPNIDHSNQTDEAIPE 2176
                +    E+V   T      +   N+      +
Sbjct: 456  PLASQLAGVENVKTDTLQSDTTVITGNKAGTTDND 490


>gnl|CDD|179334 PRK01770, PRK01770, sec-independent translocase; Provisional.
          Length = 171

 Score = 35.6 bits (82), Expect = 0.13
 Identities = 27/101 (26%), Positives = 43/101 (42%), Gaps = 8/101 (7%)

Query: 2024 STTTISPE-SESTTTSSPASESTTT----NNPKSESTTTNNPASESITSSSPASESTTTS 2078
            S T +SPE   S      A+ES       N+P+  S   +   +  +  +  A E  T  
Sbjct: 70   SLTNLSPELKASVDELKQAAESMKRSYAANDPEKASDEAHTIHNPVVKDNEAAHEGVT-- 127

Query: 2079 SPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSS 2119
             PA+  T  SSP  +  TT  P  +    + P++ + + SS
Sbjct: 128  -PAAAQTQASSPEQKPETTPEPVVKPAADAEPKTAAPSPSS 167



 Score = 31.7 bits (72), Expect = 2.5
 Identities = 20/86 (23%), Positives = 35/86 (40%), Gaps = 3/86 (3%)

Query: 2061 ASESITSSSPASESTTTSSPASESTTTSSPASESTTTS-SPASESTTTSSPESESTTTSS 2119
            A+ES+  S  A++    S  A          +E+     +PA+  T  SSPE +  TT  
Sbjct: 88   AAESMKRSYAANDPEKASDEAHTIHNPVVKDNEAAHEGVTPAAAQTQASSPEQKPETTPE 147

Query: 2120 PASESTTIEEQGVSPHSEKLSANEDP 2145
            P  +     +      +   S+++ P
Sbjct: 148  PVVKPA--ADAEPKTAAPSPSSSDKP 171


>gnl|CDD|237868 PRK14960, PRK14960, DNA polymerase III subunits gamma and tau;
            Provisional.
          Length = 702

 Score = 37.3 bits (86), Expect = 0.13
 Identities = 32/190 (16%), Positives = 54/190 (28%), Gaps = 7/190 (3%)

Query: 1900 PESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSS 1959
               ++    N ++++    +P S       +   +     PE E      PE E      
Sbjct: 374  QNGQAEVGLNSQAQTAQEITPVSAVQPVEVISQPAMVEPEPEPEPEPEPEPEPEPEPEPE 433

Query: 1960 LVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSS 2019
               E      P  +         E     S V + T +   E        PV E      
Sbjct: 434  PEPEPEPEPQPNQDLMVFDPNHHELIGLESAVVQETVSVLEED-----FIPVPEQKLVQV 488

Query: 2020 PVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSP--ASESTTT 2077
                    I PE  ST       E+++     ++ T+  +  SE +        +E   T
Sbjct: 489  QAETQVKQIEPEPASTAEPIGLFEASSAEFSLAQDTSAYDLVSEPVIEQQSLVQAEIVET 548

Query: 2078 SSPASESTTT 2087
             +   E   T
Sbjct: 549  VAVVKEPNAT 558



 Score = 33.1 bits (75), Expect = 2.1
 Identities = 29/152 (19%), Positives = 44/152 (28%), Gaps = 14/152 (9%)

Query: 1990 LVSESTTTS-------SPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPAS 2042
            LVSE    +       + ++++   I+PVS           +     PE E      P  
Sbjct: 367  LVSEPVQQNGQAEVGLNSQAQTAQEITPVSAVQPVEVISQPAMVEPEPEPEPEPEPEPEP 426

Query: 2043 ESTTTNNPKSESTTTNNPASESITSSSPAS------ESTTTSSPASESTTTSSPASESTT 2096
            E      P+ E      P ++ +    P        ES       S       P  E   
Sbjct: 427  EPEPEPEPEPEPEPEPQP-NQDLMVFDPNHHELIGLESAVVQETVSVLEEDFIPVPEQKL 485

Query: 2097 TSSPASESTTTSSPESESTTTSSPASESTTIE 2128
                A        PE  ST       E+++ E
Sbjct: 486  VQVQAETQVKQIEPEPASTAEPIGLFEASSAE 517



 Score = 33.1 bits (75), Expect = 2.5
 Identities = 19/110 (17%), Positives = 35/110 (31%), Gaps = 12/110 (10%)

Query: 2067 SSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTT 2126
                  ++    +  +++    +P S        +  +     PE E      P  E   
Sbjct: 371  PVQQNGQAEVGLNSQAQTAQEITPVSAVQPVEVISQPAMVEPEPEPEPEPEPEPEPEPEP 430

Query: 2127 IEEQGVSPHSEKLSANEDPEEFPNED--VFEHTFAEIPNIDHSNQTDEAI 2174
              E    P         +PE  PN+D  VF+    E+  ++ S    E +
Sbjct: 431  EPEPEPEP---------EPEPQPNQDLMVFDPNHHELIGLE-SAVVQETV 470


>gnl|CDD|234351 TIGR03773, anch_rpt_wall, putative ABC transporter-associated repeat
            protein.  Members of this protein family occur in genomes
            that contain a three-gene ABC transporter operon
            associated with the presence of domain TIGR03769. That
            domain occurs as a single-copy insert in the
            substrate-binding protein, and occurs in two or more
            copies in members of this protein family. Members of this
            family typically are encoded adjacent to the said
            transporter operon and may serve as a substrate receptor.
          Length = 513

 Score = 36.9 bits (85), Expect = 0.14
 Identities = 18/127 (14%), Positives = 36/127 (28%), Gaps = 9/127 (7%)

Query: 1975 TTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESES 2034
            T T++ +       S     T T          I P   +T    P +++     P ++ 
Sbjct: 142  TVTATADLADGGAKS--KPETYTVVVGKVEVDKIDPARCATGAGKPQNDA---NGPAADK 196

Query: 2035 TTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASES 2094
                 PAS      +  + S           +   PA          +     ++P++ S
Sbjct: 197  PLFDDPASGVQALGDESAFSPGQQATVQIGKSVRLPAD----APLGVAAVVVKAAPSTGS 252

Query: 2095 TTTSSPA 2101
            +      
Sbjct: 253  SDAEGGL 259



 Score = 36.9 bits (85), Expect = 0.17
 Identities = 21/123 (17%), Positives = 36/123 (29%), Gaps = 11/123 (8%)

Query: 2005 TTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASES 2064
            T T +         S     T T+           PA  +T    P++++   N PA++ 
Sbjct: 142  TVTATADLADGGAKS--KPETYTVVVGKVEVDKIDPARCATGAGKPQNDA---NGPAADK 196

Query: 2065 ITSSSPASESTTTSSPASES----TTTSSPASESTTTSSPASESTTTSS--PESESTTTS 2118
                 PAS        ++ S     T     S      +P   +       P + S+   
Sbjct: 197  PLFDDPASGVQALGDESAFSPGQQATVQIGKSVRLPADAPLGVAAVVVKAAPSTGSSDAE 256

Query: 2119 SPA 2121
               
Sbjct: 257  GGL 259



 Score = 32.2 bits (73), Expect = 3.5
 Identities = 19/118 (16%), Positives = 35/118 (29%), Gaps = 11/118 (9%)

Query: 2015 TTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASES 2074
            T T++         S     T T           +P   +T    P +++     PA++ 
Sbjct: 142  TVTATADLADGGAKS--KPETYTVVVGKVEVDKIDPARCATGAGKPQNDAN---GPAADK 196

Query: 2075 TTTSSPASESTTTSSPASES----TTTSSPASESTTTSSPESESTTTSS--PASESTT 2126
                 PAS        ++ S     T     S      +P   +       P++ S+ 
Sbjct: 197  PLFDDPASGVQALGDESAFSPGQQATVQIGKSVRLPADAPLGVAAVVVKAAPSTGSSD 254


>gnl|CDD|185641 PTZ00462, PTZ00462, Serine-repeat antigen protein; Provisional.
          Length = 1004

 Score = 37.3 bits (86), Expect = 0.14
 Identities = 15/40 (37%), Positives = 25/40 (62%), Gaps = 4/40 (10%)

Query: 2357 HSVKIIGWGKSSQNE----PYWLCTNSYNQGWGEQGLFKI 2392
            H+V I+G+G    +E     YW+  NS+ + WG++G FK+
Sbjct: 723  HAVNIVGYGNYINDEDEKKSYWIVRNSWGKYWGDEGYFKV 762



 Score = 32.3 bits (73), Expect = 4.5
 Identities = 13/83 (15%), Positives = 27/83 (32%), Gaps = 5/83 (6%)

Query: 2053 ESTTTNNPASESITSSSPASESTTTSSP-----ASESTTTSSPASESTTTSSPASESTTT 2107
            E     N        +   +      SP     A+   +  +   ES+  + P  +    
Sbjct: 26   EDDDNGNIGGGQAGGTGGDNAGNIDGSPIGNLDANIHASFGADPKESSGANLPGKKEKKK 85

Query: 2108 SSPESESTTTSSPASESTTIEEQ 2130
                     ++S +  S++IE+Q
Sbjct: 86   KEIRGHDIMSNSDSQNSSSIEKQ 108


>gnl|CDD|218881 pfam06070, Herpes_UL32, Herpesvirus large structural phosphoprotein
            UL32.  The large phosphorylated protein (UL32-like) of
            herpes viruses is the polypeptide most frequently
            reactive in immuno-blotting analyses with antisera when
            compared with other viral proteins.
          Length = 777

 Score = 37.2 bits (86), Expect = 0.14
 Identities = 32/241 (13%), Positives = 67/241 (27%), Gaps = 16/241 (6%)

Query: 1917 TSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTT 1976
              S E +     S   E        S  T     +   ++  S   ES       S    
Sbjct: 272  EDSLEYDDPGLES-TDEDDDDDGDSSLQTFKPLLDLTGSSLWSDDEESGDEDGDGSGFAP 330

Query: 1977 TSSPESESTTTSSLV-----SESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPE 2031
                +++S +  +LV       S    S ++  T++       + + +   E    ++ E
Sbjct: 331  EPLIKTDSRSNDTLVDLGRGGGSLKLDSVDAPGTSSYLFEPGLSPSPNSGKEMPGILTTE 390

Query: 2032 SESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPA 2091
            +     +S  S      + +  +   NN    +    +P           ++  + +  +
Sbjct: 391  NLDLPLASTDSTEMDPEDKRGGAVKINNSGILAWGLKTPGLAV-------NDERSIAVSS 443

Query: 2092 SESTTTSSPASESTTTSSPESESTTTSSPASESTTIEEQGVSPHSEKLSANEDPEEFPNE 2151
               T    P S     SS +    + S P+    +     +              E    
Sbjct: 444  DGITDVLDPPSPLRLHSSDKV-IDSVSPPSKRRVSAPASRLDDAKRP--EVTATPESSGS 500

Query: 2152 D 2152
            D
Sbjct: 501  D 501


>gnl|CDD|215598 PLN03138, PLN03138, Protein TOC75; Provisional.
          Length = 796

 Score = 37.1 bits (86), Expect = 0.14
 Identities = 22/95 (23%), Positives = 40/95 (42%), Gaps = 15/95 (15%)

Query: 2030 PESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSS 2089
              S ST  S+ AS S +++ P+  S              S  S  + T SP + S   S 
Sbjct: 1    GRSSSTMVSAAASTSLSSSRPQLSS-------------FSSRSPQSATRSPRASSIKCS- 46

Query: 2090 PASESTTTSSPASESTTTSSPESESTTTSSPASES 2124
             AS S ++S+ +S ++  ++      + S+ +   
Sbjct: 47   -ASASASSSATSSSASLVANGAVALLSASAISGGG 80



 Score = 36.4 bits (84), Expect = 0.24
 Identities = 15/81 (18%), Positives = 41/81 (50%), Gaps = 2/81 (2%)

Query: 2014 STTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASE 2073
             ++++   + ++T++S      ++ S  S  + T +P++ S   +  AS S +SS+ +S 
Sbjct: 2    RSSSTMVSAAASTSLSSSRPQLSSFSSRSPQSATRSPRASSIKCS--ASASASSSATSSS 59

Query: 2074 STTTSSPASESTTTSSPASES 2094
            ++  ++ A    + S+ +   
Sbjct: 60   ASLVANGAVALLSASAISGGG 80



 Score = 35.6 bits (82), Expect = 0.42
 Identities = 14/81 (17%), Positives = 36/81 (44%), Gaps = 2/81 (2%)

Query: 1914 STTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESE 1973
             ++++   + ++T+ S      ++ S  S  + T SP + S   S   S S ++S+  S 
Sbjct: 2    RSSSTMVSAAASTSLSSSRPQLSSFSSRSPQSATRSPRASSIKCS--ASASASSSATSSS 59

Query: 1974 STTTSSPESESTTTSSLVSES 1994
            ++  ++      + S++    
Sbjct: 60   ASLVANGAVALLSASAISGGG 80



 Score = 35.6 bits (82), Expect = 0.45
 Identities = 14/81 (17%), Positives = 34/81 (41%), Gaps = 2/81 (2%)

Query: 1944 STTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESE 2003
             ++++   + ++T+ S      ++ S  S  + T SP + S   S   S S ++S+  S 
Sbjct: 2    RSSSTMVSAAASTSLSSSRPQLSSFSSRSPQSATRSPRASSIKCS--ASASASSSATSSS 59

Query: 2004 STTTISPVSESTTTSSPVSES 2024
            ++   +      + S+     
Sbjct: 60   ASLVANGAVALLSASAISGGG 80



 Score = 34.4 bits (79), Expect = 0.90
 Identities = 24/80 (30%), Positives = 33/80 (41%), Gaps = 8/80 (10%)

Query: 2070 PASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTTIEE 2129
              S ST  S+ AS S ++S P   S    S  S  + T SP + S   S+ AS S++   
Sbjct: 1    GRSSSTMVSAAASTSLSSSRPQLSS---FSSRSPQSATRSPRASSIKCSASASASSS--- 54

Query: 2130 QGVSPHSEKLSANEDPEEFP 2149
               +  S  L AN       
Sbjct: 55   --ATSSSASLVANGAVALLS 72



 Score = 31.0 bits (70), Expect = 9.7
 Identities = 16/81 (19%), Positives = 39/81 (48%), Gaps = 2/81 (2%)

Query: 1994 STTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSE 2053
             ++++   + ++T++S      ++ S  S  + T SP + S   S  AS S +++   S 
Sbjct: 2    RSSSTMVSAAASTSLSSSRPQLSSFSSRSPQSATRSPRASSIKCS--ASASASSSATSSS 59

Query: 2054 STTTNNPASESITSSSPASES 2074
            ++   N A   +++S+ +   
Sbjct: 60   ASLVANGAVALLSASAISGGG 80


>gnl|CDD|227416 COG5084, YTH1, Cleavage and polyadenylation specificity factor (CPSF)
            Clipper subunit and related makorin family Zn-finger
            proteins [General function prediction only].
          Length = 285

 Score = 36.4 bits (84), Expect = 0.14
 Identities = 32/175 (18%), Positives = 50/175 (28%), Gaps = 10/175 (5%)

Query: 1874 TNNNSESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTS----SPESESTTTSS 1929
            T NN  + V+ S++           S  S           S        S +   ++  S
Sbjct: 92   TPNNHVNPVLSSSVVCKFFLRGLCKSGFSCEFLHEYDLRSSQGPPCRSFSLKGSCSSGPS 151

Query: 1930 LVS--ESTTTSSPESE----STTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESE 1983
                     + +   +    +T    P   S   S  +   +  SSP    T   SP   
Sbjct: 152  CGYSHIDPDSFAGNCDQYSGATYGFCPLGASCKFSHTLKRVSYGSSPCGNYTPPFSPPGT 211

Query: 1984 STTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTS 2038
             + + S       TS   S  +  I      T  S   S  T  I   SE  + +
Sbjct: 212  PSESVSSWGYGKGTSCSLSHPSLNIDIQQPQTAPSRKDSGGTNPIGASSEIGSEA 266



 Score = 35.6 bits (82), Expect = 0.25
 Identities = 50/296 (16%), Positives = 78/296 (26%), Gaps = 39/296 (13%)

Query: 1862 SVIDNYSEIIFTTNNNSESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPE 1921
                 Y+   +T+ N      V S             SP   S T  N  +    T+   
Sbjct: 16   GSGCTYNHSNYTSLN-DGLQSVSSKYMGA-----KQISPSLSSPTFKNKANLMQNTNDN- 68

Query: 1922 SESTTTSSLVSESTTTSSPESESTTTSSPE---SESTTTSSLVSESTTTSSPESESTTTS 1978
                 T + +S +   S   S  +T ++       S+            S    E     
Sbjct: 69   FVPGNTVACISRNFN-SIRGSRLSTPNNHVNPVLSSSVVCKFFLRGLCKSGFSCEFLHEY 127

Query: 1979 SPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTS 2038
               S         S   + SS  S   + I P      + +   +        S +T   
Sbjct: 128  DLRSSQGPPCRSFSLKGSCSSGPSCGYSHIDP-----DSFAGNCDQ------YSGATYGF 176

Query: 2039 SPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTS 2098
             P            + + T    S     SSP    T   SP    + + S       TS
Sbjct: 177  CP-------LGASCKFSHTLKRVS---YGSSPCGNYTPPFSPPGTPSESVSSWGYGKGTS 226

Query: 2099 SPASESTTTSSPESESTTTSS-------PASESTTIEEQGVSPHSEKLSANEDPEE 2147
               S  +     +   T  S        P   S+ I  +        +S + D EE
Sbjct: 227  CSLSHPSLNIDIQQPQTAPSRKDSGGTNPIGASSEIGSEADGNMQNSISGSGDSEE 282


>gnl|CDD|149648 pfam08662, eIF2A, Eukaryotic translation initiation factor eIF2A.
           This is a family of eukaryotic translation initiation
           factors.
          Length = 194

 Score = 35.7 bits (83), Expect = 0.15
 Identities = 19/74 (25%), Positives = 31/74 (41%), Gaps = 11/74 (14%)

Query: 494 SVAFNDTAECVLTGGIDN---DIKMWDLRTNSVVQKLRGHSDTVTGLSLSPDGSYILSNA 550
           ++ ++     VL  G  N    I+ WD++    +      S+  T    SPDG Y L+  
Sbjct: 105 TIFWSPFGRLVLLAGFGNLAGQIEFWDVKNKKKIATAE-ASNA-TDCEWSPDGRYFLTAT 162

Query: 551 ------MDNTVRIW 558
                 +DN  +IW
Sbjct: 163 TSPRLRVDNGFKIW 176


>gnl|CDD|221825 pfam12877, DUF3827, Domain of unknown function (DUF3827).  This
            family contains the human KIAA1549 protein which has been
            found to be fused fused to BRAF gene in many cases of
            pilocytic astrocytomas. The fusion is due mainly to a
            tandem duplication of 2 Mb at 7q34. Although nothing is
            known about the function of KIAA1549 protein, the BRAF
            protein is a well characterized oncoprotein. It is a
            serine/threonine protein kinase which is implicated in
            MAP/ERK signalling, a critical pathway for the regulation
            of cell division, differentiation and secretion.
          Length = 684

 Score = 36.8 bits (85), Expect = 0.15
 Identities = 30/178 (16%), Positives = 53/178 (29%), Gaps = 11/178 (6%)

Query: 1878 SESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTT 1937
              + +      SL  E+    +P+S+S+   +   +        S+  +  S  S  +  
Sbjct: 343  EPAPLPPLKKESLPIEDAEVPTPKSKSSQDGSSNKKRRRGRKSPSDGDSEGS--SVISNR 400

Query: 1938 SSPESESTTTSSPESESTTTSSLVSESTTTSSPES--ESTTTSSPESESTTTSSLVSEST 1995
            SS E +S   S+  S +        E     +P S  +   +S+   E     S  S   
Sbjct: 401  SSRE-KSGRPSTTPSVTAQQKPTKEEGRKKPAPPSGTDEQLSSASIFEHVDRLSRPSSDP 459

Query: 1996 TTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSE 2053
               S        + P        +P        S +  +        E       KSE
Sbjct: 460  YDRSSGKIQLIAMQP------MPAPPVPPRFEPSRDDRAAENGKVNKEIQVALRHKSE 511



 Score = 33.8 bits (77), Expect = 1.4
 Identities = 31/154 (20%), Positives = 48/154 (31%), Gaps = 27/154 (17%)

Query: 1972 SESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPE 2031
             E     +P+S+S         S   SS +       SP    +  SS +S      S  
Sbjct: 357  IEDAEVPTPKSKS---------SQDGSSNKKRRRGRKSPSDGDSEGSSVISNR----SSR 403

Query: 2032 SESTTTSSPASESTTTNNPKSESTTTNNPAS--ESITSSSPASESTT-TSSPASESTTTS 2088
             +S   S+  S +      K E      P S  +   SS+   E     S P+S+    S
Sbjct: 404  EKSGRPSTTPSVTAQQKPTKEEGRKKPAPPSGTDEQLSSASIFEHVDRLSRPSSDPYDRS 463

Query: 2089 S-----------PASESTTTSSPASESTTTSSPE 2111
            S           PA        P+ +     + +
Sbjct: 464  SGKIQLIAMQPMPAPPVPPRFEPSRDDRAAENGK 497



 Score = 32.2 bits (73), Expect = 4.3
 Identities = 32/145 (22%), Positives = 49/145 (33%), Gaps = 17/145 (11%)

Query: 1929 SLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESES--TT 1986
            SL  E     +P+S+S+   S  S         S S   S   S  +  SS E     +T
Sbjct: 354  SLPIEDAEVPTPKSKSSQDGS--SNKKRRRGRKSPSDGDSEGSSVISNRSSREKSGRPST 411

Query: 1987 TSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTIS-PESESTTTSS------ 2039
            T S+ ++   T     +     S   E  +++S + E    +S P S+    SS      
Sbjct: 412  TPSVTAQQKPTKEEGRKKPAPPSGTDEQLSSAS-IFEHVDRLSRPSSDPYDRSSGKIQLI 470

Query: 2040 -----PASESTTTNNPKSESTTTNN 2059
                 PA        P  +     N
Sbjct: 471  AMQPMPAPPVPPRFEPSRDDRAAEN 495


>gnl|CDD|224346 COG1429, CobN, Cobalamin biosynthesis protein CobN and related
            Mg-chelatases [Coenzyme metabolism].
          Length = 1388

 Score = 37.0 bits (86), Expect = 0.15
 Identities = 14/77 (18%), Positives = 25/77 (32%), Gaps = 5/77 (6%)

Query: 1898 NSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESE----STTTSSPESE 1953
                + +T        +T  S+  S S+ T +     +   S         + T + E  
Sbjct: 1291 AFAPASATPGAPESVGTTAVSTASSASSATVTGSDAGSGADSTGPSLGAAGSVTGAGEGY 1350

Query: 1954 STTTSSLVSESTTTSSP 1970
              T  + VS S +T   
Sbjct: 1351 EMTKEA-VSGSESTGMS 1366



 Score = 36.6 bits (85), Expect = 0.24
 Identities = 20/83 (24%), Positives = 32/83 (38%), Gaps = 6/83 (7%)

Query: 1923 ESTTTSSLVSESTTTSSPESESTT-TSSPESESTTTSSLVSESTTTSSPESE----STTT 1977
             +T  ++    S T  +PES  TT  S+  S S+ T +     +   S         + T
Sbjct: 1285 AATRYAAFAPASATPGAPESVGTTAVSTASSASSATVTGSDAGSGADSTGPSLGAAGSVT 1344

Query: 1978 SSPESESTTTSSLVSESTTTSSP 2000
             + E    T  + VS S +T   
Sbjct: 1345 GAGEGYEMTKEA-VSGSESTGMS 1366



 Score = 35.8 bits (83), Expect = 0.35
 Identities = 18/91 (19%), Positives = 32/91 (35%), Gaps = 10/91 (10%)

Query: 1953 ESTTTSSLVSESTTTSSPESESTT-TSSPESESTTTSSLVSESTTTSSPESESTTTISPV 2011
             +T  ++    S T  +PES  TT  S+  S S+ T +     +   S            
Sbjct: 1285 AATRYAAFAPASATPGAPESVGTTAVSTASSASSATVTGSDAGSGADSTGPSL------- 1337

Query: 2012 SESTTTSSPVSESTTTISPESESTTTSSPAS 2042
                  S   +     ++ E+ S + S+  S
Sbjct: 1338 --GAAGSVTGAGEGYEMTKEAVSGSESTGMS 1366



 Score = 34.3 bits (79), Expect = 1.0
 Identities = 13/85 (15%), Positives = 32/85 (37%), Gaps = 5/85 (5%)

Query: 2059 NPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTS 2118
               + +  ++   + +T  +  +  +T  S+ +S S+ T + +   +   S         
Sbjct: 1282 ATYAATRYAAFAPASATPGAPESVGTTAVSTASSASSATVTGSDAGSGADSTGPSL---G 1338

Query: 2119 SPASESTTIEEQGVSPHSEKLSANE 2143
            +  S +   E  G     E +S +E
Sbjct: 1339 AAGSVTGAGE--GYEMTKEAVSGSE 1361


>gnl|CDD|165564 PHA03309, PHA03309, transcriptional regulator ICP4; Provisional.
          Length = 2033

 Score = 37.1 bits (85), Expect = 0.16
 Identities = 37/152 (24%), Positives = 63/152 (41%), Gaps = 14/152 (9%)

Query: 1982 SESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTS--- 2038
            S S+++SS  S S+ +S P   +T ++SP S S    +PV  S +    E +  + +   
Sbjct: 1817 SSSSSSSSSSSSSSPSSRPSRSATPSLSP-SPSPPRRAPVDRSRSGRRRERDRPSANPFR 1875

Query: 2039 -SPASESTTTNNPKSES------TTTNNPA-SESITSSSPASESTTTSSP--ASESTTTS 2088
             +P   S   ++P   +         + P     I + S A+   + S P  + + T T 
Sbjct: 1876 WAPRQRSRADHSPDGTAPGDAPLNLEDGPGRGRPIWTPSSATTLPSRSGPEDSVDETETE 1935

Query: 2089 SPASESTTTSSPASESTTTSSPESESTTTSSP 2120
              A  +    SP   S    S +SE    S+P
Sbjct: 1936 DSAPPARLAPSPLETSRAEDSEDSEYPEYSNP 1967



 Score = 36.8 bits (84), Expect = 0.22
 Identities = 36/186 (19%), Positives = 67/186 (36%), Gaps = 7/186 (3%)

Query: 1969 SPES-----ESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSE 2023
            SPE      +S   S P    +  ++       ++   S S+++ S  S S+ +S P   
Sbjct: 1779 SPERVLGRRQSRRDSVPVRRRSGAANCGGRWMISAGRSSSSSSSSSSSSSSSPSSRPSRS 1838

Query: 2024 STTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASE 2083
            +T ++SP S S    +P   S +    +     + NP   +    S A  S   ++P   
Sbjct: 1839 ATPSLSP-SPSPPRRAPVDRSRSGRR-RERDRPSANPFRWAPRQRSRADHSPDGTAPGDA 1896

Query: 2084 STTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTTIEEQGVSPHSEKLSANE 2143
                           +P+S +T  S    E +   +   +S        SP     + + 
Sbjct: 1897 PLNLEDGPGRGRPIWTPSSATTLPSRSGPEDSVDETETEDSAPPARLAPSPLETSRAEDS 1956

Query: 2144 DPEEFP 2149
            +  E+P
Sbjct: 1957 EDSEYP 1962



 Score = 35.6 bits (81), Expect = 0.46
 Identities = 46/199 (23%), Positives = 72/199 (36%), Gaps = 20/199 (10%)

Query: 1938 SSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTT 1997
            S+  S S+++SS  S S++ SS  S S T S   S S    +P   S +      +  + 
Sbjct: 1812 SAGRSSSSSSSSSSSSSSSPSSRPSRSATPSLSPSPSPPRRAPVDRSRSGRRRERDRPSA 1871

Query: 1998 S----SPESESTTTISPVSESTTTSSPVS------ESTTTISPESESTTTSSPASESTTT 2047
            +    +P   S    SP   +    +P++            +P S +T  S    E +  
Sbjct: 1872 NPFRWAPRQRSRADHSP-DGTAPGDAPLNLEDGPGRGRPIWTPSSATTLPSRSGPEDSV- 1929

Query: 2048 NNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTT 2107
                 + T T + A  +  + SP   S    S  SE    S+P       S PA +S   
Sbjct: 1930 -----DETETEDSAPPARLAPSPLETSRAEDSEDSEYPEYSNP---RLGKSPPALKSREA 1981

Query: 2108 SSPESESTTTSSPASESTT 2126
              P S+     S      T
Sbjct: 1982 RRPSSKQPRRPSSGKNGHT 2000



 Score = 34.1 bits (77), Expect = 1.3
 Identities = 45/197 (22%), Positives = 73/197 (37%), Gaps = 23/197 (11%)

Query: 1922 SESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTS------------- 1968
            S S+++SS  S S+ +S P   +T + SP S S    + V  S +               
Sbjct: 1817 SSSSSSSSSSSSSSPSSRPSRSATPSLSP-SPSPPRRAPVDRSRSGRRRERDRPSANPFR 1875

Query: 1969 -SPESESTTTSSPESESTTTSSL-VSESTTTSSP-ESESTTTISPVSESTTTSSPVSEST 2025
             +P   S    SP+  +   + L + +      P  + S+ T  P       S   +E+ 
Sbjct: 1876 WAPRQRSRADHSPDGTAPGDAPLNLEDGPGRGRPIWTPSSATTLPSRSGPEDSVDETETE 1935

Query: 2026 TTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASEST 2085
             +  P   +    SP   S   ++  SE    +NP    +  S PA +S     P+S+  
Sbjct: 1936 DSAPP---ARLAPSPLETSRAEDSEDSEYPEYSNP---RLGKSPPALKSREARRPSSKQP 1989

Query: 2086 TTSSPASESTTTSSPAS 2102
               S      T  S AS
Sbjct: 1990 RRPSSGKNGHTDVSAAS 2006


>gnl|CDD|227928 COG5641, GAT1, GATA Zn-finger-containing transcription factor
            [Transcription].
          Length = 498

 Score = 36.8 bits (85), Expect = 0.16
 Identities = 54/265 (20%), Positives = 93/265 (35%), Gaps = 26/265 (9%)

Query: 1885 STLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTS--SPES 1942
             +L S   ++ ++ S    S   N+   E+  T   ES   +++S +++S        + 
Sbjct: 204  ISLKSDSIKSRSSRS----SHNNNDSNGENANT---ESIGNSSASKLTKSWEERPQGRQL 256

Query: 1943 ESTTTSSPESESTTTSSLVSESTTTSSPESEST---TTSSPESESTTTSSLVSESTTTSS 1999
             S   S     +   S L+     ++S +  ST      S +  ST T+S  +     +S
Sbjct: 257  LSDAGSLSPRSNNPKSPLLEGLMGSTSLQPVSTPKLVLPSDKKRSTLTTSTATPLWRRTS 316

Query: 2000 PESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNN 2059
             +S  +   S  +     S           P       +S  S +T  N     ST TN 
Sbjct: 317  DKSSFSCNASGSALKPPGS---------KRPLLPKPDPNSKRSNATCMNC---SSTPTNK 364

Query: 2060 PASESITSSSPASESTTTSSPASESTT-TSSPASESTTTSSPASESTTTSSPESESTTTS 2118
              S   TS+SP ++    +   S   T          +   PA  S+   +P  +  +  
Sbjct: 365  ILSPPTTSNSPGAQVKLPNQTRSTGATKKKITRRRMNSGKIPALSSSMK-NPVPKEFSPL 423

Query: 2119 SPASESTTIEEQGVSPHSEKLSANE 2143
             P S  +    Q  S  + KL   E
Sbjct: 424  IPQSTESETPSQSKSSLTSKLEEFE 448


>gnl|CDD|223033 PHA03291, PHA03291, envelope glycoprotein I; Provisional.
          Length = 401

 Score = 36.5 bits (84), Expect = 0.20
 Identities = 33/139 (23%), Positives = 55/139 (39%), Gaps = 15/139 (10%)

Query: 2041 ASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSP 2100
             +E T    P  E  + +     ++  S+P         PA+   T  + AS  TT +  
Sbjct: 167  PAEGTLAAPPLGE-GSADGSCDPALPLSAPRLGPADVFVPATPRPTPRTTASPETTPTPS 225

Query: 2101 ASESTTTSSPESESTTTSSPASESTTIEEQGVSPHS----EKLSANEDPEEFPNEDVFEH 2156
             + S  +++  + STT ++P + +T   E   +P +    E   AN  P   P    +E 
Sbjct: 226  TTTSPPSTTIPAPSTTIAAPQAGTTPEAEGTPAPPTPGGGEAPPANATPA--PEASRYEL 283

Query: 2157 TFAEIPNIDHSNQTDEAIP 2175
            T  +I  I        AIP
Sbjct: 284  TVTQIIQI--------AIP 294



 Score = 34.5 bits (79), Expect = 0.80
 Identities = 28/139 (20%), Positives = 48/139 (34%), Gaps = 7/139 (5%)

Query: 1983 ESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPAS 2042
            E  T +SL          E    T  +P     +       +    +P         PA+
Sbjct: 151  EGATNASLFPLGLAAFPAEG---TLAAPPLGEGSADGSCDPALPLSAPRLGPADVFVPAT 207

Query: 2043 ESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPAS----ESTTTSSPASESTTTS 2098
               T     S  TT     + S  S++  + STT ++P +    E+  T +P +     +
Sbjct: 208  PRPTPRTTASPETTPTPSTTTSPPSTTIPAPSTTIAAPQAGTTPEAEGTPAPPTPGGGEA 267

Query: 2099 SPASESTTTSSPESESTTT 2117
             PA+ +    +   E T T
Sbjct: 268  PPANATPAPEASRYELTVT 286



 Score = 31.5 bits (71), Expect = 6.6
 Identities = 20/97 (20%), Positives = 35/97 (36%), Gaps = 4/97 (4%)

Query: 1996 TTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSEST 2055
              ++P     TT SP   + T S+  S  +TTI   S +       +       P     
Sbjct: 204  VPATPRPTPRTTASP-ETTPTPSTTTSPPSTTIPAPSTTIAAPQAGTTPEAEGTPA---P 259

Query: 2056 TTNNPASESITSSSPASESTTTSSPASESTTTSSPAS 2092
             T         +++PA E++      ++    + PAS
Sbjct: 260  PTPGGGEAPPANATPAPEASRYELTVTQIIQIAIPAS 296


>gnl|CDD|240310 PTZ00200, PTZ00200, cysteine proteinase; Provisional.
          Length = 448

 Score = 36.2 bits (84), Expect = 0.21
 Identities = 19/60 (31%), Positives = 30/60 (50%), Gaps = 8/60 (13%)

Query: 2346 SCEGSINPRYIHSVKIIGWGKSSQ-NEPYWLCTNSYNQGWGEQGLFKIRR---GVNMCSI 2401
             C  S+N    H+V ++G G   +  + YW+  NS+   WGE G  ++ R   G + C I
Sbjct: 381  ECGKSLN----HAVLLVGEGYDEKTKKRYWIIKNSWGTDWGENGYMRLERTNEGTDKCGI 436


>gnl|CDD|165099 PHA02732, PHA02732, hypothetical protein; Provisional.
          Length = 1467

 Score = 36.7 bits (84), Expect = 0.22
 Identities = 37/202 (18%), Positives = 77/202 (38%), Gaps = 17/202 (8%)

Query: 1931 VSESTTTSSPESESTTTSSPESESTTTSSLVSES---TTTSSPESESTTT------SSPE 1981
            VS    +  P   +  + S    ++  SS V+     +  + P   + T+      + P+
Sbjct: 1058 VSYFAASQGPSPFTFVSPSYIFLNSWASSYVAPGFLGSPYALPYFMNQTSALVGNTALPK 1117

Query: 1982 SESTTTSSLVSESTTTSSPESESTTTISPV---SESTTTSSPVSESTTTISPESESTTTS 2038
              +  +  +    T  S+    ++T  SPV     +   S   +  +   S  +++   S
Sbjct: 1118 GLNVFSGYMFGAGTVASAFLYMNSTPQSPVLALLLAPYISYKFNALSLGFSITADAAIFS 1177

Query: 2039 S---PASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTS-SPASES 2094
                PA +  ++  P + S    +P    I         T T +  +     S + +  +
Sbjct: 1178 LFGIPAPQLLSSYIP-TGSVLYQDPIFTYIPPGIIGMSGTNTFTFKAAQLQLSAASSPPA 1236

Query: 2095 TTTSSPASESTTTSSPESESTT 2116
             TT +P   S+++SS +S ST+
Sbjct: 1237 ATTPTPPPSSSSSSSAQSISTS 1258



 Score = 34.3 bits (78), Expect = 0.99
 Identities = 36/186 (19%), Positives = 64/186 (34%), Gaps = 21/186 (11%)

Query: 1925 TTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESES 1984
              TS+LV  +         S          T  S+ +  ++T  SP        +P   S
Sbjct: 1104 NQTSALVGNTALPKGLNVFSGYMFGA---GTVASAFLYMNSTPQSPVL--ALLLAPYI-S 1157

Query: 1985 TTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPAS-- 2042
               ++L    + T+     S   I P  +  ++  P    T ++  +    T   P    
Sbjct: 1158 YKFNALSLGFSITADAAIFSLFGI-PAPQLLSSYIP----TGSVLYQDPIFTYIPPGIIG 1212

Query: 2043 -ESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPA 2101
               T T   K+     +        +SSP + +T T  P+S S++++   S S       
Sbjct: 1213 MSGTNTFTFKAAQLQLSA-------ASSPPAATTPTPPPSSSSSSSAQSISTSPGQIQIV 1265

Query: 2102 SESTTT 2107
               +TT
Sbjct: 1266 LNGSTT 1271



 Score = 34.3 bits (78), Expect = 1.1
 Identities = 20/92 (21%), Positives = 34/92 (36%), Gaps = 3/92 (3%)

Query: 1917 TSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTT 1976
                 S   T S L  +   T  P           + + T  +   + +  SSP + +T 
Sbjct: 1184 PQLLSSYIPTGSVLYQDPIFTYIP---PGIIGMSGTNTFTFKAAQLQLSAASSPPAATTP 1240

Query: 1977 TSSPESESTTTSSLVSESTTTSSPESESTTTI 2008
            T  P S S++++  +S S          +TTI
Sbjct: 1241 TPPPSSSSSSSAQSISTSPGQIQIVLNGSTTI 1272



 Score = 32.4 bits (73), Expect = 3.6
 Identities = 24/141 (17%), Positives = 47/141 (33%), Gaps = 2/141 (1%)

Query: 1888 NSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTT 1947
            ++ L  N+T  SP         P       +     S T  + +       +P+  S+  
Sbjct: 1134 SAFLYMNSTPQSPVLALLLA--PYISYKFNALSLGFSITADAAIFSLFGIPAPQLLSSYI 1191

Query: 1948 SSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTT 2007
             +                      + + T  + + + +  SS  + +T T  P S S+++
Sbjct: 1192 PTGSVLYQDPIFTYIPPGIIGMSGTNTFTFKAAQLQLSAASSPPAATTPTPPPSSSSSSS 1251

Query: 2008 ISPVSESTTTSSPVSESTTTI 2028
               +S S      V   +TTI
Sbjct: 1252 AQSISTSPGQIQIVLNGSTTI 1272



 Score = 32.4 bits (73), Expect = 3.8
 Identities = 40/211 (18%), Positives = 71/211 (33%), Gaps = 36/211 (17%)

Query: 1838 LLSVSPYITNNLLISMLAATAVAISVIDNYSEIIFTTNNNSESTVVMSTLNSLLSENTTT 1897
            + S+       LL S +   +V       Y + IFT        + MS  N+   +    
Sbjct: 1175 IFSLFGIPAPQLLSSYIPTGSVL------YQDPIFTYI--PPGIIGMSGTNTFTFKAAQL 1226

Query: 1898 NSPESESTTTNNPESESTTTSSPESESTT-TSSLVSESTTTSSPESESTTT--------- 1947
                S +++       +TT + P S S++ ++  +S S          +TT         
Sbjct: 1227 QL--SAASSP----PAATTPTPPPSSSSSSSAQSISTSPGQIQIVLNGSTTIHINFLFFP 1280

Query: 1948 --SSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESEST 2005
              S+P+        +V+ S    S        S+  +    T   V  + T     ++  
Sbjct: 1281 ALSTPKIGQILAMPIVNSSGAFIS----LYVNSAISANFNVTIEYVFSNGTVIKRFTDEP 1336

Query: 2006 TTISPVSESTTTSSPVSESTTTISPESESTT 2036
              I P+      +         IS E+ESTT
Sbjct: 1337 GQIFPLP---LINGDEE---VIISVENESTT 1361



 Score = 31.6 bits (71), Expect = 6.4
 Identities = 16/75 (21%), Positives = 28/75 (37%), Gaps = 1/75 (1%)

Query: 2060 PASESITSSSP-ASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTS 2118
            PA + ++S  P  S                  +  +T T   A    + +S    +TT +
Sbjct: 1182 PAPQLLSSYIPTGSVLYQDPIFTYIPPGIIGMSGTNTFTFKAAQLQLSAASSPPAATTPT 1241

Query: 2119 SPASESTTIEEQGVS 2133
             P S S++   Q +S
Sbjct: 1242 PPPSSSSSSSAQSIS 1256


>gnl|CDD|177577 PHA03292, PHA03292, envelope glycoprotein I; Provisional.
          Length = 413

 Score = 36.1 bits (83), Expect = 0.24
 Identities = 24/148 (16%), Positives = 53/148 (35%), Gaps = 13/148 (8%)

Query: 1936 TTSSPESESTTTSSPESESTTTSSLVSESTTTSS-PESESTTTSSPESESTTTSSLVSES 1994
            TT+ PE  +   ++P        +  + S + SS P   +T T +P  ++  T+      
Sbjct: 178  TTARPEPAAGYVATPTPRYLNAVTTSTYSRSMSSQPAGAATATPTPTLDTGLTTVAPPNE 237

Query: 1995 TTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSES 2054
            T  +   +       P +   T    +  +T  ++ +   T        S     P  + 
Sbjct: 238  TVVTGETALLCHWFQPSTRVPTLYLHLLGTTGNLTEDVLLTED------SEILRTPPPDP 291

Query: 2055 TTTNNPASESITSSSPASESTTTSSPAS 2082
            +++ +P +          + T ++SP  
Sbjct: 292  SSSRSPGAGD------DFKQTNSTSPKR 313



 Score = 34.5 bits (79), Expect = 0.63
 Identities = 27/142 (19%), Positives = 51/142 (35%), Gaps = 11/142 (7%)

Query: 1966 TTSSPESESTTTSSPESESTTTSSLVSESTTTSS-PESESTTTISPVSESTTTSSPVSES 2024
            TT+ PE  +   ++P        +  + S + SS P   +T T +P  ++  T+      
Sbjct: 178  TTARPEPAAGYVATPTPRYLNAVTTSTYSRSMSSQPAGAATATPTPTLDTGLTTVAPPNE 237

Query: 2025 TTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASES 2084
            T      +       P++   T        TT N      +T  S         +P  + 
Sbjct: 238  TVVTGETALLCHWFQPSTRVPTLYL-HLLGTTGNLTEDVLLTEDSEI-----LRTPPPDP 291

Query: 2085 TTTSSPASES----TTTSSPAS 2102
            +++ SP +      T ++SP  
Sbjct: 292  SSSRSPGAGDDFKQTNSTSPKR 313



 Score = 33.8 bits (77), Expect = 1.2
 Identities = 22/89 (24%), Positives = 35/89 (39%), Gaps = 2/89 (2%)

Query: 2029 SPESESTT-TSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTT 2087
             P+ E TT    PA+    T  P+  +  T +  S S  SS PA  +T T +P  ++  T
Sbjct: 172  VPDPEPTTARPEPAAGYVATPTPRYLNAVTTSTYSRS-MSSQPAGAATATPTPTLDTGLT 230

Query: 2088 SSPASESTTTSSPASESTTTSSPESESTT 2116
            +      T  +   +       P +   T
Sbjct: 231  TVAPPNETVVTGETALLCHWFQPSTRVPT 259


>gnl|CDD|152349 pfam11914, DUF3432, Domain of unknown function (DUF3432).  This
            presumed domain is functionally uncharacterized. This
            domain is found in eukaryotes. This domain is about 100
            amino acids in length. This domain is found associated
            with pfam00096. This domain has two conserved sequence
            motifs: YPSPV and PSP.
          Length = 100

 Score = 33.6 bits (76), Expect = 0.24
 Identities = 29/99 (29%), Positives = 43/99 (43%), Gaps = 8/99 (8%)

Query: 1977 TSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSES------TTTISP 2030
             ++P S ++   S+ S S  +S P   +T+  SPV   T+ SSPVS        T+  SP
Sbjct: 3    KAAPVSTASPNISIYSSSPVSSYPSPIATSYPSPV--PTSYSSPVSSCYPSPVHTSFPSP 60

Query: 2031 ESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSS 2069
               +T  S   +  T        S  TN+ +S   T  S
Sbjct: 61   SIATTYPSVSPTFQTQVATSFPSSVVTNSFSSPVTTPLS 99



 Score = 33.6 bits (76), Expect = 0.25
 Identities = 27/100 (27%), Positives = 45/100 (45%), Gaps = 6/100 (6%)

Query: 2017 TSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTT 2076
             ++PVS ++  IS  S S  +S P+  +T+  +P      T+  +  S    SP    T+
Sbjct: 3    KAAPVSTASPNISIYSSSPVSSYPSPIATSYPSP----VPTSYSSPVSSCYPSPV--HTS 56

Query: 2077 TSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTT 2116
              SP+  +T  S   +  T  ++    S  T+S  S  TT
Sbjct: 57   FPSPSIATTYPSVSPTFQTQVATSFPSSVVTNSFSSPVTT 96



 Score = 31.7 bits (71), Expect = 1.2
 Identities = 27/99 (27%), Positives = 45/99 (45%), Gaps = 3/99 (3%)

Query: 1947 TSSPESESTTTSSLVSESTTTSSPESESTTTSSP--ESESTTTSSLVSESTTTSSPESES 2004
             ++P S ++   S+ S S  +S P   +T+  SP   S S+  SS       TS P    
Sbjct: 3    KAAPVSTASPNISIYSSSPVSSYPSPIATSYPSPVPTSYSSPVSSCYPSPVHTSFPSPSI 62

Query: 2005 TTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASE 2043
             TT   VS +  T    S  ++ ++  S S+  ++P S+
Sbjct: 63   ATTYPSVSPTFQTQVATSFPSSVVT-NSFSSPVTTPLSD 100



 Score = 30.9 bits (69), Expect = 1.8
 Identities = 26/99 (26%), Positives = 45/99 (45%), Gaps = 3/99 (3%)

Query: 1917 TSSPESESTTTSSLVSESTTTSSPESESTTTSSP--ESESTTTSSLVSESTTTSSPESES 1974
             ++P S ++   S+ S S  +S P   +T+  SP   S S+  SS       TS P S S
Sbjct: 3    KAAPVSTASPNISIYSSSPVSSYPSPIATSYPSPVPTSYSSPVSSCYPSPVHTSFP-SPS 61

Query: 1975 TTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSE 2013
              T+ P    T  + + +   ++    S S+   +P+S+
Sbjct: 62   IATTYPSVSPTFQTQVATSFPSSVVTNSFSSPVTTPLSD 100



 Score = 29.3 bits (65), Expect = 6.8
 Identities = 27/100 (27%), Positives = 44/100 (44%), Gaps = 6/100 (6%)

Query: 1997 TSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTT 2056
             ++P S ++  IS  S S  +S P   +T+  SP    T+ SSP S       P    T+
Sbjct: 3    KAAPVSTASPNISIYSSSPVSSYPSPIATSYPSP--VPTSYSSPVSSCY----PSPVHTS 56

Query: 2057 TNNPASESITSSSPASESTTTSSPASESTTTSSPASESTT 2096
              +P+  +   S   +  T  ++    S  T+S +S  TT
Sbjct: 57   FPSPSIATTYPSVSPTFQTQVATSFPSSVVTNSFSSPVTT 96


>gnl|CDD|219938 pfam08618, Opi1, Transcription factor Opi1.  Opi1 is a leucine zipper
            containing yeast transcription factor that negatively
            regulates phospholipid biosynthesis. It represses the
            expression of several UAS(INO) cis acting element
            containing genes and its activity is mediated by
            phosphorylations catalyzed by protein kinase A, protein
            kinase C and casein kinase II.
          Length = 387

 Score = 35.8 bits (82), Expect = 0.25
 Identities = 24/163 (14%), Positives = 39/163 (23%), Gaps = 5/163 (3%)

Query: 1906 TTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSEST 1965
            T       +   S+      T       STT   P + S  +    S    ++       
Sbjct: 48   TVAAVGRATGVESNNRWALNTPVRTTPSSTT--MPSALSKRSLDAASIHMASNGAPPLIQ 105

Query: 1966 TTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSEST 2025
             +S   +             T  S+   S              S    ++   +PV  S 
Sbjct: 106  KSSEKVNGGIDAIRNSETEGTLYSVDVGSQGLRMRIQTQGYAPSG---NSNRGAPVRTSA 162

Query: 2026 TTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSS 2068
             + S         SP   S+        + T N         S
Sbjct: 163  LSTSTLPGYDDHRSPRYSSSPVPQQPQTAVTANGGPRPPQPRS 205



 Score = 33.5 bits (76), Expect = 1.5
 Identities = 28/182 (15%), Positives = 53/182 (29%), Gaps = 10/182 (5%)

Query: 1942 SESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPE 2001
              ST  +     +   S+      T       STT  S  S+ +  ++ +  ++  + P 
Sbjct: 45   VASTVAAVG-RATGVESNNRWALNTPVRTTPSSTTMPSALSKRSLDAASIHMASNGAPPL 103

Query: 2002 SESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPA 2061
             + +   S        +   SE+  T+      +       ++       + +       
Sbjct: 104  IQKS---SEKVNGGIDAIRNSETEGTLYSVDVGSQGLRMRIQTQGYAPSGNSNRGAPVRT 160

Query: 2062 SESITSSSPASESTTTSSPASESTTTSSPASESTT----TSSPASESTTTSSPESESTTT 2117
            S   TS+ P  +     SP   S+        + T       P   S   S       T 
Sbjct: 161  SALSTSTLPGYDDH--RSPRYSSSPVPQQPQTAVTANGGPRPPQPRSAWQSGNGRVLITA 218

Query: 2118 SS 2119
            SS
Sbjct: 219  SS 220



 Score = 31.6 bits (71), Expect = 5.4
 Identities = 26/163 (15%), Positives = 51/163 (31%), Gaps = 5/163 (3%)

Query: 1883 VMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPES 1942
            V ST+ ++       ++      T       STT  S  S+ +  ++ +  ++  + P  
Sbjct: 45   VASTVAAVGRATGVESNNRWALNTPVRTTPSSTTMPSALSKRSLDAASIHMASNGAPPLI 104

Query: 1943 ESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPES 2002
            +   +S   +             T  S +  S              S  S         +
Sbjct: 105  QK--SSEKVNGGIDAIRNSETEGTLYSVDVGSQGLRMRIQTQGYAPSGNSNRGAPVRTSA 162

Query: 2003 ESTTTISPVSESTT---TSSPVSESTTTISPESESTTTSSPAS 2042
             ST+T+    +  +   +SSPV +   T    +       P S
Sbjct: 163  LSTSTLPGYDDHRSPRYSSSPVPQQPQTAVTANGGPRPPQPRS 205



 Score = 31.2 bits (70), Expect = 6.9
 Identities = 18/155 (11%), Positives = 49/155 (31%), Gaps = 13/155 (8%)

Query: 1958 SSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTT 2017
             S +  ++T ++    +   S+      T       STT  S  S+ +   + +  ++  
Sbjct: 40   RSAIPVASTVAAVGRATGVESNNRWALNTPVRTTPSSTTMPSALSKRSLDAASIHMASNG 99

Query: 2018 SSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTT 2077
            + P+ + ++    E  +    +  +  T       +  +            +P+  S   
Sbjct: 100  APPLIQKSS----EKVNGGIDAIRNSETEGTLYSVDVGSQGLRMRIQTQGYAPSGNSNRG 155

Query: 2078 SSPASESTTTSSPASESTTTSSPASESTTTSSPES 2112
            +             S  +T++ P  +   +    S
Sbjct: 156  APV---------RTSALSTSTLPGYDDHRSPRYSS 181



 Score = 30.8 bits (69), Expect = 9.4
 Identities = 29/178 (16%), Positives = 54/178 (30%), Gaps = 10/178 (5%)

Query: 1958 SSLVSESTTTSSPE----SESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSE 2013
            SS    S+ + SP     +E    S+    ST  +     +   S+      T +     
Sbjct: 17   SSHAYTSSKSYSPRFRYGAEIVERSAIPVASTVAAVG-RATGVESNNRWALNTPVRTTPS 75

Query: 2014 STTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASE 2073
            STT   P + S  ++   S    ++        ++   +        +    T  S    
Sbjct: 76   STT--MPSALSKRSLDAASIHMASNGAPPLIQKSSEKVNGGIDAIRNSETEGTLYSVDVG 133

Query: 2074 STTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTTIEEQG 2131
            S              S  S +       S  +T++ P  +     SP   S+ + +Q 
Sbjct: 134  SQGLRMRIQTQGYAPSGNS-NRGAPVRTSALSTSTLPGYDD--HRSPRYSSSPVPQQP 188


>gnl|CDD|227498 COG5170, CDC55, Serine/threonine protein phosphatase 2A, regulatory
           subunit [Signal transduction mechanisms].
          Length = 460

 Score = 35.8 bits (82), Expect = 0.28
 Identities = 52/301 (17%), Positives = 104/301 (34%), Gaps = 64/301 (21%)

Query: 380 SGYDRQIFIWSVYGECENIGVMS------GHTGAVMDLKFSTDGCHIFTCST-DQTLAVW 432
           S  D+ I +W +Y +  N+ V++           +     ST    +   S  D+ +A  
Sbjct: 105 STNDKTIKLWKIYEK--NLKVVAENNLSDSFHSPMGGPLTSTKELLLPRLSEHDEIIAAK 162

Query: 433 DLEKGQRIKKMKGHSTFVNSCDPVRRGQLLIASGSDDCTVKVWDPRKKNQAVSMNN---- 488
                        H   +NS       + L+++  DD  + +W+    + + ++ +    
Sbjct: 163 -----PCRVYANAHPYHINSISFNSDKETLLSA--DDLRINLWNLEIIDGSFNIVDIKPH 215

Query: 489 -----TYQVTSVAFNDTAECVLTG--GIDNDIKMWDLRTNSV----------------VQ 525
                T  +TS  F+    C +        +IK+ DLR +++                V 
Sbjct: 216 NMEELTEVITSAEFH-PEMCNVFMYSSSKGEIKLNDLRQSALCDNSKKLFELTIDGVDVD 274

Query: 526 KLRGHSDTVTGLSLSPDGSYILSNAMDNTVRIWDIRPYVPGERCVKVMSGHQHNFEK--- 582
                  +++    S +G YILS     TV+IWD+      +  +K +  H    ++   
Sbjct: 275 FFEEIVSSISDFKFSDNGRYILSRDY-LTVKIWDVN---MAKNPIKTIPMHCDLMDELND 330

Query: 583 --------NLLRCAWSVSGLYVTAGSADKCVYIWDTTTRRIAYKLPGHNGSVNDVQFHPK 634
                   +    ++S    +V +GS      I+ T +           G V ++     
Sbjct: 331 VYENDAIFDKFEISFSGDDKHVLSGSYSNNFGIYPTDS-----SGFKDVGHVVNLADGSA 385

Query: 635 E 635
           E
Sbjct: 386 E 386


>gnl|CDD|177555 PHA03193, PHA03193, tegument protein VP11/12; Provisional.
          Length = 594

 Score = 35.8 bits (82), Expect = 0.29
 Identities = 18/128 (14%), Positives = 43/128 (33%), Gaps = 3/128 (2%)

Query: 2001 ESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNP 2060
            +S      +   +       ++ +   I PE  S      A    + +  +  + TT + 
Sbjct: 440  DSPFQRKRAMPEDGGEIHEALANNGQAIFPECFSGDLPPIAQALLSAD--ELPNDTTAST 497

Query: 2061 ASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSP 2120
            ++E    +   +     +   +     +  A++ +  + PA      ++ ES ST   +P
Sbjct: 498  SNEMKGDAECPAAQDAAAILPASFQIENGGAADGSGLAIPA-AMCDATAVESPSTVAETP 556

Query: 2121 ASESTTIE 2128
                   E
Sbjct: 557  PERLLAAE 564



 Score = 34.3 bits (78), Expect = 0.95
 Identities = 27/159 (16%), Positives = 53/159 (33%), Gaps = 12/159 (7%)

Query: 1981 ESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSP 2040
            +S      ++  +        + +   I P   S           +    E  + TT+S 
Sbjct: 440  DSPFQRKRAMPEDGGEIHEALANNGQAIFPECFSGDLPPIAQALLSAD--ELPNDTTAST 497

Query: 2041 ASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSP 2100
            ++E                PA++   +  PAS        A++ +  + PA+    T+  
Sbjct: 498  SNEMKGDAEC---------PAAQDAAAILPASFQIENGG-AADGSGLAIPAAMCDATAVE 547

Query: 2101 ASESTTTSSPESESTTTSSPASESTTIEEQGVSPHSEKL 2139
            +  +   + PE      S P  ++T   + G S   E L
Sbjct: 548  SPSTVAETPPERLLAAESGPRCKATAKHKGGSSKVEEIL 586



 Score = 33.2 bits (75), Expect = 2.1
 Identities = 15/135 (11%), Positives = 42/135 (31%), Gaps = 9/135 (6%)

Query: 1898 NSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTT 1957
            +SP         PE       +  +      ++  E  +   P       S+ E  + TT
Sbjct: 440  DSPFQRKRAM--PEDGGEIHEALANNG---QAIFPECFSGDLPPIAQALLSADELPNDTT 494

Query: 1958 SSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTT 2017
            +S  +E    +   +     +   +     +   ++ +  + P +         +  + +
Sbjct: 495  ASTSNEMKGDAECPAAQDAAAILPASFQIENGGAADGSGLAIPAA----MCDATAVESPS 550

Query: 2018 SSPVSESTTTISPES 2032
            +   +     ++ ES
Sbjct: 551  TVAETPPERLLAAES 565



 Score = 32.0 bits (72), Expect = 5.0
 Identities = 20/134 (14%), Positives = 46/134 (34%), Gaps = 7/134 (5%)

Query: 1897 TNSPESESTTTNNPESESTTTSSPESESTTTSSLV---SESTTTSSPESESTTTSSP--- 1950
             +  E      NN ++      S +      + L      + TT+S  +E    +     
Sbjct: 451  EDGGEIHEALANNGQAIFPECFSGDLPPIAQALLSADELPNDTTASTSNEMKGDAECPAA 510

Query: 1951 -ESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTIS 2009
             ++ +   +S   E+   +     +   +  ++ +  + S V+E+       +ES     
Sbjct: 511  QDAAAILPASFQIENGGAADGSGLAIPAAMCDATAVESPSTVAETPPERLLAAESGPRCK 570

Query: 2010 PVSESTTTSSPVSE 2023
              ++    SS V E
Sbjct: 571  ATAKHKGGSSKVEE 584


>gnl|CDD|227502 COG5175, MOT2, Transcriptional repressor [Transcription].
          Length = 480

 Score = 35.8 bits (82), Expect = 0.29
 Identities = 26/180 (14%), Positives = 62/180 (34%), Gaps = 9/180 (5%)

Query: 1920 PESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSS 1979
            PE +S T   L +        E  +         ++T          T +P   +    +
Sbjct: 227  PEKDSLTKDELCNSQHKLHGSEVRNKNKKRIHRSTSTARYDTDLLNFTGTPSPAAM--EA 284

Query: 1980 PESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSS 2039
                 T+      +       +  +T + +PV+ S ++S  +     ++   +E+TTT++
Sbjct: 285  QFKHKTSRVFKAPDKILFPPLDFTNTQSATPVTLSNSSSINLPTLNDSLGHHTETTTTTN 344

Query: 2040 PASESTTTNNPKSES--TTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTT 2097
              + S +  + K +S          +++ ++     +   S    +    S  + E T  
Sbjct: 345  TNATSHSHGSKKKQSLAAEEYKDPYDALGNA-----ARLHSLSNYQKRPISIKSDEETYK 399



 Score = 31.6 bits (71), Expect = 6.3
 Identities = 21/144 (14%), Positives = 46/144 (31%), Gaps = 17/144 (11%)

Query: 1877 NSESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTS-------- 1928
            N   T   + + +     T+      +       +  +T +++P + S ++S        
Sbjct: 272  NFTGTPSPAAMEAQFKHKTSRVFKAPDKILFPPLDFTNTQSATPVTLSNSSSINLPTLND 331

Query: 1929 SLVSESTTTSSPESESTTTSSP---------ESESTTTSSLVSESTTTSSPESESTTTSS 1979
            SL   + TT++  + +T+ S           E       +L + +   S    +    S 
Sbjct: 332  SLGHHTETTTTTNTNATSHSHGSKKKQSLAAEEYKDPYDALGNAARLHSLSNYQKRPISI 391

Query: 1980 PESESTTTSSLVSESTTTSSPESE 2003
               E T          T ++   E
Sbjct: 392  KSDEETYKKWDKKSDNTLANKLVE 415



 Score = 31.2 bits (70), Expect = 8.2
 Identities = 30/150 (20%), Positives = 55/150 (36%), Gaps = 12/150 (8%)

Query: 1980 PESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSS 2039
            PE +S T   L +         SE         +    S+  +   T +      T T S
Sbjct: 227  PEKDSLTKDELCNSQHK--LHGSEVRNK---NKKRIHRSTSTARYDTDLLN---FTGTPS 278

Query: 2040 PASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSS 2099
            PA+      +     T+    A + I        +T +++P + S ++S        +  
Sbjct: 279  PAAMEAQFKH----KTSRVFKAPDKILFPPLDFTNTQSATPVTLSNSSSINLPTLNDSLG 334

Query: 2100 PASESTTTSSPESESTTTSSPASESTTIEE 2129
              +E+TTT++  + S +  S   +S   EE
Sbjct: 335  HHTETTTTTNTNATSHSHGSKKKQSLAAEE 364


>gnl|CDD|227600 COG5275, COG5275, BRCT domain type II [General function prediction
            only].
          Length = 276

 Score = 35.5 bits (81), Expect = 0.31
 Identities = 20/90 (22%), Positives = 35/90 (38%), Gaps = 5/90 (5%)

Query: 1925 TTTSSLVSESTTTSSPE-----SESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSS 1979
            T       +ST + S        E+TT+        T     S+S         +   + 
Sbjct: 16   TPDEYFEQQSTRSRSKPRIISNKETTTSKDVVHPVKTELDTTSDSKPVVHQTRATRKPAQ 75

Query: 1980 PESESTTTSSLVSESTTTSSPESESTTTIS 2009
            P++E +TTS   S +TT ++  S S+ +  
Sbjct: 76   PKAEKSTTSKSKSHTTTATTHTSRSSKSKG 105



 Score = 33.6 bits (76), Expect = 1.1
 Identities = 23/94 (24%), Positives = 38/94 (40%)

Query: 1984 STTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASE 2043
            S   S+   E     S  S S   I    E+TT+   V    T +   S+S         
Sbjct: 10   SDGVSTTPDEYFEQQSTRSRSKPRIISNKETTTSKDVVHPVKTELDTTSDSKPVVHQTRA 69

Query: 2044 STTTNNPKSESTTTNNPASESITSSSPASESTTT 2077
            +     PK+E +TT+   S + T+++  S S+ +
Sbjct: 70   TRKPAQPKAEKSTTSKSKSHTTTATTHTSRSSKS 103



 Score = 33.6 bits (76), Expect = 1.2
 Identities = 25/139 (17%), Positives = 48/139 (34%), Gaps = 7/139 (5%)

Query: 1935 TTTSSPESESTTTSSPES-ESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSE 1993
            T     E +ST + S     S   ++   +       E ++T+ S P    T  +   ++
Sbjct: 16   TPDEYFEQQSTRSRSKPRIISNKETTTSKDVVHPVKTELDTTSDSKPVVHQTRATRKPAQ 75

Query: 1994 STTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSE 2053
                  P++E +TT    S +TT ++  S S+ +      S   S         +     
Sbjct: 76   ------PKAEKSTTSKSKSHTTTATTHTSRSSKSKGLPRFSDEVSQALKNVPLIDVDSMG 129

Query: 2054 STTTNNPASESITSSSPAS 2072
                      + T+ +P S
Sbjct: 130  VMAPGTFYERAATTQTPGS 148



 Score = 33.2 bits (75), Expect = 1.7
 Identities = 20/88 (22%), Positives = 36/88 (40%), Gaps = 5/88 (5%)

Query: 1895 TTTNSPESESTTTNNPE-----SESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSS 1949
            T     E +ST + +        E+TT+        T     S+S         +   + 
Sbjct: 16   TPDEYFEQQSTRSRSKPRIISNKETTTSKDVVHPVKTELDTTSDSKPVVHQTRATRKPAQ 75

Query: 1950 PESESTTTSSLVSESTTTSSPESESTTT 1977
            P++E +TTS   S +TT ++  S S+ +
Sbjct: 76   PKAEKSTTSKSKSHTTTATTHTSRSSKS 103



 Score = 32.8 bits (74), Expect = 2.2
 Identities = 18/93 (19%), Positives = 39/93 (41%), Gaps = 4/93 (4%)

Query: 1909 NPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTT- 1967
            +  S +      +  + + S     S   ++   +       E ++T+ S  V   T   
Sbjct: 11   DGVSTTPDEYFEQQSTRSRSKPRIISNKETTTSKDVVHPVKTELDTTSDSKPVVHQTRAT 70

Query: 1968 ---SSPESESTTTSSPESESTTTSSLVSESTTT 1997
               + P++E +TTS  +S +TT ++  S S+ +
Sbjct: 71   RKPAQPKAEKSTTSKSKSHTTTATTHTSRSSKS 103



 Score = 32.0 bits (72), Expect = 3.4
 Identities = 22/94 (23%), Positives = 34/94 (36%)

Query: 1964 STTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSE 2023
            S   S+   E     S  S S        E+TT+        T +   S+S         
Sbjct: 10   SDGVSTTPDEYFEQQSTRSRSKPRIISNKETTTSKDVVHPVKTELDTTSDSKPVVHQTRA 69

Query: 2024 STTTISPESESTTTSSPASESTTTNNPKSESTTT 2057
            +     P++E +TTS   S +TT     S S+ +
Sbjct: 70   TRKPAQPKAEKSTTSKSKSHTTTATTHTSRSSKS 103



 Score = 32.0 bits (72), Expect = 3.5
 Identities = 21/85 (24%), Positives = 37/85 (43%), Gaps = 5/85 (5%)

Query: 1911 ESESTTTSSPE-----SESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSEST 1965
            E +ST + S        E+TT+  +V    T     S+S         +   +   +E +
Sbjct: 22   EQQSTRSRSKPRIISNKETTTSKDVVHPVKTELDTTSDSKPVVHQTRATRKPAQPKAEKS 81

Query: 1966 TTSSPESESTTTSSPESESTTTSSL 1990
            TTS  +S +TT ++  S S+ +  L
Sbjct: 82   TTSKSKSHTTTATTHTSRSSKSKGL 106



 Score = 31.6 bits (71), Expect = 4.2
 Identities = 21/93 (22%), Positives = 35/93 (37%), Gaps = 3/93 (3%)

Query: 1975 TTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESES 2034
            +TT     E  +T S       ++   + S   + PV     T+S        +     +
Sbjct: 14   STTPDEYFEQQSTRSRSKPRIISNKETTTSKDVVHPVKTELDTTSDSKP---VVHQTRAT 70

Query: 2035 TTTSSPASESTTTNNPKSESTTTNNPASESITS 2067
               + P +E +TT+  KS +TT     S S  S
Sbjct: 71   RKPAQPKAEKSTTSKSKSHTTTATTHTSRSSKS 103



 Score = 31.6 bits (71), Expect = 4.5
 Identities = 25/133 (18%), Positives = 44/133 (33%), Gaps = 5/133 (3%)

Query: 1955 TTTSSLVSESTTTSSPE-----SESTTTSSPESESTTTSSLVSESTTTSSPESESTTTIS 2009
            T       +ST + S        E+TT+        T     S+S         +     
Sbjct: 16   TPDEYFEQQSTRSRSKPRIISNKETTTSKDVVHPVKTELDTTSDSKPVVHQTRATRKPAQ 75

Query: 2010 PVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSS 2069
            P +E +TTS   S +TT  +  S S+ +      S   +         +  +   +   +
Sbjct: 76   PKAEKSTTSKSKSHTTTATTHTSRSSKSKGLPRFSDEVSQALKNVPLIDVDSMGVMAPGT 135

Query: 2070 PASESTTTSSPAS 2082
                + TT +P S
Sbjct: 136  FYERAATTQTPGS 148



 Score = 31.2 bits (70), Expect = 6.3
 Identities = 21/86 (24%), Positives = 35/86 (40%)

Query: 2012 SESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPA 2071
             E     S  S S   I    E+TT+        T  +  S+S    +    +   + P 
Sbjct: 18   DEYFEQQSTRSRSKPRIISNKETTTSKDVVHPVKTELDTTSDSKPVVHQTRATRKPAQPK 77

Query: 2072 SESTTTSSPASESTTTSSPASESTTT 2097
            +E +TTS   S +TT ++  S S+ +
Sbjct: 78   AEKSTTSKSKSHTTTATTHTSRSSKS 103



 Score = 30.9 bits (69), Expect = 8.8
 Identities = 21/90 (23%), Positives = 37/90 (41%), Gaps = 7/90 (7%)

Query: 2035 TTTSSPASESTTTNNPKSESTTTNNPASESITSSSPAS---ESTTTSSPASESTTT---- 2087
            +TT     E  +T +       +N   + S     P     ++T+ S P    T      
Sbjct: 14   STTPDEYFEQQSTRSRSKPRIISNKETTTSKDVVHPVKTELDTTSDSKPVVHQTRATRKP 73

Query: 2088 SSPASESTTTSSPASESTTTSSPESESTTT 2117
            + P +E +TTS   S +TT ++  S S+ +
Sbjct: 74   AQPKAEKSTTSKSKSHTTTATTHTSRSSKS 103


>gnl|CDD|148682 pfam07222, PBP_sp32, Proacrosin binding protein sp32.  This family
            consists of several mammalian specific proacrosin binding
            protein sp32 sequences. sp32 is a sperm specific protein
            which is known to bind with with 55- and 53-kDa
            proacrosins and the 49-kDa acrosin intermediate. The
            exact function of sp32 is unclear, it is thought however
            that the binding of sp32 to proacrosin may be involved in
            packaging the acrosin zymogen into the acrosomal matrix.
          Length = 243

 Score = 35.0 bits (80), Expect = 0.33
 Identities = 17/56 (30%), Positives = 25/56 (44%), Gaps = 1/56 (1%)

Query: 2090 PASESTTTSSPASESTTTSSPESESTTTSSPASESTTIEEQGVS-PHSEKLSANED 2144
            P S+  +  SP +      S E + TT + P +E  TI E     P  E+L  N +
Sbjct: 122  PCSQPVSILSPNTLKEAEPSAEVQPTTMTLPIAEHPTITENQSFQPWPERLHNNVE 177


>gnl|CDD|215187 PLN02328, PLN02328, lysine-specific histone demethylase 1 homolog.
          Length = 808

 Score = 35.7 bits (82), Expect = 0.34
 Identities = 28/91 (30%), Positives = 44/91 (48%), Gaps = 4/91 (4%)

Query: 1895 TTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTS---SPE 1951
            T T  PE  +   N+  SE+++  +  S S + S    E+   +SPE++S  T    SP 
Sbjct: 3    TETKEPEDPADNVNDVVSEASSPETDLSLSPSQSEQNIENDGQNSPETQSPLTELQPSPL 62

Query: 1952 SESTTTSSLVSEST-TTSSPESESTTTSSPE 1981
              +TT  + VS+S    SS E +    +S E
Sbjct: 63   PPNTTLDAPVSDSQGDESSSEQQPQNPNSTE 93



 Score = 35.0 bits (80), Expect = 0.62
 Identities = 27/99 (27%), Positives = 43/99 (43%), Gaps = 9/99 (9%)

Query: 1934 STTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSE 1993
             T T  PE  +   +   SE+++  + +S S + S    E+   +SPE++S  T      
Sbjct: 2    ETETKEPEDPADNVNDVVSEASSPETDLSLSPSQSEQNIENDGQNSPETQSPLTE----- 56

Query: 1994 STTTSSPESESTTTISPVSESTTTSSPVSESTTTISPES 2032
                 SP   +TT  +PVS+S    S  S      +P S
Sbjct: 57   --LQPSPLPPNTTLDAPVSDSQGDES--SSEQQPQNPNS 91



 Score = 34.6 bits (79), Expect = 0.89
 Identities = 26/103 (25%), Positives = 44/103 (42%), Gaps = 13/103 (12%)

Query: 2034 STTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASE 2093
             T T  P   +   N+  SE+++      E+  S SP+       +    S  T SP +E
Sbjct: 2    ETETKEPEDPADNVNDVVSEASSP-----ETDLSLSPSQSEQNIENDGQNSPETQSPLTE 56

Query: 2094 STTTSSPASESTTTSSPESESTTTSSPASESTTIEEQGVSPHS 2136
                 SP   +TT  +P S+S        + ++ E+Q  +P+S
Sbjct: 57   --LQPSPLPPNTTLDAPVSDSQ------GDESSSEQQPQNPNS 91



 Score = 33.0 bits (75), Expect = 2.4
 Identities = 27/107 (25%), Positives = 49/107 (45%), Gaps = 12/107 (11%)

Query: 1904 STTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSE 1963
             T T  PE  +   +   SE+++  + +S S + S    E+   +SPE++S  T      
Sbjct: 2    ETETKEPEDPADNVNDVVSEASSPETDLSLSPSQSEQNIENDGQNSPETQSPLTE----- 56

Query: 1964 STTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISP 2010
                 SP   +TT  +P S+     S   ES++   P++ ++T  +P
Sbjct: 57   --LQPSPLPPNTTLDAPVSD-----SQGDESSSEQQPQNPNSTEPAP 96



 Score = 31.5 bits (71), Expect = 7.4
 Identities = 26/117 (22%), Positives = 46/117 (39%), Gaps = 22/117 (18%)

Query: 1994 STTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSE 2053
             T T  PE  +      VSE+++      E+  ++SP           SE    N+ ++ 
Sbjct: 2    ETETKEPEDPADNVNDVVSEASSP-----ETDLSLSPSQ---------SEQNIENDGQNS 47

Query: 2054 STTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSP 2110
              T +          SP   +TT  +P S+S        ES++   P + ++T  +P
Sbjct: 48   PETQSPLTEL---QPSPLPPNTTLDAPVSDSQ-----GDESSSEQQPQNPNSTEPAP 96


>gnl|CDD|233186 TIGR00920, 2A060605, 3-hydroxy-3-methylglutaryl-coenzyme A reductase.
             [Transport and binding proteins, Carbohydrates, organic
            alcohols, and acids].
          Length = 889

 Score = 36.0 bits (83), Expect = 0.35
 Identities = 19/120 (15%), Positives = 38/120 (31%), Gaps = 14/120 (11%)

Query: 1867 YSEIIFTTNNNSESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTT 1926
             S+ IF +   +ESTV +   + +++             +T+  + E          + T
Sbjct: 330  ASKYIFFSQGETESTVSLKNGDPVVNPV-----------STDKKQLEYCCRRELTVSADT 378

Query: 1927 TSSLVSESTTTSSPESESTTTSSP---ESESTTTSSLVSESTTTSSPESESTTTSSPESE 1983
                + E    S           P    S+S   +S       + + +   +    PE E
Sbjct: 379  IVVSILEEALASKFVFFEVIKPLPTETGSDSWVEASFPVGHKYSGTEQPSCSAPKEPEEE 438


>gnl|CDD|219971 pfam08690, GET2, GET complex subunit GET2.  This family corresponds
            to the GET complex subunit GET2. The GET complex is
            involved in the retrieval of ER resident proteins from
            the Golgi.
          Length = 298

 Score = 35.1 bits (81), Expect = 0.36
 Identities = 22/118 (18%), Positives = 42/118 (35%), Gaps = 10/118 (8%)

Query: 1916 TTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESEST 1975
            T      +  + S L ++    +   + +   S+PE +       + E+      ESES 
Sbjct: 32   TGQGSSVKLVSKSVLDAKPEDNTGSTTSAHDQSTPEIQD------ILEAIDPPKDESESP 85

Query: 1976 TTS-SPESE--STTTSSLVSESTTTSSPESESTTTI-SPVSESTTTSSPVSESTTTIS 2029
              +  PE E            + + + P  +ST  + S + +      P SES  +  
Sbjct: 86   AENIDPEVEMFQQLAKMQQQGNGSDNPPADDSTADLFSMLLQMGGGDGPDSESPASAQ 143


>gnl|CDD|234428 TIGR03979, His_Ser_Rich, His-Xaa-Ser repeat protein HxsA.  Members of
            this protein share two defining regions. One is a
            histidine/serine-rich cluster, typically
            H-R-S-H-S-S-H-R-S-H-S-S-H. Members are found always in
            the context of a pair of radical SAM proteins, HxsB and
            HxsC, and a fourth protein HxsD. The system is predicted
            to perform peptide modifications, likely in the
            His-Xaa-Ser region, to produce some uncharacterized
            natural product.
          Length = 186

 Score = 34.5 bits (79), Expect = 0.37
 Identities = 18/69 (26%), Positives = 32/69 (46%), Gaps = 6/69 (8%)

Query: 2068 SSPASESTTTSSPASESTTTSSPASESTTTS----SPASESTTTSSPESESTTT--SSPA 2121
            SS  S S+ +S  +    + S P+ +++T S    SP+   +  SS +S  +TT     +
Sbjct: 57   SSHRSHSSHSSHYSGAGGSYSVPSGDTSTYSYPVPSPSYSPSPGSSIQSLPSTTGVRPQS 116

Query: 2122 SESTTIEEQ 2130
            S      E+
Sbjct: 117  SAENANSEK 125



 Score = 31.0 bits (70), Expect = 4.8
 Identities = 16/71 (22%), Positives = 30/71 (42%), Gaps = 4/71 (5%)

Query: 2029 SPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTS 2088
            S  S S+ +S  +    + + P  +++T + P      S SP+  S+  S P+  +T   
Sbjct: 58   SHRSHSSHSSHYSGAGGSYSVPSGDTSTYSYPVPSP--SYSPSPGSSIQSLPS--TTGVR 113

Query: 2089 SPASESTTTSS 2099
              +S     S 
Sbjct: 114  PQSSAENANSE 124



 Score = 30.6 bits (69), Expect = 7.0
 Identities = 15/59 (25%), Positives = 23/59 (38%), Gaps = 3/59 (5%)

Query: 2061 ASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSS 2119
             S +  S S  S  T+T S     + + SP+  S+  S P+  +T      S     S 
Sbjct: 69   YSGAGGSYSVPSGDTSTYS-YPVPSPSYSPSPGSSIQSLPS--TTGVRPQSSAENANSE 124


>gnl|CDD|221509 pfam12287, Caprin-1_C, Cytoplasmic
            activation/proliferation-associated protein-1 C term.
            This family of proteins is found in eukaryotes. Proteins
            in this family are typically between 343 and 708 amino
            acids in length. This family is the C terminal region of
            caprin-1. Caprin-1 is a protein involved in regulating
            cellular proliferation. In mutated phenotypes, the G1
            phase of the cell cycle is greatly lengthened, impairing
            normal proliferation. The C terminal region of caprin-1
            contains RGG motifs which are characteristic of RNA
            binding domains. It is possible that caprin-1 functions
            through an RNA binding mechanism.
          Length = 319

 Score = 35.3 bits (81), Expect = 0.37
 Identities = 20/86 (23%), Positives = 38/86 (44%), Gaps = 1/86 (1%)

Query: 2060 PASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSS 2119
            P    +   SP SE  T+S P  + + T+ P  + T    P   S + +S ++ ++++  
Sbjct: 61   PEPTQVPMVSPTSEGYTSSPPLYQPSHTAEPRPQ-TDPIDPIQASMSLNSEQTPTSSSLP 119

Query: 2120 PASESTTIEEQGVSPHSEKLSANEDP 2145
             AS+    +      HS  ++ N  P
Sbjct: 120  AASQPQVFQTGSKPLHSSGINVNAAP 145


>gnl|CDD|240410 PTZ00418, PTZ00418, Poly(A) polymerase; Provisional.
          Length = 593

 Score = 35.5 bits (82), Expect = 0.40
 Identities = 22/74 (29%), Positives = 34/74 (45%), Gaps = 5/74 (6%)

Query: 1885 STLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESES 1944
            S L + +   T     ++++ T  N  S +T+  S  S ST+ S+     +  SSP   S
Sbjct: 525  SQLPAFVLSQTPEEPVKTKANTKTNTSSATTSGQSGSSGSTSNSN-----SNESSPTMSS 579

Query: 1945 TTTSSPESESTTTS 1958
            T   +  S STT S
Sbjct: 580  TELLNVSSTSTTGS 593



 Score = 32.8 bits (75), Expect = 2.8
 Identities = 18/53 (33%), Positives = 26/53 (49%), Gaps = 4/53 (7%)

Query: 2069 SPASESTTTSSPASESTTTSSPASESTTTSSPASE---STTTSSPESESTTTS 2118
            + A+  T TSS  +   + SS  S S + S+ +S    ST   +  S STT S
Sbjct: 542  TKANTKTNTSSATTSGQSGSSG-STSNSNSNESSPTMSSTELLNVSSTSTTGS 593



 Score = 32.5 bits (74), Expect = 3.1
 Identities = 15/69 (21%), Positives = 30/69 (43%), Gaps = 8/69 (11%)

Query: 2020 PVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSS 2079
              +      +  +  T TSS  +   + ++  S S + +N +S +++S       T   +
Sbjct: 533  SQTPEEPVKTKANTKTNTSSATTSGQSGSSG-STSNSNSNESSPTMSS-------TELLN 584

Query: 2080 PASESTTTS 2088
             +S STT S
Sbjct: 585  VSSTSTTGS 593



 Score = 31.7 bits (72), Expect = 6.8
 Identities = 16/62 (25%), Positives = 26/62 (41%), Gaps = 4/62 (6%)

Query: 2000 PESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASE---STTTNNPKSESTT 2056
             ++      +  +  T TSS  +   +  S  S S + S+ +S    ST   N  S STT
Sbjct: 533  SQTPEEPVKTKANTKTNTSSATTSGQSGSSG-STSNSNSNESSPTMSSTELLNVSSTSTT 591

Query: 2057 TN 2058
             +
Sbjct: 592  GS 593


>gnl|CDD|219833 pfam08418, Pol_alpha_B_N, DNA polymerase alpha subunit B N-terminal. 
            This is the eukaryotic DNA polymerase alpha subunit B
            N-terminal domain which is involved in complex formation.
            Also see pfam04058.
          Length = 239

 Score = 34.7 bits (80), Expect = 0.40
 Identities = 28/134 (20%), Positives = 46/134 (34%), Gaps = 12/134 (8%)

Query: 2001 ESESTTTISPVSESTTTSSPVSESTTTISPESESTTTS---SPASESTTTNNPKSESTTT 2057
            E    T  S  +       P +E +   S     TT S         +T   PK + + +
Sbjct: 63   EKRVRTPASIKTSKRLIEVPEAEESLLDS----YTTPSDKGGMLRILSTPELPKRKRSFS 118

Query: 2058 NNPASESITSSSPASES----TTTSSPASESTTTSSPASESTTTSSPA-SESTTTSSPES 2112
             +         SPAS S     +T SP S   ++ S   E   T +P   ++     P+S
Sbjct: 119  ASSLESPSLFFSPASFSPSAAPSTPSPNSAKFSSRSNPGEVVETLNPHLGQTPEGGGPDS 178

Query: 2113 ESTTTSSPASESTT 2126
            +     S   ++  
Sbjct: 179  DPKVKLSANFDAKK 192


>gnl|CDD|220633 pfam10214, Rrn6, RNA polymerase I-specific transcription-initiation
            factor.  RNA polymerase I-specific
            transcription-initiation factor Rrn6 and Rrn7 represent
            components of a multisubunit transcription factor
            essential for the initiation of rDNA transcription by Pol
            I. These proteins are found in fungi.
          Length = 753

 Score = 35.5 bits (82), Expect = 0.44
 Identities = 37/258 (14%), Positives = 79/258 (30%), Gaps = 38/258 (14%)

Query: 1761 SRDINSVSPNVTSKILTTDNYSEIIFTTNNNSESTVVM-----STLNSLLSENEKLFKPH 1815
               +    P++         +  +I   +++ +S  +      + LN L+    +  +  
Sbjct: 505  VLSLLDELPSLPDHDQNITEFDSLISQLSSHYQSEDLTFSSLINFLNQLIHVLSEESRTS 564

Query: 1816 AKTPGAEFLIQCQYCDFDSSMNLLSVSPYITNNLLISM-LAATAVAISVIDNYSEIIFTT 1874
                  + L+QC   +     +L      +   +   + L+   V+   ID+  E     
Sbjct: 565  LDDI-YDKLLQCWESNLPH--DLPGTKEKLIRKIAAEIGLSLIKVSKKEIDSRLEEFLDE 621

Query: 1875 NNNSESTVVMSTLNSLLS--ENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVS 1932
            N NS S  V     ++L   +     S   +S  T        T SS     +   ++ S
Sbjct: 622  NTNSLSEEV----KNILDHWDPGDDPSDVDDSQATQPDV----TDSSQLESQSQIPTIRS 673

Query: 1933 ESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVS 1992
                + + +  S                   S   S+P      +S P +  +++    S
Sbjct: 674  SQQVSQTRKGGS-------------------SVVPSAPAPRLAQSSQPPTSQSSSDLPPS 714

Query: 1993 ESTTTSSPESESTTTISP 2010
             S   S  +    +    
Sbjct: 715  SSQAFSLSDLPMQSQSES 732



 Score = 34.4 bits (79), Expect = 0.88
 Identities = 23/112 (20%), Positives = 42/112 (37%), Gaps = 3/112 (2%)

Query: 2022 SESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPA 2081
            SE    I    +     S   +S  T    ++S+      S+S   +  +S+  + +   
Sbjct: 627  SEEVKNILDHWDPGDDPSDVDDSQATQPDVTDSS---QLESQSQIPTIRSSQQVSQTRKG 683

Query: 2082 SESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTTIEEQGVS 2133
              S   S+PA     +S P +  +++  P S S   S       +  E G+S
Sbjct: 684  GSSVVPSAPAPRLAQSSQPPTSQSSSDLPPSSSQAFSLSDLPMQSQSESGLS 735



 Score = 32.0 bits (73), Expect = 4.3
 Identities = 22/115 (19%), Positives = 42/115 (36%), Gaps = 4/115 (3%)

Query: 1944 STTTSSPESESTTTSSLVSESTTTSSPESESTTTS--SPESESTTTSSLVSESTTTSSPE 2001
            S    S +   +     + E+T + S E ++        +  S    S  ++   T S +
Sbjct: 602  SLIKVSKKEIDSRLEEFLDENTNSLSEEVKNILDHWDPGDDPSDVDDSQATQPDVTDSSQ 661

Query: 2002 SESTTTISPVSESTTTSSP--VSESTTTISPESESTTTSSPASESTTTNNPKSES 2054
             ES + I  +  S   S       S    +P      +S P +  ++++ P S S
Sbjct: 662  LESQSQIPTIRSSQQVSQTRKGGSSVVPSAPAPRLAQSSQPPTSQSSSDLPPSSS 716



 Score = 32.0 bits (73), Expect = 5.2
 Identities = 15/74 (20%), Positives = 29/74 (39%)

Query: 2063 ESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPAS 2122
            +   SS   S+S   +  +S+  + +     S   S+PA     +S P +  +++  P S
Sbjct: 655  DVTDSSQLESQSQIPTIRSSQQVSQTRKGGSSVVPSAPAPRLAQSSQPPTSQSSSDLPPS 714

Query: 2123 ESTTIEEQGVSPHS 2136
             S       +   S
Sbjct: 715  SSQAFSLSDLPMQS 728


>gnl|CDD|221067 pfam11301, DUF3103, Protein of unknown function (DUF3103).  This
            family of proteins with unknown function appear to be
            restricted to Proteobacteria.
          Length = 344

 Score = 35.0 bits (81), Expect = 0.44
 Identities = 15/67 (22%), Positives = 19/67 (28%), Gaps = 6/67 (8%)

Query: 1456 PALEASNDIAELLMECLATMKEYEVECKEMTAMGKPPPSLPPIMKALNVTSPRDYLMTVL 1515
            P      D  + L   LA M+       E     K     P    +           TVL
Sbjct: 125  PVFVVDLDSKKELKAGLAVMRA------EFAQAAKQMQLQPRSAASSAAAETAPISTTVL 178

Query: 1516 SRIRSTD 1522
             +IR  D
Sbjct: 179  KKIRLKD 185


>gnl|CDD|236782 PRK10871, nlpD, lipoprotein NlpD; Provisional.
          Length = 319

 Score = 34.8 bits (80), Expect = 0.51
 Identities = 18/74 (24%), Positives = 33/74 (44%), Gaps = 3/74 (4%)

Query: 2032 SESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPA---SESTTTS 2088
            +E      PA  ST     +   T + +   +S     P ++   T+  A   + + +T+
Sbjct: 125  AEQGVVIKPAQNSTVAVASQPTITYSESSGEQSANKMLPNNKPAATTVTAPVTAPTASTT 184

Query: 2089 SPASESTTTSSPAS 2102
             P + ST+TS+P S
Sbjct: 185  EPTASSTSTSTPIS 198



 Score = 34.0 bits (78), Expect = 1.0
 Identities = 22/76 (28%), Positives = 37/76 (48%), Gaps = 7/76 (9%)

Query: 2012 SESTTTSSPVSESTTTISPESESTTTSSPASESTTTN----NPKSESTTTNNP-ASESIT 2066
            +E      P   ST  ++  S+ T T S +S   + N    N K  +TT   P  + + +
Sbjct: 125  AEQGVVIKPAQNSTVAVA--SQPTITYSESSGEQSANKMLPNNKPAATTVTAPVTAPTAS 182

Query: 2067 SSSPASESTTTSSPAS 2082
            ++ P + ST+TS+P S
Sbjct: 183  TTEPTASSTSTSTPIS 198



 Score = 32.9 bits (75), Expect = 2.2
 Identities = 24/101 (23%), Positives = 39/101 (38%), Gaps = 9/101 (8%)

Query: 2022 SESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPA 2081
            + S T I+  +  T     A+E      P   ST           S S   +S     P 
Sbjct: 107  NASGTPITGGNAITQAD--AAEQGVVIKPAQNSTVAVASQPTITYSESSGEQSANKMLPN 164

Query: 2082 SESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPAS 2122
            ++   T       T T+   + + +T+ P + ST+TS+P S
Sbjct: 165  NKPAAT-------TVTAPVTAPTASTTEPTASSTSTSTPIS 198



 Score = 31.7 bits (72), Expect = 4.3
 Identities = 24/81 (29%), Positives = 41/81 (50%), Gaps = 7/81 (8%)

Query: 1967 TSSPESESTTTSSPESESTTTSSLVSESTTTSSPES--ESTTTISPVSE--STTTSSPVS 2022
            T +  +E      P   ST    + S+ T T S  S  +S   + P ++  +TT ++PV+
Sbjct: 120  TQADAAEQGVVIKPAQNSTVA--VASQPTITYSESSGEQSANKMLPNNKPAATTVTAPVT 177

Query: 2023 ESTTTIS-PESESTTTSSPAS 2042
              T + + P + ST+TS+P S
Sbjct: 178  APTASTTEPTASSTSTSTPIS 198



 Score = 31.3 bits (71), Expect = 6.7
 Identities = 19/92 (20%), Positives = 39/92 (42%), Gaps = 1/92 (1%)

Query: 1932 SESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLV 1991
            + S T  +  +  T   + E       +  S     S P    + +S  +S +    +  
Sbjct: 107  NASGTPITGGNAITQADAAEQGVVIKPAQNSTVAVASQPTITYSESSGEQSANKMLPNNK 166

Query: 1992 SESTTTSSPESESTTTISPVSESTT-TSSPVS 2022
              +TT ++P +  T + +  + S+T TS+P+S
Sbjct: 167  PAATTVTAPVTAPTASTTEPTASSTSTSTPIS 198



 Score = 31.0 bits (70), Expect = 8.0
 Identities = 19/86 (22%), Positives = 37/86 (43%), Gaps = 7/86 (8%)

Query: 1927 TSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTT 1986
            T +  +E      P   ST   + +   T + S   +S     P ++   T       T 
Sbjct: 120  TQADAAEQGVVIKPAQNSTVAVASQPTITYSESSGEQSANKMLPNNKPAAT-------TV 172

Query: 1987 TSSLVSESTTTSSPESESTTTISPVS 2012
            T+ + + + +T+ P + ST+T +P+S
Sbjct: 173  TAPVTAPTASTTEPTASSTSTSTPIS 198


>gnl|CDD|147982 pfam06112, Herpes_capsid, Gammaherpesvirus capsid protein.  This
            family consists of several Gammaherpesvirus capsid
            proteins. The exact function of this family is unknown.
          Length = 148

 Score = 33.7 bits (77), Expect = 0.51
 Identities = 16/65 (24%), Positives = 28/65 (43%), Gaps = 2/65 (3%)

Query: 1920 PESESTTTSSLVSESTTTSSPESESTT--TSSPESESTTTSSLVSESTTTSSPESESTTT 1977
            P++ S+  S+L + S++ S     +     SS  + S+   SL S S+ + S      T 
Sbjct: 82   PQTSSSIGSALSASSSSASGVPGGANQLSGSSGSALSSGPGSLSSSSSLSGSGAGAGDTA 141

Query: 1978 SSPES 1982
             S   
Sbjct: 142  PSSSK 146



 Score = 32.9 bits (75), Expect = 0.74
 Identities = 17/60 (28%), Positives = 28/60 (46%), Gaps = 2/60 (3%)

Query: 2060 PASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSS 2119
              S SI S+  AS S+ +  P   +  + S  S S  +S P S S+++S   S +    +
Sbjct: 83   QTSSSIGSALSASSSSASGVPGGANQLSGS--SGSALSSGPGSLSSSSSLSGSGAGAGDT 140



 Score = 31.7 bits (72), Expect = 2.3
 Identities = 16/63 (25%), Positives = 28/63 (44%), Gaps = 2/63 (3%)

Query: 1950 PESESTTTSSLVSESTTTSSPESESTT--TSSPESESTTTSSLVSESTTTSSPESESTTT 2007
            P++ S+  S+L + S++ S     +     SS  + S+   SL S S+ + S      T 
Sbjct: 82   PQTSSSIGSALSASSSSASGVPGGANQLSGSSGSALSSGPGSLSSSSSLSGSGAGAGDTA 141

Query: 2008 ISP 2010
             S 
Sbjct: 142  PSS 144



 Score = 31.4 bits (71), Expect = 3.1
 Identities = 15/65 (23%), Positives = 31/65 (47%), Gaps = 1/65 (1%)

Query: 2050 PKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSS 2109
            P++ S+  +  ++ S ++S     +   S  +S S  +S P S S+++S   S +    +
Sbjct: 82   PQTSSSIGSALSASSSSASGVPGGANQLSG-SSGSALSSGPGSLSSSSSLSGSGAGAGDT 140

Query: 2110 PESES 2114
              S S
Sbjct: 141  APSSS 145



 Score = 30.6 bits (69), Expect = 4.6
 Identities = 14/64 (21%), Positives = 32/64 (50%), Gaps = 1/64 (1%)

Query: 2063 ESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPAS 2122
            +++  + P + S+  S+ ++ S++ S     +   S  +S S  +S P S S+++S   S
Sbjct: 75   QALRGAGPQTSSSIGSALSASSSSASGVPGGANQLSG-SSGSALSSGPGSLSSSSSLSGS 133

Query: 2123 ESTT 2126
             +  
Sbjct: 134  GAGA 137


>gnl|CDD|222819 PHA01077, PHA01077, putative lower collar protein.
          Length = 251

 Score = 34.7 bits (79), Expect = 0.51
 Identities = 18/102 (17%), Positives = 35/102 (34%)

Query: 1978 SSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTT 2037
            SS E E    S   +E    ++  ++ T+  +  S   +T    + +     P+SE    
Sbjct: 114  SSSEVEKYLQSQGFTEHNEDTTNNTDETSNQNATSLDNSTGMTANRNAYVSLPQSEVNID 173

Query: 2038 SSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSS 2079
                +     NN      T N  ++ES  ++         + 
Sbjct: 174  VDNTTLRFADNNTIDNGKTVNKSSNESNQNAKRNQNQKGNAK 215


>gnl|CDD|112562 pfam03753, HHV6-IE, Human herpesvirus 6 immediate early protein.  The
            proteins in this family are poorly characterized, but an
            investigation has indicated that the immediate early
            protein is required the down-regulation of MHC class I
            expression in dendritic cells. Human herpesvirus 6
            immediate early protein is also referred to as U90.
          Length = 993

 Score = 35.1 bits (80), Expect = 0.58
 Identities = 43/261 (16%), Positives = 92/261 (35%), Gaps = 7/261 (2%)

Query: 1862 SVIDNYSEIIFTTNNNSESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPE 1921
            S++D  ++ + T   +  +     +L +L     T     S    T+N    + + +  +
Sbjct: 477  SLLDTQADSVVTQTVSKNNEAFNMSLYNLKRNEETYQDKNSRDKKTDNQAGPTFSRTDKK 536

Query: 1922 SESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTT--TSS 1979
            + S     +        + + E        ++ T  + L+SE  +       S    + S
Sbjct: 537  TNSPAGILMERSIFNKDTQDKEQYFELFTMTDGTLDNPLISEMLSFGYETDHSAPYESES 596

Query: 1980 PESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSS 2039
              ++     + V     T++    +T   +P S+S  +   V+ S T    + +   +++
Sbjct: 597  DNNDEIDYIASVDSGNRTNNIHMNNTNENTPFSKSGKSPPEVTPSKTFYKRDKKKDISTN 656

Query: 2040 PASESTTTNNPKSESTTTNNPASESI-TSSSPASESTTTSSPASESTTTSSPA-SESTTT 2097
               +  T    K ++       S+ I + S P   +    S  SE          +S   
Sbjct: 657  RKVKKRTA---KRKTVGYKTDKSKKIKSDSLPTDTNVIVISSESEDEEDGFNIIKKSQLK 713

Query: 2098 SSPASESTTTSSPESESTTTS 2118
                SE  + SS ES+  T+ 
Sbjct: 714  KKIKSELKSESSSESDDCTSE 734


>gnl|CDD|240388 PTZ00372, PTZ00372, endonuclease 4-like protein; Provisional.
          Length = 413

 Score = 34.7 bits (80), Expect = 0.59
 Identities = 16/95 (16%), Positives = 28/95 (29%), Gaps = 2/95 (2%)

Query: 1997 TSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTT-TNNPKSEST 2055
            + + +      IS +  +    S     +T    E++  TTS+   +     N  K +S 
Sbjct: 13   SGTTQKSKLQPISYIYSNVLVLS-KEILSTFSEEENKVATTSTKKDKKEDKNNESKKKSE 71

Query: 2056 TTNNPASESITSSSPASESTTTSSPASESTTTSSP 2090
                   E     S         +P     T   P
Sbjct: 72   KKKKKKKEKKEPKSEGETKLGFKTPKKSKKTKKKP 106



 Score = 34.3 bits (79), Expect = 0.82
 Identities = 18/95 (18%), Positives = 26/95 (27%), Gaps = 3/95 (3%)

Query: 2018 SSPVSESTTT-ISPESESTTTSSPASESTTT-NNPKSESTTTNNPASESITSSSPASEST 2075
            S    +S    IS    +    S    ST +    K  +T+T     E   + S   +S 
Sbjct: 13   SGTTQKSKLQPISYIYSNVLVLSKEILSTFSEEENKVATTSTKKDKKEDKNNESK-KKSE 71

Query: 2076 TTSSPASESTTTSSPASESTTTSSPASESTTTSSP 2110
                   E     S         +P     T   P
Sbjct: 72   KKKKKKKEKKEPKSEGETKLGFKTPKKSKKTKKKP 106



 Score = 33.5 bits (77), Expect = 1.3
 Identities = 16/96 (16%), Positives = 26/96 (27%), Gaps = 4/96 (4%)

Query: 1968 SSPESESTTTSSPESESTTTSSLVSESTTTSS---PESESTTTISPVSESTTTSSPVSES 2024
             S  ++ +         +    L  E  +T S    +  +T+T     E     S   +S
Sbjct: 12   FSGTTQKSKLQPISYIYSNVLVLSKEILSTFSEEENKVATTSTKKDKKEDKNNESK-KKS 70

Query: 2025 TTTISPESESTTTSSPASESTTTNNPKSESTTTNNP 2060
                  + E     S          PK    T   P
Sbjct: 71   EKKKKKKKEKKEPKSEGETKLGFKTPKKSKKTKKKP 106


>gnl|CDD|235895 PRK06945, flgK, flagellar hook-associated protein FlgK; Validated.
          Length = 651

 Score = 35.0 bits (81), Expect = 0.61
 Identities = 30/137 (21%), Positives = 53/137 (38%), Gaps = 15/137 (10%)

Query: 1974 STTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESE 2033
            S  T+   + +  +    S  +T +      T  IS  S S+    P+   TTT++ ++ 
Sbjct: 425  SVATTDGSAIAAASPVRASAGSTNTG-----TGAISQGSVSSGY--PLPSGTTTLTYDAA 477

Query: 2034 STTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTT--------TSSPASEST 2085
            + T S   + +T T      ++ T  PA+  +  +S A  S          + +PA   T
Sbjct: 478  TGTLSGFPAGTTVTVAGTPPTSVTITPATTPVPYTSGAGISLVFNGVSVTLSGTPADGDT 537

Query: 2086 TTSSPASESTTTSSPAS 2102
             T  P +  T     A 
Sbjct: 538  FTIGPNTGGTNDGRNAL 554


>gnl|CDD|178677 PLN03131, PLN03131, hypothetical protein; Provisional.
          Length = 705

 Score = 34.8 bits (79), Expect = 0.73
 Identities = 28/184 (15%), Positives = 58/184 (31%), Gaps = 10/184 (5%)

Query: 1914 STTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESE 1973
              T+ +P  +      L       +    +++  SS +     T      S    SPE  
Sbjct: 368  PATSPAPPVDLFEIPPLDPAPAINAYQPPQTSLPSSIDLFGGITQQQSINSLDEKSPEL- 426

Query: 1974 STTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTIS-PES 2032
                S P++E   T   +    +T   E+ +  +I P    +     V      +  P  
Sbjct: 427  ----SIPKNEGWATFDGIQPIASTPGNENLTPFSIGPSMAGSANFDQVPSLDKGMQWPPF 482

Query: 2033 ESTTTSSPASESTTTNNPKSESTTTNNPASESITS----SSPASESTTTSSPASESTTTS 2088
            ++++    AS               +N ++++  +     S A         +SE  T +
Sbjct: 483  QNSSDEESASGPAPWLGDLHNVEAPDNTSAQNWNAFEFDDSVAGIPLEGIKQSSEPQTAA 542

Query: 2089 SPAS 2092
            +   
Sbjct: 543  NMPP 546


>gnl|CDD|180536 PRK06347, PRK06347, autolysin; Reviewed.
          Length = 592

 Score = 34.7 bits (79), Expect = 0.74
 Identities = 55/276 (19%), Positives = 104/276 (37%), Gaps = 55/276 (19%)

Query: 1874 TNNNSESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSS---- 1929
            T   +  T   + LN L+S     N  + +S  T    S ST  SS  S +  TS+    
Sbjct: 277  TGTYATDTAYATKLNDLIS---RYNLTQYDSGKTTGGNSGSTGNSSNSSNTGNTSNAKIY 333

Query: 1930 -LVSESTTTSSPESESTTTSSPESESTTTSSL--------VSESTTTSSPESESTTTSSP 1980
             +V   +      +   T ++ ++ +   S          VS  +TTS   +   +T + 
Sbjct: 334  TVVKGDSLWRIANNHKVTVANLKAWNNLKSDFIYPGQKLKVSAGSTTSDTNTSKPSTGTS 393

Query: 1981 ESESTTTSSLVSESTTTSSPES------ESTTTIS-------------------PVSEST 2015
             S+ +T +S  ++  T    +S       +  TI+                    VS  +
Sbjct: 394  TSKPSTGTSTNAKVYTVVKGDSLWRIANNNKVTIANLKSWNNLKSDFIYPGQKLKVSAGS 453

Query: 2016 TTSSPVSESTTTISPESESTTTSSPASEST----------TTNNPKSEST--TTNNPASE 2063
            T+++  S+ +T  +    ST T++ A   T            NN  + +   + NN  S+
Sbjct: 454  TSNTNTSKPSTNTNTSKPSTNTNTNAKVYTVAKGDSLWRIANNNKVTIANLKSWNNLKSD 513

Query: 2064 SITSSS--PASESTTTSSPASESTTTSSPASESTTT 2097
             I        S  +TT++  +   +T+ P++ +  T
Sbjct: 514  FIYPGQKLKVSAGSTTNNTNTAKPSTNKPSNSTVKT 549



 Score = 34.3 bits (78), Expect = 0.90
 Identities = 24/118 (20%), Positives = 48/118 (40%), Gaps = 6/118 (5%)

Query: 1972 SESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPE 2031
            S   T  + E+  +  ++   E+  T++PE+ +  T+ P    T   +   E     + +
Sbjct: 51   SADETAPADEASKSAEANTTKEAPATATPENTTEPTVEPKQTETKEQTKTPEEKQPAAKQ 110

Query: 2032 SESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSS 2089
             E        +E  T +NP   +T+++ PA+ ++   S      T  S       +SS
Sbjct: 111  VEK-----APAEPATVSNP-DNATSSSTPATYNLLQKSALRSGATVQSFIQTIQASSS 162



 Score = 33.9 bits (77), Expect = 1.4
 Identities = 31/145 (21%), Positives = 69/145 (47%), Gaps = 12/145 (8%)

Query: 1986 TTSSLVSESTTTSSPESE---STTTISPVSESTTTSSPVS--ESTTTISPESESTTTSSP 2040
            T + + + +T+ + P  E   S    +P  E++ ++   +  E+  T +PE+ +  T  P
Sbjct: 30   TIAGVTAIATSITVPGIEVIVSADETAPADEASKSAEANTTKEAPATATPENTTEPTVEP 89

Query: 2041 ASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSP 2100
                T     ++++     PA++ +      +E  T S+P   +T++S+PA+ +    S 
Sbjct: 90   KQTETKE---QTKTPEEKQPAAKQV--EKAPAEPATVSNP-DNATSSSTPATYNLLQKSA 143

Query: 2101 -ASESTTTSSPESESTTTSSPASES 2124
              S +T  S  ++   ++S  A+E+
Sbjct: 144  LRSGATVQSFIQTIQASSSQIAAEN 168



 Score = 33.1 bits (75), Expect = 2.0
 Identities = 23/133 (17%), Positives = 52/133 (39%), Gaps = 16/133 (12%)

Query: 2014 STTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASE 2073
            S   ++P  E++ +    +     ++   E+TT    + + T T               +
Sbjct: 51   SADETAPADEASKSAEANTTKEAPATATPENTTEPTVEPKQTETKEQT-----------K 99

Query: 2074 STTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTTIEE--QG 2131
            +     PA++        +E  T S+P   +T++S+P + +    S      T++   Q 
Sbjct: 100  TPEEKQPAAKQVEK--APAEPATVSNP-DNATSSSTPATYNLLQKSALRSGATVQSFIQT 156

Query: 2132 VSPHSEKLSANED 2144
            +   S +++A  D
Sbjct: 157  IQASSSQIAAEND 169



 Score = 33.1 bits (75), Expect = 2.3
 Identities = 27/109 (24%), Positives = 50/109 (45%), Gaps = 12/109 (11%)

Query: 2074 STTTSSPASE---STTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTTIEEQ 2130
            +T+ + P  E   S   ++PA E++ ++   +     ++   E+TT  +   + T  +EQ
Sbjct: 38   ATSITVPGIEVIVSADETAPADEASKSAEANTTKEAPATATPENTTEPTVEPKQTETKEQ 97

Query: 2131 GVSPHSEKLSANEDPEEFPNEDVFEHTFAEIPNIDHSNQTDEAIPETFD 2179
              +P  EK  A +  E+ P E       A + N D  N T  + P T++
Sbjct: 98   TKTP-EEKQPAAKQVEKAPAEP------ATVSNPD--NATSSSTPATYN 137



 Score = 32.0 bits (72), Expect = 5.1
 Identities = 22/124 (17%), Positives = 50/124 (40%), Gaps = 4/124 (3%)

Query: 1880 STVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSS 1939
            +++ +  +  ++S + T  + E+  +   N   E+  T++PE+ +  T       T   +
Sbjct: 39   TSITVPGIEVIVSADETAPADEASKSAEANTTKEAPATATPENTTEPTVEPKQTETKEQT 98

Query: 1940 PESESTTTSSPESESTT----TSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSEST 1995
               E    ++ + E       T S    +T++S+P + +    S      T  S +    
Sbjct: 99   KTPEEKQPAAKQVEKAPAEPATVSNPDNATSSSTPATYNLLQKSALRSGATVQSFIQTIQ 158

Query: 1996 TTSS 1999
             +SS
Sbjct: 159  ASSS 162



 Score = 31.6 bits (71), Expect = 5.8
 Identities = 15/85 (17%), Positives = 34/85 (40%), Gaps = 6/85 (7%)

Query: 1964 STTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSE 2023
            S   ++P  E++ ++   +     ++   E+TT  + E + T T               +
Sbjct: 51   SADETAPADEASKSAEANTTKEAPATATPENTTEPTVEPKQTETKEQTKTPEEKQPAAKQ 110

Query: 2024 ------STTTISPESESTTTSSPAS 2042
                     T+S    +T++S+PA+
Sbjct: 111  VEKAPAEPATVSNPDNATSSSTPAT 135


>gnl|CDD|114299 pfam05568, ASFV_J13L, African swine fever virus J13L protein.  This
            family consists of several African swine fever virus J13L
            proteins.
          Length = 189

 Score = 33.7 bits (76), Expect = 0.75
 Identities = 27/115 (23%), Positives = 49/115 (42%), Gaps = 8/115 (6%)

Query: 2001 ESESTTTISPVSESTTTSSPVSESTTTISPESESTTTS-SPASESTTTNNPKSESTTTNN 2059
            E E    I+P  +     + V+       P   ST ++  P  +   TN   ++   TN 
Sbjct: 64   EEEDIQFINPYQDQQW--AEVTPQPGIAKPAGASTASAGKPVMDRPATNRLVADKPATNK 121

Query: 2060 PASES--ITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPES 2112
            P  ++  + +  PA+ S   S+ AS+    + PA   TT ++  + S T  + E+
Sbjct: 122  PVMDNLGMAAGGPAAASAPASAAASDP---AHPAELYTTATTQNTASQTMPADEN 173



 Score = 32.9 bits (74), Expect = 1.4
 Identities = 17/92 (18%), Positives = 35/92 (38%), Gaps = 7/92 (7%)

Query: 2037 TSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTT 2096
             + PA  ST +    +     + PA+  + +  PA+      +          PA+ S  
Sbjct: 88   IAKPAGASTAS----AGKPVMDRPATNRLVADKPATNKPVMDNLG---MAAGGPAAASAP 140

Query: 2097 TSSPASESTTTSSPESESTTTSSPASESTTIE 2128
             S+ AS+    +   + +TT ++ +      E
Sbjct: 141  ASAAASDPAHPAELYTTATTQNTASQTMPADE 172


>gnl|CDD|227568 COG5243, HRD1, HRD ubiquitin ligase complex, ER membrane component
            [Posttranslational modification, protein turnover,
            chaperones].
          Length = 491

 Score = 34.6 bits (79), Expect = 0.84
 Identities = 31/125 (24%), Positives = 49/125 (39%), Gaps = 25/125 (20%)

Query: 2029 SPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTS 2088
            SP   S    +    +T   NP +  TTT  P    IT+SS   +   ++     +  +S
Sbjct: 350  SPTPASPNVRN-TQIATQVPNPDNTPTTTAVPG---ITNSSNQGDPQASTFNGVPNANSS 405

Query: 2089 SPASESTTTSSPA--------------SESTTTSSPESESTTTSSPASEST-----TIEE 2129
              A+ +   SS                S+ST+T++P   +T T+   S ST     T   
Sbjct: 406  GFAAHTQDLSSVIPGWTMLPIPGTRRISQSTSTTNP--SATPTTGDPSNSTYGGPQTFPN 463

Query: 2130 QGVSP 2134
             G +P
Sbjct: 464  SGNNP 468



 Score = 32.6 bits (74), Expect = 2.9
 Identities = 25/109 (22%), Positives = 43/109 (39%), Gaps = 6/109 (5%)

Query: 1899 SPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTS 1958
            SP   S    N    +T   +P++  TTT+       T SS + +   ++     +  +S
Sbjct: 350  SPTPASPNVRN-TQIATQVPNPDNTPTTTAV---PGITNSSNQGDPQASTFNGVPNANSS 405

Query: 1959 SLVSESTTTSSPESESTTTSSPESE--STTTSSLVSESTTTSSPESEST 2005
               + +   SS     T    P +   S +TS+    +T T+   S ST
Sbjct: 406  GFAAHTQDLSSVIPGWTMLPIPGTRRISQSTSTTNPSATPTTGDPSNST 454



 Score = 31.9 bits (72), Expect = 5.0
 Identities = 30/127 (23%), Positives = 52/127 (40%), Gaps = 11/127 (8%)

Query: 1928 SSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTT 1987
            SS    S    +    +T   +P++  TTT+       T SS + +   ++     +  +
Sbjct: 349  SSPTPASPNVRN-TQIATQVPNPDNTPTTTAV---PGITNSSNQGDPQASTFNGVPNANS 404

Query: 1988 SSLVSESTTTSSPESESTTTISP----VSESTTTSSPVSESTTTISPESESTTTSSPASE 2043
            S   + +   SS     T    P    +S+ST+T++P   +T T    S S T   P + 
Sbjct: 405  SGFAAHTQDLSSVIPGWTMLPIPGTRRISQSTSTTNP--SATPTTGDPSNS-TYGGPQTF 461

Query: 2044 STTTNNP 2050
              + NNP
Sbjct: 462  PNSGNNP 468


>gnl|CDD|227268 COG4932, COG4932, Predicted outer membrane protein [Cell envelope
            biogenesis, outer membrane].
          Length = 1531

 Score = 34.8 bits (80), Expect = 0.87
 Identities = 28/149 (18%), Positives = 47/149 (31%), Gaps = 18/149 (12%)

Query: 1850 LISMLAATAVAISVIDNYSEIIFTTNNNSESTVVMSTLNSLLSENTTTNSPESESTTTNN 1909
             IS + ATA          +I              ST   +   +   N    ++ + N 
Sbjct: 21   NISPVLATANDEKTETTTLKITKEDK---------STKEKINGSSFEKNKETGKTISLNI 71

Query: 1910 PESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSS 1969
            P    TTT S            +   TT    + + T +  E   T+TS+   E      
Sbjct: 72   PSEGLTTTDSLLVGDYEVKEKSAGLGTTLDEATYNVTLALKEEVITSTSTKTQE------ 125

Query: 1970 PESESTTTSSPESESTTTSSLVSESTTTS 1998
               E T   +PE       + ++++  T 
Sbjct: 126  ---EKTEIVTPEPSKKKLKAEITDNIFTP 151



 Score = 32.5 bits (74), Expect = 3.6
 Identities = 37/182 (20%), Positives = 60/182 (32%), Gaps = 21/182 (11%)

Query: 1784 IIFTTNNNSESTVVMSTLNSLLSENEKLFKPHAKT----PGAEFLIQCQYCDFDSSMNLL 1839
            + FT   N E  V ++  N   + +  L K  + +     GAEF +       D   N+L
Sbjct: 1319 VNFTIEFNQEEAVKVTKENDAKTGSVVLTKLDSSSGVTLEGAEFELL------DEEGNIL 1372

Query: 1840 --SVSPYITNNLLISMLAA-------TAVAISVIDNYSEIIFTTNNNSES--TVVMSTLN 1888
               +       LL+  LA        T        + + + FT   N E    V  +   
Sbjct: 1373 KEGLVTDENGQLLVDDLAPGDYQFVETKAPTGYELDATPVDFTIEFNQEEALKVTKTNKL 1432

Query: 1889 SLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTS 1948
             +  E++  N   +E  T N  E       +P         +  E T    P+ E    S
Sbjct: 1433 FIEFEDSIGNQLNAEEHTGNVGEEYVFKAKNPGHYKEGDQPITFEPTEPPKPDPEKRLDS 1492

Query: 1949 SP 1950
            + 
Sbjct: 1493 NN 1494



 Score = 32.5 bits (74), Expect = 3.8
 Identities = 26/146 (17%), Positives = 53/146 (36%), Gaps = 1/146 (0%)

Query: 1896 TTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESEST 1955
            T N  ++E+TT    + + +T       S   +    ++ + + P    TTT S      
Sbjct: 28   TANDEKTETTTLKITKEDKSTKEKINGSSFEKNKETGKTISLNIPSEGLTTTDSLLVGDY 87

Query: 1956 TTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTT-TSSPESESTTTISPVSES 2014
                  +   TT    + + T +  E   T+TS+   E  T   +PE       + ++++
Sbjct: 88   EVKEKSAGLGTTLDEATYNVTLALKEEVITSTSTKTQEEKTEIVTPEPSKKKLKAEITDN 147

Query: 2015 TTTSSPVSESTTTISPESESTTTSSP 2040
              T   + +     +  +      SP
Sbjct: 148  IFTPVTLKDGNGYEANTTNRIPNGSP 173



 Score = 32.1 bits (73), Expect = 4.5
 Identities = 23/124 (18%), Positives = 49/124 (39%), Gaps = 4/124 (3%)

Query: 1936 TTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSEST 1995
            T +  ++E+TT    + + +T   +   S   +    ++ + + P    TTT SL+    
Sbjct: 28   TANDEKTETTTLKITKEDKSTKEKINGSSFEKNKETGKTISLNIPSEGLTTTDSLLVGDY 87

Query: 1996 TTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSEST 2055
                  +   TT+   +    T +   E  T+ S +++   T     E +     K ++ 
Sbjct: 88   EVKEKSAGLGTTLDE-ATYNVTLALKEEVITSTSTKTQEEKTEIVTPEPSKK---KLKAE 143

Query: 2056 TTNN 2059
             T+N
Sbjct: 144  ITDN 147


>gnl|CDD|114140 pfam05399, EVI2A, Ectropic viral integration site 2A protein (EVI2A).
             This family contains several mammalian ectropic viral
            integration site 2A (EVI2A) proteins. The function of
            this protein is unknown although it is thought to be a
            membrane protein and may function as an oncogene in
            retrovirus induced myeloid tumours.
          Length = 227

 Score = 33.5 bits (76), Expect = 0.95
 Identities = 26/114 (22%), Positives = 38/114 (33%), Gaps = 1/114 (0%)

Query: 2005 TTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASES 2064
            TT  S    +TT  + +   +  I        TS    E+  TN P  E    + P +E 
Sbjct: 15   TTVFSLSLGTTTNYTDLWAVSNEIWYSICQNLTSRNIPETNNTNPPTPEVNGKSTPTAEP 74

Query: 2065 ITSSSPASESTTTS-SPASESTTTSSPASESTTTSSPASESTTTSSPESESTTT 2117
             TS+     ST+ S      S   S        TS    E+      E  ++  
Sbjct: 75   QTSTPVPLYSTSGSNFFTPSSAQNSPDTGGPGNTSKSKGETFKKEVCEENTSNF 128



 Score = 32.7 bits (74), Expect = 1.9
 Identities = 25/103 (24%), Positives = 38/103 (36%), Gaps = 1/103 (0%)

Query: 1985 TTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASE- 2043
            TT  SL   +TT  +     +  I        TS  + E+  T  P  E    S+P +E 
Sbjct: 15   TTVFSLSLGTTTNYTDLWAVSNEIWYSICQNLTSRNIPETNNTNPPTPEVNGKSTPTAEP 74

Query: 2044 STTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTT 2086
             T+T  P   ++ +N     S  +S        TS    E+  
Sbjct: 75   QTSTPVPLYSTSGSNFFTPSSAQNSPDTGGPGNTSKSKGETFK 117



 Score = 32.3 bits (73), Expect = 2.1
 Identities = 22/103 (21%), Positives = 37/103 (35%), Gaps = 1/103 (0%)

Query: 1955 TTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSE- 2013
            TT  SL   +TT  +     +           TS  + E+  T+ P  E     +P +E 
Sbjct: 15   TTVFSLSLGTTTNYTDLWAVSNEIWYSICQNLTSRNIPETNNTNPPTPEVNGKSTPTAEP 74

Query: 2014 STTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTT 2056
             T+T  P+  ++ +      S   S        T+  K E+  
Sbjct: 75   QTSTPVPLYSTSGSNFFTPSSAQNSPDTGGPGNTSKSKGETFK 117



 Score = 32.0 bits (72), Expect = 3.2
 Identities = 20/103 (19%), Positives = 35/103 (33%), Gaps = 1/103 (0%)

Query: 1945 TTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESE- 2003
            TT  S    +TT  + +   +           TS    E+  T+    E    S+P +E 
Sbjct: 15   TTVFSLSLGTTTNYTDLWAVSNEIWYSICQNLTSRNIPETNNTNPPTPEVNGKSTPTAEP 74

Query: 2004 STTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTT 2046
             T+T  P+  ++ ++     S            TS    E+  
Sbjct: 75   QTSTPVPLYSTSGSNFFTPSSAQNSPDTGGPGNTSKSKGETFK 117



 Score = 31.2 bits (70), Expect = 6.2
 Identities = 21/103 (20%), Positives = 39/103 (37%), Gaps = 3/103 (2%)

Query: 2027 TISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASE-ST 2085
            ++S  + +  T   A  +    +     T+ N P  E+  ++ P  E    S+P +E  T
Sbjct: 19   SLSLGTTTNYTDLWAVSNEIWYSICQNLTSRNIP--ETNNTNPPTPEVNGKSTPTAEPQT 76

Query: 2086 TTSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTTIE 2128
            +T  P   ++ ++     S   S        TS    E+   E
Sbjct: 77   STPVPLYSTSGSNFFTPSSAQNSPDTGGPGNTSKSKGETFKKE 119



 Score = 30.8 bits (69), Expect = 8.2
 Identities = 24/115 (20%), Positives = 45/115 (39%), Gaps = 1/115 (0%)

Query: 1873 TTNNNSESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVS 1932
               +      +M+T+ SL    TT  +     +           TS    E+  T+    
Sbjct: 3    HKGHYLHLAFLMTTVFSLSLGTTTNYTDLWAVSNEIWYSICQNLTSRNIPETNNTNPPTP 62

Query: 1933 ESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPES-ESTTTSSPESESTT 1986
            E    S+P +E  T++     ST+ S+  + S+  +SP++     TS  + E+  
Sbjct: 63   EVNGKSTPTAEPQTSTPVPLYSTSGSNFFTPSSAQNSPDTGGPGNTSKSKGETFK 117


>gnl|CDD|110602 pfam01611, Filo_glycop, Filovirus glycoprotein.  This family includes
            an extracellular region from the envelope glycoprotein of
            Ebola and Marburg viruses. This region is also produced
            as a separate transcript that gives rise to a
            non-structural, secreted glycoprotein, which is produced
            in large amounts and has an unknown function. Processing
            of this protein may be involved in viral pathogenicity.
          Length = 364

 Score = 34.0 bits (78), Expect = 1.1
 Identities = 32/155 (20%), Positives = 61/155 (39%), Gaps = 15/155 (9%)

Query: 1866 NYSEIIFTTNNNSESTVVMS-------TLN-SLLSENTTTNSPESESTTTNNPESESTTT 1917
            N    +F  NN +   +  S        LN ++   NT +N+      T + P  +S   
Sbjct: 214  NEFGTLFEVNNTTYVQLDPSHTPQFLPQLNETIYLTNTLSNTTGKLIWTVD-PSIDSG-- 270

Query: 1918 SSPESESTTTSSLVSE---STTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESES 1974
             S E     T   V++   S+T  S  S S  T++   ++ T       S   S+  + +
Sbjct: 271  -SGEWAFWETKKNVTKQGQSSTCLSTPSLSPRTTNHSRQAVTELDKNRTSLQPSTNNTTT 329

Query: 1975 TTTSSPESESTTTSSLVSESTTTSSPESESTTTIS 2009
             +T++    + +T S+  ++ T  + +S  T    
Sbjct: 330  ISTNNTSKHNFSTQSIPLQNFTNDNSQSTLTENEQ 364



 Score = 33.3 bits (76), Expect = 1.9
 Identities = 35/191 (18%), Positives = 79/191 (41%), Gaps = 16/191 (8%)

Query: 1894 NTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESE 1953
            N T +S     T+TN  ++ +   +          +L   + TT      S T   P+  
Sbjct: 190  NMTLDSTSYYWTSTNEYQTNNFGCN-------EFGTLFEVNNTTYVQLDPSHT---PQFL 239

Query: 1954 STTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSE 2013
                 ++   +T +++      T   P  +S +      E+    + + +S+T +S  S 
Sbjct: 240  PQLNETIYLTNTLSNTTGKLIWTVD-PSIDSGSGEWAFWETKKNVTKQGQSSTCLSTPSL 298

Query: 2014 STTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASE 2073
            S  T++   ++ T    E +   TS   S + TT    + +T+ +N +++SI   +  ++
Sbjct: 299  SPRTTNHSRQAVT----ELDKNRTSLQPSTNNTTTISTN-NTSKHNFSTQSIPLQNFTND 353

Query: 2074 STTTSSPASES 2084
            ++ ++   +E 
Sbjct: 354  NSQSTLTENEQ 364



 Score = 31.4 bits (71), Expect = 6.8
 Identities = 28/162 (17%), Positives = 58/162 (35%), Gaps = 12/162 (7%)

Query: 1968 SSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTT 2027
            S P  E    +   +    TS+   ++      E  +   ++  +      S   +    
Sbjct: 182  SRPGQEPRNMTLDSTSYYWTSTNEYQTNNFGCNEFGTLFEVNNTTYVQLDPSHTPQFLPQ 241

Query: 2028 ISPESESTTTSSPASESTT-TNNPKSEST-------TTNNPASESITSSSPASESTTTSS 2079
            ++     T T S  +     T +P  +S         T    ++   SS+  S  T + S
Sbjct: 242  LNETIYLTNTLSNTTGKLIWTVDPSIDSGSGEWAFWETKKNVTKQGQSSTCLS--TPSLS 299

Query: 2080 PASESTTTSSPAS--ESTTTSSPASESTTTSSPESESTTTSS 2119
            P + + +  +     ++ T+  P++ +TTT S  + S    S
Sbjct: 300  PRTTNHSRQAVTELDKNRTSLQPSTNNTTTISTNNTSKHNFS 341


>gnl|CDD|148635 pfam07139, DUF1387, Protein of unknown function (DUF1387).  This
            family represents a conserved region approximately 300
            residues long within a number of hypothetical proteins of
            unknown function that seem to be restricted to mammals.
          Length = 301

 Score = 33.8 bits (77), Expect = 1.1
 Identities = 27/143 (18%), Positives = 49/143 (34%), Gaps = 8/143 (5%)

Query: 2030 PESESTTTSSPASESTTTNNPKSESTTTN----NPASESITSSSPASESTTTSSPASEST 2085
            PE+ + + S   +       P  E    N    N +++   S    SE   ++S  +   
Sbjct: 18   PEAPAKSASKEETTPEEQAAPGDEKDEVNGFHANGSADDTESVDSLSEGLDSASLDAREP 77

Query: 2086 TTSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTTIEEQGVSPHSEKLSAN-ED 2144
               +        S  +S +   S  +S+    SSP S +          +++K  +    
Sbjct: 78   EAVTL---DAPPSPSSSLTNGLSDLQSKLELQSSPHSSAKPHPSSDQHKNAKKYVSKPSQ 134

Query: 2145 PEEFPNEDVFEHTFAEIPNIDHS 2167
            P    N    +   A  PNI+ S
Sbjct: 135  PVTPNNSAHHDAPAALGPNIEKS 157



 Score = 32.7 bits (74), Expect = 2.6
 Identities = 29/153 (18%), Positives = 63/153 (41%), Gaps = 14/153 (9%)

Query: 1938 SSPESESTTTSSPESESTTTSSLVSESTTTSSPESEST-----TTSSPESESTTTSSLVS 1992
            S P+ E+   S+ + E+T       E       E +         S+ ++ES  + S   
Sbjct: 14   SKPKPEAPAKSASKEETT------PEEQAAPGDEKDEVNGFHANGSADDTESVDSLSEGL 67

Query: 1993 ESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKS 2052
            +S +  + E E+ T  +P S S++ ++ +S+  + +  +S   +++ P   S    N K 
Sbjct: 68   DSASLDAREPEAVTLDAPPSPSSSLTNGLSDLQSKLELQSSPHSSAKPHPSSDQHKNAKK 127

Query: 2053 ESTTTNNPASESITSSSPASESTTTSSPASEST 2085
              +    P+     ++S   ++     P  E +
Sbjct: 128  YVS---KPSQPVTPNNSAHHDAPAALGPNIEKS 157


>gnl|CDD|185638 PTZ00459, PTZ00459, mucin-associated surface protein (MASP);
            Provisional.
          Length = 291

 Score = 33.6 bits (76), Expect = 1.1
 Identities = 50/227 (22%), Positives = 83/227 (36%), Gaps = 26/227 (11%)

Query: 1918 SSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSS--PESEST 1975
            S PES+   TSS  ++     +  ++  +   P  E        SE         E E  
Sbjct: 41   SPPESKGLETSSQGTQDLKGGAAGAKENSPPLPTEEDDEDVDDDSEEGDDDDGGAEDEEE 100

Query: 1976 TTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSE----STTTISPE 2031
                 +S    T +L S ST      SE  T +S  S  + + S   E     T T    
Sbjct: 101  EKVRGQSGQEGTVALGSGSTEKKLIGSEKQTELSISSAESISPSGSRELNVNLTQTEVEG 160

Query: 2032 SESTTTSSPASEST-TTNNPKS-------ESTTTNNPASESITSSSPASESTTT------ 2077
             + T  ++PA E+  TT N ++       E   +  P  + I S     E TT+      
Sbjct: 161  KKETDKNTPAVENPLTTGNGENTLPAGIVEGNPSPPPPQDGIHSREQDGEGTTSEGQKNV 220

Query: 2078 ------SSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTS 2118
                  ++P S     S    E T  ++  + +T T++ ++   +T+
Sbjct: 221  PLPETAATPQSHHDKGSEGTGEDTKATTVTANTTDTTNTQNSDGSTA 267



 Score = 33.2 bits (75), Expect = 1.4
 Identities = 41/168 (24%), Positives = 66/168 (39%), Gaps = 19/168 (11%)

Query: 1952 SESTTTSSLVSESTTTSSPESESTTTSSPESE----STTTSSLVSESTTTSSPESESTTT 2007
            S ST    + SE  T  S  S  + + S   E     T T     + T  ++P  E+  T
Sbjct: 117  SGSTEKKLIGSEKQTELSISSAESISPSGSRELNVNLTQTEVEGKKETDKNTPAVENPLT 176

Query: 2008 ISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITS 2067
                +   T  + + E   +  P  +   +     E TT+   K      N P  E  T+
Sbjct: 177  T--GNGENTLPAGIVEGNPSPPPPQDGIHSREQDGEGTTSEGQK------NVPLPE--TA 226

Query: 2068 SSPASESTTTSSPASEST-TTSSPASESTTTSSPASESTT----TSSP 2110
            ++P S     S    E T  T+  A+ + TT++  S+ +T    T+SP
Sbjct: 227  ATPQSHHDKGSEGTGEDTKATTVTANTTDTTNTQNSDGSTAVSHTTSP 274



 Score = 32.1 bits (72), Expect = 3.8
 Identities = 28/118 (23%), Positives = 52/118 (44%), Gaps = 5/118 (4%)

Query: 1895 TTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESES 1954
            T T     + T  N P  E+  T+    E+T  + +V E   +  P  +   +   + E 
Sbjct: 154  TQTEVEGKKETDKNTPAVENPLTTG-NGENTLPAGIV-EGNPSPPPPQDGIHSREQDGEG 211

Query: 1955 TTTSSL--VSESTTTSSPESESTTTSSPESESTTTSSLVSEST-TTSSPESESTTTIS 2009
            TT+     V    T ++P+S     S    E T  +++ + +T TT++  S+ +T +S
Sbjct: 212  TTSEGQKNVPLPETAATPQSHHDKGSEGTGEDTKATTVTANTTDTTNTQNSDGSTAVS 269


>gnl|CDD|191179 pfam05053, Menin, Menin.  MEN1, the gene responsible for multiple
            endocrine neoplasia type 1, is a tumour suppressor gene
            that encodes a protein called Menin which may be an
            atypical GTPase stimulated by nm23.
          Length = 618

 Score = 34.2 bits (78), Expect = 1.2
 Identities = 19/128 (14%), Positives = 46/128 (35%), Gaps = 11/128 (8%)

Query: 2021 VSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSP 2080
            + +      PE E+  +   A E       +        P  ES +      ES     P
Sbjct: 451  IRQKVVIKLPEKEAKESKEAAGEEAREGRRRG-------PRRESKSQEPSGGESPNPELP 503

Query: 2081 A-SESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTTIEEQG---VSPHS 2136
            A + ++ +++  +        A+ +   ++  + S T+      S   + +    ++ +S
Sbjct: 504  ANNNNSNSNNNNNNGADRKEAAATTGNATTTSNGSGTSVPLPVSSEPPQHKEGPVITFYS 563

Query: 2137 EKLSANED 2144
            EK+   ++
Sbjct: 564  EKMKGMKE 571


>gnl|CDD|223061 PHA03369, PHA03369, capsid maturational protease; Provisional.
          Length = 663

 Score = 33.8 bits (77), Expect = 1.2
 Identities = 27/184 (14%), Positives = 55/184 (29%), Gaps = 12/184 (6%)

Query: 1943 ESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPES 2002
            E   + + E E+T   S + +   +    + + T ++    + +  +    +  T     
Sbjct: 490  EEQESLAKELEATAHKSEIKKIAESEFKNAGAKTAAANIEPNCSADA---AAPATKRARP 546

Query: 2003 ESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPAS 2062
            E+ T +  V         +       S  S +   ++     T      +  T       
Sbjct: 547  ETKTELEAVVRFPYQIRNMESPAFVHSFTSTTLAAAAGQGSDTAEALAGAIETLLTQ--- 603

Query: 2063 ESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPAS 2122
                S+ PA  S       +     S+PAS     +        TS+P  E++       
Sbjct: 604  ---ASAQPAGLSLPA---PAVPVNASTPASTPPPLAPQEPPQPGTSAPSLETSLPQQKPV 657

Query: 2123 ESTT 2126
             S  
Sbjct: 658  LSKG 661



 Score = 32.7 bits (74), Expect = 3.4
 Identities = 27/179 (15%), Positives = 60/179 (33%), Gaps = 14/179 (7%)

Query: 1953 ESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVS 2012
            E   + +   E+T   S   +   +    + + T ++ +  + +  +    +  T     
Sbjct: 490  EEQESLAKELEATAHKSEIKKIAESEFKNAGAKTAAANIEPNCSADAA---APATKRARP 546

Query: 2013 ESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTN-NPASESITSSSPA 2071
            E+ T    V      I          S  S +      +   T      A E++ + +  
Sbjct: 547  ETKTELEAVVRFPYQIRNMESPAFVHSFTSTTLAAAAGQGSDTAEALAGAIETLLTQA-- 604

Query: 2072 SESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTTIEEQ 2130
                 ++ PA  S       +     S+PAS     +  E     TS+P+ E++  +++
Sbjct: 605  -----SAQPAGLSLPA---PAVPVNASTPASTPPPLAPQEPPQPGTSAPSLETSLPQQK 655



 Score = 31.1 bits (70), Expect = 9.3
 Identities = 20/132 (15%), Positives = 31/132 (23%), Gaps = 10/132 (7%)

Query: 2023 ESTTTISPESESTTTSSPAS--ESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSP 2080
            ++ +  +P       +  A      T   P              I  S PA    T   P
Sbjct: 349  KTASLTAPSRVLAAAAKVAVIAAPQTHTGPADRQRPQRPDG---IPYSVPARSPMTAYPP 405

Query: 2081 ASESTTTSSPASES-----TTTSSPASESTTTSSPESESTTTSSPASESTTIEEQGVSPH 2135
              +        S        T+  P         P +      S A+       Q     
Sbjct: 406  VPQFCGDPGLVSPYNPQSPGTSYGPEPVGPVPPQPTNPYVMPISMANMVYPGHPQEHGHE 465

Query: 2136 SEKLSANEDPEE 2147
             ++    E  EE
Sbjct: 466  RKRKRGGELKEE 477


>gnl|CDD|111090 pfam02158, Neuregulin, Neuregulin family. 
          Length = 406

 Score = 33.7 bits (76), Expect = 1.3
 Identities = 56/276 (20%), Positives = 103/276 (37%), Gaps = 23/276 (8%)

Query: 1865 DNYSEIIFTTNNNSESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESE- 1923
            ++ S I+ ++  NS  +         L  +     P+  S   +  ++  +   SP SE 
Sbjct: 136  ESNSVIMMSSVENSRHSSPAGGPRGRL--HGIGGPPDDCSFLRHARDTPDSYRDSPHSER 193

Query: 1924 ---STTTSSLVS--ESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTS 1978
               + TT + +S  +  T  SP+S     S PES    +   V+ S      E      S
Sbjct: 194  YVSAMTTPARMSPVDFHTPISPKSPCLEMSPPESSLAVSMPSVAVSPFIEE-ERPLLLVS 252

Query: 1979 SPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTS 2038
             P            + T     +  ++   +P  +S  +S P +        E E+T   
Sbjct: 253  PPRLREKKYDHKTPQKT---HHKQHNSFHHNPAHDS--SSLPPNPLRIVEDEEYETTQEY 307

Query: 2039 SPASEST--TTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASEST- 2095
             P+ E     TN+ +++ T  N   +  +   S     +++ S  SES T      E T 
Sbjct: 308  EPSLEPAKKLTNSRRAKRTKPNGHIANRLELDS----DSSSESSNSESETEDERIGEDTP 363

Query: 2096 --TTSSPASESTTTSSPESESTTTSSPASESTTIEE 2129
                 +P + S  ++     + + ++PA   +T EE
Sbjct: 364  FLGIQNPLAASLESAPAFRHADSRTNPAGRFSTQEE 399



 Score = 33.3 bits (75), Expect = 1.9
 Identities = 38/164 (23%), Positives = 64/164 (39%), Gaps = 18/164 (10%)

Query: 1971 ESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISP 2030
            E+E++ ++S  + +   S+ V+++ + S     S + IS  S S    S V  S      
Sbjct: 96   ETETSFSTSHYTSTAHHSTTVTQTPSHSWSNGHSESMISEESNSVIMMSSVENS------ 149

Query: 2031 ESESTTTSSPASESTTTNNPKSESTTTNNPASESITS--SSPASES--TTTSSPASESTT 2086
               S+    P         P  + +   + A ++  S   SP SE   +  ++PA  S  
Sbjct: 150  -RHSSPAGGPRGRLHGIGGPPDDCSFLRH-ARDTPDSYRDSPHSERYVSAMTTPARMSPV 207

Query: 2087 TSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTTIEEQ 2130
                  +  T  SP S     S PES    +    + S  IEE+
Sbjct: 208  ------DFHTPISPKSPCLEMSPPESSLAVSMPSVAVSPFIEEE 245


>gnl|CDD|234055 TIGR02907, spore_VI_D, stage VI sporulation protein D.  SpoVID, the
            stage VI sporulation protein D, is restricted to
            endospore-forming members of the bacteria, all of which
            are found among the Firmicutes. It is widely distributed
            but not quite universal in this group. Between
            well-conserved N-terminal and C-terminal domains is a
            poorly conserved, low-complexity region of variable
            length, rich enough in glutamic acid to cause spurious
            BLAST search results unless a filter is used. The seed
            alignment for this model was trimmed, in effect, by
            choosing member sequences in which these regions are
            relatively short. SpoVID is involved in spore coat
            assembly by the mother cell compartment late in the
            process of sporulation [Cellular processes, Sporulation
            and germination].
          Length = 338

 Score = 33.3 bits (76), Expect = 1.4
 Identities = 20/115 (17%), Positives = 32/115 (27%)

Query: 2072 SESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTTIEEQG 2131
            S S     PA E T      ++       A E     + +       S +       E  
Sbjct: 156  SFSAEFEHPAQEETAGEEERTDEPKVEHEAHEQHEQPADDDPDEWKISASEPFQLESEVE 215

Query: 2132 VSPHSEKLSANEDPEEFPNEDVFEHTFAEIPNIDHSNQTDEAIPETFDAREEWPQ 2186
             SP  E     ED  E   ED  +    +  +    +       +  +  EE  +
Sbjct: 216  ASPEEENYEEYEDETELEVEDEEKALDEQTEDPQQEDALAGDAKKALEEEEEKGE 270


>gnl|CDD|217490 pfam03318, ETX_MTX2, Clostridium epsilon toxin ETX/Bacillus
            mosquitocidal toxin MTX2.  This family appears to be
            distantly related to pfam01117.
          Length = 228

 Score = 33.2 bits (76), Expect = 1.4
 Identities = 46/196 (23%), Positives = 73/196 (37%), Gaps = 12/196 (6%)

Query: 1874 TNNNSESTVVMSTLNSLLSENTTTNSP-ESESTTTNNPESESTTTSSPESESTTTSSLVS 1932
            TN          T+    +   T        +T TNN +S  T  +   S+  TT    +
Sbjct: 3    TNGFPSYINFNVTVLDEETTVKTLTPLYTGSNTLTNNTDSTQTLQTQSFSKKVTT----T 58

Query: 1933 ESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVS 1992
             STTT+         S            V+E   T S   E   +S+  + ++TT +  +
Sbjct: 59   TSTTTTHGFKIGAKAS-----GKFGIPFVAEGGITLSVTGEYNFSSTTTNTTSTTETYTA 113

Query: 1993 ESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKS 2052
             S   + P   +T T++     TT S PV +  TT+S     +  SS +        P S
Sbjct: 114  PSQKVTVP-PHTTVTVTLYLYKTTYSGPV-DLYTTLSGTFFISIVSSVSFTRDGYVEPAS 171

Query: 2053 ESTTTNNPASESITSS 2068
               T + P  ++I  S
Sbjct: 172  YVLTASWPLYDTIFLS 187


>gnl|CDD|117051 pfam08474, MYT1, Myelin transcription factor 1.  This domain is found
            in the myelin transcription factor 1 (MYT1) of chordates.
            MYT1 contains C2HC zinc finger domains (pfam01530) and is
            expressed in developing neurons of the central nervous
            system where it is involved in the selection of neuronal
            precursor cells.
          Length = 257

 Score = 33.2 bits (75), Expect = 1.5
 Identities = 35/185 (18%), Positives = 57/185 (30%), Gaps = 11/185 (5%)

Query: 1968 SSPESESTTTSSPESES-----TTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVS 2022
             SPES   ++    S S       T S V  S+     +SE+    +  +     S+   
Sbjct: 65   PSPESSHFSSYVKSSSSLPSAGAHTQSTVRASSFDYGQDSEAAHMAA--TAILNLSTRCR 122

Query: 2023 ESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPAS 2082
            E    +S + +         E    N     S   N    +SI  +S  +   T SSP S
Sbjct: 123  EMPDNLSTKPQDLRAKGADIE-VDENGTLDLSMKKNRIRDKSIPPTSSCTTIATPSSPMS 181

Query: 2083 ESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTTIEEQGVSPHSEKLSAN 2142
                +S   + +        +      P   +        ES   +   + P  E L   
Sbjct: 182  PQKASSLLVNAA---FYQLCDQDGWDVPIDYTKPHRKTEEESKEKDPVNLDPSLENLEEK 238

Query: 2143 EDPEE 2147
            +   E
Sbjct: 239  KFAGE 243


>gnl|CDD|233787 TIGR02223, ftsN, cell division protein FtsN.  FtsN is a poorly
            conserved protein active in cell division in a number of
            Proteobacteria. The N-terminal 30 residue region tends to
            by Lys/Arg-rich, and is followed by a membrane-spanning
            region. This is followed by an acidic low-complexity
            region of variable length and a well-conserved C-terminal
            domain of two tandem regions matched by pfam05036
            (Sporulation related repeat), found in several cell
            division and sporulation proteins. The role of FtsN as a
            suppressor for other cell division mutations is poorly
            understood; it may involve cell wall hydrolysis [Cellular
            processes, Cell division].
          Length = 298

 Score = 33.1 bits (75), Expect = 1.5
 Identities = 27/190 (14%), Positives = 52/190 (27%), Gaps = 20/190 (10%)

Query: 1890 LLSENTTTNSPESE--STTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTT 1947
            LL+E+   N PE+      T N E+ +     PE   +       E              
Sbjct: 47   LLTESKQANEPETLQPKNQTENGETAADLPPKPEERWS-----YIEELEAREVLINDPEE 101

Query: 1948 SSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTT 2007
             S       ++ L +E                 +  +       + S  T + E+   T 
Sbjct: 102  PSNGGGVEESAQLTAEQRQLLEQM-------QADMRAAEKVLATAPSEQTVAVEARKQTA 154

Query: 2008 ISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITS 2067
                 ++ T              E+E   +    ++      PK  + T +N        
Sbjct: 155  EKKPQKARTA------EAQKTPVETEKIASKVKEAKQKQKALPKQTAETQSNSKPIETAP 208

Query: 2068 SSPASESTTT 2077
             +  ++ T  
Sbjct: 209  KADKADKTKP 218


>gnl|CDD|222011 pfam13257, DUF4048, Domain of unknown function (DUF4048).  This
            presumed domain is functionally uncharacterized. This
            domain family is found in eukaryotes, and is typically
            between 228 and 257 amino acids in length.
          Length = 242

 Score = 32.8 bits (75), Expect = 1.6
 Identities = 17/81 (20%), Positives = 31/81 (38%), Gaps = 5/81 (6%)

Query: 2071 ASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTTIE-- 2128
            A+ES T   P S  + + S +         +  S+ +S   +    TS   S+S  I+  
Sbjct: 116  ATESRTVPPPRSRRSGSRSTSRSRLRLQGGSLSSSRSSRSSTSKGATSGKDSKSADIDVS 175

Query: 2129 ---EQGVSPHSEKLSANEDPE 2146
               E G+    +K  + +   
Sbjct: 176  FWSEFGIDTPGQKSKSPQKAS 196



 Score = 32.4 bits (74), Expect = 2.6
 Identities = 20/105 (19%), Positives = 40/105 (38%), Gaps = 7/105 (6%)

Query: 2032 SESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASEST-TTSSP 2090
            +ES T   P S  + + +           +  S  SS  ++    TS   S+S     S 
Sbjct: 117  TESRTVPPPRSRRSGSRSTSRSRLRLQGGSLSSSRSSRSSTSKGATSGKDSKSADIDVSF 176

Query: 2091 ASE------STTTSSPASESTTTSSPESESTTTSSPASESTTIEE 2129
             SE         + SP   S+T +   ++  + ++ +S    +++
Sbjct: 177  WSEFGIDTPGQKSKSPQKASSTPAGNTNQGQSQNAQSSNLLDVDD 221



 Score = 30.9 bits (70), Expect = 6.6
 Identities = 23/107 (21%), Positives = 45/107 (42%), Gaps = 3/107 (2%)

Query: 2041 ASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASEST-TTSS 2099
            A+ES T   P+S  + + + +   +     +  S+ +S  ++    TS   S+S     S
Sbjct: 116  ATESRTVPPPRSRRSGSRSTSRSRLRLQGGSLSSSRSSRSSTSKGATSGKDSKSADIDVS 175

Query: 2100 PASE-STTTSSPESEST-TTSSPASESTTIEEQGVSPHSEKLSANED 2144
              SE    T   +S+S    SS  + +T   +   +  S  L  +++
Sbjct: 176  FWSEFGIDTPGQKSKSPQKASSTPAGNTNQGQSQNAQSSNLLDVDDN 222


>gnl|CDD|184920 PRK14956, PRK14956, DNA polymerase III subunits gamma and tau;
            Provisional.
          Length = 484

 Score = 33.4 bits (76), Expect = 1.6
 Identities = 14/74 (18%), Positives = 25/74 (33%)

Query: 1896 TTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESEST 1955
            + N PE                 + + +  TTS     S   S+ +  +   +   S+S 
Sbjct: 392  SKNIPEDVEPVKKISTPPPLQQEASKKKDPTTSDQKLNSQFESNQQDSNLDNNPLPSKSE 451

Query: 1956 TTSSLVSESTTTSS 1969
            + S   S    TS+
Sbjct: 452  SQSEPPSSKFDTST 465



 Score = 33.4 bits (76), Expect = 1.6
 Identities = 22/106 (20%), Positives = 35/106 (33%), Gaps = 15/106 (14%)

Query: 2046 TTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASEST 2105
            + N P+             +   +   +  TTS     S   S+    +   +   S+S 
Sbjct: 392  SKNIPEDVEPVKKISTPPPLQQEASKKKDPTTSDQKLNSQFESNQQDSNLDNNPLPSKSE 451

Query: 2106 TTSSPESESTTTSSPASESTTIEEQGVSPHSEKLSANE-DPEEFPN 2150
            + S P       SS    ST I+        +K    E DP +FP 
Sbjct: 452  SQSEP------PSSKFDTSTEIK--------KKFLGTEVDPNQFPK 483



 Score = 33.0 bits (75), Expect = 2.0
 Identities = 14/78 (17%), Positives = 28/78 (35%)

Query: 1916 TTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESEST 1975
            + + PE                 + + +  TTS  +  S   S+    +   +   S+S 
Sbjct: 392  SKNIPEDVEPVKKISTPPPLQQEASKKKDPTTSDQKLNSQFESNQQDSNLDNNPLPSKSE 451

Query: 1976 TTSSPESESTTTSSLVSE 1993
            + S P S    TS+ + +
Sbjct: 452  SQSEPPSSKFDTSTEIKK 469



 Score = 33.0 bits (75), Expect = 2.2
 Identities = 14/82 (17%), Positives = 27/82 (32%)

Query: 1882 VVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPE 1941
            +V  + N                      + +  TTS  +  S   S+    +   +   
Sbjct: 388  MVQGSKNIPEDVEPVKKISTPPPLQQEASKKKDPTTSDQKLNSQFESNQQDSNLDNNPLP 447

Query: 1942 SESTTTSSPESESTTTSSLVSE 1963
            S+S + S P S    TS+ + +
Sbjct: 448  SKSESQSEPPSSKFDTSTEIKK 469



 Score = 31.1 bits (70), Expect = 8.1
 Identities = 16/74 (21%), Positives = 27/74 (36%)

Query: 1906 TTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSEST 1965
            + N PE                 +   +  TTS  +  S   S+ +  +   + L S+S 
Sbjct: 392  SKNIPEDVEPVKKISTPPPLQQEASKKKDPTTSDQKLNSQFESNQQDSNLDNNPLPSKSE 451

Query: 1966 TTSSPESESTTTSS 1979
            + S P S    TS+
Sbjct: 452  SQSEPPSSKFDTST 465


>gnl|CDD|171664 PRK12688, PRK12688, flagellin; Reviewed.
          Length = 751

 Score = 33.7 bits (77), Expect = 1.7
 Identities = 36/247 (14%), Positives = 99/247 (40%), Gaps = 13/247 (5%)

Query: 1872 FTTNNNSESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLV 1931
            ++T +N  +T+  +T + L    +  ++  S +   +     +T  +   +   T  SL 
Sbjct: 105  YSTKSNVSTTISGATADDLRGTTSYASATASSNVLYDGAAGGATAATGATTLGGTAGSLA 164

Query: 1932 SESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESEST--------TTSSPESE 1983
                T     +  T T +  + + TT++ +  +   +  ++ +         + ++P S 
Sbjct: 165  GTGATAGDGTTALTGTITLIATNGTTATGLLGNAQPADGDTLTVNGKTITFRSGAAPAST 224

Query: 1984 STTTSSLVSESTTT----SSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSS 2039
            +  + S VS +  T    +S     + T++ +  +   +S V ++ T  S  +    ++S
Sbjct: 225  AVPSGSGVSGNLVTDGNGNSTVYLGSATVNDLLSAIDLASGV-QTVTISSGAATIAVSAS 283

Query: 2040 PASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSS 2099
              + S       +  ++T    S +  +    +   TT++ A  +T  ++  + + +  +
Sbjct: 284  GGAVSAAAAGAVTLKSSTGADLSVTGKADLLKALGLTTATGAGNATVNANRTTSAGSLGA 343

Query: 2100 PASESTT 2106
               + +T
Sbjct: 344  LIQDGST 350


>gnl|CDD|216289 pfam01080, Presenilin, Presenilin.  Mutations in presenilin-1 are a
            major cause of early onset Alzheimer's disease. It has
            been found that presenilin-1 binds to beta-catenin
            in-vivo. This family also contains SPE proteins from
            C.elegans.
          Length = 403

 Score = 33.3 bits (76), Expect = 1.7
 Identities = 27/93 (29%), Positives = 39/93 (41%), Gaps = 3/93 (3%)

Query: 2056 TTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASEST-TTSSPESES 2114
            +     +E   S+   +  +T S+   +S  TS    E    SS    S   + S E+ES
Sbjct: 230  SNQEETNEGTPSTIRRTSKSTRSAANPDSAPTSHSTLELPEKSSTPELSDDESDSSETES 289

Query: 2115 TTTSSPASESTTIEEQGVSPHSEKLSANEDPEE 2147
             + SS A E    E+  V   S  L +NE  EE
Sbjct: 290  QSDSSLAPEEDAAEQPEVQ--SNSLPSNEKREE 320



 Score = 32.5 bits (74), Expect = 2.8
 Identities = 31/102 (30%), Positives = 50/102 (49%), Gaps = 6/102 (5%)

Query: 1880 STVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSS 1939
            STVV+ T+ S   E T   +P +   T+      + + ++P+S  T+ S+L      +S+
Sbjct: 221  STVVVLTVGSN-QEETNEGTPSTIRRTSK----STRSAANPDSAPTSHSTL-ELPEKSST 274

Query: 1940 PESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPE 1981
            PE     + S E+ES + SSL  E      PE +S +  S E
Sbjct: 275  PELSDDESDSSETESQSDSSLAPEEDAAEQPEVQSNSLPSNE 316



 Score = 31.3 bits (71), Expect = 6.7
 Identities = 19/69 (27%), Positives = 30/69 (43%), Gaps = 1/69 (1%)

Query: 1926 TTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESEST-TTSSPESES 1984
            +     +E T ++   +  +T S+   +S  TS    E    SS    S   + S E+ES
Sbjct: 230  SNQEETNEGTPSTIRRTSKSTRSAANPDSAPTSHSTLELPEKSSTPELSDDESDSSETES 289

Query: 1985 TTTSSLVSE 1993
             + SSL  E
Sbjct: 290  QSDSSLAPE 298


>gnl|CDD|221429 pfam12118, SprA-related, SprA-related family.  This protein is found
            in bacteria. Proteins in this family are typically
            between 234 to 465 amino acids in length. There is a
            conserved GEV sequence motif.Most members are annotated
            as being SprA-related.
          Length = 261

 Score = 32.8 bits (75), Expect = 1.8
 Identities = 16/88 (18%), Positives = 42/88 (47%), Gaps = 4/88 (4%)

Query: 2064 SITSSSPASESTTTSSP---ASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSP 2120
            +I++S  +  S +++     A   T T + A  + ++S  ++  + +S  +++    +S 
Sbjct: 1    NISNSLSSISSGSSAPIGTSALRGTNTPAAAKPAPSSSEASNAGSGSSEQKAKLKGQAST 60

Query: 2121 ASESTTIEEQGVSP-HSEKLSANEDPEE 2147
            A+ S + E Q  +   +++    E+  E
Sbjct: 61   AAGSASQELQKQASESNDEEVVGEEEPE 88


>gnl|CDD|219081 pfam06546, Vert_HS_TF, Vertebrate heat shock transcription factor.
            This family represents the C-terminal region of
            vertebrate heat shock transcription factors. Heat shock
            transcription factors regulate the expression of heat
            shock proteins - a set of proteins that protect the cell
            from damage caused by stress and aid the cell's recovery
            after the removal of stress. This C-terminal region is
            found with the N-terminal pfam00447, and may contain a
            three-stranded coiled-coil trimerisation domain and a CE2
            regulatory region, the latter of which is involved in
            sustained heat shock response.
          Length = 252

 Score = 32.8 bits (74), Expect = 1.8
 Identities = 18/96 (18%), Positives = 35/96 (36%)

Query: 2035 TTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASES 2094
            +++ S + +S  ++ P     T    +S   +      E   +SSP           S+S
Sbjct: 3    SSSGSYSPDSVASSGPIISDVTELAESSPVASPDGSIEERAVSSSPLVRIKEEPPSPSQS 62

Query: 2095 TTTSSPASESTTTSSPESESTTTSSPASESTTIEEQ 2130
               S     S    +P S +T   S  +E   + ++
Sbjct: 63   PEQSEAVPGSDLVDTPLSPTTFIDSILNEEEPVSQE 98



 Score = 31.2 bits (70), Expect = 5.3
 Identities = 21/94 (22%), Positives = 35/94 (37%)

Query: 1925 TTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESES 1984
            +++ S   +S  +S P     T  +  S   +    + E   +SSP           S+S
Sbjct: 3    SSSGSYSPDSVASSGPIISDVTELAESSPVASPDGSIEERAVSSSPLVRIKEEPPSPSQS 62

Query: 1985 TTTSSLVSESTTTSSPESESTTTISPVSESTTTS 2018
               S  V  S    +P S +T   S ++E    S
Sbjct: 63   PEQSEAVPGSDLVDTPLSPTTFIDSILNEEEPVS 96


>gnl|CDD|185219 PRK15319, PRK15319, AIDA autotransporter-like protein ShdA;
            Provisional.
          Length = 2039

 Score = 33.5 bits (76), Expect = 1.9
 Identities = 61/292 (20%), Positives = 120/292 (41%), Gaps = 50/292 (17%)

Query: 1865 DNYSEIIFTTNNNSESTVVMSTLNSLLSENTTTNSP--ESESTTTNNPESESTTTSSPES 1922
              Y ++   T +++  T     L+  +  +TT N P   ++ST T     +  + ++ ++
Sbjct: 341  TGYEDLNALTVSDANVTSDTVALH--VDGSTTINDPIELTDSTFTAPTAIKLGSKATIQA 398

Query: 1923 ESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPES 1982
            E+TT +  + ++  +SS  S S         ST T S+ +  TT S  ++     + P  
Sbjct: 399  ENTTLTGNIVQTDASSSSLSLS-------QGSTLTGSVDAMFTTLSLDDTSQWNMTDP-- 449

Query: 1983 ESTTTSSLVSESTTTSSPESESTTTISPVSES--------------TTTSSPV------- 2021
              +T  +L ++   T    S ST T+  V  +              T  SSP+       
Sbjct: 450  --STVGNLTNDGDITLGNASGSTGTLLTVDNTLTLQDGSQINATLDTANSSPIIKAANVT 507

Query: 2022 -------SESTTTISPESES-----TTTSSPASESTTTNNPKSESTTTNNPASESITSSS 2069
                   S + T ++PE++      T   S  + +T  ++   ++ T+  P   +I +  
Sbjct: 508  LDGTLNLSSTATFVAPETDEHFGSITLIDSQTAITTDFDSVTLDADTSAMPDYLTINAGV 567

Query: 2070 PASESTT--TSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSS 2119
             A+++T    S+  S     +S  +   T +  A  + T +S   E+T TS+
Sbjct: 568  DANDNTNYELSTGLSWYAGANSARAAHGTFTVDAGSTFTVTSELDETTATSN 619


>gnl|CDD|219210 pfam06873, SerH, Cell surface immobilisation antigen SerH.  This
            family consists of several cell surface immobilisation
            antigen SerH proteins which seem to be specific to
            Tetrahymena thermophila. The SerH locus of Tetrahymena
            thermophila is one of several paralogous loci with genes
            encoding variants of the major cell surface protein known
            as the immobilisation antigen (i-ag).
          Length = 407

 Score = 33.4 bits (76), Expect = 1.9
 Identities = 41/208 (19%), Positives = 86/208 (41%), Gaps = 17/208 (8%)

Query: 1928 SSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTS---SPESES 1984
            ++ V+ S + S+    + T S     + TT + VS +    S  +   T S   +  + +
Sbjct: 100  TACVASSASCSNRRRGAWTDSDCTLCNPTTPAAVSGACQACSSITSGWTDSNCNACATTA 159

Query: 1985 TTTSSLVSESTTTSS--PESESTTTISPVSESTTTS--SPVSESTTTISPESESTTTSSP 2040
            +  +S V  ++  S+    S S  + S  + + T +     + +T  +  +  S   SS 
Sbjct: 160  SPKNSNVFANSAGSACVASSASCGSTSRGTTAWTDADCLLCNPTTPYLVGDKSSCAASSC 219

Query: 2041 ASESTTTN----------NPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSP 2090
            A+ S++T+          N  +  +T N  A+ + +S   +S S  +SS  + + T    
Sbjct: 220  AACSSSTSGWTDSDCNACNTTASPSTKNIFANAAGSSCVASSASCGSSSRGTTAWTDGDC 279

Query: 2091 ASESTTTSSPASESTTTSSPESESTTTS 2118
               + +T +  + S  +S     S T+ 
Sbjct: 280  TLCTPSTPAVYASSDGSSCVACSSITSG 307



 Score = 32.2 bits (73), Expect = 3.6
 Identities = 46/230 (20%), Positives = 82/230 (35%), Gaps = 16/230 (6%)

Query: 1914 STTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPES- 1972
            S  +++  +    T S  S ST++       T  S       TT ++   S  TS P + 
Sbjct: 18   SVISATAGNNVQCTGSGNSCSTSSCCTVPTITGCSWGTGTDATTCAITDCSCLTSGPATG 77

Query: 1973 ------ESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTT 2026
                  +S   S+    + +  +    S+ + S       T S  +    T+        
Sbjct: 78   LTDLFCQSCKGSNQNVFANSAGTACVASSASCSNRRRGAWTDSDCTLCNPTTPAAVSGAC 137

Query: 2027 TISPESESTTTSSPASESTTTNNPKSESTTTNN------PASESITSSSPASESTTTSSP 2080
                   S  T S  +   TT +PK+ +   N+       +S S  S+S  + + T +  
Sbjct: 138  QACSSITSGWTDSNCNACATTASPKNSNVFANSAGSACVASSASCGSTSRGTTAWTDADC 197

Query: 2081 ASESTTT---SSPASESTTTSSPASESTTTSSPESESTTTSSPASESTTI 2127
               + TT       S    +S  A  S+T+   +S+    ++ AS ST  
Sbjct: 198  LLCNPTTPYLVGDKSSCAASSCAACSSSTSGWTDSDCNACNTTASPSTKN 247


>gnl|CDD|240274 PTZ00112, PTZ00112, origin recognition complex 1 protein;
            Provisional.
          Length = 1164

 Score = 33.4 bits (76), Expect = 1.9
 Identities = 53/343 (15%), Positives = 106/343 (30%), Gaps = 44/343 (12%)

Query: 1892 SENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPE 1951
            ++N TT    ++    N   S S++ SS  +  +  SS  S  +  S+  S   +    +
Sbjct: 105  NDNVTTPIKANKKEKHNLDSSSSSSISSSLTNISFFSSPTSIYSCLSNSLSSKHSPKVIK 164

Query: 1952 SESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPV 2011
               +T  ++ S+    +SP ++  +    + ++  T +         SP + ST      
Sbjct: 165  ENQSTHVNISSD----NSPRNKEISNKQLKKQTNVTHT-TCYDKMRRSPRNTST---IKN 216

Query: 2012 SESTTTSSPVSESTTTISPESESTTTSSPASE---------------STTTNNPKSESTT 2056
            + +        E    I  + +    +   SE               S T  N K E   
Sbjct: 217  NTNDKNKEKNKEKDKNIKKDRDGDKQTKRNSEKSKVQNSHFDVRILRSYTKENKKDEKNV 276

Query: 2057 TNNPASESITSSSPASESTTTSSPAS-----------ESTTTSSPASESTTTSSPASEST 2105
             +   S  +      S+     S                         S + +   S S 
Sbjct: 277  VSGIRSSVLLKRK--SQCLRKDSYVYSNHQKKAKTGDPKNIIHRNNGSSNSNNDDTSSSN 334

Query: 2106 TTSSPESESTTTSSPASESTTIEEQGVSPHSEKLSANEDPEEFPN-----EDVFEHTFAE 2160
               S    +   SSP  + TT  +   +  + K +  +  ++F +       + + +   
Sbjct: 335  HLGSNRISNRNPSSPYKKQTTT-KHTNNTKNNKYNKTKTTQKFNHPLRHHATINKRSSML 393

Query: 2161 IPNIDHSNQTDEAIPE--TFDAREEWPQCKDVIGKVWDQGACQ 2201
              +        E       F   E     KD   K+ ++ +CQ
Sbjct: 394  PMSEQKGRGASEKSEYIKEFTMEEVAKLTKDTTIKLVEENSCQ 436



 Score = 33.0 bits (75), Expect = 2.3
 Identities = 38/158 (24%), Positives = 67/158 (42%), Gaps = 29/158 (18%)

Query: 2084 STTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTTIEEQGVSPHSEKLSANE 2143
             TTTSS A   + T +  ++S T+ + E  ST+      + +        PH      N 
Sbjct: 670  QTTTSSKAKTHSKTKNDHNKSKTSKNKEPSSTSFLQDVKKKSD-------PH------NV 716

Query: 2144 DPEEFPNEDVFEHTFAEIPNIDHSNQTDEAI--------PETFDAREEWPQCKDVIG--- 2192
            D + F  +D   +    + NI  ++ TD+AI        P+    RE+  + K+V G   
Sbjct: 717  DFKSFIKQDQENYYVNLLRNI--TDPTDKAIRMMQLDVVPKYLPCREK--EIKEVHGFLE 772

Query: 2193 -KVWDQGACQSCWVSHQPRTAGLKGLFSFIKYGQGQER 2229
              +   G+ Q  ++S  P T     ++S I+  Q + +
Sbjct: 773  SGIKQSGSNQILYISGMPGTGKTATVYSVIQLLQHKTK 810



 Score = 33.0 bits (75), Expect = 2.4
 Identities = 32/158 (20%), Positives = 57/158 (36%), Gaps = 30/158 (18%)

Query: 2048 NNPKSESTTTNN---PASESITSSS-----PASESTTTSSPASESTTTSSPASESTTTSS 2099
            N P+ E     N   P    I +++       +E + T    +++ TT   A++    + 
Sbjct: 63   NTPRKEEKKKKNLNLPDYNQIQNNTHDFYIDLNERSKTPIKNNDNVTTPIKANKKEKHNL 122

Query: 2100 PASESTTTSSPESESTTTSSPASESTTIEEQGVSPHSEK-----------LSANEDPEEF 2148
             +S S++ SS  +  +  SSP S  + +     S HS K           +S++  P   
Sbjct: 123  DSSSSSSISSSLTNISFFSSPTSIYSCLSNSLSSKHSPKVIKENQSTHVNISSDNSPRN- 181

Query: 2149 PNEDVFEHTFAEIPNIDHSNQTDEAIPETFDAREEWPQ 2186
                       EI N     QT+      +D     P+
Sbjct: 182  ----------KEISNKQLKKQTNVTHTTCYDKMRRSPR 209



 Score = 33.0 bits (75), Expect = 2.5
 Identities = 25/142 (17%), Positives = 53/142 (37%), Gaps = 5/142 (3%)

Query: 1910 PESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSS 1969
              +E + T    +++ TT    ++    +   S S++ SS  +  +  SS  S  +  S+
Sbjct: 93   DLNERSKTPIKNNDNVTTPIKANKKEKHNLDSSSSSSISSSLTNISFFSSPTSIYSCLSN 152

Query: 1970 PESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTIS 2029
              S   +    +      S+ V+ S+  S    E +     + + T  +          S
Sbjct: 153  SLSSKHSPKVIKENQ---STHVNISSDNSPRNKEISNKQ--LKKQTNVTHTTCYDKMRRS 207

Query: 2030 PESESTTTSSPASESTTTNNPK 2051
            P + ST  ++   ++   N  K
Sbjct: 208  PRNTSTIKNNTNDKNKEKNKEK 229


>gnl|CDD|236555 PRK09537, pylS, pyrolysyl-tRNA synthetase; Reviewed.
          Length = 417

 Score = 33.3 bits (76), Expect = 1.9
 Identities = 13/88 (14%), Positives = 25/88 (28%)

Query: 1953 ESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVS 2012
            + T     V  + T        +   +P+       +    S +   P    +T      
Sbjct: 89   DKTQVKVKVVSAPTKKKKAMPKSVVRAPKPLENPVPAQAESSGSKPVPSIPVSTPEVKAP 148

Query: 2013 ESTTTSSPVSESTTTISPESESTTTSSP 2040
                T S      T +SP+ + +  S  
Sbjct: 149  APALTPSQKDRLETLLSPKDKISLNSEK 176



 Score = 31.7 bits (72), Expect = 4.6
 Identities = 15/87 (17%), Positives = 25/87 (28%)

Query: 2043 ESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPAS 2102
            + T        + T    A       +P        + A  S +   P+   +T    A 
Sbjct: 89   DKTQVKVKVVSAPTKKKKAMPKSVVRAPKPLENPVPAQAESSGSKPVPSIPVSTPEVKAP 148

Query: 2103 ESTTTSSPESESTTTSSPASESTTIEE 2129
                T S +    T  SP  + +   E
Sbjct: 149  APALTPSQKDRLETLLSPKDKISLNSE 175



 Score = 31.3 bits (71), Expect = 6.6
 Identities = 14/88 (15%), Positives = 24/88 (27%)

Query: 2033 ESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPAS 2092
            + T        + T        +    P        + A  S +   P+   +T    A 
Sbjct: 89   DKTQVKVKVVSAPTKKKKAMPKSVVRAPKPLENPVPAQAESSGSKPVPSIPVSTPEVKAP 148

Query: 2093 ESTTTSSPASESTTTSSPESESTTTSSP 2120
                T S      T  SP+ + +  S  
Sbjct: 149  APALTPSQKDRLETLLSPKDKISLNSEK 176


>gnl|CDD|233909 TIGR02520, pilus_B_mal_scr, type IVB pilus formation outer membrane
            protein, R64 PilN family.  Several related protein
            families encode outer membrane pore proteins for type II
            secretion, type III secretion, and type IV pilus
            formation. This protein family appears to encode a
            secretin for pilus formation, although it is quite
            different from PilQ. Members include the PilN lipoprotein
            of the plasmid R64 thin pilus, a type IV pilus. Scoring
            between the trusted and noise cutoffs are examples of
            bundle-forming pilus B (bfpB) [Cell envelope, Surface
            structures, Protein fate, Protein and peptide secretion
            and trafficking].
          Length = 497

 Score = 33.3 bits (76), Expect = 1.9
 Identities = 45/222 (20%), Positives = 80/222 (36%), Gaps = 15/222 (6%)

Query: 1880 STVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSS 1939
            ++ V ST +S     ++++     S +T +   +  ++   + + +  S L S  +   S
Sbjct: 169  NSSVTSTSSSTAGSGSSSSGGSGNSGSTQSTAVKLESSVHNDIQQSIKSMLSSSGSWHLS 228

Query: 1940 PESES-TTTSSPES----ESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSES 1994
              + S   T  PE      S   S     +          +       ++    SLV +S
Sbjct: 229  GSTGSLVVTDVPEVLDRVASYIDSQNRRLTRQVLLNVKVLSVQFKGSDQTGVDWSLVYKS 288

Query: 1995 TTTS--SPESESTTTISPVSEST---TTSSPVSESTTTISPESESTTTSSPASESTTTNN 2049
             +    S  +  T+T +    S        P + +T  I   S     S   S S TT N
Sbjct: 289  LSRFGLSLANAGTSTAATAGSSAGINVVDGPFAGTTALIRALSTQGKVSVVTSPSVTTLN 348

Query: 2050 ----PKSESTTTNNPASESITSSSPASESTTTSSPASESTTT 2087
                P   +T T   AS+S T+ +    S+T   P + +T  
Sbjct: 349  LQPAPFQIATQTGYLASQS-TTVTANVGSSTDLEPGTITTGF 389



 Score = 31.8 bits (72), Expect = 4.8
 Identities = 25/97 (25%), Positives = 42/97 (43%), Gaps = 8/97 (8%)

Query: 1959 SLVSESTTTSSPESESTTTSSPESESTTTSSLV---SESTTTSSPESESTTTIS----PV 2011
            SL +  T+T++    S   +  +     T++L+   S     S   S S TT++    P 
Sbjct: 295  SLANAGTSTAATAGSSAGINVVDGPFAGTTALIRALSTQGKVSVVTSPSVTTLNLQPAPF 354

Query: 2012 SESTTTSSPVSESTTTISPESESTTTSSPASESTTTN 2048
              +T T    S+STT  +    S+T   P + +T  N
Sbjct: 355  QIATQTGYLASQSTTV-TANVGSSTDLEPGTITTGFN 390


>gnl|CDD|147777 pfam05808, Podoplanin, Podoplanin.  This family consists of several
            mammalian podoplanin like proteins which are thought to
            control specifically the unique shape of podocytes.
          Length = 162

 Score = 32.2 bits (73), Expect = 1.9
 Identities = 26/104 (25%), Positives = 45/104 (43%), Gaps = 11/104 (10%)

Query: 1912 SESTTTSSPESESTTTSSLVSESTTTSSPESESTTT-SSPESESTTTSSLVSESTTTSS- 1969
            ++  +T  PE +  T     ++   T   E   TTT ++ E   +  + LV   T   + 
Sbjct: 20   AQGASTVRPEDDVVTPG--TTDGMVTPGVEDYITTTGATEELNESGLAPLVPTGTENVTK 77

Query: 1970 ------PESESTTTSSPESESTTTSSLV-SESTTTSSPESESTT 2006
                  P +E T     E +STTT ++V S S   +  E+++T 
Sbjct: 78   DHLEDLPTAEGTDHDGEEHKSTTTVTVVTSHSQDKTGDETQTTD 121


>gnl|CDD|236733 PRK10672, PRK10672, rare lipoprotein A; Provisional.
          Length = 361

 Score = 32.7 bits (75), Expect = 2.2
 Identities = 17/83 (20%), Positives = 33/83 (39%), Gaps = 1/83 (1%)

Query: 2034 STTTSSPASESTTTNNPKSESTTTNNPASESITSSSP-ASESTTTSSPASESTTTSSPAS 2092
             T +  PA        P S ST  +   + +  +SS      TT +    E +  +  A 
Sbjct: 205  GTPSVQPAPAPQGDVLPVSNSTLKSEDPTGAPVTSSGFLGAPTTLAPGVLEGSEPTPTAP 264

Query: 2093 ESTTTSSPASESTTTSSPESEST 2115
             S   ++PA+ +   ++  S ++
Sbjct: 265  SSAPATAPAAAAPQAAATSSSAS 287


>gnl|CDD|148051 pfam06213, CobT, Cobalamin biosynthesis protein CobT.  This family
            consists of several bacterial cobalamin biosynthesis
            (CobT) proteins. CobT is involved in the transformation
            of precorrin-3 into cobyrinic acid.
          Length = 282

 Score = 32.9 bits (75), Expect = 2.2
 Identities = 15/76 (19%), Positives = 31/76 (40%)

Query: 2024 STTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASE 2083
            S+  ++ E      S+ + ++   ++PK +         ES +S S + +S  +S     
Sbjct: 204  SSMDMAEELGDEPESADSEDNEDEDDPKEDEDDDQGEEEESGSSDSLSEDSDASSEEMES 263

Query: 2084 STTTSSPASESTTTSS 2099
                ++ AS   T  S
Sbjct: 264  GEMEAAEASADDTPDS 279


>gnl|CDD|218752 pfam05793, TFIIF_alpha, Transcription initiation factor IIF, alpha
            subunit (TFIIF-alpha).  Transcription initiation factor
            IIF, alpha subunit (TFIIF-alpha) or RNA polymerase
            II-associating protein 74 (RAP74) is the large subunit of
            transcription factor IIF (TFIIF), which is essential for
            accurate initiation and stimulates elongation by RNA
            polymerase II.
          Length = 528

 Score = 33.0 bits (75), Expect = 2.3
 Identities = 38/191 (19%), Positives = 61/191 (31%), Gaps = 22/191 (11%)

Query: 1952 SESTTTSSLVSESTTTSSPESES-----TTTSSPESESTTTSSLVSESTTTSSPESESTT 2006
            S+S+ + +   E     SPE  +         S ESE          S        +   
Sbjct: 288  SDSSASGNDPEEREDKLSPEIPAKPEIEQDEDSEESEEEKNEEEGGLS----KKGKKLKK 343

Query: 2007 TISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESIT 2066
                 +      S   + +     + E    S     +     PK E    +NP+S    
Sbjct: 344  LKGKKNGLDKDDSDSGDDSDDSDIDGE---DSVSLVTAKKQKEPKKEEPVDSNPSSP--G 398

Query: 2067 SSSPASESTTTSSPAS-------ESTTTSSPASESTTTSSPASES-TTTSSPESESTTTS 2118
            +S PA  S  +              +  S PA +  T ++P S S  +T    S S ++S
Sbjct: 399  NSGPARPSPESKDKGKRKAANEVSKSPASVPAKKLKTENAPKSSSGKSTPQTFSGSKSSS 458

Query: 2119 SPASESTTIEE 2129
            + A    T E 
Sbjct: 459  NAADGGVTEEA 469


>gnl|CDD|223003 PHA03169, PHA03169, hypothetical protein; Provisional.
          Length = 413

 Score = 32.6 bits (74), Expect = 2.5
 Identities = 44/219 (20%), Positives = 63/219 (28%), Gaps = 17/219 (7%)

Query: 1910 PESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSS 1969
            P   + TTS P+  +          + T + E           +   + S         S
Sbjct: 46   PAPPAPTTSGPQVRAVAEQGHRQTESDTETAEESRHGEKEERGQGGPSGS--------GS 97

Query: 1970 PESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTIS 2029
                S T S   S     S L  E+T+ SSPES       P S S   S P        +
Sbjct: 98   ESVGSPTPSPSGSAEELASGLSPENTSGSSPES-------PASHSPPPSPPSHPGPHEPA 150

Query: 2030 PESESTTTSSPASESTTTNNPKSESTTTNNPASESITSS--SPASESTTTSSPASESTTT 2087
            P      + +    S    + +        P SE    S   P SE+ T+S P       
Sbjct: 151  PPESHNPSPNQQPSSFLQPSHEDSPEEPEPPTSEPEPDSPGPPQSETPTSSPPPQSPPDE 210

Query: 2088 SSPASESTTTSSPASESTTTSSPESESTTTSSPASESTT 2126
                   T   +P+  +      E E T           
Sbjct: 211  PGEPQSPTPQQAPSPNTQQAVEHEDEPTEPEREGPPFPG 249



 Score = 31.5 bits (71), Expect = 6.3
 Identities = 43/217 (19%), Positives = 72/217 (33%), Gaps = 17/217 (7%)

Query: 1937 TSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTT 1996
             + P   + TTS P+  +      V+E     +     T   S   E         E   
Sbjct: 43   AAKPAPPAPTTSGPQVRA------VAEQGHRQTESDTETAEESRHGEK--------EERG 88

Query: 1997 TSSPESESTTTISPVSESTTTSSPVSESTTTISPESES-TTTSSPASESTTTNNPKSEST 2055
               P    + ++   + S + S+    S   +SPE+ S ++  SPAS S   + P     
Sbjct: 89   QGGPSGSGSESVGSPTPSPSGSAEELASG--LSPENTSGSSPESPASHSPPPSPPSHPGP 146

Query: 2056 TTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESEST 2115
                P      S +    S    S          P SE    S    +S T +S     +
Sbjct: 147  HEPAPPESHNPSPNQQPSSFLQPSHEDSPEEPEPPTSEPEPDSPGPPQSETPTSSPPPQS 206

Query: 2116 TTSSPASESTTIEEQGVSPHSEKLSANEDPEEFPNED 2152
                P    +   +Q  SP++++   +ED    P  +
Sbjct: 207  PPDEPGEPQSPTPQQAPSPNTQQAVEHEDEPTEPERE 243


>gnl|CDD|213230 cd03263, ABC_subfamily_A, ATP-binding cassette domain of the lipid
            transporters, subfamily A.  The ABCA subfamily mediates
            the transport of a variety of lipid compounds. Mutations
            of members of ABCA subfamily are associated with human
            genetic diseases, such as, familial high-density
            lipoprotein (HDL) deficiency, neonatal surfactant
            deficiency, degenerative retinopathies, and congenital
            keratinization disorders. The ABCA1 protein is involved
            in disorders of cholesterol transport and high-density
            lipoprotein (HDL) biosynthesis. The ABCA4 (ABCR) protein
            transports vitamin A derivatives in the outer segments of
            photoreceptor cells, and therefore, performs a crucial
            step in the visual cycle. The ABCA genes are not present
            in yeast. However, evolutionary studies of ABCA genes
            indicate that they arose as transporters that
            subsequently duplicated and that certain sets of ABCA
            genes were lost in different eukaryotic lineages.
          Length = 220

 Score = 32.1 bits (74), Expect = 2.6
 Identities = 12/25 (48%), Positives = 16/25 (64%)

Query: 2241 ASVMSDRICIQSKGQVKPILSPQHL 2265
            A  + DRI I S G+++ I SPQ L
Sbjct: 195  AEALCDRIAIMSDGKLRCIGSPQEL 219


>gnl|CDD|218307 pfam04880, NUDE_C, NUDE protein, C-terminal conserved region.  This
            family represents the C-terminal conserved region of the
            NUDE proteins. NUDE proteins are involved in nuclear
            migration.
          Length = 166

 Score = 31.8 bits (71), Expect = 2.7
 Identities = 25/114 (21%), Positives = 43/114 (37%), Gaps = 3/114 (2%)

Query: 1891 LSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSES---TTTSSPESESTTT 1947
            L +          +   + P       SSP +  T +S     S    T SSP +  T  
Sbjct: 43   LKQELIVQERLRNNNRKSRPAPVVNLGSSPSTPHTNSSMNSPRSPPNGTVSSPLTPPTKL 102

Query: 1948 SSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPE 2001
            S   + +T T      S T+SS  S +  +  P  +++ + S  + S   + P+
Sbjct: 103  SLTLASATATDPAPPMSETSSSVNSLTAASGFPLQKASASESFGTRSLYGNRPQ 156


>gnl|CDD|173701 cd05610, STKc_MASTL, Catalytic domain of the Protein Serine/Threonine
            Kinase, Microtubule-associated serine/threonine-like
            kinase.  Serine/Threonine Kinases (STKs),
            Microtubule-associated serine/threonine (MAST) kinase
            subfamily, MAST-like (MASTL) kinases, catalytic (c)
            domain. STKs catalyze the transfer of the
            gamma-phosphoryl group from ATP to serine/threonine
            residues on protein substrates. The MAST kinase subfamily
            is part of a larger superfamily that includes the
            catalytic domains of other protein STKs, protein tyrosine
            kinases, RIO kinases, aminoglycoside phosphotransferase,
            choline kinase, and phosphoinositide 3-kinase. MAST
            kinases contain an N-terminal domain of unknown function,
            a central catalytic domain, and a C-terminal PDZ domain
            that mediates protein-protein interactions. The MASTL
            kinases in this group carry only a catalytic domain,
            which contains a long insertion relative to MAST kinases.
            The human MASTL gene has also been labelled FLJ14813. A
            missense mutation in FLJ14813 is associated with
            autosomal dominant thrombocytopenia. To date, the
            function of MASTL is unknown.
          Length = 669

 Score = 32.9 bits (75), Expect = 2.8
 Identities = 30/141 (21%), Positives = 46/141 (32%), Gaps = 7/141 (4%)

Query: 1979 SPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTS 2038
            +P  E    S    ++  TSS     T T  P+      S P+S   +    E+  ++ S
Sbjct: 195  TPVGEKDQGSVNSGQNNGTSS---VRTGTSHPLLMINKESLPMSLKLSKSCLETSESSPS 251

Query: 2039 SPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTS 2098
             P    T    P    +     AS S T S   +  ++  S    S       + S + S
Sbjct: 252  LPVRSLT----PNLLKSRKRPEASTSSTHSCMTNSLSSCESECCSSNLKLLEQASSPSQS 307

Query: 2099 SPASESTTTSSPESESTTTSS 2119
               S        E E +   S
Sbjct: 308  PRWSVDEGNIISEGEKSEKGS 328



 Score = 31.4 bits (71), Expect = 7.5
 Identities = 27/113 (23%), Positives = 42/113 (37%), Gaps = 4/113 (3%)

Query: 2009 SPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSS 2068
            +PV E    S    ++  T    S  T TS P       + P S   + +   +   + S
Sbjct: 195  TPVGEKDQGSVNSGQNNGT---SSVRTGTSHPLLMINKESLPMSLKLSKSCLETSESSPS 251

Query: 2069 SPASESTTTSSPASESTTTSSPASESTTTSSPAS-ESTTTSSPESESTTTSSP 2120
             P    T     + +    S+ ++ S  T+S +S ES   SS        SSP
Sbjct: 252  LPVRSLTPNLLKSRKRPEASTSSTHSCMTNSLSSCESECCSSNLKLLEQASSP 304


>gnl|CDD|203570 pfam07058, Myosin_HC-like, Myosin II heavy chain-like.  This family
            represents a conserved region within a number of myosin
            II heavy chain-like proteins that seem to be specific to
            Arabidopsis thaliana.
          Length = 351

 Score = 32.7 bits (74), Expect = 2.8
 Identities = 23/126 (18%), Positives = 43/126 (34%), Gaps = 7/126 (5%)

Query: 2034 STTTSSPASES-TTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPAS 2092
            +++   P +   + +N P    +      S   TS+   S+    SS    S T      
Sbjct: 163  NSSFVRPTTVGRSESNGPTRRQSLGGAETSPKFTSNGGLSKKRP-SSQLRGSLTGRISTV 221

Query: 2093 ESTTTSSPASESTTTSSPESESTTTSSPAS-----ESTTIEEQGVSPHSEKLSANEDPEE 2147
                  +  S    T S +      + P++     +      +G SP SE+ +  ED   
Sbjct: 222  LKHAKGTSISFDGGTRSMDRSKILANGPSNFPLNDKHEEGTSRGESPDSERKTEEEDGNA 281

Query: 2148 FPNEDV 2153
            +  + V
Sbjct: 282  YSEDSV 287


>gnl|CDD|221931 pfam13136, DUF3984, Protein of unknown function (DUF3984).  This
            family of proteins is functionally uncharacterized. This
            family of proteins is found in eukaryotes. Proteins in
            this family are typically between 393 and 442 amino acids
            in length.
          Length = 301

 Score = 32.4 bits (74), Expect = 2.8
 Identities = 31/135 (22%), Positives = 59/135 (43%), Gaps = 13/135 (9%)

Query: 2006 TTISPVSESTTTS----SPVSESTTTISPESESTTTS--SPASESTTTNNPKSESTTTNN 2059
            T+  P+ +         +P   +T+ +S +S  TT    S +   + +   K + ++  +
Sbjct: 18   TSRFPLDDDDEERDYSYAPHPPTTSYLSSKSVPTTPGILSHSRSPSRSRLHKRKKSSRRS 77

Query: 2060 PASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSP-ASESTTTSSPESESTTTS 2118
            P S+++       +S +++      +T S   S+S TTS    S S      +SE    +
Sbjct: 78   PMSDTLL------KSKSSAHLLHHQSTRSHRRSKSGTTSPRKPSSSAHRRRNDSEWLLRA 131

Query: 2119 SPASESTTIEEQGVS 2133
              A  S+T EE+G S
Sbjct: 132  GAALASSTREEKGQS 146


>gnl|CDD|140307 PTZ00284, PTZ00284, protein kinase; Provisional.
          Length = 467

 Score = 32.6 bits (74), Expect = 2.8
 Identities = 26/115 (22%), Positives = 44/115 (38%), Gaps = 3/115 (2%)

Query: 2055 TTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESES 2114
             T+  P +    +S  A+ S +T    S ST ++   S S      A+ + +  +   E 
Sbjct: 18   YTSGAPVNALSGNSPKANNSASTGQTTSRSTNSAR-RSGSKRDRETATSTDSGRTKSHEG 76

Query: 2115 TTTSSPASESTTIEEQGVSPHSEKLSANEDPEEFPNEDVFEHTFAEIPNIDHSNQ 2169
              T+  A+ + T   +   P  +K      P +   E  F     E  +ID S Q
Sbjct: 77   AATTKQATTTPTTNVEVAPPPKKKKVTYALPNQSREEGHFYVVLGE--DIDVSTQ 129


>gnl|CDD|222890 PHA02584, 34, long tail fiber, proximal subunit; Provisional.
          Length = 1229

 Score = 32.8 bits (75), Expect = 2.9
 Identities = 26/199 (13%), Positives = 71/199 (35%), Gaps = 12/199 (6%)

Query: 1872 FTTNNNSESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLV 1931
            FT N N  + +V S+  +     T  ++  +++T+         T+ +  S++ TT ++V
Sbjct: 913  FTKNTNLSAPLVSSSTATFGGSVTANSTLTTQNTSNGTVVVVDETSIAFYSQNNTTGNIV 972

Query: 1932 SESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLV 1991
                 T           +  +  T  ++ V+ +      E      ++  + + T    +
Sbjct: 973  FNIDGT-------VDPINVNANGTLNATGVATNGRAVYAEGGGIARTNNAARAITGGFTI 1025

Query: 1992 SESTTTSSPESESTTTISPVSESTTTSSPVSEST-TTISPESESTTTSSPASESTTTNNP 2050
                +T+     +    +       +    + +   TI+         +  S   T N+ 
Sbjct: 1026 RNDGSTTVFLLTAAGDQTGGFNGLKSLIINNANGQVTINDNYIINAGGTIMSGGLTVNSR 1085

Query: 2051 ----KSESTTTNNPASESI 2065
                 ++++ T  P ++++
Sbjct: 1086 IRSQGTKASYTRAPTADTV 1104


>gnl|CDD|183064 PRK11267, PRK11267, biopolymer transport protein ExbD; Provisional.
          Length = 141

 Score = 31.2 bits (71), Expect = 2.9
 Identities = 8/34 (23%), Positives = 20/34 (58%)

Query: 630 QFHPKEPIIMSASSDKTIYLGESPLHCDKAGSIL 663
           Q  P++P+ +S  +D ++++G  P+  +   + L
Sbjct: 57  QPRPEKPVYLSVKADNSMFIGNDPVTDETMITAL 90


>gnl|CDD|221583 pfam12449, DUF3684, Protein of unknown function (DUF3684).  This
            domain family is found in eukaryotes, and is typically
            between 1072 and 1090 amino acids in length.
          Length = 1084

 Score = 32.7 bits (75), Expect = 2.9
 Identities = 15/94 (15%), Positives = 28/94 (29%), Gaps = 13/94 (13%)

Query: 1874 TNNNSESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSE 1933
            T+       + +T   ++       S   E     +              + +  S  S 
Sbjct: 49   TSVTRTVAQIDATWMKVVEWKPPAGSARREGQRVPDT-------------TGSLRSFFSR 95

Query: 1934 STTTSSPESESTTTSSPESESTTTSSLVSESTTT 1967
             T +SSP    T   +   E+     L   ST++
Sbjct: 96   LTGSSSPPKPKTPEPAKVEENLDAEDLTEISTSS 129



 Score = 31.9 bits (73), Expect = 5.7
 Identities = 14/69 (20%), Positives = 22/69 (31%), Gaps = 3/69 (4%)

Query: 1929 SLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTS 1988
             +V       S   E          + +  S  S  T +SSP    T   +   E+    
Sbjct: 64   KVVEWKPPAGSARREGQRVPDT---TGSLRSFFSRLTGSSSPPKPKTPEPAKVEENLDAE 120

Query: 1989 SLVSESTTT 1997
             L   ST++
Sbjct: 121  DLTEISTSS 129



 Score = 31.1 bits (71), Expect = 9.1
 Identities = 25/112 (22%), Positives = 44/112 (39%), Gaps = 13/112 (11%)

Query: 1884 MSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSL--VSESTTTS--- 1938
              +L S  S  T ++SP    T       E+         ST++  L   + +  TS   
Sbjct: 86   TGSLRSFFSRLTGSSSPPKPKTPEPAKVEENLDAEDLTEISTSSVFLHIFTANIQTSVSQ 145

Query: 1939 --SPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTS 1988
              + E E  T   P    TT  +++    T+S  E +++  S  +S ++T  
Sbjct: 146  SFAAELERATKKPP--PKTTKLAIL----TSSYDEYDASKASDSKSSASTGD 191


>gnl|CDD|217443 pfam03234, CDC37_N, Cdc37 N terminal kinase binding.  Cdc37 is a
           molecular chaperone required for the activity of
           numerous eukaryotic protein kinases. This domain
           corresponds to the N terminal domain which binds
           predominantly to protein kinases and is found N terminal
           to the Hsp (Heat shocked protein) 90-binding domain
           pfam08565. Expression of a construct consisting of only
           the N-terminal domain of Saccharomyces pombe Cdc37
           results in cellular viability. This indicates that
           interactions with the cochaperone Hsp90 may not be
           essential for Cdc37 function.
          Length = 172

 Score = 31.7 bits (72), Expect = 3.1
 Identities = 21/91 (23%), Positives = 36/91 (39%), Gaps = 10/91 (10%)

Query: 705 RCRKRLRKLKKKEKYESPLHCDKAGSILRSGKGRVHTMVNDKHRQILCCHGNDNVVDLFY 764
           R  K L +LK++    S        ++++S         N +  Q      N+ V DLF 
Sbjct: 64  RVDKLLSELKEESLDSSQ-------AVMKSLNENFTDKENVEPEQPTY---NEMVEDLFD 113

Query: 765 FCTKDESSTRCRKRLRKLKKKEKKLQEEQME 795
               +         + +L+K   KL++EQ E
Sbjct: 114 QVKDEVDEKNGAALIEELQKHRDKLKKEQKE 144


>gnl|CDD|220402 pfam09787, Golgin_A5, Golgin subfamily A member 5.  Members of this
            family of proteins are involved in maintaining Golgi
            structure. They stimulate the formation of Golgi stacks
            and ribbons, and are involved in intra-Golgi retrograde
            transport. Two main interactions have been characterized:
            one with RAB1A that has been activated by GTP-binding and
            another with isoform CASP of CUTL1.
          Length = 509

 Score = 32.5 bits (74), Expect = 3.1
 Identities = 25/97 (25%), Positives = 40/97 (41%), Gaps = 6/97 (6%)

Query: 2019 SPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTS 2078
               +E          S+TTSSP   S + +  ++ S   +N A       +P    +  S
Sbjct: 18   RKATEEDDDEDLLEVSSTTSSPVG-SISWSVRETAS---SNKARSRSEKWNPDQPGSRVS 73

Query: 2079 SPASESTTTSSPASESTTTSSPASESTTTSSPESEST 2115
            SP+S+   TS   S S+     AS  ++ SS + E  
Sbjct: 74   SPSSKKDGTS--RSLSSQVDDLASAVSSQSSSDLEDE 108


>gnl|CDD|221480 pfam12238, MSA-2c, Merozoite surface antigen 2c.  This family of
            proteins is found in eukaryotes. Proteins in this family
            are typically between 263 and 318 amino acids in length.
            There is a conserved SFT sequence motif. MSA-2 is a
            plasma membrane glycoprotein which can be found in
            Babesia bovis species.
          Length = 201

 Score = 31.7 bits (72), Expect = 3.3
 Identities = 12/60 (20%), Positives = 26/60 (43%), Gaps = 6/60 (10%)

Query: 2061 ASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSP-ESESTTTSS 2119
             +E  +    +S+ T+T+ P+  S T +     ++   +P + +     P E+     SS
Sbjct: 138  PAEYYSPKHSSSQGTSTTRPSDGSATPN-----TSAPPTPGNPAAQPEKPAETPKGNGSS 192


>gnl|CDD|236504 PRK09418, PRK09418, bifunctional 2',3'-cyclic nucleotide
            2'-phosphodiesterase/3'-nucleotidase precursor protein;
            Reviewed.
          Length = 780

 Score = 32.8 bits (74), Expect = 3.3
 Identities = 36/145 (24%), Positives = 56/145 (38%), Gaps = 13/145 (8%)

Query: 1994 STTTSSPESESTTT----ISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNN 2049
            +T  SSP ++        IS V  S    +  +   T  + + + T   +P +  T   +
Sbjct: 617  TTFDSSPNAQKYIKKDGNISYVGPSENEFAKYAIDITKKNDDDKETGGENPTTPPTGEGD 676

Query: 2050 PKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPA-SESTTTS 2108
                 TT   P  E     +P      T+ P  E     +P + ST   + A S  TTT 
Sbjct: 677  NGENPTTP--PTGEGNNGENP------TTPPTGEGNNGGNPTTPSTDEGNNAGSGQTTTD 728

Query: 2109 SPESESTTTSSPASESTTIEEQGVS 2133
            +  S+ TTT S   E   + + G S
Sbjct: 729  NQNSKETTTVSENKEERDLPKTGTS 753



 Score = 31.2 bits (70), Expect = 7.8
 Identities = 23/96 (23%), Positives = 35/96 (36%), Gaps = 4/96 (4%)

Query: 2003 ESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPAS 2062
                T  P  E     +P +  T   +     TT   P  E     NP + ST   N A 
Sbjct: 664  GENPTTPPTGEGDNGENPTTPPTGEGNNGENPTT--PPTGEGNNGGNPTTPSTDEGNNAG 721

Query: 2063 --ESITSSSPASESTTTSSPASESTTTSSPASESTT 2096
              ++ T +  + E+TT S    E     +  S ++T
Sbjct: 722  SGQTTTDNQNSKETTTVSENKEERDLPKTGTSVAST 757


>gnl|CDD|144451 pfam00859, CTF_NFI, CTF/NF-I family transcription modulation region. 
          Length = 295

 Score = 32.0 bits (72), Expect = 3.3
 Identities = 54/274 (19%), Positives = 92/274 (33%), Gaps = 31/274 (11%)

Query: 1855 AATAVAISVIDNYSEIIFTTNNNSESTVVMSTLNSLLSENTTTNSPESESTTTNNPESES 1914
            A T    S+ D  S   +  N  +     + + +S  S+   T S E E  T+   E   
Sbjct: 25   AGTGPNFSLADLSSSSYYDLNPGAGLRRSLPSTSSSSSKRPKTVSMEEEMDTSPGGEDFY 84

Query: 1915 TTTSSPESESTTTSSLVS--ESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPES 1972
            T+ SSP S S     +     S T   P+    ++ SP+  S   S+         +  S
Sbjct: 85   TSPSSPSSSSANWHEVEGGMSSPTMKKPDKSLFSSPSPQDSSPRLSAFTQHHRPVITGHS 144

Query: 1973 ESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPE- 2031
              + +  P                T SP    T+ I P   S+            + P+ 
Sbjct: 145  GISASPHP----------------TPSPLHFPTSPILPQQPSSYFPHTAIRYPPHLHPQD 188

Query: 2032 ---SESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTS 2088
                       P+S+     N   +    N+     + +  P         P +      
Sbjct: 189  PLKEFVQLVCDPSSQQAGQPNGSGQGKVPNHFLPTPMLAPPP-------PPPMARPVPLP 241

Query: 2089 SPASESTTTSSPASESTTTSSPESESTTTSSPAS 2122
             P ++  TTS+    ++ TS   + ST ++SPA+
Sbjct: 242  MPDTKPPTTSTEGGATSPTSP--TYSTPSTSPAN 273


>gnl|CDD|225249 COG2374, COG2374, Predicted extracellular nuclease [General function
            prediction only].
          Length = 798

 Score = 32.5 bits (74), Expect = 3.3
 Identities = 14/111 (12%), Positives = 33/111 (29%), Gaps = 5/111 (4%)

Query: 2027 TISPESESTTTSSPASESTTT-NNPKSESTTTNNPASESITSSSPASESTTTSSPASEST 2085
             +    +   T S   +  T      +++  T   ++    S   + + +      +  +
Sbjct: 112  YVLLNKDGGYTDSLGVQGGTPLTRWNTDAQQTLTTSAVKEDSFDGSVKESVNFEETATPS 171

Query: 2086 TTSSPA----SESTTTSSPASESTTTSSPESESTTTSSPASESTTIEEQGV 2132
            T    +     E +TT         TS  + +     S       +  +GV
Sbjct: 172  TYPGLSHVNIGELSTTQYGNEALVLTSIGQIQGEGHRSGPLGGGVVTIEGV 222


>gnl|CDD|234520 TIGR04246, nitrous_NosZ_Gp, nitrous-oxide reductase, Sec-dependent.
            This model represents the nitrous-oxide reductase
           protein NosZ as characterized in Geobacillus
           thermodenitrificans. In contrast to the related form in
           Pseudomonas stutzeri, this version lacks a recognizable
           twin-arginine translocation (TAT) signal at the
           N-terminus. Consequently, its accessory protein may
           differ. Some members of this family have an additional
           cytochrome c-like domain at the C-terminus.
          Length = 578

 Score = 32.3 bits (74), Expect = 3.3
 Identities = 15/45 (33%), Positives = 25/45 (55%), Gaps = 5/45 (11%)

Query: 509 IDNDIKMWDLRTNSVVQKLR-----GHSDTVTGLSLSPDGSYILS 548
           +D+++  W+L T  VV K+      GH     G ++ PDG Y++S
Sbjct: 356 VDSEVVKWNLDTWEVVDKVPVHYSVGHLMAPEGDTVKPDGKYLVS 400


>gnl|CDD|236090 PRK07764, PRK07764, DNA polymerase III subunits gamma and tau;
            Validated.
          Length = 824

 Score = 32.7 bits (75), Expect = 3.4
 Identities = 13/134 (9%), Positives = 41/134 (30%), Gaps = 5/134 (3%)

Query: 1996 TTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSEST 2055
              +             + +   ++P   +    +  + +   +   + +       +  +
Sbjct: 386  GVAGGAGAPAAAAPSAAAAAPAAAPAPAAAAPAAAAAPAPAAAPQPAPAPAP--APAPPS 443

Query: 2056 TTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESEST 2115
               N  +    S  PA+  +   +PA  +    + A      ++PA      +     + 
Sbjct: 444  PAGNAPAGGAPSPPPAAAPSAQPAPAPAAAPEPTAAPAPAPPAAPAPA---AAPAAPAAP 500

Query: 2116 TTSSPASESTTIEE 2129
               + A ++ T+ E
Sbjct: 501  AAPAGADDAATLRE 514


>gnl|CDD|215448 PLN02834, PLN02834, 3-dehydroquinate synthase.
          Length = 433

 Score = 32.4 bits (74), Expect = 3.4
 Identities = 14/68 (20%), Positives = 24/68 (35%), Gaps = 1/68 (1%)

Query: 2051 KSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTS-SPASESTTTSS 2109
            KS S   +   + ++ S SP+      +S    S               S A++S  T +
Sbjct: 2    KSSSADNSESNTPTVLSRSPSDAFFDQNSSIESSKEGDLTEVIHEKCPVSGANKSEVTKT 61

Query: 2110 PESESTTT 2117
              +  TT 
Sbjct: 62   ASATVTTV 69


>gnl|CDD|237624 PRK14143, PRK14143, heat shock protein GrpE; Provisional.
          Length = 238

 Score = 32.0 bits (73), Expect = 3.6
 Identities = 17/81 (20%), Positives = 32/81 (39%), Gaps = 3/81 (3%)

Query: 1918 SSPESESTTT-SSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTT 1976
            S+PE +      +++SES    +    S   +  E+E T        +   SSP+S S  
Sbjct: 1    STPEQDPLEVKLAVISESEAEDNSPESSEEVTEQEAELTNPE--GDAAEAESSPDSGSAA 58

Query: 1977 TSSPESESTTTSSLVSESTTT 1997
            + +    +   + L  E  + 
Sbjct: 59   SETAADNAARLAQLEQELESL 79


>gnl|CDD|183582 PRK12543, PRK12543, RNA polymerase sigma factor; Provisional.
          Length = 179

 Score = 31.2 bits (71), Expect = 3.7
 Identities = 20/106 (18%), Positives = 43/106 (40%), Gaps = 24/106 (22%)

Query: 708 KRLRKLKKKEKYESPLHCDKAGSILRSGKGR-----VHTMVNDKHRQILCCHGNDNVVDL 762
           +R R  +K E+   P+  D +  +L     +     +H +   K RQ++       ++  
Sbjct: 79  RRFRIFEKAEEQRKPVSIDFSEDVLSKESNQELIELIHKL-PYKLRQVI-------ILRY 130

Query: 763 FYFCTKDESST-----------RCRKRLRKLKKKEKKLQEEQMEVV 797
            +  +++E +            R    L+KL++KE+  +    EV 
Sbjct: 131 LHDYSQEEIAQLLQIPIGTVKSRIHAALKKLRQKEQIEEIFLGEVG 176


>gnl|CDD|233367 TIGR01349, PDHac_trf_mito, pyruvate dehydrogenase complex
            dihydrolipoamide acetyltransferase, long form.  This
            model represents one of several closely related clades of
            the dihydrolipoamide acetyltransferase subunit of the
            pyruvate dehydrogenase complex. It includes sequences
            from mitochondria and from alpha and beta branches of the
            proteobacteria, as well as from some other bacteria.
            Sequences from Gram-positive bacteria are not included.
            The non-enzymatic homolog protein X, which serves as an
            E3 component binding protein, falls within the clade
            phylogenetically but is rejected by its low score [Energy
            metabolism, Pyruvate dehydrogenase].
          Length = 436

 Score = 32.1 bits (73), Expect = 3.8
 Identities = 16/82 (19%), Positives = 33/82 (40%), Gaps = 2/82 (2%)

Query: 2065 ITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASES 2124
            +        +    +   ES+ + +P       ++P S    + +P+ +S   SSPA  S
Sbjct: 73   VLVEEKEDVADAFKNYKLESSASPAPKPSEIAPTAPPSAPKPSPAPQKQSPEPSSPAPLS 132

Query: 2125 TTIEEQGV--SPHSEKLSANED 2144
                   +  SP ++KL+  + 
Sbjct: 133  DKESGDRIFASPLAKKLAKEKG 154



 Score = 32.1 bits (73), Expect = 4.7
 Identities = 23/92 (25%), Positives = 37/92 (40%), Gaps = 15/92 (16%)

Query: 2067 SSSPASESTTT--SSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASES 2124
            S+SPA + +    ++P S    + +P  +S   SSPA      S  ES     +SP ++ 
Sbjct: 93   SASPAPKPSEIAPTAPPSAPKPSPAPQKQSPEPSSPAP----LSDKESGDRIFASPLAKK 148

Query: 2125 TTIEE-------QGVSPHSEKLSANEDPEEFP 2149
               E+        G  P+   +    D E F 
Sbjct: 149  LAKEKGIDLSAVAGSGPNGRIVKK--DIESFV 178



 Score = 31.3 bits (71), Expect = 7.3
 Identities = 15/59 (25%), Positives = 23/59 (38%)

Query: 2036 TTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASES 2094
                    +    N K ES+ +  P    I  ++P S    + +P  +S   SSPA  S
Sbjct: 74   LVEEKEDVADAFKNYKLESSASPAPKPSEIAPTAPPSAPKPSPAPQKQSPEPSSPAPLS 132


>gnl|CDD|216784 pfam01917, Arch_flagellin, Archaebacterial flagellin.  Members of
            this family are the proteins that form the flagella in
            archaebacteria.
          Length = 151

 Score = 31.0 bits (71), Expect = 3.8
 Identities = 25/107 (23%), Positives = 39/107 (36%), Gaps = 10/107 (9%)

Query: 1849 LLISMLAATAVAISVIDNYSEIIFTTNNNSESTVVMSTLNSLLSEN-TTTNSPESESTTT 1907
            + I+M+   AVA  V+ N S         + ST     L   LS +          ST+T
Sbjct: 10   VFIAMVLVAAVAAGVLINTS---GFLQQKASSTG--EELTEQLSTDLEIIGVVGDSSTST 64

Query: 1908 NNPESE---STTTSSPESESTTTSSLVSESTT-TSSPESESTTTSSP 1950
               +         S+P   S T  +++ +      +    STT S P
Sbjct: 65   TIDKLTIYVKNAGSTPIDLSQTKITVLYDGGIVVINDTDYSTTVSDP 111


>gnl|CDD|227931 COG5644, COG5644, Uncharacterized conserved protein [Function
            unknown].
          Length = 869

 Score = 32.4 bits (73), Expect = 4.0
 Identities = 57/297 (19%), Positives = 108/297 (36%), Gaps = 27/297 (9%)

Query: 1876 NNSESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSEST 1935
            N S+S        +L +    + +   +S  ++  E+E + +S  E+E     +L+    
Sbjct: 76   NASKSGKSNKDHKNLNNTKEISLNDSDDSVNSDKLENEGSVSSIDENELVDLDTLLDNDQ 135

Query: 1936 TT----------SSPES--ESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESE 1983
                        +  E+  ES  +SS +SES  + S     ++ S  + E++ +      
Sbjct: 136  PEKNESGNNDHATDKENLLESDASSSNDSESEESDSESEIESSDSDHDDENSDSKLDNLR 195

Query: 1984 STTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASE 2043
            +   S    E+   S   S+   +I  +      ++  S S+ TI         S P  +
Sbjct: 196  NYIVSLKKDEADAESVLSSDDNDSIEEIKYDPHETNKESGSSETID--ITDLLDSIPMEQ 253

Query: 2044 STTTNNP-KSESTTTNNPASESITS---SSPASESTTTSSPASESTTTSSPASE------ 2093
               +  P  SES+  + P ++SI        A E T       +     +  S+      
Sbjct: 254  LKVSLKPLVSESSKLDAPLAKSIQDRLERQAAYEQTKNDLEKWKPIVADNRKSDQLIFPM 313

Query: 2094 -STTTSSPASESTTTS-SPESESTTTSSPASESTTIEEQGVSPHSEKLSANE-DPEE 2147
              T    P++    +S  P +ES      A     +E +      E+L+ N+   EE
Sbjct: 314  NETARPVPSNNGLASSFEPRTESERKMHQALLDAGLENESALKKQEELALNKLSVEE 370


>gnl|CDD|216194 pfam00922, Phosphoprotein, Vesiculovirus phosphoprotein. 
          Length = 283

 Score = 31.8 bits (72), Expect = 4.0
 Identities = 16/96 (16%), Positives = 30/96 (31%), Gaps = 9/96 (9%)

Query: 2100 PASESTTTSSPESESTTTSSPASESTTIEEQGVSPH---SEKLSANEDPEEFPNEDVFEH 2156
            P  + T +   E E       ++      E+  SP    +E+LS +E      ++     
Sbjct: 11   PRLDQTLSEIEEMEEQRADKSSTFQEDSVEEHTSPSYYLAEELSDSETEPSIEDDQGLYT 70

Query: 2157 TFAEIPNIDHSNQT------DEAIPETFDAREEWPQ 2186
                   ++   Q       D+ I   F+    W  
Sbjct: 71   QLPPAEQVEGFIQGPLDDIADDDIDVVFEEDRPWKP 106


>gnl|CDD|200219 TIGR02927, SucB_Actino, 2-oxoglutarate dehydrogenase, E2 component,
            dihydrolipoamide succinyltransferase.  This model
            represents an Actinobacterial clade of E2 enzyme, a
            component of the 2-oxoglutarate dehydrogenase complex
            involved in the TCA cycle. These proteins have multiple
            domains including the catalytic domain (pfam00198), one
            or two biotin domains (pfam00364) and an E3-component
            binding domain (pfam02817).
          Length = 579

 Score = 32.3 bits (73), Expect = 4.0
 Identities = 35/192 (18%), Positives = 60/192 (31%), Gaps = 13/192 (6%)

Query: 1932 SESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPE-----SESTT 1986
            SE    +     +    +P    T  +   + +   +    E+T    PE     +E T 
Sbjct: 84   SEPAPAAPEPEAAPEPEAPAPAPTPAAEAPAPAAPQAGGSGEATEVKMPELGESVTEGTV 143

Query: 1987 TSSL--VSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASES 2044
            TS L  V ++     P  E +T        T   SPV+ +   I    + T         
Sbjct: 144  TSWLKAVGDTVEVDEPLLEVSTD----KVDTEIPSPVAGTLLEIRAPEDDTVEVGTVLAI 199

Query: 2045 TTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASES 2104
                N            + S   S PA +    +     +    +PA     T++PA+ +
Sbjct: 200  IGDANAAPAEPAEEEAPAPSEAGSEPAPD--PAARAPHAAPDPPAPAPAPAKTAAPAAAA 257

Query: 2105 TTTSSPESESTT 2116
              +S       T
Sbjct: 258  PVSSGDSGPYVT 269


>gnl|CDD|227358 COG5025, COG5025, Transcription factor of the Forkhead/HNF3 family
            [Transcription].
          Length = 610

 Score = 32.1 bits (73), Expect = 4.2
 Identities = 31/225 (13%), Positives = 69/225 (30%), Gaps = 9/225 (4%)

Query: 1910 PESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSS-LVSESTTTS 1968
                +   +S     +   S      + S P                 S         + 
Sbjct: 375  RHKPTAWQNSIRHNLSLNKSFEKVPRSASQPGKGCFWKIDYSYIYEKESKRNPRSPKKSP 434

Query: 1969 SPESESTTTS---SPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSEST 2025
            S  S     S   +   +S  TS + S S+  +S     +T I      +  +  ++E  
Sbjct: 435  SAHSVHQKLSLHVNDLYQSPATSDIASSSSQVNSQPEFISTQIHSSKGVS--NVDLTEQD 492

Query: 2026 TTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASEST 2085
            +       +    S +    T         TT++   +S + ++P + S    +  + ++
Sbjct: 493  SQKEASKGNFLDDSGSLSPNTNEINSFSLNTTDSQQKQSPSHNAPTNNSLNEMASKNSNS 552

Query: 2086 TTSSPASESTTTSSPASESTTTSSPESESTT---TSSPASESTTI 2127
             T +  S     +  A    +    +    +   T + A+ES ++
Sbjct: 553  QTQASNSNENVAAVKAILDASAQMEKPYDLSQAATPTKATESASV 597


>gnl|CDD|225372 COG2815, COG2815, Uncharacterized protein conserved in bacteria
            [Function unknown].
          Length = 303

 Score = 32.0 bits (73), Expect = 4.2
 Identities = 16/116 (13%), Positives = 29/116 (25%), Gaps = 2/116 (1%)

Query: 1963 ESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVS 2022
            E  ++  PE E  + S P   +    S +    +  +  +   + +  V           
Sbjct: 190  EYVSSDRPEGEVISQSPPAGTTVNVGSKIEIVVSKGAFVAPDLSGMFTVEAEPHPREEGD 249

Query: 2023 ESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTS 2078
             S   I  +    T S   S       P    T     +  +             S
Sbjct: 250  TSQEVIRDKDADVTASGTDSSVNIQPPP--GGTIVLKGSEITSGIYQVVVNDKVIS 303


>gnl|CDD|152115 pfam11679, DUF3275, Protein of unknown function (DUF3275).  This
            family of proteins with unknown function appear to be
            restricted to Proteobacteria.
          Length = 211

 Score = 31.4 bits (71), Expect = 4.3
 Identities = 16/71 (22%), Positives = 25/71 (35%), Gaps = 3/71 (4%)

Query: 2059 NPASESITSSSPASESTTTSSPASESTTTSSPASE-STTTSSPASESTTTSSPESES--T 2115
              + +    + P        SPAS +   S+PA   S  +  PAS   +           
Sbjct: 84   KLSRDEPRRTEPQEPDPLDESPASAAPVASAPAPAPSPQSPKPASRRASRDMRRIAPFGM 143

Query: 2116 TTSSPASESTT 2126
              S+PA E+  
Sbjct: 144  NASAPAQEAAQ 154


>gnl|CDD|225288 COG2433, COG2433, Uncharacterized conserved protein [Function
           unknown].
          Length = 652

 Score = 32.0 bits (73), Expect = 4.4
 Identities = 28/88 (31%), Positives = 39/88 (44%), Gaps = 9/88 (10%)

Query: 776 RKRLRKLKKKEKKLQEEQMEV------VEENPVDPDDTEGGKGKPELVDVVKRLPTIKTA 829
           R R R++++ EK+L+E++  V      + E          GKG P  V     L  I+ A
Sbjct: 477 RARDRRIERLEKELEEKKKRVEELERKLAELRKMRKLELSGKGTPVKVVEKLTLEAIEEA 536

Query: 830 SKTGKIKSVDVIL---GGGGEIRLALLL 854
            +   IK  DVIL     GG  R A  L
Sbjct: 537 EEEYGIKEGDVILVEDPSGGGARTAEEL 564


>gnl|CDD|215601 PLN03142, PLN03142, Probable chromatin-remodeling complex ATPase
           chain; Provisional.
          Length = 1033

 Score = 32.1 bits (73), Expect = 4.5
 Identities = 17/48 (35%), Positives = 30/48 (62%), Gaps = 3/48 (6%)

Query: 767 TKDESSTRCRKRLRKLKKKEKKLQEEQMEVVEENP-VDPDDTEGGKGK 813
            K E S R + RL++LKK++K+  ++ +E  ++N  +D D    GKG+
Sbjct: 53  AKAEISKREKARLKELKKQKKQEIQKILE--QQNAAIDADMNNKGKGR 98


>gnl|CDD|152451 pfam12016, Stonin2_N, Stonin 2.  Stonin 2 is involved in clathrin
            mediated endocytosis. It binds to Eps15 by its highly
            conserved NPF motif. The complex formed has been shown to
            directly associate with the clathrin adaptor complex
            AP-2, and to localize to clathrin-coated pits (CCPs). In
            addition, stonin2 was recently identified as a specific
            sorting adaptor for synaptotagmin, and may thus regulate
            synaptic vesicle recycling.
          Length = 341

 Score = 31.8 bits (71), Expect = 4.5
 Identities = 38/205 (18%), Positives = 71/205 (34%), Gaps = 17/205 (8%)

Query: 1895 TTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESES 1954
             +T+ P  E+  T  P +    T  P  +S     L SES+ T+   SE T++ S     
Sbjct: 112  ASTSPPHKETAETALPLTMPCWTC-PSFDSLGRCPLTSESSWTT--HSEDTSSPSFACSY 168

Query: 1955 TTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVS-- 2012
            T    + +E   +       +T +S   +      + + S    SP         PV+  
Sbjct: 169  TDLQLINAEEQASGQASGADSTDNSSSLQEDEEVEMEAISWQAGSPAMNGHPAAPPVTSA 228

Query: 2013 --------ESTTTSSPVSESTTTISPESESTTTSSP----ASESTTTNNPKSESTTTNNP 2060
                    +      P+    +   P +    +++P     S  +     + +ST  N P
Sbjct: 229  RFPSWVTFDDNEVGCPLPPVPSPKKPNTPPAASAAPDVPFNSMGSFKKRDRPKSTLMNFP 288

Query: 2061 ASESITSSSPASESTTTSSPASEST 2085
              + +  SS     +   +P   +T
Sbjct: 289  KVQKLDISSLNRPPSVIEAPPWRAT 313


>gnl|CDD|218803 pfam05904, DUF863, Plant protein of unknown function (DUF863).  This
            family consists of a number of hypothetical proteins from
            Arabidopsis thaliana and Oryza sativa. The function of
            this family is unknown.
          Length = 766

 Score = 32.3 bits (73), Expect = 4.5
 Identities = 48/314 (15%), Positives = 105/314 (33%), Gaps = 34/314 (10%)

Query: 1868 SEIIFTTNNNSESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTT 1927
            S  +   N  S +  +  T N    E     S  S         + S        ++   
Sbjct: 103  SNGLADLNEPSPTWGLTETANVQGQEVEERASDTSRDFLGRYGSNISHVQDQSLEKNLNH 162

Query: 1928 SSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPE--SEST 1985
            +S++      S+P+S     S            V  +       S  T  S  +   E T
Sbjct: 163  NSVLEAGKEKSTPKSSLDLPSQEGQ--------VLSNKAFQPRYSLLTDQSKCKYVRERT 214

Query: 1986 TTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASEST 2045
            +++  V          S   +  SP+  S  ++ P         P+S  + +   +S   
Sbjct: 215  SSNLEVQNK-------SPGVSYQSPLESSVASNLP--RLNPFYRPDSAKSWSHWSSSWEN 265

Query: 2046 TTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASEST 2105
             ++    +ST  N+  ++ +     + E+++T++P+  ++  S+ ++      S  S+ +
Sbjct: 266  MSSGLDQKSTPLNSAQTQPV----LSFETSSTANPSFGTSCCSTNSNGFYNGFSSGSKES 321

Query: 2106 T--TSSPESESTTTSSPASESTTIEEQGVSPHSEKLS---------ANEDPEEFPNEDVF 2154
                S+  +    +S   +   +  E       E  S           + P +      F
Sbjct: 322  PFFASTGFNYPNISSGEEATEHSFVELQGPKSEECSSGLPWLRKKPTCKGPLDLNASSAF 381

Query: 2155 EHTFAEIPNIDHSN 2168
              + A + +++ SN
Sbjct: 382  YSSNANVIDVEPSN 395


>gnl|CDD|227520 COG5193, LHP1, La protein, small RNA-binding pol III transcript
            stabilizing protein and related La-motif-containing
            proteins involved in translation [Posttranslational
            modification, protein turnover, chaperones / Translation,
            ribosomal structure and biogenesis].
          Length = 438

 Score = 31.9 bits (72), Expect = 4.8
 Identities = 39/263 (14%), Positives = 76/263 (28%), Gaps = 23/263 (8%)

Query: 1914 STTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESE 1973
            S T    E +   TSSL   S  T+S E       S +S +      ++ES+ +   +  
Sbjct: 9    SNTEHQAEDKKKQTSSLKLASEPTTSEEKS----KSQDSNTVIPVEELTESSKSKKEDKN 64

Query: 1974 STTTSSPESES-TTTSSLVSES--TTTSSPESESTTTISPVSESTTTSSPVSESTTTISP 2030
             +  +S    +        S S  T ++ P+ +   T +P ++      P+     T + 
Sbjct: 65   PSKLTSNTKWTLKQVEFYFSGSKDTDSNFPKDKFLKTTAPKNKKRDKWVPIKTI-ATFNR 123

Query: 2031 ESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSP 2090
               S         S  +    +   + +    E  +S S  + +    S  ++ST+    
Sbjct: 124  MKNSG--------SPVSAVSGALRKSLDARVLEVSSSGSNKNRTEKLISNNNKSTSQM-- 173

Query: 2091 ASESTTTSSPASESTTTSSPESESTTTSSPASESTTIEEQGV---SPHSEKLSANEDPEE 2147
                   +    E    +S   +                  +        K        E
Sbjct: 174  -QRDVYQNGFGKEDVNNASRPEQQEDLEIQFPPHYHAPPSQIRNRRDWLNKNFRGSVFVE 232

Query: 2148 FPNEDVFE-HTFAEIPNIDHSNQ 2169
            F      +        N  + N 
Sbjct: 233  FKYFREAQRFNNGFYRNKKYPND 255


>gnl|CDD|215361 PLN02673, PLN02673, quinolinate synthetase A.
          Length = 724

 Score = 31.9 bits (72), Expect = 5.0
 Identities = 21/82 (25%), Positives = 31/82 (37%), Gaps = 14/82 (17%)

Query: 2032 SESTTTSSPASESTTTNNPKSESTTTN------------NPASESIT--SSSPASESTTT 2077
            S S T+SS +S  +   NP     TT+            NP  +S     S P   + + 
Sbjct: 2    SSSPTSSSSSSFLSLLPNPSPNFRTTHPNFGSQRRIGTINPLFKSFKCIQSPPPDSAPSN 61

Query: 2078 SSPASESTTTSSPASESTTTSS 2099
            +SP S S    SP+  +     
Sbjct: 62   ASPFSCSAVAFSPSQTTELVPC 83


>gnl|CDD|115071 pfam06390, NESP55, Neuroendocrine-specific golgi protein P55
            (NESP55).  This family consists of several mammalian
            neuroendocrine-specific golgi protein P55 (NESP55)
            sequences. NESP55 is a novel member of the chromogranin
            family and is a soluble, acidic, heat-stable secretory
            protein that is expressed exclusively in endocrine and
            nervous tissues, although less widely than chromogranins.
          Length = 261

 Score = 31.4 bits (70), Expect = 5.2
 Identities = 31/127 (24%), Positives = 45/127 (35%), Gaps = 9/127 (7%)

Query: 1910 PESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSS 1969
            PE  S   S  E E         E       + ++ T S  E ES     + SE+   + 
Sbjct: 81   PEP-SEPESDHEDEDFEPELARPECLEYDEDDFDTETDSETEPES----DIESETEFETE 135

Query: 1970 PESESTT--TSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTT 2027
            PE+E  T  T+ PE+E       V     T       T  +  +   +  +SP     +T
Sbjct: 136  PETEPDTAPTTEPETEPEDEPGPVVPKGATFH--QSLTERLHALKLQSADASPRRAPPST 193

Query: 2028 ISPESES 2034
              PES  
Sbjct: 194  QEPESAR 200


>gnl|CDD|221581 pfam12446, DUF3682, Protein of unknown function (DUF3682).  This
            domain family is found in eukaryotes, and is typically
            between 125 and 136 amino acids in length.
          Length = 133

 Score = 30.2 bits (68), Expect = 5.4
 Identities = 21/83 (25%), Positives = 35/83 (42%), Gaps = 2/83 (2%)

Query: 2037 TSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTT 2096
             SS +S       P         PA+  + SS+ +S     SS ++ S TT   ++   +
Sbjct: 10   VSSGSSAPAPPAGPGPGPNAPPAPAAPGVDSSAGSSGGEAGSSGSNSSNTTGDSSTGDQS 69

Query: 2097 TSSPASESTTTSSPESESTTTSS 2119
                A+ +  +S PE  + TTS 
Sbjct: 70   --PAAAAAHNSSPPEGPAGTTSG 90


>gnl|CDD|233366 TIGR01348, PDHac_trf_long, pyruvate dehydrogenase complex
            dihydrolipoamide acetyltransferase, long form.  This
            model describes a subset of pyruvate dehydrogenase
            complex dihydrolipoamide acetyltransferase specifically
            close by both phylogenetic and per cent identity (UPGMA)
            trees. Members of this set include two or three copies of
            the lipoyl-binding domain. E. coli AceF is a member of
            this model, while mitochondrial and some other bacterial
            forms belong to a separate model [Energy metabolism,
            Pyruvate dehydrogenase].
          Length = 546

 Score = 31.8 bits (72), Expect = 5.5
 Identities = 23/96 (23%), Positives = 38/96 (39%), Gaps = 1/96 (1%)

Query: 2021 VSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSP 2080
            VS   + I+ ES+  +   PA  S    + K +   +       +T S   S   T  +P
Sbjct: 143  VSADQSLITLESDKASMEVPAPASGVVKSVKVKVGDSVPTGDLILTLSVAGSTPATAPAP 202

Query: 2081 ASESTTTSSPASES-TTTSSPASESTTTSSPESEST 2115
            AS      SPA+      ++PA+      +P+   T
Sbjct: 203  ASAQPAAQSPAATQPEPAAAPAAAKAQAPAPQQAGT 238


>gnl|CDD|177475 PHA02693, PHA02693, hypothetical protein; Provisional.
          Length = 710

 Score = 31.9 bits (72), Expect = 5.5
 Identities = 29/116 (25%), Positives = 46/116 (39%), Gaps = 10/116 (8%)

Query: 2013 ESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPAS 2072
            + + T+   S S +T S  S  +T S+ +   T T +P  +         ES     P +
Sbjct: 271  DESDTADSCSRSFSTQSTRSTRSTRSTRSGAETDTTDPDLDP-----DDDESFDEVGPLT 325

Query: 2073 ESTTTSSPASESTTTSSPAS--ESTTTSSPASESTTTSS---PESESTTTSSPASE 2123
               T +S A  ++  SS AS       S+  SE   +S+   P+     T   A E
Sbjct: 326  RRFTATSFAPRASVRSSSASMRLHARGSTRISEPLMSSAARVPKVSMAPTLDTAEE 381


>gnl|CDD|218190 pfam04651, Pox_A12, Poxvirus A12 protein. 
          Length = 188

 Score = 30.9 bits (70), Expect = 5.6
 Identities = 17/73 (23%), Positives = 29/73 (39%), Gaps = 1/73 (1%)

Query: 2046 TTNNPKSESTTTNNPASESITSSS-PASESTTTSSPASESTTTSSPASESTTTSSPASES 2104
              N   + +   NNP+   + +     S S++ S   S ST+ + P S S+ + S A   
Sbjct: 36   QANRGGNLAGPENNPSDNEVKAGKRVTSASSSKSKRCSTSTSKTKPCSRSSRSRSGAPRR 95

Query: 2105 TTTSSPESESTTT 2117
              T+    E    
Sbjct: 96   RGTAFGSMEDPQI 108


>gnl|CDD|218744 pfam05781, MRVI1, MRVI1 protein.  This family consists of mammalian
            MRVI1 proteins which are related to the
            lymphoid-restricted membrane protein (JAW1) and the IP3
            receptor associated cGMP kinase substrates A and B (IRAGA
            and IRAGB). The function of MRVI1 is unknown although
            mutations in the Mrvi1 gene induces myeloid leukaemia by
            altering the expression of a gene important for myeloid
            cell growth and/or differentiation so it has been
            speculated that Mrvi1 is a tumour suppressor gene. IRAG
            is very similar in sequence to MRVI1 and is an essential
            NO/cGKI-dependent regulator of IP3-induced calcium
            release. Activation of cGKI decreases IP3-stimulated
            elevations in intracellular calcium, induces smooth
            muscle relaxation and contributes to the
            antiproliferative and pro-apoptotic effects of NO/cGMP.
            Jaw1 is a member of a class of proteins with
            COOH-terminal hydrophobic membrane anchors and is
            structurally similar to proteins involved in vesicle
            targeting and fusion. This suggests that the function
            and/or the structure of the ER in lymphocytes may be
            modified by lymphoid-restricted resident ER proteins.
          Length = 538

 Score = 31.9 bits (72), Expect = 5.6
 Identities = 24/123 (19%), Positives = 36/123 (29%), Gaps = 7/123 (5%)

Query: 2030 PESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSS 2089
                      PA ES        +       +S  +   + +   T TSSP  +     +
Sbjct: 40   ASQGENGVGEPAGES-----VGQKRELWPPTSSPPLLRGTSSDSGTETSSPRGQKILAMA 94

Query: 2090 PASESTTTSSPASESTTTSSPESESTTTSSPASESTTIEEQ-GVSPHSEKLSANEDPEEF 2148
                         ES   +SP  +   T S A E   +     V     +L A E+ E  
Sbjct: 95   SLDLDEKRLCGKEESKRAASPGLKQQGT-SLAEEHILLRNSNLVGKKLPELEAAEEQETS 153

Query: 2149 PNE 2151
              E
Sbjct: 154  EIE 156


>gnl|CDD|152863 pfam12429, DUF3676, Protein of unknown function (DUF3676).  This
            domain family is found in eukaryotes, and is
            approximately 230 amino acids in length.
          Length = 230

 Score = 31.0 bits (70), Expect = 5.6
 Identities = 54/217 (24%), Positives = 82/217 (37%), Gaps = 18/217 (8%)

Query: 1892 SENTTTNSPESESTTTNNPESESTTTSSPESESTTTS--SLVSESTTTSSPESESTTTSS 1949
            SE +T +  E     T+  E ES     P + S+T    S VSE  T +    ES   S 
Sbjct: 11   SEESTASHEELTEDDTDKQEEESVHDPVPAAPSSTVVAGSSVSEPATAA----ESAENSR 66

Query: 1950 PE-----SESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLV---SESTTTSSPE 2001
            PE     SE  T+          S            +SE  T  + V   SES  T  PE
Sbjct: 67   PEDNAQLSEGETSQQATLNEDNESMQRDSDVQPQDLQSEELTEVTDVEGSSESNDTEQPE 126

Query: 2002 SESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPA 2061
             E     +  S  +T+    S S  T +   +       ++E +  N+    + T    A
Sbjct: 127  EEGEA--NDRSGGSTSPVAASLSMETATAPVDGEHQVQQSTELSAENDDVRSTGTGTTGA 184

Query: 2062 SESITSSSPASESTTTSSPASESTTTSSPASESTTTS 2098
             ES+  S  A +  +  +  S+S+ T S +    T++
Sbjct: 185  EESL--SLEAGDGNSERTMGSDSSLTPSKSDAEPTSA 219


>gnl|CDD|221321 pfam11928, DUF3446, Domain of unknown function (DUF3446).  This
            presumed domain is functionally uncharacterized. This
            domain is found in eukaryotes. This domain is typically
            between 80 to 99 amino acids in length. This domain is
            found associated with pfam00096. This domain has a single
            completely conserved residue P that may be functionally
            important.
          Length = 84

 Score = 29.1 bits (65), Expect = 5.7
 Identities = 16/50 (32%), Positives = 25/50 (50%), Gaps = 3/50 (6%)

Query: 2077 TSSPASESTTTSSPASESTTTSSPASESTTTSSPE---SESTTTSSPASE 2123
            ++ P S S ++SS +S S++ S P S S   S P    S +   SS   +
Sbjct: 31   SNPPPSSSPSSSSSSSSSSSQSPPLSCSVHQSEPSPIYSAAPPYSSACGD 80


>gnl|CDD|227507 COG5180, PBP1, Protein interacting with poly(A)-binding protein [RNA
            processing and modification].
          Length = 654

 Score = 31.6 bits (71), Expect = 5.9
 Identities = 22/127 (17%), Positives = 42/127 (33%), Gaps = 6/127 (4%)

Query: 1994 STTTSSPESESTTTISPVSESTTTSSPVSESTTTISPE---SESTTTSSPASESTTTNNP 2050
                  PE+ S          +   S   E    I  +   S     +   S        
Sbjct: 281  LLENRKPEAVSAPEAVSPQSKSEGPSSGQEKEKQIKEKKSFSYGWKHTKFDSSKNLLEVI 340

Query: 2051 KSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSP 2110
            KS+  +  + +S  +   S         S A++    S P  ES  + S A++ + ++  
Sbjct: 341  KSKFKSLFDISSGELKWGSKPPWEAKAVSIATK---VSKPKKESVRSGSKAAKKSPSTKH 397

Query: 2111 ESESTTT 2117
             + S+T+
Sbjct: 398  TTRSSTS 404


>gnl|CDD|227478 COG5149, TOA1, Transcription initiation factor IIA, large chain
            [Transcription].
          Length = 293

 Score = 31.2 bits (70), Expect = 6.1
 Identities = 27/126 (21%), Positives = 43/126 (34%), Gaps = 17/126 (13%)

Query: 1856 ATAVAISVIDNYSEIIFTTNNNSESTVVMSTLNSLLSENTTTNSPESESTTTNNPESEST 1915
            A AVA S I N S     TN + +S+ + +         +   +P    ++TN       
Sbjct: 77   APAVANSPILNQSA----TNISFDSSAIPNV-------QSNNTAPFPSYSSTN------Q 119

Query: 1916 TTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESEST 1975
            T  SP     +T++L       +   S        E E +   S ++    T   E    
Sbjct: 120  TADSPIINDHSTANLKIYGDIIAEVISLPNRLEQVEDELSIGKSAITTLRNTDWRERLID 179

Query: 1976 TTSSPE 1981
             T S  
Sbjct: 180  DTQSEW 185


>gnl|CDD|146285 pfam03566, Peptidase_A21, Peptidase family A21. 
          Length = 628

 Score = 31.8 bits (72), Expect = 6.4
 Identities = 28/173 (16%), Positives = 56/173 (32%), Gaps = 17/173 (9%)

Query: 1900 PESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSS 1959
            P SE+          +T   P + +   S+ V      S P   S  + +P     ++  
Sbjct: 275  PISETQNAVPDIVAGSTFVGPSNVTRPGSATVVTLVWASLPPGGSAPSGTPTWTPNSSGQ 334

Query: 1960 LVSES-----------TTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTI 2008
                            T       E    ++P    T   +     T T +  + + T +
Sbjct: 335  FGQWRHGGFDASVILPTVPRGYTMEYGDFANPGDTLTFGQTGGDNVTITITAPTVTVTVL 394

Query: 2009 SPVSESTTTSSPVS-ESTTTISPESESTTTSS----PAS-ESTTTNNPKSEST 2055
            + ++ S      V+ +S   ++ ++ +    S    P +   T  N PK+E  
Sbjct: 395  ASLTSSNGVFRGVTADSGARLNLDTAALNRLSIPLPPLTFGQTMQNTPKTEQF 447


>gnl|CDD|227404 COG5072, ALK1, Serine/threonine kinase of the haspin family [Cell
            division and chromosome partitioning].
          Length = 488

 Score = 31.4 bits (71), Expect = 6.6
 Identities = 17/84 (20%), Positives = 35/84 (41%), Gaps = 1/84 (1%)

Query: 1972 SESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPE 2031
            S  +  S P   S++  +   +  +    E  + ++ S V      + P  E++ TI  +
Sbjct: 49   SLESIHSKPSKTSSSKWNFWKKKGSYPENELLAKSSFSSVHTVIFPAGPRDEASKTIVSK 108

Query: 2032 SESTT-TSSPASESTTTNNPKSES 2054
             E T   +  A  S+ +N+ K + 
Sbjct: 109  KEVTNLLNHKALSSSLSNSLKHKP 132


>gnl|CDD|216095 pfam00748, Calpain_inhib, Calpain inhibitor.  This region is found
            multiple times in calpain inhibitor proteins.
          Length = 131

 Score = 29.8 bits (67), Expect = 7.0
 Identities = 19/82 (23%), Positives = 32/82 (39%), Gaps = 2/82 (2%)

Query: 2074 STTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTTIEEQGVS 2133
            S  T S +   + T+    E    ++ + E  +  S  S  +    P  +   + +  + 
Sbjct: 6    SDFTCSASPPPSPTAKKKKEEAEKTAASGEVVSAQSAPSVRSAAPPPEKKRDKMSDDALD 65

Query: 2134 PHSEKLSANE-DPEE-FPNEDV 2153
              S+ L   E DPEE  P ED 
Sbjct: 66   ALSDSLGQREPDPEEKKPVEDK 87


>gnl|CDD|240232 PTZ00021, PTZ00021, falcipain-2; Provisional.
          Length = 489

 Score = 31.3 bits (71), Expect = 7.2
 Identities = 15/56 (26%), Positives = 27/56 (48%), Gaps = 13/56 (23%)

Query: 2346 SCEGSINPRYIHSVKIIGWG-------KSSQNEP--YWLCTNSYNQGWGEQGLFKI 2392
             C    N    H+V ++G+G        + + E   Y++  NS+ + WGE+G  +I
Sbjct: 415  ECGEEPN----HAVILVGYGMEEIYNSDTKKMEKRYYYIIKNSWGESWGEKGFIRI 466


>gnl|CDD|177303 PHA00735, PHA00735, hypothetical protein.
          Length = 808

 Score = 31.4 bits (71), Expect = 7.3
 Identities = 17/36 (47%), Positives = 21/36 (58%)

Query: 87  SSQLAVAYTNGSLKTFSLDTTDVISTFTGHKSAITV 122
           S+QL V Y NG+LKTFS+    VI+      S  TV
Sbjct: 198 SNQLYVYYYNGTLKTFSITPGQVINNQFYPLSLNTV 233


>gnl|CDD|218883 pfam06075, DUF936, Plant protein of unknown function (DUF936).  This
            family consists of several hypothetical proteins from
            Arabidopsis thaliana and Oryza sativa. The function of
            this family is unknown.
          Length = 564

 Score = 31.3 bits (71), Expect = 7.3
 Identities = 45/285 (15%), Positives = 77/285 (27%), Gaps = 64/285 (22%)

Query: 1901 ESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSL 1960
             S S  T+ P S     SS  S     S+ + +     S   +  + SS    S    S 
Sbjct: 192  PSPSGGTSCPSSSGGRRSSIGSRRLRGSASLRKKVAVLSAPRKPGSRSSDCKSSPRARSS 251

Query: 1961 VSESTTTSSPESESTTTSSPESE----STTTSSLVSESTTTSSPESESTTTISPVSESTT 2016
             ++S   SS + ++T   S  S       T+ S  SE       E++  ++    ++   
Sbjct: 252  SAKSPFKSSIQRKATKALSKLSLRASPKDTSKSSKSEVAPPKKSEAKVPSSSKKWTDGNV 311

Query: 2017 TSSPVSESTTTISPE-----------------------------------SESTTTSSPA 2041
            +   +  S + +  E                                   S S    +P 
Sbjct: 312  SWDSLPSSLSKLGKEALRQRDVAQKAALEALREASATESLIRCLSTFSELSSSAKEDNPL 371

Query: 2042 ---------------------SESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSP 2080
                                 S S + +            A   I ++     S  + S 
Sbjct: 372  PCIEKFLKFHQELDQAIKIAESLSKSRSPDAECRLERKKSALSWIRAALATDLSPFSLSG 431

Query: 2081 ASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASEST 2125
                 +TS        TS    E  +     S  +   S   ES+
Sbjct: 432  KESKRSTSLKKLVPPKTSRSNDEGRS----SSVGSIKGSGLKESS 472



 Score = 31.3 bits (71), Expect = 7.5
 Identities = 36/202 (17%), Positives = 63/202 (31%), Gaps = 8/202 (3%)

Query: 1977 TSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTT 2036
             ++      + + +       +S      +  S    ++  SSP        S    + T
Sbjct: 116  VAADSLAFFSDAVIQVIKRKKASSAPRRGSWDSSSKSASIDSSPTVIGPRPRSFSELNLT 175

Query: 2037 TSSPA--SESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASES 2094
              +PA    S +     S S  T+ P+S     SS  S     S  AS     +  ++  
Sbjct: 176  DRTPAKVRSSRSELGAPSPSGGTSCPSSSGGRRSSIGSRRLRGS--ASLRKKVAVLSAPR 233

Query: 2095 TTTSSPASESTTTSSPESESTTTSSPA-SESTTIEEQGVSPHSEKLSANEDPEEFPNEDV 2153
               S     S   SSP + S++  SP  S       + +S  S + S  +  +   +E  
Sbjct: 234  KPGSRS---SDCKSSPRARSSSAKSPFKSSIQRKATKALSKLSLRASPKDTSKSSKSEVA 290

Query: 2154 FEHTFAEIPNIDHSNQTDEAIP 2175
                            TD  + 
Sbjct: 291  PPKKSEAKVPSSSKKWTDGNVS 312


>gnl|CDD|218115 pfam04502, DUF572, Family of unknown function (DUF572).  Family of
            eukaryotic proteins with undetermined function.
          Length = 321

 Score = 31.3 bits (71), Expect = 7.3
 Identities = 17/100 (17%), Positives = 35/100 (35%), Gaps = 14/100 (14%)

Query: 2011 VSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSS- 2069
              ++  T SP S S++   P S    +++  SE+     P S     N+        +  
Sbjct: 220  EEDNDNTPSPKSGSSSPAKPTSILKKSAAKRSEA-----PSSSKAKKNSRGIPKPRDALS 274

Query: 2070 --------PASESTTTSSPASESTTTSSPASESTTTSSPA 2101
                        ++ + S A  ++ +   A  S+ +S   
Sbjct: 275  SLVVRKKAAPESTSQSPSSAEPTSESPQTAGNSSLSSLGD 314


>gnl|CDD|205996 pfam13825, Paramyxo_PNT, Paramyxovirus structural protein V/P
            N-terminus.  This family consists of several
            Paramyxoviridae structural protein P and V sequences.
            From a structural point of view, P is the
            best-characterized protein of the replicative complex. P
            is organised into two moieties that are functionally and
            structurally distinct: a C-terminal moiety (PCT) and an
            N-terminal moiety (PNT). PCT is the most conserved in
            sequence and contains all regions required for virus
            transcription, whereas PNT, which is poorly conserved,
            provides several additional functions required for
            replication. P protein plays a crucial role in the enzyme
            by positioning L onto the N/RNA template through an
            interaction with the C-terminal domain of N. Without P, L
            is not functional. The N, P, and L proteins of SeV and
            measles and mumps viruses are functionally equivalent.
            However, sequence identity between proteins from these
            viruses is limited, and the viruses have been placed in
            different genera (Respirovirus, Morbilivirus, and
            Rubulavirus, respectively). SeV P protein (568 aa) is a
            modular protein with distinct functional domains. The
            N-terminal part of P (PNT) is a chaperone for N and
            prevents it from binding to non-viral RNA in the infected
            cell.
          Length = 309

 Score = 31.0 bits (70), Expect = 7.6
 Identities = 24/91 (26%), Positives = 37/91 (40%), Gaps = 12/91 (13%)

Query: 2046 TTNNPKSESTTTNNPASESI--------TSSSPASESTTTSS---PASESTTTSS-PASE 2093
            T   P        +P+ + I         SS   +ES +T      A +ST  SS P + 
Sbjct: 206  TLQVPPIPDVKRGDPSCKPIKKGTEERSASSGTETESLSTGGATQSALKSTWGSSEPNAS 265

Query: 2094 STTTSSPASESTTTSSPESESTTTSSPASES 2124
            +      AS +      + ES TT+SP S++
Sbjct: 266  AGNVRQSASNAKMIQKCKQESGTTASPRSQN 296



 Score = 31.0 bits (70), Expect = 8.3
 Identities = 20/96 (20%), Positives = 32/96 (33%), Gaps = 11/96 (11%)

Query: 1939 SPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTS 1998
             P+ +    S    +  T     S  T T S  +   T S+ +S              +S
Sbjct: 212  IPDVKRGDPSCKPIKKGTEERSASSGTETESLSTGGATQSALKSTW-----------GSS 260

Query: 1999 SPESESTTTISPVSESTTTSSPVSESTTTISPESES 2034
             P + +       S +        ES TT SP S++
Sbjct: 261  EPNASAGNVRQSASNAKMIQKCKQESGTTASPRSQN 296


>gnl|CDD|219865 pfam08493, AflR, Aflatoxin regulatory protein.  This domain is found
            in the aflatoxin regulatory protein (AflR) which is
            involved in the regulation of the biosynthesis of
            aflatoxin in the fungal genus Aspergillus. It occurs
            together with the fungal Zn(2)-Cys(6) binuclear cluster
            domain (pfam00172).
          Length = 275

 Score = 31.1 bits (70), Expect = 7.7
 Identities = 32/160 (20%), Positives = 53/160 (33%), Gaps = 8/160 (5%)

Query: 1961 VSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSP 2020
            +    T SSP   + TT+       TTSS   +    S P S      +P + + T+S  
Sbjct: 2    LETPNTASSPTIPANTTA------NTTSSSHPQPPVQSGPSSIQPPVATPHTPNGTSSPS 55

Query: 2021 VSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTT--TS 2078
               S  +   E E   +    + S       S   + N    +   S SP+         
Sbjct: 56   PKFSHQSPPAEPELWGSILSPNASNQDQGDLSSLLSVNTDFGQLFASLSPSPLFDGNDAD 115

Query: 2079 SPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTS 2118
              A  +   S    E ++       ++  S P S  T+ +
Sbjct: 116  LHAEATGELSVADLEVSSPMQDLFLTSALSPPSSARTSHT 155


>gnl|CDD|132198 TIGR03154, sulfolob_CbsA, cytochrome b558/566, subunit A.  Members of
            this protein family are CbsA, one subunit of a highly
            glycosylated, heterodimeric, mono-heme cytochrome
            b558/566, found in Sulfolobus acidocaldarius and several
            other members of the Sulfolobales, a branch of the
            Crenarchaeota.
          Length = 465

 Score = 31.1 bits (70), Expect = 8.1
 Identities = 16/47 (34%), Positives = 27/47 (57%), Gaps = 1/47 (2%)

Query: 1983 ESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTIS 2029
            + + TSS ++    T+ P   ++TT S  S STTTSS +  +T  ++
Sbjct: 400  DKSITSSFLTLELVTTPPTPPTSTTTS-TSPSTTTSSAIPSTTLYVT 445



 Score = 31.1 bits (70), Expect = 8.6
 Identities = 16/40 (40%), Positives = 24/40 (60%)

Query: 2063 ESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPAS 2102
            +SITSS    E  TT      STTTS+  S +T+++ P++
Sbjct: 401  KSITSSFLTLELVTTPPTPPTSTTTSTSPSTTTSSAIPST 440


>gnl|CDD|221093 pfam11359, gpUL132, Glycoprotein UL132.  Glycoprotein UL132 is a
            low-abundance structural component of Human
            cytomegalovirus (HCMV). The function of this protein is
            not fully understood.
          Length = 235

 Score = 30.8 bits (69), Expect = 8.1
 Identities = 15/67 (22%), Positives = 32/67 (47%), Gaps = 2/67 (2%)

Query: 1956 TTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSEST 2015
             TSS  + + TT++      T+++    +  T++L   ++TT+ P S  T  +  +    
Sbjct: 1    MTSSTTTPANTTATVTVTVATSNTTSVSTNVTTAL--TASTTAEPGSVLTELLGIIIYCV 58

Query: 2016 TTSSPVS 2022
            +  S +S
Sbjct: 59   SGVSILS 65


>gnl|CDD|220102 pfam09073, BUD22, BUD22.  BUD22 has been shown in yeast to be a
            nuclear protein involved in bud-site selection. It plays
            a role in positioning the proximal bud pole signal. More
            recently it has been shown to be involved in ribosome
            biogenesis.
          Length = 424

 Score = 31.0 bits (70), Expect = 8.3
 Identities = 33/140 (23%), Positives = 52/140 (37%), Gaps = 11/140 (7%)

Query: 1893 ENTTTNSPE-SESTTTNNPESESTTTSSPESESTTTSS--------LVSESTTTSSPESE 1943
            E++  +  E SES   +  E  +   S  E E  + S         LV  S      E+ 
Sbjct: 165  ESSDKDDEEESESEDESKSEESAEDDSDDEEEEDSDSEDYSQYDGMLVDSSDEEEGEEAP 224

Query: 1944 STT--TSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPE 2001
            S      + ESES  + S +SES + S  E  S  +  P+ + T+++ L S      S  
Sbjct: 225  SINYNEDTSESESDESDSEISESRSVSDSEESSPPSKKPKEKKTSSTFLPSLMGGYFSGS 284

Query: 2002 SESTTTISPVSESTTTSSPV 2021
             +       +        PV
Sbjct: 285  EDEDDDDEDIDPDQVVKKPV 304


>gnl|CDD|234665 PRK00145, PRK00145, putative inner membrane protein translocase
           component YidC; Provisional.
          Length = 223

 Score = 30.5 bits (69), Expect = 8.4
 Identities = 12/30 (40%), Positives = 20/30 (66%), Gaps = 4/30 (13%)

Query: 779 LRKLKKKEK----KLQEEQMEVVEENPVDP 804
           ++KL+ K K    KLQ+E M++ +E  V+P
Sbjct: 67  IKKLQAKYKNDPQKLQQEMMKLYKEKGVNP 96


>gnl|CDD|216269 pfam01056, Myc_N, Myc amino-terminal region.  The myc family belongs
            to the basic helix-loop-helix leucine zipper class of
            transcription factors, see pfam00010. Myc forms a
            heterodimer with Max, and this complex regulates cell
            growth through direct activation of genes involved in
            cell replication. Mutations in the C-terminal 20 residues
            of this domain cause unique changes in the induction of
            apoptosis, transformation, and G2 arrest.
          Length = 329

 Score = 31.1 bits (70), Expect = 8.8
 Identities = 23/84 (27%), Positives = 33/84 (39%), Gaps = 1/84 (1%)

Query: 1909 NPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVS-ESTTT 1967
            N  S+S+  +SP       +   S S++ S  ESE       E E      +V+ E   +
Sbjct: 195  NERSKSSKVASPTPRLGLRTPPNSSSSSGSDSESEEDEEEEEEEEEEEEIDVVTVEKRRS 254

Query: 1968 SSPESESTTTSSPESESTTTSSLV 1991
            SS    ST+ S         S LV
Sbjct: 255  SSNRKASTSESITVPSRRHHSPLV 278


>gnl|CDD|215592 PLN03126, PLN03126, Elongation factor Tu; Provisional.
          Length = 478

 Score = 31.1 bits (70), Expect = 8.8
 Identities = 20/70 (28%), Positives = 36/70 (51%), Gaps = 7/70 (10%)

Query: 1922 SESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPE 1981
            S ++++SSL+  S+++SSP S + +  S        S  +   T +SS  S  +TT++  
Sbjct: 6    SAASSSSSLLLPSSSSSSPSSSTFSFKST-------SGKLKSLTLSSSFLSPFSTTTTST 58

Query: 1982 SESTTTSSLV 1991
            S+    S  V
Sbjct: 59   SQRRRRSFTV 68


>gnl|CDD|219094 pfam06583, Neogenin_C, Neogenin C-terminus.  This family represents
            the C-terminus of eukaryotic neogenin precursor proteins,
            which contains several potential phosphorylation sites.
            Neogenin is a member of the N-CAM family of cell adhesion
            molecules (and therefore contains multiple copies of
            pfam00047 and pfam00041) and is closely related to the
            DCC tumour suppressor gene product - these proteins may
            play an integral role in regulating differentiation
            programmes and/or cell migration events within many adult
            and embryonic tissues.
          Length = 295

 Score = 30.7 bits (69), Expect = 9.0
 Identities = 42/259 (16%), Positives = 76/259 (29%), Gaps = 23/259 (8%)

Query: 1899 SPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTS 1958
            SP    + T+ P   S   +   S + +         +    ESE + +S          
Sbjct: 23   SPHPNPSGTDTPIRSSQDITPVSSSAQSEPQSGQRRNSYRGHESEDSMSSLAARRGMRPK 82

Query: 1959 SLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTS 2018
             ++   +    P   +    + ++       L S        +     ++ P+  ST T 
Sbjct: 83   MMIPMDSQPPQPVVSAHPIHTLDN-PQYPGILPSPRCGYLHHQF----SLRPMPFSTLTV 137

Query: 2019 SPVSESTTTISPESESTTT--SSPASESTTTNNPKSESTTTNNPASESITS----SSPAS 2072
                +        +ES  +   +P          +S +     P+    T+    + P  
Sbjct: 138  ----QRLYQHGDRAESVESVRQTPEPPYLPAAQSESSNAAEEAPSRSIPTAHVRPTHPLK 193

Query: 2073 ESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTT----SSPASESTTIE 2128
                 + PAS ST    P   ST   +    +    S ++ S  T     SP    T   
Sbjct: 194  SFAVPALPASMSTI--EPKLPSTPLLTQQGPTLPKHSVKTASVGTLGRARSPLLPVTVPS 251

Query: 2129 EQGVSPHSEKLSANEDPEE 2147
               V          ED +E
Sbjct: 252  APDVL--ETGGKMLEDTDE 268


>gnl|CDD|113413 pfam04642, DUF601, Protein of unknown function, DUF601.  This family
            represents a conserved region found in several
            uncharacterized plant proteins.
          Length = 311

 Score = 30.8 bits (69), Expect = 9.4
 Identities = 19/85 (22%), Positives = 33/85 (38%), Gaps = 8/85 (9%)

Query: 2069 SPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTTIE 2128
                      +P SES T S    ++T++  P+         +SE    S P S     E
Sbjct: 31   PSTLAGKNPDAPTSESRTPS----KATSSKDPSKRYADKKRKQSEKDARSPPRSSRPRTE 86

Query: 2129 EQGVSPHSEKLSANEDPEEFPNEDV 2153
            E+   P  +K    E  ++  ++D+
Sbjct: 87   EKDAGPSQQK----EKGKKGDSQDL 107


>gnl|CDD|227911 COG5624, TAF61, Transcription initiation factor TFIID, subunit TAF12
            (also component of histone acetyltransferase SAGA)
            [Transcription].
          Length = 505

 Score = 30.8 bits (69), Expect = 9.4
 Identities = 18/126 (14%), Positives = 28/126 (22%), Gaps = 13/126 (10%)

Query: 1993 ESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKS 2052
            E++    P   + +    V        P           S    T          N  +S
Sbjct: 259  EASGMPPPAEWAGSNGLHVLPGRREEVPRGIFRCPSPESSRGEPTHLDYRNGMANNAQRS 318

Query: 2053 E-----STTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTT 2107
                  S    NP     ++  P              T T   A+     ++P       
Sbjct: 319  RFPGTCSIYPENPGKRWCSTKYPQP----LVHKGDRDTETGGCAAPDGGLATPG----RD 370

Query: 2108 SSPESE 2113
              P  E
Sbjct: 371  KGPLYE 376


>gnl|CDD|227625 COG5309, COG5309, Exo-beta-1,3-glucanase [Carbohydrate transport and
            metabolism].
          Length = 305

 Score = 30.6 bits (69), Expect = 9.6
 Identities = 16/47 (34%), Positives = 25/47 (53%)

Query: 2072 SESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTS 2118
            S S+   S  S S   ++ +S S+  SS ASE +++SS  S S   +
Sbjct: 1    STSSMQFSSTSSSAALATLSSSSSALSSSASEVSSSSSRASASGFLA 47


>gnl|CDD|173135 PRK14672, uvrC, excinuclease ABC subunit C; Provisional.
          Length = 691

 Score = 31.2 bits (70), Expect = 9.6
 Identities = 26/114 (22%), Positives = 52/114 (45%), Gaps = 8/114 (7%)

Query: 2037 TSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTT 2096
            +S+  +E   ++   ++ T T  P     T  +P+S + TT++P   ++  S+   +S  
Sbjct: 316  SSAGLAEHWLSHKAGTQCTVTLIPLHTFPTPQTPSS-TVTTNAPTLAASQNSNAVQDSGL 374

Query: 2097 TSSPASESTTTSSPESESTTTSSPASESTTIEEQGVSPHSE------KLSANED 2144
             S  +  ST  +  ++    T+S  +   T  E   +PH        +L+A+ED
Sbjct: 375  RSC-SETSTMHTLQKAHDACTASEGTRENTPHESAHTPHHRAILAMAQLNAHED 427


>gnl|CDD|215180 PLN02316, PLN02316, synthase/transferase.
          Length = 1036

 Score = 31.0 bits (70), Expect = 9.6
 Identities = 18/107 (16%), Positives = 32/107 (29%), Gaps = 1/107 (0%)

Query: 2047 TNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTT 2106
             +  K + +     A  +   SS              ST+TSS +  +   +S A E   
Sbjct: 1    MSTSKPKGSAPRGFAPRTTVESSQKRIQQNNGDKEDSSTSTSSLSVSAVEKTSNAKEEIQ 60

Query: 2107 TSSPESESTTTSSPASESTTIEEQGVSPHSEKLSANEDPEEFPNEDV 2153
                 +  +      +E   IE +       K S+    E    +  
Sbjct: 61   VDFQHNSESAVEEVEAE-DEIEVEQNQSDVLKSSSIVKEESISTDMD 106


>gnl|CDD|218549 pfam05308, Mito_fiss_reg, Mitochondrial fission regulator.  In
            eukaryotes, this family of proteins induces mitochondrial
            fission.
          Length = 248

 Score = 30.5 bits (69), Expect = 9.8
 Identities = 19/86 (22%), Positives = 31/86 (36%), Gaps = 4/86 (4%)

Query: 1968 SSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTT 2027
            +S  S+  ++      S+TTS  +S  T     E        P          + +ST+ 
Sbjct: 149  NSTTSDLLSSDESVPSSSTTSFPISPPT----EEPVLEVPPPPPPPPPPPPPSLQQSTSA 204

Query: 2028 ISPESESTTTSSPASESTTTNNPKSE 2053
            I    E     S A ++   + PKS 
Sbjct: 205  IDLIKERKGQRSAAGKTLVLSKPKSP 230


>gnl|CDD|234229 TIGR03490, Mycoplas_LppA, mycoides cluster lipoprotein, LppA/P72
            family.  Members of this protein family occur in
            Mycoplasma mycoides, Mycoplasma hyopneumoniae, and
            related Mycoplasmas in small paralogous families that may
            also include truncated forms and/or pseudogenes. Members
            are predicted lipoproteins with a conserved signal
            peptidase II processing and lipid attachment site. Note
            that the name for certain characterized members, p72,
            reflects an anomalous apparent molecular weight, given a
            theoretical MW of about 61 kDa.
          Length = 541

 Score = 31.0 bits (70), Expect = 9.9
 Identities = 24/119 (20%), Positives = 35/119 (29%), Gaps = 11/119 (9%)

Query: 2064 SITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASE 2123
            SI+  S  S STT+S+    S             ++P       +   SE+    S    
Sbjct: 15   SISFLSVVSCSTTSSN----SKQPEKKPEIKPNENTPKIPKKPDNKEPSENNNNKSNNEN 70

Query: 2124 STTIEEQGVSPHSEKLSANEDPEEFPNEDVFEHTFAEIPNIDHSNQTDEAIPETFDARE 2182
                      P S       DP +  N++  E    E    D   Q D+      D   
Sbjct: 71   KDEEN-----PSSTNPEKKPDPSK--NKEEIEKPKDEPKKPDKKPQADQPNNVHADQPN 122


>gnl|CDD|114648 pfam05937, EB1_binding, EB-1 Binding Domain.  This region at the
            C-terminus of the APC proteins binds the
            microtubule-associating protein EB-1. At the C-terminus
            of the alignment is also a pfam00595 binding domain. A
            short motif in the middle of the region appears to be
            found in the APC2 proteins.
          Length = 174

 Score = 30.2 bits (67), Expect = 10.0
 Identities = 20/97 (20%), Positives = 43/97 (44%), Gaps = 4/97 (4%)

Query: 1908 NNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTT 1967
            NNP     T  +  +E T  SS  S S+  SSP        +P + + +     +++++ 
Sbjct: 74   NNPVPVQETNENSIAERTAFSS--SSSSKHSSPSGTVAARVTPFNYNPSPRKSNADNSSA 131

Query: 1968 SSPESESTTTSSPESESTTTSSLVSESTTTSSPESES 2004
               +  +   ++ +   + T S  ++S+ + SP+  S
Sbjct: 132  RPSQIPTPVNNNTKKRDSKTDS--TDSSGSQSPKRHS 166


>gnl|CDD|177952 PLN02318, PLN02318, phosphoribulokinase/uridine kinase.
          Length = 656

 Score = 31.0 bits (70), Expect = 10.0
 Identities = 35/172 (20%), Positives = 66/172 (38%), Gaps = 17/172 (9%)

Query: 1962 SESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVS----ESTTT 2017
            S      S   E+ + +S +  +    S +S S +T   ++ S  T   V+    +   +
Sbjct: 429  SLDDDLVSSPKEALSRASADRRNKNLKSGLSHSYSTQRDKNLSKLTGLAVTNRRFDERNS 488

Query: 2018 SSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTT 2077
             SP + +   I+  SE  ++ +   +  T+   +  S  +    S S  + +  +E+   
Sbjct: 489  ESPAALNQGAITQLSEQISSLNERMDEFTSRIEELNSKLSIKKNSPSQQNLALQAEACNG 548

Query: 2078 SSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTTIEE 2129
            S+P S   +     S            T +  P S S+  S  A ES  +EE
Sbjct: 549  SAPTSYFVSGLGNGS-----------LTGSILPLSSSS--SQLAKESPLMEE 587


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.315    0.129    0.379 

Gapped
Lambda     K      H
   0.267   0.0789    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 117,439,458
Number of extensions: 11348795
Number of successful extensions: 22542
Number of sequences better than 10.0: 1
Number of HSP's gapped: 16441
Number of HSP's successfully gapped: 1401
Length of query: 2435
Length of database: 10,937,602
Length adjustment: 113
Effective length of query: 2322
Effective length of database: 5,925,600
Effective search space: 13759243200
Effective search space used: 13759243200
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 67 (29.5 bits)