RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy4257
         (419 letters)



>gnl|CDD|227223 COG4886, COG4886, Leucine-rich repeat (LRR) protein [Function
           unknown].
          Length = 394

 Score = 62.7 bits (152), Expect = 1e-10
 Identities = 82/264 (31%), Positives = 119/264 (45%), Gaps = 10/264 (3%)

Query: 6   NWLFLNNNRLKSLEGQLGTLSKLQLLVIEQNQLEALPSDIQLFSQLGSLYANNNRITSLD 65
             L L++N L  L   L  L  L LL            ++     L SL  N NR+ S  
Sbjct: 50  LALNLSSNTLLLLPSSLSRLLSLDLLSPSGISSLDGSENLLNLLPLPSLDLNLNRLRSNI 109

Query: 66  GLLRGLTKLQVFNMDFNQITMVRRDEFQNLHNLDSISLQNNQITSMNSSLSGLTKLAYLY 125
             L  LT L   ++D N IT +         NL  + L +N+I S+ S L  L  L  L 
Sbjct: 110 SELLELTNLTSLDLDNNNITDIPPLIGLLKSNLKELDLSDNKIESLPSPLRNLPNLKNLD 169

Query: 126 LSHNQLTEFLLDDIRGLKRLRTVDLSYNKINKFGTRNEGKNQVQGVTNIFELKLQHNEIE 185
           LS N L++ L   +  L  L  +DLS NKI+      E  +       + EL L +N I 
Sbjct: 170 LSFNDLSD-LPKLLSNLSNLNNLDLSGNKISDLPPEIELLS------ALEELDLSNNSII 222

Query: 186 NLDGALMGIHGLSRLDLSHNKLRTISPDDFIGLDSLKMLDISHNLLTTLEETSKTFLPAL 245
            L  +L  +  LS L+LS+NKL  + P+    L +L+ LD+S+N ++++       L  L
Sbjct: 223 ELLSSLSNLKNLSGLELSNNKLEDL-PESIGNLSNLETLDLSNNQISSISSLGS--LTNL 279

Query: 246 EELFVSHNSLTRLDKDFHGLPVLC 269
            EL +S NSL+        L +L 
Sbjct: 280 RELDLSGNSLSNALPLIALLLLLL 303



 Score = 62.7 bits (152), Expect = 1e-10
 Identities = 66/235 (28%), Positives = 116/235 (49%), Gaps = 11/235 (4%)

Query: 1   GLTDLNWLFLNNNRLKSLEGQLGTLSKLQLLVIEQNQLEALPSDIQLFSQLGSLYANNNR 60
            L  L+ L  +          L  L  L  L +  N+L +  S++   + L SL  +NN 
Sbjct: 68  RLLSLDLLSPSGISSLDGSENLLNLLPLPSLDLNLNRLRSNISELLELTNLTSLDLDNNN 127

Query: 61  ITSLDGLLRGLT-KLQVFNMDFNQITMVRRDEFQNLHNLDSISLQNNQITSMNSSLSGLT 119
           IT +  L+  L   L+  ++  N+I  +     +NL NL ++ L  N ++ +   LS L+
Sbjct: 128 ITDIPPLIGLLKSNLKELDLSDNKIESLPS-PLRNLPNLKNLDLSFNDLSDLPKLLSNLS 186

Query: 120 KLAYLYLSHNQLTEFLLDDIRGLKRLRTVDLSYNKINKFGTRNEGKNQVQGVTNIFELKL 179
            L  L LS N++++ L  +I  L  L  +DLS N I +        + +  + N+  L+L
Sbjct: 187 NLNNLDLSGNKISD-LPPEIELLSALEELDLSNNSIIEL------LSSLSNLKNLSGLEL 239

Query: 180 QHNEIENLDGALMGIHGLSRLDLSHNKLRTISPDDFIGLDSLKMLDISHNLLTTL 234
            +N++E+L  ++  +  L  LDLS+N++ +IS      L +L+ LD+S N L+  
Sbjct: 240 SNNKLEDLPESIGNLSNLETLDLSNNQISSISS--LGSLTNLRELDLSGNSLSNA 292



 Score = 57.3 bits (138), Expect = 6e-09
 Identities = 55/186 (29%), Positives = 77/186 (41%), Gaps = 3/186 (1%)

Query: 2   LTDLNWLFLNNNRLKSLEGQLGTLSKLQLLVIEQNQLEALPSDIQLFSQLGSLYANNNRI 61
           L +L  L L+ N L  L   L  LS L  L +  N++  LP +I+L S L  L  +NN I
Sbjct: 162 LPNLKNLDLSFNDLSDLPKLLSNLSNLNNLDLSGNKISDLPPEIELLSALEELDLSNNSI 221

Query: 62  TSLDGLLRGLTKLQVFNMDFNQITMVRRDEFQNLHNLDSISLQNNQITSMNSSLSGLTKL 121
             L   L  L  L    +  N++  +  +   NL NL+++ L NNQI+S+ SSL  LT L
Sbjct: 222 IELLSSLSNLKNLSGLELSNNKLEDL-PESIGNLSNLETLDLSNNQISSI-SSLGSLTNL 279

Query: 122 AYLYLSHNQLTEFLLDDIRGLKRLRTVDLSYNKINKFGTRNEGKNQVQGVTNIFELKLQH 181
             L LS N L+  L      L  L    L+     K          +             
Sbjct: 280 RELDLSGNSLSNALPLIALLLLLLEL-LLNLLLTLKALELKLNSILLNNNILSNGETSSP 338

Query: 182 NEIENL 187
             +  L
Sbjct: 339 EALSIL 344


>gnl|CDD|206026 pfam13855, LRR_8, Leucine rich repeat. 
          Length = 60

 Score = 55.6 bits (135), Expect = 2e-10
 Identities = 26/60 (43%), Positives = 38/60 (63%)

Query: 196 GLSRLDLSHNKLRTISPDDFIGLDSLKMLDISHNLLTTLEETSKTFLPALEELFVSHNSL 255
            L  LDLS+N+L  I    F GL +LK+LD+S N LT++   + + LP+L  L +S N+L
Sbjct: 1   NLKSLDLSNNRLTVIPDGAFKGLPNLKVLDLSGNNLTSISPEAFSGLPSLRSLDLSGNNL 60



 Score = 52.9 bits (128), Expect = 2e-09
 Identities = 27/57 (47%), Positives = 35/57 (61%), Gaps = 1/57 (1%)

Query: 176 ELKLQHNEIENL-DGALMGIHGLSRLDLSHNKLRTISPDDFIGLDSLKMLDISHNLL 231
            L L +N +  + DGA  G+  L  LDLS N L +ISP+ F GL SL+ LD+S N L
Sbjct: 4   SLDLSNNRLTVIPDGAFKGLPNLKVLDLSGNNLTSISPEAFSGLPSLRSLDLSGNNL 60



 Score = 48.7 bits (117), Expect = 7e-08
 Identities = 20/60 (33%), Positives = 33/60 (55%), Gaps = 1/60 (1%)

Query: 73  KLQVFNMDFNQITMVRRDEFQNLHNLDSISLQNNQITSMNS-SLSGLTKLAYLYLSHNQL 131
            L+  ++  N++T++    F+ L NL  + L  N +TS++  + SGL  L  L LS N L
Sbjct: 1   NLKSLDLSNNRLTVIPDGAFKGLPNLKVLDLSGNNLTSISPEAFSGLPSLRSLDLSGNNL 60



 Score = 47.2 bits (113), Expect = 3e-07
 Identities = 24/60 (40%), Positives = 33/60 (55%), Gaps = 1/60 (1%)

Query: 97  NLDSISLQNNQITSMN-SSLSGLTKLAYLYLSHNQLTEFLLDDIRGLKRLRTVDLSYNKI 155
           NL S+ L NN++T +   +  GL  L  L LS N LT    +   GL  LR++DLS N +
Sbjct: 1   NLKSLDLSNNRLTVIPDGAFKGLPNLKVLDLSGNNLTSISPEAFSGLPSLRSLDLSGNNL 60



 Score = 46.8 bits (112), Expect = 3e-07
 Identities = 25/60 (41%), Positives = 38/60 (63%), Gaps = 1/60 (1%)

Query: 220 SLKMLDISHNLLTTLEETSKTFLPALEELFVSHNSLTRLDKD-FHGLPVLCKADLAHNNI 278
           +LK LD+S+N LT + + +   LP L+ L +S N+LT +  + F GLP L   DL+ NN+
Sbjct: 1   NLKSLDLSNNRLTVIPDGAFKGLPNLKVLDLSGNNLTSISPEAFSGLPSLRSLDLSGNNL 60



 Score = 43.7 bits (104), Expect = 4e-06
 Identities = 21/60 (35%), Positives = 34/60 (56%), Gaps = 1/60 (1%)

Query: 50  QLGSLYANNNRITSL-DGLLRGLTKLQVFNMDFNQITMVRRDEFQNLHNLDSISLQNNQI 108
            L SL  +NNR+T + DG  +GL  L+V ++  N +T +  + F  L +L S+ L  N +
Sbjct: 1   NLKSLDLSNNRLTVIPDGAFKGLPNLKVLDLSGNNLTSISPEAFSGLPSLRSLDLSGNNL 60



 Score = 34.1 bits (79), Expect = 0.009
 Identities = 17/59 (28%), Positives = 27/59 (45%), Gaps = 2/59 (3%)

Query: 5  LNWLFLNNNRLKSLE-GQLGTLSKLQLLVIEQNQLEALPSD-IQLFSQLGSLYANNNRI 61
          L  L L+NNRL  +  G    L  L++L +  N L ++  +       L SL  + N +
Sbjct: 2  LKSLDLSNNRLTVIPDGAFKGLPNLKVLDLSGNNLTSISPEAFSGLPSLRSLDLSGNNL 60



 Score = 33.7 bits (78), Expect = 0.013
 Identities = 22/65 (33%), Positives = 32/65 (49%), Gaps = 8/65 (12%)

Query: 245 LEELFVSHNSLTRLDKD-FHGLPVLCKADLAHNNIKAINIQLALKTQCQIFGLNSTLRIY 303
           L+ L +S+N LT +    F GLP L   DL+ NN+ +I+ +          GL S   + 
Sbjct: 2   LKSLDLSNNRLTVIPDGAFKGLPNLKVLDLSGNNLTSISPEA-------FSGLPSLRSLD 54

Query: 304 LEGNP 308
           L GN 
Sbjct: 55  LSGNN 59



 Score = 31.7 bits (73), Expect = 0.057
 Identities = 18/65 (27%), Positives = 28/65 (43%), Gaps = 5/65 (7%)

Query: 120 KLAYLYLSHNQLTEFLLDDIRGLKRLRTVDLSYNKINKFGTRNEGKNQVQGVTNIFELKL 179
            L  L LS+N+LT       +GL  L+ +DLS N +              G+ ++  L L
Sbjct: 1   NLKSLDLSNNRLTVIPDGAFKGLPNLKVLDLSGNNLTSI-----SPEAFSGLPSLRSLDL 55

Query: 180 QHNEI 184
             N +
Sbjct: 56  SGNNL 60



 Score = 26.4 bits (59), Expect = 5.0
 Identities = 13/39 (33%), Positives = 19/39 (48%), Gaps = 1/39 (2%)

Query: 1  GLTDLNWLFLNNNRLKSLEGQ-LGTLSKLQLLVIEQNQL 38
          GL +L  L L+ N L S+  +    L  L+ L +  N L
Sbjct: 22 GLPNLKVLDLSGNNLTSISPEAFSGLPSLRSLDLSGNNL 60


>gnl|CDD|215061 PLN00113, PLN00113, leucine-rich repeat receptor-like protein
           kinase; Provisional.
          Length = 968

 Score = 58.3 bits (141), Expect = 4e-09
 Identities = 77/310 (24%), Positives = 133/310 (42%), Gaps = 29/310 (9%)

Query: 12  NNRLKSLEGQLGTLSKLQLLVIEQNQLEA-LPSDIQLFSQLGSLYANNNRITSL--DGLL 68
           NN    +   L +L +LQ+L +  N+    +P ++   + L  L  + N +T    +GL 
Sbjct: 318 NNFTGKIPVALTSLPRLQVLQLWSNKFSGEIPKNLGKHNNLTVLDLSTNNLTGEIPEGLC 377

Query: 69  RG--LTKLQVFNMDFNQITMVRRDEFQNLHNLDSISLQNNQITSMNSSLSGLTKLAYLY- 125
               L KL +F+   N +            +L  + LQ+N  +    S    TKL  +Y 
Sbjct: 378 SSGNLFKLILFS---NSLEGEIPKSLGACRSLRRVRLQDNSFSGELPS--EFTKLPLVYF 432

Query: 126 --LSHNQLTEFLLDDIRGLKRLRTVDLSYNKINKFGTRNEGKNQVQGVTNIFELKLQHNE 183
             +S+N L   +      +  L+ + L+ NK   FG    G     G   +  L L  N+
Sbjct: 433 LDISNNNLQGRINSRKWDMPSLQMLSLARNKF--FG----GLPDSFGSKRLENLDLSRNQ 486

Query: 184 I-ENLDGALMGIHGLSRLDLSHNKLRTISPDDFIGLDSLKMLDISHNLLTTLEETSKTFL 242
               +   L  +  L +L LS NKL    PD+      L  LD+SHN L+     S + +
Sbjct: 487 FSGAVPRKLGSLSELMQLKLSENKLSGEIPDELSSCKKLVSLDLSHNQLSGQIPASFSEM 546

Query: 243 PALEELFVSHNSLT-RLDKDFHGLPVLCKADLAHNNIKAINIQLALKTQCQIFGLNSTLR 301
           P L +L +S N L+  + K+   +  L + +++HN++       +L +      +N++  
Sbjct: 547 PVLSQLDLSQNQLSGEIPKNLGNVESLVQVNISHNHLHG-----SLPSTGAFLAINASA- 600

Query: 302 IYLEGNPVLC 311
             + GN  LC
Sbjct: 601 --VAGNIDLC 608



 Score = 56.0 bits (135), Expect = 2e-08
 Identities = 71/250 (28%), Positives = 106/250 (42%), Gaps = 35/250 (14%)

Query: 11  NNNRLKSLEGQLGTLSKLQLLVIEQNQLEA-LPSDIQLFSQLGSLYANNNRITS-LDGLL 68
           NNN   S+    G++  L+ L +  N L   +P+DI  FS L  L    N +   +   L
Sbjct: 127 NNNFTGSIPR--GSIPNLETLDLSNNMLSGEIPNDIGSFSSLKVLDLGGNVLVGKIPNSL 184

Query: 69  RGLTKLQVFNMDFNQITMVRRDEFQNLHNLDSISLQNN-----------QITSMN----- 112
             LT L+   +  NQ+      E   + +L  I L  N            +TS+N     
Sbjct: 185 TNLTSLEFLTLASNQLVGQIPRELGQMKSLKWIYLGYNNLSGEIPYEIGGLTSLNHLDLV 244

Query: 113 ---------SSLSGLTKLAYLYLSHNQLTEFLLDDIRGLKRLRTVDLSYNKINKFGTRNE 163
                    SSL  L  L YL+L  N+L+  +   I  L++L ++DLS N ++  G   E
Sbjct: 245 YNNLTGPIPSSLGNLKNLQYLFLYQNKLSGPIPPSIFSLQKLISLDLSDNSLS--GEIPE 302

Query: 164 GKNQVQGVTNIFELKLQHNEIE-NLDGALMGIHGLSRLDLSHNKLRTISPDDFIGLDSLK 222
              Q+Q   N+  L L  N     +  AL  +  L  L L  NK     P +    ++L 
Sbjct: 303 LVIQLQ---NLEILHLFSNNFTGKIPVALTSLPRLQVLQLWSNKFSGEIPKNLGKHNNLT 359

Query: 223 MLDISHNLLT 232
           +LD+S N LT
Sbjct: 360 VLDLSTNNLT 369



 Score = 49.1 bits (117), Expect = 4e-06
 Identities = 75/303 (24%), Positives = 114/303 (37%), Gaps = 28/303 (9%)

Query: 2   LTDLNWLFLNNNRLK-SLEGQLGTLSKLQLLVIEQNQLEA-LPSDIQLFSQLGSLYANNN 59
           L +L +LFL  N+L   +   + +L KL  L +  N L   +P  +     L  L+  +N
Sbjct: 259 LKNLQYLFLYQNKLSGPIPPSIFSLQKLISLDLSDNSLSGEIPELVIQLQNLEILHLFSN 318

Query: 60  RIT-SLDGLLRGLTKLQVFNMDFNQITMVRRDEFQNLHNLDSISLQNNQITS-MNSSLSG 117
             T  +   L  L +LQV  +  N+ +          +NL  + L  N +T  +   L  
Sbjct: 319 NFTGKIPVALTSLPRLQVLQLWSNKFSGEIPKNLGKHNNLTVLDLSTNNLTGEIPEGLCS 378

Query: 118 LTKLAYLYLSHNQLTEFLLDDIRGLKRLRTVDLSYNKINKFGTRNEGK-----------N 166
              L  L L  N L   +   +   + LR V L  N  +        K           N
Sbjct: 379 SGNLFKLILFSNSLEGEIPKSLGACRSLRRVRLQDNSFSGELPSEFTKLPLVYFLDISNN 438

Query: 167 QVQGVTNIFELKLQHNEIENLDGA----------LMGIHGLSRLDLSHNKLRTISPDDFI 216
            +QG   I   K     ++ L  A            G   L  LDLS N+     P    
Sbjct: 439 NLQG--RINSRKWDMPSLQMLSLARNKFFGGLPDSFGSKRLENLDLSRNQFSGAVPRKLG 496

Query: 217 GLDSLKMLDISHNLLTTLEETSKTFLPALEELFVSHNSLT-RLDKDFHGLPVLCKADLAH 275
            L  L  L +S N L+       +    L  L +SHN L+ ++   F  +PVL + DL+ 
Sbjct: 497 SLSELMQLKLSENKLSGEIPDELSSCKKLVSLDLSHNQLSGQIPASFSEMPVLSQLDLSQ 556

Query: 276 NNI 278
           N +
Sbjct: 557 NQL 559



 Score = 40.6 bits (95), Expect = 0.002
 Identities = 66/258 (25%), Positives = 111/258 (43%), Gaps = 28/258 (10%)

Query: 51  LGSLYANNNRITSLD-----------GLLRGLTKLQVFNMDFNQIT-MVRRDEFQNLHNL 98
            G    N++R+ S+D             +  L  +Q  N+  NQ++  +  D F    +L
Sbjct: 61  QGITCNNSSRVVSIDLSGKNISGKISSAIFRLPYIQTINLSNNQLSGPIPDDIFTTSSSL 120

Query: 99  DSISLQNNQITSM--NSSLSGLTKLAYLYLSHNQLTEFLLDDIRGLKRLRTVDLSYNKIN 156
             ++L NN  T      S+  L  L    LS+N L+  + +DI     L+ +DL  N + 
Sbjct: 121 RYLNLSNNNFTGSIPRGSIPNLETLD---LSNNMLSGEIPNDIGSFSSLKVLDLGGNVLV 177

Query: 157 KFGTRNEGK--NQVQGVTNIFELKLQHNEIENLDGALMG-IHGLSRLDLSHNKLRTISPD 213
                  GK  N +  +T++  L L  N++       +G +  L  + L +N L    P 
Sbjct: 178 -------GKIPNSLTNLTSLEFLTLASNQLVGQIPRELGQMKSLKWIYLGYNNLSGEIPY 230

Query: 214 DFIGLDSLKMLDISHNLLTTLEETSKTFLPALEELFVSHNSLT-RLDKDFHGLPVLCKAD 272
           +  GL SL  LD+ +N LT    +S   L  L+ LF+  N L+  +      L  L   D
Sbjct: 231 EIGGLTSLNHLDLVYNNLTGPIPSSLGNLKNLQYLFLYQNKLSGPIPPSIFSLQKLISLD 290

Query: 273 LAHNNIKAINIQLALKTQ 290
           L+ N++     +L ++ Q
Sbjct: 291 LSDNSLSGEIPELVIQLQ 308


>gnl|CDD|226406 COG3889, COG3889, Predicted solute binding protein [General
           function prediction only].
          Length = 872

 Score = 50.2 bits (120), Expect = 1e-06
 Identities = 20/58 (34%), Positives = 30/58 (51%), Gaps = 5/58 (8%)

Query: 329 TKIHGETICQPDSNETST-----TTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPTT 381
           T  +          ET++     T T T TTT  P+P  T++ TQ ST++TTT +P+ 
Sbjct: 786 TLSYSAYSNTSILIETTSVVITKTVTQTQTTTSSPSPTQTTSPTQTSTSTTTTTSPSQ 843



 Score = 47.5 bits (113), Expect = 1e-05
 Identities = 15/37 (40%), Positives = 25/37 (67%)

Query: 344 TSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPT 380
            + T TTT++ +P  T +PT T+T  +TT++ +QT T
Sbjct: 810 VTQTQTTTSSPSPTQTTSPTQTSTSTTTTTSPSQTTT 846



 Score = 46.8 bits (111), Expect = 2e-05
 Identities = 19/39 (48%), Positives = 25/39 (64%), Gaps = 1/39 (2%)

Query: 344 TSTTTTT-TTTTTPEPTPAPTSTTTQRSTTSTTTQTPTT 381
           T T T T TTT++P PT   + T T  STT+TT+ + TT
Sbjct: 807 TKTVTQTQTTTSSPSPTQTTSPTQTSTSTTTTTSPSQTT 845



 Score = 44.9 bits (106), Expect = 7e-05
 Identities = 20/90 (22%), Positives = 29/90 (32%), Gaps = 14/90 (15%)

Query: 318 VIDAMETINNNTKIHGETICQPDSNETSTTTT-------------TTTTTTPEPTPAPTS 364
           ++D       +    G    +   + T T TT              TT+     T   T 
Sbjct: 755 IMDPASGTGTSITTSGTFTAEVPQSPTKTETTLSYSAYSNTSILIETTSVVITKTVTQTQ 814

Query: 365 TTTQRSTTSTTTQTPTTPIQEEYTETITLE 394
           TTT  S + T T +PT       T T   +
Sbjct: 815 TTTS-SPSPTQTTSPTQTSTSTTTTTSPSQ 843



 Score = 41.4 bits (97), Expect = 8e-04
 Identities = 14/35 (40%), Positives = 21/35 (60%), Gaps = 2/35 (5%)

Query: 344 TSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQT 378
           T+++ + T TT+P  T   T+TTT  S + TTT  
Sbjct: 816 TTSSPSPTQTTSPTQTSTSTTTTT--SPSQTTTGG 848



 Score = 39.8 bits (93), Expect = 0.002
 Identities = 23/83 (27%), Positives = 27/83 (32%), Gaps = 15/83 (18%)

Query: 341 SNETSTTTTTTTTTTPEPTPAPTSTT---------------TQRSTTSTTTQTPTTPIQE 385
              TS TT+ T T     +P  T TT               T    T T TQT TT    
Sbjct: 761 GTGTSITTSGTFTAEVPQSPTKTETTLSYSAYSNTSILIETTSVVITKTVTQTQTTTSSP 820

Query: 386 EYTETITLELPQPVETNNQIPVQ 408
             T+T +        T    P Q
Sbjct: 821 SPTQTTSPTQTSTSTTTTTSPSQ 843



 Score = 39.1 bits (91), Expect = 0.005
 Identities = 17/53 (32%), Positives = 25/53 (47%), Gaps = 6/53 (11%)

Query: 344 TSTTTTTTTTTTPEPTPAPTSTTTQRST---TSTTTQTPTTPIQEEYTETITL 393
            +T+   T T T   T   + + TQ ++   TST+T T T+P Q   T T   
Sbjct: 800 ETTSVVITKTVTQTQTTTSSPSPTQTTSPTQTSTSTTTTTSPSQ---TTTGGG 849



 Score = 28.7 bits (64), Expect = 7.3
 Identities = 14/75 (18%), Positives = 21/75 (28%), Gaps = 10/75 (13%)

Query: 343 ETSTTTTTTTTTTPEPTPA--------PTSTTTQRSTTS--TTTQTPTTPIQEEYTETIT 392
            +             PT          P S T    TTS   T + P +P + E T + +
Sbjct: 731 GSLEVFPAGENWGFIPTTKRVKVRIMDPASGTGTSITTSGTFTAEVPQSPTKTETTLSYS 790

Query: 393 LELPQPVETNNQIPV 407
                 +       V
Sbjct: 791 AYSNTSILIETTSVV 805


>gnl|CDD|216513 pfam01456, Mucin, Mucin-like glycoprotein.  This family of
           trypanosomal proteins resemble vertebrate mucins. The
           protein consists of three regions. The N and C terminii
           are conserved between all members of the family, whereas
           the central region is not well conserved and contains a
           large number of threonine residues which can be
           glycosylated. Indirect evidence suggested that these
           genes might encode the core protein of parasite mucins,
           glycoproteins that were proposed to be involved in the
           interaction with, and invasion of, mammalian host cells.
           This family contains an N-terminal signal peptide.
          Length = 143

 Score = 46.0 bits (108), Expect = 4e-06
 Identities = 29/76 (38%), Positives = 38/76 (50%)

Query: 326 NNNTKIHGETICQPDSNETSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPTTPIQE 385
           +       E   Q  +  T+TT  TTTTTT   T   T+TTT+ +TT+TTT T TT  + 
Sbjct: 32  DAAVVEAAEGQSQTTTTTTTTTPPTTTTTTTTTTTTITTTTTKTTTTTTTTTTTTTTTEA 91

Query: 386 EYTETITLELPQPVET 401
               T T E P   +T
Sbjct: 92  PSKNTTTSEAPTTTDT 107



 Score = 41.0 bits (95), Expect = 2e-04
 Identities = 25/58 (43%), Positives = 30/58 (51%)

Query: 349 TTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPTTPIQEEYTETITLELPQPVETNNQIP 406
           TTTTTTT  P    T+TTT  +T +TTT   TT      T T T E P    T ++ P
Sbjct: 45  TTTTTTTTTPPTTTTTTTTTTTTITTTTTKTTTTTTTTTTTTTTTEAPSKNTTTSEAP 102



 Score = 36.0 bits (82), Expect = 0.012
 Identities = 24/63 (38%), Positives = 30/63 (47%), Gaps = 5/63 (7%)

Query: 341 SNETSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPTTPIQEEYTETITLELPQPVE 400
             ++ TTTTTTTTT P      T+TTT  +TT+ TT T  T      T T T     P +
Sbjct: 40  EGQSQTTTTTTTTTPPT-----TTTTTTTTTTTITTTTTKTTTTTTTTTTTTTTTEAPSK 94

Query: 401 TNN 403
              
Sbjct: 95  NTT 97



 Score = 34.8 bits (79), Expect = 0.026
 Identities = 21/52 (40%), Positives = 28/52 (53%)

Query: 324 TINNNTKIHGETICQPDSNETSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTT 375
           T    T     TI    +  T+TTTTTTTTTT    P+  +TT++  TT+ T
Sbjct: 56  TTTTTTTTTTTTITTTTTKTTTTTTTTTTTTTTTEAPSKNTTTSEAPTTTDT 107



 Score = 34.5 bits (78), Expect = 0.034
 Identities = 23/62 (37%), Positives = 28/62 (45%)

Query: 320 DAMETINNNTKIHGETICQPDSNETSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTP 379
               T    T     T     +  T TTTTT TTTT   T   T+TT   S  +TT++ P
Sbjct: 43  SQTTTTTTTTTPPTTTTTTTTTTTTITTTTTKTTTTTTTTTTTTTTTEAPSKNTTTSEAP 102

Query: 380 TT 381
           TT
Sbjct: 103 TT 104



 Score = 33.7 bits (76), Expect = 0.067
 Identities = 18/57 (31%), Positives = 23/57 (40%)

Query: 336 ICQPDSNETSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPTTPIQEEYTETIT 392
           +C     E             +     T+TTT   TT+TTT T TT I    T+T T
Sbjct: 21  VCATAQGEGQYDAAVVEAAEGQSQTTTTTTTTTPPTTTTTTTTTTTTITTTTTKTTT 77



 Score = 32.5 bits (73), Expect = 0.17
 Identities = 20/39 (51%), Positives = 28/39 (71%), Gaps = 1/39 (2%)

Query: 344 TSTTTTTTTTTTPEPTPAPT-STTTQRSTTSTTTQTPTT 381
            +TTTTTTTTTT   T AP+ +TTT  + T+T T+ P++
Sbjct: 74  KTTTTTTTTTTTTTTTEAPSKNTTTSEAPTTTDTRAPSS 112



 Score = 32.1 bits (72), Expect = 0.25
 Identities = 17/59 (28%), Positives = 21/59 (35%)

Query: 348 TTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPTTPIQEEYTETITLELPQPVETNNQIP 406
                      T   T+TTT  +TT+TTT T TT        T T        T  + P
Sbjct: 34  AVVEAAEGQSQTTTTTTTTTPPTTTTTTTTTTTTITTTTTKTTTTTTTTTTTTTTTEAP 92



 Score = 30.2 bits (67), Expect = 1.0
 Identities = 14/61 (22%), Positives = 18/61 (29%)

Query: 341 SNETSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPTTPIQEEYTETITLELPQPVE 400
           +                 +   T+TTT    T+TTT T TT      T   T        
Sbjct: 25  AQGEGQYDAAVVEAAEGQSQTTTTTTTTTPPTTTTTTTTTTTTITTTTTKTTTTTTTTTT 84

Query: 401 T 401
           T
Sbjct: 85  T 85



 Score = 27.9 bits (61), Expect = 6.9
 Identities = 19/52 (36%), Positives = 23/52 (44%), Gaps = 7/52 (13%)

Query: 324 TINNNTKIHGETICQPDSNETSTTTTTTTTTTPEP-------TPAPTSTTTQ 368
           T    T              T+TTTTTTTTTT E        + APT+T T+
Sbjct: 57  TTTTTTTTTTTITTTTTKTTTTTTTTTTTTTTTEAPSKNTTTSEAPTTTDTR 108


>gnl|CDD|185285 PRK15387, PRK15387, E3 ubiquitin-protein ligase SspH2; Provisional.
          Length = 788

 Score = 48.2 bits (114), Expect = 5e-06
 Identities = 53/153 (34%), Positives = 76/153 (49%), Gaps = 23/153 (15%)

Query: 3   TDLNWLFLNNNRLKSLEGQLGTL-SKLQLLVIEQNQLEALPSDIQLFSQLGSLYANNNRI 61
           ++L  L+  NN+L SL     TL S LQ L +  NQL +LP+   L S+L  L+A NNR+
Sbjct: 322 SELCKLWAYNNQLTSLP----TLPSGLQELSVSDNQLASLPT---LPSELYKLWAYNNRL 374

Query: 62  TSLDGLLRG----------LTKLQVFNMDFNQITMVRRDEFQNL----HNLDSISLQNNQ 107
           TSL  L  G          LT L V   +  ++ MV  +   +L      L S+S+  NQ
Sbjct: 375 TSLPALPSGLKELIVSGNRLTSLPVLPSELKEL-MVSGNRLTSLPMLPSGLLSLSVYRNQ 433

Query: 108 ITSMNSSLSGLTKLAYLYLSHNQLTEFLLDDIR 140
           +T +  SL  L+    + L  N L+E  L  +R
Sbjct: 434 LTRLPESLIHLSSETTVNLEGNPLSERTLQALR 466



 Score = 40.9 bits (95), Expect = 0.001
 Identities = 35/107 (32%), Positives = 57/107 (53%), Gaps = 13/107 (12%)

Query: 28  LQLLVIEQNQLEALPSDIQLFSQLGSLYANNNRITSLDGLLRGLTKLQVFNMDFNQITMV 87
           LQ L +  NQL +LP+   L S+L  L+A NN++TSL  L  GL +L V +     +  +
Sbjct: 304 LQELSVSDNQLASLPA---LPSELCKLWAYNNQLTSLPTLPSGLQELSVSDNQLASLPTL 360

Query: 88  RRDEFQNLHNLDSISLQNNQITSMNSSLSGLTKLAYLYLSHNQLTEF 134
             + ++       +   NN++TS+ +  SGL +L    +S N+LT  
Sbjct: 361 PSELYK-------LWAYNNRLTSLPALPSGLKELI---VSGNRLTSL 397


>gnl|CDD|185268 PRK15370, PRK15370, E3 ubiquitin-protein ligase SlrP; Provisional.
          Length = 754

 Score = 44.3 bits (104), Expect = 9e-05
 Identities = 59/232 (25%), Positives = 98/232 (42%), Gaps = 49/232 (21%)

Query: 31  LVIEQNQLEALPSDIQLFSQLGSLYANNNRITSLDGLLRGLTKLQVFNMDFNQITMVRRD 90
           L+++ N+L++LP ++Q    + +LYAN+N++TS+   L     +Q   +  N+IT +   
Sbjct: 204 LILDNNELKSLPENLQ--GNIKTLYANSNQLTSIPATLP--DTIQEMELSINRITELPER 259

Query: 91  EFQNLHNLDSISLQNNQITSMNSSLSGLTKLAYLYLSHNQLTEFLLDDIRGLKRLRTVDL 150
               L +LD   L +N+I+ +  +L                           + LR + +
Sbjct: 260 LPSALQSLD---LFHNKISCLPENLP--------------------------EELRYLSV 290

Query: 151 SYNKINKFGTRNEGKNQVQGVTNIFELKLQHNEIENLDGALMGIHGLSRLDLSHNKLRTI 210
             N I                + I  L +Q N +  L   L    GL  L+   N L T 
Sbjct: 291 YDNSIRTLPAHLP--------SGITHLNVQSNSLTALPETLP--PGLKTLEAGENAL-TS 339

Query: 211 SPDDFIGLDSLKMLDISHNLLTTLEETSKTFLPALEELFVSHNSLTRLDKDF 262
            P        L++LD+S N +T L ET     P +  L VS N+LT L ++ 
Sbjct: 340 LPASLP--PELQVLDVSKNQITVLPET---LPPTITTLDVSRNALTNLPENL 386



 Score = 33.5 bits (76), Expect = 0.24
 Identities = 36/142 (25%), Positives = 60/142 (42%), Gaps = 28/142 (19%)

Query: 10  LNNNRLKSLEGQLGTLSKLQLLVIEQNQLEALPSDIQLFSQLGSLYANNNRITSL-DGLL 68
           L+ NR+  L  +L   S LQ L +  N++  LP ++    +  S+Y N+  I +L   L 
Sbjct: 248 LSINRITELPERLP--SALQSLDLFHNKISCLPENLPEELRYLSVYDNS--IRTLPAHLP 303

Query: 69  RGLTKLQVFNMDFNQITMVRRDEFQNLHNLDS------------------ISLQNNQITS 110
            G+T L V +   N +T +       L  L++                  + +  NQIT 
Sbjct: 304 SGITHLNVQS---NSLTALPETLPPGLKTLEAGENALTSLPASLPPELQVLDVSKNQITV 360

Query: 111 MNSSLSGLTKLAYLYLSHNQLT 132
           +  +L     +  L +S N LT
Sbjct: 361 LPETLP--PTITTLDVSRNALT 380


>gnl|CDD|205079 pfam12799, LRR_4, Leucine Rich repeats (2 copies).  Leucine rich
           repeats are short sequence motifs present in a number of
           proteins with diverse functions and cellular locations.
           These repeats are usually involved in protein-protein
           interactions. Each Leucine Rich Repeat is composed of a
           beta-alpha unit. These units form elongated non-globular
           structures. Leucine Rich Repeats are often flanked by
           cysteine rich domains.
          Length = 43

 Score = 36.7 bits (86), Expect = 8e-04
 Identities = 18/40 (45%), Positives = 23/40 (57%), Gaps = 2/40 (5%)

Query: 119 TKLAYLYLSHNQLTEFLLDDIRGLKRLRTVDLSYNKINKF 158
           T L  L LS+NQ+T+  L  +  L  L T+DLS NKI   
Sbjct: 1   TNLETLDLSNNQITD--LPPLSNLPNLETLDLSGNKITDL 38



 Score = 34.4 bits (80), Expect = 0.006
 Identities = 17/39 (43%), Positives = 24/39 (61%), Gaps = 1/39 (2%)

Query: 96  HNLDSISLQNNQITSMNSSLSGLTKLAYLYLSHNQLTEF 134
            NL+++ L NNQIT +   LS L  L  L LS N++T+ 
Sbjct: 1   TNLETLDLSNNQITDL-PPLSNLPNLETLDLSGNKITDL 38



 Score = 31.3 bits (72), Expect = 0.070
 Identities = 16/43 (37%), Positives = 24/43 (55%), Gaps = 2/43 (4%)

Query: 197 LSRLDLSHNKLRTISPDDFIGLDSLKMLDISHNLLTTLEETSK 239
           L  LDLS+N++  + P     L +L+ LD+S N +T L   S 
Sbjct: 3   LETLDLSNNQITDLPP--LSNLPNLETLDLSGNKITDLSPLSN 43



 Score = 30.5 bits (70), Expect = 0.100
 Identities = 14/42 (33%), Positives = 24/42 (57%), Gaps = 2/42 (4%)

Query: 72  TKLQVFNMDFNQITMVRRDEFQNLHNLDSISLQNNQITSMNS 113
           T L+  ++  NQIT +      NL NL+++ L  N+IT ++ 
Sbjct: 1   TNLETLDLSNNQITDL--PPLSNLPNLETLDLSGNKITDLSP 40



 Score = 30.5 bits (70), Expect = 0.11
 Identities = 13/36 (36%), Positives = 20/36 (55%), Gaps = 1/36 (2%)

Query: 50 QLGSLYANNNRITSLDGLLRGLTKLQVFNMDFNQIT 85
           L +L  +NN+IT L  L   L  L+  ++  N+IT
Sbjct: 2  NLETLDLSNNQITDLPPL-SNLPNLETLDLSGNKIT 36



 Score = 29.0 bits (66), Expect = 0.35
 Identities = 17/40 (42%), Positives = 24/40 (60%), Gaps = 2/40 (5%)

Query: 220 SLKMLDISHNLLTTLEETSKTFLPALEELFVSHNSLTRLD 259
           +L+ LD+S+N +T L   S   LP LE L +S N +T L 
Sbjct: 2   NLETLDLSNNQITDLPPLSN--LPNLETLDLSGNKITDLS 39



 Score = 29.0 bits (66), Expect = 0.42
 Identities = 17/41 (41%), Positives = 24/41 (58%), Gaps = 1/41 (2%)

Query: 172 TNIFELKLQHNEIENLDGALMGIHGLSRLDLSHNKLRTISP 212
           TN+  L L +N+I +L   L  +  L  LDLS NK+  +SP
Sbjct: 1   TNLETLDLSNNQITDLP-PLSNLPNLETLDLSGNKITDLSP 40



 Score = 28.2 bits (64), Expect = 0.81
 Identities = 12/39 (30%), Positives = 19/39 (48%), Gaps = 1/39 (2%)

Query: 5  LNWLFLNNNRLKSLEGQLGTLSKLQLLVIEQNQLEALPS 43
          L  L L+NN++  L   L  L  L+ L +  N++  L  
Sbjct: 3  LETLDLSNNQITDLPP-LSNLPNLETLDLSGNKITDLSP 40



 Score = 27.5 bits (62), Expect = 1.4
 Identities = 13/41 (31%), Positives = 20/41 (48%), Gaps = 1/41 (2%)

Query: 27 KLQLLVIEQNQLEALPSDIQLFSQLGSLYANNNRITSLDGL 67
           L+ L +  NQ+  LP  +     L +L  + N+IT L  L
Sbjct: 2  NLETLDLSNNQITDLPP-LSNLPNLETLDLSGNKITDLSPL 41



 Score = 25.9 bits (58), Expect = 5.4
 Identities = 14/40 (35%), Positives = 20/40 (50%), Gaps = 1/40 (2%)

Query: 243 PALEELFVSHNSLTRLDKDFHGLPVLCKADLAHNNIKAIN 282
             LE L +S+N +T L      LP L   DL+ N I  ++
Sbjct: 1   TNLETLDLSNNQITDLP-PLSNLPNLETLDLSGNKITDLS 39


>gnl|CDD|237019 PRK11907, PRK11907, bifunctional 2',3'-cyclic nucleotide
           2'-phosphodiesterase/3'-nucleotidase precursor protein;
           Reviewed.
          Length = 814

 Score = 41.4 bits (97), Expect = 9e-04
 Identities = 16/84 (19%), Positives = 26/84 (30%), Gaps = 7/84 (8%)

Query: 332 HGETICQPDSNETSTTTTTTTTTTPE-------PTPAPTSTTTQRSTTSTTTQTPTTPIQ 384
             E I       ++    TT   +          TP   +T  +  ++S T +T     +
Sbjct: 27  QAEEIVTTTPATSTEAEQTTPVESDATEEADNTETPVAATTAAEAPSSSETAETSDPTSE 86

Query: 385 EEYTETITLELPQPVETNNQIPVQ 408
              T T       P  T    PV+
Sbjct: 87  ATDTTTSEARTVTPAATETSKPVE 110



 Score = 33.7 bits (77), Expect = 0.21
 Identities = 19/71 (26%), Positives = 28/71 (39%), Gaps = 10/71 (14%)

Query: 317 AVIDAMETINNNTKIHGETICQPDSNETSTTTTTTTTT--TPEPTPAPTSTTTQRSTTST 374
              DA E  +N           P +  T+    +++ T  T +PT   T TTT  + T T
Sbjct: 48  VESDATEEADNTE--------TPVAATTAAEAPSSSETAETSDPTSEATDTTTSEARTVT 99

Query: 375 TTQTPTTPIQE 385
              T T+   E
Sbjct: 100 PAATETSKPVE 110



 Score = 32.5 bits (74), Expect = 0.54
 Identities = 17/75 (22%), Positives = 26/75 (34%), Gaps = 1/75 (1%)

Query: 326 NNNTKIHGETICQPDSNETSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPTTPIQE 385
             +T+    T  + D+ E +  T T    T       +S T + S  ++     TT    
Sbjct: 37  ATSTEAEQTTPVESDATEEADNTETPVAATTAAEAPSSSETAETSDPTSEATDTTTSEAR 96

Query: 386 EYTETITLELPQPVE 400
             T   T     PVE
Sbjct: 97  TVTPAATETSK-PVE 110



 Score = 29.1 bits (65), Expect = 5.6
 Identities = 17/69 (24%), Positives = 19/69 (27%), Gaps = 6/69 (8%)

Query: 341 SNETSTTTTTTTTTTPE-PTPAPTSTTTQRSTTSTTTQTPTTPIQEEYTETITLELPQPV 399
           SN          TTTP   T A  +T  +   T     T T         T   E P   
Sbjct: 21  SNPKLAQAEEIVTTTPATSTEAEQTTPVESDATEEADNTETPV-----AATTAAEAPSSS 75

Query: 400 ETNNQIPVQ 408
           ET       
Sbjct: 76  ETAETSDPT 84


>gnl|CDD|216257 pfam01034, Syndecan, Syndecan domain.  Syndecans are transmembrane
           heparin sulfate proteoglycans which are implicated in
           the binding of extracellular matrix components and
           growth factors.
          Length = 207

 Score = 40.1 bits (94), Expect = 9e-04
 Identities = 15/44 (34%), Positives = 18/44 (40%)

Query: 339 PDSNETSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPTTP 382
             S    TTT +T+T T        +TTT  S T T   T T  
Sbjct: 84  SSSPSNDTTTASTSTKTSPTVSTTVTTTTSPSETDTEEATTTVS 127



 Score = 37.0 bits (86), Expect = 0.009
 Identities = 18/56 (32%), Positives = 30/56 (53%), Gaps = 5/56 (8%)

Query: 332 HGETICQPDSNETSTTTTTTTTTTPEP-----TPAPTSTTTQRSTTSTTTQTPTTP 382
              T     SN+T+T +T+T T+         T +P+ T T+ +TT+ +T+TPT  
Sbjct: 79  KLTTTSSSPSNDTTTASTSTKTSPTVSTTVTTTTSPSETDTEEATTTVSTETPTEG 134



 Score = 36.6 bits (85), Expect = 0.011
 Identities = 21/69 (30%), Positives = 31/69 (44%), Gaps = 6/69 (8%)

Query: 331 IHGETICQPDSNETSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPTTPIQEEYTET 390
               T  +  +  +S +  TTT +T       T T+   STT TTT +P+    EE T T
Sbjct: 72  TTSATPPKLTTTSSSPSNDTTTAST------STKTSPTVSTTVTTTTSPSETDTEEATTT 125

Query: 391 ITLELPQPV 399
           ++ E P   
Sbjct: 126 VSTETPTEG 134



 Score = 33.9 bits (78), Expect = 0.10
 Identities = 16/57 (28%), Positives = 22/57 (38%)

Query: 329 TKIHGETICQPDSNETSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPTTPIQE 385
           T     T   P  + T TTTT+ + T  E      ST T     S+    P+  + E
Sbjct: 92  TTASTSTKTSPTVSTTVTTTTSPSETDTEEATTTVSTETPTEGGSSAATDPSKNLLE 148



 Score = 32.8 bits (75), Expect = 0.23
 Identities = 18/65 (27%), Positives = 25/65 (38%), Gaps = 3/65 (4%)

Query: 340 DSNETSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPTTPIQEEYT---ETITLELP 396
           D  ++   TT+ T      T +  S  T  ++TST T    +      T   ET T E  
Sbjct: 64  DDEDSEPVTTSATPPKLTTTSSSPSNDTTTASTSTKTSPTVSTTVTTTTSPSETDTEEAT 123

Query: 397 QPVET 401
             V T
Sbjct: 124 TTVST 128



 Score = 30.9 bits (70), Expect = 0.83
 Identities = 17/57 (29%), Positives = 24/57 (42%), Gaps = 4/57 (7%)

Query: 323 ETINNNTKIHGETICQPDSNETSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTP 379
            T +++      T         + +TT TTTT+P  T    +TT    T ST T T 
Sbjct: 81  TTTSSSPSNDTTTASTSTKTSPTVSTTVTTTTSPSETDTEEATT----TVSTETPTE 133


>gnl|CDD|223031 PHA03273, PHA03273, envelope glycoprotein C; Provisional.
          Length = 486

 Score = 40.0 bits (93), Expect = 0.002
 Identities = 24/86 (27%), Positives = 36/86 (41%), Gaps = 11/86 (12%)

Query: 335 TICQPDSN--ETSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPTTPIQEEYTETI- 391
           +I   D++  E  +T  T T TT   T +P  T T  ST +  T++ T   Q    ET  
Sbjct: 33  SIENSDNSTAEMQSTPATPTHTTSNLT-SPFGTGTDNSTNANGTESTTQASQPHSHETTI 91

Query: 392 -----TLELP--QPVETNNQIPVQDN 410
                 + +P  + V+ N    V  N
Sbjct: 92  TCTKSLISVPYYKSVDMNCTTSVGVN 117



 Score = 29.6 bits (66), Expect = 3.8
 Identities = 13/65 (20%), Positives = 21/65 (32%)

Query: 341 SNETSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPTTPIQEEYTETITLELPQPVE 400
              TS++   +  +T E    P + T   S  ++   T T           T +  QP  
Sbjct: 27  GASTSSSIENSDNSTAEMQSTPATPTHTTSNLTSPFGTGTDNSTNANGTESTTQASQPHS 86

Query: 401 TNNQI 405
               I
Sbjct: 87  HETTI 91


>gnl|CDD|218421 pfam05086, Dicty_REP, Dictyostelium (Slime Mold) REP protein.  This
           family consists of REP proteins from Dictyostelium
           (Slime molds). REP protein is likely involved in
           transcription regulation and control of DNA replication,
           specifically amplification of plasmid at low copy
           numbers. The formation of homomultimers may be required
           for their regulatory activity.
          Length = 910

 Score = 39.8 bits (93), Expect = 0.002
 Identities = 19/76 (25%), Positives = 31/76 (40%), Gaps = 1/76 (1%)

Query: 343 ETSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPTTPIQEEYTETITLELPQPVETN 402
           + S  +        +P+  P +TTT  +TT+TTT  P T  + +  +   L L Q  E  
Sbjct: 233 QISINSENIQRINSQPSKRPNNTTTTTTTTTTTTFQPRT-RKRKSIDDHKLSLNQAPEKF 291

Query: 403 NQIPVQDNLEKEKQPD 418
                 D+  +    D
Sbjct: 292 KNNTKPDDDPQSDFSD 307



 Score = 34.4 bits (79), Expect = 0.11
 Identities = 16/51 (31%), Positives = 21/51 (41%), Gaps = 3/51 (5%)

Query: 338 QPDSNETSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPTTPIQEEYT 388
           QP     +TTTTTTTTTT    P    T  ++S           P + +  
Sbjct: 247 QPSKRPNNTTTTTTTTTTTTFQPR---TRKRKSIDDHKLSLNQAPEKFKNN 294


>gnl|CDD|234504 TIGR04216, halo_surf_glyco, major cell surface glycoprotein.
           Members of this family are the S-layer-forming
           halobacterial major cell surface glycoprotein. The
           highest scores below model cutoffs are fragmentary
           paralogs to actual members of the family. Modifications
           include at N-linked and O-linked glycosylation, a
           C-terminal diphytanylglyceryl modification, and probable
           cleavage of the PGF-CTERM tail.
          Length = 782

 Score = 39.9 bits (93), Expect = 0.003
 Identities = 20/48 (41%), Positives = 24/48 (50%), Gaps = 3/48 (6%)

Query: 338 QPD---SNETSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPTTP 382
           +PD   S + +TTTT TTT   E T     TTT    T  TT   +TP
Sbjct: 713 RPDTTTSEDPTTTTTPTTTGPEETTETAEPTTTTEEPTEETTTGSSTP 760



 Score = 37.6 bits (87), Expect = 0.013
 Identities = 14/45 (31%), Positives = 19/45 (42%), Gaps = 1/45 (2%)

Query: 335 TICQPDSNETSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTP 379
           T  +  +  T+ TTT    TT    P  T T      T+T + TP
Sbjct: 717 TTSEDPTTTTTPTTTGPEETTETAEPTTT-TEEPTEETTTGSSTP 760



 Score = 35.6 bits (82), Expect = 0.051
 Identities = 13/51 (25%), Positives = 17/51 (33%)

Query: 340 DSNETSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPTTPIQEEYTET 390
           D + T          T  P    +   T  +T +TT    TT   E  T T
Sbjct: 696 DGDNTDRVNVEVVEETERPDTTTSEDPTTTTTPTTTGPEETTETAEPTTTT 746



 Score = 34.9 bits (80), Expect = 0.077
 Identities = 19/47 (40%), Positives = 22/47 (46%), Gaps = 4/47 (8%)

Query: 334 ETICQPDSNETSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPT 380
            T   P +  T TTT    TT    T  PT TTT+  T  TTT + T
Sbjct: 717 TTSEDPTTTTTPTTTGPEETT---ETAEPT-TTTEEPTEETTTGSST 759



 Score = 33.7 bits (77), Expect = 0.19
 Identities = 21/65 (32%), Positives = 31/65 (47%), Gaps = 7/65 (10%)

Query: 326 NNNTKIHGETICQPDSNETSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPTTPIQE 385
           +N  +++ E + + +  +T+T+   TTTTTP  T  P  TT       T   T TT    
Sbjct: 698 DNTDRVNVEVVEETERPDTTTSEDPTTTTTPTTT-GPEETTE------TAEPTTTTEEPT 750

Query: 386 EYTET 390
           E T T
Sbjct: 751 EETTT 755



 Score = 30.2 bits (68), Expect = 2.1
 Identities = 15/51 (29%), Positives = 21/51 (41%), Gaps = 2/51 (3%)

Query: 318 VIDAMETINNNTKIHGETICQPDSNETSTTTTTTT--TTTPEPTPAPTSTT 366
           V++  E  +  T     T   P +     TT T    TTT EPT   T+ +
Sbjct: 707 VVEETERPDTTTSEDPTTTTTPTTTGPEETTETAEPTTTTEEPTEETTTGS 757


>gnl|CDD|191251 pfam05283, MGC-24, Multi-glycosylated core protein 24 (MGC-24).
           This family consists of several MGC-24 (or Cd164
           antigen) proteins from eukaryotic organisms.
           MGC-24/CD164 is a sialomucin expressed in many normal
           and cancerous tissues. In humans, soluble and
           transmembrane forms of MGC-24 are produced by
           alternative splicing.
          Length = 187

 Score = 38.1 bits (88), Expect = 0.003
 Identities = 15/50 (30%), Positives = 23/50 (46%)

Query: 341 SNETSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPTTPIQEEYTET 390
           S  T+T   T +T  P  TP+PT++    ++   T  T T   Q +   T
Sbjct: 109 SVATTTPVPTNSTAKPTITPSPTTSHHHVTSEPKTNTTVTPTSQPDRKST 158



 Score = 37.7 bits (87), Expect = 0.005
 Identities = 14/62 (22%), Positives = 24/62 (38%)

Query: 321 AMETINNNTKIHGETICQPDSNETSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPT 380
             E + +  +I+  T     +  T   T +T   T  P+P  +         + TT TPT
Sbjct: 91  QAEVVKSGCQIYNTTDSCSVATTTPVPTNSTAKPTITPSPTTSHHHVTSEPKTNTTVTPT 150

Query: 381 TP 382
           + 
Sbjct: 151 SQ 152



 Score = 29.2 bits (65), Expect = 3.4
 Identities = 14/56 (25%), Positives = 22/56 (39%)

Query: 344 TSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPTTPIQEEYTETITLELPQPV 399
                TT + +    TP PT++T + + T + T +      E  T T      QP 
Sbjct: 99  CQIYNTTDSCSVATTTPVPTNSTAKPTITPSPTTSHHHVTSEPKTNTTVTPTSQPD 154


>gnl|CDD|215964 pfam00513, Late_protein_L2, Late Protein L2. 
          Length = 466

 Score = 38.8 bits (91), Expect = 0.004
 Identities = 17/70 (24%), Positives = 25/70 (35%), Gaps = 9/70 (12%)

Query: 339 PDSNETSTTTTTTTTTTPEPTPA---PTSTTTQRSTTSTTTQTPTTPIQEEYTETITLEL 395
           P        +    TT+   TPA    T TT     + T    P       +T+   L+ 
Sbjct: 116 PIPPIPGDGSGFPITTSSTTTPAILDVTPTTRTVHVSRTQYNNPL------FTDPSVLQP 169

Query: 396 PQPVETNNQI 405
           PQP E +  +
Sbjct: 170 PQPAEVSGHV 179



 Score = 28.4 bits (64), Expect = 7.9
 Identities = 17/77 (22%), Positives = 27/77 (35%), Gaps = 5/77 (6%)

Query: 339 PDSNETSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTT---TQTPTTPIQEEYTETITLEL 395
            + +  S       TT    T    S++++ STT+TT   + TP  P+   Y   I L  
Sbjct: 366 VELDTYSDDLLLDETTEDFSTSQLVSSSSRTSTTNTTIPLSSTPDVPVY--YGPDIVLPE 423

Query: 396 PQPVETNNQIPVQDNLE 412
                    +P      
Sbjct: 424 SPGTTPIVPVPPDLPTV 440


>gnl|CDD|114270 pfam05539, Pneumo_att_G, Pneumovirinae attachment membrane
           glycoprotein G. 
          Length = 408

 Score = 38.9 bits (90), Expect = 0.004
 Identities = 16/46 (34%), Positives = 20/46 (43%), Gaps = 1/46 (2%)

Query: 345 STTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPTTPIQEEYTET 390
            +TT      T      P +T+ +RS  ST T  PTT  QE    T
Sbjct: 258 QSTTGDGQEHTQRRK-TPPATSNRRSPHSTATPPPTTKRQETGRPT 302



 Score = 32.7 bits (74), Expect = 0.32
 Identities = 11/60 (18%), Positives = 16/60 (26%), Gaps = 2/60 (3%)

Query: 345 STTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPTTPIQEEYTETITLELPQPVETNNQ 404
           S    + +   P  T +   +T         TQ   TP       +       P  T  Q
Sbjct: 239 SQRGPSGSPQHPPSTTSQDQST--TGDGQEHTQRRKTPPATSNRRSPHSTATPPPTTKRQ 296



 Score = 28.5 bits (63), Expect = 8.0
 Identities = 17/64 (26%), Positives = 26/64 (40%), Gaps = 13/64 (20%)

Query: 347 TTTTTTTTTPEPT-----PAPTSTTTQ--------RSTTSTTTQTPTTPIQEEYTETITL 393
           T  TT+ TT  PT       P+  T Q        ++ T+    + T P+  + T T + 
Sbjct: 170 TAVTTSKTTSWPTEVSHPTYPSQVTPQSQPATQGHQTATANQRLSSTEPVGTQGTTTSSN 229

Query: 394 ELPQ 397
             PQ
Sbjct: 230 PEPQ 233



 Score = 28.1 bits (62), Expect = 9.1
 Identities = 21/85 (24%), Positives = 32/85 (37%), Gaps = 9/85 (10%)

Query: 341 SNETSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPTTPIQEEYT------ETITLE 394
           S E   T  TTT++ PEP   P  +  QR  + +    P+T  Q++ T       T   +
Sbjct: 215 STEPVGTQGTTTSSNPEPQTEPPPS--QRGPSGSPQHPPSTTSQDQSTTGDGQEHTQRRK 272

Query: 395 LPQPVETNNQIPVQDNLEKEKQPDH 419
            P P  +N + P             
Sbjct: 273 TP-PATSNRRSPHSTATPPPTTKRQ 296


>gnl|CDD|115579 pfam06933, SSP160, Special lobe-specific silk protein SSP160.  This
           family consists of several special lobe-specific silk
           protein SSP160 sequences which appear to be specific to
           Chironomus (Midge) species.
          Length = 758

 Score = 38.2 bits (88), Expect = 0.008
 Identities = 22/59 (37%), Positives = 31/59 (52%), Gaps = 2/59 (3%)

Query: 317 AVIDAMETINNNTKIHGETICQPDSNETSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTT 375
           A++   E    N    G       +N T+++TTTTTTTT   T APT+TTT+ +    T
Sbjct: 655 AILAMFEAAYQNCTAPGSVTVPAAANTTTSSTTTTTTTT--TTAAPTTTTTKAANAPFT 711



 Score = 36.7 bits (84), Expect = 0.024
 Identities = 20/78 (25%), Positives = 35/78 (44%), Gaps = 8/78 (10%)

Query: 317 AVIDAMETINNNTKIHGETICQPDSNETSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTT 376
           A++ A+E   N +     T        +++ +T+ +TT    T    STT+  ST+S+ +
Sbjct: 268 AILAALEAFANGSASANST--------SNSNSTSNSTTNSNSTTTTNSTTSTNSTSSSNS 319

Query: 377 QTPTTPIQEEYTETITLE 394
            T    I      TI L+
Sbjct: 320 STIAGCIDIAANFTIALQ 337



 Score = 33.2 bits (75), Expect = 0.27
 Identities = 17/66 (25%), Positives = 33/66 (50%)

Query: 315 MRAVIDAMETINNNTKIHGETICQPDSNETSTTTTTTTTTTPEPTPAPTSTTTQRSTTST 374
           + A I A+++ + +   +  +     SN  STT+  +TT++   T    ST++  ST+S 
Sbjct: 91  ISANIQAIQSGSGSASGNSSSSANSTSNSNSTTSNNSTTSSNSTTTTSNSTSSSNSTSSG 150

Query: 375 TTQTPT 380
            T   +
Sbjct: 151 LTSGAS 156



 Score = 31.3 bits (70), Expect = 1.2
 Identities = 11/41 (26%), Positives = 24/41 (58%)

Query: 341 SNETSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPTT 381
           S   +++++  +T+    T +  STT+  STT+T+  T ++
Sbjct: 104 SASGNSSSSANSTSNSNSTTSNNSTTSSNSTTTTSNSTSSS 144



 Score = 30.1 bits (67), Expect = 2.7
 Identities = 15/44 (34%), Positives = 21/44 (47%), Gaps = 4/44 (9%)

Query: 339 PDSNETSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPTTP 382
           P S        TTT++    T   T+TTT  + T+TTT+    P
Sbjct: 670 PGSVTVPAAANTTTSS----TTTTTTTTTTAAPTTTTTKAANAP 709



 Score = 28.6 bits (63), Expect = 6.9
 Identities = 12/28 (42%), Positives = 15/28 (53%)

Query: 354 TTPEPTPAPTSTTTQRSTTSTTTQTPTT 381
           T P     P +  T  S+T+TTT T TT
Sbjct: 668 TAPGSVTVPAAANTTTSSTTTTTTTTTT 695


>gnl|CDD|221315 pfam11921, DUF3439, Domain of unknown function (DUF3439).  This
           presumed domain is functionally uncharacterized. This
           domain is found in eukaryotes. This domain is typically
           between 46 to 94 amino acids in length. This domain is
           found associated with pfam01462, pfam00560.
          Length = 122

 Score = 36.1 bits (83), Expect = 0.008
 Identities = 15/22 (68%), Positives = 17/22 (77%)

Query: 344 TSTTTTTTTTTTPEPTPAPTST 365
           T+TTTTTTTTT P  T  PT+T
Sbjct: 48  TTTTTTTTTTTMPTTTTLPTTT 69



 Score = 34.9 bits (80), Expect = 0.016
 Identities = 16/26 (61%), Positives = 18/26 (69%)

Query: 345 STTTTTTTTTTPEPTPAPTSTTTQRS 370
           +TTTTTTTTTT  PT     TTT+ S
Sbjct: 47  TTTTTTTTTTTTMPTTTTLPTTTKMS 72



 Score = 32.6 bits (74), Expect = 0.12
 Identities = 22/69 (31%), Positives = 29/69 (42%), Gaps = 11/69 (15%)

Query: 325 INNNTKI--HGETICQPDSNETSTT---------TTTTTTTTPEPTPAPTSTTTQRSTTS 373
           I  N+ I         PDS + S +          TTTTTTT   T  PT+TT   +T  
Sbjct: 12  IQKNSGIVKDSSGYVDPDSAKCSGSGKPVRSIICPTTTTTTTTTTTTMPTTTTLPTTTKM 71

Query: 374 TTTQTPTTP 382
           +  + P  P
Sbjct: 72  SMVKVPLVP 80


>gnl|CDD|236792 PRK10905, PRK10905, cell division protein DamX; Validated.
          Length = 328

 Score = 36.5 bits (84), Expect = 0.019
 Identities = 15/46 (32%), Positives = 19/46 (41%), Gaps = 1/46 (2%)

Query: 338 QPDSNETSTTTTTTTTTTPEPTPAPTST-TTQRSTTSTTTQTPTTP 382
            P   E +    +T       TPAP  T TT    T++  QT  TP
Sbjct: 182 TPKRTEPAAPVASTKAPAATSTPAPKETATTAPVQTASPAQTTATP 227



 Score = 32.2 bits (73), Expect = 0.43
 Identities = 11/38 (28%), Positives = 18/38 (47%)

Query: 344 TSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPTT 381
            ++T     T+TP P    T+   Q ++ + TT TP  
Sbjct: 192 VASTKAPAATSTPAPKETATTAPVQTASPAQTTATPAA 229



 Score = 31.4 bits (71), Expect = 0.84
 Identities = 11/41 (26%), Positives = 14/41 (34%)

Query: 344 TSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPTTPIQ 384
           T   T           PA TST   + T +T      +P Q
Sbjct: 182 TPKRTEPAAPVASTKAPAATSTPAPKETATTAPVQTASPAQ 222



 Score = 29.9 bits (67), Expect = 2.8
 Identities = 10/39 (25%), Positives = 13/39 (33%)

Query: 344 TSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPTTP 382
             T   T        T AP +T+T     + TT    T 
Sbjct: 180 AQTPKRTEPAAPVASTKAPAATSTPAPKETATTAPVQTA 218



 Score = 28.4 bits (63), Expect = 6.9
 Identities = 13/55 (23%), Positives = 19/55 (34%), Gaps = 8/55 (14%)

Query: 354 TTPEPTPAPTSTTTQRSTTSTTTQTPTTPIQEEYTETITLELPQPVETNNQIPVQ 408
            T +  P P + T +R+  +    +   P             P P ET    PVQ
Sbjct: 170 ATAKTEPKPVAQTPKRTEPAAPVASTKAP--------AATSTPAPKETATTAPVQ 216



 Score = 28.0 bits (62), Expect = 8.8
 Identities = 14/53 (26%), Positives = 18/53 (33%), Gaps = 5/53 (9%)

Query: 329 TKIHGETICQPDSNETSTTTTT-----TTTTTPEPTPAPTSTTTQRSTTSTTT 376
           T    E      S +    T+T     T TT P  T +P  TT   +    T 
Sbjct: 182 TPKRTEPAAPVASTKAPAATSTPAPKETATTAPVQTASPAQTTATPAAGGKTA 234


>gnl|CDD|238064 cd00116, LRR_RI, Leucine-rich repeats (LRRs), ribonuclease
           inhibitor (RI)-like subfamily. LRRs are 20-29 residue
           sequence motifs present in many proteins that
           participate in protein-protein interactions and have
           different functions and cellular locations. LRRs
           correspond to structural units consisting of a beta
           strand (LxxLxLxxN/CxL conserved pattern) and an alpha
           helix. This alignment contains 12 strands corresponding
           to 11 full repeats, consistent with the extent observed
           in the subfamily acting as Ran GTPase Activating
           Proteins (RanGAP1).
          Length = 319

 Score = 36.6 bits (85), Expect = 0.020
 Identities = 36/168 (21%), Positives = 55/168 (32%), Gaps = 17/168 (10%)

Query: 120 KLAYLYLSHNQLT----EFLLDDIRGLKRLRTVDLSYNKINKFGTRNEGK----NQVQGV 171
            L  L L  N+L     E L   +R  + L+ ++L+ N I   G R   +    N    V
Sbjct: 138 ALEKLVLGRNRLEGASCEALAKALRANRDLKELNLANNGIGDAGIRALAEGLKANCNLEV 197

Query: 172 TNIFELKLQHNEIENLDGALMGIHGLSRLDLSHNKLR-----TISPDDFIGLDSLKMLDI 226
            ++    L       L   L  +  L  L+L  N L       ++        SL  L +
Sbjct: 198 LDLNNNGLTDEGASALAETLASLKSLEVLNLGDNNLTDAGAAALASALLSPNISLLTLSL 257

Query: 227 SHNLLTTLE-ETSKTFLPA---LEELFVSHNSLTRLDKDFHGLPVLCK 270
           S N +T    +     L     L EL +  N             +L  
Sbjct: 258 SCNDITDDGAKDLAEVLAEKESLLELDLRGNKFGEEGAQLLAESLLEP 305



 Score = 33.5 bits (77), Expect = 0.17
 Identities = 44/212 (20%), Positives = 69/212 (32%), Gaps = 53/212 (25%)

Query: 26  SKLQLLVIEQNQLEALPSDI------QLFSQLGSLYANNNRITS-----LDGLLRGLTKL 74
           S LQ L +  N L      +       L   L  L    NR+       L   LR    L
Sbjct: 108 SSLQELKLNNNGLGDRGLRLLAKGLKDLPPALEKLVLGRNRLEGASCEALAKALRANRDL 167

Query: 75  QVFNMDFNQITM--VRR--DEFQNLHNLDSISLQNNQITSMNSS-----LSGLTKLAYLY 125
           +  N+  N I    +R   +  +   NL+ + L NN +T   +S     L+ L  L  L 
Sbjct: 168 KELNLANNGIGDAGIRALAEGLKANCNLEVLDLNNNGLTDEGASALAETLASLKSLEVLN 227

Query: 126 LSHNQLTEF--------LLDDIRGLKRL-------------------------RTVDLSY 152
           L  N LT+         LL     L  L                           +DL  
Sbjct: 228 LGDNNLTDAGAAALASALLSPNISLLTLSLSCNDITDDGAKDLAEVLAEKESLLELDLRG 287

Query: 153 NKINKFGTRNEGKNQVQGVTNIFELKLQHNEI 184
           NK  + G +   ++ ++    +  L ++ +  
Sbjct: 288 NKFGEEGAQLLAESLLEPGNELESLWVKDDSF 319


>gnl|CDD|218161 pfam04589, RFX1_trans_act, RFX1 transcription activation region.
           The RFX family is a family of winged-helix DNA binding
           proteins. RFX1 is a regulatory factor essential for
           expression of MHC class II genes. This region is to
           found N terminal to the RFX DNA binding region
           (pfam02257) in some mammalian RFX proteins, and is
           thought to activate transcription when associated with
           DNA. Deletion analysis has identified the region 233-351
           in human RFX1 as being required for maximal activation.
          Length = 150

 Score = 35.0 bits (80), Expect = 0.025
 Identities = 18/70 (25%), Positives = 28/70 (40%), Gaps = 8/70 (11%)

Query: 340 DSNETSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPTTPIQEEYTETITLELPQPV 399
            S   S +  +    T  P  AP   + QRS    T+QT   P+Q+       L + +  
Sbjct: 3   TSEGGSDSPASVALQTSVPAQAPVPASQQRSVVQATSQTKGGPVQQ-------LPVHRVQ 55

Query: 400 ETNNQIP-VQ 408
           +   Q+  VQ
Sbjct: 56  QVPQQVQQVQ 65


>gnl|CDD|237555 PRK13914, PRK13914, invasion associated secreted endopeptidase;
           Provisional.
          Length = 481

 Score = 35.9 bits (82), Expect = 0.032
 Identities = 19/89 (21%), Positives = 32/89 (35%), Gaps = 5/89 (5%)

Query: 321 AMETINNNTKIHGETICQPDSNETSTTTTTTTTTTPEPTPAPT-----STTTQRSTTSTT 375
           A   +  NT  +  T  + ++     T     T   +P PAP+     + T   + T+T 
Sbjct: 267 AAPVVKENTNTNTATTEKKETTTQQQTAPKAPTEAAKPAPAPSTNTNANKTNTNTNTNTN 326

Query: 376 TQTPTTPIQEEYTETITLELPQPVETNNQ 404
               +TP +   T T +          NQ
Sbjct: 327 NTNTSTPSKNTNTNTNSNTNTNSNTNANQ 355


>gnl|CDD|178695 PLN03150, PLN03150, hypothetical protein; Provisional.
          Length = 623

 Score = 35.6 bits (82), Expect = 0.056
 Identities = 29/109 (26%), Positives = 47/109 (43%), Gaps = 28/109 (25%)

Query: 124 LYLSHNQLTEFLLDDIRGLKRLRTVDLSYNKINKFGTRNEGKNQVQGVTNIFELKLQHNE 183
           L L +  L  F+ +DI  L+ L++++LS N I        G                   
Sbjct: 423 LGLDNQGLRGFIPNDISKLRHLQSINLSGNSI-------RG------------------- 456

Query: 184 IENLDGALMGIHGLSRLDLSHNKLRTISPDDFIGLDSLKMLDISHNLLT 232
             N+  +L  I  L  LDLS+N      P+    L SL++L+++ N L+
Sbjct: 457 --NIPPSLGSITSLEVLDLSYNSFNGSIPESLGQLTSLRILNLNGNSLS 503



 Score = 33.3 bits (76), Expect = 0.26
 Identities = 21/75 (28%), Positives = 36/75 (48%), Gaps = 1/75 (1%)

Query: 194 IHGLSRLDLSHNKLRTISPDDFIGLDSLKMLDISHNLLTTLEETSKTFLPALEELFVSHN 253
           +  L  ++LS N +R   P     + SL++LD+S+N        S   L +L  L ++ N
Sbjct: 441 LRHLQSINLSGNSIRGNIPPSLGSITSLEVLDLSYNSFNGSIPESLGQLTSLRILNLNGN 500

Query: 254 SLT-RLDKDFHGLPV 267
           SL+ R+     G  +
Sbjct: 501 SLSGRVPAALGGRLL 515



 Score = 32.9 bits (75), Expect = 0.32
 Identities = 20/65 (30%), Positives = 34/65 (52%), Gaps = 1/65 (1%)

Query: 90  DEFQNLHNLDSISLQNNQIT-SMNSSLSGLTKLAYLYLSHNQLTEFLLDDIRGLKRLRTV 148
           ++   L +L SI+L  N I  ++  SL  +T L  L LS+N     + + +  L  LR +
Sbjct: 436 NDISKLRHLQSINLSGNSIRGNIPPSLGSITSLEVLDLSYNSFNGSIPESLGQLTSLRIL 495

Query: 149 DLSYN 153
           +L+ N
Sbjct: 496 NLNGN 500


>gnl|CDD|223033 PHA03291, PHA03291, envelope glycoprotein I; Provisional.
          Length = 401

 Score = 35.3 bits (81), Expect = 0.057
 Identities = 17/46 (36%), Positives = 21/46 (45%), Gaps = 2/46 (4%)

Query: 339 PDSNETSTT--TTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPTTP 382
             S ET+ T  TTT+  +T  P P+ T    Q  TT     TP  P
Sbjct: 215 TASPETTPTPSTTTSPPSTTIPAPSTTIAAPQAGTTPEAEGTPAPP 260



 Score = 32.2 bits (73), Expect = 0.54
 Identities = 15/50 (30%), Positives = 23/50 (46%), Gaps = 1/50 (2%)

Query: 334 ETICQPDSNET-STTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPTTP 382
            T   P++  T STTT+  +TT P P+    +     +  +  T  P TP
Sbjct: 213 RTTASPETTPTPSTTTSPPSTTIPAPSTTIAAPQAGTTPEAEGTPAPPTP 262



 Score = 30.7 bits (69), Expect = 1.7
 Identities = 14/53 (26%), Positives = 20/53 (37%)

Query: 338 QPDSNETSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPTTPIQEEYTET 390
            P +   +  TT +  TTP P+   +  +T     STT   P      E   T
Sbjct: 204 VPATPRPTPRTTASPETTPTPSTTTSPPSTTIPAPSTTIAAPQAGTTPEAEGT 256


>gnl|CDD|217197 pfam02711, Pap_E4, E4 protein.  This is is a family of
           Papillomavirus proteins, E4, coded for by ORF4. A splice
           variant, E1--E4, exists but neither the function of E4
           or E1--E4 is known.
          Length = 95

 Score = 32.6 bits (75), Expect = 0.060
 Identities = 14/41 (34%), Positives = 22/41 (53%)

Query: 340 DSNETSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPT 380
           DS+ +STT+ ++  T    T  P + TT  S+ + T QT  
Sbjct: 45  DSDSSSTTSPSSPPTPQPTTECPWTVTTTGSSVTLTAQTKD 85


>gnl|CDD|180536 PRK06347, PRK06347, autolysin; Reviewed.
          Length = 592

 Score = 35.4 bits (81), Expect = 0.060
 Identities = 17/73 (23%), Positives = 28/73 (38%), Gaps = 1/73 (1%)

Query: 341 SNETSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPTTPIQEEYTETITLELPQPVE 400
           + E   T T   TT P   P  T T  Q + T    Q     +++   E  T+  P    
Sbjct: 70  TKEAPATATPENTTEPTVEPKQTETKEQ-TKTPEEKQPAAKQVEKAPAEPATVSNPDNAT 128

Query: 401 TNNQIPVQDNLEK 413
           +++     + L+K
Sbjct: 129 SSSTPATYNLLQK 141


>gnl|CDD|132198 TIGR03154, sulfolob_CbsA, cytochrome b558/566, subunit A.  Members
           of this protein family are CbsA, one subunit of a highly
           glycosylated, heterodimeric, mono-heme cytochrome
           b558/566, found in Sulfolobus acidocaldarius and several
           other members of the Sulfolobales, a branch of the
           Crenarchaeota.
          Length = 465

 Score = 35.0 bits (80), Expect = 0.062
 Identities = 19/49 (38%), Positives = 27/49 (55%), Gaps = 2/49 (4%)

Query: 333 GETICQPDSNETSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPTT 381
           GET+   D + TS+  T    TTP   P  T+T+T  STT+++    TT
Sbjct: 395 GETLF--DKSITSSFLTLELVTTPPTPPTSTTTSTSPSTTTSSAIPSTT 441



 Score = 28.8 bits (64), Expect = 5.4
 Identities = 11/33 (33%), Positives = 18/33 (54%)

Query: 344 TSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTT 376
            +T  T  T+TT   +P+ T+++   STT   T
Sbjct: 413 VTTPPTPPTSTTTSTSPSTTTSSAIPSTTLYVT 445


>gnl|CDD|216421 pfam01299, Lamp, Lysosome-associated membrane glycoprotein (Lamp). 
          Length = 305

 Score = 34.7 bits (80), Expect = 0.067
 Identities = 15/53 (28%), Positives = 22/53 (41%), Gaps = 4/53 (7%)

Query: 326 NNNTKIHGETICQPDSNETSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQT 378
           +N T    ET C       + + TT  T +P PTP P+S      + +    T
Sbjct: 70  SNGTFSKTETRC----EADTPSPTTVATPSPSPTPVPSSPAVGNYSVTNGNGT 118



 Score = 28.9 bits (65), Expect = 4.4
 Identities = 10/40 (25%), Positives = 14/40 (35%)

Query: 344 TSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPTTPI 383
           T +   + TT  P  T     T T  + T   T T    +
Sbjct: 7   TFSYNLSDTTLFPNATSKGVKTVTSSTDTKAPTNTTYRCV 46


>gnl|CDD|227281 COG4945, COG4945, Membrane-anchored protein predicted to be
           involved in regulation of amylopullulanase [Carbohydrate
           transport and metabolism].
          Length = 570

 Score = 34.9 bits (80), Expect = 0.081
 Identities = 14/63 (22%), Positives = 23/63 (36%), Gaps = 1/63 (1%)

Query: 343 ETSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPTTPIQEEYTETITLELPQPVETN 402
             + T   + T T   T   T+T T+  TT+ T  T  T       + I    P+   + 
Sbjct: 489 YIAYTQGGSITLTWTQTYTLTNTVTEMRTTTGTMLTSETF-SSPILQPIPYLYPEADPST 547

Query: 403 NQI 405
             +
Sbjct: 548 LSV 550



 Score = 28.4 bits (63), Expect = 8.5
 Identities = 14/58 (24%), Positives = 19/58 (32%), Gaps = 4/58 (6%)

Query: 341 SNETSTTTTTTTTTTPEPTPAPTSTTTQRSTTS---TTTQTPTTPIQEEYTETITLEL 395
              + T T  T T T   T     TTT    TS   ++      P      +  TL +
Sbjct: 494 QGGSITLTW-TQTYTLTNTVTEMRTTTGTMLTSETFSSPILQPIPYLYPEADPSTLSV 550


>gnl|CDD|236776 PRK10856, PRK10856, cytoskeletal protein RodZ; Provisional.
          Length = 331

 Score = 34.6 bits (80), Expect = 0.087
 Identities = 15/49 (30%), Positives = 19/49 (38%), Gaps = 1/49 (2%)

Query: 338 QPDSNETSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTT-QTPTTPIQE 385
              S    T+TTT   TTP P     +T T   T +  T   P    Q+
Sbjct: 158 SGQSVPLDTSTTTDPATTPAPAAPVDTTPTNSQTPAVATAPAPAVDPQQ 206



 Score = 33.1 bits (76), Expect = 0.24
 Identities = 12/45 (26%), Positives = 16/45 (35%)

Query: 338 QPDSNETSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPTTP 382
           Q         T+TTT     P PA    TT  ++ +    T   P
Sbjct: 156 QNSGQSVPLDTSTTTDPATTPAPAAPVDTTPTNSQTPAVATAPAP 200



 Score = 28.8 bits (65), Expect = 5.6
 Identities = 8/45 (17%), Positives = 10/45 (22%)

Query: 340 DSNETSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPTTPIQ 384
               T+        TTP  +  P   T               P Q
Sbjct: 170 TDPATTPAPAAPVDTTPTNSQTPAVATAPAPAVDPQQNAVVAPSQ 214


>gnl|CDD|236782 PRK10871, nlpD, lipoprotein NlpD; Provisional.
          Length = 319

 Score = 34.0 bits (78), Expect = 0.12
 Identities = 15/43 (34%), Positives = 25/43 (58%), Gaps = 7/43 (16%)

Query: 339 PDSNETSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPTT 381
           P++   +TT T   T       APT++TT+ + +ST+T TP +
Sbjct: 163 PNNKPAATTVTAPVT-------APTASTTEPTASSTSTSTPIS 198


>gnl|CDD|183854 PRK13042, PRK13042, superantigen-like protein; Reviewed.
          Length = 291

 Score = 33.8 bits (77), Expect = 0.14
 Identities = 20/63 (31%), Positives = 30/63 (47%), Gaps = 8/63 (12%)

Query: 319 IDAMETINNNTKIHGETICQPDSNETSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQT 378
           ++A ++   +TK+       P S   +TT  +T    P+ TP     TT  ST   T Q+
Sbjct: 38  VEAPQSTPPSTKVEA-----PQSKPNATTPPSTKVEAPQQTPNA---TTPSSTKVETPQS 89

Query: 379 PTT 381
           PTT
Sbjct: 90  PTT 92



 Score = 28.8 bits (64), Expect = 4.7
 Identities = 22/75 (29%), Positives = 31/75 (41%), Gaps = 10/75 (13%)

Query: 339 PDSNETSTTTTTTTTTTPE-PTPAPTSTTTQRSTTSTTTQTP--TTPIQEEYTETITLEL 395
           P S +     +T  +T  E P   P +TT   +      QTP  TTP       +  +E 
Sbjct: 33  PSSTKVEAPQSTPPSTKVEAPQSKPNATTPPSTKVEAPQQTPNATTP------SSTKVET 86

Query: 396 PQPVETNNQIPVQDN 410
           PQ   T  Q+P + N
Sbjct: 87  PQS-PTTKQVPTEIN 100


>gnl|CDD|165099 PHA02732, PHA02732, hypothetical protein; Provisional.
          Length = 1467

 Score = 34.3 bits (78), Expect = 0.16
 Identities = 11/66 (16%), Positives = 28/66 (42%), Gaps = 3/66 (4%)

Query: 331  IHGETICQPDSNETSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPTTPIQEEYTET 390
            + G       + +   +  ++      PTP P+S+++  S+  + + +P   IQ     +
Sbjct: 1213 MSGTNTFTFKAAQLQLSAASSPPAATTPTPPPSSSSS--SSAQSISTSPGQ-IQIVLNGS 1269

Query: 391  ITLELP 396
             T+ + 
Sbjct: 1270 TTIHIN 1275


>gnl|CDD|114205 pfam05467, Herpes_U47, Herpesvirus glycoprotein U47. 
          Length = 627

 Score = 34.1 bits (77), Expect = 0.16
 Identities = 25/79 (31%), Positives = 35/79 (44%), Gaps = 11/79 (13%)

Query: 339 PDSNETSTTTTTTT-----TTTPEPTPAPTSTTTQRSTTSTTTQTPTTP-IQEEYTETIT 392
           P S  +ST+ + T+     T  P P P P +       T T   TP TP I  + TE+I 
Sbjct: 241 PSSTPSSTSASITSPHIPSTNIPTPEPPPVTKNFTELHTDTIKVTPNTPTITAQTTESIK 300

Query: 393 -----LELPQPVETNNQIP 406
                 + P+P+ T   IP
Sbjct: 301 KIVKRSDFPRPMYTPTDIP 319


>gnl|CDD|227655 COG5352, COG5352, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 169

 Score = 32.6 bits (74), Expect = 0.22
 Identities = 15/68 (22%), Positives = 26/68 (38%), Gaps = 8/68 (11%)

Query: 352 TTTTPEPTPAPTSTTTQ------RSTTSTTTQTPTTPIQEEY-TETITLELPQPVETNNQ 404
             T        T++  +      +S   T T+T  T +Q E  T  +     +P E N  
Sbjct: 50  KPTAAPARQKKTTSAPRAPVRAAQSVRHTPTRTGATALQAEVATPVVAPRYIRPAE-NVV 108

Query: 405 IPVQDNLE 412
           +P+  +L 
Sbjct: 109 LPISRHLT 116


>gnl|CDD|183558 PRK12495, PRK12495, hypothetical protein; Provisional.
          Length = 226

 Score = 32.5 bits (74), Expect = 0.28
 Identities = 12/73 (16%), Positives = 21/73 (28%), Gaps = 1/73 (1%)

Query: 338 QPDSNETSTTTTTTTTTTPEPTPAPTSTTTQRSTTS-TTTQTPTTPIQEEYTETITLELP 396
           Q    E S+T+ T    T  P  A          T+   T       ++    +     P
Sbjct: 111 QSAPPEASSTSATDEAATDPPATAAARDGPTPDPTAQPATPDERRSPRQRPPVSGEPPTP 170

Query: 397 QPVETNNQIPVQD 409
              + +    +Q 
Sbjct: 171 STPDAHVAGTLQA 183



 Score = 27.9 bits (62), Expect = 8.7
 Identities = 11/52 (21%), Positives = 16/52 (30%), Gaps = 1/52 (1%)

Query: 340 DSNETSTTTTTT-TTTTPEPTPAPTSTTTQRSTTSTTTQTPTTPIQEEYTET 390
            S+  S  +         E   A  S   + S+TS T +  T P        
Sbjct: 87  PSDAGSQASPDDDAQPAAEAEAADQSAPPEASSTSATDEAATDPPATAAARD 138


>gnl|CDD|237191 PRK12757, PRK12757, cell division protein FtsN; Provisional.
          Length = 256

 Score = 32.7 bits (75), Expect = 0.28
 Identities = 16/78 (20%), Positives = 22/78 (28%), Gaps = 11/78 (14%)

Query: 340 DSNETSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPTTPIQEEYTETITLELPQPV 399
            + +   +T        +  P  T+   Q  T    T  P  P             P PV
Sbjct: 110 QTPQVPRSTVQIQQQAQQQQPPATTAQPQPVTPPRQTTAPVQPQT-----------PAPV 158

Query: 400 ETNNQIPVQDNLEKEKQP 417
            T    PV   +E  K  
Sbjct: 159 RTQPAAPVTQAVEAPKVE 176



 Score = 30.0 bits (68), Expect = 1.8
 Identities = 10/50 (20%), Positives = 15/50 (30%)

Query: 335 TICQPDSNETSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPTTPIQ 384
           T+      +      TT    P   P  T+   Q  T +     P  P+ 
Sbjct: 118 TVQIQQQAQQQQPPATTAQPQPVTPPRQTTAPVQPQTPAPVRTQPAAPVT 167



 Score = 30.0 bits (68), Expect = 2.2
 Identities = 16/77 (20%), Positives = 28/77 (36%), Gaps = 1/77 (1%)

Query: 339 PDSNETSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPTTPIQEEYTETITLELPQP 398
           P + +T     +T     +        TT +    T  +  T P+Q +    +  +   P
Sbjct: 106 PYNEQTPQVPRSTVQIQQQAQQQQPPATTAQPQPVTPPRQTTAPVQPQTPAPVRTQPAAP 165

Query: 399 VETNNQIPVQDNLEKEK 415
           V    + P  +  EKEK
Sbjct: 166 VTQAVEAPKVEA-EKEK 181


>gnl|CDD|223021 PHA03247, PHA03247, large tegument protein UL36; Provisional.
          Length = 3151

 Score = 33.4 bits (76), Expect = 0.32
 Identities = 17/91 (18%), Positives = 26/91 (28%), Gaps = 13/91 (14%)

Query: 339  PDSNETSTTTTTTTTTTPEPTPAPTSTTT-----------QRSTTSTTTQTPTTPIQEEY 387
            P       T+   T   P P P P S              +R  + +    P  P +   
Sbjct: 2824 PAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPV 2883

Query: 388  TETITLELPQPVETNNQIPVQDNLEKEKQPD 418
                   + +  E+    P  D  E+  QP 
Sbjct: 2884 RRLARPAVSRSTESFALPP--DQPERPPQPQ 2912


>gnl|CDD|218439 pfam05109, Herpes_BLLF1, Herpes virus major outer envelope
           glycoprotein (BLLF1).  This family consists of the BLLF1
           viral late glycoprotein, also termed gp350/220. It is
           the most abundantly expressed glycoprotein in the viral
           envelope of the Herpesviruses and is the major antigen
           responsible for stimulating the production of
           neutralising antibodies in vivo.
          Length = 830

 Score = 33.2 bits (75), Expect = 0.33
 Identities = 16/45 (35%), Positives = 23/45 (51%)

Query: 339 PDSNETSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPTTPI 383
           P S  TS T   T+ T    TP  TS TTQ+++ +    +PT  +
Sbjct: 488 PTSRTTSATPNATSPTPAVTTPNATSPTTQKTSDTPNATSPTPIV 532



 Score = 32.1 bits (72), Expect = 0.58
 Identities = 16/85 (18%), Positives = 27/85 (31%), Gaps = 4/85 (4%)

Query: 321 AMETINNNTKIHGETICQPDSNETSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPT 380
              +   N       +  P++   +T  T+ T     PTP     TT  ++  T T +  
Sbjct: 491 RTTSATPNATSPTPAVTTPNATSPTTQKTSDTPNATSPTPIVIGVTTTATSPPTGTTSVP 550

Query: 381 TPIQEEYTETITLELPQPVETNNQI 405
                + TE    E P        +
Sbjct: 551 NATSPQVTE----ESPVNNTNTPVV 571



 Score = 30.9 bits (69), Expect = 1.5
 Identities = 18/70 (25%), Positives = 26/70 (37%), Gaps = 2/70 (2%)

Query: 333 GETICQPDSNETSTTTTTTTTTTPEPT-PAPTSTTTQRSTTSTTTQTPTTPIQEEYTETI 391
           G       S     T+ T+ TT+  P   +PT   T  + TS TTQ  T+      + T 
Sbjct: 472 GTPTGTTSSTLPEDTSPTSRTTSATPNATSPTPAVTTPNATSPTTQK-TSDTPNATSPTP 530

Query: 392 TLELPQPVET 401
            +       T
Sbjct: 531 IVIGVTTTAT 540



 Score = 30.9 bits (69), Expect = 1.6
 Identities = 24/83 (28%), Positives = 32/83 (38%), Gaps = 10/83 (12%)

Query: 327 NNTKIHGETICQPDSNETSTTTTTTTTTTPEPT------PAPTSTTTQRSTTSTTTQTPT 380
           NNT      +    S  TS  TT    T   PT      P+ + +T + ++TSTT     
Sbjct: 564 NNTNTP--VVTSAPSVLTSAVTTGQHGTGSSPTSQQPGIPSSSHSTPRSNSTSTT--PLL 619

Query: 381 TPIQEEYTETITLELPQPVETNN 403
           T       E IT E P    T +
Sbjct: 620 TSAHPTGGENITEETPSVPSTTH 642



 Score = 30.5 bits (68), Expect = 2.2
 Identities = 17/77 (22%), Positives = 23/77 (29%), Gaps = 1/77 (1%)

Query: 326 NNNTKIHGETICQPDSNETSTTTTTTTTTTPEPT-PAPTSTTTQRSTTSTTTQTPTTPIQ 384
           N         I +  +N T+TT        P+ T       T        TT  PTTP  
Sbjct: 396 NPVADAKTLIITRTATNATTTTHKVVFHKAPDTTKSVIFVYTLVHVEPHKTTAVPTTPSL 455

Query: 385 EEYTETITLELPQPVET 401
              +   T+    P   
Sbjct: 456 PPASTGPTVSTADPTSG 472


>gnl|CDD|212004 TIGR04281, peripla_PGF_1, putative ABC transporter
           PGF-CTERM-modified substrate-binding protein.  Members
           of this archaeal protein family resemble periplasmic
           substrate-binding proteins of ABC transporters and
           appear in gene neighborhoods with permease and
           ATP-binding cassette proteins. Notably, essentially all
           members also have the PGF-CTERM putative protein-sorting
           domain at the C-terminus, while more distant homologs
           (excluded by the trusted cutoff) instead have what
           appear to be lipoprotein signal peptides at the
           N-terminus.
          Length = 330

 Score = 32.3 bits (74), Expect = 0.39
 Identities = 17/59 (28%), Positives = 25/59 (42%), Gaps = 9/59 (15%)

Query: 318 VIDAMETINNNTKIHGETICQPDSNET-STTTTTTTTTTPEPTPAPTSTTTQRSTTSTT 375
           V++A+ET+        E    P++ E   T  T   TT  + T      TT  ST + T
Sbjct: 253 VVEAVETL-------AEAF-HPEAYEEAETADTEADTTYDDSTTDAEDGTTTTSTGTAT 303


>gnl|CDD|165021 PHA02638, PHA02638, CC chemokine receptor-like protein;
           Provisional.
          Length = 417

 Score = 32.3 bits (73), Expect = 0.47
 Identities = 22/83 (26%), Positives = 39/83 (46%), Gaps = 5/83 (6%)

Query: 314 SMRAVIDAMETINNNTKIHGETICQPDSNETSTTTTTTTTTTP-EPTPAPTSTTTQRSTT 372
           ++ ++I +  T++  T    ET      +  ST T    T  P   +P+P S ++  S++
Sbjct: 7   TLSSIILSSSTLSPTTFFTIETSMDESKSIISTFTEIIPTEIPTSESPSPNSNSSSSSSS 66

Query: 373 STTTQTPTTPIQEEYTETITLEL 395
           S+++ T       EY   IT EL
Sbjct: 67  SSSSIT----YDYEYENNITYEL 85



 Score = 28.1 bits (62), Expect = 10.0
 Identities = 15/58 (25%), Positives = 23/58 (39%), Gaps = 4/58 (6%)

Query: 342 NETSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTT-TQTPTTPIQEEYTETITLELPQP 398
           + +STT ++   ++   +P    T       S +   T T  I    TE  T E P P
Sbjct: 2   DNSSTTLSSIILSSSTLSPTTFFTIETSMDESKSIISTFTEIIP---TEIPTSESPSP 56


>gnl|CDD|221093 pfam11359, gpUL132, Glycoprotein UL132.  Glycoprotein UL132 is a
           low-abundance structural component of Human
           cytomegalovirus (HCMV). The function of this protein is
           not fully understood.
          Length = 235

 Score = 31.9 bits (72), Expect = 0.48
 Identities = 17/38 (44%), Positives = 19/38 (50%)

Query: 344 TSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPTT 381
           TS+TTT   TT        TS TT  ST  TT  T +T
Sbjct: 2   TSSTTTPANTTATVTVTVATSNTTSVSTNVTTALTAST 39



 Score = 28.1 bits (62), Expect = 8.6
 Identities = 12/45 (26%), Positives = 19/45 (42%)

Query: 341 SNETSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPTTPIQE 385
           +   + TT T T T         ST    + T++TT  P + + E
Sbjct: 5   TTTPANTTATVTVTVATSNTTSVSTNVTTALTASTTAEPGSVLTE 49


>gnl|CDD|184900 PRK14907, rplD, 50S ribosomal protein L4; Provisional.
          Length = 295

 Score = 31.8 bits (72), Expect = 0.52
 Identities = 12/41 (29%), Positives = 16/41 (39%), Gaps = 2/41 (4%)

Query: 343 ETSTTTTTTTTT--TPEPTPAPTSTTTQRSTTSTTTQTPTT 381
           ET  TT   TT    P    A TS  T ++  +  T +   
Sbjct: 3   ETKKTTKKKTTEEKKPAAKKATTSKETAKTKKTAKTTSTKA 43


>gnl|CDD|237862 PRK14948, PRK14948, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 620

 Score = 32.2 bits (74), Expect = 0.54
 Identities = 12/75 (16%), Positives = 24/75 (32%)

Query: 338 QPDSNETSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPTTPIQEEYTETITLELPQ 397
             +++  +  T     + P     P++  T+++ T+ +                T   P 
Sbjct: 368 IANASAPANPTPAPNPSPPPAPIQPSAPKTKQAATTPSPPPAKASPPIPVPAEPTEPSPT 427

Query: 398 PVETNNQIPVQDNLE 412
           P       P   NLE
Sbjct: 428 PPANAANAPPSLNLE 442


>gnl|CDD|165513 PHA03255, PHA03255, BDLF3; Provisional.
          Length = 234

 Score = 31.8 bits (71), Expect = 0.56
 Identities = 24/77 (31%), Positives = 30/77 (38%), Gaps = 7/77 (9%)

Query: 324 TINNNTKIHGETICQPDSNETSTTTTTTTTTTPEPTPAPTSTT--TQRSTTSTTTQTPTT 381
           TIN  TK+  + I       T   T T+T  T   T   +STT  T R T +TT     +
Sbjct: 99  TINVTTKVTAQNITA-----TEAGTGTSTGVTSNVTTRSSSTTSATTRITNATTLAPTLS 153

Query: 382 PIQEEYTETITLELPQP 398
                     T ELP  
Sbjct: 154 SKGTSNATKTTAELPTV 170



 Score = 30.3 bits (67), Expect = 1.4
 Identities = 21/54 (38%), Positives = 28/54 (51%), Gaps = 2/54 (3%)

Query: 341 SNETSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTP--TTPIQEEYTETIT 392
           S+  S    T TT    P+P+ +  +T +STT TTT  P  TT I    T T+T
Sbjct: 29  SSTASAGNVTGTTAVTTPSPSASGPSTNQSTTLTTTSAPITTTAILSTNTTTVT 82



 Score = 29.9 bits (66), Expect = 2.0
 Identities = 19/49 (38%), Positives = 25/49 (51%)

Query: 344 TSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPTTPIQEEYTETIT 392
           T+ +TT TTT+ P  T A  ST T   T++ TT TP        T  +T
Sbjct: 55  TNQSTTLTTTSAPITTTAILSTNTTTVTSTGTTVTPVPTTSNASTINVT 103



 Score = 29.1 bits (64), Expect = 3.8
 Identities = 16/37 (43%), Positives = 21/37 (56%)

Query: 344 TSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPT 380
           T+TTT T+T TT  P P  ++ +T   TT  T Q  T
Sbjct: 76  TNTTTVTSTGTTVTPVPTTSNASTINVTTKVTAQNIT 112


>gnl|CDD|215299 PLN02543, PLN02543, pfkB-type carbohydrate kinase family protein.
          Length = 496

 Score = 32.2 bits (73), Expect = 0.59
 Identities = 11/61 (18%), Positives = 16/61 (26%), Gaps = 9/61 (14%)

Query: 328 NTKIHGETICQPDSNETSTTTTTTTTTTPEP---------TPAPTSTTTQRSTTSTTTQT 378
             K       +       +T       +P+P         T +P    T R  T  T Q 
Sbjct: 36  KPKASLHPSIKRSRPGRCSTNGAAVPESPKPSRRGRKKKPTSSPPKAKTTRRRTKKTDQE 95

Query: 379 P 379
            
Sbjct: 96  L 96


>gnl|CDD|218673 pfam05642, Sporozoite_P67, Sporozoite P67 surface antigen.  This
           family consists of several Theileria P67 surface
           antigens. A stage specific surface antigen of Theileria
           parva, p67, is the basis for the development of an
           anti-sporozoite vaccine for the control of East Coast
           fever (ECF) in cattle. The antigen has been shown to
           contain five distinct linear peptide sequences
           recognised by sporozoite-neutralising murine monoclonal
           antibodies.
          Length = 727

 Score = 32.0 bits (72), Expect = 0.65
 Identities = 13/51 (25%), Positives = 26/51 (50%)

Query: 333 GETICQPDSNETSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPTTPI 383
           G++   P +   S +TTTTT  +  P+      ++ R+  + +T + + PI
Sbjct: 271 GDSTSSPSTTRPSGSTTTTTPASSGPSAPGGPGSSSRNAVTRSTDSISGPI 321



 Score = 28.1 bits (62), Expect = 9.6
 Identities = 13/44 (29%), Positives = 19/44 (43%)

Query: 339 PDSNETSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPTTP 382
                 STTTTT  ++ P     P S++    T ST + +   P
Sbjct: 279 TTRPSGSTTTTTPASSGPSAPGGPGSSSRNAVTRSTDSISGPIP 322


>gnl|CDD|139494 PRK13335, PRK13335, superantigen-like protein; Reviewed.
          Length = 356

 Score = 31.6 bits (71), Expect = 0.66
 Identities = 17/59 (28%), Positives = 28/59 (47%), Gaps = 1/59 (1%)

Query: 335 TICQPDSNETSTTTTTTTTTTPEPT-PAPTSTTTQRSTTSTTTQTPTTPIQEEYTETIT 392
            I    + +   + TTT +TTP+     P ST T +   ST + TP +P  ++    +T
Sbjct: 107 NISATPAPKQEQSQTTTESTTPKTKVTTPPSTNTPQPMQSTKSDTPQSPTIKQAQTDMT 165



 Score = 31.6 bits (71), Expect = 0.70
 Identities = 19/88 (21%), Positives = 35/88 (39%), Gaps = 12/88 (13%)

Query: 339 PDSNETSTTTTTTTTTTPEPT--------PAPTSTTTQRSTTSTTTQTPTTPIQEEYTET 390
           P++NE  T+ +     +             A  +   ++S T+T + TP T +    T  
Sbjct: 80  PNTNEEKTSASKIEKISQPKQEEQKSLNISATPAPKQEQSQTTTESTTPKTKV----TTP 135

Query: 391 ITLELPQPVETNNQIPVQDNLEKEKQPD 418
            +   PQP+++      Q    K+ Q D
Sbjct: 136 PSTNTPQPMQSTKSDTPQSPTIKQAQTD 163


>gnl|CDD|151471 pfam11024, DGF-1_4, Dispersed gene family protein 1 of Trypanosoma
           cruzi region 4.  This protein is likely to be highly
           expressed, and is expressed from the sub-telomeric
           region. However, the function is not known. Other
           domains on this protein include DGF-1_N, DGF-1_2, and
           DGF-1_5. This domain is just downstream from the
           C-terminus, but not the C-terminus of proteins, also
           annotated as being DGF-1, that constitute family
           DGF-1_C.
          Length = 75

 Score = 29.5 bits (66), Expect = 0.67
 Identities = 19/49 (38%), Positives = 24/49 (48%), Gaps = 5/49 (10%)

Query: 345 STTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPTTPIQEEYTETITL 393
           S T T T T +  PT  P+ +TT  S T      PT  +Q   TET+ L
Sbjct: 14  SHTATLTETRSLTPTWTPSMSTTHYSPTQ---YGPTETLQ--VTETVAL 57


>gnl|CDD|215633 PLN03210, PLN03210, Resistant to P. syringae 6; Provisional.
          Length = 1153

 Score = 31.8 bits (72), Expect = 0.85
 Identities = 66/261 (25%), Positives = 107/261 (40%), Gaps = 52/261 (19%)

Query: 3   TDLNWLFLNN-NRLKSLEGQLGTLSKLQLLVIEQ-NQLEALPSDIQLFSQLGSLYANNNR 60
           T+L  L L++ + L  L   +  L+KL+ L + +   LE LP+ I L     SLY  N  
Sbjct: 657 TNLETLKLSDCSSLVELPSSIQYLNKLEDLDMSRCENLEILPTGINL----KSLYRLN-- 710

Query: 61  ITSLDGLLRGLTKLQVF-----NMDFNQITMVRRDEF-QNLH--NLDSISLQNNQITSMN 112
                  L G ++L+ F     N+ +  +     +EF  NL   NLD + L   +   + 
Sbjct: 711 -------LSGCSRLKSFPDISTNISWLDLDETAIEEFPSNLRLENLDELILCEMKSEKLW 763

Query: 113 SSLSGLTKLAYLYLSHNQLTEFLLDDIRGLKRLRTVDLSYNKINKFGTRN---------- 162
             +  LT L  + LS + LT   L DI  L  L +   + +K+      N          
Sbjct: 764 ERVQPLTPLMTM-LSPS-LTRLFLSDIPSLVELPSSIQNLHKLEHLEIENCINLETLPTG 821

Query: 163 -----------EGKNQVQGV----TNIFELKLQHNEIENLDGALMGIHGLSRLDLSH-NK 206
                       G ++++      TNI +L L    IE +   +     LS LD++  N 
Sbjct: 822 INLESLESLDLSGCSRLRTFPDISTNISDLNLSRTGIEEVPWWIEKFSNLSFLDMNGCNN 881

Query: 207 LRTISPDDFIGLDSLKMLDIS 227
           L+ +S  +   L  L+ +D S
Sbjct: 882 LQRVSL-NISKLKHLETVDFS 901


>gnl|CDD|233045 TIGR00601, rad23, UV excision repair protein Rad23.  All proteins
           in this family for which functions are known are
           components of a multiprotein complex used for targeting
           nucleotide excision repair to specific parts of the
           genome. In humans, Rad23 complexes with the XPC protein.
           This family is based on the phylogenomic analysis of JA
           Eisen (1999, Ph.D. Thesis, Stanford University) [DNA
           metabolism, DNA replication, recombination, and repair].
          Length = 378

 Score = 31.4 bits (71), Expect = 0.85
 Identities = 13/56 (23%), Positives = 22/56 (39%), Gaps = 9/56 (16%)

Query: 344 TSTTTTTTTTTTPEPTP--APTSTTTQRSTTST-------TTQTPTTPIQEEYTET 390
           +   T T     P  TP  APT T +  ++ ++       +     +P +E  T T
Sbjct: 75  SKPKTGTGKVAPPAATPTSAPTPTPSPPASPASGMSAAPASAVEEKSPSEESATAT 130



 Score = 30.2 bits (68), Expect = 1.9
 Identities = 10/65 (15%), Positives = 19/65 (29%), Gaps = 3/65 (4%)

Query: 329 TKIHGETICQPDSNETSTTTTTTTT---TTPEPTPAPTSTTTQRSTTSTTTQTPTTPIQE 385
            K     +  P +  TS  T T +         + AP S   ++S +  +          
Sbjct: 77  PKTGTGKVAPPAATPTSAPTPTPSPPASPASGMSAAPASAVEEKSPSEESATATAPESPS 136

Query: 386 EYTET 390
               +
Sbjct: 137 TSVPS 141


>gnl|CDD|237864 PRK14950, PRK14950, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 585

 Score = 31.3 bits (71), Expect = 1.0
 Identities = 14/87 (16%), Positives = 19/87 (21%), Gaps = 4/87 (4%)

Query: 317 AVIDAMETINNNTKIHGETICQPDSNE----TSTTTTTTTTTTPEPTPAPTSTTTQRSTT 372
           AVI+A+       +    T   P         ST           P      T T     
Sbjct: 354 AVIEALLVPVPAPQPAKPTAAAPSPVRPTPAPSTRPKAAAAANIPPKEPVRETATPPPVP 413

Query: 373 STTTQTPTTPIQEEYTETITLELPQPV 399
                 P     E   +     +P   
Sbjct: 414 PRPVAPPVPHTPESAPKLTRAAIPVDE 440


>gnl|CDD|177958 PLN02324, PLN02324, triacylglycerol lipase.
          Length = 415

 Score = 30.8 bits (69), Expect = 1.4
 Identities = 15/62 (24%), Positives = 30/62 (48%), Gaps = 8/62 (12%)

Query: 265 LPVLCKADLAHNNIKAINIQLALKTQCQIFGLNSTLRIYLEGNPVLCDDSMRAVIDAMET 324
           + VL  ADL +     INI L  K           + ++  G+P + D + + ++D+++ 
Sbjct: 228 MSVLSAADLVYGKKNKINISLQKKQVP--------ITVFAFGSPRIGDHNFKNLVDSLQP 279

Query: 325 IN 326
           +N
Sbjct: 280 LN 281


>gnl|CDD|184285 PRK13733, PRK13733, conjugal transfer protein TraV; Provisional.
          Length = 171

 Score = 30.1 bits (68), Expect = 1.4
 Identities = 8/44 (18%), Positives = 14/44 (31%), Gaps = 1/44 (2%)

Query: 341 SNETSTTTTTTTT-TTPEPTPAPTSTTTQRSTTSTTTQTPTTPI 383
             +T T TT + +      TP       +   T+        P+
Sbjct: 74  PVQTVTATTPSGSRPAVTATPEQKLLAPRPLFTAAREVKTVVPV 117


>gnl|CDD|224664 COG1750, COG1750, Archaeal serine proteases [General function
           prediction only].
          Length = 579

 Score = 30.9 bits (70), Expect = 1.4
 Identities = 27/106 (25%), Positives = 36/106 (33%), Gaps = 15/106 (14%)

Query: 280 AINIQLALKTQCQIFGLNSTLRIYLEGNPVLCDDSMRAV--IDAMETINNNTKIHGETIC 337
            I I LA  +  Q     +        N  L D    A    DA E       + GE + 
Sbjct: 468 FIKIGLAENSGDQPVLALAY--YEYAENGQLQDTINAAAYYKDASEL----ALLMGEVVS 521

Query: 338 QPDSNETSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPTTPI 383
              S ET ++   + TT         ST T  +T   T Q  +T I
Sbjct: 522 STGSKETISSPIYSATTQT-------STATVTTTIENTVQLGSTAI 560


>gnl|CDD|183756 PRK12799, motB, flagellar motor protein MotB; Reviewed.
          Length = 421

 Score = 30.5 bits (68), Expect = 1.6
 Identities = 18/101 (17%), Positives = 33/101 (32%), Gaps = 2/101 (1%)

Query: 319 IDAMETINNNTKIHGETICQPDSNETSTTTTTTTTTTPEPTPAPTSTTTQRSTTS-TTTQ 377
           I+    +      HG       +  ++ T ++  T +    P+P    +  +T S TTTQ
Sbjct: 284 IEKATGLKQ-IDTHGTVPVAAVTPSSAVTQSSAITPSSAAIPSPAVIPSSVTTQSATTTQ 342

Query: 378 TPTTPIQEEYTETITLELPQPVETNNQIPVQDNLEKEKQPD 418
                +         + LP  V      PV    +     +
Sbjct: 343 ASAVALSSAGVLPSDVTLPGTVALPAAEPVNMQPQPMSTTE 383


>gnl|CDD|234797 PRK00575, tatA, twin arginine translocase protein A; Provisional.
          Length = 92

 Score = 28.6 bits (64), Expect = 1.7
 Identities = 6/40 (15%), Positives = 13/40 (32%)

Query: 342 NETSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPTT 381
           ++     +      P     PT   +QR   +  +   +T
Sbjct: 48  SDNKAEASAAAAQAPYQVATPTPVQSQRVDPAAASGQDST 87


>gnl|CDD|237874 PRK14971, PRK14971, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 614

 Score = 30.5 bits (69), Expect = 1.7
 Identities = 14/88 (15%), Positives = 28/88 (31%)

Query: 330 KIHGETICQPDSNETSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPTTPIQEEYTE 389
               +   +P   + +     +      P+P+ +S   Q S   + TQ   TP       
Sbjct: 372 GRGPKQHIKPVFTQPAAAPQPSAAAAASPSPSQSSAAAQPSAPQSATQPAGTPPTVSVDP 431

Query: 390 TITLELPQPVETNNQIPVQDNLEKEKQP 417
              + +  P      +      E++K P
Sbjct: 432 PAAVPVNPPSTAPQAVRPAQFKEEKKIP 459


>gnl|CDD|197688 smart00370, LRR, Leucine-rich repeats, outliers. 
          Length = 24

 Score = 26.5 bits (60), Expect = 2.2
 Identities = 9/19 (47%), Positives = 13/19 (68%)

Query: 197 LSRLDLSHNKLRTISPDDF 215
           L  LDLS+N+L ++ P  F
Sbjct: 4   LRELDLSNNQLSSLPPGAF 22


>gnl|CDD|197687 smart00369, LRR_TYP, Leucine-rich repeats, typical (most populated)
           subfamily. 
          Length = 24

 Score = 26.5 bits (60), Expect = 2.2
 Identities = 9/19 (47%), Positives = 13/19 (68%)

Query: 197 LSRLDLSHNKLRTISPDDF 215
           L  LDLS+N+L ++ P  F
Sbjct: 4   LRELDLSNNQLSSLPPGAF 22


>gnl|CDD|216233 pfam00997, Casein_kappa, Kappa casein.  Kappa-casein is a mammalian
           milk protein involved in a number of important
           physiological processes. In the gut, the ingested
           protein is split into an insoluble peptide (para
           kappa-casein) and a soluble hydrophilic glycopeptide
           (caseinomacropeptide). Caseinomacropeptide is
           responsible for increased efficiency of digestion,
           prevention of neonate hypersensitivity to ingested
           proteins, and inhibition of gastric pathogens.
          Length = 161

 Score = 29.1 bits (65), Expect = 3.2
 Identities = 14/54 (25%), Positives = 21/54 (38%), Gaps = 7/54 (12%)

Query: 343 ETSTTTTTTTTTTPEPTPAPTS-------TTTQRSTTSTTTQTPTTPIQEEYTE 389
           + +   T  T  T EPTP PT+        T + S+    +   TT +     E
Sbjct: 107 DKTAIPTINTIATVEPTPVPTTEPTVNTVVTPEASSEFIISTPETTTVAVTSPE 160


>gnl|CDD|178748 PLN03209, PLN03209, translocon at the inner envelope of chloroplast
           subunit 62; Provisional.
          Length = 576

 Score = 29.5 bits (66), Expect = 3.4
 Identities = 10/53 (18%), Positives = 15/53 (28%), Gaps = 3/53 (5%)

Query: 358 PTPAPTSTTTQRSTTSTTTQ---TPTTPIQEEYTETITLELPQPVETNNQIPV 407
           P P P S++   S +         P           +    P  VE     P+
Sbjct: 386 PIPTPPSSSPASSKSVDAVAKPAEPDVVPSPGSASNVPEVEPAQVEAKKTRPL 438


>gnl|CDD|132431 TIGR03390, ascorbOXfungal, L-ascorbate oxidase, fungal type.  This
           model describes a family of fungal ascorbate oxidases,
           within a larger family of multicopper oxidases that also
           includes plant ascorbate oxidases (TIGR03388), plant
           laccases and laccase-like proteins (TIGR03389), and
           related proteins. The member from Acremonium sp. HI-25
           is characterized.
          Length = 538

 Score = 29.8 bits (67), Expect = 3.5
 Identities = 19/105 (18%), Positives = 32/105 (30%), Gaps = 10/105 (9%)

Query: 305 EGNPVLCDDSMRAVIDAMETINNNTKIHGETICQPDSNETSTTTTTTTTTTPEPTPAPTS 364
           +    LC    R      ET +      G  + +  S++ S   +      PE  P P  
Sbjct: 264 KTEDELCGGDKRQYFIQFETRDRPKVYRGYAVLRYRSDKASKLPSV-----PETPPLPLP 318

Query: 365 TTTQRSTTST-----TTQTPTTPIQEEYTETITLELPQPVETNNQ 404
            +T                   P  +E T  + ++  Q V+  N 
Sbjct: 319 NSTYDWLEYELEPLSEENNQDFPTLDEVTRRVVIDAHQNVDPLNG 363


>gnl|CDD|118131 pfam09595, Metaviral_G, Metaviral_G glycoprotein.  This is a viral
           attachment glycoprotein from region G of metaviruses. It
           is high in serine and threonine suggesting it is highly
           glycosylated.
          Length = 183

 Score = 28.8 bits (64), Expect = 3.5
 Identities = 15/54 (27%), Positives = 21/54 (38%)

Query: 345 STTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPTTPIQEEYTETITLELPQP 398
           S T TT      + T   + T T  +T++   QT TT       +  T    QP
Sbjct: 106 SRTQTTQAVHEKKNTRTTSRTQTPPTTSTAAVQTTTTTNTSSTGKEPTTTSVQP 159



 Score = 27.7 bits (61), Expect = 9.3
 Identities = 17/48 (35%), Positives = 26/48 (54%)

Query: 343 ETSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPTTPIQEEYTET 390
           + +T TT+ T T P  + A   TTT  +T+ST  +  TT +Q   + T
Sbjct: 117 KKNTRTTSRTQTPPTTSTAAVQTTTTTNTSSTGKEPTTTSVQPRSSAT 164


>gnl|CDD|177577 PHA03292, PHA03292, envelope glycoprotein I; Provisional.
          Length = 413

 Score = 29.5 bits (66), Expect = 3.6
 Identities = 15/60 (25%), Positives = 22/60 (36%)

Query: 339 PDSNETSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPTTPIQEEYTETITLELPQP 398
           P      TT+T + + + +P  A T+T T    T  TT  P           +     QP
Sbjct: 194 PRYLNAVTTSTYSRSMSSQPAGAATATPTPTLDTGLTTVAPPNETVVTGETALLCHWFQP 253


>gnl|CDD|215541 PLN03020, PLN03020, low-temperature-induced protein; Provisional.
          Length = 556

 Score = 29.3 bits (65), Expect = 4.8
 Identities = 12/38 (31%), Positives = 17/38 (44%), Gaps = 2/38 (5%)

Query: 355 TPEPTPAPTSTTTQRSTTSTTTQTPTTPIQEEYTETIT 392
           +PEP   P  T +     ST+        Q+ YTE I+
Sbjct: 285 SPEPLS-PAETKSTEDPQSTS-NAEKPSNQKTYTEKIS 320


>gnl|CDD|113196 pfam04415, DUF515, Protein of unknown function (DUF515).  Family of
           hypothetical Archaeal proteins.
          Length = 416

 Score = 29.1 bits (65), Expect = 4.8
 Identities = 11/54 (20%), Positives = 29/54 (53%)

Query: 341 SNETSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPTTPIQEEYTETITLE 394
           S+ +ST+++ +++T+  P  A    + +    S+T+Q+ +      Y+ ++ L 
Sbjct: 290 SSSSSTSSSESSSTSYSPGDASIQNSQRSQLQSSTSQSESESASSSYSYSVNLP 343


>gnl|CDD|219355 pfam07267, Nucleo_P87, Nucleopolyhedrovirus capsid protein P87.
           This family consists of several Nucleopolyhedrovirus
           capsid protein P87 sequences. P87 is expressed late in
           infection and concentrated in infected cell nuclei.
          Length = 606

 Score = 29.1 bits (65), Expect = 5.1
 Identities = 18/99 (18%), Positives = 34/99 (34%), Gaps = 2/99 (2%)

Query: 313 DSMRAVIDAMETINNNTKIHGETICQPDSNETSTTTTTTTTTTPEPTPAPTSTTTQRS-- 370
           DS+  V +  +T+        +T     S +T   T     +TP      +  T  ++  
Sbjct: 248 DSVDDVRNFAKTVWRQKNTPAQTPMPERSWQTPAQTPARRISTPMTEEIKSWQTPLQTPA 307

Query: 371 TTSTTTQTPTTPIQEEYTETITLELPQPVETNNQIPVQD 409
             S+  Q P       + E +    P  +   + +P  D
Sbjct: 308 MYSSDYQAPKPEPIYTWEELLRERFPSDLFAISSLPDSD 346


>gnl|CDD|219426 pfam07489, Tir_receptor_C, Translocated intimin receptor (Tir)
           C-terminus.  Intimin and its translocated intimin
           receptor (Tir) are bacterial proteins that mediate
           adhesion between mammalian cells and attaching and
           effacing (A/E) pathogens. A unique and essential feature
           of A/E bacterial pathogens is the formation of
           actin-rich pedestals beneath the intimately adherent
           bacteria and localised destruction of the intestinal
           brush border. The bacterial outer membrane adhesin,
           intimin, is necessary for the production of the A/E
           lesion and diarrhoea. The A/E bacteria translocate their
           own receptor for intimin, Tir, into the membrane of
           mammalian cells using the type III secretion system. The
           translocated Tir triggers additional host signalling
           events and actin nucleation, which are essential for
           lesion formation. This family represents the Tir
           C-terminal domain which has been reported to bind
           uninfected host cells and beta-1 integrins although the
           role of intimin binding to integrins is unclear. This
           intimin C-terminal domain has also been shown to be
           sufficient for Tir recognition.
          Length = 222

 Score = 28.7 bits (64), Expect = 5.3
 Identities = 16/38 (42%), Positives = 21/38 (55%), Gaps = 1/38 (2%)

Query: 338 QPDSNETSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTT 375
           QP   +T+TTTTTTTTT       P + T  +  T T+
Sbjct: 55  QP-VEQTTTTTTTTTTTHTTVENKPANNTPAQGNTDTS 91


>gnl|CDD|223714 COG0641, AslB, Arylsulfatase regulator (Fe-S oxidoreductase)
           [General function prediction only].
          Length = 378

 Score = 28.9 bits (65), Expect = 5.8
 Identities = 11/32 (34%), Positives = 19/32 (59%)

Query: 63  SLDGLLRGLTKLQVFNMDFNQITMVRRDEFQN 94
           + D +++GL  LQ   +DFN +T+V R    +
Sbjct: 140 TFDRVMKGLELLQAHGVDFNTLTVVNRQNVLH 171


>gnl|CDD|223259 COG0181, HemC, Porphobilinogen deaminase [Coenzyme metabolism].
          Length = 307

 Score = 28.7 bits (65), Expect = 6.1
 Identities = 8/32 (25%), Positives = 15/32 (46%)

Query: 185 ENLDGALMGIHGLSRLDLSHNKLRTISPDDFI 216
              D  ++   GL RL L +     + P++F+
Sbjct: 160 GEYDAIILAAAGLKRLGLENRITEILDPEEFL 191


>gnl|CDD|113398 pfam04625, DEC-1_N, DEC-1 protein, N-terminal region.  The
           defective chorion-1 gene (dec-1) in Drosophila encodes
           follicle cell proteins necessary for proper eggshell
           assembly. Multiple products of the dec-1 gene are formed
           by alternative RNA splicing and proteolytic processing.
           Cleavage products include S80 (80 kDa) which is
           incorporated into the eggshell, and further proteolysis
           of S80 gives S60 (60 kDa).
          Length = 407

 Score = 28.6 bits (63), Expect = 6.1
 Identities = 16/50 (32%), Positives = 19/50 (38%)

Query: 360 PAPTSTTTQRSTTSTTTQTPTTPIQEEYTETITLELPQPVETNNQIPVQD 409
           PAP  T  Q S  S     P  P  +E    I   L    E   Q+ V+D
Sbjct: 198 PAPAGTDAQASDISEVRVRPEDPYSQEAQMKIKSALEMEQERQQQVQVKD 247


>gnl|CDD|185693 cd08023, GH16_laminarinase_like, Laminarinase, member of the
           glycosyl hydrolase family 16.  Laminarinase, also known
           as glucan endo-1,3-beta-D-glucosidase, is a glycosyl
           hydrolase family 16 member that hydrolyzes
           1,3-beta-D-glucosidic linkages in 1,3-beta-D-glucans
           such as laminarins, curdlans, paramylons, and pachymans,
           with very limited action on mixed-link
           (1,3-1,4-)-beta-D-glucans.
          Length = 235

 Score = 28.4 bits (64), Expect = 6.6
 Identities = 9/39 (23%), Positives = 18/39 (46%), Gaps = 2/39 (5%)

Query: 319 IDAMETINNNT-KIHGETICQPDSNETSTTTTTTTTTTP 356
           ID ME + N    ++G T+    +N+ +  +  + T   
Sbjct: 116 IDIMEYVGNEPNTVYG-TLHGGATNDGNNGSGGSYTLPT 153


>gnl|CDD|221145 pfam11596, DUF3246, Protein of unknown function (DUF3246).  This is
           a small family of fungal proteins one of whose members
           from Pichia stipitis is described as being an extremely
           serine rich protein-mucin-like protein.
          Length = 208

 Score = 28.2 bits (62), Expect = 6.7
 Identities = 15/48 (31%), Positives = 20/48 (41%)

Query: 340 DSNETSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPTTPIQEEY 387
           D +ET+T TTT  T         T  T   +  +T T  PT    + Y
Sbjct: 161 DCDETTTYTTTYYTPYTTVIHGGTVYTNGVTVIATHTVYPTDVEDDGY 208


>gnl|CDD|161659 TIGR00005, rluA_subfam, pseudouridine synthase, RluA family.  In E.
           coli, RluD (SfhB) modifies uridine to pseudouridine at
           23S RNA U1911, 1915, and 1917, RluC modifies 955, 2504
           and 2580, and RluA modifies U746 and tRNA U32. An
           additional homolog from E. coli outside this family,
           TruC (SP|Q46918), modifies uracil-65 in transfer RNAs to
           pseudouridine [Protein synthesis, tRNA and rRNA base
           modification].
          Length = 299

 Score = 28.4 bits (64), Expect = 6.7
 Identities = 13/33 (39%), Positives = 18/33 (54%), Gaps = 3/33 (9%)

Query: 257 RLDKDFHGLPVLCKADLAHNNIKAINIQLALKT 289
           RLD+D  GL V+ K  LA   ++    QL  +T
Sbjct: 125 RLDRDTSGLMVVAKTPLALRELQR---QLKNRT 154


>gnl|CDD|114172 pfam05432, BSP_II, Bone sialoprotein II (BSP-II).  Bone
           sialoprotein (BSP) is a major structural protein of the
           bone matrix that is specifically expressed by
           fully-differentiated osteoblasts. The expression of bone
           sialoprotein (BSP) is normally restricted to mineralised
           connective tissues of bones and teeth where it has been
           associated with mineral crystal formation. However, it
           has been found that ectopic expression of BSP occurs in
           various lesions, including oral and extraoral
           carcinomas, in which it has been associated with the
           formation of microcrystalline deposits and the
           metastasis of cancer cells to bone.
          Length = 291

 Score = 28.5 bits (63), Expect = 7.1
 Identities = 24/86 (27%), Positives = 32/86 (37%), Gaps = 11/86 (12%)

Query: 333 GETICQPDSNETSTTTTTTTTTTP----EPTPAPTSTTTQRSTTSTTTQTPTTPIQE-EY 387
           GE     ++    TT    TTT+P    +PT  P        TT       TTP  + EY
Sbjct: 184 GEEESVTEAEAEGTTVAGPTTTSPNGGFQPTTPPQEVYG---TTDPPFGKVTTPEYQGEY 240

Query: 388 TETITLELPQPVE---TNNQIPVQDN 410
            +T   E     E   + N  P  D+
Sbjct: 241 EQTGANEYDGGYEIYESENGEPRGDS 266


>gnl|CDD|234428 TIGR03979, His_Ser_Rich, His-Xaa-Ser repeat protein HxsA.  Members
           of this protein share two defining regions. One is a
           histidine/serine-rich cluster, typically
           H-R-S-H-S-S-H-R-S-H-S-S-H. Members are found always in
           the context of a pair of radical SAM proteins, HxsB and
           HxsC, and a fourth protein HxsD. The system is predicted
           to perform peptide modifications, likely in the
           His-Xaa-Ser region, to produce some uncharacterized
           natural product.
          Length = 186

 Score = 27.9 bits (62), Expect = 8.0
 Identities = 11/37 (29%), Positives = 20/37 (54%)

Query: 345 STTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPTT 381
           S  T+T +   P P+ +P+  ++ +S  STT   P +
Sbjct: 80  SGDTSTYSYPVPSPSYSPSPGSSIQSLPSTTGVRPQS 116


>gnl|CDD|233534 TIGR01693, UTase_glnD, [Protein-PII] uridylyltransferase.  This
           model describes GlnD, the
           uridylyltransferase/uridylyl-removing enzyme for the
           nitrogen regulatory protein PII. Not all homologs of PII
           share the property of uridylyltransferase modification
           on the characteristic Tyr residue (see Prosite pattern
           PS00496 and document PDOC00439), but the modification
           site is preserved in the PII homolog of all species with
           a member of this family [Central intermediary
           metabolism, Nitrogen metabolism, Regulatory functions,
           Protein interactions].
          Length = 850

 Score = 28.5 bits (64), Expect = 8.5
 Identities = 15/44 (34%), Positives = 21/44 (47%), Gaps = 3/44 (6%)

Query: 148 VDLSYNKINKFGTRNEGKNQVQGVTNIFELKLQHNEIENLDGAL 191
           + +   KI  FG + E    V  VT++F LKL   E + L   L
Sbjct: 805 LSIQSAKITTFGEKAE---DVFYVTDLFGLKLTDEEEQRLLEVL 845


>gnl|CDD|132138 TIGR03094, sulfo_cyanin, sulfocyanin.  Members of this family are
           blue-copper redox proteins designated sulfocyanin, from
           the archaeal genera Sulfolobus, Ferroplasma, and
           Picrophilus. The most closely related proteins
           characterized as functionally different are the
           rustacyanins [Energy metabolism, Electron transport].
          Length = 195

 Score = 27.9 bits (62), Expect = 8.5
 Identities = 12/42 (28%), Positives = 18/42 (42%)

Query: 341 SNETSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPTTP 382
           S++TS  +T T+TT+    P  +S  T   T          P
Sbjct: 33  SSDTSAVSTGTSTTSKIYIPYNSSNKTVFLTLVVWGSGNAYP 74


>gnl|CDD|217495 pfam03326, Herpes_TAF50, Herpesvirus transcription activation
           factor (transactivator).  This family includes EBV BRLF1
           and similar ORF 50 proteins from other herpesviruses.
          Length = 500

 Score = 28.2 bits (63), Expect = 8.8
 Identities = 14/66 (21%), Positives = 27/66 (40%), Gaps = 2/66 (3%)

Query: 333 GETICQPDSNETSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPTTPIQEEYTETIT 392
            ++    D    ST +    T+     PA  S +++R +T+   Q   +  Q E  + + 
Sbjct: 239 QQSESFSDVWPASTQSPREETSAEPLAPA--SPSSRRPSTAQEEQIACSSPQAEPEQGVQ 296

Query: 393 LELPQP 398
             +PQ 
Sbjct: 297 SYVPQS 302


>gnl|CDD|165473 PHA03207, PHA03207, serine/threonine kinase US3; Provisional.
          Length = 392

 Score = 28.3 bits (63), Expect = 9.3
 Identities = 16/80 (20%), Positives = 26/80 (32%)

Query: 322 METINNNTKIHGETICQPDSNETSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPTT 381
           ME   +++K+    I        S T  T T+ + + T        +   +   T     
Sbjct: 1   MECHTDSSKVSDSNINLGYEAIFSLTGGTDTSDSKDTTGDKFDDCDELGDSDDVTHATDY 60

Query: 382 PIQEEYTETITLELPQPVET 401
              EE     T    +P ET
Sbjct: 61  DADEESLSPQTDVCQEPCET 80


>gnl|CDD|184042 PRK13415, PRK13415, flagella biosynthesis protein FliZ;
           Provisional.
          Length = 219

 Score = 27.8 bits (62), Expect = 9.9
 Identities = 8/27 (29%), Positives = 9/27 (33%)

Query: 338 QPDSNETSTTTTTTTTTTPEPTPAPTS 364
            PD     T T   T  T E   +  S
Sbjct: 38  HPDKCGKETPTGQQTQETAEAAASSVS 64


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.314    0.131    0.365 

Gapped
Lambda     K      H
   0.267   0.0925    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 20,312,326
Number of extensions: 1919894
Number of successful extensions: 3531
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2571
Number of HSP's successfully gapped: 300
Length of query: 419
Length of database: 10,937,602
Length adjustment: 99
Effective length of query: 320
Effective length of database: 6,546,556
Effective search space: 2094897920
Effective search space used: 2094897920
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (22.0 bits)
S2: 60 (26.5 bits)