RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy12777
         (697 letters)



>gnl|CDD|214473 smart00020, Tryp_SPc, Trypsin-like serine protease.  Many of these
           are synthesised as inactive precursor zymogens that are
           cleaved during limited proteolysis to generate their
           active forms. A few, however, are active as single chain
           molecules, and others are inactive due to substitutions
           of the catalytic triad residues.
          Length = 229

 Score =  158 bits (401), Expect = 3e-44
 Identities = 62/146 (42%), Positives = 85/146 (58%), Gaps = 7/146 (4%)

Query: 390 RIVGGEKATFGKWPWQISLRQWIRSTYLHKCGAALFNENWAVTAAHCVEDVPPSDLLLRL 449
           RIVGG +A  G +PWQ+SL+        H CG +L +  W +TAAHCV    PS++ +RL
Sbjct: 1   RIVGGSEANIGSFPWQVSLQ---YGGGRHFCGGSLISPRWVLTAAHCVRGSDPSNIRVRL 57

Query: 450 GEHDLSTEEEPYGYQERRVQIVASHPQFDPRTFEYDLALLRFYEPVKFQPNIIPICVPE- 508
           G HDLS+ EE    Q  +V  V  HP ++P T++ D+ALL+  EPV    N+ PIC+P  
Sbjct: 58  GSHDLSSGEE---GQVIKVSKVIIHPNYNPSTYDNDIALLKLKEPVTLSDNVRPICLPSS 114

Query: 509 DDTNFVGTSAHVTGWGRLYEGRFRRS 534
           +     GT+  V+GWGR  EG     
Sbjct: 115 NYNVPAGTTCTVSGWGRTSEGAGSLP 140



 Score = 73.5 bits (181), Expect = 1e-14
 Identities = 23/50 (46%), Positives = 30/50 (60%), Gaps = 3/50 (6%)

Query: 562 RIVGGEKATFGKWPWQISLRQWIRSTYLHKCGAALFNENWAVTAAHCVED 611
           RIVGG +A  G +PWQ+SL+        H CG +L +  W +TAAHCV  
Sbjct: 1   RIVGGSEANIGSFPWQVSLQ---YGGGRHFCGGSLISPRWVLTAAHCVRG 47



 Score = 53.1 bits (128), Expect = 1e-07
 Identities = 21/59 (35%), Positives = 33/59 (55%), Gaps = 4/59 (6%)

Query: 631 W-KMALADGPLPSVLQEVSVPVINNSLCETMYRAAGFIEHIPEIFICAGWRKGSFDSCE 688
           W + +   G LP  LQEV+VP+++N+ C    RA      I +  +CAG  +G  D+C+
Sbjct: 129 WGRTSEGAGSLPDTLQEVNVPIVSNATCR---RAYSGGGAITDNMLCAGGLEGGKDACQ 184


>gnl|CDD|238113 cd00190, Tryp_SPc, Trypsin-like serine protease; Many of these are
           synthesized as inactive precursor zymogens that are
           cleaved during limited proteolysis to generate their
           active forms. Alignment contains also inactive enzymes
           that have substitutions of the catalytic triad residues.
          Length = 232

 Score =  154 bits (392), Expect = 5e-43
 Identities = 61/140 (43%), Positives = 84/140 (60%), Gaps = 6/140 (4%)

Query: 391 IVGGEKATFGKWPWQISLRQWIRSTYLHKCGAALFNENWAVTAAHCVEDVPPSDLLLRLG 450
           IVGG +A  G +PWQ+SL+    +   H CG +L +  W +TAAHCV    PS+  +RLG
Sbjct: 1   IVGGSEAKIGSFPWQVSLQ---YTGGRHFCGGSLISPRWVLTAAHCVYSSAPSNYTVRLG 57

Query: 451 EHDLSTEEEPYGYQERRVQIVASHPQFDPRTFEYDLALLRFYEPVKFQPNIIPICVPEDD 510
            HDLS+ E   G Q  +V+ V  HP ++P T++ D+ALL+   PV    N+ PIC+P   
Sbjct: 58  SHDLSSNEG--GGQVIKVKKVIVHPNYNPSTYDNDIALLKLKRPVTLSDNVRPICLPSSG 115

Query: 511 TNF-VGTSAHVTGWGRLYEG 529
            N   GT+  V+GWGR  EG
Sbjct: 116 YNLPAGTTCTVSGWGRTSEG 135



 Score = 71.2 bits (175), Expect = 9e-14
 Identities = 22/49 (44%), Positives = 30/49 (61%), Gaps = 3/49 (6%)

Query: 563 IVGGEKATFGKWPWQISLRQWIRSTYLHKCGAALFNENWAVTAAHCVED 611
           IVGG +A  G +PWQ+SL+    +   H CG +L +  W +TAAHCV  
Sbjct: 1   IVGGSEAKIGSFPWQVSLQ---YTGGRHFCGGSLISPRWVLTAAHCVYS 46



 Score = 50.4 bits (121), Expect = 7e-07
 Identities = 22/52 (42%), Positives = 33/52 (63%), Gaps = 3/52 (5%)

Query: 637 DGPLPSVLQEVSVPVINNSLCETMYRAAGFIEHIPEIFICAGWRKGSFDSCE 688
            GPLP VLQEV+VP+++N+ C+  Y   G    I +  +CAG  +G  D+C+
Sbjct: 135 GGPLPDVLQEVNVPIVSNAECKRAYSYGGT---ITDNMLCAGGLEGGKDACQ 183


>gnl|CDD|215708 pfam00089, Trypsin, Trypsin. 
          Length = 218

 Score =  118 bits (297), Expect = 3e-30
 Identities = 59/154 (38%), Positives = 85/154 (55%), Gaps = 15/154 (9%)

Query: 391 IVGGEKATFGKWPWQISLRQWIRSTYLHKCGAALFNENWAVTAAHCVEDVPPSDLLLRLG 450
           IVGG++A  G +PWQ+SL+    S+  H CG +L +ENW +TAAHCV +     + + LG
Sbjct: 1   IVGGDEAQPGSFPWQVSLQ---VSSGKHFCGGSLISENWVLTAAHCVSN--AKSVRVVLG 55

Query: 451 EHDLSTEEEPYGYQERRVQIVASHPQFDPRTFEYDLALLRFYEPVKFQPNIIPICVPEDD 510
            H++   E   G Q+  V+ V  HP ++P T   D+ALL+   PV     + PIC+P   
Sbjct: 56  AHNIVLREG--GEQKFDVKKVIVHPNYNPDTD-NDIALLKLKSPVTLGDTVRPICLPTAS 112

Query: 511 TNF-VGTSAHVTGWGRLYEGRFRRSYGHPATRQE 543
           ++  VGT+  V+GWG           G P T QE
Sbjct: 113 SDLPVGTTCTVSGWGNTKTL------GLPDTLQE 140



 Score = 59.8 bits (145), Expect = 5e-10
 Identities = 25/49 (51%), Positives = 35/49 (71%), Gaps = 3/49 (6%)

Query: 563 IVGGEKATFGKWPWQISLRQWIRSTYLHKCGAALFNENWAVTAAHCVED 611
           IVGG++A  G +PWQ+SL+    S+  H CG +L +ENW +TAAHCV +
Sbjct: 1   IVGGDEAQPGSFPWQVSLQ---VSSGKHFCGGSLISENWVLTAAHCVSN 46



 Score = 34.3 bits (79), Expect = 0.12
 Identities = 18/48 (37%), Positives = 25/48 (52%), Gaps = 7/48 (14%)

Query: 640 LPSVLQEVSVPVINNSLCETMYRAAGFIEHIPEIFICAGWRKGSFDSC 687
           LP  LQEV+VPV++   C + Y        + +  ICAG   G  D+C
Sbjct: 134 LPDTLQEVTVPVVSRETCRSAYGGT-----VTDNMICAGA--GGKDAC 174


>gnl|CDD|227927 COG5640, COG5640, Secreted trypsin-like serine protease
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 413

 Score = 57.2 bits (138), Expect = 1e-08
 Identities = 32/145 (22%), Positives = 53/145 (36%), Gaps = 9/145 (6%)

Query: 388 SSRIVGGEKATFGKWPWQISLRQWIRSTYLHK-CGAALFNENWAVTAAHCVEDVPPSDLL 446
           SSRI+GG  A  G++P  ++L   I        CG +     + +TAAHC +   P    
Sbjct: 30  SSRIIGGSNANAGEYPSLVALVDRISDYVSGTFCGGSKLGGRYVLTAAHCADASSPISSD 89

Query: 447 LRLGEHDLSTEEEPYGYQERRVQIVASHPQFDPRTFEYDLALLRFYEPVKFQPNIIPICV 506
           +     DL+   +    +   V+ +  H  + P     D+A+L        +   +P   
Sbjct: 90  VNRVVVDLNDSSQ---AERGHVRTIYVHEFYSPGNLGNDIAVLELA-----RAASLPRVK 141

Query: 507 PEDDTNFVGTSAHVTGWGRLYEGRF 531
                        VT    +  G F
Sbjct: 142 ITSFDASDTFLNSVTTVSPMTNGTF 166



 Score = 49.1 bits (117), Expect = 6e-06
 Identities = 18/53 (33%), Positives = 27/53 (50%), Gaps = 1/53 (1%)

Query: 560 SSRIVGGEKATFGKWPWQISLRQWIRSTYLHK-CGAALFNENWAVTAAHCVED 611
           SSRI+GG  A  G++P  ++L   I        CG +     + +TAAHC + 
Sbjct: 30  SSRIIGGSNANAGEYPSLVALVDRISDYVSGTFCGGSKLGGRYVLTAAHCADA 82


>gnl|CDD|226406 COG3889, COG3889, Predicted solute binding protein [General
           function prediction only].
          Length = 872

 Score = 47.5 bits (113), Expect = 2e-05
 Identities = 34/144 (23%), Positives = 56/144 (38%), Gaps = 16/144 (11%)

Query: 222 SSSTIEEETNMIG-GGTAYGT-STTPSLVTW-----TTVDEIPVIPPDRTRPPPLVTTIK 274
           SS     +T  IG   T YG+    P+   W     T   ++ ++ P       + T+  
Sbjct: 712 SSEAKNPDTVKIGQALTVYGSLEVFPAGENWGFIPTTKRVKVRIMDPASGTGTSITTSG- 770

Query: 275 APPSTVASPVTSSTESWVQISLTTLSFPPPQQNVTTTVVTTKPYPETTTTTEKTEPSPST 334
               T   P  S T++   +S +  S        T+ V+      +T T T+ T  SPS 
Sbjct: 771 --TFTAEVP-QSPTKTETTLSYSAYSNTSILIETTSVVI-----TKTVTQTQTTTSSPSP 822

Query: 335 VYETSSMSPSSPKPSPTTSTVSTT 358
              TS    S+   + T+ + +TT
Sbjct: 823 TQTTSPTQTSTSTTTTTSPSQTTT 846



 Score = 41.4 bits (97), Expect = 0.002
 Identities = 30/106 (28%), Positives = 44/106 (41%), Gaps = 10/106 (9%)

Query: 42  TTTVVTTKPYPETTTTTEKTEPSPSTVYETSSMSPSSPKPSPTTSTVSTTAFIDESNEIE 101
           TT  V  +     + T      S +   E     P SP  + T +T+S +A+ + S  IE
Sbjct: 747 TTKRVKVRIMDPASGTGTSITTSGTFTAEV----PQSP--TKTETTLSYSAYSNTSILIE 800

Query: 102 SQGINMSNYKEDVTTTVVTTKPYPETTITTEKTEPSPSTVYETSSS 147
           +  + ++       T   TT   P  T TT  T+ S ST   TS S
Sbjct: 801 TTSVVITK----TVTQTQTTTSSPSPTQTTSPTQTSTSTTTTTSPS 842



 Score = 38.7 bits (90), Expect = 0.013
 Identities = 17/56 (30%), Positives = 27/56 (48%), Gaps = 1/56 (1%)

Query: 37  PIRNVTTTVVTTKPYPETTTTTEKTEPSPSTVYETSSMSPSSPKPSPTTSTVSTTA 92
            I   TT+VV TK   +T TTT    P+ +T   T + + ++   SP+ +T     
Sbjct: 796 SILIETTSVVITKTVTQTQTTTSSPSPTQTTS-PTQTSTSTTTTTSPSQTTTGGGI 850



 Score = 30.2 bits (68), Expect = 5.0
 Identities = 21/90 (23%), Positives = 30/90 (33%), Gaps = 23/90 (25%)

Query: 42  TTTVVTTKPYPETTTTTEKTEPSPSTVYETSSMSPSSPKPSPTTSTVSTTAFIDESNEIE 101
           T T ++   Y  T+   E T    +     +  + SSP P+ TTS   T+          
Sbjct: 783 TETTLSYSAYSNTSILIETTSVVITKTVTQTQTTTSSPSPTQTTSPTQTS---------- 832

Query: 102 SQGINMSNYKEDVTTTVVTTKPYPETTITT 131
                        T+T  TT P   TT   
Sbjct: 833 -------------TSTTTTTSPSQTTTGGG 849



 Score = 29.8 bits (67), Expect = 5.6
 Identities = 22/103 (21%), Positives = 32/103 (31%), Gaps = 18/103 (17%)

Query: 238 AYGTSTTPSLVTWTTVDEIP--------VIPPDRTRPPPLVTTIKAPPSTVASPVTSSTE 289
             GTS T S      V + P              T      T++    +   +  T+S+ 
Sbjct: 761 GTGTSITTSGTFTAEVPQSPTKTETTLSYSAYSNTSILIETTSVVITKTVTQTQTTTSSP 820

Query: 290 SWVQISLTTLSFPPPQQNVTTTVVTTKPYPETTTTTEKTEPSP 332
                  +      P Q  T+T  TT P   + TTT      P
Sbjct: 821 -------SPTQTTSPTQTSTSTTTTTSP---SQTTTGGGICGP 853



 Score = 29.4 bits (66), Expect = 8.0
 Identities = 22/101 (21%), Positives = 35/101 (34%), Gaps = 20/101 (19%)

Query: 41  VTTTVVTTKPYPETTTTTEKTEPSPSTVYETSSMSPSSPKPSPTTSTVSTTAFIDESNEI 100
           +TT+   T   P++ T TE T    +    +  +  +S   + T +   TT         
Sbjct: 766 ITTSGTFTAEVPQSPTKTETTLSYSAYSNTSILIETTSVVITKTVTQTQTTT-------- 817

Query: 101 ESQGINMSNYKEDVTTTVVTTKPYPETTITTEKTEPSPSTV 141
                         T T   T+    +T TT  T PS +T 
Sbjct: 818 ---------SSPSPTQTTSPTQT---STSTTTTTSPSQTTT 846



 Score = 29.4 bits (66), Expect = 9.1
 Identities = 27/138 (19%), Positives = 42/138 (30%), Gaps = 29/138 (21%)

Query: 247 LVTWTTVDEIPVIPPDRTRPPPLVTTIKAPPSTVASPVTSSTESWVQISLTTLSFPPPQQ 306
            + +T             + P  V   +A     +  V  + E+W  I  T         
Sbjct: 698 WLPYTNSLYKATTLSSEAKNPDTVKIGQALTVYGSLEVFPAGENWGFIPTTKRVKVRIMD 757

Query: 307 NV--TTTVVTTKPYPETTTTTEKTEPSPST-------------VYETSSMSP-------- 343
               T T +TT     T T      P+ +              + ET+S+          
Sbjct: 758 PASGTGTSITTS---GTFTAEVPQSPTKTETTLSYSAYSNTSILIETTSVVITKTVTQTQ 814

Query: 344 ---SSPKPSPTTSTVSTT 358
              SSP P+ TTS   T+
Sbjct: 815 TTTSSPSPTQTTSPTQTS 832


>gnl|CDD|215130 PLN02217, PLN02217, probable pectinesterase/pectinesterase
           inhibitor.
          Length = 670

 Score = 40.1 bits (93), Expect = 0.004
 Identities = 27/94 (28%), Positives = 40/94 (42%), Gaps = 3/94 (3%)

Query: 242 STTPSLVTWTTVDEIPVIPPDRTRPPPLVTTIKAPPSTVASPVTSSTESWVQISLTTLSF 301
           S   S  T+++     V+ P  + P   + +  A PS + SP TS   S +    TT S 
Sbjct: 576 SAASSNTTFSSDSPSTVVAPSTSPPAGHLGSPPATPSKIVSPSTSPPASHLGSPSTTPSS 635

Query: 302 PPPQQNVTTTVVTTKPYPETTTTTEKTEPSPSTV 335
           P     V +T   +   PE++     TE S S V
Sbjct: 636 PESSIKVASTETAS---PESSIKVASTESSVSMV 666



 Score = 30.8 bits (69), Expect = 3.2
 Identities = 27/102 (26%), Positives = 40/102 (39%), Gaps = 6/102 (5%)

Query: 42  TTTVVTTKPYPETTTTTEKTEPSPSTVYETSSMSPSSPKPSP--TTSTVSTTAFIDESNE 99
           T +  T       TT +     SPSTV   S+  P+    SP  T S + + +    ++ 
Sbjct: 569 TNSTPTGSAASSNTTFSSD---SPSTVVAPSTSPPAGHLGSPPATPSKIVSPSTSPPASH 625

Query: 100 IESQGINMSNYKEDVTTTVVTTKPYPETTITTEKTEPSPSTV 141
           + S     S+  E       T    PE++I    TE S S V
Sbjct: 626 LGSPSTTPSS-PESSIKVASTETASPESSIKVASTESSVSMV 666


>gnl|CDD|114270 pfam05539, Pneumo_att_G, Pneumovirinae attachment membrane
           glycoprotein G. 
          Length = 408

 Score = 39.3 bits (91), Expect = 0.005
 Identities = 45/168 (26%), Positives = 54/168 (32%), Gaps = 17/168 (10%)

Query: 192 SCCK---IAGETDVDISEPINNFIHNTNHIDEHSSSTIEEETNMIGGGTAYGTSTTPSLV 248
           SCCK    A  T    S P        +H    S  T + +    G  TA       S  
Sbjct: 163 SCCKEPKTAVTTSKTTSWP-----TEVSHPTYPSQVTPQSQPATQGHQTATANQRLSSTE 217

Query: 249 TWTTVDE-IPVIPPDRTRPPP----LVTTIKAPPSTVASPVTSSTESWVQISLTTLSFPP 303
              T        P  +T PPP       + + PPST +     ST    Q        PP
Sbjct: 218 PVGTQGTTTSSNPEPQTEPPPSQRGPSGSPQHPPSTTSQ--DQSTTGDGQEHTQRRKTPP 275

Query: 304 PQQNVTTTVVTTKPYPETTTTTEKTEPSPSTVYETSSMSPSSPKPSPT 351
              N  +   T  P P TT   E   P+P     T S   S P  SP 
Sbjct: 276 ATSNRRSPHSTATP-PPTTKRQETGRPTPRPTATTQS-GSSPPHSSPP 321



 Score = 30.0 bits (67), Expect = 4.6
 Identities = 29/105 (27%), Positives = 37/105 (35%), Gaps = 20/105 (19%)

Query: 270 VTTIKAPPSTVASPVTSSTESWVQISL--------TTLSFPPPQQNVTTT----VVTTKP 317
           V+  K P + V    TS T SW             T  S P  Q + T T    + +T+P
Sbjct: 162 VSCCKEPKTAVT---TSKTTSWPTEVSHPTYPSQVTPQSQPATQGHQTATANQRLSSTEP 218

Query: 318 YPETTTTT-----EKTEPSPSTVYETSSMSPSSPKPSPTTSTVST 357
                TTT      +TEP PS    + S        S   ST   
Sbjct: 219 VGTQGTTTSSNPEPQTEPPPSQRGPSGSPQHPPSTTSQDQSTTGD 263


>gnl|CDD|222077 pfam13365, Trypsin_2, Trypsin-like peptidase domain.  This family
           includes trypsin like peptidase domains.
          Length = 138

 Score = 37.2 bits (86), Expect = 0.007
 Identities = 28/179 (15%), Positives = 49/179 (27%), Gaps = 52/179 (29%)

Query: 426 NENWAVTAAHCVEDVPPSDLLLRLGEHDLSTEEEPYGYQERRVQIVASHPQFDPRTFEYD 485
           ++   +T AH VED   S++ + L                   ++VA+ P       + D
Sbjct: 8   SDGLILTNAHVVEDADASEIEVVL-----------PDGGRVPAEVVAADP-------DLD 49

Query: 486 LALLRFYEPVKFQPNIIPI-CVPEDDTNFVGTSAHVTGWGRLYEGRFRRSYGHPATRQEM 544
           LALL+   P+     ++     P   +  V       G G    G               
Sbjct: 50  LALLKVDGPLLPAAPLLASSAAPLGGSVVVVGGPGGIGLGASGGGGGVGGLVSG------ 103

Query: 545 ATCWNHFLGNRILFPSSRIVGGEKATFGKWPWQISLRQWIRSTYLHKCGAALFNENWAV 603
                  +  R +   +    G                   S      G  +F+ +  V
Sbjct: 104 ---SLGGVDGRYILTDADTSPG------------------SS------GGPVFDADGEV 135


>gnl|CDD|223021 PHA03247, PHA03247, large tegument protein UL36; Provisional.
          Length = 3151

 Score = 39.2 bits (91), Expect = 0.009
 Identities = 18/102 (17%), Positives = 29/102 (28%)

Query: 251  TTVDEIPVIPPDRTRPPPLVTTIKAPPSTVASPVTSSTESWVQISLTTLSFPPPQQNVTT 310
            T         P  T  PP      AP +     +T    + +  S  +L  P    +   
Sbjct: 2750 TPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPA 2809

Query: 311  TVVTTKPYPETTTTTEKTEPSPSTVYETSSMSPSSPKPSPTT 352
             V+          +     P P++   T+   P  P P    
Sbjct: 2810 AVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLP 2851



 Score = 34.1 bits (78), Expect = 0.33
 Identities = 21/111 (18%), Positives = 30/111 (27%), Gaps = 6/111 (5%)

Query: 242  STTPSLVTWTTVDEIPVIPPDRTRPPPLVTTIKAPPSTVA-SPVTSSTESWVQISLTTLS 300
            +  P+ V        P   P    PPP      APP      P +      V        
Sbjct: 2805 ADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRR 2864

Query: 301  FPPPQQNVTTTVVTTKPYPETTTTTEKTEPSPSTVYETSSMSPSSPKPSPT 351
             PP +             P          P+ S   E+ ++ P  P+  P 
Sbjct: 2865 RPPSRS--PAAKPAAPARPPVRRL---ARPAVSRSTESFALPPDQPERPPQ 2910



 Score = 34.1 bits (78), Expect = 0.35
 Identities = 19/98 (19%), Positives = 26/98 (26%), Gaps = 11/98 (11%)

Query: 260  PPDRTRPPPLVTTIKAPPSTVASP-VTSSTESWVQISLTTLSFPPPQQNVTTTVVTTKPY 318
            PP R+         + P   +A P V+ STES+          P PQ           P 
Sbjct: 2866 PPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQ----------APP 2915

Query: 319  PETTTTTEKTEPSPSTVYETSSMSPSSPKPSPTTSTVS 356
            P          P P               P+   +   
Sbjct: 2916 PPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAG 2953



 Score = 33.4 bits (76), Expect = 0.48
 Identities = 26/126 (20%), Positives = 43/126 (34%), Gaps = 16/126 (12%)

Query: 245  PSLVTWTTVDEIPVIPPDRTRPPPLVTTIKAPPSTVASPVTSSTESWVQISLTTLSFP-- 302
            P+     +   +P  P    +  P +    APP+  A P T    +      TT   P  
Sbjct: 2710 PAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAP 2769

Query: 303  -PPQQNVTTTV-VTTKPYPETTTTTEKTEPSPS------------TVYETSSMSPSSPKP 348
             PP           T+P   + + + ++ PSP                   + SP+ P P
Sbjct: 2770 APPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLP 2829

Query: 349  SPTTST 354
             PT++ 
Sbjct: 2830 PPTSAQ 2835


>gnl|CDD|218439 pfam05109, Herpes_BLLF1, Herpes virus major outer envelope
           glycoprotein (BLLF1).  This family consists of the BLLF1
           viral late glycoprotein, also termed gp350/220. It is
           the most abundantly expressed glycoprotein in the viral
           envelope of the Herpesviruses and is the major antigen
           responsible for stimulating the production of
           neutralising antibodies in vivo.
          Length = 830

 Score = 39.0 bits (90), Expect = 0.010
 Identities = 51/285 (17%), Positives = 95/285 (33%), Gaps = 14/285 (4%)

Query: 35  TSPIRNVTTTVVTTKPYPETTTTTEKTEPSPSTVYETSSMS-PSSPKPSPTTSTVSTTAF 93
           TS   N T+          T+ TT+KT  +P+    T  +   ++   SP T T S    
Sbjct: 493 TSATPNATSPTPAVTTPNATSPTTQKTSDTPNATSPTPIVIGVTTTATSPPTGTTSVPNA 552

Query: 94  IDESNEIESQGINMSNYKEDVTTTVVTTKPYPE---TTITTEKTEPSPSTVYETSSSGRN 150
                  ES   N +        +V+T+        T  +    +P   +   ++    +
Sbjct: 553 TSPQVTEESPVNNTNTPVVTSAPSVLTSAVTTGQHGTGSSPTSQQPGIPSSSHSTPRSNS 612

Query: 151 IRHLPCVSRRTGETGLCMFAFSCAKANGTHLGTCIDRFYFGSCCKIAGETDVDISEPINN 210
               P ++      G  +   + +  + TH+ T       G+  +++G  +   S     
Sbjct: 613 TSTTPLLTSAHPTGGENITEETPSVPSTTHVSTLSPGPGPGTTSQVSGPGNSSTS-RYPG 671

Query: 211 FIHNT----NHIDEHSSSTIEEETNMIGGGTAYGTSTTPSLVTWTTVDEIPVIPP----- 261
            +H T    N      S+   ++T +    +  G + + +  T  +       P      
Sbjct: 672 EVHVTEGMPNPNATSPSAPSGQKTAVPTVTSTGGKANSTTKETSGSTLMASTSPHTNEGA 731

Query: 262 DRTRPPPLVTTIKAPPSTVASPVTSSTESWVQISLTTLSFPPPQQ 306
            RT P    T +    S+   P  + T   V     T+  PP Q 
Sbjct: 732 FRTTPYNATTYLPPSTSSKLRPRWTFTSPPVTTKQATVPVPPTQH 776



 Score = 36.3 bits (83), Expect = 0.069
 Identities = 28/161 (17%), Positives = 49/161 (30%), Gaps = 15/161 (9%)

Query: 211 FIHNTNHIDEHSSSTIEEETNMIGGGTAYG------TSTTPSLVTWTTVDEIP-----VI 259
           F++   H++ H ++ +    ++    T         TS TP+  T +T+ E         
Sbjct: 434 FVYTLVHVEPHKTTAVPTTPSLPPASTGPTVSTADPTSGTPTGTTSSTLPEDTSPTSRTT 493

Query: 260 PPDRTRPPPLVTTIKAPPSTVASPVTSSTESWVQISLTTLSFPPPQQNVTT-TVVTTKPY 318
                   P             SP T  T      +  T          T+    TT   
Sbjct: 494 SATPNATSPTPAV---TTPNATSPTTQKTSDTPNATSPTPIVIGVTTTATSPPTGTTSVP 550

Query: 319 PETTTTTEKTEPSPSTVYETSSMSPSSPKPSPTTSTVSTTA 359
             T+    +  P  +T     + +PS    + TT    T +
Sbjct: 551 NATSPQVTEESPVNNTNTPVVTSAPSVLTSAVTTGQHGTGS 591



 Score = 35.9 bits (82), Expect = 0.086
 Identities = 34/132 (25%), Positives = 52/132 (39%), Gaps = 6/132 (4%)

Query: 241 TSTTPSLVTWTTVDEIPVIPPDRTRPPPLVTTIKAPPSTVASPVTSSTESWVQISLTTLS 300
             TTP+  + TT        P+ T P P+V       +T  SP T +T      S     
Sbjct: 505 AVTTPNATSPTTQKTSD--TPNATSPTPIVIG---VTTTATSPPTGTTSVPNATSPQVTE 559

Query: 301 FPPPQQNVTTTVVTTKPYPETTTTTEKTEPSPSTVYETSSMSPSSPKPSPTTSTVSTTAF 360
              P  N  T VVT+ P   T+  T     + S+        PSS   +P +++ STT  
Sbjct: 560 -ESPVNNTNTPVVTSAPSVLTSAVTTGQHGTGSSPTSQQPGIPSSSHSTPRSNSTSTTPL 618

Query: 361 IDESNEIESQGI 372
           +  ++    + I
Sbjct: 619 LTSAHPTGGENI 630



 Score = 35.5 bits (81), Expect = 0.11
 Identities = 28/107 (26%), Positives = 46/107 (42%)

Query: 34  DTSPIRNVTTTVVTTKPYPETTTTTEKTEPSPSTVYETSSMSPSSPKPSPTTSTVSTTAF 93
           + SP+ N  T VVT+ P   T+  T     + S+        PSS   +P +++ STT  
Sbjct: 559 EESPVNNTNTPVVTSAPSVLTSAVTTGQHGTGSSPTSQQPGIPSSSHSTPRSNSTSTTPL 618

Query: 94  IDESNEIESQGINMSNYKEDVTTTVVTTKPYPETTITTEKTEPSPST 140
           +  ++    + I         TT V T  P P    T++ + P  S+
Sbjct: 619 LTSAHPTGGENITEETPSVPSTTHVSTLSPGPGPGTTSQVSGPGNSS 665



 Score = 32.8 bits (74), Expect = 0.71
 Identities = 30/144 (20%), Positives = 46/144 (31%), Gaps = 7/144 (4%)

Query: 219 DEHSSSTIEEETNMIGGGTAYGTSTTPSLVTWTTVDEIPVIPPDRTRPPPLVTTIKAPPS 278
           D   S         +         TTPSL   +T     V   D T   P  TT    P 
Sbjct: 427 DTTKSVIFVYTLVHVEPHKTTAVPTTPSLPPASTGP--TVSTADPTSGTPTGTTSSTLPE 484

Query: 279 TVASPVTSSTESWVQISLTTLSFPPPQQNVTTTVVTTKPYPETTTTTEKTEPSP----ST 334
              SP + +T +    +  T +   P     TT  T+     T+ T      +       
Sbjct: 485 -DTSPTSRTTSATPNATSPTPAVTTPNATSPTTQKTSDTPNATSPTPIVIGVTTTATSPP 543

Query: 335 VYETSSMSPSSPKPSPTTSTVSTT 358
              TS  + +SP+ +  +   +T 
Sbjct: 544 TGTTSVPNATSPQVTEESPVNNTN 567


>gnl|CDD|217310 pfam02993, MCPVI, Minor capsid protein VI.  This minor capsid
           protein may act as a link between the external capsid
           and the internal DNA-protein core. The C-terminal 11
           residues may function as a protease cofactor leading to
           enzyme activation.
          Length = 238

 Score = 35.9 bits (83), Expect = 0.040
 Identities = 20/80 (25%), Positives = 27/80 (33%), Gaps = 6/80 (7%)

Query: 254 DEIPVIPPDRTRPPPLVTTIKAPPSTVASPVTSS-TESWVQISLTTLSFPP-PQQNVTTT 311
             +P  P     PP    TIK  P+ V  PV S         +  TL  PP PQ      
Sbjct: 140 VLVPAAPE----PPSYEETIKPGPAPVEEPVDSMAIAVPAIDTPVTLELPPAPQPPPPVV 195

Query: 312 VVTTKPYPETTTTTEKTEPS 331
              +       +  ++T  S
Sbjct: 196 PQPSTMVVHRRSRIKRTRSS 215



 Score = 32.9 bits (75), Expect = 0.43
 Identities = 17/104 (16%), Positives = 27/104 (25%), Gaps = 7/104 (6%)

Query: 261 PDRTRPPPLVTTIKAPPSTVASPVTSSTESWVQISLTTLSFPPPQQNVTTTVVTTKPYPE 320
                P P   T+  P   +        E      L   +  PP    T   +   P P 
Sbjct: 110 LGEEEPAPQEETVADPIQALQPRPRPDVEE----VLVPAAPEPPSYEET---IKPGPAPV 162

Query: 321 TTTTTEKTEPSPSTVYETSSMSPSSPKPSPTTSTVSTTAFIDES 364
                      P+     +   P +P+P P      +T  +   
Sbjct: 163 EEPVDSMAIAVPAIDTPVTLELPPAPQPPPPVVPQPSTMVVHRR 206


>gnl|CDD|178806 PRK00030, minC, septum formation inhibitor; Provisional.
          Length = 292

 Score = 35.8 bits (82), Expect = 0.053
 Identities = 21/93 (22%), Positives = 40/93 (43%), Gaps = 10/93 (10%)

Query: 265 RPPPLVTTIKAPPSTVASPVTSSTESWVQISLTTLSFPPPQQNVTTTVVTTKPYPETTTT 324
           R P ++ T  APP+ VA+PV S  E+    +    +   PQ +        +   E    
Sbjct: 106 RAPQVIDT--APPNDVATPVPSVPEA----TAEAAAKAGPQDDEADG----EQADEAPAH 155

Query: 325 TEKTEPSPSTVYETSSMSPSSPKPSPTTSTVST 357
             ++ P+ +    T +  P++  P  +++ V T
Sbjct: 156 NPESVPTRAARETTEANRPTATPPQSSSALVIT 188


>gnl|CDD|237019 PRK11907, PRK11907, bifunctional 2',3'-cyclic nucleotide
           2'-phosphodiesterase/3'-nucleotidase precursor protein;
           Reviewed.
          Length = 814

 Score = 36.0 bits (83), Expect = 0.072
 Identities = 18/95 (18%), Positives = 31/95 (32%), Gaps = 9/95 (9%)

Query: 41  VTTTVVTTKPYPETTTTTEKTEPSPSTVYETSSMSPSSPKPSPTTSTVSTTAFIDESNEI 100
           +          P T+T  E+T P      E+ +   +    +P  +T +  A        
Sbjct: 25  LAQAEEIVTTTPATSTEAEQTTPV-----ESDATEEADNTETPVAATTAAEAPSSSETAE 79

Query: 101 ESQGINMSNYKEDVTTTVVTTKPYPETTITTEKTE 135
            S   + +      TTT       P  T T++  E
Sbjct: 80  TSDPTSEAT----DTTTSEARTVTPAATETSKPVE 110



 Score = 35.2 bits (81), Expect = 0.15
 Identities = 15/83 (18%), Positives = 29/83 (34%), Gaps = 7/83 (8%)

Query: 288 TESWVQISLTTLSFPPPQQNVTTTVVTTKPYPETTTTTEKTEPSPSTVYETSSMSPSSPK 347
           ++S V ++L  L+       +          P T+T  E+T P      E+ +   +   
Sbjct: 7   SKSAVALTLALLTASN--PKLAQAEEIVTTTPATSTEAEQTTPV-----ESDATEEADNT 59

Query: 348 PSPTTSTVSTTAFIDESNEIESQ 370
            +P  +T +  A         S 
Sbjct: 60  ETPVAATTAAEAPSSSETAETSD 82



 Score = 32.5 bits (74), Expect = 0.94
 Identities = 19/100 (19%), Positives = 34/100 (34%), Gaps = 11/100 (11%)

Query: 275 APPSTVASPVTSSTESWVQISLTTLSFPPPQQNVTTTVVTTKPYPETTTTTEKTEPSPST 334
           A      +P TS+         T +     ++   T             TT    PS S 
Sbjct: 28  AEEIVTTTPATSTEAE----QTTPVESDATEEADNTET-------PVAATTAAEAPSSSE 76

Query: 335 VYETSSMSPSSPKPSPTTSTVSTTAFIDESNEIESQGINM 374
             ETS  +  +   + + +   T A  + S  +E Q +++
Sbjct: 77  TAETSDPTSEATDTTTSEARTVTPAATETSKPVEGQTVDV 116



 Score = 32.5 bits (74), Expect = 0.96
 Identities = 18/87 (20%), Positives = 30/87 (34%), Gaps = 9/87 (10%)

Query: 268 PLVTTIKAPPSTVASPVTSSTESWVQISLTTLSFPPPQQNVTTTVVTTKPYPETTTTTEK 327
            +VTT  A  +      T+  ES           P       TT       P ++ T E 
Sbjct: 30  EIVTTTPATSTEAEQ--TTPVESDATEEADNTETPVA----ATTAAEA---PSSSETAET 80

Query: 328 TEPSPSTVYETSSMSPSSPKPSPTTST 354
           ++P+      T+S + +    +  TS 
Sbjct: 81  SDPTSEATDTTTSEARTVTPAATETSK 107



 Score = 31.0 bits (70), Expect = 2.3
 Identities = 17/73 (23%), Positives = 30/73 (41%), Gaps = 1/73 (1%)

Query: 35  TSPIRNVTTTVVTTKPYPETTTTTEKTEPSPSTVYETSSMSPSSPKPSPTTSTVSTTAFI 94
           T+P+ +  T        P    TT    PS S   ETS  +  +   + + +   T A  
Sbjct: 45  TTPVESDATEEADNTETP-VAATTAAEAPSSSETAETSDPTSEATDTTTSEARTVTPAAT 103

Query: 95  DESNEIESQGINM 107
           + S  +E Q +++
Sbjct: 104 ETSKPVEGQTVDV 116


>gnl|CDD|216257 pfam01034, Syndecan, Syndecan domain.  Syndecans are transmembrane
           heparin sulfate proteoglycans which are implicated in
           the binding of extracellular matrix components and
           growth factors.
          Length = 207

 Score = 34.7 bits (80), Expect = 0.088
 Identities = 24/83 (28%), Positives = 32/83 (38%), Gaps = 7/83 (8%)

Query: 267 PPLVTTIKAPPSTVASPVTSSTESWVQISLTTLSFPPPQQNVTTTVVTTKPYPETTTTTE 326
              VTT   PP    +  + S ++      TT S         +T VTT   P  T T E
Sbjct: 68  SEPVTTSATPPKLTTTSSSPSNDT------TTASTSTKTSPTVSTTVTTTTSPSETDTEE 121

Query: 327 KTEPSPSTVYETSSMSPSSPKPS 349
            T  + ST   T   S ++  PS
Sbjct: 122 AT-TTVSTETPTEGGSSAATDPS 143



 Score = 30.1 bits (68), Expect = 3.1
 Identities = 17/64 (26%), Positives = 25/64 (39%), Gaps = 7/64 (10%)

Query: 296 LTTLSFPPPQQNVTTTVVTTKPYPETTTTTEKTEPSPSTVYETSSMSPSSPKPSPTTSTV 355
           +TT + PP          TT   P   TTT  T    S    T+  + +SP  + T    
Sbjct: 71  VTTSATPPKLT-------TTSSSPSNDTTTASTSTKTSPTVSTTVTTTTSPSETDTEEAT 123

Query: 356 STTA 359
           +T +
Sbjct: 124 TTVS 127



 Score = 29.7 bits (67), Expect = 4.4
 Identities = 18/71 (25%), Positives = 27/71 (38%), Gaps = 8/71 (11%)

Query: 35  TSPIRNVTTTVVTTKPYPETTTTTEKTEPSPSTVYETSSMSPSSPKPSPTTSTVSTTAFI 94
           +S   N TTT  T+     T +TT  T  SPS        + +      T ST + T   
Sbjct: 84  SSSPSNDTTTASTSTKTSPTVSTTVTTTTSPSE-------TDTEEAT-TTVSTETPTEGG 135

Query: 95  DESNEIESQGI 105
             +    S+ +
Sbjct: 136 SSAATDPSKNL 146


>gnl|CDD|178748 PLN03209, PLN03209, translocon at the inner envelope of chloroplast
           subunit 62; Provisional.
          Length = 576

 Score = 35.3 bits (81), Expect = 0.10
 Identities = 26/118 (22%), Positives = 42/118 (35%), Gaps = 8/118 (6%)

Query: 260 PPDRTRPPPLVTTIKAPPSTVASPVTSSTESWVQISLTTLSFPPPQQNVTTTVVTTKPYP 319
           PP  T P P   T  +P  +  S V +  ++    + T  + PPP      +     PY 
Sbjct: 449 PP--TSPSPTAPTGVSPSVSSTSSVPAVPDTAPATAATDAAAPPPANMRPLS-----PYA 501

Query: 320 ETTTTTEKTEPSPSTVYETSSMSPSSPKPSPTTSTVSTTAFIDESNEIESQGINMSNY 377
                   T PSP           S+ +     ++   TA  DE +  + +   +S Y
Sbjct: 502 VYDDLKPPTSPSP-AAPVGKVAPSSTNEVVKVGNSAPPTALADEQHHAQPKPRPLSPY 558



 Score = 33.0 bits (75), Expect = 0.57
 Identities = 25/111 (22%), Positives = 37/111 (33%), Gaps = 9/111 (8%)

Query: 34  DTSPIRNVTTTVVTTKPYPETTTTTEKTEP-SPSTVYETSSMSPSSPKPSPTT---STVS 89
           D  P     + V   +P        +KT P SP   YE     P+SP P+  T    +VS
Sbjct: 411 DVVPSPGSASNVPEVEPAQVEA---KKTRPLSPYARYEDLK-PPTSPSPTAPTGVSPSVS 466

Query: 90  TTAFIDESNEIESQGINMSNYKEDVTTTVVTTKPYPETTITTEKTEPSPST 140
           +T+ +    +                  +    PY         T PSP+ 
Sbjct: 467 STSSVPAVPDTAPATAATDA-AAPPPANMRPLSPYAVYDDLKPPTSPSPAA 516



 Score = 33.0 bits (75), Expect = 0.62
 Identities = 28/111 (25%), Positives = 43/111 (38%), Gaps = 19/111 (17%)

Query: 45  VVTTKPYPETTTTTEKTEP-----------SPSTVYETSSMSPSSPKPSPTTSTVSTTAF 93
           V T    PE  +   + EP           SP T YE     P+SP P+P +S+ +++  
Sbjct: 342 VPTKPVTPEAPSPPIEEEPPQPKAVVPRPLSPYTAYEDLK-PPTSPIPTPPSSSPASSKS 400

Query: 94  IDESNEIESQGINMSNYKEDVTTTVVTTKPYPETTITTEKTEP-SPSTVYE 143
           +D   +     +  S       + V   +P        +KT P SP   YE
Sbjct: 401 VDAVAKPAEPDVVPS---PGSASNVPEVEPAQV---EAKKTRPLSPYARYE 445



 Score = 32.6 bits (74), Expect = 0.76
 Identities = 27/124 (21%), Positives = 35/124 (28%), Gaps = 14/124 (11%)

Query: 242 STTPSLVTWTTVDEIPVIPPDRTRPPPLVTTIKAPPSTVASPVTSSTESWVQISLTTLSF 301
           S T       +V     +P      P    T  A P        S    +  +   T   
Sbjct: 454 SPTAPTGVSPSVSSTSSVPAVPDTAPATAATDAAAPPPANMRPLSPYAVYDDLKPPTSPS 513

Query: 302 P--------PPQQNVTTTVVTTKPYPETTTTTEKTEP-----SPSTVYETSSMSPSSPKP 348
           P        P   N    V  + P           +P     SP T+YE     P+SP P
Sbjct: 514 PAAPVGKVAPSSTNEVVKVGNSAPPTALADEQHHAQPKPRPLSPYTMYEDLK-PPTSPTP 572

Query: 349 SPTT 352
           SP  
Sbjct: 573 SPVL 576



 Score = 29.5 bits (66), Expect = 7.0
 Identities = 23/91 (25%), Positives = 34/91 (37%), Gaps = 22/91 (24%)

Query: 302 PPPQQNVTTTVV----TTKPYPETTTTTEKTEP-----------SPSTVYETSSMSPSSP 346
            PP+++          T    PE  +   + EP           SP T YE     P+SP
Sbjct: 328 VPPKESDAADGPKPVPTKPVTPEAPSPPIEEEPPQPKAVVPRPLSPYTAYEDLK-PPTSP 386

Query: 347 KPSPTTS------TVSTTAFIDESNEIESQG 371
            P+P +S      +V   A   E + + S G
Sbjct: 387 IPTPPSSSPASSKSVDAVAKPAEPDVVPSPG 417


>gnl|CDD|234504 TIGR04216, halo_surf_glyco, major cell surface glycoprotein.
           Members of this family are the S-layer-forming
           halobacterial major cell surface glycoprotein. The
           highest scores below model cutoffs are fragmentary
           paralogs to actual members of the family. Modifications
           include at N-linked and O-linked glycosylation, a
           C-terminal diphytanylglyceryl modification, and probable
           cleavage of the PGF-CTERM tail.
          Length = 782

 Score = 34.9 bits (80), Expect = 0.19
 Identities = 17/65 (26%), Positives = 29/65 (44%), Gaps = 3/65 (4%)

Query: 27  NFAMYKNDTSPIRNVTTTVVTTKPYPETTTTTEKTEPSPSTVYETSSMSPSSPKPSPTTS 86
           N+ +  +D      V   VV     P+TTT+ +   P+ +T   T+    ++    PTT+
Sbjct: 689 NYTVEADDGDNTDRVNVEVVEETERPDTTTSED---PTTTTTPTTTGPEETTETAEPTTT 745

Query: 87  TVSTT 91
           T   T
Sbjct: 746 TEEPT 750



 Score = 32.9 bits (75), Expect = 0.66
 Identities = 15/51 (29%), Positives = 24/51 (47%), Gaps = 3/51 (5%)

Query: 308 VTTTVVTTKPYPETTTTTEKTEPSPSTVYETSSMSPSSPKPSPTTSTVSTT 358
           V   VV     P+TTT+ +   P+ +T   T+    ++    PTT+T   T
Sbjct: 703 VNVEVVEETERPDTTTSED---PTTTTTPTTTGPEETTETAEPTTTTEEPT 750



 Score = 29.5 bits (66), Expect = 7.4
 Identities = 17/45 (37%), Positives = 26/45 (57%), Gaps = 2/45 (4%)

Query: 314 TTKPYPETTTTTEKTEPSPSTVYETSSMSPSSPKPSPTTSTVSTT 358
           TT   P TTTT   T P  +T  ET+  + ++ +P+  T+T S+T
Sbjct: 717 TTSEDPTTTTTPTTTGPEETT--ETAEPTTTTEEPTEETTTGSST 759


>gnl|CDD|139494 PRK13335, PRK13335, superantigen-like protein; Reviewed.
          Length = 356

 Score = 34.3 bits (78), Expect = 0.22
 Identities = 20/75 (26%), Positives = 29/75 (38%), Gaps = 8/75 (10%)

Query: 287 STESWVQISLTTLSFPPPQQNVTTTVVTTKPYP---ETTTTTEKTEPSPSTVYETSSMSP 343
           + E     S       P Q+   +  ++  P P   ++ TTTE T P         +  P
Sbjct: 82  TNEEKTSASKIEKISQPKQEEQKSLNISATPAPKQEQSQTTTESTTPKTKV-----TTPP 136

Query: 344 SSPKPSPTTSTVSTT 358
           S+  P P  ST S T
Sbjct: 137 STNTPQPMQSTKSDT 151


>gnl|CDD|114205 pfam05467, Herpes_U47, Herpesvirus glycoprotein U47. 
          Length = 627

 Score = 34.1 bits (77), Expect = 0.32
 Identities = 37/133 (27%), Positives = 50/133 (37%), Gaps = 12/133 (9%)

Query: 241 TSTTPSLVTWTTVDEIPVIPPDRTRPPPLVTTIKAPPSTVASPVTSSTESWVQISLTTLS 300
           T  TP++   TT + I  I      P P+ T    P  T+    T  TE   +       
Sbjct: 285 TPNTPTITAQTT-ESIKKIVKRSDFPRPMYTPTDIPTLTIRLNATIKTEQNTENPTENPK 343

Query: 301 FPPPQQNV-TTTVVTTKPYPETTTTTEKTEPSPSTVYET-------SSMSPSSPKPS--- 349
            PP   N   TT+   + +   T  T  T+   ST + T       +    SSPK S   
Sbjct: 344 SPPKPTNFENTTIRIPETFESATVATNATQKIESTTFATTIGIEEINDNIYSSPKNSIYL 403

Query: 350 PTTSTVSTTAFID 362
            + S  STT F D
Sbjct: 404 KSKSQQSTTKFTD 416


>gnl|CDD|233987 TIGR02733, desat_CrtD, C-3',4' desaturase CrtD.  Members of this
           family are slr1293, a carotenoid biosynthesis protein
           which was shown to be the C-3',4' desaturase (CrtD) of
           myxoxanthophyll biosynthesis in Synechocystis sp. strain
           PCC 6803, and close homologs (presumed to be
           functionally equivalent) from other cyanobacteria, where
           myxoxanthophyll biosynthesis is either known or
           expected. This enzyme can act on neurosporene and so
           presumably catalyzes the first step that is committed to
           myxoxanthophyll [Biosynthesis of cofactors, prosthetic
           groups, and carriers, Other].
          Length = 492

 Score = 33.5 bits (77), Expect = 0.37
 Identities = 15/40 (37%), Positives = 20/40 (50%), Gaps = 5/40 (12%)

Query: 427 ENWAVTAAHCVEDVPPSDLLLRLGEHDLSTEEEPYGYQER 466
           E+  V A   V ++PP  LL  LG   L     P GY++R
Sbjct: 277 EDLNVKADDVVANLPPQSLLELLGPLGL-----PPGYRKR 311


>gnl|CDD|216421 pfam01299, Lamp, Lysosome-associated membrane glycoprotein (Lamp). 
          Length = 305

 Score = 32.8 bits (75), Expect = 0.56
 Identities = 34/124 (27%), Positives = 48/124 (38%), Gaps = 17/124 (13%)

Query: 241 TSTTPSLVTWTTVDEIPVIPPDRTRPPPLVTTIKAPPSTVASPVTSSTESWVQISLTTLS 300
            S T    ++   D    + P+ T     V T+ +   T A P  ++      +S TT+ 
Sbjct: 1   YSVTELTFSYNLSDTT--LFPNATSKG--VKTVTSSTDTKA-PTNTTYRC---VSSTTV- 51

Query: 301 FPPPQQNVTTTVVTTKPY------PETTTTTEKTEPSPSTVYETSSMSPSSPKPSPTTST 354
            P     VT   VT + Y       +T T  E   PSP+TV   S  SP+    SP    
Sbjct: 52  -PMTNVTVTLHDVTLQAYLSNGTFSKTETRCEADTPSPTTVATPSP-SPTPVPSSPAVGN 109

Query: 355 VSTT 358
            S T
Sbjct: 110 YSVT 113


>gnl|CDD|233045 TIGR00601, rad23, UV excision repair protein Rad23.  All proteins
           in this family for which functions are known are
           components of a multiprotein complex used for targeting
           nucleotide excision repair to specific parts of the
           genome. In humans, Rad23 complexes with the XPC protein.
           This family is based on the phylogenomic analysis of JA
           Eisen (1999, Ph.D. Thesis, Stanford University) [DNA
           metabolism, DNA replication, recombination, and repair].
          Length = 378

 Score = 32.9 bits (75), Expect = 0.58
 Identities = 14/65 (21%), Positives = 24/65 (36%), Gaps = 2/65 (3%)

Query: 295 SLTTLSFPPPQQNVTTTVVTTKPYPETTTTTEKTEPSPSTVYETSSMSPSSPKPSPTTST 354
           + T    PP      T+  T  P P  +  +  +    S V E S    S+   +P + +
Sbjct: 79  TGTGKVAPPAAT--PTSAPTPTPSPPASPASGMSAAPASAVEEKSPSEESATATAPESPS 136

Query: 355 VSTTA 359
            S  +
Sbjct: 137 TSVPS 141



 Score = 32.9 bits (75), Expect = 0.62
 Identities = 23/91 (25%), Positives = 34/91 (37%), Gaps = 10/91 (10%)

Query: 42  TTTVVTTKPYPETTTTTEKTEPSPSTVYETSSMSPSSPKP--SPTTSTVSTTAFIDESNE 99
           T T     P    T+    T   P++     S +P+S     SP+  + + TA    S  
Sbjct: 79  TGTGKVAPPAATPTSAPTPTPSPPASPASGMSAAPASAVEEKSPSEESATATAPESPSTS 138

Query: 100 IESQGINMSNYKEDVTTTVVTTKPYPETTIT 130
           + S G        D  +T+V      ETTI 
Sbjct: 139 VPSSG-------SDAASTLVVGSER-ETTIE 161



 Score = 30.2 bits (68), Expect = 4.2
 Identities = 16/70 (22%), Positives = 28/70 (40%), Gaps = 2/70 (2%)

Query: 309 TTTVVTTKPYPETTTTTEKTEPSPSTVYETSSMSPSSPKP--SPTTSTVSTTAFIDESNE 366
           T T     P    T+    T   P++     S +P+S     SP+  + + TA    S  
Sbjct: 79  TGTGKVAPPAATPTSAPTPTPSPPASPASGMSAAPASAVEEKSPSEESATATAPESPSTS 138

Query: 367 IESQGINMSN 376
           + S G + ++
Sbjct: 139 VPSSGSDAAS 148



 Score = 29.1 bits (65), Expect = 8.1
 Identities = 19/103 (18%), Positives = 31/103 (30%), Gaps = 19/103 (18%)

Query: 264 TRPPPLVTTIKAPPSTVASPVTSSTESWVQISLTTLSFPPPQQNVTTTVVTTKPYPETTT 323
           ++P      +  P +T  S  T +            S P            +       +
Sbjct: 75  SKPKTGTGKVAPPAATPTSAPTPTP-----------SPPAS--------PASGMSAAPAS 115

Query: 324 TTEKTEPSPSTVYETSSMSPSSPKPSPTTSTVSTTAFIDESNE 366
             E+  PS  +   T+  SPS+  PS  +   ST     E   
Sbjct: 116 AVEEKSPSEESATATAPESPSTSVPSSGSDAASTLVVGSERET 158


>gnl|CDD|220749 pfam10428, SOG2, RAM signalling pathway protein.  SOG2 proteins in
           Saccharomyces cerevisiae are involved in cell separation
           and cytokinesis.
          Length = 419

 Score = 32.8 bits (75), Expect = 0.61
 Identities = 28/144 (19%), Positives = 45/144 (31%), Gaps = 14/144 (9%)

Query: 222 SSSTIEEETNMIGGGTAYGTSTTPSLVTWTTVDEIPVIPPDRTRPPPLVTTIKAPPSTVA 281
           SS     +       TA  +S        +       + P R R P       + P+T+ 
Sbjct: 146 SSLGPPLQHRKRDAVTASPSSMIARNTPISDRLRPRSVTPTRGRRPSSSPRSLSNPTTLE 205

Query: 282 SPVTSSTESWVQISLTTLSFPPPQQNVTTTVVTTKPYPETTTTTEKTEPSPSTVYETSSM 341
           SP      S +Q++      PPP  N T+   T       +        S +T     S 
Sbjct: 206 SP------SNLQVTTDV---PPPYSNGTSRSSTMSSSANLSI-----ISSLATPRSGESF 251

Query: 342 SPSSPKPSPTTSTVSTTAFIDESN 365
             +    S + + VS     +E  
Sbjct: 252 RSTPTSGSSSINPVSGLDEAEEDR 275


>gnl|CDD|225828 COG3291, COG3291, FOG: PKD repeat [General function prediction
           only].
          Length = 297

 Score = 32.2 bits (73), Expect = 0.85
 Identities = 20/126 (15%), Positives = 29/126 (23%), Gaps = 6/126 (4%)

Query: 27  NFAMYKNDTSPIRNVTTTVVTTKPYPETTTTTEKTEPSPSTVYETSSMSPSSPKPSPTTS 86
           +F      TS  +N   T      Y  T T +  T    S                    
Sbjct: 114 DFGDGGVTTSTEQNPVHTYTDAGTYTVTLTVSNSTGSD-SKTKTDYVTVSEEGIEEAVPE 172

Query: 87  TVSTTAFIDESNEIESQG-----INMSNYKEDVTTTVVTTKPYPETTITTEKTEPSPSTV 141
             ST      +              +  +++D  T      P     I      P P TV
Sbjct: 173 AASTVVTKPLTVSGTESSSGNLSSWVYVFEDDKGTNSTVKTPLLGGVIKVTLGSPLPDTV 232

Query: 142 YETSSS 147
              +  
Sbjct: 233 VYPTDK 238


>gnl|CDD|237862 PRK14948, PRK14948, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 620

 Score = 31.9 bits (73), Expect = 1.2
 Identities = 18/93 (19%), Positives = 31/93 (33%), Gaps = 7/93 (7%)

Query: 274 KAPPSTVASPVTSSTESWVQISLTTLSFPPPQQNVTTTVVTTKPYPETTTTTEKTEPSPS 333
            +  +    P   S              PP  Q   T    T P P  T T   +     
Sbjct: 517 ASNTAKTPPPPQKSPPPPAPT-------PPLPQPTATAPPPTPPPPPPTATQASSNAPAQ 569

Query: 334 TVYETSSMSPSSPKPSPTTSTVSTTAFIDESNE 366
              ++S   P   +P+P+ +  S+   ID++ +
Sbjct: 570 IPADSSPPPPIPEEPTPSPTKDSSPEEIDKAAK 602



 Score = 29.2 bits (66), Expect = 9.6
 Identities = 11/85 (12%), Positives = 25/85 (29%), Gaps = 4/85 (4%)

Query: 281 ASPVTSSTESWVQISLTTLSFPPPQQNVTTTVVTTKPYPE--TTTTTEKTEPSPSTVYET 338
            S   S   +    +  T +  P           + P  +   TT +     +   +   
Sbjct: 361 PSAFISEIANASAPANPTPAPNPSPPPAPIQ--PSAPKTKQAATTPSPPPAKASPPIPVP 418

Query: 339 SSMSPSSPKPSPTTSTVSTTAFIDE 363
           +  +  SP P    +    +  ++E
Sbjct: 419 AEPTEPSPTPPANAANAPPSLNLEE 443


>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger.  [Transport and
           binding proteins, Cations and iron carrying compounds].
          Length = 1096

 Score = 31.5 bits (71), Expect = 2.1
 Identities = 19/85 (22%), Positives = 32/85 (37%), Gaps = 3/85 (3%)

Query: 271 TTIKAPPSTVASPVTSSTESWVQISLTTLSFPPPQQNVTTTVVTTKPYPETTTTTEKTEP 330
             +  P  TV     +++E  V IS+ T S P   +  T       P   T+    +   
Sbjct: 304 NNLTTPQGTVLEHTPATSEGQVTISIMTGSSPAETKASTAAWKIRNPLSRTSAPAVRI-- 361

Query: 331 SPSTVYETSSMSPSSPKPSPTTSTV 355
           + +T       +PS+   +P T  V
Sbjct: 362 ASATFRGLEK-NPSTAPSTPATPRV 385


>gnl|CDD|225689 COG3147, DedD, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 226

 Score = 30.6 bits (69), Expect = 2.4
 Identities = 19/91 (20%), Positives = 27/91 (29%), Gaps = 12/91 (13%)

Query: 255 EIPVIPPDRTRPPPLVTTIKA---------PPSTVASPVTSSTESWVQISLTT--LSFPP 303
            + V+      P  +   I+           P  VA P   ST + V ++  T     PP
Sbjct: 60  VVQVVALPTQPPEGVAQEIQDAGDAAAASVDPQPVAQPPVESTPAGVPVAAQTPKPVKPP 119

Query: 304 PQQNVTTTVVTTKPYPETTTTTE-KTEPSPS 333
            Q           P PE     E    P+  
Sbjct: 120 KQPPAGAVPAKPTPKPEPKPVAEPAAAPTGQ 150


>gnl|CDD|237864 PRK14950, PRK14950, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 585

 Score = 30.9 bits (70), Expect = 2.8
 Identities = 12/86 (13%), Positives = 25/86 (29%), Gaps = 7/86 (8%)

Query: 271 TTIKAPPSTVASPVTSSTESWVQISLTTLSFPPPQQNVTTTVVTTKPYPETTTTTEKTEP 330
             +   P+   +  T++  S V+ +    +   P+      +   +P  ET T       
Sbjct: 358 ALLVPVPAPQPAKPTAAAPSPVRPTPAPST--RPKAAAAANIPPKEPVRETAT---PPPV 412

Query: 331 SPSTVYETSSMSPSSPKPSPTTSTVS 356
            P  V     +  +       T    
Sbjct: 413 PPRPV--APPVPHTPESAPKLTRAAI 436



 Score = 29.4 bits (66), Expect = 7.4
 Identities = 13/89 (14%), Positives = 17/89 (19%), Gaps = 13/89 (14%)

Query: 242 STTPSLVTWTTVDEIPVIPPDRTRPPPLVTTIKAPPSTVASPVTSSTESWVQISLTTLSF 301
           + +P   T                P   V     PP     PV                 
Sbjct: 375 APSPVRPTPAPSTRPKAAAAANIPPKEPVRETATPPPVPPRPVAPPV------------- 421

Query: 302 PPPQQNVTTTVVTTKPYPETTTTTEKTEP 330
           P   ++         P  E    T    P
Sbjct: 422 PHTPESAPKLTRAAIPVDEKPKYTPPAPP 450


>gnl|CDD|183756 PRK12799, motB, flagellar motor protein MotB; Reviewed.
          Length = 421

 Score = 30.8 bits (69), Expect = 2.8
 Identities = 39/220 (17%), Positives = 73/220 (33%), Gaps = 22/220 (10%)

Query: 161 TGETGLCMFAFSCAKANGTHLGTCIDRFYFGSCCKIAGETDV------DISEPINNFI-- 212
            GE G   +  S  +AN +           G   ++ G             +P+N  I  
Sbjct: 202 RGERGYSNWELSADRANASRRELLAGGLDEGKILRVVGMASTMRLKEQASDDPVNRRISI 261

Query: 213 --------HNTNHID-EHSSSTIEEETNMIGGGTAYGTSTTPSLVTWTTVDEIPVIPPDR 263
                   H+  H + ++ +  IE+ T +    T           + + V +   I P  
Sbjct: 262 LVLNKQSQHDIEHENLDNRALDIEKATGLKQIDTHGTVPVAAVTPS-SAVTQSSAITPSS 320

Query: 264 TRPPPLVTTIKAPPSTVASPVTSSTE--SWVQISLTTLSFPPPQQNVTTTVVTTKPYPET 321
              P       +  +  A+   +S    S   +  + ++ P          V  +P P +
Sbjct: 321 AAIPSPAVIPSSVTTQSATTTQASAVALSSAGVLPSDVTLPGTVALPAAEPVNMQPQPMS 380

Query: 322 TTTTEK--TEPSPSTVYETSSMSPSSPKPSPTTSTVSTTA 359
           TT T++  T    ST    ++  P++P  +   S  S  A
Sbjct: 381 TTETQQSSTGNITSTANGPTTSLPAAPASNIPVSPTSRDA 420


>gnl|CDD|165021 PHA02638, PHA02638, CC chemokine receptor-like protein;
           Provisional.
          Length = 417

 Score = 30.4 bits (68), Expect = 3.2
 Identities = 17/68 (25%), Positives = 32/68 (47%), Gaps = 4/68 (5%)

Query: 278 STVASPVTSSTESWVQISLTTLSFPPPQQNVTTTVVTTKPYPETTTTTEKTEPSPSTVYE 337
           +T++S + SS+     +S TT        + + ++++T      T       PSP++   
Sbjct: 6   TTLSSIILSSS----TLSPTTFFTIETSMDESKSIISTFTEIIPTEIPTSESPSPNSNSS 61

Query: 338 TSSMSPSS 345
           +SS S SS
Sbjct: 62  SSSSSSSS 69



 Score = 29.2 bits (65), Expect = 9.1
 Identities = 24/99 (24%), Positives = 42/99 (42%), Gaps = 13/99 (13%)

Query: 278 STVASPVTSSTESWVQISLTTLSFPPPQQNVTTTVVTTKPYPETTTTTEKTEPSPSTVYE 337
           ST  S +  S+ +   +S TT        + + ++++T  + E   T   T  SPS    
Sbjct: 5   STTLSSIILSSST---LSPTTFFTIETSMDESKSIIST--FTEIIPTEIPTSESPS---- 55

Query: 338 TSSMSPSSPKPSPTTSTVSTTAFIDESNEIESQGINMSN 376
                 S+   S ++S+ S T   +  N I  + IN+ N
Sbjct: 56  ----PNSNSSSSSSSSSSSITYDYEYENNITYELINIKN 90


>gnl|CDD|218191 pfam04652, DUF605, Vta1 like.  Vta1 (VPS20-associated protein 1) is
           a positive regulator of Vps4. Vps4 is an ATPase that is
           required in the multivesicular body (MVB) sorting
           pathway to dissociate the endosomal sorting complex
           required for transport (ESCRT). Vta1 promotes correct
           assembly of Vps4 and stimulates its ATPase activity
           through its conserved Vta1/SBP1/LIP5 region.
          Length = 315

 Score = 30.4 bits (69), Expect = 3.3
 Identities = 15/90 (16%), Positives = 30/90 (33%)

Query: 268 PLVTTIKAPPSTVASPVTSSTESWVQISLTTLSFPPPQQNVTTTVVTTKPYPETTTTTEK 327
           P   +   PPS+     +  +      S +  S PP   +  +      P   T  +   
Sbjct: 179 PASASPSDPPSSSPGVPSFPSPPEDPSSPSDSSLPPAPSSFQSDTPPPSPESPTNPSPPP 238

Query: 328 TEPSPSTVYETSSMSPSSPKPSPTTSTVST 357
              +P           S+ KP+P +++ + 
Sbjct: 239 GPAAPPPPPVQQVPPLSTAKPTPPSASATP 268


>gnl|CDD|165513 PHA03255, PHA03255, BDLF3; Provisional.
          Length = 234

 Score = 29.9 bits (66), Expect = 3.5
 Identities = 30/130 (23%), Positives = 50/130 (38%), Gaps = 5/130 (3%)

Query: 241 TSTTPSLVTWTTVDEIPVIPPDRTRPPPLVTTIKAPPSTVASPVT-----SSTESWVQIS 295
           TS+  S  +   V     +           T      +T ++P+T     S+  + V  +
Sbjct: 25  TSSGSSTASAGNVTGTTAVTTPSPSASGPSTNQSTTLTTTSAPITTTAILSTNTTTVTST 84

Query: 296 LTTLSFPPPQQNVTTTVVTTKPYPETTTTTEKTEPSPSTVYETSSMSPSSPKPSPTTSTV 355
            TT++  P   N +T  VTTK   +  T TE    + + V    +   SS   + T  T 
Sbjct: 85  GTTVTPVPTTSNASTINVTTKVTAQNITATEAGTGTSTGVTSNVTTRSSSTTSATTRITN 144

Query: 356 STTAFIDESN 365
           +TT     S+
Sbjct: 145 ATTLAPTLSS 154


>gnl|CDD|237874 PRK14971, PRK14971, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 614

 Score = 30.1 bits (68), Expect = 4.4
 Identities = 22/113 (19%), Positives = 32/113 (28%), Gaps = 5/113 (4%)

Query: 268 PLVTTIKAPPSTVASPVTSSTESWVQISLTTLSFPPPQQNVTTTVVTTKPYPETTTTTEK 327
           P+ T   A P   A+   S + S      +  + P   Q+ T    T             
Sbjct: 381 PVFTQPAAAPQPSAAAAASPSPS----QSSAAAQPSAPQSATQPAGTPPTVSVDPPAAVP 436

Query: 328 TEPSPSTVYETSSMSPSSPKPSPTTSTVSTTAFIDESNEIESQGINMSNYKEV 380
             P PST  +    +    +     S VS+          E       N KE 
Sbjct: 437 VNP-PSTAPQAVRPAQFKEEKKIPVSKVSSLGPSTLRPIQEKAEQATGNIKEA 488


>gnl|CDD|236766 PRK10811, rne, ribonuclease E; Reviewed.
          Length = 1068

 Score = 30.0 bits (68), Expect = 4.9
 Identities = 11/126 (8%), Positives = 32/126 (25%), Gaps = 10/126 (7%)

Query: 241  TSTTPSLVTWTTVDEIPVI----------PPDRTRPPPLVTTIKAPPSTVASPVTSSTES 290
              + P +     V E PV+            + T P  +   +   P  +     +  + 
Sbjct: 882  VVSAPVVEAVAEVVEEPVVVAEPQPEEVVVVETTHPEVIAAPVTEQPQVITESDVAVAQE 941

Query: 291  WVQISLTTLSFPPPQQNVTTTVVTTKPYPETTTTTEKTEPSPSTVYETSSMSPSSPKPSP 350
              + +   +       ++     T +          +              + ++ +P  
Sbjct: 942  VAEHAEPVVEPQDETADIEEAAETAEVVVAEPEVVAQPAAPVVAEVAAEVETVTAVEPEV 1001

Query: 351  TTSTVS 356
              + V 
Sbjct: 1002 APAQVP 1007


>gnl|CDD|237865 PRK14951, PRK14951, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 618

 Score = 30.1 bits (68), Expect = 4.9
 Identities = 18/106 (16%), Positives = 32/106 (30%), Gaps = 11/106 (10%)

Query: 251 TTVDEIPVIPPDRTRPPPLVTTIKAPPSTVASPVTSSTESWVQISLTTLSFPPPQQNVTT 310
                 P   P      P      APP+ VA+P  ++  +    +   ++  P       
Sbjct: 397 AAAAPAPAAAPAAAASAPAAPPAAAPPAPVAAPAAAAPAAAPAAAPAAVALAPAPPAQ-- 454

Query: 311 TVVTTKPYPETTTTTEKTEPSPSTVYETSSMSPSSPKPSPTTSTVS 356
                   PET     +  P P+     S+    +  P+    T +
Sbjct: 455 ------AAPETVAIPVRVAPEPAV---ASAAPAPAAAPAAARLTPT 491


>gnl|CDD|235055 PRK02625, rpoC1, DNA-directed RNA polymerase subunit gamma;
           Provisional.
          Length = 627

 Score = 30.1 bits (68), Expect = 5.4
 Identities = 13/23 (56%), Positives = 14/23 (60%), Gaps = 1/23 (4%)

Query: 244 TPSLVTWTTVDEIPVIPPDRTRP 266
           T S   W  +D IPVIPPD  RP
Sbjct: 238 TGSRPEWMVLDVIPVIPPD-LRP 259


>gnl|CDD|236733 PRK10672, PRK10672, rare lipoprotein A; Provisional.
          Length = 361

 Score = 29.6 bits (67), Expect = 6.2
 Identities = 17/90 (18%), Positives = 35/90 (38%), Gaps = 2/90 (2%)

Query: 272 TIKAPPSTVASPVTSSTESWVQ-ISLTTL-SFPPPQQNVTTTVVTTKPYPETTTTTEKTE 329
            +     T +     + +  V  +S +TL S  P    VT++     P        E +E
Sbjct: 199 DLSGGMGTPSVQPAPAPQGDVLPVSNSTLKSEDPTGAPVTSSGFLGAPTTLAPGVLEGSE 258

Query: 330 PSPSTVYETSSMSPSSPKPSPTTSTVSTTA 359
           P+P+      + +P++  P    ++ S + 
Sbjct: 259 PTPTAPSSAPATAPAAAAPQAAATSSSASG 288


>gnl|CDD|131440 TIGR02387, rpoC1_cyan, DNA-directed RNA polymerase, gamma subunit. 
           The RNA polymerase gamma subunit, encoded by the rpoC1
           gene, is found in cyanobacteria and corresponds to the
           N-terminal region the beta' subunit, encoded by rpoC, in
           other bacteria. The equivalent subunit in plastids and
           chloroplasts is designated beta', while the product of
           the rpoC2 gene is designated beta''.
          Length = 619

 Score = 29.6 bits (66), Expect = 6.2
 Identities = 13/23 (56%), Positives = 14/23 (60%), Gaps = 1/23 (4%)

Query: 244 TPSLVTWTTVDEIPVIPPDRTRP 266
           T S   W  +D IPVIPPD  RP
Sbjct: 231 TSSRPEWMVLDVIPVIPPD-LRP 252


>gnl|CDD|218421 pfam05086, Dicty_REP, Dictyostelium (Slime Mold) REP protein.  This
           family consists of REP proteins from Dictyostelium
           (Slime molds). REP protein is likely involved in
           transcription regulation and control of DNA replication,
           specifically amplification of plasmid at low copy
           numbers. The formation of homomultimers may be required
           for their regulatory activity.
          Length = 910

 Score = 29.8 bits (67), Expect = 6.8
 Identities = 20/110 (18%), Positives = 38/110 (34%), Gaps = 6/110 (5%)

Query: 32  KNDTSPIRNVTTTVVTTKPYPETTTTTEKTEPSPSTVYETSSMSPSSPKPSPTTSTVSTT 91
           +    P  N TTT  TT     TTT   +T    S      S+   +P+     +     
Sbjct: 247 QPSKRP-NNTTTTTTTTT----TTTFQPRTRKRKSIDDHKLSL-NQAPEKFKNNTKPDDD 300

Query: 92  AFIDESNEIESQGINMSNYKEDVTTTVVTTKPYPETTITTEKTEPSPSTV 141
              D S++   +  ++ + + D  +  V+      +   TE      + +
Sbjct: 301 PQSDFSDKGSRKSGSLKDVRIDNISCSVSHNGVNISISPTETIISVSNKI 350


>gnl|CDD|236652 PRK10118, PRK10118, flagellar hook-length control protein;
           Provisional.
          Length = 408

 Score = 29.4 bits (66), Expect = 6.9
 Identities = 17/106 (16%), Positives = 28/106 (26%), Gaps = 11/106 (10%)

Query: 260 PPDRTRPPPLVTTIKAPPS----TVASPVTSSTESWVQISLTTLSFPPPQQNVTTTVVTT 315
                  P  V   + P        ++P   +           L+              T
Sbjct: 156 TTPVADAPSTVLPAEKPTLLTKDMPSAPQDETHTLSSDEHEKGLTSAQLTTAQPDDAPGT 215

Query: 316 KPYPETTTTTEKTE-------PSPSTVYETSSMSPSSPKPSPTTST 354
              P T    E          PSP T   + +++P   +P PT + 
Sbjct: 216 PAQPLTPLAAEAQAKAEVISTPSPVTAAASPTITPHQTQPLPTAAA 261


>gnl|CDD|217330 pfam03035, RNA_capsid, Calicivirus putative RNA polymerase/capsid
           protein. 
          Length = 226

 Score = 28.9 bits (65), Expect = 7.6
 Identities = 22/106 (20%), Positives = 37/106 (34%), Gaps = 9/106 (8%)

Query: 52  PETTTTTEKTEPSPSTVY--ETSSMSPSSPKPSPTTSTVSTTAFIDESNEIESQGINMSN 109
                    +  S S+V    T S   SS   S ++++  T      S+ + SQ  N+  
Sbjct: 122 SSPVPVPPSSSSSASSVSSQSTQSTGLSSSSYSSSSASSRT------SSWVRSQNSNLEP 175

Query: 110 YKEDVTTTVVTTKPYPETTITTEKTEPSPSTVYETSSSGRNIRHLP 155
           +      T   T P   T  ++      P  V ++ +   N R  P
Sbjct: 176 FMPGALQTAWVTPP-SSTASSSGTVSTVPKGVLDSWTPAFNTRRQP 220


>gnl|CDD|216513 pfam01456, Mucin, Mucin-like glycoprotein.  This family of
           trypanosomal proteins resemble vertebrate mucins. The
           protein consists of three regions. The N and C terminii
           are conserved between all members of the family, whereas
           the central region is not well conserved and contains a
           large number of threonine residues which can be
           glycosylated. Indirect evidence suggested that these
           genes might encode the core protein of parasite mucins,
           glycoproteins that were proposed to be involved in the
           interaction with, and invasion of, mammalian host cells.
           This family contains an N-terminal signal peptide.
          Length = 143

 Score = 28.3 bits (62), Expect = 7.7
 Identities = 16/74 (21%), Positives = 27/74 (36%)

Query: 276 PPSTVASPVTSSTESWVQISLTTLSFPPPQQNVTTTVVTTKPYPETTTTTEKTEPSPSTV 335
             +T   P T++T +    ++TT +        TTT  TT     +  TT    P+ +  
Sbjct: 48  TTTTTTPPTTTTTTTTTTTTITTTTTKTTTTTTTTTTTTTTTEAPSKNTTTSEAPTTTDT 107

Query: 336 YETSSMSPSSPKPS 349
              SS+        
Sbjct: 108 RAPSSIREIDGSLG 121


>gnl|CDD|114140 pfam05399, EVI2A, Ectropic viral integration site 2A protein
           (EVI2A).  This family contains several mammalian
           ectropic viral integration site 2A (EVI2A) proteins. The
           function of this protein is unknown although it is
           thought to be a membrane protein and may function as an
           oncogene in retrovirus induced myeloid tumours.
          Length = 227

 Score = 28.9 bits (64), Expect = 7.8
 Identities = 28/99 (28%), Positives = 35/99 (35%), Gaps = 14/99 (14%)

Query: 286 SSTESWVQISLTTLSFPPPQQNVTTTVVTTKPYPETTTTTEKTEPSPSTVYETS---SMS 342
            S E W  I     S   P+ N T           T T   +T   P  +Y TS     +
Sbjct: 34  VSNEIWYSICQNLTSRNIPETNNTNPPTPEVNGKSTPTAEPQTST-PVPLYSTSGSNFFT 92

Query: 343 PSSPKPSPTTSTVSTTAFIDESNEIESQGINMSNYKEVC 381
           PSS + SP T            N  +S+G      KEVC
Sbjct: 93  PSSAQNSPDTGGP--------GNTSKSKGETFK--KEVC 121


>gnl|CDD|115650 pfam07010, Endomucin, Endomucin.  This family consists of several
           mammalian endomucin proteins. Endomucin is an early
           endothelial-specific antigen that is also expressed on
           putative hematopoietic progenitor cells.
          Length = 259

 Score = 28.9 bits (64), Expect = 8.2
 Identities = 34/132 (25%), Positives = 55/132 (41%), Gaps = 17/132 (12%)

Query: 241 TSTTPSLVTWTTVDEIPVIPPDRTRPPPLVTTIKAPPSTVASPVTSSTESWVQISLTTLS 300
           ++T  S+ T  TV    V  P    PP   TT +   +++ S  TS T    ++  TT  
Sbjct: 35  STTKASITTPNTVSLKNVNKPTTGTPPKGTTTSELLKTSLMSTATSLTTPKHELKTTTTG 94

Query: 301 F-------------PPPQQNVTTTVVTTKPYPETTTTTEKTEPSPSTVYETSSMSPSSPK 347
                              N  +T+ +++   E  ++   TE SP++V +      +SPK
Sbjct: 95  VRKNESSTSKVTVTNVTLSNAVSTLQSSQNKTENQSSIRTTEISPTSVLQPD----ASPK 150

Query: 348 PSPTTSTVSTTA 359
            + TTS   TTA
Sbjct: 151 KTGTTSASLTTA 162


>gnl|CDD|236940 PRK11633, PRK11633, cell division protein DedD; Provisional.
          Length = 226

 Score = 28.8 bits (65), Expect = 8.4
 Identities = 21/110 (19%), Positives = 32/110 (29%), Gaps = 12/110 (10%)

Query: 256 IPVIPP--DRTRP---PPLVTTIKAPPSTVASPVTSSTESWVQISLTTLSFPPPQQNVTT 310
           IP++P   DR  P   P     +   P   A+    + ++           PP       
Sbjct: 41  IPLVPKPGDRDEPDMMPAATQALPTQPPEGAAEAVRAGDAAAPSLDPATVAPPNTPVEPE 100

Query: 311 TVVTTKPYPE-TTTTTEKTEPSPSTVYETSSMSPSSPKPSPTTSTVSTTA 359
                 P P+       K +P           +P +PKP P        A
Sbjct: 101 PAPVEPPKPKPVEKPKPKPKPQQKV------EAPPAPKPEPKPVVEEKAA 144


>gnl|CDD|185628 PTZ00449, PTZ00449, 104 kDa microneme/rhoptry antigen; Provisional.
          Length = 943

 Score = 29.3 bits (65), Expect = 8.8
 Identities = 21/127 (16%), Positives = 37/127 (29%), Gaps = 5/127 (3%)

Query: 15  PKHAFCDEERERNFAMYKNDTSPIRNVTTTVVTTKPYPETTTTTEKTEP-----SPSTVY 69
           PK  F  + +E+ +  Y +  +  +   TTVV  + +      T    P     +P  + 
Sbjct: 663 PKPPFDPKFKEKFYDDYLDAAAKSKETKTTVVLDESFESILKETLPETPGTPFTTPRPLP 722

Query: 70  ETSSMSPSSPKPSPTTSTVSTTAFIDESNEIESQGINMSNYKEDVTTTVVTTKPYPETTI 129
                    P              I+     E +         D     +  + + E  I
Sbjct: 723 PKLPRDEEFPFEPIGDPDAEQPDDIEFFTPPEEERTFFHETPADTPLPDILAEEFKEEDI 782

Query: 130 TTEKTEP 136
             E  EP
Sbjct: 783 HAETGEP 789


>gnl|CDD|237191 PRK12757, PRK12757, cell division protein FtsN; Provisional.
          Length = 256

 Score = 28.9 bits (65), Expect = 8.9
 Identities = 11/60 (18%), Positives = 18/60 (30%), Gaps = 2/60 (3%)

Query: 292 VQISLTTLSFPPPQQNVTTTVVTTKPYPETTTTTEKTEPSPSTVYETSSMSPSSPKPSPT 351
           VQI        PP            P  +TT   +   P+P      + ++ +   P   
Sbjct: 119 VQIQQQAQQQQPPATTAQP--QPVTPPRQTTAPVQPQTPAPVRTQPAAPVTQAVEAPKVE 176


>gnl|CDD|189554 pfam00454, PI3_PI4_kinase, Phosphatidylinositol 3- and 4-kinase.
           Some members of this family probably do not have lipid
           kinase activity and are protein kinases, .
          Length = 233

 Score = 28.5 bits (64), Expect = 9.4
 Identities = 28/99 (28%), Positives = 38/99 (38%), Gaps = 7/99 (7%)

Query: 378 KEVCGRRLFPSSRIVGGEKATFGKW-PWQISLRQWIRSTYLHKCGAALFNENWAVTAAHC 436
            E   RRL     I  G  +   +W P   +L + I  TY+ K G  LFN +  V     
Sbjct: 31  GEGLDRRLAAYLVIPLGPGSGLIEWVPNSTTLAE-IPRTYMVKKGIPLFNYSRKVLVFES 89

Query: 437 VEDVPPSDLLLR-LGEHDLSTEEEPYGYQERRVQIVASH 474
              + P   LL+   +H    EE    + E R   V S 
Sbjct: 90  RTALFPKVGLLQWFVKHFPDAEE----WGEARKNFVRSC 124


>gnl|CDD|222812 PHA00727, PHA00727, hypothetical protein.
          Length = 278

 Score = 28.7 bits (64), Expect = 9.8
 Identities = 13/30 (43%), Positives = 16/30 (53%), Gaps = 1/30 (3%)

Query: 306 QNVTTTVVTTKPYPETTTTTEK-TEPSPST 334
           Q+VT T  T      TTTT+   T PS S+
Sbjct: 247 QHVTNTTTTHLATTTTTTTSPSFTIPSNSS 276


>gnl|CDD|140276 PTZ00249, PTZ00249, variable surface protein Vir28; Provisional.
          Length = 516

 Score = 28.8 bits (64), Expect = 10.0
 Identities = 22/106 (20%), Positives = 31/106 (29%), Gaps = 24/106 (22%)

Query: 260 PPDRTRPPPLVTTIKAPPSTV--ASPVTSSTESWVQISLT---------------TLSFP 302
            P  T  PPLVT    P S     +PV + T + V    +               T S P
Sbjct: 275 HPGETHTPPLVT---VPSSKAHDRNPVQTPTPTSVSGYSSQAKGLEKQAGGESERTSSVP 331

Query: 303 PPQQNVTTTVVTTKPYPETTTTTEKTEPSPSTVYETSSMSPSSPKP 348
             Q  +   V+          ++E  E    T           P+ 
Sbjct: 332 SEQFPLPLPVLLPLGQSGPLESSESEE----TDEYAGPKGLPEPEL 373


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.315    0.130    0.404 

Gapped
Lambda     K      H
   0.267   0.0738    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 34,968,328
Number of extensions: 3293296
Number of successful extensions: 4356
Number of sequences better than 10.0: 1
Number of HSP's gapped: 3848
Number of HSP's successfully gapped: 219
Length of query: 697
Length of database: 10,937,602
Length adjustment: 104
Effective length of query: 593
Effective length of database: 6,324,786
Effective search space: 3750598098
Effective search space used: 3750598098
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 62 (27.6 bits)