RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy7302
         (741 letters)



>gnl|CDD|238113 cd00190, Tryp_SPc, Trypsin-like serine protease; Many of these are
           synthesized as inactive precursor zymogens that are
           cleaved during limited proteolysis to generate their
           active forms. Alignment contains also inactive enzymes
           that have substitutions of the catalytic triad residues.
          Length = 232

 Score =  266 bits (683), Expect = 2e-84
 Identities = 108/233 (46%), Positives = 140/233 (60%), Gaps = 9/233 (3%)

Query: 448 VVGGEDADPAEWCWQVALINSLNQYLCGGALIGTQWVLTAAHCVTNIVRSGDAVYVRVGD 507
           +VGG +A    + WQV+L  +  ++ CGG+LI  +WVLTAAHCV +   S     VR+G 
Sbjct: 1   IVGGSEAKIGSFPWQVSLQYTGGRHFCGGSLISPRWVLTAAHCVYSSAPS--NYTVRLGS 58

Query: 508 HDLTRKYGSPGAQTLRVATTYIHHNHNSQTLDNDIALLKLHGQAELKDGVCLVCLPARGV 567
           HDL+      G Q ++V    +H N+N  T DNDIALLKL     L D V  +CLP+ G 
Sbjct: 59  HDLSS--NEGGGQVIKVKKVIVHPNYNPSTYDNDIALLKLKRPVTLSDNVRPICLPSSGY 116

Query: 568 NHAAGKRCTVTGYGYMGEAGPIPLRVREAEIPIVSDTECIRKINAVTEKIFILPASSFCA 627
           N  AG  CTV+G+G   E GP+P  ++E  +PIVS+ EC R           +  +  CA
Sbjct: 117 NLPAGTTCTVSGWGRTSEGGPLPDVLQEVNVPIVSNAECKRA----YSYGGTITDNMLCA 172

Query: 628 GGEEGN-DACQGDGGGPLVCQDDGFYELAGLVSWGFGCGRQDVPGVYVKVSSF 679
           GG EG  DACQGD GGPLVC D+G   L G+VSWG GC R + PGVY +VSS+
Sbjct: 173 GGLEGGKDACQGDSGGPLVCNDNGRGVLVGIVSWGSGCARPNYPGVYTRVSSY 225


>gnl|CDD|214473 smart00020, Tryp_SPc, Trypsin-like serine protease.  Many of these
           are synthesised as inactive precursor zymogens that are
           cleaved during limited proteolysis to generate their
           active forms. A few, however, are active as single chain
           molecules, and others are inactive due to substitutions
           of the catalytic triad residues.
          Length = 229

 Score =  251 bits (643), Expect = 1e-78
 Identities = 106/235 (45%), Positives = 137/235 (58%), Gaps = 12/235 (5%)

Query: 447 RVVGGEDADPAEWCWQVALINSLNQYLCGGALIGTQWVLTAAHCVTNIVRSGDAVYVRVG 506
           R+VGG +A+   + WQV+L     ++ CGG+LI  +WVLTAAHCV     S   + VR+G
Sbjct: 1   RIVGGSEANIGSFPWQVSLQYGGGRHFCGGSLISPRWVLTAAHCVRGSDPS--NIRVRLG 58

Query: 507 DHDLTRKYGSPGAQTLRVATTYIHHNHNSQTLDNDIALLKLHGQAELKDGVCLVCLPARG 566
            HDL+        Q ++V+   IH N+N  T DNDIALLKL     L D V  +CLP+  
Sbjct: 59  SHDLSS---GEEGQVIKVSKVIIHPNYNPSTYDNDIALLKLKEPVTLSDNVRPICLPSSN 115

Query: 567 VNHAAGKRCTVTGYGYMGE-AGPIPLRVREAEIPIVSDTECIRKINAVTEKIFILPASSF 625
            N  AG  CTV+G+G   E AG +P  ++E  +PIVS+  C R           +  +  
Sbjct: 116 YNVPAGTTCTVSGWGRTSEGAGSLPDTLQEVNVPIVSNATCRRA----YSGGGAITDNML 171

Query: 626 CAGGEE-GNDACQGDGGGPLVCQDDGFYELAGLVSWGFGCGRQDVPGVYVKVSSF 679
           CAGG E G DACQGD GGPLVC D  +  L G+VSWG GC R   PGVY +VSS+
Sbjct: 172 CAGGLEGGKDACQGDSGGPLVCNDGRWV-LVGIVSWGSGCARPGKPGVYTRVSSY 225


>gnl|CDD|215708 pfam00089, Trypsin, Trypsin. 
          Length = 218

 Score =  187 bits (477), Expect = 6e-55
 Identities = 98/233 (42%), Positives = 126/233 (54%), Gaps = 18/233 (7%)

Query: 448 VVGGEDADPAEWCWQVALINSLNQYLCGGALIGTQWVLTAAHCVTNIVRSGDAVYVRVGD 507
           +VGG++A P  + WQV+L  S  ++ CGG+LI   WVLTAAHCV+N      +V V +G 
Sbjct: 1   IVGGDEAQPGSFPWQVSLQVSSGKHFCGGSLISENWVLTAAHCVSN----AKSVRVVLGA 56

Query: 508 HDLTRKYGSPGAQTLRVATTYIHHNHNSQTLDNDIALLKLHGQAELKDGVCLVCLPARGV 567
           H++  + G  G Q   V    +H N+N  T  NDIALLKL     L D V  +CLP    
Sbjct: 57  HNIVLREG--GEQKFDVKKVIVHPNYNPDTD-NDIALLKLKSPVTLGDTVRPICLPTASS 113

Query: 568 NHAAGKRCTVTGYGYMGEAGPIPLRVREAEIPIVSDTECIRKINAVTEKIFILPASSFCA 627
           +   G  CTV+G+G     G  P  ++E  +P+VS   C            I      CA
Sbjct: 114 DLPVGTTCTVSGWGNTKTLGL-PDTLQEVTVPVVSRETCRSAYGGTVTDNMI------CA 166

Query: 628 GGEEGNDACQGDGGGPLVCQDDGFYELAGLVSWGFGCGRQDVPGVYVKVSSFN 680
           G   G DACQGD GGPLVC D    EL G+VSWG+GC   + PGVY  VSS+ 
Sbjct: 167 GA-GGKDACQGDSGGPLVCSDG---ELIGIVSWGYGCASGNYPGVYTPVSSYL 215


>gnl|CDD|227927 COG5640, COG5640, Secreted trypsin-like serine protease
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 413

 Score =  111 bits (278), Expect = 4e-26
 Identities = 77/265 (29%), Positives = 115/265 (43%), Gaps = 38/265 (14%)

Query: 439 GTARERGARVVGGEDADPAEWCWQVALINSLNQY----LCGGALIGTQWVLTAAHCVT-N 493
            TA E  +R++GG +A+  E+   VAL++ ++ Y     CGG+ +G ++VLTAAHC   +
Sbjct: 24  QTADEVSSRIIGGSNANAGEYPSLVALVDRISDYVSGTFCGGSKLGGRYVLTAAHCADAS 83

Query: 494 IVRSGDAVYVRVGDHDLTRKYGSPGAQTLRVATTYIHHNHNSQTLDNDIALLKLHGQAEL 553
              S D   V V  +D ++      A+   V T Y+H  ++   L NDIA+L+L   A L
Sbjct: 84  SPISSDVNRVVVDLNDSSQ------AERGHVRTIYVHEFYSPGNLGNDIAVLELARAASL 137

Query: 554 KDGVCLVCLPARGVNHAA---GKRCTVTGYGYMGEAG-------------PIPLRVREAE 597
                    P   +           +VT    M                 P    + E  
Sbjct: 138 ---------PRVKITSFDASDTFLNSVTTVSPMTNGTFGVTTPSDVPRSSPKGTILHEVA 188

Query: 598 IPIVSDTECIRKINAVTEKIFILPASSFCAGGEEGNDACQGDGGGPLVCQDDGFYELAGL 657
           +  V  + C +              + FCAG     DACQGD GGP+  + +      G+
Sbjct: 189 VLFVPLSTCAQYKGCANASDGATGLTGFCAG-RPPKDACQGDSGGPIFHKGEEGRVQRGV 247

Query: 658 VSWGFG-CGRQDVPGVYVKVSSFNG 681
           VSWG G CG   +PGVY  VS++  
Sbjct: 248 VSWGDGGCGGTLIPGVYTNVSNYQD 272


>gnl|CDD|223021 PHA03247, PHA03247, large tegument protein UL36; Provisional.
          Length = 3151

 Score = 44.2 bits (104), Expect = 3e-04
 Identities = 51/263 (19%), Positives = 69/263 (26%), Gaps = 45/263 (17%)

Query: 28   PPNLVIMERNGTKPNSGSQSSSTTPYVSSTSAV--------TQTTTSRPQQPAGSKPCHG 79
            P  LV        P +  Q+S   P   +  AV             +RP   AG      
Sbjct: 2712 PHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAP 2771

Query: 80   LLDTITSTAESKDCPGTCVHAIATLICAEVLDDVQCPSPSMRCCLESPANGTTPAAGGST 139
                                A+A+L  +        PSP        PA+        + 
Sbjct: 2772 PAAPAAGPP-----RRLTRPAVASLSESRE----SLPSPW------DPADPPAAVLAPAA 2816

Query: 140  TSNAGTSSPSTTTPPTLYTTSKPYKKPQNPEKRPQPPGASNKIDTGGNNNCLGVTTTNKP 199
                  +SP+   PP           P  P     P G S  +  GG+          +P
Sbjct: 2817 ALPP-AASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGS--VAPGGD-------VRRRP 2866

Query: 200  PSSQPSNLPKCPGFCLLNIMAAFCERPSVLIPQTSTCSRGSMCCDNTRLPPPTKPKPRPT 259
            PS  P+  P  P             RP V        SR +         P   P+P+  
Sbjct: 2867 PSRSPAAKPAAP------------ARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAP 2914

Query: 260  QPPTTTTVATTTAVPDDRPECPG 282
             PP           P   P  P 
Sbjct: 2915 PPPQPQPQPPPPPQPQPPPPPPP 2937



 Score = 36.1 bits (83), Expect = 0.098
 Identities = 32/192 (16%), Positives = 51/192 (26%), Gaps = 19/192 (9%)

Query: 111 DDVQCPSPSMRCCLESPANGTTPAAGGSTTSNAGTSSPSTTTPPTLYTTS---KPYKKPQ 167
                P P +    +        A G      A   + +   P  ++  +    P   P 
Sbjct: 252 IAAPAPPPVVGEGADRAPETARGATGPPPPPEAAAPNGAAAPPDGVWGAALAGAPLALPA 311

Query: 168 NPEKRPQPPGASNKIDTGGNNNCLGVTTTNKPPSSQPSNLPK------CPGFCLLNIMAA 221
            P+  P  P    + +   +     V+   +P    P   PK       P   L ++ A 
Sbjct: 312 PPDPPPPAPAGDAEEEDDEDGAMEVVSPLPRPRQHYPLGFPKRRRPTWTPPSSLEDLSAG 371

Query: 222 FCERPSVLIPQT---------STCSRGSMCCDNTRLPPPT-KPKPRPTQPPTTTTVATTT 271
                   +P           +  +RG    D TR   P     P P   P   +     
Sbjct: 372 RHHPKRASLPTRKRRSARHAATPFARGPGGDDQTRPAAPVPASVPTPAPTPVPASAPPPP 431

Query: 272 AVPDDRPECPGS 283
           A P    E    
Sbjct: 432 ATPLPSAEPGSD 443



 Score = 34.5 bits (79), Expect = 0.24
 Identities = 36/246 (14%), Positives = 59/246 (23%), Gaps = 29/246 (11%)

Query: 41   PNSGSQSSSTTPYVSSTSAVTQTTTSRPQQPAGSKPCHGLLDTITSTAESKDCPGTCVHA 100
            P +G     T P V+S S   ++  S                 +   A     P   +  
Sbjct: 2775 PAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAAS----PAGPLPP 2830

Query: 101  IATLICAEVLDDVQCPSPSMRCCLESPANGTTPAAGGSTTSNAGTSSPSTTTP------- 153
              +              P                A G        S      P       
Sbjct: 2831 PTSA--------QPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPP 2882

Query: 154  ------PTLYTTSKPYKKPQNPEKRPQPPGASNKIDTGGNNNCLGVTTTNKPPSSQPSNL 207
                  P +  +++ +  P +  +RP  P A                   +PP   P   
Sbjct: 2883 VRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQ---PPPPPQPQPPPPPPP-R 2938

Query: 208  PKCPGFCLLNIMAAFCERPSVLIPQTSTCSRGSMCCDNTRLPPPTKPKPRPTQPPTTTTV 267
            P+ P     +   A     +V  P       G +     R+P P   +  P       T 
Sbjct: 2939 PQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPASSTPPLTG 2998

Query: 268  ATTTAV 273
             + + V
Sbjct: 2999 HSLSRV 3004



 Score = 33.8 bits (77), Expect = 0.42
 Identities = 26/170 (15%), Positives = 43/170 (25%), Gaps = 20/170 (11%)

Query: 116  PSPSMRCCLESPANGTTPAAGGSTTSNAGTSSPSTTTPPTLYTTSKPYKKPQNPEKRPQP 175
            P P       +  +   P             +P   + P          +  +P +RP+ 
Sbjct: 2626 PPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRR 2685

Query: 176  PGASNKIDTGGNNNCLGVTTTNKPPSSQPSNLPKCPGFCLLNIMAAFCERPSVLIPQTST 235
              A   + +        +T+   PP   P+  P           A     P    P  + 
Sbjct: 2686 RAARPTVGS--------LTSLADPPPPPPTPEPAPH--------ALVSATPLPPGPAAAR 2729

Query: 236  CSRGSMCCDNTRLPPPTKP----KPRPTQPPTTTTVATTTAVPDDRPECP 281
             +  ++         P  P     P     P TT      A P      P
Sbjct: 2730 QASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGP 2779



 Score = 32.2 bits (73), Expect = 1.3
 Identities = 42/245 (17%), Positives = 55/245 (22%), Gaps = 32/245 (13%)

Query: 35   ERNGTKPNSGSQSSSTTPYVSSTSAVTQTTTSRPQQPAGSKPCHGLLDTITSTAESKD-- 92
            ER    P  G  S          +A   +   RP++ A       L              
Sbjct: 2650 ERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTPE 2709

Query: 93   -CPGTCVHAIATLICAEVLDDVQCPSPSMRCCLESPANGTTPAAGGSTTSNAGTSSPSTT 151
              P   V A                 P+       PA   TP           T+ P   
Sbjct: 2710 PAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAP 2769

Query: 152  TPPTLYTTSKPYKKPQNP-----EKRPQPPGASNKIDTGG---------NNNCLGVTTTN 197
             PP       P +  +       E R   P   +  D                       
Sbjct: 2770 APPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLP 2829

Query: 198  KPPSSQPSNLPKCPGFCLLNIMAAFCERPSVLIPQTSTCSRGSMCCDNTRLPPPTKPKPR 257
             P S+QP+  P  PG                  P        +   D  R PP   P  +
Sbjct: 2830 PPTSAQPTAPPPPPGP---------------PPPSLPLGGSVAPGGDVRRRPPSRSPAAK 2874

Query: 258  PTQPP 262
            P  P 
Sbjct: 2875 PAAPA 2879



 Score = 31.8 bits (72), Expect = 1.7
 Identities = 25/164 (15%), Positives = 40/164 (24%), Gaps = 27/164 (16%)

Query: 125  ESPANGTTPAAGGSTTSNAGTSSPSTTTPPTLYTTSKPYKKPQ-------NPEKRPQPPG 177
            ++P     P A      +    +P +  PP  +    P   P               PP 
Sbjct: 2590 DAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPP 2649

Query: 178  ASNKIDTGGNNNCLGVTTTNKPPSSQPSNLPKCPGFCLLNIMAAFCERPSVLIPQTSTCS 237
               + D                 ++Q S+ P+ P             R     P   + +
Sbjct: 2650 ERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRP-------------RRRAARPTVGSLT 2696

Query: 238  RGSMCCDNTRLPPPTKPKPRPTQPPTTTTVATTTAVPDDRPECP 281
                   +   PPP  P P P      +           R   P
Sbjct: 2697 -------SLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASP 2733



 Score = 30.3 bits (68), Expect = 5.3
 Identities = 29/177 (16%), Positives = 45/177 (25%), Gaps = 17/177 (9%)

Query: 112  DVQCPSPSMRCCLESPANGTTPAAGGSTTSNAGTSSPSTTTPPTLYTTSKPYKKPQNPEK 171
                P+P  R      A     AA  S+         +  T  +L + + P   P  PE 
Sbjct: 2652 PRDDPAPG-RVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTPEP 2710

Query: 172  RPQPPGASNKIDTGGNNNCLGVTTTNKPPSSQPS-NLPKCPGFCLLNIMAAFCERPSVLI 230
             P    ++  +  G              P+       P  PG       A     P+   
Sbjct: 2711 APHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPG-----GPARPARPPTTAG 2765

Query: 231  PQTSTCSRGSMCCDNTRLPPPTKPKPRPTQPPTTTTVATTTAVPDDRPECPGSCIVS 287
            P                  P   P  R T+P   +   +  ++P           V 
Sbjct: 2766 PPAPAPPAA----------PAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVL 2812


>gnl|CDD|226406 COG3889, COG3889, Predicted solute binding protein [General
           function prediction only].
          Length = 872

 Score = 38.7 bits (90), Expect = 0.011
 Identities = 17/64 (26%), Positives = 28/64 (43%)

Query: 42  NSGSQSSSTTPYVSSTSAVTQTTTSRPQQPAGSKPCHGLLDTITSTAESKDCPGTCVHAI 101
           N+     +T+  ++ T   TQTTTS P     + P      T T+T+ S+   G  +   
Sbjct: 794 NTSILIETTSVVITKTVTQTQTTTSSPSPTQTTSPTQTSTSTTTTTSPSQTTTGGGICGP 853

Query: 102 ATLI 105
             +I
Sbjct: 854 IVII 857



 Score = 32.5 bits (74), Expect = 0.99
 Identities = 30/149 (20%), Positives = 52/149 (34%), Gaps = 18/149 (12%)

Query: 132 TPAAGGSTTSNAGTSSPSTTTP---PTLYTTSKPYKKPQNPEKRPQPPGA----SNKIDT 184
           T +   +TT ++   +P T       T+Y + + +   +N    P          +    
Sbjct: 702 TNSLYKATTLSSEAKNPDTVKIGQALTVYGSLEVFPAGENWGFIPTTKRVKVRIMDPASG 761

Query: 185 GGNNNCLGVTTTNKPPSSQPSNLPKCPGFCLLNIMAAFCERPSVLIPQTSTCSRGSMCCD 244
            G +     T T + P S P+       +   +  +   E  SV+I +T T         
Sbjct: 762 TGTSITTSGTFTAEVPQS-PTKTETTLSYSAYSNTSILIETTSVVITKTVT--------- 811

Query: 245 NTRLPPPTKPKPRPTQPPTTTTVATTTAV 273
                  + P P  T  PT T+ +TTT  
Sbjct: 812 -QTQTTTSSPSPTQTTSPTQTSTSTTTTT 839


>gnl|CDD|237015 PRK11901, PRK11901, hypothetical protein; Reviewed.
          Length = 327

 Score = 37.0 bits (86), Expect = 0.032
 Identities = 27/164 (16%), Positives = 44/164 (26%), Gaps = 28/164 (17%)

Query: 126 SPANGTTPAAGGSTTSNAGTSSPSTTTPPTLYTTSKPYKKPQNPEKRPQPPGASNKIDTG 185
           SP+     + G   +    T+ P   + P +  T      P      PQ P    +I+  
Sbjct: 94  SPSAANNTSDGHDASGVKNTAPPQDISAPPISPT------PTQAAP-PQTPNGQQRIELP 146

Query: 186 GN-NNCL-----GVTTTNKPPSSQPSNLPKCPGFCLLNIMAAFCERPSVLIPQTSTCSRG 239
           GN ++ L      V   ++      S LP                 P+ + P        
Sbjct: 147 GNISDALSQQQGQVNAASQNAQGNTSTLPT---------------APATVAPSKGAKVPA 191

Query: 240 SMCCDNTRLPPPTKPKPRPTQPPTTTTVATTTAVPDDRPECPGS 283
           +     T    P   KP      T T           +     +
Sbjct: 192 TAETHPTPPQKPATKKPAVNHHKTATVAVPPATSGKPKSGAASA 235


>gnl|CDD|218439 pfam05109, Herpes_BLLF1, Herpes virus major outer envelope
           glycoprotein (BLLF1).  This family consists of the BLLF1
           viral late glycoprotein, also termed gp350/220. It is
           the most abundantly expressed glycoprotein in the viral
           envelope of the Herpesviruses and is the major antigen
           responsible for stimulating the production of
           neutralising antibodies in vivo.
          Length = 830

 Score = 36.3 bits (83), Expect = 0.071
 Identities = 51/290 (17%), Positives = 84/290 (28%), Gaps = 30/290 (10%)

Query: 18  VSKDAYSGTLPPNLVIMERNGTKPNSGSQSSSTTPYVSSTSAVTQTTTSRPQQPAG-SKP 76
            +  A      P   +   N T P +   +S T    S T  V   TT+    P G +  
Sbjct: 491 RTTSATPNATSPTPAVTTPNATSPTTQK-TSDTPNATSPTPIVIGVTTTATSPPTGTTSV 549

Query: 77  CHGLLDTITSTAESKDCPGTCVHAIATLICAEVLDDVQCPSPSMRCCLESPANGTTPAAG 136
            +     +T  +   +     V +  +++ + V         S         + +     
Sbjct: 550 PNATSPQVTEESPVNNTNTPVVTSAPSVLTSAVTTGQHGTGSSPTSQQPGIPSSSHSTPR 609

Query: 137 GSTTS--------------NAGTSSPSTTTPPTLYTTSKPYKKPQNPEKRPQPPGASNKI 182
            ++TS              N    +PS  +  T  +T  P   P    +   P  +S   
Sbjct: 610 SNSTSTTPLLTSAHPTGGENITEETPSVPS-TTHVSTLSPGPGPGTTSQVSGPGNSSTSR 668

Query: 183 DTGGNNNCLGV----TTTNKPPSSQPSNLPKCPGFCLLNIMAAFCERPSVLIPQTS--TC 236
             G  +   G+     T+   PS Q + +P                  S L+  TS  T 
Sbjct: 669 YPGEVHVTEGMPNPNATSPSAPSGQKTAVPTVTSTGGKANSTTKETSGSTLMASTSPHTN 728

Query: 237 SRGSMCCDNTRLPPPT-------KPKPRPTQPPTTTTVATTTAVPDDRPE 279
                                  +P+   T PP TT  AT    P   P+
Sbjct: 729 EGAFRTTPYNATTYLPPSTSSKLRPRWTFTSPPVTTKQATVPVPPTQHPD 778



 Score = 31.3 bits (70), Expect = 2.6
 Identities = 30/153 (19%), Positives = 51/153 (33%), Gaps = 14/153 (9%)

Query: 24  SGTLPPNLVIMERNGTKPNSGS---QSSSTTPYVSSTSAVTQTTTSRPQQPAGSKPCHGL 80
           + +LPP       +   P SG+    +SST P  +S ++ T + T     P  +      
Sbjct: 452 TPSLPPASTGPTVSTADPTSGTPTGTTSSTLPEDTSPTSRTTSATPNATSPTPAVTT--- 508

Query: 81  LDTITSTAESKDCPGTCVHAIATLICAEVLDDVQCPSPSMRCCLESPANGTTPAAGGSTT 140
               TS    K        +   ++       V   + S      S  N T+P      +
Sbjct: 509 -PNATSPTTQKTSDTPNATSPTPIVIG-----VTTTATSPPTGTTSVPNATSPQV-TEES 561

Query: 141 SNAGTSSPSTTTPPTLYTTSKPYKKPQNPEKRP 173
               T++P  T+ P++  TS            P
Sbjct: 562 PVNNTNTPVVTSAPSV-LTSAVTTGQHGTGSSP 593


>gnl|CDD|218440 pfam05110, AF-4, AF-4 proto-oncoprotein.  This family consists of
           AF4 (Proto-oncogene AF4) and FMR2 (Fragile X E mental
           retardation syndrome) nuclear proteins. These proteins
           have been linked to human diseases such as acute
           lymphoblastic leukaemia and mental retardation. The
           family also contains a Drosophila AF4 protein homologue
           Lilliputian which contains an AT-hook domain.
           Lilliputian represents a novel pair-rule gene that acts
           in cytoskeleton regulation, segmentation and
           morphogenesis in Drosophila.
          Length = 1154

 Score = 34.1 bits (78), Expect = 0.30
 Identities = 25/166 (15%), Positives = 45/166 (27%), Gaps = 3/166 (1%)

Query: 116 PSPSMRCCLESPANGTTPAAGGSTTSNAGTSSPSTTTPPTLYTTSKPYKKPQNPEKRPQP 175
            SP+       P        G   +S  G+     +  P   ++SK  ++P+  +K P+ 
Sbjct: 459 VSPAESVSSNPPIKQPMEKEGKVKSS--GSQYHPESKEPPPKSSSKEKRRPRTAQKGPES 516

Query: 176 PGASNKIDTGGNNNCLGVTTTNKPPSSQPSNLPKCPGFCLLNIMAAFCERPSVLIPQTST 235
                K            T   K P  +P           L   +     P     QT  
Sbjct: 517 GRGKQKSPAQSEAPPQRRTVGKKQP-KKPEKASAGDERTGLRPESEPGTLPYGSSVQTPP 575

Query: 236 CSRGSMCCDNTRLPPPTKPKPRPTQPPTTTTVATTTAVPDDRPECP 281
               +    + +  P  +PK             + + +     E  
Sbjct: 576 DRPKAATKGSRKPSPRKEPKSSVPPAAEKRKYKSPSKIVPKSREFI 621


>gnl|CDD|144411 pfam00802, Glycoprotein_G, Pneumovirus attachment glycoprotein G.
           This family includes attachment proteins from
           respiratory synctial virus. Glycoprotein G has not been
           shown to have any neuraminidase or hemagglutinin
           activity. The amino terminus is thought to be
           cytoplasmic, and the carboxyl terminus extracellular.
           The extracellular region contains four completely
           conserved cysteine residues.
          Length = 263

 Score = 33.5 bits (76), Expect = 0.32
 Identities = 34/155 (21%), Positives = 44/155 (28%), Gaps = 38/155 (24%)

Query: 125 ESPANGTTPAAGGSTTSNAGTSSPSTTTPPTLYTTSKPYKKPQNPEKRPQPPGASNKIDT 184
           +     T+P+   +TT    T     TTP T  T        +     P           
Sbjct: 84  QHNQLSTSPSNQSTTTPLIHTIL-DDTTPGTKSTYQHTTVGTKGRTTTP----------- 131

Query: 185 GGNNNCLGVTTTNKPP-SSQPSNLPKCPG----FCLLNIMAAFCERPSVLIPQTSTCSRG 239
                      TNKPP   + SN P+ P     F + N +              S C   
Sbjct: 132 ---------AQTNKPPTKPRQSNPPEKPQDDFHFEVFNFVPC------------SICENN 170

Query: 240 SMCCDNTRLPPPTKPKPRPTQPPTTTTVATTTAVP 274
             C    +  P  KP   PT  PT      TT   
Sbjct: 171 PACLSICKRIPEKKPGKAPTTKPTKKPKPKTTKKD 205



 Score = 29.7 bits (66), Expect = 4.8
 Identities = 35/162 (21%), Positives = 47/162 (29%), Gaps = 27/162 (16%)

Query: 117 SPSMRCCLESPANGTTPAAGGST--TSNAGTSSPSTTTPPTLYTTSKPYKKPQNPEKRPQ 174
           +P +   L+    GT      +T  T    T+   T  PPT    S P +KPQ+      
Sbjct: 99  TPLIHTILDDTTPGTKSTYQHTTVGTKGRTTTPAQTNKPPTKPRQSNPPEKPQDDFHFEV 158

Query: 175 PPGASNKIDTGGNNNCLGV--TTTNKPPSSQPSNLPKCPGFCLLNIMAAFCERPSVLIPQ 232
                  I    N  CL +      K P   P+  P                      P+
Sbjct: 159 FNFVPCSI-CENNPACLSICKRIPEKKPGKAPTTKPTKK-------------------PK 198

Query: 233 TSTCSRGSMCCDNTRLPPPTKPKPRPTQPPTTTTVATTTAVP 274
             T  + +           T     PT  PT  T  T T  P
Sbjct: 199 PKTTKKDTKTQTTKSKEATTH---HPTSEPTKLTTKTNTTTP 237


>gnl|CDD|185588 PTZ00385, PTZ00385, lysyl-tRNA synthetase; Provisional.
          Length = 659

 Score = 33.5 bits (76), Expect = 0.43
 Identities = 12/81 (14%), Positives = 24/81 (29%), Gaps = 17/81 (20%)

Query: 225 RPSVLIPQTSTCSRGSMCCDNTRLPPPTKPK-PRPTQPPTTTTVATTTAVPDDRPECPGS 283
            P + + Q ++ S             P + K P      T T     +  P  + + P  
Sbjct: 29  LPGLQLRQVASLSSS---------RSPLELKKPISKASATKTVTQEASRAPRSKLDLPA- 78

Query: 284 CIVSYLSFTCFRNAEMTEIFK 304
                 +++ FR        +
Sbjct: 79  ------AYSSFRGITPISEVR 93


>gnl|CDD|183756 PRK12799, motB, flagellar motor protein MotB; Reviewed.
          Length = 421

 Score = 33.5 bits (76), Expect = 0.45
 Identities = 28/133 (21%), Positives = 46/133 (34%), Gaps = 14/133 (10%)

Query: 30  NLVIMERNGTKPNSGSQSSSTTPYVSSTSAVTQTTTSRPQQPAGSKPCHGLLDTITSTAE 89
            L  ++ +GT P +           + +SAVTQ++   P   A   P       I S+  
Sbjct: 289 GLKQIDTHGTVPVAAV---------TPSSAVTQSSAITPSSAAIPSP-----AVIPSSVT 334

Query: 90  SKDCPGTCVHAIATLICAEVLDDVQCPSPSMRCCLESPANGTTPAAGGSTTSNAGTSSPS 149
           ++    T   A+A      +  DV  P        E       P +   T  ++  +  S
Sbjct: 335 TQSATTTQASAVALSSAGVLPSDVTLPGTVALPAAEPVNMQPQPMSTTETQQSSTGNITS 394

Query: 150 TTTPPTLYTTSKP 162
           T   PT    + P
Sbjct: 395 TANGPTTSLPAAP 407



 Score = 30.5 bits (68), Expect = 3.4
 Identities = 21/92 (22%), Positives = 31/92 (33%), Gaps = 7/92 (7%)

Query: 126 SPANGTTPAAG--GSTTSNAGTSSPSTTTPPTLYTTSKPYKKPQNPEKRPQPPGASNKID 183
           SPA   +       +TT  +  +  S    P+  T       P       QP   S    
Sbjct: 325 SPAVIPSSVTTQSATTTQASAVALSSAGVLPSDVTLPGTVALPAAEPVNMQPQPMSTTET 384

Query: 184 TGGNNNCLGVTTTNKPPSSQP----SNLPKCP 211
              +   +  +T N P +S P    SN+P  P
Sbjct: 385 QQSSTGNI-TSTANGPTTSLPAAPASNIPVSP 415


>gnl|CDD|218421 pfam05086, Dicty_REP, Dictyostelium (Slime Mold) REP protein.  This
           family consists of REP proteins from Dictyostelium
           (Slime molds). REP protein is likely involved in
           transcription regulation and control of DNA replication,
           specifically amplification of plasmid at low copy
           numbers. The formation of homomultimers may be required
           for their regulatory activity.
          Length = 910

 Score = 33.6 bits (77), Expect = 0.47
 Identities = 25/138 (18%), Positives = 34/138 (24%), Gaps = 45/138 (32%)

Query: 124 LESPANGTTPAAGGSTTSNAGTSSPSTTTPPTLYTTS----------------------- 160
           LE   +     +  S       S PS     T  TT+                       
Sbjct: 224 LEVNESDIEQISINSENIQRINSQPSKRPNNTTTTTTTTTTTTFQPRTRKRKSIDDHKLS 283

Query: 161 ---KPYKKPQNPEKRPQPPGASNKI---DTGG-------NNNCL----GVTTTNKPPS-- 201
               P K   N +    P    +      +G        N +C     GV  +  P    
Sbjct: 284 LNQAPEKFKNNTKPDDDPQSDFSDKGSRKSGSLKDVRIDNISCSVSHNGVNISISPTETI 343

Query: 202 SQPSNLPKCPGFCLLNIM 219
              SN  K    CLL + 
Sbjct: 344 ISVSNKIKL---CLLEVS 358


>gnl|CDD|205161 pfam12956, DUF3845, Domain of Unknown Function with PDB structure. 
           Member PDB:3GF6 has statistically significant similarity
           to TNF-like jelly roll fold may indicate an
           immunomodulatory function or a bioadhesion role.
          Length = 244

 Score = 32.3 bits (73), Expect = 0.67
 Identities = 25/73 (34%), Positives = 35/73 (47%), Gaps = 9/73 (12%)

Query: 490 CVTNIVRSGDAVYVRVGDHDLTRKYGSP----GAQTLRVATTYIHHNHNSQTL---DNDI 542
           C+T  VR  DA+Y  VG+ D+ R  G P    G + + VA +Y   ++   TL     D 
Sbjct: 14  CLTGCVRDNDAIYYPVGNVDIER--GGPALEVGEEDVLVARSYNEEDYVLDTLAQYPGDP 71

Query: 543 ALLKLHGQAELKD 555
            L KL     LK+
Sbjct: 72  TLGKLTFMINLKN 84


>gnl|CDD|139494 PRK13335, PRK13335, superantigen-like protein; Reviewed.
          Length = 356

 Score = 32.0 bits (72), Expect = 1.0
 Identities = 23/126 (18%), Positives = 40/126 (31%), Gaps = 33/126 (26%)

Query: 158 TTSKPYKKPQNPEKRPQPPGASNKIDTGGNNNCLGVTTTNKPPSS--QPSNLPKCPGFCL 215
            T++     Q    + +    +N+  T  +     +   ++P     +  N+   P    
Sbjct: 61  ATTQAANTRQERTPKLEKAPNTNEEKTSASK----IEKISQPKQEEQKSLNISATPA--- 113

Query: 216 LNIMAAFCERPSVLIPQTSTCSRGSMCCDNTRLPPPTKPKPRPTQPPTTTTVATTTAVPD 275
                     P     QT+T S              T PK + T PP+T T     +   
Sbjct: 114 ----------PKQEQSQTTTES--------------TTPKTKVTTPPSTNTPQPMQSTKS 149

Query: 276 DRPECP 281
           D P+ P
Sbjct: 150 DTPQSP 155


>gnl|CDD|183854 PRK13042, PRK13042, superantigen-like protein; Reviewed.
          Length = 291

 Score = 31.9 bits (72), Expect = 1.2
 Identities = 31/102 (30%), Positives = 45/102 (44%), Gaps = 24/102 (23%)

Query: 79  GLLDT--ITSTAESKDCPGTCVHAIATLICAEVLDDVQCPSPSMRCCLESP---ANGTTP 133
           GLL T  IT+T ++         A AT   +  ++  Q   PS +  +E+P    N TTP
Sbjct: 14  GLLTTGVITTTTQA---------ANATTPSSTKVEAPQSTPPSTK--VEAPQSKPNATTP 62

Query: 134 AAGGSTTSNAGTSSPSTTTPPTLYTTSKPYKKPQNPEKRPQP 175
               ST   A   +P+ TTP      S   + PQ+P  +  P
Sbjct: 63  P---STKVEAPQQTPNATTPS-----STKVETPQSPTTKQVP 96


>gnl|CDD|168775 PRK07034, PRK07034, hypothetical protein; Provisional.
          Length = 536

 Score = 32.1 bits (72), Expect = 1.3
 Identities = 24/99 (24%), Positives = 32/99 (32%), Gaps = 16/99 (16%)

Query: 130 GTTPAAGGSTTSNA-------------GTSSPSTTTPPTLYTTSKPYKKPQNPEKRPQPP 176
           G  P +G S+TS A             G +   T+ PP      +P      P      P
Sbjct: 42  GIVPRSGSSSTSQAENYKEGLADQAASGNNMARTSAPPVTLYQQQPNANDSYPNGNNNNP 101

Query: 177 GASNKIDTGGNNNCLGVTTTNKPPSSQPSNLPKCPGFCL 215
              N    G NNN   +    + P      L + PG  L
Sbjct: 102 NGDNNNPNGSNNN---IARVQRMPHGISRGLYERPGMLL 137


>gnl|CDD|221480 pfam12238, MSA-2c, Merozoite surface antigen 2c.  This family of
           proteins is found in eukaryotes. Proteins in this family
           are typically between 263 and 318 amino acids in length.
           There is a conserved SFT sequence motif. MSA-2 is a
           plasma membrane glycoprotein which can be found in
           Babesia bovis species.
          Length = 201

 Score = 30.9 bits (70), Expect = 1.7
 Identities = 14/51 (27%), Positives = 30/51 (58%), Gaps = 5/51 (9%)

Query: 131 TTPAAGGSTTS--NAGTSSPSTTTPPTLYTTSKPYKKPQNPEKRPQPPGAS 179
              ++ G++T+  + G+++P+T+ PPT      P  +P+ P + P+  G+S
Sbjct: 145 KHSSSQGTSTTRPSDGSATPNTSAPPT---PGNPAAQPEKPAETPKGNGSS 192


>gnl|CDD|114172 pfam05432, BSP_II, Bone sialoprotein II (BSP-II).  Bone
           sialoprotein (BSP) is a major structural protein of the
           bone matrix that is specifically expressed by
           fully-differentiated osteoblasts. The expression of bone
           sialoprotein (BSP) is normally restricted to mineralised
           connective tissues of bones and teeth where it has been
           associated with mineral crystal formation. However, it
           has been found that ectopic expression of BSP occurs in
           various lesions, including oral and extraoral
           carcinomas, in which it has been associated with the
           formation of microcrystalline deposits and the
           metastasis of cancer cells to bone.
          Length = 291

 Score = 30.8 bits (69), Expect = 2.3
 Identities = 19/46 (41%), Positives = 23/46 (50%), Gaps = 1/46 (2%)

Query: 125 ESPANGTTPAAGGSTTSNAGTSSPSTTTPPTLYTTSKPYKKPQNPE 170
           E+ A GTT  AG +TTS  G   P+T       TT  P+ K   PE
Sbjct: 191 EAEAEGTT-VAGPTTTSPNGGFQPTTPPQEVYGTTDPPFGKVTTPE 235


>gnl|CDD|237864 PRK14950, PRK14950, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 585

 Score = 30.5 bits (69), Expect = 4.1
 Identities = 12/56 (21%), Positives = 16/56 (28%), Gaps = 2/56 (3%)

Query: 226 PSVLIPQTSTCSRGSMCCDNTRLPPPTKPKPRPTQPPTTTTVATTTAVPDDRPECP 281
           PS + P  +  +R           PP +P      PP          VP      P
Sbjct: 376 PSPVRPTPAPSTRPKAAAAAN--IPPKEPVRETATPPPVPPRPVAPPVPHTPESAP 429


>gnl|CDD|114270 pfam05539, Pneumo_att_G, Pneumovirinae attachment membrane
           glycoprotein G. 
          Length = 408

 Score = 30.0 bits (67), Expect = 4.7
 Identities = 35/176 (19%), Positives = 52/176 (29%), Gaps = 19/176 (10%)

Query: 40  KPNSGSQSSSTTPY---VSSTSAVTQTTTSRPQQPAGSKPCHGLLDTITSTAESKDCPGT 96
           +P +   +S TT +   VS  +  +Q T        G +      +   S+ E     GT
Sbjct: 167 EPKTAVTTSKTTSWPTEVSHPTYPSQVTPQSQPATQGHQTATA--NQRLSSTEPVGTQGT 224

Query: 97  CVHAIATLICAEVLDDVQCPSPSMRCCLESP-ANGTTPAAGGSTTSNAGTSSPSTTTPPT 155
              +          +    P PS R    SP    +T +   STT +    +    TPP 
Sbjct: 225 TTSSNP--------EPQTEPPPSQRGPSGSPQHPPSTTSQDQSTTGDGQEHTQRRKTPP- 275

Query: 156 LYTTSKPYKKPQNPEKRPQPPGASNKIDTGGNNNCLGVTTTNKPPSSQPSNLPKCP 211
              TS            P          T          +   PP S P  +   P
Sbjct: 276 --ATSNRRSPHSTATPPPTTKRQETGRPTPRPTATTQSGS--SPPHSSPPGVQANP 327


>gnl|CDD|217541 pfam03409, Glycoprotein, Transmembrane glycoprotein.  This family
           of proteins has some GO annotations for positive
           regulation of growth rate and nematode larval
           development. This is probably a family of membrane
           glycoproteins.
          Length = 353

 Score = 30.0 bits (68), Expect = 5.2
 Identities = 8/24 (33%), Positives = 11/24 (45%)

Query: 248 LPPPTKPKPRPTQPPTTTTVATTT 271
           LP  T      T   +T+ + TTT
Sbjct: 310 LPTTTSAPGSTTTTTSTSRIETTT 333


>gnl|CDD|227676 COG5386, COG5386, Cell surface protein [Cell envelope biogenesis,
           outer membrane].
          Length = 352

 Score = 29.9 bits (67), Expect = 5.7
 Identities = 16/85 (18%), Positives = 30/85 (35%), Gaps = 11/85 (12%)

Query: 129 NGTTPAAGG--STTSNAGTSSPSTTTPPTLYTTSKPYK-KPQNPEKRPQPPGASNKIDTG 185
           +     A    +   +   + P TTTP T    + P K K +NP+      G +      
Sbjct: 178 DTAIDQAANAATPNDDKNVAQPGTTTPKTPTPQTTPNKPKVENPK-----TGLTTTSYLI 232

Query: 186 GNN---NCLGVTTTNKPPSSQPSNL 207
             +        T+T    + +P+ +
Sbjct: 233 DFHVVKAGTNETSTMFTYTQKPAIV 257


>gnl|CDD|226099 COG3569, COG3569, Topoisomerase IB [DNA replication, recombination,
           and repair].
          Length = 354

 Score = 29.8 bits (67), Expect = 5.9
 Identities = 10/25 (40%), Positives = 12/25 (48%)

Query: 499 DAVYVRVGDHDLTRKYGSPGAQTLR 523
           D   +RVG+    R  GS G  TL 
Sbjct: 145 DGARIRVGNFAYARDNGSYGLTTLL 169


>gnl|CDD|220189 pfam09342, DUF1986, Domain of unknown function (DUF1986).  This
           domain is found in serine proteases and is predicted to
           contain disulphide bonds.
          Length = 267

 Score = 29.2 bits (65), Expect = 6.4
 Identities = 19/70 (27%), Positives = 28/70 (40%), Gaps = 1/70 (1%)

Query: 455 DPAEWCWQVALINSLNQYLCGGALIGTQWVLTAAHCVTNIVRSGDAVYVRVGDHDLTRKY 514
           +   W W +A +     Y C G LI   WVL +  C+ +       + V +G H   +  
Sbjct: 12  EELHWPW-IAKVYVEGNYRCTGVLIDLSWVLVSHSCLWDTSLEHSYISVVLGGHKTLKSV 70

Query: 515 GSPGAQTLRV 524
             P  Q  RV
Sbjct: 71  KGPYEQIYRV 80


>gnl|CDD|183346 PRK11865, PRK11865, pyruvate ferredoxin oxidoreductase subunit
           beta; Provisional.
          Length = 299

 Score = 29.3 bits (66), Expect = 6.9
 Identities = 19/56 (33%), Positives = 26/56 (46%), Gaps = 13/56 (23%)

Query: 22  AYSGTLPPN---LVIMERNGTKPNSGSQSSSTTPYVSSTSAVTQTTTSRPQQPAGS 74
           + SG +      L +M  N    N+G Q S +TP+ +S      TTTS    PAG 
Sbjct: 110 SLSGAMERGHNILYLMYDNEAYMNTGIQRSGSTPFGAS------TTTS----PAGK 155


>gnl|CDD|184256 PRK13700, PRK13700, conjugal transfer protein TraD; Provisional.
          Length = 732

 Score = 29.5 bits (66), Expect = 7.0
 Identities = 15/71 (21%), Positives = 25/71 (35%), Gaps = 2/71 (2%)

Query: 143 AGTSSPSTTTPPTLYTTSKPYKKPQNPEKRPQPPGASN-KIDTGGNNNCLGVTTTNKPPS 201
           A     +    P      +  ++PQ P ++P  P  ++ K D G N    G+    K   
Sbjct: 611 ASGEDVTQAEQPQQPQQPQQPQQPQQP-QQPVSPVINDKKSDAGVNVPAGGIEQELKMKP 669

Query: 202 SQPSNLPKCPG 212
            +       PG
Sbjct: 670 EEEMEQQLPPG 680


>gnl|CDD|222915 PHA02670, PHA02670, ORF112 putative chemokine-binding protein;
           Provisional.
          Length = 287

 Score = 29.2 bits (65), Expect = 8.2
 Identities = 12/39 (30%), Positives = 18/39 (46%), Gaps = 2/39 (5%)

Query: 243 CDNTRLPPPTKPKPRPTQPPTTTTVATTTAVPDDRPECP 281
           CD+ + P   +P P+P   P+TT   T      D  + P
Sbjct: 165 CDHQKFPE--EPTPKPEPTPSTTPGPTLLETTLDLTDEP 201


>gnl|CDD|239471 cd03376, TPP_PFOR_porB_like, Thiamine pyrophosphate (TPP family),
           PFOR porB-like subfamily, TPP-binding module; composed
           of proteins similar to the beta subunit (porB) of the
           Helicobacter pylori four-subunit pyruvate ferredoxin
           oxidoreductase (PFOR), which are also found in archaea
           and some hyperthermophilic bacteria. PFOR catalyzes the
           oxidative decarboxylation of pyruvate to form
           acetyl-CoA, a crucial step in many metabolic pathways.
           Archaea, anaerobic bacteria and eukaryotes that lack
           mitochondria (and therefore pyruvate dehydrogenase) use
           PFOR to oxidatively decarboxylate pyruvate, with
           ferredoxin or flavodoxin as the electron acceptor. The
           36-kDa porB subunit contains the binding sites for the
           cofactors, TPP and a divalent metal cation, which are
           required for activity.
          Length = 235

 Score = 28.7 bits (65), Expect = 8.4
 Identities = 16/46 (34%), Positives = 19/46 (41%), Gaps = 10/46 (21%)

Query: 31  LVIMERNGTKPNSGSQSSSTTPYVSSTSAVTQTTTSRPQQPAGSKP 76
           L I   N    N+G Q S +TPY + T     TTT     P G   
Sbjct: 111 LYICYDNEAYMNTGIQRSGSTPYGAWT-----TTT-----PVGKVS 146


>gnl|CDD|237871 PRK14965, PRK14965, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 576

 Score = 29.3 bits (66), Expect = 9.5
 Identities = 16/56 (28%), Positives = 20/56 (35%), Gaps = 1/56 (1%)

Query: 124 LESPANGTTPAAGGSTTSNAGTSSPSTTTPPTLYTTSKPYKKPQNPEKRPQPPGAS 179
           LE  A     AA G+ T  A  + P    PP            + P   P PP A+
Sbjct: 377 LERGAPAPPSAAWGAPTPAAPAAPPPAAAPPVPPAAPARPAAAR-PAPAPAPPAAA 431


>gnl|CDD|218116 pfam04503, SSDP, Single-stranded DNA binding protein, SSDP.  This
           is a family of eukaryotic single-stranded DNA binding
           proteins with specificity to a pyrimidine-rich element
           found in the promoter region of the alpha2(I) collagen
           gene.
          Length = 293

 Score = 28.9 bits (64), Expect = 9.6
 Identities = 21/91 (23%), Positives = 30/91 (32%), Gaps = 8/91 (8%)

Query: 125 ESPANGTTPAAGG--STTSNAGTSSPSTTTPPTLYTTSKPYKKPQNPEKRPQPPGASNKI 182
            SP   T P  GG    T    + + ST +   +YT   P    Q     P  PG+   +
Sbjct: 182 SSPGEYTGPPGGGGPPGTPIMPSPADSTNSSDNMYTVMNPVPPGQTRSNFPMGPGSDGPM 241

Query: 183 DTGGN------NNCLGVTTTNKPPSSQPSNL 207
                      N  LG    +    + P+N 
Sbjct: 242 GGMSGMEPHHMNGSLGSGDMDSLSKNSPNNQ 272


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.317    0.133    0.422 

Gapped
Lambda     K      H
   0.267   0.0738    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 36,052,705
Number of extensions: 3363583
Number of successful extensions: 3869
Number of sequences better than 10.0: 1
Number of HSP's gapped: 3638
Number of HSP's successfully gapped: 89
Length of query: 741
Length of database: 10,937,602
Length adjustment: 104
Effective length of query: 637
Effective length of database: 6,324,786
Effective search space: 4028888682
Effective search space used: 4028888682
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 63 (28.0 bits)