RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy15052
         (422 letters)



>gnl|CDD|238113 cd00190, Tryp_SPc, Trypsin-like serine protease; Many of these are
           synthesized as inactive precursor zymogens that are
           cleaved during limited proteolysis to generate their
           active forms. Alignment contains also inactive enzymes
           that have substitutions of the catalytic triad residues.
          Length = 232

 Score =  230 bits (588), Expect = 8e-74
 Identities = 93/192 (48%), Positives = 119/192 (61%), Gaps = 7/192 (3%)

Query: 177 VNSLLTSQIKIRVGEYDFSKLEEPYPYQERGVVKKMVHPKYNFFTYEYDLALVRLETPVE 236
           V S   S   +R+G +D S  E     Q   V K +VHP YN  TY+ D+AL++L+ PV 
Sbjct: 44  VYSSAPSNYTVRLGSHDLSSNEGGG--QVIKVKKVIVHPNYNPSTYDNDIALLKLKRPVT 101

Query: 237 FAPNIVPICLPGSDDLLI-GENATVTGWGRLSEGGSLPPVLQKVTVPIVSNEKCRSMFLR 295
            + N+ PICLP S   L  G   TV+GWGR SEGG LP VLQ+V VPIVSN +C+  +  
Sbjct: 102 LSDNVRPICLPSSGYNLPAGTTCTVSGWGRTSEGGPLPDVLQEVNVPIVSNAECKRAY-- 159

Query: 296 AGRYEFISDIFMCAGFDNGGRDSCQGDSGGPLQVKGKDGRYFLAGIISWGIGCAEANLPG 355
                 I+D  +CAG   GG+D+CQGDSGGPL V   +GR  L GI+SWG GCA  N PG
Sbjct: 160 -SYGGTITDNMLCAGGLEGGKDACQGDSGGPL-VCNDNGRGVLVGIVSWGSGCARPNYPG 217

Query: 356 VCTRISKFVPWV 367
           V TR+S ++ W+
Sbjct: 218 VYTRVSSYLDWI 229


>gnl|CDD|214473 smart00020, Tryp_SPc, Trypsin-like serine protease.  Many of these
           are synthesised as inactive precursor zymogens that are
           cleaved during limited proteolysis to generate their
           active forms. A few, however, are active as single chain
           molecules, and others are inactive due to substitutions
           of the catalytic triad residues.
          Length = 229

 Score =  220 bits (562), Expect = 6e-70
 Identities = 92/193 (47%), Positives = 119/193 (61%), Gaps = 10/193 (5%)

Query: 177 VNSLLTSQIKIRVGEYDFSKLEEPYPYQERGVVKKMVHPKYNFFTYEYDLALVRLETPVE 236
           V     S I++R+G +D S  EE    Q   V K ++HP YN  TY+ D+AL++L+ PV 
Sbjct: 45  VRGSDPSNIRVRLGSHDLSSGEE---GQVIKVSKVIIHPNYNPSTYDNDIALLKLKEPVT 101

Query: 237 FAPNIVPICLPGSDDLLI-GENATVTGWGRLSEG-GSLPPVLQKVTVPIVSNEKCRSMFL 294
            + N+ PICLP S+  +  G   TV+GWGR SEG GSLP  LQ+V VPIVSN  CR    
Sbjct: 102 LSDNVRPICLPSSNYNVPAGTTCTVSGWGRTSEGAGSLPDTLQEVNVPIVSNATCRR--- 158

Query: 295 RAGRYEFISDIFMCAGFDNGGRDSCQGDSGGPLQVKGKDGRYFLAGIISWGIGCAEANLP 354
                  I+D  +CAG   GG+D+CQGDSGGPL     DGR+ L GI+SWG GCA    P
Sbjct: 159 AYSGGGAITDNMLCAGGLEGGKDACQGDSGGPLVC--NDGRWVLVGIVSWGSGCARPGKP 216

Query: 355 GVCTRISKFVPWV 367
           GV TR+S ++ W+
Sbjct: 217 GVYTRVSSYLDWI 229


>gnl|CDD|215708 pfam00089, Trypsin, Trypsin. 
          Length = 218

 Score =  164 bits (417), Expect = 1e-48
 Identities = 76/186 (40%), Positives = 106/186 (56%), Gaps = 16/186 (8%)

Query: 183 SQIKIRVGEYDFSKLEEPYPYQERGVVKKMVHPKYNFFTYEYDLALVRLETPVEFAPNIV 242
             +++ +G ++    E     Q+  V K +VHP YN  T   D+AL++L++PV     + 
Sbjct: 48  KSVRVVLGAHNIVLREGGE--QKFDVKKVIVHPNYNPDTD-NDIALLKLKSPVTLGDTVR 104

Query: 243 PICLPGSDDLL-IGENATVTGWGRLSEGGSLPPVLQKVTVPIVSNEKCRSMFLRAGRYEF 301
           PICLP +   L +G   TV+GWG     G LP  LQ+VTVP+VS E CRS +        
Sbjct: 105 PICLPTASSDLPVGTTCTVSGWGNTKTLG-LPDTLQEVTVPVVSRETCRSAYGGT----- 158

Query: 302 ISDIFMCAGFDNGGRDSCQGDSGGPLQVKGKDGRYFLAGIISWGIGCAEANLPGVCTRIS 361
           ++D  +CAG   GG+D+CQGDSGGPL          L GI+SWG GCA  N PGV T +S
Sbjct: 159 VTDNMICAGA--GGKDACQGDSGGPLVCSDG----ELIGIVSWGYGCASGNYPGVYTPVS 212

Query: 362 KFVPWV 367
            ++ W+
Sbjct: 213 SYLDWI 218


>gnl|CDD|227927 COG5640, COG5640, Secreted trypsin-like serine protease
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 413

 Score = 67.6 bits (165), Expect = 3e-12
 Identities = 56/202 (27%), Positives = 81/202 (40%), Gaps = 30/202 (14%)

Query: 203 YQERGVVKKM-VHPKYNFFTYEYDLALVRLETPVEFAPNIVPICLPGSDDLLIGENATVT 261
             ERG V+ + VH  Y+      D+A++ L          +P     S D       +VT
Sbjct: 102 QAERGHVRTIYVHEFYSPGNLGNDIAVLELARAA-----SLPRVKITSFDASDTFLNSVT 156

Query: 262 GWGRLSEGG------SLPP-------VLQKVTVPIVSNEKCRSMFLRA----GRYEFISD 304
               ++ G       S  P       +L +V V  V    C      A    G       
Sbjct: 157 TVSPMTNGTFGVTTPSDVPRSSPKGTILHEVAVLFVPLSTCAQYKGCANASDGATGLTG- 215

Query: 305 IFMCAGFDNGGRDSCQGDSGGPLQVKGKDGRYFLAGIISWGIG-CAEANLPGVCTRISKF 363
              CAG     +D+CQGDSGGP+  KG++GR    G++SWG G C    +PGV T +S +
Sbjct: 216 --FCAG--RPPKDACQGDSGGPIFHKGEEGRVQR-GVVSWGDGGCGGTLIPGVYTNVSNY 270

Query: 364 VPWVLDTGDSGGPLQVKGKDGR 385
             W+    +    LQ +    R
Sbjct: 271 QDWIAAMTNGLSYLQFRPLGYR 292



 Score = 51.8 bits (124), Expect = 4e-07
 Identities = 22/50 (44%), Positives = 31/50 (62%), Gaps = 2/50 (4%)

Query: 371 GDSGGPLQVKGKDGRYFLAGIISWGIG-CAEANLPGVCTRISKFVPWVLD 419
           GDSGGP+  KG++GR    G++SWG G C    +PGV T +S +  W+  
Sbjct: 228 GDSGGPIFHKGEEGRVQR-GVVSWGDGGCGGTLIPGVYTNVSNYQDWIAA 276


>gnl|CDD|216257 pfam01034, Syndecan, Syndecan domain.  Syndecans are transmembrane
           heparin sulfate proteoglycans which are implicated in
           the binding of extracellular matrix components and
           growth factors.
          Length = 207

 Score = 33.2 bits (76), Expect = 0.17
 Identities = 23/77 (29%), Positives = 29/77 (37%), Gaps = 2/77 (2%)

Query: 74  ISPLAQEKTSPTLLYTKPSTPHKTSPTVHKHTTSMKPTPTTTILYSKPYRQ-TPRPMLSY 132
              L    +SP+   T  ST  KTSPTV    T+   +P+ T          T  P    
Sbjct: 77  PPKLTTTSSSPSNDTTTASTSTKTSPTVSTTVTTT-TSPSETDTEEATTTVSTETPTEGG 135

Query: 133 YSHKDISSKNTTQRPYV 149
            S     SKN  +R  V
Sbjct: 136 SSAATDPSKNLLERKEV 152


>gnl|CDD|215315 PLN02578, PLN02578, hydrolase.
          Length = 354

 Score = 33.3 bits (76), Expect = 0.22
 Identities = 16/61 (26%), Positives = 22/61 (36%), Gaps = 19/61 (31%)

Query: 136 KDISSKNTTQRPYVKPLKESLGRPV-------------------NVYSNNEKVDDFSTES 176
             +  +    R  VKPLKE   R V                   +VY +   VDD+  ES
Sbjct: 199 AIVVEETVLTRFVVKPLKEWFQRVVLGFLFWQAKQPSRIESVLKSVYKDKSNVDDYLVES 258

Query: 177 V 177
           +
Sbjct: 259 I 259


>gnl|CDD|139494 PRK13335, PRK13335, superantigen-like protein; Reviewed.
          Length = 356

 Score = 32.4 bits (73), Expect = 0.44
 Identities = 20/91 (21%), Positives = 30/91 (32%), Gaps = 3/91 (3%)

Query: 66  SHNVTKNEISPLAQEKTSPTLLYTKPSTPHKTSPTVHKHTTSMKPTPTTTILYSKPYRQT 125
               + ++I  ++Q K          +TP          T S  P    T   + P   T
Sbjct: 84  EEKTSASKIEKISQPKQEEQKSLNISATPAPKQEQSQTTTESTTPKTKVT---TPPSTNT 140

Query: 126 PRPMLSYYSHKDISSKNTTQRPYVKPLKESL 156
           P+PM S  S    S      +  + P  E L
Sbjct: 141 PQPMQSTKSDTPQSPTIKQAQTDMTPKYEDL 171


>gnl|CDD|217835 pfam03999, MAP65_ASE1, Microtubule associated protein (MAP65/ASE1
           family). 
          Length = 619

 Score = 32.1 bits (73), Expect = 0.64
 Identities = 25/137 (18%), Positives = 36/137 (26%), Gaps = 10/137 (7%)

Query: 65  CSHNVTKNEISPLAQEKTSPTLLYTKPSTP--HKTSPTVHKHTTSMKPTPTTTILYSKPY 122
              N T   + P      S       PSTP   +    +  +T S+K TP  T       
Sbjct: 448 KLANKTSTVMEPPYGSTESSV-----PSTPSTRRNDRNITSNTPSLKRTPNLTKSSLSQE 502

Query: 123 RQTPRPMLSYYS-HKDISSKNTTQRPYVKPLKESLGRPVNVYSNNEKVDDFSTESVNSL- 180
                        H       T  +              +  + N   D  S  S+NS  
Sbjct: 503 ASLISKSTGNTHKHSTPRRLTTLPKLPAASRSSKGNLIRSGANGNASSDLSSPGSINSKS 562

Query: 181 -LTSQIKIRVGEYDFSK 196
              S   +RV +     
Sbjct: 563 PEHSVPLVRVFDIHLRA 579


>gnl|CDD|216513 pfam01456, Mucin, Mucin-like glycoprotein.  This family of
           trypanosomal proteins resemble vertebrate mucins. The
           protein consists of three regions. The N and C terminii
           are conserved between all members of the family, whereas
           the central region is not well conserved and contains a
           large number of threonine residues which can be
           glycosylated. Indirect evidence suggested that these
           genes might encode the core protein of parasite mucins,
           glycoproteins that were proposed to be involved in the
           interaction with, and invasion of, mammalian host cells.
           This family contains an N-terminal signal peptide.
          Length = 143

 Score = 30.2 bits (67), Expect = 1.1
 Identities = 20/83 (24%), Positives = 35/83 (42%)

Query: 80  EKTSPTLLYTKPSTPHKTSPTVHKHTTSMKPTPTTTILYSKPYRQTPRPMLSYYSHKDIS 139
           + T+ T   T P+T   T+ T    TT+   T TTT   +     T  P  +  + +  +
Sbjct: 44  QTTTTTTTTTPPTTTTTTTTTTTTITTTTTKTTTTTTTTTTTTTTTEAPSKNTTTSEAPT 103

Query: 140 SKNTTQRPYVKPLKESLGRPVNV 162
           + +T     ++ +  SLG    V
Sbjct: 104 TTDTRAPSSIREIDGSLGSSAWV 126


>gnl|CDD|227570 COG5245, DYN1, Dynein, heavy chain [Cytoskeleton].
          Length = 3164

 Score = 31.1 bits (70), Expect = 1.6
 Identities = 30/126 (23%), Positives = 41/126 (32%), Gaps = 22/126 (17%)

Query: 191 EYDFSKLEEPYPYQERGVVKKMVHPK-------YNFFTYEYDLALVRLETPVEFAPNIVP 243
           E +   LE    Y+ RG                   FT+ +      L           P
Sbjct: 334 ESEAMSLESSLFYEFRGGEHLAGFYSAFGDIKRILLFTWSFKKLGTLL-----------P 382

Query: 244 ICLPG--SDDLLIGENATVTGWGRLSE-GGSLPPVLQKVTVPIVSNEKCRSMFLRAGRYE 300
             LPG  S  +  GE      W   SE G      ++K     + + K RS      + E
Sbjct: 383 S-LPGYSSGGMDYGEEYRSLLWELGSEVGDPDSGPVRKWMRKDLFDAKVRSGVSFGKQEE 441

Query: 301 FISDIF 306
           F+SDIF
Sbjct: 442 FVSDIF 447


>gnl|CDD|235846 PRK06654, fliL, flagellar basal body-associated protein FliL;
           Reviewed.
          Length = 181

 Score = 28.6 bits (64), Expect = 3.9
 Identities = 14/47 (29%), Positives = 18/47 (38%), Gaps = 1/47 (2%)

Query: 101 VHKHTTSMKPTPTTTILYSKPYRQTPRPMLSYYSHKDISSKNTTQRP 147
           V K   S    P    ++S  Y +  RPML +Y        NT   P
Sbjct: 54  VSKMVVSQSGAPNAFPVFSNEYLEK-RPMLQWYESIGEIRGNTADTP 99


>gnl|CDD|165411 PHA03139, PHA03139, helicase-primase primase subunit; Provisional.
          Length = 860

 Score = 29.2 bits (65), Expect = 4.6
 Identities = 22/74 (29%), Positives = 34/74 (45%), Gaps = 5/74 (6%)

Query: 95  HKTSPTVHKHTTSMKPTPTTTILYSKPYRQTPRPML---SYYSHKDISSKNTTQRPYVKP 151
           HK+   + K+ T     PT+   +   Y QT   +L    YY  ++  S  TT + YV+ 
Sbjct: 122 HKS--LIIKYLTGKFLLPTSPFWFLSTYGQTEGMLLLTMYYYLFEEQKSTITTTKNYVQC 179

Query: 152 LKESLGRPVNVYSN 165
             E+ G  V  YS+
Sbjct: 180 FTENTGDMVFTYSS 193


>gnl|CDD|182730 PRK10787, PRK10787, DNA-binding ATP-dependent protease La;
           Provisional.
          Length = 784

 Score = 29.1 bits (65), Expect = 6.0
 Identities = 14/28 (50%), Positives = 18/28 (64%)

Query: 251 DLLIGENATVTGWGRLSEGGSLPPVLQK 278
           DLL  E A V G G+L+  GSL  V+Q+
Sbjct: 609 DLLTIETACVPGKGKLTYTGSLGEVMQE 636


>gnl|CDD|234137 TIGR03176, AllC, allantoate amidohydrolase.  This enzyme catalyzes
           the breakdown of allantoate , first to ureidoglycine by
           hydrolysis and then decarboxylation of one of the two
           equivalent ureido groups. Ureidoglycine then
           spontaneously exchanges ammonia for water resulting in
           ureidoglycolate. This enzyme is an alternative to
           allantoicase (3.5.3.4) which releases urea [Central
           intermediary metabolism, Nitrogen metabolism].
          Length = 406

 Score = 28.7 bits (64), Expect = 6.8
 Identities = 11/26 (42%), Positives = 15/26 (57%)

Query: 23  KINPKPCSVNSVEGRCMFVWECINTD 48
           K+ P P +VN V G   F  +C +TD
Sbjct: 261 KVEPVPNTVNVVPGETTFTIDCRHTD 286


>gnl|CDD|191916 pfam07982, Herpes_UL74, Herpes UL74 glycoproteins.  Members of this
           family are viral glycoproteins that form part of an
           envelope complex.
          Length = 417

 Score = 28.5 bits (63), Expect = 8.6
 Identities = 14/46 (30%), Positives = 24/46 (52%)

Query: 73  EISPLAQEKTSPTLLYTKPSTPHKTSPTVHKHTTSMKPTPTTTILY 118
           + S  +Q  T+P   YT  +  + T+   +  TT+ +  PT+TI Y
Sbjct: 225 KKSKKSQSTTTPYSSYTTSTAFNVTTNVTYSATTTTRRVPTSTIGY 270


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.319    0.138    0.434 

Gapped
Lambda     K      H
   0.267   0.0659    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 21,184,010
Number of extensions: 1997952
Number of successful extensions: 1468
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1437
Number of HSP's successfully gapped: 28
Length of query: 422
Length of database: 10,937,602
Length adjustment: 99
Effective length of query: 323
Effective length of database: 6,546,556
Effective search space: 2114537588
Effective search space used: 2114537588
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 60 (27.0 bits)