RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy14715
         (1147 letters)



>gnl|CDD|223957 COG1026, COG1026, Predicted Zn-dependent peptidases, insulinase-like
            [General function prediction only].
          Length = 978

 Score =  706 bits (1824), Expect = 0.0
 Identities = 333/1002 (33%), Positives = 523/1002 (52%), Gaps = 70/1002 (6%)

Query: 149  GFLVKNVTPIPEFQMTAIKLQHVKTLAEYFHLSRDDSNNVFAVAFRTPPPDSTGITHILE 208
            GF V +   +PE       L+H KT AE  H+  +D NNVF++AF+T P DSTG+ HILE
Sbjct: 5    GFKVISSEDLPEISSKGYILEHEKTGAELAHIKNEDPNNVFSIAFKTEPHDSTGVAHILE 64

Query: 209  HLSLCGSVKYPCRDPFMKMLTRSMATFMNAMTGPDYTFYPFSSQNHCDYFNLMSIYLDAV 268
            H  LCGS KYP +DPF KML RS+ TF+NA T PD T YP SS N  D++NL+S+YLDAV
Sbjct: 65   HTVLCGSKKYPVKDPFFKMLKRSLNTFLNAFTFPDKTVYPASSANEKDFYNLLSVYLDAV 124

Query: 269  FNPQLKQLDFMQEGWRLEHEDIKDQNSPIIFKGVVFNEMKGAFSDNSYIFGEALMNNILP 328
            F+P L +  F+QEGWR+E +D     S + +KGVV+NEMKGA+S    +   A+  ++ P
Sbjct: 125  FHPLLTKESFLQEGWRIEFKDE----SNLKYKGVVYNEMKGAYSSGESVLSRAMQQSLFP 180

Query: 329  TYCYKHVSGGDPIKILNLKYENLVNYHKKHYHPTNSKFFSYGNFNLEDHLSFINTNYLSK 388
               Y   SGGDP  I +L YE    +HKKHYHP+N K F YGN   E  L FI    L  
Sbjct: 181  GTTYGVNSGGDPKNIPDLTYEEFRAFHKKHYHPSNCKIFVYGNIPTERLLDFIEEKVLR- 239

Query: 389  INPYQHHRSSTAVLPEPAWDKPRQLHIHGRHD-PLASENQSHIAIAYKCAVMDNFKDVFV 447
              P+        +  + A+ KPR+  +          E+Q  +++++      + +D   
Sbjct: 240  --PFGKRELDVPIPDQKAFKKPRRKVLEYPISFDEEDEDQGLLSLSWLGGSASDAEDSLA 297

Query: 448  LNILGDLLLKGPNAPFYKNLVESGLGLSFSPVTG-YEASIHDTLFTVGLQGVDSNKFDEI 506
            L +L ++LL    +P  + L+ESGLG  F+ V+G Y++ + +T+F+V             
Sbjct: 298  LEVLEEILLDSAASPLTQALIESGLG--FADVSGSYDSDLKETIFSV------------- 342

Query: 507  IGAVNKTIDEVIAEGFDKERVAIGLQGVDSNKFDEIKGAVNKTIDEVIAEGFDKERVASV 566
                                   GL+GV   K  ++K  V  T+ E++  G DK+ + ++
Sbjct: 343  -----------------------GLKGVSEEKIAKLKNLVLSTLKELVKNGIDKKLIEAI 379

Query: 567  LHSLELSLKH-QSSNFGLNLLFWLVPFMNHDCDVIHLLHINDRLNWFKKHIQENPTYLQE 625
            LH LE SLK  +S  FGL L+F  +    +  D    L   D L   ++ +++ P Y ++
Sbjct: 380  LHQLEFSLKEVKSYPFGLGLMFRSLYGWLNGGDPEDSLRFLDYLQNLREKLEKGP-YFEK 438

Query: 626  KVDEYLRNNPHKLIITMSPEKTFDEKLDKVEKDILKDRISQMNDQDLNKVYVNGTELRKE 685
             + +Y  +NPH + + + P    +EKL+K E+++L+ R S++ D+DL K+  +  +L++ 
Sbjct: 439  LIRKYFLDNPHYVTVIVLPSPELEEKLEKEERELLQKRSSELTDEDLEKIIKDSKKLKER 498

Query: 686  QEK---EQNIDVLPTLKISDVDDHVERVVTTDKHILQVPIQLSTQPTNGVTYFRSVVDTS 742
            Q++   E+++  LPTLK+ DV D +E+     +   +  +      TNG+TY R   D  
Sbjct: 499  QDQPDSEEDLATLPTLKLGDVPDPIEKTSLETEVSNEAKVLHHDLFTNGITYLRLYFDLD 558

Query: 743  KLSPELKPLVPLFNYVINQMRTKNYDFREMDQLIHMSTGGISFNSHLGESCSTPNGFEEA 802
             L  EL P +PLF + +  + T+ Y ++E+   I   TGGIS +  +         +  +
Sbjct: 559  MLPSELLPYLPLFAFALTNLGTETYSYKELLNQIERHTGGISVSLSVDTDPGDDGEYRPS 618

Query: 803  ILVSSHCLEHNNDKMFDVLSELFNNVQLTDLNRFTTLVNTLSSELINGISGNGHRYAMSI 862
              +S   L    +K+F+++ E+  N    D  R   L+    S+L + +  +GH  A S+
Sbjct: 619  FSISGKALRSKVEKLFELIREILANTDFHDRERLKELLEQYLSDLTSSVRNSGHSIASSL 678

Query: 863  ASSLVDPVSEQKEIYSGLSFVSKIKEI----AQSPKLENILQDIQSIGAHVLRKDSMRCA 918
            A+S +      KE+ +GLS V  ++E+     ++ + E I   +Q++   + + +++R A
Sbjct: 679  ANSRLSSAGALKELLNGLSQVKFLRELSSNFEENFEKE-IADKLQALRKKIFQTNNLRIA 737

Query: 919  LNMSAQSNAPERLES----FLQSIPGDFT--SQPGQTVHSFNVSGIQKVSHVLPFPVNFT 972
            + +       + LE+    FL+ +   F   + P         S  +      P   N  
Sbjct: 738  I-IGDIDKILDLLENPLLKFLEHLLPGFELPTPPKNPHLDLISSLSEATIIPSPVAYNAL 796

Query: 973  AKSLRGVPFLHKDYVALKVLSKFLTTKYLLREVREKNGAYGAGAVVSP-SGVIQFYSYRD 1031
            A S+ G+P+ H DY AL+VLS++L + YL  ++REK GAYGA A +    GV  F SYRD
Sbjct: 797  AFSIGGLPYTHPDYAALQVLSEYLGSGYLWNKIREKGGAYGASASIDANRGVFSFASYRD 856

Query: 1032 PYALETLATFDQSTQFLADTKLSVQDLDEAKLGVFKEVDAPIPPGSKGMSKFLY---GKT 1088
            P  L+T   F +S + LA      +DL+EA LG+   +D P  P S+G   F     G T
Sbjct: 857  PNILKTYKVFRKSVKDLASGNFDERDLEEAILGIISTLDTPESPASEGSKSFYRDLSGLT 916

Query: 1089 DEMIEQYRLSVKQVTEDDIRRVADTYLSRDATEKLSSYVVIG 1130
            DE  + +R  +  VT++DI+ V D YL   ++E   S  V  
Sbjct: 917  DEERQAFRERLLDVTKEDIKEVMDKYLLNFSSEN--SIAVFA 956


>gnl|CDD|240416 PTZ00432, PTZ00432, falcilysin; Provisional.
          Length = 1119

 Score =  345 bits (886), Expect = e-101
 Identities = 270/1153 (23%), Positives = 498/1153 (43%), Gaps = 139/1153 (12%)

Query: 79   SYCRPITDYGSIVYQSAKPSAKGKLNSTHNNFIRLCTGAFRSSPLDSLFCEMAPEKKFIS 138
             Y   I++ G         + K + N    +F+     +  ++ L S   +M    +++ 
Sbjct: 11   GYLNIISNLGKYASSLHNKAEKRRYNVFSEDFL----YSINTTNLYSFTADMNKSPEWVQ 66

Query: 139  HSFEEGAEVEGFLVKNVTPIPEFQMTAIKLQHVKTLAEYFHLSRDDSNNV---FAVAFRT 195
             +     E E +       +P+F M A    H KT  +   L  +DS+     F     T
Sbjct: 67   EAMP--VEHEAYEKIEKRYLPDFGMVATVYSHKKTGLQVISLKTNDSSGKEMCFDFYVPT 124

Query: 196  PPPDSTGITHILEHLSLCGSVKYPCRDPFMKMLTRSMATFMNAMTGPDYTFYPFSSQNHC 255
            PP +  GI HILEH  L GS KY  +D F  ++     +F+NA T  D T Y F+S N  
Sbjct: 125  PPHNDKGIPHILEHSVLSGSKKYNYKDSFSLLVQGGFNSFLNAYTFKDRTSYLFASTNEK 184

Query: 256  DYFNLMSIYLDAVFNPQLKQLD--FMQEGWRLEHEDIKDQNSP-----------IIFKGV 302
            D++N   +Y+D+VF P + +    F QEGW  +   +KD               + + G+
Sbjct: 185  DFYNTADVYMDSVFQPNILEDKDIFKQEGWHYKVTKLKDDEKNADELGNVHDRHVSYSGI 244

Query: 303  VFNEMKGAFSDNSYIFGEALMNNILPTYCYKHVSGGDPIKILNLKYENLVNYHKKHYHPT 362
            V++EMK  FSD        +  N+  +  YK+ SGGDP  I+ L YE LV ++K +Y P 
Sbjct: 245  VYSEMKKRFSDPLSFGYSVIYQNLF-SNVYKYDSGGDPKDIVELTYEELVEFYKTYYGPK 303

Query: 363  NSKFFSYGNFNLEDHLSFINTNYLSKINPYQHHRSSTA--------VLPEPAWDKPRQLH 414
             +  + YG  ++ + L F++ NYL+K +P     S TA        +L E   DKP+  H
Sbjct: 304  TATVYFYGPNDVTERLEFVD-NYLTK-HPKTGQLSHTAYREDADENLLYEEYKDKPK--H 359

Query: 415  IHGRHDPLASENQSHIAIAY-----KCAVMDNFK------DVFVLNILGDLLLKGPNAPF 463
            +  +    + E ++ +++++          D  K      D   L +L  LLL  P +  
Sbjct: 360  VKKKFSSHSEEEENLMSVSWLLNPKHNGSKDYDKSLIDPVDYLALLVLNYLLLGTPESVL 419

Query: 464  YKNLVESGLGLSFSPVTGYEASIHDTLFTVGLQGVDSNKFDEIIGAVNKTIDEVIAEGFD 523
            YK L++SGLG      +G +     ++F++GL+G+             K  D+V      
Sbjct: 420  YKALIDSGLGKKVV-GSGLDDYFKQSIFSIGLKGIKETNE--------KRKDKVHYT--- 467

Query: 524  KERVAIGLQGVDSNKFDEIKGAVNKTIDEVIAEGFDKERVASVLHSLELSLKHQSSNF-- 581
                           F+++   V   + +V+ EGF+K  V + L+++E  +K  +     
Sbjct: 468  ---------------FEKV---VLNALTKVVTEGFNKSAVEASLNNIEFVMKELNLGTYP 509

Query: 582  -GLNLLFWLVPFMNHDCDVIHLLHINDRLNWFKKHIQENPTYLQEKVDEYLRNNPHKLII 640
             GL L+F +   + +  D   +L     LN  K  I     YL++ ++++L NN H++ +
Sbjct: 510  KGLMLIFLMQSRLQYGKDPFEILRFEKLLNELKLRIDNESKYLEKLIEKHLLNNNHRVTV 569

Query: 641  TMS--PEKTFDEKLDKVEKDILKDRISQMNDQDLNKVYVNGTELRKEQEKE---QNIDVL 695
             +       ++++ +K+ KD LK+R+S +  + ++++     + +KE+E +   +++D  
Sbjct: 570  HLEAVESSKYEKEFNKLVKDELKERLSHLTKEQVDEMEKAYEKFKKEREADDDPEHLDSF 629

Query: 696  PTLKISDVDDHVERVVT----TDKHILQ-----------VPIQLSTQPTNGVTYFRSVVD 740
            P L +SD++   E + T         L+           V + +    + G+ Y      
Sbjct: 630  PILSLSDLNKETEEIPTKLYKLSSDSLKENMDLDSDGGSVTVLVHPIESRGILYLDFAFS 689

Query: 741  TSKLSPELKPLVPLFNYVINQMRTKNYDFREMDQLIHMSTGGISFNSHL-----GESCST 795
               L+ +    + LF  ++ +  T      E       + GG+S ++         +   
Sbjct: 690  LDSLTVDELKYLNLFKALLKENGTDKLSSEEFTYKREKNLGGLSASTAFYSETNNLTYDD 749

Query: 796  PNGFEEAILVSSHCLEHNNDKMFDVLSELFNNVQLTDLNRFTTLVNTLSSELINGISGNG 855
            P      + V +  L+H  ++M D++ E   +   ++  +   ++    + +    S  G
Sbjct: 750  PYNGVGYLNVRAKVLKHKVNEMVDIVLEALKDADFSNSKKGVEILKRKINGMKTVFSSKG 809

Query: 856  HRYAMSIASSLVDPVSEQKEIYSGLSFVSKIKE----IAQS--PKLENILQDIQSIGAHV 909
            H++A+    S         E+ +G S +  +KE    +A+    K+E+ L +I++    +
Sbjct: 810  HKFALKRMKSKFSVSDYADELVNGYSQLLFLKETLVPLAEKDWSKVESKLNEIRN---KL 866

Query: 910  LRKDSMRCALNMSAQSNAPERL----ESFLQSIPGDFTSQPGQTVHSFNVSGI------Q 959
            L   ++   +N++  S   + L     +FL+ +   F     ++     V  +      +
Sbjct: 867  LSMKNL--TVNVTGDSELLDSLLDDSTTFLKKLSSTFKENDNKSSDKVWVKEVLDKKLME 924

Query: 960  KVSH----VLPFPVNFTAKSLRGVPFLHKDYV--ALKVLSKFLTTKYLLREVREKNGAYG 1013
             V      VLP  VNF    + G  F   D V  + +V+  +L   YL + VR   GAYG
Sbjct: 925  SVDKNEFIVLPTRVNFV--GMGGKLFDKSDKVDGSFQVIVHYLKNSYLWKTVRMSLGAYG 982

Query: 1014 AGAVVSPSGVIQFYSYRDPYALETLATFDQSTQFL--ADTKLSVQDLDEAKLGVFKEVDA 1071
              A +  +G + F SY DP   +TL  + +    L  A   L+ +DL   K+G    +D 
Sbjct: 983  VFADLLYTGHVIFMSYADPNFEKTLEVYKEVASALREAAETLTDKDLLRYKIGKISNIDK 1042

Query: 1072 PIPPGSKGMSKF---LYGKTDEMIEQYRLSVKQVTEDDIRRVADTYLSRDATEKLSSYVV 1128
            P+             +  ++DE  +++R  + + T++D  R+AD        EK+ + VV
Sbjct: 1043 PLHVDELSKLALLRIIRNESDEDRQKFRKDILETTKEDFYRLADLMEKSKEWEKVIA-VV 1101

Query: 1129 IGPKSNNLGDEWK 1141
                S+    ++K
Sbjct: 1102 NSKTSDTAPSDFK 1114


>gnl|CDD|116948 pfam08367, M16C_assoc, Peptidase M16C associated.  This domain
           appears in eukaryotes as well as bacteria and tends to
           be found near the C-terminus of the metalloprotease M16C
           (pfam05193).
          Length = 248

 Score =  225 bits (575), Expect = 8e-67
 Identities = 82/248 (33%), Positives = 140/248 (56%)

Query: 642 MSPEKTFDEKLDKVEKDILKDRISQMNDQDLNKVYVNGTELRKEQEKEQNIDVLPTLKIS 701
           MSP++T +EKL++ E++ L+ + S ++++D  K+   G EL + Q   +++  LPTL +S
Sbjct: 1   MSPDETLEEKLNEEEREKLEKKKSSLSEEDKEKIIERGLELEELQSTPEDLSCLPTLTLS 60

Query: 702 DVDDHVERVVTTDKHILQVPIQLSTQPTNGVTYFRSVVDTSKLSPELKPLVPLFNYVINQ 761
           D+    +     ++ +  VP+      TNG+ Y R V D + L  EL P +PLF  V+ +
Sbjct: 61  DIPRSGDEYPLEEEKVGDVPVLHHDVDTNGIVYVRLVFDLNALPEELLPYLPLFCDVLTK 120

Query: 762 MRTKNYDFREMDQLIHMSTGGISFNSHLGESCSTPNGFEEAILVSSHCLEHNNDKMFDVL 821
           + T+NY +RE+++ I + TGGISF+  +  +    + +E + ++S   L     K+F+++
Sbjct: 121 LGTENYSYRELEEEILLHTGGISFSVSVSSNPDDTDKYEPSFVISGKALNSKVPKLFELI 180

Query: 822 SELFNNVQLTDLNRFTTLVNTLSSELINGISGNGHRYAMSIASSLVDPVSEQKEIYSGLS 881
           SE+ N     D  R   L+   +S L N I+ +GH YAMS A+S +       E+ SGL 
Sbjct: 181 SEILNETDFDDEERLKELIRESASRLTNSIADSGHSYAMSRAASYLSIAGAISELLSGLD 240

Query: 882 FVSKIKEI 889
            V  IKE+
Sbjct: 241 QVKFIKEL 248


>gnl|CDD|218490 pfam05193, Peptidase_M16_C, Peptidase M16 inactive domain.
           Peptidase M16 consists of two structurally related
           domains. One is the active peptidase, whereas the other
           is inactive. The two domains hold the substrate like a
           clamp.
          Length = 182

 Score = 94.9 bits (236), Expect = 3e-22
 Identities = 39/184 (21%), Positives = 82/184 (44%), Gaps = 8/184 (4%)

Query: 345 NLKYENLVNYHKKHYHPTNSKFFSYGNFNLEDHLSFINTNYLSKINPYQHHRSSTAVLPE 404
           +L  E+L +++KKHY P N      G+ +L++ L+     Y   +              E
Sbjct: 1   SLTREDLKDFYKKHYSPDNMVLVVVGDVDLDELLALAEK-YFGDLPASPPKPKPREPPLE 59

Query: 405 PAWDKPRQLHIHGRHDPLASENQSHIAIAYKCAVMDNFKDVFVLNILGDLLLKGPNAPFY 464
           P     +++ +  +        Q+ +A+A+    + N  D   L++L +LL  G ++  +
Sbjct: 60  PEELTGKEVVVPDK-----DVPQAKLALAFPGPPLGNDPDSAALDVLAELLGGGASSRLF 114

Query: 465 KNLVESGLGLSFSPVTGYEASIHDTLFTVGLQGVDSNKFDEIIGAVNKTIDEVIAEGFDK 524
           + L E   GL++S    +++     LF +    +D    DE+I  + + + ++  EG  +
Sbjct: 115 QELREK-EGLAYSVSAFFDSYSDTGLFGIYAD-LDPENLDEVIELIFEELKKLAEEGITE 172

Query: 525 ERVA 528
           E + 
Sbjct: 173 EELE 176



 Score = 51.7 bits (124), Expect = 2e-07
 Identities = 37/150 (24%), Positives = 63/150 (42%), Gaps = 15/150 (10%)

Query: 928  PERLESFLQSIPGDFTSQPGQTVHSFN------VSGIQKVSHVLPFPVNFTAKSLRGVPF 981
             + L +  +   GD  + P +            ++G + V      P    A +  G P 
Sbjct: 30   LDELLALAEKYFGDLPASPPKPKPREPPLEPEELTGKEVVVPDKDVPQAKLALAFPGPPL 89

Query: 982  L-HKDYVALKVLSKFL---TTKYLLREVREKNG-AYGAGAVV---SPSGVIQFYSYRDPY 1033
                D  AL VL++ L    +  L +E+REK G AY   A     S +G+   Y+  DP 
Sbjct: 90   GNDPDSAALDVLAELLGGGASSRLFQELREKEGLAYSVSAFFDSYSDTGLFGIYADLDPE 149

Query: 1034 AL-ETLATFDQSTQFLADTKLSVQDLDEAK 1062
             L E +    +  + LA+  ++ ++L+ AK
Sbjct: 150  NLDEVIELIFEELKKLAEEGITEEELERAK 179


>gnl|CDD|223685 COG0612, PqqL, Predicted Zn-dependent peptidases [General function
           prediction only].
          Length = 438

 Score = 77.9 bits (192), Expect = 7e-15
 Identities = 68/345 (19%), Positives = 125/345 (36%), Gaps = 34/345 (9%)

Query: 188 VFAVAFRTPPPDSTGITHILEHLSLCGSVKYPCRDPFMKMLTRSMATFMNAMTGPDYTFY 247
                 R  P    GI H LEH++  G+   P  +   +   +      NA T  DYT Y
Sbjct: 44  WVKAGSRAEPAGKAGIAHFLEHMAFKGTTGLPSAE-LAEAFEKLGGQL-NAFTSFDYTVY 101

Query: 248 PFSSQNHCDYFN-LMSIYLDAVFNPQLKQLDFMQEGWRLEHEDIKDQNSPIIFKGVVFNE 306
             S     D  +  + +  D + NP   + +  +E                  KGV+  E
Sbjct: 102 YLSVLP--DNLDKALDLLADILLNPTFDEEEVERE------------------KGVILEE 141

Query: 307 MKGAFSDNSYIFGEALMNNILPTYCYKHVSGGDPIKILNLKYENLVNYHKKHYHPTNSKF 366
           ++    D   +  E L+  +   +       G    I  +  E+L ++++K Y P N   
Sbjct: 142 IRMRQDDPDDLAFERLLEALYGNHPLGRPILGTEESIEAITREDLKDFYQKWYQPDNMVL 201

Query: 367 FSYGNFNLEDHLSFINTNYLSKINPYQHHRSSTAVLPEPAWDKPRQLHIHGRHDPLASEN 426
              G+ + E+ +  I   Y   +           + PEP     R + ++    P     
Sbjct: 202 VVVGDVDAEEVVELIE-KYFGDLPG---AAPPPKIPPEPPLGPERVVRVNDPEQPDLE-- 255

Query: 427 QSHIAIAYKCAVMDNFKDVFVLNILGDLLLKGPNAPFYKNLVESGLGLSFSPVTGYEASI 486
           Q+ +A+ Y     D+  D   L +L  LL  G ++  ++ L E   GL++S  +  +   
Sbjct: 256 QAWLALGYPGPDYDSPDDYAALLLLNGLLGGGFSSRLFQELRE-KRGLAYSVSSFSDFLS 314

Query: 487 HDTLFTVGLQGVDSNKFDEIIGAVNKTIDEVIAEG---FDKERVA 528
              LF++   G      ++    V + +  +       F +E + 
Sbjct: 315 DSGLFSI-YAGTAPENPEKTAELVEEILKALKKGLKGPFTEEELD 358



 Score = 45.5 bits (108), Expect = 1e-04
 Identities = 39/226 (17%), Positives = 73/226 (32%), Gaps = 22/226 (9%)

Query: 929  ERLESFLQSIPGDFTSQPGQTVHSFNVSGIQKVSHVLPFPVNFT--AKSLRGVPFLHK-D 985
            E +E +   +PG                 + +V+      +     A    G  +    D
Sbjct: 214  ELIEKYFGDLPGAAPPPKIPPEPPLGPERVVRVNDPEQPDLEQAWLALGYPGPDYDSPDD 273

Query: 986  YVALKVLSKFL---TTKYLLREVREKNG-AYGAGAVVS------PSGVIQFYSYRDPYAL 1035
            Y AL +L+  L    +  L +E+REK G AY   +            +    +  +P   
Sbjct: 274  YAALLLLNGLLGGGFSSRLFQELREKRGLAYSVSSFSDFLSDSGLFSIYAGTAPENPEKT 333

Query: 1036 -----ETLATFDQSTQFLADTKLSVQDLDEAKLGVFKEVDAPIPPGSKGMSKFLYGKTDE 1090
                 E L    +  +     +            +   +D+P           L G +  
Sbjct: 334  AELVEEILKALKKGLKGPFTEEELDAAKQLLIGLLLLSLDSPSSIAELLGQYLLLGGSLI 393

Query: 1091 MIEQYRLSVKQVTEDDIRRVADTYLSRDATEKLSSYVVIGPKSNNL 1136
             +E+    ++ VT +D+  VA   L      +  + VV+GP+    
Sbjct: 394  TLEELLERIEAVTLEDVNAVAKKLL----APENLTIVVLGPEKALK 435


>gnl|CDD|223956 COG1025, Ptr, Secreted/periplasmic Zn-dependent peptidases,
           insulinase-like [Posttranslational modification, protein
           turnover, chaperones].
          Length = 937

 Score = 53.5 bits (129), Expect = 6e-07
 Identities = 38/180 (21%), Positives = 66/180 (36%), Gaps = 26/180 (14%)

Query: 197 PPDSTGITHILEHLSLCGSVKYPCRDPFMKMLTRSMATFMNAMTGPDYTFYPFSSQNHCD 256
           P +  G+ H LEH+   GS KYP    F + L++   +  NA T  + T + F    +  
Sbjct: 60  PEEYPGLAHFLEHMLFMGSEKYPDEGGFSEFLSKHGGSH-NASTAGERTAFYFEV-ENDA 117

Query: 257 YFNLMSIYLDAVFNPQLKQLDFMQEGWRLEHEDIKDQNSPIIFKGVVFNEMKGAFSDNSY 316
               +  + D    P   +    +E                  +  V +E     + + +
Sbjct: 118 LEGALDRFADFFIEPLFNKEALDRE------------------RNAVNSEFTMNLTSDGW 159

Query: 317 IFGEALMNNILPTYCYKHVSGGDPIKILNLK-----YENLVNYHKKHYHPTNSKFFSYGN 371
              +       P +     S G+ ++ L+ K      + L  +H+KHY   N K   YGN
Sbjct: 160 RMYQVQALTANPGHPLSKFSTGN-LETLSDKPGLVVQQELKEFHEKHYSANNMKLVIYGN 218


>gnl|CDD|216056 pfam00675, Peptidase_M16, Insulinase (Peptidase family M16). 
          Length = 149

 Score = 41.9 bits (99), Expect = 4e-04
 Identities = 23/100 (23%), Positives = 42/100 (42%), Gaps = 6/100 (6%)

Query: 197 PPDSTGITHILEHLSLCGSVKYPCRDPFMKMLTRSMATFMNAMTGPDYTFYPFSSQNHCD 256
           P D+ G+ H LEH++  G+ KYP  D   + L +   +  NA T  ++T Y     N  D
Sbjct: 27  PDDNNGLAHFLEHMAFKGTKKYPSED-LEEELEKLGGSL-NAYTSREHTVYYVEVLN--D 82

Query: 257 YFN-LMSIYLDAVFNPQLKQLDFMQEGWRLEHEDIKDQNS 295
                +    D   NP     +  +    +   +++  ++
Sbjct: 83  DLPKAVDRLADFFLNPLFSPSEV-ERERLVVLYEVEAVDA 121


>gnl|CDD|185056 PRK15101, PRK15101, protease3; Provisional.
          Length = 961

 Score = 37.6 bits (88), Expect = 0.043
 Identities = 20/52 (38%), Positives = 25/52 (48%), Gaps = 2/52 (3%)

Query: 197 PPDSTGITHILEHLSLCGSVKYPCRDPFMKMLTRSMATFMNAMTGPDYT-FY 247
           P    G+ H LEH+ L GS KYP  D   + L +   +  NA T    T FY
Sbjct: 80  PDAQQGLAHYLEHMVLMGSKKYPQPDSLAEFLKKHGGSH-NASTASYRTAFY 130


>gnl|CDD|100111 cd05833, Ribosomal_P2, Ribosomal protein P2. This subfamily
           represents the eukaryotic large ribosomal protein P2.
           Eukaryotic P1 and P2 are functionally equivalent to the
           bacterial protein L7/L12, but are not homologous to
           L7/L12. P2 is located in the L12 stalk, with proteins
           P1, P0, L11, and 28S rRNA. P1 and P2 are the only
           proteins in the ribosome to occur as multimers, always
           appearing as sets of heterodimers. Recent data indicate
           that eukaryotes have four copies (two heterodimers),
           while most archaeal species contain six copies of L12p
           (three homodimers). Bacteria may have four or six copies
           of L7/L12 (two or three homodimers) depending on the
           species. Experiments using S. cerevisiae P1 and P2
           indicate that P1 proteins are positioned more internally
           with limited reactivity in the C-terminal domains, while
           P2 proteins seem to be more externally located and are
           more likely to interact with other cellular components.
           In lower eukaryotes, P1 and P2 are further subdivided
           into P1A, P1B, P2A, and P2B, which form P1A/P2B and
           P1B/P2A heterodimers. Some plants have a third
           P-protein, called P3, which is not homologous to P1 and
           P2. In humans, P1 and P2 are strongly autoimmunogenic.
           They play a significant role in the etiology and
           pathogenesis of systemic lupus erythema (SLE). In
           addition, the ribosome-inactivating protein
           trichosanthin (TCS) interacts with human P0, P1, and P2,
           with its primary binding site in the C-terminal region
           of P2. TCS inactivates the ribosome by depurinating a
           specific adenine in the sarcin-ricin loop of 28S rRNA.
          Length = 109

 Score = 33.8 bits (78), Expect = 0.11
 Identities = 18/57 (31%), Positives = 29/57 (50%), Gaps = 14/57 (24%)

Query: 510 VNKTIDEVIAEGFDKERVAIGLQGVDSNKFDEIKGAVNKTIDEVIAEGFDKERVASV 566
           V K +  V  E  D E++        +    E++G   K ++E+IA G  KE++ASV
Sbjct: 23  VKKILGSVGVE-VDDEKL--------NKVISELEG---KDVEELIAAG--KEKLASV 65


>gnl|CDD|227699 COG5412, COG5412, Phage-related protein [Function unknown].
          Length = 637

 Score = 34.0 bits (78), Expect = 0.58
 Identities = 19/87 (21%), Positives = 36/87 (41%), Gaps = 6/87 (6%)

Query: 466 NLVESGLGLSFSPVTGYEASIHDT---LFTVGLQGVDS--NKFDEIIGAVNKTIDEVIAE 520
           +L+ S +G++  PV G   ++ +T     + G   + S   +      ++   +  +I  
Sbjct: 408 SLIGSKVGMAIIPVLGIITTVMNTIKSTISNGSIAIKSVATRIFSGRSSIFNGVRNLITI 467

Query: 521 GFDKERVAI-GLQGVDSNKFDEIKGAV 546
            FD  R A  GL    S  F ++   V
Sbjct: 468 VFDGIRGAWGGLTSAVSPIFSKVGNFV 494


>gnl|CDD|233732 TIGR02110, PQQ_syn_pqqF, coenzyme PQQ biosynthesis probable
           peptidase PqqF.  In a subset of species that make
           coenzyme PQQ (pyrrolo-quinoline-quinone), this probable
           peptidase is found in the PQQ biosynthesis region and is
           thought to act as a protease on PqqA (TIGR02107), a
           probable peptide precursor of the coenzyme. PQQ is
           required for some glucose dehydrogenases and alcohol
           dehydrogenases [Biosynthesis of cofactors, prosthetic
           groups, and carriers, Other].
          Length = 696

 Score = 34.1 bits (78), Expect = 0.59
 Identities = 10/29 (34%), Positives = 14/29 (48%)

Query: 202 GITHILEHLSLCGSVKYPCRDPFMKMLTR 230
           G+ H LEHL   G  ++   D  M  + R
Sbjct: 41  GLAHFLEHLLFLGGERFQGDDRLMPWVQR 69


>gnl|CDD|239248 cd02950, TxlA, TRX-like protein A (TxlA) family; TxlA was
          originally isolated from the cyanobacterium
          Synechococcus. It is found only in oxygenic
          photosynthetic organisms. TRX is a small enzyme that
          participate in redox reactions, via the reversible
          oxidation of an active site dithiol present in a CXXC
          motif. Disruption of the txlA gene suggests that the
          protein is involved in the redox regulation  of the
          structure and function of photosynthetic apparatus. The
          plant homolog (designated as HCF164) is localized in
          the chloroplast and is involved in the assembly of the
          cytochrome b6f complex, which takes a central position
          in photosynthetic electron transport.
          Length = 142

 Score = 31.5 bits (72), Expect = 0.93
 Identities = 7/26 (26%), Positives = 13/26 (50%)

Query: 28 KWLGLIVDSKLNWIPQINYLNNKLKL 53
          KWL  I   +++ IP   +L+ +   
Sbjct: 64 KWLPEIDRYRVDGIPHFVFLDREGNE 89


>gnl|CDD|215730 pfam00118, Cpn60_TCP1, TCP-1/cpn60 chaperonin family.  This family
           includes members from the HSP60 chaperone family and the
           TCP-1 (T-complex protein) family.
          Length = 481

 Score = 33.0 bits (76), Expect = 1.1
 Identities = 11/51 (21%), Positives = 20/51 (39%), Gaps = 1/51 (1%)

Query: 72  ILLKIYNSYCRPITDYGSIVYQSAKPSAKGKLNSTHNNFI-RLCTGAFRSS 121
           I LK       P++D    +   A+ S   K++S  +  + +L   A    
Sbjct: 100 IALKALEELSIPVSDDDEDLLNVARTSLNSKISSRESELLGKLVVDAVLLI 150


>gnl|CDD|227931 COG5644, COG5644, Uncharacterized conserved protein [Function
           unknown].
          Length = 869

 Score = 32.8 bits (74), Expect = 1.2
 Identities = 53/235 (22%), Positives = 89/235 (37%), Gaps = 30/235 (12%)

Query: 360 HPTNSKFFSYGNFNLEDHLSFINTNYLSKINPYQHHRSSTAVLPEPAWDKPRQLHIHGRH 419
           H TN +  S    ++ D L  I    L          SS         D P    I  R 
Sbjct: 228 HETNKESGSSETIDITDLLDSIPMEQLKVSLKPLVSESSK-------LDAPLAKSIQDRL 280

Query: 420 DPLASENQSHIAIAYKCAVMDNFKDVFVLNILGDLLLKGPNAPFYKNLVESGLGLSFSPV 479
           +  A+  Q+   +       + +K +   N   D L+   N         +GL  SF P 
Sbjct: 281 ERQAAYEQTKNDL-------EKWKPIVADNRKSDQLIFPMNETARPVPSNNGLASSFEPR 333

Query: 480 TGYEASIHDTLFTVGLQGVDS-NKFDEIIGAVNKTIDEVIAEG-----------FDKERV 527
           T  E  +H  L   GL+   +  K +E+  A+NK   E +AE            F +ER 
Sbjct: 334 TESERKMHQALLDAGLENESALKKQEEL--ALNKLSVEEVAERTRQLRFMRELMFREERK 391

Query: 528 AIGLQGVDSNKFDEIKGAVNKTIDEVIAEGFDKERVAS-VLHSLE-LSLKHQSSN 580
           A  +  + S  + +I+    +    +I +  D E   S    +LE ++ +H++++
Sbjct: 392 AKRVAKIKSKTYRKIRKNRKEKEMALIPKSEDLENEKSEEARALERMTQRHKNTS 446


>gnl|CDD|187687 cd09629, DOMON_CIL1_like, DOMON-like domain of Brassica carinata
           CIL1 and similar proteins.  Brassica carinata CIL1 has
           been described as involved in suppression of axillary
           meristem development. It contains a single DOMON domain,
           the function of which is unclear. Members in this
           diverse family of plant proteins may have a cytochrome
           b561 domain C-terminal to the DOMON domain, some members
           from Arabidopsis have been characterized as
           auxin-responsive or auxin-induced proteins. DOMON
           domains were initially thought to confer protein-protein
           interactions. They were subsequently found as a
           heme-binding motif in cellobiose dehydrogenase, an
           extracellular fungal oxidoreductase that degrades both
           lignin and cellulose, and in ethylbenzene dehydrogenase,
           an enzyme that aids in the anaerobic degradation of
           hydrocarbons. The domain interacts with sugars in the
           type 9 carbohydrate binding modules (CBM9), which are
           present in a variety of glycosyl hydrolases, and it can
           also be found at the N-terminus of sensor histidine
           kinases.
          Length = 152

 Score = 31.1 bits (71), Expect = 1.6
 Identities = 13/30 (43%), Positives = 17/30 (56%)

Query: 173 TLAEYFHLSRDDSNNVFAVAFRTPPPDSTG 202
           TL    H + + SN+   VAFR  PP S+G
Sbjct: 9   TLGASLHWTYNASNSTLDVAFRATPPSSSG 38


>gnl|CDD|116688 pfam08083, PROCN, PROCN (NUC071) domain.  The PROCN domain is the
           central domain in pre-mRNA splicing factors of PRO8
           family.
          Length = 408

 Score = 31.6 bits (72), Expect = 2.5
 Identities = 20/81 (24%), Positives = 39/81 (48%), Gaps = 5/81 (6%)

Query: 532 QGVDSNKFDEIKGAVNKTIDEVIAEGFDKERVASVLHSLELSLKHQSSNFGLNLLFWLVP 591
           Q V+S    E++ AV K I ++I EG  + +  ++L  L  + +   +N     + W VP
Sbjct: 282 QRVESYYDLELRAAVMKDILDMIPEGLRQNKARTILQHLSEAWRCWKAN-----VPWKVP 336

Query: 592 FMNHDCDVIHLLHINDRLNWF 612
            +      I L ++  + +W+
Sbjct: 337 GLPAPIKNIILRYVKAKADWW 357


>gnl|CDD|150004 pfam09167, DUF1942, Domain of unknown function (DUF1942).  Members
           of this family of bacterial proteins assume a
           beta-sandwich structure consisting of two antiparallel
           beta-sheets similar to an immunoglobulin-like fold, with
           an additional small, antiparallel beta-sheet. The
           longer-stranded beta-sheet is made up of four
           antiparallel beta-strands. The shorter-stranded
           beta-sheet consists of five beta-strands, four of these
           beta-strands form an antiparallel beta-sheet. The exact
           function of this family of proteins is unkown, though a
           putative role includes involvement in host-bacterial
           interactions involved in endocytosis or phagocytosis,
           possibly during bacterial internalisation.
          Length = 127

 Score = 30.1 bits (68), Expect = 2.7
 Identities = 10/31 (32%), Positives = 16/31 (51%)

Query: 939 PGDFTSQPGQTVHSFNVSGIQKVSHVLPFPV 969
           P +     G  V  + VS ++  S V+P+PV
Sbjct: 11  PAELVDAGGTVVQDWTVSDLKPSSDVIPYPV 41


>gnl|CDD|130386 TIGR01319, glmL_fam, conserved hypothetical protein.  This small
           family includes, so far, an uncharacterized protein from
           E. coli O157:H7 and GlmL from Clostridium tetanomorphum
           and Clostridium cochlearium. GlmL is located between the
           genes for the two subunits, epsilon (GlmE) and sigma
           (GlmS), of the coenzyme-B12-dependent glutamate mutase
           (methylaspartate mutase), the first enzyme in a pathway
           of glutamate fermentation. Members shows significant
           sequence similarity to the hydantoinase branch of the
           hydantoinase/oxoprolinase family (pfam01968).
          Length = 463

 Score = 30.8 bits (69), Expect = 5.4
 Identities = 21/71 (29%), Positives = 27/71 (38%), Gaps = 16/71 (22%)

Query: 431 AIAYKCAVMDNFKDVFVLNILG----------------DLLLKGPNAPFYKNLVESGLGL 474
           AIA      D   D  +++I G                D  L G   PF K  VE  LG+
Sbjct: 237 AIAEGTDKDDGIGDFILIDIGGATTDVHSAAAGELSKPDTTLHGLEEPFAKRTVEGDLGM 296

Query: 475 SFSPVTGYEAS 485
             S ++  EAS
Sbjct: 297 RVSAISLGEAS 307


>gnl|CDD|220425 pfam09822, ABC_transp_aux, ABC-type uncharacterized transport
           system.  This domain is found in various eukaryotic and
           prokaryotic intra-flagellar transport proteins involved
           in gliding motility, as well as in several hypothetical
           proteins.
          Length = 268

 Score = 30.0 bits (68), Expect = 6.0
 Identities = 29/158 (18%), Positives = 62/158 (39%), Gaps = 24/158 (15%)

Query: 621 TYLQEKVDEYLRNNPHKLIITMS-PEKTFDEKLDKVEKDILKDRISQMNDQDLNKVYVNG 679
             ++E ++EY   +P K+ +    P++   E  +  EK  ++    Q+ +   + V +  
Sbjct: 46  PRVRELLEEYAALSPGKIKVEFIDPDEPPSEAEEAAEKYGIQPVPLQVRENKSSVVTIYF 105

Query: 680 TELRKEQEKEQNIDVLPTLKISDVDDHVERVVTTDKHILQVPIQLSTQPTNGVTYFRSVV 739
             + +  +K + I  L          ++E  +T    I +V     T+P     YF    
Sbjct: 106 GLVVEYGDKREVIPFLDN----VSLQNLEYELTR--AIQKV-----TRPKKPKVYF---- 150

Query: 740 DTSKLS--PELKPLVP--LFNYVINQMRTKNYDFREMD 773
               LS   EL         + +I+++  + YD   ++
Sbjct: 151 ----LSGHGELPDEGGGESLSTLIDELEKEGYDVEPLN 184


>gnl|CDD|176017 cd04052, C2B_Tricalbin-like, C2 domain second repeat present in
          Tricalbin-like proteins.  5 to 6 copies of the C2
          domain are present in Tricalbin, a yeast homolog of
          Synaptotagmin, which is involved in membrane
          trafficking and sorting.  C2 domains fold into an
          8-standed beta-sandwich that can adopt 2 structural
          arrangements: Type I and Type II, distinguished by a
          circular permutation involving their N- and C-terminal
          beta strands. Many C2 domains are Ca2+-dependent
          membrane-targeting modules that bind a wide variety of
          substances including bind phospholipids, inositol
          polyphosphates, and intracellular proteins.  Most C2
          domain proteins are either signal transduction enzymes
          that contain a single C2 domain, such as protein kinase
          C, or membrane trafficking proteins which contain at
          least two C2 domains, such as synaptotagmin 1.
          However, there are a few exceptions to this including
          RIM isoforms and some splice variants of
          piccolo/aczonin and intersectin which only have a
          single C2 domain.  C2 domains with a calcium binding
          region have negatively charged residues, primarily
          aspartates, that serve as ligands for calcium ions.
          This cd contains the second C2 repeat, C2B, and has a
          type-II topology.
          Length = 111

 Score = 28.7 bits (65), Expect = 7.0
 Identities = 11/22 (50%), Positives = 14/22 (63%)

Query: 46 YLNNKLKLSSNILKMLNNPSWG 67
          YLN KL  ++ + K  NNPSW 
Sbjct: 20 YLNGKLVYTTRVKKKTNNPSWN 41


>gnl|CDD|236995 PRK11824, PRK11824, polynucleotide phosphorylase/polyadenylase;
           Provisional.
          Length = 693

 Score = 30.0 bits (69), Expect = 8.5
 Identities = 16/72 (22%), Positives = 27/72 (37%), Gaps = 13/72 (18%)

Query: 504 DEIIGAVNKTIDEVIAEGF---DK-ERVAIGLQGVDSNKFDEIKGAVNKTIDEVIAEGFD 559
           +E+  AV +  +  + E +   DK ER             D IK  V + +     E  D
Sbjct: 239 EELKAAVKELAEAKLKEAYQITDKQER---------EAALDAIKEEVLEALAAEEEEEED 289

Query: 560 KERVASVLHSLE 571
           ++ +      LE
Sbjct: 290 EKEIKEAFKKLE 301


>gnl|CDD|165202 PHA02869, PHA02869, C4L/C10L-like gene family protein; Provisional.
          Length = 418

 Score = 30.0 bits (67), Expect = 8.8
 Identities = 53/226 (23%), Positives = 99/226 (43%), Gaps = 28/226 (12%)

Query: 622 YLQEKVDEYLRNNPHKLIIT-MSPEKTFDEKLDKVEKDILKDRISQMNDQDLNKVYVNGT 680
           YL +K++ Y R N  +L    M  E+  ++K  +    ++ DR  +       K + NG+
Sbjct: 207 YLGKKINLYDRENDLQLCYCDMVIERMTEDK--EYSLGMISDRSGR-----CIKSHHNGS 259

Query: 681 ELRKEQEKEQNIDVLPTLKISDVDDHVERVVTTD-KHILQVPIQLSTQ----PTNGVTYF 735
            +R  +E+  + D L    +++VD+    + T D KHI+   I   T     P + V Y 
Sbjct: 260 IVRYRKEEYGSFDALCIYNMNEVDE----IWTGDKKHIIWSTIDKKTGTSFIPIDPVLYE 315

Query: 736 RSVVDTSKLSPELKPLVPLFN----YVINQMRTKNYDFREMDQLIHMSTGGISFNSHLGE 791
           +    +SK   E K L    N    Y+   +    +D      LIH     I +++   +
Sbjct: 316 KLKAISSKEHKEYKDLRGFCNSRTEYICCSVSKYYFDLPTKTDLIHEVINSIDYDT---K 372

Query: 792 SCSTPNGFEEAILVSSHCLEH-NNDKMFDVLSELFNNVQLTDLNRF 836
           S  TP+ +   I V    L + + +++F+++     N+ L + N +
Sbjct: 373 SVGTPDWYTLPIEVKQTILGNMSYEELFNIVR---GNIALEEDNEY 415


>gnl|CDD|218956 pfam06248, Zw10, Centromere/kinetochore Zw10.  Zw10 and rough deal
           proteins are both required for correct metaphase
           check-pointing during mitosis. These proteins bind to
           the centromere/kinetochore.
          Length = 593

 Score = 29.7 bits (67), Expect = 9.9
 Identities = 20/79 (25%), Positives = 36/79 (45%), Gaps = 5/79 (6%)

Query: 624 QEKVDEYLRNNPHKLIITMSPEKTFDEKLDKVEKDI---LKDRISQMNDQDLNKVYVNGT 680
           +E+V + +       +  MS  +   E++  + +DI   LK  +S    QDLN       
Sbjct: 27  KEEVRDMINKKYSDFLPNMSSAEDLLEQVSSLSEDISDLLKSEVSNEILQDLNDAVEELA 86

Query: 681 ELRKEQEKEQNIDVLPTLK 699
           EL+  ++ E+ I  L  L+
Sbjct: 87  ELK--RQLEEKILGLKILQ 103


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.318    0.135    0.397 

Gapped
Lambda     K      H
   0.267   0.0854    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 58,550,932
Number of extensions: 5846696
Number of successful extensions: 5361
Number of sequences better than 10.0: 1
Number of HSP's gapped: 5327
Number of HSP's successfully gapped: 44
Length of query: 1147
Length of database: 10,937,602
Length adjustment: 107
Effective length of query: 1040
Effective length of database: 6,191,724
Effective search space: 6439392960
Effective search space used: 6439392960
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 65 (28.6 bits)