Query         psy10314
Match_columns 233
No_of_seqs    176 out of 1240
Neff          9.0 
Searched_HMMs 46136
Date          Fri Aug 16 23:19:29 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy10314.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/10314hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 smart00216 VWD von Willebrand  100.0 5.5E-37 1.2E-41  237.8  17.4  152    7-166     1-162 (162)
  2 PF00094 VWD:  von Willebrand f 100.0 1.1E-28 2.4E-33  190.0  13.9  146   18-167     1-159 (159)
  3 smart00832 C8 C8 domain. This   98.0 7.8E-07 1.7E-11   59.8  -1.2   29  205-233     3-33  (76)
  4 KOG4338|consensus               98.0 1.2E-05 2.7E-10   78.9   5.8  160   16-187  1354-1531(1680)
  5 PF08742 C8:  C8 domain;  Inter  97.5 1.3E-05 2.8E-10   53.5  -1.4   29  205-233     3-31  (74)
  6 PF06535 RGM_N:  Repulsive guid  97.3 0.00051 1.1E-08   52.1   5.1   61   15-81     99-160 (161)
  7 PF07481 DUF1521:  Domain of Un  92.5       2 4.4E-05   32.7   9.2   69    5-82     27-97  (171)
  8 PF05375 Pacifastin_I:  Pacifas  62.0     3.7   8E-05   23.6   0.6   15    2-16     17-32  (39)
  9 smart00215 VWC_out von Willebr  57.5     5.2 0.00011   25.9   0.8   21    1-21     22-43  (67)
 10 KOG1216|consensus               53.1      31 0.00068   31.8   5.6   46  122-167   400-446 (553)
 11 cd06909 M14_ASPA Aspartoacylas  30.4      41 0.00089   28.4   2.3   19   22-40    240-258 (282)
 12 PF08394 Arc_trans_TRASH:  Arch  29.7      35 0.00077   19.3   1.2   18   20-37      8-25  (37)
 13 PRK14111 F0F1 ATP synthase sub  28.0      22 0.00047   30.1   0.2   11  147-157    88-98  (290)
 14 TIGR01146 ATPsyn_F1gamma ATP s  27.5      17 0.00038   30.6  -0.4   12  147-158    85-96  (287)
 15 PRK05621 F0F1 ATP synthase sub  27.5      16 0.00035   30.7  -0.6   11  147-157    84-94  (284)
 16 PRK13424 F0F1 ATP synthase sub  26.9      17 0.00037   30.7  -0.6   11  147-157    86-96  (291)
 17 PRK14110 F0F1 ATP synthase sub  26.6      18 0.00039   30.6  -0.5   11  147-157    87-97  (291)
 18 PHA03357 Alkaline exonuclease;  26.1      44 0.00095   22.0   1.3   37  142-184     8-44  (81)
 19 PRK13422 F0F1 ATP synthase sub  25.5      18 0.00039   30.7  -0.7   12  147-158    85-96  (298)
 20 COG4892 Predicted heme/steroid  25.1      48   0.001   21.7   1.4   16   23-38     17-32  (81)
 21 PRK13425 F0F1 ATP synthase sub  24.8      21 0.00045   30.2  -0.4   11  147-157    86-96  (291)
 22 PF06668 ITI_HC_C:  Inter-alpha  24.3 3.6E+02  0.0078   21.3   7.4   24  133-156   110-133 (188)
 23 PRK13427 F0F1 ATP synthase sub  23.8      28 0.00061   29.4   0.2   12  147-158    86-97  (289)
 24 PF08091 Toxin_21:  Spider inse  23.2      79  0.0017   17.8   1.8   14    6-19     22-37  (39)
 25 PRK13426 F0F1 ATP synthase sub  23.0      27 0.00059   29.5  -0.0   11  147-157    85-95  (291)
 26 PRK13423 F0F1 ATP synthase sub  22.7      29 0.00063   29.3   0.1   11  147-157    85-95  (288)
 27 PF00231 ATP-synt:  ATP synthas  22.0      22 0.00047   30.0  -0.8   11  147-157    85-95  (290)
 28 KOG2875|consensus               20.5      26 0.00056   29.5  -0.7   22   16-38    141-163 (323)

No 1  
>smart00216 VWD von Willebrand factor (vWF) type D domain. Von Willebrand factor contains several type D domains: D1 and D2 are present within the N-terminal propeptide whereas the remaining D domains are required for multimerisation.
Probab=100.00  E-value=5.5e-37  Score=237.76  Aligned_cols=152  Identities=36%  Similarity=0.776  Sum_probs=126.9

Q ss_pred             eeeecCCCCeEEEEECCCeeeccCCCeeeeecCceEEEEEecCCCCCCceEEEEEEeecCCCCceeeeEEEEEeCCCccc
Q psy10314          7 WKCSKEECPGICTAWGDSHFQTFDAKNFDFQGACEYVLVKGANEEEEEAFVVVIEVVPCGSTGVSCSKSVKISTGSKKNA   86 (233)
Q Consensus         7 w~C~~~~c~~~C~~~G~~h~~TFDg~~y~f~g~C~YvL~~~~~~~~~~~f~V~~~~~~~~~~~~~~~~~v~v~~~~~~~~   86 (233)
                      |.|+..+|+++|.++|++||+||||+.|+|+|.|.|+|+++|..  .++|+|++++.+++.. .++.++|+|.+++    
T Consensus         1 ~~c~~~~~~~~C~v~g~~~~~TFDg~~y~~~g~C~yvL~~~~~~--~~~f~V~~~~~~~~~~-~~~~~~v~v~~~~----   73 (162)
T smart00216        1 WCCTQGECSPTCSVSGDPHYTTFDGVAYTFPGNCYYVLAQDCSS--EPTFSVLLKNVPCGGG-ATCLKSVKVELNG----   73 (162)
T ss_pred             CccccCCCCCEEEEcCCCCeECcCCCEEeecCceEEEEEEECCC--CCCEEEEEEecCCCCC-ceEEEEEEEEECC----
Confidence            88999999999999999999999999999999999999999864  4699999999887653 6789999999984    


Q ss_pred             eEEEEeC--------Cc--ccCCCCCcceEEEEEcccEEEEEEccceEEEEEcCCeEEEEecCCCCCCCcccceeecCCC
Q psy10314         87 ESITLTR--------NK--PLPNSGEYHKLSVREAGLFVFVELHDIGVTVQWDKGTRVSVTLSPKWKNKVKGLCGNYNDN  156 (233)
Q Consensus        87 ~~i~~~~--------~~--~lp~~~~~~~~~i~~~g~~~~v~~~~~gl~v~~d~~~~~~v~l~~~~~g~~~GLCGn~ng~  156 (233)
                      ..|.+..        |.  .+|.......+.+++.+.++++. +..||+|.|||...++|.+|+.|+|+||||||||||+
T Consensus        74 ~~i~~~~~~~~v~vng~~v~~p~~~~~~~v~~~~~~~~~v~~-~~~gl~v~~dg~~~~~V~~~~~~~g~~~GLCGn~ng~  152 (162)
T smart00216       74 DEIELKDDNGTVTVNGQQVSLPYKTSDGSIQIRSSGGYVVVI-TSLGLQVTFDGLTLLSVQLPSRYRGKTCGLCGNFDGE  152 (162)
T ss_pred             EEEEEEeCCCEEEECCEEeeCCcCcCCceEEEEECceEEEEE-CCCCEEEEECCCcEEEEEECHHHCCCeeEccCCCCCC
Confidence            3444432        22  34554333346666666555554 6899999999887799999999999999999999999


Q ss_pred             CCCCccCCCC
Q psy10314        157 ELDDFQSPSG  166 (233)
Q Consensus       157 ~~dD~~~p~g  166 (233)
                      +.|||++|+|
T Consensus       153 ~~dDf~~p~g  162 (162)
T smart00216      153 PEDDFRTPDG  162 (162)
T ss_pred             ccccccCCCC
Confidence            9999999987


No 2  
>PF00094 VWD:  von Willebrand factor type D domain;  InterPro: IPR001846 A family of growth regulators (originally called cef10, connective tissue growth factor, fisp-12, cyr61, or, alternatively, beta IG-M1 and beta IG-M2), all belong to immediate-early genes expressed after induction by growth factors or certain oncogenes. Sequence analysis of this family revealed the presence of four distinct modules. Each module has homologues in other extracellular mosaic proteins such as Von Willebrand factor, slit, thrombospondins, fibrillar collagens, IGF-binding proteins and mucins. Classification and analysis of these modules suggests the location of binding regions and, by analogy to better characterised modules in other proteins, sheds some light onto the structure of this new family []. The vWF domain is found in various plasma proteins: complement factors B, C2, CR3 and CR4; the integrins (I-domains); collagen types VI, VII, XII and XIV; and other extracellular proteins [, , ]. Although the majority of VWA-containing proteins are extracellular, the most ancient ones present in all eukaryotes are all intracellular proteins involved in functions such as transcription, DNA repair, ribosomal and membrane transport and the proteasome. A common feature appears to be involvement in multiprotein complexes. Proteins that incorporate vWF domains participate in numerous biological events (e.g. cell adhesion, migration, homing, pattern formation, and signal transduction), involving interaction with a large array of ligands []. A number of human diseases arise from mutations in VWA domains. Secondary structure prediction from 75 aligned vWF sequences has revealed a largely alternating sequence of alpha-helices and beta-strands []. One of the functions of von Willebrand factor (vWF) is to serve as a carrier of clotting factor VIII (FVIII). The native conformation of the D' domain of vWF is not only required for factor VIII (FVIII) binding but also for normal multimerisation and optimal secretion. The interaction between blood clotting factor VIII and VWF is necessary for normal survival of blood clotting factor VIII in blood circulation. The VWFD domain is a highly structured region, in which the first conserved Cys has been found to form a disulphide bridge with the second conserved one [].
Probab=99.96  E-value=1.1e-28  Score=190.03  Aligned_cols=146  Identities=35%  Similarity=0.730  Sum_probs=104.9

Q ss_pred             EEEECCCeeeccCCCeeeeecCceEE--EEEecCCCCCCceEEEEEEeecCCCCceeeeEEEEEeCCCccceEEEEeCC-
Q psy10314         18 CTAWGDSHFQTFDAKNFDFQGACEYV--LVKGANEEEEEAFVVVIEVVPCGSTGVSCSKSVKISTGSKKNAESITLTRN-   94 (233)
Q Consensus        18 C~~~G~~h~~TFDg~~y~f~g~C~Yv--L~~~~~~~~~~~f~V~~~~~~~~~~~~~~~~~v~v~~~~~~~~~~i~~~~~-   94 (233)
                      |+++|++||+||||+.|.|++.|.|+  |++++.+.....|.|.+++.+.+....+|+++|.+.+.+    ..|.+..+ 
T Consensus         1 C~v~g~~~~~TFDg~~y~~~~~c~y~~vl~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~v~i~~~~----~~i~i~~~~   76 (159)
T PF00094_consen    1 CSVYGDPHITTFDGKSYSFPGNCTYILVLAQDCSSDPKFSFSVENKNCPCGQSGTSCTRKVTIRLGN----HEIEIKPNN   76 (159)
T ss_pred             CEEeCCCeEEcCCCCEEecCCCceEEEEeecccCccccccccceeeeeccCCCCcceeeEEEEEccc----ceeeEeccc
Confidence            89999999999999999999999999  999986511124455555555444456699999999884    34444333 


Q ss_pred             --------cc--cCCCCCcceEEEEEcccEEEEEEccceEEEEEcCCeEEEEecCCCCCCCcccceeecCCCCCCCccCC
Q psy10314         95 --------KP--LPNSGEYHKLSVREAGLFVFVELHDIGVTVQWDKGTRVSVTLSPKWKNKVKGLCGNYNDNELDDFQSP  164 (233)
Q Consensus        95 --------~~--lp~~~~~~~~~i~~~g~~~~v~~~~~gl~v~~d~~~~~~v~l~~~~~g~~~GLCGn~ng~~~dD~~~p  164 (233)
                              ..  +|+......+.+...+..+++.....+|++.|++...+.|.+|++|+|+|+||||+||+++.|||++|
T Consensus        77 ~~~v~vng~~~~~p~~~~~~~i~~~~~~~~~v~~~~~~~v~~~~~~~~~~~v~~~~~~~g~t~GLCG~~dg~~~dDf~~~  156 (159)
T PF00094_consen   77 GDQVTVNGQPVSLPYWSSGGSIERYSGGFVVVVFSSGVRVQVNWDGNMSVYVSVPPWYKGKTCGLCGNFDGEPEDDFTTP  156 (159)
T ss_pred             cceEEEeeecccCccccCcceEEeeccccEEEEEecCCeEEEEEecccccccccccccccccccccCCCCCCccceecCC
Confidence                    22  23322222344443333333333456666678776789999999999999999999999999999999


Q ss_pred             CCC
Q psy10314        165 SGG  167 (233)
Q Consensus       165 ~g~  167 (233)
                      +|+
T Consensus       157 ~g~  159 (159)
T PF00094_consen  157 DGC  159 (159)
T ss_pred             CCC
Confidence            984


No 3  
>smart00832 C8 C8 domain. This domain contains 8 conserved cysteine residues, but this family only contains 7 of them to overlaps with other domains. It is found in disease-related proteins including von Willebrand factor, Alpha tectorin, Zonadhesin and Mucin.
Probab=97.99  E-value=7.8e-07  Score=59.75  Aligned_cols=29  Identities=45%  Similarity=0.828  Sum_probs=25.5

Q ss_pred             hhhccCccccCC--CCcCCcCCCCCcccccC
Q psy10314        205 WAVKQCNLLKSE--LFQPCHSEVPVDSFFER  233 (233)
Q Consensus       205 ~a~~~C~~l~~~--~F~~Ch~~v~p~~y~~~  233 (233)
                      .+.+.|++|++.  +|++||.+|||++||++
T Consensus         3 ~a~~~C~~l~~~~g~F~~Ch~~V~p~~f~~~   33 (76)
T smart00832        3 YACSQCGILLSPRGPFAACHSVVDPEPFFEN   33 (76)
T ss_pred             chhhhhHhhcCCCCChHHHhCcCChHHHHHH
Confidence            467899999865  99999999999999973


No 4  
>KOG4338|consensus
Probab=97.95  E-value=1.2e-05  Score=78.86  Aligned_cols=160  Identities=25%  Similarity=0.371  Sum_probs=105.6

Q ss_pred             eEEEEECCCeeeccCCCeeee-ecCceEEEEEecCCC-CCCceEEEEEEeecCCCCceeeeEEEEEeCCCccceEEEEe-
Q psy10314         16 GICTAWGDSHFQTFDAKNFDF-QGACEYVLVKGANEE-EEEAFVVVIEVVPCGSTGVSCSKSVKISTGSKKNAESITLT-   92 (233)
Q Consensus        16 ~~C~~~G~~h~~TFDg~~y~f-~g~C~YvL~~~~~~~-~~~~f~V~~~~~~~~~~~~~~~~~v~v~~~~~~~~~~i~~~-   92 (233)
                      +.|.+ |-+...|||-..|.= .-.|.++.+++|..+ ..+.|.|++++...+...    ..|.+-.+.    .+|++. 
T Consensus      1354 a~~~v-~ls~~~~Fd~~~y~n~~~~~~sl~a~~~~~ek~~~~~~vl~~~~~~~~~~----~~vk~vy~e----~ei~~~~ 1424 (1680)
T KOG4338|consen 1354 ALCKV-GLSRINTFDIVAYNNPLTPCYSLTAKDCSEEKATPRFAVLIKKINKNSEE----VAVKVVYGE----REIVVKK 1424 (1680)
T ss_pred             EEEEe-ccccccccchhhhcCcccceEEEEeecccccccCceEEEEecCCCCCchh----hhheeeeee----eeeEEee
Confidence            67866 667899999999964 489999999999762 236899998876543211    112222211    122221 


Q ss_pred             ---------CCcccCCCC----CcceEEEEEcccEEEEEEccceEEEEEcCCeEEEEecCCCCC-CCcccceeecCCCCC
Q psy10314         93 ---------RNKPLPNSG----EYHKLSVREAGLFVFVELHDIGVTVQWDKGTRVSVTLSPKWK-NKVKGLCGNYNDNEL  158 (233)
Q Consensus        93 ---------~~~~lp~~~----~~~~~~i~~~g~~~~v~~~~~gl~v~~d~~~~~~v~l~~~~~-g~~~GLCGn~ng~~~  158 (233)
                               ++..+-...    ......|...+...++- ...+.+|+||| ....+.+|..+. |+.||||||++++..
T Consensus      1425 ~~~~~~~~~~~~k~~~t~~~~~~~~q~~i~~~~es~~~~-~l~~gtv~~~g-~~~~~~~~~~~~~~q~~~~~~n~~~~~~ 1502 (1680)
T KOG4338|consen 1425 LHEKKMVKVDGKKIITTLNIIYELKQYNIEYLGESLIVF-DLPGGTVQFDG-YNARITLPSVRRKNQLCGLCGNNDDESE 1502 (1680)
T ss_pred             cccceEEeecchhcccchHHHHHHHHHHHHhhccceeEe-ecCCceEEecc-cccccccHHHhhccchhhhhccCCcccc
Confidence                     122111100    01123344444333332 35799999997 778888887766 999999999999999


Q ss_pred             CCccCCCCCccccChhhhccceecc-CCCC
Q psy10314        159 DDFQSPSGGISEVSPVVFGDSWRVH-TYCP  187 (233)
Q Consensus       159 dD~~~p~g~~~~~~~~~f~~sW~~~-~~C~  187 (233)
                      .+|.+.... ...+..+|..|+-+. ..|.
T Consensus      1503 ~~~~t~~~~-~~~~i~e~h~s~~l~n~~~~ 1531 (1680)
T KOG4338|consen 1503 TEFYTALNI-ETEDIEEFHRSYLLKNEECE 1531 (1680)
T ss_pred             chhhhcccc-hhhhHHHHHHHHcccchhhh
Confidence            999999884 557889999999885 4564


No 5  
>PF08742 C8:  C8 domain;  InterPro: IPR014853 The proteins in this entry contained a domain rich in positionally conserved cysteine residues. Most proteins contains 7 or 8 cysteine residues. The domain is found in disease-related proteins including von Willebrand factor, Alpha tectorin, Zonadhesin and Mucin. It is often found on proteins containing IPR001846 from INTERPRO and IPR002919 from INTERPRO. 
Probab=97.48  E-value=1.3e-05  Score=53.50  Aligned_cols=29  Identities=41%  Similarity=0.740  Sum_probs=25.7

Q ss_pred             hhhccCccccCCCCcCCcCCCCCcccccC
Q psy10314        205 WAVKQCNLLKSELFQPCHSEVPVDSFFER  233 (233)
Q Consensus       205 ~a~~~C~~l~~~~F~~Ch~~v~p~~y~~~  233 (233)
                      .+...|++|.++.|++||.+|||++||++
T Consensus         3 ~a~~~C~~l~~~~F~~C~~~v~~~~f~~~   31 (74)
T PF08742_consen    3 RAESYCGILLDPEFAPCHSVVDPDPFYEA   31 (74)
T ss_pred             hHhHHhHHHcCchhhhhcccCccHHHHHH
Confidence            46789999988779999999999999863


No 6  
>PF06535 RGM_N:  Repulsive guidance molecule (RGM) N-terminus;  InterPro: IPR010536 This entry represents the N-terminal region of several mammalian and one bird sequence from Gallus gallus (Chicken). All of the mammalian proteins are hypothetical and have no known function but Q8JG54 from SWISSPROT from the chicken is annotated as being a repulsive guidance molecule (RGM). RGM is a GPI-linked axon guidance molecule of the retinotectal system. RGM is repulsive for a subset of axons, those from the temporal half of the retina. Temporal retinal axons invade the anterior optic tectum in a superficial layer, and encounter RGM expressed in a gradient with increasing concentration along the anterior-posterior axis. Temporal axons are able to receive posterior-dependent information by sensing gradients or concentrations of guidance cues. Thus, RGM is likely to provide positional information for temporal axons invading the optic tectum in the stratum opticum [].
Probab=97.30  E-value=0.00051  Score=52.07  Aligned_cols=61  Identities=16%  Similarity=0.307  Sum_probs=44.1

Q ss_pred             CeEEEEECCCeeeccCCCeeeeecCceEEEEEecCCCCCCceEEEEEEeecC-CCCceeeeEEEEEeC
Q psy10314         15 PGICTAWGDSHFQTFDAKNFDFQGACEYVLVKGANEEEEEAFVVVIEVVPCG-STGVSCSKSVKISTG   81 (233)
Q Consensus        15 ~~~C~~~G~~h~~TFDg~~y~f~g~C~YvL~~~~~~~~~~~f~V~~~~~~~~-~~~~~~~~~v~v~~~   81 (233)
                      -..|.++||||++||++...+=.-...+=|+.      +..+.|++.+.+-. ...++.+.+|+|.+.
T Consensus        99 y~hCglFGDPHLRTF~~~fqTC~v~GAWPLId------N~yL~VQvTn~pv~~gs~aTattKvTVI~K  160 (161)
T PF06535_consen   99 YRHCGLFGDPHLRTFNDEFQTCKVEGAWPLID------NDYLSVQVTNSPVVPGSSATATTKVTVIFK  160 (161)
T ss_pred             cceecccCChhHhhccCCceeeeeecceeeec------CCcEEEEeeCccccCCccceeeeeEEEEEc
Confidence            47899999999999999876544344455554      46789999988752 234666778887764


No 7  
>PF07481 DUF1521:  Domain of Unknown Function (DUF1521);  InterPro: IPR011086 This domain of unknown function is found in a limited set of Bradyrhizobium proteins. There appears to be a periodic -DG- motif in the domain.
Probab=92.45  E-value=2  Score=32.67  Aligned_cols=69  Identities=26%  Similarity=0.623  Sum_probs=49.2

Q ss_pred             ceeeeecCCCCeEEEEECCCee-eccCCCe-eeeecCceEEEEEecCCCCCCceEEEEEEeecCCCCceeeeEEEEEeCC
Q psy10314          5 GHWKCSKEECPGICTAWGDSHF-QTFDAKN-FDFQGACEYVLVKGANEEEEEAFVVVIEVVPCGSTGVSCSKSVKISTGS   82 (233)
Q Consensus         5 g~w~C~~~~c~~~C~~~G~~h~-~TFDg~~-y~f~g~C~YvL~~~~~~~~~~~f~V~~~~~~~~~~~~~~~~~v~v~~~~   82 (233)
                      +.|.=+...-..+=++|||||| ..=||+. |+|.+.-++.|--        -=.|.+...+.+. +++....++|..+.
T Consensus        27 s~~~itnk~tG~~TriwGDPHvd~~Gdg~~~fDFk~~~tf~L~D--------GTKItV~T~p~gn-g~T~askLtIt~Gd   97 (171)
T PF07481_consen   27 SEWTITNKQTGETTRIWGDPHVDADGDGKTDFDFKGDMTFQLDD--------GTKITVDTVPWGN-GMTYASKLTITNGD   97 (171)
T ss_pred             ceEEEecCCCCcEeEEeCCCccccCCCcccceeecCceEEEeCC--------CCEEEEeeeecCC-CcEEeeeEEEEcCC
Confidence            4566555544556689999999 5666555 9999988888753        2257777777665 77778888888774


No 8  
>PF05375 Pacifastin_I:  Pacifastin inhibitor (LCMII);  InterPro: IPR008037 Peptide proteinase inhibitors can be found as single domain proteins or as single or multiple domains within proteins; these are referred to as either simple or compound inhibitors, respectively. In many cases they are synthesised as part of a larger precursor protein, either as a prepropeptide or as an N-terminal domain associated with an inactive peptidase or zymogen. This domain prevents access of the substrate to the active site. Removal of the N-terminal inhibitor domain either by interaction with a second peptidase or by autocatalytic cleavage activates the zymogen. Other inhibitors interact direct with proteinases using a simple noncovalent lock and key mechanism; while yet others use a conformational change-based trapping mechanism that depends on their structural and thermodynamic properties.  This family of serine protease inhibitors belong to MEROPS inhibitor family I19, clan IW. They inhibit chymotrpsin, a peptidase belong to the S1 family (IPR001254 from INTERPRO) []. They were first isolated from Locusta migratoria migratoria(migratory locust). These were HI, LMCI-1 (PMP-D2) and LMCI-2 (PMP-C) [, , ]; five additional members SGPI-1 to 5 were identified in Schistocerca gregaria (desert locust) [, ], and a heterodimeric serine protease inhibitor (pacifastin) was isolated from the hemolymph of Pacifastacus leniusculus (Signal crayfish) [].  Pacifastin is a 155kDa composed of two covalently linked subunits, which are separately encoded. The heavy chain of pacifastin (105 kDa) is related to transferrins, containing three transferrin lobes, two of which seem to be active for iron binding []. A number of the members of the transferrin family are also serine peptidases belong to MEROPS peptidase family S60 (IPR001156 from INTERPRO). The light chain of pacifastin (44 kDa) is the proteinase inhibitory subunit, and has nine cysteine-rich inhibitory domains that are homologous to each other. The locust inhibitors share a conserved array of six cysteine residues with the pacifastin light chain. The structure of members of this family reveal that they are comprised of a triple-stranded antiparallel beta-sheet connected by three disulphide bridges []. The biological function(s) of the locust inhibitors is (are) not fully understood. LMCI-1 and LMCI-2 were shown to inhibit the endogenous proteolytic activating cascade of prophenoloxidase []. Expression analysis shows that the genes encoding the SGPI precursors are differentially expressed in a time-, stage- and hormone-dependent manner.; GO: 0030414 peptidase inhibitor activity; PDB: 2VU8_I 1WO9_A 1GL1_K 1PMC_A 2XTT_A 2F91_B 1KJ0_A 1GL0_I 1KGM_A 3TVJ_I.
Probab=62.00  E-value=3.7  Score=23.55  Aligned_cols=15  Identities=47%  Similarity=1.553  Sum_probs=11.4

Q ss_pred             CC-CceeeeecCCCCe
Q psy10314          2 CS-QGHWKCSKEECPG   16 (233)
Q Consensus         2 C~-~g~w~C~~~~c~~   16 (233)
                      |. +|+|.|+...|+.
T Consensus        17 C~~~G~~~CT~~~C~~   32 (39)
T PF05375_consen   17 CSSDGKWACTRKACPP   32 (39)
T ss_dssp             EBTTSSEEEE-SSSS-
T ss_pred             CCCCCcCccCcccCcC
Confidence            66 8999999999864


No 9  
>smart00215 VWC_out von Willebrand factor (vWF) type C domain.
Probab=57.50  E-value=5.2  Score=25.93  Aligned_cols=21  Identities=33%  Similarity=0.662  Sum_probs=16.4

Q ss_pred             CCCCceeeeecCCCCeE-EEEE
Q psy10314          1 TCSQGHWKCSKEECPGI-CTAW   21 (233)
Q Consensus         1 ~C~~g~w~C~~~~c~~~-C~~~   21 (233)
                      +|.+|.|.|++..|+.. |.+.
T Consensus        22 tC~~G~v~Ct~~~C~~~~c~l~   43 (67)
T smart00215       22 TCLNGRVSCTKVWCGPKPCLLH   43 (67)
T ss_pred             EecCCCEEecCCcCCCchhhcc
Confidence            58899999999999644 5443


No 10 
>KOG1216|consensus
Probab=53.12  E-value=31  Score=31.78  Aligned_cols=46  Identities=22%  Similarity=0.427  Sum_probs=32.8

Q ss_pred             ceEEEEEcCCeEEEEe-cCCCCCCCcccceeecCCCCCCCccCCCCC
Q psy10314        122 IGVTVQWDKGTRVSVT-LSPKWKNKVKGLCGNYNDNELDDFQSPSGG  167 (233)
Q Consensus       122 ~gl~v~~d~~~~~~v~-l~~~~~g~~~GLCGn~ng~~~dD~~~p~g~  167 (233)
                      .++...|.....+.+. .......+.+++++++|++..|++...++.
T Consensus       400 ~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~  446 (553)
T KOG1216|consen  400 VVLECSWPTDCVVELSRAAEESEIRLCGLCADGNGNTPDGFPLGDGN  446 (553)
T ss_pred             eeccCCCCchheeccCcchhhccccccceeccCCcCCCCCeeccCCc
Confidence            3444555544444443 456677889999999999998888887775


No 11 
>cd06909 M14_ASPA Aspartoacylase (ASPA) belongs to the Succinylglutamate desuccinylase/aspartoacylase subfamily of the M14 family of metallocarboxypeptidases. ASPA (also known as aminoacylase 2; EC:3.5.1.15) cleaves N-acetyl L-aspartic acid (NAA) into aspartate and acetate. NAA is abundant in the brain, and hydrolysis of NAA by ASPA may help maintain white matter. ASPA is an NAA scavenger in other tissues. Mutations in the gene encoding ASPA cause Canavan disease (CD), a fatal progressive neurodegenerative disorder involving dysmyelination and spongiform degeneration of white matter in children. This enzyme binds zinc which is necessary for activity. Measurement of elevated NAA levels in urine is used in the diagnosis of CD.
Probab=30.40  E-value=41  Score=28.35  Aligned_cols=19  Identities=37%  Similarity=0.737  Sum_probs=16.9

Q ss_pred             CCCeeeccCCCeeeeecCc
Q psy10314         22 GDSHFQTFDAKNFDFQGAC   40 (233)
Q Consensus        22 G~~h~~TFDg~~y~f~g~C   40 (233)
                      |||-|.||||+...|.|..
T Consensus       240 Gdp~f~~~~G~~i~~~g~~  258 (282)
T cd06909         240 GDPLFLTFDGETIPYEGDE  258 (282)
T ss_pred             CCceEecCCCCEEEecCCC
Confidence            9999999999999888753


No 12 
>PF08394 Arc_trans_TRASH:  Archaeal TRASH domain;  InterPro: IPR013603 This region is found in the C terminus of a number of archaeal transcriptional regulators. It is thought to function as a metal-sensing regulatory module []. 
Probab=29.69  E-value=35  Score=19.29  Aligned_cols=18  Identities=17%  Similarity=0.198  Sum_probs=15.6

Q ss_pred             EECCCeeeccCCCeeeee
Q psy10314         20 AWGDSHFQTFDAKNFDFQ   37 (233)
Q Consensus        20 ~~G~~h~~TFDg~~y~f~   37 (233)
                      +.|.|+...++++.|.|=
T Consensus         8 I~~eP~~~k~~~~~y~fC   25 (37)
T PF08394_consen    8 ITGEPIVVKIGNKVYYFC   25 (37)
T ss_pred             ccCCEEEEEECCeEEEEE
Confidence            678999999999999873


No 13 
>PRK14111 F0F1 ATP synthase subunit gamma; Provisional
Probab=28.03  E-value=22  Score=30.10  Aligned_cols=11  Identities=45%  Similarity=0.809  Sum_probs=9.0

Q ss_pred             ccceeecCCCC
Q psy10314        147 KGLCGNYNDNE  157 (233)
Q Consensus       147 ~GLCGn~ng~~  157 (233)
                      .||||.||.+-
T Consensus        88 rGLCG~fN~~l   98 (290)
T PRK14111         88 RGLAGSLNANV   98 (290)
T ss_pred             CCccccccHHH
Confidence            49999999763


No 14 
>TIGR01146 ATPsyn_F1gamma ATP synthase, F1 gamma subunit. This model describes the ATP synthase gamma subunit in bacteria and its equivalents in organelles, namely, mitochondria and chloroplast. F1/F0-ATP synthase is a multisubunit, membrane associated enzyme found in bacteria and organelles of higher eukaryotes, namely, mitochondria and chloroplast. This enzyme is principally involed in the synthesis of ATP from ADP and inorganic phosphate by coupling the energy derived from the proton electrochemical gradient across the biological membrane. A brief description of this multisubunit enzyme complex: F1 and F0 represent two major clusters of subunits. The gamma subunit is the part of F1 cluster. Surrounding the gamma subunit in a cylinder-like structure are three alpha and three subunits in an alternating fashion. This is the central catalytic unit whose different conformations permit the binding of ADP and inorganic phosphate and release of ATP.
Probab=27.53  E-value=17  Score=30.56  Aligned_cols=12  Identities=67%  Similarity=1.381  Sum_probs=9.5

Q ss_pred             ccceeecCCCCC
Q psy10314        147 KGLCGNYNDNEL  158 (233)
Q Consensus       147 ~GLCGn~ng~~~  158 (233)
                      .||||.||.+--
T Consensus        85 rGLCG~fN~~v~   96 (287)
T TIGR01146        85 RGLCGGYNSNVL   96 (287)
T ss_pred             CCccccccHHHH
Confidence            499999997653


No 15 
>PRK05621 F0F1 ATP synthase subunit gamma; Validated
Probab=27.46  E-value=16  Score=30.71  Aligned_cols=11  Identities=64%  Similarity=1.362  Sum_probs=9.1

Q ss_pred             ccceeecCCCC
Q psy10314        147 KGLCGNYNDNE  157 (233)
Q Consensus       147 ~GLCGn~ng~~  157 (233)
                      .||||.||.+-
T Consensus        84 ~GLCG~fN~~v   94 (284)
T PRK05621         84 RGLCGGYNSNI   94 (284)
T ss_pred             CcccchhhHHH
Confidence            49999999764


No 16 
>PRK13424 F0F1 ATP synthase subunit gamma; Provisional
Probab=26.91  E-value=17  Score=30.71  Aligned_cols=11  Identities=55%  Similarity=1.268  Sum_probs=9.1

Q ss_pred             ccceeecCCCC
Q psy10314        147 KGLCGNYNDNE  157 (233)
Q Consensus       147 ~GLCGn~ng~~  157 (233)
                      .||||.||.+-
T Consensus        86 rGLCG~fN~~v   96 (291)
T PRK13424         86 RGLCGSFNANL   96 (291)
T ss_pred             CcccccchHHH
Confidence            49999999764


No 17 
>PRK14110 F0F1 ATP synthase subunit gamma; Provisional
Probab=26.61  E-value=18  Score=30.57  Aligned_cols=11  Identities=45%  Similarity=0.951  Sum_probs=9.2

Q ss_pred             ccceeecCCCC
Q psy10314        147 KGLCGNYNDNE  157 (233)
Q Consensus       147 ~GLCGn~ng~~  157 (233)
                      .||||.||.+-
T Consensus        87 rGLCG~fNs~v   97 (291)
T PRK14110         87 RGLAGAFNSNV   97 (291)
T ss_pred             CCccccccHHH
Confidence            59999999764


No 18 
>PHA03357 Alkaline exonuclease; Provisional
Probab=26.11  E-value=44  Score=21.99  Aligned_cols=37  Identities=16%  Similarity=0.338  Sum_probs=23.3

Q ss_pred             CCCCcccceeecCCCCCCCccCCCCCccccChhhhccceeccC
Q psy10314        142 WKNKVKGLCGNYNDNELDDFQSPSGGISEVSPVVFGDSWRVHT  184 (233)
Q Consensus       142 ~~g~~~GLCGn~ng~~~dD~~~p~g~~~~~~~~~f~~sW~~~~  184 (233)
                      -.|-.||+|=.+|     .+.+.+|.+++-++ ++-+.+.+++
T Consensus         8 g~gg~C~~CkR~N-----~LvT~sGetvaLda-d~FEdF~lDE   44 (81)
T PHA03357          8 GRGGICGLCKRYN-----ELVTCDGETLALNA-DCFEDFDFDE   44 (81)
T ss_pred             cCCceechhcccC-----eEEecCCcEEEeCh-hhhhcccccc
Confidence            3567788887665     68999997544343 3445555543


No 19 
>PRK13422 F0F1 ATP synthase subunit gamma; Provisional
Probab=25.51  E-value=18  Score=30.74  Aligned_cols=12  Identities=50%  Similarity=0.966  Sum_probs=9.7

Q ss_pred             ccceeecCCCCC
Q psy10314        147 KGLCGNYNDNEL  158 (233)
Q Consensus       147 ~GLCGn~ng~~~  158 (233)
                      .||||-||.+--
T Consensus        85 rGLCG~fN~~v~   96 (298)
T PRK13422         85 RGLCGGLNINLF   96 (298)
T ss_pred             cccccchhHHHH
Confidence            499999997654


No 20 
>COG4892 Predicted heme/steroid binding protein [General function prediction only]
Probab=25.09  E-value=48  Score=21.66  Aligned_cols=16  Identities=6%  Similarity=0.208  Sum_probs=13.5

Q ss_pred             CCeeeccCCCeeeeec
Q psy10314         23 DSHFQTFDAKNFDFQG   38 (233)
Q Consensus        23 ~~h~~TFDg~~y~f~g   38 (233)
                      .|.|+.|||+.|+..+
T Consensus        17 GpaYiA~~G~VYDvS~   32 (81)
T COG4892          17 GPAYIAVNGTVYDVSL   32 (81)
T ss_pred             CCeEEEECCEEEeecc
Confidence            3789999999999764


No 21 
>PRK13425 F0F1 ATP synthase subunit gamma; Provisional
Probab=24.81  E-value=21  Score=30.25  Aligned_cols=11  Identities=55%  Similarity=1.244  Sum_probs=9.2

Q ss_pred             ccceeecCCCC
Q psy10314        147 KGLCGNYNDNE  157 (233)
Q Consensus       147 ~GLCGn~ng~~  157 (233)
                      .||||.||.+-
T Consensus        86 rGLCG~fN~~v   96 (291)
T PRK13425         86 RGLCGAFNTNI   96 (291)
T ss_pred             CcccccchHHH
Confidence            49999999763


No 22 
>PF06668 ITI_HC_C:  Inter-alpha-trypsin inhibitor heavy chain C-terminus;  InterPro: IPR010600 This entry represents the C-terminal region of inter-alpha-trypsin inhibitor heavy chains. Inter-alpha-trypsin inhibitors are glycoproteins with a high inhibitory activity against trypsin, built up from different combinations of four polypeptides: bikunin and the three heavy chains that belong to this family (HC1, HC2, HC3). The heavy chains do not have any protease inhibitory properties but have the capacity to interact in vitro and in vivo with hyaluronic acid, which promotes the stability of the extra-cellular matrix. This domain is associated with the VWA domain IPR002035 from INTERPRO.; GO: 0004867 serine-type endopeptidase inhibitor activity, 0030212 hyaluronan metabolic process
Probab=24.25  E-value=3.6e+02  Score=21.26  Aligned_cols=24  Identities=21%  Similarity=0.266  Sum_probs=17.9

Q ss_pred             EEEEecCCCCCCCcccceeecCCC
Q psy10314        133 RVSVTLSPKWKNKVKGLCGNYNDN  156 (233)
Q Consensus       133 ~~~v~l~~~~~g~~~GLCGn~ng~  156 (233)
                      .++|.=+..+..++-||.|-|-.+
T Consensus       110 GfYi~ds~~lS~~vhGLLGQF~~~  133 (188)
T PF06668_consen  110 GFYILDSHGLSPSVHGLLGQFYHE  133 (188)
T ss_pred             EEEecCCCCCCCcccccccCccCC
Confidence            345555567889999999998655


No 23 
>PRK13427 F0F1 ATP synthase subunit gamma; Provisional
Probab=23.84  E-value=28  Score=29.39  Aligned_cols=12  Identities=50%  Similarity=1.088  Sum_probs=9.6

Q ss_pred             ccceeecCCCCC
Q psy10314        147 KGLCGNYNDNEL  158 (233)
Q Consensus       147 ~GLCGn~ng~~~  158 (233)
                      .||||.||.+--
T Consensus        86 rGLcG~fN~~v~   97 (289)
T PRK13427         86 RGLCGGFNSNLN   97 (289)
T ss_pred             CCccccccHHHH
Confidence            499999997653


No 24 
>PF08091 Toxin_21:  Spider insecticidal peptide;  InterPro: IPR012626 This family consists of insecticidal peptides isolated from venom of spiders of Aptostichus schlingeri (Trap-door spider) and Calisoga sp. Nine insecticidal peptides were isolated from the venom of the A. schlinger spider and seven of these toxins cause flaccid paralysis to insect larvae within 10 min of injection. However, all nine peptides were lethal within 24 hours [].; GO: 0009405 pathogenesis, 0005576 extracellular region
Probab=23.18  E-value=79  Score=17.79  Aligned_cols=14  Identities=29%  Similarity=1.070  Sum_probs=10.0

Q ss_pred             eeeee--cCCCCeEEE
Q psy10314          6 HWKCS--KEECPGICT   19 (233)
Q Consensus         6 ~w~C~--~~~c~~~C~   19 (233)
                      -|+|.  ..+|..+|.
T Consensus        22 fWtC~~~~~~CSk~C~   37 (39)
T PF08091_consen   22 FWTCQIRGDGCSKECL   37 (39)
T ss_pred             eEEEEEcCCCccceec
Confidence            57777  677777774


No 25 
>PRK13426 F0F1 ATP synthase subunit gamma; Provisional
Probab=22.97  E-value=27  Score=29.50  Aligned_cols=11  Identities=45%  Similarity=0.990  Sum_probs=9.0

Q ss_pred             ccceeecCCCC
Q psy10314        147 KGLCGNYNDNE  157 (233)
Q Consensus       147 ~GLCGn~ng~~  157 (233)
                      .||||.||.+-
T Consensus        85 rGLcG~fN~~v   95 (291)
T PRK13426         85 TSLCGGFNANI   95 (291)
T ss_pred             CcccchhhHHH
Confidence            49999999654


No 26 
>PRK13423 F0F1 ATP synthase subunit gamma; Provisional
Probab=22.71  E-value=29  Score=29.29  Aligned_cols=11  Identities=55%  Similarity=1.277  Sum_probs=9.1

Q ss_pred             ccceeecCCCC
Q psy10314        147 KGLCGNYNDNE  157 (233)
Q Consensus       147 ~GLCGn~ng~~  157 (233)
                      .||||.||.+-
T Consensus        85 rGLCG~fN~~i   95 (288)
T PRK13423         85 RGLCGGFNSNL   95 (288)
T ss_pred             CCCcchhhHHH
Confidence            49999999764


No 27 
>PF00231 ATP-synt:  ATP synthase This Pfam entry corresponds to chain g;  InterPro: IPR000131 ATPases (or ATP synthases) are membrane-bound enzyme complexes/ion transporters that combine ATP synthesis and/or hydrolysis with the transport of protons across a membrane. ATPases can harness the energy from a proton gradient, using the flux of ions across the membrane via the ATPase proton channel to drive the synthesis of ATP. Some ATPases work in reverse, using the energy from the hydrolysis of ATP to create a proton gradient. There are different types of ATPases, which can differ in function (ATP synthesis and/or hydrolysis), structure (e.g., F-, V- and A-ATPases, which contain rotary motors) and in the type of ions they transport [, ]. The different types include:   F-ATPases (F1F0-ATPases), which are found in mitochondria, chloroplasts and bacterial plasma membranes where they are the prime producers of ATP, using the proton gradient generated by oxidative phosphorylation (mitochondria) or photosynthesis (chloroplasts). V-ATPases (V1V0-ATPases), which are primarily found in eukaryotic vacuoles and catalyse ATP hydrolysis to transport solutes and lower pH in organelles. A-ATPases (A1A0-ATPases), which are found in Archaea and function like F-ATPases (though with respect to their structure and some inhibitor responses, A-ATPases are more closely related to the V-ATPases). P-ATPases (E1E2-ATPases), which are found in bacteria and in eukaryotic plasma membranes and organelles, and function to transport a variety of different ions across membranes. E-ATPases, which are cell-surface enzymes that hydrolyse a range of NTPs, including extracellular ATP.   F-ATPases (also known as F1F0-ATPase, or H(+)-transporting two-sector ATPase) (3.6.3.14 from EC) are composed of two linked complexes: the F1 ATPase complex is the catalytic core and is composed of 5 subunits (alpha, beta, gamma, delta, epsilon), while the F0 ATPase complex is the membrane-embedded proton channel that is composed of at least 3 subunits (A-C), nine in mitochondria (A-G, F6, F8). Both the F1 and F0 complexes are rotary motors that are coupled back-to-back. In the F1 complex, the central gamma subunit forms the rotor inside the cylinder made of the alpha(3)beta(3) subunits, while in the F0 complex, the ring-shaped C subunits forms the rotor. The two rotors rotate in opposite directions, but the F0 rotor is usually stronger, using the force from the proton gradient to push the F1 rotor in reverse in order to drive ATP synthesis []. These ATPases can also work in reverse to hydrolyse ATP to create a proton gradient. The ATPase F1 complex gamma subunit forms the central shaft that connects the F0 rotary motor to the F1 catalytic core. The gamma subunit functions as a rotary motor inside the cylinder formed by the alpha(3)beta(3) subunits in the F1 complex []. The best-conserved region of the gamma subunit is its C terminus, which seems to be essential for assembly and catalysis. More information about this protein can be found at Protein of the Month: ATP Synthases [].; GO: 0046933 hydrogen ion transporting ATP synthase activity, rotational mechanism, 0046961 proton-transporting ATPase activity, rotational mechanism, 0015986 ATP synthesis coupled proton transport, 0045261 proton-transporting ATP synthase complex, catalytic core F(1); PDB: 3OFN_G 3FKS_P 3OEE_Y 2HLD_Y 3OEH_Y 2XOK_G 3ZRY_G 3OE7_P 2WPD_G 3OAA_e ....
Probab=21.97  E-value=22  Score=29.98  Aligned_cols=11  Identities=55%  Similarity=1.304  Sum_probs=8.8

Q ss_pred             ccceeecCCCC
Q psy10314        147 KGLCGNYNDNE  157 (233)
Q Consensus       147 ~GLCGn~ng~~  157 (233)
                      .||||.||.+-
T Consensus        85 rGLCG~fN~~v   95 (290)
T PF00231_consen   85 RGLCGGFNSNV   95 (290)
T ss_dssp             TSSSTTHHHHH
T ss_pred             ccccccccHHH
Confidence            48999999653


No 28 
>KOG2875|consensus
Probab=20.50  E-value=26  Score=29.50  Aligned_cols=22  Identities=23%  Similarity=0.676  Sum_probs=17.7

Q ss_pred             eEEEEECCCeeeccCCCeee-eec
Q psy10314         16 GICTAWGDSHFQTFDAKNFD-FQG   38 (233)
Q Consensus        16 ~~C~~~G~~h~~TFDg~~y~-f~g   38 (233)
                      .-|..+| +++.+|||..|+ ||.
T Consensus       141 ~fc~~fG-~~i~~~dg~~~h~FPs  163 (323)
T KOG2875|consen  141 RFCQAFG-PRIIQLDGVDYHGFPS  163 (323)
T ss_pred             HHHHhhC-cceEeecCcccccCcc
Confidence            3578888 699999999985 764


Done!