Query psy10314
Match_columns 233
No_of_seqs 176 out of 1240
Neff 9.0
Searched_HMMs 46136
Date Fri Aug 16 23:19:29 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy10314.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/10314hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 smart00216 VWD von Willebrand 100.0 5.5E-37 1.2E-41 237.8 17.4 152 7-166 1-162 (162)
2 PF00094 VWD: von Willebrand f 100.0 1.1E-28 2.4E-33 190.0 13.9 146 18-167 1-159 (159)
3 smart00832 C8 C8 domain. This 98.0 7.8E-07 1.7E-11 59.8 -1.2 29 205-233 3-33 (76)
4 KOG4338|consensus 98.0 1.2E-05 2.7E-10 78.9 5.8 160 16-187 1354-1531(1680)
5 PF08742 C8: C8 domain; Inter 97.5 1.3E-05 2.8E-10 53.5 -1.4 29 205-233 3-31 (74)
6 PF06535 RGM_N: Repulsive guid 97.3 0.00051 1.1E-08 52.1 5.1 61 15-81 99-160 (161)
7 PF07481 DUF1521: Domain of Un 92.5 2 4.4E-05 32.7 9.2 69 5-82 27-97 (171)
8 PF05375 Pacifastin_I: Pacifas 62.0 3.7 8E-05 23.6 0.6 15 2-16 17-32 (39)
9 smart00215 VWC_out von Willebr 57.5 5.2 0.00011 25.9 0.8 21 1-21 22-43 (67)
10 KOG1216|consensus 53.1 31 0.00068 31.8 5.6 46 122-167 400-446 (553)
11 cd06909 M14_ASPA Aspartoacylas 30.4 41 0.00089 28.4 2.3 19 22-40 240-258 (282)
12 PF08394 Arc_trans_TRASH: Arch 29.7 35 0.00077 19.3 1.2 18 20-37 8-25 (37)
13 PRK14111 F0F1 ATP synthase sub 28.0 22 0.00047 30.1 0.2 11 147-157 88-98 (290)
14 TIGR01146 ATPsyn_F1gamma ATP s 27.5 17 0.00038 30.6 -0.4 12 147-158 85-96 (287)
15 PRK05621 F0F1 ATP synthase sub 27.5 16 0.00035 30.7 -0.6 11 147-157 84-94 (284)
16 PRK13424 F0F1 ATP synthase sub 26.9 17 0.00037 30.7 -0.6 11 147-157 86-96 (291)
17 PRK14110 F0F1 ATP synthase sub 26.6 18 0.00039 30.6 -0.5 11 147-157 87-97 (291)
18 PHA03357 Alkaline exonuclease; 26.1 44 0.00095 22.0 1.3 37 142-184 8-44 (81)
19 PRK13422 F0F1 ATP synthase sub 25.5 18 0.00039 30.7 -0.7 12 147-158 85-96 (298)
20 COG4892 Predicted heme/steroid 25.1 48 0.001 21.7 1.4 16 23-38 17-32 (81)
21 PRK13425 F0F1 ATP synthase sub 24.8 21 0.00045 30.2 -0.4 11 147-157 86-96 (291)
22 PF06668 ITI_HC_C: Inter-alpha 24.3 3.6E+02 0.0078 21.3 7.4 24 133-156 110-133 (188)
23 PRK13427 F0F1 ATP synthase sub 23.8 28 0.00061 29.4 0.2 12 147-158 86-97 (289)
24 PF08091 Toxin_21: Spider inse 23.2 79 0.0017 17.8 1.8 14 6-19 22-37 (39)
25 PRK13426 F0F1 ATP synthase sub 23.0 27 0.00059 29.5 -0.0 11 147-157 85-95 (291)
26 PRK13423 F0F1 ATP synthase sub 22.7 29 0.00063 29.3 0.1 11 147-157 85-95 (288)
27 PF00231 ATP-synt: ATP synthas 22.0 22 0.00047 30.0 -0.8 11 147-157 85-95 (290)
28 KOG2875|consensus 20.5 26 0.00056 29.5 -0.7 22 16-38 141-163 (323)
No 1
>smart00216 VWD von Willebrand factor (vWF) type D domain. Von Willebrand factor contains several type D domains: D1 and D2 are present within the N-terminal propeptide whereas the remaining D domains are required for multimerisation.
Probab=100.00 E-value=5.5e-37 Score=237.76 Aligned_cols=152 Identities=36% Similarity=0.776 Sum_probs=126.9
Q ss_pred eeeecCCCCeEEEEECCCeeeccCCCeeeeecCceEEEEEecCCCCCCceEEEEEEeecCCCCceeeeEEEEEeCCCccc
Q psy10314 7 WKCSKEECPGICTAWGDSHFQTFDAKNFDFQGACEYVLVKGANEEEEEAFVVVIEVVPCGSTGVSCSKSVKISTGSKKNA 86 (233)
Q Consensus 7 w~C~~~~c~~~C~~~G~~h~~TFDg~~y~f~g~C~YvL~~~~~~~~~~~f~V~~~~~~~~~~~~~~~~~v~v~~~~~~~~ 86 (233)
|.|+..+|+++|.++|++||+||||+.|+|+|.|.|+|+++|.. .++|+|++++.+++.. .++.++|+|.+++
T Consensus 1 ~~c~~~~~~~~C~v~g~~~~~TFDg~~y~~~g~C~yvL~~~~~~--~~~f~V~~~~~~~~~~-~~~~~~v~v~~~~---- 73 (162)
T smart00216 1 WCCTQGECSPTCSVSGDPHYTTFDGVAYTFPGNCYYVLAQDCSS--EPTFSVLLKNVPCGGG-ATCLKSVKVELNG---- 73 (162)
T ss_pred CccccCCCCCEEEEcCCCCeECcCCCEEeecCceEEEEEEECCC--CCCEEEEEEecCCCCC-ceEEEEEEEEECC----
Confidence 88999999999999999999999999999999999999999864 4699999999887653 6789999999984
Q ss_pred eEEEEeC--------Cc--ccCCCCCcceEEEEEcccEEEEEEccceEEEEEcCCeEEEEecCCCCCCCcccceeecCCC
Q psy10314 87 ESITLTR--------NK--PLPNSGEYHKLSVREAGLFVFVELHDIGVTVQWDKGTRVSVTLSPKWKNKVKGLCGNYNDN 156 (233)
Q Consensus 87 ~~i~~~~--------~~--~lp~~~~~~~~~i~~~g~~~~v~~~~~gl~v~~d~~~~~~v~l~~~~~g~~~GLCGn~ng~ 156 (233)
..|.+.. |. .+|.......+.+++.+.++++. +..||+|.|||...++|.+|+.|+|+||||||||||+
T Consensus 74 ~~i~~~~~~~~v~vng~~v~~p~~~~~~~v~~~~~~~~~v~~-~~~gl~v~~dg~~~~~V~~~~~~~g~~~GLCGn~ng~ 152 (162)
T smart00216 74 DEIELKDDNGTVTVNGQQVSLPYKTSDGSIQIRSSGGYVVVI-TSLGLQVTFDGLTLLSVQLPSRYRGKTCGLCGNFDGE 152 (162)
T ss_pred EEEEEEeCCCEEEECCEEeeCCcCcCCceEEEEECceEEEEE-CCCCEEEEECCCcEEEEEECHHHCCCeeEccCCCCCC
Confidence 3444432 22 34554333346666666555554 6899999999887799999999999999999999999
Q ss_pred CCCCccCCCC
Q psy10314 157 ELDDFQSPSG 166 (233)
Q Consensus 157 ~~dD~~~p~g 166 (233)
+.|||++|+|
T Consensus 153 ~~dDf~~p~g 162 (162)
T smart00216 153 PEDDFRTPDG 162 (162)
T ss_pred ccccccCCCC
Confidence 9999999987
No 2
>PF00094 VWD: von Willebrand factor type D domain; InterPro: IPR001846 A family of growth regulators (originally called cef10, connective tissue growth factor, fisp-12, cyr61, or, alternatively, beta IG-M1 and beta IG-M2), all belong to immediate-early genes expressed after induction by growth factors or certain oncogenes. Sequence analysis of this family revealed the presence of four distinct modules. Each module has homologues in other extracellular mosaic proteins such as Von Willebrand factor, slit, thrombospondins, fibrillar collagens, IGF-binding proteins and mucins. Classification and analysis of these modules suggests the location of binding regions and, by analogy to better characterised modules in other proteins, sheds some light onto the structure of this new family []. The vWF domain is found in various plasma proteins: complement factors B, C2, CR3 and CR4; the integrins (I-domains); collagen types VI, VII, XII and XIV; and other extracellular proteins [, , ]. Although the majority of VWA-containing proteins are extracellular, the most ancient ones present in all eukaryotes are all intracellular proteins involved in functions such as transcription, DNA repair, ribosomal and membrane transport and the proteasome. A common feature appears to be involvement in multiprotein complexes. Proteins that incorporate vWF domains participate in numerous biological events (e.g. cell adhesion, migration, homing, pattern formation, and signal transduction), involving interaction with a large array of ligands []. A number of human diseases arise from mutations in VWA domains. Secondary structure prediction from 75 aligned vWF sequences has revealed a largely alternating sequence of alpha-helices and beta-strands []. One of the functions of von Willebrand factor (vWF) is to serve as a carrier of clotting factor VIII (FVIII). The native conformation of the D' domain of vWF is not only required for factor VIII (FVIII) binding but also for normal multimerisation and optimal secretion. The interaction between blood clotting factor VIII and VWF is necessary for normal survival of blood clotting factor VIII in blood circulation. The VWFD domain is a highly structured region, in which the first conserved Cys has been found to form a disulphide bridge with the second conserved one [].
Probab=99.96 E-value=1.1e-28 Score=190.03 Aligned_cols=146 Identities=35% Similarity=0.730 Sum_probs=104.9
Q ss_pred EEEECCCeeeccCCCeeeeecCceEE--EEEecCCCCCCceEEEEEEeecCCCCceeeeEEEEEeCCCccceEEEEeCC-
Q psy10314 18 CTAWGDSHFQTFDAKNFDFQGACEYV--LVKGANEEEEEAFVVVIEVVPCGSTGVSCSKSVKISTGSKKNAESITLTRN- 94 (233)
Q Consensus 18 C~~~G~~h~~TFDg~~y~f~g~C~Yv--L~~~~~~~~~~~f~V~~~~~~~~~~~~~~~~~v~v~~~~~~~~~~i~~~~~- 94 (233)
|+++|++||+||||+.|.|++.|.|+ |++++.+.....|.|.+++.+.+....+|+++|.+.+.+ ..|.+..+
T Consensus 1 C~v~g~~~~~TFDg~~y~~~~~c~y~~vl~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~v~i~~~~----~~i~i~~~~ 76 (159)
T PF00094_consen 1 CSVYGDPHITTFDGKSYSFPGNCTYILVLAQDCSSDPKFSFSVENKNCPCGQSGTSCTRKVTIRLGN----HEIEIKPNN 76 (159)
T ss_pred CEEeCCCeEEcCCCCEEecCCCceEEEEeecccCccccccccceeeeeccCCCCcceeeEEEEEccc----ceeeEeccc
Confidence 89999999999999999999999999 999986511124455555555444456699999999884 34444333
Q ss_pred --------cc--cCCCCCcceEEEEEcccEEEEEEccceEEEEEcCCeEEEEecCCCCCCCcccceeecCCCCCCCccCC
Q psy10314 95 --------KP--LPNSGEYHKLSVREAGLFVFVELHDIGVTVQWDKGTRVSVTLSPKWKNKVKGLCGNYNDNELDDFQSP 164 (233)
Q Consensus 95 --------~~--lp~~~~~~~~~i~~~g~~~~v~~~~~gl~v~~d~~~~~~v~l~~~~~g~~~GLCGn~ng~~~dD~~~p 164 (233)
.. +|+......+.+...+..+++.....+|++.|++...+.|.+|++|+|+|+||||+||+++.|||++|
T Consensus 77 ~~~v~vng~~~~~p~~~~~~~i~~~~~~~~~v~~~~~~~v~~~~~~~~~~~v~~~~~~~g~t~GLCG~~dg~~~dDf~~~ 156 (159)
T PF00094_consen 77 GDQVTVNGQPVSLPYWSSGGSIERYSGGFVVVVFSSGVRVQVNWDGNMSVYVSVPPWYKGKTCGLCGNFDGEPEDDFTTP 156 (159)
T ss_pred cceEEEeeecccCccccCcceEEeeccccEEEEEecCCeEEEEEecccccccccccccccccccccCCCCCCccceecCC
Confidence 22 23322222344443333333333456666678776789999999999999999999999999999999
Q ss_pred CCC
Q psy10314 165 SGG 167 (233)
Q Consensus 165 ~g~ 167 (233)
+|+
T Consensus 157 ~g~ 159 (159)
T PF00094_consen 157 DGC 159 (159)
T ss_pred CCC
Confidence 984
No 3
>smart00832 C8 C8 domain. This domain contains 8 conserved cysteine residues, but this family only contains 7 of them to overlaps with other domains. It is found in disease-related proteins including von Willebrand factor, Alpha tectorin, Zonadhesin and Mucin.
Probab=97.99 E-value=7.8e-07 Score=59.75 Aligned_cols=29 Identities=45% Similarity=0.828 Sum_probs=25.5
Q ss_pred hhhccCccccCC--CCcCCcCCCCCcccccC
Q psy10314 205 WAVKQCNLLKSE--LFQPCHSEVPVDSFFER 233 (233)
Q Consensus 205 ~a~~~C~~l~~~--~F~~Ch~~v~p~~y~~~ 233 (233)
.+.+.|++|++. +|++||.+|||++||++
T Consensus 3 ~a~~~C~~l~~~~g~F~~Ch~~V~p~~f~~~ 33 (76)
T smart00832 3 YACSQCGILLSPRGPFAACHSVVDPEPFFEN 33 (76)
T ss_pred chhhhhHhhcCCCCChHHHhCcCChHHHHHH
Confidence 467899999865 99999999999999973
No 4
>KOG4338|consensus
Probab=97.95 E-value=1.2e-05 Score=78.86 Aligned_cols=160 Identities=25% Similarity=0.371 Sum_probs=105.6
Q ss_pred eEEEEECCCeeeccCCCeeee-ecCceEEEEEecCCC-CCCceEEEEEEeecCCCCceeeeEEEEEeCCCccceEEEEe-
Q psy10314 16 GICTAWGDSHFQTFDAKNFDF-QGACEYVLVKGANEE-EEEAFVVVIEVVPCGSTGVSCSKSVKISTGSKKNAESITLT- 92 (233)
Q Consensus 16 ~~C~~~G~~h~~TFDg~~y~f-~g~C~YvL~~~~~~~-~~~~f~V~~~~~~~~~~~~~~~~~v~v~~~~~~~~~~i~~~- 92 (233)
+.|.+ |-+...|||-..|.= .-.|.++.+++|..+ ..+.|.|++++...+... ..|.+-.+. .+|++.
T Consensus 1354 a~~~v-~ls~~~~Fd~~~y~n~~~~~~sl~a~~~~~ek~~~~~~vl~~~~~~~~~~----~~vk~vy~e----~ei~~~~ 1424 (1680)
T KOG4338|consen 1354 ALCKV-GLSRINTFDIVAYNNPLTPCYSLTAKDCSEEKATPRFAVLIKKINKNSEE----VAVKVVYGE----REIVVKK 1424 (1680)
T ss_pred EEEEe-ccccccccchhhhcCcccceEEEEeecccccccCceEEEEecCCCCCchh----hhheeeeee----eeeEEee
Confidence 67866 667899999999964 489999999999762 236899998876543211 112222211 122221
Q ss_pred ---------CCcccCCCC----CcceEEEEEcccEEEEEEccceEEEEEcCCeEEEEecCCCCC-CCcccceeecCCCCC
Q psy10314 93 ---------RNKPLPNSG----EYHKLSVREAGLFVFVELHDIGVTVQWDKGTRVSVTLSPKWK-NKVKGLCGNYNDNEL 158 (233)
Q Consensus 93 ---------~~~~lp~~~----~~~~~~i~~~g~~~~v~~~~~gl~v~~d~~~~~~v~l~~~~~-g~~~GLCGn~ng~~~ 158 (233)
++..+-... ......|...+...++- ...+.+|+||| ....+.+|..+. |+.||||||++++..
T Consensus 1425 ~~~~~~~~~~~~k~~~t~~~~~~~~q~~i~~~~es~~~~-~l~~gtv~~~g-~~~~~~~~~~~~~~q~~~~~~n~~~~~~ 1502 (1680)
T KOG4338|consen 1425 LHEKKMVKVDGKKIITTLNIIYELKQYNIEYLGESLIVF-DLPGGTVQFDG-YNARITLPSVRRKNQLCGLCGNNDDESE 1502 (1680)
T ss_pred cccceEEeecchhcccchHHHHHHHHHHHHhhccceeEe-ecCCceEEecc-cccccccHHHhhccchhhhhccCCcccc
Confidence 122111100 01123344444333332 35799999997 778888887766 999999999999999
Q ss_pred CCccCCCCCccccChhhhccceecc-CCCC
Q psy10314 159 DDFQSPSGGISEVSPVVFGDSWRVH-TYCP 187 (233)
Q Consensus 159 dD~~~p~g~~~~~~~~~f~~sW~~~-~~C~ 187 (233)
.+|.+.... ...+..+|..|+-+. ..|.
T Consensus 1503 ~~~~t~~~~-~~~~i~e~h~s~~l~n~~~~ 1531 (1680)
T KOG4338|consen 1503 TEFYTALNI-ETEDIEEFHRSYLLKNEECE 1531 (1680)
T ss_pred chhhhcccc-hhhhHHHHHHHHcccchhhh
Confidence 999999884 557889999999885 4564
No 5
>PF08742 C8: C8 domain; InterPro: IPR014853 The proteins in this entry contained a domain rich in positionally conserved cysteine residues. Most proteins contains 7 or 8 cysteine residues. The domain is found in disease-related proteins including von Willebrand factor, Alpha tectorin, Zonadhesin and Mucin. It is often found on proteins containing IPR001846 from INTERPRO and IPR002919 from INTERPRO.
Probab=97.48 E-value=1.3e-05 Score=53.50 Aligned_cols=29 Identities=41% Similarity=0.740 Sum_probs=25.7
Q ss_pred hhhccCccccCCCCcCCcCCCCCcccccC
Q psy10314 205 WAVKQCNLLKSELFQPCHSEVPVDSFFER 233 (233)
Q Consensus 205 ~a~~~C~~l~~~~F~~Ch~~v~p~~y~~~ 233 (233)
.+...|++|.++.|++||.+|||++||++
T Consensus 3 ~a~~~C~~l~~~~F~~C~~~v~~~~f~~~ 31 (74)
T PF08742_consen 3 RAESYCGILLDPEFAPCHSVVDPDPFYEA 31 (74)
T ss_pred hHhHHhHHHcCchhhhhcccCccHHHHHH
Confidence 46789999988779999999999999863
No 6
>PF06535 RGM_N: Repulsive guidance molecule (RGM) N-terminus; InterPro: IPR010536 This entry represents the N-terminal region of several mammalian and one bird sequence from Gallus gallus (Chicken). All of the mammalian proteins are hypothetical and have no known function but Q8JG54 from SWISSPROT from the chicken is annotated as being a repulsive guidance molecule (RGM). RGM is a GPI-linked axon guidance molecule of the retinotectal system. RGM is repulsive for a subset of axons, those from the temporal half of the retina. Temporal retinal axons invade the anterior optic tectum in a superficial layer, and encounter RGM expressed in a gradient with increasing concentration along the anterior-posterior axis. Temporal axons are able to receive posterior-dependent information by sensing gradients or concentrations of guidance cues. Thus, RGM is likely to provide positional information for temporal axons invading the optic tectum in the stratum opticum [].
Probab=97.30 E-value=0.00051 Score=52.07 Aligned_cols=61 Identities=16% Similarity=0.307 Sum_probs=44.1
Q ss_pred CeEEEEECCCeeeccCCCeeeeecCceEEEEEecCCCCCCceEEEEEEeecC-CCCceeeeEEEEEeC
Q psy10314 15 PGICTAWGDSHFQTFDAKNFDFQGACEYVLVKGANEEEEEAFVVVIEVVPCG-STGVSCSKSVKISTG 81 (233)
Q Consensus 15 ~~~C~~~G~~h~~TFDg~~y~f~g~C~YvL~~~~~~~~~~~f~V~~~~~~~~-~~~~~~~~~v~v~~~ 81 (233)
-..|.++||||++||++...+=.-...+=|+. +..+.|++.+.+-. ...++.+.+|+|.+.
T Consensus 99 y~hCglFGDPHLRTF~~~fqTC~v~GAWPLId------N~yL~VQvTn~pv~~gs~aTattKvTVI~K 160 (161)
T PF06535_consen 99 YRHCGLFGDPHLRTFNDEFQTCKVEGAWPLID------NDYLSVQVTNSPVVPGSSATATTKVTVIFK 160 (161)
T ss_pred cceecccCChhHhhccCCceeeeeecceeeec------CCcEEEEeeCccccCCccceeeeeEEEEEc
Confidence 47899999999999999876544344455554 46789999988752 234666778887764
No 7
>PF07481 DUF1521: Domain of Unknown Function (DUF1521); InterPro: IPR011086 This domain of unknown function is found in a limited set of Bradyrhizobium proteins. There appears to be a periodic -DG- motif in the domain.
Probab=92.45 E-value=2 Score=32.67 Aligned_cols=69 Identities=26% Similarity=0.623 Sum_probs=49.2
Q ss_pred ceeeeecCCCCeEEEEECCCee-eccCCCe-eeeecCceEEEEEecCCCCCCceEEEEEEeecCCCCceeeeEEEEEeCC
Q psy10314 5 GHWKCSKEECPGICTAWGDSHF-QTFDAKN-FDFQGACEYVLVKGANEEEEEAFVVVIEVVPCGSTGVSCSKSVKISTGS 82 (233)
Q Consensus 5 g~w~C~~~~c~~~C~~~G~~h~-~TFDg~~-y~f~g~C~YvL~~~~~~~~~~~f~V~~~~~~~~~~~~~~~~~v~v~~~~ 82 (233)
+.|.=+...-..+=++|||||| ..=||+. |+|.+.-++.|-- -=.|.+...+.+. +++....++|..+.
T Consensus 27 s~~~itnk~tG~~TriwGDPHvd~~Gdg~~~fDFk~~~tf~L~D--------GTKItV~T~p~gn-g~T~askLtIt~Gd 97 (171)
T PF07481_consen 27 SEWTITNKQTGETTRIWGDPHVDADGDGKTDFDFKGDMTFQLDD--------GTKITVDTVPWGN-GMTYASKLTITNGD 97 (171)
T ss_pred ceEEEecCCCCcEeEEeCCCccccCCCcccceeecCceEEEeCC--------CCEEEEeeeecCC-CcEEeeeEEEEcCC
Confidence 4566555544556689999999 5666555 9999988888753 2257777777665 77778888888774
No 8
>PF05375 Pacifastin_I: Pacifastin inhibitor (LCMII); InterPro: IPR008037 Peptide proteinase inhibitors can be found as single domain proteins or as single or multiple domains within proteins; these are referred to as either simple or compound inhibitors, respectively. In many cases they are synthesised as part of a larger precursor protein, either as a prepropeptide or as an N-terminal domain associated with an inactive peptidase or zymogen. This domain prevents access of the substrate to the active site. Removal of the N-terminal inhibitor domain either by interaction with a second peptidase or by autocatalytic cleavage activates the zymogen. Other inhibitors interact direct with proteinases using a simple noncovalent lock and key mechanism; while yet others use a conformational change-based trapping mechanism that depends on their structural and thermodynamic properties. This family of serine protease inhibitors belong to MEROPS inhibitor family I19, clan IW. They inhibit chymotrpsin, a peptidase belong to the S1 family (IPR001254 from INTERPRO) []. They were first isolated from Locusta migratoria migratoria(migratory locust). These were HI, LMCI-1 (PMP-D2) and LMCI-2 (PMP-C) [, , ]; five additional members SGPI-1 to 5 were identified in Schistocerca gregaria (desert locust) [, ], and a heterodimeric serine protease inhibitor (pacifastin) was isolated from the hemolymph of Pacifastacus leniusculus (Signal crayfish) []. Pacifastin is a 155kDa composed of two covalently linked subunits, which are separately encoded. The heavy chain of pacifastin (105 kDa) is related to transferrins, containing three transferrin lobes, two of which seem to be active for iron binding []. A number of the members of the transferrin family are also serine peptidases belong to MEROPS peptidase family S60 (IPR001156 from INTERPRO). The light chain of pacifastin (44 kDa) is the proteinase inhibitory subunit, and has nine cysteine-rich inhibitory domains that are homologous to each other. The locust inhibitors share a conserved array of six cysteine residues with the pacifastin light chain. The structure of members of this family reveal that they are comprised of a triple-stranded antiparallel beta-sheet connected by three disulphide bridges []. The biological function(s) of the locust inhibitors is (are) not fully understood. LMCI-1 and LMCI-2 were shown to inhibit the endogenous proteolytic activating cascade of prophenoloxidase []. Expression analysis shows that the genes encoding the SGPI precursors are differentially expressed in a time-, stage- and hormone-dependent manner.; GO: 0030414 peptidase inhibitor activity; PDB: 2VU8_I 1WO9_A 1GL1_K 1PMC_A 2XTT_A 2F91_B 1KJ0_A 1GL0_I 1KGM_A 3TVJ_I.
Probab=62.00 E-value=3.7 Score=23.55 Aligned_cols=15 Identities=47% Similarity=1.553 Sum_probs=11.4
Q ss_pred CC-CceeeeecCCCCe
Q psy10314 2 CS-QGHWKCSKEECPG 16 (233)
Q Consensus 2 C~-~g~w~C~~~~c~~ 16 (233)
|. +|+|.|+...|+.
T Consensus 17 C~~~G~~~CT~~~C~~ 32 (39)
T PF05375_consen 17 CSSDGKWACTRKACPP 32 (39)
T ss_dssp EBTTSSEEEE-SSSS-
T ss_pred CCCCCcCccCcccCcC
Confidence 66 8999999999864
No 9
>smart00215 VWC_out von Willebrand factor (vWF) type C domain.
Probab=57.50 E-value=5.2 Score=25.93 Aligned_cols=21 Identities=33% Similarity=0.662 Sum_probs=16.4
Q ss_pred CCCCceeeeecCCCCeE-EEEE
Q psy10314 1 TCSQGHWKCSKEECPGI-CTAW 21 (233)
Q Consensus 1 ~C~~g~w~C~~~~c~~~-C~~~ 21 (233)
+|.+|.|.|++..|+.. |.+.
T Consensus 22 tC~~G~v~Ct~~~C~~~~c~l~ 43 (67)
T smart00215 22 TCLNGRVSCTKVWCGPKPCLLH 43 (67)
T ss_pred EecCCCEEecCCcCCCchhhcc
Confidence 58899999999999644 5443
No 10
>KOG1216|consensus
Probab=53.12 E-value=31 Score=31.78 Aligned_cols=46 Identities=22% Similarity=0.427 Sum_probs=32.8
Q ss_pred ceEEEEEcCCeEEEEe-cCCCCCCCcccceeecCCCCCCCccCCCCC
Q psy10314 122 IGVTVQWDKGTRVSVT-LSPKWKNKVKGLCGNYNDNELDDFQSPSGG 167 (233)
Q Consensus 122 ~gl~v~~d~~~~~~v~-l~~~~~g~~~GLCGn~ng~~~dD~~~p~g~ 167 (233)
.++...|.....+.+. .......+.+++++++|++..|++...++.
T Consensus 400 ~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 446 (553)
T KOG1216|consen 400 VVLECSWPTDCVVELSRAAEESEIRLCGLCADGNGNTPDGFPLGDGN 446 (553)
T ss_pred eeccCCCCchheeccCcchhhccccccceeccCCcCCCCCeeccCCc
Confidence 3444555544444443 456677889999999999998888887775
No 11
>cd06909 M14_ASPA Aspartoacylase (ASPA) belongs to the Succinylglutamate desuccinylase/aspartoacylase subfamily of the M14 family of metallocarboxypeptidases. ASPA (also known as aminoacylase 2; EC:3.5.1.15) cleaves N-acetyl L-aspartic acid (NAA) into aspartate and acetate. NAA is abundant in the brain, and hydrolysis of NAA by ASPA may help maintain white matter. ASPA is an NAA scavenger in other tissues. Mutations in the gene encoding ASPA cause Canavan disease (CD), a fatal progressive neurodegenerative disorder involving dysmyelination and spongiform degeneration of white matter in children. This enzyme binds zinc which is necessary for activity. Measurement of elevated NAA levels in urine is used in the diagnosis of CD.
Probab=30.40 E-value=41 Score=28.35 Aligned_cols=19 Identities=37% Similarity=0.737 Sum_probs=16.9
Q ss_pred CCCeeeccCCCeeeeecCc
Q psy10314 22 GDSHFQTFDAKNFDFQGAC 40 (233)
Q Consensus 22 G~~h~~TFDg~~y~f~g~C 40 (233)
|||-|.||||+...|.|..
T Consensus 240 Gdp~f~~~~G~~i~~~g~~ 258 (282)
T cd06909 240 GDPLFLTFDGETIPYEGDE 258 (282)
T ss_pred CCceEecCCCCEEEecCCC
Confidence 9999999999999888753
No 12
>PF08394 Arc_trans_TRASH: Archaeal TRASH domain; InterPro: IPR013603 This region is found in the C terminus of a number of archaeal transcriptional regulators. It is thought to function as a metal-sensing regulatory module [].
Probab=29.69 E-value=35 Score=19.29 Aligned_cols=18 Identities=17% Similarity=0.198 Sum_probs=15.6
Q ss_pred EECCCeeeccCCCeeeee
Q psy10314 20 AWGDSHFQTFDAKNFDFQ 37 (233)
Q Consensus 20 ~~G~~h~~TFDg~~y~f~ 37 (233)
+.|.|+...++++.|.|=
T Consensus 8 I~~eP~~~k~~~~~y~fC 25 (37)
T PF08394_consen 8 ITGEPIVVKIGNKVYYFC 25 (37)
T ss_pred ccCCEEEEEECCeEEEEE
Confidence 678999999999999873
No 13
>PRK14111 F0F1 ATP synthase subunit gamma; Provisional
Probab=28.03 E-value=22 Score=30.10 Aligned_cols=11 Identities=45% Similarity=0.809 Sum_probs=9.0
Q ss_pred ccceeecCCCC
Q psy10314 147 KGLCGNYNDNE 157 (233)
Q Consensus 147 ~GLCGn~ng~~ 157 (233)
.||||.||.+-
T Consensus 88 rGLCG~fN~~l 98 (290)
T PRK14111 88 RGLAGSLNANV 98 (290)
T ss_pred CCccccccHHH
Confidence 49999999763
No 14
>TIGR01146 ATPsyn_F1gamma ATP synthase, F1 gamma subunit. This model describes the ATP synthase gamma subunit in bacteria and its equivalents in organelles, namely, mitochondria and chloroplast. F1/F0-ATP synthase is a multisubunit, membrane associated enzyme found in bacteria and organelles of higher eukaryotes, namely, mitochondria and chloroplast. This enzyme is principally involed in the synthesis of ATP from ADP and inorganic phosphate by coupling the energy derived from the proton electrochemical gradient across the biological membrane. A brief description of this multisubunit enzyme complex: F1 and F0 represent two major clusters of subunits. The gamma subunit is the part of F1 cluster. Surrounding the gamma subunit in a cylinder-like structure are three alpha and three subunits in an alternating fashion. This is the central catalytic unit whose different conformations permit the binding of ADP and inorganic phosphate and release of ATP.
Probab=27.53 E-value=17 Score=30.56 Aligned_cols=12 Identities=67% Similarity=1.381 Sum_probs=9.5
Q ss_pred ccceeecCCCCC
Q psy10314 147 KGLCGNYNDNEL 158 (233)
Q Consensus 147 ~GLCGn~ng~~~ 158 (233)
.||||.||.+--
T Consensus 85 rGLCG~fN~~v~ 96 (287)
T TIGR01146 85 RGLCGGYNSNVL 96 (287)
T ss_pred CCccccccHHHH
Confidence 499999997653
No 15
>PRK05621 F0F1 ATP synthase subunit gamma; Validated
Probab=27.46 E-value=16 Score=30.71 Aligned_cols=11 Identities=64% Similarity=1.362 Sum_probs=9.1
Q ss_pred ccceeecCCCC
Q psy10314 147 KGLCGNYNDNE 157 (233)
Q Consensus 147 ~GLCGn~ng~~ 157 (233)
.||||.||.+-
T Consensus 84 ~GLCG~fN~~v 94 (284)
T PRK05621 84 RGLCGGYNSNI 94 (284)
T ss_pred CcccchhhHHH
Confidence 49999999764
No 16
>PRK13424 F0F1 ATP synthase subunit gamma; Provisional
Probab=26.91 E-value=17 Score=30.71 Aligned_cols=11 Identities=55% Similarity=1.268 Sum_probs=9.1
Q ss_pred ccceeecCCCC
Q psy10314 147 KGLCGNYNDNE 157 (233)
Q Consensus 147 ~GLCGn~ng~~ 157 (233)
.||||.||.+-
T Consensus 86 rGLCG~fN~~v 96 (291)
T PRK13424 86 RGLCGSFNANL 96 (291)
T ss_pred CcccccchHHH
Confidence 49999999764
No 17
>PRK14110 F0F1 ATP synthase subunit gamma; Provisional
Probab=26.61 E-value=18 Score=30.57 Aligned_cols=11 Identities=45% Similarity=0.951 Sum_probs=9.2
Q ss_pred ccceeecCCCC
Q psy10314 147 KGLCGNYNDNE 157 (233)
Q Consensus 147 ~GLCGn~ng~~ 157 (233)
.||||.||.+-
T Consensus 87 rGLCG~fNs~v 97 (291)
T PRK14110 87 RGLAGAFNSNV 97 (291)
T ss_pred CCccccccHHH
Confidence 59999999764
No 18
>PHA03357 Alkaline exonuclease; Provisional
Probab=26.11 E-value=44 Score=21.99 Aligned_cols=37 Identities=16% Similarity=0.338 Sum_probs=23.3
Q ss_pred CCCCcccceeecCCCCCCCccCCCCCccccChhhhccceeccC
Q psy10314 142 WKNKVKGLCGNYNDNELDDFQSPSGGISEVSPVVFGDSWRVHT 184 (233)
Q Consensus 142 ~~g~~~GLCGn~ng~~~dD~~~p~g~~~~~~~~~f~~sW~~~~ 184 (233)
-.|-.||+|=.+| .+.+.+|.+++-++ ++-+.+.+++
T Consensus 8 g~gg~C~~CkR~N-----~LvT~sGetvaLda-d~FEdF~lDE 44 (81)
T PHA03357 8 GRGGICGLCKRYN-----ELVTCDGETLALNA-DCFEDFDFDE 44 (81)
T ss_pred cCCceechhcccC-----eEEecCCcEEEeCh-hhhhcccccc
Confidence 3567788887665 68999997544343 3445555543
No 19
>PRK13422 F0F1 ATP synthase subunit gamma; Provisional
Probab=25.51 E-value=18 Score=30.74 Aligned_cols=12 Identities=50% Similarity=0.966 Sum_probs=9.7
Q ss_pred ccceeecCCCCC
Q psy10314 147 KGLCGNYNDNEL 158 (233)
Q Consensus 147 ~GLCGn~ng~~~ 158 (233)
.||||-||.+--
T Consensus 85 rGLCG~fN~~v~ 96 (298)
T PRK13422 85 RGLCGGLNINLF 96 (298)
T ss_pred cccccchhHHHH
Confidence 499999997654
No 20
>COG4892 Predicted heme/steroid binding protein [General function prediction only]
Probab=25.09 E-value=48 Score=21.66 Aligned_cols=16 Identities=6% Similarity=0.208 Sum_probs=13.5
Q ss_pred CCeeeccCCCeeeeec
Q psy10314 23 DSHFQTFDAKNFDFQG 38 (233)
Q Consensus 23 ~~h~~TFDg~~y~f~g 38 (233)
.|.|+.|||+.|+..+
T Consensus 17 GpaYiA~~G~VYDvS~ 32 (81)
T COG4892 17 GPAYIAVNGTVYDVSL 32 (81)
T ss_pred CCeEEEECCEEEeecc
Confidence 3789999999999764
No 21
>PRK13425 F0F1 ATP synthase subunit gamma; Provisional
Probab=24.81 E-value=21 Score=30.25 Aligned_cols=11 Identities=55% Similarity=1.244 Sum_probs=9.2
Q ss_pred ccceeecCCCC
Q psy10314 147 KGLCGNYNDNE 157 (233)
Q Consensus 147 ~GLCGn~ng~~ 157 (233)
.||||.||.+-
T Consensus 86 rGLCG~fN~~v 96 (291)
T PRK13425 86 RGLCGAFNTNI 96 (291)
T ss_pred CcccccchHHH
Confidence 49999999763
No 22
>PF06668 ITI_HC_C: Inter-alpha-trypsin inhibitor heavy chain C-terminus; InterPro: IPR010600 This entry represents the C-terminal region of inter-alpha-trypsin inhibitor heavy chains. Inter-alpha-trypsin inhibitors are glycoproteins with a high inhibitory activity against trypsin, built up from different combinations of four polypeptides: bikunin and the three heavy chains that belong to this family (HC1, HC2, HC3). The heavy chains do not have any protease inhibitory properties but have the capacity to interact in vitro and in vivo with hyaluronic acid, which promotes the stability of the extra-cellular matrix. This domain is associated with the VWA domain IPR002035 from INTERPRO.; GO: 0004867 serine-type endopeptidase inhibitor activity, 0030212 hyaluronan metabolic process
Probab=24.25 E-value=3.6e+02 Score=21.26 Aligned_cols=24 Identities=21% Similarity=0.266 Sum_probs=17.9
Q ss_pred EEEEecCCCCCCCcccceeecCCC
Q psy10314 133 RVSVTLSPKWKNKVKGLCGNYNDN 156 (233)
Q Consensus 133 ~~~v~l~~~~~g~~~GLCGn~ng~ 156 (233)
.++|.=+..+..++-||.|-|-.+
T Consensus 110 GfYi~ds~~lS~~vhGLLGQF~~~ 133 (188)
T PF06668_consen 110 GFYILDSHGLSPSVHGLLGQFYHE 133 (188)
T ss_pred EEEecCCCCCCCcccccccCccCC
Confidence 345555567889999999998655
No 23
>PRK13427 F0F1 ATP synthase subunit gamma; Provisional
Probab=23.84 E-value=28 Score=29.39 Aligned_cols=12 Identities=50% Similarity=1.088 Sum_probs=9.6
Q ss_pred ccceeecCCCCC
Q psy10314 147 KGLCGNYNDNEL 158 (233)
Q Consensus 147 ~GLCGn~ng~~~ 158 (233)
.||||.||.+--
T Consensus 86 rGLcG~fN~~v~ 97 (289)
T PRK13427 86 RGLCGGFNSNLN 97 (289)
T ss_pred CCccccccHHHH
Confidence 499999997653
No 24
>PF08091 Toxin_21: Spider insecticidal peptide; InterPro: IPR012626 This family consists of insecticidal peptides isolated from venom of spiders of Aptostichus schlingeri (Trap-door spider) and Calisoga sp. Nine insecticidal peptides were isolated from the venom of the A. schlinger spider and seven of these toxins cause flaccid paralysis to insect larvae within 10 min of injection. However, all nine peptides were lethal within 24 hours [].; GO: 0009405 pathogenesis, 0005576 extracellular region
Probab=23.18 E-value=79 Score=17.79 Aligned_cols=14 Identities=29% Similarity=1.070 Sum_probs=10.0
Q ss_pred eeeee--cCCCCeEEE
Q psy10314 6 HWKCS--KEECPGICT 19 (233)
Q Consensus 6 ~w~C~--~~~c~~~C~ 19 (233)
-|+|. ..+|..+|.
T Consensus 22 fWtC~~~~~~CSk~C~ 37 (39)
T PF08091_consen 22 FWTCQIRGDGCSKECL 37 (39)
T ss_pred eEEEEEcCCCccceec
Confidence 57777 677777774
No 25
>PRK13426 F0F1 ATP synthase subunit gamma; Provisional
Probab=22.97 E-value=27 Score=29.50 Aligned_cols=11 Identities=45% Similarity=0.990 Sum_probs=9.0
Q ss_pred ccceeecCCCC
Q psy10314 147 KGLCGNYNDNE 157 (233)
Q Consensus 147 ~GLCGn~ng~~ 157 (233)
.||||.||.+-
T Consensus 85 rGLcG~fN~~v 95 (291)
T PRK13426 85 TSLCGGFNANI 95 (291)
T ss_pred CcccchhhHHH
Confidence 49999999654
No 26
>PRK13423 F0F1 ATP synthase subunit gamma; Provisional
Probab=22.71 E-value=29 Score=29.29 Aligned_cols=11 Identities=55% Similarity=1.277 Sum_probs=9.1
Q ss_pred ccceeecCCCC
Q psy10314 147 KGLCGNYNDNE 157 (233)
Q Consensus 147 ~GLCGn~ng~~ 157 (233)
.||||.||.+-
T Consensus 85 rGLCG~fN~~i 95 (288)
T PRK13423 85 RGLCGGFNSNL 95 (288)
T ss_pred CCCcchhhHHH
Confidence 49999999764
No 27
>PF00231 ATP-synt: ATP synthase This Pfam entry corresponds to chain g; InterPro: IPR000131 ATPases (or ATP synthases) are membrane-bound enzyme complexes/ion transporters that combine ATP synthesis and/or hydrolysis with the transport of protons across a membrane. ATPases can harness the energy from a proton gradient, using the flux of ions across the membrane via the ATPase proton channel to drive the synthesis of ATP. Some ATPases work in reverse, using the energy from the hydrolysis of ATP to create a proton gradient. There are different types of ATPases, which can differ in function (ATP synthesis and/or hydrolysis), structure (e.g., F-, V- and A-ATPases, which contain rotary motors) and in the type of ions they transport [, ]. The different types include: F-ATPases (F1F0-ATPases), which are found in mitochondria, chloroplasts and bacterial plasma membranes where they are the prime producers of ATP, using the proton gradient generated by oxidative phosphorylation (mitochondria) or photosynthesis (chloroplasts). V-ATPases (V1V0-ATPases), which are primarily found in eukaryotic vacuoles and catalyse ATP hydrolysis to transport solutes and lower pH in organelles. A-ATPases (A1A0-ATPases), which are found in Archaea and function like F-ATPases (though with respect to their structure and some inhibitor responses, A-ATPases are more closely related to the V-ATPases). P-ATPases (E1E2-ATPases), which are found in bacteria and in eukaryotic plasma membranes and organelles, and function to transport a variety of different ions across membranes. E-ATPases, which are cell-surface enzymes that hydrolyse a range of NTPs, including extracellular ATP. F-ATPases (also known as F1F0-ATPase, or H(+)-transporting two-sector ATPase) (3.6.3.14 from EC) are composed of two linked complexes: the F1 ATPase complex is the catalytic core and is composed of 5 subunits (alpha, beta, gamma, delta, epsilon), while the F0 ATPase complex is the membrane-embedded proton channel that is composed of at least 3 subunits (A-C), nine in mitochondria (A-G, F6, F8). Both the F1 and F0 complexes are rotary motors that are coupled back-to-back. In the F1 complex, the central gamma subunit forms the rotor inside the cylinder made of the alpha(3)beta(3) subunits, while in the F0 complex, the ring-shaped C subunits forms the rotor. The two rotors rotate in opposite directions, but the F0 rotor is usually stronger, using the force from the proton gradient to push the F1 rotor in reverse in order to drive ATP synthesis []. These ATPases can also work in reverse to hydrolyse ATP to create a proton gradient. The ATPase F1 complex gamma subunit forms the central shaft that connects the F0 rotary motor to the F1 catalytic core. The gamma subunit functions as a rotary motor inside the cylinder formed by the alpha(3)beta(3) subunits in the F1 complex []. The best-conserved region of the gamma subunit is its C terminus, which seems to be essential for assembly and catalysis. More information about this protein can be found at Protein of the Month: ATP Synthases [].; GO: 0046933 hydrogen ion transporting ATP synthase activity, rotational mechanism, 0046961 proton-transporting ATPase activity, rotational mechanism, 0015986 ATP synthesis coupled proton transport, 0045261 proton-transporting ATP synthase complex, catalytic core F(1); PDB: 3OFN_G 3FKS_P 3OEE_Y 2HLD_Y 3OEH_Y 2XOK_G 3ZRY_G 3OE7_P 2WPD_G 3OAA_e ....
Probab=21.97 E-value=22 Score=29.98 Aligned_cols=11 Identities=55% Similarity=1.304 Sum_probs=8.8
Q ss_pred ccceeecCCCC
Q psy10314 147 KGLCGNYNDNE 157 (233)
Q Consensus 147 ~GLCGn~ng~~ 157 (233)
.||||.||.+-
T Consensus 85 rGLCG~fN~~v 95 (290)
T PF00231_consen 85 RGLCGGFNSNV 95 (290)
T ss_dssp TSSSTTHHHHH
T ss_pred ccccccccHHH
Confidence 48999999653
No 28
>KOG2875|consensus
Probab=20.50 E-value=26 Score=29.50 Aligned_cols=22 Identities=23% Similarity=0.676 Sum_probs=17.7
Q ss_pred eEEEEECCCeeeccCCCeee-eec
Q psy10314 16 GICTAWGDSHFQTFDAKNFD-FQG 38 (233)
Q Consensus 16 ~~C~~~G~~h~~TFDg~~y~-f~g 38 (233)
.-|..+| +++.+|||..|+ ||.
T Consensus 141 ~fc~~fG-~~i~~~dg~~~h~FPs 163 (323)
T KOG2875|consen 141 RFCQAFG-PRIIQLDGVDYHGFPS 163 (323)
T ss_pred HHHHhhC-cceEeecCcccccCcc
Confidence 3578888 699999999985 764
Done!