Query psy2655
Match_columns 84
No_of_seqs 184 out of 1030
Neff 8.5
Searched_HMMs 46136
Date Fri Aug 16 21:51:51 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy2655.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/2655hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 COG4826 Serine protease inhibi 99.7 3.6E-18 7.8E-23 115.2 4.3 77 7-83 113-190 (410)
2 KOG2392|consensus 99.7 3.6E-16 7.8E-21 108.2 7.7 76 8-83 84-160 (380)
3 cd02054 angiotensinogen Angiot 99.6 8.3E-16 1.8E-20 106.0 8.5 67 12-78 92-159 (372)
4 cd02043 plant_SERPIN SERine Pr 99.6 2.1E-16 4.5E-21 109.0 5.5 72 12-83 81-153 (381)
5 cd00172 SERPIN SERine Proteina 99.6 3.2E-16 6.9E-21 107.2 6.4 72 12-83 78-149 (364)
6 cd02048 neuroserpin Neuroserpi 99.6 2.1E-16 4.5E-21 109.3 5.2 73 11-83 79-151 (388)
7 cd02044 ov-serpin ovalbumin fa 99.6 6.2E-16 1.4E-20 106.2 7.3 73 11-83 78-151 (370)
8 cd02059 ovalbumin_like The ova 99.6 8E-16 1.7E-20 106.4 7.6 73 11-83 95-168 (389)
9 cd02046 hsp47 Heat shock prote 99.6 6E-16 1.3E-20 106.5 6.7 67 12-78 84-150 (366)
10 cd02051 PAI-1_nexin-1 Plasmino 99.6 3.1E-16 6.8E-21 108.1 4.8 69 12-80 84-152 (377)
11 cd02057 maspin_like Maspin (ma 99.6 2.9E-16 6.3E-21 108.1 4.6 72 12-83 79-151 (372)
12 PF00079 Serpin: Serpin (serin 99.6 2E-15 4.3E-20 103.0 8.4 74 10-83 78-151 (371)
13 cd02045 antithrombin-III_like 99.6 3.9E-16 8.3E-21 107.9 4.5 73 11-83 89-162 (381)
14 cd02058 PAI-2 Plasminogen Acti 99.6 8.7E-16 1.9E-20 105.8 6.1 73 11-83 82-155 (380)
15 smart00093 SERPIN SERine Prot 99.6 1.3E-15 2.9E-20 104.1 6.6 69 11-79 73-142 (359)
16 cd02056 alpha-1-antitrypsin_li 99.6 3.4E-15 7.4E-20 102.2 6.1 67 12-78 80-146 (361)
17 cd02049 bacterial_SERPIN SERin 99.6 3.3E-15 7.1E-20 102.5 5.7 68 11-79 80-147 (364)
18 cd02047 HCII Heparin cofactor 99.5 3.4E-14 7.3E-19 99.9 7.2 67 11-78 152-218 (436)
19 cd02055 PZI Protein Z-dependen 99.5 1.7E-14 3.6E-19 99.3 2.9 63 16-78 85-149 (365)
20 cd02050 C1_inh C1 inhibitor (C 99.4 8.5E-14 1.8E-18 95.3 2.2 63 15-78 74-136 (352)
21 cd02052 PEDF Pigment epitheliu 99.4 1.4E-13 3E-18 95.1 3.2 64 14-79 95-158 (374)
22 cd02053 alpha2AP Alpha2-antipl 99.4 3.4E-13 7.4E-18 92.3 4.5 65 12-78 75-140 (351)
23 PHA02660 serpin-like protein; 99.3 2.2E-12 4.8E-17 89.0 2.1 61 15-78 72-133 (364)
24 PHA02948 serine protease inhib 99.1 5.8E-11 1.3E-15 82.2 2.7 61 16-83 100-160 (373)
25 PF07037 DUF1323: Putative tra 73.3 2.4 5.2E-05 25.5 1.4 12 57-68 13-24 (122)
26 PF10957 DUF2758: Protein of u 65.5 7 0.00015 20.6 2.1 30 46-75 4-33 (60)
27 PF13092 CENP-L: Kinetochore c 55.1 17 0.00037 22.7 2.8 45 29-74 16-60 (162)
28 KOG2729|consensus 52.4 11 0.00023 23.2 1.5 19 48-66 35-53 (137)
29 PF03311 Cornichon: Cornichon 50.7 15 0.00033 22.0 2.0 19 48-66 34-52 (128)
30 TIGR01617 arsC_related transcr 48.1 53 0.0011 18.9 4.7 56 22-79 3-58 (117)
31 PLN00174 predicted protein; Pr 47.4 14 0.00031 23.2 1.6 19 48-66 40-58 (160)
32 PRK01655 spxA transcriptional 46.8 56 0.0012 19.4 4.1 55 22-78 4-58 (131)
33 TIGR03044 PS_II_psb27 photosys 46.1 21 0.00046 21.9 2.1 15 55-69 66-80 (135)
34 PLN00064 photosystem II protei 44.9 22 0.00048 22.5 2.1 14 56-69 96-109 (166)
35 PF14409 Herpeto_peptide: Ribo 42.7 29 0.00062 18.0 2.0 18 51-68 17-36 (58)
36 cd03036 ArsC_like Arsenate Red 41.0 61 0.0013 18.5 3.6 26 43-68 23-48 (111)
37 PRK13344 spxA transcriptional 40.5 51 0.0011 19.7 3.2 55 22-78 4-58 (132)
38 cd03034 ArsC_ArsC Arsenate Red 40.3 42 0.00092 19.3 2.8 36 43-79 23-58 (112)
39 PF09373 PMBR: Pseudomurein-bi 35.0 35 0.00075 15.4 1.5 21 55-77 11-31 (33)
40 PF06806 DUF1233: Putative exc 35.0 32 0.00069 18.8 1.6 11 58-68 59-69 (72)
41 PF03960 ArsC: ArsC family; I 34.7 90 0.0019 17.7 4.7 38 42-80 19-56 (110)
42 PF13326 PSII_Pbs27: Photosyst 31.5 25 0.00055 21.6 0.9 14 55-68 78-91 (145)
43 cd03032 ArsC_Spx Arsenate Redu 29.9 1.1E+02 0.0025 17.4 3.8 56 22-79 4-59 (115)
44 PF06531 DUF1108: Protein of u 26.5 46 0.001 18.7 1.3 13 58-70 62-74 (86)
45 PF06891 P2_Phage_GpR: P2 phag 26.0 39 0.00084 20.4 1.0 56 15-70 26-82 (135)
46 PF10750 DUF2536: Protein of u 25.4 80 0.0017 17.0 2.1 23 45-67 11-33 (68)
47 COG1393 ArsC Arsenate reductas 24.1 79 0.0017 18.6 2.1 36 42-78 24-59 (117)
48 TIGR02055 APS_reductase thiore 24.0 1.2E+02 0.0026 19.1 3.1 31 20-50 20-52 (191)
49 PRK12559 transcriptional regul 23.7 1.2E+02 0.0026 18.1 2.9 55 22-78 4-58 (131)
50 cd02977 ArsC_family Arsenate R 22.8 1.3E+02 0.0028 16.7 2.8 35 43-78 23-57 (105)
51 PF06294 DUF1042: Domain of Un 22.3 81 0.0018 19.6 2.0 31 35-65 28-58 (158)
52 COG5144 TFB2 RNA polymerase II 22.3 1.3E+02 0.0029 21.5 3.1 57 18-74 82-155 (447)
53 cd03035 ArsC_Yffb Arsenate Red 21.4 1.4E+02 0.0031 16.9 2.8 33 43-78 23-55 (105)
54 PF04465 DUF499: Protein of un 20.9 1.9E+02 0.0041 23.7 4.1 56 25-84 245-301 (1035)
55 PF08876 DUF1836: Domain of un 20.1 89 0.0019 18.1 1.7 20 54-76 33-52 (105)
No 1
>COG4826 Serine protease inhibitor [Posttranslational modification, protein turnover, chaperones]
Probab=99.72 E-value=3.6e-18 Score=115.24 Aligned_cols=77 Identities=31% Similarity=0.540 Sum_probs=70.8
Q ss_pred CCCCCCCceEEEeeeeeecCCCCcCHHHHHHHHhhhCCcceeeCCC-CcHHHHHHHHHHHHHHhCccccccccCCCCC
Q psy2655 7 SNYEISSPELKIANKIYFAKDIELNPAYQTQAVDNFNSELGKVDFS-QPPAAAKEMNDWVSNHTNDKIKDLIKAGNLT 83 (84)
Q Consensus 7 ~~~~~~~~~l~~an~lf~~~~~~l~~~f~~~~~~~y~~~v~~vdf~-~~~~~~~~IN~wv~~~T~g~I~~ll~~~~i~ 83 (84)
-+-++.++.|..||++|+++++++++.|+..++++|+++|..+||. +|..++..||+||+++|+|+|++|++.+.|.
T Consensus 113 i~sp~~~y~le~AnalWvqedyplke~yv~n~r~yy~aevtnlDF~~~p~~s~dtINkWvEekT~gkIkdlVp~~~i~ 190 (410)
T COG4826 113 INSPNDSYELETANALWVQEDYPLKENYVNNVRNYYDAEVTNLDFVNKPDASRDTINKWVEEKTNGKIKDLVPEDYIG 190 (410)
T ss_pred ccCCccchhhhhhhhHHHhcCCCCCHHHHHHHHhhhhcceeecccccCchHHHHHHHHHHHHhhcchHhhcCCccccC
Confidence 3456778999999999999999999999999999999999999997 5788999999999999999999999987653
No 2
>KOG2392|consensus
Probab=99.66 E-value=3.6e-16 Score=108.25 Aligned_cols=76 Identities=37% Similarity=0.609 Sum_probs=71.3
Q ss_pred CCCCCCceEEEeeeeeecCCCCcCHHHHHHHHhhhCCcceeeCCC-CcHHHHHHHHHHHHHHhCccccccccCCCCC
Q psy2655 8 NYEISSPELKIANKIYFAKDIELNPAYQTQAVDNFNSELGKVDFS-QPPAAAKEMNDWVSNHTNDKIKDLIKAGNLT 83 (84)
Q Consensus 8 ~~~~~~~~l~~an~lf~~~~~~l~~~f~~~~~~~y~~~v~~vdf~-~~~~~~~~IN~wv~~~T~g~I~~ll~~~~i~ 83 (84)
.....++.+.++|++|+++++.+.++|.+.++++|+++++.+||. +++.+++.||.||+++|+|+|+++++++.++
T Consensus 84 ~~~~~~~~l~~an~l~~~~~~~~~~~f~~~~~~~y~a~~~~~dF~~~~e~~~~~IN~wv~~~T~gkI~~ll~~~~~~ 160 (380)
T KOG2392|consen 84 NSTESGYELKLANRLFVSKSLKLLQKFLELVKKLYKAEAESLDFSNDTEEAAKEINEWVEEHTNGKIKDLLSPDSID 160 (380)
T ss_pred ccccCCceEEEeeeheecCCccccHHHHHHHHHHHhCcceeecCccChHHHHHHHHHHHHHHcCCchhhhcCCCCCC
Confidence 456677899999999999999999999999999999999999999 6889999999999999999999999998875
No 3
>cd02054 angiotensinogen Angiotensinogen is part of the renin-angiotensin system (RAS), which plays an important role in blood pressure regulation, renal haemodynamics, fluid and electrolyte homeostasis. It is also involved in normal and abnormal growth processes. The growth promoting actions of angiotensin have been shown in a variety of cells and tissues. This subgroup represents clade A8 of the serpin superfamily. In general, SERine Proteinase INhibitors (serpins) exhibit conformational polymorphism shifting from native to cleaved, latent, delta, or polymorphic forms. Many serpins, such as antitrypsin and antichymotrypsin, function as serine protease inhibitors which regulate blood coagulation cascades. Non-inhibitory serpins perform many diverse functions such as chaperoning proteins or transporting hormones.
Probab=99.65 E-value=8.3e-16 Score=106.02 Aligned_cols=67 Identities=22% Similarity=0.349 Sum_probs=63.0
Q ss_pred CCceEEEeeeeeecCCCCcCHHHHHHHHhhhCCc-ceeeCCCCcHHHHHHHHHHHHHHhCcccccccc
Q psy2655 12 SSPELKIANKIYFAKDIELNPAYQTQAVDNFNSE-LGKVDFSQPPAAAKEMNDWVSNHTNDKIKDLIK 78 (84)
Q Consensus 12 ~~~~l~~an~lf~~~~~~l~~~f~~~~~~~y~~~-v~~vdf~~~~~~~~~IN~wv~~~T~g~I~~ll~ 78 (84)
....+..+|++|++++++++++|.+.++++|+++ ++.+||.++..++..||+||+++|+|+|++++.
T Consensus 92 ~~~~l~~a~~i~~~~~~~l~~~f~~~~~~~y~a~~~~~vdF~~~~~a~~~IN~WV~~~T~g~I~~ll~ 159 (372)
T cd02054 92 RQLLLSTVVWTFTAPGIHLSQPFVQGLADFSDASFPRSVDFTEPDVAEEKINNFVQATSDGKVKSSLK 159 (372)
T ss_pred CccEEEEEEEEEecCCCCccHHHHHHHHHHhCcccceeecCCChHHHHHHHHHHHHHHhcCchhhhcc
Confidence 4568999999999999999999999999999997 999999998888999999999999999999985
No 4
>cd02043 plant_SERPIN SERine Proteinase INhibitors (serpins), plant specific subgroup. It has been suggested that plant serpins play a role in defense against insect predators. This subgroup corresponds to clade P of the serpin superfamily. In general, serpins exhibit conformational polymorphism shifting from native to cleaved, latent, delta, or polymorphic forms. Many serpins, such as antitrypsin and antichymotrypsin, function as serine protease inhibitors which regulate blood coagulation cascades. Non-inhibitory serpins perform many diverse functions such as chaperoning proteins or transporting hormones.
Probab=99.65 E-value=2.1e-16 Score=109.01 Aligned_cols=72 Identities=31% Similarity=0.647 Sum_probs=66.4
Q ss_pred CCceEEEeeeeeecCCCCcCHHHHHHHHhhhCCcceeeCCCC-cHHHHHHHHHHHHHHhCccccccccCCCCC
Q psy2655 12 SSPELKIANKIYFAKDIELNPAYQTQAVDNFNSELGKVDFSQ-PPAAAKEMNDWVSNHTNDKIKDLIKAGNLT 83 (84)
Q Consensus 12 ~~~~l~~an~lf~~~~~~l~~~f~~~~~~~y~~~v~~vdf~~-~~~~~~~IN~wv~~~T~g~I~~ll~~~~i~ 83 (84)
....|.++|+||++++++++++|.+.++++|+++++.+||.+ ++.++..||+||+++|+|+|+++++++.++
T Consensus 81 ~~~~l~~an~l~~~~~~~~~~~f~~~~~~~y~~~~~~vdF~~~~~~a~~~IN~wv~~~T~g~I~~~~~~~~i~ 153 (381)
T cd02043 81 GGPRLSFANGVWVDKSLSLKPSFKDLAANSYKAEARPVDFRTKAEEVRREVNSWVEKATNGLIKDILPPGSVD 153 (381)
T ss_pred CCcEEehhhhHHHhCCCCcCHHHHHHHHhhhhccceeecCCCCHHHHHHHHHHHHHHHhcChHHhhCCCCCcC
Confidence 456899999999999999999999999999999999999975 678999999999999999999999877654
No 5
>cd00172 SERPIN SERine Proteinase INhibitors (serpins) exhibit conformational polymorphism shifting from native to cleaved, latent, delta, or polymorphic forms. Many serpins, such as antitrypsin and antichymotrypsin, function as serine protease inhibitors which regulate blood coagulation cascades. Non-inhibitory serpins perform many diverse functions such as chaperoning proteins or transporting hormones. Serpins are of medical interest because mutants have been associated with blood clotting disorders, emphysema, cirrhosis, and dementia.
Probab=99.65 E-value=3.2e-16 Score=107.16 Aligned_cols=72 Identities=36% Similarity=0.669 Sum_probs=67.1
Q ss_pred CCceEEEeeeeeecCCCCcCHHHHHHHHhhhCCcceeeCCCCcHHHHHHHHHHHHHHhCccccccccCCCCC
Q psy2655 12 SSPELKIANKIYFAKDIELNPAYQTQAVDNFNSELGKVDFSQPPAAAKEMNDWVSNHTNDKIKDLIKAGNLT 83 (84)
Q Consensus 12 ~~~~l~~an~lf~~~~~~l~~~f~~~~~~~y~~~v~~vdf~~~~~~~~~IN~wv~~~T~g~I~~ll~~~~i~ 83 (84)
++..+.++|++|++++++++++|.+.++++|++++..+||.++..++..||.||+++|+|+|+++++++.++
T Consensus 78 ~~~~l~~an~~~~~~~~~~~~~f~~~~~~~~~~~~~~~dF~~~~~a~~~IN~wv~~~T~g~I~~~~~~~~i~ 149 (364)
T cd00172 78 KGVELKLANRLFVQKGLTVKEDFLDLAKKYYDAEVESVDFANPEAAAAQINNWVEEKTNGKIKDLLSPDALD 149 (364)
T ss_pred CCceeeeehhhhhhCCCchhHHHHHHHHHHhcCeeEEEcCCChHHHHHHHHHHHHhhhcccchhhCCCCCCC
Confidence 357899999999999999999999999999999999999998888999999999999999999999886553
No 6
>cd02048 neuroserpin Neuroserpin is a inhibitory member of the SERine Proteinase INhibitor (serpin) family that reacts preferentially with tissue-type plasminogen activator (tPA). It is located in neurons in regions of the brain where tPA is also found, suggesting that neuroserpin is the selective inhibitor of tPA in the central nervous system (CNS). This subgroup corresponds to clade I of the serpin superfamily.
Probab=99.64 E-value=2.1e-16 Score=109.30 Aligned_cols=73 Identities=33% Similarity=0.571 Sum_probs=67.7
Q ss_pred CCCceEEEeeeeeecCCCCcCHHHHHHHHhhhCCcceeeCCCCcHHHHHHHHHHHHHHhCccccccccCCCCC
Q psy2655 11 ISSPELKIANKIYFAKDIELNPAYQTQAVDNFNSELGKVDFSQPPAAAKEMNDWVSNHTNDKIKDLIKAGNLT 83 (84)
Q Consensus 11 ~~~~~l~~an~lf~~~~~~l~~~f~~~~~~~y~~~v~~vdf~~~~~~~~~IN~wv~~~T~g~I~~ll~~~~i~ 83 (84)
..++.|.++|++|++++++++++|.+.++++|+++++.+||.++..++..||.||+++|+|+|+++++++.++
T Consensus 79 ~~~~~l~~an~i~v~~~~~l~~~f~~~~~~~f~~~~~~vdF~~~~~~~~~IN~wv~~~T~g~I~~l~~~~~l~ 151 (388)
T cd02048 79 EKEYVFNLANSLYLQNGFHVKEKFLQSNKKYFNAAVKLVDFSQVKAVAEHINKWVENHTNNKIKDMFSSRDFT 151 (388)
T ss_pred CCccchHHhhhhHhhcCCCccHHHHHHHHHHhcCeeEEECCCChHHHHHHHHHHHHHhccCchhhcCCCccCC
Confidence 4567899999999999999999999999999999999999999888999999999999999999999876653
No 7
>cd02044 ov-serpin ovalbumin family of serpins (ov-serpins). Family of closely related proteins, whose members can be secreted (ovalbumin), cytosolic (leukocyte elastase inhibitor, LEI), or targeted to both compartments (plasminogen activator inhibitor 2, PAI-2). This subgroup corresponds to clade B of the serpin superfamily. In general, serpins exhibit conformational polymorphism shifting from native to cleaved, latent, delta, or polymorphic forms. Many serpins, such as antitrypsin and antichymotrypsin, function as serine protease inhibitors which regulate blood coagulation cascades. Non-inhibitory serpins perform many diverse functions such as chaperoning proteins or transporting hormones. Serpins are of medical interest because mutants can cause blood clotting disorders, emphysema, cirrhosis, and dementia.
Probab=99.64 E-value=6.2e-16 Score=106.19 Aligned_cols=73 Identities=25% Similarity=0.495 Sum_probs=66.6
Q ss_pred CCCceEEEeeeeeecCCCCcCHHHHHHHHhhhCCcceeeCCCC-cHHHHHHHHHHHHHHhCccccccccCCCCC
Q psy2655 11 ISSPELKIANKIYFAKDIELNPAYQTQAVDNFNSELGKVDFSQ-PPAAAKEMNDWVSNHTNDKIKDLIKAGNLT 83 (84)
Q Consensus 11 ~~~~~l~~an~lf~~~~~~l~~~f~~~~~~~y~~~v~~vdf~~-~~~~~~~IN~wv~~~T~g~I~~ll~~~~i~ 83 (84)
+.++.|.++|++|++++++++++|.+.++++|++++..+||.+ +..++..||.||+++|+|+|++++++++++
T Consensus 78 ~~~~~l~~a~~~~~~~~~~l~~~f~~~~~~~~~~~~~~vdF~~~~~~a~~~IN~wv~~~T~g~I~~lv~~~~l~ 151 (370)
T cd02044 78 NSFYSLKLVNRLYGEKRYNFLPEFLSSTKKPYAKELETVDFKDKAEETRGQINSWIKDQTKGKIENLLPENSVD 151 (370)
T ss_pred CCCceEeeeeehhhcCCCcccHHHHHHHHHhhccceEeccccCCHHHHHHHHHHHHHHhcCCchhhcCCCCCCC
Confidence 3457899999999999999999999999999999999999986 567889999999999999999999877653
No 8
>cd02059 ovalbumin_like The ovalbumin_like group of serpins contains ovalbumin, the squamous cell carcinoma antigen 1 (SCCA1) and other closely related serpins of clade B of the serpin superfamily. Ovalbumin, the major protein component of avian egg white, is a non-inhibitory member of SERine Proteinase INhibitorS (serpins). In contrast, SCCA1 inhibits cysteine proteinases such as cathepsin S, K, L, and papain, a so called cross-class serpin.
Probab=99.64 E-value=8e-16 Score=106.35 Aligned_cols=73 Identities=33% Similarity=0.540 Sum_probs=66.5
Q ss_pred CCCceEEEeeeeeecCCCCcCHHHHHHHHhhhCCcceeeCCCCc-HHHHHHHHHHHHHHhCccccccccCCCCC
Q psy2655 11 ISSPELKIANKIYFAKDIELNPAYQTQAVDNFNSELGKVDFSQP-PAAAKEMNDWVSNHTNDKIKDLIKAGNLT 83 (84)
Q Consensus 11 ~~~~~l~~an~lf~~~~~~l~~~f~~~~~~~y~~~v~~vdf~~~-~~~~~~IN~wv~~~T~g~I~~ll~~~~i~ 83 (84)
..++.|.++|++|++++++++++|.+.++++|++++..+||.++ ..++..||.||+++|+|+|++++++++++
T Consensus 95 ~~~~~l~~an~l~~~~~~~l~~~f~~~~~~~~~~~~~~vdF~~~~~~~~~~IN~wv~~~T~g~I~~~~~~~~i~ 168 (389)
T cd02059 95 NDDYELSIANRLYGEKTYPFHQEYLDCVEKLYRAKLEPVDFQNAAEASRKKINSWVESQTNGKIKNLFGKGTID 168 (389)
T ss_pred CCCcEEEEeeehhhhCCCCccHHHHHHHHHHhcCeeEEEcCCCCHHHHHHHHHHHHHHhccchhhhcCCCCCCC
Confidence 45678999999999999999999999999999999999999875 56778999999999999999999877664
No 9
>cd02046 hsp47 Heat shock protein 47 (Hsp47), also called colligin, because of its collagen binding ability, is a chaperone specific for procollagen. It has been shown to be essential for collagen biosynthesis, but its exact function is still unclear. Hsp47 is a non-inhibitory member of the SERPIN superfamily and corresponds to clade H.
Probab=99.64 E-value=6e-16 Score=106.47 Aligned_cols=67 Identities=21% Similarity=0.485 Sum_probs=63.5
Q ss_pred CCceEEEeeeeeecCCCCcCHHHHHHHHhhhCCcceeeCCCCcHHHHHHHHHHHHHHhCcccccccc
Q psy2655 12 SSPELKIANKIYFAKDIELNPAYQTQAVDNFNSELGKVDFSQPPAAAKEMNDWVSNHTNDKIKDLIK 78 (84)
Q Consensus 12 ~~~~l~~an~lf~~~~~~l~~~f~~~~~~~y~~~v~~vdf~~~~~~~~~IN~wv~~~T~g~I~~ll~ 78 (84)
.+..+.++|++|+++++.++++|.+.++++|++++..+||.++..+++.||+||+++|+|+|++++.
T Consensus 84 ~~~~l~~an~~~~~~~~~l~~~F~~~~~~~y~~~~~~vdF~~~~~a~~~IN~wv~~~T~g~I~~~~~ 150 (366)
T cd02046 84 RNVTWKIGNRLYGPSSVSFADDFVKNSKKHYNYEHSKINFRDKRSALNSINEWAAQTTDGKLPEVTK 150 (366)
T ss_pred CCcEEEEeeehhhcCCCChhHHHHHHHHHHhcCeeEEecCCChHHHHHHHHHHHHHhcCCchhHhhc
Confidence 4578999999999999999999999999999999999999998888999999999999999999974
No 10
>cd02051 PAI-1_nexin-1 Plasminogen activator inhibitor-1_like. Plasminogen activator inhibitor-1 (PAI-1) is the primary, fast-acting inhibitor of plasminogen activators. It is often bound to vitronectin, an abundant component of the extracellular matrix in many tissues. Protease nexin-1 is a potent serpin able to inhibit thrombin, plasmin, and plasminogen activators. PAI-1 and nexin-1 are members of the serpin superfamily and represent clade E. In general, SERine Proteinase INhibitors (serpins) exhibit conformational polymorphism shifting from native to cleaved, latent, delta, or polymorphic forms. Many serpins, such as antitrypsin and antichymotrypsin, function as serine protease inhibitors which regulate blood coagulation cascades. Non-inhibitory serpins perform many diverse functions such as chaperoning proteins or transporting hormones.
Probab=99.63 E-value=3.1e-16 Score=108.08 Aligned_cols=69 Identities=28% Similarity=0.520 Sum_probs=64.8
Q ss_pred CCceEEEeeeeeecCCCCcCHHHHHHHHhhhCCcceeeCCCCcHHHHHHHHHHHHHHhCccccccccCC
Q psy2655 12 SSPELKIANKIYFAKDIELNPAYQTQAVDNFNSELGKVDFSQPPAAAKEMNDWVSNHTNDKIKDLIKAG 80 (84)
Q Consensus 12 ~~~~l~~an~lf~~~~~~l~~~f~~~~~~~y~~~v~~vdf~~~~~~~~~IN~wv~~~T~g~I~~ll~~~ 80 (84)
+...|.++|++|++++++++++|.+.++++|++++..+||.+++.++..||.||+++|+|+|+++++++
T Consensus 84 ~~~~l~~an~i~~~~~~~l~~~f~~~~~~~f~a~~~~vdF~~~~~a~~~IN~Wv~~~T~g~I~~~l~~~ 152 (377)
T cd02051 84 NKDIVTTANAVFAQSGFKMEVPFVPRNKEVFQCEVKSVDFSDPETAAFSINDWVKNETKGMIDNLLSPD 152 (377)
T ss_pred CccchHHHHHHHhcCCCchhHHHHHHHHHHhcCeeEEECCCCHHHHHHHHHHHHHHhcCCCccccCCCC
Confidence 345799999999999999999999999999999999999999888999999999999999999999875
No 11
>cd02057 maspin_like Maspin (mammary serine proteinase inhibitor), a member of the serpin superfamily, with a multitude of effects on cells and tissues at an assortment of developmental stages. Maspin has tumor suppressing activity against breast and prostate cancer. In general, SERine Proteinase INhibitors (serpins) exhibit conformational polymorphism shifting from native to cleaved, latent, delta, or polymorphic forms. Many serpins, such as antitrypsin and antichymotrypsin, function as serine protease inhibitors which regulate blood coagulation cascades. Non-inhibitory serpins perform many diverse functions such as chaperoning proteins or transporting hormones.
Probab=99.63 E-value=2.9e-16 Score=108.09 Aligned_cols=72 Identities=18% Similarity=0.397 Sum_probs=65.7
Q ss_pred CCceEEEeeeeeecCCCCcCHHHHHHHHhhhCCcceeeCCCC-cHHHHHHHHHHHHHHhCccccccccCCCCC
Q psy2655 12 SSPELKIANKIYFAKDIELNPAYQTQAVDNFNSELGKVDFSQ-PPAAAKEMNDWVSNHTNDKIKDLIKAGNLT 83 (84)
Q Consensus 12 ~~~~l~~an~lf~~~~~~l~~~f~~~~~~~y~~~v~~vdf~~-~~~~~~~IN~wv~~~T~g~I~~ll~~~~i~ 83 (84)
..+.|.++|++|++++++++++|.+.++++|++++..+||.+ +..++..||+||+++|+|+|+++++++.++
T Consensus 79 ~~~~l~~an~l~v~~~~~~~~~f~~~~~~~y~~~~~~vdF~~~~~~a~~~IN~wv~~~T~g~I~~l~~~~~i~ 151 (372)
T cd02057 79 SFYSLKLIKRLYVDKSLNLSTDFINSTKRPYPKELETVDFKDKLEETRGQINNSIKELTDGHFENILNENSVN 151 (372)
T ss_pred ccchhHhhhhhhhhCCCCccHHHHHHHHHHhcCeeEEecCCCCHHHHHHHHHHHHHHhccChhhhcCCCCCCC
Confidence 346799999999999999999999999999999999999976 567889999999999999999999877653
No 12
>PF00079 Serpin: Serpin (serine protease inhibitor); InterPro: IPR023796 Peptide proteinase inhibitors can be found as single domain proteins or as single or multiple domains within proteins; these are referred to as either simple or compound inhibitors, respectively. In many cases they are synthesised as part of a larger precursor protein, either as a prepropeptide or as an N-terminal domain associated with an inactive peptidase or zymogen. This domain prevents access of the substrate to the active site. Removal of the N-terminal inhibitor domain either by interaction with a second peptidase or by autocatalytic cleavage activates the zymogen. Other inhibitors interact direct with proteinases using a simple noncovalent lock and key mechanism; while yet others use a conformational change-based trapping mechanism that depends on their structural and thermodynamic properties. Serpins (SERine Proteinase INhibitors) [, , ] belong to MEROPS inhibitor family I4, clan ID. Serpins are proteins that are primarily known as irreversible serine protease inhibitors active against S1 (IPR001254 from INTERPRO), S8 (IPR000209 from INTERPRO) and C14 (IPR002398 from INTERPRO) peptidases. There are both extra- and intra-cellular serpins, which are found in all groups of organisms with the notable exception of fungi [, ]. In contrast to "rigid" proteinase inhibitors, such as those of the Kunitz or Kazal families, the serpins are metastable proteins (active-state proteins) which interact with their substrate and irreversibly trap the acyl intermediate as a result of a major conformational change []; they are best described as suicide substrate inhibitors. The common structure of these proteins is a multi-domain fold containing a bundle of 8 or 9 alpha helices and a beta sandwich formed by 3 beta sheets. The reactive centre loop (RCL) is found in the C-terminal part of these proteins. Serpins and their homologues are a group of high molecular weight (40 to 50 kDa) structurally related proteins involved in a number of fundamental biological processes such as blood coagulation, complement activation, fibrinolysis, angiogenesis, inflammation, tumour suppression and hormone transport. All known serpins have been classified into 16 clades and 10 orphan sequences; the vertebrate serpins can be conveniently classified into six sub-groups []. In human plasma they represent approximately 2% of the total protein, of which 70% is alpha-1-antitrypsin. On the basis of strong sequence similarities, a number of proteins with no known inhibitory activity also belong to this family, these include: angiotensinogen, corticosteroid-binding globulin and thyroxin-binding globulin []. This entry represents the structural domain of serpins. It consists of a cluster of helices and a beta-sandwich.; PDB: 3PZF_A 3B9F_I 1MTP_A 1SNG_A 1ANT_I 2ZNH_B 1BR8_I 1SR5_A 1E03_L 1AZX_L ....
Probab=99.63 E-value=2e-15 Score=103.03 Aligned_cols=74 Identities=43% Similarity=0.810 Sum_probs=69.3
Q ss_pred CCCCceEEEeeeeeecCCCCcCHHHHHHHHhhhCCcceeeCCCCcHHHHHHHHHHHHHHhCccccccccCCCCC
Q psy2655 10 EISSPELKIANKIYFAKDIELNPAYQTQAVDNFNSELGKVDFSQPPAAAKEMNDWVSNHTNDKIKDLIKAGNLT 83 (84)
Q Consensus 10 ~~~~~~l~~an~lf~~~~~~l~~~f~~~~~~~y~~~v~~vdf~~~~~~~~~IN~wv~~~T~g~I~~ll~~~~i~ 83 (84)
...+..+.++|++|+++++++.++|.+.++++|+++++.+||.++..++..||.||+++|+|+|++++.+++++
T Consensus 78 ~~~~~~~~~~~~l~~~~~~~~~~~f~~~~~~~~~~~~~~~df~~~~~~~~~IN~wv~~~T~g~I~~~v~~~~i~ 151 (371)
T PF00079_consen 78 PSDNPQLKLANKLFVDKNLPLKPEFKKTLKKYFNAEVESVDFSDPPQAAKEINKWVSEKTNGKIKNLVSPNDID 151 (371)
T ss_dssp STSSSEEEEEEEEEEETTSEBEHHHHHHHHHHHSEEEEEEETTSHHHHHHHHHHHHHHHTTTSSSTSSSTTSSS
T ss_pred ccccccceeeeeeecccccccccccchhhhhccccccccccccccchhhhhhcchhhhhccceeeccccccchh
Confidence 45678999999999999999999999999999999999999999877999999999999999999999987764
No 13
>cd02045 antithrombin-III_like Antithrombin is a serine proteinase inhibitor (serpin) which controls the process of coagulation. It is the most important anticoagulant molecule in mammalian circulation systems, controlled by its interaction with the co-factor, heparin, which accelerates its interaction with target proteases, such as thrombin and factor Xa. This subgroup corresponds to clade C of the serpin superfamily.
Probab=99.62 E-value=3.9e-16 Score=107.86 Aligned_cols=73 Identities=29% Similarity=0.544 Sum_probs=65.8
Q ss_pred CCCceEEEeeeeeecCCCCcCHHHHHHHHhhhCCcceeeCCCC-cHHHHHHHHHHHHHHhCccccccccCCCCC
Q psy2655 11 ISSPELKIANKIYFAKDIELNPAYQTQAVDNFNSELGKVDFSQ-PPAAAKEMNDWVSNHTNDKIKDLIKAGNLT 83 (84)
Q Consensus 11 ~~~~~l~~an~lf~~~~~~l~~~f~~~~~~~y~~~v~~vdf~~-~~~~~~~IN~wv~~~T~g~I~~ll~~~~i~ 83 (84)
+.+..|.++|++|++++++++++|.+.++++|+++++.+||.+ +..++..||+||+++|+|+|++++++++++
T Consensus 89 ~~~~~l~~an~l~~~~~~~l~~~f~~~~~~~y~~~~~~vdF~~~~~~~~~~IN~wv~~~T~g~I~~ll~~~~l~ 162 (381)
T cd02045 89 NKSSELISANRLFGDKSLTFNETYQDISEIVYGAKLWPLDFKEKPELSRITINEWIANKTENRITDVIPEGAID 162 (381)
T ss_pred CCcchhhhhhhhhhcCCCchhHHHHHHHHHhhcCeeEEecCCCCHHHHHHHHHHHHHHHHhhHHHhhCCCCCCC
Confidence 3456899999999999999999999999999999999999974 566788999999999999999999876653
No 14
>cd02058 PAI-2 Plasminogen Activator Inhibitor-2 (PAI-2). PAI-2 is a serine protease inhibitor that belongs to the ov-serpin branch of the serpin superfamily. It is is an effective inhibitor of urinary plasminogen activator (urokinase or uPA) and is involved in cell differentiation, tissue growth and regeneration.
Probab=99.62 E-value=8.7e-16 Score=105.80 Aligned_cols=73 Identities=29% Similarity=0.453 Sum_probs=66.7
Q ss_pred CCCceEEEeeeeeecCCCCcCHHHHHHHHhhhCCcceeeCCCC-cHHHHHHHHHHHHHHhCccccccccCCCCC
Q psy2655 11 ISSPELKIANKIYFAKDIELNPAYQTQAVDNFNSELGKVDFSQ-PPAAAKEMNDWVSNHTNDKIKDLIKAGNLT 83 (84)
Q Consensus 11 ~~~~~l~~an~lf~~~~~~l~~~f~~~~~~~y~~~v~~vdf~~-~~~~~~~IN~wv~~~T~g~I~~ll~~~~i~ 83 (84)
+.++.|.++|++|++++++++++|.+.++++|++++..+||.+ +..+++.||.||+++|+|+|++++++++++
T Consensus 82 ~~~~~l~~~n~l~~~~~~~l~~~f~~~~~~~~~~~~~~vdF~~~~~~a~~~IN~wv~~~T~g~I~~~~~~~~i~ 155 (380)
T cd02058 82 GTNYLLKSANRLYGEKTYPFLEKFLQLTQKYYQAEPQAVDFVEAAEQARKEINSWVERQTEGKIQNLLPPGSVD 155 (380)
T ss_pred CCccEeehhhhhhhcCCCcHhHHHHHHHHhhhcCceEEecCCCCHHHHHHHHHHHHHHhcccchhhcCCcCCCC
Confidence 3557899999999999999999999999999999999999975 567899999999999999999999877664
No 15
>smart00093 SERPIN SERine Proteinase INhibitors.
Probab=99.61 E-value=1.3e-15 Score=104.10 Aligned_cols=69 Identities=36% Similarity=0.592 Sum_probs=63.6
Q ss_pred CCCceEEEeeeeeecCCCCcCHHHHHHHHhhhCCcceeeCCCCc-HHHHHHHHHHHHHHhCccccccccC
Q psy2655 11 ISSPELKIANKIYFAKDIELNPAYQTQAVDNFNSELGKVDFSQP-PAAAKEMNDWVSNHTNDKIKDLIKA 79 (84)
Q Consensus 11 ~~~~~l~~an~lf~~~~~~l~~~f~~~~~~~y~~~v~~vdf~~~-~~~~~~IN~wv~~~T~g~I~~ll~~ 79 (84)
+.+..|.+||++|++++++++++|.+.++++|++++..+||.++ ..++..||.||+++|+|+|++++++
T Consensus 73 ~~~~~l~~an~i~~~~~~~l~~~f~~~~~~~~~~~~~~~dF~~~~~~a~~~IN~wv~~~T~g~I~~~~~~ 142 (359)
T smart00093 73 DSQLELKTANALFVDKSLKLKDSFLEDIKKLYGAEVQSVDFSDKAEEAKKQINDWVEKKTQGKIKDLLSD 142 (359)
T ss_pred CCcceEEeeeeehccCCCchhHHHHHHHHHHhcCeeEEecCCCCHHHHHHHHHHHHHHhhhchHHHhccc
Confidence 34568999999999999999999999999999999999999985 6699999999999999999999863
No 16
>cd02056 alpha-1-antitrypsin_like alpha-1-antitrypsin_like. This family contains a variety of different members of clade A of the serpin superfamily. They include the classical serine proteinase inhibitors, alpha-1-antitrypsin and alpha-1-antichymotrypsin, protein C inhibitor, kallistatin, and noninhibitory serpins, like corticosteroid and thyroxin binding globulins. In general, SERine Proteinase INhibitors (serpins) exhibit conformational polymorphism shifting from native to cleaved, latent, delta, or polymorphic forms. Many serpins, such as antitrypsin and antichymotrypsin, function as serine protease inhibitors which regulate blood coagulation cascades. Non-inhibitory serpins perform many diverse functions such as chaperoning proteins or transporting hormones. Serpins are of medical interest because mutants have been associated with blood clotting disorders, emphysema, cirrhosis, and dementia.
Probab=99.58 E-value=3.4e-15 Score=102.22 Aligned_cols=67 Identities=28% Similarity=0.467 Sum_probs=63.6
Q ss_pred CCceEEEeeeeeecCCCCcCHHHHHHHHhhhCCcceeeCCCCcHHHHHHHHHHHHHHhCcccccccc
Q psy2655 12 SSPELKIANKIYFAKDIELNPAYQTQAVDNFNSELGKVDFSQPPAAAKEMNDWVSNHTNDKIKDLIK 78 (84)
Q Consensus 12 ~~~~l~~an~lf~~~~~~l~~~f~~~~~~~y~~~v~~vdf~~~~~~~~~IN~wv~~~T~g~I~~ll~ 78 (84)
.+..+.++|++|++++++++++|.+.++++|++++..+||.++..++..||.||+++|+|+|+++++
T Consensus 80 ~~~~l~~an~l~~~~~~~~~~~f~~~~~~~y~~~~~~~dF~~~~~~~~~IN~wv~~~T~g~I~~~~~ 146 (361)
T cd02056 80 SGLQLNMGNALFLDKRLKPLDKFLEDVKHLYESEAFSTDFQDSAEAKKQINDYVEKKTHGKIVDLVK 146 (361)
T ss_pred CcceEEeeeehhhhCCCChhHHHHHHHHHHhcCceEeeCCCChHHHHHHHHHHHHHHhhhhHHhhcC
Confidence 4578999999999999999999999999999999999999998889999999999999999999985
No 17
>cd02049 bacterial_SERPIN SERine Proteinase INhibitors (serpins), prokaryotic subgroup. Little information about specific functions is available for this subgroup, most likely they are inhibitory members of the serpin superfamily. In general, serpins exhibit conformational polymorphism shifting from native to cleaved, latent, delta, or polymorphic forms. Many serpins, such as antitrypsin and antichymotrypsin, function as serine protease inhibitors.
Probab=99.58 E-value=3.3e-15 Score=102.45 Aligned_cols=68 Identities=32% Similarity=0.573 Sum_probs=63.0
Q ss_pred CCCceEEEeeeeeecCCCCcCHHHHHHHHhhhCCcceeeCCCCcHHHHHHHHHHHHHHhCccccccccC
Q psy2655 11 ISSPELKIANKIYFAKDIELNPAYQTQAVDNFNSELGKVDFSQPPAAAKEMNDWVSNHTNDKIKDLIKA 79 (84)
Q Consensus 11 ~~~~~l~~an~lf~~~~~~l~~~f~~~~~~~y~~~v~~vdf~~~~~~~~~IN~wv~~~T~g~I~~ll~~ 79 (84)
+.+..|.++|++|++++++++++|.+.++++|++++..+||.++ .++..||+||+++|+|+|++++.+
T Consensus 80 ~~~~~l~~an~i~~~~~~~~~~~f~~~~~~~~~~~v~~~dF~~~-~a~~~IN~wv~~~T~g~I~~~~~~ 147 (364)
T cd02049 80 DKTVELIIANSIWIEPGFTLKPDFLQTIKDYYQAYVLELDFQSP-AAAEEINRWVKEKTKGKIDKIVDK 147 (364)
T ss_pred CCCcEEEeehhhHhhCCCCccHHHHHHHHHhhcccceeeCCCCH-HHHHHHHHHHHHhhcChHHHhcCC
Confidence 34578999999999999999999999999999999999999987 488999999999999999999854
No 18
>cd02047 HCII Heparin cofactor II (HCII) inhibits thrombin, the final protease of the coagulation cascade. HCII is allosterically activated by binding to cell surface glycosaminoglycans (GAGs). The specificity of HCII for thrombin is conferred by a highly acidic hirudin-like N-terminal tail, which becomes available after GAG binding for interaction with the anion-binding exosite I of thrombin. This subgroup corresponds to clade D of the serpin superfamily.
Probab=99.52 E-value=3.4e-14 Score=99.95 Aligned_cols=67 Identities=21% Similarity=0.329 Sum_probs=61.5
Q ss_pred CCCceEEEeeeeeecCCCCcCHHHHHHHHhhhCCcceeeCCCCcHHHHHHHHHHHHHHhCcccccccc
Q psy2655 11 ISSPELKIANKIYFAKDIELNPAYQTQAVDNFNSELGKVDFSQPPAAAKEMNDWVSNHTNDKIKDLIK 78 (84)
Q Consensus 11 ~~~~~l~~an~lf~~~~~~l~~~f~~~~~~~y~~~v~~vdf~~~~~~~~~IN~wv~~~T~g~I~~ll~ 78 (84)
..++.|.++|++|++++++++++|.+.++++|+++++.+||.++. ++..||+||+++|+|+|++++.
T Consensus 152 ~~~~~L~~an~l~v~~~~~l~~~F~~~~~~~y~a~v~~vdF~~~~-~~~~IN~wV~~~T~g~I~~~l~ 218 (436)
T cd02047 152 NFGYTLRSVNDLYIKKDFPILLDFKNNVKTYYFAEAQIADFSDPA-FITKTNNRIQKLTKGLIKEALE 218 (436)
T ss_pred CCCcEEeeeeehhhhCCCCccHHHHHHHHHHhcCceEEeCCCCHH-HHHHHHHHHHHhcCCccccccc
Confidence 346789999999999999999999999999999999999999875 4568999999999999999885
No 19
>cd02055 PZI Protein Z-dependent protease inhibitor (ZPI) is a member of the serpin superfamily of proteinase inhibitors (clade A10). ZPI inhibits coagulation factor Xa , dependent on protein Z (PZ), a vitamin K-dependent plasma protein. ZPI also inhibits factor XIa in a process that does not require PZ. In general, SERine Proteinase INhibitors (serpins) exhibit conformational polymorphism shifting from native to cleaved, latent, delta, or polymorphic forms.
Probab=99.48 E-value=1.7e-14 Score=99.33 Aligned_cols=63 Identities=22% Similarity=0.328 Sum_probs=57.4
Q ss_pred EEEee--eeeecCCCCcCHHHHHHHHhhhCCcceeeCCCCcHHHHHHHHHHHHHHhCcccccccc
Q psy2655 16 LKIAN--KIYFAKDIELNPAYQTQAVDNFNSELGKVDFSQPPAAAKEMNDWVSNHTNDKIKDLIK 78 (84)
Q Consensus 16 l~~an--~lf~~~~~~l~~~f~~~~~~~y~~~v~~vdf~~~~~~~~~IN~wv~~~T~g~I~~ll~ 78 (84)
+.+++ ++|++++++++++|.+.++++|++++..+||.++..+++.||+||+++|+|+|+++++
T Consensus 85 ~~l~~~~~~~v~~~~~l~~~F~~~~~~~y~~~~~~vdF~~~~~a~~~IN~wv~~~T~g~I~~~~~ 149 (365)
T cd02055 85 LGFTQGIFAFIHKDFDVKEAFFNLSKQYFDMECLCMDFQNASQAKFLINHNIKKETKGKIPELFD 149 (365)
T ss_pred eehHHHHHHHHHCCCCccHHHHHHHHHHhcCeeEEECCCChHHHHHHHHHHHHHhhcchHHHHhh
Confidence 44455 4599999999999999999999999999999998888999999999999999999985
No 20
>cd02050 C1_inh C1 inhibitor (C1-Inh) is a protease inhibitor of the serpin family. It plays a pivotal role in regulating the activation of the classical complement pathway and of the contact system, via regulating bradykinin formation, inhibiting factor XII and kallikrein of the contact system, and via acting on factor XI in the coagulation cascade. This subgroup corresponds to clade G of the serpin superfamily.
Probab=99.41 E-value=8.5e-14 Score=95.32 Aligned_cols=63 Identities=21% Similarity=0.411 Sum_probs=58.4
Q ss_pred eEEEeeeeeecCCCCcCHHHHHHHHhhhCCcceeeCCCCcHHHHHHHHHHHHHHhCcccccccc
Q psy2655 15 ELKIANKIYFAKDIELNPAYQTQAVDNFNSELGKVDFSQPPAAAKEMNDWVSNHTNDKIKDLIK 78 (84)
Q Consensus 15 ~l~~an~lf~~~~~~l~~~f~~~~~~~y~~~v~~vdf~~~~~~~~~IN~wv~~~T~g~I~~ll~ 78 (84)
.|.++|++|++++++++++|.+.++++|++++..+||. +..++..||+||+++|+|+|++++.
T Consensus 74 ~l~~an~i~~~~~~~l~~~f~~~~~~~y~~~~~~~~~~-~~~a~~~IN~wv~~~T~g~I~~~~~ 136 (352)
T cd02050 74 GLLSASQIFHHPDLHLRESFTNESWQFYKARPRELSNN-SELNLEMINSWVAKATNNKIPRLLD 136 (352)
T ss_pred hhHHHHHHHHhCCCCccHHHHHHHHHHhCCcceeecCC-hHHHHHHHHHHHHHHhcchHHHHhc
Confidence 58899999999999999999999999999999999995 4568899999999999999999984
No 21
>cd02052 PEDF Pigment epithelium-derived factor (PEDF)_like. PEDF is non-inhibitory member of the Serpin superfamily. It exhibits neurotrophic, neuroprotective and antiangiogenic properties and is widely expressed in the developing and adult nervous systems. This subgroup corresponds to clade F1 of the serpin superfamily.
Probab=99.41 E-value=1.4e-13 Score=95.06 Aligned_cols=64 Identities=22% Similarity=0.415 Sum_probs=58.9
Q ss_pred ceEEEeeeeeecCCCCcCHHHHHHHHhhhCCcceeeCCCCcHHHHHHHHHHHHHHhCccccccccC
Q psy2655 14 PELKIANKIYFAKDIELNPAYQTQAVDNFNSELGKVDFSQPPAAAKEMNDWVSNHTNDKIKDLIKA 79 (84)
Q Consensus 14 ~~l~~an~lf~~~~~~l~~~f~~~~~~~y~~~v~~vdf~~~~~~~~~IN~wv~~~T~g~I~~ll~~ 79 (84)
..|+++|++|++++++++++|.+.++++|++++..+||.. .++..||+||+++|+|+|++++++
T Consensus 95 ~~l~~an~i~v~~~~~l~~~f~~~~~~~~~~~~~~~~~~~--~~~~~IN~wv~~~T~g~I~~~l~~ 158 (374)
T cd02052 95 KGLKSASRILLERKLRLRLEFVNQVEKSYGERPRILAGNA--LDLQEINDWVQQQTGGKVDRFVKE 158 (374)
T ss_pred ccchhhhHHHHhCCCchhHHHHHHHHHHhhCccccccCCH--HHHHHHHHHHHHhhcCcHHHhCCC
Confidence 3699999999999999999999999999999999999843 588999999999999999999864
No 22
>cd02053 alpha2AP Alpha2-antiplasmin (alpha2AP) is the primary inhibitor of plasmin, a proteinase that digests fibrin, the main component of blood clots. Alpha2-Antiplasmin forms an inactive 1 : 1 stoichiometric complex with plasmin. It also rapidly crosslinks to fibrin during blood clotting by activated coagulation factor XIII, and as a consequence fibrin becomes more resistant to fibrinolysis. Therefore alpha2AP is important in modulating the effectiveness and persistence of fibrin with respect to its susceptibility to digestion and removal by plasmin. This subgroup corresponds to clade F2 of the serpin superfamily.
Probab=99.39 E-value=3.4e-13 Score=92.31 Aligned_cols=65 Identities=17% Similarity=0.467 Sum_probs=57.6
Q ss_pred CCceEEEeeeeeecCCCCcCHHHHHHHHhhhCCcceeeCCC-CcHHHHHHHHHHHHHHhCcccccccc
Q psy2655 12 SSPELKIANKIYFAKDIELNPAYQTQAVDNFNSELGKVDFS-QPPAAAKEMNDWVSNHTNDKIKDLIK 78 (84)
Q Consensus 12 ~~~~l~~an~lf~~~~~~l~~~f~~~~~~~y~~~v~~vdf~-~~~~~~~~IN~wv~~~T~g~I~~ll~ 78 (84)
.+..|.++|++|++++++++++|.+.++++|++++ +||. ++..++..||+||+++|+|+|+++++
T Consensus 75 ~~~~l~~an~i~v~~~~~~~~~f~~~~~~~~~~~~--~~~~~~~~~~~~~IN~wv~~~T~g~I~~~~~ 140 (351)
T cd02053 75 GPGALRLATRMYLQKGFEIKESFLEESEKLYGAKP--VSLTGTKEDDLANINKWVKEATEGQIPNFLS 140 (351)
T ss_pred CccEeehhhHHHHhCCCchhHHHHHHHHHHcCCEE--eecCCCCHHHHHHHHHHHHHhccCchhHhcC
Confidence 34689999999999999999999999999999875 5664 35678899999999999999999985
No 23
>PHA02660 serpin-like protein; Provisional
Probab=99.25 E-value=2.2e-12 Score=88.98 Aligned_cols=61 Identities=18% Similarity=0.442 Sum_probs=54.9
Q ss_pred eEEEeeeeeecCCCCcCHHHHHHHHhhhCCcceeeCCCC-cHHHHHHHHHHHHHHhCcccccccc
Q psy2655 15 ELKIANKIYFAKDIELNPAYQTQAVDNFNSELGKVDFSQ-PPAAAKEMNDWVSNHTNDKIKDLIK 78 (84)
Q Consensus 15 ~l~~an~lf~~~~~~l~~~f~~~~~~~y~~~v~~vdf~~-~~~~~~~IN~wv~~~T~g~I~~ll~ 78 (84)
.+.+||++|++++++++++|.+.++. |++++..+||.+ +..+++.||.||+++|+ |++++.
T Consensus 72 ~l~~an~l~v~~~~~l~~~F~~~~~~-~~~~v~~vdF~~~~~~a~~~IN~WV~~~T~--I~~~~~ 133 (364)
T PHA02660 72 HIHNITKVYVDSHLPIHSAFVASMND-MGIDVILADLANHAEPIRRSINEWVYEKTN--IINFLH 133 (364)
T ss_pred hHHHHHHHHHhcCCchhHHHHHHHHH-cCCceEEeccccCHHHHHHHHHHHHHhccC--Cccccc
Confidence 47789999999999999999999987 899999999976 56788999999999998 888873
No 24
>PHA02948 serine protease inhibitor-like protein; Provisional
Probab=99.08 E-value=5.8e-11 Score=82.23 Aligned_cols=61 Identities=13% Similarity=0.342 Sum_probs=52.6
Q ss_pred EEEeeeeeecCCCCcCHHHHHHHHhhhCCcceeeCCCCcHHHHHHHHHHHHHHhCccccccccCCCCC
Q psy2655 16 LKIANKIYFAKDIELNPAYQTQAVDNFNSELGKVDFSQPPAAAKEMNDWVSNHTNDKIKDLIKAGNLT 83 (84)
Q Consensus 16 l~~an~lf~~~~~~l~~~f~~~~~~~y~~~v~~vdf~~~~~~~~~IN~wv~~~T~g~I~~ll~~~~i~ 83 (84)
+.++|++|+++++.++++|.+ .+|.++++.+||.+ .++..||+||+++| +|++++++++++
T Consensus 100 ~~~an~~~v~~~~~~~~~f~~---~~~~~~v~~vdF~~--~a~~~IN~WV~~~T--~I~~~~~~~~i~ 160 (373)
T PHA02948 100 TDLTYQSFVDNTVCIKPSYYQ---QYHRFGLYRLNFRR--DAVNKINSIVERRS--GMSNVVDSTMLD 160 (373)
T ss_pred HHHHHHHHHhcCCCccHHHHH---HHhhCceEecCccH--HHHHHHHHHHHhcC--CCcccCCCCCCC
Confidence 478999999999999999997 55678999999973 58899999999998 599999876654
No 25
>PF07037 DUF1323: Putative transcription regulator (DUF1323); InterPro: IPR010749 This family consists of several hypothetical Enterobacterial proteins of around 120 residues in length. The function of this family is unknown.
Probab=73.31 E-value=2.4 Score=25.45 Aligned_cols=12 Identities=25% Similarity=0.736 Sum_probs=10.4
Q ss_pred HHHHHHHHHHHH
Q psy2655 57 AAKEMNDWVSNH 68 (84)
Q Consensus 57 ~~~~IN~wv~~~ 68 (84)
+++.||.||.++
T Consensus 13 srQTINrWvRke 24 (122)
T PF07037_consen 13 SRQTINRWVRKE 24 (122)
T ss_pred cHHHHHHHHHhc
Confidence 678999999884
No 26
>PF10957 DUF2758: Protein of unknown function (DUF2758); InterPro: IPR020296 Cse60 is expressed during sporulation in Bacillus subtilis. Transcription commences around 2h after the start of sporulation and had an absolute requirement for the transcription factor sigmaE. Maximal expression of cse60 further depended on the DNA-binding protein SpoIIID. Cse60 is an acidic product of only 60 residues, whose function is not known [].
Probab=65.51 E-value=7 Score=20.56 Aligned_cols=30 Identities=17% Similarity=0.340 Sum_probs=18.7
Q ss_pred ceeeCCCCcHHHHHHHHHHHHHHhCccccc
Q psy2655 46 LGKVDFSQPPAAAKEMNDWVSNHTNDKIKD 75 (84)
Q Consensus 46 v~~vdf~~~~~~~~~IN~wv~~~T~g~I~~ 75 (84)
|+.+|-.........||.|.++.....|-+
T Consensus 4 VKvFd~~he~dLe~~vN~fL~~~~~~~viD 33 (60)
T PF10957_consen 4 VKVFDEEHEKDLEDQVNDFLAKLDDDQVID 33 (60)
T ss_pred EEEEehhhHHHHHHHHHHHHHhCCCCcEEE
Confidence 344454444456689999998865555443
No 27
>PF13092 CENP-L: Kinetochore complex Sim4 subunit Fta1
Probab=55.06 E-value=17 Score=22.67 Aligned_cols=45 Identities=11% Similarity=0.203 Sum_probs=36.5
Q ss_pred CcCHHHHHHHHhhhCCcceeeCCCCcHHHHHHHHHHHHHHhCcccc
Q psy2655 29 ELNPAYQTQAVDNFNSELGKVDFSQPPAAAKEMNDWVSNHTNDKIK 74 (84)
Q Consensus 29 ~l~~~f~~~~~~~y~~~v~~vdf~~~~~~~~~IN~wv~~~T~g~I~ 74 (84)
.+...+.+.+++.|++.+..+-|... .-...+..|+...+.....
T Consensus 16 ~l~~~v~~~L~~~FDc~is~l~l~~~-~L~~~~e~w~~~~~~~~~~ 60 (162)
T PF13092_consen 16 SLRQAVISWLSTTFDCRISPLRLPSS-FLCWMLEMWIATLTSSSPG 60 (162)
T ss_pred HHHHHHHHHHHHhcCceEEeeecCHH-HHHHHHHHHHHhcCCCCCC
Confidence 55678888899999999999998754 4678899999988876554
No 28
>KOG2729|consensus
Probab=52.36 E-value=11 Score=23.17 Aligned_cols=19 Identities=21% Similarity=0.375 Sum_probs=16.2
Q ss_pred eeCCCCcHHHHHHHHHHHH
Q psy2655 48 KVDFSQPPAAAKEMNDWVS 66 (84)
Q Consensus 48 ~vdf~~~~~~~~~IN~wv~ 66 (84)
..|+.||-+....+|.||-
T Consensus 35 e~DyiNPid~c~rlN~lVl 53 (137)
T KOG2729|consen 35 ETDYINPIDLCSRLNRLVL 53 (137)
T ss_pred hhccCCHHHHHHHHhHHHh
Confidence 4689999888999999983
No 29
>PF03311 Cornichon: Cornichon protein; InterPro: IPR003377 The drosophila cornichon protein (gene: cni) [] is required in the germline for dorsal-ventral signalling. The dorsal-ventral pattern formation involves a reorganisation of the microtubule network correlated with the movement of the oocyte nucleus, and depending on the initial correct establishment of the anterior-posterior axis via a signal from the oocyte produced by cornichon and gurken and received by torpedo protein in the follicle cells. The biochemical function of the cornichon protein is currently not known. It is a protein of 144 residues that seems to contain three transmembrane regions. ; GO: 0035556 intracellular signal transduction, 0016020 membrane
Probab=50.74 E-value=15 Score=22.04 Aligned_cols=19 Identities=21% Similarity=0.368 Sum_probs=16.2
Q ss_pred eeCCCCcHHHHHHHHHHHH
Q psy2655 48 KVDFSQPPAAAKEMNDWVS 66 (84)
Q Consensus 48 ~vdf~~~~~~~~~IN~wv~ 66 (84)
+.|+.||-+..++||.||.
T Consensus 34 e~D~iNp~d~c~~lN~lv~ 52 (128)
T PF03311_consen 34 ECDYINPIDLCSRLNPLVL 52 (128)
T ss_pred HhccCCHHHHHHHHHHHHH
Confidence 4578899889999999985
No 30
>TIGR01617 arsC_related transcriptional regulator, Spx/MgsR family. This model represents a portion of the proteins within the larger set covered by Pfam model pfam03960. That larger family includes a glutaredoxin-dependent arsenate reductase (TIGR00014). Characterized members of this family include Spx and MgsR from Bacillus subtili. Spx is a global regulator for response to thiol-specific oxidative stress. It interacts with RNA polymerase. MgsR (modulator of the general stress response, also called YqgZ) provides a second level of regulation for more than a third of the proteins in the B. subtilis general stress regulon controlled by Sigma-B.
Probab=48.14 E-value=53 Score=18.91 Aligned_cols=56 Identities=16% Similarity=0.138 Sum_probs=33.4
Q ss_pred eeecCCCCcCHHHHHHHHhhhCCcceeeCCCCcHHHHHHHHHHHHHHhCccccccccC
Q psy2655 22 IYFAKDIELNPAYQTQAVDNFNSELGKVDFSQPPAAAKEMNDWVSNHTNDKIKDLIKA 79 (84)
Q Consensus 22 lf~~~~~~l~~~f~~~~~~~y~~~v~~vdf~~~~~~~~~IN~wv~~~T~g~I~~ll~~ 79 (84)
+|..++.+-...=.+.++. .+.+...+|......+...+-.|++.. +..+.+++..
T Consensus 3 iY~~~~C~~c~ka~~~L~~-~~i~~~~idi~~~~~~~~el~~l~~~~-~~~~~~lin~ 58 (117)
T TIGR01617 3 VYGSPNCTTCKKARRWLEA-NGIEYQFIDIGEDGPTREELLDILSLL-EDGIDPLLNT 58 (117)
T ss_pred EEeCCCCHHHHHHHHHHHH-cCCceEEEecCCChhhHHHHHHHHHHc-CCCHHHheeC
Confidence 4555554333332222322 366888899876556788999998874 4456666654
No 31
>PLN00174 predicted protein; Provisional
Probab=47.41 E-value=14 Score=23.18 Aligned_cols=19 Identities=16% Similarity=0.235 Sum_probs=16.3
Q ss_pred eeCCCCcHHHHHHHHHHHH
Q psy2655 48 KVDFSQPPAAAKEMNDWVS 66 (84)
Q Consensus 48 ~vdf~~~~~~~~~IN~wv~ 66 (84)
..|+.||-.....||.||-
T Consensus 40 E~DYiNPiD~CnrLN~lVl 58 (160)
T PLN00174 40 STDTVNPVEVCDKVNQLKV 58 (160)
T ss_pred hhccCCHHHHHHHhhHHHh
Confidence 4688899889999999983
No 32
>PRK01655 spxA transcriptional regulator Spx; Reviewed
Probab=46.81 E-value=56 Score=19.42 Aligned_cols=55 Identities=5% Similarity=0.132 Sum_probs=33.2
Q ss_pred eeecCCCCcCHHHHHHHHhhhCCcceeeCCCCcHHHHHHHHHHHHHHhCcccccccc
Q psy2655 22 IYFAKDIELNPAYQTQAVDNFNSELGKVDFSQPPAAAKEMNDWVSNHTNDKIKDLIK 78 (84)
Q Consensus 22 lf~~~~~~l~~~f~~~~~~~y~~~v~~vdf~~~~~~~~~IN~wv~~~T~g~I~~ll~ 78 (84)
||..++.+-...=.+.+.. .+.....+|+.....+...+-.|++....| +.+++.
T Consensus 4 iY~~~~C~~C~ka~~~L~~-~gi~~~~idi~~~~~~~~eL~~~l~~~~~g-~~~lin 58 (131)
T PRK01655 4 LFTSPSCTSCRKAKAWLEE-HDIPFTERNIFSSPLTIDEIKQILRMTEDG-TDEIIS 58 (131)
T ss_pred EEeCCCChHHHHHHHHHHH-cCCCcEEeeccCChhhHHHHHHHHHHhcCC-HHHHHh
Confidence 5666655433332222222 266788899865556788999999885444 555554
No 33
>TIGR03044 PS_II_psb27 photosystem II protein Psb27. Members of this family are the Psb27 protein of the cyanobacterial photosynthetic supracomplex, photosystem II. Although most protein components of both cyanobacterial and chloroplast versions of photosystem II are closely related and described together by single model families, this family is strictly bacterial. Some uncharacterized proteins with highly divergent sequences, from Arabidopsis, score between trusted and noise cutoffs for this model but are not at this time assigned as functionally equivalent photosystem II proteins.
Probab=46.14 E-value=21 Score=21.87 Aligned_cols=15 Identities=27% Similarity=0.425 Sum_probs=11.5
Q ss_pred HHHHHHHHHHHHHHh
Q psy2655 55 PAAAKEMNDWVSNHT 69 (84)
Q Consensus 55 ~~~~~~IN~wv~~~T 69 (84)
.++++.||+||++-=
T Consensus 66 ~~ar~~indyvsrYR 80 (135)
T TIGR03044 66 AEARQLINDYISRYR 80 (135)
T ss_pred HHHHHHHHHHHHHhc
Confidence 346789999998843
No 34
>PLN00064 photosystem II protein Psb27; Provisional
Probab=44.87 E-value=22 Score=22.46 Aligned_cols=14 Identities=21% Similarity=0.461 Sum_probs=11.1
Q ss_pred HHHHHHHHHHHHHh
Q psy2655 56 AAAKEMNDWVSNHT 69 (84)
Q Consensus 56 ~~~~~IN~wv~~~T 69 (84)
++++.||.||++-=
T Consensus 96 eaR~~iNdyvSrYR 109 (166)
T PLN00064 96 ELRETSNSWVAKYR 109 (166)
T ss_pred HHHHHHHHHHHHhc
Confidence 46789999998843
No 35
>PF14409 Herpeto_peptide: Ribosomally synthesized peptide in Herpetosiphon
Probab=42.71 E-value=29 Score=18.05 Aligned_cols=18 Identities=33% Similarity=0.383 Sum_probs=12.9
Q ss_pred CCCc--HHHHHHHHHHHHHH
Q psy2655 51 FSQP--PAAAKEMNDWVSNH 68 (84)
Q Consensus 51 f~~~--~~~~~~IN~wv~~~ 68 (84)
|++. +..+.+||.||+.-
T Consensus 17 ~~~~~le~eaa~~~~~V~~~ 36 (58)
T PF14409_consen 17 FGLTYLEEEAAEINDVVGCW 36 (58)
T ss_pred HhhHHHHHHHHHHHHHHHhh
Confidence 5543 55678999999763
No 36
>cd03036 ArsC_like Arsenate Reductase (ArsC) family, unknown subfamily; uncharacterized proteins containing a CXXC motif with similarity to thioredoxin (TRX)-fold arsenic reductases, ArsC. Proteins containing a redox active CXXC motif like TRX and glutaredoxin (GRX) function as protein disulfide oxidoreductases, altering the redox state of target proteins via the reversible oxidation of the active site dithiol. ArsC catalyzes the reduction of arsenate [As(V)] to arsenite [As(III)], using reducing equivalents derived from glutathione via GRX, through a single catalytic cysteine.
Probab=40.97 E-value=61 Score=18.54 Aligned_cols=26 Identities=15% Similarity=0.564 Sum_probs=19.2
Q ss_pred CCcceeeCCCCcHHHHHHHHHHHHHH
Q psy2655 43 NSELGKVDFSQPPAAAKEMNDWVSNH 68 (84)
Q Consensus 43 ~~~v~~vdf~~~~~~~~~IN~wv~~~ 68 (84)
+.....+|+.....+...+..|++..
T Consensus 23 ~i~~~~idi~~~~~~~~el~~~~~~~ 48 (111)
T cd03036 23 GVDYTAIDIVEEPPSKEELKKWLEKS 48 (111)
T ss_pred CCceEEecccCCcccHHHHHHHHHHc
Confidence 56778888864445778899998763
No 37
>PRK13344 spxA transcriptional regulator Spx; Reviewed
Probab=40.46 E-value=51 Score=19.69 Aligned_cols=55 Identities=11% Similarity=0.165 Sum_probs=31.8
Q ss_pred eeecCCCCcCHHHHHHHHhhhCCcceeeCCCCcHHHHHHHHHHHHHHhCcccccccc
Q psy2655 22 IYFAKDIELNPAYQTQAVDNFNSELGKVDFSQPPAAAKEMNDWVSNHTNDKIKDLIK 78 (84)
Q Consensus 22 lf~~~~~~l~~~f~~~~~~~y~~~v~~vdf~~~~~~~~~IN~wv~~~T~g~I~~ll~ 78 (84)
||..++..-...=.+.++. .+.+...+|+.....+...+..|++... +.+.+++.
T Consensus 4 iY~~~~C~~crkA~~~L~~-~~i~~~~~d~~~~~~s~~eL~~~l~~~~-~~~~~lin 58 (132)
T PRK13344 4 IYTISSCTSCKKAKTWLNA-HQLSYKEQNLGKEPLTKEEILAILTKTE-NGIESIVS 58 (132)
T ss_pred EEeCCCCHHHHHHHHHHHH-cCCCeEEEECCCCCCCHHHHHHHHHHhC-CCHHHhhc
Confidence 5555554332222222222 3668888998644457889999988843 33555554
No 38
>cd03034 ArsC_ArsC Arsenate Reductase (ArsC) family, ArsC subfamily; arsenic reductases similar to that encoded by arsC on the R733 plasmid of Escherichia coli. E. coli ArsC catalyzes the reduction of arsenate [As(V)] to arsenite [As(III)], the first step in the detoxification of arsenic, using reducing equivalents derived from glutathione (GSH) via glutaredoxin (GRX). ArsC contains a single catalytic cysteine, within a thioredoxin fold, that forms a covalent thiolate-As(V) intermediate, which is reduced by GRX through a mixed GSH-arsenate intermediate. This family of predominantly bacterial enzymes is unrelated to two other families of arsenate reductases which show similarity to low-molecular-weight acid phosphatases and phosphotyrosyl phosphatases.
Probab=40.30 E-value=42 Score=19.29 Aligned_cols=36 Identities=19% Similarity=0.374 Sum_probs=25.3
Q ss_pred CCcceeeCCCCcHHHHHHHHHHHHHHhCccccccccC
Q psy2655 43 NSELGKVDFSQPPAAAKEMNDWVSNHTNDKIKDLIKA 79 (84)
Q Consensus 43 ~~~v~~vdf~~~~~~~~~IN~wv~~~T~g~I~~ll~~ 79 (84)
+..+..+|+....-+..++..|++.. ++.+.+++..
T Consensus 23 ~i~~~~~di~~~~~t~~el~~~l~~~-~~~~~~lin~ 58 (112)
T cd03034 23 GIEPEIVEYLKTPPTAAELRELLAKL-GISPRDLLRT 58 (112)
T ss_pred CCCeEEEecccCCcCHHHHHHHHHHc-CCCHHHHHhc
Confidence 56788899865445788999998874 4446666543
No 39
>PF09373 PMBR: Pseudomurein-binding repeat; InterPro: IPR018975 Methanothermobacter thermautotrophicus (Methanobacterium thermoformicicum) is a methanogenic Gram-positive microorganism with a cell wall consisting of pseudomurein. This repeat specifically binds to pseudomurein. This repeat is found at the N terminus of PeiW and PeiP which are pseudomurein binding phage proteins.
Probab=35.02 E-value=35 Score=15.39 Aligned_cols=21 Identities=14% Similarity=0.396 Sum_probs=15.0
Q ss_pred HHHHHHHHHHHHHHhCccccccc
Q psy2655 55 PAAAKEMNDWVSNHTNDKIKDLI 77 (84)
Q Consensus 55 ~~~~~~IN~wv~~~T~g~I~~ll 77 (84)
..++..++.|+.+ +|+.++-+
T Consensus 11 ~d~a~rv~~f~~~--ngRlPnyV 31 (33)
T PF09373_consen 11 LDMASRVNNFYES--NGRLPNYV 31 (33)
T ss_pred HHHHHHHHHHHHH--cCCCCCee
Confidence 4577888999776 57776654
No 40
>PF06806 DUF1233: Putative excisionase (DUF1233); InterPro: IPR009634 This family consists of several putative phage and prophage excisionase proteins of around 80 residues in length.; PDB: 2KVV_A.
Probab=34.95 E-value=32 Score=18.78 Aligned_cols=11 Identities=27% Similarity=0.924 Sum_probs=7.7
Q ss_pred HHHHHHHHHHH
Q psy2655 58 AKEMNDWVSNH 68 (84)
Q Consensus 58 ~~~IN~wv~~~ 68 (84)
...||.||+++
T Consensus 59 ~~~I~~wi~~~ 69 (72)
T PF06806_consen 59 RPEIDQWIENQ 69 (72)
T ss_dssp HHHHHHHHH--
T ss_pred HHHHHHHHHhh
Confidence 46899999875
No 41
>PF03960 ArsC: ArsC family; InterPro: IPR006660 Several bacterial taxon have a chromosomal resistance system, encoded by the ars operon, for the detoxification of arsenate, arsenite, and antimonite []. This system transports arsenite and antimonite out of the cell. The pump is composed of two polypeptides, the products of the arsA and arsB genes. This two-subunit enzyme produces resistance to arsenite and antimonite. Arsenate, however, must first be reduced to arsenite before it is extruded. A third gene, arsC, expands the substrate specificity to allow for arsenate pumping and resistance. ArsC is an approximately 150-residue arsenate reductase that uses reduced glutathione (GSH) to convert arsenate to arsenite with a redox active cysteine residue in the active site. ArsC forms an active quaternary complex with GSH, arsenate, and glutaredoxin 1 (Grx1). The three ligands must be present simultaneously for reduction to occur []. The arsC family also comprises the Spx proteins which are GRAM-positive bacterial transcription factors that regulate the transcription of multiple genes in response to disulphide stress []. The arsC protein structure has been solved []. It belongs to the thioredoxin superfamily fold which is defined by a beta-sheet core surrounded by alpha-helices. The active cysteine residue of ArsC is located in the loop between the first beta-strand and the first helix, which is also conserved in the Spx protein and its homologues.; PDB: 2KOK_A 1SK1_A 1SK2_A 1JZW_A 1J9B_A 1S3C_A 1SD8_A 1SD9_A 1I9D_A 1SK0_A ....
Probab=34.75 E-value=90 Score=17.68 Aligned_cols=38 Identities=24% Similarity=0.374 Sum_probs=24.4
Q ss_pred hCCcceeeCCCCcHHHHHHHHHHHHHHhCccccccccCC
Q psy2655 42 FNSELGKVDFSQPPAAAKEMNDWVSNHTNDKIKDLIKAG 80 (84)
Q Consensus 42 y~~~v~~vdf~~~~~~~~~IN~wv~~~T~g~I~~ll~~~ 80 (84)
.+.++..+|+....-+..++..|++....| +.+++...
T Consensus 19 ~gi~~~~~d~~k~p~s~~el~~~l~~~~~~-~~~lin~~ 56 (110)
T PF03960_consen 19 NGIEYEFIDYKKEPLSREELRELLSKLGNG-PDDLINTR 56 (110)
T ss_dssp TT--EEEEETTTS---HHHHHHHHHHHTSS-GGGGB-TT
T ss_pred cCCCeEeehhhhCCCCHHHHHHHHHHhccc-HHHHhcCc
Confidence 366888999976556889999999886544 77777543
No 42
>PF13326 PSII_Pbs27: Photosystem II Pbs27; PDB: 2KND_A 2KMF_A 2Y6X_A.
Probab=31.49 E-value=25 Score=21.63 Aligned_cols=14 Identities=36% Similarity=0.460 Sum_probs=10.8
Q ss_pred HHHHHHHHHHHHHH
Q psy2655 55 PAAAKEMNDWVSNH 68 (84)
Q Consensus 55 ~~~~~~IN~wv~~~ 68 (84)
..+++.||.||++-
T Consensus 78 ~~ar~~in~~vs~Y 91 (145)
T PF13326_consen 78 AEARELINDYVSRY 91 (145)
T ss_dssp HHHHHHHHHHHCCC
T ss_pred HHHHHHHHHHHHHh
Confidence 34678999999763
No 43
>cd03032 ArsC_Spx Arsenate Reductase (ArsC) family, Spx subfamily; Spx is a unique RNA polymerase (RNAP)-binding protein present in bacilli and some mollicutes. It inhibits transcription by binding to the C-terminal domain of the alpha subunit of RNAP, disrupting complex formation between RNAP and certain transcriptional activator proteins like ResD and ComA. In response to oxidative stress, Spx can also activate transcription, making it a general regulator that exerts both positive and negative control over transcription initiation. Spx has been shown to exert redox-sensitive transcriptional control over genes like trxA (TRX) and trxB (TRX reductase), genes that function in thiol homeostasis. This redox-sensitive activity is dependent on the presence of a CXXC motif, present in some members of the Spx subfamily, that acts as a thiol/disulfide switch. Spx has also been shown to repress genes in a sulfate-dependent manner independent of the presence of the CXXC motif.
Probab=29.93 E-value=1.1e+02 Score=17.44 Aligned_cols=56 Identities=13% Similarity=0.227 Sum_probs=32.3
Q ss_pred eeecCCCCcCHHHHHHHHhhhCCcceeeCCCCcHHHHHHHHHHHHHHhCccccccccC
Q psy2655 22 IYFAKDIELNPAYQTQAVDNFNSELGKVDFSQPPAAAKEMNDWVSNHTNDKIKDLIKA 79 (84)
Q Consensus 22 lf~~~~~~l~~~f~~~~~~~y~~~v~~vdf~~~~~~~~~IN~wv~~~T~g~I~~ll~~ 79 (84)
+|..++.+-...=...++. .+.+...+|+.....+...+-.|++....| +.+++..
T Consensus 4 iY~~~~C~~c~ka~~~L~~-~gi~~~~idi~~~~~~~~el~~~~~~~~~~-~~~l~n~ 59 (115)
T cd03032 4 LYTSPSCSSCRKAKQWLEE-HQIPFEERNLFKQPLTKEELKEILSLTENG-VEDIIST 59 (115)
T ss_pred EEeCCCCHHHHHHHHHHHH-CCCceEEEecCCCcchHHHHHHHHHHhcCC-HHHHHhc
Confidence 5665554433332222222 366888899865445778899998875323 5555543
No 44
>PF06531 DUF1108: Protein of unknown function (DUF1108); InterPro: IPR009494 This entry is represented by Bacteriophage 92, Orf49. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. This family consists of several bacterial proteins from Staphylococcus aureus as well as a number of phage proteins. The function of this family is unknown.
Probab=26.51 E-value=46 Score=18.74 Aligned_cols=13 Identities=15% Similarity=0.751 Sum_probs=10.6
Q ss_pred HHHHHHHHHHHhC
Q psy2655 58 AKEMNDWVSNHTN 70 (84)
Q Consensus 58 ~~~IN~wv~~~T~ 70 (84)
.+.|.+|++++|.
T Consensus 62 ~~aiyewIE~nTD 74 (86)
T PF06531_consen 62 NQAIYEWIEENTD 74 (86)
T ss_pred HHHHHHHHHhCcc
Confidence 4578899999995
No 45
>PF06891 P2_Phage_GpR: P2 phage tail completion protein R (GpR); InterPro: IPR009678 This family consists of P2 phage tail completion protein R (GpR) and similar sequences from related phage. GpR is thought to be a tail completion protein which is essential for stable head joining []. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches.
Probab=25.99 E-value=39 Score=20.37 Aligned_cols=56 Identities=13% Similarity=0.207 Sum_probs=39.5
Q ss_pred eEEEeeeeeecCCCCcCHHHHHHHHhhhCCcceeeCCC-CcHHHHHHHHHHHHHHhC
Q psy2655 15 ELKIANKIYFAKDIELNPAYQTQAVDNFNSELGKVDFS-QPPAAAKEMNDWVSNHTN 70 (84)
Q Consensus 15 ~l~~an~lf~~~~~~l~~~f~~~~~~~y~~~v~~vdf~-~~~~~~~~IN~wv~~~T~ 70 (84)
.+-+-++-.+..+-....+|...+.-.|.+.+.-.||. ++......|-.|+.++=.
T Consensus 26 ~~~v~~G~i~~t~~~~~~s~~~~~~~~Y~~~i~i~df~~~~d~l~~~v~~WL~~nqp 82 (135)
T PF06891_consen 26 DSFVDNGEIVATGKDPGLSFYRLAEYRYTAVISIEDFPGDPDLLMAPVLAWLRENQP 82 (135)
T ss_pred EEEEeCcEEEECcccCCCCccEEEEEEEEEEEEEEECCCCHHHHHHHHHHHHHhcCc
Confidence 33444444444443466777777778899999999996 677778899999988543
No 46
>PF10750 DUF2536: Protein of unknown function (DUF2536); InterPro: IPR019686 This entry represents proteins with unknown function appears to be restricted to Bacillus spp.
Probab=25.42 E-value=80 Score=17.01 Aligned_cols=23 Identities=9% Similarity=0.195 Sum_probs=14.6
Q ss_pred cceeeCCCCcHHHHHHHHHHHHH
Q psy2655 45 ELGKVDFSQPPAAAKEMNDWVSN 67 (84)
Q Consensus 45 ~v~~vdf~~~~~~~~~IN~wv~~ 67 (84)
.|+-+...+-....+.||.||..
T Consensus 11 KVEfFEA~~l~~LEkkIneqIe~ 33 (68)
T PF10750_consen 11 KVEFFEANDLQTLEKKINEQIEH 33 (68)
T ss_pred eeeEEecchHHHHHHHHHHHHhc
Confidence 34444444445567899999985
No 47
>COG1393 ArsC Arsenate reductase and related proteins, glutaredoxin family [Inorganic ion transport and metabolism]
Probab=24.06 E-value=79 Score=18.60 Aligned_cols=36 Identities=19% Similarity=0.422 Sum_probs=24.8
Q ss_pred hCCcceeeCCCCcHHHHHHHHHHHHHHhCcccccccc
Q psy2655 42 FNSELGKVDFSQPPAAAKEMNDWVSNHTNDKIKDLIK 78 (84)
Q Consensus 42 y~~~v~~vdf~~~~~~~~~IN~wv~~~T~g~I~~ll~ 78 (84)
++.+...+|+....-++..+..|++....| +..++.
T Consensus 24 ~gi~~~~~~y~~~~~s~~eL~~~l~~~g~~-~~~li~ 59 (117)
T COG1393 24 HGIEYTFIDYLKTPPSREELKKILSKLGDG-VEELIN 59 (117)
T ss_pred cCCCcEEEEeecCCCCHHHHHHHHHHcCcc-HHHHHH
Confidence 366777889864434788999999885544 555543
No 48
>TIGR02055 APS_reductase thioredoxin-dependent adenylylsulfate APS reductase. This model describes recently identified adenosine 5'-phosphosulfate (APS) reductase activity found in sulfate-assimilatory prokaryotes, thus separating it from the traditionally described phosphoadenosine 5'-phosphosulfate (PAPS) reductases found in bacteria and fungi. Homologous to PAPS reductase in enterobacteria, cyanobacteria, and yeast, APS reductase here clusters with, and demonstrates greater homology to plant APS reductase. Additionally, the presence of two conserved C-terminal motifs (CCXXRKXXPL & SXGCXXCT) distinguishes APS substrate specificity and serves as a FeS cluster.
Probab=24.03 E-value=1.2e+02 Score=19.12 Aligned_cols=31 Identities=3% Similarity=0.120 Sum_probs=25.7
Q ss_pred eeeeecCCCCcCHH--HHHHHHhhhCCcceeeC
Q psy2655 20 NKIYFAKDIELNPA--YQTQAVDNFNSELGKVD 50 (84)
Q Consensus 20 n~lf~~~~~~l~~~--f~~~~~~~y~~~v~~vd 50 (84)
-.+|++.++.+.+. |.+.+.+.|+..+..+.
T Consensus 20 ~vifvDTg~~FpET~~~~d~~~~~~~l~i~~~~ 52 (191)
T TIGR02055 20 KVFFLDTGRLFKETYETIDQVRERYDILIDVLS 52 (191)
T ss_pred cEEEecCCCCCHHHHHHHHHHHHHhCCceEEEc
Confidence 35899999999886 99999999988777663
No 49
>PRK12559 transcriptional regulator Spx; Provisional
Probab=23.69 E-value=1.2e+02 Score=18.07 Aligned_cols=55 Identities=5% Similarity=0.070 Sum_probs=30.8
Q ss_pred eeecCCCCcCHHHHHHHHhhhCCcceeeCCCCcHHHHHHHHHHHHHHhCcccccccc
Q psy2655 22 IYFAKDIELNPAYQTQAVDNFNSELGKVDFSQPPAAAKEMNDWVSNHTNDKIKDLIK 78 (84)
Q Consensus 22 lf~~~~~~l~~~f~~~~~~~y~~~v~~vdf~~~~~~~~~IN~wv~~~T~g~I~~ll~ 78 (84)
+|..++..-...=...+. ..+.....+|+.....+...+..|++....| +.+++.
T Consensus 4 iY~~~~C~~crkA~~~L~-~~gi~~~~~di~~~~~s~~el~~~l~~~~~g-~~~lin 58 (131)
T PRK12559 4 LYTTASCASCRKAKAWLE-ENQIDYTEKNIVSNSMTVDELKSILRLTEEG-ATEIIS 58 (131)
T ss_pred EEeCCCChHHHHHHHHHH-HcCCCeEEEEeeCCcCCHHHHHHHHHHcCCC-HHHHHh
Confidence 555555433222222222 2366778888864445788999998863233 444443
No 50
>cd02977 ArsC_family Arsenate Reductase (ArsC) family; composed of TRX-fold arsenic reductases and similar proteins including the transcriptional regulator, Spx. ArsC catalyzes the reduction of arsenate [As(V)] to arsenite [As(III)], using reducing equivalents derived from glutathione (GSH) via glutaredoxin (GRX), through a single catalytic cysteine. This family of predominantly bacterial enzymes is unrelated to two other families of arsenate reductases which show similarity to low-molecular-weight acid phosphatases and phosphotyrosyl phosphatases. Spx is a general regulator that exerts negative and positive control over transcription initiation by binding to the C-terminal domain of the alpha subunit of RNA polymerase.
Probab=22.78 E-value=1.3e+02 Score=16.71 Aligned_cols=35 Identities=17% Similarity=0.385 Sum_probs=21.5
Q ss_pred CCcceeeCCCCcHHHHHHHHHHHHHHhCcccccccc
Q psy2655 43 NSELGKVDFSQPPAAAKEMNDWVSNHTNDKIKDLIK 78 (84)
Q Consensus 43 ~~~v~~vdf~~~~~~~~~IN~wv~~~T~g~I~~ll~ 78 (84)
+.+...+|+.....+...+..|.+.. ...+.+++.
T Consensus 23 ~i~~~~idi~~~~~~~~~l~~~~~~~-~~~~~~li~ 57 (105)
T cd02977 23 GIEYEFIDYLKEPPTKEELKELLAKL-GLGVEDLFN 57 (105)
T ss_pred CCCcEEEeeccCCCCHHHHHHHHHhc-CCCHHHHHh
Confidence 55777888864334667888887653 333444444
No 51
>PF06294 DUF1042: Domain of Unknown Function (DUF1042); InterPro: IPR010441 This is a family of proteins of unknown function.; PDB: 2EE7_A.
Probab=22.28 E-value=81 Score=19.57 Aligned_cols=31 Identities=13% Similarity=0.306 Sum_probs=16.9
Q ss_pred HHHHHhhhCCcceeeCCCCcHHHHHHHHHHH
Q psy2655 35 QTQAVDNFNSELGKVDFSQPPAAAKEMNDWV 65 (84)
Q Consensus 35 ~~~~~~~y~~~v~~vdf~~~~~~~~~IN~wv 65 (84)
.+.+..+|.-.|.--+|.+.......++.|-
T Consensus 28 AEIl~~y~p~~vdlh~y~~~~s~~~Kl~NW~ 58 (158)
T PF06294_consen 28 AEILSRYYPKLVDLHNYSNGNSVAQKLNNWE 58 (158)
T ss_dssp HHHHHHH-TTT---SS----SSHHHHHHHHH
T ss_pred HHHHHHHCCCCccccccCCCCCHHHHHHHHH
Confidence 3456777777777778877666778899995
No 52
>COG5144 TFB2 RNA polymerase II transcription initiation/nucleotide excision repair factor TFIIH, subunit TFB2 [Transcription / DNA replication, recombination, and repair]
Probab=22.26 E-value=1.3e+02 Score=21.51 Aligned_cols=57 Identities=23% Similarity=0.405 Sum_probs=33.8
Q ss_pred EeeeeeecCC--CCcCHHHHHHH---------HhhhCC----cceeeCCCCcHHH--HHHHHHHHHHHhCcccc
Q psy2655 18 IANKIYFAKD--IELNPAYQTQA---------VDNFNS----ELGKVDFSQPPAA--AKEMNDWVSNHTNDKIK 74 (84)
Q Consensus 18 ~an~lf~~~~--~~l~~~f~~~~---------~~~y~~----~v~~vdf~~~~~~--~~~IN~wv~~~T~g~I~ 74 (84)
..+.+|.+++ +.+.|+|.... ++.|++ .+.++||-+...+ -+.|=.++-..-..+|+
T Consensus 82 k~~hi~~~~g~~i~ln~~fk~sl~~altgg~~~nsfgv~i~E~lvsvd~ld~ys~~kwEtILhfmVgtpeakip 155 (447)
T COG5144 82 KGNHIYLDEGLMIRLNPEFKISLMDALTGGTMENSFGVCIGEKLVSVDMLDSYSSRKWETILHFMVGTPEAKIP 155 (447)
T ss_pred ccceEEecCCceEEeChHHHHHHHHHhhccccccccceeeccceeeeehhhhhhhhhHHHHHHHHhCCccccCc
Confidence 3577888888 67788887765 445555 3456888775433 23455554432234443
No 53
>cd03035 ArsC_Yffb Arsenate Reductase (ArsC) family, Yffb subfamily; Yffb is an uncharacterized bacterial protein encoded by the yffb gene, related to the thioredoxin-fold arsenic reductases, ArsC. The structure of Yffb and the conservation of the catalytic cysteine suggest that it is likely to function as a glutathione (GSH)-dependent thiol reductase. ArsC catalyzes the reduction of arsenate [As(V)] to arsenite [As(III)], using reducing equivalents derived from GSH via glutaredoxin, through a single catalytic cysteine.
Probab=21.37 E-value=1.4e+02 Score=16.91 Aligned_cols=33 Identities=12% Similarity=0.291 Sum_probs=22.4
Q ss_pred CCcceeeCCCCcHHHHHHHHHHHHHHhCcccccccc
Q psy2655 43 NSELGKVDFSQPPAAAKEMNDWVSNHTNDKIKDLIK 78 (84)
Q Consensus 43 ~~~v~~vdf~~~~~~~~~IN~wv~~~T~g~I~~ll~ 78 (84)
+.+...+|+....-+..++..|++.. | +.+++.
T Consensus 23 ~i~~~~~di~~~p~s~~eL~~~l~~~--g-~~~li~ 55 (105)
T cd03035 23 GVAYTFHDYRKDGLDAATLERWLAKV--G-WETLLN 55 (105)
T ss_pred CCCeEEEecccCCCCHHHHHHHHHHh--C-hHHHHc
Confidence 66788888864334778999999853 3 455443
No 54
>PF04465 DUF499: Protein of unknown function (DUF499); InterPro: IPR007555 This is a family of uncharacterised hypothetical prokaryotic proteins.
Probab=20.88 E-value=1.9e+02 Score=23.74 Aligned_cols=56 Identities=14% Similarity=0.214 Sum_probs=37.6
Q ss_pred cCCCCcCHHHHHHHHhhhCCcceeeCCCCcHHHHHHHHHHHHHHh-CccccccccCCCCCC
Q psy2655 25 AKDIELNPAYQTQAVDNFNSELGKVDFSQPPAAAKEMNDWVSNHT-NDKIKDLIKAGNLTL 84 (84)
Q Consensus 25 ~~~~~l~~~f~~~~~~~y~~~v~~vdf~~~~~~~~~IN~wv~~~T-~g~I~~ll~~~~i~~ 84 (84)
...++++|++.+.+...- ..++|.....+.+..-.+|+.-- .+.=..|+.|++|+|
T Consensus 245 ~~sYPFHPel~d~L~~~w----~s~~FQRTRGvLRLma~VV~~Lw~~~~~~~LI~P~~IpL 301 (1035)
T PF04465_consen 245 RASYPFHPELFDRLYEFW----SSLPFQRTRGVLRLMAAVVRSLWERGDDSPLIMPGHIPL 301 (1035)
T ss_pred HHhCCCCHHHHHHHHHHH----hccchHHHHHHHHHHHHHHHHHHhCCCCCcEEeccCCCC
Confidence 467999999999987643 44568766556666655555533 344556777888876
No 55
>PF08876 DUF1836: Domain of unknown function (DUF1836); InterPro: IPR014975 This group of proteins are functionally uncharacterised.
Probab=20.13 E-value=89 Score=18.14 Aligned_cols=20 Identities=15% Similarity=0.147 Sum_probs=13.9
Q ss_pred cHHHHHHHHHHHHHHhCcccccc
Q psy2655 54 PPAAAKEMNDWVSNHTNDKIKDL 76 (84)
Q Consensus 54 ~~~~~~~IN~wv~~~T~g~I~~l 76 (84)
..-+...||+||+. |.|+..
T Consensus 33 ~~lT~tMInNYvK~---~li~~P 52 (105)
T PF08876_consen 33 KILTKTMINNYVKR---GLIPPP 52 (105)
T ss_pred ccCCHHHHHHHHhc---ccCCCc
Confidence 34567899999987 555443
Done!