Query         psy10995
Match_columns 265
No_of_seqs    178 out of 1311
Neff          7.4 
Searched_HMMs 46136
Date          Fri Aug 16 15:30:03 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy10995.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/10995hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 PF00093 VWC:  von Willebrand f  99.4 5.9E-13 1.3E-17   91.7   4.4   56   79-135     1-57  (57)
  2 PF00093 VWC:  von Willebrand f  99.4   6E-13 1.3E-17   91.7   4.2   54   12-66      1-57  (57)
  3 smart00214 VWC von Willebrand   99.3 6.5E-12 1.4E-16   87.1   5.3   55   12-66      1-59  (59)
  4 smart00214 VWC von Willebrand   99.3 7.6E-12 1.6E-16   86.8   5.5   56   79-135     1-59  (59)
  5 smart00215 VWC_out von Willebr  98.2 1.1E-06 2.4E-11   62.3   3.1   42   12-53      1-42  (67)
  6 smart00215 VWC_out von Willebr  97.7 2.6E-05 5.6E-10   55.3   3.0   42   79-122     1-42  (67)
  7 PF12714 TILa:  TILa domain      97.3 0.00045 9.7E-09   47.2   4.4   40   79-118     2-43  (56)
  8 PF12714 TILa:  TILa domain      97.2 0.00055 1.2E-08   46.8   4.4   38   11-48      1-42  (56)
  9 PF05375 Pacifastin_I:  Pacifas  95.6   0.016 3.4E-07   36.6   3.0   29   20-48      3-32  (39)
 10 CHL00038 psbL photosystem II p  95.3    0.04 8.7E-07   33.8   4.1   32  231-262     5-36  (38)
 11 PRK00753 psbL photosystem II r  94.9    0.05 1.1E-06   33.6   3.7   32  231-262     6-37  (39)
 12 PF02419 PsbL:  PsbL protein;    94.8   0.034 7.3E-07   34.1   2.7   25  238-262    11-35  (37)
 13 PF05825 PSP94:  Beta-microsemi  94.6   0.043 9.4E-07   41.5   3.6   39   75-113    14-52  (94)
 14 PF05375 Pacifastin_I:  Pacifas  93.8   0.063 1.4E-06   33.8   2.6   28   89-117     4-32  (39)
 15 PF05825 PSP94:  Beta-microsemi  89.6    0.35 7.7E-06   36.5   2.9   35   10-44     17-52  (94)
 16 KOG1544|consensus               89.3    0.33 7.2E-06   44.5   3.0   57   58-116    87-147 (470)
 17 cd00061 FN1 Fibronectin type 1  84.0     1.4   3E-05   28.3   3.0   30   83-112     8-42  (43)
 18 cd00061 FN1 Fibronectin type 1  83.5     1.6 3.4E-05   28.1   3.0   28   16-43      9-42  (43)
 19 KOG1544|consensus               81.2     1.1 2.3E-05   41.3   2.2   37   11-47    111-147 (470)
 20 PF14828 Amnionless:  Amnionles  80.9    0.85 1.8E-05   43.9   1.6   34  195-228   192-234 (437)
 21 PF00039 fn1:  Fibronectin type  78.2     2.2 4.8E-05   26.8   2.3   28   83-110     6-39  (39)
 22 PF15430 SVWC:  Single domain v  76.7     3.5 7.5E-05   28.2   3.3   35  175-210     1-40  (65)
 23 TIGR03063 srtB_target sortase   76.4     2.9 6.2E-05   24.6   2.3   13  252-264    17-29  (29)
 24 PF14828 Amnionless:  Amnionles  75.8     1.3 2.8E-05   42.6   1.2   39   30-69    192-235 (437)
 25 COG4537 ComGC Competence prote  74.8     2.7 5.8E-05   32.1   2.4   18  244-261    16-33  (107)
 26 smart00058 FN1 Fibronectin typ  70.4     6.6 0.00014   25.5   3.2   31   83-113     6-43  (45)
 27 PF15430 SVWC:  Single domain v  70.3     7.2 0.00016   26.5   3.6   35   12-48      1-39  (65)
 28 smart00058 FN1 Fibronectin typ  69.7     7.4 0.00016   25.2   3.3   29   16-44      7-43  (45)
 29 PF12191 stn_TNFRSF12A:  Tumour  69.7     2.4 5.3E-05   33.6   1.2   69  183-265    36-106 (129)
 30 PRK11875 psbT photosystem II r  62.8     9.4  0.0002   22.6   2.4   20  244-263     5-24  (31)
 31 PF06459 RR_TM4-6:  Ryanodine R  61.2     7.2 0.00016   35.2   2.7   19  245-263   175-193 (274)
 32 CHL00031 psbT photosystem II p  60.9     8.9 0.00019   23.1   2.1   21  243-263     4-24  (33)
 33 PF15106 TMEM156:  TMEM156 prot  60.2     8.7 0.00019   33.2   2.8   25  237-263   173-197 (226)
 34 PF08693 SKG6:  Transmembrane a  57.9     6.2 0.00013   25.0   1.2   18  248-265    21-40  (40)
 35 PF09443 CFC:  Cripto_Frl-1_Cry  57.8     3.4 7.5E-05   25.5   0.0   25   88-113     5-29  (36)
 36 TIGR03024 arch_pef_cterm PEF-C  55.6      15 0.00033   20.9   2.5   16  249-264     9-24  (26)
 37 PF15102 TMEM154:  TMEM154 prot  55.4     6.8 0.00015   32.0   1.4   19  246-264    66-84  (146)
 38 PF04530 Viral_Beta_CD:  Viral   51.9      36 0.00077   26.9   4.8   10  219-228     5-14  (122)
 39 PF06692 MNSV_P7B:  Melon necro  51.7      17 0.00036   24.6   2.5   18  245-262    14-31  (61)
 40 TIGR03068 srtB_sig_NPQTN sorta  48.8      17 0.00037   21.8   2.0   15  245-262    11-25  (33)
 41 PF01405 PsbT:  Photosystem II   48.0      26 0.00056   20.5   2.6   20  244-263     5-24  (29)
 42 PF15168 TRIQK:  Triple QxxK/R   45.5      23 0.00049   25.6   2.6   11  252-262    57-67  (79)
 43 COG4839 FtsL Protein required   44.6      23  0.0005   27.8   2.8   22  241-262    37-58  (120)
 44 PF09889 DUF2116:  Uncharacteri  43.7      35 0.00076   23.4   3.3   15  240-254    35-49  (59)
 45 PHA02831 EEV host range protei  43.5      89  0.0019   28.2   6.7   13  251-263   243-255 (268)
 46 KOG3208|consensus               42.2      25 0.00054   30.7   2.9   22  240-262   208-229 (231)
 47 PHA02819 hypothetical protein;  41.6      44 0.00096   23.8   3.5   21  240-260    44-64  (71)
 48 PF01034 Syndecan:  Syndecan do  39.2      10 0.00022   26.5   0.1   15  250-264    23-37  (64)
 49 PF09889 DUF2116:  Uncharacteri  38.9      30 0.00064   23.8   2.3   21  237-257    38-58  (59)
 50 PHA03240 envelope glycoprotein  38.4      27 0.00058   30.5   2.4   26  237-262   207-232 (258)
 51 PHA02650 hypothetical protein;  36.2      66  0.0014   23.4   3.8   19  242-260    49-67  (81)
 52 PHA03054 IMV membrane protein;  36.0      71  0.0015   22.8   3.8   20  241-260    47-66  (72)
 53 PF06084 Cytomega_TRL10:  Cytom  35.9      20 0.00044   28.0   1.2   14  251-264    70-83  (150)
 54 PF03908 Sec20:  Sec20;  InterP  34.8      41 0.00088   24.9   2.7   17  247-263    75-91  (92)
 55 PF15220 HILPDA:  Hypoxia-induc  34.5      50  0.0011   22.3   2.8   18  243-260     6-23  (63)
 56 PF11027 DUF2615:  Protein of u  34.2      40 0.00088   25.9   2.6   16  247-262    57-72  (103)
 57 COG3726 AhpA Uncharacterized m  33.6      44 0.00095   28.7   3.0   22  240-261   161-183 (214)
 58 PHA02844 putative transmembran  33.6      68  0.0015   23.1   3.5   16  244-259    50-65  (75)
 59 PF10717 ODV-E18:  Occlusion-de  32.3      52  0.0011   24.2   2.8   15  248-262    31-45  (85)
 60 PF12911 OppC_N:  N-terminal TM  32.0      54  0.0012   21.5   2.7   15  246-260    19-33  (56)
 61 PTZ00370 STEVOR; Provisional    31.9      44 0.00095   30.4   2.9   20  246-265   264-283 (296)
 62 TIGR01478 STEVOR variant surfa  31.7      44 0.00096   30.3   2.9   20  246-265   268-287 (295)
 63 PF11346 DUF3149:  Protein of u  30.9      65  0.0014   20.6   2.7   16  246-261    14-29  (42)
 64 PF06679 DUF1180:  Protein of u  30.5      67  0.0015   26.7   3.6   25  240-264    94-118 (163)
 65 PF04272 Phospholamban:  Phosph  30.1      98  0.0021   20.1   3.4   26  236-261    20-46  (52)
 66 PF00746 Gram_pos_anchor:  Gram  29.9      17 0.00038   22.2   0.0   21  244-264    19-39  (39)
 67 PF04021 Class_IIIsignal:  Clas  29.6      73  0.0016   18.4   2.6    6  252-257    16-21  (28)
 68 TIGR01294 P_lamban phospholamb  29.5      99  0.0021   20.1   3.4   27  235-261    19-46  (52)
 69 PHA03093 EEV glycoprotein; Pro  29.4      60  0.0013   27.5   3.1   22  239-260    38-59  (185)
 70 PF06024 DUF912:  Nucleopolyhed  28.5      49  0.0011   25.1   2.3    6  256-261    77-82  (101)
 71 PF15188 CCDC-167:  Coiled-coil  28.5      61  0.0013   24.0   2.7   17  245-261    68-84  (85)
 72 PHA02673 ORF109 EEV glycoprote  28.4      65  0.0014   26.7   3.1   21  240-260    36-56  (161)
 73 PF15201 Rod_cone_degen:  Progr  28.2      46 0.00099   21.8   1.7   15  250-264     4-18  (54)
 74 PF06682 DUF1183:  Protein of u  28.0      45 0.00098   30.8   2.3   16  247-262   159-174 (318)
 75 PF13150 DUF3989:  Protein of u  27.7      91   0.002   23.0   3.5   24  237-260    23-46  (85)
 76 KOG3462|consensus               27.5   1E+02  0.0022   23.3   3.6   30  233-262    24-53  (105)
 77 PHA01815 hypothetical protein   27.4   1E+02  0.0023   20.0   3.2   14  249-262    38-51  (55)
 78 PF09604 Potass_KdpF:  F subuni  27.1      78  0.0017   17.9   2.3   15  248-262     4-18  (25)
 79 PF05781 MRVI1:  MRVI1 protein;  27.0      55  0.0012   32.3   2.8   21  242-262   478-498 (538)
 80 PHA02955 hypothetical protein;  26.8      57  0.0012   28.3   2.6   21  244-264   181-201 (213)
 81 PF02495 7kD_coat:  7kD viral c  25.3      61  0.0013   21.9   2.1   16  246-261     2-17  (59)
 82 PF05545 FixQ:  Cbb3-type cytoc  24.8 1.2E+02  0.0026   19.5   3.4   19  243-261    12-30  (49)
 83 COG0690 SecE Preprotein transl  24.8 1.1E+02  0.0024   21.8   3.4   21  237-257    39-59  (73)
 84 KOG3385|consensus               24.1      72  0.0016   25.0   2.5   15  247-261   100-114 (118)
 85 PF02553 CbiN:  Cobalt transpor  23.9      71  0.0015   23.0   2.2   16  246-261     6-21  (74)
 86 PF03742 PetN:  PetN ;  InterPr  23.4 1.3E+02  0.0027   17.6   2.8   18  246-263     9-26  (29)
 87 PHA03231 glycoprotein BALF4; P  23.2      60  0.0013   33.9   2.4   17  246-262   709-725 (829)
 88 PHA03255 BDLF3; Provisional     23.1      63  0.0014   27.2   2.1   15  245-259   194-208 (234)
 89 PF11712 Vma12:  Endoplasmic re  23.1      97  0.0021   24.8   3.2   25  235-259    70-94  (142)
 90 PF05084 GRA6:  Granule antigen  22.6   1E+02  0.0023   25.7   3.3   27  237-264   146-173 (215)
 91 TIGR02115 potass_kdpF K+-trans  22.6   1E+02  0.0022   17.6   2.3    7  256-262    11-17  (26)
 92 PF07204 Orthoreo_P10:  Orthore  22.4      60  0.0013   24.5   1.7   12  254-265    55-66  (98)
 93 PF12575 DUF3753:  Protein of u  22.0 1.5E+02  0.0032   21.3   3.5   12  247-258    53-64  (72)
 94 PHA02692 hypothetical protein;  21.8 1.4E+02   0.003   21.3   3.3   10  250-259    54-63  (70)
 95 PF03935 SKN1:  Beta-glucan syn  21.7      78  0.0017   31.1   2.8   27  234-261    36-62  (504)
 96 PTZ00227 variable surface prot  21.3      65  0.0014   30.9   2.1   22  241-262   324-346 (418)
 97 PF11857 DUF3377:  Domain of un  21.3      73  0.0016   22.9   1.8   21  240-260    34-54  (74)
 98 PF09788 Tmemb_55A:  Transmembr  20.7 1.2E+02  0.0027   27.0   3.6   12  222-233   179-190 (256)
 99 PRK03625 tatE twin arginine tr  20.1      75  0.0016   22.4   1.7   10  251-260    11-20  (67)
100 PF05795 Plasmodium_Vir:  Plasm  20.1      77  0.0017   28.6   2.3   19  244-262   285-303 (354)
101 PHA02975 hypothetical protein;  20.0 1.3E+02  0.0027   21.4   2.8   16  244-259    46-61  (69)

No 1  
>PF00093 VWC:  von Willebrand factor type C domain;  InterPro: IPR001007 The vWF domain is found in various plasma proteins: complement factors B, C2, CR3 and CR4; the integrins (I-domains); collagen types VI, VII, XII and XIV; and other extracellular proteins [, , ]. Although the majority of VWA-containing proteins are extracellular, the most ancient ones present in all eukaryotes are all intracellular proteins involved in functions such as transcription, DNA repair, ribosomal and membrane transport and the proteasome. A common feature appears to be involvement in multiprotein complexes. Proteins that incorporate vWF domains participate in numerous biological events (e.g. cell adhesion, migration, homing, pattern formation, and signal transduction), involving interaction with a large array of ligands []. A number of human diseases arise from mutations in VWA domains. Secondary structure prediction from 75 aligned vWF sequences has revealed a largely alternating sequence of alpha-helices and beta-strands []. The domain is named after the von Willebrand factor (VWF) type C repeat which is found in multidomain protein/multifunctional proteins involved in maintaining homeostasis [, ]. For the von Willebrand factor the duplicated VWFC domain is thought to participate in oligomerization, but not in the initial dimerization step []. The presence of this region in a number of other complex-forming proteins points to the possible involvment of the VWFC domain in complex formation.; GO: 0005515 protein binding; PDB: 1U5M_A.
Probab=99.37  E-value=5.9e-13  Score=91.72  Aligned_cols=56  Identities=46%  Similarity=1.234  Sum_probs=34.3

Q ss_pred             ccccCCeEEcCCcEEccCCCccceecCCeEEEeecCCCCCCCCCC-CCCCCCCCcccc
Q psy10995         79 CMDFHGELKLEGQHWTLNPCTDCTCREGKVLCYSQQCPAAACSNP-RPPEPGTCCPRC  135 (265)
Q Consensus        79 C~~~~g~~y~~Ge~w~~~~C~~C~C~~g~v~C~~~~Cp~~~C~~~-~~~~~g~CCp~C  135 (265)
                      |. ++|++|.+|++|+++.|+.|+|.+|+|.|.++.||.+.|+++ ....+|+|||+|
T Consensus         1 C~-~~g~~y~~g~~w~~~~C~~C~C~~G~v~C~~~~Cp~~~C~~~~~~~~~g~CCp~C   57 (57)
T PF00093_consen    1 CS-FNGRVYQNGESWHPDPCTTCTCQDGEVQCSRIQCPPLDCPNPELVKPPGECCPVC   57 (57)
T ss_dssp             EE-SSS-EE-SS-EE-S-SSEEEEEETTEEEEEE---S-SS-S-------SS-S--S-
T ss_pred             Ce-eCCEEeCCCCEECCCCCcEeEecCCEEEEcCCCCcCCCCCCCceecCCCCcCCCC
Confidence            55 899999999999999999999999999999999999999998 223899999986


No 2  
>PF00093 VWC:  von Willebrand factor type C domain;  InterPro: IPR001007 The vWF domain is found in various plasma proteins: complement factors B, C2, CR3 and CR4; the integrins (I-domains); collagen types VI, VII, XII and XIV; and other extracellular proteins [, , ]. Although the majority of VWA-containing proteins are extracellular, the most ancient ones present in all eukaryotes are all intracellular proteins involved in functions such as transcription, DNA repair, ribosomal and membrane transport and the proteasome. A common feature appears to be involvement in multiprotein complexes. Proteins that incorporate vWF domains participate in numerous biological events (e.g. cell adhesion, migration, homing, pattern formation, and signal transduction), involving interaction with a large array of ligands []. A number of human diseases arise from mutations in VWA domains. Secondary structure prediction from 75 aligned vWF sequences has revealed a largely alternating sequence of alpha-helices and beta-strands []. The domain is named after the von Willebrand factor (VWF) type C repeat which is found in multidomain protein/multifunctional proteins involved in maintaining homeostasis [, ]. For the von Willebrand factor the duplicated VWFC domain is thought to participate in oligomerization, but not in the initial dimerization step []. The presence of this region in a number of other complex-forming proteins points to the possible involvment of the VWFC domain in complex formation.; GO: 0005515 protein binding; PDB: 1U5M_A.
Probab=99.36  E-value=6e-13  Score=91.72  Aligned_cols=54  Identities=37%  Similarity=1.003  Sum_probs=33.3

Q ss_pred             ecCCCeEeCCCCeee-cCCcceEECCCceEEeeccCCCCCCCCC--CCCCCCCCcCCC
Q psy10995         12 QDEDGVWRDDSEIWR-DGCRECHCIDGREMCTLIACPTLSCQHP--IPANSTQCCPYC   66 (265)
Q Consensus        12 c~~~G~~y~~Ge~w~-~~C~~C~C~~G~v~C~~~~Cp~~~C~~~--~~~~~g~CCp~C   66 (265)
                      |.++|+.|.+|++|+ ++|+.|+|++|.|.|.++.||+++|+++  +. .+|+|||+|
T Consensus         1 C~~~g~~y~~g~~w~~~~C~~C~C~~G~v~C~~~~Cp~~~C~~~~~~~-~~g~CCp~C   57 (57)
T PF00093_consen    1 CSFNGRVYQNGESWHPDPCTTCTCQDGEVQCSRIQCPPLDCPNPELVK-PPGECCPVC   57 (57)
T ss_dssp             EESSS-EE-SS-EE-S-SSEEEEEETTEEEEEE---S-SS-S--------SS-S--S-
T ss_pred             CeeCCEEeCCCCEECCCCCcEeEecCCEEEEcCCCCcCCCCCCCceec-CCCCcCCCC
Confidence            568999999999999 6699999999999999999999999998  55 899999986


No 3  
>smart00214 VWC von Willebrand factor (vWF) type C domain.
Probab=99.27  E-value=6.5e-12  Score=87.13  Aligned_cols=55  Identities=35%  Similarity=0.984  Sum_probs=49.3

Q ss_pred             ecCCCeEeCCCCeee-cCCcceEECCCc-eEEeeccCCCC-CCCCCCC-CCCCCCcCCC
Q psy10995         12 QDEDGVWRDDSEIWR-DGCRECHCIDGR-EMCTLIACPTL-SCQHPIP-ANSTQCCPYC   66 (265)
Q Consensus        12 c~~~G~~y~~Ge~w~-~~C~~C~C~~G~-v~C~~~~Cp~~-~C~~~~~-~~~g~CCp~C   66 (265)
                      |.++|+.|++|++|+ ++|+.|+|.+|. |.|.++.||++ +|+++.. ..+|+|||+|
T Consensus         1 C~~~g~~y~~G~~W~~~~C~~C~C~~g~~v~C~~~~Cp~~~~C~~~~~~~~~g~CCp~C   59 (59)
T smart00214        1 CVHNGEVYNDGETWKPDPCQICTCLDGETVLCDPVECPPPPDCPNPERVKPPGECCPRC   59 (59)
T ss_pred             CccCCEEeCCCCEECCCCCeECCcCCCCEEEeeeecCCCCCCCCCCcccCCCCCcCCCC
Confidence            568999999999999 569999999999 99999999986 8998872 3899999976


No 4  
>smart00214 VWC von Willebrand factor (vWF) type C domain.
Probab=99.27  E-value=7.6e-12  Score=86.78  Aligned_cols=56  Identities=46%  Similarity=1.271  Sum_probs=50.8

Q ss_pred             ccccCCeEEcCCcEEccCCCccceecCCe-EEEeecCCCCC-CCCCCCC-CCCCCCcccc
Q psy10995         79 CMDFHGELKLEGQHWTLNPCTDCTCREGK-VLCYSQQCPAA-ACSNPRP-PEPGTCCPRC  135 (265)
Q Consensus        79 C~~~~g~~y~~Ge~w~~~~C~~C~C~~g~-v~C~~~~Cp~~-~C~~~~~-~~~g~CCp~C  135 (265)
                      |. ++|++|.+|++|++++|++|+|.+|. |.|.++.||.+ .|+++.. ..+|+|||+|
T Consensus         1 C~-~~g~~y~~G~~W~~~~C~~C~C~~g~~v~C~~~~Cp~~~~C~~~~~~~~~g~CCp~C   59 (59)
T smart00214        1 CV-HNGEVYNDGETWKPDPCQICTCLDGETVLCDPVECPPPPDCPNPERVKPPGECCPRC   59 (59)
T ss_pred             Cc-cCCEEeCCCCEECCCCCeECCcCCCCEEEeeeecCCCCCCCCCCcccCCCCCcCCCC
Confidence            55 89999999999999999999999999 99999999986 8998872 3899999976


No 5  
>smart00215 VWC_out von Willebrand factor (vWF) type C domain.
Probab=98.21  E-value=1.1e-06  Score=62.28  Aligned_cols=42  Identities=29%  Similarity=0.770  Sum_probs=38.4

Q ss_pred             ecCCCeEeCCCCeeecCCcceEECCCceEEeeccCCCCCCCC
Q psy10995         12 QDEDGVWRDDSEIWRDGCRECHCIDGREMCTLIACPTLSCQH   53 (265)
Q Consensus        12 c~~~G~~y~~Ge~w~~~C~~C~C~~G~v~C~~~~Cp~~~C~~   53 (265)
                      |.++|+.|++|+.|.++|.+|+|.+|.+.|+++.|++..|..
T Consensus         1 C~~~G~~y~~g~~w~d~Cn~CtC~~G~v~Ct~~~C~~~~c~l   42 (67)
T smart00215        1 CYNNGSYYPPGAKWDDDCNRCTCLNGRVSCTKVWCGPKPCLL   42 (67)
T ss_pred             CeECCEEcCCCCccccCCCCCEecCCCEEecCCcCCCchhhc
Confidence            568999999999999999999999999999999999877653


No 6  
>smart00215 VWC_out von Willebrand factor (vWF) type C domain.
Probab=97.73  E-value=2.6e-05  Score=55.33  Aligned_cols=42  Identities=31%  Similarity=0.813  Sum_probs=37.1

Q ss_pred             ccccCCeEEcCCcEEccCCCccceecCCeEEEeecCCCCCCCCC
Q psy10995         79 CMDFHGELKLEGQHWTLNPCTDCTCREGKVLCYSQQCPAAACSN  122 (265)
Q Consensus        79 C~~~~g~~y~~Ge~w~~~~C~~C~C~~g~v~C~~~~Cp~~~C~~  122 (265)
                      |+ ++|+.|.+|++|. +.|..|+|.+|.|.|.+..|+..+|..
T Consensus         1 C~-~~G~~y~~g~~w~-d~Cn~CtC~~G~v~Ct~~~C~~~~c~l   42 (67)
T smart00215        1 CY-NNGSYYPPGAKWD-DDCNRCTCLNGRVSCTKVWCGPKPCLL   42 (67)
T ss_pred             Ce-ECCEEcCCCCccc-cCCCCCEecCCCEEecCCcCCCchhhc
Confidence            44 7899999999997 899999999999999999998766653


No 7  
>PF12714 TILa:  TILa domain
Probab=97.29  E-value=0.00045  Score=47.22  Aligned_cols=40  Identities=38%  Similarity=1.021  Sum_probs=34.2

Q ss_pred             ccccCCeEEcCCcEEccCCCc-ccee-cCCeEEEeecCCCCC
Q psy10995         79 CMDFHGELKLEGQHWTLNPCT-DCTC-REGKVLCYSQQCPAA  118 (265)
Q Consensus        79 C~~~~g~~y~~Ge~w~~~~C~-~C~C-~~g~v~C~~~~Cp~~  118 (265)
                      |.+++|+.|+.||+|..+.|+ .|+| .+|.|.|+...|++.
T Consensus         2 C~d~~G~yy~~Ge~~~~~~C~~~C~C~~~g~v~C~~~~C~~~   43 (56)
T PF12714_consen    2 CTDYNGRYYPPGESWWTDDCTQRCTCQPNGQVQCQPSSCPPG   43 (56)
T ss_pred             CcCcCCEEECCCCEEeCCCCCEeEEEcCCCeEEEeCCCCCCC
Confidence            556799999999999987776 7999 689999999888753


No 8  
>PF12714 TILa:  TILa domain
Probab=97.24  E-value=0.00055  Score=46.79  Aligned_cols=38  Identities=29%  Similarity=0.764  Sum_probs=32.6

Q ss_pred             eec-CCCeEeCCCCeee-cCCc-ceEE-CCCceEEeeccCCC
Q psy10995         11 CQD-EDGVWRDDSEIWR-DGCR-ECHC-IDGREMCTLIACPT   48 (265)
Q Consensus        11 cc~-~~G~~y~~Ge~w~-~~C~-~C~C-~~G~v~C~~~~Cp~   48 (265)
                      +|. ++|+.|..||+|. +.|+ .|+| .+|.|+|++..|++
T Consensus         1 GC~d~~G~yy~~Ge~~~~~~C~~~C~C~~~g~v~C~~~~C~~   42 (56)
T PF12714_consen    1 GCTDYNGRYYPPGESWWTDDCTQRCTCQPNGQVQCQPSSCPP   42 (56)
T ss_pred             CCcCcCCEEECCCCEEeCCCCCEeEEEcCCCeEEEeCCCCCC
Confidence            355 9999999999998 6786 7999 67999999998875


No 9  
>PF05375 Pacifastin_I:  Pacifastin inhibitor (LCMII);  InterPro: IPR008037 Peptide proteinase inhibitors can be found as single domain proteins or as single or multiple domains within proteins; these are referred to as either simple or compound inhibitors, respectively. In many cases they are synthesised as part of a larger precursor protein, either as a prepropeptide or as an N-terminal domain associated with an inactive peptidase or zymogen. This domain prevents access of the substrate to the active site. Removal of the N-terminal inhibitor domain either by interaction with a second peptidase or by autocatalytic cleavage activates the zymogen. Other inhibitors interact direct with proteinases using a simple noncovalent lock and key mechanism; while yet others use a conformational change-based trapping mechanism that depends on their structural and thermodynamic properties.  This family of serine protease inhibitors belong to MEROPS inhibitor family I19, clan IW. They inhibit chymotrpsin, a peptidase belong to the S1 family (IPR001254 from INTERPRO) []. They were first isolated from Locusta migratoria migratoria(migratory locust). These were HI, LMCI-1 (PMP-D2) and LMCI-2 (PMP-C) [, , ]; five additional members SGPI-1 to 5 were identified in Schistocerca gregaria (desert locust) [, ], and a heterodimeric serine protease inhibitor (pacifastin) was isolated from the hemolymph of Pacifastacus leniusculus (Signal crayfish) [].  Pacifastin is a 155kDa composed of two covalently linked subunits, which are separately encoded. The heavy chain of pacifastin (105 kDa) is related to transferrins, containing three transferrin lobes, two of which seem to be active for iron binding []. A number of the members of the transferrin family are also serine peptidases belong to MEROPS peptidase family S60 (IPR001156 from INTERPRO). The light chain of pacifastin (44 kDa) is the proteinase inhibitory subunit, and has nine cysteine-rich inhibitory domains that are homologous to each other. The locust inhibitors share a conserved array of six cysteine residues with the pacifastin light chain. The structure of members of this family reveal that they are comprised of a triple-stranded antiparallel beta-sheet connected by three disulphide bridges []. The biological function(s) of the locust inhibitors is (are) not fully understood. LMCI-1 and LMCI-2 were shown to inhibit the endogenous proteolytic activating cascade of prophenoloxidase []. Expression analysis shows that the genes encoding the SGPI precursors are differentially expressed in a time-, stage- and hormone-dependent manner.; GO: 0030414 peptidase inhibitor activity; PDB: 2VU8_I 1WO9_A 1GL1_K 1PMC_A 2XTT_A 2F91_B 1KJ0_A 1GL0_I 1KGM_A 3TVJ_I.
Probab=95.56  E-value=0.016  Score=36.63  Aligned_cols=29  Identities=34%  Similarity=0.909  Sum_probs=24.4

Q ss_pred             CCCCeeecCCcceEEC-CCceEEeeccCCC
Q psy10995         20 DDSEIWRDGCRECHCI-DGREMCTLIACPT   48 (265)
Q Consensus        20 ~~Ge~w~~~C~~C~C~-~G~v~C~~~~Cp~   48 (265)
                      ..|+.|...|..|+|. +|...|+++.|++
T Consensus         3 ~pG~~~k~dCN~C~C~~~G~~~CT~~~C~~   32 (39)
T PF05375_consen    3 TPGSTFKKDCNTCTCSSDGKWACTRKACPP   32 (39)
T ss_dssp             -TTTEEEETTEEEEEBTTSSEEEE-SSSS-
T ss_pred             CCCCeeeCCCCCCCCCCCCcCccCcccCcC
Confidence            3689999999999999 8999999999975


No 10 
>CHL00038 psbL photosystem II protein L
Probab=95.31  E-value=0.04  Score=33.83  Aligned_cols=32  Identities=31%  Similarity=0.541  Sum_probs=24.7

Q ss_pred             CCCccchhhhhhhhHHHHHHHHHHHHHHHHhh
Q psy10995        231 SPESATNRQIRESLYILLIVLFAVSTLFAFYA  262 (265)
Q Consensus       231 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  262 (265)
                      +|-.+.-+-=|-|||++|+.||.+.+||--|.
T Consensus         5 NPN~q~VELNRTSLy~GLLlifvl~vlfssyf   36 (38)
T CHL00038          5 NPNKQNVELNRTSLYWGLLLIFVLAVLFSNYF   36 (38)
T ss_pred             CCCCCccchhhhhHHHHHHHHHHHHHHHHHHh
Confidence            34444444457899999999999999998774


No 11 
>PRK00753 psbL photosystem II reaction center L; Provisional
Probab=94.93  E-value=0.05  Score=33.56  Aligned_cols=32  Identities=31%  Similarity=0.564  Sum_probs=24.2

Q ss_pred             CCCccchhhhhhhhHHHHHHHHHHHHHHHHhh
Q psy10995        231 SPESATNRQIRESLYILLIVLFAVSTLFAFYA  262 (265)
Q Consensus       231 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  262 (265)
                      +|-.+.-+--|-|||++|+.||.+.+||--|.
T Consensus         6 NpN~q~VELNRTSLy~GlLlifvl~vLFssYf   37 (39)
T PRK00753          6 NPNKQPVELNRTSLYLGLLLVFVLGILFSSYF   37 (39)
T ss_pred             CCCCCCceechhhHHHHHHHHHHHHHHHHhhc
Confidence            33334434457899999999999999998773


No 12 
>PF02419 PsbL:  PsbL protein;  InterPro: IPR003372 Oxygenic photosynthesis uses two multi-subunit photosystems (I and II) located in the cell membranes of cyanobacteria and in the thylakoid membranes of chloroplasts in plants and algae. Photosystem II (PSII) has a P680 reaction centre containing chlorophyll 'a' that uses light energy to carry out the oxidation (splitting) of water molecules, and to produce ATP via a proton pump. Photosystem I (PSI) has a P700 reaction centre containing chlorophyll that takes the electron and associated hydrogen donated from PSII to reduce NADP+ to NADPH. Both ATP and NADPH are subsequently used in the light-independent reactions to convert carbon dioxide to glucose using the hydrogen atom extracted from water by PSII, releasing oxygen as a by-product. PSII is a multisubunit protein-pigment complex containing polypeptides both intrinsic and extrinsic to the photosynthetic membrane [, ]. Within the core of the complex, the chlorophyll and beta-carotene pigments are mainly bound to the antenna proteins CP43 (PsbC) and CP47 (PsbB), which pass the excitation energy on to the reaction centre proteins D1 (Qb, PsbA) and D2 (Qa, PsbD) that bind all the redox-active cofactors involved in the energy conversion process. The PSII oxygen-evolving complex (OEC) oxidises water to provide protons for use by PSI, and consists of OEE1 (PsbO), OEE2 (PsbP) and OEE3 (PsbQ). The remaining subunits in PSII are of low molecular weight (less than 10 kDa), and are involved in PSII assembly, stabilisation, dimerisation, and photo-protection [].  This family represents the low molecular weight transmembrane protein PsbL found in PSII. PsbL is located in a gene cluster with PsbE, PsbF and PsbJ (PsbEFJL). Both PsbL and PsbJ (IPR002682 from INTERPRO) are essential for proper assembly of the OEC. Mutations in PsbL prevent the formation of both PSII core dimers and PSII-light harvesting complex []. In addition, both PsbL and PsbJ are involved in the unidirectional flow of electrons, where PsbJ regulates the forward electron flow from D2 (Qa) to the plastoquinone pool, and PsbL prevents the reduction of PSII by back electron flow from plastoquinol protecting PSII from photo-inactivation [].; GO: 0015979 photosynthesis, 0009523 photosystem II, 0009539 photosystem II reaction center, 0016020 membrane; PDB: 3A0H_L 3A0B_l 3ARC_l 1S5L_l 2AXT_l 3BZ2_L 4FBY_L 3PRQ_L 3PRR_L 3KZI_L ....
Probab=94.77  E-value=0.034  Score=34.08  Aligned_cols=25  Identities=36%  Similarity=0.595  Sum_probs=20.4

Q ss_pred             hhhhhhhHHHHHHHHHHHHHHHHhh
Q psy10995        238 RQIRESLYILLIVLFAVSTLFAFYA  262 (265)
Q Consensus       238 ~~~~~~~~~~~~~~~~~~~~~~~~~  262 (265)
                      +-=|-|||++|+.||.+.+||--|.
T Consensus        11 ELNRTSLY~GLllifvl~vLFssyf   35 (37)
T PF02419_consen   11 ELNRTSLYWGLLLIFVLAVLFSSYF   35 (37)
T ss_dssp             E--CCHHHHHHHHHHHHHHHHHHHH
T ss_pred             chhHHhHHHHHHHHHHHHHHhhhhh
Confidence            3346899999999999999998774


No 13 
>PF05825 PSP94:  Beta-microseminoprotein (PSP-94);  InterPro: IPR008735 This family consists of the mammalian specific protein beta-microseminoprotein. Prostatic secretory protein of 94 amino acids (PSP94), also called beta-microseminoprotein, is a small, nonglycosylated protein, rich in cysteine residues. It was first isolated as a major protein from Homo sapiens seminal plasma []. The exact function of this protein is unknown.; PDB: 2IZ4_A 1XHH_A 3IX0_C 2IZ3_A.
Probab=94.61  E-value=0.043  Score=41.46  Aligned_cols=39  Identities=26%  Similarity=0.755  Sum_probs=33.7

Q ss_pred             cCccccccCCeEEcCCcEEccCCCccceecCCeEEEeec
Q psy10995         75 EAAVCMDFHGELKLEGQHWTLNPCTDCTCREGKVLCYSQ  113 (265)
Q Consensus        75 ~~~~C~~~~g~~y~~Ge~w~~~~C~~C~C~~g~v~C~~~  113 (265)
                      ....|.+.+|+.|.-|++|..+.|..|+|.+..+.|-..
T Consensus        14 ~~~~C~D~dG~~h~l~S~W~T~~C~~C~C~~~Gi~CC~~   52 (94)
T PF05825_consen   14 SPNECKDWDGKKHPLNSTWKTDDCEECSCSEDGISCCST   52 (94)
T ss_dssp             STTCEEETTTEEEETTEEEEETTSEEEEEESSEEEEEES
T ss_pred             CCCccEeCCCcEEeCCCeECCCCCcEEEecCCceEeccC
Confidence            345798889999999999999999999999888888543


No 14 
>PF05375 Pacifastin_I:  Pacifastin inhibitor (LCMII);  InterPro: IPR008037 Peptide proteinase inhibitors can be found as single domain proteins or as single or multiple domains within proteins; these are referred to as either simple or compound inhibitors, respectively. In many cases they are synthesised as part of a larger precursor protein, either as a prepropeptide or as an N-terminal domain associated with an inactive peptidase or zymogen. This domain prevents access of the substrate to the active site. Removal of the N-terminal inhibitor domain either by interaction with a second peptidase or by autocatalytic cleavage activates the zymogen. Other inhibitors interact direct with proteinases using a simple noncovalent lock and key mechanism; while yet others use a conformational change-based trapping mechanism that depends on their structural and thermodynamic properties.  This family of serine protease inhibitors belong to MEROPS inhibitor family I19, clan IW. They inhibit chymotrpsin, a peptidase belong to the S1 family (IPR001254 from INTERPRO) []. They were first isolated from Locusta migratoria migratoria(migratory locust). These were HI, LMCI-1 (PMP-D2) and LMCI-2 (PMP-C) [, , ]; five additional members SGPI-1 to 5 were identified in Schistocerca gregaria (desert locust) [, ], and a heterodimeric serine protease inhibitor (pacifastin) was isolated from the hemolymph of Pacifastacus leniusculus (Signal crayfish) [].  Pacifastin is a 155kDa composed of two covalently linked subunits, which are separately encoded. The heavy chain of pacifastin (105 kDa) is related to transferrins, containing three transferrin lobes, two of which seem to be active for iron binding []. A number of the members of the transferrin family are also serine peptidases belong to MEROPS peptidase family S60 (IPR001156 from INTERPRO). The light chain of pacifastin (44 kDa) is the proteinase inhibitory subunit, and has nine cysteine-rich inhibitory domains that are homologous to each other. The locust inhibitors share a conserved array of six cysteine residues with the pacifastin light chain. The structure of members of this family reveal that they are comprised of a triple-stranded antiparallel beta-sheet connected by three disulphide bridges []. The biological function(s) of the locust inhibitors is (are) not fully understood. LMCI-1 and LMCI-2 were shown to inhibit the endogenous proteolytic activating cascade of prophenoloxidase []. Expression analysis shows that the genes encoding the SGPI precursors are differentially expressed in a time-, stage- and hormone-dependent manner.; GO: 0030414 peptidase inhibitor activity; PDB: 2VU8_I 1WO9_A 1GL1_K 1PMC_A 2XTT_A 2F91_B 1KJ0_A 1GL0_I 1KGM_A 3TVJ_I.
Probab=93.81  E-value=0.063  Score=33.85  Aligned_cols=28  Identities=36%  Similarity=1.022  Sum_probs=23.4

Q ss_pred             CCcEEccCCCccceec-CCeEEEeecCCCC
Q psy10995         89 EGQHWTLNPCTDCTCR-EGKVLCYSQQCPA  117 (265)
Q Consensus        89 ~Ge~w~~~~C~~C~C~-~g~v~C~~~~Cp~  117 (265)
                      +|++|. +.|..|+|. +|.+.|.++.|++
T Consensus         4 pG~~~k-~dCN~C~C~~~G~~~CT~~~C~~   32 (39)
T PF05375_consen    4 PGSTFK-KDCNTCTCSSDGKWACTRKACPP   32 (39)
T ss_dssp             TTTEEE-ETTEEEEEBTTSSEEEE-SSSS-
T ss_pred             CCCeee-CCCCCCCCCCCCcCccCcccCcC
Confidence            588888 469999999 8999999999975


No 15 
>PF05825 PSP94:  Beta-microseminoprotein (PSP-94);  InterPro: IPR008735 This family consists of the mammalian specific protein beta-microseminoprotein. Prostatic secretory protein of 94 amino acids (PSP94), also called beta-microseminoprotein, is a small, nonglycosylated protein, rich in cysteine residues. It was first isolated as a major protein from Homo sapiens seminal plasma []. The exact function of this protein is unknown.; PDB: 2IZ4_A 1XHH_A 3IX0_C 2IZ3_A.
Probab=89.57  E-value=0.35  Score=36.53  Aligned_cols=35  Identities=34%  Similarity=0.797  Sum_probs=29.7

Q ss_pred             ceecCCCeEeCCCCeee-cCCcceEECCCceEEeec
Q psy10995         10 ECQDEDGVWRDDSEIWR-DGCRECHCIDGREMCTLI   44 (265)
Q Consensus        10 ~cc~~~G~~y~~Ge~w~-~~C~~C~C~~G~v~C~~~   44 (265)
                      .|...+|+.|.-|+.|. +.|..|+|.+..+.|-..
T Consensus        17 ~C~D~dG~~h~l~S~W~T~~C~~C~C~~~Gi~CC~~   52 (94)
T PF05825_consen   17 ECKDWDGKKHPLNSTWKTDDCEECSCSEDGISCCST   52 (94)
T ss_dssp             CEEETTTEEEETTEEEEETTSEEEEEESSEEEEEES
T ss_pred             ccEeCCCcEEeCCCeECCCCCcEEEecCCceEeccC
Confidence            45668999999999999 899999999877877543


No 16 
>KOG1544|consensus
Probab=89.30  E-value=0.33  Score=44.55  Aligned_cols=57  Identities=25%  Similarity=0.659  Sum_probs=44.2

Q ss_pred             CCCCCcCC----CCCCCCCCccCccccccCCeEEcCCcEEccCCCccceecCCeEEEeecCCC
Q psy10995         58 NSTQCCPY----CPDATSSGHEAAVCMDFHGELKLEGQHWTLNPCTDCTCREGKVLCYSQQCP  116 (265)
Q Consensus        58 ~~g~CCp~----C~~~~~~~~~~~~C~~~~g~~y~~Ge~w~~~~C~~C~C~~g~v~C~~~~Cp  116 (265)
                      ...+|||.    |.+......+..+|+ .+|+.|.+|.... +.|..|+|.++.+.|.-..|-
T Consensus        87 ~~sDCCPDf~~fCRg~pp~~Qp~~gc~-~gg~~y~~G~t~~-~NCn~CTC~n~qWKCdq~~CL  147 (470)
T KOG1544|consen   87 TVSDCCPDFWDFCRGVPPPFQPIQGCM-HGGRIYPVGGTYW-DNCNRCTCQNRQWKCDQEPCL  147 (470)
T ss_pred             CCcccCcCHHHHhcCCCCCCCChhhcc-cCceecccCCeee-ccccceeecCCceecCCceee
Confidence            35678883    665433333446898 7999999999888 689999999999999987774


No 17 
>cd00061 FN1 Fibronectin type 1 domain, approximately 40 residue long with two conserved disulfide bridges. FN1 is one of three types of internal repeats which combine to form larger domains within fibronectin. Fibronectin, a plasma protein that binds cell surfaces and various compounds including collagen, fibrin, heparin, DNA, and actin, usually exists as a dimer in plasma and as an insoluble multimer in extracellular matrices.  Dimers of nearly identical subunits are linked by a disulfide bond close to their C-terminus. FN1 domains also found in coagulation factor XII, HGF activator, and tissue-type plasminogen activator. In tissue plasminogen activator, FN1 domains may form functional fibrin-binding units with EGF-like domains C-terminal to FN1.
Probab=84.03  E-value=1.4  Score=28.34  Aligned_cols=30  Identities=27%  Similarity=0.626  Sum_probs=24.7

Q ss_pred             CCeEEcCCcEEcc---CCCccceecC--CeEEEee
Q psy10995         83 HGELKLEGQHWTL---NPCTDCTCRE--GKVLCYS  112 (265)
Q Consensus        83 ~g~~y~~Ge~w~~---~~C~~C~C~~--g~v~C~~  112 (265)
                      .++.|..|++|..   ..=..|+|..  |.+.|..
T Consensus         8 t~~~Y~ige~W~r~~~~~~~~C~C~G~~G~~~C~~   42 (43)
T cd00061           8 TGVFYRVGETWERPSEGHVLQCTCLGNRGEARCDP   42 (43)
T ss_pred             cccEEeCCCEEEeecCCeEEEEEEeCCCCceEEee
Confidence            4899999999984   3456899998  9999875


No 18 
>cd00061 FN1 Fibronectin type 1 domain, approximately 40 residue long with two conserved disulfide bridges. FN1 is one of three types of internal repeats which combine to form larger domains within fibronectin. Fibronectin, a plasma protein that binds cell surfaces and various compounds including collagen, fibrin, heparin, DNA, and actin, usually exists as a dimer in plasma and as an insoluble multimer in extracellular matrices.  Dimers of nearly identical subunits are linked by a disulfide bond close to their C-terminus. FN1 domains also found in coagulation factor XII, HGF activator, and tissue-type plasminogen activator. In tissue plasminogen activator, FN1 domains may form functional fibrin-binding units with EGF-like domains C-terminal to FN1.
Probab=83.50  E-value=1.6  Score=28.10  Aligned_cols=28  Identities=29%  Similarity=0.803  Sum_probs=23.5

Q ss_pred             CeEeCCCCeee---c-CCcceEECC--CceEEee
Q psy10995         16 GVWRDDSEIWR---D-GCRECHCID--GREMCTL   43 (265)
Q Consensus        16 G~~y~~Ge~w~---~-~C~~C~C~~--G~v~C~~   43 (265)
                      |+.|+.|++|.   + .=..|+|..  |.+.|.+
T Consensus         9 ~~~Y~ige~W~r~~~~~~~~C~C~G~~G~~~C~~   42 (43)
T cd00061           9 GVFYRVGETWERPSEGHVLQCTCLGNRGEARCDP   42 (43)
T ss_pred             ccEEeCCCEEEeecCCeEEEEEEeCCCCceEEee
Confidence            89999999997   3 345799998  9999875


No 19 
>KOG1544|consensus
Probab=81.22  E-value=1.1  Score=41.33  Aligned_cols=37  Identities=19%  Similarity=0.396  Sum_probs=33.9

Q ss_pred             eecCCCeEeCCCCeeecCCcceEECCCceEEeeccCC
Q psy10995         11 CQDEDGVWRDDSEIWRDGCRECHCIDGREMCTLIACP   47 (265)
Q Consensus        11 cc~~~G~~y~~Ge~w~~~C~~C~C~~G~v~C~~~~Cp   47 (265)
                      +|..+|+.|.+|.+..++|..|||+++...|....|-
T Consensus       111 gc~~gg~~y~~G~t~~~NCn~CTC~n~qWKCdq~~CL  147 (470)
T KOG1544|consen  111 GCMHGGRIYPVGGTYWDNCNRCTCQNRQWKCDQEPCL  147 (470)
T ss_pred             hcccCceecccCCeeeccccceeecCCceecCCceee
Confidence            5678999999999999999999999999999988884


No 20 
>PF14828 Amnionless:  Amnionless
Probab=80.93  E-value=0.85  Score=43.91  Aligned_cols=34  Identities=15%  Similarity=0.374  Sum_probs=28.7

Q ss_pred             eEEEECCCCe-------e--cCCCCcCCCcccCCCCCCCCCCC
Q psy10995        195 TNCTCLHGGQ-------K--KCTASMCERKALEKGDTPTPPSG  228 (265)
Q Consensus       195 ~~C~C~~g~v-------~--~Cp~~~C~~p~~~~g~CC~~C~~  228 (265)
                      .-|.|.+..+       .  +|+.+.|.+|+...|+||+.|=.
T Consensus       192 ~gC~C~n~~~l~~ICs~v~~~C~~~~C~~pl~P~GhCC~iCGa  234 (437)
T PF14828_consen  192 SGCPCGNDEVLEWICSNVLQRCPKPHCRSPLRPEGHCCPICGA  234 (437)
T ss_pred             ccCccCcccchhhhhHHhhCcCCCCccCCCCCCCCCchhhcce
Confidence            5889976655       4  89999999999999999999853


No 21 
>PF00039 fn1:  Fibronectin type I domain;  InterPro: IPR000083 Fibronectin type I repeats are one of the three repeats found in the fibronectin protein. Fibronectin is a plasma protein that binds cell surfaces and various compounds including collagen, fibrin, heparin, DNA, and actin. Type I domain (FN1) is approximately 40 residues in length. Four conserved cysteines are involved in disulphide bonds. The 3D structure of the FN1 domain has been determined [, , ]. It consists of two antiparallel beta-sheets, first a double-stranded one, that is linked by a disulphide bond to a triple-stranded beta-sheet. The second conserved disulphide bridge links the C-terminal adjacent strands of the domain.  In human tissue plasminogen activator chain A the FN1 domain together with the following epidermal growth factor (EGF)-like domain are involved in fibrin-binding []. It has been suggested that these two modules form a single structural and functional unit []. The two domains keep their specific tertiary structure, but interact intimately to bury a hydrophobic core; the inter-module linker makes up the third strand of the EGF-module's major beta-sheet.; GO: 0005576 extracellular region; PDB: 2RKY_A 1FBR_A 2RL0_D 3M7P_A 3MQL_A 3ZRZ_A 2RKZ_F 2CKU_A 2CG6_A 3CAL_C ....
Probab=78.22  E-value=2.2  Score=26.77  Aligned_cols=28  Identities=29%  Similarity=0.716  Sum_probs=21.0

Q ss_pred             CCeEEcCCcEEcc---CCCccceecC---CeEEE
Q psy10995         83 HGELKLEGQHWTL---NPCTDCTCRE---GKVLC  110 (265)
Q Consensus        83 ~g~~y~~Ge~w~~---~~C~~C~C~~---g~v~C  110 (265)
                      .++.|..|++|..   ..=..|+|..   |.+.|
T Consensus         6 t~~~Y~vge~W~R~~~g~~~~CtC~G~g~G~~~C   39 (39)
T PF00039_consen    6 TGQFYQVGETWERPYQGHMMQCTCLGNGRGEWKC   39 (39)
T ss_dssp             TTEEEETTEEEEEEETSSEEEEEEEETTTTEEEE
T ss_pred             CccEEeCCCEEEeecCCeEEEeEECCCCccceEC
Confidence            4999999999983   3345699953   66765


No 22 
>PF15430 SVWC:  Single domain von Willebrand factor type C
Probab=76.71  E-value=3.5  Score=28.16  Aligned_cols=35  Identities=31%  Similarity=0.779  Sum_probs=28.7

Q ss_pred             ccccCCeEeeCCCEecC-CCceEEEEC--CCCe--ecCCCC
Q psy10995        175 CLLENGTLLHVGDTYFA-DPCTNCTCL--HGGQ--KKCTAS  210 (265)
Q Consensus       175 C~~~~g~~y~~Ge~w~~-~~C~~C~C~--~g~v--~~Cp~~  210 (265)
                      |.| +|+.+.+|+.|.. ++|..=+|.  +|.+  ..|+..
T Consensus         1 C~y-~~~~~~~g~~~~~~~pC~~~~C~~~~~~v~v~~C~~~   40 (65)
T PF15430_consen    1 CYY-NGRTIPSGESYNPEEPCERWTCDASDGYVTVEGCPPP   40 (65)
T ss_pred             CEE-CCEEcCCCcEecCCCCCceEEEECCCCEEEEEeCCCC
Confidence            556 7999999999874 799999998  5666  888874


No 23 
>TIGR03063 srtB_target sortase B cell surface sorting signal. Two different classes of sorting signal, both analogous to the sortase A signal LPXTG, may be recognized by the sortase SrtB. These are given as NXZTN and NPKXZ. Proteins sorted by this class of sortase are less common than the sortase A and LPXTG system. This model describes a number of cell surface protein C-terminal regions from Gram-positive bacteria that appear to be sortase B (SrtB) sorting signals.
Probab=76.44  E-value=2.9  Score=24.57  Aligned_cols=13  Identities=31%  Similarity=0.327  Sum_probs=8.1

Q ss_pred             HHHHHHHHHhhhc
Q psy10995        252 FAVSTLFAFYACR  264 (265)
Q Consensus       252 ~~~~~~~~~~~~~  264 (265)
                      |..|.+||+++.|
T Consensus        17 ~~~s~~~Li~k~~   29 (29)
T TIGR03063        17 FLGSGLFLIRKRK   29 (29)
T ss_pred             HHHHHHHHhhccC
Confidence            3446678887653


No 24 
>PF14828 Amnionless:  Amnionless
Probab=75.76  E-value=1.3  Score=42.62  Aligned_cols=39  Identities=28%  Similarity=0.913  Sum_probs=32.7

Q ss_pred             cceEECCCc---eEEeec--cCCCCCCCCCCCCCCCCCcCCCCCC
Q psy10995         30 RECHCIDGR---EMCTLI--ACPTLSCQHPIPANSTQCCPYCPDA   69 (265)
Q Consensus        30 ~~C~C~~G~---v~C~~~--~Cp~~~C~~~~~~~~g~CCp~C~~~   69 (265)
                      +-|.|.+..   .+|..+  .||.+.|..|.. +.|+||++|-..
T Consensus       192 ~gC~C~n~~~l~~ICs~v~~~C~~~~C~~pl~-P~GhCC~iCGa~  235 (437)
T PF14828_consen  192 SGCPCGNDEVLEWICSNVLQRCPKPHCRSPLR-PEGHCCPICGAI  235 (437)
T ss_pred             ccCccCcccchhhhhHHhhCcCCCCccCCCCC-CCCCchhhcceE
Confidence            579998754   568887  999999999988 899999999753


No 25 
>COG4537 ComGC Competence protein ComGC [Intracellular trafficking and secretion]
Probab=74.79  E-value=2.7  Score=32.06  Aligned_cols=18  Identities=39%  Similarity=0.670  Sum_probs=15.3

Q ss_pred             hHHHHHHHHHHHHHHHHh
Q psy10995        244 LYILLIVLFAVSTLFAFY  261 (265)
Q Consensus       244 ~~~~~~~~~~~~~~~~~~  261 (265)
                      |-=||+|++.||+|||++
T Consensus        16 LvEMLiVLlIISiLlLl~   33 (107)
T COG4537          16 LVEMLIVLLIISILLLLF   33 (107)
T ss_pred             HHHHHHHHHHHHHHHHHH
Confidence            445899999999999986


No 26 
>smart00058 FN1 Fibronectin type 1 domain. One of three types of internal repeat within the plasma protein, fibronectin. Found also in coagulation factor XII, HGF activator and tissue-type plasminogen activator. In t-PA and fibronectin, this domain type  contributes to fibrin-binding.
Probab=70.40  E-value=6.6  Score=25.46  Aligned_cols=31  Identities=26%  Similarity=0.529  Sum_probs=23.9

Q ss_pred             CCeEEcCCcEEcc---C-CCccceecC---CeEEEeec
Q psy10995         83 HGELKLEGQHWTL---N-PCTDCTCRE---GKVLCYSQ  113 (265)
Q Consensus        83 ~g~~y~~Ge~w~~---~-~C~~C~C~~---g~v~C~~~  113 (265)
                      .++.|..|++|..   . .-..|+|..   |++.|...
T Consensus         6 t~~~Y~ige~W~r~~~~g~~~~C~C~G~g~G~~~C~~~   43 (45)
T smart00058        6 TGTTYRIGDTWERPYEGGHVLQCTCLGNGRGEWKCDPV   43 (45)
T ss_pred             cccEEecCCEEEeecCCCcEEEeEEcCCCceEEEEEec
Confidence            4899999999984   2 456899985   45999864


No 27 
>PF15430 SVWC:  Single domain von Willebrand factor type C
Probab=70.28  E-value=7.2  Score=26.51  Aligned_cols=35  Identities=23%  Similarity=0.430  Sum_probs=26.5

Q ss_pred             ecCCCeEeCCCCeee--cCCcceEEC--CCceEEeeccCCC
Q psy10995         12 QDEDGVWRDDSEIWR--DGCRECHCI--DGREMCTLIACPT   48 (265)
Q Consensus        12 c~~~G~~y~~Ge~w~--~~C~~C~C~--~G~v~C~~~~Cp~   48 (265)
                      |.++|+.+.+|+.|.  ++|..-+|.  +|.++  ...|+.
T Consensus         1 C~y~~~~~~~g~~~~~~~pC~~~~C~~~~~~v~--v~~C~~   39 (65)
T PF15430_consen    1 CYYNGRTIPSGESYNPEEPCERWTCDASDGYVT--VEGCPP   39 (65)
T ss_pred             CEECCEEcCCCcEecCCCCCceEEEECCCCEEE--EEeCCC
Confidence            567899999999998  799999999  45444  334543


No 28 
>smart00058 FN1 Fibronectin type 1 domain. One of three types of internal repeat within the plasma protein, fibronectin. Found also in coagulation factor XII, HGF activator and tissue-type plasminogen activator. In t-PA and fibronectin, this domain type  contributes to fibrin-binding.
Probab=69.71  E-value=7.4  Score=25.23  Aligned_cols=29  Identities=17%  Similarity=0.538  Sum_probs=22.8

Q ss_pred             CeEeCCCCeee---c--CCcceEECCCc---eEEeec
Q psy10995         16 GVWRDDSEIWR---D--GCRECHCIDGR---EMCTLI   44 (265)
Q Consensus        16 G~~y~~Ge~w~---~--~C~~C~C~~G~---v~C~~~   44 (265)
                      |+.|+.|++|.   +  .-..|+|..+.   +.|.+.
T Consensus         7 ~~~Y~ige~W~r~~~~g~~~~C~C~G~g~G~~~C~~~   43 (45)
T smart00058        7 GTTYRIGDTWERPYEGGHVLQCTCLGNGRGEWKCDPV   43 (45)
T ss_pred             ccEEecCCEEEeecCCCcEEEeEEcCCCceEEEEEec
Confidence            79999999997   2  34579999654   888864


No 29 
>PF12191 stn_TNFRSF12A:  Tumour necrosis factor receptor stn_TNFRSF12A_TNFR domain;  InterPro: IPR022316 The tumour necrosis factor (TNF) receptor (TNFR) superfamily comprises more than 20 type-I transmembrane proteins. Family members are defined based on similarity in their extracellular domain - a region that contains many cysteine residues arranged in a specific repetitive pattern []. The cysteines allow formation of an extended rod-like structure, responsible for ligand binding []. Upon receptor activation, different intracellular signalling complexes are assembled for different members of the TNFR superfamily, depending on their intracellular domains and sequences []. Activation of TNFRs can therefore induce a range of disparate effects, including cell proliferation, differentiation, survival, or apoptotic cell death, depending upon the receptor involved []. TNFRs are widely distributed and play important roles in many crucial biological processes, such as lymphoid and neuronal development, innate and adaptive immunity, and maintenance of cellular homeostasis []. Drugs that manipulate their signalling have potential roles in the prevention and treatment of many diseases, such as viral infections, coronary heart disease, transplant rejection, and immune disease []. TNF receptor 12 (also known as TWEAK receptor, and fibroblast growth factor-inducible-14 (Fn14)) has been implicated in endothelial cell growth and migration []. The receptor may also play a role in cell-matrix interactions [].; PDB: 2KN0_A 2RPJ_A 2KMZ_A 2EQP_A.
Probab=69.67  E-value=2.4  Score=33.59  Aligned_cols=69  Identities=22%  Similarity=0.335  Sum_probs=16.4

Q ss_pred             eeCCCEecC--CCceEEEECCCCeecCCCCcCCCcccCCCCCCCCCCCCCCCCccchhhhhhhhHHHHHHHHHHHHHHHH
Q psy10995        183 LHVGDTYFA--DPCTNCTCLHGGQKKCTASMCERKALEKGDTPTPPSGNDSPESATNRQIRESLYILLIVLFAVSTLFAF  260 (265)
Q Consensus       183 y~~Ge~w~~--~~C~~C~C~~g~v~~Cp~~~C~~p~~~~g~CC~~C~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  260 (265)
                      -+.|+.|+.  ++|..|+       .|+..       ..-.=|..|...+...-..--.|.-|...+++|+.++|.|+..
T Consensus        36 Cp~G~sWs~dLdkCm~Cs-------sC~~~-------P~S~~C~~C~a~p~~p~~l~~pi~~sal~v~lVl~llsg~lv~  101 (129)
T PF12191_consen   36 CPRGSSWSADLDKCMSCS-------SCPAY-------PKSPFCQGCPAAPPAPFPLLWPILGSALSVVLVLALLSGFLVW  101 (129)
T ss_dssp             S-SSEEEETTTTEEEECC-------CHCC--------TT-CCCCCHSS-SSS-SSSS-----------------------
T ss_pred             CCCCCcCcccccccccCc-------cCCCC-------CCCccccCCCCCCCCCcceehhhhhhHHHHHHHHHHHHHHHHH
Confidence            356889986  5888772       23320       1111123343322111222233555554556666666655555


Q ss_pred             hhhcC
Q psy10995        261 YACRQ  265 (265)
Q Consensus       261 ~~~~~  265 (265)
                      .+||+
T Consensus       102 rrcrr  106 (129)
T PF12191_consen  102 RRCRR  106 (129)
T ss_dssp             -----
T ss_pred             hhhhc
Confidence            55553


No 30 
>PRK11875 psbT photosystem II reaction center protein T; Reviewed
Probab=62.80  E-value=9.4  Score=22.62  Aligned_cols=20  Identities=35%  Similarity=0.501  Sum_probs=16.0

Q ss_pred             hHHHHHHHHHHHHHHHHhhh
Q psy10995        244 LYILLIVLFAVSTLFAFYAC  263 (265)
Q Consensus       244 ~~~~~~~~~~~~~~~~~~~~  263 (265)
                      .|++|++..+.+.+||++-+
T Consensus         5 ~Ytfll~~tlgiiFFAIfFR   24 (31)
T PRK11875          5 AYILILTLALVTLFFAIAFR   24 (31)
T ss_pred             HHHHHHHHHHHHHHHhhhcc
Confidence            57888888888888888754


No 31 
>PF06459 RR_TM4-6:  Ryanodine Receptor TM 4-6;  InterPro: IPR009460  The release of Ca2+ ions from intracellular stores is a key step in a wide variety of cellular functions. In striated muscle, the release of Ca2+ from the sarcoplasmic reticulum (SR) leads to muscle contraction. Ca2+ release occurs through large, high-conductance Ca2+ release channels, also known as ryanodine receptors (RyRs) because they bind the plant alkaloid ryanodine with high affinity and specificity []. This region covers TM regions 4-6 of the ryanodine receptor 1 family.; GO: 0005219 ryanodine-sensitive calcium-release channel activity, 0006874 cellular calcium ion homeostasis, 0016021 integral to membrane
Probab=61.22  E-value=7.2  Score=35.21  Aligned_cols=19  Identities=26%  Similarity=0.728  Sum_probs=16.4

Q ss_pred             HHHHHHHHHHHHHHHHhhh
Q psy10995        245 YILLIVLFAVSTLFAFYAC  263 (265)
Q Consensus       245 ~~~~~~~~~~~~~~~~~~~  263 (265)
                      |+-|+|.|||=+++|||+-
T Consensus       175 ~lALflAFaINFILLFYKV  193 (274)
T PF06459_consen  175 FLALFLAFAINFILLFYKV  193 (274)
T ss_pred             HHHHHHHHHHHHHHHHHHh
Confidence            4668999999999999983


No 32 
>CHL00031 psbT photosystem II protein T
Probab=60.86  E-value=8.9  Score=23.07  Aligned_cols=21  Identities=24%  Similarity=0.333  Sum_probs=16.6

Q ss_pred             hhHHHHHHHHHHHHHHHHhhh
Q psy10995        243 SLYILLIVLFAVSTLFAFYAC  263 (265)
Q Consensus       243 ~~~~~~~~~~~~~~~~~~~~~  263 (265)
                      -.|++|++..+.+.+||++-+
T Consensus         4 lvYtfll~~tlgilFFAI~FR   24 (33)
T CHL00031          4 LVYTFLLVSTLGIIFFAIFFR   24 (33)
T ss_pred             hHHHHHHHHHHHHHHHhheec
Confidence            368899998888888888654


No 33 
>PF15106 TMEM156:  TMEM156 protein family
Probab=60.21  E-value=8.7  Score=33.22  Aligned_cols=25  Identities=20%  Similarity=0.558  Sum_probs=21.1

Q ss_pred             hhhhhhhhHHHHHHHHHHHHHHHHhhh
Q psy10995        237 NRQIRESLYILLIVLFAVSTLFAFYAC  263 (265)
Q Consensus       237 ~~~~~~~~~~~~~~~~~~~~~~~~~~~  263 (265)
                      +..|.  +|+|++++|.|.+.|.||+-
T Consensus       173 smKIT--WYvLVllVfiflii~iI~KI  197 (226)
T PF15106_consen  173 SMKIT--WYVLVLLVFIFLIILIIYKI  197 (226)
T ss_pred             ehhhH--HHHHHHHHHHHHHHHHHHHH
Confidence            34455  99999999999999999984


No 34 
>PF08693 SKG6:  Transmembrane alpha-helix domain;  InterPro: IPR014805 SKG6 and AXL2 are membrane proteins that show polarised intracellular localisation [, ]. This entry represents the highly conserved transmembrane alpha-helical domain found in these proteins [, ]. The full-length AXL2 protein has a negative regulatory function in cytokinesis [].
Probab=57.95  E-value=6.2  Score=24.97  Aligned_cols=18  Identities=28%  Similarity=0.458  Sum_probs=9.0

Q ss_pred             HHHHHHHHHHHHH--hhhcC
Q psy10995        248 LIVLFAVSTLFAF--YACRQ  265 (265)
Q Consensus       248 ~~~~~~~~~~~~~--~~~~~  265 (265)
                      +.||.++..+|||  |++++
T Consensus        21 V~vI~~vl~~~l~~~~rR~k   40 (40)
T PF08693_consen   21 VGVIIIVLGAFLFFWYRRKK   40 (40)
T ss_pred             hHHHHHHHHHHhheEEeccC
Confidence            3444444444444  76653


No 35 
>PF09443 CFC:  Cripto_Frl-1_Cryptic (CFC);  InterPro: IPR019011 This entry represents the CFC domain found in the membrane protein Cripto (or teratocarcinoma-derived growth factor), a protein over expressed in many tumours [, ] and structurally similar to the C-terminal extracellular portions of Jagged 1 and Jagged 2 []. CFC is approx 40-residues long, compacted by three internal disulphide bridges, and binds Alk4 via a hydrophobic patch. CFC is structurally homologous to the VWFC-like domain [].  The protein Cripto is the founding member of the extra-cellular EGF-CFC growth factors, which are composed of two adjacent cysteine-rich domains: the EGF-like (IPR006210 from INTERPRO) and the CFC domains. Members of the EGF-CFC family play key roles in embryonic development and are also implicated in tumourigenesis []. The Cripto protein could play a role in the determination of the epiblastic cells that subsequently give rise to the mesoderm. Although both the EGF and CFC domains are involved in the tumourigenic activity of Crispto proteins, the CFC domain appears to play a crucial role, as it is through the CFC domain that Crispto interferes with the onco-suppressive activity of Activins, either by blocking the Activin receptor ALK4 or by antagonising proteins of the TGF-beta family [].  The Cryptic protein is involved in the correct establishment of the left-right axis. May play a role in mesoderm and/or neural patterning during gastrulation.; PDB: 2J5H_A.
Probab=57.85  E-value=3.4  Score=25.49  Aligned_cols=25  Identities=32%  Similarity=0.950  Sum_probs=6.9

Q ss_pred             cCCcEEccCCCccceecCCeEEEeec
Q psy10995         88 LEGQHWTLNPCTDCTCREGKVLCYSQ  113 (265)
Q Consensus        88 ~~Ge~w~~~~C~~C~C~~g~v~C~~~  113 (265)
                      .+|+ |....|..|+|..|...|.+.
T Consensus         5 ~Hg~-W~~~~C~lCrC~~G~LhC~p~   29 (36)
T PF09443_consen    5 PHGE-WVPKGCSLCRCWYGTLHCFPQ   29 (36)
T ss_dssp             --TT--SS---------SSS--S--S
T ss_pred             cCCC-eeecCceeeEeecCcEEEcCc
Confidence            3454 888899999999999999864


No 36 
>TIGR03024 arch_pef_cterm PEF-C-terminal archaeal protein sorting domain. This domain, distantly related to the PEP-Cterm domain described in model TIGR02595, is found in Methanosarcina mazei in four different proteins, as well as in other archaea such as Methanococcoides burtonii. Several proteins with this domain have their genes only a short distance from a distant homology of EpsH, a proposed integral membrane transpeptidase.
Probab=55.59  E-value=15  Score=20.95  Aligned_cols=16  Identities=13%  Similarity=0.299  Sum_probs=10.1

Q ss_pred             HHHHHHHHHHHHhhhc
Q psy10995        249 IVLFAVSTLFAFYACR  264 (265)
Q Consensus       249 ~~~~~~~~~~~~~~~~  264 (265)
                      +.|.|+.++++++.+|
T Consensus         9 l~I~all~i~~i~~rr   24 (26)
T TIGR03024         9 LPIIALLAIIVILRRR   24 (26)
T ss_pred             HHHHHHHHHHHHHhhc
Confidence            4555666667777665


No 37 
>PF15102 TMEM154:  TMEM154 protein family
Probab=55.44  E-value=6.8  Score=31.97  Aligned_cols=19  Identities=26%  Similarity=0.490  Sum_probs=11.5

Q ss_pred             HHHHHHHHHHHHHHHhhhc
Q psy10995        246 ILLIVLFAVSTLFAFYACR  264 (265)
Q Consensus       246 ~~~~~~~~~~~~~~~~~~~  264 (265)
                      ++|.++++++|++++|.+|
T Consensus        66 VLLvlLLl~vV~lv~~~kR   84 (146)
T PF15102_consen   66 VLLVLLLLSVVCLVIYYKR   84 (146)
T ss_pred             HHHHHHHHHHHHheeEEee
Confidence            4555555666666666655


No 38 
>PF04530 Viral_Beta_CD:  Viral Beta C/D like family;  InterPro: IPR007617 This is a family of ssRNA positive-strand viral proteins. Conserved region is found in the Beta C and Beta D transcripts.
Probab=51.86  E-value=36  Score=26.92  Aligned_cols=10  Identities=10%  Similarity=-0.204  Sum_probs=5.3

Q ss_pred             CCCCCCCCCC
Q psy10995        219 KGDTPTPPSG  228 (265)
Q Consensus       219 ~g~CC~~C~~  228 (265)
                      .|-+|+.|.-
T Consensus         5 ~~C~CsdC~w   14 (122)
T PF04530_consen    5 SGCCCSDCQW   14 (122)
T ss_pred             CCCcCCcccC
Confidence            4455666553


No 39 
>PF06692 MNSV_P7B:  Melon necrotic spot virus P7B protein;  InterPro: IPR009575 This family consists of several Melon necrotic spot virus (MNSV) P7B proteins. The function of this family is unknown.
Probab=51.67  E-value=17  Score=24.65  Aligned_cols=18  Identities=17%  Similarity=0.180  Sum_probs=12.5

Q ss_pred             HHHHHHHHHHHHHHHHhh
Q psy10995        245 YILLIVLFAVSTLFAFYA  262 (265)
Q Consensus       245 ~~~~~~~~~~~~~~~~~~  262 (265)
                      ..+|+|||.-.+||+||.
T Consensus        14 ~~~lLiliis~~f~lI~~   31 (61)
T PF06692_consen   14 SGPLLILIISFVFFLITS   31 (61)
T ss_pred             hhHHHHHHHHHHHHHHhh
Confidence            346667777777888874


No 40 
>TIGR03068 srtB_sig_NPQTN sortase B signal domain, NPQTN class. This model represents one of the boutique (rare) sortase signals, recognized by sortase B (SrtB) rather than by the housekeeping-type SrtA class sortase. This sequence, beginning NPQTN, shows little similarity to several other SrtB substrates.
Probab=48.82  E-value=17  Score=21.85  Aligned_cols=15  Identities=33%  Similarity=0.392  Sum_probs=8.6

Q ss_pred             HHHHHHHHHHHHHHHHhh
Q psy10995        245 YILLIVLFAVSTLFAFYA  262 (265)
Q Consensus       245 ~~~~~~~~~~~~~~~~~~  262 (265)
                      |+.++.++|   |||+|.
T Consensus        11 y~y~Ip~v~---lflL~~   25 (33)
T TIGR03068        11 YIYAIPVAS---LALLIA   25 (33)
T ss_pred             hhhHHHHHH---HHHHHH
Confidence            556665555   565554


No 41 
>PF01405 PsbT:  Photosystem II reaction centre T protein;  InterPro: IPR001743 Oxygenic photosynthesis uses two multi-subunit photosystems (I and II) located in the cell membranes of cyanobacteria and in the thylakoid membranes of chloroplasts in plants and algae. Photosystem II (PSII) has a P680 reaction centre containing chlorophyll 'a' that uses light energy to carry out the oxidation (splitting) of water molecules, and to produce ATP via a proton pump. Photosystem I (PSI) has a P700 reaction centre containing chlorophyll that takes the electron and associated hydrogen donated from PSII to reduce NADP+ to NADPH. Both ATP and NADPH are subsequently used in the light-independent reactions to convert carbon dioxide to glucose using the hydrogen atom extracted from water by PSII, releasing oxygen as a by-product. PSII is a multisubunit protein-pigment complex containing polypeptides both intrinsic and extrinsic to the photosynthetic membrane [, ]. Within the core of the complex, the chlorophyll and beta-carotene pigments are mainly bound to the antenna proteins CP43 (PsbC) and CP47 (PsbB), which pass the excitation energy on to the reaction centre proteins D1 (Qb, PsbA) and D2 (Qa, PsbD) that bind all the redox-active cofactors involved in the energy conversion process. The PSII oxygen-evolving complex (OEC) oxidises water to provide protons for use by PSI, and consists of OEE1 (PsbO), OEE2 (PsbP) and OEE3 (PsbQ). The remaining subunits in PSII are of low molecular weight (less than 10 kDa), and are involved in PSII assembly, stabilisation, dimerisation, and photo-protection [].   This family represents the low molecular weight transmembrane protein PsbT found in PSII, which is thought to be associated with the D1 (PsbA) - D2 (PsbD) heterodimer. PsbT may be involved in the formation and/or stabilisation of dimeric PSII complexes, because in the absence of this protein dimeric PSII complexes were found to be less abundant. Furthermore, although PsbT does not confer photo-protection, it is required for the efficient recovery of photo-damaged PSII [].; GO: 0015979 photosynthesis, 0009523 photosystem II, 0009539 photosystem II reaction center, 0016020 membrane; PDB: 3BZ1_T 1S5L_t 2AXT_t 3KZI_T 3PRQ_T 3BZ2_T 3PRR_T 4FBY_g 3A0H_t 3A0B_T ....
Probab=48.04  E-value=26  Score=20.54  Aligned_cols=20  Identities=20%  Similarity=0.330  Sum_probs=14.8

Q ss_pred             hHHHHHHHHHHHHHHHHhhh
Q psy10995        244 LYILLIVLFAVSTLFAFYAC  263 (265)
Q Consensus       244 ~~~~~~~~~~~~~~~~~~~~  263 (265)
                      .|++|++..+.+.+||+.-+
T Consensus         5 vY~~ll~~tlgilffAI~FR   24 (29)
T PF01405_consen    5 VYTFLLIGTLGILFFAIFFR   24 (29)
T ss_dssp             HHHHHHHHHHHHHHHHHHSS
T ss_pred             HHHHHHHHHHHHHHhhhhcc
Confidence            57888888777777877643


No 42 
>PF15168 TRIQK:  Triple QxxK/R motif-containing protein family
Probab=45.53  E-value=23  Score=25.61  Aligned_cols=11  Identities=45%  Similarity=0.667  Sum_probs=5.0

Q ss_pred             HHHHHHHHHhh
Q psy10995        252 FAVSTLFAFYA  262 (265)
Q Consensus       252 ~~~~~~~~~~~  262 (265)
                      +.+..||+||+
T Consensus        57 ail~lL~a~Ya   67 (79)
T PF15168_consen   57 AILVLLLAFYA   67 (79)
T ss_pred             HHHHHHHHHHH
Confidence            33444555553


No 43 
>COG4839 FtsL Protein required for the initiation of cell division [Cell division and chromosome partitioning]
Probab=44.63  E-value=23  Score=27.85  Aligned_cols=22  Identities=23%  Similarity=0.211  Sum_probs=17.2

Q ss_pred             hhhhHHHHHHHHHHHHHHHHhh
Q psy10995        241 RESLYILLIVLFAVSTLFAFYA  262 (265)
Q Consensus       241 ~~~~~~~~~~~~~~~~~~~~~~  262 (265)
                      .--+|+++.|++++.+++++|.
T Consensus        37 EKvly~~~~va~L~vai~ii~~   58 (120)
T COG4839          37 EKVLYTTLAVAALVVAISIISV   58 (120)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHH
Confidence            3447889999988888888774


No 44 
>PF09889 DUF2116:  Uncharacterized protein containing a Zn-ribbon (DUF2116);  InterPro: IPR019216 This entry contains various hypothetical prokaryotic proteins whose functions are unknown. They contain a conserved zinc ribbon motif in the N-terminal part and a predicted transmembrane segment in the C-terminal part.
Probab=43.67  E-value=35  Score=23.45  Aligned_cols=15  Identities=20%  Similarity=0.547  Sum_probs=7.0

Q ss_pred             hhhhhHHHHHHHHHH
Q psy10995        240 IRESLYILLIVLFAV  254 (265)
Q Consensus       240 ~~~~~~~~~~~~~~~  254 (265)
                      ++-..||++.++++|
T Consensus        35 ~~~~~~i~~~~~i~~   49 (59)
T PF09889_consen   35 MRKTQYIFFGIFILF   49 (59)
T ss_pred             HHHHHHHHHHHHHHH
Confidence            333444555554443


No 45 
>PHA02831 EEV host range protein; Provisional
Probab=43.53  E-value=89  Score=28.17  Aligned_cols=13  Identities=23%  Similarity=0.761  Sum_probs=6.8

Q ss_pred             HHHHHHHHHHhhh
Q psy10995        251 LFAVSTLFAFYAC  263 (265)
Q Consensus       251 ~~~~~~~~~~~~~  263 (265)
                      ||.+.+++||-.|
T Consensus       243 ~~~~~~~~~~~~~  255 (268)
T PHA02831        243 IFVLGLIALFLSC  255 (268)
T ss_pred             HHHHHHHHHhhcc
Confidence            4455555555444


No 46 
>KOG3208|consensus
Probab=42.24  E-value=25  Score=30.68  Aligned_cols=22  Identities=41%  Similarity=0.560  Sum_probs=17.7

Q ss_pred             hhhhhHHHHHHHHHHHHHHHHhh
Q psy10995        240 IRESLYILLIVLFAVSTLFAFYA  262 (265)
Q Consensus       240 ~~~~~~~~~~~~~~~~~~~~~~~  262 (265)
                      -|.+| ||..||.+-..||+||+
T Consensus       208 rrdsl-ILa~Vis~C~llllfy~  229 (231)
T KOG3208|consen  208 RRDSL-ILAAVISVCTLLLLFYW  229 (231)
T ss_pred             hhhhH-HHHHHHHHHHHHHHHHH
Confidence            34555 88899998889999996


No 47 
>PHA02819 hypothetical protein; Provisional
Probab=41.61  E-value=44  Score=23.77  Aligned_cols=21  Identities=14%  Similarity=0.227  Sum_probs=10.2

Q ss_pred             hhhhhHHHHHHHHHHHHHHHH
Q psy10995        240 IRESLYILLIVLFAVSTLFAF  260 (265)
Q Consensus       240 ~~~~~~~~~~~~~~~~~~~~~  260 (265)
                      ..-.+.|++++++++.+||.|
T Consensus        44 ~~~~~~ii~l~~~~~~~~~~f   64 (71)
T PHA02819         44 FLRYYLIIGLVTIVFVIIFII   64 (71)
T ss_pred             hhHHHHHHHHHHHHHHHHHHH
Confidence            333444555555555555544


No 48 
>PF01034 Syndecan:  Syndecan domain;  InterPro: IPR001050 The syndecans are transmembrane proteoglycans which are involved in the organisation of cytoskeleton and/or actin microfilaments, and have important roles as cell surface receptors during cell-cell and/or cell-matrix interactions [, ]. Structurally, these proteins consist of four separate domains:   A signal sequence; An extracellular domain (ectodomain) of variable length whose sequence is not evolutionary conserved in the various forms of syndecans. The ectodomain contains the sites of attachment of the heparan sulphate glycosaminoglycan side chains;  A transmembrane region;  A highly conserved cytoplasmic domain of about 30 to 35 residues, which could interact with cytoskeletal proteins.    The proteins known to belong to this family are:    Syndecan 1.  Syndecan 2 or fibroglycan.  Syndecan 3 or neuroglycan or N-syndecan.  Syndecan 4 or amphiglycan or ryudocan.  Drosophila syndecan.   Caenorhabditis elegans probable syndecan (F57C7.3).    Syndecan-4, a transmembrane heparan sulphate proteoglycan, is a coreceptor with integrins in cell adhesion. It has been suggested to form a ternary signalling complex with protein kinase Calpha and phosphatidylinositol 4,5-bisphosphate (PIP2). Structural studies have demonstrated that the cytoplasmic domain undergoes a conformational transition and forms a symmetric dimer in the presence of phospholipid activator PIP2, and whose overall structure in solution exhibits a twisted clamp shape having a cavity in the centre of dimeric interface. In addition, it has been observed that the syndecan-4 variable domain interacts, strongly, not only with fatty acyl groups but also the anionic head group of PIP2. These findings indicate that PIP2 promotes oligomerisation of the syndecan-4 cytoplasmic domain for transmembrane signalling and cell-matrix adhesion [, ].; GO: 0008092 cytoskeletal protein binding, 0016020 membrane; PDB: 1EJQ_B 1EJP_B 1YBO_C 1OBY_Q.
Probab=39.17  E-value=10  Score=26.51  Aligned_cols=15  Identities=27%  Similarity=0.481  Sum_probs=0.6

Q ss_pred             HHHHHHHHHHHhhhc
Q psy10995        250 VLFAVSTLFAFYACR  264 (265)
Q Consensus       250 ~~~~~~~~~~~~~~~  264 (265)
                      +.+.+.+||++|+.|
T Consensus        23 l~ailLIlf~iyR~r   37 (64)
T PF01034_consen   23 LFAILLILFLIYRMR   37 (64)
T ss_dssp             --------------S
T ss_pred             HHHHHHHHHHHHHHH
Confidence            334445678888765


No 49 
>PF09889 DUF2116:  Uncharacterized protein containing a Zn-ribbon (DUF2116);  InterPro: IPR019216 This entry contains various hypothetical prokaryotic proteins whose functions are unknown. They contain a conserved zinc ribbon motif in the N-terminal part and a predicted transmembrane segment in the C-terminal part.
Probab=38.87  E-value=30  Score=23.81  Aligned_cols=21  Identities=19%  Similarity=0.417  Sum_probs=10.1

Q ss_pred             hhhhhhhhHHHHHHHHHHHHH
Q psy10995        237 NRQIRESLYILLIVLFAVSTL  257 (265)
Q Consensus       237 ~~~~~~~~~~~~~~~~~~~~~  257 (265)
                      .++|...+.++++++|++.++
T Consensus        38 ~~~i~~~~~i~~l~v~~~~~~   58 (59)
T PF09889_consen   38 TQYIFFGIFILFLAVWIFMTF   58 (59)
T ss_pred             HHHHHHHHHHHHHHHHHHHHh
Confidence            455555554444444444443


No 50 
>PHA03240 envelope glycoprotein M; Provisional
Probab=38.41  E-value=27  Score=30.46  Aligned_cols=26  Identities=8%  Similarity=0.442  Sum_probs=16.7

Q ss_pred             hhhhhhhhHHHHHHHHHHHHHHHHhh
Q psy10995        237 NRQIRESLYILLIVLFAVSTLFAFYA  262 (265)
Q Consensus       237 ~~~~~~~~~~~~~~~~~~~~~~~~~~  262 (265)
                      +....|.-.|+++||+.++++++|++
T Consensus       207 ~~~aaH~~WIiilIIiIiIIIL~cfK  232 (258)
T PHA03240        207 SKDAAHIAWIFIAIIIIIVIILFFFK  232 (258)
T ss_pred             ccccchHhHHHHHHHHHHHHHHHHHh
Confidence            33467777777777776665555554


No 51 
>PHA02650 hypothetical protein; Provisional
Probab=36.24  E-value=66  Score=23.45  Aligned_cols=19  Identities=32%  Similarity=0.640  Sum_probs=8.9

Q ss_pred             hhhHHHHHHHHHHHHHHHH
Q psy10995        242 ESLYILLIVLFAVSTLFAF  260 (265)
Q Consensus       242 ~~~~~~~~~~~~~~~~~~~  260 (265)
                      -.+.|++++++++.+||.|
T Consensus        49 ~~~~ii~i~~v~i~~l~~f   67 (81)
T PHA02650         49 GQNFIFLIFSLIIVALFSF   67 (81)
T ss_pred             HHHHHHHHHHHHHHHHHHH
Confidence            3344555444445555443


No 52 
>PHA03054 IMV membrane protein; Provisional
Probab=36.03  E-value=71  Score=22.76  Aligned_cols=20  Identities=10%  Similarity=0.243  Sum_probs=9.6

Q ss_pred             hhhhHHHHHHHHHHHHHHHH
Q psy10995        241 RESLYILLIVLFAVSTLFAF  260 (265)
Q Consensus       241 ~~~~~~~~~~~~~~~~~~~~  260 (265)
                      .-.+.|++++++++.+||.|
T Consensus        47 ~~~~~ii~l~~v~~~~l~~f   66 (72)
T PHA03054         47 GWYWLIIIFFIVLILLLLIY   66 (72)
T ss_pred             hHHHHHHHHHHHHHHHHHHH
Confidence            33444555554455555444


No 53 
>PF06084 Cytomega_TRL10:  Cytomegalovirus TRL10 protein;  InterPro: IPR009284 This family consists of several Cytomegalovirus TRL10 proteins. TRL10 represents a structural component of the virus particle and like the other HCMV envelope glycoproteins, is present in a disulphide-linked complex [].
Probab=35.88  E-value=20  Score=28.02  Aligned_cols=14  Identities=14%  Similarity=0.477  Sum_probs=9.8

Q ss_pred             HHHHHHHHHHhhhc
Q psy10995        251 LFAVSTLFAFYACR  264 (265)
Q Consensus       251 ~~~~~~~~~~~~~~  264 (265)
                      |+++++||.||++.
T Consensus        70 iillviffviy~re   83 (150)
T PF06084_consen   70 IILLVIFFVIYSRE   83 (150)
T ss_pred             HHHHHHhheeEecc
Confidence            44566788899863


No 54 
>PF03908 Sec20:  Sec20;  InterPro: IPR005606 Sec20 is a membrane glycoprotein associated with secretory pathway.
Probab=34.76  E-value=41  Score=24.87  Aligned_cols=17  Identities=12%  Similarity=0.467  Sum_probs=10.5

Q ss_pred             HHHHHHHHHHHHHHhhh
Q psy10995        247 LLIVLFAVSTLFAFYAC  263 (265)
Q Consensus       247 ~~~~~~~~~~~~~~~~~  263 (265)
                      +.+.+|.+++++.+|++
T Consensus        75 ~~~~~f~~~v~yI~~rR   91 (92)
T PF03908_consen   75 FAFLFFLLVVLYILWRR   91 (92)
T ss_pred             HHHHHHHHHHHHHhhhc
Confidence            45556666666666654


No 55 
>PF15220 HILPDA:  Hypoxia-inducible lipid droplet-associated 
Probab=34.46  E-value=50  Score=22.30  Aligned_cols=18  Identities=33%  Similarity=0.539  Sum_probs=15.1

Q ss_pred             hhHHHHHHHHHHHHHHHH
Q psy10995        243 SLYILLIVLFAVSTLFAF  260 (265)
Q Consensus       243 ~~~~~~~~~~~~~~~~~~  260 (265)
                      +||++..|+.++|+|+-+
T Consensus         6 nlyllgvvltllsifvrl   23 (63)
T PF15220_consen    6 NLYLLGVVLTLLSIFVRL   23 (63)
T ss_pred             HHHHHHHHHHHHHHHHHH
Confidence            689999999999988754


No 56 
>PF11027 DUF2615:  Protein of unknown function (DUF2615);  InterPro: IPR020309 This entry represents a group of uncharacterised protein from the Metazoa, including CD034 (or C4orf34) and YQF4 (or C34C12.4).
Probab=34.24  E-value=40  Score=25.86  Aligned_cols=16  Identities=13%  Similarity=0.453  Sum_probs=8.8

Q ss_pred             HHHHHHHHHHHHHHhh
Q psy10995        247 LLIVLFAVSTLFAFYA  262 (265)
Q Consensus       247 ~~~~~~~~~~~~~~~~  262 (265)
                      ++.++|.+++++||+-
T Consensus        57 ~~~~~w~~~A~~ly~~   72 (103)
T PF11027_consen   57 MMMMLWMVLAMALYLL   72 (103)
T ss_pred             HHHHHHHHHHHHHHHc
Confidence            4445566666555553


No 57 
>COG3726 AhpA Uncharacterized membrane protein affecting hemolysin expression [General function prediction only]
Probab=33.63  E-value=44  Score=28.68  Aligned_cols=22  Identities=18%  Similarity=0.415  Sum_probs=17.7

Q ss_pred             hhhhhHHHHHHHHHHHHHHH-Hh
Q psy10995        240 IRESLYILLIVLFAVSTLFA-FY  261 (265)
Q Consensus       240 ~~~~~~~~~~~~~~~~~~~~-~~  261 (265)
                      --|-+|.||+++|++.+|++ +|
T Consensus       161 ~~~~l~~mlll~~l~gvlLa~~l  183 (214)
T COG3726         161 TTNILRLMLLLSLLIGVLLARLL  183 (214)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHH
Confidence            34668899999999999887 54


No 58 
>PHA02844 putative transmembrane protein; Provisional
Probab=33.57  E-value=68  Score=23.09  Aligned_cols=16  Identities=25%  Similarity=0.650  Sum_probs=6.7

Q ss_pred             hHHHHHHHHHHHHHHH
Q psy10995        244 LYILLIVLFAVSTLFA  259 (265)
Q Consensus       244 ~~~~~~~~~~~~~~~~  259 (265)
                      +.|++++++++++||.
T Consensus        50 ~~ii~i~~v~~~~~~~   65 (75)
T PHA02844         50 IWILTIIFVVFATFLT   65 (75)
T ss_pred             HHHHHHHHHHHHHHHH
Confidence            3344444444444443


No 59 
>PF10717 ODV-E18:  Occlusion-derived virus envelope protein ODV-E18;  InterPro: IPR019655  Baculovirus occlusion-derived virus (ODV) derives its envelope from an intranuclear membrane source. Occlusion-derived viral envelope proteins that are detected in viral-induced intranuclear microvesicles, but not detected in the plasma membrane, cytoplasmic membranes, or the nuclear envelope. This entry represents ODV-E18 protein which is encoded by baculovirus late genes with transcription initiating from a TAAG motif. ODV-E18 exists as a dimer in the ODV envelope, which contains a hydrophobic domain that putatively acts as a target or retention signal for intranuclear microvesicles []. ; GO: 0019031 viral envelope
Probab=32.33  E-value=52  Score=24.23  Aligned_cols=15  Identities=33%  Similarity=0.576  Sum_probs=6.7

Q ss_pred             HHHHHHHHHHHHHhh
Q psy10995        248 LIVLFAVSTLFAFYA  262 (265)
Q Consensus       248 ~~~~~~~~~~~~~~~  262 (265)
                      |+|++.|+.|.+|++
T Consensus        31 LivLVIIiLlImlfq   45 (85)
T PF10717_consen   31 LIVLVIIILLIMLFQ   45 (85)
T ss_pred             HHHHHHHHHHHHHHh
Confidence            344444444444443


No 60 
>PF12911 OppC_N:  N-terminal TM domain of oligopeptide transport permease C
Probab=32.04  E-value=54  Score=21.47  Aligned_cols=15  Identities=13%  Similarity=0.476  Sum_probs=6.9

Q ss_pred             HHHHHHHHHHHHHHH
Q psy10995        246 ILLIVLFAVSTLFAF  260 (265)
Q Consensus       246 ~~~~~~~~~~~~~~~  260 (265)
                      ++.++|+++.+++++
T Consensus        19 ~~gl~il~~~vl~ai   33 (56)
T PF12911_consen   19 VIGLIILLILVLLAI   33 (56)
T ss_pred             HHHHHHHHHHHHHHH
Confidence            344444444444444


No 61 
>PTZ00370 STEVOR; Provisional
Probab=31.87  E-value=44  Score=30.39  Aligned_cols=20  Identities=15%  Similarity=0.398  Sum_probs=12.2

Q ss_pred             HHHHHHHHHHHHHHHhhhcC
Q psy10995        246 ILLIVLFAVSTLFAFYACRQ  265 (265)
Q Consensus       246 ~~~~~~~~~~~~~~~~~~~~  265 (265)
                      ++++++++|+...++|++|.
T Consensus       264 llil~vvliilYiwlyrrRK  283 (296)
T PTZ00370        264 LLILAVVLIILYIWLYRRRK  283 (296)
T ss_pred             HHHHHHHHHHHHHHHHHhhc
Confidence            44444555555667888873


No 62 
>TIGR01478 STEVOR variant surface antigen, stevor family. This model represents the stevor branch of the rifin/stevor family (pfam02009) of predicted variant surface antigens as found in Plasmodium falciparum. This model is based on a set of stevor sequences kindly provided by Matt Berriman from the Sanger Center. This is a global model and assesses a penalty for incomplete sequence. Additional fragmentary sequences may be found with the fragment model and a cutoff of 8 bits.
Probab=31.67  E-value=44  Score=30.32  Aligned_cols=20  Identities=10%  Similarity=0.376  Sum_probs=12.1

Q ss_pred             HHHHHHHHHHHHHHHhhhcC
Q psy10995        246 ILLIVLFAVSTLFAFYACRQ  265 (265)
Q Consensus       246 ~~~~~~~~~~~~~~~~~~~~  265 (265)
                      ++++++++|+...++|++|.
T Consensus       268 llil~vvliiLYiWlyrrRK  287 (295)
T TIGR01478       268 LIILTVVLIILYIWLYRRRK  287 (295)
T ss_pred             HHHHHHHHHHHHHHHHHhhc
Confidence            44444455555567888873


No 63 
>PF11346 DUF3149:  Protein of unknown function (DUF3149);  InterPro: IPR021494  This bacterial family of proteins has no known function. 
Probab=30.94  E-value=65  Score=20.57  Aligned_cols=16  Identities=25%  Similarity=0.403  Sum_probs=9.3

Q ss_pred             HHHHHHHHHHHHHHHh
Q psy10995        246 ILLIVLFAVSTLFAFY  261 (265)
Q Consensus       246 ~~~~~~~~~~~~~~~~  261 (265)
                      +.|+||+..+.+.++|
T Consensus        14 ~Sl~vI~~~igm~~~~   29 (42)
T PF11346_consen   14 MSLIVIVFTIGMGVFF   29 (42)
T ss_pred             HHHHHHHHHHHHHHHH
Confidence            4566666666555554


No 64 
>PF06679 DUF1180:  Protein of unknown function (DUF1180);  InterPro: IPR009565 This entry consists of several hypothetical eukaryotic proteins thought to be membrane proteins. Their function is unknown.
Probab=30.48  E-value=67  Score=26.74  Aligned_cols=25  Identities=20%  Similarity=0.364  Sum_probs=17.0

Q ss_pred             hhhhhHHHHHHHHHHHHHHHHhhhc
Q psy10995        240 IRESLYILLIVLFAVSTLFAFYACR  264 (265)
Q Consensus       240 ~~~~~~~~~~~~~~~~~~~~~~~~~  264 (265)
                      +.-++|+++.+..++++.|+|...|
T Consensus        94 l~R~~~Vl~g~s~l~i~yfvir~~R  118 (163)
T PF06679_consen   94 LKRALYVLVGLSALAILYFVIRTFR  118 (163)
T ss_pred             hhhhHHHHHHHHHHHHHHHHHHHHh
Confidence            4456677777777777777776654


No 65 
>PF04272 Phospholamban:  Phospholamban;  InterPro: IPR005984  Phospholamban (PLB) is a small protein (52 amino acids) that regulates the affinity of the cardiac sarcoplasmic reticulum Ca2+-ATPase (SERCA2a) for calcium. PLB is present in cardiac myocytes, in slow-twitch and smooth muscle and is expressed also in aorta endothelial cells in which it could play a role in tissue relaxation. The phosphorylation/dephosphorylation of phospholamban removes and restores, respectively, its inhibitory activity on SERCA2a. It has in fact been shown that phospholamban, in its non-phosphorylated form, binds to SERCA2a and inhibits this pump by lowering its affinity for Ca2+, whereas the phosphorylated form does not exert the inhibition. PLB is phosphorylated at two sites, namely at Ser-16 for a cAMP-dependent phosphokinase and at Thr-17 for a Ca2+/calmodulin-dependent phosphokinase, phosphorylation at Ser-16 being a prerequisite for the phosphorylation at Thr-17.   The structure of a 36-amino-acid-long N-terminal fragment of human phospholamban phosphorylated at Ser-16 and Thr-17 and Cys36Ser mutated was determined from nuclear magnetic resonance data. The peptide assumes a conformation characterised by two alpha-helices connected by an irregular strand, which comprises the amino acids from Arg-13 to Pro-21. The proline is in a trans conformation. The two phosphate groups on Ser-16 and Thr-17 are shown to interact preferably with the side chains of Arg-14 and Arg-13, respectively [].; GO: 0005246 calcium channel regulator activity, 0042030 ATPase inhibitor activity, 0006816 calcium ion transport, 0016020 membrane; PDB: 1N7L_A 1FJP_A 1FJK_A 2HYN_C 1ZLL_D 1PLP_A 3O7L_I.
Probab=30.10  E-value=98  Score=20.07  Aligned_cols=26  Identities=19%  Similarity=0.220  Sum_probs=13.2

Q ss_pred             chhhhhhhhHHH-HHHHHHHHHHHHHh
Q psy10995        236 TNRQIRESLYIL-LIVLFAVSTLFAFY  261 (265)
Q Consensus       236 ~~~~~~~~~~~~-~~~~~~~~~~~~~~  261 (265)
                      .|+|+|..+.-+ ++....++-|+++|
T Consensus        20 v~~qa~qnlqelfvnfclilicllli~   46 (52)
T PF04272_consen   20 VPQQARQNLQELFVNFCLILICLLLIC   46 (52)
T ss_dssp             SCHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred             CCHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            466777766432 22333344555555


No 66 
>PF00746 Gram_pos_anchor:  Gram positive anchor;  InterPro: IPR019948 Viruses, parasites and bacteria are covered in protein and sugar molecules that help them gain entry into a host by counteracting the host's defences. One such molecule is the M protein produced by certain streptococcal bacteria. M proteins embody a motif that is now known to be shared by many Gram-positive bacterial surface proteins. The motif includes a conserved hexapeptide, which precedes a hydrophobic C-terminal membrane anchor, which itself precedes a cluster of basic residues [, ]. This structure is represented in the following schematic representation:  +--------------------------------------------+-+--------+-+ | Variable length extracellular domain |H| Anchor |B| +--------------------------------------------+-+--------+-+ 'H': conserved hexapeptide. 'B': cluster of basic residues.  It has been proposed that this hexapeptide sequence is responsible for a post- translational modification necessary for the proper anchoring of the proteins which bear it, to the cell wall.; PDB: 2XTL_B 3QDH_A 2Y1V_C 2X9X_A 3RPK_A 2X9W_A 2X9Y_A.
Probab=29.88  E-value=17  Score=22.22  Aligned_cols=21  Identities=14%  Similarity=0.159  Sum_probs=0.0

Q ss_pred             hHHHHHHHHHHHHHHHHhhhc
Q psy10995        244 LYILLIVLFAVSTLFAFYACR  264 (265)
Q Consensus       244 ~~~~~~~~~~~~~~~~~~~~~  264 (265)
                      .++++-++++++++++++++|
T Consensus        19 ~~~~~G~l~~~~~~~~~~krr   39 (39)
T PF00746_consen   19 ILTILGALLLLGGGLLLVKRR   39 (39)
T ss_dssp             ---------------------
T ss_pred             HHHHHHHHHHHHHHHHheecC
Confidence            334444455555555555544


No 67 
>PF04021 Class_IIIsignal:  Class III signal peptide;  InterPro: IPR007166 This entry represents an amino terminal motif QXSXEXXXL thought to be part of a class III signal sequence for a family of archaeal proteins. The Q residue is the +1 residue of the signal peptidase cleavage site []. Two proteins containing this motif are cleaved by a type IV pilin-like signal peptidase. 
Probab=29.63  E-value=73  Score=18.39  Aligned_cols=6  Identities=17%  Similarity=0.218  Sum_probs=2.1

Q ss_pred             HHHHHH
Q psy10995        252 FAVSTL  257 (265)
Q Consensus       252 ~~~~~~  257 (265)
                      +.+++.
T Consensus        16 vlv~~~   21 (28)
T PF04021_consen   16 VLVIAI   21 (28)
T ss_pred             HHHHHH
Confidence            333333


No 68 
>TIGR01294 P_lamban phospholamban. This model represents the short (52 residue) transmembrane phosphoprotein phospholamban. Phospholamban, in its unphosphorylated form, inhibits SERCA2, the cardiac sarcoplasmic reticulum Ca-ATPase.
Probab=29.45  E-value=99  Score=20.06  Aligned_cols=27  Identities=19%  Similarity=0.215  Sum_probs=13.6

Q ss_pred             cchhhhhhhhHHH-HHHHHHHHHHHHHh
Q psy10995        235 ATNRQIRESLYIL-LIVLFAVSTLFAFY  261 (265)
Q Consensus       235 ~~~~~~~~~~~~~-~~~~~~~~~~~~~~  261 (265)
                      ..|+|+|..|.-+ ++....++-|+++|
T Consensus        19 e~~~qar~~lq~lfvnf~lilicllli~   46 (52)
T TIGR01294        19 EMPQQARQNLQNLFINFCLILICLLLIC   46 (52)
T ss_pred             cCCHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            3467788777532 22223334455554


No 69 
>PHA03093 EEV glycoprotein; Provisional
Probab=29.37  E-value=60  Score=27.52  Aligned_cols=22  Identities=32%  Similarity=0.487  Sum_probs=17.2

Q ss_pred             hhhhhhHHHHHHHHHHHHHHHH
Q psy10995        239 QIRESLYILLIVLFAVSTLFAF  260 (265)
Q Consensus       239 ~~~~~~~~~~~~~~~~~~~~~~  260 (265)
                      -+|.+.-|.++++.++++++||
T Consensus        38 ~~RisiiiSIlsL~~i~~~LAl   59 (185)
T PHA03093         38 CIRISIIISILSLIAITATLAL   59 (185)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHH
Confidence            4777787888888888887776


No 70 
>PF06024 DUF912:  Nucleopolyhedrovirus protein of unknown function (DUF912);  InterPro: IPR009261 This entry is represented by Autographa californica nuclear polyhedrosis virus (AcMNPV), Orf78; it is a family of uncharacterised viral proteins.
Probab=28.47  E-value=49  Score=25.08  Aligned_cols=6  Identities=50%  Similarity=1.093  Sum_probs=2.5

Q ss_pred             HHHHHh
Q psy10995        256 TLFAFY  261 (265)
Q Consensus       256 ~~~~~~  261 (265)
                      +|+++|
T Consensus        77 ily~Iy   82 (101)
T PF06024_consen   77 ILYAIY   82 (101)
T ss_pred             HHhhhe
Confidence            334444


No 71 
>PF15188 CCDC-167:  Coiled-coil domain-containing protein 167
Probab=28.45  E-value=61  Score=23.99  Aligned_cols=17  Identities=18%  Similarity=0.247  Sum_probs=12.8

Q ss_pred             HHHHHHHHHHHHHHHHh
Q psy10995        245 YILLIVLFAVSTLFAFY  261 (265)
Q Consensus       245 ~~~~~~~~~~~~~~~~~  261 (265)
                      =.|++.+..|.++|+||
T Consensus        68 K~~~ls~~l~~v~~LvY   84 (85)
T PF15188_consen   68 KSMLLSVALFFVCFLVY   84 (85)
T ss_pred             hhHHHHHHHHHHHHHHc
Confidence            34677777788888888


No 72 
>PHA02673 ORF109 EEV glycoprotein; Provisional
Probab=28.37  E-value=65  Score=26.66  Aligned_cols=21  Identities=29%  Similarity=0.350  Sum_probs=14.2

Q ss_pred             hhhhhHHHHHHHHHHHHHHHH
Q psy10995        240 IRESLYILLIVLFAVSTLFAF  260 (265)
Q Consensus       240 ~~~~~~~~~~~~~~~~~~~~~  260 (265)
                      +|.+.-|.++|+.+++|++|+
T Consensus        36 ~Ri~~~iSIisL~~l~v~LaL   56 (161)
T PHA02673         36 FRLMAAIAIIVLAILVVILAL   56 (161)
T ss_pred             HHHHHHHHHHHHHHHHHHHHH
Confidence            566666667777777776665


No 73 
>PF15201 Rod_cone_degen:  Progressive rod-cone degeneration
Probab=28.22  E-value=46  Score=21.83  Aligned_cols=15  Identities=40%  Similarity=0.563  Sum_probs=11.6

Q ss_pred             HHHHHHHHHHHhhhc
Q psy10995        250 VLFAVSTLFAFYACR  264 (265)
Q Consensus       250 ~~~~~~~~~~~~~~~  264 (265)
                      -+|++|+|..++.+|
T Consensus         4 tlfllstlamlwrrr   18 (54)
T PF15201_consen    4 TLFLLSTLAMLWRRR   18 (54)
T ss_pred             hHHHHHHHHHHHHHH
Confidence            367888888888776


No 74 
>PF06682 DUF1183:  Protein of unknown function (DUF1183);  InterPro: IPR009567 This family consists of several eukaryotic proteins of around 360 residues in length. The function of this family is unknown.
Probab=27.98  E-value=45  Score=30.81  Aligned_cols=16  Identities=25%  Similarity=0.613  Sum_probs=9.2

Q ss_pred             HHHHHHHHHHHHHHhh
Q psy10995        247 LLIVLFAVSTLFAFYA  262 (265)
Q Consensus       247 ~~~~~~~~~~~~~~~~  262 (265)
                      |+.|||+++++|+||+
T Consensus       159 lf~ii~l~vla~ivY~  174 (318)
T PF06682_consen  159 LFWIIFLLVLAFIVYS  174 (318)
T ss_pred             hhhHHHHHHHHHHHHH
Confidence            4455555566666664


No 75 
>PF13150 DUF3989:  Protein of unknown function (DUF3989)
Probab=27.70  E-value=91  Score=22.99  Aligned_cols=24  Identities=25%  Similarity=0.455  Sum_probs=14.0

Q ss_pred             hhhhhhhhHHHHHHHHHHHHHHHH
Q psy10995        237 NRQIRESLYILLIVLFAVSTLFAF  260 (265)
Q Consensus       237 ~~~~~~~~~~~~~~~~~~~~~~~~  260 (265)
                      ++..|.-+-+.++++||+..|+.+
T Consensus        23 sp~~R~~vvl~ml~~fa~l~ly~~   46 (85)
T PF13150_consen   23 SPKQRLRVVLVMLVLFAALCLYMT   46 (85)
T ss_pred             CHHHHHHHHHHHHHHHHHHHHHHH
Confidence            444666565666666666655544


No 76 
>KOG3462|consensus
Probab=27.53  E-value=1e+02  Score=23.31  Aligned_cols=30  Identities=17%  Similarity=0.202  Sum_probs=22.9

Q ss_pred             CccchhhhhhhhHHHHHHHHHHHHHHHHhh
Q psy10995        233 ESATNRQIRESLYILLIVLFAVSTLFAFYA  262 (265)
Q Consensus       233 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  262 (265)
                      +.+.++....++..+|-+||++.-||+=|+
T Consensus        24 ~~~~~eD~~pdYmn~lgmIfsmcGlM~r~K   53 (105)
T KOG3462|consen   24 QGAANEDPPPDYMNFLGMIFSMCGLMFRLK   53 (105)
T ss_pred             ccccccCCChhHHHHHHHHHHHHHHHHHHH
Confidence            344556667778889999999999987664


No 77 
>PHA01815 hypothetical protein
Probab=27.35  E-value=1e+02  Score=20.00  Aligned_cols=14  Identities=29%  Similarity=0.702  Sum_probs=9.4

Q ss_pred             HHHHHHHHHHHHhh
Q psy10995        249 IVLFAVSTLFAFYA  262 (265)
Q Consensus       249 ~~~~~~~~~~~~~~  262 (265)
                      ++||-++.|+.+|+
T Consensus        38 ~iifyiifl~viya   51 (55)
T PHA01815         38 LIIFYIIFLMVIYA   51 (55)
T ss_pred             HHHHHHHHHHHHHH
Confidence            56666677777775


No 78 
>PF09604 Potass_KdpF:  F subunit of K+-transporting ATPase (Potass_KdpF);  InterPro: IPR011726 ATPases (or ATP synthases) are membrane-bound enzyme complexes/ion transporters that combine ATP synthesis and/or hydrolysis with the transport of protons across a membrane. ATPases can harness the energy from a proton gradient, using the flux of ions across the membrane via the ATPase proton channel to drive the synthesis of ATP. Some ATPases work in reverse, using the energy from the hydrolysis of ATP to create a proton gradient. There are different types of ATPases, which can differ in function (ATP synthesis and/or hydrolysis), structure (e.g., F-, V- and A-ATPases, which contain rotary motors) and in the type of ions they transport [, ]. The different types include:   F-ATPases (F1F0-ATPases), which are found in mitochondria, chloroplasts and bacterial plasma membranes where they are the prime producers of ATP, using the proton gradient generated by oxidative phosphorylation (mitochondria) or photosynthesis (chloroplasts). V-ATPases (V1V0-ATPases), which are primarily found in eukaryotic vacuoles and catalyse ATP hydrolysis to transport solutes and lower pH in organelles. A-ATPases (A1A0-ATPases), which are found in Archaea and function like F-ATPases (though with respect to their structure and some inhibitor responses, A-ATPases are more closely related to the V-ATPases). P-ATPases (E1E2-ATPases), which are found in bacteria and in eukaryotic plasma membranes and organelles, and function to transport a variety of different ions across membranes. E-ATPases, which are cell-surface enzymes that hydrolyse a range of NTPs, including extracellular ATP.   P-ATPases (sometime known as E1-E2 ATPases) (3.6.3.- from EC) are found in bacteria and in a number of eukaryotic plasma membranes and organelles []. P-ATPases function to transport a variety of different compounds, including ions and phospholipids, across a membrane using ATP hydrolysis for energy. There are many different classes of P-ATPases, each of which transports a specific type of ion: H+, Na+, K+, Mg2+, Ca2+, Ag+ and Ag2+, Zn2+, Co2+, Pb2+, Ni2+, Cd2+, Cu+ and Cu2+. P-ATPases can be composed of one or two polypeptides, and can usually assume two main conformations called E1 and E2. This entry represents the F subunit (KdpF) of a P-type K+-translocating ATPase (Kdp). KdpF is a very small integral membrane peptide. The kdpABC operon of Escherichia coli codes for the high affinity K+-translocating Kdp complex []. KdpF is found upstream of the KdpA subunit (IPR004623 from INTERPRO). Because of its very small size and highly hydrophobic character, it is sometimes missed in genome annotation. More information about this protein can be found at Protein of the Month: ATP Synthases [].
Probab=27.09  E-value=78  Score=17.86  Aligned_cols=15  Identities=20%  Similarity=0.279  Sum_probs=6.2

Q ss_pred             HHHHHHHHHHHHHhh
Q psy10995        248 LIVLFAVSTLFAFYA  262 (265)
Q Consensus       248 ~~~~~~~~~~~~~~~  262 (265)
                      +.++.+.....++|+
T Consensus         4 ~~~v~~~L~~YL~~a   18 (25)
T PF09604_consen    4 GGIVAVALFVYLFYA   18 (25)
T ss_pred             HHHHHHHHHHHHHHH
Confidence            333333333444543


No 79 
>PF05781 MRVI1:  MRVI1 protein;  InterPro: IPR008677 This family consists of mammalian MRVI1 proteins which are related to the lymphoid-restricted membrane protein (JAW1) and the IP3 receptor associated cGMP kinase substrates A and B (IRAGA and IRAGB). The function of MRVI1 is unknown although mutations in the Mrvi1 gene induces myeloid leukaemia by altering the expression of a gene important for myeloid cell growth and/or differentiation so it has been speculated that Mrvi1 is a tumour suppressor gene []. IRAG is very similar in sequence to MRVI1 and is an essential NO/cGKI-dependent regulator of IP3-induced calcium release. Activation of cGKI decreases IP3-stimulated elevations in intracellular calcium, induces smooth muscle relaxation and contributes to the antiproliferative and pro-apoptotic effects of NO/cGMP []. Jaw1 is a member of a class of proteins with COOH-terminal hydrophobic membrane anchors and is structurally similar to proteins involved in vesicle targeting and fusion. This suggests that the function and/or the structure of the ER in lymphocytes may be modified by lymphoid-restricted resident ER proteins [].
Probab=27.00  E-value=55  Score=32.34  Aligned_cols=21  Identities=19%  Similarity=0.523  Sum_probs=14.2

Q ss_pred             hhhHHHHHHHHHHHHHHHHhh
Q psy10995        242 ESLYILLIVLFAVSTLFAFYA  262 (265)
Q Consensus       242 ~~~~~~~~~~~~~~~~~~~~~  262 (265)
                      --|+|+|.+|+.|.+|+.||.
T Consensus       478 K~LWIsvAliVLLAaLlSfLt  498 (538)
T PF05781_consen  478 KVLWISVALIVLLAALLSFLT  498 (538)
T ss_pred             HHHHHHHHHHHHHHHHHHHHh
Confidence            345677777777777777775


No 80 
>PHA02955 hypothetical protein; Provisional
Probab=26.77  E-value=57  Score=28.34  Aligned_cols=21  Identities=29%  Similarity=0.444  Sum_probs=16.0

Q ss_pred             hHHHHHHHHHHHHHHHHhhhc
Q psy10995        244 LYILLIVLFAVSTLFAFYACR  264 (265)
Q Consensus       244 ~~~~~~~~~~~~~~~~~~~~~  264 (265)
                      .||++.|+.+++++|+-|..|
T Consensus       181 w~ii~~v~ii~~~v~l~yikR  201 (213)
T PHA02955        181 WFIIYIVLCLLILIILGYIYR  201 (213)
T ss_pred             chhHHHHHHHHHHHHHHHHHH
Confidence            577888888778888777766


No 81 
>PF02495 7kD_coat:  7kD viral coat protein;  InterPro: IPR003411 This family consists of a 7 kDa coat protein from Carlavirus and Potexvirus [].
Probab=25.29  E-value=61  Score=21.95  Aligned_cols=16  Identities=31%  Similarity=0.627  Sum_probs=7.8

Q ss_pred             HHHHHHHHHHHHHHHh
Q psy10995        246 ILLIVLFAVSTLFAFY  261 (265)
Q Consensus       246 ~~~~~~~~~~~~~~~~  261 (265)
                      ++++.+.++++++++|
T Consensus         2 ~~l~~i~~l~~~~~l~   17 (59)
T PF02495_consen    2 ILLLLIGLLAFLLTLY   17 (59)
T ss_pred             cHHHHHHHHHHHHHHH
Confidence            3444455555544444


No 82 
>PF05545 FixQ:  Cbb3-type cytochrome oxidase component FixQ;  InterPro: IPR008621 This family consists of several Cbb3-type cytochrome oxidase components (FixQ/CcoQ). FixQ is found in nitrogen fixing bacteria. Since nitrogen fixation is an energy-consuming process, effective symbioses depend on operation of a respiratory chain with a high affinity for O2, closely coupled to ATP production. This requirement is fulfilled by a special three-subunit terminal oxidase (cytochrome terminal oxidase cbb3), which was first identified in Bradyrhizobium japonicum as the product of the fixNOQP operon [].
Probab=24.85  E-value=1.2e+02  Score=19.51  Aligned_cols=19  Identities=21%  Similarity=0.376  Sum_probs=7.4

Q ss_pred             hhHHHHHHHHHHHHHHHHh
Q psy10995        243 SLYILLIVLFAVSTLFAFY  261 (265)
Q Consensus       243 ~~~~~~~~~~~~~~~~~~~  261 (265)
                      ++.+++++++.+.+++..|
T Consensus        12 ~~~~v~~~~~F~gi~~w~~   30 (49)
T PF05545_consen   12 SIGTVLFFVFFIGIVIWAY   30 (49)
T ss_pred             HHHHHHHHHHHHHHHHHHH
Confidence            3433333333333333333


No 83 
>COG0690 SecE Preprotein translocase subunit SecE [Intracellular trafficking and secretion]
Probab=24.81  E-value=1.1e+02  Score=21.79  Aligned_cols=21  Identities=24%  Similarity=0.365  Sum_probs=9.1

Q ss_pred             hhhhhhhhHHHHHHHHHHHHH
Q psy10995        237 NRQIRESLYILLIVLFAVSTL  257 (265)
Q Consensus       237 ~~~~~~~~~~~~~~~~~~~~~  257 (265)
                      .+|+..+..+-|.+++.++++
T Consensus        39 rke~~~~t~~Vl~~v~~~s~~   59 (73)
T COG0690          39 RKELIRSTLIVLVVVAFFSLF   59 (73)
T ss_pred             HHHHHHHHHHHHHHHHHHHHH
Confidence            455555553333333333333


No 84 
>KOG3385|consensus
Probab=24.11  E-value=72  Score=25.03  Aligned_cols=15  Identities=13%  Similarity=0.578  Sum_probs=9.1

Q ss_pred             HHHHHHHHHHHHHHh
Q psy10995        247 LLIVLFAVSTLFAFY  261 (265)
Q Consensus       247 ~~~~~~~~~~~~~~~  261 (265)
                      ..+.+|++++||+||
T Consensus       100 ~~m~~f~lV~~fi~~  114 (118)
T KOG3385|consen  100 CWMAVFSLVAFFILW  114 (118)
T ss_pred             HHHHHHHHHHHHHhh
Confidence            334566666777665


No 85 
>PF02553 CbiN:  Cobalt transport protein component CbiN;  InterPro: IPR003705 The cobalt transport protein CbiN is part of the active cobalt transport system involved in uptake of cobalt in to the cell involved with cobalamin biosynthesis (vitamin B12). It has been suggested that CbiN may function as the periplasmic binding protein component of the active cobalt transport system [].; GO: 0015087 cobalt ion transmembrane transporter activity, 0006824 cobalt ion transport, 0009236 cobalamin biosynthetic process, 0016020 membrane
Probab=23.90  E-value=71  Score=23.02  Aligned_cols=16  Identities=25%  Similarity=0.451  Sum_probs=7.6

Q ss_pred             HHHHHHHHHHHHHHHh
Q psy10995        246 ILLIVLFAVSTLFAFY  261 (265)
Q Consensus       246 ~~~~~~~~~~~~~~~~  261 (265)
                      |+++|++.+++.|++|
T Consensus         6 l~~~vv~l~~~pl~~~   21 (74)
T PF02553_consen    6 LLLLVVALAVVPLFLY   21 (74)
T ss_pred             HHHHHHHHHHHHHHHh
Confidence            4445444444444444


No 86 
>PF03742 PetN:  PetN ;  InterPro: IPR005497 PetN is a small hydrophobic protein, crucial for cytochrome b6-f complex assembly and/or stability. It is found in bacteria and plants. Cytochrome b6-f complex is composed of 4 large subunits: cytochrome b6, subunit IV (17 kDa polypeptide, petD), cytochrome f and the Rieske protein, as well as 4 small subunits: petG, petL, petM and petN. The complex functions as a dimer. The cytochrome b6-f complex mediates electron transfer between photosystem II (PSII) and photosystem I (PSI) [].; GO: 0045158 electron transporter, transferring electrons within cytochrome b6/f complex of photosystem II activity, 0017004 cytochrome complex assembly, 0009512 cytochrome b6f complex; PDB: 2ZT9_H 2D2C_H 2E76_H 1VF5_U 2E75_H 2E74_H.
Probab=23.43  E-value=1.3e+02  Score=17.65  Aligned_cols=18  Identities=22%  Similarity=0.403  Sum_probs=13.4

Q ss_pred             HHHHHHHHHHHHHHHhhh
Q psy10995        246 ILLIVLFAVSTLFAFYAC  263 (265)
Q Consensus       246 ~~~~~~~~~~~~~~~~~~  263 (265)
                      ..|.++|.||.-+..+.+
T Consensus         9 aal~~~ftfSlalVVWGR   26 (29)
T PF03742_consen    9 AALMVVFTFSLALVVWGR   26 (29)
T ss_dssp             HHHHHHHHHHHHHHHHHC
T ss_pred             HHHHHHHhccceeEEEec
Confidence            567888888887776654


No 87 
>PHA03231 glycoprotein BALF4; Provisional
Probab=23.20  E-value=60  Score=33.90  Aligned_cols=17  Identities=29%  Similarity=0.477  Sum_probs=7.7

Q ss_pred             HHHHHHHHHHHHHHHhh
Q psy10995        246 ILLIVLFAVSTLFAFYA  262 (265)
Q Consensus       246 ~~~~~~~~~~~~~~~~~  262 (265)
                      |+|+||+.|++.|++|.
T Consensus       709 iillvia~vv~v~l~~r  725 (829)
T PHA03231        709 IGLLVIAVLVAVFLAYR  725 (829)
T ss_pred             HHHHHHHHhhhhhHHHH
Confidence            44444444444444443


No 88 
>PHA03255 BDLF3; Provisional
Probab=23.13  E-value=63  Score=27.16  Aligned_cols=15  Identities=27%  Similarity=0.831  Sum_probs=0.0

Q ss_pred             HHHHHHHHHHHHHHH
Q psy10995        245 YILLIVLFAVSTLFA  259 (265)
Q Consensus       245 ~~~~~~~~~~~~~~~  259 (265)
                      |+||+.|||...+|+
T Consensus       194 flmlilifaagimma  208 (234)
T PHA03255        194 FLMLILIFAAGLMMS  208 (234)
T ss_pred             HHHHHHHHHhhHhhh


No 89 
>PF11712 Vma12:  Endoplasmic reticulum-based factor for assembly of V-ATPase;  InterPro: IPR021013 ATPases (or ATP synthases) are membrane-bound enzyme complexes/ion transporters that combine ATP synthesis and/or hydrolysis with the transport of protons across a membrane. ATPases can harness the energy from a proton gradient, using the flux of ions across the membrane via the ATPase proton channel to drive the synthesis of ATP. Some ATPases work in reverse, using the energy from the hydrolysis of ATP to create a proton gradient. There are different types of ATPases, which can differ in function (ATP synthesis and/or hydrolysis), structure (e.g., F-, V- and A-ATPases, which contain rotary motors) and in the type of ions they transport [, ]. The different types include:   F-ATPases (F1F0-ATPases), which are found in mitochondria, chloroplasts and bacterial plasma membranes where they are the prime producers of ATP, using the proton gradient generated by oxidative phosphorylation (mitochondria) or photosynthesis (chloroplasts). V-ATPases (V1V0-ATPases), which are primarily found in eukaryotic vacuoles and catalyse ATP hydrolysis to transport solutes and lower pH in organelles. A-ATPases (A1A0-ATPases), which are found in Archaea and function like F-ATPases (though with respect to their structure and some inhibitor responses, A-ATPases are more closely related to the V-ATPases). P-ATPases (E1E2-ATPases), which are found in bacteria and in eukaryotic plasma membranes and organelles, and function to transport a variety of different ions across membranes. E-ATPases, which are cell-surface enzymes that hydrolyse a range of NTPs, including extracellular ATP.   V-ATPases (also known as V1V0-ATPase or vacuolar ATPase) (3.6.3.14 from EC) are found in the eukaryotic endomembrane system, and in the plasma membrane of prokaryotes and certain specialised eukaryotic cells. V-ATPases hydrolyse ATP to drive a proton pump, and are involved in a variety of vital intra- and inter-cellular processes such as receptor mediated endocytosis, protein trafficking, active transport of metabolites, homeostasis and neurotransmitter release []. V-ATPases are composed of two linked complexes: the V1 complex (subunits A-H) contains the catalytic core that hydrolyses ATP, while the V0 complex (subunits a, c, c', c'', d) forms the membrane-spanning pore. V-ATPases may have an additional role in membrane fusion through binding to t-SNARE proteins [].  The yeast vacuolar proton-translocating ATPase (V-ATPase) is the best characterised member of the V-ATPase family. A total of thirteen genes are required for encoding the subunits of the enzyme complex itself and an additional three for providing factors necessary for the assembly of the whole. Vma12 is one of these latter, all three of which are localised to the endoplasmic reticulum []. 
Probab=23.11  E-value=97  Score=24.80  Aligned_cols=25  Identities=16%  Similarity=0.328  Sum_probs=18.5

Q ss_pred             cchhhhhhhhHHHHHHHHHHHHHHH
Q psy10995        235 ATNRQIRESLYILLIVLFAVSTLFA  259 (265)
Q Consensus       235 ~~~~~~~~~~~~~~~~~~~~~~~~~  259 (265)
                      +...+++..+.+.+++|+++++.|+
T Consensus        70 ~~~k~~~~qls~v~Nilvsv~~~~~   94 (142)
T PF11712_consen   70 QELKSVKRQLSTVFNILVSVFAVFF   94 (142)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            4457788888888888888765544


No 90 
>PF05084 GRA6:  Granule antigen protein (GRA6);  InterPro: IPR008119  Toxoplasma gondii is an obligate intracellular apicomplexan protozoan parasite, with a complex lifestyle involving varied hosts []. It has two phases of growth: an intestinal phase in feline hosts, and an extra-intestinal phase in other mammals. Oocysts from infected cats develop into tachyzoites, and eventually, bradyzoites and zoitocysts in the extraintestinal host []. Transmission of the parasite occurs through contact with infected cats or raw/undercooked meat; in immunocompromised individuals, it can cause severe and often lethal toxoplasmosis. Acute infection in healthy humans can sometimes also cause tissue damage [].  The protozoan utilises a variety of secretory and antigenic proteins to invade a host and gain access to the intracellular environment []. These originate from distinct organelles in the T. gondii cell termed micronemes, rhoptries, and dense granules. They are released at specific times during invasion to ensure the proteins are allocated to their correct target destinations []. Dense granule antigens (GRAs) are released from the T. gondii tachyzoite while still encapsulated in a host vacuole. Gra6, one of these moieties, is associated with the parasitophorous vacuole []. It possesses a hydrophobic central region flanked by two hydrophilic domains, and is present as a single copy gene in the Toxoplasma gondii genome []. Gra6 shares a similar function with Gra2, in that it is rapidly targeted to a network of membranous tubules that connect with the vacuolar membrane []. Indeed, these two proteins, together with Gra4, form a multimeric complex that stabilises the parasite within the vacuole.
Probab=22.64  E-value=1e+02  Score=25.68  Aligned_cols=27  Identities=33%  Similarity=0.314  Sum_probs=16.8

Q ss_pred             hhhhhhhhHHHHHHHHHHH-HHHHHhhhc
Q psy10995        237 NRQIRESLYILLIVLFAVS-TLFAFYACR  264 (265)
Q Consensus       237 ~~~~~~~~~~~~~~~~~~~-~~~~~~~~~  264 (265)
                      ..|.||-| |.-.||++++ +|++||.+|
T Consensus       146 R~Q~RHR~-IG~~VlA~~VA~L~~~F~RR  173 (215)
T PF05084_consen  146 RTQKRHRL-IGAVVLAVSVAMLTWFFLRR  173 (215)
T ss_pred             HHHHHHHH-HHHHHHHHHHHHHHHHHHHh
Confidence            46889988 6666655443 455556554


No 91 
>TIGR02115 potass_kdpF K+-transporting ATPase, KdpF subunit. This model describes a very small integral membrane peptide KdpF, a subunit of the K(+)-translocating Kdp complex. It is found upstream of the KdpA subunit (TIGR00680). Because of its very small size and highly hydrophobic character, it is sometimes missed in genome annotation.
Probab=22.61  E-value=1e+02  Score=17.59  Aligned_cols=7  Identities=43%  Similarity=1.032  Sum_probs=2.8

Q ss_pred             HHHHHhh
Q psy10995        256 TLFAFYA  262 (265)
Q Consensus       256 ~~~~~~~  262 (265)
                      ...|+|+
T Consensus        11 ~~YL~~a   17 (26)
T TIGR02115        11 FIYLFYA   17 (26)
T ss_pred             HHHHHHH
Confidence            3344443


No 92 
>PF07204 Orthoreo_P10:  Orthoreovirus membrane fusion protein p10;  InterPro: IPR009854 This family consists of several Orthoreovirus membrane fusion protein p10 sequences. p10 is thought to be a multifunctional protein that plays a key role in virus-host interaction [].
Probab=22.43  E-value=60  Score=24.50  Aligned_cols=12  Identities=17%  Similarity=0.412  Sum_probs=7.3

Q ss_pred             HHHHHHHhhhcC
Q psy10995        254 VSTLFAFYACRQ  265 (265)
Q Consensus       254 ~~~~~~~~~~~~  265 (265)
                      +++|+|||-||+
T Consensus        55 lIii~Lv~CC~~   66 (98)
T PF07204_consen   55 LIIIALVCCCRA   66 (98)
T ss_pred             HHHHHHHHHhhh
Confidence            334777777763


No 93 
>PF12575 DUF3753:  Protein of unknown function (DUF3753);  InterPro: IPR009175 This group represents an uncharacterised conserved protein belonging to poxvirus family I2.
Probab=21.95  E-value=1.5e+02  Score=21.30  Aligned_cols=12  Identities=17%  Similarity=0.445  Sum_probs=4.7

Q ss_pred             HHHHHHHHHHHH
Q psy10995        247 LLIVLFAVSTLF  258 (265)
Q Consensus       247 ~~~~~~~~~~~~  258 (265)
                      ++++++++++||
T Consensus        53 i~ii~v~ii~~l   64 (72)
T PF12575_consen   53 ISIIFVLIIVLL   64 (72)
T ss_pred             HHHHHHHHHHHH
Confidence            333333334443


No 94 
>PHA02692 hypothetical protein; Provisional
Probab=21.78  E-value=1.4e+02  Score=21.26  Aligned_cols=10  Identities=20%  Similarity=0.401  Sum_probs=3.8

Q ss_pred             HHHHHHHHHH
Q psy10995        250 VLFAVSTLFA  259 (265)
Q Consensus       250 ~~~~~~~~~~  259 (265)
                      +++++++||+
T Consensus        54 ~~~~~~vll~   63 (70)
T PHA02692         54 IAAAIGVLLC   63 (70)
T ss_pred             HHHHHHHHHH
Confidence            3333344443


No 95 
>PF03935 SKN1:  Beta-glucan synthesis-associated protein (SKN1);  InterPro: IPR005629  This family consists of the beta-glucan synthesis-associated proteins KRE6 and SKN1. Beta1,6-Glucan is a key component of the yeast cell wall, interconnecting cell wall proteins, beta1,3-glucan, and chitin. It has been postulated that the synthesis of beta1,6-glucan begins in the endoplasmic reticulum with the formation of protein-bound primer structures and that these primer structures are extended in the Golgi complex by two putative glucosyltransferases that are functionally redundant, Kre6 and Skn1. This is followed by maturation steps at the cell surface and by coupling to other cell wall macromolecules []. 
Probab=21.74  E-value=78  Score=31.15  Aligned_cols=27  Identities=22%  Similarity=0.335  Sum_probs=17.5

Q ss_pred             ccchhhhhhhhHHHHHHHHHHHHHHHHh
Q psy10995        234 SATNRQIRESLYILLIVLFAVSTLFAFY  261 (265)
Q Consensus       234 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~  261 (265)
                      ..+.|.+. ++..|+++++++++||+.|
T Consensus        36 ~~s~RGl~-Nlg~LilL~lGLL~LFigy   62 (504)
T PF03935_consen   36 IFSRRGLL-NLGGLILLILGLLMLFIGY   62 (504)
T ss_pred             ccchhhhh-HHHHHHHHHHHHHHHHHHh
Confidence            34445444 4556677777788888877


No 96 
>PTZ00227 variable surface protein Vir14; Provisional
Probab=21.30  E-value=65  Score=30.91  Aligned_cols=22  Identities=23%  Similarity=0.395  Sum_probs=12.1

Q ss_pred             hhhhHHHHHHHHHHH-HHHHHhh
Q psy10995        241 RESLYILLIVLFAVS-TLFAFYA  262 (265)
Q Consensus       241 ~~~~~~~~~~~~~~~-~~~~~~~  262 (265)
                      ++.+...++.||+++ +||++|+
T Consensus       324 ~~~~~~~~~~i~~~~~~~~~~yk  346 (418)
T PTZ00227        324 TDDITPPLLGIVGVLLILWALYK  346 (418)
T ss_pred             hhhccchHHHHHHHHHHHHHHHh
Confidence            344444455555544 4778884


No 97 
>PF11857 DUF3377:  Domain of unknown function (DUF3377);  InterPro: IPR021805  This domain is functionally uncharacterised and found at the C terminus of peptidases belonging to MEROPS peptidase family M10A, membrane-type matrix metallopeptidases (clan MA). ; GO: 0004222 metalloendopeptidase activity
Probab=21.25  E-value=73  Score=22.94  Aligned_cols=21  Identities=29%  Similarity=0.569  Sum_probs=14.2

Q ss_pred             hhhhhHHHHHHHHHHHHHHHH
Q psy10995        240 IRESLYILLIVLFAVSTLFAF  260 (265)
Q Consensus       240 ~~~~~~~~~~~~~~~~~~~~~  260 (265)
                      |-.-+.++|+|++.+-++|.|
T Consensus        34 VviPl~L~LCiLvl~yai~~f   54 (74)
T PF11857_consen   34 VVIPLVLLLCILVLIYAIFQF   54 (74)
T ss_pred             EeHHHHHHHHHHHHHHHhhee
Confidence            334466788888887777753


No 98 
>PF09788 Tmemb_55A:  Transmembrane protein 55A;  InterPro: IPR019178  Members of this family catalyse the hydrolysis of the 4-position phosphate of phosphatidylinositol 4,5-bisphosphate, in the reaction:  1-phosphatidyl-myo-inositol 4,5-bisphosphate + H(2)O = 1-phosphatidyl-1D-myo-inositol 5-phosphate + phosphate.  
Probab=20.75  E-value=1.2e+02  Score=27.02  Aligned_cols=12  Identities=8%  Similarity=-0.297  Sum_probs=5.9

Q ss_pred             CCCCCCCCCCCC
Q psy10995        222 TPTPPSGNDSPE  233 (265)
Q Consensus       222 CC~~C~~~~~~~  233 (265)
                      =||.|..-.+-+
T Consensus       179 RCPHCrKvSSVG  190 (256)
T PF09788_consen  179 RCPHCRKVSSVG  190 (256)
T ss_pred             cCCCCceecccc
Confidence            355666544333


No 99 
>PRK03625 tatE twin arginine translocase protein E; Validated
Probab=20.11  E-value=75  Score=22.40  Aligned_cols=10  Identities=20%  Similarity=0.398  Sum_probs=4.2

Q ss_pred             HHHHHHHHHH
Q psy10995        251 LFAVSTLFAF  260 (265)
Q Consensus       251 ~~~~~~~~~~  260 (265)
                      |+++++|++|
T Consensus        11 iIlvI~lllF   20 (67)
T PRK03625         11 VVAALVVLLF   20 (67)
T ss_pred             HHHHHHHHHc
Confidence            3344444443


No 100
>PF05795 Plasmodium_Vir:  Plasmodium vivax Vir protein;  InterPro: IPR008780 This family consists of several Vir proteins specific to the genus Plasmodium and Plasmodium vivax in particular. The vir genes are present at about 600-1,000 copies per haploid genome and encode proteins that are immunovariant in natural infections, indicating that they may have a functional role in establishing chronic infection through antigenic variation [].
Probab=20.08  E-value=77  Score=28.62  Aligned_cols=19  Identities=26%  Similarity=0.272  Sum_probs=14.9

Q ss_pred             hHHHHHHHHHHHHHHHHhh
Q psy10995        244 LYILLIVLFAVSTLFAFYA  262 (265)
Q Consensus       244 ~~~~~~~~~~~~~~~~~~~  262 (265)
                      ....|.++.++++||+||+
T Consensus       285 v~~~~~~~G~~~~~f~LYK  303 (354)
T PF05795_consen  285 VSPVLSVLGIPLIFFLLYK  303 (354)
T ss_pred             hhhhhhhHHHHHHHHHHhc
Confidence            3456777888888999997


No 101
>PHA02975 hypothetical protein; Provisional
Probab=20.01  E-value=1.3e+02  Score=21.38  Aligned_cols=16  Identities=19%  Similarity=0.636  Sum_probs=6.3

Q ss_pred             hHHHHHHHHHHHHHHH
Q psy10995        244 LYILLIVLFAVSTLFA  259 (265)
Q Consensus       244 ~~~~~~~~~~~~~~~~  259 (265)
                      +.|++++.+++++||.
T Consensus        46 ~~ii~i~~v~~~~~~~   61 (69)
T PHA02975         46 ILIIFIIFITCIAVFT   61 (69)
T ss_pred             HHHHHHHHHHHHHHHH
Confidence            3344433333444443


Done!