Query         psy3106
Match_columns 433
No_of_seqs    342 out of 600
Neff          7.1 
Searched_HMMs 46136
Date          Fri Aug 16 16:38:03 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy3106.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/3106hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 smart00241 ZP Zona pellucida (  99.8 1.2E-19 2.6E-24  175.5  16.8  195  154-359    16-242 (253)
  2 cd01099 PAN_AP_HGF Subfamily o  99.6 9.4E-16   2E-20  122.8   8.7   79   31-111     2-80  (80)
  3 PF00100 Zona_pellucida:  Zona   99.4 1.7E-12 3.8E-17  124.6  12.8  151  205-358    87-254 (265)
  4 PF00024 PAN_1:  PAN domain Thi  99.4 8.7E-13 1.9E-17  104.1   6.5   77   33-112     2-79  (79)
  5 smart00473 PAN_AP divergent su  98.7 4.6E-08   1E-12   76.5   7.3   76   32-111     3-78  (78)
  6 cd01099 PAN_AP_HGF Subfamily o  98.2   3E-06 6.4E-11   67.8   5.9   58  122-180     6-64  (80)
  7 PF00024 PAN_1:  PAN domain Thi  97.6 5.9E-05 1.3E-09   59.0   3.8   59  121-180     3-61  (79)
  8 smart00473 PAN_AP divergent su  97.1  0.0015 3.2E-08   50.6   6.0   54  122-176     6-59  (78)
  9 cd01100 APPLE_Factor_XI_like S  95.6   0.017 3.8E-07   44.9   4.0   55   33-89      4-58  (73)
 10 cd00129 PAN_APPLE PAN/APPLE-li  95.6   0.036 7.9E-07   44.3   5.9   73   34-110     3-79  (80)
 11 PF08276 PAN_2:  PAN-like domai  94.7   0.093   2E-06   40.1   5.8   58   29-88      1-61  (66)
 12 smart00223 APPLE APPLE domain.  92.9    0.16 3.4E-06   40.5   4.1   49   38-88      6-57  (79)
 13 PF14295 PAN_4:  PAN domain; PD  92.7   0.079 1.7E-06   37.6   2.1   33   51-85     13-50  (51)
 14 cd01098 PAN_AP_plant Plant PAN  91.3    0.35 7.5E-06   37.9   4.5   54   33-89     12-65  (84)
 15 PF08276 PAN_2:  PAN-like domai  86.9     1.6 3.5E-05   33.2   5.1   52  122-174     7-60  (66)
 16 cd00129 PAN_APPLE PAN/APPLE-li  82.4     2.7 5.9E-05   33.5   4.7   36  141-177    25-62  (80)
 17 cd01100 APPLE_Factor_XI_like S  71.5     5.7 0.00012   30.6   3.7   36  139-176    23-58  (73)
 18 PF08277 PAN_3:  PAN-like domai  63.1      17 0.00036   27.5   4.8   39   49-91     15-53  (71)
 19 PF05506 DUF756:  Domain of unk  62.4      64  0.0014   25.7   8.3   43  253-295    32-77  (89)
 20 PF14295 PAN_4:  PAN domain; PD  61.9     5.4 0.00012   27.9   1.7   33  139-172    14-50  (51)
 21 smart00223 APPLE APPLE domain.  60.2      13 0.00027   29.6   3.7   44  130-175    11-57  (79)
 22 cd01098 PAN_AP_plant Plant PAN  51.2      26 0.00056   27.0   4.2   32  141-174    32-63  (84)
 23 PTZ00265 multidrug resistance   49.3       8 0.00017   46.9   1.3   15  306-320   598-612 (1466)
 24 KOG3441|consensus               37.4      82  0.0018   27.1   5.2   67  250-325    54-124 (149)
 25 KOG4574|consensus               35.8      26 0.00057   39.4   2.5   32   35-66    515-547 (1007)
 26 COG3485 PcaH Protocatechuate 3  33.5      71  0.0015   30.6   4.8   71  254-331    74-145 (226)
 27 KOG4574|consensus               29.4      32  0.0007   38.7   1.9    6  380-385   948-953 (1007)
 28 smart00765 MANEC The MANEC dom  28.6 1.1E+02  0.0024   25.2   4.5   52   35-88     10-74  (93)
 29 PF02822 Antistasin:  Antistasi  28.3      33 0.00072   21.3   1.1    9  310-318    15-23  (26)
 30 PF07502 MANEC:  MANEC domain;   28.0 1.2E+02  0.0026   24.8   4.6   44   33-78      7-58  (92)
 31 PRK05483 rplN 50S ribosomal pr  26.9 1.2E+02  0.0026   26.2   4.7   61  269-329    31-105 (122)
 32 PF07502 MANEC:  MANEC domain;   24.3      82  0.0018   25.7   3.0   53  122-175     9-73  (92)
 33 PF10836 DUF2574:  Protein of u  23.9   1E+02  0.0022   25.0   3.3   57  242-303    26-82  (93)
 34 PF05688 DUF824:  Salmonella re  23.5      98  0.0021   22.1   2.9   18  269-286     7-24  (47)
 35 TIGR01067 rplN_bact ribosomal   23.3 1.5E+02  0.0033   25.6   4.6   61  269-329    31-105 (122)
 36 smart00605 CW CW domain.        22.1 2.5E+02  0.0055   22.4   5.6   37   51-90     19-55  (94)
 37 TIGR03673 rpl14p_arch 50S ribo  20.1 1.6E+02  0.0035   25.8   4.1   61  269-329    44-115 (131)

No 1  
>smart00241 ZP Zona pellucida (ZP) domain. ZP proteins are responsible for sperm-adhesion fo the zona pellucida. ZP domains are also present in multidomain transmembrane proteins such as glycoprotein GP2, uromodulin and TGF-beta receptor type III (betaglycan).
Probab=99.83  E-value=1.2e-19  Score=175.50  Aligned_cols=195  Identities=21%  Similarity=0.333  Sum_probs=141.2

Q ss_pred             CceEEEEecCCCC-ccceEEec--CCCceeeccccccccccccccc------------eeEEEe-eCCceeeeccc-eeE
Q psy3106         154 PYRCHSYDYGDTG-DLVCRLSH--HSRATLVGIQEYRNTVGIEVGN------------RNTVIQ-HHDTIITSSDL-GLA  216 (433)
Q Consensus       154 ~F~Crs~~~~~~~-~~~C~Ls~--~s~~t~~~~~~P~~~v~~~Cg~------------~~vvVq-~hp~~~T~~Dr-~y~  216 (433)
                      .|.++.+..+.+. +..|....  .++..+ .++.|+    ..||+            ++|+++ +||.+||..|+ .|+
T Consensus        16 ~~~g~i~~~~l~l~d~~C~~~~~~~~~~~~-~f~~~l----~~CGt~~~~~~~~~~ysn~v~~~~~~~~~itr~~~~~~~   90 (253)
T smart00241       16 LFPGGIYVKGLYLGDPSCRPVFTDSTSAFV-SFEVPL----NGCGTRRQVNPDGIVYSNTLVVSPFHPGFITRDDRAAYH   90 (253)
T ss_pred             CCCCeEEEeeEEeCCCCCCCccccCCCcEE-EEEecc----ccCCCeEEECCCeEEEEEEEEEccCCCCceEecCceEEE
Confidence            6778887776432 56899852  222222 123333    27987            789999 79999999999 999


Q ss_pred             EEEeecCCCceeeeceeeccccccccccccceecCCCcEEEEEEcCC--CCccccee---EeCCeEEEEEEEeCC-CCCc
Q psy3106         217 VTCQYDLTNKTVSNEVDLGVQGDVKSALTEEVVVDSPNVAMKITDRS--GADVKPSA---EVGDPLALRFEILDP-NSPY  290 (433)
Q Consensus       217 v~C~y~~~~~~v~~~~~v~~~~~~~~~~~~~~~~~~P~c~~~I~~~~--g~~v~~~a---~vGd~l~~~w~~~~~-~~~~  290 (433)
                      ++|.|....+ +...+.+....  +..+. ......|.|+|+|+.++  |+|+ ..+   +|||+|+|+|++... +..|
T Consensus        91 ~~C~y~~~~~-~~~~~~~~~~~--~~~~~-~~~~g~~~~~m~l~~~~~~~~~~-~~~~~~~lg~~l~~~~~~~~~~~~~~  165 (253)
T smart00241       91 FQCFYPENEK-VSLNLDVSTIP--PTELS-SVSEGPPTCSYRLYKDDSFGSPY-QSADYPVLGDPVYHEWSCDGADDPPL  165 (253)
T ss_pred             EEEEEeCCCc-eEEEEEecCCC--CCCcc-cccCCCcEEEEEeccCCCCCCcc-cCCCCcccCCeEEEEEEEccCCCCCe
Confidence            9999987555 33333333211  11121 12256789999999753  4555 333   799999999999654 7999


Q ss_pred             cEEEEEEEEEcCCC---CceEEEEcCCCCcccccccCcceeccc-cC----cccccccCCcceeEEE-eccCCcCCcc
Q psy3106         291 EIFVRELVAMDGVD---SSEIVLIDSNGCPTDHFIMGPLYKAAD-TG----KFLNCPHEGKCTLTMS-KVSSCPRSSI  359 (433)
Q Consensus       291 ~~~V~~C~a~dg~~---~~~~~lID~~GC~~d~~l~~~~~y~~~-~~----~sl~~~F~~~s~l~~~-~v~~C~~s~~  359 (433)
                      +|+|++|+|.++.+   +.++.||| +|||+|+.+++.+.|..+ ..    ....++|.+.+.+|++ .+..|.....
T Consensus       166 ~l~v~~C~at~~~~~~~~~~~~lI~-~GC~~d~~~~~~~~~~~~~~~~~~f~~~aF~F~~~~~v~~hC~v~vC~~~~~  242 (253)
T smart00241      166 GLLVDNCYATPGSDPSSGPKYFIID-NGCPVDGYLDSTIPYNSSPNHYARFSVKVFKFADRSLVYFHCQIRLCDKSDG  242 (253)
T ss_pred             EEEEeeEEEcCCCCCCCCCcEEEEE-CccCCCCccccceecCCCCcceEEEEEEEEEecCCCcEEEEEEEEEECCCCC
Confidence            99999999998852   68999999 999999999999999876 21    1346799999999997 8888876544


No 2  
>cd01099 PAN_AP_HGF Subfamily of PAN/APPLE-like domains; present in N-terminal (N) domains of plasminogen/hepatocyte growth factor proteins, and various proteins found in Bilateria, such as leech anti-platelet proteins. PAN/APPLE domains fulfill diverse biological functions by mediating protein-protein or protein-carbohydrate interactions.
Probab=99.63  E-value=9.4e-16  Score=122.82  Aligned_cols=79  Identities=34%  Similarity=0.538  Sum_probs=72.2

Q ss_pred             CCeEEEEecCeEEecceeeEeecCCHHHHHHHcccCCCceeeeEEeeCCCCceEeecCCcccccCCCCcccCCCceEEec
Q psy3106          31 RAWCFDRVHGYSLLGFVKKVGTATSRTDCFEMCLGERDFICRSANYNNATGECHMSDMDRITVAGHGAFEETKDIEYLEN  110 (433)
Q Consensus        31 ~~~~Fer~~g~~L~g~~~~~~~~~s~~eC~~~C~~e~~f~CrS~~y~~~~~~C~Ls~~~~~t~~~p~~f~~~~~~dY~En  110 (433)
                      ..|.|++++++.|+|+++.++.+.|+++|+.+|+.+..|.||||+|++.+++|+|+++++.+.  +..+..+.++|||||
T Consensus         2 ~~~~f~~~~~~~l~~~~~~~~~~~s~~~C~~~C~~~~~f~CrSf~y~~~~~~C~L~~~~~~~~--~~~~~~~~~~dyyE~   79 (80)
T cd01099           2 NDFKFVLVLNKILVSEVKTEITVASLEECLRKCLEETEFTCRSFNYNYKSKECILSDEDRMSS--GVKLLYDSNVDYYEN   79 (80)
T ss_pred             CcccEEEecceeecceeeEEEecCCHHHHHHHhCCCCCceEeEEEEEcCCCEEEEeCCCcccc--ccccccccCceeeec
Confidence            358999999999999999999999999999999998889999999999999999999999876  466666668999998


Q ss_pred             C
Q psy3106         111 N  111 (433)
Q Consensus       111 ~  111 (433)
                      .
T Consensus        80 ~   80 (80)
T cd01099          80 K   80 (80)
T ss_pred             C
Confidence            4


No 3  
>PF00100 Zona_pellucida:  Zona pellucida-like domain;  InterPro: IPR001507 A large domain, containing around 260 amino acids, has been recognised in a variety of receptor-like eukaryotic glycoproteins []. All of these proteins are mosaic proteins composed of various domains and that all have a large extracellular region followed by either a transmembrane region and a very short cytoplasmic region or by a GPI-anchor. The domain common to all these proteins is located in the C-terminal portion of the extracellular region, and contains 8 conserved Cys residues, which are probably involved in disulphide bond formation. This entry includes a domain found in the zona pellucida proteins. The zona pellucida is a specialised extracellular matrix of mammailian eggs, which plays a crucial role in fertilisation by directly mediating species-restricted recongition between gametes. In mammals, up to four zona pellucida subnits are known, named ZP1-4. The full-length crystal structure of murine ZP3 has been solved []. This entry is also found in other glycoproteins, such as CD105 (also called endoglin) which is the regulatory component of the TGF-beta receptor complex. It is a modulator of cellular responses to TGF-beta 1. CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; PDB: 3D4G_G 3EF7_A 3D4C_A 3NK4_B 3NK3_B 3QW9_A.
Probab=99.41  E-value=1.7e-12  Score=124.61  Aligned_cols=151  Identities=21%  Similarity=0.335  Sum_probs=92.5

Q ss_pred             CceeeeccceeEEEEeecCCCceeeeceeeccccccccccccceecCCCcEEEEEEcCCC--C---cccceeEeCCeEEE
Q psy3106         205 DTIITSSDLGLAVTCQYDLTNKTVSNEVDLGVQGDVKSALTEEVVVDSPNVAMKITDRSG--A---DVKPSAEVGDPLAL  279 (433)
Q Consensus       205 p~~~T~~Dr~y~v~C~y~~~~~~v~~~~~v~~~~~~~~~~~~~~~~~~P~c~~~I~~~~g--~---~v~~~a~vGd~l~~  279 (433)
                      +.+++..|+.|+++|.|......+..... ...  ..+............++|++.....  .   .....+++||+|++
T Consensus        87 ~~i~~~~~~~~~~~C~y~~~~~~~~~~~~-~~~--~~~~~~~~~~~~~~~~~m~l~~~~~~~~~~~~~~~~~~vg~~v~~  163 (265)
T PF00100_consen   87 GNITRDSDRRFNFTCTYSSSQYSVSSGSP-NIS--SVTIHISQESSGSFSFSMRLYKDDSFSSPYSQPPATVEVGDRVYV  163 (265)
T ss_dssp             TTECSSS-EEEEEEEEE-EC---------------B-TTT---STT--EEEEEEEESTTTSSCB---SSEEE-TT-EEEE
T ss_pred             CceEEecccccccEEEECCccceeccccc-cce--eccccceeccCCccceEEEeccCccccccccCCceeEEeeeeEEE
Confidence            35667789999999999433333322211 110  0010111112334457777776431  1   22256899999999


Q ss_pred             EEEEeCCC-CCccEEEEEEEEEcC---CCCceEEEEcCCCCcccccccCcceec---cccCc----ccccccCCcceeEE
Q psy3106         280 RFEILDPN-SPYEIFVRELVAMDG---VDSSEIVLIDSNGCPTDHFIMGPLYKA---ADTGK----FLNCPHEGKCTLTM  348 (433)
Q Consensus       280 ~w~~~~~~-~~~~~~V~~C~a~dg---~~~~~~~lID~~GC~~d~~l~~~~~y~---~~~~~----sl~~~F~~~s~l~~  348 (433)
                      +|++.... ..+.|+|++|+|.++   .++.++.|||++|||+|...++.+.+.   .+...    ...++|.....+|+
T Consensus       164 ~v~~~~~~~~~~~~~v~~C~a~~~~~~~~~~~~~lId~~GC~~~~~~~~~~~~~~~~~~~~~~~f~f~aF~f~~~~~v~i  243 (265)
T PF00100_consen  164 EVSVSPDSSDPLGLFVDSCWATPSSDPDSSPSYTLIDENGCPVDSDDFVTFEYSGPNGDSSRARFSFRAFRFPDSSSVYI  243 (265)
T ss_dssp             EEEEEECCC-TEEEEEEEEEEESSSSTTSSSCEEEEBCTTEEGGGCTTTEEEEETTTCCCCEEEEEEE-EEETSSSEEEE
T ss_pred             EEEeccccceeeEEEeCEEEEeCCCCcccccccceeccccccccccceeEEEeccccccccEEEEEeeEEEecCCCeEEE
Confidence            99974332 237799999999988   568899999999999999999999987   33211    24568888999999


Q ss_pred             E-eccCCcCCc
Q psy3106         349 S-KVSSCPRSS  358 (433)
Q Consensus       349 ~-~v~~C~~s~  358 (433)
                      + .+..|....
T Consensus       244 ~C~v~~C~~~~  254 (265)
T PF00100_consen  244 HCSVRVCDKSD  254 (265)
T ss_dssp             EEEEEEEETTS
T ss_pred             EEEEEEECCCC
Confidence            7 888887754


No 4  
>PF00024 PAN_1:  PAN domain This Prosite entry concerns apple domains, a subset of PAN domains;  InterPro: IPR003014 PAN domains have significant functional versatility fulfilling diverse biological functions by mediating protein-protein or protein-carbohydrate interactions []. These domains contain a hair-pin loop like structure, similar to knottins, but the pattern of disulphide bonds differs It has been shown that, the N-terminal N domains of members of the plasminogen/hepatocyte growth factor family, the apple domains of the plasma prekallikrein/coagulation factor XI family, and domains of various nematode proteins belong to the same module superfamily, the PAN module []. PAN contains a conserved core of three disulphide bridges. In some members of the family there is an additional fourth disulphide bridge that links the N and C termini of the domain.; PDB: 1GP9_C 2QJ2_B 1GMO_H 1NK1_B 3MKP_B 1BHT_B 3HN4_A 1GMN_A 3HMS_A 3HMT_B ....
Probab=99.38  E-value=8.7e-13  Score=104.05  Aligned_cols=77  Identities=31%  Similarity=0.713  Sum_probs=68.3

Q ss_pred             eEEEEecCeEEecceeeEeecCCHHHHHHHcccCCCceeeeEEeeCCCCceEeecCCcccccCCCCccc-CCCceEEecC
Q psy3106          33 WCFDRVHGYSLLGFVKKVGTATSRTDCFEMCLGERDFICRSANYNNATGECHMSDMDRITVAGHGAFEE-TKDIEYLENN  111 (433)
Q Consensus        33 ~~Fer~~g~~L~g~~~~~~~~~s~~eC~~~C~~e~~f~CrS~~y~~~~~~C~Ls~~~~~t~~~p~~f~~-~~~~dY~En~  111 (433)
                      |.|++++|+.|.|.....+.+.++++|+.+|..+.. .|+||+|++..+.|.|++.++.+.  +..+.. ...++|||+.
T Consensus         2 ~~f~~~~~~~l~~~~~~~~~v~s~~~C~~~C~~~~~-~C~s~~y~~~~~~C~L~~~~~~~~--~~~~~~~~~~~~~ye~~   78 (79)
T PF00024_consen    2 WAFERIPGYRLSGHSIKEINVPSLEECAQLCLNEPR-RCKSFNYDPSSKTCYLSSSDRSSL--PPRLTPSSPNVDYYEKS   78 (79)
T ss_dssp             -TEEEEEEEEEESCEEEEEEESSHHHHHHHHHHSTT--ESEEEEETTTTEEEEECSSSSSE--STEEEEEEEEEEEEEEG
T ss_pred             CCeEEECCEEEeCCcceEEcCCCHHHHHhhcCcCcc-cCCeEEEECCCCEEEEcCCCCCcc--cceeEeccCcEEEEEec
Confidence            789999999999998888888999999999999876 799999999999999999999985  245555 7899999998


Q ss_pred             C
Q psy3106         112 C  112 (433)
Q Consensus       112 C  112 (433)
                      |
T Consensus        79 C   79 (79)
T PF00024_consen   79 C   79 (79)
T ss_dssp             C
T ss_pred             C
Confidence            8


No 5  
>smart00473 PAN_AP divergent subfamily of APPLE domains. Apple-like domains present in Plasminogen, C. elegans hypothetical ORFs and the extracellular portion of plant receptor-like protein kinases. Predicted to possess protein- and/or carbohydrate-binding functions.
Probab=98.71  E-value=4.6e-08  Score=76.45  Aligned_cols=76  Identities=26%  Similarity=0.439  Sum_probs=61.8

Q ss_pred             CeEEEEecCeEEecceeeEeecCCHHHHHHHcccCCCceeeeEEeeCCCCceEeecCCcccccCCCCcccCCCceEEecC
Q psy3106          32 AWCFDRVHGYSLLGFVKKVGTATSRTDCFEMCLGERDFICRSANYNNATGECHMSDMDRITVAGHGAFEETKDIEYLENN  111 (433)
Q Consensus        32 ~~~Fer~~g~~L~g~~~~~~~~~s~~eC~~~C~~e~~f~CrS~~y~~~~~~C~Ls~~~~~t~~~p~~f~~~~~~dY~En~  111 (433)
                      ...|.++++..|.+++...+.+.++++|+++|+.+ .+.|+||+|.+..+.|+|+.+.....  +. ......++|||+.
T Consensus         3 ~~~f~~~~~~~l~~~~~~~~~~~s~~~C~~~C~~~-~~~C~s~~y~~~~~~C~l~~~~~~~~--~~-~~~~~~~~~y~~~   78 (78)
T smart00473        3 DDCFVRLPNTKLPGFSRIVISVASLEECASKCLNS-NCSCRSFTYNNGTKGCLLWSESSLGD--AR-LFPSGGVDLYEKI   78 (78)
T ss_pred             CceeEEecCccCCCCcceeEcCCCHHHHHHHhCCC-CCceEEEEEcCCCCEEEEeeCCcccc--ce-ecccCCceeEEeC
Confidence            35799999999987777667889999999999985 58999999998899999999555444  22 4456788999974


No 6  
>cd01099 PAN_AP_HGF Subfamily of PAN/APPLE-like domains; present in N-terminal (N) domains of plasminogen/hepatocyte growth factor proteins, and various proteins found in Bilateria, such as leech anti-platelet proteins. PAN/APPLE domains fulfill diverse biological functions by mediating protein-protein or protein-carbohydrate interactions.
Probab=98.20  E-value=3e-06  Score=67.76  Aligned_cols=58  Identities=31%  Similarity=0.543  Sum_probs=48.7

Q ss_pred             eeeecCceecccceeeecccCHHHHHHhhhCC-CceEEEEecCCCCccceEEecCCCcee
Q psy3106         122 FKKLAGRILKTVDSVYQDVASIEECRELCLNS-PYRCHSYDYGDTGDLVCRLSHHSRATL  180 (433)
Q Consensus       122 ~~~~~~~~l~~~d~~~~~v~s~~eC~~~Cl~s-~F~Crs~~~~~~~~~~C~Ls~~s~~t~  180 (433)
                      |.+.++++|.+.......+.|+++|+++|+++ +|.||||+|.. ...+|+|+..++.+.
T Consensus         6 f~~~~~~~l~~~~~~~~~~~s~~~C~~~C~~~~~f~CrSf~y~~-~~~~C~L~~~~~~~~   64 (80)
T cd01099           6 FVLVLNKILVSEVKTEITVASLEECLRKCLEETEFTCRSFNYNY-KSKECILSDEDRMSS   64 (80)
T ss_pred             EEEecceeecceeeEEEecCCHHHHHHHhCCCCCceEeEEEEEc-CCCEEEEeCCCcccc
Confidence            55778888877665555678999999999997 99999999997 589999999988544


No 7  
>PF00024 PAN_1:  PAN domain This Prosite entry concerns apple domains, a subset of PAN domains;  InterPro: IPR003014 PAN domains have significant functional versatility fulfilling diverse biological functions by mediating protein-protein or protein-carbohydrate interactions []. These domains contain a hair-pin loop like structure, similar to knottins, but the pattern of disulphide bonds differs It has been shown that, the N-terminal N domains of members of the plasminogen/hepatocyte growth factor family, the apple domains of the plasma prekallikrein/coagulation factor XI family, and domains of various nematode proteins belong to the same module superfamily, the PAN module []. PAN contains a conserved core of three disulphide bridges. In some members of the family there is an additional fourth disulphide bridge that links the N and C termini of the domain.; PDB: 1GP9_C 2QJ2_B 1GMO_H 1NK1_B 3MKP_B 1BHT_B 3HN4_A 1GMN_A 3HMS_A 3HMT_B ....
Probab=97.61  E-value=5.9e-05  Score=58.97  Aligned_cols=59  Identities=37%  Similarity=0.731  Sum_probs=48.5

Q ss_pred             EeeeecCceecccceeeecccCHHHHHHhhhCCCceEEEEecCCCCccceEEecCCCcee
Q psy3106         121 EFKKLAGRILKTVDSVYQDVASIEECRELCLNSPYRCHSYDYGDTGDLVCRLSHHSRATL  180 (433)
Q Consensus       121 ~~~~~~~~~l~~~d~~~~~v~s~~eC~~~Cl~s~F~Crs~~~~~~~~~~C~Ls~~s~~t~  180 (433)
                      .|+++.|..|.+......+|.++++|+++|+..+..|++|.|.. ....|.|...++.+.
T Consensus         3 ~f~~~~~~~l~~~~~~~~~v~s~~~C~~~C~~~~~~C~s~~y~~-~~~~C~L~~~~~~~~   61 (79)
T PF00024_consen    3 AFERIPGYRLSGHSIKEINVPSLEECAQLCLNEPRRCKSFNYDP-SSKTCYLSSSDRSSL   61 (79)
T ss_dssp             TEEEEEEEEEESCEEEEEEESSHHHHHHHHHHSTT-ESEEEEET-TTTEEEEECSSSSSE
T ss_pred             CeEEECCEEEeCCcceEEcCCCHHHHHhhcCcCcccCCeEEEEC-CCCEEEEcCCCCCcc
Confidence            58888998888864444488999999999999866999999997 588999999888764


No 8  
>smart00473 PAN_AP divergent subfamily of APPLE domains. Apple-like domains present in Plasminogen, C. elegans hypothetical ORFs and the extracellular portion of plant receptor-like protein kinases. Predicted to possess protein- and/or carbohydrate-binding functions.
Probab=97.07  E-value=0.0015  Score=50.61  Aligned_cols=54  Identities=37%  Similarity=0.642  Sum_probs=42.3

Q ss_pred             eeeecCceecccceeeecccCHHHHHHhhhCCCceEEEEecCCCCccceEEecCC
Q psy3106         122 FKKLAGRILKTVDSVYQDVASIEECRELCLNSPYRCHSYDYGDTGDLVCRLSHHS  176 (433)
Q Consensus       122 ~~~~~~~~l~~~d~~~~~v~s~~eC~~~Cl~s~F~Crs~~~~~~~~~~C~Ls~~s  176 (433)
                      |.++++..|...+.....+.++++|+++|+++.+.|++|+|.. ....|+|...+
T Consensus         6 f~~~~~~~l~~~~~~~~~~~s~~~C~~~C~~~~~~C~s~~y~~-~~~~C~l~~~~   59 (78)
T smart00473        6 FVRLPNTKLPGFSRIVISVASLEECASKCLNSNCSCRSFTYNN-GTKGCLLWSES   59 (78)
T ss_pred             eEEecCccCCCCcceeEcCCCHHHHHHHhCCCCCceEEEEEcC-CCCEEEEeeCC
Confidence            6667777776444333456799999999999779999999987 68899999833


No 9  
>cd01100 APPLE_Factor_XI_like Subfamily of PAN/APPLE-like domains; present in plasma prekallikrein/coagulation factor XI, microneme antigen proteins, and a few prokaryotic proteins. PAN/APPLE domains fulfill diverse biological functions by mediating protein-protein or protein-carbohydrate interactions.
Probab=95.55  E-value=0.017  Score=44.93  Aligned_cols=55  Identities=20%  Similarity=0.372  Sum_probs=47.0

Q ss_pred             eEEEEecCeEEecceeeEeecCCHHHHHHHcccCCCceeeeEEeeCCCCceEeecCC
Q psy3106          33 WCFDRVHGYSLLGFVKKVGTATSRTDCFEMCLGERDFICRSANYNNATGECHMSDMD   89 (433)
Q Consensus        33 ~~Fer~~g~~L~g~~~~~~~~~s~~eC~~~C~~e~~f~CrS~~y~~~~~~C~Ls~~~   89 (433)
                      .+|+..++..+.|....+..+.+.++|+++|+.+  -.|+.|.|....+.|.|-...
T Consensus         4 ~C~~~~~~~~~~g~d~~~~~~~s~~~Cq~~C~~~--~~C~afT~~~~~~~C~lk~~~   58 (73)
T cd01100           4 SCFRQGSNVDFRGGDLSTVFASSAEQCQAACTAD--PGCLAFTYNTKSKKCFLKSSE   58 (73)
T ss_pred             ccccccCCCccccCCcceeecCCHHHHHHHcCCC--CCceEEEEECCCCeEEcccCC
Confidence            4566667888888877777788999999999986  689999999989999998875


No 10 
>cd00129 PAN_APPLE PAN/APPLE-like domain; present in N-terminal (N) domains of plasminogen/ hepatocyte growth factor proteins,  plasma prekallikrein/coagulation factor XI and microneme antigen proteins, plant receptor-like protein kinases, and various nematode and leech anti-platelet proteins. Common structural features include two disulfide bonds that link the alpha-helix to the central region of the protein. PAN domains have significant functional versatility, fulfilling diverse biological functions by mediating protein-protein or protein-carbohydrate interactions.
Probab=95.55  E-value=0.036  Score=44.27  Aligned_cols=73  Identities=19%  Similarity=0.327  Sum_probs=48.0

Q ss_pred             EEEEecCeEEeccee--eEeecCCHHHHHHHccc-CCCceeeeEEeeCCCCceEeecCCc-ccccCCCCcccCCCceEEe
Q psy3106          34 CFDRVHGYSLLGFVK--KVGTATSRTDCFEMCLG-ERDFICRSANYNNATGECHMSDMDR-ITVAGHGAFEETKDIEYLE  109 (433)
Q Consensus        34 ~Fer~~g~~L~g~~~--~~~~~~s~~eC~~~C~~-e~~f~CrS~~y~~~~~~C~Ls~~~~-~t~~~p~~f~~~~~~dY~E  109 (433)
                      .|-+.+|..|.....  ....+.+++||+++|++ +.++.|+++.|....+.|.+-..+. ...   ..+. +.+.+.|+
T Consensus         3 ~~~~~~g~fl~~~~~klpd~~~~s~~eC~~~Cl~~~~nCsC~Aya~~~~~~gC~~W~~~l~~d~---~~~~-~~g~~Ly~   78 (80)
T cd00129           3 EFCKSAGTTLIKIALKIKTTKANTADECANRCEKNGLPFSCKAFVFAKARKQCLWFPFNSMSGV---RKEF-SHGFDLYE   78 (80)
T ss_pred             eeeecCCeEEEeecccCCcccccCHHHHHHHHhcCCCCCCceeeeccCCCCCeEEecCcchhhH---Hhcc-CCCceeEe
Confidence            456667666642211  12244799999999998 3369999999976678999998877 332   2221 33566665


Q ss_pred             c
Q psy3106         110 N  110 (433)
Q Consensus       110 n  110 (433)
                      +
T Consensus        79 r   79 (80)
T cd00129          79 N   79 (80)
T ss_pred             E
Confidence            4


No 11 
>PF08276 PAN_2:  PAN-like domain;  InterPro: IPR013227 PAN domains have significant functional versatility fulfilling diverse biological functions by mediating protein-protein or protein-carbohydrate interactions []. These domains contain a hair-pin loop like structure, similar to knottins, but the pattern of disulphide bonds differs
Probab=94.71  E-value=0.093  Score=40.11  Aligned_cols=58  Identities=22%  Similarity=0.429  Sum_probs=45.9

Q ss_pred             CCCCeEEEEecCeEEecceeeEe-ecCCHHHHHHHcccCCCceeeeEEeeC--CCCceEeecC
Q psy3106          29 CERAWCFDRVHGYSLLGFVKKVG-TATSRTDCFEMCLGERDFICRSANYNN--ATGECHMSDM   88 (433)
Q Consensus        29 C~~~~~Fer~~g~~L~g~~~~~~-~~~s~~eC~~~C~~e~~f~CrS~~y~~--~~~~C~Ls~~   88 (433)
                      |+..-.|.++++..|.+.....+ .+.++++|..+|++.  =.|.++.|..  ..+.|.|-..
T Consensus         1 C~~~d~F~~l~~~~~p~~~~~~~~~~~s~~~C~~~Cl~n--CsC~Ayay~~~~~~~~C~lW~~   61 (66)
T PF08276_consen    1 CGSGDGFLKLPNMKLPDFDNAIVDSSVSLEECEKACLSN--CSCTAYAYSNLSGGGGCLLWYG   61 (66)
T ss_pred             CcCCCEEEEECCeeCCCCcceeeecCCCHHHHHhhcCCC--CCEeeEEeeccCCCCEEEEEcC
Confidence            44456899999999977655554 679999999999976  5799999975  5689987653


No 12 
>smart00223 APPLE APPLE domain. Four-fold repeat in plasma kallikrein and coagulation factor XI. Factor XI apple 3 mediates binding to platelets. Factor XI apple 1 binds high-molecular-mass kininogen. Apple 4 in factor XI mediates dimer formation and binds to factor XIIa. Mutations in apple 4 cause factor XI deficiency, an inherited bleeding disorder.
Probab=92.88  E-value=0.16  Score=40.52  Aligned_cols=49  Identities=14%  Similarity=0.218  Sum_probs=43.5

Q ss_pred             ecCeEEecceeeEeecCCHHHHHHHcccCCCceeeeEEeeCCCC---ceEeecC
Q psy3106          38 VHGYSLLGFVKKVGTATSRTDCFEMCLGERDFICRSANYNNATG---ECHMSDM   88 (433)
Q Consensus        38 ~~g~~L~g~~~~~~~~~s~~eC~~~C~~e~~f~CrS~~y~~~~~---~C~Ls~~   88 (433)
                      ..+..+.|...+.+.+.+.++|++.|...  -.|+.|.|.....   .|.|-..
T Consensus         6 ~~~~df~G~Dl~~~~~~~~~~Cq~~Ct~~--~~C~~FTf~~~~~~~~~C~LK~s   57 (79)
T smart00223        6 YKNVDFRGSDINTVYVPSAQVCQKRCTSH--PRCLFFTFSTNEPPEEKCLLKDS   57 (79)
T ss_pred             ccCccccCceeeeeecCCHHHHHHhhcCC--CCccEEEeeCCCCCCCEeEeCcC
Confidence            45777889999999999999999999976  5899999998888   9999876


No 13 
>PF14295 PAN_4:  PAN domain; PDB: 2YIL_E 2YIP_C 2YIO_A.
Probab=92.72  E-value=0.079  Score=37.60  Aligned_cols=33  Identities=27%  Similarity=0.631  Sum_probs=17.3

Q ss_pred             eecCCHHHHHHHcccCCCceeeeEEeeC-----CCCceEe
Q psy3106          51 GTATSRTDCFEMCLGERDFICRSANYNN-----ATGECHM   85 (433)
Q Consensus        51 ~~~~s~~eC~~~C~~e~~f~CrS~~y~~-----~~~~C~L   85 (433)
                      +.+.+.++|+++|...  ..|..+.|..     ..+.|.|
T Consensus        13 ~~~~s~~~C~~~C~~~--~~C~~~~~~~~~~~~~~~~C~L   50 (51)
T PF14295_consen   13 VTASSPEECQAACAAD--PGCQAFTFNPPGCPSSSGRCYL   50 (51)
T ss_dssp             -----HHHHHHHHHTS--TT--EEEEETTEE---------
T ss_pred             ccCCCHHHHHHHccCC--CCCCEEEEECCCcccccccccC
Confidence            3788999999999976  5799999998     5777776


No 14 
>cd01098 PAN_AP_plant Plant PAN/APPLE-like domain; present in plant S-receptor protein kinases and secreted glycoproteins. PAN/APPLE domains fulfill diverse biological functions by mediating protein-protein or protein-carbohydrate interactions. S-receptor protein kinases and S-locus glycoproteins are involved in sporophytic self-incompatibility response in Brassica, one of probably many molecular mechanisms, by which hermaphrodite flowering plants avoid self-fertilization.
Probab=91.33  E-value=0.35  Score=37.92  Aligned_cols=54  Identities=28%  Similarity=0.428  Sum_probs=42.7

Q ss_pred             eEEEEecCeEEecceeeEeecCCHHHHHHHcccCCCceeeeEEeeCCCCceEeecCC
Q psy3106          33 WCFDRVHGYSLLGFVKKVGTATSRTDCFEMCLGERDFICRSANYNNATGECHMSDMD   89 (433)
Q Consensus        33 ~~Fer~~g~~L~g~~~~~~~~~s~~eC~~~C~~e~~f~CrS~~y~~~~~~C~Ls~~~   89 (433)
                      ..|..+++..+.+..... ...++++|.++|++.  =.|.+|.|....+.|.+-...
T Consensus        12 ~~f~~~~~~~~~~~~~~~-~~~s~~~C~~~Cl~n--CsC~a~~~~~~~~~C~~~~~~   65 (84)
T cd01098          12 DGFLKLPDVKLPDNASAI-TAISLEECREACLSN--CSCTAYAYNNGSGGCLLWNGL   65 (84)
T ss_pred             CEEEEeCCeeCCCchhhh-ccCCHHHHHHHHhcC--CCcceeeecCCCCeEEEEece
Confidence            689888888776554443 778999999999975  689999998667999885444


No 15 
>PF08276 PAN_2:  PAN-like domain;  InterPro: IPR013227 PAN domains have significant functional versatility fulfilling diverse biological functions by mediating protein-protein or protein-carbohydrate interactions []. These domains contain a hair-pin loop like structure, similar to knottins, but the pattern of disulphide bonds differs
Probab=86.93  E-value=1.6  Score=33.23  Aligned_cols=52  Identities=33%  Similarity=0.606  Sum_probs=35.7

Q ss_pred             eeeecCceecccceee-ecccCHHHHHHhhhCCCceEEEEecCCC-CccceEEec
Q psy3106         122 FKKLAGRILKTVDSVY-QDVASIEECRELCLNSPYRCHSYDYGDT-GDLVCRLSH  174 (433)
Q Consensus       122 ~~~~~~~~l~~~d~~~-~~v~s~~eC~~~Cl~s~F~Crs~~~~~~-~~~~C~Ls~  174 (433)
                      |.++++..++..+..+ ....++++|+.+|++. =.|.+|+|... +...|.+-.
T Consensus         7 F~~l~~~~~p~~~~~~~~~~~s~~~C~~~Cl~n-CsC~Ayay~~~~~~~~C~lW~   60 (66)
T PF08276_consen    7 FLKLPNMKLPDFDNAIVDSSVSLEECEKACLSN-CSCTAYAYSNLSGGGGCLLWY   60 (66)
T ss_pred             EEEECCeeCCCCcceeeecCCCHHHHHhhcCCC-CCEeeEEeeccCCCCEEEEEc
Confidence            5566666665554333 2446999999999996 13999998741 457898854


No 16 
>cd00129 PAN_APPLE PAN/APPLE-like domain; present in N-terminal (N) domains of plasminogen/ hepatocyte growth factor proteins,  plasma prekallikrein/coagulation factor XI and microneme antigen proteins, plant receptor-like protein kinases, and various nematode and leech anti-platelet proteins. Common structural features include two disulfide bonds that link the alpha-helix to the central region of the protein. PAN domains have significant functional versatility, fulfilling diverse biological functions by mediating protein-protein or protein-carbohydrate interactions.
Probab=82.37  E-value=2.7  Score=33.46  Aligned_cols=36  Identities=17%  Similarity=0.580  Sum_probs=29.2

Q ss_pred             cCHHHHHHhhhCC--CceEEEEecCCCCccceEEecCCC
Q psy3106         141 ASIEECRELCLNS--PYRCHSYDYGDTGDLVCRLSHHSR  177 (433)
Q Consensus       141 ~s~~eC~~~Cl~s--~F~Crs~~~~~~~~~~C~Ls~~s~  177 (433)
                      .+++||+++|+++  .+.|++|.|.. ....|.+-..+.
T Consensus        25 ~s~~eC~~~Cl~~~~nCsC~Aya~~~-~~~gC~~W~~~l   62 (80)
T cd00129          25 NTADECANRCEKNGLPFSCKAFVFAK-ARKQCLWFPFNS   62 (80)
T ss_pred             cCHHHHHHHHhcCCCCCCceeeeccC-CCCCeEEecCcc
Confidence            4899999999994  68899999875 245899987655


No 17 
>cd01100 APPLE_Factor_XI_like Subfamily of PAN/APPLE-like domains; present in plasma prekallikrein/coagulation factor XI, microneme antigen proteins, and a few prokaryotic proteins. PAN/APPLE domains fulfill diverse biological functions by mediating protein-protein or protein-carbohydrate interactions.
Probab=71.55  E-value=5.7  Score=30.60  Aligned_cols=36  Identities=25%  Similarity=0.593  Sum_probs=29.5

Q ss_pred             cccCHHHHHHhhhCCCceEEEEecCCCCccceEEecCC
Q psy3106         139 DVASIEECRELCLNSPYRCHSYDYGDTGDLVCRLSHHS  176 (433)
Q Consensus       139 ~v~s~~eC~~~Cl~s~F~Crs~~~~~~~~~~C~Ls~~s  176 (433)
                      .+.+.++|++.|+..+ .|+.|.|.. ....|.|-...
T Consensus        23 ~~~s~~~Cq~~C~~~~-~C~afT~~~-~~~~C~lk~~~   58 (73)
T cd01100          23 FASSAEQCQAACTADP-GCLAFTYNT-KSKKCFLKSSE   58 (73)
T ss_pred             ecCCHHHHHHHcCCCC-CceEEEEEC-CCCeEEcccCC
Confidence            3669999999999974 599999985 57899997654


No 18 
>PF08277 PAN_3:  PAN-like domain;  InterPro: IPR006583 PAN domains have significant functional versatility fulfilling diverse biological functions by mediating protein-protein or protein-carbohydrate interactions []. These domains contain a hair-pin loop like structure, similar to knottins, but the pattern of disulphide bonds differs The PAN-3 or CW is a domain associated with a number of Caenorhabditis elegans hypothetical proteins.
Probab=63.14  E-value=17  Score=27.51  Aligned_cols=39  Identities=28%  Similarity=0.589  Sum_probs=32.9

Q ss_pred             eEeecCCHHHHHHHcccCCCceeeeEEeeCCCCceEeecCCcc
Q psy3106          49 KVGTATSRTDCFEMCLGERDFICRSANYNNATGECHMSDMDRI   91 (433)
Q Consensus        49 ~~~~~~s~~eC~~~C~~e~~f~CrS~~y~~~~~~C~Ls~~~~~   91 (433)
                      ......+.++|+..|...  -.|--|-|.  .+.|.|......
T Consensus        15 ~~~~~~sw~~Cv~~C~~~--~~C~la~~~--~~~C~~y~~~~i   53 (71)
T PF08277_consen   15 SSTTNTSWDDCVQKCYND--ENCVLAYFD--SGKCYLYNYGSI   53 (71)
T ss_pred             ccccCCCHHHHhHHhCCC--CEEEEEEeC--CCCEEEEEcCCE
Confidence            445678999999999987  589999887  889999988764


No 19 
>PF05506 DUF756:  Domain of unknown function (DUF756);  InterPro: IPR008475 This domain is found, normally as a tandem repeat, at the C terminus of bacterial phospholipase C proteins.; GO: 0004629 phospholipase C activity, 0016042 lipid catabolic process
Probab=62.45  E-value=64  Score=25.74  Aligned_cols=43  Identities=16%  Similarity=0.343  Sum_probs=33.0

Q ss_pred             CcEEEEEEcC---CCCcccceeEeCCeEEEEEEEeCCCCCccEEEE
Q psy3106         253 PNVAMKITDR---SGADVKPSAEVGDPLALRFEILDPNSPYEIFVR  295 (433)
Q Consensus       253 P~c~~~I~~~---~g~~v~~~a~vGd~l~~~w~~~~~~~~~~~~V~  295 (433)
                      -.+.++|.+.   .+.+..-.++-|+.+.+.|......+-|++.|+
T Consensus        32 ~~~~~~v~~~~y~~~~~~~~~v~ag~~~~~~w~l~~s~gwYDl~v~   77 (89)
T PF05506_consen   32 AAVTFTVYDNAYGGGGPWTYTVAAGQTVSLTWPLAASGGWYDLTVT   77 (89)
T ss_pred             CcEEEEEEeCCcCCCCCEEEEECCCCEEEEEEeecCCCCcEEEEEE
Confidence            3567777762   235666788889999999988556899999886


No 20 
>PF14295 PAN_4:  PAN domain; PDB: 2YIL_E 2YIP_C 2YIO_A.
Probab=61.89  E-value=5.4  Score=27.86  Aligned_cols=33  Identities=24%  Similarity=0.588  Sum_probs=16.1

Q ss_pred             cccCHHHHHHhhhCCCceEEEEecCCC----CccceEE
Q psy3106         139 DVASIEECRELCLNSPYRCHSYDYGDT----GDLVCRL  172 (433)
Q Consensus       139 ~v~s~~eC~~~Cl~s~F~Crs~~~~~~----~~~~C~L  172 (433)
                      .+.+.++|+++|... -.|..|+|...    ....|+|
T Consensus        14 ~~~s~~~C~~~C~~~-~~C~~~~~~~~~~~~~~~~C~L   50 (51)
T PF14295_consen   14 TASSPEECQAACAAD-PGCQAFTFNPPGCPSSSGRCYL   50 (51)
T ss_dssp             ----HHHHHHHHHTS-TT--EEEEETTEE---------
T ss_pred             cCCCHHHHHHHccCC-CCCCEEEEECCCcccccccccC
Confidence            456999999999975 35999988751    4567765


No 21 
>smart00223 APPLE APPLE domain. Four-fold repeat in plasma kallikrein and coagulation factor XI. Factor XI apple 3 mediates binding to platelets. Factor XI apple 1 binds high-molecular-mass kininogen. Apple 4 in factor XI mediates dimer formation and binds to factor XIIa. Mutations in apple 4 cause factor XI deficiency, an inherited bleeding disorder.
Probab=60.23  E-value=13  Score=29.55  Aligned_cols=44  Identities=20%  Similarity=0.430  Sum_probs=31.7

Q ss_pred             ecccceeeecccCHHHHHHhhhCCCceEEEEecCCCCcc---ceEEecC
Q psy3106         130 LKTVDSVYQDVASIEECRELCLNSPYRCHSYDYGDTGDL---VCRLSHH  175 (433)
Q Consensus       130 l~~~d~~~~~v~s~~eC~~~Cl~s~F~Crs~~~~~~~~~---~C~Ls~~  175 (433)
                      +.+.|.....+.+.++|+++|... -.|+.|.|.. ...   .|+|-..
T Consensus        11 f~G~Dl~~~~~~~~~~Cq~~Ct~~-~~C~~FTf~~-~~~~~~~C~LK~s   57 (79)
T smart00223       11 FRGSDINTVYVPSAQVCQKRCTSH-PRCLFFTFST-NEPPEEKCLLKDS   57 (79)
T ss_pred             ccCceeeeeecCCHHHHHHhhcCC-CCccEEEeeC-CCCCCCEeEeCcC
Confidence            334444444567999999999975 3599999985 344   8999653


No 22 
>cd01098 PAN_AP_plant Plant PAN/APPLE-like domain; present in plant S-receptor protein kinases and secreted glycoproteins. PAN/APPLE domains fulfill diverse biological functions by mediating protein-protein or protein-carbohydrate interactions. S-receptor protein kinases and S-locus glycoproteins are involved in sporophytic self-incompatibility response in Brassica, one of probably many molecular mechanisms, by which hermaphrodite flowering plants avoid self-fertilization.
Probab=51.18  E-value=26  Score=27.00  Aligned_cols=32  Identities=44%  Similarity=0.910  Sum_probs=24.9

Q ss_pred             cCHHHHHHhhhCCCceEEEEecCCCCccceEEec
Q psy3106         141 ASIEECRELCLNSPYRCHSYDYGDTGDLVCRLSH  174 (433)
Q Consensus       141 ~s~~eC~~~Cl~s~F~Crs~~~~~~~~~~C~Ls~  174 (433)
                      .++++|+++|++. =.|..|.|.. ....|.+-.
T Consensus        32 ~s~~~C~~~Cl~n-CsC~a~~~~~-~~~~C~~~~   63 (84)
T cd01098          32 ISLEECREACLSN-CSCTAYAYNN-GSGGCLLWN   63 (84)
T ss_pred             CCHHHHHHHHhcC-CCcceeeecC-CCCeEEEEe
Confidence            4889999999986 3488888875 367898753


No 23 
>PTZ00265 multidrug resistance protein (mdr1); Provisional
Probab=49.26  E-value=8  Score=46.93  Aligned_cols=15  Identities=20%  Similarity=0.481  Sum_probs=9.2

Q ss_pred             ceEEEEcCCCCcccc
Q psy3106         306 SEIVLIDSNGCPTDH  320 (433)
Q Consensus       306 ~~~~lID~~GC~~d~  320 (433)
                      -+++|+||-==++|+
T Consensus       598 P~ILlLDEpTSaLD~  612 (1466)
T PTZ00265        598 PKILILDEATSSLDN  612 (1466)
T ss_pred             CCEEEEeCcccccCH
Confidence            466777766555553


No 24 
>KOG3441|consensus
Probab=37.43  E-value=82  Score=27.14  Aligned_cols=67  Identities=25%  Similarity=0.325  Sum_probs=42.5

Q ss_pred             cCCCcEEEEEEcCCCCcccceeEeCCeEEEEEEEeCCCCCccEEEEEEEEEc----CCCCceEEEEcCCCCcccccccCc
Q psy3106         250 VDSPNVAMKITDRSGADVKPSAEVGDPLALRFEILDPNSPYEIFVRELVAMD----GVDSSEIVLIDSNGCPTDHFIMGP  325 (433)
Q Consensus       250 ~~~P~c~~~I~~~~g~~v~~~a~vGd~l~~~w~~~~~~~~~~~~V~~C~a~d----g~~~~~~~lID~~GC~~d~~l~~~  325 (433)
                      ...|.|. .+....     ..+++||.+.+-+.=   .-.-.+.|..|.-..    ..+.+.+.|||++|=|+...|--+
T Consensus        54 gr~PrCI-HVYkkr-----gvg~~GDkiLvAIkG---QmkKa~vVGh~~~~k~~~P~fDsNniVLiddnGnPlGtRI~~P  124 (149)
T KOG3441|consen   54 GRLPRCI-HVYKKR-----GVGELGDKILVAIKG---QMKKAYVVGHVHYRKHGVPVFDSNNIVLIDDNGNPLGTRITAP  124 (149)
T ss_pred             CCCCceE-EEEecc-----cccccccEEEEEEec---ceeeeEEEEeeccCCCCCcccCCCcEEEECCCCCcccceEecc
Confidence            4566664 343333     367889999877753   333355777765532    123468899999999998766443


No 25 
>KOG4574|consensus
Probab=35.75  E-value=26  Score=39.39  Aligned_cols=32  Identities=19%  Similarity=0.017  Sum_probs=18.8

Q ss_pred             EEEecCeEE-ecceeeEeecCCHHHHHHHcccC
Q psy3106          35 FDRVHGYSL-LGFVKKVGTATSRTDCFEMCLGE   66 (433)
Q Consensus        35 Fer~~g~~L-~g~~~~~~~~~s~~eC~~~C~~e   66 (433)
                      ||..+=+++ +.+.--.+....+++=+..|+.+
T Consensus       515 fe~~klRe~rksedsn~~S~pEied~ai~mLDe  547 (1007)
T KOG4574|consen  515 FEAPKLRELRKSEDSNRLSAPEIEDLAILMLDE  547 (1007)
T ss_pred             CccchhhHHhhhcccCcccchhHHHHHHHHhcc
Confidence            555554444 34445555666677777777765


No 26 
>COG3485 PcaH Protocatechuate 3,4-dioxygenase beta subunit [Secondary metabolites biosynthesis, transport, and catabolism]
Probab=33.50  E-value=71  Score=30.64  Aligned_cols=71  Identities=17%  Similarity=0.174  Sum_probs=46.4

Q ss_pred             cEEEEEEcCCCCcccceeEeCCeEEEEEEEeCCCCCccEEEEEEEE-EcCCCCceEEEEcCCCCcccccccCcceeccc
Q psy3106         254 NVAMKITDRSGADVKPSAEVGDPLALRFEILDPNSPYEIFVRELVA-MDGVDSSEIVLIDSNGCPTDHFIMGPLYKAAD  331 (433)
Q Consensus       254 ~c~~~I~~~~g~~v~~~a~vGd~l~~~w~~~~~~~~~~~~V~~C~a-~dg~~~~~~~lID~~GC~~d~~l~~~~~y~~~  331 (433)
                      .++.+|+|.+|.|| ..|.     .-.|.| +..+.|.-.--+=.+ .+.-.|...++.|++|=-.-..|.|....-.+
T Consensus        74 ~l~G~VlD~~G~Pv-~~A~-----VEiWQA-da~GrY~~~~d~~~~~~~~f~g~Gr~~Td~~G~y~F~Ti~Pg~yp~~~  145 (226)
T COG3485          74 LLEGRVLDGNGRPV-PDAL-----VEIWQA-DADGRYSHPKDSRLAPLPNFNGRGRTITDEDGEYRFRTIKPGPYPWRN  145 (226)
T ss_pred             EEEEEEECCCCCCC-CCCE-----EEEEEc-CCCCcccCccccccCcCccccceEEEEeCCCceEEEEEeecccccCCC
Confidence            67889999999998 6665     578999 456666420000000 01233677899999998877788886555444


No 27 
>KOG4574|consensus
Probab=29.37  E-value=32  Score=38.73  Aligned_cols=6  Identities=33%  Similarity=0.512  Sum_probs=2.4

Q ss_pred             CCceee
Q psy3106         380 NGESVN  385 (433)
Q Consensus       380 ng~~~n  385 (433)
                      +|+.+-
T Consensus       948 n~t~ll  953 (1007)
T KOG4574|consen  948 NGTKLL  953 (1007)
T ss_pred             Ccccee
Confidence            344433


No 28 
>smart00765 MANEC The MANEC domain was formerly called MANSC. This domain, comprising 8 conserved cysteines, is found in the N terminus of higher multicellular animal membrane and extracellular proteins. It is postulated that this domain may play a role in the formation of protein complexes involving various protease activators and inhibitors. It is possible that some of the cysteine residues in the MANSC domain form structurally important disulfide bridges. All of the MANSC-containing proteins contain predicted transmembrane regions and signal peptides. It has been proposed that the MANSC domain in HAI-1 might function through binding with hepatocyte growth factor activator and matriptase.
Probab=28.60  E-value=1.1e+02  Score=25.15  Aligned_cols=52  Identities=17%  Similarity=0.326  Sum_probs=35.3

Q ss_pred             EEEecCeEEe-cc----eeeE----eecCCHHHHHHHcccCCCceeeeEEeeC----CCCceEeecC
Q psy3106          35 FDRVHGYSLL-GF----VKKV----GTATSRTDCFEMCLGERDFICRSANYNN----ATGECHMSDM   88 (433)
Q Consensus        35 Fer~~g~~L~-g~----~~~~----~~~~s~~eC~~~C~~e~~f~CrS~~y~~----~~~~C~Ls~~   88 (433)
                      |++.++++|. +.    ..+.    ..+.+.++|..+|=...  .|-=|.|..    ..+.|.|..=
T Consensus        10 f~~~~n~ii~t~~S~~~Ga~fL~~p~~~~s~edC~~aCC~~~--~CnlAv~e~~~~~~~~~CyLf~C   74 (93)
T smart00765       10 FRVLENAIIRTEESLSAGARFLKSPIAVNTWEDCVRACCSTP--NCNLAVFELRREDAEGNCYLFNC   74 (93)
T ss_pred             cccCCCeeeccchhhhccccccCCccccCCHHHHHHHHcCCC--CCcEEEEeccCCCCCCceEEEEc
Confidence            7778888884 11    1122    46789999999999775  488888852    2466766553


No 29 
>PF02822 Antistasin:  Antistasin family;  InterPro: IPR004094 Peptide proteinase inhibitors can be found as single domain proteins or as single or multiple domains within proteins; these are referred to as either simple or compound inhibitors, respectively. In many cases they are synthesised as part of a larger precursor protein, either as a prepropeptide or as an N-terminal domain associated with an inactive peptidase or zymogen. This domain prevents access of the substrate to the active site. Removal of the N-terminal inhibitor domain either by interaction with a second peptidase or by autocatalytic cleavage activates the zymogen. Other inhibitors interact direct with proteinases using a simple noncovalent lock and key mechanism; while yet others use a conformational change-based trapping mechanism that depends on their structural and thermodynamic properties.  This group of serine protease inhibitors belong to MEROPS inhibitor family I15, clan IO. They inhibit serine peptidases of the S1 family (IPR001254 from INTERPRO) [] and are characterised by a well conserved pattern of cysteine residues. Many of the proteins that belong to this family are anti-coagulants.; GO: 0004867 serine-type endopeptidase inhibitor activity; PDB: 1HIA_I 1BX8_A 1BX7_A 3BG4_D 1C9P_B 1EJA_B 1SKZ_A.
Probab=28.28  E-value=33  Score=21.26  Aligned_cols=9  Identities=67%  Similarity=1.265  Sum_probs=6.3

Q ss_pred             EEcCCCCcc
Q psy3106         310 LIDSNGCPT  318 (433)
Q Consensus       310 lID~~GC~~  318 (433)
                      ..|++||++
T Consensus        15 ~~D~~GC~~   23 (26)
T PF02822_consen   15 KTDENGCPT   23 (26)
T ss_dssp             EE-TTSSEE
T ss_pred             cCCCCCCCc
Confidence            568999986


No 30 
>PF07502 MANEC:  MANEC domain;  InterPro: IPR011106 The MANSC (motif at N terminus with seven cysteines) domain is a module with a well-conserved seven cysteine motif that is present at the N terminus of higher multicellular animal membrane and extracellular proteins. It is possible that some of the cysteine residues in the MANSC domain form structurally important disulphide bridges. All of the MANSC-containing proteins contain predicted transmembrane regions and signal peptides. It has been proposed that the MANSC domain in HAI-1 might function through binding with hepatocyte growth factor activator and matriptase [].
Probab=28.04  E-value=1.2e+02  Score=24.77  Aligned_cols=44  Identities=18%  Similarity=0.353  Sum_probs=31.7

Q ss_pred             eEEEEecCeEEe-------cce-eeEeecCCHHHHHHHcccCCCceeeeEEeeC
Q psy3106          33 WCFDRVHGYSLL-------GFV-KKVGTATSRTDCFEMCLGERDFICRSANYNN   78 (433)
Q Consensus        33 ~~Fer~~g~~L~-------g~~-~~~~~~~s~~eC~~~C~~e~~f~CrS~~y~~   78 (433)
                      +.|+..++++|.       |.. .....+.+.++|+.+|=..++  |-=|.|..
T Consensus         7 ~~f~~~~d~iIdt~~S~~~GA~fl~~~~v~s~~dC~~aCC~~~~--Cnlav~e~   58 (92)
T PF07502_consen    7 DDFRSFPDFIIDTEESVENGATFLSSPEVSSAEDCVRACCSTPN--CNLAVFEE   58 (92)
T ss_pred             cccccCCCeEEehHHHHHCCCeecCccccCCHHHHHHHhCCCCC--CcEEEEec
Confidence            458888898885       111 122378999999999997754  88888864


No 31 
>PRK05483 rplN 50S ribosomal protein L14; Validated
Probab=26.93  E-value=1.2e+02  Score=26.16  Aligned_cols=61  Identities=25%  Similarity=0.403  Sum_probs=40.4

Q ss_pred             ceeEeCCeEEEEEEEeC-------CCCCccEEEEEEEE---EcC----CCCceEEEEcCCCCcccccccCcceec
Q psy3106         269 PSAEVGDPLALRFEILD-------PNSPYEIFVRELVA---MDG----VDSSEIVLIDSNGCPTDHFIMGPLYKA  329 (433)
Q Consensus       269 ~~a~vGd~l~~~w~~~~-------~~~~~~~~V~~C~a---~dg----~~~~~~~lID~~GC~~d~~l~~~~~y~  329 (433)
                      ..|.|||.+...+.=..       ++-...+-|+.-.-   .||    .+++.+.|||++|=|+-..|++++.+.
T Consensus        31 ~~a~iGD~I~vsVkk~~~~~~~kkg~v~~AvIVrtkk~~~r~dG~~i~F~dNavVLin~~~~p~GTrI~Gpv~~e  105 (122)
T PRK05483         31 RYASIGDVIVVSVKEAIPRGKVKKGDVVKAVVVRTKKGVRRPDGSYIRFDDNAAVLLNNDGEPRGTRIFGPVARE  105 (122)
T ss_pred             CccccCCEEEEEEEEcCCCCcccCCCEeeEEEEEeccceecCCCCEEEcCCCEEEEECCCCCEeEeEEeccchHH
Confidence            57888888877664211       12223444555443   233    335788999999999999999998764


No 32 
>PF07502 MANEC:  MANEC domain;  InterPro: IPR011106 The MANSC (motif at N terminus with seven cysteines) domain is a module with a well-conserved seven cysteine motif that is present at the N terminus of higher multicellular animal membrane and extracellular proteins. It is possible that some of the cysteine residues in the MANSC domain form structurally important disulphide bridges. All of the MANSC-containing proteins contain predicted transmembrane regions and signal peptides. It has been proposed that the MANSC domain in HAI-1 might function through binding with hepatocyte growth factor activator and matriptase [].
Probab=24.33  E-value=82  Score=25.75  Aligned_cols=53  Identities=26%  Similarity=0.503  Sum_probs=33.5

Q ss_pred             eeeecCceecccceee--------ecccCHHHHHHhhhCCCceEEEEecCCCCc---cc-eEEecC
Q psy3106         122 FKKLAGRILKTVDSVY--------QDVASIEECRELCLNSPYRCHSYDYGDTGD---LV-CRLSHH  175 (433)
Q Consensus       122 ~~~~~~~~l~~~d~~~--------~~v~s~~eC~~~Cl~s~F~Crs~~~~~~~~---~~-C~Ls~~  175 (433)
                      |..+++.+|..-+++.        ..|.+.++|..+|=..+= |-.+++.....   .. |.|..=
T Consensus         9 f~~~~d~iIdt~~S~~~GA~fl~~~~v~s~~dC~~aCC~~~~-Cnlav~e~~~~~~~~~~CyLf~C   73 (92)
T PF07502_consen    9 FRSFPDFIIDTEESVENGATFLSSPEVSSAEDCVRACCSTPN-CNLAVFEERPTGSEDNSCYLFNC   73 (92)
T ss_pred             cccCCCeEEehHHHHHCCCeecCccccCCHHHHHHHhCCCCC-CcEEEEeccCCCCCCceEEEEEc
Confidence            5556666666533211        267899999999987632 88888764322   23 888664


No 33 
>PF10836 DUF2574:  Protein of unknown function (DUF2574)  ;  InterPro: IPR020386 This entry contains proteins with no known function.
Probab=23.90  E-value=1e+02  Score=24.97  Aligned_cols=57  Identities=18%  Similarity=0.179  Sum_probs=34.0

Q ss_pred             cccccceecCCCcEEEEEEcCCCCcccceeEeCCeEEEEEEEeCCCCCccEEEEEEEEEcCC
Q psy3106         242 SALTEEVVVDSPNVAMKITDRSGADVKPSAEVGDPLALRFEILDPNSPYEIFVRELVAMDGV  303 (433)
Q Consensus       242 ~~~~~~~~~~~P~c~~~I~~~~g~~v~~~a~vGd~l~~~w~~~~~~~~~~~~V~~C~a~dg~  303 (433)
                      .+|.+.+....|+|+++|++..-     .-+=|+.++|.=.-...+.+-.=.+.+-++-.+.
T Consensus        26 ATLtIsGrv~~PTCS~dvVn~~~-----QQrCGq~~~~~~~~~~~~tp~~GV~Tevv~vp~d   82 (93)
T PF10836_consen   26 ATLTISGRVSPPTCSTDVVNNQL-----QQRCGQNTYLVNTQNRASTPARGVTTEVVTVPGD   82 (93)
T ss_pred             eEEEEcceEcCCcccHHHhccch-----hhhhccceEeeeccccccccccceEEEEEEcCCC
Confidence            35666677889999999997542     2234666554432211133444457777777664


No 34 
>PF05688 DUF824:  Salmonella repeat of unknown function (DUF824);  InterPro: IPR008542 This family consists of a series of repeated sequences (of around 180 residues) which are found in Salmonella typhimurium, Salmonella typhi and Escherichia coli. These repeats are almost always found with this entry. The repeats are associated with RatA and RatB, the coding sequences of which are found in the pathogeneicity island of Salmonella. The sequences may be determinants of pathogenicity [, ].
Probab=23.49  E-value=98  Score=22.15  Aligned_cols=18  Identities=28%  Similarity=0.551  Sum_probs=15.5

Q ss_pred             ceeEeCCeEEEEEEEeCC
Q psy3106         269 PSAEVGDPLALRFEILDP  286 (433)
Q Consensus       269 ~~a~vGd~l~~~w~~~~~  286 (433)
                      ..||+||.|.|.++..+.
T Consensus         7 akaK~Ge~I~ltVt~kda   24 (47)
T PF05688_consen    7 AKAKVGETIPLTVTVKDA   24 (47)
T ss_pred             hheecCCeEEEEEEEECC
Confidence            589999999999998653


No 35 
>TIGR01067 rplN_bact ribosomal protein L14, bacterial/organelle. This model distinguishes bacterial and most organellar examples of ribosomal protein L14 from all archaeal and eukaryotic forms.
Probab=23.35  E-value=1.5e+02  Score=25.57  Aligned_cols=61  Identities=28%  Similarity=0.365  Sum_probs=39.5

Q ss_pred             ceeEeCCeEEEEEEEeC-------CCCCccEEEEEEEE---EcC----CCCceEEEEcCCCCcccccccCcceec
Q psy3106         269 PSAEVGDPLALRFEILD-------PNSPYEIFVRELVA---MDG----VDSSEIVLIDSNGCPTDHFIMGPLYKA  329 (433)
Q Consensus       269 ~~a~vGd~l~~~w~~~~-------~~~~~~~~V~~C~a---~dg----~~~~~~~lID~~GC~~d~~l~~~~~y~  329 (433)
                      ..|.|||.+...+.=..       ++-.+.+-|+.-.-   .||    .+++.+.|||++|=|+-..|++++.+.
T Consensus        31 ~~a~iGD~I~vsVk~~~~~~~~kkg~v~~AvIVrtkk~~~r~dG~~i~F~~Na~VLin~~~~p~GTrI~Gpv~~e  105 (122)
T TIGR01067        31 RYATVGDVIVVVVKDAIPNGKVKKGDVVKAVIVRTKKGVRRKDGSYIRFDDNACVLINKNKEPRGTRIFGPVARE  105 (122)
T ss_pred             CccccCCEEEEEEEEcCCCCccccccEEEEEEEEeecceEeCCCCEEECCCceEEEECCCCCEeeeEEEccchHH
Confidence            46888888877664211       11223444554433   233    345788999999999999999988764


No 36 
>smart00605 CW CW domain.
Probab=22.08  E-value=2.5e+02  Score=22.43  Aligned_cols=37  Identities=19%  Similarity=0.395  Sum_probs=28.7

Q ss_pred             eecCCHHHHHHHcccCCCceeeeEEeeCCCCceEeecCCc
Q psy3106          51 GTATSRTDCFEMCLGERDFICRSANYNNATGECHMSDMDR   90 (433)
Q Consensus        51 ~~~~s~~eC~~~C~~e~~f~CrS~~y~~~~~~C~Ls~~~~   90 (433)
                      ....+.++|...|...  ..|--|-+. ....|.|.....
T Consensus        19 ~~~~sw~~Ci~~C~~~--~~Cvlay~~-~~~~C~~f~~~~   55 (94)
T smart00605       19 AATLSWDECIQKCYED--SNCVLAYGN-SSETCYLFSYGT   55 (94)
T ss_pred             ccCCCHHHHHHHHhCC--CceEEEecC-CCCceEEEEcCC
Confidence            3577899999999976  589866443 368999988864


No 37 
>TIGR03673 rpl14p_arch 50S ribosomal protein L14P. Part of the 50S ribosomal subunit. Forms a cluster with proteins L3 and L24e, part of which may contact the 16S rRNA in 2 intersubunit bridges.
Probab=20.13  E-value=1.6e+02  Score=25.82  Aligned_cols=61  Identities=25%  Similarity=0.420  Sum_probs=40.6

Q ss_pred             ceeEeCCeEEEEEEEeC----CCCCccEEEEEEEE---EcC----CCCceEEEEcCCCCcccccccCcceec
Q psy3106         269 PSAEVGDPLALRFEILD----PNSPYEIFVRELVA---MDG----VDSSEIVLIDSNGCPTDHFIMGPLYKA  329 (433)
Q Consensus       269 ~~a~vGd~l~~~w~~~~----~~~~~~~~V~~C~a---~dg----~~~~~~~lID~~GC~~d~~l~~~~~y~  329 (433)
                      .+|.|||.+...+.=..    ++-.+.+-|+.-.-   .||    .+++.+.|||++|=|+-..|++++.+.
T Consensus        44 ~~a~iGD~IvvsVK~~~p~~kg~v~kAVIVRtkk~~~R~dGs~i~FddNa~VLin~~~~P~GTRI~GpV~rE  115 (131)
T TIGR03673        44 PCAGVGDMVVVSVKKGTPEMRKQVFKAVVVRQRKEYRRPDGTRVKFEDNAVVIVTPDGEPKGTEIKGPVARE  115 (131)
T ss_pred             CccccCCEEEEEEEECCccccCCEeEEEEEEeCcceecCCCcEEEeCCcEEEEECCCCCEeeeEEEccchHH
Confidence            57889998877765211    12234444544433   233    235778999999999999999998765


Done!