Query         psy1623
Match_columns 522
No_of_seqs    224 out of 412
Neff          5.8 
Searched_HMMs 46136
Date          Sat Aug 17 00:11:30 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy1623.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/1623hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 smart00241 ZP Zona pellucida ( 100.0 8.8E-30 1.9E-34  252.9  24.4  220  129-390     1-239 (253)
  2 PF00100 Zona_pellucida:  Zona   99.9 5.8E-24 1.3E-28  208.4  22.4  224  129-392     1-254 (265)
  3 cd01099 PAN_AP_HGF Subfamily o  99.3 8.1E-12 1.8E-16  104.3   6.5   77   23-108     4-80  (80)
  4 PF00024 PAN_1:  PAN domain Thi  98.5 1.7E-07 3.6E-12   76.1   5.7   74   23-108     4-78  (79)
  5 smart00473 PAN_AP divergent su  98.0 1.6E-05 3.5E-10   64.0   6.3   75   21-108     4-78  (78)
  6 cd01100 APPLE_Factor_XI_like S  91.4    0.28   6E-06   39.9   4.1   56   20-84      3-58  (73)
  7 cd00129 PAN_APPLE PAN/APPLE-li  91.2    0.35 7.7E-06   40.6   4.6   56   42-107    23-79  (80)
  8 PF14295 PAN_4:  PAN domain; PD  88.0    0.31 6.7E-06   36.1   1.7   38   41-80     13-50  (51)
  9 smart00223 APPLE APPLE domain.  86.0     1.1 2.3E-05   37.6   4.0   47   33-83     11-57  (79)
 10 PF08276 PAN_2:  PAN-like domai  74.7     5.7 0.00012   31.8   4.5   59   20-85      4-63  (66)
 11 cd01098 PAN_AP_plant Plant PAN  49.4      18 0.00038   29.2   3.0   37   42-85     30-66  (84)
 12 PF00558 Vpu:  Vpu protein;  In  36.2      24 0.00052   30.0   1.8   20  502-521     8-27  (81)
 13 PF05454 DAG1:  Dystroglycan (D  31.0      16 0.00035   38.0   0.0   23  499-522   149-171 (290)
 14 PF10177 DUF2371:  Uncharacteri  28.3      48   0.001   31.1   2.6   30  490-522    35-64  (141)
 15 PF14814 UB2H:  Bifunctional tr  27.1      98  0.0021   26.0   4.1   34  357-390    33-66  (85)
 16 PF15102 TMEM154:  TMEM154 prot  25.8      52  0.0011   31.0   2.4   23  499-522    61-83  (146)
 17 PF09680 Tiny_TM_bacill:  Prote  24.2      44 0.00096   22.0   1.1   15  502-516     7-21  (24)
 18 PF01299 Lamp:  Lysosome-associ  22.9      78  0.0017   32.8   3.3   39  484-522   258-296 (306)
 19 PF07502 MANEC:  MANEC domain;   22.4 1.2E+02  0.0025   26.3   3.7   41   41-83     33-73  (92)
 20 COG4064 MtrG Tetrahydromethano  22.3 1.1E+02  0.0024   25.4   3.3   23  493-515    50-72  (75)
 21 PF07787 DUF1625:  Protein of u  21.8      88  0.0019   31.5   3.3   25  498-522   223-247 (248)

No 1  
>smart00241 ZP Zona pellucida (ZP) domain. ZP proteins are responsible for sperm-adhesion fo the zona pellucida. ZP domains are also present in multidomain transmembrane proteins such as glycoprotein GP2, uromodulin and TGF-beta receptor type III (betaglycan).
Probab=99.97  E-value=8.8e-30  Score=252.87  Aligned_cols=220  Identities=22%  Similarity=0.370  Sum_probs=169.6

Q ss_pred             EeCCCcEEEEEeCCCCceEEEEecCCcc---CceEeccCCceeEEEEecccCCCCCCceee--C--ceEEEEEEEE-ecC
Q psy1623         129 VCSEDGMEFTLRTPEGFLGRIYTYGYYD---RCFYRGNGGTANVLRISGAQGYPDCGTQRY--G--DTMTNIVVVQ-FSD  200 (522)
Q Consensus       129 ~C~~~~m~V~v~t~~pF~G~IYvkg~~~---~C~~~~~~~~~~~l~i~l~~~~~~CG~~~~--~--~~~s~~VVVQ-~hp  200 (522)
                      +|.+++|.|.|++..+|.|+||+++.+.   +|+..+...+...+.|.++  .++||+++.  +  .+|+|+|++| +|+
T Consensus         1 ~C~~~~m~v~v~~~~~~~g~i~~~~l~l~d~~C~~~~~~~~~~~~~f~~~--l~~CGt~~~~~~~~~~ysn~v~~~~~~~   78 (253)
T smart00241        1 QCGEDQMVVSVSTDLLFPGGIYVKGLYLGDPSCRPVFTDSTSAFVSFEVP--LNGCGTRRQVNPDGIVYSNTLVVSPFHP   78 (253)
T ss_pred             CCCCCEEEEEEEecCCCCCeEEEeeEEeCCCCCCCccccCCCcEEEEEec--cccCCCeEEECCCeEEEEEEEEEccCCC
Confidence            5999999999999999999999999774   8999643333334445444  459999853  2  2799999999 899


Q ss_pred             eEeeccce-EEEEEeeeCCCCCeEEEeeccccCCCCCcceeecCccCCCCCceEEEEee---CCCceeee---ecCCeEE
Q psy1623         201 YVQTSRDK-RYNLTCLFRGPGEAVVTSSYMTANSGSPIPIEYLPAENSLSSRVRLQILF---QGRPTNTI---AVGDPLT  273 (522)
Q Consensus       201 ~~vT~~Dr-~y~V~C~y~~~~e~~vt~~~~~~~~g~p~pi~~l~~e~~~~~~v~m~I~~---~G~~v~~a---~VGd~L~  273 (522)
                      .++|..|+ .|+++|.|....+ +. ..+...   ...+. .+.......+.++|++..   .|+++..+   ++||+|+
T Consensus        79 ~~itr~~~~~~~~~C~y~~~~~-~~-~~~~~~---~~~~~-~~~~~~~g~~~~~m~l~~~~~~~~~~~~~~~~~lg~~l~  152 (253)
T smart00241       79 GFITRDDRAAYHFQCFYPENEK-VS-LNLDVS---TIPPT-ELSSVSEGPPTCSYRLYKDDSFGSPYQSADYPVLGDPVY  152 (253)
T ss_pred             CceEecCceEEEEEEEEeCCCc-eE-EEEEec---CCCCC-CcccccCCCcEEEEEeccCCCCCCcccCCCCcccCCeEE
Confidence            99999999 9999999997433 32 222111   11111 111112234578999985   46677544   6999999


Q ss_pred             EEEEEeCCCCceecEEEEEEEEECCCC---CceEEEEcCCCCceecccccCCCCCCCCCCCccccccceeccCCCCCCcc
Q psy1623         274 FKLDSQDGYNYVTDIFATNVIARDPYS---GRSVQLIDRYGYWIIESTFNHLPVNQCSPQRSFSALTVTIFSFRCPVDNF  350 (522)
Q Consensus       274 l~we~~~~~~~~y~i~V~~C~A~Dg~~---~~~~~LIDe~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~ii~ngC~~D~~  350 (522)
                      |+|++.......++|+|++|+|.++.+   +.++.||| +|                                 ||+|+.
T Consensus       153 ~~~~~~~~~~~~~~l~v~~C~at~~~~~~~~~~~~lI~-~G---------------------------------C~~d~~  198 (253)
T smart00241      153 HEWSCDGADDPPLGLLVDNCYATPGSDPSSGPKYFIID-NG---------------------------------CPVDGY  198 (253)
T ss_pred             EEEEEccCCCCCeEEEEeeEEEcCCCCCCCCCcEEEEE-Cc---------------------------------cCCCCc
Confidence            999998642358899999999999853   68888998 88                                 999999


Q ss_pred             ccCCeeeccC-CCeEEEEEEeEEcCCCCeEEEEEEEEEccC
Q psy1623         351 VFPALDRSRE-GDGLEARFNAFKIPESNFLVFEATVRTCRD  390 (522)
Q Consensus       351 i~~~l~~~~~-~~~l~a~f~AFKFpds~~V~f~C~V~~C~~  390 (522)
                      +++.+.+..+ +..++..|+||||+++..|+|+|+|++|.+
T Consensus       199 ~~~~~~~~~~~~~~~~f~~~aF~F~~~~~v~~hC~v~vC~~  239 (253)
T smart00241      199 LDSTIPYNSSPNHYARFSVKVFKFADRSLVYFHCQIRLCDK  239 (253)
T ss_pred             cccceecCCCCcceEEEEEEEEEecCCCcEEEEEEEEEECC
Confidence            9999998765 678889999999999999999999999985


No 2  
>PF00100 Zona_pellucida:  Zona pellucida-like domain;  InterPro: IPR001507 A large domain, containing around 260 amino acids, has been recognised in a variety of receptor-like eukaryotic glycoproteins []. All of these proteins are mosaic proteins composed of various domains and that all have a large extracellular region followed by either a transmembrane region and a very short cytoplasmic region or by a GPI-anchor. The domain common to all these proteins is located in the C-terminal portion of the extracellular region, and contains 8 conserved Cys residues, which are probably involved in disulphide bond formation. This entry includes a domain found in the zona pellucida proteins. The zona pellucida is a specialised extracellular matrix of mammailian eggs, which plays a crucial role in fertilisation by directly mediating species-restricted recongition between gametes. In mammals, up to four zona pellucida subnits are known, named ZP1-4. The full-length crystal structure of murine ZP3 has been solved []. This entry is also found in other glycoproteins, such as CD105 (also called endoglin) which is the regulatory component of the TGF-beta receptor complex. It is a modulator of cellular responses to TGF-beta 1. CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; PDB: 3D4G_G 3EF7_A 3D4C_A 3NK4_B 3NK3_B 3QW9_A.
Probab=99.92  E-value=5.8e-24  Score=208.42  Aligned_cols=224  Identities=25%  Similarity=0.413  Sum_probs=146.7

Q ss_pred             EeCCCcEEEEEeCCCCceEEEEec-CCcc--CceE-eccCCceeEEEEecccCCCCCCceeeC---c--eEEEEEEEEec
Q psy1623         129 VCSEDGMEFTLRTPEGFLGRIYTY-GYYD--RCFY-RGNGGTANVLRISGAQGYPDCGTQRYG---D--TMTNIVVVQFS  199 (522)
Q Consensus       129 ~C~~~~m~V~v~t~~pF~G~IYvk-g~~~--~C~~-~~~~~~~~~l~i~l~~~~~~CG~~~~~---~--~~s~~VVVQ~h  199 (522)
                      +|+++.|.|.|++..+|.|.||+. ++..  +|+. ...+.....++|  ++....||+.+..   +  .|+++|+++.+
T Consensus         1 ~C~~~~~~v~i~~~~~~~~~i~~~~~~~~~~~C~~~~~~~~~~~~~~~--~~~~~~Cg~~~~~~~~~~~~y~n~i~~~~~   78 (265)
T PF00100_consen    1 TCGSDSMTVTIKKDYLFTGNIYVSDLHLNDPSCRARSDTNSTHITFSI--PFNSPSCGTTRSIENGDSIVYSNTIVSSQD   78 (265)
T ss_dssp             EE-SSEEEEEEESSTTSSS-B-GGGEEETTTTB--EEECTTTEEEEEE--ECCC--CTSEEEE--SSEEEEEEEEEEE--
T ss_pred             CcCCCEEEEEEEeccccCCCEEeeeeEECCCCCCCcccCCCcEEEEEE--ecCCCCCcccccccccCcceeeeeeeeecc
Confidence            699999999999999999999984 4443  7999 444444434444  3332249998332   2  58999998876


Q ss_pred             C--------eEeeccceEEEEEeeeCCCCCeEEEeeccccCCCCCcceeecCccCCCCCceEEEEeeC-------CCcee
Q psy1623         200 D--------YVQTSRDKRYNLTCLFRGPGEAVVTSSYMTANSGSPIPIEYLPAENSLSSRVRLQILFQ-------GRPTN  264 (522)
Q Consensus       200 p--------~~vT~~Dr~y~V~C~y~~~~e~~vt~~~~~~~~g~p~pi~~l~~e~~~~~~v~m~I~~~-------G~~v~  264 (522)
                      .        .+++..|..|+++|.|... ...++.......   ..++.... .......++|++..+       ..+..
T Consensus        79 ~~~~~~~~~~i~~~~~~~~~~~C~y~~~-~~~~~~~~~~~~---~~~~~~~~-~~~~~~~~~m~l~~~~~~~~~~~~~~~  153 (265)
T PF00100_consen   79 VSFNDSPGGNITRDSDRRFNFTCTYSSS-QYSVSSGSPNIS---SVTIHISQ-ESSGSFSFSMRLYKDDSFSSPYSQPPA  153 (265)
T ss_dssp             SSSS---STTECSSS-EEEEEEEEE-EC----------------B-TTT----STT--EEEEEEEESTTTSSCB---SSE
T ss_pred             ccccccccCceEEecccccccEEEECCc-cceeccccccce---ecccccee-ccCCccceEEEeccCccccccccCCce
Confidence            4        5777899999999999321 212221111000   00110000 111122477777742       24557


Q ss_pred             eeecCCeEEEEEEEeCCCCceecEEEEEEEEECC---CCCceEEEEcCCCCceecccccCCCCCCCCCCCccccccceec
Q psy1623         265 TIAVGDPLTFKLDSQDGYNYVTDIFATNVIARDP---YSGRSVQLIDRYGYWIIESTFNHLPVNQCSPQRSFSALTVTIF  341 (522)
Q Consensus       265 ~a~VGd~L~l~we~~~~~~~~y~i~V~~C~A~Dg---~~~~~~~LIDe~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~ii  341 (522)
                      .+++||+|+++|++.....+.+.++|++|+|.++   .++..+.|||++|                              
T Consensus       154 ~~~vg~~v~~~v~~~~~~~~~~~~~v~~C~a~~~~~~~~~~~~~lId~~G------------------------------  203 (265)
T PF00100_consen  154 TVEVGDRVYVEVSVSPDSSDPLGLFVDSCWATPSSDPDSSPSYTLIDENG------------------------------  203 (265)
T ss_dssp             EE-TT-EEEEEEEEEECCC-TEEEEEEEEEEESSSSTTSSSCEEEEBCTT------------------------------
T ss_pred             eEEeeeeEEEEEEeccccceeeEEEeCEEEEeCCCCcccccccceecccc------------------------------
Confidence            8899999999999876532226699999999998   6678899999999                              


Q ss_pred             cCCCCCCccccCCeeec---cCCCeEEEEEEeEEcCCCCeEEEEEEEEEccCCC
Q psy1623         342 SFRCPVDNFVFPALDRS---REGDGLEARFNAFKIPESNFLVFEATVRTCRDGC  392 (522)
Q Consensus       342 ~ngC~~D~~i~~~l~~~---~~~~~l~a~f~AFKFpds~~V~f~C~V~~C~~~C  392 (522)
                         ||+|...++.+.+.   .++..+...|+||||+++..|+|+|+|++|...|
T Consensus       204 ---C~~~~~~~~~~~~~~~~~~~~~~~f~f~aF~f~~~~~v~i~C~v~~C~~~~  254 (265)
T PF00100_consen  204 ---CPVDSDDFVTFEYSGPNGDSSRARFSFRAFRFPDSSSVYIHCSVRVCDKSD  254 (265)
T ss_dssp             ---EEGGGCTTTEEEEETTTCCCCEEEEEEE-EEETSSSEEEEEEEEEEEETTS
T ss_pred             ---ccccccceeEEEeccccccccEEEEEeeEEEecCCCeEEEEEEEEEECCCC
Confidence               99999999999987   5677899999999999999999999999999875


No 3  
>cd01099 PAN_AP_HGF Subfamily of PAN/APPLE-like domains; present in N-terminal (N) domains of plasminogen/hepatocyte growth factor proteins, and various proteins found in Bilateria, such as leech anti-platelet proteins. PAN/APPLE domains fulfill diverse biological functions by mediating protein-protein or protein-carbohydrate interactions.
Probab=99.27  E-value=8.1e-12  Score=104.29  Aligned_cols=77  Identities=32%  Similarity=0.508  Sum_probs=63.5

Q ss_pred             cccccccccccccceeeeeeccCHHHHHHHhhcCCCeEEeeEEeccCCCCCCCCCeEeccCCccccccCCCCceeecCCc
Q psy1623          23 SNNIGVRMRMRRQFIKKSTTAASLAACERECAESRDFICRSFNYRAVPYGGQRENCELSDRDARDLDMNNPSYFEPTGDF  102 (522)
Q Consensus        23 f~~vg~~~r~~~~~~~r~~~v~S~~~C~~~C~~e~~F~CRS~nY~~~~~~~~~~~C~Ls~~d~~sl~~~~p~~~~~~~~~  102 (522)
                      |..+-..++++.+++++.+.+.|+++|+++|+.+.+|.||||+|+..     ...|+|++.++.+.    +..+....++
T Consensus         4 ~~f~~~~~~~l~~~~~~~~~~~s~~~C~~~C~~~~~f~CrSf~y~~~-----~~~C~L~~~~~~~~----~~~~~~~~~~   74 (80)
T cd01099           4 FKFVLVLNKILVSEVKTEITVASLEECLRKCLEETEFTCRSFNYNYK-----SKECILSDEDRMSS----GVKLLYDSNV   74 (80)
T ss_pred             ccEEEecceeecceeeEEEecCCHHHHHHHhCCCCCceEeEEEEEcC-----CCEEEEeCCCcccc----ccccccccCc
Confidence            33444456778999999999999999999999999999999999864     79999999999754    2334445589


Q ss_pred             ceeecc
Q psy1623         103 DFYEKS  108 (522)
Q Consensus       103 dYyE~~  108 (522)
                      |||||.
T Consensus        75 dyyE~~   80 (80)
T cd01099          75 DYYENK   80 (80)
T ss_pred             eeeecC
Confidence            999984


No 4  
>PF00024 PAN_1:  PAN domain This Prosite entry concerns apple domains, a subset of PAN domains;  InterPro: IPR003014 PAN domains have significant functional versatility fulfilling diverse biological functions by mediating protein-protein or protein-carbohydrate interactions []. These domains contain a hair-pin loop like structure, similar to knottins, but the pattern of disulphide bonds differs It has been shown that, the N-terminal N domains of members of the plasminogen/hepatocyte growth factor family, the apple domains of the plasma prekallikrein/coagulation factor XI family, and domains of various nematode proteins belong to the same module superfamily, the PAN module []. PAN contains a conserved core of three disulphide bridges. In some members of the family there is an additional fourth disulphide bridge that links the N and C termini of the domain.; PDB: 1GP9_C 2QJ2_B 1GMO_H 1NK1_B 3MKP_B 1BHT_B 3HN4_A 1GMN_A 3HMS_A 3HMT_B ....
Probab=98.51  E-value=1.7e-07  Score=76.08  Aligned_cols=74  Identities=31%  Similarity=0.551  Sum_probs=58.1

Q ss_pred             cccccccccccccceeeeeeccCHHHHHHHhhcCCCeEEeeEEeccCCCCCCCCCeEeccCCccccccCCCCceee-cCC
Q psy1623          23 SNNIGVRMRMRRQFIKKSTTAASLAACERECAESRDFICRSFNYRAVPYGGQRENCELSDRDARDLDMNNPSYFEP-TGD  101 (522)
Q Consensus        23 f~~vg~~~r~~~~~~~r~~~v~S~~~C~~~C~~e~~F~CRS~nY~~~~~~~~~~~C~Ls~~d~~sl~~~~p~~~~~-~~~  101 (522)
                      |+++. +. .+.+.....+.+.++++|+++|..+.. .|+||+|+..     .+.|+|++.+..+.    +..+.. .++
T Consensus         4 f~~~~-~~-~l~~~~~~~~~v~s~~~C~~~C~~~~~-~C~s~~y~~~-----~~~C~L~~~~~~~~----~~~~~~~~~~   71 (79)
T PF00024_consen    4 FERIP-GY-RLSGHSIKEINVPSLEECAQLCLNEPR-RCKSFNYDPS-----SKTCYLSSSDRSSL----PPRLTPSSPN   71 (79)
T ss_dssp             EEEEE-EE-EEESCEEEEEEESSHHHHHHHHHHSTT--ESEEEEETT-----TTEEEEECSSSSSE----STEEEEEEEE
T ss_pred             eEEEC-CE-EEeCCcceEEcCCCHHHHHhhcCcCcc-cCCeEEEECC-----CCEEEEcCCCCCcc----cceeEeccCc
Confidence            66666 23 345566666799999999999999987 8999999864     79999999999764    234555 789


Q ss_pred             cceeecc
Q psy1623         102 FDFYEKS  108 (522)
Q Consensus       102 ~dYyE~~  108 (522)
                      ++|||+.
T Consensus        72 ~~~ye~~   78 (79)
T PF00024_consen   72 VDYYEKS   78 (79)
T ss_dssp             EEEEEEG
T ss_pred             EEEEEec
Confidence            9999984


No 5  
>smart00473 PAN_AP divergent subfamily of APPLE domains. Apple-like domains present in Plasminogen, C. elegans hypothetical ORFs and the extracellular portion of plant receptor-like protein kinases. Predicted to possess protein- and/or carbohydrate-binding functions.
Probab=97.99  E-value=1.6e-05  Score=63.95  Aligned_cols=75  Identities=27%  Similarity=0.497  Sum_probs=53.1

Q ss_pred             cccccccccccccccceeeeeeccCHHHHHHHhhcCCCeEEeeEEeccCCCCCCCCCeEeccCCccccccCCCCceeecC
Q psy1623          21 PISNNIGVRMRMRRQFIKKSTTAASLAACERECAESRDFICRSFNYRAVPYGGQRENCELSDRDARDLDMNNPSYFEPTG  100 (522)
Q Consensus        21 ~~f~~vg~~~r~~~~~~~r~~~v~S~~~C~~~C~~e~~F~CRS~nY~~~~~~~~~~~C~Ls~~d~~sl~~~~p~~~~~~~  100 (522)
                      ..|..+.+.. + .++......+.++++|+++|+.+ ++.|+||+|+.     +.+.|.|++....    .++. ...+.
T Consensus         4 ~~f~~~~~~~-l-~~~~~~~~~~~s~~~C~~~C~~~-~~~C~s~~y~~-----~~~~C~l~~~~~~----~~~~-~~~~~   70 (78)
T smart00473        4 DCFVRLPNTK-L-PGFSRIVISVASLEECASKCLNS-NCSCRSFTYNN-----GTKGCLLWSESSL----GDAR-LFPSG   70 (78)
T ss_pred             ceeEEecCcc-C-CCCcceeEcCCCHHHHHHHhCCC-CCceEEEEEcC-----CCCEEEEeeCCcc----ccce-ecccC
Confidence            4566666332 3 34555456789999999999997 68999999974     2689999993322    2222 45577


Q ss_pred             Ccceeecc
Q psy1623         101 DFDFYEKS  108 (522)
Q Consensus       101 ~~dYyE~~  108 (522)
                      +.+|||+.
T Consensus        71 ~~~~y~~~   78 (78)
T smart00473       71 GVDLYEKI   78 (78)
T ss_pred             CceeEEeC
Confidence            89999974


No 6  
>cd01100 APPLE_Factor_XI_like Subfamily of PAN/APPLE-like domains; present in plasma prekallikrein/coagulation factor XI, microneme antigen proteins, and a few prokaryotic proteins. PAN/APPLE domains fulfill diverse biological functions by mediating protein-protein or protein-carbohydrate interactions.
Probab=91.41  E-value=0.28  Score=39.88  Aligned_cols=56  Identities=18%  Similarity=0.258  Sum_probs=40.8

Q ss_pred             ccccccccccccccccceeeeeeccCHHHHHHHhhcCCCeEEeeEEeccCCCCCCCCCeEeccCC
Q psy1623          20 LPISNNIGVRMRMRRQFIKKSTTAASLAACERECAESRDFICRSFNYRAVPYGGQRENCELSDRD   84 (522)
Q Consensus        20 ~~~f~~vg~~~r~~~~~~~r~~~v~S~~~C~~~C~~e~~F~CRS~nY~~~~~~~~~~~C~Ls~~d   84 (522)
                      ..+|.++ ... -..++......+.+.++|+++|+.+.  .|..|+|+..     .+.|.|....
T Consensus         3 ~~C~~~~-~~~-~~~g~d~~~~~~~s~~~Cq~~C~~~~--~C~afT~~~~-----~~~C~lk~~~   58 (73)
T cd01100           3 SSCFRQG-SNV-DFRGGDLSTVFASSAEQCQAACTADP--GCLAFTYNTK-----SKKCFLKSSE   58 (73)
T ss_pred             ccccccc-CCC-ccccCCcceeecCCHHHHHHHcCCCC--CceEEEEECC-----CCeEEcccCC
Confidence            4566665 111 13345555667899999999999975  7999999843     7999998875


No 7  
>cd00129 PAN_APPLE PAN/APPLE-like domain; present in N-terminal (N) domains of plasminogen/ hepatocyte growth factor proteins,  plasma prekallikrein/coagulation factor XI and microneme antigen proteins, plant receptor-like protein kinases, and various nematode and leech anti-platelet proteins. Common structural features include two disulfide bonds that link the alpha-helix to the central region of the protein. PAN domains have significant functional versatility, fulfilling diverse biological functions by mediating protein-protein or protein-carbohydrate interactions.
Probab=91.17  E-value=0.35  Score=40.61  Aligned_cols=56  Identities=21%  Similarity=0.476  Sum_probs=39.2

Q ss_pred             eccCHHHHHHHhhc-CCCeEEeeEEeccCCCCCCCCCeEeccCCccccccCCCCceeecCCcceeec
Q psy1623          42 TAASLAACERECAE-SRDFICRSFNYRAVPYGGQRENCELSDRDARDLDMNNPSYFEPTGDFDFYEK  107 (522)
Q Consensus        42 ~v~S~~~C~~~C~~-e~~F~CRS~nY~~~~~~~~~~~C~Ls~~d~~sl~~~~p~~~~~~~~~dYyE~  107 (522)
                      ...+.++|+++|+. +-++.|+.|.|...     ...|.+-..+.. +++.   .+ ...+.|.|+|
T Consensus        23 ~~~s~~eC~~~Cl~~~~nCsC~Aya~~~~-----~~gC~~W~~~l~-~d~~---~~-~~~g~~Ly~r   79 (80)
T cd00129          23 KANTADECANRCEKNGLPFSCKAFVFAKA-----RKQCLWFPFNSM-SGVR---KE-FSHGFDLYEN   79 (80)
T ss_pred             cccCHHHHHHHHhcCCCCCCceeeeccCC-----CCCeEEecCcch-hhHH---hc-cCCCceeEeE
Confidence            34799999999998 44799999999643     357999988873 2222   12 2346677765


No 8  
>PF14295 PAN_4:  PAN domain; PDB: 2YIL_E 2YIP_C 2YIO_A.
Probab=87.98  E-value=0.31  Score=36.07  Aligned_cols=38  Identities=26%  Similarity=0.545  Sum_probs=17.7

Q ss_pred             eeccCHHHHHHHhhcCCCeEEeeEEeccCCCCCCCCCeEe
Q psy1623          41 TTAASLAACERECAESRDFICRSFNYRAVPYGGQRENCEL   80 (522)
Q Consensus        41 ~~v~S~~~C~~~C~~e~~F~CRS~nY~~~~~~~~~~~C~L   80 (522)
                      +.+.+.++|+++|..+.  .|..|+|.........+.|.|
T Consensus        13 ~~~~s~~~C~~~C~~~~--~C~~~~~~~~~~~~~~~~C~L   50 (51)
T PF14295_consen   13 VTASSPEECQAACAADP--GCQAFTFNPPGCPSSSGRCYL   50 (51)
T ss_dssp             -----HHHHHHHHHTST--T--EEEEETTEE---------
T ss_pred             ccCCCHHHHHHHccCCC--CCCEEEEECCCcccccccccC
Confidence            47889999999999865  699999986321123677876


No 9  
>smart00223 APPLE APPLE domain. Four-fold repeat in plasma kallikrein and coagulation factor XI. Factor XI apple 3 mediates binding to platelets. Factor XI apple 1 binds high-molecular-mass kininogen. Apple 4 in factor XI mediates dimer formation and binds to factor XIIa. Mutations in apple 4 cause factor XI deficiency, an inherited bleeding disorder.
Probab=86.00  E-value=1.1  Score=37.65  Aligned_cols=47  Identities=21%  Similarity=0.386  Sum_probs=36.9

Q ss_pred             cccceeeeeeccCHHHHHHHhhcCCCeEEeeEEeccCCCCCCCCCeEeccC
Q psy1623          33 RRQFIKKSTTAASLAACERECAESRDFICRSFNYRAVPYGGQRENCELSDR   83 (522)
Q Consensus        33 ~~~~~~r~~~v~S~~~C~~~C~~e~~F~CRS~nY~~~~~~~~~~~C~Ls~~   83 (522)
                      ..|.+-+.+.+.+.++|+++|..+.  .|+.|+|+....  ....|.|-+.
T Consensus        11 f~G~Dl~~~~~~~~~~Cq~~Ct~~~--~C~~FTf~~~~~--~~~~C~LK~s   57 (79)
T smart00223       11 FRGSDINTVYVPSAQVCQKRCTSHP--RCLFFTFSTNEP--PEEKCLLKDS   57 (79)
T ss_pred             ccCceeeeeecCCHHHHHHhhcCCC--CccEEEeeCCCC--CCCEeEeCcC
Confidence            4466777889999999999999875  799999986421  1238999876


No 10 
>PF08276 PAN_2:  PAN-like domain;  InterPro: IPR013227 PAN domains have significant functional versatility fulfilling diverse biological functions by mediating protein-protein or protein-carbohydrate interactions []. These domains contain a hair-pin loop like structure, similar to knottins, but the pattern of disulphide bonds differs
Probab=74.69  E-value=5.7  Score=31.77  Aligned_cols=59  Identities=20%  Similarity=0.374  Sum_probs=39.7

Q ss_pred             ccccccccccccccccceeeee-eccCHHHHHHHhhcCCCeEEeeEEeccCCCCCCCCCeEeccCCc
Q psy1623          20 LPISNNIGVRMRMRRQFIKKST-TAASLAACERECAESRDFICRSFNYRAVPYGGQRENCELSDRDA   85 (522)
Q Consensus        20 ~~~f~~vg~~~r~~~~~~~r~~-~v~S~~~C~~~C~~e~~F~CRS~nY~~~~~~~~~~~C~Ls~~d~   85 (522)
                      .+.|-++. +.++ +....... ...++++|+.+|+..=  .|..|.|....   +.+.|.|-..+.
T Consensus         4 ~d~F~~l~-~~~~-p~~~~~~~~~~~s~~~C~~~Cl~nC--sC~Ayay~~~~---~~~~C~lW~~~L   63 (66)
T PF08276_consen    4 GDGFLKLP-NMKL-PDFDNAIVDSSVSLEECEKACLSNC--SCTAYAYSNLS---GGGGCLLWYGDL   63 (66)
T ss_pred             CCEEEEEC-CeeC-CCCcceeeecCCCHHHHHhhcCCCC--CEeeEEeeccC---CCCEEEEEcCEe
Confidence            44566665 3333 33333332 6799999999999964  79999997432   358999877654


No 11 
>cd01098 PAN_AP_plant Plant PAN/APPLE-like domain; present in plant S-receptor protein kinases and secreted glycoproteins. PAN/APPLE domains fulfill diverse biological functions by mediating protein-protein or protein-carbohydrate interactions. S-receptor protein kinases and S-locus glycoproteins are involved in sporophytic self-incompatibility response in Brassica, one of probably many molecular mechanisms, by which hermaphrodite flowering plants avoid self-fertilization.
Probab=49.36  E-value=18  Score=29.20  Aligned_cols=37  Identities=27%  Similarity=0.522  Sum_probs=29.1

Q ss_pred             eccCHHHHHHHhhcCCCeEEeeEEeccCCCCCCCCCeEeccCCc
Q psy1623          42 TAASLAACERECAESRDFICRSFNYRAVPYGGQRENCELSDRDA   85 (522)
Q Consensus        42 ~v~S~~~C~~~C~~e~~F~CRS~nY~~~~~~~~~~~C~Ls~~d~   85 (522)
                      ...++++|+++|+..  =.|.+|.|...     .+.|.+-..+.
T Consensus        30 ~~~s~~~C~~~Cl~n--CsC~a~~~~~~-----~~~C~~~~~~~   66 (84)
T cd01098          30 TAISLEECREACLSN--CSCTAYAYNNG-----SGGCLLWNGLL   66 (84)
T ss_pred             ccCCHHHHHHHHhcC--CCcceeeecCC-----CCeEEEEecee
Confidence            678999999999985  58999999742     58898854443


No 12 
>PF00558 Vpu:  Vpu protein;  InterPro: IPR008187 The Human immunodeficiency virus 1 (HIV-1) Vpu protein acts in the degradation of CD4 in the endoplasmic reticulum and in the enhancement of virion release from the plasma membrane of infected cells [].; GO: 0019076 release of virus from host; PDB: 2JPX_A 1PI8_A 2GOH_A 2GOF_A 1PI7_A 1PJE_A 1VPU_A 2K7Y_A.
Probab=36.22  E-value=24  Score=30.01  Aligned_cols=20  Identities=15%  Similarity=0.321  Sum_probs=9.4

Q ss_pred             HHHHHHHHHHHHHhhhceee
Q psy1623         502 TLFILLTFLIVVTAFAGFYF  521 (522)
Q Consensus       502 ~l~i~~~~~~~~~~~~~~~~  521 (522)
                      +++.+.+++|.+++||.+.|
T Consensus         8 ~iialiv~~iiaIvvW~iv~   27 (81)
T PF00558_consen    8 AIIALIVALIIAIVVWTIVY   27 (81)
T ss_dssp             HHHHHHHHHHHHHHHHHHH-
T ss_pred             HHHHHHHHHHHHHHHHHHHH
Confidence            33444444444555665544


No 13 
>PF05454 DAG1:  Dystroglycan (Dystrophin-associated glycoprotein 1);  InterPro: IPR008465 Dystroglycan is one of the dystrophin-associated glycoproteins, which is encoded by a 5.5 kb transcript in Homo sapiens. The protein product is cleaved into two non-covalently associated subunits, [alpha] (N-terminal) and [beta] (C-terminal). In skeletal muscle the dystroglycan complex works as a transmembrane linkage between the extracellular matrix and the cytoskeleton [alpha]-dystroglycan is extracellular and binds to merosin ([alpha]-2 laminin) in the basement membrane, while [beta]-dystroglycan is a transmembrane protein and binds to dystrophin, which is a large rod-like cytoskeletal protein, absent in Duchenne muscular dystrophy patients. Dystrophin binds to intracellular actin cables. In this way, the dystroglycan complex, which links the extracellular matrix to the intracellular actin cables, is thought to provide structural integrity in muscle tissues. The dystroglycan complex is also known to serve as an agrin receptor in muscle, where it may regulate agrin-induced acetylcholine receptor clustering at the neuromuscular junction. There is also evidence which suggests the function of dystroglycan as a part of the signal transduction pathway because it is shown that Grb2, a mediator of the Ras-related signal pathway, can interact with the cytoplasmic domain of dystroglycan. In general, aberrant expression of dystrophin-associated protein complex underlies the pathogenesis of Duchenne muscular dystrophy, Becker muscular dystrophy and severe childhood autosomal recessive muscular dystrophy. Interestingly, no genetic disease has been described for either [alpha]- or [beta]-dystroglycan. Dystroglycan is widely distributed in non-muscle tissues as well as in muscle tissues. During epithelial morphogenesis of kidney, the dystroglycan complex is shown to act as a receptor for the basement membrane. Dystroglycan expression in Mus musculus brain and neural retina has also been reported. However, the physiological role of dystroglycan in non-muscle tissues has remained unclear [].; PDB: 1EG4_P.
Probab=30.98  E-value=16  Score=38.01  Aligned_cols=23  Identities=13%  Similarity=0.369  Sum_probs=0.0

Q ss_pred             HHHHHHHHHHHHHHHHhhhceeeC
Q psy1623         499 LLTTLFILLTFLIVVTAFAGFYFK  522 (522)
Q Consensus       499 ~~~~l~i~~~~~~~~~~~~~~~~~  522 (522)
                      +|.+++|+++|| ++.++|++|||
T Consensus       149 ~IpaVVI~~iLL-IA~iIa~icyr  171 (290)
T PF05454_consen  149 FIPAVVIAAILL-IAGIIACICYR  171 (290)
T ss_dssp             ------------------------
T ss_pred             HHHHHHHHHHHH-HHHHHHHHhhh
Confidence            445554444444 56666666765


No 14 
>PF10177 DUF2371:  Uncharacterised conserved protein (DUF2371);  InterPro: IPR018787  This family of proteins with no known function is conserved from nematodes to humans. It includes members of the TMEM200 family of transmembrane proteins. 
Probab=28.34  E-value=48  Score=31.07  Aligned_cols=30  Identities=27%  Similarity=0.315  Sum_probs=23.1

Q ss_pred             cccchhhhHHHHHHHHHHHHHHHHHhhhceeeC
Q psy1623         490 CMSNREYYSLLTTLFILLTFLIVVTAFAGFYFK  522 (522)
Q Consensus       490 C~s~~~~~~~~~~l~i~~~~~~~~~~~~~~~~~  522 (522)
                      |.|.+   ++++.+.+++++.=.+.|+.++|.+
T Consensus        35 l~s~S---g~~l~lG~lvllvGiaMAv~GYwp~   64 (141)
T PF10177_consen   35 LCSPS---GLFLLLGILVLLVGIAMAVLGYWPK   64 (141)
T ss_pred             EecHH---HHHHHHHHHHHHHhhHhheeecccc
Confidence            55664   8888888888888888888888753


No 15 
>PF14814 UB2H:  Bifunctional transglycosylase second domain; PDB: 3FWL_A 3VMA_A.
Probab=27.08  E-value=98  Score=26.02  Aligned_cols=34  Identities=18%  Similarity=0.247  Sum_probs=22.9

Q ss_pred             eccCCCeEEEEEEeEEcCCCCeEEEEEEEEEccC
Q psy1623         357 RSREGDGLEARFNAFKIPESNFLVFEATVRTCRD  390 (522)
Q Consensus       357 ~~~~~~~l~a~f~AFKFpds~~V~f~C~V~~C~~  390 (522)
                      |..+++.+...-.+|.|||...=-....|++=.+
T Consensus        33 y~~~g~~i~i~~R~F~F~Dg~e~~~~~~l~f~~~   66 (85)
T PF14814_consen   33 YSRSGNRIEIYTRGFDFPDGQEPARRVRLTFSGG   66 (85)
T ss_dssp             EEEETTEEEEEE--EEETTCEE--EEEEEEEETT
T ss_pred             EEEECCEEEEEECCCCCCCCCccCEEEEEEECCC
Confidence            3446778999999999999988777777776543


No 16 
>PF15102 TMEM154:  TMEM154 protein family
Probab=25.82  E-value=52  Score=30.96  Aligned_cols=23  Identities=30%  Similarity=0.655  Sum_probs=10.8

Q ss_pred             HHHHHHHHHHHHHHHHhhhceeeC
Q psy1623         499 LLTTLFILLTFLIVVTAFAGFYFK  522 (522)
Q Consensus       499 ~~~~l~i~~~~~~~~~~~~~~~~~  522 (522)
                      .+|-+++|+||||+++++ +.+||
T Consensus        61 IlIP~VLLvlLLl~vV~l-v~~~k   83 (146)
T PF15102_consen   61 ILIPLVLLVLLLLSVVCL-VIYYK   83 (146)
T ss_pred             EeHHHHHHHHHHHHHHHh-eeEEe
Confidence            344455555555555444 44443


No 17 
>PF09680 Tiny_TM_bacill:  Protein of unknown function (Tiny_TM_bacill);  InterPro: IPR010070 This entry represents a family of hypothetical proteins, half of which are 40 residues or less in length. Members are found only in spore-forming species. A Gly-rich variable region is followed by a strongly conserved, highly hydrophobic region, predicted to form a transmembrane helix, ending with an invariant Gly. The consensus for this stretch is FALLVVFILLIIV.
Probab=24.24  E-value=44  Score=22.02  Aligned_cols=15  Identities=20%  Similarity=0.545  Sum_probs=7.8

Q ss_pred             HHHHHHHHHHHHHhh
Q psy1623         502 TLFILLTFLIVVTAF  516 (522)
Q Consensus       502 ~l~i~~~~~~~~~~~  516 (522)
                      +|++++.|||+++-+
T Consensus         7 alivVLFILLiIvG~   21 (24)
T PF09680_consen    7 ALIVVLFILLIIVGA   21 (24)
T ss_pred             hhHHHHHHHHHHhcc
Confidence            455555555555543


No 18 
>PF01299 Lamp:  Lysosome-associated membrane glycoprotein (Lamp);  InterPro: IPR002000 Lysosome-associated membrane glycoproteins (lamp) [] are integral membrane proteins, specific to lysosomes, and whose exact biological function is not yet clear. Structurally, the lamp proteins consist of two internally homologous lysosome-luminal domains separated by a proline-rich hinge region; at the C-terminal extremity there is a transmembrane region (TM) followed by a very short cytoplasmic tail (C). In each of the duplicated domains, there are two conserved disulphide bonds. This structure is schematically represented in the figure below.   +-----+ +-----+ +-----+ +-----+ | | | | | | | | xCxxxxxCxxxxxxxxxxxxCxxxxxCxxxxxxxxxCxxxxxCxxxxxxxxxxxxCxxxxxCxxxxxxxx +--------------------------++Hinge++--------------------------++TM++C+  In mammals, there are two closely related types of lamp: lamp-1 and lamp-2, which form major components of the lysosome membrane. In chicken lamp-1 is known as LEP100.  Also included in this entry is the macrophage protein CD68 (or macrosialin) [] is a heavily glycosylated integral membrane protein whose structure consists of a mucin-like domain followed by a proline-rich hinge; a single lamp-like domain; a transmembrane region and a short cytoplasmic tail.   Similar to CD68, mammalian lamp-3, which is expressed in lymphoid organs, dendritic cells and in lung, contains all the C-terminal regions but lacks the N-terminal lamp-like region []. In a lamp-family protein from nematodes [] only the part C-terminal to the hinge is conserved. ; GO: 0016020 membrane
Probab=22.90  E-value=78  Score=32.83  Aligned_cols=39  Identities=13%  Similarity=0.005  Sum_probs=20.2

Q ss_pred             CCCCcccccchhhhHHHHHHHHHHHHHHHHHhhhceeeC
Q psy1623         484 AIPDSICMSNREYYSLLTTLFILLTFLIVVTAFAGFYFK  522 (522)
Q Consensus       484 ~~~~~~C~s~~~~~~~~~~l~i~~~~~~~~~~~~~~~~~  522 (522)
                      .+....|-++..-..+-|++++++.+|++++++|-++.|
T Consensus       258 Fg~a~~C~~D~~~~~vPIaVG~~La~lvlivLiaYli~R  296 (306)
T PF01299_consen  258 FGTAEECSSDDTSDLVPIAVGAALAGLVLIVLIAYLIGR  296 (306)
T ss_pred             CCChhcCCcCCccchHHHHHHHHHHHHHHHHHHhheeEe
Confidence            344567877543233334444445555555555655554


No 19 
>PF07502 MANEC:  MANEC domain;  InterPro: IPR011106 The MANSC (motif at N terminus with seven cysteines) domain is a module with a well-conserved seven cysteine motif that is present at the N terminus of higher multicellular animal membrane and extracellular proteins. It is possible that some of the cysteine residues in the MANSC domain form structurally important disulphide bridges. All of the MANSC-containing proteins contain predicted transmembrane regions and signal peptides. It has been proposed that the MANSC domain in HAI-1 might function through binding with hepatocyte growth factor activator and matriptase [].
Probab=22.41  E-value=1.2e+02  Score=26.30  Aligned_cols=41  Identities=22%  Similarity=0.536  Sum_probs=31.7

Q ss_pred             eeccCHHHHHHHhhcCCCeEEeeEEeccCCCCCCCCCeEeccC
Q psy1623          41 TTAASLAACERECAESRDFICRSFNYRAVPYGGQRENCELSDR   83 (522)
Q Consensus        41 ~~v~S~~~C~~~C~~e~~F~CRS~nY~~~~~~~~~~~C~Ls~~   83 (522)
                      ..+.+.++|.++|=....  |-=+-|...+.+.+...|.|.+=
T Consensus        33 ~~v~s~~dC~~aCC~~~~--Cnlav~e~~~~~~~~~~CyLf~C   73 (92)
T PF07502_consen   33 PEVSSAEDCVRACCSTPN--CNLAVFEERPTGSEDNSCYLFNC   73 (92)
T ss_pred             cccCCHHHHHHHhCCCCC--CcEEEEeccCCCCCCceEEEEEc
Confidence            479999999999998875  99999987654432233999876


No 20 
>COG4064 MtrG Tetrahydromethanopterin S-methyltransferase, subunit G [Coenzyme metabolism]
Probab=22.34  E-value=1.1e+02  Score=25.35  Aligned_cols=23  Identities=17%  Similarity=0.377  Sum_probs=16.7

Q ss_pred             chhhhHHHHHHHHHHHHHHHHHh
Q psy1623         493 NREYYSLLTTLFILLTFLIVVTA  515 (522)
Q Consensus       493 ~~~~~~~~~~l~i~~~~~~~~~~  515 (522)
                      ....||++++|+++++..+..++
T Consensus        50 IGILYGlVIGlil~~i~~~l~~~   72 (75)
T COG4064          50 IGILYGLVIGLILCMIYILLGVA   72 (75)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHH
Confidence            45679999999877766655544


No 21 
>PF07787 DUF1625:  Protein of unknown function (DUF1625);  InterPro: IPR012430 Sequences making up this family are derived from hypothetical proteins expressed by both prokaryotic and eukaryotic species. The region in question is approximately 250 residues long. 
Probab=21.76  E-value=88  Score=31.46  Aligned_cols=25  Identities=16%  Similarity=0.461  Sum_probs=20.1

Q ss_pred             HHHHHHHHHHHHHHHHHhhhceeeC
Q psy1623         498 SLLTTLFILLTFLIVVTAFAGFYFK  522 (522)
Q Consensus       498 ~~~~~l~i~~~~~~~~~~~~~~~~~  522 (522)
                      ..++++++.+.+.++++++|-++||
T Consensus       223 ~~~~~~~~s~~lsl~~Ia~aW~~yR  247 (248)
T PF07787_consen  223 LFLVAFIISFSLSLLTIALAWLFYR  247 (248)
T ss_pred             HHHHHHHHHHHHHHHHHHHhheeeC
Confidence            3456677888888889999999998


Done!