Query         psy4200
Match_columns 388
No_of_seqs    251 out of 1536
Neff          9.5 
Searched_HMMs 46136
Date          Fri Aug 16 19:25:48 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy4200.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/4200hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 KOG4289|consensus              100.0 8.1E-60 1.8E-64  468.1  35.5  364    1-387   364-937 (2531)
  2 KOG4289|consensus              100.0 3.2E-57   7E-62  449.7  33.5  360    1-382   259-826 (2531)
  3 KOG1219|consensus              100.0 1.6E-53 3.5E-58  435.2  40.7  360    1-381  2168-2722(4289)
  4 KOG1219|consensus              100.0 5.8E-52 1.3E-56  424.0  39.9  366    1-386   735-1208(4289)
  5 cd00031 CA Cadherin repeat dom 100.0 1.2E-28 2.5E-33  215.5  28.1  190   16-266     2-199 (199)
  6 cd00031 CA Cadherin repeat dom  99.9 3.5E-21 7.5E-26  168.1  23.0  129  164-304     1-130 (199)
  7 PF00028 Cadherin:  Cadherin do  99.7 7.9E-16 1.7E-20  117.1  13.9   92  165-265     1-93  (93)
  8 smart00112 CA Cadherin repeats  99.6 2.1E-14 4.6E-19  105.6  10.5   78  185-272     1-79  (79)
  9 KOG1834|consensus               99.5 4.1E-13 8.8E-18  128.0  18.9  201    5-267    27-245 (952)
 10 PF00028 Cadherin:  Cadherin do  99.5 1.7E-13 3.6E-18  104.2  11.8   70   16-88      1-76  (93)
 11 KOG1834|consensus               99.3 2.5E-11 5.4E-16  116.0  15.5  149  147-304    20-176 (952)
 12 smart00112 CA Cadherin repeats  99.3   1E-11 2.2E-16   91.2   9.9   69   41-158     8-79  (79)
 13 PF08266 Cadherin_2:  Cadherin-  97.7 4.5E-05 9.7E-10   55.9   3.8   66  165-231     3-71  (84)
 14 PF08266 Cadherin_2:  Cadherin-  97.6 4.8E-05   1E-09   55.8   3.0   59   19-78      5-71  (84)
 15 PF08758 Cadherin_pro:  Cadheri  97.4 0.00037   8E-09   51.8   5.7   76    7-88      2-78  (90)
 16 PF08758 Cadherin_pro:  Cadheri  96.0   0.091   2E-06   39.0   9.0   77  156-243     2-78  (90)
 17 smart00736 CADG Dystroglycan-t  94.2     1.1 2.4E-05   33.8  10.6   69  184-269    24-96  (97)
 18 smart00736 CADG Dystroglycan-t  93.6     1.4   3E-05   33.2  10.2   49   35-87     23-77  (97)
 19 TIGR01965 VCBS_repeat VCBS rep  93.0     1.3 2.9E-05   33.4   9.0   39  130-173    60-98  (99)
 20 TIGR01965 VCBS_repeat VCBS rep  92.2     1.1 2.3E-05   33.9   7.5   87  180-287     2-98  (99)
 21 PF07495 Y_Y_Y:  Y_Y_Y domain;   82.2     7.7 0.00017   26.4   6.5   45  191-243     6-50  (66)
 22 TIGR00845 caca sodium/calcium   78.2   1E+02  0.0023   33.2  18.2   47  153-201   395-441 (928)
 23 TIGR03660 T1SS_rpt_143 T1SS-14  74.9      43 0.00093   27.0   9.9   60  215-287    69-128 (137)
 24 PF05345 He_PIG:  Putative Ig d  68.0      25 0.00055   22.6   5.5   34  204-242    13-46  (49)
 25 TIGR00845 caca sodium/calcium   56.6 2.9E+02  0.0063   30.0  22.2   59    3-66    394-460 (928)
 26 KOG3597|consensus               49.1      38 0.00083   33.0   5.6   62  319-383    24-85  (442)
 27 PF03160 Calx-beta:  Calx-beta   47.4      92   0.002   23.1   6.6   53  324-382     2-54  (100)
 28 KOG3597|consensus               33.7   1E+02  0.0022   30.3   5.8   59  142-201    24-82  (442)
 29 PF05895 DUF859:  Siphovirus pr  26.6 7.4E+02   0.016   25.7  13.3  102  130-243   301-423 (624)
 30 PF09100 Qn_am_d_aIV:  Quinohem  26.1 1.2E+02  0.0026   24.0   3.9   30  305-335   103-132 (133)
 31 cd02848 Chitinase_N_term Chiti  20.4 1.2E+02  0.0027   23.1   3.0   28  231-266    79-106 (106)

No 1  
>KOG4289|consensus
Probab=100.00  E-value=8.1e-60  Score=468.14  Aligned_cols=364  Identities=29%  Similarity=0.453  Sum_probs=314.6

Q ss_pred             CCccCCCCCeeccCCCcceEEecCCCCCcEEEEEEEECCC---CCeEEEEEecC---CcEEEeCCeeEEEEcccCCCccc
Q psy4200           1 MEAYGNSPPSFTTDVNSPLVVEENTPPGTIVSTLEGVDPE---GSKVKYGIYGT---DRFSLDRDSGELRVAQPLDREYN   74 (388)
Q Consensus         1 ~~d~nd~~P~F~~~~~~~~~v~E~~~~gt~v~~v~a~D~D---~~~i~y~i~~~---~~F~Id~~tG~i~~~~~lD~e~~   74 (388)
                      |+|.|||+|.|.++.|. +.|.|+..++++|++|+|+|.|   |+.++|+|.++   +.|.||..||+|.+..+||+|..
T Consensus       364 V~D~NDNaPqFse~~Yv-vqv~Edvt~~avvlrV~AtDrD~g~Ng~VHYsi~Sgn~~G~f~id~~tGel~vv~plD~e~~  442 (2531)
T KOG4289|consen  364 VEDENDNAPQFSEKRYV-VQVREDVTPPAVVLRVTATDRDKGTNGKVHYSIASGNGRGQFYIDSLTGELDVVEPLDFENS  442 (2531)
T ss_pred             EEecCCCCccccccceE-EEecccCCCCceEEEEEecccCCCcCceEEEEeeccCccccEEEecccceEEEeccccccCC
Confidence            68999999999999999 9999999999999999999998   79999999974   48999999999999999999988


Q ss_pred             ceeeEEEEEEEEec--C---------------------------------------------------------------
Q psy4200          75 STNTSTIVLTLEGV--D---------------------------------------------------------------   89 (388)
Q Consensus        75 ~~~~~~l~v~a~D~--~---------------------------------------------------------------   89 (388)
                       .|+  +.|+|+|.  |                                                               
T Consensus       443 -~yt--l~IrAqDggrPpLsn~sgl~iqVlDINDhaPifvstpfq~tvlEnv~lg~~v~~vqaidadsg~na~l~y~laG  519 (2531)
T KOG4289|consen  443 -EYT--LRIRAQDGGRPPLSNTSGLVIQVLDINDHAPIFVSTPFQATVLENVPLGYLVCHVQAIDADSGENARLHYSLAG  519 (2531)
T ss_pred             -eeE--EEEEcccCCCCCccCCCceEEEEEecCCCCceeEechhhhhhhhcccccceEEEEecccCCCCcccceeeeecc
Confidence             899  88888882  2                                                               


Q ss_pred             ----------------------------------------------------------CCCccceeee-------eccee
Q psy4200          90 ----------------------------------------------------------PEGSKVKYGI-------YGTDW  104 (388)
Q Consensus        90 ----------------------------------------------------------P~f~~~~~~~-------~g~~v  104 (388)
                                                                                |.|++..|..       .|+.|
T Consensus       520 ~~pf~I~~~SG~Itvtk~ldrEt~~~ysl~V~ard~gtp~l~tstsI~Vtv~dvndndP~Ft~~eytl~inED~pvgsSI  599 (2531)
T KOG4289|consen  520 VGPFQINNGSGWITVTKELDRETVEHYSLGVEARDHGTPPLSTSTSISVTVLDVNDNDPTFTQKEYTLRINEDAPVGSSI  599 (2531)
T ss_pred             CCCeeEecCCceEEEeecccccccceEEEEEEEcCCCCCcccccceEEEEecccCCCCCccccCceEEEecCCccccceE
Confidence                                                                      5565555554       45555


Q ss_pred             EE---ecCCccE-E--EE----------------------cCCCCccccceEEEEEEEeeCCCceeEEEEEEEEeecCCC
Q psy4200         105 FS---LDRDSGE-L--RV----------------------AQPLYREDGHHTSKTNVDIRDGHHTSKTNVDIRVGDVQNT  156 (388)
Q Consensus       105 ~~---~D~d~g~-~--~~----------------------~~~~~~~~~~~~~~~~v~a~d~~~~~~~~v~V~V~dvNd~  156 (388)
                      ++   +|.|... +  .+                      ..++++.. ..+..+.|+|+||++.+.+.|.|.|.|.|.+
T Consensus       600 ~tvtAvD~d~~s~ityqi~g~ntrn~Fsi~si~g~Glitlalp~dkKq-e~~~vl~vtAtDg~l~d~~~V~v~I~danTh  678 (2531)
T KOG4289|consen  600 VTVTAVDRDANSVITYQITGGNTRNRFSISSIGGGGLITLALPLDKKQ-ERQYVLAVTATDGTLQDTCSVNVNITDANTH  678 (2531)
T ss_pred             EEEEEeccccccceEEEecCCcccccceeeccCCcceEEeecchhhcc-cceEEEEEEecCCccccceEEEEEeeecccC
Confidence            54   4444311 0  00                      00011000 1112677899999999999999999999999


Q ss_pred             CCeeeCCcceEEEecCCCCCcEEEEEEEeeCCCCCCceEEEEEecCCCCceEEeCCeeEEEecccCCccccCCcccEEEE
Q psy4200         157 PPIFINSSFSGEIMESAPIGSVVLRVEAKDGDLAQPRSIYYDLLTNPDEFFLIDSNTGELKTAKPLDREILGGTNGVISL  236 (388)
Q Consensus       157 ~P~f~~~~~~~~v~E~~~~g~~v~~v~A~D~D~~~~~~v~y~l~~~~~~~F~id~~tG~i~~~~~LD~E~~~~~~~~~~l  236 (388)
                      .|.|...+|.++|.|..|.|+.|..+.|+|.|.|+|++|+|-+.   +..|+||+.+|.+++...||||.    +-.|++
T Consensus       679 rpvFqs~pfTvsI~e~rP~G~tvvtlsasd~D~geNARI~y~le---d~~Frid~dsg~i~t~~~ld~ed----qvtytl  751 (2531)
T KOG4289|consen  679 RPVFQSSPFTVSINEDRPLGTTVVTLSASDEDTGENARITYILE---DEAFRIDPDSGAIYTQAELDYED----QVTYTL  751 (2531)
T ss_pred             CcccccCCeeEeeccCCcCCceeEEEecccCCCCccceEEEEec---ccceeecCCCCceEEeeeeeccc----ceeeEe
Confidence            99999999999999999999999999999999999999999443   24599999999999999999999    668999


Q ss_pred             EEEEEEccCCCCcCCCCceEEEEEEEEEcccCCCCCccCCCcEEEEEeCCCCCCCeeeeeEEEEeeCC------------
Q psy4200         237 TVRAREMVDGKPLQEDQATAFAQVTVTILDVNDSPPVFNRKEYVVHIPEDIPDGSLLPDLDMIVTDSD------------  304 (388)
Q Consensus       237 ~v~a~D~~~g~~~~~~~~~~~~~v~I~V~dvNd~~P~f~~~~~~~~v~E~~~~g~~v~~l~l~a~D~D------------  304 (388)
                      .++|+|  +|.|    +..++.+|.|.|.|+|||+|+|..+.|.++|.|++|++|.+.  +++|+|+|            
T Consensus       752 ~itA~D--~~~p----q~adtttveV~v~diNDnaPqf~assyt~sV~Ed~Pv~Tsvl--QVSatDaD~g~Ng~v~y~~q  823 (2531)
T KOG4289|consen  752 AITARD--NGIP----QKADTTTVEVLVNDINDNAPQFLASSYTGSVFEDAPVFTSVL--QVSATDADSGPNGRVYYTFQ  823 (2531)
T ss_pred             eeeecC--CCCC----CcCccEEEEEEeecccccCcccchhhceeEeecCCCCcceEE--EEEEeccCCCCCceEEEEec
Confidence            999998  7765    678999999999999999999999999999999999999995  99999999            


Q ss_pred             ----------------------------------eEEEEecCCCceeEEEEEEEEEEEeCCCCCCeeeccceEEEEecCC
Q psy4200         305 ----------------------------------LVIAEETHTAEKLSSSATLIVQVTDVNDNVPSFELNAYTGNVLETA  350 (388)
Q Consensus       305 ----------------------------------~v~~~~~~~~~~~s~~~~v~I~V~dvND~~P~f~~~~y~~~v~e~~  350 (388)
                                                        .+.|+| .|.|++++.+.|+|+|+|+|||||+|.+.+|...|.||.
T Consensus       824 g~~d~p~~F~IEptSGviRtl~rLdRE~~avy~L~a~avD-rg~p~ls~~~eItvtvldvNDnaPvfe~~e~e~~I~ens  902 (2531)
T KOG4289|consen  824 GGDDGPGDFYIEPTSGVIRTLRRLDRENVAVYVLAAYAVD-RGNPPLSAPVEITVTVLDVNDNAPVFEQDELELFIEENS  902 (2531)
T ss_pred             CCCCCCCceEEccCcceeehhhhhcchheeEEEEEEEEee-CCCCCcCCceEEEEEEEecCCCCCCCCCcceeeEEeecC
Confidence                                              234455 688999999999999999999999999999999999999


Q ss_pred             CCCcEEEEEEEEcCCCCCCCceeEEEEEEcCceeEEe
Q psy4200         351 QAGTSITTITALDSDGGDYGTGGIVYELLGEYGIMYV  387 (388)
Q Consensus       351 ~~g~~v~~v~a~D~D~~~~~~~~i~ysi~~~~~~~~~  387 (388)
                      +.|+.+++|.|.|+|+|+|+.  |.|+|+++...-+|
T Consensus       903 pvgs~va~i~a~dpdEG~NA~--IsYqIvgg~d~~~f  937 (2531)
T KOG4289|consen  903 PVGSVVALITADDPDEGPNAH--ISYQIVGGNDPELF  937 (2531)
T ss_pred             ccceeeEEEEccCCCcCCcce--EEEeeccCccHHHH
Confidence            999999999999999999988  99999977655444


No 2  
>KOG4289|consensus
Probab=100.00  E-value=3.2e-57  Score=449.73  Aligned_cols=360  Identities=30%  Similarity=0.483  Sum_probs=319.4

Q ss_pred             CCccCCCCCeeccCCCcceEEecCCCCCcEEEEEEEECCC---CCeEEEEEecC---CcEEEeCCeeEEEEcccCCCccc
Q psy4200           1 MEAYGNSPPSFTTDVNSPLVVEENTPPGTIVSTLEGVDPE---GSKVKYGIYGT---DRFSLDRDSGELRVAQPLDREYN   74 (388)
Q Consensus         1 ~~d~nd~~P~F~~~~~~~~~v~E~~~~gt~v~~v~a~D~D---~~~i~y~i~~~---~~F~Id~~tG~i~~~~~lD~e~~   74 (388)
                      |.|.|||.|+|.++.|. -++.||.++|+.|.+|+|+|.|   |+.|+|++.++   +.|.||+.+|.|+++.+||||+.
T Consensus       259 V~D~nDhsPvFEq~~Y~-e~lREn~evGy~vLtvrAtD~Dsp~Nani~Yrl~eg~~~~~f~in~rSGvI~T~a~lDRE~~  337 (2531)
T KOG4289|consen  259 VLDTNDHSPVFEQDEYR-EELRENLEVGYEVLTVRATDGDSPPNANIRYRLLEGNAKNVFEINPRSGVISTRAPLDREEL  337 (2531)
T ss_pred             EeecCCCCcccchhHHH-HHHhhccccCceEEEEEeccCCCCCCCceEEEecCCCccceeEEcCccceeeccCccCHHhh
Confidence            56999999999999999 9999999999999999999998   79999999975   57999999999999999999999


Q ss_pred             ceeeEEEEEEEEecC---------------------CCCccceeee-------ecceeEE--------------------
Q psy4200          75 STNTSTIVLTLEGVD---------------------PEGSKVKYGI-------YGTDWFS--------------------  106 (388)
Q Consensus        75 ~~~~~~l~v~a~D~~---------------------P~f~~~~~~~-------~g~~v~~--------------------  106 (388)
                      ..|.  |.|.|.|.+                     |.|++..|..       +++.+.+                    
T Consensus       338 ~~y~--L~VeAsDqG~~pgp~Ta~V~itV~D~NDNaPqFse~~Yvvqv~Edvt~~avvlrV~AtDrD~g~Ng~VHYsi~S  415 (2531)
T KOG4289|consen  338 ESYQ--LDVEASDQGRPPGPRTAMVEITVEDENDNAPQFSEKRYVVQVREDVTPPAVVLRVTATDRDKGTNGKVHYSIAS  415 (2531)
T ss_pred             hheE--EEEEeccCCCCCCCceEEEEEEEEecCCCCccccccceEEEecccCCCCceEEEEEecccCCCcCceEEEEeec
Confidence            9999  777777721                     9999888876       4444444                    


Q ss_pred             --------ecCCccEEEEcCCCCccccceEEEEEEEeeCCC---ceeEEEEEEEEeecCCCCCeeeCCcceEEEecCCCC
Q psy4200         107 --------LDRDSGELRVAQPLYREDGHHTSKTNVDIRDGH---HTSKTNVDIRVGDVQNTPPIFINSSFSGEIMESAPI  175 (388)
Q Consensus       107 --------~D~d~g~~~~~~~~~~~~~~~~~~~~v~a~d~~---~~~~~~v~V~V~dvNd~~P~f~~~~~~~~v~E~~~~  175 (388)
                              +|..+|+|.+..++|.|..  ...+.++|.||+   ++...-+.|.|+|+|||+|.|....+..+|.|+.+.
T Consensus       416 gn~~G~f~id~~tGel~vv~plD~e~~--~ytl~IrAqDggrPpLsn~sgl~iqVlDINDhaPifvstpfq~tvlEnv~l  493 (2531)
T KOG4289|consen  416 GNGRGQFYIDSLTGELDVVEPLDFENS--EYTLRIRAQDGGRPPLSNTSGLVIQVLDINDHAPIFVSTPFQATVLENVPL  493 (2531)
T ss_pred             cCccccEEEecccceEEEeccccccCC--eeEEEEEcccCCCCCccCCCceEEEEEecCCCCceeEechhhhhhhhcccc
Confidence                    4444455566777777776  446889999986   456667779999999999999999999999999999


Q ss_pred             CcEEEEEEEeeCCCCCCceEEEEEecCCCCceEEeCCeeEEEecccCCccccCCcccEEEEEEEEEEccCCCCcCCCCce
Q psy4200         176 GSVVLRVEAKDGDLAQPRSIYYDLLTNPDEFFLIDSNTGELKTAKPLDREILGGTNGVISLTVRAREMVDGKPLQEDQAT  255 (388)
Q Consensus       176 g~~v~~v~A~D~D~~~~~~v~y~l~~~~~~~F~id~~tG~i~~~~~LD~E~~~~~~~~~~l~v~a~D~~~g~~~~~~~~~  255 (388)
                      |..++.+.|.|+|.|.|+.+.|++.+  -+.|.|+..+|.|++.+.||||+    ...|.|.|.|+|  .|.|    +++
T Consensus       494 g~~v~~vqaidadsg~na~l~y~laG--~~pf~I~~~SG~Itvtk~ldrEt----~~~ysl~V~ard--~gtp----~l~  561 (2531)
T KOG4289|consen  494 GYLVCHVQAIDADSGENARLHYSLAG--VGPFQINNGSGWITVTKELDRET----VEHYSLGVEARD--HGTP----PLS  561 (2531)
T ss_pred             cceEEEEecccCCCCcccceeeeecc--CCCeeEecCCceEEEeecccccc----cceEEEEEEEcC--CCCC----ccc
Confidence            99999999999999999999999975  35899999999999999999999    458999999998  5665    688


Q ss_pred             EEEEEEEEEcccCCCCCccCCCcEEEEEeCCCCCCCeeeeeEEEEeeCC-------------------------------
Q psy4200         256 AFAQVTVTILDVNDSPPVFNRKEYVVHIPEDIPDGSLLPDLDMIVTDSD-------------------------------  304 (388)
Q Consensus       256 ~~~~v~I~V~dvNd~~P~f~~~~~~~~v~E~~~~g~~v~~l~l~a~D~D-------------------------------  304 (388)
                      +.+.|.|.+.|+|||.|+|++.+|+..+.|+.+.|+.|.  +++|+|.|                               
T Consensus       562 tstsI~Vtv~dvndndP~Ft~~eytl~inED~pvgsSI~--tvtAvD~d~~s~ityqi~g~ntrn~Fsi~si~g~Glitl  639 (2531)
T KOG4289|consen  562 TSTSISVTVLDVNDNDPTFTQKEYTLRINEDAPVGSSIV--TVTAVDRDANSVITYQITGGNTRNRFSISSIGGGGLITL  639 (2531)
T ss_pred             ccceEEEEecccCCCCCccccCceEEEecCCccccceEE--EEEEeccccccceEEEecCCcccccceeeccCCcceEEe
Confidence            899999999999999999999999999999999999995  99999999                               


Q ss_pred             --------------------------------------------------------------------------------
Q psy4200         305 --------------------------------------------------------------------------------  304 (388)
Q Consensus       305 --------------------------------------------------------------------------------  304 (388)
                                                                                                      
T Consensus       640 alp~dkKqe~~~vl~vtAtDg~l~d~~~V~v~I~danThrpvFqs~pfTvsI~e~rP~G~tvvtlsasd~D~geNARI~y  719 (2531)
T KOG4289|consen  640 ALPLDKKQERQYVLAVTATDGTLQDTCSVNVNITDANTHRPVFQSSPFTVSINEDRPLGTTVVTLSASDEDTGENARITY  719 (2531)
T ss_pred             ecchhhcccceEEEEEEecCCccccceEEEEEeeecccCCcccccCCeeEeeccCCcCCceeEEEecccCCCCccceEEE
Confidence                                                                                            


Q ss_pred             --------------------------------eEEEEecCCCceeEEEEEEEEEEEeCCCCCCeeeccceEEEEecCCCC
Q psy4200         305 --------------------------------LVIAEETHTAEKLSSSATLIVQVTDVNDNVPSFELNAYTGNVLETAQA  352 (388)
Q Consensus       305 --------------------------------~v~~~~~~~~~~~s~~~~v~I~V~dvND~~P~f~~~~y~~~v~e~~~~  352 (388)
                                                      .++|.| .+.|+...+++|.|.|.|+|||+|+|..+.|.++|.|++++
T Consensus       720 ~led~~Frid~dsg~i~t~~~ld~edqvtytl~itA~D-~~~pq~adtttveV~v~diNDnaPqf~assyt~sV~Ed~Pv  798 (2531)
T KOG4289|consen  720 ILEDEAFRIDPDSGAIYTQAELDYEDQVTYTLAITARD-NGIPQKADTTTVEVLVNDINDNAPQFLASSYTGSVFEDAPV  798 (2531)
T ss_pred             EecccceeecCCCCceEEeeeeecccceeeEeeeeecC-CCCCCcCccEEEEEEeecccccCcccchhhceeEeecCCCC
Confidence                                            123333 57788999999999999999999999999999999999999


Q ss_pred             CcEEEEEEEEcCCCCCCCceeEEEEEEcCc
Q psy4200         353 GTSITTITALDSDGGDYGTGGIVYELLGEY  382 (388)
Q Consensus       353 g~~v~~v~a~D~D~~~~~~~~i~ysi~~~~  382 (388)
                      +|.|++|.|+|+|.|.||+  +-|.+.|+.
T Consensus       799 ~TsvlQVSatDaD~g~Ng~--v~y~~qg~~  826 (2531)
T KOG4289|consen  799 FTSVLQVSATDADSGPNGR--VYYTFQGGD  826 (2531)
T ss_pred             cceEEEEEEeccCCCCCce--EEEEecCCC
Confidence            9999999999999999997  888887443


No 3  
>KOG1219|consensus
Probab=100.00  E-value=1.6e-53  Score=435.24  Aligned_cols=360  Identities=28%  Similarity=0.406  Sum_probs=303.5

Q ss_pred             CCccCCCCCeeccCCCcceEEecCCCCCcEEEEEEEECCC-CCeEEEEEec----CCcEEEeCCeeEEEEcccCCCcccc
Q psy4200           1 MEAYGNSPPSFTTDVNSPLVVEENTPPGTIVSTLEGVDPE-GSKVKYGIYG----TDRFSLDRDSGELRVAQPLDREYNS   75 (388)
Q Consensus         1 ~~d~nd~~P~F~~~~~~~~~v~E~~~~gt~v~~v~a~D~D-~~~i~y~i~~----~~~F~Id~~tG~i~~~~~lD~e~~~   75 (388)
                      |+|+|||||+|.+..|. ++++|++++|+.|.++.|+|.| |..|.|+|.+    ...|.|+..||.|++.+.||||+..
T Consensus      2168 V~dIndn~PvFeqlsYt-~sisE~s~igt~viqilATdsDsn~~isYsl~g~s~~sk~f~In~sTG~it~~g~ldyE~~q 2246 (4289)
T KOG1219|consen 2168 VGDINDNPPVFEQLSYT-ISISENSKIGTKVIQILATDSDSNREISYSLEGNSEISKPFRINVSTGWITVAGKLDYEENQ 2246 (4289)
T ss_pred             ecccCCCCchhheeeEE-EEccCCCccCceEEEEEeccCCCCCceEEEeecCCccccceEEecccceEEEeeecChhhcc
Confidence            68999999999999999 9999999999999999999999 8999999998    4589999999999999999999999


Q ss_pred             eeeEEEEEEEEec--------------------CCCCccceeee-------ecceeEE---ecCCc--------------
Q psy4200          76 TNTSTIVLTLEGV--------------------DPEGSKVKYGI-------YGTDWFS---LDRDS--------------  111 (388)
Q Consensus        76 ~~~~~l~v~a~D~--------------------~P~f~~~~~~~-------~g~~v~~---~D~d~--------------  111 (388)
                      .|.  +.|+|.|.                    +|.|.+..|+.       -|..+..   .|+|+              
T Consensus      2247 ~f~--~fvratdggk~lSseviv~V~VeD~Ndn~Pef~q~~~ea~vsd~a~~g~fit~v~a~D~Dssd~lk~ey~~~~~l 2324 (4289)
T KOG1219|consen 2247 EFR--FFVRATDGGKPLSSEVIVEVHVEDFNDNPPEFNQRNYEAFVSDPARSGHFITVVNAHDLDSSDHLKLEYNSNHFL 2324 (4289)
T ss_pred             eEE--EEEEEccCCCcccccEEEEEEehhcCCCCchhccccceeecCCCccceeEEEEEEeccCCccchhhhhhccccee
Confidence            998  77777762                    28888877775       1222222   33332              


Q ss_pred             -----cEE------------------------------------------------------------------------
Q psy4200         112 -----GEL------------------------------------------------------------------------  114 (388)
Q Consensus       112 -----g~~------------------------------------------------------------------------  114 (388)
                           |.+                                                                        
T Consensus      2325 ~~s~~G~iTlfNl~k~~l~~s~~lrv~vsD~v~~at~~vl~~~~~~n~~~~lveka~l~Tv~~~~~~~~~~f~~~gt~~~ 2404 (4289)
T KOG1219|consen 2325 ILSENGIITLFNLLKSPLQTSYPLRVTVSDGVFRATMEVLFHPHSRNHFSELVEKADLVTVVEHDEQEDADFGAYGTSIY 2404 (4289)
T ss_pred             eeccCceEEehhhcccccccccceeeeeccCcceeeeEEEEEecCcccchhhhhccceeEEEEecCccccccccCCceee
Confidence                 111                                                                        


Q ss_pred             -------------------EEcCCCCcccc-ceEEEEEEEeeCC-CceeEEEEEEEEeecCCCCCeeeCCcceEEEecCC
Q psy4200         115 -------------------RVAQPLYREDG-HHTSKTNVDIRDG-HHTSKTNVDIRVGDVQNTPPIFINSSFSGEIMESA  173 (388)
Q Consensus       115 -------------------~~~~~~~~~~~-~~~~~~~v~a~d~-~~~~~~~v~V~V~dvNd~~P~f~~~~~~~~v~E~~  173 (388)
                                         ...+.+++|.- .+...+.+.|.|+ ++.+.++++|.++|+|||+|.|....|+.+|.|++
T Consensus      2405 ~si~s~~sd~~~in~~GqI~t~~kld~e~s~~~vi~i~v~a~Da~gr~af~tvti~ltDiNDnpPqF~a~~Y~~nI~ena 2484 (4289)
T KOG1219|consen 2405 YSINSRASDHFEINKSGQIKTLSKLDREYSEELVIIIAVMAFDAGGRVAFCTVTIILTDINDNPPQFDAQLYRVNITENA 2484 (4289)
T ss_pred             eeechhccCceeECCCccEEeeehhhhccCceEEEEEEEEEecCCCeEEEEEEEEEEEecCCCCccccceeEEEEeeccc
Confidence                               01111111110 0111455667785 68899999999999999999999999999999999


Q ss_pred             CCCcEEEEEEEeeCCCCCCceEEEEEecC--CCCceEEeCCeeEEEecccCCccccCCcccEEEEEEEEEEccCCCCcCC
Q psy4200         174 PIGSVVLRVEAKDGDLAQPRSIYYDLLTN--PDEFFLIDSNTGELKTAKPLDREILGGTNGVISLTVRAREMVDGKPLQE  251 (388)
Q Consensus       174 ~~g~~v~~v~A~D~D~~~~~~v~y~l~~~--~~~~F~id~~tG~i~~~~~LD~E~~~~~~~~~~l~v~a~D~~~g~~~~~  251 (388)
                      .-|..|+++.|+|.|.+.|+.++|.+.+.  -..-|.|++ +|.|.+++.|+.+.    +..|.|.|+|.|  +|.|   
T Consensus      2485 skg~~V~~v~A~D~De~snadvty~i~~e~~~~~v~~in~-sG~Itv~~sL~~~e----n~tl~l~vkA~D--~g~P--- 2554 (4289)
T KOG1219|consen 2485 SKGKLVGHVIARDADEGSNADVTYEIVGESDVKHVFEINE-SGVITVKRSLDGLE----NSTLHLFVKAID--DGKP--- 2554 (4289)
T ss_pred             CCCceEEEEEEecCCCCCcccEEEEecCchhhhheeeecC-CceEEeehhhhccc----CcEEEEEEEecc--CCCC---
Confidence            99999999999999999999999999864  345678887 99999999999998    778999999998  7776   


Q ss_pred             CCceEEEEEEEEEcccCCCCCccCCCcEEEEEeCCCCCCCeeeeeEEEEeeCC---------------------------
Q psy4200         252 DQATAFAQVTVTILDVNDSPPVFNRKEYVVHIPEDIPDGSLLPDLDMIVTDSD---------------------------  304 (388)
Q Consensus       252 ~~~~~~~~v~I~V~dvNd~~P~f~~~~~~~~v~E~~~~g~~v~~l~l~a~D~D---------------------------  304 (388)
                       ++.+..+|.|+|.+..++.|.|..+.|.++|+|+.+.|..|+  ++.|.|.|                           
T Consensus      2555 -~~~s~ttV~v~vl~e~v~lPrFSep~y~fsvpEDv~vG~~Ig--~v~a~~a~~~~i~~~v~~gt~Esn~d~~Fsvdr~T 2631 (4289)
T KOG1219|consen 2555 -RRRSNTTVIVTVLPEDVNLPRFSEPIYTFSVPEDVPVGEEIG--QVSASDADEHVIYSLVLGGTPESNPDLPFSVDRNT 2631 (4289)
T ss_pred             -CcccceEEEEEecCcccCcccccCceEEEeccccCCCCCeee--EEeecccCCceEEEEEeCCCCCCCCCCceEEcCCC
Confidence             778899999999999999999999999999999999999998  88998887                           


Q ss_pred             -------------------eEEEEecCCCceeEEEEEEEEEEEeCCCCCCeeeccceEEEEecCCCCCcEEEEEEEEcCC
Q psy4200         305 -------------------LVIAEETHTAEKLSSSATLIVQVTDVNDNVPSFELNAYTGNVLETAQAGTSITTITALDSD  365 (388)
Q Consensus       305 -------------------~v~~~~~~~~~~~s~~~~v~I~V~dvND~~P~f~~~~y~~~v~e~~~~g~~v~~v~a~D~D  365 (388)
                                         .+.|.. .  ...-+.+.|.|.|.|+|||+|.|..+.|.+.+.||++.|+.|++++|.|.|
T Consensus      2632 G~i~v~ksLD~E~kk~yqi~v~a~~-~--~~vva~tsv~vqVkDvNDNaPvFe~d~y~f~i~En~pvGtsV~qf~AsD~D 2708 (4289)
T KOG1219|consen 2632 GMIKVNKSLDHEKKKSYQIKVKATC-G--QWVVAETSVFVQVKDVNDNAPVFEKDPYLFIIEENSPVGTSVIQFHASDMD 2708 (4289)
T ss_pred             ceEEeccccchhhhceEEEEEEeec-C--CceEEEEEEEEEeecccCCCccccCCceeEEEeccCCCCceEEEEEeeccC
Confidence                               122222 1  114678899999999999999999999999999999999999999999999


Q ss_pred             CCCCCceeEEEEEEcC
Q psy4200         366 GGDYGTGGIVYELLGE  381 (388)
Q Consensus       366 ~~~~~~~~i~ysi~~~  381 (388)
                      .+.+|+  |+|||...
T Consensus      2709 s~~nGq--irysl~~~ 2722 (4289)
T KOG1219|consen 2709 SGNNGQ--IRYSLTSP 2722 (4289)
T ss_pred             CCCCce--EEEEEcCC
Confidence            999999  99999976


No 4  
>KOG1219|consensus
Probab=100.00  E-value=5.8e-52  Score=423.98  Aligned_cols=366  Identities=29%  Similarity=0.458  Sum_probs=318.7

Q ss_pred             CCccCCCCCeeccCCCcceEEecCCCCCcEEEEEEEECCC---CCeEEEEEec-CCcEEEeCCeeEEEEcccCCCcccce
Q psy4200           1 MEAYGNSPPSFTTDVNSPLVVEENTPPGTIVSTLEGVDPE---GSKVKYGIYG-TDRFSLDRDSGELRVAQPLDREYNST   76 (388)
Q Consensus         1 ~~d~nd~~P~F~~~~~~~~~v~E~~~~gt~v~~v~a~D~D---~~~i~y~i~~-~~~F~Id~~tG~i~~~~~lD~e~~~~   76 (388)
                      |+|.|||+|+|.+..|. ++|.|+..+|+.|++|.|+|.|   ||.++|+|.. .+.|+||+.||.|.+.++||||....
T Consensus       735 vkd~ndn~p~f~e~sy~-vtvsedtepgs~Ia~vetnd~D~g~NG~v~fsL~n~sdvfsIdp~tGivv~~~sLdrE~q~~  813 (4289)
T KOG1219|consen  735 VKDYNDNTPIFVERSYH-VTVSEDTEPGSFIAHVETNDTDGGNNGMVSFSLLNKSDVFSIDPFTGIVVTSKSLDREGQTS  813 (4289)
T ss_pred             EEecccCCccccccceE-EEEecCCCCCceEEEEEecccCCCCCceEEEEecCCcceEEecCcccEEEeccccCcccCce
Confidence            57999999999999999 9999999999999999999998   6999999997 78999999999999999999999999


Q ss_pred             eeEEEEEEEEecC---------------------CCCccceeee-------ecceeEE---ecCCccE------------
Q psy4200          77 NTSTIVLTLEGVD---------------------PEGSKVKYGI-------YGTDWFS---LDRDSGE------------  113 (388)
Q Consensus        77 ~~~~l~v~a~D~~---------------------P~f~~~~~~~-------~g~~v~~---~D~d~g~------------  113 (388)
                      |.  |.|.|.|.|                     |.|-...+..       .|+.+..   .|+|-|.            
T Consensus       814 y~--l~I~a~dqp~pq~~svv~l~vsvedVndnpPkci~~hsr~kipedlp~gt~~~~l~A~d~diGq~~kvry~l~~~~  891 (4289)
T KOG1219|consen  814 YH--LKIEARDQPPPQLFSVVELDVSVEDVNDNPPKCIIRHSRSKIPEDLPYGTVTWQLVALDPDIGQLGKVRYYLTDDT  891 (4289)
T ss_pred             eE--EEEEEcCCCCCceEEEEEEEEEEeeccCCCCccccccccccCcccCCCceEEEEhhhcCcccCcCceeEEEEecCc
Confidence            99  888888744                     3333222222       4555554   5666542            


Q ss_pred             -----------EEEcCCCCccccceEEEEEEEeeCCC---ceeEEEEEEEEeecCCC--CCeeeCCcceEEEecCCCCCc
Q psy4200         114 -----------LRVAQPLYREDGHHTSKTNVDIRDGH---HTSKTNVDIRVGDVQNT--PPIFINSSFSGEIMESAPIGS  177 (388)
Q Consensus       114 -----------~~~~~~~~~~~~~~~~~~~v~a~d~~---~~~~~~v~V~V~dvNd~--~P~f~~~~~~~~v~E~~~~g~  177 (388)
                                 +.+.+++|++...+. .|.|+|.|++   +++.+.+.|.++|+|.|  ||.|..-.-.++|.||+|.|+
T Consensus       892 v~~rvd~~sGavfi~~~LDf~k~~fy-nLsv~a~d~g~p~lss~chl~Vevldv~enlhpp~F~~~v~e~~V~EnapiGT  970 (4289)
T KOG1219|consen  892 VGERVDFPSGAVFIGKPLDFEKSDFY-NLSVTAVDRGTPILSSICHLEVEVLDVNENLHPPEFISFVTEGHVLENAPIGT  970 (4289)
T ss_pred             cccccccccccEEEecccccccccce-EEEEEEecCCCcceeeeEEEEEEEeccCCCCCCcchheeeeeeeEeecCCcce
Confidence                       345566666654443 7899999976   56788999999999876  999998888899999999999


Q ss_pred             EEEEEEEeeCCCCCCceEEEEEec-CCCCceEEeCCeeEEEecccCCccccCCcccEEEEEEEEEEccCCCCcCCCCceE
Q psy4200         178 VVLRVEAKDGDLAQPRSIYYDLLT-NPDEFFLIDSNTGELKTAKPLDREILGGTNGVISLTVRAREMVDGKPLQEDQATA  256 (388)
Q Consensus       178 ~v~~v~A~D~D~~~~~~v~y~l~~-~~~~~F~id~~tG~i~~~~~LD~E~~~~~~~~~~l~v~a~D~~~g~~~~~~~~~~  256 (388)
                      .++++.|.|-|.|..+.++|+|.. +..+.|+||..+|.|++.+.||||.    +..|.|+|.|.|  .|.+    ++++
T Consensus       971 ~vi~i~A~dedsgldg~l~Y~I~~gdg~g~FsId~~tG~irTl~~lDrE~----ks~YwltveA~D--~gt~----~~ss 1040 (4289)
T KOG1219|consen  971 IVIRIQARDEDSGLDGELSYKIRTGDGDGIFSIDSTTGSIRTLKALDREK----KSSYWLTVEAKD--LGTV----PLSS 1040 (4289)
T ss_pred             EEEEEEEecCCCCccceEEEEEEcCCcceeEEecCCcceEeechhhchhh----cceEEEEEEEEe--cCCC----cccc
Confidence            999999999999999999999996 5678999999999999999999999    779999999998  5654    6888


Q ss_pred             EEEEEEEEcccCCCCCccCCCcEEEEEeCCCCCCCeeeeeEEEEeeCC--------------------------------
Q psy4200         257 FAQVTVTILDVNDSPPVFNRKEYVVHIPEDIPDGSLLPDLDMIVTDSD--------------------------------  304 (388)
Q Consensus       257 ~~~v~I~V~dvNd~~P~f~~~~~~~~v~E~~~~g~~v~~l~l~a~D~D--------------------------------  304 (388)
                      .+.+.|.|+|+|||+|+|.+..|..+|.|+++.+..|  +++.|+|+|                                
T Consensus      1041 v~~vyI~ieDvNDn~Pq~s~pvy~asI~enSp~~vsi--vq~ea~D~Dsssn~kLmykI~sGnyq~FF~Id~~TG~iTt~ 1118 (4289)
T KOG1219|consen 1041 VCEVYIEIEDVNDNVPQFSSPVYYASISENSPETVSI--VQAEANDPDSSSNQKLMYKITSGNYQGFFQIDPETGLITTI 1118 (4289)
T ss_pred             ceeEEEEEEecCCCCcccCCceEeeeeccCCCCceEE--EEeccCCCCcccCcceEEEEccCCccceEEEccccceeeee
Confidence            9999999999999999999999999999999999999  599999999                                


Q ss_pred             ------------eEEEEecCCCceeEEEEEEEEEEEeCCCCCCeeeccceEEEEecCCCCCcEEEEEEEEcCCCCCCCce
Q psy4200         305 ------------LVIAEETHTAEKLSSSATLIVQVTDVNDNVPSFELNAYTGNVLETAQAGTSITTITALDSDGGDYGTG  372 (388)
Q Consensus       305 ------------~v~~~~~~~~~~~s~~~~v~I~V~dvND~~P~f~~~~y~~~v~e~~~~g~~v~~v~a~D~D~~~~~~~  372 (388)
                                  |-+...+.|.|.+.+.+.|.|.|+|+|||+|+|.+..|...++|...+  .+.++.|.|.|.|.|++ 
T Consensus      1119 r~LDRE~qdEHiLeVTi~D~gep~l~s~~rviV~IldvNdnsp~Flqk~~~~~v~~r~s~--plyRl~a~d~DeG~nar- 1195 (4289)
T KOG1219|consen 1119 RRLDREKQDEHILEVTIQDNGEPWLCSNQRVIVSILDVNDNSPRFLQKKTFLRVPERSSP--PLYRLAAQDNDEGNNAR- 1195 (4289)
T ss_pred             hhhcccccccceEEEEEecCCCCccccceEEEEEEeeccCCchhhhhheeEEEeeeccCC--ceeEEEEEecCCCcceE-
Confidence                        223334468899999999999999999999999999999999998875  88999999999999988 


Q ss_pred             eEEEEEEcCceeEE
Q psy4200         373 GIVYELLGEYGIMY  386 (388)
Q Consensus       373 ~i~ysi~~~~~~~~  386 (388)
                       |+|+|..+++.|+
T Consensus      1196 -ityniedgde~Fs 1208 (4289)
T KOG1219|consen 1196 -ITYNIEDGDEVFS 1208 (4289)
T ss_pred             -EEEecccCceEEE
Confidence             9999998888743


No 5  
>cd00031 CA Cadherin repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion; these domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium; plays a role in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-,CNR-,proto-,and FAT-family cadherin, desmocollin, and desmoglein, exists as monomers or dimers (hetero- and homo-); two copies of the repeat are present here
Probab=99.97  E-value=1.2e-28  Score=215.48  Aligned_cols=190  Identities=41%  Similarity=0.650  Sum_probs=168.4

Q ss_pred             CcceEEecCCCCCcEEEEEEEECCCC---CeEEEEEecC---CcEEEeCCeeEEEEcccCCCcccceeeEEEEEEEEec-
Q psy4200          16 NSPLVVEENTPPGTIVSTLEGVDPEG---SKVKYGIYGT---DRFSLDRDSGELRVAQPLDREYNSTNTSTIVLTLEGV-   88 (388)
Q Consensus        16 ~~~~~v~E~~~~gt~v~~v~a~D~D~---~~i~y~i~~~---~~F~Id~~tG~i~~~~~lD~e~~~~~~~~l~v~a~D~-   88 (388)
                      |. +.|+|+++.|+.|+++.|.|+|.   +.++|+|.++   .+|.|++.+|.|++++.||||....|.  |.|.|.|. 
T Consensus         2 ~~-~~i~En~~~g~~v~~~~a~D~D~~~~~~~~y~i~~~~~~~~F~i~~~tG~l~~~~~lD~e~~~~~~--l~v~a~D~g   78 (199)
T cd00031           2 YS-VSVPENAPPGTVVGTVSATDPDSGENGRVTYSILGGNEDGLFSIDPNTGVITTTKPLDREEQSEYT--LTVVASDGG   78 (199)
T ss_pred             eE-EEEeCCCCCCCEEEEEEEECCCCCCCceEEEEEeCCCCcccEEEeCCCCEEEECCCCCCcCCceEE--EEEEEEECC
Confidence            56 89999999999999999999994   6899999974   389999999999999999999999999  88888882 


Q ss_pred             CCCCccceeeeecceeEEecCCccEEEEcCCCCccccceEEEEEEEeeCCCceeEEEEEEEEeecCCCCCeeeCCcceEE
Q psy4200          89 DPEGSKVKYGIYGTDWFSLDRDSGELRVAQPLYREDGHHTSKTNVDIRDGHHTSKTNVDIRVGDVQNTPPIFINSSFSGE  168 (388)
Q Consensus        89 ~P~f~~~~~~~~g~~v~~~D~d~g~~~~~~~~~~~~~~~~~~~~v~a~d~~~~~~~~v~V~V~dvNd~~P~f~~~~~~~~  168 (388)
                      .|                                                ..+....++|.|.|+||++|.|....|.+.
T Consensus        79 ~~------------------------------------------------~~~~~~~v~I~V~d~Nd~~P~~~~~~~~~~  110 (199)
T cd00031          79 GP------------------------------------------------PLSSTATVTVTVLDVNDNPPVFEQSSYEAS  110 (199)
T ss_pred             cC------------------------------------------------cceeEEEEEEEEccCCCCCCcccccceEEE
Confidence            11                                                123678999999999999999999999999


Q ss_pred             EecCCCCCcEEEEEEEeeCCCCCCceEEEEEecCCC-CceEEeCCeeEEEecccCCccccCCcccEEEEEEEEEEccCCC
Q psy4200         169 IMESAPIGSVVLRVEAKDGDLAQPRSIYYDLLTNPD-EFFLIDSNTGELKTAKPLDREILGGTNGVISLTVRAREMVDGK  247 (388)
Q Consensus       169 v~E~~~~g~~v~~v~A~D~D~~~~~~v~y~l~~~~~-~~F~id~~tG~i~~~~~LD~E~~~~~~~~~~l~v~a~D~~~g~  247 (388)
                      |.|+.++|+.++++.|+|+|.+.++.++|+|.+... .+|.|++.+|.|++.+.||+|.    ...|.+.|.|+|  .|.
T Consensus       111 v~e~~~~~~~i~~~~a~D~D~~~~~~~~y~l~~~~~~~~f~i~~~~G~i~~~~~ld~e~----~~~~~l~v~a~D--~~~  184 (199)
T cd00031         111 VPENAPPGTVVGTVTATDADSGENAKLTYSILSGNDKELFSIDPNTGIITLAKPLDREE----KSSYELTVVATD--GGG  184 (199)
T ss_pred             EeCCCCCCCEEEEEEEEcCCCCCCccEEEEEeCCCCCCEEEEeCCceEEEeCCccCCcc----CceEEEEEEEEE--CCC
Confidence            999999999999999999999888899999997544 8999999999999999999999    558999999998  343


Q ss_pred             CcCCCCceEEEEEEEEEcc
Q psy4200         248 PLQEDQATAFAQVTVTILD  266 (388)
Q Consensus       248 ~~~~~~~~~~~~v~I~V~d  266 (388)
                      +    .++.++.+.|.|.|
T Consensus       185 ~----~~~~~~~i~i~v~d  199 (199)
T cd00031         185 P----PLSSTATVTVTVLD  199 (199)
T ss_pred             C----CceeEEEEEEEEEC
Confidence            2    46888888888865


No 6  
>cd00031 CA Cadherin repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion; these domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium; plays a role in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-,CNR-,proto-,and FAT-family cadherin, desmocollin, and desmoglein, exists as monomers or dimers (hetero- and homo-); two copies of the repeat are present here
Probab=99.89  E-value=3.5e-21  Score=168.14  Aligned_cols=129  Identities=43%  Similarity=0.674  Sum_probs=117.8

Q ss_pred             cceEEEecCCCCCcEEEEEEEeeCCCCCCceEEEEEecCCC-CceEEeCCeeEEEecccCCccccCCcccEEEEEEEEEE
Q psy4200         164 SFSGEIMESAPIGSVVLRVEAKDGDLAQPRSIYYDLLTNPD-EFFLIDSNTGELKTAKPLDREILGGTNGVISLTVRARE  242 (388)
Q Consensus       164 ~~~~~v~E~~~~g~~v~~v~A~D~D~~~~~~v~y~l~~~~~-~~F~id~~tG~i~~~~~LD~E~~~~~~~~~~l~v~a~D  242 (388)
                      .|.++|.|+.+.|+.++++.|.|+|.+.++.++|+|.++.. .+|.|++.+|.|++.+.||||.    ...|.|.|+|+|
T Consensus         1 ~~~~~i~En~~~g~~v~~~~a~D~D~~~~~~~~y~i~~~~~~~~F~i~~~tG~l~~~~~lD~e~----~~~~~l~v~a~D   76 (199)
T cd00031           1 SYSVSVPENAPPGTVVGTVSATDPDSGENGRVTYSILGGNEDGLFSIDPNTGVITTTKPLDREE----QSEYTLTVVASD   76 (199)
T ss_pred             CeEEEEeCCCCCCCEEEEEEEECCCCCCCceEEEEEeCCCCcccEEEeCCCCEEEECCCCCCcC----CceEEEEEEEEE
Confidence            36789999999999999999999999988899999997543 7999999999999999999999    568999999997


Q ss_pred             ccCCCCcCCCCceEEEEEEEEEcccCCCCCccCCCcEEEEEeCCCCCCCeeeeeEEEEeeCC
Q psy4200         243 MVDGKPLQEDQATAFAQVTVTILDVNDSPPVFNRKEYVVHIPEDIPDGSLLPDLDMIVTDSD  304 (388)
Q Consensus       243 ~~~g~~~~~~~~~~~~~v~I~V~dvNd~~P~f~~~~~~~~v~E~~~~g~~v~~l~l~a~D~D  304 (388)
                        .|.+    .++....+.|.|.|+||++|.|....|.+.|.|+.+.|+.++  ++.|+|+|
T Consensus        77 --~g~~----~~~~~~~v~I~V~d~Nd~~P~~~~~~~~~~v~e~~~~~~~i~--~~~a~D~D  130 (199)
T cd00031          77 --GGGP----PLSSTATVTVTVLDVNDNPPVFEQSSYEASVPENAPPGTVVG--TVTATDAD  130 (199)
T ss_pred             --CCcC----cceeEEEEEEEEccCCCCCCcccccceEEEEeCCCCCCCEEE--EEEEEcCC
Confidence              4554    456899999999999999999998899999999999999997  89999999


No 7  
>PF00028 Cadherin:  Cadherin domain;  InterPro: IPR002126 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion.; GO: 0005509 calcium ion binding, 0007156 homophilic cell adhesion, 0016020 membrane; PDB: 2A4E_A 2A4C_B 2O72_A 2QVI_A 1NCJ_A 3Q2W_A 3Q2N_A 3LNH_B 3LNI_A 3Q2L_A ....
Probab=99.69  E-value=7.9e-16  Score=117.05  Aligned_cols=92  Identities=42%  Similarity=0.610  Sum_probs=82.9

Q ss_pred             ceEEEecCCCCCcEEEEEEEeeCCCCCCceEEEEEecCC-CCceEEeCCeeEEEecccCCccccCCcccEEEEEEEEEEc
Q psy4200         165 FSGEIMESAPIGSVVLRVEAKDGDLAQPRSIYYDLLTNP-DEFFLIDSNTGELKTAKPLDREILGGTNGVISLTVRAREM  243 (388)
Q Consensus       165 ~~~~v~E~~~~g~~v~~v~A~D~D~~~~~~v~y~l~~~~-~~~F~id~~tG~i~~~~~LD~E~~~~~~~~~~l~v~a~D~  243 (388)
                      |.++|+|+.++|+.++++.|.|+|.+.|+.+.|+|.++. ..+|.|++.+|.|++.++||||.    ...|.|.|.|+|.
T Consensus         1 Y~~~v~E~~~~g~~v~~v~a~D~D~~~n~~i~y~i~~~~~~~~F~I~~~tg~i~~~~~LD~E~----~~~y~l~v~a~D~   76 (93)
T PF00028_consen    1 YSFSVPENAPPGTVVGQVTATDPDSGPNSQITYSILGGNPDGLFSIDPNTGEISLKKPLDRET----QSSYQLTVRATDS   76 (93)
T ss_dssp             EEEEEETTGSTSSEEEEEEEEESSTSTTSSEEEEEEETTSTTSEEEETTTTEEEESSSSCTTT----TSEEEEEEEEEET
T ss_pred             CEEEEECCCCCCCEEEEEEEEeCCCCCCceEEEEEecCcccCceEEeeeeeccccceecCccc----CCEEEEEEEEEEC
Confidence            788999999999999999999999999999999999744 79999999999999999999999    6689999999983


Q ss_pred             cCCCCcCCCCceEEEEEEEEEc
Q psy4200         244 VDGKPLQEDQATAFAQVTVTIL  265 (388)
Q Consensus       244 ~~g~~~~~~~~~~~~~v~I~V~  265 (388)
                       .|.+    .++++++|.|+|+
T Consensus        77 -~~~~----~~~~~~~V~I~V~   93 (93)
T PF00028_consen   77 -GGSP----PLSSTATVTINVL   93 (93)
T ss_dssp             -TTSS----EEEEEEEEEEEEE
T ss_pred             -CCCC----CCEEEEEEEEEEC
Confidence             1444    6889999999874


No 8  
>smart00112 CA Cadherin repeats. Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium.
Probab=99.58  E-value=2.1e-14  Score=105.61  Aligned_cols=78  Identities=44%  Similarity=0.685  Sum_probs=69.2

Q ss_pred             eeCCCCCCceEEEEEecCCC-CceEEeCCeeEEEecccCCccccCCcccEEEEEEEEEEccCCCCcCCCCceEEEEEEEE
Q psy4200         185 KDGDLAQPRSIYYDLLTNPD-EFFLIDSNTGELKTAKPLDREILGGTNGVISLTVRAREMVDGKPLQEDQATAFAQVTVT  263 (388)
Q Consensus       185 ~D~D~~~~~~v~y~l~~~~~-~~F~id~~tG~i~~~~~LD~E~~~~~~~~~~l~v~a~D~~~g~~~~~~~~~~~~~v~I~  263 (388)
                      +|+|.|.|+.++|+|.++.. .+|.|++.+|.|.+.++||||.    ...|.|.|+|+|  .|.+    .+++.+.|.|.
T Consensus         1 ~D~D~g~n~~i~Y~i~~~~~~~~F~i~~~tg~i~~~~~LD~e~----~~~y~l~v~a~D--~~~~----~~~~~~~v~I~   70 (79)
T smart00112        1 TDADSGENGKVTYSILSGNEDGLFSIDPETGEITTTKPLDREE----QPEYTLTVEATD--GGGP----PLSSTATVTVT   70 (79)
T ss_pred             CCCCCCcCcEEEEEEecCCCCCEEEEeCCccEEEeCCccCeeC----CCeEEEEEEEEE--CCCC----CcccEEEEEEE
Confidence            48899889999999997544 8999999999999999999998    568999999998  4543    57899999999


Q ss_pred             EcccCCCCC
Q psy4200         264 ILDVNDSPP  272 (388)
Q Consensus       264 V~dvNd~~P  272 (388)
                      |.|+|||+|
T Consensus        71 V~D~Nd~~P   79 (79)
T smart00112       71 VLDVNDNAP   79 (79)
T ss_pred             EEECCCCCC
Confidence            999999998


No 9  
>KOG1834|consensus
Probab=99.55  E-value=4.1e-13  Score=128.04  Aligned_cols=201  Identities=28%  Similarity=0.430  Sum_probs=149.7

Q ss_pred             CCCCCeeccCCCcceEEecCCCCCcEEE--EEEEECCCC------CeEEEEEecCC-cEE---EeCCee--EEEEcccCC
Q psy4200           5 GNSPPSFTTDVNSPLVVEENTPPGTIVS--TLEGVDPEG------SKVKYGIYGTD-RFS---LDRDSG--ELRVAQPLD   70 (388)
Q Consensus         5 nd~~P~F~~~~~~~~~v~E~~~~gt~v~--~v~a~D~D~------~~i~y~i~~~~-~F~---Id~~tG--~i~~~~~lD   70 (388)
                      |-+-|. ....|. ..|.||-.  +++.  -+.|.|.|.      .-.-|.|.+.+ +|.   +|..||  .|+.+.+||
T Consensus        27 nkhkpw-ie~ey~-gvV~Endn--tvll~Ppl~aLdkdaplr~ageiC~fklhgq~vPFdavVvdK~TGegvlRaK~~lD  102 (952)
T KOG1834|consen   27 NKHKPW-IEEEYH-GVVTENDN--TVLLDPPLAALDKDAPLRYAGEICGFKLHGQPVPFDAVVVDKYTGEGVLRAKEPLD  102 (952)
T ss_pred             cccCcc-ccccee-EEEEeCCc--eEEeCCCeeeecCCCCcccccccceeEecCCCCCceEEEEeccCCceEEeecCccc
Confidence            333443 344688 88999863  3332  356777763      44567777743 454   577765  788899999


Q ss_pred             CcccceeeEEEEEEEEe--cCCCCccceeeeecceeEEecCCccEEEEcCCCCccccceEEEEEEEeeCCCceeEEEEEE
Q psy4200          71 REYNSTNTSTIVLTLEG--VDPEGSKVKYGIYGTDWFSLDRDSGELRVAQPLYREDGHHTSKTNVDIRDGHHTSKTNVDI  148 (388)
Q Consensus        71 ~e~~~~~~~~l~v~a~D--~~P~f~~~~~~~~g~~v~~~D~d~g~~~~~~~~~~~~~~~~~~~~v~a~d~~~~~~~~v~V  148 (388)
                      .|.+..|+  |+|+|.|  .+|.-++                                           -..+..++++|
T Consensus       103 Celqkeyt--f~iQAydCg~gpdgtn-------------------------------------------~kKShkatvhI  137 (952)
T KOG1834|consen  103 CELQKEYT--FTIQAYDCGNGPDGTN-------------------------------------------TKKSHKATVHI  137 (952)
T ss_pred             ccccccce--EEEEEEecCCCCCccc-------------------------------------------cccccceEEEE
Confidence            99999999  9999998  2232110                                           12356679999


Q ss_pred             EEeecCCCCCeeeCCcceEEEecCCCCCcEEEEEEEeeCCCCC-Cce-EEEEEecCCCCceEEeCCeeEEEecccCCccc
Q psy4200         149 RVGDVQNTPPIFINSSFSGEIMESAPIGSVVLRVEAKDGDLAQ-PRS-IYYDLLTNPDEFFLIDSNTGELKTAKPLDREI  226 (388)
Q Consensus       149 ~V~dvNd~~P~f~~~~~~~~v~E~~~~g~~v~~v~A~D~D~~~-~~~-v~y~l~~~~~~~F~id~~tG~i~~~~~LD~E~  226 (388)
                      .|.|+|+.+|+|....|.+.|.|. ++-..|++|.|.|.|-+. +++ ..|.|.. ++-+|.||. .|.|+...+|.|..
T Consensus       138 rVkDvNe~AP~f~ep~Yka~V~EG-K~yd~il~veAiD~DCspq~sqIC~YEI~t-~d~PFaIdn-~G~irnTekLny~k  214 (952)
T KOG1834|consen  138 RVKDVNEFAPVFKEPWYKAHVTEG-KVYDSILRVEAIDKDCSPQYSQICEYEITT-PDVPFAIDN-DGNIRNTEKLNYTK  214 (952)
T ss_pred             EeccccccCchhcccceeeEEecc-eeeeeeEEEEeecCCCCCcccceeEEEecC-CCCceEEcC-CCcccccccccccc
Confidence            999999999999999999999998 566689999999999864 555 4688885 678999974 79999999999988


Q ss_pred             cCCcccEEEEEEEEEEccCCCCcCCCCceEEEEEEEEEccc
Q psy4200         227 LGGTNGVISLTVRAREMVDGKPLQEDQATAFAQVTVTILDV  267 (388)
Q Consensus       227 ~~~~~~~~~l~v~a~D~~~g~~~~~~~~~~~~~v~I~V~dv  267 (388)
                          ...|.|+|.|.|+  |..    +....+.|+|+|...
T Consensus       215 ----e~~Y~ltVtAyDC--g~k----raa~d~lV~v~Vkp~  245 (952)
T KOG1834|consen  215 ----EHQYKLTVTAYDC--GKK----RAASDSLVTVHVKPT  245 (952)
T ss_pred             ----ceeEEEEEEEEec--ccc----cccCcceEEEEecCc
Confidence                6789999999984  332    223346777777654


No 10 
>PF00028 Cadherin:  Cadherin domain;  InterPro: IPR002126 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion.; GO: 0005509 calcium ion binding, 0007156 homophilic cell adhesion, 0016020 membrane; PDB: 2A4E_A 2A4C_B 2O72_A 2QVI_A 1NCJ_A 3Q2W_A 3Q2N_A 3LNH_B 3LNI_A 3Q2L_A ....
Probab=99.52  E-value=1.7e-13  Score=104.19  Aligned_cols=70  Identities=37%  Similarity=0.701  Sum_probs=65.3

Q ss_pred             CcceEEecCCCCCcEEEEEEEECCC---CCeEEEEEecC---CcEEEeCCeeEEEEcccCCCcccceeeEEEEEEEEec
Q psy4200          16 NSPLVVEENTPPGTIVSTLEGVDPE---GSKVKYGIYGT---DRFSLDRDSGELRVAQPLDREYNSTNTSTIVLTLEGV   88 (388)
Q Consensus        16 ~~~~~v~E~~~~gt~v~~v~a~D~D---~~~i~y~i~~~---~~F~Id~~tG~i~~~~~lD~e~~~~~~~~l~v~a~D~   88 (388)
                      |. ++|+|++++|+.|+++.|.|+|   ++.+.|+|.++   .+|.|++.+|.|+++++||||....|.  |.|.|.|.
T Consensus         1 Y~-~~v~E~~~~g~~v~~v~a~D~D~~~n~~i~y~i~~~~~~~~F~I~~~tg~i~~~~~LD~E~~~~y~--l~v~a~D~   76 (93)
T PF00028_consen    1 YS-FSVPENAPPGTVVGQVTATDPDSGPNSQITYSILGGNPDGLFSIDPNTGEISLKKPLDRETQSSYQ--LTVRATDS   76 (93)
T ss_dssp             EE-EEEETTGSTSSEEEEEEEEESSTSTTSSEEEEEEETTSTTSEEEETTTTEEEESSSSCTTTTSEEE--EEEEEEET
T ss_pred             CE-EEEECCCCCCCEEEEEEEEeCCCCCCceEEEEEecCcccCceEEeeeeeccccceecCcccCCEEE--EEEEEEEC
Confidence            56 8999999999999999999998   69999999964   589999999999999999999999999  99999983


No 11 
>KOG1834|consensus
Probab=99.35  E-value=2.5e-11  Score=116.04  Aligned_cols=149  Identities=23%  Similarity=0.292  Sum_probs=116.1

Q ss_pred             EEEEeecCCCCCeeeCCcceEEEecCCCCCcEEEEEEEeeCCCCC--Cc-eEEEEEecCCCCceE---EeCCe--eEEEe
Q psy4200         147 DIRVGDVQNTPPIFINSSFSGEIMESAPIGSVVLRVEAKDGDLAQ--PR-SIYYDLLTNPDEFFL---IDSNT--GELKT  218 (388)
Q Consensus       147 ~V~V~dvNd~~P~f~~~~~~~~v~E~~~~g~~v~~v~A~D~D~~~--~~-~v~y~l~~~~~~~F~---id~~t--G~i~~  218 (388)
                      ....--+|-+.|... ..|.+-|.||...-...--+.|-|.|.+.  .+ ..-|.|.+. .-+|.   +|..|  |.|+.
T Consensus        20 ~~~aarankhkpwie-~ey~gvV~Endntvll~Ppl~aLdkdaplr~ageiC~fklhgq-~vPFdavVvdK~TGegvlRa   97 (952)
T KOG1834|consen   20 HHHAARANKHKPWIE-EEYHGVVTENDNTVLLDPPLAALDKDAPLRYAGEICGFKLHGQ-PVPFDAVVVDKYTGEGVLRA   97 (952)
T ss_pred             ccccccccccCcccc-cceeEEEEeCCceEEeCCCeeeecCCCCcccccccceeEecCC-CCCceEEEEeccCCceEEee
Confidence            445566787888776 88999999996544444458888988753  12 456788753 34555   47766  57888


Q ss_pred             cccCCccccCCcccEEEEEEEEEEccCCCCcCCCCceEEEEEEEEEcccCCCCCccCCCcEEEEEeCCCCCCCeeeeeEE
Q psy4200         219 AKPLDREILGGTNGVISLTVRAREMVDGKPLQEDQATAFAQVTVTILDVNDSPPVFNRKEYVVHIPEDIPDGSLLPDLDM  298 (388)
Q Consensus       219 ~~~LD~E~~~~~~~~~~l~v~a~D~~~g~~~~~~~~~~~~~v~I~V~dvNd~~P~f~~~~~~~~v~E~~~~g~~v~~l~l  298 (388)
                      +.+||.|.    +..|+|+|+|.|+..|..+.....+..++|.|.|.|+|+.+|.|....|.+.|.|+.-....+   ++
T Consensus        98 K~~lDCel----qkeytf~iQAydCg~gpdgtn~kKShkatvhIrVkDvNe~AP~f~ep~Yka~V~EGK~yd~il---~v  170 (952)
T KOG1834|consen   98 KEPLDCEL----QKEYTFTIQAYDCGNGPDGTNTKKSHKATVHIRVKDVNEFAPVFKEPWYKAHVTEGKVYDSIL---RV  170 (952)
T ss_pred             cCcccccc----cccceEEEEEEecCCCCCccccccccceEEEEEeccccccCchhcccceeeEEecceeeeeeE---EE
Confidence            99999998    568999999999765544444467788999999999999999999999999999998777665   89


Q ss_pred             EEeeCC
Q psy4200         299 IVTDSD  304 (388)
Q Consensus       299 ~a~D~D  304 (388)
                      .|.|.|
T Consensus       171 eAiD~D  176 (952)
T KOG1834|consen  171 EAIDKD  176 (952)
T ss_pred             EeecCC
Confidence            999999


No 12 
>smart00112 CA Cadherin repeats. Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium.
Probab=99.34  E-value=1e-11  Score=91.25  Aligned_cols=69  Identities=30%  Similarity=0.489  Sum_probs=59.3

Q ss_pred             CCeEEEEEecC---CcEEEeCCeeEEEEcccCCCcccceeeEEEEEEEEecCCCCccceeeeecceeEEecCCccEEEEc
Q psy4200          41 GSKVKYGIYGT---DRFSLDRDSGELRVAQPLDREYNSTNTSTIVLTLEGVDPEGSKVKYGIYGTDWFSLDRDSGELRVA  117 (388)
Q Consensus        41 ~~~i~y~i~~~---~~F~Id~~tG~i~~~~~lD~e~~~~~~~~l~v~a~D~~P~f~~~~~~~~g~~v~~~D~d~g~~~~~  117 (388)
                      ++.++|+|.++   .+|.|++.+|.|++.++||||....|.  |.|.|.|.+                            
T Consensus         8 n~~i~Y~i~~~~~~~~F~i~~~tg~i~~~~~LD~e~~~~y~--l~v~a~D~~----------------------------   57 (79)
T smart00112        8 NGKVTYSILSGNEDGLFSIDPETGEITTTKPLDREEQPEYT--LTVEATDGG----------------------------   57 (79)
T ss_pred             CcEEEEEEecCCCCCEEEEeCCccEEEeCCccCeeCCCeEE--EEEEEEECC----------------------------
Confidence            58899999864   689999999999999999999999999  999988821                            


Q ss_pred             CCCCccccceEEEEEEEeeCCCceeEEEEEEEEeecCCCCC
Q psy4200         118 QPLYREDGHHTSKTNVDIRDGHHTSKTNVDIRVGDVQNTPP  158 (388)
Q Consensus       118 ~~~~~~~~~~~~~~~v~a~d~~~~~~~~v~V~V~dvNd~~P  158 (388)
                                         ...+++.+.|+|.|.|+|||+|
T Consensus        58 -------------------~~~~~~~~~v~I~V~D~Nd~~P   79 (79)
T smart00112       58 -------------------GPPLSSTATVTVTVLDVNDNAP   79 (79)
T ss_pred             -------------------CCCcccEEEEEEEEEECCCCCC
Confidence                               0125677899999999999998


No 13 
>PF08266 Cadherin_2:  Cadherin-like;  InterPro: IPR013164 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion. This entry represents a cadherin domain that is usually found at the N terminus of cadherin proteins.; PDB: 1WUZ_A 1WYJ_A.
Probab=97.69  E-value=4.5e-05  Score=55.91  Aligned_cols=66  Identities=27%  Similarity=0.467  Sum_probs=40.8

Q ss_pred             ceEEEecCCCCCcEEEEEEEeeCCCCCC--ceEEEEEec-CCCCceEEeCCeeEEEecccCCccccCCcc
Q psy4200         165 FSGEIMESAPIGSVVLRVEAKDGDLAQP--RSIYYDLLT-NPDEFFLIDSNTGELKTAKPLDREILGGTN  231 (388)
Q Consensus       165 ~~~~v~E~~~~g~~v~~v~A~D~D~~~~--~~v~y~l~~-~~~~~F~id~~tG~i~~~~~LD~E~~~~~~  231 (388)
                      ...+|+|..+.|+.|+.+ |.|.-....  ....|++.+ ....+|.++..+|.|.++..+|||.+...+
T Consensus         3 i~YsV~EE~~~Gt~IGni-a~dL~l~~~~l~~~~~ri~s~~~~~~~~v~~~tG~L~v~~rIDRE~LC~~~   71 (84)
T PF08266_consen    3 IRYSVPEEMPPGTVIGNI-AKDLGLDPQSLSSRNFRIVSEGNSQYFRVNEKTGDLFVSERIDREELCGQS   71 (84)
T ss_dssp             EEEEEESS--TT-EEEEC-CCCCT--HHHHCCTTBEEE-SSSS-SEEE-TTTSEEEESS--SCCCC-TTS
T ss_pred             eEEEeecCCCCCCEEEEh-HHhhCCCcccccccceEEeecCCcceeEecCCceeEEeCCccCHHHHCCCC
Confidence            457899999999999998 445432211  123566654 456899999999999999999999976433


No 14 
>PF08266 Cadherin_2:  Cadherin-like;  InterPro: IPR013164 Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues of the organism which modulate a wide variety of processes including cell polarisation and migration [, ,]. Cadherin-mediated cell-cell junctions are formed as a result of interaction between extracellular domains of identical cadherins, which are located on the membranes of the neighbouring cells. The stability of these adhesive junctions is ensured by binding of the intracellular cadherin domain with the actin cytoskeleton. There are a number of different isoforms distributed in a tissue-specific manner in a wide variety of organisms. Cells containing different cadherins tend to segregate in vitro, while those that contain the same cadherins tend to preferentially aggregate together. This observation is linked to the finding that cadherin expression causes morphological changes involving the positional segregation of cells into layers, suggesting they may play an important role in the sorting of different cell types during morphogenesis, histogenesis and regeneration. They may also be involved in the regulation of tight and gap junctions, and in the control of intercellular spacing. Cadherins are evolutionary related to the desmogleins which are component of intercellular desmosome junctions involved in the interaction of plaque proteins. Structurally, cadherins comprise a number of domains: classically, these include a signal sequence; a propeptide of around 130 residues; a single transmembrane domain and five tandemly repeated extracellular cadherin domains, 4 of which are cadherin repeats, and the fifth contains 4 conserved cysteines and a N-terminal cytoplasmic domain []. However, proteins are designated as members of the broadly defined cadherin family if they have one or more cadherin repeats. A cadherin repeat is an independently folding sequence of approximately 110 amino acids that contains motifs with the conserved sequences DRE, DXNDNAPXF, and DXD. Crystal structures have revealed that multiple cadherin domains form Ca2+-dependent rod-like structures with a conserved Ca2+-binding pocket at the domain-domain interface. Cadherins depend on calcium for their function: calcium ions bind to specific residues in each cadherin repeat to ensure its proper folding, to confer rigidity upon the extracellular domain and is essential for cadherin adhesive function and for protection against protease digestion. This entry represents a cadherin domain that is usually found at the N terminus of cadherin proteins.; PDB: 1WUZ_A 1WYJ_A.
Probab=97.61  E-value=4.8e-05  Score=55.76  Aligned_cols=59  Identities=27%  Similarity=0.481  Sum_probs=38.8

Q ss_pred             eEEecCCCCCcEEEEEEEECCC-----CCeEEEEEec---CCcEEEeCCeeEEEEcccCCCcccceee
Q psy4200          19 LVVEENTPPGTIVSTLEGVDPE-----GSKVKYGIYG---TDRFSLDRDSGELRVAQPLDREYNSTNT   78 (388)
Q Consensus        19 ~~v~E~~~~gt~v~~v~a~D~D-----~~~i~y~i~~---~~~F~Id~~tG~i~~~~~lD~e~~~~~~   78 (388)
                      ++|+|..+.|+.|+.| |.|..     ...-.|++.+   ..+|.+++.+|.|+++..+|||+.+...
T Consensus         5 YsV~EE~~~Gt~IGni-a~dL~l~~~~l~~~~~ri~s~~~~~~~~v~~~tG~L~v~~rIDRE~LC~~~   71 (84)
T PF08266_consen    5 YSVPEEMPPGTVIGNI-AKDLGLDPQSLSSRNFRIVSEGNSQYFRVNEKTGDLFVSERIDREELCGQS   71 (84)
T ss_dssp             EEEESS--TT-EEEEC-CCCCT--HHHHCCTTBEEE-SSSS-SEEE-TTTSEEEESS--SCCCC-TTS
T ss_pred             EEeecCCCCCCEEEEh-HHhhCCCcccccccceEEeecCCcceeEecCCceeEEeCCccCHHHHCCCC
Confidence            7899999999999999 55653     1223455543   4599999999999999999999976543


No 15 
>PF08758 Cadherin_pro:  Cadherin prodomain like;  InterPro: IPR014868 Cadherins are a group of proteins that mediate calcium dependent cell-cell adhesion. They are activated through cleavage of a prosequence in the late Golgi. This protein corresponds to the folded region of the prosequence, and is termed the prodomain. The prodomain shows structural resemblance to the cadherin domain, but lacks all the features known to be important for cadherin-cadherin interactions []. ; GO: 0007155 cell adhesion, 0016021 integral to membrane; PDB: 1OP4_A.
Probab=97.43  E-value=0.00037  Score=51.79  Aligned_cols=76  Identities=14%  Similarity=0.235  Sum_probs=42.1

Q ss_pred             CCCeeccCCCcceEEecCCCCCcEEEEEEEECCCC-CeEEEEEecCCcEEEeCCeeEEEEcccCCCcccceeeEEEEEEE
Q psy4200           7 SPPSFTTDVNSPLVVEENTPPGTIVSTLEGVDPEG-SKVKYGIYGTDRFSLDRDSGELRVAQPLDREYNSTNTSTIVLTL   85 (388)
Q Consensus         7 ~~P~F~~~~~~~~~v~E~~~~gt~v~~v~a~D~D~-~~i~y~i~~~~~F~Id~~tG~i~~~~~lD~e~~~~~~~~l~v~a   85 (388)
                      +.|=|.+..|. +.|+.+...|..|++|.-.|..+ ..+.|....+ .|.|.+ .|.|++++++..... ...  +.|.|
T Consensus         2 C~pGF~~~~~~-~~Vp~~l~~g~~lg~V~f~dC~~~~~~~~~ssDp-dF~V~~-DGsVy~~r~v~l~~~-~~~--F~V~a   75 (90)
T PF08758_consen    2 CRPGFSQKKYT-FEVPSNLEAGQPLGKVNFEDCTGRRRVIFESSDP-DFRVLE-DGSVYAKRPVQLSSE-QRS--FTVHA   75 (90)
T ss_dssp             ---B--S-EEE-E----SS-SS--EEE---B--SS---EEEE---S-EEEEET-TTEEEEES--S-SSS--EE--EEEEE
T ss_pred             CcCCcccceEE-EEcCchhhCCcEEEEEEeccCCCCCceEEecCCC-CEEEcC-CCeEEEeeeEecCCC-ceE--EEEEE
Confidence            45889999999 99999999999999999999875 5688876655 999998 799999999877533 235  99999


Q ss_pred             Eec
Q psy4200          86 EGV   88 (388)
Q Consensus        86 ~D~   88 (388)
                      .|.
T Consensus        76 ~D~   78 (90)
T PF08758_consen   76 WDS   78 (90)
T ss_dssp             EET
T ss_pred             ECC
Confidence            993


No 16 
>PF08758 Cadherin_pro:  Cadherin prodomain like;  InterPro: IPR014868 Cadherins are a group of proteins that mediate calcium dependent cell-cell adhesion. They are activated through cleavage of a prosequence in the late Golgi. This protein corresponds to the folded region of the prosequence, and is termed the prodomain. The prodomain shows structural resemblance to the cadherin domain, but lacks all the features known to be important for cadherin-cadherin interactions []. ; GO: 0007155 cell adhesion, 0016021 integral to membrane; PDB: 1OP4_A.
Probab=95.96  E-value=0.091  Score=39.04  Aligned_cols=77  Identities=18%  Similarity=0.226  Sum_probs=40.8

Q ss_pred             CCCeeeCCcceEEEecCCCCCcEEEEEEEeeCCCCCCceEEEEEecCCCCceEEeCCeeEEEecccCCccccCCcccEEE
Q psy4200         156 TPPIFINSSFSGEIMESAPIGSVVLRVEAKDGDLAQPRSIYYDLLTNPDEFFLIDSNTGELKTAKPLDREILGGTNGVIS  235 (388)
Q Consensus       156 ~~P~f~~~~~~~~v~E~~~~g~~v~~v~A~D~D~~~~~~v~y~l~~~~~~~F~id~~tG~i~~~~~LD~E~~~~~~~~~~  235 (388)
                      +.|-|.+..|.+.|+.+...|..|++|.-.|-...  ..+.|... +  ..|.|.+ .|.|.+++++.....     .-.
T Consensus         2 C~pGF~~~~~~~~Vp~~l~~g~~lg~V~f~dC~~~--~~~~~~ss-D--pdF~V~~-DGsVy~~r~v~l~~~-----~~~   70 (90)
T PF08758_consen    2 CRPGFSQKKYTFEVPSNLEAGQPLGKVNFEDCTGR--RRVIFESS-D--PDFRVLE-DGSVYAKRPVQLSSE-----QRS   70 (90)
T ss_dssp             ---B--S-EEEE----SS-SS--EEE---B--SS-----EEEE------SEEEEET-TTEEEEES--S-SSS------EE
T ss_pred             CcCCcccceEEEEcCchhhCCcEEEEEEeccCCCC--CceEEecC-C--CCEEEcC-CCeEEEeeeEecCCC-----ceE
Confidence            46889999999999999999999999999887433  46777764 2  3799964 799999999887542     358


Q ss_pred             EEEEEEEc
Q psy4200         236 LTVRAREM  243 (388)
Q Consensus       236 l~v~a~D~  243 (388)
                      |.|.|.|.
T Consensus        71 F~V~a~D~   78 (90)
T PF08758_consen   71 FTVHAWDS   78 (90)
T ss_dssp             EEEEEEET
T ss_pred             EEEEEECC
Confidence            99999984


No 17 
>smart00736 CADG Dystroglycan-type cadherin-like domains. Cadherin-homologous domains present in metazoan dystroglycans and alpha/epsilon sarcoglycans, yeast Axl2p and in a very large protein from magnetotactic bacteria. Likely to bind calcium ions.
Probab=94.17  E-value=1.1  Score=33.77  Aligned_cols=69  Identities=30%  Similarity=0.526  Sum_probs=49.0

Q ss_pred             EeeCCCCCCceEEEEEecC----CCCceEEeCCeeEEEecccCCccccCCcccEEEEEEEEEEccCCCCcCCCCceEEEE
Q psy4200         184 AKDGDLAQPRSIYYDLLTN----PDEFFLIDSNTGELKTAKPLDREILGGTNGVISLTVRAREMVDGKPLQEDQATAFAQ  259 (388)
Q Consensus       184 A~D~D~~~~~~v~y~l~~~----~~~~F~id~~tG~i~~~~~LD~E~~~~~~~~~~l~v~a~D~~~g~~~~~~~~~~~~~  259 (388)
                      ..|+| +  ..++|++...    -..|..+++.++.+.-. |.....     +.|.+.|.|+|. .|       .+....
T Consensus        24 F~d~d-~--~~lty~~~~~~~~~lP~Wl~fd~~~~~~~Gt-P~~~~~-----g~~~i~v~a~D~-~g-------~~~~~~   86 (97)
T smart00736       24 FTDAD-G--DTLTYSATLSDGSALPSWLSFDSDTGTLSGT-PTNSDV-----GSLSLKVTATDS-SG-------ASASDT   86 (97)
T ss_pred             eECCC-C--CeEEEEEEeCCCCCCCCeEEEeCCCCEEEEE-CCCCCC-----cEEEEEEEEEEC-CC-------CEEEEE
Confidence            45666 2  3788888631    24699999998887773 444332     369999999973 12       467888


Q ss_pred             EEEEEcccCC
Q psy4200         260 VTVTILDVND  269 (388)
Q Consensus       260 v~I~V~dvNd  269 (388)
                      +.|.|.+.|+
T Consensus        87 f~i~V~~~~~   96 (97)
T smart00736       87 FTITVVNTND   96 (97)
T ss_pred             EEEEEeCCCC
Confidence            9999999886


No 18 
>smart00736 CADG Dystroglycan-type cadherin-like domains. Cadherin-homologous domains present in metazoan dystroglycans and alpha/epsilon sarcoglycans, yeast Axl2p and in a very large protein from magnetotactic bacteria. Likely to bind calcium ions.
Probab=93.63  E-value=1.4  Score=33.23  Aligned_cols=49  Identities=20%  Similarity=0.338  Sum_probs=33.9

Q ss_pred             EEECCCCCeEEEEEec------CCcEEEeCCeeEEEEcccCCCcccceeeEEEEEEEEe
Q psy4200          35 EGVDPEGSKVKYGIYG------TDRFSLDRDSGELRVAQPLDREYNSTNTSTIVLTLEG   87 (388)
Q Consensus        35 ~a~D~D~~~i~y~i~~------~~~F~Id~~tG~i~~~~~lD~e~~~~~~~~l~v~a~D   87 (388)
                      ...|+|+..++|++..      +.+.+.|+.++.++=. +...+ ...+.  ++|.|+|
T Consensus        23 tF~d~d~~~lty~~~~~~~~~lP~Wl~fd~~~~~~~Gt-P~~~~-~g~~~--i~v~a~D   77 (97)
T smart00736       23 TFTDADGDTLTYSATLSDGSALPSWLSFDSDTGTLSGT-PTNSD-VGSLS--LKVTATD   77 (97)
T ss_pred             ceECCCCCeEEEEEEeCCCCCCCCeEEEeCCCCEEEEE-CCCCC-CcEEE--EEEEEEE
Confidence            3578888899999973      3478999988887764 33333 34477  6666666


No 19 
>TIGR01965 VCBS_repeat VCBS repeat. This domain of about 100 residues is found multiple (up to 35) copies in long proteins from several species of Vibrio, Colwellia, Bradyrhizobium, and Shewanella (hence the name VCBS) and in smaller copy numbers in proteins from several other bacteria. The large protein size and repeat copy numbers, species distribution, and suggested activities of several member proteins suggests a role for this domain in adhesion.
Probab=92.98  E-value=1.3  Score=33.36  Aligned_cols=39  Identities=18%  Similarity=0.313  Sum_probs=28.0

Q ss_pred             EEEEEeeCCCceeEEEEEEEEeecCCCCCeeeCCcceEEEecCC
Q psy4200         130 KTNVDIRDGHHTSKTNVDIRVGDVQNTPPIFINSSFSGEIMESA  173 (388)
Q Consensus       130 ~~~v~a~d~~~~~~~~v~V~V~dvNd~~P~f~~~~~~~~v~E~~  173 (388)
                      .|++.+.||.   ...|+|.|.-.|| +|+.. ..-...+.|+.
T Consensus        60 sFtvtv~DGt---t~~vtItI~GtND-apvi~-~~~~g~v~ED~   98 (99)
T TIGR01965        60 TFTVTSADGT---SQTVTITITGAND-AAVIG-GADTGSVTEDS   98 (99)
T ss_pred             EEEEEEeCCC---eEEEEEEEEccCC-CCEEe-cccceeEecCC
Confidence            5667777773   7789999999997 57665 34456777763


No 20 
>TIGR01965 VCBS_repeat VCBS repeat. This domain of about 100 residues is found multiple (up to 35) copies in long proteins from several species of Vibrio, Colwellia, Bradyrhizobium, and Shewanella (hence the name VCBS) and in smaller copy numbers in proteins from several other bacteria. The large protein size and repeat copy numbers, species distribution, and suggested activities of several member proteins suggests a role for this domain in adhesion.
Probab=92.20  E-value=1.1  Score=33.93  Aligned_cols=87  Identities=24%  Similarity=0.258  Sum_probs=53.3

Q ss_pred             EEEEEeeCCCCCCceEEEEEec--CCCCceEEeCCeeEEEecc--------cCCccccCCcccEEEEEEEEEEccCCCCc
Q psy4200         180 LRVEAKDGDLAQPRSIYYDLLT--NPDEFFLIDSNTGELKTAK--------PLDREILGGTNGVISLTVRAREMVDGKPL  249 (388)
Q Consensus       180 ~~v~A~D~D~~~~~~v~y~l~~--~~~~~F~id~~tG~i~~~~--------~LD~E~~~~~~~~~~l~v~a~D~~~g~~~  249 (388)
                      +++.++|+|.+.  ...+++..  +..+.|.|++ +|.....-        .|....    ...-.|.+.+.   ||.  
T Consensus         2 G~Lt~sD~D~gd--~~~~s~~~~~g~yGtlti~~-~G~wtYtl~n~~~avq~L~~Ge----~~tdsFtvtv~---DGt--   69 (99)
T TIGR01965         2 GQLTISDADAGQ--AHFIAQTDAAGQYGTFSIDA-DGQWTYQADNSQTAVQALKAGE----TLTDTFTVTSA---DGT--   69 (99)
T ss_pred             CceEEeCCCCCC--ceEEecccccCCcEEEEECC-CCcEEEEeCCCcHHHHhhcCCC----EEEEEEEEEEe---CCC--
Confidence            468889999875  45555542  4467788876 66554421        122111    33567888888   452  


Q ss_pred             CCCCceEEEEEEEEEcccCCCCCccCCCcEEEEEeCCC
Q psy4200         250 QEDQATAFAQVTVTILDVNDSPPVFNRKEYVVHIPEDI  287 (388)
Q Consensus       250 ~~~~~~~~~~v~I~V~dvNd~~P~f~~~~~~~~v~E~~  287 (388)
                             +..|.|+|...||.| .... .-...+.|+.
T Consensus        70 -------t~~vtItI~GtNDap-vi~~-~~~g~v~ED~   98 (99)
T TIGR01965        70 -------SQTVTITITGANDAA-VIGG-ADTGSVTEDS   98 (99)
T ss_pred             -------eEEEEEEEEccCCCC-EEec-ccceeEecCC
Confidence                   678999999999755 4332 2245666653


No 21 
>PF07495 Y_Y_Y:  Y_Y_Y domain;  InterPro: IPR011123 This region is mostly found at the end of the beta propellers (IPR011110 from INTERPRO) in a family of two component regulators. However they are also found tandemly repeated in Q891H4 from SWISSPROT without other signal conduction domains being present. It is named after the conserved tyrosines found in the alignment. The exact function is not known.; PDB: 3V9F_D 3VA6_B 3OTT_B 4A2M_D 4A2L_B.
Probab=82.19  E-value=7.7  Score=26.44  Aligned_cols=45  Identities=24%  Similarity=0.300  Sum_probs=26.8

Q ss_pred             CCceEEEEEecCCCCceEEeCCeeEEEecccCCccccCCcccEEEEEEEEEEc
Q psy4200         191 QPRSIYYDLLTNPDEFFLIDSNTGELKTAKPLDREILGGTNGVISLTVRAREM  243 (388)
Q Consensus       191 ~~~~v~y~l~~~~~~~F~id~~tG~i~~~~~LD~E~~~~~~~~~~l~v~a~D~  243 (388)
                      .+-..+|+|.+-...+..+...+-.+      .+-.++  .+.|+|.|+|.|.
T Consensus         6 ~~~~Y~Y~l~g~d~~W~~~~~~~~~~------~~~~L~--~G~Y~l~V~a~~~   50 (66)
T PF07495_consen    6 ENIRYRYRLEGFDDEWITLGSYSNSI------SYTNLP--PGKYTLEVRAKDN   50 (66)
T ss_dssp             TTEEEEEEEETTESSEEEESSTS-EE------EEES----SEEEEEEEEEEET
T ss_pred             CceEEEEEEECCCCeEEECCCCcEEE------EEEeCC--CEEEEEEEEEECC
Confidence            34467788876555666664332222      222333  5789999999985


No 22 
>TIGR00845 caca sodium/calcium exchanger 1. This model is specific for the eukaryotic sodium ion/calcium ion exchangers of the Caca family
Probab=78.24  E-value=1e+02  Score=33.17  Aligned_cols=47  Identities=19%  Similarity=0.214  Sum_probs=28.3

Q ss_pred             cCCCCCeeeCCcceEEEecCCCCCcEEEEEEEeeCCCCCCceEEEEEec
Q psy4200         153 VQNTPPIFINSSFSGEIMESAPIGSVVLRVEAKDGDLAQPRSIYYDLLT  201 (388)
Q Consensus       153 vNd~~P~f~~~~~~~~v~E~~~~g~~v~~v~A~D~D~~~~~~v~y~l~~  201 (388)
                      .||.++.|.-..-..+|.|++  |+.-..|.-...|.+....+.|+..+
T Consensus       395 ~dd~~s~i~Fe~~~Y~V~En~--GtV~VtV~R~GGdl~~tVsVdY~T~D  441 (928)
T TIGR00845       395 ENDPVSKIFFEPGHYTCLENC--GTVALTVVRRGGDLTNTVYVDYRTED  441 (928)
T ss_pred             ccCCcceEEecCCeEEEeecC--cEEEEEEEEccCCCCceEEEEEEccC
Confidence            455566655555556899984  66656665544444444567777654


No 23 
>TIGR03660 T1SS_rpt_143 T1SS-143 repeat domain. This model represents a domain of about 143 amino acids that may occur singly or in up to 23 tandem repeats in very large proteins in the genus Vibrio, and in related species such as Legionella pneumophila, Photobacterium profundum, Rhodopseudomonas palustris, Shewanella pealeana, and Aeromonas hydrophila. Proteins with these domains represent a subset of a broader set of proteins with a particular signal for type 1 secretion, consisting of several glycine-rich repeats modeled by pfam00353, followed by a C-terminal domain modeled by TIGR03661. Proteins with this domain tend to share several properties with the RtxA (Repeats in Toxin) protein of Vibrio cholerae, including a large size often containing tandemly repeated domains and a C-terminal signal for type 1 secretion.
Probab=74.94  E-value=43  Score=27.04  Aligned_cols=60  Identities=25%  Similarity=0.416  Sum_probs=37.3

Q ss_pred             EEEecccCCccccCCcccEEEEEEEEEEccCCCCcCCCCceEEEEEEEEEcccCCCCCccCCCcEEEEEeCCC
Q psy4200         215 ELKTAKPLDREILGGTNGVISLTVRAREMVDGKPLQEDQATAFAQVTVTILDVNDSPPVFNRKEYVVHIPEDI  287 (388)
Q Consensus       215 ~i~~~~~LD~E~~~~~~~~~~l~v~a~D~~~g~~~~~~~~~~~~~v~I~V~dvNd~~P~f~~~~~~~~v~E~~  287 (388)
                      .+.+.++||...- ...-...|.|.|+|. +|..       +...+.|+|.|  |. |...... ..+|.|+.
T Consensus        69 tftL~~~lDH~~g-~d~l~l~~~v~a~D~-DGD~-------s~~~l~VtI~D--D~-P~~~~~~-~~~V~E~~  128 (137)
T TIGR03660        69 EFTLEGPLDHAAG-SDELTLNFPIIATDF-DGDT-------SSITLPVTIVD--DV-PTITDVD-ALTVDEDD  128 (137)
T ss_pred             EEEEcccccCCCC-CceEEEeeeEEEEeC-CCCc-------cccEEEEEEEC--CC-Ceecccc-ceEEeccc
Confidence            4455666666431 113457889999987 6653       23588888887  44 6665433 37888853


No 24 
>PF05345 He_PIG:  Putative Ig domain;  InterPro: IPR008009 This alignment represents the conserved core region of a ~90 residue repeat found in several haemagglutinins and other cell surface proteins. Sequence similarities to Hyalin (IPR003410 from INTERPRO) and the PKD domain (IPR000601 from INTERPRO) suggest an Ig-like fold so this family may be similar in function to the (IPR003791 from INTERPRO) and (IPR003790 from INTERPRO) protein families.
Probab=67.97  E-value=25  Score=22.61  Aligned_cols=34  Identities=21%  Similarity=0.306  Sum_probs=25.1

Q ss_pred             CCceEEeCCeeEEEecccCCccccCCcccEEEEEEEEEE
Q psy4200         204 DEFFLIDSNTGELKTAKPLDREILGGTNGVISLTVRARE  242 (388)
Q Consensus       204 ~~~F~id~~tG~i~~~~~LD~E~~~~~~~~~~l~v~a~D  242 (388)
                      ..+..||+.+|.|.-.-.-.-+     .+.|.+.|.|+|
T Consensus        13 P~gLs~d~~tG~isGtp~~~~~-----~G~y~~~vtatd   46 (49)
T PF05345_consen   13 PSGLSLDPSTGTISGTPTSSVQ-----PGTYTFTVTATD   46 (49)
T ss_pred             CCcEEEeCCCCEEEeecCCCcc-----ccEEEEEEEEEc
Confidence            4678999999999886332211     247999999996


No 25 
>TIGR00845 caca sodium/calcium exchanger 1. This model is specific for the eukaryotic sodium ion/calcium ion exchangers of the Caca family
Probab=56.63  E-value=2.9e+02  Score=29.95  Aligned_cols=59  Identities=15%  Similarity=0.141  Sum_probs=36.3

Q ss_pred             ccCCCCCeeccCCCcceEEecCCCCCcEEEEEEEECCC-C--CeEEEEEecC-----CcEEEeCCeeEEEEc
Q psy4200           3 AYGNSPPSFTTDVNSPLVVEENTPPGTIVSTLEGVDPE-G--SKVKYGIYGT-----DRFSLDRDSGELRVA   66 (388)
Q Consensus         3 d~nd~~P~F~~~~~~~~~v~E~~~~gt~v~~v~a~D~D-~--~~i~y~i~~~-----~~F~Id~~tG~i~~~   66 (388)
                      +.||..++|....-. ..|.|+.  |+.-.+|.-...+ +  -.+.|+..++     .-|.  +.+|.|...
T Consensus       394 ~~dd~~s~i~Fe~~~-Y~V~En~--GtV~VtV~R~GGdl~~tVsVdY~T~DGTA~AG~DY~--~~sGTLtF~  460 (928)
T TIGR00845       394 EENDPVSKIFFEPGH-YTCLENC--GTVALTVVRRGGDLTNTVYVDYRTEDGTANAGSDYE--FTEGTLVFK  460 (928)
T ss_pred             cccCCcceEEecCCe-EEEeecC--cEEEEEEEEccCCCCceEEEEEEccCCccCCCCCcc--ccCceEEEC
Confidence            356667777666666 7899996  7776666655433 2  5588887642     2333  235766544


No 26 
>KOG3597|consensus
Probab=49.15  E-value=38  Score=33.05  Aligned_cols=62  Identities=23%  Similarity=0.228  Sum_probs=46.7

Q ss_pred             EEEEEEEEEEeCCCCCCeeeccceEEEEecCCCCCcEEEEEEEEcCCCCCCCceeEEEEEEcCce
Q psy4200         319 SSATLIVQVTDVNDNVPSFELNAYTGNVLETAQAGTSITTITALDSDGGDYGTGGIVYELLGEYG  383 (388)
Q Consensus       319 ~~~~v~I~V~dvND~~P~f~~~~y~~~v~e~~~~g~~v~~v~a~D~D~~~~~~~~i~ysi~~~~~  383 (388)
                      -+....|.|..+||.+..+....+.+-+.|+...-.-.-.+.+.|+|...-   ++.|+|.+...
T Consensus        24 ~~~~~~i~v~pvndpp~~~~~~~~~l~~~~~~~k~l~~~~l~~~d~d~~~~---~l~f~v~~t~~   85 (442)
T KOG3597|consen   24 QTDVLRIHVNPVNDPPSLIFPSGSLLVILEGGQKVLDPELLTAADPDSAPL---PLEFQVLGTSS   85 (442)
T ss_pred             EEeeecccccccCCCcceeecccceEEeecCCceeccceEeeccCCCCCcc---ceEEEEccCCC
Confidence            566788999999997777777777777777765433345699999998874   38999986543


No 27 
>PF03160 Calx-beta:  Calx-beta domain;  InterPro: IPR003644 The calx-beta motif is present as a tandem repeat in the cytoplasmic domains of Calx Na-Ca exchangers, which are used to expel calcium from cells. This motif overlaps domains used for calcium binding and regulation. The calx-beta motif is also present in the cytoplasmic tail of mammalian integrin-beta4, which mediates the bi-directional transfer of signals across the plasma membrane, as well as in some cyanobacterial proteins. This motif contains a series of beta-strands and turns that form a self-contained beta-sheet [, ].; GO: 0007154 cell communication, 0016021 integral to membrane; PDB: 3H6A_B 3FSO_A 3FQ4_B 2DPK_A 2QVM_A 3GIN_B 2QVK_A 2FWU_A 2FWS_A 3E9U_A ....
Probab=47.37  E-value=92  Score=23.05  Aligned_cols=53  Identities=21%  Similarity=0.248  Sum_probs=28.1

Q ss_pred             EEEEEeCCCCCCeeeccceEEEEecCCCCCcEEEEEEEEcCCCCCCCceeEEEEEEcCc
Q psy4200         324 IVQVTDVNDNVPSFELNAYTGNVLETAQAGTSITTITALDSDGGDYGTGGIVYELLGEY  382 (388)
Q Consensus       324 ~I~V~dvND~~P~f~~~~y~~~v~e~~~~g~~v~~v~a~D~D~~~~~~~~i~ysi~~~~  382 (388)
                      +|.|+| ||. |.+.-..-..++.|+.  |..-..|.-...+....  -.+.|+..++.
T Consensus         2 tvtI~d-~d~-~~v~f~~~~~~v~E~~--~~~~v~V~~~~~~~~~~--v~v~~~~~~gt   54 (100)
T PF03160_consen    2 TVTILD-DDD-PTVSFSSPSYTVSEGD--GTVTVTVTRSGGSLDGP--VTVNYSTVDGT   54 (100)
T ss_dssp             EEEEE--TTS-EEEEESSSEEEEETTS--SEEEEEEEEESS-TSSE--EEEEEEEEESS
T ss_pred             EEEEEC-CCC-CEEEEeCCEEEEEeCC--CEEEEEEEEcccCCCcc--eEEEEEEeCCc
Confidence            567778 664 4766555556777875  33444455444432222  23777766543


No 28 
>KOG3597|consensus
Probab=33.71  E-value=1e+02  Score=30.27  Aligned_cols=59  Identities=20%  Similarity=0.180  Sum_probs=45.9

Q ss_pred             eEEEEEEEEeecCCCCCeeeCCcceEEEecCCCCCcEEEEEEEeeCCCCCCceEEEEEec
Q psy4200         142 SKTNVDIRVGDVQNTPPIFINSSFSGEIMESAPIGSVVLRVEAKDGDLAQPRSIYYDLLT  201 (388)
Q Consensus       142 ~~~~v~V~V~dvNd~~P~f~~~~~~~~v~E~~~~g~~v~~v~A~D~D~~~~~~v~y~l~~  201 (388)
                      .+....|.|..+||.+..+....+.+-+.|+...-.....+.+.|+|... ..+.|++.+
T Consensus        24 ~~~~~~i~v~pvndpp~~~~~~~~~l~~~~~~~k~l~~~~l~~~d~d~~~-~~l~f~v~~   82 (442)
T KOG3597|consen   24 QTDVLRIHVNPVNDPPSLIFPSGSLLVILEGGQKVLDPELLTAADPDSAP-LPLEFQVLG   82 (442)
T ss_pred             EEeeecccccccCCCcceeecccceEEeecCCceeccceEeeccCCCCCc-cceEEEEcc
Confidence            45678899999999888777777778888886555556778899999774 468888885


No 29 
>PF05895 DUF859:  Siphovirus protein of unknown function (DUF859);  InterPro: IPR008577 This entry is represented by Streptococcus phage 7201, Orf39. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. This family consists of several uncharacterised proteins from a number of the Siphoviruses as well as some bacterial proteins from Streptococcus species. Some of the members of this family are described as putative minor structural proteins.
Probab=26.65  E-value=7.4e+02  Score=25.70  Aligned_cols=102  Identities=13%  Similarity=0.154  Sum_probs=52.8

Q ss_pred             EEEEEeeCC--CceeEEEEEEEEeecCCCCCeeeCCcceEEEecCCCCCcEEEEEEEeeCCC------CCCceEEEEEec
Q psy4200         130 KTNVDIRDG--HHTSKTNVDIRVGDVQNTPPIFINSSFSGEIMESAPIGSVVLRVEAKDGDL------AQPRSIYYDLLT  201 (388)
Q Consensus       130 ~~~v~a~d~--~~~~~~~v~V~V~dvNd~~P~f~~~~~~~~v~E~~~~g~~v~~v~A~D~D~------~~~~~v~y~l~~  201 (388)
                      .+++.++|.  ..++.....|.|++=.  +|.+.-..++..-.++    .......|.=...      -...++.|+...
T Consensus       301 Ti~atVtDSRGr~S~~~~~tItVl~Y~--~P~lsfsv~R~~~~~~----~~~v~~~a~Iapl~v~g~qKN~~~lt~~~a~  374 (624)
T PF05895_consen  301 TIRATVTDSRGRTSDPKTKTITVLEYS--PPTLSFSVYRCGSSGN----TLTVTRNAKIAPLTVNGVQKNTMTLTFKVAP  374 (624)
T ss_pred             EEEEEEEECCCccCCceEEEEEEEEcC--CCcEEEEEEEeCCCCc----EEEEEEEEEEeEEEEcccccceEEEEEEEEE
Confidence            445555664  3456778999999874  7877533332222222    1112222211111      112356777665


Q ss_pred             CCCCceEEeCC-------------eeEEEecccCCccccCCcccEEEEEEEEEEc
Q psy4200         202 NPDEFFLIDSN-------------TGELKTAKPLDREILGGTNGVISLTVRAREM  243 (388)
Q Consensus       202 ~~~~~F~id~~-------------tG~i~~~~~LD~E~~~~~~~~~~l~v~a~D~  243 (388)
                      -....|.+|..             .+...+...+|-+.      .|.+.+.++|.
T Consensus       375 ~gt~~~t~d~~~a~~~~s~~s~~~~~~~~L~g~y~~~k------Sy~V~~~l~D~  423 (624)
T PF05895_consen  375 LGTGTFTTDNGSASGTWSSISELTNSSANLGGTYDAEK------SYDVRGTLSDK  423 (624)
T ss_pred             cCcceEEEEccccccceeeeeeecccceeeccccCCCc------eEEEEEEEEEE
Confidence            34455655432             11234445566655      79999999984


No 30 
>PF09100 Qn_am_d_aIV:  Quinohemoprotein amine dehydrogenase, alpha subunit domain IV;  InterPro: IPR015184 This domain is predominantly found in the prokaryotic protein quinohemoprotein amine dehydrogenase, adopting an immunoglobulin-like beta-sandwich fold, with seven strands arranged into two beta sheets; the fold is possibly related to the immunoglobulin and/or fibronectin type III superfamilies. The precise function of this domain has not, as yet, been defined []. ; PDB: 1JMZ_A 1JMX_A 1PBY_A 1JJU_A.
Probab=26.14  E-value=1.2e+02  Score=23.98  Aligned_cols=30  Identities=33%  Similarity=0.415  Sum_probs=15.9

Q ss_pred             eEEEEecCCCceeEEEEEEEEEEEeCCCCCC
Q psy4200         305 LVIAEETHTAEKLSSSATLIVQVTDVNDNVP  335 (388)
Q Consensus       305 ~v~~~~~~~~~~~s~~~~v~I~V~dvND~~P  335 (388)
                      .|+|+-..+...++..+.+.|+|.+.|+ +|
T Consensus       103 ~VvAtv~d~~~~l~~e~~liVtVqr~~~-pp  132 (133)
T PF09100_consen  103 KVVATVKDGGKPLTGEAHLIVTVQRWNN-PP  132 (133)
T ss_dssp             EEEEEETTTT---EEEEEEEEE---S----S
T ss_pred             EEEEEEccCCcccceeEeEEEEeecccC-CC
Confidence            3455555667789999999999999987 66


No 31 
>cd02848 Chitinase_N_term Chitinase N-terminus domain. Chitinases hydrolyze the abundant natural biopolymer chitin, producing smaller chito-oligosaccharides. Chitin consists of multiple N-acetyl-D-glucosamine (NAG) residues connected via beta-1,4-glycosidic linkages and is an important structural element of fungal cell wall and arthropod exoskeletons. On the basis of the mode of chitin hydrolysis, chitinases are classified as random, endo-, and exo-chitinases and based on sequence criteria, chitinases belong to families 18 and 19 of glycosyl hydrolases.  The N-terminus of chitinase may be related to the immunoglobulin and/or fibronectin type III superfamilies. These domains are associated with different types of catalytic domains at  either the N-terminal or C-terminal end and may be involved in homodimeric/tetrameric/dodecameric interactions. Members of this family include members of the alpha amylase family, sialidase, galactose oxidase, cellulase, cellulose, hyaluronate lyase, chitob
Probab=20.45  E-value=1.2e+02  Score=23.14  Aligned_cols=28  Identities=21%  Similarity=0.264  Sum_probs=18.7

Q ss_pred             ccEEEEEEEEEEccCCCCcCCCCceEEEEEEEEEcc
Q psy4200         231 NGVISLTVRAREMVDGKPLQEDQATAFAQVTVTILD  266 (388)
Q Consensus       231 ~~~~~l~v~a~D~~~g~~~~~~~~~~~~~v~I~V~d  266 (388)
                      .+.|.+.|+++|. +|+       +..+.+.|.|-|
T Consensus        79 gG~y~m~V~lCn~-dGC-------S~S~~~~I~VAD  106 (106)
T cd02848          79 GGRYQMQVALCNG-DGC-------STSAAKEIVVAD  106 (106)
T ss_pred             CCeEEEEEEEECC-CCc-------cCcCCEEEEecC
Confidence            6789999999975 443       344555665543


Done!