Query         psy3300
Match_columns 97
No_of_seqs    142 out of 1037
Neff          9.5 
Searched_HMMs 46136
Date          Fri Aug 16 21:29:56 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy3300.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/3300hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 cd00173 SH2 Src homology 2 dom  99.9 2.1E-24 4.5E-29  121.5  11.2   86    1-91      7-94  (94)
  2 smart00252 SH2 Src homology 2   99.9 1.4E-22 3.1E-27  112.1  10.2   75    1-81      8-83  (84)
  3 PF00017 SH2:  SH2 domain;  Int  99.9 9.5E-23 2.1E-27  111.2   7.6   70    1-76      6-77  (77)
  4 KOG1930|consensus               99.8 8.4E-21 1.8E-25  128.0   6.7   95    1-95    219-325 (483)
  5 KOG4278|consensus               99.8 3.6E-18 7.9E-23  121.5   8.1   87    1-93    159-246 (1157)
  6 KOG4637|consensus               99.7 1.4E-17 2.9E-22  111.5   6.0   84    2-91    340-428 (464)
  7 KOG4637|consensus               99.7   5E-17 1.1E-21  108.8   6.1   85    1-91     30-119 (464)
  8 KOG0197|consensus               99.7 2.6E-16 5.7E-21  108.8   7.7   91    1-92     88-184 (468)
  9 KOG0790|consensus               99.7 4.5E-16 9.8E-21  106.6   8.5   91    1-92    117-216 (600)
 10 KOG4226|consensus               99.6 6.3E-16 1.4E-20  100.4   7.4   73    1-80    288-361 (379)
 11 KOG2996|consensus               99.6 6.3E-16 1.4E-20  108.6   6.2   88    1-94    692-784 (865)
 12 KOG1264|consensus               99.6 2.8E-15 6.2E-20  108.5   8.6   87    3-95    544-643 (1267)
 13 KOG4792|consensus               99.6 4.2E-15 9.1E-20   94.3   7.6   73    1-80     18-98  (293)
 14 KOG1264|consensus               99.6 5.2E-15 1.1E-19  107.1   6.9   84    1-92    655-739 (1267)
 15 KOG0790|consensus               99.4 2.8E-13   6E-18   93.1   6.8   90    1-96     11-109 (600)
 16 KOG0194|consensus               99.3 4.3E-11 9.3E-16   83.8  10.3   91    1-97     56-153 (474)
 17 KOG3601|consensus               99.1 8.9E-12 1.9E-16   78.3  -0.4   75    1-82     65-140 (222)
 18 PF14633 SH2_2:  SH2 domain; PD  98.7 1.4E-07   3E-12   60.6   8.6   72    1-78     44-123 (220)
 19 KOG3751|consensus               98.5 2.7E-08 5.9E-13   69.9   1.4   85    1-90    526-616 (622)
 20 PF14633 SH2_2:  SH2 domain; PD  97.6  0.0002 4.4E-09   46.1   5.0   71    3-78    143-218 (220)
 21 KOG1856|consensus               97.4  0.0002 4.2E-09   55.0   4.3   73    1-78   1114-1193(1299)
 22 KOG4566|consensus               97.3 0.00077 1.7E-08   44.4   5.2   73    2-79     60-138 (258)
 23 KOG3697|consensus               97.1  0.0012 2.6E-08   44.0   4.6   62   26-94    278-344 (345)
 24 KOG1856|consensus               96.3   0.031 6.7E-07   43.6   7.9   72    2-80   1212-1288(1299)
 25 PF02762 Cbl_N3:  CBL proto-onc  96.2   0.029 6.3E-07   30.4   5.1   36    2-37     10-47  (86)
 26 KOG3667|consensus               92.6    0.37   8E-06   36.0   5.2   47    1-51    570-616 (682)
 27 smart00557 IG_FLMN Filamin-typ  86.8     2.4 5.3E-05   23.3   4.5   24   14-37     54-77  (93)
 28 KOG1785|consensus               86.3     1.3 2.7E-05   31.5   3.6   34    4-37    266-301 (563)
 29 PF10505 NARG2_C:  NMDA recepto  70.4      10 0.00022   24.7   3.9   28    8-37    138-165 (220)
 30 PF00630 Filamin:  Filamin/ABP2  65.8     9.3  0.0002   20.9   2.8   24   14-37     68-91  (101)
 31 PF07935 SSV1_ORF_D-335:  ORF D  63.3      22 0.00047   19.0   6.2   36   43-81     15-50  (72)
 32 PF09430 DUF2012:  Protein of u  62.1      28  0.0006   20.3   4.4   34   14-53     28-61  (123)
 33 KOG2293|consensus               54.2      13 0.00027   27.5   2.3   67    1-79    474-542 (547)
 34 PF00408 PGM_PMM_IV:  Phosphogl  50.0      16 0.00034   19.1   1.8   15   14-28     33-48  (73)
 35 PF05303 DUF727:  Protein of un  49.8      49  0.0011   19.0   4.6   67    3-77      3-89  (108)
 36 cd00214 Calpain_III Calpain, s  45.4      43 0.00094   20.1   3.5   24   14-37    116-142 (150)
 37 PHA00730 int integrase          45.0 1.1E+02  0.0023   21.5   5.7   35   43-80     17-51  (337)
 38 COG5453 Uncharacterized conser  44.3      58  0.0013   18.3   5.4   52   14-76     27-79  (96)
 39 PF09625 VP9:  VP9 protein;  In  42.3      51  0.0011   17.9   3.0   18   19-36     29-46  (79)
 40 PF01067 Calpain_III:  Calpain   41.9      41 0.00088   19.7   3.0   24   14-37    115-141 (147)
 41 PF13912 zf-C2H2_6:  C2H2-type   41.8      21 0.00045   14.5   1.2   18   62-79      7-24  (27)
 42 smart00720 calpain_III calpain  40.5      47   0.001   19.5   3.1   24   14-37    111-137 (143)
 43 PF07240 Turandot:  Stress-indu  38.6      29 0.00062   19.2   1.7   18   65-82      8-25  (85)
 44 COG1576 Uncharacterized conser  37.0      31 0.00067   21.3   1.9   21    3-23     54-74  (155)
 45 COG1724 Predicted RNA binding   36.6      45 0.00097   17.6   2.2   20    2-23      8-27  (66)
 46 KOG0518|consensus               34.7      63  0.0014   26.1   3.5   26   13-38    904-929 (1113)
 47 cd00122 MBD MeCP2, MBD1, MBD2,  34.0      63  0.0014   16.3   2.5   24   57-80     28-51  (62)
 48 PF01429 MBD:  Methyl-CpG bindi  32.2      52  0.0011   17.5   2.1   25   57-81     34-58  (77)
 49 TIGR03769 P_ac_wall_RPT actino  30.4      62  0.0013   15.1   2.0   13   25-37     10-22  (41)
 50 cd01396 MeCP2_MBD MeCP2, MBD1,  30.1      56  0.0012   17.5   2.0   24   57-80     29-52  (77)
 51 PF09634 DUF2025:  Protein of u  29.0      31 0.00068   19.4   0.9   16   13-28     23-38  (106)
 52 cd00068 GGL G protein gamma su  28.6      70  0.0015   16.0   2.1   18   68-85     25-44  (57)
 53 KOG3413|consensus               28.1 1.2E+02  0.0026   18.7   3.3   51   14-75     89-139 (156)
 54 PF14822 Vasohibin:  Vasohibin   27.5 1.7E+02  0.0036   19.6   4.2   61   16-79    120-190 (246)
 55 smart00391 MBD Methyl-CpG bind  25.2   1E+02  0.0022   16.5   2.5   25   57-81     31-55  (77)
 56 PF04485 NblA:  Phycobilisome d  24.2      57  0.0012   16.4   1.3   11    1-11     22-32  (53)
 57 PF03443 Glyco_hydro_61:  Glyco  23.9      45 0.00098   21.6   1.1    9   14-22    147-155 (218)
 58 KOG4680|consensus               23.8 1.1E+02  0.0023   18.8   2.5   24   14-37    100-128 (153)
 59 cd01229 PH_etc2 Epithelial cel  22.2 1.9E+02  0.0041   17.3   3.3   30   20-50     80-109 (129)
 60 PF06851 DUF1247:  Protein of u  21.3      62  0.0014   19.7   1.2   15   65-79    120-134 (148)
 61 PF13620 CarboxypepD_reg:  Carb  20.4 1.4E+02  0.0031   15.2   2.9   22   14-37     36-57  (82)
 62 PF08272 Topo_Zn_Ribbon:  Topoi  20.1     7.7 0.00017   18.5  -2.2    9   15-23     12-20  (42)

No 1  
>cd00173 SH2 Src homology 2 domains; Signal transduction, involved in recognition of phosphorylated tyrosine (pTyr). SH2 domains typically bind pTyr-containing ligands via two surface pockets, a pTyr and hydrophobic binding pocket, allowing proteins with SH2 domains to localize to tyrosine phosphorylated sites.
Probab=99.92  E-value=2.1e-24  Score=121.52  Aligned_cols=86  Identities=37%  Similarity=0.570  Sum_probs=78.0

Q ss_pred             CCHHHHHHHhCCCCCceEEEeecCCCCCeeEEEEEecCCCCCCceeEEEEEEeCCcEEEecccCCCCCHHHHHHHhhhCC
Q psy3300           1 MIQEITLEVLSQEPVGAFMVRESTTKPGCFALSLRVPKEFHHLGIAHYLILRTAKGYKIKGFTKEFSSLTSLITHHSVMP   80 (97)
Q Consensus         1 isr~~ae~~L~~~~~G~fLiR~s~~~~~~~~Ls~~~~~~~~~~~v~h~~I~~~~~~~~l~~~~~~F~sl~~Lv~~y~~~~   80 (97)
                      |+|++|+++|++.++|+||||.|++.++.|+||++..     +.++||+|...++++.+......|+||.|||+||+.++
T Consensus         7 i~r~~Ae~~L~~~~~G~FLiR~s~~~~~~~~Lsv~~~-----~~v~H~~I~~~~~~~~~~~~~~~f~sl~eLv~~y~~~~   81 (94)
T cd00173           7 ISREEAEELLKKKPDGTFLVRDSESSPGDYVLSVRVK-----GKVKHYRIERTDDGYYLLGEGRSFPSLPELIEHYQKNP   81 (94)
T ss_pred             CCHHHHHHHHhcCCCceEEEEecCCCCCCEEEEEEEC-----CEEEEEEEEECCCCeEEecCCCccCCHHHHHHHHhhCc
Confidence            7899999999988999999999998899999999996     79999999999888776654678999999999999998


Q ss_pred             --CCcceeecCCC
Q psy3300          81 --ELLPCTLSLHR   91 (97)
Q Consensus        81 --~~l~~~L~~p~   91 (97)
                        +++.+.|+.|+
T Consensus        82 ~~~~~~~~L~~p~   94 (94)
T cd00173          82 LSDGLGVKLRYPV   94 (94)
T ss_pred             cCCCcccEeCCcC
Confidence              78889998884


No 2  
>smart00252 SH2 Src homology 2 domains. Src homology 2 domains bind phosphotyrosine-containing polypeptides via 2 surface pockets. Specificity is provided via interaction with residues that are distinct from the phosphotyrosine. Only a single occurrence of a SH2 domain has been found in S. cerevisiae.
Probab=99.89  E-value=1.4e-22  Score=112.10  Aligned_cols=75  Identities=32%  Similarity=0.530  Sum_probs=68.0

Q ss_pred             CCHHHHHHHhCCCCCceEEEeecCCCCCeeEEEEEecCCCCCCceeEEEEEEeC-CcEEEecccCCCCCHHHHHHHhhhC
Q psy3300           1 MIQEITLEVLSQEPVGAFMVRESTTKPGCFALSLRVPKEFHHLGIAHYLILRTA-KGYKIKGFTKEFSSLTSLITHHSVM   79 (97)
Q Consensus         1 isr~~ae~~L~~~~~G~fLiR~s~~~~~~~~Ls~~~~~~~~~~~v~h~~I~~~~-~~~~l~~~~~~F~sl~~Lv~~y~~~   79 (97)
                      |+|++|+++|.+.++|+||||.|++.++.|+||++..     +.++||+|...+ ++|.+.+ ...|+||.+||+||+.+
T Consensus         8 i~r~~Ae~lL~~~~~G~FLvR~s~~~~~~~~Lsv~~~-----~~~~h~~I~~~~~~~~~l~~-~~~F~sl~eLI~~y~~~   81 (84)
T smart00252        8 ISREEAEKLLKNEGDGDFLVRDSESEPGDYVLSVRVK-----GKVKHYRIRRNEDGKFYLDG-GRKFPSLVELVEHYQKN   81 (84)
T ss_pred             CCHHHHHHHHhcCCCcEEEEEcCCCCCCCEEEEEEEC-----CEEEEEEEEECCCCcEEECC-CCccCCHHHHHHHHhhC
Confidence            6899999999988899999999998899999999995     899999999987 4588876 78999999999999987


Q ss_pred             CC
Q psy3300          80 PE   81 (97)
Q Consensus        80 ~~   81 (97)
                      ++
T Consensus        82 ~~   83 (84)
T smart00252       82 SL   83 (84)
T ss_pred             CC
Confidence            64


No 3  
>PF00017 SH2:  SH2 domain;  InterPro: IPR000980 The Src homology 2 (SH2) domain is a protein domain of about 100 amino-acid residues first identified as a conserved sequence region between the oncoproteins Src and Fps []. Similar sequences were later found in many other intracellular signal-transducing proteins []. SH2 domains function as regulatory modules of intracellular signalling cascades by interacting with high affinity to phosphotyrosine-containing target peptides in a sequence-specific, SH2 domains recognise between 3-6 residues C-terminal to the phosphorylated tyrosine in a fashion that differs from one SH2 domain to another, and strictly phosphorylation-dependent manner [, , , ]. They are found in a wide variety of protein contexts e.g., in association with catalytic domains of phospholipase Cy (PLCy) and the non-receptor protein tyrosine kinases; within structural proteins such as fodrin and tensin; and in a group of small adaptor molecules, i.e Crk and Nck. The domains are frequently found as repeats in a single protein sequence and will then often bind both mono- and di-phosphorylated substrates.  The structure of the SH2 domain belongs to the alpha+beta class, its overall shape forming a compact flattened hemisphere. The core structural elements comprise a central hydrophobic anti-parallel beta-sheet, flanked by 2 short alpha-helices. The loop between strands 2 and 3 provides many of the binding interactions with the phosphate group of its phosphopeptide ligand, and is hence designated the phosphate binding loop, the phosphorylated ligand binds perpendicular to the beta-sheet and typically interacts with the phosphate binding loop and a hydrophobic binding pocket that interacts with a pY+3 side chain. The N- and C-termini of the domain are close together in space and on the opposite face from the phosphopeptide binding surface and it has been speculated that this has facilitated their integration into surface-exposed regions of host proteins [].; GO: 0005515 protein binding; PDB: 1M27_A 1KA6_A 1D4W_B 1D4T_A 1D1Z_B 1KA7_A 1UUR_A 1UUS_A 1BLJ_A 1BLK_A ....
Probab=99.89  E-value=9.5e-23  Score=111.19  Aligned_cols=70  Identities=37%  Similarity=0.634  Sum_probs=64.5

Q ss_pred             CCHHHHHHHhCC-CCCceEEEeecCCCCCeeEEEEEecCCCCCCceeEEEEEEeCCc-EEEecccCCCCCHHHHHHHh
Q psy3300           1 MIQEITLEVLSQ-EPVGAFMVRESTTKPGCFALSLRVPKEFHHLGIAHYLILRTAKG-YKIKGFTKEFSSLTSLITHH   76 (97)
Q Consensus         1 isr~~ae~~L~~-~~~G~fLiR~s~~~~~~~~Ls~~~~~~~~~~~v~h~~I~~~~~~-~~l~~~~~~F~sl~~Lv~~y   76 (97)
                      |+|++|+++|+. .++|+||||.|.+.+|.|+||++..     +.++||+|.+.+++ |.+.+ ...|+||.+||+||
T Consensus         6 isr~~Ae~~L~~~~~~G~FLvR~s~~~~~~~~Lsv~~~-----~~v~h~~I~~~~~~~~~~~~-~~~F~sl~~LV~~y   77 (77)
T PF00017_consen    6 ISRQEAERLLMQGKPDGTFLVRPSSSKPGKYVLSVRFD-----GKVKHFRINRTENGGYFLSD-GKKFPSLSDLVEHY   77 (77)
T ss_dssp             SHHHHHHHHHHTTSSTTEEEEEEESSSTTSEEEEEEET-----TEEEEEEEEEETTSEEESST-SSEBSSHHHHHHHH
T ss_pred             CCHHHHHHHHHhcCCCCeEEEEeccccccccccccccc-----cccEEEEEEecCCceEEccC-CCcCCCHHHHHHhC
Confidence            689999999999 7999999999998899999999996     79999999999988 77765 67799999999998


No 4  
>KOG1930|consensus
Probab=99.83  E-value=8.4e-21  Score=128.04  Aligned_cols=95  Identities=32%  Similarity=0.565  Sum_probs=85.5

Q ss_pred             CCHHHHHHHhCCCCCceEEEeecCCCCCeeEEEEEecCCC--------C--CCceeEEEEEEeCCcEEEecccC--CCCC
Q psy3300           1 MIQEITLEVLSQEPVGAFMVRESTTKPGCFALSLRVPKEF--------H--HLGIAHYLILRTAKGYKIKGFTK--EFSS   68 (97)
Q Consensus         1 isr~~ae~~L~~~~~G~fLiR~s~~~~~~~~Ls~~~~~~~--------~--~~~v~h~~I~~~~~~~~l~~~~~--~F~s   68 (97)
                      |||++|..||+++++|+||||+|.+.+|.|-|++++..+.        .  ...|+||.|+....|..|.|+.+  .|.|
T Consensus       219 isREQAIalLrdkePGtFvvRDS~SfrGayGLAlKVstPPPs~~~~~g~~~neLVRHFLIE~spkGVkLKGC~nEP~FGS  298 (483)
T KOG1930|consen  219 ISREQAIALLRDKEPGTFVVRDSHSFRGAYGLALKVSTPPPSVQPGDGSDSNELVRHFLIEPSPKGVKLKGCDNEPVFGS  298 (483)
T ss_pred             CCHHHHHHHhhcCCCCeEEEecCCcCCCccceEEEeccCCCcccCCCCCchhhhhhhheeccCCCceeccCCCCCCccch
Confidence            7999999999999999999999999999999999986421        1  25799999999999999988755  5999


Q ss_pred             HHHHHHHhhhCCCCcceeecCCCCCCC
Q psy3300          69 LTSLITHHSVMPELLPCTLSLHRYNPN   95 (97)
Q Consensus        69 l~~Lv~~y~~~~~~l~~~L~~p~~~p~   95 (97)
                      |..||-.|....+.|+|+|.+|.++|.
T Consensus       299 LSALV~QHSIt~LALPckL~iP~rDp~  325 (483)
T KOG1930|consen  299 LSALVYQHSITALALPCKLVIPDRDPL  325 (483)
T ss_pred             hHHHHhhccchhhhcceeEeccCCCcc
Confidence            999999999999999999999999984


No 5  
>KOG4278|consensus
Probab=99.75  E-value=3.6e-18  Score=121.52  Aligned_cols=87  Identities=30%  Similarity=0.442  Sum_probs=79.6

Q ss_pred             CCHHHHHHHhCCCCCceEEEeecCCCCCeeEEEEEecCCCCCCceeEEEEEEeCCc-EEEecccCCCCCHHHHHHHhhhC
Q psy3300           1 MIQEITLEVLSQEPVGAFMVRESTTKPGCFALSLRVPKEFHHLGIAHYLILRTAKG-YKIKGFTKEFSSLTSLITHHSVM   79 (97)
Q Consensus         1 isr~~ae~~L~~~~~G~fLiR~s~~~~~~~~Ls~~~~~~~~~~~v~h~~I~~~~~~-~~l~~~~~~F~sl~~Lv~~y~~~   79 (97)
                      +||..||.+|..-..|+||||+|++.+|+|+||++..     ++|.||+|.-..+| .++.. ...|.+|.|||.||...
T Consensus       159 vSRsaaEy~LsSgInGSFLVRESEsSpgQ~sISlRye-----GRVyHYRINt~~dgK~yvt~-EsrF~TLaELVHHHStv  232 (1157)
T KOG4278|consen  159 VSRSAAEYILSSGINGSFLVRESESSPGQYSISLRYE-----GRVYHYRINTDNDGKMYVTQ-ESRFRTLAELVHHHSTV  232 (1157)
T ss_pred             cccchhhhhhhcCcccceEEeeccCCCcceeEEEEec-----ceEEEEEeeccCCccEEEee-hhhhhHHHHHHhhcccc
Confidence            6899999999888999999999999999999999996     99999999988777 66654 77899999999999999


Q ss_pred             CCCcceeecCCCCC
Q psy3300          80 PELLPCTLSLHRYN   93 (97)
Q Consensus        80 ~~~l~~~L~~p~~~   93 (97)
                      .|||.|.|.+|..+
T Consensus       233 ADGLittLhYPApK  246 (1157)
T KOG4278|consen  233 ADGLITTLHYPAPK  246 (1157)
T ss_pred             ccceeEeeeccCcc
Confidence            99999999999643


No 6  
>KOG4637|consensus
Probab=99.72  E-value=1.4e-17  Score=111.48  Aligned_cols=84  Identities=32%  Similarity=0.484  Sum_probs=75.7

Q ss_pred             CHHHHHHHhCCCCCceEEEeecCCCCCeeEEEEEecCCCCCCceeEEEEEEeCCcEEEecccCCCCCHHHHHHHhhhC--
Q psy3300           2 IQEITLEVLSQEPVGAFMVRESTTKPGCFALSLRVPKEFHHLGIAHYLILRTAKGYKIKGFTKEFSSLTSLITHHSVM--   79 (97)
Q Consensus         2 sr~~ae~~L~~~~~G~fLiR~s~~~~~~~~Ls~~~~~~~~~~~v~h~~I~~~~~~~~l~~~~~~F~sl~~Lv~~y~~~--   79 (97)
                      .|..||.+|+..++|+||||.| ++.|+|++||...     +.|+|+.|..+..||.+..+.+.+.||.+||.||+..  
T Consensus       340 ~r~kAe~llrg~~dGtFLIR~s-s~~g~yalSV~~~-----~~V~HClIy~tatG~GFa~pyn~y~tlk~lV~hY~h~SL  413 (464)
T KOG4637|consen  340 NRDKAEELLRGKPDGTFLIRES-SKGGCYALSVVHD-----GEVKHCLIYQTATGFGFAEPYNLYSTLKELVLHYQHTSL  413 (464)
T ss_pred             hHHHHHHHhcCCCCCeEEEeec-cCCCceEEEEEEC-----CceeeeEEeeccccccccchhHHHHHHHHHHHHHhhhhH
Confidence            5889999999999999999996 6899999999995     9999999999999999987677799999999999876  


Q ss_pred             ---CCCcceeecCCC
Q psy3300          80 ---PELLPCTLSLHR   91 (97)
Q Consensus        80 ---~~~l~~~L~~p~   91 (97)
                         .|.|.++|+.|.
T Consensus       414 e~HnDal~ttLr~Pv  428 (464)
T KOG4637|consen  414 EQHNDALTTTLRYPV  428 (464)
T ss_pred             Hhhccccccccccce
Confidence               567888999885


No 7  
>KOG4637|consensus
Probab=99.69  E-value=5e-17  Score=108.80  Aligned_cols=85  Identities=25%  Similarity=0.305  Sum_probs=76.0

Q ss_pred             CCHHHHHHHhCCCCCceEEEeecCCCCCeeEEEEEecCCCCCCceeEEEEEEeCCcEEEecccCCCCCHHHHHHHhhhCC
Q psy3300           1 MIQEITLEVLSQEPVGAFMVRESTTKPGCFALSLRVPKEFHHLGIAHYLILRTAKGYKIKGFTKEFSSLTSLITHHSVMP   80 (97)
Q Consensus         1 isr~~ae~~L~~~~~G~fLiR~s~~~~~~~~Ls~~~~~~~~~~~v~h~~I~~~~~~~~l~~~~~~F~sl~~Lv~~y~~~~   80 (97)
                      |||+++...|.+.|+|+||||+.++.+|.|+|+++..     +.++-++|...++.|.+.+ ...|+|+.+||+||++.+
T Consensus        30 isReev~~~L~d~PDGsFlVRdAstm~GdYTLtl~k~-----g~~KLikI~h~DgKyGF~d-~ltf~SVVelIn~yr~~S  103 (464)
T KOG4637|consen   30 ISREEVNKKLRDQPDGSFLVRDASTMQGDYTLTLRKG-----GNNKLIKIVHRDGKYGFSD-PLTFNSVVELINHYRNES  103 (464)
T ss_pred             cCHHHHHHHhcCCCCCcEEeeccccCCCceEEEEecC-----CccceeeeEEecCccCCCC-chhhHHHHHHHHHHhhhH
Confidence            7999999999999999999999888999999999994     8999999999988777665 889999999999999886


Q ss_pred             C-----CcceeecCCC
Q psy3300          81 E-----LLPCTLSLHR   91 (97)
Q Consensus        81 ~-----~l~~~L~~p~   91 (97)
                      +     .|.++|.+|.
T Consensus       104 L~~yN~~LDvrLlyPV  119 (464)
T KOG4637|consen  104 LAQYNPKLDVRLLYPV  119 (464)
T ss_pred             HHhhCcccceeeechH
Confidence            5     4668888883


No 8  
>KOG0197|consensus
Probab=99.67  E-value=2.6e-16  Score=108.83  Aligned_cols=91  Identities=29%  Similarity=0.269  Sum_probs=76.6

Q ss_pred             CCHHHHHHHhCCC--CCceEEEeecCCCCCeeEEEEEecCCC-CCCceeEEEEEEeCCc-EE--EecccCCCCCHHHHHH
Q psy3300           1 MIQEITLEVLSQE--PVGAFMVRESTTKPGCFALSLRVPKEF-HHLGIAHYLILRTAKG-YK--IKGFTKEFSSLTSLIT   74 (97)
Q Consensus         1 isr~~ae~~L~~~--~~G~fLiR~s~~~~~~~~Ls~~~~~~~-~~~~v~h~~I~~~~~~-~~--l~~~~~~F~sl~~Lv~   74 (97)
                      |||++||+.|..+  +.|+||||+|++.+|.|+|+++..... .+..|+||+|...++| ++  +.. ...|.+|.+||+
T Consensus        88 isR~~ae~~ll~p~~~~G~flvR~se~~~g~yslsv~~~~~~~~~~~v~hyri~~~~~~~~~~~~~~-~~~F~~l~~lv~  166 (468)
T KOG0197|consen   88 ISREEAERQLLAPENKEGAFLVRESESDKGDYSLSVREGDSGGLGAKVKHYRIRQLDGGGLYPYIDE-RELFSSLQQLVN  166 (468)
T ss_pred             ccHHHHHHhhcCCCCCccceeeecccCCcCCeeEEEEeccccCCccceeeeeeeEcCCCCeecCCCH-HHhhhhHHHHHh
Confidence            7899999666544  679999999999999999999986321 2458999999998876 55  543 777999999999


Q ss_pred             HhhhCCCCcceeecCCCC
Q psy3300          75 HHSVMPELLPCTLSLHRY   92 (97)
Q Consensus        75 ~y~~~~~~l~~~L~~p~~   92 (97)
                      ||+.+++++...|..|+.
T Consensus       167 ~~~~~~~gl~~~l~~p~~  184 (468)
T KOG0197|consen  167 YYSKNADGLCTRLRDPCS  184 (468)
T ss_pred             hhhccCcchhhcccCchh
Confidence            999999999999998874


No 9  
>KOG0790|consensus
Probab=99.67  E-value=4.5e-16  Score=106.58  Aligned_cols=91  Identities=29%  Similarity=0.408  Sum_probs=75.6

Q ss_pred             CCHHHHHHHhCCC-CCceEEEeecCCCCCeeEEEEEecC----CCCCCceeEEEEEEeCCcEEEecccCCCCCHHHHHHH
Q psy3300           1 MIQEITLEVLSQE-PVGAFMVRESTTKPGCFALSLRVPK----EFHHLGIAHYLILRTAKGYKIKGFTKEFSSLTSLITH   75 (97)
Q Consensus         1 isr~~ae~~L~~~-~~G~fLiR~s~~~~~~~~Ls~~~~~----~~~~~~v~h~~I~~~~~~~~l~~~~~~F~sl~~Lv~~   75 (97)
                      +|..+||.+|... ..|+||||+|.+.||.|+||++..+    +....+|.|..|.+.++.|-+.| ...|.||.+||+|
T Consensus       117 LsgkeAekLl~ekgk~gsfLvReSqs~PGdfVlSvrTdd~~~~~~~~~kVtHvmI~~q~~kydVGg-ge~F~sltdLidh  195 (600)
T KOG0790|consen  117 LSGKEAEKLLQEKGKHGSFLVRESQSHPGDFVLSVRTDDKKESNDSKLKVTHVMIRCQEGKYDVGG-GERFDSLTDLVEH  195 (600)
T ss_pred             CCchhHHHHHHhcCCCccEEEeccccCCCceEEEEEcCCcccCCCCccceEEEEEEecccccccCC-ccccchHHHHHHH
Confidence            4678999999554 8999999999999999999999953    12246899999999887799866 8899999999999


Q ss_pred             hhhCCC----CcceeecCCCC
Q psy3300          76 HSVMPE----LLPCTLSLHRY   92 (97)
Q Consensus        76 y~~~~~----~l~~~L~~p~~   92 (97)
                      |.+++-    |-...|+.|..
T Consensus       196 ykknpmvEt~gtvv~LrqP~n  216 (600)
T KOG0790|consen  196 YKKNPMVETLGTVVYLRQPLN  216 (600)
T ss_pred             hccCchhhhcceeEEeecccc
Confidence            999853    45567887754


No 10 
>KOG4226|consensus
Probab=99.65  E-value=6.3e-16  Score=100.40  Aligned_cols=73  Identities=25%  Similarity=0.484  Sum_probs=65.4

Q ss_pred             CCHHHHHHHhCCC-CCceEEEeecCCCCCeeEEEEEecCCCCCCceeEEEEEEeCCcEEEecccCCCCCHHHHHHHhhhC
Q psy3300           1 MIQEITLEVLSQE-PVGAFMVRESTTKPGCFALSLRVPKEFHHLGIAHYLILRTAKGYKIKGFTKEFSSLTSLITHHSVM   79 (97)
Q Consensus         1 isr~~ae~~L~~~-~~G~fLiR~s~~~~~~~~Ls~~~~~~~~~~~v~h~~I~~~~~~~~l~~~~~~F~sl~~Lv~~y~~~   79 (97)
                      |+|.+||.+|..+ .+|.||||+|++.+|.|++|++..     ++.+||++...++-|.| | ...|.++.+||+||++.
T Consensus       288 itR~qae~~Ln~hG~eGdFLiRDSEsnpgD~SvSlka~-----grNKHFkVq~~d~~ycI-G-qRkF~tmd~Lv~HY~ka  360 (379)
T KOG4226|consen  288 ITRHQAECALNEHGHEGDFLIRDSESNPGDFSVSLKAS-----GRNKHFKVQLVDNVYCI-G-QRKFHTMDELVEHYKKA  360 (379)
T ss_pred             ccHHHHHHHHhccCccCceEEecCCCCCcceeEEeecc-----CCCcceEEEEecceEEe-c-cceeccHHHHHHhhhcC
Confidence            6899999999655 899999999999999999999996     89999999877666776 4 88999999999999886


Q ss_pred             C
Q psy3300          80 P   80 (97)
Q Consensus        80 ~   80 (97)
                      +
T Consensus       361 P  361 (379)
T KOG4226|consen  361 P  361 (379)
T ss_pred             C
Confidence            4


No 11 
>KOG2996|consensus
Probab=99.63  E-value=6.3e-16  Score=108.61  Aligned_cols=88  Identities=25%  Similarity=0.314  Sum_probs=76.9

Q ss_pred             CCHHHHHHHhCCCCCceEEEeecCCCCCeeEEEEEecCCCCCCceeEEEEEEeCCcEEEecccCCCCCHHHHHHHhhhCC
Q psy3300           1 MIQEITLEVLSQEPVGAFMVRESTTKPGCFALSLRVPKEFHHLGIAHYLILRTAKGYKIKGFTKEFSSLTSLITHHSVMP   80 (97)
Q Consensus         1 isr~~ae~~L~~~~~G~fLiR~s~~~~~~~~Ls~~~~~~~~~~~v~h~~I~~~~~~~~l~~~~~~F~sl~~Lv~~y~~~~   80 (97)
                      |-|++||..|+..+.|+||||.+....+.|+||++..     ..|+|++|...++.++|.. ...|+|+.|||+||+.++
T Consensus       692 MERaqaes~Lk~~~ngT~LVR~r~kea~e~AISikyn-----nevKHikI~~~dg~~~i~E-~k~F~sl~ELVeyYq~~s  765 (865)
T KOG2996|consen  692 MERAQAESTLKNRPNGTYLVRYRTKEAKEFAISIKYN-----NEVKHIKIETNDGKVHITE-DKKFNSLVELVEYYQSHS  765 (865)
T ss_pred             HhhhhhhhHhhcCCCceEEEEecccchhheeEEEEec-----cccceEEEEecCCeEEech-hhhhhhHHHHHHHHHhcc
Confidence            4689999999999999999999988899999999996     8999999998866688876 888999999999999997


Q ss_pred             C-----CcceeecCCCCCC
Q psy3300          81 E-----LLPCTLSLHRYNP   94 (97)
Q Consensus        81 ~-----~l~~~L~~p~~~p   94 (97)
                      +     .|...|+.|-..|
T Consensus       766 Lke~Fk~LDTtLk~Pyke~  784 (865)
T KOG2996|consen  766 LKEIFKALDTTLKFPYKEP  784 (865)
T ss_pred             HHHHHHHhhhhhcCCCcCh
Confidence            6     3557888886544


No 12 
>KOG1264|consensus
Probab=99.62  E-value=2.8e-15  Score=108.46  Aligned_cols=87  Identities=28%  Similarity=0.311  Sum_probs=73.5

Q ss_pred             HHHHHHHhCC------CCCceEEEeecCCCCCeeEEEEEecCCCCCCceeEEEEEEeC-Cc---EEEecccCCCCCHHHH
Q psy3300           3 QEITLEVLSQ------EPVGAFMVRESTTKPGCFALSLRVPKEFHHLGIAHYLILRTA-KG---YKIKGFTKEFSSLTSL   72 (97)
Q Consensus         3 r~~ae~~L~~------~~~G~fLiR~s~~~~~~~~Ls~~~~~~~~~~~v~h~~I~~~~-~~---~~l~~~~~~F~sl~~L   72 (97)
                      |.+|+.+|..      .++|+||||+|++..|.|+||++..     ++|.|++|.... +|   |++.+ +..|.||-+|
T Consensus       544 R~eAekll~eycke~G~~dGtFlVReS~tFvgDytLSfwr~-----grv~HcRIrsk~e~gt~Kyyl~d-N~vfdslY~L  617 (1267)
T KOG1264|consen  544 RTEAEKLLQEYCKETGGKDGTFLVRESETFVGDYTLSFWRS-----GRVQHCRIRSKMEGGTLKYYLTD-NLVFDSLYAL  617 (1267)
T ss_pred             chHHHHHHHHHHHHhCCCCccEEEeeccccccceeeeeeEC-----CceeeEEEEeeecCCceeEEEec-chhHHHHHHH
Confidence            7899999974      2699999999999999999999996     999999999764 33   88877 8899999999


Q ss_pred             HHHhhhCCCC---cceeecCCCCCCC
Q psy3300          73 ITHHSVMPEL---LPCTLSLHRYNPN   95 (97)
Q Consensus        73 v~~y~~~~~~---l~~~L~~p~~~p~   95 (97)
                      |.||+.+.+.   ....|+.|+++|.
T Consensus       618 I~~Y~~~~Lr~aeF~m~LtePvPqp~  643 (1267)
T KOG1264|consen  618 IQHYRETHLRCAEFEMRLTEPVPQPN  643 (1267)
T ss_pred             HHHHHhccccccceEEEecCCCCCCC
Confidence            9999998653   3368888776653


No 13 
>KOG4792|consensus
Probab=99.61  E-value=4.2e-15  Score=94.29  Aligned_cols=73  Identities=29%  Similarity=0.517  Sum_probs=63.9

Q ss_pred             CCHHHHHHHhCCCCCceEEEeecCCCCCeeEEEEEecCCCCCCceeEEEEEEe--C------CcEEEecccCCCCCHHHH
Q psy3300           1 MIQEITLEVLSQEPVGAFMVRESTTKPGCFALSLRVPKEFHHLGIAHYLILRT--A------KGYKIKGFTKEFSSLTSL   72 (97)
Q Consensus         1 isr~~ae~~L~~~~~G~fLiR~s~~~~~~~~Ls~~~~~~~~~~~v~h~~I~~~--~------~~~~l~~~~~~F~sl~~L   72 (97)
                      |||+||..+|..+..|.||||+|++-+|.|+|+|+..     .+|.||.|...  .      .++.|.  ...|++|+.|
T Consensus        18 mSRqeA~~lL~~~r~G~FLvRDSst~pGdYvLsV~E~-----srVshYiIn~~~p~~~~~~~~~~rIg--dQ~Fd~lPaL   90 (293)
T KOG4792|consen   18 MSRQEAVALLQGQRHGVFLVRDSSTSPGDYVLSVSEN-----SRVSHYIINSSPPSPAQPPPSRLRIG--DQEFDSLPAL   90 (293)
T ss_pred             ccHHHHHHHhcCcceeeEEEecCCCCCCceEEEEecC-----cceeeeeecCCCCCccCCCcceeeec--cccccchHHH
Confidence            7999999999988899999999999999999999985     88999999872  1      146664  7789999999


Q ss_pred             HHHhhhCC
Q psy3300          73 ITHHSVMP   80 (97)
Q Consensus        73 v~~y~~~~   80 (97)
                      ++||..+-
T Consensus        91 L~fykihy   98 (293)
T KOG4792|consen   91 LEFYKIHY   98 (293)
T ss_pred             HhheeEee
Confidence            99998764


No 14 
>KOG1264|consensus
Probab=99.58  E-value=5.2e-15  Score=107.14  Aligned_cols=84  Identities=25%  Similarity=0.360  Sum_probs=71.2

Q ss_pred             CCHHHHHHHhCCC-CCceEEEeecCCCCCeeEEEEEecCCCCCCceeEEEEEEeCCcEEEecccCCCCCHHHHHHHhhhC
Q psy3300           1 MIQEITLEVLSQE-PVGAFMVRESTTKPGCFALSLRVPKEFHHLGIAHYLILRTAKGYKIKGFTKEFSSLTSLITHHSVM   79 (97)
Q Consensus         1 isr~~ae~~L~~~-~~G~fLiR~s~~~~~~~~Ls~~~~~~~~~~~v~h~~I~~~~~~~~l~~~~~~F~sl~~Lv~~y~~~   79 (97)
                      .+|++||++|..- .+|+|||| +....+.|+||++..     ++++|++|.+.+.-|.+ | ...|+||.+||+||.++
T Consensus       655 ~treqAE~mL~rvp~DGaFLiR-~~~~~nsy~iSfr~~-----gkikHcRi~rdGr~fvl-~-t~~FesLv~lv~yY~k~  726 (1267)
T KOG1264|consen  655 LTREQAEDMLMRVPRDGAFLIR-KREGSNSYAISFRAR-----GKIKHCRINRDGRHFVL-G-TSAFESLVELVSYYEKH  726 (1267)
T ss_pred             ccHHHHHHHHhhCccCcceEEE-eccCCceEEEEEEEc-----CcEeEEEEccCceEEEe-c-cHHHHHHHHHHHHHhcC
Confidence            4799999999876 67999999 546688899999996     99999999876554665 4 66799999999999999


Q ss_pred             CCCcceeecCCCC
Q psy3300          80 PELLPCTLSLHRY   92 (97)
Q Consensus        80 ~~~l~~~L~~p~~   92 (97)
                      +.--.++|++|..
T Consensus       727 ~lyR~mkLr~PVn  739 (1267)
T KOG1264|consen  727 PLYRKMKLRYPVN  739 (1267)
T ss_pred             hhhhcccccCcCC
Confidence            8877789998853


No 15 
>KOG0790|consensus
Probab=99.45  E-value=2.8e-13  Score=93.12  Aligned_cols=90  Identities=22%  Similarity=0.285  Sum_probs=78.1

Q ss_pred             CCHHHHHHHhCCC-CCceEEEeecCCCCCeeEEEEEecCCCCCCceeEEEEEEeCCcEEEecccCCCCCHHHHHHHhhhC
Q psy3300           1 MIQEITLEVLSQE-PVGAFMVRESTTKPGCFALSLRVPKEFHHLGIAHYLILRTAKGYKIKGFTKEFSSLTSLITHHSVM   79 (97)
Q Consensus         1 isr~~ae~~L~~~-~~G~fLiR~s~~~~~~~~Ls~~~~~~~~~~~v~h~~I~~~~~~~~l~~~~~~F~sl~~Lv~~y~~~   79 (97)
                      |+..+||.+|+.. .+|+||.|.|++.+|.|+|+++..     +.+.|.+|...++-|.+.+ ...|.++.|||+||..+
T Consensus        11 ~~g~~ae~Ll~~~g~dgsfl~r~s~sNp~~fsl~~r~~-----~~v~hikiq~~~~~~~l~~-gekfat~~ELvqyyme~   84 (600)
T KOG0790|consen   11 LSGVEAETLLKERGVDGSFLARPSESNPGDFSLSVRRG-----DKVTHIKIQNSGDFYDLYG-GEKFATLAELVQYYMEH   84 (600)
T ss_pred             ccchhHHHHHHHhccccchhhccccCCCcceeEEEEeC-----CceEEEEEeecCccccccC-CccccchHHHHHHHHhh
Confidence            4567999999876 899999999999999999999994     8999999987776688887 88999999999999887


Q ss_pred             C------CCcceeecCC--CCCCCC
Q psy3300          80 P------ELLPCTLSLH--RYNPNF   96 (97)
Q Consensus        80 ~------~~l~~~L~~p--~~~p~~   96 (97)
                      .      .|...-|++|  |.+|+-
T Consensus        85 ~~~lkekng~~ielK~pl~cAdpts  109 (600)
T KOG0790|consen   85 HGQLKEKNGDVIELKYPLNCADPTS  109 (600)
T ss_pred             hHHHHhcCCCEEEecCCCccCCchh
Confidence            5      5677888888  577763


No 16 
>KOG0194|consensus
Probab=99.30  E-value=4.3e-11  Score=83.80  Aligned_cols=91  Identities=24%  Similarity=0.354  Sum_probs=72.5

Q ss_pred             CCHHHHHHHhCCCCCceEEEeecCCCCCe---e-EEEEEecCCCCCCceeEEEEEEeCCcEEEecccCCCCCHHHHHHHh
Q psy3300           1 MIQEITLEVLSQEPVGAFMVRESTTKPGC---F-ALSLRVPKEFHHLGIAHYLILRTAKGYKIKGFTKEFSSLTSLITHH   76 (97)
Q Consensus         1 isr~~ae~~L~~~~~G~fLiR~s~~~~~~---~-~Ls~~~~~~~~~~~v~h~~I~~~~~~~~l~~~~~~F~sl~~Lv~~y   76 (97)
                      +.|++|+.+|+.  +|+||||.|+...|.   + +||+....   ...++||.|...+++++..+ ...|+++.+||.||
T Consensus        56 l~red~~~lL~~--~GDfLvR~s~~~~~~~~~~~vlSv~~~~---~~~~~h~vi~~~~~~~~~~~-~~~F~si~~li~~~  129 (474)
T KOG0194|consen   56 LPREDAEKLLKN--DGDFLVRASEPKEGEKREFVVLSVKWSV---FKKIKHYVIKRNGNLFFFEG-LRKFPTISELVNYY  129 (474)
T ss_pred             ccHhHHHHHhCC--CCceEEEeecccCCcceeEEEEEEEeec---CCceeEEEEEEcCCeeEEec-cccCCcHHHHHHHH
Confidence            579999999977  799999999877652   3 78888731   25888999999888766665 78999999999999


Q ss_pred             hhCCCCcc---eeecCCCCCCCCC
Q psy3300          77 SVMPELLP---CTLSLHRYNPNFE   97 (97)
Q Consensus        77 ~~~~~~l~---~~L~~p~~~p~~~   97 (97)
                      ..+.....   ..|..|-..+.||
T Consensus       130 ~~~~~~~~~~~~~L~~PI~r~~We  153 (474)
T KOG0194|consen  130 KFSKLEITGKNFFLKRPIPRQKWE  153 (474)
T ss_pred             HhcccceeccceeecccccccccE
Confidence            98876543   4688887776664


No 17 
>KOG3601|consensus
Probab=99.11  E-value=8.9e-12  Score=78.32  Aligned_cols=75  Identities=31%  Similarity=0.610  Sum_probs=65.9

Q ss_pred             CCHHHHHHHhCCCCCceEEEeecCCCCCeeEEEEEecCCCCCCceeEEEEEEeCCc-EEEecccCCCCCHHHHHHHhhhC
Q psy3300           1 MIQEITLEVLSQEPVGAFMVRESTTKPGCFALSLRVPKEFHHLGIAHYLILRTAKG-YKIKGFTKEFSSLTSLITHHSVM   79 (97)
Q Consensus         1 isr~~ae~~L~~~~~G~fLiR~s~~~~~~~~Ls~~~~~~~~~~~v~h~~I~~~~~~-~~l~~~~~~F~sl~~Lv~~y~~~   79 (97)
                      |+|..||++|....+|+||+|.+++.+|.|+++++..     ..|.|+++.+...| |++  +...|+++.+|++||..+
T Consensus        65 i~r~~ae~~l~~~~~G~fl~r~s~sSPg~fsgsvr~~-----d~vqhfkvvrpa~~k~~l--w~skfnslnplv~Y~rt~  137 (222)
T KOG3601|consen   65 IPRPLAEDLLSKKRDGDFLIRLSESSPGDFSGSVRFP-----DGVQHFKVVRPAFGKYFL--WSSKFNSLNPLVSYHRTA  137 (222)
T ss_pred             cccchhhhhhhccCcchhhhhhhhcCcccccccccCC-----CCceeccccccCcccccc--chhhccCCCCCcccCccc
Confidence            5789999999998999999999999999999999995     89999999877655 665  377899999999999887


Q ss_pred             CCC
Q psy3300          80 PEL   82 (97)
Q Consensus        80 ~~~   82 (97)
                      +..
T Consensus       138 s~~  140 (222)
T KOG3601|consen  138 SQS  140 (222)
T ss_pred             ccc
Confidence            543


No 18 
>PF14633 SH2_2:  SH2 domain; PDB: 3GXX_A 3GXW_B 3PJP_B 2XP1_A.
Probab=98.74  E-value=1.4e-07  Score=60.55  Aligned_cols=72  Identities=25%  Similarity=0.403  Sum_probs=55.4

Q ss_pred             CCHHHHHHHhCCCCCceEEEeecCCCCCeeEEEEEecCCCCCCceeEEEEEEeCC------c--EEEecccCCCCCHHHH
Q psy3300           1 MIQEITLEVLSQEPVGAFMVRESTTKPGCFALSLRVPKEFHHLGIAHYLILRTAK------G--YKIKGFTKEFSSLTSL   72 (97)
Q Consensus         1 isr~~ae~~L~~~~~G~fLiR~s~~~~~~~~Ls~~~~~~~~~~~v~h~~I~~~~~------~--~~l~~~~~~F~sl~~L   72 (97)
                      ++-.+|+++|++.+.|.++||+|+.-.+..+++++..    .+...|+.|...+.      |  ..+ + ...|++|.||
T Consensus        44 ~~~~qAe~~L~~~~~Ge~iIRPSSkG~dhL~vTwKv~----d~vyqHidV~E~~K~n~~slG~~L~i-~-~~~yeDLDEi  117 (220)
T PF14633_consen   44 FNYKQAEEYLADQDVGEVIIRPSSKGPDHLTVTWKVA----DGVYQHIDVKEEDKENEFSLGKTLKI-G-GEEYEDLDEI  117 (220)
T ss_dssp             S-HHHHHHHHCCS-TT-EEEEE-TTTTTEEEEEEEEE----TTEEEEEEEEEECSSSTTS-SSEEEE-T-TEEESSHHHH
T ss_pred             CCHHHHHHHHhcCCCCCEEEeeCCCCCCeEEEEEEEc----CCcEEEEEEEECCCcCccccCcEEEE-C-CeEECCHHHH
Confidence            4678999999999999999999987778899999997    47788999988652      2  333 3 7789999999


Q ss_pred             HHHhhh
Q psy3300          73 ITHHSV   78 (97)
Q Consensus        73 v~~y~~   78 (97)
                      |.-|..
T Consensus       118 i~r~V~  123 (220)
T PF14633_consen  118 IARHVE  123 (220)
T ss_dssp             HHHCHH
T ss_pred             HHHHHH
Confidence            988754


No 19 
>KOG3751|consensus
Probab=98.54  E-value=2.7e-08  Score=69.93  Aligned_cols=85  Identities=28%  Similarity=0.529  Sum_probs=70.8

Q ss_pred             CCHHHHHHHhCCC--CCceEEEeecCCCCCeeEEEEEecCCCCCCceeEEEEEEeC-Cc---EEEecccCCCCCHHHHHH
Q psy3300           1 MIQEITLEVLSQE--PVGAFMVRESTTKPGCFALSLRVPKEFHHLGIAHYLILRTA-KG---YKIKGFTKEFSSLTSLIT   74 (97)
Q Consensus         1 isr~~ae~~L~~~--~~G~fLiR~s~~~~~~~~Ls~~~~~~~~~~~v~h~~I~~~~-~~---~~l~~~~~~F~sl~~Lv~   74 (97)
                      |+|+++..+++++  .+|.|++|+|.+.|..|++++...     .+++||.|.... .|   +.+.+....|.++..|++
T Consensus       526 is~~es~~~ikq~glv~~~~l~r~sqsnP~~~~~~~~~~-----~~v~~~~~~P~~~~~~~~~t~~~g~t~~sd~~ql~~  600 (622)
T KOG3751|consen  526 ISRDESQRLIKQQGLVDGLFLVRDSQSNPKIFVLSLCHP-----QKVKHFQILPVEDDGCTFFTLDDGPTKFSDLIQLVE  600 (622)
T ss_pred             cCchhhhhHHHhcccceeeeeecccccCcchhhhhccCC-----ccccceEEecCCCCCceeeccCCCCccccccccccc
Confidence            6899999988876  789999999999999999999885     899999998764 34   334333567999999999


Q ss_pred             HhhhCCCCcceeecCC
Q psy3300          75 HHSVMPELLPCTLSLH   90 (97)
Q Consensus        75 ~y~~~~~~l~~~L~~p   90 (97)
                      +|+-+...++|.|+.=
T Consensus       601 ~~ql~k~~l~~al~~~  616 (622)
T KOG3751|consen  601 FYQLNKGVLPCALKHC  616 (622)
T ss_pred             hhhcCCCcchHHHHHH
Confidence            9999988888887653


No 20 
>PF14633 SH2_2:  SH2 domain; PDB: 3GXX_A 3GXW_B 3PJP_B 2XP1_A.
Probab=97.56  E-value=0.0002  Score=46.13  Aligned_cols=71  Identities=25%  Similarity=0.329  Sum_probs=49.9

Q ss_pred             HHHHHHHhCCC-----CCceEEEeecCCCCCeeEEEEEecCCCCCCceeEEEEEEeCCcEEEecccCCCCCHHHHHHHhh
Q psy3300           3 QEITLEVLSQE-----PVGAFMVRESTTKPGCFALSLRVPKEFHHLGIAHYLILRTAKGYKIKGFTKEFSSLTSLITHHS   77 (97)
Q Consensus         3 r~~ae~~L~~~-----~~G~fLiR~s~~~~~~~~Ls~~~~~~~~~~~v~h~~I~~~~~~~~l~~~~~~F~sl~~Lv~~y~   77 (97)
                      +++++..|...     .--.|.+..+...||.|.|++....   ...+.+.-|.-..+||.+.  ...|+++.+|+.+++
T Consensus       143 ~~e~e~~L~~~k~~nP~~i~Y~f~~~~~~PG~F~L~y~~~~---~~~~~~~~v~V~p~Gf~~r--~~~f~~~~~L~~~FK  217 (220)
T PF14633_consen  143 KEEVEEWLKEEKKANPKRIPYAFCISKEHPGYFILSYKPNK---NPRHEYWPVKVTPDGFRFR--KQVFPSLDRLINWFK  217 (220)
T ss_dssp             CCCCHHHHHCHHHHSTTS-EEEEEE-TTSTTEEEEEEESST---TS-EEEEEEEE-SSSEEET--TEEESSHHHHHHHHH
T ss_pred             HHHHHHHHHHHHHhCCCCceEEEEECCCCCCEEEEEEEcCC---CCceEEeeEEEecCcEEEe--cccCCCHHHHHHHHh
Confidence            45666777642     3356888888889999999998852   1333443377788999985  778999999999987


Q ss_pred             h
Q psy3300          78 V   78 (97)
Q Consensus        78 ~   78 (97)
                      .
T Consensus       218 ~  218 (220)
T PF14633_consen  218 K  218 (220)
T ss_dssp             H
T ss_pred             h
Confidence            5


No 21 
>KOG1856|consensus
Probab=97.44  E-value=0.0002  Score=54.97  Aligned_cols=73  Identities=19%  Similarity=0.274  Sum_probs=57.0

Q ss_pred             CCHHHHHHHhCCCCCceEEEeecCCCCCeeEEEEEecCCCCCCceeEEEEEEeCC-c-EEEe-----cccCCCCCHHHHH
Q psy3300           1 MIQEITLEVLSQEPVGAFMVRESTTKPGCFALSLRVPKEFHHLGIAHYLILRTAK-G-YKIK-----GFTKEFSSLTSLI   73 (97)
Q Consensus         1 isr~~ae~~L~~~~~G~fLiR~s~~~~~~~~Ls~~~~~~~~~~~v~h~~I~~~~~-~-~~l~-----~~~~~F~sl~~Lv   73 (97)
                      ++-++|+.+|.....|.++||+|+......+++++..    .+...|+.|...+. + |.+.     + +..|++|.++|
T Consensus      1114 ~n~eQAe~yL~~~d~ge~iiRpSSrgddhLvvtwKVs----D~iYqhidV~E~eKEn~fslg~~l~i~-~e~feDLDEiI 1188 (1299)
T KOG1856|consen 1114 LNAEQAEAYLSDMDQGELIIRPSSRGDDHLVVTWKVS----DGIYQHIDVQELEKENYFSLGKTLWIG-GEEFEDLDEII 1188 (1299)
T ss_pred             CCHHHHHHHHHhcccccEEeccccCCCCceEEEEEec----CchhhhhhhhhhhccccccccceEEEC-CcccccHHHHH
Confidence            4568999999998999999999988777888999887    46778888876432 1 3332     3 77899999999


Q ss_pred             HHhhh
Q psy3300          74 THHSV   78 (97)
Q Consensus        74 ~~y~~   78 (97)
                      .=|..
T Consensus      1189 ~r~vq 1193 (1299)
T KOG1856|consen 1189 ARYVQ 1193 (1299)
T ss_pred             HHHHH
Confidence            87754


No 22 
>KOG4566|consensus
Probab=97.29  E-value=0.00077  Score=44.40  Aligned_cols=73  Identities=25%  Similarity=0.397  Sum_probs=52.7

Q ss_pred             CHHHHHHHhCCCCCceEEEeecCCCCCeeEEEEEecCCCCCCceeEEEEEEeCCcEEE----eccc--CCCCCHHHHHHH
Q psy3300           2 IQEITLEVLSQEPVGAFMVRESTTKPGCFALSLRVPKEFHHLGIAHYLILRTAKGYKI----KGFT--KEFSSLTSLITH   75 (97)
Q Consensus         2 sr~~ae~~L~~~~~G~fLiR~s~~~~~~~~Ls~~~~~~~~~~~v~h~~I~~~~~~~~l----~~~~--~~F~sl~~Lv~~   75 (97)
                      +.++|+..|...+.|+|+||+|...+-.+.|++...     .+.+.-+|.+.++.|..    .+..  ..|++...++.+
T Consensus        60 ~~~~~~~~l~~~p~~~~l~Rdss~~~~~~~I~vk~~-----~g~~~~r~~~~~~sfr~ds~~~~~~pl~~~e~~~~~~~~  134 (258)
T KOG4566|consen   60 TANEAELCLGQEPRGTFLVRDSSHRPYLFTISVKTH-----SGPKNLRIQYQDSSFRLDSSHRHTPPLISFEDVEVLIHH  134 (258)
T ss_pred             cchhHHHhhcCCCccceeeecCcCccccceeEeeec-----cCCCCccccccccceecccccccCCCcccChhHHHHHHh
Confidence            456777778778999999999987777788998885     45555566555543443    1212  369999999999


Q ss_pred             hhhC
Q psy3300          76 HSVM   79 (97)
Q Consensus        76 y~~~   79 (97)
                      |+..
T Consensus       135 y~~~  138 (258)
T KOG4566|consen  135 YRQS  138 (258)
T ss_pred             hhhh
Confidence            9554


No 23 
>KOG3697|consensus
Probab=97.09  E-value=0.0012  Score=44.04  Aligned_cols=62  Identities=19%  Similarity=0.183  Sum_probs=38.9

Q ss_pred             CCCeeEEEEEecCCCCCCceeEEEEEEeCCcEEEecccCCCCCHHHHHHHhhhCCCCcc-----eeecCCCCCC
Q psy3300          26 KPGCFALSLRVPKEFHHLGIAHYLILRTAKGYKIKGFTKEFSSLTSLITHHSVMPELLP-----CTLSLHRYNP   94 (97)
Q Consensus        26 ~~~~~~Ls~~~~~~~~~~~v~h~~I~~~~~~~~l~~~~~~F~sl~~Lv~~y~~~~~~l~-----~~L~~p~~~p   94 (97)
                      .+|.|+|+=..     ++..+|..+. ...|..-.. ...|+|+..||+||..+.+.+.     .+|+.|...+
T Consensus       278 ~~gqyvltgl~-----~~~~khlllv-dpegvvrtk-d~~fdsishli~yh~~~~lpiis~~sel~l~~pv~r~  344 (345)
T KOG3697|consen  278 TPGQYVLTGLQ-----SGQPKHLLLV-DPEGVVRTK-DRRFDSISHLINYHMDNHLPIISAGSELCLQQPVERK  344 (345)
T ss_pred             CCccEEEeccc-----CCCcceEEEE-CCccceecc-cchhhHHHHHHHHhhhccCceecCcchhhhhCccccc
Confidence            45555555444     2566776654 233422222 6679999999999999876544     4677776543


No 24 
>KOG1856|consensus
Probab=96.35  E-value=0.031  Score=43.62  Aligned_cols=72  Identities=17%  Similarity=0.265  Sum_probs=57.8

Q ss_pred             CHHHHHHHhCCC--CC---ceEEEeecCCCCCeeEEEEEecCCCCCCceeEEEEEEeCCcEEEecccCCCCCHHHHHHHh
Q psy3300           2 IQEITLEVLSQE--PV---GAFMVRESTTKPGCFALSLRVPKEFHHLGIAHYLILRTAKGYKIKGFTKEFSSLTSLITHH   76 (97)
Q Consensus         2 sr~~ae~~L~~~--~~---G~fLiR~s~~~~~~~~Ls~~~~~~~~~~~v~h~~I~~~~~~~~l~~~~~~F~sl~~Lv~~y   76 (97)
                      ++.+.+.+|...  ..   --|...-|...||.|.|+++..     ++++|--+.-...||.+.  +..|+||.+|+.|+
T Consensus      1212 t~~~~ek~L~~~k~~np~~~~Y~F~~s~~~PG~F~L~y~~~-----~k~~heyv~v~p~g~~~r--g~~f~tld~L~~~F 1284 (1299)
T KOG1856|consen 1212 TKKEVEKLLRDYKKVNPKKSVYFFCASHEHPGKFCLSYKPS-----SKPRHEYVKVVPEGFRFR--GQNFGTLDELCRWF 1284 (1299)
T ss_pred             CHHHHHHHHHHHhccCCCeeeEEEEecccCCceEEEEeccC-----CCccceeEEEcccceEEe--cccchhHHHHHHHH
Confidence            356778888731  22   3456677888999999999985     889999888889998887  67899999999999


Q ss_pred             hhCC
Q psy3300          77 SVMP   80 (97)
Q Consensus        77 ~~~~   80 (97)
                      ..+-
T Consensus      1285 K~h~ 1288 (1299)
T KOG1856|consen 1285 KRHY 1288 (1299)
T ss_pred             HHHh
Confidence            8764


No 25 
>PF02762 Cbl_N3:  CBL proto-oncogene N-terminus, SH2-like domain;  InterPro: IPR014742 Cbl (Casitas B-lineage lymphoma) is an adaptor protein that functions as a negative regulator of many signalling pathways that start from receptors at the cell surface. The N-terminal region of Cbl contains a Cbl-type phosphotyrosine-binding (Cbl-PTB) domain, which is composed of three evolutionarily conserved domains: an N-terminal four-helix bundle (4H) domain, an EF hand-like calcium-binding domain, and a divergent SH2-like domain. The calcium-bound EF-hand wedges between the 4H and SH2 domains, and roughly determines their relative orientation. The Cbl-PTB domain has also been named Cbl N-terminal (Cbl-N) or tyrosine kinase binding (TKB) domain [, ]. The N-terminal 4H domain contains four long alpha-helices. The C and D helices in this domain pack against the adjacent EF-hand-like domain, and a highly conserved loop connecting the A and B helices contacts the SH2-like domain. The EF-hand motif is similar to classical EF-hand proteins. The SH2-like domain retains the general helix-sheet-helix architecture of the SH2 fold, but lacks the secondary beta-sheet, comprising beta-strands D', E and F, and also a prominent BG loop [].  This entry represents the SH2-like domain.; PDB: 3PFV_A 3VGO_A 3PLF_B 2Y1M_A 2CBL_A 3BUX_B 3BUN_B 3BUM_B 3OB1_B 3BUW_B ....
Probab=96.16  E-value=0.029  Score=30.37  Aligned_cols=36  Identities=19%  Similarity=0.302  Sum_probs=29.9

Q ss_pred             CHHHHHHHhCC--CCCceEEEeecCCCCCeeEEEEEec
Q psy3300           2 IQEITLEVLSQ--EPVGAFMVRESTTKPGCFALSLRVP   37 (97)
Q Consensus         2 sr~~ae~~L~~--~~~G~fLiR~s~~~~~~~~Ls~~~~   37 (97)
                      +=+|+...|..  ..+|+|+.|-|.++-|.|.+-....
T Consensus        10 TYdevk~~L~~~~~kpGsYiFRlSCTrLGQWAIGyV~~   47 (86)
T PF02762_consen   10 TYDEVKARLQHYRDKPGSYIFRLSCTRLGQWAIGYVTQ   47 (86)
T ss_dssp             -HHHHHHHHGGGTTSTTEEEEEEESSSTTSEEEEEEET
T ss_pred             eHHHHHHHHHHHhCCcccEEEeeccccccceeEEEEcC
Confidence            45677788874  4789999999999999999988875


No 26 
>KOG3667|consensus
Probab=92.58  E-value=0.37  Score=35.95  Aligned_cols=47  Identities=17%  Similarity=0.285  Sum_probs=34.3

Q ss_pred             CCHHHHHHHhCCCCCceEEEeecCCCCCeeEEEEEecCCCCCCceeEEEEE
Q psy3300           1 MIQEITLEVLSQEPVGAFMVRESTTKPGCFALSLRVPKEFHHLGIAHYLIL   51 (97)
Q Consensus         1 isr~~ae~~L~~~~~G~fLiR~s~~~~~~~~Ls~~~~~~~~~~~v~h~~I~   51 (97)
                      ++++.+..+|...++|+|++|-|++.-|..+++....+    ....++.|.
T Consensus       570 inkq~~~~ll~~~~~GtflLrfs~S~~GgiT~~~v~~~----~~~~~~~i~  616 (682)
T KOG3667|consen  570 INKQQERALLMTKPDGTFLLRFSASEEGGITIAWVEDQ----KQNLIMMIM  616 (682)
T ss_pred             cchhhhhhhhhcCCCCCceeeeeccccCceeEEecccc----cccceeEec
Confidence            46677788888889999999999888776666655542    222567776


No 27 
>smart00557 IG_FLMN Filamin-type immunoglobulin domains. These form a rod-like structure in the actin-binding cytoskeleton protein, filamin. The C-terminal repeats of filamin bind beta1-integrin (CD29).
Probab=86.80  E-value=2.4  Score=23.33  Aligned_cols=24  Identities=21%  Similarity=0.501  Sum_probs=13.7

Q ss_pred             CCceEEEeecCCCCCeeEEEEEec
Q psy3300          14 PVGAFMVRESTTKPGCFALSLRVP   37 (97)
Q Consensus        14 ~~G~fLiR~s~~~~~~~~Ls~~~~   37 (97)
                      .+|+|.+.-.....|.|.|++...
T Consensus        54 ~dGty~v~y~P~~~G~~~i~V~~~   77 (93)
T smart00557       54 GDGTYTVSYTPTEPGDYTVTVKFG   77 (93)
T ss_pred             CCCEEEEEEEeCCCEeEEEEEEEC
Confidence            455666665555555555555553


No 28 
>KOG1785|consensus
Probab=86.29  E-value=1.3  Score=31.50  Aligned_cols=34  Identities=21%  Similarity=0.335  Sum_probs=28.1

Q ss_pred             HHHHHHhCC--CCCceEEEeecCCCCCeeEEEEEec
Q psy3300           4 EITLEVLSQ--EPVGAFMVRESTTKPGCFALSLRVP   37 (97)
Q Consensus         4 ~~ae~~L~~--~~~G~fLiR~s~~~~~~~~Ls~~~~   37 (97)
                      +|....|..  ..+|+|+.|-|.+..|.|.|-++..
T Consensus       266 DEVk~RLqk~~~KpGSYIFRlSCTRlGQWAIGYVt~  301 (563)
T KOG1785|consen  266 DEVKARLQKYIKKPGSYIFRLSCTRLGQWAIGYVTA  301 (563)
T ss_pred             HHHHHHHHHHhcCCCceEEeeccCcccceeEEEEcC
Confidence            556666654  3799999999999999999988876


No 29 
>PF10505 NARG2_C:  NMDA receptor-regulated gene protein 2 C-terminus;  InterPro: IPR019535  The transition of neuronal cells from pre-cursor to mature state is regulated by the N-methyl-d-aspartate (NMDA) receptor, a glutamate-gated ion channel that is permeable to Ca2+. NMDA receptors probably mediate this activity by permitting expression of NARG2. NARG2 (NMDA receptor-regulated gene protein 2) is transiently expressed, being a regulatory protein that is present in the nucleus of dividing cells and then down-regulated as progenitors exit the cell cycle and begin to differentiate. NARG2 contains repeats of (S/T)PXX, (11 in mouse, 6 in human), a putative DNA-binding motif that is found in many gene-regulatory proteins including Kruppel, Hunchback and Antennapedi []. 
Probab=70.39  E-value=10  Score=24.71  Aligned_cols=28  Identities=21%  Similarity=0.420  Sum_probs=19.7

Q ss_pred             HHhCCCCCceEEEeecCCCCCeeEEEEEec
Q psy3300           8 EVLSQEPVGAFMVRESTTKPGCFALSLRVP   37 (97)
Q Consensus         8 ~~L~~~~~G~fLiR~s~~~~~~~~Ls~~~~   37 (97)
                      ..|.+-++|.||+|...  .+.|++.++..
T Consensus       138 k~L~~L~pG~YLLrh~~--~d~f~~i~k~~  165 (220)
T PF10505_consen  138 KKLSSLQPGSYLLRHTP--GDSFVLIYKSS  165 (220)
T ss_pred             HHHcCCCCCcEEEEecC--CCCcEEEEEcC
Confidence            34445599999999986  55566666654


No 30 
>PF00630 Filamin:  Filamin/ABP280 repeat;  InterPro: IPR017868 The many different actin cross-linking proteins share a common architecture, consisting of a globular actin-binding domain and an extended rod. Whereas their actin-binding domains consist of two calponin homology domains (see IPR001715 from INTERPRO), their rods fall into three families. The rod domain of the family including the Dictyostelium discoideum (Slime mould) gelation factor (ABP120) and human filamin (ABP280) is constructed from tandem repeats of a 100-residue motif that is glycine and proline rich []. The gelation factor's rod contains 6 copies of the repeat, whereas filamin has a rod constructed from 24 repeats. The resolution of the 3D structure of rod repeats from the gelation factor has shown that they consist of a beta-sandwich, formed by two beta-sheets arranged in an immunoglobulin-like fold [, ]. Because conserved residues that form the core of the repeats are preserved in filamin, the repeat structure should be common to the members of the gelation factor/filamin family. The head to tail homodimerisation is crucial to the function of the ABP120 and ABP280 proteins. This interaction involves a small portion at the distal end of the rod domains. For the gelation factor it has been shown that the carboxy-terminal repeat 6 dimerises through a double edge-to-edge extension of the beta-sheet and that repeat 5 contributes to dimerisation to some extent [, , ].; PDB: 2DI9_A 2EEC_A 2DIC_A 2EEA_A 2DMC_A 2EE9_A 2D7O_A 2D7N_A 2K7P_A 2NQC_A ....
Probab=65.81  E-value=9.3  Score=20.88  Aligned_cols=24  Identities=25%  Similarity=0.567  Sum_probs=17.5

Q ss_pred             CCceEEEeecCCCCCeeEEEEEec
Q psy3300          14 PVGAFMVRESTTKPGCFALSLRVP   37 (97)
Q Consensus        14 ~~G~fLiR~s~~~~~~~~Ls~~~~   37 (97)
                      .+|+|.++-....+|.|.|++...
T Consensus        68 ~~G~y~v~y~p~~~G~y~i~V~~~   91 (101)
T PF00630_consen   68 GDGTYTVSYTPTEPGKYKISVKIN   91 (101)
T ss_dssp             SSSEEEEEEEESSSEEEEEEEEES
T ss_pred             CCCEEEEEEEeCccEeEEEEEEEC
Confidence            567777777777777777777764


No 31 
>PF07935 SSV1_ORF_D-335:  ORF D-335-like protein;  InterPro: IPR012922 The sequences featured in this family are similar to a probable integrase (P20214 from SWISSPROT) expressed by the SSV1 virus of the archaeon Sulfolobus shibatae. This protein may be necessary for the integration of the virus into the host genome by a process of site-specific recombination []. 
Probab=63.26  E-value=22  Score=19.04  Aligned_cols=36  Identities=14%  Similarity=0.101  Sum_probs=20.8

Q ss_pred             CceeEEEEEEeCCcEEEecccCCCCCHHHHHHHhhhCCC
Q psy3300          43 LGIAHYLILRTAKGYKIKGFTKEFSSLTSLITHHSVMPE   81 (97)
Q Consensus        43 ~~v~h~~I~~~~~~~~l~~~~~~F~sl~~Lv~~y~~~~~   81 (97)
                      ++..-|+|+...+|-.-   ...-.+|.++|+.|.+...
T Consensus        15 GkYYVY~iE~~~~G~~r---e~YVGpL~~VVe~Yik~K~   50 (72)
T PF07935_consen   15 GKYYVYKIEKDINGERR---ETYVGPLDDVVETYIKLKE   50 (72)
T ss_pred             CeEEEEEEEeccCCcee---eeeeccHHHHHHHHHhccc
Confidence            34444555553443110   2234689999999988743


No 32 
>PF09430 DUF2012:  Protein of unknown function (DUF2012);  InterPro: IPR019008  This domain is found in different proteins, including uncharacterised protein family UPF0480 and nodal modulators. A nodal modulator has been identified as part of a protein complex that participates in the nodal signaling pathway during vertebrate development []. 
Probab=62.13  E-value=28  Score=20.30  Aligned_cols=34  Identities=21%  Similarity=0.466  Sum_probs=23.7

Q ss_pred             CCceEEEeecCCCCCeeEEEEEecCCCCCCceeEEEEEEe
Q psy3300          14 PVGAFMVRESTTKPGCFALSLRVPKEFHHLGIAHYLILRT   53 (97)
Q Consensus        14 ~~G~fLiR~s~~~~~~~~Ls~~~~~~~~~~~v~h~~I~~~   53 (97)
                      .+|+|.+++=+  +|.|.|.+...    .-....+||.-.
T Consensus        28 ~dG~F~f~~Vp--~GsY~L~V~s~----~~~F~~~RVdV~   61 (123)
T PF09430_consen   28 SDGSFVFHNVP--PGSYLLEVHSP----DYVFPPYRVDVS   61 (123)
T ss_pred             CCCEEEeCCCC--CceEEEEEECC----CccccCEEEEEe
Confidence            68999998864  68899988875    234445666554


No 33 
>KOG2293|consensus
Probab=54.23  E-value=13  Score=27.48  Aligned_cols=67  Identities=12%  Similarity=0.086  Sum_probs=40.2

Q ss_pred             CCHHHHHHHhCCCCCceEEEeecCCCCCeeEEEEEecCCCCCCceeEEEEEEeCCc--EEEecccCCCCCHHHHHHHhhh
Q psy3300           1 MIQEITLEVLSQEPVGAFMVRESTTKPGCFALSLRVPKEFHHLGIAHYLILRTAKG--YKIKGFTKEFSSLTSLITHHSV   78 (97)
Q Consensus         1 isr~~ae~~L~~~~~G~fLiR~s~~~~~~~~Ls~~~~~~~~~~~v~h~~I~~~~~~--~~l~~~~~~F~sl~~Lv~~y~~   78 (97)
                      |||.+|.-.|++  .|+|-|    ..-|.+.|.|-.      +.+-|-.+....+.  ..|-|-.-.|..-.++|.-|..
T Consensus       474 ISRRQa~IkL~n--~GsF~I----kNlGK~~I~vng------~~l~~gq~~~L~~nclveIrg~~FiF~~N~~~v~~~l~  541 (547)
T KOG2293|consen  474 ISRRQALIKLKN--DGSFFI----KNLGKRSILVNG------GELDRGQKVILKNNCLVEIRGLRFIFEINQEAVGQYLK  541 (547)
T ss_pred             eeccceeEEecc--CCcEEe----ccCcceeEEeCC------ccccCCceEEeccCcEEEEccceEEEeecHHHHHHHHH
Confidence            566666666644  699999    456778887765      34444433333333  3344323346667888888865


Q ss_pred             C
Q psy3300          79 M   79 (97)
Q Consensus        79 ~   79 (97)
                      .
T Consensus       542 ~  542 (547)
T KOG2293|consen  542 E  542 (547)
T ss_pred             h
Confidence            4


No 34 
>PF00408 PGM_PMM_IV:  Phosphoglucomutase/phosphomannomutase, C-terminal domain;  InterPro: IPR005843 The alpha-D-phosphohexomutase superfamily is composed of four related enzymes, each of which catalyses a phosphoryl transfer on their sugar substrates: phosphoglucomutase (PGM), phosphoglucomutase/phosphomannomutase (PGM/PMM), phosphoglucosamine mutase (PNGM), and phosphoacetylglucosamine mutase (PAGM) []. PGM (5.4.2.2 from EC) converts D-glucose 1-phosphate into D-glucose 6-phosphate, and participates in both the breakdown and synthesis of glucose []. PGM/PMM (5.4.2.2 from EC; 5.4.2.8 from EC) are primarily bacterial enzymes that use either glucose or mannose as substrate, participating in the biosynthesis of a variety of carbohydrates such as lipopolysaccharides and alginate [, ]. Both PNGM (5.4.2.3 from EC) and PAGM (5.4.2.10 from EC) are involved in the biosynthesis of UDP-N-acetylglucosamine [, ].  Despite differences in substrate specificity, these enzymes share a similar catalytic mechanism, converting 1-phospho-sugars to 6-phospho-sugars via a biphosphorylated 1,6-phospho-sugar. The active enzyme is phosphorylated at a conserved serine residue and binds one magnesium ion; residues around the active site serine are well conserved among family members. The reaction mechanism involves phosphoryl transfer from the phosphoserine to the substrate to create a biophosphorylated sugar, followed by a phosphoryl transfer from the substrate back to the enzyme []. The structures of PGM and PGM/PMM have been determined, and were found to be very similar in topology. These enzymes are both composed of four domains and a large central active site cleft, where each domain contains residues essential for catalysis and/or substrate recognition. Domain I contains the catalytic phosphoserine, domain II contains a metal-binding loop to coordinate the magnesium ion, domain III contains the sugar-binding loop that recognises the two different binding orientations of the 1- and 6-phospho-sugars, and domain IV contains a phosphate-binding site required for orienting the incoming phospho-sugar substrate. This entry represents the C-terminal domain alpha-D-phosphohexomutase enzymes.; GO: 0016868 intramolecular transferase activity, phosphotransferases, 0005975 carbohydrate metabolic process; PDB: 1KFQ_B 1KFI_A 3PDK_B 2F7L_A 1TUO_A 2FKM_X 3C04_A 1K2Y_X 1P5G_X 2H4L_X ....
Probab=49.98  E-value=16  Score=19.09  Aligned_cols=15  Identities=40%  Similarity=0.403  Sum_probs=12.4

Q ss_pred             CCceEE-EeecCCCCC
Q psy3300          14 PVGAFM-VRESTTKPG   28 (97)
Q Consensus        14 ~~G~fL-iR~s~~~~~   28 (97)
                      .+|..+ ||.|-+.|-
T Consensus        33 ~dG~~l~vR~SgTEP~   48 (73)
T PF00408_consen   33 EDGWRLLVRPSGTEPK   48 (73)
T ss_dssp             TTEEEEEEEEESSSSE
T ss_pred             CCceEEEEECCCCCce
Confidence            689999 999987764


No 35 
>PF05303 DUF727:  Protein of unknown function (DUF727);  InterPro: IPR007967 This family consists of several uncharacterised eukaryotic proteins of unknown function.; PDB: 1SGO_A.
Probab=49.79  E-value=49  Score=19.02  Aligned_cols=67  Identities=22%  Similarity=0.214  Sum_probs=33.3

Q ss_pred             HHHHHHHhCCC--CCceEEEeecCCC---CCeeEEEEEecCCCCCCceeEEEEEEeCCcEEEecc---------------
Q psy3300           3 QEITLEVLSQE--PVGAFMVRESTTK---PGCFALSLRVPKEFHHLGIAHYLILRTAKGYKIKGF---------------   62 (97)
Q Consensus         3 r~~ae~~L~~~--~~G~fLiR~s~~~---~~~~~Ls~~~~~~~~~~~v~h~~I~~~~~~~~l~~~---------------   62 (97)
                      +.||+..+.+-  ....-.|  |+.-   .+...|-+...      --..|.|.....||++.+.               
T Consensus         3 ~~Ea~aai~dv~~~V~~i~v--s~~Lp~~~~l~Ylnv~Tl------Eg~~~cIelt~~Gf~V~s~~~D~~~~~~~~~~~~   74 (108)
T PF05303_consen    3 RLEAEAAINDVAFAVKEISV--SEKLPRTGDLIYLNVTTL------EGRTYCIELTTKGFRVVSSTFDCMDPDPSQSSLH   74 (108)
T ss_dssp             HHHHHHHHHHHGGGSSEEEE----SS-EETTEEEEEEE-T------T--EEEEEEETTEEEEEESSTT---TT-------
T ss_pred             HHHHHHHHHHHHHHhhEEEe--cccCCCCCCEEEEEEEEe------cCCEEEEEEECCeEEEeeecCCCcCccccccccc
Confidence            56777777653  2333334  3222   23344555552      3345666667788887542               


Q ss_pred             cCCCCCHHHHHHHhh
Q psy3300          63 TKEFSSLTSLITHHS   77 (97)
Q Consensus        63 ~~~F~sl~~Lv~~y~   77 (97)
                      ...|.|+.+|+..++
T Consensus        75 ~~~~eTl~~LL~~iS   89 (108)
T PF05303_consen   75 TKYFETLYALLDSIS   89 (108)
T ss_dssp             --EESSSHHHHHHH-
T ss_pred             chHHhhHHHHHHHHC
Confidence            112567777776643


No 36 
>cd00214 Calpain_III Calpain, subdomain III. Calpains are  calcium-activated cytoplasmic cysteine proteinases, participate in cytoskeletal remodeling processes, cell differentiation, apoptosis and signal transduction. Catalytic domain and the two calmodulin-like domains are separated by C2-like domain III. Domain III plays an important role in calcium-induced activation of calpain involving electrostatic interactions with subdomain II. Proposed to mediate calpain's interaction with phospholipids and translocation to cytoplasmic/nuclear membranes. CD includes subdomain III of typical and atypical calpains.
Probab=45.41  E-value=43  Score=20.07  Aligned_cols=24  Identities=33%  Similarity=0.574  Sum_probs=14.3

Q ss_pred             CCceEEEeecCCCC---CeeEEEEEec
Q psy3300          14 PVGAFMVRESTTKP---GCFALSLRVP   37 (97)
Q Consensus        14 ~~G~fLiR~s~~~~---~~~~Ls~~~~   37 (97)
                      ++|.|+|-.|...+   |.|.|.|...
T Consensus       116 ~pG~YvIIPsT~~p~~~g~F~LrVfs~  142 (150)
T cd00214         116 PPGEYVIVPSTFEPGEEGEFLLRVFSE  142 (150)
T ss_pred             CCCCEEEEeeecCCCCcccEEEEEEec
Confidence            56777777665443   3466666553


No 37 
>PHA00730 int integrase
Probab=44.96  E-value=1.1e+02  Score=21.45  Aligned_cols=35  Identities=20%  Similarity=0.176  Sum_probs=21.2

Q ss_pred             CceeEEEEEEeCCcEEEecccCCCCCHHHHHHHhhhCC
Q psy3300          43 LGIAHYLILRTAKGYKIKGFTKEFSSLTSLITHHSVMP   80 (97)
Q Consensus        43 ~~v~h~~I~~~~~~~~l~~~~~~F~sl~~Lv~~y~~~~   80 (97)
                      ++..-|+|+..++|-.-   ...-.+|.++|+.|-...
T Consensus        17 ~~~~~~~~~~~~~~~~~---~~y~~~~~~~v~~~~~~~   51 (337)
T PHA00730         17 GKYYVYLIEKDNEGNRR---DTYVGSLDQVVESYLKMK   51 (337)
T ss_pred             CeEEEEEEEecCCCcch---hcccccHHHHHHHHHHHh
Confidence            45555666655444111   223468999999997763


No 38 
>COG5453 Uncharacterized conserved protein [Function unknown]
Probab=44.29  E-value=58  Score=18.26  Aligned_cols=52  Identities=19%  Similarity=0.301  Sum_probs=32.4

Q ss_pred             CCceEEEeecC-CCCCeeEEEEEecCCCCCCceeEEEEEEeCCcEEEecccCCCCCHHHHHHHh
Q psy3300          14 PVGAFMVREST-TKPGCFALSLRVPKEFHHLGIAHYLILRTAKGYKIKGFTKEFSSLTSLITHH   76 (97)
Q Consensus        14 ~~G~fLiR~s~-~~~~~~~Ls~~~~~~~~~~~v~h~~I~~~~~~~~l~~~~~~F~sl~~Lv~~y   76 (97)
                      ..+.|||+..+ +..|.|.++=+....++ +.++.++..+          ...|.++.|-|+.-
T Consensus        27 ehkdflI~atP~~eggQyr~aG~I~k~id-gevk~h~FiR----------aD~ft~~~DAve~~   79 (96)
T COG5453          27 EHKDFLIYATPISEGGQYRLAGRIEKEID-GEVKVHRFIR----------ADLFTSLDDAVEIA   79 (96)
T ss_pred             cccCeEEEeeecccCCeEEEeeEeeeecC-ceeeeeeeee----------hhhccchhHHHHHH
Confidence            45889999876 45678888766654332 4444444332          33577888777654


No 39 
>PF09625 VP9:  VP9 protein;  InterPro: IPR018587  VP9 is a protein containing a ferredoxin fold. Two dimers come together to form one asymmetric unit which possesses a DNA recognition fold and specific metal binding sites possibly for zinc. It is postulated that being a non-structural protein VP9 is involved in the transcriptional regulation of the White spot syndrome virus (WSSV), from which it comes. WSSV is the major viral pathogen in shrimp aquaculture []. VP9 is found N-terminal to the IPR009766 from INTERPRO. ; PDB: 2ZUG_A 2GJ2_A 2GJI_A.
Probab=42.28  E-value=51  Score=17.90  Aligned_cols=18  Identities=22%  Similarity=0.525  Sum_probs=10.8

Q ss_pred             EEeecCCCCCeeEEEEEe
Q psy3300          19 MVRESTTKPGCFALSLRV   36 (97)
Q Consensus        19 LiR~s~~~~~~~~Ls~~~   36 (97)
                      -||.|+-..|.|+..++.
T Consensus        29 siRkSEl~d~~~iV~Lk~   46 (79)
T PF09625_consen   29 SIRKSELRDGVYIVQLKK   46 (79)
T ss_dssp             EEEE-SS-TTEEEEEE-T
T ss_pred             HHHHhhcccceEEEEEec
Confidence            478887777777776665


No 40 
>PF01067 Calpain_III:  Calpain large subunit, domain III;  InterPro: IPR022682 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad [].  This group of cysteine peptidases belong to the MEROPS peptidase family C2 (calpain family, clan CA). A type example is calpain, which is an intracellular protease involved in many important cellular functions that are regulated by calcium []. The protein is a complex of 2 polypeptide chains (light and heavy), with three known forms in mammals [, ]: a highly calcium-sensitive (i.e., micro-molar range) form known as mu-calpain, mu-CANP or calpain I; a form sensitive to calcium in the milli-molar range, known as m-calpain, m-CANP or calpain II; and a third form, known as p94, which is found in skeletal muscle only [].  All forms have identical light but different heavy chains. Both mu- and m-calpain are heterodimers containing an identical 28kDa subunit and an 80kDa subunit that shares 55-65% sequence homology between the two proteases [, ]. The crystallographic structure of m-calpain reveals six "domains" in the 80kDa subunit:    A 19-amino acid NH2-terminal sequence; Active site domain IIa; Active site domain IIb.  Domain 2 shows low levels of sequence similarity to papain; although the catalytic His has not been located by biochemical means, it is likely that calpain and papain are related [].  Domain III; An 18-amino acid extended sequence linking domain III to domain IV; Domain IV, which resembles the penta EF-hand family of polypeptides, binds calcium and regulates activity []. />]. Ca2+-binding causes a rearrangement of the protein backbone, the net effect of which is that a Trp side chain, which acts as a wedge between catalytic domains IIa and IIb in the apo state, moves away from the active site cleft allowing for the proper formation of the catalytic triad [].   Calpain-like mRNAs have been identified in other organisms including bacteria, but the molecules encoded by these mRNAs have not been isolated, so little is known about their properties. How calpain activity is regulated in these organisms cells is still unclear In metazoans, the activity of calpain is controlled by a single proteinase inhibitor, calpastatin (IPR001259 from INTERPRO). The calpastatin gene can produce eight or more calpastatin polypeptides ranging from 17 to 85 kDa by use of different promoters and alternative splicing events. The physiological significance of these different calpastatins is unclear, although all bind to three different places on the calpain molecule; binding to at least two of the sites is Ca2+ dependent. The calpains ostensibly participate in a variety of cellular processes including remodelling of cytoskeletal/membrane attachments, different signal transduction pathways, and apoptosis. Deregulated calpain activity following loss of Ca2+ homeostasis results in tissue damage in response to events such as myocardial infarcts, stroke, and brain trauma [].   This entry represents domain III. It is found in association with PF00648 from PFAM. The function of the domain III and I are currently unknown. Domain II is a cysteine protease and domain IV is a calcium binding domain. Calpains are believed to participate in intracellular signaling pathways mediated by calcium ions. ; PDB: 1QXP_B 2QFE_A 1DF0_A 1U5I_A 3DF0_A 3BOW_A 1KFU_L 1KFX_L.
Probab=41.88  E-value=41  Score=19.69  Aligned_cols=24  Identities=29%  Similarity=0.497  Sum_probs=15.0

Q ss_pred             CCceEEEeecCCC---CCeeEEEEEec
Q psy3300          14 PVGAFMVRESTTK---PGCFALSLRVP   37 (97)
Q Consensus        14 ~~G~fLiR~s~~~---~~~~~Ls~~~~   37 (97)
                      ++|.|+|=.|.-.   .|.|.|.+...
T Consensus       115 ~~G~YvIVPsT~~~~~~g~F~L~v~s~  141 (147)
T PF01067_consen  115 PPGTYVIVPSTYEPGQEGEFTLRVFSD  141 (147)
T ss_dssp             -SEEEEEEEEESSTT--EEEEEEEEES
T ss_pred             CCCCEEEEEecCCCCCeeeEEEEEEEC
Confidence            5788888777433   44577777664


No 41 
>PF13912 zf-C2H2_6:  C2H2-type zinc finger; PDB: 1JN7_A 1FU9_A 2L1O_A 1NJQ_A 2EN8_A 2EMM_A 1FV5_A 1Y0J_B 2L6Z_B.
Probab=41.79  E-value=21  Score=14.46  Aligned_cols=18  Identities=39%  Similarity=0.460  Sum_probs=14.0

Q ss_pred             ccCCCCCHHHHHHHhhhC
Q psy3300          62 FTKEFSSLTSLITHHSVM   79 (97)
Q Consensus        62 ~~~~F~sl~~Lv~~y~~~   79 (97)
                      ....|.++.+|+.|-+..
T Consensus         7 C~~~F~~~~~l~~H~~~h   24 (27)
T PF13912_consen    7 CGKTFSSLSALREHKRSH   24 (27)
T ss_dssp             TTEEESSHHHHHHHHCTT
T ss_pred             cCCccCChhHHHHHhHHh
Confidence            356799999999997554


No 42 
>smart00720 calpain_III calpain_III.
Probab=40.46  E-value=47  Score=19.55  Aligned_cols=24  Identities=33%  Similarity=0.571  Sum_probs=14.0

Q ss_pred             CCceEEEeecCCCC---CeeEEEEEec
Q psy3300          14 PVGAFMVRESTTKP---GCFALSLRVP   37 (97)
Q Consensus        14 ~~G~fLiR~s~~~~---~~~~Ls~~~~   37 (97)
                      ++|.|+|=.|...+   |.|.|.|...
T Consensus       111 ~~G~Y~iVPsT~~p~~~g~F~LrV~s~  137 (143)
T smart00720      111 PPGEYVIVPSTFEPNQEGDFLLRVFSE  137 (143)
T ss_pred             CCCCEEEEEeecCCCCccCEEEEEEec
Confidence            56777777664443   3466666553


No 43 
>PF07240 Turandot:  Stress-inducible humoral factor Turandot;  InterPro: IPR010825 This family consists of several Drosophila species specific Turandot proteins. The Turandot A (TotA) gene encodes a humoral factor, which is secreted from the fat body and accumulates in the body fluids. TotA is strongly induced upon bacterial challenge, as well as by other types of stress such as high temperature, mechanical pressure, dehydration, UV irradiation, and oxidative agents. It is also upregulated during metamorphosis and at high age. Flies that overexpress TotA show prolonged survival and retain normal activity at otherwise lethal temperatures. Although TotA is only induced by severe stress, it responds to a much wider range of stimuli than heat shock genes such as hsp70 or immune genes such as Cecropin A1 [].
Probab=38.64  E-value=29  Score=19.24  Aligned_cols=18  Identities=11%  Similarity=0.154  Sum_probs=14.5

Q ss_pred             CCCCHHHHHHHhhhCCCC
Q psy3300          65 EFSSLTSLITHHSVMPEL   82 (97)
Q Consensus        65 ~F~sl~~Lv~~y~~~~~~   82 (97)
                      .-.++++||++|.+.+..
T Consensus         8 K~rni~eLi~fY~ky~~~   25 (85)
T PF07240_consen    8 KIRNIQELIAFYEKYSPR   25 (85)
T ss_pred             HHhhHHHHHHHHHHcCcc
Confidence            457899999999987654


No 44 
>COG1576 Uncharacterized conserved protein [Function unknown]
Probab=37.04  E-value=31  Score=21.33  Aligned_cols=21  Identities=19%  Similarity=0.328  Sum_probs=16.5

Q ss_pred             HHHHHHHhCCCCCceEEEeec
Q psy3300           3 QEITLEVLSQEPVGAFMVRES   23 (97)
Q Consensus         3 r~~ae~~L~~~~~G~fLiR~s   23 (97)
                      +.|++.+|..-++|+|+|--.
T Consensus        54 ~~E~~~il~~i~~~~~vi~Ld   74 (155)
T COG1576          54 KKEGEAILAAIPKGSYVVLLD   74 (155)
T ss_pred             HHHHHHHHHhcCCCCeEEEEe
Confidence            578889988778898888543


No 45 
>COG1724 Predicted RNA binding protein (dsRBD-like fold), HicA family    [General function prediction only]
Probab=36.57  E-value=45  Score=17.55  Aligned_cols=20  Identities=20%  Similarity=0.258  Sum_probs=14.4

Q ss_pred             CHHHHHHHhCCCCCceEEEeec
Q psy3300           2 IQEITLEVLSQEPVGAFMVRES   23 (97)
Q Consensus         2 sr~~ae~~L~~~~~G~fLiR~s   23 (97)
                      +-.+++..|.  .+|.|++|..
T Consensus         8 ~~ke~ik~Le--~~Gf~~vrqk   27 (66)
T COG1724           8 KAKEVIKALE--KDGFQLVRQK   27 (66)
T ss_pred             CHHHHHHHHH--hCCcEEEEee
Confidence            3456777774  4799999975


No 46 
>KOG0518|consensus
Probab=34.74  E-value=63  Score=26.15  Aligned_cols=26  Identities=27%  Similarity=0.448  Sum_probs=23.2

Q ss_pred             CCCceEEEeecCCCCCeeEEEEEecC
Q psy3300          13 EPVGAFMVRESTTKPGCFALSLRVPK   38 (97)
Q Consensus        13 ~~~G~fLiR~s~~~~~~~~Ls~~~~~   38 (97)
                      -.+|+|-||..+.++|.|+|++....
T Consensus       904 ~~~~~y~vrFtP~e~G~~tl~V~y~~  929 (1113)
T KOG0518|consen  904 LGQGTYQVRFTPKEPGNHTLSVKYKD  929 (1113)
T ss_pred             CCCceEEEEecCCCCCceEEEEEecC
Confidence            36799999999999999999999864


No 47 
>cd00122 MBD MeCP2, MBD1, MBD2, MBD3, MBD4, CLLD8-like, and BAZ2A-like proteins constitute a family of proteins that share the methyl-CpG-binding domain (MBD). The MBD consists of about 70 residues and is defined as the minimal region required for binding to methylated DNA by a methyl-CpG-binding protein which binds specifically to methylated DNA. The MBD can recognize a single symmetrically methylated CpG either as naked DNA or within chromatin.  MeCP2, MBD1 and MBD2 (and likely MBD3) form complexes with histone deacetylase and are involved in histone deacetylase-dependent repression of transcription. MBD4 is an endonuclease that forms a complex with the DNA mismatch-repair protein MLH1. The MBDs present in putative chromatin remodelling subunit, BAZ2A, and putative histone methyltransferase, CLLD8, represent two phylogenetically distinct groups within the MBD protein family.
Probab=34.01  E-value=63  Score=16.35  Aligned_cols=24  Identities=13%  Similarity=0.019  Sum_probs=19.4

Q ss_pred             EEEecccCCCCCHHHHHHHhhhCC
Q psy3300          57 YKIKGFTKEFSSLTSLITHHSVMP   80 (97)
Q Consensus        57 ~~l~~~~~~F~sl~~Lv~~y~~~~   80 (97)
                      |++.-+...|.|..|+..|...+.
T Consensus        28 ~Y~sP~Gk~~Rs~~ev~~yL~~~~   51 (62)
T cd00122          28 YYYSPCGKKLRSKPEVARYLEKTG   51 (62)
T ss_pred             EEECCCCceecCHHHHHHHHHhCC
Confidence            666555778999999999998763


No 48 
>PF01429 MBD:  Methyl-CpG binding domain;  InterPro: IPR001739 Methylation at CpG dinucleotide, the most common DNA modification in eukaryotes, has been correlated with gene silencing associated with various phenomena such as genomic imprinting, transposon and chromosome X inactivation, differentiation, and cancer. Effects of DNA methylation are mediated through proteins which bind to symmetrically methylated CpGs. Such proteins contain a specific domain of ~70 residues, the methyl-CpG-binding domain (MBD), which is linked to additional domains associated with chromatin, such as the bromodomain, the AT hook motif,the SET domain, or the PHD finger. MBD-containing proteins appear to act as structural proteins, which recruit a variety of histone deacetylase (HDAC) complexes and chromatin remodelling factors, leading to chromatin compaction and, consequently, to transcriptional repression. The MBD of MeCP2, MBD1, MBD2, MBD4 and BAZ2 mediates binding to DNA, in case of MeCP2, MBD1 and MBD2 preferentially to methylated CpG. In case of human MBD3 and SETDB1 the MBD has been shown to mediate protein-protein interactions [, ]. The MBD folds into an alpha/beta sandwich structure comprising a layer of twisted beta sheet, backed by another layer formed by the alpha1 helix and a hairpin loop at the C terminus. These layers are both amphipathic, with the alpha1 helix and the beta sheet lying parallel and the hydrophobic faces tightly packed against each other. The beta sheet is composed of two long inner strands (beta2 and beta3) sandwiched by two shorter outer strands (beta1 and beta4) [].; GO: 0003677 DNA binding, 0005634 nucleus; PDB: 2KY8_A 1UB1_A 1D9N_A 1IG4_A 1QK9_A 3C2I_A.
Probab=32.21  E-value=52  Score=17.48  Aligned_cols=25  Identities=20%  Similarity=0.226  Sum_probs=19.7

Q ss_pred             EEEecccCCCCCHHHHHHHhhhCCC
Q psy3300          57 YKIKGFTKEFSSLTSLITHHSVMPE   81 (97)
Q Consensus        57 ~~l~~~~~~F~sl~~Lv~~y~~~~~   81 (97)
                      |++.-++..|.|..|++.|+..+..
T Consensus        34 ~Y~sP~Gk~~RS~~eV~~yL~~~~~   58 (77)
T PF01429_consen   34 YYYSPCGKRFRSKKEVVRYLKENPS   58 (77)
T ss_dssp             EEEETTSEEESSHHHHHHHHTTSS-
T ss_pred             EEECCCCCEEeCHHHHHHHHHhCCC
Confidence            6665557789999999999988764


No 49 
>TIGR03769 P_ac_wall_RPT actinobacterial surface-anchored protein domain. This model describes a repeat domain that one to three times in Actinobacterial proteins, some of which have LPXTG-type sortase recognition motifs for covalent attachment to the Gram-positive cell wall. Where it occurs with duplication in an LPXTG-anchored protein, it tends to be adjacent to the substrate-binding protein of the gene trio of an ABC transporter system, where that substrate-binding protein has a single copy of this same domain. This arrangement suggests a substrate-binding relay system, with the LPXTG protein acting as a substrate receptor.
Probab=30.42  E-value=62  Score=15.12  Aligned_cols=13  Identities=38%  Similarity=0.874  Sum_probs=10.1

Q ss_pred             CCCCeeEEEEEec
Q psy3300          25 TKPGCFALSLRVP   37 (97)
Q Consensus        25 ~~~~~~~Ls~~~~   37 (97)
                      ++||.|.|.+...
T Consensus        10 T~PG~Y~l~~~a~   22 (41)
T TIGR03769        10 TKPGTYTLTVQAT   22 (41)
T ss_pred             CCCeEEEEEEEEE
Confidence            5688888888774


No 50 
>cd01396 MeCP2_MBD MeCP2, MBD1, MBD2, MBD3, and MBD4 are members of a protein family that share the methyl-CpG-binding domain (MBD). The MBD, consists of about 70 residues and is defined as the minimal region required for binding to methylated DNA by a methyl-CpG-binding protein which binds specifically to methylated DNA. The MBD can recognize a single symmetrically methylated CpG either as naked DNA or within chromatin.  MeCP2, MBD1 and MBD2 (and likely MBD3) form complexes with histone deacetylase and are involved in histone deacetylase-dependent repression of transcription. MBD4 is an endonuclease that forms a complex with the DNA mismatch-repair protein MLH1.
Probab=30.06  E-value=56  Score=17.53  Aligned_cols=24  Identities=25%  Similarity=0.141  Sum_probs=19.7

Q ss_pred             EEEecccCCCCCHHHHHHHhhhCC
Q psy3300          57 YKIKGFTKEFSSLTSLITHHSVMP   80 (97)
Q Consensus        57 ~~l~~~~~~F~sl~~Lv~~y~~~~   80 (97)
                      |++.-....|.|..||..|+..+.
T Consensus        29 yY~sP~Gkk~RS~~ev~~yL~~~~   52 (77)
T cd01396          29 YYISPTGKKFRSKVELARYLEKNG   52 (77)
T ss_pred             EEECCCCCEEECHHHHHHHHHhCC
Confidence            666655778999999999998863


No 51 
>PF09634 DUF2025:  Protein of unknown function (DUF2025);  InterPro: IPR018595  This protein is produced from gene PA1123 in Pseudomonas. It appears to be present in the biofilm layer and may be a lipoprotein. It contains three alpha helices and six beta strands and is thought to be monomeric.; PDB: 2HG6_A.
Probab=29.05  E-value=31  Score=19.40  Aligned_cols=16  Identities=31%  Similarity=0.372  Sum_probs=11.4

Q ss_pred             CCCceEEEeecCCCCC
Q psy3300          13 EPVGAFMVRESTTKPG   28 (97)
Q Consensus        13 ~~~G~fLiR~s~~~~~   28 (97)
                      .+.|.++||-|+...|
T Consensus        23 ~KtG~~ivRFSEDsFG   38 (106)
T PF09634_consen   23 RKTGQHIVRFSEDSFG   38 (106)
T ss_dssp             TTTTEEEEES-GGG-S
T ss_pred             ccCceEEEEecccccC
Confidence            3789999999976555


No 52 
>cd00068 GGL G protein gamma subunit-like motifs, the alpha-helical G-gamma chain dimerizes with the G-beta propeller subunit as part of the heterotrimeric G-protein complex; involved in signal transduction via G-protein-coupled receptors
Probab=28.60  E-value=70  Score=16.01  Aligned_cols=18  Identities=11%  Similarity=0.027  Sum_probs=12.4

Q ss_pred             CHHHHHHHhhhC--CCCcce
Q psy3300          68 SLTSLITHHSVM--PELLPC   85 (97)
Q Consensus        68 sl~~Lv~~y~~~--~~~l~~   85 (97)
                      ...+|+.|...+  .|.+..
T Consensus        25 a~~~l~~y~e~~~~~Dpll~   44 (57)
T cd00068          25 AAAELLKYCEQNAENDPLLT   44 (57)
T ss_pred             HHHHHHHHHHhcCCCCCCCC
Confidence            467888888777  555543


No 53 
>KOG3413|consensus
Probab=28.07  E-value=1.2e+02  Score=18.71  Aligned_cols=51  Identities=20%  Similarity=0.360  Sum_probs=26.2

Q ss_pred             CCceEEEeecCCCCCeeEEEEEecCCCCCCceeEEEEEEeCCcEEEecccCCCCCHHHHHHH
Q psy3300          14 PVGAFMVRESTTKPGCFALSLRVPKEFHHLGIAHYLILRTAKGYKIKGFTKEFSSLTSLITH   75 (97)
Q Consensus        14 ~~G~fLiR~s~~~~~~~~Ls~~~~~~~~~~~v~h~~I~~~~~~~~l~~~~~~F~sl~~Lv~~   75 (97)
                      ..|+|+|-.-+....   |.+.+.    -.+.++|-+.+.+...+.-    .=.+|.+|++-
T Consensus        89 ~~GTYViNKQ~PnkQ---IWlSSP----~SGPkryD~~~~~~Wiy~r----~g~sL~~lL~e  139 (156)
T KOG3413|consen   89 SVGTYVINKQPPNKQ---IWLSSP----VSGPKRYDLCRAGEWIYLR----DGTSLHELLEE  139 (156)
T ss_pred             CceeEEecCCCCcce---eeeeCC----CCCCccccccccCceEEec----CCccHHHHHHH
Confidence            457888855433332   222222    2677888887633332222    22566766654


No 54 
>PF14822 Vasohibin:  Vasohibin
Probab=27.51  E-value=1.7e+02  Score=19.61  Aligned_cols=61  Identities=21%  Similarity=0.251  Sum_probs=34.4

Q ss_pred             ceEEEeecCCCCCeeEEEEEecCCCCCCceeEEEEEEeCCc-EEEec---------ccCCCCCHHHHHHHhhhC
Q psy3300          16 GAFMVRESTTKPGCFALSLRVPKEFHHLGIAHYLILRTAKG-YKIKG---------FTKEFSSLTSLITHHSVM   79 (97)
Q Consensus        16 G~fLiR~s~~~~~~~~Ls~~~~~~~~~~~v~h~~I~~~~~~-~~l~~---------~~~~F~sl~~Lv~~y~~~   79 (97)
                      |.||--.- ..-..|.+++++.-  .+...+|+.+.-..+| |...|         ....|.+|.+||.-|...
T Consensus       120 ~~ylT~~~-~~~~Rfpi~Fks~~--~g~~~~HIVL~v~~~~~~GalGlSRr~~Lm~k~l~~~sls~li~~y~~s  190 (246)
T PF14822_consen  120 GIYLTAGL-PDVERFPISFKSQF--SGRVFRHIVLGVRCGGKYGALGLSRRSDLMYKPLEFRSLSDLIFDYKES  190 (246)
T ss_pred             HHHHcCCc-ccceEeeceeEEEE--cCcEEEEEEEEEEECCeecccccccchhhcccchhhHHHHHHHHHHHHH
Confidence            44544322 22334778887752  2345666654444444 44333         133599999999988654


No 55 
>smart00391 MBD Methyl-CpG binding domain. Methyl-CpG binding domain, also known as the TAM (TTF-IIP5, ARBP, MeCP1) domain
Probab=25.25  E-value=1e+02  Score=16.48  Aligned_cols=25  Identities=20%  Similarity=0.138  Sum_probs=20.2

Q ss_pred             EEEecccCCCCCHHHHHHHhhhCCC
Q psy3300          57 YKIKGFTKEFSSLTSLITHHSVMPE   81 (97)
Q Consensus        57 ~~l~~~~~~F~sl~~Lv~~y~~~~~   81 (97)
                      |++.-+...|.|..|+..|...+..
T Consensus        31 ~Y~sP~GkklRs~~ev~~YL~~~~~   55 (77)
T smart00391       31 YYISPCGKKLRSKSELARYLHKNGD   55 (77)
T ss_pred             EEECCCCCeeeCHHHHHHHHHhCCC
Confidence            6666667789999999999987754


No 56 
>PF04485 NblA:  Phycobilisome degradation protein nblA ;  InterPro: IPR007574 In the cyanobacterium Synechococcus species PCC 7942 (P35087 from SWISSPROT), nblA triggers degradation of light-harvesting phycobiliproteins in response to deprivation nutrients including nitrogen, phosphorus and sulphur. The mechanism of nblA function is not known, but it has been hypothesised that nblA may act by disrupting phycobilisome structure, activating a protease or tagging phycobiliproteins for proteolysis. Members of this family have also been identified in the chloroplasts of some red algae.; PDB: 3CS5_D 1OJH_L 2QDO_B 2Q8V_A.
Probab=24.22  E-value=57  Score=16.37  Aligned_cols=11  Identities=27%  Similarity=0.353  Sum_probs=7.0

Q ss_pred             CCHHHHHHHhC
Q psy3300           1 MIQEITLEVLS   11 (97)
Q Consensus         1 isr~~ae~~L~   11 (97)
                      ||+++|.++|.
T Consensus        22 ls~Eqaq~~Lv   32 (53)
T PF04485_consen   22 LSREQAQELLV   32 (53)
T ss_dssp             S-HHHHHHHHH
T ss_pred             hCHHHHHHHHH
Confidence            57788877663


No 57 
>PF03443 Glyco_hydro_61:  Glycosyl hydrolase family 61;  InterPro: IPR005103 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. The only known activity within this family is that of endoglucanase (3.2.1.4 from EC) GH61 from CAZY ; PDB: 4EIS_B 2VTC_A 4EIR_B 3EJA_D 3EII_A.
Probab=23.94  E-value=45  Score=21.62  Aligned_cols=9  Identities=33%  Similarity=0.999  Sum_probs=8.0

Q ss_pred             CCceEEEee
Q psy3300          14 PVGAFMVRE   22 (97)
Q Consensus        14 ~~G~fLiR~   22 (97)
                      ++|.||||.
T Consensus       147 ~~G~YLlR~  155 (218)
T PF03443_consen  147 PPGQYLLRH  155 (218)
T ss_dssp             BSEEEEEEE
T ss_pred             CCCCceEEe
Confidence            789999995


No 58 
>KOG4680|consensus
Probab=23.77  E-value=1.1e+02  Score=18.80  Aligned_cols=24  Identities=29%  Similarity=0.560  Sum_probs=17.4

Q ss_pred             CCceEEEeecCCC-----CCeeEEEEEec
Q psy3300          14 PVGAFMVRESTTK-----PGCFALSLRVP   37 (97)
Q Consensus        14 ~~G~fLiR~s~~~-----~~~~~Ls~~~~   37 (97)
                      .+|.|++-.|..-     ||.|.|-++.-
T Consensus       100 epG~f~~~hsq~LPg~tPPG~Y~lkm~~~  128 (153)
T KOG4680|consen  100 EPGDFLVAHSQVLPGYTPPGSYVLKMTAY  128 (153)
T ss_pred             CcCceeeeeeEeccCcCCCceEEEEEEee
Confidence            6899999876544     56788766653


No 59 
>cd01229 PH_etc2 Epithelial cell transforming 2 (ECT2) pleckstrin homology (PH) domain. Epithelial cell transforming 2 (ECT2) pleckstrin homology (PH) domain.  PH domains share little sequence conservation, but all have a common fold, which is electrostatically polarized. PH domains also have diverse functions. They are often involved in targeting proteins to the plasma membrane, but few display strong specificity in lipid binding.  Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinases, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, a well as cytoskeletal associated molecules and in lipid associated enzymes.
Probab=22.17  E-value=1.9e+02  Score=17.26  Aligned_cols=30  Identities=23%  Similarity=0.378  Sum_probs=16.2

Q ss_pred             EeecCCCCCeeEEEEEecCCCCCCceeEEEE
Q psy3300          20 VRESTTKPGCFALSLRVPKEFHHLGIAHYLI   50 (97)
Q Consensus        20 iR~s~~~~~~~~Ls~~~~~~~~~~~v~h~~I   50 (97)
                      ||+++.-++.|.|-++...+.. +....|.|
T Consensus        80 I~DtE~c~~aF~L~~R~p~d~~-~~L~sF~l  109 (129)
T cd01229          80 IRDTEDCHNAFALLVRPPTEQA-NVLLSFQM  109 (129)
T ss_pred             eeccccccceeEEEEeCCccch-heEEEEEe
Confidence            4566555667777777542211 34555554


No 60 
>PF06851 DUF1247:  Protein of unknown function (DUF1247);  InterPro: IPR009657 This entry is represented by Autographa californica nuclear polyhedrosis virus (AcMNPV), Orf2; it is a family of uncharacterised viral proteins.
Probab=21.26  E-value=62  Score=19.73  Aligned_cols=15  Identities=40%  Similarity=0.536  Sum_probs=12.4

Q ss_pred             CCCCHHHHHHHhhhC
Q psy3300          65 EFSSLTSLITHHSVM   79 (97)
Q Consensus        65 ~F~sl~~Lv~~y~~~   79 (97)
                      -|.||.+|+.||--.
T Consensus       120 gF~s~~~LL~~Yny~  134 (148)
T PF06851_consen  120 GFESLKDLLTHYNYE  134 (148)
T ss_pred             CcccHHHHHHHcCCC
Confidence            499999999998543


No 61 
>PF13620 CarboxypepD_reg:  Carboxypeptidase regulatory-like domain; PDB: 3MN8_D 3P0D_I 3KCP_A 2B59_B 1UWY_A 1H8L_A 1QMU_A 2NSM_A.
Probab=20.37  E-value=1.4e+02  Score=15.22  Aligned_cols=22  Identities=27%  Similarity=0.596  Sum_probs=15.1

Q ss_pred             CCceEEEeecCCCCCeeEEEEEec
Q psy3300          14 PVGAFMVRESTTKPGCFALSLRVP   37 (97)
Q Consensus        14 ~~G~fLiR~s~~~~~~~~Ls~~~~   37 (97)
                      ..|.|.++.-.  +|.|.|.+...
T Consensus        36 ~~G~f~~~~l~--~g~Y~l~v~~~   57 (82)
T PF13620_consen   36 SDGRFSFEGLP--PGTYTLRVSAP   57 (82)
T ss_dssp             TTSEEEEEEE---SEEEEEEEEBT
T ss_pred             CCceEEEEccC--CEeEEEEEEEC
Confidence            57888888653  37788877663


No 62 
>PF08272 Topo_Zn_Ribbon:  Topoisomerase I zinc-ribbon-like ;  InterPro: IPR013263 DNA topoisomerases regulate the number of topological links between two DNA strands (i.e. change the number of superhelical turns) by catalysing transient single- or double-strand breaks, crossing the strands through one another, then resealing the breaks []. These enzymes have several functions: to remove DNA supercoils during transcription and DNA replication; for strand breakage during recombination; for chromosome condensation; and to disentangle intertwined DNA during mitosis [, ]. DNA topoisomerases are divided into two classes: type I enzymes (5.99.1.2 from EC; topoisomerases I, III and V) break single-strand DNA, and type II enzymes (5.99.1.3 from EC; topoisomerases II, IV and VI) break double-strand DNA []. Type I topoisomerases are ATP-independent enzymes (except for reverse gyrase), and can be subdivided according to their structure and reaction mechanisms: type IA (bacterial and archaeal topoisomerase I, topoisomerase III and reverse gyrase) and type IB (eukaryotic topoisomerase I and topoisomerase V). These enzymes are primarily responsible for relaxing positively and/or negatively supercoiled DNA, except for reverse gyrase, which can introduce positive supercoils into DNA.  This entry represents the C-terminal zinc-ribbon-like domain found in bacterial topoisomerase I (type IA) enzymes. Escherichia coli topoisomerase I proteins contain five copies of a zinc-ribbon-like domain at their C terminus, two of which have lost their cysteine residues and are therefore probably not able to bind zinc []. This domain is still considered to be a member of the zinc-ribbon superfamily despite not being able to bind zinc. More information about this protein can be found at Protein of the Month: DNA Topoisomerase [].; GO: 0003677 DNA binding, 0003918 DNA topoisomerase (ATP-hydrolyzing) activity, 0006265 DNA topological change, 0005694 chromosome; PDB: 1YUA_A.
Probab=20.15  E-value=7.7  Score=18.50  Aligned_cols=9  Identities=22%  Similarity=0.173  Sum_probs=4.0

Q ss_pred             CceEEEeec
Q psy3300          15 VGAFMVRES   23 (97)
Q Consensus        15 ~G~fLiR~s   23 (97)
                      +.-|.||.-
T Consensus        12 ~k~qyv~se   20 (42)
T PF08272_consen   12 TKEQYVRSE   20 (42)
T ss_dssp             TTEEEEEE-
T ss_pred             CcEEEEecC
Confidence            344555543


Done!