Query psy3300
Match_columns 97
No_of_seqs 142 out of 1037
Neff 9.5
Searched_HMMs 46136
Date Fri Aug 16 21:29:56 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy3300.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/3300hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 cd00173 SH2 Src homology 2 dom 99.9 2.1E-24 4.5E-29 121.5 11.2 86 1-91 7-94 (94)
2 smart00252 SH2 Src homology 2 99.9 1.4E-22 3.1E-27 112.1 10.2 75 1-81 8-83 (84)
3 PF00017 SH2: SH2 domain; Int 99.9 9.5E-23 2.1E-27 111.2 7.6 70 1-76 6-77 (77)
4 KOG1930|consensus 99.8 8.4E-21 1.8E-25 128.0 6.7 95 1-95 219-325 (483)
5 KOG4278|consensus 99.8 3.6E-18 7.9E-23 121.5 8.1 87 1-93 159-246 (1157)
6 KOG4637|consensus 99.7 1.4E-17 2.9E-22 111.5 6.0 84 2-91 340-428 (464)
7 KOG4637|consensus 99.7 5E-17 1.1E-21 108.8 6.1 85 1-91 30-119 (464)
8 KOG0197|consensus 99.7 2.6E-16 5.7E-21 108.8 7.7 91 1-92 88-184 (468)
9 KOG0790|consensus 99.7 4.5E-16 9.8E-21 106.6 8.5 91 1-92 117-216 (600)
10 KOG4226|consensus 99.6 6.3E-16 1.4E-20 100.4 7.4 73 1-80 288-361 (379)
11 KOG2996|consensus 99.6 6.3E-16 1.4E-20 108.6 6.2 88 1-94 692-784 (865)
12 KOG1264|consensus 99.6 2.8E-15 6.2E-20 108.5 8.6 87 3-95 544-643 (1267)
13 KOG4792|consensus 99.6 4.2E-15 9.1E-20 94.3 7.6 73 1-80 18-98 (293)
14 KOG1264|consensus 99.6 5.2E-15 1.1E-19 107.1 6.9 84 1-92 655-739 (1267)
15 KOG0790|consensus 99.4 2.8E-13 6E-18 93.1 6.8 90 1-96 11-109 (600)
16 KOG0194|consensus 99.3 4.3E-11 9.3E-16 83.8 10.3 91 1-97 56-153 (474)
17 KOG3601|consensus 99.1 8.9E-12 1.9E-16 78.3 -0.4 75 1-82 65-140 (222)
18 PF14633 SH2_2: SH2 domain; PD 98.7 1.4E-07 3E-12 60.6 8.6 72 1-78 44-123 (220)
19 KOG3751|consensus 98.5 2.7E-08 5.9E-13 69.9 1.4 85 1-90 526-616 (622)
20 PF14633 SH2_2: SH2 domain; PD 97.6 0.0002 4.4E-09 46.1 5.0 71 3-78 143-218 (220)
21 KOG1856|consensus 97.4 0.0002 4.2E-09 55.0 4.3 73 1-78 1114-1193(1299)
22 KOG4566|consensus 97.3 0.00077 1.7E-08 44.4 5.2 73 2-79 60-138 (258)
23 KOG3697|consensus 97.1 0.0012 2.6E-08 44.0 4.6 62 26-94 278-344 (345)
24 KOG1856|consensus 96.3 0.031 6.7E-07 43.6 7.9 72 2-80 1212-1288(1299)
25 PF02762 Cbl_N3: CBL proto-onc 96.2 0.029 6.3E-07 30.4 5.1 36 2-37 10-47 (86)
26 KOG3667|consensus 92.6 0.37 8E-06 36.0 5.2 47 1-51 570-616 (682)
27 smart00557 IG_FLMN Filamin-typ 86.8 2.4 5.3E-05 23.3 4.5 24 14-37 54-77 (93)
28 KOG1785|consensus 86.3 1.3 2.7E-05 31.5 3.6 34 4-37 266-301 (563)
29 PF10505 NARG2_C: NMDA recepto 70.4 10 0.00022 24.7 3.9 28 8-37 138-165 (220)
30 PF00630 Filamin: Filamin/ABP2 65.8 9.3 0.0002 20.9 2.8 24 14-37 68-91 (101)
31 PF07935 SSV1_ORF_D-335: ORF D 63.3 22 0.00047 19.0 6.2 36 43-81 15-50 (72)
32 PF09430 DUF2012: Protein of u 62.1 28 0.0006 20.3 4.4 34 14-53 28-61 (123)
33 KOG2293|consensus 54.2 13 0.00027 27.5 2.3 67 1-79 474-542 (547)
34 PF00408 PGM_PMM_IV: Phosphogl 50.0 16 0.00034 19.1 1.8 15 14-28 33-48 (73)
35 PF05303 DUF727: Protein of un 49.8 49 0.0011 19.0 4.6 67 3-77 3-89 (108)
36 cd00214 Calpain_III Calpain, s 45.4 43 0.00094 20.1 3.5 24 14-37 116-142 (150)
37 PHA00730 int integrase 45.0 1.1E+02 0.0023 21.5 5.7 35 43-80 17-51 (337)
38 COG5453 Uncharacterized conser 44.3 58 0.0013 18.3 5.4 52 14-76 27-79 (96)
39 PF09625 VP9: VP9 protein; In 42.3 51 0.0011 17.9 3.0 18 19-36 29-46 (79)
40 PF01067 Calpain_III: Calpain 41.9 41 0.00088 19.7 3.0 24 14-37 115-141 (147)
41 PF13912 zf-C2H2_6: C2H2-type 41.8 21 0.00045 14.5 1.2 18 62-79 7-24 (27)
42 smart00720 calpain_III calpain 40.5 47 0.001 19.5 3.1 24 14-37 111-137 (143)
43 PF07240 Turandot: Stress-indu 38.6 29 0.00062 19.2 1.7 18 65-82 8-25 (85)
44 COG1576 Uncharacterized conser 37.0 31 0.00067 21.3 1.9 21 3-23 54-74 (155)
45 COG1724 Predicted RNA binding 36.6 45 0.00097 17.6 2.2 20 2-23 8-27 (66)
46 KOG0518|consensus 34.7 63 0.0014 26.1 3.5 26 13-38 904-929 (1113)
47 cd00122 MBD MeCP2, MBD1, MBD2, 34.0 63 0.0014 16.3 2.5 24 57-80 28-51 (62)
48 PF01429 MBD: Methyl-CpG bindi 32.2 52 0.0011 17.5 2.1 25 57-81 34-58 (77)
49 TIGR03769 P_ac_wall_RPT actino 30.4 62 0.0013 15.1 2.0 13 25-37 10-22 (41)
50 cd01396 MeCP2_MBD MeCP2, MBD1, 30.1 56 0.0012 17.5 2.0 24 57-80 29-52 (77)
51 PF09634 DUF2025: Protein of u 29.0 31 0.00068 19.4 0.9 16 13-28 23-38 (106)
52 cd00068 GGL G protein gamma su 28.6 70 0.0015 16.0 2.1 18 68-85 25-44 (57)
53 KOG3413|consensus 28.1 1.2E+02 0.0026 18.7 3.3 51 14-75 89-139 (156)
54 PF14822 Vasohibin: Vasohibin 27.5 1.7E+02 0.0036 19.6 4.2 61 16-79 120-190 (246)
55 smart00391 MBD Methyl-CpG bind 25.2 1E+02 0.0022 16.5 2.5 25 57-81 31-55 (77)
56 PF04485 NblA: Phycobilisome d 24.2 57 0.0012 16.4 1.3 11 1-11 22-32 (53)
57 PF03443 Glyco_hydro_61: Glyco 23.9 45 0.00098 21.6 1.1 9 14-22 147-155 (218)
58 KOG4680|consensus 23.8 1.1E+02 0.0023 18.8 2.5 24 14-37 100-128 (153)
59 cd01229 PH_etc2 Epithelial cel 22.2 1.9E+02 0.0041 17.3 3.3 30 20-50 80-109 (129)
60 PF06851 DUF1247: Protein of u 21.3 62 0.0014 19.7 1.2 15 65-79 120-134 (148)
61 PF13620 CarboxypepD_reg: Carb 20.4 1.4E+02 0.0031 15.2 2.9 22 14-37 36-57 (82)
62 PF08272 Topo_Zn_Ribbon: Topoi 20.1 7.7 0.00017 18.5 -2.2 9 15-23 12-20 (42)
No 1
>cd00173 SH2 Src homology 2 domains; Signal transduction, involved in recognition of phosphorylated tyrosine (pTyr). SH2 domains typically bind pTyr-containing ligands via two surface pockets, a pTyr and hydrophobic binding pocket, allowing proteins with SH2 domains to localize to tyrosine phosphorylated sites.
Probab=99.92 E-value=2.1e-24 Score=121.52 Aligned_cols=86 Identities=37% Similarity=0.570 Sum_probs=78.0
Q ss_pred CCHHHHHHHhCCCCCceEEEeecCCCCCeeEEEEEecCCCCCCceeEEEEEEeCCcEEEecccCCCCCHHHHHHHhhhCC
Q psy3300 1 MIQEITLEVLSQEPVGAFMVRESTTKPGCFALSLRVPKEFHHLGIAHYLILRTAKGYKIKGFTKEFSSLTSLITHHSVMP 80 (97)
Q Consensus 1 isr~~ae~~L~~~~~G~fLiR~s~~~~~~~~Ls~~~~~~~~~~~v~h~~I~~~~~~~~l~~~~~~F~sl~~Lv~~y~~~~ 80 (97)
|+|++|+++|++.++|+||||.|++.++.|+||++.. +.++||+|...++++.+......|+||.|||+||+.++
T Consensus 7 i~r~~Ae~~L~~~~~G~FLiR~s~~~~~~~~Lsv~~~-----~~v~H~~I~~~~~~~~~~~~~~~f~sl~eLv~~y~~~~ 81 (94)
T cd00173 7 ISREEAEELLKKKPDGTFLVRDSESSPGDYVLSVRVK-----GKVKHYRIERTDDGYYLLGEGRSFPSLPELIEHYQKNP 81 (94)
T ss_pred CCHHHHHHHHhcCCCceEEEEecCCCCCCEEEEEEEC-----CEEEEEEEEECCCCeEEecCCCccCCHHHHHHHHhhCc
Confidence 7899999999988999999999998899999999996 79999999999888776654678999999999999998
Q ss_pred --CCcceeecCCC
Q psy3300 81 --ELLPCTLSLHR 91 (97)
Q Consensus 81 --~~l~~~L~~p~ 91 (97)
+++.+.|+.|+
T Consensus 82 ~~~~~~~~L~~p~ 94 (94)
T cd00173 82 LSDGLGVKLRYPV 94 (94)
T ss_pred cCCCcccEeCCcC
Confidence 78889998884
No 2
>smart00252 SH2 Src homology 2 domains. Src homology 2 domains bind phosphotyrosine-containing polypeptides via 2 surface pockets. Specificity is provided via interaction with residues that are distinct from the phosphotyrosine. Only a single occurrence of a SH2 domain has been found in S. cerevisiae.
Probab=99.89 E-value=1.4e-22 Score=112.10 Aligned_cols=75 Identities=32% Similarity=0.530 Sum_probs=68.0
Q ss_pred CCHHHHHHHhCCCCCceEEEeecCCCCCeeEEEEEecCCCCCCceeEEEEEEeC-CcEEEecccCCCCCHHHHHHHhhhC
Q psy3300 1 MIQEITLEVLSQEPVGAFMVRESTTKPGCFALSLRVPKEFHHLGIAHYLILRTA-KGYKIKGFTKEFSSLTSLITHHSVM 79 (97)
Q Consensus 1 isr~~ae~~L~~~~~G~fLiR~s~~~~~~~~Ls~~~~~~~~~~~v~h~~I~~~~-~~~~l~~~~~~F~sl~~Lv~~y~~~ 79 (97)
|+|++|+++|.+.++|+||||.|++.++.|+||++.. +.++||+|...+ ++|.+.+ ...|+||.+||+||+.+
T Consensus 8 i~r~~Ae~lL~~~~~G~FLvR~s~~~~~~~~Lsv~~~-----~~~~h~~I~~~~~~~~~l~~-~~~F~sl~eLI~~y~~~ 81 (84)
T smart00252 8 ISREEAEKLLKNEGDGDFLVRDSESEPGDYVLSVRVK-----GKVKHYRIRRNEDGKFYLDG-GRKFPSLVELVEHYQKN 81 (84)
T ss_pred CCHHHHHHHHhcCCCcEEEEEcCCCCCCCEEEEEEEC-----CEEEEEEEEECCCCcEEECC-CCccCCHHHHHHHHhhC
Confidence 6899999999988899999999998899999999995 899999999987 4588876 78999999999999987
Q ss_pred CC
Q psy3300 80 PE 81 (97)
Q Consensus 80 ~~ 81 (97)
++
T Consensus 82 ~~ 83 (84)
T smart00252 82 SL 83 (84)
T ss_pred CC
Confidence 64
No 3
>PF00017 SH2: SH2 domain; InterPro: IPR000980 The Src homology 2 (SH2) domain is a protein domain of about 100 amino-acid residues first identified as a conserved sequence region between the oncoproteins Src and Fps []. Similar sequences were later found in many other intracellular signal-transducing proteins []. SH2 domains function as regulatory modules of intracellular signalling cascades by interacting with high affinity to phosphotyrosine-containing target peptides in a sequence-specific, SH2 domains recognise between 3-6 residues C-terminal to the phosphorylated tyrosine in a fashion that differs from one SH2 domain to another, and strictly phosphorylation-dependent manner [, , , ]. They are found in a wide variety of protein contexts e.g., in association with catalytic domains of phospholipase Cy (PLCy) and the non-receptor protein tyrosine kinases; within structural proteins such as fodrin and tensin; and in a group of small adaptor molecules, i.e Crk and Nck. The domains are frequently found as repeats in a single protein sequence and will then often bind both mono- and di-phosphorylated substrates. The structure of the SH2 domain belongs to the alpha+beta class, its overall shape forming a compact flattened hemisphere. The core structural elements comprise a central hydrophobic anti-parallel beta-sheet, flanked by 2 short alpha-helices. The loop between strands 2 and 3 provides many of the binding interactions with the phosphate group of its phosphopeptide ligand, and is hence designated the phosphate binding loop, the phosphorylated ligand binds perpendicular to the beta-sheet and typically interacts with the phosphate binding loop and a hydrophobic binding pocket that interacts with a pY+3 side chain. The N- and C-termini of the domain are close together in space and on the opposite face from the phosphopeptide binding surface and it has been speculated that this has facilitated their integration into surface-exposed regions of host proteins [].; GO: 0005515 protein binding; PDB: 1M27_A 1KA6_A 1D4W_B 1D4T_A 1D1Z_B 1KA7_A 1UUR_A 1UUS_A 1BLJ_A 1BLK_A ....
Probab=99.89 E-value=9.5e-23 Score=111.19 Aligned_cols=70 Identities=37% Similarity=0.634 Sum_probs=64.5
Q ss_pred CCHHHHHHHhCC-CCCceEEEeecCCCCCeeEEEEEecCCCCCCceeEEEEEEeCCc-EEEecccCCCCCHHHHHHHh
Q psy3300 1 MIQEITLEVLSQ-EPVGAFMVRESTTKPGCFALSLRVPKEFHHLGIAHYLILRTAKG-YKIKGFTKEFSSLTSLITHH 76 (97)
Q Consensus 1 isr~~ae~~L~~-~~~G~fLiR~s~~~~~~~~Ls~~~~~~~~~~~v~h~~I~~~~~~-~~l~~~~~~F~sl~~Lv~~y 76 (97)
|+|++|+++|+. .++|+||||.|.+.+|.|+||++.. +.++||+|.+.+++ |.+.+ ...|+||.+||+||
T Consensus 6 isr~~Ae~~L~~~~~~G~FLvR~s~~~~~~~~Lsv~~~-----~~v~h~~I~~~~~~~~~~~~-~~~F~sl~~LV~~y 77 (77)
T PF00017_consen 6 ISRQEAERLLMQGKPDGTFLVRPSSSKPGKYVLSVRFD-----GKVKHFRINRTENGGYFLSD-GKKFPSLSDLVEHY 77 (77)
T ss_dssp SHHHHHHHHHHTTSSTTEEEEEEESSSTTSEEEEEEET-----TEEEEEEEEEETTSEEESST-SSEBSSHHHHHHHH
T ss_pred CCHHHHHHHHHhcCCCCeEEEEeccccccccccccccc-----cccEEEEEEecCCceEEccC-CCcCCCHHHHHHhC
Confidence 689999999999 7999999999998899999999996 79999999999988 77765 67799999999998
No 4
>KOG1930|consensus
Probab=99.83 E-value=8.4e-21 Score=128.04 Aligned_cols=95 Identities=32% Similarity=0.565 Sum_probs=85.5
Q ss_pred CCHHHHHHHhCCCCCceEEEeecCCCCCeeEEEEEecCCC--------C--CCceeEEEEEEeCCcEEEecccC--CCCC
Q psy3300 1 MIQEITLEVLSQEPVGAFMVRESTTKPGCFALSLRVPKEF--------H--HLGIAHYLILRTAKGYKIKGFTK--EFSS 68 (97)
Q Consensus 1 isr~~ae~~L~~~~~G~fLiR~s~~~~~~~~Ls~~~~~~~--------~--~~~v~h~~I~~~~~~~~l~~~~~--~F~s 68 (97)
|||++|..||+++++|+||||+|.+.+|.|-|++++..+. . ...|+||.|+....|..|.|+.+ .|.|
T Consensus 219 isREQAIalLrdkePGtFvvRDS~SfrGayGLAlKVstPPPs~~~~~g~~~neLVRHFLIE~spkGVkLKGC~nEP~FGS 298 (483)
T KOG1930|consen 219 ISREQAIALLRDKEPGTFVVRDSHSFRGAYGLALKVSTPPPSVQPGDGSDSNELVRHFLIEPSPKGVKLKGCDNEPVFGS 298 (483)
T ss_pred CCHHHHHHHhhcCCCCeEEEecCCcCCCccceEEEeccCCCcccCCCCCchhhhhhhheeccCCCceeccCCCCCCccch
Confidence 7999999999999999999999999999999999986421 1 25799999999999999988755 5999
Q ss_pred HHHHHHHhhhCCCCcceeecCCCCCCC
Q psy3300 69 LTSLITHHSVMPELLPCTLSLHRYNPN 95 (97)
Q Consensus 69 l~~Lv~~y~~~~~~l~~~L~~p~~~p~ 95 (97)
|..||-.|....+.|+|+|.+|.++|.
T Consensus 299 LSALV~QHSIt~LALPckL~iP~rDp~ 325 (483)
T KOG1930|consen 299 LSALVYQHSITALALPCKLVIPDRDPL 325 (483)
T ss_pred hHHHHhhccchhhhcceeEeccCCCcc
Confidence 999999999999999999999999984
No 5
>KOG4278|consensus
Probab=99.75 E-value=3.6e-18 Score=121.52 Aligned_cols=87 Identities=30% Similarity=0.442 Sum_probs=79.6
Q ss_pred CCHHHHHHHhCCCCCceEEEeecCCCCCeeEEEEEecCCCCCCceeEEEEEEeCCc-EEEecccCCCCCHHHHHHHhhhC
Q psy3300 1 MIQEITLEVLSQEPVGAFMVRESTTKPGCFALSLRVPKEFHHLGIAHYLILRTAKG-YKIKGFTKEFSSLTSLITHHSVM 79 (97)
Q Consensus 1 isr~~ae~~L~~~~~G~fLiR~s~~~~~~~~Ls~~~~~~~~~~~v~h~~I~~~~~~-~~l~~~~~~F~sl~~Lv~~y~~~ 79 (97)
+||..||.+|..-..|+||||+|++.+|+|+||++.. ++|.||+|.-..+| .++.. ...|.+|.|||.||...
T Consensus 159 vSRsaaEy~LsSgInGSFLVRESEsSpgQ~sISlRye-----GRVyHYRINt~~dgK~yvt~-EsrF~TLaELVHHHStv 232 (1157)
T KOG4278|consen 159 VSRSAAEYILSSGINGSFLVRESESSPGQYSISLRYE-----GRVYHYRINTDNDGKMYVTQ-ESRFRTLAELVHHHSTV 232 (1157)
T ss_pred cccchhhhhhhcCcccceEEeeccCCCcceeEEEEec-----ceEEEEEeeccCCccEEEee-hhhhhHHHHHHhhcccc
Confidence 6899999999888999999999999999999999996 99999999988777 66654 77899999999999999
Q ss_pred CCCcceeecCCCCC
Q psy3300 80 PELLPCTLSLHRYN 93 (97)
Q Consensus 80 ~~~l~~~L~~p~~~ 93 (97)
.|||.|.|.+|..+
T Consensus 233 ADGLittLhYPApK 246 (1157)
T KOG4278|consen 233 ADGLITTLHYPAPK 246 (1157)
T ss_pred ccceeEeeeccCcc
Confidence 99999999999643
No 6
>KOG4637|consensus
Probab=99.72 E-value=1.4e-17 Score=111.48 Aligned_cols=84 Identities=32% Similarity=0.484 Sum_probs=75.7
Q ss_pred CHHHHHHHhCCCCCceEEEeecCCCCCeeEEEEEecCCCCCCceeEEEEEEeCCcEEEecccCCCCCHHHHHHHhhhC--
Q psy3300 2 IQEITLEVLSQEPVGAFMVRESTTKPGCFALSLRVPKEFHHLGIAHYLILRTAKGYKIKGFTKEFSSLTSLITHHSVM-- 79 (97)
Q Consensus 2 sr~~ae~~L~~~~~G~fLiR~s~~~~~~~~Ls~~~~~~~~~~~v~h~~I~~~~~~~~l~~~~~~F~sl~~Lv~~y~~~-- 79 (97)
.|..||.+|+..++|+||||.| ++.|+|++||... +.|+|+.|..+..||.+..+.+.+.||.+||.||+..
T Consensus 340 ~r~kAe~llrg~~dGtFLIR~s-s~~g~yalSV~~~-----~~V~HClIy~tatG~GFa~pyn~y~tlk~lV~hY~h~SL 413 (464)
T KOG4637|consen 340 NRDKAEELLRGKPDGTFLIRES-SKGGCYALSVVHD-----GEVKHCLIYQTATGFGFAEPYNLYSTLKELVLHYQHTSL 413 (464)
T ss_pred hHHHHHHHhcCCCCCeEEEeec-cCCCceEEEEEEC-----CceeeeEEeeccccccccchhHHHHHHHHHHHHHhhhhH
Confidence 5889999999999999999996 6899999999995 9999999999999999987677799999999999876
Q ss_pred ---CCCcceeecCCC
Q psy3300 80 ---PELLPCTLSLHR 91 (97)
Q Consensus 80 ---~~~l~~~L~~p~ 91 (97)
.|.|.++|+.|.
T Consensus 414 e~HnDal~ttLr~Pv 428 (464)
T KOG4637|consen 414 EQHNDALTTTLRYPV 428 (464)
T ss_pred Hhhccccccccccce
Confidence 567888999885
No 7
>KOG4637|consensus
Probab=99.69 E-value=5e-17 Score=108.80 Aligned_cols=85 Identities=25% Similarity=0.305 Sum_probs=76.0
Q ss_pred CCHHHHHHHhCCCCCceEEEeecCCCCCeeEEEEEecCCCCCCceeEEEEEEeCCcEEEecccCCCCCHHHHHHHhhhCC
Q psy3300 1 MIQEITLEVLSQEPVGAFMVRESTTKPGCFALSLRVPKEFHHLGIAHYLILRTAKGYKIKGFTKEFSSLTSLITHHSVMP 80 (97)
Q Consensus 1 isr~~ae~~L~~~~~G~fLiR~s~~~~~~~~Ls~~~~~~~~~~~v~h~~I~~~~~~~~l~~~~~~F~sl~~Lv~~y~~~~ 80 (97)
|||+++...|.+.|+|+||||+.++.+|.|+|+++.. +.++-++|...++.|.+.+ ...|+|+.+||+||++.+
T Consensus 30 isReev~~~L~d~PDGsFlVRdAstm~GdYTLtl~k~-----g~~KLikI~h~DgKyGF~d-~ltf~SVVelIn~yr~~S 103 (464)
T KOG4637|consen 30 ISREEVNKKLRDQPDGSFLVRDASTMQGDYTLTLRKG-----GNNKLIKIVHRDGKYGFSD-PLTFNSVVELINHYRNES 103 (464)
T ss_pred cCHHHHHHHhcCCCCCcEEeeccccCCCceEEEEecC-----CccceeeeEEecCccCCCC-chhhHHHHHHHHHHhhhH
Confidence 7999999999999999999999888999999999994 8999999999988777665 889999999999999886
Q ss_pred C-----CcceeecCCC
Q psy3300 81 E-----LLPCTLSLHR 91 (97)
Q Consensus 81 ~-----~l~~~L~~p~ 91 (97)
+ .|.++|.+|.
T Consensus 104 L~~yN~~LDvrLlyPV 119 (464)
T KOG4637|consen 104 LAQYNPKLDVRLLYPV 119 (464)
T ss_pred HHhhCcccceeeechH
Confidence 5 4668888883
No 8
>KOG0197|consensus
Probab=99.67 E-value=2.6e-16 Score=108.83 Aligned_cols=91 Identities=29% Similarity=0.269 Sum_probs=76.6
Q ss_pred CCHHHHHHHhCCC--CCceEEEeecCCCCCeeEEEEEecCCC-CCCceeEEEEEEeCCc-EE--EecccCCCCCHHHHHH
Q psy3300 1 MIQEITLEVLSQE--PVGAFMVRESTTKPGCFALSLRVPKEF-HHLGIAHYLILRTAKG-YK--IKGFTKEFSSLTSLIT 74 (97)
Q Consensus 1 isr~~ae~~L~~~--~~G~fLiR~s~~~~~~~~Ls~~~~~~~-~~~~v~h~~I~~~~~~-~~--l~~~~~~F~sl~~Lv~ 74 (97)
|||++||+.|..+ +.|+||||+|++.+|.|+|+++..... .+..|+||+|...++| ++ +.. ...|.+|.+||+
T Consensus 88 isR~~ae~~ll~p~~~~G~flvR~se~~~g~yslsv~~~~~~~~~~~v~hyri~~~~~~~~~~~~~~-~~~F~~l~~lv~ 166 (468)
T KOG0197|consen 88 ISREEAERQLLAPENKEGAFLVRESESDKGDYSLSVREGDSGGLGAKVKHYRIRQLDGGGLYPYIDE-RELFSSLQQLVN 166 (468)
T ss_pred ccHHHHHHhhcCCCCCccceeeecccCCcCCeeEEEEeccccCCccceeeeeeeEcCCCCeecCCCH-HHhhhhHHHHHh
Confidence 7899999666544 679999999999999999999986321 2458999999998876 55 543 777999999999
Q ss_pred HhhhCCCCcceeecCCCC
Q psy3300 75 HHSVMPELLPCTLSLHRY 92 (97)
Q Consensus 75 ~y~~~~~~l~~~L~~p~~ 92 (97)
||+.+++++...|..|+.
T Consensus 167 ~~~~~~~gl~~~l~~p~~ 184 (468)
T KOG0197|consen 167 YYSKNADGLCTRLRDPCS 184 (468)
T ss_pred hhhccCcchhhcccCchh
Confidence 999999999999998874
No 9
>KOG0790|consensus
Probab=99.67 E-value=4.5e-16 Score=106.58 Aligned_cols=91 Identities=29% Similarity=0.408 Sum_probs=75.6
Q ss_pred CCHHHHHHHhCCC-CCceEEEeecCCCCCeeEEEEEecC----CCCCCceeEEEEEEeCCcEEEecccCCCCCHHHHHHH
Q psy3300 1 MIQEITLEVLSQE-PVGAFMVRESTTKPGCFALSLRVPK----EFHHLGIAHYLILRTAKGYKIKGFTKEFSSLTSLITH 75 (97)
Q Consensus 1 isr~~ae~~L~~~-~~G~fLiR~s~~~~~~~~Ls~~~~~----~~~~~~v~h~~I~~~~~~~~l~~~~~~F~sl~~Lv~~ 75 (97)
+|..+||.+|... ..|+||||+|.+.||.|+||++..+ +....+|.|..|.+.++.|-+.| ...|.||.+||+|
T Consensus 117 LsgkeAekLl~ekgk~gsfLvReSqs~PGdfVlSvrTdd~~~~~~~~~kVtHvmI~~q~~kydVGg-ge~F~sltdLidh 195 (600)
T KOG0790|consen 117 LSGKEAEKLLQEKGKHGSFLVRESQSHPGDFVLSVRTDDKKESNDSKLKVTHVMIRCQEGKYDVGG-GERFDSLTDLVEH 195 (600)
T ss_pred CCchhHHHHHHhcCCCccEEEeccccCCCceEEEEEcCCcccCCCCccceEEEEEEecccccccCC-ccccchHHHHHHH
Confidence 4678999999554 8999999999999999999999953 12246899999999887799866 8899999999999
Q ss_pred hhhCCC----CcceeecCCCC
Q psy3300 76 HSVMPE----LLPCTLSLHRY 92 (97)
Q Consensus 76 y~~~~~----~l~~~L~~p~~ 92 (97)
|.+++- |-...|+.|..
T Consensus 196 ykknpmvEt~gtvv~LrqP~n 216 (600)
T KOG0790|consen 196 YKKNPMVETLGTVVYLRQPLN 216 (600)
T ss_pred hccCchhhhcceeEEeecccc
Confidence 999853 45567887754
No 10
>KOG4226|consensus
Probab=99.65 E-value=6.3e-16 Score=100.40 Aligned_cols=73 Identities=25% Similarity=0.484 Sum_probs=65.4
Q ss_pred CCHHHHHHHhCCC-CCceEEEeecCCCCCeeEEEEEecCCCCCCceeEEEEEEeCCcEEEecccCCCCCHHHHHHHhhhC
Q psy3300 1 MIQEITLEVLSQE-PVGAFMVRESTTKPGCFALSLRVPKEFHHLGIAHYLILRTAKGYKIKGFTKEFSSLTSLITHHSVM 79 (97)
Q Consensus 1 isr~~ae~~L~~~-~~G~fLiR~s~~~~~~~~Ls~~~~~~~~~~~v~h~~I~~~~~~~~l~~~~~~F~sl~~Lv~~y~~~ 79 (97)
|+|.+||.+|..+ .+|.||||+|++.+|.|++|++.. ++.+||++...++-|.| | ...|.++.+||+||++.
T Consensus 288 itR~qae~~Ln~hG~eGdFLiRDSEsnpgD~SvSlka~-----grNKHFkVq~~d~~ycI-G-qRkF~tmd~Lv~HY~ka 360 (379)
T KOG4226|consen 288 ITRHQAECALNEHGHEGDFLIRDSESNPGDFSVSLKAS-----GRNKHFKVQLVDNVYCI-G-QRKFHTMDELVEHYKKA 360 (379)
T ss_pred ccHHHHHHHHhccCccCceEEecCCCCCcceeEEeecc-----CCCcceEEEEecceEEe-c-cceeccHHHHHHhhhcC
Confidence 6899999999655 899999999999999999999996 89999999877666776 4 88999999999999886
Q ss_pred C
Q psy3300 80 P 80 (97)
Q Consensus 80 ~ 80 (97)
+
T Consensus 361 P 361 (379)
T KOG4226|consen 361 P 361 (379)
T ss_pred C
Confidence 4
No 11
>KOG2996|consensus
Probab=99.63 E-value=6.3e-16 Score=108.61 Aligned_cols=88 Identities=25% Similarity=0.314 Sum_probs=76.9
Q ss_pred CCHHHHHHHhCCCCCceEEEeecCCCCCeeEEEEEecCCCCCCceeEEEEEEeCCcEEEecccCCCCCHHHHHHHhhhCC
Q psy3300 1 MIQEITLEVLSQEPVGAFMVRESTTKPGCFALSLRVPKEFHHLGIAHYLILRTAKGYKIKGFTKEFSSLTSLITHHSVMP 80 (97)
Q Consensus 1 isr~~ae~~L~~~~~G~fLiR~s~~~~~~~~Ls~~~~~~~~~~~v~h~~I~~~~~~~~l~~~~~~F~sl~~Lv~~y~~~~ 80 (97)
|-|++||..|+..+.|+||||.+....+.|+||++.. ..|+|++|...++.++|.. ...|+|+.|||+||+.++
T Consensus 692 MERaqaes~Lk~~~ngT~LVR~r~kea~e~AISikyn-----nevKHikI~~~dg~~~i~E-~k~F~sl~ELVeyYq~~s 765 (865)
T KOG2996|consen 692 MERAQAESTLKNRPNGTYLVRYRTKEAKEFAISIKYN-----NEVKHIKIETNDGKVHITE-DKKFNSLVELVEYYQSHS 765 (865)
T ss_pred HhhhhhhhHhhcCCCceEEEEecccchhheeEEEEec-----cccceEEEEecCCeEEech-hhhhhhHHHHHHHHHhcc
Confidence 4689999999999999999999988899999999996 8999999998866688876 888999999999999997
Q ss_pred C-----CcceeecCCCCCC
Q psy3300 81 E-----LLPCTLSLHRYNP 94 (97)
Q Consensus 81 ~-----~l~~~L~~p~~~p 94 (97)
+ .|...|+.|-..|
T Consensus 766 Lke~Fk~LDTtLk~Pyke~ 784 (865)
T KOG2996|consen 766 LKEIFKALDTTLKFPYKEP 784 (865)
T ss_pred HHHHHHHhhhhhcCCCcCh
Confidence 6 3557888886544
No 12
>KOG1264|consensus
Probab=99.62 E-value=2.8e-15 Score=108.46 Aligned_cols=87 Identities=28% Similarity=0.311 Sum_probs=73.5
Q ss_pred HHHHHHHhCC------CCCceEEEeecCCCCCeeEEEEEecCCCCCCceeEEEEEEeC-Cc---EEEecccCCCCCHHHH
Q psy3300 3 QEITLEVLSQ------EPVGAFMVRESTTKPGCFALSLRVPKEFHHLGIAHYLILRTA-KG---YKIKGFTKEFSSLTSL 72 (97)
Q Consensus 3 r~~ae~~L~~------~~~G~fLiR~s~~~~~~~~Ls~~~~~~~~~~~v~h~~I~~~~-~~---~~l~~~~~~F~sl~~L 72 (97)
|.+|+.+|.. .++|+||||+|++..|.|+||++.. ++|.|++|.... +| |++.+ +..|.||-+|
T Consensus 544 R~eAekll~eycke~G~~dGtFlVReS~tFvgDytLSfwr~-----grv~HcRIrsk~e~gt~Kyyl~d-N~vfdslY~L 617 (1267)
T KOG1264|consen 544 RTEAEKLLQEYCKETGGKDGTFLVRESETFVGDYTLSFWRS-----GRVQHCRIRSKMEGGTLKYYLTD-NLVFDSLYAL 617 (1267)
T ss_pred chHHHHHHHHHHHHhCCCCccEEEeeccccccceeeeeeEC-----CceeeEEEEeeecCCceeEEEec-chhHHHHHHH
Confidence 7899999974 2699999999999999999999996 999999999764 33 88877 8899999999
Q ss_pred HHHhhhCCCC---cceeecCCCCCCC
Q psy3300 73 ITHHSVMPEL---LPCTLSLHRYNPN 95 (97)
Q Consensus 73 v~~y~~~~~~---l~~~L~~p~~~p~ 95 (97)
|.||+.+.+. ....|+.|+++|.
T Consensus 618 I~~Y~~~~Lr~aeF~m~LtePvPqp~ 643 (1267)
T KOG1264|consen 618 IQHYRETHLRCAEFEMRLTEPVPQPN 643 (1267)
T ss_pred HHHHHhccccccceEEEecCCCCCCC
Confidence 9999998653 3368888776653
No 13
>KOG4792|consensus
Probab=99.61 E-value=4.2e-15 Score=94.29 Aligned_cols=73 Identities=29% Similarity=0.517 Sum_probs=63.9
Q ss_pred CCHHHHHHHhCCCCCceEEEeecCCCCCeeEEEEEecCCCCCCceeEEEEEEe--C------CcEEEecccCCCCCHHHH
Q psy3300 1 MIQEITLEVLSQEPVGAFMVRESTTKPGCFALSLRVPKEFHHLGIAHYLILRT--A------KGYKIKGFTKEFSSLTSL 72 (97)
Q Consensus 1 isr~~ae~~L~~~~~G~fLiR~s~~~~~~~~Ls~~~~~~~~~~~v~h~~I~~~--~------~~~~l~~~~~~F~sl~~L 72 (97)
|||+||..+|..+..|.||||+|++-+|.|+|+|+.. .+|.||.|... . .++.|. ...|++|+.|
T Consensus 18 mSRqeA~~lL~~~r~G~FLvRDSst~pGdYvLsV~E~-----srVshYiIn~~~p~~~~~~~~~~rIg--dQ~Fd~lPaL 90 (293)
T KOG4792|consen 18 MSRQEAVALLQGQRHGVFLVRDSSTSPGDYVLSVSEN-----SRVSHYIINSSPPSPAQPPPSRLRIG--DQEFDSLPAL 90 (293)
T ss_pred ccHHHHHHHhcCcceeeEEEecCCCCCCceEEEEecC-----cceeeeeecCCCCCccCCCcceeeec--cccccchHHH
Confidence 7999999999988899999999999999999999985 88999999872 1 146664 7789999999
Q ss_pred HHHhhhCC
Q psy3300 73 ITHHSVMP 80 (97)
Q Consensus 73 v~~y~~~~ 80 (97)
++||..+-
T Consensus 91 L~fykihy 98 (293)
T KOG4792|consen 91 LEFYKIHY 98 (293)
T ss_pred HhheeEee
Confidence 99998764
No 14
>KOG1264|consensus
Probab=99.58 E-value=5.2e-15 Score=107.14 Aligned_cols=84 Identities=25% Similarity=0.360 Sum_probs=71.2
Q ss_pred CCHHHHHHHhCCC-CCceEEEeecCCCCCeeEEEEEecCCCCCCceeEEEEEEeCCcEEEecccCCCCCHHHHHHHhhhC
Q psy3300 1 MIQEITLEVLSQE-PVGAFMVRESTTKPGCFALSLRVPKEFHHLGIAHYLILRTAKGYKIKGFTKEFSSLTSLITHHSVM 79 (97)
Q Consensus 1 isr~~ae~~L~~~-~~G~fLiR~s~~~~~~~~Ls~~~~~~~~~~~v~h~~I~~~~~~~~l~~~~~~F~sl~~Lv~~y~~~ 79 (97)
.+|++||++|..- .+|+|||| +....+.|+||++.. ++++|++|.+.+.-|.+ | ...|+||.+||+||.++
T Consensus 655 ~treqAE~mL~rvp~DGaFLiR-~~~~~nsy~iSfr~~-----gkikHcRi~rdGr~fvl-~-t~~FesLv~lv~yY~k~ 726 (1267)
T KOG1264|consen 655 LTREQAEDMLMRVPRDGAFLIR-KREGSNSYAISFRAR-----GKIKHCRINRDGRHFVL-G-TSAFESLVELVSYYEKH 726 (1267)
T ss_pred ccHHHHHHHHhhCccCcceEEE-eccCCceEEEEEEEc-----CcEeEEEEccCceEEEe-c-cHHHHHHHHHHHHHhcC
Confidence 4799999999876 67999999 546688899999996 99999999876554665 4 66799999999999999
Q ss_pred CCCcceeecCCCC
Q psy3300 80 PELLPCTLSLHRY 92 (97)
Q Consensus 80 ~~~l~~~L~~p~~ 92 (97)
+.--.++|++|..
T Consensus 727 ~lyR~mkLr~PVn 739 (1267)
T KOG1264|consen 727 PLYRKMKLRYPVN 739 (1267)
T ss_pred hhhhcccccCcCC
Confidence 8877789998853
No 15
>KOG0790|consensus
Probab=99.45 E-value=2.8e-13 Score=93.12 Aligned_cols=90 Identities=22% Similarity=0.285 Sum_probs=78.1
Q ss_pred CCHHHHHHHhCCC-CCceEEEeecCCCCCeeEEEEEecCCCCCCceeEEEEEEeCCcEEEecccCCCCCHHHHHHHhhhC
Q psy3300 1 MIQEITLEVLSQE-PVGAFMVRESTTKPGCFALSLRVPKEFHHLGIAHYLILRTAKGYKIKGFTKEFSSLTSLITHHSVM 79 (97)
Q Consensus 1 isr~~ae~~L~~~-~~G~fLiR~s~~~~~~~~Ls~~~~~~~~~~~v~h~~I~~~~~~~~l~~~~~~F~sl~~Lv~~y~~~ 79 (97)
|+..+||.+|+.. .+|+||.|.|++.+|.|+|+++.. +.+.|.+|...++-|.+.+ ...|.++.|||+||..+
T Consensus 11 ~~g~~ae~Ll~~~g~dgsfl~r~s~sNp~~fsl~~r~~-----~~v~hikiq~~~~~~~l~~-gekfat~~ELvqyyme~ 84 (600)
T KOG0790|consen 11 LSGVEAETLLKERGVDGSFLARPSESNPGDFSLSVRRG-----DKVTHIKIQNSGDFYDLYG-GEKFATLAELVQYYMEH 84 (600)
T ss_pred ccchhHHHHHHHhccccchhhccccCCCcceeEEEEeC-----CceEEEEEeecCccccccC-CccccchHHHHHHHHhh
Confidence 4567999999876 899999999999999999999994 8999999987776688887 88999999999999887
Q ss_pred C------CCcceeecCC--CCCCCC
Q psy3300 80 P------ELLPCTLSLH--RYNPNF 96 (97)
Q Consensus 80 ~------~~l~~~L~~p--~~~p~~ 96 (97)
. .|...-|++| |.+|+-
T Consensus 85 ~~~lkekng~~ielK~pl~cAdpts 109 (600)
T KOG0790|consen 85 HGQLKEKNGDVIELKYPLNCADPTS 109 (600)
T ss_pred hHHHHhcCCCEEEecCCCccCCchh
Confidence 5 5677888888 577763
No 16
>KOG0194|consensus
Probab=99.30 E-value=4.3e-11 Score=83.80 Aligned_cols=91 Identities=24% Similarity=0.354 Sum_probs=72.5
Q ss_pred CCHHHHHHHhCCCCCceEEEeecCCCCCe---e-EEEEEecCCCCCCceeEEEEEEeCCcEEEecccCCCCCHHHHHHHh
Q psy3300 1 MIQEITLEVLSQEPVGAFMVRESTTKPGC---F-ALSLRVPKEFHHLGIAHYLILRTAKGYKIKGFTKEFSSLTSLITHH 76 (97)
Q Consensus 1 isr~~ae~~L~~~~~G~fLiR~s~~~~~~---~-~Ls~~~~~~~~~~~v~h~~I~~~~~~~~l~~~~~~F~sl~~Lv~~y 76 (97)
+.|++|+.+|+. +|+||||.|+...|. + +||+.... ...++||.|...+++++..+ ...|+++.+||.||
T Consensus 56 l~red~~~lL~~--~GDfLvR~s~~~~~~~~~~~vlSv~~~~---~~~~~h~vi~~~~~~~~~~~-~~~F~si~~li~~~ 129 (474)
T KOG0194|consen 56 LPREDAEKLLKN--DGDFLVRASEPKEGEKREFVVLSVKWSV---FKKIKHYVIKRNGNLFFFEG-LRKFPTISELVNYY 129 (474)
T ss_pred ccHhHHHHHhCC--CCceEEEeecccCCcceeEEEEEEEeec---CCceeEEEEEEcCCeeEEec-cccCCcHHHHHHHH
Confidence 579999999977 799999999877652 3 78888731 25888999999888766665 78999999999999
Q ss_pred hhCCCCcc---eeecCCCCCCCCC
Q psy3300 77 SVMPELLP---CTLSLHRYNPNFE 97 (97)
Q Consensus 77 ~~~~~~l~---~~L~~p~~~p~~~ 97 (97)
..+..... ..|..|-..+.||
T Consensus 130 ~~~~~~~~~~~~~L~~PI~r~~We 153 (474)
T KOG0194|consen 130 KFSKLEITGKNFFLKRPIPRQKWE 153 (474)
T ss_pred HhcccceeccceeecccccccccE
Confidence 98876543 4688887776664
No 17
>KOG3601|consensus
Probab=99.11 E-value=8.9e-12 Score=78.32 Aligned_cols=75 Identities=31% Similarity=0.610 Sum_probs=65.9
Q ss_pred CCHHHHHHHhCCCCCceEEEeecCCCCCeeEEEEEecCCCCCCceeEEEEEEeCCc-EEEecccCCCCCHHHHHHHhhhC
Q psy3300 1 MIQEITLEVLSQEPVGAFMVRESTTKPGCFALSLRVPKEFHHLGIAHYLILRTAKG-YKIKGFTKEFSSLTSLITHHSVM 79 (97)
Q Consensus 1 isr~~ae~~L~~~~~G~fLiR~s~~~~~~~~Ls~~~~~~~~~~~v~h~~I~~~~~~-~~l~~~~~~F~sl~~Lv~~y~~~ 79 (97)
|+|..||++|....+|+||+|.+++.+|.|+++++.. ..|.|+++.+...| |++ +...|+++.+|++||..+
T Consensus 65 i~r~~ae~~l~~~~~G~fl~r~s~sSPg~fsgsvr~~-----d~vqhfkvvrpa~~k~~l--w~skfnslnplv~Y~rt~ 137 (222)
T KOG3601|consen 65 IPRPLAEDLLSKKRDGDFLIRLSESSPGDFSGSVRFP-----DGVQHFKVVRPAFGKYFL--WSSKFNSLNPLVSYHRTA 137 (222)
T ss_pred cccchhhhhhhccCcchhhhhhhhcCcccccccccCC-----CCceeccccccCcccccc--chhhccCCCCCcccCccc
Confidence 5789999999998999999999999999999999995 89999999877655 665 377899999999999887
Q ss_pred CCC
Q psy3300 80 PEL 82 (97)
Q Consensus 80 ~~~ 82 (97)
+..
T Consensus 138 s~~ 140 (222)
T KOG3601|consen 138 SQS 140 (222)
T ss_pred ccc
Confidence 543
No 18
>PF14633 SH2_2: SH2 domain; PDB: 3GXX_A 3GXW_B 3PJP_B 2XP1_A.
Probab=98.74 E-value=1.4e-07 Score=60.55 Aligned_cols=72 Identities=25% Similarity=0.403 Sum_probs=55.4
Q ss_pred CCHHHHHHHhCCCCCceEEEeecCCCCCeeEEEEEecCCCCCCceeEEEEEEeCC------c--EEEecccCCCCCHHHH
Q psy3300 1 MIQEITLEVLSQEPVGAFMVRESTTKPGCFALSLRVPKEFHHLGIAHYLILRTAK------G--YKIKGFTKEFSSLTSL 72 (97)
Q Consensus 1 isr~~ae~~L~~~~~G~fLiR~s~~~~~~~~Ls~~~~~~~~~~~v~h~~I~~~~~------~--~~l~~~~~~F~sl~~L 72 (97)
++-.+|+++|++.+.|.++||+|+.-.+..+++++.. .+...|+.|...+. | ..+ + ...|++|.||
T Consensus 44 ~~~~qAe~~L~~~~~Ge~iIRPSSkG~dhL~vTwKv~----d~vyqHidV~E~~K~n~~slG~~L~i-~-~~~yeDLDEi 117 (220)
T PF14633_consen 44 FNYKQAEEYLADQDVGEVIIRPSSKGPDHLTVTWKVA----DGVYQHIDVKEEDKENEFSLGKTLKI-G-GEEYEDLDEI 117 (220)
T ss_dssp S-HHHHHHHHCCS-TT-EEEEE-TTTTTEEEEEEEEE----TTEEEEEEEEEECSSSTTS-SSEEEE-T-TEEESSHHHH
T ss_pred CCHHHHHHHHhcCCCCCEEEeeCCCCCCeEEEEEEEc----CCcEEEEEEEECCCcCccccCcEEEE-C-CeEECCHHHH
Confidence 4678999999999999999999987778899999997 47788999988652 2 333 3 7789999999
Q ss_pred HHHhhh
Q psy3300 73 ITHHSV 78 (97)
Q Consensus 73 v~~y~~ 78 (97)
|.-|..
T Consensus 118 i~r~V~ 123 (220)
T PF14633_consen 118 IARHVE 123 (220)
T ss_dssp HHHCHH
T ss_pred HHHHHH
Confidence 988754
No 19
>KOG3751|consensus
Probab=98.54 E-value=2.7e-08 Score=69.93 Aligned_cols=85 Identities=28% Similarity=0.529 Sum_probs=70.8
Q ss_pred CCHHHHHHHhCCC--CCceEEEeecCCCCCeeEEEEEecCCCCCCceeEEEEEEeC-Cc---EEEecccCCCCCHHHHHH
Q psy3300 1 MIQEITLEVLSQE--PVGAFMVRESTTKPGCFALSLRVPKEFHHLGIAHYLILRTA-KG---YKIKGFTKEFSSLTSLIT 74 (97)
Q Consensus 1 isr~~ae~~L~~~--~~G~fLiR~s~~~~~~~~Ls~~~~~~~~~~~v~h~~I~~~~-~~---~~l~~~~~~F~sl~~Lv~ 74 (97)
|+|+++..+++++ .+|.|++|+|.+.|..|++++... .+++||.|.... .| +.+.+....|.++..|++
T Consensus 526 is~~es~~~ikq~glv~~~~l~r~sqsnP~~~~~~~~~~-----~~v~~~~~~P~~~~~~~~~t~~~g~t~~sd~~ql~~ 600 (622)
T KOG3751|consen 526 ISRDESQRLIKQQGLVDGLFLVRDSQSNPKIFVLSLCHP-----QKVKHFQILPVEDDGCTFFTLDDGPTKFSDLIQLVE 600 (622)
T ss_pred cCchhhhhHHHhcccceeeeeecccccCcchhhhhccCC-----ccccceEEecCCCCCceeeccCCCCccccccccccc
Confidence 6899999988876 789999999999999999999885 899999998764 34 334333567999999999
Q ss_pred HhhhCCCCcceeecCC
Q psy3300 75 HHSVMPELLPCTLSLH 90 (97)
Q Consensus 75 ~y~~~~~~l~~~L~~p 90 (97)
+|+-+...++|.|+.=
T Consensus 601 ~~ql~k~~l~~al~~~ 616 (622)
T KOG3751|consen 601 FYQLNKGVLPCALKHC 616 (622)
T ss_pred hhhcCCCcchHHHHHH
Confidence 9999988888887653
No 20
>PF14633 SH2_2: SH2 domain; PDB: 3GXX_A 3GXW_B 3PJP_B 2XP1_A.
Probab=97.56 E-value=0.0002 Score=46.13 Aligned_cols=71 Identities=25% Similarity=0.329 Sum_probs=49.9
Q ss_pred HHHHHHHhCCC-----CCceEEEeecCCCCCeeEEEEEecCCCCCCceeEEEEEEeCCcEEEecccCCCCCHHHHHHHhh
Q psy3300 3 QEITLEVLSQE-----PVGAFMVRESTTKPGCFALSLRVPKEFHHLGIAHYLILRTAKGYKIKGFTKEFSSLTSLITHHS 77 (97)
Q Consensus 3 r~~ae~~L~~~-----~~G~fLiR~s~~~~~~~~Ls~~~~~~~~~~~v~h~~I~~~~~~~~l~~~~~~F~sl~~Lv~~y~ 77 (97)
+++++..|... .--.|.+..+...||.|.|++.... ...+.+.-|.-..+||.+. ...|+++.+|+.+++
T Consensus 143 ~~e~e~~L~~~k~~nP~~i~Y~f~~~~~~PG~F~L~y~~~~---~~~~~~~~v~V~p~Gf~~r--~~~f~~~~~L~~~FK 217 (220)
T PF14633_consen 143 KEEVEEWLKEEKKANPKRIPYAFCISKEHPGYFILSYKPNK---NPRHEYWPVKVTPDGFRFR--KQVFPSLDRLINWFK 217 (220)
T ss_dssp CCCCHHHHHCHHHHSTTS-EEEEEE-TTSTTEEEEEEESST---TS-EEEEEEEE-SSSEEET--TEEESSHHHHHHHHH
T ss_pred HHHHHHHHHHHHHhCCCCceEEEEECCCCCCEEEEEEEcCC---CCceEEeeEEEecCcEEEe--cccCCCHHHHHHHHh
Confidence 45666777642 3356888888889999999998852 1333443377788999985 778999999999987
Q ss_pred h
Q psy3300 78 V 78 (97)
Q Consensus 78 ~ 78 (97)
.
T Consensus 218 ~ 218 (220)
T PF14633_consen 218 K 218 (220)
T ss_dssp H
T ss_pred h
Confidence 5
No 21
>KOG1856|consensus
Probab=97.44 E-value=0.0002 Score=54.97 Aligned_cols=73 Identities=19% Similarity=0.274 Sum_probs=57.0
Q ss_pred CCHHHHHHHhCCCCCceEEEeecCCCCCeeEEEEEecCCCCCCceeEEEEEEeCC-c-EEEe-----cccCCCCCHHHHH
Q psy3300 1 MIQEITLEVLSQEPVGAFMVRESTTKPGCFALSLRVPKEFHHLGIAHYLILRTAK-G-YKIK-----GFTKEFSSLTSLI 73 (97)
Q Consensus 1 isr~~ae~~L~~~~~G~fLiR~s~~~~~~~~Ls~~~~~~~~~~~v~h~~I~~~~~-~-~~l~-----~~~~~F~sl~~Lv 73 (97)
++-++|+.+|.....|.++||+|+......+++++.. .+...|+.|...+. + |.+. + +..|++|.++|
T Consensus 1114 ~n~eQAe~yL~~~d~ge~iiRpSSrgddhLvvtwKVs----D~iYqhidV~E~eKEn~fslg~~l~i~-~e~feDLDEiI 1188 (1299)
T KOG1856|consen 1114 LNAEQAEAYLSDMDQGELIIRPSSRGDDHLVVTWKVS----DGIYQHIDVQELEKENYFSLGKTLWIG-GEEFEDLDEII 1188 (1299)
T ss_pred CCHHHHHHHHHhcccccEEeccccCCCCceEEEEEec----CchhhhhhhhhhhccccccccceEEEC-CcccccHHHHH
Confidence 4568999999998999999999988777888999887 46778888876432 1 3332 3 77899999999
Q ss_pred HHhhh
Q psy3300 74 THHSV 78 (97)
Q Consensus 74 ~~y~~ 78 (97)
.=|..
T Consensus 1189 ~r~vq 1193 (1299)
T KOG1856|consen 1189 ARYVQ 1193 (1299)
T ss_pred HHHHH
Confidence 87754
No 22
>KOG4566|consensus
Probab=97.29 E-value=0.00077 Score=44.40 Aligned_cols=73 Identities=25% Similarity=0.397 Sum_probs=52.7
Q ss_pred CHHHHHHHhCCCCCceEEEeecCCCCCeeEEEEEecCCCCCCceeEEEEEEeCCcEEE----eccc--CCCCCHHHHHHH
Q psy3300 2 IQEITLEVLSQEPVGAFMVRESTTKPGCFALSLRVPKEFHHLGIAHYLILRTAKGYKI----KGFT--KEFSSLTSLITH 75 (97)
Q Consensus 2 sr~~ae~~L~~~~~G~fLiR~s~~~~~~~~Ls~~~~~~~~~~~v~h~~I~~~~~~~~l----~~~~--~~F~sl~~Lv~~ 75 (97)
+.++|+..|...+.|+|+||+|...+-.+.|++... .+.+.-+|.+.++.|.. .+.. ..|++...++.+
T Consensus 60 ~~~~~~~~l~~~p~~~~l~Rdss~~~~~~~I~vk~~-----~g~~~~r~~~~~~sfr~ds~~~~~~pl~~~e~~~~~~~~ 134 (258)
T KOG4566|consen 60 TANEAELCLGQEPRGTFLVRDSSHRPYLFTISVKTH-----SGPKNLRIQYQDSSFRLDSSHRHTPPLISFEDVEVLIHH 134 (258)
T ss_pred cchhHHHhhcCCCccceeeecCcCccccceeEeeec-----cCCCCccccccccceecccccccCCCcccChhHHHHHHh
Confidence 456777778778999999999987777788998885 45555566555543443 1212 369999999999
Q ss_pred hhhC
Q psy3300 76 HSVM 79 (97)
Q Consensus 76 y~~~ 79 (97)
|+..
T Consensus 135 y~~~ 138 (258)
T KOG4566|consen 135 YRQS 138 (258)
T ss_pred hhhh
Confidence 9554
No 23
>KOG3697|consensus
Probab=97.09 E-value=0.0012 Score=44.04 Aligned_cols=62 Identities=19% Similarity=0.183 Sum_probs=38.9
Q ss_pred CCCeeEEEEEecCCCCCCceeEEEEEEeCCcEEEecccCCCCCHHHHHHHhhhCCCCcc-----eeecCCCCCC
Q psy3300 26 KPGCFALSLRVPKEFHHLGIAHYLILRTAKGYKIKGFTKEFSSLTSLITHHSVMPELLP-----CTLSLHRYNP 94 (97)
Q Consensus 26 ~~~~~~Ls~~~~~~~~~~~v~h~~I~~~~~~~~l~~~~~~F~sl~~Lv~~y~~~~~~l~-----~~L~~p~~~p 94 (97)
.+|.|+|+=.. ++..+|..+. ...|..-.. ...|+|+..||+||..+.+.+. .+|+.|...+
T Consensus 278 ~~gqyvltgl~-----~~~~khlllv-dpegvvrtk-d~~fdsishli~yh~~~~lpiis~~sel~l~~pv~r~ 344 (345)
T KOG3697|consen 278 TPGQYVLTGLQ-----SGQPKHLLLV-DPEGVVRTK-DRRFDSISHLINYHMDNHLPIISAGSELCLQQPVERK 344 (345)
T ss_pred CCccEEEeccc-----CCCcceEEEE-CCccceecc-cchhhHHHHHHHHhhhccCceecCcchhhhhCccccc
Confidence 45555555444 2566776654 233422222 6679999999999999876544 4677776543
No 24
>KOG1856|consensus
Probab=96.35 E-value=0.031 Score=43.62 Aligned_cols=72 Identities=17% Similarity=0.265 Sum_probs=57.8
Q ss_pred CHHHHHHHhCCC--CC---ceEEEeecCCCCCeeEEEEEecCCCCCCceeEEEEEEeCCcEEEecccCCCCCHHHHHHHh
Q psy3300 2 IQEITLEVLSQE--PV---GAFMVRESTTKPGCFALSLRVPKEFHHLGIAHYLILRTAKGYKIKGFTKEFSSLTSLITHH 76 (97)
Q Consensus 2 sr~~ae~~L~~~--~~---G~fLiR~s~~~~~~~~Ls~~~~~~~~~~~v~h~~I~~~~~~~~l~~~~~~F~sl~~Lv~~y 76 (97)
++.+.+.+|... .. --|...-|...||.|.|+++.. ++++|--+.-...||.+. +..|+||.+|+.|+
T Consensus 1212 t~~~~ek~L~~~k~~np~~~~Y~F~~s~~~PG~F~L~y~~~-----~k~~heyv~v~p~g~~~r--g~~f~tld~L~~~F 1284 (1299)
T KOG1856|consen 1212 TKKEVEKLLRDYKKVNPKKSVYFFCASHEHPGKFCLSYKPS-----SKPRHEYVKVVPEGFRFR--GQNFGTLDELCRWF 1284 (1299)
T ss_pred CHHHHHHHHHHHhccCCCeeeEEEEecccCCceEEEEeccC-----CCccceeEEEcccceEEe--cccchhHHHHHHHH
Confidence 356778888731 22 3456677888999999999985 889999888889998887 67899999999999
Q ss_pred hhCC
Q psy3300 77 SVMP 80 (97)
Q Consensus 77 ~~~~ 80 (97)
..+-
T Consensus 1285 K~h~ 1288 (1299)
T KOG1856|consen 1285 KRHY 1288 (1299)
T ss_pred HHHh
Confidence 8764
No 25
>PF02762 Cbl_N3: CBL proto-oncogene N-terminus, SH2-like domain; InterPro: IPR014742 Cbl (Casitas B-lineage lymphoma) is an adaptor protein that functions as a negative regulator of many signalling pathways that start from receptors at the cell surface. The N-terminal region of Cbl contains a Cbl-type phosphotyrosine-binding (Cbl-PTB) domain, which is composed of three evolutionarily conserved domains: an N-terminal four-helix bundle (4H) domain, an EF hand-like calcium-binding domain, and a divergent SH2-like domain. The calcium-bound EF-hand wedges between the 4H and SH2 domains, and roughly determines their relative orientation. The Cbl-PTB domain has also been named Cbl N-terminal (Cbl-N) or tyrosine kinase binding (TKB) domain [, ]. The N-terminal 4H domain contains four long alpha-helices. The C and D helices in this domain pack against the adjacent EF-hand-like domain, and a highly conserved loop connecting the A and B helices contacts the SH2-like domain. The EF-hand motif is similar to classical EF-hand proteins. The SH2-like domain retains the general helix-sheet-helix architecture of the SH2 fold, but lacks the secondary beta-sheet, comprising beta-strands D', E and F, and also a prominent BG loop []. This entry represents the SH2-like domain.; PDB: 3PFV_A 3VGO_A 3PLF_B 2Y1M_A 2CBL_A 3BUX_B 3BUN_B 3BUM_B 3OB1_B 3BUW_B ....
Probab=96.16 E-value=0.029 Score=30.37 Aligned_cols=36 Identities=19% Similarity=0.302 Sum_probs=29.9
Q ss_pred CHHHHHHHhCC--CCCceEEEeecCCCCCeeEEEEEec
Q psy3300 2 IQEITLEVLSQ--EPVGAFMVRESTTKPGCFALSLRVP 37 (97)
Q Consensus 2 sr~~ae~~L~~--~~~G~fLiR~s~~~~~~~~Ls~~~~ 37 (97)
+=+|+...|.. ..+|+|+.|-|.++-|.|.+-....
T Consensus 10 TYdevk~~L~~~~~kpGsYiFRlSCTrLGQWAIGyV~~ 47 (86)
T PF02762_consen 10 TYDEVKARLQHYRDKPGSYIFRLSCTRLGQWAIGYVTQ 47 (86)
T ss_dssp -HHHHHHHHGGGTTSTTEEEEEEESSSTTSEEEEEEET
T ss_pred eHHHHHHHHHHHhCCcccEEEeeccccccceeEEEEcC
Confidence 45677788874 4789999999999999999988875
No 26
>KOG3667|consensus
Probab=92.58 E-value=0.37 Score=35.95 Aligned_cols=47 Identities=17% Similarity=0.285 Sum_probs=34.3
Q ss_pred CCHHHHHHHhCCCCCceEEEeecCCCCCeeEEEEEecCCCCCCceeEEEEE
Q psy3300 1 MIQEITLEVLSQEPVGAFMVRESTTKPGCFALSLRVPKEFHHLGIAHYLIL 51 (97)
Q Consensus 1 isr~~ae~~L~~~~~G~fLiR~s~~~~~~~~Ls~~~~~~~~~~~v~h~~I~ 51 (97)
++++.+..+|...++|+|++|-|++.-|..+++....+ ....++.|.
T Consensus 570 inkq~~~~ll~~~~~GtflLrfs~S~~GgiT~~~v~~~----~~~~~~~i~ 616 (682)
T KOG3667|consen 570 INKQQERALLMTKPDGTFLLRFSASEEGGITIAWVEDQ----KQNLIMMIM 616 (682)
T ss_pred cchhhhhhhhhcCCCCCceeeeeccccCceeEEecccc----cccceeEec
Confidence 46677788888889999999999888776666655542 222567776
No 27
>smart00557 IG_FLMN Filamin-type immunoglobulin domains. These form a rod-like structure in the actin-binding cytoskeleton protein, filamin. The C-terminal repeats of filamin bind beta1-integrin (CD29).
Probab=86.80 E-value=2.4 Score=23.33 Aligned_cols=24 Identities=21% Similarity=0.501 Sum_probs=13.7
Q ss_pred CCceEEEeecCCCCCeeEEEEEec
Q psy3300 14 PVGAFMVRESTTKPGCFALSLRVP 37 (97)
Q Consensus 14 ~~G~fLiR~s~~~~~~~~Ls~~~~ 37 (97)
.+|+|.+.-.....|.|.|++...
T Consensus 54 ~dGty~v~y~P~~~G~~~i~V~~~ 77 (93)
T smart00557 54 GDGTYTVSYTPTEPGDYTVTVKFG 77 (93)
T ss_pred CCCEEEEEEEeCCCEeEEEEEEEC
Confidence 455666665555555555555553
No 28
>KOG1785|consensus
Probab=86.29 E-value=1.3 Score=31.50 Aligned_cols=34 Identities=21% Similarity=0.335 Sum_probs=28.1
Q ss_pred HHHHHHhCC--CCCceEEEeecCCCCCeeEEEEEec
Q psy3300 4 EITLEVLSQ--EPVGAFMVRESTTKPGCFALSLRVP 37 (97)
Q Consensus 4 ~~ae~~L~~--~~~G~fLiR~s~~~~~~~~Ls~~~~ 37 (97)
+|....|.. ..+|+|+.|-|.+..|.|.|-++..
T Consensus 266 DEVk~RLqk~~~KpGSYIFRlSCTRlGQWAIGYVt~ 301 (563)
T KOG1785|consen 266 DEVKARLQKYIKKPGSYIFRLSCTRLGQWAIGYVTA 301 (563)
T ss_pred HHHHHHHHHHhcCCCceEEeeccCcccceeEEEEcC
Confidence 556666654 3799999999999999999988876
No 29
>PF10505 NARG2_C: NMDA receptor-regulated gene protein 2 C-terminus; InterPro: IPR019535 The transition of neuronal cells from pre-cursor to mature state is regulated by the N-methyl-d-aspartate (NMDA) receptor, a glutamate-gated ion channel that is permeable to Ca2+. NMDA receptors probably mediate this activity by permitting expression of NARG2. NARG2 (NMDA receptor-regulated gene protein 2) is transiently expressed, being a regulatory protein that is present in the nucleus of dividing cells and then down-regulated as progenitors exit the cell cycle and begin to differentiate. NARG2 contains repeats of (S/T)PXX, (11 in mouse, 6 in human), a putative DNA-binding motif that is found in many gene-regulatory proteins including Kruppel, Hunchback and Antennapedi [].
Probab=70.39 E-value=10 Score=24.71 Aligned_cols=28 Identities=21% Similarity=0.420 Sum_probs=19.7
Q ss_pred HHhCCCCCceEEEeecCCCCCeeEEEEEec
Q psy3300 8 EVLSQEPVGAFMVRESTTKPGCFALSLRVP 37 (97)
Q Consensus 8 ~~L~~~~~G~fLiR~s~~~~~~~~Ls~~~~ 37 (97)
..|.+-++|.||+|... .+.|++.++..
T Consensus 138 k~L~~L~pG~YLLrh~~--~d~f~~i~k~~ 165 (220)
T PF10505_consen 138 KKLSSLQPGSYLLRHTP--GDSFVLIYKSS 165 (220)
T ss_pred HHHcCCCCCcEEEEecC--CCCcEEEEEcC
Confidence 34445599999999986 55566666654
No 30
>PF00630 Filamin: Filamin/ABP280 repeat; InterPro: IPR017868 The many different actin cross-linking proteins share a common architecture, consisting of a globular actin-binding domain and an extended rod. Whereas their actin-binding domains consist of two calponin homology domains (see IPR001715 from INTERPRO), their rods fall into three families. The rod domain of the family including the Dictyostelium discoideum (Slime mould) gelation factor (ABP120) and human filamin (ABP280) is constructed from tandem repeats of a 100-residue motif that is glycine and proline rich []. The gelation factor's rod contains 6 copies of the repeat, whereas filamin has a rod constructed from 24 repeats. The resolution of the 3D structure of rod repeats from the gelation factor has shown that they consist of a beta-sandwich, formed by two beta-sheets arranged in an immunoglobulin-like fold [, ]. Because conserved residues that form the core of the repeats are preserved in filamin, the repeat structure should be common to the members of the gelation factor/filamin family. The head to tail homodimerisation is crucial to the function of the ABP120 and ABP280 proteins. This interaction involves a small portion at the distal end of the rod domains. For the gelation factor it has been shown that the carboxy-terminal repeat 6 dimerises through a double edge-to-edge extension of the beta-sheet and that repeat 5 contributes to dimerisation to some extent [, , ].; PDB: 2DI9_A 2EEC_A 2DIC_A 2EEA_A 2DMC_A 2EE9_A 2D7O_A 2D7N_A 2K7P_A 2NQC_A ....
Probab=65.81 E-value=9.3 Score=20.88 Aligned_cols=24 Identities=25% Similarity=0.567 Sum_probs=17.5
Q ss_pred CCceEEEeecCCCCCeeEEEEEec
Q psy3300 14 PVGAFMVRESTTKPGCFALSLRVP 37 (97)
Q Consensus 14 ~~G~fLiR~s~~~~~~~~Ls~~~~ 37 (97)
.+|+|.++-....+|.|.|++...
T Consensus 68 ~~G~y~v~y~p~~~G~y~i~V~~~ 91 (101)
T PF00630_consen 68 GDGTYTVSYTPTEPGKYKISVKIN 91 (101)
T ss_dssp SSSEEEEEEEESSSEEEEEEEEES
T ss_pred CCCEEEEEEEeCccEeEEEEEEEC
Confidence 567777777777777777777764
No 31
>PF07935 SSV1_ORF_D-335: ORF D-335-like protein; InterPro: IPR012922 The sequences featured in this family are similar to a probable integrase (P20214 from SWISSPROT) expressed by the SSV1 virus of the archaeon Sulfolobus shibatae. This protein may be necessary for the integration of the virus into the host genome by a process of site-specific recombination [].
Probab=63.26 E-value=22 Score=19.04 Aligned_cols=36 Identities=14% Similarity=0.101 Sum_probs=20.8
Q ss_pred CceeEEEEEEeCCcEEEecccCCCCCHHHHHHHhhhCCC
Q psy3300 43 LGIAHYLILRTAKGYKIKGFTKEFSSLTSLITHHSVMPE 81 (97)
Q Consensus 43 ~~v~h~~I~~~~~~~~l~~~~~~F~sl~~Lv~~y~~~~~ 81 (97)
++..-|+|+...+|-.- ...-.+|.++|+.|.+...
T Consensus 15 GkYYVY~iE~~~~G~~r---e~YVGpL~~VVe~Yik~K~ 50 (72)
T PF07935_consen 15 GKYYVYKIEKDINGERR---ETYVGPLDDVVETYIKLKE 50 (72)
T ss_pred CeEEEEEEEeccCCcee---eeeeccHHHHHHHHHhccc
Confidence 34444555553443110 2234689999999988743
No 32
>PF09430 DUF2012: Protein of unknown function (DUF2012); InterPro: IPR019008 This domain is found in different proteins, including uncharacterised protein family UPF0480 and nodal modulators. A nodal modulator has been identified as part of a protein complex that participates in the nodal signaling pathway during vertebrate development [].
Probab=62.13 E-value=28 Score=20.30 Aligned_cols=34 Identities=21% Similarity=0.466 Sum_probs=23.7
Q ss_pred CCceEEEeecCCCCCeeEEEEEecCCCCCCceeEEEEEEe
Q psy3300 14 PVGAFMVRESTTKPGCFALSLRVPKEFHHLGIAHYLILRT 53 (97)
Q Consensus 14 ~~G~fLiR~s~~~~~~~~Ls~~~~~~~~~~~v~h~~I~~~ 53 (97)
.+|+|.+++=+ +|.|.|.+... .-....+||.-.
T Consensus 28 ~dG~F~f~~Vp--~GsY~L~V~s~----~~~F~~~RVdV~ 61 (123)
T PF09430_consen 28 SDGSFVFHNVP--PGSYLLEVHSP----DYVFPPYRVDVS 61 (123)
T ss_pred CCCEEEeCCCC--CceEEEEEECC----CccccCEEEEEe
Confidence 68999998864 68899988875 234445666554
No 33
>KOG2293|consensus
Probab=54.23 E-value=13 Score=27.48 Aligned_cols=67 Identities=12% Similarity=0.086 Sum_probs=40.2
Q ss_pred CCHHHHHHHhCCCCCceEEEeecCCCCCeeEEEEEecCCCCCCceeEEEEEEeCCc--EEEecccCCCCCHHHHHHHhhh
Q psy3300 1 MIQEITLEVLSQEPVGAFMVRESTTKPGCFALSLRVPKEFHHLGIAHYLILRTAKG--YKIKGFTKEFSSLTSLITHHSV 78 (97)
Q Consensus 1 isr~~ae~~L~~~~~G~fLiR~s~~~~~~~~Ls~~~~~~~~~~~v~h~~I~~~~~~--~~l~~~~~~F~sl~~Lv~~y~~ 78 (97)
|||.+|.-.|++ .|+|-| ..-|.+.|.|-. +.+-|-.+....+. ..|-|-.-.|..-.++|.-|..
T Consensus 474 ISRRQa~IkL~n--~GsF~I----kNlGK~~I~vng------~~l~~gq~~~L~~nclveIrg~~FiF~~N~~~v~~~l~ 541 (547)
T KOG2293|consen 474 ISRRQALIKLKN--DGSFFI----KNLGKRSILVNG------GELDRGQKVILKNNCLVEIRGLRFIFEINQEAVGQYLK 541 (547)
T ss_pred eeccceeEEecc--CCcEEe----ccCcceeEEeCC------ccccCCceEEeccCcEEEEccceEEEeecHHHHHHHHH
Confidence 566666666644 699999 456778887765 34444433333333 3344323346667888888865
Q ss_pred C
Q psy3300 79 M 79 (97)
Q Consensus 79 ~ 79 (97)
.
T Consensus 542 ~ 542 (547)
T KOG2293|consen 542 E 542 (547)
T ss_pred h
Confidence 4
No 34
>PF00408 PGM_PMM_IV: Phosphoglucomutase/phosphomannomutase, C-terminal domain; InterPro: IPR005843 The alpha-D-phosphohexomutase superfamily is composed of four related enzymes, each of which catalyses a phosphoryl transfer on their sugar substrates: phosphoglucomutase (PGM), phosphoglucomutase/phosphomannomutase (PGM/PMM), phosphoglucosamine mutase (PNGM), and phosphoacetylglucosamine mutase (PAGM) []. PGM (5.4.2.2 from EC) converts D-glucose 1-phosphate into D-glucose 6-phosphate, and participates in both the breakdown and synthesis of glucose []. PGM/PMM (5.4.2.2 from EC; 5.4.2.8 from EC) are primarily bacterial enzymes that use either glucose or mannose as substrate, participating in the biosynthesis of a variety of carbohydrates such as lipopolysaccharides and alginate [, ]. Both PNGM (5.4.2.3 from EC) and PAGM (5.4.2.10 from EC) are involved in the biosynthesis of UDP-N-acetylglucosamine [, ]. Despite differences in substrate specificity, these enzymes share a similar catalytic mechanism, converting 1-phospho-sugars to 6-phospho-sugars via a biphosphorylated 1,6-phospho-sugar. The active enzyme is phosphorylated at a conserved serine residue and binds one magnesium ion; residues around the active site serine are well conserved among family members. The reaction mechanism involves phosphoryl transfer from the phosphoserine to the substrate to create a biophosphorylated sugar, followed by a phosphoryl transfer from the substrate back to the enzyme []. The structures of PGM and PGM/PMM have been determined, and were found to be very similar in topology. These enzymes are both composed of four domains and a large central active site cleft, where each domain contains residues essential for catalysis and/or substrate recognition. Domain I contains the catalytic phosphoserine, domain II contains a metal-binding loop to coordinate the magnesium ion, domain III contains the sugar-binding loop that recognises the two different binding orientations of the 1- and 6-phospho-sugars, and domain IV contains a phosphate-binding site required for orienting the incoming phospho-sugar substrate. This entry represents the C-terminal domain alpha-D-phosphohexomutase enzymes.; GO: 0016868 intramolecular transferase activity, phosphotransferases, 0005975 carbohydrate metabolic process; PDB: 1KFQ_B 1KFI_A 3PDK_B 2F7L_A 1TUO_A 2FKM_X 3C04_A 1K2Y_X 1P5G_X 2H4L_X ....
Probab=49.98 E-value=16 Score=19.09 Aligned_cols=15 Identities=40% Similarity=0.403 Sum_probs=12.4
Q ss_pred CCceEE-EeecCCCCC
Q psy3300 14 PVGAFM-VRESTTKPG 28 (97)
Q Consensus 14 ~~G~fL-iR~s~~~~~ 28 (97)
.+|..+ ||.|-+.|-
T Consensus 33 ~dG~~l~vR~SgTEP~ 48 (73)
T PF00408_consen 33 EDGWRLLVRPSGTEPK 48 (73)
T ss_dssp TTEEEEEEEEESSSSE
T ss_pred CCceEEEEECCCCCce
Confidence 689999 999987764
No 35
>PF05303 DUF727: Protein of unknown function (DUF727); InterPro: IPR007967 This family consists of several uncharacterised eukaryotic proteins of unknown function.; PDB: 1SGO_A.
Probab=49.79 E-value=49 Score=19.02 Aligned_cols=67 Identities=22% Similarity=0.214 Sum_probs=33.3
Q ss_pred HHHHHHHhCCC--CCceEEEeecCCC---CCeeEEEEEecCCCCCCceeEEEEEEeCCcEEEecc---------------
Q psy3300 3 QEITLEVLSQE--PVGAFMVRESTTK---PGCFALSLRVPKEFHHLGIAHYLILRTAKGYKIKGF--------------- 62 (97)
Q Consensus 3 r~~ae~~L~~~--~~G~fLiR~s~~~---~~~~~Ls~~~~~~~~~~~v~h~~I~~~~~~~~l~~~--------------- 62 (97)
+.||+..+.+- ....-.| |+.- .+...|-+... --..|.|.....||++.+.
T Consensus 3 ~~Ea~aai~dv~~~V~~i~v--s~~Lp~~~~l~Ylnv~Tl------Eg~~~cIelt~~Gf~V~s~~~D~~~~~~~~~~~~ 74 (108)
T PF05303_consen 3 RLEAEAAINDVAFAVKEISV--SEKLPRTGDLIYLNVTTL------EGRTYCIELTTKGFRVVSSTFDCMDPDPSQSSLH 74 (108)
T ss_dssp HHHHHHHHHHHGGGSSEEEE----SS-EETTEEEEEEE-T------T--EEEEEEETTEEEEEESSTT---TT-------
T ss_pred HHHHHHHHHHHHHHhhEEEe--cccCCCCCCEEEEEEEEe------cCCEEEEEEECCeEEEeeecCCCcCccccccccc
Confidence 56777777653 2333334 3222 23344555552 3345666667788887542
Q ss_pred cCCCCCHHHHHHHhh
Q psy3300 63 TKEFSSLTSLITHHS 77 (97)
Q Consensus 63 ~~~F~sl~~Lv~~y~ 77 (97)
...|.|+.+|+..++
T Consensus 75 ~~~~eTl~~LL~~iS 89 (108)
T PF05303_consen 75 TKYFETLYALLDSIS 89 (108)
T ss_dssp --EESSSHHHHHHH-
T ss_pred chHHhhHHHHHHHHC
Confidence 112567777776643
No 36
>cd00214 Calpain_III Calpain, subdomain III. Calpains are calcium-activated cytoplasmic cysteine proteinases, participate in cytoskeletal remodeling processes, cell differentiation, apoptosis and signal transduction. Catalytic domain and the two calmodulin-like domains are separated by C2-like domain III. Domain III plays an important role in calcium-induced activation of calpain involving electrostatic interactions with subdomain II. Proposed to mediate calpain's interaction with phospholipids and translocation to cytoplasmic/nuclear membranes. CD includes subdomain III of typical and atypical calpains.
Probab=45.41 E-value=43 Score=20.07 Aligned_cols=24 Identities=33% Similarity=0.574 Sum_probs=14.3
Q ss_pred CCceEEEeecCCCC---CeeEEEEEec
Q psy3300 14 PVGAFMVRESTTKP---GCFALSLRVP 37 (97)
Q Consensus 14 ~~G~fLiR~s~~~~---~~~~Ls~~~~ 37 (97)
++|.|+|-.|...+ |.|.|.|...
T Consensus 116 ~pG~YvIIPsT~~p~~~g~F~LrVfs~ 142 (150)
T cd00214 116 PPGEYVIVPSTFEPGEEGEFLLRVFSE 142 (150)
T ss_pred CCCCEEEEeeecCCCCcccEEEEEEec
Confidence 56777777665443 3466666553
No 37
>PHA00730 int integrase
Probab=44.96 E-value=1.1e+02 Score=21.45 Aligned_cols=35 Identities=20% Similarity=0.176 Sum_probs=21.2
Q ss_pred CceeEEEEEEeCCcEEEecccCCCCCHHHHHHHhhhCC
Q psy3300 43 LGIAHYLILRTAKGYKIKGFTKEFSSLTSLITHHSVMP 80 (97)
Q Consensus 43 ~~v~h~~I~~~~~~~~l~~~~~~F~sl~~Lv~~y~~~~ 80 (97)
++..-|+|+..++|-.- ...-.+|.++|+.|-...
T Consensus 17 ~~~~~~~~~~~~~~~~~---~~y~~~~~~~v~~~~~~~ 51 (337)
T PHA00730 17 GKYYVYLIEKDNEGNRR---DTYVGSLDQVVESYLKMK 51 (337)
T ss_pred CeEEEEEEEecCCCcch---hcccccHHHHHHHHHHHh
Confidence 45555666655444111 223468999999997763
No 38
>COG5453 Uncharacterized conserved protein [Function unknown]
Probab=44.29 E-value=58 Score=18.26 Aligned_cols=52 Identities=19% Similarity=0.301 Sum_probs=32.4
Q ss_pred CCceEEEeecC-CCCCeeEEEEEecCCCCCCceeEEEEEEeCCcEEEecccCCCCCHHHHHHHh
Q psy3300 14 PVGAFMVREST-TKPGCFALSLRVPKEFHHLGIAHYLILRTAKGYKIKGFTKEFSSLTSLITHH 76 (97)
Q Consensus 14 ~~G~fLiR~s~-~~~~~~~Ls~~~~~~~~~~~v~h~~I~~~~~~~~l~~~~~~F~sl~~Lv~~y 76 (97)
..+.|||+..+ +..|.|.++=+....++ +.++.++..+ ...|.++.|-|+.-
T Consensus 27 ehkdflI~atP~~eggQyr~aG~I~k~id-gevk~h~FiR----------aD~ft~~~DAve~~ 79 (96)
T COG5453 27 EHKDFLIYATPISEGGQYRLAGRIEKEID-GEVKVHRFIR----------ADLFTSLDDAVEIA 79 (96)
T ss_pred cccCeEEEeeecccCCeEEEeeEeeeecC-ceeeeeeeee----------hhhccchhHHHHHH
Confidence 45889999876 45678888766654332 4444444332 33577888777654
No 39
>PF09625 VP9: VP9 protein; InterPro: IPR018587 VP9 is a protein containing a ferredoxin fold. Two dimers come together to form one asymmetric unit which possesses a DNA recognition fold and specific metal binding sites possibly for zinc. It is postulated that being a non-structural protein VP9 is involved in the transcriptional regulation of the White spot syndrome virus (WSSV), from which it comes. WSSV is the major viral pathogen in shrimp aquaculture []. VP9 is found N-terminal to the IPR009766 from INTERPRO. ; PDB: 2ZUG_A 2GJ2_A 2GJI_A.
Probab=42.28 E-value=51 Score=17.90 Aligned_cols=18 Identities=22% Similarity=0.525 Sum_probs=10.8
Q ss_pred EEeecCCCCCeeEEEEEe
Q psy3300 19 MVRESTTKPGCFALSLRV 36 (97)
Q Consensus 19 LiR~s~~~~~~~~Ls~~~ 36 (97)
-||.|+-..|.|+..++.
T Consensus 29 siRkSEl~d~~~iV~Lk~ 46 (79)
T PF09625_consen 29 SIRKSELRDGVYIVQLKK 46 (79)
T ss_dssp EEEE-SS-TTEEEEEE-T
T ss_pred HHHHhhcccceEEEEEec
Confidence 478887777777776665
No 40
>PF01067 Calpain_III: Calpain large subunit, domain III; InterPro: IPR022682 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad []. This group of cysteine peptidases belong to the MEROPS peptidase family C2 (calpain family, clan CA). A type example is calpain, which is an intracellular protease involved in many important cellular functions that are regulated by calcium []. The protein is a complex of 2 polypeptide chains (light and heavy), with three known forms in mammals [, ]: a highly calcium-sensitive (i.e., micro-molar range) form known as mu-calpain, mu-CANP or calpain I; a form sensitive to calcium in the milli-molar range, known as m-calpain, m-CANP or calpain II; and a third form, known as p94, which is found in skeletal muscle only []. All forms have identical light but different heavy chains. Both mu- and m-calpain are heterodimers containing an identical 28kDa subunit and an 80kDa subunit that shares 55-65% sequence homology between the two proteases [, ]. The crystallographic structure of m-calpain reveals six "domains" in the 80kDa subunit: A 19-amino acid NH2-terminal sequence; Active site domain IIa; Active site domain IIb. Domain 2 shows low levels of sequence similarity to papain; although the catalytic His has not been located by biochemical means, it is likely that calpain and papain are related []. Domain III; An 18-amino acid extended sequence linking domain III to domain IV; Domain IV, which resembles the penta EF-hand family of polypeptides, binds calcium and regulates activity []. />]. Ca2+-binding causes a rearrangement of the protein backbone, the net effect of which is that a Trp side chain, which acts as a wedge between catalytic domains IIa and IIb in the apo state, moves away from the active site cleft allowing for the proper formation of the catalytic triad []. Calpain-like mRNAs have been identified in other organisms including bacteria, but the molecules encoded by these mRNAs have not been isolated, so little is known about their properties. How calpain activity is regulated in these organisms cells is still unclear In metazoans, the activity of calpain is controlled by a single proteinase inhibitor, calpastatin (IPR001259 from INTERPRO). The calpastatin gene can produce eight or more calpastatin polypeptides ranging from 17 to 85 kDa by use of different promoters and alternative splicing events. The physiological significance of these different calpastatins is unclear, although all bind to three different places on the calpain molecule; binding to at least two of the sites is Ca2+ dependent. The calpains ostensibly participate in a variety of cellular processes including remodelling of cytoskeletal/membrane attachments, different signal transduction pathways, and apoptosis. Deregulated calpain activity following loss of Ca2+ homeostasis results in tissue damage in response to events such as myocardial infarcts, stroke, and brain trauma []. This entry represents domain III. It is found in association with PF00648 from PFAM. The function of the domain III and I are currently unknown. Domain II is a cysteine protease and domain IV is a calcium binding domain. Calpains are believed to participate in intracellular signaling pathways mediated by calcium ions. ; PDB: 1QXP_B 2QFE_A 1DF0_A 1U5I_A 3DF0_A 3BOW_A 1KFU_L 1KFX_L.
Probab=41.88 E-value=41 Score=19.69 Aligned_cols=24 Identities=29% Similarity=0.497 Sum_probs=15.0
Q ss_pred CCceEEEeecCCC---CCeeEEEEEec
Q psy3300 14 PVGAFMVRESTTK---PGCFALSLRVP 37 (97)
Q Consensus 14 ~~G~fLiR~s~~~---~~~~~Ls~~~~ 37 (97)
++|.|+|=.|.-. .|.|.|.+...
T Consensus 115 ~~G~YvIVPsT~~~~~~g~F~L~v~s~ 141 (147)
T PF01067_consen 115 PPGTYVIVPSTYEPGQEGEFTLRVFSD 141 (147)
T ss_dssp -SEEEEEEEEESSTT--EEEEEEEEES
T ss_pred CCCCEEEEEecCCCCCeeeEEEEEEEC
Confidence 5788888777433 44577777664
No 41
>PF13912 zf-C2H2_6: C2H2-type zinc finger; PDB: 1JN7_A 1FU9_A 2L1O_A 1NJQ_A 2EN8_A 2EMM_A 1FV5_A 1Y0J_B 2L6Z_B.
Probab=41.79 E-value=21 Score=14.46 Aligned_cols=18 Identities=39% Similarity=0.460 Sum_probs=14.0
Q ss_pred ccCCCCCHHHHHHHhhhC
Q psy3300 62 FTKEFSSLTSLITHHSVM 79 (97)
Q Consensus 62 ~~~~F~sl~~Lv~~y~~~ 79 (97)
....|.++.+|+.|-+..
T Consensus 7 C~~~F~~~~~l~~H~~~h 24 (27)
T PF13912_consen 7 CGKTFSSLSALREHKRSH 24 (27)
T ss_dssp TTEEESSHHHHHHHHCTT
T ss_pred cCCccCChhHHHHHhHHh
Confidence 356799999999997554
No 42
>smart00720 calpain_III calpain_III.
Probab=40.46 E-value=47 Score=19.55 Aligned_cols=24 Identities=33% Similarity=0.571 Sum_probs=14.0
Q ss_pred CCceEEEeecCCCC---CeeEEEEEec
Q psy3300 14 PVGAFMVRESTTKP---GCFALSLRVP 37 (97)
Q Consensus 14 ~~G~fLiR~s~~~~---~~~~Ls~~~~ 37 (97)
++|.|+|=.|...+ |.|.|.|...
T Consensus 111 ~~G~Y~iVPsT~~p~~~g~F~LrV~s~ 137 (143)
T smart00720 111 PPGEYVIVPSTFEPNQEGDFLLRVFSE 137 (143)
T ss_pred CCCCEEEEEeecCCCCccCEEEEEEec
Confidence 56777777664443 3466666553
No 43
>PF07240 Turandot: Stress-inducible humoral factor Turandot; InterPro: IPR010825 This family consists of several Drosophila species specific Turandot proteins. The Turandot A (TotA) gene encodes a humoral factor, which is secreted from the fat body and accumulates in the body fluids. TotA is strongly induced upon bacterial challenge, as well as by other types of stress such as high temperature, mechanical pressure, dehydration, UV irradiation, and oxidative agents. It is also upregulated during metamorphosis and at high age. Flies that overexpress TotA show prolonged survival and retain normal activity at otherwise lethal temperatures. Although TotA is only induced by severe stress, it responds to a much wider range of stimuli than heat shock genes such as hsp70 or immune genes such as Cecropin A1 [].
Probab=38.64 E-value=29 Score=19.24 Aligned_cols=18 Identities=11% Similarity=0.154 Sum_probs=14.5
Q ss_pred CCCCHHHHHHHhhhCCCC
Q psy3300 65 EFSSLTSLITHHSVMPEL 82 (97)
Q Consensus 65 ~F~sl~~Lv~~y~~~~~~ 82 (97)
.-.++++||++|.+.+..
T Consensus 8 K~rni~eLi~fY~ky~~~ 25 (85)
T PF07240_consen 8 KIRNIQELIAFYEKYSPR 25 (85)
T ss_pred HHhhHHHHHHHHHHcCcc
Confidence 457899999999987654
No 44
>COG1576 Uncharacterized conserved protein [Function unknown]
Probab=37.04 E-value=31 Score=21.33 Aligned_cols=21 Identities=19% Similarity=0.328 Sum_probs=16.5
Q ss_pred HHHHHHHhCCCCCceEEEeec
Q psy3300 3 QEITLEVLSQEPVGAFMVRES 23 (97)
Q Consensus 3 r~~ae~~L~~~~~G~fLiR~s 23 (97)
+.|++.+|..-++|+|+|--.
T Consensus 54 ~~E~~~il~~i~~~~~vi~Ld 74 (155)
T COG1576 54 KKEGEAILAAIPKGSYVVLLD 74 (155)
T ss_pred HHHHHHHHHhcCCCCeEEEEe
Confidence 578889988778898888543
No 45
>COG1724 Predicted RNA binding protein (dsRBD-like fold), HicA family [General function prediction only]
Probab=36.57 E-value=45 Score=17.55 Aligned_cols=20 Identities=20% Similarity=0.258 Sum_probs=14.4
Q ss_pred CHHHHHHHhCCCCCceEEEeec
Q psy3300 2 IQEITLEVLSQEPVGAFMVRES 23 (97)
Q Consensus 2 sr~~ae~~L~~~~~G~fLiR~s 23 (97)
+-.+++..|. .+|.|++|..
T Consensus 8 ~~ke~ik~Le--~~Gf~~vrqk 27 (66)
T COG1724 8 KAKEVIKALE--KDGFQLVRQK 27 (66)
T ss_pred CHHHHHHHHH--hCCcEEEEee
Confidence 3456777774 4799999975
No 46
>KOG0518|consensus
Probab=34.74 E-value=63 Score=26.15 Aligned_cols=26 Identities=27% Similarity=0.448 Sum_probs=23.2
Q ss_pred CCCceEEEeecCCCCCeeEEEEEecC
Q psy3300 13 EPVGAFMVRESTTKPGCFALSLRVPK 38 (97)
Q Consensus 13 ~~~G~fLiR~s~~~~~~~~Ls~~~~~ 38 (97)
-.+|+|-||..+.++|.|+|++....
T Consensus 904 ~~~~~y~vrFtP~e~G~~tl~V~y~~ 929 (1113)
T KOG0518|consen 904 LGQGTYQVRFTPKEPGNHTLSVKYKD 929 (1113)
T ss_pred CCCceEEEEecCCCCCceEEEEEecC
Confidence 36799999999999999999999864
No 47
>cd00122 MBD MeCP2, MBD1, MBD2, MBD3, MBD4, CLLD8-like, and BAZ2A-like proteins constitute a family of proteins that share the methyl-CpG-binding domain (MBD). The MBD consists of about 70 residues and is defined as the minimal region required for binding to methylated DNA by a methyl-CpG-binding protein which binds specifically to methylated DNA. The MBD can recognize a single symmetrically methylated CpG either as naked DNA or within chromatin. MeCP2, MBD1 and MBD2 (and likely MBD3) form complexes with histone deacetylase and are involved in histone deacetylase-dependent repression of transcription. MBD4 is an endonuclease that forms a complex with the DNA mismatch-repair protein MLH1. The MBDs present in putative chromatin remodelling subunit, BAZ2A, and putative histone methyltransferase, CLLD8, represent two phylogenetically distinct groups within the MBD protein family.
Probab=34.01 E-value=63 Score=16.35 Aligned_cols=24 Identities=13% Similarity=0.019 Sum_probs=19.4
Q ss_pred EEEecccCCCCCHHHHHHHhhhCC
Q psy3300 57 YKIKGFTKEFSSLTSLITHHSVMP 80 (97)
Q Consensus 57 ~~l~~~~~~F~sl~~Lv~~y~~~~ 80 (97)
|++.-+...|.|..|+..|...+.
T Consensus 28 ~Y~sP~Gk~~Rs~~ev~~yL~~~~ 51 (62)
T cd00122 28 YYYSPCGKKLRSKPEVARYLEKTG 51 (62)
T ss_pred EEECCCCceecCHHHHHHHHHhCC
Confidence 666555778999999999998763
No 48
>PF01429 MBD: Methyl-CpG binding domain; InterPro: IPR001739 Methylation at CpG dinucleotide, the most common DNA modification in eukaryotes, has been correlated with gene silencing associated with various phenomena such as genomic imprinting, transposon and chromosome X inactivation, differentiation, and cancer. Effects of DNA methylation are mediated through proteins which bind to symmetrically methylated CpGs. Such proteins contain a specific domain of ~70 residues, the methyl-CpG-binding domain (MBD), which is linked to additional domains associated with chromatin, such as the bromodomain, the AT hook motif,the SET domain, or the PHD finger. MBD-containing proteins appear to act as structural proteins, which recruit a variety of histone deacetylase (HDAC) complexes and chromatin remodelling factors, leading to chromatin compaction and, consequently, to transcriptional repression. The MBD of MeCP2, MBD1, MBD2, MBD4 and BAZ2 mediates binding to DNA, in case of MeCP2, MBD1 and MBD2 preferentially to methylated CpG. In case of human MBD3 and SETDB1 the MBD has been shown to mediate protein-protein interactions [, ]. The MBD folds into an alpha/beta sandwich structure comprising a layer of twisted beta sheet, backed by another layer formed by the alpha1 helix and a hairpin loop at the C terminus. These layers are both amphipathic, with the alpha1 helix and the beta sheet lying parallel and the hydrophobic faces tightly packed against each other. The beta sheet is composed of two long inner strands (beta2 and beta3) sandwiched by two shorter outer strands (beta1 and beta4) [].; GO: 0003677 DNA binding, 0005634 nucleus; PDB: 2KY8_A 1UB1_A 1D9N_A 1IG4_A 1QK9_A 3C2I_A.
Probab=32.21 E-value=52 Score=17.48 Aligned_cols=25 Identities=20% Similarity=0.226 Sum_probs=19.7
Q ss_pred EEEecccCCCCCHHHHHHHhhhCCC
Q psy3300 57 YKIKGFTKEFSSLTSLITHHSVMPE 81 (97)
Q Consensus 57 ~~l~~~~~~F~sl~~Lv~~y~~~~~ 81 (97)
|++.-++..|.|..|++.|+..+..
T Consensus 34 ~Y~sP~Gk~~RS~~eV~~yL~~~~~ 58 (77)
T PF01429_consen 34 YYYSPCGKRFRSKKEVVRYLKENPS 58 (77)
T ss_dssp EEEETTSEEESSHHHHHHHHTTSS-
T ss_pred EEECCCCCEEeCHHHHHHHHHhCCC
Confidence 6665557789999999999988764
No 49
>TIGR03769 P_ac_wall_RPT actinobacterial surface-anchored protein domain. This model describes a repeat domain that one to three times in Actinobacterial proteins, some of which have LPXTG-type sortase recognition motifs for covalent attachment to the Gram-positive cell wall. Where it occurs with duplication in an LPXTG-anchored protein, it tends to be adjacent to the substrate-binding protein of the gene trio of an ABC transporter system, where that substrate-binding protein has a single copy of this same domain. This arrangement suggests a substrate-binding relay system, with the LPXTG protein acting as a substrate receptor.
Probab=30.42 E-value=62 Score=15.12 Aligned_cols=13 Identities=38% Similarity=0.874 Sum_probs=10.1
Q ss_pred CCCCeeEEEEEec
Q psy3300 25 TKPGCFALSLRVP 37 (97)
Q Consensus 25 ~~~~~~~Ls~~~~ 37 (97)
++||.|.|.+...
T Consensus 10 T~PG~Y~l~~~a~ 22 (41)
T TIGR03769 10 TKPGTYTLTVQAT 22 (41)
T ss_pred CCCeEEEEEEEEE
Confidence 5688888888774
No 50
>cd01396 MeCP2_MBD MeCP2, MBD1, MBD2, MBD3, and MBD4 are members of a protein family that share the methyl-CpG-binding domain (MBD). The MBD, consists of about 70 residues and is defined as the minimal region required for binding to methylated DNA by a methyl-CpG-binding protein which binds specifically to methylated DNA. The MBD can recognize a single symmetrically methylated CpG either as naked DNA or within chromatin. MeCP2, MBD1 and MBD2 (and likely MBD3) form complexes with histone deacetylase and are involved in histone deacetylase-dependent repression of transcription. MBD4 is an endonuclease that forms a complex with the DNA mismatch-repair protein MLH1.
Probab=30.06 E-value=56 Score=17.53 Aligned_cols=24 Identities=25% Similarity=0.141 Sum_probs=19.7
Q ss_pred EEEecccCCCCCHHHHHHHhhhCC
Q psy3300 57 YKIKGFTKEFSSLTSLITHHSVMP 80 (97)
Q Consensus 57 ~~l~~~~~~F~sl~~Lv~~y~~~~ 80 (97)
|++.-....|.|..||..|+..+.
T Consensus 29 yY~sP~Gkk~RS~~ev~~yL~~~~ 52 (77)
T cd01396 29 YYISPTGKKFRSKVELARYLEKNG 52 (77)
T ss_pred EEECCCCCEEECHHHHHHHHHhCC
Confidence 666655778999999999998863
No 51
>PF09634 DUF2025: Protein of unknown function (DUF2025); InterPro: IPR018595 This protein is produced from gene PA1123 in Pseudomonas. It appears to be present in the biofilm layer and may be a lipoprotein. It contains three alpha helices and six beta strands and is thought to be monomeric.; PDB: 2HG6_A.
Probab=29.05 E-value=31 Score=19.40 Aligned_cols=16 Identities=31% Similarity=0.372 Sum_probs=11.4
Q ss_pred CCCceEEEeecCCCCC
Q psy3300 13 EPVGAFMVRESTTKPG 28 (97)
Q Consensus 13 ~~~G~fLiR~s~~~~~ 28 (97)
.+.|.++||-|+...|
T Consensus 23 ~KtG~~ivRFSEDsFG 38 (106)
T PF09634_consen 23 RKTGQHIVRFSEDSFG 38 (106)
T ss_dssp TTTTEEEEES-GGG-S
T ss_pred ccCceEEEEecccccC
Confidence 3789999999976555
No 52
>cd00068 GGL G protein gamma subunit-like motifs, the alpha-helical G-gamma chain dimerizes with the G-beta propeller subunit as part of the heterotrimeric G-protein complex; involved in signal transduction via G-protein-coupled receptors
Probab=28.60 E-value=70 Score=16.01 Aligned_cols=18 Identities=11% Similarity=0.027 Sum_probs=12.4
Q ss_pred CHHHHHHHhhhC--CCCcce
Q psy3300 68 SLTSLITHHSVM--PELLPC 85 (97)
Q Consensus 68 sl~~Lv~~y~~~--~~~l~~ 85 (97)
...+|+.|...+ .|.+..
T Consensus 25 a~~~l~~y~e~~~~~Dpll~ 44 (57)
T cd00068 25 AAAELLKYCEQNAENDPLLT 44 (57)
T ss_pred HHHHHHHHHHhcCCCCCCCC
Confidence 467888888777 555543
No 53
>KOG3413|consensus
Probab=28.07 E-value=1.2e+02 Score=18.71 Aligned_cols=51 Identities=20% Similarity=0.360 Sum_probs=26.2
Q ss_pred CCceEEEeecCCCCCeeEEEEEecCCCCCCceeEEEEEEeCCcEEEecccCCCCCHHHHHHH
Q psy3300 14 PVGAFMVRESTTKPGCFALSLRVPKEFHHLGIAHYLILRTAKGYKIKGFTKEFSSLTSLITH 75 (97)
Q Consensus 14 ~~G~fLiR~s~~~~~~~~Ls~~~~~~~~~~~v~h~~I~~~~~~~~l~~~~~~F~sl~~Lv~~ 75 (97)
..|+|+|-.-+.... |.+.+. -.+.++|-+.+.+...+.- .=.+|.+|++-
T Consensus 89 ~~GTYViNKQ~PnkQ---IWlSSP----~SGPkryD~~~~~~Wiy~r----~g~sL~~lL~e 139 (156)
T KOG3413|consen 89 SVGTYVINKQPPNKQ---IWLSSP----VSGPKRYDLCRAGEWIYLR----DGTSLHELLEE 139 (156)
T ss_pred CceeEEecCCCCcce---eeeeCC----CCCCccccccccCceEEec----CCccHHHHHHH
Confidence 457888855433332 222222 2677888887633332222 22566766654
No 54
>PF14822 Vasohibin: Vasohibin
Probab=27.51 E-value=1.7e+02 Score=19.61 Aligned_cols=61 Identities=21% Similarity=0.251 Sum_probs=34.4
Q ss_pred ceEEEeecCCCCCeeEEEEEecCCCCCCceeEEEEEEeCCc-EEEec---------ccCCCCCHHHHHHHhhhC
Q psy3300 16 GAFMVRESTTKPGCFALSLRVPKEFHHLGIAHYLILRTAKG-YKIKG---------FTKEFSSLTSLITHHSVM 79 (97)
Q Consensus 16 G~fLiR~s~~~~~~~~Ls~~~~~~~~~~~v~h~~I~~~~~~-~~l~~---------~~~~F~sl~~Lv~~y~~~ 79 (97)
|.||--.- ..-..|.+++++.- .+...+|+.+.-..+| |...| ....|.+|.+||.-|...
T Consensus 120 ~~ylT~~~-~~~~Rfpi~Fks~~--~g~~~~HIVL~v~~~~~~GalGlSRr~~Lm~k~l~~~sls~li~~y~~s 190 (246)
T PF14822_consen 120 GIYLTAGL-PDVERFPISFKSQF--SGRVFRHIVLGVRCGGKYGALGLSRRSDLMYKPLEFRSLSDLIFDYKES 190 (246)
T ss_pred HHHHcCCc-ccceEeeceeEEEE--cCcEEEEEEEEEEECCeecccccccchhhcccchhhHHHHHHHHHHHHH
Confidence 44544322 22334778887752 2345666654444444 44333 133599999999988654
No 55
>smart00391 MBD Methyl-CpG binding domain. Methyl-CpG binding domain, also known as the TAM (TTF-IIP5, ARBP, MeCP1) domain
Probab=25.25 E-value=1e+02 Score=16.48 Aligned_cols=25 Identities=20% Similarity=0.138 Sum_probs=20.2
Q ss_pred EEEecccCCCCCHHHHHHHhhhCCC
Q psy3300 57 YKIKGFTKEFSSLTSLITHHSVMPE 81 (97)
Q Consensus 57 ~~l~~~~~~F~sl~~Lv~~y~~~~~ 81 (97)
|++.-+...|.|..|+..|...+..
T Consensus 31 ~Y~sP~GkklRs~~ev~~YL~~~~~ 55 (77)
T smart00391 31 YYISPCGKKLRSKSELARYLHKNGD 55 (77)
T ss_pred EEECCCCCeeeCHHHHHHHHHhCCC
Confidence 6666667789999999999987754
No 56
>PF04485 NblA: Phycobilisome degradation protein nblA ; InterPro: IPR007574 In the cyanobacterium Synechococcus species PCC 7942 (P35087 from SWISSPROT), nblA triggers degradation of light-harvesting phycobiliproteins in response to deprivation nutrients including nitrogen, phosphorus and sulphur. The mechanism of nblA function is not known, but it has been hypothesised that nblA may act by disrupting phycobilisome structure, activating a protease or tagging phycobiliproteins for proteolysis. Members of this family have also been identified in the chloroplasts of some red algae.; PDB: 3CS5_D 1OJH_L 2QDO_B 2Q8V_A.
Probab=24.22 E-value=57 Score=16.37 Aligned_cols=11 Identities=27% Similarity=0.353 Sum_probs=7.0
Q ss_pred CCHHHHHHHhC
Q psy3300 1 MIQEITLEVLS 11 (97)
Q Consensus 1 isr~~ae~~L~ 11 (97)
||+++|.++|.
T Consensus 22 ls~Eqaq~~Lv 32 (53)
T PF04485_consen 22 LSREQAQELLV 32 (53)
T ss_dssp S-HHHHHHHHH
T ss_pred hCHHHHHHHHH
Confidence 57788877663
No 57
>PF03443 Glyco_hydro_61: Glycosyl hydrolase family 61; InterPro: IPR005103 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. The only known activity within this family is that of endoglucanase (3.2.1.4 from EC) GH61 from CAZY ; PDB: 4EIS_B 2VTC_A 4EIR_B 3EJA_D 3EII_A.
Probab=23.94 E-value=45 Score=21.62 Aligned_cols=9 Identities=33% Similarity=0.999 Sum_probs=8.0
Q ss_pred CCceEEEee
Q psy3300 14 PVGAFMVRE 22 (97)
Q Consensus 14 ~~G~fLiR~ 22 (97)
++|.||||.
T Consensus 147 ~~G~YLlR~ 155 (218)
T PF03443_consen 147 PPGQYLLRH 155 (218)
T ss_dssp BSEEEEEEE
T ss_pred CCCCceEEe
Confidence 789999995
No 58
>KOG4680|consensus
Probab=23.77 E-value=1.1e+02 Score=18.80 Aligned_cols=24 Identities=29% Similarity=0.560 Sum_probs=17.4
Q ss_pred CCceEEEeecCCC-----CCeeEEEEEec
Q psy3300 14 PVGAFMVRESTTK-----PGCFALSLRVP 37 (97)
Q Consensus 14 ~~G~fLiR~s~~~-----~~~~~Ls~~~~ 37 (97)
.+|.|++-.|..- ||.|.|-++.-
T Consensus 100 epG~f~~~hsq~LPg~tPPG~Y~lkm~~~ 128 (153)
T KOG4680|consen 100 EPGDFLVAHSQVLPGYTPPGSYVLKMTAY 128 (153)
T ss_pred CcCceeeeeeEeccCcCCCceEEEEEEee
Confidence 6899999876544 56788766653
No 59
>cd01229 PH_etc2 Epithelial cell transforming 2 (ECT2) pleckstrin homology (PH) domain. Epithelial cell transforming 2 (ECT2) pleckstrin homology (PH) domain. PH domains share little sequence conservation, but all have a common fold, which is electrostatically polarized. PH domains also have diverse functions. They are often involved in targeting proteins to the plasma membrane, but few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinases, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, a well as cytoskeletal associated molecules and in lipid associated enzymes.
Probab=22.17 E-value=1.9e+02 Score=17.26 Aligned_cols=30 Identities=23% Similarity=0.378 Sum_probs=16.2
Q ss_pred EeecCCCCCeeEEEEEecCCCCCCceeEEEE
Q psy3300 20 VRESTTKPGCFALSLRVPKEFHHLGIAHYLI 50 (97)
Q Consensus 20 iR~s~~~~~~~~Ls~~~~~~~~~~~v~h~~I 50 (97)
||+++.-++.|.|-++...+.. +....|.|
T Consensus 80 I~DtE~c~~aF~L~~R~p~d~~-~~L~sF~l 109 (129)
T cd01229 80 IRDTEDCHNAFALLVRPPTEQA-NVLLSFQM 109 (129)
T ss_pred eeccccccceeEEEEeCCccch-heEEEEEe
Confidence 4566555667777777542211 34555554
No 60
>PF06851 DUF1247: Protein of unknown function (DUF1247); InterPro: IPR009657 This entry is represented by Autographa californica nuclear polyhedrosis virus (AcMNPV), Orf2; it is a family of uncharacterised viral proteins.
Probab=21.26 E-value=62 Score=19.73 Aligned_cols=15 Identities=40% Similarity=0.536 Sum_probs=12.4
Q ss_pred CCCCHHHHHHHhhhC
Q psy3300 65 EFSSLTSLITHHSVM 79 (97)
Q Consensus 65 ~F~sl~~Lv~~y~~~ 79 (97)
-|.||.+|+.||--.
T Consensus 120 gF~s~~~LL~~Yny~ 134 (148)
T PF06851_consen 120 GFESLKDLLTHYNYE 134 (148)
T ss_pred CcccHHHHHHHcCCC
Confidence 499999999998543
No 61
>PF13620 CarboxypepD_reg: Carboxypeptidase regulatory-like domain; PDB: 3MN8_D 3P0D_I 3KCP_A 2B59_B 1UWY_A 1H8L_A 1QMU_A 2NSM_A.
Probab=20.37 E-value=1.4e+02 Score=15.22 Aligned_cols=22 Identities=27% Similarity=0.596 Sum_probs=15.1
Q ss_pred CCceEEEeecCCCCCeeEEEEEec
Q psy3300 14 PVGAFMVRESTTKPGCFALSLRVP 37 (97)
Q Consensus 14 ~~G~fLiR~s~~~~~~~~Ls~~~~ 37 (97)
..|.|.++.-. +|.|.|.+...
T Consensus 36 ~~G~f~~~~l~--~g~Y~l~v~~~ 57 (82)
T PF13620_consen 36 SDGRFSFEGLP--PGTYTLRVSAP 57 (82)
T ss_dssp TTSEEEEEEE---SEEEEEEEEBT
T ss_pred CCceEEEEccC--CEeEEEEEEEC
Confidence 57888888653 37788877663
No 62
>PF08272 Topo_Zn_Ribbon: Topoisomerase I zinc-ribbon-like ; InterPro: IPR013263 DNA topoisomerases regulate the number of topological links between two DNA strands (i.e. change the number of superhelical turns) by catalysing transient single- or double-strand breaks, crossing the strands through one another, then resealing the breaks []. These enzymes have several functions: to remove DNA supercoils during transcription and DNA replication; for strand breakage during recombination; for chromosome condensation; and to disentangle intertwined DNA during mitosis [, ]. DNA topoisomerases are divided into two classes: type I enzymes (5.99.1.2 from EC; topoisomerases I, III and V) break single-strand DNA, and type II enzymes (5.99.1.3 from EC; topoisomerases II, IV and VI) break double-strand DNA []. Type I topoisomerases are ATP-independent enzymes (except for reverse gyrase), and can be subdivided according to their structure and reaction mechanisms: type IA (bacterial and archaeal topoisomerase I, topoisomerase III and reverse gyrase) and type IB (eukaryotic topoisomerase I and topoisomerase V). These enzymes are primarily responsible for relaxing positively and/or negatively supercoiled DNA, except for reverse gyrase, which can introduce positive supercoils into DNA. This entry represents the C-terminal zinc-ribbon-like domain found in bacterial topoisomerase I (type IA) enzymes. Escherichia coli topoisomerase I proteins contain five copies of a zinc-ribbon-like domain at their C terminus, two of which have lost their cysteine residues and are therefore probably not able to bind zinc []. This domain is still considered to be a member of the zinc-ribbon superfamily despite not being able to bind zinc. More information about this protein can be found at Protein of the Month: DNA Topoisomerase [].; GO: 0003677 DNA binding, 0003918 DNA topoisomerase (ATP-hydrolyzing) activity, 0006265 DNA topological change, 0005694 chromosome; PDB: 1YUA_A.
Probab=20.15 E-value=7.7 Score=18.50 Aligned_cols=9 Identities=22% Similarity=0.173 Sum_probs=4.0
Q ss_pred CceEEEeec
Q psy3300 15 VGAFMVRES 23 (97)
Q Consensus 15 ~G~fLiR~s 23 (97)
+.-|.||.-
T Consensus 12 ~k~qyv~se 20 (42)
T PF08272_consen 12 TKEQYVRSE 20 (42)
T ss_dssp TTEEEEEE-
T ss_pred CcEEEEecC
Confidence 344555543
Done!