Query psy7177
Match_columns 66
No_of_seqs 191 out of 1007
Neff 7.6
Searched_HMMs 46136
Date Fri Aug 16 20:10:24 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy7177.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/7177hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 cd02667 Peptidase_C19K A subfa 99.8 9.5E-21 2.1E-25 122.7 5.3 53 1-53 206-279 (279)
2 cd02673 Peptidase_C19Q A subfa 99.8 1.1E-20 2.4E-25 121.7 5.2 53 1-53 188-245 (245)
3 cd02664 Peptidase_C19H A subfa 99.8 6.7E-21 1.5E-25 125.8 4.1 53 1-53 247-327 (327)
4 cd02662 Peptidase_C19F A subfa 99.8 1.5E-20 3.1E-25 119.9 4.8 53 1-53 167-240 (240)
5 cd02659 peptidase_C19C A subfa 99.8 2.6E-20 5.6E-25 122.1 5.3 56 1-56 256-333 (334)
6 cd02658 Peptidase_C19B A subfa 99.8 5.9E-20 1.3E-24 119.7 6.2 53 1-53 256-311 (311)
7 cd02663 Peptidase_C19G A subfa 99.8 2.8E-20 6E-25 121.4 4.5 51 1-53 241-300 (300)
8 cd02657 Peptidase_C19A A subfa 99.8 2.7E-20 5.9E-25 120.9 4.4 53 1-53 245-305 (305)
9 KOG1865|consensus 99.8 1.1E-19 2.5E-24 126.6 6.7 54 1-55 356-410 (545)
10 cd02665 Peptidase_C19I A subfa 99.8 7.3E-20 1.6E-24 117.3 4.6 53 1-53 168-228 (228)
11 PF00443 UCH: Ubiquitin carbox 99.8 1.6E-19 3.5E-24 112.6 5.1 52 1-52 215-269 (269)
12 cd02668 Peptidase_C19L A subfa 99.8 1.7E-19 3.7E-24 118.7 5.1 53 1-53 250-324 (324)
13 cd02670 Peptidase_C19N A subfa 99.8 2.5E-19 5.3E-24 115.6 5.1 53 1-53 171-241 (241)
14 COG5560 UBP12 Ubiquitin C-term 99.8 2.7E-19 5.8E-24 126.9 5.3 56 1-56 768-823 (823)
15 cd02669 Peptidase_C19M A subfa 99.8 3E-19 6.5E-24 122.3 5.3 53 1-53 387-440 (440)
16 cd02674 Peptidase_C19R A subfa 99.8 5.5E-19 1.2E-23 110.5 6.0 53 1-53 178-230 (230)
17 cd02660 Peptidase_C19D A subfa 99.8 5.7E-19 1.2E-23 115.5 5.6 52 1-53 277-328 (328)
18 cd02661 Peptidase_C19E A subfa 99.8 8.9E-19 1.9E-23 112.8 5.2 52 1-53 252-304 (304)
19 KOG1868|consensus 99.7 2.9E-18 6.3E-23 122.2 6.4 60 1-60 592-651 (653)
20 cd02257 Peptidase_C19 Peptidas 99.7 3E-18 6.6E-23 105.1 5.5 53 1-53 197-255 (255)
21 cd02671 Peptidase_C19O A subfa 99.7 1.2E-17 2.6E-22 111.3 4.0 47 1-53 276-332 (332)
22 KOG1870|consensus 99.7 6.9E-18 1.5E-22 122.9 2.9 56 1-56 787-842 (842)
23 cd02672 Peptidase_C19P A subfa 99.7 5.7E-17 1.2E-21 105.6 5.4 47 1-53 217-268 (268)
24 KOG1867|consensus 99.7 1.8E-17 3.9E-22 115.4 2.8 56 1-58 432-487 (492)
25 cd02666 Peptidase_C19J A subfa 99.6 4.6E-16 1E-20 104.4 4.7 53 1-53 285-343 (343)
26 COG5533 UBP5 Ubiquitin C-termi 99.6 2E-15 4.3E-20 101.0 5.8 53 1-55 361-414 (415)
27 KOG4598|consensus 99.5 2.6E-15 5.7E-20 108.2 1.8 59 1-59 374-445 (1203)
28 KOG1866|consensus 99.5 6.9E-16 1.5E-20 111.0 -1.9 59 1-59 357-438 (944)
29 COG5077 Ubiquitin carboxyl-ter 99.5 1.6E-14 3.5E-19 104.6 1.3 61 1-61 435-517 (1089)
30 KOG1873|consensus 99.4 1.3E-14 2.7E-19 104.5 0.1 55 1-55 803-877 (877)
31 KOG0944|consensus 99.4 4.9E-13 1.1E-17 95.6 4.7 51 1-55 711-762 (763)
32 KOG1864|consensus 99.3 3E-12 6.6E-17 91.0 4.4 54 1-55 515-572 (587)
33 COG5207 UBP14 Isopeptidase T [ 99.3 4.1E-12 8.9E-17 89.5 4.2 55 1-55 692-749 (749)
34 KOG1872|consensus 99.3 2.7E-12 5.9E-17 88.8 3.2 55 1-56 408-470 (473)
35 KOG1871|consensus 98.8 1.6E-08 3.4E-13 69.3 5.2 56 1-56 360-420 (420)
36 KOG1863|consensus 98.7 4.7E-09 1E-13 78.9 2.4 61 1-61 415-491 (1093)
37 PF13423 UCH_1: Ubiquitin carb 98.6 7.3E-08 1.6E-12 63.0 4.3 34 1-34 259-295 (295)
38 KOG1275|consensus 97.6 0.00022 4.7E-09 53.7 6.5 52 1-52 795-860 (1118)
39 KOG2026|consensus 96.9 4.8E-05 1E-09 52.6 -3.2 47 9-55 395-441 (442)
40 PF05408 Peptidase_C28: Foot-a 93.7 0.2 4.2E-06 31.9 4.5 47 5-58 130-178 (193)
41 PF15499 Peptidase_C98: Ubiqui 93.6 0.081 1.7E-06 35.2 2.8 22 11-33 231-252 (275)
42 PF01473 CW_binding_1: Putativ 89.6 0.35 7.5E-06 19.3 1.6 14 17-32 3-16 (19)
43 PF00770 Peptidase_C5: Adenovi 76.1 4.3 9.3E-05 25.7 3.1 31 1-31 22-53 (183)
44 PF08770 SoxZ: Sulphur oxidati 71.5 7.1 0.00015 22.1 3.1 41 13-60 41-81 (100)
45 PF08715 Viral_protease: Papai 69.7 6.4 0.00014 26.9 3.0 33 4-41 266-298 (320)
46 PF02099 Josephin: Josephin; 61.2 17 0.00037 22.2 3.5 28 1-34 101-128 (157)
47 smart00460 TGc Transglutaminas 59.9 18 0.00039 17.9 3.0 20 10-31 47-66 (68)
48 PF05986 ADAM_spacer1: ADAM-TS 54.4 39 0.00085 19.2 4.2 51 11-66 37-88 (114)
49 cd05860 Ig4_SCFR Fourth immuno 53.9 21 0.00046 20.4 2.9 15 8-22 74-88 (101)
50 cd05896 Ig1_IL1RAPL-1_like Fir 51.4 11 0.00024 21.7 1.4 13 9-21 80-92 (104)
51 cd04480 RPA1_DBD_A_like RPA1_D 49.4 40 0.00087 17.9 3.9 32 23-54 51-82 (86)
52 PF09149 DUF1935: Domain of un 47.5 14 0.00031 21.1 1.5 15 22-36 30-44 (104)
53 PF01841 Transglut_core: Trans 47.3 24 0.00053 19.1 2.4 20 10-30 93-112 (113)
54 COG0093 RplN Ribosomal protein 46.9 38 0.00082 20.2 3.2 18 23-40 73-90 (122)
55 cd00524 SORL Superoxide reduct 45.1 50 0.0011 17.8 6.5 40 8-56 29-68 (86)
56 PF14509 GH97_C: Glycosyl-hydr 44.5 18 0.0004 20.5 1.7 38 11-48 13-55 (103)
57 cd05751 Ig1_LILRB1_like First 44.4 21 0.00045 19.1 1.8 12 10-21 64-75 (91)
58 PHA02131 hypothetical protein 41.3 31 0.00068 18.0 2.1 26 10-40 21-46 (70)
59 PRK08571 rpl14p 50S ribosomal 39.6 55 0.0012 19.7 3.2 19 22-40 83-101 (132)
60 cd03172 SORL_classII Superoxid 39.2 73 0.0016 18.0 5.1 41 9-56 38-80 (104)
61 PTZ00054 60S ribosomal protein 38.8 56 0.0012 19.8 3.2 19 22-40 90-108 (139)
62 cd05750 Ig_Pro_neuregulin Immu 38.0 55 0.0012 16.1 2.8 14 9-22 53-66 (75)
63 cd04474 RPA1_DBD_A RPA1_DBD_A: 37.7 75 0.0016 17.6 3.8 32 23-54 67-98 (104)
64 PHA01365 hypothetical protein 36.9 41 0.00088 18.8 2.2 25 28-52 15-41 (91)
65 PF01088 Peptidase_C12: Ubiqui 36.8 28 0.0006 22.2 1.8 24 12-37 166-189 (214)
66 PF12252 SidE: Dot/Icm substra 36.3 52 0.0011 26.7 3.3 24 8-32 59-82 (1439)
67 cd05756 Ig1_IL1R_like First im 36.3 32 0.0007 18.7 1.8 14 8-21 69-82 (94)
68 cd05747 Ig5_Titin_like M5, fif 36.1 63 0.0014 17.0 2.9 21 8-28 68-88 (92)
69 PF02721 DUF223: Domain of unk 35.3 62 0.0013 17.6 2.9 35 22-56 29-63 (95)
70 cd05855 Ig_TrkB_d5 Fifth domai 34.9 46 0.001 17.7 2.2 15 8-22 53-67 (79)
71 PF10572 UPF0556: Uncharacteri 34.8 80 0.0017 19.6 3.5 26 8-33 69-94 (158)
72 PRK05483 rplN 50S ribosomal pr 33.2 72 0.0016 18.8 3.0 19 22-40 72-90 (122)
73 TIGR03673 rpl14p_arch 50S ribo 33.0 78 0.0017 19.0 3.1 19 22-40 82-100 (131)
74 PF04473 DUF553: Transglutamin 32.7 81 0.0018 19.2 3.3 19 12-32 109-127 (153)
75 PRK04233 hypothetical protein; 32.1 38 0.00081 20.2 1.7 9 23-31 120-128 (129)
76 COG3012 Uncharacterized protei 31.8 35 0.00075 21.0 1.5 12 23-34 119-130 (151)
77 cd05854 Ig6_Contactin-2 Sixth 31.5 48 0.001 17.5 2.0 14 9-22 57-70 (85)
78 cd05744 Ig_Myotilin_C_like Imm 30.5 52 0.0011 16.8 2.0 15 8-22 50-64 (75)
79 TIGR03834 EAGR_box EAGR box. T 30.2 58 0.0013 14.4 1.7 19 7-31 2-23 (28)
80 cd07702 Ig2_VEGFR-1 Second imm 30.1 64 0.0014 16.8 2.3 16 8-23 45-60 (72)
81 PF05717 TnpB_IS66: IS66 Orf2 29.7 88 0.0019 17.8 2.9 38 8-58 31-68 (107)
82 cd04976 Ig2_VEGFR Second immun 29.4 48 0.001 16.8 1.7 15 8-22 44-58 (71)
83 cd04971 Ig_TrKABC_d5 Fifth dom 29.1 95 0.0021 16.2 2.9 15 8-22 55-69 (81)
84 cd05735 Ig8_DSCAM Eight immuno 28.9 62 0.0013 17.1 2.1 15 8-22 56-70 (88)
85 PF02750 Synapsin_C: Synapsin, 28.6 32 0.0007 22.2 1.1 17 11-27 107-123 (203)
86 cd05869 Ig5_NCAM-1 Fifth immun 28.6 1E+02 0.0022 16.4 3.0 14 9-22 73-86 (97)
87 PRK00183 hypothetical protein; 28.0 42 0.00092 20.7 1.5 11 23-33 120-130 (157)
88 KOG1415|consensus 27.9 70 0.0015 20.9 2.5 24 12-37 160-183 (222)
89 PRK10579 hypothetical protein; 27.8 1.2E+02 0.0027 17.2 4.5 36 17-52 53-93 (94)
90 cd07693 Ig1_Robo First immunog 27.6 56 0.0012 16.9 1.8 14 8-21 73-86 (100)
91 TIGR01067 rplN_bact ribosomal 27.3 72 0.0016 18.8 2.3 19 22-40 72-90 (122)
92 cd04967 Ig1_Contactin First Ig 27.1 58 0.0013 17.1 1.8 14 9-22 69-82 (91)
93 cd05895 Ig_Pro_neuregulin-1 Im 27.1 65 0.0014 16.2 1.9 15 8-22 53-67 (76)
94 cd05863 Ig2_VEGFR-3 Second imm 26.9 55 0.0012 16.7 1.6 15 8-22 40-54 (67)
95 cd05730 Ig3_NCAM-1_like Third 26.4 63 0.0014 17.0 1.9 15 8-22 67-81 (95)
96 PF06680 DUF1181: Protein of u 26.3 71 0.0015 18.4 2.1 25 10-34 43-67 (120)
97 PF13529 Peptidase_C39_2: Pept 26.3 64 0.0014 17.7 2.0 23 8-31 120-142 (144)
98 cd04974 Ig3_FGFR Third immunog 26.2 1E+02 0.0022 16.2 2.7 14 9-22 65-78 (90)
99 cd05893 Ig_Palladin_C C-termin 25.8 65 0.0014 16.6 1.8 13 9-21 51-63 (75)
100 cd05740 Ig_CEACAM_D4 Fourth im 25.5 67 0.0015 17.2 1.9 15 8-22 64-78 (91)
101 cd05746 Ig4_Peroxidasin Fourth 25.4 60 0.0013 16.1 1.6 15 8-22 45-59 (69)
102 cd05892 Ig_Myotilin_C C-termin 24.8 70 0.0015 16.6 1.8 15 8-22 50-64 (75)
103 COG5263 FOG: Glucan-binding do 24.4 1.1E+02 0.0025 21.0 3.1 15 23-37 221-235 (313)
104 cd04969 Ig5_Contactin_like Fif 24.2 63 0.0014 16.1 1.6 14 8-21 48-61 (73)
105 cd05876 Ig3_L1-CAM Third immun 24.2 69 0.0015 16.0 1.7 15 8-22 45-59 (71)
106 cd04978 Ig4_L1-NrCAM_like Four 24.1 71 0.0015 15.8 1.7 13 9-21 51-63 (76)
107 cd05712 Ig_Siglec_N Immunoglob 24.1 1.4E+02 0.0031 16.6 3.8 25 9-33 90-114 (119)
108 cd05852 Ig5_Contactin-1 Fifth 23.9 74 0.0016 16.2 1.8 15 8-22 48-62 (73)
109 cd05853 Ig6_Contactin-4 Sixth 23.9 61 0.0013 17.5 1.5 14 9-22 57-70 (85)
110 cd05736 Ig2_Follistatin_like S 23.6 86 0.0019 15.6 2.0 14 8-21 48-61 (76)
111 COG1998 RPS31 Ribosomal protei 23.2 94 0.002 15.7 2.0 17 24-40 5-21 (51)
112 cd05865 Ig1_NCAM-1 First immun 22.9 1.2E+02 0.0027 16.4 2.6 14 8-21 69-82 (96)
113 PF08053 Tna_leader: Tryptopha 22.9 52 0.0011 13.7 0.8 12 23-34 9-20 (24)
114 cd04973 Ig1_FGFR First immunog 22.7 88 0.0019 16.0 2.0 14 8-21 54-67 (79)
115 PF10949 DUF2777: Protein of u 22.7 1.8E+02 0.0039 18.5 3.6 19 23-41 25-43 (185)
116 PRK01842 hypothetical protein; 22.7 86 0.0019 19.2 2.1 10 23-32 137-146 (149)
117 cd05856 Ig2_FGFRL1-like Second 22.6 82 0.0018 15.6 1.8 15 8-22 57-71 (82)
118 cd05879 Ig_P0 Immunoglobulin ( 22.6 1.1E+02 0.0025 17.4 2.5 15 8-22 89-103 (116)
119 cd05894 Ig_C5_MyBP-C C5 immuno 22.3 88 0.0019 16.4 1.9 15 8-22 61-75 (86)
120 cd05864 Ig2_VEGFR-2 Second imm 22.3 65 0.0014 16.5 1.4 15 8-22 43-57 (70)
121 PRK09806 tryptophanase leader 21.8 63 0.0014 13.8 1.0 13 23-35 9-21 (26)
122 PRK01617 hypothetical protein; 21.4 69 0.0015 19.6 1.5 15 17-33 117-131 (154)
123 cd05734 Ig7_DSCAM Seventh immu 21.3 1E+02 0.0022 15.6 2.0 14 8-21 53-66 (79)
124 cd05725 Ig3_Robo Third immunog 21.1 90 0.002 15.3 1.7 14 8-21 44-57 (69)
125 CHL00057 rpl14 ribosomal prote 21.1 1.2E+02 0.0026 17.9 2.4 20 22-41 72-91 (122)
126 cd05868 Ig4_NrCAM Fourth immun 21.0 91 0.002 15.8 1.8 14 9-22 51-64 (76)
127 PF06865 DUF1255: Protein of u 20.9 1.8E+02 0.0038 16.5 4.1 37 17-53 53-94 (94)
128 PF05766 NinG: Bacteriophage L 20.8 33 0.00072 21.8 0.0 26 5-34 97-122 (189)
129 PTZ00320 ribosomal protein L14 20.7 1.4E+02 0.003 19.1 2.8 20 22-41 138-157 (188)
130 KOG1296|consensus 20.7 56 0.0012 20.3 1.0 29 8-37 120-148 (161)
131 cd05723 Ig4_Neogenin Fourth im 20.6 1E+02 0.0023 15.3 1.9 14 8-21 46-59 (71)
132 PF07206 Baculo_LEF-10: Baculo 20.4 1.4E+02 0.003 16.2 2.4 18 44-61 22-39 (71)
133 cd05718 Ig1_PVR_like First imm 20.2 1.5E+02 0.0033 15.4 2.7 19 9-27 72-90 (98)
No 1
>cd02667 Peptidase_C19K A subfamily of Peptidase C19. Peptidase C19 contains ubiquitinyl hydrolases. They are intracellular peptidases that remove ubiquitin molecules from polyubiquinated peptides by cleavage of isopeptide bonds. They hydrolyze bonds involving the carboxyl group of the C-terminal Gly residue of ubiquitin. The purpose of the de-ubiquitination is thought to be editing of the ubiquitin conjugates, which could rescue them from degradation, as well as recycling of the ubiquitin. The ubiquitin/proteasome system is responsible for most protein turnover in the mammalian cell, and with over 50 members, family C19 is one of the largest families of peptidases in the human genome.
Probab=99.82 E-value=9.5e-21 Score=122.70 Aligned_cols=53 Identities=32% Similarity=0.541 Sum_probs=49.3
Q ss_pred CEEEeeecCCCCeEEEEEECCC---------------------CCcEEEEcCceEEEcCCCcCCCCCeEEEEEE
Q psy7177 1 MCVRPSKGVSAGHYTAYTLHPN---------------------TKEWHYFNDETVLEQRPQDEDFNNAYILFYK 53 (66)
Q Consensus 1 avv~H~G~~~~GHY~a~~~~~~---------------------~~~W~~fdD~~V~~~~~~~~~~~~aYiLfY~ 53 (66)
|||+|.|+.++|||+||+|... +++|++|||+.|++++.+++.+.+||||||+
T Consensus 206 ~vi~H~G~~~~GHY~a~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~W~~~dD~~V~~v~~~~v~~~~aYiLfYe 279 (279)
T cd02667 206 GVVEHSGTMRSGHYVAYVKVRPPQQRLSDLTKSKPAADEAGPGSGQWYYISDSDVREVSLEEVLKSEAYLLFYE 279 (279)
T ss_pred EEEEEeCCCCCCEeEEEEEcCccccccccccccccccccCCCCCCcEEEEECCccEECCHHHhccCCcEEEEeC
Confidence 7999999779999999999854 6899999999999999999999999999995
No 2
>cd02673 Peptidase_C19Q A subfamily of Peptidase C19. Peptidase C19 contains ubiquitinyl hydrolases. They are intracellular peptidases that remove ubiquitin molecules from polyubiquinated peptides by cleavage of isopeptide bonds. They hydrolyze bonds involving the carboxyl group of the C-terminal Gly residue of ubiquitin. The purpose of the de-ubiquitination is thought to be editing of the ubiquitin conjugates, which could rescue them from degradation, as well as recycling of the ubiquitin. The ubiquitin/proteasome system is responsible for most protein turnover in the mammalian cell, and with over 50 members, family C19 is one of the largest families of peptidases in the human genome.
Probab=99.82 E-value=1.1e-20 Score=121.70 Aligned_cols=53 Identities=25% Similarity=0.479 Sum_probs=48.5
Q ss_pred CEEEeee-cCCCCeEEEEEECCC-CCcEEEEcCceEEEcCCCcCC---CCCeEEEEEE
Q psy7177 1 MCVRPSK-GVSAGHYTAYTLHPN-TKEWHYFNDETVLEQRPQDED---FNNAYILFYK 53 (66)
Q Consensus 1 avv~H~G-~~~~GHY~a~~~~~~-~~~W~~fdD~~V~~~~~~~~~---~~~aYiLfY~ 53 (66)
|||+|.| ++++|||+||+|... +++||+|||+.|++++.+++. +++||||||+
T Consensus 188 ~VV~H~G~~~~~GHY~a~vk~~~~~~~Wy~fnD~~V~~v~~~~v~~~~~~~aYiLFY~ 245 (245)
T cd02673 188 AVICHLGESPYDGHYIAYTKELYNGSSWLYCSDDEIRPVSKNDVSTNARSSGYLIFYD 245 (245)
T ss_pred EEEEECCCCCCCceEEEEEEcCCCCCeEEEeeCceeeEcCHHHHhhccCCceEEEEEC
Confidence 6999999 689999999999875 689999999999999999876 6899999995
No 3
>cd02664 Peptidase_C19H A subfamily of Peptidase C19. Peptidase C19 contains ubiquitinyl hydrolases. They are intracellular peptidases that remove ubiquitin molecules from polyubiquinated peptides by cleavage of isopeptide bonds. They hydrolyze bonds involving the carboxyl group of the C-terminal Gly residue of ubiquitin. The purpose of the de-ubiquitination is thought to be editing of the ubiquitin conjugates, which could rescue them from degradation, as well as recycling of the ubiquitin. The ubiquitin/proteasome system is responsible for most protein turnover in the mammalian cell, and with over 50 members, family C19 is one of the largest families of peptidases in the human genome.
Probab=99.82 E-value=6.7e-21 Score=125.83 Aligned_cols=53 Identities=32% Similarity=0.551 Sum_probs=47.4
Q ss_pred CEEEeee-cCCCCeEEEEEECCC--------------------CCcEEEEcCceEEEcCCCcCCC-------CCeEEEEE
Q psy7177 1 MCVRPSK-GVSAGHYTAYTLHPN--------------------TKEWHYFNDETVLEQRPQDEDF-------NNAYILFY 52 (66)
Q Consensus 1 avv~H~G-~~~~GHY~a~~~~~~--------------------~~~W~~fdD~~V~~~~~~~~~~-------~~aYiLfY 52 (66)
|||+|.| ++++|||+||+|... .++|++|||+.|++++.+++.+ .+||||||
T Consensus 247 ~Vi~H~G~~~~~GHY~a~~r~~~~~~~~~~~~~~~~~~~~~~~~~~W~~fnD~~V~~~~~~~v~~~~~~~~~~~aYlLfY 326 (327)
T cd02664 247 AVVVHSGYSSESGHYFTYARDQTDADSTGQECPEPKDAEENDESKNWYLFNDSRVTFSSFESVQNVTSRFPKDTPYILFY 326 (327)
T ss_pred EEEEEccCCCCCcceEEEEecCCccccccccccccccccccCCCCCEEEEeCCceEECCHHHHHHhhCCCCCCCEEEEEe
Confidence 7999999 689999999999863 3789999999999999887754 89999999
Q ss_pred E
Q psy7177 53 K 53 (66)
Q Consensus 53 ~ 53 (66)
+
T Consensus 327 ~ 327 (327)
T cd02664 327 E 327 (327)
T ss_pred C
Confidence 5
No 4
>cd02662 Peptidase_C19F A subfamily of Peptidase C19. Peptidase C19 contains ubiquitinyl hydrolases. They are intracellular peptidases that remove ubiquitin molecules from polyubiquinated peptides by cleavage of isopeptide bonds. They hydrolyze bonds involving the carboxyl group of the C-terminal Gly residue of ubiquitin. The purpose of the de-ubiquitination is thought to be editing of the ubiquitin conjugates, which could rescue them from degradation, as well as recycling of the ubiquitin. The ubiquitin/proteasome system is responsible for most protein turnover in the mammalian cell, and with over 50 members, family C19 is one of the largest families of peptidases in the human genome.
Probab=99.82 E-value=1.5e-20 Score=119.95 Aligned_cols=53 Identities=32% Similarity=0.590 Sum_probs=49.0
Q ss_pred CEEEeeecCCCCeEEEEEECC--------------------CCCcEEEEcCceEEEcCCCcC-CCCCeEEEEEE
Q psy7177 1 MCVRPSKGVSAGHYTAYTLHP--------------------NTKEWHYFNDETVLEQRPQDE-DFNNAYILFYK 53 (66)
Q Consensus 1 avv~H~G~~~~GHY~a~~~~~--------------------~~~~W~~fdD~~V~~~~~~~~-~~~~aYiLfY~ 53 (66)
|||+|.|+.++|||++|+|.+ ..++||+|||+.|++++.+++ .+.+||||||+
T Consensus 167 avi~H~G~~~~GHY~~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~W~~fnD~~V~~v~~~~v~~~~~aY~LfYe 240 (240)
T cd02662 167 AVVVHYGSHSSGHYVCYRRKPLFSKDKEPGSFVRMREGPSSTSHPWWRISDTTVKEVSESEVLEQKSAYMLFYE 240 (240)
T ss_pred EEEEEeccCCCceEEEEEeCCCcccccccccccccccccCccCCCEEEEechheEEeCHHHHhhCCCEEEEEeC
Confidence 799999987999999999986 348999999999999999998 89999999995
No 5
>cd02659 peptidase_C19C A subfamily of Peptidase C19. Peptidase C19 contains ubiquitinyl hydrolases. They are intracellular peptidases that remove ubiquitin molecules from polyubiquinated peptides by cleavage of isopeptide bonds. They hydrolyze bonds involving the carboxyl group of the C-terminal Gly residue of ubiquitin. The purpose of the de-ubiquitination is thought to be editing of the ubiquitin conjugates, which could rescue them from degradation, as well as recycling of the ubiquitin. The ubiquitin/proteasome system is responsible for most protein turnover in the mammalian cell, and with over 50 members, family C19 is one of the largest families of peptidases in the human genome.
Probab=99.81 E-value=2.6e-20 Score=122.08 Aligned_cols=56 Identities=32% Similarity=0.634 Sum_probs=50.0
Q ss_pred CEEEeeecCCCCeEEEEEECCCCCcEEEEcCceEEEcCCCcCC----------------------CCCeEEEEEEEcC
Q psy7177 1 MCVRPSKGVSAGHYTAYTLHPNTKEWHYFNDETVLEQRPQDED----------------------FNNAYILFYKKRD 56 (66)
Q Consensus 1 avv~H~G~~~~GHY~a~~~~~~~~~W~~fdD~~V~~~~~~~~~----------------------~~~aYiLfY~r~~ 56 (66)
|||+|.|+.++|||++++|...+++|+.|||+.|++++.+++. +.+||||||+|++
T Consensus 256 ~vI~H~G~~~~GHY~~~vk~~~~~~W~~~nD~~V~~i~~~~v~~~~~g~~~~~~~~~~~~~~~~~~~~ay~l~Y~~~~ 333 (334)
T cd02659 256 GVLVHSGDAHGGHYYSYIKDRDDGKWYKFNDDVVTPFDPNDAEEECFGGEETQKTYDSGPRAFKRTTNAYMLFYERKS 333 (334)
T ss_pred EEEEecCCCCCCCeEEEEECCCCCceEEEeCcccEECCHHHHHHHcCCCccccccccccccccccccceEEEEEEEeC
Confidence 7999999999999999999976799999999999999877652 4679999999976
No 6
>cd02658 Peptidase_C19B A subfamily of Peptidase C19. Peptidase C19 contains ubiquitinyl hydrolases. They are intracellular peptidases that remove ubiquitin molecules from polyubiquinated peptides by cleavage of isopeptide bonds. They hydrolyze bonds involving the carboxyl group of the C-terminal Gly residue of ubiquitin. The purpose of the de-ubiquitination is thought to be editing of the ubiquitin conjugates, which could rescue them from degradation, as well as recycling of the ubiquitin. The ubiquitin/proteasome system is responsible for most protein turnover in the mammalian cell, and with over 50 members, family C19 is one of the largest families of peptidases in the human genome.
Probab=99.81 E-value=5.9e-20 Score=119.71 Aligned_cols=53 Identities=30% Similarity=0.521 Sum_probs=49.0
Q ss_pred CEEEeee-cCCCCeEEEEEECC--CCCcEEEEcCceEEEcCCCcCCCCCeEEEEEE
Q psy7177 1 MCVRPSK-GVSAGHYTAYTLHP--NTKEWHYFNDETVLEQRPQDEDFNNAYILFYK 53 (66)
Q Consensus 1 avv~H~G-~~~~GHY~a~~~~~--~~~~W~~fdD~~V~~~~~~~~~~~~aYiLfY~ 53 (66)
|||+|.| ++++|||++++|.. .+++|++|||+.|++++..++..++||||||+
T Consensus 256 ~vI~H~G~~~~~GHY~~~vk~~~~~~~~W~~fnD~~V~~~~~~~~~~~~~YilfY~ 311 (311)
T cd02658 256 AFISHKGTSVHSGHYVAHIKKEIDGEGKWVLFNDEKVVASQDPPEMKKLGYIYFYQ 311 (311)
T ss_pred EEEEccCCCCCCcceEEEEeCCCCCCCCEEEecCceeEECCcccccCCcceEEEEC
Confidence 7999999 69999999999986 34899999999999999999999999999995
No 7
>cd02663 Peptidase_C19G A subfamily of Peptidase C19. Peptidase C19 contains ubiquitinyl hydrolases. They are intracellular peptidases that remove ubiquitin molecules from polyubiquinated peptides by cleavage of isopeptide bonds. They hydrolyze bonds involving the carboxyl group of the C-terminal Gly residue of ubiquitin. The purpose of the de-ubiquitination is thought to be editing of the ubiquitin conjugates, which could rescue them from degradation, as well as recycling of the ubiquitin. The ubiquitin/proteasome system is responsible for most protein turnover in the mammalian cell, and with over 50 members, family C19 is one of the largest families of peptidases in the human genome.
Probab=99.81 E-value=2.8e-20 Score=121.42 Aligned_cols=51 Identities=31% Similarity=0.535 Sum_probs=46.6
Q ss_pred CEEEeee-cCCCCeEEEEEECCCCCcEEEEcCceEEEcCCCcCC--------CCCeEEEEEE
Q psy7177 1 MCVRPSK-GVSAGHYTAYTLHPNTKEWHYFNDETVLEQRPQDED--------FNNAYILFYK 53 (66)
Q Consensus 1 avv~H~G-~~~~GHY~a~~~~~~~~~W~~fdD~~V~~~~~~~~~--------~~~aYiLfY~ 53 (66)
|||+|.| ++++|||++|+|. +++|++|||+.|++++.+++. +.+||||||+
T Consensus 241 ~vi~H~G~~~~~GHY~a~~k~--~~~W~~fdD~~V~~~~~~~v~~~~~~~~~~~~aYiLfY~ 300 (300)
T cd02663 241 AVVVHIGGGPNHGHYVSIVKS--HGGWLLFDDETVEKIDENAVEEFFGDSPNQATAYVLFYQ 300 (300)
T ss_pred EEEEEecCCCCCCceEEEEEC--CCcEEEEcCCceEEcCHHHHHHhcCCCCCCCceEEEEeC
Confidence 7999999 5999999999999 789999999999999887764 6899999995
No 8
>cd02657 Peptidase_C19A A subfamily of Peptidase C19. Peptidase C19 contains ubiquitinyl hydrolases. They are intracellular peptidases that remove ubiquitin molecules from polyubiquinated peptides by cleavage of isopeptide bonds. They hydrolyse bonds involving the carboxyl group of the C-terminal Gly residue of ubiquitin. The purpose of the de-ubiquitination is thought to be editing of the ubiquitin conjugates, which could rescue them from degradation, as well as recycling of the ubiquitin. The ubiquitin/proteasome system is responsible for most protein turnover in the mammalian cell, and with over 50 members, family C19 is one of the largest families of peptidases in the human genome.
Probab=99.81 E-value=2.7e-20 Score=120.92 Aligned_cols=53 Identities=32% Similarity=0.543 Sum_probs=48.0
Q ss_pred CEEEeee-cCCCCeEEEEEECCCCCcEEEEcCceEEEcCCCcCCC-------CCeEEEEEE
Q psy7177 1 MCVRPSK-GVSAGHYTAYTLHPNTKEWHYFNDETVLEQRPQDEDF-------NNAYILFYK 53 (66)
Q Consensus 1 avv~H~G-~~~~GHY~a~~~~~~~~~W~~fdD~~V~~~~~~~~~~-------~~aYiLfY~ 53 (66)
|||+|.| ++++|||++|+|....++|++|||+.|++++.+++.. .+||||||+
T Consensus 245 ~vI~H~G~~~~~GHY~~~~~~~~~~~W~~fdD~~V~~~~~~~v~~~~~~~~~~~aYiL~Y~ 305 (305)
T cd02657 245 AVITHQGRSADSGHYVAWVRRKNDGKWIKFDDDKVSEVTEEDILKLSGGGDWHIAYILLYK 305 (305)
T ss_pred EEEEecCCCCCCcEEEEEEEcCCCCeEEEEECCceEEeCHHHHHhhcCCCCCceEEEEEEC
Confidence 7999999 6999999999999755899999999999999888753 699999995
No 9
>KOG1865|consensus
Probab=99.80 E-value=1.1e-19 Score=126.59 Aligned_cols=54 Identities=24% Similarity=0.457 Sum_probs=52.1
Q ss_pred CEEEeee-cCCCCeEEEEEECCCCCcEEEEcCceEEEcCCCcCCCCCeEEEEEEEc
Q psy7177 1 MCVRPSK-GVSAGHYTAYTLHPNTKEWHYFNDETVLEQRPQDEDFNNAYILFYKKR 55 (66)
Q Consensus 1 avv~H~G-~~~~GHY~a~~~~~~~~~W~~fdD~~V~~~~~~~~~~~~aYiLfY~r~ 55 (66)
|||+|.| +..+|||++|||.. +|+||+|||+.|+.++.+.+.++.||||||.|.
T Consensus 356 avlVH~g~~~~~GHY~cYvks~-~g~Wy~~DDS~V~~~~~~~VLsq~AYmLfY~R~ 410 (545)
T KOG1865|consen 356 AVLVHLGTSCHSGHYFCYVKSQ-NGQWYKMDDSEVTQSSIESVLSQQAYILFYARK 410 (545)
T ss_pred EEEEeccccccCCceEEEEEcC-CCceEEccCceeeeccccceecccceEEEEEee
Confidence 7999999 99999999999997 789999999999999999999999999999997
No 10
>cd02665 Peptidase_C19I A subfamily of Peptidase C19. Peptidase C19 contains ubiquitinyl hydrolases. They are intracellular peptidases that remove ubiquitin molecules from polyubiquinated peptides by cleavage of isopeptide bonds. They hydrolyze bonds involving the carboxyl group of the C-terminal Gly residue of ubiquitin. The purpose of the de-ubiquitination is thought to be editing of the ubiquitin conjugates, which could rescue them from degradation, as well as recycling of the ubiquitin. The ubiquitin/proteasome system is responsible for most protein turnover in the mammalian cell, and with over 50 members, family C19 is one of the largest families of peptidases in the human genome.
Probab=99.79 E-value=7.3e-20 Score=117.28 Aligned_cols=53 Identities=30% Similarity=0.506 Sum_probs=47.9
Q ss_pred CEEEeeecCCCCeEEEEEECCCCCcEEEEcCceEEEcCCCcCC--------CCCeEEEEEE
Q psy7177 1 MCVRPSKGVSAGHYTAYTLHPNTKEWHYFNDETVLEQRPQDED--------FNNAYILFYK 53 (66)
Q Consensus 1 avv~H~G~~~~GHY~a~~~~~~~~~W~~fdD~~V~~~~~~~~~--------~~~aYiLfY~ 53 (66)
|||+|.|++++|||++|+|+..+++|++|||+.|++++.+++. ..+||||||.
T Consensus 168 aVi~H~G~~~~GHY~~~i~~~~~~~W~~fdD~~V~~~~~~~v~~~~fGg~~~~~AYiLfYv 228 (228)
T cd02665 168 AVLVHEGQANAGHYWAYIYKQSRQEWEKYNDISVTESSWEEVERDSFGGGRNPSAYCLMYI 228 (228)
T ss_pred EEEEecCCCCCCEEEEEEEcCCCCEEEEEECCeeEEcCHHHHhhhccCCCCCCceEEEEEC
Confidence 7999999999999999999877899999999999999887764 4589999994
No 11
>PF00443 UCH: Ubiquitin carboxyl-terminal hydrolase; InterPro: IPR001394 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad []. This group of cysteine peptidases belong to the MEROPS peptidase family C19 (ubiquitin-specific protease family, clan CA). Families within the CA clan are loosely termed papain-like as protein fold of the peptidase unit resembles that of papain, the type example for clan CA. Predicted active site residues for members of this family and family C1 occur in the same order in the sequence: N/Q, C, H. The type example is human ubiquitin-specific protease 14. Ubiquitin is highly conserved, commonly found conjugated to proteins in eukaryotic cells, where it may act as a marker for rapid degradation, or it may have a chaperone function in protein assembly []. The ubiquitin is released by cleavage from the bound protein by a protease []. A number of deubiquitinising proteases are known: all are activated by thiol compounds [, ], and inhibited by thiol-blocking agents and ubiquitin aldehyde [, ], and as such have the properties of cysteine proteases []. The deubiquitinsing proteases can be split into 2 size ranges (20-30 kDa, IPR001578 from INTERPRO, and 100-200 kDa) []: this family are the 100-200 kDa peptides which includes the Ubp1 ubiquitin peptidase from yeast. Only one conserved cysteine can be identified, along with two conserved histidines. The spacing between the cysteine and the second histidine is thought to be more representative of the cysteine/histidine spacing of a cysteine protease catalytic dyad [].; GO: 0004221 ubiquitin thiolesterase activity, 0006511 ubiquitin-dependent protein catabolic process; PDB: 2LBC_A 3MHH_A 3MHS_A 3M99_A 2Y6E_D 2VHF_A 2HD5_A 3NHE_A 2IBI_A 1NBF_B ....
Probab=99.79 E-value=1.6e-19 Score=112.56 Aligned_cols=52 Identities=29% Similarity=0.522 Sum_probs=48.5
Q ss_pred CEEEeeecCCCCeEEEEEECCCCCcEEEEcCceEEEcCCCcCCC---CCeEEEEE
Q psy7177 1 MCVRPSKGVSAGHYTAYTLHPNTKEWHYFNDETVLEQRPQDEDF---NNAYILFY 52 (66)
Q Consensus 1 avv~H~G~~~~GHY~a~~~~~~~~~W~~fdD~~V~~~~~~~~~~---~~aYiLfY 52 (66)
|||+|.|+.++|||+|++|+..+++|++|||+.|++++.+++.. .+||||||
T Consensus 215 avi~H~G~~~~GHY~a~v~~~~~~~W~~~dD~~v~~~~~~~v~~~~~~~~yll~Y 269 (269)
T PF00443_consen 215 AVIVHYGSADSGHYVAYVRDSDDGKWYKFDDSRVTEVSWEEVIKSSNSTAYLLFY 269 (269)
T ss_dssp EEEEEESSTTSEEEEEEEEETTTTEEEEEETTEEEEESHHHHCCGGSTCEEEEEE
T ss_pred hhhccccccccceEEEeeccccCCeEEEeeCCceEECCHHHHhhccCCceEEEeC
Confidence 69999999999999999999766779999999999999988877 99999999
No 12
>cd02668 Peptidase_C19L A subfamily of Peptidase C19. Peptidase C19 contains ubiquitinyl hydrolases. They are intracellular peptidases that remove ubiquitin molecules from polyubiquinated peptides by cleavage of isopeptide bonds. They hydrolyze bonds involving the carboxyl group of the C-terminal Gly residue of ubiquitin. The purpose of the de-ubiquitination is thought to be editing of the ubiquitin conjugates, which could rescue them from degradation, as well as recycling of the ubiquitin. The ubiquitin/proteasome system is responsible for most protein turnover in the mammalian cell, and with over 50 members, family C19 is one of the largest families of peptidases in the human genome.
Probab=99.78 E-value=1.7e-19 Score=118.71 Aligned_cols=53 Identities=34% Similarity=0.464 Sum_probs=46.5
Q ss_pred CEEEeee-cCCCCeEEEEEECCCCCcEEEEcCceEEEcCCCc---------------------CCCCCeEEEEEE
Q psy7177 1 MCVRPSK-GVSAGHYTAYTLHPNTKEWHYFNDETVLEQRPQD---------------------EDFNNAYILFYK 53 (66)
Q Consensus 1 avv~H~G-~~~~GHY~a~~~~~~~~~W~~fdD~~V~~~~~~~---------------------~~~~~aYiLfY~ 53 (66)
|||+|.| ++++|||++++|+..+++|++|||+.|++++... +.+..||||||+
T Consensus 250 ~vI~H~G~~~~~GHY~~~~k~~~~~~W~~fdD~~V~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~l~y~ 324 (324)
T cd02668 250 GVLIHQGVSAYSGHYIAHIKDEQTGEWYKFNDEDVEEMPGKPLKLGNSEDPAKPRKSEIKKGTHSSRTAYMLVYK 324 (324)
T ss_pred EEEEEcCCCCCCEeeEEEEECCCCCcEEEEECCceEEcCHHHhhcccccccccccccccCCCccccCceEEEEeC
Confidence 6999999 6899999999999767899999999999986553 246899999995
No 13
>cd02670 Peptidase_C19N A subfamily of Peptidase C19. Peptidase C19 contains ubiquitinyl hydrolases. They are intracellular peptidases that remove ubiquitin molecules from polyubiquinated peptides by cleavage of isopeptide bonds. They hydrolyze bonds involving the carboxyl group of the C-terminal Gly residue of ubiquitin. The purpose of the de-ubiquitination is thought to be editing of the ubiquitin conjugates, which could rescue them from degradation, as well as recycling of the ubiquitin. The ubiquitin/proteasome system is responsible for most protein turnover in the mammalian cell, and with over 50 members, family C19 is one of the largest families of peptidases in the human genome.
Probab=99.78 E-value=2.5e-19 Score=115.61 Aligned_cols=53 Identities=23% Similarity=0.523 Sum_probs=45.5
Q ss_pred CEEEeee-cCCCCeEEEEEECCC-----------CCcEEEEcCceEEEcCCC------cCCCCCeEEEEEE
Q psy7177 1 MCVRPSK-GVSAGHYTAYTLHPN-----------TKEWHYFNDETVLEQRPQ------DEDFNNAYILFYK 53 (66)
Q Consensus 1 avv~H~G-~~~~GHY~a~~~~~~-----------~~~W~~fdD~~V~~~~~~------~~~~~~aYiLfY~ 53 (66)
|||+|.| ++++|||+||||... +++|++|||..|+.+... .+.+++||||||+
T Consensus 171 aVi~H~G~s~~sGHYva~vr~~~~~~~~~~~~~~~~~W~~FDD~~v~~~~~~~~~~~~~~~~~~aYmLFYq 241 (241)
T cd02670 171 SAVCHRGTSLETGHYVAFVRYGSYSLTETDNEAYNAQWVFFDDMADRDGVSNGFNIPAARLLEDPYMLFYQ 241 (241)
T ss_pred EEEEeCCCCCCCcCeEEEEECCcccccccccCCCCCeEEEecCcccccccccccccchhcccCCceEEEeC
Confidence 6999999 699999999999975 379999999988776433 4578999999996
No 14
>COG5560 UBP12 Ubiquitin C-terminal hydrolase [Posttranslational modification, protein turnover, chaperones]
Probab=99.78 E-value=2.7e-19 Score=126.91 Aligned_cols=56 Identities=34% Similarity=0.678 Sum_probs=52.8
Q ss_pred CEEEeeecCCCCeEEEEEECCCCCcEEEEcCceEEEcCCCcCCCCCeEEEEEEEcC
Q psy7177 1 MCVRPSKGVSAGHYTAYTLHPNTKEWHYFNDETVLEQRPQDEDFNNAYILFYKKRD 56 (66)
Q Consensus 1 avv~H~G~~~~GHY~a~~~~~~~~~W~~fdD~~V~~~~~~~~~~~~aYiLfY~r~~ 56 (66)
||=+|.|.+.+||||||+|+..+++||+|||++|+++++++..++.||+|||+|+.
T Consensus 768 avDNHygglsgGHYtAyarn~~n~~wy~fdDsritevdped~vtssaYvLFyrrk~ 823 (823)
T COG5560 768 AVDNHYGGLSGGHYTAYARNFANNGWYLFDDSRITEVDPEDSVTSSAYVLFYRRKS 823 (823)
T ss_pred eccccccccCCcceeeeeecccCCceEEecCccccccCccccccceeEEEEEEecC
Confidence 46689999999999999999999999999999999999999999999999999974
No 15
>cd02669 Peptidase_C19M A subfamily of Peptidase C19. Peptidase C19 contains ubiquitinyl hydrolases. They are intracellular peptidases that remove ubiquitin molecules from polyubiquinated peptides by cleavage of isopeptide bonds. They hydrolyze bonds involving the carboxyl group of the C-terminal Gly residue of ubiquitin. The purpose of the de-ubiquitination is thought to be editing of the ubiquitin conjugates, which could rescue them from degradation, as well as recycling of the ubiquitin. The ubiquitin/proteasome system is responsible for most protein turnover in the mammalian cell, and with over 50 members, family C19 is one of the largest families of peptidases in the human genome.
Probab=99.78 E-value=3e-19 Score=122.30 Aligned_cols=53 Identities=21% Similarity=0.295 Sum_probs=50.2
Q ss_pred CEEEeeecC-CCCeEEEEEECCCCCcEEEEcCceEEEcCCCcCCCCCeEEEEEE
Q psy7177 1 MCVRPSKGV-SAGHYTAYTLHPNTKEWHYFNDETVLEQRPQDEDFNNAYILFYK 53 (66)
Q Consensus 1 avv~H~G~~-~~GHY~a~~~~~~~~~W~~fdD~~V~~~~~~~~~~~~aYiLfY~ 53 (66)
|||+|.|++ ++|||++++|+..+++||+|||+.|++++.+.+..++||||||+
T Consensus 387 avI~H~G~~~~sGHY~a~v~~~~~~~W~~fdD~~V~~v~~~~v~~~eaYll~Y~ 440 (440)
T cd02669 387 ANIVHEGTPQEDGTWRVQLRHKSTNKWFEIQDLNVKEVLPQLIFLSESYIQIWE 440 (440)
T ss_pred EEEEEeccCCCCeeEEEEEEcCCCCeEEEEECCeeeEcCHHHhccCCceEEEeC
Confidence 799999977 99999999998768999999999999999999999999999995
No 16
>cd02674 Peptidase_C19R A subfamily of peptidase C19. Peptidase C19 contains ubiquitinyl hydrolases. They are intracellular peptidases that remove ubiquitin molecules from polyubiquinated peptides by cleavage of isopeptide bonds. They hydrolyze bonds involving the carboxyl group of the C-terminal Gly residue of ubiquitin. The purpose of the de-ubiquitination is thought to be editing of the ubiquitin conjugates, which could rescue them from degradation, as well as recycling of the ubiquitin. The ubiquitin/proteasome system is responsible for most protein turnover in the mammalian cell, and with over 50 members, family C19 is one of the largest families of peptidases in the human genome.
Probab=99.77 E-value=5.5e-19 Score=110.47 Aligned_cols=53 Identities=34% Similarity=0.593 Sum_probs=49.6
Q ss_pred CEEEeeecCCCCeEEEEEECCCCCcEEEEcCceEEEcCCCcCCCCCeEEEEEE
Q psy7177 1 MCVRPSKGVSAGHYTAYTLHPNTKEWHYFNDETVLEQRPQDEDFNNAYILFYK 53 (66)
Q Consensus 1 avv~H~G~~~~GHY~a~~~~~~~~~W~~fdD~~V~~~~~~~~~~~~aYiLfY~ 53 (66)
|||+|.|..++|||++++|...+++|+.|||+.|++++.+++...+||||||+
T Consensus 178 ~vI~H~G~~~~GHY~~~~~~~~~~~W~~fnD~~V~~i~~~~~~~~~~YlL~Y~ 230 (230)
T cd02674 178 AVVNHYGSLNGGHYTAYCKNNETNDWYKFDDSRVTKVSESSVVSSSAYILFYE 230 (230)
T ss_pred EEEEeeCCCCCcEEEEEEECCCCCceEEEcCCeEEEcCHHHccCCCceEEEeC
Confidence 69999997799999999999766899999999999999999989999999995
No 17
>cd02660 Peptidase_C19D A subfamily of Peptidase C19. Peptidase C19 contains ubiquitinyl hydrolases. They are intracellular peptidases that remove ubiquitin molecules from polyubiquinated peptides by cleavage of isopeptide bonds. They hydrolyze bonds involving the carboxyl group of the C-terminal Gly residue of ubiquitin. The purpose of the de-ubiquitination is thought to be editing of the ubiquitin conjugates, which could rescue them from degradation, as well as recycling of the ubiquitin. The ubiquitin/proteasome system is responsible for most protein turnover in the mammalian cell, and with over 50 members, family C19 is one of the largest families of peptidases in the human genome.
Probab=99.77 E-value=5.7e-19 Score=115.51 Aligned_cols=52 Identities=29% Similarity=0.549 Sum_probs=48.9
Q ss_pred CEEEeeecCCCCeEEEEEECCCCCcEEEEcCceEEEcCCCcCCCCCeEEEEEE
Q psy7177 1 MCVRPSKGVSAGHYTAYTLHPNTKEWHYFNDETVLEQRPQDEDFNNAYILFYK 53 (66)
Q Consensus 1 avv~H~G~~~~GHY~a~~~~~~~~~W~~fdD~~V~~~~~~~~~~~~aYiLfY~ 53 (66)
|||+|.|+..+|||++++|.. +++|++|||+.|++++.+++...+||||||.
T Consensus 277 avi~H~G~~~~GHY~~~~~~~-~~~W~~~nD~~V~~~~~~~v~~~~ayil~Y~ 328 (328)
T cd02660 277 AVVVHKGTLDTGHYTAYCRQG-DGQWFKFDDAMITRVSEEEVLKSQAYLLFYH 328 (328)
T ss_pred EEEEeeccCCCCcEEEEEECC-CCcEEEEECCeeEECCHHHhcCCCcEEEEeC
Confidence 799999988999999999996 4899999999999999999999999999994
No 18
>cd02661 Peptidase_C19E A subfamily of Peptidase C19. Peptidase C19 contains ubiquitinyl hydrolases. They are intracellular peptidases that remove ubiquitin molecules from polyubiquinated peptides by cleavage of isopeptide bonds. They hydrolyze bonds involving the carboxyl group of the C-terminal Gly residue of ubiquitin. The purpose of the de-ubiquitination is thought to be editing of the ubiquitin conjugates, which could rescue them from degradation, as well as recycling of the ubiquitin. The ubiquitin/proteasome system is responsible for most protein turnover in the mammalian cell, and with over 50 members, family C19 is one of the largest families of peptidases in the human genome.
Probab=99.76 E-value=8.9e-19 Score=112.75 Aligned_cols=52 Identities=27% Similarity=0.420 Sum_probs=48.3
Q ss_pred CEEEeee-cCCCCeEEEEEECCCCCcEEEEcCceEEEcCCCcCCCCCeEEEEEE
Q psy7177 1 MCVRPSK-GVSAGHYTAYTLHPNTKEWHYFNDETVLEQRPQDEDFNNAYILFYK 53 (66)
Q Consensus 1 avv~H~G-~~~~GHY~a~~~~~~~~~W~~fdD~~V~~~~~~~~~~~~aYiLfY~ 53 (66)
|||+|.| .+++|||++++|.. +++|++|||+.|++++.+++...+||||||.
T Consensus 252 ~vi~H~G~~~~~GHY~~~~~~~-~~~W~~~nD~~V~~v~~~~v~~~~aYil~Y~ 304 (304)
T cd02661 252 AVLVHSGFSPHSGHYYCYVKSS-NGKWYNMDDSKVSPVSIETVLSQKAYILFYI 304 (304)
T ss_pred EEEEECCCCCCCcCCEEEEECC-CCCEEEEeCCeeEECCHHHhcCCCcEEEEeC
Confidence 6999999 55999999999985 6899999999999999999999999999994
No 19
>KOG1868|consensus
Probab=99.74 E-value=2.9e-18 Score=122.19 Aligned_cols=60 Identities=32% Similarity=0.546 Sum_probs=55.1
Q ss_pred CEEEeeecCCCCeEEEEEECCCCCcEEEEcCceEEEcCCCcCCCCCeEEEEEEEcCCCCC
Q psy7177 1 MCVRPSKGVSAGHYTAYTLHPNTKEWHYFNDETVLEQRPQDEDFNNAYILFYKKRDEDSD 60 (66)
Q Consensus 1 avv~H~G~~~~GHY~a~~~~~~~~~W~~fdD~~V~~~~~~~~~~~~aYiLfY~r~~~~~~ 60 (66)
||++|.|++.+|||||+|+....+.|+.|||+.|+.+....+.++.||||||+|....+.
T Consensus 592 aVv~H~Gtl~sGHYta~~~~~~~~~W~~fdDs~Vs~~~~~~~~~s~aYIlFY~~~~~~~~ 651 (653)
T KOG1868|consen 592 AVVNHSGTLNSGHYTAYVYKNEKQRWFTFDDSEVSPISETDVGSSSAYILFYERLGIFEE 651 (653)
T ss_pred EEEeccCcccCCceEEEEeecCCCceEEecCeeeeccccccccCCCceEEEeecCCcccc
Confidence 699999999999999999887778999999999999999888899999999999876553
No 20
>cd02257 Peptidase_C19 Peptidase C19 contains ubiquitinyl hydrolases. They are intracellular peptidases that remove ubiquitin molecules from polyubiquinated peptides by cleavage of isopeptide bonds. They hydrolyse bonds involving the carboxyl group of the C-terminal Gly residue of ubiquitin The purpose of the de-ubiquitination is thought to be editing of the ubiquitin conjugates, which could rescue them from degradation, as well as recycling of the ubiquitin. The ubiquitin/proteasome system is responsible for most protein turnover in the mammalian cell, and with over 50 members, family C19 is one of the largest families of peptidases in the human genome.
Probab=99.74 E-value=3e-18 Score=105.10 Aligned_cols=53 Identities=38% Similarity=0.653 Sum_probs=48.5
Q ss_pred CEEEeee-cCCCCeEEEEEECCCCCcEEEEcCceEEEcCCCcC-----CCCCeEEEEEE
Q psy7177 1 MCVRPSK-GVSAGHYTAYTLHPNTKEWHYFNDETVLEQRPQDE-----DFNNAYILFYK 53 (66)
Q Consensus 1 avv~H~G-~~~~GHY~a~~~~~~~~~W~~fdD~~V~~~~~~~~-----~~~~aYiLfY~ 53 (66)
|||+|.| ++.+|||++++|...+++|++|||..|++++.+++ ...+||+|||+
T Consensus 197 ~vi~h~G~~~~~GHY~~~~~~~~~~~W~~~nD~~V~~v~~~~~~~~~~~~~~~yll~Y~ 255 (255)
T cd02257 197 AVVVHSGTSADSGHYVAYVKDPSDGKWYKFNDDKVTEVSEEEVLEFGSLSSSAYILFYE 255 (255)
T ss_pred EEEEEecCCCCCcCeEEEEeCCCCCceEEEeccccEEcCHHHhhhccCCCCceEEEEEC
Confidence 6999999 55999999999997668999999999999999987 67999999995
No 21
>cd02671 Peptidase_C19O A subfamily of Peptidase C19. Peptidase C19 contains ubiquitinyl hydrolases. They are intracellular peptidases that remove ubiquitin molecules from polyubiquinated peptides by cleavage of isopeptide bonds. They hydrolyze bonds involving the carboxyl group of the C-terminal Gly residue of ubiquitin. The purpose of the de-ubiquitination is thought to be editing of the ubiquitin conjugates, which could rescue them from degradation, as well as recycling of the ubiquitin. The ubiquitin/proteasome system is responsible for most protein turnover in the mammalian cell, and with over 50 members, family C19 is one of the largest families of peptidases in the human genome.
Probab=99.70 E-value=1.2e-17 Score=111.31 Aligned_cols=47 Identities=40% Similarity=0.693 Sum_probs=41.4
Q ss_pred CEEEeee-cCCCCeEEEEEECCCCCcEEEEcCceEEEcCCCcC---------CCCCeEEEEEE
Q psy7177 1 MCVRPSK-GVSAGHYTAYTLHPNTKEWHYFNDETVLEQRPQDE---------DFNNAYILFYK 53 (66)
Q Consensus 1 avv~H~G-~~~~GHY~a~~~~~~~~~W~~fdD~~V~~~~~~~~---------~~~~aYiLfY~ 53 (66)
|||+|.| ++++|||+||+| |++|||+.|++++++++ ...+||||||+
T Consensus 276 ~VI~H~G~~~~~GHY~a~vr------W~~fdD~~V~~~~~~~~~~~~~~~~~~~~~aYiLfY~ 332 (332)
T cd02671 276 AVVMHSGATISSGHYTAYVR------WLLFDDSEVKVTEEKDFLEALSPNTSSTSTPYLLFYK 332 (332)
T ss_pred EEEEEcCCCCCCCeEEEEEE------EEEEcCcceEEccHHHHHhhcCCCCCCCCceEEEEEC
Confidence 6999999 689999999999 99999999999876543 35799999995
No 22
>KOG1870|consensus
Probab=99.70 E-value=6.9e-18 Score=122.89 Aligned_cols=56 Identities=34% Similarity=0.604 Sum_probs=53.7
Q ss_pred CEEEeeecCCCCeEEEEEECCCCCcEEEEcCceEEEcCCCcCCCCCeEEEEEEEcC
Q psy7177 1 MCVRPSKGVSAGHYTAYTLHPNTKEWHYFNDETVLEQRPQDEDFNNAYILFYKKRD 56 (66)
Q Consensus 1 avv~H~G~~~~GHY~a~~~~~~~~~W~~fdD~~V~~~~~~~~~~~~aYiLfY~r~~ 56 (66)
||.+|+|.+.+|||+||+|+..+++|+.|||+.|++++++++.++.||+|||+|++
T Consensus 787 av~nHyG~l~~GHYta~~k~~~~~~w~~fdDs~v~~~~~~~i~t~~aY~Lfy~r~~ 842 (842)
T KOG1870|consen 787 AVGNHYGQLSGGHYTAYAKNVGDGKWYLFDDSSVSEVDEDEIDTEAAYVLFYRRLD 842 (842)
T ss_pred eeecccCCcCCcchhhhhhcCCCCceEEeccccCCCCChhhcccccceEEEEEecC
Confidence 78999999999999999999889999999999999999999999999999999974
No 23
>cd02672 Peptidase_C19P A subfamily of Peptidase C19. Peptidase C19 contains ubiquitinyl hydrolases. They are intracellular peptidases that remove ubiquitin molecules from polyubiquinated peptides by cleavage of isopeptide bonds. They hydrolyze bonds involving the carboxyl group of the C-terminal Gly residue of ubiquitin. The purpose of the de-ubiquitination is thought to be editing of the ubiquitin conjugates, which could rescue them from degradation, as well as recycling of the ubiquitin. The ubiquitin/proteasome system is responsible for most protein turnover in the mammalian cell, and with over 50 members, family C19 is one of the largest families of peptidases in the human genome.
Probab=99.68 E-value=5.7e-17 Score=105.61 Aligned_cols=47 Identities=28% Similarity=0.404 Sum_probs=42.4
Q ss_pred CEEEeee-cCCCCeEEEEEECCC----CCcEEEEcCceEEEcCCCcCCCCCeEEEEEE
Q psy7177 1 MCVRPSK-GVSAGHYTAYTLHPN----TKEWHYFNDETVLEQRPQDEDFNNAYILFYK 53 (66)
Q Consensus 1 avv~H~G-~~~~GHY~a~~~~~~----~~~W~~fdD~~V~~~~~~~~~~~~aYiLfY~ 53 (66)
|||+|.| +.++|||+|++|... .++||.|||..|++++. .||||||+
T Consensus 217 gvV~hig~~~~~GHyva~vk~~~~~~~~~~WylFND~~V~~vs~------~aYiLfY~ 268 (268)
T cd02672 217 GYVCEINDSSRGQHNVVFVIKVNEESTHGRWYLFNDFLVTPVSE------LAYILLYQ 268 (268)
T ss_pred EEEEEecCCCCCCcEEEEEEccCCCCCCCcEEEecCeEEEEcCc------hheeeecC
Confidence 7999999 459999999999864 57899999999999887 99999995
No 24
>KOG1867|consensus
Probab=99.68 E-value=1.8e-17 Score=115.43 Aligned_cols=56 Identities=29% Similarity=0.512 Sum_probs=52.2
Q ss_pred CEEEeeecCCCCeEEEEEECCCCCcEEEEcCceEEEcCCCcCCCCCeEEEEEEEcCCC
Q psy7177 1 MCVRPSKGVSAGHYTAYTLHPNTKEWHYFNDETVLEQRPQDEDFNNAYILFYKKRDED 58 (66)
Q Consensus 1 avv~H~G~~~~GHY~a~~~~~~~~~W~~fdD~~V~~~~~~~~~~~~aYiLfY~r~~~~ 58 (66)
|||+|.|..++|||+||+|. .+.|++|||+.|+.++.+++.+..||+|||.++...
T Consensus 432 AVV~H~G~~~SGHY~aY~r~--~~~~~~~dDs~v~~~s~~eVl~~~aylLFY~~~~~~ 487 (492)
T KOG1867|consen 432 AVVVHHGTVGSGHYVAYRRQ--SGGWFKCDDSTVTKVSEEEVLSSQAYLLFYTQEQVE 487 (492)
T ss_pred EEEEeccCCCCCceEEEEEe--CCCcEEEcCeEEEEeeHHHhhhchhhheehhHHhhh
Confidence 79999999999999999999 578999999999999999999999999999886543
No 25
>cd02666 Peptidase_C19J A subfamily of Peptidase C19. Peptidase C19 contains ubiquitinyl hydrolases. They are intracellular peptidases that remove ubiquitin molecules from polyubiquinated peptides by cleavage of isopeptide bonds. They hydrolyze bonds involving the carboxyl group of the C-terminal Gly residue of ubiquitin. The purpose of the de-ubiquitination is thought to be editing of the ubiquitin conjugates, which could rescue them from degradation, as well as recycling of the ubiquitin. The ubiquitin/proteasome system is responsible for most protein turnover in the mammalian cell, and with over 50 members, family C19 is one of the largest families of peptidases in the human genome.
Probab=99.62 E-value=4.6e-16 Score=104.38 Aligned_cols=53 Identities=26% Similarity=0.378 Sum_probs=47.5
Q ss_pred CEEEeeecCCCCeEEEEEECCCCCcEEEEcCceEEEcCCCcCC------CCCeEEEEEE
Q psy7177 1 MCVRPSKGVSAGHYTAYTLHPNTKEWHYFNDETVLEQRPQDED------FNNAYILFYK 53 (66)
Q Consensus 1 avv~H~G~~~~GHY~a~~~~~~~~~W~~fdD~~V~~~~~~~~~------~~~aYiLfY~ 53 (66)
||++|.|+.++|||++++|+..++.|++|||+.|++++..++. +.+||||+|.
T Consensus 285 avv~H~G~~~~GHY~~~~~~~~~~~W~~~dD~~V~~v~~~ev~~~~~~~~~~pY~l~Yv 343 (343)
T cd02666 285 AVFIHRGEASSGHYWVYIKDFEENVWRKYNDETVTVVPASEVFLFTLGNTATPYFLVYV 343 (343)
T ss_pred EEEEeecCCCCCeEEEEEEECCCCeEEEEECCeeEEecHHHHhhcccCCCCCCEEEEeC
Confidence 6999999889999999999987789999999999999886643 4799999994
No 26
>COG5533 UBP5 Ubiquitin C-terminal hydrolase [Posttranslational modification, protein turnover, chaperones]
Probab=99.60 E-value=2e-15 Score=100.98 Aligned_cols=53 Identities=23% Similarity=0.470 Sum_probs=47.4
Q ss_pred CEEEeeecCCCCeEEEEEECCCCCcEEEEcCceEEEcCCCc-CCCCCeEEEEEEEc
Q psy7177 1 MCVRPSKGVSAGHYTAYTLHPNTKEWHYFNDETVLEQRPQD-EDFNNAYILFYKKR 55 (66)
Q Consensus 1 avv~H~G~~~~GHY~a~~~~~~~~~W~~fdD~~V~~~~~~~-~~~~~aYiLfY~r~ 55 (66)
||+||.|+.++|||+++++. ++.|+.|||+.|++++... ....++|||||+|.
T Consensus 361 gv~Ch~G~L~gGHY~s~v~~--~~~W~~~dDs~vr~~~~~t~~~~pSsYilFY~r~ 414 (415)
T COG5533 361 GVVCHNGTLNGGHYFSEVKR--SGTWNVYDDSQVRKGSRTTSGSHPSSYILFYTRS 414 (415)
T ss_pred EEEeecceecCceeEEeeee--cCceEEechhheeeccceecccCCcceEEEEEec
Confidence 69999999999999999998 6899999999999987653 45679999999985
No 27
>KOG4598|consensus
Probab=99.53 E-value=2.6e-15 Score=108.22 Aligned_cols=59 Identities=31% Similarity=0.524 Sum_probs=52.7
Q ss_pred CEEEeeecCCCCeEEEEEECCCCCcEEEEcCceEEEcCCCcCC-------------CCCeEEEEEEEcCCCC
Q psy7177 1 MCVRPSKGVSAGHYTAYTLHPNTKEWHYFNDETVLEQRPQDED-------------FNNAYILFYKKRDEDS 59 (66)
Q Consensus 1 avv~H~G~~~~GHY~a~~~~~~~~~W~~fdD~~V~~~~~~~~~-------------~~~aYiLfY~r~~~~~ 59 (66)
+|.+|+|++.+|||+||+++.+++.||.|||.+|+.++..++. +.+||||+|+|.+..-
T Consensus 374 ~imihsg~a~gghy~ayik~~d~~~w~~fnd~~v~~~t~~~i~~sfgg~~~~~~~s~tnaymlmyr~id~kr 445 (1203)
T KOG4598|consen 374 SVMVHSGNAAGGHYFAYIKNLDQDRWYVFNDTRVDFATPLEIEKSFGGHPSGWNQSNTNAYMLMYRRIDPKR 445 (1203)
T ss_pred hhheecCCCCCceeeeeecccCcCceEEecCccccccCHHHHHHhhCCCCCCccccCcchhhhhhhhcCccc
Confidence 5789999999999999999999999999999999999887743 4789999999987654
No 28
>KOG1866|consensus
Probab=99.52 E-value=6.9e-16 Score=111.03 Aligned_cols=59 Identities=32% Similarity=0.544 Sum_probs=50.7
Q ss_pred CEEEeeecCCCCeEEEEEECC---CCCcEEEEcCceEEEcCCCcC--------------------CCCCeEEEEEEEcCC
Q psy7177 1 MCVRPSKGVSAGHYTAYTLHP---NTKEWHYFNDETVLEQRPQDE--------------------DFNNAYILFYKKRDE 57 (66)
Q Consensus 1 avv~H~G~~~~GHY~a~~~~~---~~~~W~~fdD~~V~~~~~~~~--------------------~~~~aYiLfY~r~~~ 57 (66)
||++|+|.+.+|||++|++.. .+++||+|||..|++.+.+++ .-.+||||||+|++.
T Consensus 357 GVlvHSGqAsaGHYySfIk~rr~~~~~kWykfnD~~Vte~~~n~me~~cfGGey~q~~~~~~~rrR~WNAYmlFYer~~d 436 (944)
T KOG1866|consen 357 GVLVHSGQASAGHYYSFIKQRRGEDGNKWYKFNDGDVTECKMNEMENECFGGEYMQMMKRMSYRRRWWNAYMLFYERMDD 436 (944)
T ss_pred EEEEecccccCcchhhhhhhhccCCCCceEeccCccccccchhhHHHHhhcchhhhcccccchHHHhhhhHHHHHHHhcC
Confidence 699999999999999999975 357899999999999987762 237999999999976
Q ss_pred CC
Q psy7177 58 DS 59 (66)
Q Consensus 58 ~~ 59 (66)
.+
T Consensus 437 ~p 438 (944)
T KOG1866|consen 437 IP 438 (944)
T ss_pred CC
Confidence 65
No 29
>COG5077 Ubiquitin carboxyl-terminal hydrolase [Posttranslational modification, protein turnover, chaperones]
Probab=99.45 E-value=1.6e-14 Score=104.58 Aligned_cols=61 Identities=23% Similarity=0.335 Sum_probs=52.3
Q ss_pred CEEEeeecCCCCeEEEEEECCCCCcEEEEcCceEEEcCCCcCC---------------C-------CCeEEEEEEEcCCC
Q psy7177 1 MCVRPSKGVSAGHYTAYTLHPNTKEWHYFNDETVLEQRPQDED---------------F-------NNAYILFYKKRDED 58 (66)
Q Consensus 1 avv~H~G~~~~GHY~a~~~~~~~~~W~~fdD~~V~~~~~~~~~---------------~-------~~aYiLfY~r~~~~ 58 (66)
||++|.|..+.|||+|++|...+++||+|||++|+..+..++. + -+||||+|-|++..
T Consensus 435 gVlVHsGDl~~GHyYallKpe~dg~WykfdDtrVtrat~kevleeNfGgd~~~~~k~r~~~~~kRfmsAYmLvYlRks~~ 514 (1089)
T COG5077 435 GVLVHSGDLHEGHYYALLKPEKDGRWYKFDDTRVTRATEKEVLEENFGGDHPYKDKIRDHSGIKRFMSAYMLVYLRKSML 514 (1089)
T ss_pred EEEEeccccCCceEEEEeccccCCCceeecceehhhHHHHHHHHHhcCCCCCCcccccCCchhhhhhhhheeeeehHhHH
Confidence 6999999999999999999888999999999999998766521 1 46899999999877
Q ss_pred CCc
Q psy7177 59 SDV 61 (66)
Q Consensus 59 ~~~ 61 (66)
++.
T Consensus 515 ddL 517 (1089)
T COG5077 515 DDL 517 (1089)
T ss_pred Hhh
Confidence 643
No 30
>KOG1873|consensus
Probab=99.44 E-value=1.3e-14 Score=104.52 Aligned_cols=55 Identities=33% Similarity=0.500 Sum_probs=50.3
Q ss_pred CEEEeeecCCCCeEEEEEECC--------------------CCCcEEEEcCceEEEcCCCcCCCCCeEEEEEEEc
Q psy7177 1 MCVRPSKGVSAGHYTAYTLHP--------------------NTKEWHYFNDETVLEQRPQDEDFNNAYILFYKKR 55 (66)
Q Consensus 1 avv~H~G~~~~GHY~a~~~~~--------------------~~~~W~~fdD~~V~~~~~~~~~~~~aYiLfY~r~ 55 (66)
|||.|.|.+..|||++|+|.. ..++||+.-|+.|++++++++...+||||||+|.
T Consensus 803 gvvehsgtm~~ghyvayv~~~t~~~~~~~~~~~~~~sd~~~~~~~Wy~iSDs~VrevS~d~vLkseAYlLFYERI 877 (877)
T KOG1873|consen 803 GVVEHSGTMSYGHYVAYVRGGTFLDLSAPSNSKDFESDAGIPSGRWYYISDSIVREVSLDEVLKSEAYLLFYERI 877 (877)
T ss_pred cceeccccccCCcchhhhhccchhhccCccccccchhccCCCCcceEEecchheecccHHHHhhhhhhhhheecC
Confidence 689999999999999999953 2468999999999999999999999999999984
No 31
>KOG0944|consensus
Probab=99.38 E-value=4.9e-13 Score=95.59 Aligned_cols=51 Identities=24% Similarity=0.461 Sum_probs=45.2
Q ss_pred CEEEeee-cCCCCeEEEEEECCCCCcEEEEcCceEEEcCCCcCCCCCeEEEEEEEc
Q psy7177 1 MCVRPSK-GVSAGHYTAYTLHPNTKEWHYFNDETVLEQRPQDEDFNNAYILFYKKR 55 (66)
Q Consensus 1 avv~H~G-~~~~GHY~a~~~~~~~~~W~~fdD~~V~~~~~~~~~~~~aYiLfY~r~ 55 (66)
|+|.|.| ++.+|||+|++|.. |+|+.|||++|- ..++.-...+||.||+|.
T Consensus 711 a~IsHmGts~~sGHYV~hirKe--gkWVlfNDeKv~--~S~~ppK~lgYvY~y~R~ 762 (763)
T KOG0944|consen 711 AFISHMGTSAHSGHYVCHIRKE--GKWVLFNDEKVA--ASQEPPKDLGYVYLYTRI 762 (763)
T ss_pred EEEecCCCCCCCcceEEEEeec--CcEEEEcchhhh--hccCChhhcceEEEEEec
Confidence 6899999 99999999999994 789999999987 455556789999999985
No 32
>KOG1864|consensus
Probab=99.29 E-value=3e-12 Score=90.95 Aligned_cols=54 Identities=24% Similarity=0.349 Sum_probs=45.2
Q ss_pred CEEEeee-cCCCCeEEEEEECCCCCcEEEEcCceEEEcCCCcCC---CCCeEEEEEEEc
Q psy7177 1 MCVRPSK-GVSAGHYTAYTLHPNTKEWHYFNDETVLEQRPQDED---FNNAYILFYKKR 55 (66)
Q Consensus 1 avv~H~G-~~~~GHY~a~~~~~~~~~W~~fdD~~V~~~~~~~~~---~~~aYiLfY~r~ 55 (66)
|||+|.| +++.|||+|++|... ..|+.|||+.|++++.+.+. ..+.|++||...
T Consensus 515 avVvH~G~~p~~GHYia~~r~~~-~nWl~fdD~~V~~~s~~~v~~~~~~s~~~~~~~~~ 572 (587)
T KOG1864|consen 515 AVVVHLGSTPNRGHYVAYVKSLD-FNWLLFDDDNVEPISEEPVSEFTGSSGDTLFYYVQ 572 (587)
T ss_pred EEEEeccCCCCCcceEEEEeeCC-CCceecccccccccCcchhhhccCCCccceeeeEE
Confidence 7999999 999999999999853 33999999999999888765 467777777654
No 33
>COG5207 UBP14 Isopeptidase T [Posttranslational modification, protein turnover, chaperones]
Probab=99.27 E-value=4.1e-12 Score=89.49 Aligned_cols=55 Identities=22% Similarity=0.407 Sum_probs=48.9
Q ss_pred CEEEeee-cCCCCeEEEEEECCCCC--cEEEEcCceEEEcCCCcCCCCCeEEEEEEEc
Q psy7177 1 MCVRPSK-GVSAGHYTAYTLHPNTK--EWHYFNDETVLEQRPQDEDFNNAYILFYKKR 55 (66)
Q Consensus 1 avv~H~G-~~~~GHY~a~~~~~~~~--~W~~fdD~~V~~~~~~~~~~~~aYiLfY~r~ 55 (66)
|||+|.| ++.+|||++|+|+...+ +|+.|||+++-..+.-+....+.||.||+|.
T Consensus 692 AvI~HkG~s~haGHYv~fIrk~~~~K~kWvl~nDek~v~~~svE~~k~nGYiylf~R~ 749 (749)
T COG5207 692 AVICHKGDSIHAGHYVWFIRKNGKDKWKWVLKNDEKTVLNSSVEVLKDNGYIYLFKRC 749 (749)
T ss_pred eEEeccCCcccccceEEEEecccCcceeEEEEccchheehhhHHHHhhCCeEEEEecC
Confidence 7999999 99999999999986443 6899999999888888888899999999883
No 34
>KOG1872|consensus
Probab=99.27 E-value=2.7e-12 Score=88.80 Aligned_cols=55 Identities=25% Similarity=0.450 Sum_probs=49.2
Q ss_pred CEEEeee-cCCCCeEEEEEECCCCCcEEEEcCceEEEcCCCcCCC-------CCeEEEEEEEcC
Q psy7177 1 MCVRPSK-GVSAGHYTAYTLHPNTKEWHYFNDETVLEQRPQDEDF-------NNAYILFYKKRD 56 (66)
Q Consensus 1 avv~H~G-~~~~GHY~a~~~~~~~~~W~~fdD~~V~~~~~~~~~~-------~~aYiLfY~r~~ 56 (66)
|||.|.| +..+|||.+++|.+ .++|++|||..|+.+..+.+.+ .+||||.|+-+.
T Consensus 408 ~vithkgrss~sghy~aw~r~s-~~~w~~fdd~~vs~v~~e~i~~lsgggd~~~ayvllyk~~~ 470 (473)
T KOG1872|consen 408 LVITHKGRSSKSGHYVAWNRVS-EDKWGHFDDDMVSFVLGETILSLSGGGDWHSAYVLLYKARV 470 (473)
T ss_pred EeeeccccccCCCcceEEEecc-CCceeeccccccccccccceeeecCCCccchhhheeecccc
Confidence 6899999 99999999999997 5699999999999998888653 789999998654
No 35
>KOG1871|consensus
Probab=98.76 E-value=1.6e-08 Score=69.32 Aligned_cols=56 Identities=21% Similarity=0.430 Sum_probs=49.9
Q ss_pred CEEEeee-cCCCCeEEEEEECCCCCcEEEEcCceEEEcCCCcCC----CCCeEEEEEEEcC
Q psy7177 1 MCVRPSK-GVSAGHYTAYTLHPNTKEWHYFNDETVLEQRPQDED----FNNAYILFYKKRD 56 (66)
Q Consensus 1 avv~H~G-~~~~GHY~a~~~~~~~~~W~~fdD~~V~~~~~~~~~----~~~aYiLfY~r~~ 56 (66)
||+.|.| ++..|||+..+.+..-+.|+.+||..|..+..+++. +.+||+|.|++.+
T Consensus 360 ~vvyhtgtsatvghYl~dv~~s~~~gw~rIDD~~i~~v~q~dv~~~t~~r~~yllyY~~~d 420 (420)
T KOG1871|consen 360 AVVYHTGTSATVGHYLEDVSRSVPSGWQRIDDALILFVAQEDVEKVTGSRTPYLLYYIEAD 420 (420)
T ss_pred EEEEecccccccCceEEeeeecccCceeEeccceeeeccHhhhccccCccchheeEeeecC
Confidence 5889999 999999999999988889999999999999888765 4789999998853
No 36
>KOG1863|consensus
Probab=98.74 E-value=4.7e-09 Score=78.87 Aligned_cols=61 Identities=26% Similarity=0.435 Sum_probs=50.7
Q ss_pred CEEEeeecCCCCeEEEEEECCCCCcEEEEcCceEEEcCCCcCC----------C------CCeEEEEEEEcCCCCCc
Q psy7177 1 MCVRPSKGVSAGHYTAYTLHPNTKEWHYFNDETVLEQRPQDED----------F------NNAYILFYKKRDEDSDV 61 (66)
Q Consensus 1 avv~H~G~~~~GHY~a~~~~~~~~~W~~fdD~~V~~~~~~~~~----------~------~~aYiLfY~r~~~~~~~ 61 (66)
||.+|.|..++|||+++++.+..++|++|||..|+.++..++. . ..||+|+|.|.+..+..
T Consensus 415 ~v~vh~g~~~~ghy~~~i~~~~~~~w~kfdd~~v~~~~~~~~l~~~~g~~~~~~~~~~~~~~~~~lv~~~~s~~~~~ 491 (1093)
T KOG1863|consen 415 AVLVHSGDAHSGHYVAYINPKLDGKWVKFDDLVVTVVSEKEALEQNYGTEEIELSSTADFKNAYMLVYIRDSCESKI 491 (1093)
T ss_pred hhhcccccccCccceeeecchhhccceeccCceeeeccHHHHHHhhCCCcchhhhcccccCCcceEEEEecCcHHhh
Confidence 5889988999999999999888899999999999999765421 1 23899999999876544
No 37
>PF13423 UCH_1: Ubiquitin carboxyl-terminal hydrolase
Probab=98.59 E-value=7.3e-08 Score=62.96 Aligned_cols=34 Identities=24% Similarity=0.337 Sum_probs=29.5
Q ss_pred CEEEeee-cCCCCeEEEEEECCC--CCcEEEEcCceE
Q psy7177 1 MCVRPSK-GVSAGHYTAYTLHPN--TKEWHYFNDETV 34 (66)
Q Consensus 1 avv~H~G-~~~~GHY~a~~~~~~--~~~W~~fdD~~V 34 (66)
|+|+|.| +.+.|||+|++|... +.+||.|||-.|
T Consensus 259 ~~V~~i~~~~~~~HlVs~vrv~~~~~~~W~lFNDflV 295 (295)
T PF13423_consen 259 SMVCHIGDSIESGHLVSLVRVGPSDDSQWYLFNDFLV 295 (295)
T ss_pred EEEEEecCCCCCCceEEEEEcCCCCCCcEEEECcEeC
Confidence 6899999 799999999999863 369999999654
No 38
>KOG1275|consensus
Probab=97.63 E-value=0.00022 Score=53.71 Aligned_cols=52 Identities=19% Similarity=0.247 Sum_probs=44.6
Q ss_pred CEEEeee-cCCCCeEEEEEECC--------CCCcEEEEcCceEEEcCCCcCC-----CCCeEEEEE
Q psy7177 1 MCVRPSK-GVSAGHYTAYTLHP--------NTKEWHYFNDETVLEQRPQDED-----FNNAYILFY 52 (66)
Q Consensus 1 avv~H~G-~~~~GHY~a~~~~~--------~~~~W~~fdD~~V~~~~~~~~~-----~~~aYiLfY 52 (66)
|+|+|.+ +-+.+|.+++|+.. .+.+||.|||-.|.++++++.. -+.+-||+|
T Consensus 795 a~V~~I~d~~~e~~lVs~Ikv~~~~~~~~~~dsqWylFNDfLV~~ite~EAl~~~~~WKvP~Il~Y 860 (1118)
T KOG1275|consen 795 AMVHAIGDNENEVNLVSPIKVLRPYHVIKPDDSQWYLFNDFLVSEITEEEALHFDGPWKVPAILYY 860 (1118)
T ss_pred eEEEEeccCCCccceEEEEEccCcccccCcCcceeEEEcceeeeeCChHHheEeccCccCcEEEEE
Confidence 6899999 77899999999953 2358999999999999998855 378999999
No 39
>KOG2026|consensus
Probab=96.92 E-value=4.8e-05 Score=52.60 Aligned_cols=47 Identities=19% Similarity=0.369 Sum_probs=42.4
Q ss_pred CCCCeEEEEEECCCCCcEEEEcCceEEEcCCCcCCCCCeEEEEEEEc
Q psy7177 9 VSAGHYTAYTLHPNTKEWHYFNDETVLEQRPQDEDFNNAYILFYKKR 55 (66)
Q Consensus 9 ~~~GHY~a~~~~~~~~~W~~fdD~~V~~~~~~~~~~~~aYiLfY~r~ 55 (66)
...|||...+++.+.++||..+|-.|++...+.+.-..+||-+|++.
T Consensus 395 ~e~~~~riqi~~~~s~kW~eiqdl~v~e~~~qmi~L~Es~iQiwe~~ 441 (442)
T KOG2026|consen 395 DEDGNFRIQIYDNSSEKWYEIQDLHVTERLPQMIFLKESFIQIWEKQ 441 (442)
T ss_pred cccCceEEEEEeCCCcceEEecccchhhhhhHHHHHHHHHHHHHhcc
Confidence 56799999999999999999999999999998888888888888764
No 40
>PF05408 Peptidase_C28: Foot-and-mouth virus L-proteinase; InterPro: IPR008739 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad []. This group of cysteine peptidases belong to MEROPS peptidase family C28 (clan CA).The protein fold of the peptidase unit for members of this family resembles that of papain. The leader peptidase of Foot-and-mouth disease virus cleaves itself from the growing polyprotein and also cleaves the host translation initiation factor 4GI (eIF4G), thus inhibiting 5'-cap dependent translation [].; GO: 0004197 cysteine-type endopeptidase activity, 0016032 viral reproduction, 0019082 viral protein processing; PDB: 2JQF_R 1QMY_B 1QOL_G 2JQG_R.
Probab=93.69 E-value=0.2 Score=31.88 Aligned_cols=47 Identities=15% Similarity=0.239 Sum_probs=31.0
Q ss_pred eee-c-CCCCeEEEEEECCCCCcEEEEcCceEEEcCCCcCCCCCeEEEEEEEcCCC
Q psy7177 5 PSK-G-VSAGHYTAYTLHPNTKEWHYFNDETVLEQRPQDEDFNNAYILFYKKRDED 58 (66)
Q Consensus 5 H~G-~-~~~GHY~a~~~~~~~~~W~~fdD~~V~~~~~~~~~~~~aYiLfY~r~~~~ 58 (66)
|.| . -..||-+.+++. .+.||.+||+.+.+-+++-. =+|.|..-+.+
T Consensus 130 ~agi~~~g~~Havfa~~t--s~gWy~iDDe~~y~~tPdp~-----~VLvfvp~D~E 178 (193)
T PF05408_consen 130 HAGIFLKGQEHAVFACVT--SDGWYAIDDEDFYPWTPDPS-----DVLVFVPYDQE 178 (193)
T ss_dssp EEEEEEESTTEEEEEEEE--TTCEEEEETTEEEE----GG-----GEEEEEESSSS
T ss_pred hhHheecCCcceEEEEEe--eCcEEEecCCeeeeCCCChh-----heEEEcccCcc
Confidence 566 3 447999988887 57899999999998766432 24666655433
No 41
>PF15499 Peptidase_C98: Ubiquitin-specific peptidase-like, SUMO isopeptidase
Probab=93.59 E-value=0.081 Score=35.22 Aligned_cols=22 Identities=14% Similarity=0.382 Sum_probs=19.5
Q ss_pred CCeEEEEEECCCCCcEEEEcCce
Q psy7177 11 AGHYTAYTLHPNTKEWHYFNDET 33 (66)
Q Consensus 11 ~GHY~a~~~~~~~~~W~~fdD~~ 33 (66)
--|+++++++. +|.|.++||-+
T Consensus 231 ~~HFvtWi~~~-dGsWLecDDLk 252 (275)
T PF15499_consen 231 LNHFVTWIRDS-DGSWLECDDLK 252 (275)
T ss_pred CceeEEEEEcC-CCCeEeeccCC
Confidence 46999999997 78899999976
No 42
>PF01473 CW_binding_1: Putative cell wall binding repeat; InterPro: IPR018337 The cell wall-binding repeat (CW) is an about 20 amino acid residue module, essentially found in two bacterial Gram-positive protein families; the choline binding proteins and glucosyltransferases (2.4.1.5 from EC). In choline-binding proteins cell wall binding repeats bind to choline moieties of both teichoic and lipoteichoic acids, two components peculiar to the cell surface of Gram-positive bacteria [, ]. In glucosyltransferases the region spanning the CW repeats is a glucan binding domain []. Several crystal structures of CW have been solved [, ]. In the choline binding protein LytA, the repeats adopt a solenoid fold consisting exclusively of beta-hairpins that stack to form a left-handed superhelix with a boomerang-like shape. The choline groups bind between beta-hairpin 'steps' of the superhelix []. In Cpl-1 CW repeats assemble in two sub-domains: an N-terminal superhelical moiety similar to the LytA one and a C-terminal beta-sheet involved in interactions with the lysozyme domain. Choline is bound between repeats 1 and 2, and, 2 and 3 of the superhelical sub-domain []. Some proteins known to contain cell-wall binding repeats include: Pneumococcal N-acetylmuramoyl-L-alanine amidase (autolysin, lytA) (3.5.1.28 from EC). It is a surface-exposed enzyme that rules the self-destruction of pneumococcal cells through degradation of their peptidoglycan backbone. It mediates the release of toxic substances that damage the host tissues. Pneumococcal endo-beta-N-acetylglucosaminidase (lytB) (3.2.1.96 from EC). It plays an important role in cell wall degradation and cell separation. Pneumococcal teichoic acid phosphorylcholine esterase (pce or cbpE), a cell wall hydrolase important for cellular adhesion and colonisation. Lactobacillales glucosyltransferase. It catalyses the transfer of glucosyl units from the cleavage of sucrose to a growing chain of glucan. Clostridium difficile toxin A (tcdA) and toxin B (tcdb). They are the causative agents of the antibiotic-associated pseudomembranous colitis. They are intracellular acting toxins that reach their targets after receptor-mediated endocytosis. Clostridium acetobutylicum cspA protein. Siphoviridae bacteriophages N-acetylmuramoyl-L-alanine amidase. It lyses the bacterial host cell wall. Podoviridae lysozyme protein (cpl-1). It is capable of digesting the pneumococcal cell wall. The cell wall binding repeats are also known as the choline-binding repeats (ChBr) or the choline-binding domain (ChBD). ; PDB: 1GVM_C 2BML_B 1HCX_A 1OBA_A 1H09_A 2J8F_A 2IXU_A 2J8G_A 2IXV_A 2X8O_A ....
Probab=89.57 E-value=0.35 Score=19.32 Aligned_cols=14 Identities=21% Similarity=0.843 Sum_probs=10.3
Q ss_pred EEECCCCCcEEEEcCc
Q psy7177 17 YTLHPNTKEWHYFNDE 32 (66)
Q Consensus 17 ~~~~~~~~~W~~fdD~ 32 (66)
+++. ++.||.|+++
T Consensus 3 W~~~--~~~wYy~~~~ 16 (19)
T PF01473_consen 3 WVQD--NGNWYYFDSD 16 (19)
T ss_dssp EEEE--TTEEEEETTT
T ss_pred CEEE--CCEEEEeCCC
Confidence 4555 6899999874
No 43
>PF00770 Peptidase_C5: Adenovirus endoprotease; InterPro: IPR000855 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad []. This group of cysteine aminopeptidases belong to the peptidase family C5 (adenain family, clan CE). Several adenovirus proteins are synthesised as precursors, requiring processing by a protease before the virion is assembled [, ]. Until recently, the adenovirus endopeptidase was classified as a serine protease, having been reported to be inhibited by serine protease inhibitors [, ]. However, it has since been shown to be inhibited by cysteine protease inhibitors, and the catalytic residues are believed to be His-54 and Cys-104 [, ].; GO: 0004197 cysteine-type endopeptidase activity, 0006508 proteolysis; PDB: 1NLN_A 1AVP_A.
Probab=76.11 E-value=4.3 Score=25.68 Aligned_cols=31 Identities=16% Similarity=0.393 Sum_probs=21.2
Q ss_pred CEEEeee-cCCCCeEEEEEECCCCCcEEEEcC
Q psy7177 1 MCVRPSK-GVSAGHYTAYTLHPNTKEWHYFND 31 (66)
Q Consensus 1 avv~H~G-~~~~GHY~a~~~~~~~~~W~~fdD 31 (66)
|||+-.| ...+=|..|++.++.+.+.|.||-
T Consensus 22 AIVNT~~retGGvHWlA~Aw~P~s~t~YmFDP 53 (183)
T PF00770_consen 22 AIVNTGGRETGGVHWLAFAWDPRSRTFYMFDP 53 (183)
T ss_dssp EEEESS-TTT--S-EEEEEEETTTTEEEEE-T
T ss_pred EEEecCCcccCceeEEEEEecCCcceEEEeCC
Confidence 4666666 566669999999998889999984
No 44
>PF08770 SoxZ: Sulphur oxidation protein SoxZ; InterPro: IPR014880 SoxZ forms an anti parallel beta structure and forms a complex with SoxY. Sulphur oxidation occurs at the thiol of a conserved cysteine residue of the SoxY subunit []. ; PDB: 1V8H_B 2OX5_E 2OXG_E 2OXH_C.
Probab=71.54 E-value=7.1 Score=22.14 Aligned_cols=41 Identities=17% Similarity=0.194 Sum_probs=29.0
Q ss_pred eEEEEEECCCCCcEEEEcCceEEEcCCCcCCCCCeEEEEEEEcCCCCC
Q psy7177 13 HYTAYTLHPNTKEWHYFNDETVLEQRPQDEDFNNAYILFYKKRDEDSD 60 (66)
Q Consensus 13 HY~a~~~~~~~~~W~~fdD~~V~~~~~~~~~~~~aYiLfY~r~~~~~~ 60 (66)
||+.-+. +.+||..|-..+.....++++|+-|+-+-.....
T Consensus 41 ~~I~~v~-------v~~ng~~v~~~~~~~siS~NP~l~F~~~~~~~g~ 81 (100)
T PF08770_consen 41 HFIEEVE-------VTYNGKPVFRADWGPSISENPYLRFSFKGKKSGT 81 (100)
T ss_dssp B-EEEEE-------EEETTEEEEEEEE-TTB-SS-EEEEEEEESSSEE
T ss_pred HheEEEE-------EEECCEEEEEEEeCCcccCCCcEEEEEecCCCcE
Confidence 6655554 3689999999999989999999999988755443
No 45
>PF08715 Viral_protease: Papain like viral protease; InterPro: IPR014827 This family of viral proteases are similar to the papain protease and are required for proteolytic processing of the replicase polyprotein. The structure of this protein has shown it adopts a fold similar to that of de-ubiquitinating enzymes []. ; GO: 0004197 cysteine-type endopeptidase activity, 0008242 omega peptidase activity, 0016740 transferase activity; PDB: 3MP2_A 3EWP_B 3EWO_B 2FE8_A 3MJ5_B 3EKE_A 3EJF_A 3JZT_H 3ETI_E 3E9S_A.
Probab=69.73 E-value=6.4 Score=26.89 Aligned_cols=33 Identities=18% Similarity=0.306 Sum_probs=20.7
Q ss_pred EeeecCCCCeEEEEEECCCCCcEEEEcCceEEEcCCCc
Q psy7177 4 RPSKGVSAGHYTAYTLHPNTKEWHYFNDETVLEQRPQD 41 (66)
Q Consensus 4 ~H~G~~~~GHY~a~~~~~~~~~W~~fdD~~V~~~~~~~ 41 (66)
+-.|+.++|||+ |.+.. +. .+|-.++.+.+...
T Consensus 266 vF~G~~~~GHYt-~~~~~--~~--~~Dg~~~~k~~~~~ 298 (320)
T PF08715_consen 266 VFTGSTDSGHYT-HDVAG--KA--MYDGDKVKKFSDNK 298 (320)
T ss_dssp EEESCTTTCEEE-EEEES--SS--EEETTEEEEESEEE
T ss_pred EEEcCCCccceE-EEccC--ce--eEEcCcccccCccc
Confidence 346888999999 76653 22 24666666655443
No 46
>PF02099 Josephin: Josephin; InterPro: IPR006155 Human genes containing triplet repeats can markedly expand in length, leading to neuropsychiatric disease. Expansion of triplet repeats explains the phenomenon of anticipation, i.e. the increasing severity or earlier age of onset in successive generations in a pedigree []. A novel gene containing CAG repeats has been identified and mapped to chromosome 14q32.1, the genetic locus for Machado-Joseph disease (MJD). Normally, the gene contains 13-36 CAG repeats, but most clinically diagnosed patients and all affected members of a family with the clinical and pathological diagnosis of MJD show expansion of the repeat number, from 68-79 []. Similar abnormalities in related genes may give rise to diseases similar to MJD. MJD is a neurodegenerative disorder characterised by cerebellar ataxia, pyramidal and extra-pyramidal signs, peripheral nerve palsy, external ophtalmoplegia, facial and lingual fasciculation and bulging. The disease is autosomal dominant, with late onset of symptoms, generally after the fourth decade.; GO: 0008242 omega peptidase activity; PDB: 3O65_G 1YZB_A 2JRI_A 2DOS_A 2AGA_A.
Probab=61.20 E-value=17 Score=22.22 Aligned_cols=28 Identities=11% Similarity=0.143 Sum_probs=19.9
Q ss_pred CEEEeeecCCCCeEEEEEECCCCCcEEEEcCceE
Q psy7177 1 MCVRPSKGVSAGHYTAYTLHPNTKEWHYFNDETV 34 (66)
Q Consensus 1 avv~H~G~~~~GHY~a~~~~~~~~~W~~fdD~~V 34 (66)
|+|++ ..-|.++.-|- ++.||.+|-..-
T Consensus 101 gfI~N----~~~HWf~iRki--~~~wyNLDS~l~ 128 (157)
T PF02099_consen 101 GFICN----LSRHWFAIRKI--GGQWYNLDSKLK 128 (157)
T ss_dssp EEEEE----CTTEEEEEEEE--TTEEEEECTTTS
T ss_pred EEEec----cCcceEEEEee--CCeeEeccCCCC
Confidence 34555 45688876666 799999987543
No 47
>smart00460 TGc Transglutaminase/protease-like homologues. Transglutaminases are enzymes that establish covalent links between proteins. A subset of transglutaminase homologues appear to catalyse the reverse reaction, the hydrolysis of peptide bonds. Proteins with this domain are both extracellular and intracellular, and it is likely that the eukaryotic intracellular proteins are involved in signalling events.
Probab=59.92 E-value=18 Score=17.88 Aligned_cols=20 Identities=15% Similarity=0.069 Sum_probs=16.1
Q ss_pred CCCeEEEEEECCCCCcEEEEcC
Q psy7177 10 SAGHYTAYTLHPNTKEWHYFND 31 (66)
Q Consensus 10 ~~GHY~a~~~~~~~~~W~~fdD 31 (66)
...|-+.-++. +++|+.||-
T Consensus 47 ~~~H~W~ev~~--~~~W~~~D~ 66 (68)
T smart00460 47 WEAHAWAEVYL--EGGWVPVDP 66 (68)
T ss_pred CCcEEEEEEEE--CCCeEEEeC
Confidence 57888888887 489999984
No 48
>PF05986 ADAM_spacer1: ADAM-TS Spacer 1; InterPro: IPR010294 This domain represents the Spacer-1 domain from the ADAM-TS family of metalloproteinases []. A cellular disintegrin and metalloproteinase (ADAM) is a family of genes with structural homology to the snake venom metalloproteinases and disintegrins []. There is variation amongst members of the family, however, all have a similar domain organisation comprising a preproregion, a reprolysin-type catalytic domain, a disintegrin-like domain, a thrombospondin type-1 (TS) module, a cysteine-rich domain, a spacer domain without cysteine residues, and a COOH-terminal TS module [, ]. They are involved in embryogenesis and have been implicated in some cancers and inflammatory diseases [].; GO: 0004222 metalloendopeptidase activity, 0031012 extracellular matrix
Probab=54.41 E-value=39 Score=19.25 Aligned_cols=51 Identities=12% Similarity=0.147 Sum_probs=29.0
Q ss_pred CCeEEEEEECCCCCcEEEEcCceEEEcCCCcCCCCCeEEEEEEEcCC-CCCcccCCC
Q psy7177 11 AGHYTAYTLHPNTKEWHYFNDETVLEQRPQDEDFNNAYILFYKKRDE-DSDVEVSGP 66 (66)
Q Consensus 11 ~GHY~a~~~~~~~~~W~~fdD~~V~~~~~~~~~~~~aYiLfY~r~~~-~~~~~~~gP 66 (66)
+.+|.| +|+. +++++.=.+..+..-.. ..-..=.+.|+|.+. .+.+...||
T Consensus 37 s~n~La-lk~~-~g~y~lNg~~~i~~~~~---~~~aGt~~~Y~~~~~~~E~i~~~GP 88 (114)
T PF05986_consen 37 SSNYLA-LKNS-DGKYVLNGNWVISWPGT---YSVAGTTFEYSRSDDNLERITAPGP 88 (114)
T ss_pred CccEEE-EEec-CCcEEEcCCccccCCcC---EEeCCeEEEEEecCCCCEEEEcCCC
Confidence 347775 5554 36665555554432211 223333688988777 667777777
No 49
>cd05860 Ig4_SCFR Fourth immunoglobulin (Ig)-like domain of stem cell factor receptor (SCFR). Ig4_SCFR: The fourth Immunoglobulin (Ig)-like domain in stem cell factor receptor (SCFR). SCFR is organized as an extracellular component having five IG-like domains, a transmembrane segment, and a cytoplasmic portion having protein tyrosine kinase activity. SCFR and its ligand SCF are critical for normal hematopoiesis, mast cell development, melanocytes and gametogenesis. SCF binds to the second and third Ig-like domains of SCFR. This fourth Ig-like domain participates in SCFR dimerization, which follows ligand binding. Deletion of this fourth domain abolishes the ligand-induced dimerization of SCFR and completely inhibits signal transduction.
Probab=53.86 E-value=21 Score=20.36 Aligned_cols=15 Identities=27% Similarity=0.519 Sum_probs=12.5
Q ss_pred cCCCCeEEEEEECCC
Q psy7177 8 GVSAGHYTAYTLHPN 22 (66)
Q Consensus 8 ~~~~GHY~a~~~~~~ 22 (66)
..++|+|+..+++..
T Consensus 74 ~~E~G~YTf~a~N~~ 88 (101)
T cd05860 74 GTEGGTYTFLVSNSD 88 (101)
T ss_pred hhhCcEEEEEEECCC
Confidence 467899999999863
No 50
>cd05896 Ig1_IL1RAPL-1_like First immunoglobulin (Ig)-like domain of X-linked interleukin-1 receptor accessory protein-like 1 (IL1RAPL-1). Ig1_ IL1RAPL-1_like: domain similar to the first immunoglobulin (Ig)-like domain of X-linked interleukin-1 receptor accessory protein-like 1 (IL1RAPL-1). IL-1 alpha and IL-1 beta are cytokines which participates in the regulation of inflammation, immune responses, and hematopoiesis. These cytokines bind to the IL-1 receptor type 1 (IL1R1), which is activated on additional association with an accessory protein, IL1RAP. IL-1 also binds a second receptor designated type II (IL1R2). Mature IL1R1 consists of three Ig-like domains, a transmembrane domain, and a large cytoplasmic domain. Mature IL1R2 is organized similarly except that it has a short cytoplasmic domain. The latter does not initiate signal transduction. A naturally occurring cytokine IL-1RA (IL-1 receptor antagonist) is widely expressed and binds to IL-1 receptors, inhibiting the binding of
Probab=51.42 E-value=11 Score=21.69 Aligned_cols=13 Identities=23% Similarity=0.325 Sum_probs=11.4
Q ss_pred CCCCeEEEEEECC
Q psy7177 9 VSAGHYTAYTLHP 21 (66)
Q Consensus 9 ~~~GHY~a~~~~~ 21 (66)
.++|+|+|-+|+.
T Consensus 80 eDSG~Y~C~~rN~ 92 (104)
T cd05896 80 QDSGLYTCVLRNS 92 (104)
T ss_pred hhCeEEEEEECCC
Confidence 4789999999986
No 51
>cd04480 RPA1_DBD_A_like RPA1_DBD_A_like: A subgroup of uncharacterized plant OB folds with similarity to the second OB fold, the ssDNA-binding domain (DBD)-A, of human RPA1 (also called RPA70). RPA1 is the large subunit of Replication protein A (RPA). RPA is a nuclear ssDNA-binding protein (SSB) which appears to be involved in all aspects of DNA metabolism including replication, recombination, and repair. RPA also mediates specific interactions of various nuclear proteins. In animals, plants, and fungi, RPA is a heterotrimer with subunits of 70KDa (RPA1), 32kDa (RPA2), and 14 KDa (RPA3). In addition to DBD-A, RPA1 contains three other OB folds: DBD-B, DBD-C, and RPA1N. The major DNA binding activity of RPA is associated with DBD-A and DBD-B of RPA1. RPA1 DBD-C is involved in trimerization. The ssDNA-binding mechanism is believed to be multistep and to involve conformational change.
Probab=49.37 E-value=40 Score=17.89 Aligned_cols=32 Identities=9% Similarity=0.193 Sum_probs=25.4
Q ss_pred CCcEEEEcCceEEEcCCCcCCCCCeEEEEEEE
Q psy7177 23 TKEWHYFNDETVLEQRPQDEDFNNAYILFYKK 54 (66)
Q Consensus 23 ~~~W~~fdD~~V~~~~~~~~~~~~aYiLfY~r 54 (66)
.|+||.+..=.|.+.+..--.....|++.+..
T Consensus 51 eg~vy~is~f~v~~~~~~y~~~~~~y~I~f~~ 82 (86)
T cd04480 51 EGKWYTISNFEVAPNTGSYRPTDHPYKIKFMS 82 (86)
T ss_pred eCCEEEEeeEEEEcCCCcccccCCcEEEEeec
Confidence 57899999999988876666677889887654
No 52
>PF09149 DUF1935: Domain of unknown function (DUF1935); InterPro: IPR015232 This entry represents a conserved region found in various bacterial and eukaryotic hypothetical proteins, as well as in the cysteine protease calpain. Its function has not, as yet, been defined. ; PDB: 1R75_A 2FE0_A.
Probab=47.48 E-value=14 Score=21.11 Aligned_cols=15 Identities=27% Similarity=0.729 Sum_probs=9.9
Q ss_pred CCCcEEEEcCceEEE
Q psy7177 22 NTKEWHYFNDETVLE 36 (66)
Q Consensus 22 ~~~~W~~fdD~~V~~ 36 (66)
.+++|..+||+.--.
T Consensus 30 ~~~~WaFYNDT~~y~ 44 (104)
T PF09149_consen 30 KEGRWAFYNDTKDYE 44 (104)
T ss_dssp TTTEEEEEE--SSEE
T ss_pred CCCEEEEEeCCCcEE
Confidence 478999999987544
No 53
>PF01841 Transglut_core: Transglutaminase-like superfamily; InterPro: IPR002931 This domain is found in many proteins known to have transglutaminase activity, i.e. which cross-link proteins through an acyl-transfer reaction between the gamma-carboxamide group of peptide-bound glutamine and the epsilon-amino group of peptide-bound lysine, resulting in a epsilon-(gamma-glutamyl)lysine isopeptide bond. Tranglutaminases have been found in a diverse range of species, from bacteria through to mammals. The enzymes require calcium binding and their activity leads to post-translational modification of proteins through acyl-transfer reactions, involving peptidyl glutamine residues as acyl donors and a variety of primary amines as acyl acceptors, with the generation of proteinase resistant isopeptide bonds []. Sequence conservation in this superfamily primarily involves three motifs that centre around conserved cysteine, histidine, and aspartate residues that form the catalytic triad in the structurally characterised transglutaminase, the human blood clotting factor XIIIa' []. On the basis of the experimentally demonstrated activity of the Methanobacterium phage psiM2 pseudomurein endoisopeptidase [], it is proposed that many, if not all, microbial homologs of the transglutaminases are proteases and that the eukaryotic transglutaminases have evolved from an ancestral protease []. A subunit of plasma Factor XIII revealed that each Factor XIIIA subunit is composed of four domains (termed N-terminal beta-sandwich, core domain (containing the catalytic and the regulatory sites), and C-terminal beta-barrels 1 and 2) and that two monomers assemble into the native dimer through the surfaces in domains 1 and 2, in opposite orientation. This organisation in four domains is highly conserved during evolution among transglutaminase isoforms [].; PDB: 2F4M_A 2F4O_A 1GGY_B 1FIE_B 1GGU_B 1GGT_B 1F13_A 1QRK_B 1EVU_A 1EX0_B ....
Probab=47.32 E-value=24 Score=19.07 Aligned_cols=20 Identities=15% Similarity=0.244 Sum_probs=16.0
Q ss_pred CCCeEEEEEECCCCCcEEEEc
Q psy7177 10 SAGHYTAYTLHPNTKEWHYFN 30 (66)
Q Consensus 10 ~~GHY~a~~~~~~~~~W~~fd 30 (66)
..+|.+.-+..+ +++|+.+|
T Consensus 93 ~~~H~w~ev~~~-~~~W~~~D 112 (113)
T PF01841_consen 93 NDNHAWVEVYLP-GGGWIPLD 112 (113)
T ss_dssp EEEEEEEEEEET-TTEEEEEE
T ss_pred CCCEEEEEEEEc-CCcEEEcC
Confidence 446988888884 68899987
No 54
>COG0093 RplN Ribosomal protein L14 [Translation, ribosomal structure and biogenesis]
Probab=46.86 E-value=38 Score=20.17 Aligned_cols=18 Identities=17% Similarity=0.379 Sum_probs=14.9
Q ss_pred CCcEEEEcCceEEEcCCC
Q psy7177 23 TKEWHYFNDETVLEQRPQ 40 (66)
Q Consensus 23 ~~~W~~fdD~~V~~~~~~ 40 (66)
+|.|.+|||.-.-.++.+
T Consensus 73 DGs~i~FddNA~Viin~~ 90 (122)
T COG0093 73 DGSYIKFDDNAAVIINPD 90 (122)
T ss_pred CCCEEEeCCceEEEECCC
Confidence 789999999887777665
No 55
>cd00524 SORL Superoxide reductase-like (SORL) domain; present in a family of mononuclear non-heme iron proteins that includes superoxide reductase and desulfoferrodoxin. Superoxide reductase-like proteins scavenge superoxide anion radicals as a defense mechanism against reactive oxygen species and are found in anaerobic bacteria and archeae, and microaerophilic Treponema pallidum. The SORL domain contains an active iron site, Fe[His4Cys(Glu)], which in the reduced state loses the glutamate ligand. Superoxide reductase (class II) forms a homotetramer with four Fe[His4Cys(Glu)] centers. Desulfoferrodoxin (class I) is a homodimeric protein, with each protomer comprised of two domains, the N-terminal desulforedoxin (DSRD) domain and C-terminal SORL domain. Each domain has a distinct iron center: the DSRD iron center I, Fe(S-Cys)4; and the SORL iron center II, Fe[His4Cys(Glu)].
Probab=45.11 E-value=50 Score=17.78 Aligned_cols=40 Identities=23% Similarity=0.239 Sum_probs=27.7
Q ss_pred cCCCCeEEEEEECCCCCcEEEEcCceEEEcCCCcCCCCCeEEEEEEEcC
Q psy7177 8 GVSAGHYTAYTLHPNTKEWHYFNDETVLEQRPQDEDFNNAYILFYKKRD 56 (66)
Q Consensus 8 ~~~~GHY~a~~~~~~~~~W~~fdD~~V~~~~~~~~~~~~aYiLfY~r~~ 56 (66)
-+.-.||+.++.. .++|..|....... ..+++.-|..+.+
T Consensus 29 pm~~~HyI~wI~l-------~~~d~~v~~~~l~P--~~~P~~~F~l~~~ 68 (86)
T cd00524 29 PMTEEHYIEWIEL-------YFGDEKVGRVELTP--GTKPEATFTVKAP 68 (86)
T ss_pred CCCCCcEEEEEEE-------EECCeEEEEEECCC--CCCCEEEEEEecC
Confidence 4667899998864 57776777665544 5677777776643
No 56
>PF14509 GH97_C: Glycosyl-hydrolase 97 C-terminal, oligomerisation; PDB: 3A24_A 2JKP_A 2JKE_A 2D73_B 2ZQ0_B 2JKA_A.
Probab=44.48 E-value=18 Score=20.55 Aligned_cols=38 Identities=18% Similarity=0.311 Sum_probs=21.6
Q ss_pred CCeEEEEEECCC-CCcEEE--EcCc--eEEEcCCCcCCCCCeE
Q psy7177 11 AGHYTAYTLHPN-TKEWHY--FNDE--TVLEQRPQDEDFNNAY 48 (66)
Q Consensus 11 ~GHY~a~~~~~~-~~~W~~--fdD~--~V~~~~~~~~~~~~aY 48 (66)
-|.|...+|.+. ++.||. .|++ +.-.++.+.......|
T Consensus 13 pGeyvviARr~~~G~~Wyvg~in~~~~r~i~l~L~FL~~g~~y 55 (103)
T PF14509_consen 13 PGEYVVIARRKRDGDDWYVGGINGEDARTITLPLSFLDKGKKY 55 (103)
T ss_dssp TTTEEEEEEEETTTTEEEEEEEE-TT-EEEEEEGCCS-TT--E
T ss_pred CceEEEEEEEcCCCCCEEEEEeeCCCceEEEEECcccCCCCcE
Confidence 499999999963 589993 4444 3333444444444355
No 57
>cd05751 Ig1_LILRB1_like First immunoglobulin (Ig)-like domain found in Leukocyte Ig-like receptors (LILR)B1 (also known as LIR-1) and similar proteins. Ig1_LILRB1_like: domain similar to the first immunoglobulin (Ig)-like domain found in Leukocyte Ig-like receptors (LILR)B1 (also known as LIR-1). This group includes, LILRA5 (LIR9), an activating natural cytotoxicity receptor NKp46, and the immune-type receptor glycoprotein VI (GPVI). LILRs are a family of immunoreceptors expressed on expressed on T and B cells, on monocytes, dendritic cells, and subgroups of natural killer (NK) cells. The human LILR family contains nine proteins (LILRA1-3,and 5, and LILRB1-5). From functional assays, and as the cytoplasmic domains of various LILRs, for example LILRB1 (LIR-1), LILRB2 (LIR-2), and LILRB3 (LIR-3) contain immunoreceptor tyrosine-based inhibitory motifs (ITIMs) it is thought that LIR proteins are inhibitory receptors. Of the eight LIR family proteins, only LIR-1(LILRB1), and LIR-2 (LILRB2),
Probab=44.36 E-value=21 Score=19.13 Aligned_cols=12 Identities=33% Similarity=0.540 Sum_probs=10.2
Q ss_pred CCCeEEEEEECC
Q psy7177 10 SAGHYTAYTLHP 21 (66)
Q Consensus 10 ~~GHY~a~~~~~ 21 (66)
++|.|.|..++.
T Consensus 64 ~~G~Y~C~~~~~ 75 (91)
T cd05751 64 HAGRYRCYYRSG 75 (91)
T ss_pred HCEEEEEEEECC
Confidence 589999999874
No 58
>PHA02131 hypothetical protein
Probab=41.30 E-value=31 Score=18.03 Aligned_cols=26 Identities=23% Similarity=0.469 Sum_probs=18.3
Q ss_pred CCCeEEEEEECCCCCcEEEEcCceEEEcCCC
Q psy7177 10 SAGHYTAYTLHPNTKEWHYFNDETVLEQRPQ 40 (66)
Q Consensus 10 ~~GHY~a~~~~~~~~~W~~fdD~~V~~~~~~ 40 (66)
-.|||+--+ ..|..|.|.+|..++..
T Consensus 21 irgh~~~g~-----~c~imfk~~~v~dctfk 46 (70)
T PHA02131 21 IRGHYRFGI-----SCWIMFKNDQVIDCTFK 46 (70)
T ss_pred eccceecce-----EEEEEEcCCCEEEeeec
Confidence 357776433 25999999999887543
No 59
>PRK08571 rpl14p 50S ribosomal protein L14P; Reviewed
Probab=39.58 E-value=55 Score=19.67 Aligned_cols=19 Identities=16% Similarity=0.232 Sum_probs=15.7
Q ss_pred CCCcEEEEcCceEEEcCCC
Q psy7177 22 NTKEWHYFNDETVLEQRPQ 40 (66)
Q Consensus 22 ~~~~W~~fdD~~V~~~~~~ 40 (66)
.+|.|..|||..+--++.+
T Consensus 83 ~dGs~i~F~dNa~VLin~~ 101 (132)
T PRK08571 83 PDGTRVKFEDNAAVIVTPE 101 (132)
T ss_pred CCCcEEEeCCcEEEEECCC
Confidence 3788999999998888755
No 60
>cd03172 SORL_classII Superoxide reductase-like (SORL) domain, class II; SORL-domains are present in a family of mononuclear non-heme iron proteins that includes superoxide reductase and desulfoferrodoxin. Superoxide reductase-like proteins scavenge superoxide anion radicals as a defense mechanism against reactive oxygen species and are found in anaerobic bacteria and archeae, and microaerophilic Treponema pallidum. The SORL domain contains an active iron site, Fe[His4Cys(Glu)], which in the reduced state loses the glutamate ligand. Superoxide reductase (class II) forms a homotetramer with four Fe[His4Cys(Glu)] centers.
Probab=39.23 E-value=73 Score=18.00 Aligned_cols=41 Identities=12% Similarity=0.042 Sum_probs=26.5
Q ss_pred CCCCeEEEEEECCCCCcEEEEcCceEEEcCCCcC--CCCCeEEEEEEEcC
Q psy7177 9 VSAGHYTAYTLHPNTKEWHYFNDETVLEQRPQDE--DFNNAYILFYKKRD 56 (66)
Q Consensus 9 ~~~GHY~a~~~~~~~~~W~~fdD~~V~~~~~~~~--~~~~aYiLfY~r~~ 56 (66)
+..+||+.++.. .++|..|........ ....+..-|..+.+
T Consensus 38 M~~~HyI~wI~l-------~~~~~~v~r~~l~p~~~~~~~P~a~F~l~~~ 80 (104)
T cd03172 38 NTTEHHIEWIEL-------YFGVYLLGRVEFTAHGGVYTKPEATFTVKIP 80 (104)
T ss_pred CCCCeEEEEEEE-------EECCEEEEEEEEeCCCCCCCCCeEEEEEEcc
Confidence 667999998864 566767766655432 25666677765544
No 61
>PTZ00054 60S ribosomal protein L23; Provisional
Probab=38.80 E-value=56 Score=19.82 Aligned_cols=19 Identities=21% Similarity=0.436 Sum_probs=15.9
Q ss_pred CCCcEEEEcCceEEEcCCC
Q psy7177 22 NTKEWHYFNDETVLEQRPQ 40 (66)
Q Consensus 22 ~~~~W~~fdD~~V~~~~~~ 40 (66)
.+|.|+.|||.-+--++.+
T Consensus 90 ~dGs~i~F~dNA~VLin~~ 108 (139)
T PTZ00054 90 KDGVFIYFEDNAGVIVNPK 108 (139)
T ss_pred CCCcEEEeCCcEEEEECCC
Confidence 3789999999998888765
No 62
>cd05750 Ig_Pro_neuregulin Immunoglobulin (Ig)-like domain in neuregulins (NRGs). Ig_Pro_neuregulin: immunoglobulin (Ig)-like domain in neuregulins (NRGs). NRGs are signaling molecules, which participate in cell-cell interactions in the nervous system, breast, heart, and other organ systems, and are implicated in the pathology of diseases including schizophrenia, multiple sclerosis, and breast cancer. There are four members of the neuregulin gene family (NRG1, -2, -3, and -4). The NRG-1 protein, binds to and activates the tyrosine kinases receptors ErbB3 and ErbB4, initiating signaling cascades. The other NRGs proteins bind one or the other or both of these ErbBs. NRG-1 has multiple functions; for example, in the brain it regulates various processes such as radial glia formation and neuronal migration, dendritic development, and expression of neurotransmitters receptors; in the peripheral nervous system NRG-1 regulates processes such as target cell differentiation, and Schwann cell surv
Probab=37.96 E-value=55 Score=16.15 Aligned_cols=14 Identities=21% Similarity=0.268 Sum_probs=10.6
Q ss_pred CCCCeEEEEEECCC
Q psy7177 9 VSAGHYTAYTLHPN 22 (66)
Q Consensus 9 ~~~GHY~a~~~~~~ 22 (66)
.+.|.|+|.+.+..
T Consensus 53 ~D~G~Y~C~a~N~~ 66 (75)
T cd05750 53 ADSGEYTCVVENIL 66 (75)
T ss_pred ccCeEEEEEEEEcC
Confidence 56788888888753
No 63
>cd04474 RPA1_DBD_A RPA1_DBD_A: A subfamily of OB folds corresponding to the second OB fold, the ssDNA-binding domain (DBD)-A, of human RPA1 (also called RPA70). RPA1 is the large subunit of Replication protein A (RPA). RPA is a nuclear ssDNA-binding protein (SSB) which appears to be involved in all aspects of DNA metabolism including replication, recombination, and repair. RPA also mediates specific interactions of various nuclear proteins. In animals, plants, and fungi, RPA is a heterotrimer with subunits of 70KDa (RPA1), 32kDa (RPA2), and 14 KDa (RPA3). In addition to DBD-A, RPA1 contains three other OB folds: DBD-B, DBD-C, and RPA1N. The major DNA binding activity of human RPA (hRPA) and Saccharomyces cerevisiae RPA (ScRPA) is associated with DBD-A and DBD-B of RPA1. RPA1 DBD-C is involved in trimerization. The ssDNA-binding mechanism is believed to be multistep and to involve conformational change. Although ScRPA and the hRPA have similar ssDNA-binding properties, they differ funct
Probab=37.69 E-value=75 Score=17.63 Aligned_cols=32 Identities=13% Similarity=0.247 Sum_probs=26.0
Q ss_pred CCcEEEEcCceEEEcCCCcCCCCCeEEEEEEE
Q psy7177 23 TKEWHYFNDETVLEQRPQDEDFNNAYILFYKK 54 (66)
Q Consensus 23 ~~~W~~fdD~~V~~~~~~~~~~~~aYiLfY~r 54 (66)
.|+||.+..-.|.+.+..--.....|.+.+.+
T Consensus 67 eG~vy~i~~~~V~~a~~~y~~~~~~yeI~f~~ 98 (104)
T cd04474 67 VGKVYYISKGSVKVANKKFNTLKNDYEITFNR 98 (104)
T ss_pred cccEEEEeccEEeeccccCCCCCCcEEEEECC
Confidence 57899999999999877666667899887765
No 64
>PHA01365 hypothetical protein
Probab=36.86 E-value=41 Score=18.78 Aligned_cols=25 Identities=12% Similarity=0.207 Sum_probs=15.7
Q ss_pred EEcCceEEEc--CCCcCCCCCeEEEEE
Q psy7177 28 YFNDETVLEQ--RPQDEDFNNAYILFY 52 (66)
Q Consensus 28 ~fdD~~V~~~--~~~~~~~~~aYiLfY 52 (66)
+|+|++|.++ ...+..-..-||.|=
T Consensus 15 ~fkdskieki~~lps~~dv~~KYiif~ 41 (91)
T PHA01365 15 CFKDSSIDVIFMSCNNLSPHKKYMIID 41 (91)
T ss_pred HhCCCceEEEEEecCCCCccccEEEEE
Confidence 5788888776 333334466777753
No 65
>PF01088 Peptidase_C12: Ubiquitin carboxyl-terminal hydrolase, family 1; InterPro: IPR001578 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad []. This group of cysteine peptidases belong to the MEROPS peptidase family C12 (ubiquitin C-terminal hydrolase family, clan CA). Families within the CA clan are loosely termed papain-like as protein fold of the peptidase unit resembles that of papain, the type example for clan CA. The type example is the human ubiquitin C-terminal hydrolase UCH-L1. Ubiquitin is highly conserved, commonly found conjugated to proteins in eukaryotic cells, where it may act as a marker for rapid degradation, or it may have a chaperone function in protein assembly []. The ubiquitin is released by cleavage from the bound protein by a protease []. A number of deubiquitinising proteases are known: all are activated by thiol compounds [, ], and inhibited by thiol-blocking agents and ubiquitin aldehyde [, ], and as such have the properties of cysteine proteases []. The deubiquitinsing proteases can be split into 2 size ranges (20-30 kDa and 100-200 kDa, IPR001394 from INTERPRO) []: this family are the 20-30 kDa ppeptides which includes the yeast yuh1. Yeast yuh1 protease is known to be active only against small ubiquitin conjugates, being inactive against conjugated beta-galactosidase []. A mammalian homologue, UCH (ubiquitin conjugate hydrolase), is one of the most abundant proteins in the brain []. Only one conserved cysteine can be identified, along with two conserved histidines. The spacing between the cysteine and the second histidine is thought to be more representative of the cysteine/histidine spacing of a cysteine protease catalytic dyad [].; GO: 0004221 ubiquitin thiolesterase activity, 0006511 ubiquitin-dependent protein catabolic process, 0005622 intracellular; PDB: 1CMX_A 4DM9_A 2ETL_A 3IRT_A 3KW5_A 3IFW_A 2LEN_A 3KVF_A 2WDT_C 2WE6_B ....
Probab=36.84 E-value=28 Score=22.21 Aligned_cols=24 Identities=8% Similarity=-0.009 Sum_probs=18.6
Q ss_pred CeEEEEEECCCCCcEEEEcCceEEEc
Q psy7177 12 GHYTAYTLHPNTKEWHYFNDETVLEQ 37 (66)
Q Consensus 12 GHY~a~~~~~~~~~W~~fdD~~V~~~ 37 (66)
=||+|||.. +|.=|++|..+-.++
T Consensus 166 ~HFI~fV~~--~G~LyELDG~k~~Pi 189 (214)
T PF01088_consen 166 FHFIAFVPV--DGHLYELDGRKSGPI 189 (214)
T ss_dssp EEEEEEEEE--TTEEEEEETTSSS-E
T ss_pred ccEEEEEeE--CCeEEEcCCCCCCCe
Confidence 499999996 788889988765444
No 66
>PF12252 SidE: Dot/Icm substrate protein; InterPro: IPR021014 This entry represents bacterial proteins that are typically between 397 and 1543 amino acids in length including SidE protein in the Dot/Icm pathway of Legionella pneumophila bacteria. There is little literature describing the family.
Probab=36.33 E-value=52 Score=26.66 Aligned_cols=24 Identities=13% Similarity=0.268 Sum_probs=20.6
Q ss_pred cCCCCeEEEEEECCCCCcEEEEcCc
Q psy7177 8 GVSAGHYTAYTLHPNTKEWHYFNDE 32 (66)
Q Consensus 8 ~~~~GHY~a~~~~~~~~~W~~fdD~ 32 (66)
..++||++..++-+ ++++|.||.-
T Consensus 59 ~~~sghwimlikg~-gn~y~lfdpl 82 (1439)
T PF12252_consen 59 RQDSGHWIMLIKGQ-GNQYYLFDPL 82 (1439)
T ss_pred CCcCceeEEEEEcC-CCceEEeccc
Confidence 57899999999987 7899999853
No 67
>cd05756 Ig1_IL1R_like First immunoglobulin (Ig)-like domain of interleukin-1 receptor (IL1R) and similar proteins. Ig1_IL1R_like: domain similar to the first immunoglobulin (Ig)-like domain of interleukin-1 receptor (IL1R). IL-1 alpha and IL-1 beta are cytokines which participate in the regulation of inflammation, immune responses, and hematopoiesis. These cytokines bind to the IL-1 receptor type 1 (IL1R1), which is activated on additional association with an accessory protein, IL1RAP. IL-1 also binds a second receptor designated type II (IL1R2). Mature IL1R1 consists of three Ig-like domains, a transmembrane domain, and a large cytoplasmic domain. Mature IL1R2 is organized similarly except that it has a short cytoplasmic domain. The latter does not initiate signal transduction. A naturally occurring cytokine IL-1RA (IL-1 receptor antagonist) is widely expressed and binds to IL-1 receptors, inhibiting the binding of IL-1 alpha and IL-1 beta.
Probab=36.31 E-value=32 Score=18.67 Aligned_cols=14 Identities=21% Similarity=0.218 Sum_probs=11.8
Q ss_pred cCCCCeEEEEEECC
Q psy7177 8 GVSAGHYTAYTLHP 21 (66)
Q Consensus 8 ~~~~GHY~a~~~~~ 21 (66)
..++|.|+|.+++.
T Consensus 69 ~~DsG~Y~C~~~N~ 82 (94)
T cd05756 69 LEDSGLYTCVVRNS 82 (94)
T ss_pred cccCeEEEEEEcCC
Confidence 46789999999986
No 68
>cd05747 Ig5_Titin_like M5, fifth immunoglobulin (Ig)-like domain of human titin C terminus and similar proteins. Ig5_Titin_like: domain similar to the M5, fifth immunoglobulin (Ig)-like domain from the human titin C terminus. Titin (also called connectin) is a fibrous sarcomeric protein specifically found in vertebrate striated muscle. Titin is gigantic; depending on isoform composition it ranges from 2970 to 3700 kDa, and is of a length that spans half a sarcomere. Titin largely consists of multiple repeats of Ig-like and fibronectin type 3 (FN-III)-like domains. Titin connects the ends of myosin thick filaments to Z disks and extends along the thick filament to the H zone, and appears to function similar to an elastic band, keeping the myosin filaments centered in the sarcomere during muscle contraction or stretching.
Probab=36.10 E-value=63 Score=16.99 Aligned_cols=21 Identities=19% Similarity=0.158 Sum_probs=14.7
Q ss_pred cCCCCeEEEEEECCCCCcEEE
Q psy7177 8 GVSAGHYTAYTLHPNTKEWHY 28 (66)
Q Consensus 8 ~~~~GHY~a~~~~~~~~~W~~ 28 (66)
..++|-|+|.+.+..+..+..
T Consensus 68 ~~D~G~Y~C~a~N~~G~~~~~ 88 (92)
T cd05747 68 MSDEGNYTVVVENSEGKQEAQ 88 (92)
T ss_pred cccCEeEEEEEEcCCCCEeEE
Confidence 467889999988875544443
No 69
>PF02721 DUF223: Domain of unknown function DUF223; InterPro: IPR003871 The function of this domain has not been characterised, but may be involved in nucleic acid or nucleotide binding.
Probab=35.28 E-value=62 Score=17.58 Aligned_cols=35 Identities=11% Similarity=0.183 Sum_probs=27.0
Q ss_pred CCCcEEEEcCceEEEcCCCcCCCCCeEEEEEEEcC
Q psy7177 22 NTKEWHYFNDETVLEQRPQDEDFNNAYILFYKKRD 56 (66)
Q Consensus 22 ~~~~W~~fdD~~V~~~~~~~~~~~~aYiLfY~r~~ 56 (66)
..|.||.+.+=.|.+....--.+...|.+.+....
T Consensus 29 ~Eg~~y~i~~F~V~~~~~~yr~t~h~y~I~f~~~T 63 (95)
T PF02721_consen 29 KEGSWYTISNFTVSPNSGSYRPTDHKYKINFMPNT 63 (95)
T ss_pred ccCCEEEeEeEEEEeCCCceeccCCCEEEEECCcC
Confidence 36899999998888877666667888988776554
No 70
>cd05855 Ig_TrkB_d5 Fifth domain (immunoglobulin-like) of Trk receptor TrkB. TrkB_d5: the fifth domain of Trk receptor TrkB, this is an immunoglobulin (Ig)-like domain which binds to neurotrophin. The Trk family of receptors are tyrosine kinase receptors, which mediate the trophic effects of the neurotrophin Nerve growth factor (NGF) family. The Trks are activated by dimerization, leading to autophosphorylation of intracellular tyrosine residues, and triggering the signal transduction pathway. TrkB shares significant sequence homology and domain organization with TrkA, and TrkC. The first three domains are leucine-rich domains. The fourth and fifth domains are Ig-like domains playing a part in ligand binding. TrKB is recognized by brain-derived neurotrophic factor (BDNF) and neurotrophin (NT)-4. In some cell systems NT-3 can activate TrkA and TrkB receptors. TrKB transcripts are found throughout multiple structures of the central and peripheral nervous systems.
Probab=34.86 E-value=46 Score=17.73 Aligned_cols=15 Identities=20% Similarity=0.251 Sum_probs=11.9
Q ss_pred cCCCCeEEEEEECCC
Q psy7177 8 GVSAGHYTAYTLHPN 22 (66)
Q Consensus 8 ~~~~GHY~a~~~~~~ 22 (66)
..+.|.|+|-+++..
T Consensus 53 ~~D~G~YtC~A~N~~ 67 (79)
T cd05855 53 HLNNGIYTLVAKNEY 67 (79)
T ss_pred cccCEEEEEEEEcCC
Confidence 357899999999863
No 71
>PF10572 UPF0556: Uncharacterised protein family UPF0556; InterPro: IPR018887 This family of proteins has no known function.
Probab=34.75 E-value=80 Score=19.64 Aligned_cols=26 Identities=23% Similarity=0.529 Sum_probs=15.7
Q ss_pred cCCCCeEEEEEECCCCCcEEEEcCce
Q psy7177 8 GVSAGHYTAYTLHPNTKEWHYFNDET 33 (66)
Q Consensus 8 ~~~~GHY~a~~~~~~~~~W~~fdD~~ 33 (66)
+.+.+||+|.|-.+.......|-+=+
T Consensus 69 s~d~~~~sC~I~RPqgkSYL~F~qFk 94 (158)
T PF10572_consen 69 SDDNGHYSCIIWRPQGKSYLFFTQFK 94 (158)
T ss_pred cCCCCceEEEEECCCCCcEEEEEEEE
Confidence 45667777777766554555555444
No 72
>PRK05483 rplN 50S ribosomal protein L14; Validated
Probab=33.22 E-value=72 Score=18.82 Aligned_cols=19 Identities=11% Similarity=0.204 Sum_probs=16.0
Q ss_pred CCCcEEEEcCceEEEcCCC
Q psy7177 22 NTKEWHYFNDETVLEQRPQ 40 (66)
Q Consensus 22 ~~~~W~~fdD~~V~~~~~~ 40 (66)
.+|.|..|||..+--++.+
T Consensus 72 ~dG~~i~F~dNavVLin~~ 90 (122)
T PRK05483 72 PDGSYIRFDDNAAVLLNND 90 (122)
T ss_pred CCCCEEEcCCCEEEEECCC
Confidence 4788999999999888765
No 73
>TIGR03673 rpl14p_arch 50S ribosomal protein L14P. Part of the 50S ribosomal subunit. Forms a cluster with proteins L3 and L24e, part of which may contact the 16S rRNA in 2 intersubunit bridges.
Probab=32.99 E-value=78 Score=18.98 Aligned_cols=19 Identities=21% Similarity=0.275 Sum_probs=15.8
Q ss_pred CCCcEEEEcCceEEEcCCC
Q psy7177 22 NTKEWHYFNDETVLEQRPQ 40 (66)
Q Consensus 22 ~~~~W~~fdD~~V~~~~~~ 40 (66)
.+|.|+.|||..+--++.+
T Consensus 82 ~dGs~i~FddNa~VLin~~ 100 (131)
T TIGR03673 82 PDGTRVKFEDNAVVIVTPD 100 (131)
T ss_pred CCCcEEEeCCcEEEEECCC
Confidence 3788999999998888765
No 74
>PF04473 DUF553: Transglutaminase-like domain; InterPro: IPR007562 This entry represents a transglutaminase-like domain found in a family of uncharacterised archaeal proteins that had previously been called DUF553 and UPF0252.
Probab=32.72 E-value=81 Score=19.25 Aligned_cols=19 Identities=11% Similarity=0.204 Sum_probs=16.1
Q ss_pred CeEEEEEECCCCCcEEEEcCc
Q psy7177 12 GHYTAYTLHPNTKEWHYFNDE 32 (66)
Q Consensus 12 GHY~a~~~~~~~~~W~~fdD~ 32 (66)
||-.+.++- +|+||..|-.
T Consensus 109 ~Haa~aV~i--ng~~yvlDq~ 127 (153)
T PF04473_consen 109 GHAAVAVKI--NGKYYVLDQH 127 (153)
T ss_pred CeEEEEEEE--CCEEEEEeCC
Confidence 899999998 7899988743
No 75
>PRK04233 hypothetical protein; Provisional
Probab=32.07 E-value=38 Score=20.19 Aligned_cols=9 Identities=22% Similarity=0.851 Sum_probs=7.0
Q ss_pred CCcEEEEcC
Q psy7177 23 TKEWHYFND 31 (66)
Q Consensus 23 ~~~W~~fdD 31 (66)
+|+|++.|.
T Consensus 120 ~g~W~YvDg 128 (129)
T PRK04233 120 DGRWYYLDA 128 (129)
T ss_pred CCEEEEecC
Confidence 688998874
No 76
>COG3012 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=31.81 E-value=35 Score=21.04 Aligned_cols=12 Identities=17% Similarity=0.584 Sum_probs=9.5
Q ss_pred CCcEEEEcCceE
Q psy7177 23 TKEWHYFNDETV 34 (66)
Q Consensus 23 ~~~W~~fdD~~V 34 (66)
+++||+.|...-
T Consensus 119 ngrWyyiDgtv~ 130 (151)
T COG3012 119 NGRWYYIDGTVP 130 (151)
T ss_pred CCEEEEECCCCC
Confidence 688999987653
No 77
>cd05854 Ig6_Contactin-2 Sixth Ig domain of contactin-2. Ig6_Contactin-2: Sixth Ig domain of the neural cell adhesion molecule contactin-2-like. Contactins are comprised of six Ig domains followed by four fibronectin type III (FnIII) domains anchored to the membrane by glycosylphosphatidylinositol. Contactin-2 (TAG-1, axonin-1) facilitates cell adhesion by homophilic binding between molecules in apposed membranes. It may play a part in the neuronal processes of neurite outgrowth, axon guidance and fasciculation, and neuronal migration. The first four Ig domains form the intermolecular binding fragment, which arranges as a compact U-shaped module by contacts between IG domains 1 and 4, and domains 2 and 3. The different contactins show different expression patterns in the central nervous system. During development and in adulthood, contactin-2 is transiently expressed in subsets of central and peripheral neurons. Contactin-2 is also expressed in retinal amacrine cells in the developing c
Probab=31.46 E-value=48 Score=17.46 Aligned_cols=14 Identities=29% Similarity=0.266 Sum_probs=10.4
Q ss_pred CCCCeEEEEEECCC
Q psy7177 9 VSAGHYTAYTLHPN 22 (66)
Q Consensus 9 ~~~GHY~a~~~~~~ 22 (66)
.++|-|+|.+++..
T Consensus 57 ~D~G~YtC~A~n~~ 70 (85)
T cd05854 57 SHAGTYTCTAQTVV 70 (85)
T ss_pred hhCeEEEEEEecCC
Confidence 56788888888753
No 78
>cd05744 Ig_Myotilin_C_like Immunoglobulin (Ig)-like domain of myotilin, palladin, and myopalladin. Ig_Myotilin_like_C: immunoglobulin (Ig)-like domain in myotilin, palladin, and myopalladin. Myotilin, palladin, and myopalladin function as scaffolds that regulate actin organization. Myotilin and myopalladin are most abundant in skeletal and cardiac muscle; palladin is ubiquitously expressed in the organs of developing vertebrates and plays a key role in cellular morphogenesis. The three family members each interact with specific molecular partners: all three bind to alpha-actinin; in addition, palladin also binds to vasodilator-stimulated phosphoprotein (VASP) and ezrin, myotilin binds to filamin and actin, and myopalladin also binds to nebulin and cardiac ankyrin repeat protein (CARP).
Probab=30.54 E-value=52 Score=16.78 Aligned_cols=15 Identities=27% Similarity=0.379 Sum_probs=11.4
Q ss_pred cCCCCeEEEEEECCC
Q psy7177 8 GVSAGHYTAYTLHPN 22 (66)
Q Consensus 8 ~~~~GHY~a~~~~~~ 22 (66)
..++|.|+|.+.+..
T Consensus 50 ~~D~G~Y~C~a~N~~ 64 (75)
T cd05744 50 KEDAGWYTVSAVNEA 64 (75)
T ss_pred cccCEEEEEEEEcCC
Confidence 467888988888763
No 79
>TIGR03834 EAGR_box EAGR box. The EAGR box (Enriched in Aromatic and Glycine Residues) is found in three different proteins of the Mycoplasma genitalium terminal organelle, which acts in both cytadherence and gliding motility. The presence of this domain in a genome predicts the Mycoplasma-type terminal organelle structure, gliding motility, and cytadherence. The EAGR box may occur from one to nine times in a protein.
Probab=30.18 E-value=58 Score=14.44 Aligned_cols=19 Identities=32% Similarity=0.758 Sum_probs=12.0
Q ss_pred ecCCCCeEEEEEECCCCCcEE---EEcC
Q psy7177 7 KGVSAGHYTAYTLHPNTKEWH---YFND 31 (66)
Q Consensus 7 G~~~~GHY~a~~~~~~~~~W~---~fdD 31 (66)
|+...|||- .++.|. .||+
T Consensus 2 Gn~~YG~yd------~~~~WiW~GyFDe 23 (28)
T TIGR03834 2 GNEEYGYYD------ENGEWIWKGYFDE 23 (28)
T ss_pred CccccccCc------CCCcEEEeeEeCc
Confidence 566778883 256776 5554
No 80
>cd07702 Ig2_VEGFR-1 Second immunoglobulin (Ig)-like domain of vascular endothelial growth factor receptor 1 (VEGFR-1). Ig2_VEGFR-1: Second immunoglobulin (Ig)-like domain of vascular endothelial growth factor receptor 1 (VEGFR-1). VEGFRs have an extracellular component with seven Ig-like domains, a transmembrane segment, and an intracellular tyrosine kinase domain interrupted by a kinase-insert domain. VEGFRs bind VEGFs with high affinity at the Ig-like domains. VEGFR-1 binds VEGF-A strongly; VEGF-A is important to the growth and maintenance of vascular endothelial cells and to the development of new blood- and lymphatic-vessels in physiological and pathological states. VEGFR-1 may play an inhibitory rolet in the function of VEGFR-2 by binding VEGF-A and interfering with its interaction with VEGFR-2. VEGFR-1 has a signaling role in mediating monocyte chemotaxis and may mediate a chemotactic and a survival signal in hematopoietic stem cells or leukemia cells.
Probab=30.08 E-value=64 Score=16.76 Aligned_cols=16 Identities=25% Similarity=0.094 Sum_probs=11.9
Q ss_pred cCCCCeEEEEEECCCC
Q psy7177 8 GVSAGHYTAYTLHPNT 23 (66)
Q Consensus 8 ~~~~GHY~a~~~~~~~ 23 (66)
..++|-|+|.+++...
T Consensus 45 ~~D~G~YtC~a~~~~~ 60 (72)
T cd07702 45 EEDAGIYTILLGIKQY 60 (72)
T ss_pred HHHCEEEEEEEEcccc
Confidence 3567999999888643
No 81
>PF05717 TnpB_IS66: IS66 Orf2 like protein; InterPro: IPR008878 Thess proteins are found in insertion sequences related to IS66. The function of these proteins is uncertain, but they are probably essential for transposition []. More information about these proteins can be found at Protein of the Month: Transposase [].
Probab=29.70 E-value=88 Score=17.84 Aligned_cols=38 Identities=11% Similarity=0.183 Sum_probs=25.1
Q ss_pred cCCCCeEEEEEECCCCCcEEEEcCceEEEcCCCcCCCCCeEEEEEEEcCCC
Q psy7177 8 GVSAGHYTAYTLHPNTKEWHYFNDETVLEQRPQDEDFNNAYILFYKKRDED 58 (66)
Q Consensus 8 ~~~~GHY~a~~~~~~~~~W~~fdD~~V~~~~~~~~~~~~aYiLfY~r~~~~ 58 (66)
.+.+||-+.|+-.. -+.|+.+-+ ..+.|.|+|+|.+..
T Consensus 31 dp~~g~~fvF~nr~---------r~riKiL~w----d~~G~~L~~KRLe~G 68 (107)
T PF05717_consen 31 DPFSGDLFVFCNRR---------RDRIKILYW----DGDGFWLYYKRLERG 68 (107)
T ss_pred CCCcceEEEEEecc---------CCceEEEec----cCCceEEeeeeecCC
Confidence 56677777666553 245555533 478999999987653
No 82
>cd04976 Ig2_VEGFR Second immunoglobulin (Ig)-like domain of vascular endothelial growth factor receptor (VEGFR). Ig2_VEGFR: Second immunoglobulin (Ig)-like domain of vascular endothelial growth factor receptor (VEGFR). The VEGFRs have an extracellular component with seven Ig-like domains, a transmembrane segment, and an intracellular tyrosine kinase domain interrupted by a kinase-insert domain. The VEGFR family consists of three members, VEGFR-1 (Flt-1), VEGFR-2 (KDR/Flk-1) and VEGFR-3 (Flt-4). VEGFRs bind VEGFs with high affinity at the Ig-like domains. VEGF-A is important to the growth and maintenance of vascular endothelial cells and to the development of new blood- and lymphatic-vessels in physiological and pathological states. VEGFR-2 is a major mediator of the mitogenic, angiogenic and microvascular permeability-enhancing effects of VEGF-A. VEGFR-1 may play an inhibitory part in these processes by binding VEGF and interfering with its interaction with VEGFR-2. VEGFR-1 has a signa
Probab=29.35 E-value=48 Score=16.75 Aligned_cols=15 Identities=27% Similarity=0.357 Sum_probs=12.0
Q ss_pred cCCCCeEEEEEECCC
Q psy7177 8 GVSAGHYTAYTLHPN 22 (66)
Q Consensus 8 ~~~~GHY~a~~~~~~ 22 (66)
..++|-|+|.+.+..
T Consensus 44 ~~D~G~YtC~a~N~~ 58 (71)
T cd04976 44 EEDAGNYTVVLTNKQ 58 (71)
T ss_pred HHHCEEEEEEEEcCC
Confidence 467899999999864
No 83
>cd04971 Ig_TrKABC_d5 Fifth domain (immunoglobulin-like) of Trk receptors TrkA, TrkB and TrkC. TrkABC_d5: the fifth domain of Trk receptors TrkA, TrkB and TrkC, this is an immunoglobulin (Ig)-like domain which binds to neurotrophin. The Trk family of receptors are tyrosine kinase receptors. They are activated by dimerization, leading to autophosphorylation of intracellular tyrosine residues, and triggering the signal transduction pathway. TrkA, TrkB, and TrkC share significant sequence homology and domain organization. The first three domains are leucine-rich domains. The fourth and fifth domains are Ig-like domains playing a part in ligand binding. TrkA, Band C mediate the trophic effects of the neurotrophin Nerve growth factor (NGF) family. TrkA is recognized by NGF. TrkB is recognized by brain-derived neurotrophic factor (BDNF) and neurotrophin (NT)-4. TrkC is recognized by NT-3. NT-3 is promiscuous as in some cell systems it activates TrkA and TrkB receptors. TrkA is a receptor foun
Probab=29.10 E-value=95 Score=16.25 Aligned_cols=15 Identities=27% Similarity=0.361 Sum_probs=12.0
Q ss_pred cCCCCeEEEEEECCC
Q psy7177 8 GVSAGHYTAYTLHPN 22 (66)
Q Consensus 8 ~~~~GHY~a~~~~~~ 22 (66)
..++|-|+|.+.+..
T Consensus 55 ~~D~G~YtC~A~N~~ 69 (81)
T cd04971 55 HVNNGNYTLVASNEY 69 (81)
T ss_pred cccCeEEEEEEEeCC
Confidence 357899999999864
No 84
>cd05735 Ig8_DSCAM Eight immunoglobulin (Ig) domain of Down Syndrome Cell Adhesion molecule (DSCAM). Ig8_DSCAM: the eight immunoglobulin (Ig) domain of Down Syndrome Cell Adhesion molecule (DSCAM). DSCAM is a cell adhesion molecule expressed largely in the developing nervous system. The gene encoding DSCAM is located at human chromosome 21q22, the locus associated with the mental retardation phenotype of Down Syndrome. DSCAM is predicted to be the largest member of the IG superfamily. It has been demonstrated that DSCAM can mediate cation-independent homophilic intercellular adhesion.
Probab=28.85 E-value=62 Score=17.10 Aligned_cols=15 Identities=7% Similarity=0.306 Sum_probs=11.1
Q ss_pred cCCCCeEEEEEECCC
Q psy7177 8 GVSAGHYTAYTLHPN 22 (66)
Q Consensus 8 ~~~~GHY~a~~~~~~ 22 (66)
..++|-|+|.+++..
T Consensus 56 ~~D~G~YtC~A~N~~ 70 (88)
T cd05735 56 REDSGFFSCHAINSY 70 (88)
T ss_pred cccCEEEEEEEEcCC
Confidence 457788888888763
No 85
>PF02750 Synapsin_C: Synapsin, ATP binding domain; InterPro: IPR020898 The synapsins are a family of neuron-specific phosphoproteins that coat synaptic vesicles and are involved in the binding between these vesicles and the cytoskeleton (including actin filaments). The family comprises 5 homologous proteins Ia, Ib, IIa, IIb and III. Synapsins I, II, and III are encoded by 3 different genes. The a and b isoforms of synapsin I and II are splice variants of the primary transcripts []. Synapsin I is mainly associated with regulation of neurotransmitter release from presynaptic neuron terminals []. Synapsin II, as well as being involved in neurotransmitter release, has a role in the synaptogenesis and synaptic plasticity responsible for long term potentiation []. Recent studies implicate synapsin III with a developmental role in neurite elongation and synapse formation that is distinct from the functions of synapsins I and II []. Structurally, synapsins are multidomain proteins, of which 3 domains are common to all the mammalian forms. The N-terminal `A' domain is ~30 residues long and contains a serine residue that serves as an acceptor site for protein kinase-mediated phosphorylation. This is followed by the `B' linker domain, which is ~80 residues long and is relatively poorly conserved. Domain `C' is the longest, spanning approximately 300 residues. This domain is highly conserved across all the synapsins (including those from Drosophila) and is possessed by all splice variants. The remaining six domains, D-I, are not shared by all the synapsins and differ both between the primary transcripts and the splice variants. This entry represent the ATP-grasp fold found in synapsins, which is responsible for Ca dependent ATP binding. ; PDB: 1PX2_A 1PK8_F 1AUV_B 1AUX_A 2P0A_A 1I7N_A 1I7L_A.
Probab=28.61 E-value=32 Score=22.19 Aligned_cols=17 Identities=24% Similarity=0.604 Sum_probs=13.1
Q ss_pred CCeEEEEEECCCCCcEE
Q psy7177 11 AGHYTAYTLHPNTKEWH 27 (66)
Q Consensus 11 ~GHY~a~~~~~~~~~W~ 27 (66)
+.+|.||.|.+.++.|-
T Consensus 107 G~~ykA~~R~sis~nWK 123 (203)
T PF02750_consen 107 GNNYKAYMRTSISGNWK 123 (203)
T ss_dssp TTEEEEEEEEESSSTSS
T ss_pred cCeEEEEEEcccccccc
Confidence 57999999986556664
No 86
>cd05869 Ig5_NCAM-1 Fifth immunoglobulin (Ig)-like domain of Neural Cell Adhesion Molecule NCAM-1 (NCAM). Ig5_NCAM-1: The fifth immunoglobulin (Ig)-like domain of Neural Cell Adhesion Molecule NCAM-1 (NCAM). NCAM plays important roles in the development and regeneration of the central nervous system, in synaptogenesis and neural migration. NCAM mediates cell-cell and cell-substratum recognition and adhesion via homophilic (NCAM-NCAM) and heterophilic (NCAM-non-NCAM) interactions. NCAM is expressed as three major isoforms having different intracellular extensions. The extracellular portion of NCAM has five N-terminal Ig-like domains and two fibronectin type III domains. The double zipper adhesion complex model for NCAM homophilic binding involves Ig1, Ig2, and Ig3. By this model, Ig1 and Ig2 mediate dimerization of NCAM molecules situated on the same cell surface (cis interactions), and Ig3 domains mediate interactions between NCAM molecules expressed on the surface of opposing cells (tr
Probab=28.58 E-value=1e+02 Score=16.39 Aligned_cols=14 Identities=21% Similarity=0.287 Sum_probs=10.7
Q ss_pred CCCCeEEEEEECCC
Q psy7177 9 VSAGHYTAYTLHPN 22 (66)
Q Consensus 9 ~~~GHY~a~~~~~~ 22 (66)
.+.|.|+|.+++..
T Consensus 73 ~D~G~Y~C~A~N~~ 86 (97)
T cd05869 73 TDAGEYLCTASNTI 86 (97)
T ss_pred CcCEEEEEEEecCC
Confidence 46788888888764
No 87
>PRK00183 hypothetical protein; Provisional
Probab=27.97 E-value=42 Score=20.65 Aligned_cols=11 Identities=9% Similarity=0.570 Sum_probs=8.8
Q ss_pred CCcEEEEcCce
Q psy7177 23 TKEWHYFNDET 33 (66)
Q Consensus 23 ~~~W~~fdD~~ 33 (66)
+++|++.|...
T Consensus 120 ~g~W~YvDG~~ 130 (157)
T PRK00183 120 QGRWYFIDPTV 130 (157)
T ss_pred CCEEEeccccC
Confidence 68999998744
No 88
>KOG1415|consensus
Probab=27.94 E-value=70 Score=20.91 Aligned_cols=24 Identities=4% Similarity=0.087 Sum_probs=19.3
Q ss_pred CeEEEEEECCCCCcEEEEcCceEEEc
Q psy7177 12 GHYTAYTLHPNTKEWHYFNDETVLEQ 37 (66)
Q Consensus 12 GHY~a~~~~~~~~~W~~fdD~~V~~~ 37 (66)
=||+||+.. +|.-|..|...--++
T Consensus 160 ~HFI~~v~~--~G~lYELDgR~~fPi 183 (222)
T KOG1415|consen 160 LHFICFVNK--NGHLYELDGRKPFPI 183 (222)
T ss_pred eEEEEEEcc--CCeEEEecCCcCCCc
Confidence 399999996 789999998755444
No 89
>PRK10579 hypothetical protein; Provisional
Probab=27.82 E-value=1.2e+02 Score=17.16 Aligned_cols=36 Identities=17% Similarity=0.404 Sum_probs=25.0
Q ss_pred EEECCCCCcEEEEcCceEEEcCCCc-----CCCCCeEEEEE
Q psy7177 17 YTLHPNTKEWHYFNDETVLEQRPQD-----EDFNNAYILFY 52 (66)
Q Consensus 17 ~~~~~~~~~W~~fdD~~V~~~~~~~-----~~~~~aYiLfY 52 (66)
.++.+....|-.|.--.--.++.+. +....+|+..|
T Consensus 53 ~V~Lpg~~ew~~~~aG~sF~VpanssF~l~v~~~t~Y~C~y 93 (94)
T PRK10579 53 NVLLPGATDWQVYEAGEVFNVPGHSEFHLQVAEPTSYLCRY 93 (94)
T ss_pred EEECCCCcccEEeCCCCEEEECCCCeEEEEECcceeeEEEc
Confidence 5566666789888877666666554 44578888776
No 90
>cd07693 Ig1_Robo First immunoglobulin (Ig)-like domain in Robo (roundabout) receptors and similar proteins. Ig1_Robo: domain similar to the first immunoglobulin (Ig)-like domain in Robo (roundabout) receptors. Robo receptors play a role in the development of the central nervous system (CNS), and are receptors of Slit protein. Slit is a repellant secreted by the neural cells in the midline. Slit acts through Robo to prevent most neurons from crossing the midline from either side. Three mammalian Robo homologs (robo1, -2, and -3), and three mammalian Slit homologs (Slit-1,-2, -3), have been identified. Commissural axons, which cross the midline, express low levels of Robo; longitudinal axons, which avoid the midline, express high levels of Robo. robo1, -2, and -3 are expressed by commissural neurons in the vertebrate spinal cord and Slits 1, -2, -3 are expressed at the ventral midline. Robo-3 is a divergent member of the Robo family which instead of being a positive regulator of slit res
Probab=27.56 E-value=56 Score=16.86 Aligned_cols=14 Identities=14% Similarity=0.254 Sum_probs=11.3
Q ss_pred cCCCCeEEEEEECC
Q psy7177 8 GVSAGHYTAYTLHP 21 (66)
Q Consensus 8 ~~~~GHY~a~~~~~ 21 (66)
..+.|.|+|.+.+.
T Consensus 73 ~~D~G~Y~C~a~N~ 86 (100)
T cd07693 73 RSDEGVYVCVAHNS 86 (100)
T ss_pred cCcCEEEEEEEEcc
Confidence 45889999999875
No 91
>TIGR01067 rplN_bact ribosomal protein L14, bacterial/organelle. This model distinguishes bacterial and most organellar examples of ribosomal protein L14 from all archaeal and eukaryotic forms.
Probab=27.30 E-value=72 Score=18.80 Aligned_cols=19 Identities=11% Similarity=0.228 Sum_probs=15.9
Q ss_pred CCCcEEEEcCceEEEcCCC
Q psy7177 22 NTKEWHYFNDETVLEQRPQ 40 (66)
Q Consensus 22 ~~~~W~~fdD~~V~~~~~~ 40 (66)
.+|.|+.|||..+--++.+
T Consensus 72 ~dG~~i~F~~Na~VLin~~ 90 (122)
T TIGR01067 72 KDGSYIRFDDNACVLINKN 90 (122)
T ss_pred CCCCEEECCCceEEEECCC
Confidence 3788999999998888765
No 92
>cd04967 Ig1_Contactin First Ig domain of contactin. Ig1_Contactin: First Ig domain of contactins. Contactins are neural cell adhesion molecules and are comprised of six Ig domains followed by four fibronectin type III(FnIII) domains anchored to the membrane by glycosylphosphatidylinositol. The first four Ig domains form the intermolecular binding fragment, which arranges as a compact U-shaped module via contacts between Ig domains 1 and 4, and between Ig domains 2 and 3. Contactin-2 (TAG-1, axonin-1) may play a part in the neuronal processes of neurite outgrowth, axon guidance and fasciculation, and neuronal migration. This group also includes contactin-1 and contactin-5. The different contactins show different expression patterns in the central nervous system. During development and in adulthood, contactin-2 is transiently expressed in subsets of central and peripheral neurons. Contactin-5 is expressed specifically in the rat postnatal nervous system, peaking at about 3 weeks postnata
Probab=27.14 E-value=58 Score=17.09 Aligned_cols=14 Identities=21% Similarity=0.278 Sum_probs=11.9
Q ss_pred CCCCeEEEEEECCC
Q psy7177 9 VSAGHYTAYTLHPN 22 (66)
Q Consensus 9 ~~~GHY~a~~~~~~ 22 (66)
.+.|.|+|.+++..
T Consensus 69 ~d~G~Y~C~a~N~~ 82 (91)
T cd04967 69 KDAGRYQCLASNIV 82 (91)
T ss_pred CCCEEEEEEEEcCC
Confidence 68999999999863
No 93
>cd05895 Ig_Pro_neuregulin-1 Immunoglobulin (Ig)-like domain found in neuregulin (NRG)-1. Ig_Pro_neuregulin-1: immunoglobulin (Ig)-like domain found in neuregulin (NRG)-1. There are many NRG-1 isoforms which arise from the alternative splicing of mRNA. NRG-1 belongs to the neuregulin gene family, which is comprised of four genes. This group represents NRG-1. NRGs are signaling molecules, which participate in cell-cell interactions in the nervous system, breast, and heart, and other organ systems, and are implicated in the pathology of diseases including schizophrenia, multiple sclerosis, and breast cancer. The NRG-1 protein binds to and activates the tyrosine kinases receptors ErbB3 and ErbB4, initiating signaling cascades. NRG-1 has multiple functions; for example, in the brain it regulates various processes such as radial glia formation and neuronal migration, dendritic development, and expression of neurotransmitters receptors; in the peripheral nervous system NRG-1 regulates process
Probab=27.08 E-value=65 Score=16.16 Aligned_cols=15 Identities=13% Similarity=0.118 Sum_probs=11.8
Q ss_pred cCCCCeEEEEEECCC
Q psy7177 8 GVSAGHYTAYTLHPN 22 (66)
Q Consensus 8 ~~~~GHY~a~~~~~~ 22 (66)
..++|-|+|.+.+..
T Consensus 53 ~~DsG~Y~C~a~N~~ 67 (76)
T cd05895 53 LADNGEYKCMVSSKL 67 (76)
T ss_pred cccCEEEEEEEEeCC
Confidence 467899999998763
No 94
>cd05863 Ig2_VEGFR-3 Second immunoglobulin (Ig)-like domain of vascular endothelial growth factor receptor 3 (VEGFR-3). Ig2_VEGFR-3: Second immunoglobulin (Ig)-like domain of vascular endothelial growth factor receptor 3 (VEGFR-3). The VEGFRs have an extracellular component with seven Ig-like domains, a transmembrane segment, and an intracellular tyrosine kinase domain interrupted by a kinase-insert domain. VEGFRs bind VEGFs with high affinity at the Ig-like domains. VEGFR-3 (Flt-4) binds two members of the VEGF family (VEGF-C and -D) and is involved in tumor angiogenesis and growth.
Probab=26.94 E-value=55 Score=16.74 Aligned_cols=15 Identities=33% Similarity=0.390 Sum_probs=11.8
Q ss_pred cCCCCeEEEEEECCC
Q psy7177 8 GVSAGHYTAYTLHPN 22 (66)
Q Consensus 8 ~~~~GHY~a~~~~~~ 22 (66)
..++|-|+|-+.+..
T Consensus 40 ~~D~G~YtC~a~N~~ 54 (67)
T cd05863 40 EASAGTYTLVLWNSA 54 (67)
T ss_pred HHHCEEEEEEEEECC
Confidence 357899999998863
No 95
>cd05730 Ig3_NCAM-1_like Third immunoglobulin (Ig)-like domain of Neural Cell Adhesion Molecule NCAM-1 (NCAM). Ig3_NCAM-1_like: domain similar to the third immunoglobulin (Ig)-like domain of Neural Cell Adhesion Molecule NCAM-1 (NCAM). NCAM plays important roles in the development and regeneration of the central nervous system, in synaptogenesis and neural migration. NCAM mediates cell-cell and cell-substratum recognition and adhesion via homophilic (NCAM-NCAM), and heterophilic (NCAM-non-NCAM), interactions. NCAM is expressed as three major isoforms having different intracellular extensions. The extracellular portion of NCAM has five N-terminal Ig-like domains and two fibronectin type III domains. The double zipper adhesion complex model for NCAM homophilic binding involves Ig1, Ig2, and Ig3. By this model, Ig1,and Ig2 mediate dimerization of NCAM molecules situated on the same cell surface (cis interactions), and Ig3 domains mediate interactions between NCAM molecules expressed on the
Probab=26.39 E-value=63 Score=16.98 Aligned_cols=15 Identities=13% Similarity=0.231 Sum_probs=12.1
Q ss_pred cCCCCeEEEEEECCC
Q psy7177 8 GVSAGHYTAYTLHPN 22 (66)
Q Consensus 8 ~~~~GHY~a~~~~~~ 22 (66)
..+.|.|+|.+.+..
T Consensus 67 ~~D~G~Y~C~a~N~~ 81 (95)
T cd05730 67 KLDEAEYTCIAENKA 81 (95)
T ss_pred hhhCEEEEEEEEcCC
Confidence 457899999999864
No 96
>PF06680 DUF1181: Protein of unknown function (DUF1181); InterPro: IPR009566 This family consists of several hypothetical proteins of around 120 residues in length which are found specifically in Trypanosoma brucei. The function of this family is unknown.
Probab=26.31 E-value=71 Score=18.38 Aligned_cols=25 Identities=28% Similarity=0.708 Sum_probs=19.2
Q ss_pred CCCeEEEEEECCCCCcEEEEcCceE
Q psy7177 10 SAGHYTAYTLHPNTKEWHYFNDETV 34 (66)
Q Consensus 10 ~~GHY~a~~~~~~~~~W~~fdD~~V 34 (66)
.+|||.-++-...+..|-+-|.-.|
T Consensus 43 ~~ghy~~wvargesedwdysnsfvv 67 (120)
T PF06680_consen 43 NSGHYHRWVARGESEDWDYSNSFVV 67 (120)
T ss_pred ccCceEEeeecCCccccCccCceEE
Confidence 5799999998877778877766443
No 97
>PF13529 Peptidase_C39_2: Peptidase_C39 like family; PDB: 3ERV_A.
Probab=26.26 E-value=64 Score=17.69 Aligned_cols=23 Identities=30% Similarity=0.514 Sum_probs=13.1
Q ss_pred cCCCCeEEEEEECCCCCcEEEEcC
Q psy7177 8 GVSAGHYTAYTLHPNTKEWHYFND 31 (66)
Q Consensus 8 ~~~~GHY~a~~~~~~~~~W~~fdD 31 (66)
....|||+..+--..++ ++.++|
T Consensus 120 ~~~~~H~vvi~Gy~~~~-~~~v~D 142 (144)
T PF13529_consen 120 GTYGGHYVVIIGYDEDG-YVYVND 142 (144)
T ss_dssp E-TTEEEEEEEEE-SSE--EEEE-
T ss_pred CCcCCEEEEEEEEeCCC-EEEEeC
Confidence 34689999888754343 566655
No 98
>cd04974 Ig3_FGFR Third immunoglobulin (Ig)-like domain of fibroblast growth factor receptor (FGFR). Ig3_FGFR: third immunoglobulin (Ig)-like domain of fibroblast growth factor receptor (FGFR). Fibroblast growth factors (FGFs) participate in morphogenesis, development, angiogenesis, and wound healing. These FGF-stimulated processes are mediated by four FGFR tyrosine kinases (FGRF1-4). FGFRs are comprised of an extracellular portion consisting of three Ig-like domains, a transmembrane helix, and a cytoplasmic portion having protein tyrosine kinase activity. The highly conserved Ig-like domains 2 and 3, and the linker region between D2 and D3 define a general binding site for FGFs.
Probab=26.17 E-value=1e+02 Score=16.20 Aligned_cols=14 Identities=29% Similarity=0.306 Sum_probs=10.4
Q ss_pred CCCCeEEEEEECCC
Q psy7177 9 VSAGHYTAYTLHPN 22 (66)
Q Consensus 9 ~~~GHY~a~~~~~~ 22 (66)
.+.|.|+|.+.+..
T Consensus 65 ~D~G~Y~C~A~N~~ 78 (90)
T cd04974 65 DDAGEYTCLAGNSI 78 (90)
T ss_pred ccCcEEEEEeeccc
Confidence 57788888888753
No 99
>cd05893 Ig_Palladin_C C-terminal immunoglobulin (Ig)-like domain of palladin. Ig_Palladin_C: C-terminal immunoglobulin (Ig)-like domain of palladin. Palladin belongs to the palladin-myotilin-myopalladin family. Proteins belonging to this family contain multiple Ig-like domains and function as scaffolds, modulating actin cytoskeleton. Palladin binds to alpha-actinin ezrin, vasodilator-stimulated phosphoprotein VASP, SPIN90 (DIP, mDia interacting protein), and Src. Palladin also binds F-actin directly, via its Ig3 domain. Palladin is expressed as several alternatively spliced isoforms, having various combinations of Ig-like domains, in a cell-type-specific manner. It has been suggested that palladin's different Ig-like domains may be specialized for distinct functions.
Probab=25.77 E-value=65 Score=16.62 Aligned_cols=13 Identities=31% Similarity=0.373 Sum_probs=9.9
Q ss_pred CCCCeEEEEEECC
Q psy7177 9 VSAGHYTAYTLHP 21 (66)
Q Consensus 9 ~~~GHY~a~~~~~ 21 (66)
.++|-|+|.+.+.
T Consensus 51 ~D~G~Y~C~A~N~ 63 (75)
T cd05893 51 EDAGWYTVSAKNE 63 (75)
T ss_pred HHCEEEEEEEEcC
Confidence 4678888888875
No 100
>cd05740 Ig_CEACAM_D4 Fourth immunoglobulin (Ig)-like domain of carcinoembryonic antigen (CEA) related cell adhesion molecule (CEACAM). Ig_CEACAM_D4: immunoglobulin (Ig)-like domain 4 in carcinoembryonic antigen (CEA) related cell adhesion molecule (CEACAM) protein subfamily. The CEA family is a group of anchored or secreted glycoproteins, expressed by epithelial cells, leukocytes, endothelial cells and placenta. The CEA family is divided into the CEACAM and pregnancy-specific glycoprotein (PSG) subfamilies. This group represents the CEACAM subfamily. CEACAM1 has many important cellular functions, it is a cell adhesion molecule, and a signaling molecule that regulates the growth of tumor cells, it is an angiogenic factor, and is a receptor for bacterial and viral pathogens, including mouse hepatitis virus (MHV). In mice, four isoforms of CEACAM1 generated by alternative splicing have either two [D1, D4] or four [D1-D4] Ig-like domains on the cell surface. This family corresponds to the
Probab=25.47 E-value=67 Score=17.18 Aligned_cols=15 Identities=20% Similarity=0.319 Sum_probs=12.1
Q ss_pred cCCCCeEEEEEECCC
Q psy7177 8 GVSAGHYTAYTLHPN 22 (66)
Q Consensus 8 ~~~~GHY~a~~~~~~ 22 (66)
..+.|.|.|-+++..
T Consensus 64 ~~D~G~Y~C~a~N~~ 78 (91)
T cd05740 64 RSDTGHYQCEASNEV 78 (91)
T ss_pred hhhCEEEEEEEEcCC
Confidence 357899999999864
No 101
>cd05746 Ig4_Peroxidasin Fourth immunoglobulin (Ig)-like domain of peroxidasin. Ig4_Peroxidasin: the fourth immunoglobulin (Ig)-like domain in peroxidasin. Peroxidasin has a peroxidase domain and interacting extracellular motifs containing four Ig-like domains. It has been suggested that peroxidasin is secreted, and has functions related to the stabilization of the extracellular matrix. It may play a part in various other important processes such as removal and destruction of cells, which have undergone programmed cell death, and protection of the organism against non-self.
Probab=25.36 E-value=60 Score=16.08 Aligned_cols=15 Identities=13% Similarity=0.186 Sum_probs=12.0
Q ss_pred cCCCCeEEEEEECCC
Q psy7177 8 GVSAGHYTAYTLHPN 22 (66)
Q Consensus 8 ~~~~GHY~a~~~~~~ 22 (66)
..+.|.|+|.+.+..
T Consensus 45 ~~D~G~Y~C~a~N~~ 59 (69)
T cd05746 45 VADQGRYECVARNTI 59 (69)
T ss_pred hhhCEEEEEEEECCC
Confidence 467899999998863
No 102
>cd05892 Ig_Myotilin_C C-terminal immunoglobulin (Ig)-like domain of myotilin. Ig_Myotilin_C: C-terminal immunoglobulin (Ig)-like domain of myotilin. Mytolin belongs to the palladin-myotilin-myopalladin family. Proteins belonging to the latter family contain multiple Ig-like domains and function as scaffolds, modulating actin cytoskeleton. Myotilin is most abundant in skeletal and cardiac muscle, and is involved in maintaining sarcomere integrity. It binds to alpha-actinin, filamin and actin. Mutations in myotilin lead to muscle disorders.
Probab=24.78 E-value=70 Score=16.56 Aligned_cols=15 Identities=27% Similarity=0.384 Sum_probs=10.8
Q ss_pred cCCCCeEEEEEECCC
Q psy7177 8 GVSAGHYTAYTLHPN 22 (66)
Q Consensus 8 ~~~~GHY~a~~~~~~ 22 (66)
..+.|.|+|.+.+..
T Consensus 50 ~~D~G~Y~C~A~N~~ 64 (75)
T cd05892 50 KKDAGWYTVSAVNEA 64 (75)
T ss_pred hhhCEEEEEEEEcCc
Confidence 456788888888753
No 103
>COG5263 FOG: Glucan-binding domain (YG repeat) [General function prediction only]
Probab=24.42 E-value=1.1e+02 Score=20.95 Aligned_cols=15 Identities=20% Similarity=0.598 Sum_probs=8.2
Q ss_pred CCcEEEEcCceEEEc
Q psy7177 23 TKEWHYFNDETVLEQ 37 (66)
Q Consensus 23 ~~~W~~fdD~~V~~~ 37 (66)
++.||.|||..+...
T Consensus 221 ~GsWYyl~~~g~m~t 235 (313)
T COG5263 221 KGSWYYLDDSGDMAT 235 (313)
T ss_pred cceEEEEecCcccee
Confidence 456666665554443
No 104
>cd04969 Ig5_Contactin_like Fifth Ig domain of contactin. Ig5_Contactin_like: Fifth Ig domain of contactins. Contactins are neural cell adhesion molecules and are comprised of six Ig domains followed by four fibronectin type III(FnIII) domains anchored to the membrane by glycosylphosphatidylinositol. The first four Ig domains form the intermolecular binding fragment, which arranges as a compact U-shaped module via contacts between Ig domains 1 and 4, and between Ig domains 2 and 3. Contactin-2 (TAG-1, axonin-1) may play a part in the neuronal processes of neurite outgrowth, axon guidance and fasciculation, and neuronal migration. This group also includes contactin-1 and contactin-5. The different contactins show different expression patterns in the central nervous system. During development and in adulthood, contactin-2 is transiently expressed in subsets of central and peripheral neurons. Contactin-5 is expressed specifically in the rat postnatal nervous system, peaking at about 3 week
Probab=24.24 E-value=63 Score=16.12 Aligned_cols=14 Identities=21% Similarity=0.399 Sum_probs=11.6
Q ss_pred cCCCCeEEEEEECC
Q psy7177 8 GVSAGHYTAYTLHP 21 (66)
Q Consensus 8 ~~~~GHY~a~~~~~ 21 (66)
..+.|.|+|.+.+.
T Consensus 48 ~~D~G~Y~C~a~N~ 61 (73)
T cd04969 48 KSDEGKYTCFAENF 61 (73)
T ss_pred hhHCEEEEEEEECC
Confidence 45789999999986
No 105
>cd05876 Ig3_L1-CAM Third immunoglobulin (Ig)-like domain of the L1 cell adhesion molecule (CAM). Ig3_L1-CAM: third immunoglobulin (Ig)-like domain of the L1 cell adhesion molecule (CAM). L1 belongs to the L1 subfamily of cell adhesion molecules (CAMs) and is comprised of an extracellular region having six Ig-like domains, five fibronectin type III domains, a transmembrane region and an intracellular domain. L1 is primarily expressed in the nervous system and is involved in its development and function. L1 is associated with an X-linked recessive disorder, X-linked hydrocephalus, MASA syndrome, or spastic paraplegia type 1, that involves abnormalities of axonal growth. This group also contains the chicken neuron-glia cell adhesion molecule, Ng-CAM.
Probab=24.24 E-value=69 Score=16.00 Aligned_cols=15 Identities=13% Similarity=0.271 Sum_probs=11.8
Q ss_pred cCCCCeEEEEEECCC
Q psy7177 8 GVSAGHYTAYTLHPN 22 (66)
Q Consensus 8 ~~~~GHY~a~~~~~~ 22 (66)
..+.|.|+|.+.+..
T Consensus 45 ~~D~G~Y~C~a~N~~ 59 (71)
T cd05876 45 ESDDGEYVCTAENSE 59 (71)
T ss_pred HHhCEEEEEEEEcCC
Confidence 457899999998863
No 106
>cd04978 Ig4_L1-NrCAM_like Fourth immunoglobulin (Ig)-like domain of L1, Ng-CAM (Neuron-glia CAM cell adhesion molecule), and NrCAM (Ng-CAM-related). Ig4_L1-NrCAM_like: fourth immunoglobulin (Ig)-like domain of L1, Ng-CAM (Neuron-glia CAM cell adhesion molecule), and NrCAM (Ng-CAM-related). These proteins belong to the L1 subfamily of cell adhesion molecules (CAMs) and are comprised of an extracellular region having six Ig-like domains and five fibronectin type III domains, a transmembrane region and an intracellular domain. These molecules are primarily expressed in the nervous system. L1 is associated with an X-linked recessive disorder, X-linked hydrocephalus, MASA syndrome, or spastic paraplegia type 1, that involves abnormalities of axonal growth.
Probab=24.11 E-value=71 Score=15.79 Aligned_cols=13 Identities=8% Similarity=0.059 Sum_probs=10.7
Q ss_pred CCCCeEEEEEECC
Q psy7177 9 VSAGHYTAYTLHP 21 (66)
Q Consensus 9 ~~~GHY~a~~~~~ 21 (66)
.+.|.|+|.+++.
T Consensus 51 ~D~G~Y~C~A~N~ 63 (76)
T cd04978 51 NDTAVYQCNASNV 63 (76)
T ss_pred hhCEEEEEEEEcc
Confidence 5678999999875
No 107
>cd05712 Ig_Siglec_N Immunoglobulin (Ig) domain at the N terminus of Siglec (sialic acid-binding Ig-like lectins). Ig_Siglec_N: immunoglobulin (Ig) domain at the N terminus of Siglec (sialic acid-binding Ig-like lectins). Siglec refers to a structurally related protein family that specifically recognizes sialic acid in oligosaccharide chains of glycoproteins and glycolipids. Siglecs are type I transmembrane proteins, organized as an extracellular module composed of Ig-like domains (an N-terminal variable set of Ig-like carbohydrate recognition domains, and 1 to 16 constant Ig-like domains), followed by transmembrane and short cytoplasmic domains. Human siglecs are classified into two subgroups, one subgroup is comprised of sialoadhesin (Siglec-1), CD22 (Siglec-2), and MAG, the other subgroup is comprised of CD33-related Siglecs which include CD33 (Siglec-3) and human Siglecs 5-11.
Probab=24.09 E-value=1.4e+02 Score=16.58 Aligned_cols=25 Identities=8% Similarity=0.292 Sum_probs=12.6
Q ss_pred CCCCeEEEEEECCCCCcEEEEcCce
Q psy7177 9 VSAGHYTAYTLHPNTKEWHYFNDET 33 (66)
Q Consensus 9 ~~~GHY~a~~~~~~~~~W~~fdD~~ 33 (66)
.++|-|.|.+....+.+|.--.+..
T Consensus 90 ~Dsg~Y~C~~~~~~~~~~~f~~gt~ 114 (119)
T cd05712 90 EDSGKYFFRVELGDSNKYNFLKNTL 114 (119)
T ss_pred hhCeEEEEEEEcCCccEEEecCCeE
Confidence 4566666666554344444333333
No 108
>cd05852 Ig5_Contactin-1 Fifth Ig domain of contactin-1. Ig5_Contactin-1: fifth Ig domain of the neural cell adhesion molecule contactin-1. Contactins are comprised of six Ig domains followed by four fibronectin type III (FnIII) domains anchored to the membrane by glycosylphosphatidylinositol. Contactin-1 is differentially expressed in tumor tissues and may through a RhoA mechanism, facilitate invasion and metastasis of human lung adenocarcinoma.
Probab=23.90 E-value=74 Score=16.25 Aligned_cols=15 Identities=20% Similarity=0.488 Sum_probs=11.8
Q ss_pred cCCCCeEEEEEECCC
Q psy7177 8 GVSAGHYTAYTLHPN 22 (66)
Q Consensus 8 ~~~~GHY~a~~~~~~ 22 (66)
..+.|-|+|.+.+..
T Consensus 48 ~~D~G~Y~C~A~N~~ 62 (73)
T cd05852 48 KLDEGSYTCFAENNR 62 (73)
T ss_pred hhHCEEEEEEEECCC
Confidence 457899999999863
No 109
>cd05853 Ig6_Contactin-4 Sixth Ig domain of contactin-4. Ig6_Contactin-4: sixth Ig domain of the neural cell adhesion molecule contactin-4. Contactins are neural cell adhesion molecules, and are comprised of six Ig domains followed by four fibronectin type III (FnIII) domains anchored to the membrane by glycosylphosphatidylinositol. The different contactins show different expression patterns in the central nervous system. Highest expresson of contactin-4 is in testes, thyroid, small intestine, uterus and brain. Contactin-4 plays a role in the response of neuroblastoma cells to differentiating agents, such as retinoids. The contactin 4 gene is associated with cerebellar degeneration in spinocerebellar ataxia type 16.
Probab=23.88 E-value=61 Score=17.48 Aligned_cols=14 Identities=21% Similarity=0.290 Sum_probs=10.5
Q ss_pred CCCCeEEEEEECCC
Q psy7177 9 VSAGHYTAYTLHPN 22 (66)
Q Consensus 9 ~~~GHY~a~~~~~~ 22 (66)
.++|-|+|.+++..
T Consensus 57 ~dsG~YtC~a~n~~ 70 (85)
T cd05853 57 KHAGKYVCMVQTSV 70 (85)
T ss_pred HHCEEEEEEEEccc
Confidence 35688999998753
No 110
>cd05736 Ig2_Follistatin_like Second immunoglobulin (Ig)-like domain of a follistatin-like molecule encoded by the Mahya gene and similar proteins. Ig2_Follistatin_like: domain similar to the second immunoglobulin (Ig)-like domain found in a follistatin-like molecule encoded by the CNS-related Mahya gene. Mahya genes have been retained in certain Bilaterian branches during evolution. They are conserved in Hymenoptera and Deuterostomes, but are absent from other metazoan species such as fruit fly and nematode. Mahya proteins are secretory, with a follistatin-like domain (Kazal-type serine/threonine protease inhibitor domain and EF-hand calcium-binding domain), two Ig-like domains, and a novel C-terminal domain. Mahya may be involved in learning and memory and in processing of sensory information in Hymenoptera and vertebrates. Follistatin is a secreted, multidomain protein that binds activins with high affinity and antagonizes their signaling.
Probab=23.64 E-value=86 Score=15.65 Aligned_cols=14 Identities=21% Similarity=0.247 Sum_probs=10.8
Q ss_pred cCCCCeEEEEEECC
Q psy7177 8 GVSAGHYTAYTLHP 21 (66)
Q Consensus 8 ~~~~GHY~a~~~~~ 21 (66)
..+.|.|+|.+.+.
T Consensus 48 ~~D~G~Y~C~a~N~ 61 (76)
T cd05736 48 YEDTGAYTCIAKNE 61 (76)
T ss_pred cccCEEEEEEEEcC
Confidence 35778898888875
No 111
>COG1998 RPS31 Ribosomal protein S27AE [Translation, ribosomal structure and biogenesis]
Probab=23.22 E-value=94 Score=15.72 Aligned_cols=17 Identities=18% Similarity=0.178 Sum_probs=13.3
Q ss_pred CcEEEEcCceEEEcCCC
Q psy7177 24 KEWHYFNDETVLEQRPQ 40 (66)
Q Consensus 24 ~~W~~fdD~~V~~~~~~ 40 (66)
.+||+.+|+.|+.....
T Consensus 5 ~~yY~v~~~kv~rk~~~ 21 (51)
T COG1998 5 LKYYEVDDEKVKRKNRF 21 (51)
T ss_pred eeEEEEcCCcEEEcccc
Confidence 47999999998876543
No 112
>cd05865 Ig1_NCAM-1 First immunoglobulin (Ig)-like domain of neural cell adhesion molecule NCAM-1. Ig1_NCAM-1: first immunoglobulin (Ig)-like domain of neural cell adhesion molecule NCAM-1. NCAM-1 plays important roles in the development and regeneration of the central nervous system, in synaptogenesis and neural migration. NCAM mediates cell-cell and cell-substratum recognition and adhesion via homophilic (NCAM-NCAM), and heterophilic (NCAM-nonNCAM), interactions. NCAM is expressed as three major isoforms having different intracellular extensions. The extracellular portion of NCAM has five N-terminal Ig-like domains and two fibronectin type III domains. The double zipper adhesion complex model for NCAM homophilic binding involves the Ig1, Ig2, and Ig3 domains. By this model, Ig1 and Ig2 mediate dimerization of NCAM molecules situated on the same cell surface (cis interactions), and Ig3 domains mediate interactions between NCAM molecules expressed on the surface of opposing cells (trans
Probab=22.95 E-value=1.2e+02 Score=16.41 Aligned_cols=14 Identities=21% Similarity=0.140 Sum_probs=11.0
Q ss_pred cCCCCeEEEEEECC
Q psy7177 8 GVSAGHYTAYTLHP 21 (66)
Q Consensus 8 ~~~~GHY~a~~~~~ 21 (66)
..++|-|+|.+.+.
T Consensus 69 ~~D~G~YtC~A~N~ 82 (96)
T cd05865 69 IDDAGIYKCVVSNE 82 (96)
T ss_pred hhhCEEEEEEEEcC
Confidence 46788888888875
No 113
>PF08053 Tna_leader: Tryptophanese operon leader peptide; InterPro: IPR012620 This entry defines the apparent leader peptides of tryptophanase operons in Escherichia coli, Vibrio cholerae, Photobacterium profundum, Haemophilus influenzae, and related species. It has been suggested that these peptides act in cis to alter the behaviour of the translating ribosome []. The tryptophanese (tna) operon leader peptide catalyses the degradation of L-tryptophan to indole, pyruvate and ammonia, enabling the bacteria to utilise tryptophan as a source of carbon, nitrogen and energy. The tna operon of Escherichia coli contains two major structural genes, tnaA and tnaB. Preceding tnaA in the tna operon is a 319 -nucleotide transcribed regulatory region that contains the coding region for a 24-residue leader peptide, TnaC. The RNA sequence in the vicinity of the tnaC stop codon is rich in Cytidylate residues which is required for efficient Rho -dependent termination in the leader region of the tna operon [].; GO: 0031554 regulation of transcription termination, DNA-dependent, 0031556 transcriptional attenuation by ribosome
Probab=22.89 E-value=52 Score=13.74 Aligned_cols=12 Identities=25% Similarity=0.720 Sum_probs=8.9
Q ss_pred CCcEEEEcCceE
Q psy7177 23 TKEWHYFNDETV 34 (66)
Q Consensus 23 ~~~W~~fdD~~V 34 (66)
..+|+..|...|
T Consensus 9 tskwfnidnkiv 20 (24)
T PF08053_consen 9 TSKWFNIDNKIV 20 (24)
T ss_pred eeeeEeccCeec
Confidence 358999987655
No 114
>cd04973 Ig1_FGFR First immunoglobulin (Ig)-like domain of fibroblast growth factor receptor (FGFR). Ig1_FGFR: The first immunoglobulin (Ig)-like domain of fibroblast growth factor receptor (FGFR). Fibroblast growth factors (FGFs) participate in morphogenesis, development, angiogenesis, and wound healing. These FGF-stimulated processes are mediated by four FGFR tyrosine kinases (FGRF1-4). FGFRs are comprised of an extracellular portion consisting of three Ig-like domains, a transmembrane helix, and a cytoplasmic portion having protein tyrosine kinase activity. The highly conserved Ig-like domains 2 and 3, and the linker region between D2 and D3 define a general binding site for all FGFs.
Probab=22.70 E-value=88 Score=16.00 Aligned_cols=14 Identities=29% Similarity=0.437 Sum_probs=11.3
Q ss_pred cCCCCeEEEEEECC
Q psy7177 8 GVSAGHYTAYTLHP 21 (66)
Q Consensus 8 ~~~~GHY~a~~~~~ 21 (66)
..++|-|+|.+++.
T Consensus 54 ~~DsG~Y~C~a~n~ 67 (79)
T cd04973 54 PRDSGLYACVTSSP 67 (79)
T ss_pred hhhCEEEEEEEeCC
Confidence 45789999999875
No 115
>PF10949 DUF2777: Protein of unknown function (DUF2777); InterPro: IPR024488 This family of proteins with unknown function appears to be restricted to Bacillaceae.
Probab=22.70 E-value=1.8e+02 Score=18.50 Aligned_cols=19 Identities=26% Similarity=0.550 Sum_probs=13.2
Q ss_pred CCcEEEEcCceEEEcCCCc
Q psy7177 23 TKEWHYFNDETVLEQRPQD 41 (66)
Q Consensus 23 ~~~W~~fdD~~V~~~~~~~ 41 (66)
+++|+.|||+.=+....++
T Consensus 25 ~~~w~ffddE~~Eallle~ 43 (185)
T PF10949_consen 25 NDEWIFFDDEEGEALLLEE 43 (185)
T ss_pred eCcEEEEEcCCCceechhh
Confidence 5789999998755444433
No 116
>PRK01842 hypothetical protein; Provisional
Probab=22.67 E-value=86 Score=19.19 Aligned_cols=10 Identities=20% Similarity=0.900 Sum_probs=8.2
Q ss_pred CCcEEEEcCc
Q psy7177 23 TKEWHYFNDE 32 (66)
Q Consensus 23 ~~~W~~fdD~ 32 (66)
+|+|++.|..
T Consensus 137 ~G~W~YvDG~ 146 (149)
T PRK01842 137 QGRWRYVDGD 146 (149)
T ss_pred CCeEEEeCCE
Confidence 6899999863
No 117
>cd05856 Ig2_FGFRL1-like Second immunoglobulin (Ig)-like domain of fibroblast growth factor (FGF) receptor_like-1(FGFRL1). Ig2_FGFRL1-like: second immunoglobulin (Ig)-like domain of fibroblast growth factor (FGF) receptor_like-1(FGFRL1). FGFRL1 is comprised of a signal peptide, three extracellular Ig-like modules, a transmembrane segment, and a short intracellular domain. FGFRL1 is expressed preferentially in skeletal tissues. Similar to FGF receptors, the expressed protein interacts specifically with heparin and with FGF2. FGFRL1 does not have a protein tyrosine kinase domain at its C terminus; neither does its cytoplasmic domain appear to interact with a signaling partner. It has been suggested that FGFRL1 may not have any direct signaling function, but instead acts as a decoy receptor trapping FGFs and preventing them from binding other receptors.
Probab=22.62 E-value=82 Score=15.64 Aligned_cols=15 Identities=20% Similarity=0.264 Sum_probs=11.5
Q ss_pred cCCCCeEEEEEECCC
Q psy7177 8 GVSAGHYTAYTLHPN 22 (66)
Q Consensus 8 ~~~~GHY~a~~~~~~ 22 (66)
..++|-|+|.+.+..
T Consensus 57 ~~D~G~Y~C~a~N~~ 71 (82)
T cd05856 57 PEDSGKYTCHVSNRA 71 (82)
T ss_pred hhhCEEEEEEEEcCC
Confidence 457889999988753
No 118
>cd05879 Ig_P0 Immunoglobulin (Ig)-like domain of Protein zero (P0). Ig_P0ex: immunoglobulin (Ig) domain of Protein zero (P0). P0 accounts for over 50% of the total protein in peripheral nervous system (PNS) myelin. P0 is a single-pass transmembrane glycoprotein with a highly basic intracellular domain and an Ig domain. The extracellular domain of P0 (P0-ED) is similar to the Ig variable domain, carrying one acceptor sequence for N-linked glycosylation. P0 plays a role in membrane adhesion in the spiral wraps of the myelin sheath. The intracellular domain is thought to mediate membrane apposition of the cytoplasmic faces and may, through electrostatic interactions, interact directly with lipid headgroups. It is thought that homophilic interactions of the P0 extracellular domain mediate membrane juxtaposition in the extracellular space of PNS myelin.
Probab=22.56 E-value=1.1e+02 Score=17.36 Aligned_cols=15 Identities=20% Similarity=0.304 Sum_probs=11.4
Q ss_pred cCCCCeEEEEEECCC
Q psy7177 8 GVSAGHYTAYTLHPN 22 (66)
Q Consensus 8 ~~~~GHY~a~~~~~~ 22 (66)
-.+.|-|+|.++++.
T Consensus 89 ~sD~G~Y~C~v~n~p 103 (116)
T cd05879 89 YTDNGTFTCDVKNPP 103 (116)
T ss_pred cccCEEEEEEEEcCC
Confidence 357788999988764
No 119
>cd05894 Ig_C5_MyBP-C C5 immunoglobulin (Ig) domain of cardiac myosin binding protein C (MyBP-C). Ig_C5_MyBP_C : the C5 immunoglobulin (Ig) domain of cardiac myosin binding protein C (MyBP-C). MyBP_C consists of repeated domains, Ig and fibronectin type 3, and various linkers. Three isoforms of MYBP_C exist and are included in this group: cardiac(c), and fast and slow skeletal muscle (s) MyBP_C. cMYBP_C has insertions between and inside domains and an additional cardiac-specific Ig domain at the N-terminus. For cMYBP_C an interaction has been demonstrated between this C5 domain and the Ig C8 domain.
Probab=22.33 E-value=88 Score=16.45 Aligned_cols=15 Identities=27% Similarity=0.337 Sum_probs=12.1
Q ss_pred cCCCCeEEEEEECCC
Q psy7177 8 GVSAGHYTAYTLHPN 22 (66)
Q Consensus 8 ~~~~GHY~a~~~~~~ 22 (66)
..++|.|++.+++..
T Consensus 61 ~~D~G~Y~c~a~N~~ 75 (86)
T cd05894 61 REDEGVYTITVTNPV 75 (86)
T ss_pred cCcCEEEEEEEEeCC
Confidence 467899999999863
No 120
>cd05864 Ig2_VEGFR-2 Second immunoglobulin (Ig)-like domain of vascular endothelial growth factor receptor 2 (VEGFR-2). Ig2_VEGF-2: Second immunoglobulin (Ig)-like domain of vascular endothelial growth factor receptor 2 (VEGFR-2). The VEGFRs have an extracellular component with seven Ig-like domains, a transmembrane segment, and an intracellular tyrosine kinase domain interrupted by a kinase-insert domain. VEGFRs bind VEGFs with high affinity at the Ig-like domains. VEGFR-2 (KDR/Flk-1) is a major mediator of the mitogenic, angiogenic and microvascular permeability-enhancing effects of VEGF-A; VEGF-A is important to the growth and maintenance of vascular endothelial cells and to the development of new blood- and lymphatic-vessels in physiological and pathological states. VEGF-A also interacts with VEGFR-1, which it binds more strongly than VEGFR-2. VEGFR-2 and -1 may mediate a chemotactic and a survival signal in hematopoietic stem cells or leukemia cells.
Probab=22.30 E-value=65 Score=16.47 Aligned_cols=15 Identities=33% Similarity=0.465 Sum_probs=11.6
Q ss_pred cCCCCeEEEEEECCC
Q psy7177 8 GVSAGHYTAYTLHPN 22 (66)
Q Consensus 8 ~~~~GHY~a~~~~~~ 22 (66)
..++|-|+|.+.+..
T Consensus 43 ~~D~G~YtC~a~N~~ 57 (70)
T cd05864 43 EKDAGNYTVVLTNPI 57 (70)
T ss_pred HHHCEEEEEEEEECC
Confidence 356799999998863
No 121
>PRK09806 tryptophanase leader peptide; Provisional
Probab=21.84 E-value=63 Score=13.75 Aligned_cols=13 Identities=23% Similarity=0.562 Sum_probs=9.6
Q ss_pred CCcEEEEcCceEE
Q psy7177 23 TKEWHYFNDETVL 35 (66)
Q Consensus 23 ~~~W~~fdD~~V~ 35 (66)
..+|+..|...|.
T Consensus 9 tskwfnidnkivd 21 (26)
T PRK09806 9 TSKWFNIDNKIVD 21 (26)
T ss_pred eeeEEeccCeeec
Confidence 3589999877664
No 122
>PRK01617 hypothetical protein; Provisional
Probab=21.36 E-value=69 Score=19.56 Aligned_cols=15 Identities=13% Similarity=0.578 Sum_probs=10.9
Q ss_pred EEECCCCCcEEEEcCce
Q psy7177 17 YTLHPNTKEWHYFNDET 33 (66)
Q Consensus 17 ~~~~~~~~~W~~fdD~~ 33 (66)
|+|. +|+|++.|...
T Consensus 117 F~r~--~g~W~Yvdg~~ 131 (154)
T PRK01617 117 FLKE--NGQWYYIDGTR 131 (154)
T ss_pred eEEe--CCCEEecCCCC
Confidence 4554 68999998754
No 123
>cd05734 Ig7_DSCAM Seventh immunoglobulin (Ig)-like domain of Down Syndrome Cell Adhesion molecule (DSCAM). Ig7_DSCAM: the seventh immunoglobulin (Ig)-like domain of Down Syndrome Cell Adhesion molecule (DSCAM). DSCAM is a cell adhesion molecule expressed largely in the developing nervous system. The gene encoding DSCAM is located at human chromosome 21q22, the locus associated with the mental retardation phenotype of Down Syndrome. DSCAM is predicted to be the largest member of the IG superfamily. It has been demonstrated that DSCAM can mediate cation-independent homophilic intercellular adhesion.
Probab=21.28 E-value=1e+02 Score=15.58 Aligned_cols=14 Identities=14% Similarity=0.159 Sum_probs=10.3
Q ss_pred cCCCCeEEEEEECC
Q psy7177 8 GVSAGHYTAYTLHP 21 (66)
Q Consensus 8 ~~~~GHY~a~~~~~ 21 (66)
..+.|.|.|-+.+.
T Consensus 53 ~~D~G~Y~C~a~N~ 66 (79)
T cd05734 53 EEDSGYYLCKVSND 66 (79)
T ss_pred cccCEEEEEEEEeC
Confidence 35678888888775
No 124
>cd05725 Ig3_Robo Third immunoglobulin (Ig)-like domain in Robo (roundabout) receptors. Ig3_Robo: domain similar to the third immunoglobulin (Ig)-like domain in Robo (roundabout) receptors. Robo receptors play a role in the development of the central nervous system (CNS), and are receptors of Slit protein. Slit is a repellant secreted by the neural cells in the midline. Slit acts through Robo to prevent most neurons from crossing the midline from either side. Three mammalian Robo homologs (robo1, -2, and -3), and three mammalian Slit homologs (Slit-1,-2, -3), have been identified. Commissural axons, which cross the midline, express low levels of Robo; longitudinal axons, which avoid the midline, express high levels of Robo. robo1, -2, and -3 are expressed by commissural neurons in the vertebrate spinal cord and Slits 1, -2, -3 are expressed at the ventral midline. Robo-3 is a divergent member of the Robo family which instead of being a positive regulator of slit responsiveness, antagoni
Probab=21.11 E-value=90 Score=15.28 Aligned_cols=14 Identities=21% Similarity=0.211 Sum_probs=11.2
Q ss_pred cCCCCeEEEEEECC
Q psy7177 8 GVSAGHYTAYTLHP 21 (66)
Q Consensus 8 ~~~~GHY~a~~~~~ 21 (66)
..++|-|+|.+.+.
T Consensus 44 ~~D~G~Y~C~a~N~ 57 (69)
T cd05725 44 AGDEGSYTCEAENM 57 (69)
T ss_pred hhHCEEEEEEEEcC
Confidence 45789999999875
No 125
>CHL00057 rpl14 ribosomal protein L14
Probab=21.10 E-value=1.2e+02 Score=17.89 Aligned_cols=20 Identities=10% Similarity=0.146 Sum_probs=16.1
Q ss_pred CCCcEEEEcCceEEEcCCCc
Q psy7177 22 NTKEWHYFNDETVLEQRPQD 41 (66)
Q Consensus 22 ~~~~W~~fdD~~V~~~~~~~ 41 (66)
.+|.|.+|||..+--++.+.
T Consensus 72 ~dG~~i~F~~Na~VLin~~~ 91 (122)
T CHL00057 72 DNGMIIRFDDNAAVVIDQEG 91 (122)
T ss_pred CCCcEEEcCCceEEEECCCC
Confidence 36889999999988887653
No 126
>cd05868 Ig4_NrCAM Fourth immunoglobulin (Ig)-like domain of NrCAM (NgCAM-related cell adhesion molecule). Ig4_ NrCAM: fourth immunoglobulin (Ig)-like domain of NrCAM (NgCAM-related cell adhesion molecule). NrCAM belongs to the L1 subfamily of cell adhesion molecules (CAMs) and is comprised of an extracellular region having six IG-like domains and five fibronectin type III domains, a transmembrane region and an intracellular domain. NrCAM is primarily expressed in the nervous system.
Probab=21.04 E-value=91 Score=15.84 Aligned_cols=14 Identities=14% Similarity=0.109 Sum_probs=10.7
Q ss_pred CCCCeEEEEEECCC
Q psy7177 9 VSAGHYTAYTLHPN 22 (66)
Q Consensus 9 ~~~GHY~a~~~~~~ 22 (66)
.+.|.|.|.+++..
T Consensus 51 ~D~G~Y~C~A~N~~ 64 (76)
T cd05868 51 RSSAVYQCNASNEY 64 (76)
T ss_pred hhCEEEEEEEEcCC
Confidence 56788888888753
No 127
>PF06865 DUF1255: Protein of unknown function (DUF1255); InterPro: IPR009664 This family consists of several conserved hypothetical bacterial proteins of around 95 residues in length. The function of this family is unknown; PDB: 2OYZ_A 3HQX_A.
Probab=20.92 E-value=1.8e+02 Score=16.50 Aligned_cols=37 Identities=19% Similarity=0.326 Sum_probs=20.2
Q ss_pred EEECCCCCcEEEEcCceEEEcCCCc-----CCCCCeEEEEEE
Q psy7177 17 YTLHPNTKEWHYFNDETVLEQRPQD-----EDFNNAYILFYK 53 (66)
Q Consensus 17 ~~~~~~~~~W~~fdD~~V~~~~~~~-----~~~~~aYiLfY~ 53 (66)
.++.+....|-.|..-..-.++.+. +....+|+..|.
T Consensus 53 ~V~lpg~~ew~~~~aGesF~VpanssF~v~v~~~~~Y~C~y~ 94 (94)
T PF06865_consen 53 EVKLPGEDEWQTYSAGESFEVPANSSFDVKVKEPTAYLCSYL 94 (94)
T ss_dssp EEEETT-SS-EEEETT-EEEE-TTEEEEEEESS-EEEEEEE-
T ss_pred EEEcCCCcccEEeCCCCeEEECCCCeEEEEECcceeeEEEeC
Confidence 4555555678877776666665554 446788888773
No 128
>PF05766 NinG: Bacteriophage Lambda NinG protein; InterPro: IPR008713 The ninR region of phage lambda contains two recombination genes, ninB (also known as orf) and ninG (also known as rap). These genes are involved in the RecF and RecBCD recombination pathways of Escherichia coli that operate on phage lambda [, ]. NinB and NinG participate in Red recombination, the primary pathway operating when wild-type lambda grows lytically in rec+ cells [].
Probab=20.78 E-value=33 Score=21.83 Aligned_cols=26 Identities=27% Similarity=0.428 Sum_probs=17.0
Q ss_pred eeecCCCCeEEEEEECCCCCcEEEEcCceE
Q psy7177 5 PSKGVSAGHYTAYTLHPNTKEWHYFNDETV 34 (66)
Q Consensus 5 H~G~~~~GHY~a~~~~~~~~~W~~fdD~~V 34 (66)
|.|..+.|||.+--..+ -..||-.++
T Consensus 97 ~~~~~dagHy~s~g~~~----~lRF~~~N~ 122 (189)
T PF05766_consen 97 HGGQWDAGHYRSRGAAP----ELRFNEDNI 122 (189)
T ss_pred CCCCcccccccccccCc----ccccChhhH
Confidence 55688999998764433 356665554
No 129
>PTZ00320 ribosomal protein L14; Provisional
Probab=20.74 E-value=1.4e+02 Score=19.12 Aligned_cols=20 Identities=15% Similarity=0.074 Sum_probs=16.4
Q ss_pred CCCcEEEEcCceEEEcCCCc
Q psy7177 22 NTKEWHYFNDETVLEQRPQD 41 (66)
Q Consensus 22 ~~~~W~~fdD~~V~~~~~~~ 41 (66)
.+|.|+.|||.-+-.++.+.
T Consensus 138 ~DGs~IrFDdNAaVLIN~qg 157 (188)
T PTZ00320 138 MSGLQTNFDRNTCILMNDQR 157 (188)
T ss_pred CCCCEEEeCCcEEEEECCCC
Confidence 47889999999988887653
No 130
>KOG1296|consensus
Probab=20.69 E-value=56 Score=20.27 Aligned_cols=29 Identities=17% Similarity=0.234 Sum_probs=18.6
Q ss_pred cCCCCeEEEEEECCCCCcEEEEcCceEEEc
Q psy7177 8 GVSAGHYTAYTLHPNTKEWHYFNDETVLEQ 37 (66)
Q Consensus 8 ~~~~GHY~a~~~~~~~~~W~~fdD~~V~~~ 37 (66)
++.+|--..-+-.. ++.|+.|||..-..|
T Consensus 120 s~esG~~f~eidls-e~ew~dYDdk~~e~V 148 (161)
T KOG1296|consen 120 SAESGTAFSEIDLS-EKEWTDYDDKAGESV 148 (161)
T ss_pred ecccCceeeeeecc-cccceehhhcCCCee
Confidence 35566655555443 669999999764444
No 131
>cd05723 Ig4_Neogenin Fourth immunoglobulin (Ig)-like domain in neogenin and similar proteins. Ig4_Neogenin: fourth immunoglobulin (Ig)-like domain in neogenin and related proteins. Neogenin is a cell surface protein which is expressed in the developing nervous system of vertebrate embryos in the growing nerve cells. It is also expressed in other embryonic tissues, and may play a general role in developmental processes such as cell migration, cell-cell recognition, and tissue growth regulation. Included in this group is the tumor suppressor protein DCC, which is deleted in colorectal carcinoma . DCC and neogenin each have four Ig-like domains followed by six fibronectin type III domains, a transmembrane domain, and an intracellular domain.
Probab=20.60 E-value=1e+02 Score=15.30 Aligned_cols=14 Identities=14% Similarity=0.259 Sum_probs=10.8
Q ss_pred cCCCCeEEEEEECC
Q psy7177 8 GVSAGHYTAYTLHP 21 (66)
Q Consensus 8 ~~~~GHY~a~~~~~ 21 (66)
..++|-|+|.+.+.
T Consensus 46 ~~D~G~Y~C~A~N~ 59 (71)
T cd05723 46 KSDEGFYQCIAEND 59 (71)
T ss_pred cccCEEEEEEEEcC
Confidence 35678888888875
No 132
>PF07206 Baculo_LEF-10: Baculovirus late expression factor 10 (LEF-10); InterPro: IPR009855 This family consists of several Baculovirus specific late expression factor 10 (LEF-10) sequences. LEF-10 is thought to be a late expressed structural protein although its exact function is unknown [].
Probab=20.41 E-value=1.4e+02 Score=16.16 Aligned_cols=18 Identities=22% Similarity=0.543 Sum_probs=13.0
Q ss_pred CCCeEEEEEEEcCCCCCc
Q psy7177 44 FNNAYILFYKKRDEDSDV 61 (66)
Q Consensus 44 ~~~aYiLfY~r~~~~~~~ 61 (66)
-.+.||+||..-.....+
T Consensus 22 idn~Yii~nVvd~~~~~i 39 (71)
T PF07206_consen 22 IDNKYIILNVVDQESGEI 39 (71)
T ss_pred ecCcEEEEEEEeCCCCce
Confidence 378899999887655443
No 133
>cd05718 Ig1_PVR_like First immunoglobulin (Ig) domain of poliovirus receptor (PVR, also known as CD155) and similar proteins. Ig1_PVR_like: domain similar to the first immunoglobulin (Ig) domain of poliovirus receptor (PVR, also known as CD155). Poliovirus (PV) binds to its cellular receptor (PVR/CD155) to initiate infection. CD155 is a membrane-anchored, single-span glycoprotein; its extracellular region has three Ig-like domains. There are four different isotypes of CD155 (referred to as alpha, beta, gamma, and delta), that result from alternate splicing of the CD155 mRNA, and have identical extracellular domains. CD155-beta and - gamma, are secreted, CD155-alpha and delta are membrane-bound and function as PV receptors. The virus recognition site is contained in the amino-terminal domain, D1. Having the virus attachment site on the receptor distal from the plasma membrane, may be important for successful initiation of infection of cells by the virus. CD155 binds in the poliovirus "c
Probab=20.23 E-value=1.5e+02 Score=15.41 Aligned_cols=19 Identities=11% Similarity=0.045 Sum_probs=11.4
Q ss_pred CCCCeEEEEEECCCCCcEE
Q psy7177 9 VSAGHYTAYTLHPNTKEWH 27 (66)
Q Consensus 9 ~~~GHY~a~~~~~~~~~W~ 27 (66)
.++|-|+|-+.....|.|.
T Consensus 72 ~D~G~Y~C~v~~~~~g~~~ 90 (98)
T cd05718 72 EDEGNYICEFATFPQGNRQ 90 (98)
T ss_pred ccCEEEEEEEEeCCCCcEE
Confidence 5677787777653344443
Done!