Query psy1572
Match_columns 74
No_of_seqs 56 out of 58
Neff 2.7
Searched_HMMs 46136
Date Fri Aug 16 19:42:12 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy1572.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/1572hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG3732|consensus 99.5 3.3E-15 7.1E-20 117.0 -0.5 54 21-74 286-339 (339)
2 PF00035 dsrm: Double-stranded 98.0 6.2E-05 1.4E-09 42.7 6.9 36 30-67 32-67 (67)
3 smart00358 DSRM Double-strande 97.7 0.00017 3.6E-09 40.5 5.2 37 29-67 30-66 (67)
4 cd00048 DSRM Double-stranded R 97.4 0.00057 1.2E-08 38.0 4.9 37 29-67 32-68 (68)
5 PRK00102 rnc ribonuclease III; 95.9 0.049 1.1E-06 37.9 6.7 38 29-68 190-227 (229)
6 TIGR02191 RNaseIII ribonucleas 95.0 0.062 1.3E-06 36.9 4.7 37 29-67 184-220 (220)
7 PHA03103 double-strand RNA-bin 94.1 0.11 2.4E-06 38.1 4.6 39 28-68 138-176 (183)
8 PRK12371 ribonuclease III; Rev 93.7 0.16 3.4E-06 37.2 4.8 41 27-69 191-231 (235)
9 PHA02701 ORF020 dsRNA-binding 93.3 0.15 3.2E-06 37.7 4.2 38 29-68 139-176 (183)
10 PF14709 DND1_DSRM: double str 93.1 0.31 6.6E-06 30.6 4.8 39 30-68 34-80 (80)
11 PF03368 Dicer_dimer: Dicer di 91.8 0.79 1.7E-05 29.0 5.6 46 26-71 28-75 (90)
12 PRK12372 ribonuclease III; Rev 90.1 1.1 2.3E-05 36.7 6.2 40 28-68 182-221 (413)
13 PRK14718 ribonuclease III; Pro 87.5 1.4 3E-05 36.7 5.3 37 30-68 184-221 (467)
14 COG0571 Rnc dsRNA-specific rib 87.5 1.2 2.7E-05 33.2 4.5 28 42-69 204-231 (235)
15 KOG3732|consensus 86.8 1.9 4.1E-05 34.7 5.5 45 23-70 63-107 (339)
16 PF14954 LIX1: Limb expression 86.0 0.99 2.1E-05 35.2 3.4 32 30-64 58-92 (252)
17 KOG3769|consensus 79.9 2.6 5.6E-05 34.0 3.7 32 37-68 261-302 (333)
18 PF04694 Corona_3: Coronavirus 77.7 2.9 6.3E-05 26.5 2.7 25 35-59 29-53 (60)
19 KOG2777|consensus 72.6 4.4 9.5E-05 34.3 3.3 27 43-69 128-154 (542)
20 PF02169 LPP20: LPP20 lipoprot 66.7 5.6 0.00012 23.7 2.1 23 46-68 17-39 (92)
21 PF12909 DUF3832: Protein of u 66.5 6.1 0.00013 26.0 2.4 37 27-68 15-51 (89)
22 KOG4334|consensus 65.7 5.6 0.00012 34.4 2.6 28 40-67 414-441 (650)
23 cd00099 IgV Immunoglobulin var 62.1 9 0.0002 22.5 2.4 26 26-51 77-102 (105)
24 PF01413 C4: C-terminal tandem 60.9 4.7 0.0001 27.7 1.1 18 32-49 48-65 (114)
25 PF03681 UPF0150: Uncharacteri 59.8 7.8 0.00017 21.4 1.7 16 44-59 23-38 (48)
26 cd05733 Ig6_L1-CAM_like Sixth 59.3 9.6 0.00021 22.1 2.1 9 28-36 55-63 (77)
27 smart00111 C4 C-terminal tande 58.6 4.4 9.6E-05 27.8 0.7 21 32-53 48-68 (114)
28 PF12695 Abhydrolase_5: Alpha/ 57.5 13 0.00029 22.2 2.6 16 41-56 1-16 (145)
29 COG1598 Predicted nuclease of 56.8 9 0.00019 23.4 1.8 16 44-59 25-40 (73)
30 cd04967 Ig1_Contactin First Ig 54.4 7.9 0.00017 22.9 1.2 11 26-36 69-79 (91)
31 PRK04930 glutathione-regulated 51.8 26 0.00056 25.0 3.7 35 35-69 143-177 (184)
32 PRK03661 hypothetical protein; 50.9 59 0.0013 22.9 5.4 40 30-70 120-160 (164)
33 cd02647 nuc_hydro_TvIAG nuc_hy 50.8 23 0.0005 26.9 3.5 28 42-72 31-58 (312)
34 PRK03657 hypothetical protein; 46.9 62 0.0014 23.1 5.0 42 30-72 126-167 (170)
35 cd05874 Ig6_NrCAM Sixth immuno 46.8 21 0.00045 21.0 2.2 10 28-37 55-64 (77)
36 cd05739 Ig3_RPTP_IIa_LAR_like 44.1 17 0.00037 21.0 1.6 15 22-36 42-56 (69)
37 PF04098 Rad52_Rad22: Rad52/22 41.9 56 0.0012 22.3 4.0 39 27-65 72-121 (154)
38 PF09888 DUF2115: Uncharacteri 40.9 14 0.00031 26.2 1.0 29 15-45 124-159 (163)
39 cd05875 Ig6_hNeurofascin_like 40.4 31 0.00067 20.2 2.3 9 28-36 55-63 (77)
40 cd05760 Ig2_PTK7 Second immuno 39.2 19 0.00042 20.8 1.2 11 26-36 48-58 (77)
41 KOG2938|consensus 39.2 37 0.0008 27.1 3.2 27 42-73 52-78 (350)
42 TIGR03056 bchO_mg_che_rel puta 38.2 1E+02 0.0022 20.3 4.6 32 26-57 12-46 (278)
43 cd05868 Ig4_NrCAM Fourth immun 37.9 21 0.00045 20.7 1.2 12 25-36 50-61 (76)
44 KOG0286|consensus 36.4 68 0.0015 26.2 4.3 45 4-48 290-334 (343)
45 PRK10443 rihA ribonucleoside h 36.3 49 0.0011 24.9 3.3 27 41-72 32-58 (311)
46 cd05873 Ig_Sema4D_like Immunog 36.2 22 0.00048 22.3 1.2 14 25-38 57-70 (87)
47 smart00408 IGc2 Immunoglobulin 35.1 26 0.00057 17.4 1.2 11 26-36 49-59 (63)
48 PF05165 GGDN: GGDN family; I 35.0 28 0.00061 26.8 1.8 26 36-61 75-100 (246)
49 cd05723 Ig4_Neogenin Fourth im 34.9 25 0.00054 20.0 1.2 11 26-36 47-57 (71)
50 cd05750 Ig_Pro_neuregulin Immu 34.6 27 0.00058 19.3 1.3 11 26-36 53-63 (75)
51 TIGR01738 bioH putative pimelo 33.7 45 0.00097 20.8 2.3 19 40-58 5-23 (245)
52 cd05848 Ig1_Contactin-5 First 33.6 26 0.00057 21.2 1.2 11 26-36 69-79 (94)
53 cd05738 Ig2_RPTP_IIa_LAR_like 33.6 27 0.00058 20.0 1.2 12 25-36 47-58 (74)
54 cd05850 Ig1_Contactin-2 First 33.5 26 0.00057 21.1 1.2 11 26-36 69-79 (94)
55 cd05852 Ig5_Contactin-1 Fifth 33.4 27 0.00058 20.4 1.2 11 26-36 49-59 (73)
56 COG2429 Archaeal GTP cyclohydr 33.1 36 0.00078 26.7 2.2 23 42-65 84-106 (250)
57 PF13927 Ig_3: Immunoglobulin 33.1 33 0.00072 18.1 1.5 12 25-36 63-74 (75)
58 cd05867 Ig4_L1-CAM_like Fourth 32.6 29 0.00062 19.8 1.2 13 26-38 51-63 (76)
59 cd05895 Ig_Pro_neuregulin-1 Im 32.5 29 0.00063 19.6 1.3 12 25-36 53-64 (76)
60 KOG1817|consensus 32.4 48 0.001 28.5 2.9 24 45-68 479-502 (533)
61 cd05722 Ig1_Neogenin First imm 31.9 29 0.00064 20.5 1.2 13 26-38 69-81 (95)
62 cd05849 Ig1_Contactin-1 First 31.6 30 0.00064 21.2 1.2 11 26-36 68-78 (93)
63 cd04970 Ig6_Contactin_like Six 31.3 31 0.00067 20.0 1.2 11 26-36 57-67 (85)
64 cd05740 Ig_CEACAM_D4 Fourth im 31.1 30 0.00066 21.0 1.2 11 26-36 65-75 (91)
65 cd05857 Ig2_FGFR Second immuno 31.0 32 0.00069 19.6 1.2 11 26-36 61-71 (85)
66 cd02649 nuc_hydro_CeIAG nuc_hy 30.7 69 0.0015 24.1 3.3 26 42-72 31-56 (306)
67 PF02464 CinA: Competence-dama 30.4 1.8E+02 0.0039 19.9 6.4 39 30-69 114-152 (154)
68 PRK02240 GTP cyclohydrolase II 29.6 63 0.0014 25.0 3.0 25 41-66 83-107 (254)
69 cd04971 Ig_TrKABC_d5 Fifth dom 29.3 36 0.00078 20.4 1.3 10 27-36 57-66 (81)
70 cd05876 Ig3_L1-CAM Third immun 29.2 35 0.00077 19.3 1.2 10 27-36 47-56 (71)
71 cd05731 Ig3_L1-CAM_like Third 28.8 38 0.00082 18.4 1.3 12 26-37 46-57 (71)
72 cd05724 Ig2_Robo Second immuno 28.2 38 0.00081 19.2 1.2 11 26-36 61-71 (86)
73 smart00409 IG Immunoglobulin. 28.1 43 0.00093 17.4 1.4 11 26-36 61-71 (86)
74 smart00410 IG_like Immunoglobu 28.1 43 0.00093 17.4 1.4 11 26-36 61-71 (86)
75 cd07693 Ig1_Robo First immunog 28.0 38 0.00083 19.2 1.2 12 25-36 73-84 (100)
76 cd05745 Ig3_Peroxidasin Third 27.9 39 0.00083 19.2 1.2 11 26-36 50-60 (74)
77 PF00047 ig: Immunoglobulin do 27.8 35 0.00077 18.2 1.0 9 27-35 56-64 (64)
78 cd05763 Ig_1 Subgroup of the i 27.8 39 0.00084 19.0 1.2 11 26-36 50-60 (75)
79 TIGR02296 HpaC 4-hydroxyphenyl 27.6 94 0.002 21.1 3.3 40 7-50 8-47 (154)
80 KOG4528|consensus 27.5 64 0.0014 24.0 2.6 23 45-67 42-64 (168)
81 PF07897 DUF1675: Protein of u 27.1 45 0.00097 26.1 1.8 18 42-59 254-271 (284)
82 cd00455 nuc_hydro nuc_hydro: N 26.9 66 0.0014 23.8 2.6 27 41-72 28-54 (295)
83 cd07706 IgV_TCR_delta Immunogl 26.8 58 0.0013 20.0 2.0 28 24-51 80-111 (116)
84 PHA02447 hypothetical protein 26.8 36 0.00079 22.7 1.1 45 22-66 35-79 (86)
85 COG2820 Udp Uridine phosphoryl 26.6 37 0.00079 26.5 1.2 25 21-51 45-70 (248)
86 cd08342 HPPD_N_like N-terminal 26.5 72 0.0016 19.9 2.4 23 8-30 104-126 (136)
87 cd04978 Ig4_L1-NrCAM_like Four 26.3 44 0.00094 18.5 1.2 11 26-36 51-61 (76)
88 cd05737 Ig_Myomesin_like_C C-t 25.6 44 0.00095 19.8 1.2 11 26-36 68-78 (92)
89 cd05727 Ig2_Contactin-2-like S 25.6 78 0.0017 20.4 2.5 26 13-38 54-82 (96)
90 PF00388 PI-PLC-X: Phosphatidy 25.5 78 0.0017 20.6 2.5 7 42-48 55-61 (146)
91 cd05748 Ig_Titin_like Immunogl 25.5 46 0.001 18.6 1.3 11 26-36 50-60 (74)
92 cd04974 Ig3_FGFR Third immunog 25.4 45 0.00097 19.8 1.2 11 26-36 65-75 (90)
93 cd05736 Ig2_Follistatin_like S 25.4 46 0.00099 18.7 1.2 11 26-36 49-59 (76)
94 TIGR00702 uncharacterized doma 25.1 94 0.002 24.1 3.2 26 41-66 47-72 (377)
95 TIGR03266 methan_mark_1 putati 24.8 1E+02 0.0022 23.9 3.4 24 42-65 52-75 (376)
96 PRK15486 hpaC 4-hydroxyphenyla 24.6 1E+02 0.0023 21.6 3.2 39 7-49 17-55 (170)
97 KOG3075|consensus 24.5 1E+02 0.0022 24.3 3.3 30 33-65 8-37 (261)
98 PRK06806 fructose-bisphosphate 23.8 60 0.0013 24.5 1.9 17 42-58 202-219 (281)
99 cd05712 Ig_Siglec_N Immunoglob 23.6 98 0.0021 19.4 2.6 26 24-50 88-113 (119)
100 cd04972 Ig_TrkABC_d4 Fourth do 23.6 51 0.0011 19.3 1.2 9 28-36 68-76 (90)
101 cd05752 Ig1_FcgammaR_like Frst 23.4 31 0.00067 20.6 0.3 12 25-36 55-66 (78)
102 cd04969 Ig5_Contactin_like Fif 23.3 53 0.0011 18.4 1.2 11 26-36 49-59 (73)
103 cd05746 Ig4_Peroxidasin Fourth 23.3 53 0.0011 18.4 1.2 10 27-36 47-56 (69)
104 cd05747 Ig5_Titin_like M5, fif 23.2 53 0.0011 19.4 1.2 11 26-36 69-79 (92)
105 cd04973 Ig1_FGFR First immunog 23.1 51 0.0011 19.1 1.2 12 25-36 54-65 (79)
106 cd05856 Ig2_FGFRL1-like Second 23.0 55 0.0012 18.1 1.2 11 26-36 58-68 (82)
107 PF13827 DUF4189: Domain of un 23.0 83 0.0018 19.5 2.2 18 45-62 66-83 (100)
108 PRK01022 hypothetical protein; 22.7 45 0.00098 23.9 1.0 20 15-36 126-145 (167)
109 cd05756 Ig1_IL1R_like First im 22.7 48 0.001 20.4 1.1 15 25-39 69-83 (94)
110 cd05743 Ig_Perlecan_D2_like Im 22.5 56 0.0012 18.7 1.2 10 27-36 53-62 (78)
111 PF07679 I-set: Immunoglobulin 22.3 58 0.0012 18.4 1.2 14 26-39 66-79 (90)
112 cd05854 Ig6_Contactin-2 Sixth 22.2 56 0.0012 19.5 1.2 12 26-37 57-68 (85)
113 cd05730 Ig3_NCAM-1_like Third 22.1 57 0.0012 19.1 1.2 11 26-36 68-78 (95)
114 cd02651 nuc_hydro_IU_UC_XIUA n 21.8 1.1E+02 0.0024 22.6 2.9 26 42-72 30-55 (302)
115 cd05891 Ig_M-protein_C C-termi 21.7 58 0.0013 19.5 1.2 11 26-36 68-78 (92)
116 TIGR03693 ocin_ThiF_like putat 21.2 71 0.0015 27.8 2.1 45 24-70 383-437 (637)
117 PF06675 DUF1177: Protein of u 21.2 1.6E+02 0.0035 23.4 3.9 29 46-74 220-248 (276)
118 cd05853 Ig6_Contactin-4 Sixth 21.1 59 0.0013 20.2 1.2 11 27-37 58-68 (85)
119 cd05900 Ig_Aggrecan Immunoglob 21.1 1.2E+02 0.0025 19.7 2.6 27 10-36 61-94 (112)
120 cd05878 Ig_Aggrecan_like Immun 21.0 1.4E+02 0.003 18.8 2.9 32 7-38 56-94 (110)
121 cd05858 Ig3_FGFR-2 Third immun 20.7 62 0.0013 19.0 1.2 10 27-36 66-75 (90)
122 cd04979 Ig_Semaphorin_C Immuno 20.7 62 0.0013 18.9 1.2 14 25-38 60-73 (89)
123 PF04101 Glyco_tran_28_C: Glyc 20.6 43 0.00094 21.6 0.6 14 42-55 75-88 (167)
124 PRK05428 HPr kinase/phosphoryl 20.1 3.8E+02 0.0082 21.1 5.7 66 4-71 223-288 (308)
125 cd05851 Ig3_Contactin-1 Third 20.1 66 0.0014 19.1 1.2 10 27-36 64-73 (88)
126 cd04968 Ig3_Contactin_like Thi 20.1 67 0.0015 18.6 1.3 11 26-36 63-73 (88)
No 1
>KOG3732|consensus
Probab=99.48 E-value=3.3e-15 Score=116.96 Aligned_cols=54 Identities=48% Similarity=0.697 Sum_probs=51.2
Q ss_pred ceeeeccccceeEEEEeccCCceEeecCCCCHHHHHHHHHHHHHHHHHHhhcCC
Q psy1572 21 SIAFLHSTGQFQALVQLATYPVAVCHGQGKTQQEAKTVAAHNAIEYLRLMTKSK 74 (74)
Q Consensus 21 ~ieels~~G~yqcLV~LST~P~~VCHGsG~S~~eA~~~AA~NALqyLkiM~~~k 74 (74)
.|++.|++|++||+|++||.|++||||+++|+..|+.+||++||+||+||.+.+
T Consensus 286 d~~~~t~sg~~~~~~e~s~~~~~~~~~s~~t~~~s~~~aa~~~L~yL~~~~~f~ 339 (339)
T KOG3732|consen 286 DIEELTLSGQNQCLVELSTSPATVCHGSSITPPSAQSEAAHNALEYLKIMAGFK 339 (339)
T ss_pred hhHHhhhhhHHHHHHHhcCCCceeeeccCCCCCcccchhHHHHHHHHHHHhcCC
Confidence 368899999999999999999999999999999999999999999999999764
No 2
>PF00035 dsrm: Double-stranded RNA binding motif; InterPro: IPR001159 The DsRBD domain is found in a variety of RNA-binding proteins with different structures and exhibiting a diversity of functions []. It is involved in localisation of at least five different mRNAs in the early Drosophila embryo and by interferon-induced protein kinase in humans, which is part of the cellular response to dsRNA.; GO: 0003725 double-stranded RNA binding, 0005622 intracellular; PDB: 1EKZ_A 1STU_A 1QU6_A 2L2M_A 3ADJ_A 1WHN_A 3LLH_B 2B7V_A 2L3J_A 1UHZ_A ....
Probab=97.98 E-value=6.2e-05 Score=42.66 Aligned_cols=36 Identities=31% Similarity=0.406 Sum_probs=30.5
Q ss_pred ceeEEEEeccCCceEeecCCCCHHHHHHHHHHHHHHHH
Q psy1572 30 QFQALVQLATYPVAVCHGQGKTQQEAKTVAAHNAIEYL 67 (74)
Q Consensus 30 ~yqcLV~LST~P~~VCHGsG~S~~eA~~~AA~NALqyL 67 (74)
.|.|-|.+ ....+..|.|.|-.+|+..||..||++|
T Consensus 32 ~f~~~~~i--~~~~~~~g~g~sKk~Ak~~AA~~al~~L 67 (67)
T PF00035_consen 32 RFICTVYI--DGKEYGEGEGSSKKEAKQQAAKKALQKL 67 (67)
T ss_dssp EEEEEEEE--TTEEEEEEEESSHHHHHHHHHHHHHHHH
T ss_pred eEEEEEEE--CCEEEeEeccCCHHHHHHHHHHHHHHhC
Confidence 45554444 7788889999999999999999999998
No 3
>smart00358 DSRM Double-stranded RNA binding motif.
Probab=97.67 E-value=0.00017 Score=40.45 Aligned_cols=37 Identities=30% Similarity=0.403 Sum_probs=32.4
Q ss_pred cceeEEEEeccCCceEeecCCCCHHHHHHHHHHHHHHHH
Q psy1572 29 GQFQALVQLATYPVAVCHGQGKTQQEAKTVAAHNAIEYL 67 (74)
Q Consensus 29 G~yqcLV~LST~P~~VCHGsG~S~~eA~~~AA~NALqyL 67 (74)
..|.|-|.+.-. .+.-|.|.|-++|+.+||++||++|
T Consensus 30 ~~f~~~v~i~~~--~~~~g~g~sKk~Ak~~AA~~al~~L 66 (67)
T smart00358 30 PRFTVTVKVGGE--YTGEGEGSSKKEAKQRAAEAALRSL 66 (67)
T ss_pred CcEEEEEEECCE--EEEEeccCCHHHHHHHHHHHHHHhc
Confidence 589999998543 6678999999999999999999987
No 4
>cd00048 DSRM Double-stranded RNA binding motif. Binding is not sequence specific but is highly specific for double stranded RNA. Found in a variety of proteins including dsRNA dependent protein kinase PKR, RNA helicases, Drosophila staufen protein, E. coli RNase III, RNases H1, and dsRNA dependent adenosine deaminases.
Probab=97.39 E-value=0.00057 Score=37.95 Aligned_cols=37 Identities=30% Similarity=0.394 Sum_probs=32.1
Q ss_pred cceeEEEEeccCCceEeecCCCCHHHHHHHHHHHHHHHH
Q psy1572 29 GQFQALVQLATYPVAVCHGQGKTQQEAKTVAAHNAIEYL 67 (74)
Q Consensus 29 G~yqcLV~LST~P~~VCHGsG~S~~eA~~~AA~NALqyL 67 (74)
..|.|-|.+.- ..++.|.|.|.++|+..||..||+.|
T Consensus 32 ~~f~~~v~i~~--~~~~~g~g~sKk~Ak~~AA~~al~~L 68 (68)
T cd00048 32 PRFTVEVTVGG--KITGEGEGSSKKEAKQNAAEAALRKL 68 (68)
T ss_pred CeEEEEEEECC--EEEEEeecCCHHHHHHHHHHHHHHhC
Confidence 57888888844 68889999999999999999999865
No 5
>PRK00102 rnc ribonuclease III; Reviewed
Probab=95.87 E-value=0.049 Score=37.88 Aligned_cols=38 Identities=26% Similarity=0.430 Sum_probs=30.9
Q ss_pred cceeEEEEeccCCceEeecCCCCHHHHHHHHHHHHHHHHH
Q psy1572 29 GQFQALVQLATYPVAVCHGQGKTQQEAKTVAAHNAIEYLR 68 (74)
Q Consensus 29 G~yqcLV~LST~P~~VCHGsG~S~~eA~~~AA~NALqyLk 68 (74)
..|.|-|.+. --.+..|.|.|-++|+..||.+||+.|+
T Consensus 190 ~~f~~~v~i~--~~~~~~g~g~skk~Ae~~AA~~Al~~l~ 227 (229)
T PRK00102 190 KEFTVEVTVN--GKELGEGTGSSKKEAEQAAAKQALKKLK 227 (229)
T ss_pred ceEEEEEEEC--CEEEEEeeeCCHHHHHHHHHHHHHHHHh
Confidence 4566666654 3568899999999999999999999885
No 6
>TIGR02191 RNaseIII ribonuclease III, bacterial. This family consists of bacterial examples of ribonuclease III. This enzyme cleaves double-stranded rRNA. It is involved in processing ribosomal RNA precursors. It is found even in minimal genones such as Mycoplasma genitalium and Buchnera aphidicola, and in some cases has been shown to be an essential gene. These bacterial proteins contain a double-stranded RNA binding motif (pfam00035) and a ribonuclease III domain (pfam00636). Eukaryotic homologs tend to be much longer proteins with additional domains, localized to the nucleus, and not included in this family.
Probab=94.95 E-value=0.062 Score=36.95 Aligned_cols=37 Identities=32% Similarity=0.432 Sum_probs=31.3
Q ss_pred cceeEEEEeccCCceEeecCCCCHHHHHHHHHHHHHHHH
Q psy1572 29 GQFQALVQLATYPVAVCHGQGKTQQEAKTVAAHNAIEYL 67 (74)
Q Consensus 29 G~yqcLV~LST~P~~VCHGsG~S~~eA~~~AA~NALqyL 67 (74)
..|.|-|.+.= -.+..|+|.|-++|+..||..||+-|
T Consensus 184 ~~f~~~v~~~~--~~~~~g~g~skk~A~~~AA~~Al~~l 220 (220)
T TIGR02191 184 KEFTVEVSVNG--EPYGEGKGKSKKEAEQNAAKAALEKL 220 (220)
T ss_pred ceEEEEEEECC--EEEEEeeeCCHHHHHHHHHHHHHHhC
Confidence 47888888863 55889999999999999999999753
No 7
>PHA03103 double-strand RNA-binding protein; Provisional
Probab=94.09 E-value=0.11 Score=38.12 Aligned_cols=39 Identities=28% Similarity=0.306 Sum_probs=30.9
Q ss_pred ccceeEEEEeccCCceEeecCCCCHHHHHHHHHHHHHHHHH
Q psy1572 28 TGQFQALVQLATYPVAVCHGQGKTQQEAKTVAAHNAIEYLR 68 (74)
Q Consensus 28 ~G~yqcLV~LST~P~~VCHGsG~S~~eA~~~AA~NALqyLk 68 (74)
+-.|.+-|.+.-.+.. .|.|.|-.+|+.+||+.||+.|.
T Consensus 138 ~p~F~v~V~I~g~~~g--~G~G~SKKeAEQ~AAk~AL~~L~ 176 (183)
T PHA03103 138 SPTFTASVIISGIKFK--PAIGSTKKEAKNNAAKLAMDKIL 176 (183)
T ss_pred CceEEEEEEECCEEEE--EeeeCCHHHHHHHHHHHHHHHHH
Confidence 3456677777664444 59999999999999999998874
No 8
>PRK12371 ribonuclease III; Reviewed
Probab=93.67 E-value=0.16 Score=37.20 Aligned_cols=41 Identities=22% Similarity=0.381 Sum_probs=32.7
Q ss_pred cccceeEEEEeccCCceEeecCCCCHHHHHHHHHHHHHHHHHH
Q psy1572 27 STGQFQALVQLATYPVAVCHGQGKTQQEAKTVAAHNAIEYLRL 69 (74)
Q Consensus 27 ~~G~yqcLV~LST~P~~VCHGsG~S~~eA~~~AA~NALqyLki 69 (74)
.+..|.|-|.+.=.. +-.|.|.|..+|...||..||+.|+.
T Consensus 191 h~~~F~v~v~v~~~~--~~~g~G~sKK~Ae~~AA~~al~~~~~ 231 (235)
T PRK12371 191 HDPRFTVEVEVKGFA--PETGEGRSKRAAEQVAAEKMLEREGV 231 (235)
T ss_pred CCCeEEEEEEECCEE--EEEeeeCCHHHHHHHHHHHHHHHhhh
Confidence 345677777775433 66899999999999999999988764
No 9
>PHA02701 ORF020 dsRNA-binding PKR inhibitor; Provisional
Probab=93.33 E-value=0.15 Score=37.69 Aligned_cols=38 Identities=18% Similarity=0.259 Sum_probs=30.1
Q ss_pred cceeEEEEeccCCceEeecCCCCHHHHHHHHHHHHHHHHH
Q psy1572 29 GQFQALVQLATYPVAVCHGQGKTQQEAKTVAAHNAIEYLR 68 (74)
Q Consensus 29 G~yqcLV~LST~P~~VCHGsG~S~~eA~~~AA~NALqyLk 68 (74)
..|-+-|.++ .-.+.-|.|.|-.+|..+||+.||+.|.
T Consensus 139 ~~Ftv~V~V~--g~~~g~G~G~SKKeAEQ~AAk~AL~~L~ 176 (183)
T PHA02701 139 PLFTCTIVVS--GKVVATASGCSKKLARHAACADALTILI 176 (183)
T ss_pred ceEEEEEEEC--CEEEEEEEeCCHHHHHHHHHHHHHHHHH
Confidence 3455556655 4556689999999999999999998875
No 10
>PF14709 DND1_DSRM: double strand RNA binding domain from DEAD END PROTEIN 1
Probab=93.06 E-value=0.31 Score=30.59 Aligned_cols=39 Identities=31% Similarity=0.389 Sum_probs=28.8
Q ss_pred ceeEEEEeccCCc-----eEeec---CCCCHHHHHHHHHHHHHHHHH
Q psy1572 30 QFQALVQLATYPV-----AVCHG---QGKTQQEAKTVAAHNAIEYLR 68 (74)
Q Consensus 30 ~yqcLV~LST~P~-----~VCHG---sG~S~~eA~~~AA~NALqyLk 68 (74)
.|.|-|.+.+.+. .-|.| .-.+-++|++.||..||++|.
T Consensus 34 ~F~ykV~i~~~~~~~~~~~~~~~p~~~~~~~k~Ak~~AA~~~L~~Lg 80 (80)
T PF14709_consen 34 LFLYKVVIPGLEYPFEGSIECFGPTKPSSTKKEAKESAAQQALQALG 80 (80)
T ss_pred EEEEEEEEcCCCCCCcceEEEccCCCcCccHHHHHHHHHHHHHHhcC
Confidence 5667777766555 35555 235789999999999999983
No 11
>PF03368 Dicer_dimer: Dicer dimerisation domain; InterPro: IPR005034 This domain is found in members of the Dicer protein family of dsRNA nucleases. This entry represents a dsRNA-binding domain. RNA interference (RNAi) is an ancient gene-silencing process that plays a fundamental role in diverse eukaryotic functions including viral defence, chromatin remodelling, genome rearrangement, developmental timing, brain morphogenesis, and stem cell maintenance. All RNAi pathways require the multidomain ribonuclease Dicer, which initiates RNAi by cleaving double-stranded RNA (dsRNA) substrates into small fragments ~25 nuleotides in length. A typical eukaryotic Dicer consists of a helicase domain (PDOC51192 from PROSITEDOC), a domain of unknown function, and a PAZ domain (PDOC50821 from PROSITEDOC) at the amino (N)-terminus as well as two ribonuclease III domains (PDOC00448 from PROSITEDOC) and a dsRNA-binding domain (dsRBD) (PDOC50137 from PROSITEDOC) at the carboxy (C)-terminus. The domain of unknown function of ~100 amino acids is predicted to adopt the canonical alpha-beta-beta-beta-alpha-fold found in all dsRBDs [, , , ].; GO: 0016891 endoribonuclease activity, producing 5'-phosphomonoesters; PDB: 2KOU_A.
Probab=91.82 E-value=0.79 Score=28.98 Aligned_cols=46 Identities=24% Similarity=0.229 Sum_probs=31.1
Q ss_pred ccccceeEEEEeccC-CceEeecC-CCCHHHHHHHHHHHHHHHHHHhh
Q psy1572 26 HSTGQFQALVQLATY-PVAVCHGQ-GKTQQEAKTVAAHNAIEYLRLMT 71 (74)
Q Consensus 26 s~~G~yqcLV~LST~-P~~VCHGs-G~S~~eA~~~AA~NALqyLkiM~ 71 (74)
...+.|+|-|.|-.+ |+..--|. -.|...|+..||..|.+.|-.|+
T Consensus 28 ~~~~~~~c~v~LP~~~pi~~i~g~~~~sk~~AK~sAAf~Ac~~L~~~g 75 (90)
T PF03368_consen 28 KIGSGFICTVILPINSPIRSIEGPPMRSKKLAKRSAAFEACKKLHEAG 75 (90)
T ss_dssp E--G-EEEEEE--TT-SS--EEEE--SSHHHHHHHHHHHHHHHHHHH-
T ss_pred EcCCcEEEEEECCCCCCCCeEEccccccHHHHHHHHHHHHHHHHHHcC
Confidence 345589999999985 65555554 46889999999999999998876
No 12
>PRK12372 ribonuclease III; Reviewed
Probab=90.15 E-value=1.1 Score=36.67 Aligned_cols=40 Identities=23% Similarity=0.378 Sum_probs=30.3
Q ss_pred ccceeEEEEeccCCceEeecCCCCHHHHHHHHHHHHHHHHH
Q psy1572 28 TGQFQALVQLATYPVAVCHGQGKTQQEAKTVAAHNAIEYLR 68 (74)
Q Consensus 28 ~G~yqcLV~LST~P~~VCHGsG~S~~eA~~~AA~NALqyLk 68 (74)
+..|.+-|.+.-..+. -.|.|.|.++|+..||+.||+.|+
T Consensus 182 ~~~F~V~V~v~g~~~~-g~G~G~SKKeAEQ~AAr~AL~kL~ 221 (413)
T PRK12372 182 NQQFEVECTVPKLDVK-VSGSGASRRAAEQAAAKKALDEVM 221 (413)
T ss_pred CceEEEEEEECCeEEE-EEEEeCCHHHHHHHHHHHHHHHHh
Confidence 3466666666432222 389999999999999999999887
No 13
>PRK14718 ribonuclease III; Provisional
Probab=87.54 E-value=1.4 Score=36.71 Aligned_cols=37 Identities=22% Similarity=0.410 Sum_probs=28.0
Q ss_pred ceeEEEEeccCCceE-eecCCCCHHHHHHHHHHHHHHHHH
Q psy1572 30 QFQALVQLATYPVAV-CHGQGKTQQEAKTVAAHNAIEYLR 68 (74)
Q Consensus 30 ~yqcLV~LST~P~~V-CHGsG~S~~eA~~~AA~NALqyLk 68 (74)
.|.+-|.+ .-..+ ..|.|.|-.+|+..||+.||+.|+
T Consensus 184 ~F~V~V~v--~g~~~~G~G~G~SKKeAEQ~AAk~AL~kL~ 221 (467)
T PRK14718 184 QFEVECTV--PKLDIKVSGSGASRRAAEQAAAKKALDEVT 221 (467)
T ss_pred eEEEEEEE--CCeeeEEEEEcCCHHHHHHHHHHHHHHHhc
Confidence 44444444 33223 789999999999999999999887
No 14
>COG0571 Rnc dsRNA-specific ribonuclease [Transcription]
Probab=87.50 E-value=1.2 Score=33.22 Aligned_cols=28 Identities=29% Similarity=0.501 Sum_probs=25.1
Q ss_pred ceEeecCCCCHHHHHHHHHHHHHHHHHH
Q psy1572 42 VAVCHGQGKTQQEAKTVAAHNAIEYLRL 69 (74)
Q Consensus 42 ~~VCHGsG~S~~eA~~~AA~NALqyLki 69 (74)
-.+.-|.|.|..+|...||.+||+-|.+
T Consensus 204 ~~~g~G~G~skk~AEq~AA~~al~~l~~ 231 (235)
T COG0571 204 KELGTGKGRSKKEAEQAAAEQALKKLGV 231 (235)
T ss_pred eeEEEecccCHHHHHHHHHHHHHHHhcc
Confidence 6678899999999999999999987754
No 15
>KOG3732|consensus
Probab=86.79 E-value=1.9 Score=34.67 Aligned_cols=45 Identities=22% Similarity=0.220 Sum_probs=39.2
Q ss_pred eeeccccceeEEEEeccCCceEeecCCCCHHHHHHHHHHHHHHHHHHh
Q psy1572 23 AFLHSTGQFQALVQLATYPVAVCHGQGKTQQEAKTVAAHNAIEYLRLM 70 (74)
Q Consensus 23 eels~~G~yqcLV~LST~P~~VCHGsG~S~~eA~~~AA~NALqyLkiM 70 (74)
+.--..+.|.|-|++. ..+|=|-|.|-..|+-.||.++|+-|+-.
T Consensus 63 eGp~H~~~fv~rvtvg---~~~a~GeG~sKK~AKh~AA~~~L~~lk~l 107 (339)
T KOG3732|consen 63 EGPPHMPNFVFRVTVG---EITATGEGKSKKLAKHRAAEALLKELKKL 107 (339)
T ss_pred cCCccCCCeEEEEEEe---eeEEecCCCchhHHHHHHHHHHHHHHhcC
Confidence 3344678899999998 89999999999999999999999988753
No 16
>PF14954 LIX1: Limb expression 1
Probab=86.01 E-value=0.99 Score=35.17 Aligned_cols=32 Identities=34% Similarity=0.525 Sum_probs=26.1
Q ss_pred ceeEEEEeccCCceEeecC---CCCHHHHHHHHHHHHH
Q psy1572 30 QFQALVQLATYPVAVCHGQ---GKTQQEAKTVAAHNAI 64 (74)
Q Consensus 30 ~yqcLV~LST~P~~VCHGs---G~S~~eA~~~AA~NAL 64 (74)
-|.|.|+| |=.-|+|+ =.|.+||+.+||.-||
T Consensus 58 pyVcyVTL---PGGSCFGnfq~C~tkAEARR~AAKiAL 92 (252)
T PF14954_consen 58 PYVCYVTL---PGGSCFGNFQNCPTKAEARRSAAKIAL 92 (252)
T ss_pred CeEEEEeC---CCCCccCccccCCcHHHHHhhhHHHHH
Confidence 47888854 77788886 4688999999999988
No 17
>KOG3769|consensus
Probab=79.89 E-value=2.6 Score=34.02 Aligned_cols=32 Identities=34% Similarity=0.363 Sum_probs=25.3
Q ss_pred eccCCceEee----------cCCCCHHHHHHHHHHHHHHHHH
Q psy1572 37 LATYPVAVCH----------GQGKTQQEAKTVAAHNAIEYLR 68 (74)
Q Consensus 37 LST~P~~VCH----------GsG~S~~eA~~~AA~NALqyLk 68 (74)
.|-.|+-|-| |-|.|...|.+.||++||.-|=
T Consensus 261 ~S~~PvyvVgiYs~kkllGqG~Gesl~~A~e~AA~dAL~k~y 302 (333)
T KOG3769|consen 261 NSAEPVYVVGIYSGKKLLGQGQGESLKLAEEQAARDALIKLY 302 (333)
T ss_pred CccCceEEEEeecCchhhccCcchHHHHHHHHHHHHHHHHHH
Confidence 4556666654 8999999999999999996543
No 18
>PF04694 Corona_3: Coronavirus ORF3 protein; InterPro: IPR006784 This family represents the Coronavirus ORF3 protein, also known as the X2A protein.
Probab=77.72 E-value=2.9 Score=26.50 Aligned_cols=25 Identities=40% Similarity=0.470 Sum_probs=22.5
Q ss_pred EEeccCCceEeecCCCCHHHHHHHH
Q psy1572 35 VQLATYPVAVCHGQGKTQQEAKTVA 59 (74)
Q Consensus 35 V~LST~P~~VCHGsG~S~~eA~~~A 59 (74)
|+..|...-||-|.|-|..||+..|
T Consensus 29 vef~tgkllVcigFGDTl~~A~~ka 53 (60)
T PF04694_consen 29 VEFKTGKLLVCIGFGDTLFEAEQKA 53 (60)
T ss_pred EEEecCcEEEEEecchHHHHHHHhh
Confidence 6778999999999999999998765
No 19
>KOG2777|consensus
Probab=72.60 E-value=4.4 Score=34.31 Aligned_cols=27 Identities=33% Similarity=0.381 Sum_probs=24.0
Q ss_pred eEeecCCCCHHHHHHHHHHHHHHHHHH
Q psy1572 43 AVCHGQGKTQQEAKTVAAHNAIEYLRL 69 (74)
Q Consensus 43 ~VCHGsG~S~~eA~~~AA~NALqyLki 69 (74)
.+|-|.|.|-.+|+.+||..||+||--
T Consensus 128 ~~~~~~~~sKk~ak~~aa~~al~~l~~ 154 (542)
T KOG2777|consen 128 RWFEGGGRSKKEAKQEAAMAALQVLFK 154 (542)
T ss_pred EEccCCCcchHHHHHHHHHHHHHHHHh
Confidence 467888999999999999999999853
No 20
>PF02169 LPP20: LPP20 lipoprotein; InterPro: IPR002217 A major antigen has been recognised in Helicobacter pylori, a protein with an apparent molecular weight of 20,000 and mass 18,283 kDa []. DNA sequence analysis revealed a 525 bp gene, encoding a 175-amino acid residue product with a typical 21-residue lipoprotein signal peptide and consensus prolipoprotein processing site []. Results of experimental work with Lpp20 are consistent with it being a nonessential lipoprotein []. Prokaryotic membrane lipoproteins are synthesised with precursor signal peptides that are cleaved by specific peptidases (signal peptidase II). The enzyme recognises a conserved sequence, cutting upstream of a cysteine residue to which a glyceride-fatty acid lipid is attached [].; GO: 0009279 cell outer membrane
Probab=66.73 E-value=5.6 Score=23.72 Aligned_cols=23 Identities=30% Similarity=0.397 Sum_probs=18.7
Q ss_pred ecCCCCHHHHHHHHHHHHHHHHH
Q psy1572 46 HGQGKTQQEAKTVAAHNAIEYLR 68 (74)
Q Consensus 46 HGsG~S~~eA~~~AA~NALqyLk 68 (74)
-|.|.+.+.|..+|-++..+.|.
T Consensus 17 ~G~~~~~~~A~~~A~~~la~~i~ 39 (92)
T PF02169_consen 17 VGSGSSREQAKQDALANLAEQIS 39 (92)
T ss_pred EEcccChHHHHHHHHHHHHHhee
Confidence 38899999999999888877553
No 21
>PF12909 DUF3832: Protein of unknown function (DUF3832); InterPro: IPR024308 This domain found in bacteria and Acanthamoeba polyphaga mimivirus (APMV) L591 protein has no known function. In Q0AZ30 from SWISSPROT and other uncharacterised proteins this domain is found C-terminal, while the N terminus shows remote homology to the bacterial toxin/antitoxin 'addiction module' (PF12910 from PFAM).; PDB: 3K6Q_C.
Probab=66.49 E-value=6.1 Score=26.00 Aligned_cols=37 Identities=22% Similarity=0.256 Sum_probs=28.9
Q ss_pred cccceeEEEEeccCCceEeecCCCCHHHHHHHHHHHHHHHHH
Q psy1572 27 STGQFQALVQLATYPVAVCHGQGKTQQEAKTVAAHNAIEYLR 68 (74)
Q Consensus 27 ~~G~yqcLV~LST~P~~VCHGsG~S~~eA~~~AA~NALqyLk 68 (74)
-.|.+ -++|--.| -+|.|.|.++|-++.+-+.++|=.
T Consensus 15 ~dg~~--~~~l~~ld---i~~~g~T~eeA~~~lie~l~dYAe 51 (89)
T PF12909_consen 15 DDGSI--TVSLPELD---IYANGPTLEEAIEDLIEDLRDYAE 51 (89)
T ss_dssp ETTEE--EEEECCGT---EEEEESSHHHHHHHHHHHHHHHHH
T ss_pred CCCeE--EEEecCee---EEecCCCHHHHHHHHHHHHHHHHH
Confidence 34555 55666666 589999999999999999999843
No 22
>KOG4334|consensus
Probab=65.74 E-value=5.6 Score=34.44 Aligned_cols=28 Identities=32% Similarity=0.398 Sum_probs=23.5
Q ss_pred CCceEeecCCCCHHHHHHHHHHHHHHHH
Q psy1572 40 YPVAVCHGQGKTQQEAKTVAAHNAIEYL 67 (74)
Q Consensus 40 ~P~~VCHGsG~S~~eA~~~AA~NALqyL 67 (74)
+-+.+=-|.|+|-..|+..|||.+|+.|
T Consensus 414 d~~~yGsG~g~sKK~Ak~~AAR~tLeiL 441 (650)
T KOG4334|consen 414 DLFPYGSGVGASKKTAKLVAARDTLEIL 441 (650)
T ss_pred cccccccccccchHHHHHHHHHHHHHHh
Confidence 3355666888999999999999999886
No 23
>cd00099 IgV Immunoglobulin variable domain (IgV). IgV: Immunoglobulin variable domain (IgV). Members of the IgV family are components of immunoglobulin (Ig) and T cell receptors. The basic structure of Ig molecules is a tetramer of two light chains and two heavy chains linked by disulfide bonds. In Ig, each chain is composed of one variable domain (IgV) and one or more constant domains (IgC); these names reflect the fact that the variability in sequences is higher in the variable domain than in the constant domain. Within the variable domain, there are regions of even more variability called the hypervariable or complementarity-determining regions (CDRs) which are responsible for antigen binding. A predominant feature of most Ig domains is the disulfide bridge connecting 2 beta-sheets with a tryptophan residue packed against the disulfide bond.
Probab=62.12 E-value=9 Score=22.51 Aligned_cols=26 Identities=19% Similarity=0.300 Sum_probs=18.8
Q ss_pred ccccceeEEEEeccCCceEeecCCCC
Q psy1572 26 HSTGQFQALVQLATYPVAVCHGQGKT 51 (74)
Q Consensus 26 s~~G~yqcLV~LST~P~~VCHGsG~S 51 (74)
+=+|.|.|-+.-..-+-...+|.|..
T Consensus 77 ~D~G~Y~C~v~~~~~~~~~~~~~gt~ 102 (105)
T cd00099 77 EDSAVYYCAVSLSGGTYKLYFGQGTR 102 (105)
T ss_pred HHCEEEEEEEecCCCceeeEECCCEE
Confidence 34699999997444334888998864
No 24
>PF01413 C4: C-terminal tandem repeated domain in type 4 procollagen; InterPro: IPR001442 Collagens are major components of the extracellular matrices of all metazoan life and play crucial roles in developmental processes and tissue homeostasis. Collagens are composed of three polypeptide chains (alpha chains) that fold together to form the characteristic triple helical collagenous domain. Some types of triple helical protomers contain genetically identical alpha chains forming homotrimers, whereas others contain two or three different alpha chains forming heterotrimers. The sequences required to form a collagenous domain are Gly-X-Y repeats in which the X and Y positions are frequently proline and hydroxyproline. Glycine is required every third residue as it is the only amino acid small enough to pack into the central core of the triple helix. The triple helix-forming parts are surrounded by non-collagenous (NC) domains of variable sequence, size, and shape. Even if the triple helical parts represent the most striking feature of collagens, tissue specificity as well as defined binding of non-collagens seem to be encoded in the NC domains. The terminal NC domains are excised, modified, or incorporated directly into the final suprastructure, depending on protomer type and function [, ]. Type IV collagen is one of the major constituents of basement membranes, a specialised form of extracellular matrix underlying epithelia that compartmentalises tissues and provides molecular signals for influencing cell behaviour. Each type IV chain contains a long triple-helical collagenous domain flanked by a short 7S domain of 25 residues and a globular non-collagenous NC1 domain of ~230 residues at the N and C terminus, respectively. In protomer assembly, the NC1 domains (monomers) of three chains interact, forming an NC1 trimer, to select and register chains for triple helix formation. In network assembly, the NC1 trimers of two protomers interact, forming a NC1 hexamer structure, to select and connect protomers [, , ]. The collagen IV NC1 domain contains 12 cysteines, and all of them are involved in disulphide bonds. It folds into a tertiary structure with predominantly beta-strands. The collagen IV NC1 domain is composed of two similarly folded subdomains stabilised by 3 intrachain dissulphide bonds involving the following pairs: C1-C6, C2-C5, and C3-C4. Each subdomain represents a compact disulphide-stabilised triangular structure, from which a finger-like hairpin loop projects into an incompletely formed six-stranded beta-sheet of an adjacent subdomain of the same or of an adjacent chain clamping the subdomains tightly together [, , ]. This duplicated domain is present at the C-terminal of type 4 collagen, the major structural component of glomerular basement membranes (GMB) forming a 'chicken-wire' meshwork together with laminins, proteoglycans and entactin/nidogen. Mutations in alpha-5 collagen IV are associated with X-linked Alport syndrome.; GO: 0005201 extracellular matrix structural constituent, 0005581 collagen; PDB: 1T61_F 1M3D_L 1LI1_D.
Probab=60.87 E-value=4.7 Score=27.67 Aligned_cols=18 Identities=33% Similarity=0.748 Sum_probs=15.4
Q ss_pred eEEEEeccCCceEeecCC
Q psy1572 32 QALVQLATYPVAVCHGQG 49 (74)
Q Consensus 32 qcLV~LST~P~~VCHGsG 49 (74)
-||-..+|.|...|||.|
T Consensus 48 SCL~~F~~~Pfi~C~~~g 65 (114)
T PF01413_consen 48 SCLERFRTMPFIECNGRG 65 (114)
T ss_dssp GEESSEESS-EEEEETTS
T ss_pred hhHHHHhcCcceecCCCC
Confidence 489999999999999966
No 25
>PF03681 UPF0150: Uncharacterised protein family (UPF0150); InterPro: IPR005357 This family of small proteins is uncharacterised. In Q9A3L8 from SWISSPROT this domain is found next to a DNA binding helix-turn-helix domain IPR002145 from INTERPRO, which suggests that this is some kind of ligand binding domain.; PDB: 2DSY_C 2YZT_A 3KWR_A.
Probab=59.78 E-value=7.8 Score=21.36 Aligned_cols=16 Identities=44% Similarity=0.488 Sum_probs=13.3
Q ss_pred EeecCCCCHHHHHHHH
Q psy1572 44 VCHGQGKTQQEAKTVA 59 (74)
Q Consensus 44 VCHGsG~S~~eA~~~A 59 (74)
=|+..|.|.+||.++|
T Consensus 23 g~~t~G~t~eea~~~~ 38 (48)
T PF03681_consen 23 GCFTQGDTLEEALENA 38 (48)
T ss_dssp TCEEEESSHHHHHHHH
T ss_pred ChhhcCCCHHHHHHHH
Confidence 4778899999998775
No 26
>cd05733 Ig6_L1-CAM_like Sixth immunoglobulin (Ig)-like domain of the L1 cell adhesion molecule (CAM) and similar proteins. Ig6_L1-CAM_like: domain similar to the sixth immunoglobulin (Ig)-like domain of the L1 cell adhesion molecule (CAM). L1 belongs to the L1 subfamily of cell adhesion molecules (CAMs) and is comprised of an extracellular region having six Ig-like domains and five fibronectin type III domains, a transmembrane region and an intracellular domain. L1 is primarily expressed in the nervous system and is involved in its development and function. L1 is associated with an X-linked recessive disorder, X-linked hydrocephalus, MASA syndrome, or spastic paraplegia type 1, that involves abnormalities of axonal growth. This group also contains NrCAM [Ng(neuronglia)CAM-related cell adhesion molecule], which is primarily expressed in the nervous system, and human neurofascin.
Probab=59.30 E-value=9.6 Score=22.12 Aligned_cols=9 Identities=22% Similarity=0.623 Sum_probs=7.9
Q ss_pred ccceeEEEE
Q psy1572 28 TGQFQALVQ 36 (74)
Q Consensus 28 ~G~yqcLV~ 36 (74)
.|.|+|.++
T Consensus 55 ~G~Y~C~A~ 63 (77)
T cd05733 55 EGEYQCYAS 63 (77)
T ss_pred CEEEEEEEE
Confidence 799999986
No 27
>smart00111 C4 C-terminal tandem repeated domain in type 4 procollagens. Duplicated domain in C-terminus of type 4 collagens. Mutations in alpha-5 collagen IV are associated with X-linked Alport syndrome.
Probab=58.58 E-value=4.4 Score=27.83 Aligned_cols=21 Identities=29% Similarity=0.615 Sum_probs=17.7
Q ss_pred eEEEEeccCCceEeecCCCCHH
Q psy1572 32 QALVQLATYPVAVCHGQGKTQQ 53 (74)
Q Consensus 32 qcLV~LST~P~~VCHGsG~S~~ 53 (74)
-||-..++.|...|||.| ++.
T Consensus 48 SCL~~F~~~Pfi~C~~~~-~C~ 68 (114)
T smart00111 48 SCLERFRTMPFIECNGRG-VCN 68 (114)
T ss_pred cchhhhccCCeEEECCCC-eec
Confidence 589999999999999965 443
No 28
>PF12695 Abhydrolase_5: Alpha/beta hydrolase family; PDB: 3D0K_B 2I3D_B 3DOH_B 3DOI_B 3PFB_A 3S2Z_B 3PFC_A 3QM1_A 3PF8_B 3PF9_A ....
Probab=57.51 E-value=13 Score=22.24 Aligned_cols=16 Identities=25% Similarity=0.499 Sum_probs=12.8
Q ss_pred CceEeecCCCCHHHHH
Q psy1572 41 PVAVCHGQGKTQQEAK 56 (74)
Q Consensus 41 P~~VCHGsG~S~~eA~ 56 (74)
|+.+|||.|.+..+-.
T Consensus 1 ~vv~~HG~~~~~~~~~ 16 (145)
T PF12695_consen 1 VVVLLHGWGGSRRDYQ 16 (145)
T ss_dssp EEEEECTTTTTTHHHH
T ss_pred CEEEECCCCCCHHHHH
Confidence 6789999999876543
No 29
>COG1598 Predicted nuclease of the RNAse H fold, HicB family [General function prediction only]
Probab=56.77 E-value=9 Score=23.40 Aligned_cols=16 Identities=50% Similarity=0.663 Sum_probs=13.8
Q ss_pred EeecCCCCHHHHHHHH
Q psy1572 44 VCHGQGKTQQEAKTVA 59 (74)
Q Consensus 44 VCHGsG~S~~eA~~~A 59 (74)
=||-.|.|.+||..++
T Consensus 25 gc~s~G~T~eea~~n~ 40 (73)
T COG1598 25 GCHSQGETLEEALQNA 40 (73)
T ss_pred CccccCCCHHHHHHHH
Confidence 4888999999998875
No 30
>cd04967 Ig1_Contactin First Ig domain of contactin. Ig1_Contactin: First Ig domain of contactins. Contactins are neural cell adhesion molecules and are comprised of six Ig domains followed by four fibronectin type III(FnIII) domains anchored to the membrane by glycosylphosphatidylinositol. The first four Ig domains form the intermolecular binding fragment, which arranges as a compact U-shaped module via contacts between Ig domains 1 and 4, and between Ig domains 2 and 3. Contactin-2 (TAG-1, axonin-1) may play a part in the neuronal processes of neurite outgrowth, axon guidance and fasciculation, and neuronal migration. This group also includes contactin-1 and contactin-5. The different contactins show different expression patterns in the central nervous system. During development and in adulthood, contactin-2 is transiently expressed in subsets of central and peripheral neurons. Contactin-5 is expressed specifically in the rat postnatal nervous system, peaking at about 3 weeks postnata
Probab=54.44 E-value=7.9 Score=22.94 Aligned_cols=11 Identities=27% Similarity=0.679 Sum_probs=9.6
Q ss_pred ccccceeEEEE
Q psy1572 26 HSTGQFQALVQ 36 (74)
Q Consensus 26 s~~G~yqcLV~ 36 (74)
.-+|.|+|.++
T Consensus 69 ~d~G~Y~C~a~ 79 (91)
T cd04967 69 KDAGRYQCLAS 79 (91)
T ss_pred CCCEEEEEEEE
Confidence 46899999997
No 31
>PRK04930 glutathione-regulated potassium-efflux system ancillary protein KefG; Provisional
Probab=51.80 E-value=26 Score=25.04 Aligned_cols=35 Identities=9% Similarity=-0.011 Sum_probs=28.9
Q ss_pred EEeccCCceEeecCCCCHHHHHHHHHHHHHHHHHH
Q psy1572 35 VQLATYPVAVCHGQGKTQQEAKTVAAHNAIEYLRL 69 (74)
Q Consensus 35 V~LST~P~~VCHGsG~S~~eA~~~AA~NALqyLki 69 (74)
.-+.-.|+.+|||.....++.....|....++|..
T Consensus 143 ~Gm~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~ 177 (184)
T PRK04930 143 CRMHWLSPIIIYWARRQSPEELASHARAYGDWLAN 177 (184)
T ss_pred cCCeEcCcEEEecCCCCCHHHHHHHHHHHHHHHhh
Confidence 36777899999999888888888888888888764
No 32
>PRK03661 hypothetical protein; Validated
Probab=50.89 E-value=59 Score=22.87 Aligned_cols=40 Identities=10% Similarity=-0.011 Sum_probs=28.8
Q ss_pred ceeEEEEecc-CCceEeecCCCCHHHHHHHHHHHHHHHHHHh
Q psy1572 30 QFQALVQLAT-YPVAVCHGQGKTQQEAKTVAAHNAIEYLRLM 70 (74)
Q Consensus 30 ~yqcLV~LST-~P~~VCHGsG~S~~eA~~~AA~NALqyLkiM 70 (74)
-|.++..=.. .-...+|..| ++++-+..|+..||+.|..+
T Consensus 120 v~i~i~~~~~~~~~~~~~~~g-~R~~ir~~~~~~AL~~L~~~ 160 (164)
T PRK03661 120 VWFGFASASGEGITRRECFSG-DRDAVRRQATAYALQTLWQQ 160 (164)
T ss_pred EEEEEEeCCCcEEEEEEecCC-CHHHHHHHHHHHHHHHHHHH
Confidence 4666654222 1244688888 89999999999999988754
No 33
>cd02647 nuc_hydro_TvIAG nuc_hydro_ TvIAG: Nucleoside hydrolases similar to the Inosine-adenosine-guanosine-preferring nucleoside hydrolase from Trypanosoma vivax. Nucleoside hydrolases cleave the N-glycosidic bond in nucleosides generating ribose and the respective base. Nucleoside hydrolases vary in their substrate specificity. This group contains eukaryotic and bacterial proteins similar to the purine specific inosine-adenosine-guanosine-preferring nucleoside hydrolase (IAG-NH) from T. vivax. T. vivax IAG-NH is of the order of a thousand to ten thousand fold more specific towards the naturally occurring purine nucleosides, than towards the pyrimidine nucleosides.
Probab=50.75 E-value=23 Score=26.92 Aligned_cols=28 Identities=7% Similarity=0.006 Sum_probs=21.7
Q ss_pred ceEeecCCCCHHHHHHHHHHHHHHHHHHhhc
Q psy1572 42 VAVCHGQGKTQQEAKTVAAHNAIEYLRLMTK 72 (74)
Q Consensus 42 ~~VCHGsG~S~~eA~~~AA~NALqyLkiM~~ 72 (74)
++++||.|... .+++++||+..|..|.+
T Consensus 31 It~v~~~GN~~---~~~~~~na~~ll~~~g~ 58 (312)
T cd02647 31 IGVSGIDADCY---VEPAVSVTRKLIDRLGQ 58 (312)
T ss_pred EEEecCcCCcc---HHHHHHHHHHHHHHhCC
Confidence 56675667663 47899999999999875
No 34
>PRK03657 hypothetical protein; Validated
Probab=46.87 E-value=62 Score=23.14 Aligned_cols=42 Identities=17% Similarity=-0.026 Sum_probs=28.8
Q ss_pred ceeEEEEeccCCceEeecCCCCHHHHHHHHHHHHHHHHHHhhc
Q psy1572 30 QFQALVQLATYPVAVCHGQGKTQQEAKTVAAHNAIEYLRLMTK 72 (74)
Q Consensus 30 ~yqcLV~LST~P~~VCHGsG~S~~eA~~~AA~NALqyLkiM~~ 72 (74)
-|.++..=...-...+|-.| .+++-+..|+..||+.|...-+
T Consensus 126 V~iai~~~~~~~~~~~~~~g-~R~~ir~~a~~~al~~L~~~l~ 167 (170)
T PRK03657 126 VWFAWNIKGQTYTARMHFAG-DCETVLAKAVRFALAQLLQLLL 167 (170)
T ss_pred EEEEEEcCCcEEEEEEecCC-CHHHHHHHHHHHHHHHHHHHHH
Confidence 35555432111244678888 7999999999999998865443
No 35
>cd05874 Ig6_NrCAM Sixth immunoglobulin (Ig)-like domain of NrCAM (Ng (neuronglia) CAM-related cell adhesion molecule). Ig6_NrCAM: sixth immunoglobulin (Ig)-like domain of NrCAM (Ng (neuronglia) CAM-related cell adhesion molecule). NrCAM belongs to the L1 subfamily of cell adhesion molecules (CAMs) and is comprised of an extracellular region having six Ig-like domains and five fibronectin type III domains, a transmembrane region, and an intracellular domain. NrCAM is primarily expressed in the nervous system.
Probab=46.84 E-value=21 Score=20.99 Aligned_cols=10 Identities=20% Similarity=0.355 Sum_probs=8.2
Q ss_pred ccceeEEEEe
Q psy1572 28 TGQFQALVQL 37 (74)
Q Consensus 28 ~G~yqcLV~L 37 (74)
.|.|+|.++=
T Consensus 55 ~G~Y~C~A~N 64 (77)
T cd05874 55 EGVYQCTARN 64 (77)
T ss_pred CEEEEEEEEc
Confidence 5999999863
No 36
>cd05739 Ig3_RPTP_IIa_LAR_like Third immunoglobulin (Ig)-like domain of the receptor protein tyrosine phosphatase (RPTP)-F, also known as LAR. Ig3_RPTP_IIa_LAR_like: domain similar to the third immunoglobulin (Ig)-like domain found in the receptor protein tyrosine phosphatase (RPTP)-F, also known as LAR. LAR belongs to the RPTP type IIa subfamily. Members of this subfamily are cell adhesion molecule-like proteins involved in central nervous system (CNS) development. They have large extracellular portions, comprised of multiple IG-like domains and two to nine fibronectin type III (FNIII) domains, and a cytoplasmic portion having two tandem phosphatase domains. Included in this group is Drosophila LAR (DLAR).
Probab=44.11 E-value=17 Score=20.96 Aligned_cols=15 Identities=0% Similarity=0.211 Sum_probs=11.7
Q ss_pred eeeeccccceeEEEE
Q psy1572 22 IAFLHSTGQFQALVQ 36 (74)
Q Consensus 22 ieels~~G~yqcLV~ 36 (74)
|..++-+|.|.|.++
T Consensus 42 i~~~~d~G~Y~C~A~ 56 (69)
T cd05739 42 LTNIYESANYTCVAI 56 (69)
T ss_pred EeccccCeeEEEEEE
Confidence 455667899999976
No 37
>PF04098 Rad52_Rad22: Rad52/22 family double-strand break repair protein; InterPro: IPR007232 The DNA single-strand annealing proteins (SSAPs), such as RecT, Red-beta, ERF and Rad52, function in RecA-dependent and RecA-independent DNA recombination pathways. This family includes proteins related to Rad52. These proteins contain two helix-hairpin-helix motifs [].; GO: 0006281 DNA repair, 0006310 DNA recombination; PDB: 1KN0_I 1H2I_U.
Probab=41.94 E-value=56 Score=22.28 Aligned_cols=39 Identities=18% Similarity=0.228 Sum_probs=24.0
Q ss_pred cccceeEEEEeccCCce----EeecCCC-------CHHHHHHHHHHHHHH
Q psy1572 27 STGQFQALVQLATYPVA----VCHGQGK-------TQQEAKTVAAHNAIE 65 (74)
Q Consensus 27 ~~G~yqcLV~LST~P~~----VCHGsG~-------S~~eA~~~AA~NALq 65 (74)
.+-.|.|.|++...... |.+|... +.++|...|.-+||+
T Consensus 72 ~~~~~~~~vritLkdg~~~e~~G~g~~~~~~~k~~a~~~akK~AvtdAlK 121 (154)
T PF04098_consen 72 YSVGYSAIVRITLKDGTYREDIGYGWSENMDGKGMAIEKAKKGAVTDALK 121 (154)
T ss_dssp EEEEEEEEEEEEETTS-EEEEEEEEEEESES-HHHHHHHHHHHHHHHHHH
T ss_pred EEEEEEEEEEEEEeCCceeeecceEEEecCCccchhhhhhHHHHHHHHHH
Confidence 45568899999887776 3334333 345666666666664
No 38
>PF09888 DUF2115: Uncharacterized protein conserved in archaea (DUF2115); InterPro: IPR019215 This entry represents various hypothetical archaeal proteins, has no known function.
Probab=40.93 E-value=14 Score=26.20 Aligned_cols=29 Identities=14% Similarity=0.165 Sum_probs=22.1
Q ss_pred eCCCCCceeeeccccceeEEE-------EeccCCceEe
Q psy1572 15 STNGSNSIAFLHSTGQFQALV-------QLATYPVAVC 45 (74)
Q Consensus 15 ~~~~~n~ieels~~G~yqcLV-------~LST~P~~VC 45 (74)
..|||..|++ -+|.|.|-| ..|.++-.||
T Consensus 124 ~FPGG~~V~~--~~g~YYCPVKdkq~d~~~aLC~FC~a 159 (163)
T PF09888_consen 124 PFPGGFKVEE--KNGNYYCPVKDKQKDVPFALCNFCPA 159 (163)
T ss_pred CCCCCeEEEE--ECCEEeCccchhhcCCCCccCCCcCc
Confidence 4799999999 699999987 3455555554
No 39
>cd05875 Ig6_hNeurofascin_like Sixth immunoglobulin (Ig)-like domain of human neurofascin (NF). Ig6_hNeurofascin_like: the sixth immunoglobulin (Ig)-like domain of human neurofascin (NF). NF belongs to the L1 subfamily of cell adhesion molecules (CAMs) and is comprised of an extracellular region having six Ig-like domains and five fibronectin type III domains, a transmembrane region, and a cytoplasmic domain. NF has many alternatively spliced isoforms having different temporal expression patterns during development. NF participates in axon subcellular targeting and synapse formation, however little is known of the functions of the different isoforms.
Probab=40.38 E-value=31 Score=20.21 Aligned_cols=9 Identities=22% Similarity=0.744 Sum_probs=7.8
Q ss_pred ccceeEEEE
Q psy1572 28 TGQFQALVQ 36 (74)
Q Consensus 28 ~G~yqcLV~ 36 (74)
.|.|+|.++
T Consensus 55 ~G~Y~C~A~ 63 (77)
T cd05875 55 EGEYQCFAR 63 (77)
T ss_pred CEEEEEEEE
Confidence 699999975
No 40
>cd05760 Ig2_PTK7 Second immunoglobulin (Ig)-like domain of protein tyrosine kinase (PTK) 7, also known as CCK4. Ig2_PTK7: domain similar to the second immunoglobulin (Ig)-like domain in protein tyrosine kinase (PTK) 7, also known as CCK4. PTK7 is a subfamily of the receptor protein tyrosine kinase family, and is referred to as an RPTK-like molecule. RPTKs transduce extracellular signals across the cell membrane, and play important roles in regulating cell proliferation, migration, and differentiation. PTK7 is organized as an extracellular portion having seven Ig-like domains, a single transmembrane region, and a cytoplasmic tyrosine kinase-like domain. PTK7 is considered a pseudokinase as it has several unusual residues in some of the highly conserved tyrosine kinase (TK) motifs; it is predicted to lack TK activity. PTK7 may function as a cell-adhesion molecule. PTK7 mRNA is expressed at high levels in placenta, melanocytes, liver, lung, pancreas, and kidney. PTK7 is overexpressed in s
Probab=39.18 E-value=19 Score=20.81 Aligned_cols=11 Identities=9% Similarity=0.338 Sum_probs=9.1
Q ss_pred ccccceeEEEE
Q psy1572 26 HSTGQFQALVQ 36 (74)
Q Consensus 26 s~~G~yqcLV~ 36 (74)
.-.|.|+|.++
T Consensus 48 ~D~G~Y~C~a~ 58 (77)
T cd05760 48 DDSGLYYCCAH 58 (77)
T ss_pred ccCEEEEEEEE
Confidence 45799999986
No 41
>KOG2938|consensus
Probab=39.16 E-value=37 Score=27.12 Aligned_cols=27 Identities=26% Similarity=0.452 Sum_probs=21.1
Q ss_pred ceEeecCCCCHHHHHHHHHHHHHHHHHHhhcC
Q psy1572 42 VAVCHGQGKTQQEAKTVAAHNAIEYLRLMTKS 73 (74)
Q Consensus 42 ~~VCHGsG~S~~eA~~~AA~NALqyLkiM~~~ 73 (74)
++=|||. .+.+ ++++||++.|+.+.+.
T Consensus 52 lTtv~Gn-~~~~----~t~~NA~~~L~l~~r~ 78 (350)
T KOG2938|consen 52 LTTVHGN-VTVE----DTDRNALDLLSLLGRL 78 (350)
T ss_pred eeEeeCC-ccHh----hhhhhHHHHHHhcCCc
Confidence 4558998 5554 7899999999998753
No 42
>TIGR03056 bchO_mg_che_rel putative magnesium chelatase accessory protein. Members of this family belong to the alpha/beta fold family hydrolases (PFAM model pfam00561). Members are found in bacterial genomes if and only if they encoded for anoxygenic photosynthetic systems similar to that of Rhodobacter capsulatus and other alpha-Proteobacteria. Members often are encoded in the same operon as subunits of the protoporphyrin IX magnesium chelatase, and were once designated BchO. No literature supports a role as an actual subunit of magnesium chelatase, but an accessory role is possible, as suggested by placement by its probable hydrolase activity.
Probab=38.21 E-value=1e+02 Score=20.28 Aligned_cols=32 Identities=9% Similarity=0.114 Sum_probs=20.0
Q ss_pred ccccceeEEEEecc---CCceEeecCCCCHHHHHH
Q psy1572 26 HSTGQFQALVQLAT---YPVAVCHGQGKTQQEAKT 57 (74)
Q Consensus 26 s~~G~yqcLV~LST---~P~~VCHGsG~S~~eA~~ 57 (74)
+++|...+..+... .|+.+|||.|.+...-+.
T Consensus 12 ~~~~~~~~~~~~g~~~~~~vv~~hG~~~~~~~~~~ 46 (278)
T TIGR03056 12 TVGPFHWHVQDMGPTAGPLLLLLHGTGASTHSWRD 46 (278)
T ss_pred eECCEEEEEEecCCCCCCeEEEEcCCCCCHHHHHH
Confidence 34444444444432 478899999988776544
No 43
>cd05868 Ig4_NrCAM Fourth immunoglobulin (Ig)-like domain of NrCAM (NgCAM-related cell adhesion molecule). Ig4_ NrCAM: fourth immunoglobulin (Ig)-like domain of NrCAM (NgCAM-related cell adhesion molecule). NrCAM belongs to the L1 subfamily of cell adhesion molecules (CAMs) and is comprised of an extracellular region having six IG-like domains and five fibronectin type III domains, a transmembrane region and an intracellular domain. NrCAM is primarily expressed in the nervous system.
Probab=37.88 E-value=21 Score=20.70 Aligned_cols=12 Identities=17% Similarity=0.171 Sum_probs=9.2
Q ss_pred eccccceeEEEE
Q psy1572 25 LHSTGQFQALVQ 36 (74)
Q Consensus 25 ls~~G~yqcLV~ 36 (74)
.+-.|.|+|.++
T Consensus 50 ~~D~G~Y~C~A~ 61 (76)
T cd05868 50 ERSSAVYQCNAS 61 (76)
T ss_pred HhhCEEEEEEEE
Confidence 345899999875
No 44
>KOG0286|consensus
Probab=36.40 E-value=68 Score=26.21 Aligned_cols=45 Identities=16% Similarity=0.149 Sum_probs=40.1
Q ss_pred ccceeeeEEeeeCCCCCceeeeccccceeEEEEeccCCceEeecC
Q psy1572 4 SGLLDTRVCLISTNGSNSIAFLHSTGQFQALVQLATYPVAVCHGQ 48 (74)
Q Consensus 4 s~l~~~~v~~~~~~~~n~ieels~~G~yqcLV~LST~P~~VCHGs 48 (74)
.+--|+.++..|+-|+-.+--|+-++--.+-+.+|.+-.+||-|+
T Consensus 290 agy~d~~c~vWDtlk~e~vg~L~GHeNRvScl~~s~DG~av~TgS 334 (343)
T KOG0286|consen 290 AGYDDFTCNVWDTLKGERVGVLAGHENRVSCLGVSPDGMAVATGS 334 (343)
T ss_pred eeecCCceeEeeccccceEEEeeccCCeeEEEEECCCCcEEEecc
Confidence 345689999999999999999998888888899999999999886
No 45
>PRK10443 rihA ribonucleoside hydrolase 1; Provisional
Probab=36.25 E-value=49 Score=24.88 Aligned_cols=27 Identities=22% Similarity=0.379 Sum_probs=20.1
Q ss_pred CceEeecCCCCHHHHHHHHHHHHHHHHHHhhc
Q psy1572 41 PVAVCHGQGKTQQEAKTVAAHNAIEYLRLMTK 72 (74)
Q Consensus 41 P~~VCHGsG~S~~eA~~~AA~NALqyLkiM~~ 72 (74)
=+++|||.-. .+++++||++.|..++.
T Consensus 32 gIt~v~Gn~~-----~~~~~~na~~~l~~~g~ 58 (311)
T PRK10443 32 AVTTSAGNQT-----PEKTLRNALRMLTLLNR 58 (311)
T ss_pred EEEEeCCCCC-----HHHHHHHHHHHHHHhCC
Confidence 3667887543 45789999999998864
No 46
>cd05873 Ig_Sema4D_like Immunoglobulin (Ig)-like domain of the class IV semaphorin Sema4D. Ig_Sema4D_like; Immunoglobulin (Ig)-like domain of Sema4D. Sema4D is a Class IV semaphorin. Semaphorins are classified based on structural features additional to the Sema domain. Sema4D has extracellular Sema and Ig domains, a transmembrane domain, and a short cytoplasmic domain. Sema4D plays a part in the development of GABAergic synapses. Sema4D in addition is an immune semaphorin. It is abundant on resting T cells; its expression is weak on resting B cells and antigen presenting cells (APCs), but is upregulated by various stimuli. The receptor used by Sema4D in the immune system is CD72. Sem4D enhances the activation of B cells and DCs through binding CD72, perhaps by reducing CD72s inhibitory signals. The receptor used by Sema4D in the non-lymphatic tissues is plexin-B1. Sem4D is anchored to the cell surface but its extracellular domain can be released from the cell surface by a metalloproteas
Probab=36.22 E-value=22 Score=22.26 Aligned_cols=14 Identities=21% Similarity=0.292 Sum_probs=11.1
Q ss_pred eccccceeEEEEec
Q psy1572 25 LHSTGQFQALVQLA 38 (74)
Q Consensus 25 ls~~G~yqcLV~LS 38 (74)
.+-.|.|+|.++-.
T Consensus 57 ~~DsG~Y~C~a~e~ 70 (87)
T cd05873 57 EADAGRYQCLSVEK 70 (87)
T ss_pred HHHCEEEEEEEEec
Confidence 34579999999866
No 47
>smart00408 IGc2 Immunoglobulin C-2 Type.
Probab=35.06 E-value=26 Score=17.40 Aligned_cols=11 Identities=9% Similarity=0.604 Sum_probs=8.6
Q ss_pred ccccceeEEEE
Q psy1572 26 HSTGQFQALVQ 36 (74)
Q Consensus 26 s~~G~yqcLV~ 36 (74)
.-.|.|+|.+.
T Consensus 49 ~d~G~Y~C~~~ 59 (63)
T smart00408 49 EDSGEYTCVAE 59 (63)
T ss_pred ccCEEEEEEEe
Confidence 45799999874
No 48
>PF05165 GGDN: GGDN family; InterPro: IPR007839 GTP cyclohydrolase III catalyses the formation of 2-amino-5-formylamino-6- ribofuranosylamino-4(3H)-pyrimidinone ribonucleotide monophosphate and inorganic phosphate from GTP. The enzyme also has an independent pyrophosphate phosphohydrolase activity. The proteins are 200-270 amino acids in length.; GO: 0003933 GTP cyclohydrolase activity, 0009058 biosynthetic process; PDB: 2QV6_B.
Probab=35.01 E-value=28 Score=26.78 Aligned_cols=26 Identities=31% Similarity=0.420 Sum_probs=20.4
Q ss_pred EeccCCceEeecCCCCHHHHHHHHHH
Q psy1572 36 QLATYPVAVCHGQGKTQQEAKTVAAH 61 (74)
Q Consensus 36 ~LST~P~~VCHGsG~S~~eA~~~AA~ 61 (74)
..+.-|+.+|=|.|.|+-||+.+|-.
T Consensus 75 ~~~Pv~vs~~ig~g~tP~eA~~~a~~ 100 (246)
T PF05165_consen 75 NRYPVTVSMCIGYGETPYEAQENATE 100 (246)
T ss_dssp HHSSS-EEEEEEEESSHHHHHHHHHH
T ss_pred hcCCceEEEEEecCCCHHHHHHHHHH
Confidence 34566788999999999999988654
No 49
>cd05723 Ig4_Neogenin Fourth immunoglobulin (Ig)-like domain in neogenin and similar proteins. Ig4_Neogenin: fourth immunoglobulin (Ig)-like domain in neogenin and related proteins. Neogenin is a cell surface protein which is expressed in the developing nervous system of vertebrate embryos in the growing nerve cells. It is also expressed in other embryonic tissues, and may play a general role in developmental processes such as cell migration, cell-cell recognition, and tissue growth regulation. Included in this group is the tumor suppressor protein DCC, which is deleted in colorectal carcinoma . DCC and neogenin each have four Ig-like domains followed by six fibronectin type III domains, a transmembrane domain, and an intracellular domain.
Probab=34.94 E-value=25 Score=19.95 Aligned_cols=11 Identities=18% Similarity=0.519 Sum_probs=8.8
Q ss_pred ccccceeEEEE
Q psy1572 26 HSTGQFQALVQ 36 (74)
Q Consensus 26 s~~G~yqcLV~ 36 (74)
+-.|.|+|.++
T Consensus 47 ~D~G~Y~C~A~ 57 (71)
T cd05723 47 SDEGFYQCIAE 57 (71)
T ss_pred ccCEEEEEEEE
Confidence 45699999976
No 50
>cd05750 Ig_Pro_neuregulin Immunoglobulin (Ig)-like domain in neuregulins (NRGs). Ig_Pro_neuregulin: immunoglobulin (Ig)-like domain in neuregulins (NRGs). NRGs are signaling molecules, which participate in cell-cell interactions in the nervous system, breast, heart, and other organ systems, and are implicated in the pathology of diseases including schizophrenia, multiple sclerosis, and breast cancer. There are four members of the neuregulin gene family (NRG1, -2, -3, and -4). The NRG-1 protein, binds to and activates the tyrosine kinases receptors ErbB3 and ErbB4, initiating signaling cascades. The other NRGs proteins bind one or the other or both of these ErbBs. NRG-1 has multiple functions; for example, in the brain it regulates various processes such as radial glia formation and neuronal migration, dendritic development, and expression of neurotransmitters receptors; in the peripheral nervous system NRG-1 regulates processes such as target cell differentiation, and Schwann cell surv
Probab=34.62 E-value=27 Score=19.27 Aligned_cols=11 Identities=18% Similarity=0.667 Sum_probs=9.0
Q ss_pred ccccceeEEEE
Q psy1572 26 HSTGQFQALVQ 36 (74)
Q Consensus 26 s~~G~yqcLV~ 36 (74)
.-+|.|.|.++
T Consensus 53 ~D~G~Y~C~a~ 63 (75)
T cd05750 53 ADSGEYTCVVE 63 (75)
T ss_pred ccCeEEEEEEE
Confidence 56799999876
No 51
>TIGR01738 bioH putative pimeloyl-BioC--CoA transferase BioH. This CoA-binding enzyme is required for the production of pimeloyl-coenzyme A, the substrate of the BioF protein early in the biosynthesis of biotin. Its exact function is unknown, but is proposed in ref 2. This enzyme belongs to the alpha/beta hydrolase fold family (pfam model pfam00561). Members of this family are restricted to the Proteobacteria.
Probab=33.68 E-value=45 Score=20.78 Aligned_cols=19 Identities=16% Similarity=0.305 Sum_probs=14.3
Q ss_pred CCceEeecCCCCHHHHHHH
Q psy1572 40 YPVAVCHGQGKTQQEAKTV 58 (74)
Q Consensus 40 ~P~~VCHGsG~S~~eA~~~ 58 (74)
.|+.++||.|.+...-+..
T Consensus 5 ~~iv~~HG~~~~~~~~~~~ 23 (245)
T TIGR01738 5 VHLVLIHGWGMNAEVFRCL 23 (245)
T ss_pred ceEEEEcCCCCchhhHHHH
Confidence 4789999999987754443
No 52
>cd05848 Ig1_Contactin-5 First Ig domain of contactin-5. Ig1_Contactin-5: First Ig domain of the neural cell adhesion molecule contactin-5. Contactins are comprised of six Ig domains followed by four fibronectin type III (FnIII) domains, anchored to the membrane by glycosylphosphatidylinositol. The different contactins show different expression patterns in the central nervous system. In rats, a lack of contactin-5 (NB-2) results in an impairment of the neuronal activity in the auditory system. Contactin-5 is expressed specifically in the postnatal nervous system, peaking at about 3 weeks postnatal. Contactin-5 is highly expressed in the adult human brain in the occipital lobe and in the amygdala; lower levels of expression have been detected in the corpus callosum, caudate nucleus, and spinal cord.
Probab=33.59 E-value=26 Score=21.21 Aligned_cols=11 Identities=27% Similarity=0.700 Sum_probs=9.3
Q ss_pred ccccceeEEEE
Q psy1572 26 HSTGQFQALVQ 36 (74)
Q Consensus 26 s~~G~yqcLV~ 36 (74)
.-.|.|+|.++
T Consensus 69 ~D~G~Y~C~A~ 79 (94)
T cd05848 69 KDSGRYQCLAT 79 (94)
T ss_pred CcCEEEEEEEE
Confidence 46799999986
No 53
>cd05738 Ig2_RPTP_IIa_LAR_like Second immunoglobulin (Ig)-like domain of the receptor protein tyrosine phosphatase (RPTP)-F, also known as LAR. Ig2_RPTP_IIa_LAR_like: domain similar to the second immunoglobulin (Ig)-like domain found in the receptor protein tyrosine phosphatase (RPTP)-F, also known as LAR. LAR belongs to the RPTP type IIa subfamily. Members of this subfamily are cell adhesion molecule-like proteins involved in central nervous system (CNS) development. They have large extracellular portions, comprised of multiple Ig-like domains and two to nine fibronectin type III (FNIII) domains, and a cytoplasmic portion having two tandem phosphatase domains.
Probab=33.56 E-value=27 Score=19.97 Aligned_cols=12 Identities=8% Similarity=0.412 Sum_probs=9.8
Q ss_pred eccccceeEEEE
Q psy1572 25 LHSTGQFQALVQ 36 (74)
Q Consensus 25 ls~~G~yqcLV~ 36 (74)
.+-.|.|+|.++
T Consensus 47 ~~D~G~Y~C~a~ 58 (74)
T cd05738 47 ESDQGKYECVAT 58 (74)
T ss_pred hhhCEEEEEEEE
Confidence 457899999886
No 54
>cd05850 Ig1_Contactin-2 First Ig domain of contactin-2. Ig1_Contactin-2: First Ig domain of the neural cell adhesion molecule contactin-2-like. Contactins are comprised of six Ig domains followed by four fibronectin type III (FnIII) domains anchored to the membrane by glycosylphosphatidylinositol. Contactin-2 (TAG-1, axonin-1) facilitates cell adhesion by homophilic binding between molecules in apposed membranes. It may play a part in the neuronal processes of neurite outgrowth, axon guidance and fasciculation, and neuronal migration. The first four Ig domains form the intermolecular binding fragment, which arranges as a compact U-shaped module by contacts between IG domains 1 and 4, and domains 2 and 3. The different contactins show different expression patterns in the central nervous system. During development and in adulthood, contactin-2 is transiently expressed in subsets of central and peripheral neurons. Contactin-2 is also expressed in retinal amacrine cells in the developing c
Probab=33.51 E-value=26 Score=21.14 Aligned_cols=11 Identities=27% Similarity=0.576 Sum_probs=9.6
Q ss_pred ccccceeEEEE
Q psy1572 26 HSTGQFQALVQ 36 (74)
Q Consensus 26 s~~G~yqcLV~ 36 (74)
.-.|.|+|.++
T Consensus 69 ~D~G~Y~C~A~ 79 (94)
T cd05850 69 RDAGSYQCLAI 79 (94)
T ss_pred CcCEEEEEEEE
Confidence 57899999987
No 55
>cd05852 Ig5_Contactin-1 Fifth Ig domain of contactin-1. Ig5_Contactin-1: fifth Ig domain of the neural cell adhesion molecule contactin-1. Contactins are comprised of six Ig domains followed by four fibronectin type III (FnIII) domains anchored to the membrane by glycosylphosphatidylinositol. Contactin-1 is differentially expressed in tumor tissues and may through a RhoA mechanism, facilitate invasion and metastasis of human lung adenocarcinoma.
Probab=33.45 E-value=27 Score=20.43 Aligned_cols=11 Identities=9% Similarity=0.446 Sum_probs=9.0
Q ss_pred ccccceeEEEE
Q psy1572 26 HSTGQFQALVQ 36 (74)
Q Consensus 26 s~~G~yqcLV~ 36 (74)
.-.|.|+|.++
T Consensus 49 ~D~G~Y~C~A~ 59 (73)
T cd05852 49 LDEGSYTCFAE 59 (73)
T ss_pred hHCEEEEEEEE
Confidence 45799999986
No 56
>COG2429 Archaeal GTP cyclohydrolase III [Nucleotide transport and metabolism]
Probab=33.13 E-value=36 Score=26.67 Aligned_cols=23 Identities=35% Similarity=0.628 Sum_probs=17.9
Q ss_pred ceEeecCCCCHHHHHHHHHHHHHH
Q psy1572 42 VAVCHGQGKTQQEAKTVAAHNAIE 65 (74)
Q Consensus 42 ~~VCHGsG~S~~eA~~~AA~NALq 65 (74)
|.+|=|+|.|+-||+.+|- +||+
T Consensus 84 Vs~~ig~g~TP~dA~~~A~-~aL~ 106 (250)
T COG2429 84 VSMCIGTGRTPLDAQENAG-IALQ 106 (250)
T ss_pred EEEEeecCCCcHHHHHHHH-HhHh
Confidence 3469999999999998764 4554
No 57
>PF13927 Ig_3: Immunoglobulin domain; PDB: 2D3V_A 1G0X_A 1VDG_A 1P7Q_D 3D2U_H 1UFU_A 1UGN_A 3VH8_H 3OQ3_B 4DKD_C ....
Probab=33.09 E-value=33 Score=18.07 Aligned_cols=12 Identities=8% Similarity=0.387 Sum_probs=9.3
Q ss_pred eccccceeEEEE
Q psy1572 25 LHSTGQFQALVQ 36 (74)
Q Consensus 25 ls~~G~yqcLV~ 36 (74)
.+-+|.|+|.++
T Consensus 63 ~~~~g~y~C~a~ 74 (75)
T PF13927_consen 63 RSDNGTYTCIAS 74 (75)
T ss_dssp GGGTEEEEEEEE
T ss_pred HHhCcEEEEEEE
Confidence 457788999875
No 58
>cd05867 Ig4_L1-CAM_like Fourth immunoglobulin (Ig)-like domain of the L1 cell adhesion molecule (CAM). Ig4_L1-CAM_like: fourth immunoglobulin (Ig)-like domain of the L1 cell adhesion molecule (CAM). L1 is comprised of an extracellular region having six Ig-like domains and five fibronectin type III domains, a transmembrane region and an intracellular domain. L1 is primarily expressed in the nervous system and is involved in its development and function. L1 is associated with an X-linked recessive disorder, X-linked hydrocephalus, MASA syndrome, or spastic paraplegia type 1, that involves abnormalities of axonal growth. This group also contains the chicken neuron-glia cell adhesion molecule, Ng-CAM.
Probab=32.56 E-value=29 Score=19.80 Aligned_cols=13 Identities=15% Similarity=0.138 Sum_probs=9.9
Q ss_pred ccccceeEEEEec
Q psy1572 26 HSTGQFQALVQLA 38 (74)
Q Consensus 26 s~~G~yqcLV~LS 38 (74)
.-+|.|+|.++=+
T Consensus 51 ~D~G~Y~C~a~N~ 63 (76)
T cd05867 51 SDTAVYQCEARNR 63 (76)
T ss_pred hhCEEEEEEEECC
Confidence 4579999998643
No 59
>cd05895 Ig_Pro_neuregulin-1 Immunoglobulin (Ig)-like domain found in neuregulin (NRG)-1. Ig_Pro_neuregulin-1: immunoglobulin (Ig)-like domain found in neuregulin (NRG)-1. There are many NRG-1 isoforms which arise from the alternative splicing of mRNA. NRG-1 belongs to the neuregulin gene family, which is comprised of four genes. This group represents NRG-1. NRGs are signaling molecules, which participate in cell-cell interactions in the nervous system, breast, and heart, and other organ systems, and are implicated in the pathology of diseases including schizophrenia, multiple sclerosis, and breast cancer. The NRG-1 protein binds to and activates the tyrosine kinases receptors ErbB3 and ErbB4, initiating signaling cascades. NRG-1 has multiple functions; for example, in the brain it regulates various processes such as radial glia formation and neuronal migration, dendritic development, and expression of neurotransmitters receptors; in the peripheral nervous system NRG-1 regulates process
Probab=32.54 E-value=29 Score=19.62 Aligned_cols=12 Identities=25% Similarity=0.722 Sum_probs=9.7
Q ss_pred eccccceeEEEE
Q psy1572 25 LHSTGQFQALVQ 36 (74)
Q Consensus 25 ls~~G~yqcLV~ 36 (74)
..-+|.|.|.++
T Consensus 53 ~~DsG~Y~C~a~ 64 (76)
T cd05895 53 LADNGEYKCMVS 64 (76)
T ss_pred cccCEEEEEEEE
Confidence 456799999986
No 60
>KOG1817|consensus
Probab=32.44 E-value=48 Score=28.46 Aligned_cols=24 Identities=33% Similarity=0.461 Sum_probs=22.0
Q ss_pred eecCCCCHHHHHHHHHHHHHHHHH
Q psy1572 45 CHGQGKTQQEAKTVAAHNAIEYLR 68 (74)
Q Consensus 45 CHGsG~S~~eA~~~AA~NALqyLk 68 (74)
=-|.|++..+|--.||-+||+-++
T Consensus 479 at~~G~nik~Ae~rAA~~ALe~~~ 502 (533)
T KOG1817|consen 479 ATGVGSNIKQAEMRAAMQALENLK 502 (533)
T ss_pred eeccCchHhHHHHHHHHHHHHHHH
Confidence 358999999999999999999887
No 61
>cd05722 Ig1_Neogenin First immunoglobulin (Ig)-like domain in neogenin and similar proteins. Ig1_Neogenin: first immunoglobulin (Ig)-like domain in neogenin and related proteins. Neogenin is a cell surface protein which is expressed in the developing nervous system of vertebrate embryos in the growing nerve cells. It is also expressed in other embryonic tissues, and may play a general role in developmental processes such as cell migration, cell-cell recognition, and tissue growth regulation. Included in this group is the tumor suppressor protein DCC, which is deleted in colorectal carcinoma . DCC and neogenin each have four Ig-like domains followed by six fibronectin type III domains, a transmembrane domain, and an intracellular domain.
Probab=31.86 E-value=29 Score=20.52 Aligned_cols=13 Identities=23% Similarity=0.332 Sum_probs=10.3
Q ss_pred ccccceeEEEEec
Q psy1572 26 HSTGQFQALVQLA 38 (74)
Q Consensus 26 s~~G~yqcLV~LS 38 (74)
.-.|.|+|.++=.
T Consensus 69 ~D~G~Y~C~a~N~ 81 (95)
T cd05722 69 PDEGFYQCVAQND 81 (95)
T ss_pred CcCEEEEEEEECC
Confidence 4579999998754
No 62
>cd05849 Ig1_Contactin-1 First Ig domain of contactin-1. Ig1_Contactin-1: First Ig domain of the neural cell adhesion molecule contactin-1. Contactins are comprised of six Ig domains followed by four fibronectin type III (FnIII) domains anchored to the membrane by glycosylphosphatidylinositol. Contactin-1 is differentially expressed in tumor tissues and may, through a RhoA mechanism, facilitate invasion and metastasis of human lung adenocarcinoma.
Probab=31.60 E-value=30 Score=21.15 Aligned_cols=11 Identities=18% Similarity=0.616 Sum_probs=9.5
Q ss_pred ccccceeEEEE
Q psy1572 26 HSTGQFQALVQ 36 (74)
Q Consensus 26 s~~G~yqcLV~ 36 (74)
.-.|.|+|.++
T Consensus 68 ~D~G~Y~C~A~ 78 (93)
T cd05849 68 KDAGRYVCIVS 78 (93)
T ss_pred CCCEEEEEEEE
Confidence 57899999987
No 63
>cd04970 Ig6_Contactin_like Sixth Ig domain of contactin. Ig6_Contactin_like: Sixth Ig domain of contactins. Contactins are neural cell adhesion molecules and are comprised of six Ig domains followed by four fibronectin type III(FnIII) domains anchored to the membrane by glycosylphosphatidylinositol. The first four Ig domains form the intermolecular binding fragment, which arranges as a compact U-shaped module via contacts between Ig domains 1 and 4, and between Ig domains 2 and 3. Contactin-2 (TAG-1, axonin-1) may play a part in the neuronal processes of neurite outgrowth, axon guidance and fasciculation, and neuronal migration. This group also includes contactin-1 and contactin-5. The different contactins show different expression patterns in the central nervous system. During development and in adulthood, contactin-2 is transiently expressed in subsets of central and peripheral neurons. Contactin-5 is expressed specifically in the rat postnatal nervous system, peaking at about 3 week
Probab=31.27 E-value=31 Score=20.01 Aligned_cols=11 Identities=18% Similarity=0.492 Sum_probs=9.1
Q ss_pred ccccceeEEEE
Q psy1572 26 HSTGQFQALVQ 36 (74)
Q Consensus 26 s~~G~yqcLV~ 36 (74)
.-.|.|+|.++
T Consensus 57 ~D~G~Y~C~a~ 67 (85)
T cd04970 57 KHAGKYTCTAQ 67 (85)
T ss_pred HhCeeeEEEEe
Confidence 45799999986
No 64
>cd05740 Ig_CEACAM_D4 Fourth immunoglobulin (Ig)-like domain of carcinoembryonic antigen (CEA) related cell adhesion molecule (CEACAM). Ig_CEACAM_D4: immunoglobulin (Ig)-like domain 4 in carcinoembryonic antigen (CEA) related cell adhesion molecule (CEACAM) protein subfamily. The CEA family is a group of anchored or secreted glycoproteins, expressed by epithelial cells, leukocytes, endothelial cells and placenta. The CEA family is divided into the CEACAM and pregnancy-specific glycoprotein (PSG) subfamilies. This group represents the CEACAM subfamily. CEACAM1 has many important cellular functions, it is a cell adhesion molecule, and a signaling molecule that regulates the growth of tumor cells, it is an angiogenic factor, and is a receptor for bacterial and viral pathogens, including mouse hepatitis virus (MHV). In mice, four isoforms of CEACAM1 generated by alternative splicing have either two [D1, D4] or four [D1-D4] Ig-like domains on the cell surface. This family corresponds to the
Probab=31.10 E-value=30 Score=21.02 Aligned_cols=11 Identities=27% Similarity=0.498 Sum_probs=8.8
Q ss_pred ccccceeEEEE
Q psy1572 26 HSTGQFQALVQ 36 (74)
Q Consensus 26 s~~G~yqcLV~ 36 (74)
.-.|.|+|.+.
T Consensus 65 ~D~G~Y~C~a~ 75 (91)
T cd05740 65 SDTGHYQCEAS 75 (91)
T ss_pred hhCEEEEEEEE
Confidence 35699999985
No 65
>cd05857 Ig2_FGFR Second immunoglobulin (Ig)-like domain of fibroblast growth factor (FGF) receptor. Ig2_FGFR: second immunoglobulin (Ig)-like domain of fibroblast growth factor (FGF) receptor. FGF receptors bind FGF signaling polypeptides. FGFs participate in multiple processes such as morphogenesis, development, and angiogenesis. FGFs bind to four FGF receptor tyrosine kinases (FGFR1, -2, -3, -4). Receptor diversity is controlled by alternative splicing producing splice variants with different ligand binding characteristics and different expression patterns. FGFRs have an extracellular region comprised of three IG-like domains, a single transmembrane helix, and an intracellular tyrosine kinase domain. Ligand binding and specificity reside in the Ig-like domains 2 and 3, and the linker region that connects these two. FGFR activation and signaling depend on FGF-induced dimerization, a process involving cell surface heparin or heparin sulfate proteoglycans.
Probab=31.00 E-value=32 Score=19.57 Aligned_cols=11 Identities=18% Similarity=0.613 Sum_probs=9.0
Q ss_pred ccccceeEEEE
Q psy1572 26 HSTGQFQALVQ 36 (74)
Q Consensus 26 s~~G~yqcLV~ 36 (74)
.-.|.|+|.++
T Consensus 61 ~D~G~Y~C~a~ 71 (85)
T cd05857 61 SDKGNYTCVVE 71 (85)
T ss_pred ccCEEEEEEEE
Confidence 35799999987
No 66
>cd02649 nuc_hydro_CeIAG nuc_hydro_CeIAG: Nucleoside hydrolases similar to the inosine-adenosine-guanosine-preferring nucleoside hydrolase from Caenorhabditis elegans. Nucleoside hydrolases cleave the N-glycosidic bond in nucleosides generating ribose and the respective base. These enzymes vary in their substrate specificity. This group contains eukaryotic, bacterial and archeal proteins similar to the purine-preferring nucleoside hydrolase (IAG-NH) from C. elegans and the salivary purine nucleosidase from Aedes aegypti. C. elegans IAG-NH exhibits a high affinity for the substrate analogue p-nitrophenylriboside (p-NPR).
Probab=30.66 E-value=69 Score=24.15 Aligned_cols=26 Identities=19% Similarity=0.340 Sum_probs=18.4
Q ss_pred ceEeecCCCCHHHHHHHHHHHHHHHHHHhhc
Q psy1572 42 VAVCHGQGKTQQEAKTVAAHNAIEYLRLMTK 72 (74)
Q Consensus 42 ~~VCHGsG~S~~eA~~~AA~NALqyLkiM~~ 72 (74)
+++++|... .+++++||+..|..|.+
T Consensus 31 It~v~GN~~-----~~~~~~na~~~l~~~g~ 56 (306)
T cd02649 31 ITCVHGNTN-----VEQVVKNALRVLEACGR 56 (306)
T ss_pred EEEccCCcC-----HHHHHHHHHHHHHHhCC
Confidence 445666443 45679999999998864
No 67
>PF02464 CinA: Competence-damaged protein; InterPro: IPR008136 CinA is the first gene in the competence-inducible (cin) operon, and is thought to be specifically required at some stage in the process of transformation []. This is a C-terminal region of putative competence-damaged proteins from the cin operon.; PDB: 2A9S_A.
Probab=30.35 E-value=1.8e+02 Score=19.88 Aligned_cols=39 Identities=21% Similarity=0.217 Sum_probs=23.8
Q ss_pred ceeEEEEeccCCceEeecCCCCHHHHHHHHHHHHHHHHHH
Q psy1572 30 QFQALVQLATYPVAVCHGQGKTQQEAKTVAAHNAIEYLRL 69 (74)
Q Consensus 30 ~yqcLV~LST~P~~VCHGsG~S~~eA~~~AA~NALqyLki 69 (74)
-|.++..=.-.-+...+-.| ++++-+..|+..||++|..
T Consensus 114 v~iai~~~~~~~~~~~~~~g-~R~~ir~~~~~~al~~L~~ 152 (154)
T PF02464_consen 114 VYIAIAYRGGVWVREVQFPG-DREEIRQRAVNQALDLLRR 152 (154)
T ss_dssp EEEEEEETTSEEEEEEE--S--HHHHHHHHHHHHHHHHH-
T ss_pred EEEEEEeCCceEEEEEECCC-CHHHHHHHHHHHHHHHHHh
Confidence 34444433332233445556 9999999999999999875
No 68
>PRK02240 GTP cyclohydrolase III; Provisional
Probab=29.61 E-value=63 Score=25.05 Aligned_cols=25 Identities=28% Similarity=0.380 Sum_probs=19.8
Q ss_pred CceEeecCCCCHHHHHHHHHHHHHHH
Q psy1572 41 PVAVCHGQGKTQQEAKTVAAHNAIEY 66 (74)
Q Consensus 41 P~~VCHGsG~S~~eA~~~AA~NALqy 66 (74)
++.+|=|.|.|+.+|...|. .+||.
T Consensus 83 ~ls~GIG~g~TP~eA~~~At-~~Le~ 107 (254)
T PRK02240 83 TVSMGIGTAETPYEAQKLAT-EALQE 107 (254)
T ss_pred ceEEEEEecCCHHHHHHHHH-HHHHH
Confidence 55999999999999997764 35554
No 69
>cd04971 Ig_TrKABC_d5 Fifth domain (immunoglobulin-like) of Trk receptors TrkA, TrkB and TrkC. TrkABC_d5: the fifth domain of Trk receptors TrkA, TrkB and TrkC, this is an immunoglobulin (Ig)-like domain which binds to neurotrophin. The Trk family of receptors are tyrosine kinase receptors. They are activated by dimerization, leading to autophosphorylation of intracellular tyrosine residues, and triggering the signal transduction pathway. TrkA, TrkB, and TrkC share significant sequence homology and domain organization. The first three domains are leucine-rich domains. The fourth and fifth domains are Ig-like domains playing a part in ligand binding. TrkA, Band C mediate the trophic effects of the neurotrophin Nerve growth factor (NGF) family. TrkA is recognized by NGF. TrkB is recognized by brain-derived neurotrophic factor (BDNF) and neurotrophin (NT)-4. TrkC is recognized by NT-3. NT-3 is promiscuous as in some cell systems it activates TrkA and TrkB receptors. TrkA is a receptor foun
Probab=29.33 E-value=36 Score=20.39 Aligned_cols=10 Identities=10% Similarity=0.491 Sum_probs=8.4
Q ss_pred cccceeEEEE
Q psy1572 27 STGQFQALVQ 36 (74)
Q Consensus 27 ~~G~yqcLV~ 36 (74)
-+|.|.|.++
T Consensus 57 D~G~YtC~A~ 66 (81)
T cd04971 57 NNGNYTLVAS 66 (81)
T ss_pred cCeEEEEEEE
Confidence 4699999986
No 70
>cd05876 Ig3_L1-CAM Third immunoglobulin (Ig)-like domain of the L1 cell adhesion molecule (CAM). Ig3_L1-CAM: third immunoglobulin (Ig)-like domain of the L1 cell adhesion molecule (CAM). L1 belongs to the L1 subfamily of cell adhesion molecules (CAMs) and is comprised of an extracellular region having six Ig-like domains, five fibronectin type III domains, a transmembrane region and an intracellular domain. L1 is primarily expressed in the nervous system and is involved in its development and function. L1 is associated with an X-linked recessive disorder, X-linked hydrocephalus, MASA syndrome, or spastic paraplegia type 1, that involves abnormalities of axonal growth. This group also contains the chicken neuron-glia cell adhesion molecule, Ng-CAM.
Probab=29.23 E-value=35 Score=19.30 Aligned_cols=10 Identities=10% Similarity=0.448 Sum_probs=8.4
Q ss_pred cccceeEEEE
Q psy1572 27 STGQFQALVQ 36 (74)
Q Consensus 27 ~~G~yqcLV~ 36 (74)
-+|.|.|.++
T Consensus 47 D~G~Y~C~a~ 56 (71)
T cd05876 47 DDGEYVCTAE 56 (71)
T ss_pred hCEEEEEEEE
Confidence 5699999876
No 71
>cd05731 Ig3_L1-CAM_like Third immunoglobulin (Ig)-like domain of the L1 cell adhesion molecule (CAM). Ig3_L1-CAM_like: domain similar to the third immunoglobulin (Ig)-like domain of the L1 cell adhesion molecule (CAM). L1 belongs to the L1 subfamily of cell adhesion molecules (CAMs) and is comprised of an extracellular region having six Ig-like domains and five fibronectin type III domains, a transmembrane region and an intracellular domain. L1 is primarily expressed in the nervous system and is involved in its development and function. L1 is associated with an X-linked recessive disorder, X-linked hydrocephalus, MASA syndrome, or spastic paraplegia type 1, that involves abnormalities of axonal growth. This group also contains the chicken neuron-glia cell adhesion molecule, Ng-CAM and human neurofascin.
Probab=28.77 E-value=38 Score=18.44 Aligned_cols=12 Identities=8% Similarity=0.343 Sum_probs=9.0
Q ss_pred ccccceeEEEEe
Q psy1572 26 HSTGQFQALVQL 37 (74)
Q Consensus 26 s~~G~yqcLV~L 37 (74)
.-+|.|.|.+.=
T Consensus 46 ~D~G~Y~C~a~N 57 (71)
T cd05731 46 EDDGEYRCTASN 57 (71)
T ss_pred HHCEEEEEEEEe
Confidence 346999998763
No 72
>cd05724 Ig2_Robo Second immunoglobulin (Ig)-like domain in Robo (roundabout) receptors. Ig2_Robo: domain similar to the second immunoglobulin (Ig)-like domain in Robo (roundabout) receptors. Robo receptors play a role in the development of the central nervous system (CNS), and are receptors of Slit protein. Slit is a repellant secreted by the neural cells in the midline. Slit acts through Robo to prevent most neurons from crossing the midline from either side. Three mammalian Robo homologs (robo1, -2, and -3), and three mammalian Slit homologs (Slit-1,-2, -3), have been identified. Commissural axons, which cross the midline, express low levels of Robo; longitudinal axons, which avoid the midline, express high levels of Robo. robo1, -2, and -3 are expressed by commissural neurons in the vertebrate spinal cord and Slits 1, -2, -3 are expressed at the ventral midline. Robo-3 is a divergent member of the Robo family which instead of being a positive regulator of slit responsiveness, antago
Probab=28.22 E-value=38 Score=19.16 Aligned_cols=11 Identities=9% Similarity=0.480 Sum_probs=9.1
Q ss_pred ccccceeEEEE
Q psy1572 26 HSTGQFQALVQ 36 (74)
Q Consensus 26 s~~G~yqcLV~ 36 (74)
.-+|.|+|.++
T Consensus 61 ~D~G~Y~C~a~ 71 (86)
T cd05724 61 SDEGTYKCVAT 71 (86)
T ss_pred ccCEEEEEEEE
Confidence 46799999886
No 73
>smart00409 IG Immunoglobulin.
Probab=28.12 E-value=43 Score=17.38 Aligned_cols=11 Identities=9% Similarity=0.410 Sum_probs=9.0
Q ss_pred ccccceeEEEE
Q psy1572 26 HSTGQFQALVQ 36 (74)
Q Consensus 26 s~~G~yqcLV~ 36 (74)
.-.|.|.|.+.
T Consensus 61 ~d~G~Y~C~v~ 71 (86)
T smart00409 61 EDSGTYTCAAT 71 (86)
T ss_pred ccCeEEEEEEE
Confidence 34599999998
No 74
>smart00410 IG_like Immunoglobulin like. IG domains that cannot be classified into one of IGv1, IGc1, IGc2, IG.
Probab=28.12 E-value=43 Score=17.38 Aligned_cols=11 Identities=9% Similarity=0.410 Sum_probs=9.0
Q ss_pred ccccceeEEEE
Q psy1572 26 HSTGQFQALVQ 36 (74)
Q Consensus 26 s~~G~yqcLV~ 36 (74)
.-.|.|.|.+.
T Consensus 61 ~d~G~Y~C~v~ 71 (86)
T smart00410 61 EDSGTYTCAAT 71 (86)
T ss_pred ccCeEEEEEEE
Confidence 34599999998
No 75
>cd07693 Ig1_Robo First immunoglobulin (Ig)-like domain in Robo (roundabout) receptors and similar proteins. Ig1_Robo: domain similar to the first immunoglobulin (Ig)-like domain in Robo (roundabout) receptors. Robo receptors play a role in the development of the central nervous system (CNS), and are receptors of Slit protein. Slit is a repellant secreted by the neural cells in the midline. Slit acts through Robo to prevent most neurons from crossing the midline from either side. Three mammalian Robo homologs (robo1, -2, and -3), and three mammalian Slit homologs (Slit-1,-2, -3), have been identified. Commissural axons, which cross the midline, express low levels of Robo; longitudinal axons, which avoid the midline, express high levels of Robo. robo1, -2, and -3 are expressed by commissural neurons in the vertebrate spinal cord and Slits 1, -2, -3 are expressed at the ventral midline. Robo-3 is a divergent member of the Robo family which instead of being a positive regulator of slit res
Probab=28.03 E-value=38 Score=19.21 Aligned_cols=12 Identities=8% Similarity=0.288 Sum_probs=10.2
Q ss_pred eccccceeEEEE
Q psy1572 25 LHSTGQFQALVQ 36 (74)
Q Consensus 25 ls~~G~yqcLV~ 36 (74)
..-.|.|+|.++
T Consensus 73 ~~D~G~Y~C~a~ 84 (100)
T cd07693 73 RSDEGVYVCVAH 84 (100)
T ss_pred cCcCEEEEEEEE
Confidence 567899999986
No 76
>cd05745 Ig3_Peroxidasin Third immunoglobulin (Ig)-like domain of peroxidasin. Ig3_Peroxidasin: the third immunoglobulin (Ig)-like domain in peroxidasin. Peroxidasin has a peroxidase domain and interacting extracellular motifs containing four Ig-like domains. It has been suggested that peroxidasin is secreted and has functions related to the stabilization of the extracellular matrix. It may play a part in various other important processes such as removal and destruction of cells which have undergone programmed cell death, and protection of the organism against non-self.
Probab=27.95 E-value=39 Score=19.15 Aligned_cols=11 Identities=27% Similarity=0.610 Sum_probs=9.0
Q ss_pred ccccceeEEEE
Q psy1572 26 HSTGQFQALVQ 36 (74)
Q Consensus 26 s~~G~yqcLV~ 36 (74)
.-.|.|+|.++
T Consensus 50 ~D~G~Y~C~A~ 60 (74)
T cd05745 50 HDQGQYECQAV 60 (74)
T ss_pred HhCEEEEEEEE
Confidence 45799999986
No 77
>PF00047 ig: Immunoglobulin domain The Prosite family only concerns antibodies and MHCs.; InterPro: IPR013151 Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involved in protein-protein and protein-ligand interactions. The Pfam alignments do not include the first and last strand of the immunoglobulin-like domain.; PDB: 1B6U_A 3O4O_C 3VH8_H 1BIH_A 2C9A_A 2V5Y_A 1BQH_K 2ATP_A 3B9K_A 1NEZ_H ....
Probab=27.80 E-value=35 Score=18.25 Aligned_cols=9 Identities=22% Similarity=0.719 Sum_probs=7.5
Q ss_pred cccceeEEE
Q psy1572 27 STGQFQALV 35 (74)
Q Consensus 27 ~~G~yqcLV 35 (74)
-.|.|.|.|
T Consensus 56 d~G~Y~C~v 64 (64)
T PF00047_consen 56 DSGTYTCVV 64 (64)
T ss_dssp GTEEEEEEE
T ss_pred HCEEEEEEC
Confidence 579999976
No 78
>cd05763 Ig_1 Subgroup of the immunoglobulin (Ig) superfamily. Ig_1: subgroup of the immunoglobulin (Ig) domain found in the Ig superfamily. The Ig superfamily is a heterogenous group of proteins, built on a common fold comprised of a sandwich of two beta sheets. Members of the Ig superfamily are components of immunoglobulin, neuroglia, cell surface glycoproteins, such as T-cell receptors, CD2, CD4, CD8, and membrane glycoproteins, such as butyrophilin and chondroitin sulfate proteoglycan core protein. A predominant feature of most Ig domains is a disulfide bridge connecting the two beta-sheets with a tryptophan residue packed against the disulfide bond.
Probab=27.75 E-value=39 Score=19.05 Aligned_cols=11 Identities=27% Similarity=0.480 Sum_probs=9.0
Q ss_pred ccccceeEEEE
Q psy1572 26 HSTGQFQALVQ 36 (74)
Q Consensus 26 s~~G~yqcLV~ 36 (74)
.-+|.|+|.++
T Consensus 50 ~D~G~Y~C~A~ 60 (75)
T cd05763 50 EDTGVYSCTAQ 60 (75)
T ss_pred ccCEEEEEEEE
Confidence 45799999985
No 79
>TIGR02296 HpaC 4-hydroxyphenylacetate 3-monooxygenase, reductase component. These reductases catalyze the reduction of free flavins by NADPH. The flavin is then utilized by the large subunit of the monooxygenase.
Probab=27.64 E-value=94 Score=21.07 Aligned_cols=40 Identities=25% Similarity=0.294 Sum_probs=27.0
Q ss_pred eeeeEEeeeCCCCCceeeeccccceeEEEEeccCCceEeecCCC
Q psy1572 7 LDTRVCLISTNGSNSIAFLHSTGQFQALVQLATYPVAVCHGQGK 50 (74)
Q Consensus 7 ~~~~v~~~~~~~~n~ieels~~G~yqcLV~LST~P~~VCHGsG~ 50 (74)
|-+-|+++.+.++...-.+|++ .+.++|.+||.|.-....
T Consensus 8 ~a~gV~VVT~~~~~~~~G~Tvs----s~~svS~~PP~v~v~l~~ 47 (154)
T TIGR02296 8 LAAAVNIITTNGIAGKCGITAT----AVCSVTDTPPTVMVCINR 47 (154)
T ss_pred cCCceEEEEecCCCCceeEEEE----eEEEeecCCCEEEEEECC
Confidence 4456778877655555666664 478999999877654443
No 80
>KOG4528|consensus
Probab=27.48 E-value=64 Score=24.03 Aligned_cols=23 Identities=30% Similarity=0.341 Sum_probs=20.0
Q ss_pred eecCCCCHHHHHHHHHHHHHHHH
Q psy1572 45 CHGQGKTQQEAKTVAAHNAIEYL 67 (74)
Q Consensus 45 CHGsG~S~~eA~~~AA~NALqyL 67 (74)
-|+.|+|.+||-+.++....-|.
T Consensus 42 lH~wgstl~EafE~c~~aMfgym 64 (168)
T KOG4528|consen 42 LHAWGSTLEEAFEQCAVAMFGYM 64 (168)
T ss_pred EEecccchHHHHHHHHHHHHhhe
Confidence 49999999999999998877663
No 81
>PF07897 DUF1675: Protein of unknown function (DUF1675); InterPro: IPR012463 The members of this family are sequences derived from hypothetical plant proteins of unknown function. One member of this family (Q9SFV5 from SWISSPROT) is annotated as a putative RNA-binding protein, but no evidence was found to support this.
Probab=27.13 E-value=45 Score=26.08 Aligned_cols=18 Identities=39% Similarity=0.473 Sum_probs=14.7
Q ss_pred ceEeecCCCCHHHHHHHH
Q psy1572 42 VAVCHGQGKTQQEAKTVA 59 (74)
Q Consensus 42 ~~VCHGsG~S~~eA~~~A 59 (74)
|.||||+=-|.+|=-.+|
T Consensus 254 ~c~chg~~~~~~efv~h~ 271 (284)
T PF07897_consen 254 VCVCHGSFLSPAEFVKHA 271 (284)
T ss_pred EEEecCCCCCHHHHHHhc
Confidence 689999999998865554
No 82
>cd00455 nuc_hydro nuc_hydro: Nucleoside hydrolases. Nucleoside hydrolases cleave the N-glycosidic bond in nucleosides generating ribose and the respective base. These enzymes vary in their substrate specificity. This group contains eukaryotic, bacterial and archeal proteins similar to the inosine-uridine preferring nucleoside hydrolase from Crithidia fasciculata, the xanthosine-inosine-uridine-adenosine-preferring nucleoside hydrolase RihC from Salmonella enterica serovar Typhimurium, the purine-specific inosine-adenosine-guanosine-preferring nucleoside hydrolase from Trypanosoma vivax and, pyrimidine-specific uridine-cytidine preferring nucleoside hydrolases such as URH1 from Saccharomyces cerevisiae, RihA and RihB from Escherichia coli. Nucleoside hydrolases are of interest as a target for antiprotozoan drugs as, no nucleoside hydrolase activity or genes encoding these enzymes have been detected in humans and, parasitic protozoans lack de novo purine synthesis relying on nucleosid
Probab=26.88 E-value=66 Score=23.77 Aligned_cols=27 Identities=22% Similarity=0.238 Sum_probs=19.4
Q ss_pred CceEeecCCCCHHHHHHHHHHHHHHHHHHhhc
Q psy1572 41 PVAVCHGQGKTQQEAKTVAAHNAIEYLRLMTK 72 (74)
Q Consensus 41 P~~VCHGsG~S~~eA~~~AA~NALqyLkiM~~ 72 (74)
=++++||... -+++++||+..|..|.+
T Consensus 28 gIt~~~Gn~~-----~~~~~~n~~~~l~~~g~ 54 (295)
T cd00455 28 GIVATYGNVT-----LEQATQNAAYLLELLGR 54 (295)
T ss_pred EEEeccCCcc-----HHHHHHHHHHHHHHhCC
Confidence 3556776553 26789999999998864
No 83
>cd07706 IgV_TCR_delta Immunoglobulin (Ig) variable (V) domain of T-cell receptor (TCR) delta chain. IgV_TCR_delta: immunoglobulin (Ig) variable (V) domain of the delta chain of gamma/delta T-cell receptors (TCRs). TCRs mediate antigen recognition by T lymphocytes, and are heterodimers consisting of alpha and beta chains or gamma and delta chains. Each chain contains a variable (V) and a constant (C) region. The majority of T cells contain alpha/beta TCRs but a small subset contain gamma/delta TCRs. Alpha/beta TCRs recognize antigen as peptide fragments presented by major histocompatibility complex (MHC) molecules. Gamma/delta TCRs recognize intact protein antigens; they recognize protein antigens directly and without antigen processing, and MHC independently of the bound peptide. Gamma/delta T cells can also be stimulated by non-peptide antigens such as small phosphate- or amine-containing compounds. The variable domain of gamma/delta TCRs is responsible for antigen recognition and is
Probab=26.81 E-value=58 Score=20.01 Aligned_cols=28 Identities=11% Similarity=0.214 Sum_probs=18.4
Q ss_pred eeccccceeEEEEeccC----CceEeecCCCC
Q psy1572 24 FLHSTGQFQALVQLATY----PVAVCHGQGKT 51 (74)
Q Consensus 24 els~~G~yqcLV~LST~----P~~VCHGsG~S 51 (74)
+.+=+|.|.|-+...-. +-.++.|.|..
T Consensus 80 ~~~Dsg~Y~C~~~~~~~~~~~~~~~~fg~gt~ 111 (116)
T cd07706 80 QLEDSAKYFCALSTLGMGSYDTDKLIFGKGTR 111 (116)
T ss_pred CHHHCEEEEEEEECCCCCcccccCCEECCCeE
Confidence 34567999999984211 22467888864
No 84
>PHA02447 hypothetical protein
Probab=26.81 E-value=36 Score=22.67 Aligned_cols=45 Identities=29% Similarity=0.271 Sum_probs=40.3
Q ss_pred eeeeccccceeEEEEeccCCceEeecCCCCHHHHHHHHHHHHHHH
Q psy1572 22 IAFLHSTGQFQALVQLATYPVAVCHGQGKTQQEAKTVAAHNAIEY 66 (74)
Q Consensus 22 ieels~~G~yqcLV~LST~P~~VCHGsG~S~~eA~~~AA~NALqy 66 (74)
|.--.+.|+|.--|+=.|+.-..--|--.|.+.|...||-.|-+|
T Consensus 35 inpsdvagdfrwsvedntcarvlawgyeetldaamaaaaaaaaey 79 (86)
T PHA02447 35 INPSDVAGDFRWSVEDNTCARVLAWGYEETLDAAMAAAAAAAAEY 79 (86)
T ss_pred cChhhcccceeeeeccchHHHHHHhhhHHHHHHHHHHHHHHHHHH
Confidence 444568899999999999999999999999999999999999998
No 85
>COG2820 Udp Uridine phosphorylase [Nucleotide transport and metabolism]
Probab=26.57 E-value=37 Score=26.46 Aligned_cols=25 Identities=36% Similarity=0.658 Sum_probs=19.9
Q ss_pred ceeeeccccceeEEEEeccCCceEe-ecCCCC
Q psy1572 21 SIAFLHSTGQFQALVQLATYPVAVC-HGQGKT 51 (74)
Q Consensus 21 ~ieels~~G~yqcLV~LST~P~~VC-HGsG~S 51 (74)
+.|+.+..|.|- ..|+.|| ||.|.-
T Consensus 45 ~Ref~~~~g~~~------g~~v~v~StGIGgP 70 (248)
T COG2820 45 NREFRTYTGTYN------GKPVTVCSTGIGGP 70 (248)
T ss_pred ccceEEEEEEEc------CeEEEEEecCCCCc
Confidence 567888888765 8999999 898854
No 86
>cd08342 HPPD_N_like N-terminal domain of 4-hydroxyphenylpyruvate dioxygenase (HPPD) and hydroxymandelate Synthase (HmaS). HppD and HmaS are non-heme iron-dependent dioxygenases, which modify a common substrate, 4-hydroxyphenylpyruvate (HPP), but yield different products. HPPD catalyzes the second reaction in tyrosine catabolism, the conversion of HPP to homogentisate (2,5-dihydroxyphenylacetic acid, HG). HmaS converts HPP to 4-hydroxymandelate, a committed step in the formation of hydroxyphenylglycerine, a structural component of nonproteinogenic macrocyclic peptide antibiotics, such as vancomycin. If the emphasis is on catalytic chemistry, HPPD and HmaS are classified as members of a large family of alpha-keto acid dependent mononuclear non-heme iron oxygenases most of which require Fe(II), molecular oxygen, and an alpha-keto acid (typically alpha-ketoglutarate) to either oxygenate or oxidize a third substrate. Both enzymes are exceptions in that they require two, instead of three, su
Probab=26.46 E-value=72 Score=19.94 Aligned_cols=23 Identities=13% Similarity=0.312 Sum_probs=18.9
Q ss_pred eeeEEeeeCCCCCceeeeccccc
Q psy1572 8 DTRVCLISTNGSNSIAFLHSTGQ 30 (74)
Q Consensus 8 ~~~v~~~~~~~~n~ieels~~G~ 30 (74)
..+++|+..|.||.|+....+|.
T Consensus 104 ~~~~~~i~dp~G~~ie~~~~~~~ 126 (136)
T cd08342 104 ELKIAAIKGYGDSLHTLVDRKGY 126 (136)
T ss_pred eEEEEEEeccCCcEEEEEecCCC
Confidence 56899999999999998765553
No 87
>cd04978 Ig4_L1-NrCAM_like Fourth immunoglobulin (Ig)-like domain of L1, Ng-CAM (Neuron-glia CAM cell adhesion molecule), and NrCAM (Ng-CAM-related). Ig4_L1-NrCAM_like: fourth immunoglobulin (Ig)-like domain of L1, Ng-CAM (Neuron-glia CAM cell adhesion molecule), and NrCAM (Ng-CAM-related). These proteins belong to the L1 subfamily of cell adhesion molecules (CAMs) and are comprised of an extracellular region having six Ig-like domains and five fibronectin type III domains, a transmembrane region and an intracellular domain. These molecules are primarily expressed in the nervous system. L1 is associated with an X-linked recessive disorder, X-linked hydrocephalus, MASA syndrome, or spastic paraplegia type 1, that involves abnormalities of axonal growth.
Probab=26.26 E-value=44 Score=18.50 Aligned_cols=11 Identities=18% Similarity=0.268 Sum_probs=9.0
Q ss_pred ccccceeEEEE
Q psy1572 26 HSTGQFQALVQ 36 (74)
Q Consensus 26 s~~G~yqcLV~ 36 (74)
.-.|.|+|.++
T Consensus 51 ~D~G~Y~C~A~ 61 (76)
T cd04978 51 NDTAVYQCNAS 61 (76)
T ss_pred hhCEEEEEEEE
Confidence 46799999885
No 88
>cd05737 Ig_Myomesin_like_C C-temrinal immunoglobulin (Ig)-like domain of myomesin and M-protein. Ig_Myomesin_like_C: domain similar to the C-temrinal immunoglobulin (Ig)-like domain of myomesin and M-protein. Myomesin and M-protein are both structural proteins localized to the M-band, a transverse structure in the center of the sarcomere, and are candidates for M-band bridges. Both proteins are modular, consisting mainly of repetitive Ig-like and fibronectin type III (FnIII) domains. Myomesin is expressed in all types of vertebrate striated muscle; M-protein has a muscle-type specific expression pattern. Myomesin is present in both slow and fast fibers; M-protein is present only in fast fibers. It has been suggested that myomesin acts as a molecular spring with alternative splicing as a means of modifying its elasticity.
Probab=25.56 E-value=44 Score=19.78 Aligned_cols=11 Identities=18% Similarity=0.622 Sum_probs=9.1
Q ss_pred ccccceeEEEE
Q psy1572 26 HSTGQFQALVQ 36 (74)
Q Consensus 26 s~~G~yqcLV~ 36 (74)
.-+|.|+|.++
T Consensus 68 ~D~G~Y~C~a~ 78 (92)
T cd05737 68 EDSGKYGIVVK 78 (92)
T ss_pred hhCEEEEEEEE
Confidence 45799999886
No 89
>cd05727 Ig2_Contactin-2-like Second Ig domain of the neural cell adhesion molecule contactin-2 and similar proteins. Ig2_Contactin-2-like: second Ig domain of the neural cell adhesion molecule contactin-2. Contactins are comprised of six Ig domains followed by four fibronectin type III (FnIII) domains anchored to the membrane by glycosylphosphatidylinositol. Contactin-2 (aliases TAG-1, axonin-1) facilitates cell adhesion by homophilic binding between molecules in apposed membranes. The first four Ig domains form the intermolecular binding fragment which arranges as a compact U-shaped module by contacts between Ig domains 1 and 4, and domains 2 and 3. It has been proposed that a linear zipper-like array forms, from contactin-2 molecules alternatively provided by the two apposed membranes.
Probab=25.55 E-value=78 Score=20.41 Aligned_cols=26 Identities=15% Similarity=0.278 Sum_probs=15.9
Q ss_pred eeeCCCCCce---eeeccccceeEEEEec
Q psy1572 13 LISTNGSNSI---AFLHSTGQFQALVQLA 38 (74)
Q Consensus 13 ~~~~~~~n~i---eels~~G~yqcLV~LS 38 (74)
+++.+.||-+ -+.+=.|.|+|.++-.
T Consensus 54 ~~~~~~G~L~fs~v~~~D~g~Y~C~A~n~ 82 (96)
T cd05727 54 FVSQTNGNLYIAKVEASDRGNYSCFVSSP 82 (96)
T ss_pred EEeCCCCcEEEeecCHhhCceeEEEEEec
Confidence 3444555532 3344569999999754
No 90
>PF00388 PI-PLC-X: Phosphatidylinositol-specific phospholipase C, X domain This entry is for the whole phospholipase C protein; InterPro: IPR000909 Phosphatidylinositol-specific phospholipase C (3.1.4.11 from EC), a eukaryotic intracellular enzyme, plays an important role in signal transduction processes []. It catalyzes the hydrolysis of 1-phosphatidyl-D-myo-inositol-3,4,5-triphosphate into the second messenger molecules diacylglycerol and inositol-1,4,5-triphosphate. This catalytic process is tightly regulated by reversible phosphorylation and binding of regulatory proteins [, , ]. In mammals, there are at least 6 different isoforms of PI-PLC, they differ in their domain structure, their regulation, and their tissue distribution. Lower eukaryotes also possess multiple isoforms of PI-PLC. All eukaryotic PI-PLCs contain two regions of homology, sometimes referred to as the 'X-box' and 'Y-box'. The order of these two regions is always the same (NH2-X-Y-COOH), but the spacing is variable. In most isoforms, the distance between these two regions is only 50-100 residues but in the gamma isoforms one PH domain, two SH2 domains, and one SH3 domain are inserted between the two PLC-specific domains. The two conserved regions have been shown to be important for the catalytic activity. By profile analysis, we could show that sequences with significant similarity to the X-box domain occur also in prokaryotic and trypanosome PI-specific phospholipases C. Apart from this region, the prokaryotic enzymes show no similarity to their eukaryotic counterparts.; GO: 0004629 phospholipase C activity, 0006629 lipid metabolic process, 0035556 intracellular signal transduction; PDB: 2FJU_B 2ZKM_X 3V18_A 3V1H_A 3V16_A 3QR1_D 3EA3_A 3EA1_A 2OR2_A 1T6M_B ....
Probab=25.50 E-value=78 Score=20.62 Aligned_cols=7 Identities=43% Similarity=0.738 Sum_probs=5.1
Q ss_pred ceEeecC
Q psy1572 42 VAVCHGQ 48 (74)
Q Consensus 42 ~~VCHGs 48 (74)
+.||||.
T Consensus 55 ~~v~Hg~ 61 (146)
T PF00388_consen 55 LVVYHGI 61 (146)
T ss_dssp EEEEETT
T ss_pred eEEEeCC
Confidence 6777776
No 91
>cd05748 Ig_Titin_like Immunoglobulin (Ig)-like domain of titin and similar proteins. Ig_Titin_like: immunoglobulin (Ig)-like domain found in titin-like proteins. Titin (also called connectin) is a fibrous sarcomeric protein specifically found in vertebrate striated muscle. Titin is gigantic, depending on isoform composition it ranges from 2970 to 3700 kDa, and is of a length that spans half a sarcomere. Titin largely consists of multiple repeats of Ig-like and fibronectin type 3 (FN-III)-like domains. Titin connects the ends of myosin thick filaments to Z disks and extends along the thick filament to the H zone. It appears to function similarly to an elastic band, keeping the myosin filaments centered in the sarcomere during muscle contraction or stretching. Within the sarcomere, titin is also attached to or is associated with myosin binding protein C (MyBP-C). MyBP-C appears to contribute to the generation of passive tension by titin, and similar to titin has repeated Ig-like and FN-
Probab=25.45 E-value=46 Score=18.59 Aligned_cols=11 Identities=9% Similarity=0.480 Sum_probs=9.0
Q ss_pred ccccceeEEEE
Q psy1572 26 HSTGQFQALVQ 36 (74)
Q Consensus 26 s~~G~yqcLV~ 36 (74)
.-+|.|.|.++
T Consensus 50 ~D~G~Y~C~a~ 60 (74)
T cd05748 50 SDSGKYTLTLK 60 (74)
T ss_pred CcCEEEEEEEE
Confidence 46799999886
No 92
>cd04974 Ig3_FGFR Third immunoglobulin (Ig)-like domain of fibroblast growth factor receptor (FGFR). Ig3_FGFR: third immunoglobulin (Ig)-like domain of fibroblast growth factor receptor (FGFR). Fibroblast growth factors (FGFs) participate in morphogenesis, development, angiogenesis, and wound healing. These FGF-stimulated processes are mediated by four FGFR tyrosine kinases (FGRF1-4). FGFRs are comprised of an extracellular portion consisting of three Ig-like domains, a transmembrane helix, and a cytoplasmic portion having protein tyrosine kinase activity. The highly conserved Ig-like domains 2 and 3, and the linker region between D2 and D3 define a general binding site for FGFs.
Probab=25.41 E-value=45 Score=19.80 Aligned_cols=11 Identities=18% Similarity=0.561 Sum_probs=9.0
Q ss_pred ccccceeEEEE
Q psy1572 26 HSTGQFQALVQ 36 (74)
Q Consensus 26 s~~G~yqcLV~ 36 (74)
.=.|.|+|.++
T Consensus 65 ~D~G~Y~C~A~ 75 (90)
T cd04974 65 DDAGEYTCLAG 75 (90)
T ss_pred ccCcEEEEEee
Confidence 35799999986
No 93
>cd05736 Ig2_Follistatin_like Second immunoglobulin (Ig)-like domain of a follistatin-like molecule encoded by the Mahya gene and similar proteins. Ig2_Follistatin_like: domain similar to the second immunoglobulin (Ig)-like domain found in a follistatin-like molecule encoded by the CNS-related Mahya gene. Mahya genes have been retained in certain Bilaterian branches during evolution. They are conserved in Hymenoptera and Deuterostomes, but are absent from other metazoan species such as fruit fly and nematode. Mahya proteins are secretory, with a follistatin-like domain (Kazal-type serine/threonine protease inhibitor domain and EF-hand calcium-binding domain), two Ig-like domains, and a novel C-terminal domain. Mahya may be involved in learning and memory and in processing of sensory information in Hymenoptera and vertebrates. Follistatin is a secreted, multidomain protein that binds activins with high affinity and antagonizes their signaling.
Probab=25.37 E-value=46 Score=18.72 Aligned_cols=11 Identities=18% Similarity=0.600 Sum_probs=9.1
Q ss_pred ccccceeEEEE
Q psy1572 26 HSTGQFQALVQ 36 (74)
Q Consensus 26 s~~G~yqcLV~ 36 (74)
.-+|.|.|.++
T Consensus 49 ~D~G~Y~C~a~ 59 (76)
T cd05736 49 EDTGAYTCIAK 59 (76)
T ss_pred ccCEEEEEEEE
Confidence 46799999886
No 94
>TIGR00702 uncharacterized domain. This uncharacterized domain comprises the whole of a protein in Methanococcus jannaschii and Methanobacterium thermoautotrophicum, all but the N-terminal 60 residues from a protein of Mycobacterium tuberculosis, and all but the C-terminal 180 residues from a protein in Haemophilus influenzae and Escherichia coli, among proteins from published complete genomes.
Probab=25.07 E-value=94 Score=24.09 Aligned_cols=26 Identities=23% Similarity=0.415 Sum_probs=23.9
Q ss_pred CceEeecCCCCHHHHHHHHHHHHHHH
Q psy1572 41 PVAVCHGQGKTQQEAKTVAAHNAIEY 66 (74)
Q Consensus 41 P~~VCHGsG~S~~eA~~~AA~NALqy 66 (74)
|..|..|-|.|.++|+-.|.--|+|-
T Consensus 47 ~~~~~~GKG~t~~~A~~SAlmEaiER 72 (377)
T TIGR00702 47 ALSISNGKGASKKAALASALMEAFER 72 (377)
T ss_pred CceeccCCCCCHHHHHHHHHHHHHHH
Confidence 78899999999999999999999883
No 95
>TIGR03266 methan_mark_1 putative methanogenesis marker protein 1. Members of this protein family represent a distinct clade among the larger set of proteins that belong to families TIGR00702 and pfam02624. Proteins from this clade are found in genome sequence if and only if the species sequenced is one of the methanogens. All methanogens belong to the archaea; some but not all of those sequenced are hyperthermophiles. This protein family was detected by the method of partial phylogenetic profiling (see Haft, et al., 2006).
Probab=24.80 E-value=1e+02 Score=23.89 Aligned_cols=24 Identities=29% Similarity=0.509 Sum_probs=22.7
Q ss_pred ceEeecCCCCHHHHHHHHHHHHHH
Q psy1572 42 VAVCHGQGKTQQEAKTVAAHNAIE 65 (74)
Q Consensus 42 ~~VCHGsG~S~~eA~~~AA~NALq 65 (74)
++|+-|-|.|.++|+-.|--.|+|
T Consensus 52 ~~v~~GKG~t~~~A~~sAlmEaiE 75 (376)
T TIGR03266 52 ISIYAGKGATEIQARVSAIMEAFE 75 (376)
T ss_pred eeEecCCCCCHHHHHHHHHHHHHH
Confidence 779999999999999999999998
No 96
>PRK15486 hpaC 4-hydroxyphenylacetate 3-monooxygenase reductase subunit; Provisional
Probab=24.59 E-value=1e+02 Score=21.63 Aligned_cols=39 Identities=21% Similarity=0.264 Sum_probs=25.9
Q ss_pred eeeeEEeeeCCCCCceeeeccccceeEEEEeccCCceEeecCC
Q psy1572 7 LDTRVCLISTNGSNSIAFLHSTGQFQALVQLATYPVAVCHGQG 49 (74)
Q Consensus 7 ~~~~v~~~~~~~~n~ieels~~G~yqcLV~LST~P~~VCHGsG 49 (74)
|-+-|+++.+..+...-.+|+| .+.++|.+||.|.-...
T Consensus 17 ~a~GV~VVTt~~~~~~~G~Tvs----s~~SvSldPPlvlv~l~ 55 (170)
T PRK15486 17 LSAAVNIVTTAGDAGRCGITAT----AVCSVTDTPPSVMVCIN 55 (170)
T ss_pred cCCceEEEEEecCCCcEEEEEE----EEEEeEcCCCEEEEEEC
Confidence 4566888876544333456655 47899999998765544
No 97
>KOG3075|consensus
Probab=24.50 E-value=1e+02 Score=24.28 Aligned_cols=30 Identities=37% Similarity=0.348 Sum_probs=25.9
Q ss_pred EEEEeccCCceEeecCCCCHHHHHHHHHHHHHH
Q psy1572 33 ALVQLATYPVAVCHGQGKTQQEAKTVAAHNAIE 65 (74)
Q Consensus 33 cLV~LST~P~~VCHGsG~S~~eA~~~AA~NALq 65 (74)
-.+. ||.|.++ -+..+.+++..-||++|-+
T Consensus 8 ~~~~-s~~~~~~--~~~~~~e~~kr~Aa~~avd 37 (261)
T KOG3075|consen 8 GTGS-STSPRAV--ESLSPQEEAKRLAAYKAVD 37 (261)
T ss_pred eeec-cCCCCcc--cccCchHHHHHHHHhhhhh
Confidence 3455 6999999 8899999999999999975
No 98
>PRK06806 fructose-bisphosphate aldolase; Provisional
Probab=23.83 E-value=60 Score=24.54 Aligned_cols=17 Identities=29% Similarity=0.450 Sum_probs=13.8
Q ss_pred ceEeec-CCCCHHHHHHH
Q psy1572 42 VAVCHG-QGKTQQEAKTV 58 (74)
Q Consensus 42 ~~VCHG-sG~S~~eA~~~ 58 (74)
|.|.|| ||.+.++.+.-
T Consensus 202 PlV~hG~SGI~~e~~~~~ 219 (281)
T PRK06806 202 PLVLHGGSGISPEDFKKC 219 (281)
T ss_pred CEEEECCCCCCHHHHHHH
Confidence 579998 89999987653
No 99
>cd05712 Ig_Siglec_N Immunoglobulin (Ig) domain at the N terminus of Siglec (sialic acid-binding Ig-like lectins). Ig_Siglec_N: immunoglobulin (Ig) domain at the N terminus of Siglec (sialic acid-binding Ig-like lectins). Siglec refers to a structurally related protein family that specifically recognizes sialic acid in oligosaccharide chains of glycoproteins and glycolipids. Siglecs are type I transmembrane proteins, organized as an extracellular module composed of Ig-like domains (an N-terminal variable set of Ig-like carbohydrate recognition domains, and 1 to 16 constant Ig-like domains), followed by transmembrane and short cytoplasmic domains. Human siglecs are classified into two subgroups, one subgroup is comprised of sialoadhesin (Siglec-1), CD22 (Siglec-2), and MAG, the other subgroup is comprised of CD33-related Siglecs which include CD33 (Siglec-3) and human Siglecs 5-11.
Probab=23.64 E-value=98 Score=19.39 Aligned_cols=26 Identities=12% Similarity=0.074 Sum_probs=18.2
Q ss_pred eeccccceeEEEEeccCCceEeecCCC
Q psy1572 24 FLHSTGQFQALVQLATYPVAVCHGQGK 50 (74)
Q Consensus 24 els~~G~yqcLV~LST~P~~VCHGsG~ 50 (74)
+.+=+|.|.|.++.+. +...=.|.|.
T Consensus 88 ~~~Dsg~Y~C~~~~~~-~~~~~f~~gt 113 (119)
T cd05712 88 QPEDSGKYFFRVELGD-SNKYNFLKNT 113 (119)
T ss_pred ChhhCeEEEEEEEcCC-ccEEEecCCe
Confidence 3455799999999973 4455566664
No 100
>cd04972 Ig_TrkABC_d4 Fourth domain (immunoglobulin-like) of Trk receptors TrkA, TrkB and TrkC. TrkABC_d4: the fourth domain of Trk receptors TrkA, TrkB and TrkC, this is an immunoglobulin (Ig)-like domain which binds to neurotrophin. The Trk family of receptors are tyrosine kinase receptors. They are activated by dimerization, leading to autophosphorylation of intracellular tyrosine residues, and triggering the signal transduction pathway. TrkA, TrkB, and TrkC share significant sequence homology and domain organization. The first three domains are leucine-rich domains. The fourth and fifth domains are Ig-like domains playing a part in ligand binding. TrkA, Band C mediate the trophic effects of the neurotrophin Nerve growth factor (NGF) family. TrkA is recognized by NGF. TrKB is recognized by brain-derived neurotrophic factor (BDNF) and neurotrophin (NT)-4. TrkC is recognized by NT-3. NT-3 is promiscuous as in some cell systems it activates TrkA and TrkB receptors. TrkA is a receptor fo
Probab=23.61 E-value=51 Score=19.34 Aligned_cols=9 Identities=0% Similarity=0.003 Sum_probs=7.9
Q ss_pred ccceeEEEE
Q psy1572 28 TGQFQALVQ 36 (74)
Q Consensus 28 ~G~yqcLV~ 36 (74)
.|.|+|.++
T Consensus 68 ~g~Y~C~A~ 76 (90)
T cd04972 68 QTTVTCTAE 76 (90)
T ss_pred CEEEEEEEE
Confidence 699999886
No 101
>cd05752 Ig1_FcgammaR_like Frst immunoglobulin (Ig)-like domain of Fcgamma-receptors (FcgammaRs) and similar proteins. Ig1_FcgammaR_like: domain similar to the first immunoglobulin (Ig)-like domain of Fcgamma-receptors (FcgammaRs). Interactions between IgG and FcgammaR are important to the initiation of cellular and humoral response. IgG binding to FcgammaR leads to a cascade of signals and ultimately to functions such as antibody-dependent-cellular-cytotoxicity (ADCC), endocytosis, phagocytosis, release of inflammatory mediators, etc. FcgammaR has two Ig-like domains. This group also contains FcepsilonRI, which binds IgE with high affinity.
Probab=23.37 E-value=31 Score=20.60 Aligned_cols=12 Identities=17% Similarity=0.465 Sum_probs=9.7
Q ss_pred eccccceeEEEE
Q psy1572 25 LHSTGQFQALVQ 36 (74)
Q Consensus 25 ls~~G~yqcLV~ 36 (74)
..-+|.|+|-+.
T Consensus 55 ~~dsG~Y~C~a~ 66 (78)
T cd05752 55 NNDSGEYRCQTQ 66 (78)
T ss_pred ccCCEEeEEECC
Confidence 457899999874
No 102
>cd04969 Ig5_Contactin_like Fifth Ig domain of contactin. Ig5_Contactin_like: Fifth Ig domain of contactins. Contactins are neural cell adhesion molecules and are comprised of six Ig domains followed by four fibronectin type III(FnIII) domains anchored to the membrane by glycosylphosphatidylinositol. The first four Ig domains form the intermolecular binding fragment, which arranges as a compact U-shaped module via contacts between Ig domains 1 and 4, and between Ig domains 2 and 3. Contactin-2 (TAG-1, axonin-1) may play a part in the neuronal processes of neurite outgrowth, axon guidance and fasciculation, and neuronal migration. This group also includes contactin-1 and contactin-5. The different contactins show different expression patterns in the central nervous system. During development and in adulthood, contactin-2 is transiently expressed in subsets of central and peripheral neurons. Contactin-5 is expressed specifically in the rat postnatal nervous system, peaking at about 3 week
Probab=23.35 E-value=53 Score=18.44 Aligned_cols=11 Identities=9% Similarity=0.537 Sum_probs=8.8
Q ss_pred ccccceeEEEE
Q psy1572 26 HSTGQFQALVQ 36 (74)
Q Consensus 26 s~~G~yqcLV~ 36 (74)
.-.|.|+|.++
T Consensus 49 ~D~G~Y~C~a~ 59 (73)
T cd04969 49 SDEGKYTCFAE 59 (73)
T ss_pred hHCEEEEEEEE
Confidence 35699999985
No 103
>cd05746 Ig4_Peroxidasin Fourth immunoglobulin (Ig)-like domain of peroxidasin. Ig4_Peroxidasin: the fourth immunoglobulin (Ig)-like domain in peroxidasin. Peroxidasin has a peroxidase domain and interacting extracellular motifs containing four Ig-like domains. It has been suggested that peroxidasin is secreted, and has functions related to the stabilization of the extracellular matrix. It may play a part in various other important processes such as removal and destruction of cells, which have undergone programmed cell death, and protection of the organism against non-self.
Probab=23.32 E-value=53 Score=18.39 Aligned_cols=10 Identities=10% Similarity=0.644 Sum_probs=8.2
Q ss_pred cccceeEEEE
Q psy1572 27 STGQFQALVQ 36 (74)
Q Consensus 27 ~~G~yqcLV~ 36 (74)
-.|.|.|.++
T Consensus 47 D~G~Y~C~a~ 56 (69)
T cd05746 47 DQGRYECVAR 56 (69)
T ss_pred hCEEEEEEEE
Confidence 4699999985
No 104
>cd05747 Ig5_Titin_like M5, fifth immunoglobulin (Ig)-like domain of human titin C terminus and similar proteins. Ig5_Titin_like: domain similar to the M5, fifth immunoglobulin (Ig)-like domain from the human titin C terminus. Titin (also called connectin) is a fibrous sarcomeric protein specifically found in vertebrate striated muscle. Titin is gigantic; depending on isoform composition it ranges from 2970 to 3700 kDa, and is of a length that spans half a sarcomere. Titin largely consists of multiple repeats of Ig-like and fibronectin type 3 (FN-III)-like domains. Titin connects the ends of myosin thick filaments to Z disks and extends along the thick filament to the H zone, and appears to function similar to an elastic band, keeping the myosin filaments centered in the sarcomere during muscle contraction or stretching.
Probab=23.15 E-value=53 Score=19.36 Aligned_cols=11 Identities=18% Similarity=0.595 Sum_probs=8.9
Q ss_pred ccccceeEEEE
Q psy1572 26 HSTGQFQALVQ 36 (74)
Q Consensus 26 s~~G~yqcLV~ 36 (74)
+-.|.|+|.++
T Consensus 69 ~D~G~Y~C~a~ 79 (92)
T cd05747 69 SDEGNYTVVVE 79 (92)
T ss_pred ccCEeEEEEEE
Confidence 56799999865
No 105
>cd04973 Ig1_FGFR First immunoglobulin (Ig)-like domain of fibroblast growth factor receptor (FGFR). Ig1_FGFR: The first immunoglobulin (Ig)-like domain of fibroblast growth factor receptor (FGFR). Fibroblast growth factors (FGFs) participate in morphogenesis, development, angiogenesis, and wound healing. These FGF-stimulated processes are mediated by four FGFR tyrosine kinases (FGRF1-4). FGFRs are comprised of an extracellular portion consisting of three Ig-like domains, a transmembrane helix, and a cytoplasmic portion having protein tyrosine kinase activity. The highly conserved Ig-like domains 2 and 3, and the linker region between D2 and D3 define a general binding site for all FGFs.
Probab=23.13 E-value=51 Score=19.09 Aligned_cols=12 Identities=8% Similarity=0.354 Sum_probs=9.5
Q ss_pred eccccceeEEEE
Q psy1572 25 LHSTGQFQALVQ 36 (74)
Q Consensus 25 ls~~G~yqcLV~ 36 (74)
..-+|.|+|.+.
T Consensus 54 ~~DsG~Y~C~a~ 65 (79)
T cd04973 54 PRDSGLYACVTS 65 (79)
T ss_pred hhhCEEEEEEEe
Confidence 346899999985
No 106
>cd05856 Ig2_FGFRL1-like Second immunoglobulin (Ig)-like domain of fibroblast growth factor (FGF) receptor_like-1(FGFRL1). Ig2_FGFRL1-like: second immunoglobulin (Ig)-like domain of fibroblast growth factor (FGF) receptor_like-1(FGFRL1). FGFRL1 is comprised of a signal peptide, three extracellular Ig-like modules, a transmembrane segment, and a short intracellular domain. FGFRL1 is expressed preferentially in skeletal tissues. Similar to FGF receptors, the expressed protein interacts specifically with heparin and with FGF2. FGFRL1 does not have a protein tyrosine kinase domain at its C terminus; neither does its cytoplasmic domain appear to interact with a signaling partner. It has been suggested that FGFRL1 may not have any direct signaling function, but instead acts as a decoy receptor trapping FGFs and preventing them from binding other receptors.
Probab=23.00 E-value=55 Score=18.05 Aligned_cols=11 Identities=18% Similarity=0.540 Sum_probs=8.9
Q ss_pred ccccceeEEEE
Q psy1572 26 HSTGQFQALVQ 36 (74)
Q Consensus 26 s~~G~yqcLV~ 36 (74)
.-+|.|.|.++
T Consensus 58 ~D~G~Y~C~a~ 68 (82)
T cd05856 58 EDSGKYTCHVS 68 (82)
T ss_pred hhCEEEEEEEE
Confidence 35799999886
No 107
>PF13827 DUF4189: Domain of unknown function (DUF4189)
Probab=22.95 E-value=83 Score=19.46 Aligned_cols=18 Identities=39% Similarity=0.446 Sum_probs=13.9
Q ss_pred eecCCCCHHHHHHHHHHH
Q psy1572 45 CHGQGKTQQEAKTVAAHN 62 (74)
Q Consensus 45 CHGsG~S~~eA~~~AA~N 62 (74)
=-|.|+|.++|+++|-.+
T Consensus 66 ~~~~~~t~~~A~~~Al~~ 83 (100)
T PF13827_consen 66 GWGTGPTREEAEQDALAN 83 (100)
T ss_pred EEecCCCHHHHHHHHHHH
Confidence 678899999888776543
No 108
>PRK01022 hypothetical protein; Provisional
Probab=22.72 E-value=45 Score=23.89 Aligned_cols=20 Identities=15% Similarity=0.213 Sum_probs=17.4
Q ss_pred eCCCCCceeeeccccceeEEEE
Q psy1572 15 STNGSNSIAFLHSTGQFQALVQ 36 (74)
Q Consensus 15 ~~~~~n~ieels~~G~yqcLV~ 36 (74)
..|||..|++ -+|.|.|-|.
T Consensus 126 ~FPGG~~V~~--~~g~yYCPVk 145 (167)
T PRK01022 126 PFPGGFKVEE--KNGVYYCPVK 145 (167)
T ss_pred CCCCCeEEEe--ECCEEeCccc
Confidence 3699999999 8999999874
No 109
>cd05756 Ig1_IL1R_like First immunoglobulin (Ig)-like domain of interleukin-1 receptor (IL1R) and similar proteins. Ig1_IL1R_like: domain similar to the first immunoglobulin (Ig)-like domain of interleukin-1 receptor (IL1R). IL-1 alpha and IL-1 beta are cytokines which participate in the regulation of inflammation, immune responses, and hematopoiesis. These cytokines bind to the IL-1 receptor type 1 (IL1R1), which is activated on additional association with an accessory protein, IL1RAP. IL-1 also binds a second receptor designated type II (IL1R2). Mature IL1R1 consists of three Ig-like domains, a transmembrane domain, and a large cytoplasmic domain. Mature IL1R2 is organized similarly except that it has a short cytoplasmic domain. The latter does not initiate signal transduction. A naturally occurring cytokine IL-1RA (IL-1 receptor antagonist) is widely expressed and binds to IL-1 receptors, inhibiting the binding of IL-1 alpha and IL-1 beta.
Probab=22.69 E-value=48 Score=20.37 Aligned_cols=15 Identities=27% Similarity=0.541 Sum_probs=11.1
Q ss_pred eccccceeEEEEecc
Q psy1572 25 LHSTGQFQALVQLAT 39 (74)
Q Consensus 25 ls~~G~yqcLV~LST 39 (74)
..-+|.|+|.++=++
T Consensus 69 ~~DsG~Y~C~~~N~~ 83 (94)
T cd05756 69 LEDSGLYTCVVRNST 83 (94)
T ss_pred cccCeEEEEEEcCCC
Confidence 345799999987544
No 110
>cd05743 Ig_Perlecan_D2_like Immunoglobulin (Ig)-like domain II (D2) of the human basement membrane heparan sulfate proteoglycan perlecan, also known as HSPG2. Ig_Perlecan_D2_like: the immunoglobulin (Ig)-like domain II (D2) of the human basement membrane heparan sulfate proteoglycan perlecan, also known as HSPG2. Perlecan consists of five domains. Domain I has three putative heparan sulfate attachment sites; domain II has four LDL receptor-like repeats, and one Ig-like repeat; domain III resembles the short arm of laminin chains; domain IV has multiple Ig-like repeats (21 repeats in human perlecan); and domain V resembles the globular G domain of the laminin A chain and internal repeats of EGF. Perlecan may participate in a variety of biological functions including cell binding, LDL-metabolism, basement membrane assembly and selective permeability, calcium binding, and growth- and neurite-promoting activities.
Probab=22.48 E-value=56 Score=18.73 Aligned_cols=10 Identities=10% Similarity=0.265 Sum_probs=8.1
Q ss_pred cccceeEEEE
Q psy1572 27 STGQFQALVQ 36 (74)
Q Consensus 27 ~~G~yqcLV~ 36 (74)
-.|.|.|.++
T Consensus 53 D~G~Y~C~a~ 62 (78)
T cd05743 53 DQGAYTCEAI 62 (78)
T ss_pred HCEEEEEEEE
Confidence 4699999975
No 111
>PF07679 I-set: Immunoglobulin I-set domain; InterPro: IPR013098 The basic structure of immunoglobulin (Ig) molecules is a tetramer of two light chains and two heavy chains linked by disulphide bonds. There are two types of light chains: kappa and lambda, each composed of a constant domain (CL) and a variable domain (VL). There are five types of heavy chains: alpha, delta, epsilon, gamma and mu, all consisting of a variable domain (VH) and three (in alpha, delta and gamma) or four (in epsilon and mu) constant domains (CH1 to CH4). Ig molecules are highly modular proteins, in which the variable and constant domains have clear, conserved sequence patterns. The domains in Ig and Ig-like molecules are grouped into four types: V-set (variable; IPR013106 from INTERPRO), C1-set (constant-1; IPR003597 from INTERPRO), C2-set (constant-2; IPR008424 from INTERPRO) and I-set (intermediate; IPR013098 from INTERPRO) []. Structural studies have shown that these domains share a common core Greek-key beta-sandwich structure, with the types differing in the number of strands in the beta-sheets as well as in their sequence patterns [, ]. Immunoglobulin-like domains that are related in both sequence and structure can be found in several diverse protein families. Ig-like domains are involved in a variety of functions, including cell-cell recognition, cell-surface receptors, muscle structure and the immune system []. This entry represents I-set domains, which are found in several cell adhesion molecules, including vascular (VCAM), intercellular (ICAM), neural (NCAM) and mucosal addressin (MADCAM) cell adhesion molecules, as well as junction adhesion molecules (JAM). I-set domains are also present in several other diverse protein families, including several tyrosine-protein kinase receptors, the hemolymph protein hemolin, the muscle proteins titin, telokin, and twitchin, the neuronal adhesion molecule axonin-1 [], and the signalling molecule semaphorin 4D that is involved in axonal guidance, immune function and angiogenesis [].; PDB: 3MTR_A 2EDK_A 3DMK_B 1KOA_A 3NCM_A 2NCM_A 2V9Q_A 2CR3_A 3QQN_A 3QR2_A ....
Probab=22.27 E-value=58 Score=18.38 Aligned_cols=14 Identities=7% Similarity=0.368 Sum_probs=10.9
Q ss_pred ccccceeEEEEecc
Q psy1572 26 HSTGQFQALVQLAT 39 (74)
Q Consensus 26 s~~G~yqcLV~LST 39 (74)
.-.|.|.|.++-+.
T Consensus 66 ~D~G~Y~C~~~n~~ 79 (90)
T PF07679_consen 66 EDAGTYTCVASNSS 79 (90)
T ss_dssp GGSEEEEEEEEETT
T ss_pred hhCEEEEEEEEECC
Confidence 45799999998653
No 112
>cd05854 Ig6_Contactin-2 Sixth Ig domain of contactin-2. Ig6_Contactin-2: Sixth Ig domain of the neural cell adhesion molecule contactin-2-like. Contactins are comprised of six Ig domains followed by four fibronectin type III (FnIII) domains anchored to the membrane by glycosylphosphatidylinositol. Contactin-2 (TAG-1, axonin-1) facilitates cell adhesion by homophilic binding between molecules in apposed membranes. It may play a part in the neuronal processes of neurite outgrowth, axon guidance and fasciculation, and neuronal migration. The first four Ig domains form the intermolecular binding fragment, which arranges as a compact U-shaped module by contacts between IG domains 1 and 4, and domains 2 and 3. The different contactins show different expression patterns in the central nervous system. During development and in adulthood, contactin-2 is transiently expressed in subsets of central and peripheral neurons. Contactin-2 is also expressed in retinal amacrine cells in the developing c
Probab=22.20 E-value=56 Score=19.46 Aligned_cols=12 Identities=17% Similarity=0.351 Sum_probs=9.3
Q ss_pred ccccceeEEEEe
Q psy1572 26 HSTGQFQALVQL 37 (74)
Q Consensus 26 s~~G~yqcLV~L 37 (74)
+-.|.|.|.++=
T Consensus 57 ~D~G~YtC~A~n 68 (85)
T cd05854 57 SHAGTYTCTAQT 68 (85)
T ss_pred hhCeEEEEEEec
Confidence 357999999764
No 113
>cd05730 Ig3_NCAM-1_like Third immunoglobulin (Ig)-like domain of Neural Cell Adhesion Molecule NCAM-1 (NCAM). Ig3_NCAM-1_like: domain similar to the third immunoglobulin (Ig)-like domain of Neural Cell Adhesion Molecule NCAM-1 (NCAM). NCAM plays important roles in the development and regeneration of the central nervous system, in synaptogenesis and neural migration. NCAM mediates cell-cell and cell-substratum recognition and adhesion via homophilic (NCAM-NCAM), and heterophilic (NCAM-non-NCAM), interactions. NCAM is expressed as three major isoforms having different intracellular extensions. The extracellular portion of NCAM has five N-terminal Ig-like domains and two fibronectin type III domains. The double zipper adhesion complex model for NCAM homophilic binding involves Ig1, Ig2, and Ig3. By this model, Ig1,and Ig2 mediate dimerization of NCAM molecules situated on the same cell surface (cis interactions), and Ig3 domains mediate interactions between NCAM molecules expressed on the
Probab=22.09 E-value=57 Score=19.12 Aligned_cols=11 Identities=0% Similarity=0.332 Sum_probs=8.7
Q ss_pred ccccceeEEEE
Q psy1572 26 HSTGQFQALVQ 36 (74)
Q Consensus 26 s~~G~yqcLV~ 36 (74)
.-.|.|.|.++
T Consensus 68 ~D~G~Y~C~a~ 78 (95)
T cd05730 68 LDEAEYTCIAE 78 (95)
T ss_pred hhCEEEEEEEE
Confidence 45789999875
No 114
>cd02651 nuc_hydro_IU_UC_XIUA nuc_hydro_IU_UC_XIUA: inosine-uridine preferring, xanthosine-inosine-uridine-adenosine-preferring and, uridine-cytidine preferring nucleoside hydrolases. Nucleoside hydrolases cleave the N-glycosidic bond in nucleosides generating ribose and the respective base. These enzymes vary in their substrate specificity. This group contains proteins similar to nucleoside hydrolases which hydrolyze both pyrimidine and purine ribonucleosides: the inosine-uridine preferring nucleoside hydrolase from Crithidia fasciculata, the inosine-uridine-xanthosine preferring nucleoside hydrolase RihC from Escherichia coli and the xanthosine-inosine-uridine-adenosine-preferring nucleoside hydrolase RihC from Salmonella enterica serovar Typhimurium. This group also contains proteins similar to the pyrimidine-specific uridine-cytidine preferring nucleoside hydrolases URH1 from Saccharomyces cerevisiae, E. coli RihA and E. coli RihB. E. coli RihA is equally efficient with uridine a
Probab=21.83 E-value=1.1e+02 Score=22.57 Aligned_cols=26 Identities=19% Similarity=0.342 Sum_probs=18.3
Q ss_pred ceEeecCCCCHHHHHHHHHHHHHHHHHHhhc
Q psy1572 42 VAVCHGQGKTQQEAKTVAAHNAIEYLRLMTK 72 (74)
Q Consensus 42 ~~VCHGsG~S~~eA~~~AA~NALqyLkiM~~ 72 (74)
++.++|.-. .+++++||+..|..+..
T Consensus 30 It~v~Gn~~-----~~~~~~na~~ll~~~g~ 55 (302)
T cd02651 30 ITTVAGNVP-----LEKTTRNALKLLTLLGR 55 (302)
T ss_pred EEeccCeec-----HHHHHHHHHHHHHHhCC
Confidence 445565443 46789999999988864
No 115
>cd05891 Ig_M-protein_C C-terminal immunoglobulin (Ig)-like domain of M-protein (also known as myomesin-2). Ig_M-protein_C: the C-terminal immunoglobulin (Ig)-like domain of M-protein (also known as myomesin-2). M-protein is a structural protein localized to the M-band, a transverse structure in the center of the sarcomere, and is a candidate for M-band bridges. M-protein is modular consisting mainly of repetitive IG-like and fibronectin type III (FnIII) domains, and has a muscle-type specific expression pattern. M-protein is present in fast fibers.
Probab=21.67 E-value=58 Score=19.53 Aligned_cols=11 Identities=18% Similarity=0.513 Sum_probs=9.4
Q ss_pred ccccceeEEEE
Q psy1572 26 HSTGQFQALVQ 36 (74)
Q Consensus 26 s~~G~yqcLV~ 36 (74)
.-.|.|+|.++
T Consensus 68 ~D~G~Y~C~a~ 78 (92)
T cd05891 68 EDSGKYSINVK 78 (92)
T ss_pred hhCEEEEEEEE
Confidence 56899999887
No 116
>TIGR03693 ocin_ThiF_like putative thiazole-containing bacteriocin maturation protein. Members of this protein family are found in a three-gene operon in Bacillus anthracis and related Bacillus species, where the other two genes are clearly identified with maturation of a putative thiazole-containing bacteriocin precursor. While there is no detectable pairwise sequence similarity between members of this family and the proposed cyclodehydratases such as SagC of Streptococcus pyogenes (see family TIGR03603), both families show similarity through PSI-BLAST to ThiF, a protein involved in biosynthesis of the thiazole moiety for thiamine biosynthesis. This family, therefore, may contribute to cyclodehydratase function in heterocycle-containing bacteriocin biosyntheses. In Bacillus licheniformis ATCC 14580, the bacteriocin precursor gene is adjacent to the gene for this protein.
Probab=21.25 E-value=71 Score=27.83 Aligned_cols=45 Identities=31% Similarity=0.365 Sum_probs=31.2
Q ss_pred eeccccceeEEEEec---------cCCceEeecCCCCHHHHHHHHHHHHHHH-HHHh
Q psy1572 24 FLHSTGQFQALVQLA---------TYPVAVCHGQGKTQQEAKTVAAHNAIEY-LRLM 70 (74)
Q Consensus 24 els~~G~yqcLV~LS---------T~P~~VCHGsG~S~~eA~~~AA~NALqy-LkiM 70 (74)
+++.--.-||-|+.. ..|..|| +|-|-+|||.+|+.-++|. ...|
T Consensus 383 ~l~QlPL~~~~~~~~~p~~~gp~~~l~~~~~--~g~th~earreagl~gie~y~~~~ 437 (637)
T TIGR03693 383 DLHQLPLAQCFIQTADPLSDGPAELLPEITC--SGLTHNEARREAGLTGIEMYAAEI 437 (637)
T ss_pred cccccchhhceEEecCCCCCCcccccCCcee--cccchHHHHHhhchhhHHHHHHHH
Confidence 344444457777742 3566788 5789999999999999984 4443
No 117
>PF06675 DUF1177: Protein of unknown function (DUF1177); InterPro: IPR009561 This family consists of several hypothetical archaeal and bacterial proteins of around 300 residues in length. The function of this family is unknown.
Probab=21.18 E-value=1.6e+02 Score=23.43 Aligned_cols=29 Identities=17% Similarity=0.185 Sum_probs=26.1
Q ss_pred ecCCCCHHHHHHHHHHHHHHHHHHhhcCC
Q psy1572 46 HGQGKTQQEAKTVAAHNAIEYLRLMTKSK 74 (74)
Q Consensus 46 HGsG~S~~eA~~~AA~NALqyLkiM~~~k 74 (74)
.++|.|-+.--+.|+|-++|..|.+++++
T Consensus 220 caTGash~~di~~A~RF~iEvAK~fg~g~ 248 (276)
T PF06675_consen 220 CATGASHEVDIEHAVRFCIEVAKEFGRGK 248 (276)
T ss_pred cccccCCHHHHHHHHHHHHHHHHHHcCCC
Confidence 58999999999999999999999988654
No 118
>cd05853 Ig6_Contactin-4 Sixth Ig domain of contactin-4. Ig6_Contactin-4: sixth Ig domain of the neural cell adhesion molecule contactin-4. Contactins are neural cell adhesion molecules, and are comprised of six Ig domains followed by four fibronectin type III (FnIII) domains anchored to the membrane by glycosylphosphatidylinositol. The different contactins show different expression patterns in the central nervous system. Highest expresson of contactin-4 is in testes, thyroid, small intestine, uterus and brain. Contactin-4 plays a role in the response of neuroblastoma cells to differentiating agents, such as retinoids. The contactin 4 gene is associated with cerebellar degeneration in spinocerebellar ataxia type 16.
Probab=21.14 E-value=59 Score=20.18 Aligned_cols=11 Identities=27% Similarity=0.613 Sum_probs=8.9
Q ss_pred cccceeEEEEe
Q psy1572 27 STGQFQALVQL 37 (74)
Q Consensus 27 ~~G~yqcLV~L 37 (74)
-.|.|.|.++=
T Consensus 58 dsG~YtC~a~n 68 (85)
T cd05853 58 HAGKYVCMVQT 68 (85)
T ss_pred HCEEEEEEEEc
Confidence 46999999874
No 119
>cd05900 Ig_Aggrecan Immunoglobulin (Ig)-like domain of the chondroitin sulfate proteoglycan core protein (CSPG), aggrecan. Ig_Aggrecan: immunoglobulin (Ig)-like domain of the chondroitin sulfate proteoglycan core protein (CSPG), aggrecan. These aggregates contribute to the tissue's load bearing properties. In CSPGs, the Ig-like domain is followed by hyaluronan (HA)-binding tandem repeats, and a C-terminal region with epidermal growth factor-like, lectin-like, and complement regulatory protein-like domains. Separating these N- and C-terminal regions is a nonhomologous glycosaminoglycan attachment region. In cartilage, aggrecan forms cartilage link protein stabilized aggregates with HA. These aggregates contribute to the tissue's load bearing properties. Aggrecan has a wide distribution in connective tissue and extracellular matrices. Members of the vertebrate HPLN (hyaluronan/HA and proteoglycan binding link) protein family are physically linked adjacent to CSPG genes.
Probab=21.09 E-value=1.2e+02 Score=19.73 Aligned_cols=27 Identities=22% Similarity=0.385 Sum_probs=17.6
Q ss_pred eEEeee---CCCCCce--eee--ccccceeEEEE
Q psy1572 10 RVCLIS---TNGSNSI--AFL--HSTGQFQALVQ 36 (74)
Q Consensus 10 ~v~~~~---~~~~n~i--eel--s~~G~yqcLV~ 36 (74)
||.|.. .++..++ ..+ +=+|.|+|-|.
T Consensus 61 R~sl~~~~~~~~~asL~I~nl~~sDsG~Y~C~V~ 94 (112)
T cd05900 61 RVSLPNYPAIPSDATLEITELRSNDSGTYRCEVM 94 (112)
T ss_pred eEEecccCCCCCCcEEEEeecccccCEEEEEEEe
Confidence 777765 3455554 333 45799999987
No 120
>cd05878 Ig_Aggrecan_like Immunoglobulin (Ig)-like domain of the aggrecan-like chondroitin sulfate proteoglycan core protein (CSPG). Ig_Aggrecan_like: immunoglobulin (Ig)-like domain of the aggrecan-like chondroitin sulfate proteoglycan core protein (CSPG)s. Included in this group are the Ig domains of other CSPGs: versican, and neurocan. In CSPGs this Ig-like domain is followed by hyaluronan (HA)-binding tandem repeats, and a C-terminal region with epidermal growth factor-like, lectin-like, and complement regulatory protein-like domains. Separating these N- and C-terminal regions is a nonhomologous glycosaminoglycan attachment region. In cartilage, aggrecan forms cartilage link protein stabilized aggregates with hyaluronan (HA). These aggregates contribute to the tissue's load bearing properties. Aggrecan and versican have a wide distribution in connective tissue and extracellular matrices. Neurocan is localized almost exclusively in nervous tissue. Aggregates having other CSPGs substi
Probab=21.00 E-value=1.4e+02 Score=18.77 Aligned_cols=32 Identities=19% Similarity=0.213 Sum_probs=17.5
Q ss_pred eeeeEEeeeCCCCCc-----ee--eeccccceeEEEEec
Q psy1572 7 LDTRVCLISTNGSNS-----IA--FLHSTGQFQALVQLA 38 (74)
Q Consensus 7 ~~~~v~~~~~~~~n~-----ie--els~~G~yqcLV~LS 38 (74)
+.-|+.|...+..+. |. ..+=+|.|.|-|.-.
T Consensus 56 f~gR~~~~~~~~~~~~asL~I~~l~~sD~G~Y~C~v~~~ 94 (110)
T cd05878 56 YQGRVSLPNYPENRGDASLEISRLRSSDSGVYRCEVMHG 94 (110)
T ss_pred ccCeEEeccCCCCCCcEEEEECCCChhhCEEEEEEEEeC
Confidence 334777754433222 22 223468999998753
No 121
>cd05858 Ig3_FGFR-2 Third immunoglobulin (Ig)-like domain of fibroblast growth factor receptor 2 (FGFR2). Ig3_FGFR-2-like; domain similar to the third immunoglobulin (Ig)-like domain of human fibroblast growth factor receptor 2 (FGFR2). Fibroblast growth factors (FGFs) participate in morphogenesis, development, angiogenesis, and wound healing. These FGF-stimulated processes are mediated by four FGFR tyrosine kinases (FGRF1-4). FGFRs are comprised of an extracellular portion consisting of three Ig-like domains, a transmembrane helix, and a cytoplasmic portion having protein tyrosine kinase activity. The highly conserved Ig-like domains 2 and 3, and the linker region between D2 and D3 define a general binding site for FGFs. FGFR2 is required for male sex determination.
Probab=20.74 E-value=62 Score=19.04 Aligned_cols=10 Identities=20% Similarity=0.604 Sum_probs=8.2
Q ss_pred cccceeEEEE
Q psy1572 27 STGQFQALVQ 36 (74)
Q Consensus 27 ~~G~yqcLV~ 36 (74)
-.|.|+|.+.
T Consensus 66 D~G~Y~C~A~ 75 (90)
T cd05858 66 DAGEYTCLAG 75 (90)
T ss_pred HCEEEEEEEE
Confidence 3699999985
No 122
>cd04979 Ig_Semaphorin_C Immunoglobulin (Ig)-like domain of semaphorin. Ig_Semaphorin_C; Immunoglobulin (Ig)-like domain in semaphorins. Semaphorins are transmembrane protein that have important roles in a variety of tissues. Functionally, semaphorins were initially characterized for their importance in the development of the nervous system and in axonal guidance. Later they have been found to be important for the formation and functioning of the cardiovascular, endocrine, gastrointestinal, hepatic, immune, musculoskeletal, renal, reproductive, and respiratory systems. Semaphorins function through binding to their receptors and transmembrane semaphorins also serves as receptors themselves. Although molecular mechanism of semaphorins is poorly understood, the Ig-like domains may involve in ligand binding or dimerization.
Probab=20.69 E-value=62 Score=18.86 Aligned_cols=14 Identities=7% Similarity=0.026 Sum_probs=10.7
Q ss_pred eccccceeEEEEec
Q psy1572 25 LHSTGQFQALVQLA 38 (74)
Q Consensus 25 ls~~G~yqcLV~LS 38 (74)
..-+|.|+|.++-+
T Consensus 60 ~~D~G~Y~C~a~~~ 73 (89)
T cd04979 60 PADAGVYTCQSVEH 73 (89)
T ss_pred HHHCEEEEEEEecC
Confidence 35679999998743
No 123
>PF04101 Glyco_tran_28_C: Glycosyltransferase family 28 C-terminal domain; InterPro: IPR007235 The biosynthesis of disaccharides, oligosaccharides and polysaccharides involves the action of hundreds of different glycosyltransferases. These enzymes catalyse the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. A classification of glycosyltransferases using nucleotide diphospho-sugar, nucleotide monophospho-sugar and sugar phosphates (2.4.1.- from EC) and related proteins into distinct sequence based families has been described []. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. The same three-dimensional fold is expected to occur within each of the families. Because 3-D structures are better conserved than sequences, several of the families defined on the basis of sequence similarities may have similar 3-D structures and therefore form 'clans'. Glycosyltransferase family 28 GT28 from CAZY comprises enzymes with a number of known activities; 1,2-diacylglycerol 3-beta-galactosyltransferase (2.4.1.46 from EC); 1,2-diacylglycerol 3-beta-glucosyltransferase (2.4.1.157 from EC); beta-N-acetylglucosamine transferase (2.4.1 from EC). Structural analysis suggests the C-terminal domain contains the UDP-GlcNAc binding site.; GO: 0016758 transferase activity, transferring hexosyl groups, 0030246 carbohydrate binding, 0005975 carbohydrate metabolic process, 0030259 lipid glycosylation; PDB: 2KS6_A 2JZC_A 1NLM_B 1F0K_B.
Probab=20.63 E-value=43 Score=21.60 Aligned_cols=14 Identities=29% Similarity=0.392 Sum_probs=11.0
Q ss_pred ceEeecCCCCHHHH
Q psy1572 42 VAVCHGQGKTQQEA 55 (74)
Q Consensus 42 ~~VCHGsG~S~~eA 55 (74)
+.||||.+.|..|+
T Consensus 75 lvIs~aG~~Ti~E~ 88 (167)
T PF04101_consen 75 LVISHAGAGTIAEA 88 (167)
T ss_dssp EEEECS-CHHHHHH
T ss_pred EEEeCCCccHHHHH
Confidence 79999999887765
No 124
>PRK05428 HPr kinase/phosphorylase; Provisional
Probab=20.11 E-value=3.8e+02 Score=21.06 Aligned_cols=66 Identities=15% Similarity=0.094 Sum_probs=48.4
Q ss_pred ccceeeeEEeeeCCCCCceeeeccccceeEEEEeccCCceEeecCCCCHHHHHHHHHHHHHHHHHHhh
Q psy1572 4 SGLLDTRVCLISTNGSNSIAFLHSTGQFQALVQLATYPVAVCHGQGKTQQEAKTVAAHNAIEYLRLMT 71 (74)
Q Consensus 4 s~l~~~~v~~~~~~~~n~ieels~~G~yqcLV~LST~P~~VCHGsG~S~~eA~~~AA~NALqyLkiM~ 71 (74)
++-+|.-|.+..++.....|.+-...+|..+.-.+.--+.+.==.|-....--+.||+|=. ||.|+
T Consensus 223 ~~~I~lvV~L~~~~~~~~~dRLg~~~~~~~ilgv~Ip~i~ipv~~Grn~a~iiE~Aa~n~~--lk~~G 288 (308)
T PRK05428 223 KKRIQLVVELEEWDEDKEYDRLGLDEETEEILGVKIPKITIPVRPGRNLAVIIEAAARNYR--LKLMG 288 (308)
T ss_pred CcEEEEEEEeecCCCCCccCCCCCchheeEEcCceeeEEEeCCCCCCcHHHHHHHHHHHHH--HHHcC
Confidence 3567788888888777788888888888877444443333333458999999999999965 88875
No 125
>cd05851 Ig3_Contactin-1 Third Ig domain of contactin-1. Ig3_Contactin-1: Third Ig domain of the neural cell adhesion molecule contactin-1. Contactins are comprised of six Ig domains followed by four fibronectin type III (FnIII) domains anchored to the membrane by glycosylphosphatidylinositol. Contactin-1 is differentially expressed in tumor tissues and may through a RhoA mechanism, facilitate invasion and metastasis of human lung adenocarcinoma.
Probab=20.10 E-value=66 Score=19.11 Aligned_cols=10 Identities=10% Similarity=0.444 Sum_probs=8.4
Q ss_pred cccceeEEEE
Q psy1572 27 STGQFQALVQ 36 (74)
Q Consensus 27 ~~G~yqcLV~ 36 (74)
-+|.|+|.++
T Consensus 64 D~G~Y~C~A~ 73 (88)
T cd05851 64 DEGTYECEAE 73 (88)
T ss_pred hCEEEEEEEE
Confidence 5699999976
No 126
>cd04968 Ig3_Contactin_like Third Ig domain of contactin. Ig3_Contactin_like: Third Ig domain of contactins. Contactins are neural cell adhesion molecules and are comprised of six Ig domains followed by four fibronectin type III(FnIII) domains anchored to the membrane by glycosylphosphatidylinositol. The first four Ig domains form the intermolecular binding fragment, which arranges as a compact U-shaped module via contacts between Ig domains 1 and 4, and between Ig domains 2 and 3. Contactin-2 (TAG-1, axonin-1) may play a part in the neuronal processes of neurite outgrowth, axon guidance and fasciculation, and neuronal migration. This group also includes contactin-1 and contactin-5. The different contactins show different expression patterns in the central nervous system. During development and in adulthood, contactin-2 is transiently expressed in subsets of central and peripheral neurons. Contactin-5 is expressed specifically in the rat postnatal nervous system, peaking at about 3 week
Probab=20.09 E-value=67 Score=18.59 Aligned_cols=11 Identities=9% Similarity=0.416 Sum_probs=9.0
Q ss_pred ccccceeEEEE
Q psy1572 26 HSTGQFQALVQ 36 (74)
Q Consensus 26 s~~G~yqcLV~ 36 (74)
.-+|.|+|.+.
T Consensus 63 ~D~G~Y~C~a~ 73 (88)
T cd04968 63 EDEGTYECEAE 73 (88)
T ss_pred ccCEEEEEEEE
Confidence 45799999986
Done!