Query         psy1572
Match_columns 74
No_of_seqs    56 out of 58
Neff          2.7 
Searched_HMMs 46136
Date          Fri Aug 16 19:42:12 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy1572.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/1572hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 KOG3732|consensus               99.5 3.3E-15 7.1E-20  117.0  -0.5   54   21-74    286-339 (339)
  2 PF00035 dsrm:  Double-stranded  98.0 6.2E-05 1.4E-09   42.7   6.9   36   30-67     32-67  (67)
  3 smart00358 DSRM Double-strande  97.7 0.00017 3.6E-09   40.5   5.2   37   29-67     30-66  (67)
  4 cd00048 DSRM Double-stranded R  97.4 0.00057 1.2E-08   38.0   4.9   37   29-67     32-68  (68)
  5 PRK00102 rnc ribonuclease III;  95.9   0.049 1.1E-06   37.9   6.7   38   29-68    190-227 (229)
  6 TIGR02191 RNaseIII ribonucleas  95.0   0.062 1.3E-06   36.9   4.7   37   29-67    184-220 (220)
  7 PHA03103 double-strand RNA-bin  94.1    0.11 2.4E-06   38.1   4.6   39   28-68    138-176 (183)
  8 PRK12371 ribonuclease III; Rev  93.7    0.16 3.4E-06   37.2   4.8   41   27-69    191-231 (235)
  9 PHA02701 ORF020 dsRNA-binding   93.3    0.15 3.2E-06   37.7   4.2   38   29-68    139-176 (183)
 10 PF14709 DND1_DSRM:  double str  93.1    0.31 6.6E-06   30.6   4.8   39   30-68     34-80  (80)
 11 PF03368 Dicer_dimer:  Dicer di  91.8    0.79 1.7E-05   29.0   5.6   46   26-71     28-75  (90)
 12 PRK12372 ribonuclease III; Rev  90.1     1.1 2.3E-05   36.7   6.2   40   28-68    182-221 (413)
 13 PRK14718 ribonuclease III; Pro  87.5     1.4   3E-05   36.7   5.3   37   30-68    184-221 (467)
 14 COG0571 Rnc dsRNA-specific rib  87.5     1.2 2.7E-05   33.2   4.5   28   42-69    204-231 (235)
 15 KOG3732|consensus               86.8     1.9 4.1E-05   34.7   5.5   45   23-70     63-107 (339)
 16 PF14954 LIX1:  Limb expression  86.0    0.99 2.1E-05   35.2   3.4   32   30-64     58-92  (252)
 17 KOG3769|consensus               79.9     2.6 5.6E-05   34.0   3.7   32   37-68    261-302 (333)
 18 PF04694 Corona_3:  Coronavirus  77.7     2.9 6.3E-05   26.5   2.7   25   35-59     29-53  (60)
 19 KOG2777|consensus               72.6     4.4 9.5E-05   34.3   3.3   27   43-69    128-154 (542)
 20 PF02169 LPP20:  LPP20 lipoprot  66.7     5.6 0.00012   23.7   2.1   23   46-68     17-39  (92)
 21 PF12909 DUF3832:  Protein of u  66.5     6.1 0.00013   26.0   2.4   37   27-68     15-51  (89)
 22 KOG4334|consensus               65.7     5.6 0.00012   34.4   2.6   28   40-67    414-441 (650)
 23 cd00099 IgV Immunoglobulin var  62.1       9  0.0002   22.5   2.4   26   26-51     77-102 (105)
 24 PF01413 C4:  C-terminal tandem  60.9     4.7  0.0001   27.7   1.1   18   32-49     48-65  (114)
 25 PF03681 UPF0150:  Uncharacteri  59.8     7.8 0.00017   21.4   1.7   16   44-59     23-38  (48)
 26 cd05733 Ig6_L1-CAM_like Sixth   59.3     9.6 0.00021   22.1   2.1    9   28-36     55-63  (77)
 27 smart00111 C4 C-terminal tande  58.6     4.4 9.6E-05   27.8   0.7   21   32-53     48-68  (114)
 28 PF12695 Abhydrolase_5:  Alpha/  57.5      13 0.00029   22.2   2.6   16   41-56      1-16  (145)
 29 COG1598 Predicted nuclease of   56.8       9 0.00019   23.4   1.8   16   44-59     25-40  (73)
 30 cd04967 Ig1_Contactin First Ig  54.4     7.9 0.00017   22.9   1.2   11   26-36     69-79  (91)
 31 PRK04930 glutathione-regulated  51.8      26 0.00056   25.0   3.7   35   35-69    143-177 (184)
 32 PRK03661 hypothetical protein;  50.9      59  0.0013   22.9   5.4   40   30-70    120-160 (164)
 33 cd02647 nuc_hydro_TvIAG nuc_hy  50.8      23  0.0005   26.9   3.5   28   42-72     31-58  (312)
 34 PRK03657 hypothetical protein;  46.9      62  0.0014   23.1   5.0   42   30-72    126-167 (170)
 35 cd05874 Ig6_NrCAM Sixth immuno  46.8      21 0.00045   21.0   2.2   10   28-37     55-64  (77)
 36 cd05739 Ig3_RPTP_IIa_LAR_like   44.1      17 0.00037   21.0   1.6   15   22-36     42-56  (69)
 37 PF04098 Rad52_Rad22:  Rad52/22  41.9      56  0.0012   22.3   4.0   39   27-65     72-121 (154)
 38 PF09888 DUF2115:  Uncharacteri  40.9      14 0.00031   26.2   1.0   29   15-45    124-159 (163)
 39 cd05875 Ig6_hNeurofascin_like   40.4      31 0.00067   20.2   2.3    9   28-36     55-63  (77)
 40 cd05760 Ig2_PTK7 Second immuno  39.2      19 0.00042   20.8   1.2   11   26-36     48-58  (77)
 41 KOG2938|consensus               39.2      37  0.0008   27.1   3.2   27   42-73     52-78  (350)
 42 TIGR03056 bchO_mg_che_rel puta  38.2   1E+02  0.0022   20.3   4.6   32   26-57     12-46  (278)
 43 cd05868 Ig4_NrCAM Fourth immun  37.9      21 0.00045   20.7   1.2   12   25-36     50-61  (76)
 44 KOG0286|consensus               36.4      68  0.0015   26.2   4.3   45    4-48    290-334 (343)
 45 PRK10443 rihA ribonucleoside h  36.3      49  0.0011   24.9   3.3   27   41-72     32-58  (311)
 46 cd05873 Ig_Sema4D_like Immunog  36.2      22 0.00048   22.3   1.2   14   25-38     57-70  (87)
 47 smart00408 IGc2 Immunoglobulin  35.1      26 0.00057   17.4   1.2   11   26-36     49-59  (63)
 48 PF05165 GGDN:  GGDN family;  I  35.0      28 0.00061   26.8   1.8   26   36-61     75-100 (246)
 49 cd05723 Ig4_Neogenin Fourth im  34.9      25 0.00054   20.0   1.2   11   26-36     47-57  (71)
 50 cd05750 Ig_Pro_neuregulin Immu  34.6      27 0.00058   19.3   1.3   11   26-36     53-63  (75)
 51 TIGR01738 bioH putative pimelo  33.7      45 0.00097   20.8   2.3   19   40-58      5-23  (245)
 52 cd05848 Ig1_Contactin-5 First   33.6      26 0.00057   21.2   1.2   11   26-36     69-79  (94)
 53 cd05738 Ig2_RPTP_IIa_LAR_like   33.6      27 0.00058   20.0   1.2   12   25-36     47-58  (74)
 54 cd05850 Ig1_Contactin-2 First   33.5      26 0.00057   21.1   1.2   11   26-36     69-79  (94)
 55 cd05852 Ig5_Contactin-1 Fifth   33.4      27 0.00058   20.4   1.2   11   26-36     49-59  (73)
 56 COG2429 Archaeal GTP cyclohydr  33.1      36 0.00078   26.7   2.2   23   42-65     84-106 (250)
 57 PF13927 Ig_3:  Immunoglobulin   33.1      33 0.00072   18.1   1.5   12   25-36     63-74  (75)
 58 cd05867 Ig4_L1-CAM_like Fourth  32.6      29 0.00062   19.8   1.2   13   26-38     51-63  (76)
 59 cd05895 Ig_Pro_neuregulin-1 Im  32.5      29 0.00063   19.6   1.3   12   25-36     53-64  (76)
 60 KOG1817|consensus               32.4      48   0.001   28.5   2.9   24   45-68    479-502 (533)
 61 cd05722 Ig1_Neogenin First imm  31.9      29 0.00064   20.5   1.2   13   26-38     69-81  (95)
 62 cd05849 Ig1_Contactin-1 First   31.6      30 0.00064   21.2   1.2   11   26-36     68-78  (93)
 63 cd04970 Ig6_Contactin_like Six  31.3      31 0.00067   20.0   1.2   11   26-36     57-67  (85)
 64 cd05740 Ig_CEACAM_D4 Fourth im  31.1      30 0.00066   21.0   1.2   11   26-36     65-75  (91)
 65 cd05857 Ig2_FGFR Second immuno  31.0      32 0.00069   19.6   1.2   11   26-36     61-71  (85)
 66 cd02649 nuc_hydro_CeIAG nuc_hy  30.7      69  0.0015   24.1   3.3   26   42-72     31-56  (306)
 67 PF02464 CinA:  Competence-dama  30.4 1.8E+02  0.0039   19.9   6.4   39   30-69    114-152 (154)
 68 PRK02240 GTP cyclohydrolase II  29.6      63  0.0014   25.0   3.0   25   41-66     83-107 (254)
 69 cd04971 Ig_TrKABC_d5 Fifth dom  29.3      36 0.00078   20.4   1.3   10   27-36     57-66  (81)
 70 cd05876 Ig3_L1-CAM Third immun  29.2      35 0.00077   19.3   1.2   10   27-36     47-56  (71)
 71 cd05731 Ig3_L1-CAM_like Third   28.8      38 0.00082   18.4   1.3   12   26-37     46-57  (71)
 72 cd05724 Ig2_Robo Second immuno  28.2      38 0.00081   19.2   1.2   11   26-36     61-71  (86)
 73 smart00409 IG Immunoglobulin.   28.1      43 0.00093   17.4   1.4   11   26-36     61-71  (86)
 74 smart00410 IG_like Immunoglobu  28.1      43 0.00093   17.4   1.4   11   26-36     61-71  (86)
 75 cd07693 Ig1_Robo First immunog  28.0      38 0.00083   19.2   1.2   12   25-36     73-84  (100)
 76 cd05745 Ig3_Peroxidasin Third   27.9      39 0.00083   19.2   1.2   11   26-36     50-60  (74)
 77 PF00047 ig:  Immunoglobulin do  27.8      35 0.00077   18.2   1.0    9   27-35     56-64  (64)
 78 cd05763 Ig_1 Subgroup of the i  27.8      39 0.00084   19.0   1.2   11   26-36     50-60  (75)
 79 TIGR02296 HpaC 4-hydroxyphenyl  27.6      94   0.002   21.1   3.3   40    7-50      8-47  (154)
 80 KOG4528|consensus               27.5      64  0.0014   24.0   2.6   23   45-67     42-64  (168)
 81 PF07897 DUF1675:  Protein of u  27.1      45 0.00097   26.1   1.8   18   42-59    254-271 (284)
 82 cd00455 nuc_hydro nuc_hydro: N  26.9      66  0.0014   23.8   2.6   27   41-72     28-54  (295)
 83 cd07706 IgV_TCR_delta Immunogl  26.8      58  0.0013   20.0   2.0   28   24-51     80-111 (116)
 84 PHA02447 hypothetical protein   26.8      36 0.00079   22.7   1.1   45   22-66     35-79  (86)
 85 COG2820 Udp Uridine phosphoryl  26.6      37 0.00079   26.5   1.2   25   21-51     45-70  (248)
 86 cd08342 HPPD_N_like N-terminal  26.5      72  0.0016   19.9   2.4   23    8-30    104-126 (136)
 87 cd04978 Ig4_L1-NrCAM_like Four  26.3      44 0.00094   18.5   1.2   11   26-36     51-61  (76)
 88 cd05737 Ig_Myomesin_like_C C-t  25.6      44 0.00095   19.8   1.2   11   26-36     68-78  (92)
 89 cd05727 Ig2_Contactin-2-like S  25.6      78  0.0017   20.4   2.5   26   13-38     54-82  (96)
 90 PF00388 PI-PLC-X:  Phosphatidy  25.5      78  0.0017   20.6   2.5    7   42-48     55-61  (146)
 91 cd05748 Ig_Titin_like Immunogl  25.5      46   0.001   18.6   1.3   11   26-36     50-60  (74)
 92 cd04974 Ig3_FGFR Third immunog  25.4      45 0.00097   19.8   1.2   11   26-36     65-75  (90)
 93 cd05736 Ig2_Follistatin_like S  25.4      46 0.00099   18.7   1.2   11   26-36     49-59  (76)
 94 TIGR00702 uncharacterized doma  25.1      94   0.002   24.1   3.2   26   41-66     47-72  (377)
 95 TIGR03266 methan_mark_1 putati  24.8   1E+02  0.0022   23.9   3.4   24   42-65     52-75  (376)
 96 PRK15486 hpaC 4-hydroxyphenyla  24.6   1E+02  0.0023   21.6   3.2   39    7-49     17-55  (170)
 97 KOG3075|consensus               24.5   1E+02  0.0022   24.3   3.3   30   33-65      8-37  (261)
 98 PRK06806 fructose-bisphosphate  23.8      60  0.0013   24.5   1.9   17   42-58    202-219 (281)
 99 cd05712 Ig_Siglec_N Immunoglob  23.6      98  0.0021   19.4   2.6   26   24-50     88-113 (119)
100 cd04972 Ig_TrkABC_d4 Fourth do  23.6      51  0.0011   19.3   1.2    9   28-36     68-76  (90)
101 cd05752 Ig1_FcgammaR_like Frst  23.4      31 0.00067   20.6   0.3   12   25-36     55-66  (78)
102 cd04969 Ig5_Contactin_like Fif  23.3      53  0.0011   18.4   1.2   11   26-36     49-59  (73)
103 cd05746 Ig4_Peroxidasin Fourth  23.3      53  0.0011   18.4   1.2   10   27-36     47-56  (69)
104 cd05747 Ig5_Titin_like M5, fif  23.2      53  0.0011   19.4   1.2   11   26-36     69-79  (92)
105 cd04973 Ig1_FGFR First immunog  23.1      51  0.0011   19.1   1.2   12   25-36     54-65  (79)
106 cd05856 Ig2_FGFRL1-like Second  23.0      55  0.0012   18.1   1.2   11   26-36     58-68  (82)
107 PF13827 DUF4189:  Domain of un  23.0      83  0.0018   19.5   2.2   18   45-62     66-83  (100)
108 PRK01022 hypothetical protein;  22.7      45 0.00098   23.9   1.0   20   15-36    126-145 (167)
109 cd05756 Ig1_IL1R_like First im  22.7      48   0.001   20.4   1.1   15   25-39     69-83  (94)
110 cd05743 Ig_Perlecan_D2_like Im  22.5      56  0.0012   18.7   1.2   10   27-36     53-62  (78)
111 PF07679 I-set:  Immunoglobulin  22.3      58  0.0012   18.4   1.2   14   26-39     66-79  (90)
112 cd05854 Ig6_Contactin-2 Sixth   22.2      56  0.0012   19.5   1.2   12   26-37     57-68  (85)
113 cd05730 Ig3_NCAM-1_like Third   22.1      57  0.0012   19.1   1.2   11   26-36     68-78  (95)
114 cd02651 nuc_hydro_IU_UC_XIUA n  21.8 1.1E+02  0.0024   22.6   2.9   26   42-72     30-55  (302)
115 cd05891 Ig_M-protein_C C-termi  21.7      58  0.0013   19.5   1.2   11   26-36     68-78  (92)
116 TIGR03693 ocin_ThiF_like putat  21.2      71  0.0015   27.8   2.1   45   24-70    383-437 (637)
117 PF06675 DUF1177:  Protein of u  21.2 1.6E+02  0.0035   23.4   3.9   29   46-74    220-248 (276)
118 cd05853 Ig6_Contactin-4 Sixth   21.1      59  0.0013   20.2   1.2   11   27-37     58-68  (85)
119 cd05900 Ig_Aggrecan Immunoglob  21.1 1.2E+02  0.0025   19.7   2.6   27   10-36     61-94  (112)
120 cd05878 Ig_Aggrecan_like Immun  21.0 1.4E+02   0.003   18.8   2.9   32    7-38     56-94  (110)
121 cd05858 Ig3_FGFR-2 Third immun  20.7      62  0.0013   19.0   1.2   10   27-36     66-75  (90)
122 cd04979 Ig_Semaphorin_C Immuno  20.7      62  0.0013   18.9   1.2   14   25-38     60-73  (89)
123 PF04101 Glyco_tran_28_C:  Glyc  20.6      43 0.00094   21.6   0.6   14   42-55     75-88  (167)
124 PRK05428 HPr kinase/phosphoryl  20.1 3.8E+02  0.0082   21.1   5.7   66    4-71    223-288 (308)
125 cd05851 Ig3_Contactin-1 Third   20.1      66  0.0014   19.1   1.2   10   27-36     64-73  (88)
126 cd04968 Ig3_Contactin_like Thi  20.1      67  0.0015   18.6   1.3   11   26-36     63-73  (88)

No 1  
>KOG3732|consensus
Probab=99.48  E-value=3.3e-15  Score=116.96  Aligned_cols=54  Identities=48%  Similarity=0.697  Sum_probs=51.2

Q ss_pred             ceeeeccccceeEEEEeccCCceEeecCCCCHHHHHHHHHHHHHHHHHHhhcCC
Q psy1572          21 SIAFLHSTGQFQALVQLATYPVAVCHGQGKTQQEAKTVAAHNAIEYLRLMTKSK   74 (74)
Q Consensus        21 ~ieels~~G~yqcLV~LST~P~~VCHGsG~S~~eA~~~AA~NALqyLkiM~~~k   74 (74)
                      .|++.|++|++||+|++||.|++||||+++|+..|+.+||++||+||+||.+.+
T Consensus       286 d~~~~t~sg~~~~~~e~s~~~~~~~~~s~~t~~~s~~~aa~~~L~yL~~~~~f~  339 (339)
T KOG3732|consen  286 DIEELTLSGQNQCLVELSTSPATVCHGSSITPPSAQSEAAHNALEYLKIMAGFK  339 (339)
T ss_pred             hhHHhhhhhHHHHHHHhcCCCceeeeccCCCCCcccchhHHHHHHHHHHHhcCC
Confidence            368899999999999999999999999999999999999999999999999764


No 2  
>PF00035 dsrm:  Double-stranded RNA binding motif;  InterPro: IPR001159 The DsRBD domain is found in a variety of RNA-binding proteins with different structures and exhibiting a diversity of functions []. It is involved in localisation of at least five different mRNAs in the early Drosophila embryo and by interferon-induced protein kinase in humans, which is part of the cellular response to dsRNA.; GO: 0003725 double-stranded RNA binding, 0005622 intracellular; PDB: 1EKZ_A 1STU_A 1QU6_A 2L2M_A 3ADJ_A 1WHN_A 3LLH_B 2B7V_A 2L3J_A 1UHZ_A ....
Probab=97.98  E-value=6.2e-05  Score=42.66  Aligned_cols=36  Identities=31%  Similarity=0.406  Sum_probs=30.5

Q ss_pred             ceeEEEEeccCCceEeecCCCCHHHHHHHHHHHHHHHH
Q psy1572          30 QFQALVQLATYPVAVCHGQGKTQQEAKTVAAHNAIEYL   67 (74)
Q Consensus        30 ~yqcLV~LST~P~~VCHGsG~S~~eA~~~AA~NALqyL   67 (74)
                      .|.|-|.+  ....+..|.|.|-.+|+..||..||++|
T Consensus        32 ~f~~~~~i--~~~~~~~g~g~sKk~Ak~~AA~~al~~L   67 (67)
T PF00035_consen   32 RFICTVYI--DGKEYGEGEGSSKKEAKQQAAKKALQKL   67 (67)
T ss_dssp             EEEEEEEE--TTEEEEEEEESSHHHHHHHHHHHHHHHH
T ss_pred             eEEEEEEE--CCEEEeEeccCCHHHHHHHHHHHHHHhC
Confidence            45554444  7788889999999999999999999998


No 3  
>smart00358 DSRM Double-stranded RNA binding motif.
Probab=97.67  E-value=0.00017  Score=40.45  Aligned_cols=37  Identities=30%  Similarity=0.403  Sum_probs=32.4

Q ss_pred             cceeEEEEeccCCceEeecCCCCHHHHHHHHHHHHHHHH
Q psy1572          29 GQFQALVQLATYPVAVCHGQGKTQQEAKTVAAHNAIEYL   67 (74)
Q Consensus        29 G~yqcLV~LST~P~~VCHGsG~S~~eA~~~AA~NALqyL   67 (74)
                      ..|.|-|.+.-.  .+.-|.|.|-++|+.+||++||++|
T Consensus        30 ~~f~~~v~i~~~--~~~~g~g~sKk~Ak~~AA~~al~~L   66 (67)
T smart00358       30 PRFTVTVKVGGE--YTGEGEGSSKKEAKQRAAEAALRSL   66 (67)
T ss_pred             CcEEEEEEECCE--EEEEeccCCHHHHHHHHHHHHHHhc
Confidence            589999998543  6678999999999999999999987


No 4  
>cd00048 DSRM Double-stranded RNA binding motif. Binding is not sequence specific but is highly specific for double stranded RNA. Found in a variety of proteins including dsRNA dependent protein kinase PKR, RNA helicases, Drosophila staufen protein, E. coli RNase III, RNases H1, and dsRNA dependent adenosine deaminases.
Probab=97.39  E-value=0.00057  Score=37.95  Aligned_cols=37  Identities=30%  Similarity=0.394  Sum_probs=32.1

Q ss_pred             cceeEEEEeccCCceEeecCCCCHHHHHHHHHHHHHHHH
Q psy1572          29 GQFQALVQLATYPVAVCHGQGKTQQEAKTVAAHNAIEYL   67 (74)
Q Consensus        29 G~yqcLV~LST~P~~VCHGsG~S~~eA~~~AA~NALqyL   67 (74)
                      ..|.|-|.+.-  ..++.|.|.|.++|+..||..||+.|
T Consensus        32 ~~f~~~v~i~~--~~~~~g~g~sKk~Ak~~AA~~al~~L   68 (68)
T cd00048          32 PRFTVEVTVGG--KITGEGEGSSKKEAKQNAAEAALRKL   68 (68)
T ss_pred             CeEEEEEEECC--EEEEEeecCCHHHHHHHHHHHHHHhC
Confidence            57888888844  68889999999999999999999865


No 5  
>PRK00102 rnc ribonuclease III; Reviewed
Probab=95.87  E-value=0.049  Score=37.88  Aligned_cols=38  Identities=26%  Similarity=0.430  Sum_probs=30.9

Q ss_pred             cceeEEEEeccCCceEeecCCCCHHHHHHHHHHHHHHHHH
Q psy1572          29 GQFQALVQLATYPVAVCHGQGKTQQEAKTVAAHNAIEYLR   68 (74)
Q Consensus        29 G~yqcLV~LST~P~~VCHGsG~S~~eA~~~AA~NALqyLk   68 (74)
                      ..|.|-|.+.  --.+..|.|.|-++|+..||.+||+.|+
T Consensus       190 ~~f~~~v~i~--~~~~~~g~g~skk~Ae~~AA~~Al~~l~  227 (229)
T PRK00102        190 KEFTVEVTVN--GKELGEGTGSSKKEAEQAAAKQALKKLK  227 (229)
T ss_pred             ceEEEEEEEC--CEEEEEeeeCCHHHHHHHHHHHHHHHHh
Confidence            4566666654  3568899999999999999999999885


No 6  
>TIGR02191 RNaseIII ribonuclease III, bacterial. This family consists of bacterial examples of ribonuclease III. This enzyme cleaves double-stranded rRNA. It is involved in processing ribosomal RNA precursors. It is found even in minimal genones such as Mycoplasma genitalium and Buchnera aphidicola, and in some cases has been shown to be an essential gene. These bacterial proteins contain a double-stranded RNA binding motif (pfam00035) and a ribonuclease III domain (pfam00636). Eukaryotic homologs tend to be much longer proteins with additional domains, localized to the nucleus, and not included in this family.
Probab=94.95  E-value=0.062  Score=36.95  Aligned_cols=37  Identities=32%  Similarity=0.432  Sum_probs=31.3

Q ss_pred             cceeEEEEeccCCceEeecCCCCHHHHHHHHHHHHHHHH
Q psy1572          29 GQFQALVQLATYPVAVCHGQGKTQQEAKTVAAHNAIEYL   67 (74)
Q Consensus        29 G~yqcLV~LST~P~~VCHGsG~S~~eA~~~AA~NALqyL   67 (74)
                      ..|.|-|.+.=  -.+..|+|.|-++|+..||..||+-|
T Consensus       184 ~~f~~~v~~~~--~~~~~g~g~skk~A~~~AA~~Al~~l  220 (220)
T TIGR02191       184 KEFTVEVSVNG--EPYGEGKGKSKKEAEQNAAKAALEKL  220 (220)
T ss_pred             ceEEEEEEECC--EEEEEeeeCCHHHHHHHHHHHHHHhC
Confidence            47888888863  55889999999999999999999753


No 7  
>PHA03103 double-strand RNA-binding protein; Provisional
Probab=94.09  E-value=0.11  Score=38.12  Aligned_cols=39  Identities=28%  Similarity=0.306  Sum_probs=30.9

Q ss_pred             ccceeEEEEeccCCceEeecCCCCHHHHHHHHHHHHHHHHH
Q psy1572          28 TGQFQALVQLATYPVAVCHGQGKTQQEAKTVAAHNAIEYLR   68 (74)
Q Consensus        28 ~G~yqcLV~LST~P~~VCHGsG~S~~eA~~~AA~NALqyLk   68 (74)
                      +-.|.+-|.+.-.+..  .|.|.|-.+|+.+||+.||+.|.
T Consensus       138 ~p~F~v~V~I~g~~~g--~G~G~SKKeAEQ~AAk~AL~~L~  176 (183)
T PHA03103        138 SPTFTASVIISGIKFK--PAIGSTKKEAKNNAAKLAMDKIL  176 (183)
T ss_pred             CceEEEEEEECCEEEE--EeeeCCHHHHHHHHHHHHHHHHH
Confidence            3456677777664444  59999999999999999998874


No 8  
>PRK12371 ribonuclease III; Reviewed
Probab=93.67  E-value=0.16  Score=37.20  Aligned_cols=41  Identities=22%  Similarity=0.381  Sum_probs=32.7

Q ss_pred             cccceeEEEEeccCCceEeecCCCCHHHHHHHHHHHHHHHHHH
Q psy1572          27 STGQFQALVQLATYPVAVCHGQGKTQQEAKTVAAHNAIEYLRL   69 (74)
Q Consensus        27 ~~G~yqcLV~LST~P~~VCHGsG~S~~eA~~~AA~NALqyLki   69 (74)
                      .+..|.|-|.+.=..  +-.|.|.|..+|...||..||+.|+.
T Consensus       191 h~~~F~v~v~v~~~~--~~~g~G~sKK~Ae~~AA~~al~~~~~  231 (235)
T PRK12371        191 HDPRFTVEVEVKGFA--PETGEGRSKRAAEQVAAEKMLEREGV  231 (235)
T ss_pred             CCCeEEEEEEECCEE--EEEeeeCCHHHHHHHHHHHHHHHhhh
Confidence            345677777775433  66899999999999999999988764


No 9  
>PHA02701 ORF020 dsRNA-binding PKR inhibitor; Provisional
Probab=93.33  E-value=0.15  Score=37.69  Aligned_cols=38  Identities=18%  Similarity=0.259  Sum_probs=30.1

Q ss_pred             cceeEEEEeccCCceEeecCCCCHHHHHHHHHHHHHHHHH
Q psy1572          29 GQFQALVQLATYPVAVCHGQGKTQQEAKTVAAHNAIEYLR   68 (74)
Q Consensus        29 G~yqcLV~LST~P~~VCHGsG~S~~eA~~~AA~NALqyLk   68 (74)
                      ..|-+-|.++  .-.+.-|.|.|-.+|..+||+.||+.|.
T Consensus       139 ~~Ftv~V~V~--g~~~g~G~G~SKKeAEQ~AAk~AL~~L~  176 (183)
T PHA02701        139 PLFTCTIVVS--GKVVATASGCSKKLARHAACADALTILI  176 (183)
T ss_pred             ceEEEEEEEC--CEEEEEEEeCCHHHHHHHHHHHHHHHHH
Confidence            3455556655  4556689999999999999999998875


No 10 
>PF14709 DND1_DSRM:  double strand RNA binding domain from DEAD END PROTEIN 1
Probab=93.06  E-value=0.31  Score=30.59  Aligned_cols=39  Identities=31%  Similarity=0.389  Sum_probs=28.8

Q ss_pred             ceeEEEEeccCCc-----eEeec---CCCCHHHHHHHHHHHHHHHHH
Q psy1572          30 QFQALVQLATYPV-----AVCHG---QGKTQQEAKTVAAHNAIEYLR   68 (74)
Q Consensus        30 ~yqcLV~LST~P~-----~VCHG---sG~S~~eA~~~AA~NALqyLk   68 (74)
                      .|.|-|.+.+.+.     .-|.|   .-.+-++|++.||..||++|.
T Consensus        34 ~F~ykV~i~~~~~~~~~~~~~~~p~~~~~~~k~Ak~~AA~~~L~~Lg   80 (80)
T PF14709_consen   34 LFLYKVVIPGLEYPFEGSIECFGPTKPSSTKKEAKESAAQQALQALG   80 (80)
T ss_pred             EEEEEEEEcCCCCCCcceEEEccCCCcCccHHHHHHHHHHHHHHhcC
Confidence            5667777766555     35555   235789999999999999983


No 11 
>PF03368 Dicer_dimer:  Dicer dimerisation domain;  InterPro: IPR005034  This domain is found in members of the Dicer protein family of dsRNA nucleases. This entry represents a dsRNA-binding domain. RNA interference (RNAi) is an ancient gene-silencing process that plays a fundamental role in diverse eukaryotic functions including viral defence, chromatin remodelling, genome rearrangement, developmental timing, brain morphogenesis, and stem cell maintenance. All RNAi pathways require the multidomain ribonuclease Dicer, which initiates RNAi by cleaving double-stranded RNA (dsRNA) substrates into small fragments ~25 nuleotides in length. A typical eukaryotic Dicer consists of a helicase domain (PDOC51192 from PROSITEDOC), a domain of unknown function, and a PAZ domain (PDOC50821 from PROSITEDOC) at the amino (N)-terminus as well as two ribonuclease III domains (PDOC00448 from PROSITEDOC) and a dsRNA-binding domain (dsRBD) (PDOC50137 from PROSITEDOC) at the carboxy (C)-terminus. The domain of unknown function of ~100 amino acids is predicted to adopt the canonical alpha-beta-beta-beta-alpha-fold found in all dsRBDs [, , , ].; GO: 0016891 endoribonuclease activity, producing 5'-phosphomonoesters; PDB: 2KOU_A.
Probab=91.82  E-value=0.79  Score=28.98  Aligned_cols=46  Identities=24%  Similarity=0.229  Sum_probs=31.1

Q ss_pred             ccccceeEEEEeccC-CceEeecC-CCCHHHHHHHHHHHHHHHHHHhh
Q psy1572          26 HSTGQFQALVQLATY-PVAVCHGQ-GKTQQEAKTVAAHNAIEYLRLMT   71 (74)
Q Consensus        26 s~~G~yqcLV~LST~-P~~VCHGs-G~S~~eA~~~AA~NALqyLkiM~   71 (74)
                      ...+.|+|-|.|-.+ |+..--|. -.|...|+..||..|.+.|-.|+
T Consensus        28 ~~~~~~~c~v~LP~~~pi~~i~g~~~~sk~~AK~sAAf~Ac~~L~~~g   75 (90)
T PF03368_consen   28 KIGSGFICTVILPINSPIRSIEGPPMRSKKLAKRSAAFEACKKLHEAG   75 (90)
T ss_dssp             E--G-EEEEEE--TT-SS--EEEE--SSHHHHHHHHHHHHHHHHHHH-
T ss_pred             EcCCcEEEEEECCCCCCCCeEEccccccHHHHHHHHHHHHHHHHHHcC
Confidence            345589999999985 65555554 46889999999999999998876


No 12 
>PRK12372 ribonuclease III; Reviewed
Probab=90.15  E-value=1.1  Score=36.67  Aligned_cols=40  Identities=23%  Similarity=0.378  Sum_probs=30.3

Q ss_pred             ccceeEEEEeccCCceEeecCCCCHHHHHHHHHHHHHHHHH
Q psy1572          28 TGQFQALVQLATYPVAVCHGQGKTQQEAKTVAAHNAIEYLR   68 (74)
Q Consensus        28 ~G~yqcLV~LST~P~~VCHGsG~S~~eA~~~AA~NALqyLk   68 (74)
                      +..|.+-|.+.-..+. -.|.|.|.++|+..||+.||+.|+
T Consensus       182 ~~~F~V~V~v~g~~~~-g~G~G~SKKeAEQ~AAr~AL~kL~  221 (413)
T PRK12372        182 NQQFEVECTVPKLDVK-VSGSGASRRAAEQAAAKKALDEVM  221 (413)
T ss_pred             CceEEEEEEECCeEEE-EEEEeCCHHHHHHHHHHHHHHHHh
Confidence            3466666666432222 389999999999999999999887


No 13 
>PRK14718 ribonuclease III; Provisional
Probab=87.54  E-value=1.4  Score=36.71  Aligned_cols=37  Identities=22%  Similarity=0.410  Sum_probs=28.0

Q ss_pred             ceeEEEEeccCCceE-eecCCCCHHHHHHHHHHHHHHHHH
Q psy1572          30 QFQALVQLATYPVAV-CHGQGKTQQEAKTVAAHNAIEYLR   68 (74)
Q Consensus        30 ~yqcLV~LST~P~~V-CHGsG~S~~eA~~~AA~NALqyLk   68 (74)
                      .|.+-|.+  .-..+ ..|.|.|-.+|+..||+.||+.|+
T Consensus       184 ~F~V~V~v--~g~~~~G~G~G~SKKeAEQ~AAk~AL~kL~  221 (467)
T PRK14718        184 QFEVECTV--PKLDIKVSGSGASRRAAEQAAAKKALDEVT  221 (467)
T ss_pred             eEEEEEEE--CCeeeEEEEEcCCHHHHHHHHHHHHHHHhc
Confidence            44444444  33223 789999999999999999999887


No 14 
>COG0571 Rnc dsRNA-specific ribonuclease [Transcription]
Probab=87.50  E-value=1.2  Score=33.22  Aligned_cols=28  Identities=29%  Similarity=0.501  Sum_probs=25.1

Q ss_pred             ceEeecCCCCHHHHHHHHHHHHHHHHHH
Q psy1572          42 VAVCHGQGKTQQEAKTVAAHNAIEYLRL   69 (74)
Q Consensus        42 ~~VCHGsG~S~~eA~~~AA~NALqyLki   69 (74)
                      -.+.-|.|.|..+|...||.+||+-|.+
T Consensus       204 ~~~g~G~G~skk~AEq~AA~~al~~l~~  231 (235)
T COG0571         204 KELGTGKGRSKKEAEQAAAEQALKKLGV  231 (235)
T ss_pred             eeEEEecccCHHHHHHHHHHHHHHHhcc
Confidence            6678899999999999999999987754


No 15 
>KOG3732|consensus
Probab=86.79  E-value=1.9  Score=34.67  Aligned_cols=45  Identities=22%  Similarity=0.220  Sum_probs=39.2

Q ss_pred             eeeccccceeEEEEeccCCceEeecCCCCHHHHHHHHHHHHHHHHHHh
Q psy1572          23 AFLHSTGQFQALVQLATYPVAVCHGQGKTQQEAKTVAAHNAIEYLRLM   70 (74)
Q Consensus        23 eels~~G~yqcLV~LST~P~~VCHGsG~S~~eA~~~AA~NALqyLkiM   70 (74)
                      +.--..+.|.|-|++.   ..+|=|-|.|-..|+-.||.++|+-|+-.
T Consensus        63 eGp~H~~~fv~rvtvg---~~~a~GeG~sKK~AKh~AA~~~L~~lk~l  107 (339)
T KOG3732|consen   63 EGPPHMPNFVFRVTVG---EITATGEGKSKKLAKHRAAEALLKELKKL  107 (339)
T ss_pred             cCCccCCCeEEEEEEe---eeEEecCCCchhHHHHHHHHHHHHHHhcC
Confidence            3344678899999998   89999999999999999999999988753


No 16 
>PF14954 LIX1:  Limb expression 1
Probab=86.01  E-value=0.99  Score=35.17  Aligned_cols=32  Identities=34%  Similarity=0.525  Sum_probs=26.1

Q ss_pred             ceeEEEEeccCCceEeecC---CCCHHHHHHHHHHHHH
Q psy1572          30 QFQALVQLATYPVAVCHGQ---GKTQQEAKTVAAHNAI   64 (74)
Q Consensus        30 ~yqcLV~LST~P~~VCHGs---G~S~~eA~~~AA~NAL   64 (74)
                      -|.|.|+|   |=.-|+|+   =.|.+||+.+||.-||
T Consensus        58 pyVcyVTL---PGGSCFGnfq~C~tkAEARR~AAKiAL   92 (252)
T PF14954_consen   58 PYVCYVTL---PGGSCFGNFQNCPTKAEARRSAAKIAL   92 (252)
T ss_pred             CeEEEEeC---CCCCccCccccCCcHHHHHhhhHHHHH
Confidence            47888854   77788886   4688999999999988


No 17 
>KOG3769|consensus
Probab=79.89  E-value=2.6  Score=34.02  Aligned_cols=32  Identities=34%  Similarity=0.363  Sum_probs=25.3

Q ss_pred             eccCCceEee----------cCCCCHHHHHHHHHHHHHHHHH
Q psy1572          37 LATYPVAVCH----------GQGKTQQEAKTVAAHNAIEYLR   68 (74)
Q Consensus        37 LST~P~~VCH----------GsG~S~~eA~~~AA~NALqyLk   68 (74)
                      .|-.|+-|-|          |-|.|...|.+.||++||.-|=
T Consensus       261 ~S~~PvyvVgiYs~kkllGqG~Gesl~~A~e~AA~dAL~k~y  302 (333)
T KOG3769|consen  261 NSAEPVYVVGIYSGKKLLGQGQGESLKLAEEQAARDALIKLY  302 (333)
T ss_pred             CccCceEEEEeecCchhhccCcchHHHHHHHHHHHHHHHHHH
Confidence            4556666654          8999999999999999996543


No 18 
>PF04694 Corona_3:  Coronavirus ORF3 protein;  InterPro: IPR006784 This family represents the Coronavirus ORF3 protein, also known as the X2A protein.
Probab=77.72  E-value=2.9  Score=26.50  Aligned_cols=25  Identities=40%  Similarity=0.470  Sum_probs=22.5

Q ss_pred             EEeccCCceEeecCCCCHHHHHHHH
Q psy1572          35 VQLATYPVAVCHGQGKTQQEAKTVA   59 (74)
Q Consensus        35 V~LST~P~~VCHGsG~S~~eA~~~A   59 (74)
                      |+..|...-||-|.|-|..||+..|
T Consensus        29 vef~tgkllVcigFGDTl~~A~~ka   53 (60)
T PF04694_consen   29 VEFKTGKLLVCIGFGDTLFEAEQKA   53 (60)
T ss_pred             EEEecCcEEEEEecchHHHHHHHhh
Confidence            6778999999999999999998765


No 19 
>KOG2777|consensus
Probab=72.60  E-value=4.4  Score=34.31  Aligned_cols=27  Identities=33%  Similarity=0.381  Sum_probs=24.0

Q ss_pred             eEeecCCCCHHHHHHHHHHHHHHHHHH
Q psy1572          43 AVCHGQGKTQQEAKTVAAHNAIEYLRL   69 (74)
Q Consensus        43 ~VCHGsG~S~~eA~~~AA~NALqyLki   69 (74)
                      .+|-|.|.|-.+|+.+||..||+||--
T Consensus       128 ~~~~~~~~sKk~ak~~aa~~al~~l~~  154 (542)
T KOG2777|consen  128 RWFEGGGRSKKEAKQEAAMAALQVLFK  154 (542)
T ss_pred             EEccCCCcchHHHHHHHHHHHHHHHHh
Confidence            467888999999999999999999853


No 20 
>PF02169 LPP20:  LPP20 lipoprotein;  InterPro: IPR002217 A major antigen has been recognised in Helicobacter pylori, a protein with an apparent molecular weight of 20,000 and mass 18,283 kDa []. DNA sequence analysis revealed a 525 bp gene, encoding a 175-amino acid residue product with a typical 21-residue lipoprotein signal peptide and consensus prolipoprotein processing site []. Results of experimental work with Lpp20 are consistent with it being a nonessential lipoprotein []. Prokaryotic membrane lipoproteins are synthesised with precursor signal peptides that are cleaved by specific peptidases (signal peptidase II). The enzyme recognises a conserved sequence, cutting upstream of a cysteine residue to which a glyceride-fatty acid lipid is attached [].; GO: 0009279 cell outer membrane
Probab=66.73  E-value=5.6  Score=23.72  Aligned_cols=23  Identities=30%  Similarity=0.397  Sum_probs=18.7

Q ss_pred             ecCCCCHHHHHHHHHHHHHHHHH
Q psy1572          46 HGQGKTQQEAKTVAAHNAIEYLR   68 (74)
Q Consensus        46 HGsG~S~~eA~~~AA~NALqyLk   68 (74)
                      -|.|.+.+.|..+|-++..+.|.
T Consensus        17 ~G~~~~~~~A~~~A~~~la~~i~   39 (92)
T PF02169_consen   17 VGSGSSREQAKQDALANLAEQIS   39 (92)
T ss_pred             EEcccChHHHHHHHHHHHHHhee
Confidence            38899999999999888877553


No 21 
>PF12909 DUF3832:  Protein of unknown function (DUF3832);  InterPro: IPR024308 This domain found in bacteria and Acanthamoeba polyphaga mimivirus (APMV) L591 protein has no known function. In Q0AZ30 from SWISSPROT and other uncharacterised proteins this domain is found C-terminal, while the N terminus shows remote homology to the bacterial toxin/antitoxin 'addiction module' (PF12910 from PFAM).; PDB: 3K6Q_C.
Probab=66.49  E-value=6.1  Score=26.00  Aligned_cols=37  Identities=22%  Similarity=0.256  Sum_probs=28.9

Q ss_pred             cccceeEEEEeccCCceEeecCCCCHHHHHHHHHHHHHHHHH
Q psy1572          27 STGQFQALVQLATYPVAVCHGQGKTQQEAKTVAAHNAIEYLR   68 (74)
Q Consensus        27 ~~G~yqcLV~LST~P~~VCHGsG~S~~eA~~~AA~NALqyLk   68 (74)
                      -.|.+  -++|--.|   -+|.|.|.++|-++.+-+.++|=.
T Consensus        15 ~dg~~--~~~l~~ld---i~~~g~T~eeA~~~lie~l~dYAe   51 (89)
T PF12909_consen   15 DDGSI--TVSLPELD---IYANGPTLEEAIEDLIEDLRDYAE   51 (89)
T ss_dssp             ETTEE--EEEECCGT---EEEEESSHHHHHHHHHHHHHHHHH
T ss_pred             CCCeE--EEEecCee---EEecCCCHHHHHHHHHHHHHHHHH
Confidence            34555  55666666   589999999999999999999843


No 22 
>KOG4334|consensus
Probab=65.74  E-value=5.6  Score=34.44  Aligned_cols=28  Identities=32%  Similarity=0.398  Sum_probs=23.5

Q ss_pred             CCceEeecCCCCHHHHHHHHHHHHHHHH
Q psy1572          40 YPVAVCHGQGKTQQEAKTVAAHNAIEYL   67 (74)
Q Consensus        40 ~P~~VCHGsG~S~~eA~~~AA~NALqyL   67 (74)
                      +-+.+=-|.|+|-..|+..|||.+|+.|
T Consensus       414 d~~~yGsG~g~sKK~Ak~~AAR~tLeiL  441 (650)
T KOG4334|consen  414 DLFPYGSGVGASKKTAKLVAARDTLEIL  441 (650)
T ss_pred             cccccccccccchHHHHHHHHHHHHHHh
Confidence            3355666888999999999999999886


No 23 
>cd00099 IgV Immunoglobulin variable domain (IgV). IgV: Immunoglobulin variable domain (IgV). Members of the IgV family are components of immunoglobulin (Ig) and T cell receptors. The basic structure of Ig molecules is a tetramer of two light chains and two heavy chains linked by disulfide bonds. In Ig, each chain is composed of one variable domain (IgV) and one or more constant domains (IgC); these names reflect the fact that the variability in sequences is higher in the variable domain than in the constant domain. Within the variable domain, there are regions of even more variability called the hypervariable or complementarity-determining regions (CDRs) which are responsible for antigen binding. A predominant feature of most Ig domains is the disulfide bridge connecting 2 beta-sheets with a tryptophan residue packed against the disulfide bond.
Probab=62.12  E-value=9  Score=22.51  Aligned_cols=26  Identities=19%  Similarity=0.300  Sum_probs=18.8

Q ss_pred             ccccceeEEEEeccCCceEeecCCCC
Q psy1572          26 HSTGQFQALVQLATYPVAVCHGQGKT   51 (74)
Q Consensus        26 s~~G~yqcLV~LST~P~~VCHGsG~S   51 (74)
                      +=+|.|.|-+.-..-+-...+|.|..
T Consensus        77 ~D~G~Y~C~v~~~~~~~~~~~~~gt~  102 (105)
T cd00099          77 EDSAVYYCAVSLSGGTYKLYFGQGTR  102 (105)
T ss_pred             HHCEEEEEEEecCCCceeeEECCCEE
Confidence            34699999997444334888998864


No 24 
>PF01413 C4:  C-terminal tandem repeated domain in type 4 procollagen;  InterPro: IPR001442  Collagens are major components of the extracellular matrices of all metazoan life and play crucial roles in developmental processes and tissue homeostasis. Collagens are composed of three polypeptide chains (alpha chains) that fold together to form the characteristic triple helical collagenous domain. Some types of triple helical protomers contain genetically identical alpha chains forming homotrimers, whereas others contain two or three different alpha chains forming heterotrimers. The sequences required to form a collagenous domain are Gly-X-Y repeats in which the X and Y positions are frequently proline and hydroxyproline. Glycine is required every third residue as it is the only amino acid small enough to pack into the central core of the triple helix. The triple helix-forming parts are surrounded by non-collagenous (NC) domains of variable sequence, size, and shape. Even if the triple helical parts represent the most striking feature of collagens, tissue specificity as well as defined binding of non-collagens seem to be encoded in the NC domains. The terminal NC domains are excised, modified, or incorporated directly into the final suprastructure, depending on protomer type and function [, ].   Type IV collagen is one of the major constituents of basement membranes, a specialised form of extracellular matrix underlying epithelia that compartmentalises tissues and provides molecular signals for influencing cell behaviour. Each type IV chain contains a long triple-helical collagenous domain flanked by a short 7S domain of 25 residues and a globular non-collagenous NC1 domain of ~230 residues at the N and C terminus, respectively. In protomer assembly, the NC1 domains (monomers) of three chains interact, forming an NC1 trimer, to select and register chains for triple helix formation. In network assembly, the NC1 trimers of two protomers interact, forming a NC1 hexamer structure, to select and connect protomers [, , ].   The collagen IV NC1 domain contains 12 cysteines, and all of them are involved in disulphide bonds. It folds into a tertiary structure with predominantly beta-strands. The collagen IV NC1 domain is composed of two similarly folded subdomains stabilised by 3 intrachain dissulphide bonds involving the following pairs: C1-C6, C2-C5, and C3-C4. Each subdomain represents a compact disulphide-stabilised triangular structure, from which a finger-like hairpin loop projects into an incompletely formed six-stranded beta-sheet of an adjacent subdomain of the same or of an adjacent chain clamping the subdomains tightly together [, , ].  This duplicated domain is present at the C-terminal of type 4 collagen, the major structural component of glomerular basement membranes (GMB) forming a 'chicken-wire' meshwork together with laminins, proteoglycans and entactin/nidogen. Mutations in alpha-5 collagen IV are associated with X-linked Alport syndrome.; GO: 0005201 extracellular matrix structural constituent, 0005581 collagen; PDB: 1T61_F 1M3D_L 1LI1_D.
Probab=60.87  E-value=4.7  Score=27.67  Aligned_cols=18  Identities=33%  Similarity=0.748  Sum_probs=15.4

Q ss_pred             eEEEEeccCCceEeecCC
Q psy1572          32 QALVQLATYPVAVCHGQG   49 (74)
Q Consensus        32 qcLV~LST~P~~VCHGsG   49 (74)
                      -||-..+|.|...|||.|
T Consensus        48 SCL~~F~~~Pfi~C~~~g   65 (114)
T PF01413_consen   48 SCLERFRTMPFIECNGRG   65 (114)
T ss_dssp             GEESSEESS-EEEEETTS
T ss_pred             hhHHHHhcCcceecCCCC
Confidence            489999999999999966


No 25 
>PF03681 UPF0150:  Uncharacterised protein family (UPF0150);  InterPro: IPR005357 This family of small proteins is uncharacterised. In Q9A3L8 from SWISSPROT this domain is found next to a DNA binding helix-turn-helix domain IPR002145 from INTERPRO, which suggests that this is some kind of ligand binding domain.; PDB: 2DSY_C 2YZT_A 3KWR_A.
Probab=59.78  E-value=7.8  Score=21.36  Aligned_cols=16  Identities=44%  Similarity=0.488  Sum_probs=13.3

Q ss_pred             EeecCCCCHHHHHHHH
Q psy1572          44 VCHGQGKTQQEAKTVA   59 (74)
Q Consensus        44 VCHGsG~S~~eA~~~A   59 (74)
                      =|+..|.|.+||.++|
T Consensus        23 g~~t~G~t~eea~~~~   38 (48)
T PF03681_consen   23 GCFTQGDTLEEALENA   38 (48)
T ss_dssp             TCEEEESSHHHHHHHH
T ss_pred             ChhhcCCCHHHHHHHH
Confidence            4778899999998775


No 26 
>cd05733 Ig6_L1-CAM_like Sixth immunoglobulin (Ig)-like domain of the L1 cell adhesion molecule (CAM) and similar proteins. Ig6_L1-CAM_like: domain similar to the sixth immunoglobulin (Ig)-like domain of the L1 cell adhesion molecule (CAM).  L1 belongs to the L1 subfamily of cell adhesion molecules (CAMs) and is comprised of an extracellular region having six Ig-like domains and five fibronectin type III domains, a transmembrane region and an intracellular domain. L1 is primarily expressed in the nervous system and is involved in its development and function. L1 is associated with an X-linked recessive disorder, X-linked hydrocephalus, MASA syndrome, or spastic paraplegia type 1, that involves abnormalities of axonal growth. This group also contains NrCAM [Ng(neuronglia)CAM-related cell adhesion molecule], which is primarily expressed in the nervous system, and human neurofascin.
Probab=59.30  E-value=9.6  Score=22.12  Aligned_cols=9  Identities=22%  Similarity=0.623  Sum_probs=7.9

Q ss_pred             ccceeEEEE
Q psy1572          28 TGQFQALVQ   36 (74)
Q Consensus        28 ~G~yqcLV~   36 (74)
                      .|.|+|.++
T Consensus        55 ~G~Y~C~A~   63 (77)
T cd05733          55 EGEYQCYAS   63 (77)
T ss_pred             CEEEEEEEE
Confidence            799999986


No 27 
>smart00111 C4 C-terminal tandem repeated domain in type 4 procollagens. Duplicated domain in C-terminus of type 4 collagens. Mutations in alpha-5 collagen IV are associated with X-linked Alport syndrome.
Probab=58.58  E-value=4.4  Score=27.83  Aligned_cols=21  Identities=29%  Similarity=0.615  Sum_probs=17.7

Q ss_pred             eEEEEeccCCceEeecCCCCHH
Q psy1572          32 QALVQLATYPVAVCHGQGKTQQ   53 (74)
Q Consensus        32 qcLV~LST~P~~VCHGsG~S~~   53 (74)
                      -||-..++.|...|||.| ++.
T Consensus        48 SCL~~F~~~Pfi~C~~~~-~C~   68 (114)
T smart00111       48 SCLERFRTMPFIECNGRG-VCN   68 (114)
T ss_pred             cchhhhccCCeEEECCCC-eec
Confidence            589999999999999965 443


No 28 
>PF12695 Abhydrolase_5:  Alpha/beta hydrolase family; PDB: 3D0K_B 2I3D_B 3DOH_B 3DOI_B 3PFB_A 3S2Z_B 3PFC_A 3QM1_A 3PF8_B 3PF9_A ....
Probab=57.51  E-value=13  Score=22.24  Aligned_cols=16  Identities=25%  Similarity=0.499  Sum_probs=12.8

Q ss_pred             CceEeecCCCCHHHHH
Q psy1572          41 PVAVCHGQGKTQQEAK   56 (74)
Q Consensus        41 P~~VCHGsG~S~~eA~   56 (74)
                      |+.+|||.|.+..+-.
T Consensus         1 ~vv~~HG~~~~~~~~~   16 (145)
T PF12695_consen    1 VVVLLHGWGGSRRDYQ   16 (145)
T ss_dssp             EEEEECTTTTTTHHHH
T ss_pred             CEEEECCCCCCHHHHH
Confidence            6789999999876543


No 29 
>COG1598 Predicted nuclease of the RNAse H fold, HicB family [General    function prediction only]
Probab=56.77  E-value=9  Score=23.40  Aligned_cols=16  Identities=50%  Similarity=0.663  Sum_probs=13.8

Q ss_pred             EeecCCCCHHHHHHHH
Q psy1572          44 VCHGQGKTQQEAKTVA   59 (74)
Q Consensus        44 VCHGsG~S~~eA~~~A   59 (74)
                      =||-.|.|.+||..++
T Consensus        25 gc~s~G~T~eea~~n~   40 (73)
T COG1598          25 GCHSQGETLEEALQNA   40 (73)
T ss_pred             CccccCCCHHHHHHHH
Confidence            4888999999998875


No 30 
>cd04967 Ig1_Contactin First Ig domain of contactin. Ig1_Contactin: First Ig domain of contactins. Contactins are neural cell adhesion molecules and are comprised of six Ig domains followed by four fibronectin type III(FnIII) domains anchored to the membrane by glycosylphosphatidylinositol. The first four Ig domains form the intermolecular binding fragment, which arranges as a compact U-shaped module via contacts between Ig domains 1 and 4, and between Ig domains 2 and 3. Contactin-2 (TAG-1, axonin-1) may play a part in the neuronal processes of neurite outgrowth, axon guidance and fasciculation, and neuronal migration. This group also includes contactin-1 and contactin-5. The different contactins show different expression patterns in the central nervous system. During development and in adulthood, contactin-2 is transiently expressed in subsets of central and peripheral neurons. Contactin-5 is expressed specifically in the rat postnatal nervous system, peaking at about 3 weeks postnata
Probab=54.44  E-value=7.9  Score=22.94  Aligned_cols=11  Identities=27%  Similarity=0.679  Sum_probs=9.6

Q ss_pred             ccccceeEEEE
Q psy1572          26 HSTGQFQALVQ   36 (74)
Q Consensus        26 s~~G~yqcLV~   36 (74)
                      .-+|.|+|.++
T Consensus        69 ~d~G~Y~C~a~   79 (91)
T cd04967          69 KDAGRYQCLAS   79 (91)
T ss_pred             CCCEEEEEEEE
Confidence            46899999997


No 31 
>PRK04930 glutathione-regulated potassium-efflux system ancillary protein KefG; Provisional
Probab=51.80  E-value=26  Score=25.04  Aligned_cols=35  Identities=9%  Similarity=-0.011  Sum_probs=28.9

Q ss_pred             EEeccCCceEeecCCCCHHHHHHHHHHHHHHHHHH
Q psy1572          35 VQLATYPVAVCHGQGKTQQEAKTVAAHNAIEYLRL   69 (74)
Q Consensus        35 V~LST~P~~VCHGsG~S~~eA~~~AA~NALqyLki   69 (74)
                      .-+.-.|+.+|||.....++.....|....++|..
T Consensus       143 ~Gm~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~  177 (184)
T PRK04930        143 CRMHWLSPIIIYWARRQSPEELASHARAYGDWLAN  177 (184)
T ss_pred             cCCeEcCcEEEecCCCCCHHHHHHHHHHHHHHHhh
Confidence            36777899999999888888888888888888764


No 32 
>PRK03661 hypothetical protein; Validated
Probab=50.89  E-value=59  Score=22.87  Aligned_cols=40  Identities=10%  Similarity=-0.011  Sum_probs=28.8

Q ss_pred             ceeEEEEecc-CCceEeecCCCCHHHHHHHHHHHHHHHHHHh
Q psy1572          30 QFQALVQLAT-YPVAVCHGQGKTQQEAKTVAAHNAIEYLRLM   70 (74)
Q Consensus        30 ~yqcLV~LST-~P~~VCHGsG~S~~eA~~~AA~NALqyLkiM   70 (74)
                      -|.++..=.. .-...+|..| ++++-+..|+..||+.|..+
T Consensus       120 v~i~i~~~~~~~~~~~~~~~g-~R~~ir~~~~~~AL~~L~~~  160 (164)
T PRK03661        120 VWFGFASASGEGITRRECFSG-DRDAVRRQATAYALQTLWQQ  160 (164)
T ss_pred             EEEEEEeCCCcEEEEEEecCC-CHHHHHHHHHHHHHHHHHHH
Confidence            4666654222 1244688888 89999999999999988754


No 33 
>cd02647 nuc_hydro_TvIAG nuc_hydro_ TvIAG:  Nucleoside hydrolases similar to the Inosine-adenosine-guanosine-preferring nucleoside hydrolase from Trypanosoma vivax.   Nucleoside hydrolases cleave the N-glycosidic bond in nucleosides generating ribose and the respective base. Nucleoside hydrolases vary in their substrate specificity. This group contains eukaryotic and bacterial proteins similar to the purine specific inosine-adenosine-guanosine-preferring nucleoside hydrolase (IAG-NH) from T.  vivax.  T. vivax IAG-NH is of the order of a thousand to ten thousand fold more specific towards the naturally occurring purine nucleosides, than towards the pyrimidine nucleosides.
Probab=50.75  E-value=23  Score=26.92  Aligned_cols=28  Identities=7%  Similarity=0.006  Sum_probs=21.7

Q ss_pred             ceEeecCCCCHHHHHHHHHHHHHHHHHHhhc
Q psy1572          42 VAVCHGQGKTQQEAKTVAAHNAIEYLRLMTK   72 (74)
Q Consensus        42 ~~VCHGsG~S~~eA~~~AA~NALqyLkiM~~   72 (74)
                      ++++||.|...   .+++++||+..|..|.+
T Consensus        31 It~v~~~GN~~---~~~~~~na~~ll~~~g~   58 (312)
T cd02647          31 IGVSGIDADCY---VEPAVSVTRKLIDRLGQ   58 (312)
T ss_pred             EEEecCcCCcc---HHHHHHHHHHHHHHhCC
Confidence            56675667663   47899999999999875


No 34 
>PRK03657 hypothetical protein; Validated
Probab=46.87  E-value=62  Score=23.14  Aligned_cols=42  Identities=17%  Similarity=-0.026  Sum_probs=28.8

Q ss_pred             ceeEEEEeccCCceEeecCCCCHHHHHHHHHHHHHHHHHHhhc
Q psy1572          30 QFQALVQLATYPVAVCHGQGKTQQEAKTVAAHNAIEYLRLMTK   72 (74)
Q Consensus        30 ~yqcLV~LST~P~~VCHGsG~S~~eA~~~AA~NALqyLkiM~~   72 (74)
                      -|.++..=...-...+|-.| .+++-+..|+..||+.|...-+
T Consensus       126 V~iai~~~~~~~~~~~~~~g-~R~~ir~~a~~~al~~L~~~l~  167 (170)
T PRK03657        126 VWFAWNIKGQTYTARMHFAG-DCETVLAKAVRFALAQLLQLLL  167 (170)
T ss_pred             EEEEEEcCCcEEEEEEecCC-CHHHHHHHHHHHHHHHHHHHHH
Confidence            35555432111244678888 7999999999999998865443


No 35 
>cd05874 Ig6_NrCAM Sixth immunoglobulin (Ig)-like domain of NrCAM (Ng (neuronglia) CAM-related cell adhesion molecule). Ig6_NrCAM: sixth immunoglobulin (Ig)-like domain of NrCAM (Ng (neuronglia) CAM-related cell adhesion molecule). NrCAM belongs to the L1 subfamily of cell adhesion molecules (CAMs) and is comprised of an extracellular region having six Ig-like domains and five fibronectin type III domains, a transmembrane region, and an intracellular domain. NrCAM is primarily expressed in the nervous system.
Probab=46.84  E-value=21  Score=20.99  Aligned_cols=10  Identities=20%  Similarity=0.355  Sum_probs=8.2

Q ss_pred             ccceeEEEEe
Q psy1572          28 TGQFQALVQL   37 (74)
Q Consensus        28 ~G~yqcLV~L   37 (74)
                      .|.|+|.++=
T Consensus        55 ~G~Y~C~A~N   64 (77)
T cd05874          55 EGVYQCTARN   64 (77)
T ss_pred             CEEEEEEEEc
Confidence            5999999863


No 36 
>cd05739 Ig3_RPTP_IIa_LAR_like Third immunoglobulin (Ig)-like domain of the receptor protein tyrosine phosphatase (RPTP)-F, also known as LAR. Ig3_RPTP_IIa_LAR_like: domain similar to the third immunoglobulin (Ig)-like domain found in the receptor protein tyrosine phosphatase (RPTP)-F, also known as LAR. LAR belongs to the RPTP type IIa subfamily. Members of this subfamily are cell adhesion molecule-like proteins involved in central nervous system (CNS) development. They have large extracellular portions, comprised of multiple IG-like domains and two to nine fibronectin type III (FNIII) domains, and a cytoplasmic portion having two tandem phosphatase domains. Included in this group is Drosophila LAR (DLAR).
Probab=44.11  E-value=17  Score=20.96  Aligned_cols=15  Identities=0%  Similarity=0.211  Sum_probs=11.7

Q ss_pred             eeeeccccceeEEEE
Q psy1572          22 IAFLHSTGQFQALVQ   36 (74)
Q Consensus        22 ieels~~G~yqcLV~   36 (74)
                      |..++-+|.|.|.++
T Consensus        42 i~~~~d~G~Y~C~A~   56 (69)
T cd05739          42 LTNIYESANYTCVAI   56 (69)
T ss_pred             EeccccCeeEEEEEE
Confidence            455667899999976


No 37 
>PF04098 Rad52_Rad22:  Rad52/22 family double-strand break repair protein;  InterPro: IPR007232 The DNA single-strand annealing proteins (SSAPs), such as RecT, Red-beta, ERF and Rad52, function in RecA-dependent and RecA-independent DNA recombination pathways. This family includes proteins related to Rad52. These proteins contain two helix-hairpin-helix motifs [].; GO: 0006281 DNA repair, 0006310 DNA recombination; PDB: 1KN0_I 1H2I_U.
Probab=41.94  E-value=56  Score=22.28  Aligned_cols=39  Identities=18%  Similarity=0.228  Sum_probs=24.0

Q ss_pred             cccceeEEEEeccCCce----EeecCCC-------CHHHHHHHHHHHHHH
Q psy1572          27 STGQFQALVQLATYPVA----VCHGQGK-------TQQEAKTVAAHNAIE   65 (74)
Q Consensus        27 ~~G~yqcLV~LST~P~~----VCHGsG~-------S~~eA~~~AA~NALq   65 (74)
                      .+-.|.|.|++......    |.+|...       +.++|...|.-+||+
T Consensus        72 ~~~~~~~~vritLkdg~~~e~~G~g~~~~~~~k~~a~~~akK~AvtdAlK  121 (154)
T PF04098_consen   72 YSVGYSAIVRITLKDGTYREDIGYGWSENMDGKGMAIEKAKKGAVTDALK  121 (154)
T ss_dssp             EEEEEEEEEEEEETTS-EEEEEEEEEEESES-HHHHHHHHHHHHHHHHHH
T ss_pred             EEEEEEEEEEEEEeCCceeeecceEEEecCCccchhhhhhHHHHHHHHHH
Confidence            45568899999887776    3334333       345666666666664


No 38 
>PF09888 DUF2115:  Uncharacterized protein conserved in archaea (DUF2115);  InterPro: IPR019215  This entry represents various hypothetical archaeal proteins, has no known function. 
Probab=40.93  E-value=14  Score=26.20  Aligned_cols=29  Identities=14%  Similarity=0.165  Sum_probs=22.1

Q ss_pred             eCCCCCceeeeccccceeEEE-------EeccCCceEe
Q psy1572          15 STNGSNSIAFLHSTGQFQALV-------QLATYPVAVC   45 (74)
Q Consensus        15 ~~~~~n~ieels~~G~yqcLV-------~LST~P~~VC   45 (74)
                      ..|||..|++  -+|.|.|-|       ..|.++-.||
T Consensus       124 ~FPGG~~V~~--~~g~YYCPVKdkq~d~~~aLC~FC~a  159 (163)
T PF09888_consen  124 PFPGGFKVEE--KNGNYYCPVKDKQKDVPFALCNFCPA  159 (163)
T ss_pred             CCCCCeEEEE--ECCEEeCccchhhcCCCCccCCCcCc
Confidence            4799999999  699999987       3455555554


No 39 
>cd05875 Ig6_hNeurofascin_like Sixth immunoglobulin (Ig)-like domain of human neurofascin (NF). Ig6_hNeurofascin_like:  the sixth immunoglobulin (Ig)-like domain of human neurofascin (NF). NF belongs to the L1 subfamily of cell adhesion molecules (CAMs) and is comprised of an extracellular region having six Ig-like domains and five fibronectin type III domains, a transmembrane region, and a cytoplasmic domain. NF has many alternatively spliced isoforms having different temporal expression patterns during development. NF participates in axon subcellular targeting and synapse formation, however little is known of the functions of the different isoforms.
Probab=40.38  E-value=31  Score=20.21  Aligned_cols=9  Identities=22%  Similarity=0.744  Sum_probs=7.8

Q ss_pred             ccceeEEEE
Q psy1572          28 TGQFQALVQ   36 (74)
Q Consensus        28 ~G~yqcLV~   36 (74)
                      .|.|+|.++
T Consensus        55 ~G~Y~C~A~   63 (77)
T cd05875          55 EGEYQCFAR   63 (77)
T ss_pred             CEEEEEEEE
Confidence            699999975


No 40 
>cd05760 Ig2_PTK7 Second immunoglobulin (Ig)-like domain of protein tyrosine kinase (PTK) 7, also known as CCK4. Ig2_PTK7: domain similar to the second immunoglobulin (Ig)-like domain in protein tyrosine kinase (PTK) 7, also known as CCK4. PTK7 is a subfamily of the receptor protein tyrosine kinase family, and is referred to as an RPTK-like molecule. RPTKs transduce extracellular signals across the cell membrane, and play important roles in regulating cell proliferation, migration, and differentiation. PTK7 is organized as an extracellular portion having seven Ig-like domains, a single transmembrane region, and a cytoplasmic tyrosine kinase-like domain. PTK7 is considered a pseudokinase as it has several unusual residues in some of the highly conserved tyrosine kinase (TK) motifs; it is predicted to lack TK activity. PTK7 may function as a cell-adhesion molecule. PTK7 mRNA is expressed at high levels in placenta, melanocytes, liver, lung, pancreas, and kidney. PTK7 is overexpressed in s
Probab=39.18  E-value=19  Score=20.81  Aligned_cols=11  Identities=9%  Similarity=0.338  Sum_probs=9.1

Q ss_pred             ccccceeEEEE
Q psy1572          26 HSTGQFQALVQ   36 (74)
Q Consensus        26 s~~G~yqcLV~   36 (74)
                      .-.|.|+|.++
T Consensus        48 ~D~G~Y~C~a~   58 (77)
T cd05760          48 DDSGLYYCCAH   58 (77)
T ss_pred             ccCEEEEEEEE
Confidence            45799999986


No 41 
>KOG2938|consensus
Probab=39.16  E-value=37  Score=27.12  Aligned_cols=27  Identities=26%  Similarity=0.452  Sum_probs=21.1

Q ss_pred             ceEeecCCCCHHHHHHHHHHHHHHHHHHhhcC
Q psy1572          42 VAVCHGQGKTQQEAKTVAAHNAIEYLRLMTKS   73 (74)
Q Consensus        42 ~~VCHGsG~S~~eA~~~AA~NALqyLkiM~~~   73 (74)
                      ++=|||. .+.+    ++++||++.|+.+.+.
T Consensus        52 lTtv~Gn-~~~~----~t~~NA~~~L~l~~r~   78 (350)
T KOG2938|consen   52 LTTVHGN-VTVE----DTDRNALDLLSLLGRL   78 (350)
T ss_pred             eeEeeCC-ccHh----hhhhhHHHHHHhcCCc
Confidence            4558998 5554    7899999999998753


No 42 
>TIGR03056 bchO_mg_che_rel putative magnesium chelatase accessory protein. Members of this family belong to the alpha/beta fold family hydrolases (PFAM model pfam00561). Members are found in bacterial genomes if and only if they encoded for anoxygenic photosynthetic systems similar to that of Rhodobacter capsulatus and other alpha-Proteobacteria. Members often are encoded in the same operon as subunits of the protoporphyrin IX magnesium chelatase, and were once designated BchO. No literature supports a role as an actual subunit of magnesium chelatase, but an accessory role is possible, as suggested by placement by its probable hydrolase activity.
Probab=38.21  E-value=1e+02  Score=20.28  Aligned_cols=32  Identities=9%  Similarity=0.114  Sum_probs=20.0

Q ss_pred             ccccceeEEEEecc---CCceEeecCCCCHHHHHH
Q psy1572          26 HSTGQFQALVQLAT---YPVAVCHGQGKTQQEAKT   57 (74)
Q Consensus        26 s~~G~yqcLV~LST---~P~~VCHGsG~S~~eA~~   57 (74)
                      +++|...+..+...   .|+.+|||.|.+...-+.
T Consensus        12 ~~~~~~~~~~~~g~~~~~~vv~~hG~~~~~~~~~~   46 (278)
T TIGR03056        12 TVGPFHWHVQDMGPTAGPLLLLLHGTGASTHSWRD   46 (278)
T ss_pred             eECCEEEEEEecCCCCCCeEEEEcCCCCCHHHHHH
Confidence            34444444444432   478899999988776544


No 43 
>cd05868 Ig4_NrCAM Fourth immunoglobulin (Ig)-like domain of NrCAM (NgCAM-related cell adhesion molecule). Ig4_ NrCAM: fourth immunoglobulin (Ig)-like domain of NrCAM (NgCAM-related cell adhesion molecule). NrCAM belongs to the L1 subfamily of cell adhesion molecules (CAMs) and is comprised of an extracellular region having six IG-like domains and five fibronectin type III domains, a transmembrane region and an intracellular domain. NrCAM is primarily expressed in the nervous system.
Probab=37.88  E-value=21  Score=20.70  Aligned_cols=12  Identities=17%  Similarity=0.171  Sum_probs=9.2

Q ss_pred             eccccceeEEEE
Q psy1572          25 LHSTGQFQALVQ   36 (74)
Q Consensus        25 ls~~G~yqcLV~   36 (74)
                      .+-.|.|+|.++
T Consensus        50 ~~D~G~Y~C~A~   61 (76)
T cd05868          50 ERSSAVYQCNAS   61 (76)
T ss_pred             HhhCEEEEEEEE
Confidence            345899999875


No 44 
>KOG0286|consensus
Probab=36.40  E-value=68  Score=26.21  Aligned_cols=45  Identities=16%  Similarity=0.149  Sum_probs=40.1

Q ss_pred             ccceeeeEEeeeCCCCCceeeeccccceeEEEEeccCCceEeecC
Q psy1572           4 SGLLDTRVCLISTNGSNSIAFLHSTGQFQALVQLATYPVAVCHGQ   48 (74)
Q Consensus         4 s~l~~~~v~~~~~~~~n~ieels~~G~yqcLV~LST~P~~VCHGs   48 (74)
                      .+--|+.++..|+-|+-.+--|+-++--.+-+.+|.+-.+||-|+
T Consensus       290 agy~d~~c~vWDtlk~e~vg~L~GHeNRvScl~~s~DG~av~TgS  334 (343)
T KOG0286|consen  290 AGYDDFTCNVWDTLKGERVGVLAGHENRVSCLGVSPDGMAVATGS  334 (343)
T ss_pred             eeecCCceeEeeccccceEEEeeccCCeeEEEEECCCCcEEEecc
Confidence            345689999999999999999998888888899999999999886


No 45 
>PRK10443 rihA ribonucleoside hydrolase 1; Provisional
Probab=36.25  E-value=49  Score=24.88  Aligned_cols=27  Identities=22%  Similarity=0.379  Sum_probs=20.1

Q ss_pred             CceEeecCCCCHHHHHHHHHHHHHHHHHHhhc
Q psy1572          41 PVAVCHGQGKTQQEAKTVAAHNAIEYLRLMTK   72 (74)
Q Consensus        41 P~~VCHGsG~S~~eA~~~AA~NALqyLkiM~~   72 (74)
                      =+++|||.-.     .+++++||++.|..++.
T Consensus        32 gIt~v~Gn~~-----~~~~~~na~~~l~~~g~   58 (311)
T PRK10443         32 AVTTSAGNQT-----PEKTLRNALRMLTLLNR   58 (311)
T ss_pred             EEEEeCCCCC-----HHHHHHHHHHHHHHhCC
Confidence            3667887543     45789999999998864


No 46 
>cd05873 Ig_Sema4D_like Immunoglobulin (Ig)-like domain of the class IV semaphorin Sema4D. Ig_Sema4D_like; Immunoglobulin (Ig)-like domain of Sema4D. Sema4D is a Class IV semaphorin. Semaphorins are classified based on structural features additional to the Sema domain. Sema4D has extracellular Sema and Ig domains, a transmembrane domain, and a short cytoplasmic domain. Sema4D plays a part in the development of GABAergic synapses. Sema4D in addition is an immune semaphorin. It is abundant on resting T cells; its expression is weak on resting B cells and antigen presenting cells (APCs), but is upregulated by various stimuli. The receptor used by Sema4D in the immune system is CD72. Sem4D enhances the activation of B cells and DCs through binding CD72, perhaps by reducing CD72s inhibitory signals. The receptor used by Sema4D in the non-lymphatic tissues is plexin-B1. Sem4D is anchored to the cell surface but its extracellular domain can be released from the cell surface by a metalloproteas
Probab=36.22  E-value=22  Score=22.26  Aligned_cols=14  Identities=21%  Similarity=0.292  Sum_probs=11.1

Q ss_pred             eccccceeEEEEec
Q psy1572          25 LHSTGQFQALVQLA   38 (74)
Q Consensus        25 ls~~G~yqcLV~LS   38 (74)
                      .+-.|.|+|.++-.
T Consensus        57 ~~DsG~Y~C~a~e~   70 (87)
T cd05873          57 EADAGRYQCLSVEK   70 (87)
T ss_pred             HHHCEEEEEEEEec
Confidence            34579999999866


No 47 
>smart00408 IGc2 Immunoglobulin C-2 Type.
Probab=35.06  E-value=26  Score=17.40  Aligned_cols=11  Identities=9%  Similarity=0.604  Sum_probs=8.6

Q ss_pred             ccccceeEEEE
Q psy1572          26 HSTGQFQALVQ   36 (74)
Q Consensus        26 s~~G~yqcLV~   36 (74)
                      .-.|.|+|.+.
T Consensus        49 ~d~G~Y~C~~~   59 (63)
T smart00408       49 EDSGEYTCVAE   59 (63)
T ss_pred             ccCEEEEEEEe
Confidence            45799999874


No 48 
>PF05165 GGDN:  GGDN family;  InterPro: IPR007839 GTP cyclohydrolase III catalyses the formation of 2-amino-5-formylamino-6- ribofuranosylamino-4(3H)-pyrimidinone ribonucleotide monophosphate and inorganic phosphate from GTP. The enzyme also has an independent pyrophosphate phosphohydrolase activity. The proteins are 200-270 amino acids in length.; GO: 0003933 GTP cyclohydrolase activity, 0009058 biosynthetic process; PDB: 2QV6_B.
Probab=35.01  E-value=28  Score=26.78  Aligned_cols=26  Identities=31%  Similarity=0.420  Sum_probs=20.4

Q ss_pred             EeccCCceEeecCCCCHHHHHHHHHH
Q psy1572          36 QLATYPVAVCHGQGKTQQEAKTVAAH   61 (74)
Q Consensus        36 ~LST~P~~VCHGsG~S~~eA~~~AA~   61 (74)
                      ..+.-|+.+|=|.|.|+-||+.+|-.
T Consensus        75 ~~~Pv~vs~~ig~g~tP~eA~~~a~~  100 (246)
T PF05165_consen   75 NRYPVTVSMCIGYGETPYEAQENATE  100 (246)
T ss_dssp             HHSSS-EEEEEEEESSHHHHHHHHHH
T ss_pred             hcCCceEEEEEecCCCHHHHHHHHHH
Confidence            34566788999999999999988654


No 49 
>cd05723 Ig4_Neogenin Fourth immunoglobulin (Ig)-like domain in neogenin and similar proteins. Ig4_Neogenin: fourth immunoglobulin (Ig)-like domain in neogenin and related proteins. Neogenin  is a cell surface protein which is expressed in the developing nervous system of vertebrate embryos in the growing nerve cells. It is also expressed in other embryonic tissues, and may play a general role in developmental processes such as cell migration, cell-cell recognition, and tissue growth regulation. Included in this group is the tumor suppressor protein DCC, which is deleted in colorectal carcinoma . DCC and neogenin each have four Ig-like domains followed by six fibronectin type III domains, a transmembrane domain, and an intracellular domain.
Probab=34.94  E-value=25  Score=19.95  Aligned_cols=11  Identities=18%  Similarity=0.519  Sum_probs=8.8

Q ss_pred             ccccceeEEEE
Q psy1572          26 HSTGQFQALVQ   36 (74)
Q Consensus        26 s~~G~yqcLV~   36 (74)
                      +-.|.|+|.++
T Consensus        47 ~D~G~Y~C~A~   57 (71)
T cd05723          47 SDEGFYQCIAE   57 (71)
T ss_pred             ccCEEEEEEEE
Confidence            45699999976


No 50 
>cd05750 Ig_Pro_neuregulin Immunoglobulin (Ig)-like domain in neuregulins (NRGs). Ig_Pro_neuregulin: immunoglobulin (Ig)-like domain in neuregulins (NRGs). NRGs are signaling molecules, which participate in cell-cell interactions in the nervous system, breast, heart, and other organ systems, and are implicated in the pathology of diseases including schizophrenia, multiple sclerosis, and breast cancer. There are four members of the neuregulin gene family (NRG1, -2, -3, and -4). The NRG-1 protein, binds to and activates the tyrosine kinases receptors ErbB3 and ErbB4, initiating signaling cascades. The other NRGs proteins bind one or the other or both of these ErbBs. NRG-1 has multiple functions; for example, in the brain it regulates various processes such as radial glia formation and neuronal migration, dendritic development, and expression of neurotransmitters receptors; in the peripheral nervous system NRG-1 regulates processes such as target cell differentiation, and Schwann cell surv
Probab=34.62  E-value=27  Score=19.27  Aligned_cols=11  Identities=18%  Similarity=0.667  Sum_probs=9.0

Q ss_pred             ccccceeEEEE
Q psy1572          26 HSTGQFQALVQ   36 (74)
Q Consensus        26 s~~G~yqcLV~   36 (74)
                      .-+|.|.|.++
T Consensus        53 ~D~G~Y~C~a~   63 (75)
T cd05750          53 ADSGEYTCVVE   63 (75)
T ss_pred             ccCeEEEEEEE
Confidence            56799999876


No 51 
>TIGR01738 bioH putative pimeloyl-BioC--CoA transferase BioH. This CoA-binding enzyme is required for the production of pimeloyl-coenzyme A, the substrate of the BioF protein early in the biosynthesis of biotin. Its exact function is unknown, but is proposed in ref 2. This enzyme belongs to the alpha/beta hydrolase fold family (pfam model pfam00561). Members of this family are restricted to the Proteobacteria.
Probab=33.68  E-value=45  Score=20.78  Aligned_cols=19  Identities=16%  Similarity=0.305  Sum_probs=14.3

Q ss_pred             CCceEeecCCCCHHHHHHH
Q psy1572          40 YPVAVCHGQGKTQQEAKTV   58 (74)
Q Consensus        40 ~P~~VCHGsG~S~~eA~~~   58 (74)
                      .|+.++||.|.+...-+..
T Consensus         5 ~~iv~~HG~~~~~~~~~~~   23 (245)
T TIGR01738         5 VHLVLIHGWGMNAEVFRCL   23 (245)
T ss_pred             ceEEEEcCCCCchhhHHHH
Confidence            4789999999987754443


No 52 
>cd05848 Ig1_Contactin-5 First Ig domain of contactin-5. Ig1_Contactin-5: First Ig domain of the neural cell adhesion molecule contactin-5. Contactins are comprised of six Ig domains followed by four fibronectin type III (FnIII) domains, anchored to the membrane by glycosylphosphatidylinositol. The different contactins show different expression patterns in the central nervous system. In rats, a lack of contactin-5 (NB-2) results in an impairment of the neuronal activity in the auditory system. Contactin-5 is expressed specifically in the postnatal nervous system, peaking at about 3 weeks postnatal. Contactin-5 is highly expressed in the adult human brain in the occipital lobe and in the amygdala; lower levels of expression have been detected in the corpus callosum, caudate nucleus, and spinal cord.
Probab=33.59  E-value=26  Score=21.21  Aligned_cols=11  Identities=27%  Similarity=0.700  Sum_probs=9.3

Q ss_pred             ccccceeEEEE
Q psy1572          26 HSTGQFQALVQ   36 (74)
Q Consensus        26 s~~G~yqcLV~   36 (74)
                      .-.|.|+|.++
T Consensus        69 ~D~G~Y~C~A~   79 (94)
T cd05848          69 KDSGRYQCLAT   79 (94)
T ss_pred             CcCEEEEEEEE
Confidence            46799999986


No 53 
>cd05738 Ig2_RPTP_IIa_LAR_like Second immunoglobulin (Ig)-like domain of  the receptor protein tyrosine phosphatase (RPTP)-F, also known as LAR. Ig2_RPTP_IIa_LAR_like: domain similar to the second immunoglobulin (Ig)-like domain found in the receptor protein tyrosine phosphatase (RPTP)-F, also known as LAR. LAR belongs to the RPTP type IIa subfamily. Members of this subfamily are cell adhesion molecule-like proteins involved in central nervous system (CNS) development. They have large extracellular portions, comprised of multiple Ig-like domains and two to nine fibronectin type III (FNIII) domains, and a cytoplasmic portion having two tandem phosphatase domains.
Probab=33.56  E-value=27  Score=19.97  Aligned_cols=12  Identities=8%  Similarity=0.412  Sum_probs=9.8

Q ss_pred             eccccceeEEEE
Q psy1572          25 LHSTGQFQALVQ   36 (74)
Q Consensus        25 ls~~G~yqcLV~   36 (74)
                      .+-.|.|+|.++
T Consensus        47 ~~D~G~Y~C~a~   58 (74)
T cd05738          47 ESDQGKYECVAT   58 (74)
T ss_pred             hhhCEEEEEEEE
Confidence            457899999886


No 54 
>cd05850 Ig1_Contactin-2 First Ig domain of contactin-2. Ig1_Contactin-2: First Ig domain of the neural cell adhesion molecule contactin-2-like. Contactins are comprised of six Ig domains followed by four fibronectin type III (FnIII) domains anchored to the membrane by glycosylphosphatidylinositol. Contactin-2 (TAG-1, axonin-1) facilitates cell adhesion by homophilic binding between molecules in apposed membranes. It may play a part in the neuronal processes of neurite outgrowth, axon guidance and fasciculation, and neuronal migration. The first four Ig domains form the intermolecular binding fragment, which arranges as a compact U-shaped module by contacts between IG domains 1 and 4, and domains 2 and 3. The different contactins show different expression patterns in the central nervous system. During development and in adulthood, contactin-2 is transiently expressed in subsets of central and peripheral neurons. Contactin-2 is also expressed in retinal amacrine cells in the developing c
Probab=33.51  E-value=26  Score=21.14  Aligned_cols=11  Identities=27%  Similarity=0.576  Sum_probs=9.6

Q ss_pred             ccccceeEEEE
Q psy1572          26 HSTGQFQALVQ   36 (74)
Q Consensus        26 s~~G~yqcLV~   36 (74)
                      .-.|.|+|.++
T Consensus        69 ~D~G~Y~C~A~   79 (94)
T cd05850          69 RDAGSYQCLAI   79 (94)
T ss_pred             CcCEEEEEEEE
Confidence            57899999987


No 55 
>cd05852 Ig5_Contactin-1 Fifth Ig domain of contactin-1. Ig5_Contactin-1: fifth Ig domain of the neural cell adhesion molecule contactin-1. Contactins are comprised of six Ig domains followed by four fibronectin type III (FnIII) domains anchored to the membrane by glycosylphosphatidylinositol. Contactin-1 is differentially expressed in tumor tissues and may through a RhoA mechanism, facilitate invasion and metastasis of human lung adenocarcinoma.
Probab=33.45  E-value=27  Score=20.43  Aligned_cols=11  Identities=9%  Similarity=0.446  Sum_probs=9.0

Q ss_pred             ccccceeEEEE
Q psy1572          26 HSTGQFQALVQ   36 (74)
Q Consensus        26 s~~G~yqcLV~   36 (74)
                      .-.|.|+|.++
T Consensus        49 ~D~G~Y~C~A~   59 (73)
T cd05852          49 LDEGSYTCFAE   59 (73)
T ss_pred             hHCEEEEEEEE
Confidence            45799999986


No 56 
>COG2429 Archaeal GTP cyclohydrolase III [Nucleotide transport and    metabolism]
Probab=33.13  E-value=36  Score=26.67  Aligned_cols=23  Identities=35%  Similarity=0.628  Sum_probs=17.9

Q ss_pred             ceEeecCCCCHHHHHHHHHHHHHH
Q psy1572          42 VAVCHGQGKTQQEAKTVAAHNAIE   65 (74)
Q Consensus        42 ~~VCHGsG~S~~eA~~~AA~NALq   65 (74)
                      |.+|=|+|.|+-||+.+|- +||+
T Consensus        84 Vs~~ig~g~TP~dA~~~A~-~aL~  106 (250)
T COG2429          84 VSMCIGTGRTPLDAQENAG-IALQ  106 (250)
T ss_pred             EEEEeecCCCcHHHHHHHH-HhHh
Confidence            3469999999999998764 4554


No 57 
>PF13927 Ig_3:  Immunoglobulin domain; PDB: 2D3V_A 1G0X_A 1VDG_A 1P7Q_D 3D2U_H 1UFU_A 1UGN_A 3VH8_H 3OQ3_B 4DKD_C ....
Probab=33.09  E-value=33  Score=18.07  Aligned_cols=12  Identities=8%  Similarity=0.387  Sum_probs=9.3

Q ss_pred             eccccceeEEEE
Q psy1572          25 LHSTGQFQALVQ   36 (74)
Q Consensus        25 ls~~G~yqcLV~   36 (74)
                      .+-+|.|+|.++
T Consensus        63 ~~~~g~y~C~a~   74 (75)
T PF13927_consen   63 RSDNGTYTCIAS   74 (75)
T ss_dssp             GGGTEEEEEEEE
T ss_pred             HHhCcEEEEEEE
Confidence            457788999875


No 58 
>cd05867 Ig4_L1-CAM_like Fourth immunoglobulin (Ig)-like domain of the L1 cell adhesion molecule (CAM). Ig4_L1-CAM_like:  fourth immunoglobulin (Ig)-like domain of the L1 cell adhesion molecule (CAM). L1 is comprised of an extracellular region having six Ig-like domains and five fibronectin type III domains, a transmembrane region and an intracellular domain. L1 is primarily expressed in the nervous system and is involved in its development and function. L1 is associated with an X-linked recessive disorder, X-linked hydrocephalus, MASA syndrome, or spastic paraplegia type 1, that involves abnormalities of axonal growth. This group also contains the chicken neuron-glia cell adhesion molecule, Ng-CAM.
Probab=32.56  E-value=29  Score=19.80  Aligned_cols=13  Identities=15%  Similarity=0.138  Sum_probs=9.9

Q ss_pred             ccccceeEEEEec
Q psy1572          26 HSTGQFQALVQLA   38 (74)
Q Consensus        26 s~~G~yqcLV~LS   38 (74)
                      .-+|.|+|.++=+
T Consensus        51 ~D~G~Y~C~a~N~   63 (76)
T cd05867          51 SDTAVYQCEARNR   63 (76)
T ss_pred             hhCEEEEEEEECC
Confidence            4579999998643


No 59 
>cd05895 Ig_Pro_neuregulin-1 Immunoglobulin (Ig)-like domain found in neuregulin (NRG)-1. Ig_Pro_neuregulin-1: immunoglobulin (Ig)-like domain found in neuregulin (NRG)-1. There are many NRG-1 isoforms which arise from the alternative splicing of mRNA. NRG-1 belongs to the neuregulin gene family, which is comprised of four genes. This group represents NRG-1. NRGs are signaling molecules, which participate in cell-cell interactions in the nervous system, breast, and heart, and other organ systems, and are implicated in the pathology of diseases including schizophrenia, multiple sclerosis, and breast cancer. The NRG-1 protein binds to and activates the tyrosine kinases receptors ErbB3 and ErbB4, initiating signaling cascades. NRG-1 has multiple functions; for example, in the brain it regulates various processes such as radial glia formation and neuronal migration, dendritic development, and expression of neurotransmitters receptors; in the peripheral nervous system NRG-1 regulates process
Probab=32.54  E-value=29  Score=19.62  Aligned_cols=12  Identities=25%  Similarity=0.722  Sum_probs=9.7

Q ss_pred             eccccceeEEEE
Q psy1572          25 LHSTGQFQALVQ   36 (74)
Q Consensus        25 ls~~G~yqcLV~   36 (74)
                      ..-+|.|.|.++
T Consensus        53 ~~DsG~Y~C~a~   64 (76)
T cd05895          53 LADNGEYKCMVS   64 (76)
T ss_pred             cccCEEEEEEEE
Confidence            456799999986


No 60 
>KOG1817|consensus
Probab=32.44  E-value=48  Score=28.46  Aligned_cols=24  Identities=33%  Similarity=0.461  Sum_probs=22.0

Q ss_pred             eecCCCCHHHHHHHHHHHHHHHHH
Q psy1572          45 CHGQGKTQQEAKTVAAHNAIEYLR   68 (74)
Q Consensus        45 CHGsG~S~~eA~~~AA~NALqyLk   68 (74)
                      =-|.|++..+|--.||-+||+-++
T Consensus       479 at~~G~nik~Ae~rAA~~ALe~~~  502 (533)
T KOG1817|consen  479 ATGVGSNIKQAEMRAAMQALENLK  502 (533)
T ss_pred             eeccCchHhHHHHHHHHHHHHHHH
Confidence            358999999999999999999887


No 61 
>cd05722 Ig1_Neogenin First immunoglobulin (Ig)-like domain in neogenin and similar proteins. Ig1_Neogenin: first immunoglobulin (Ig)-like domain in neogenin and related proteins. Neogenin  is a cell surface protein which is expressed in the developing nervous system of vertebrate embryos in the growing nerve cells. It is also expressed in other embryonic tissues, and may play a general role in developmental processes such as cell migration, cell-cell recognition, and tissue growth regulation. Included in this group is the tumor suppressor protein DCC, which is deleted in colorectal carcinoma . DCC and neogenin each have four Ig-like domains followed by six fibronectin type III domains, a transmembrane domain, and an intracellular domain.
Probab=31.86  E-value=29  Score=20.52  Aligned_cols=13  Identities=23%  Similarity=0.332  Sum_probs=10.3

Q ss_pred             ccccceeEEEEec
Q psy1572          26 HSTGQFQALVQLA   38 (74)
Q Consensus        26 s~~G~yqcLV~LS   38 (74)
                      .-.|.|+|.++=.
T Consensus        69 ~D~G~Y~C~a~N~   81 (95)
T cd05722          69 PDEGFYQCVAQND   81 (95)
T ss_pred             CcCEEEEEEEECC
Confidence            4579999998754


No 62 
>cd05849 Ig1_Contactin-1 First Ig domain of contactin-1. Ig1_Contactin-1: First Ig domain of the neural cell adhesion molecule contactin-1. Contactins are comprised of six Ig domains followed by four fibronectin type III (FnIII) domains anchored to the membrane by glycosylphosphatidylinositol. Contactin-1 is differentially expressed in tumor tissues and may, through a RhoA mechanism, facilitate invasion and metastasis of human lung adenocarcinoma.
Probab=31.60  E-value=30  Score=21.15  Aligned_cols=11  Identities=18%  Similarity=0.616  Sum_probs=9.5

Q ss_pred             ccccceeEEEE
Q psy1572          26 HSTGQFQALVQ   36 (74)
Q Consensus        26 s~~G~yqcLV~   36 (74)
                      .-.|.|+|.++
T Consensus        68 ~D~G~Y~C~A~   78 (93)
T cd05849          68 KDAGRYVCIVS   78 (93)
T ss_pred             CCCEEEEEEEE
Confidence            57899999987


No 63 
>cd04970 Ig6_Contactin_like Sixth Ig domain of contactin. Ig6_Contactin_like: Sixth Ig domain of contactins. Contactins are neural cell adhesion molecules and are comprised of six Ig domains followed by four fibronectin type III(FnIII) domains anchored to the membrane by glycosylphosphatidylinositol. The first four Ig domains form the intermolecular binding fragment, which arranges as a compact U-shaped module via contacts between Ig domains 1 and 4, and between Ig domains 2 and 3. Contactin-2 (TAG-1, axonin-1) may play a part in the neuronal processes of neurite outgrowth, axon guidance and fasciculation, and neuronal migration. This group also includes contactin-1 and contactin-5. The different contactins show different expression patterns in the central nervous system. During development and in adulthood, contactin-2 is transiently expressed in subsets of central and peripheral neurons. Contactin-5 is expressed specifically in the rat postnatal nervous system, peaking at about 3 week
Probab=31.27  E-value=31  Score=20.01  Aligned_cols=11  Identities=18%  Similarity=0.492  Sum_probs=9.1

Q ss_pred             ccccceeEEEE
Q psy1572          26 HSTGQFQALVQ   36 (74)
Q Consensus        26 s~~G~yqcLV~   36 (74)
                      .-.|.|+|.++
T Consensus        57 ~D~G~Y~C~a~   67 (85)
T cd04970          57 KHAGKYTCTAQ   67 (85)
T ss_pred             HhCeeeEEEEe
Confidence            45799999986


No 64 
>cd05740 Ig_CEACAM_D4 Fourth immunoglobulin (Ig)-like domain of carcinoembryonic antigen (CEA) related cell adhesion molecule (CEACAM). Ig_CEACAM_D4:  immunoglobulin (Ig)-like domain 4 in carcinoembryonic antigen (CEA) related cell adhesion molecule (CEACAM) protein subfamily. The CEA family is a group of anchored or secreted glycoproteins, expressed by epithelial cells, leukocytes, endothelial cells and placenta. The CEA family is divided into the CEACAM and pregnancy-specific glycoprotein (PSG) subfamilies. This group represents the CEACAM subfamily. CEACAM1 has many important cellular functions, it is a cell adhesion molecule, and a signaling molecule that regulates the growth of tumor cells, it is an angiogenic factor, and is a receptor for bacterial and viral pathogens, including mouse hepatitis virus (MHV). In mice, four isoforms of CEACAM1 generated by alternative splicing have either two [D1, D4] or four [D1-D4] Ig-like domains on the cell surface. This family corresponds to the
Probab=31.10  E-value=30  Score=21.02  Aligned_cols=11  Identities=27%  Similarity=0.498  Sum_probs=8.8

Q ss_pred             ccccceeEEEE
Q psy1572          26 HSTGQFQALVQ   36 (74)
Q Consensus        26 s~~G~yqcLV~   36 (74)
                      .-.|.|+|.+.
T Consensus        65 ~D~G~Y~C~a~   75 (91)
T cd05740          65 SDTGHYQCEAS   75 (91)
T ss_pred             hhCEEEEEEEE
Confidence            35699999985


No 65 
>cd05857 Ig2_FGFR Second immunoglobulin (Ig)-like domain of fibroblast growth factor (FGF) receptor. Ig2_FGFR: second immunoglobulin (Ig)-like domain of fibroblast growth factor (FGF) receptor. FGF receptors bind FGF signaling polypeptides. FGFs participate in multiple processes such as morphogenesis, development, and angiogenesis. FGFs bind to four FGF receptor tyrosine kinases (FGFR1, -2, -3, -4). Receptor diversity is controlled by alternative splicing producing splice variants with different ligand binding characteristics and different expression patterns. FGFRs have an extracellular region comprised of three IG-like domains, a single transmembrane helix, and an intracellular tyrosine kinase domain. Ligand binding and specificity reside in the Ig-like domains 2 and 3, and the linker region that connects these two. FGFR activation and signaling depend on FGF-induced dimerization, a process involving cell surface heparin or heparin sulfate proteoglycans.
Probab=31.00  E-value=32  Score=19.57  Aligned_cols=11  Identities=18%  Similarity=0.613  Sum_probs=9.0

Q ss_pred             ccccceeEEEE
Q psy1572          26 HSTGQFQALVQ   36 (74)
Q Consensus        26 s~~G~yqcLV~   36 (74)
                      .-.|.|+|.++
T Consensus        61 ~D~G~Y~C~a~   71 (85)
T cd05857          61 SDKGNYTCVVE   71 (85)
T ss_pred             ccCEEEEEEEE
Confidence            35799999987


No 66 
>cd02649 nuc_hydro_CeIAG nuc_hydro_CeIAG: Nucleoside hydrolases similar to the inosine-adenosine-guanosine-preferring nucleoside hydrolase from Caenorhabditis elegans.  Nucleoside hydrolases cleave the N-glycosidic bond in nucleosides generating ribose and the respective base. These enzymes vary in their substrate specificity. This group contains eukaryotic, bacterial and archeal proteins similar to the purine-preferring nucleoside hydrolase (IAG-NH) from C. elegans and the salivary purine nucleosidase from Aedes aegypti.  C. elegans IAG-NH exhibits a high affinity for the substrate analogue p-nitrophenylriboside (p-NPR).
Probab=30.66  E-value=69  Score=24.15  Aligned_cols=26  Identities=19%  Similarity=0.340  Sum_probs=18.4

Q ss_pred             ceEeecCCCCHHHHHHHHHHHHHHHHHHhhc
Q psy1572          42 VAVCHGQGKTQQEAKTVAAHNAIEYLRLMTK   72 (74)
Q Consensus        42 ~~VCHGsG~S~~eA~~~AA~NALqyLkiM~~   72 (74)
                      +++++|...     .+++++||+..|..|.+
T Consensus        31 It~v~GN~~-----~~~~~~na~~~l~~~g~   56 (306)
T cd02649          31 ITCVHGNTN-----VEQVVKNALRVLEACGR   56 (306)
T ss_pred             EEEccCCcC-----HHHHHHHHHHHHHHhCC
Confidence            445666443     45679999999998864


No 67 
>PF02464 CinA:  Competence-damaged protein;  InterPro: IPR008136 CinA is the first gene in the competence-inducible (cin) operon, and is thought to be specifically required at some stage in the process of transformation []. This is a C-terminal region of putative competence-damaged proteins from the cin operon.; PDB: 2A9S_A.
Probab=30.35  E-value=1.8e+02  Score=19.88  Aligned_cols=39  Identities=21%  Similarity=0.217  Sum_probs=23.8

Q ss_pred             ceeEEEEeccCCceEeecCCCCHHHHHHHHHHHHHHHHHH
Q psy1572          30 QFQALVQLATYPVAVCHGQGKTQQEAKTVAAHNAIEYLRL   69 (74)
Q Consensus        30 ~yqcLV~LST~P~~VCHGsG~S~~eA~~~AA~NALqyLki   69 (74)
                      -|.++..=.-.-+...+-.| ++++-+..|+..||++|..
T Consensus       114 v~iai~~~~~~~~~~~~~~g-~R~~ir~~~~~~al~~L~~  152 (154)
T PF02464_consen  114 VYIAIAYRGGVWVREVQFPG-DREEIRQRAVNQALDLLRR  152 (154)
T ss_dssp             EEEEEEETTSEEEEEEE--S--HHHHHHHHHHHHHHHHH-
T ss_pred             EEEEEEeCCceEEEEEECCC-CHHHHHHHHHHHHHHHHHh
Confidence            34444433332233445556 9999999999999999875


No 68 
>PRK02240 GTP cyclohydrolase III; Provisional
Probab=29.61  E-value=63  Score=25.05  Aligned_cols=25  Identities=28%  Similarity=0.380  Sum_probs=19.8

Q ss_pred             CceEeecCCCCHHHHHHHHHHHHHHH
Q psy1572          41 PVAVCHGQGKTQQEAKTVAAHNAIEY   66 (74)
Q Consensus        41 P~~VCHGsG~S~~eA~~~AA~NALqy   66 (74)
                      ++.+|=|.|.|+.+|...|. .+||.
T Consensus        83 ~ls~GIG~g~TP~eA~~~At-~~Le~  107 (254)
T PRK02240         83 TVSMGIGTAETPYEAQKLAT-EALQE  107 (254)
T ss_pred             ceEEEEEecCCHHHHHHHHH-HHHHH
Confidence            55999999999999997764 35554


No 69 
>cd04971 Ig_TrKABC_d5 Fifth domain (immunoglobulin-like) of Trk receptors TrkA, TrkB and TrkC. TrkABC_d5: the fifth domain of Trk receptors TrkA, TrkB and TrkC, this is an immunoglobulin (Ig)-like domain which binds to neurotrophin. The Trk family of receptors are tyrosine kinase receptors. They are activated by dimerization, leading to autophosphorylation of intracellular tyrosine residues, and triggering the signal transduction pathway. TrkA, TrkB, and TrkC share significant sequence homology and domain organization. The first three domains are leucine-rich domains. The fourth and fifth domains are Ig-like domains playing a part in ligand binding. TrkA, Band C mediate the trophic effects of the neurotrophin Nerve growth factor (NGF) family. TrkA is recognized by NGF. TrkB is recognized by brain-derived neurotrophic factor (BDNF) and neurotrophin (NT)-4. TrkC is recognized by NT-3. NT-3 is promiscuous as in some cell systems it activates TrkA and TrkB receptors. TrkA is a receptor foun
Probab=29.33  E-value=36  Score=20.39  Aligned_cols=10  Identities=10%  Similarity=0.491  Sum_probs=8.4

Q ss_pred             cccceeEEEE
Q psy1572          27 STGQFQALVQ   36 (74)
Q Consensus        27 ~~G~yqcLV~   36 (74)
                      -+|.|.|.++
T Consensus        57 D~G~YtC~A~   66 (81)
T cd04971          57 NNGNYTLVAS   66 (81)
T ss_pred             cCeEEEEEEE
Confidence            4699999986


No 70 
>cd05876 Ig3_L1-CAM Third immunoglobulin (Ig)-like domain of the L1 cell adhesion molecule (CAM). Ig3_L1-CAM:  third immunoglobulin (Ig)-like domain of the L1 cell adhesion molecule (CAM). L1 belongs to the L1 subfamily of cell adhesion molecules (CAMs) and is comprised of an extracellular region having six Ig-like domains, five fibronectin type III domains, a transmembrane region and an intracellular domain. L1 is primarily expressed in the nervous system and is involved in its development and function. L1 is associated with an X-linked recessive disorder, X-linked hydrocephalus, MASA syndrome, or spastic paraplegia type 1, that involves abnormalities of axonal growth. This group also contains the chicken neuron-glia cell adhesion molecule, Ng-CAM.
Probab=29.23  E-value=35  Score=19.30  Aligned_cols=10  Identities=10%  Similarity=0.448  Sum_probs=8.4

Q ss_pred             cccceeEEEE
Q psy1572          27 STGQFQALVQ   36 (74)
Q Consensus        27 ~~G~yqcLV~   36 (74)
                      -+|.|.|.++
T Consensus        47 D~G~Y~C~a~   56 (71)
T cd05876          47 DDGEYVCTAE   56 (71)
T ss_pred             hCEEEEEEEE
Confidence            5699999876


No 71 
>cd05731 Ig3_L1-CAM_like Third immunoglobulin (Ig)-like domain of the L1 cell adhesion molecule (CAM). Ig3_L1-CAM_like: domain similar to the third immunoglobulin (Ig)-like domain of the L1 cell adhesion molecule (CAM). L1 belongs to the L1 subfamily of cell adhesion molecules (CAMs) and is comprised of an extracellular region having six Ig-like domains and five fibronectin type III domains, a transmembrane region and an intracellular domain. L1 is primarily expressed in the nervous system and is involved in its development and function. L1 is associated with an X-linked recessive disorder, X-linked hydrocephalus, MASA syndrome, or spastic paraplegia type 1, that involves abnormalities of axonal growth. This group also contains the chicken neuron-glia cell adhesion molecule, Ng-CAM and human neurofascin.
Probab=28.77  E-value=38  Score=18.44  Aligned_cols=12  Identities=8%  Similarity=0.343  Sum_probs=9.0

Q ss_pred             ccccceeEEEEe
Q psy1572          26 HSTGQFQALVQL   37 (74)
Q Consensus        26 s~~G~yqcLV~L   37 (74)
                      .-+|.|.|.+.=
T Consensus        46 ~D~G~Y~C~a~N   57 (71)
T cd05731          46 EDDGEYRCTASN   57 (71)
T ss_pred             HHCEEEEEEEEe
Confidence            346999998763


No 72 
>cd05724 Ig2_Robo Second immunoglobulin (Ig)-like domain in Robo (roundabout) receptors. Ig2_Robo: domain similar to the second immunoglobulin (Ig)-like domain in Robo (roundabout) receptors. Robo receptors play a role in the development of the central nervous system (CNS), and are receptors of Slit protein. Slit is a repellant secreted by the neural cells in the midline. Slit acts through Robo to prevent most neurons from crossing the midline from either side. Three mammalian Robo homologs (robo1, -2, and -3), and three mammalian Slit homologs (Slit-1,-2, -3), have been identified. Commissural axons, which cross the midline, express low levels of Robo; longitudinal axons, which avoid the midline, express high levels of Robo. robo1, -2, and -3 are expressed by commissural neurons in the vertebrate spinal cord and Slits 1, -2, -3 are expressed at the ventral midline. Robo-3 is a divergent member of the Robo family which instead of being a positive regulator of slit responsiveness, antago
Probab=28.22  E-value=38  Score=19.16  Aligned_cols=11  Identities=9%  Similarity=0.480  Sum_probs=9.1

Q ss_pred             ccccceeEEEE
Q psy1572          26 HSTGQFQALVQ   36 (74)
Q Consensus        26 s~~G~yqcLV~   36 (74)
                      .-+|.|+|.++
T Consensus        61 ~D~G~Y~C~a~   71 (86)
T cd05724          61 SDEGTYKCVAT   71 (86)
T ss_pred             ccCEEEEEEEE
Confidence            46799999886


No 73 
>smart00409 IG Immunoglobulin.
Probab=28.12  E-value=43  Score=17.38  Aligned_cols=11  Identities=9%  Similarity=0.410  Sum_probs=9.0

Q ss_pred             ccccceeEEEE
Q psy1572          26 HSTGQFQALVQ   36 (74)
Q Consensus        26 s~~G~yqcLV~   36 (74)
                      .-.|.|.|.+.
T Consensus        61 ~d~G~Y~C~v~   71 (86)
T smart00409       61 EDSGTYTCAAT   71 (86)
T ss_pred             ccCeEEEEEEE
Confidence            34599999998


No 74 
>smart00410 IG_like Immunoglobulin like. IG domains that cannot be classified into one of IGv1, IGc1, IGc2, IG.
Probab=28.12  E-value=43  Score=17.38  Aligned_cols=11  Identities=9%  Similarity=0.410  Sum_probs=9.0

Q ss_pred             ccccceeEEEE
Q psy1572          26 HSTGQFQALVQ   36 (74)
Q Consensus        26 s~~G~yqcLV~   36 (74)
                      .-.|.|.|.+.
T Consensus        61 ~d~G~Y~C~v~   71 (86)
T smart00410       61 EDSGTYTCAAT   71 (86)
T ss_pred             ccCeEEEEEEE
Confidence            34599999998


No 75 
>cd07693 Ig1_Robo First immunoglobulin (Ig)-like domain in Robo (roundabout) receptors and similar proteins. Ig1_Robo: domain similar to the first immunoglobulin (Ig)-like domain in Robo (roundabout) receptors. Robo receptors play a role in the development of the central nervous system (CNS), and are receptors of Slit protein. Slit is a repellant secreted by the neural cells in the midline. Slit acts through Robo to prevent most neurons from crossing the midline from either side. Three mammalian Robo homologs (robo1, -2, and -3), and three mammalian Slit homologs (Slit-1,-2, -3), have been identified. Commissural axons, which cross the midline, express low levels of Robo; longitudinal axons, which avoid the midline, express high levels of Robo. robo1, -2, and -3 are expressed by commissural neurons in the vertebrate spinal cord and Slits 1, -2, -3 are expressed at the ventral midline. Robo-3 is a divergent member of the Robo family which instead of being a positive regulator of slit res
Probab=28.03  E-value=38  Score=19.21  Aligned_cols=12  Identities=8%  Similarity=0.288  Sum_probs=10.2

Q ss_pred             eccccceeEEEE
Q psy1572          25 LHSTGQFQALVQ   36 (74)
Q Consensus        25 ls~~G~yqcLV~   36 (74)
                      ..-.|.|+|.++
T Consensus        73 ~~D~G~Y~C~a~   84 (100)
T cd07693          73 RSDEGVYVCVAH   84 (100)
T ss_pred             cCcCEEEEEEEE
Confidence            567899999986


No 76 
>cd05745 Ig3_Peroxidasin Third immunoglobulin (Ig)-like domain of peroxidasin. Ig3_Peroxidasin: the third immunoglobulin (Ig)-like domain in peroxidasin. Peroxidasin has a peroxidase domain and interacting extracellular motifs containing four Ig-like domains. It has been suggested that peroxidasin is secreted and has functions related to the stabilization of the extracellular matrix. It may play a part in various other important processes such as removal and destruction of cells which have undergone programmed cell death, and protection of the organism against non-self.
Probab=27.95  E-value=39  Score=19.15  Aligned_cols=11  Identities=27%  Similarity=0.610  Sum_probs=9.0

Q ss_pred             ccccceeEEEE
Q psy1572          26 HSTGQFQALVQ   36 (74)
Q Consensus        26 s~~G~yqcLV~   36 (74)
                      .-.|.|+|.++
T Consensus        50 ~D~G~Y~C~A~   60 (74)
T cd05745          50 HDQGQYECQAV   60 (74)
T ss_pred             HhCEEEEEEEE
Confidence            45799999986


No 77 
>PF00047 ig:  Immunoglobulin domain The Prosite family only concerns antibodies and MHCs.;  InterPro: IPR013151 Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involved in protein-protein and protein-ligand interactions. The Pfam alignments do not include the first and last strand of the immunoglobulin-like domain.; PDB: 1B6U_A 3O4O_C 3VH8_H 1BIH_A 2C9A_A 2V5Y_A 1BQH_K 2ATP_A 3B9K_A 1NEZ_H ....
Probab=27.80  E-value=35  Score=18.25  Aligned_cols=9  Identities=22%  Similarity=0.719  Sum_probs=7.5

Q ss_pred             cccceeEEE
Q psy1572          27 STGQFQALV   35 (74)
Q Consensus        27 ~~G~yqcLV   35 (74)
                      -.|.|.|.|
T Consensus        56 d~G~Y~C~v   64 (64)
T PF00047_consen   56 DSGTYTCVV   64 (64)
T ss_dssp             GTEEEEEEE
T ss_pred             HCEEEEEEC
Confidence            579999976


No 78 
>cd05763 Ig_1 Subgroup of the immunoglobulin (Ig) superfamily. Ig_1: subgroup of the immunoglobulin (Ig) domain found in the Ig superfamily. The Ig superfamily is a heterogenous group of proteins, built on a common fold comprised of a sandwich of two beta sheets. Members of the Ig superfamily are components of immunoglobulin, neuroglia, cell surface glycoproteins, such as T-cell receptors, CD2, CD4, CD8, and membrane glycoproteins, such as butyrophilin and chondroitin sulfate proteoglycan core protein. A predominant feature of most Ig domains is a disulfide bridge connecting the two beta-sheets with a tryptophan residue packed against the disulfide bond.
Probab=27.75  E-value=39  Score=19.05  Aligned_cols=11  Identities=27%  Similarity=0.480  Sum_probs=9.0

Q ss_pred             ccccceeEEEE
Q psy1572          26 HSTGQFQALVQ   36 (74)
Q Consensus        26 s~~G~yqcLV~   36 (74)
                      .-+|.|+|.++
T Consensus        50 ~D~G~Y~C~A~   60 (75)
T cd05763          50 EDTGVYSCTAQ   60 (75)
T ss_pred             ccCEEEEEEEE
Confidence            45799999985


No 79 
>TIGR02296 HpaC 4-hydroxyphenylacetate 3-monooxygenase, reductase component. These reductases catalyze the reduction of free flavins by NADPH. The flavin is then utilized by the large subunit of the monooxygenase.
Probab=27.64  E-value=94  Score=21.07  Aligned_cols=40  Identities=25%  Similarity=0.294  Sum_probs=27.0

Q ss_pred             eeeeEEeeeCCCCCceeeeccccceeEEEEeccCCceEeecCCC
Q psy1572           7 LDTRVCLISTNGSNSIAFLHSTGQFQALVQLATYPVAVCHGQGK   50 (74)
Q Consensus         7 ~~~~v~~~~~~~~n~ieels~~G~yqcLV~LST~P~~VCHGsG~   50 (74)
                      |-+-|+++.+.++...-.+|++    .+.++|.+||.|.-....
T Consensus         8 ~a~gV~VVT~~~~~~~~G~Tvs----s~~svS~~PP~v~v~l~~   47 (154)
T TIGR02296         8 LAAAVNIITTNGIAGKCGITAT----AVCSVTDTPPTVMVCINR   47 (154)
T ss_pred             cCCceEEEEecCCCCceeEEEE----eEEEeecCCCEEEEEECC
Confidence            4456778877655555666664    478999999877654443


No 80 
>KOG4528|consensus
Probab=27.48  E-value=64  Score=24.03  Aligned_cols=23  Identities=30%  Similarity=0.341  Sum_probs=20.0

Q ss_pred             eecCCCCHHHHHHHHHHHHHHHH
Q psy1572          45 CHGQGKTQQEAKTVAAHNAIEYL   67 (74)
Q Consensus        45 CHGsG~S~~eA~~~AA~NALqyL   67 (74)
                      -|+.|+|.+||-+.++....-|.
T Consensus        42 lH~wgstl~EafE~c~~aMfgym   64 (168)
T KOG4528|consen   42 LHAWGSTLEEAFEQCAVAMFGYM   64 (168)
T ss_pred             EEecccchHHHHHHHHHHHHhhe
Confidence            49999999999999998877663


No 81 
>PF07897 DUF1675:  Protein of unknown function (DUF1675);  InterPro: IPR012463 The members of this family are sequences derived from hypothetical plant proteins of unknown function. One member of this family (Q9SFV5 from SWISSPROT) is annotated as a putative RNA-binding protein, but no evidence was found to support this. 
Probab=27.13  E-value=45  Score=26.08  Aligned_cols=18  Identities=39%  Similarity=0.473  Sum_probs=14.7

Q ss_pred             ceEeecCCCCHHHHHHHH
Q psy1572          42 VAVCHGQGKTQQEAKTVA   59 (74)
Q Consensus        42 ~~VCHGsG~S~~eA~~~A   59 (74)
                      |.||||+=-|.+|=-.+|
T Consensus       254 ~c~chg~~~~~~efv~h~  271 (284)
T PF07897_consen  254 VCVCHGSFLSPAEFVKHA  271 (284)
T ss_pred             EEEecCCCCCHHHHHHhc
Confidence            689999999998865554


No 82 
>cd00455 nuc_hydro nuc_hydro: Nucleoside hydrolases. Nucleoside hydrolases cleave the N-glycosidic bond in nucleosides generating ribose and the respective base. These enzymes vary in their substrate specificity.  This group contains eukaryotic, bacterial and archeal proteins similar to the inosine-uridine preferring nucleoside hydrolase from Crithidia fasciculata,  the xanthosine-inosine-uridine-adenosine-preferring nucleoside hydrolase RihC from Salmonella enterica serovar Typhimurium, the purine-specific  inosine-adenosine-guanosine-preferring nucleoside hydrolase from Trypanosoma vivax and, pyrimidine-specific uridine-cytidine preferring nucleoside hydrolases such as URH1 from Saccharomyces cerevisiae, RihA and RihB from Escherichia coli. Nucleoside hydrolases are of interest as a target for antiprotozoan drugs as, no nucleoside hydrolase activity or genes encoding these enzymes have been detected in humans and, parasitic protozoans lack de novo purine synthesis relying on nucleosid
Probab=26.88  E-value=66  Score=23.77  Aligned_cols=27  Identities=22%  Similarity=0.238  Sum_probs=19.4

Q ss_pred             CceEeecCCCCHHHHHHHHHHHHHHHHHHhhc
Q psy1572          41 PVAVCHGQGKTQQEAKTVAAHNAIEYLRLMTK   72 (74)
Q Consensus        41 P~~VCHGsG~S~~eA~~~AA~NALqyLkiM~~   72 (74)
                      =++++||...     -+++++||+..|..|.+
T Consensus        28 gIt~~~Gn~~-----~~~~~~n~~~~l~~~g~   54 (295)
T cd00455          28 GIVATYGNVT-----LEQATQNAAYLLELLGR   54 (295)
T ss_pred             EEEeccCCcc-----HHHHHHHHHHHHHHhCC
Confidence            3556776553     26789999999998864


No 83 
>cd07706 IgV_TCR_delta Immunoglobulin (Ig) variable (V) domain of T-cell receptor (TCR) delta chain. IgV_TCR_delta: immunoglobulin (Ig) variable (V) domain of the delta chain of gamma/delta T-cell receptors (TCRs). TCRs mediate antigen recognition by T lymphocytes, and are heterodimers consisting of alpha and beta chains or gamma and delta chains.  Each chain contains a variable (V) and a constant (C) region. The majority of T cells contain alpha/beta TCRs but a small subset contain gamma/delta TCRs. Alpha/beta TCRs recognize antigen as peptide fragments presented by major histocompatibility complex (MHC) molecules. Gamma/delta TCRs recognize intact protein antigens; they recognize protein antigens directly and without antigen processing, and MHC independently of the bound peptide. Gamma/delta T cells can also be stimulated by non-peptide antigens such as small phosphate- or amine-containing compounds. The variable domain of gamma/delta TCRs is responsible for antigen recognition and is
Probab=26.81  E-value=58  Score=20.01  Aligned_cols=28  Identities=11%  Similarity=0.214  Sum_probs=18.4

Q ss_pred             eeccccceeEEEEeccC----CceEeecCCCC
Q psy1572          24 FLHSTGQFQALVQLATY----PVAVCHGQGKT   51 (74)
Q Consensus        24 els~~G~yqcLV~LST~----P~~VCHGsG~S   51 (74)
                      +.+=+|.|.|-+...-.    +-.++.|.|..
T Consensus        80 ~~~Dsg~Y~C~~~~~~~~~~~~~~~~fg~gt~  111 (116)
T cd07706          80 QLEDSAKYFCALSTLGMGSYDTDKLIFGKGTR  111 (116)
T ss_pred             CHHHCEEEEEEEECCCCCcccccCCEECCCeE
Confidence            34567999999984211    22467888864


No 84 
>PHA02447 hypothetical protein
Probab=26.81  E-value=36  Score=22.67  Aligned_cols=45  Identities=29%  Similarity=0.271  Sum_probs=40.3

Q ss_pred             eeeeccccceeEEEEeccCCceEeecCCCCHHHHHHHHHHHHHHH
Q psy1572          22 IAFLHSTGQFQALVQLATYPVAVCHGQGKTQQEAKTVAAHNAIEY   66 (74)
Q Consensus        22 ieels~~G~yqcLV~LST~P~~VCHGsG~S~~eA~~~AA~NALqy   66 (74)
                      |.--.+.|+|.--|+=.|+.-..--|--.|.+.|...||-.|-+|
T Consensus        35 inpsdvagdfrwsvedntcarvlawgyeetldaamaaaaaaaaey   79 (86)
T PHA02447         35 INPSDVAGDFRWSVEDNTCARVLAWGYEETLDAAMAAAAAAAAEY   79 (86)
T ss_pred             cChhhcccceeeeeccchHHHHHHhhhHHHHHHHHHHHHHHHHHH
Confidence            444568899999999999999999999999999999999999998


No 85 
>COG2820 Udp Uridine phosphorylase [Nucleotide transport and metabolism]
Probab=26.57  E-value=37  Score=26.46  Aligned_cols=25  Identities=36%  Similarity=0.658  Sum_probs=19.9

Q ss_pred             ceeeeccccceeEEEEeccCCceEe-ecCCCC
Q psy1572          21 SIAFLHSTGQFQALVQLATYPVAVC-HGQGKT   51 (74)
Q Consensus        21 ~ieels~~G~yqcLV~LST~P~~VC-HGsG~S   51 (74)
                      +.|+.+..|.|-      ..|+.|| ||.|.-
T Consensus        45 ~Ref~~~~g~~~------g~~v~v~StGIGgP   70 (248)
T COG2820          45 NREFRTYTGTYN------GKPVTVCSTGIGGP   70 (248)
T ss_pred             ccceEEEEEEEc------CeEEEEEecCCCCc
Confidence            567888888765      8999999 898854


No 86 
>cd08342 HPPD_N_like N-terminal domain of 4-hydroxyphenylpyruvate dioxygenase (HPPD) and hydroxymandelate Synthase (HmaS). HppD and HmaS are non-heme iron-dependent dioxygenases, which modify a common substrate, 4-hydroxyphenylpyruvate (HPP), but yield different products. HPPD catalyzes the second reaction in tyrosine catabolism, the conversion of HPP to homogentisate (2,5-dihydroxyphenylacetic acid, HG). HmaS converts HPP to 4-hydroxymandelate, a committed step in the formation of hydroxyphenylglycerine, a structural component of nonproteinogenic macrocyclic peptide antibiotics, such as vancomycin. If the emphasis is on catalytic chemistry, HPPD and HmaS are classified as members of a large family of alpha-keto acid dependent mononuclear non-heme iron oxygenases most of which require Fe(II), molecular oxygen, and an alpha-keto acid (typically alpha-ketoglutarate) to either oxygenate or oxidize a third substrate. Both enzymes are exceptions in that they require two, instead of three, su
Probab=26.46  E-value=72  Score=19.94  Aligned_cols=23  Identities=13%  Similarity=0.312  Sum_probs=18.9

Q ss_pred             eeeEEeeeCCCCCceeeeccccc
Q psy1572           8 DTRVCLISTNGSNSIAFLHSTGQ   30 (74)
Q Consensus         8 ~~~v~~~~~~~~n~ieels~~G~   30 (74)
                      ..+++|+..|.||.|+....+|.
T Consensus       104 ~~~~~~i~dp~G~~ie~~~~~~~  126 (136)
T cd08342         104 ELKIAAIKGYGDSLHTLVDRKGY  126 (136)
T ss_pred             eEEEEEEeccCCcEEEEEecCCC
Confidence            56899999999999998765553


No 87 
>cd04978 Ig4_L1-NrCAM_like Fourth immunoglobulin (Ig)-like domain of L1, Ng-CAM (Neuron-glia CAM cell adhesion molecule), and NrCAM (Ng-CAM-related). Ig4_L1-NrCAM_like: fourth immunoglobulin (Ig)-like domain of L1, Ng-CAM (Neuron-glia CAM cell adhesion molecule), and NrCAM (Ng-CAM-related). These proteins belong to the L1 subfamily of cell adhesion molecules (CAMs) and are comprised of an extracellular region having six Ig-like domains and five fibronectin type III domains, a transmembrane region and an intracellular domain. These molecules are primarily expressed in the nervous system. L1 is associated with an X-linked recessive disorder, X-linked hydrocephalus, MASA syndrome, or spastic paraplegia type 1, that involves abnormalities of axonal growth.
Probab=26.26  E-value=44  Score=18.50  Aligned_cols=11  Identities=18%  Similarity=0.268  Sum_probs=9.0

Q ss_pred             ccccceeEEEE
Q psy1572          26 HSTGQFQALVQ   36 (74)
Q Consensus        26 s~~G~yqcLV~   36 (74)
                      .-.|.|+|.++
T Consensus        51 ~D~G~Y~C~A~   61 (76)
T cd04978          51 NDTAVYQCNAS   61 (76)
T ss_pred             hhCEEEEEEEE
Confidence            46799999885


No 88 
>cd05737 Ig_Myomesin_like_C C-temrinal immunoglobulin (Ig)-like domain of myomesin and M-protein. Ig_Myomesin_like_C: domain similar to the C-temrinal immunoglobulin (Ig)-like domain of myomesin and M-protein. Myomesin and M-protein are both structural proteins localized to the M-band, a transverse structure in the center of the sarcomere, and are candidates for M-band bridges. Both proteins are modular, consisting mainly of repetitive Ig-like and fibronectin type III (FnIII) domains. Myomesin is expressed in all types of vertebrate striated muscle; M-protein has a muscle-type specific expression pattern. Myomesin is present in both slow and fast fibers; M-protein is present only in fast fibers. It has been suggested that myomesin acts as a molecular spring with alternative splicing as a means of modifying its elasticity.
Probab=25.56  E-value=44  Score=19.78  Aligned_cols=11  Identities=18%  Similarity=0.622  Sum_probs=9.1

Q ss_pred             ccccceeEEEE
Q psy1572          26 HSTGQFQALVQ   36 (74)
Q Consensus        26 s~~G~yqcLV~   36 (74)
                      .-+|.|+|.++
T Consensus        68 ~D~G~Y~C~a~   78 (92)
T cd05737          68 EDSGKYGIVVK   78 (92)
T ss_pred             hhCEEEEEEEE
Confidence            45799999886


No 89 
>cd05727 Ig2_Contactin-2-like Second Ig domain of the neural cell adhesion molecule contactin-2 and similar proteins. Ig2_Contactin-2-like: second Ig domain of the neural cell adhesion molecule contactin-2. Contactins are comprised of six Ig domains followed by four fibronectin type III (FnIII) domains anchored to the membrane by glycosylphosphatidylinositol. Contactin-2 (aliases TAG-1, axonin-1) facilitates cell adhesion by homophilic binding between molecules in apposed membranes. The first four Ig domains form the intermolecular binding fragment which arranges as a compact U-shaped module by contacts between Ig domains 1 and 4, and domains 2 and 3. It has been proposed that a linear zipper-like array forms, from contactin-2 molecules alternatively provided by the two apposed membranes.
Probab=25.55  E-value=78  Score=20.41  Aligned_cols=26  Identities=15%  Similarity=0.278  Sum_probs=15.9

Q ss_pred             eeeCCCCCce---eeeccccceeEEEEec
Q psy1572          13 LISTNGSNSI---AFLHSTGQFQALVQLA   38 (74)
Q Consensus        13 ~~~~~~~n~i---eels~~G~yqcLV~LS   38 (74)
                      +++.+.||-+   -+.+=.|.|+|.++-.
T Consensus        54 ~~~~~~G~L~fs~v~~~D~g~Y~C~A~n~   82 (96)
T cd05727          54 FVSQTNGNLYIAKVEASDRGNYSCFVSSP   82 (96)
T ss_pred             EEeCCCCcEEEeecCHhhCceeEEEEEec
Confidence            3444555532   3344569999999754


No 90 
>PF00388 PI-PLC-X:  Phosphatidylinositol-specific phospholipase C, X domain This entry is for the whole phospholipase C protein;  InterPro: IPR000909 Phosphatidylinositol-specific phospholipase C (3.1.4.11 from EC), a eukaryotic intracellular enzyme, plays an important role in signal transduction processes []. It catalyzes the hydrolysis of 1-phosphatidyl-D-myo-inositol-3,4,5-triphosphate into the second messenger molecules diacylglycerol and inositol-1,4,5-triphosphate. This catalytic process is tightly regulated by reversible phosphorylation and binding of regulatory proteins [, , ]. In mammals, there are at least 6 different isoforms of PI-PLC, they differ in their domain structure, their regulation, and their tissue distribution. Lower eukaryotes also possess multiple isoforms of PI-PLC. All eukaryotic PI-PLCs contain two regions of homology, sometimes referred to as the 'X-box' and 'Y-box'. The order of these two regions is always the same (NH2-X-Y-COOH), but the spacing is variable. In most isoforms, the distance between these two regions is only 50-100 residues but in the gamma isoforms one PH domain, two SH2 domains, and one SH3 domain are inserted between the two PLC-specific domains. The two conserved regions have been shown to be important for the catalytic activity. By profile analysis, we could show that sequences with significant similarity to the X-box domain occur also in prokaryotic and trypanosome PI-specific phospholipases C. Apart from this region, the prokaryotic enzymes show no similarity to their eukaryotic counterparts.; GO: 0004629 phospholipase C activity, 0006629 lipid metabolic process, 0035556 intracellular signal transduction; PDB: 2FJU_B 2ZKM_X 3V18_A 3V1H_A 3V16_A 3QR1_D 3EA3_A 3EA1_A 2OR2_A 1T6M_B ....
Probab=25.50  E-value=78  Score=20.62  Aligned_cols=7  Identities=43%  Similarity=0.738  Sum_probs=5.1

Q ss_pred             ceEeecC
Q psy1572          42 VAVCHGQ   48 (74)
Q Consensus        42 ~~VCHGs   48 (74)
                      +.||||.
T Consensus        55 ~~v~Hg~   61 (146)
T PF00388_consen   55 LVVYHGI   61 (146)
T ss_dssp             EEEEETT
T ss_pred             eEEEeCC
Confidence            6777776


No 91 
>cd05748 Ig_Titin_like Immunoglobulin (Ig)-like domain of titin and similar proteins. Ig_Titin_like: immunoglobulin (Ig)-like domain found in titin-like proteins. Titin (also called connectin) is a fibrous sarcomeric protein specifically found in vertebrate striated muscle. Titin is gigantic, depending on isoform composition it ranges from 2970 to 3700 kDa, and is of a length that spans half a sarcomere. Titin largely consists of multiple repeats of Ig-like and fibronectin type 3 (FN-III)-like domains. Titin connects the ends of myosin thick filaments to Z disks and extends along the thick filament to the H zone.  It appears to function similarly to an elastic band, keeping the myosin filaments centered in the sarcomere during muscle contraction or stretching. Within the sarcomere, titin is also attached to or is associated with myosin binding protein C (MyBP-C). MyBP-C appears to contribute to the generation of passive tension by titin, and similar to titin has repeated Ig-like and FN-
Probab=25.45  E-value=46  Score=18.59  Aligned_cols=11  Identities=9%  Similarity=0.480  Sum_probs=9.0

Q ss_pred             ccccceeEEEE
Q psy1572          26 HSTGQFQALVQ   36 (74)
Q Consensus        26 s~~G~yqcLV~   36 (74)
                      .-+|.|.|.++
T Consensus        50 ~D~G~Y~C~a~   60 (74)
T cd05748          50 SDSGKYTLTLK   60 (74)
T ss_pred             CcCEEEEEEEE
Confidence            46799999886


No 92 
>cd04974 Ig3_FGFR Third immunoglobulin (Ig)-like domain of fibroblast growth factor receptor (FGFR). Ig3_FGFR: third immunoglobulin (Ig)-like domain of fibroblast growth factor receptor (FGFR). Fibroblast growth factors (FGFs) participate in morphogenesis, development, angiogenesis, and wound healing. These FGF-stimulated processes are mediated by four FGFR tyrosine kinases (FGRF1-4). FGFRs are comprised of an extracellular portion consisting of three Ig-like domains, a transmembrane helix, and a cytoplasmic portion having protein tyrosine kinase activity. The highly conserved Ig-like domains 2 and 3, and the linker region between D2 and D3 define a general binding site for FGFs.
Probab=25.41  E-value=45  Score=19.80  Aligned_cols=11  Identities=18%  Similarity=0.561  Sum_probs=9.0

Q ss_pred             ccccceeEEEE
Q psy1572          26 HSTGQFQALVQ   36 (74)
Q Consensus        26 s~~G~yqcLV~   36 (74)
                      .=.|.|+|.++
T Consensus        65 ~D~G~Y~C~A~   75 (90)
T cd04974          65 DDAGEYTCLAG   75 (90)
T ss_pred             ccCcEEEEEee
Confidence            35799999986


No 93 
>cd05736 Ig2_Follistatin_like Second immunoglobulin (Ig)-like domain of a follistatin-like molecule encoded by the Mahya gene and similar proteins. Ig2_Follistatin_like: domain similar to the second immunoglobulin (Ig)-like domain found in a follistatin-like molecule encoded by the CNS-related Mahya gene. Mahya genes have been retained in certain Bilaterian branches during evolution.  They are conserved in Hymenoptera and Deuterostomes, but are absent from other metazoan species such as fruit fly and nematode. Mahya proteins are secretory, with a follistatin-like domain (Kazal-type serine/threonine protease inhibitor domain and EF-hand calcium-binding domain), two Ig-like domains, and a novel C-terminal domain. Mahya may be involved in learning and memory and in processing of sensory information in Hymenoptera and vertebrates. Follistatin is a secreted, multidomain protein that binds activins with high affinity and antagonizes their signaling.
Probab=25.37  E-value=46  Score=18.72  Aligned_cols=11  Identities=18%  Similarity=0.600  Sum_probs=9.1

Q ss_pred             ccccceeEEEE
Q psy1572          26 HSTGQFQALVQ   36 (74)
Q Consensus        26 s~~G~yqcLV~   36 (74)
                      .-+|.|.|.++
T Consensus        49 ~D~G~Y~C~a~   59 (76)
T cd05736          49 EDTGAYTCIAK   59 (76)
T ss_pred             ccCEEEEEEEE
Confidence            46799999886


No 94 
>TIGR00702 uncharacterized domain. This uncharacterized domain comprises the whole of a protein in Methanococcus jannaschii and Methanobacterium thermoautotrophicum, all but the N-terminal 60 residues from a protein of Mycobacterium tuberculosis, and all but the C-terminal 180 residues from a protein in Haemophilus influenzae and Escherichia coli, among proteins from published complete genomes.
Probab=25.07  E-value=94  Score=24.09  Aligned_cols=26  Identities=23%  Similarity=0.415  Sum_probs=23.9

Q ss_pred             CceEeecCCCCHHHHHHHHHHHHHHH
Q psy1572          41 PVAVCHGQGKTQQEAKTVAAHNAIEY   66 (74)
Q Consensus        41 P~~VCHGsG~S~~eA~~~AA~NALqy   66 (74)
                      |..|..|-|.|.++|+-.|.--|+|-
T Consensus        47 ~~~~~~GKG~t~~~A~~SAlmEaiER   72 (377)
T TIGR00702        47 ALSISNGKGASKKAALASALMEAFER   72 (377)
T ss_pred             CceeccCCCCCHHHHHHHHHHHHHHH
Confidence            78899999999999999999999883


No 95 
>TIGR03266 methan_mark_1 putative methanogenesis marker protein 1. Members of this protein family represent a distinct clade among the larger set of proteins that belong to families TIGR00702 and pfam02624. Proteins from this clade are found in genome sequence if and only if the species sequenced is one of the methanogens. All methanogens belong to the archaea; some but not all of those sequenced are hyperthermophiles. This protein family was detected by the method of partial phylogenetic profiling (see Haft, et al., 2006).
Probab=24.80  E-value=1e+02  Score=23.89  Aligned_cols=24  Identities=29%  Similarity=0.509  Sum_probs=22.7

Q ss_pred             ceEeecCCCCHHHHHHHHHHHHHH
Q psy1572          42 VAVCHGQGKTQQEAKTVAAHNAIE   65 (74)
Q Consensus        42 ~~VCHGsG~S~~eA~~~AA~NALq   65 (74)
                      ++|+-|-|.|.++|+-.|--.|+|
T Consensus        52 ~~v~~GKG~t~~~A~~sAlmEaiE   75 (376)
T TIGR03266        52 ISIYAGKGATEIQARVSAIMEAFE   75 (376)
T ss_pred             eeEecCCCCCHHHHHHHHHHHHHH
Confidence            779999999999999999999998


No 96 
>PRK15486 hpaC 4-hydroxyphenylacetate 3-monooxygenase reductase subunit; Provisional
Probab=24.59  E-value=1e+02  Score=21.63  Aligned_cols=39  Identities=21%  Similarity=0.264  Sum_probs=25.9

Q ss_pred             eeeeEEeeeCCCCCceeeeccccceeEEEEeccCCceEeecCC
Q psy1572           7 LDTRVCLISTNGSNSIAFLHSTGQFQALVQLATYPVAVCHGQG   49 (74)
Q Consensus         7 ~~~~v~~~~~~~~n~ieels~~G~yqcLV~LST~P~~VCHGsG   49 (74)
                      |-+-|+++.+..+...-.+|+|    .+.++|.+||.|.-...
T Consensus        17 ~a~GV~VVTt~~~~~~~G~Tvs----s~~SvSldPPlvlv~l~   55 (170)
T PRK15486         17 LSAAVNIVTTAGDAGRCGITAT----AVCSVTDTPPSVMVCIN   55 (170)
T ss_pred             cCCceEEEEEecCCCcEEEEEE----EEEEeEcCCCEEEEEEC
Confidence            4566888876544333456655    47899999998765544


No 97 
>KOG3075|consensus
Probab=24.50  E-value=1e+02  Score=24.28  Aligned_cols=30  Identities=37%  Similarity=0.348  Sum_probs=25.9

Q ss_pred             EEEEeccCCceEeecCCCCHHHHHHHHHHHHHH
Q psy1572          33 ALVQLATYPVAVCHGQGKTQQEAKTVAAHNAIE   65 (74)
Q Consensus        33 cLV~LST~P~~VCHGsG~S~~eA~~~AA~NALq   65 (74)
                      -.+. ||.|.++  -+..+.+++..-||++|-+
T Consensus         8 ~~~~-s~~~~~~--~~~~~~e~~kr~Aa~~avd   37 (261)
T KOG3075|consen    8 GTGS-STSPRAV--ESLSPQEEAKRLAAYKAVD   37 (261)
T ss_pred             eeec-cCCCCcc--cccCchHHHHHHHHhhhhh
Confidence            3455 6999999  8899999999999999975


No 98 
>PRK06806 fructose-bisphosphate aldolase; Provisional
Probab=23.83  E-value=60  Score=24.54  Aligned_cols=17  Identities=29%  Similarity=0.450  Sum_probs=13.8

Q ss_pred             ceEeec-CCCCHHHHHHH
Q psy1572          42 VAVCHG-QGKTQQEAKTV   58 (74)
Q Consensus        42 ~~VCHG-sG~S~~eA~~~   58 (74)
                      |.|.|| ||.+.++.+.-
T Consensus       202 PlV~hG~SGI~~e~~~~~  219 (281)
T PRK06806        202 PLVLHGGSGISPEDFKKC  219 (281)
T ss_pred             CEEEECCCCCCHHHHHHH
Confidence            579998 89999987653


No 99 
>cd05712 Ig_Siglec_N Immunoglobulin (Ig) domain at the N terminus of Siglec (sialic acid-binding Ig-like lectins). Ig_Siglec_N: immunoglobulin (Ig) domain at the N terminus of Siglec (sialic acid-binding Ig-like lectins). Siglec refers to a structurally related protein family that specifically recognizes sialic acid in oligosaccharide chains of glycoproteins and glycolipids. Siglecs are type I transmembrane proteins, organized as an extracellular module composed of Ig-like domains (an N-terminal variable set of Ig-like carbohydrate recognition domains, and 1 to 16 constant Ig-like domains), followed by transmembrane and short cytoplasmic domains. Human siglecs are classified into two subgroups, one subgroup is comprised of sialoadhesin (Siglec-1), CD22 (Siglec-2), and MAG, the other subgroup is comprised of CD33-related Siglecs which include CD33 (Siglec-3) and human Siglecs 5-11.
Probab=23.64  E-value=98  Score=19.39  Aligned_cols=26  Identities=12%  Similarity=0.074  Sum_probs=18.2

Q ss_pred             eeccccceeEEEEeccCCceEeecCCC
Q psy1572          24 FLHSTGQFQALVQLATYPVAVCHGQGK   50 (74)
Q Consensus        24 els~~G~yqcLV~LST~P~~VCHGsG~   50 (74)
                      +.+=+|.|.|.++.+. +...=.|.|.
T Consensus        88 ~~~Dsg~Y~C~~~~~~-~~~~~f~~gt  113 (119)
T cd05712          88 QPEDSGKYFFRVELGD-SNKYNFLKNT  113 (119)
T ss_pred             ChhhCeEEEEEEEcCC-ccEEEecCCe
Confidence            3455799999999973 4455566664


No 100
>cd04972 Ig_TrkABC_d4 Fourth domain (immunoglobulin-like) of Trk receptors TrkA, TrkB and TrkC. TrkABC_d4: the fourth domain of Trk receptors TrkA, TrkB and TrkC, this is an immunoglobulin (Ig)-like domain which binds to neurotrophin. The Trk family of receptors are tyrosine kinase receptors. They are activated by dimerization, leading to autophosphorylation of intracellular tyrosine residues, and triggering the signal transduction pathway. TrkA, TrkB, and TrkC share significant sequence homology and domain organization. The first three domains are leucine-rich domains. The fourth and fifth domains are Ig-like domains playing a part in ligand binding. TrkA, Band C mediate the trophic effects of the neurotrophin Nerve growth factor (NGF) family. TrkA is recognized by NGF. TrKB is recognized by brain-derived neurotrophic factor (BDNF) and neurotrophin (NT)-4. TrkC is recognized by NT-3. NT-3 is promiscuous as in some cell systems it activates TrkA and TrkB receptors. TrkA is a receptor fo
Probab=23.61  E-value=51  Score=19.34  Aligned_cols=9  Identities=0%  Similarity=0.003  Sum_probs=7.9

Q ss_pred             ccceeEEEE
Q psy1572          28 TGQFQALVQ   36 (74)
Q Consensus        28 ~G~yqcLV~   36 (74)
                      .|.|+|.++
T Consensus        68 ~g~Y~C~A~   76 (90)
T cd04972          68 QTTVTCTAE   76 (90)
T ss_pred             CEEEEEEEE
Confidence            699999886


No 101
>cd05752 Ig1_FcgammaR_like Frst immunoglobulin (Ig)-like domain of  Fcgamma-receptors (FcgammaRs) and similar proteins. Ig1_FcgammaR_like: domain similar to the first immunoglobulin (Ig)-like domain of  Fcgamma-receptors (FcgammaRs). Interactions between IgG and FcgammaR are important to the initiation of cellular and humoral response. IgG binding to FcgammaR leads to a cascade of signals and ultimately to functions such as antibody-dependent-cellular-cytotoxicity (ADCC), endocytosis, phagocytosis, release of inflammatory mediators, etc. FcgammaR has two Ig-like domains. This group also contains FcepsilonRI, which binds IgE with high affinity.
Probab=23.37  E-value=31  Score=20.60  Aligned_cols=12  Identities=17%  Similarity=0.465  Sum_probs=9.7

Q ss_pred             eccccceeEEEE
Q psy1572          25 LHSTGQFQALVQ   36 (74)
Q Consensus        25 ls~~G~yqcLV~   36 (74)
                      ..-+|.|+|-+.
T Consensus        55 ~~dsG~Y~C~a~   66 (78)
T cd05752          55 NNDSGEYRCQTQ   66 (78)
T ss_pred             ccCCEEeEEECC
Confidence            457899999874


No 102
>cd04969 Ig5_Contactin_like Fifth Ig domain of contactin. Ig5_Contactin_like: Fifth Ig domain of contactins. Contactins are neural cell adhesion molecules and are comprised of six Ig domains followed by four fibronectin type III(FnIII) domains anchored to the membrane by glycosylphosphatidylinositol. The first four Ig domains form the intermolecular binding fragment, which arranges as a compact U-shaped module via contacts between Ig domains 1 and 4, and between Ig domains 2 and 3. Contactin-2 (TAG-1, axonin-1) may play a part in the neuronal processes of neurite outgrowth, axon guidance and fasciculation, and neuronal migration. This group also includes contactin-1 and contactin-5. The different contactins show different expression patterns in the central nervous system. During development and in adulthood, contactin-2 is transiently expressed in subsets of central and peripheral neurons. Contactin-5 is expressed specifically in the rat postnatal nervous system, peaking at about 3 week
Probab=23.35  E-value=53  Score=18.44  Aligned_cols=11  Identities=9%  Similarity=0.537  Sum_probs=8.8

Q ss_pred             ccccceeEEEE
Q psy1572          26 HSTGQFQALVQ   36 (74)
Q Consensus        26 s~~G~yqcLV~   36 (74)
                      .-.|.|+|.++
T Consensus        49 ~D~G~Y~C~a~   59 (73)
T cd04969          49 SDEGKYTCFAE   59 (73)
T ss_pred             hHCEEEEEEEE
Confidence            35699999985


No 103
>cd05746 Ig4_Peroxidasin Fourth immunoglobulin (Ig)-like domain of peroxidasin. Ig4_Peroxidasin: the fourth immunoglobulin (Ig)-like domain in peroxidasin. Peroxidasin has a peroxidase domain and interacting extracellular motifs containing four Ig-like domains. It has been suggested that peroxidasin is secreted, and has functions related to the stabilization of the extracellular matrix. It may play a part in various other important processes such as removal and destruction of cells, which have undergone programmed cell death, and protection of the organism against non-self.
Probab=23.32  E-value=53  Score=18.39  Aligned_cols=10  Identities=10%  Similarity=0.644  Sum_probs=8.2

Q ss_pred             cccceeEEEE
Q psy1572          27 STGQFQALVQ   36 (74)
Q Consensus        27 ~~G~yqcLV~   36 (74)
                      -.|.|.|.++
T Consensus        47 D~G~Y~C~a~   56 (69)
T cd05746          47 DQGRYECVAR   56 (69)
T ss_pred             hCEEEEEEEE
Confidence            4699999985


No 104
>cd05747 Ig5_Titin_like M5, fifth immunoglobulin (Ig)-like domain of human titin C terminus and similar proteins. Ig5_Titin_like: domain similar to the M5, fifth immunoglobulin (Ig)-like domain from the human titin C terminus. Titin (also called connectin) is a fibrous sarcomeric protein specifically found in vertebrate striated muscle. Titin is gigantic; depending on isoform composition it ranges from 2970 to 3700 kDa, and is of a length that spans half a sarcomere. Titin largely consists of multiple repeats of Ig-like and fibronectin type 3 (FN-III)-like domains. Titin connects the ends of myosin thick filaments to Z disks and extends along the thick filament to the H zone, and appears to function similar to an elastic band, keeping the myosin filaments centered in the sarcomere during muscle contraction or stretching.
Probab=23.15  E-value=53  Score=19.36  Aligned_cols=11  Identities=18%  Similarity=0.595  Sum_probs=8.9

Q ss_pred             ccccceeEEEE
Q psy1572          26 HSTGQFQALVQ   36 (74)
Q Consensus        26 s~~G~yqcLV~   36 (74)
                      +-.|.|+|.++
T Consensus        69 ~D~G~Y~C~a~   79 (92)
T cd05747          69 SDEGNYTVVVE   79 (92)
T ss_pred             ccCEeEEEEEE
Confidence            56799999865


No 105
>cd04973 Ig1_FGFR First immunoglobulin (Ig)-like domain of fibroblast growth factor receptor (FGFR). Ig1_FGFR: The first immunoglobulin (Ig)-like domain of fibroblast growth factor receptor (FGFR). Fibroblast growth factors (FGFs) participate in morphogenesis, development, angiogenesis, and wound healing. These FGF-stimulated processes are mediated by four FGFR tyrosine kinases (FGRF1-4). FGFRs are comprised of an extracellular portion consisting of three Ig-like domains, a transmembrane helix, and a cytoplasmic portion having protein tyrosine kinase activity. The highly conserved Ig-like domains 2 and 3, and the linker region between D2 and D3 define a general binding site for all FGFs.
Probab=23.13  E-value=51  Score=19.09  Aligned_cols=12  Identities=8%  Similarity=0.354  Sum_probs=9.5

Q ss_pred             eccccceeEEEE
Q psy1572          25 LHSTGQFQALVQ   36 (74)
Q Consensus        25 ls~~G~yqcLV~   36 (74)
                      ..-+|.|+|.+.
T Consensus        54 ~~DsG~Y~C~a~   65 (79)
T cd04973          54 PRDSGLYACVTS   65 (79)
T ss_pred             hhhCEEEEEEEe
Confidence            346899999985


No 106
>cd05856 Ig2_FGFRL1-like Second immunoglobulin (Ig)-like domain of fibroblast growth factor (FGF) receptor_like-1(FGFRL1). Ig2_FGFRL1-like: second immunoglobulin (Ig)-like domain of fibroblast growth factor (FGF) receptor_like-1(FGFRL1). FGFRL1 is comprised of a signal peptide, three extracellular Ig-like modules, a transmembrane segment, and a short intracellular domain. FGFRL1 is expressed preferentially in skeletal tissues. Similar to FGF receptors, the expressed protein interacts specifically with heparin and with FGF2.  FGFRL1 does not have a protein tyrosine kinase domain at its C terminus; neither does its cytoplasmic domain appear to interact with a signaling partner. It has been suggested that FGFRL1 may not have any direct signaling function, but instead acts as a decoy receptor trapping FGFs and preventing them from binding other receptors.
Probab=23.00  E-value=55  Score=18.05  Aligned_cols=11  Identities=18%  Similarity=0.540  Sum_probs=8.9

Q ss_pred             ccccceeEEEE
Q psy1572          26 HSTGQFQALVQ   36 (74)
Q Consensus        26 s~~G~yqcLV~   36 (74)
                      .-+|.|.|.++
T Consensus        58 ~D~G~Y~C~a~   68 (82)
T cd05856          58 EDSGKYTCHVS   68 (82)
T ss_pred             hhCEEEEEEEE
Confidence            35799999886


No 107
>PF13827 DUF4189:  Domain of unknown function (DUF4189)
Probab=22.95  E-value=83  Score=19.46  Aligned_cols=18  Identities=39%  Similarity=0.446  Sum_probs=13.9

Q ss_pred             eecCCCCHHHHHHHHHHH
Q psy1572          45 CHGQGKTQQEAKTVAAHN   62 (74)
Q Consensus        45 CHGsG~S~~eA~~~AA~N   62 (74)
                      =-|.|+|.++|+++|-.+
T Consensus        66 ~~~~~~t~~~A~~~Al~~   83 (100)
T PF13827_consen   66 GWGTGPTREEAEQDALAN   83 (100)
T ss_pred             EEecCCCHHHHHHHHHHH
Confidence            678899999888776543


No 108
>PRK01022 hypothetical protein; Provisional
Probab=22.72  E-value=45  Score=23.89  Aligned_cols=20  Identities=15%  Similarity=0.213  Sum_probs=17.4

Q ss_pred             eCCCCCceeeeccccceeEEEE
Q psy1572          15 STNGSNSIAFLHSTGQFQALVQ   36 (74)
Q Consensus        15 ~~~~~n~ieels~~G~yqcLV~   36 (74)
                      ..|||..|++  -+|.|.|-|.
T Consensus       126 ~FPGG~~V~~--~~g~yYCPVk  145 (167)
T PRK01022        126 PFPGGFKVEE--KNGVYYCPVK  145 (167)
T ss_pred             CCCCCeEEEe--ECCEEeCccc
Confidence            3699999999  8999999874


No 109
>cd05756 Ig1_IL1R_like First immunoglobulin (Ig)-like domain of interleukin-1 receptor (IL1R) and similar proteins. Ig1_IL1R_like: domain similar to the first immunoglobulin (Ig)-like domain of interleukin-1 receptor (IL1R). IL-1 alpha and IL-1 beta are cytokines which participate in the regulation of inflammation, immune responses, and hematopoiesis. These cytokines bind to the IL-1 receptor type 1 (IL1R1), which is activated on additional association with an accessory protein, IL1RAP. IL-1 also binds a second receptor designated type II (IL1R2). Mature IL1R1 consists of three Ig-like domains, a transmembrane domain, and a large cytoplasmic domain. Mature IL1R2 is organized similarly except that it has a short cytoplasmic domain. The latter does not initiate signal transduction. A naturally occurring cytokine IL-1RA (IL-1 receptor antagonist) is widely expressed and binds to IL-1 receptors, inhibiting the binding of IL-1 alpha and IL-1 beta.
Probab=22.69  E-value=48  Score=20.37  Aligned_cols=15  Identities=27%  Similarity=0.541  Sum_probs=11.1

Q ss_pred             eccccceeEEEEecc
Q psy1572          25 LHSTGQFQALVQLAT   39 (74)
Q Consensus        25 ls~~G~yqcLV~LST   39 (74)
                      ..-+|.|+|.++=++
T Consensus        69 ~~DsG~Y~C~~~N~~   83 (94)
T cd05756          69 LEDSGLYTCVVRNST   83 (94)
T ss_pred             cccCeEEEEEEcCCC
Confidence            345799999987544


No 110
>cd05743 Ig_Perlecan_D2_like Immunoglobulin (Ig)-like domain II (D2) of the human basement membrane heparan sulfate proteoglycan perlecan, also known as HSPG2. Ig_Perlecan_D2_like: the immunoglobulin (Ig)-like domain II (D2) of the human basement membrane heparan sulfate proteoglycan perlecan, also known as HSPG2. Perlecan consists of five domains. Domain I has three putative heparan sulfate attachment sites; domain II has four LDL receptor-like repeats, and one Ig-like repeat; domain III resembles the short arm of laminin chains; domain IV has multiple Ig-like repeats (21 repeats in human perlecan); and domain V resembles the globular G domain of the laminin A chain and internal repeats of EGF. Perlecan may participate in a variety of biological functions including cell binding, LDL-metabolism, basement membrane assembly and selective permeability, calcium binding, and growth- and neurite-promoting activities.
Probab=22.48  E-value=56  Score=18.73  Aligned_cols=10  Identities=10%  Similarity=0.265  Sum_probs=8.1

Q ss_pred             cccceeEEEE
Q psy1572          27 STGQFQALVQ   36 (74)
Q Consensus        27 ~~G~yqcLV~   36 (74)
                      -.|.|.|.++
T Consensus        53 D~G~Y~C~a~   62 (78)
T cd05743          53 DQGAYTCEAI   62 (78)
T ss_pred             HCEEEEEEEE
Confidence            4699999975


No 111
>PF07679 I-set:  Immunoglobulin I-set domain;  InterPro: IPR013098 The basic structure of immunoglobulin (Ig) molecules is a tetramer of two light chains and two heavy chains linked by disulphide bonds. There are two types of light chains: kappa and lambda, each composed of a constant domain (CL) and a variable domain (VL). There are five types of heavy chains: alpha, delta, epsilon, gamma and mu, all consisting of a variable domain (VH) and three (in alpha, delta and gamma) or four (in epsilon and mu) constant domains (CH1 to CH4). Ig molecules are highly modular proteins, in which the variable and constant domains have clear, conserved sequence patterns. The domains in Ig and Ig-like molecules are grouped into four types: V-set (variable; IPR013106 from INTERPRO), C1-set (constant-1; IPR003597 from INTERPRO), C2-set (constant-2; IPR008424 from INTERPRO) and I-set (intermediate; IPR013098 from INTERPRO) []. Structural studies have shown that these domains share a common core Greek-key beta-sandwich structure, with the types differing in the number of strands in the beta-sheets as well as in their sequence patterns [, ]. Immunoglobulin-like domains that are related in both sequence and structure can be found in several diverse protein families. Ig-like domains are involved in a variety of functions, including cell-cell recognition, cell-surface receptors, muscle structure and the immune system [].  This entry represents I-set domains, which are found in several cell adhesion molecules, including vascular (VCAM), intercellular (ICAM), neural (NCAM) and mucosal addressin (MADCAM) cell adhesion molecules, as well as junction adhesion molecules (JAM). I-set domains are also present in several other diverse protein families, including several tyrosine-protein kinase receptors, the hemolymph protein hemolin, the muscle proteins titin, telokin, and twitchin, the neuronal adhesion molecule axonin-1 [], and the signalling molecule semaphorin 4D that is involved in axonal guidance, immune function and angiogenesis [].; PDB: 3MTR_A 2EDK_A 3DMK_B 1KOA_A 3NCM_A 2NCM_A 2V9Q_A 2CR3_A 3QQN_A 3QR2_A ....
Probab=22.27  E-value=58  Score=18.38  Aligned_cols=14  Identities=7%  Similarity=0.368  Sum_probs=10.9

Q ss_pred             ccccceeEEEEecc
Q psy1572          26 HSTGQFQALVQLAT   39 (74)
Q Consensus        26 s~~G~yqcLV~LST   39 (74)
                      .-.|.|.|.++-+.
T Consensus        66 ~D~G~Y~C~~~n~~   79 (90)
T PF07679_consen   66 EDAGTYTCVASNSS   79 (90)
T ss_dssp             GGSEEEEEEEEETT
T ss_pred             hhCEEEEEEEEECC
Confidence            45799999998653


No 112
>cd05854 Ig6_Contactin-2 Sixth Ig domain of contactin-2. Ig6_Contactin-2: Sixth Ig domain of the neural cell adhesion molecule contactin-2-like. Contactins are comprised of six Ig domains followed by four fibronectin type III (FnIII) domains anchored to the membrane by glycosylphosphatidylinositol. Contactin-2 (TAG-1, axonin-1) facilitates cell adhesion by homophilic binding between molecules in apposed membranes. It may play a part in the neuronal processes of neurite outgrowth, axon guidance and fasciculation, and neuronal migration. The first four Ig domains form the intermolecular binding fragment, which arranges as a compact U-shaped module by contacts between IG domains 1 and 4, and domains 2 and 3. The different contactins show different expression patterns in the central nervous system. During development and in adulthood, contactin-2 is transiently expressed in subsets of central and peripheral neurons. Contactin-2 is also expressed in retinal amacrine cells in the developing c
Probab=22.20  E-value=56  Score=19.46  Aligned_cols=12  Identities=17%  Similarity=0.351  Sum_probs=9.3

Q ss_pred             ccccceeEEEEe
Q psy1572          26 HSTGQFQALVQL   37 (74)
Q Consensus        26 s~~G~yqcLV~L   37 (74)
                      +-.|.|.|.++=
T Consensus        57 ~D~G~YtC~A~n   68 (85)
T cd05854          57 SHAGTYTCTAQT   68 (85)
T ss_pred             hhCeEEEEEEec
Confidence            357999999764


No 113
>cd05730 Ig3_NCAM-1_like Third immunoglobulin (Ig)-like domain of Neural Cell Adhesion Molecule NCAM-1 (NCAM). Ig3_NCAM-1_like: domain similar to the third immunoglobulin (Ig)-like domain of Neural Cell Adhesion Molecule NCAM-1 (NCAM). NCAM plays important roles in the development and regeneration of the central nervous system, in synaptogenesis and neural migration. NCAM mediates cell-cell and cell-substratum recognition and adhesion via homophilic (NCAM-NCAM), and heterophilic (NCAM-non-NCAM), interactions. NCAM is expressed as three major isoforms having different intracellular extensions. The extracellular portion of NCAM has five N-terminal Ig-like domains and two fibronectin type III domains. The double zipper adhesion complex model for NCAM homophilic binding involves Ig1, Ig2, and Ig3. By this model, Ig1,and Ig2 mediate dimerization of NCAM molecules situated on the same cell surface (cis interactions), and Ig3 domains mediate interactions between NCAM molecules expressed on the
Probab=22.09  E-value=57  Score=19.12  Aligned_cols=11  Identities=0%  Similarity=0.332  Sum_probs=8.7

Q ss_pred             ccccceeEEEE
Q psy1572          26 HSTGQFQALVQ   36 (74)
Q Consensus        26 s~~G~yqcLV~   36 (74)
                      .-.|.|.|.++
T Consensus        68 ~D~G~Y~C~a~   78 (95)
T cd05730          68 LDEAEYTCIAE   78 (95)
T ss_pred             hhCEEEEEEEE
Confidence            45789999875


No 114
>cd02651 nuc_hydro_IU_UC_XIUA nuc_hydro_IU_UC_XIUA: inosine-uridine preferring, xanthosine-inosine-uridine-adenosine-preferring and, uridine-cytidine preferring nucleoside hydrolases.  Nucleoside hydrolases cleave the N-glycosidic bond in nucleosides generating ribose and the respective base. These enzymes vary in their substrate specificity. This group contains proteins similar to nucleoside hydrolases which hydrolyze both pyrimidine and purine ribonucleosides: the inosine-uridine preferring nucleoside hydrolase from Crithidia fasciculata, the inosine-uridine-xanthosine preferring nucleoside hydrolase RihC from Escherichia coli and the xanthosine-inosine-uridine-adenosine-preferring nucleoside hydrolase RihC from Salmonella enterica serovar Typhimurium. This group also contains proteins similar to the pyrimidine-specific uridine-cytidine preferring nucleoside hydrolases URH1 from Saccharomyces cerevisiae, E. coli RihA and E. coli RihB.  E. coli  RihA is equally efficient with uridine a
Probab=21.83  E-value=1.1e+02  Score=22.57  Aligned_cols=26  Identities=19%  Similarity=0.342  Sum_probs=18.3

Q ss_pred             ceEeecCCCCHHHHHHHHHHHHHHHHHHhhc
Q psy1572          42 VAVCHGQGKTQQEAKTVAAHNAIEYLRLMTK   72 (74)
Q Consensus        42 ~~VCHGsG~S~~eA~~~AA~NALqyLkiM~~   72 (74)
                      ++.++|.-.     .+++++||+..|..+..
T Consensus        30 It~v~Gn~~-----~~~~~~na~~ll~~~g~   55 (302)
T cd02651          30 ITTVAGNVP-----LEKTTRNALKLLTLLGR   55 (302)
T ss_pred             EEeccCeec-----HHHHHHHHHHHHHHhCC
Confidence            445565443     46789999999988864


No 115
>cd05891 Ig_M-protein_C C-terminal immunoglobulin (Ig)-like domain of M-protein (also known as myomesin-2). Ig_M-protein_C: the C-terminal immunoglobulin (Ig)-like domain of M-protein (also known as myomesin-2). M-protein is a structural protein localized to the M-band, a transverse structure in the center of the sarcomere, and is a candidate for M-band bridges. M-protein is modular consisting mainly of repetitive IG-like and fibronectin type III (FnIII) domains, and has a muscle-type specific expression pattern. M-protein is present in fast fibers.
Probab=21.67  E-value=58  Score=19.53  Aligned_cols=11  Identities=18%  Similarity=0.513  Sum_probs=9.4

Q ss_pred             ccccceeEEEE
Q psy1572          26 HSTGQFQALVQ   36 (74)
Q Consensus        26 s~~G~yqcLV~   36 (74)
                      .-.|.|+|.++
T Consensus        68 ~D~G~Y~C~a~   78 (92)
T cd05891          68 EDSGKYSINVK   78 (92)
T ss_pred             hhCEEEEEEEE
Confidence            56899999887


No 116
>TIGR03693 ocin_ThiF_like putative thiazole-containing bacteriocin maturation protein. Members of this protein family are found in a three-gene operon in Bacillus anthracis and related Bacillus species, where the other two genes are clearly identified with maturation of a putative thiazole-containing bacteriocin precursor. While there is no detectable pairwise sequence similarity between members of this family and the proposed cyclodehydratases such as SagC of Streptococcus pyogenes (see family TIGR03603), both families show similarity through PSI-BLAST to ThiF, a protein involved in biosynthesis of the thiazole moiety for thiamine biosynthesis. This family, therefore, may contribute to cyclodehydratase function in heterocycle-containing bacteriocin biosyntheses. In Bacillus licheniformis ATCC 14580, the bacteriocin precursor gene is adjacent to the gene for this protein.
Probab=21.25  E-value=71  Score=27.83  Aligned_cols=45  Identities=31%  Similarity=0.365  Sum_probs=31.2

Q ss_pred             eeccccceeEEEEec---------cCCceEeecCCCCHHHHHHHHHHHHHHH-HHHh
Q psy1572          24 FLHSTGQFQALVQLA---------TYPVAVCHGQGKTQQEAKTVAAHNAIEY-LRLM   70 (74)
Q Consensus        24 els~~G~yqcLV~LS---------T~P~~VCHGsG~S~~eA~~~AA~NALqy-LkiM   70 (74)
                      +++.--.-||-|+..         ..|..||  +|-|-+|||.+|+.-++|. ...|
T Consensus       383 ~l~QlPL~~~~~~~~~p~~~gp~~~l~~~~~--~g~th~earreagl~gie~y~~~~  437 (637)
T TIGR03693       383 DLHQLPLAQCFIQTADPLSDGPAELLPEITC--SGLTHNEARREAGLTGIEMYAAEI  437 (637)
T ss_pred             cccccchhhceEEecCCCCCCcccccCCcee--cccchHHHHHhhchhhHHHHHHHH
Confidence            344444457777742         3566788  5789999999999999984 4443


No 117
>PF06675 DUF1177:  Protein of unknown function (DUF1177);  InterPro: IPR009561 This family consists of several hypothetical archaeal and bacterial proteins of around 300 residues in length. The function of this family is unknown.
Probab=21.18  E-value=1.6e+02  Score=23.43  Aligned_cols=29  Identities=17%  Similarity=0.185  Sum_probs=26.1

Q ss_pred             ecCCCCHHHHHHHHHHHHHHHHHHhhcCC
Q psy1572          46 HGQGKTQQEAKTVAAHNAIEYLRLMTKSK   74 (74)
Q Consensus        46 HGsG~S~~eA~~~AA~NALqyLkiM~~~k   74 (74)
                      .++|.|-+.--+.|+|-++|..|.+++++
T Consensus       220 caTGash~~di~~A~RF~iEvAK~fg~g~  248 (276)
T PF06675_consen  220 CATGASHEVDIEHAVRFCIEVAKEFGRGK  248 (276)
T ss_pred             cccccCCHHHHHHHHHHHHHHHHHHcCCC
Confidence            58999999999999999999999988654


No 118
>cd05853 Ig6_Contactin-4 Sixth Ig domain of contactin-4. Ig6_Contactin-4: sixth Ig domain of the neural cell adhesion molecule contactin-4. Contactins are neural cell adhesion molecules, and are comprised of six Ig domains followed by four fibronectin type III (FnIII) domains anchored to the membrane by glycosylphosphatidylinositol. The different contactins show different expression patterns in the central nervous system. Highest expresson of contactin-4 is in testes, thyroid, small intestine, uterus and brain. Contactin-4 plays a role in the response of neuroblastoma cells to differentiating agents, such as retinoids. The contactin 4 gene is associated with cerebellar degeneration in spinocerebellar ataxia type 16.
Probab=21.14  E-value=59  Score=20.18  Aligned_cols=11  Identities=27%  Similarity=0.613  Sum_probs=8.9

Q ss_pred             cccceeEEEEe
Q psy1572          27 STGQFQALVQL   37 (74)
Q Consensus        27 ~~G~yqcLV~L   37 (74)
                      -.|.|.|.++=
T Consensus        58 dsG~YtC~a~n   68 (85)
T cd05853          58 HAGKYVCMVQT   68 (85)
T ss_pred             HCEEEEEEEEc
Confidence            46999999874


No 119
>cd05900 Ig_Aggrecan Immunoglobulin (Ig)-like domain of the chondroitin sulfate proteoglycan core protein (CSPG), aggrecan. Ig_Aggrecan: immunoglobulin (Ig)-like domain of the chondroitin sulfate proteoglycan core protein (CSPG), aggrecan. These aggregates contribute to the tissue's load bearing properties. In CSPGs, the Ig-like domain is followed by hyaluronan (HA)-binding tandem repeats, and a C-terminal region with epidermal growth factor-like, lectin-like, and complement regulatory protein-like domains. Separating these N- and C-terminal regions is a nonhomologous glycosaminoglycan attachment region. In cartilage, aggrecan forms cartilage link protein stabilized aggregates with HA. These aggregates contribute to the tissue's load bearing properties. Aggrecan has a wide distribution in connective tissue and extracellular matrices. Members of the vertebrate HPLN (hyaluronan/HA and proteoglycan binding link) protein family are physically linked adjacent to CSPG genes.
Probab=21.09  E-value=1.2e+02  Score=19.73  Aligned_cols=27  Identities=22%  Similarity=0.385  Sum_probs=17.6

Q ss_pred             eEEeee---CCCCCce--eee--ccccceeEEEE
Q psy1572          10 RVCLIS---TNGSNSI--AFL--HSTGQFQALVQ   36 (74)
Q Consensus        10 ~v~~~~---~~~~n~i--eel--s~~G~yqcLV~   36 (74)
                      ||.|..   .++..++  ..+  +=+|.|+|-|.
T Consensus        61 R~sl~~~~~~~~~asL~I~nl~~sDsG~Y~C~V~   94 (112)
T cd05900          61 RVSLPNYPAIPSDATLEITELRSNDSGTYRCEVM   94 (112)
T ss_pred             eEEecccCCCCCCcEEEEeecccccCEEEEEEEe
Confidence            777765   3455554  333  45799999987


No 120
>cd05878 Ig_Aggrecan_like Immunoglobulin (Ig)-like domain of the aggrecan-like chondroitin sulfate proteoglycan core protein (CSPG). Ig_Aggrecan_like: immunoglobulin (Ig)-like domain of the aggrecan-like chondroitin sulfate proteoglycan core protein (CSPG)s. Included in this group are the Ig domains of other CSPGs: versican, and neurocan. In CSPGs this Ig-like domain is followed by hyaluronan (HA)-binding tandem repeats, and a C-terminal region with epidermal growth factor-like, lectin-like, and complement regulatory protein-like domains. Separating these N- and C-terminal regions is a nonhomologous glycosaminoglycan attachment region. In cartilage, aggrecan forms cartilage link protein stabilized aggregates with hyaluronan (HA). These aggregates contribute to the tissue's load bearing properties. Aggrecan and versican have a wide distribution in connective tissue and extracellular matrices. Neurocan is localized almost exclusively in nervous tissue. Aggregates having other CSPGs substi
Probab=21.00  E-value=1.4e+02  Score=18.77  Aligned_cols=32  Identities=19%  Similarity=0.213  Sum_probs=17.5

Q ss_pred             eeeeEEeeeCCCCCc-----ee--eeccccceeEEEEec
Q psy1572           7 LDTRVCLISTNGSNS-----IA--FLHSTGQFQALVQLA   38 (74)
Q Consensus         7 ~~~~v~~~~~~~~n~-----ie--els~~G~yqcLV~LS   38 (74)
                      +.-|+.|...+..+.     |.  ..+=+|.|.|-|.-.
T Consensus        56 f~gR~~~~~~~~~~~~asL~I~~l~~sD~G~Y~C~v~~~   94 (110)
T cd05878          56 YQGRVSLPNYPENRGDASLEISRLRSSDSGVYRCEVMHG   94 (110)
T ss_pred             ccCeEEeccCCCCCCcEEEEECCCChhhCEEEEEEEEeC
Confidence            334777754433222     22  223468999998753


No 121
>cd05858 Ig3_FGFR-2 Third immunoglobulin (Ig)-like domain of fibroblast growth factor receptor 2 (FGFR2). Ig3_FGFR-2-like; domain similar to the third immunoglobulin (Ig)-like domain of human fibroblast growth factor receptor 2 (FGFR2). Fibroblast growth factors (FGFs) participate in morphogenesis, development, angiogenesis, and wound healing. These FGF-stimulated processes are mediated by four FGFR tyrosine kinases (FGRF1-4). FGFRs are comprised of an extracellular portion consisting of three Ig-like domains, a transmembrane helix, and a cytoplasmic portion having protein tyrosine kinase activity. The highly conserved Ig-like domains 2 and 3, and the linker region between D2 and D3 define a general binding site for FGFs. FGFR2 is required for male sex determination.
Probab=20.74  E-value=62  Score=19.04  Aligned_cols=10  Identities=20%  Similarity=0.604  Sum_probs=8.2

Q ss_pred             cccceeEEEE
Q psy1572          27 STGQFQALVQ   36 (74)
Q Consensus        27 ~~G~yqcLV~   36 (74)
                      -.|.|+|.+.
T Consensus        66 D~G~Y~C~A~   75 (90)
T cd05858          66 DAGEYTCLAG   75 (90)
T ss_pred             HCEEEEEEEE
Confidence            3699999985


No 122
>cd04979 Ig_Semaphorin_C Immunoglobulin (Ig)-like domain of semaphorin. Ig_Semaphorin_C; Immunoglobulin (Ig)-like domain in semaphorins. Semaphorins are transmembrane protein that have important roles in a variety of tissues. Functionally, semaphorins were initially characterized for their importance in the development of the nervous system and in axonal guidance. Later they have been found to be important for the formation and functioning of the cardiovascular, endocrine, gastrointestinal, hepatic, immune, musculoskeletal, renal, reproductive, and respiratory systems. Semaphorins function through binding to their receptors and transmembrane semaphorins also serves as receptors themselves. Although molecular mechanism of semaphorins is poorly understood, the Ig-like domains may involve in ligand binding or dimerization.
Probab=20.69  E-value=62  Score=18.86  Aligned_cols=14  Identities=7%  Similarity=0.026  Sum_probs=10.7

Q ss_pred             eccccceeEEEEec
Q psy1572          25 LHSTGQFQALVQLA   38 (74)
Q Consensus        25 ls~~G~yqcLV~LS   38 (74)
                      ..-+|.|+|.++-+
T Consensus        60 ~~D~G~Y~C~a~~~   73 (89)
T cd04979          60 PADAGVYTCQSVEH   73 (89)
T ss_pred             HHHCEEEEEEEecC
Confidence            35679999998743


No 123
>PF04101 Glyco_tran_28_C:  Glycosyltransferase family 28 C-terminal domain;  InterPro: IPR007235 The biosynthesis of disaccharides, oligosaccharides and polysaccharides involves the action of hundreds of different glycosyltransferases. These enzymes catalyse the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. A classification of glycosyltransferases using nucleotide diphospho-sugar, nucleotide monophospho-sugar and sugar phosphates (2.4.1.- from EC) and related proteins into distinct sequence based families has been described []. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. The same three-dimensional fold is expected to occur within each of the families. Because 3-D structures are better conserved than sequences, several of the families defined on the basis of sequence similarities may have similar 3-D structures and therefore form 'clans'. Glycosyltransferase family 28 GT28 from CAZY comprises enzymes with a number of known activities; 1,2-diacylglycerol 3-beta-galactosyltransferase (2.4.1.46 from EC); 1,2-diacylglycerol 3-beta-glucosyltransferase (2.4.1.157 from EC); beta-N-acetylglucosamine transferase (2.4.1 from EC). Structural analysis suggests the C-terminal domain contains the UDP-GlcNAc binding site.; GO: 0016758 transferase activity, transferring hexosyl groups, 0030246 carbohydrate binding, 0005975 carbohydrate metabolic process, 0030259 lipid glycosylation; PDB: 2KS6_A 2JZC_A 1NLM_B 1F0K_B.
Probab=20.63  E-value=43  Score=21.60  Aligned_cols=14  Identities=29%  Similarity=0.392  Sum_probs=11.0

Q ss_pred             ceEeecCCCCHHHH
Q psy1572          42 VAVCHGQGKTQQEA   55 (74)
Q Consensus        42 ~~VCHGsG~S~~eA   55 (74)
                      +.||||.+.|..|+
T Consensus        75 lvIs~aG~~Ti~E~   88 (167)
T PF04101_consen   75 LVISHAGAGTIAEA   88 (167)
T ss_dssp             EEEECS-CHHHHHH
T ss_pred             EEEeCCCccHHHHH
Confidence            79999999887765


No 124
>PRK05428 HPr kinase/phosphorylase; Provisional
Probab=20.11  E-value=3.8e+02  Score=21.06  Aligned_cols=66  Identities=15%  Similarity=0.094  Sum_probs=48.4

Q ss_pred             ccceeeeEEeeeCCCCCceeeeccccceeEEEEeccCCceEeecCCCCHHHHHHHHHHHHHHHHHHhh
Q psy1572           4 SGLLDTRVCLISTNGSNSIAFLHSTGQFQALVQLATYPVAVCHGQGKTQQEAKTVAAHNAIEYLRLMT   71 (74)
Q Consensus         4 s~l~~~~v~~~~~~~~n~ieels~~G~yqcLV~LST~P~~VCHGsG~S~~eA~~~AA~NALqyLkiM~   71 (74)
                      ++-+|.-|.+..++.....|.+-...+|..+.-.+.--+.+.==.|-....--+.||+|=.  ||.|+
T Consensus       223 ~~~I~lvV~L~~~~~~~~~dRLg~~~~~~~ilgv~Ip~i~ipv~~Grn~a~iiE~Aa~n~~--lk~~G  288 (308)
T PRK05428        223 KKRIQLVVELEEWDEDKEYDRLGLDEETEEILGVKIPKITIPVRPGRNLAVIIEAAARNYR--LKLMG  288 (308)
T ss_pred             CcEEEEEEEeecCCCCCccCCCCCchheeEEcCceeeEEEeCCCCCCcHHHHHHHHHHHHH--HHHcC
Confidence            3567788888888777788888888888877444443333333458999999999999965  88875


No 125
>cd05851 Ig3_Contactin-1 Third Ig domain of contactin-1. Ig3_Contactin-1: Third Ig domain of the neural cell adhesion molecule contactin-1. Contactins are comprised of six Ig domains followed by four fibronectin type III (FnIII) domains anchored to the membrane by glycosylphosphatidylinositol. Contactin-1 is differentially expressed in tumor tissues and may through a RhoA mechanism, facilitate invasion and metastasis of human lung adenocarcinoma.
Probab=20.10  E-value=66  Score=19.11  Aligned_cols=10  Identities=10%  Similarity=0.444  Sum_probs=8.4

Q ss_pred             cccceeEEEE
Q psy1572          27 STGQFQALVQ   36 (74)
Q Consensus        27 ~~G~yqcLV~   36 (74)
                      -+|.|+|.++
T Consensus        64 D~G~Y~C~A~   73 (88)
T cd05851          64 DEGTYECEAE   73 (88)
T ss_pred             hCEEEEEEEE
Confidence            5699999976


No 126
>cd04968 Ig3_Contactin_like Third Ig domain of contactin. Ig3_Contactin_like: Third Ig domain of contactins. Contactins are neural cell adhesion molecules and are comprised of six Ig domains followed by four fibronectin type III(FnIII) domains anchored to the membrane by glycosylphosphatidylinositol. The first four Ig domains form the intermolecular binding fragment, which arranges as a compact U-shaped module via contacts between Ig domains 1 and 4, and between Ig domains 2 and 3. Contactin-2 (TAG-1, axonin-1) may play a part in the neuronal processes of neurite outgrowth, axon guidance and fasciculation, and neuronal migration. This group also includes contactin-1 and contactin-5. The different contactins show different expression patterns in the central nervous system. During development and in adulthood, contactin-2 is transiently expressed in subsets of central and peripheral neurons. Contactin-5 is expressed specifically in the rat postnatal nervous system, peaking at about 3 week
Probab=20.09  E-value=67  Score=18.59  Aligned_cols=11  Identities=9%  Similarity=0.416  Sum_probs=9.0

Q ss_pred             ccccceeEEEE
Q psy1572          26 HSTGQFQALVQ   36 (74)
Q Consensus        26 s~~G~yqcLV~   36 (74)
                      .-+|.|+|.+.
T Consensus        63 ~D~G~Y~C~a~   73 (88)
T cd04968          63 EDEGTYECEAE   73 (88)
T ss_pred             ccCEEEEEEEE
Confidence            45799999986


Done!