Query psy14887
Match_columns 238
No_of_seqs 139 out of 1708
Neff 11.4
Searched_HMMs 46136
Date Fri Aug 16 23:20:40 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy14887.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/14887hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG4221|consensus 100.0 1.8E-26 4E-31 191.5 21.9 204 2-216 520-725 (1381)
2 KOG3513|consensus 99.9 4.3E-24 9.4E-29 177.6 25.2 197 2-204 716-914 (1051)
3 KOG0196|consensus 99.9 3.8E-25 8.2E-30 177.5 15.5 195 1-206 330-537 (996)
4 KOG3513|consensus 99.9 5.7E-24 1.2E-28 176.8 21.3 207 2-213 614-825 (1051)
5 KOG4221|consensus 99.9 1.3E-20 2.7E-25 157.3 20.6 194 1-202 614-811 (1381)
6 KOG4222|consensus 99.6 2.8E-14 6E-19 119.5 16.5 185 13-202 652-843 (1281)
7 PF00041 fn3: Fibronectin type 99.5 1.9E-13 4.2E-18 84.5 9.4 84 108-195 2-85 (85)
8 PF00041 fn3: Fibronectin type 99.5 1.3E-12 2.8E-17 80.7 10.1 84 4-92 1-84 (85)
9 KOG4222|consensus 99.3 1.1E-11 2.3E-16 104.4 9.0 197 3-204 527-742 (1281)
10 KOG4258|consensus 99.3 6.8E-12 1.5E-16 102.7 6.8 135 65-204 783-923 (1025)
11 KOG4802|consensus 99.0 4.1E-08 8.9E-13 74.7 16.3 137 63-204 208-353 (516)
12 cd00063 FN3 Fibronectin type 3 99.0 1.7E-08 3.6E-13 62.7 11.4 84 108-196 3-87 (93)
13 cd00063 FN3 Fibronectin type 3 98.9 4.7E-08 1E-12 60.6 10.9 87 3-94 1-87 (93)
14 KOG0196|consensus 98.9 2.9E-08 6.4E-13 81.8 11.3 95 3-103 443-537 (996)
15 smart00060 FN3 Fibronectin typ 98.8 2.6E-07 5.7E-12 55.7 10.5 80 108-191 3-82 (83)
16 KOG4258|consensus 98.6 8.8E-07 1.9E-11 73.7 12.8 137 8-150 493-653 (1025)
17 smart00060 FN3 Fibronectin typ 98.5 4E-06 8.7E-11 50.3 10.1 78 4-88 2-81 (83)
18 PF09294 Interfer-bind: Interf 98.1 1.4E-05 3E-10 51.3 6.3 83 107-195 4-98 (106)
19 PF09294 Interfer-bind: Interf 97.9 0.00017 3.6E-09 46.2 8.2 89 4-97 4-102 (106)
20 PF01108 Tissue_fac: Tissue fa 97.9 0.00017 3.7E-09 46.3 8.0 82 3-90 22-105 (107)
21 PF10179 DUF2369: Uncharacteri 97.8 0.0013 2.8E-08 49.7 12.5 122 64-191 128-285 (300)
22 KOG4367|consensus 97.8 8.2E-05 1.8E-09 57.7 6.3 97 102-205 435-531 (699)
23 KOG4152|consensus 97.5 0.0017 3.7E-08 52.0 9.9 123 69-195 652-789 (830)
24 KOG3632|consensus 97.3 0.0014 2.9E-08 56.3 8.0 173 3-195 672-859 (1335)
25 PF09067 EpoR_lig-bind: Erythr 97.1 0.0056 1.2E-07 38.8 7.7 83 1-89 6-95 (104)
26 KOG4367|consensus 96.8 0.0012 2.5E-08 51.6 3.2 82 14-102 450-531 (699)
27 PF10179 DUF2369: Uncharacteri 96.7 0.16 3.5E-06 38.7 15.9 28 61-88 12-39 (300)
28 COG3401 Fibronectin type 3 dom 96.6 0.1 2.3E-06 39.9 12.0 162 16-197 83-247 (343)
29 PF01108 Tissue_fac: Tissue fa 96.6 0.016 3.4E-07 37.1 6.8 81 105-191 21-104 (107)
30 PF09067 EpoR_lig-bind: Erythr 96.5 0.027 5.8E-07 35.8 7.4 81 105-190 7-94 (104)
31 KOG4802|consensus 96.0 0.15 3.3E-06 40.0 10.3 58 159-217 202-259 (516)
32 PF09240 IL6Ra-bind: Interleuk 95.3 0.38 8.3E-06 30.2 10.1 80 5-89 1-86 (99)
33 PF07495 Y_Y_Y: Y_Y_Y domain; 95.2 0.24 5.1E-06 28.3 7.1 27 167-194 30-56 (66)
34 KOG3632|consensus 95.1 0.049 1.1E-06 47.4 5.2 159 6-188 581-740 (1335)
35 PLN02533 probable purple acid 94.6 1.2 2.6E-05 36.3 11.9 87 105-205 40-135 (427)
36 COG4733 Phage-related protein, 93.8 0.91 2E-05 39.4 9.7 127 61-195 657-786 (952)
37 COG3401 Fibronectin type 3 dom 92.8 1.1 2.4E-05 34.6 8.1 150 36-195 4-153 (343)
38 KOG4806|consensus 92.7 2 4.4E-05 33.4 9.3 119 68-193 288-437 (454)
39 PF09240 IL6Ra-bind: Interleuk 92.2 1.9 4.1E-05 27.1 10.2 83 109-194 2-89 (99)
40 KOG1225|consensus 92.1 0.74 1.6E-05 38.1 6.9 157 5-183 368-524 (525)
41 PLN02533 probable purple acid 91.7 3.1 6.7E-05 34.0 10.1 84 3-101 41-134 (427)
42 PF07353 Uroplakin_II: Uroplak 89.2 4.8 0.0001 27.5 7.5 41 163-205 99-139 (184)
43 PF07495 Y_Y_Y: Y_Y_Y domain; 89.1 2.8 6E-05 23.8 6.7 25 65-90 30-54 (66)
44 KOG4152|consensus 88.3 0.94 2E-05 37.1 4.4 35 170-205 651-685 (830)
45 KOG0613|consensus 88.0 9.5 0.00021 35.1 10.6 178 2-195 139-320 (1205)
46 TIGR00868 hCaCC calcium-activa 86.8 5.1 0.00011 35.8 8.3 88 107-195 757-859 (863)
47 KOG1948|consensus 86.5 24 0.00051 31.6 14.6 117 63-187 945-1064(1165)
48 PF14292 SusE: SusE outer memb 80.2 14 0.0003 24.2 10.0 67 117-186 42-121 (122)
49 KOG0613|consensus 76.2 18 0.00039 33.5 7.8 126 66-195 293-422 (1205)
50 TIGR00868 hCaCC calcium-activa 74.3 26 0.00056 31.6 8.3 41 2-45 755-795 (863)
51 PF11344 DUF3146: Protein of u 74.2 2.8 6E-05 24.6 1.7 16 171-186 65-80 (80)
52 PHA02579 7 baseplate wedge sub 71.6 25 0.00053 30.9 7.2 80 7-94 9-96 (1030)
53 KOG1948|consensus 70.6 44 0.00095 30.0 8.5 26 163-188 943-968 (1165)
54 PF14292 SusE: SusE outer memb 64.9 36 0.00079 22.2 9.9 37 9-48 37-73 (122)
55 PF10342 GPI-anchored: Ser-Thr 62.9 32 0.00069 20.9 7.9 61 119-184 12-77 (93)
56 KOG4806|consensus 61.4 18 0.00038 28.5 4.2 33 157-189 161-193 (454)
57 COG4733 Phage-related protein, 55.8 72 0.0015 28.6 7.2 45 160-205 654-698 (952)
58 TIGR03000 plancto_dom_1 Planct 55.4 24 0.00051 20.9 3.2 27 162-188 25-51 (75)
59 PHA02579 7 baseplate wedge sub 53.9 1.1E+02 0.0024 27.3 7.9 79 108-193 7-92 (1030)
60 KOG3834|consensus 53.0 40 0.00086 27.4 5.0 70 9-81 66-137 (462)
61 PF09423 PhoD: PhoD-like phosp 48.4 24 0.00051 29.2 3.5 34 167-204 65-98 (453)
62 KOG3834|consensus 48.4 56 0.0012 26.7 5.2 67 115-183 71-137 (462)
63 PF13205 Big_5: Bacterial Ig-l 48.3 30 0.00064 21.6 3.3 15 171-185 70-84 (107)
64 PF04775 Bile_Hydr_Trans: Acyl 48.0 35 0.00075 22.6 3.6 24 167-190 6-29 (126)
65 PF07353 Uroplakin_II: Uroplak 45.5 46 0.00099 23.0 3.8 24 63-86 101-124 (184)
66 PF09423 PhoD: PhoD-like phosp 44.9 59 0.0013 26.9 5.3 35 64-101 64-98 (453)
67 KOG3515|consensus 43.4 53 0.0012 29.0 4.8 75 69-150 659-733 (741)
68 PF10333 Pga1: GPI-Mannosyltra 42.9 36 0.00077 24.2 3.2 22 163-184 63-84 (180)
69 KOG4228|consensus 42.1 47 0.001 30.6 4.4 109 6-127 28-136 (1087)
70 PF13750 Big_3_3: Bacterial Ig 38.0 46 0.00099 23.1 3.1 31 170-202 116-146 (158)
71 PF13754 Big_3_4: Bacterial Ig 36.8 73 0.0016 17.3 5.3 27 168-195 16-42 (54)
72 PF00907 T-box: T-box; InterP 36.5 85 0.0018 22.3 4.4 25 64-88 33-57 (184)
73 PF02010 REJ: REJ domain; Int 35.9 12 0.00026 30.6 0.0 32 171-204 267-298 (440)
74 PF14686 fn3_3: Polysaccharide 35.4 43 0.00093 20.9 2.4 21 164-185 48-68 (95)
75 PF00907 T-box: T-box; InterP 34.9 77 0.0017 22.5 3.9 25 165-189 32-56 (184)
76 KOG1378|consensus 34.1 2.8E+02 0.006 23.2 10.0 33 65-102 110-142 (452)
77 KOG4228|consensus 34.0 97 0.0021 28.7 5.0 102 81-195 144-245 (1087)
78 PF14250 AbrB-like: AbrB-like 31.3 49 0.0011 19.2 1.9 41 134-184 24-64 (71)
79 cd05893 Ig_Palladin_C C-termin 29.0 1.2E+02 0.0027 17.5 5.4 26 167-192 42-67 (75)
80 PF08329 ChitinaseA_N: Chitina 27.9 1.9E+02 0.0042 19.4 8.7 20 71-90 79-98 (133)
81 COG3979 Uncharacterized protei 26.2 1.6E+02 0.0035 21.0 4.3 84 109-204 8-91 (181)
82 KOG4680|consensus 25.8 1.5E+02 0.0032 20.1 3.6 25 167-191 109-133 (153)
83 cd00182 TBOX T-box DNA binding 25.8 1.3E+02 0.0027 21.7 3.7 23 166-188 35-57 (188)
84 KOG1225|consensus 25.2 80 0.0017 26.8 3.0 67 6-81 458-524 (525)
85 cd05762 Ig8_MLCK Eighth immuno 24.8 1.8E+02 0.0038 17.9 6.6 31 164-194 54-84 (98)
86 KOG1378|consensus 23.7 4.4E+02 0.0095 22.1 9.3 33 167-205 110-142 (452)
87 COG3269 Predicted RNA-binding 23.5 1.6E+02 0.0034 17.4 3.1 25 171-195 11-35 (73)
88 smart00425 TBOX Domain first f 22.3 1.5E+02 0.0033 21.3 3.6 23 166-188 34-56 (190)
89 cd02848 Chitinase_N_term Chiti 22.2 2.3E+02 0.0049 18.2 9.8 20 71-90 76-95 (106)
90 PF05345 He_PIG: Putative Ig d 22.1 1.2E+02 0.0026 16.1 2.4 12 178-189 37-48 (49)
91 PF07867 DUF1654: Protein of u 20.8 1.6E+02 0.0036 17.4 2.9 20 8-27 52-71 (73)
No 1
>KOG4221|consensus
Probab=99.95 E-value=1.8e-26 Score=191.48 Aligned_cols=204 Identities=25% Similarity=0.410 Sum_probs=169.2
Q ss_pred CCCCCcceEEeeeccceEEEEeeecccccccccceeeEEEEEEEeeCCCcCceEEEeeccccceEEEecCCCCCEEEEEE
Q psy14887 2 PQVAPQGVFSRGFNSTAINVTWNPIELIRENIRGRLIGHRIKYWKEDNPEETAVYYLSRTTRNWALIVGLQPDTKYYVKV 81 (238)
Q Consensus 2 pp~~P~~~~~~~~~~~~~~l~W~~~~~~~~~~~~~i~~y~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~L~p~~~Y~~~v 81 (238)
||.+|..+++.+.+...+.+.|++| ...++++++|++.|... +...+ .....+..+++|+||++++.|.++|
T Consensus 520 ~pEgp~~~~a~ats~~ti~v~WepP----~~~n~~I~~yk~~ys~~-~~~~~---~~~~~n~~e~ti~gL~k~TeY~~~v 591 (1381)
T KOG4221|consen 520 QPEGPVQLQAYATSPTTILVTWEPP----PFGNGPITGYKLFYSED-DTGKE---LRVENNATEYTINGLEKYTEYSIRV 591 (1381)
T ss_pred CCCCCccccccccCcceEEEEecCC----CCCCCCceEEEEEEEcC-CCCce---EEEecCccEEEeecCCCccceEEEE
Confidence 4556666888888999999999998 55678999999998554 22222 3344567789999999999999999
Q ss_pred EEEeCCCCCCCCCcEEeeccCcCCCCCCcceEEEeeCCcEEEEEEeccCCCCCCCCeeEEEEEEEEcCCCCCCCceeeee
Q psy14887 82 MAYNNAGEGPESERYLERTYRKAPQKPPSAVHIKGINPTTVHVTWRYVSPSFEEEPIIGYKVRVWEVDQDMSRANDTIIP 161 (238)
Q Consensus 82 ~a~~~~g~~~~s~~~~~~t~~~~p~~~p~~~~~~~~~~~~~~l~W~~~~~~~~~~~i~~y~v~~~~~~~~~~~~~~~~~~ 161 (238)
.|.|..|.|..+..+.++|..+.|..||.++++...++++|.|+|.++++...+|.|.+|+|+|+........ .....
T Consensus 592 vA~N~~G~g~sS~~i~V~Tlsd~PsaPP~Nl~lev~sStsVrVsW~pP~~~t~ng~itgYkIRy~~~~~~~~~--~~t~v 669 (1381)
T KOG4221|consen 592 VAYNSAGSGVSSADITVRTLSDVPSAPPQNLSLEVVSSTSVRVSWLPPPSETQNGQITGYKIRYRKLSREDEV--NETVV 669 (1381)
T ss_pred EEecCCCCCCCCCceEEEeccCCCCCCCcceEEEecCCCeEEEEccCCCcccccceEEEEEEEecccCccccc--ceeec
Confidence 9999999999999999999999999999999999999999999999988889999999999999876655432 23444
Q ss_pred cCCcceEEecCCCCCceEEEEEEEEeCCCCCCCCCCcEEEeeccc--ccCCCCCcee
Q psy14887 162 VGNTLDADIVDLSPGKRYHMRVLAFSNGGDGRMSSPAITFQMGEE--KEFKSGSPAL 216 (238)
Q Consensus 162 ~~~~~~~~i~~L~p~t~Y~~~V~a~~~~g~g~~s~~~~~~~t~~~--~~~~p~~~~~ 216 (238)
.++.+++++.+|+|++.|.|+|.|.|..|.|+.|+. +.+.|... ++.+|.+|..
T Consensus 670 ~~n~~~~l~~~Lep~T~Y~vrIsa~t~nGtGpaS~w-~~aeT~~~d~~e~vp~~ps~ 725 (1381)
T KOG4221|consen 670 KGNTTQYLFNGLEPNTQYRVRISAMTVNGTGPASEW-VSAETPESDLDERVPGKPSE 725 (1381)
T ss_pred ccchhhhHhhcCCCCceEEEEEEEeccCCCCCcccc-eeccCccccccccCCCCCce
Confidence 568899999999999999999999999999999964 66666443 3456666653
No 2
>KOG3513|consensus
Probab=99.94 E-value=4.3e-24 Score=177.56 Aligned_cols=197 Identities=25% Similarity=0.480 Sum_probs=160.5
Q ss_pred CCCCCcceEEeeeccceEEEEeeecccccccccceeeEEEEEEEeeCCCcCceEEEeeccccceEEEec--CCCCCEEEE
Q psy14887 2 PQVAPQGVFSRGFNSTAINVTWNPIELIRENIRGRLIGHRIKYWKEDNPEETAVYYLSRTTRNWALIVG--LQPDTKYYV 79 (238)
Q Consensus 2 pp~~P~~~~~~~~~~~~~~l~W~~~~~~~~~~~~~i~~y~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~--L~p~~~Y~~ 79 (238)
|-.+|.||.......+.+.|+|++... ...+++-.+|+|.|+..+....+....+.......+++.+ ..|++.|.+
T Consensus 716 P~~~P~nv~g~g~~~~eLvItW~Pl~~--~~qNG~gfgY~Vswr~~g~~~~W~~~~v~~~d~~~~V~~~~st~~~tpyev 793 (1051)
T KOG3513|consen 716 PSVNPSNVKGGGGSPTELVITWEPLPE--EEQNGPGFGYRVSWRPQGADKEWKEVIVSNQDQPRYVVSNESTEPFTPYEV 793 (1051)
T ss_pred CccCCccccccCCCCceEEEEeccCCH--HHccCCCceEEEEEEeCCCCcccceeEecccCCceEEEcCCCCCCcceeEE
Confidence 667789999988899999999999876 4455888999999877766634443333333344566665 667999999
Q ss_pred EEEEEeCCCCCCCCCcEEeeccCcCCCCCCcceEEEeeCCcEEEEEEeccCCCCCCCCeeEEEEEEEEcCCCCCCCceee
Q psy14887 80 KVMAYNNAGEGPESERYLERTYRKAPQKPPSAVHIKGINPTTVHVTWRYVSPSFEEEPIIGYKVRVWEVDQDMSRANDTI 159 (238)
Q Consensus 80 ~v~a~~~~g~~~~s~~~~~~t~~~~p~~~p~~~~~~~~~~~~~~l~W~~~~~~~~~~~i~~y~v~~~~~~~~~~~~~~~~ 159 (238)
.|++.|+.|.|+.+....+++.++.|+.+|..+.+...+.+.+.|+|+ ++...+|.+.||+|.|+....... ....+
T Consensus 794 KVqa~N~~GeGp~s~~~v~~S~Ed~P~~ap~~~~~~~~s~s~~~v~W~--~~~~~nG~l~gY~v~Y~~~~~~~~-~~~~~ 870 (1051)
T KOG3513|consen 794 KVQAINDQGEGPESQVTVGYSGEDEPPVAPTKLSAKPLSSSEVNLSWK--PPLWDNGKLTGYEVKYWKINEKEG-SLSRV 870 (1051)
T ss_pred EEEEecCCCCCCCCceEEEEcCCCCCCCCCccceeecccCceEEEEec--CcCccCCccceeEEEEEEcCCCcc-cccce
Confidence 999999999999999999999999999999999999999999999999 556666999999999998876642 22244
Q ss_pred eecCCcceEEecCCCCCceEEEEEEEEeCCCCCCCCCCcEEEeec
Q psy14887 160 IPVGNTLDADIVDLSPGKRYHMRVLAFSNGGDGRMSSPAITFQMG 204 (238)
Q Consensus 160 ~~~~~~~~~~i~~L~p~t~Y~~~V~a~~~~g~g~~s~~~~~~~t~ 204 (238)
...++.++..|.||+++|.|.|.|+|+|.+|.|+.|.. ...++.
T Consensus 871 ~i~~~~~~~~ltgL~~~T~Y~~~vrA~nsaG~Gp~s~~-~~~tt~ 914 (1051)
T KOG3513|consen 871 QIAGNRTSWRLTGLEPNTKYRFYVRAYTSAGGGPASSE-ENVTTK 914 (1051)
T ss_pred eecCCcceEeeeCCCCCceEEEEEEEecCCCCCCCccc-eecccc
Confidence 55589999999999999999999999999999998854 333433
No 3
>KOG0196|consensus
Probab=99.93 E-value=3.8e-25 Score=177.55 Aligned_cols=195 Identities=25% Similarity=0.340 Sum_probs=149.2
Q ss_pred CCCCCCcceEEeeeccceEEEEeeecccccccccceeeEEEEEEEee------CCCcCceEEEeecc---ccceEEEecC
Q psy14887 1 MPQVAPQGVFSRGFNSTAINVTWNPIELIRENIRGRLIGHRIKYWKE------DNPEETAVYYLSRT---TRNWALIVGL 71 (238)
Q Consensus 1 ~pp~~P~~~~~~~~~~~~~~l~W~~~~~~~~~~~~~i~~y~v~~~~~------~~~~~~~~~~~~~~---~~~~~~~~~L 71 (238)
.||++|.++... ++.+++.|.|.+|.+. .+...+ .|.|...+. ++.|+..+.+.+.. .++...+.+|
T Consensus 330 ~PPSaP~nlis~-vn~Ts~~L~W~~P~d~--GGR~Di-~y~v~Ck~c~~~~~~C~~Cg~~V~f~P~q~gLt~~~V~v~~L 405 (996)
T KOG0196|consen 330 RPPSAPRNLISN-VNGTSLILEWSPPADT--GGREDI-TYNVICKKCGGGRGACEPCGDNVRFTPRQRGLTETSVTVSDL 405 (996)
T ss_pred CCCCccceeeee-cccceEEEEecCCccc--CCCcce-EEEEEeeccCCCCCccccCCCCceECCCCCCcccceEEEecc
Confidence 489999998884 8999999999999864 223334 466654332 23355455554332 4677899999
Q ss_pred CCCCEEEEEEEEEeCCCC-CC---CCCcEEeeccCcCCCCCCcceEEEeeCCcEEEEEEeccCCCCCCCCeeEEEEEEEE
Q psy14887 72 QPDTKYYVKVMAYNNAGE-GP---ESERYLERTYRKAPQKPPSAVHIKGINPTTVHVTWRYVSPSFEEEPIIGYKVRVWE 147 (238)
Q Consensus 72 ~p~~~Y~~~v~a~~~~g~-~~---~s~~~~~~t~~~~p~~~p~~~~~~~~~~~~~~l~W~~~~~~~~~~~i~~y~v~~~~ 147 (238)
.|.+.|+|.|.|+|+... ++ ....+.+.|.+..| .+...++......+++.|+|. .|++.+|.|..|+|+|++
T Consensus 406 ~ah~~YTFeV~AvNgVS~lsp~~~~~a~vnItt~qa~p-s~V~~~r~~~~~~~sitlsW~--~p~~png~ildYEvky~e 482 (996)
T KOG0196|consen 406 LAHTNYTFEVEAVNGVSDLSPFPRQFASVNITTNQAAP-SPVSVLRQVSRTSDSITLSWS--EPDQPNGVILDYEVKYYE 482 (996)
T ss_pred ccccccEEEEEEeecccccCCCCCcceeEEeeccccCC-CccceEEEeeeccCceEEecC--CCCCCCCcceeEEEEEee
Confidence 999999999999998754 32 23345667766655 455578888999999999999 788999999999999999
Q ss_pred cCCCCCCCceeeeecCCcceEEecCCCCCceEEEEEEEEeCCCCCCCCCCcEEEeeccc
Q psy14887 148 VDQDMSRANDTIIPVGNTLDADIVDLSPGKRYHMRVLAFSNGGDGRMSSPAITFQMGEE 206 (238)
Q Consensus 148 ~~~~~~~~~~~~~~~~~~~~~~i~~L~p~t~Y~~~V~a~~~~g~g~~s~~~~~~~t~~~ 206 (238)
++.++... .......+..++.+|+|++.|.|+|+|++.+|.|+.|. .+.|+|...
T Consensus 483 k~~~e~~~---~~~~t~~~~~ti~gL~p~t~YvfqVRarT~aG~G~~S~-~~~fqT~~~ 537 (996)
T KOG0196|consen 483 KDEDERSY---STLKTKTTTATITGLKPGTVYVFQVRARTAAGYGPYSG-KHEFQTLPS 537 (996)
T ss_pred ccccccce---eEEecccceEEeeccCCCcEEEEEEEEecccCCCCCCC-ceeeeecCc
Confidence 97654432 23346778999999999999999999999999999995 477887753
No 4
>KOG3513|consensus
Probab=99.93 E-value=5.7e-24 Score=176.85 Aligned_cols=207 Identities=22% Similarity=0.293 Sum_probs=160.1
Q ss_pred CCCCCcceEEeeeccceEEEEeeecccccccccceeeEEEEEEEeeCCCcCceEEEeecccc--ceEEEecCCCCCEEEE
Q psy14887 2 PQVAPQGVFSRGFNSTAINVTWNPIELIRENIRGRLIGHRIKYWKEDNPEETAVYYLSRTTR--NWALIVGLQPDTKYYV 79 (238)
Q Consensus 2 pp~~P~~~~~~~~~~~~~~l~W~~~~~~~~~~~~~i~~y~v~~~~~~~~~~~~~~~~~~~~~--~~~~~~~L~p~~~Y~~ 79 (238)
||.||.+|.+...+.++++|+|++..|+ +++|..|.|+++..-...+......+.... .+.++.+|.|+..|+|
T Consensus 614 pPgpP~~v~~~~i~~t~~~lsW~~g~dn----~SpI~~Y~iq~rt~~~~~W~~v~~vp~~~~~~~sa~vv~L~Pwv~YeF 689 (1051)
T KOG3513|consen 614 PPGPPPDVHVDDISDTTARLSWSPGSDN----NSPIEKYTIQFRTPFPGKWKAVTTVPGNITGDESATVVNLSPWVEYEF 689 (1051)
T ss_pred CCCCCCceeEeeeccceEEEEeecCCCC----CCCceEEeEEecCCCCCcceEeeECCCcccCccceeEEccCCCcceEE
Confidence 8999999999999999999999998653 367999999975442222323332222212 2467889999999999
Q ss_pred EEEEEeCCCCCCCCCcE-EeeccCcCCCCCCcceEEEeeCCcEEEEEEeccCCCCCCCCeeEEEEEEEEcCCCCCCCcee
Q psy14887 80 KVMAYNNAGEGPESERY-LERTYRKAPQKPPSAVHIKGINPTTVHVTWRYVSPSFEEEPIIGYKVRVWEVDQDMSRANDT 158 (238)
Q Consensus 80 ~v~a~~~~g~~~~s~~~-~~~t~~~~p~~~p~~~~~~~~~~~~~~l~W~~~~~~~~~~~i~~y~v~~~~~~~~~~~~~~~ 158 (238)
||.|.|..|.|.+|.+. ..+|.+++|...|.++.......+.+.+.|+++++...||+-.+|+|.|+..+... .|...
T Consensus 690 RV~AvN~iG~gePS~pS~~~rT~ea~P~~~P~nv~g~g~~~~eLvItW~Pl~~~~qNG~gfgY~Vswr~~g~~~-~W~~~ 768 (1051)
T KOG3513|consen 690 RVVAVNSIGIGEPSPPSEKVRTPEAAPSVNPSNVKGGGGSPTELVITWEPLPEEEQNGPGFGYRVSWRPQGADK-EWKEV 768 (1051)
T ss_pred EEEEEcccccCCCCCCccceecCCCCCccCCccccccCCCCceEEEEeccCCHHHccCCCceEEEEEEeCCCCc-cccee
Confidence 99999999999887764 57899999999999999999999999999999999999999999999999988763 24444
Q ss_pred eeecCCcceEEe--cCCCCCceEEEEEEEEeCCCCCCCCCCcEEEeecccccCCCCC
Q psy14887 159 IIPVGNTLDADI--VDLSPGKRYHMRVLAFSNGGDGRMSSPAITFQMGEEKEFKSGS 213 (238)
Q Consensus 159 ~~~~~~~~~~~i--~~L~p~t~Y~~~V~a~~~~g~g~~s~~~~~~~t~~~~~~~p~~ 213 (238)
++.+.....+.+ ..-+|.+.|.++|+|+|..|+|+.|..++.+.-...|+..|..
T Consensus 769 ~v~~~d~~~~V~~~~st~~~tpyevKVqa~N~~GeGp~s~~~v~~S~Ed~P~~ap~~ 825 (1051)
T KOG3513|consen 769 IVSNQDQPRYVVSNESTEPFTPYEVKVQAINDQGEGPESQVTVGYSGEDEPPVAPTK 825 (1051)
T ss_pred EecccCCceEEEcCCCCCCcceeEEEEEEecCCCCCCCCceEEEEcCCCCCCCCCcc
Confidence 554444444444 5566799999999999999999999655555544444333333
No 5
>KOG4221|consensus
Probab=99.88 E-value=1.3e-20 Score=157.32 Aligned_cols=194 Identities=25% Similarity=0.397 Sum_probs=151.0
Q ss_pred CCCCCCcceEEeeeccceEEEEeeecccccccccceeeEEEEEEEeeCCCcCceEEEeeccccceEEEecCCCCCEEEEE
Q psy14887 1 MPQVAPQGVFSRGFNSTAINVTWNPIELIRENIRGRLIGHRIKYWKEDNPEETAVYYLSRTTRNWALIVGLQPDTKYYVK 80 (238)
Q Consensus 1 ~pp~~P~~~~~~~~~~~~~~l~W~~~~~~~~~~~~~i~~y~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~L~p~~~Y~~~ 80 (238)
+|-.||.|+++...++++++++|.+|.. ...++.+.+|.|+|+......+-.... ...+...+.+++|+|++.|.|+
T Consensus 614 ~PsaPP~Nl~lev~sStsVrVsW~pP~~--~t~ng~itgYkIRy~~~~~~~~~~~t~-v~~n~~~~l~~~Lep~T~Y~vr 690 (1381)
T KOG4221|consen 614 VPSAPPQNLSLEVVSSTSVRVSWLPPPS--ETQNGQITGYKIRYRKLSREDEVNETV-VKGNTTQYLFNGLEPNTQYRVR 690 (1381)
T ss_pred CCCCCCcceEEEecCCCeEEEEccCCCc--ccccceEEEEEEEecccCcccccceee-cccchhhhHhhcCCCCceEEEE
Confidence 3667788899999999999999999986 455689999999986554443322222 2236677899999999999999
Q ss_pred EEEEeCCCCCCCCCcEEeeccCcCCC----CCCcceEEEeeCCcEEEEEEeccCCCCCCCCeeEEEEEEEEcCCCCCCCc
Q psy14887 81 VMAYNNAGEGPESERYLERTYRKAPQ----KPPSAVHIKGINPTTVHVTWRYVSPSFEEEPIIGYKVRVWEVDQDMSRAN 156 (238)
Q Consensus 81 v~a~~~~g~~~~s~~~~~~t~~~~p~----~~p~~~~~~~~~~~~~~l~W~~~~~~~~~~~i~~y~v~~~~~~~~~~~~~ 156 (238)
|.|.+.+|.|+.+++..+.|.+..+. .+|..++ ......++.+.|. |+.+.+-.+.||+|.|+...... ..
T Consensus 691 Isa~t~nGtGpaS~w~~aeT~~~d~~e~vp~~ps~l~-~~~g~~si~vsW~--Pp~~~~~~vrgY~ig~r~g~~~p--~~ 765 (1381)
T KOG4221|consen 691 ISAMTVNGTGPASEWVSAETPESDLDERVPGKPSELH-VHPGSNSIVVSWT--PPPHPNIVVRGYKIGYRPGSGIP--DT 765 (1381)
T ss_pred EEEeccCCCCCcccceeccCccccccccCCCCCceee-eccCceeEEEEeC--CCCChhhhhcceEEeeecccCCC--CC
Confidence 99999999999999999988865432 3454444 3446678999999 67777788999999996443332 22
Q ss_pred eeeeecCCcceEEecCCCCCceEEEEEEEEeCCCCCCCCCCcEEEe
Q psy14887 157 DTIIPVGNTLDADIVDLSPGKRYHMRVLAFSNGGDGRMSSPAITFQ 202 (238)
Q Consensus 157 ~~~~~~~~~~~~~i~~L~p~t~Y~~~V~a~~~~g~g~~s~~~~~~~ 202 (238)
..+......+.+.|..|.++..|.|+++|+|..|+|.+.-..+.++
T Consensus 766 ~tIrl~~~~s~y~l~~Le~~~~YvVkL~AfNn~gdG~p~y~~~~tR 811 (1381)
T KOG4221|consen 766 GTIRLDEKVSYYNLEQLEPNRDYVVKLRAFNNHGDGNPIYESVKTR 811 (1381)
T ss_pred ccEEecceeeEEEEEecccCceEEEEEEEeccCCCCcceeeeeeec
Confidence 3567778889999999999999999999999999999874334444
No 6
>KOG4222|consensus
Probab=99.62 E-value=2.8e-14 Score=119.52 Aligned_cols=185 Identities=21% Similarity=0.280 Sum_probs=138.9
Q ss_pred eeccceEEEEeeecccccccccceeeEEEEEEEeeCCCc-CceEEEeeccccceEEEecCCCCCEEEEEEEEEeCCC---
Q psy14887 13 GFNSTAINVTWNPIELIRENIRGRLIGHRIKYWKEDNPE-ETAVYYLSRTTRNWALIVGLQPDTKYYVKVMAYNNAG--- 88 (238)
Q Consensus 13 ~~~~~~~~l~W~~~~~~~~~~~~~i~~y~v~~~~~~~~~-~~~~~~~~~~~~~~~~~~~L~p~~~Y~~~v~a~~~~g--- 88 (238)
.++++++.+.|.... ......+.+|.|.|+...... .+....+......++.+.+|+|++.|+|.++.....+
T Consensus 652 ~L~asslr~~w~~~k---q~~~~~i~g~~I~~r~~~~~~a~~s~~~v~~~t~~s~v~~nl~p~t~ye~f~~Pf~~~~~s~ 728 (1281)
T KOG4222|consen 652 VLNASSLRLGWTKDK---QHGSQYIQGYRISYRSLGSQLAQWSNAGVTVPTPESVVVPNLKPGTNYEFFVRPFFPHGYSI 728 (1281)
T ss_pred ccchhheeeeeeeec---ccCcccccceEEEeccCccccccccccceeccCCcceeccccCCCccceeeccCccCCCcce
Confidence 467889999998865 444578999999986655421 1111122233445688999999999999999988744
Q ss_pred CCCCCCcEEeeccCcCCCCCCcce---EEEeeCCcEEEEEEeccCCCCCCCCeeEEEEEEEEcCCCCCCCceeeeecCCc
Q psy14887 89 EGPESERYLERTYRKAPQKPPSAV---HIKGINPTTVHVTWRYVSPSFEEEPIIGYKVRVWEVDQDMSRANDTIIPVGNT 165 (238)
Q Consensus 89 ~~~~s~~~~~~t~~~~p~~~p~~~---~~~~~~~~~~~l~W~~~~~~~~~~~i~~y~v~~~~~~~~~~~~~~~~~~~~~~ 165 (238)
.+..+....+.|.+..|..+|..+ .....+.++..|+|.+++....+|.+.+|+|..-.... .........+...
T Consensus 729 ~g~pS~sk~alt~e~~PSapp~~~~~~s~~~~n~Ta~~Vsw~~pp~d~~ng~~qg~ki~~~~~e~--tr~h~n~t~~a~~ 806 (1281)
T KOG4222|consen 729 QGAPSNSKTALTLEEPPSAPPQGVQHVSKGSYNGTAGSVSWAPPPADVQNGILQGYKIECSGGEK--TRIHINKTTNART 806 (1281)
T ss_pred ecCCcccccccccccCCCCCCCCccccccccCCCceeeEEecCCcccccCCcccceeEEeecCcc--ccccccccccCCC
Confidence 467788888889988898888884 45566788999999987777888999999997643321 1222233445777
Q ss_pred ceEEecCCCCCceEEEEEEEEeCCCCCCCCCCcEEEe
Q psy14887 166 LDADIVDLSPGKRYHMRVLAFSNGGDGRMSSPAITFQ 202 (238)
Q Consensus 166 ~~~~i~~L~p~t~Y~~~V~a~~~~g~g~~s~~~~~~~ 202 (238)
.+.+|.+|.++..|.|++.+.+.+|.|..+++..++.
T Consensus 807 ~sv~i~~l~~g~ay~vtv~a~T~aGvG~~s~p~~~~~ 843 (1281)
T KOG4222|consen 807 GSVTIGNLVTGIAYSVTVAARTGAGVGVKSPPQPIVF 843 (1281)
T ss_pred CceEeccccccceEEEEEeeecCCccCCCCCCeeeec
Confidence 8999999999999999999999999999887655443
No 7
>PF00041 fn3: Fibronectin type III domain; InterPro: IPR003961 Fibronectins are multi-domain glycoproteins found in a soluble form in plasma, and in an insoluble form in loose connective tissue and basement membranes []. They contain multiple copies of 3 repeat regions (types I, II and III), which bind to a variety of substances including heparin, collagen, DNA, actin, fibrin and fibronectin receptors on cell surfaces. The wide variety of these substances means that fibronectins are involved in a number of important functions: e.g., wound healing; cell adhesion; blood coagulation; cell differentiation and migration; maintenance of the cellular cytoskeleton; and tumour metastasis []. The role of fibronectin in cell differentiation is demonstrated by the marked reduction in the expression of its gene when neoplastic transformation occurs. Cell attachment has been found to be mediated by the binding of the tetrapeptide RGDS to integrins on the cell surface [], although related sequences can also display cell adhesion activity. Plasma fibronectin occurs as a dimer of 2 different subunits, linked together by 2 disulphide bonds near the C terminus. The difference in the 2 chains occurs in the type III repeat region and is caused by alternative splicing of the mRNA from one gene []. The observation that, in a given protein, an individual repeat of one of the 3 types (e.g., the first FnIII repeat) shows much less similarity to its subsequent tandem repeats within that protein than to its equivalent repeat between fibronectins from other species, has suggested that the repeating structure of fibronectin arose at an early stage of evolution. It also seems to suggest that the structure is subject to high selective pressure []. The fibronectin type III repeat region is an approximately 100 amino acid domain, different tandem repeats of which contain binding sites for DNA, heparin and the cell surface []. The superfamily of sequences believed to contain FnIII repeats represents 45 different families, the majority of which are involved in cell surface binding in some manner, or are receptor protein tyrosine kinases, or cytokine receptors.; GO: 0005515 protein binding; PDB: 1UEM_A 1TDQ_A 1X5I_A 2IC2_B 2IBG_C 2IBB_A 3R8Q_A 2FNB_A 1FNH_A 2EDB_A ....
Probab=99.52 E-value=1.9e-13 Score=84.47 Aligned_cols=84 Identities=33% Similarity=0.606 Sum_probs=69.0
Q ss_pred CCcceEEEeeCCcEEEEEEeccCCCCCCCCeeEEEEEEEEcCCCCCCCceeeeecCCcceEEecCCCCCceEEEEEEEEe
Q psy14887 108 PPSAVHIKGINPTTVHVTWRYVSPSFEEEPIIGYKVRVWEVDQDMSRANDTIIPVGNTLDADIVDLSPGKRYHMRVLAFS 187 (238)
Q Consensus 108 ~p~~~~~~~~~~~~~~l~W~~~~~~~~~~~i~~y~v~~~~~~~~~~~~~~~~~~~~~~~~~~i~~L~p~t~Y~~~V~a~~ 187 (238)
+|.++.+...+.+++.|.|.. +...++.+.+|.|.|+....... ..........+.+++.+|+|++.|.|+|+|.+
T Consensus 2 ~P~~l~v~~~~~~sv~v~W~~--~~~~~~~~~~y~v~~~~~~~~~~--~~~~~~~~~~~~~~i~~L~p~t~Y~~~v~a~~ 77 (85)
T PF00041_consen 2 APENLSVSNISPTSVTVSWKP--PSSGNGPITGYRVEYRSVNSTSD--WQEVTVPGNETSYTITGLQPGTTYEFRVRAVN 77 (85)
T ss_dssp SSEEEEEEEECSSEEEEEEEE--SSSTSSSESEEEEEEEETTSSSE--EEEEEEETTSSEEEEESCCTTSEEEEEEEEEE
T ss_pred cCcCeEEEECCCCEEEEEEEC--CCCCCCCeeEEEEEEEeccccee--eeeeeeeeeeeeeeeccCCCCCEEEEEEEEEe
Confidence 677899999999999999995 33778999999999988766541 22344456667999999999999999999999
Q ss_pred CCCCCCCC
Q psy14887 188 NGGDGRMS 195 (238)
Q Consensus 188 ~~g~g~~s 195 (238)
..|.|++|
T Consensus 78 ~~g~g~~S 85 (85)
T PF00041_consen 78 SDGEGPPS 85 (85)
T ss_dssp TTEEEEEE
T ss_pred CCcCcCCC
Confidence 99987654
No 8
>PF00041 fn3: Fibronectin type III domain; InterPro: IPR003961 Fibronectins are multi-domain glycoproteins found in a soluble form in plasma, and in an insoluble form in loose connective tissue and basement membranes []. They contain multiple copies of 3 repeat regions (types I, II and III), which bind to a variety of substances including heparin, collagen, DNA, actin, fibrin and fibronectin receptors on cell surfaces. The wide variety of these substances means that fibronectins are involved in a number of important functions: e.g., wound healing; cell adhesion; blood coagulation; cell differentiation and migration; maintenance of the cellular cytoskeleton; and tumour metastasis []. The role of fibronectin in cell differentiation is demonstrated by the marked reduction in the expression of its gene when neoplastic transformation occurs. Cell attachment has been found to be mediated by the binding of the tetrapeptide RGDS to integrins on the cell surface [], although related sequences can also display cell adhesion activity. Plasma fibronectin occurs as a dimer of 2 different subunits, linked together by 2 disulphide bonds near the C terminus. The difference in the 2 chains occurs in the type III repeat region and is caused by alternative splicing of the mRNA from one gene []. The observation that, in a given protein, an individual repeat of one of the 3 types (e.g., the first FnIII repeat) shows much less similarity to its subsequent tandem repeats within that protein than to its equivalent repeat between fibronectins from other species, has suggested that the repeating structure of fibronectin arose at an early stage of evolution. It also seems to suggest that the structure is subject to high selective pressure []. The fibronectin type III repeat region is an approximately 100 amino acid domain, different tandem repeats of which contain binding sites for DNA, heparin and the cell surface []. The superfamily of sequences believed to contain FnIII repeats represents 45 different families, the majority of which are involved in cell surface binding in some manner, or are receptor protein tyrosine kinases, or cytokine receptors.; GO: 0005515 protein binding; PDB: 1UEM_A 1TDQ_A 1X5I_A 2IC2_B 2IBG_C 2IBB_A 3R8Q_A 2FNB_A 1FNH_A 2EDB_A ....
Probab=99.46 E-value=1.3e-12 Score=80.72 Aligned_cols=84 Identities=29% Similarity=0.488 Sum_probs=66.1
Q ss_pred CCCcceEEeeeccceEEEEeeecccccccccceeeEEEEEEEeeCCCcCceEEEeeccccceEEEecCCCCCEEEEEEEE
Q psy14887 4 VAPQGVFSRGFNSTAINVTWNPIELIRENIRGRLIGHRIKYWKEDNPEETAVYYLSRTTRNWALIVGLQPDTKYYVKVMA 83 (238)
Q Consensus 4 ~~P~~~~~~~~~~~~~~l~W~~~~~~~~~~~~~i~~y~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~L~p~~~Y~~~v~a 83 (238)
+||+++++...+.+++.|+|+.+. ...+.+.+|.|+|+...+...... .........+.+.+|.|++.|.|+|++
T Consensus 1 s~P~~l~v~~~~~~sv~v~W~~~~----~~~~~~~~y~v~~~~~~~~~~~~~-~~~~~~~~~~~i~~L~p~t~Y~~~v~a 75 (85)
T PF00041_consen 1 SAPENLSVSNISPTSVTVSWKPPS----SGNGPITGYRVEYRSVNSTSDWQE-VTVPGNETSYTITGLQPGTTYEFRVRA 75 (85)
T ss_dssp SSSEEEEEEEECSSEEEEEEEESS----STSSSESEEEEEEEETTSSSEEEE-EEEETTSSEEEEESCCTTSEEEEEEEE
T ss_pred CcCcCeEEEECCCCEEEEEEECCC----CCCCCeeEEEEEEEecccceeeee-eeeeeeeeeeeeccCCCCCEEEEEEEE
Confidence 479999999999999999999984 344788999999866655542222 222233447999999999999999999
Q ss_pred EeCCCCCCC
Q psy14887 84 YNNAGEGPE 92 (238)
Q Consensus 84 ~~~~g~~~~ 92 (238)
.+..|.|++
T Consensus 76 ~~~~g~g~~ 84 (85)
T PF00041_consen 76 VNSDGEGPP 84 (85)
T ss_dssp EETTEEEEE
T ss_pred EeCCcCcCC
Confidence 999987754
No 9
>KOG4222|consensus
Probab=99.30 E-value=1.1e-11 Score=104.42 Aligned_cols=197 Identities=18% Similarity=0.252 Sum_probs=132.8
Q ss_pred CCCCcceEEeeeccceEEEEeeecccccccccceeeEEEEEEEeeCCCcCceEEEeeccccceEEEecCCCCCEEEEEEE
Q psy14887 3 QVAPQGVFSRGFNSTAINVTWNPIELIRENIRGRLIGHRIKYWKEDNPEETAVYYLSRTTRNWALIVGLQPDTKYYVKVM 82 (238)
Q Consensus 3 p~~P~~~~~~~~~~~~~~l~W~~~~~~~~~~~~~i~~y~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~L~p~~~Y~~~v~ 82 (238)
|++|..-.+..++.++++|+|.+... -+..++.+|.+++...+-.+.+...- .....+.+.|.+|+|++.|.|.++
T Consensus 527 pS~p~~p~v~~v~~~~v~LsW~~~s~---sg~vP~s~yiieafs~~~~etw~~ta-~~v~~t~~~I~gL~P~~sylf~vR 602 (1281)
T KOG4222|consen 527 PSPPGTPGVVNVSRTSVTLSWQPTSP---SGAVPASGYIIEAFSPDLGETWQTTA-GRVKTTTYAIRGLKPNLSYLFLVR 602 (1281)
T ss_pred CCCCCCCccccCCCceEEecccCCCC---CCccccchhHHHHhhhhhcccccccc-cccccceeeecCcCccceeeeeee
Confidence 56666667778889999999998752 34578899999876554443332211 112456689999999999999999
Q ss_pred EEeCCCCCCCCCc-EEeeccCcCCCCC-C-----------cce---EEEeeCCcEEEEEEeccCCCCCCCCeeEEEEEEE
Q psy14887 83 AYNNAGEGPESER-YLERTYRKAPQKP-P-----------SAV---HIKGINPTTVHVTWRYVSPSFEEEPIIGYKVRVW 146 (238)
Q Consensus 83 a~~~~g~~~~s~~-~~~~t~~~~p~~~-p-----------~~~---~~~~~~~~~~~l~W~~~~~~~~~~~i~~y~v~~~ 146 (238)
+.|..|.+..+.. -.++|....+... . ..+ .....+++++.+.|.- ...+....|.+|+|.|+
T Consensus 603 a~n~~Gis~Ps~~S~~vrta~a~~~~a~ad~~k~~~~ls~~l~~l~~~~~L~asslr~~w~~-~kq~~~~~i~g~~I~~r 681 (1281)
T KOG4222|consen 603 AENEQGISDPSTSSDPVRTAPADAAAAGADHQKVQRELSNELLRLSNPNVLNASSLRLGWTK-DKQHGSQYIQGYRISYR 681 (1281)
T ss_pred ccccccccCCcccCCccccCCCChhhhhhhHHHHHHhhcccceeeccccccchhheeeeeee-ecccCcccccceEEEec
Confidence 9999998655432 2233432211000 0 001 1234567889999983 23344678999999999
Q ss_pred EcCCCCCCCceeeeecCCcceEEecCCCCCceEEEEEEEEeCCC---CCCCCCCcEEEeec
Q psy14887 147 EVDQDMSRANDTIIPVGNTLDADIVDLSPGKRYHMRVLAFSNGG---DGRMSSPAITFQMG 204 (238)
Q Consensus 147 ~~~~~~~~~~~~~~~~~~~~~~~i~~L~p~t~Y~~~V~a~~~~g---~g~~s~~~~~~~t~ 204 (238)
..+.....+....+.......+.+.+|.|++.|+|++..+...| .|..|+...+.+..
T Consensus 682 ~~~~~~a~~s~~~v~~~t~~s~v~~nl~p~t~ye~f~~Pf~~~~~s~~g~pS~sk~alt~e 742 (1281)
T KOG4222|consen 682 SLGSQLAQWSNAGVTVPTPESVVVPNLKPGTNYEFFVRPFFPHGYSIQGAPSNSKTALTLE 742 (1281)
T ss_pred cCccccccccccceeccCCcceeccccCCCccceeeccCccCCCcceecCCcccccccccc
Confidence 88774444444445556777889999999999999999988754 46666554444433
No 10
>KOG4258|consensus
Probab=99.29 E-value=6.8e-12 Score=102.71 Aligned_cols=135 Identities=21% Similarity=0.291 Sum_probs=90.3
Q ss_pred eEEEecCCCCCEEEEEEEEEeCCCC----CCCCCcEEeeccCcCCC-CCCcceEEEee-CCcEEEEEEeccCCCCCCCCe
Q psy14887 65 WALIVGLQPDTKYYVKVMAYNNAGE----GPESERYLERTYRKAPQ-KPPSAVHIKGI-NPTTVHVTWRYVSPSFEEEPI 138 (238)
Q Consensus 65 ~~~~~~L~p~~~Y~~~v~a~~~~g~----~~~s~~~~~~t~~~~p~-~~p~~~~~~~~-~~~~~~l~W~~~~~~~~~~~i 138 (238)
++.+.+|+.++.|.+.+.|+|..+. ++.. .+..+|+..... .-+..+..... ..+.+.++|. +|...||.|
T Consensus 783 ~~vi~~Lrh~tlY~i~l~aCnh~~~~~~cS~a~-~v~~RT~~~~~aD~i~g~v~we~~~~~~~v~l~w~--EP~~pNGli 859 (1025)
T KOG4258|consen 783 SYVISGLRHFTLYRIDLQACNHATPKCGCSHAA-FVFARTMPTMGADDIPGPVTWECHIEMNSVILRWL--EPKEPNGLI 859 (1025)
T ss_pred hhhhhccccchhhhhhHhhhcccccccccchhh-hhhhccccccccccCCCceeEecccCcceEEEecC--CCCCCCccE
Confidence 5677789999999999999987765 2211 122334322111 12222333333 6788999999 788999999
Q ss_pred eEEEEEEEEcCCCCCCCceeeeecCCcceEEecCCCCCceEEEEEEEEeCCCCCCCCCCcEEEeec
Q psy14887 139 IGYKVRVWEVDQDMSRANDTIIPVGNTLDADIVDLSPGKRYHMRVLAFSNGGDGRMSSPAITFQMG 204 (238)
Q Consensus 139 ~~y~v~~~~~~~~~~~~~~~~~~~~~~~~~~i~~L~p~t~Y~~~V~a~~~~g~g~~s~~~~~~~t~ 204 (238)
..|.|.|+..+......-.+...-.......+.+|.|| .|.++|+|..-+|.|++.++ ..|...
T Consensus 860 ~~Y~Vk~r~~~~et~v~cvsR~~~~k~~gv~l~~l~~G-~y~~~vratSlaGng~~t~~-~~~~v~ 923 (1025)
T KOG4258|consen 860 LNYEVKYRRNGDETHVECVSRMDYAKAGGVYLKRLNPG-NYSVRVRATSLAGNGSWTES-VIFYVS 923 (1025)
T ss_pred EEEEEEEeeccCcchhhhhhhhhhhhcCceEEEecCCC-cEEEEEEEEeeccCCCCcCC-cceEEe
Confidence 99999999877654322111112234456678888776 89999999999999999865 445443
No 11
>KOG4802|consensus
Probab=99.04 E-value=4.1e-08 Score=74.73 Aligned_cols=137 Identities=21% Similarity=0.270 Sum_probs=88.9
Q ss_pred cceEEEecCCCCCEEEEEEEEEeCCCCCCCCCcEE-ee-ccCcCCCCCCcceEEEeeC---CcEEEEEEeccCCCCCCCC
Q psy14887 63 RNWALIVGLQPDTKYYVKVMAYNNAGEGPESERYL-ER-TYRKAPQKPPSAVHIKGIN---PTTVHVTWRYVSPSFEEEP 137 (238)
Q Consensus 63 ~~~~~~~~L~p~~~Y~~~v~a~~~~g~~~~s~~~~-~~-t~~~~p~~~p~~~~~~~~~---~~~~~l~W~~~~~~~~~~~ 137 (238)
...+....+.|+.-|+|+|.|+|..|.-..+++.. +. +....++.+|.++++..+. .-...|.|. |...+-+
T Consensus 208 e~~~~~t~~rPgRwyefrvaavn~~G~rGFs~PSkpf~ssk~pkaPp~P~dl~l~~v~~dG~~~~~v~w~---P~~sdlP 284 (516)
T KOG4802|consen 208 ENTYIFTDMRPGRWYEFRVAAVNAYGFRGFSEPSKPFPSSKNPKAPPSPNDLKLIGVQFDGRYMLKVVWC---PSKSDLP 284 (516)
T ss_pred CceeeeeecCcceeEEEEEeeeecccccccCCCCCCCCCCCCCCCCcCcccceeeeeeecceEEEEEEeC---CCCCCCc
Confidence 33578889999999999999999887633333322 22 2222234577787765443 224567786 4566778
Q ss_pred eeEEEEEEEEcCCCC---CCCceeeeecCCcceEEecCCCCCceEEEEEEEEeCCCCCCC-CCCcEEEeec
Q psy14887 138 IIGYKVRVWEVDQDM---SRANDTIIPVGNTLDADIVDLSPGKRYHMRVLAFSNGGDGRM-SSPAITFQMG 204 (238)
Q Consensus 138 i~~y~v~~~~~~~~~---~~~~~~~~~~~~~~~~~i~~L~p~t~Y~~~V~a~~~~g~g~~-s~~~~~~~t~ 204 (238)
|..|+|.|.-.-+.. ...+........+++ |.+|.|+..|.|.|+|....|.+.. |+..+.|.+.
T Consensus 285 v~~Yki~Ws~~v~s~k~~m~tks~~~k~thq~s--i~~L~Pns~Y~VevqAi~y~g~~rLksek~~~F~~T 353 (516)
T KOG4802|consen 285 VEKYKITWSLYVNSAKASMITKSSYVKDTHQFS--IKELLPNSSYYVEVQAISYLGSRRLKSEKWMLFNTT 353 (516)
T ss_pred ceeeEEEeehhhhhhhhhcccccceeeccchhh--hhhcCCCCeEEEEEEEEEeccCcccccceEEEEeec
Confidence 999999985433221 111222333344334 9999999999999999999887663 3444666554
No 12
>cd00063 FN3 Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
Probab=99.00 E-value=1.7e-08 Score=62.71 Aligned_cols=84 Identities=27% Similarity=0.538 Sum_probs=61.7
Q ss_pred CCcceEEEeeCCcEEEEEEeccCCCCCCCCeeEEEEEEEEcCCCCCCCceeeeec-CCcceEEecCCCCCceEEEEEEEE
Q psy14887 108 PPSAVHIKGINPTTVHVTWRYVSPSFEEEPIIGYKVRVWEVDQDMSRANDTIIPV-GNTLDADIVDLSPGKRYHMRVLAF 186 (238)
Q Consensus 108 ~p~~~~~~~~~~~~~~l~W~~~~~~~~~~~i~~y~v~~~~~~~~~~~~~~~~~~~-~~~~~~~i~~L~p~t~Y~~~V~a~ 186 (238)
+|.++.+.....+++.+.|.. +...++.+.+|.|.|+..... .+. ..... .....+.+.+|.|++.|.++|++.
T Consensus 3 ~p~~~~~~~~~~~~~~v~W~~--~~~~~~~~~~y~v~~~~~~~~--~~~-~~~~~~~~~~~~~i~~l~p~~~Y~~~v~a~ 77 (93)
T cd00063 3 PPTNLRVTDVTSTSVTLSWTP--PEDDGGPITGYVVEYREKGSG--DWK-EVEVTPGSETSYTLTGLKPGTEYEFRVRAV 77 (93)
T ss_pred CCCCcEEEEecCCEEEEEECC--CCCCCCcceeEEEEEeeCCCC--CCE-EeeccCCcccEEEEccccCCCEEEEEEEEE
Confidence 555677777778999999994 444446788999998876521 111 11111 367789999999999999999999
Q ss_pred eCCCCCCCCC
Q psy14887 187 SNGGDGRMSS 196 (238)
Q Consensus 187 ~~~g~g~~s~ 196 (238)
+..|.|..+.
T Consensus 78 ~~~~~~~~s~ 87 (93)
T cd00063 78 NGGGESPPSE 87 (93)
T ss_pred CCCccCCCcc
Confidence 9988877663
No 13
>cd00063 FN3 Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
Probab=98.91 E-value=4.7e-08 Score=60.63 Aligned_cols=87 Identities=25% Similarity=0.424 Sum_probs=61.5
Q ss_pred CCCCcceEEeeeccceEEEEeeecccccccccceeeEEEEEEEeeCCCcCceEEEeeccccceEEEecCCCCCEEEEEEE
Q psy14887 3 QVAPQGVFSRGFNSTAINVTWNPIELIRENIRGRLIGHRIKYWKEDNPEETAVYYLSRTTRNWALIVGLQPDTKYYVKVM 82 (238)
Q Consensus 3 p~~P~~~~~~~~~~~~~~l~W~~~~~~~~~~~~~i~~y~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~L~p~~~Y~~~v~ 82 (238)
|.+|.++.+......++.|+|+++.. ..+.+..|.+.++..... ..............+.+.+|.|++.|.++|+
T Consensus 1 p~~p~~~~~~~~~~~~~~v~W~~~~~----~~~~~~~y~v~~~~~~~~-~~~~~~~~~~~~~~~~i~~l~p~~~Y~~~v~ 75 (93)
T cd00063 1 PSPPTNLRVTDVTSTSVTLSWTPPED----DGGPITGYVVEYREKGSG-DWKEVEVTPGSETSYTLTGLKPGTEYEFRVR 75 (93)
T ss_pred CcCCCCcEEEEecCCEEEEEECCCCC----CCCcceeEEEEEeeCCCC-CCEEeeccCCcccEEEEccccCCCEEEEEEE
Confidence 45788888888888999999998742 114577899987654411 1111111112456789999999999999999
Q ss_pred EEeCCCCCCCCC
Q psy14887 83 AYNNAGEGPESE 94 (238)
Q Consensus 83 a~~~~g~~~~s~ 94 (238)
+.+..|.+.++.
T Consensus 76 a~~~~~~~~~s~ 87 (93)
T cd00063 76 AVNGGGESPPSE 87 (93)
T ss_pred EECCCccCCCcc
Confidence 998877776554
No 14
>KOG0196|consensus
Probab=98.88 E-value=2.9e-08 Score=81.84 Aligned_cols=95 Identities=25% Similarity=0.360 Sum_probs=75.4
Q ss_pred CCCCcceEEeeeccceEEEEeeecccccccccceeeEEEEEEEeeCCCcCceEEEeeccccceEEEecCCCCCEEEEEEE
Q psy14887 3 QVAPQGVFSRGFNSTAINVTWNPIELIRENIRGRLIGHRIKYWKEDNPEETAVYYLSRTTRNWALIVGLQPDTKYYVKVM 82 (238)
Q Consensus 3 p~~P~~~~~~~~~~~~~~l~W~~~~~~~~~~~~~i~~y~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~L~p~~~Y~~~v~ 82 (238)
|++-.+++....+.++++|+|..| ...++.++.|.|+|......+...... ....+..++.+|+|++.|.|+|+
T Consensus 443 ps~V~~~r~~~~~~~sitlsW~~p----~~png~ildYEvky~ek~~~e~~~~~~--~t~~~~~ti~gL~p~t~YvfqVR 516 (996)
T KOG0196|consen 443 PSPVSVLRQVSRTSDSITLSWSEP----DQPNGVILDYEVKYYEKDEDERSYSTL--KTKTTTATITGLKPGTVYVFQVR 516 (996)
T ss_pred CCccceEEEeeeccCceEEecCCC----CCCCCcceeEEEEEeeccccccceeEE--ecccceEEeeccCCCcEEEEEEE
Confidence 445567888888999999999988 446688999999998776543333322 23445689999999999999999
Q ss_pred EEeCCCCCCCCCcEEeeccCc
Q psy14887 83 AYNNAGEGPESERYLERTYRK 103 (238)
Q Consensus 83 a~~~~g~~~~s~~~~~~t~~~ 103 (238)
|+...|.|..+....+.|...
T Consensus 517 arT~aG~G~~S~~~~fqT~~~ 537 (996)
T KOG0196|consen 517 ARTAAGYGPYSGKHEFQTLPS 537 (996)
T ss_pred EecccCCCCCCCceeeeecCc
Confidence 999999999999888888753
No 15
>smart00060 FN3 Fibronectin type 3 domain. One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.
Probab=98.77 E-value=2.6e-07 Score=55.71 Aligned_cols=80 Identities=23% Similarity=0.434 Sum_probs=51.2
Q ss_pred CCcceEEEeeCCcEEEEEEeccCCCCCCCCeeEEEEEEEEcCCCCCCCceeeeecCCcceEEecCCCCCceEEEEEEEEe
Q psy14887 108 PPSAVHIKGINPTTVHVTWRYVSPSFEEEPIIGYKVRVWEVDQDMSRANDTIIPVGNTLDADIVDLSPGKRYHMRVLAFS 187 (238)
Q Consensus 108 ~p~~~~~~~~~~~~~~l~W~~~~~~~~~~~i~~y~v~~~~~~~~~~~~~~~~~~~~~~~~~~i~~L~p~t~Y~~~V~a~~ 187 (238)
+|.++.+......++.++|.+ +....+ .+|.+.|......................+.+.+|.|++.|.|+|+|.+
T Consensus 3 ~p~~~~~~~~~~~~~~v~W~~--~~~~~~--~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~L~~~~~Y~v~v~a~~ 78 (83)
T smart00060 3 PPSNLRVTDVTSTSVTLSWEP--PPDDGI--TGYIVGYRVEYREEGSSWKEVNVTPSSTSYTLTGLKPGTEYEFRVRAVN 78 (83)
T ss_pred CCCcEEEEEEeCCEEEEEECC--CCCCCC--CccEEEEEEEEecCCCccEEEEecCCccEEEEeCcCCCCEEEEEEEEEc
Confidence 344566666667799999993 333222 5677776544332211111222223357899999999999999999999
Q ss_pred CCCC
Q psy14887 188 NGGD 191 (238)
Q Consensus 188 ~~g~ 191 (238)
..|.
T Consensus 79 ~~g~ 82 (83)
T smart00060 79 GAGE 82 (83)
T ss_pred ccCC
Confidence 7553
No 16
>KOG4258|consensus
Probab=98.64 E-value=8.8e-07 Score=73.68 Aligned_cols=137 Identities=22% Similarity=0.232 Sum_probs=88.8
Q ss_pred ceEEeeeccceEEEEeeecccccccccceeeEEEEEEEeeC--------CC--c---CceEEEe-ec------cccceEE
Q psy14887 8 GVFSRGFNSTAINVTWNPIELIRENIRGRLIGHRIKYWKED--------NP--E---ETAVYYL-SR------TTRNWAL 67 (238)
Q Consensus 8 ~~~~~~~~~~~~~l~W~~~~~~~~~~~~~i~~y~v~~~~~~--------~~--~---~~~~~~~-~~------~~~~~~~ 67 (238)
++..+..+.+++.++|..-... .--...+|.+.|...- +. + .+....+ .+ .....+.
T Consensus 493 ~~~~~~~~~dsi~lrW~~~~~~---d~r~llg~~~~yKEaP~qNvT~~dg~~aCg~~~W~~~~v~~~~~~p~~~~~~~~~ 569 (1025)
T KOG4258|consen 493 QFSSTVTSADSILLRWERYQPP---DMRDLLGFLLHYKEAPFQNVTEEDGRDACGSNSWNVVDVDPPDLIPNDGTHPGFL 569 (1025)
T ss_pred eeeeEEeecceeEEEecccCCc---chhhhheeeEeeccCCccccceecCccccccCcceEEeccCCcCCCcccccccee
Confidence 3445556789999999765321 0113567888773211 11 1 1111111 11 1223568
Q ss_pred EecCCCCCEEEEEEEEEeC--CCCC--CCCCcEEeeccCcCCCCCCcceEEEeeCCcEEEEEEeccCCCCCCCCeeEEEE
Q psy14887 68 IVGLQPDTKYYVKVMAYNN--AGEG--PESERYLERTYRKAPQKPPSAVHIKGINPTTVHVTWRYVSPSFEEEPIIGYKV 143 (238)
Q Consensus 68 ~~~L~p~~~Y~~~v~a~~~--~g~~--~~s~~~~~~t~~~~p~~~p~~~~~~~~~~~~~~l~W~~~~~~~~~~~i~~y~v 143 (238)
+.+|+|+|.|-+-|++... .+.. ..|+..-++|..+.| .+|.++-....+++.+.|+|. +|.++||.+..|.|
T Consensus 570 l~~LkP~TqYAvfVkT~t~t~~~~~~~A~S~I~YvqT~~~~P-spPl~~ls~snsSsqi~l~W~--pP~~pNG~lt~Ylv 646 (1025)
T KOG4258|consen 570 LDGLKPWTQYAVFVKTLTVTEAHEAYEAKSKIGYVQTLPDIP-SPPLDVLSKSNSSSQILLKWK--PPSQPNGNLTHYLV 646 (1025)
T ss_pred hhcCCccceeEEEEeeeehhhhccccccccceEEEEecCCCC-CCcchhhhccCcchheeEEec--CCCCCCCceeEEEE
Confidence 8899999999999988732 2221 345666678887777 566666656666778999999 88999999999999
Q ss_pred EEEEcCC
Q psy14887 144 RVWEVDQ 150 (238)
Q Consensus 144 ~~~~~~~ 150 (238)
.|.+...
T Consensus 647 ~wer~~~ 653 (1025)
T KOG4258|consen 647 VWERQAE 653 (1025)
T ss_pred EEEeccC
Confidence 9876544
No 17
>smart00060 FN3 Fibronectin type 3 domain. One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.
Probab=98.49 E-value=4e-06 Score=50.34 Aligned_cols=78 Identities=24% Similarity=0.321 Sum_probs=49.9
Q ss_pred CCCcceEEeeeccceEEEEeeecccccccccceeeEEEEEEEeeCCCc--CceEEEeeccccceEEEecCCCCCEEEEEE
Q psy14887 4 VAPQGVFSRGFNSTAINVTWNPIELIRENIRGRLIGHRIKYWKEDNPE--ETAVYYLSRTTRNWALIVGLQPDTKYYVKV 81 (238)
Q Consensus 4 ~~P~~~~~~~~~~~~~~l~W~~~~~~~~~~~~~i~~y~v~~~~~~~~~--~~~~~~~~~~~~~~~~~~~L~p~~~Y~~~v 81 (238)
.+|.++.+......++.++|+++... . . .+|.+.|....... .... .........+.+.+|.|++.|.++|
T Consensus 2 ~~p~~~~~~~~~~~~~~v~W~~~~~~--~---~-~~y~~~~~~~~~~~~~~~~~-~~~~~~~~~~~i~~L~~~~~Y~v~v 74 (83)
T smart00060 2 SPPSNLRVTDVTSTSVTLSWEPPPDD--G---I-TGYIVGYRVEYREEGSSWKE-VNVTPSSTSYTLTGLKPGTEYEFRV 74 (83)
T ss_pred CCCCcEEEEEEeCCEEEEEECCCCCC--C---C-CccEEEEEEEEecCCCccEE-EEecCCccEEEEeCcCCCCEEEEEE
Confidence 45667888777777999999976431 1 1 45666554332221 1111 1111224578999999999999999
Q ss_pred EEEeCCC
Q psy14887 82 MAYNNAG 88 (238)
Q Consensus 82 ~a~~~~g 88 (238)
++.+..|
T Consensus 75 ~a~~~~g 81 (83)
T smart00060 75 RAVNGAG 81 (83)
T ss_pred EEEcccC
Confidence 9998654
No 18
>PF09294 Interfer-bind: Interferon-alpha/beta receptor, fibronectin type III; InterPro: IPR015373 Members of this family adopt a secondary structure consisting of seven beta-strands arranged in an immunoglobulin-like beta-sandwich, in a Greek-key topology. They are required for binding to interferon-alpha []. ; PDB: 1A21_A 3LQM_B 3ELA_T 1AHW_C 2A2Q_T 1TFH_B 1FAK_T 1WSS_T 1W2K_T 2FIR_T ....
Probab=98.10 E-value=1.4e-05 Score=51.31 Aligned_cols=83 Identities=25% Similarity=0.394 Sum_probs=52.1
Q ss_pred CCCcceEEEeeCCcEEEEEEeccCC----CCCCC------CeeEEEEEEEEcCCCCCCCceeeeecCCcceEEecCCCCC
Q psy14887 107 KPPSAVHIKGINPTTVHVTWRYVSP----SFEEE------PIIGYKVRVWEVDQDMSRANDTIIPVGNTLDADIVDLSPG 176 (238)
Q Consensus 107 ~~p~~~~~~~~~~~~~~l~W~~~~~----~~~~~------~i~~y~v~~~~~~~~~~~~~~~~~~~~~~~~~~i~~L~p~ 176 (238)
+||. +.+ ....+.+.|.+.++.. ...+. .-..|.|.|++.+... ...........+.+.+|.|+
T Consensus 4 gPP~-v~v-~~~~~~l~V~i~~P~~~~~~~~~~~~l~~~~~~~~Y~v~~~~~~~~~----~~~~~~~~~~~~~l~~L~p~ 77 (106)
T PF09294_consen 4 GPPS-VNV-SSCGGSLHVTIKPPMTPLRAGGKNSSLRDIYPSLSYNVSYWKNGSNE----KKKEIETKNSSVTLSDLKPG 77 (106)
T ss_dssp -SSE-EEE-EEETTEEEEEEEESEEEEECSSSEEEHHHHHGG-EEEEEEEETTTSC----EEEEEESSSEEEEEES--TT
T ss_pred cCCE-EEE-EECCCEEEEEEECCCcccccCCCCCcHHHhCCCeEEEEEEEeCCCcc----ceEEEeecCCEEEEeCCCCC
Confidence 4554 776 5567789999995320 01110 1256999999987762 13344566778899999999
Q ss_pred ceEEEEEEEEeC--CCCCCCC
Q psy14887 177 KRYHMRVLAFSN--GGDGRMS 195 (238)
Q Consensus 177 t~Y~~~V~a~~~--~g~g~~s 195 (238)
+.|.|+|++... ...|.+|
T Consensus 78 t~YCv~V~~~~~~~~~~s~~S 98 (106)
T PF09294_consen 78 TNYCVSVQAFSPSQNKNSQPS 98 (106)
T ss_dssp SEEEEEEEEEECSSTEEEEEB
T ss_pred CCEEEEEEEEeccCCCcCCCC
Confidence 999999999433 2245665
No 19
>PF09294 Interfer-bind: Interferon-alpha/beta receptor, fibronectin type III; InterPro: IPR015373 Members of this family adopt a secondary structure consisting of seven beta-strands arranged in an immunoglobulin-like beta-sandwich, in a Greek-key topology. They are required for binding to interferon-alpha []. ; PDB: 1A21_A 3LQM_B 3ELA_T 1AHW_C 2A2Q_T 1TFH_B 1FAK_T 1WSS_T 1W2K_T 2FIR_T ....
Probab=97.87 E-value=0.00017 Score=46.23 Aligned_cols=89 Identities=22% Similarity=0.260 Sum_probs=54.1
Q ss_pred CCCcceEEeeeccceEEEEeeecccc----ccc----ccceeeEEEEEEEeeCCCcCceEEEeeccccceEEEecCCCCC
Q psy14887 4 VAPQGVFSRGFNSTAINVTWNPIELI----REN----IRGRLIGHRIKYWKEDNPEETAVYYLSRTTRNWALIVGLQPDT 75 (238)
Q Consensus 4 ~~P~~~~~~~~~~~~~~l~W~~~~~~----~~~----~~~~i~~y~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~L~p~~ 75 (238)
+||. +.+ ....+.+.+.+.+|... ... .-+.-..|.|.||+.+..... .........+.+.+|.|++
T Consensus 4 gPP~-v~v-~~~~~~l~V~i~~P~~~~~~~~~~~~l~~~~~~~~Y~v~~~~~~~~~~~---~~~~~~~~~~~l~~L~p~t 78 (106)
T PF09294_consen 4 GPPS-VNV-SSCGGSLHVTIKPPMTPLRAGGKNSSLRDIYPSLSYNVSYWKNGSNEKK---KEIETKNSSVTLSDLKPGT 78 (106)
T ss_dssp -SSE-EEE-EEETTEEEEEEEESEEEEECSSSEEEHHHHHGG-EEEEEEEETTTSCEE---EEEESSSEEEEEES--TTS
T ss_pred cCCE-EEE-EECCCEEEEEEECCCcccccCCCCCcHHHhCCCeEEEEEEEeCCCccce---EEEeecCCEEEEeCCCCCC
Confidence 4565 888 66888999999988610 000 001135799999988776211 2233445567999999999
Q ss_pred EEEEEEEEEeCC--CCCCCCCcEE
Q psy14887 76 KYYVKVMAYNNA--GEGPESERYL 97 (238)
Q Consensus 76 ~Y~~~v~a~~~~--g~~~~s~~~~ 97 (238)
.|-++|++.... ..|.+|+..-
T Consensus 79 ~YCv~V~~~~~~~~~~s~~S~~~C 102 (106)
T PF09294_consen 79 NYCVSVQAFSPSQNKNSQPSEPQC 102 (106)
T ss_dssp EEEEEEEEEECSSTEEEEEBSEEE
T ss_pred CEEEEEEEEeccCCCcCCCCCCEe
Confidence 999999994322 2355555443
No 20
>PF01108 Tissue_fac: Tissue factor; PDB: 3OG4_B 3OG6_B 1FYH_E 1FG9_D 1JRH_I 3DGC_R 3DLQ_R 1LQS_R 1Y6M_R 1J7V_R ....
Probab=97.85 E-value=0.00017 Score=46.25 Aligned_cols=82 Identities=20% Similarity=0.155 Sum_probs=52.8
Q ss_pred CCCCcceEEeeeccceEEEEeeecccccccccceeeEEEEEEEeeCCCcCceEEEeeccccceEEEecCC--CCCEEEEE
Q psy14887 3 QVAPQGVFSRGFNSTAINVTWNPIELIRENIRGRLIGHRIKYWKEDNPEETAVYYLSRTTRNWALIVGLQ--PDTKYYVK 80 (238)
Q Consensus 3 p~~P~~~~~~~~~~~~~~l~W~~~~~~~~~~~~~i~~y~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~L~--p~~~Y~~~ 80 (238)
..+|+|+++.+.+-. ..|.|+++... ..-..|.|+|+...+........-.....+.+.++... ....|.++
T Consensus 22 lp~P~nv~~~s~nf~-~iL~W~~~~~~-----~~~~~ytVq~~~~~~~~W~~v~~C~~i~~~~Cdlt~~~~~~~~~Y~~r 95 (107)
T PF01108_consen 22 LPAPQNVTVDSVNFK-HILRWDPGPGS-----PPNVTYTVQYKKYGSSSWKDVPGCQNITETSCDLTDETSDPSESYYAR 95 (107)
T ss_dssp GSSCEEEEEEEETTE-EEEEEEESTTS-----SSTEEEEEEEEESSTSCEEEECCEEEESSSEEECTTCCTTTTSEEEEE
T ss_pred CCCCCeeEEEEECCc-eEEEeCCCCCC-----CCCeEEEEEEEecCCcceeeccceecccccceeCcchhhcCcCCEEEE
Confidence 457999999886555 57999995421 34568999987333333222211122334567777633 68899999
Q ss_pred EEEEeCCCCC
Q psy14887 81 VMAYNNAGEG 90 (238)
Q Consensus 81 v~a~~~~g~~ 90 (238)
|+|..+...+
T Consensus 96 V~A~~~~~~S 105 (107)
T PF01108_consen 96 VRAEVGNQTS 105 (107)
T ss_dssp EEEEETTEEE
T ss_pred EEEEeCCccC
Confidence 9998766544
No 21
>PF10179 DUF2369: Uncharacterised conserved protein (DUF2369); InterPro: IPR019326 This is a proline-rich region of a group of proteins found from plants to fungi. The function is largely unknown, although the entry contains Fibronectin type-III domain-containing protein C4orf31, which promotes matrix assembly and cell adhesiveness.
Probab=97.78 E-value=0.0013 Score=49.68 Aligned_cols=122 Identities=18% Similarity=0.284 Sum_probs=72.2
Q ss_pred ceEEEecCCCCCEEEEEEEEEeCCCCCCCCCcEE-eeccCc--CCCCCCcceEEEe----eCCcEEEEEEeccCCCCCCC
Q psy14887 64 NWALIVGLQPDTKYYVKVMAYNNAGEGPESERYL-ERTYRK--APQKPPSAVHIKG----INPTTVHVTWRYVSPSFEEE 136 (238)
Q Consensus 64 ~~~~~~~L~p~~~Y~~~v~a~~~~g~~~~s~~~~-~~t~~~--~p~~~p~~~~~~~----~~~~~~~l~W~~~~~~~~~~ 136 (238)
..+.+.|+.|+..|.+++...+.... .....+. ..+... .-|.-|.+..+.. ...++++|.|.+ ..+.+
T Consensus 128 ~~f~l~~~~~g~~Yliri~~~~~~e~-~~~~kV~aast~~~~~~~P~LP~d~~Ik~f~~lrtC~SvTIAW~~--s~d~~- 203 (300)
T PF10179_consen 128 RHFRLSGVKPGERYLIRIQISNSDEG-PSTFKVQAASTNPSKQPYPQLPDDTSIKEFNKLRTCNSVTIAWLG--SPDRS- 203 (300)
T ss_pred EEEEECCCCCCCeEEEEEEccCCCCC-ceEEEEEEecCCcccCCCCCCCCCCceeEEcCCcccceEEEEEec--CCCCC-
Confidence 45888999999999999977654332 2222222 223221 1245677766643 346899999994 33322
Q ss_pred CeeEEEEEEEEcCCC--C--------CC-------Cceeee----e--------cCCcceEEecCCCCCceEEEEEEEEe
Q psy14887 137 PIIGYKVRVWEVDQD--M--------SR-------ANDTII----P--------VGNTLDADIVDLSPGKRYHMRVLAFS 187 (238)
Q Consensus 137 ~i~~y~v~~~~~~~~--~--------~~-------~~~~~~----~--------~~~~~~~~i~~L~p~t~Y~~~V~a~~ 187 (238)
..|-|+.++.... + .+ ..+.+. . ...-...+|.+|+||+.|.|.|.+.-
T Consensus 204 --~kYCvy~~~~~~~~~~~~~~~~~n~C~~~~sr~k~e~v~Ck~~~~~n~~~~~~~~v~tetI~~L~PG~~Yl~dV~~~~ 281 (300)
T PF10179_consen 204 --IKYCVYRREEHSNYQERSVSRMPNQCLGPESRKKSEKVLCKYFHSPNSSEDPQRAVTTETIKGLKPGTTYLFDVYVNG 281 (300)
T ss_pred --ceEEEEEEEecCchhhhhhcccCccCCCCCccccceEEEEEEEcCCccccccccccceeecccCCCCcEEEEEEEEec
Confidence 4677766544333 0 00 111111 1 11113457999999999999999987
Q ss_pred CCCC
Q psy14887 188 NGGD 191 (238)
Q Consensus 188 ~~g~ 191 (238)
..|.
T Consensus 282 ~~G~ 285 (300)
T PF10179_consen 282 PSGQ 285 (300)
T ss_pred CCCc
Confidence 6554
No 22
>KOG4367|consensus
Probab=97.77 E-value=8.2e-05 Score=57.67 Aligned_cols=97 Identities=20% Similarity=0.311 Sum_probs=68.9
Q ss_pred CcCCCCCCcceEEEeeCCcEEEEEEeccCCCCCCCCeeEEEEEEEEcCCCCCCCceeeeecCCcceEEecCCCCCceEEE
Q psy14887 102 RKAPQKPPSAVHIKGINPTTVHVTWRYVSPSFEEEPIIGYKVRVWEVDQDMSRANDTIIPVGNTLDADIVDLSPGKRYHM 181 (238)
Q Consensus 102 ~~~p~~~p~~~~~~~~~~~~~~l~W~~~~~~~~~~~i~~y~v~~~~~~~~~~~~~~~~~~~~~~~~~~i~~L~p~t~Y~~ 181 (238)
..+|..|...+.--....+++++.|+ .|.....+..||.++.-..+.. + -..+..+..+..++.+|.-++.|..
T Consensus 435 s~vpatpilq~~ec~t~nns~t~~wk--qp~~~~~~~dg~~leld~g~~g-~---frevy~g~etmctvdglhfns~y~a 508 (699)
T KOG4367|consen 435 SPVPATPILQLEECCTHNNSATLSWK--QPPLSTVPADGYILELDDGNGG-Q---FREVYVGKETMCTVDGLHFNSTYNA 508 (699)
T ss_pred CCCCCCceeehhhhhccCCceEEEee--cCCCCCCCCcceEEEeecCCCC-c---eeEEEecCceeEEecceecchhHHH
Confidence 33454343333333456778999999 4555566788999886544222 1 1334568889999999999999999
Q ss_pred EEEEEeCCCCCCCCCCcEEEeecc
Q psy14887 182 RVLAFSNGGDGRMSSPAITFQMGE 205 (238)
Q Consensus 182 ~V~a~~~~g~g~~s~~~~~~~t~~ 205 (238)
+|.++|..|.++.|+. +..+|..
T Consensus 509 rvka~n~tg~s~ys~t-l~lqts~ 531 (699)
T KOG4367|consen 509 RVKAFNKTGVSPYSKT-LVLQTSE 531 (699)
T ss_pred HHHHhhccCCCcccce-eEeeecc
Confidence 9999999999999964 5566544
No 23
>KOG4152|consensus
Probab=97.48 E-value=0.0017 Score=52.04 Aligned_cols=123 Identities=22% Similarity=0.291 Sum_probs=78.2
Q ss_pred ecCCCCCEEEEEEEEEeCCCCCCCCCcEEeeccCcCCCCCCcceEEEeeCCcEEEEEEeccCCC-CCCCCeeEEEEEEEE
Q psy14887 69 VGLQPDTKYYVKVMAYNNAGEGPESERYLERTYRKAPQKPPSAVHIKGINPTTVHVTWRYVSPS-FEEEPIIGYKVRVWE 147 (238)
Q Consensus 69 ~~L~p~~~Y~~~v~a~~~~g~~~~s~~~~~~t~~~~p~~~p~~~~~~~~~~~~~~l~W~~~~~~-~~~~~i~~y~v~~~~ 147 (238)
..|-+++.|.|+|.+.|+.|.++++....+.|...-.+..|..+++..... .+.+.|. ++. ...|.|..|..+..-
T Consensus 652 ~~lv~Gq~yrfrV~aIng~G~gp~s~i~~~kTc~pG~P~apS~~ri~k~~e-Gi~l~we--ppt~p~sg~Iieys~ylAi 728 (830)
T KOG4152|consen 652 TSLVTGQAYRFRVTAINGKGPGPASTILKLKTCAPGKPTAPSGARIKKTIE-GISLVWE--PPTKPGSGTIIEYSPYLAI 728 (830)
T ss_pred cccccccceeeeeeeeeccCCCchhhheeeeeccCCCCCCccccccccccc-ceeeccc--CCCCCCCcceEEeehhhHh
Confidence 357889999999999999999999998888887544446676666544322 5899999 444 446888888876421
Q ss_pred cCC---C--CCCCceeeeecCCcceEE---------ecCCCCCceEEEEEEEEeCCCCCCCC
Q psy14887 148 VDQ---D--MSRANDTIIPVGNTLDAD---------IVDLSPGKRYHMRVLAFSNGGDGRMS 195 (238)
Q Consensus 148 ~~~---~--~~~~~~~~~~~~~~~~~~---------i~~L~p~t~Y~~~V~a~~~~g~g~~s 195 (238)
..+ . .+... -....+..+... +.+.--+-.-.|+|.|.|..|.|+..
T Consensus 729 ~tq~~~~~~sql~f-mr~ycg~~t~c~vt~g~l~tA~~d~t~~paivfri~aknekGyGpat 789 (830)
T KOG4152|consen 729 ITQAGATGVSQLPF-MRSYCGIITTCVVTSGKLITAVPDITTGPAIVFRIVAKNEKGYGPAT 789 (830)
T ss_pred hhhhhccCccccce-eeeecccceeeEEecccceeeccccccCcceEEEEEecCCCCCCcce
Confidence 111 1 11000 011112222211 12333456678999999999988864
No 24
>KOG3632|consensus
Probab=97.30 E-value=0.0014 Score=56.31 Aligned_cols=173 Identities=21% Similarity=0.315 Sum_probs=103.2
Q ss_pred CCCCcceEEeeeccceEEEEeeecccccccccceeeEEEEEEEeeCCCcCceEEEeeccccceEEEecCCCCCEEEEEEE
Q psy14887 3 QVAPQGVFSRGFNSTAINVTWNPIELIRENIRGRLIGHRIKYWKEDNPEETAVYYLSRTTRNWALIVGLQPDTKYYVKVM 82 (238)
Q Consensus 3 p~~P~~~~~~~~~~~~~~l~W~~~~~~~~~~~~~i~~y~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~L~p~~~Y~~~v~ 82 (238)
+..|+++++..++.++..++|-+... + .-.+.|-...+.. ........+++-+|.|++.|..+|-
T Consensus 672 ~vvpsqlrv~n~tqtSa~itwvp~ns-----n----~~Hviyln~eE~~------ps~a~~y~ytf~~lrpgt~y~a~ve 736 (1335)
T KOG3632|consen 672 PVVPSQLRVWNATQTSAMITWVPFNS-----N----FLHVIYLNAEEPR------PSVAEMYNYTFMRLRPGTDYWASVE 736 (1335)
T ss_pred cccchhhhhhhhhchhhheeeeecCC-----C----cceeeecCCccCC------CchhhhhHHHHhccCCCCccceecc
Confidence 57899999999999999999987431 1 2233332221111 0112334478889999999999998
Q ss_pred EEeCCC---------CCCCCCcEEeeccCcCCCCCCcceEEEe-eCCcEEEEEEeccCC---CCCCC-CeeEEEEEEEEc
Q psy14887 83 AYNNAG---------EGPESERYLERTYRKAPQKPPSAVHIKG-INPTTVHVTWRYVSP---SFEEE-PIIGYKVRVWEV 148 (238)
Q Consensus 83 a~~~~g---------~~~~s~~~~~~t~~~~p~~~p~~~~~~~-~~~~~~~l~W~~~~~---~~~~~-~i~~y~v~~~~~ 148 (238)
+..... .+.-.....+.|...-|+.+|.++.+.. ..++.+.++|.++.- ...|| .+++|-|+..
T Consensus 737 a~~p~q~pwdl~~v~~etr~atv~f~tLpAGppappldV~vE~g~spg~l~vswrPptldsag~sngv~vtgYavyad-- 814 (1335)
T KOG3632|consen 737 AALPRQEPWDLRMVPMETRQATVLFRTLPAGPPAPPLDVKVETGGSPGRLEVSWRPPTLDSAGCSNGVAVTGYAVYAD-- 814 (1335)
T ss_pred cccCcCCCcccccchhhhhccceeeecccCCCCCCchheeeecCCCCceeeeeccCceeccccccCceeeeeeeeeeC--
Confidence 875411 1112334667787777888999988854 567789999996422 12233 6888988643
Q ss_pred CCCCCCCceeeeecCCcceEEecCCCCC-ceEEEEEEEEeCCCCCCCC
Q psy14887 149 DQDMSRANDTIIPVGNTLDADIVDLSPG-KRYHMRVLAFSNGGDGRMS 195 (238)
Q Consensus 149 ~~~~~~~~~~~~~~~~~~~~~i~~L~p~-t~Y~~~V~a~~~~g~g~~s 195 (238)
+. ...+........+.+.+..|+-+ +-=.|.|+.-...|++-.|
T Consensus 815 gq---kv~Evafptagst~VelsQlq~~l~~~~V~vRtms~~gesvds 859 (1335)
T KOG3632|consen 815 GQ---KVEEVAFPTAGSTKVELSQLQDGLYHGAVGVRTMSVPGESVDS 859 (1335)
T ss_pred Cc---eeeeeecccCCceEEEeeeehhhheecceeEEecccCcccccc
Confidence 11 11112222233445556666542 2334555555555654443
No 25
>PF09067 EpoR_lig-bind: Erythropoietin receptor, ligand binding; InterPro: IPR015152 Members of this entry include the growth hormone and erythropoietin receptors. The latter interacts with erythropoietin (EPO), with subsequent initiation of the downstream chain of events associated with binding of EPO to the receptor, including EPO-induced erythroblast proliferation and differentiation through induction of the JAK2/STAT5 signalling cascade. The domain adopts a secondary structure composed of a short amino-terminal helix, followed by two beta-sandwich regions []. ; PDB: 3NCB_B 3NCF_B 3NCE_B 3N0P_B 3NCC_B 1BP3_B 3N06_B 3MZG_B 3D48_R 1F6F_C ....
Probab=97.12 E-value=0.0056 Score=38.82 Aligned_cols=83 Identities=17% Similarity=0.127 Sum_probs=52.7
Q ss_pred CCCCCCcceEEeeeccceEEEEeeecccccccccceeeEEEEEEEeeCCCcCceEEEeeccccce-----EEEe--cCCC
Q psy14887 1 MPQVAPQGVFSRGFNSTAINVTWNPIELIRENIRGRLIGHRIKYWKEDNPEETAVYYLSRTTRNW-----ALIV--GLQP 73 (238)
Q Consensus 1 ~pp~~P~~~~~~~~~~~~~~l~W~~~~~~~~~~~~~i~~y~v~~~~~~~~~~~~~~~~~~~~~~~-----~~~~--~L~p 73 (238)
+||.-|.++.+.+-....++..|++... .+.+ ..|++.|+...+.... -.........+ +.+. +..-
T Consensus 6 ~~p~~P~~~~C~S~~~etftC~W~~g~~----~~l~-~~y~L~Y~~~~~~~~e-Cp~~~~~~~ns~~~~~C~F~~~~t~l 79 (104)
T PF09067_consen 6 LPPEKPENLKCFSREMETFTCFWEPGSE----GNLP-TNYTLFYKKEGEEWKE-CPDYSTSGPNSTVRHICYFPKSDTSL 79 (104)
T ss_dssp HHCCCCEEEEEEBSSSS-EEEEEEEESS----STST-CEEEEEEEETTSEEEE-ESESSTTETTEEEEEEEEE-CCGCSS
T ss_pred cCCCCCccCccCCCCCCcEEEEeeCCCC----CCCC-CcEEEEEEeCCCCCcc-CCCeEecCCCCceeEEEEcCCCCeEE
Confidence 3788999999999899999999999753 2222 2399988766521111 00111112222 4444 5677
Q ss_pred CCEEEEEEEEEeCCCC
Q psy14887 74 DTKYYVKVMAYNNAGE 89 (238)
Q Consensus 74 ~~~Y~~~v~a~~~~g~ 89 (238)
.+.|.++|.+.+..|.
T Consensus 80 f~~y~i~V~a~~~~~~ 95 (104)
T PF09067_consen 80 FVPYCIQVEATNALGS 95 (104)
T ss_dssp SSEEEEEEEEEETTEE
T ss_pred EEEEEEEEEeccCCCc
Confidence 8999999999987764
No 26
>KOG4367|consensus
Probab=96.83 E-value=0.0012 Score=51.56 Aligned_cols=82 Identities=15% Similarity=0.187 Sum_probs=60.3
Q ss_pred eccceEEEEeeecccccccccceeeEEEEEEEeeCCCcCceEEEeeccccceEEEecCCCCCEEEEEEEEEeCCCCCCCC
Q psy14887 14 FNSTAINVTWNPIELIRENIRGRLIGHRIKYWKEDNPEETAVYYLSRTTRNWALIVGLQPDTKYYVKVMAYNNAGEGPES 93 (238)
Q Consensus 14 ~~~~~~~l~W~~~~~~~~~~~~~i~~y~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~L~p~~~Y~~~v~a~~~~g~~~~s 93 (238)
....+++++|+.|. ....+..+|.++....++. ...-+.....+-++++||.-++.|..+|.+.|..|.++++
T Consensus 450 t~nns~t~~wkqp~----~~~~~~dg~~leld~g~~g---~frevy~g~etmctvdglhfns~y~arvka~n~tg~s~ys 522 (699)
T KOG4367|consen 450 THNNSATLSWKQPP----LSTVPADGYILELDDGNGG---QFREVYVGKETMCTVDGLHFNSTYNARVKAFNKTGVSPYS 522 (699)
T ss_pred ccCCceEEEeecCC----CCCCCCcceEEEeecCCCC---ceeEEEecCceeEEecceecchhHHHHHHHhhccCCCccc
Confidence 34577999999653 2335678999985332222 2223344556779999999999999999999999999999
Q ss_pred CcEEeeccC
Q psy14887 94 ERYLERTYR 102 (238)
Q Consensus 94 ~~~~~~t~~ 102 (238)
+.....|.+
T Consensus 523 ~tl~lqts~ 531 (699)
T KOG4367|consen 523 KTLVLQTSE 531 (699)
T ss_pred ceeEeeecc
Confidence 887776654
No 27
>PF10179 DUF2369: Uncharacterised conserved protein (DUF2369); InterPro: IPR019326 This is a proline-rich region of a group of proteins found from plants to fungi. The function is largely unknown, although the entry contains Fibronectin type-III domain-containing protein C4orf31, which promotes matrix assembly and cell adhesiveness.
Probab=96.75 E-value=0.16 Score=38.65 Aligned_cols=28 Identities=32% Similarity=0.574 Sum_probs=23.8
Q ss_pred cccceEEEecCCCCCEEEEEEEEEeCCC
Q psy14887 61 TTRNWALIVGLQPDTKYYVKVMAYNNAG 88 (238)
Q Consensus 61 ~~~~~~~~~~L~p~~~Y~~~v~a~~~~g 88 (238)
...+.+++.+|+|++.|.|.|.++|...
T Consensus 12 g~~t~~t~~~L~p~t~YyfdVF~vn~~~ 39 (300)
T PF10179_consen 12 GQKTNQTLSGLKPDTTYYFDVFVVNQLT 39 (300)
T ss_pred CCCceEEeccCCCCCeEEEEEEEEECCC
Confidence 3456789999999999999999998743
No 28
>COG3401 Fibronectin type 3 domain-containing protein [General function prediction only]
Probab=96.62 E-value=0.1 Score=39.86 Aligned_cols=162 Identities=14% Similarity=0.031 Sum_probs=85.0
Q ss_pred cceEEEEeeecccccccccceeeEEEEEEEeeCCCcCceEEEeeccccceEEEecCCCCCEEEEEEEEEeCCCCCCCCCc
Q psy14887 16 STAINVTWNPIELIRENIRGRLIGHRIKYWKEDNPEETAVYYLSRTTRNWALIVGLQPDTKYYVKVMAYNNAGEGPESER 95 (238)
Q Consensus 16 ~~~~~l~W~~~~~~~~~~~~~i~~y~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~L~p~~~Y~~~v~a~~~~g~~~~s~~ 95 (238)
+..+.+.|.+..| .++-.|.|+....++. ............-.+.-.++.++..|...|.+.+..+.-+.+..
T Consensus 83 ~~~~~~~w~~~~d------~~~~~Y~i~~~~gD~~-f~r~~~~~n~l~~~~i~s~~~~~~~~~~~Iia~~f~~~~sfsf~ 155 (343)
T COG3401 83 PKSVKVFWSPHPD------VSVGKYIIQRQNGDGK-FLRTGLVKNRLFVEFIDSDLGHNEKYMELIIAADFQMGKSFSFT 155 (343)
T ss_pred cceeeecccccCC------CCCCeEEEEEecCchh-hhhhhHHHhccchhheecccccccceeeeEEeecccccceeeee
Confidence 4558999998765 4566899985443333 22222222233344566689999999999999876665333311
Q ss_pred EEeeccCcCCCCCCc--ceEEEeeCCcEEEEEEeccCCCCCCCCeeEEEEEEEEcCCCCCCCceeeeecCCcceEEecCC
Q psy14887 96 YLERTYRKAPQKPPS--AVHIKGINPTTVHVTWRYVSPSFEEEPIIGYKVRVWEVDQDMSRANDTIIPVGNTLDADIVDL 173 (238)
Q Consensus 96 ~~~~t~~~~p~~~p~--~~~~~~~~~~~~~l~W~~~~~~~~~~~i~~y~v~~~~~~~~~~~~~~~~~~~~~~~~~~i~~L 173 (238)
..--+... .|+. ++.+.....+.+.|+|+. ++..+ .|.|+ .....++... .-.....+.......+|
T Consensus 156 gVE~~~~~---~P~ei~~~~~~~d~~~~i~ls~dg--~~~~~----yy~IY-~~~~g~e~~~-~ia~t~~n~y~d~~egl 224 (343)
T COG3401 156 GVEATPKA---EPKEITNVRVSFDLGNNIELSEDG--SEAED----YYRIY-ASDSGNEEYG-FIAQTTENSYYDVKEGL 224 (343)
T ss_pred eeeccccc---CCceeeeeeeecCCCCcceeeccC--ccccc----eEEEe-ccCCcccccc-ceeeccccchhhhhhcc
Confidence 11112111 2222 233334456678999984 22222 66664 3333332211 11111122223334555
Q ss_pred CCCceEEEEEEEEeC-CCCCCCCCC
Q psy14887 174 SPGKRYHMRVLAFSN-GGDGRMSSP 197 (238)
Q Consensus 174 ~p~t~Y~~~V~a~~~-~g~g~~s~~ 197 (238)
|..|...|.+... ..++..+++
T Consensus 225 --ga~~~y~VTtVd~~~~es~lp~~ 247 (343)
T COG3401 225 --GAVEYYKVTTVDNTGFESDLPNE 247 (343)
T ss_pred --CceeEEEEEEEcCCcceeccCCc
Confidence 6667777777764 446666653
No 29
>PF01108 Tissue_fac: Tissue factor; PDB: 3OG4_B 3OG6_B 1FYH_E 1FG9_D 1JRH_I 3DGC_R 3DLQ_R 1LQS_R 1Y6M_R 1J7V_R ....
Probab=96.59 E-value=0.016 Score=37.15 Aligned_cols=81 Identities=19% Similarity=0.216 Sum_probs=51.8
Q ss_pred CCCCCcceEEEeeCCcEEEEEEeccCCCCCCCCeeEEEEEEEEcCCCCCCCcee-eeecCCcceEEecCCC--CCceEEE
Q psy14887 105 PQKPPSAVHIKGINPTTVHVTWRYVSPSFEEEPIIGYKVRVWEVDQDMSRANDT-IIPVGNTLDADIVDLS--PGKRYHM 181 (238)
Q Consensus 105 p~~~p~~~~~~~~~~~~~~l~W~~~~~~~~~~~i~~y~v~~~~~~~~~~~~~~~-~~~~~~~~~~~i~~L~--p~t~Y~~ 181 (238)
...+|.++++...+- ...|+|++ ......-..|.|+|+..+... +... ...+-..++..++... +...|.+
T Consensus 21 ~lp~P~nv~~~s~nf-~~iL~W~~---~~~~~~~~~ytVq~~~~~~~~--W~~v~~C~~i~~~~Cdlt~~~~~~~~~Y~~ 94 (107)
T PF01108_consen 21 SLPAPQNVTVDSVNF-KHILRWDP---GPGSPPNVTYTVQYKKYGSSS--WKDVPGCQNITETSCDLTDETSDPSESYYA 94 (107)
T ss_dssp SGSSCEEEEEEEETT-EEEEEEEE---STTSSSTEEEEEEEEESSTSC--EEEECCEEEESSSEEECTTCCTTTTSEEEE
T ss_pred cCCCCCeeEEEEECC-ceEEEeCC---CCCCCCCeEEEEEEEecCCcc--eeeccceecccccceeCcchhhcCcCCEEE
Confidence 345888999887654 58899994 122234578999999443332 2111 1223345677776544 6889999
Q ss_pred EEEEEeCCCC
Q psy14887 182 RVLAFSNGGD 191 (238)
Q Consensus 182 ~V~a~~~~g~ 191 (238)
+|+|...+..
T Consensus 95 rV~A~~~~~~ 104 (107)
T PF01108_consen 95 RVRAEVGNQT 104 (107)
T ss_dssp EEEEEETTEE
T ss_pred EEEEEeCCcc
Confidence 9999976543
No 30
>PF09067 EpoR_lig-bind: Erythropoietin receptor, ligand binding; InterPro: IPR015152 Members of this entry include the growth hormone and erythropoietin receptors. The latter interacts with erythropoietin (EPO), with subsequent initiation of the downstream chain of events associated with binding of EPO to the receptor, including EPO-induced erythroblast proliferation and differentiation through induction of the JAK2/STAT5 signalling cascade. The domain adopts a secondary structure composed of a short amino-terminal helix, followed by two beta-sandwich regions []. ; PDB: 3NCB_B 3NCF_B 3NCE_B 3N0P_B 3NCC_B 1BP3_B 3N06_B 3MZG_B 3D48_R 1F6F_C ....
Probab=96.54 E-value=0.027 Score=35.76 Aligned_cols=81 Identities=16% Similarity=0.126 Sum_probs=52.7
Q ss_pred CCCCCcceEEEeeCCcEEEEEEeccCCCCCCCCeeEEEEEEEEcCCCCCCCceeeeecCCcce-----EEe--cCCCCCc
Q psy14887 105 PQKPPSAVHIKGINPTTVHVTWRYVSPSFEEEPIIGYKVRVWEVDQDMSRANDTIIPVGNTLD-----ADI--VDLSPGK 177 (238)
Q Consensus 105 p~~~p~~~~~~~~~~~~~~l~W~~~~~~~~~~~i~~y~v~~~~~~~~~~~~~~~~~~~~~~~~-----~~i--~~L~p~t 177 (238)
|+..|.++.+...+-.+++..|+ +....+.. ..|.+.|...+... ..+-.......++ ..+ .+..-.+
T Consensus 7 ~p~~P~~~~C~S~~~etftC~W~--~g~~~~l~-~~y~L~Y~~~~~~~--~eCp~~~~~~~ns~~~~~C~F~~~~t~lf~ 81 (104)
T PF09067_consen 7 PPEKPENLKCFSREMETFTCFWE--PGSEGNLP-TNYTLFYKKEGEEW--KECPDYSTSGPNSTVRHICYFPKSDTSLFV 81 (104)
T ss_dssp HCCCCEEEEEEBSSSS-EEEEEE--EESSSTST-CEEEEEEEETTSEE--EEESESSTTETTEEEEEEEEE-CCGCSSSS
T ss_pred CCCCCccCccCCCCCCcEEEEee--CCCCCCCC-CcEEEEEEeCCCCC--ccCCCeEecCCCCceeEEEEcCCCCeEEEE
Confidence 45788899999988899999999 44444422 34999998776221 1111111112223 344 3677889
Q ss_pred eEEEEEEEEeCCC
Q psy14887 178 RYHMRVLAFSNGG 190 (238)
Q Consensus 178 ~Y~~~V~a~~~~g 190 (238)
.|.++|.+.|..|
T Consensus 82 ~y~i~V~a~~~~~ 94 (104)
T PF09067_consen 82 PYCIQVEATNALG 94 (104)
T ss_dssp EEEEEEEEEETTE
T ss_pred EEEEEEEeccCCC
Confidence 9999999999876
No 31
>KOG4802|consensus
Probab=96.03 E-value=0.15 Score=40.04 Aligned_cols=58 Identities=22% Similarity=0.293 Sum_probs=41.0
Q ss_pred eeecCCcceEEecCCCCCceEEEEEEEEeCCCCCCCCCCcEEEeecccccCCCCCceee
Q psy14887 159 IIPVGNTLDADIVDLSPGKRYHMRVLAFSNGGDGRMSSPAITFQMGEEKEFKSGSPALY 217 (238)
Q Consensus 159 ~~~~~~~~~~~i~~L~p~t~Y~~~V~a~~~~g~g~~s~~~~~~~t~~~~~~~p~~~~~~ 217 (238)
+......+.+.+..+.||.=|+|+|.|.|..|.-.+++|..-|.....+. .|.+|...
T Consensus 202 v~~t~~e~~~~~t~~rPgRwyefrvaavn~~G~rGFs~PSkpf~ssk~pk-aPp~P~dl 259 (516)
T KOG4802|consen 202 VEKTMEENTYIFTDMRPGRWYEFRVAAVNAYGFRGFSEPSKPFPSSKNPK-APPSPNDL 259 (516)
T ss_pred eeecCCCceeeeeecCcceeEEEEEeeeecccccccCCCCCCCCCCCCCC-CCcCcccc
Confidence 34444555788899999999999999999988766666666676664443 44444444
No 32
>PF09240 IL6Ra-bind: Interleukin-6 receptor alpha chain, binding; InterPro: IPR015321 Members of this entry adopt a structure consisting of an immunoglobulin-like beta-sandwich, with seven strands in two beta-sheets, in a Greek-key topology. They are required for binding to the cytokine Interleukin-6 []. ; PDB: 1N26_A 1P9M_C 3LB6_C 1PVH_A 3L5H_A 1BQU_A 1I1R_A 3QT2_B 3BPN_C 3BPO_C ....
Probab=95.34 E-value=0.38 Score=30.22 Aligned_cols=80 Identities=24% Similarity=0.225 Sum_probs=49.4
Q ss_pred CCcceEEeeeccceEEEEeeecccccccccceeeEEEEEEEeeCCCcCc-eEEEeec-cccceEEEecCCC----CCEEE
Q psy14887 5 APQGVFSRGFNSTAINVTWNPIELIRENIRGRLIGHRIKYWKEDNPEET-AVYYLSR-TTRNWALIVGLQP----DTKYY 78 (238)
Q Consensus 5 ~P~~~~~~~~~~~~~~l~W~~~~~~~~~~~~~i~~y~v~~~~~~~~~~~-~~~~~~~-~~~~~~~~~~L~p----~~~Y~ 78 (238)
+|+|+.+.-.+...+..+|.+.... ..-..|.+.||........ -...... .....+.+..... ...|.
T Consensus 1 ~~~nlsC~~~~~~~m~CtW~~g~~~-----~~~t~y~L~~~~~~~~~~~eC~~y~~~~~~~~gC~~~~~~~~~~~~~~~~ 75 (99)
T PF09240_consen 1 KPQNLSCFIYNLEYMNCTWEPGKEA-----PPDTQYTLYYWYSPLEEEKECPHYSKDSGTRIGCQFPVSEIDSSEFSQYN 75 (99)
T ss_dssp S-EEEEEEEETTTEEEEEEECCTTC-----STTEEEEEEEEETTSSSEEEESEEEESTSSEEEEEEESCTT-TTTTSEEE
T ss_pred CCeeCEEEEECCEEEEEEECCCCCC-----CCcccEEEEEEcCCCCccccCCCccccCCceeEEEecCCCccccccceEE
Confidence 4789999888899999999986431 2335788888765432211 1112122 2244566665444 35788
Q ss_pred EEEEEEeCCCC
Q psy14887 79 VKVMAYNNAGE 89 (238)
Q Consensus 79 ~~v~a~~~~g~ 89 (238)
+.|.+.+..|.
T Consensus 76 v~V~~ss~~~~ 86 (99)
T PF09240_consen 76 VCVNGSSSAGS 86 (99)
T ss_dssp EEEEEEETTEE
T ss_pred EEEEeccCCCc
Confidence 99988876664
No 33
>PF07495 Y_Y_Y: Y_Y_Y domain; InterPro: IPR011123 This region is mostly found at the end of the beta propellers (IPR011110 from INTERPRO) in a family of two component regulators. However they are also found tandemly repeated in Q891H4 from SWISSPROT without other signal conduction domains being present. It is named after the conserved tyrosines found in the alignment. The exact function is not known.; PDB: 3V9F_D 3VA6_B 3OTT_B 4A2M_D 4A2L_B.
Probab=95.24 E-value=0.24 Score=28.35 Aligned_cols=27 Identities=30% Similarity=0.497 Sum_probs=20.2
Q ss_pred eEEecCCCCCceEEEEEEEEeCCCCCCC
Q psy14887 167 DADIVDLSPGKRYHMRVLAFSNGGDGRM 194 (238)
Q Consensus 167 ~~~i~~L~p~t~Y~~~V~a~~~~g~g~~ 194 (238)
.+.+.+|.|| .|+|+|+|.+..|....
T Consensus 30 ~~~~~~L~~G-~Y~l~V~a~~~~~~~~~ 56 (66)
T PF07495_consen 30 SISYTNLPPG-KYTLEVRAKDNNGKWSS 56 (66)
T ss_dssp EEEEES--SE-EEEEEEEEEETTS-B-S
T ss_pred EEEEEeCCCE-EEEEEEEEECCCCCcCc
Confidence 8899999877 79999999999776544
No 34
>KOG3632|consensus
Probab=95.09 E-value=0.049 Score=47.40 Aligned_cols=159 Identities=16% Similarity=0.129 Sum_probs=91.4
Q ss_pred CcceEEeeeccceEEEEeeecccccccccceeeEEEEEEEeeCCCcCceEEEeeccccceEEEecCCC-CCEEEEEEEEE
Q psy14887 6 PQGVFSRGFNSTAINVTWNPIELIRENIRGRLIGHRIKYWKEDNPEETAVYYLSRTTRNWALIVGLQP-DTKYYVKVMAY 84 (238)
Q Consensus 6 P~~~~~~~~~~~~~~l~W~~~~~~~~~~~~~i~~y~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~L~p-~~~Y~~~v~a~ 84 (238)
|....+....+.++.+.|++|.. ...-++...|.+... .+.+. .......+...+++|.- ...|.+.|.+.
T Consensus 581 pr~~~~vkqla~sv~vawepps~--pP~~~~~~~~~~~v~--~elrq----~l~~gs~tka~~E~ld~~a~s~~isvq~l 652 (1335)
T KOG3632|consen 581 PRTGAAVKQLAGSVHVAWEPPSS--PPFTAPARMIGATVL--MELRQ----TLYAGSLTKAMQESLDNSAHSGYISVQRL 652 (1335)
T ss_pred ccchhhhhhhccceeeeccCCCC--CCccccceeeeeecc--hhhhh----hcccccchHHHHhhccccCCceeeehhhh
Confidence 34444445678889999999764 222344555655421 11110 01111222233333322 34578888888
Q ss_pred eCCCCCCCCCcEEeeccCcCCCCCCcceEEEeeCCcEEEEEEeccCCCCCCCCeeEEEEEEEEcCCCCCCCceeeeecCC
Q psy14887 85 NNAGEGPESERYLERTYRKAPQKPPSAVHIKGINPTTVHVTWRYVSPSFEEEPIIGYKVRVWEVDQDMSRANDTIIPVGN 164 (238)
Q Consensus 85 ~~~g~~~~s~~~~~~t~~~~p~~~p~~~~~~~~~~~~~~l~W~~~~~~~~~~~i~~y~v~~~~~~~~~~~~~~~~~~~~~ 164 (238)
...|....... .+-+...++ ..|.++++......+..++|-+ ...... -+.|-.. .+. ......
T Consensus 653 tSrGsqd~lrc-~Llvgg~a~-vvpsqlrv~n~tqtSa~itwvp--~nsn~~-----Hviyln~--eE~-----~ps~a~ 716 (1335)
T KOG3632|consen 653 TSRGSQDQLRC-ILLVGGAAP-VVPSQLRVWNATQTSAMITWVP--FNSNFL-----HVIYLNA--EEP-----RPSVAE 716 (1335)
T ss_pred hccCCCCccee-eEecccccc-ccchhhhhhhhhchhhheeeee--cCCCcc-----eeeecCC--ccC-----CCchhh
Confidence 88887655443 333444444 6788999999999999999983 222111 1222221 111 111234
Q ss_pred cceEEecCCCCCceEEEEEEEEeC
Q psy14887 165 TLDADIVDLSPGKRYHMRVLAFSN 188 (238)
Q Consensus 165 ~~~~~i~~L~p~t~Y~~~V~a~~~ 188 (238)
...+++.+|.|++.|.++|-+.-.
T Consensus 717 ~y~ytf~~lrpgt~y~a~vea~~p 740 (1335)
T KOG3632|consen 717 MYNYTFMRLRPGTDYWASVEAALP 740 (1335)
T ss_pred hhHHHHhccCCCCccceecccccC
Confidence 457789999999999999998776
No 35
>PLN02533 probable purple acid phosphatase
Probab=94.63 E-value=1.2 Score=36.33 Aligned_cols=87 Identities=17% Similarity=0.210 Sum_probs=49.8
Q ss_pred CCCCCcceEEEeeCCcEEEEEEeccCCCCCCCCeeEEEEEEEEcCCCCCCCce--eeee-------cCCcceEEecCCCC
Q psy14887 105 PQKPPSAVHIKGINPTTVHVTWRYVSPSFEEEPIIGYKVRVWEVDQDMSRAND--TIIP-------VGNTLDADIVDLSP 175 (238)
Q Consensus 105 p~~~p~~~~~~~~~~~~~~l~W~~~~~~~~~~~i~~y~v~~~~~~~~~~~~~~--~~~~-------~~~~~~~~i~~L~p 175 (238)
.+..|..+++.-.+++++.|+|... ... . -.|.|............ .... .+....+.|.+|+|
T Consensus 40 ~~~~P~qvhls~~~~~~m~V~W~T~--~~~-~----~~V~yG~~~~~l~~~a~g~~~~~~~~~~~~~g~iH~v~l~~L~p 112 (427)
T PLN02533 40 DPTHPDQVHISLVGPDKMRISWITQ--DSI-P----PSVVYGTVSGKYEGSANGTSSSYHYLLIYRSGQINDVVIGPLKP 112 (427)
T ss_pred CCCCCceEEEEEcCCCeEEEEEECC--CCC-C----CEEEEecCCCCCcceEEEEEEEEeccccccCCeEEEEEeCCCCC
Confidence 3456778888776788999999942 221 1 23556543322211100 0111 11223578999999
Q ss_pred CceEEEEEEEEeCCCCCCCCCCcEEEeecc
Q psy14887 176 GKRYHMRVLAFSNGGDGRMSSPAITFQMGE 205 (238)
Q Consensus 176 ~t~Y~~~V~a~~~~g~g~~s~~~~~~~t~~ 205 (238)
+|.|..+|. ....| ....|+|.+
T Consensus 113 ~T~Y~Yrvg------~~~~s-~~~~F~T~p 135 (427)
T PLN02533 113 NTVYYYKCG------GPSST-QEFSFRTPP 135 (427)
T ss_pred CCEEEEEEC------CCCCc-cceEEECCC
Confidence 999999983 11223 457787653
No 36
>COG4733 Phage-related protein, tail component [Function unknown]
Probab=93.76 E-value=0.91 Score=39.38 Aligned_cols=127 Identities=17% Similarity=0.182 Sum_probs=64.7
Q ss_pred cccceEEEecCCCCCEEEEEEEEEeCCCCC-CCCCcEEeecc-CcCCCCCCcceEEEee-CCcEEEEEEeccCCCCCCCC
Q psy14887 61 TTRNWALIVGLQPDTKYYVKVMAYNNAGEG-PESERYLERTY-RKAPQKPPSAVHIKGI-NPTTVHVTWRYVSPSFEEEP 137 (238)
Q Consensus 61 ~~~~~~~~~~L~p~~~Y~~~v~a~~~~g~~-~~s~~~~~~t~-~~~p~~~p~~~~~~~~-~~~~~~l~W~~~~~~~~~~~ 137 (238)
+....+.++++.+ ..|.++|+|.|..|.. .+.....+... ...| +|..+....+ ..-.+.+.|- .|. ..-.
T Consensus 657 t~~~~~~~~gi~~-GqY~i~VrAiN~~g~~~~~a~s~~f~i~g~~~P--pp~~~t~~a~~it~~~~l~v~--dPt-~~~d 730 (952)
T COG4733 657 TSAAGFDVEGIPA-GQYAIRVRAINVFEPNSPDATAYEFALNGKKVP--PPKAMIYDAVIITLVIRLVVG--DPT-GAVD 730 (952)
T ss_pred ccccceeecCcCc-cceEEEEEEeeccCCCCCCcceeEEEecCCCCC--CCcccccceEEEEeeeeEEEe--cCC-cceE
Confidence 3456689999988 6699999999988874 33323333222 2223 2322222111 1224667776 332 1112
Q ss_pred eeEEEEEEEEcCCCCCCCceeeeecCCcceEEecCCCCCceEEEEEEEEeCCCCCCCC
Q psy14887 138 IIGYKVRVWEVDQDMSRANDTIIPVGNTLDADIVDLSPGKRYHMRVLAFSNGGDGRMS 195 (238)
Q Consensus 138 i~~y~v~~~~~~~~~~~~~~~~~~~~~~~~~~i~~L~p~t~Y~~~V~a~~~~g~g~~s 195 (238)
+..-.+++....+.+.. ....-........=.+|+++..|.|.+++.|..|...+.
T Consensus 731 ~~~sei~~s~~~d~~~~--Ar~LG~~~~~~~~~~~i~~g~~~~F~~R~Vn~vG~~~~~ 786 (952)
T COG4733 731 ITSTEIRSAVIADGNFQ--ARSLGNLNYPGLFSVGIQAGLTFWFRNRNVDLVGNNDKW 786 (952)
T ss_pred EeeeeeeeeccccchhH--HhhhhccccccccccCcCCCceEEEEeeecccccccccc
Confidence 22222222221111110 001101111111226999999999999999998875554
No 37
>COG3401 Fibronectin type 3 domain-containing protein [General function prediction only]
Probab=92.83 E-value=1.1 Score=34.57 Aligned_cols=150 Identities=16% Similarity=0.096 Sum_probs=88.4
Q ss_pred eeeEEEEEEEeeCCCcCceEEEeeccccceEEEecCCCCCEEEEEEEEEeCCCCCCCCCcEEeeccCcCCCCCCcceEEE
Q psy14887 36 RLIGHRIKYWKEDNPEETAVYYLSRTTRNWALIVGLQPDTKYYVKVMAYNNAGEGPESERYLERTYRKAPQKPPSAVHIK 115 (238)
Q Consensus 36 ~i~~y~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~L~p~~~Y~~~v~a~~~~g~~~~s~~~~~~t~~~~p~~~p~~~~~~ 115 (238)
.+.+|.+.....+.. ....-.......+++...+|.+...|.+.....+....+-.++.....+.-. .+...+...
T Consensus 4 ~~~~~~~~~t~~~~~-~~~i~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~i---~~~~~~~~~ 79 (343)
T COG3401 4 DIDGFVLYRTKKDLK-LKRIGTIKNKYQTHYYDEGLEGEESYPYQEGTTKVDKISYDSERILVKTSFI---ERVRSVFAS 79 (343)
T ss_pred cceeEEEEEEcccch-hhhhhccccchhhhhhhccccccCcceeeecccccceeeecCcceEEEeeec---cccccccch
Confidence 455666653222222 3333333444566788888999888888777666554455555555555411 111122222
Q ss_pred eeCCcEEEEEEeccCCCCCCCCeeEEEEEEEEcCCCCCCCceeeeecCCcceEEecCCCCCceEEEEEEEEeCCCCCCCC
Q psy14887 116 GINPTTVHVTWRYVSPSFEEEPIIGYKVRVWEVDQDMSRANDTIIPVGNTLDADIVDLSPGKRYHMRVLAFSNGGDGRMS 195 (238)
Q Consensus 116 ~~~~~~~~l~W~~~~~~~~~~~i~~y~v~~~~~~~~~~~~~~~~~~~~~~~~~~i~~L~p~t~Y~~~V~a~~~~g~g~~s 195 (238)
..-+..+.+-|++. ++. ....|.|+....+.. ...-....++-...+....|.++..|+..|.+-+..+.=..|
T Consensus 80 ~~~~~~~~~~w~~~-~d~---~~~~Y~i~~~~gD~~--f~r~~~~~n~l~~~~i~s~~~~~~~~~~~Iia~~f~~~~sfs 153 (343)
T COG3401 80 LERPKSVKVFWSPH-PDV---SVGKYIIQRQNGDGK--FLRTGLVKNRLFVEFIDSDLGHNEKYMELIIAADFQMGKSFS 153 (343)
T ss_pred hcCcceeeeccccc-CCC---CCCeEEEEEecCchh--hhhhhHHHhccchhheecccccccceeeeEEeecccccceee
Confidence 23355699999952 222 345788876555443 222223445666677778999999999999999887654444
No 38
>KOG4806|consensus
Probab=92.75 E-value=2 Score=33.37 Aligned_cols=119 Identities=17% Similarity=0.235 Sum_probs=66.2
Q ss_pred EecCCCCCEEEEEEEEEeCCCCCCCCCcEEee---ccCcCCCCCCcceEE--EeeCCcEEEEEEeccCCCCCCCCeeEEE
Q psy14887 68 IVGLQPDTKYYVKVMAYNNAGEGPESERYLER---TYRKAPQKPPSAVHI--KGINPTTVHVTWRYVSPSFEEEPIIGYK 142 (238)
Q Consensus 68 ~~~L~p~~~Y~~~v~a~~~~g~~~~s~~~~~~---t~~~~p~~~p~~~~~--~~~~~~~~~l~W~~~~~~~~~~~i~~y~ 142 (238)
.-.+.++..|.+++...|.+..-.. ..+... |..+. +.-|.+-.+ .....+++++.|...+ +. -..|-
T Consensus 288 Qfvv~~~~~~~~rf~~~ndde~~~~-v~V~aSt~~t~~~~-P~LP~dTtVk~v~r~CSsAtIaW~gs~--d~---~~kyC 360 (454)
T KOG4806|consen 288 QFVVLPGERYLMRFEPSNDDEALQK-VMVAASTEATFRDL-PELPQDTTVKNVRRRCSSATIAWNGSP--DE---ELKYC 360 (454)
T ss_pred EEEecCCCceEEEEEecCCchhhhe-eEeeeecccccCcC-CCCCCCceEeeeccccchheeeeccCc--ch---heeEE
Confidence 4457788888888888776543211 111111 22222 356666444 4456778999999522 21 14566
Q ss_pred EEEEEcCCCCCC---------------Cceee-----------eecCCcceEEecCCCCCceEEEEEEEEeCCCCCC
Q psy14887 143 VRVWEVDQDMSR---------------ANDTI-----------IPVGNTLDADIVDLSPGKRYHMRVLAFSNGGDGR 193 (238)
Q Consensus 143 v~~~~~~~~~~~---------------~~~~~-----------~~~~~~~~~~i~~L~p~t~Y~~~V~a~~~~g~g~ 193 (238)
|+.++..+.+.. .++-+ ....+...-+|.||.||..|.+.|.|....|..-
T Consensus 361 Iy~~~~~~~e~~v~~~~N~C~g~~~~~~s~~v~c~y~hs~~~q~~~~~i~teTI~gL~PgssYlldv~a~~~~g~~l 437 (454)
T KOG4806|consen 361 IYVFNLPQRERSVVDFTNYCMGFVPKRVSQYVYCEYMHSRERQQSPDNIETETILGLMPGSSYLLDVTANLSMGKPL 437 (454)
T ss_pred EEEecccchhhhhhhhhccccCccccceeEEEeEEEecChhhhcchhhhhhhhhcccccCceEEEEEEEcccCCccc
Confidence 655443332110 00000 0112223557899999999999999988766433
No 39
>PF09240 IL6Ra-bind: Interleukin-6 receptor alpha chain, binding; InterPro: IPR015321 Members of this entry adopt a structure consisting of an immunoglobulin-like beta-sandwich, with seven strands in two beta-sheets, in a Greek-key topology. They are required for binding to the cytokine Interleukin-6 []. ; PDB: 1N26_A 1P9M_C 3LB6_C 1PVH_A 3L5H_A 1BQU_A 1I1R_A 3QT2_B 3BPN_C 3BPO_C ....
Probab=92.18 E-value=1.9 Score=27.05 Aligned_cols=83 Identities=16% Similarity=0.166 Sum_probs=51.5
Q ss_pred CcceEEEeeCCcEEEEEEeccCCCCCCCCeeEEEEEEEEcCCCCCCCceeeeec-CCcceEEecCCCC----CceEEEEE
Q psy14887 109 PSAVHIKGINPTTVHVTWRYVSPSFEEEPIIGYKVRVWEVDQDMSRANDTIIPV-GNTLDADIVDLSP----GKRYHMRV 183 (238)
Q Consensus 109 p~~~~~~~~~~~~~~l~W~~~~~~~~~~~i~~y~v~~~~~~~~~~~~~~~~~~~-~~~~~~~i~~L~p----~t~Y~~~V 183 (238)
|.++.+...+...+..+|. +..... .-..|.++|+.........-...... +......+..... ...|.|.|
T Consensus 2 ~~nlsC~~~~~~~m~CtW~--~g~~~~-~~t~y~L~~~~~~~~~~~eC~~y~~~~~~~~gC~~~~~~~~~~~~~~~~v~V 78 (99)
T PF09240_consen 2 PQNLSCFIYNLEYMNCTWE--PGKEAP-PDTQYTLYYWYSPLEEEKECPHYSKDSGTRIGCQFPVSEIDSSEFSQYNVCV 78 (99)
T ss_dssp -EEEEEEEETTTEEEEEEE--CCTTCS-TTEEEEEEEEETTSSSEEEESEEEESTSSEEEEEEESCTT-TTTTSEEEEEE
T ss_pred CeeCEEEEECCEEEEEEEC--CCCCCC-CcccEEEEEEcCCCCccccCCCccccCCceeEEEecCCCccccccceEEEEE
Confidence 5688888888999999998 332222 23689999988764332111111111 2355566666555 24799999
Q ss_pred EEEeCCCCCCC
Q psy14887 184 LAFSNGGDGRM 194 (238)
Q Consensus 184 ~a~~~~g~g~~ 194 (238)
.+.+..|.-..
T Consensus 79 ~~ss~~~~i~~ 89 (99)
T PF09240_consen 79 NGSSSAGSIRS 89 (99)
T ss_dssp EEEETTEEEEC
T ss_pred EeccCCCccCC
Confidence 99988775333
No 40
>KOG1225|consensus
Probab=92.09 E-value=0.74 Score=38.12 Aligned_cols=157 Identities=16% Similarity=0.120 Sum_probs=87.9
Q ss_pred CCcceEEeeeccceEEEEeeecccccccccceeeEEEEEEEeeCCCcCceEEEeeccccceEEEecCCCCCEEEEEEEEE
Q psy14887 5 APQGVFSRGFNSTAINVTWNPIELIRENIRGRLIGHRIKYWKEDNPEETAVYYLSRTTRNWALIVGLQPDTKYYVKVMAY 84 (238)
Q Consensus 5 ~P~~~~~~~~~~~~~~l~W~~~~~~~~~~~~~i~~y~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~L~p~~~Y~~~v~a~ 84 (238)
+|..+.....+...+.+.|.... ....+...|.......... ..........+.+..|.|+..|..+++++
T Consensus 368 ~~~~~~~~~cs~~~~~~~~~~~~--------~~~~~~~~~~~~~~~~~~~-~~r~~~~~~~~~~~~~~~g~~~~~~~~~v 438 (525)
T KOG1225|consen 368 DPSLLLITECSPPSLCIAGVGRR--------RVTHCAGTYCPLGESGGDL-QGRVPGDANSVDIQGLEPGDEYNCSVNTV 438 (525)
T ss_pred CchhhcccccCCCceeecccccc--------ccccccccccccccCCCcc-ceeeccceeeeeeeeecCCcceeeehhhh
Confidence 34445555666777888887221 1223443332222221111 23333445568888999999999999987
Q ss_pred eCCCCCCCCCcEEeeccCcCCCCCCcceEEEeeCCcEEEEEEeccCCCCCCCCeeEEEEEEEEcCCCCCCCceeeeecCC
Q psy14887 85 NNAGEGPESERYLERTYRKAPQKPPSAVHIKGINPTTVHVTWRYVSPSFEEEPIIGYKVRVWEVDQDMSRANDTIIPVGN 164 (238)
Q Consensus 85 ~~~g~~~~s~~~~~~t~~~~p~~~p~~~~~~~~~~~~~~l~W~~~~~~~~~~~i~~y~v~~~~~~~~~~~~~~~~~~~~~ 164 (238)
...-.+.... ....+.. ..+..+.+.....+++.+.|. .+. ....++.+.| .....+-.. .......
T Consensus 439 ~~~~~~~~~~-~~~~~~~----~~~g~~~v~~~~~~s~e~~g~--~~s---~~~~~~~~~~-~~~~~~~~~--~~~~~~~ 505 (525)
T KOG1225|consen 439 AANIGSLPKD-KSETTVL----CWNGGLCVDGETESSLEVGGP--CPS---SGTCGWEVRC-GPCGNDGGV--NAEPPPE 505 (525)
T ss_pred hhhhccCCcc-cccceEe----ecCCceeeeeeeeccccccCC--CCC---ccccceEEEe-eecCccccc--ccCCCCC
Confidence 5543332211 1111111 244456777777889999998 332 2235677776 222222111 1222336
Q ss_pred cceEEecCCCCCceEEEEE
Q psy14887 165 TLDADIVDLSPGKRYHMRV 183 (238)
Q Consensus 165 ~~~~~i~~L~p~t~Y~~~V 183 (238)
.+.+...+|.|++.|.+.+
T Consensus 506 ~~~~~~~~l~~c~~~~~~~ 524 (525)
T KOG1225|consen 506 CTSYDRTGLGPCTEYEVSV 524 (525)
T ss_pred CCCCCccCcccccceeccc
Confidence 6778889999999998865
No 41
>PLN02533 probable purple acid phosphatase
Probab=91.68 E-value=3.1 Score=33.97 Aligned_cols=84 Identities=20% Similarity=0.225 Sum_probs=46.8
Q ss_pred CCCCcceEEeeeccceEEEEeeecccccccccceeeEEEEEEEeeCCCcC-----ceEEEee-----ccccceEEEecCC
Q psy14887 3 QVAPQGVFSRGFNSTAINVTWNPIELIRENIRGRLIGHRIKYWKEDNPEE-----TAVYYLS-----RTTRNWALIVGLQ 72 (238)
Q Consensus 3 p~~P~~~~~~~~~~~~~~l~W~~~~~~~~~~~~~i~~y~v~~~~~~~~~~-----~~~~~~~-----~~~~~~~~~~~L~ 72 (238)
+..|+++++.-...++++|+|.-... ..+. |+|-...+... ....+.. ...-....+.+|+
T Consensus 41 ~~~P~qvhls~~~~~~m~V~W~T~~~-----~~~~----V~yG~~~~~l~~~a~g~~~~~~~~~~~~~g~iH~v~l~~L~ 111 (427)
T PLN02533 41 PTHPDQVHISLVGPDKMRISWITQDS-----IPPS----VVYGTVSGKYEGSANGTSSSYHYLLIYRSGQINDVVIGPLK 111 (427)
T ss_pred CCCCceEEEEEcCCCeEEEEEECCCC-----CCCE----EEEecCCCCCcceEEEEEEEEeccccccCCeEEEEEeCCCC
Confidence 45788999877778899999987532 1122 33321111100 0011110 0111346889999
Q ss_pred CCCEEEEEEEEEeCCCCCCCCCcEEeecc
Q psy14887 73 PDTKYYVKVMAYNNAGEGPESERYLERTY 101 (238)
Q Consensus 73 p~~~Y~~~v~a~~~~g~~~~s~~~~~~t~ 101 (238)
|++.|..+|.. ...+....++|.
T Consensus 112 p~T~Y~Yrvg~------~~~s~~~~F~T~ 134 (427)
T PLN02533 112 PNTVYYYKCGG------PSSTQEFSFRTP 134 (427)
T ss_pred CCCEEEEEECC------CCCccceEEECC
Confidence 99999999842 113444566665
No 42
>PF07353 Uroplakin_II: Uroplakin II; InterPro: IPR009952 This family contains uroplakin II, which is approximately 180 residues long and seems to be restricted to mammals. Uroplakin II is an integral membrane protein, and is one of the components of the apical plaques of mammalian urothelium formed by the asymmetric unit membrane - this is believed to play a role in strengthening the urothelial apical surface to prevent the cells from rupturing during bladder distension [].; GO: 0016044 cellular membrane organization, 0030176 integral to endoplasmic reticulum membrane
Probab=89.21 E-value=4.8 Score=27.46 Aligned_cols=41 Identities=20% Similarity=0.273 Sum_probs=28.5
Q ss_pred CCcceEEecCCCCCceEEEEEEEEeCCCCCCCCCCcEEEeecc
Q psy14887 163 GNTLDADIVDLSPGKRYHMRVLAFSNGGDGRMSSPAITFQMGE 205 (238)
Q Consensus 163 ~~~~~~~i~~L~p~t~Y~~~V~a~~~~g~g~~s~~~~~~~t~~ 205 (238)
.....+.+.||.||+.|.|+-..-+. .+.+|+.++..+|.+
T Consensus 99 trlsaYqVtNL~pGTkY~isY~Vtkg--tstESS~~i~msT~n 139 (184)
T PF07353_consen 99 TRLSAYQVTNLQPGTKYYISYLVTKG--TSTESSNEIPMSTLN 139 (184)
T ss_pred ccceeEEeeccCCCcEEEEEEEEecC--ccceecceecccccc
Confidence 34457889999999999998877654 444544455555544
No 43
>PF07495 Y_Y_Y: Y_Y_Y domain; InterPro: IPR011123 This region is mostly found at the end of the beta propellers (IPR011110 from INTERPRO) in a family of two component regulators. However they are also found tandemly repeated in Q891H4 from SWISSPROT without other signal conduction domains being present. It is named after the conserved tyrosines found in the alignment. The exact function is not known.; PDB: 3V9F_D 3VA6_B 3OTT_B 4A2M_D 4A2L_B.
Probab=89.06 E-value=2.8 Score=23.78 Aligned_cols=25 Identities=28% Similarity=0.341 Sum_probs=18.8
Q ss_pred eEEEecCCCCCEEEEEEEEEeCCCCC
Q psy14887 65 WALIVGLQPDTKYYVKVMAYNNAGEG 90 (238)
Q Consensus 65 ~~~~~~L~p~~~Y~~~v~a~~~~g~~ 90 (238)
.+.+.+|.||. |+|.|+|.+..|..
T Consensus 30 ~~~~~~L~~G~-Y~l~V~a~~~~~~~ 54 (66)
T PF07495_consen 30 SISYTNLPPGK-YTLEVRAKDNNGKW 54 (66)
T ss_dssp EEEEES--SEE-EEEEEEEEETTS-B
T ss_pred EEEEEeCCCEE-EEEEEEEECCCCCc
Confidence 68889987765 99999999988764
No 44
>KOG4152|consensus
Probab=88.29 E-value=0.94 Score=37.11 Aligned_cols=35 Identities=26% Similarity=0.449 Sum_probs=29.3
Q ss_pred ecCCCCCceEEEEEEEEeCCCCCCCCCCcEEEeecc
Q psy14887 170 IVDLSPGKRYHMRVLAFSNGGDGRMSSPAITFQMGE 205 (238)
Q Consensus 170 i~~L~p~t~Y~~~V~a~~~~g~g~~s~~~~~~~t~~ 205 (238)
-..|.+|+.|+|+|.+.|..|.|++|. +..+.|..
T Consensus 651 k~~lv~Gq~yrfrV~aIng~G~gp~s~-i~~~kTc~ 685 (830)
T KOG4152|consen 651 KTSLVTGQAYRFRVTAINGKGPGPAST-ILKLKTCA 685 (830)
T ss_pred ccccccccceeeeeeeeeccCCCchhh-heeeeecc
Confidence 357899999999999999999999985 56676653
No 45
>KOG0613|consensus
Probab=87.96 E-value=9.5 Score=35.15 Aligned_cols=178 Identities=16% Similarity=0.099 Sum_probs=84.8
Q ss_pred CCCCCcceEEeeeccceEEEEeeecccccccccceeeEEEEEEEeeCCCcCceEEEeeccccceEEEecCCCCCEEEEEE
Q psy14887 2 PQVAPQGVFSRGFNSTAINVTWNPIELIRENIRGRLIGHRIKYWKEDNPEETAVYYLSRTTRNWALIVGLQPDTKYYVKV 81 (238)
Q Consensus 2 pp~~P~~~~~~~~~~~~~~l~W~~~~~~~~~~~~~i~~y~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~L~p~~~Y~~~v 81 (238)
|+.+|.+-....+....+...|..+. .+..-...+|.|+.+..++..+. ..+....... .+.... |.+++
T Consensus 139 pd~~~~~~~sv~~~~~~v~~~~~~~~---~d~~~k~~~~iie~~~~g~s~~~-~~l~~~~~~s--~~~~a~----~e~~v 208 (1205)
T KOG0613|consen 139 PDPPPSNRESVLVHRARVLPAWRPPR---EDGGRKREEYIIERRQAGRSPWL-KRLTQPTDDS--TDTDAH----YESRV 208 (1205)
T ss_pred CCCCCCCccceeecCceecccccCCc---cCccceeecceEEEEeccCCCcc-cccccccCCc--ceeccc----cceeE
Confidence 55566665555667788889998875 44555678899985444443322 2121112221 222222 66666
Q ss_pred EEEeCCCCCCCCCcEEeeccCcCCCCCCcceEEEeeCC---cEEEEEEeccCCCCCCCCeeEEEEEEEE-cCCCCCCCce
Q psy14887 82 MAYNNAGEGPESERYLERTYRKAPQKPPSAVHIKGINP---TTVHVTWRYVSPSFEEEPIIGYKVRVWE-VDQDMSRAND 157 (238)
Q Consensus 82 ~a~~~~g~~~~s~~~~~~t~~~~p~~~p~~~~~~~~~~---~~~~l~W~~~~~~~~~~~i~~y~v~~~~-~~~~~~~~~~ 157 (238)
+.+...|..+.-..... ... ...+...+.+..... --+.+.|.. +.......+.+|.+.-.. .+..... .
T Consensus 209 ~~v~s~gqp~~~~~~~~-~g~--~i~~~~~~~~s~t~e~s~~~i~~~l~~-~~G~~~~~~~~~l~e~~~~~~~~t~~--i 282 (1205)
T KOG0613|consen 209 RAVVSAGQPEPLERWEI-LGE--LIAPGSAPSMSVTLEASDRKIQLTLRA-PRGPYSAHILGYLIEQRKHKGGNTKT--I 282 (1205)
T ss_pred EEEEecCCCccchhhhh-ccc--ccCCCCccccccccchhheeeeEEeec-CCCchHHHHhcchhhhcccCCCcccc--c
Confidence 66655555432221111 111 112222333333333 345566653 222222345566655433 2211111 0
Q ss_pred eeeecCCcceEEecCCCCCceEEEEEEEEeCCCCCCCC
Q psy14887 158 TIIPVGNTLDADIVDLSPGKRYHMRVLAFSNGGDGRMS 195 (238)
Q Consensus 158 ~~~~~~~~~~~~i~~L~p~t~Y~~~V~a~~~~g~g~~s 195 (238)
.-.........+...=..+.+|.+.+...-..|.+.+-
T Consensus 283 ~~~~~~~~~s~t~~~~~~~~q~~~~v~~~~~i~~~~~~ 320 (1205)
T KOG0613|consen 283 QNDGPEPELSWTVADKRQGCQDRFKVTEEAPIGKGKPG 320 (1205)
T ss_pred cccCCccccceeeecccCCcccceeeeEEeeccccccC
Confidence 11112233344444445677888888877776654443
No 46
>TIGR00868 hCaCC calcium-activated chloride channel protein 1. distributions. found a row in 1A13.INFO that was not parsed out
Probab=86.76 E-value=5.1 Score=35.75 Aligned_cols=88 Identities=13% Similarity=0.139 Sum_probs=49.1
Q ss_pred CCCcceEEEeeCCcEEEEEEeccCCCCCCCCeeEEEEEEEEcCCCCC-CCceee---------eecCCcceEEe--cC--
Q psy14887 107 KPPSAVHIKGINPTTVHVTWRYVSPSFEEEPIIGYKVRVWEVDQDMS-RANDTI---------IPVGNTLDADI--VD-- 172 (238)
Q Consensus 107 ~~p~~~~~~~~~~~~~~l~W~~~~~~~~~~~i~~y~v~~~~~~~~~~-~~~~~~---------~~~~~~~~~~i--~~-- 172 (238)
....+|++. .....+.|+|..+.-....|....|.|+|...-.+.. ...... ...++...+.+ .+
T Consensus 757 ~rItDL~~~-~~~~~v~LsWTAPG~d~D~G~a~~y~ir~s~~~~~l~~~f~~a~~vn~~~~~P~~ags~e~~~f~~~~~~ 835 (863)
T TIGR00868 757 SKITDLEAG-FQGDNIILTWTAPGDVLDHGRADRYIIRISTSILDLRDDFNDATQVNTTDLIPKEANSKEVFVFKPEGIP 835 (863)
T ss_pred ccceeeEEe-ecCCEEEEEeeCCCccCCCCccceEEEEecCCHHHHHhhhccccccccCCcCCCCCCceeEEEEeCCccc
Confidence 334445543 4555699999965444567889999999954322210 000000 11122223333 33
Q ss_pred CCCCceEEEEEEEEeCCCC-CCCC
Q psy14887 173 LSPGKRYHMRVLAFSNGGD-GRMS 195 (238)
Q Consensus 173 L~p~t~Y~~~V~a~~~~g~-g~~s 195 (238)
-+.++.|.|.|+|++..|. |..|
T Consensus 836 ~~~~~~~~~ai~a~d~~~~~s~~s 859 (863)
T TIGR00868 836 IENGTDLFIAVQAIDKANLTSEVS 859 (863)
T ss_pred ccCCeEEEEEEEEEcccccccccc
Confidence 2257889999999998763 4443
No 47
>KOG1948|consensus
Probab=86.46 E-value=24 Score=31.57 Aligned_cols=117 Identities=17% Similarity=0.157 Sum_probs=56.9
Q ss_pred cceEEEecCCCCCEEEEEEEEEeCCCCCC--CCCcEEeec-cCcCCCCCCcceEEEeeCCcEEEEEEeccCCCCCCCCee
Q psy14887 63 RNWALIVGLQPDTKYYVKVMAYNNAGEGP--ESERYLERT-YRKAPQKPPSAVHIKGINPTTVHVTWRYVSPSFEEEPII 139 (238)
Q Consensus 63 ~~~~~~~~L~p~~~Y~~~v~a~~~~g~~~--~s~~~~~~t-~~~~p~~~p~~~~~~~~~~~~~~l~W~~~~~~~~~~~i~ 139 (238)
+-.|.+.||.|++.|.+++.+.-+...-+ ......+.. .+++. ....+......-..+.++-. ...+..+.
T Consensus 945 nG~yRiRGL~Pdc~Y~V~vk~~~~n~~iers~P~s~tv~vgneDv~--glnf~af~q~kttdit~~V~----~~~ne~l~ 1018 (1165)
T KOG1948|consen 945 NGTYRIRGLLPDCEYQVHVKSYADNSPIERSFPRSFTVSVGNEDVK--GLNFMAFIQAKTTDITVEVG----MDTNEELQ 1018 (1165)
T ss_pred CCcEEEeccCCCceEEEEEeeccCCCcccccCCceEEEEecccccC--CceEEEEeccceEEEEEEEc----cccccccc
Confidence 44699999999999999999874322211 122222222 22322 11112222222223333322 34455555
Q ss_pred EEEEEEEEcCCCCCCCceeeeecCCcceEEecCCCCCceEEEEEEEEe
Q psy14887 140 GYKVRVWEVDQDMSRANDTIIPVGNTLDADIVDLSPGKRYHMRVLAFS 187 (238)
Q Consensus 140 ~y~v~~~~~~~~~~~~~~~~~~~~~~~~~~i~~L~p~t~Y~~~V~a~~ 187 (238)
.-+|-..++++++.... . ...+....+--.-+.+|..|.+++.+.-
T Consensus 1019 sl~vv~yKs~nddspv~-s-v~~gql~~ffp~l~~dg~~yvV~l~Stl 1064 (1165)
T KOG1948|consen 1019 SLRVVIYKSNNDDSPVA-S-VVAGQLLHFFPNLPRDGVEYVVRLESTL 1064 (1165)
T ss_pred ceEEEEEecCCCCCcce-E-EeccceeeeccccCCCCceEEEEEeccC
Confidence 55555444444433221 1 1222333333344567888888887653
No 48
>PF14292 SusE: SusE outer membrane protein
Probab=80.19 E-value=14 Score=24.17 Aligned_cols=67 Identities=16% Similarity=0.288 Sum_probs=39.3
Q ss_pred eCCcEEEEEEeccCCCCCCC-CeeEEEEEEEEcCCCCCCCceeeeecC-CcceEEec-----------CCCCCceEEEEE
Q psy14887 117 INPTTVHVTWRYVSPSFEEE-PIIGYKVRVWEVDQDMSRANDTIIPVG-NTLDADIV-----------DLSPGKRYHMRV 183 (238)
Q Consensus 117 ~~~~~~~l~W~~~~~~~~~~-~i~~y~v~~~~~~~~~~~~~~~~~~~~-~~~~~~i~-----------~L~p~t~Y~~~V 183 (238)
.....+++.|. .+..... ....|.++....+.+.... ..+.... ..++..+. ++.|+....+.+
T Consensus 42 ~~~~a~tftW~--~~~~~~~~a~v~Y~lq~~~~~~~F~~~-~~~~~~~~~~~s~~~t~~eLN~~l~~~g~~~~~~~~l~~ 118 (122)
T PF14292_consen 42 ASDNAVTFTWT--AADYGGPDAPVTYTLQFDKKGNDFSSP-VEIVTTDNGSTSVSITVKELNSILLKLGLEPGEAGTLYW 118 (122)
T ss_pred cCCceEEEEEE--CCccCCCCCceEEEEEEeccCCCccCc-EEEEeecCcceeEEecHHHHHHHHHHcCCCCCceEEEEE
Confidence 45568999999 4544433 5678999887766543221 1111111 13444433 678888888877
Q ss_pred EEE
Q psy14887 184 LAF 186 (238)
Q Consensus 184 ~a~ 186 (238)
+..
T Consensus 119 rV~ 121 (122)
T PF14292_consen 119 RVK 121 (122)
T ss_pred EEE
Confidence 753
No 49
>KOG0613|consensus
Probab=76.16 E-value=18 Score=33.53 Aligned_cols=126 Identities=18% Similarity=0.178 Sum_probs=74.5
Q ss_pred EEEecCCCCCEEEEEEEEEeCCCCCCC---CCcEEeeccCcCCCCCCcceEEEeeCCcEEEEEEeccCCCCCCC-CeeEE
Q psy14887 66 ALIVGLQPDTKYYVKVMAYNNAGEGPE---SERYLERTYRKAPQKPPSAVHIKGINPTTVHVTWRYVSPSFEEE-PIIGY 141 (238)
Q Consensus 66 ~~~~~L~p~~~Y~~~v~a~~~~g~~~~---s~~~~~~t~~~~p~~~p~~~~~~~~~~~~~~l~W~~~~~~~~~~-~i~~y 141 (238)
++...-..+..|.+.+......|.+.+ +....+.+....+...+.+..+.....+.+.+.|.- .....+ ...+|
T Consensus 293 ~t~~~~~~~~q~~~~v~~~~~i~~~~~~p~~~~~aaa~~~~~~v~~~~~~~v~a~d~~~v~m~~~~--~~~~~p~~~~~~ 370 (1205)
T KOG0613|consen 293 WTVADKRQGCQDRFKVTEEAPIGKGKPGPESLQAAAARPPEVPVGLVRNLSVTARDNTLVEMPTAL--SGTQKPDEAQGY 370 (1205)
T ss_pred eeeecccCCcccceeeeEEeeccccccCchhhhhhccCCccccccccccceeccccCcceeecccc--cCCcCCchheee
Confidence 444445667888888888776665443 323334444455666677777777778888888873 222222 23333
Q ss_pred EEEEEEcCCCCCCCceeeeecCCcceEEecCCCCCceEEEEEEEEeCCCCCCCC
Q psy14887 142 KVRVWEVDQDMSRANDTIIPVGNTLDADIVDLSPGKRYHMRVLAFSNGGDGRMS 195 (238)
Q Consensus 142 ~v~~~~~~~~~~~~~~~~~~~~~~~~~~i~~L~p~t~Y~~~V~a~~~~g~g~~s 195 (238)
... ........+..............+.++.++..|...+.+.|..|.....
T Consensus 371 ~~~--~~~~~~~~~~~~~~~~~ts~~~~~~~~~~~e~~~~~~~a~n~~g~~~~~ 422 (1205)
T KOG0613|consen 371 HGE--EVSSESLGALPCPVGTVTSHTYTIKDAGPGEGYFTRVPAVNKGGTEQTI 422 (1205)
T ss_pred ccc--ccccccccccccccccccCcchhhhhcCcccccccccceeccCCccccc
Confidence 332 2211111111122223444566778999999999999999998865554
No 50
>TIGR00868 hCaCC calcium-activated chloride channel protein 1. distributions. found a row in 1A13.INFO that was not parsed out
Probab=74.32 E-value=26 Score=31.58 Aligned_cols=41 Identities=20% Similarity=0.146 Sum_probs=29.5
Q ss_pred CCCCCcceEEeeeccceEEEEeeecccccccccceeeEEEEEEE
Q psy14887 2 PQVAPQGVFSRGFNSTAINVTWNPIELIRENIRGRLIGHRIKYW 45 (238)
Q Consensus 2 pp~~P~~~~~~~~~~~~~~l~W~~~~~~~~~~~~~i~~y~v~~~ 45 (238)
||..-.+|++. .....+.|+|..|.+ ....|....|.|++.
T Consensus 755 PP~rItDL~~~-~~~~~v~LsWTAPG~--d~D~G~a~~y~ir~s 795 (863)
T TIGR00868 755 PPSKITDLEAG-FQGDNIILTWTAPGD--VLDHGRADRYIIRIS 795 (863)
T ss_pred CCccceeeEEe-ecCCEEEEEeeCCCc--cCCCCccceEEEEec
Confidence 66666777773 456669999999976 334466788999873
No 51
>PF11344 DUF3146: Protein of unknown function (DUF3146); InterPro: IPR021492 This family of proteins with unknown function appear to be restricted to Cyanobacteria.
Probab=74.21 E-value=2.8 Score=24.56 Aligned_cols=16 Identities=31% Similarity=0.538 Sum_probs=13.7
Q ss_pred cCCCCCceEEEEEEEE
Q psy14887 171 VDLSPGKRYHMRVLAF 186 (238)
Q Consensus 171 ~~L~p~t~Y~~~V~a~ 186 (238)
-.|+||..|.|+|+|.
T Consensus 65 Y~LEpGgdY~Ftirak 80 (80)
T PF11344_consen 65 YQLEPGGDYSFTIRAK 80 (80)
T ss_pred eeccCCCceEEEEecC
Confidence 3689999999999873
No 52
>PHA02579 7 baseplate wedge subunit; Provisional
Probab=71.63 E-value=25 Score=30.92 Aligned_cols=80 Identities=15% Similarity=0.181 Sum_probs=46.6
Q ss_pred cceEEeeeccceEEEEeeecccccccccceeeEEEEEEEeeCCCcCc-----eEEEee--ccccceEEEecCCCCCEEEE
Q psy14887 7 QGVFSRGFNSTAINVTWNPIELIRENIRGRLIGHRIKYWKEDNPEET-----AVYYLS--RTTRNWALIVGLQPDTKYYV 79 (238)
Q Consensus 7 ~~~~~~~~~~~~~~l~W~~~~~~~~~~~~~i~~y~v~~~~~~~~~~~-----~~~~~~--~~~~~~~~~~~L~p~~~Y~~ 79 (238)
..++....++..+.|+|+.. |.-..|.|++.+....... ...|.. -+..+..--+.+.|.+.|.+
T Consensus 9 tslrI~kLsaN~v~l~WddV--------G~NFyY~Ve~a~t~~~~g~~ip~~~~~w~nlg~T~~~~wFed~~~p~t~Yk~ 80 (1030)
T PHA02579 9 TSLRIDKLSANQVYLTWDDV--------GANFYYFVELAETRDADGELIPDDELRWINLGYTANNEWFEDKLQPNTYYKF 80 (1030)
T ss_pred cEEEhhhhccceEEEEeecc--------CCceEEEEEEEeeccCCCccCCCccccceecCcccchhhhhhccCCcceEEE
Confidence 34566667889999999876 3445788887543333221 111111 11122222344999999999
Q ss_pred EEEEEeCCCC-CCCCC
Q psy14887 80 KVMAYNNAGE-GPESE 94 (238)
Q Consensus 80 ~v~a~~~~g~-~~~s~ 94 (238)
+|+.....-. +.|..
T Consensus 81 Rv~~~~qGFe~SdWv~ 96 (1030)
T PHA02579 81 RVAVAAQGFEQSDWVY 96 (1030)
T ss_pred EEEeeccCCCccccEE
Confidence 9998754433 44433
No 53
>KOG1948|consensus
Probab=70.64 E-value=44 Score=30.04 Aligned_cols=26 Identities=19% Similarity=0.333 Sum_probs=21.8
Q ss_pred CCcceEEecCCCCCceEEEEEEEEeC
Q psy14887 163 GNTLDADIVDLSPGKRYHMRVLAFSN 188 (238)
Q Consensus 163 ~~~~~~~i~~L~p~t~Y~~~V~a~~~ 188 (238)
..+-.+.|.+|.|++.|.++|.++-.
T Consensus 943 denG~yRiRGL~Pdc~Y~V~vk~~~~ 968 (1165)
T KOG1948|consen 943 DENGTYRIRGLLPDCEYQVHVKSYAD 968 (1165)
T ss_pred ccCCcEEEeccCCCceEEEEEeeccC
Confidence 34457889999999999999999843
No 54
>PF14292 SusE: SusE outer membrane protein
Probab=64.86 E-value=36 Score=22.19 Aligned_cols=37 Identities=11% Similarity=0.232 Sum_probs=22.4
Q ss_pred eEEeeeccceEEEEeeecccccccccceeeEEEEEEEeeC
Q psy14887 9 VFSRGFNSTAINVTWNPIELIRENIRGRLIGHRIKYWKED 48 (238)
Q Consensus 9 ~~~~~~~~~~~~l~W~~~~~~~~~~~~~i~~y~v~~~~~~ 48 (238)
+.+.......++++|+.+.. ........|.+++...+
T Consensus 37 i~L~~~~~~a~tftW~~~~~---~~~~a~v~Y~lq~~~~~ 73 (122)
T PF14292_consen 37 IVLDEASDNAVTFTWTAADY---GGPDAPVTYTLQFDKKG 73 (122)
T ss_pred EEecccCCceEEEEEECCcc---CCCCCceEEEEEEeccC
Confidence 33433345689999998753 11213457999875544
No 55
>PF10342 GPI-anchored: Ser-Thr-rich glycosyl-phosphatidyl-inositol-anchored membrane family; InterPro: IPR018466 This entry represents glycoproteins involved in cell wall (1-->6)-beta-glucan assembly. In yeast a null mutation leads to severe growth defects, aberrant multi-budded morphology, and mating defects [, ]. The entry includes DRMIP and Hesp-379, which are involved in both fruiting body formation and in host attack respectively. Hesp-379 is a haustorially expressed secreted protein; the haustorium being the small sucker that penetrates host tissue [].
Probab=62.95 E-value=32 Score=20.87 Aligned_cols=61 Identities=15% Similarity=0.220 Sum_probs=35.1
Q ss_pred CcEEEEEEeccCCCCCCCCeeEEEEEEEEcCCCCCCCceeeeec--C--CcceEEe-cCCCCCceEEEEEE
Q psy14887 119 PTTVHVTWRYVSPSFEEEPIIGYKVRVWEVDQDMSRANDTIIPV--G--NTLDADI-VDLSPGKRYHMRVL 184 (238)
Q Consensus 119 ~~~~~l~W~~~~~~~~~~~i~~y~v~~~~~~~~~~~~~~~~~~~--~--~~~~~~i-~~L~p~t~Y~~~V~ 184 (238)
...+.|+|+.. . .....|.|..+............+... . ....+.+ .+|.++..|.|++.
T Consensus 12 g~~~~I~W~~~--~---~~~~~~~I~L~~g~~~~~~~~~~ia~~v~~~~gs~~~~~p~~l~~~~~Y~i~~~ 77 (93)
T PF10342_consen 12 GQPITITWTSD--G---TDPGNVTIYLCNGNNTNLNFVQTIASNVSNSDGSYTWTIPSDLPSGGDYFIQIV 77 (93)
T ss_pred CCcEEEEEeCC--C---CCCcEEEEEEEcCCCCCcceeEEEEecccCCCCEEEEEcCCCCCCCCcEEEEEE
Confidence 35799999932 1 122567887776665322111122211 1 2334455 56999999999988
No 56
>KOG4806|consensus
Probab=61.36 E-value=18 Score=28.49 Aligned_cols=33 Identities=27% Similarity=0.379 Sum_probs=27.9
Q ss_pred eeeeecCCcceEEecCCCCCceEEEEEEEEeCC
Q psy14887 157 DTIIPVGNTLDADIVDLSPGKRYHMRVLAFSNG 189 (238)
Q Consensus 157 ~~~~~~~~~~~~~i~~L~p~t~Y~~~V~a~~~~ 189 (238)
..+.-.++.+.+++..|+|.++|.|.|...+.+
T Consensus 161 ~~iiCig~~n~~~v~~L~pdt~Y~~dvFvv~~~ 193 (454)
T KOG4806|consen 161 VHIICIGNKNIFTVSDLKPDTQYYFDVFVVNIN 193 (454)
T ss_pred eeEEEcCCccEEEhhhcCCCceEEEEEEEEecC
Confidence 345556777899999999999999999999874
No 57
>COG4733 Phage-related protein, tail component [Function unknown]
Probab=55.81 E-value=72 Score=28.63 Aligned_cols=45 Identities=18% Similarity=0.245 Sum_probs=32.4
Q ss_pred eecCCcceEEecCCCCCceEEEEEEEEeCCCCCCCCCCcEEEeecc
Q psy14887 160 IPVGNTLDADIVDLSPGKRYHMRVLAFSNGGDGRMSSPAITFQMGE 205 (238)
Q Consensus 160 ~~~~~~~~~~i~~L~p~t~Y~~~V~a~~~~g~g~~s~~~~~~~t~~ 205 (238)
........+.+.+|-+ .+|.++|+|.|..|..........|+...
T Consensus 654 a~rt~~~~~~~~gi~~-GqY~i~VrAiN~~g~~~~~a~s~~f~i~g 698 (952)
T COG4733 654 APRTSAAGFDVEGIPA-GQYAIRVRAINVFEPNSPDATAYEFALNG 698 (952)
T ss_pred cccccccceeecCcCc-cceEEEEEEeeccCCCCCCcceeEEEecC
Confidence 3345667889999987 69999999999987544432246666654
No 58
>TIGR03000 plancto_dom_1 Planctomycetes uncharacterized domain TIGR03000. Domains described by this model are found, so far, only in the Planctomycetes (Pirellula sp. strain 1 and Gemmata obscuriglobus), in up to six proteins per genome, and may be duplicated within a protein. The function is unknown.
Probab=55.42 E-value=24 Score=20.91 Aligned_cols=27 Identities=26% Similarity=0.284 Sum_probs=22.4
Q ss_pred cCCcceEEecCCCCCceEEEEEEEEeC
Q psy14887 162 VGNTLDADIVDLSPGKRYHMRVLAFSN 188 (238)
Q Consensus 162 ~~~~~~~~i~~L~p~t~Y~~~V~a~~~ 188 (238)
.+..+.|.-.+|++|..|.++|.+.-.
T Consensus 25 ~G~~R~F~T~~L~~G~~y~Y~v~a~~~ 51 (75)
T TIGR03000 25 TGTVRTFTTPPLEAGKEYEYTVTAEYD 51 (75)
T ss_pred CccEEEEECCCCCCCCEEEEEEEEEEe
Confidence 355678888999999999999998654
No 59
>PHA02579 7 baseplate wedge subunit; Provisional
Probab=53.90 E-value=1.1e+02 Score=27.29 Aligned_cols=79 Identities=19% Similarity=0.185 Sum_probs=48.0
Q ss_pred CCcceEEEeeCCcEEEEEEeccCCCCCCCCeeEEEEEEEEcCCCCC------CCcee-eeecCCcceEEecCCCCCceEE
Q psy14887 108 PPSAVHIKGINPTTVHVTWRYVSPSFEEEPIIGYKVRVWEVDQDMS------RANDT-IIPVGNTLDADIVDLSPGKRYH 180 (238)
Q Consensus 108 ~p~~~~~~~~~~~~~~l~W~~~~~~~~~~~i~~y~v~~~~~~~~~~------~~~~~-~~~~~~~~~~~i~~L~p~t~Y~ 180 (238)
+...+++...+...+.|.|..+ |.---|.|++.+.....- ...+. .-.......+.-. ++|.+-|.
T Consensus 7 ~vtslrI~kLsaN~v~l~WddV------G~NFyY~Ve~a~t~~~~g~~ip~~~~~w~nlg~T~~~~wFed~-~~p~t~Yk 79 (1030)
T PHA02579 7 SVTSLRIDKLSANQVYLTWDDV------GANFYYFVELAETRDADGELIPDDELRWINLGYTANNEWFEDK-LQPNTYYK 79 (1030)
T ss_pred cccEEEhhhhccceEEEEeecc------CCceEEEEEEEeeccCCCccCCCccccceecCcccchhhhhhc-cCCcceEE
Confidence 3445677777888999999943 333568888865433211 01111 1222344455444 99999999
Q ss_pred EEEEEEeCCCCCC
Q psy14887 181 MRVLAFSNGGDGR 193 (238)
Q Consensus 181 ~~V~a~~~~g~g~ 193 (238)
++|+..-.+-+..
T Consensus 80 ~Rv~~~~qGFe~S 92 (1030)
T PHA02579 80 FRVAVAAQGFEQS 92 (1030)
T ss_pred EEEEeeccCCCcc
Confidence 9999776544433
No 60
>KOG3834|consensus
Probab=53.02 E-value=40 Score=27.45 Aligned_cols=70 Identities=24% Similarity=0.302 Sum_probs=43.0
Q ss_pred eEEeeec--cceEEEEeeecccccccccceeeEEEEEEEeeCCCcCceEEEeeccccceEEEecCCCCCEEEEEE
Q psy14887 9 VFSRGFN--STAINVTWNPIELIRENIRGRLIGHRIKYWKEDNPEETAVYYLSRTTRNWALIVGLQPDTKYYVKV 81 (238)
Q Consensus 9 ~~~~~~~--~~~~~l~W~~~~~~~~~~~~~i~~y~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~L~p~~~Y~~~v 81 (238)
|+....+ .-.+++.|-.+.. +-.+.++++.|+|......++...........+-..+-||.|++.|.+-+
T Consensus 66 Vkltv~n~kt~~~R~v~I~ps~---~wggqllGvsvrFcsf~~A~~~vwHvl~V~p~SPaalAgl~~~~DYivG~ 137 (462)
T KOG3834|consen 66 VKLTVYNSKTQEVRIVEIVPSN---NWGGQLLGVSVRFCSFDGAVESVWHVLSVEPNSPAALAGLRPYTDYIVGI 137 (462)
T ss_pred eEEEEEecccceeEEEEecccc---cccccccceEEEeccCccchhheeeeeecCCCCHHHhcccccccceEecc
Confidence 4444433 4446777877753 33344889999986554554443333333333346677999999998776
No 61
>PF09423 PhoD: PhoD-like phosphatase; InterPro: IPR018946 This entry contains a number of putative proteins as well as Alkaline phosphatase D which catalyses the reaction: A phosphate monoester + H(2)O = an alcohol + phosphate ; PDB: 2YEQ_B.
Probab=48.41 E-value=24 Score=29.19 Aligned_cols=34 Identities=32% Similarity=0.600 Sum_probs=17.2
Q ss_pred eEEecCCCCCceEEEEEEEEeCCCCCCCCCCcEEEeec
Q psy14887 167 DADIVDLSPGKRYHMRVLAFSNGGDGRMSSPAITFQMG 204 (238)
Q Consensus 167 ~~~i~~L~p~t~Y~~~V~a~~~~g~g~~s~~~~~~~t~ 204 (238)
.+.+.+|+|++.|.+++..-.. +..+ +.-.|+|.
T Consensus 65 ~v~v~gL~p~t~Y~Y~~~~~~~---~~~s-~~g~~rT~ 98 (453)
T PF09423_consen 65 KVDVTGLQPGTRYYYRFVVDGG---GQTS-PVGRFRTA 98 (453)
T ss_dssp EEEE-S--TT-EEEEEEEE--T---TEE----EEEE--
T ss_pred ecccCCCCCCceEEEEEEEecC---CCCC-CceEEEcC
Confidence 5788999999999999998211 2222 34566655
No 62
>KOG3834|consensus
Probab=48.37 E-value=56 Score=26.67 Aligned_cols=67 Identities=15% Similarity=0.163 Sum_probs=38.7
Q ss_pred EeeCCcEEEEEEeccCCCCCCCCeeEEEEEEEEcCCCCCCCceeeeecCCcceEEecCCCCCceEEEEE
Q psy14887 115 KGINPTTVHVTWRYVSPSFEEEPIIGYKVRVWEVDQDMSRANDTIIPVGNTLDADIVDLSPGKRYHMRV 183 (238)
Q Consensus 115 ~~~~~~~~~l~W~~~~~~~~~~~i~~y~v~~~~~~~~~~~~~~~~~~~~~~~~~~i~~L~p~t~Y~~~V 183 (238)
.......+...|-. +.+...|-+.||.|+|+..+..... .+.+..-..+.-..+.+|.+++.|.+-+
T Consensus 71 ~n~kt~~~R~v~I~-ps~~wggqllGvsvrFcsf~~A~~~-vwHvl~V~p~SPaalAgl~~~~DYivG~ 137 (462)
T KOG3834|consen 71 YNSKTQEVRIVEIV-PSNNWGGQLLGVSVRFCSFDGAVES-VWHVLSVEPNSPAALAGLRPYTDYIVGI 137 (462)
T ss_pred EecccceeEEEEec-ccccccccccceEEEeccCccchhh-eeeeeecCCCCHHHhcccccccceEecc
Confidence 33344456667763 3345556689999999876332211 1122222233355677888888887755
No 63
>PF13205 Big_5: Bacterial Ig-like domain
Probab=48.27 E-value=30 Score=21.64 Aligned_cols=15 Identities=27% Similarity=0.558 Sum_probs=12.9
Q ss_pred cCCCCCceEEEEEEE
Q psy14887 171 VDLSPGKRYHMRVLA 185 (238)
Q Consensus 171 ~~L~p~t~Y~~~V~a 185 (238)
..|.+++.|++.|..
T Consensus 70 ~~L~~~t~Y~v~i~~ 84 (107)
T PF13205_consen 70 QPLKPGTTYTVTIDS 84 (107)
T ss_pred CcCCCCCEEEEEECC
Confidence 579999999999954
No 64
>PF04775 Bile_Hydr_Trans: Acyl-CoA thioester hydrolase/BAAT N-terminal region; InterPro: IPR006862 This entry presents the N-termini of acyl-CoA thioester hydrolase and bile acid-CoA:amino acid N-acetyltransferase (BAAT) []. This region is not thought to contain the active site of either enzyme. Thioesterase isoforms have been identified in peroxisomes, cytoplasm and mitochondria, where they are thought to have distinct functions in lipid metabolism []. For example, in peroxisomes, the hydrolase acts on bile-CoA esters [].; GO: 0016290 palmitoyl-CoA hydrolase activity, 0006629 lipid metabolic process; PDB: 3HLK_B 3K2I_B.
Probab=48.01 E-value=35 Score=22.56 Aligned_cols=24 Identities=25% Similarity=0.509 Sum_probs=16.7
Q ss_pred eEEecCCCCCceEEEEEEEEeCCC
Q psy14887 167 DADIVDLSPGKRYHMRVLAFSNGG 190 (238)
Q Consensus 167 ~~~i~~L~p~t~Y~~~V~a~~~~g 190 (238)
...+.+|.|+..|+++.......|
T Consensus 6 ~I~v~GL~p~~~vtl~a~~~~~~g 29 (126)
T PF04775_consen 6 DIRVSGLPPGQEVTLRARLTDDNG 29 (126)
T ss_dssp EEEEES--TT-EEEEEEEEE-TTS
T ss_pred EEEEeCCCCCCEEEEEEEEEeCCC
Confidence 557899999999999999887644
No 65
>PF07353 Uroplakin_II: Uroplakin II; InterPro: IPR009952 This family contains uroplakin II, which is approximately 180 residues long and seems to be restricted to mammals. Uroplakin II is an integral membrane protein, and is one of the components of the apical plaques of mammalian urothelium formed by the asymmetric unit membrane - this is believed to play a role in strengthening the urothelial apical surface to prevent the cells from rupturing during bladder distension [].; GO: 0016044 cellular membrane organization, 0030176 integral to endoplasmic reticulum membrane
Probab=45.49 E-value=46 Score=22.98 Aligned_cols=24 Identities=29% Similarity=0.484 Sum_probs=20.1
Q ss_pred cceEEEecCCCCCEEEEEEEEEeC
Q psy14887 63 RNWALIVGLQPDTKYYVKVMAYNN 86 (238)
Q Consensus 63 ~~~~~~~~L~p~~~Y~~~v~a~~~ 86 (238)
...|.+.+|.|++.|.++-...++
T Consensus 101 lsaYqVtNL~pGTkY~isY~Vtkg 124 (184)
T PF07353_consen 101 LSAYQVTNLQPGTKYYISYLVTKG 124 (184)
T ss_pred ceeEEeeccCCCcEEEEEEEEecC
Confidence 346899999999999999887655
No 66
>PF09423 PhoD: PhoD-like phosphatase; InterPro: IPR018946 This entry contains a number of putative proteins as well as Alkaline phosphatase D which catalyses the reaction: A phosphate monoester + H(2)O = an alcohol + phosphate ; PDB: 2YEQ_B.
Probab=44.94 E-value=59 Score=26.91 Aligned_cols=35 Identities=31% Similarity=0.359 Sum_probs=17.0
Q ss_pred ceEEEecCCCCCEEEEEEEEEeCCCCCCCCCcEEeecc
Q psy14887 64 NWALIVGLQPDTKYYVKVMAYNNAGEGPESERYLERTY 101 (238)
Q Consensus 64 ~~~~~~~L~p~~~Y~~~v~a~~~~g~~~~s~~~~~~t~ 101 (238)
....+++|+|++.|..++..... +..+..-.++|.
T Consensus 64 ~~v~v~gL~p~t~Y~Y~~~~~~~---~~~s~~g~~rT~ 98 (453)
T PF09423_consen 64 VKVDVTGLQPGTRYYYRFVVDGG---GQTSPVGRFRTA 98 (453)
T ss_dssp EEEEE-S--TT-EEEEEEEE--T---TEE---EEEE--
T ss_pred eecccCCCCCCceEEEEEEEecC---CCCCCceEEEcC
Confidence 45788999999999999998322 223333455555
No 67
>KOG3515|consensus
Probab=43.37 E-value=53 Score=29.03 Aligned_cols=75 Identities=13% Similarity=0.221 Sum_probs=53.0
Q ss_pred ecCCCCCEEEEEEEEEeCCCCCCCCCcEEeeccCcCCCCCCcceEEEeeCCcEEEEEEeccCCCCCCCCeeEEEEEEEEc
Q psy14887 69 VGLQPDTKYYVKVMAYNNAGEGPESERYLERTYRKAPQKPPSAVHIKGINPTTVHVTWRYVSPSFEEEPIIGYKVRVWEV 148 (238)
Q Consensus 69 ~~L~p~~~Y~~~v~a~~~~g~~~~s~~~~~~t~~~~p~~~p~~~~~~~~~~~~~~l~W~~~~~~~~~~~i~~y~v~~~~~ 148 (238)
-+.-++..|.+.+.+.|..|.++-+.....-+. +..|.++.+......++.+.|. +....|-...|.+.|...
T Consensus 659 ~~~~~~~~y~~~c~t~n~lg~~~v~~~~~~~t~----~~~~~n~~~~~~~~~~i~l~~~---p~fdgg~~q~f~~~~~~~ 731 (741)
T KOG3515|consen 659 LNHIDGSDYENGCTTQNLLGSDHVSGAIHSGTA----SVGPINLTYDNLTYSTISLEWM---PGFDGGLQQRFFLKYYDL 731 (741)
T ss_pred cCCcchhhhcceeeecccCCCccccceecCCcC----CCCccceEeeeeeeeeeceeee---eccccccccceeeehhhc
Confidence 345567788877777787787766654444443 2577799999889999999998 344456667788887665
Q ss_pred CC
Q psy14887 149 DQ 150 (238)
Q Consensus 149 ~~ 150 (238)
+.
T Consensus 732 ~~ 733 (741)
T KOG3515|consen 732 GT 733 (741)
T ss_pred CC
Confidence 43
No 68
>PF10333 Pga1: GPI-Mannosyltransferase II co-activator; InterPro: IPR019433 Pga1 is found only in yeasts and not in mammals. It localises in the ER as a glycosylated integral membrane protein. It binds to the GPI-mannosyltransferase II subunit of the GPI and it is responsible for the second mannose addition to GPI precursors. The GPI-anchoring complex is a glycolipid that functions as a membrane anchor for many cell-surface proteins [].
Probab=42.89 E-value=36 Score=24.19 Aligned_cols=22 Identities=27% Similarity=0.694 Sum_probs=18.4
Q ss_pred CCcceEEecCCCCCceEEEEEE
Q psy14887 163 GNTLDADIVDLSPGKRYHMRVL 184 (238)
Q Consensus 163 ~~~~~~~i~~L~p~t~Y~~~V~ 184 (238)
+......+.+|++|..|++++.
T Consensus 63 ~~t~~V~L~nl~~~e~y~vKiC 84 (180)
T PF10333_consen 63 GSTTYVELNNLQPGETYQVKIC 84 (180)
T ss_pred CceEEEEeccCCCCCeEEEEEE
Confidence 4556778899999999999986
No 69
>KOG4228|consensus
Probab=42.14 E-value=47 Score=30.60 Aligned_cols=109 Identities=16% Similarity=0.054 Sum_probs=59.2
Q ss_pred CcceEEeeeccceEEEEeeecccccccccceeeEEEEEEEeeCCCcCceEEEeeccccceEEEecCCCCCEEEEEEEEEe
Q psy14887 6 PQGVFSRGFNSTAINVTWNPIELIRENIRGRLIGHRIKYWKEDNPEETAVYYLSRTTRNWALIVGLQPDTKYYVKVMAYN 85 (238)
Q Consensus 6 P~~~~~~~~~~~~~~l~W~~~~~~~~~~~~~i~~y~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~L~p~~~Y~~~v~a~~ 85 (238)
|....++....++....|..... ....|.+.++..... .... -......+....+.+...+.+ .+.|
T Consensus 28 ~~~~~vt~~~~~~~~~~~~~~~t-------~~~~y~v~~~~~~s~-~~~~---~~~~~~~~s~~~~~~~~~~~~--~evn 94 (1087)
T KOG4228|consen 28 PSTPNVTGSTATSWVQAELLGST-------WPNYYQVIFKALSSG-YGYI---DVDLTTRLSYPCLSPPSFLEL--VEVN 94 (1087)
T ss_pred CCCcccccccccceeehhhccCC-------ChhhhheeeeEeccC-ccee---eccceeecccCCCCCcchhhh--hhcc
Confidence 34445666677788888876542 344566655333222 1111 111122244445666666666 6667
Q ss_pred CCCCCCCCCcEEeeccCcCCCCCCcceEEEeeCCcEEEEEEe
Q psy14887 86 NAGEGPESERYLERTYRKAPQKPPSAVHIKGINPTTVHVTWR 127 (238)
Q Consensus 86 ~~g~~~~s~~~~~~t~~~~p~~~p~~~~~~~~~~~~~~l~W~ 127 (238)
..+.+..+.+..-++.+..+...+..............+.|.
T Consensus 95 t~~~a~~~~~~~~rt~~~~~~~~~~~~~~~~~~a~~~~~~~~ 136 (1087)
T KOG4228|consen 95 TIGRAPFQCPVAPRTGEQATLSLQRDVQALMLSAATAHVNWR 136 (1087)
T ss_pred cccccccccccCCccccccccccccchhhhhhhhhhhccCcc
Confidence 777776666666666655454445444444445555666666
No 70
>PF13750 Big_3_3: Bacterial Ig-like domain (group 3)
Probab=38.03 E-value=46 Score=23.06 Aligned_cols=31 Identities=23% Similarity=0.412 Sum_probs=24.1
Q ss_pred ecCCCCCceEEEEEEEEeCCCCCCCCCCcEEEe
Q psy14887 170 IVDLSPGKRYHMRVLAFSNGGDGRMSSPAITFQ 202 (238)
Q Consensus 170 i~~L~p~t~Y~~~V~a~~~~g~g~~s~~~~~~~ 202 (238)
++.|..+..|++.|.|...+|.... ..+.|.
T Consensus 116 fpsle~~~~YtLtV~a~D~aGN~~~--~si~F~ 146 (158)
T PF13750_consen 116 FPSLEADDSYTLTVSATDKAGNQST--KSISFS 146 (158)
T ss_pred cCCcCCCCeEEEEEEEEecCCCEEE--EEEEEE
Confidence 5678899999999999999886443 346665
No 71
>PF13754 Big_3_4: Bacterial Ig-like domain (group 3)
Probab=36.81 E-value=73 Score=17.26 Aligned_cols=27 Identities=22% Similarity=0.294 Sum_probs=20.8
Q ss_pred EEecCCCCCceEEEEEEEEeCCCCCCCC
Q psy14887 168 ADIVDLSPGKRYHMRVLAFSNGGDGRMS 195 (238)
Q Consensus 168 ~~i~~L~p~t~Y~~~V~a~~~~g~g~~s 195 (238)
+.+..+ ....|.+++.+...+|.....
T Consensus 16 ~t~~~~-~dG~y~itv~a~D~AGN~s~~ 42 (54)
T PF13754_consen 16 FTVPAL-ADGTYTITVTATDAAGNTSTS 42 (54)
T ss_pred EeCCCC-CCccEEEEEEEEeCCCCCCCc
Confidence 445555 678999999999999875554
No 72
>PF00907 T-box: T-box; InterPro: IPR001699 Transcription factors of the T-box family are required both for early cell-fate decisions, such as those necessary for formation of the basic vertebrate body plan, and for differentiation and organogenesis []. The T-box is defined as the minimal region within the T-box protein that is both necessary and sufficient for sequence-specific DNA binding, all members of the family so far examined bind to the DNA consensus sequence TCACACCT. The T-box is a relatively large DNA-binding domain, generally comprising about a third of the entire protein (17-26 kDa). These genes were uncovered on the basis of similarity to the DNA binding domain [] of Mus musculus (Mouse) Brachyury (T) gene product, which similarity is the defining feature of the family. The Brachyury gene is named for its phenotype, which was identified 70 years ago as a mutant mouse strain with a short blunted tail. The gene, and its paralogues, have become a well-studied model for the family, and hence much of what is known about the T-box family is derived from the murine Brachyury gene. Consistent with its nuclear location, Brachyury protein has a sequence-specific DNA-binding activity and can act as a transcriptional regulator []. Homozygous mutants for the gene undergo extensive developmental anomalies, thus rendering the mutation lethal []. The postulated role of Brachyury is as a transcription factor, regulating the specification and differentiation of posterior mesoderm during gastrulation in a dose-dependent manner []. T-box proteins tend to be expressed in specific organs or cell types, especially during development, and they are generally required for the development of those tissues, for example, Brachyury is expressed in posterior mesoderm and in the developing notochord, and it is required for the formation of these cells in mice []. ; GO: 0003700 sequence-specific DNA binding transcription factor activity, 0006355 regulation of transcription, DNA-dependent, 0005634 nucleus; PDB: 1H6F_B 4A04_A 1XBR_B 2X6V_B 2X6U_A.
Probab=36.50 E-value=85 Score=22.26 Aligned_cols=25 Identities=20% Similarity=0.222 Sum_probs=19.5
Q ss_pred ceEEEecCCCCCEEEEEEEEEeCCC
Q psy14887 64 NWALIVGLQPDTKYYVKVMAYNNAG 88 (238)
Q Consensus 64 ~~~~~~~L~p~~~Y~~~v~a~~~~g 88 (238)
-.|.+.||.|...|.+.+.-...+.
T Consensus 33 l~y~vsGL~p~~~Y~i~l~~~~~d~ 57 (184)
T PF00907_consen 33 LEYSVSGLDPDSLYSISLHFERVDN 57 (184)
T ss_dssp EEEEEESS-TTSEEEEEEEEEESCS
T ss_pred cEEEecCCCCCcceEEEEEEEEecC
Confidence 3589999999999999998775443
No 73
>PF02010 REJ: REJ domain; InterPro: IPR002859 The REJ (Receptor for Egg Jelly) domain is found in PKD1 P98161 from SWISSPROT and the sperm receptor for egg jelly Q26627 from SWISSPROT. The exact function of this domain is unknown. The domain is 600 amino acids long so is probably composed of multiple structural domains. There are six completely conserved cysteine residues that may form disulphide bridges. This region contains tandem PKD-like domains. Sequence similarity between a region of the autosomal dominant polycystic kidney disease (ADPKD) protein, polycystin-1 and a sea urchin sperm glycoprotein involved in fertilization, the receptor for egg jelly (suREJ) has been known for some time. The suREJ protein binds the glycoprotein coat of the egg (egg jelly), triggering the acrosome reaction, which transforms the sperm into a fusogenic cell. The sequence similarity and expression pattern suggests that the predicted human PKDREJ protein is a mammalian equivalent of the suREJ protein and therefore may have a central role in human fertilization [].; PDB: 2E7M_A 2YRL_A.
Probab=35.91 E-value=12 Score=30.61 Aligned_cols=32 Identities=25% Similarity=0.500 Sum_probs=0.0
Q ss_pred cCCCCCceEEEEEEEEeCCCCCCCCCCcEEEeec
Q psy14887 171 VDLSPGKRYHMRVLAFSNGGDGRMSSPAITFQMG 204 (238)
Q Consensus 171 ~~L~p~t~Y~~~V~a~~~~g~g~~s~~~~~~~t~ 204 (238)
..|++|..|.|++.+.+..+....+ .+.|++.
T Consensus 267 ~~l~~g~~Y~~~l~v~~~~~~~~~a--~~~~~~n 298 (440)
T PF02010_consen 267 GVLEPGSTYTFRLTVTDSSGSSGSA--SISFTVN 298 (440)
T ss_dssp ----------------------------------
T ss_pred ccccccccccccccccccccccccc--ccccccc
Confidence 3689999999999999987764433 3556543
No 74
>PF14686 fn3_3: Polysaccharide lyase family 4, domain II; PDB: 1NKG_A 2XHN_B 3NJX_A 3NJV_A.
Probab=35.43 E-value=43 Score=20.88 Aligned_cols=21 Identities=24% Similarity=0.455 Sum_probs=13.9
Q ss_pred CcceEEecCCCCCceEEEEEEE
Q psy14887 164 NTLDADIVDLSPGKRYHMRVLA 185 (238)
Q Consensus 164 ~~~~~~i~~L~p~t~Y~~~V~a 185 (238)
..-.|+|.+++||+ |.+.+.+
T Consensus 48 ~~G~Fti~~V~pGt-Y~L~ay~ 68 (95)
T PF14686_consen 48 SDGNFTIPNVRPGT-YRLYAYA 68 (95)
T ss_dssp TTSEEE---B-SEE-EEEEEEE
T ss_pred CCCcEEeCCeeCcE-eEEEEEE
Confidence 45689999999998 8888887
No 75
>PF00907 T-box: T-box; InterPro: IPR001699 Transcription factors of the T-box family are required both for early cell-fate decisions, such as those necessary for formation of the basic vertebrate body plan, and for differentiation and organogenesis []. The T-box is defined as the minimal region within the T-box protein that is both necessary and sufficient for sequence-specific DNA binding, all members of the family so far examined bind to the DNA consensus sequence TCACACCT. The T-box is a relatively large DNA-binding domain, generally comprising about a third of the entire protein (17-26 kDa). These genes were uncovered on the basis of similarity to the DNA binding domain [] of Mus musculus (Mouse) Brachyury (T) gene product, which similarity is the defining feature of the family. The Brachyury gene is named for its phenotype, which was identified 70 years ago as a mutant mouse strain with a short blunted tail. The gene, and its paralogues, have become a well-studied model for the family, and hence much of what is known about the T-box family is derived from the murine Brachyury gene. Consistent with its nuclear location, Brachyury protein has a sequence-specific DNA-binding activity and can act as a transcriptional regulator []. Homozygous mutants for the gene undergo extensive developmental anomalies, thus rendering the mutation lethal []. The postulated role of Brachyury is as a transcription factor, regulating the specification and differentiation of posterior mesoderm during gastrulation in a dose-dependent manner []. T-box proteins tend to be expressed in specific organs or cell types, especially during development, and they are generally required for the development of those tissues, for example, Brachyury is expressed in posterior mesoderm and in the developing notochord, and it is required for the formation of these cells in mice []. ; GO: 0003700 sequence-specific DNA binding transcription factor activity, 0006355 regulation of transcription, DNA-dependent, 0005634 nucleus; PDB: 1H6F_B 4A04_A 1XBR_B 2X6V_B 2X6U_A.
Probab=34.91 E-value=77 Score=22.48 Aligned_cols=25 Identities=20% Similarity=0.283 Sum_probs=19.6
Q ss_pred cceEEecCCCCCceEEEEEEEEeCC
Q psy14887 165 TLDADIVDLSPGKRYHMRVLAFSNG 189 (238)
Q Consensus 165 ~~~~~i~~L~p~t~Y~~~V~a~~~~ 189 (238)
.-.+.+.||.|...|.+.+......
T Consensus 32 ~l~y~vsGL~p~~~Y~i~l~~~~~d 56 (184)
T PF00907_consen 32 TLEYSVSGLDPDSLYSISLHFERVD 56 (184)
T ss_dssp -EEEEEESS-TTSEEEEEEEEEESC
T ss_pred ccEEEecCCCCCcceEEEEEEEEec
Confidence 3478899999999999999877653
No 76
>KOG1378|consensus
Probab=34.10 E-value=2.8e+02 Score=23.17 Aligned_cols=33 Identities=30% Similarity=0.314 Sum_probs=24.6
Q ss_pred eEEEecCCCCCEEEEEEEEEeCCCCCCCCCcEEeeccC
Q psy14887 65 WALIVGLQPDTKYYVKVMAYNNAGEGPESERYLERTYR 102 (238)
Q Consensus 65 ~~~~~~L~p~~~Y~~~v~a~~~~g~~~~s~~~~~~t~~ 102 (238)
..++.+|++++.|..++-+.. .+|+...+++..
T Consensus 110 ~~~~~~L~~~t~YyY~~Gs~~-----~wS~~f~F~t~p 142 (452)
T KOG1378|consen 110 DAVMKNLEPNTRYYYQVGSDL-----KWSEIFSFKTPP 142 (452)
T ss_pred eeeecCCCCCceEEEEeCCCC-----CcccceEeECCC
Confidence 457789999999999886532 267777787764
No 77
>KOG4228|consensus
Probab=34.01 E-value=97 Score=28.75 Aligned_cols=102 Identities=19% Similarity=0.224 Sum_probs=60.2
Q ss_pred EEEEeCCCCCCCCCcEEeeccCcCCCCCCcceEEEeeCCcEEEEEEeccCCCCCCCCeeEEEEEEEEcCCCCCCCceeee
Q psy14887 81 VMAYNNAGEGPESERYLERTYRKAPQKPPSAVHIKGINPTTVHVTWRYVSPSFEEEPIIGYKVRVWEVDQDMSRANDTII 160 (238)
Q Consensus 81 v~a~~~~g~~~~s~~~~~~t~~~~p~~~p~~~~~~~~~~~~~~l~W~~~~~~~~~~~i~~y~v~~~~~~~~~~~~~~~~~ 160 (238)
+.+.+..+.++.+....+.+....|..+. .......+++.|.+. ....+..|++.+...+... +...
T Consensus 144 ~~~~~~~~~~~~~t~i~i~~~~~~p~~~~-----~~~~~~~it~~w~~~----~~~~~~~ykl~~~~~d~~~----~~~v 210 (1087)
T KOG4228|consen 144 PLAIASPGLGPPSTYIQITTNANSPIQPG-----EEEEYTTITGSWSPP----HAVSLDTYKLLHLDPDTGY----EISV 210 (1087)
T ss_pred ccccCCcccCCCCceEEEeccCCCCCCCC-----cceEEEEEEecCCCC----CcccchhhhhhhcCCcccc----eeee
Confidence 55555566665555566666666554443 444567899999842 2345677887665552221 1223
Q ss_pred ecCCcceEEecCCCCCceEEEEEEEEeCCCCCCCC
Q psy14887 161 PVGNTLDADIVDLSPGKRYHMRVLAFSNGGDGRMS 195 (238)
Q Consensus 161 ~~~~~~~~~i~~L~p~t~Y~~~V~a~~~~g~g~~s 195 (238)
......+....++.|...|.++..+....+.+...
T Consensus 211 ~~~~~~~~~t~~~~~~~~~~~~~ae~~~~~~~~~~ 245 (1087)
T KOG4228|consen 211 TLTPPGSGGTGDLGPPSISRFKCAEPDRGPLGSST 245 (1087)
T ss_pred eccCCCCCcccCCCCcccccccccCccccccCCCC
Confidence 33444455566777777888877777665554443
No 78
>PF14250 AbrB-like: AbrB-like transcriptional regulator
Probab=31.34 E-value=49 Score=19.23 Aligned_cols=41 Identities=15% Similarity=0.261 Sum_probs=24.9
Q ss_pred CCCCeeEEEEEEEEcCCCCCCCceeeeecCCcceEEecCCCCCceEEEEEE
Q psy14887 134 EEEPIIGYKVRVWEVDQDMSRANDTIIPVGNTLDADIVDLSPGKRYHMRVL 184 (238)
Q Consensus 134 ~~~~i~~y~v~~~~~~~~~~~~~~~~~~~~~~~~~~i~~L~p~t~Y~~~V~ 184 (238)
..|.--.|++.....++... ........+|+||.+|++++-
T Consensus 24 ~~GR~~syr~~Vq~NGnLLI----------G~AYT~~m~L~PGdEFeI~Lg 64 (71)
T PF14250_consen 24 RRGRKASYRVSVQGNGNLLI----------GSAYTKQMGLKPGDEFEIKLG 64 (71)
T ss_pred CCCcCceEEEEEecCCCEEE----------cHHHHHHhCCCCCCEEEEEeC
Confidence 34555668777665544321 111223479999999999863
No 79
>cd05893 Ig_Palladin_C C-terminal immunoglobulin (Ig)-like domain of palladin. Ig_Palladin_C: C-terminal immunoglobulin (Ig)-like domain of palladin. Palladin belongs to the palladin-myotilin-myopalladin family. Proteins belonging to this family contain multiple Ig-like domains and function as scaffolds, modulating actin cytoskeleton. Palladin binds to alpha-actinin ezrin, vasodilator-stimulated phosphoprotein VASP, SPIN90 (DIP, mDia interacting protein), and Src. Palladin also binds F-actin directly, via its Ig3 domain. Palladin is expressed as several alternatively spliced isoforms, having various combinations of Ig-like domains, in a cell-type-specific manner. It has been suggested that palladin's different Ig-like domains may be specialized for distinct functions.
Probab=29.00 E-value=1.2e+02 Score=17.53 Aligned_cols=26 Identities=15% Similarity=-0.125 Sum_probs=21.1
Q ss_pred eEEecCCCCCceEEEEEEEEeCCCCC
Q psy14887 167 DADIVDLSPGKRYHMRVLAFSNGGDG 192 (238)
Q Consensus 167 ~~~i~~L~p~t~Y~~~V~a~~~~g~g 192 (238)
.+.|.++++.-.-.++..|.|..|.-
T Consensus 42 ~L~I~~v~~~D~G~Y~C~A~N~~G~~ 67 (75)
T cd05893 42 CLLIQGATKEDAGWYTVSAKNEAGIV 67 (75)
T ss_pred EEEECCCCHHHCEEEEEEEEcCCCEE
Confidence 46788999987778889999998753
No 80
>PF08329 ChitinaseA_N: Chitinase A, N-terminal domain; InterPro: IPR013540 This domain is found in a number of bacterial chitinases and similar viral proteins. It is organised into a fibronectin III module domain-like fold, comprising only beta strands. Its function is not known, but it may be involved in interaction with the enzyme substrate, chitin [, ]. It is separated by a hinge region from the catalytic domain (IPR001223 from INTERPRO); this hinge region is probably mobile, allowing the N-terminal domain to have different relative positions in solution []. ; GO: 0004568 chitinase activity; PDB: 2WLY_A 1EDQ_A 2WM0_A 1X6N_A 1NH6_A 2WK2_A 1EHN_A 2WLZ_A 1EIB_A 1FFR_A ....
Probab=27.90 E-value=1.9e+02 Score=19.42 Aligned_cols=20 Identities=20% Similarity=0.318 Sum_probs=15.8
Q ss_pred CCCCCEEEEEEEEEeCCCCC
Q psy14887 71 LQPDTKYYVKVMAYNNAGEG 90 (238)
Q Consensus 71 L~p~~~Y~~~v~a~~~~g~~ 90 (238)
...+..|+++|..+|..|++
T Consensus 79 ~~~gG~y~~~VeLCN~~GCS 98 (133)
T PF08329_consen 79 VTKGGRYQMQVELCNADGCS 98 (133)
T ss_dssp E-S-EEEEEEEEEEETTEEE
T ss_pred ecCCCEEEEEEEEECCCCcc
Confidence 46788899999999999965
No 81
>COG3979 Uncharacterized protein contain chitin-binding domain type 3 [General function prediction only]
Probab=26.17 E-value=1.6e+02 Score=20.95 Aligned_cols=84 Identities=14% Similarity=0.121 Sum_probs=48.2
Q ss_pred CcceEEEeeCCcEEEEEEeccCCCCCCCCeeEEEEEEEEcCCCCCCCceeeeecCCcceEEecCCCCCceEEEEEEEEeC
Q psy14887 109 PSAVHIKGINPTTVHVTWRYVSPSFEEEPIIGYKVRVWEVDQDMSRANDTIIPVGNTLDADIVDLSPGKRYHMRVLAFSN 188 (238)
Q Consensus 109 p~~~~~~~~~~~~~~l~W~~~~~~~~~~~i~~y~v~~~~~~~~~~~~~~~~~~~~~~~~~~i~~L~p~t~Y~~~V~a~~~ 188 (238)
+..++.......++.+.|+.. ..+-.+..|.+.+ ..+ ............+.+|.+.++|.+.+.+.-.
T Consensus 8 ~~~~~~~~~~~~s~~~~~~s~---~~~~~~~~~p~~~-~~d--------~~~~~t~~~~~~~~g~~~et~y~~~~~a~~t 75 (181)
T COG3979 8 PKSLMLTNVTTTSASLAASSS---SDNKASDTYPYVY-AWD--------PKTVYTTGTVVTVEGLAPETEYWTLVEAPDT 75 (181)
T ss_pred ccceeecchhhhhhhhccccc---cccccccCCCccc-ccC--------ccceecccceEEecCcceeeEEEEeeeccCC
Confidence 334566666677788888742 1223344444111 111 1122345567889999999999999999987
Q ss_pred CCCCCCCCCcEEEeec
Q psy14887 189 GGDGRMSSPAITFQMG 204 (238)
Q Consensus 189 ~g~g~~s~~~~~~~t~ 204 (238)
.|.....+......+.
T Consensus 76 ~g~~s~~~~~~~~~~~ 91 (181)
T COG3979 76 SGNWSAWSRLLTVSTS 91 (181)
T ss_pred CCccccceeeeeeccC
Confidence 6653333333433333
No 82
>KOG4680|consensus
Probab=25.82 E-value=1.5e+02 Score=20.08 Aligned_cols=25 Identities=16% Similarity=0.401 Sum_probs=21.5
Q ss_pred eEEecCCCCCceEEEEEEEEeCCCC
Q psy14887 167 DADIVDLSPGKRYHMRVLAFSNGGD 191 (238)
Q Consensus 167 ~~~i~~L~p~t~Y~~~V~a~~~~g~ 191 (238)
+..+.+..|-..|.+.+++++..|.
T Consensus 109 sq~LPg~tPPG~Y~lkm~~~d~~~~ 133 (153)
T KOG4680|consen 109 SQVLPGYTPPGSYVLKMTAYDAKGK 133 (153)
T ss_pred eEeccCcCCCceEEEEEEeecCCCC
Confidence 4467888899999999999999775
No 83
>cd00182 TBOX T-box DNA binding domain of the T-box family of transcriptional regulators. The T-box family is an ancient group that appears to play a critical role in development in all animal species. These genes were uncovered on the basis of similarity to the DNA binding domain of murine Brachyury (T) gene product, the defining feature of the family. Common features shared by T-box family members are DNA-binding and transcriptional regulatory activity, a role in development and conserved expression patterns, most of the known genes in all species being expressed in mesoderm or mesoderm precursors.
Probab=25.82 E-value=1.3e+02 Score=21.68 Aligned_cols=23 Identities=17% Similarity=0.275 Sum_probs=19.8
Q ss_pred ceEEecCCCCCceEEEEEEEEeC
Q psy14887 166 LDADIVDLSPGKRYHMRVLAFSN 188 (238)
Q Consensus 166 ~~~~i~~L~p~t~Y~~~V~a~~~ 188 (238)
-.+.+.||.|...|.+-+.....
T Consensus 35 l~~~vsGLdp~~~Y~v~l~~~~~ 57 (188)
T cd00182 35 LKVKVSGLDPNALYSVLMDLVPV 57 (188)
T ss_pred eEEEEeCCCcccceEEEEEEEEc
Confidence 37789999999999999987764
No 84
>KOG1225|consensus
Probab=25.23 E-value=80 Score=26.76 Aligned_cols=67 Identities=18% Similarity=0.102 Sum_probs=41.5
Q ss_pred CcceEEeeeccceEEEEeeecccccccccceeeEEEEEEEeeCCCcCceEEEeeccccceEEEecCCCCCEEEEEE
Q psy14887 6 PQGVFSRGFNSTAINVTWNPIELIRENIRGRLIGHRIKYWKEDNPEETAVYYLSRTTRNWALIVGLQPDTKYYVKV 81 (238)
Q Consensus 6 P~~~~~~~~~~~~~~l~W~~~~~~~~~~~~~i~~y~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~L~p~~~Y~~~v 81 (238)
|..+.+......++.+.|..+.+ ....+.+.| ........ .........+.+...+|.|++.|.+.+
T Consensus 458 ~g~~~v~~~~~~s~e~~g~~~s~-------~~~~~~~~~-~~~~~~~~-~~~~~~~~~~~~~~~~l~~c~~~~~~~ 524 (525)
T KOG1225|consen 458 NGGLCVDGETESSLEVGGPCPSS-------GTCGWEVRC-GPCGNDGG-VNAEPPPECTSYDRTGLGPCTEYEVSV 524 (525)
T ss_pred CCceeeeeeeeccccccCCCCCc-------cccceEEEe-eecCcccc-cccCCCCCCCCCCccCcccccceeccc
Confidence 45567777888889999998864 234566665 22222211 111112255667888999999998754
No 85
>cd05762 Ig8_MLCK Eighth immunoglobulin (Ig)-like domain of human myosin light-chain kinase (MLCK). Ig8_MLCK: the eighth immunoglobulin (Ig)-like domain of human myosin light-chain kinase (MLCK). MLCK is a key regulator of different forms of cell motility involving actin and myosin II. Agonist stimulation of smooth muscle cells increases cytosolic Ca2+, which binds calmodulin. This Ca2+-calmodulin complex in turn binds to and activates MLCK. Activated MLCK leads to the phosphorylation of the 20 kDa myosin regulatory light chain (RLC) of myosin II and the stimulation of actin-activated myosin MgATPase activity. MLCK is widely present in vertebrate tissues; it phosphorylates the 20 kDa RLC of both smooth and nonmuscle myosin II. Phosphorylation leads to the activation of the myosin motor domain and altered structural properties of myosin II. In smooth muscle MLCK it is involved in initiating contraction. In nonmuscle cells, MLCK may participate in cell division and cell motility; it has
Probab=24.83 E-value=1.8e+02 Score=17.94 Aligned_cols=31 Identities=10% Similarity=-0.039 Sum_probs=20.6
Q ss_pred CcceEEecCCCCCceEEEEEEEEeCCCCCCC
Q psy14887 164 NTLDADIVDLSPGKRYHMRVLAFSNGGDGRM 194 (238)
Q Consensus 164 ~~~~~~i~~L~p~t~Y~~~V~a~~~~g~g~~ 194 (238)
....++|.+....-.=.+.+.|.|..|.-..
T Consensus 54 ~~s~L~I~~~~~~D~G~Ytc~a~N~~G~~~~ 84 (98)
T cd05762 54 NSSKLTITEGQQEHCGCYTLEVENKLGSRQA 84 (98)
T ss_pred CeeEEEECCCChhhCEEEEEEEEcCCCceeE
Confidence 3446677777766555666777899886443
No 86
>KOG1378|consensus
Probab=23.69 E-value=4.4e+02 Score=22.08 Aligned_cols=33 Identities=27% Similarity=0.337 Sum_probs=23.4
Q ss_pred eEEecCCCCCceEEEEEEEEeCCCCCCCCCCcEEEeecc
Q psy14887 167 DADIVDLSPGKRYHMRVLAFSNGGDGRMSSPAITFQMGE 205 (238)
Q Consensus 167 ~~~i~~L~p~t~Y~~~V~a~~~~g~g~~s~~~~~~~t~~ 205 (238)
.+.+.+|++++.|..++-.- ..+| .+..|++.+
T Consensus 110 ~~~~~~L~~~t~YyY~~Gs~-----~~wS-~~f~F~t~p 142 (452)
T KOG1378|consen 110 DAVMKNLEPNTRYYYQVGSD-----LKWS-EIFSFKTPP 142 (452)
T ss_pred eeeecCCCCCceEEEEeCCC-----CCcc-cceEeECCC
Confidence 56789999999999988532 2244 467777554
No 87
>COG3269 Predicted RNA-binding protein, contains TRAM domain [General function prediction only]
Probab=23.46 E-value=1.6e+02 Score=17.45 Aligned_cols=25 Identities=20% Similarity=0.442 Sum_probs=21.2
Q ss_pred cCCCCCceEEEEEEEEeCCCCCCCC
Q psy14887 171 VDLSPGKRYHMRVLAFSNGGDGRMS 195 (238)
Q Consensus 171 ~~L~p~t~Y~~~V~a~~~~g~g~~s 195 (238)
..+..|..|.+.|......|+|-..
T Consensus 11 ~PVeeGe~y~V~I~d~g~~GDGiar 35 (73)
T COG3269 11 PPVEEGETYEVEIEDVGDQGDGIAR 35 (73)
T ss_pred CCcccCCEEEEEEEEeccCCCceEE
Confidence 4567799999999999999988764
No 88
>smart00425 TBOX Domain first found in the mice T locus (Brachyury) protein.
Probab=22.31 E-value=1.5e+02 Score=21.29 Aligned_cols=23 Identities=17% Similarity=0.242 Sum_probs=19.7
Q ss_pred ceEEecCCCCCceEEEEEEEEeC
Q psy14887 166 LDADIVDLSPGKRYHMRVLAFSN 188 (238)
Q Consensus 166 ~~~~i~~L~p~t~Y~~~V~a~~~ 188 (238)
-.+.+.||.|...|.+.+.....
T Consensus 34 l~~~vsGLdp~~~Y~v~l~~~~~ 56 (190)
T smart00425 34 LKYKVSGLDPNALYSVLMDLVPV 56 (190)
T ss_pred eEEEEeCCCccCcEEEEEEEEEc
Confidence 37789999999999999987764
No 89
>cd02848 Chitinase_N_term Chitinase N-terminus domain. Chitinases hydrolyze the abundant natural biopolymer chitin, producing smaller chito-oligosaccharides. Chitin consists of multiple N-acetyl-D-glucosamine (NAG) residues connected via beta-1,4-glycosidic linkages and is an important structural element of fungal cell wall and arthropod exoskeletons. On the basis of the mode of chitin hydrolysis, chitinases are classified as random, endo-, and exo-chitinases and based on sequence criteria, chitinases belong to families 18 and 19 of glycosyl hydrolases. The N-terminus of chitinase may be related to the immunoglobulin and/or fibronectin type III superfamilies. These domains are associated with different types of catalytic domains at either the N-terminal or C-terminal end and may be involved in homodimeric/tetrameric/dodecameric interactions. Members of this family include members of the alpha amylase family, sialidase, galactose oxidase, cellulase, cellulose, hyaluronate lyase, chitob
Probab=22.18 E-value=2.3e+02 Score=18.23 Aligned_cols=20 Identities=20% Similarity=0.335 Sum_probs=17.2
Q ss_pred CCCCCEEEEEEEEEeCCCCC
Q psy14887 71 LQPDTKYYVKVMAYNNAGEG 90 (238)
Q Consensus 71 L~p~~~Y~~~v~a~~~~g~~ 90 (238)
...+..|.++|..++..|++
T Consensus 76 v~kgG~y~m~V~lCn~dGCS 95 (106)
T cd02848 76 VGKGGRYQMQVALCNGDGCS 95 (106)
T ss_pred eCCCCeEEEEEEEECCCCcc
Confidence 45678899999999999975
No 90
>PF05345 He_PIG: Putative Ig domain; InterPro: IPR008009 This alignment represents the conserved core region of a ~90 residue repeat found in several haemagglutinins and other cell surface proteins. Sequence similarities to Hyalin (IPR003410 from INTERPRO) and the PKD domain (IPR000601 from INTERPRO) suggest an Ig-like fold so this family may be similar in function to the (IPR003791 from INTERPRO) and (IPR003790 from INTERPRO) protein families.
Probab=22.12 E-value=1.2e+02 Score=16.06 Aligned_cols=12 Identities=25% Similarity=0.343 Sum_probs=7.2
Q ss_pred eEEEEEEEEeCC
Q psy14887 178 RYHMRVLAFSNG 189 (238)
Q Consensus 178 ~Y~~~V~a~~~~ 189 (238)
.|.|.|.+.+..
T Consensus 37 ~y~~~vtatd~~ 48 (49)
T PF05345_consen 37 TYTFTVTATDGS 48 (49)
T ss_pred EEEEEEEEEcCC
Confidence 566666666543
No 91
>PF07867 DUF1654: Protein of unknown function (DUF1654); InterPro: IPR012449 This entry is represented by Bacteriophage F116 (Pseudomonas phage F116), Orf28. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. This family consists of proteins from the Pseudomonadaceae.
Probab=20.82 E-value=1.6e+02 Score=17.36 Aligned_cols=20 Identities=15% Similarity=0.308 Sum_probs=14.4
Q ss_pred ceEEeeeccceEEEEeeecc
Q psy14887 8 GVFSRGFNSTAINVTWNPIE 27 (238)
Q Consensus 8 ~~~~~~~~~~~~~l~W~~~~ 27 (238)
++.+.....++++|.|..+.
T Consensus 52 gv~v~~~dDGsv~i~W~~~~ 71 (73)
T PF07867_consen 52 GVEVTFNDDGSVRIRWERPK 71 (73)
T ss_pred CeEEEEcCCCeEEEEEEccc
Confidence 45666667778899998753
Done!