Query psy13839
Match_columns 413
No_of_seqs 313 out of 1563
Neff 6.3
Searched_HMMs 46136
Date Fri Aug 16 21:44:19 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy13839.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/13839hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG1692|consensus 100.0 7.3E-40 1.6E-44 296.5 17.0 181 35-413 21-201 (201)
2 KOG1690|consensus 100.0 2.9E-37 6.3E-42 280.6 13.9 188 37-312 21-213 (215)
3 KOG1693|consensus 100.0 3.5E-35 7.7E-40 267.5 18.2 182 37-411 24-205 (209)
4 KOG3287|consensus 100.0 1E-32 2.2E-37 254.0 16.9 202 31-319 30-234 (236)
5 KOG1692|consensus 100.0 8.9E-32 1.9E-36 244.0 12.9 152 2-311 47-198 (201)
6 KOG1691|consensus 100.0 6.5E-30 1.4E-34 235.9 20.4 184 36-413 24-210 (210)
7 KOG1693|consensus 100.0 4.7E-30 1E-34 234.0 10.8 156 1-312 50-205 (209)
8 PF01105 EMP24_GP25L: emp24/gp 99.9 6.1E-28 1.3E-32 218.6 0.9 177 37-309 2-183 (183)
9 KOG1691|consensus 99.8 2.2E-20 4.7E-25 172.8 13.9 74 229-312 135-208 (210)
10 KOG1690|consensus 99.8 1E-19 2.2E-24 166.3 8.2 75 339-413 141-215 (215)
11 PF01105 EMP24_GP25L: emp24/gp 99.4 5.7E-14 1.2E-18 127.0 0.7 71 338-408 113-183 (183)
12 KOG3287|consensus 99.4 2.3E-12 5E-17 119.8 8.6 74 339-412 155-228 (236)
13 PHA02148 hypothetical protein 89.2 4.1 9E-05 33.7 9.0 91 302-400 6-101 (110)
14 PF13897 GOLD_2: Golgi-dynamic 86.9 0.74 1.6E-05 40.7 3.6 31 118-148 104-134 (136)
15 PRK14081 triple tyrosine motif 84.2 16 0.00036 40.5 13.1 122 9-131 125-263 (667)
16 smart00557 IG_FLMN Filamin-typ 82.0 7.6 0.00016 31.6 7.4 45 90-134 30-77 (93)
17 PRK14081 triple tyrosine motif 80.9 31 0.00068 38.4 13.7 138 8-148 221-378 (667)
18 PF00630 Filamin: Filamin/ABP2 79.5 15 0.00033 29.6 8.5 42 92-133 42-90 (101)
19 PF04151 PPC: Bacterial pre-pe 74.2 11 0.00024 28.8 5.9 40 92-131 24-68 (70)
20 PF01835 A2M_N: MG2 domain; I 71.6 28 0.00061 28.2 8.1 44 91-134 35-86 (99)
21 KOG0518|consensus 69.3 58 0.0013 37.9 12.2 125 12-137 791-931 (1113)
22 PF13897 GOLD_2: Golgi-dynamic 60.3 11 0.00024 33.4 3.7 30 37-66 105-134 (136)
23 KOG0518|consensus 60.1 85 0.0018 36.6 11.3 129 3-133 383-526 (1113)
24 PF10779 XhlA: Haemolysin XhlA 58.5 86 0.0019 24.4 8.3 57 341-397 3-68 (71)
25 PF09315 DUF1973: Domain of un 56.1 1.7E+02 0.0037 27.1 13.9 102 3-106 34-157 (179)
26 PF05738 Cna_B: Cna protein B- 55.9 24 0.00053 26.6 4.6 44 94-137 4-49 (70)
27 PF13860 FlgD_ig: FlgD Ig-like 52.4 73 0.0016 25.2 7.0 39 81-120 15-55 (81)
28 PF01835 A2M_N: MG2 domain; I 48.3 88 0.0019 25.2 7.1 47 10-69 36-85 (99)
29 PF11589 DUF3244: Domain of un 48.1 55 0.0012 27.3 5.9 55 79-136 38-96 (106)
30 PF13620 CarboxypepD_reg: Carb 47.4 21 0.00046 27.6 3.1 42 93-134 16-57 (82)
31 PF07495 Y_Y_Y: Y_Y_Y domain; 42.9 1.3E+02 0.0029 22.0 7.7 41 92-134 8-48 (66)
32 PF14524 Wzt_C: Wzt C-terminal 40.6 1.3E+02 0.0028 25.2 7.3 57 91-149 51-107 (142)
33 PF04728 LPP: Lipoprotein leuc 38.7 93 0.002 23.6 5.1 27 339-365 5-31 (56)
34 PF09315 DUF1973: Domain of un 36.5 1.5E+02 0.0032 27.6 7.3 52 93-146 42-97 (179)
35 PRK05842 flgD flagellar basal 35.8 96 0.0021 31.2 6.3 44 90-133 164-220 (295)
36 TIGR03503 conserved hypothetic 34.7 5.5E+02 0.012 26.8 17.2 118 27-145 165-298 (374)
37 PHA02148 hypothetical protein 34.3 95 0.0021 25.9 4.9 54 230-283 39-93 (110)
38 cd01042 DMQH Demethoxyubiquino 33.6 1.4E+02 0.003 27.4 6.5 82 293-380 58-140 (165)
39 PF12690 BsuPI: Intracellular 31.8 1.8E+02 0.0038 23.4 6.2 20 90-109 22-41 (82)
40 smart00557 IG_FLMN Filamin-typ 31.7 1.1E+02 0.0024 24.6 5.1 17 8-24 30-46 (93)
41 PRK12812 flgD flagellar basal 30.8 1.4E+02 0.003 29.5 6.4 44 90-133 139-193 (259)
42 PF10400 Vir_act_alpha_C: Viru 30.4 2.5E+02 0.0055 22.0 7.0 64 335-398 12-76 (90)
43 PF04151 PPC: Bacterial pre-pe 29.1 56 0.0012 24.8 2.7 40 6-45 20-64 (70)
44 PF10805 DUF2730: Protein of u 28.7 1E+02 0.0022 26.1 4.4 13 273-285 7-19 (106)
45 PRK06655 flgD flagellar basal 27.8 1.9E+02 0.004 27.9 6.7 44 90-133 124-178 (225)
46 PF11589 DUF3244: Domain of un 27.6 1.9E+02 0.0042 24.0 6.0 25 10-34 48-72 (106)
47 PRK12634 flgD flagellar basal 27.2 1.7E+02 0.0036 28.1 6.2 44 90-133 120-174 (221)
48 PF04678 DUF607: Protein of un 25.9 3.1E+02 0.0066 25.3 7.5 45 343-387 56-100 (180)
49 PF00630 Filamin: Filamin/ABP2 25.3 2.2E+02 0.0047 22.7 5.8 13 10-22 42-54 (101)
50 PF04728 LPP: Lipoprotein leuc 24.3 1.9E+02 0.0042 21.9 4.7 42 230-271 5-46 (56)
51 PF05738 Cna_B: Cna protein B- 24.2 1.1E+02 0.0024 22.8 3.7 35 11-45 3-39 (70)
52 PRK15396 murein lipoprotein; P 24.0 2.2E+02 0.0047 23.0 5.3 43 339-381 27-69 (78)
53 PF09323 DUF1980: Domain of un 23.9 97 0.0021 28.5 3.8 35 375-409 25-59 (182)
54 PF11368 DUF3169: Protein of u 23.8 6.4E+02 0.014 24.2 9.7 23 340-362 171-193 (248)
55 COG5415 Predicted integral mem 22.8 4.9E+02 0.011 25.1 8.1 35 332-366 10-44 (251)
56 PRK12813 flgD flagellar basal 22.4 2.5E+02 0.0054 27.0 6.4 44 90-134 122-174 (223)
57 PF10528 PA14_2: GLEYA domain; 21.9 70 0.0015 27.4 2.2 24 4-27 79-102 (113)
58 PF10267 Tmemb_cc2: Predicted 21.2 9.3E+02 0.02 25.3 10.7 64 330-399 291-356 (395)
59 PF06156 DUF972: Protein of un 20.5 1.8E+02 0.004 24.7 4.5 40 336-375 7-46 (107)
60 PRK12633 flgD flagellar basal 20.4 2.8E+02 0.006 26.7 6.3 44 90-133 127-181 (230)
No 1
>KOG1692|consensus
Probab=100.00 E-value=7.3e-40 Score=296.54 Aligned_cols=181 Identities=56% Similarity=0.941 Sum_probs=158.7
Q ss_pred eEEEeeecccEEEEeecccCCCcceEEEEEEEEeecCCCCCCCCcceEEEEeeecCceeEEEEEECCCCCeEeeceeecc
Q psy13839 35 YTFSAHMSGVYTYCFSNQMSTMTPKVVMFNMEVLSTQPSEDDKGEKGLMFEISEGGFLDIDVRITGPDGNIIYQGERESS 114 (413)
Q Consensus 35 ~~f~~~~~G~y~~CF~N~~s~~t~k~V~f~v~v~~~~~~~~~~~~~~~~yqV~~Gg~~dI~v~v~dP~g~~l~~~~~~~~ 114 (413)
|.++.+.++ +.||++.... +++++++|+|.+||.++||+.|++|+|++|+..++.+.
T Consensus 21 ~~is~~ah~--eeCf~e~~~~---------------------gd~~~vsF~v~~gg~~~vd~~I~gP~~~~i~~~~~~ss 77 (201)
T KOG1692|consen 21 YGISLDAHE--EECFFENLEE---------------------GDKLSVSFEVIDGGFLGVDVEITGPDGKIIHKGKRESS 77 (201)
T ss_pred eeEEEccch--hhhHhhhhcc---------------------CCEEEEEEEEecCCccceeEEEECCCCchhhhcccccC
Confidence 334444444 5699998743 56789999999999999999999999999999999999
Q ss_pred ceEEEEeecceeEEEEEeeCCCCCCceEEEEEEEEcccCCCCCccccccccccccccCCCCCCCCccccccCCCccchhc
Q psy13839 115 GKYTFSAHMSGVYTYCFSNQMSTMTPKVVMFNMEVLSTQPSEDDKGEKGNRYAKNISAKPKGLNPAKDKLYSKPTKFVIR 194 (413)
Q Consensus 115 g~f~fta~~~G~y~~Cf~N~~s~~~~k~V~f~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~~~~~~~~~~~~ 194 (413)
|+|+|+|+.+|.|++||+|.+|+++||.|+|+++++.....++
T Consensus 78 gk~tF~a~~~G~Y~fCF~N~~s~mtpk~V~F~ihvg~~~~~~d------------------------------------- 120 (201)
T KOG1692|consen 78 GKYTFTAPKKGTYTFCFSNKMSTMTPKTVMFTIHVGHAPQRDD------------------------------------- 120 (201)
T ss_pred ceEEEEecCCceEEEEecCCCCCCCceEEEEEEEEeeccccch-------------------------------------
Confidence 9999999999999999999999999999999999997521100
Q ss_pred ccccccCCCCCCcchhhhhhhccCCCCCCCcCCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccchhh
Q psy13839 195 TADFLLFCPPSFDLGMIKLLFLSTQPSEDDKGEKGEPNLQDMIRQLGESLTSVKHEQEYMSVRDRIHRSINESTNSRVVW 274 (413)
Q Consensus 195 ~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~l~~~l~~L~~~l~~I~~eQ~y~r~RE~~~r~~~Estn~rV~~ 274 (413)
.++|
T Consensus 121 --------------------------~~~d-------------------------------------------------- 124 (201)
T KOG1692|consen 121 --------------------------LAKD-------------------------------------------------- 124 (201)
T ss_pred --------------------------hccc--------------------------------------------------
Confidence 0001
Q ss_pred hhHHhhhheeeccccccCCCCccceeHHHHHHHhhhhcccCCCCCCCCccchhhhhhhhcccCCchhHHHHHHHHHHHHH
Q psy13839 275 WSFFEAVVLVNHAKDKLHSKPTNSSSIKHLKKLLNHAKDKLHSKPTNSSSIKHLEKIVFFFFSGEPNLQDMIRQLGESLT 354 (413)
Q Consensus 275 wSi~qivvli~~~~~~~~~~~~~~~qi~~lk~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~le~~v~~L~~~l~ 354 (413)
+ +.++|+++|++|++.|.
T Consensus 125 ----------------------------------------~----------------------~~~~L~~~I~eL~~al~ 142 (201)
T KOG1692|consen 125 ----------------------------------------A----------------------HQNKLEEMIRELSEALT 142 (201)
T ss_pred ----------------------------------------c----------------------cccHHHHHHHHHHHHHH
Confidence 0 15689999999999999
Q ss_pred HHHHHHHHHHHHHHHHHHHHhhhcchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccC
Q psy13839 355 SVKHEQEYMSVRDRIHRSINESTNSRVVWWSFFEAVVLVTMTIGQIYYLKRFFEVRRVV 413 (413)
Q Consensus 355 ~I~~eq~y~r~RE~~~r~t~Estn~RV~~wSi~q~~vLi~vs~~Qv~~Lk~fF~~Kkiv 413 (413)
+++.||+|+..||++||.++|+||+||+|||++|.++||++|++|||||||||++|++|
T Consensus 143 ~Vk~EQeY~~~Rer~Hr~~nEntn~RVv~wsife~~vLi~~s~~QVyYLkRfFEvkrvv 201 (201)
T KOG1692|consen 143 SVKHEQEYMEARERIHRNTNENTNSRVVLWSIFEALVLIAMSVLQVYYLKRFFEVKRVV 201 (201)
T ss_pred HhHHHHHHHHHHHHHHHHhhhcccceeehHHHHHHHHHHHHHHHHHHHHHHhheeeecC
Confidence 99999999999999999999999999999999999999999999999999999999975
No 2
>KOG1690|consensus
Probab=100.00 E-value=2.9e-37 Score=280.60 Aligned_cols=188 Identities=24% Similarity=0.388 Sum_probs=157.6
Q ss_pred EEeeecccEEEEeecccCCCcceEEEEEEEEeecCCCCCCCCcceEEEEeeecCceeEEEEEECC--CCCeEeeceeecc
Q psy13839 37 FSAHMSGVYTYCFSNQMSTMTPKVVMFNMEVLSTQPSEDDKGEKGLMFEISEGGFLDIDVRITGP--DGNIIYQGERESS 114 (413)
Q Consensus 37 f~~~~~G~y~~CF~N~~s~~t~k~V~f~v~v~~~~~~~~~~~~~~~~yqV~~Gg~~dI~v~v~dP--~g~~l~~~~~~~~ 114 (413)
+-....|+.++||.+++|+++.+++.|.+.+++++.+. |+ +-.+..|.|.|.+| ++|+|+++...++
T Consensus 21 ~yFy~~~~e~KCF~eelpk~tmv~G~yk~qlyd~~~~~---------y~--~~p~~gm~VeV~e~fdnnh~Vl~q~~ss~ 89 (215)
T KOG1690|consen 21 LYFYIAGTEKKCFIEELPKGTMVTGNYKAQLYDDQLKG---------YG--SYPNIGMHVEVKETFDNNHVVLSQQYSSE 89 (215)
T ss_pred EEEEecCCcccchhhhCCCCcEEEeeeeeeeecchhcc---------cc--cCCCceEEEEeecCCCCceEEEeecCCCC
Confidence 44567788899999999999999999999999886532 55 22344555555555 5569999999999
Q ss_pred ceEEEEeecceeEEEEEeeCCCCCC--ce-EEEEEEEEcccCCCCCccccccccccccccCCCCCCCCccccccCCCccc
Q psy13839 115 GKYTFSAHMSGVYTYCFSNQMSTMT--PK-VVMFNMEVLSTQPSEDDKGEKGNRYAKNISAKPKGLNPAKDKLYSKPTKF 191 (413)
Q Consensus 115 g~f~fta~~~G~y~~Cf~N~~s~~~--~k-~V~f~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~~~~~~~~~ 191 (413)
|+|+|||+++|+|+||+.+..+.|- .| .|.+++++|+. ++.|++
T Consensus 90 G~ftFta~~~GeH~IC~~s~s~awf~~aklRvhld~qvG~~----------a~l~a~----------------------- 136 (215)
T KOG1690|consen 90 GDFTFTALTPGEHRICIQSNSTAWFNGAKLRVHLDIQVGDH----------ANLDAQ----------------------- 136 (215)
T ss_pred CceEEEccCCCceEEEEecccchhhccceEEEEEEEeeCch----------hhhhhh-----------------------
Confidence 9999999999999999988877662 33 79999999975 222221
Q ss_pred hhcccccccCCCCCCcchhhhhhhccCCCCCCCcCCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccc
Q psy13839 192 VIRTADFLLFCPPSFDLGMIKLLFLSTQPSEDDKGEKGEPNLQDMIRQLGESLTSVKHEQEYMSVRDRIHRSINESTNSR 271 (413)
Q Consensus 192 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~l~~~l~~L~~~l~~I~~eQ~y~r~RE~~~r~~~Estn~r 271 (413)
.+++.+.++.++++|..++.+|++||+|+|.||+++|+++||+|+|
T Consensus 137 ----------------------------------~ke~~k~l~~Rv~~L~~~~~~IrkEQ~~~R~RE~~FR~tSES~NsR 182 (215)
T KOG1690|consen 137 ----------------------------------IKETDKLLEGRVRQLNSRLESIRKEQNLQREREETFRDTSESANSR 182 (215)
T ss_pred ----------------------------------hhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhcce
Confidence 1223456888999999999999999999999999999999999999
Q ss_pred hhhhhHHhhhheeeccccccCCCCccceeHHHHHHHhhhhc
Q psy13839 272 VVWWSFFEAVVLVNHAKDKLHSKPTNSSSIKHLKKLLNHAK 312 (413)
Q Consensus 272 V~~wSi~qivvli~~~~~~~~~~~~~~~qi~~lk~~f~~~~ 312 (413)
|+|||++|++||++ +|+||+.|||+||.+.|
T Consensus 183 vm~Wsv~Q~vvL~~----------tc~wQmrhL~~FFvkqK 213 (215)
T KOG1690|consen 183 VMWWSVAQLVVLLV----------TCIWQMRHLKSFFVKQK 213 (215)
T ss_pred eeehhHHHHHHHHH----------HHHHHHHHHHHHHHHhh
Confidence 99999999999999 99999999999999887
No 3
>KOG1693|consensus
Probab=100.00 E-value=3.5e-35 Score=267.48 Aligned_cols=182 Identities=31% Similarity=0.627 Sum_probs=158.4
Q ss_pred EEeeecccEEEEeecccCCCcceEEEEEEEEeecCCCCCCCCcceEEEEeeecCceeEEEEEECCCCCeEeeceeeccce
Q psy13839 37 FSAHMSGVYTYCFSNQMSTMTPKVVMFNMEVLSTQPSEDDKGEKGLMFEISEGGFLDIDVRITGPDGNIIYQGERESSGK 116 (413)
Q Consensus 37 f~~~~~G~y~~CF~N~~s~~t~k~V~f~v~v~~~~~~~~~~~~~~~~yqV~~Gg~~dI~v~v~dP~g~~l~~~~~~~~g~ 116 (413)
++...+...+-||+.++++. .+.++++|||+.||++|||++|++|+|++|+.+.++..+.
T Consensus 24 lTfeLp~~aKqC~Y~d~~~~--------------------~~~~~~~fqV~tGG~fDVD~~I~aPdgkvI~~~~kk~~~~ 83 (209)
T KOG1693|consen 24 LTFELPDNAKQCFYEDLKKD--------------------DDTTSFEFQVQTGGHFDVDYDIEAPDGKVIYSEKKKRYDS 83 (209)
T ss_pred EEEEcCCcchhheeeecccC--------------------CceEEEEEEEEeCCceeeEEEEECCCCCEEeecccccccc
Confidence 45555555678999998763 2347889999999999999999999999999999999999
Q ss_pred EEEEeecceeEEEEEeeCCCCCCceEEEEEEEEcccCCCCCccccccccccccccCCCCCCCCccccccCCCccchhccc
Q psy13839 117 YTFSAHMSGVYTYCFSNQMSTMTPKVVMFNMEVLSTQPSEDDKGEKGNRYAKNISAKPKGLNPAKDKLYSKPTKFVIRTA 196 (413)
Q Consensus 117 f~fta~~~G~y~~Cf~N~~s~~~~k~V~f~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~~~~~~~~~~~~~~ 196 (413)
|.|++...|+|++||+|.+|++.+|.|+++++++.+.|.....
T Consensus 84 ~~f~ae~~G~Y~fCFsN~fstf~~Kiv~~~~q~~~~~~~~~~~------------------------------------- 126 (209)
T KOG1693|consen 84 FLFKAEGKGEYTFCFSNEFSTFSHKIVYMDFQVGEEPPLHPAV------------------------------------- 126 (209)
T ss_pred EEEEEecceEEEEEecCccccccceEeeehhhhccccccCccc-------------------------------------
Confidence 9999999999999999999999999999999998763321000
Q ss_pred ccccCCCCCCcchhhhhhhccCCCCCCCcCCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccchhhhh
Q psy13839 197 DFLLFCPPSFDLGMIKLLFLSTQPSEDDKGEKGEPNLQDMIRQLGESLTSVKHEQEYMSVRDRIHRSINESTNSRVVWWS 276 (413)
Q Consensus 197 ~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~l~~~l~~L~~~l~~I~~eQ~y~r~RE~~~r~~~Estn~rV~~wS 276 (413)
.+.+ .
T Consensus 127 ------------------------~~~~-------~-------------------------------------------- 131 (209)
T KOG1693|consen 127 ------------------------SNRD-------T-------------------------------------------- 131 (209)
T ss_pred ------------------------cccc-------h--------------------------------------------
Confidence 0000 0
Q ss_pred HHhhhheeeccccccCCCCccceeHHHHHHHhhhhcccCCCCCCCCccchhhhhhhhcccCCchhHHHHHHHHHHHHHHH
Q psy13839 277 FFEAVVLVNHAKDKLHSKPTNSSSIKHLKKLLNHAKDKLHSKPTNSSSIKHLEKIVFFFFSGEPNLQDMIRQLGESLTSV 356 (413)
Q Consensus 277 i~qivvli~~~~~~~~~~~~~~~qi~~lk~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~le~~v~~L~~~l~~I 356 (413)
.++.+|..+..+.++|+.|
T Consensus 132 -------------------------------------------------------------~~~~mena~~~I~~~L~~I 150 (209)
T KOG1693|consen 132 -------------------------------------------------------------ALTQMENAIVEIHRALNKI 150 (209)
T ss_pred -------------------------------------------------------------HHHHHHHHHHHHHHHHHHH
Confidence 0567899999999999999
Q ss_pred HHHHHHHHHHHHHHHHHHhhhcchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhcc
Q psy13839 357 KHEQEYMSVRDRIHRSINESTNSRVVWWSFFEAVVLVTMTIGQIYYLKRFFEVRR 411 (413)
Q Consensus 357 ~~eq~y~r~RE~~~r~t~Estn~RV~~wSi~q~~vLi~vs~~Qv~~Lk~fF~~Kk 411 (413)
.+.|.|+|.||+|.|.++||||+||+|||++++++++++++.||+.||.||..||
T Consensus 151 ~~~q~y~R~RE~rn~~tv~st~~Rv~~~Sl~e~~~vv~iSi~Qv~ilk~fFt~~r 205 (209)
T KOG1693|consen 151 DDTQTYYRLREARNRSTVESTNSRVTWWSLLEIIAVVVISIAQVFILKFFFTDRR 205 (209)
T ss_pred HHHHHHHHHHHhcCccchhcccchhhHHHHHHHHHHHHHHHHHHHHHHHHHhccC
Confidence 9999999999999999999999999999999999999999999999999998876
No 4
>KOG3287|consensus
Probab=100.00 E-value=1e-32 Score=254.04 Aligned_cols=202 Identities=22% Similarity=0.413 Sum_probs=171.5
Q ss_pred ecceeEEEeeecccEEEEeecccCCCcceEEEEEEEEeecCCCCCCCCcceEEEEeeec-CceeEEEEEECCCCCeEeec
Q psy13839 31 SSGKYTFSAHMSGVYTYCFSNQMSTMTPKVVMFNMEVLSTQPSEDDKGEKGLMFEISEG-GFLDIDVRITGPDGNIIYQG 109 (413)
Q Consensus 31 ~~g~~~f~~~~~G~y~~CF~N~~s~~t~k~V~f~v~v~~~~~~~~~~~~~~~~yqV~~G-g~~dI~v~v~dP~g~~l~~~ 109 (413)
+..++.|++..+.+.+-||+...+. +.+++++|||+.| |+++|++++.+|.|.+|...
T Consensus 30 pa~d~dftv~ipAGk~eCf~Q~v~~---------------------~~tle~eyQVi~G~GDl~i~Ftl~~P~G~~lv~~ 88 (236)
T KOG3287|consen 30 PADDYDFTVMIPAGKTECFYQPVPQ---------------------GATLEVEYQVIDGAGDLDIDFTLLNPAGEVLVSD 88 (236)
T ss_pred cccccceEEEecCCCceeeeeeccC---------------------CeEEEEEEEEEecCCccceeeEEeCCCccEEeec
Confidence 4557778888888889999999754 3457788999999 88999999999999999999
Q ss_pred eeeccceEEEEeecceeEEEEEeeCCCCCCceEEEEEEEEcccCCC--CCccccccccccccccCCCCCCCCccccccCC
Q psy13839 110 ERESSGKYTFSAHMSGVYTYCFSNQMSTMTPKVVMFNMEVLSTQPS--EDDKGEKGNRYAKNISAKPKGLNPAKDKLYSK 187 (413)
Q Consensus 110 ~~~~~g~f~fta~~~G~y~~Cf~N~~s~~~~k~V~f~~~v~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~k~~~~~~ 187 (413)
.++..|.+++.+.+.|+|++||+|++|+++.|.|+|++.+...+.. .++.|.+ |+
T Consensus 89 q~k~dg~ht~e~~e~GdY~~CfDNsFS~fs~K~Vffeli~~~~g~~~e~~e~w~k---~~-------------------- 145 (236)
T KOG3287|consen 89 QRKVDGVHTVEVTETGDYQVCFDNSFSTFSRKLVFFELILDAHGEFYEGDETWHK---YK-------------------- 145 (236)
T ss_pred ccccCceeEeeccCCcceEEEEcCccccccceEEEEEEEeccccchhccchhHhh---hh--------------------
Confidence 9999999999999999999999999999999999999988765322 1111110 00
Q ss_pred CccchhcccccccCCCCCCcchhhhhhhccCCCCCCCcCCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy13839 188 PTKFVIRTADFLLFCPPSFDLGMIKLLFLSTQPSEDDKGEKGEPNLQDMIRQLGESLTSVKHEQEYMSVRDRIHRSINES 267 (413)
Q Consensus 188 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~l~~~l~~L~~~l~~I~~eQ~y~r~RE~~~r~~~Es 267 (413)
.+++....+++++++.+.++..+|+++++.|...|.||+|+|++.||
T Consensus 146 ---------------------------------e~~~~Ld~kl~di~~~i~~i~~nl~k~~~~q~~lRa~EaRDr~L~es 192 (236)
T KOG3287|consen 146 ---------------------------------ERTEQLDVKLDDIEDSIGTIKNNLNKMWQYQALLRAREARDRNLQES 192 (236)
T ss_pred ---------------------------------hhhhhhcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhHHHHhc
Confidence 01113456889999999999999999999999999999999999999
Q ss_pred hccchhhhhHHhhhheeeccccccCCCCccceeHHHHHHHhhhhcccCCCCC
Q psy13839 268 TNSRVVWWSFFEAVVLVNHAKDKLHSKPTNSSSIKHLKKLLNHAKDKLHSKP 319 (413)
Q Consensus 268 tn~rV~~wSi~qivvli~~~~~~~~~~~~~~~qi~~lk~~f~~~~~~~~~~~ 319 (413)
+-.||.|||.+|++|||+ ++.+|+++||+||+.+-...+.+|
T Consensus 193 Nf~rVN~WS~vq~~vmi~----------v~~iQVf~lrslFe~~~~~~t~~~ 234 (236)
T KOG3287|consen 193 NFDRVNFWSMVQTLVMIL----------VGIIQVFMLRSLFEVKSKSDTKKR 234 (236)
T ss_pred ccchhhHHHHHHHHHHHH----------HhhhhhhhhHHHhcCCCCcccccc
Confidence 999999999999999999 899999999999997665555544
No 5
>KOG1692|consensus
Probab=99.97 E-value=8.9e-32 Score=243.97 Aligned_cols=152 Identities=55% Similarity=0.897 Sum_probs=135.2
Q ss_pred ceeeecCCceEEEEEECCCCCeEecceeeecceeEEEeeecccEEEEeecccCCCcceEEEEEEEEeecCCCCCCCCcce
Q psy13839 2 FEISEGGFLDIDVRITGPDGNIIYQGERESSGKYTFSAHMSGVYTYCFSNQMSTMTPKVVMFNMEVLSTQPSEDDKGEKG 81 (413)
Q Consensus 2 yqV~~GG~~dIdf~i~~P~g~~i~~~~~~~~g~~~f~~~~~G~y~~CF~N~~s~~t~k~V~f~v~v~~~~~~~~~~~~~~ 81 (413)
|+|++||+++||+.|++|+|++|+...+++.|+|+|+++.+|.|++||+|++|++++|+|.|++++.....
T Consensus 47 F~v~~gg~~~vd~~I~gP~~~~i~~~~~~ssgk~tF~a~~~G~Y~fCF~N~~s~mtpk~V~F~ihvg~~~~--------- 117 (201)
T KOG1692|consen 47 FEVIDGGFLGVDVEITGPDGKIIHKGKRESSGKYTFTAPKKGTYTFCFSNKMSTMTPKTVMFTIHVGHAPQ--------- 117 (201)
T ss_pred EEEecCCccceeEEEECCCCchhhhcccccCceEEEEecCCceEEEEecCCCCCCCceEEEEEEEEeeccc---------
Confidence 89999999999999999999999999999999999999999999999999999999999999998865421
Q ss_pred EEEEeeecCceeEEEEEECCCCCeEeeceeeccceEEEEeecceeEEEEEeeCCCCCCceEEEEEEEEcccCCCCCcccc
Q psy13839 82 LMFEISEGGFLDIDVRITGPDGNIIYQGERESSGKYTFSAHMSGVYTYCFSNQMSTMTPKVVMFNMEVLSTQPSEDDKGE 161 (413)
Q Consensus 82 ~~yqV~~Gg~~dI~v~v~dP~g~~l~~~~~~~~g~f~fta~~~G~y~~Cf~N~~s~~~~k~V~f~~~v~~~~~~~~~~~~ 161 (413)
.... ..+.
T Consensus 118 ---------------------~~d~------------------------------------------~~d~--------- 125 (201)
T KOG1692|consen 118 ---------------------RDDL------------------------------------------AKDA--------- 125 (201)
T ss_pred ---------------------cchh------------------------------------------cccc---------
Confidence 0000 0000
Q ss_pred ccccccccccCCCCCCCCccccccCCCccchhcccccccCCCCCCcchhhhhhhccCCCCCCCcCCCCcchHHHHHHHHH
Q psy13839 162 KGNRYAKNISAKPKGLNPAKDKLYSKPTKFVIRTADFLLFCPPSFDLGMIKLLFLSTQPSEDDKGEKGEPNLQDMIRQLG 241 (413)
Q Consensus 162 ~~~~~~~~~~~~~~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~l~~~l~~L~ 241 (413)
+. ++|+..+++|+
T Consensus 126 --------------~~-----------------------------------------------------~~L~~~I~eL~ 138 (201)
T KOG1692|consen 126 --------------HQ-----------------------------------------------------NKLEEMIRELS 138 (201)
T ss_pred --------------cc-----------------------------------------------------cHHHHHHHHHH
Confidence 00 03788999999
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhccchhhhhHHhhhheeeccccccCCCCccceeHHHHHHHhhhh
Q psy13839 242 ESLTSVKHEQEYMSVRDRIHRSINESTNSRVVWWSFFEAVVLVNHAKDKLHSKPTNSSSIKHLKKLLNHA 311 (413)
Q Consensus 242 ~~l~~I~~eQ~y~r~RE~~~r~~~Estn~rV~~wSi~qivvli~~~~~~~~~~~~~~~qi~~lk~~f~~~ 311 (413)
+.|..++.||.|+..||.+||++||+||+||.|||++|.++||+ +++.||++||+||..+
T Consensus 139 ~al~~Vk~EQeY~~~Rer~Hr~~nEntn~RVv~wsife~~vLi~----------~s~~QVyYLkRfFEvk 198 (201)
T KOG1692|consen 139 EALTSVKHEQEYMEARERIHRNTNENTNSRVVLWSIFEALVLIA----------MSVLQVYYLKRFFEVK 198 (201)
T ss_pred HHHHHhHHHHHHHHHHHHHHHHhhhcccceeehHHHHHHHHHHH----------HHHHHHHHHHHhheee
Confidence 99999999999999999999999999999999999999999999 9999999999999864
No 6
>KOG1691|consensus
Probab=99.97 E-value=6.5e-30 Score=235.85 Aligned_cols=184 Identities=28% Similarity=0.472 Sum_probs=151.9
Q ss_pred EEEeeecccEEEEeecccCCCcceEEEEEEEEeecCCCCCCCCcceEEEEeeecCceeEEEEEECCCCCeEeeceeeccc
Q psy13839 36 TFSAHMSGVYTYCFSNQMSTMTPKVVMFNMEVLSTQPSEDDKGEKGLMFEISEGGFLDIDVRITGPDGNIIYQGERESSG 115 (413)
Q Consensus 36 ~f~~~~~G~y~~CF~N~~s~~t~k~V~f~v~v~~~~~~~~~~~~~~~~yqV~~Gg~~dI~v~v~dP~g~~l~~~~~~~~g 115 (413)
.|+.+.++ ++|+.+++-....+++.|.+..... ++...+++.|+||.|+.+++.+...+|
T Consensus 24 ~f~v~~~~--~kCi~EeI~~n~lv~g~y~i~~~~~------------------~~~~~~~~~Vts~~G~~~~~~env~~g 83 (210)
T KOG1691|consen 24 RFDVPSKT--TKCISEEIHENVLVVGDYEIINPNG------------------DHSHKLSVKVTSPYGNNLHSKENVTKG 83 (210)
T ss_pred EEEecCCC--CEeehhhhccCeEEEEEEEEecCCC------------------CccceEEEEEEcCCCceeehhhccccc
Confidence 46666666 8999999988777777777643211 122789999999999999999999999
Q ss_pred eEEEEeecceeEEEEEeeCCCCCCc---eEEEEEEEEcccCCCCCccccccccccccccCCCCCCCCccccccCCCccch
Q psy13839 116 KYTFSAHMSGVYTYCFSNQMSTMTP---KVVMFNMEVLSTQPSEDDKGEKGNRYAKNISAKPKGLNPAKDKLYSKPTKFV 192 (413)
Q Consensus 116 ~f~fta~~~G~y~~Cf~N~~s~~~~---k~V~f~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~~~~~~~~~~ 192 (413)
+|+|||.++|.|.+||.|....-.+ ..|.||+..|.+. ++|
T Consensus 84 qFaFta~e~~~y~~Cf~~~~~~~~p~~~~~I~ld~k~Gv~a--------------kdw---------------------- 127 (210)
T KOG1691|consen 84 QFAFTAEESGMYEACFTADVPGHKPETKRSIDLDWKTGVEA--------------KDW---------------------- 127 (210)
T ss_pred eEEEEeccCCcEEEEEecccCCCCCCcceEEEEEeeccccc--------------cch----------------------
Confidence 9999999999999999995433222 4688888777541 111
Q ss_pred hcccccccCCCCCCcchhhhhhhccCCCCCCCcCCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccch
Q psy13839 193 IRTADFLLFCPPSFDLGMIKLLFLSTQPSEDDKGEKGEPNLQDMIRQLGESLTSVKHEQEYMSVRDRIHRSINESTNSRV 272 (413)
Q Consensus 193 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~l~~~l~~L~~~l~~I~~eQ~y~r~RE~~~r~~~Estn~rV 272 (413)
T Consensus 128 -------------------------------------------------------------------------------- 127 (210)
T KOG1691|consen 128 -------------------------------------------------------------------------------- 127 (210)
T ss_pred --------------------------------------------------------------------------------
Confidence
Q ss_pred hhhhHHhhhheeeccccccCCCCccceeHHHHHHHhhhhcccCCCCCCCCccchhhhhhhhcccCCchhHHHHHHHHHHH
Q psy13839 273 VWWSFFEAVVLVNHAKDKLHSKPTNSSSIKHLKKLLNHAKDKLHSKPTNSSSIKHLEKIVFFFFSGEPNLQDMIRQLGES 352 (413)
Q Consensus 273 ~~wSi~qivvli~~~~~~~~~~~~~~~qi~~lk~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~le~~v~~L~~~ 352 (413)
++|..-| +++|+|.++++|.+.
T Consensus 128 --------------------------------------------------~~IAKke--------klep~E~elrrLed~ 149 (210)
T KOG1691|consen 128 --------------------------------------------------DSIAKKE--------KLEPLEVELRRLEDL 149 (210)
T ss_pred --------------------------------------------------HHHHhhh--------cCcHHHHHHHHHHHH
Confidence 1111112 288999999999999
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhcchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccC
Q psy13839 353 LTSVKHEQEYMSVRDRIHRSINESTNSRVVWWSFFEAVVLVTMTIGQIYYLKRFFEVRRVV 413 (413)
Q Consensus 353 l~~I~~eq~y~r~RE~~~r~t~Estn~RV~~wSi~q~~vLi~vs~~Qv~~Lk~fF~~Kkiv 413 (413)
+.+|++|..|+|.||+++|+++|+||+||.|+|++.++|++++++||++|||+||++||+|
T Consensus 150 ~~sI~~e~~YLr~REeemr~~nesTNsrv~~fSi~Sl~v~~~va~~QvlyLK~fF~kKKLI 210 (210)
T KOG1691|consen 150 VESIHEEMYYLREREEEMRNTNESTNSRVAWFSILSLVVLLSVAGWQVLYLKRFFQKKKLI 210 (210)
T ss_pred HHHHHHHHHHHHHHHHHHHhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCC
Confidence 9999999999999999999999999999999999999999999999999999999999987
No 7
>KOG1693|consensus
Probab=99.96 E-value=4.7e-30 Score=234.02 Aligned_cols=156 Identities=31% Similarity=0.608 Sum_probs=136.4
Q ss_pred CceeeecCCceEEEEEECCCCCeEecceeeecceeEEEeeecccEEEEeecccCCCcceEEEEEEEEeecCCCCCCCCcc
Q psy13839 1 MFEISEGGFLDIDVRITGPDGNIIYQGERESSGKYTFSAHMSGVYTYCFSNQMSTMTPKVVMFNMEVLSTQPSEDDKGEK 80 (413)
Q Consensus 1 ~yqV~~GG~~dIdf~i~~P~g~~i~~~~~~~~g~~~f~~~~~G~y~~CF~N~~s~~t~k~V~f~v~v~~~~~~~~~~~~~ 80 (413)
+|||+.||++||||.|.+|+|++|+++.+++.++|.|++...|+|++||+|++++.+.|+|++...+.++.+.-
T Consensus 50 ~fqV~tGG~fDVD~~I~aPdgkvI~~~~kk~~~~~~f~ae~~G~Y~fCFsN~fstf~~Kiv~~~~q~~~~~~~~------ 123 (209)
T KOG1693|consen 50 EFQVQTGGHFDVDYDIEAPDGKVIYSEKKKRYDSFLFKAEGKGEYTFCFSNEFSTFSHKIVYMDFQVGEEPPLH------ 123 (209)
T ss_pred EEEEEeCCceeeEEEEECCCCCEEeeccccccccEEEEEecceEEEEEecCccccccceEeeehhhhccccccC------
Confidence 59999999999999999999999999999999999999999999999999999999999999998776543210
Q ss_pred eEEEEeeecCceeEEEEEECCCCCeEeeceeeccceEEEEeecceeEEEEEeeCCCCCCceEEEEEEEEcccCCCCCccc
Q psy13839 81 GLMFEISEGGFLDIDVRITGPDGNIIYQGERESSGKYTFSAHMSGVYTYCFSNQMSTMTPKVVMFNMEVLSTQPSEDDKG 160 (413)
Q Consensus 81 ~~~yqV~~Gg~~dI~v~v~dP~g~~l~~~~~~~~g~f~fta~~~G~y~~Cf~N~~s~~~~k~V~f~~~v~~~~~~~~~~~ 160 (413)
.. ++...
T Consensus 124 ------------------------------------------------------------~~------~~~~~------- 130 (209)
T KOG1693|consen 124 ------------------------------------------------------------PA------VSNRD------- 130 (209)
T ss_pred ------------------------------------------------------------cc------ccccc-------
Confidence 00 00110
Q ss_pred cccccccccccCCCCCCCCccccccCCCccchhcccccccCCCCCCcchhhhhhhccCCCCCCCcCCCCcchHHHHHHHH
Q psy13839 161 EKGNRYAKNISAKPKGLNPAKDKLYSKPTKFVIRTADFLLFCPPSFDLGMIKLLFLSTQPSEDDKGEKGEPNLQDMIRQL 240 (413)
Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~l~~~l~~L 240 (413)
+++ ..++..+..|
T Consensus 131 ------------------~~~-------------------------------------------------~~mena~~~I 143 (209)
T KOG1693|consen 131 ------------------TAL-------------------------------------------------TQMENAIVEI 143 (209)
T ss_pred ------------------hHH-------------------------------------------------HHHHHHHHHH
Confidence 000 1377889999
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhccchhhhhHHhhhheeeccccccCCCCccceeHHHHHHHhhhhc
Q psy13839 241 GESLTSVKHEQEYMSVRDRIHRSINESTNSRVVWWSFFEAVVLVNHAKDKLHSKPTNSSSIKHLKKLLNHAK 312 (413)
Q Consensus 241 ~~~l~~I~~eQ~y~r~RE~~~r~~~Estn~rV~~wSi~qivvli~~~~~~~~~~~~~~~qi~~lk~~f~~~~ 312 (413)
+++|+.|.+.|+|+|.||++.|.++||||+||+|||+++++++++ +++.|+.-||.||..+.
T Consensus 144 ~~~L~~I~~~q~y~R~RE~rn~~tv~st~~Rv~~~Sl~e~~~vv~----------iSi~Qv~ilk~fFt~~r 205 (209)
T KOG1693|consen 144 HRALNKIDDTQTYYRLREARNRSTVESTNSRVTWWSLLEIIAVVV----------ISIAQVFILKFFFTDRR 205 (209)
T ss_pred HHHHHHHHHHHHHHHHHHhcCccchhcccchhhHHHHHHHHHHHH----------HHHHHHHHHHHHHhccC
Confidence 999999999999999999999999999999999999999999999 99999999999998654
No 8
>PF01105 EMP24_GP25L: emp24/gp25L/p24 family/GOLD; InterPro: IPR009038 The GOLD (for Golgi dynamics) domain is a protein module found in several eukaryotic Golgi and lipid-traffic proteins. It is typically between 90 and 150 amino acids long. Most of the size difference observed in the GOLD-domain superfamily is traceable to a single large low-complexity insert that is seen in some versions of the domain. With the exception of the p24 proteins, which have a simple architecture with the GOLD domain as their only globular domain, all other GOLD-domain proteins contain additional conserved globular domains. In these proteins, the GOLD domain co-occurs with lipid-, sterol- or fatty acid-binding domains such as PH, CRAL-TRIO, FYVE oxysterol binding- and acyl CoA-binding domains, suggesting that these proteins may interact with membranes. The GOLD domain can also be found associated with a RUN domain, which may have a role in the interaction of various proteins with cytoskeletal filaments. The GOLD domain is predicted to mediate diverse protein-protein interactions []. A secondary structure prediction for the GOLD domain reveals that it is likely to adopt a compact all-beta-fold structure with six to seven strands. Most of the sequence conservation is centred on the hydrophobic cores that support these predicted strands. The predicted secondary-structure elements and the size of the conserved core of the domain suggests that it may form a beta- sandwich fold with the strands arranged in two beta sheets stacked on each other []. Some proteins known to contain a GOLD domain are listed below: Eukaryotic proteins of the p24 family. Animal Sec14-like proteins. They are involved in secretion. Human Golgi resident protein GCP60. It interacts with the Golgi integral membrane protein Giantin. Yeast oxysterol-binding protein homologue 3 (OSH3). ; GO: 0006810 transport, 0016021 integral to membrane; PDB: 1P23_A 1M23_A.
Probab=99.93 E-value=6.1e-28 Score=218.62 Aligned_cols=177 Identities=31% Similarity=0.600 Sum_probs=5.5
Q ss_pred EEeeecccEEEEeecccCCCcceEEEEEEEEeecCCCCCCCCcceEEEEeeecC-ceeEEEEEE--CCCCCeEeeceee-
Q psy13839 37 FSAHMSGVYTYCFSNQMSTMTPKVVMFNMEVLSTQPSEDDKGEKGLMFEISEGG-FLDIDVRIT--GPDGNIIYQGERE- 112 (413)
Q Consensus 37 f~~~~~G~y~~CF~N~~s~~t~k~V~f~v~v~~~~~~~~~~~~~~~~yqV~~Gg-~~dI~v~v~--dP~g~~l~~~~~~- 112 (413)
+++..+++.+.||..+.+.+ +.+.+.|++.+|+ ..+|++.|+ +|+|++++.....
T Consensus 2 ~~f~l~~g~~~Cf~e~v~~~---------------------~~i~~~y~v~~~~~~~~v~~~i~~~~~~~~~i~~~~~~~ 60 (183)
T PF01105_consen 2 LTFELEPGETECFYEEVPKG---------------------TTIRGSYRVTDGGGAYDVDFTIRDPDPNGEVIYSKSDKE 60 (183)
T ss_dssp --------------------------------------------------------------------------------
T ss_pred EEEEECCCCcEEEEEEcCCC---------------------cEEEEEEEEeeccccceEEEEEEecccCCceeeeecccc
Confidence 45566777899999998653 3456677777777 899999999 6667999998555
Q ss_pred ccceEEEEeecceeEEEEEeeCCCCCCc-eEEEEEEEEcccCCCCCccccccccccccccCCCCCCCCccccccCCCccc
Q psy13839 113 SSGKYTFSAHMSGVYTYCFSNQMSTMTP-KVVMFNMEVLSTQPSEDDKGEKGNRYAKNISAKPKGLNPAKDKLYSKPTKF 191 (413)
Q Consensus 113 ~~g~f~fta~~~G~y~~Cf~N~~s~~~~-k~V~f~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~~~~~~~~~ 191 (413)
++|+|+|+++++|+|++||+|..+.+++ +.|+|++.++..... +
T Consensus 61 ~~~~f~f~~~~~G~y~iCf~n~~~~~~~~~~v~~~~~~~~~~~~----------~------------------------- 105 (183)
T PF01105_consen 61 SEGSFSFTAKESGEYQICFDNSSSSFSPSKRVSFDIDVGNENKD----------Y------------------------- 105 (183)
T ss_dssp --------------------------------------------------------------------------------
T ss_pred cCCcEEEEeccCCCEEEEEEcCCCCccccEEEEEEEEEeecccc----------h-------------------------
Confidence 5579999999999999999999998777 899999988754110 0
Q ss_pred hhcccccccCCCCCCcchhhhhhhccCCCCCCCcCCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccc
Q psy13839 192 VIRTADFLLFCPPSFDLGMIKLLFLSTQPSEDDKGEKGEPNLQDMIRQLGESLTSVKHEQEYMSVRDRIHRSINESTNSR 271 (413)
Q Consensus 192 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~l~~~l~~L~~~l~~I~~eQ~y~r~RE~~~r~~~Estn~r 271 (413)
...+..+++++++..|.+|...+..|+++|.|++.||++++.++|+++.|
T Consensus 106 ------------------------------~~~~~~~~~~~~~~~l~~l~~~l~~i~~~q~~~~~r~~~~~~~~es~~~~ 155 (183)
T PF01105_consen 106 ------------------------------KNVAKKEHLDPLEESLEKLESNLKEIKDEQKYLREREERHRQLNESTNSR 155 (183)
T ss_dssp --------------------------------------------------------------------------------
T ss_pred ------------------------------hhhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhe
Confidence 00112456788999999999999999999999999999999999999999
Q ss_pred hhhhhHHhhhheeeccccccCCCCccceeHHHHHHHhh
Q psy13839 272 VVWWSFFEAVVLVNHAKDKLHSKPTNSSSIKHLKKLLN 309 (413)
Q Consensus 272 V~~wSi~qivvli~~~~~~~~~~~~~~~qi~~lk~~f~ 309 (413)
|+|||++|+++|++ +++||+++||+||+
T Consensus 156 i~~~si~~~~vli~----------~~~~Qv~~lk~~f~ 183 (183)
T PF01105_consen 156 IMWWSIIQIVVLIL----------VSVWQVYYLKKFFK 183 (183)
T ss_dssp --------------------------------HHHHHH
T ss_pred EEhHHHHHHHHHHH----------HHHHHHHHHHHHhC
Confidence 99999999999999 99999999999996
No 9
>KOG1691|consensus
Probab=99.84 E-value=2.2e-20 Score=172.84 Aligned_cols=74 Identities=32% Similarity=0.492 Sum_probs=71.4
Q ss_pred CcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccchhhhhHHhhhheeeccccccCCCCccceeHHHHHHHh
Q psy13839 229 GEPNLQDMIRQLGESLTSVKHEQEYMSVRDRIHRSINESTNSRVVWWSFFEAVVLVNHAKDKLHSKPTNSSSIKHLKKLL 308 (413)
Q Consensus 229 ~~~~l~~~l~~L~~~l~~I~~eQ~y~r~RE~~~r~~~Estn~rV~~wSi~qivvli~~~~~~~~~~~~~~~qi~~lk~~f 308 (413)
++++++.++++|++.+.+|.++..|+|.||++||++||+||+||.|+|++.++++++ .+.||+.|||+||
T Consensus 135 klep~E~elrrLed~~~sI~~e~~YLr~REeemr~~nesTNsrv~~fSi~Sl~v~~~----------va~~QvlyLK~fF 204 (210)
T KOG1691|consen 135 KLEPLEVELRRLEDLVESIHEEMYYLREREEEMRNTNESTNSRVAWFSILSLVVLLS----------VAGWQVLYLKRFF 204 (210)
T ss_pred cCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhHHHHHHHHHHHHHHH----------HHHHHHHHHHHHH
Confidence 466799999999999999999999999999999999999999999999999999999 8899999999999
Q ss_pred hhhc
Q psy13839 309 NHAK 312 (413)
Q Consensus 309 ~~~~ 312 (413)
+++|
T Consensus 205 ~kKK 208 (210)
T KOG1691|consen 205 QKKK 208 (210)
T ss_pred Hhcc
Confidence 9987
No 10
>KOG1690|consensus
Probab=99.80 E-value=1e-19 Score=166.27 Aligned_cols=75 Identities=39% Similarity=0.613 Sum_probs=72.8
Q ss_pred chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccC
Q psy13839 339 EPNLQDMIRQLGESLTSVKHEQEYMSVRDRIHRSINESTNSRVVWWSFFEAVVLVTMTIGQIYYLKRFFEVRRVV 413 (413)
Q Consensus 339 ~~~le~~v~~L~~~l~~I~~eq~y~r~RE~~~r~t~Estn~RV~~wSi~q~~vLi~vs~~Qv~~Lk~fF~~Kkiv 413 (413)
.+.++.+++.|.+++.+|++||+|+|.||+++|+++||+|+||+|||++|+++|+++|+||+.|||+||.++|+|
T Consensus 141 ~k~l~~Rv~~L~~~~~~IrkEQ~~~R~RE~~FR~tSES~NsRvm~Wsv~Q~vvL~~tc~wQmrhL~~FFvkqKlv 215 (215)
T KOG1690|consen 141 DKLLEGRVRQLNSRLESIRKEQNLQREREETFRDTSESANSRVMWWSVAQLVVLLVTCIWQMRHLKSFFVKQKLV 215 (215)
T ss_pred hhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhcceeeehhHHHHHHHHHHHHHHHHHHHHHHHHhhcC
Confidence 557899999999999999999999999999999999999999999999999999999999999999999999986
No 11
>PF01105 EMP24_GP25L: emp24/gp25L/p24 family/GOLD; InterPro: IPR009038 The GOLD (for Golgi dynamics) domain is a protein module found in several eukaryotic Golgi and lipid-traffic proteins. It is typically between 90 and 150 amino acids long. Most of the size difference observed in the GOLD-domain superfamily is traceable to a single large low-complexity insert that is seen in some versions of the domain. With the exception of the p24 proteins, which have a simple architecture with the GOLD domain as their only globular domain, all other GOLD-domain proteins contain additional conserved globular domains. In these proteins, the GOLD domain co-occurs with lipid-, sterol- or fatty acid-binding domains such as PH, CRAL-TRIO, FYVE oxysterol binding- and acyl CoA-binding domains, suggesting that these proteins may interact with membranes. The GOLD domain can also be found associated with a RUN domain, which may have a role in the interaction of various proteins with cytoskeletal filaments. The GOLD domain is predicted to mediate diverse protein-protein interactions []. A secondary structure prediction for the GOLD domain reveals that it is likely to adopt a compact all-beta-fold structure with six to seven strands. Most of the sequence conservation is centred on the hydrophobic cores that support these predicted strands. The predicted secondary-structure elements and the size of the conserved core of the domain suggests that it may form a beta- sandwich fold with the strands arranged in two beta sheets stacked on each other []. Some proteins known to contain a GOLD domain are listed below: Eukaryotic proteins of the p24 family. Animal Sec14-like proteins. They are involved in secretion. Human Golgi resident protein GCP60. It interacts with the Golgi integral membrane protein Giantin. Yeast oxysterol-binding protein homologue 3 (OSH3). ; GO: 0006810 transport, 0016021 integral to membrane; PDB: 1P23_A 1M23_A.
Probab=99.39 E-value=5.7e-14 Score=127.05 Aligned_cols=71 Identities=42% Similarity=0.838 Sum_probs=5.6
Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcchhHHHHHHHHHHHHHHHHHHHHHHHHhhh
Q psy13839 338 GEPNLQDMIRQLGESLTSVKHEQEYMSVRDRIHRSINESTNSRVVWWSFFEAVVLVTMTIGQIYYLKRFFE 408 (413)
Q Consensus 338 ~~~~le~~v~~L~~~l~~I~~eq~y~r~RE~~~r~t~Estn~RV~~wSi~q~~vLi~vs~~Qv~~Lk~fF~ 408 (413)
++++++..|.+|...++.|+++|.|++.|+++|+.++|++++||+||+++++++|+++++||++|||+||+
T Consensus 113 ~~~~~~~~l~~l~~~l~~i~~~q~~~~~r~~~~~~~~es~~~~i~~~si~~~~vli~~~~~Qv~~lk~~f~ 183 (183)
T PF01105_consen 113 HLDPLEESLEKLESNLKEIKDEQKYLREREERHRQLNESTNSRIMWWSIIQIVVLILVSVWQVYYLKKFFK 183 (183)
T ss_dssp -----------------------------------------------------------------HHHHHH
T ss_pred hhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhheEEhHHHHHHHHHHHHHHHHHHHHHHHhC
Confidence 38899999999999999999999999999999999999999999999999999999999999999999996
No 12
>KOG3287|consensus
Probab=99.35 E-value=2.3e-12 Score=119.76 Aligned_cols=74 Identities=26% Similarity=0.500 Sum_probs=71.9
Q ss_pred chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhccc
Q psy13839 339 EPNLQDMIRQLGESLTSVKHEQEYMSVRDRIHRSINESTNSRVVWWSFFEAVVLVTMTIGQIYYLKRFFEVRRV 412 (413)
Q Consensus 339 ~~~le~~v~~L~~~l~~I~~eq~y~r~RE~~~r~t~Estn~RV~~wSi~q~~vLi~vs~~Qv~~Lk~fF~~Kki 412 (413)
++++++.+..++.+|+.+...|..+|+||+|.|.+.||+-.||.|||++|+++||+++++||++|||+|+.|+-
T Consensus 155 l~di~~~i~~i~~nl~k~~~~q~~lRa~EaRDr~L~esNf~rVN~WS~vq~~vmi~v~~iQVf~lrslFe~~~~ 228 (236)
T KOG3287|consen 155 LDDIEDSIGTIKNNLNKMWQYQALLRAREARDRNLQESNFDRVNFWSMVQTLVMILVGIIQVFMLRSLFEVKSK 228 (236)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhHHHHhcccchhhHHHHHHHHHHHHHhhhhhhhhHHHhcCCCC
Confidence 78999999999999999999999999999999999999999999999999999999999999999999998863
No 13
>PHA02148 hypothetical protein
Probab=89.17 E-value=4.1 Score=33.69 Aligned_cols=91 Identities=15% Similarity=0.150 Sum_probs=74.7
Q ss_pred HHHHHHh--hhhcccCCCCCCCCccc--hhhhhhhhcccCCchhHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHhh
Q psy13839 302 KHLKKLL--NHAKDKLHSKPTNSSSI--KHLEKIVFFFFSGEPNLQDMIRQLGESLTSVK-HEQEYMSVRDRIHRSINES 376 (413)
Q Consensus 302 ~~lk~~f--~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~le~~v~~L~~~l~~I~-~eq~y~r~RE~~~r~t~Es 376 (413)
+.|-++| ++++.+|.-+|.|-+-+ .++|- .+.+|++|-.=+..|.++- +.|+-+|..|..-|...|+
T Consensus 6 ~~~~K~F~~~~~~i~P~v~~~~i~~~k~~A~E~--------FK~iE~RIA~D~~~L~~~A~~R~NEL~~HE~Air~~M~~ 77 (110)
T PHA02148 6 WNLIKMFQMKKTKIEPTVTPLPISDLKVEAFEL--------FKKIEERIAHDQTRLEELAKSRENELRRHEAAIRLAMDS 77 (110)
T ss_pred hHHHHHHHccccccCCCCCCCChhHHHHHHHHH--------HHHHHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3566788 78899999999887776 34555 4688999988888887774 5678899999999999999
Q ss_pred hcchhHHHHHHHHHHHHHHHHHHH
Q psy13839 377 TNSRVVWWSFFEAVVLVTMTIGQI 400 (413)
Q Consensus 377 tn~RV~~wSi~q~~vLi~vs~~Qv 400 (413)
-....--.|+=|.++.=.+-+.|+
T Consensus 78 HS~~M~~LS~~~~LiSDS~K~~~~ 101 (110)
T PHA02148 78 HSIKMDDLSLQEALISDSLKGGKY 101 (110)
T ss_pred HHHHhccchHHHHHHHHHHHHHHH
Confidence 999999999999998888877775
No 14
>PF13897 GOLD_2: Golgi-dynamics membrane-trafficking
Probab=86.86 E-value=0.74 Score=40.75 Aligned_cols=31 Identities=23% Similarity=0.360 Sum_probs=26.8
Q ss_pred EEEeecceeEEEEEeeCCCCCCceEEEEEEE
Q psy13839 118 TFSAHMSGVYTYCFSNQMSTMTPKVVMFNME 148 (413)
Q Consensus 118 ~fta~~~G~y~~Cf~N~~s~~~~k~V~f~~~ 148 (413)
+++.+..|.|.++|+|++|.|..|+|++.+.
T Consensus 104 s~~c~~~GvYvLkFDNSYS~~rsK~l~Y~V~ 134 (136)
T PF13897_consen 104 SHTCPGPGVYVLKFDNSYSWFRSKKLYYRVY 134 (136)
T ss_pred EEECCCCeEEEEEeeCcceeEEeeEEEEEEE
Confidence 4567789999999999999999999988763
No 15
>PRK14081 triple tyrosine motif-containing protein; Provisional
Probab=84.17 E-value=16 Score=40.50 Aligned_cols=122 Identities=16% Similarity=0.244 Sum_probs=80.0
Q ss_pred CceEEEEEECCCCCeEecceeeecceeEEEeeecccEEE-EeecccCCCc----ceEEEEEEEEeec-------CCCCCC
Q psy13839 9 FLDIDVRITGPDGNIIYQGERESSGKYTFSAHMSGVYTY-CFSNQMSTMT----PKVVMFNMEVLST-------QPSEDD 76 (413)
Q Consensus 9 ~~dIdf~i~~P~g~~i~~~~~~~~g~~~f~~~~~G~y~~-CF~N~~s~~t----~k~V~f~v~v~~~-------~~~~~~ 76 (413)
++--.|+|.+ +|....-.+--+...+++++..+|.|+. |.--++.+.. ...|.|++.-..+ ..+.+.
T Consensus 125 ~~LYKF~I~~-~~~w~~iqDYst~n~lsyt~~~~G~Y~ll~~~Kd~~S~~~fDD~~~v~y~Vk~~~~v~I~~F~~lns~~ 203 (667)
T PRK14081 125 PLMYRYWIKE-DNNWKLIKDYSTENSLSYTANKPGKYELLVECKRIDSTKDFDDFKKVKFKVKEIDKVEITDFKCLNKEL 203 (667)
T ss_pred cEEEEEEEcC-CCcEEEEEecCCcceEEEEecCCCcEEEEEEEecCCCccccCcceEEEEEcccCcceEEEeccccCcce
Confidence 3555677665 5666666677778999999999998886 5544444332 3556666643221 111111
Q ss_pred CCcceEEEEe----eecCceeEEEEEECCCCCeEeeceeeccceEEEEeecceeEEE-EE
Q psy13839 77 KGEKGLMFEI----SEGGFLDIDVRITGPDGNIIYQGERESSGKYTFSAHMSGVYTY-CF 131 (413)
Q Consensus 77 ~~~~~~~yqV----~~Gg~~dI~v~v~dP~g~~l~~~~~~~~g~f~fta~~~G~y~~-Cf 131 (413)
...=.+.|.+ .+|..+--.+.+.+++|.....+.......|++++..+|.|++ |.
T Consensus 204 i~~~eI~f~~~a~~~~g~~~LYKF~~i~~~G~~~~~qdYst~n~~~y~~~~~G~Y~i~~~ 263 (667)
T PRK14081 204 ICDEELVFEVESVYEEDRTILYKFVKIDSDGKQTCIQDYSTKNIVSYKEKKSGDYKLLCL 263 (667)
T ss_pred ecCcEEEEEEEEEeCCCceEEEEEEEECCCCCEEEecCccccceEEEEeCCCccEEEEEE
Confidence 1112333442 2334466677788899998888888999999999999999987 44
No 16
>smart00557 IG_FLMN Filamin-type immunoglobulin domains. These form a rod-like structure in the actin-binding cytoskeleton protein, filamin. The C-terminal repeats of filamin bind beta1-integrin (CD29).
Probab=81.98 E-value=7.6 Score=31.60 Aligned_cols=45 Identities=24% Similarity=0.479 Sum_probs=31.3
Q ss_pred CceeEEEEEECCCCCeEeec-eeeccceE--EEEeecceeEEEEEeeC
Q psy13839 90 GFLDIDVRITGPDGNIIYQG-ERESSGKY--TFSAHMSGVYTYCFSNQ 134 (413)
Q Consensus 90 g~~dI~v~v~dP~g~~l~~~-~~~~~g~f--~fta~~~G~y~~Cf~N~ 134 (413)
|..++.+.|++|+|+.+.-. .....|.| +|+....|.|.+...-.
T Consensus 30 G~~~~~v~i~~p~g~~~~~~v~d~~dGty~v~y~P~~~G~~~i~V~~~ 77 (93)
T smart00557 30 GGGELEVEVTGPSGKKVPVEVKDNGDGTYTVSYTPTEPGDYTVTVKFG 77 (93)
T ss_pred CCCcEEEEEECCCCCeeEeEEEeCCCCEEEEEEEeCCCEeEEEEEEEC
Confidence 45789999999999655433 33444544 77778888888776543
No 17
>PRK14081 triple tyrosine motif-containing protein; Provisional
Probab=80.93 E-value=31 Score=38.39 Aligned_cols=138 Identities=14% Similarity=0.174 Sum_probs=81.4
Q ss_pred CCceEEEEEECCCCCeEecceeeecceeEEEeeecccEEE-EeecccCCCc----ceEEEEEEEEeec--------CCCC
Q psy13839 8 GFLDIDVRITGPDGNIIYQGERESSGKYTFSAHMSGVYTY-CFSNQMSTMT----PKVVMFNMEVLST--------QPSE 74 (413)
Q Consensus 8 G~~dIdf~i~~P~g~~i~~~~~~~~g~~~f~~~~~G~y~~-CF~N~~s~~t----~k~V~f~v~v~~~--------~~~~ 74 (413)
+.+-=.|.+.+++|....-..--+...|+++...+|.|+. |.--++++.. .....|.|....+ +.+.
T Consensus 221 ~~~LYKF~~i~~~G~~~~~qdYst~n~~~y~~~~~G~Y~i~~~VKD~~S~~eyDD~~~l~y~Vk~~~~vkI~~~~~d~~s 300 (667)
T PRK14081 221 RTILYKFVKIDSDGKQTCIQDYSTKNIVSYKEKKSGDYKLLCLVKDMYSNNEFDDRAVLVYTVKPYKDIKIRNFTTDLSS 300 (667)
T ss_pred ceEEEEEEEECCCCCEEEecCccccceEEEEeCCCccEEEEEEEeccCcccccccceEEEEEEecCCCcEEEEEEEcCCC
Confidence 3566778888999998888888899999999999999884 7777766543 3445555522211 0000
Q ss_pred --CCCCcceEEEEeeecCceeEEEEEECCCCCeEeeceeeccceEEEEeecceeEEEEEeeCCC--C--C-CceEEEEEE
Q psy13839 75 --DDKGEKGLMFEISEGGFLDIDVRITGPDGNIIYQGERESSGKYTFSAHMSGVYTYCFSNQMS--T--M-TPKVVMFNM 147 (413)
Q Consensus 75 --~~~~~~~~~yqV~~Gg~~dI~v~v~dP~g~~l~~~~~~~~g~f~fta~~~G~y~~Cf~N~~s--~--~-~~k~V~f~~ 147 (413)
..+..+.+.=-...|..+.-.+.|. |..+-.........|+|++..+|.|.+=+.=... . + .++.+.|.+
T Consensus 301 ~ql~g~~I~ika~a~GG~~llYrf~I~---G~~~e~~~Y~~~n~~~w~P~~~G~Y~I~v~VKDk~S~~eyD~~~~v~f~i 377 (667)
T PRK14081 301 PQLTDTDIELKAVAEGGKELLYRFIIK---GKESEDSGYIRNNIYTWKPKIAGKYSITLWVKDISSKGEYEDKSSIDYTI 377 (667)
T ss_pred CeecCceEEEEEEecCCCceEEEEEEC---CcEEeeccccccceEEEeeCCCceEEEEEEEEcCcCcccccceEEEEEEE
Confidence 0111111111111222233333332 3334334456677999999999999998754433 2 2 245666655
Q ss_pred E
Q psy13839 148 E 148 (413)
Q Consensus 148 ~ 148 (413)
.
T Consensus 378 ~ 378 (667)
T PRK14081 378 E 378 (667)
T ss_pred c
Confidence 4
No 18
>PF00630 Filamin: Filamin/ABP280 repeat; InterPro: IPR017868 The many different actin cross-linking proteins share a common architecture, consisting of a globular actin-binding domain and an extended rod. Whereas their actin-binding domains consist of two calponin homology domains (see IPR001715 from INTERPRO), their rods fall into three families. The rod domain of the family including the Dictyostelium discoideum (Slime mould) gelation factor (ABP120) and human filamin (ABP280) is constructed from tandem repeats of a 100-residue motif that is glycine and proline rich []. The gelation factor's rod contains 6 copies of the repeat, whereas filamin has a rod constructed from 24 repeats. The resolution of the 3D structure of rod repeats from the gelation factor has shown that they consist of a beta-sandwich, formed by two beta-sheets arranged in an immunoglobulin-like fold [, ]. Because conserved residues that form the core of the repeats are preserved in filamin, the repeat structure should be common to the members of the gelation factor/filamin family. The head to tail homodimerisation is crucial to the function of the ABP120 and ABP280 proteins. This interaction involves a small portion at the distal end of the rod domains. For the gelation factor it has been shown that the carboxy-terminal repeat 6 dimerises through a double edge-to-edge extension of the beta-sheet and that repeat 5 contributes to dimerisation to some extent [, , ].; PDB: 2DI9_A 2EEC_A 2DIC_A 2EEA_A 2DMC_A 2EE9_A 2D7O_A 2D7N_A 2K7P_A 2NQC_A ....
Probab=79.47 E-value=15 Score=29.61 Aligned_cols=42 Identities=24% Similarity=0.458 Sum_probs=29.7
Q ss_pred eeEEEEEECCCCC----eE-eeceeeccceE--EEEeecceeEEEEEee
Q psy13839 92 LDIDVRITGPDGN----II-YQGERESSGKY--TFSAHMSGVYTYCFSN 133 (413)
Q Consensus 92 ~dI~v~v~dP~g~----~l-~~~~~~~~g~f--~fta~~~G~y~~Cf~N 133 (413)
..+.+.+.+|++. .+ ..-.....|.| +|++...|.|+++..-
T Consensus 42 ~~~~v~i~~p~~~~~~~~~~~~v~~~~~G~y~v~y~p~~~G~y~i~V~~ 90 (101)
T PF00630_consen 42 DEFQVTITSPDGKEEPVPVPVEVIDNGDGTYTVSYTPTEPGKYKISVKI 90 (101)
T ss_dssp SEEEEEEESSSSESS--EEEEEEEEESSSEEEEEEEESSSEEEEEEEEE
T ss_pred ceeEEEEeCCCCCccccccceEEEECCCCEEEEEEEeCccEeEEEEEEE
Confidence 4678899999987 33 23344556654 7888888999888754
No 19
>PF04151 PPC: Bacterial pre-peptidase C-terminal domain; InterPro: IPR007280 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. This domain is normally found at the C terminus of secreted archaeal and bacterial peptidases, the majority of which belong to MEROPS peptidase families M4 (vibriolysin, IPR001570 from INTERPRO), M9A amd M9B (microbial collangenase, IPR002169 from INTERPRO), M28 (aminopeptidase Ap1, IPR007484 from INTERPRO) and S8 (subtilisin family peptidases, IPR000209 from INTERPRO).; GO: 0008233 peptidase activity, 0006508 proteolysis; PDB: 4DY5_B 4DXZ_A 4DY3_B 3JQW_A 3JQX_C 1NQJ_B 1NQD_A 2O8O_A 1WMF_A 1WME_A ....
Probab=74.18 E-value=11 Score=28.78 Aligned_cols=40 Identities=25% Similarity=0.331 Sum_probs=29.3
Q ss_pred eeEEEEEECCCCCeEeeceeec-----cceEEEEeecceeEEEEE
Q psy13839 92 LDIDVRITGPDGNIIYQGERES-----SGKYTFSAHMSGVYTYCF 131 (413)
Q Consensus 92 ~dI~v~v~dP~g~~l~~~~~~~-----~g~f~fta~~~G~y~~Cf 131 (413)
.+.++.+.+++|..+......+ .-..+|++..+|.|.+=.
T Consensus 24 ~d~dl~l~~~~g~~~~~~d~~~~~~~~~~~i~~~~~~~GtYyi~V 68 (70)
T PF04151_consen 24 GDADLYLYDSNGNSLASYDDSSQSGGNDESITFTAPAAGTYYIRV 68 (70)
T ss_dssp SSEEEEEEETTSSSCEECCCCTCETTSEEEEEEEESSSEEEEEEE
T ss_pred CCeEEEEEcCCCCchhhheecCCCCCCccEEEEEcCCCEEEEEEE
Confidence 3778999999988777633322 245788999999998743
No 20
>PF01835 A2M_N: MG2 domain; InterPro: IPR002890 The proteinase-binding alpha-macroglobulins (A2M) [] are large glycoproteins found in the plasma of vertebrates, in the hemolymph of some invertebrates and in reptilian and avian egg white. A2M-like proteins are able to inhibit all four classes of proteinases by a 'trapping' mechanism. They have a peptide stretch, called the 'bait region', which contains specific cleavage sites for different proteinases. When a proteinase cleaves the bait region, a conformational change is induced in the protein, thus trapping the proteinase. The entrapped enzyme remains active against low molecular weight substrates, whilst its activity toward larger substrates is greatly reduced, due to steric hindrance. Following cleavage in the bait region, a thiol ester bond, formed between the side chains of a cysteine and a glutamine, is cleaved and mediates the covalent binding of the A2M-like protein to the proteinase. This family includes the N-terminal region of the alpha-2-macroglobulin family. The inhibitor domains belong to MEROPS inhibitor family I39.; GO: 0004866 endopeptidase inhibitor activity; PDB: 2B39_B 3KLS_B 3PRX_C 3KM9_B 3PVM_C 3CU7_A 4E0S_A 4A5W_A 4ACQ_C 2P9R_B ....
Probab=71.59 E-value=28 Score=28.17 Aligned_cols=44 Identities=20% Similarity=0.309 Sum_probs=30.3
Q ss_pred ceeEEEEEECCCCCeEeecee---eccceEEE--Eeec---ceeEEEEEeeC
Q psy13839 91 FLDIDVRITGPDGNIIYQGER---ESSGKYTF--SAHM---SGVYTYCFSNQ 134 (413)
Q Consensus 91 ~~dI~v~v~dP~g~~l~~~~~---~~~g~f~f--ta~~---~G~y~~Cf~N~ 134 (413)
+..+.+.|.||+|+.+..... ...|.+++ ..+. .|.|.+=....
T Consensus 35 ~~~~~v~i~dp~g~~v~~~~~~~~~~~G~~~~~~~lp~~~~~G~y~i~~~~~ 86 (99)
T PF01835_consen 35 NSPVTVTIKDPSGNEVFRWSVNTTNENGIFSGSFQLPDDAPLGTYTIRVKTD 86 (99)
T ss_dssp SEEEEEEEEETTSEEEEEEEEEETTCTTEEEEEEE--SS---EEEEEEEEET
T ss_pred CCceEEEEECCCCCEEEEEEeeeeCCCCEEEEEEECCCCCCCEeEEEEEEEc
Confidence 368999999999999988766 24565443 3333 38888887764
No 21
>KOG0518|consensus
Probab=69.30 E-value=58 Score=37.88 Aligned_cols=125 Identities=19% Similarity=0.295 Sum_probs=74.6
Q ss_pred EEEEEECCCCCeEecceeeeccee--EEEeeecccEEEE--eecccCCCcceEEEEEEEE--------eecCCCC-CCCC
Q psy13839 12 IDVRITGPDGNIIYQGERESSGKY--TFSAHMSGVYTYC--FSNQMSTMTPKVVMFNMEV--------LSTQPSE-DDKG 78 (413)
Q Consensus 12 Idf~i~~P~g~~i~~~~~~~~g~~--~f~~~~~G~y~~C--F~N~~s~~t~k~V~f~v~v--------~~~~~~~-~~~~ 78 (413)
+...+.+|+ ++=++-.+...|.+ +|+...+|.|..- |-++.-++....|...-.. ..++++- ..+.
T Consensus 791 Lsi~~~Gps-kvd~~~~d~~dGt~kV~ytPtepG~Y~I~i~Fad~~I~gSPftVkv~~~~~vvesi~~~~~~~~va~~g~ 869 (1113)
T KOG0518|consen 791 LSISVQGPS-KVDLNVEDREDGTCKVSYTPTEPGTYIINIKFADEHIKGSPFTVKVTGESRVVESITRDREAPSVARVGH 869 (1113)
T ss_pred eEEEEeCCc-ccccceeecCCCeEEEEEeCCCCceEEEEEEEcCccCCCCceEEEecCCeeEeeeeeecccccceecccc
Confidence 567788888 34444455555654 5888888888653 5555444444333322211 1111111 1222
Q ss_pred cceEEEEeeecCceeEEEEEECCCCCeEeec-eeeccc--eEEEEeecceeEEEEEeeCCCC
Q psy13839 79 EKGLMFEISEGGFLDIDVRITGPDGNIIYQG-ERESSG--KYTFSAHMSGVYTYCFSNQMST 137 (413)
Q Consensus 79 ~~~~~yqV~~Gg~~dI~v~v~dP~g~~l~~~-~~~~~g--~f~fta~~~G~y~~Cf~N~~s~ 137 (413)
.+++.-...+-+.+|+.+.++||.|+..--. +....| +..|+..+.|.|.+|..+..-.
T Consensus 870 ~~~l~lk~~~~~~~d~ta~vt~PSG~~~~aei~~~~~~~y~vrFtP~e~G~~tl~V~y~~~~ 931 (1113)
T KOG0518|consen 870 TCSLDLKATEASSQDITARVTDPSGRVFEAEIVDLGQGTYQVRFTPKEPGNHTLSVKYKDQH 931 (1113)
T ss_pred eeeeeeecCCCCccceEEEeeCCCCCccccEEEECCCceEEEEecCCCCCceEEEEEecCcc
Confidence 3444444455578999999999999754322 222333 6689999999999999887554
No 22
>PF13897 GOLD_2: Golgi-dynamics membrane-trafficking
Probab=60.35 E-value=11 Score=33.43 Aligned_cols=30 Identities=23% Similarity=0.356 Sum_probs=27.7
Q ss_pred EEeeecccEEEEeecccCCCcceEEEEEEE
Q psy13839 37 FSAHMSGVYTYCFSNQMSTMTPKVVMFNME 66 (413)
Q Consensus 37 f~~~~~G~y~~CF~N~~s~~t~k~V~f~v~ 66 (413)
++.+.+|.|-.+|+|.+|-...|.+.|.|.
T Consensus 105 ~~c~~~GvYvLkFDNSYS~~rsK~l~Y~V~ 134 (136)
T PF13897_consen 105 HTCPGPGVYVLKFDNSYSWFRSKKLYYRVY 134 (136)
T ss_pred EECCCCeEEEEEeeCcceeEEeeEEEEEEE
Confidence 678899999999999999999999999884
No 23
>KOG0518|consensus
Probab=60.13 E-value=85 Score=36.64 Aligned_cols=129 Identities=16% Similarity=0.229 Sum_probs=69.5
Q ss_pred eeeecCCceEEEEEECCCCCeEecc-eeeecc-e--eEEEeeecccEEEEeecccCCCcceEEEEEEEEe-ecCCC---C
Q psy13839 3 EISEGGFLDIDVRITGPDGNIIYQG-ERESSG-K--YTFSAHMSGVYTYCFSNQMSTMTPKVVMFNMEVL-STQPS---E 74 (413)
Q Consensus 3 qV~~GG~~dIdf~i~~P~g~~i~~~-~~~~~g-~--~~f~~~~~G~y~~CF~N~~s~~t~k~V~f~v~v~-~~~~~---~ 74 (413)
.+.+.|.-.+...|.||+|+..-.- .+..+| . ++|.....|.|..-++-..-. .--.-|.++.+ +.... .
T Consensus 383 D~~Dageg~levqV~gp~Gk~~~~~V~d~~~~~~h~vsY~pd~~G~y~i~v~~~gd~--i~gSPf~~ra~~dask~~~~~ 460 (1113)
T KOG0518|consen 383 DEGDAGEGLLEVQVVGPEGKEKEVVVRDNGRGGIHIVTYVPDCPGRYLIVVFYGGDP--IPGSPFTARAYPDASKEVLVP 460 (1113)
T ss_pred EccccccceEEEEEECCCCCceeeEEEecCCCceEEEEEcCCCCCceEEEEEECCcc--cCCCceEEEeccCccceEecC
Confidence 3445566669999999999875332 333344 4 445566777775444322211 11122333332 11000 0
Q ss_pred CCCCc--ceEEEEeee--cCceeEEEEEECCCCCeEeec-eeeccc--eEEEEeecceeEEEEEee
Q psy13839 75 DDKGE--KGLMFEISE--GGFLDIDVRITGPDGNIIYQG-ERESSG--KYTFSAHMSGVYTYCFSN 133 (413)
Q Consensus 75 ~~~~~--~~~~yqV~~--Gg~~dI~v~v~dP~g~~l~~~-~~~~~g--~f~fta~~~G~y~~Cf~N 133 (413)
+-... ....+-|.. .+.-.+.+.+..|+|..+-.. ....+| +..||+.+.|+|.+|+.=
T Consensus 461 ~i~~~vg~~~~~~vdak~ag~g~~~~~~~~pdn~n~~~~v~~~~dg~~~iv~ta~~~~~~~i~v~f 526 (1113)
T KOG0518|consen 461 PINAQVGKEANFVVDAKAAGAGPVTCTVVTPDNRNVPADVQKNGDGTFDIVFTAQDPGTYTIDVRF 526 (1113)
T ss_pred CCcccccceeeeEeehhhccCCCceeEEeccccCCCCceeeeccCcceeeEeccCCCCcceEEEEE
Confidence 00011 122222321 122238899999998766543 344566 678999999999999854
No 24
>PF10779 XhlA: Haemolysin XhlA; InterPro: IPR019715 Haemolysin XhlA is a cell-surface associated haemolysin that lyses the two most prevalent types of insect immune cells (granulocytes and plasmatocytes) as well as rabbit and horse erythrocytes [].
Probab=58.49 E-value=86 Score=24.39 Aligned_cols=57 Identities=11% Similarity=0.348 Sum_probs=26.5
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh---------hhcchhHHHHHHHHHHHHHHHH
Q psy13839 341 NLQDMIRQLGESLTSVKHEQEYMSVRDRIHRSINE---------STNSRVVWWSFFEAVVLVTMTI 397 (413)
Q Consensus 341 ~le~~v~~L~~~l~~I~~eq~y~r~RE~~~r~t~E---------stn~RV~~wSi~q~~vLi~vs~ 397 (413)
.+++++.++...++++.....-+..+.+.+....+ ..|.+-+|+.++.+++..++++
T Consensus 3 ~i~e~l~~ie~~l~~~~~~i~~lE~~~~~~e~~i~~~~~~l~~I~~n~kW~~r~iiGaiI~~i~~~ 68 (71)
T PF10779_consen 3 DIKEKLNRIETKLDNHEERIDKLEKRDAANEKDIKNLNKQLEKIKSNTKWIWRTIIGAIITAIIYL 68 (71)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34455555555555444333333333333333322 2334555666666666555443
No 25
>PF09315 DUF1973: Domain of unknown function (DUF1973); InterPro: IPR015394 These functionally uncharacterised domains are found in various eukaryotic calcium-dependent chloride channels.
Probab=56.07 E-value=1.7e+02 Score=27.10 Aligned_cols=102 Identities=17% Similarity=0.239 Sum_probs=53.9
Q ss_pred eeeecCCceEEEEEECCCCCeEec-ceeeecceeEEEe---eecccEEEEeecccCCCcceEEEEEEEEeec--------
Q psy13839 3 EISEGGFLDIDVRITGPDGNIIYQ-GERESSGKYTFSA---HMSGVYTYCFSNQMSTMTPKVVMFNMEVLST-------- 70 (413)
Q Consensus 3 qV~~GG~~dIdf~i~~P~g~~i~~-~~~~~~g~~~f~~---~~~G~y~~CF~N~~s~~t~k~V~f~v~v~~~-------- 70 (413)
.|.-.+....++.|++|+|+.... ..........+.+ .+.|...++..|..+.-.. +...+.-.-.
T Consensus 34 ~v~w~~~~~p~i~L~~P~G~~~~~~~~d~~~~~~~i~ipg~ae~G~W~y~i~~~~~~~q~--v~vtVtS~~~~~~~~Pit 111 (179)
T PF09315_consen 34 TVTWQNSSPPSITLTDPSGTVYTTFTTDSNSKTARIQIPGTAEVGTWTYSITNTSSSSQT--VTVTVTSRASSPTVPPIT 111 (179)
T ss_pred EEEECCCCCceEEEECCCCCEEeeeEEcccccEEEEECCCCcccccEEEEEecCCCCcce--EEEEEEeeECCCCCCCEE
Confidence 344344446788999999998866 1112223333333 3466667777555444322 3332221110
Q ss_pred ------CCCCCCCCcceEEEEeeecC----ceeEEEEEECCCCCeE
Q psy13839 71 ------QPSEDDKGEKGLMFEISEGG----FLDIDVRITGPDGNII 106 (413)
Q Consensus 71 ------~~~~~~~~~~~~~yqV~~Gg----~~dI~v~v~dP~g~~l 106 (413)
+.....+.-+.++=+|..|. +.+|...|..|+|...
T Consensus 112 v~~~~~~~~~~~~~p~~IyA~V~qg~~PVlgA~V~A~V~~~~g~~~ 157 (179)
T PF09315_consen 112 VTAWLSQSSVNFPSPVIIYAEVQQGYLPVLGANVTATVESPSGNTV 157 (179)
T ss_pred EEEEECCCCcCCCCcEEEEEEEEECCeeeeCCEEEEEEcCCCCCEE
Confidence 11111223366666666665 3578888888888864
No 26
>PF05738 Cna_B: Cna protein B-type domain; InterPro: IPR008454 This entry represents a repeated B region domain found in the collagen-binding surface protein Cna in Staphylococcus aureus, as well as other related domains. The B region domain of Cna has a prealbumin-like beta-sandwich fold of seven strands in two sheets with a Greek key topology []. However, this domain does not mediate collagen binding, the IPR008456 from INTERPRO region carries out that function; instead it appears to form a stalk that presents the ligand binding domain away from the bacterial cell surface. Cna is a collagen-binding MSCRAMM (Microbial Surface Component Recognizing Adhesive Matrix Molecules), and is necessary and sufficient for S. aureus cells to adhere to cartilage.; PDB: 2X5P_A 3RKP_A 3KPT_A 1VLF_T 1TI2_F 1TI6_D 1TI4_J 1VLE_V 1VLD_X 3PF2_A ....
Probab=55.89 E-value=24 Score=26.56 Aligned_cols=44 Identities=25% Similarity=0.374 Sum_probs=35.2
Q ss_pred EEEEEECCCCCeEee--ceeeccceEEEEeecceeEEEEEeeCCCC
Q psy13839 94 IDVRITGPDGNIIYQ--GERESSGKYTFSAHMSGVYTYCFSNQMST 137 (413)
Q Consensus 94 I~v~v~dP~g~~l~~--~~~~~~g~f~fta~~~G~y~~Cf~N~~s~ 137 (413)
+.|.+++.++..+.. ..-...|.+.|.-...|.|.+=.......
T Consensus 4 a~f~L~~~~~~~~~~~~~~Td~~G~~~f~~L~~G~Y~l~E~~aP~G 49 (70)
T PF05738_consen 4 ATFELYDEDGNEVIEVTVTTDENGKYTFKNLPPGTYTLKETKAPDG 49 (70)
T ss_dssp EEEEEEETTSEEEEEEEEEGGTTSEEEEEEEESEEEEEEEEETTTT
T ss_pred eEEEEEECCCCEEEEEEEEECCCCEEEEeecCCeEEEEEEEECCCC
Confidence 567788888877775 56678899999999999999988875443
No 27
>PF13860 FlgD_ig: FlgD Ig-like domain; PDB: 3C12_A 3OSV_A.
Probab=52.39 E-value=73 Score=25.16 Aligned_cols=39 Identities=18% Similarity=0.411 Sum_probs=23.4
Q ss_pred eEEEEeeecCceeEEEEEECCCCCeEeece--eeccceEEEE
Q psy13839 81 GLMFEISEGGFLDIDVRITGPDGNIIYQGE--RESSGKYTFS 120 (413)
Q Consensus 81 ~~~yqV~~Gg~~dI~v~v~dP~g~~l~~~~--~~~~g~f~ft 120 (413)
.+.|.+.. ..-.+.+.|+|.+|++|.+.. ..+.|.+.|+
T Consensus 15 ~~~~~l~~-~a~~v~v~I~d~~G~~V~t~~~~~~~~G~~~~~ 55 (81)
T PF13860_consen 15 SIEYTLPE-DADNVTVTIYDSNGQVVRTISLGSQSAGEHSFT 55 (81)
T ss_dssp EEEEEECS-SCEEEEEEEEETTS-EEEEEEEEECSSEEEEEE
T ss_pred EEEEeCCC-cccEEEEEEEcCCCCEEEEEEcCCcCCceEEEE
Confidence 33343333 345689999999999997643 3344544444
No 28
>PF01835 A2M_N: MG2 domain; InterPro: IPR002890 The proteinase-binding alpha-macroglobulins (A2M) [] are large glycoproteins found in the plasma of vertebrates, in the hemolymph of some invertebrates and in reptilian and avian egg white. A2M-like proteins are able to inhibit all four classes of proteinases by a 'trapping' mechanism. They have a peptide stretch, called the 'bait region', which contains specific cleavage sites for different proteinases. When a proteinase cleaves the bait region, a conformational change is induced in the protein, thus trapping the proteinase. The entrapped enzyme remains active against low molecular weight substrates, whilst its activity toward larger substrates is greatly reduced, due to steric hindrance. Following cleavage in the bait region, a thiol ester bond, formed between the side chains of a cysteine and a glutamine, is cleaved and mediates the covalent binding of the A2M-like protein to the proteinase. This family includes the N-terminal region of the alpha-2-macroglobulin family. The inhibitor domains belong to MEROPS inhibitor family I39.; GO: 0004866 endopeptidase inhibitor activity; PDB: 2B39_B 3KLS_B 3PRX_C 3KM9_B 3PVM_C 3CU7_A 4E0S_A 4A5W_A 4ACQ_C 2P9R_B ....
Probab=48.26 E-value=88 Score=25.19 Aligned_cols=47 Identities=17% Similarity=0.325 Sum_probs=31.1
Q ss_pred ceEEEEEECCCCCeEeccee---eecceeEEEeeecccEEEEeecccCCCcceEEEEEEEEee
Q psy13839 10 LDIDVRITGPDGNIIYQGER---ESSGKYTFSAHMSGVYTYCFSNQMSTMTPKVVMFNMEVLS 69 (413)
Q Consensus 10 ~dIdf~i~~P~g~~i~~~~~---~~~g~~~f~~~~~G~y~~CF~N~~s~~t~k~V~f~v~v~~ 69 (413)
..+.+.|.||+|+.+.+... ...|.++++.+.+.. ...+.|.+.+..
T Consensus 36 ~~~~v~i~dp~g~~v~~~~~~~~~~~G~~~~~~~lp~~-------------~~~G~y~i~~~~ 85 (99)
T PF01835_consen 36 SPVTVTIKDPSGNEVFRWSVNTTNENGIFSGSFQLPDD-------------APLGTYTIRVKT 85 (99)
T ss_dssp EEEEEEEEETTSEEEEEEEEEETTCTTEEEEEEE--SS----------------EEEEEEEEE
T ss_pred CceEEEEECCCCCEEEEEEeeeeCCCCEEEEEEECCCC-------------CCCEeEEEEEEE
Confidence 67999999999999988766 246776666655432 345677776655
No 29
>PF11589 DUF3244: Domain of unknown function (DUF3244); InterPro: IPR021638 This family of proteins with unknown function appear to be restricted to Bacteroidetes. The protein may have an immunoglobulin-like beta-sandwich fold however this cannot be confirmed. ; PDB: 3D33_B 3SD2_A.
Probab=48.13 E-value=55 Score=27.30 Aligned_cols=55 Identities=25% Similarity=0.357 Sum_probs=35.4
Q ss_pred cceEEEEeeecCceeEEEEEECCCCCeEeeceeecc--ceEEEEe--ecceeEEEEEeeCCC
Q psy13839 79 EKGLMFEISEGGFLDIDVRITGPDGNIIYQGERESS--GKYTFSA--HMSGVYTYCFSNQMS 136 (413)
Q Consensus 79 ~~~~~yqV~~Gg~~dI~v~v~dP~g~~l~~~~~~~~--g~f~fta--~~~G~y~~Cf~N~~s 136 (413)
.+.+.|.. ..-++.++|+|.+|++|++..-... ...++.. ..+|.|.+=+.+...
T Consensus 38 ~l~I~F~~---~~~~vtI~I~d~~G~vVy~~~~~~~~~~~~~I~L~~~~~G~Y~l~i~~~~g 96 (106)
T PF11589_consen 38 NLSIEFES---PIGDVTITIKDSTGNVVYSETVSNSAGQSITIDLNGLPSGEYTLEITNGNG 96 (106)
T ss_dssp EEEEEESS-----SEEEEEEEETT--EEEEEEESCGGTTEEEEE-TTS-SEEEEEEEEECTC
T ss_pred EEEEEEcC---CCCCEEEEEEeCCCCEEEEEEccCCCCcEEEEEeCCCCCccEEEEEEeCCC
Confidence 35555532 2346899999999999999864433 3466666 457999999988754
No 30
>PF13620 CarboxypepD_reg: Carboxypeptidase regulatory-like domain; PDB: 3MN8_D 3P0D_I 3KCP_A 2B59_B 1UWY_A 1H8L_A 1QMU_A 2NSM_A.
Probab=47.35 E-value=21 Score=27.56 Aligned_cols=42 Identities=29% Similarity=0.555 Sum_probs=29.4
Q ss_pred eEEEEEECCCCCeEeeceeeccceEEEEeecceeEEEEEeeC
Q psy13839 93 DIDVRITGPDGNIIYQGERESSGKYTFSAHMSGVYTYCFSNQ 134 (413)
Q Consensus 93 dI~v~v~dP~g~~l~~~~~~~~g~f~fta~~~G~y~~Cf~N~ 134 (413)
+..|.+.+.++.......-.+.|.|.|..-..|.|.+=++-.
T Consensus 16 ~a~V~l~~~~~~~~~~~~Td~~G~f~~~~l~~g~Y~l~v~~~ 57 (82)
T PF13620_consen 16 GATVTLTDQDGGTVYTTTTDSDGRFSFEGLPPGTYTLRVSAP 57 (82)
T ss_dssp T-EEEET--TTTECCEEE--TTSEEEEEEE-SEEEEEEEEBT
T ss_pred CEEEEEEEeeCCCEEEEEECCCceEEEEccCCEeEEEEEEEC
Confidence 577788887777777777789999999977779999888654
No 31
>PF07495 Y_Y_Y: Y_Y_Y domain; InterPro: IPR011123 This region is mostly found at the end of the beta propellers (IPR011110 from INTERPRO) in a family of two component regulators. However they are also found tandemly repeated in Q891H4 from SWISSPROT without other signal conduction domains being present. It is named after the conserved tyrosines found in the alignment. The exact function is not known.; PDB: 3V9F_D 3VA6_B 3OTT_B 4A2M_D 4A2L_B.
Probab=42.93 E-value=1.3e+02 Score=22.04 Aligned_cols=41 Identities=20% Similarity=0.308 Sum_probs=23.5
Q ss_pred eeEEEEEECCCCCeEeeceeeccceEEEEeecceeEEEEEeeC
Q psy13839 92 LDIDVRITGPDGNIIYQGERESSGKYTFSAHMSGVYTYCFSNQ 134 (413)
Q Consensus 92 ~dI~v~v~dP~g~~l~~~~~~~~g~f~fta~~~G~y~~Cf~N~ 134 (413)
....+++.+.+++-+....... .++|+...+|.|++.....
T Consensus 8 ~~Y~Y~l~g~d~~W~~~~~~~~--~~~~~~L~~G~Y~l~V~a~ 48 (66)
T PF07495_consen 8 IRYRYRLEGFDDEWITLGSYSN--SISYTNLPPGKYTLEVRAK 48 (66)
T ss_dssp EEEEEEEETTESSEEEESSTS---EEEEES--SEEEEEEEEEE
T ss_pred eEEEEEEECCCCeEEECCCCcE--EEEEEeCCCEEEEEEEEEE
Confidence 3444445555555444332222 8999999999999887544
No 32
>PF14524 Wzt_C: Wzt C-terminal domain; PDB: 2R5O_B.
Probab=40.64 E-value=1.3e+02 Score=25.25 Aligned_cols=57 Identities=12% Similarity=0.234 Sum_probs=32.6
Q ss_pred ceeEEEEEECCCCCeEeeceeeccceEEEEeecceeEEEEEeeCCCCCCceEEEEEEEE
Q psy13839 91 FLDIDVRITGPDGNIIYQGERESSGKYTFSAHMSGVYTYCFSNQMSTMTPKVVMFNMEV 149 (413)
Q Consensus 91 ~~dI~v~v~dP~g~~l~~~~~~~~g~f~fta~~~G~y~~Cf~N~~s~~~~k~V~f~~~v 149 (413)
...+.+.|.+.+|..|+.. ........+.....|.|.+++.-... +.+-.-++++.+
T Consensus 51 ~~~~~~~i~~~~g~~v~~~-~t~~~~~~~~~~~~g~~~~~~~i~~~-L~~G~Y~i~v~l 107 (142)
T PF14524_consen 51 DPVFGFAIRDSDGQRVFGT-NTYDSGFPIPLSEGGTYEVTFTIPKP-LNPGEYSISVGL 107 (142)
T ss_dssp EEEEEEEEEETT--EEEEE-EHHHHT--EEE-TT-EEEEEEEEE---B-SEEEEEEEEE
T ss_pred ccEEEEEEEcCCCCEEEEE-CccccCccccccCCCEEEEEEEEcCc-cCCCeEEEEEEE
Confidence 4778999999999988873 33333345555557888888877766 666655555555
No 33
>PF04728 LPP: Lipoprotein leucine-zipper; InterPro: IPR006817 This repeating sequence, NAKVDQLSNDV, is found in the enterobacterial outer membrane lipoprotein LPP. The outer membrane lipoprotein is the most abundant protein in an Escherichia coli cell. The messenger RNA for the lipoprotein of the E. coli outer membrane codes for a putative precursor, prolipoprotein, which has 20 additional amino acid residues extending from the amino terminus of the lipoprotein.; GO: 0019867 outer membrane; PDB: 1JCC_A 2GUV_C 2GUS_A 1JCD_A 1KFM_A 1T8Z_D 1KFN_A 1EQ7_A.
Probab=38.68 E-value=93 Score=23.57 Aligned_cols=27 Identities=7% Similarity=0.182 Sum_probs=20.4
Q ss_pred chhHHHHHHHHHHHHHHHHHHHHHHHH
Q psy13839 339 EPNLQDMIRQLGESLTSVKHEQEYMSV 365 (413)
Q Consensus 339 ~~~le~~v~~L~~~l~~I~~eq~y~r~ 365 (413)
++.|...|..|..++..+..+..-+|.
T Consensus 5 id~Ls~dVq~L~~kvdqLs~dv~~lr~ 31 (56)
T PF04728_consen 5 IDQLSSDVQTLNSKVDQLSSDVNALRA 31 (56)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 667777788888888888777777663
No 34
>PF09315 DUF1973: Domain of unknown function (DUF1973); InterPro: IPR015394 These functionally uncharacterised domains are found in various eukaryotic calcium-dependent chloride channels.
Probab=36.47 E-value=1.5e+02 Score=27.56 Aligned_cols=52 Identities=19% Similarity=0.367 Sum_probs=32.6
Q ss_pred eEEEEEECCCCCeEee-ceeeccc--eEEEE-eecceeEEEEEeeCCCCCCceEEEEE
Q psy13839 93 DIDVRITGPDGNIIYQ-GERESSG--KYTFS-AHMSGVYTYCFSNQMSTMTPKVVMFN 146 (413)
Q Consensus 93 dI~v~v~dP~g~~l~~-~~~~~~g--~f~ft-a~~~G~y~~Cf~N~~s~~~~k~V~f~ 146 (413)
...+.+++|+|+.... ....... .+... +.+.|.+.+++.|..+. .+.+.+.
T Consensus 42 ~p~i~L~~P~G~~~~~~~~d~~~~~~~i~ipg~ae~G~W~y~i~~~~~~--~q~v~vt 97 (179)
T PF09315_consen 42 PPSITLTDPSGTVYTTFTTDSNSKTARIQIPGTAEVGTWTYSITNTSSS--SQTVTVT 97 (179)
T ss_pred CceEEEECCCCCEEeeeEEcccccEEEEECCCCcccccEEEEEecCCCC--cceEEEE
Confidence 4678899999998866 2222223 33332 35689999999776553 3444443
No 35
>PRK05842 flgD flagellar basal body rod modification protein; Reviewed
Probab=35.78 E-value=96 Score=31.20 Aligned_cols=44 Identities=11% Similarity=0.037 Sum_probs=31.5
Q ss_pred CceeEEEEEECCCCCeEeeceee----ccceEEEEe---------ecceeEEEEEee
Q psy13839 90 GFLDIDVRITGPDGNIIYQGERE----SSGKYTFSA---------HMSGVYTYCFSN 133 (413)
Q Consensus 90 g~~dI~v~v~dP~g~~l~~~~~~----~~g~f~fta---------~~~G~y~~Cf~N 133 (413)
....+.+.|+|.+|++|..-.-. ..|.++|+- ..+|.|.|-...
T Consensus 164 ~a~~v~I~I~Da~G~vVrTi~l~~~~~~aG~~~f~WDG~d~~G~~~p~G~Yt~~V~a 220 (295)
T PRK05842 164 SKGVPAIQILNENNELVKTIPLKDYNGQKGYINFEWDGLNEKGEKVPKGNYKIKAEY 220 (295)
T ss_pred cCceEEEEEEcCCCCEEEEEecCcccCCCcceeEEECCCCCCCCcCCCcceEEEEEE
Confidence 34579999999999999875432 357777773 225889888754
No 36
>TIGR03503 conserved hypothetical protein TIGR03503. This set of conserved hypothetical protein has a phylogenetic range that closely matches that of TIGR03501, a putative C-terminal protein targeting signal.
Probab=34.69 E-value=5.5e+02 Score=26.77 Aligned_cols=118 Identities=16% Similarity=0.133 Sum_probs=60.1
Q ss_pred ceeeecceeEEEeee---cccEEEEeecccCCCcceEEEEEEEEee-------cCCCCC-CCCcceEEEE--eeecCcee
Q psy13839 27 GERESSGKYTFSAHM---SGVYTYCFSNQMSTMTPKVVMFNMEVLS-------TQPSED-DKGEKGLMFE--ISEGGFLD 93 (413)
Q Consensus 27 ~~~~~~g~~~f~~~~---~G~y~~CF~N~~s~~t~k~V~f~v~v~~-------~~~~~~-~~~~~~~~yq--V~~Gg~~d 93 (413)
+.+...|.|+-..+. .|.|..-+.-..+........ .+.++- .+...+ ....+.++-. +..-+++.
T Consensus 165 DE~p~DGvFT~~l~l~~~~G~Y~~~v~~~n~vF~R~~~q-~v~V~p~Pi~~~~~~~~~~~~~h~l~v~~d~~~i~~~s~~ 243 (374)
T TIGR03503 165 DERPGDGIFTGEFNLDVAPGEYRPTYQSRNPVFLREVEQ-PVLVYPLPVSYTVIQSEDESGAHQLMVDADAGHIDPGSLV 243 (374)
T ss_pred CCCCCCceEEEEeeccCCCceEEEEEEEcCceEEEEEEE-eEEEECCCeeEEEEccCCCCCceEEEEEcccceeccccEE
Confidence 345566777655443 567777766665544433222 333331 111111 1112222222 34556788
Q ss_pred EEEEEECCCCCeEeeceeeccceEEEEe---ecceeEEEEEeeCCCCCCceEEEE
Q psy13839 94 IDVRITGPDGNIIYQGERESSGKYTFSA---HMSGVYTYCFSNQMSTMTPKVVMF 145 (413)
Q Consensus 94 I~v~v~dP~g~~l~~~~~~~~g~f~fta---~~~G~y~~Cf~N~~s~~~~k~V~f 145 (413)
+.+++++|+|..........++.-.+.. .+-|.|.+-.+--..+.+.+-+.|
T Consensus 244 ~~~~~~~P~g~~~~~~~~~~~~~~~~~l~~~~~~G~Y~i~~~~~~Tt~~GRe~~~ 298 (374)
T TIGR03503 244 IHGELVFPNGQIQQFSIELEEPETRVDLPANYEFGKYRVKGTVFGTTITGRELQL 298 (374)
T ss_pred EEEEEECCCCceEEecccCccCceEEeccCcCCCeEEEEEEEEEEecCCCCEEEE
Confidence 9999999999844444444444444443 245888776554333333333333
No 37
>PHA02148 hypothetical protein
Probab=34.32 E-value=95 Score=25.87 Aligned_cols=54 Identities=15% Similarity=0.124 Sum_probs=44.7
Q ss_pred cchHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHhhccchhhhhHHhhhhe
Q psy13839 230 EPNLQDMIRQLGESLTSV-KHEQEYMSVRDRIHRSINESTNSRVVWWSFFEAVVL 283 (413)
Q Consensus 230 ~~~l~~~l~~L~~~l~~I-~~eQ~y~r~RE~~~r~~~Estn~rV~~wSi~qivvl 283 (413)
+..|++++..=...|++| .+.|+-+|.+|+.-|..+|+-...+...|+-|.++-
T Consensus 39 FK~iE~RIA~D~~~L~~~A~~R~NEL~~HE~Air~~M~~HS~~M~~LS~~~~LiS 93 (110)
T PHA02148 39 FKKIEERIAHDQTRLEELAKSRENELRRHEAAIRLAMDSHSIKMDDLSLQEALIS 93 (110)
T ss_pred HHHHHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccchHHHHHHH
Confidence 456777877777777776 467889999999999999999999999999888653
No 38
>cd01042 DMQH Demethoxyubiquinone hydroxylase, ferritin-like diiron-binding domain. Demethoxyubiquinone hydroxylases (DMQH) are members of the ferritin-like, diiron-carboxylate family which are present in eukaryotes (the CLK-1/CAT5 family) and prokaryotes (the Coq7 family). DMQH participates in one of the last steps of ubiquinone biosysnthesis and is responsible for DMQ hydroxylation, resulting in the formation of hydroxyubiquinone, a precursor of ubiquinone. CLK-1 is a mitochondrial inner membrane protein and Coq7 is a proposed interfacial integral membrane protein. Mutations in the Caenorhabditis elegans gene clk-1 affect biological timing and extend longevity. The conserved residues of a diiron center are present in this domain.
Probab=33.55 E-value=1.4e+02 Score=27.44 Aligned_cols=82 Identities=16% Similarity=0.124 Sum_probs=50.7
Q ss_pred CCCccceeHHHHHHHhhhhcccCCCCCCCCccchhhhhhhhcccCCchhHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHH
Q psy13839 293 SKPTNSSSIKHLKKLLNHAKDKLHSKPTNSSSIKHLEKIVFFFFSGEPNLQDMIRQLGES-LTSVKHEQEYMSVRDRIHR 371 (413)
Q Consensus 293 ~~~~~~~qi~~lk~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~le~~v~~L~~~-l~~I~~eq~y~r~RE~~~r 371 (413)
.+|+-+.=++++-.|.-..-..+-++...--...+.|..+-. -++.+|+.|... =.++++-..-+|.=|.+|+
T Consensus 58 ~rps~l~PlW~~~gf~lG~~tal~G~~~a~~~~~avE~~V~~------Hy~~ql~~L~~~~d~~l~~~l~~~r~DE~~H~ 131 (165)
T cd01042 58 VRPSLLLPLWYVAGFALGALTALLGKKAAMACTAAVETVVEE------HYNDQLRELPAQPDKELRAIIEQFRDDELEHA 131 (165)
T ss_pred CCCchHHHHHHHHHHHHHHHHHhhChHHHHHHHHHHHHHHHH------HHHHHHHHhhccCCHHHHHHHHHHHHHHHHHH
Confidence 344444444444444444443344333333446777776655 667777777744 3556666677888899999
Q ss_pred HHHhhhcch
Q psy13839 372 SINESTNSR 380 (413)
Q Consensus 372 ~t~Estn~R 380 (413)
+++......
T Consensus 132 d~A~~~ga~ 140 (165)
T cd01042 132 DIAEELGAE 140 (165)
T ss_pred HHHHHCCCC
Confidence 999998877
No 39
>PF12690 BsuPI: Intracellular proteinase inhibitor; InterPro: IPR020481 BsuPI is a intracellular proteinase inhibitor that directly regulates the major intracellular proteinase (ISP-1) activity in vivo. It inhibits ISP-1 in the early stages of sporulation and then may be inactivated by a membrane-bound proteinase [].; PDB: 3ISY_A.
Probab=31.78 E-value=1.8e+02 Score=23.38 Aligned_cols=20 Identities=10% Similarity=0.172 Sum_probs=12.3
Q ss_pred CceeEEEEEECCCCCeEeec
Q psy13839 90 GFLDIDVRITGPDGNIIYQG 109 (413)
Q Consensus 90 g~~dI~v~v~dP~g~~l~~~ 109 (413)
.+...|+.|+|++|+.|+..
T Consensus 22 sgq~~D~~v~d~~g~~vwrw 41 (82)
T PF12690_consen 22 SGQRYDFVVKDKEGKEVWRW 41 (82)
T ss_dssp SS--EEEEEE-TT--EEEET
T ss_pred CCCEEEEEEECCCCCEEEEe
Confidence 34567888889999999986
No 40
>smart00557 IG_FLMN Filamin-type immunoglobulin domains. These form a rod-like structure in the actin-binding cytoskeleton protein, filamin. The C-terminal repeats of filamin bind beta1-integrin (CD29).
Probab=31.73 E-value=1.1e+02 Score=24.63 Aligned_cols=17 Identities=35% Similarity=0.739 Sum_probs=13.6
Q ss_pred CCceEEEEEECCCCCeE
Q psy13839 8 GFLDIDVRITGPDGNII 24 (413)
Q Consensus 8 G~~dIdf~i~~P~g~~i 24 (413)
|.-++.+.|.+|+|+.+
T Consensus 30 G~~~~~v~i~~p~g~~~ 46 (93)
T smart00557 30 GGGELEVEVTGPSGKKV 46 (93)
T ss_pred CCCcEEEEEECCCCCee
Confidence 45689999999999654
No 41
>PRK12812 flgD flagellar basal body rod modification protein; Reviewed
Probab=30.77 E-value=1.4e+02 Score=29.45 Aligned_cols=44 Identities=16% Similarity=0.132 Sum_probs=31.7
Q ss_pred CceeEEEEEECCCCCeEeece--eeccceEEEEeec---------ceeEEEEEee
Q psy13839 90 GFLDIDVRITGPDGNIIYQGE--RESSGKYTFSAHM---------SGVYTYCFSN 133 (413)
Q Consensus 90 g~~dI~v~v~dP~g~~l~~~~--~~~~g~f~fta~~---------~G~y~~Cf~N 133 (413)
..-.+.+.|+|.+|++|..-. ..+.|.+.|+-.. +|.|.|-..-
T Consensus 139 ~a~~v~v~I~D~~G~~V~t~~lg~~~aG~~~f~WDG~d~~G~~~~~G~Yt~~v~A 193 (259)
T PRK12812 139 DSDEGTLEIYDSNNKLVEKIDFKEISQGLFTMEWDGRDNDGVYAGDGEYTIKAVY 193 (259)
T ss_pred cCceEEEEEEeCCCCEEEEEecCCCCCcceeEEECCCCCCCCcCCCeeeEEEEEE
Confidence 345799999999999997653 3456766666543 5888888763
No 42
>PF10400 Vir_act_alpha_C: Virulence activator alpha C-term; InterPro: IPR018309 Phenolic acids, also called substituted hydroxycinnamic acids, are abundant in the plant kingdom because they are involved in the structure of plant cell walls and are present in some vacuoles. In plant-soil ecosystems they are released as free acids by hemicellulases produced by several fungi and bacteria. Of these weak acids, the most abundant are p-coumaric, ferulic, and caffeic acids, considered to be natural toxins that inhibit the growth of microorganisms, especially at low pHs. In spite of this chemical stress, some bacteria can use phenolic acids as a sole source of carbon. For other microorganisms, these compounds induce a specific response by which the organism adapts to its environment. The ubiquitous lactic acid bacterium Lactobacillus plantarum exhibits an inducible phenolic acid decarboxylase (PAD) activity which converts these substrates into less-toxic vinyl phenol derivatives. PadR acts as a repressor of padA gene expression in the phenolic acid stress response []. This entry represents the C-terminal domain.; PDB: 1YG2_A.
Probab=30.42 E-value=2.5e+02 Score=21.99 Aligned_cols=64 Identities=14% Similarity=0.116 Sum_probs=41.4
Q ss_pred ccCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh-hcchhHHHHHHHHHHHHHHHHH
Q psy13839 335 FFSGEPNLQDMIRQLGESLTSVKHEQEYMSVRDRIHRSINES-TNSRVVWWSFFEAVVLVTMTIG 398 (413)
Q Consensus 335 ~~~~~~~le~~v~~L~~~l~~I~~eq~y~r~RE~~~r~t~Es-tn~RV~~wSi~q~~vLi~vs~~ 398 (413)
||++.-+.+..+..|.+...........++....+....... ......|+.+++..+...-+.+
T Consensus 12 ff~~~~~~~~~~~~l~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~ltl~~Gi~~~~~~i 76 (90)
T PF10400_consen 12 FFGGHLDPEEAIELLEERREQHEERLAEYEEIEQEIFSDPDELDPEAFYWYLTLEYGIRYEQAEI 76 (90)
T ss_dssp HGGGTS-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHSS-GGGS-HHHHHHHHHHHHHHHHHHHHH
T ss_pred HhcccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcccCCHHHHHHHHHHHHHHHHHHHHH
Confidence 455555668888888888888877777777766554444333 5667777777777666554433
No 43
>PF04151 PPC: Bacterial pre-peptidase C-terminal domain; InterPro: IPR007280 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. This domain is normally found at the C terminus of secreted archaeal and bacterial peptidases, the majority of which belong to MEROPS peptidase families M4 (vibriolysin, IPR001570 from INTERPRO), M9A amd M9B (microbial collangenase, IPR002169 from INTERPRO), M28 (aminopeptidase Ap1, IPR007484 from INTERPRO) and S8 (subtilisin family peptidases, IPR000209 from INTERPRO).; GO: 0008233 peptidase activity, 0006508 proteolysis; PDB: 4DY5_B 4DXZ_A 4DY3_B 3JQW_A 3JQX_C 1NQJ_B 1NQD_A 2O8O_A 1WMF_A 1WME_A ....
Probab=29.07 E-value=56 Score=24.82 Aligned_cols=40 Identities=33% Similarity=0.518 Sum_probs=23.3
Q ss_pred ecCCceEEEEEECCCCCeEecceeee--cc---eeEEEeeecccE
Q psy13839 6 EGGFLDIDVRITGPDGNIIYQGERES--SG---KYTFSAHMSGVY 45 (413)
Q Consensus 6 ~GG~~dIdf~i~~P~g~~i~~~~~~~--~g---~~~f~~~~~G~y 45 (413)
.|+..|.+..|++++|+.+.+....+ .| ..+|+++.+|.|
T Consensus 20 ~~~~~d~dl~l~~~~g~~~~~~d~~~~~~~~~~~i~~~~~~~GtY 64 (70)
T PF04151_consen 20 SGGSGDADLYLYDSNGNSLASYDDSSQSGGNDESITFTAPAAGTY 64 (70)
T ss_dssp CETTSSEEEEEEETTSSSCEECCCCTCETTSEEEEEEEESSSEEE
T ss_pred cCCCCCeEEEEEcCCCCchhhheecCCCCCCccEEEEEcCCCEEE
Confidence 34444777889999987776633322 12 234555555554
No 44
>PF10805 DUF2730: Protein of unknown function (DUF2730); InterPro: IPR020269 This entry represents a family of various hypothetical proteins. The proteins, which include HI1498 and Gp25, from phage Mu, are currently uncharacterised.
Probab=28.68 E-value=1e+02 Score=26.05 Aligned_cols=13 Identities=23% Similarity=0.292 Sum_probs=10.0
Q ss_pred hhhhHHhhhheee
Q psy13839 273 VWWSFFEAVVLVN 285 (413)
Q Consensus 273 ~~wSi~qivvli~ 285 (413)
.||+++..++.++
T Consensus 7 ~~w~ii~a~~~~~ 19 (106)
T PF10805_consen 7 KNWGIIWAVFGIA 19 (106)
T ss_pred hCcHHHHHHHHHH
Confidence 4788888887766
No 45
>PRK06655 flgD flagellar basal body rod modification protein; Reviewed
Probab=27.84 E-value=1.9e+02 Score=27.86 Aligned_cols=44 Identities=25% Similarity=0.311 Sum_probs=31.0
Q ss_pred CceeEEEEEECCCCCeEeece--eeccceEEEEe---------ecceeEEEEEee
Q psy13839 90 GFLDIDVRITGPDGNIIYQGE--RESSGKYTFSA---------HMSGVYTYCFSN 133 (413)
Q Consensus 90 g~~dI~v~v~dP~g~~l~~~~--~~~~g~f~fta---------~~~G~y~~Cf~N 133 (413)
....+.++|+|.+|++|+.-. ..+.|.+.|+- ..+|.|.+=..-
T Consensus 124 ~a~~vti~I~D~~G~~Vrt~~lg~~~aG~~~f~WDG~d~~G~~lp~G~Yt~~V~A 178 (225)
T PRK06655 124 AADNVTVTITDSAGQVVRTIDLGAQSAGVVSFTWDGTDTDGNALPDGNYTIKASA 178 (225)
T ss_pred CCcEEEEEEEcCCCCEEEEEecCCcCCCceeEEECCCCCCCCcCCCeeEEEEEEE
Confidence 345699999999999997643 35677777743 235788777753
No 46
>PF11589 DUF3244: Domain of unknown function (DUF3244); InterPro: IPR021638 This family of proteins with unknown function appear to be restricted to Bacteroidetes. The protein may have an immunoglobulin-like beta-sandwich fold however this cannot be confirmed. ; PDB: 3D33_B 3SD2_A.
Probab=27.61 E-value=1.9e+02 Score=23.95 Aligned_cols=25 Identities=24% Similarity=0.472 Sum_probs=15.8
Q ss_pred ceEEEEEECCCCCeEecceeeecce
Q psy13839 10 LDIDVRITGPDGNIIYQGERESSGK 34 (413)
Q Consensus 10 ~dIdf~i~~P~g~~i~~~~~~~~g~ 34 (413)
-++.+.|+|-+|+++++..-...+.
T Consensus 48 ~~vtI~I~d~~G~vVy~~~~~~~~~ 72 (106)
T PF11589_consen 48 GDVTITIKDSTGNVVYSETVSNSAG 72 (106)
T ss_dssp SEEEEEEEETT--EEEEEEESCGGT
T ss_pred CCEEEEEEeCCCCEEEEEEccCCCC
Confidence 3577888888888888875444443
No 47
>PRK12634 flgD flagellar basal body rod modification protein; Reviewed
Probab=27.22 E-value=1.7e+02 Score=28.11 Aligned_cols=44 Identities=18% Similarity=0.253 Sum_probs=32.7
Q ss_pred CceeEEEEEECCCCCeEeece--eeccceEEEEeec---------ceeEEEEEee
Q psy13839 90 GFLDIDVRITGPDGNIIYQGE--RESSGKYTFSAHM---------SGVYTYCFSN 133 (413)
Q Consensus 90 g~~dI~v~v~dP~g~~l~~~~--~~~~g~f~fta~~---------~G~y~~Cf~N 133 (413)
....+.+.|+|.+|++|+... ..+.|.+.|+-.. +|.|.+...-
T Consensus 120 ~a~~v~i~I~d~~G~~V~t~~lg~~~aG~~~f~WDG~d~~G~~~~~G~Yt~~v~a 174 (221)
T PRK12634 120 SAGFVNFEITDANGAFVKQISVPASAAGEVSFAWDGTDANGNRMAAGKYGVTATQ 174 (221)
T ss_pred cCCeEEEEEEcCCCCEEEEEecCCcCCCceeEEECCCCCCCCcCCCeeeEEEEEE
Confidence 446789999999999997753 4566777776533 4889888853
No 48
>PF04678 DUF607: Protein of unknown function, DUF607; InterPro: IPR006769 This entry represents the C-terminal domain of coiled-coil domain containing protein 109.
Probab=25.86 E-value=3.1e+02 Score=25.26 Aligned_cols=45 Identities=18% Similarity=0.239 Sum_probs=36.8
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcchhHHHHHH
Q psy13839 343 QDMIRQLGESLTSVKHEQEYMSVRDRIHRSINESTNSRVVWWSFF 387 (413)
Q Consensus 343 e~~v~~L~~~l~~I~~eq~y~r~RE~~~r~t~Estn~RV~~wSi~ 387 (413)
..+.+.|...+..++.+..-+.....+....++....|++|-.++
T Consensus 56 ~~~~~~l~~~l~~~~~el~~le~~k~~id~~A~~~~~~~~w~gl~ 100 (180)
T PF04678_consen 56 NSRERQLRKRLEELRQELAPLEKIKQEIDEKAEKRARRLLWGGLA 100 (180)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 567777888899999999888888888888888888888875543
No 49
>PF00630 Filamin: Filamin/ABP280 repeat; InterPro: IPR017868 The many different actin cross-linking proteins share a common architecture, consisting of a globular actin-binding domain and an extended rod. Whereas their actin-binding domains consist of two calponin homology domains (see IPR001715 from INTERPRO), their rods fall into three families. The rod domain of the family including the Dictyostelium discoideum (Slime mould) gelation factor (ABP120) and human filamin (ABP280) is constructed from tandem repeats of a 100-residue motif that is glycine and proline rich []. The gelation factor's rod contains 6 copies of the repeat, whereas filamin has a rod constructed from 24 repeats. The resolution of the 3D structure of rod repeats from the gelation factor has shown that they consist of a beta-sandwich, formed by two beta-sheets arranged in an immunoglobulin-like fold [, ]. Because conserved residues that form the core of the repeats are preserved in filamin, the repeat structure should be common to the members of the gelation factor/filamin family. The head to tail homodimerisation is crucial to the function of the ABP120 and ABP280 proteins. This interaction involves a small portion at the distal end of the rod domains. For the gelation factor it has been shown that the carboxy-terminal repeat 6 dimerises through a double edge-to-edge extension of the beta-sheet and that repeat 5 contributes to dimerisation to some extent [, , ].; PDB: 2DI9_A 2EEC_A 2DIC_A 2EEA_A 2DMC_A 2EE9_A 2D7O_A 2D7N_A 2K7P_A 2NQC_A ....
Probab=25.32 E-value=2.2e+02 Score=22.67 Aligned_cols=13 Identities=46% Similarity=0.779 Sum_probs=10.3
Q ss_pred ceEEEEEECCCCC
Q psy13839 10 LDIDVRITGPDGN 22 (413)
Q Consensus 10 ~dIdf~i~~P~g~ 22 (413)
-++.+.|.+|++.
T Consensus 42 ~~~~v~i~~p~~~ 54 (101)
T PF00630_consen 42 DEFQVTITSPDGK 54 (101)
T ss_dssp SEEEEEEESSSSE
T ss_pred ceeEEEEeCCCCC
Confidence 3677888999886
No 50
>PF04728 LPP: Lipoprotein leucine-zipper; InterPro: IPR006817 This repeating sequence, NAKVDQLSNDV, is found in the enterobacterial outer membrane lipoprotein LPP. The outer membrane lipoprotein is the most abundant protein in an Escherichia coli cell. The messenger RNA for the lipoprotein of the E. coli outer membrane codes for a putative precursor, prolipoprotein, which has 20 additional amino acid residues extending from the amino terminus of the lipoprotein.; GO: 0019867 outer membrane; PDB: 1JCC_A 2GUV_C 2GUS_A 1JCD_A 1KFM_A 1T8Z_D 1KFN_A 1EQ7_A.
Probab=24.33 E-value=1.9e+02 Score=21.87 Aligned_cols=42 Identities=10% Similarity=0.163 Sum_probs=23.3
Q ss_pred cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccc
Q psy13839 230 EPNLQDMIRQLGESLTSVKHEQEYMSVRDRIHRSINESTNSR 271 (413)
Q Consensus 230 ~~~l~~~l~~L~~~l~~I~~eQ~y~r~RE~~~r~~~Estn~r 271 (413)
++.|...+..|...+..|..+..-+|.=-..-.+-+...|.|
T Consensus 5 id~Ls~dVq~L~~kvdqLs~dv~~lr~~v~~ak~EAaRAN~R 46 (56)
T PF04728_consen 5 IDQLSSDVQTLNSKVDQLSSDVNALRADVQAAKEEAARANQR 46 (56)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 456666677777777777666666663333333333334444
No 51
>PF05738 Cna_B: Cna protein B-type domain; InterPro: IPR008454 This entry represents a repeated B region domain found in the collagen-binding surface protein Cna in Staphylococcus aureus, as well as other related domains. The B region domain of Cna has a prealbumin-like beta-sandwich fold of seven strands in two sheets with a Greek key topology []. However, this domain does not mediate collagen binding, the IPR008456 from INTERPRO region carries out that function; instead it appears to form a stalk that presents the ligand binding domain away from the bacterial cell surface. Cna is a collagen-binding MSCRAMM (Microbial Surface Component Recognizing Adhesive Matrix Molecules), and is necessary and sufficient for S. aureus cells to adhere to cartilage.; PDB: 2X5P_A 3RKP_A 3KPT_A 1VLF_T 1TI2_F 1TI6_D 1TI4_J 1VLE_V 1VLD_X 3PF2_A ....
Probab=24.18 E-value=1.1e+02 Score=22.80 Aligned_cols=35 Identities=29% Similarity=0.519 Sum_probs=25.1
Q ss_pred eEEEEEECCCCCeEec--ceeeecceeEEEeeecccE
Q psy13839 11 DIDVRITGPDGNIIYQ--GERESSGKYTFSAHMSGVY 45 (413)
Q Consensus 11 dIdf~i~~P~g~~i~~--~~~~~~g~~~f~~~~~G~y 45 (413)
.+-|.|++.++..+.. ..-...|.+.|+--..|.|
T Consensus 3 Ga~f~L~~~~~~~~~~~~~~Td~~G~~~f~~L~~G~Y 39 (70)
T PF05738_consen 3 GATFELYDEDGNEVIEVTVTTDENGKYTFKNLPPGTY 39 (70)
T ss_dssp TEEEEEEETTSEEEEEEEEEGGTTSEEEEEEEESEEE
T ss_pred CeEEEEEECCCCEEEEEEEEECCCCEEEEeecCCeEE
Confidence 3679999999888776 4555567777776666654
No 52
>PRK15396 murein lipoprotein; Provisional
Probab=24.02 E-value=2.2e+02 Score=22.97 Aligned_cols=43 Identities=12% Similarity=0.179 Sum_probs=29.3
Q ss_pred chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcchh
Q psy13839 339 EPNLQDMIRQLGESLTSVKHEQEYMSVRDRIHRSINESTNSRV 381 (413)
Q Consensus 339 ~~~le~~v~~L~~~l~~I~~eq~y~r~RE~~~r~t~Estn~RV 381 (413)
++.|...|+.|...+..+..+..-+|.--..-..-++..|.|+
T Consensus 27 vd~LssqV~~L~~kvdql~~dv~~~~~~~~~a~~eA~raN~Rl 69 (78)
T PRK15396 27 IDQLSSDVQTLNAKVDQLSNDVNAMRSDVQAAKDDAARANQRL 69 (78)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 6778888888888888888888777754444444444455553
No 53
>PF09323 DUF1980: Domain of unknown function (DUF1980); InterPro: IPR015402 Members of this occur in gene pairs with members of PF03773 from PFAM. The N-terminal region contains several predicted transmembrane helix regions while the few invariant residues (G, CxxD, and W) occur in the C-terminal region. Members of this family are found in a set of prokaryotic hypothetical proteins. Their exact function has not, as yet, been defined.
Probab=23.89 E-value=97 Score=28.46 Aligned_cols=35 Identities=20% Similarity=0.227 Sum_probs=29.3
Q ss_pred hhhcchhHHHHHHHHHHHHHHHHHHHHHHHHhhhh
Q psy13839 375 ESTNSRVVWWSFFEAVVLVTMTIGQIYYLKRFFEV 409 (413)
Q Consensus 375 Estn~RV~~wSi~q~~vLi~vs~~Qv~~Lk~fF~~ 409 (413)
-=.|-|..+++.+-+++|++++++|++.+-+--.+
T Consensus 25 ~YI~P~~~~~~~~a~i~l~ilai~q~~~~~~~~~~ 59 (182)
T PF09323_consen 25 LYIHPRYIPLLYFAAILLLILAIVQLWRWFRPKRR 59 (182)
T ss_pred HHhCccHHHHHHHHHHHHHHHHHHHHHHHHhcccc
Confidence 34678999999999999999999999987665544
No 54
>PF11368 DUF3169: Protein of unknown function (DUF3169); InterPro: IPR021509 Some members in this family of proteins are annotated as membrane proteins however this cannot be confirmed. Currently there is no known function.
Probab=23.82 E-value=6.4e+02 Score=24.17 Aligned_cols=23 Identities=17% Similarity=0.247 Sum_probs=10.1
Q ss_pred hhHHHHHHHHHHHHHHHHHHHHH
Q psy13839 340 PNLQDMIRQLGESLTSVKHEQEY 362 (413)
Q Consensus 340 ~~le~~v~~L~~~l~~I~~eq~y 362 (413)
+..++-++...++=+.+.-++.|
T Consensus 171 k~~kk~i~s~DEgEk~i~y~~~y 193 (248)
T PF11368_consen 171 KFVKKWIESCDEGEKQINYEASY 193 (248)
T ss_pred HHHHHHHHhCCHHHHHHHHHHHH
Confidence 34444444444444444444443
No 55
>COG5415 Predicted integral membrane metal-binding protein [General function prediction only]
Probab=22.81 E-value=4.9e+02 Score=25.11 Aligned_cols=35 Identities=20% Similarity=0.167 Sum_probs=23.2
Q ss_pred hhcccCCchhHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy13839 332 VFFFFSGEPNLQDMIRQLGESLTSVKHEQEYMSVR 366 (413)
Q Consensus 332 ~~~~~~~~~~le~~v~~L~~~l~~I~~eq~y~r~R 366 (413)
++.+=+.+..||.+|.++...+..++....-.+.|
T Consensus 10 ~~~~~~~L~rle~qi~q~~~~~~~~qs~l~~~~~r 44 (251)
T COG5415 10 VTKYTADLSRLESQIHQLDVALKKSQSILSQWQSR 44 (251)
T ss_pred cccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 33444457788888888888887776665544443
No 56
>PRK12813 flgD flagellar basal body rod modification protein; Reviewed
Probab=22.44 E-value=2.5e+02 Score=27.04 Aligned_cols=44 Identities=9% Similarity=0.144 Sum_probs=28.4
Q ss_pred CceeEEEEEECCCCCeEeeceeeccceEEEEee---------cceeEEEEEeeC
Q psy13839 90 GFLDIDVRITGPDGNIIYQGERESSGKYTFSAH---------MSGVYTYCFSNQ 134 (413)
Q Consensus 90 g~~dI~v~v~dP~g~~l~~~~~~~~g~f~fta~---------~~G~y~~Cf~N~ 134 (413)
....+.+.|+|.+|++|+...- +.|.+.|+-. .+|.|.|-..-.
T Consensus 122 ~a~~v~v~I~D~~G~vV~t~~~-~~G~~~f~WDG~d~~G~~l~~G~Yt~~V~A~ 174 (223)
T PRK12813 122 DADKAELVVRDAAGAEVARETV-PVGAGPVEWAGEDADGNPLPNGAYSFVVESY 174 (223)
T ss_pred CCceEEEEEEcCCCCEEEEEee-CCCceeEEeCCcCCCCCcCCCccEEEEEEEE
Confidence 3456899999999999977543 3444444332 247777776543
No 57
>PF10528 PA14_2: GLEYA domain; InterPro: IPR018871 This presumed domain is found in fungal adhesins and is related to the PA14 domain. ; PDB: 4A3X_A.
Probab=21.89 E-value=70 Score=27.40 Aligned_cols=24 Identities=33% Similarity=0.506 Sum_probs=14.7
Q ss_pred eeecCCceEEEEEECCCCCeEecc
Q psy13839 4 ISEGGFLDIDVRITGPDGNIIYQG 27 (413)
Q Consensus 4 V~~GG~~dIdf~i~~P~g~~i~~~ 27 (413)
+..+|....+|.|++|+|+.+...
T Consensus 79 ~N~~g~~~~~~~i~~P~G~~~~~~ 102 (113)
T PF10528_consen 79 ANGGGPGSFDFSITDPDGTVHTDD 102 (113)
T ss_dssp EE-SS-EEEEEEEEETT-S--B--
T ss_pred EcCCCceEEEEEEECCCCcEEecC
Confidence 445788899999999999998775
No 58
>PF10267 Tmemb_cc2: Predicted transmembrane and coiled-coil 2 protein; InterPro: IPR019394 This family of transmembrane coiled-coil containing proteins is conserved from worms to humans. Its function is unknown.
Probab=21.15 E-value=9.3e+02 Score=25.31 Aligned_cols=64 Identities=6% Similarity=0.191 Sum_probs=35.2
Q ss_pred hhhhcccCCchhHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHhhhcchhHHHHHHHHHHHHHHHHHH
Q psy13839 330 KIVFFFFSGEPNLQDMIRQLGESLTSVK--HEQEYMSVRDRIHRSINESTNSRVVWWSFFEAVVLVTMTIGQ 399 (413)
Q Consensus 330 ~~~~~~~~~~~~le~~v~~L~~~l~~I~--~eq~y~r~RE~~~r~t~Estn~RV~~wSi~q~~vLi~vs~~Q 399 (413)
|.....+-...++++.|...+.++..+. ..|.+.. -...+..+.|+++..++.+++-++.-++.
T Consensus 291 K~~Yqs~eRaRdi~E~~Es~qtRisklE~~~~Qq~~q------~e~~~n~~~r~~l~k~inllL~l~~vlLv 356 (395)
T PF10267_consen 291 KMAYQSYERARDIWEVMESCQTRISKLEQQQQQQVVQ------LEGTENSRARALLGKLINLLLTLLTVLLV 356 (395)
T ss_pred HHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHhhhhhh------hcccccccHHHHHHHHHHHHHHHHHHHHH
Confidence 3333333336677777777777777776 2222221 23344555777777777665544444333
No 59
>PF06156 DUF972: Protein of unknown function (DUF972); InterPro: IPR010377 FUNCTION: Involved in initiation control of chromosome replication. SUBUNIT: Interacts with both DnaA and DnaN, acting as a bridge between these two proteins. SIMILARITY: Belongs to the YabA family.
Probab=20.48 E-value=1.8e+02 Score=24.71 Aligned_cols=40 Identities=25% Similarity=0.253 Sum_probs=21.7
Q ss_pred cCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy13839 336 FSGEPNLQDMIRQLGESLTSVKHEQEYMSVRDRIHRSINE 375 (413)
Q Consensus 336 ~~~~~~le~~v~~L~~~l~~I~~eq~y~r~RE~~~r~t~E 375 (413)
|..+..++..+..|...+..++.....+-+--++.+.-++
T Consensus 7 ~~~l~~le~~l~~l~~~~~~LK~~~~~l~EEN~~L~~EN~ 46 (107)
T PF06156_consen 7 FDRLDQLEQQLGQLLEELEELKKQLQELLEENARLRIENE 46 (107)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3345666666666666666666555554444443333333
No 60
>PRK12633 flgD flagellar basal body rod modification protein; Provisional
Probab=20.39 E-value=2.8e+02 Score=26.74 Aligned_cols=44 Identities=20% Similarity=0.338 Sum_probs=31.0
Q ss_pred CceeEEEEEECCCCCeEeece--eeccceEEEEee---------cceeEEEEEee
Q psy13839 90 GFLDIDVRITGPDGNIIYQGE--RESSGKYTFSAH---------MSGVYTYCFSN 133 (413)
Q Consensus 90 g~~dI~v~v~dP~g~~l~~~~--~~~~g~f~fta~---------~~G~y~~Cf~N 133 (413)
..-.+.+.|+|.+|++|+.-. ..+.|.+.|+-. .+|.|+|-..-
T Consensus 127 ~a~~v~v~I~D~~G~vV~t~~lg~~~aG~~~f~WDG~d~~G~~~~~G~Y~~~V~a 181 (230)
T PRK12633 127 DATKVTVKVLDPSGAVVRTMELGDLKTGVHTLQWDGNNDGGQPLADGKYSITVSA 181 (230)
T ss_pred cCcEEEEEEEeCCCCEEEEEecCCCCCCceeEEECCCCCCCCcCCCcceEEEEEE
Confidence 445799999999999998653 346676766632 24778777753
Done!