Query psy11306
Match_columns 413
No_of_seqs 197 out of 751
Neff 5.7
Searched_HMMs 46136
Date Fri Aug 16 23:33:56 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy11306.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/11306hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG3838|consensus 100.0 3E-102 6E-107 768.8 31.3 362 25-412 28-398 (497)
2 cd06902 lectin_ERGIC-53_ERGL E 100.0 8.4E-62 1.8E-66 463.5 28.5 225 30-255 1-225 (225)
3 PF03388 Lectin_leg-like: Legu 100.0 1.1E-55 2.5E-60 422.2 25.4 225 30-255 1-229 (229)
4 cd06901 lectin_VIP36_VIPL VIP3 100.0 6.9E-54 1.5E-58 414.3 28.1 219 36-259 6-228 (248)
5 cd06903 lectin_EMP46_EMP47 EMP 100.0 3.5E-50 7.5E-55 381.2 25.7 201 36-253 5-213 (215)
6 cd07308 lectin_leg-like legume 100.0 2.4E-48 5.2E-53 367.7 28.0 216 34-254 3-218 (218)
7 KOG3839|consensus 100.0 1.2E-46 2.6E-51 368.3 18.2 219 35-258 57-279 (351)
8 cd01951 lectin_L-type legume l 100.0 6.5E-31 1.4E-35 248.9 23.8 195 49-252 12-222 (223)
9 cd06899 lectin_legume_LecRK_Ar 100.0 9.2E-28 2E-32 230.5 24.5 195 50-253 12-234 (236)
10 PF00139 Lectin_legB: Legume l 99.9 3.3E-25 7.2E-30 212.4 20.5 197 50-252 12-236 (236)
11 cd06900 lectin_VcfQ VcfQ bacte 99.1 3.3E-08 7.1E-13 95.2 21.7 195 51-251 14-253 (255)
12 KOG3838|consensus 95.0 0.12 2.5E-06 53.6 8.8 113 265-402 263-380 (497)
13 cd06902 lectin_ERGIC-53_ERGL E 76.1 1.1 2.5E-05 43.1 0.9 16 295-310 1-16 (225)
14 PF07889 DUF1664: Protein of u 65.8 11 0.00023 33.6 4.5 66 267-365 61-126 (126)
15 PF06439 DUF1080: Domain of Un 64.3 1.1E+02 0.0023 27.4 13.6 155 48-237 9-174 (185)
16 PF09712 PHA_synth_III_E: Poly 51.6 1.5E+02 0.0032 29.9 10.5 54 342-411 239-292 (293)
17 KOG0249|consensus 49.3 69 0.0015 36.2 8.2 82 265-369 161-243 (916)
18 PF10393 Matrilin_ccoil: Trime 43.7 40 0.00086 25.0 3.7 29 338-366 12-40 (47)
19 PF10168 Nup88: Nuclear pore c 41.4 2E+02 0.0042 32.8 10.5 22 347-368 644-665 (717)
20 KOG4403|consensus 40.2 67 0.0014 34.3 6.1 75 266-366 251-326 (575)
21 PF12248 Methyltransf_FA: Farn 38.0 1.2E+02 0.0027 25.3 6.5 45 185-236 52-96 (102)
22 KOG3192|consensus 37.7 1.5E+02 0.0033 27.5 7.3 34 337-370 98-136 (168)
23 PF07081 DUF1349: Protein of u 35.5 3.6E+02 0.0079 24.9 14.8 115 88-239 48-162 (183)
24 KOG1029|consensus 34.3 1.1E+02 0.0024 35.1 6.9 10 279-288 373-382 (1118)
25 cd09234 V_HD-PTP_like Protein- 34.1 3.1E+02 0.0067 27.9 9.9 37 325-361 235-277 (337)
26 PRK11519 tyrosine kinase; Prov 33.4 2.3E+02 0.005 32.0 9.5 26 263-288 270-295 (719)
27 KOG0994|consensus 33.3 1.7E+02 0.0038 35.1 8.4 79 335-413 1211-1289(1758)
28 PRK10894 lipopolysaccharide tr 32.6 75 0.0016 29.4 4.7 57 7-71 5-66 (180)
29 PRK10803 tol-pal system protei 32.4 1.5E+02 0.0033 29.2 7.1 58 336-410 44-101 (263)
30 PRK13676 hypothetical protein; 32.3 1.7E+02 0.0036 24.9 6.5 72 266-362 35-108 (114)
31 cd09236 V_AnPalA_UmRIM20_like 30.1 1.4E+02 0.003 30.7 6.6 36 325-360 252-293 (353)
32 smart00282 LamG Laminin G doma 30.0 2.4E+02 0.0052 23.7 7.2 22 184-205 61-82 (135)
33 PF10458 Val_tRNA-synt_C: Valy 28.6 2E+02 0.0043 22.2 5.8 23 266-288 3-25 (66)
34 PRK09841 cryptic autophosphory 28.1 3.5E+02 0.0076 30.5 9.9 26 263-288 270-295 (726)
35 KOG0999|consensus 27.4 99 0.0021 34.1 5.0 92 266-375 42-140 (772)
36 PTZ00009 heat shock 70 kDa pro 26.2 91 0.002 34.7 4.8 39 263-301 511-551 (653)
37 PF11153 DUF2931: Protein of u 25.4 62 0.0013 30.7 2.9 35 7-41 3-37 (216)
38 TIGR02302 aProt_lowcomp conser 25.3 1.9E+02 0.0041 33.6 7.1 112 265-406 521-641 (851)
39 PRK10884 SH3 domain-containing 25.3 6.1E+02 0.013 24.3 10.3 24 265-288 91-114 (206)
40 PF03268 DUF267: Caenorhabditi 25.3 1.3E+02 0.0028 31.4 5.2 52 270-360 179-230 (353)
41 PLN03184 chloroplast Hsp70; Pr 25.0 71 0.0015 35.8 3.7 39 263-301 538-578 (673)
42 cd08915 V_Alix_like Protein-in 24.3 2.3E+02 0.005 28.7 6.9 38 325-362 241-284 (342)
43 PF09224 DUF1961: Domain of un 23.7 3.1E+02 0.0066 26.7 7.1 66 47-118 14-87 (218)
44 PF08649 DASH_Dad1: DASH compl 23.4 1.7E+02 0.0036 22.7 4.3 35 328-362 2-36 (58)
45 PF07072 DUF1342: Protein of u 23.1 1.6E+02 0.0034 28.4 5.1 34 333-366 66-99 (211)
46 KOG2314|consensus 23.0 5.7E+02 0.012 28.6 9.6 38 335-372 627-664 (698)
47 PF06133 DUF964: Protein of un 22.9 1.5E+02 0.0033 24.5 4.5 73 266-363 32-106 (108)
48 TIGR01834 PHA_synth_III_E poly 22.5 3E+02 0.0065 28.3 7.2 54 342-411 256-309 (320)
49 PF08372 PRT_C: Plant phosphor 21.3 2.5E+02 0.0053 25.9 5.8 57 314-373 33-90 (156)
50 PF13600 DUF4140: N-terminal d 20.6 4.8E+02 0.011 21.4 8.0 43 244-287 48-90 (104)
51 TIGR01005 eps_transp_fam exopo 20.3 4.7E+02 0.01 29.4 9.0 32 267-322 288-319 (754)
52 PRK10722 hypothetical protein; 20.1 2.5E+02 0.0055 27.8 5.9 35 336-371 177-211 (247)
53 PF05130 FlgN: FlgN protein; 20.1 2.7E+02 0.0059 23.3 5.7 19 392-410 75-93 (143)
No 1
>KOG3838|consensus
Probab=100.00 E-value=2.8e-102 Score=768.84 Aligned_cols=362 Identities=51% Similarity=0.884 Sum_probs=340.9
Q ss_pred CCCCceeeeeeeeccCCcccCCCCCCCCeEEeeeeEeeCCeEEeCCCCCCceeEEEeCCcccccceEEEEEEEEEeCCCC
Q psy11306 25 SQNPVERFEYKYSFKPPYLAQKDGSVPFWEYGGNCIASLENVRVAPSLRSQKGAIWTKQTTNFEWWNVDIVFRVTGRGRI 104 (413)
Q Consensus 25 ~~~~~~r~e~k~Sf~~P~l~~~~~~i~~W~~~GdAii~~~~IRLTp~~~sq~GsiWsk~pi~~~sFe~eftFrI~g~g~~ 104 (413)
..+|||||||||||++|+|++++++||+|.++||||.+.+.|||+|++++++|++|+|..+++++|+||++|||+|.||.
T Consensus 28 ~~~~~rrFEYK~SFk~P~Laq~dgtiPFW~~~GdAIas~eqvRlaPSmrsrkGavWtka~~~fe~weVev~~rVtGrGRi 107 (497)
T KOG3838|consen 28 GTPPHRRFEYKYSFKGPRLAQPDGTIPFWSHHGDAIASSEQVRLAPSMRSRKGAVWTKASVPFENWEVEVQFRVTGRGRI 107 (497)
T ss_pred CCCccceeeeeecccCCccccCCCCcceeeecCcccccccceeeccccccccCceeecccCCcccceEEEEEEecccccc
Confidence 35699999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred CCCeEEEEEecCCCCCCCCcCCCCCCccEEEEEEEccCCCCCCCCCeEEEEecCCcccccCCCCCCcccccceeeccccC
Q psy11306 105 GADGLAFWYTSEKGSYDGEVFGSSDRWKGLGLFFDSFDNDNNHNNPYIMAVVNDGNMAFDHQNDGASQSLAGCLRDFRNK 184 (413)
Q Consensus 105 ggDGlAFW~t~~~~~~~G~vfG~~~~f~GlaI~FDTf~N~~~~~~P~I~i~~NDGt~~yd~~~DG~~~~L~~C~~~~rn~ 184 (413)
||||||||||...+ ..|+|||+.+.|+|+|||||+|+||+++|+|+|+++.|||+..|||.+||..+.|++|.+||||+
T Consensus 108 GAdGlaiWYt~~~G-~~GpVfGg~d~WnGigiffDSfdnD~qknnP~Is~~lndGt~~ydh~~DGasQ~LssCqrDFRNk 186 (497)
T KOG3838|consen 108 GADGLAIWYTRGRG-HVGPVFGGLDSWNGIGIFFDSFDNDGQKNNPAISVLLNDGTIPYDHPGDGASQGLSSCQRDFRNK 186 (497)
T ss_pred cCCceEEEEecCCC-cccccccccccccceEEEeecccccCCcCCccEEEEecCCcccccCCCccHHHHHHHhhHHhccC
Confidence 99999999999999 99999999999999999999999999999999999999999999999999999999999999999
Q ss_pred CcceEEEEEEECCEEEEEEEeCccCCCCceeEEEEecccccCCCcEEEEEeeCCCCCccEEEEEEEEeecCCCCCCC---
Q psy11306 185 PYPTRARIQYYMNTLTVWFHNGMTNNEQDIEVCLRVENIYLPKEGYFGVSAATGGLADDHDILHFLTSSLLPPGAKQ--- 261 (413)
Q Consensus 185 ~~p~~~rI~Y~~~~LsV~v~~~~t~~~~~~~~C~~v~~v~LP~~~yfGfSAaTG~lad~HDIls~~~~sL~~p~~~~--- 261 (413)
|+|+++||+|++++|+|+|+++++++ ++|+.|.+++++.||+++|||||||||+++|+|||++|.||+|+.|+++|
T Consensus 187 PyPvRarItY~~nvLtv~innGmtp~-d~yE~C~rve~~~lp~nGyFGvSAATGgLADDHDVl~FltfsL~e~~qeppqP 265 (497)
T KOG3838|consen 187 PYPVRARITYYGNVLTVMINNGMTPS-DDYEFCVRVENLLLPPNGYFGVSAATGGLADDHDVLSFLTFSLSEPGQEPPQP 265 (497)
T ss_pred CCCceEEEEEeccEEEEEEcCCCCCC-CCcceeEeccceeccCCCeeeeeecccccccccceeeeEEeeecCCCCCCCCc
Confidence 99999999999999999999999988 78999999999999999999999999999999999999999999999985
Q ss_pred -cCCChHHHHHHHHHHHHHHHHHHHhhhccCCccccccccccCCCCcccccCCccCCCCCCCCCchhhh---hccchHHH
Q psy11306 262 -QEQVNQEDQKVAQEYAQYEKKLEEQKQHSQNPVERFEYKYSFKPPYLAQKDGYQKDHPDAHPNEEEWY---ESENQREL 337 (413)
Q Consensus 262 -~~~~~~e~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~l 337 (413)
.|+.+.|++||++|++.+|.+||++|+ +|+|.|||+++++|+|| |++++|||
T Consensus 266 ~~E~qe~ek~kyqeEfe~~q~elek~k~------------------------efkk~hpd~~~e~ee~~~~yEs~~~Rel 321 (497)
T KOG3838|consen 266 FLEMQELEKAKYQEEFEWAQLELEKRKD------------------------EFKKSHPDAQGEGEELFDLYESLGHREL 321 (497)
T ss_pred hhhhhHHHHHHHHHHHHHHHHHHhhhHh------------------------hhccCCchhhcchhhhhhhhhccchHHH
Confidence 467778999999999999999999999 99999999999977777 99999999
Q ss_pred HHHHhhhhHHHHHHHHHHHHHHHHHHHHHhhhhccccCC--CccCCCCCCCCCCccchHHHHHhhHHHHHHHHHHhc
Q psy11306 338 RQIFQGQSQLAEWTKAIATGLDALQQKQDRILAVVSQGG--GIPHQVVPGQPMPMINNDALLASQNSLLSTVQELRY 412 (413)
Q Consensus 338 ~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~q~~~~~~~~~~~~ 412 (413)
||||++||+|++.|++|++|||++++.|+|..+.+...+ +.+.+++.+....+.++|.++.+|+.|+++++|||.
T Consensus 322 rqi~egQn~i~~~l~ql~rql~~il~~Q~~~~~a~~p~asg~~p~~~~~gG~~sqse~d~v~k~~~~l~~qiremkq 398 (497)
T KOG3838|consen 322 RQILEGQNAIHKQLAQLERQLDKILGPQARPVGALAPDASGQIPSTPGRGGHLSQSENDDVAKVQALLLGQIREMKQ 398 (497)
T ss_pred HHHHhhhhHHHHHHHHHHHHHHHHhCccccCCcccCCcccCCCCCCCCCCCccchhhhhHHHHHHHHHHHHHHHHHH
Confidence 999999999999999999999999999999997765543 233333322334668999999999999999999985
No 2
>cd06902 lectin_ERGIC-53_ERGL ERGIC-53 and ERGL type 1 transmembrane proteins, N-terminal lectin domain. ERGIC-53 and ERGL, N-terminal carbohydrate recognition domain. ERGIC-53 and ERGL are eukaryotic mannose-binding type 1 transmembrane proteins of the early secretory pathway that transport newly synthesized glycoproteins from the endoplasmic reticulum (ER) to the ER-Golgi intermediate compartment (ERGIC). ERGIC-53 and ERGL have an N-terminal lectin-like carbohydrate recognition domain (represented by this alignment model) as well as a C-terminal transmembrane domain. ERGIC-53 functions as a 'cargo receptor' to facilitate the export of glycoproteins with different characteristics from the ER, while the ERGIC-53-like protein (ERGL) which may act as a regulator of ERGIC-53. In mammals, ERGIC-53 forms a complex with MCFD2 (multi-coagulation factor deficiency 2) which then recruits blood coagulation factors V and VIII. Mutations in either MCFD2 or ERGIC-53 cause a mild form of inherite
Probab=100.00 E-value=8.4e-62 Score=463.51 Aligned_cols=225 Identities=68% Similarity=1.202 Sum_probs=216.4
Q ss_pred eeeeeeeeccCCcccCCCCCCCCeEEeeeeEeeCCeEEeCCCCCCceeEEEeCCcccccceEEEEEEEEEeCCCCCCCeE
Q psy11306 30 ERFEYKYSFKPPYLAQKDGSVPFWEYGGNCIASLENVRVAPSLRSQKGAIWTKQTTNFEWWNVDIVFRVTGRGRIGADGL 109 (413)
Q Consensus 30 ~r~e~k~Sf~~P~l~~~~~~i~~W~~~GdAii~~~~IRLTp~~~sq~GsiWsk~pi~~~sFe~eftFrI~g~g~~ggDGl 109 (413)
|||||||||++||+.+.+..|++|++.|+|++.+++|||||+.++++|+||++.|+..++|+++|+|||+|.++.+||||
T Consensus 1 ~~~~~~~s~~~P~l~~~~~~i~~W~~~G~t~~~~~~IrLTp~~~~~~G~iw~~~~~~~~~w~ie~~Fri~g~~~~~gdG~ 80 (225)
T cd06902 1 RRFEYKYSFKGPHLAQKDGTVPFWSHGGDAIASLEQVRLTPSLRSKKGSVWTKNPFSFENWEVEVTFRVTGRGRIGADGL 80 (225)
T ss_pred CcceeeecccCcccccCCCCCCceEecccEEecCCEEEECCCCCCCEEEEeeCCCcCCCCEEEEEEEEEecCCCCCCCEE
Confidence 79999999999999988889999999999999999999999999999999999999999999999999999888899999
Q ss_pred EEEEecCCCCCCCCcCCCCCCccEEEEEEEccCCCCCCCCCeEEEEecCCcccccCCCCCCcccccceeeccccCCcceE
Q psy11306 110 AFWYTSEKGSYDGEVFGSSDRWKGLGLFFDSFDNDNNHNNPYIMAVVNDGNMAFDHQNDGASQSLAGCLRDFRNKPYPTR 189 (413)
Q Consensus 110 AFW~t~~~~~~~G~vfG~~~~f~GlaI~FDTf~N~~~~~~P~I~i~~NDGt~~yd~~~DG~~~~L~~C~~~~rn~~~p~~ 189 (413)
|||||.+++ ..|++||+++.|+||||+|||++|++.+++|+|++++|||++.|++.+|+..+++++|.++|||.+.|++
T Consensus 81 a~W~t~~~~-~~G~~~G~~~~f~Gl~I~~Dt~~n~~~~~~p~i~~~~NDGt~~yd~~~D~~~~~~~~C~~~~rn~~~p~~ 159 (225)
T cd06902 81 AIWYTKERG-EEGPVFGSSDKWNGVGIFFDSFDNDGKKNNPAILVVGNDGTKSYDHQNDGLTQALGSCLRDFRNKPYPVR 159 (225)
T ss_pred EEEEECCCC-CCCCccCCCCcccEEEEEEECCCCCCCCCCcEEEEEECCCCeeccccCCCcccccceEEEeccCCCCCeE
Confidence 999999988 8999999999999999999999998877889999999999999999999999999999999999999999
Q ss_pred EEEEEECCEEEEEEEeCccCCCCceeEEEEecccccCCCcEEEEEeeCCCCCccEEEEEEEEeecC
Q psy11306 190 ARIQYYMNTLTVWFHNGMTNNEQDIEVCLRVENIYLPKEGYFGVSAATGGLADDHDILHFLTSSLL 255 (413)
Q Consensus 190 ~rI~Y~~~~LsV~v~~~~t~~~~~~~~C~~v~~v~LP~~~yfGfSAaTG~lad~HDIls~~~~sL~ 255 (413)
+||+|.++.|+|+++++.+++..+|+.||+++++.||+++||||||+||+++|+|||++|.||+|+
T Consensus 160 ~rI~Y~~~~l~V~~d~~~~~~~~~~~~Cf~~~~v~LP~~~yfGiSA~Tg~l~d~hDIls~~~~sl~ 225 (225)
T cd06902 160 AKITYYQNVLTVSINNGFTPNKDDYELCTRVENMVLPPNGYFGVSAATGGLADDHDVLSFLTFSLT 225 (225)
T ss_pred EEEEEECCeEEEEEeCCcCCCCCcccEEEecCCeeCCCCCEEEEEecCCCCCCcEeEEEEEEeccC
Confidence 999999999999999766677788999999999999999999999999999999999999999875
No 3
>PF03388 Lectin_leg-like: Legume-like lectin family; InterPro: IPR005052 Lectins are structurally diverse proteins that bind to specific carbohydrates. This family includes the VIP36 and ERGIC-53 lectins. These two proteins were the first members of the family of animal lectins similar to the leguminous plant lectins []. The alignment for this family is towards the N terminus, where the similarity of VIP36 and ERGIC-53 is greatest. Although they have been identified as a family of animal lectins, this alignment also includes yeast sequences[]. ERGIC-53 is a 53kDa protein, localised to the intermediate region between the endoplasmic reticulum and the Golgi apparatus (ER-Golgi-Intermediate Compartment, ERGIC). It was identified as a calcium-dependent, mannose-specific lectin []. Its dysfunction has been associated with combined factors V and VIII deficiency, suggesting an important and substrate-specific role for ERGIC-53 in the glycoprotein-secreting pathway [,]. The L-type lectin-like domain has an overall globular shape composed of a beta-sandwich of two major twisted antiparallel beta-sheets. The beta-sandwich comprises a major concave beta-sheet and a minor convex beta-sheet, in a variation of the jelly roll fold [, , , ]. ; GO: 0016020 membrane; PDB: 3A4U_A 3LCP_B 2A6Z_A 2A71_C 2A70_B 2A6Y_A 2A6X_A 2A6W_B 2A6V_B 2E6V_B ....
Probab=100.00 E-value=1.1e-55 Score=422.16 Aligned_cols=225 Identities=45% Similarity=0.889 Sum_probs=196.7
Q ss_pred eeeeeeeeccCCcccCCCCCCCCeEEeeeeEeeCCeEEeCCCCCCceeEEEeCCcccccceEEEEEEEEEeCC-CCCCCe
Q psy11306 30 ERFEYKYSFKPPYLAQKDGSVPFWEYGGNCIASLENVRVAPSLRSQKGAIWTKQTTNFEWWNVDIVFRVTGRG-RIGADG 108 (413)
Q Consensus 30 ~r~e~k~Sf~~P~l~~~~~~i~~W~~~GdAii~~~~IRLTp~~~sq~GsiWsk~pi~~~sFe~eftFrI~g~g-~~ggDG 108 (413)
.||+++|||++||+......+++|++.|+|++.+++|||||+.++++|+||++.|+..++|+++++|||+|.+ +.+|||
T Consensus 1 ~~~~~~~S~~~P~~~~~~~~i~~W~~~G~t~i~~~~IrLTp~~~~~~G~iws~~~~~~~~w~i~~~Fri~g~~~~~~g~G 80 (229)
T PF03388_consen 1 ARFKESHSFSPPFLDNGDNEIPNWDIGGSTVITDNFIRLTPDRQSQSGSIWSRKPIPFDNWEIEFTFRISGQEKGLGGDG 80 (229)
T ss_dssp -EEEGGGEEESSSCSCTTSCBTTEEEEET-EEESSEEEEE-SSTTEEEEEEESS-BEESEEEEEEEEEEESS-SSS-S-E
T ss_pred CcccccCCcCCCcCcCCCCccCCEEECCeEEecCCEEEECCCcccCEEEEEEcCCCCccCEEEEEEEEEeccccCcCCCe
Confidence 3789999999999998778999999999999999999999999999999999999999999999999999985 479999
Q ss_pred EEEEEecCCCCCCCCcCCCCCCccEEEEEEEccCCCCC---CCCCeEEEEecCCcccccCCCCCCcccccceeeccccCC
Q psy11306 109 LAFWYTSEKGSYDGEVFGSSDRWKGLGLFFDSFDNDNN---HNNPYIMAVVNDGNMAFDHQNDGASQSLAGCLRDFRNKP 185 (413)
Q Consensus 109 lAFW~t~~~~~~~G~vfG~~~~f~GlaI~FDTf~N~~~---~~~P~I~i~~NDGt~~yd~~~DG~~~~L~~C~~~~rn~~ 185 (413)
||||||+++. ..|++||+++.|+||||+||||+|++. ..+|+|++++|||++.|++.+||...++++|.++|||.+
T Consensus 81 ~a~W~t~~~~-~~G~~fG~~~~f~Gl~i~idt~~N~~~~~~~~~p~i~~~~NDGt~~~~~~~dg~~~~~~~C~~~~rn~~ 159 (229)
T PF03388_consen 81 MAFWYTKDPG-SDGPVFGGPDKFDGLGIFIDTYDNDEGGHKRGFPYISAMLNDGTKSYDHDNDGKDQSLGSCSADYRNSD 159 (229)
T ss_dssp EEEEEESSSS-SSCSBTTB-SS-EEEEEEEEES-TTCTTCTSTSSEEEEEEEESSS---GGGTTTTT-SEEEE---BTES
T ss_pred EEEEEEcCcc-ccccccCCCcccceEEEEEEcccCCCcccccccceEEEEecCCCccccccccCcccccccceeccCcCC
Confidence 9999999998 899999999999999999999999763 467999999999999999999999999999999999999
Q ss_pred cceEEEEEEECCEEEEEEEeCccCCCCceeEEEEecccccCCCcEEEEEeeCCCCCccEEEEEEEEeecC
Q psy11306 186 YPTRARIQYYMNTLTVWFHNGMTNNEQDIEVCLRVENIYLPKEGYFGVSAATGGLADDHDILHFLTSSLL 255 (413)
Q Consensus 186 ~p~~~rI~Y~~~~LsV~v~~~~t~~~~~~~~C~~v~~v~LP~~~yfGfSAaTG~lad~HDIls~~~~sL~ 255 (413)
.|+++||+|.++.|+|.++.+...+..+|+.|++++.+.||+++||||||+||.++|+|||+++++++|+
T Consensus 160 ~p~~~ri~Y~~~~l~v~id~~~~~~~~~~~~Cf~~~~v~LP~~~yfGvSA~Tg~~~d~hdi~s~~~~~l~ 229 (229)
T PF03388_consen 160 VPTRIRISYSKNTLTVSIDSNYLKNQDDWELCFTTDGVDLPEGYYFGVSAATGELSDNHDILSVKTYSLD 229 (229)
T ss_dssp SEEEEEEEEETTEEEEEEETSCCSECCTTEEEEEESTEEGGSSBEEEEEEEESSSGGEEEEEEEEEEEEE
T ss_pred CCEEEEEEEECCeEEEEEecccccCCcCCcEEEEcCCeecCCCCEEEEEecCCCCCCcEEEEEEEEeecC
Confidence 9999999999999999999765566788999999999999999999999999999999999999999874
No 4
>cd06901 lectin_VIP36_VIPL VIP36 and VIPL type 1 transmembrane proteins, lectin domain. The vesicular integral protein of 36 kDa (VIP36) is a type 1 transmembrane protein of the mammalian early secretory pathway that acts as a cargo receptor transporting high mannose type glycoproteins between the Golgi and the endoplasmic reticulum (ER). Lectins of the early secretory pathway are involved in the selective transport of newly synthesized glycoproteins from the ER to the ER-Golgi intermediate compartment (ERGIC). The most prominent cycling lectin is the mannose-binding type1 membrane protein ERGIC-53, which functions as a cargo receptor to facilitate export of glycoproteins from the ER. L-type lectins have a dome-shaped beta-barrel carbohydrate recognition domain with a curved seven-stranded beta-sheet referred to as the "front face" and a flat six-stranded beta-sheet referred to as the "back face". This domain homodimerizes so that adjacent back sheets form a contiguous 12-stranded she
Probab=100.00 E-value=6.9e-54 Score=414.33 Aligned_cols=219 Identities=37% Similarity=0.735 Sum_probs=205.5
Q ss_pred eeccCCcccCCCCCCCCeEEeeeeEeeCCeEEeCCCCCCceeEEEeCCcccccceEEEEEEEEEeCCC-CCCCeEEEEEe
Q psy11306 36 YSFKPPYLAQKDGSVPFWEYGGNCIASLENVRVAPSLRSQKGAIWTKQTTNFEWWNVDIVFRVTGRGR-IGADGLAFWYT 114 (413)
Q Consensus 36 ~Sf~~P~l~~~~~~i~~W~~~GdAii~~~~IRLTp~~~sq~GsiWsk~pi~~~sFe~eftFrI~g~g~-~ggDGlAFW~t 114 (413)
|||.+||+..... |++|.+.|+|++.+++|||||+.++++|+||++.|+.+.+|+++|+|+|++.++ .+||||||||+
T Consensus 6 ~sL~~Py~~~~~~-i~~w~~~G~a~v~~~~IrLTp~~~~~~G~~w~~~p~~~~~F~~~f~F~I~~~~~~~~GdGlAfw~t 84 (248)
T cd06901 6 HSLIKPYQGVGSS-MPLWDFLGSTMVTSQYIRLTPDHQSKQGSIWNRVPCYLRDWEMHVHFKVHGSGKNLFGDGFAIWYT 84 (248)
T ss_pred ccccCCeecCCCC-CCCEEEcceEEEcCCeEEECCCCCCCEEEEeccCCccCCCEEEEEEEEEeCCCCCCCCCEEEEEEE
Confidence 8999999987444 999999999999999999999999999999999999999999999999998765 78999999999
Q ss_pred cCCCCCCCCcCCCCCCccEEEEEEEccCCCCC---CCCCeEEEEecCCcccccCCCCCCcccccceeeccccCCcceEEE
Q psy11306 115 SEKGSYDGEVFGSSDRWKGLGLFFDSFDNDNN---HNNPYIMAVVNDGNMAFDHQNDGASQSLAGCLRDFRNKPYPTRAR 191 (413)
Q Consensus 115 ~~~~~~~G~vfG~~~~f~GlaI~FDTf~N~~~---~~~P~I~i~~NDGt~~yd~~~DG~~~~L~~C~~~~rn~~~p~~~r 191 (413)
++++ +.|++||.++.|+||||+||||+|+.. +.+|+|++++|||++.|++.+||..+++++|.++|||++.|+++|
T Consensus 85 ~~~~-~~G~~fG~~~~f~Gl~I~~Dt~~n~~~~~~~~~P~i~~~~NDGt~~yd~~~Dg~~~~~~~C~~~~rn~~~~t~~r 163 (248)
T cd06901 85 KERM-QPGPVFGSKDNFHGLAIFFDTYSNQNGEHEHVHPYISAMVNNGSLSYDHDRDGTHTELAGCSAPFRNKDHDTFVA 163 (248)
T ss_pred cCCC-ccCcccccCCCCceEEEEEECCCCCCCcccCCCceEEEEEcCCCeeecccCCCchhhcCceeeeccCCCCCeEEE
Confidence 9988 899999999999999999999999864 468999999999999999999999999999999999999999999
Q ss_pred EEEECCEEEEEEEeCccCCCCceeEEEEecccccCCCcEEEEEeeCCCCCccEEEEEEEEeecCCCCC
Q psy11306 192 IQYYMNTLTVWFHNGMTNNEQDIEVCLRVENIYLPKEGYFGVSAATGGLADDHDILHFLTSSLLPPGA 259 (413)
Q Consensus 192 I~Y~~~~LsV~v~~~~t~~~~~~~~C~~v~~v~LP~~~yfGfSAaTG~lad~HDIls~~~~sL~~p~~ 259 (413)
|+|.++.|+|.++. .+..+|+.||+++.+.||+++||||||+||+++|+|||+++++|+|+.|..
T Consensus 164 I~Y~~~~l~v~vd~---~~~~~w~~Cf~~~~v~LP~~~yfGiSA~Tg~~sd~hdIlsv~~~~l~~~~~ 228 (248)
T cd06901 164 IRYSKGRLTVMTDI---DGKNEWKECFDVTGVRLPTGYYFGASAATGDLSDNHDIISMKLYELDVEET 228 (248)
T ss_pred EEEECCeEEEEEec---CCCCceeeeEEeCCeecCCCCEEEEEecCCCCCCcEEEEEEEEecCccccc
Confidence 99999999999886 234679999999999999999999999999999999999999999998764
No 5
>cd06903 lectin_EMP46_EMP47 EMP46 and EMP47 type 1 transmembrane proteins, N-terminal lectin domain. EMP46 and EMP47, N-terminal carbohydrate recognition domain. EMP46 and EMP47 are fungal type-I transmembrane proteins that cycle between the endoplasmic reticulum and the golgi apparatus and are thought to function as cargo receptors that transport newly synthesized glycoproteins. EMP47 is a receptor for EMP46 responsible for the selective transport of EMP46 by forming hetero-oligomerization between the two proteins. EMP46 and EMP47 have an N-terminal lectin-like carbohydrate recognition domain (represented by this alignment model) as well as a C-terminal transmembrane domain. EMP46 and EMP47 are 45% sequence-identical to one another and have sequence homology to a class of intracellular lectins defined by ERGIC-53 and VIP36. L-type lectins have a dome-shaped beta-barrel carbohydrate recognition domain with a curved seven-stranded beta-sheet referred to as the "front face" and a flat s
Probab=100.00 E-value=3.5e-50 Score=381.16 Aligned_cols=201 Identities=23% Similarity=0.505 Sum_probs=185.2
Q ss_pred eeccCCcccC-C--CCCCCCeEEeeeeEeeCCeEEeCCCCCCceeEEEeCCccccc-ceEEEEEEEEEeCCCCCCCeEEE
Q psy11306 36 YSFKPPYLAQ-K--DGSVPFWEYGGNCIASLENVRVAPSLRSQKGAIWTKQTTNFE-WWNVDIVFRVTGRGRIGADGLAF 111 (413)
Q Consensus 36 ~Sf~~P~l~~-~--~~~i~~W~~~GdAii~~~~IRLTp~~~sq~GsiWsk~pi~~~-sFe~eftFrI~g~g~~ggDGlAF 111 (413)
+|| |.+.. . +..|++|++.|+|++.+++|||||+ ++++|+||++.|+.+. +|+++|+|||+|.++.+||||||
T Consensus 5 ~s~--p~l~~~~~~~~~i~~W~~~G~t~v~~~~IrLTp~-~s~~G~iWs~~pl~~~~~w~ie~~Fri~G~~~~~gdGla~ 81 (215)
T cd06903 5 LSL--PNLLKISPNGKLIPNWQTSGNPKLESGRIILTPP-GNQRGSLWLKKPLSLKDEWTIEWTFRSTGPEGRSGGGLNF 81 (215)
T ss_pred ccC--CchhccccCCCCCCCeEEcCcEEeeCCeEEECCC-CCceEeEeeCCcCCCCCCEEEEEEEEecccCCcCCCEEEE
Confidence 677 55544 2 4689999999999999999999999 9999999999999999 99999999999988789999999
Q ss_pred EEecCCCCCCC-CcCCCCCCccEEEEEEEccCCCCCCCCCeEEEEecCCcccccCCCCCCcccccceeeccccCCcceEE
Q psy11306 112 WYTSEKGSYDG-EVFGSSDRWKGLGLFFDSFDNDNNHNNPYIMAVVNDGNMAFDHQNDGASQSLAGCLRDFRNKPYPTRA 190 (413)
Q Consensus 112 W~t~~~~~~~G-~vfG~~~~f~GlaI~FDTf~N~~~~~~P~I~i~~NDGt~~yd~~~DG~~~~L~~C~~~~rn~~~p~~~ 190 (413)
|||++++...| ++||+++.|+||||+|||+.|+ .|+|++++|||++.|++.+| ..+++++|.++|||.+.|+++
T Consensus 82 W~t~~~~~~~g~~~fG~~~~f~Gl~I~~Dt~~n~----~p~i~~~~NDGt~~yd~~~d-~~~~~g~C~~~~rn~~~p~~i 156 (215)
T cd06903 82 WLVKDGNADVGTSSIYGPSKFDGLQLLIDNNGGS----GGSLRGFLNDGSKDYKNEDV-DSLAFGSCLFAYQDSGVPSTI 156 (215)
T ss_pred EEECCCcccCCccccCCCCCCcEEEEEEECCCCC----CceEEEEECCCCeeccccCC-cccccceeeEeccCCCCCEEE
Confidence 99998873345 9999999999999999999974 38999999999999999999 889999999999999999999
Q ss_pred EEEEEC--CEEEEEEEeCccCCCCceeEEEEecccccC-CCcEEEEEeeCCCCCccEEEEEEEEee
Q psy11306 191 RIQYYM--NTLTVWFHNGMTNNEQDIEVCLRVENIYLP-KEGYFGVSAATGGLADDHDILHFLTSS 253 (413)
Q Consensus 191 rI~Y~~--~~LsV~v~~~~t~~~~~~~~C~~v~~v~LP-~~~yfGfSAaTG~lad~HDIls~~~~s 253 (413)
||+|.+ +.|+|+|++ +.||+++++.|| .++||||||+||+++|+|||+++++|+
T Consensus 157 ri~Y~~~~~~l~v~vd~---------~~Cf~~~~v~lP~~~y~fGiSAaTg~~~d~hdIl~~~~~~ 213 (215)
T cd06903 157 RLSYDALNSLFKVQVDN---------RLCFQTDKVQLPQGGYRFGITAANADNPESFEILKLKVWN 213 (215)
T ss_pred EEEEECCCCEEEEEECC---------CEEEecCCeecCCCCCEEEEEEcCCCCCCcEEEEEEEEec
Confidence 999999 789999864 699999999999 678899999999999999999999986
No 6
>cd07308 lectin_leg-like legume-like lectins: ERGIC-53, ERGL, VIP36, VIPL, EMP46, and EMP47. The legume-like (leg-like) lectins are eukaryotic intracellular sugar transport proteins with a carbohydrate recognition domain similar to that of the legume lectins. This domain binds high-mannose-type oligosaccharides for transport from the endoplasmic reticulum to the Golgi complex. These leg-like lectins include ERGIC-53, ERGL, VIP36, VIPL, EMP46, EMP47, and the UIP5 (ULP1-interacting protein 5) precursor protein. Leg-like lectins have different intracellular distributions and dynamics in the endoplasmic reticulum-Golgi system of the secretory pathway and interact with N-glycans of glycoproteins in a calcium-dependent manner, suggesting a role in glycoprotein sorting and trafficking. L-type lectins have a dome-shaped beta-barrel carbohydrate recognition domain with a curved seven-stranded beta-sheet referred to as the "front face" and a flat six-stranded beta-sheet referred to as the "ba
Probab=100.00 E-value=2.4e-48 Score=367.66 Aligned_cols=216 Identities=39% Similarity=0.787 Sum_probs=202.3
Q ss_pred eeeeccCCcccCCCCCCCCeEEeeeeEeeCCeEEeCCCCCCceeEEEeCCcccccceEEEEEEEEEeCCCCCCCeEEEEE
Q psy11306 34 YKYSFKPPYLAQKDGSVPFWEYGGNCIASLENVRVAPSLRSQKGAIWTKQTTNFEWWNVDIVFRVTGRGRIGADGLAFWY 113 (413)
Q Consensus 34 ~k~Sf~~P~l~~~~~~i~~W~~~GdAii~~~~IRLTp~~~sq~GsiWsk~pi~~~sFe~eftFrI~g~g~~ggDGlAFW~ 113 (413)
..|||.+||+.+.+..+++|.+.|+|++.+++|||||+.++++|++|++.|+++++|+++|+|+|++.+..+||||||||
T Consensus 3 ~~~Sl~~P~~~~~~~~~~~w~~~G~a~~~~~~i~LT~~~~~~~G~~~~~~pi~~~~F~~~f~F~i~~~~~~~gdG~af~~ 82 (218)
T cd07308 3 SEHSLSPPFLDDNDGEIGNWTVGGSTVITKNYIRLTPDVPSQSGSLWSRVPIPAKDFEIEVEFSIHGGSGLGGDGFAFWY 82 (218)
T ss_pred cccccCCCcccCCCCccCCeEEcCCeEEeCCEEEeCCCCCCCEeEEEeCCCccCCCEEEEEEEEEeCCCCCCCCEEEEEE
Confidence 46999999999877889999999999999999999999999999999999999999999999999987667999999999
Q ss_pred ecCCCCCCCCcCCCCCCccEEEEEEEccCCCCCCCCCeEEEEecCCcccccCCCCCCcccccceeeccccCCcceEEEEE
Q psy11306 114 TSEKGSYDGEVFGSSDRWKGLGLFFDSFDNDNNHNNPYIMAVVNDGNMAFDHQNDGASQSLAGCLRDFRNKPYPTRARIQ 193 (413)
Q Consensus 114 t~~~~~~~G~vfG~~~~f~GlaI~FDTf~N~~~~~~P~I~i~~NDGt~~yd~~~DG~~~~L~~C~~~~rn~~~p~~~rI~ 193 (413)
+++++ ..|++||+++.|+||||+|||+.|+ ...+|+|++++|||++.|++.+|+....+++|.+++|+.+.++++||+
T Consensus 83 ~~~~~-~~g~~~G~~~~~~Glai~fdt~~n~-~~~~p~i~~~~Ndg~~~~~~~~d~~~~~~~~c~~~~~~~~~~~~~~I~ 160 (218)
T cd07308 83 TEEPG-SDGPLFGGPDKFKGLAIFFDTYDND-GKGFPSISVFLNDGTKSYDYETDGEKLELASCSLKFRNSNAPTTLRIS 160 (218)
T ss_pred ECCCC-CCCcccccCCCCCEEEEEEEcCCCC-CCCCCeEEEEEeCCCceecccCCCccccccceeEecccCCCCeEEEEE
Confidence 99888 8999999999999999999999997 456799999999999999999999999999999999998889999999
Q ss_pred EECCEEEEEEEeCccCCCCceeEEEEecccccCCCcEEEEEeeCCCCCccEEEEEEEEeec
Q psy11306 194 YYMNTLTVWFHNGMTNNEQDIEVCLRVENIYLPKEGYFGVSAATGGLADDHDILHFLTSSL 254 (413)
Q Consensus 194 Y~~~~LsV~v~~~~t~~~~~~~~C~~v~~v~LP~~~yfGfSAaTG~lad~HDIls~~~~sL 254 (413)
|..+.|+|.++. ....+|..|++++.+.||+++||||||+||.++|+|||++|+++.+
T Consensus 161 y~~~~l~v~i~~---~~~~~~~~c~~~~~~~l~~~~y~G~sA~tg~~~d~~dIls~~~~~~ 218 (218)
T cd07308 161 YLNNTLKVDITY---SEGNNWKECFTVEDVILPSQGYFGFSAQTGDLSDNHDILSVHTYEL 218 (218)
T ss_pred EECCEEEEEEeC---CCCCCccEEEEcCCcccCCCCEEEEEeccCCCcCcEEEEEEEeeeC
Confidence 998899999986 3445799999999999999999999999999999999999999863
No 7
>KOG3839|consensus
Probab=100.00 E-value=1.2e-46 Score=368.33 Aligned_cols=219 Identities=36% Similarity=0.690 Sum_probs=207.8
Q ss_pred eeeccCCcccCCCCCCCCeEEeeeeEeeCCeEEeCCCCCCceeEEEeCCcccccceEEEEEEEEEeCCC-CCCCeEEEEE
Q psy11306 35 KYSFKPPYLAQKDGSVPFWEYGGNCIASLENVRVAPSLRSQKGAIWTKQTTNFEWWNVDIVFRVTGRGR-IGADGLAFWY 113 (413)
Q Consensus 35 k~Sf~~P~l~~~~~~i~~W~~~GdAii~~~~IRLTp~~~sq~GsiWsk~pi~~~sFe~eftFrI~g~g~-~ggDGlAFW~ 113 (413)
+||+..||...+ +..++|++.|+++++.++|||||+.+++.|+||++.|+..++||+.+.|+|++.++ ..|||||+||
T Consensus 57 ehSL~~Py~g~g-~r~~~W~~~Gstvv~~~~irLT~d~qsk~GAv~n~~Pv~s~~wev~v~fkv~~~s~~lfgdG~Aiw~ 135 (351)
T KOG3839|consen 57 EHSLSKPYQGVG-GRQPNWNLSGSTVVTSNYIRLTPDEQSKSGAVWNRQPVFSRDWEVLVHFKVHGQSKNLFGDGMAIWY 135 (351)
T ss_pred hhhccccccccc-ccccCccccccEEEEeeeeeccccccccccccccCCCccccceeEEEEEEEecCCCcccccceEEEe
Confidence 599999999986 88999999999999999999999999999999999999999999999999999875 7999999999
Q ss_pred ecCCCCCCCCcCCCCCCccEEEEEEEccCCCCC---CCCCeEEEEecCCcccccCCCCCCcccccceeeccccCCcceEE
Q psy11306 114 TSEKGSYDGEVFGSSDRWKGLGLFFDSFDNDNN---HNNPYIMAVVNDGNMAFDHQNDGASQSLAGCLRDFRNKPYPTRA 190 (413)
Q Consensus 114 t~~~~~~~G~vfG~~~~f~GlaI~FDTf~N~~~---~~~P~I~i~~NDGt~~yd~~~DG~~~~L~~C~~~~rn~~~p~~~ 190 (413)
|.++. ++||+||..++|+||+|++|||.|... +.+|+|++|+|+|+.+|||..||..+.+++|.++|||.++.+.+
T Consensus 136 t~Er~-q~GPvFG~~dkF~GL~vfidtY~n~~g~~~~v~P~isvmv~~gs~sYD~~~Dg~~tel~gc~a~~rn~~~dt~~ 214 (351)
T KOG3839|consen 136 TKERA-QPGPVFGSKDKFTGLAVFIDTYGNHNGPHEHVFPYISVMVNIGSLSYDHSKDGTHTELAGCTANFRNLPHDTLV 214 (351)
T ss_pred ecccc-cCCCCCCCcccceeEEEEEeccCCcCCCcccceeeEEEEeccCCcccccCCCCCccccccceeeeccCCCccee
Confidence 99999 899999999999999999999999753 34799999999999999999999999999999999999999999
Q ss_pred EEEEECCEEEEEEEeCccCCCCceeEEEEecccccCCCcEEEEEeeCCCCCccEEEEEEEEeecCCCC
Q psy11306 191 RIQYYMNTLTVWFHNGMTNNEQDIEVCLRVENIYLPKEGYFGVSAATGGLADDHDILHFLTSSLLPPG 258 (413)
Q Consensus 191 rI~Y~~~~LsV~v~~~~t~~~~~~~~C~~v~~v~LP~~~yfGfSAaTG~lad~HDIls~~~~sL~~p~ 258 (413)
.|+|.+++|++.++. .++.+|..|+.++.+.||.+.|||+||+||+++|+|||+|++.+.|..-.
T Consensus 215 ~iry~~~~l~~~~dl---~~~~~~~~c~~~n~v~lp~g~~fg~SasTGdlSd~HdivS~kl~~L~d~~ 279 (351)
T KOG3839|consen 215 VIRYEKKTLSISIDL---EGPNEWIDCFSLNNVELPLGYFFGVSASTGDLSDSHDIVSLKLFELTDIH 279 (351)
T ss_pred EEEecCCceEEEEec---CCCceeeeeeeecceecccceEEeeeeccCccchhhHHHHhhhhhhhccc
Confidence 999999999999997 45678999999999999999999999999999999999999999988764
No 8
>cd01951 lectin_L-type legume lectins. The L-type (legume-type) lectins are a highly diverse family of carbohydrate binding proteins that generally display no enzymatic activity toward the sugars they bind. This family includes arcelin, concanavalinA, the lectin-like receptor kinases, the ERGIC-53/VIP36/EMP46 type1 transmembrane proteins, and an alpha-amylase inhibitor. L-type lectins have a dome-shaped beta-barrel carbohydrate recognition domain with a curved seven-stranded beta-sheet referred to as the "front face" and a flat six-stranded beta-sheet referred to as the "back face". This domain homodimerizes so that adjacent back sheets form a contiguous 12-stranded sheet and homotetramers occur by a back-to-back association of these homodimers. Though L-type lectins exhibit both sequence and structural similarity to one another, their carbohydrate binding specificities differ widely.
Probab=99.98 E-value=6.5e-31 Score=248.92 Aligned_cols=195 Identities=27% Similarity=0.420 Sum_probs=159.6
Q ss_pred CCCCeEEeeeeEee--CCeEEeCCCCCCceeEEEeCCcccc-cceEEEEEEEEEeCCCCCCCeEEEEEecCCCCC---CC
Q psy11306 49 SVPFWEYGGNCIAS--LENVRVAPSLRSQKGAIWTKQTTNF-EWWNVDIVFRVTGRGRIGADGLAFWYTSEKGSY---DG 122 (413)
Q Consensus 49 ~i~~W~~~GdAii~--~~~IRLTp~~~sq~GsiWsk~pi~~-~sFe~eftFrI~g~g~~ggDGlAFW~t~~~~~~---~G 122 (413)
..++|.+.|+|.+. ++.|||||+..+++|+||++.|+++ .+|+++|+|+|.+.+..+||||||||++++... .|
T Consensus 12 ~~~~~~~~G~A~~~~~~~~l~Lt~~~~~~~G~~~~~~~i~~~~~F~~~F~f~i~~~~~~~gdG~aF~l~~~~~~~~~~~g 91 (223)
T cd01951 12 NQSNWQLNGSATLTTDSGVLRLTPDTGNQAGSAWYKTPIDLSKDFTTTFKFYLGTKGTNGADGIAFVLQNDPAGALGGGG 91 (223)
T ss_pred ChhhcEEcccEEecCCCCEEEECCCCCCcEEEEEECCcEeccCCEEEEEEEEEeCCCCCCCCcEEEEEecCCCCccccCC
Confidence 35689999999988 8999999999999999999999999 999999999999876679999999999876522 22
Q ss_pred --CcCCCCCCccEEEEEEEccCCCC--CCCCCeEEEEecCCcccccCCCCCCcccccceeeccccC-CcceEEEEEEEC-
Q psy11306 123 --EVFGSSDRWKGLGLFFDSFDNDN--NHNNPYIMAVVNDGNMAFDHQNDGASQSLAGCLRDFRNK-PYPTRARIQYYM- 196 (413)
Q Consensus 123 --~vfG~~~~f~GlaI~FDTf~N~~--~~~~P~I~i~~NDGt~~yd~~~DG~~~~L~~C~~~~rn~-~~p~~~rI~Y~~- 196 (413)
..+|..+.+++||||||||.|.+ ..+.++|.+.+|.....| + ....++.|....... +..+++||+|+.
T Consensus 92 ~~~~lG~~~~~~~~aVefDT~~N~~~~dp~~~higi~~n~~~~~~-~----~~~~~~~~~~~~~~~~g~~~~v~I~Y~~~ 166 (223)
T cd01951 92 GGGGLGYGGIGNSVAVEFDTYKNDDNNDPNGNHISIDVNGNGNNT-A----LATSLGSASLPNGTGLGNEHTVRITYDPT 166 (223)
T ss_pred CCCccCccccCCeEEEEEeccccCCCCCCCCCEEEEEcCCCCCCc-c----cccccceeeCCCccCCCCEEEEEEEEeCC
Confidence 67888899999999999999975 345678999888776554 1 123567777665544 778999999996
Q ss_pred -CEEEEEEEeCccCCCCceeEEEEecc---cccCCCcEEEEEeeCCCCCccEEEEEEEEe
Q psy11306 197 -NTLTVWFHNGMTNNEQDIEVCLRVEN---IYLPKEGYFGVSAATGGLADDHDILHFLTS 252 (413)
Q Consensus 197 -~~LsV~v~~~~t~~~~~~~~C~~v~~---v~LP~~~yfGfSAaTG~lad~HDIls~~~~ 252 (413)
+.|+|.++.... .+..|+.+.. ..+|+.+||||||+||..++.|+|++|++.
T Consensus 167 ~~~L~v~l~~~~~----~~~~~l~~~~~l~~~~~~~~yvGFTAsTG~~~~~h~V~~wsf~ 222 (223)
T cd01951 167 TNTLTVYLDNGST----LTSLDITIPVDLIQLGPTKAYFGFTASTGGLTNLHDILNWSFT 222 (223)
T ss_pred CCEEEEEECCCCc----cccccEEEeeeecccCCCcEEEEEEcccCCCcceeEEEEEEec
Confidence 579999986321 3567776641 145899999999999999999999999975
No 9
>cd06899 lectin_legume_LecRK_Arcelin_ConA legume lectins, lectin-like receptor kinases, arcelin, concanavalinA, and alpha-amylase inhibitor. This alignment model includes the legume lectins (also known as agglutinins), the arcelin (also known as phytohemagglutinin-L) family of lectin-like defense proteins, the LecRK family of lectin-like receptor kinases, concanavalinA (ConA), and an alpha-amylase inhibitor. Arcelin is a major seed glycoprotein discovered in kidney beans (Phaseolus vulgaris) that has insecticidal properties and protects the seeds from predation by larvae of various bruchids. Arcelin is devoid of monosaccharide binding properties and lacks a key metal-binding loop that is present in other members of this family. Phytohaemagglutinin (PHA) is a lectin found in plants, especially beans, that affects cell metabolism by inducing mitosis and by altering the permeability of the cell membrane to various proteins. PHA agglutinates most mammalian red blood cell types by bindin
Probab=99.96 E-value=9.2e-28 Score=230.51 Aligned_cols=195 Identities=22% Similarity=0.257 Sum_probs=145.8
Q ss_pred CCCeEEeeeeEee-CCeEEeCCCC--CCceeEEEeCCcccc--------cceEEEEEEEEEeCC-CCCCCeEEEEEecCC
Q psy11306 50 VPFWEYGGNCIAS-LENVRVAPSL--RSQKGAIWTKQTTNF--------EWWNVDIVFRVTGRG-RIGADGLAFWYTSEK 117 (413)
Q Consensus 50 i~~W~~~GdAii~-~~~IRLTp~~--~sq~GsiWsk~pi~~--------~sFe~eftFrI~g~g-~~ggDGlAFW~t~~~ 117 (413)
..+|.+.|+|.+. ++.|+||++. .++.|++|++.|+++ .+|+++|+|+|++.. ..+|||||||+++++
T Consensus 12 ~~~l~l~G~A~~~~~~~i~LT~~~~~~~~~G~v~y~~pi~l~~~~~~~~~sFst~F~F~i~~~~~~~~gdGlAF~i~~~~ 91 (236)
T cd06899 12 QSNLTLQGDATISSNGALQLTNDTSPASSVGRALYSKPVRLWDSTTGKVASFSTSFSFSITPPNPSLGGDGLAFFLAPTD 91 (236)
T ss_pred CCCEEEecceEcCCCCeEEecCCCCCCcceEEEEeCCCEEeecCCCCCceeEEEEEEEEEEcCCCCCCCCeEEEEEecCC
Confidence 5689999999987 8999999998 899999999999986 489999999999765 378999999999976
Q ss_pred CC---CCCCcCCCCC-------CccEEEEEEEccCCCCC--CCCCeEEEEecCC-cccccCCCCCCcccccceeeccccC
Q psy11306 118 GS---YDGEVFGSSD-------RWKGLGLFFDSFDNDNN--HNNPYIMAVVNDG-NMAFDHQNDGASQSLAGCLRDFRNK 184 (413)
Q Consensus 118 ~~---~~G~vfG~~~-------~f~GlaI~FDTf~N~~~--~~~P~I~i~~NDG-t~~yd~~~DG~~~~L~~C~~~~rn~ 184 (413)
.. ..|+.+|..+ ...+|||||||+.|.+. .+.+||.+.+|-= +..... +..-..++ ..
T Consensus 92 ~~~~~~~G~~lG~~~~~~~~~~~~~~vAVEFDT~~n~~~~D~~~nHigIdvn~~~S~~~~~--------~~~~~~~l-~~ 162 (236)
T cd06899 92 SLPPASSGGYLGLFNSSNNGNSSNHIVAVEFDTFQNPEFGDPDDNHVGIDVNSLVSVKAGY--------WDDDGGKL-KS 162 (236)
T ss_pred CCCCCCCcceeeeecCCCCCCcccceEEEEeecccCcccCCCCCCeEEEEcCCcccceeec--------cccccccc-cC
Confidence 52 2467777654 34689999999999753 4457888887631 111000 00000111 34
Q ss_pred CcceEEEEEEEC--CEEEEEEEeCccCCCCceeEEEEec-ccccCCCcEEEEEeeCCCCCccEEEEEEEEee
Q psy11306 185 PYPTRARIQYYM--NTLTVWFHNGMTNNEQDIEVCLRVE-NIYLPKEGYFGVSAATGGLADDHDILHFLTSS 253 (413)
Q Consensus 185 ~~p~~~rI~Y~~--~~LsV~v~~~~t~~~~~~~~C~~v~-~v~LP~~~yfGfSAaTG~lad~HDIls~~~~s 253 (413)
+..++|||.|+. +.|+|++............+...++ ...||+.+||||||+||...+.|+|++|.+.+
T Consensus 163 g~~~~v~I~Y~~~~~~L~V~l~~~~~~~~~~~~ls~~vdL~~~l~~~~~vGFSasTG~~~~~h~i~sWsF~s 234 (236)
T cd06899 163 GKPMQAWIDYDSSSKRLSVTLAYSGVAKPKKPLLSYPVDLSKVLPEEVYVGFSASTGLLTELHYILSWSFSS 234 (236)
T ss_pred CCeEEEEEEEcCCCCEEEEEEEeCCCCCCcCCEEEEeccHHHhCCCceEEEEEeEcCCCcceEEEEEEEEEc
Confidence 678999999996 4799999874322222233444444 44679999999999999999999999999865
No 10
>PF00139 Lectin_legB: Legume lectin domain; InterPro: IPR001220 Legume lectins are one of the largest lectin families with more than 70 lectins reported. Leguminous plant lectins resemble each other in their physicochemical properties although they differ in their carbohydrate specificities. They consist of two or four subunits with relative molecular mass of 30 kDa and each subunit has one carbohydrate-binding site. The interaction with sugars requires tightly bound calcium and manganese ions. The structural similarities of these lectins are reported by the primary structural analyses and X-ray crystallographic studies. X-ray studies have shown that the folding of the polypeptide chains in the region of the carbohydrate-binding sites is also similar, despite differences in the primary sequences. The carbohydrate-binding sites of these lectins consist of two conserved amino acids on beta pleated sheets. One of these loops contains transition metals, calcium and manganese, which keep the amino acid residues of the sugar-binding site at the required positions. Amino acid sequences of this loop play an important role in the carbohydrate-binding specificities of these lectins. These lectins bind either glucose/mannose or galactose. The exact function of legume lectins is not known but they may be involved in the attachment of nitrogen-fixing bacteria to legumes and in the protection against pathogens. Some legume lectins are proteolytically processed to produce two chains, beta (which corresponds to the N-terminal) and alpha (C-terminal) (IPR000985 from INTERPRO). The lectin concanavalin A (conA) from jack bean is exceptional in that the two chains are transposed and ligated (by formation of a new peptide bond). The N terminus of mature conA thus corresponds to that of the alpha chain and the C terminus to the beta chain.; GO: 0005488 binding; PDB: 1VLN_B 2GDF_C 2JE9_C 2JEC_C 1DGL_B 2P37_B 2CWM_A 2P34_D 2OW4_A 3IPV_B ....
Probab=99.94 E-value=3.3e-25 Score=212.40 Aligned_cols=197 Identities=21% Similarity=0.315 Sum_probs=143.7
Q ss_pred CCCeEEeeeeEe-eCCeEEeCCCCC-CceeEEEeCCcccc--------cceEEEEEEEEEeC---CCCCCCeEEEEEecC
Q psy11306 50 VPFWEYGGNCIA-SLENVRVAPSLR-SQKGAIWTKQTTNF--------EWWNVDIVFRVTGR---GRIGADGLAFWYTSE 116 (413)
Q Consensus 50 i~~W~~~GdAii-~~~~IRLTp~~~-sq~GsiWsk~pi~~--------~sFe~eftFrI~g~---g~~ggDGlAFW~t~~ 116 (413)
..+|.+.|+|.+ .++.|+||++.. ++.|++|++.|+++ .+|+++|+|+|... ...+||||||+++++
T Consensus 12 ~~~~~l~G~A~~~~~~~l~LT~~~~~~~~G~~~y~~pi~l~d~~~~~~~sF~t~F~f~i~~~~~~~~~~~dGlAFvi~~~ 91 (236)
T PF00139_consen 12 SSNLTLNGDASISSNGSLQLTPDSTNNQAGRAWYNNPIQLWDSTTGNVASFSTSFSFSITNGPGSSNNGGDGLAFVIQPD 91 (236)
T ss_dssp GTTEEEEETEEEETTSEEESSTBETSSEEEEEEESSEEESBETTTTEBEEEEEEEEEEEEESSSSSSS-BEEEEEEEEET
T ss_pred CCceEEEeeEEeccCCeEEcCCCCCCCcEEEEEECCcEEEeCCCCcceeeeeeEEEEEEeccCCCCccCCCceEEEEecC
Confidence 478999999998 799999999998 99999999999998 79999999999643 347899999999998
Q ss_pred CCC---CCCCcCCCCC-------CccEEEEEEEccCCCCC--CCCCeEEEEecCCcccccCCCCCCcccccceeeccccC
Q psy11306 117 KGS---YDGEVFGSSD-------RWKGLGLFFDSFDNDNN--HNNPYIMAVVNDGNMAFDHQNDGASQSLAGCLRDFRNK 184 (413)
Q Consensus 117 ~~~---~~G~vfG~~~-------~f~GlaI~FDTf~N~~~--~~~P~I~i~~NDGt~~yd~~~DG~~~~L~~C~~~~rn~ 184 (413)
... ..|+.+|..+ ..+.||||||||.|.+. .+.+||.|.+|.-....... -+. ...-..++ ..
T Consensus 92 ~~~~~~~~g~~lG~~~~~~~~~~~~~~vAVEFDT~~N~~~~d~~~nHIgI~~n~~~s~~~~~-~~~---~~~~~~~l-~~ 166 (236)
T PF00139_consen 92 PNLPGGSSGGYLGLFNSSTDGNGINNSVAVEFDTYKNPEYNDPDDNHIGIDVNSVVSNKTAS-AGY---YSSPSFSL-SD 166 (236)
T ss_dssp TSSTTTSSGGGTTTSSSSSTTGGGGCEEEEEEETSTCGGGTTTSSSEEEEEESSSSESEEEE--------EEEEHHH-GT
T ss_pred cccccCCCCCccCccccccCCCccCcEEEEEEeeeecccccccCCCEEEEECCCCccccccc-ccc---cccccccc-cC
Confidence 741 2556677644 45689999999997543 34578988876422111100 000 00011122 33
Q ss_pred CcceEEEEEEECC--EEEEEEEeCccCCCCceeEEEEec-ccccCCCcEEEEEeeCCCCCccEEEEEEEEe
Q psy11306 185 PYPTRARIQYYMN--TLTVWFHNGMTNNEQDIEVCLRVE-NIYLPKEGYFGVSAATGGLADDHDILHFLTS 252 (413)
Q Consensus 185 ~~p~~~rI~Y~~~--~LsV~v~~~~t~~~~~~~~C~~v~-~v~LP~~~yfGfSAaTG~lad~HDIls~~~~ 252 (413)
+..+++||+|+.. .|+|.++... .++....++..++ .-.||+.+||||||+||...+.|+|++|.+.
T Consensus 167 g~~~~v~I~Yd~~~~~L~V~l~~~~-~~~~~~~l~~~vdL~~~l~~~v~vGFsasTG~~~~~h~I~sW~F~ 236 (236)
T PF00139_consen 167 GKWHTVWIDYDASTKRLSVYLDDNS-SKPSSPVLSVNVDLSAVLPEQVYVGFSASTGGSYQTHDILSWSFS 236 (236)
T ss_dssp TSEEEEEEEEETTTTEEEEEEEETT-TTSEEEEEEEE--HHHHSCSEEEEEEEEEESSSSEEEEEEEEEEE
T ss_pred CcEEEEEEEEcCCccEEEEEEeccc-CCCcceeEEEEEchHHhcCCCcEEEEEeecCCCcceEEEEEEEeC
Confidence 6789999999995 7999998741 2233345555554 3467899999999999999999999999973
No 11
>cd06900 lectin_VcfQ VcfQ bacterial pilus biogenesis protein, lectin domain. This family includes bacterial proteins homologous to the VcfQ (also known as MshQ) bacterial pilus biogenesis protein. VcfQ is encoded by the vcfQ gene of the type IV pilus gene cluster of Vibrio cholerae and is essential for type IV pilus assembly. VcfQ has a Laminin G-like domain as well as an L-type lectin domain.
Probab=99.07 E-value=3.3e-08 Score=95.19 Aligned_cols=195 Identities=23% Similarity=0.281 Sum_probs=120.5
Q ss_pred CCeEEeeee-----EeeCCeEEeCCCCCCceeEEEeCCcccccc--eEEEEEEEEEeCCCCCCCeEEEEEecCCC-CCC-
Q psy11306 51 PFWEYGGNC-----IASLENVRVAPSLRSQKGAIWTKQTTNFEW--WNVDIVFRVTGRGRIGADGLAFWYTSEKG-SYD- 121 (413)
Q Consensus 51 ~~W~~~GdA-----ii~~~~IRLTp~~~sq~GsiWsk~pi~~~s--Fe~eftFrI~g~g~~ggDGlAFW~t~~~~-~~~- 121 (413)
.+|.....- .+.++++|||++..+|.|++.+...++.+. .++||++...+.+..|||||||++++..- ++.
T Consensus 14 ~~W~v~~~~~s~~P~i~~g~LRLT~~~~nqata~~~~~~FPs~~n~v~veFd~yayg~~g~GADGia~vLsDasv~p~~G 93 (255)
T cd06900 14 NDWVVISSGGSFTPQVVNNRLRLTDASGNQATAVTLQRLFPSAGNYVEVEFDYYAYGSGGNGADGVALVLSDASVTPQAG 93 (255)
T ss_pred CceEEEecCCccCCccccCeEEeccCccCcceeEEEeeeeccCCCeEEEEEEEEEecCCCCCCceEEEEEeCCCcCCcCC
Confidence 368776543 256899999999999999999999999864 89999999988767899999999996433 123
Q ss_pred --CCcCCCCCC------ccE--EEEEEEccCCCCC---C--C----CCeEEEEecCCc--ccccCCCCCCcccccceeec
Q psy11306 122 --GEVFGSSDR------WKG--LGLFFDSFDNDNN---H--N----NPYIMAVVNDGN--MAFDHQNDGASQSLAGCLRD 180 (413)
Q Consensus 122 --G~vfG~~~~------f~G--laI~FDTf~N~~~---~--~----~P~I~i~~NDGt--~~yd~~~DG~~~~L~~C~~~ 180 (413)
|..+|+... |.| |||=||-|.|-.. + . -|.--.+-.-|. ..|.... |. ..+.. ..+
T Consensus 94 ~fGGsLGYa~~~~~~~GfaGGwLGiGlDEyGNFsn~~eg~~~~~g~r~~sV~vRGsg~g~~gY~yl~-gt-~~~~~-~id 170 (255)
T cd06900 94 AFGGSLGYAQRNDGVPGFAGGWLGIGLDEYGNFSNPNEGRNGFSGRRPQSVTVRGSGSGYTGYKYIT-GT-GVLPP-GID 170 (255)
T ss_pred CcCcccccccccCCCCccccceEEEEEeccccccCCCCCccCCcccccceEEEECCCCCCcCceEec-cc-CCCCc-ccc
Confidence 334444432 222 7889998887311 1 1 122223333333 3444321 10 01110 112
Q ss_pred cccC--C-cceEEEEEEECC-----EEEEEEEeCccCCCCceeEEE---Eec----ccccCCCcEEEEEeeCCCCCccEE
Q psy11306 181 FRNK--P-YPTRARIQYYMN-----TLTVWFHNGMTNNEQDIEVCL---RVE----NIYLPKEGYFGVSAATGGLADDHD 245 (413)
Q Consensus 181 ~rn~--~-~p~~~rI~Y~~~-----~LsV~v~~~~t~~~~~~~~C~---~v~----~v~LP~~~yfGfSAaTG~lad~HD 245 (413)
.++. + -+++-||+-+.. .|+|.-+.+ +...|...+ .+- .-.+|+..+++||++||.....|+
T Consensus 171 ~~~~~~~~~~hrY~i~Ids~~~~~~~vsV~R~~~---~gg~~~~~I~~~d~~~~~~q~avP~~f~lS~TgSTGgstN~HE 247 (255)
T cd06900 171 NNSTSTPAPGHRYRITIDSTNGDNAWLSVERDIG---NGGAYFVVILTFDALAEQNQDAIPENFYLSFTGSTGGSTNTHE 247 (255)
T ss_pred cCCCCCCCCceEEEEEEecCCCCceEEEEEEEcc---CCceeEEeecceeeccccCCCCCCccEEEEEEecCCCccccee
Confidence 2221 1 135555555543 367765532 222243332 222 245789999999999999999999
Q ss_pred EEEEEE
Q psy11306 246 ILHFLT 251 (413)
Q Consensus 246 Ils~~~ 251 (413)
|-.|++
T Consensus 248 Idnf~V 253 (255)
T cd06900 248 IDNFQV 253 (255)
T ss_pred ecceEe
Confidence 998876
No 12
>KOG3838|consensus
Probab=94.97 E-value=0.12 Score=53.58 Aligned_cols=113 Identities=17% Similarity=0.159 Sum_probs=68.8
Q ss_pred ChHHHHHHHHHHHHHHHHHHHhhhccCCccccccccccCCCCcccccCCccCCCCCCCCC-chhhhhccchHHHHHHHhh
Q psy11306 265 VNQEDQKVAQEYAQYEKKLEEQKQHSQNPVERFEYKYSFKPPYLAQKDGYQKDHPDAHPN-EEEWYESENQRELRQIFQG 343 (413)
Q Consensus 265 ~~~e~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~l~~~~~~ 343 (413)
|..-++.-.+|.+.||++++.... ++-|+.-+-++. .|..=|.+.--++.+.+.
T Consensus 263 pqP~~E~qe~ek~kyqeEfe~~q~------------------------elek~k~efkk~hpd~~~e~ee~~~~yEs~~- 317 (497)
T KOG3838|consen 263 PQPFLEMQELEKAKYQEEFEWAQL------------------------ELEKRKDEFKKSHPDAQGEGEELFDLYESLG- 317 (497)
T ss_pred CCchhhhhHHHHHHHHHHHHHHHH------------------------HHhhhHhhhccCCchhhcchhhhhhhhhccc-
Confidence 444445555588889999988776 444432222222 333333222233445555
Q ss_pred hhHHHHHHHH---HHHHHHHHHHHHHhhhhccccCC-CccCCCCCCCCCCccchHHHHHhhHH
Q psy11306 344 QSQLAEWTKA---IATGLDALQQKQDRILAVVSQGG-GIPHQVVPGQPMPMINNDALLASQNS 402 (413)
Q Consensus 344 ~~~~~~~~~~---l~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~q~~ 402 (413)
|..+.|.+++ ++.+|+++.+|++++++.-.... +.+.+.+|++|........+-..-+.
T Consensus 318 ~Relrqi~egQn~i~~~l~ql~rql~~il~~Q~~~~~a~~p~asg~~p~~~~~gG~~sqse~d 380 (497)
T KOG3838|consen 318 HRELRQILEGQNAIHKQLAQLERQLDKILGPQARPVGALAPDASGQIPSTPGRGGHLSQSEND 380 (497)
T ss_pred hHHHHHHHhhhhHHHHHHHHHHHHHHHHhCccccCCcccCCcccCCCCCCCCCCCccchhhhh
Confidence 8888888888 99999999999999998743322 34455566666666555555444333
No 13
>cd06902 lectin_ERGIC-53_ERGL ERGIC-53 and ERGL type 1 transmembrane proteins, N-terminal lectin domain. ERGIC-53 and ERGL, N-terminal carbohydrate recognition domain. ERGIC-53 and ERGL are eukaryotic mannose-binding type 1 transmembrane proteins of the early secretory pathway that transport newly synthesized glycoproteins from the endoplasmic reticulum (ER) to the ER-Golgi intermediate compartment (ERGIC). ERGIC-53 and ERGL have an N-terminal lectin-like carbohydrate recognition domain (represented by this alignment model) as well as a C-terminal transmembrane domain. ERGIC-53 functions as a 'cargo receptor' to facilitate the export of glycoproteins with different characteristics from the ER, while the ERGIC-53-like protein (ERGL) which may act as a regulator of ERGIC-53. In mammals, ERGIC-53 forms a complex with MCFD2 (multi-coagulation factor deficiency 2) which then recruits blood coagulation factors V and VIII. Mutations in either MCFD2 or ERGIC-53 cause a mild form of inherite
Probab=76.14 E-value=1.1 Score=43.13 Aligned_cols=16 Identities=81% Similarity=1.244 Sum_probs=14.7
Q ss_pred ccccccccCCCCcccc
Q psy11306 295 ERFEYKYSFKPPYLAQ 310 (413)
Q Consensus 295 ~~~~~~~~~~~~~~~~ 310 (413)
+||||++|++||++..
T Consensus 1 ~~~~~~~s~~~P~l~~ 16 (225)
T cd06902 1 RRFEYKYSFKGPHLAQ 16 (225)
T ss_pred CcceeeecccCccccc
Confidence 6999999999999984
No 14
>PF07889 DUF1664: Protein of unknown function (DUF1664); InterPro: IPR012458 The members of this family are hypothetical plant proteins of unknown function. The region featured in this family is approximately 100 amino acids long.
Probab=65.79 E-value=11 Score=33.58 Aligned_cols=66 Identities=21% Similarity=0.348 Sum_probs=43.6
Q ss_pred HHHHHHHHHHHHHHHHHHHhhhccCCccccccccccCCCCcccccCCccCCCCCCCCCchhhhhccchHHHHHHHhhhhH
Q psy11306 267 QEDQKVAQEYAQYEKKLEEQKQHSQNPVERFEYKYSFKPPYLAQKDGYQKDHPDAHPNEEEWYESENQRELRQIFQGQSQ 346 (413)
Q Consensus 267 ~e~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~ 346 (413)
.-+.+|.++++++-.+|+++.++++ .. .++.++ -.+++.+|=..=..
T Consensus 61 ~tKkhLsqRId~vd~klDe~~ei~~---------------------~i----------~~eV~~--v~~dv~~i~~dv~~ 107 (126)
T PF07889_consen 61 STKKHLSQRIDRVDDKLDEQKEISK---------------------QI----------KDEVTE--VREDVSQIGDDVDS 107 (126)
T ss_pred HHHHHHHHHHHHHHhhHHHHHHHHH---------------------HH----------HHHHHH--HHhhHHHHHHHHHH
Confidence 3455677777777777777766322 01 234433 34556667666778
Q ss_pred HHHHHHHHHHHHHHHHHHH
Q psy11306 347 LAEWTKAIATGLDALQQKQ 365 (413)
Q Consensus 347 ~~~~~~~l~~~~~~~~~~~ 365 (413)
|++++++|..||++++..|
T Consensus 108 v~~~V~~Le~ki~~ie~~Q 126 (126)
T PF07889_consen 108 VQQMVEGLEGKIDEIEEKQ 126 (126)
T ss_pred HHHHHHHHHHHHHHHhcCC
Confidence 8999999999999887654
No 15
>PF06439 DUF1080: Domain of Unknown Function (DUF1080); InterPro: IPR010496 This is a family of proteins of unknown function.; PDB: 3IMM_B 3NMB_A 3S5Q_A 3OSD_A 3HBK_A 3H3L_A 3U1X_A.
Probab=64.25 E-value=1.1e+02 Score=27.35 Aligned_cols=155 Identities=18% Similarity=0.245 Sum_probs=77.7
Q ss_pred CCCCCeEEeeee------EeeCCeEEeCC-CCCCceeEEEeCCcccccceEEEEEEEEEeCCCCCCCeEEEEEecCCCCC
Q psy11306 48 GSVPFWEYGGNC------IASLENVRVAP-SLRSQKGAIWTKQTTNFEWWNVDIVFRVTGRGRIGADGLAFWYTSEKGSY 120 (413)
Q Consensus 48 ~~i~~W~~~GdA------ii~~~~IRLTp-~~~sq~GsiWsk~pi~~~sFe~eftFrI~g~g~~ggDGlAFW~t~~~~~~ 120 (413)
.....|...+.. .+.++.|+ +. ......|.+.++. .+.+|++++.|++.. .+.-|+.|-.+ .+. .
T Consensus 9 ~~l~gW~~~~~~~~~~~~~v~dG~l~-~~~~~~~~~~~l~~~~--~~~df~l~~d~k~~~---~~~sGi~~r~~-~~~-~ 80 (185)
T PF06439_consen 9 KDLDGWKIYGGGWFEGGWSVKDGVLV-SNGSSGSGGGYLYTDK--KFSDFELEVDFKITP---GGNSGIFFRAQ-SPG-D 80 (185)
T ss_dssp SCGTTEEETTSSSETTTEEEETTEEE--GGGGESSS--EEESS--EBSSEEEEEEEEE-T---T-EEEEEEEES-SEC-C
T ss_pred CCHHHCeeCCCCccccCcEeeCCEEE-ecccCCCCcceEEECC--ccccEEEEEEEEECC---CCCeEEEEEec-ccc-C
Confidence 355678877432 46788777 22 2234567788874 899999999999942 22446666666 111 0
Q ss_pred CCCcCCCCCCccEEEEEEEccCCCCCCCCCeEEEEecCCcccccCCCCCCcccccceeeccccCCcceEEEEEEECCEEE
Q psy11306 121 DGEVFGSSDRWKGLGLFFDSFDNDNNHNNPYIMAVVNDGNMAFDHQNDGASQSLAGCLRDFRNKPYPTRARIQYYMNTLT 200 (413)
Q Consensus 121 ~G~vfG~~~~f~GlaI~FDTf~N~~~~~~P~I~i~~NDGt~~yd~~~DG~~~~L~~C~~~~rn~~~p~~~rI~Y~~~~Ls 200 (413)
..+...|..+-++-........ ...|.. |+... . .....-...+ ..+.=.+++|+-.+++++
T Consensus 81 ------~~~~~~gy~~~i~~~~~~~~~~-------~~~G~~-~~~~~--~-~~~~~~~~~~-~~~~W~~~~I~~~g~~i~ 142 (185)
T PF06439_consen 81 ------GQDWNNGYEFQIDNSGGGTGLP-------NSTGSL-YDEPP--W-QLEPSVNVAI-PPGEWNTVRIVVKGNRIT 142 (185)
T ss_dssp ------SSGGGTSEEEEEE-TTTCSTTT-------TSTTSB-TTTB---T-CB-SSS--S---TTSEEEEEEEEETTEEE
T ss_pred ------CCCcceEEEEEEECCCCccCCC-------CccceE-EEecc--c-cccccccccC-CCCceEEEEEEEECCEEE
Confidence 1234456655555432210000 001111 11000 0 0000000001 113346899999999999
Q ss_pred EEEEeCccCCCCceeEEEEecccccC----CCcEEEEEeeC
Q psy11306 201 VWFHNGMTNNEQDIEVCLRVENIYLP----KEGYFGVSAAT 237 (413)
Q Consensus 201 V~v~~~~t~~~~~~~~C~~v~~v~LP----~~~yfGfSAaT 237 (413)
|.++. ..-.+......+ ..++|||-+..
T Consensus 143 v~vnG---------~~v~~~~d~~~~~~~~~~G~Igl~~~~ 174 (185)
T PF06439_consen 143 VWVNG---------KPVADFTDPSFPYSNPTKGPIGLQTHG 174 (185)
T ss_dssp EEETT---------EEEEEEETTSHHHHHHSSBEEEEECSS
T ss_pred EEECC---------EEEEEEEcCCCCCCCCCceEEEEEeCC
Confidence 99864 222333333333 78999999544
No 16
>PF09712 PHA_synth_III_E: Poly(R)-hydroxyalkanoic acid synthase subunit (PHA_synth_III_E)
Probab=51.58 E-value=1.5e+02 Score=29.88 Aligned_cols=54 Identities=19% Similarity=0.150 Sum_probs=45.7
Q ss_pred hhhhHHHHHHHHHHHHHHHHHHHHHhhhhccccCCCccCCCCCCCCCCccchHHHHHhhHHHHHHHHHHh
Q psy11306 342 QGQSQLAEWTKAIATGLDALQQKQDRILAVVSQGGGIPHQVVPGQPMPMINNDALLASQNSLLSTVQELR 411 (413)
Q Consensus 342 ~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~~~~~~~ 411 (413)
..+.+++.++-+|..+++++..+.-+.|+++ .+.|||+|-....+|=..++.+|
T Consensus 239 ~~~g~~~~a~m~~r~~~~~~~e~~L~~l~lP----------------Tr~evd~l~k~l~eLrre~r~Lk 292 (293)
T PF09712_consen 239 QAYGQLVNALMDLRKQQQEVVEEYLRSLNLP----------------TRSEVDELYKRLHELRREVRALK 292 (293)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCC----------------CHHHHHHHHHHHHHHHHHHHHhc
Confidence 4567889999999999999999999999887 55799999888888877777765
No 17
>KOG0249|consensus
Probab=49.34 E-value=69 Score=36.24 Aligned_cols=82 Identities=11% Similarity=0.317 Sum_probs=50.5
Q ss_pred ChHHHHHHHHHHHHHHHHHHHhhhccCCccccccccccCCCCcccccCCccCCCCCCC-CCchhhhhccchHHHHHHHhh
Q psy11306 265 VNQEDQKVAQEYAQYEKKLEEQKQHSQNPVERFEYKYSFKPPYLAQKDGYQKDHPDAH-PNEEEWYESENQRELRQIFQG 343 (413)
Q Consensus 265 ~~~e~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~l~~~~~~ 343 (413)
-++...|+..+.+++-++|...++ +-+.-.+|..-- ...+++- ..--.|.-+.++.
T Consensus 161 ~eer~~kl~~~~qe~naeL~rarq----------------------reemneeh~~rlsdtvdErl-qlhlkermaAle~ 217 (916)
T KOG0249|consen 161 IEERTRKLEEQLEELNAELQRARQ----------------------REKMNEEHNKRLSDTVDERL-QLHLKERMAALED 217 (916)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHH----------------------HHHhhhhhccccccccHHHH-HHHHHHHHHHHHH
Confidence 345566777777777777766554 113333333210 0012222 1223578889999
Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHhhh
Q psy11306 344 QSQLAEWTKAIATGLDALQQKQDRIL 369 (413)
Q Consensus 344 ~~~~~~~~~~l~~~~~~~~~~~~~~~ 369 (413)
-+++.+++..+.++|+++...-+|..
T Consensus 218 kn~L~~e~~s~kk~l~~~~~~k~rl~ 243 (916)
T KOG0249|consen 218 KNRLEQELESVKKQLEEMRHDKDKLR 243 (916)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 99999999999999998887776654
No 18
>PF10393 Matrilin_ccoil: Trimeric coiled-coil oligomerisation domain of matrilin; InterPro: IPR019466 This entry represents a short domain found the matrilin (cartilage matrix) proteins. It forms a coiled coil structure and contains a single cysteine residue at its start which is likely to form a di-sulphide bridge with a corresponding cysteine in an upstream EGF domain (IPR006209 from INTERPRO), thereby spanning the VWA domain of the protein (IPR002035 from INTERPRO).This domain is likely to be responsible for protein trimerisation []. ; PDB: 1AQ5_C.
Probab=43.70 E-value=40 Score=24.97 Aligned_cols=29 Identities=14% Similarity=0.280 Sum_probs=23.4
Q ss_pred HHHHhhhhHHHHHHHHHHHHHHHHHHHHH
Q psy11306 338 RQIFQGQSQLAEWTKAIATGLDALQQKQD 366 (413)
Q Consensus 338 ~~~~~~~~~~~~~~~~l~~~~~~~~~~~~ 366 (413)
+-+..=|.++..+|+.|.++|+++.++-+
T Consensus 12 Eslv~FQ~~v~~~lq~Lt~kL~~vs~RLe 40 (47)
T PF10393_consen 12 ESLVAFQNKVTSALQSLTQKLDAVSKRLE 40 (47)
T ss_dssp THHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34555689999999999999999887654
No 19
>PF10168 Nup88: Nuclear pore component; InterPro: IPR019321 Nup88 can be divided into two structural domains; the N-terminal two-thirds of the protein have no obvious structural motifs. It is, however, where it binds to Nup98; one of the components of the nuclear pore. The C-terminal end is a predicted coiled-coil domain []. Nup88 is over expressed in tumour cells [].
Probab=41.36 E-value=2e+02 Score=32.80 Aligned_cols=22 Identities=14% Similarity=0.314 Sum_probs=12.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHhh
Q psy11306 347 LAEWTKAIATGLDALQQKQDRI 368 (413)
Q Consensus 347 ~~~~~~~l~~~~~~~~~~~~~~ 368 (413)
|-..|++|...|+++..+++++
T Consensus 644 ~~~~l~~l~~si~~lk~k~~~Q 665 (717)
T PF10168_consen 644 MKDQLQDLKASIEQLKKKLDYQ 665 (717)
T ss_pred HHHHHHHHHHHHHHHHHHHHHH
Confidence 3444566666666666655543
No 20
>KOG4403|consensus
Probab=40.23 E-value=67 Score=34.31 Aligned_cols=75 Identities=19% Similarity=0.238 Sum_probs=43.8
Q ss_pred hHHHHHHHHHHHHHHHHHHHhhhccCCccccccccccCCCCcccccCCccCCCCCCCCCchhhh-hccchHHHHHHHhhh
Q psy11306 266 NQEDQKVAQEYAQYEKKLEEQKQHSQNPVERFEYKYSFKPPYLAQKDGYQKDHPDAHPNEEEWY-ESENQRELRQIFQGQ 344 (413)
Q Consensus 266 ~~e~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~l~~~~~~~ 344 (413)
-+++++..+.++++|++|++.++. +..|. .-|.| .|+.. |--.-+|||+=.+-|
T Consensus 251 le~Lq~aEqsl~dlQk~Lekar~e-~rnva------------------vek~~------lerkl~ea~rl~elreg~e~e 305 (575)
T KOG4403|consen 251 LEGLQRAEQSLEDLQKRLEKAREE-QRNVA------------------VEKLD------LERKLDEAPRLSELREGVENE 305 (575)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHh-hhchh------------------hhhhh------HHHHHhhhhhhhhhhcchhHH
Confidence 367778888999999999987761 00000 00111 23322 233346677777666
Q ss_pred hHHHHHHHHHHHHHHHHHHHHH
Q psy11306 345 SQLAEWTKAIATGLDALQQKQD 366 (413)
Q Consensus 345 ~~~~~~~~~l~~~~~~~~~~~~ 366 (413)
.-. ++|.+|...|..++++-+
T Consensus 306 ~~r-kelE~lR~~L~kAEkele 326 (575)
T KOG4403|consen 306 TSR-KELEQLRVALEKAEKELE 326 (575)
T ss_pred HHH-HHHHHHHHHHHHHHHHHH
Confidence 554 677777777766666554
No 21
>PF12248 Methyltransf_FA: Farnesoic acid 0-methyl transferase; InterPro: IPR022041 This domain, found in farnesoic acid O-methyl transferase, is approximately 110 amino acids in length. Farnesoic acid O-methyl transferase (FAMeT) is the enzyme that catalyses the formation of methyl farnesoate (MF) from farnesoic acid (FA) in the biosynthetic pathway of juvenile hormone (JH) [].
Probab=38.01 E-value=1.2e+02 Score=25.28 Aligned_cols=45 Identities=16% Similarity=-0.012 Sum_probs=29.2
Q ss_pred CcceEEEEEEECCEEEEEEEeCccCCCCceeEEEEecccccCCCcEEEEEee
Q psy11306 185 PYPTRARIQYYMNTLTVWFHNGMTNNEQDIEVCLRVENIYLPKEGYFGVSAA 236 (413)
Q Consensus 185 ~~p~~~rI~Y~~~~LsV~v~~~~t~~~~~~~~C~~v~~v~LP~~~yfGfSAa 236 (413)
..+...+|++.++.++|..+.. . ...++.....-++-.|||||+-
T Consensus 52 ~e~~~fwI~~~~G~I~vg~~g~-----~--~pfl~~~Dp~~~~v~yvGft~w 96 (102)
T PF12248_consen 52 SEFRMFWISWRDGTIRVGRGGE-----D--EPFLEWTDPEPIPVNYVGFTGW 96 (102)
T ss_pred CccEEEEEEECCCEEEEEECCC-----c--cEEEEEECCCCCcccEEEEecC
Confidence 4578899998888999987531 1 2333333333455689999653
No 22
>KOG3192|consensus
Probab=37.69 E-value=1.5e+02 Score=27.55 Aligned_cols=34 Identities=12% Similarity=0.122 Sum_probs=29.3
Q ss_pred HHHHHhhhhHHHH-----HHHHHHHHHHHHHHHHHhhhh
Q psy11306 337 LRQIFQGQSQLAE-----WTKAIATGLDALQQKQDRILA 370 (413)
Q Consensus 337 l~~~~~~~~~~~~-----~~~~l~~~~~~~~~~~~~~~~ 370 (413)
|-+|++.|..|-+ .|++|.+++..-+.||+.+++
T Consensus 98 Lmevle~~E~IS~~~De~~l~~lk~q~q~ri~q~~~qlg 136 (168)
T KOG3192|consen 98 LMEVLEYHEAISEMDDEEDLKQLKSQNQERIAQCKQQLG 136 (168)
T ss_pred HHHHHHHHHHHHhccCcHHHHHHHHHHHHHHHHHHHHHH
Confidence 8888998888875 599999999999999998884
No 23
>PF07081 DUF1349: Protein of unknown function (DUF1349); InterPro: IPR009784 This family consists of several hypothetical bacterial proteins but contains one sequence (P40893 from SWISSPROT) from Saccharomyces cerevisiae. Members of this family are typically around 200 residues in length. The function of this family is unknown.; PDB: 3MEP_B 3O12_A.
Probab=35.46 E-value=3.6e+02 Score=24.90 Aligned_cols=115 Identities=15% Similarity=0.233 Sum_probs=63.3
Q ss_pred cceEEEEEEEEEeCCCCCCCeEEEEEecCCCCCCCCcCCCCCCccEEEEEEEccCCCCCCCCCeEEEEecCCcccccCCC
Q psy11306 88 EWWNVDIVFRVTGRGRIGADGLAFWYTSEKGSYDGEVFGSSDRWKGLGLFFDSFDNDNNHNNPYIMAVVNDGNMAFDHQN 167 (413)
Q Consensus 88 ~sFe~eftFrI~g~g~~ggDGlAFW~t~~~~~~~G~vfG~~~~f~GlaI~FDTf~N~~~~~~P~I~i~~NDGt~~yd~~~ 167 (413)
.+|+++.+|.......--..||-++..+ +.|-=++||+.. + ..|.++.++.+|...+.
T Consensus 48 ~dF~~~~~v~~~~~~~YDQaGL~v~~~~-------------~~WiK~giE~~~---~---g~~~l~sV~t~~~SDws--- 105 (183)
T PF07081_consen 48 GDFTAEVKVSGDFKEQYDQAGLMVYQDE-------------DNWIKAGIEYSN---D---GTPRLSSVVTNGYSDWS--- 105 (183)
T ss_dssp SSEEEEEEEEE---STT-EEEEEEEEET-------------TEEEEEEEEE-E---T---TCEEEEEEEEESSEEEE---
T ss_pred CCeEEEEEEEeCCccceeeEEEEEEECC-------------cccEEEEEEEec---C---CCceEEEEeccCccccc---
Confidence 4587777777743332224466666654 267778899842 1 12677777766544322
Q ss_pred CCCcccccceeeccccCCcceEEEEEEECCEEEEEEEeCccCCCCceeEEEEecccccCCCcEEEEEeeCCC
Q psy11306 168 DGASQSLAGCLRDFRNKPYPTRARIQYYMNTLTVWFHNGMTNNEQDIEVCLRVENIYLPKEGYFGVSAATGG 239 (413)
Q Consensus 168 DG~~~~L~~C~~~~rn~~~p~~~rI~Y~~~~LsV~v~~~~t~~~~~~~~C~~v~~v~LP~~~yfGfSAaTG~ 239 (413)
...+-.....+.+||+-.++.|.|.+.. .+..|..--... ...+....+|+-|....
T Consensus 106 ----------~~~~~~~~~~~~lrv~R~g~~~~~~ys~----DG~~w~~~R~~~-~~~~~~~~VG~~A~sP~ 162 (183)
T PF07081_consen 106 ----------LSPLPSDGQSVWLRVERRGDDLWIYYSA----DGKTWTLLRIFH-FPEDWEVQVGVYACSPQ 162 (183)
T ss_dssp ----------EEE--SBTTSEEEEEEEETTEEEEEEES----SSS---EEEEEE---S-S-EEEEEEEE-SS
T ss_pred ----------ccccCCCCCEEEEEEEEeCCEEEEEEEc----CCCEEEEEEEEE-CCCCCcEEEEEEEeCCC
Confidence 1111122457889999999999998864 235565432222 22446689999999865
No 24
>KOG1029|consensus
Probab=34.32 E-value=1.1e+02 Score=35.08 Aligned_cols=10 Identities=50% Similarity=0.780 Sum_probs=4.0
Q ss_pred HHHHHHHhhh
Q psy11306 279 YEKKLEEQKQ 288 (413)
Q Consensus 279 ~~~~~~~~~~ 288 (413)
++++|++|++
T Consensus 373 lekqLerQRe 382 (1118)
T KOG1029|consen 373 LEKQLERQRE 382 (1118)
T ss_pred HHHHHHHHHH
Confidence 3334444443
No 25
>cd09234 V_HD-PTP_like Protein-interacting V-domain of mammalian His-Domain type N23 protein tyrosine phosphatase and related domains. This family contains the V-shaped (V) domain of mammalian His-Domain type N23 protein tyrosine phosphatase (HD-PTP, also known as PTPN23) and related domains. It belongs to the V_Alix_like superfamily which includes the V domains of Bro1 and Rim20 (also known as PalA) from Saccharomyces cerevisiae, mammalian Alix (apoptosis-linked gene-2 interacting protein X/ also known as apoptosis-linked gene-2 interacting protein 1, AIP1), and related domains. HD_PTP interacts with the ESCRT (Endosomal Sorting Complexes Required for Transport) system, and participates in cell migration and endosomal trafficking. The related Alix V-domain (belonging to a different family in this superfamily) contains a binding site, partially conserved in the superfamily, for the retroviral late assembly (L) domain YPXnL motif. The Alix V-domain is also a dimerization domain. In addi
Probab=34.12 E-value=3.1e+02 Score=27.88 Aligned_cols=37 Identities=11% Similarity=0.127 Sum_probs=28.1
Q ss_pred chhhhhcc------chHHHHHHHhhhhHHHHHHHHHHHHHHHH
Q psy11306 325 EEEWYESE------NQRELRQIFQGQSQLAEWTKAIATGLDAL 361 (413)
Q Consensus 325 ~~~~~~~~------~~~~l~~~~~~~~~~~~~~~~l~~~~~~~ 361 (413)
.|++|+.+ -+..|++.+..|..|+.+|+.++.+...+
T Consensus 235 ~e~lf~~eL~k~~~~~~~l~~~~~~Q~~ll~~i~~an~~f~~~ 277 (337)
T cd09234 235 MEDLFKEELKKHDQLVNLIEQNLAAQENILKALTEANAKYAPV 277 (337)
T ss_pred HHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 45666532 34678899999999999999999988533
No 26
>PRK11519 tyrosine kinase; Provisional
Probab=33.43 E-value=2.3e+02 Score=31.96 Aligned_cols=26 Identities=23% Similarity=0.194 Sum_probs=20.6
Q ss_pred CCChHHHHHHHHHHHHHHHHHHHhhh
Q psy11306 263 EQVNQEDQKVAQEYAQYEKKLEEQKQ 288 (413)
Q Consensus 263 ~~~~~e~~~l~~~~~~~~~~~~~~~~ 288 (413)
.+.+++..++.+++++.+++|+.=++
T Consensus 270 ~fL~~ql~~l~~~L~~aE~~l~~fr~ 295 (719)
T PRK11519 270 AFLAQQLPEVRSRLDVAENKLNAFRQ 295 (719)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 37778888899999888888877555
No 27
>KOG0994|consensus
Probab=33.33 E-value=1.7e+02 Score=35.10 Aligned_cols=79 Identities=23% Similarity=0.287 Sum_probs=48.0
Q ss_pred HHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhhhhccccCCCccCCCCCCCCCCccchHHHHHhhHHHHHHHHHHhcC
Q psy11306 335 RELRQIFQGQSQLAEWTKAIATGLDALQQKQDRILAVVSQGGGIPHQVVPGQPMPMINNDALLASQNSLLSTVQELRYH 413 (413)
Q Consensus 335 ~~l~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~~~~~~~~~ 413 (413)
.|+|-||..+|--.+.|.+|..++.++.+|-......+++....-....-..+....+++.|=+.=+.|+++++|+|++
T Consensus 1211 ~~ir~il~~~svs~~~i~~l~~~~~~lr~~l~~~~e~L~~~E~~Lsdi~~~~~~a~~~LesLq~~~~~l~~~~keL~e~ 1289 (1758)
T KOG0994|consen 1211 EEIRAILSAPSVSAEDIAQLASATESLRRQLQALTEDLPQEEETLSDITNSLPLAGKDLESLQREFNGLLTTYKELREQ 1289 (1758)
T ss_pred HHHHHHhcCCCccHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhhhhccchhhhhHHHHHHHHHHHHHHHHHHHHH
Confidence 5788888888888888888888887777665554444433210000000011122246777777777778888888753
No 28
>PRK10894 lipopolysaccharide transport periplasmic protein LptA; Provisional
Probab=32.59 E-value=75 Score=29.35 Aligned_cols=57 Identities=16% Similarity=0.164 Sum_probs=31.3
Q ss_pred hHHHHHHHHHHHHHHhhcCCCCceeeeeeeeccCCcccCCC-----CCCCCeEEeeeeEeeCCeEEeCCC
Q psy11306 7 WLNLSLSLVLCYLVVLSSSQNPVERFEYKYSFKPPYLAQKD-----GSVPFWEYGGNCIASLENVRVAPS 71 (413)
Q Consensus 7 ~~~~~~~~~~~~~~~~~~~~~~~~r~e~k~Sf~~P~l~~~~-----~~i~~W~~~GdAii~~~~IRLTp~ 71 (413)
||+|+|+++++++++.+++.+..+ ..|.....+ ..-.-..+.|++.+..+..+|+.+
T Consensus 5 ~~~~~~~~~ll~~~~~a~A~~~d~--------~~pI~I~AD~~~~~~~~~~~~~tGnV~i~QG~~~L~AD 66 (180)
T PRK10894 5 SLNLLLASSLLAASIPAFALTGDT--------DQPIHIDSDQQSLDMQGNVVTFTGNVVVTQGTIKINAD 66 (180)
T ss_pred HHHHHHHHHHHHHHHHHhhccccc--------CCCEEEEeCceEeeccCCEEEEEeeEEEEECceEEEeE
Confidence 677777777776555444422111 113222111 113346788998888777777755
No 29
>PRK10803 tol-pal system protein YbgF; Provisional
Probab=32.40 E-value=1.5e+02 Score=29.19 Aligned_cols=58 Identities=17% Similarity=0.241 Sum_probs=35.4
Q ss_pred HHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhhhhccccCCCccCCCCCCCCCCccchHHHHHhhHHHHHHHHHH
Q psy11306 336 ELRQIFQGQSQLAEWTKAIATGLDALQQKQDRILAVVSQGGGIPHQVVPGQPMPMINNDALLASQNSLLSTVQEL 410 (413)
Q Consensus 336 ~l~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~~~~~~ 410 (413)
.|++++..+++.+ -+|.++|+++.++-.+..|-+- ...++++.+...|+++.+.|.+.
T Consensus 44 ~le~~~~~~~~~~---~~l~~ql~~lq~ev~~LrG~~E--------------~~~~~l~~~~~rq~~~y~dld~r 101 (263)
T PRK10803 44 QLERISNAHSQLL---TQLQQQLSDNQSDIDSLRGQIQ--------------ENQYQLNQVVERQKQIYLQIDSL 101 (263)
T ss_pred HHHHHHHhhhHHH---HHHHHHHHHHHHHHHHHhhHHH--------------HHHHHHHHHHHHHHHHHHHHHHH
Confidence 4777887777543 2344455555444444443330 13368899999999988888764
No 30
>PRK13676 hypothetical protein; Provisional
Probab=32.29 E-value=1.7e+02 Score=24.92 Aligned_cols=72 Identities=11% Similarity=0.227 Sum_probs=42.4
Q ss_pred hHHHHHHHHHHHHHHHHHHHhhhccCCccccccccccCCCCcccccCCccCCCCCCC-CCchhhhhccch-HHHHHHHhh
Q psy11306 266 NQEDQKVAQEYAQYEKKLEEQKQHSQNPVERFEYKYSFKPPYLAQKDGYQKDHPDAH-PNEEEWYESENQ-RELRQIFQG 343 (413)
Q Consensus 266 ~~e~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~-~~l~~~~~~ 343 (413)
.++..++.++|...|.++..... .-++++++. .+..+++..... ..+..-++.
T Consensus 35 d~~a~~li~~F~~~q~~~~~~q~-------------------------~g~~~~~e~~~~l~~l~~~i~~n~~i~~y~~A 89 (114)
T PRK13676 35 DEEAKKLFDEFRALQLEIQQKQM-------------------------TGQEITEEEQQKAQELGQKIQQNELLSKLMEA 89 (114)
T ss_pred CHHHHHHHHHHHHHHHHHHHHHH-------------------------cCCCCCHHHHHHHHHHHHHHhcCHHHHHHHHH
Confidence 45777888888888888865443 223444322 223444443322 456666666
Q ss_pred hhHHHHHHHHHHHHHHHHH
Q psy11306 344 QSQLAEWTKAIATGLDALQ 362 (413)
Q Consensus 344 ~~~~~~~~~~l~~~~~~~~ 362 (413)
+.++-..|+.+++-+.+.+
T Consensus 90 e~~l~~ll~~v~~iI~~~i 108 (114)
T PRK13676 90 EQRLSVYINDINKIIFKPL 108 (114)
T ss_pred HHHHHHHHHHHHHHHHHHH
Confidence 7776667776666666554
No 31
>cd09236 V_AnPalA_UmRIM20_like Protein-interacting V-domains of Aspergillus nidulans PalA/RIM20, Ustilago maydis RIM20, and related proteins. This family belongs to the V_Alix_like superfamily which includes the V-shaped (V) domains of Bro1 and Rim20 from Saccharomyces cerevisiae, mammalian Alix (apoptosis-linked gene-2 interacting protein X), His-Domain type N23 protein tyrosine phosphatase (HD-PTP, also known as PTPN23), and related domains. Aspergillus nidulas PalA/RIM20 and Ustilago maydis RIM20, like Saccharomyces cerevisiae Rim20, participate in the response to the external pH via the Pal/Rim101 pathway; however, Saccharomyces cerevisiae Rim20 does not belong to this family. This pathway is a signaling cascade resulting in the activation of the transcription factor PacC/Rim101. The mammalian Alix V-domain (belonging to a different family) contains a binding site, partially conserved in the superfamily, for the retroviral late assembly (L) domain YPXnL motif. Aspergillus nidulas Pa
Probab=30.05 E-value=1.4e+02 Score=30.66 Aligned_cols=36 Identities=8% Similarity=0.129 Sum_probs=26.8
Q ss_pred chhhhhcc------chHHHHHHHhhhhHHHHHHHHHHHHHHH
Q psy11306 325 EEEWYESE------NQRELRQIFQGQSQLAEWTKAIATGLDA 360 (413)
Q Consensus 325 ~~~~~~~~------~~~~l~~~~~~~~~~~~~~~~l~~~~~~ 360 (413)
.|++|+.+ .+..|++....|..|+++|+..++++.+
T Consensus 252 fe~lf~~eL~kf~~~~~~l~~~~~~Q~~ll~~i~~~n~~f~~ 293 (353)
T cd09236 252 FEDLFDKRLAKYDKDLDAVSEEAQEQEEILQQIEVANKAFLQ 293 (353)
T ss_pred HHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 46666532 3466788888999999999998887743
No 32
>smart00282 LamG Laminin G domain.
Probab=30.02 E-value=2.4e+02 Score=23.72 Aligned_cols=22 Identities=9% Similarity=-0.028 Sum_probs=17.9
Q ss_pred CCcceEEEEEEECCEEEEEEEe
Q psy11306 184 KPYPTRARIQYYMNTLTVWFHN 205 (413)
Q Consensus 184 ~~~p~~~rI~Y~~~~LsV~v~~ 205 (413)
.+.-.++.|.+..+.+++.+|.
T Consensus 61 dg~WH~v~i~~~~~~~~l~VD~ 82 (135)
T smart00282 61 DGQWHRVAVERNGRRVTLSVDG 82 (135)
T ss_pred CCCEEEEEEEEeCCEEEEEECC
Confidence 3456789999999999888885
No 33
>PF10458 Val_tRNA-synt_C: Valyl tRNA synthetase tRNA binding arm; InterPro: IPR019499 The aminoacyl-tRNA synthetases (6.1.1. from EC) catalyse the attachment of an amino acid to its cognate transfer RNA molecule in a highly specific two-step reaction. These proteins differ widely in size and oligomeric state, and have limited sequence homology []. The 20 aminoacyl-tRNA synthetases are divided into two classes, I and II. Class I aminoacyl-tRNA synthetases contain a characteristic Rossman fold catalytic domain and are mostly monomeric []. Class II aminoacyl-tRNA synthetases share an anti-parallel beta-sheet fold flanked by alpha-helices [], and are mostly dimeric or multimeric, containing at least three conserved regions [, , ]. However, tRNA binding involves an alpha-helical structure that is conserved between class I and class II synthetases. In reactions catalysed by the class I aminoacyl-tRNA synthetases, the aminoacyl group is coupled to the 2'-hydroxyl of the tRNA, while, in class II reactions, the 3'-hydroxyl site is preferred. The synthetases specific for arginine, cysteine, glutamic acid, glutamine, isoleucine, leucine, methionine, tyrosine, tryptophan and valine belong to class I synthetases. The synthetases specific for alanine, asparagine, aspartic acid, glycine, histidine, lysine, phenylalanine, proline, serine, and threonine belong to class-II synthetases []. Based on their mode of binding to the tRNA acceptor stem, both classes of tRNA synthetases have been subdivided into three subclasses, designated 1a, 1b, 1c and 2a, 2b, 2c. This entry represents the C-terminal domain of Valyl-tRNA synthetase, which consists of two helices in a long alpha-hairpin. Valyl-tRNA synthetase (6.1.1.9 from EC) is an alpha monomer that belongs to class Ia.; GO: 0000166 nucleotide binding, 0004832 valine-tRNA ligase activity, 0005524 ATP binding, 0006438 valyl-tRNA aminoacylation, 0005737 cytoplasm; PDB: 1IVS_B 1GAX_B.
Probab=28.60 E-value=2e+02 Score=22.17 Aligned_cols=23 Identities=22% Similarity=0.424 Sum_probs=19.6
Q ss_pred hHHHHHHHHHHHHHHHHHHHhhh
Q psy11306 266 NQEDQKVAQEYAQYEKKLEEQKQ 288 (413)
Q Consensus 266 ~~e~~~l~~~~~~~~~~~~~~~~ 288 (413)
..|..||.++++.++++++.-..
T Consensus 3 ~~E~~rL~Kel~kl~~~i~~~~~ 25 (66)
T PF10458_consen 3 EAEIERLEKELEKLEKEIERLEK 25 (66)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHH
Confidence 57899999999999998887555
No 34
>PRK09841 cryptic autophosphorylating protein tyrosine kinase Etk; Provisional
Probab=28.13 E-value=3.5e+02 Score=30.53 Aligned_cols=26 Identities=31% Similarity=0.325 Sum_probs=20.6
Q ss_pred CCChHHHHHHHHHHHHHHHHHHHhhh
Q psy11306 263 EQVNQEDQKVAQEYAQYEKKLEEQKQ 288 (413)
Q Consensus 263 ~~~~~e~~~l~~~~~~~~~~~~~~~~ 288 (413)
++..+++.++.++++..+++|+.=|+
T Consensus 270 ~fL~~qL~~l~~~L~~aE~~l~~fr~ 295 (726)
T PRK09841 270 EFLQRQLPEVRSELDQAEEKLNVYRQ 295 (726)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 36778888888888888888877555
No 35
>KOG0999|consensus
Probab=27.44 E-value=99 Score=34.11 Aligned_cols=92 Identities=17% Similarity=0.225 Sum_probs=54.7
Q ss_pred hHHHHHHHHHHHHHHHHHHHhhhccCCccccccccccCCCCcccccCCccCCCCCCCCCch----hhh-hccch--HHHH
Q psy11306 266 NQEDQKVAQEYAQYEKKLEEQKQHSQNPVERFEYKYSFKPPYLAQKDGYQKDHPDAHPNEE----EWY-ESENQ--RELR 338 (413)
Q Consensus 266 ~~e~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~-~~~~~--~~l~ 338 (413)
.+|+..|.+++++++.+++..+... -++.-+-..|+..|......++ .+. ||..- -=|+
T Consensus 42 LeeK~~Lkqq~eEleaeyd~~R~El--------------dqtkeal~q~~s~hkk~~~~g~e~EesLLqESaakE~~yl~ 107 (772)
T KOG0999|consen 42 LEEKEDLKQQLEELEAEYDLARTEL--------------DQTKEALGQYRSQHKKVARDGEEREESLLQESAAKEEYYLQ 107 (772)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH--------------HHHHHHHHHHHHHHHHhhccchhhHHHHHHHHHHhHHHHHH
Confidence 3678888888888888666543300 0111122256666665544332 221 11111 1278
Q ss_pred HHHhhhhHHHHHHHHHHHHHHHHHHHHHhhhhccccC
Q psy11306 339 QIFQGQSQLAEWTKAIATGLDALQQKQDRILAVVSQG 375 (413)
Q Consensus 339 ~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~ 375 (413)
+||+-|+ +|+++...|..+..+-+|.....|..
T Consensus 108 kI~elen----eLKq~r~el~~~q~E~erl~~~~sd~ 140 (772)
T KOG0999|consen 108 KILELEN----ELKQLRQELTNVQEENERLEKVHSDL 140 (772)
T ss_pred HHHHHHH----HHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 8888665 67888888888888888888777544
No 36
>PTZ00009 heat shock 70 kDa protein; Provisional
Probab=26.20 E-value=91 Score=34.71 Aligned_cols=39 Identities=13% Similarity=0.341 Sum_probs=25.2
Q ss_pred CCChHHHHHHHHHHHHHHH--HHHHhhhccCCccccccccc
Q psy11306 263 EQVNQEDQKVAQEYAQYEK--KLEEQKQHSQNPVERFEYKY 301 (413)
Q Consensus 263 ~~~~~e~~~l~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~ 301 (413)
.++.+|.+++.++.+++.+ ++.+++....|.+|.+-|..
T Consensus 511 ~ls~~~i~~~~~~~~~~~~~d~~~~~~~eakN~lEs~Iy~~ 551 (653)
T PTZ00009 511 RLSKADIDRMVNEAEKYKAEDEANRERVEAKNGLENYCYSM 551 (653)
T ss_pred cccHHHHHHHHHHHHHHHHHhHHHHHHHHHHhhhHHHHHHH
Confidence 4778888888888876654 34444444556677665543
No 37
>PF11153 DUF2931: Protein of unknown function (DUF2931); InterPro: IPR021326 Some members in this family of proteins are annotated as lipoproteins however this cannot be confirmed. Currently, there is no known function.
Probab=25.44 E-value=62 Score=30.67 Aligned_cols=35 Identities=20% Similarity=0.261 Sum_probs=19.2
Q ss_pred hHHHHHHHHHHHHHHhhcCCCCceeeeeeeeccCC
Q psy11306 7 WLNLSLSLVLCYLVVLSSSQNPVERFEYKYSFKPP 41 (413)
Q Consensus 7 ~~~~~~~~~~~~~~~~~~~~~~~~r~e~k~Sf~~P 41 (413)
++.+++++|++.+|.-........++++.++|..|
T Consensus 3 ~i~~l~l~lll~~C~~~~~~~~~~~~~W~~~~~~P 37 (216)
T PF11153_consen 3 KILLLLLLLLLTGCSTNPNEPLQPYFEWRFGVAAP 37 (216)
T ss_pred HHHHHHHHHHHHhhcCCCccCCCCCCccEEEEecC
Confidence 44444444444444333323233788888999887
No 38
>TIGR02302 aProt_lowcomp conserved hypothetical protein TIGR02302. Members of this family are long (~850 residue) bacterial proteins from the alpha Proteobacteria. Each has 2-3 predicted transmembrane helices near the N-terminus and a long C-terminal region that includes stretches of Gln/Gly-rich low complexity sequence, predicted by TMHMM to be outside the membrane. In Bradyrhizobium japonicum, two tandem reading frames are together homologous the single members found in other species; the cutoffs scores are set low enough that the longer scores above the trusted cutoff and the shorter above the noise cutoff for this model.
Probab=25.31 E-value=1.9e+02 Score=33.57 Aligned_cols=112 Identities=17% Similarity=0.173 Sum_probs=0.0
Q ss_pred ChHHHHHHHHHH----HHHHHHHHHhhhccCCccccccccccCCCCcccccCCccCCCCCCCCCchhhhhccchHHHHHH
Q psy11306 265 VNQEDQKVAQEY----AQYEKKLEEQKQHSQNPVERFEYKYSFKPPYLAQKDGYQKDHPDAHPNEEEWYESENQRELRQI 340 (413)
Q Consensus 265 ~~~e~~~l~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~ 340 (413)
+.+|.+||.+++ ++|.+.|.++.+ ++.-... +....+...+=..+-++=|.+|
T Consensus 521 sdeEI~~Lm~eLR~Am~~ym~~LAeq~~----------------------~~~~~~~-~~~~~~~~~l~~~dLq~Mmd~i 577 (851)
T TIGR02302 521 SDEEIKQLTDKLRAAMQTYMRQLAQQLR----------------------NNPQQLA-RPLDPNTKVLRQQDLQNMMDQI 577 (851)
T ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHhh----------------------hCccccc-ccCCccccccCHHHHHHHHHHH
Q ss_pred -----HhhhhHHHHHHHHHHHHHHHHHHHHHhhhhccccCCCccCCCCCCCCCCccchHHHHHhhHHHHHH
Q psy11306 341 -----FQGQSQLAEWTKAIATGLDALQQKQDRILAVVSQGGGIPHQVVPGQPMPMINNDALLASQNSLLST 406 (413)
Q Consensus 341 -----~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~~ 406 (413)
-++--..-+.|.+|++=|+.+.-.|-.+ ++...++++...+.+.++..+|..|+.|+..
T Consensus 578 eela~~G~~~~A~qlL~qlq~mmenlq~~q~~~-------g~~~~~~~~~~~q~m~~L~e~lr~QQ~L~D~ 641 (851)
T TIGR02302 578 ENLARSGDRDQAKQLLSQLQQMMNNLQMGQPGQ-------GQQMGDQSGDMEQQMNKLGELMRKQQQLRDE 641 (851)
T ss_pred HHHHHcCCHHHHHHHHHHHHHHHHHHhccCcCC-------CcccccchHHHHHHHHHHHHHHHHHHHHHHH
No 39
>PRK10884 SH3 domain-containing protein; Provisional
Probab=25.30 E-value=6.1e+02 Score=24.27 Aligned_cols=24 Identities=8% Similarity=0.159 Sum_probs=17.6
Q ss_pred ChHHHHHHHHHHHHHHHHHHHhhh
Q psy11306 265 VNQEDQKVAQEYAQYEKKLEEQKQ 288 (413)
Q Consensus 265 ~~~e~~~l~~~~~~~~~~~~~~~~ 288 (413)
..+...++++|+++++++|+...+
T Consensus 91 ~~~rlp~le~el~~l~~~l~~~~~ 114 (206)
T PRK10884 91 LRTRVPDLENQVKTLTDKLNNIDN 114 (206)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHh
Confidence 345667888888888888877555
No 40
>PF03268 DUF267: Caenorhabditis protein of unknown function, DUF267; InterPro: IPR004950 This family of proteins, from Caenorhabditis species, have not been characterised though a number are annotated as 'serpentine receptor, class r' proteins.
Probab=25.26 E-value=1.3e+02 Score=31.35 Aligned_cols=52 Identities=17% Similarity=0.285 Sum_probs=38.5
Q ss_pred HHHHHHHHHHHHHHHHhhhccCCccccccccccCCCCcccccCCccCCCCCCCCCchhhhhccchHHHHHHHhhhhHHHH
Q psy11306 270 QKVAQEYAQYEKKLEEQKQHSQNPVERFEYKYSFKPPYLAQKDGYQKDHPDAHPNEEEWYESENQRELRQIFQGQSQLAE 349 (413)
Q Consensus 270 ~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~ 349 (413)
--+.+|++.+.+|||+.++ +=+=.+|+. |+.-=.||+.|++
T Consensus 179 ~al~REi~yFN~ELe~A~k------------------------eK~L~n~~v---------------L~~F~~RQ~eL~~ 219 (353)
T PF03268_consen 179 SALNREIEYFNEELEKASK------------------------EKKLKNPQV---------------LEKFSHRQIELFE 219 (353)
T ss_pred HHHHHHHHHHHHHHHHHHH------------------------hccccChHH---------------HHHHhHHHHHHHH
Confidence 3678999999999999777 111112321 5555678999999
Q ss_pred HHHHHHHHHHH
Q psy11306 350 WTKAIATGLDA 360 (413)
Q Consensus 350 ~~~~l~~~~~~ 360 (413)
..+-+++.|..
T Consensus 220 lv~~~ne~L~~ 230 (353)
T PF03268_consen 220 LVNFANESLSS 230 (353)
T ss_pred HHHHHHHhhhh
Confidence 99999999874
No 41
>PLN03184 chloroplast Hsp70; Provisional
Probab=25.02 E-value=71 Score=35.78 Aligned_cols=39 Identities=18% Similarity=0.302 Sum_probs=24.7
Q ss_pred CCChHHHHHHHHHHHHHHH--HHHHhhhccCCccccccccc
Q psy11306 263 EQVNQEDQKVAQEYAQYEK--KLEEQKQHSQNPVERFEYKY 301 (413)
Q Consensus 263 ~~~~~e~~~l~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~ 301 (413)
.++.+|.+++.++.+++.+ +..+.++...|.+|.+-|..
T Consensus 538 ~ls~eei~~~~~~~~~~~~~D~~~~~~~eakN~lE~~iy~~ 578 (673)
T PLN03184 538 TLPKDEVERMVQEAEKFAKEDKEKRDAVDTKNQADSVVYQT 578 (673)
T ss_pred cccHHHHHHHHHHHHHhhhhhHHHHHHHHHHHhHHHHHHHH
Confidence 4788888888888876654 33334333456666665543
No 42
>cd08915 V_Alix_like Protein-interacting V-domain of mammalian Alix and related domains. This superfamily contains the V-shaped (V) domain of mammalian Alix (apoptosis-linked gene-2 interacting protein X), His-Domain type N23 protein tyrosine phosphatase (HD-PTP, also known as PTPN23), Bro1 and Rim20 (also known as PalA) from Saccharomyces cerevisiae, and related domains. Alix, HD-PTP, Bro1, and Rim20 all interact with the ESCRT (Endosomal Sorting Complexes Required for Transport) system. Alix, also known as apoptosis-linked gene-2 interacting protein 1 (AIP1), participates in membrane remodeling processes during the budding of enveloped viruses, vesicle budding inside late endosomal multivesicular bodies (MVBs), and the abscission reactions of mammalian cell division. It also functions in apoptosis. HD-PTP functions in cell migration and endosomal trafficking, Bro1 in endosomal trafficking, and Rim20 in the response to the external pH via the Rim101 pathway. The Alix V-domain contains
Probab=24.32 E-value=2.3e+02 Score=28.68 Aligned_cols=38 Identities=13% Similarity=0.164 Sum_probs=27.9
Q ss_pred chhhhhc------cchHHHHHHHhhhhHHHHHHHHHHHHHHHHH
Q psy11306 325 EEEWYES------ENQRELRQIFQGQSQLAEWTKAIATGLDALQ 362 (413)
Q Consensus 325 ~~~~~~~------~~~~~l~~~~~~~~~~~~~~~~l~~~~~~~~ 362 (413)
.+.+|+. ..+..|++.+..|..|+.+|+..+.++.++.
T Consensus 241 ~e~lf~~eL~kf~~~~~~i~~~~~~Q~~ll~~i~~~~~~f~~~~ 284 (342)
T cd08915 241 FEDLFEEHLKKFDKDLTYVEKTKKKQIELIKEIDAANQEFSQVK 284 (342)
T ss_pred hHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 4666653 2346688888999999999999888885543
No 43
>PF09224 DUF1961: Domain of unknown function (DUF1961); InterPro: IPR015305 Members of this family are found in a set of hypothetical bacterial proteins. Their exact function has not, as yet, been determined. ; PDB: 1OQ1_C.
Probab=23.70 E-value=3.1e+02 Score=26.72 Aligned_cols=66 Identities=15% Similarity=0.269 Sum_probs=42.7
Q ss_pred CCCCCCeEEeeeeEee--CCeEEeCCCC------CCceeEEEeCCcccccceEEEEEEEEEeCCCCCCCeEEEEEecCCC
Q psy11306 47 DGSVPFWEYGGNCIAS--LENVRVAPSL------RSQKGAIWTKQTTNFEWWNVDIVFRVTGRGRIGADGLAFWYTSEKG 118 (413)
Q Consensus 47 ~~~i~~W~~~GdAii~--~~~IRLTp~~------~sq~GsiWsk~pi~~~sFe~eftFrI~g~g~~ggDGlAFW~t~~~~ 118 (413)
...|.+|.+.|.+.++ ++.++|.... ....=-+|.+.-++ ++..++..|+.... -|||+.|....+
T Consensus 14 ~~dv~~w~~eG~~~~t~~~~~m~l~~~~~~e~~~~~~hfVlW~p~~Fp-~~i~IsweF~p~~e-----pGLamlfF~Aag 87 (218)
T PF09224_consen 14 PADVADWVAEGPAKVTFQNGRMELSGELDPEQFGDAAHFVLWCPEVFP-DDIRISWEFTPLAE-----PGLAMLFFAAAG 87 (218)
T ss_dssp GGGGTT-EEEESEEEE-GGGSEEEE------ESSTT--EEEE-SS-B--SSEEEEEEEEEEE------SEEEEEEEEEEE
T ss_pred HHHhhhhhccCCeEEeecCCcEEEeeeeccccccccccEEEECcccCC-CCeEEEEEEEEcCC-----CCEEEEEeeccc
Confidence 3468999999999775 7889998533 12223578865544 56689999998754 499999987655
No 44
>PF08649 DASH_Dad1: DASH complex subunit Dad1; InterPro: IPR013958 The DASH complex is a ~10 subunit microtubule-binding complex that is transferred to the kinetochore prior to mitosis []. In Saccharomyces cerevisiae (Baker's yeast) DASH forms both rings and spiral structures on microtubules in vitro [, ]. Components of the DASH complex, including Dam1, Duo1, Spc34, Dad1 and Ask1, are essential and connect the centromere to the plus end of spindle microtubules []. Throughout the cell cycle Dad1 remains bound to kinetochores and its association is dependent on the Mis6 and Mal2 [].
Probab=23.37 E-value=1.7e+02 Score=22.71 Aligned_cols=35 Identities=14% Similarity=0.347 Sum_probs=29.7
Q ss_pred hhhccchHHHHHHHhhhhHHHHHHHHHHHHHHHHH
Q psy11306 328 WYESENQRELRQIFQGQSQLAEWTKAIATGLDALQ 362 (413)
Q Consensus 328 ~~~~~~~~~l~~~~~~~~~~~~~~~~l~~~~~~~~ 362 (413)
+||..-++=+++|-..=..|+..|..|++.|..+.
T Consensus 2 ~Fe~qR~~Li~eI~~~~e~vl~nlN~LNRsLE~~i 36 (58)
T PF08649_consen 2 YFERQRDRLIQEISESMESVLNNLNALNRSLESVI 36 (58)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHH
Confidence 56666667789999999999999999999998775
No 45
>PF07072 DUF1342: Protein of unknown function (DUF1342); InterPro: IPR009777 This family consists of several hypothetical bacterial proteins of around 250 residues in length. Members of this family are often known as YacF after the Escherichia coli protein P36680 from SWISSPROT. The function of this family is unknown.; PDB: 2OEZ_A.
Probab=23.10 E-value=1.6e+02 Score=28.40 Aligned_cols=34 Identities=12% Similarity=0.154 Sum_probs=24.0
Q ss_pred chHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHH
Q psy11306 333 NQRELRQIFQGQSQLAEWTKAIATGLDALQQKQD 366 (413)
Q Consensus 333 ~~~~l~~~~~~~~~~~~~~~~l~~~~~~~~~~~~ 366 (413)
++.-|..+++...++.+.|+....|+.+..++-+
T Consensus 66 d~~~l~~~l~~l~~~~~~L~~~~gr~gq~Lrene 99 (211)
T PF07072_consen 66 DQEALDALLQELDQALQALQQAPGRIGQHLRENE 99 (211)
T ss_dssp -HHHHHHHHHHHHHHHHHHHH---STTHHHHC-H
T ss_pred CHHHHHHHHHHHHHHHHHHHhCcchhhhhhhHhH
Confidence 3445888888899999999999989888877654
No 46
>KOG2314|consensus
Probab=23.04 E-value=5.7e+02 Score=28.56 Aligned_cols=38 Identities=13% Similarity=0.079 Sum_probs=28.9
Q ss_pred HHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhhhhcc
Q psy11306 335 RELRQIFQGQSQLAEWTKAIATGLDALQQKQDRILAVV 372 (413)
Q Consensus 335 ~~l~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~ 372 (413)
|--|++.+++.+++++-+....+.-+-..++|.....+
T Consensus 627 ~AsrElvekrRqlm~~f~~yR~~~~~~~~eek~~r~~l 664 (698)
T KOG2314|consen 627 RASRELVEKRRQLMEQFTEYREKIIEEMAEEKSERIKL 664 (698)
T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcceeee
Confidence 45678888999999999888888777777776555443
No 47
>PF06133 DUF964: Protein of unknown function (DUF964); InterPro: IPR010368 This entry consists of several relatively short bacterial and archaeal hypothetical sequences. It also includes YlbF and YmcA proteins which are involved in the formation of biofilms []. YlbF regulates sporulation prior to stage II, positively controlling the competence regulator ComK at a post-transcriptional level. It may also modulate the translation, stability or activity of ComS and may work together with YmcA to regulate community development [].; PDB: 2IAZ_C 2OEE_A 2OEQ_D 2PIH_A.
Probab=22.89 E-value=1.5e+02 Score=24.45 Aligned_cols=73 Identities=19% Similarity=0.331 Sum_probs=42.5
Q ss_pred hHHHHHHHHHHHHHHHHHHHhhhccCCccccccccccCCCCcccccCCccCCCCCCC-CCchhhhhccch-HHHHHHHhh
Q psy11306 266 NQEDQKVAQEYAQYEKKLEEQKQHSQNPVERFEYKYSFKPPYLAQKDGYQKDHPDAH-PNEEEWYESENQ-RELRQIFQG 343 (413)
Q Consensus 266 ~~e~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~-~~l~~~~~~ 343 (413)
.+|..++.++|..+|+++...+. +-+.+|++. .+...++..... ..+..-+..
T Consensus 32 d~e~~~l~~~f~~~q~~~~~~q~-------------------------~g~~~~~e~~~~l~~~~~~l~~~p~v~~y~~a 86 (108)
T PF06133_consen 32 DPEAQKLIEEFQKLQQELQNAQM-------------------------YGKEPPKEEIEELQELQEELMQNPVVKEYLQA 86 (108)
T ss_dssp SHHHHHHHHHHHHHHHHHHHHHH-------------------------TTSCHHHHHHHHHHHHHHHHHTSHHHHHHHHH
T ss_pred CHHHHHHHHHHHHHHHHHHHHHh-------------------------hccCCcHHHHHHHHHHHHHHHcCHHHHHHHHH
Confidence 36778889999999998866444 122222111 112333332222 556666777
Q ss_pred hhHHHHHHHHHHHHHHHHHH
Q psy11306 344 QSQLAEWTKAIATGLDALQQ 363 (413)
Q Consensus 344 ~~~~~~~~~~l~~~~~~~~~ 363 (413)
|.++-+.|+.+++.+...+.
T Consensus 87 e~~~~~ll~~i~~~I~~~i~ 106 (108)
T PF06133_consen 87 EQALQDLLQDINQIISEAIS 106 (108)
T ss_dssp HHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHhhhc
Confidence 77777777777777665543
No 48
>TIGR01834 PHA_synth_III_E poly(R)-hydroxyalkanoic acid synthase, class III, PhaE subunit. This model represents the PhaE subunit of the heterodimeric class (class III) of polymerase for poly(R)-hydroxyalkanoic acids (PHAs), carbon and energy storage polymers of many bacteria. The most common PHA is polyhydroxybutyrate but about 150 different constituent hydroxyalkanoic acids (HAs) have been identified in various species. This model must be designated subfamily to indicate the heterogeneity of PHAs.
Probab=22.52 E-value=3e+02 Score=28.33 Aligned_cols=54 Identities=9% Similarity=0.066 Sum_probs=41.6
Q ss_pred hhhhHHHHHHHHHHHHHHHHHHHHHhhhhccccCCCccCCCCCCCCCCccchHHHHHhhHHHHHHHHHHh
Q psy11306 342 QGQSQLAEWTKAIATGLDALQQKQDRILAVVSQGGGIPHQVVPGQPMPMINNDALLASQNSLLSTVQELR 411 (413)
Q Consensus 342 ~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~~~~~~~ 411 (413)
.-+.++..++-+|.++++++..+.-+.|+++ .+.|||++-....+|=..++.|+
T Consensus 256 k~~G~lvna~m~lr~~~qe~~e~~L~~LnlP----------------TRsElDe~~krL~ELrR~vr~L~ 309 (320)
T TIGR01834 256 KVHGKFINALMRLRIQQQEIVEALLKMLNLP----------------TRSELDEAHQRIQQLRREVKSLK 309 (320)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhCCCC----------------CHHHHHHHHHHHHHHHHHHHHHH
Confidence 4467889999999999999999999999888 44577777666666655555554
No 49
>PF08372 PRT_C: Plant phosphoribosyltransferase C-terminal; InterPro: IPR013583 This domain is found at the C terminus of phosphoribosyltransferases and phosphoribosyltransferase-like proteins. It contains putative transmembrane regions. It often appears together with calcium-ion dependent C2 domains (IPR000008 from INTERPRO).
Probab=21.28 E-value=2.5e+02 Score=25.90 Aligned_cols=57 Identities=21% Similarity=0.298 Sum_probs=32.6
Q ss_pred ccCCCCCCCCCchhhhhccch-HHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhhhhccc
Q psy11306 314 YQKDHPDAHPNEEEWYESENQ-RELRQIFQGQSQLAEWTKAIATGLDALQQKQDRILAVVS 373 (413)
Q Consensus 314 ~~~~~~~~~~~~~~~~~~~~~-~~l~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~ 373 (413)
.-..|||+-.+..|-|.+... .-||+ |..++.+....++..|+++..+=||.-++++
T Consensus 33 ~~~~~~deldEEfD~~ps~~~~~~lr~---Rydrlr~va~rvQ~vlgd~At~gERl~alls 90 (156)
T PF08372_consen 33 ADSAHPDELDEEFDTFPSSRPPDSLRM---RYDRLRSVAGRVQNVLGDVATQGERLQALLS 90 (156)
T ss_pred cccCCcchhhhhhcccccccccHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc
Confidence 344567764444444443222 33555 4445556666666677777777777777763
No 50
>PF13600 DUF4140: N-terminal domain of unknown function (DUF4140)
Probab=20.56 E-value=4.8e+02 Score=21.41 Aligned_cols=43 Identities=19% Similarity=0.208 Sum_probs=29.2
Q ss_pred EEEEEEEEeecCCCCCCCcCCChHHHHHHHHHHHHHHHHHHHhh
Q psy11306 244 HDILHFLTSSLLPPGAKQQEQVNQEDQKVAQEYAQYEKKLEEQK 287 (413)
Q Consensus 244 HDIls~~~~sL~~p~~~~~~~~~~e~~~l~~~~~~~~~~~~~~~ 287 (413)
..|+++.+..-..|..+ .+...++..+|.++++.++++++...
T Consensus 48 ~~i~~v~~~~~~~~~~~-~~~~~~~~~~l~~~l~~l~~~~~~~~ 90 (104)
T PF13600_consen 48 VTILSVRFRRDFLPEPE-KESDSPELKELEEELEALEDELAALQ 90 (104)
T ss_pred EEEEEEEEEEeccCccc-cccCcHHHHHHHHHHHHHHHHHHHHH
Confidence 67788777765544322 23456688888888888888776643
No 51
>TIGR01005 eps_transp_fam exopolysaccharide transport protein family. The model describes the exopolysaccharide transport protein family in bacteria. The transport protein is part of a large genetic locus which is associated with exopolysaccharide (EPS) biosynthesis. Detailed molecular characterization and gene fusion analysis revealed atleast seven gene products are involved in the overall regulation, which among other things, include exopolysaccharide biosynthesis, property of conferring virulence and exopolysaccharide export.
Probab=20.30 E-value=4.7e+02 Score=29.41 Aligned_cols=32 Identities=13% Similarity=0.208 Sum_probs=21.5
Q ss_pred HHHHHHHHHHHHHHHHHHHhhhccCCccccccccccCCCCcccccCCccCCCCCCC
Q psy11306 267 QEDQKVAQEYAQYEKKLEEQKQHSQNPVERFEYKYSFKPPYLAQKDGYQKDHPDAH 322 (413)
Q Consensus 267 ~e~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 322 (413)
.....|..++.+++.++..-.. .|...||+.+
T Consensus 288 ~~i~~L~~~l~~l~~~~~~l~~------------------------~y~~~hP~v~ 319 (754)
T TIGR01005 288 DLIQRLRERQAELRATIADLST------------------------TMLANHPRVV 319 (754)
T ss_pred HHHHHHHHHHHHHHHHHHHHHH------------------------hhCCCCHHHH
Confidence 3455666666666666666555 7889999864
No 52
>PRK10722 hypothetical protein; Provisional
Probab=20.15 E-value=2.5e+02 Score=27.76 Aligned_cols=35 Identities=26% Similarity=0.261 Sum_probs=18.7
Q ss_pred HHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhhhhc
Q psy11306 336 ELRQIFQGQSQLAEWTKAIATGLDALQQKQDRILAV 371 (413)
Q Consensus 336 ~l~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~ 371 (413)
+|...-+-|.+.-..|..+++||..+ ..-+|+||.
T Consensus 177 qlD~lrqq~~~Lq~~L~~t~rKLEnL-TdIERqLSs 211 (247)
T PRK10722 177 ELDALRQQQQRLQYQLELTTRKLENL-TDIERQLSS 211 (247)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHhcc
Confidence 34444444555556666666666533 334556654
No 53
>PF05130 FlgN: FlgN protein; InterPro: IPR007809 Flagella synthesis protein FlgN is an export chaperone involved in flagellar synthesis []. This entry represents a FlgN-like domain, consisting of a 4 long helices bundle, where the last helix is shorter than the three others.; GO: 0009296 flagellum assembly, 0019861 flagellum; PDB: 2FUP_A 3OPC_A.
Probab=20.13 E-value=2.7e+02 Score=23.32 Aligned_cols=19 Identities=21% Similarity=0.288 Sum_probs=8.3
Q ss_pred chHHHHHhhHHHHHHHHHH
Q psy11306 392 NNDALLASQNSLLSTVQEL 410 (413)
Q Consensus 392 ~~~~~~~~q~~~~~~~~~~ 410 (413)
.++.++..+..+....+++
T Consensus 75 ~l~~~~~~~~~l~~~~~~l 93 (143)
T PF05130_consen 75 TLSELIEEREELQALWREL 93 (143)
T ss_dssp HHHHHHCCCHHHHHHHHHH
T ss_pred cHHHHHhccHHHHHHHHHH
Confidence 4444444444444443333
Done!