Query psy17488
Match_columns 1446
No_of_seqs 1 out of 3
Neff 1.0
Searched_HMMs 46136
Date Fri Aug 16 23:46:30 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy17488.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/17488hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PF14228 MOR2-PAG1_mid: Cell m 99.4 6.8E-13 1.5E-17 155.6 8.1 151 89-291 114-267 (1120)
2 KOG1825|consensus 97.9 1.1E-05 2.5E-10 99.4 6.3 217 68-294 792-1024(2206)
3 PF13513 HEAT_EZ: HEAT-like re 92.6 0.051 1.1E-06 42.4 0.9 28 223-250 28-55 (55)
4 PF14098 SSPI: Small, acid-sol 81.4 1.5 3.3E-05 39.2 3.2 33 238-277 2-34 (65)
5 TIGR03092 SASP_sspI small, aci 80.0 1.8 4E-05 38.9 3.2 33 238-277 1-33 (65)
6 PF13646 HEAT_2: HEAT repeats; 79.7 1.4 3E-05 35.8 2.2 36 224-259 32-67 (88)
7 PRK02955 small acid-soluble sp 77.4 2.4 5.3E-05 38.4 3.2 34 237-277 3-36 (68)
8 smart00185 ARM Armadillo/beta- 76.1 1.2 2.6E-05 31.9 0.8 29 223-251 12-40 (41)
9 KOG4676|consensus 67.4 4.9 0.00011 45.6 3.4 24 77-102 11-34 (479)
10 PF00514 Arm: Armadillo/beta-c 66.4 2.4 5.3E-05 32.0 0.6 29 223-251 12-40 (41)
11 PF02985 HEAT: HEAT repeat; I 63.0 2.8 6.1E-05 30.7 0.4 25 226-250 3-27 (31)
12 PF03130 HEAT_PBS: PBS lyase H 53.6 16 0.00034 26.7 2.9 24 239-262 1-25 (27)
13 COG5064 SRP1 Karyopherin (impo 49.6 10 0.00023 42.9 2.1 37 223-259 157-195 (526)
14 PF11236 DUF3037: Protein of u 45.3 5.3 0.00011 37.2 -0.8 71 104-174 20-93 (118)
15 PF02969 TAF: TATA box binding 45.0 34 0.00074 30.3 4.1 42 242-283 11-54 (66)
16 smart00567 EZ_HEAT E-Z type HE 42.0 29 0.00063 24.8 2.7 23 238-260 2-25 (30)
17 TIGR01635 tail_comp_S phage vi 41.9 34 0.00073 32.5 3.9 40 234-273 1-40 (144)
18 PF09415 CENP-X: CENP-S associ 40.9 17 0.00038 32.3 1.7 30 249-278 19-48 (72)
19 cd00020 ARM Armadillo/beta-cat 40.2 32 0.00069 28.1 3.1 29 223-251 49-77 (120)
20 PF01934 DUF86: Protein of unk 34.4 46 0.00099 29.4 3.4 36 235-270 81-116 (119)
21 PF13646 HEAT_2: HEAT repeats; 33.8 73 0.0016 25.7 4.2 51 228-278 4-59 (88)
22 PF07112 DUF1368: Protein of u 33.6 17 0.00038 40.9 0.8 21 82-110 28-48 (404)
23 PTZ00033 60S ribosomal protein 33.3 21 0.00044 35.4 1.1 35 93-127 21-66 (125)
24 PF12363 DUF3647: Phage protei 33.0 69 0.0015 29.9 4.4 40 222-261 36-79 (113)
25 PF13513 HEAT_EZ: HEAT-like re 32.6 1.1E+02 0.0025 23.7 5.0 38 238-275 2-42 (55)
26 PRK09687 putative lyase; Provi 32.3 42 0.00092 34.9 3.2 26 225-250 161-186 (280)
27 KOG4843|consensus 31.6 30 0.00065 38.5 2.1 121 91-222 191-333 (355)
28 cd01899 Ygr210 Ygr210 subfamil 29.3 57 0.0012 34.8 3.7 56 228-283 103-159 (318)
29 PF15209 IL31: Interleukin 31 27.9 59 0.0013 32.8 3.2 69 230-303 46-116 (137)
30 PRK09687 putative lyase; Provi 26.2 58 0.0013 33.9 3.0 39 224-262 129-168 (280)
31 PF12972 NAGLU_C: Alpha-N-acet 26.1 74 0.0016 33.3 3.7 88 157-259 16-114 (267)
32 PF13976 gag_pre-integrs: GAG- 24.8 50 0.0011 26.7 1.8 20 241-260 24-44 (67)
33 KOG0414|consensus 24.7 55 0.0012 41.2 2.9 31 226-256 315-345 (1251)
34 KOG2549|consensus 24.5 1.1E+02 0.0024 36.2 5.1 48 246-293 25-80 (576)
35 cd00020 ARM Armadillo/beta-cat 24.4 78 0.0017 25.8 2.9 31 222-252 6-36 (120)
36 PF14123 DUF4290: Domain of un 23.0 1.9E+02 0.0042 30.0 5.9 71 157-249 58-137 (176)
37 PF04484 DUF566: Family of unk 21.4 91 0.002 33.7 3.4 21 228-248 250-273 (311)
38 PF08757 CotH: CotH protein; 20.9 2.4E+02 0.0053 27.8 6.1 31 243-273 261-292 (323)
39 PF12755 Vac14_Fab1_bd: Vacuol 20.8 93 0.002 28.4 2.9 28 224-251 25-55 (97)
40 PF05069 Phage_tail_S: Phage v 20.4 1.4E+02 0.003 28.0 4.1 38 235-274 2-39 (148)
41 TIGR00274 N-acetylmuramic acid 20.3 71 0.0015 33.8 2.3 34 238-271 258-291 (291)
42 TIGR03354 VI_FHA type VI secre 20.2 2.4E+02 0.0051 31.3 6.3 47 233-280 204-254 (396)
No 1
>PF14228 MOR2-PAG1_mid: Cell morphogenesis central region
Probab=99.37 E-value=6.8e-13 Score=155.62 Aligned_cols=151 Identities=33% Similarity=0.573 Sum_probs=116.4
Q ss_pred CCchhhHHHhhhcCceeeechhhhhhcchhhhhhccccccccCCccCCccchhhccccCCCCCCCccchhhHHHHHHHHh
Q psy17488 89 DPWGFCLFQFLQKGRVLTYCPSAVAQAWPIVYSRLHTLFPTIDPTPVSDNRASLLRSSAPPRKPVSERDSFLSLWRYLAI 168 (1446)
Q Consensus 89 dpwgfclfqflqkgrvltycpsavaqawpivysrlhtlfptidptpvsdnrasllrssapprkpvserdsflslwrylai 168 (1446)
..|.-||=.||.- +...||.+||.+|-+|..||-.+-|. |. .. ++. .+++ -|..+..|+--.|
T Consensus 114 tlW~r~fp~lir~--~fe~CP~tval~R~~v~~RL~~i~~~-~l---~~-k~~---~s~~-------~e~~ieQWKlYLI 176 (1120)
T PF14228_consen 114 TLWFRCFPNLIRI--AFERCPFTVALCRSEVCQRLQQITPI-EL---GS-KSS---QSTD-------PEVLIEQWKLYLI 176 (1120)
T ss_pred HHHHHHHHHHHHH--HHHhCcHHHHHHHHHHHHHhccCCch-hc---cC-ccC---CCCC-------ccHHHHHHHHHHH
Confidence 4798888888765 56679999999999999999766542 21 11 111 1222 2789999999999
Q ss_pred hhccccCCCCCCCCCCCCCCCccccCCCCCCCCCCCCCCcCCCCCCCCCCCCccchhhhhhccccccccchhhHHHHHhc
Q psy17488 169 FSARVVPPVPYPTPRCASPDLSLSWSPENVGGERGGAGVENKPAPTASPPPSPTGLYKLLVPLLRCDTTDVRDAVVHALG 248 (1446)
Q Consensus 169 fsarvvppvpyptprcaspdlslswspenvggerggagvenkpaptaspppsptglykllvpllrcdttdvrdavvhalg 248 (1446)
|.. ++|--+ -. +....+..-||+++||+|+|+.++|+||+|.|||
T Consensus 177 fAC-------------ttp~~s----~s------------------~~~i~SaR~LFk~ivPlLks~~~~~r~AaVlaLG 221 (1120)
T PF14228_consen 177 FAC-------------TTPTDS----RS------------------QQKITSARELFKLIVPLLKSESSSFRDAAVLALG 221 (1120)
T ss_pred HHh-------------cCCCcC----cc------------------ccccCCHHHHHHHHhhhhccCcHHHHHHHHHhcC
Confidence 975 222111 00 1224467889999999999999999999999999
Q ss_pred cCCHHHHHHHHHHHHHHHHHHh---hhhhHHHHHHhhhchhhhccc
Q psy17488 249 NINSEALKDLMEELSVYIREAV---DRKTENMRRRRRRDALRLQSY 291 (1446)
Q Consensus 249 ninsealkdlmeelsvyireav---drktenmrrrrrrdalrlqsy 291 (1446)
+||..+++.||++|+++|.++. +.+..+|++++|+|.||++-.
T Consensus 222 ~~n~~v~~~LleeL~~~i~~~~~e~e~r~~~k~rr~Rrd~LR~ev~ 267 (1120)
T PF14228_consen 222 SINLNVYRTLLEELQSYIEECNSEAESRPKWKRRRRRRDRLRTEVT 267 (1120)
T ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHhcccccccchhhhhhHHHHHH
Confidence 9999999999999999999874 577889999999999998753
No 2
>KOG1825|consensus
Probab=97.93 E-value=1.1e-05 Score=99.43 Aligned_cols=217 Identities=13% Similarity=0.071 Sum_probs=156.3
Q ss_pred ccccCCCCCceeecCCCCCCCCCchhhHHHhhhcCceeeechhhhhhcchhhhhhccccccccCCc-cCCccchhhcccc
Q psy17488 68 IHEDSNRNGSVLNLSPSCSAPDPWGFCLFQFLQKGRVLTYCPSAVAQAWPIVYSRLHTLFPTIDPT-PVSDNRASLLRSS 146 (1446)
Q Consensus 68 ihedsnrngsvlnlspscsapdpwgfclfqflqkgrvltycpsavaqawpivysrlhtlfptidpt-pvsdnrasllrss 146 (1446)
+|.+..-.+.|.+.--+|-.-|+||+|||-+.+.--+++-|+.+++++|++...++-+.-++.|+. |-.|+...+++-.
T Consensus 792 ~~~~i~sd~~~~~~~f~~~~k~kwg~~L~~~~~~~~l~~~c~~~~s~s~~ll~~~~~a~~~~~~~s~~q~d~~s~~~~~~ 871 (2206)
T KOG1825|consen 792 DSDTIPSDSTLWSIIFSRLDKNKWGLLLSELVKYAGLLKPCSVQESKSEILLRRAHSAPSEFGGKSNPQNDLDSKLLRWL 871 (2206)
T ss_pred CCCCCCCcceehhhhhhhhccchhhhhhHHHHHHhcCCCCcchhhhhhhHHHHHHhccchhccCCCccccccchHHHhhh
Confidence 444555556666677778788999999999999999999999999999999999999999999997 5567787777765
Q ss_pred CCCCCC-----CccchhhHHHHHHHHhhhccccCCCCCCCCCCCCCCCccccCCCCCCCCCCCCCC-------cCCCCCC
Q psy17488 147 APPRKP-----VSERDSFLSLWRYLAIFSARVVPPVPYPTPRCASPDLSLSWSPENVGGERGGAGV-------ENKPAPT 214 (1446)
Q Consensus 147 apprkp-----vserdsflslwrylaifsarvvppvpyptprcaspdlslswspenvggerggagv-------enkpapt 214 (1446)
--+--- +---+.-+.+|+-.-||...|.++. -.+++---+-.|.+.++.-+|.-- .++|+.+
T Consensus 872 ~~~~~v~fk~~~~~~~~~v~~w~~yl~~~~~~~~~~------~~s~~h~rs~~pt~~~~s~~~~fS~~~~~~s~~tpi~s 945 (2206)
T KOG1825|consen 872 LYAMFVCFKPPDGLDEGSVASWRDYLHLIFGVLRFG------SESGNHLRSNAPTISHTSCEGMFSEVAISFSDETPIKS 945 (2206)
T ss_pred ccccceecccccccchhhhhhhhhhHHHhhhccccc------cccccchhhcCcccccCCccceehhhhhhccccCccch
Confidence 443322 3334556788997777776665432 223343445566766666444321 2233222
Q ss_pred CCCCCCccchhhhhhccccccccchhhHHHHHhccCCHHHHHHHHHHHHH--HHHHHhhhhhH-HHHHHhhhchhhhccc
Q psy17488 215 ASPPPSPTGLYKLLVPLLRCDTTDVRDAVVHALGNINSEALKDLMEELSV--YIREAVDRKTE-NMRRRRRRDALRLQSY 291 (1446)
Q Consensus 215 aspppsptglykllvpllrcdttdvrdavvhalgninsealkdlmeelsv--yireavdrkte-nmrrrrrrdalrlqsy 291 (1446)
+.-.++-++.|++..-..+++.+-+|-..|.+|.-+..-+.++|.- -.++++.|.-+ |..++-|.|.|++|-+
T Consensus 946 ----~~k~~i~~~~~~~~~~~~~~i~~~~v~~~~~~~~~vf~~~~~~l~~r~~~r~sl~R~e~~~~~~~~r~dLl~~~i~ 1021 (2206)
T KOG1825|consen 946 ----KIKKVIRRENVRVHVENIYRITEENVWPFMLARKPVFALLVLRLIERSTRRNSLARSESFHIMAPLRYDLLSVLIF 1021 (2206)
T ss_pred ----HHHHHHhhhhHHHHHHhccccchhheeccccccccchhhhhhhhhhcccchhhhhcchhhhhhhhhhHHHHHHHhc
Confidence 2334566667777777777788888888888888777777777665 56789999887 9999999999999987
Q ss_pred CCC
Q psy17488 292 RPE 294 (1446)
Q Consensus 292 rpe 294 (1446)
+-.
T Consensus 1022 ~~~ 1024 (2206)
T KOG1825|consen 1022 LDP 1024 (2206)
T ss_pred ccH
Confidence 643
No 3
>PF13513 HEAT_EZ: HEAT-like repeat; PDB: 2Z5J_A 2OT8_B 2Z5O_A 2H4M_A 2QMR_A 1QBK_B 2Z5M_A 2Z5K_A 2Z5N_A 1GCJ_B ....
Probab=92.56 E-value=0.051 Score=42.42 Aligned_cols=28 Identities=46% Similarity=0.676 Sum_probs=24.9
Q ss_pred chhhhhhccccccccchhhHHHHHhccC
Q psy17488 223 GLYKLLVPLLRCDTTDVRDAVVHALGNI 250 (1446)
Q Consensus 223 glykllvpllrcdttdvrdavvhalgni 250 (1446)
.+..+|+++|.-|..+||.+.++|||||
T Consensus 28 ~~~~~L~~~L~d~~~~VR~~A~~aLg~l 55 (55)
T PF13513_consen 28 ELLPALIPLLQDDDDSVRAAAAWALGNL 55 (55)
T ss_dssp HHHHHHHHHTTSSSHHHHHHHHHHHHCH
T ss_pred HHHHHHHHHHcCCCHHHHHHHHHHHhcC
Confidence 4677888999888899999999999997
No 4
>PF14098 SSPI: Small, acid-soluble spore protein I
Probab=81.43 E-value=1.5 Score=39.22 Aligned_cols=33 Identities=33% Similarity=0.573 Sum_probs=27.7
Q ss_pred chhhHHHHHhccCCHHHHHHHHHHHHHHHHHHhhhhhHHH
Q psy17488 238 DVRDAVVHALGNINSEALKDLMEELSVYIREAVDRKTENM 277 (1446)
Q Consensus 238 dvrdavvhalgninsealkdlmeelsvyireavdrktenm 277 (1446)
|+|.||+|-|-|-+.+.|++. |.+|||.+-|+|
T Consensus 2 dlR~AI~~nv~g~s~~el~~~-------I~daI~sgEE~~ 34 (65)
T PF14098_consen 2 DLRQAIIHNVKGSSKEELKDT-------IEDAIQSGEEKA 34 (65)
T ss_pred CHHHHHHHHccCCCHHHHHHH-------HHHHHhccchhc
Confidence 799999999999998887774 667888877765
No 5
>TIGR03092 SASP_sspI small, acid-soluble spore protein I. This protein family is restricted to a subset of endospore-forming bacteria such as Bacillus subtilis, all of which are in the Firmicutes (low-GC Gram-positive) lineage. It is a minor SASP (small, acid-soluble spore protein) designated SspI. The gene in Bacillus subtilis previously was designated ysfA.
Probab=80.01 E-value=1.8 Score=38.89 Aligned_cols=33 Identities=27% Similarity=0.454 Sum_probs=28.0
Q ss_pred chhhHHHHHhccCCHHHHHHHHHHHHHHHHHHhhhhhHHH
Q psy17488 238 DVRDAVVHALGNINSEALKDLMEELSVYIREAVDRKTENM 277 (1446)
Q Consensus 238 dvrdavvhalgninsealkdlmeelsvyireavdrktenm 277 (1446)
|+|+||+|-|-|-+.+.||+. |-.|||.+-|+|
T Consensus 1 nlR~Ai~~nv~~~s~~elk~~-------I~daI~sgEEk~ 33 (65)
T TIGR03092 1 NLRRAILANISNNTKEQLEAT-------IVDAIQSGEEKM 33 (65)
T ss_pred CHHHHHHHHhcCCCHHHHHHH-------HHHHHhccchhc
Confidence 789999999999998877775 667888888876
No 6
>PF13646 HEAT_2: HEAT repeats; PDB: 1OYZ_A 3FGA_A 2PF4_C 2IAE_A 3B2A_A.
Probab=79.65 E-value=1.4 Score=35.75 Aligned_cols=36 Identities=28% Similarity=0.339 Sum_probs=27.4
Q ss_pred hhhhhhccccccccchhhHHHHHhccCCHHHHHHHH
Q psy17488 224 LYKLLVPLLRCDTTDVRDAVVHALGNINSEALKDLM 259 (1446)
Q Consensus 224 lykllvpllrcdttdvrdavvhalgninsealkdlm 259 (1446)
+...|+.+|..+..+||.+++.|||+|......+.|
T Consensus 32 ~~~~L~~~l~d~~~~vr~~a~~aL~~i~~~~~~~~L 67 (88)
T PF13646_consen 32 AIPALIELLKDEDPMVRRAAARALGRIGDPEAIPAL 67 (88)
T ss_dssp HHHHHHHHHTSSSHHHHHHHHHHHHCCHHHHTHHHH
T ss_pred HHHHHHHHHcCCCHHHHHHHHHHHHHhCCHHHHHHH
Confidence 456778888888888999999999999765544433
No 7
>PRK02955 small acid-soluble spore protein SspI; Provisional
Probab=77.44 E-value=2.4 Score=38.39 Aligned_cols=34 Identities=26% Similarity=0.387 Sum_probs=28.4
Q ss_pred cchhhHHHHHhccCCHHHHHHHHHHHHHHHHHHhhhhhHHH
Q psy17488 237 TDVRDAVVHALGNINSEALKDLMEELSVYIREAVDRKTENM 277 (1446)
Q Consensus 237 tdvrdavvhalgninsealkdlmeelsvyireavdrktenm 277 (1446)
-|+|+||+|-|-|-+.+.|++ +|-+||+.+-|+|
T Consensus 3 ~nlR~Ai~~nv~g~s~eel~~-------~I~daIqsgEEk~ 36 (68)
T PRK02955 3 FNLRGAVLANVSGNSKEELEG-------TIVDAIQSGEEKM 36 (68)
T ss_pred ccHHHHHHHHccCCCHHHHHH-------HHHHHHhccchhc
Confidence 589999999999988877766 5677888888876
No 8
>smart00185 ARM Armadillo/beta-catenin-like repeats. Approx. 40 amino acid repeat. Tandem repeats form superhelix of helices that is proposed to mediate interaction of beta-catenin with its ligands. Involved in transducing the Wingless/Wnt signal. In plakoglobin arm repeats bind alpha-catenin and N-cadherin.
Probab=76.09 E-value=1.2 Score=31.86 Aligned_cols=29 Identities=31% Similarity=0.449 Sum_probs=25.2
Q ss_pred chhhhhhccccccccchhhHHHHHhccCC
Q psy17488 223 GLYKLLVPLLRCDTTDVRDAVVHALGNIN 251 (1446)
Q Consensus 223 glykllvpllrcdttdvrdavvhalgnin 251 (1446)
|....||.||.++..+|..+++.||.||.
T Consensus 12 g~i~~L~~ll~~~~~~i~~~a~~aL~nl~ 40 (41)
T smart00185 12 GGLPALVELLKSEDEEVVKEAAWALSNLS 40 (41)
T ss_pred CCHHHHHHHHcCCCHHHHHHHHHHHHHHc
Confidence 45678999999988999999999999984
No 9
>KOG4676|consensus
Probab=67.37 E-value=4.9 Score=45.62 Aligned_cols=24 Identities=38% Similarity=0.296 Sum_probs=15.8
Q ss_pred ceeecCCCCCCCCCchhhHHHhhhcC
Q psy17488 77 SVLNLSPSCSAPDPWGFCLFQFLQKG 102 (1446)
Q Consensus 77 svlnlspscsapdpwgfclfqflqkg 102 (1446)
.|.||+|+... --...||.||-|=
T Consensus 11 qvanispsat~--dqm~tlFg~lGkI 34 (479)
T KOG4676|consen 11 QVANISPSATK--DQMQTLFGNLGKI 34 (479)
T ss_pred eecccCchhhH--HHHHHHHhhcccc
Confidence 47788887542 3345788888663
No 10
>PF00514 Arm: Armadillo/beta-catenin-like repeat; InterPro: IPR000225 The armadillo (Arm) repeat is an approximately 40 amino acid long tandemly repeated sequence motif first identified in the Drosophila melanogaster segment polarity gene armadillo involved in signal transduction through wingless. Animal Arm-repeat proteins function in various processes, including intracellular signalling and cytoskeletal regulation, and include such proteins as beta-catenin, the junctional plaque protein plakoglobin, the adenomatous polyposis coli (APC) tumour suppressor protein, and the nuclear transport factor importin-alpha, amongst others []. A subset of these proteins is conserved across eukaryotic kingdoms. In higher plants, some Arm-repeat proteins function in intracellular signalling like their mammalian counterparts, while others have novel functions []. The 3-dimensional fold of an armadillo repeat is known from the crystal structure of beta-catenin, where the 12 repeats form a superhelix of alpha helices with three helices per unit []. The cylindrical structure features a positively charged grove, which presumably interacts with the acidic surfaces of the known interaction partners of beta-catenin.; GO: 0005515 protein binding; PDB: 2Z6G_A 1IQ1_C 3RZX_A 2C1M_A 3BTR_C 3OQS_A 3TPO_A 1IAL_A 1Q1S_C 1PJM_B ....
Probab=66.42 E-value=2.4 Score=31.95 Aligned_cols=29 Identities=34% Similarity=0.518 Sum_probs=25.9
Q ss_pred chhhhhhccccccccchhhHHHHHhccCC
Q psy17488 223 GLYKLLVPLLRCDTTDVRDAVVHALGNIN 251 (1446)
Q Consensus 223 glykllvpllrcdttdvrdavvhalgnin 251 (1446)
|....||.||.++..+|+..++-||+||-
T Consensus 12 g~i~~Lv~ll~~~~~~v~~~a~~al~nl~ 40 (41)
T PF00514_consen 12 GGIPPLVQLLKSPDPEVQEEAAWALGNLA 40 (41)
T ss_dssp THHHHHHHHTTSSSHHHHHHHHHHHHHHH
T ss_pred ccHHHHHHHHcCCCHHHHHHHHHHHHHHh
Confidence 56778999999999999999999999973
No 11
>PF02985 HEAT: HEAT repeat; InterPro: IPR000357 The HEAT repeat is a tandemly repeated, 37-47 amino acid long module occurring in a number of cytoplasmic proteins, including the four name-giving proteins huntingtin, elongation factor 3 (EF3), the 65 Kd alpha regulatory subunit of protein phosphatase 2A (PP2A) and the yeast PI3-kinase TOR1 []. Arrays of HEAT repeats consists of 3 to 36 units forming a rod-like helical structure and appear to function as protein-protein interaction surfaces. It has been noted that many HEAT repeat-containing proteins are involved in intracellular transport processes. In the crystal structure of PP2A PR65/A [], the HEAT repeats consist of pairs of antiparallel alpha helices [].; GO: 0005515 protein binding; PDB: 3FGA_A 2PF4_C 2IAE_A 2BKU_D 3EA5_B 3ND2_A 2BPT_A 2NYL_A 2NPP_D 2PKG_B ....
Probab=62.97 E-value=2.8 Score=30.66 Aligned_cols=25 Identities=36% Similarity=0.578 Sum_probs=19.4
Q ss_pred hhhhccccccccchhhHHHHHhccC
Q psy17488 226 KLLVPLLRCDTTDVRDAVVHALGNI 250 (1446)
Q Consensus 226 kllvpllrcdttdvrdavvhalgni 250 (1446)
-.|+-+|.=+..+||+|++.+||+|
T Consensus 3 p~l~~~l~D~~~~VR~~a~~~l~~i 27 (31)
T PF02985_consen 3 PILLQLLNDPSPEVRQAAAECLGAI 27 (31)
T ss_dssp HHHHHHHT-SSHHHHHHHHHHHHHH
T ss_pred HHHHHHcCCCCHHHHHHHHHHHHHH
Confidence 3556677767789999999999986
No 12
>PF03130 HEAT_PBS: PBS lyase HEAT-like repeat; InterPro: IPR004155 These proteins contain a short bi-helical repeat that is related to HEAT. Cyanobacteria and red algae harvest light energy using macromolecular complexes known as phycobilisomes (PBS), peripherally attached to the photosynthetic membrane. The major components of PBS are the phycobiliproteins. These heterodimeric proteins are covalently attached to phycobilins: open-chain tetrapyrrole chromophores, which function as the photosynthetic light-harvesting pigments. Phycobiliproteins differ in sequence and in the nature and number of attached phycobilins to each of their subunits. These proteins include the lyase enzymes that specifically attach particular phycobilins to apophycobiliprotein subunits. The most comprehensively studied of these is the CpcE/Flyase P31967 from SWISSPROT, P31968 from SWISSPROT, which attaches phycocyanobilin (PCB) to the alpha subunit of apophycocyanin []. Similarly, MpeU/V attaches phycoerythrobilin to phycoerythrin II, while CpeY/Z is thought to be involved in phycoerythrobilin (PEB) attachment to phycoerythrin (PE) I (PEs I and II differ in sequence and in the number of attached molecules of PEB: PE I has five, PE II has six) []. All the reactions of the above lyases involve an apoprotein cysteine SH addition to a terminal delta 3,3'-double bond. Such a reaction is not possible in the case of phycoviolobilin (PVB), the phycobilin of alpha-phycoerythrocyanin (alpha-PEC). It is thought that in this case, PCB, not PVB, is first added to apo-alpha-PEC, and is then isomerized to PVB. The addition reaction has been shown to occur in the presence of either of the components of alpha-PEC-PVB lyase PecE or PecF (or both). The isomerisation reaction occurs only when both PecE and PecF components are present, i.e. the PecE/F phycobiliprotein lyase is also a phycobilin isomerase []. Another member of this family is the NblB protein, whose similarity to the phycobiliprotein lyases was previously noted []. This constitutively expressed protein is not known to have any lyase activity. It is thought to be involved in the coordination of PBS degradation with environmental nutrient limitation. It has been suggested that the similarity of NblB to the phycobiliprotein lyases is due to the ability to bind tetrapyrrole phycobilins via the common repeated motif [].; PDB: 1TE4_A.
Probab=53.61 E-value=16 Score=26.67 Aligned_cols=24 Identities=46% Similarity=0.672 Sum_probs=18.0
Q ss_pred hhhHHHHHhccCCH-HHHHHHHHHH
Q psy17488 239 VRDAVVHALGNINS-EALKDLMEEL 262 (1446)
Q Consensus 239 vrdavvhalgnins-ealkdlmeel 262 (1446)
||-+.+-|||+|.. +|+..|++-|
T Consensus 1 VR~~Aa~aLg~igd~~ai~~L~~~L 25 (27)
T PF03130_consen 1 VRRAAARALGQIGDPRAIPALIEAL 25 (27)
T ss_dssp HHHHHHHHHGGG-SHHHHHHHHHHH
T ss_pred CHHHHHHHHHHcCCHHHHHHHHHHh
Confidence 68889999999988 6666666654
No 13
>COG5064 SRP1 Karyopherin (importin) alpha [Intracellular trafficking and secretion]
Probab=49.61 E-value=10 Score=42.93 Aligned_cols=37 Identities=43% Similarity=0.678 Sum_probs=29.8
Q ss_pred chhhhhhccccccccchhhHHHHHhccC--CHHHHHHHH
Q psy17488 223 GLYKLLVPLLRCDTTDVRDAVVHALGNI--NSEALKDLM 259 (1446)
Q Consensus 223 glykllvpllrcdttdvrdavvhalgni--nsealkdlm 259 (1446)
|..-|++-||---..||++.+|.||||| +|++..|.+
T Consensus 157 ~AVPlfiqlL~s~~~~V~eQavWALGNiAGDS~~~RD~v 195 (526)
T COG5064 157 GAVPLFIQLLSSTEDDVREQAVWALGNIAGDSEGCRDYV 195 (526)
T ss_pred CchHHHHHHHcCchHHHHHHHHHHhccccCCchhHHHHH
Confidence 3344788888888889999999999999 577777754
No 14
>PF11236 DUF3037: Protein of unknown function (DUF3037); InterPro: IPR021398 This bacterial family of proteins has no known function.
Probab=45.25 E-value=5.3 Score=37.22 Aligned_cols=71 Identities=20% Similarity=0.342 Sum_probs=44.4
Q ss_pred eeeechhh--hhhcchhhhhhccccccccCCccCCccchhhccccCCCC-CCCccchhhHHHHHHHHhhhcccc
Q psy17488 104 VLTYCPSA--VAQAWPIVYSRLHTLFPTIDPTPVSDNRASLLRSSAPPR-KPVSERDSFLSLWRYLAIFSARVV 174 (1446)
Q Consensus 104 vltycpsa--vaqawpivysrlhtlfptidptpvsdnrasllrssappr-kpvserdsflslwrylaifsarvv 174 (1446)
||.|||.+ +.-.|-.-..||+.|||.+|..-+.++-.++-..-+--. ......+...+.|++|.---.-||
T Consensus 20 Vvl~~~~~~~l~~r~~~~~~Rl~~f~~~~D~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~f~~lt~~rstvi 93 (118)
T PF11236_consen 20 VVLFCPEQGFLDFRFHLDRKRLRAFFPELDIDLVRAALEAFERECAGIKEAGPIAQLDIAERFRWLTAPRSTVI 93 (118)
T ss_pred EEEEeCCCCeEEEEEeCCHHHHHHhCccCCHHHHHHHHHHHHHHHccCCCCCCcCcccHHHHHHHhhhcccCcE
Confidence 66788843 444555556999999999999988887766533222111 111445557788888764433333
No 15
>PF02969 TAF: TATA box binding protein associated factor (TAF); InterPro: IPR004823 The TATA box binding protein associated factor (TAF) is part of the transcription initiation factor TFIID multimeric protein complex. TFIID plays a central role in mediating promoter responses to various activators and repressors. It binds tightly to TAFII-250 and directly interacts with TAFII-40. TFIID is composed of TATA binding protein (TBP)and a number of TBP-associated factors (TAFS). TAF proteins adopt a histone-like fold.; GO: 0006352 transcription initiation, DNA-dependent, 0005634 nucleus; PDB: 1TAF_B.
Probab=45.01 E-value=34 Score=30.32 Aligned_cols=42 Identities=29% Similarity=0.484 Sum_probs=32.3
Q ss_pred HHHHHh--ccCCHHHHHHHHHHHHHHHHHHhhhhhHHHHHHhhh
Q psy17488 242 AVVHAL--GNINSEALKDLMEELSVYIREAVDRKTENMRRRRRR 283 (1446)
Q Consensus 242 avvhal--gninsealkdlmeelsvyireavdrktenmrrrrrr 283 (1446)
+|.-.| .||+.++++.|.+++..-|++.|..-..-|+.-+|.
T Consensus 11 ~iAes~Gi~~l~de~a~~La~dveyrlreiiq~a~kfm~hskR~ 54 (66)
T PF02969_consen 11 DIAESLGISNLSDEAAKALAEDVEYRLREIIQEALKFMRHSKRT 54 (66)
T ss_dssp HHHHHTT---B-HHHHHHHHHHHHHHHHHHHHHHHHHHHHTT-S
T ss_pred HHHHHcCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC
Confidence 344444 489999999999999999999999999999887765
No 16
>smart00567 EZ_HEAT E-Z type HEAT repeats. Present in subunits of cyanobacterial phycocyanin lyase, and other proteins. Probable scaffolding role.
Probab=42.01 E-value=29 Score=24.84 Aligned_cols=23 Identities=35% Similarity=0.531 Sum_probs=16.2
Q ss_pred chhhHHHHHhccCCH-HHHHHHHH
Q psy17488 238 DVRDAVVHALGNINS-EALKDLME 260 (1446)
Q Consensus 238 dvrdavvhalgnins-ealkdlme 260 (1446)
-||-+++-+||||.. +|+..|++
T Consensus 2 ~vR~~aa~aLg~~~~~~a~~~L~~ 25 (30)
T smart00567 2 LVRHEAAFALGQLGDEEAVPALIK 25 (30)
T ss_pred HHHHHHHHHHHHcCCHhHHHHHHH
Confidence 378889999999954 45544444
No 17
>TIGR01635 tail_comp_S phage virion morphogenesis (putative tail completion) protein. This model describes protein S of phage P2, suggested experimentally to act in tail completion and stable head joining, and related proteins from a number of phage.
Probab=41.88 E-value=34 Score=32.49 Aligned_cols=40 Identities=8% Similarity=0.217 Sum_probs=33.2
Q ss_pred ccccchhhHHHHHhccCCHHHHHHHHHHHHHHHHHHhhhh
Q psy17488 234 CDTTDVRDAVVHALGNINSEALKDLMEELSVYIREAVDRK 273 (1446)
Q Consensus 234 cdttdvrdavvhalgninsealkdlmeelsvyireavdrk 273 (1446)
.|+.+|.+++-++|.+++...++.||.+|.-+|+..++..
T Consensus 1 ~~~~~~~~~l~~l~~~l~~~~~~~l~~~Ig~~l~~~~~~r 40 (144)
T TIGR01635 1 EDRSYIQVALERLLQVLDPKTRDKMLRRIAAQLASSTRKN 40 (144)
T ss_pred CcHHHHHHHHHHHHHhcChhhHHHHHHHHHHHHHHHHHHH
Confidence 3778899999999999988788899999999888776543
No 18
>PF09415 CENP-X: CENP-S associating Centromere protein X; InterPro: IPR018552 Centromere protein X (CENP-X) is a component of the CENP-S complex. The CENP-S complex is composed of at least of CENP-S and CENP-X and is essential for the stable assembly of the outer kinetchore []. CENP-X is also a DNA-binding component of the Fanconi anemia (FA) core complex involved in DNA damage repair and genome maintenance. The FA complex is composed of CENPS, FANCA, FANCB, FANCC, FANCE, FANCF, FANCG, FANCL/PHF9, FANCM, FAAP24 and CENPX. Interacts with CENPS, FANCM and FAAP24 [, ].; PDB: 4DRB_L 4DRA_H 3V9R_D.
Probab=40.88 E-value=17 Score=32.35 Aligned_cols=30 Identities=40% Similarity=0.556 Sum_probs=25.3
Q ss_pred cCCHHHHHHHHHHHHHHHHHHhhhhhHHHH
Q psy17488 249 NINSEALKDLMEELSVYIREAVDRKTENMR 278 (1446)
Q Consensus 249 ninsealkdlmeelsvyireavdrktenmr 278 (1446)
+|+.+||+-+-+=|-++|+|||-|-.++-+
T Consensus 19 kIs~dal~l~~eyl~iFV~EAv~Ra~~~a~ 48 (72)
T PF09415_consen 19 KISKDALKLSAEYLRIFVREAVARAAEQAE 48 (72)
T ss_dssp EE-CCCHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred CcCHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 688999999999999999999999887543
No 19
>cd00020 ARM Armadillo/beta-catenin-like repeats. An approximately 40 amino acid long tandemly repeated sequence motif first identified in the Drosophila segment polarity gene armadillo; these repeats were also found in the mammalian armadillo homolog beta-catenin, the junctional plaque protein plakoglobin, the adenomatous polyposis coli (APC) tumor suppressor protein, and a number of other proteins. ARM has been implicated in mediating protein-protein interactions, but no common features among the target proteins recognized by the ARM repeats have been identified; related to the HEAT domain; three consecutive copies of the repeat are represented by this alignment model.
Probab=40.24 E-value=32 Score=28.14 Aligned_cols=29 Identities=34% Similarity=0.470 Sum_probs=26.0
Q ss_pred chhhhhhccccccccchhhHHHHHhccCC
Q psy17488 223 GLYKLLVPLLRCDTTDVRDAVVHALGNIN 251 (1446)
Q Consensus 223 glykllvpllrcdttdvrdavvhalgnin 251 (1446)
|+...|+.+|..+..+|+.+.+-||+||-
T Consensus 49 ~~i~~l~~~l~~~~~~v~~~a~~~L~~l~ 77 (120)
T cd00020 49 GGLPALVQLLKSEDEEVVKAALWALRNLA 77 (120)
T ss_pred CChHHHHHHHhCCCHHHHHHHHHHHHHHc
Confidence 67889999999998999999999999983
No 20
>PF01934 DUF86: Protein of unknown function DUF86; InterPro: IPR008201 This entry describes prokaryotic proteins of unknown function.; PDB: 1YLM_A.
Probab=34.36 E-value=46 Score=29.39 Aligned_cols=36 Identities=19% Similarity=0.355 Sum_probs=24.5
Q ss_pred cccchhhHHHHHhccCCHHHHHHHHHHHHHHHHHHh
Q psy17488 235 DTTDVRDAVVHALGNINSEALKDLMEELSVYIREAV 270 (1446)
Q Consensus 235 dttdvrdavvhalgninsealkdlmeelsvyireav 270 (1446)
..+++|+.+||.-.+||.+.|-+++.+.--.+.++|
T Consensus 81 ~~~g~RN~lvH~Y~~id~~~i~~~i~~~l~~l~~~~ 116 (119)
T PF01934_consen 81 KMVGFRNRLVHDYDSIDDEIIYEIIKEDLPDLEEFI 116 (119)
T ss_dssp HHHTTHHHHHT-GGG--HHHHHHHHHHTHHHHHHHH
T ss_pred HHHHHHHHHccccccCCHHHHHHHHHHHHHHHHHHH
Confidence 356899999999999999999888876444444443
No 21
>PF13646 HEAT_2: HEAT repeats; PDB: 1OYZ_A 3FGA_A 2PF4_C 2IAE_A 3B2A_A.
Probab=33.76 E-value=73 Score=25.68 Aligned_cols=51 Identities=25% Similarity=0.342 Sum_probs=30.9
Q ss_pred hhccc-cccccchhhHHHHHhccCCHHHHHHHHHHH----HHHHHHHhhhhhHHHH
Q psy17488 228 LVPLL-RCDTTDVRDAVVHALGNINSEALKDLMEEL----SVYIREAVDRKTENMR 278 (1446)
Q Consensus 228 lvpll-rcdttdvrdavvhalgninsealkdlmeel----svyireavdrktenmr 278 (1446)
|+-+| .=...+||..++++||++..+.+-+++-++ ...||.++-.-..+|.
T Consensus 4 L~~~l~~~~~~~vr~~a~~~L~~~~~~~~~~~L~~~l~d~~~~vr~~a~~aL~~i~ 59 (88)
T PF13646_consen 4 LLQLLQNDPDPQVRAEAARALGELGDPEAIPALIELLKDEDPMVRRAAARALGRIG 59 (88)
T ss_dssp HHHHHHTSSSHHHHHHHHHHHHCCTHHHHHHHHHHHHTSSSHHHHHHHHHHHHCCH
T ss_pred HHHHHhcCCCHHHHHHHHHHHHHcCCHhHHHHHHHHHcCCCHHHHHHHHHHHHHhC
Confidence 44455 444468999999999999977554443333 2445554444444443
No 22
>PF07112 DUF1368: Protein of unknown function (DUF1368); InterPro: IPR010778 This family consists of several proteins with seem to be specific to red algae plasmids. Members of this family are typically around 415 residues in length. The function of this family is unknown.
Probab=33.62 E-value=17 Score=40.94 Aligned_cols=21 Identities=43% Similarity=0.977 Sum_probs=17.7
Q ss_pred CCCCCCCCCchhhHHHhhhcCceeeechh
Q psy17488 82 SPSCSAPDPWGFCLFQFLQKGRVLTYCPS 110 (1446)
Q Consensus 82 spscsapdpwgfclfqflqkgrvltycps 110 (1446)
++|||. |-|.|||| ||.||+.
T Consensus 28 ~~S~SE-dDW~ycll-------ilkfi~p 48 (404)
T PF07112_consen 28 SPSPSE-DDWQYCLL-------ILKFINP 48 (404)
T ss_pred CCCCCc-cchhheee-------eeeecCc
Confidence 788886 55999997 7899986
No 23
>PTZ00033 60S ribosomal protein L24; Provisional
Probab=33.34 E-value=21 Score=35.44 Aligned_cols=35 Identities=26% Similarity=0.492 Sum_probs=29.3
Q ss_pred hhHHHhhhcCceeeechhhhh-----------hcchhhhhhccccc
Q psy17488 93 FCLFQFLQKGRVLTYCPSAVA-----------QAWPIVYSRLHTLF 127 (1446)
Q Consensus 93 fclfqflqkgrvltycpsava-----------qawpivysrlhtlf 127 (1446)
|.-|.||..|.|+++|-+-.. .+|-.+|-|+|.--
T Consensus 21 ~Vr~~~~~Dgkv~~F~~sKc~~~~~~krnPRkl~WT~~yRr~~kK~ 66 (125)
T PTZ00033 21 YVPFAFLSTKPVLTFLRPKCFALYMRKKNPRFLPWTRTYRRINRKT 66 (125)
T ss_pred eeecccCCCCCEEEEecHHHHHHHHCcCCCccchHHHHHHHHhCCc
Confidence 566889999999999987543 47999999999875
No 24
>PF12363 DUF3647: Phage protein ; InterPro: IPR024410 Proteins in this entry are frequently annotated as phage proteins, however there is little accompanying literature to back this up or to describe the nature of these phage proteins.
Probab=32.96 E-value=69 Score=29.88 Aligned_cols=40 Identities=28% Similarity=0.416 Sum_probs=31.7
Q ss_pred cchhhhhhccccccccchhhHHHHHh----ccCCHHHHHHHHHH
Q psy17488 222 TGLYKLLVPLLRCDTTDVRDAVVHAL----GNINSEALKDLMEE 261 (1446)
Q Consensus 222 tglykllvpllrcdttdvrdavvhal----gninsealkdlmee 261 (1446)
.|+-.|++-|+.+|+.-+.|++-+|+ .++..++|.+.+++
T Consensus 36 ~G~~~l~~~l~~~d~~al~d~i~~a~~~~~~~~s~~eIe~~ie~ 79 (113)
T PF12363_consen 36 MGLSQLVPGLLQGDPVALADIIYAATAHEKKRPSREEIEDYIED 79 (113)
T ss_pred cCHHHHHHHHHcCCHHHHHHHHHHHhcccCCCCCHHHHHHHHHH
Confidence 47788899999999999999998888 45566666666664
No 25
>PF13513 HEAT_EZ: HEAT-like repeat; PDB: 2Z5J_A 2OT8_B 2Z5O_A 2H4M_A 2QMR_A 1QBK_B 2Z5M_A 2Z5K_A 2Z5N_A 1GCJ_B ....
Probab=32.60 E-value=1.1e+02 Score=23.67 Aligned_cols=38 Identities=26% Similarity=0.359 Sum_probs=24.4
Q ss_pred chhhHHHHHhccCC---HHHHHHHHHHHHHHHHHHhhhhhH
Q psy17488 238 DVRDAVVHALGNIN---SEALKDLMEELSVYIREAVDRKTE 275 (1446)
Q Consensus 238 dvrdavvhalgnin---sealkdlmeelsvyireavdrkte 275 (1446)
-||.+.+.|||+|. .+.++.++.+|--.+...+.....
T Consensus 2 ~vR~~A~~aLg~l~~~~~~~~~~~~~~~~~~L~~~L~d~~~ 42 (55)
T PF13513_consen 2 RVRRAAAWALGRLAEGCPELLQPYLPELLPALIPLLQDDDD 42 (55)
T ss_dssp HHHHHHHHHHHCTTTTTHHHHHHHHHHHHHHHHHHTTSSSH
T ss_pred HHHHHHHHHHhhHhcccHHHHHHHHHHHHHHHHHHHcCCCH
Confidence 37899999999764 556665555555555555543333
No 26
>PRK09687 putative lyase; Provisional
Probab=32.31 E-value=42 Score=34.88 Aligned_cols=26 Identities=35% Similarity=0.405 Sum_probs=15.5
Q ss_pred hhhhhccccccccchhhHHHHHhccC
Q psy17488 225 YKLLVPLLRCDTTDVRDAVVHALGNI 250 (1446)
Q Consensus 225 ykllvpllrcdttdvrdavvhalgni 250 (1446)
..+|+.+|.-+..+||.+.+.|||+|
T Consensus 161 i~~L~~~L~d~~~~VR~~A~~aLg~~ 186 (280)
T PRK09687 161 IPLLINLLKDPNGDVRNWAAFALNSN 186 (280)
T ss_pred HHHHHHHhcCCCHHHHHHHHHHHhcC
Confidence 45555666555556666666666665
No 27
>KOG4843|consensus
Probab=31.59 E-value=30 Score=38.52 Aligned_cols=121 Identities=24% Similarity=0.264 Sum_probs=78.1
Q ss_pred chhhHHHhhhcC--cee--eechhhhhhcchhhhhhccccccccCCc--------------cCCccchhhccccCCCCCC
Q psy17488 91 WGFCLFQFLQKG--RVL--TYCPSAVAQAWPIVYSRLHTLFPTIDPT--------------PVSDNRASLLRSSAPPRKP 152 (1446)
Q Consensus 91 wgfclfqflqkg--rvl--tycpsavaqawpivysrlhtlfptidpt--------------pvsdnrasllrssapprkp 152 (1446)
-|-|+|..+.-- .-| --|-+.+|..-| |--++|.+|-||.|. .-+|+-+-|.+-.|++-+|
T Consensus 191 ~~e~~f~~ie~~~~~~le~p~~~~~~~~~~p-~~V~i~~~~~~~~pr~~~~~qlwgTdVytddsD~vt~~~~g~~sll~p 269 (355)
T KOG4843|consen 191 VGECMFELIELWKEYDLEHPDKSGDFANNGP-LEVRIPAEHVTATPRQVRGGQLWGTDVYTDDSDLVTGYCRGTASLLPP 269 (355)
T ss_pred chhhhhhhhhccccccccCcccccccccCCc-eEEEeeccceeccccccccceeecCccccccchhhheecccccCCCCC
Confidence 456777654321 111 124455555555 344566666666553 4478888888888888888
Q ss_pred CccchhhHHHHHHHHhhhccccCCCCCCCCCCCCCCCccccCCCC----CCCCCCCCCCcCCCCCCCCCCCCcc
Q psy17488 153 VSERDSFLSLWRYLAIFSARVVPPVPYPTPRCASPDLSLSWSPEN----VGGERGGAGVENKPAPTASPPPSPT 222 (1446)
Q Consensus 153 vserdsflslwrylaifsarvvppvpyptprcaspdlslswspen----vggerggagvenkpaptaspppspt 222 (1446)
|- +.|....+|+|+.-|=|----.---|-+|+-+. -|=--|+.-|+-||+-|.+---+|+
T Consensus 270 t~----------~~l~v~L~vLP~l~~Y~s~~rn~I~SR~WgS~~~~~~dgls~gi~~I~~kp~l~~~~~~~ph 333 (355)
T KOG4843|consen 270 TM----------QELRVTLRVLPSLDYYTSKLRNNIRSRAWGSGIGCSYDGLSKGIGTIELKPSLTHSSTVEPH 333 (355)
T ss_pred cc----------ceeEEEEEecCchhhccchhccceeecccccCCceeecccccccceEEEecccccccccCcc
Confidence 73 456667788999877665545555677898776 5666677778888887766555554
No 28
>cd01899 Ygr210 Ygr210 subfamily. Ygr210 is a member of Obg-like family and present in archaea and fungi. They are characterized by a distinct glycine-rich motif immediately following the Walker B motif. The Ygr210 and YyaF/YchF subfamilies appear to form one major branch of the Obg-like family. Among eukaryotes, the Ygr210 subfamily is represented only in fungi. These fungal proteins form a tight cluster with their archaeal orthologs, which suggests the possibility of horizontal transfer from archaea to fungi.
Probab=29.27 E-value=57 Score=34.80 Aligned_cols=56 Identities=14% Similarity=0.229 Sum_probs=43.0
Q ss_pred hhccccccccchhhHHHHHhccCC-HHHHHHHHHHHHHHHHHHhhhhhHHHHHHhhh
Q psy17488 228 LVPLLRCDTTDVRDAVVHALGNIN-SEALKDLMEELSVYIREAVDRKTENMRRRRRR 283 (1446)
Q Consensus 228 lvpllrcdttdvrdavvhalgnin-sealkdlmeelsvyireavdrktenmrrrrrr 283 (1446)
|+=+++|..++-.+.|.|..|++| .+.|.-|.+||-.++-+.|.++.+.+.++...
T Consensus 103 ii~Vvd~~~~~d~~~~~~~~~~~dp~~d~~~i~~El~~~d~~~~~~~~~~~~~~~~~ 159 (318)
T cd01899 103 LIHVVDASGGTDAEGNGVETGGHDPLEDIEFLENEIDMWIYGILEKNWEKIVRKADA 159 (318)
T ss_pred EEEEEeCCCCcccccccccCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc
Confidence 555667754334467889999998 56788889999999999999888888866553
No 29
>PF15209 IL31: Interleukin 31
Probab=27.94 E-value=59 Score=32.84 Aligned_cols=69 Identities=26% Similarity=0.298 Sum_probs=39.4
Q ss_pred ccccccccchhhHHHHHhccCCHHHHHHHHHHHHHHHHH-HhhhhhHHHHHHhhhchhhhcccCCC-Cccccceec
Q psy17488 230 PLLRCDTTDVRDAVVHALGNINSEALKDLMEELSVYIRE-AVDRKTENMRRRRRRDALRLQSYRPE-SGERKSYRL 303 (1446)
Q Consensus 230 pllrcdttdvrdavvhalgninsealkdlmeelsvyire-avdrktenmrrrrrrdalrlqsyrpe-sgerksyrl 303 (1446)
.+|.|=|-|- ||..||||-|+..-++.+...... +++.-++.+-.-+-.|++-.+-=.|. +-|+|+|-|
T Consensus 46 ~~LPCft~d~-----~~~~nI~sSaIl~ylk~Ik~l~~n~~i~~Ii~~L~kl~~q~~pEt~iS~P~d~fE~K~FiL 116 (137)
T PF15209_consen 46 YQLPCFTPDA-----QAPNNINSSAILPYLKAIKRLSNNTVIDEIIEQLDKLKFQNPPETNISVPTDSFECKSFIL 116 (137)
T ss_pred ccCCCCCCCC-----CCCccccHHHHHHHHHHHHHhcccchHHHHHHHHHhcccCCCCCCCCCCCCCcchhhhhHH
Confidence 3567777773 899999999998888877764433 23333444333333444443333332 334555544
No 30
>PRK09687 putative lyase; Provisional
Probab=26.16 E-value=58 Score=33.88 Aligned_cols=39 Identities=31% Similarity=0.210 Sum_probs=26.8
Q ss_pred hhhhhhccccccccchhhHHHHHhccCCH-HHHHHHHHHH
Q psy17488 224 LYKLLVPLLRCDTTDVRDAVVHALGNINS-EALKDLMEEL 262 (1446)
Q Consensus 224 lykllvpllrcdttdvrdavvhalgnins-ealkdlmeel 262 (1446)
+..+|..++.-+...||-++|+|||+|.. +++..|+.-|
T Consensus 129 a~~~l~~~~~D~~~~VR~~a~~aLg~~~~~~ai~~L~~~L 168 (280)
T PRK09687 129 IVEQSQITAFDKSTNVRFAVAFALSVINDEAAIPLLINLL 168 (280)
T ss_pred HHHHHHHHhhCCCHHHHHHHHHHHhccCCHHHHHHHHHHh
Confidence 44556666666677899999999999964 4454444433
No 31
>PF12972 NAGLU_C: Alpha-N-acetylglucosaminidase (NAGLU) C-terminal domain; InterPro: IPR024732 Alpha-N-acetylglucosaminidase is a lysosomal enzyme required for the stepwise degradation of heparan sulphate []. Mutations on the alpha-N-acetylglucosaminidase (NAGLU) gene can lead to Mucopolysaccharidosis type IIIB (MPS IIIB, or Sanfilippo syndrome type B) characterised by neurological dysfunction but relatively mild somatic manifestations []. The structure shows that the enzyme is composed of three domains. This C-terminal domain has an all alpha helical fold [].; PDB: 2VC9_A 2VCC_A 2VCB_A 2VCA_A 4A4A_A.
Probab=26.13 E-value=74 Score=33.33 Aligned_cols=88 Identities=19% Similarity=0.178 Sum_probs=38.2
Q ss_pred hhhHHHHHHHH--hhhccccCCCCCCCCCCCCCCCccccCCCCCCCCCCCCCCcCCCCCCCCCCCCccchhhhhhccccc
Q psy17488 157 DSFLSLWRYLA--IFSARVVPPVPYPTPRCASPDLSLSWSPENVGGERGGAGVENKPAPTASPPPSPTGLYKLLVPLLRC 234 (1446)
Q Consensus 157 dsflslwryla--ifsarvvppvpyptprcaspdlslswspenvggerggagvenkpaptaspppsptglykllvpllrc 234 (1446)
+.....|..|. |++.......+....-|+-|.|........ ....-+=.|.-|.+.+--||.|
T Consensus 16 ~~~~~AW~iL~~tvY~~~~~~~~~~~si~~~rPsl~~~~~~~~---------------~~~~~~Yd~~~l~~A~~~ll~a 80 (267)
T PF12972_consen 16 PEAREAWQILLRTVYNNTGGQYGRGESIFCARPSLNGNSASTW---------------GPPGIWYDPADLEKAWRLLLKA 80 (267)
T ss_dssp HHHHHHHHHHHHTTTS---S---SS--GGGS-S-SS---SSTT---------------------S-HHHHHHHHHHHHHC
T ss_pred HHHHHHHHHHHhhhCCCCCCCCCCCcCceeeCCCCCcccccCC---------------CCCCCCCCHHHHHHHHHHHHHh
Confidence 34567788775 333332222222355689998874321111 1111223455566666666665
Q ss_pred cc---------cchhhHHHHHhccCCHHHHHHHH
Q psy17488 235 DT---------TDVRDAVVHALGNINSEALKDLM 259 (1446)
Q Consensus 235 dt---------tdvrdavvhalgninsealkdlm 259 (1446)
.. -||-|..-|+|+|.-...+.+|+
T Consensus 81 ~~~l~~~~~yryDlvDvtRQvL~n~~~~~~~~~~ 114 (267)
T PF12972_consen 81 ADELKDSETYRYDLVDVTRQVLSNYADELYQQLV 114 (267)
T ss_dssp HHHHTT-HHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHhcCCCchHHhHHHHHHHHHHHHHHHHHHHHHH
Confidence 44 25666666777665555444444
No 32
>PF13976 gag_pre-integrs: GAG-pre-integrase domain
Probab=24.84 E-value=50 Score=26.71 Aligned_cols=20 Identities=30% Similarity=0.416 Sum_probs=16.1
Q ss_pred hHHHHH-hccCCHHHHHHHHH
Q psy17488 241 DAVVHA-LGNINSEALKDLME 260 (1446)
Q Consensus 241 davvha-lgninsealkdlme 260 (1446)
-.+.|+ ||-+|..+|+.|+.
T Consensus 24 ~~lWH~RLGH~~~~~l~~l~~ 44 (67)
T PF13976_consen 24 TDLWHRRLGHPSFQVLKKLSK 44 (67)
T ss_pred HHHHHHhcCCCCHHHHHHHhh
Confidence 345564 99999999999887
No 33
>KOG0414|consensus
Probab=24.72 E-value=55 Score=41.23 Aligned_cols=31 Identities=32% Similarity=0.519 Sum_probs=26.8
Q ss_pred hhhhccccccccchhhHHHHHhccCCHHHHH
Q psy17488 226 KLLVPLLRCDTTDVRDAVVHALGNINSEALK 256 (1446)
Q Consensus 226 kllvpllrcdttdvrdavvhalgninsealk 256 (1446)
-+||+||.|..-++|.|||.++||+--.+|.
T Consensus 315 ~~lv~lld~es~~lRnavlei~~n~V~~~l~ 345 (1251)
T KOG0414|consen 315 TLLVDLLDSESYTLRNAVLEICANLVASELR 345 (1251)
T ss_pred HHHHHhcCCchHHHHHHHHHHHHHHHHHHhc
Confidence 4688999999999999999999998666555
No 34
>KOG2549|consensus
Probab=24.47 E-value=1.1e+02 Score=36.19 Aligned_cols=48 Identities=31% Similarity=0.380 Sum_probs=39.0
Q ss_pred HhccCCHHHHHHHHHHHHHHHHHHhhhhhHHHHHHhhh--------chhhhcccCC
Q psy17488 246 ALGNINSEALKDLMEELSVYIREAVDRKTENMRRRRRR--------DALRLQSYRP 293 (1446)
Q Consensus 246 algninsealkdlmeelsvyireavdrktenmrrrrrr--------dalrlqsyrp 293 (1446)
.++|||.||++-|-++|.--|+|+|..-++-|+--.|+ -|||+-.--|
T Consensus 25 Gi~nl~deaa~~La~dv~yrikEI~Q~aaKfm~hskR~kLtv~DV~~ALr~~nVep 80 (576)
T KOG2549|consen 25 GITNLNDEAALLLAEDVEYRIKEIVQDAAKFMVHSKRTKLTVDDVDYALRSLNVEP 80 (576)
T ss_pred CccccCHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCcCcHHHHHHHHhhccccc
Confidence 46799999999999999999999999999999876654 2566655444
No 35
>cd00020 ARM Armadillo/beta-catenin-like repeats. An approximately 40 amino acid long tandemly repeated sequence motif first identified in the Drosophila segment polarity gene armadillo; these repeats were also found in the mammalian armadillo homolog beta-catenin, the junctional plaque protein plakoglobin, the adenomatous polyposis coli (APC) tumor suppressor protein, and a number of other proteins. ARM has been implicated in mediating protein-protein interactions, but no common features among the target proteins recognized by the ARM repeats have been identified; related to the HEAT domain; three consecutive copies of the repeat are represented by this alignment model.
Probab=24.37 E-value=78 Score=25.84 Aligned_cols=31 Identities=29% Similarity=0.417 Sum_probs=26.2
Q ss_pred cchhhhhhccccccccchhhHHHHHhccCCH
Q psy17488 222 TGLYKLLVPLLRCDTTDVRDAVVHALGNINS 252 (1446)
Q Consensus 222 tglykllvpllrcdttdvrdavvhalgnins 252 (1446)
.|+...|+.+|.=+..+++.+.+.+|+||-.
T Consensus 6 ~~~i~~l~~~l~~~~~~~~~~a~~~l~~l~~ 36 (120)
T cd00020 6 AGGLPALVSLLSSSDENVQREAAWALSNLSA 36 (120)
T ss_pred cCChHHHHHHHHcCCHHHHHHHHHHHHHHhc
Confidence 4677888899987778999999999999864
No 36
>PF14123 DUF4290: Domain of unknown function (DUF4290)
Probab=23.01 E-value=1.9e+02 Score=30.03 Aligned_cols=71 Identities=28% Similarity=0.436 Sum_probs=43.7
Q ss_pred hhhHHHHHHHHhhhcc---ccCCCCCCCCCCCCCCCccccCCCCCCCCCCCCCCcCCCCCCCCCCCCc------cchhhh
Q psy17488 157 DSFLSLWRYLAIFSAR---VVPPVPYPTPRCASPDLSLSWSPENVGGERGGAGVENKPAPTASPPPSP------TGLYKL 227 (1446)
Q Consensus 157 dsflslwrylaifsar---vvppvpyptprcaspdlslswspenvggerggagvenkpaptaspppsp------tglykl 227 (1446)
|----||--|||.|.- |-.|+|-|++ ++ +..||.+-+=|...+ ..+..|
T Consensus 58 Df~hKLWDhL~ImS~FkLDVD~Py~ip~~-------------e~---------l~~kP~~l~Yp~~~i~yRhYG~~i~~m 115 (176)
T PF14123_consen 58 DFKHKLWDHLFIMSDFKLDVDSPYPIPSK-------------EK---------LGKKPEPLPYPQNRIKYRHYGKNIEEM 115 (176)
T ss_pred hHHHHHHHHHHHHhCCCccCCCCCCCCCH-------------HH---------hccCCCCCCCCCCCCCcccccHHHHHH
Confidence 3345799999998754 4455555443 11 123454444333322 134556
Q ss_pred hhccccccccchhhHHHHHhcc
Q psy17488 228 LVPLLRCDTTDVRDAVVHALGN 249 (1446)
Q Consensus 228 lvpllrcdttdvrdavvhalgn 249 (1446)
+--++-+...+.+||+|-+++|
T Consensus 116 i~ka~~~e~geer~~lv~~iAn 137 (176)
T PF14123_consen 116 IDKAKEMEDGEERDALVRLIAN 137 (176)
T ss_pred HHHHHhCCCHHHHHHHHHHHHH
Confidence 6667788889999999988875
No 37
>PF04484 DUF566: Family of unknown function (DUF566) ; InterPro: IPR007573 This is a family of related proteins that is plant specific.
Probab=21.43 E-value=91 Score=33.69 Aligned_cols=21 Identities=19% Similarity=0.487 Sum_probs=14.7
Q ss_pred hhccc---cccccchhhHHHHHhc
Q psy17488 228 LVPLL---RCDTTDVRDAVVHALG 248 (1446)
Q Consensus 228 lvpll---rcdttdvrdavvhalg 248 (1446)
-|||. +.|+.+|.+||.+|+.
T Consensus 250 rlP~~~ga~~d~~~l~~al~sAvd 273 (311)
T PF04484_consen 250 RLPVTGGAKADVQSLKEALSSAVD 273 (311)
T ss_pred cccccCCceecHHHHHHHHHHHHH
Confidence 35554 5677788888888764
No 38
>PF08757 CotH: CotH protein; InterPro: IPR014867 Members of this group include the Bacillus subtilis spore coat protein H (CotH). Assembly of CotH requires both CotE and GerE and is reuired for the correct assembly of both inner and outer layers of the coat. CotH appears to be a structural component of the coat being localised at the interface of the 2 coat layers [, , ].
Probab=20.95 E-value=2.4e+02 Score=27.77 Aligned_cols=31 Identities=23% Similarity=0.370 Sum_probs=24.7
Q ss_pred HHHHh-ccCCHHHHHHHHHHHHHHHHHHhhhh
Q psy17488 243 VVHAL-GNINSEALKDLMEELSVYIREAVDRK 273 (1446)
Q Consensus 243 vvhal-gninsealkdlmeelsvyireavdrk 273 (1446)
+...+ |.++.+.|..++..+...|..|+.+-
T Consensus 261 ~~~l~~~~~~~~~~~~~i~~~~~~i~~~~~~d 292 (323)
T PF08757_consen 261 WKELRNGVFSPERLEARIDDYAALIRPAMERD 292 (323)
T ss_pred HHHHHHccCCHHHHHHHHHHHHHHHHHHHHhC
Confidence 34444 78899999999999999999887654
No 39
>PF12755 Vac14_Fab1_bd: Vacuolar 14 Fab1-binding region
Probab=20.81 E-value=93 Score=28.45 Aligned_cols=28 Identities=39% Similarity=0.553 Sum_probs=21.7
Q ss_pred hhhhhhccccccc---cchhhHHHHHhccCC
Q psy17488 224 LYKLLVPLLRCDT---TDVRDAVVHALGNIN 251 (1446)
Q Consensus 224 lykllvpllrcdt---tdvrdavvhalgnin 251 (1446)
+=++|-|+|.|=+ ..||-+..+||.||-
T Consensus 25 l~~Il~pVL~~~~D~d~rVRy~AcEaL~ni~ 55 (97)
T PF12755_consen 25 LDEILPPVLKCFDDQDSRVRYYACEALYNIS 55 (97)
T ss_pred HHHHHHHHHHHcCCCcHHHHHHHHHHHHHHH
Confidence 4467778888844 248999999999994
No 40
>PF05069 Phage_tail_S: Phage virion morphogenesis family ; InterPro: IPR006522 This entry represents a number of known and suspected phage virion morphogenesis proteins. The S protein from phage P2 is thought to act in tail completion and stable head joining [].
Probab=20.42 E-value=1.4e+02 Score=27.99 Aligned_cols=38 Identities=16% Similarity=0.372 Sum_probs=27.5
Q ss_pred cccchhhHHHHHhccCCHHHHHHHHHHHHHHHHHHhhhhh
Q psy17488 235 DTTDVRDAVVHALGNINSEALKDLMEELSVYIREAVDRKT 274 (1446)
Q Consensus 235 dttdvrdavvhalgninsealkdlmeelsvyireavdrkt 274 (1446)
|+..|.+++-+++.+ ...+.+||..|..+|+..+++..
T Consensus 2 ~~~~l~~~L~~L~~~--~~~~~~l~~~ig~~l~~~~~~~f 39 (148)
T PF05069_consen 2 DLEQLQRALERLAAQ--PKDRRPLLRKIGEYLRKSTRRRF 39 (148)
T ss_pred cHHHHHHHHHHHHhC--hHhHHHHHHHHHHHHHHHHHHHH
Confidence 455566666555555 78899999999999988775433
No 41
>TIGR00274 N-acetylmuramic acid 6-phosphate etherase. This protein, MurQ, is involved in recycling components of the bacterial murein sacculus turned over during cell growth. The cell wall metabolite anhydro-N-acetylmuramic acid (anhMurNAc) is converted by a kinase, AnmK, to MurNAc-phosphate, then converted to N-acetylglucosamine-phosphate by this etherase, called MurQ. This family of proteins is similar to the C-terminal half of a number of vertebrate glucokinase regulator proteins and contains a Prosite pattern which is shared by this group of proteins in a region of local similarity.
Probab=20.33 E-value=71 Score=33.80 Aligned_cols=34 Identities=26% Similarity=0.475 Sum_probs=25.4
Q ss_pred chhhHHHHHhccCCHHHHHHHHHHHHHHHHHHhh
Q psy17488 238 DVRDAVVHALGNINSEALKDLMEELSVYIREAVD 271 (1446)
Q Consensus 238 dvrdavvhalgninsealkdlmeelsvyireavd 271 (1446)
+|.-|||-+|++++.+..+.++..-.-.||.|+|
T Consensus 258 ~vk~Ai~~~~~~~~~~~a~~~l~~~~g~~~~~l~ 291 (291)
T TIGR00274 258 NVKLAIVMILSTLSASEAKVLLDRHGGFLRQALD 291 (291)
T ss_pred CcHHHHHHHHhCCCHHHHHHHHHHcCCcHHHhhC
Confidence 6777777778888877777777777777777765
No 42
>TIGR03354 VI_FHA type VI secretion system FHA domain protein. Members of this protein family are FHA (forkhead-associated) domain-containing proteins that are part of type VI secretion loci in a considerable number of bacteria, most of which are known pathogens. Species include Pseudomonas aeruginosa PAO1, Aeromonas hydrophila, Yersinia pestis, Burkholderia mallei, etc.
Probab=20.23 E-value=2.4e+02 Score=31.31 Aligned_cols=47 Identities=30% Similarity=0.461 Sum_probs=35.2
Q ss_pred cccccchhhHHHHHhc----cCCHHHHHHHHHHHHHHHHHHhhhhhHHHHHH
Q psy17488 233 RCDTTDVRDAVVHALG----NINSEALKDLMEELSVYIREAVDRKTENMRRR 280 (1446)
Q Consensus 233 rcdttdvrdavvhalg----ninsealkdlmeelsvyireavdrktenmrrr 280 (1446)
.+|.+ +..|+..+|| .++.++..+||++|...++.+|+--.+-++.|
T Consensus 204 ~~~~~-~~~af~~GlG~~~~~~~~~~~~~~~~~lG~~lr~~v~Gl~~lL~~R 254 (396)
T TIGR03354 204 RSDAV-DLTPFLRGLGLPLELLDSQAAEDVLEELGRLLRAMVEGLLQLLRAR 254 (396)
T ss_pred chhhH-HHHHHHHhCCCCcccCChhhHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 55555 5557777776 36677778899999999999998877777654
Done!