Query psy16052
Match_columns 476
No_of_seqs 409 out of 2690
Neff 7.8
Searched_HMMs 46136
Date Fri Aug 16 19:38:38 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy16052.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/16052hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 TIGR03212 uraD_N-term-dom puta 100.0 2.6E-33 5.6E-38 277.2 20.1 224 181-455 66-295 (297)
2 PRK15394 4-deoxy-4-formamido-L 100.0 3E-27 6.4E-32 233.0 22.6 228 181-462 4-280 (296)
3 TIGR03006 pepcterm_polyde poly 99.9 6.4E-24 1.4E-28 207.2 19.3 210 193-448 29-264 (265)
4 TIGR02764 spore_ybaN_pdaB poly 99.9 9E-23 1.9E-27 190.8 19.1 184 178-448 5-191 (191)
5 TIGR02884 spore_pdaA delta-lac 99.9 5.2E-22 1.1E-26 190.1 20.1 117 178-309 36-155 (224)
6 TIGR02873 spore_ylxY probable 99.9 7.7E-22 1.7E-26 193.0 18.8 182 179-449 85-268 (268)
7 PF01522 Polysacc_deac_1: Poly 99.8 2.2E-21 4.7E-26 167.8 8.9 118 177-310 4-123 (123)
8 COG0726 CDA1 Predicted xylanas 99.8 2.6E-17 5.7E-22 160.2 19.4 115 179-308 65-181 (267)
9 PRK14581 hmsF outer membrane N 99.5 1.4E-13 3.1E-18 148.5 13.6 125 178-314 106-275 (672)
10 PRK14582 pgaB outer membrane N 99.5 2.4E-13 5.2E-18 147.0 13.5 125 178-314 106-275 (671)
11 PF00057 Ldl_recept_a: Low-den 99.0 3.9E-10 8.4E-15 75.9 2.6 36 102-137 2-37 (37)
12 cd00112 LDLa Low Density Lipop 98.9 1.4E-09 2.9E-14 72.5 2.4 35 103-137 1-35 (35)
13 smart00192 LDLa Low-density li 98.5 5.9E-08 1.3E-12 63.7 2.7 32 103-134 2-33 (33)
14 PF09960 DUF2194: Uncharacteri 98.1 0.00016 3.4E-09 78.0 18.4 206 193-452 288-531 (585)
15 PF10096 DUF2334: Uncharacteri 98.0 0.00029 6.3E-09 68.4 15.9 99 194-304 18-143 (243)
16 PF01607 CBM_14: Chitin bindin 97.9 4.8E-06 1E-10 60.4 1.0 53 26-83 1-53 (53)
17 COG3233 Predicted deacetylase 97.7 0.00054 1.2E-08 63.9 11.7 123 181-315 6-143 (233)
18 smart00494 ChtBD2 Chitin-bindi 97.1 0.00044 9.5E-09 50.4 3.3 49 25-78 2-50 (56)
19 PF11959 DUF3473: Domain of un 97.0 0.0034 7.3E-08 55.0 8.2 82 348-449 29-131 (133)
20 PF12999 PRKCSH-like: Glucosid 96.5 0.0029 6.3E-08 57.6 3.8 49 78-134 55-110 (176)
21 PF00057 Ldl_recept_a: Low-den 95.5 0.0062 1.3E-07 40.9 1.2 33 56-92 5-37 (37)
22 cd00112 LDLa Low Density Lipop 95.5 0.0066 1.4E-07 40.2 1.3 32 57-92 4-35 (35)
23 KOG1215|consensus 94.9 0.02 4.4E-07 65.9 3.7 74 56-139 220-296 (877)
24 KOG1215|consensus 94.9 0.067 1.5E-06 61.6 7.8 115 56-175 140-292 (877)
25 PF03065 Glyco_hydro_57: Glyco 94.8 0.071 1.5E-06 54.8 7.0 196 227-453 114-322 (360)
26 COG1449 Alpha-amylase/alpha-ma 94.6 0.6 1.3E-05 51.3 13.7 79 227-309 183-262 (615)
27 PF04748 Polysacc_deac_2: Dive 93.8 2.8 6.1E-05 39.8 15.1 182 211-449 19-213 (213)
28 PF15421 Polysacc_deac_3: Puta 92.5 5.1 0.00011 40.9 15.0 167 226-451 132-303 (425)
29 smart00192 LDLa Low-density li 92.1 0.08 1.7E-06 34.5 1.3 28 58-89 6-33 (33)
30 COG2861 Uncharacterized protei 81.4 53 0.0012 31.7 15.9 204 178-449 28-245 (250)
31 PF12999 PRKCSH-like: Glucosid 77.9 1.6 3.5E-05 39.9 2.3 31 105-135 35-67 (176)
32 KOG2397|consensus 75.3 2.1 4.5E-05 44.8 2.6 49 82-138 65-117 (480)
33 COG4878 Uncharacterized protei 68.4 1.1E+02 0.0024 30.1 12.2 74 230-306 52-130 (309)
34 KOG3509|consensus 63.5 13 0.00029 42.6 5.9 79 55-139 33-112 (964)
35 PF07315 DUF1462: Protein of u 62.0 27 0.00058 28.3 5.7 53 376-434 10-63 (93)
36 COG3603 Uncharacterized conser 46.5 58 0.0012 27.8 5.5 67 167-240 52-127 (128)
37 cd02995 PDI_a_PDI_a'_C PDIa fa 43.4 48 0.001 26.4 4.8 51 394-445 9-59 (104)
38 PF01074 Glyco_hydro_38: Glyco 42.9 1.2E+02 0.0026 29.5 8.3 80 227-314 64-145 (275)
39 COG4837 Uncharacterized protei 36.9 69 0.0015 26.1 4.4 53 376-434 17-70 (106)
40 COG3536 Uncharacterized protei 36.5 22 0.00048 29.7 1.6 81 166-248 2-92 (120)
41 COG1543 Uncharacterized conser 36.4 89 0.0019 33.1 6.3 76 228-310 128-204 (504)
42 PRK05406 LamB/YcsF family prot 35.6 2.9E+02 0.0064 26.8 9.3 82 194-287 22-110 (246)
43 PTZ00062 glutaredoxin; Provisi 33.3 1.1E+02 0.0024 28.7 6.1 41 401-445 15-55 (204)
44 PRK12569 hypothetical protein; 33.1 3.2E+02 0.0069 26.5 9.1 83 194-288 25-114 (245)
45 cd02992 PDI_a_QSOX PDIa family 31.9 1E+02 0.0022 25.7 5.1 50 394-444 10-60 (114)
46 COG1540 Uncharacterized protei 31.6 4.9E+02 0.011 25.2 10.5 83 194-288 22-111 (252)
47 KOG1958|consensus 31.0 9.1E+02 0.02 28.2 15.0 44 227-274 229-273 (1129)
48 cd07286 PX_SNX18 The phosphoin 28.3 1.1E+02 0.0023 26.6 4.6 74 389-468 42-122 (127)
49 PF12340 DUF3638: Protein of u 28.1 3.1E+02 0.0067 26.4 8.1 61 385-449 83-143 (229)
50 KOG1225|consensus 27.0 1.6E+02 0.0035 31.9 6.5 15 149-164 345-360 (525)
51 TIGR01068 thioredoxin thioredo 26.0 1.4E+02 0.0029 23.3 4.7 48 394-444 5-52 (101)
52 KOG3509|consensus 25.6 78 0.0017 36.7 4.1 59 78-140 11-69 (964)
53 PTZ00413 lipoate synthase; Pro 25.0 3.1E+02 0.0068 28.5 7.9 70 383-460 276-345 (398)
54 TIGR01126 pdi_dom protein disu 24.8 1.1E+02 0.0023 24.1 3.9 49 394-444 5-53 (102)
55 cd02956 ybbN ybbN protein fami 24.5 1.2E+02 0.0025 23.9 4.0 47 395-444 3-50 (96)
56 PF07579 DUF1548: Domain of Un 24.1 33 0.00071 29.8 0.7 48 385-432 47-94 (135)
57 COG4637 Predicted ATPase [Gene 23.5 1.9E+02 0.0042 29.3 5.9 61 385-448 273-333 (373)
58 cd02957 Phd_like Phosducin (Ph 23.4 2.1E+02 0.0046 23.5 5.6 37 403-443 24-60 (113)
59 cd02996 PDI_a_ERp44 PDIa famil 22.0 1.8E+02 0.0039 23.5 4.8 40 394-434 10-49 (108)
60 cd02954 DIM1 Dim1 family; Dim1 22.0 2.8E+02 0.006 23.5 5.9 46 393-442 3-50 (114)
61 PLN02701 alpha-mannosidase 21.6 2.7E+02 0.0057 33.1 7.5 75 227-309 106-182 (1050)
62 cd02987 Phd_like_Phd Phosducin 20.6 2.3E+02 0.005 25.8 5.6 37 404-444 84-120 (175)
63 TIGR01358 DAHP_synth_II 3-deox 20.2 6E+02 0.013 26.9 8.9 77 195-274 293-379 (443)
No 1
>TIGR03212 uraD_N-term-dom putative urate catabolism protein. This model represents a protein that is predominantly found just upstream of the UraD protein (OHCU decarboxylase) and in a number of instances as a N-terminal fusion with it. UraD itself catalyzes the last step in the catabolism of urate to allantoate. The function of this protein is presently unknown. It shows homology with the pfam01522 polysaccharide deacetylase domain family.
Probab=100.00 E-value=2.6e-33 Score=277.22 Aligned_cols=224 Identities=18% Similarity=0.257 Sum_probs=181.8
Q ss_pred EEEeecCCCCCccHHHHHHHHhccCCCCCCCCceEEEEecccC--CChHHHHHHHHcCCeeeeCCCCCCCCccccCCCCH
Q psy16052 181 ITITFDDAINNNNIALYKEMFNGKRKNPNGCDIKATFFVSHKY--TNYSAVQELHRKGHEVAVHSITHNDDENFWSNATV 258 (476)
Q Consensus 181 V~lTFDdg~~~~~~~~~l~ll~~~~~np~~c~i~aTFFv~~~~--~~~~~v~~l~~~GHEIg~Hs~~H~~~~~~~~~~s~ 258 (476)
..+++-+..+..+.++++++|++ ++||+|||+++.. .+++++++|.++|||||+||++|... ..++.
T Consensus 66 ~~~s~~~YG~rvG~~RiLdlL~~-------~gv~aTffv~g~~~e~~P~~v~~i~~~GHEIg~Hg~~H~~~----~~ls~ 134 (297)
T TIGR03212 66 SMESLYEYGSRAGFWRLLRLFTE-------RGIPVTVFGVAMALARNPEAVAAMKEAGWEIASHGLRWIDY----QDMDE 134 (297)
T ss_pred cchhhhhhcchhCHHHHHHHHHH-------cCCCEEEEeEHHHHHHCHHHHHHHHHcCCEEeeccccCccc----ccCCH
Confidence 35556566667778999999999 9999999999864 34599999999999999999999875 46899
Q ss_pred HHHHHHHHHHHHHHHHhhcccCCcccEEecCCcccCchHHHHHHHH-cCCeEeeecccCCCCCCCCCccccccCCCCcCC
Q psy16052 259 DDWAKEMAGMRIIAEKYANLSDNSVVGVRAPYLRVGGNNQFTMMEE-QAFLYDSTITAPLSNPPLWPYTMYFRMPHRCHG 337 (476)
Q Consensus 259 e~~~~ei~~~~~~l~~~~~i~g~~~~gfR~P~l~~~~~~~~~~l~~-~G~~Ydss~~~~~~~~p~~pytl~~~~~~~c~~ 337 (476)
++.+++|.++.++|+ .++|..|+|||++ ..+.+ ++++|++ .||+|+||+..+ +.|+|..+
T Consensus 135 ~~e~~~i~~s~~~i~---~~tG~~P~G~~~~--~~s~~-T~~LL~e~~Gf~Y~sd~~~d--D~Py~~~~----------- 195 (297)
T TIGR03212 135 AQEREHIAEAIRLHT---EVTGERPLGWYTG--RTSPN-TRRLVAEEGGFLYDADSYAD--DLPYWDEV----------- 195 (297)
T ss_pred HHHHHHHHHHHHHHH---HHhCCCCceEECC--CCChh-HHHHHHHhcCceEeCchhhc--CCCeEeec-----------
Confidence 999999999999999 5889999999975 55665 8999998 999999999976 34555332
Q ss_pred CCcCCCCCCCCeEEEeccccccCCCCCCCCCCCCCcccCCCCCCCChHHHHHHHHHHHHHHhccCC---cceEEeecchh
Q psy16052 338 NLQHCPTRSHAVWEMVMNELDRREDPNFDEYLPGCAMVDSCSNILTGDQFYNFLNHNFDRHYDQNR---APLGLYFHAAW 414 (476)
Q Consensus 338 ~~~~cp~~~~~lweiP~~~~~~~~~p~~~~~~~g~~~~d~~~~~~~~~~~~~~l~~~f~~~y~~~r---~p~~i~~H~~w 414 (476)
...+++.||+++..++ ..|.. . ....++++++++|+++|+.+|+..+ ..|.|.+| ++
T Consensus 196 -------~~~~~l~lP~~~~~nD--~~~~~-~---------~~~~~~~~~~~~~~d~fd~l~~eg~~~~~~~~i~lH-p~ 255 (297)
T TIGR03212 196 -------AGRPQLIVPYTLDAND--MRFAT-P---------QGFNTGEQFFTYLRDAFDVLYAEGEGAPKMMSIGLH-CR 255 (297)
T ss_pred -------CCCCeEEEecccccCc--HHHHh-c---------cCCCCHHHHHHHHHHHHHHHHHhCCCCCceEEEecC-cc
Confidence 1124677999875432 11211 0 1125789999999999999997543 35789999 77
Q ss_pred hcCChHHHHHHHHHHHHHHhhCCCEEEEeHHHHHHHhhCCc
Q psy16052 415 LKNNPEFLDAFLYWVDEIIERHKDVYFVTMTQVIQWIQNPR 455 (476)
Q Consensus 415 ~~~~p~~~~al~~fl~~i~~~~~dV~fvT~~qi~~w~~~p~ 455 (476)
++|+|.|+.+|++||+||++ +++|||+|.+||++|++...
T Consensus 256 i~G~p~R~~~L~~~l~~i~~-~~~VW~at~~eIA~~~~~~~ 295 (297)
T TIGR03212 256 LVGRPGRIAALQRFLDYVQS-HDKVWVARRIDIARHWHETH 295 (297)
T ss_pred ccCCHHHHHHHHHHHHHHHh-CCCEEEEcHHHHHHHHHHhC
Confidence 99999999999999999998 99999999999999986543
No 2
>PRK15394 4-deoxy-4-formamido-L-arabinose-phosphoundecaprenol deformylase ArnD; Provisional
Probab=99.96 E-value=3e-27 Score=233.05 Aligned_cols=228 Identities=16% Similarity=0.191 Sum_probs=176.8
Q ss_pred EEEeecC---CCCCccHHHHHHHHhccCCCCCCCCceEEEEecccC----------------------------------
Q psy16052 181 ITITFDD---AINNNNIALYKEMFNGKRKNPNGCDIKATFFVSHKY---------------------------------- 223 (476)
Q Consensus 181 V~lTFDd---g~~~~~~~~~l~ll~~~~~np~~c~i~aTFFv~~~~---------------------------------- 223 (476)
|+|-.|- -.+..+.++++++|++ ++++||||.+..-
T Consensus 4 ~~l~idvdt~~~~~~g~~~~~~~~~~-------~~~~a~f~~~~gpd~~g~~~~r~~~~~fl~k~~r~~~~~~yg~~~l~ 76 (296)
T PRK15394 4 VGLRIDVDTFRGTREGVPRLLEILSK-------HGIQASFFFSVGPDNMGRHLWRLLKPRFLWKMLRSNAASLYGWDILL 76 (296)
T ss_pred eEEEEEeecccccccCHHHHHHHHHH-------cCCCEEEEeccCCCchhHHHHHHhhhHHHHHHHhcCchhhcCHHHHh
Confidence 5555552 1345678999999999 9999999997310
Q ss_pred -----C-------ChHHHHHHHHcCCeeeeCCCCCCCCccccCCCCHHHHHHHHHHHHHHHHHhhcccCCcccEEecCCc
Q psy16052 224 -----T-------NYSAVQELHRKGHEVAVHSITHNDDENFWSNATVDDWAKEMAGMRIIAEKYANLSDNSVVGVRAPYL 291 (476)
Q Consensus 224 -----~-------~~~~v~~l~~~GHEIg~Hs~~H~~~~~~~~~~s~e~~~~ei~~~~~~l~~~~~i~g~~~~gfR~P~l 291 (476)
+ ++++++++.++ ||||+|||+|.........++.++++++|.++.++|+ .++|..++|||+|++
T Consensus 77 ~g~~~PG~~I~~~~P~lvr~i~~~-HEIG~Hg~~H~~wq~~~~~ls~~e~~~eI~ra~~~Le---~itG~~p~g~raPgw 152 (296)
T PRK15394 77 AGTAWPGKEIGKALADIIREAAKA-HEVGLHAWDHHAWQAWSGVWSRQQLIEQIARGVDALE---EIIGQPVTCSAAAGW 152 (296)
T ss_pred hcccCCCchHHHHCHHHHHHHHhc-CEehhcccCccchhcccccCCHHHHHHHHHHHHHHHH---HHhCCCCCEEeCCCc
Confidence 0 23789999999 9999999999953222346899999999999999999 589999999999999
Q ss_pred ccCchHHHHHHHHcCCeEeeecccCCCCCCCCCccccccCCCCcCCCCcCCCCCCCCeEEEeccccccCCCCCCCCCCCC
Q psy16052 292 RVGGNNQFTMMEEQAFLYDSTITAPLSNPPLWPYTMYFRMPHRCHGNLQHCPTRSHAVWEMVMNELDRREDPNFDEYLPG 371 (476)
Q Consensus 292 ~~~~~~~~~~l~~~G~~Ydss~~~~~~~~p~~pytl~~~~~~~c~~~~~~cp~~~~~lweiP~~~~~~~~~p~~~~~~~g 371 (476)
+.+.+ +++++++.||.|+||+..+ .|++|.+.+ ...+++|||+++..+ |+.. |
T Consensus 153 ~~~~~-tl~ll~e~Gf~Y~Ss~~~d---~Py~~~~~~----------------g~~~~veIP~tl~t~------D~~~-g 205 (296)
T PRK15394 153 RADQR-VVEAKEAFGFRYNSDCRGT---HPFRPLLPD----------------GSLGTVQIPVTLPTW------DEVI-G 205 (296)
T ss_pred cCCHH-HHHHHHHcCCeeecCCCCC---CceeeecCC----------------CcCCeEEEccccccc------hhhh-c
Confidence 99876 8999999999999999753 566654421 223589999997432 2211 1
Q ss_pred CcccCCCCCCCChHHHHHHHHHHHHHHhccCCcceEEeecchhhcCChHHHHHHHHHHHHHHhhCCCEEEEeHHHHHHHh
Q psy16052 372 CAMVDSCSNILTGDQFYNFLNHNFDRHYDQNRAPLGLYFHAAWLKNNPEFLDAFLYWVDEIIERHKDVYFVTMTQVIQWI 451 (476)
Q Consensus 372 ~~~~d~~~~~~~~~~~~~~l~~~f~~~y~~~r~p~~i~~H~~w~~~~p~~~~al~~fl~~i~~~~~dV~fvT~~qi~~w~ 451 (476)
...+.+++.+++++.|++ ..+. .++++| ++++|.| ++.+|+++|+++++ .+|||+|++||+.-.
T Consensus 206 --------~~~~~~~f~~~~~d~~~~---~~g~-~V~tiH-P~v~G~~-~~~~fe~lL~~~~~--~gv~F~tl~el~~~~ 269 (296)
T PRK15394 206 --------RDVKAEDFNDFILDRILR---DKGT-PVYTIH-AEVEGIA-YAHNFEDLLKRAAQ--EGITFCPLSELLPDD 269 (296)
T ss_pred --------CCCCHHHHHHHHHHHhcc---ccCc-eEEEEC-CCcCCcc-hHHHHHHHHHHHhc--CCcEEeeHHHHHhhc
Confidence 125678899999777754 2233 489999 5688888 99999999999985 679999999999766
Q ss_pred hCCcchhcccc
Q psy16052 452 QNPRTITEVKN 462 (476)
Q Consensus 452 ~~p~~~~~~~~ 462 (476)
++..|+.++..
T Consensus 270 ~~~~p~~~~~~ 280 (296)
T PRK15394 270 LETLPLGKVVR 280 (296)
T ss_pred hhhCCccceee
Confidence 68888777654
No 3
>TIGR03006 pepcterm_polyde polysaccharide deactylase family protein, PEP-CTERM locus subfamily. Members of this protein family belong to the family of polysaccharide deacetylases (pfam01522). All are found in species that encode the PEP-CTERM/exosortase system predicted to act in protein sorting in a number of Gram-negative bacteria, and are found near the epsH homolog that is the putative exosortase gene. The highest scoring homologs below the trusted cutoff for this model are found in several species of Methanosarcina, an archaeal genus.
Probab=99.92 E-value=6.4e-24 Score=207.16 Aligned_cols=210 Identities=22% Similarity=0.361 Sum_probs=145.4
Q ss_pred cHHHHHHHHhccCCCCCCCCceEEEEecccCC--ChHHHHHHHHcCCeeeeCCCCCCCCccccCCCCHHHHHHHHHHHHH
Q psy16052 193 NIALYKEMFNGKRKNPNGCDIKATFFVSHKYT--NYSAVQELHRKGHEVAVHSITHNDDENFWSNATVDDWAKEMAGMRI 270 (476)
Q Consensus 193 ~~~~~l~ll~~~~~np~~c~i~aTFFv~~~~~--~~~~v~~l~~~GHEIg~Hs~~H~~~~~~~~~~s~e~~~~ei~~~~~ 270 (476)
++.+++++|++ ++++|||||++... ..++|++|.++|||||+||++|... ..++.+++++||.++++
T Consensus 29 nt~riL~lL~~-------~gikATFFv~g~~~e~~p~lir~i~~~GhEIgsHg~sH~~l----~~ls~ee~~~eI~~s~~ 97 (265)
T TIGR03006 29 NTDRILDLLDR-------HGVKATFFTLGWVAERYPELVRRIVAAGHELASHGYGHERV----TTQTPEAFRADIRRSKA 97 (265)
T ss_pred hHHHHHHHHHH-------cCCcEEEEEeccchhhCHHHHHHHHHcCCEeeeccccCcCc----hhCCHHHHHHHHHHHHH
Confidence 67899999999 99999999998643 3489999999999999999999975 56899999999999999
Q ss_pred HHHHhhcccCCcccEEecCCcccCchH--HHHHHHHcCCeEeeecc---cCCCCCC---CCCccccccCCCCcCCCCcCC
Q psy16052 271 IAEKYANLSDNSVVGVRAPYLRVGGNN--QFTMMEEQAFLYDSTIT---APLSNPP---LWPYTMYFRMPHRCHGNLQHC 342 (476)
Q Consensus 271 ~l~~~~~i~g~~~~gfR~P~l~~~~~~--~~~~l~~~G~~Ydss~~---~~~~~~p---~~pytl~~~~~~~c~~~~~~c 342 (476)
+|+ .++|..++|||+|++..+..+ .+++|+++||.||||+. .+..+.| .+||.
T Consensus 98 ~Le---~itG~~~~gfRaP~~s~~~~t~~a~~iL~e~Gy~YdsS~~p~~~d~~g~~~~~~~p~~---------------- 158 (265)
T TIGR03006 98 LLE---DLSGQPVRGYRAPSFSIGKKNLWALDVLAEAGYRYSSSIYPVRHDHYGMPDAPRFPFR---------------- 158 (265)
T ss_pred HHH---HHhCCCceEEECCCCCCCCCcHHHHHHHHHCCCEEEEeeccCcCCcCCCCCCCCCcee----------------
Confidence 999 588999999999999887652 37999999999999995 2211122 12221
Q ss_pred CCCCCCeEEEeccccccCCCCCCCCCCCCCcccCCCCCCCChHHHHHHHHHHHHHHhccCCcceEEeecchhhcC--ChH
Q psy16052 343 PTRSHAVWEMVMNELDRREDPNFDEYLPGCAMVDSCSNILTGDQFYNFLNHNFDRHYDQNRAPLGLYFHAAWLKN--NPE 420 (476)
Q Consensus 343 p~~~~~lweiP~~~~~~~~~p~~~~~~~g~~~~d~~~~~~~~~~~~~~l~~~f~~~y~~~r~p~~i~~H~~w~~~--~p~ 420 (476)
....+|+|||++.+. .++..+| +. ..+-.. +-|-.+.+++ +.+.....+.|+++||| +|... +|.
T Consensus 159 -~~~g~l~e~P~s~~~-----~~~~~~p-~~-gg~~~r-~~P~~~~~~~---~~~~~~~~~~~~~~y~H-pwe~dp~~~~ 225 (265)
T TIGR03006 159 -PPNGRLLEVPVTTVR-----LGGRNLP-AA-GGGYFR-LLPYALSRWA---LRRVNGREGRPAIFYFH-PWEIDPDQPR 225 (265)
T ss_pred -cCCCcEEEEccchhh-----hcccccc-cC-CCcchh-cChHHHHHHH---HHHHHhcCCceEEEEEc-hhhcCCCCCC
Confidence 112349999998753 1222233 00 000000 1223333333 33333333689999999 77763 221
Q ss_pred --------------HHHHHHHHHHHHHhhCCCEEEEeHHHHH
Q psy16052 421 --------------FLDAFLYWVDEIIERHKDVYFVTMTQVI 448 (476)
Q Consensus 421 --------------~~~al~~fl~~i~~~~~dV~fvT~~qi~ 448 (476)
.+...+.=|+.++ .|.-|+|++|++
T Consensus 226 ~~~~~~~~~fr~y~~~~~~~~~l~~ll---~~~~~~~~~~~~ 264 (265)
T TIGR03006 226 IAGISMKSRFRHYLNLDRMEGRLRRLL---GDFRWGRMDRVF 264 (265)
T ss_pred CCCCCccceeeeecCHHHHHHHHHHHH---hcCceeeHHHhh
Confidence 1234444456665 488999999986
No 4
>TIGR02764 spore_ybaN_pdaB polysaccharide deacetylase family sporulation protein PdaB. This model describes the YbaN protein family, also called PdaB and SpoVIE, of Gram-positive bacteria. Although ybaN null mutants have only a mild sporulation defect, ybaN/ytrI double mutants show drastically reducted sporulation efficiencies. This synthetic defect suggests the role of this sigmaE-controlled gene in sporulation had been masked by functional redundancy. Members of this family are homologous to a characterized polysaccharide deacetylase; the exact function this protein family is unknown.
Probab=99.90 E-value=9e-23 Score=190.76 Aligned_cols=184 Identities=20% Similarity=0.287 Sum_probs=143.2
Q ss_pred CCeEEEeecCCCCCccHHHHHHHHhccCCCCCCCCceEEEEecccCC--ChHHHHHHHHcCCeeeeCCCCCCCCccccCC
Q psy16052 178 PQMITITFDDAINNNNIALYKEMFNGKRKNPNGCDIKATFFVSHKYT--NYSAVQELHRKGHEVAVHSITHNDDENFWSN 255 (476)
Q Consensus 178 Pq~V~lTFDdg~~~~~~~~~l~ll~~~~~np~~c~i~aTFFv~~~~~--~~~~v~~l~~~GHEIg~Hs~~H~~~~~~~~~ 255 (476)
.+.|+||||||+....+.+++++|++ ++++|||||++.+. .++++++|.++|||||+|||+|... ..
T Consensus 5 ~k~V~LTFDDgp~~~~t~~~l~~L~~-------~~ikaTfFv~g~~~~~~~~~~~~i~~~Gheig~Ht~~H~~~----~~ 73 (191)
T TIGR02764 5 DKKIALTFDISWGNDYTEPILDTLKE-------YDVKATFFLSGSWAERHPELVKEIVKDGHEIGSHGYRHKNY----TT 73 (191)
T ss_pred CCEEEEEEECCCCcccHHHHHHHHHH-------cCCCEEEEeccHHHHHCHHHHHHHHhCCCEEEECCcCCCCc----cc
Confidence 46899999999998777899999999 99999999999753 3489999999999999999999975 56
Q ss_pred CCHHHHHHHHHHHHHHHHHhhcccCCcccEEecCCcccCchHHHHHHHHcCCeEee-ecccCCCCCCCCCccccccCCCC
Q psy16052 256 ATVDDWAKEMAGMRIIAEKYANLSDNSVVGVRAPYLRVGGNNQFTMMEEQAFLYDS-TITAPLSNPPLWPYTMYFRMPHR 334 (476)
Q Consensus 256 ~s~e~~~~ei~~~~~~l~~~~~i~g~~~~gfR~P~l~~~~~~~~~~l~~~G~~Yds-s~~~~~~~~p~~pytl~~~~~~~ 334 (476)
++.+++++||.++.+.|+ .+.|..+++||+||+..+.. ++++++++||.+-. ++- ...|-
T Consensus 74 ~~~~~~~~ei~~~~~~l~---~~~g~~~~~fr~P~G~~~~~-~~~~l~~~G~~~v~w~~~-----~~D~~---------- 134 (191)
T TIGR02764 74 LEDEKIKKDILRAQEIIE---KLTGKKPTLFRPPSGAFNKA-VLKAAESLGYTVVHWSVD-----SRDWK---------- 134 (191)
T ss_pred CCHHHHHHHHHHHHHHHH---HHhCCCCCEEECCCcCCCHH-HHHHHHHcCCeEEEecCC-----CCccC----------
Confidence 899999999999999998 57788999999999999876 89999999995211 100 00110
Q ss_pred cCCCCcCCCCCCCCeEEEeccccccCCCCCCCCCCCCCcccCCCCCCCChHHHHHHHHHHHHHHhccCCcceEEeecchh
Q psy16052 335 CHGNLQHCPTRSHAVWEMVMNELDRREDPNFDEYLPGCAMVDSCSNILTGDQFYNFLNHNFDRHYDQNRAPLGLYFHAAW 414 (476)
Q Consensus 335 c~~~~~~cp~~~~~lweiP~~~~~~~~~p~~~~~~~g~~~~d~~~~~~~~~~~~~~l~~~f~~~y~~~r~p~~i~~H~~w 414 (476)
..+++++++.+..+. .. + -+|.+|..
T Consensus 135 ----------------------------------------------~~~~~~i~~~~~~~~---~~--g--~Iil~Hd~- 160 (191)
T TIGR02764 135 ----------------------------------------------NPGVESIVDRVVKNT---KP--G--DIILLHAS- 160 (191)
T ss_pred ----------------------------------------------CCCHHHHHHHHHhcC---CC--C--CEEEEeCC-
Confidence 013445554443322 12 2 37889942
Q ss_pred hcCChHHHHHHHHHHHHHHhhCCCEEEEeHHHHH
Q psy16052 415 LKNNPEFLDAFLYWVDEIIERHKDVYFVTMTQVI 448 (476)
Q Consensus 415 ~~~~p~~~~al~~fl~~i~~~~~dV~fvT~~qi~ 448 (476)
...+.-.++|.++|+++++ .+..|||.+|++
T Consensus 161 -~~~~~t~~~l~~~i~~l~~--~Gy~~vtl~~l~ 191 (191)
T TIGR02764 161 -DSAKQTVKALPTIIKKLKE--KGYEFVTISELI 191 (191)
T ss_pred -CCcHhHHHHHHHHHHHHHH--CCCEEEEHHHhC
Confidence 1245678999999999986 799999999874
No 5
>TIGR02884 spore_pdaA delta-lactam-biosynthetic de-N-acetylase. Muramic delta-lactam is an unusual constituent of peptidoglycan, found only in bacterial spores in the peptidoglycan wall, or spore cortex. The proteins in this family are PdaA (yfjS), a member of a larger family of polysaccharide deacetylases, and are specificially involved in delta-lactam biosynthesis. PdaA acts immediately after CwlD, an N-acetylmuramoyl-L-alanine amidase and performs a de-N-acetylation. PdaA may also perform the following transpeptidation for lactam ring formation, as heterologous expression in E. coli of CwlD and PdaA together is sufficient for delta-lactam production.
Probab=99.89 E-value=5.2e-22 Score=190.07 Aligned_cols=117 Identities=18% Similarity=0.327 Sum_probs=105.2
Q ss_pred CCeEEEeecCCCCCccHHHHHHHHhccCCCCCCCCceEEEEecccCC--ChHHHHHHHHcCCeeeeCCCCCCCCccccCC
Q psy16052 178 PQMITITFDDAINNNNIALYKEMFNGKRKNPNGCDIKATFFVSHKYT--NYSAVQELHRKGHEVAVHSITHNDDENFWSN 255 (476)
Q Consensus 178 Pq~V~lTFDdg~~~~~~~~~l~ll~~~~~np~~c~i~aTFFv~~~~~--~~~~v~~l~~~GHEIg~Hs~~H~~~~~~~~~ 255 (476)
.+.|+||||||+.....++++++|++ ++++|||||++.+. +.+++++|+++|||||+|||+|... ..
T Consensus 36 ~k~VaLTFDDGp~~~~t~~lL~~L~~-------~~vkATFFv~G~~~~~~p~~ir~i~~~GheIgnHt~~H~~~----~~ 104 (224)
T TIGR02884 36 KKVIYLTFDNGYENGYTPKILDVLKE-------KKVPAAFFVTGHYIKTQPDLIKRMVDEGHIVGNHSVHHPSL----TA 104 (224)
T ss_pred CCEEEEEEECCCCccchHHHHHHHHH-------cCCCeEEEeechhhHHCHHHHHHHHHcCCEeeecCccCcCc----cc
Confidence 57899999999987777899999999 99999999999764 4589999999999999999999875 56
Q ss_pred CCHHHHHHHHHHHHHHHHHhhcccCCc-ccEEecCCcccCchHHHHHHHHcCCeE
Q psy16052 256 ATVDDWAKEMAGMRIIAEKYANLSDNS-VVGVRAPYLRVGGNNQFTMMEEQAFLY 309 (476)
Q Consensus 256 ~s~e~~~~ei~~~~~~l~~~~~i~g~~-~~gfR~P~l~~~~~~~~~~l~~~G~~Y 309 (476)
++.+++++||.++.++|+ .++|.. +..||+||+..+.. ++++++++||.+
T Consensus 105 ls~~~~~~ei~~~~~~i~---~~~G~~~~~~fR~P~G~~~~~-~~~~l~~~Gy~~ 155 (224)
T TIGR02884 105 VNDEKFKEELTGVEEEFK---KVTGQKEMKYFRPPRGVFSER-TLAYTKELGYYT 155 (224)
T ss_pred CCHHHHHHHHHHHHHHHH---HHhCCCCCCEEeCCCCCcCHH-HHHHHHHcCCcE
Confidence 899999999999999999 477877 89999999999876 899999999963
No 6
>TIGR02873 spore_ylxY probable sporulation protein, polysaccharide deacetylase family. Members of this protein family are most closely related to TIGR02764, a subset of polysaccharide deacetylase family proteins found in a species if and only if the species forms endospores like those of Bacillus subtilis or Clostridium tetani. This family is likewise restricted to spore-formers, but is not universal among them in having sequences with full-length matches to the model.
Probab=99.88 E-value=7.7e-22 Score=193.01 Aligned_cols=182 Identities=17% Similarity=0.204 Sum_probs=141.6
Q ss_pred CeEEEeecCCCCCccHHHHHHHHhccCCCCCCCCceEEEEecccCC--ChHHHHHHHHcCCeeeeCCCCCCCCccccCCC
Q psy16052 179 QMITITFDDAINNNNIALYKEMFNGKRKNPNGCDIKATFFVSHKYT--NYSAVQELHRKGHEVAVHSITHNDDENFWSNA 256 (476)
Q Consensus 179 q~V~lTFDdg~~~~~~~~~l~ll~~~~~np~~c~i~aTFFv~~~~~--~~~~v~~l~~~GHEIg~Hs~~H~~~~~~~~~~ 256 (476)
.+|+||||||+....+++++++|++ ++++||||++|.+. +.+++++++++|||||+|||+|... ..+
T Consensus 85 k~VaLTFDdg~~~~~t~~iL~iLkk-------~~vkATFFv~G~~i~~~p~l~k~i~~~GheIGnHT~sH~~l----~~l 153 (268)
T TIGR02873 85 PMVALLINVAWGNEYLPEILQILKK-------HDVKATFFLEGKWVKENSQLAKMIVEQGHEIGNHAYNHPDM----ATL 153 (268)
T ss_pred CEEEEEEeCCCCcchHHHHHHHHHH-------CCCCEEEEeehHhhhHCHHHHHHHHHCCCEEEecCCcCCCc----ccC
Confidence 6899999999988888999999999 99999999999753 4589999999999999999999885 568
Q ss_pred CHHHHHHHHHHHHHHHHHhhcccCCcccEEecCCcccCchHHHHHHHHcCCeEeeecccCCCCCCCCCccccccCCCCcC
Q psy16052 257 TVDDWAKEMAGMRIIAEKYANLSDNSVVGVRAPYLRVGGNNQFTMMEEQAFLYDSTITAPLSNPPLWPYTMYFRMPHRCH 336 (476)
Q Consensus 257 s~e~~~~ei~~~~~~l~~~~~i~g~~~~gfR~P~l~~~~~~~~~~l~~~G~~Ydss~~~~~~~~p~~pytl~~~~~~~c~ 336 (476)
+.+++++||.++.++|+ .++|..++.||+||+..+.. ++++++++||..- .| +.+.
T Consensus 154 s~~~~~~Ei~~~~~~i~---~~~G~~p~~fRpP~G~~n~~-~~~~l~~~G~~~v-----------~W--svd~------- 209 (268)
T TIGR02873 154 SKEEIYDQINQTNEIIE---ATIGVTPKWFAPPSGSFNDN-VVQIAADLQMGTI-----------MW--TVDT------- 209 (268)
T ss_pred CHHHHHHHHHHHHHHHH---HHhCCCCCEEECCCCCCCHH-HHHHHHHCCCeEE-----------Ee--ccCC-------
Confidence 99999999999999999 57888999999999998876 8999999998421 11 1110
Q ss_pred CCCcCCCCCCCCeEEEeccccccCCCCCCCCCCCCCcccCCCCCCCChHHHHHHHHHHHHHHhccCCcceEEeecchhhc
Q psy16052 337 GNLQHCPTRSHAVWEMVMNELDRREDPNFDEYLPGCAMVDSCSNILTGDQFYNFLNHNFDRHYDQNRAPLGLYFHAAWLK 416 (476)
Q Consensus 337 ~~~~~cp~~~~~lweiP~~~~~~~~~p~~~~~~~g~~~~d~~~~~~~~~~~~~~l~~~f~~~y~~~r~p~~i~~H~~w~~ 416 (476)
.|.. ..+++.+.+.+... .. ++ -+|.+|..
T Consensus 210 ----------------------------~Dw~------------~~~~~~i~~~v~~~----~~-~G--~IILmHd~--- 239 (268)
T TIGR02873 210 ----------------------------IDWK------------NPSPSVMVNRVLSK----IH-PG--AMVLMHPT--- 239 (268)
T ss_pred ----------------------------CCCC------------CCCHHHHHHHHHhc----CC-CC--cEEEEcCC---
Confidence 0000 01344444444221 11 22 36788943
Q ss_pred CChHHHHHHHHHHHHHHhhCCCEEEEeHHHHHH
Q psy16052 417 NNPEFLDAFLYWVDEIIERHKDVYFVTMTQVIQ 449 (476)
Q Consensus 417 ~~p~~~~al~~fl~~i~~~~~dV~fvT~~qi~~ 449 (476)
....++|.++|.++++ .+.-|||.+|+++
T Consensus 240 --~~T~~aL~~iI~~Lk~--kGy~fvtl~ell~ 268 (268)
T TIGR02873 240 --ASSTEGLEEMITIIKE--KGYKIGTITELLD 268 (268)
T ss_pred --ccHHHHHHHHHHHHHH--CCCEEEeHHHhhC
Confidence 2357899999999985 8999999999863
No 7
>PF01522 Polysacc_deac_1: Polysaccharide deacetylase; InterPro: IPR002509 This domain is found in polysaccharide deacetylase. This family of polysaccharide deacetylases includes NodB (nodulation protein B from Rhizobium) which is a chitooligosaccharide deacetylase []. It also includes chitin deacetylase from yeast [], and endoxylanases which hydrolyses glucosidic bonds in xylan [].; GO: 0016810 hydrolase activity, acting on carbon-nitrogen (but not peptide) bonds, 0005975 carbohydrate metabolic process; PDB: 2IW0_A 2CC0_B 2VYO_A 2J13_A 2C71_A 2C79_A 1W1A_1 1W1B_1 1W17_A 1NY1_B ....
Probab=99.85 E-value=2.2e-21 Score=167.79 Aligned_cols=118 Identities=25% Similarity=0.439 Sum_probs=98.3
Q ss_pred CCCeEEEeecCCCCCccHHHHHHHHhccCCCCCCCCceEEEEecccCCC--hHHHHHHHHcCCeeeeCCCCCCCCccccC
Q psy16052 177 VPQMITITFDDAINNNNIALYKEMFNGKRKNPNGCDIKATFFVSHKYTN--YSAVQELHRKGHEVAVHSITHNDDENFWS 254 (476)
Q Consensus 177 ~Pq~V~lTFDdg~~~~~~~~~l~ll~~~~~np~~c~i~aTFFv~~~~~~--~~~v~~l~~~GHEIg~Hs~~H~~~~~~~~ 254 (476)
.++.|+||||||+. ....+++++|++ +++++||||++.+.. ++++++|.++|||||+||++|... .
T Consensus 4 ~~~~v~ltfDdg~~-~~~~~~~~~l~~-------~~i~at~fv~~~~~~~~~~~l~~l~~~G~ei~~H~~~H~~~----~ 71 (123)
T PF01522_consen 4 PKKSVALTFDDGYR-DNYDRLLPLLKK-------YGIPATFFVIGSWVERYPDQLRELAAAGHEIGNHGWSHPNL----S 71 (123)
T ss_dssp TSSEEEEEEESHCH-THHHHHHHHHHH-------TT--EEEEE-HHHHHHHHHHHHHHHHTT-EEEEE-SSSSCG----G
T ss_pred CCCEEEEEEecCch-hhHHHHHHHHHh-------cccceeeeecccccccccccchhHHHHHHHHHhcCCccccc----c
Confidence 46899999999994 445789999999 999999999987542 389999999999999999999985 5
Q ss_pred CCCHHHHHHHHHHHHHHHHHhhcccCCcccEEecCCcccCchHHHHHHHHcCCeEe
Q psy16052 255 NATVDDWAKEMAGMRIIAEKYANLSDNSVVGVRAPYLRVGGNNQFTMMEEQAFLYD 310 (476)
Q Consensus 255 ~~s~e~~~~ei~~~~~~l~~~~~i~g~~~~gfR~P~l~~~~~~~~~~l~~~G~~Yd 310 (476)
.++.+++++||.++++.|+ .+.|..+.+||+||+..+.+ ++++|+++||.||
T Consensus 72 ~~~~~~~~~ei~~~~~~l~---~~~g~~~~~f~~P~g~~~~~-~~~~l~~~G~~y~ 123 (123)
T PF01522_consen 72 TLSPEELRREIERSREILE---EITGRPPKGFRYPFGSYDDN-TLQALREAGYKYD 123 (123)
T ss_dssp GS-HHHHHHHHHHHHHHHH---HHHSSEESEEE-GGGEECHH-HHHHHHHTT-EEE
T ss_pred cCCHHHHHHHHHHHHHHHH---HHhCCCCcEEECCCCCCCHH-HHHHHHHcCCCcC
Confidence 6899999999999999999 47788999999999999987 9999999999997
No 8
>COG0726 CDA1 Predicted xylanase/chitin deacetylase [Carbohydrate transport and metabolism]
Probab=99.76 E-value=2.6e-17 Score=160.15 Aligned_cols=115 Identities=23% Similarity=0.409 Sum_probs=103.0
Q ss_pred CeEEEeecCCCCCccHHHHHHHHhccCCCCCCCCceEEEEecccCC--ChHHHHHHHHcCCeeeeCCCCCCCCccccCCC
Q psy16052 179 QMITITFDDAINNNNIALYKEMFNGKRKNPNGCDIKATFFVSHKYT--NYSAVQELHRKGHEVAVHSITHNDDENFWSNA 256 (476)
Q Consensus 179 q~V~lTFDdg~~~~~~~~~l~ll~~~~~np~~c~i~aTFFv~~~~~--~~~~v~~l~~~GHEIg~Hs~~H~~~~~~~~~~ 256 (476)
..|+||||||+...++++++++|++ ++++|||||++.+. ..++++++.+.|||||+|+++|... ..+
T Consensus 65 k~v~lTFDDg~~~~~~~~il~iL~k-------~~i~ATfFv~g~~i~~~p~~~~~~~~~Gheig~H~~~h~~~----~~~ 133 (267)
T COG0726 65 KAVALTFDDGPLDGNTPRILPLLKK-------YGIKATFFVVGSWIERNPDLVKRIAEAGHEIGNHGYDHPDL----QDL 133 (267)
T ss_pred CeEEEEeecCCCCCCcHHHHHHHHH-------cCCceEEEEehHhhHHCHHHHHHHHhccCeehhCcccCCCc----ccC
Confidence 7899999999998677899999999 99999999999653 2489999999999999999999985 568
Q ss_pred CHHHHHHHHHHHHHHHHHhhcccCCcccEEecCCcccCchHHHHHHHHcCCe
Q psy16052 257 TVDDWAKEMAGMRIIAEKYANLSDNSVVGVRAPYLRVGGNNQFTMMEEQAFL 308 (476)
Q Consensus 257 s~e~~~~ei~~~~~~l~~~~~i~g~~~~gfR~P~l~~~~~~~~~~l~~~G~~ 308 (476)
+.++.+++|.++...|. .++|..+++||+|++..+.. +..++++.|+.
T Consensus 134 ~~~~~~~~i~~~~~~l~---~~~g~~~~~~r~p~g~~~~~-~~~~~~~~g~~ 181 (267)
T COG0726 134 SLEELGAEIARAHDILK---KITGGRPRGFRPPYGSYNES-SLALARRLGYR 181 (267)
T ss_pred CHHHHHHHHHHHHHHHH---HHhCCCCceEECCCCccCHH-HHHHHHhCCCc
Confidence 89999999999999998 58888899999999999865 88899999984
No 9
>PRK14581 hmsF outer membrane N-deacetylase; Provisional
Probab=99.50 E-value=1.4e-13 Score=148.52 Aligned_cols=125 Identities=13% Similarity=0.193 Sum_probs=99.6
Q ss_pred CCeEEEeecCCCCCccHHHHHHHHhccCCCCCCCCceEEEEecccC---------------------CChHHHHHHHHcC
Q psy16052 178 PQMITITFDDAINNNNIALYKEMFNGKRKNPNGCDIKATFFVSHKY---------------------TNYSAVQELHRKG 236 (476)
Q Consensus 178 Pq~V~lTFDdg~~~~~~~~~l~ll~~~~~np~~c~i~aTFFv~~~~---------------------~~~~~v~~l~~~G 236 (476)
++.|+||||||+.. +...++.||++ ++++||||+++.+ .+|++|++|.+.|
T Consensus 106 ~KaV~LTFDDGy~s-ny~~AlPILKk-------yg~pATfFvVg~wid~~~~~~v~~~~~~~~r~~~LTW~qIrEM~~sG 177 (672)
T PRK14581 106 DKAVLLTFDDGYSS-FYRRVYPLLKA-------YKWSAVLAPVGTWIDTATDKKVDFGGLSTDRDRFATWKQITEMSKSG 177 (672)
T ss_pred CCeEEEEEEcCCcc-hHHHHHHHHHH-------cCCCEEEEEechhhccCcccccccccccccccccCCHHHHHHHHHCC
Confidence 58999999999976 45788999999 9999999997643 2457999999999
Q ss_pred -CeeeeCCCCCCCCcc---------------ccC----CCC----HHHHHHHHHHHHHHHHHhhcccCCcccEEecCCcc
Q psy16052 237 -HEVAVHSITHNDDEN---------------FWS----NAT----VDDWAKEMAGMRIIAEKYANLSDNSVVGVRAPYLR 292 (476)
Q Consensus 237 -HEIg~Hs~~H~~~~~---------------~~~----~~s----~e~~~~ei~~~~~~l~~~~~i~g~~~~gfR~P~l~ 292 (476)
||||+||++|..... +.. ..+ .+++++||.++++.|+ ..+|..++.|++||+.
T Consensus 178 LvEIGSHT~sHh~~~~~np~g~~~pa~~~~~y~~~~~~yEs~~~~~~rl~~Di~~s~~~Ie---~~lG~~p~~FayPyG~ 254 (672)
T PRK14581 178 LVEIGAHTYASHYGVIANPQGNTEPAAANLQYDPKTKQYETVEAFKQRMEKDVALITQRIV---QATGKQPRVWVWPYGA 254 (672)
T ss_pred CcEEEcCCcccccccccCccccccchhhhccccccccccccHHHHHHHHHHHHHHHHHHHH---HHhCCCCCEEEcCCCC
Confidence 699999999873110 000 012 2346678888999998 5888999999999999
Q ss_pred cCchHHHHHHHHcCCeEeeecc
Q psy16052 293 VGGNNQFTMMEEQAFLYDSTIT 314 (476)
Q Consensus 293 ~~~~~~~~~l~~~G~~Ydss~~ 314 (476)
++.. +.+++++.||.+.-+..
T Consensus 255 yn~~-~~~iak~aGy~~afTt~ 275 (672)
T PRK14581 255 PNGT-VLNILRQHGYQLAMTLD 275 (672)
T ss_pred cCHH-HHHHHHHCCCcEEEECC
Confidence 9886 89999999998776654
No 10
>PRK14582 pgaB outer membrane N-deacetylase; Provisional
Probab=99.48 E-value=2.4e-13 Score=147.02 Aligned_cols=125 Identities=14% Similarity=0.195 Sum_probs=99.7
Q ss_pred CCeEEEeecCCCCCccHHHHHHHHhccCCCCCCCCceEEEEecccCC---------------------ChHHHHHHHHcC
Q psy16052 178 PQMITITFDDAINNNNIALYKEMFNGKRKNPNGCDIKATFFVSHKYT---------------------NYSAVQELHRKG 236 (476)
Q Consensus 178 Pq~V~lTFDdg~~~~~~~~~l~ll~~~~~np~~c~i~aTFFv~~~~~---------------------~~~~v~~l~~~G 236 (476)
++.|+||||||+.... ..++.||++ ++++||||+++.+. +|++|++|.+.|
T Consensus 106 ~K~VaLTFDDGy~s~y-t~A~PILkk-------ygvpATfFlvg~wi~~~~~~~~~~~~~~~~r~~~LtWeqIreM~~sG 177 (671)
T PRK14582 106 EKAVLLTFDDGYSSFY-TRVFPILQA-------FQWPAVWAPVGSWVDTPADQPVKFGGEMVPREYFATWQQVREVARSR 177 (671)
T ss_pred CCeEEEEEEcCCCchH-HHHHHHHHH-------cCCCEEEEEechhhccCccccccccccccccccCCCHHHHHHHHhCC
Confidence 5799999999997655 578999999 99999999986432 357999999999
Q ss_pred -CeeeeCCC-CCCCCcc--------------cc----CCCCH----HHHHHHHHHHHHHHHHhhcccCCcccEEecCCcc
Q psy16052 237 -HEVAVHSI-THNDDEN--------------FW----SNATV----DDWAKEMAGMRIIAEKYANLSDNSVVGVRAPYLR 292 (476)
Q Consensus 237 -HEIg~Hs~-~H~~~~~--------------~~----~~~s~----e~~~~ei~~~~~~l~~~~~i~g~~~~gfR~P~l~ 292 (476)
||||+||+ +|..... |. .-.+. +.+++||.++++.|+ ..+|..++.|++||+.
T Consensus 178 lvEIGSHT~~sH~~l~anp~g~~~pa~~~r~~~~~~~~yEs~~e~~~ri~~DL~~s~~~Ie---~~tG~~p~~FayPyG~ 254 (671)
T PRK14582 178 LVEIASHTWNSHYGIQANPQGSLLPAAVNRAYFTDHARYETAAEYRERIRLDAVKMTEYIR---TKAGKNPRVWVWPYGE 254 (671)
T ss_pred CeEEEcCCchhccccccccccccccchhcccccccccccccHHHHHHHHHHHHHHHHHHHH---HHhCCCCcEEecCCCC
Confidence 69999999 5874311 00 00122 336789999999999 5889999999999999
Q ss_pred cCchHHHHHHHHcCCeEeeecc
Q psy16052 293 VGGNNQFTMMEEQAFLYDSTIT 314 (476)
Q Consensus 293 ~~~~~~~~~l~~~G~~Ydss~~ 314 (476)
++.. ..+++++.||.+.-+..
T Consensus 255 ~n~~-~~~iakeaGY~~afT~~ 275 (671)
T PRK14582 255 ANGI-ALEELKKLGYDMAFTLE 275 (671)
T ss_pred CCHH-HHHHHHHCCCeEEEEeC
Confidence 9886 89999999998766643
No 11
>PF00057 Ldl_recept_a: Low-density lipoprotein receptor domain class A This prints entry is specific to LDL receptor; InterPro: IPR002172 The low-density lipoprotein receptor (LDLR) is the major cholesterol-carrying lipoprotein of plasma, acting to regulate cholesterol homeostasis in mammalian cells. The LDL receptor binds LDL and transports it into cells by acidic endocytosis. In order to be internalized, the receptor-ligand complex must first cluster into clathrin-coated pits. Once inside the cell, the LDLR separates from its ligand, which is degraded in the lysosomes, while the receptor returns to the cell surface []. The internal dissociation of the LDLR with its ligand is mediated by proton pumps within the walls of the endosome that lower the pH. The LDLR is a multi-domain protein, containing: The ligand-binding domain contains seven or eight 40-amino acid LDLR class A (cysteine-rich) repeats, each of which contains a coordinated calcium ion and six cysteine residues involved in disulphide bond formation []. Similar domains have been found in other extracellular and membrane proteins []. The second conserved region contains two EGF repeats, followed by six LDLR class B (YWTD) repeats, and another EGF repeat. The LDLR class B repeats each contain a conserved YWTD motif, and is predicted to form a beta-propeller structure []. This region is critical for ligand release and recycling of the receptor []. The third domain is rich in serine and threonine residues and contains clustered O-linked carbohydrate chains. The fourth domain is the hydrophobic transmembrane region. The fifth domain is the cytoplasmic tail that directs the receptor to clathrin-coated pits. LDLR is closely related in structure to several other receptors, including LRP1, LRP1b, megalin/LRP2, VLDL receptor, lipoprotein receptor, MEGF7/LRP4, and LRP8/apolipoprotein E receptor2); these proteins participate in a wide range of physiological processes, including the regulation of lipid metabolism, protection against atherosclerosis, neurodevelopment, and transport of nutrients and vitamins []. This entry represents the LDLR class A (cyateine-rich) repeat, which contains 6 disulphide-bound cysteines and a highly conserved cluster of negatively charged amino acids, of which many are clustered on one face of the module []. In LDL receptors, the class A domains form the binding site for LDL and calcium. The acidic residues between the fourth and sixth cysteines are important for high-affinity binding of positively charged sequences in LDLR's ligands. The repeat consists of a beta-hairpin structure followed by a series of beta turns. In the absence of calcium, LDL-A domains are unstructured; the bound calcium ion imparts structural integrity. Following these repeats is a 350 residue domain that resembles part of the epidermal growth factor (EGF) precursor. Numerous familial hypercholestorolemia mutations of the LDL receptor alter the calcium coordinating residue of LDL-A domains or other crucial scaffolding residues. ; GO: 0005515 protein binding; PDB: 2I1P_A 3OJY_A 4E0S_B 3T5O_A 4A5W_B 1JRF_A 1K7B_A 1V9U_5 3DPR_E 2KNY_A ....
Probab=98.95 E-value=3.9e-10 Score=75.92 Aligned_cols=36 Identities=47% Similarity=1.150 Sum_probs=34.1
Q ss_pred CCCCCceecCCCCccCCCcccCCCCCCCCCCCCCCC
Q psy16052 102 LCQDGFLACGDGACIERGLFCNGEKDCNDGSDENTC 137 (476)
Q Consensus 102 ~C~~~~~~C~~g~Ci~~~~~C~g~~dC~d~sDe~~c 137 (476)
.|+..+|+|.+|.||+..++|||..||.|+|||.+|
T Consensus 2 ~C~~~~f~C~~~~CI~~~~~CDg~~DC~dgsDE~~C 37 (37)
T PF00057_consen 2 TCPPGEFRCGNGQCIPKSWVCDGIPDCPDGSDEQNC 37 (37)
T ss_dssp SSSTTEEEETTSSEEEGGGTTSSSCSSSSSTTTSSH
T ss_pred cCcCCeeEcCCCCEEChHHcCCCCCCCCCCcccccC
Confidence 488999999999999999999999999999999876
No 12
>cd00112 LDLa Low Density Lipoprotein Receptor Class A domain, a cysteine-rich repeat that plays a central role in mammalian cholesterol metabolism; the receptor protein binds LDL and transports it into cells by endocytosis; 7 successive cysteine-rich repeats of about 40 amino acids are present in the N-terminal of this multidomain membrane protein; other homologous domains occur in related receptors, including the very low-density lipoprotein receptor and the LDL receptor-related protein/alpha 2-macroglobulin receptor, and in proteins which are functionally unrelated, such as the C9 component of complement; the binding of calcium is required for in vitro formation of the native disulfide isomer and is necessary in establishing and maintaining the modular structure
Probab=98.85 E-value=1.4e-09 Score=72.48 Aligned_cols=35 Identities=46% Similarity=1.132 Sum_probs=32.7
Q ss_pred CCCCceecCCCCccCCCcccCCCCCCCCCCCCCCC
Q psy16052 103 CQDGFLACGDGACIERGLFCNGEKDCNDGSDENTC 137 (476)
Q Consensus 103 C~~~~~~C~~g~Ci~~~~~C~g~~dC~d~sDe~~c 137 (476)
|++.+|+|.+|.||+..++|||..||.|+|||..|
T Consensus 1 C~~~~f~C~~~~Ci~~~~~CDg~~DC~dgsDE~~C 35 (35)
T cd00112 1 CPPNEFRCANGRCIPSSWVCDGEDDCGDGSDEENC 35 (35)
T ss_pred CCCCeEEcCCCCeeCHHHcCCCccCCCCCcccccC
Confidence 56789999999999999999999999999999876
No 13
>smart00192 LDLa Low-density lipoprotein receptor domain class A. Cysteine-rich repeat in the low-density lipoprotein (LDL) receptor that plays a central role in mammalian cholesterol metabolism. The N-terminal type A repeats in LDL receptor bind the lipoproteins. Other homologous domains occur in related receptors, including the very low-density lipoprotein receptor and the LDL receptor-related protein/alpha 2-macroglobulin receptor, and in proteins which are functionally unrelated, such as the C9 component of complement. Mutations in the LDL receptor gene cause familial hypercholesterolemia.
Probab=98.53 E-value=5.9e-08 Score=63.72 Aligned_cols=32 Identities=47% Similarity=1.096 Sum_probs=30.1
Q ss_pred CCCCceecCCCCccCCCcccCCCCCCCCCCCC
Q psy16052 103 CQDGFLACGDGACIERGLFCNGEKDCNDGSDE 134 (476)
Q Consensus 103 C~~~~~~C~~g~Ci~~~~~C~g~~dC~d~sDe 134 (476)
|...+|+|.+|.||+..++|||..||.|+|||
T Consensus 2 C~~~~f~C~~~~Ci~~~~~Cdg~~dC~dgsDE 33 (33)
T smart00192 2 CPPGEFQCDNGRCIPLSWVCDGVDDCSDGSDE 33 (33)
T ss_pred CCCCeEECCCCCEECchhhCCCcCcCcCCCCC
Confidence 66679999999999999999999999999998
No 14
>PF09960 DUF2194: Uncharacterized protein conserved in bacteria (DUF2194); InterPro: IPR018695 This family of prokaryotic proteins has no known function; however it may be a membrane protein.
Probab=98.10 E-value=0.00016 Score=78.04 Aligned_cols=206 Identities=13% Similarity=0.100 Sum_probs=125.2
Q ss_pred cHHHHHHHHhccCCCCCCCCceEEEEecccCCCh-----------------HHHHHHHHcCCeeeeCCCCCCCCccccCC
Q psy16052 193 NIALYKEMFNGKRKNPNGCDIKATFFVSHKYTNY-----------------SAVQELHRKGHEVAVHSITHNDDENFWSN 255 (476)
Q Consensus 193 ~~~~~l~ll~~~~~np~~c~i~aTFFv~~~~~~~-----------------~~v~~l~~~GHEIg~Hs~~H~~~~~~~~~ 255 (476)
=|+.++++-++ +++|.|.++...|.+. ...++|.+.|||||.|||.|+.+.. ..
T Consensus 288 WWpDM~~la~~-------y~ikyTg~~I~~Y~~~~~~pf~~~~~~~~~~~~~~g~~Ll~~ggElG~HGYNHqpL~~--~~ 358 (585)
T PF09960_consen 288 WWPDMKKLAKK-------YGIKYTGVMIENYNDQTNPPFEFLEQEDTDRFIYFGRELLKSGGELGLHGYNHQPLTL--EG 358 (585)
T ss_pred ccHHHHHHHHH-------cCCeEEEEEccccCCCCCCCcccccccchhhHHHHHHHHHhcCCceEEecccCCCCcC--CC
Confidence 44566677777 9999999999876421 3568899999999999999998632 12
Q ss_pred ----------C-CHHHHHHHHHHHHHHHHHhhcccCCcccEEecCCcccCchHHHHHHHHc-C-CeEeeecccCCCCCCC
Q psy16052 256 ----------A-TVDDWAKEMAGMRIIAEKYANLSDNSVVGVRAPYLRVGGNNQFTMMEEQ-A-FLYDSTITAPLSNPPL 322 (476)
Q Consensus 256 ----------~-s~e~~~~ei~~~~~~l~~~~~i~g~~~~gfR~P~l~~~~~~~~~~l~~~-G-~~Ydss~~~~~~~~p~ 322 (476)
+ +.+++.+-+...++.+++ -.+...+..+-+|---++.. -.++|.+. . +..-+|...... -.
T Consensus 359 ~~~~~~~Y~~W~~~~~m~~sl~~l~~f~~~--l~p~~~~~~YVPPSNils~e-G~e~L~~~~P~ik~IaS~Y~~~~--~~ 433 (585)
T PF09960_consen 359 DYGDEYGYKPWPSKEDMAESLKELKRFVKS--LFPNYEPSSYVPPSNILSEE-GREALKKAFPEIKTIASLYFGDD--EE 433 (585)
T ss_pred cccccccCcCCCCHHHHHHHHHHHHHHHHH--hCCCcceEEecCCccccCHH-HHHHHHHhCCCeEEEEEeeecCC--cC
Confidence 2 556666666655555554 34567888899998888776 56666653 1 334444432110 02
Q ss_pred CCccccccCCCCcCCCCcCCCCCCCCeEEEeccccccCCCCCCCCCCCCCcccCCCCCCCChHHHHHHHHHHHHHHhccC
Q psy16052 323 WPYTMYFRMPHRCHGNLQHCPTRSHAVWEMVMNELDRREDPNFDEYLPGCAMVDSCSNILTGDQFYNFLNHNFDRHYDQN 402 (476)
Q Consensus 323 ~pytl~~~~~~~c~~~~~~cp~~~~~lweiP~~~~~~~~~p~~~~~~~g~~~~d~~~~~~~~~~~~~~l~~~f~~~y~~~ 402 (476)
-+|...|+.. ..+++|+|=.. .|. -+++.. +...+...-.-
T Consensus 434 ~~y~QEF~~~-------------~dgi~e~PRis-------------SG~----------~~~~~~--~~~~~s~l~~~- 474 (585)
T PF09960_consen 434 GEYVQEFEIA-------------PDGIVEFPRIS-------------SGY----------YPDDYM--LWAIVSALNNH- 474 (585)
T ss_pred CcceEEeeEC-------------CCCeEEeCccc-------------cCC----------ccChHH--HHHHHHHHHhh-
Confidence 2455555431 23588888432 121 011211 11223222110
Q ss_pred CcceEEeecchhhcC--C------hHHHHHHHHHHHHHHhhCCCEEEEeHHHHHHHhh
Q psy16052 403 RAPLGLYFHAAWLKN--N------PEFLDAFLYWVDEIIERHKDVYFVTMTQVIQWIQ 452 (476)
Q Consensus 403 r~p~~i~~H~~w~~~--~------p~~~~al~~fl~~i~~~~~dV~fvT~~qi~~w~~ 452 (476)
..+.=.+||.=+.. | .+....|++||.++.++.|-....|.+|.+.-++
T Consensus 475 -Gv~sHFiHPDDvld~dR~~~~~W~~l~~~l~~~~~~i~~~~P~Lr~~T~Se~~~~~~ 531 (585)
T PF09960_consen 475 -GVFSHFIHPDDVLDEDRNAGKGWEELYKSLDKFMSWIKESYPWLRNLTASEAAGALQ 531 (585)
T ss_pred -eeeecccCCccccccccCCCCCHHHHHHHHHHHHHHHHHhCCCcccCcHHHHHHHHH
Confidence 11222346531211 1 2367899999999998899999999999986553
No 15
>PF10096 DUF2334: Uncharacterized protein conserved in bacteria (DUF2334); InterPro: IPR018763 This group of proteins has no known function.
Probab=97.97 E-value=0.00029 Score=68.37 Aligned_cols=99 Identities=17% Similarity=0.290 Sum_probs=73.9
Q ss_pred HHHHHHHHhccCCCCCCCCceEEEEecccCCC---------------hHHHHHHHHcCCeeeeCCCCCCCCc-------c
Q psy16052 194 IALYKEMFNGKRKNPNGCDIKATFFVSHKYTN---------------YSAVQELHRKGHEVAVHSITHNDDE-------N 251 (476)
Q Consensus 194 ~~~~l~ll~~~~~np~~c~i~aTFFv~~~~~~---------------~~~v~~l~~~GHEIg~Hs~~H~~~~-------~ 251 (476)
...+.++|.+ .+|+.+.+|+..+.+ ...++.+.++|++|+.||++|.... .
T Consensus 18 l~~i~d~l~~-------~~ipf~v~vIP~~~d~~~~~~~~l~~~~~f~~~L~~~~~~Gg~I~lHGYtHq~~~~~sg~~~e 90 (243)
T PF10096_consen 18 LKEIADYLYK-------YGIPFSVAVIPVYVDPNGGITVNLSDNPEFVEYLRYLQARGGEIVLHGYTHQYGNSVSGDGFE 90 (243)
T ss_pred HHHHHHHHHH-------CCCCEEEEEEecccCCCCcccccchhhHHHHHHHHHHHhcCCEEEEEecceecCCCcccccce
Confidence 3445677877 799988888754321 1578888999999999999999832 2
Q ss_pred ccC-----CCCHHHHHHHHHHHHHHHHHhhcccCCcccEEecCCcccCchHHHHHHHH
Q psy16052 252 FWS-----NATVDDWAKEMAGMRIIAEKYANLSDNSVVGVRAPYLRVGGNNQFTMMEE 304 (476)
Q Consensus 252 ~~~-----~~s~e~~~~ei~~~~~~l~~~~~i~g~~~~gfR~P~l~~~~~~~~~~l~~ 304 (476)
+|. ..+.+..++-|.++.+++++ .|-.+.||-+|....+.. +++++++
T Consensus 91 f~~~~~~~~~~~~~a~~ri~~gl~~l~~----~gi~p~~f~aP~y~~s~~-~~~~~~~ 143 (243)
T PF10096_consen 91 FWDSEFDAGDSEEEAKERIEKGLEILAE----LGIYPVGFEAPHYWASPE-TYKVLEK 143 (243)
T ss_pred eccccccccCCHHHHHHHHHHHHHHHHH----CCCcccEEECCccccCHH-HHHHHHh
Confidence 333 25677888888888888874 255899999999998886 6766643
No 16
>PF01607 CBM_14: Chitin binding Peritrophin-A domain; InterPro: IPR002557 This entry represents a chitin binding domain []. It is found in (amongst others) the Peritrophin-A chitin binding proteins, particularly the peritrophic matrix proteins of insects and animal chitinases [, , ]. Copies of the domain are also found in some baculoviruses. It is an extracellular domain that contains six conserved cysteines that probably form three disulphide bridges. Chitin binding has been demonstrated for a protein containing only two of these domains [].; GO: 0008061 chitin binding, 0006030 chitin metabolic process, 0005576 extracellular region; PDB: 1DQC_A.
Probab=97.86 E-value=4.8e-06 Score=60.43 Aligned_cols=53 Identities=34% Similarity=0.767 Sum_probs=34.8
Q ss_pred cCCCCCCceeeccCCCCCCCCCccccCCCCcccccCCCCcccccccccccCCcccccc
Q psy16052 26 CKDKDAGEWFRLVAGEGDNCRDVIQCTSSGLQAIRCPAGLYFDIEKQTCDWKGAVKNC 83 (476)
Q Consensus 26 C~~~~~~~~~~~~~~~~~~C~~~~~C~~g~~~~~~C~~g~~~~~~~~~C~~~~~~~~C 83 (476)
|.++.++ +|+++. +|+.|+.|.+|.....+|+.|+.||...+.|+++..+..|
T Consensus 1 C~~~~~~-~~~~~~----~C~~Y~~C~~g~~~~~~C~~g~~fd~~~~~C~~~~~~~~C 53 (53)
T PF01607_consen 1 CPGRGDG-FYPHPD----DCRKYYQCVNGQAVEQRCPEGLYFDPSSQRCVPPSNVVQC 53 (53)
T ss_dssp ----SSE-EE--SS-----SSEEEEEETTEEEEEE-TTS-EE-TTTSSEE-TTT-TT-
T ss_pred CCccCCe-eEeCCC----CCCEEEEeeCCcEECCCCcCCCEECcCcCEEcCCccCCCC
Confidence 5666544 777777 8999999999999999999999999999999987664444
No 17
>COG3233 Predicted deacetylase [General function prediction only]
Probab=97.68 E-value=0.00054 Score=63.88 Aligned_cols=123 Identities=14% Similarity=0.139 Sum_probs=85.7
Q ss_pred EEEeecCCC-CCccHHHHHHHHhccCCCCCCCCceEEE-EecccCCC----------hHHHHHHHHcCCeeeeCCCCCCC
Q psy16052 181 ITITFDDAI-NNNNIALYKEMFNGKRKNPNGCDIKATF-FVSHKYTN----------YSAVQELHRKGHEVAVHSITHND 248 (476)
Q Consensus 181 V~lTFDdg~-~~~~~~~~l~ll~~~~~np~~c~i~aTF-Fv~~~~~~----------~~~v~~l~~~GHEIg~Hs~~H~~ 248 (476)
|.|-.|-.+ +..+...+.+++++ . +....|+ +|..++-+ .+.+.+..++|+|++.|||.|.+
T Consensus 6 iillhdVSpv~~~~~~~i~~~ide-~-----~~~~~t~lLViPn~~~~~~l~~d~rf~~~l~~r~e~Gdel~lHGy~h~d 79 (233)
T COG3233 6 IILLHDVSPVYWPTLSNIDAAIDE-Y-----GAQNSTVLLVIPNHANDYPLSKDPRFVDLLTEREEEGDELVLHGYDHID 79 (233)
T ss_pred eEEEEecCcccchhHHHHHHHHHH-h-----CCCCceEEEEeeccCCCCCcccChHHHHHHHHHHhcCCEEEEechhhcc
Confidence 444444433 33344455666665 1 3455666 77654321 27788889999999999999987
Q ss_pred Cc---cccCCCCHHHHHHHHHHHHHHHHHhhcccCCcccEEecCCcccCchHHHHHHHHcCCeEeeeccc
Q psy16052 249 DE---NFWSNATVDDWAKEMAGMRIIAEKYANLSDNSVVGVRAPYLRVGGNNQFTMMEEQAFLYDSTITA 315 (476)
Q Consensus 249 ~~---~~~~~~s~e~~~~ei~~~~~~l~~~~~i~g~~~~gfR~P~l~~~~~~~~~~l~~~G~~Ydss~~~ 315 (476)
.. .| .-+...+.+.-+..+.+.+++ .|-..+||-+|.+.++.. ++++|-+++|+|-++..+
T Consensus 80 ~~~~gEF-~~l~~~eA~~RL~~a~~~l~~----~G~~~~~FvpPrwlmS~g-t~~aL~~~~frv~a~l~~ 143 (233)
T COG3233 80 TKRRGEF-ACLRAHEARLRLMAAIEELEA----LGFPLRGFVPPRWLMSEG-TRQALLENNFRVTADLRG 143 (233)
T ss_pred ccCcccc-ccchHHHHHHHHHHHHHHHHH----cCCcceeecCcceecCHh-HHHHhccCCcEEEecccc
Confidence 21 11 334456666677777777873 355699999999999987 999999999999988863
No 18
>smart00494 ChtBD2 Chitin-binding domain type 2.
Probab=97.12 E-value=0.00044 Score=50.36 Aligned_cols=49 Identities=33% Similarity=0.717 Sum_probs=42.7
Q ss_pred ccCCCCCCceeeccCCCCCCCCCccccCCCCcccccCCCCcccccccccccCCc
Q psy16052 25 LCKDKDAGEWFRLVAGEGDNCRDVIQCTSSGLQAIRCPAGLYFDIEKQTCDWKG 78 (476)
Q Consensus 25 ~C~~~~~~~~~~~~~~~~~~C~~~~~C~~g~~~~~~C~~g~~~~~~~~~C~~~~ 78 (476)
.|..+. +.+|+++. +|..|+.|..+....+.|+.|+.|+...+.|++..
T Consensus 2 ~C~~~~-~g~~~~~~----~C~~y~~C~~~~~~~~~C~~g~~f~~~~~~C~~~~ 50 (56)
T smart00494 2 ECPGRG-DGLYPHPT----DCSKYYQCSNGRPIVGSCPAGLVFDPATQTCDWPD 50 (56)
T ss_pred cCCCCC-CccccCcc----cCCeeEEcCCCcEEeccCcCCCeECCCCCCcCCcc
Confidence 477754 55998888 89999999999889999999999999999998754
No 19
>PF11959 DUF3473: Domain of unknown function (DUF3473); InterPro: IPR022560 This domain, found in bacteria and archaea, is functionally uncharacterised. It is about 130 amino acids in length and is found C-terminal to PF01522 from PFAM. It contains two completely conserved residues (P and H) that may be functionally important.
Probab=97.01 E-value=0.0034 Score=55.03 Aligned_cols=82 Identities=26% Similarity=0.489 Sum_probs=48.7
Q ss_pred CeEEEeccccccCCCCCCCCCCCCCcccCCCCCCCChHHHHHHH-----HHHHHHHhccCCcceEEeecchhhcC--ChH
Q psy16052 348 AVWEMVMNELDRREDPNFDEYLPGCAMVDSCSNILTGDQFYNFL-----NHNFDRHYDQNRAPLGLYFHAAWLKN--NPE 420 (476)
Q Consensus 348 ~lweiP~~~~~~~~~p~~~~~~~g~~~~d~~~~~~~~~~~~~~l-----~~~f~~~y~~~r~p~~i~~H~~w~~~--~p~ 420 (476)
.|||||++.+. .+...+| .++..+++++ +..+.+.....+.|+++||| +|... +|.
T Consensus 29 ~l~E~P~st~~-----~~g~~lP-----------~~GGgyfRl~P~~l~~~~~~~~~~~~~~~~~~YfH-PwE~dp~qpr 91 (133)
T PF11959_consen 29 SLVEFPISTSP-----LFGKRLP-----------LGGGGYFRLFPYWLYRWLIRRINRRGGQPAVFYFH-PWEFDPDQPR 91 (133)
T ss_pred eEEEECCcccc-----cCCeeee-----------ecCChhHHHccHHHHHHHHHHHHhCCCcceEEEEe-ceecCCCCcc
Confidence 39999998753 2222333 2233344444 33333333334489999999 87762 221
Q ss_pred --------------HHHHHHHHHHHHHhhCCCEEEEeHHHHHH
Q psy16052 421 --------------FLDAFLYWVDEIIERHKDVYFVTMTQVIQ 449 (476)
Q Consensus 421 --------------~~~al~~fl~~i~~~~~dV~fvT~~qi~~ 449 (476)
.+..+++=|+.++ .+.-|+|+.|+++
T Consensus 92 ~~~~~~~~rf~~y~~l~~~~~rl~~Ll---~~f~f~t~~~~~~ 131 (133)
T PF11959_consen 92 VPGAPLKSRFRHYNNLDRMEKRLDRLL---SDFRFGTMREVLA 131 (133)
T ss_pred cCCCCcceeEEEEcCHHHHHHHHHHHH---hhCceEEHHHHhh
Confidence 1355555566654 4889999999975
No 20
>PF12999 PRKCSH-like: Glucosidase II beta subunit-like
Probab=96.45 E-value=0.0029 Score=57.56 Aligned_cols=49 Identities=29% Similarity=0.498 Sum_probs=40.0
Q ss_pred cccccccccccccccCCCCCCCCCCCCCCceecCCC----CccCCCcccCCCCC---CCCCCCC
Q psy16052 78 GAVKNCKLKNKERKVKPLLYTDEPLCQDGFLACGDG----ACIERGLFCNGEKD---CNDGSDE 134 (476)
Q Consensus 78 ~~~~~C~~~~de~~c~~~~~~~~~~C~~~~~~C~~g----~Ci~~~~~C~g~~d---C~d~sDe 134 (476)
|+.=||.|+|||..= ..|..+.|.|.|. .-|+.+.+=||.=| |-|||||
T Consensus 55 DdyCDC~DGSDEPGT--------sAC~~~~FyC~N~g~~p~~i~~s~VnDGICDy~~CCDGSDE 110 (176)
T PF12999_consen 55 DDYCDCPDGSDEPGT--------SACSNGKFYCENKGHIPRYIPSSRVNDGICDYDICCDGSDE 110 (176)
T ss_pred CcceeCCCCCCcccc--------ccCcCceEeeccCCCCCceeehhhhcCCcCcccccCCCCCC
Confidence 455579999999732 3488889999764 77888999999999 9999999
No 21
>PF00057 Ldl_recept_a: Low-density lipoprotein receptor domain class A This prints entry is specific to LDL receptor; InterPro: IPR002172 The low-density lipoprotein receptor (LDLR) is the major cholesterol-carrying lipoprotein of plasma, acting to regulate cholesterol homeostasis in mammalian cells. The LDL receptor binds LDL and transports it into cells by acidic endocytosis. In order to be internalized, the receptor-ligand complex must first cluster into clathrin-coated pits. Once inside the cell, the LDLR separates from its ligand, which is degraded in the lysosomes, while the receptor returns to the cell surface []. The internal dissociation of the LDLR with its ligand is mediated by proton pumps within the walls of the endosome that lower the pH. The LDLR is a multi-domain protein, containing: The ligand-binding domain contains seven or eight 40-amino acid LDLR class A (cysteine-rich) repeats, each of which contains a coordinated calcium ion and six cysteine residues involved in disulphide bond formation []. Similar domains have been found in other extracellular and membrane proteins []. The second conserved region contains two EGF repeats, followed by six LDLR class B (YWTD) repeats, and another EGF repeat. The LDLR class B repeats each contain a conserved YWTD motif, and is predicted to form a beta-propeller structure []. This region is critical for ligand release and recycling of the receptor []. The third domain is rich in serine and threonine residues and contains clustered O-linked carbohydrate chains. The fourth domain is the hydrophobic transmembrane region. The fifth domain is the cytoplasmic tail that directs the receptor to clathrin-coated pits. LDLR is closely related in structure to several other receptors, including LRP1, LRP1b, megalin/LRP2, VLDL receptor, lipoprotein receptor, MEGF7/LRP4, and LRP8/apolipoprotein E receptor2); these proteins participate in a wide range of physiological processes, including the regulation of lipid metabolism, protection against atherosclerosis, neurodevelopment, and transport of nutrients and vitamins []. This entry represents the LDLR class A (cyateine-rich) repeat, which contains 6 disulphide-bound cysteines and a highly conserved cluster of negatively charged amino acids, of which many are clustered on one face of the module []. In LDL receptors, the class A domains form the binding site for LDL and calcium. The acidic residues between the fourth and sixth cysteines are important for high-affinity binding of positively charged sequences in LDLR's ligands. The repeat consists of a beta-hairpin structure followed by a series of beta turns. In the absence of calcium, LDL-A domains are unstructured; the bound calcium ion imparts structural integrity. Following these repeats is a 350 residue domain that resembles part of the epidermal growth factor (EGF) precursor. Numerous familial hypercholestorolemia mutations of the LDL receptor alter the calcium coordinating residue of LDL-A domains or other crucial scaffolding residues. ; GO: 0005515 protein binding; PDB: 2I1P_A 3OJY_A 4E0S_B 3T5O_A 4A5W_B 1JRF_A 1K7B_A 1V9U_5 3DPR_E 2KNY_A ....
Probab=95.54 E-value=0.0062 Score=40.92 Aligned_cols=33 Identities=18% Similarity=0.425 Sum_probs=27.0
Q ss_pred cccccCCCCcccccccccccCCccccccccccccccc
Q psy16052 56 LQAIRCPAGLYFDIEKQTCDWKGAVKNCKLKNKERKV 92 (476)
Q Consensus 56 ~~~~~C~~g~~~~~~~~~C~~~~~~~~C~~~~de~~c 92 (476)
..+|+|.+|.++.. .++| +++.||.+++||.+|
T Consensus 5 ~~~f~C~~~~CI~~-~~~C---Dg~~DC~dgsDE~~C 37 (37)
T PF00057_consen 5 PGEFRCGNGQCIPK-SWVC---DGIPDCPDGSDEQNC 37 (37)
T ss_dssp TTEEEETTSSEEEG-GGTT---SSSCSSSSSTTTSSH
T ss_pred CCeeEcCCCCEECh-HHcC---CCCCCCCCCcccccC
Confidence 56899999886654 5677 899999999999876
No 22
>cd00112 LDLa Low Density Lipoprotein Receptor Class A domain, a cysteine-rich repeat that plays a central role in mammalian cholesterol metabolism; the receptor protein binds LDL and transports it into cells by endocytosis; 7 successive cysteine-rich repeats of about 40 amino acids are present in the N-terminal of this multidomain membrane protein; other homologous domains occur in related receptors, including the very low-density lipoprotein receptor and the LDL receptor-related protein/alpha 2-macroglobulin receptor, and in proteins which are functionally unrelated, such as the C9 component of complement; the binding of calcium is required for in vitro formation of the native disulfide isomer and is necessary in establishing and maintaining the modular structure
Probab=95.52 E-value=0.0066 Score=40.22 Aligned_cols=32 Identities=19% Similarity=0.440 Sum_probs=26.2
Q ss_pred ccccCCCCcccccccccccCCccccccccccccccc
Q psy16052 57 QAIRCPAGLYFDIEKQTCDWKGAVKNCKLKNKERKV 92 (476)
Q Consensus 57 ~~~~C~~g~~~~~~~~~C~~~~~~~~C~~~~de~~c 92 (476)
.+|+|.+|.++.. .+.| +++.||.++|||..|
T Consensus 4 ~~f~C~~~~Ci~~-~~~C---Dg~~DC~dgsDE~~C 35 (35)
T cd00112 4 NEFRCANGRCIPS-SWVC---DGEDDCGDGSDEENC 35 (35)
T ss_pred CeEEcCCCCeeCH-HHcC---CCccCCCCCcccccC
Confidence 5899999877654 4677 889999999999865
No 23
>KOG1215|consensus
Probab=94.90 E-value=0.02 Score=65.85 Aligned_cols=74 Identities=27% Similarity=0.678 Sum_probs=57.2
Q ss_pred cccccCCCC-cccccccccccCCccccccccccccc--ccCCCCCCCCCCCCCCceecCCCCccCCCcccCCCCCCCCCC
Q psy16052 56 LQAIRCPAG-LYFDIEKQTCDWKGAVKNCKLKNKER--KVKPLLYTDEPLCQDGFLACGDGACIERGLFCNGEKDCNDGS 132 (476)
Q Consensus 56 ~~~~~C~~g-~~~~~~~~~C~~~~~~~~C~~~~de~--~c~~~~~~~~~~C~~~~~~C~~g~Ci~~~~~C~g~~dC~d~s 132 (476)
...++|..+ .+.. ..+.| ++..+|.+++||. .|.- ..|...++.|.++.|+...+.|+|..||.||+
T Consensus 220 ~~~~~c~g~~~~i~-~~~~~---Dg~~dc~~~~de~~~~~~~------~~~~~~e~~~~~~~~~~~~~~~~g~~d~pdg~ 289 (877)
T KOG1215|consen 220 IEVTRCDGSSRCIL-ISEVC---DGPRDCVDGPDEGVMNCSD------ATCEAPEIECADGDCSDRQKLCDGDLDCPDGL 289 (877)
T ss_pred eeEEEecCCCcEEe-ehhcc---CCCcccccCCcCceeEeec------cccCCcceeecCCCCccceEEecCccCCCCcc
Confidence 356677764 3322 23445 8899999999995 3422 24777899999999999999999999999999
Q ss_pred CCCCCCC
Q psy16052 133 DENTCDS 139 (476)
Q Consensus 133 De~~c~~ 139 (476)
||..|..
T Consensus 290 de~~~~~ 296 (877)
T KOG1215|consen 290 DEDYCKK 296 (877)
T ss_pred ccccccc
Confidence 9998874
No 24
>KOG1215|consensus
Probab=94.87 E-value=0.067 Score=61.58 Aligned_cols=115 Identities=22% Similarity=0.389 Sum_probs=73.0
Q ss_pred cccccCC--CCcccccccccccCCcccccccccccccccCCCC---C----------------------CCCCCC-----
Q psy16052 56 LQAIRCP--AGLYFDIEKQTCDWKGAVKNCKLKNKERKVKPLL---Y----------------------TDEPLC----- 103 (476)
Q Consensus 56 ~~~~~C~--~g~~~~~~~~~C~~~~~~~~C~~~~de~~c~~~~---~----------------------~~~~~C----- 103 (476)
...|+|. ++.|.. ..+.| ++..+|.+++||..|.... . .++..+
T Consensus 140 ~~~~~c~~~~~~Cip-~~~~c---d~~~~C~dg~de~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~d~~~~~~~~~ 215 (877)
T KOG1215|consen 140 LDKFSCRTGSCKCIP-GDWLC---DGEADCPDGSDELNCAVRRCEPRGASLDCIVAIKVCDIQHDCADDYDESEGRIYWT 215 (877)
T ss_pred CCCCCCcCccccCCC-CceeC---CCCCccccchhhhcccccccCccccccccceeeeecCcccccccccccccCccccc
Confidence 4556666 333322 23445 7777899999998885210 0 001111
Q ss_pred ---CCCceecCC-CCccCCCcccCCCCCCCCCCCC--CCCCCCCCCCCCCCCCCCcccCCccccCCCCCCCCCCCCCC
Q psy16052 104 ---QDGFLACGD-GACIERGLFCNGEKDCNDGSDE--NTCDSENDPNRAPPCDPAVCVLPDCFCSEDGTYIPGDLPSK 175 (476)
Q Consensus 104 ---~~~~~~C~~-g~Ci~~~~~C~g~~dC~d~sDe--~~c~~~~~~~~~~~C~~~~C~~p~C~C~~~~~~iP~~l~~~ 175 (476)
....++|.. ++||...+.|||+.||.|++|| ..|....+......|..+.|....-.|..... .|.++...
T Consensus 216 ~~~~~~~~~c~g~~~~i~~~~~~Dg~~dc~~~~de~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~g~~d-~pdg~de~ 292 (877)
T KOG1215|consen 216 DDSRIEVTRCDGSSRCILISEVCDGPRDCVDGPDEGVMNCSDATCEAPEIECADGDCSDRQKLCDGDLD-CPDGLDED 292 (877)
T ss_pred CCcceeEEEecCCCcEEeehhccCCCcccccCCcCceeEeeccccCCcceeecCCCCccceEEecCccC-CCCccccc
Confidence 135788866 5999999999999999999999 46666555555677777777765555665322 35555443
No 25
>PF03065 Glyco_hydro_57: Glycosyl hydrolase family 57; InterPro: IPR004300 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Glycoside hydrolase family 57 GH57 from CAZY comprises enzymes with two known activities; alpha-amylase (3.2.1.1 from EC) and 4-alpha-glucanotransferase (2.4.1 from EC).; GO: 0003824 catalytic activity, 0005975 carbohydrate metabolic process; PDB: 1K1X_B 1K1W_A 1K1Y_A 3N98_A 3N8T_A 3N92_A 1UFA_A 3P0B_A.
Probab=94.77 E-value=0.071 Score=54.81 Aligned_cols=196 Identities=15% Similarity=0.165 Sum_probs=106.9
Q ss_pred HHHHHHHHcCC-eeeeCCCCCCCCccccCCCCH-HHHHHHHHHHHHHHHHhhcccCCcccEEecCCcccCchHHHHHHHH
Q psy16052 227 SAVQELHRKGH-EVAVHSITHNDDENFWSNATV-DDWAKEMAGMRIIAEKYANLSDNSVVGVRAPYLRVGGNNQFTMMEE 304 (476)
Q Consensus 227 ~~v~~l~~~GH-EIg~Hs~~H~~~~~~~~~~s~-e~~~~ei~~~~~~l~~~~~i~g~~~~gfR~P~l~~~~~~~~~~l~~ 304 (476)
+.++++.++|+ ||-.-+++|...+. ++. ++.+.+|+...+.++ .+.|..|+||..|-..++.. ..++|++
T Consensus 114 ~~~~~l~~~G~iEll~~~~~h~ilpl----~~~~~d~~~Qi~~~~~~~~---~~FG~~P~G~W~pE~a~~~~-l~~~l~~ 185 (360)
T PF03065_consen 114 EAFRELAERGQIELLTSPYYHPILPL----LPDPEDFRAQIEMGREYFK---KHFGRRPRGFWLPECAYSPG-LEEILAE 185 (360)
T ss_dssp HHHHHHHHTTSEEEEEE-TT-B-GGG----SSHHHHHHHHHHHHHHHHH---HHHSS--SBEE-GGG-B-TT-HHHHHHH
T ss_pred HHHHHHHHCCCEEEEeCchHHHHHHH----hCCHHHHHHHHHHHHHHHH---HHhCCCCCceECcccccCHH-HHHHHHH
Confidence 67889999998 99999999998753 566 999999999999988 58899999999999999987 8999999
Q ss_pred cCCeEee---ecccCCCCCCCCCccccccCCCCcCCCCcCCCCCC-CCeEEEeccccccCCCCCCCC-CCCCCcccCCCC
Q psy16052 305 QAFLYDS---TITAPLSNPPLWPYTMYFRMPHRCHGNLQHCPTRS-HAVWEMVMNELDRREDPNFDE-YLPGCAMVDSCS 379 (476)
Q Consensus 305 ~G~~Yds---s~~~~~~~~p~~pytl~~~~~~~c~~~~~~cp~~~-~~lweiP~~~~~~~~~p~~~~-~~~g~~~~d~~~ 379 (476)
.|+.|-- +........|..+|..+. .+... .+|..++.+.-... .-.|.. ..||
T Consensus 186 ~Gi~~~i~d~~~~~~~~~~~~~~y~py~------------~~~~~~~~l~vf~rd~~ls~-~I~f~~~g~~~-------- 244 (360)
T PF03065_consen 186 AGIRYTILDGHQLLGAGSPPNNPYRPYR------------VEDQGGSGLAVFFRDQELSD-DIWFRYSGYPG-------- 244 (360)
T ss_dssp TT--EEEEECHHHHHHT--CGGCSSEEE------------ECE-TTCSEEEEEB-HHHHH-HHHTTTTSGGG--------
T ss_pred cCCEEEEECcHHhCCCCCccCCCCceeE------------eecCCCceEEEEecCHHHHH-hcCCccccccc--------
Confidence 9998732 222111123344444331 11111 23777765531110 000111 0111
Q ss_pred CCCChHHHHHHHHHHHHHHhccCCcce-EEeecchhh-----cCChHHHHHHHHHHHHHHhhCCCEEEEeHHHHHHHhhC
Q psy16052 380 NILTGDQFYNFLNHNFDRHYDQNRAPL-GLYFHAAWL-----KNNPEFLDAFLYWVDEIIERHKDVYFVTMTQVIQWIQN 453 (476)
Q Consensus 380 ~~~~~~~~~~~l~~~f~~~y~~~r~p~-~i~~H~~w~-----~~~p~~~~al~~fl~~i~~~~~dV~fvT~~qi~~w~~~ 453 (476)
..+.+++++.+...+..........+ ++.+-..-+ ......++.|.+.+..+.+ .+.|-++|.+|+++-+..
T Consensus 245 -~~~~~~~~~~l~~~~~~~~~~~~~~~v~i~~DgE~~~~~~~~~~~~~l~~l~~~~~~~~~-~~~i~~~t~se~l~~~~~ 322 (360)
T PF03065_consen 245 -DPAYREFYRYLLYEYASWISYDPGPVVVIALDGENFWGHYWDEGEGFLEFLYQLLEELHS-RGFIETVTPSEYLERHPP 322 (360)
T ss_dssp -STTSBHSSCHHC----HC-BS-TT-EEEECCEGGGCCCCCCCCTCEHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHST
T ss_pred -ccCHHHHHHhhhhhhhhhhccCCCCeEEEeECCcccCCcCCcccCCHHHHHHHHHHHhhc-CCeEEecCHHHHHhhCCC
Confidence 12345566555411333333333333 343322111 1234577778877777765 445899999999986633
No 26
>COG1449 Alpha-amylase/alpha-mannosidase [Carbohydrate transport and metabolism]
Probab=94.56 E-value=0.6 Score=51.34 Aligned_cols=79 Identities=18% Similarity=0.197 Sum_probs=65.9
Q ss_pred HHHHHHHHcCC-eeeeCCCCCCCCccccCCCCHHHHHHHHHHHHHHHHHhhcccCCcccEEecCCcccCchHHHHHHHHc
Q psy16052 227 SAVQELHRKGH-EVAVHSITHNDDENFWSNATVDDWAKEMAGMRIIAEKYANLSDNSVVGVRAPYLRVGGNNQFTMMEEQ 305 (476)
Q Consensus 227 ~~v~~l~~~GH-EIg~Hs~~H~~~~~~~~~~s~e~~~~ei~~~~~~l~~~~~i~g~~~~gfR~P~l~~~~~~~~~~l~~~ 305 (476)
...|++.+.|+ |+-.+++.|.-.+-+..+--.+++.++|...+++.. .+.|..++|||+|.+.++.+ ..+++++.
T Consensus 183 ~~~r~l~e~g~~e~~~~pyyHsl~pl~~d~gw~~d~~~qv~~~~~~~~---elfG~~p~~~~~tEl~y~~~-i~~~~~e~ 258 (615)
T COG1449 183 EAFRELAESGKVELTASPYYHSLIPLLADDGWYEDFKEQVMMSRELYK---ELFGVWPSGFWNTELAYNDQ-ILEYFEES 258 (615)
T ss_pred HHHHHHHhcCceEEEecccccccchhcccCCchHHHHHHHHHHHHHHH---HHhCCCCccccChhhhccHH-HHHHHHHc
Confidence 56788888887 999999999976544332235688899999988888 47888999999999999987 99999999
Q ss_pred CCeE
Q psy16052 306 AFLY 309 (476)
Q Consensus 306 G~~Y 309 (476)
||.|
T Consensus 259 G~~~ 262 (615)
T COG1449 259 GFSW 262 (615)
T ss_pred CCcc
Confidence 9986
No 27
>PF04748 Polysacc_deac_2: Divergent polysaccharide deacetylase; InterPro: IPR006837 This is a family of uncharacterised proteins that includes YibQ.; PDB: 2QV5_A 2NLY_A.
Probab=93.84 E-value=2.8 Score=39.78 Aligned_cols=182 Identities=15% Similarity=0.191 Sum_probs=88.5
Q ss_pred CCceEEEEecccCCCh-HHHHHHHHcCCeeeeCCC------CCCCCccccCCCCHHHHHHHHHHHHHHHHHhhcccCCcc
Q psy16052 211 CDIKATFFVSHKYTNY-SAVQELHRKGHEVAVHSI------THNDDENFWSNATVDDWAKEMAGMRIIAEKYANLSDNSV 283 (476)
Q Consensus 211 c~i~aTFFv~~~~~~~-~~v~~l~~~GHEIg~Hs~------~H~~~~~~~~~~s~e~~~~ei~~~~~~l~~~~~i~g~~~ 283 (476)
-+.+.||-|....+.- +.++..+++||||-.|-- .+.........++.++.+.-+..+ |. .+++ .
T Consensus 19 Lp~pvT~ai~P~~~~~~~~a~~a~~~G~EvllhlPMep~~~~~~gp~~L~~~~~~~~i~~~l~~a---l~---~vp~--a 90 (213)
T PF04748_consen 19 LPFPVTFAILPYAPYSREWAERARAAGHEVLLHLPMEPKGYKDPGPGALLTGMSEEEIRKRLEAA---LA---RVPG--A 90 (213)
T ss_dssp SSTTCEEEEETTSTTHHHHHHHHHHCT-EEEEEEEE--TTTT---TT-B-TTS-HHHHHHHHHHH---HC---CSTT---
T ss_pred CCCCeEEEECCCCCChHHHHHHHHHcCCEEEEeCCCCCCCCCCcccccccCCCCHHHHHHHHHHH---HH---HCCC--c
Confidence 5788899888765543 566777899999999852 111111223356777666665543 22 3443 3
Q ss_pred cEEecCCc-ccCch-----HHHHHHHHcCCeEeeecccCCCCCCCCCccccccCCCCcCCCCcCCCCCCCCeEEEecccc
Q psy16052 284 VGVRAPYL-RVGGN-----NQFTMMEEQAFLYDSTITAPLSNPPLWPYTMYFRMPHRCHGNLQHCPTRSHAVWEMVMNEL 357 (476)
Q Consensus 284 ~gfR~P~l-~~~~~-----~~~~~l~~~G~~Ydss~~~~~~~~p~~pytl~~~~~~~c~~~~~~cp~~~~~lweiP~~~~ 357 (476)
+|....-+ ....+ ..++.|++.|+.|=-|...+ .+.-.... +..++--.=-..+
T Consensus 91 ~GvnNhmGS~~T~~~~~m~~vl~~l~~~gl~FvDS~T~~--------~s~a~~~A------------~~~gvp~~~rdvf 150 (213)
T PF04748_consen 91 VGVNNHMGSRFTSDREAMRWVLEVLKERGLFFVDSRTTP--------RSVAPQVA------------KELGVPAARRDVF 150 (213)
T ss_dssp SEEEEEE-CCHHC-HHHHHHHHHHHHHTT-EEEE-S--T--------T-SHHHHH------------HHCT--EEE-SEE
T ss_pred EEEecCCCccccCCHHHHHHHHHHHHHcCCEEEeCCCCc--------ccHHHHHH------------HHcCCCEEeecee
Confidence 44433222 11111 26778889998664443321 11110000 0111111101111
Q ss_pred ccCCCCCCCCCCCCCcccCCCCCCCChHHHHHHHHHHHHHHhccCCcceEEeecchhhcCChHHHHHHHHHHHHHHhhCC
Q psy16052 358 DRREDPNFDEYLPGCAMVDSCSNILTGDQFYNFLNHNFDRHYDQNRAPLGLYFHAAWLKNNPEFLDAFLYWVDEIIERHK 437 (476)
Q Consensus 358 ~~~~~p~~~~~~~g~~~~d~~~~~~~~~~~~~~l~~~f~~~y~~~r~p~~i~~H~~w~~~~p~~~~al~~fl~~i~~~~~ 437 (476)
. | +..+.+.+.+.|.......... + ..+.--| + +|.-+++|++++..+.+ .
T Consensus 151 L-----------------D---~~~~~~~I~~ql~~~~~~A~~~-G-~aI~Igh-~----~p~Tl~~L~~~~~~l~~--~ 201 (213)
T PF04748_consen 151 L-----------------D---NDQDEAAIRRQLDQAARIARKQ-G-SAIAIGH-P----RPETLEALEEWLPELEA--Q 201 (213)
T ss_dssp T-----------------T---ST-SHHHHHHHHHHHHHHHHCC-S-EEEEEEE------SCCHHHHHHHHHHHHHH--C
T ss_pred c-----------------C---CCCCHHHHHHHHHHHHHhhhhc-C-cEEEEEc-C----CHHHHHHHHHHHhHHhh--C
Confidence 1 1 1134566666665555444332 2 2333446 4 26789999999999975 8
Q ss_pred CEEEEeHHHHHH
Q psy16052 438 DVYFVTMTQVIQ 449 (476)
Q Consensus 438 dV~fvT~~qi~~ 449 (476)
+|-+|+.+++++
T Consensus 202 gi~lVpvS~llr 213 (213)
T PF04748_consen 202 GIELVPVSALLR 213 (213)
T ss_dssp TEEE--HHHH--
T ss_pred CEEEEEhHHhhC
Confidence 999999999874
No 28
>PF15421 Polysacc_deac_3: Putative polysaccharide deacetylase; PDB: 4DWE_A.
Probab=92.47 E-value=5.1 Score=40.91 Aligned_cols=167 Identities=16% Similarity=0.213 Sum_probs=73.9
Q ss_pred hHHHHHHHHcCCeeeeCCCCCCCCccccCCCCHHHHHHHHHHHHHHHHHhhcccCCcccEEecCCcccCchHHHHHHHHc
Q psy16052 226 YSAVQELHRKGHEVAVHSITHNDDENFWSNATVDDWAKEMAGMRIIAEKYANLSDNSVVGVRAPYLRVGGNNQFTMMEEQ 305 (476)
Q Consensus 226 ~~~v~~l~~~GHEIg~Hs~~H~~~~~~~~~~s~e~~~~ei~~~~~~l~~~~~i~g~~~~gfR~P~l~~~~~~~~~~l~~~ 305 (476)
|+.+++|+.-|..|+.|...+... ..+.++.+.+..-+..+|.+ .+.|...+-+..|-+ +..+- .+.+
T Consensus 132 W~evkem~nyG~~ia~Hdv~~~~~-----~~t~~~~~~~~~i~~~~i~~--~l~GR~~K~l~ePnG----n~~Yi-~aa~ 199 (425)
T PF15421_consen 132 WDEVKEMLNYGTGIAFHDVNTYDV-----DTTDDQILAHFCIAQNSIFK--NLSGRGCKMLAEPNG----NKKYI-TAAL 199 (425)
T ss_dssp HHHHHHHHTTT-EEEB---SS-GG-----G-SHHHHHHHHHHHHHHHHH--HTTT----EEE--TT-----HHHH-HHHT
T ss_pred HHHHHHHHhcCcceEecccccccc-----cccchhhhhhhhhHHHHHHH--HhcCcccceeeCCCC----CHHHH-HHHH
Confidence 589999999999999999988774 34556666665555555654 677877776666643 32222 1222
Q ss_pred CCeEeeecccCCCCCCCCCccccccCCCCcCCCCcCCCCCCCCeEEEeccccccCCCCCCCCCCCCCcccCCCCCCCChH
Q psy16052 306 AFLYDSTITAPLSNPPLWPYTMYFRMPHRCHGNLQHCPTRSHAVWEMVMNELDRREDPNFDEYLPGCAMVDSCSNILTGD 385 (476)
Q Consensus 306 G~~Ydss~~~~~~~~p~~pytl~~~~~~~c~~~~~~cp~~~~~lweiP~~~~~~~~~p~~~~~~~g~~~~d~~~~~~~~~ 385 (476)
++.---++..-.+..+.+|+.....+. . ..++-. |. + ....+
T Consensus 200 ~~d~Iqtitaq~G~~kv~pf~~~~dL~---------------k-~~~~R~---------F~---------~----~~~~~ 241 (425)
T PF15421_consen 200 VYDPIQTITAQSGAIKVYPFKVNGDLE---------------K-VVIERG---------FY---------D----SPDND 241 (425)
T ss_dssp TSTT--EEEESSS-B---TTT--S--T---------------T-EEEE-B---------------------------THH
T ss_pred cCCchhhhHhhcCCcceeccccCCchh---------------h-heeeee---------ec---------C----CcchH
Confidence 221111233222344677776443220 0 111111 10 0 11234
Q ss_pred HHHHHHHHHHHHHhccCCcceEEeecchhhcCChHHHHHHHHHHHHHHhh-----CCCEEEEeHHHHHHHh
Q psy16052 386 QFYNFLNHNFDRHYDQNRAPLGLYFHAAWLKNNPEFLDAFLYWVDEIIER-----HKDVYFVTMTQVIQWI 451 (476)
Q Consensus 386 ~~~~~l~~~f~~~y~~~r~p~~i~~H~~w~~~~p~~~~al~~fl~~i~~~-----~~dV~fvT~~qi~~w~ 451 (476)
++.+.+.......- .+|.-..|..|.. ...+.. ||.||... .+.|||.+.+|+-++.
T Consensus 242 ~~k~~I~~~l~~p~-e~r~ai~iG~h~t-----d~~W~~---fl~wLn~~YG~~G~D~lWfp~~eEyyEY~ 303 (425)
T PF15421_consen 242 QYKQDIEDQLKNPV-EEREAINIGVHGT-----DSNWED---FLLWLNDTYGKDGDDSLWFPSQEEYYEYN 303 (425)
T ss_dssp HHHHHHHHHHTS-G-GG---EEEEESS-------HHHHH---HHHHHHHHTSTTTT--EEEE-HHHHHHHH
T ss_pred HHHHHHHHHhccch-hchhheeecccCC-----CCcHHH---HHHHHHHHhCcCCCcccccCCHHHHHHHH
Confidence 56555555444322 3466678999942 233434 66666433 4669999999999843
No 29
>smart00192 LDLa Low-density lipoprotein receptor domain class A. Cysteine-rich repeat in the low-density lipoprotein (LDL) receptor that plays a central role in mammalian cholesterol metabolism. The N-terminal type A repeats in LDL receptor bind the lipoproteins. Other homologous domains occur in related receptors, including the very low-density lipoprotein receptor and the LDL receptor-related protein/alpha 2-macroglobulin receptor, and in proteins which are functionally unrelated, such as the C9 component of complement. Mutations in the LDL receptor gene cause familial hypercholesterolemia.
Probab=92.13 E-value=0.08 Score=34.46 Aligned_cols=28 Identities=21% Similarity=0.495 Sum_probs=22.8
Q ss_pred cccCCCCcccccccccccCCcccccccccccc
Q psy16052 58 AIRCPAGLYFDIEKQTCDWKGAVKNCKLKNKE 89 (476)
Q Consensus 58 ~~~C~~g~~~~~~~~~C~~~~~~~~C~~~~de 89 (476)
+|+|.+|.++. ..++| +++.+|.+++||
T Consensus 6 ~f~C~~~~Ci~-~~~~C---dg~~dC~dgsDE 33 (33)
T smart00192 6 EFQCDNGRCIP-LSWVC---DGVDDCSDGSDE 33 (33)
T ss_pred eEECCCCCEEC-chhhC---CCcCcCcCCCCC
Confidence 89999887554 55677 888999999987
No 30
>COG2861 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=81.38 E-value=53 Score=31.67 Aligned_cols=204 Identities=15% Similarity=0.183 Sum_probs=103.6
Q ss_pred CCeEEEeecCCCCCccHHHHHHHHhccCCCCCCCCceEEEEe--cccCCChHHHHHHHHcCCeeeeCCCC----CCCCc-
Q psy16052 178 PQMITITFDDAINNNNIALYKEMFNGKRKNPNGCDIKATFFV--SHKYTNYSAVQELHRKGHEVAVHSIT----HNDDE- 250 (476)
Q Consensus 178 Pq~V~lTFDdg~~~~~~~~~l~ll~~~~~np~~c~i~aTFFv--~~~~~~~~~v~~l~~~GHEIg~Hs~~----H~~~~- 250 (476)
+-.++|-.||-........ ++++- +...|.-+ .+.+. ....+++++.||||-.|.-- +....
T Consensus 28 ~~klaIVIddlG~~~~~~~--~i~~L--------p~eVTlA~~P~~~~~-~~~~~~A~~~G~evlih~PmeP~~~~~~e~ 96 (250)
T COG2861 28 GPKLAIVIDDLGLSQTGTQ--AILAL--------PPEVTLAFAPFAPHA-REWAQKARNAGHEVLIHMPMEPFSYPKIEP 96 (250)
T ss_pred CceEEEEECCccccHHHHH--HHHhC--------CccceEEecCCCchh-HHHHHHHHhcCCEEEEeccCCcccCCCCCC
Confidence 4468888888665555443 44442 44444444 44432 26788999999999999732 22111
Q ss_pred -cccCCCCHHHHHHHHHHHHHHHHHhhcccCCcccEEecCC-cccCch--H---HHHHHHHcCCeEeeecccCCCCCCCC
Q psy16052 251 -NFWSNATVDDWAKEMAGMRIIAEKYANLSDNSVVGVRAPY-LRVGGN--N---QFTMMEEQAFLYDSTITAPLSNPPLW 323 (476)
Q Consensus 251 -~~~~~~s~e~~~~ei~~~~~~l~~~~~i~g~~~~gfR~P~-l~~~~~--~---~~~~l~~~G~~Ydss~~~~~~~~p~~ 323 (476)
.....++.++..+.|.++...+. . .+|+.--- .+...+ . ..+.|++.|.-|--|-...
T Consensus 97 gtL~~~~s~~e~~~rl~~a~~~v~------~--~~GlnNhmGs~~tsn~~aM~~~m~~Lk~r~l~flDs~T~a------- 161 (250)
T COG2861 97 GTLRPGMSAEEILRRLRKAMNKVP------D--AVGLNNHMGSRFTSNEDAMEKLMEALKERGLYFLDSGTIA------- 161 (250)
T ss_pred CCcccCCCHHHHHHHHHHHHhhCc------c--ceeehhhhhhhhcCcHHHHHHHHHHHHHCCeEEEcccccc-------
Confidence 11124667777777766554443 1 12221110 111112 2 4556667665332222110
Q ss_pred CccccccCCCCcCCCCcCCCCCCCCeEEEeccccccCCCCCCCCCCCCCcccCCCCCCCChHHHHHHHHHHHHHHhccCC
Q psy16052 324 PYTMYFRMPHRCHGNLQHCPTRSHAVWEMVMNELDRREDPNFDEYLPGCAMVDSCSNILTGDQFYNFLNHNFDRHYDQNR 403 (476)
Q Consensus 324 pytl~~~~~~~c~~~~~~cp~~~~~lweiP~~~~~~~~~p~~~~~~~g~~~~d~~~~~~~~~~~~~~l~~~f~~~y~~~r 403 (476)
.++--.+ ..=.+||+-. ++ .| +| +..+...+.+.|.....- -+.|+
T Consensus 162 -~S~a~~i---------------Ak~~gVp~~~--rd---vf---------LD---~e~~~~~V~kql~~~~~~-Ark~G 207 (250)
T COG2861 162 -NSLAGKI---------------AKEIGVPVIK--RD---VF---------LD---DEDTEAAVLKQLDAAEKL-ARKNG 207 (250)
T ss_pred -cchhhhh---------------HhhcCCceee--ee---ee---------ec---CcCCHHHHHHHHHHHHHH-HHhcC
Confidence 0000000 0011222211 10 01 11 123556666666543332 23345
Q ss_pred cceEEeecchhhcCChHHHHHHHHHHHHHHhhCCCEEEEeHHHHHH
Q psy16052 404 APLGLYFHAAWLKNNPEFLDAFLYWVDEIIERHKDVYFVTMTQVIQ 449 (476)
Q Consensus 404 ~p~~i~~H~~w~~~~p~~~~al~~fl~~i~~~~~dV~fvT~~qi~~ 449 (476)
...+|. | + .+.-+.+++++++.+.+ +++-+|.++.|+.
T Consensus 208 ~ai~IG-h-~----~~~Tv~vl~~~~~~l~~--~gIelV~~s~L~~ 245 (250)
T COG2861 208 SAIGIG-H-P----HKNTVAVLQQWLDELPA--RGIELVPVSALLG 245 (250)
T ss_pred ceEEec-C-C----chhHHHHHHHHHHhCCC--CCeEEecHHHhhc
Confidence 444443 4 3 36789999999999863 8999999999975
No 31
>PF12999 PRKCSH-like: Glucosidase II beta subunit-like
Probab=77.92 E-value=1.6 Score=39.90 Aligned_cols=31 Identities=39% Similarity=0.622 Sum_probs=26.1
Q ss_pred CCceecCCCC-c-cCCCcccCCCCCCCCCCCCC
Q psy16052 105 DGFLACGDGA-C-IERGLFCNGEKDCNDGSDEN 135 (476)
Q Consensus 105 ~~~~~C~~g~-C-i~~~~~C~g~~dC~d~sDe~ 135 (476)
.+.|.|-+|. = |+...+.|+.=||.|||||-
T Consensus 35 ~~~f~Cl~~~~~~I~~~~iNDdyCDC~DGSDEP 67 (176)
T PF12999_consen 35 NGKFTCLDGSKIVIPFSQINDDYCDCPDGSDEP 67 (176)
T ss_pred CCceEecCCCCceecHHHccCcceeCCCCCCcc
Confidence 4579997763 3 79999999999999999993
No 32
>KOG2397|consensus
Probab=75.32 E-value=2.1 Score=44.81 Aligned_cols=49 Identities=29% Similarity=0.477 Sum_probs=39.1
Q ss_pred cccccccccccCCCCCCCCCCCCCCceecCC----CCccCCCcccCCCCCCCCCCCCCCCC
Q psy16052 82 NCKLKNKERKVKPLLYTDEPLCQDGFLACGD----GACIERGLFCNGEKDCNDGSDENTCD 138 (476)
Q Consensus 82 ~C~~~~de~~c~~~~~~~~~~C~~~~~~C~~----g~Ci~~~~~C~g~~dC~d~sDe~~c~ 138 (476)
||.|++||. + ...|+.+.|.|.| ..=|+..-+=||.-||-|+|||..-.
T Consensus 65 DC~DGsDEP--G------tsACpngkF~C~N~G~~p~~i~ssrV~DGICDCCDgSDE~~Sg 117 (480)
T KOG2397|consen 65 DCLDGSDEP--G------TSACPNGKFYCVNQGHQPKYIPSSRVNDGICDCCDGSDEYLSG 117 (480)
T ss_pred cCCCCCCCC--c------cccCCCCceeeeecCCCceeeechhccCcccccccCCCCccCC
Confidence 689999984 2 1359999999965 36778888999999999999995443
No 33
>COG4878 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=68.44 E-value=1.1e+02 Score=30.09 Aligned_cols=74 Identities=16% Similarity=0.320 Sum_probs=46.7
Q ss_pred HHHHHcCCeeeeCCCCCCCCccccCCCCHHHHHHH--HHHHHHHHHHhh---cccCCcccEEecCCcccCchHHHHHHHH
Q psy16052 230 QELHRKGHEVAVHSITHNDDENFWSNATVDDWAKE--MAGMRIIAEKYA---NLSDNSVVGVRAPYLRVGGNNQFTMMEE 304 (476)
Q Consensus 230 ~~l~~~GHEIg~Hs~~H~~~~~~~~~~s~e~~~~e--i~~~~~~l~~~~---~i~g~~~~gfR~P~l~~~~~~~~~~l~~ 304 (476)
+++...|-|+|.|||.|..++. ...++-|+.| |.++...|..+. -++|....-|-+|-........+.+++.
T Consensus 52 ~el~~~gge~gIhgYNh~sl~p---k~~~knw~see~m~ka~~~L~~y~~~ls~~~y~~~vfVppSnil~q~gk~alvk~ 128 (309)
T COG4878 52 TELTFDGGEKGIHGYNHKSLQP---KDILKNWESEEIMNKANYTLADYGDILSITGYDTFVFVPPSNILLQKGKFALVKQ 128 (309)
T ss_pred cEEEecCCceeEeeccCCCCCc---cccccchhhHHHHHHHHHHHHHHHHHHhccccceEEEeCcccccchhHHHHHHHh
Confidence 3456688999999999998531 2233334332 455555555442 2456678888999887777656666665
Q ss_pred cC
Q psy16052 305 QA 306 (476)
Q Consensus 305 ~G 306 (476)
..
T Consensus 129 ~p 130 (309)
T COG4878 129 AP 130 (309)
T ss_pred CC
Confidence 43
No 34
>KOG3509|consensus
Probab=63.46 E-value=13 Score=42.64 Aligned_cols=79 Identities=23% Similarity=0.435 Sum_probs=56.6
Q ss_pred CcccccCCCCcccccccccccCCcccccccccccccccCCCCCCCCCCCCCCceecCCC-CccCCCcccCCCCCCCCCCC
Q psy16052 55 GLQAIRCPAGLYFDIEKQTCDWKGAVKNCKLKNKERKVKPLLYTDEPLCQDGFLACGDG-ACIERGLFCNGEKDCNDGSD 133 (476)
Q Consensus 55 ~~~~~~C~~g~~~~~~~~~C~~~~~~~~C~~~~de~~c~~~~~~~~~~C~~~~~~C~~g-~Ci~~~~~C~g~~dC~d~sD 133 (476)
++++++|.++.+ -.....| +..-.|...+.+..|.... ....|.+.++.|-+- +|-.++..|+|.+||.|+++
T Consensus 33 ~p~~~~~~~~~~-~~~~~~~---~~~~~~~~~~~~~~~~~~~--~~s~~~~~~~~c~~~~~~~~~~~~~~g~~~~~~~~~ 106 (964)
T KOG3509|consen 33 SPNEFKCNNPRC-VQPEALL---DADSTCGPNSTPSGCNAKP--SASDCKPTETQCRDRLRCNPQSFQCDGTNDCKDGSD 106 (964)
T ss_pred CcchhccCCccc-cCchhhh---ccccccCCCCCcCCccccc--cccccCCcccccccchhcCCccccccCCCCCCccch
Confidence 378899988743 2233445 3334567777777665332 234577889999776 89999999999999999999
Q ss_pred CCCCCC
Q psy16052 134 ENTCDS 139 (476)
Q Consensus 134 e~~c~~ 139 (476)
|..+..
T Consensus 107 ~~~~~~ 112 (964)
T KOG3509|consen 107 EVGCKS 112 (964)
T ss_pred hccccc
Confidence 987754
No 35
>PF07315 DUF1462: Protein of unknown function (DUF1462); InterPro: IPR009190 There are currently no experimental data for members of this group of bacterial proteins or their homologues. A crystal structure of Q7A6J8 from SWISSPROT revealed a thioredoxin-like fold, its core consisting of three layers alpha/beta/alpha.; PDB: 1XG8_A.
Probab=62.02 E-value=27 Score=28.26 Aligned_cols=53 Identities=26% Similarity=0.554 Sum_probs=34.5
Q ss_pred CCCCCCCChHHHHHHHHHHHHHHhccCCcceEE-eecchhhcCChHHHHHHHHHHHHHHh
Q psy16052 376 DSCSNILTGDQFYNFLNHNFDRHYDQNRAPLGL-YFHAAWLKNNPEFLDAFLYWVDEIIE 434 (476)
Q Consensus 376 d~~~~~~~~~~~~~~l~~~f~~~y~~~r~p~~i-~~H~~w~~~~p~~~~al~~fl~~i~~ 434 (476)
.||.+..|..+++++|..++.|.|-. .||.+ |+. +. .|..-+.-++|++.|..
T Consensus 10 ASCVn~PsSkeTyeWL~aal~RKyp~--~~f~~~YiD---i~-~p~~~~~~~~~a~~I~e 63 (93)
T PF07315_consen 10 ASCVNAPSSKETYEWLEAALKRKYPD--QPFEFTYID---IE-NPPENDHDQQFAERILE 63 (93)
T ss_dssp GGGSSS--HHHHHHHHHHHHHHH-TT--S-EEEEEEE---TT-T----HHHHHHHHHHHT
T ss_pred hhhcCCCCchhHHHHHHHHHhCcCCC--CceEEEEEe---cC-CCCccHHHHHHHHHHHh
Confidence 57888899999999999999999874 57755 333 22 23333677889998874
No 36
>COG3603 Uncharacterized conserved protein [Function unknown]
Probab=46.50 E-value=58 Score=27.84 Aligned_cols=67 Identities=22% Similarity=0.285 Sum_probs=38.2
Q ss_pred CCCCCCCCCCCCCeEEEeecCCCCCccHHHHHHHHhccCCCCCCCCceEEEEecccCCCh---------HHHHHHHHcCC
Q psy16052 167 YIPGDLPSKEVPQMITITFDDAINNNNIALYKEMFNGKRKNPNGCDIKATFFVSHKYTNY---------SAVQELHRKGH 237 (476)
Q Consensus 167 ~iP~~l~~~~~Pq~V~lTFDdg~~~~~~~~~l~ll~~~~~np~~c~i~aTFFv~~~~~~~---------~~v~~l~~~GH 237 (476)
++|...... +....|+|+.-..-.-...+..+++..+.| +| .-|-++.--++| ..++.|..+||
T Consensus 52 ~vp~~V~~~--~GW~~lk~~gpf~FgltGilasV~~pLsd~----gi-gIFavStydtDhiLVr~~dLekAv~~L~eaGh 124 (128)
T COG3603 52 RVPDVVQIE--KGWSCLKFEGPFDFGLTGILASVSQPLSDN----GI-GIFAVSTYDTDHILVREEDLEKAVKALEEAGH 124 (128)
T ss_pred cCCcceEec--CCeEEEEEeccccCCcchhhhhhhhhHhhC----Cc-cEEEEEeccCceEEEehhhHHHHHHHHHHcCC
Confidence 455544333 578889998766544444444444432222 22 234444433343 67899999999
Q ss_pred eee
Q psy16052 238 EVA 240 (476)
Q Consensus 238 EIg 240 (476)
||.
T Consensus 125 ev~ 127 (128)
T COG3603 125 EVL 127 (128)
T ss_pred ccc
Confidence 985
No 37
>cd02995 PDI_a_PDI_a'_C PDIa family, C-terminal TRX domain (a') subfamily; composed of the C-terminal redox active a' domains of PDI, ERp72, ERp57 (or ERp60) and EFP1. PDI, ERp72 and ERp57 are endoplasmic reticulum (ER)-resident eukaryotic proteins involved in oxidative protein folding. They are oxidases, catalyzing the formation of disulfide bonds of newly synthesized polypeptides in the ER. They also exhibit reductase activity in acting as isomerases to correct any non-native disulfide bonds, as well as chaperone activity to prevent protein aggregation and facilitate the folding of newly synthesized proteins. PDI and ERp57 have the abb'a' domain structure (where a and a' are redox active TRX domains while b and b' are redox inactive TRX-like domains). PDI also contains an acidic region (c domain) after the a' domain that is absent in ERp57. ERp72 has an additional a domain at the N-terminus (a"abb'a' domain structure). ERp57 interacts with the lectin chaperones, calnexin and calreticu
Probab=43.38 E-value=48 Score=26.41 Aligned_cols=51 Identities=16% Similarity=0.241 Sum_probs=32.7
Q ss_pred HHHHHhccCCcceEEeecchhhcCChHHHHHHHHHHHHHHhhCCCEEEEeHH
Q psy16052 394 NFDRHYDQNRAPLGLYFHAAWLKNNPEFLDAFLYWVDEIIERHKDVYFVTMT 445 (476)
Q Consensus 394 ~f~~~y~~~r~p~~i~~H~~w~~~~p~~~~al~~fl~~i~~~~~dV~fvT~~ 445 (476)
+|+......+.+++|.|+++|-..-......|+++.+.+.. ...+-|+...
T Consensus 9 ~f~~~i~~~~~~~~v~f~~~~C~~C~~~~~~~~~~~~~~~~-~~~~~~~~id 59 (104)
T cd02995 9 NFDEVVLDSDKDVLVEFYAPWCGHCKALAPIYEELAEKLKG-DDNVVIAKMD 59 (104)
T ss_pred hhHHHHhCCCCcEEEEEECCCCHHHHHHhhHHHHHHHHhcC-CCCEEEEEEe
Confidence 45555555557899999999865333456677777666543 4567776543
No 38
>PF01074 Glyco_hydro_38: Glycosyl hydrolases family 38 N-terminal domain; InterPro: IPR000602 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Glycoside hydrolase family 38 GH38 from CAZY comprises enzymes with only one known activity; alpha-mannosidase (3.2.1.24 from EC) (3.2.1.114 from EC). Lysosomal alpha-mannosidase is necessary for the catabolism of N-linked carbohydrates released during glycoprotein turnover. The enzyme catalyses the hydrolysis of terminal, non-reducing alpha-D-mannose residues in alpha-D-mannosides, and can cleave all known types of alpha-mannosidic linkages. Defects in the gene cause lysosomal alpha-mannosidosis (AM), a lysosomal storage disease characterised by the accumulation of unbranched oligo-saccharide chains.; GO: 0004559 alpha-mannosidase activity, 0005975 carbohydrate metabolic process; PDB: 2WYI_A 2WYH_A 1O7D_A 3CZN_A 2FYV_A 3D50_A 3EJU_A 3EJS_A 3DX3_A 3BVX_A ....
Probab=42.86 E-value=1.2e+02 Score=29.55 Aligned_cols=80 Identities=9% Similarity=0.098 Sum_probs=50.1
Q ss_pred HHHHHHHHcCC-eeeeCCCCCCCCccccCCCCHHHHHHHHHHHHHHHHHhhcccCCcccEEec-CCcccCchHHHHHHHH
Q psy16052 227 SAVQELHRKGH-EVAVHSITHNDDENFWSNATVDDWAKEMAGMRIIAEKYANLSDNSVVGVRA-PYLRVGGNNQFTMMEE 304 (476)
Q Consensus 227 ~~v~~l~~~GH-EIg~Hs~~H~~~~~~~~~~s~e~~~~ei~~~~~~l~~~~~i~g~~~~gfR~-P~l~~~~~~~~~~l~~ 304 (476)
..+++++++|- ||..=+|.-.+. .-.+.|.+.+.|...++.+++ .......++|-+ +|+.... +-++|+.
T Consensus 64 ~~~~~lv~~Gri~~vgg~~~~~D~----~l~~~Eslirql~~G~~~~~~--~fg~~~~~~~~~D~FG~~~~--lP~il~~ 135 (275)
T PF01074_consen 64 KRIKKLVKEGRIEIVGGWYVQPDE----NLPSGESLIRQLLYGHKYLRK--EFGVRPKVAWQPDSFGHSAQ--LPQILKQ 135 (275)
T ss_dssp HHHHHHHHTTSEEESSSBSS-B-S----SSS-HHHHHHHHHHHHHHHHH--HHTG--SEEEESSSSSB-TC--HHHHHHT
T ss_pred HHHHHHHHhceeEEeCceeeeccc----cCCCHHHHHHHHhhhHHHHHH--hcCCCCCeEEeCCCCCCchh--hHHHHhc
Confidence 56688999995 543333433332 236789999999999997764 222334466666 7777543 7899999
Q ss_pred cCCeEeeecc
Q psy16052 305 QAFLYDSTIT 314 (476)
Q Consensus 305 ~G~~Ydss~~ 314 (476)
.|+.|-....
T Consensus 136 ~Gi~~~v~~r 145 (275)
T PF01074_consen 136 FGIKYFVIWR 145 (275)
T ss_dssp TT-SEEEESS
T ss_pred cCcceEEEec
Confidence 9998766654
No 39
>COG4837 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=36.92 E-value=69 Score=26.14 Aligned_cols=53 Identities=28% Similarity=0.562 Sum_probs=39.6
Q ss_pred CCCCCCCChHHHHHHHHHHHHHHhccCCcceEE-eecchhhcCChHHHHHHHHHHHHHHh
Q psy16052 376 DSCSNILTGDQFYNFLNHNFDRHYDQNRAPLGL-YFHAAWLKNNPEFLDAFLYWVDEIIE 434 (476)
Q Consensus 376 d~~~~~~~~~~~~~~l~~~f~~~y~~~r~p~~i-~~H~~w~~~~p~~~~al~~fl~~i~~ 434 (476)
.||.+..|..+++++|...+.|.|- ..||.+ |+. +. +|-.-+...+|++.|.+
T Consensus 17 ASCV~aPtsKdt~eWLeaalkRKyp--~~~F~~~YiD---I~-n~~~e~~~~~~aekI~~ 70 (106)
T COG4837 17 ASCVNAPTSKDTYEWLEAALKRKYP--NQPFKYTYID---IT-NPPLEDHDLQFAEKIEQ 70 (106)
T ss_pred HHhcCCCcchhHHHHHHHHHhccCC--CCCcEEEEEE---cC-CCccHHHHHHHHHHHhc
Confidence 5788888999999999999998886 367754 443 22 34466777888888874
No 40
>COG3536 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=36.52 E-value=22 Score=29.75 Aligned_cols=81 Identities=20% Similarity=0.229 Sum_probs=45.3
Q ss_pred CCCCCCCCCCCCCCeEEEeecCCCCCccHHHHHHHHhccCCCCCCCCceEEEEecccC----------CChHHHHHHHHc
Q psy16052 166 TYIPGDLPSKEVPQMITITFDDAINNNNIALYKEMFNGKRKNPNGCDIKATFFVSHKY----------TNYSAVQELHRK 235 (476)
Q Consensus 166 ~~iP~~l~~~~~Pq~V~lTFDdg~~~~~~~~~l~ll~~~~~np~~c~i~aTFFv~~~~----------~~~~~v~~l~~~ 235 (476)
+.+|..+....--...++||+||-.......++.++-....- -|++..--.-|.|+- -| -.|+-++..
T Consensus 2 ~~~P~~lrvsk~r~~L~l~f~dg~~~~L~ae~LRv~sPsAEv-qghgp~q~v~v~gkr~V~i~~i~p~Gn-Yavri~FdD 79 (120)
T COG3536 2 TQWPTELRVSKDRKVLSLTFADGAPFRLPAEFLRVLSPSAEV-QGHGPGQRVLVPGKRNVQIRDIEPVGN-YAVRILFDD 79 (120)
T ss_pred CCCCceeEEeccCcEEEEEeCCCCcccCCHHHHHhcCchHHh-cccCCCCceeeeccccceEEEeEecCc-eEEEEEecC
Confidence 455666666555678999999976544444566665431110 012221111122211 12 246778899
Q ss_pred CCeeeeCCCCCCC
Q psy16052 236 GHEVAVHSITHND 248 (476)
Q Consensus 236 GHEIg~Hs~~H~~ 248 (476)
||+-|.++|+...
T Consensus 80 gHDsGiy~W~YL~ 92 (120)
T COG3536 80 GHDSGIYTWAYLH 92 (120)
T ss_pred CcccCcccHHHHH
Confidence 9999999987543
No 41
>COG1543 Uncharacterized conserved protein [Function unknown]
Probab=36.39 E-value=89 Score=33.13 Aligned_cols=76 Identities=16% Similarity=0.130 Sum_probs=59.6
Q ss_pred HHHHHHHcC-CeeeeCCCCCCCCccccCCCCHHHHHHHHHHHHHHHHHhhcccCCcccEEecCCcccCchHHHHHHHHcC
Q psy16052 228 AVQELHRKG-HEVAVHSITHNDDENFWSNATVDDWAKEMAGMRIIAEKYANLSDNSVVGVRAPYLRVGGNNQFTMMEEQA 306 (476)
Q Consensus 228 ~v~~l~~~G-HEIg~Hs~~H~~~~~~~~~~s~e~~~~ei~~~~~~l~~~~~i~g~~~~gfR~P~l~~~~~~~~~~l~~~G 306 (476)
..+.+.+.| -||-.-+.+|-.++-+ +. .+.+.++|..+.+..++ ..|..|.||-.|-..+... .-.+|++.|
T Consensus 128 ~f~~~~~~g~ieilts~athg~lPll--~~-peAi~AQi~~g~~~ye~---~fg~~P~GiWlPEcay~pg-ie~~l~~~G 200 (504)
T COG1543 128 AFKELQRSGGIEILTSAATHGYLPLL--GG-PEAIEAQILTGIELYEE---HFGLAPKGIWLPECAYAPG-IERILKDAG 200 (504)
T ss_pred HHHHHHHcCCceeeeehhhheehhhc--CC-chhhHHHHHHHHHHHHH---HhCCCCCceechhhccccc-hHHHHHhcC
Confidence 456666666 5998888899877642 23 78888999988888874 5667799999998888875 778999999
Q ss_pred CeEe
Q psy16052 307 FLYD 310 (476)
Q Consensus 307 ~~Yd 310 (476)
+.|-
T Consensus 201 i~yf 204 (504)
T COG1543 201 IEYF 204 (504)
T ss_pred ceEE
Confidence 9883
No 42
>PRK05406 LamB/YcsF family protein; Provisional
Probab=35.60 E-value=2.9e+02 Score=26.81 Aligned_cols=82 Identities=15% Similarity=0.116 Sum_probs=53.6
Q ss_pred HHHHHHHHhccCCCCCCCCceEEEEecccCCCh----HHHHHHHHcCCeeeeCCCCCCCCcccc---CCCCHHHHHHHHH
Q psy16052 194 IALYKEMFNGKRKNPNGCDIKATFFVSHKYTNY----SAVQELHRKGHEVAVHSITHNDDENFW---SNATVDDWAKEMA 266 (476)
Q Consensus 194 ~~~~l~ll~~~~~np~~c~i~aTFFv~~~~~~~----~~v~~l~~~GHEIg~Hs~~H~~~~~~~---~~~s~e~~~~ei~ 266 (476)
...++.++... | --|+.+| .++ ..|+...+.|-.||.|- +-.+...|- -.++.+++++.+.
T Consensus 22 D~~lmp~IssA--N-IACG~HA--------GDp~~M~~tv~lA~~~gV~IGAHP-gypD~~gFGRR~m~~s~~el~~~v~ 89 (246)
T PRK05406 22 DEALLPLVTSA--N-IACGFHA--------GDPAVMRRTVRLAKENGVAIGAHP-GYPDLEGFGRRNMDLSPEELYALVL 89 (246)
T ss_pred HHHHHHHhhhH--H-HhccccC--------CCHHHHHHHHHHHHHcCCeEccCC-CCCccCCCCCCCCCCCHHHHHHHHH
Confidence 45677777660 0 0144444 444 56677788999999985 222222221 1589999999999
Q ss_pred HHHHHHHHhhcccCCcccEEe
Q psy16052 267 GMRIIAEKYANLSDNSVVGVR 287 (476)
Q Consensus 267 ~~~~~l~~~~~i~g~~~~gfR 287 (476)
.+...|+.++...|.++....
T Consensus 90 yQigAL~~~a~~~g~~l~hVK 110 (246)
T PRK05406 90 YQIGALQAIARAAGGRVSHVK 110 (246)
T ss_pred HHHHHHHHHHHHcCCeeEEeC
Confidence 999999988777776555554
No 43
>PTZ00062 glutaredoxin; Provisional
Probab=33.30 E-value=1.1e+02 Score=28.74 Aligned_cols=41 Identities=15% Similarity=0.261 Sum_probs=28.1
Q ss_pred cCCcceEEeecchhhcCChHHHHHHHHHHHHHHhhCCCEEEEeHH
Q psy16052 401 QNRAPLGLYFHAAWLKNNPEFLDAFLYWVDEIIERHKDVYFVTMT 445 (476)
Q Consensus 401 ~~r~p~~i~~H~~w~~~~p~~~~al~~fl~~i~~~~~dV~fvT~~ 445 (476)
++....+++||++|-. + -+.+..+|..+.+.+++|-|+...
T Consensus 15 ~~~g~~vl~f~a~w~~--~--C~~m~~vl~~l~~~~~~~~F~~V~ 55 (204)
T PTZ00062 15 SNTGKLVLYVKSSKEP--E--YEQLMDVCNALVEDFPSLEFYVVN 55 (204)
T ss_pred cCCCcEEEEEeCCCCc--c--hHHHHHHHHHHHHHCCCcEEEEEc
Confidence 3446689999999964 2 335555666665557888888764
No 44
>PRK12569 hypothetical protein; Provisional
Probab=33.14 E-value=3.2e+02 Score=26.55 Aligned_cols=83 Identities=13% Similarity=0.109 Sum_probs=54.1
Q ss_pred HHHHHHHHhccCCCCCCCCceEEEEecccCCCh----HHHHHHHHcCCeeeeCCCCCCCCcccc---CCCCHHHHHHHHH
Q psy16052 194 IALYKEMFNGKRKNPNGCDIKATFFVSHKYTNY----SAVQELHRKGHEVAVHSITHNDDENFW---SNATVDDWAKEMA 266 (476)
Q Consensus 194 ~~~~l~ll~~~~~np~~c~i~aTFFv~~~~~~~----~~v~~l~~~GHEIg~Hs~~H~~~~~~~---~~~s~e~~~~ei~ 266 (476)
.+.++.++... | --|+.+| .++ ..|+...+.|-.||.|- +-.+...|- -.++.++.++.+.
T Consensus 25 D~~lmp~Itsa--N-IACG~HA--------GDp~~M~~tv~lA~~~~V~IGAHP-syPD~~gFGRr~m~~s~~el~~~v~ 92 (245)
T PRK12569 25 DEALMPLISSA--N-IATGFHA--------GDPNIMRRTVELAKAHGVGIGAHP-GFRDLVGFGRRHINASPQELVNDVL 92 (245)
T ss_pred HHHHHHHhhhH--H-HhccccC--------CCHHHHHHHHHHHHHcCCEeccCC-CCCcCCCCCCCCCCCCHHHHHHHHH
Confidence 35677777660 0 0144444 444 56677788999999986 223322221 1589999999999
Q ss_pred HHHHHHHHhhcccCCcccEEec
Q psy16052 267 GMRIIAEKYANLSDNSVVGVRA 288 (476)
Q Consensus 267 ~~~~~l~~~~~i~g~~~~gfR~ 288 (476)
.+...|+.++...|.++....+
T Consensus 93 yQigaL~~~~~~~g~~l~hVKP 114 (245)
T PRK12569 93 YQLGALREFARAHGVRLQHVKP 114 (245)
T ss_pred HHHHHHHHHHHHcCCeeEEecC
Confidence 9999999887777766555543
No 45
>cd02992 PDI_a_QSOX PDIa family, Quiescin-sulfhydryl oxidase (QSOX) subfamily; QSOX is a eukaryotic protein containing an N-terminal redox active TRX domain, similar to that of PDI, and a small C-terminal flavin adenine dinucleotide (FAD)-binding domain homologous to the yeast ERV1p protein. QSOX oxidizes thiol groups to disulfides like PDI, however, unlike PDI, this oxidation is accompanied by the reduction of oxygen to hydrogen peroxide. QSOX is localized in high concentrations in cells with heavy secretory load and prefers peptides and proteins as substrates, not monothiols like glutathione. Inside the cell, QSOX is found in the endoplasmic reticulum and Golgi. The flow of reducing equivalents in a QSOX-catalyzed reaction goes from the dithiol substrate - dithiol of the QSOX TRX domain - dithiols of the QSOX ERV1p domain - FAD - oxygen.
Probab=31.94 E-value=1e+02 Score=25.68 Aligned_cols=50 Identities=12% Similarity=0.213 Sum_probs=30.6
Q ss_pred HHHHHhccCCcceEEeecchhhcCChHHHHHHHHHHHHHHhhCCC-EEEEeH
Q psy16052 394 NFDRHYDQNRAPLGLYFHAAWLKNNPEFLDAFLYWVDEIIERHKD-VYFVTM 444 (476)
Q Consensus 394 ~f~~~y~~~r~p~~i~~H~~w~~~~p~~~~al~~fl~~i~~~~~d-V~fvT~ 444 (476)
+|......++.|+++.||++|-..-......|+++...+.. .++ |.|+..
T Consensus 10 ~f~~~i~~~~~~vvV~f~a~wC~~C~~~~~~~~~la~~~~~-~~~~v~~~~v 60 (114)
T cd02992 10 SFNSALLGSPSAWLVEFYASWCGHCRAFAPTWKKLARDLRK-WRPVVRVAAV 60 (114)
T ss_pred hHHHHHhcCCCeEEEEEECCCCHHHHHHhHHHHHHHHHHHh-cCCceEEEEE
Confidence 34444455567899999999854223355667776666544 333 666543
No 46
>COG1540 Uncharacterized proteins, homologs of lactam utilization protein B [General function prediction only]
Probab=31.59 E-value=4.9e+02 Score=25.19 Aligned_cols=83 Identities=13% Similarity=0.117 Sum_probs=55.6
Q ss_pred HHHHHHHHhccCCCCCCCCceEEEEecccCCCh----HHHHHHHHcCCeeeeCCCCCCCCcccc---CCCCHHHHHHHHH
Q psy16052 194 IALYKEMFNGKRKNPNGCDIKATFFVSHKYTNY----SAVQELHRKGHEVAVHSITHNDDENFW---SNATVDDWAKEMA 266 (476)
Q Consensus 194 ~~~~l~ll~~~~~np~~c~i~aTFFv~~~~~~~----~~v~~l~~~GHEIg~Hs~~H~~~~~~~---~~~s~e~~~~ei~ 266 (476)
.+.++.++... | --|+.+| .++ ..|+...+.|--||.|- .-.+...|- -.++.++....+.
T Consensus 22 De~~l~lvsSA--N-IACGfHA--------GDp~~M~rtV~lA~e~gV~IGAHP-gyPDl~gFGRr~m~~~~~e~~a~~l 89 (252)
T COG1540 22 DEALLPLVSSA--N-IACGFHA--------GDPLTMRRTVRLAKENGVAIGAHP-GYPDLVGFGRREMALSPEELYAQVL 89 (252)
T ss_pred cHHHHHHHhhh--h-HhhcccC--------CCHHHHHHHHHHHHHcCCeeccCC-CCccccccCccccCCCHHHHHHHHH
Confidence 35677777651 1 1255555 333 56777888999999885 222222221 1578999999999
Q ss_pred HHHHHHHHhhcccCCcccEEec
Q psy16052 267 GMRIIAEKYANLSDNSVVGVRA 288 (476)
Q Consensus 267 ~~~~~l~~~~~i~g~~~~gfR~ 288 (476)
-+.-.|..|+...|..+..+.+
T Consensus 90 YQiGAL~a~~~a~G~~~~hVKp 111 (252)
T COG1540 90 YQIGALQAFARAQGGVVQHVKP 111 (252)
T ss_pred HHHHHHHHHHHhcCCeEEEecc
Confidence 9999999888888877766654
No 47
>KOG1958|consensus
Probab=31.01 E-value=9.1e+02 Score=28.19 Aligned_cols=44 Identities=16% Similarity=0.204 Sum_probs=35.0
Q ss_pred HHHHHHHHcCC-eeeeCCCCCCCCccccCCCCHHHHHHHHHHHHHHHHH
Q psy16052 227 SAVQELHRKGH-EVAVHSITHNDDENFWSNATVDDWAKEMAGMRIIAEK 274 (476)
Q Consensus 227 ~~v~~l~~~GH-EIg~Hs~~H~~~~~~~~~~s~e~~~~ei~~~~~~l~~ 274 (476)
..++.+..+|- ||..-||.-.+. .+..+-...+++.....+|+.
T Consensus 229 ~a~k~lv~~GqlEIvtGGWVMpDE----An~Hy~~~i~qliEGh~Wl~~ 273 (1129)
T KOG1958|consen 229 NAVKRLVKNGQLEIVTGGWVMPDE----ANSHYFAMIDQLIEGHQWLKN 273 (1129)
T ss_pred HHHHHHHhcCcEEEEeCccccCcc----cchhHHHHHHHHhhhhhHHhc
Confidence 78999999995 999999986653 345566677888888899985
No 48
>cd07286 PX_SNX18 The phosphoinositide binding Phox Homology domain of Sorting Nexin 18. The PX domain is a phosphoinositide (PI) binding module present in many proteins with diverse functions. Sorting nexins (SNXs) make up the largest group among PX domain containing proteins. They are involved in regulating membrane traffic and protein sorting in the endosomal system. The PX domain of SNXs binds PIs and targets the protein to PI-enriched membranes. SNXs differ from each other in PI-binding specificity and affinity, and the presence of other protein-protein interaction domains, which help determine subcellular localization and specific function in the endocytic pathway. SNX18, like SNX9, contains an N-terminal Src Homology 3 (SH3) domain, a PX domain, and a C-terminal Bin/Amphiphysin/Rvs (BAR) domain, which detects membrane curvature. The PX-BAR structural unit helps determine specific membrane localization. SNX18 is localized to peripheral endosomal structures, and acts in a trafficki
Probab=28.31 E-value=1.1e+02 Score=26.58 Aligned_cols=74 Identities=12% Similarity=0.255 Sum_probs=42.0
Q ss_pred HHHHHHHHHHhccCCcc------eEEeecchhhcCChHHHHHHHHHHHHHHhhCCCEEEEeHHHHHHHhhCC-cchhccc
Q psy16052 389 NFLNHNFDRHYDQNRAP------LGLYFHAAWLKNNPEFLDAFLYWVDEIIERHKDVYFVTMTQVIQWIQNP-RTITEVK 461 (476)
Q Consensus 389 ~~l~~~f~~~y~~~r~p------~~i~~H~~w~~~~p~~~~al~~fl~~i~~~~~dV~fvT~~qi~~w~~~p-~~~~~~~ 461 (476)
.+|...+.+.|-+--.| .+-.+.+.|+. .|..+|++||..|+. ++-.+ ...++-.++..+ +...+.+
T Consensus 42 ~~L~~~L~~~~p~~~IPpLP~K~~~g~f~~~FIe---~Rr~~Lq~FL~ria~-hp~L~--~s~~~~~FL~~~~~~~~~~~ 115 (127)
T cd07286 42 DWLYARLAEKFPVISVPHIPEKQATGRFEEDFIS---KRRKGLIWWMDHMCS-HPVLA--RCDAFQHFLTCPSTDEKAWK 115 (127)
T ss_pred HHHHHHHHHHCCCcEeCCCcCCCcCCCCCHHHHH---HHHHHHHHHHHHHHc-Ccccc--cChHHHHHhcCCcchhHHHh
Confidence 45555565555432111 12235555543 589999999999997 77666 335555666655 3444444
Q ss_pred cCCCccc
Q psy16052 462 NFDPWRE 468 (476)
Q Consensus 462 ~~~~~~~ 468 (476)
.+..|.+
T Consensus 116 ~~~~~~e 122 (127)
T cd07286 116 QGKRKAE 122 (127)
T ss_pred hcchHhh
Confidence 4444433
No 49
>PF12340 DUF3638: Protein of unknown function (DUF3638); InterPro: IPR022099 This domain family is found in eukaryotes, and is approximately 230 amino acids in length. There are two conserved sequence motifs: LLE and NMG.
Probab=28.09 E-value=3.1e+02 Score=26.36 Aligned_cols=61 Identities=18% Similarity=0.260 Sum_probs=44.7
Q ss_pred HHHHHHHHHHHHHHhccCCcceEEeecchhhcCChHHHHHHHHHHHHHHhhCCCEEEEeHHHHHH
Q psy16052 385 DQFYNFLNHNFDRHYDQNRAPLGLYFHAAWLKNNPEFLDAFLYWVDEIIERHKDVYFVTMTQVIQ 449 (476)
Q Consensus 385 ~~~~~~l~~~f~~~y~~~r~p~~i~~H~~w~~~~p~~~~al~~fl~~i~~~~~dV~fvT~~qi~~ 449 (476)
++.++.|...|-..+ +|.-+.+.|+ --..-.+..++.+++.++.++. ..+|.++|.+.+..
T Consensus 83 ~q~~~~L~~~lg~l~--~r~i~~lpFs-R~~~~~~~~~~~~~~l~~~~~~-~~gill~~PEhilS 143 (229)
T PF12340_consen 83 EQMRQMLRSRLGGLL--NRRIYHLPFS-RSTPLTPETLEKIRQLLEECMR-SGGILLATPEHILS 143 (229)
T ss_pred HHHHHHHHHHHHHHh--CCeeEEeccc-CCCCCCHHHHHHHHHHHHHHHH-cCCEEEeChHHHHH
Confidence 566777777776544 4555666666 2233356778899999999987 89999999999964
No 50
>KOG1225|consensus
Probab=26.96 E-value=1.6e+02 Score=31.93 Aligned_cols=15 Identities=33% Similarity=0.906 Sum_probs=9.2
Q ss_pred CCCC-cccCCccccCCC
Q psy16052 149 CDPA-VCVLPDCFCSED 164 (476)
Q Consensus 149 C~~~-~C~~p~C~C~~~ 164 (476)
|..+ +|+.. |.|.+.
T Consensus 345 C~~~g~cv~g-C~C~~G 360 (525)
T KOG1225|consen 345 CSGGGQCVNG-CKCKKG 360 (525)
T ss_pred cCCCceeccC-ceeccC
Confidence 5544 56666 777763
No 51
>TIGR01068 thioredoxin thioredoxin. Several proteins, such as protein disulfide isomerase, have two or more copies of a domain closely related to thioredoxin. This model is designed to recognize authentic thioredoxin, a small protein that should be hit exactly once by this model.
Probab=26.05 E-value=1.4e+02 Score=23.33 Aligned_cols=48 Identities=23% Similarity=0.301 Sum_probs=26.5
Q ss_pred HHHHHhccCCcceEEeecchhhcCChHHHHHHHHHHHHHHhhCCCEEEEeH
Q psy16052 394 NFDRHYDQNRAPLGLYFHAAWLKNNPEFLDAFLYWVDEIIERHKDVYFVTM 444 (476)
Q Consensus 394 ~f~~~y~~~r~p~~i~~H~~w~~~~p~~~~al~~fl~~i~~~~~dV~fvT~ 444 (476)
+|......++.+++|+||++|-..-......|+++.+.+. +++.|+..
T Consensus 5 ~~~~~~~~~~~~vvi~f~~~~C~~C~~~~~~l~~~~~~~~---~~~~~~~v 52 (101)
T TIGR01068 5 NFDETIASSDKPVLVDFWAPWCGPCKMIAPILEELAKEYE---GKVKFVKL 52 (101)
T ss_pred HHHHHHhhcCCcEEEEEECCCCHHHHHhCHHHHHHHHHhc---CCeEEEEE
Confidence 3444444456789999999985422233444555543332 24666554
No 52
>KOG3509|consensus
Probab=25.61 E-value=78 Score=36.72 Aligned_cols=59 Identities=12% Similarity=0.451 Sum_probs=46.6
Q ss_pred cccccccccccccccCCCCCCCCCCCCCCceecCCCCccCCCcccCCCCCCCCCCCCCCCCCC
Q psy16052 78 GAVKNCKLKNKERKVKPLLYTDEPLCQDGFLACGDGACIERGLFCNGEKDCNDGSDENTCDSE 140 (476)
Q Consensus 78 ~~~~~C~~~~de~~c~~~~~~~~~~C~~~~~~C~~g~Ci~~~~~C~g~~dC~d~sDe~~c~~~ 140 (476)
+...||-+.+|..++... ...|++.+++|.+++|.-..+.||.+.+|..+|+..+|...
T Consensus 11 ~~~~~~~~~~~~~~~~~~----~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 69 (964)
T KOG3509|consen 11 DRQPDCRDRSDVANDPAI----GSACSPNEFKCNNPRCVQPEALLDADSTCGPNSTPSGCNAK 69 (964)
T ss_pred ccchhhHhhcccCCCccc----cccCCcchhccCCccccCchhhhccccccCCCCCcCCcccc
Confidence 445566777777665332 24578899999999999999999999999999988888653
No 53
>PTZ00413 lipoate synthase; Provisional
Probab=24.97 E-value=3.1e+02 Score=28.54 Aligned_cols=70 Identities=13% Similarity=0.013 Sum_probs=40.6
Q ss_pred ChHHHHHHHHHHHHHHhccCCcceEEeecchhhcCChHHHHHHHHHHHHHHhhCCCEEEEeHHHHHHHhhCCcchhcc
Q psy16052 383 TGDQFYNFLNHNFDRHYDQNRAPLGLYFHAAWLKNNPEFLDAFLYWVDEIIERHKDVYFVTMTQVIQWIQNPRTITEV 460 (476)
Q Consensus 383 ~~~~~~~~l~~~f~~~y~~~r~p~~i~~H~~w~~~~p~~~~al~~fl~~i~~~~~dV~fvT~~qi~~w~~~p~~~~~~ 460 (476)
+.++.++.|+.. ...|.. + +.+--| .+.|-++..+.+.+.|+.+.+ -+|-++|+.|.++=-+.-.|+.++
T Consensus 276 tYe~sLe~Lr~A-Ke~f~~-g--i~tcSG--iIVGLGET~eEvie~m~dLre--lGVDivtIGQYL~Ps~~h~~V~~y 345 (398)
T PTZ00413 276 SYRQSLKVLEHV-KEFTNG-A--MLTKSS--IMLGLGETEEEVRQTLRDLRT--AGVSAVTLGQYLQPTKTRLKVSRY 345 (398)
T ss_pred CHHHHHHHHHHH-HHHhcC-C--ceEeee--eEecCCCCHHHHHHHHHHHHH--cCCcEEeeccccCCCcccCCceec
Confidence 556667777543 233321 1 111112 133445667778888888875 699999999987544444455443
No 54
>TIGR01126 pdi_dom protein disulfide-isomerase domain. This model describes a domain of eukaryotic protein disulfide isomerases, generally found in two copies. The high cutoff for total score reflects the expectation of finding both copies. The domain is similar to thioredoxin but the redox-active disulfide region motif is APWCGHCK.
Probab=24.84 E-value=1.1e+02 Score=24.12 Aligned_cols=49 Identities=16% Similarity=0.255 Sum_probs=28.2
Q ss_pred HHHHHhccCCcceEEeecchhhcCChHHHHHHHHHHHHHHhhCCCEEEEeH
Q psy16052 394 NFDRHYDQNRAPLGLYFHAAWLKNNPEFLDAFLYWVDEIIERHKDVYFVTM 444 (476)
Q Consensus 394 ~f~~~y~~~r~p~~i~~H~~w~~~~p~~~~al~~fl~~i~~~~~dV~fvT~ 444 (476)
+|+.... .+.+++|.|+++|-..-......|+++.+.+.. .+.|-|+..
T Consensus 5 ~~~~~~~-~~~~~~i~f~~~~C~~c~~~~~~~~~~~~~~~~-~~~~~~~~~ 53 (102)
T TIGR01126 5 NFDDIVL-SNKDVLVEFYAPWCGHCKNLAPEYEKLAKELKG-DPDIVLAKV 53 (102)
T ss_pred hHHHHhc-cCCcEEEEEECCCCHHHHhhChHHHHHHHHhcc-CCceEEEEE
Confidence 4444444 456789999999864322234556666555433 225777643
No 55
>cd02956 ybbN ybbN protein family; ybbN is a hypothetical protein containing a redox-inactive TRX-like domain. Its gene has been sequenced from several gammaproteobacteria and actinobacteria.
Probab=24.49 E-value=1.2e+02 Score=23.92 Aligned_cols=47 Identities=15% Similarity=0.218 Sum_probs=27.6
Q ss_pred HHHHhc-cCCcceEEeecchhhcCChHHHHHHHHHHHHHHhhCCCEEEEeH
Q psy16052 395 FDRHYD-QNRAPLGLYFHAAWLKNNPEFLDAFLYWVDEIIERHKDVYFVTM 444 (476)
Q Consensus 395 f~~~y~-~~r~p~~i~~H~~w~~~~p~~~~al~~fl~~i~~~~~dV~fvT~ 444 (476)
|++... ..+.|++|.|+++|-..-......|+++.+... +.+.|+..
T Consensus 3 f~~~i~~~~~~~vlv~f~a~wC~~C~~~~~~~~~~~~~~~---~~~~~~~v 50 (96)
T cd02956 3 FQQVLQESTQVPVVVDFWAPRSPPSKELLPLLERLAEEYQ---GQFVLAKV 50 (96)
T ss_pred hHHHHHhcCCCeEEEEEECCCChHHHHHHHHHHHHHHHhC---CcEEEEEE
Confidence 455553 336789999999996533334555666655542 23544443
No 56
>PF07579 DUF1548: Domain of Unknown Function (DUF1548); InterPro: IPR013044 This domain is found in a small number of Chlamydia proteins of unknown function. It occurs together with IPR011436 from INTERPRO.
Probab=24.11 E-value=33 Score=29.85 Aligned_cols=48 Identities=21% Similarity=0.231 Sum_probs=38.8
Q ss_pred HHHHHHHHHHHHHHhccCCcceEEeecchhhcCChHHHHHHHHHHHHH
Q psy16052 385 DQFYNFLNHNFDRHYDQNRAPLGLYFHAAWLKNNPEFLDAFLYWVDEI 432 (476)
Q Consensus 385 ~~~~~~l~~~f~~~y~~~r~p~~i~~H~~w~~~~p~~~~al~~fl~~i 432 (476)
..-|+.+...|...|...+..++-++|-..+.+.|+...++..||=..
T Consensus 47 ~~~Y~~l~~~F~~~Y~~S~~~LI~~i~~q~~~sspe~~~~i~~yll~~ 94 (135)
T PF07579_consen 47 HEKYDRLFNQFLNAYRNSGSNLINYILNQILTSSPEQKAAIRNYLLDD 94 (135)
T ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHHH
Confidence 445777888999999887888999999888888899888888876443
No 57
>COG4637 Predicted ATPase [General function prediction only]
Probab=23.50 E-value=1.9e+02 Score=29.30 Aligned_cols=61 Identities=18% Similarity=0.177 Sum_probs=43.3
Q ss_pred HHHHHHHHHHHHHHhccCCcceEEeecchhhcCChHHHHHHHHHHHHHHhhCCCEEEEeHHHHH
Q psy16052 385 DQFYNFLNHNFDRHYDQNRAPLGLYFHAAWLKNNPEFLDAFLYWVDEIIERHKDVYFVTMTQVI 448 (476)
Q Consensus 385 ~~~~~~l~~~f~~~y~~~r~p~~i~~H~~w~~~~p~~~~al~~fl~~i~~~~~dV~fvT~~qi~ 448 (476)
+..+++|.. ...+-.-+.|-.|.+..|-.+=.|..+.+|.+++.-... +.-||++|++.=+
T Consensus 273 DGTlRfl~l--~t~Llsp~~p~ll~ldEPE~sLHP~lL~~La~~~~sAak-~sQv~VsTHS~rL 333 (373)
T COG4637 273 DGTLRFLAL--ATLLLSPRPPPLLLLDEPETSLHPDLLPALAELMRSAAK-RSQVIVSTHSPRL 333 (373)
T ss_pred ccHHHHHHH--HHHHcCCCCCceeEecCcccccCHhHHHHHHHHHHHhhc-cceEEEEeCCHHH
Confidence 456677743 233444566666666656455568899999999999876 8899999998643
No 58
>cd02957 Phd_like Phosducin (Phd)-like family; composed of Phd and Phd-like proteins (PhLP), characterized as cytosolic regulators of G protein functions. Phd and PhLPs specifically bind G protein betagamma (Gbg)-subunits with high affinity, resulting in the solubilization of Gbg from the plasma membrane and impeding G protein-mediated signal transduction by inhibiting the formation of a functional G protein trimer (G protein alphabetagamma). Phd also inhibits the GTPase activity of G protein alpha. Phd can be phosphorylated by protein kinase A and G protein-coupled receptor kinase 2, leading to its inactivation. Phd was originally isolated from the retina, where it is highly expressed and has been implicated to play an important role in light adaptation. It is also found in the pineal gland, liver, spleen, striated muscle and the brain. The C-terminal domain of Phd adopts a thioredoxin fold, but it does not contain a CXXC motif. Phd interacts with G protein beta mostly through the N-te
Probab=23.43 E-value=2.1e+02 Score=23.50 Aligned_cols=37 Identities=11% Similarity=0.119 Sum_probs=22.0
Q ss_pred CcceEEeecchhhcCChHHHHHHHHHHHHHHhhCCCEEEEe
Q psy16052 403 RAPLGLYFHAAWLKNNPEFLDAFLYWVDEIIERHKDVYFVT 443 (476)
Q Consensus 403 r~p~~i~~H~~w~~~~p~~~~al~~fl~~i~~~~~dV~fvT 443 (476)
+.|++++|+++|-. + -.++...|+.+...++++.|+.
T Consensus 24 ~~~vvv~F~a~~c~--~--C~~l~~~l~~la~~~~~v~f~~ 60 (113)
T cd02957 24 GTRVVVHFYEPGFP--R--CKILDSHLEELAAKYPETKFVK 60 (113)
T ss_pred CCEEEEEEeCCCCC--c--HHHHHHHHHHHHHHCCCcEEEE
Confidence 36889999999853 3 2234444444443366775554
No 59
>cd02996 PDI_a_ERp44 PDIa family, endoplasmic reticulum protein 44 (ERp44) subfamily; ERp44 is an ER-resident protein, induced during stress, involved in thiol-mediated ER retention. It contains an N-terminal TRX domain, similar to that of PDIa, with a CXFS motif followed by two redox inactive TRX-like domains, homologous to the b and b' domains of PDI. The CXFS motif in the N-terminal domain allows ERp44 to form stable reversible mixed disulfides with its substrates. Through this activity, ERp44 mediates the ER localization of Ero1alpha, a protein that oxidizes protein disulfide isomerases into their active form. ERp44 also prevents the secretion of unassembled cargo protein with unpaired cysteines. It also modulates the activity of inositol 1,4,5-triphosphate type I receptor (IP3R1), an intracellular channel protein that mediates calcium release from the ER to the cytosol.
Probab=22.05 E-value=1.8e+02 Score=23.54 Aligned_cols=40 Identities=20% Similarity=0.201 Sum_probs=24.0
Q ss_pred HHHHHhccCCcceEEeecchhhcCChHHHHHHHHHHHHHHh
Q psy16052 394 NFDRHYDQNRAPLGLYFHAAWLKNNPEFLDAFLYWVDEIIE 434 (476)
Q Consensus 394 ~f~~~y~~~r~p~~i~~H~~w~~~~p~~~~al~~fl~~i~~ 434 (476)
+|+.+.+. ..+.+|.|+++|-..-......++++.+.+.+
T Consensus 10 ~f~~~i~~-~~~vlv~F~a~wC~~C~~~~p~~~~~a~~~~~ 49 (108)
T cd02996 10 NIDDILQS-AELVLVNFYADWCRFSQMLHPIFEEAAAKIKE 49 (108)
T ss_pred hHHHHHhc-CCEEEEEEECCCCHHHHhhHHHHHHHHHHHhh
Confidence 44444433 34688999999864323345666666666543
No 60
>cd02954 DIM1 Dim1 family; Dim1 is also referred to as U5 small nuclear ribonucleoprotein particle (snRNP)-specific 15kD protein. It is a component of U5 snRNP, which pre-assembles with U4/U6 snRNPs to form a [U4/U6:U5] tri-snRNP complex required for pre-mRNA splicing. Dim1 interacts with multiple splicing-associated proteins, suggesting that it functions at multiple control points in the splicing of pre-mRNA as part of a large spliceosomal complex involving many protein-protein interactions. U5 snRNP contains seven core proteins (common to all snRNPs) and nine U5-specific proteins, one of which is Dim1. Dim1 adopts a thioredoxin fold but does not contain the redox active CXXC motif. It is essential for G2/M phase transition, as a consequence to its role in pre-mRNA splicing.
Probab=22.03 E-value=2.8e+02 Score=23.52 Aligned_cols=46 Identities=11% Similarity=0.051 Sum_probs=27.3
Q ss_pred HHHHHHhcc-CCcceEEeecchhhcCChHHHHHHHHHHHHHHhhCCC-EEEE
Q psy16052 393 HNFDRHYDQ-NRAPLGLYFHAAWLKNNPEFLDAFLYWVDEIIERHKD-VYFV 442 (476)
Q Consensus 393 ~~f~~~y~~-~r~p~~i~~H~~w~~~~p~~~~al~~fl~~i~~~~~d-V~fv 442 (476)
++|+..... +..|+++.|+++|- .| -.+|..+|+.+....++ |.|+
T Consensus 3 ~~~~~~i~~~~~~~vVV~F~A~WC--gp--Ck~m~P~le~la~~~~~~v~f~ 50 (114)
T cd02954 3 WAVDQAILSEEEKVVVIRFGRDWD--PV--CMQMDEVLAKIAEDVSNFAVIY 50 (114)
T ss_pred HHHHHHHhccCCCEEEEEEECCCC--hh--HHHHHHHHHHHHHHccCceEEE
Confidence 344444432 46689999999995 34 44555566665543454 4444
No 61
>PLN02701 alpha-mannosidase
Probab=21.58 E-value=2.7e+02 Score=33.14 Aligned_cols=75 Identities=12% Similarity=0.141 Sum_probs=54.8
Q ss_pred HHHHHHHHcCC-eeeeCCCCCCCCccccCCCCHHHHHHHHHHHHHHHHHhhcccCCcccEEec-CCcccCchHHHHHHHH
Q psy16052 227 SAVQELHRKGH-EVAVHSITHNDDENFWSNATVDDWAKEMAGMRIIAEKYANLSDNSVVGVRA-PYLRVGGNNQFTMMEE 304 (476)
Q Consensus 227 ~~v~~l~~~GH-EIg~Hs~~H~~~~~~~~~~s~e~~~~ei~~~~~~l~~~~~i~g~~~~gfR~-P~l~~~~~~~~~~l~~ 304 (476)
+.|++++++|. ||..=+|.-.+. ....++.+.++|......|.. .+.....+||.. ||+... . +-.+|+.
T Consensus 106 ~~vk~LV~~GrLE~vgGgwvm~DE----a~~~~esiI~Ql~~G~~~l~~--~fG~~P~~~W~iDpFGhs~-~-~P~Ll~~ 177 (1050)
T PLN02701 106 EAFTKLVKNGQLEIVGGGWVMNDE----ANSHYFAIIEQITEGNMWLND--TIGVAPKNSWAIDPFGYSS-T-MAYLLRR 177 (1050)
T ss_pred HHHHHHHHcCCEEEECCceecccc----cccCHHHHHHHHHhhhHHHHh--hcCCCCCcCccCCCCCCCH-H-HHHHHHh
Confidence 78999999995 788888875553 346789999999988888864 222223456665 888743 3 7788999
Q ss_pred cCCeE
Q psy16052 305 QAFLY 309 (476)
Q Consensus 305 ~G~~Y 309 (476)
.||.+
T Consensus 178 ~G~~~ 182 (1050)
T PLN02701 178 MGFEN 182 (1050)
T ss_pred CCCce
Confidence 99854
No 62
>cd02987 Phd_like_Phd Phosducin (Phd)-like family, Phd subfamily; Phd is a cytosolic regulator of G protein functions. It specifically binds G protein betagamma (Gbg)-subunits with high affinity, resulting in the solubilization of Gbg from the plasma membrane. This impedes the formation of a functional G protein trimer (G protein alphabetagamma), thereby inhibiting G protein-mediated signal transduction. Phd also inhibits the GTPase activity of G protein alpha. Phd can be phosphorylated by protein kinase A and G protein-coupled receptor kinase 2, leading to its inactivation. Phd was originally isolated from the retina, where it is highly expressed and has been implicated to play an important role in light adaptation. It is also found in the pineal gland, liver, spleen, striated muscle and the brain. The C-terminal domain of Phd adopts a thioredoxin fold, but it does not contain a CXXC motif. Phd interacts with G protein beta mostly through the N-terminal helical domain.
Probab=20.56 E-value=2.3e+02 Score=25.77 Aligned_cols=37 Identities=8% Similarity=-0.052 Sum_probs=27.5
Q ss_pred cceEEeecchhhcCChHHHHHHHHHHHHHHhhCCCEEEEeH
Q psy16052 404 APLGLYFHAAWLKNNPEFLDAFLYWVDEIIERHKDVYFVTM 444 (476)
Q Consensus 404 ~p~~i~~H~~w~~~~p~~~~al~~fl~~i~~~~~dV~fvT~ 444 (476)
.+++++|+++|.. | -.+|...|+.|+..+++|-|+..
T Consensus 84 ~~VVV~Fya~wc~--~--Ck~m~~~l~~LA~~~~~vkF~kV 120 (175)
T cd02987 84 TTVVVHIYEPGIP--G--CAALNSSLLCLAAEYPAVKFCKI 120 (175)
T ss_pred cEEEEEEECCCCc--h--HHHHHHHHHHHHHHCCCeEEEEE
Confidence 4789999999863 3 55677777777665888888765
No 63
>TIGR01358 DAHP_synth_II 3-deoxy-7-phosphoheptulonate synthase, class II. Homologs scoring between trusted and noise cutoff include proteins involved in antibiotic biosynthesis; one example is active as this enzyme, while another acts on an amino analog.
Probab=20.25 E-value=6e+02 Score=26.88 Aligned_cols=77 Identities=14% Similarity=0.175 Sum_probs=52.6
Q ss_pred HHHHHHHhccCCCCCCCCceEEEEec-cc--C--CChHHHHHHHHcCCeeee-----CCCCCCCCccccCCCCHHHHHHH
Q psy16052 195 ALYKEMFNGKRKNPNGCDIKATFFVS-HK--Y--TNYSAVQELHRKGHEVAV-----HSITHNDDENFWSNATVDDWAKE 264 (476)
Q Consensus 195 ~~~l~ll~~~~~np~~c~i~aTFFv~-~~--~--~~~~~v~~l~~~GHEIg~-----Hs~~H~~~~~~~~~~s~e~~~~e 264 (476)
+.++++++. .||..-+=+.|+-+= |. . .-+.+|++..+.||.|.- ||-+......+ .+-..+.+.+|
T Consensus 293 ~~l~~L~~~--LnP~~epGRlTLI~RmGa~kV~~~LP~li~aV~~~G~~VvW~cDPMHGNT~~t~~G~-KTR~f~~Il~E 369 (443)
T TIGR01358 293 DELLRLIER--LNPENEPGRLTLISRMGADKIADKLPPLLRAVKAAGRRVVWVCDPMHGNTEEAASGY-KTRRFDDIRSE 369 (443)
T ss_pred HHHHHHHHH--hCCCCCCceEEEEeccCchHHHHhHHHHHHHHHHcCCceEEeecCCCCCceeCCCCc-cCCcHHHHHHH
Confidence 456666653 567655556665552 21 1 123899999999999975 88777654433 45678899999
Q ss_pred HHHHHHHHHH
Q psy16052 265 MAGMRIIAEK 274 (476)
Q Consensus 265 i~~~~~~l~~ 274 (476)
+.+..++.++
T Consensus 370 v~~ff~vh~~ 379 (443)
T TIGR01358 370 VKGFFEVHRA 379 (443)
T ss_pred HHHHHHHHHH
Confidence 9988888774
Done!