Query psy2133
Match_columns 207
No_of_seqs 224 out of 1444
Neff 6.0
Searched_HMMs 46136
Date Fri Aug 16 16:53:11 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy2133.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/2133hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG1881|consensus 100.0 4.3E-52 9.2E-57 394.8 12.0 200 1-202 161-369 (793)
2 PF00498 FHA: FHA domain; Int 99.9 4.3E-21 9.3E-26 133.9 9.8 67 19-93 1-68 (68)
3 KOG1882|consensus 99.7 1.2E-18 2.6E-23 149.6 4.2 101 2-102 179-285 (293)
4 cd00060 FHA Forkhead associate 99.7 8.2E-16 1.8E-20 113.0 11.8 88 7-101 12-100 (102)
5 TIGR03354 VI_FHA type VI secre 99.6 1E-15 2.2E-20 141.0 11.8 86 9-104 17-106 (396)
6 PLN02927 antheraxanthin epoxid 99.6 1.4E-15 3.1E-20 147.8 11.4 80 8-95 546-637 (668)
7 COG1716 FOG: FHA domain [Signa 99.6 1.8E-14 3.9E-19 118.1 10.6 78 17-104 89-167 (191)
8 smart00240 FHA Forkhead associ 99.5 3.9E-14 8.5E-19 93.4 5.8 51 19-76 1-52 (52)
9 KOG1880|consensus 99.4 5.5E-14 1.2E-18 124.0 2.1 97 2-104 23-120 (337)
10 COG3456 Predicted component of 99.3 4E-12 8.6E-17 116.8 8.3 77 18-103 27-106 (430)
11 KOG0615|consensus 98.3 1.4E-06 3E-11 80.8 6.7 94 9-104 57-158 (475)
12 TIGR02500 type_III_yscD type I 97.9 3.4E-05 7.3E-10 71.7 8.1 77 10-99 14-93 (410)
13 KOG2293|consensus 97.5 0.00046 1E-08 65.6 8.2 79 16-102 447-532 (547)
14 KOG0245|consensus 97.5 0.0017 3.6E-08 66.1 12.5 82 12-104 473-557 (1221)
15 KOG1892|consensus 96.1 0.01 2.2E-07 60.4 5.8 85 9-105 371-457 (1629)
16 TIGR01663 PNK-3'Pase polynucle 95.9 0.026 5.7E-07 54.4 7.3 81 9-99 25-105 (526)
17 KOG0241|consensus 94.3 0.15 3.2E-06 52.4 7.4 77 18-105 468-544 (1714)
18 KOG1881|consensus 90.8 0.077 1.7E-06 52.7 0.3 90 18-113 61-150 (793)
19 PF01479 S4: S4 domain; Inter 69.3 4.1 8.9E-05 25.8 2.1 23 66-90 25-48 (48)
20 COG2501 S4-like RNA binding pr 68.9 12 0.00025 27.1 4.5 39 56-99 30-68 (73)
21 TIGR02988 YaaA_near_RecF S4 do 67.8 5.8 0.00013 26.5 2.6 26 66-92 33-58 (59)
22 COG1188 Ribosome-associated he 65.0 10 0.00022 29.0 3.7 36 68-105 35-72 (100)
23 smart00363 S4 S4 RNA-binding d 61.0 14 0.00031 22.9 3.5 27 66-94 25-52 (60)
24 PRK11507 ribosome-associated p 59.3 18 0.00038 25.9 3.9 31 66-97 36-66 (70)
25 PF02626 AHS2: Allophanate hyd 59.0 13 0.00027 33.1 3.8 41 64-104 52-95 (271)
26 PF13275 S4_2: S4 domain; PDB: 58.5 9.4 0.0002 26.8 2.4 31 66-97 32-62 (65)
27 COG5025 Transcription factor o 58.5 5.8 0.00013 39.2 1.8 63 33-102 124-189 (610)
28 PRK15367 type III secretion sy 56.2 65 0.0014 30.3 8.1 81 10-109 18-98 (395)
29 cd00165 S4 S4/Hsp/ tRNA synthe 55.1 16 0.00034 23.5 3.0 28 65-94 24-52 (70)
30 PF14640 TMEM223: Transmembran 53.0 23 0.0005 29.4 4.2 45 20-74 109-159 (170)
31 PRK01777 hypothetical protein; 50.1 19 0.0004 27.0 2.9 30 62-94 46-75 (95)
32 smart00797 AHS2 Allophanate hy 49.0 67 0.0014 28.6 6.8 40 64-103 52-94 (280)
33 PF11012 DUF2850: Protein of u 45.8 51 0.0011 24.0 4.6 38 65-105 22-61 (79)
34 COG1984 DUR1 Allophanate hydro 44.5 70 0.0015 29.2 6.2 44 64-107 73-119 (314)
35 TIGR00724 urea_amlyse_rel biot 42.9 58 0.0013 29.6 5.5 41 63-103 73-117 (314)
36 KOG2963|consensus 41.3 40 0.00086 31.4 4.1 12 36-47 190-201 (405)
37 cd01764 Urm1 Urm1-like ubuitin 40.8 23 0.00049 26.3 2.2 24 69-92 62-87 (94)
38 PRK10348 ribosome-associated h 40.5 43 0.00094 26.7 3.8 37 67-105 34-72 (133)
39 cd00754 MoaD Ubiquitin domain 38.6 31 0.00068 23.7 2.5 26 65-92 48-73 (80)
40 TIGR01687 moaD_arch MoaD famil 38.1 35 0.00077 24.2 2.8 27 66-92 55-81 (88)
41 KOG3438|consensus 37.8 9.1 0.0002 29.4 -0.4 28 19-46 35-65 (105)
42 PF08804 gp32: gp32 DNA bindin 37.6 14 0.00031 27.9 0.6 16 55-70 43-58 (94)
43 cd04486 YhcR_OBF_like YhcR_OBF 36.2 88 0.0019 22.3 4.6 40 55-99 15-60 (78)
44 PF14451 Ub-Mut7C: Mut7-C ubiq 36.1 42 0.0009 24.4 2.9 25 68-94 51-75 (81)
45 COG5131 URM1 Ubiquitin-like pr 35.6 48 0.001 25.0 3.1 41 54-94 47-91 (96)
46 PF14478 DUF4430: Domain of un 33.9 35 0.00076 23.4 2.1 22 71-92 45-67 (68)
47 PF02824 TGS: TGS domain; Int 32.2 40 0.00087 22.7 2.1 24 67-92 35-58 (60)
48 PF00376 MerR: MerR family reg 31.1 26 0.00056 21.7 0.9 13 184-197 12-24 (38)
49 TIGR02712 urea_carbox urea car 30.8 1.8E+02 0.0039 31.2 7.6 41 64-104 523-567 (1201)
50 PF15149 CATSPERB: Cation chan 30.3 75 0.0016 30.9 4.3 18 55-72 129-146 (540)
51 PRK06437 hypothetical protein; 29.3 71 0.0015 22.0 3.1 25 66-92 36-60 (67)
52 cd01666 TGS_DRG_C TGS_DRG_C: 29.1 53 0.0012 23.4 2.4 20 71-92 54-73 (75)
53 cd04089 eRF3_II eRF3_II: domai 28.9 83 0.0018 22.1 3.4 26 67-96 13-38 (82)
54 COG1342 Predicted DNA-binding 28.7 96 0.0021 23.6 3.8 36 89-124 9-44 (99)
55 KOG4460|consensus 28.5 3.1E+02 0.0066 27.4 8.0 83 1-99 140-229 (741)
56 PF02597 ThiS: ThiS family; I 27.9 49 0.0011 22.5 2.0 28 65-92 42-70 (77)
57 COG3145 AlkB Alkylated DNA rep 27.7 1.3E+02 0.0027 25.6 4.8 41 58-99 134-176 (194)
58 cd00565 ThiS ThiaminS ubiquiti 27.1 68 0.0015 21.6 2.6 25 68-92 32-58 (65)
59 smart00524 DWB Domain B in dwa 26.7 2E+02 0.0043 23.9 5.7 21 55-76 70-90 (171)
60 PF09269 DUF1967: Domain of un 25.4 63 0.0014 22.5 2.2 16 84-99 54-69 (69)
61 PRK06488 sulfur carrier protei 25.4 55 0.0012 22.1 1.9 24 69-92 33-58 (65)
62 PHA02550 32 single-stranded DN 25.3 41 0.00089 30.5 1.5 18 54-71 70-87 (304)
63 cd00050 MH2 MH2 domain; C term 25.2 2.5E+02 0.0054 23.4 6.1 22 55-77 75-96 (176)
64 KOG0246|consensus 24.5 81 0.0017 31.4 3.4 36 33-68 415-452 (676)
65 TIGR01683 thiS thiamine biosyn 24.2 79 0.0017 21.2 2.5 24 69-92 32-57 (64)
66 PHA00728 hypothetical protein 23.9 90 0.002 24.9 3.0 26 163-188 38-63 (151)
67 TIGR03595 Obg_CgtA_exten Obg f 22.4 69 0.0015 22.4 1.9 15 84-98 54-68 (69)
68 KOG3159|consensus 22.1 1.3E+02 0.0029 27.5 4.1 46 16-71 48-97 (336)
69 COG0522 RpsD Ribosomal protein 22.1 1.1E+02 0.0023 26.3 3.3 27 69-96 121-147 (205)
70 TIGR01682 moaD molybdopterin c 21.5 1.1E+02 0.0023 21.4 2.8 24 67-92 50-73 (80)
71 PF03021 CM2: Influenza C viru 21.3 57 0.0012 25.1 1.4 19 155-173 115-133 (139)
72 PF12525 DUF3726: Protein of u 21.1 29 0.00063 25.3 -0.2 20 142-161 10-29 (80)
73 PRK08364 sulfur carrier protei 20.5 1.2E+02 0.0026 20.9 2.9 21 70-92 43-63 (70)
74 PF04710 Pellino: Pellino; In 20.5 1.2E+02 0.0026 28.7 3.6 31 18-48 97-144 (416)
75 cd00571 UreE UreE urease acces 20.2 2E+02 0.0043 22.6 4.4 40 64-106 38-77 (136)
76 PRK06944 sulfur carrier protei 20.2 1.1E+02 0.0023 20.4 2.5 24 69-92 33-58 (65)
No 1
>KOG1881|consensus
Probab=100.00 E-value=4.3e-52 Score=394.81 Aligned_cols=200 Identities=48% Similarity=0.802 Sum_probs=176.6
Q ss_pred CccCCeeeeEEEcCCCCeEEEccCCCCcEEeCCCCcCccceEEEEEeecCC---cCCCCeEEEEECCCccceeecCeEcC
Q psy2133 1 ELKSGQIVNTIDLSTRSFYCVGRERNTHLNLLHPTVSRYHAILQYKSTFDE---KDPARGFYVYDLGSTHGTFLNRCKIK 77 (207)
Q Consensus 1 ~lK~G~ii~~i~L~~~~~~~IGR~~~cDv~l~~psVSR~HAvI~~~~~~~~---~~~~~~~~l~Dl~StnGTfVNg~rI~ 77 (207)
+||.|.||++..|....+|+|||.+.||+.+.||+|||+||+|||.+.+.. +..+.+|||+|||||||||+|+.|++
T Consensus 161 vlKeg~iiet~~l~~~~~~~fgr~~~cD~~~eHpsISr~h~vlQy~~~~~~~p~~s~~~g~~i~dlgsThgt~~NK~rvp 240 (793)
T KOG1881|consen 161 VLKEGAIIETEDLKGAAACLFGRLGGCDVALEHPSISRFHAVLQYKASGPDDPCASNGEGWYIYDLGSTHGTFLNKDRVP 240 (793)
T ss_pred hhccCceeeeeecccceeEEecccCCCccccccCcccccceeeeccCCCCCccccCCCCceEEeeccccccceeccccCC
Confidence 589999999999999999999999999999999999999999999876543 34567899999999999999999999
Q ss_pred CCceEEecCCCEEEEcCeeeEEEEeCCCCCccccccCCHHHHHHHHHH--HHHHHHHHH--HHHHHHHhhcchhhhcCCc
Q psy2133 78 PKMYVRIHVGHMLSFGSSTRFFILQGPSEDEEEESELSVSELKEQRRQ--EKEKKEREA--LEKSLEQEAKTEEEDEGIS 153 (207)
Q Consensus 78 ~~~~~~L~~Gd~I~fG~s~~~fil~~p~~~~e~es~lsv~el~~~~~~--~~~~~~~~~--~~~~~~~~~~~~~~~~g~~ 153 (207)
|++|+++++|++++||+++|+|++|||.+|.+++++++++++++.|.+ +++++.... ++.-++ ++++...+||+
T Consensus 241 pk~yir~~Vg~v~~fggsTrl~i~Qgp~eD~Epe~dls~te~~e~r~~~~E~~e~~~~~~~e~r~le--~~q~~~~~g~s 318 (793)
T KOG1881|consen 241 PKVYIRDRVGHVARFGGSTRLYIFQGPEEDEEPESDLSVTELKECREKKKELSDAEVEAAAEERILE--AAQADDEEGCS 318 (793)
T ss_pred CcchhhhhHHHHHHhcCceEEEEeeCCCcCCCchhhcCHHHHHHHHHHHHhhhHhhhhhhHHHhhHH--HHhhhhhcCCc
Confidence 999999999999999999999999999999999999999999999885 344333322 111122 56677899999
Q ss_pred cCCCCCcccccccC--CCCCCCCCcccccccCchhhhhhhhhccCCCccce
Q psy2133 154 WGMGDDAEEETDLS--ENPYASTNNEELYLDDPKKTLRGWFDREGKGFPLF 202 (207)
Q Consensus 154 wGm~eda~ee~~~~--~np~~~~~~e~~~~~dp~k~l~~~f~~eg~~~~~~ 202 (207)
||||+||+|++..+ .+++++.++|++|++||||+|++||+|+|++|+|.
T Consensus 319 wGmgeDa~ed~~~e~et~~~~~~e~E~~y~qdPkk~l~~ffereg~~l~~~ 369 (793)
T KOG1881|consen 319 WGMGEDADEDDADEVETDAEAMEEREATYIQDPKKALLGFFEREGEDLEYE 369 (793)
T ss_pred ccCCcccccccccccccccccccccccccccCHHHHHHHHHHhhhhhhhhh
Confidence 99999999877554 48999999999999999999999999999999995
No 2
>PF00498 FHA: FHA domain; InterPro: IPR000253 The forkhead-associated (FHA) domain [] is a phosphopeptide recognition domain found in many regulatory proteins. It displays specificity for phosphothreonine-containing epitopes but will also recognise phosphotyrosine with relatively high affinity. It spans approximately 80-100 amino acid residues folded into an 11-stranded beta sandwich, which sometimes contain small helical insertions between the loops connecting the strands []. To date, genes encoding FHA-containing proteins have been identified in eubacterial and eukaryotic but not archaeal genomes. The domain is present in a diverse range of proteins, such as kinases, phosphatases, kinesins, transcription factors, RNA-binding proteins and metabolic enzymes which partake in many different cellular processes - DNA repair, signal transduction, vesicular transport and protein degradation are just a few examples.; GO: 0005515 protein binding; PDB: 1LGQ_B 1LGP_A 2CSW_A 2PIE_A 3FM8_A 3MDB_B 3GQS_B 1UHT_A 1WLN_A 3POA_A ....
Probab=99.85 E-value=4.3e-21 Score=133.93 Aligned_cols=67 Identities=36% Similarity=0.679 Sum_probs=62.5
Q ss_pred EEEccCCCCcEEeCCCCcCccceEEEEEeecCCcCCCC-eEEEEECCCccceeecCeEcCCCceEEecCCCEEEEc
Q psy2133 19 YCVGRERNTHLNLLHPTVSRYHAILQYKSTFDEKDPAR-GFYVYDLGSTHGTFLNRCKIKPKMYVRIHVGHMLSFG 93 (207)
Q Consensus 19 ~~IGR~~~cDv~l~~psVSR~HAvI~~~~~~~~~~~~~-~~~l~Dl~StnGTfVNg~rI~~~~~~~L~~Gd~I~fG 93 (207)
++|||.+.||+++++|+|||.||.|.+. .+ .|+|.|++|+|||||||+++.++.+++|++||+|+||
T Consensus 1 ~~iGR~~~~di~l~~~~iSr~Ha~i~~~--------~~~~~~i~d~~s~ngt~vng~~l~~~~~~~L~~gd~i~~G 68 (68)
T PF00498_consen 1 VTIGRSPDCDIVLPDPSISRRHARISFD--------DDGQFYIEDLGSTNGTFVNGQRLGPGEPVPLKDGDIIRFG 68 (68)
T ss_dssp EEEESSTTSSEEETSTTSSTTSEEEEEE--------TTEEEEEEESSSSS-EEETTEEESSTSEEEE-TTEEEEET
T ss_pred CEEcCCCCCCEEECCHheeeeeeEEEEe--------ceeeEEEEeCCCCCcEEECCEEcCCCCEEECCCCCEEEcC
Confidence 6899999999999999999999999999 45 9999999999999999999999999999999999998
No 3
>KOG1882|consensus
Probab=99.74 E-value=1.2e-18 Score=149.58 Aligned_cols=101 Identities=33% Similarity=0.539 Sum_probs=87.1
Q ss_pred ccCCeeeeEEEcCCCCeEEEccCC-CCcEEeCCCCcCccceEEEEEeecCCcC-----CCCeEEEEECCCccceeecCeE
Q psy2133 2 LKSGQIVNTIDLSTRSFYCVGRER-NTHLNLLHPTVSRYHAILQYKSTFDEKD-----PARGFYVYDLGSTHGTFLNRCK 75 (207)
Q Consensus 2 lK~G~ii~~i~L~~~~~~~IGR~~-~cDv~l~~psVSR~HAvI~~~~~~~~~~-----~~~~~~l~Dl~StnGTfVNg~r 75 (207)
+|++.+++++.+...+++++||.. .+||.++|||.|+.||+|||....-+.+ ..-..||.||+|+||||||..+
T Consensus 179 fk~~e~l~~l~iHrqs~yL~gRerkIaDi~idhpScSKQHaviQyR~v~~~r~dGt~grrvkpYiiDLgS~NgTfLNnk~ 258 (293)
T KOG1882|consen 179 FKCYEVLPVLYIHRQSCYLDGRERKIADIPIDHPSCSKQHAVIQYRLVEFTRADGTVGRRVKPYIIDLGSGNGTFLNNKV 258 (293)
T ss_pred ccCCcccchheeeeeeeeecCceeeeeccCCCCccccccceeeeeeecccccCCCccceeeeeEEEecCCCCcceecCcc
Confidence 789999999999999999999965 4899999999999999999985431111 1235899999999999999999
Q ss_pred cCCCceEEecCCCEEEEcCeeeEEEEe
Q psy2133 76 IKPKMYVRIHVGHMLSFGSSTRFFILQ 102 (207)
Q Consensus 76 I~~~~~~~L~~Gd~I~fG~s~~~fil~ 102 (207)
|+|..|+.|..+|+|+||-+++-|++.
T Consensus 259 IepqRYyEL~ekDvlkfgfs~rEyvll 285 (293)
T KOG1882|consen 259 IEPQRYYELREKDVLKFGFSSREYVLL 285 (293)
T ss_pred cCchheeeeecCceeeeccchHHHHHH
Confidence 999999999999999999888877663
No 4
>cd00060 FHA Forkhead associated domain (FHA); found in eukaryotic and prokaryotic proteins. Putative nuclear signalling domain. FHA domains may bind phosphothreonine, phosphoserine and sometimes phosphotyrosine. In eukaryotes, many FHA domain-containing proteins localize to the nucleus, where they participate in establishing or maintaining cell cycle checkpoints, DNA repair, or transcriptional regulation. Members of the FHA family include: Dun1, Rad53, Cds1, Mek1, KAPP(kinase-associated protein phosphatase),and Ki-67 (a human nuclear protein related to cell proliferation).
Probab=99.67 E-value=8.2e-16 Score=112.99 Aligned_cols=88 Identities=32% Similarity=0.560 Sum_probs=76.9
Q ss_pred eeeEEEcCCCCeEEEccCCCC-cEEeCCCCcCccceEEEEEeecCCcCCCCeEEEEECCCccceeecCeEcCCCceEEec
Q psy2133 7 IVNTIDLSTRSFYCVGRERNT-HLNLLHPTVSRYHAILQYKSTFDEKDPARGFYVYDLGSTHGTFLNRCKIKPKMYVRIH 85 (207)
Q Consensus 7 ii~~i~L~~~~~~~IGR~~~c-Dv~l~~psVSR~HAvI~~~~~~~~~~~~~~~~l~Dl~StnGTfVNg~rI~~~~~~~L~ 85 (207)
....+.|..+..++|||.+.| ++.+.+++|||.||+|.+. ....|++.++.|+|||+||+.++.++..+.|.
T Consensus 12 ~~~~~~l~~~~~~~iGr~~~~~~i~l~~~~iS~~H~~i~~~-------~~~~~~~~~~~s~~g~~vn~~~~~~~~~~~l~ 84 (102)
T cd00060 12 SGRRYYLDPGGTYTIGRDSDNCDIVLDDPSVSRRHAVIRYD-------GDGGVVLIDLGSTNGTFVNGQRVSPGEPVRLR 84 (102)
T ss_pred ceeEEEECCCCeEEECcCCCcCCEEcCCCCeeCcceEEEEc-------CCCCEEEEECCCCCCeEECCEECCCCCcEECC
Confidence 456777887577899999999 9999999999999999998 23789999999999999999999987789999
Q ss_pred CCCEEEEcCeeeEEEE
Q psy2133 86 VGHMLSFGSSTRFFIL 101 (207)
Q Consensus 86 ~Gd~I~fG~s~~~fil 101 (207)
+||.|.||.....|.+
T Consensus 85 ~gd~i~ig~~~~~~~~ 100 (102)
T cd00060 85 DGDVIRLGNTSISFRF 100 (102)
T ss_pred CCCEEEECCeEEEEEE
Confidence 9999999964444544
No 5
>TIGR03354 VI_FHA type VI secretion system FHA domain protein. Members of this protein family are FHA (forkhead-associated) domain-containing proteins that are part of type VI secretion loci in a considerable number of bacteria, most of which are known pathogens. Species include Pseudomonas aeruginosa PAO1, Aeromonas hydrophila, Yersinia pestis, Burkholderia mallei, etc.
Probab=99.65 E-value=1e-15 Score=141.03 Aligned_cols=86 Identities=29% Similarity=0.403 Sum_probs=78.1
Q ss_pred eEEEcCCCCeEEEccCCCCcEEeCCCC--cCccceEEEEEeecCCcCCCCeEEEEECCCccceeec--CeEcCCCceEEe
Q psy2133 9 NTIDLSTRSFYCVGRERNTHLNLLHPT--VSRYHAILQYKSTFDEKDPARGFYVYDLGSTHGTFLN--RCKIKPKMYVRI 84 (207)
Q Consensus 9 ~~i~L~~~~~~~IGR~~~cDv~l~~ps--VSR~HAvI~~~~~~~~~~~~~~~~l~Dl~StnGTfVN--g~rI~~~~~~~L 84 (207)
.++.|.. ..++|||.+.|++++.++. ||+.||+|.+. ++.|+|.|+ |+|||||| +.+|.++.+++|
T Consensus 17 ~~~~f~~-~~~~IGR~~~~d~~l~d~~~~VS~~Ha~I~~~--------~g~~~l~Dl-StNGT~VN~sg~~l~~~~~~~L 86 (396)
T TIGR03354 17 AQKTFGT-NGGTIGRSEDCDWVLPDPERHVSGRHARIRYR--------DGAYLLTDL-STNGVFLNGSGSPLGRGNPVRL 86 (396)
T ss_pred eEEEECC-CCEEEecCCCCCEEeCCCCCCcchhhcEEEEE--------CCEEEEEEC-CCCCeEECCCCCCCCCCCceEc
Confidence 3566766 4579999999999999999 99999999998 789999999 99999999 899999888999
Q ss_pred cCCCEEEEcCeeeEEEEeCC
Q psy2133 85 HVGHMLSFGSSTRFFILQGP 104 (207)
Q Consensus 85 ~~Gd~I~fG~s~~~fil~~p 104 (207)
+.||+|+||.....+.+..|
T Consensus 87 ~~GD~I~iG~~~lrv~~~~~ 106 (396)
T TIGR03354 87 EQGDRLRLGDYEIRVSLGDP 106 (396)
T ss_pred CCCCEEEECCEEEEEEeCCC
Confidence 99999999999998888655
No 6
>PLN02927 antheraxanthin epoxidase/zeaxanthin epoxidase
Probab=99.63 E-value=1.4e-15 Score=147.81 Aligned_cols=80 Identities=29% Similarity=0.501 Sum_probs=71.0
Q ss_pred eeEEEc--CCCCeEEEccCCCCcE-----EeCCCCcCccceEEEEEeecCCcCCCCeEEEEECCCccceeecCeE-----
Q psy2133 8 VNTIDL--STRSFYCVGRERNTHL-----NLLHPTVSRYHAILQYKSTFDEKDPARGFYVYDLGSTHGTFLNRCK----- 75 (207)
Q Consensus 8 i~~i~L--~~~~~~~IGR~~~cDv-----~l~~psVSR~HAvI~~~~~~~~~~~~~~~~l~Dl~StnGTfVNg~r----- 75 (207)
.+.|.| ..+.+++|||.++|++ +|.+|.||+.||.|.++ ++.|||+||+|+||||||+.+
T Consensus 546 ~~~~~l~~~~~~p~~iG~~~~~~~~~~~i~i~~~~vS~~Ha~i~~~--------~~~~~~~Dl~S~nGT~v~~~~~~r~~ 617 (668)
T PLN02927 546 SETLCLTKDEDQPCIVGSEPDQDFPGMRIVIPSSQVSKMHARVIYK--------DGAFFLMDLRSEHGTYVTDNEGRRYR 617 (668)
T ss_pred cceeeeecCCCCCeEecCCCCcCCCCceEEecCCccChhHeEEEEE--------CCEEEEEECCCCCccEEeCCCCceEe
Confidence 456777 2335789999999996 99999999999999999 899999999999999998765
Q ss_pred cCCCceEEecCCCEEEEcCe
Q psy2133 76 IKPKMYVRIHVGHMLSFGSS 95 (207)
Q Consensus 76 I~~~~~~~L~~Gd~I~fG~s 95 (207)
++|+.+++|++||+|+||+.
T Consensus 618 ~~p~~~~~l~~~d~I~~g~~ 637 (668)
T PLN02927 618 ATPNFPARFRSSDIIEFGSD 637 (668)
T ss_pred cCCCCceEeCCCCEEEeCCC
Confidence 67889999999999999994
No 7
>COG1716 FOG: FHA domain [Signal transduction mechanisms]
Probab=99.57 E-value=1.8e-14 Score=118.12 Aligned_cols=78 Identities=27% Similarity=0.429 Sum_probs=69.8
Q ss_pred CeEEEccCCCCcEEeCCCCcCccceEEEEEeecCCcCCCCeEEEEECCCccceeecCeEcCCCceEEecCCCEEEEcCee
Q psy2133 17 SFYCVGRERNTHLNLLHPTVSRYHAILQYKSTFDEKDPARGFYVYDLGSTHGTFLNRCKIKPKMYVRIHVGHMLSFGSST 96 (207)
Q Consensus 17 ~~~~IGR~~~cDv~l~~psVSR~HAvI~~~~~~~~~~~~~~~~l~Dl~StnGTfVNg~rI~~~~~~~L~~Gd~I~fG~s~ 96 (207)
..++|||.+++++++.++.|||+||.|.+. +..|++.|++|+||||||+.++.+ .+.+++||.|+||...
T Consensus 89 ~~~tigr~~~~~i~~~~~~vSR~Ha~l~~~--------~~~~~~~d~~S~nGt~vn~~~v~~--~~~l~~gd~i~i~~~~ 158 (191)
T COG1716 89 PVTTIGRDPDNDIVLDDDVVSRRHAELRRE--------GNEVFLEDLGSTNGTYVNGEKVRQ--RVLLQDGDVIRLGGTL 158 (191)
T ss_pred ceEEeccCCCCCEEcCCCccccceEEEEEe--------CCceEEEECCCCcceEECCeEccC--cEEcCCCCEEEECccc
Confidence 478999999999999999999999999999 788999999999999999999986 3689999999999988
Q ss_pred e-EEEEeCC
Q psy2133 97 R-FFILQGP 104 (207)
Q Consensus 97 ~-~fil~~p 104 (207)
. .+.....
T Consensus 159 ~~~~~~~~~ 167 (191)
T COG1716 159 AERLRIILT 167 (191)
T ss_pred eeeEeeccc
Confidence 7 4444433
No 8
>smart00240 FHA Forkhead associated domain. Found in eukaryotic and prokaryotic proteins. Putative nuclear signalling domain.
Probab=99.50 E-value=3.9e-14 Score=93.39 Aligned_cols=51 Identities=41% Similarity=0.731 Sum_probs=46.2
Q ss_pred EEEccCC-CCcEEeCCCCcCccceEEEEEeecCCcCCCCeEEEEECCCccceeecCeEc
Q psy2133 19 YCVGRER-NTHLNLLHPTVSRYHAILQYKSTFDEKDPARGFYVYDLGSTHGTFLNRCKI 76 (207)
Q Consensus 19 ~~IGR~~-~cDv~l~~psVSR~HAvI~~~~~~~~~~~~~~~~l~Dl~StnGTfVNg~rI 76 (207)
++|||.+ .|++++++|.|||.||.|.+.. ++.|+|.|++|+|||||||++|
T Consensus 1 ~~iGr~~~~~~i~~~~~~vs~~H~~i~~~~-------~~~~~i~d~~s~~gt~vng~~v 52 (52)
T smart00240 1 VTIGRSSEDCDIQLPGPSISRRHAEIVYDG-------GGRFYLIDLGSTNGTFVNGKRI 52 (52)
T ss_pred CEeCCCCCCCCEEeCCCCcchhHcEEEECC-------CCeEEEEECCCCCCeeECCEEC
Confidence 4799999 9999999999999999999982 3359999999999999999875
No 9
>KOG1880|consensus
Probab=99.41 E-value=5.5e-14 Score=124.04 Aligned_cols=97 Identities=31% Similarity=0.618 Sum_probs=82.1
Q ss_pred ccCCeeeeEEEcCCCCeEEEccCCC-CcEEeCCCCcCccceEEEEEeecCCcCCCCeEEEEECCCccceeecCeEcCCCc
Q psy2133 2 LKSGQIVNTIDLSTRSFYCVGRERN-THLNLLHPTVSRYHAILQYKSTFDEKDPARGFYVYDLGSTHGTFLNRCKIKPKM 80 (207)
Q Consensus 2 lK~G~ii~~i~L~~~~~~~IGR~~~-cDv~l~~psVSR~HAvI~~~~~~~~~~~~~~~~l~Dl~StnGTfVNg~rI~~~~ 80 (207)
.|+-..|..+.+..+..++|||.+. ||++++|.++||.||.+.+.. ....++|+||+|+||||+...||.+..
T Consensus 23 ~k~d~li~kl~iddkr~y~Fgrn~q~~df~idh~scSrvhaa~vyhk------hl~~~~lidl~s~hgtf~g~~rL~~~~ 96 (337)
T KOG1880|consen 23 VKGDKLIQKLIIDDKRRYLFGRNHQTCDFVIDHASCSRVHAALVYHK------HLSRIFLIDLGSTHGTFLGNERLEPHK 96 (337)
T ss_pred eecchhHHHHHhhhhhhhhhccCCCccceEeecchhhhhHhhhhhhh------ccceEEEEEccCCcceeeeeeeeccCC
Confidence 3433444444555557889999985 999999999999999998873 245699999999999999999999999
Q ss_pred eEEecCCCEEEEcCeeeEEEEeCC
Q psy2133 81 YVRIHVGHMLSFGSSTRFFILQGP 104 (207)
Q Consensus 81 ~~~L~~Gd~I~fG~s~~~fil~~p 104 (207)
++.+..|..+.||.++|.|.+..-
T Consensus 97 p~~l~i~~~~~fgasTr~y~lr~k 120 (337)
T KOG1880|consen 97 PVQLEIGSTFHFGASTRIYLLREK 120 (337)
T ss_pred CccccCCceEEEeccceeeeeecc
Confidence 999999999999999999998654
No 10
>COG3456 Predicted component of the type VI protein secretion system, contains a FHA domain [Intracellular trafficking, secretion, and vesicular transport; Signal transduction mechanisms]
Probab=99.32 E-value=4e-12 Score=116.78 Aligned_cols=77 Identities=22% Similarity=0.314 Sum_probs=70.3
Q ss_pred eEEEccCCCCcEEeCCC--CcCccceEEEEEeecCCcCCCCeEEEEECCCccceeecCeEcCCCce-EEecCCCEEEEcC
Q psy2133 18 FYCVGRERNTHLNLLHP--TVSRYHAILQYKSTFDEKDPARGFYVYDLGSTHGTFLNRCKIKPKMY-VRIHVGHMLSFGS 94 (207)
Q Consensus 18 ~~~IGR~~~cDv~l~~p--sVSR~HAvI~~~~~~~~~~~~~~~~l~Dl~StnGTfVNg~rI~~~~~-~~L~~Gd~I~fG~ 94 (207)
-.+|||.++||.+|++| .||+.||.|.++ +++|||.|. |.+||+|||..++.+.. .+|+.|++|.||.
T Consensus 27 ~g~IGrs~dcdW~i~D~~~~VS~~Hc~I~~~--------dg~f~L~Dt-S~g~l~VNgs~~~~g~~~~RLqqGd~i~iG~ 97 (430)
T COG3456 27 GGVIGRSPDCDWQIDDPERFVSKQHCTISYR--------DGGFCLTDT-SNGGLLVNGSDLPLGEGSARLQQGDEILIGR 97 (430)
T ss_pred CcccccCCCCCccccCcccccchhheEEEec--------CCeEEEEec-CCCceeecccccCCCCCccccccCCEEeecc
Confidence 45899999999999976 699999999999 899999998 79999999999988877 8999999999999
Q ss_pred eeeEEEEeC
Q psy2133 95 STRFFILQG 103 (207)
Q Consensus 95 s~~~fil~~ 103 (207)
..+...+..
T Consensus 98 y~i~V~l~~ 106 (430)
T COG3456 98 YIIRVHLSR 106 (430)
T ss_pred EEEEEEecc
Confidence 887777764
No 11
>KOG0615|consensus
Probab=98.30 E-value=1.4e-06 Score=80.83 Aligned_cols=94 Identities=22% Similarity=0.331 Sum_probs=74.1
Q ss_pred eEEEcCCCCeEEEccCCCCcEEeCCCCcCccceEEEEEee-------cCCcCCCCeEEEEECCCccceeecCeEcCCCce
Q psy2133 9 NTIDLSTRSFYCVGRERNTHLNLLHPTVSRYHAILQYKST-------FDEKDPARGFYVYDLGSTHGTFLNRCKIKPKMY 81 (207)
Q Consensus 9 ~~i~L~~~~~~~IGR~~~cDv~l~~psVSR~HAvI~~~~~-------~~~~~~~~~~~l~Dl~StnGTfVNg~rI~~~~~ 81 (207)
..+.+.. ..|++||.+.||+.+..+.+|..|-.|..... .........+|+.|. |+||||||...|..+..
T Consensus 57 ~~~d~~n-d~f~fGR~~~~d~~ln~~~~s~~~~~i~~~~~~~~~~f~~dr~~~sn~~y~~Dh-S~nGT~VN~e~i~k~~~ 134 (475)
T KOG0615|consen 57 KSIDLAN-DEFTFGRGDSCDAPLNLNNVSNKHFKILLYNKISKIHFRIDRDKNSNRVYLHDH-SRNGTFVNDEMIGKGLS 134 (475)
T ss_pred ccceecc-ceEEecCCCcccccccCccccccchheeeeeeeeeeeecccCCCccceEEEEec-ccCcccccHhHhhcccc
Confidence 3455655 67899999999999999999999988765411 001124678999998 99999999999999988
Q ss_pred EEecCCCEEEEcCee-eEEEEeCC
Q psy2133 82 VRIHVGHMLSFGSST-RFFILQGP 104 (207)
Q Consensus 82 ~~L~~Gd~I~fG~s~-~~fil~~p 104 (207)
..|+.||.|.+|.+. -.|++..+
T Consensus 135 r~lkN~dei~is~p~~~~~v~~~~ 158 (475)
T KOG0615|consen 135 RILKNGDEISISIPALKIFVFEDL 158 (475)
T ss_pred ccccCCCEEEeccchhheeeeecc
Confidence 999999999999875 34555554
No 12
>TIGR02500 type_III_yscD type III secretion apparatus protein, YscD/HrpQ family. This family represents a conserved protein of bacterial type III secretion systems. Gene symbols are variable from species to species. Members are designated YscD in Yersinia, HrpQ in Pseudomonas syringae, and EscD in enteropathogenic Escherichia coli. In the Chlamydiae, this model describes the C-terminal 400 residues of a longer protein.
Probab=97.93 E-value=3.4e-05 Score=71.66 Aligned_cols=77 Identities=10% Similarity=0.136 Sum_probs=61.5
Q ss_pred EEEcCCCCeEEEc-cCCCCcEEeCCCCcCccceEEEEEeecCCcCCCCeEEEEECCCccceeecC--eEcCCCceEEecC
Q psy2133 10 TIDLSTRSFYCVG-RERNTHLNLLHPTVSRYHAILQYKSTFDEKDPARGFYVYDLGSTHGTFLNR--CKIKPKMYVRIHV 86 (207)
Q Consensus 10 ~i~L~~~~~~~IG-R~~~cDv~l~~psVSR~HAvI~~~~~~~~~~~~~~~~l~Dl~StnGTfVNg--~rI~~~~~~~L~~ 86 (207)
.+.|..+. ++|| +.++|||++.++.||+.|+.|.+. ..++.+.+ +..++++|| .....+ ..|..
T Consensus 14 ~~~L~~g~-~~iG~~~~~~di~L~d~~~~~~h~~l~v~--------~~~~~l~~--~~~~~~~~g~~~~~~~g--~~l~~ 80 (410)
T TIGR02500 14 ELPLPEGN-LVLGTDAADCDIVLSDGGIAAVHVSLHVR--------LEGVTLAG--AVEPAWEEGGVLPDEEG--TPLPS 80 (410)
T ss_pred EEECCCCc-eEeccCCCCcEEEeCCCCccchheEEEEc--------CceEEEec--CCcceeECCcccccCCC--CccCC
Confidence 56788754 8999 999999999999999999999998 56777775 466789999 444444 46888
Q ss_pred CCEEEEcCeeeEE
Q psy2133 87 GHMLSFGSSTRFF 99 (207)
Q Consensus 87 Gd~I~fG~s~~~f 99 (207)
+..|..|+..+.|
T Consensus 81 ~~~l~~g~~~~~~ 93 (410)
T TIGR02500 81 GTPLLVAGVAFAL 93 (410)
T ss_pred CCceecceeEEec
Confidence 8888888876544
No 13
>KOG2293|consensus
Probab=97.47 E-value=0.00046 Score=65.61 Aligned_cols=79 Identities=15% Similarity=0.305 Sum_probs=66.1
Q ss_pred CCeEEEccCCC-CcEEeC----C--CCcCccceEEEEEeecCCcCCCCeEEEEECCCccceeecCeEcCCCceEEecCCC
Q psy2133 16 RSFYCVGRERN-THLNLL----H--PTVSRYHAILQYKSTFDEKDPARGFYVYDLGSTHGTFLNRCKIKPKMYVRIHVGH 88 (207)
Q Consensus 16 ~~~~~IGR~~~-cDv~l~----~--psVSR~HAvI~~~~~~~~~~~~~~~~l~Dl~StnGTfVNg~rI~~~~~~~L~~Gd 88 (207)
++.+++||+.. |.|-|+ . ..|||+.|.|... +++.|+|.++| ..-.+|||..|.+|+.++|....
T Consensus 447 k~EVtlGRat~d~~VDIDLgkegpatKISRRQa~IkL~-------n~GsF~IkNlG-K~~I~vng~~l~~gq~~~L~~nc 518 (547)
T KOG2293|consen 447 KKEVTLGRATGDLKVDIDLGKEGPATKISRRQALIKLK-------NDGSFFIKNLG-KRSILVNGGELDRGQKVILKNNC 518 (547)
T ss_pred CcceEeeccCCCcceeeeccccCccceeeccceeEEec-------cCCcEEeccCc-ceeEEeCCccccCCceEEeccCc
Confidence 46789999974 433333 3 4699999999998 47899999995 67899999999999999999999
Q ss_pred EEEEcCeeeEEEEe
Q psy2133 89 MLSFGSSTRFFILQ 102 (207)
Q Consensus 89 ~I~fG~s~~~fil~ 102 (207)
+|.|-+-+++|.+.
T Consensus 519 lveIrg~~FiF~~N 532 (547)
T KOG2293|consen 519 LVEIRGLRFIFEIN 532 (547)
T ss_pred EEEEccceEEEeec
Confidence 99999988877653
No 14
>KOG0245|consensus
Probab=97.46 E-value=0.0017 Score=66.09 Aligned_cols=82 Identities=15% Similarity=0.232 Sum_probs=62.0
Q ss_pred EcCCCCeEEEccCC---CCcEEeCCCCcCccceEEEEEeecCCcCCCCeEEEEECCCccceeecCeEcCCCceEEecCCC
Q psy2133 12 DLSTRSFYCVGRER---NTHLNLLHPTVSRYHAILQYKSTFDEKDPARGFYVYDLGSTHGTFLNRCKIKPKMYVRIHVGH 88 (207)
Q Consensus 12 ~L~~~~~~~IGR~~---~cDv~l~~psVSR~HAvI~~~~~~~~~~~~~~~~l~Dl~StnGTfVNg~rI~~~~~~~L~~Gd 88 (207)
.|+. +...|||.+ ..||+|..+.|--.||.|.-. +++..+..-..--.-|||||+.|.. +..|+.||
T Consensus 473 ~ike-G~TrVG~~~a~~~~DI~LsG~~I~~qHC~i~~~-------~g~~~vtl~p~e~aetyVNGk~v~e--p~qL~~Gd 542 (1221)
T KOG0245|consen 473 YIKE-GETRVGREDASSRQDIVLSGQLIREQHCSIRNE-------GGNDVVTLEPCEDAETYVNGKLVTE--PTQLRSGD 542 (1221)
T ss_pred Eecc-CceecCCCCcccCCceEecchhhhhhceEEEec-------CCCceEEeccCCccceeEccEEcCC--cceeccCC
Confidence 3455 456899986 389999999999999999987 2333333333345579999999974 56899999
Q ss_pred EEEEcCeeeEEEEeCC
Q psy2133 89 MLSFGSSTRFFILQGP 104 (207)
Q Consensus 89 ~I~fG~s~~~fil~~p 104 (207)
.|-||+.. .|.+.-|
T Consensus 543 RiilG~~H-~frfn~P 557 (1221)
T KOG0245|consen 543 RIILGGNH-VFRFNHP 557 (1221)
T ss_pred EEEEcCce-eEEecCH
Confidence 99999964 3666655
No 15
>KOG1892|consensus
Probab=96.09 E-value=0.01 Score=60.39 Aligned_cols=85 Identities=22% Similarity=0.293 Sum_probs=66.4
Q ss_pred eEEEcCCCCeEEEccCCC--CcEEeCCCCcCccceEEEEEeecCCcCCCCeEEEEECCCccceeecCeEcCCCceEEecC
Q psy2133 9 NTIDLSTRSFYCVGRERN--THLNLLHPTVSRYHAILQYKSTFDEKDPARGFYVYDLGSTHGTFLNRCKIKPKMYVRIHV 86 (207)
Q Consensus 9 ~~i~L~~~~~~~IGR~~~--cDv~l~~psVSR~HAvI~~~~~~~~~~~~~~~~l~Dl~StnGTfVNg~rI~~~~~~~L~~ 86 (207)
.+|.|.. +..-+|.... +.|.|..|+|-.+||-|.+- ++-+.|.-..-..-|||||.+|..- ..|+.
T Consensus 371 ~ri~L~~-~vtEVGs~~~~~~~iqLfGP~IqprHc~it~m--------eGVvTvTP~~~DA~t~VnGh~isqt--tiL~~ 439 (1629)
T KOG1892|consen 371 KRIRLQL-SVTEVGSEKLDDNSIQLFGPGIQPRHCDITNM--------EGVVTVTPRSMDAETYVNGHRISQT--TILQS 439 (1629)
T ss_pred eeEEecc-CceeccccccCCcceeeeCCCCCccccchhhc--------cceEEecccccchhhhccceecchh--hhhcc
Confidence 4566665 4567888764 57889999999999999987 6778888876667799999999854 58999
Q ss_pred CCEEEEcCeeeEEEEeCCC
Q psy2133 87 GHMLSFGSSTRFFILQGPS 105 (207)
Q Consensus 87 Gd~I~fG~s~~~fil~~p~ 105 (207)
|+.|+||.+.. |.+..++
T Consensus 440 G~~v~fGa~hs-fkF~dss 457 (1629)
T KOG1892|consen 440 GMKVQFGASHS-FKFVDSS 457 (1629)
T ss_pred CCEEEecccee-EEecCCc
Confidence 99999998742 4444443
No 16
>TIGR01663 PNK-3'Pase polynucleotide 5'-kinase 3'-phosphatase. Note that the EC number for the kinase function is: 2.7.1.78
Probab=95.87 E-value=0.026 Score=54.38 Aligned_cols=81 Identities=16% Similarity=0.129 Sum_probs=64.3
Q ss_pred eEEEcCCCCeEEEccCCCCcEEeCCCCcCccceEEEEEeecCCcCCCCeEEEEECCCccceeecCeEcCCCceEEecCCC
Q psy2133 9 NTIDLSTRSFYCVGRERNTHLNLLHPTVSRYHAILQYKSTFDEKDPARGFYVYDLGSTHGTFLNRCKIKPKMYVRIHVGH 88 (207)
Q Consensus 9 ~~i~L~~~~~~~IGR~~~cDv~l~~psVSR~HAvI~~~~~~~~~~~~~~~~l~Dl~StnGTfVNg~rI~~~~~~~L~~Gd 88 (207)
..|.|.. ..++|||.|...|. +...||....|+.+. ..+.+.+.-| ..|.+-|||..+.++..+.|++||
T Consensus 25 ~~~~~~~-~~~~~gr~pet~i~--d~~cs~~qv~l~a~~------~~~~v~~k~l-g~np~~~~~~~~~~~~~~~l~~g~ 94 (526)
T TIGR01663 25 HFIHLDA-GALFLGRGPETGIR--DRKCSKRQIELQADL------EKATVALKQL-GVNPCGTGGLELKPGGEGELGHGD 94 (526)
T ss_pred CeeccCC-CceEEccCcccccc--hhhhchhhheeeecc------cCceEEEEEc-cCCCcccCceEecCCCeeeecCCC
Confidence 4567776 45789999998766 789999999999873 2455667778 569999999999999999999999
Q ss_pred EEEEcCeeeEE
Q psy2133 89 MLSFGSSTRFF 99 (207)
Q Consensus 89 ~I~fG~s~~~f 99 (207)
.+.+=....-|
T Consensus 95 ~l~~v~~~~~~ 105 (526)
T TIGR01663 95 LLEIVNGLHPL 105 (526)
T ss_pred EEEEeccccce
Confidence 98874443333
No 17
>KOG0241|consensus
Probab=94.26 E-value=0.15 Score=52.38 Aligned_cols=77 Identities=12% Similarity=0.206 Sum_probs=64.9
Q ss_pred eEEEccCCCCcEEeCCCCcCccceEEEEEeecCCcCCCCeEEEEECCCccceeecCeEcCCCceEEecCCCEEEEcCeee
Q psy2133 18 FYCVGRERNTHLNLLHPTVSRYHAILQYKSTFDEKDPARGFYVYDLGSTHGTFLNRCKIKPKMYVRIHVGHMLSFGSSTR 97 (207)
Q Consensus 18 ~~~IGR~~~cDv~l~~psVSR~HAvI~~~~~~~~~~~~~~~~l~Dl~StnGTfVNg~rI~~~~~~~L~~Gd~I~fG~s~~ 97 (207)
..+||-..+.+|++..-.|-+.||+|.+. .++..++..+.++ .+||||..+... +.|++||.|-.|...+
T Consensus 468 ~tlig~~~~~~i~l~glgi~p~h~vidI~-------~dg~l~~~p~~~~-R~~VNGs~v~~~--t~L~~GdRiLwGnnHF 537 (1714)
T KOG0241|consen 468 HTLIGLFKSQDIQLSGLGIQPKHCVIDIE-------SDGELRLTPLLNA-RSCVNGSLVCST--TQLWHGDRILWGNNHF 537 (1714)
T ss_pred ceeeccccCcceeeecCcccCccceeeec-------cCCcEEecccccc-eeeecCceeccc--cccccCceEEecccce
Confidence 34789888899999999999999999998 3566888888555 999999988754 5799999999999876
Q ss_pred EEEEeCCC
Q psy2133 98 FFILQGPS 105 (207)
Q Consensus 98 ~fil~~p~ 105 (207)
|.+..|.
T Consensus 538 -FrvN~PK 544 (1714)
T KOG0241|consen 538 -FRVNLPK 544 (1714)
T ss_pred -EEecCcc
Confidence 8888774
No 18
>KOG1881|consensus
Probab=90.85 E-value=0.077 Score=52.67 Aligned_cols=90 Identities=12% Similarity=-0.103 Sum_probs=65.3
Q ss_pred eEEEccCCCCcEEeCCCCcCccceEEEEEeecCCcCCCCeEEEEECCCccceeecCeEcCCCceEEecCCCEEEEcCeee
Q psy2133 18 FYCVGRERNTHLNLLHPTVSRYHAILQYKSTFDEKDPARGFYVYDLGSTHGTFLNRCKIKPKMYVRIHVGHMLSFGSSTR 97 (207)
Q Consensus 18 ~~~IGR~~~cDv~l~~psVSR~HAvI~~~~~~~~~~~~~~~~l~Dl~StnGTfVNg~rI~~~~~~~L~~Gd~I~fG~s~~ 97 (207)
..++||-++|.+.+.||+.|+.|+-+.-.... .....-+.-+.++|.+-..+.++..-.-...+..++++.+++
T Consensus 61 ilvvG~~~~~~~l~~hpS~S~~~~~~~S~p~~------~~~~~p~s~e~~~s~v~d~p~P~~Gc~D~~el~e~~~~~t~k 134 (793)
T KOG1881|consen 61 ILVVGDFKDCAELLEHPSESDAAADEFSIPAP------HPKPAPDAPEKHRSEVPDEPPPPEGCGDDEELAEAQIPKTPK 134 (793)
T ss_pred ccCCCCCCcccccCCCcccccccccCcCCCcc------cCCCCCCchhhcCCCCccCCCCCCCccccccccccccCCCcc
Confidence 34789999999999999999999999876432 112223445678888877777776556677888999999999
Q ss_pred EEEEeCCCCCcccccc
Q psy2133 98 FFILQGPSEDEEEESE 113 (207)
Q Consensus 98 ~fil~~p~~~~e~es~ 113 (207)
+++-..+.....++|.
T Consensus 135 is~pa~~~py~vppw~ 150 (793)
T KOG1881|consen 135 ISGPAAAAPYEVPPWS 150 (793)
T ss_pred ccccCCCCcccCCCCC
Confidence 9955544444555443
No 19
>PF01479 S4: S4 domain; InterPro: IPR002942 Ribosomes are the particles that catalyse mRNA-directed protein synthesis in all organisms. The codons of the mRNA are exposed on the ribosome to allow tRNA binding. This leads to the incorporation of amino acids into the growing polypeptide chain in accordance with the genetic information. Incoming amino acid monomers enter the ribosomal A site in the form of aminoacyl-tRNAs complexed with elongation factor Tu (EF-Tu) and GTP. The growing polypeptide chain, situated in the P site as peptidyl-tRNA, is then transferred to aminoacyl-tRNA and the new peptidyl-tRNA, extended by one residue, is translocated to the P site with the aid the elongation factor G (EF-G) and GTP as the deacylated tRNA is released from the ribosome through one or more exit sites [, ]. About 2/3 of the mass of the ribosome consists of RNA and 1/3 of protein. The proteins are named in accordance with the subunit of the ribosome which they belong to - the small (S1 to S31) and the large (L1 to L44). Usually they decorate the rRNA cores of the subunits. Many ribosomal proteins, particularly those of the large subunit, are composed of a globular, surfaced-exposed domain with long finger-like projections that extend into the rRNA core to stabilise its structure. Most of the proteins interact with multiple RNA elements, often from different domains. In the large subunit, about 1/3 of the 23S rRNA nucleotides are at least in van der Waal's contact with protein, and L22 interacts with all six domains of the 23S rRNA. Proteins S4 and S7, which initiate assembly of the 16S rRNA, are located at junctions of five and four RNA helices, respectively. In this way proteins serve to organise and stabilise the rRNA tertiary structure. While the crucial activities of decoding and peptide transfer are RNA based, proteins play an active role in functions that may have evolved to streamline the process of protein synthesis. In addition to their function in the ribosome, many ribosomal proteins have some function 'outside' the ribosome [, ]. The S4 domain is a small domain consisting of 60-65 amino acid residues that was detected in the bacterial ribosomal protein S4, eukaryotic ribosomal S9, two families of pseudouridine synthases, a novel family of predicted RNA methylases, a yeast protein containing a pseudouridine synthetase and a deaminase domain, bacterial tyrosyl-tRNA synthetases, and a number of uncharacterised, small proteins that may be involved in translation regulation []. The S4 domain probably mediates binding to RNA.; GO: 0003723 RNA binding; PDB: 3BBU_A 1DM9_B 2K6P_A 3U5G_E 3U5C_E 3IZB_D 2XZM_D 2XZN_D 3O30_E 3O2Z_E ....
Probab=69.25 E-value=4.1 Score=25.84 Aligned_cols=23 Identities=17% Similarity=0.265 Sum_probs=18.6
Q ss_pred ccceeecCeEcC-CCceEEecCCCEE
Q psy2133 66 THGTFLNRCKIK-PKMYVRIHVGHML 90 (207)
Q Consensus 66 tnGTfVNg~rI~-~~~~~~L~~Gd~I 90 (207)
.++.+|||+.+. |+ +.+++||+|
T Consensus 25 ~g~V~VNg~~v~~~~--~~v~~~d~I 48 (48)
T PF01479_consen 25 QGRVKVNGKVVKDPS--YIVKPGDVI 48 (48)
T ss_dssp TTTEEETTEEESSTT--SBESTTEEE
T ss_pred CCEEEECCEEEcCCC--CCCCCcCCC
Confidence 467999999998 55 478899886
No 20
>COG2501 S4-like RNA binding protein [Replication, recombination, and repair]
Probab=68.89 E-value=12 Score=27.05 Aligned_cols=39 Identities=5% Similarity=-0.005 Sum_probs=28.2
Q ss_pred CeEEEEECCCccceeecCeEcCCCceEEecCCCEEEEcCeeeEE
Q psy2133 56 RGFYVYDLGSTHGTFLNRCKIKPKMYVRIHVGHMLSFGSSTRFF 99 (207)
Q Consensus 56 ~~~~l~Dl~StnGTfVNg~rI~~~~~~~L~~Gd~I~fG~s~~~f 99 (207)
..|+|. .|.++|||..-..+ -..|++||+|.|.+-....
T Consensus 30 AK~~i~----eg~V~vNGe~EtRR-gkKlr~gd~V~i~~~~~~v 68 (73)
T COG2501 30 AKAFIA----EGEVKVNGEVETRR-GKKLRDGDVVEIPGQRYQV 68 (73)
T ss_pred HHHHHH----CCeEEECCeeeecc-CCEeecCCEEEECCEEEEE
Confidence 345554 46899999865543 3579999999999966443
No 21
>TIGR02988 YaaA_near_RecF S4 domain protein YaaA. This small protein has a single S4 domain (pfam01479), as do bacterial ribosomal protein S4, some pseudouridine synthases, tyrosyl-tRNA synthetases. The S4 domain may bind RNA. Members of this protein family are found almost exclusively in the Firmicutes, and almost invariably just a few nucleotides upstream of the gene for the DNA replication and repair protein RecF. The few members of this family that are not near recF are found instead near dnaA and/or dnaN, the usual neighbors of recF, near the origin of replication. The conserved location suggests a possible role in replication in the Firmicutes lineage.
Probab=67.83 E-value=5.8 Score=26.53 Aligned_cols=26 Identities=8% Similarity=0.132 Sum_probs=19.7
Q ss_pred ccceeecCeEcCCCceEEecCCCEEEE
Q psy2133 66 THGTFLNRCKIKPKMYVRIHVGHMLSF 92 (207)
Q Consensus 66 tnGTfVNg~rI~~~~~~~L~~Gd~I~f 92 (207)
....+|||..+... .++|..||.|.|
T Consensus 33 ~G~V~VNg~~~~~~-~~~l~~Gd~v~i 58 (59)
T TIGR02988 33 ENEVLVNGELENRR-GKKLYPGDVIEI 58 (59)
T ss_pred cCCEEECCEEccCC-CCCCCCCCEEEe
Confidence 35788999988432 358999999976
No 22
>COG1188 Ribosome-associated heat shock protein implicated in the recycling of the 50S subunit (S4 paralog) [Translation, ribosomal structure and biogenesis]
Probab=65.00 E-value=10 Score=29.00 Aligned_cols=36 Identities=19% Similarity=0.295 Sum_probs=25.8
Q ss_pred ceeecCeEcCCCceEEecCCCEEEE--cCeeeEEEEeCCC
Q psy2133 68 GTFLNRCKIKPKMYVRIHVGHMLSF--GSSTRFFILQGPS 105 (207)
Q Consensus 68 GTfVNg~rI~~~~~~~L~~Gd~I~f--G~s~~~fil~~p~ 105 (207)
-+.|||++.+|.. .+++||+|.| |.-.+.+.+....
T Consensus 35 rV~vNG~~aKpS~--~VK~GD~l~i~~~~~~~~v~Vl~~~ 72 (100)
T COG1188 35 RVKVNGQRAKPSK--EVKVGDILTIRFGNKEFTVKVLALG 72 (100)
T ss_pred eEEECCEEccccc--ccCCCCEEEEEeCCcEEEEEEEecc
Confidence 4679999998864 7999999765 5555566555443
No 23
>smart00363 S4 S4 RNA-binding domain.
Probab=61.04 E-value=14 Score=22.94 Aligned_cols=27 Identities=22% Similarity=0.308 Sum_probs=20.8
Q ss_pred ccceeecCeEc-CCCceEEecCCCEEEEcC
Q psy2133 66 THGTFLNRCKI-KPKMYVRIHVGHMLSFGS 94 (207)
Q Consensus 66 tnGTfVNg~rI-~~~~~~~L~~Gd~I~fG~ 94 (207)
..+.+|||.++ .+. ..|..||.|.+-.
T Consensus 25 ~g~i~vng~~~~~~~--~~l~~gd~i~~~~ 52 (60)
T smart00363 25 QGRVKVNGKKVTKPS--YIVKPGDVISVRG 52 (60)
T ss_pred cCCEEECCEEecCCC--eEeCCCCEEEEcc
Confidence 45788999988 554 4789999988754
No 24
>PRK11507 ribosome-associated protein; Provisional
Probab=59.28 E-value=18 Score=25.91 Aligned_cols=31 Identities=16% Similarity=0.186 Sum_probs=23.4
Q ss_pred ccceeecCeEcCCCceEEecCCCEEEEcCeee
Q psy2133 66 THGTFLNRCKIKPKMYVRIHVGHMLSFGSSTR 97 (207)
Q Consensus 66 tnGTfVNg~rI~~~~~~~L~~Gd~I~fG~s~~ 97 (207)
.+.+.|||..-..+ ...|++||+|.|.+...
T Consensus 36 eg~V~VNGeve~rR-gkKl~~GD~V~~~g~~~ 66 (70)
T PRK11507 36 EGQVKVDGAVETRK-RCKIVAGQTVSFAGHSV 66 (70)
T ss_pred cCceEECCEEeccc-CCCCCCCCEEEECCEEE
Confidence 36789999754432 35799999999999653
No 25
>PF02626 AHS2: Allophanate hydrolase subunit 2; InterPro: IPR003778 Allophanate hydrolase catalyses the second reaction in an ATP-dependent, two-step degradation of urea to ammonia and C02. This follows the action of the biotin-containing urea carboxylase. Saccharomyces cerevisiae can use urea as a sole nitrogen source via this degradation pathway []. In yeast, the fusion of allophanate hydrolase to urea carboxylase is called urea amidolyase. In bacteria, the second step in the urea degradation pathway is also the ATP-dependent allophanate hydrolase. The gene encoding this enzyme is found adjacent to the urea carboxylase gene []. Allophanate hydrolase has strict substrate specificity, as analogues of allophanate are not hydrolysed by it []. This domain represents subunit 2 of allophanate hydrolase (AHS2) which is found in urea carboxylase.; PDB: 3MML_G 3VA7_A 3OEP_A 3OPF_C 3ORE_B.
Probab=59.02 E-value=13 Score=33.06 Aligned_cols=41 Identities=20% Similarity=0.302 Sum_probs=31.0
Q ss_pred CCccceeecCeEcCCCceEEecCCCEEEEcCe---eeEEEEeCC
Q psy2133 64 GSTHGTFLNRCKIKPKMYVRIHVGHMLSFGSS---TRFFILQGP 104 (207)
Q Consensus 64 ~StnGTfVNg~rI~~~~~~~L~~Gd~I~fG~s---~~~fil~~p 104 (207)
|..-...|||+.++..+.+.++.||+|+||.. .|.|.-...
T Consensus 52 Ga~~~~~lng~~~~~~~~~~v~~Gd~L~~~~~~~G~r~YLAv~G 95 (271)
T PF02626_consen 52 GADFEATLNGKPVPMWQPFLVKAGDVLKFGPPRSGARAYLAVAG 95 (271)
T ss_dssp ESCEEEEETTEEE-TTSEEEE-TT-EEEEEEESSECEEEEEETT
T ss_pred CCCCceEECCEEccCCEEEEECCCCEEEecCCCCccEEEEEecc
Confidence 45667889999999999999999999999865 477776544
No 26
>PF13275 S4_2: S4 domain; PDB: 1P9K_A.
Probab=58.54 E-value=9.4 Score=26.82 Aligned_cols=31 Identities=6% Similarity=0.029 Sum_probs=18.3
Q ss_pred ccceeecCeEcCCCceEEecCCCEEEEcCeee
Q psy2133 66 THGTFLNRCKIKPKMYVRIHVGHMLSFGSSTR 97 (207)
Q Consensus 66 tnGTfVNg~rI~~~~~~~L~~Gd~I~fG~s~~ 97 (207)
.+..+|||.....+ ...|++||+|.|++...
T Consensus 32 ~g~V~VNGe~e~rr-g~Kl~~GD~V~~~~~~~ 62 (65)
T PF13275_consen 32 EGEVKVNGEVETRR-GKKLRPGDVVEIDGEEY 62 (65)
T ss_dssp HHHHEETTB----S-S----SSEEEEETTEEE
T ss_pred cCceEECCEEcccc-CCcCCCCCEEEECCEEE
Confidence 36789999865443 35799999999988664
No 27
>COG5025 Transcription factor of the Forkhead/HNF3 family [Transcription]
Probab=58.51 E-value=5.8 Score=39.17 Aligned_cols=63 Identities=14% Similarity=0.105 Sum_probs=50.0
Q ss_pred CCCcCccceEEEEEeecCCcCCCCeEEEEECCCccceeecCeEcCCCc---eEEecCCCEEEEcCeeeEEEEe
Q psy2133 33 HPTVSRYHAILQYKSTFDEKDPARGFYVYDLGSTHGTFLNRCKIKPKM---YVRIHVGHMLSFGSSTRFFILQ 102 (207)
Q Consensus 33 ~psVSR~HAvI~~~~~~~~~~~~~~~~l~Dl~StnGTfVNg~rI~~~~---~~~L~~Gd~I~fG~s~~~fil~ 102 (207)
.+.+||.|++|++.- .-+.|++.+. .++|..|+|+++.-+. ...|..|-....|+....|++.
T Consensus 124 ~k~~~~~~~sIr~Nl------s~~~a~~~i~-g~~g~~~~g~~~~igP~~~~~~l~~g~~~~~~~~~~~~~~p 189 (610)
T COG5025 124 AKVVSRWQNSIRHNL------SLNDAFIKIE-GRNGAKVKGHFWSIGPGHETQFLKSGLRLDGGGKQMMFTLP 189 (610)
T ss_pred ccccchhhhhhhccc------ccCceEEEEe-ccCCccccceeeccCCCccceeeccccccccccccccccCc
Confidence 467899999999984 2578899988 4689999998764443 6789999999988888777764
No 28
>PRK15367 type III secretion system protein SsaD; Provisional
Probab=56.22 E-value=65 Score=30.34 Aligned_cols=81 Identities=14% Similarity=0.181 Sum_probs=55.1
Q ss_pred EEEcCCCCeEEEccCCCCcEEeCCCCcCccceEEEEEeecCCcCCCCeEEEEECCCccceeecCeEcCCCceEEecCCCE
Q psy2133 10 TIDLSTRSFYCVGRERNTHLNLLHPTVSRYHAILQYKSTFDEKDPARGFYVYDLGSTHGTFLNRCKIKPKMYVRIHVGHM 89 (207)
Q Consensus 10 ~i~L~~~~~~~IGR~~~cDv~l~~psVSR~HAvI~~~~~~~~~~~~~~~~l~Dl~StnGTfVNg~rI~~~~~~~L~~Gd~ 89 (207)
.+.|+.+ .+++|= .+|||.++=|. ..-..|..+ ..+.++.- +.--++|||.+...+. +|..+.+
T Consensus 18 El~Lp~G-~~tlG~-~gcDi~lpL~~--~~~~~L~i~--------e~gi~l~~--~~~~vwVnG~~~~~~~--~LPl~q~ 81 (395)
T PRK15367 18 EVWLNEG-NLSLGE-KGCDICIPLTI--NEKIILREQ--------ADSLFVDA--GKARVRVNGRRFNPNK--PLPSSGV 81 (395)
T ss_pred EEecCCC-ceeecC-CCceEEEECCC--CCEEEEEEc--------CCcEEEec--CCceEEECCEEcCCCC--CCCCcch
Confidence 4678874 569998 55999987654 333445555 56666642 3457999999987775 4888999
Q ss_pred EEEcCeeeEEEEeCCCCCcc
Q psy2133 90 LSFGSSTRFFILQGPSEDEE 109 (207)
Q Consensus 90 I~fG~s~~~fil~~p~~~~e 109 (207)
|.+.+..+. + |..++.-
T Consensus 82 Ie~aG~~~v--l-G~~d~~L 98 (395)
T PRK15367 82 LQVAGVAIA--F-GKQDCEL 98 (395)
T ss_pred hhhcceEEE--e-cCccccc
Confidence 999997644 3 5444443
No 29
>cd00165 S4 S4/Hsp/ tRNA synthetase RNA-binding domain; The domain surface is populated by conserved, charged residues that define a likely RNA-binding site; Found in stress proteins, ribosomal proteins and tRNA synthetases; This may imply a hitherto unrecognized functional similarity between these three protein classes.
Probab=55.11 E-value=16 Score=23.51 Aligned_cols=28 Identities=14% Similarity=0.344 Sum_probs=21.3
Q ss_pred CccceeecCeEc-CCCceEEecCCCEEEEcC
Q psy2133 65 STHGTFLNRCKI-KPKMYVRIHVGHMLSFGS 94 (207)
Q Consensus 65 StnGTfVNg~rI-~~~~~~~L~~Gd~I~fG~ 94 (207)
..++.+|||+++ .++ ..++.||.|.+..
T Consensus 24 ~~g~V~vn~~~~~~~~--~~v~~~d~i~i~~ 52 (70)
T cd00165 24 KHGHVLVNGKVVTKPS--YKVKPGDVIEVDG 52 (70)
T ss_pred HcCCEEECCEEccCCc--cCcCCCCEEEEcC
Confidence 356789999988 544 4788999888865
No 30
>PF14640 TMEM223: Transmembrane protein 223
Probab=53.04 E-value=23 Score=29.36 Aligned_cols=45 Identities=29% Similarity=0.414 Sum_probs=28.3
Q ss_pred EEccCCCCcEEeC------CCCcCccceEEEEEeecCCcCCCCeEEEEECCCccceeecCe
Q psy2133 20 CVGRERNTHLNLL------HPTVSRYHAILQYKSTFDEKDPARGFYVYDLGSTHGTFLNRC 74 (207)
Q Consensus 20 ~IGR~~~cDv~l~------~psVSR~HAvI~~~~~~~~~~~~~~~~l~Dl~StnGTfVNg~ 74 (207)
.+|+....++++. |.+-++-+--|.+. +..-|||.| .+|+|.|.+
T Consensus 109 ~fG~~~~~~vPl~~vs~~~~r~~~~~~i~lkvk-------g~~fyylld---~~G~f~n~~ 159 (170)
T PF14640_consen 109 PFGRRRDFTVPLNQVSCLTHRSDAPSQIPLKVK-------GRRFYYLLD---KRGEFVNPR 159 (170)
T ss_pred ccCCCCceEEcHHHcccchhccCCCceeEEEEC-------CceEEEEEe---cCCcCCCHH
Confidence 3777666555554 33333334455555 356788998 699999864
No 31
>PRK01777 hypothetical protein; Validated
Probab=50.09 E-value=19 Score=27.04 Aligned_cols=30 Identities=13% Similarity=0.036 Sum_probs=23.9
Q ss_pred ECCCccceeecCeEcCCCceEEecCCCEEEEcC
Q psy2133 62 DLGSTHGTFLNRCKIKPKMYVRIHVGHMLSFGS 94 (207)
Q Consensus 62 Dl~StnGTfVNg~rI~~~~~~~L~~Gd~I~fG~ 94 (207)
|+ |++-..|||+.+...+ .|++||.|.|=.
T Consensus 46 ~~-~~~~vgI~Gk~v~~d~--~L~dGDRVeIyr 75 (95)
T PRK01777 46 DL-AKNKVGIYSRPAKLTD--VLRDGDRVEIYR 75 (95)
T ss_pred cc-ccceEEEeCeECCCCC--cCCCCCEEEEec
Confidence 55 5678889999998764 799999987643
No 32
>smart00797 AHS2 Allophanate hydrolase subunit 2. This domain represents subunit 2 of allophanate hydrolase (AHS2).
Probab=49.01 E-value=67 Score=28.64 Aligned_cols=40 Identities=20% Similarity=0.375 Sum_probs=31.9
Q ss_pred CCccceeecCeEcCCCceEEecCCCEEEEcCe---eeEEEEeC
Q psy2133 64 GSTHGTFLNRCKIKPKMYVRIHVGHMLSFGSS---TRFFILQG 103 (207)
Q Consensus 64 ~StnGTfVNg~rI~~~~~~~L~~Gd~I~fG~s---~~~fil~~ 103 (207)
|..-...||++.++....+.++.||+|+||.. .|.|.-..
T Consensus 52 GA~~~~~ln~~~~~~~~~~~v~~Gd~L~~g~~~~G~R~YLAv~ 94 (280)
T smart00797 52 GADFPATLDGQPVPPWKPFLVRAGQVLSLGAPKAGARAYLAVA 94 (280)
T ss_pred CCCCeeeECCEEcCCCeEEEECCCCEEEeCCCCCccEEEEEEe
Confidence 45566779999999988899999999999964 36676543
No 33
>PF11012 DUF2850: Protein of unknown function (DUF2850); InterPro: IPR021271 This family of proteins with unknown function appear to be restricted to Vibrionaceae.
Probab=45.76 E-value=51 Score=24.05 Aligned_cols=38 Identities=24% Similarity=0.353 Sum_probs=28.2
Q ss_pred CccceeecCeEcCCCceEEecCCCEE--EEcCeeeEEEEeCCC
Q psy2133 65 STHGTFLNRCKIKPKMYVRIHVGHML--SFGSSTRFFILQGPS 105 (207)
Q Consensus 65 StnGTfVNg~rI~~~~~~~L~~Gd~I--~fG~s~~~fil~~p~ 105 (207)
+..|+++||+.|.-. ... +|..| +.|+..+.|.+.+..
T Consensus 22 ~~~GV~~ngrlV~T~--F~f-DG~~l~~~~G~~~~~y~~~~~~ 61 (79)
T PF11012_consen 22 NESGVFRNGRLVATS--FEF-DGKTLEYRTGSGTYRYQISGEN 61 (79)
T ss_pred CCCcEEECCCEEeeE--EEE-CCCEEEEEECCeEEEEEEcCCC
Confidence 678999999998743 233 66664 568888999887654
No 34
>COG1984 DUR1 Allophanate hydrolase subunit 2 [Amino acid transport and metabolism]
Probab=44.50 E-value=70 Score=29.19 Aligned_cols=44 Identities=18% Similarity=0.297 Sum_probs=35.0
Q ss_pred CCccceeecCeEcCCCceEEecCCCEEEEcCe---eeEEEEeCCCCC
Q psy2133 64 GSTHGTFLNRCKIKPKMYVRIHVGHMLSFGSS---TRFFILQGPSED 107 (207)
Q Consensus 64 ~StnGTfVNg~rI~~~~~~~L~~Gd~I~fG~s---~~~fil~~p~~~ 107 (207)
|..-...||++.|++...+.++.|++|+||.. .|.|......-+
T Consensus 73 Gad~~a~ld~~~i~~~~~~~vk~Gq~L~~g~~~~G~R~YLav~GGi~ 119 (314)
T COG1984 73 GADCEATLDGQEVPPWSPYLVKAGQTLKLGRPKQGMRAYLAVAGGID 119 (314)
T ss_pred CCcccceECCEEcCCCceEEccCCCEEEecCCCCcceEEEEecCccc
Confidence 45667789999999999999999999999975 467766544333
No 35
>TIGR00724 urea_amlyse_rel biotin-dependent carboxylase uncharacterized domain. Urea amidolyase of Saccharomyces cerevisiae is a 1835 amino acid protein with an amidase domain, a biotin/lipoyl cofactor attachment domain, a carbamoyl-phosphate synthase L chain-like domain, and uncharacterized regions. It has both urea carboxylase and allophanate hydrolase activities. This alignment models a domain that represents uncharacterized prokaryotic proteins of about 300 amino acids, regions of prokaryotic urea carboxylase and of the urea carboxylase region of yeast urea amidolyase, and regions of other biotin-containing proteins.
Probab=42.90 E-value=58 Score=29.57 Aligned_cols=41 Identities=22% Similarity=0.313 Sum_probs=32.0
Q ss_pred CCCccceeecCeEcCC-CceEEecCCCEEEEcCe---eeEEEEeC
Q psy2133 63 LGSTHGTFLNRCKIKP-KMYVRIHVGHMLSFGSS---TRFFILQG 103 (207)
Q Consensus 63 l~StnGTfVNg~rI~~-~~~~~L~~Gd~I~fG~s---~~~fil~~ 103 (207)
.|......|||+++.+ ...+.++.||+|.||.. .|.|.-..
T Consensus 73 tGA~~~~~lng~~~~~~~~~~~v~~Gd~L~~g~~~~G~R~YLAv~ 117 (314)
T TIGR00724 73 TGADTDLCLNDGQVIPQWRPYEVKRGQILSLGRLKSGMRGYLAVR 117 (314)
T ss_pred eCCCCcceeCCcccCCCceEEEECCCCEEEeCCCCCccEEEEEEe
Confidence 3556788999999976 77899999999999964 36666543
No 36
>KOG2963|consensus
Probab=41.25 E-value=40 Score=31.42 Aligned_cols=12 Identities=8% Similarity=0.113 Sum_probs=6.0
Q ss_pred cCccceEEEEEe
Q psy2133 36 VSRYHAILQYKS 47 (207)
Q Consensus 36 VSR~HAvI~~~~ 47 (207)
|+=+|-.|++.+
T Consensus 190 IdmRHysI~vvp 201 (405)
T KOG2963|consen 190 IDMRHYSIRVVP 201 (405)
T ss_pred eeeeeeEEEEEe
Confidence 444555555543
No 37
>cd01764 Urm1 Urm1-like ubuitin domain. Urm1 (Ubiquitin-Related Modifier1) The Urm1 fold, like those of two closely related proteins MoaD (molybdopterin synthase) and ThiS (sulfur carrier protein), is similar to that of ubiquitin although there is little or no sequence similarity. The C-terminal glycines of Urm1 are conjugated to an E1-like protein Uba4 as part of a novel conjugation system in yeast. The Urm1 fold is found only in eukaryotes.
Probab=40.76 E-value=23 Score=26.31 Aligned_cols=24 Identities=13% Similarity=0.180 Sum_probs=18.9
Q ss_pred eeecCeEcC--CCceEEecCCCEEEE
Q psy2133 69 TFLNRCKIK--PKMYVRIHVGHMLSF 92 (207)
Q Consensus 69 TfVNg~rI~--~~~~~~L~~Gd~I~f 92 (207)
.+||++.+. .+..+.|++||.|.|
T Consensus 62 VlvN~~di~~l~g~~t~L~dgD~v~i 87 (94)
T cd01764 62 VLINDTDWELLGEEDYILEDGDHVVF 87 (94)
T ss_pred EEECCccccccCCcccCCCCcCEEEE
Confidence 677998764 455678999999987
No 38
>PRK10348 ribosome-associated heat shock protein Hsp15; Provisional
Probab=40.54 E-value=43 Score=26.75 Aligned_cols=37 Identities=19% Similarity=0.315 Sum_probs=26.1
Q ss_pred cceeecCeEcCCCceEEecCCCEEEE--cCeeeEEEEeCCC
Q psy2133 67 HGTFLNRCKIKPKMYVRIHVGHMLSF--GSSTRFFILQGPS 105 (207)
Q Consensus 67 nGTfVNg~rI~~~~~~~L~~Gd~I~f--G~s~~~fil~~p~ 105 (207)
.-++|||++.+|+. .+.+||+|.| |...+.+.+.+.+
T Consensus 34 G~V~vnG~~~Kps~--~V~~gd~l~v~~~~~~~~v~Vl~l~ 72 (133)
T PRK10348 34 GKVHYNGQRSKPSK--IVELNATLTLRQGNDERTVIVKAIT 72 (133)
T ss_pred CCEEECCEECCCCC--ccCCCCEEEEEECCEEEEEEEeECc
Confidence 45788999977764 6789999765 5656666665443
No 39
>cd00754 MoaD Ubiquitin domain of MoaD-like proteins. MoaD family. Members of this family are involved in biosynthesis of the molybdenum cofactor (Moco), an essential cofactor of a diverse group of redox enzymes. Moco biosynthesis is an evolutionarily conserved pathway present in eubacteria, archaea and eukaryotes. Moco contains a tricyclic pyranopterin, termed molybdopterin (MPT), that contains the cis-dithiolene group responsible for molybdenum ligation. This dithiolene group is generated by MPT synthase, the second major step in Moco biosynthesis. MPT synthase consists of a large (MoeE) and small (MoaD) subunit. The small subunit is inserted into the lare subunit to form the active site. The small subunit, which is structurally similar to ubiquitin, contains a C-terminal thiocarboxylated glycine residue that serves as a sulfur donor for the synthesis of the MPT dithiolene group.
Probab=38.60 E-value=31 Score=23.73 Aligned_cols=26 Identities=8% Similarity=0.092 Sum_probs=20.0
Q ss_pred CccceeecCeEcCCCceEEecCCCEEEE
Q psy2133 65 STHGTFLNRCKIKPKMYVRIHVGHMLSF 92 (207)
Q Consensus 65 StnGTfVNg~rI~~~~~~~L~~Gd~I~f 92 (207)
...-.+|||+.++.. ..|+.||.|.|
T Consensus 48 ~~~~v~vNg~~v~~~--~~l~~gD~v~i 73 (80)
T cd00754 48 ARVRIAVNGEYVRLD--TPLKDGDEVAI 73 (80)
T ss_pred hcEEEEECCeEcCCC--cccCCCCEEEE
Confidence 345678899888744 57999999876
No 40
>TIGR01687 moaD_arch MoaD family protein, archaeal. Members of this family appear to be archaeal versions of MoaD, subunit 1 of molybdopterin converting factor. This model has been split from the bacterial/eukaryotic equivalog model TIGR01682 because the presence of two members of this family in a substantial number of archaeal species suggests that roles might not be interchangeable.
Probab=38.05 E-value=35 Score=24.23 Aligned_cols=27 Identities=7% Similarity=0.105 Sum_probs=20.0
Q ss_pred ccceeecCeEcCCCceEEecCCCEEEE
Q psy2133 66 THGTFLNRCKIKPKMYVRIHVGHMLSF 92 (207)
Q Consensus 66 tnGTfVNg~rI~~~~~~~L~~Gd~I~f 92 (207)
..-+.||++.+.......|++||.|.|
T Consensus 55 ~~~v~vN~~~v~~~~~~~l~dgdev~i 81 (88)
T TIGR01687 55 NVIILVNGRNVDWGLGTELKDGDVVAI 81 (88)
T ss_pred cEEEEECCEecCccCCCCCCCCCEEEE
Confidence 345678888887654357999999876
No 41
>KOG3438|consensus
Probab=37.78 E-value=9.1 Score=29.38 Aligned_cols=28 Identities=14% Similarity=0.360 Sum_probs=24.1
Q ss_pred EEEccCCC---CcEEeCCCCcCccceEEEEE
Q psy2133 19 YCVGRERN---THLNLLHPTVSRYHAILQYK 46 (207)
Q Consensus 19 ~~IGR~~~---cDv~l~~psVSR~HAvI~~~ 46 (207)
|+|+|.|. |-..++|||-+..|.+||..
T Consensus 35 ~vI~k~peVefcGYtIPHPse~k~niRIQt~ 65 (105)
T KOG3438|consen 35 YVIMKNPEVEFCGYTIPHPSEDKINIRIQTR 65 (105)
T ss_pred HHHhcCCceEEEeccCCCCchhhheEEEEec
Confidence 56888886 77779999999999999987
No 42
>PF08804 gp32: gp32 DNA binding protein like; InterPro: IPR012339 This entry is represented by the Bacteriophage T4, Gp32, single-stranded DNA-binding protein. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. Gp32 is essential for T4 DNA replication, recombination and repair, acting to stimulate replisome processing and accuracy through its binding to ssDNA as the replication fork advances. The crystal structure of Gp32 shows an ssDNA binding cleft comprised of regions from three structural subdomains, through which ssDNA can slide freely []. The structure of Gp32 is similar to other phage ssDNA-binding proteins such as Gp2.5 from bacteriophage T4, and gene V protein, both of which have a nucleic acid-binding OB-type fold. However, Gp32 contains a zinc-finger subdomain at residues 63-111 that is not found in the other two phage proteins.; GO: 0003697 single-stranded DNA binding; PDB: 1GPC_A 2A1K_B 2ATQ_B.
Probab=37.57 E-value=14 Score=27.89 Aligned_cols=16 Identities=38% Similarity=0.804 Sum_probs=13.7
Q ss_pred CCeEEEEECCCcccee
Q psy2133 55 ARGFYVYDLGSTHGTF 70 (207)
Q Consensus 55 ~~~~~l~Dl~StnGTf 70 (207)
.++|||.+..||||.|
T Consensus 43 ~g~WyiEN~~sT~G~~ 58 (94)
T PF08804_consen 43 NGGWYIENCPSTHGDY 58 (94)
T ss_dssp TTEEEEEEEGGGGT-S
T ss_pred CCcEEEecCccccCCC
Confidence 6889999999999985
No 43
>cd04486 YhcR_OBF_like YhcR_OBF_like: A subfamily of OB-fold domains similar to the OB folds of Bacillus subtilis YhcR. YhcR is a sugar-nonspecific nuclease, which is active in the presence of Ca2+ and Mn2+. It cleaves RNA endonucleolytically, producing 3'-monophosphate nucleosides. YhcR appears to be the major Ca2+ activated nuclease of B. subtilis. YhcR may be localized in the cell wall.
Probab=36.24 E-value=88 Score=22.25 Aligned_cols=40 Identities=20% Similarity=0.435 Sum_probs=27.1
Q ss_pred CCeEEEEECC------CccceeecCeEcCCCceEEecCCCEEEEcCeeeEE
Q psy2133 55 ARGFYVYDLG------STHGTFLNRCKIKPKMYVRIHVGHMLSFGSSTRFF 99 (207)
Q Consensus 55 ~~~~~l~Dl~------StnGTfVNg~rI~~~~~~~L~~Gd~I~fG~s~~~f 99 (207)
..+|||.|.. .+.|.||=... ...+.+||.|++-+..-.|
T Consensus 15 ~~GffiQd~~~d~~~~ts~gifV~~~~-----~~~~~~Gd~V~vtG~v~ey 60 (78)
T cd04486 15 LGGFYIQDEDGDGDPATSEGIFVYTGS-----GADVAVGDLVRVTGTVTEY 60 (78)
T ss_pred cCEEEEEcCCCCCCCcccceEEEecCC-----CCCCCCCCEEEEEEEEEee
Confidence 3789999963 25688883322 2358999999997655444
No 44
>PF14451 Ub-Mut7C: Mut7-C ubiquitin
Probab=36.08 E-value=42 Score=24.39 Aligned_cols=25 Identities=12% Similarity=0.107 Sum_probs=20.3
Q ss_pred ceeecCeEcCCCceEEecCCCEEEEcC
Q psy2133 68 GTFLNRCKIKPKMYVRIHVGHMLSFGS 94 (207)
Q Consensus 68 GTfVNg~rI~~~~~~~L~~Gd~I~fG~ 94 (207)
-.+|||+.+... +++++||.|.+-.
T Consensus 51 ~i~vNG~~v~~~--~~~~~Gd~v~V~P 75 (81)
T PF14451_consen 51 LILVNGRPVDFD--YRLKDGDRVAVYP 75 (81)
T ss_pred EEEECCEECCCc--ccCCCCCEEEEEe
Confidence 467899998875 6899999998754
No 45
>COG5131 URM1 Ubiquitin-like protein [Posttranslational modification, protein turnover, chaperones]
Probab=35.61 E-value=48 Score=25.00 Aligned_cols=41 Identities=15% Similarity=0.216 Sum_probs=29.6
Q ss_pred CCCeEEEEECCCccce--eecCe--EcCCCceEEecCCCEEEEcC
Q psy2133 54 PARGFYVYDLGSTHGT--FLNRC--KIKPKMYVRIHVGHMLSFGS 94 (207)
Q Consensus 54 ~~~~~~l~Dl~StnGT--fVNg~--rI~~~~~~~L~~Gd~I~fG~ 94 (207)
.+..+++.+-.=.+|+ .||++ .+-.+..+.|.+||.|.|=+
T Consensus 47 ~~~sifie~g~lrpGiI~LINd~DWeLleke~y~ledgDiIvfis 91 (96)
T COG5131 47 TRDSIFIEHGELRPGIICLINDMDWELLEKERYPLEDGDIIVFIS 91 (96)
T ss_pred ccceeeecCCCCcccEEEEEcCccHhhhhcccccCCCCCEEEEEe
Confidence 3567788776667775 56887 34555568999999998844
No 46
>PF14478 DUF4430: Domain of unknown function (DUF4430); PDB: 3U7Z_B 2BB5_A.
Probab=33.88 E-value=35 Score=23.41 Aligned_cols=22 Identities=9% Similarity=0.138 Sum_probs=10.7
Q ss_pred ecCeEcCCC-ceEEecCCCEEEE
Q psy2133 71 LNRCKIKPK-MYVRIHVGHMLSF 92 (207)
Q Consensus 71 VNg~rI~~~-~~~~L~~Gd~I~f 92 (207)
|||.....+ ..+.|++||.|.|
T Consensus 45 vNG~~~~~ga~~~~l~~GD~i~~ 67 (68)
T PF14478_consen 45 VNGESANVGAGSYKLKDGDKITW 67 (68)
T ss_dssp ETTEE-SS-CCC-B--TTEEEEE
T ss_pred ECCEEhhcCcceeEeCCCCEEEe
Confidence 455443332 2467999999986
No 47
>PF02824 TGS: TGS domain; InterPro: IPR004095 The TGS domain is present in a number of enzymes, for example, in threonyl-tRNA synthetase (ThrRS), GTPase, and guanosine-3',5'-bis(diphosphate) 3'-pyrophosphohydrolase (SpoT) []. The TGS domain is also present at the amino terminus of the uridine kinase from the spirochaete Treponema pallidum (but not any other organism, including the related spirochaete Borrelia burgdorferi). TGS is a small domain that consists of ~50 amino acid residues and is predicted to possess a predominantly beta-sheet structure. There is no direct information on the functions of the TGS domain, but its presence in two types of regulatory proteins (the GTPases and guanosine polyphosphate phosphohydrolases/synthetases) suggests a ligand (most likely nucleotide)-binding, regulatory role []. ; PDB: 1WXQ_A 1NYR_B 1NYQ_B 2KMM_A 1WWT_A 1TKE_A 1TKG_A 1TJE_A 1QF6_A 1TKY_A ....
Probab=32.20 E-value=40 Score=22.68 Aligned_cols=24 Identities=8% Similarity=0.095 Sum_probs=18.2
Q ss_pred cceeecCeEcCCCceEEecCCCEEEE
Q psy2133 67 HGTFLNRCKIKPKMYVRIHVGHMLSF 92 (207)
Q Consensus 67 nGTfVNg~rI~~~~~~~L~~Gd~I~f 92 (207)
-+-.|||+++.-. ++|+.||+|+|
T Consensus 35 ~~A~Vng~~vdl~--~~L~~~d~v~i 58 (60)
T PF02824_consen 35 VAAKVNGQLVDLD--HPLEDGDVVEI 58 (60)
T ss_dssp EEEEETTEEEETT--SBB-SSEEEEE
T ss_pred eEEEEcCEECCCC--CCcCCCCEEEE
Confidence 4567899888765 47999999886
No 48
>PF00376 MerR: MerR family regulatory protein; InterPro: IPR000551 The many bacterial transcription regulation proteins which bind DNA through a 'helix-turn-helix' motif can be classified into subfamilies on the basis of sequence similarities. One of these is the MerR subfamily. MerR, which is found in many bacterial species mediates the mercuric-dependent induction of the mercury resistance operon. In the absence of mercury merR represses transcription by binding tightly, as a dimer, to the 'mer' operator region; when mercury is present the dimeric complex binds a single ion and becomes a potent transcriptional activator, while remaining bound to the mer site. Members of the family include the mercuric resistance operon regulatory protein merR; Bacillus subtilis bltR and bmrR; Bacillus glnR; Streptomyces coelicolor hspR; Bradyrhizobium japonicum nolA; Escherichia coli superoxide response regulator soxR; and Streptomyces lividans transcriptional activator tipA [, , , , , ]. Other members include hypothetical proteins from E. coli, B. subtilis and Haemophilus influenzae. Within this family, the HTH motif is situated towards the N terminus.; GO: 0003700 sequence-specific DNA binding transcription factor activity, 0006355 regulation of transcription, DNA-dependent; PDB: 3HH0_A 2DG6_A 1R8D_B 1JBG_A 2VZ4_A 2ZHH_A 2ZHG_A 1Q07_A 1Q06_A 1Q05_B ....
Probab=31.13 E-value=26 Score=21.68 Aligned_cols=13 Identities=46% Similarity=0.984 Sum_probs=10.1
Q ss_pred chhhhhhhhhccCC
Q psy2133 184 PKKTLRGWFDREGK 197 (207)
Q Consensus 184 p~k~l~~~f~~eg~ 197 (207)
+.+||| |||++|+
T Consensus 12 s~~tlR-~ye~~Gl 24 (38)
T PF00376_consen 12 SPRTLR-YYEREGL 24 (38)
T ss_dssp -HHHHH-HHHHTTS
T ss_pred CHHHHH-HHHHCCC
Confidence 568898 8899985
No 49
>TIGR02712 urea_carbox urea carboxylase. Members of this family are ATP-dependent urea carboxylase, including characterized members from Oleomonas sagaranensis (alpha class Proteobacterium) and yeasts such as Saccharomyces cerevisiae. The allophanate hydrolase domain of the yeast enzyme is not included in this model and is represented by an adjacent gene in Oleomonas sagaranensis. The fusion of urea carboxylase and allophanate hydrolase is designated urea amidolyase. The enzyme from Oleomonas sagaranensis was shown to be highly active on acetamide and formamide as well as urea.
Probab=30.78 E-value=1.8e+02 Score=31.21 Aligned_cols=41 Identities=17% Similarity=0.247 Sum_probs=33.9
Q ss_pred CCccceeecCeEcCCCceEEecCCCEEEEcCee----eEEEEeCC
Q psy2133 64 GSTHGTFLNRCKIKPKMYVRIHVGHMLSFGSST----RFFILQGP 104 (207)
Q Consensus 64 ~StnGTfVNg~rI~~~~~~~L~~Gd~I~fG~s~----~~fil~~p 104 (207)
|......|||++|+..+.+.++.|++++||... |.|.....
T Consensus 523 Ga~~~~~l~g~~v~~~~~~~v~~G~~L~~g~~~~~G~r~Ylav~G 567 (1201)
T TIGR02712 523 GAPAPATLDGQPVPQWKPITVKAGSTLSIGKIAGSGCRTYLAIRG 567 (1201)
T ss_pred CCCCcceECCEEcCCCeEEEECCCCEEEecCCCcCceEEEEEecc
Confidence 556678899999999999999999999999554 77776543
No 50
>PF15149 CATSPERB: Cation channel sperm-associated protein subunit beta protein family
Probab=30.31 E-value=75 Score=30.93 Aligned_cols=18 Identities=17% Similarity=0.641 Sum_probs=16.0
Q ss_pred CCeEEEEECCCccceeec
Q psy2133 55 ARGFYVYDLGSTHGTFLN 72 (207)
Q Consensus 55 ~~~~~l~Dl~StnGTfVN 72 (207)
.+.|+|||.|++||+.++
T Consensus 129 ~~~WfLYnFG~~~gr~W~ 146 (540)
T PF15149_consen 129 KDSWFLYNFGQKNGRTWK 146 (540)
T ss_pred cceEEEEeccccCCceEE
Confidence 578999999999999874
No 51
>PRK06437 hypothetical protein; Provisional
Probab=29.33 E-value=71 Score=22.01 Aligned_cols=25 Identities=4% Similarity=-0.023 Sum_probs=19.0
Q ss_pred ccceeecCeEcCCCceEEecCCCEEEE
Q psy2133 66 THGTFLNRCKIKPKMYVRIHVGHMLSF 92 (207)
Q Consensus 66 tnGTfVNg~rI~~~~~~~L~~Gd~I~f 92 (207)
...+.+||..+++. +.|+.||.|.+
T Consensus 36 ~vaV~vNg~iv~~~--~~L~dgD~Vei 60 (67)
T PRK06437 36 EYVVIVNGSPVLED--HNVKKEDDVLI 60 (67)
T ss_pred cEEEEECCEECCCc--eEcCCCCEEEE
Confidence 44566888888754 58999999876
No 52
>cd01666 TGS_DRG_C TGS_DRG_C: DRG (developmentally regulated GTP-binding protein) represents a family of GTP-binding proteins that includes two members, DRG1 and DRG2. DRG1 and DRG2 have a C-terminal TGS domain (named after the ThrRS, GTPase, and SpoT proteins where it occurs) with a predominantly beta-sheet structure. The function of TGS is unknown but its presence in two types of regulatory proteins (the DRG GTPases and guanosine polyphosphate phosphohydrolases/synthetases) suggests a ligand (most likely nucleotide)-binding, regulatory role.
Probab=29.06 E-value=53 Score=23.45 Aligned_cols=20 Identities=0% Similarity=-0.012 Sum_probs=16.7
Q ss_pred ecCeEcCCCceEEecCCCEEEE
Q psy2133 71 LNRCKIKPKMYVRIHVGHMLSF 92 (207)
Q Consensus 71 VNg~rI~~~~~~~L~~Gd~I~f 92 (207)
++|+++... ..|++||+|++
T Consensus 54 ~~gq~Vgl~--~~L~d~DvVeI 73 (75)
T cd01666 54 HSPQRVGLD--HVLEDEDVVQI 73 (75)
T ss_pred CCCeECCCC--CEecCCCEEEE
Confidence 588888876 48999999986
No 53
>cd04089 eRF3_II eRF3_II: domain II of the eukaryotic class II release factor (eRF3). In eukaryotes, translation termination is mediated by two interacting release factors, eRF1 and eRF3, which act as class I and II factors, respectively. eRF1 functions as an omnipotent release factor, decoding all three stop codons and triggering the release of the nascent peptide catalyzed by the ribsome. eRF3 is a GTPase, which enhances the termination efficiency by stimulating the eRF1 activity in a GTP-dependent manner. Sequence comparison of class II release factors with elongation factors shows that eRF3 is more similar to eEF1alpha whereas prokaryote RF3 is more similar to EF-G, implying that their precise function may differ. Only eukaryote RF3s are found in this group. Saccharomyces cerevisiae eRF3 (Sup35p) is a translation termination factor which is divided into three regions N, M and a C-terminal eEF1a-like region essential for translation termination. Sup35NM is a non-pathogenic prion-li
Probab=28.91 E-value=83 Score=22.12 Aligned_cols=26 Identities=23% Similarity=0.267 Sum_probs=20.2
Q ss_pred cceeecCeEcCCCceEEecCCCEEEEcCee
Q psy2133 67 HGTFLNRCKIKPKMYVRIHVGHMLSFGSST 96 (207)
Q Consensus 67 nGTfVNg~rI~~~~~~~L~~Gd~I~fG~s~ 96 (207)
.||.+.| +|..+ .|+.||.|.+..+.
T Consensus 13 ~g~vv~G-~v~~G---~i~~G~~v~i~P~~ 38 (82)
T cd04089 13 MGTVVLG-KVESG---TIKKGDKLLVMPNK 38 (82)
T ss_pred CCEEEEE-EEeee---EEecCCEEEEeCCC
Confidence 4888888 56665 69999999988754
No 54
>COG1342 Predicted DNA-binding proteins [General function prediction only]
Probab=28.69 E-value=96 Score=23.64 Aligned_cols=36 Identities=31% Similarity=0.390 Sum_probs=28.4
Q ss_pred EEEEcCeeeEEEEeCCCCCccccccCCHHHHHHHHH
Q psy2133 89 MLSFGSSTRFFILQGPSEDEEEESELSVSELKEQRR 124 (207)
Q Consensus 89 ~I~fG~s~~~fil~~p~~~~e~es~lsv~el~~~~~ 124 (207)
.|.+=...+.|...+++..+.+.-.+|++|+..+|.
T Consensus 9 ~I~~~P~~~~F~P~g~~~~~~~~V~lt~eElEAlRL 44 (99)
T COG1342 9 RISFEPKVRCFKPEGVPLEELEPVILTIEELEALRL 44 (99)
T ss_pred eeccCCceeEeccCCCCcccCcceeecHHHHHHHHH
Confidence 344555667888888888777788999999999876
No 55
>KOG4460|consensus
Probab=28.46 E-value=3.1e+02 Score=27.42 Aligned_cols=83 Identities=19% Similarity=0.261 Sum_probs=46.3
Q ss_pred CccCC---eeeeEEEcCCCCeEEEccCCCCcEEeC--CC-CcCccceEEEEEeecCCcCCCCeEEEEECCCccceeecCe
Q psy2133 1 ELKSG---QIVNTIDLSTRSFYCVGRERNTHLNLL--HP-TVSRYHAILQYKSTFDEKDPARGFYVYDLGSTHGTFLNRC 74 (207)
Q Consensus 1 ~lK~G---~ii~~i~L~~~~~~~IGR~~~cDv~l~--~p-sVSR~HAvI~~~~~~~~~~~~~~~~l~Dl~StnGTfVNg~ 74 (207)
++++| .+..++.+.. . ++-.++...+.-. || |||..|..|-.. +.-+.|||+.+..-.|+-+
T Consensus 140 ~~eDgk~~v~CRt~~i~~-~--~ftss~~ltl~Qa~WHP~S~~D~hL~iL~s--------dnviRiy~lS~~telylqp- 207 (741)
T KOG4460|consen 140 EFEDGKSTVNCRTTPVAE-R--FFTSSTSLTLKQAAWHPSSILDPHLVLLTS--------DNVIRIYSLSEPTELYLQP- 207 (741)
T ss_pred eecCCCceEEEEeecccc-e--eeccCCceeeeeccccCCccCCceEEEEec--------CcEEEEEecCCcchhhccC-
Confidence 35666 3334444544 1 2333343333322 66 588999999887 7899999997766665532
Q ss_pred EcCCCceEEe-cCCCEEEEcCeeeEE
Q psy2133 75 KIKPKMYVRI-HVGHMLSFGSSTRFF 99 (207)
Q Consensus 75 rI~~~~~~~L-~~Gd~I~fG~s~~~f 99 (207)
+.+.|= ....+..||.....|
T Consensus 208 ----gepgRS~tn~Si~sFGe~~~~~ 229 (741)
T KOG4460|consen 208 ----GEPGRSPTNVSILSFGEEESLV 229 (741)
T ss_pred ----CCcCCCCccceeeccCCcceee
Confidence 211111 122345788765533
No 56
>PF02597 ThiS: ThiS family; InterPro: IPR003749 ThiS (thiaminS) is a 66 aa protein involved in sulphur transfer. ThiS is coded in the thiCEFSGH operon in Escherichia coli. This family of proteins have two conserved Glycines at the COOH terminus. Thiocarboxylate is formed at the last G in the activation process. Sulphur is transferred from ThiI to ThiS in a reaction catalysed by IscS []. MoaD, a protein involved in sulphur transfer during molybdopterin synthesis, is about the same length and shows limited sequence similarity to ThiS. Both have the conserved GG at the COOH end.; PDB: 1JW9_D 1JWB_D 1JWA_D 3BII_D 1NVI_D 1FMA_D 1FM0_D 2QIE_G 2Q5W_D 2K5P_A ....
Probab=27.90 E-value=49 Score=22.47 Aligned_cols=28 Identities=7% Similarity=0.064 Sum_probs=20.0
Q ss_pred CccceeecCeEcCC-CceEEecCCCEEEE
Q psy2133 65 STHGTFLNRCKIKP-KMYVRIHVGHMLSF 92 (207)
Q Consensus 65 StnGTfVNg~rI~~-~~~~~L~~Gd~I~f 92 (207)
...-..||++.++. .....|+.||.|.|
T Consensus 42 ~~~~v~vN~~~v~~~~~~~~l~~gD~V~i 70 (77)
T PF02597_consen 42 DRVAVAVNGEIVPDDGLDTPLKDGDEVAI 70 (77)
T ss_dssp TTEEEEETTEEEGGGTTTSBEETTEEEEE
T ss_pred ccEEEEECCEEcCCccCCcCcCCCCEEEE
Confidence 44567788888776 12357999999876
No 57
>COG3145 AlkB Alkylated DNA repair protein [DNA replication, recombination, and repair]
Probab=27.67 E-value=1.3e+02 Score=25.63 Aligned_cols=41 Identities=22% Similarity=0.396 Sum_probs=29.6
Q ss_pred EEEEECCCccceeecCe--EcCCCceEEecCCCEEEEcCeeeEE
Q psy2133 58 FYVYDLGSTHGTFLNRC--KIKPKMYVRIHVGHMLSFGSSTRFF 99 (207)
Q Consensus 58 ~~l~Dl~StnGTfVNg~--rI~~~~~~~L~~Gd~I~fG~s~~~f 99 (207)
.....|++ +.+|.=+. +-.+...++|.+||+|.+|++++.+
T Consensus 134 v~slSLg~-~~~F~~~~~~r~~~~~~~~L~~Gdvvvm~G~~r~~ 176 (194)
T COG3145 134 VASLSLGA-PCIFRLRGRRRRGPGLRLRLEHGDVVVMGGPSRLA 176 (194)
T ss_pred eEEEecCC-CeEEEeccccCCCCceeEEecCCCEEEecCCcccc
Confidence 45666765 46776442 2466778999999999999987653
No 58
>cd00565 ThiS ThiaminS ubiquitin-like sulfur carrier protein. ThiS (ThiaminS) is a sulfur carrier protein involved in thiamin biosynthesis in bacteria. The ThiS fold, like those of two closely related proteins MoaD and Urm1, is similar to that of ubiquitin although there is little or no sequence similarity.
Probab=27.14 E-value=68 Score=21.59 Aligned_cols=25 Identities=12% Similarity=0.115 Sum_probs=17.7
Q ss_pred ceeecCeEcCCC--ceEEecCCCEEEE
Q psy2133 68 GTFLNRCKIKPK--MYVRIHVGHMLSF 92 (207)
Q Consensus 68 GTfVNg~rI~~~--~~~~L~~Gd~I~f 92 (207)
.+.+||+.+++. ....|+.||.|.|
T Consensus 32 ~V~vNg~~v~~~~~~~~~L~~gD~V~i 58 (65)
T cd00565 32 AVALNGEIVPRSEWASTPLQDGDRIEI 58 (65)
T ss_pred EEEECCEEcCHHHcCceecCCCCEEEE
Confidence 455788777654 1257999999876
No 59
>smart00524 DWB Domain B in dwarfin family proteins.
Probab=26.72 E-value=2e+02 Score=23.94 Aligned_cols=21 Identities=14% Similarity=0.308 Sum_probs=17.3
Q ss_pred CCeEEEEECCCccceeecCeEc
Q psy2133 55 ARGFYVYDLGSTHGTFLNRCKI 76 (207)
Q Consensus 55 ~~~~~l~Dl~StnGTfVNg~rI 76 (207)
++.++|+.+ |.+..||+-.-+
T Consensus 70 ~g~Vw~~~~-s~~~VFVqS~~~ 90 (171)
T smart00524 70 NGDVWLYNR-SDSPIFVQSPYL 90 (171)
T ss_pred CCeEEEEEc-CCCCeEEcCCCc
Confidence 578999998 899999986443
No 60
>PF09269 DUF1967: Domain of unknown function (DUF1967); InterPro: IPR015349 The Obg family comprises a group of ancient P-loop small G proteins (GTPases) belonging to the TRAFAC (for translation factors) class and can be subdivided into several distinct protein subfamilies []. OBG GTPases have been found in both prokaryotes and eukaryotes []. The structure of the OBG GTPase from Thermus thermophilus has been determined []. This entry represents a C-terminal domain found in certain OBG GTPases. This domain contains a four-stranded beta sheet and three alpha helices flanked by an additional beta strand. It is predominantly found in the bacterial GTP-binding protein Obg, and is functionally uncharacterised. ; GO: 0000166 nucleotide binding; PDB: 1UDX_A.
Probab=25.43 E-value=63 Score=22.54 Aligned_cols=16 Identities=13% Similarity=0.210 Sum_probs=10.4
Q ss_pred ecCCCEEEEcCeeeEE
Q psy2133 84 IHVGHMLSFGSSTRFF 99 (207)
Q Consensus 84 L~~Gd~I~fG~s~~~f 99 (207)
++.||+|++|+-.+.|
T Consensus 54 ~~~GD~V~Ig~~eFe~ 69 (69)
T PF09269_consen 54 AKEGDTVRIGDYEFEY 69 (69)
T ss_dssp --TT-EEEETTEEEE-
T ss_pred CCCCCEEEEcCEEEEC
Confidence 7899999999976544
No 61
>PRK06488 sulfur carrier protein ThiS; Validated
Probab=25.40 E-value=55 Score=22.08 Aligned_cols=24 Identities=17% Similarity=0.223 Sum_probs=16.0
Q ss_pred eeecCeEcCCCc--eEEecCCCEEEE
Q psy2133 69 TFLNRCKIKPKM--YVRIHVGHMLSF 92 (207)
Q Consensus 69 TfVNg~rI~~~~--~~~L~~Gd~I~f 92 (207)
.-+|+.-+++.. .+.|+.||.|.|
T Consensus 33 vavN~~iv~~~~~~~~~L~dgD~Iei 58 (65)
T PRK06488 33 TAVNGELVHKEARAQFVLHEGDRIEI 58 (65)
T ss_pred EEECCEEcCHHHcCccccCCCCEEEE
Confidence 345666665422 357999999876
No 62
>PHA02550 32 single-stranded DNA binding protein; Provisional
Probab=25.30 E-value=41 Score=30.46 Aligned_cols=18 Identities=33% Similarity=0.648 Sum_probs=14.9
Q ss_pred CCCeEEEEECCCccceee
Q psy2133 54 PARGFYVYDLGSTHGTFL 71 (207)
Q Consensus 54 ~~~~~~l~Dl~StnGTfV 71 (207)
+.++|||+.+.||||-+-
T Consensus 70 ~~ggWYIENc~sT~Gd~D 87 (304)
T PHA02550 70 GNGGWYIENCSSTHGDYD 87 (304)
T ss_pred cCCceEeecCccccCCCC
Confidence 467899999999999443
No 63
>cd00050 MH2 MH2 domain; C terminal domain of SMAD family proteins, responsible for receptor interaction, transactivation, and homo- and heterooligomerisation; also known as Domain B in dwarfin family proteins
Probab=25.15 E-value=2.5e+02 Score=23.45 Aligned_cols=22 Identities=23% Similarity=0.483 Sum_probs=18.2
Q ss_pred CCeEEEEECCCccceeecCeEcC
Q psy2133 55 ARGFYVYDLGSTHGTFLNRCKIK 77 (207)
Q Consensus 55 ~~~~~l~Dl~StnGTfVNg~rI~ 77 (207)
++.++++.+ |.+..||+.-.+.
T Consensus 75 ~g~Vw~~~~-S~~~VFVqS~~l~ 96 (176)
T cd00050 75 NGEVWAYNL-SDHPIFVQSPTLD 96 (176)
T ss_pred CCeEEEEEc-CCCCEEEcCCCCC
Confidence 588999998 8999999875544
No 64
>KOG0246|consensus
Probab=24.53 E-value=81 Score=31.36 Aligned_cols=36 Identities=28% Similarity=0.325 Sum_probs=22.3
Q ss_pred CCCcCccceEEEEEeecCC-cCCCCeEEEEEC-CCccc
Q psy2133 33 HPTVSRYHAILQYKSTFDE-KDPARGFYVYDL-GSTHG 68 (207)
Q Consensus 33 ~psVSR~HAvI~~~~~~~~-~~~~~~~~l~Dl-~StnG 68 (207)
+..-||-||++|+.-.... .--.|.|-++|| ||--|
T Consensus 415 Ns~SSRSHAvfQIilr~~~~~k~hGKfSlIDLAGnERG 452 (676)
T KOG0246|consen 415 NSNSSRSHAVFQIILRKHGEFKLHGKFSLIDLAGNERG 452 (676)
T ss_pred cccccccceeEeeeeecCCcceeEeEEEEEEccCCccC
Confidence 4556999999998632210 001467999998 44333
No 65
>TIGR01683 thiS thiamine biosynthesis protein ThiS. This model represents ThiS, a small thiamine-biosynthesis protein related to MoaD, a molybdenum cofactor biosynthesis protein. Both proteins are involved in sulfur transfer. ThiS has a conserved Gly-Gly C-terminus that is modified, in reactions requiring ThiI, ThiF, IscS, and a sulfur atom from Cys, into the thiocarboxylate that provides the sulfur for thiazole biosynthesis.
Probab=24.17 E-value=79 Score=21.23 Aligned_cols=24 Identities=8% Similarity=0.037 Sum_probs=16.4
Q ss_pred eeecCeEcCCCc--eEEecCCCEEEE
Q psy2133 69 TFLNRCKIKPKM--YVRIHVGHMLSF 92 (207)
Q Consensus 69 TfVNg~rI~~~~--~~~L~~Gd~I~f 92 (207)
..+|+..|++.. .+.|+.||.|.|
T Consensus 32 v~vN~~iv~~~~~~~~~L~~gD~vei 57 (64)
T TIGR01683 32 VAVNGEIVPRSEWDDTILKEGDRIEI 57 (64)
T ss_pred EEECCEEcCHHHcCceecCCCCEEEE
Confidence 346777765443 357999999876
No 66
>PHA00728 hypothetical protein
Probab=23.95 E-value=90 Score=24.88 Aligned_cols=26 Identities=42% Similarity=0.520 Sum_probs=17.7
Q ss_pred ccccCCCCCCCCCcccccccCchhhh
Q psy2133 163 ETDLSENPYASTNNEELYLDDPKKTL 188 (207)
Q Consensus 163 e~~~~~np~~~~~~e~~~~~dp~k~l 188 (207)
+-...+|||..+++----+-+||.|.
T Consensus 38 ~lqEiEnPYTVTNRaIsElV~PkDTM 63 (151)
T PHA00728 38 ELQEIENPYTVTNRAISELVEPKDTM 63 (151)
T ss_pred hHHHhcCCceehhHHHHHhcCCccce
Confidence 33456899988777655666777653
No 67
>TIGR03595 Obg_CgtA_exten Obg family GTPase CgtA, C-terminal extension. CgtA (see model TIGR02729) is a broadly conserved member of the obg family of GTPases associated with ribosome maturation. This model represents a unique C-terminal domain found in some but not all sequences of CgtA. This region is preceded, and may be followed, by a region of low-complexity sequence.
Probab=22.36 E-value=69 Score=22.37 Aligned_cols=15 Identities=13% Similarity=0.069 Sum_probs=12.7
Q ss_pred ecCCCEEEEcCeeeE
Q psy2133 84 IHVGHMLSFGSSTRF 98 (207)
Q Consensus 84 L~~Gd~I~fG~s~~~ 98 (207)
++.||+|++|+..+-
T Consensus 54 ~~~GD~V~Ig~~eFe 68 (69)
T TIGR03595 54 AKDGDTVRIGDFEFE 68 (69)
T ss_pred CCCCCEEEEccEEEe
Confidence 889999999996653
No 68
>KOG3159|consensus
Probab=22.07 E-value=1.3e+02 Score=27.52 Aligned_cols=46 Identities=20% Similarity=0.299 Sum_probs=32.2
Q ss_pred CCeEEEccCCC----CcEEeCCCCcCccceEEEEEeecCCcCCCCeEEEEECCCccceee
Q psy2133 16 RSFYCVGRERN----THLNLLHPTVSRYHAILQYKSTFDEKDPARGFYVYDLGSTHGTFL 71 (207)
Q Consensus 16 ~~~~~IGR~~~----cDv~l~~psVSR~HAvI~~~~~~~~~~~~~~~~l~Dl~StnGTfV 71 (207)
.++++|||... |+|.+ .+++.=-+|... ++||-++.|+|--|-|++
T Consensus 48 ~p~vVIGRhQNpw~E~nv~~---~~e~~I~liRR~-------SGGGTVyHDlGNLN~S~l 97 (336)
T KOG3159|consen 48 DPCVVIGRHQNPWQEANVAL---LRENNIPLIRRF-------SGGGTVYHDLGNLNYSLL 97 (336)
T ss_pred CceEEEccCCCcceeccHHH---HHhcCCeEEEEe-------cCCceEEEecCceeEEEE
Confidence 37889999865 55543 234444444444 378889999999999988
No 69
>COG0522 RpsD Ribosomal protein S4 and related proteins [Translation, ribosomal structure and biogenesis]
Probab=22.05 E-value=1.1e+02 Score=26.25 Aligned_cols=27 Identities=11% Similarity=0.167 Sum_probs=21.1
Q ss_pred eeecCeEcCCCceEEecCCCEEEEcCee
Q psy2133 69 TFLNRCKIKPKMYVRIHVGHMLSFGSST 96 (207)
Q Consensus 69 TfVNg~rI~~~~~~~L~~Gd~I~fG~s~ 96 (207)
..|||++|.-- .+.|++||+|.+...+
T Consensus 121 I~VnGk~V~iP-Sy~V~~gdei~V~~k~ 147 (205)
T COG0522 121 ILVNGKRVNIP-SYLVSPGDEISVREKS 147 (205)
T ss_pred EEECCEEeccC-cEEecCCCEEEeeecc
Confidence 56899988665 4689999999988544
No 70
>TIGR01682 moaD molybdopterin converting factor, subunit 1, non-archaeal. The C-terminal Gly-Gly of this model is critical to function.
Probab=21.49 E-value=1.1e+02 Score=21.38 Aligned_cols=24 Identities=13% Similarity=0.155 Sum_probs=18.5
Q ss_pred cceeecCeEcCCCceEEecCCCEEEE
Q psy2133 67 HGTFLNRCKIKPKMYVRIHVGHMLSF 92 (207)
Q Consensus 67 nGTfVNg~rI~~~~~~~L~~Gd~I~f 92 (207)
--..||++.+... ..|++||.|.|
T Consensus 50 ~~v~vn~~~v~~~--~~l~dgDevai 73 (80)
T TIGR01682 50 VMVAVNEEYVTDD--ALLNEGDEVAF 73 (80)
T ss_pred eEEEECCEEcCCC--cCcCCCCEEEE
Confidence 3467888887754 58999999876
No 71
>PF03021 CM2: Influenza C virus M2 protein; InterPro: IPR004267 This family represents the matrix protein, M2, of Influenza C virus. The M1 protein is the product of a spliced mRNA (see IPR004271 from INTERPRO). Small quantities of the unspliced mRNA are found in the cell additionally encoding the M2 protein.
Probab=21.31 E-value=57 Score=25.15 Aligned_cols=19 Identities=42% Similarity=0.753 Sum_probs=16.0
Q ss_pred CCCCCcccccccCCCCCCC
Q psy2133 155 GMGDDAEEETDLSENPYAS 173 (207)
Q Consensus 155 Gm~eda~ee~~~~~np~~~ 173 (207)
-+||||.+|+|.+.-||+.
T Consensus 115 dlgedapdetdnspipfsn 133 (139)
T PF03021_consen 115 DLGEDAPDETDNSPIPFSN 133 (139)
T ss_pred cccCCCCCcccCCCCccCC
Confidence 4789999999988888864
No 72
>PF12525 DUF3726: Protein of unknown function (DUF3726) ; InterPro: IPR022201 This domain family is found in bacteria and eukaryotes, and is approximately 80 amino acids in length. There is a single completely conserved residue E that may be functionally important.
Probab=21.07 E-value=29 Score=25.31 Aligned_cols=20 Identities=35% Similarity=0.637 Sum_probs=15.1
Q ss_pred hhcchhhhcCCccCCCCCcc
Q psy2133 142 EAKTEEEDEGISWGMGDDAE 161 (207)
Q Consensus 142 ~~~~~~~~~g~~wGm~eda~ 161 (207)
..+++.+-.|-+||+.||+-
T Consensus 10 ~~~kA~~Gag~~~G~Ae~aa 29 (80)
T PF12525_consen 10 LAKKAFRGAGLSWGEAEEAA 29 (80)
T ss_pred HHHHHHHhcCCCcchHHHHH
Confidence 34456678999999988764
No 73
>PRK08364 sulfur carrier protein ThiS; Provisional
Probab=20.55 E-value=1.2e+02 Score=20.88 Aligned_cols=21 Identities=10% Similarity=-0.001 Sum_probs=15.7
Q ss_pred eecCeEcCCCceEEecCCCEEEE
Q psy2133 70 FLNRCKIKPKMYVRIHVGHMLSF 92 (207)
Q Consensus 70 fVNg~rI~~~~~~~L~~Gd~I~f 92 (207)
.+|+..+++. ..|+.||.|.|
T Consensus 43 ~vNg~iv~~~--~~l~~gD~Vei 63 (70)
T PRK08364 43 KVNGKVALED--DPVKDGDYVEV 63 (70)
T ss_pred EECCEECCCC--cCcCCCCEEEE
Confidence 4677777654 47999999876
No 74
>PF04710 Pellino: Pellino; InterPro: IPR006800 Pellino is involved in Toll-like signalling pathways, and associates with the kinase domain of the Pelle Ser/Thr kinase [, , ].; PDB: 3EGB_B 3EGA_A.
Probab=20.53 E-value=1.2e+02 Score=28.69 Aligned_cols=31 Identities=16% Similarity=0.461 Sum_probs=17.1
Q ss_pred eEEEccCCC--CcEEeC---------------CCCcCccceEEEEEee
Q psy2133 18 FYCVGRERN--THLNLL---------------HPTVSRYHAILQYKST 48 (207)
Q Consensus 18 ~~~IGR~~~--cDv~l~---------------~psVSR~HAvI~~~~~ 48 (207)
-+=|||+.+ .|+++- ..+|||+-|+|..++.
T Consensus 97 MFQIGRSte~~IDFvV~dt~~G~~~~~~~~~~qStISRfACRI~~dR~ 144 (416)
T PF04710_consen 97 MFQIGRSTESPIDFVVMDTVPGGQDNEDTQPTQSTISRFACRILCDRS 144 (416)
T ss_dssp EEEEES--STT-SEE---------------EEE--S-TT-EEEEEESS
T ss_pred hhhhccCCCCCcCEEEeCCCCCCCcCCCCCccccchhheeEEEEeccC
Confidence 456999875 677664 3469999999999864
No 75
>cd00571 UreE UreE urease accessory protein. UreE is a metallochaperone assisting the insertion of a Ni2+ ion in the active site of urease, an important step in the in vivo assembly of urease, an enzyme that hydrolyses urea into ammonia and carbamic acid. The C-terminal region of UreE contains a histidine rich nickel binding site.
Probab=20.24 E-value=2e+02 Score=22.57 Aligned_cols=40 Identities=18% Similarity=0.287 Sum_probs=26.5
Q ss_pred CCccceeecCeEcCCCceEEecCCCEEEEcCeeeEEEEeCCCC
Q psy2133 64 GSTHGTFLNRCKIKPKMYVRIHVGHMLSFGSSTRFFILQGPSE 106 (207)
Q Consensus 64 ~StnGTfVNg~rI~~~~~~~L~~Gd~I~fG~s~~~fil~~p~~ 106 (207)
.|.+|+-+ +-+++.++ .|++||++..-+.....+...|.+
T Consensus 38 ~t~~G~ei-~i~L~r~~--~L~dGdvL~~d~~~~i~V~~~~e~ 77 (136)
T cd00571 38 KSDHGEEL-GIRLPRGQ--VLRDGDVLFLDDGRLVVVRAAPED 77 (136)
T ss_pred ECCCCCEE-EEECCCCC--cCCCCCEEEeCCCCEEEEEECCCc
Confidence 36777776 55666653 699999999988655444444433
No 76
>PRK06944 sulfur carrier protein ThiS; Provisional
Probab=20.15 E-value=1.1e+02 Score=20.37 Aligned_cols=24 Identities=13% Similarity=0.087 Sum_probs=16.4
Q ss_pred eeecCeEcCCC--ceEEecCCCEEEE
Q psy2133 69 TFLNRCKIKPK--MYVRIHVGHMLSF 92 (207)
Q Consensus 69 TfVNg~rI~~~--~~~~L~~Gd~I~f 92 (207)
..+|+..+++. ....|+.||.|.|
T Consensus 33 v~vN~~~v~~~~~~~~~L~~gD~vei 58 (65)
T PRK06944 33 VAVNGDFVARTQHAARALAAGDRLDL 58 (65)
T ss_pred EEECCEEcCchhcccccCCCCCEEEE
Confidence 46677766542 2356999999876
Done!