Query psy11622
Match_columns 346
No_of_seqs 176 out of 436
Neff 5.0
Searched_HMMs 46136
Date Fri Aug 16 23:23:27 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy11622.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/11622hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG2967|consensus 100.0 1.2E-67 2.5E-72 509.1 21.3 266 27-294 13-289 (314)
2 PF01746 tRNA_m1G_MT: tRNA (Gu 100.0 3E-45 6.5E-50 329.4 -16.7 166 117-282 1-186 (186)
3 COG2419 Uncharacterized conser 99.9 1.7E-24 3.6E-29 207.9 13.9 174 104-282 95-274 (336)
4 COG2428 Uncharacterized conser 93.4 0.15 3.3E-06 46.8 5.3 109 144-265 27-138 (196)
5 PF04252 RNA_Me_trans: Predict 90.4 0.45 9.7E-06 44.5 4.9 55 183-237 60-118 (196)
6 smart00250 PLEC Plectin repeat 82.2 1.3 2.7E-05 30.4 2.4 25 209-233 9-33 (38)
7 PF00681 Plectin: Plectin repe 70.4 1.5 3.3E-05 31.1 0.2 25 209-233 9-33 (45)
8 PF01344 Kelch_1: Kelch motif; 67.8 3.3 7.3E-05 28.3 1.5 13 203-215 10-22 (47)
9 PLN03086 PRLI-interacting fact 58.6 29 0.00063 37.4 7.1 101 143-280 112-212 (567)
10 COG1901 Uncharacterized conser 58.3 26 0.00056 32.9 5.8 122 108-265 64-193 (197)
11 PF13964 Kelch_6: Kelch motif 52.0 6.2 0.00013 27.7 0.6 13 204-216 11-23 (50)
12 PRK13679 hypothetical protein; 51.6 41 0.0009 29.7 5.9 77 122-202 10-95 (168)
13 PF13019 Telomere_Sde2: Telome 48.2 35 0.00076 31.1 4.9 33 50-82 124-156 (162)
14 COG5055 RAD52 Recombination DN 46.3 52 0.0011 33.2 6.1 115 116-235 21-141 (375)
15 smart00612 Kelch Kelch domain. 44.8 12 0.00026 24.7 1.1 9 206-214 1-9 (47)
16 TIGR00853 pts-lac PTS system, 42.7 31 0.00066 28.2 3.4 61 170-251 32-95 (95)
17 PF08496 Peptidase_S49_N: Pept 42.5 1.3E+02 0.0029 27.0 7.7 42 114-156 103-144 (155)
18 PF07788 DUF1626: Protein of u 41.8 25 0.00054 27.9 2.6 27 202-233 44-70 (70)
19 PF09805 Nop25: Nucleolar prot 40.6 68 0.0015 28.1 5.4 10 45-54 26-35 (137)
20 PRK11657 dsbG disulfide isomer 39.4 31 0.00067 32.9 3.4 51 185-246 57-113 (251)
21 cd03020 DsbA_DsbC_DsbG DsbA fa 35.9 62 0.0013 29.0 4.6 51 186-246 21-73 (197)
22 PRK12377 putative replication 35.9 2.3E+02 0.005 27.1 8.7 49 110-159 67-115 (248)
23 PF09554 RE_HaeII: HaeII restr 35.1 41 0.00088 33.5 3.4 96 105-215 165-263 (338)
24 KOG1324|consensus 34.9 26 0.00057 32.7 2.0 39 189-235 88-133 (190)
25 PF07646 Kelch_2: Kelch motif; 33.8 20 0.00044 25.0 0.9 10 204-213 11-20 (49)
26 COG5493 Uncharacterized conser 32.0 46 0.00099 31.6 3.1 33 202-235 194-226 (231)
27 PF04122 CW_binding_2: Putativ 31.3 12 0.00025 29.8 -0.8 16 198-213 67-82 (92)
28 KOG0068|consensus 31.2 50 0.0011 34.0 3.4 117 109-232 94-254 (406)
29 PF02112 PDEase_II: cAMP phosp 29.0 70 0.0015 32.3 4.1 47 109-155 245-300 (335)
30 PRK02135 hypothetical protein; 28.8 1.4E+02 0.0031 28.1 5.8 83 171-265 108-195 (201)
31 PLN02850 aspartate-tRNA ligase 28.7 71 0.0015 34.1 4.3 23 44-66 11-33 (530)
32 PRK04171 ribosome biogenesis p 28.5 67 0.0015 30.7 3.7 84 168-265 126-216 (222)
33 PF04298 Zn_peptidase_2: Putat 25.6 1.8E+02 0.004 27.8 6.0 69 125-195 12-81 (222)
34 COG1366 SpoIIAA Anti-anti-sigm 25.5 2.9E+02 0.0062 22.6 6.6 52 106-163 43-94 (117)
35 PF01740 STAS: STAS domain; I 24.6 1.8E+02 0.0039 23.3 5.2 55 107-168 48-102 (117)
36 COG1578 Uncharacterized conser 24.1 3.5E+02 0.0077 26.9 7.7 44 187-233 153-205 (285)
37 PF13854 Kelch_5: Kelch motif 23.9 37 0.0008 23.2 0.8 11 204-214 14-24 (42)
38 KOG2967|consensus 23.9 1.8E+02 0.0038 29.4 5.8 34 40-73 21-54 (314)
39 cd05564 PTS_IIB_chitobiose_lic 23.7 55 0.0012 26.6 1.9 48 183-251 44-91 (96)
40 PF13056 DUF3918: Protein of u 23.3 58 0.0012 23.6 1.7 16 44-59 25-40 (43)
41 KOG0555|consensus 22.4 1.4E+02 0.003 31.5 4.8 61 187-261 289-352 (545)
42 PF13418 Kelch_4: Galactose ox 21.9 57 0.0012 22.4 1.4 16 202-217 10-25 (49)
43 COG1514 LigT 2'-5' RNA ligase 20.5 5.3E+02 0.012 23.5 7.8 120 122-264 10-144 (180)
44 PF06117 DUF957: Enterobacteri 20.4 78 0.0017 24.8 2.0 17 249-265 2-18 (65)
45 cd07043 STAS_anti-anti-sigma_f 20.4 3.8E+02 0.0083 20.2 6.0 52 107-164 38-89 (99)
46 KOG1135|consensus 20.2 27 0.00058 38.5 -0.8 31 106-136 23-53 (764)
No 1
>KOG2967|consensus
Probab=100.00 E-value=1.2e-67 Score=509.07 Aligned_cols=266 Identities=41% Similarity=0.679 Sum_probs=219.5
Q ss_pred cccCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHhhchhHHHHHHHHHHHHHH-HHHHcCCCCCc-chH---HH-HHHh
Q psy11622 27 IEKNVPENEPCDTNTAEPPLSKRAMKRKLKHEKWLKWKPIKRAKEREKLKQKKQ-YAREHNIPLGP-SRK---AL-KLVK 100 (346)
Q Consensus 27 ~~~~~~~~~~~~~~~~~~~lSK~q~KKl~k~~kwe~~K~~rr~~~kekkK~kr~-~~~~~~~~~~~-~rk---~~-k~~~ 100 (346)
.++.++.+.+..... +.++||+|+|++++++.|++.++.+++++|++...++. +.+...+.++. .|. +. .+..
T Consensus 13 ~e~~v~~~~~~~~p~-~e~~sk~q~k~~~k~~~wee~~~~~~~~rr~~er~~k~~~k~~~~i~~g~~~r~~~~~~~~r~~ 91 (314)
T KOG2967|consen 13 SEESVKHQEPGQQPV-AEPMSKKQLKRQKKQAEWEELKKKKKERRREKERLRKKQEKRNELIELGLEVRLRRIRMEKRIL 91 (314)
T ss_pred cHHHhcccCCCCCcc-cchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHhhcccCchhhHHHHHHHHHhH
Confidence 555566666554443 45899999999999999999998885433332222222 12223333332 222 22 2333
Q ss_pred hhcCCCCCeEEEEcCCCcccCHHHHhHHHHHHHHHHHHhhhcCCCCcEEEEcCChhHHHHHHH---hcCCCcc-ceEEec
Q psy11622 101 METSPNKLRVCIDFSFDHLMTQKDICKCVKQFNWCYSLNRRAQHPLQFYVSSFKDKCKQEIAR---YNGYEHW-DVHIHE 176 (346)
Q Consensus 101 ~~~~~~~~~IvIDcsFddlM~~KEi~SL~~QL~~~Ys~NRr~~~P~~L~lts~~g~l~~~l~k---~~g~~nW-~v~~~~ 176 (346)
..+.++||+|||||+|+++|+.||+.+|++||++||++||++..||+|+||||.+.+...++. +.||.+| .+.+.+
T Consensus 92 ~~~~~s~~rivlD~sfd~lM~~kei~~l~~Qi~~~y~~Nr~a~~Pf~l~~~n~~~~~~~~~~l~~~~~~~~n~~~~~~~~ 171 (314)
T KOG2967|consen 92 AKAMDSGPRIVLDCSFDELMNEKEIVNLVNQIQRCYSENRRAKHPFHLHFTNFQGDIFKRQDLVKKNPGYENWKRIILYP 171 (314)
T ss_pred HhhhccCCeEEEeccHHHHHhHHHHHHHHHHHHHHhhhcccCCCCeEEEEecCCcchHHHHHHHhcCCCcccceeeeccc
Confidence 356799999999999999999999999999999999999999999999999999877665443 4478899 566677
Q ss_pred chhhhc-cCCCCeEEeCCCCccccccccCCceEEEEeEecCCCChhhhHHHHHHcCCcccccccccccccCCCcccchHH
Q psy11622 177 QAYLDL-FKKEDLVYLTSDSDNIIEELDQSKVYIIGGLVDHNQHKLLTLNKAREENIAHGKLPIDTYLNMKTRQVLAVNH 255 (346)
Q Consensus 177 ~~~~el-f~k~~iVYLSpDS~~~L~eld~~kvYIIGGIVDrnr~Kglt~~kA~e~GI~taRLPI~eyi~l~~rkVLtiN~ 255 (346)
..+..+ |.+++||||||||+++|++++|++|||||||||+|++||+|+.+|.++||+||||||++||+|.+++|||+||
T Consensus 172 ~~~s~~~~kkenlVYLT~ds~~vL~dldp~~vYiIGglVD~n~~k~l~~~kA~~~gi~tarLPi~eyi~~~s~~vLt~nq 251 (314)
T KOG2967|consen 172 ECTSLLPFKKENLVYLTADSENVLEDLDPDKVYIIGGLVDKNRQKGLTLSKAQEYGIRTARLPLDEYIKMESRKVLTLNQ 251 (314)
T ss_pred cccccccCCccceEEECCCCccchhccCCCcEEEEEEEecCCCCcchhHHHHHHcCCCcccCchHHhhhcCCCceeeHHH
Confidence 777777 8899999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred HHHHHHHhhhCCCCHHHHHHhhCCCCcccccccCCCCCc
Q psy11622 256 VFQILLAISSDKKSWKDALLETLPERKNAAAKNTGSEDA 294 (346)
Q Consensus 256 V~EILl~~~e~~~DW~eA~~~vIP~RK~~~~~~~~~~~~ 294 (346)
||+||+.|++ ++||.+||+++||+||+..+..+++.+.
T Consensus 252 v~~Il~~~~~-~~dW~~Al~~~ip~RKg~~~~~~~~~~~ 289 (314)
T KOG2967|consen 252 VFEILLKYLE-TGDWKEALLSNIPKRKGAGLKSQESSEK 289 (314)
T ss_pred HHHHHHHHHh-cCCHHHHHHHhCccccccCccchhhhhh
Confidence 9999999999 9999999999999999999988744433
No 2
>PF01746 tRNA_m1G_MT: tRNA (Guanine-1)-methyltransferase; InterPro: IPR016009 In transfer RNA many different modified nucleosides are found, especially in the anticodon region. tRNA (guanine-N1-)-methyltransferase 2.1.1.31 from EC is one of several nucleases operating together with the tRNA-modifying enzymes before the formation of the mature tRNA. It catalyses the reaction: S-adenosyl-L-methionine + tRNA -> S-adenosyl-L-homocysteine + tRNA containing N1-methylguanine methylating guanosine(G) to N1-methylguanine (1-methylguanosine (m1G)) at position 37 of tRNAs that read CUN (leucine), CCN(proline), and CGG (arginine) codons. The presence of m1G improves the cellular growth rate and the polypeptide steptime and also prevents the tRNA from shifting the reading frame []. The mechanism of the trmD3-induced frameshift involving mutant tRNA(Pro) and tRNA(Leu) species has been investigated []. It has been suggested that the conformation of the anticodon loop may be a major determining element for the formation of m1G37 in vivo []. ; PDB: 3IEF_A 3KY7_A 1UAL_A 3AXZ_A 1UAM_A 1UAK_A 1UAJ_A 1OY5_B 3KNU_B 3QUV_A ....
Probab=100.00 E-value=3e-45 Score=329.38 Aligned_cols=166 Identities=43% Similarity=0.642 Sum_probs=134.8
Q ss_pred CcccCHHHHhHHHHHHHHHHHHhhhcCCCCcEEEEcCChhHHHHH--HHh---cCCCccce-----EEecchhhhccCCC
Q psy11622 117 DHLMTQKDICKCVKQFNWCYSLNRRAQHPLQFYVSSFKDKCKQEI--ARY---NGYEHWDV-----HIHEQAYLDLFKKE 186 (346)
Q Consensus 117 ddlM~~KEi~SL~~QL~~~Ys~NRr~~~P~~L~lts~~g~l~~~l--~k~---~g~~nW~v-----~~~~~~~~elf~k~ 186 (346)
|++|+++|+.||++|+.+|||.||++.+|+++++||+++.++..+ ... .|+..|.+ .++++++.++|+++
T Consensus 1 ~~~m~~ke~~sl~~Q~~~~y~~nr~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~l~~~~ 80 (186)
T PF01746_consen 1 DDLMTEKEIKSLAKQLLRCYSANRRAKVPDQLYGTGFGMVLKPEPEFRALDSVNGWRVILLSPQGKPFTQKSAEELFPKE 80 (186)
T ss_dssp THHHHHTTSEEEEEEEGGGGSTSTTG-SEE-BTTS-SS-EE-HHHHHHHHHHHHTSEEEEE-TTSEE--HHHHHHHTTSS
T ss_pred CcChhhcCchhhhhhhHHHhhcCCCCCCCCcEEEeCcccccchhhhhHHHHHHhhhccccccccCcchhhHHHHhhccCC
Confidence 579999999999999999999999999999999999998766432 222 16766644 34577899999999
Q ss_pred CeEEeCCCCccccccccCCc--eEEEEeEecCCCChhh----hHHHHHHcCCc-ccccccccccccCC--CcccchHHHH
Q psy11622 187 DLVYLTSDSDNIIEELDQSK--VYIIGGLVDHNQHKLL----TLNKAREENIA-HGKLPIDTYLNMKT--RQVLAVNHVF 257 (346)
Q Consensus 187 ~iVYLSpDS~~~L~eld~~k--vYIIGGIVDrnr~Kgl----t~~kA~e~GI~-taRLPI~eyi~l~~--rkVLtiN~V~ 257 (346)
+||||||||+++|+++++++ +||||||||++++||+ |+.+|...||+ ++||||++|+.+.+ ++|||+||||
T Consensus 81 ~lVyLs~d~e~~le~~~~~~~~vyiIGgiVD~~~~k~~~~~~~~~~a~~~Gi~~~~rLpi~~~l~~~~~~~k~l~in~V~ 160 (186)
T PF01746_consen 81 NLVYLSPDSEGVLERVDPDKDDVYIIGGIVDRNGEKGAMTIADLEKALEPGIRNTARLPIDSFLLEYPHYTKVLTINQVP 160 (186)
T ss_dssp EEEEEE-BTT-BBHHHHHHHTSEEESSSS--SCSHHHHHHHHHHHHTTSTTTSGGGG-SSSCCS----B-BSSSEETE--
T ss_pred CEEEEECCccccccccccccceEEEEccEEccCCcccchhhhhHHHHHccCCCccccCcccccccCCCCCCcCceECchH
Confidence 99999999999999999999 9999999999999999 99999999999 99999999999887 8999999999
Q ss_pred HHHHHhhh-CCCCHHHHHHhhCCCCc
Q psy11622 258 QILLAISS-DKKSWKDALLETLPERK 282 (346)
Q Consensus 258 EILl~~~e-~~~DW~eA~~~vIP~RK 282 (346)
+|||.+.+ .++||++||.++||+||
T Consensus 161 eILl~~~~~~~~~W~~A~~~~ip~Rk 186 (186)
T PF01746_consen 161 EILLSGNHKKIGDWKEALEKTIPKRK 186 (186)
T ss_dssp GGGGSTSHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHhhchhhhchHHHHHHHHhccCC
Confidence 99999976 68999999999999997
No 3
>COG2419 Uncharacterized conserved protein [Function unknown]
Probab=99.92 E-value=1.7e-24 Score=207.86 Aligned_cols=174 Identities=18% Similarity=0.261 Sum_probs=144.1
Q ss_pred CCCCCeEEEEcCCCcccCHHHHhHHHHHHHHHHHHhhhcCCCCcEEEEcCChhHHHHHHHhcCCCccceEEecchhhhcc
Q psy11622 104 SPNKLRVCIDFSFDHLMTQKDICKCVKQFNWCYSLNRRAQHPLQFYVSSFKDKCKQEIARYNGYEHWDVHIHEQAYLDLF 183 (346)
Q Consensus 104 ~~~~~~IvIDcsFddlM~~KEi~SL~~QL~~~Ys~NRr~~~P~~L~lts~~g~l~~~l~k~~g~~nW~v~~~~~~~~elf 183 (346)
.|..|.|||||+|++.++++|..+|..||..+||..|++.||-+|.+|+.+++..+++ + .++..-...-....|+...
T Consensus 95 ~P~~P~fVVDl~lwdeH~~~Ek~kl~lQi~vTl~~vR~ylwD~~L~ltn~~~e~~e~~-~-l~~~k~~~~~~~~e~l~~~ 172 (336)
T COG2419 95 LPAYPYFVVDLRLWDEHSDKEKSKLELQIAVTLGTVRRYLWDRHLVLTNTNPEAKERL-K-LPLEKIGYEGKTEEFLKEI 172 (336)
T ss_pred CCCCCEEEEEeechhhcChHhHHHHHHHHHHHHHHHHHHhcCCceEEEecChhhhhhe-e-cccccccccCCHHHHHhhc
Confidence 4678999999999999999999999999999999999999999999999999988877 2 2222100011223455566
Q ss_pred CCCCeEEeCCCCcccccc--ccCCceEEEEeEecCCC-ChhhhHHHHHHcCCcccccccccccccCCC---cccchHHHH
Q psy11622 184 KKEDLVYLTSDSDNIIEE--LDQSKVYIIGGLVDHNQ-HKLLTLNKAREENIAHGKLPIDTYLNMKTR---QVLAVNHVF 257 (346)
Q Consensus 184 ~k~~iVYLSpDS~~~L~e--ld~~kvYIIGGIVDrnr-~Kglt~~kA~e~GI~taRLPI~eyi~l~~r---kVLtiN~V~ 257 (346)
+.+++|.|.|++++.|++ +...++||||||||+++ .||.|.+.|..+|..- .+|-.. |.+.|+ .+..|||++
T Consensus 173 ~idrvVlLdPnade~lse~~~r~~~~FIIGGIVDk~~~~kg~Tari~~~l~~~g-ev~rrk-I~LRGsvvGVPDRIN~I~ 250 (336)
T COG2419 173 GIDRVVLLDPNADELLSEEEIRGAKAFIIGGIVDKKGTKKGATARIGEVLGREG-EVPRRK-IELRGSVVGVPDRINHIV 250 (336)
T ss_pred CcCceEEeCCCCccccchhhhccCceEEEeeeEcCCCCccchHHHhhhhcCCcc-ccceeE-EEEecCccCCchHHHHHH
Confidence 889999999999999975 67789999999999985 7999999999888764 445332 567765 679999999
Q ss_pred HHHHHhhhCCCCHHHHHHhhCCCCc
Q psy11622 258 QILLAISSDKKSWKDALLETLPERK 282 (346)
Q Consensus 258 EILl~~~e~~~DW~eA~~~vIP~RK 282 (346)
||||++++ |++.++|+.++.++|.
T Consensus 251 EILlR~~~-Ge~lEkAIlavQap~~ 274 (336)
T COG2419 251 EILLRMLY-GEPLEKAILAVQAPRD 274 (336)
T ss_pred HHHHHHHc-CccHHHHHHHhcCcHH
Confidence 99999999 9999999999887764
No 4
>COG2428 Uncharacterized conserved protein [Function unknown]
Probab=93.42 E-value=0.15 Score=46.84 Aligned_cols=109 Identities=10% Similarity=0.161 Sum_probs=67.5
Q ss_pred CCCcEEEEcCChhHHHHHHHhcCCCccceEEecchhh-hccCCCCeEEeCCCCccccc--cccCCceEEEEeEecCCCCh
Q psy11622 144 HPLQFYVSSFKDKCKQEIARYNGYEHWDVHIHEQAYL-DLFKKEDLVYLTSDSDNIIE--ELDQSKVYIIGGLVDHNQHK 220 (346)
Q Consensus 144 ~P~~L~lts~~g~l~~~l~k~~g~~nW~v~~~~~~~~-elf~k~~iVYLSpDS~~~L~--eld~~kvYIIGGIVDrnr~K 220 (346)
+-..|.|||..+...+.+.+. |+.- | ..++. ..+....++.|+++|+..|+ +.++++++|+|||.--..-+
T Consensus 27 ~G~~~~vtna~pe~p~vlak~-g~~~--i---~e~~~~~~l~r~rvivLDl~a~~~L~PEdas~~~~ivvGGIlGD~ppr 100 (196)
T COG2428 27 WGDEFIVTNAKPEEPEVLAKI-GLSG--I---PESITRLPLDRSRVIVLDLQAEEELKPEDASEDTYIVVGGILGDHPPR 100 (196)
T ss_pred hchheeeecCCcchhHHHHHh-cccc--C---chhHhhcccCCCCEEEecCCCCCCCCcccCCcccEEEECccccCCCCC
Confidence 455788999988666666553 3321 1 11222 23567899999999999998 67788999999998776667
Q ss_pred hhhHHHHHHcCCcccccccccccccCCCcccchHHHHHHHHHhhh
Q psy11622 221 LLTLNKAREENIAHGKLPIDTYLNMKTRQVLAVNHVFQILLAISS 265 (346)
Q Consensus 221 glt~~kA~e~GI~taRLPI~eyi~l~~rkVLtiN~V~EILl~~~e 265 (346)
|-|+.. .|++.+.-.- .--|.+-++|+--+.+-.-..+
T Consensus 101 gRT~~l------~T~~~~~~~~-R~lG~~q~SiDgA~rta~li~e 138 (196)
T COG2428 101 GRTKEL------ITSKMEGVKV-RHLGPKQFSIDGASRTAKLIAE 138 (196)
T ss_pred Ccchhh------hccccccccc-cccCccceeccHHHHHHHHHHc
Confidence 777743 3333332210 1114455666655444444444
No 5
>PF04252 RNA_Me_trans: Predicted SAM-dependent RNA methyltransferase; InterPro: IPR007364 This family of proteins are predicted to be alpha/beta-knot SAM-dependent RNA methyltransferases [].
Probab=90.41 E-value=0.45 Score=44.46 Aligned_cols=55 Identities=11% Similarity=0.274 Sum_probs=40.8
Q ss_pred cCCCCeEEeCCCCccccc--cccCCceEEEEeEecCCCChhhhHHHHHH--cCCccccc
Q psy11622 183 FKKEDLVYLTSDSDNIIE--ELDQSKVYIIGGLVDHNQHKLLTLNKARE--ENIAHGKL 237 (346)
Q Consensus 183 f~k~~iVYLSpDS~~~L~--eld~~kvYIIGGIVDrnr~Kglt~~kA~e--~GI~taRL 237 (346)
+++++++.|+|.|+..|+ +.+..+++|+|||.--..-++-|...-.. .|+..+||
T Consensus 60 ~~~~~VcLLDP~A~~~L~PeD~~~fd~fvfGGILGD~PPrdRT~eLr~~~~~g~~~R~L 118 (196)
T PF04252_consen 60 IDKSRVCLLDPAAEKELSPEDGEKFDYFVFGGILGDHPPRDRTSELRTKKPKGFEGRRL 118 (196)
T ss_pred CCcCCEEEeCCCCCCCCCccccCcccEEEECcccCCCCCCCchHHHHhhhccCcccccc
Confidence 567899999999999998 66778899999999443334445544433 47776666
No 6
>smart00250 PLEC Plectin repeat.
Probab=82.23 E-value=1.3 Score=30.37 Aligned_cols=25 Identities=24% Similarity=0.502 Sum_probs=22.9
Q ss_pred EEEeEecCCCChhhhHHHHHHcCCc
Q psy11622 209 IIGGLVDHNQHKLLTLNKAREENIA 233 (346)
Q Consensus 209 IIGGIVDrnr~Kglt~~kA~e~GI~ 233 (346)
-+|||+|......+|+..|.+.|+-
T Consensus 9 ~~~Giidp~t~~~lsv~eA~~~gli 33 (38)
T smart00250 9 AIGGIIDPETGQKLSVEEALRRGLI 33 (38)
T ss_pred heeEEEcCCCCCCcCHHHHHHcCCC
Confidence 6899999999999999999999874
No 7
>PF00681 Plectin: Plectin repeat; InterPro: IPR001101 Plectin may have a role in cross-linking intermediate filaments, in inter-linking intermediate filaments with microtubules and microfilaments and in anchoring intermediate filaments to the plasma and nuclear membranes. Plectin is recruited into hemidesmosomes, multiprotein complexes that facilitate adhesion of epithelia to the basement membrane, thereby providing linkage between the intracellular keratin filaments to the laminins of the extracellular matrix. Plectin binds to hemidesmosomes through association of its actin-binding domain with the first pair of fibronectin type III repeats and a small part of the connecting segment of the integrin-beta4 subunit, the latter (integrin-alpha6,beta4) acting as a receptor for the extracellular matrix component laminin-5. The plectin repeat is also seen in the cell adhesion junction plaque proteins, desmoplakin, envoplakin, and bullous pemphigoid antigen. The domains in plakins show considerable sequence homology. The N terminus consists of a plakin domain containing a number of subdomains with high alpha-helical content, while the central coiled-coil domain is composed of heptad repeats involved in the dimerisation of plakin, and the C terminus contains one or more homologous repeat sequences referred to plectin repeats []. This entry represents the plectin repeats found in the C terminus of plakin proteins.; GO: 0005856 cytoskeleton; PDB: 1LM7_A 1LM5_A.
Probab=70.37 E-value=1.5 Score=31.13 Aligned_cols=25 Identities=20% Similarity=0.450 Sum_probs=22.6
Q ss_pred EEEeEecCCCChhhhHHHHHHcCCc
Q psy11622 209 IIGGLVDHNQHKLLTLNKAREENIA 233 (346)
Q Consensus 209 IIGGIVDrnr~Kglt~~kA~e~GI~ 233 (346)
.+|||||-....-++...|...|+=
T Consensus 9 ~~gGiidp~tg~~lsv~~A~~~glI 33 (45)
T PF00681_consen 9 ATGGIIDPETGERLSVEEAIQRGLI 33 (45)
T ss_dssp TTTSEEETTTTEEEEHHHHHHTTSS
T ss_pred eeeeEEeCCCCeEEcHHHHHHCCCc
Confidence 5799999999999999999999963
No 8
>PF01344 Kelch_1: Kelch motif; InterPro: IPR006652 Kelch is a 50-residue motif, named after the Drosophila mutant in which it was first identified []. This sequence motif represents one beta-sheet blade, and several of these repeats can associate to form a beta-propeller. For instance, the motif appears 6 times in Drosophila egg-chamber regulatory protein, creating a 6-bladed beta-propeller. The motif is also found in mouse protein MIPP [] and in a number of poxviruses. In addition, kelch repeats have been recognised in alpha- and beta-scruin [, ], and in galactose oxidase from the fungus Dactylium dendroides [, ]. The structure of galactose oxidase reveals that the repeated sequence corresponds to a 4-stranded anti-parallel beta-sheet motif that forms the repeat unit in a super-barrel structural fold []. The known functions of kelch-containing proteins are diverse: scruin is an actin cross-linking protein; galactose oxidase catalyses the oxidation of the hydroxyl group at the C6 position in D-galactose; neuraminidase hydrolyses sialic acid residues from glycoproteins; and kelch may have a cytoskeletal function, as it is localised to the actin-rich ring canals that connect the 15 nurse cells to the developing oocyte in Drosophila []. Nevertheless, based on the location of the kelch pattern in the catalytic unit in galactose oxidase, functionally important residues have been predicted in glyoxal oxidase []. This entry represents a type of kelch sequence motif that comprises one beta-sheet blade.; GO: 0005515 protein binding; PDB: 2XN4_A 2WOZ_A 3II7_A 4ASC_A 1U6D_X 1ZGK_A 2FLU_X 2VPJ_A 2DYH_A 1X2R_A ....
Probab=67.77 E-value=3.3 Score=28.31 Aligned_cols=13 Identities=38% Similarity=0.733 Sum_probs=11.0
Q ss_pred cCCceEEEEeEec
Q psy11622 203 DQSKVYIIGGLVD 215 (346)
Q Consensus 203 d~~kvYIIGGIVD 215 (346)
-.+++|||||.-.
T Consensus 10 ~~~~iyv~GG~~~ 22 (47)
T PF01344_consen 10 VGNKIYVIGGYDG 22 (47)
T ss_dssp ETTEEEEEEEBES
T ss_pred ECCEEEEEeeecc
Confidence 3578999999987
No 9
>PLN03086 PRLI-interacting factor K; Provisional
Probab=58.60 E-value=29 Score=37.44 Aligned_cols=101 Identities=21% Similarity=0.187 Sum_probs=56.1
Q ss_pred CCCCcEEEEcCChhHHHHHHHhcCCCccceEEecchhhhccCCCCeEEeCCCCccccccccCCceEEEEeEecCCCChhh
Q psy11622 143 QHPLQFYVSSFKDKCKQEIARYNGYEHWDVHIHEQAYLDLFKKEDLVYLTSDSDNIIEELDQSKVYIIGGLVDHNQHKLL 222 (346)
Q Consensus 143 ~~P~~L~lts~~g~l~~~l~k~~g~~nW~v~~~~~~~~elf~k~~iVYLSpDS~~~L~eld~~kvYIIGGIVDrnr~Kgl 222 (346)
.+|+-|-|++.....-..+. .+. .-..+--++++.-..+..|||++.=-..|.--++.. |.+|
T Consensus 112 ~~Pm~F~l~~~~~~~~~~~~--~~~---~~~~th~GVlEF~A~EG~v~lP~wm~~~L~~~~~~~----~~~v-------- 174 (567)
T PLN03086 112 KGPLYFRLSVVHQEGSGEMK--DTD---SQKTTHSGVLEFTAEEGSVGLPPHVWSNLFPSDPPD----VPLV-------- 174 (567)
T ss_pred CCCeEEEEeccccccccccc--ccc---CCcEEEEEEEEEEcCCCeEEcCHHHHhhcCCCCCCC----CCeE--------
Confidence 68999999886311000000 000 012333455666667888999887555443111100 0111
Q ss_pred hHHHHHHcCCcccccccccccccCCCcccchHHHHHHHHHhhhCCCCHHHHHHhhCCC
Q psy11622 223 TLNKAREENIAHGKLPIDTYLNMKTRQVLAVNHVFQILLAISSDKKSWKDALLETLPE 280 (346)
Q Consensus 223 t~~kA~e~GI~taRLPI~eyi~l~~rkVLtiN~V~EILl~~~e~~~DW~eA~~~vIP~ 280 (346)
-|..+.||-+.|+++. +++.+ |+. -.||+..|+..+..
T Consensus 175 --------~v~~~~Lpkgt~vklq---P~~~~--------f~d-i~npKavLE~~Lr~ 212 (567)
T PLN03086 175 --------EVRYIWLPKGTYAKLQ---PDGVG--------FSD-LPNHKAVLETALRQ 212 (567)
T ss_pred --------EEEEeecCCCCEEEEe---eccCC--------cCC-cccHHHHHHHHhhc
Confidence 3566799999999986 23322 555 67888888877743
No 10
>COG1901 Uncharacterized conserved protein [Function unknown]
Probab=58.33 E-value=26 Score=32.92 Aligned_cols=122 Identities=16% Similarity=0.201 Sum_probs=69.7
Q ss_pred CeEEEEcCCCcccCHHHHhHHHHHHHHHHHHhhh--cCCCCcEEEEcCChhHHHHHHHhcCCCccceEEecchhhhc---
Q psy11622 108 LRVCIDFSFDHLMTQKDICKCVKQFNWCYSLNRR--AQHPLQFYVSSFKDKCKQEIARYNGYEHWDVHIHEQAYLDL--- 182 (346)
Q Consensus 108 ~~IvIDcsFddlM~~KEi~SL~~QL~~~Ys~NRr--~~~P~~L~lts~~g~l~~~l~k~~g~~nW~v~~~~~~~~el--- 182 (346)
-.|.|+.+--..|++.| .|++.+|..+...+-. ...+++ . ++.....++.++
T Consensus 64 ktI~~~g~~~~~~~pdE-rs~a~~i~kAL~~~~~~~~~~~~~-----------------p-----Gi~V~~~~~e~ll~~ 120 (197)
T COG1901 64 KTIRVEGSELRYLNPDE-RSLAILIKKALDAELGKEQTREVT-----------------P-----GIYVRNGGFEALLAE 120 (197)
T ss_pred EEEEEEcccccccCcch-HHHHHHHHHHHHhhccccceeecC-----------------C-----CEEEecCCHHHHHHH
Confidence 34667777677888875 4677777776655111 001000 0 123323222222
Q ss_pred c-CCCCeEEeCCCCccccc-cccCCceEEEEeEecCCCChhhhHHHHHHcC-CcccccccccccccCCCcccchHHHHHH
Q psy11622 183 F-KKEDLVYLTSDSDNIIE-ELDQSKVYIIGGLVDHNQHKLLTLNKAREEN-IAHGKLPIDTYLNMKTRQVLAVNHVFQI 259 (346)
Q Consensus 183 f-~k~~iVYLSpDS~~~L~-eld~~kvYIIGGIVDrnr~Kglt~~kA~e~G-I~taRLPI~eyi~l~~rkVLtiN~V~EI 259 (346)
+ ....++||.-|+++.-+ ++.++-+||+|-= -|+|.+..+-+- +...+ |.+ |...|-.+||+-|
T Consensus 121 l~~~~~ly~L~E~G~DI~~v~~~~np~FIlGDH------~g~t~e~~k~L~r~~~~~------ISl-GP~~lha~hcit~ 187 (197)
T COG1901 121 LAEGRSLYYLHEDGRDISEVDLIPNPVFILGDH------IGLTEEDEKLLERHAAKK------ISL-GPLSLHADHCITL 187 (197)
T ss_pred HhccCcEEEEccCCccHhhcccCCCceEEeeCC------CCCCHHHHHHHHHhhCce------eEe-CchHHHHHHHHHH
Confidence 2 23579999999988765 5688999999943 344544443210 11111 221 5667889999999
Q ss_pred HHHhhh
Q psy11622 260 LLAISS 265 (346)
Q Consensus 260 Ll~~~e 265 (346)
+-.++.
T Consensus 188 ~h~~LD 193 (197)
T COG1901 188 LHNLLD 193 (197)
T ss_pred HHHHHh
Confidence 887765
No 11
>PF13964 Kelch_6: Kelch motif
Probab=52.02 E-value=6.2 Score=27.66 Aligned_cols=13 Identities=31% Similarity=0.767 Sum_probs=11.2
Q ss_pred CCceEEEEeEecC
Q psy11622 204 QSKVYIIGGLVDH 216 (346)
Q Consensus 204 ~~kvYIIGGIVDr 216 (346)
.+++||+||..+.
T Consensus 11 ~~~iyv~GG~~~~ 23 (50)
T PF13964_consen 11 GGKIYVFGGYDNS 23 (50)
T ss_pred CCEEEEECCCCCC
Confidence 5689999999884
No 12
>PRK13679 hypothetical protein; Provisional
Probab=51.62 E-value=41 Score=29.67 Aligned_cols=77 Identities=16% Similarity=0.280 Sum_probs=48.1
Q ss_pred HHHHhHHHHHHHHHHHHhhhcCCCCcEEEEcCC---h----hHHHHHHHhc-CCCccceEEecchhhhccC-CCCeEEeC
Q psy11622 122 QKDICKCVKQFNWCYSLNRRAQHPLQFYVSSFK---D----KCKQEIARYN-GYEHWDVHIHEQAYLDLFK-KEDLVYLT 192 (346)
Q Consensus 122 ~KEi~SL~~QL~~~Ys~NRr~~~P~~L~lts~~---g----~l~~~l~k~~-g~~nW~v~~~~~~~~elf~-k~~iVYLS 192 (346)
++++.....+++..|....+. .|.||+|..++ . .+.+.+.... ++....+.+..- ..|+ ...+|||.
T Consensus 10 p~~~~~~l~~~~~~~~~~~~~-v~pHITL~f~g~~~~~~~~~l~~~l~~~~~~~~pf~l~l~~~---~~F~~~~~vl~l~ 85 (168)
T PRK13679 10 SKKIQDFANSYRKRYDPHYAL-IPPHITLKEPFEISDEQLDSIVEELRAIASETKPFTLHVTKV---SSFAPTNNVIYFK 85 (168)
T ss_pred CHHHHHHHHHHHHhhCccccc-CCCceEEecCCCCCHHHHHHHHHHHHHHHhcCCCEEEEEecc---ccCCCCCCEEEEE
Confidence 567888888888888766444 56699987643 2 2334444432 445445555433 3454 35899999
Q ss_pred CCCccccccc
Q psy11622 193 SDSDNIIEEL 202 (346)
Q Consensus 193 pDS~~~L~el 202 (346)
++....|..|
T Consensus 86 ~~~~~~L~~L 95 (168)
T PRK13679 86 VEKTEELEEL 95 (168)
T ss_pred ccCCHHHHHH
Confidence 9877777654
No 13
>PF13019 Telomere_Sde2: Telomere stability and silencing
Probab=48.22 E-value=35 Score=31.15 Aligned_cols=33 Identities=30% Similarity=0.501 Sum_probs=23.7
Q ss_pred HHHHHHHHHHHHhhchhHHHHHHHHHHHHHHHH
Q psy11622 50 AMKRKLKHEKWLKWKPIKRAKEREKLKQKKQYA 82 (346)
Q Consensus 50 q~KKl~k~~kwe~~K~~rr~~~kekkK~kr~~~ 82 (346)
..+-.++.++|.+.+.++.++++|+++++..+.
T Consensus 124 ~v~~~k~l~~~~~~~~er~k~~~e~~~~k~~~l 156 (162)
T PF13019_consen 124 TVNEEKKLAEWLEKKPEREKKEKEKRRKKLEKL 156 (162)
T ss_pred hhHHHHHHHHHHhcChhHHHHHHHHHHHHHHHH
Confidence 345566677899999999888887776555443
No 14
>COG5055 RAD52 Recombination DNA repair protein (RAD52 pathway) [DNA replication, recombination, and repair]
Probab=46.31 E-value=52 Score=33.21 Aligned_cols=115 Identities=17% Similarity=0.235 Sum_probs=63.0
Q ss_pred CCcccCHHHHhHHHHHHHHHHHHhhhcCCCCc-EEEEcCChhHHHHHHHhcCCCccc--eEEecchhhhccCCCCe-EEe
Q psy11622 116 FDHLMTQKDICKCVKQFNWCYSLNRRAQHPLQ-FYVSSFKDKCKQEIARYNGYEHWD--VHIHEQAYLDLFKKEDL-VYL 191 (346)
Q Consensus 116 FddlM~~KEi~SL~~QL~~~Ys~NRr~~~P~~-L~lts~~g~l~~~l~k~~g~~nW~--v~~~~~~~~elf~k~~i-VYL 191 (346)
|.+....+=..+|.++|.--|-.+|--..-.. -||-|+ .+-+.-+...||..|. |.-.+-+|.+.|...++ |-+
T Consensus 21 fs~~r~~~lqs~L~rklgpeYis~R~G~g~~~iayIegw--~~I~lANeiFGfnGWss~I~sv~id~~ee~~e~k~svg~ 98 (375)
T COG5055 21 FSVQRIGKLQSKLERKLGPEYISRRSGFGGSSIAYIEGW--KAIELANEIFGFNGWSSEIRSVEIDYCEEFEEKKFSVGA 98 (375)
T ss_pred hhhhHHHHHHHHHHHHhccHhhccccCCCCCceeeechh--HHHHHHHHhhCcCccccceeeeeeeccccccccceeeee
Confidence 33444455556777888888877664321111 122222 1222234456999994 43344567776643332 444
Q ss_pred CCCCcccccc--ccCCceEEEEeEecCCCChhhhHHHHHHcCCccc
Q psy11622 192 TSDSDNIIEE--LDQSKVYIIGGLVDHNQHKLLTLNKAREENIAHG 235 (346)
Q Consensus 192 SpDS~~~L~e--ld~~kvYIIGGIVDrnr~Kglt~~kA~e~GI~ta 235 (346)
|+--.-+|.+ +..+ |==|-|+-.+.|+.||++|++.|+--|
T Consensus 99 saiVRVTLKDGty~Ed---iGyGsien~r~ka~ayek~KKEavtDA 141 (375)
T COG5055 99 SAIVRVTLKDGTYRED---IGYGSIENCRRKAEAYEKAKKEAVTDA 141 (375)
T ss_pred EEEEEEEecCCccccc---cccceeecccccHHHHHHHHhhhhHHH
Confidence 4443444442 1111 112677888899999999999886543
No 15
>smart00612 Kelch Kelch domain.
Probab=44.76 E-value=12 Score=24.71 Aligned_cols=9 Identities=44% Similarity=1.206 Sum_probs=7.4
Q ss_pred ceEEEEeEe
Q psy11622 206 KVYIIGGLV 214 (346)
Q Consensus 206 kvYIIGGIV 214 (346)
++|||||.-
T Consensus 1 ~iyv~GG~~ 9 (47)
T smart00612 1 KIYVVGGFD 9 (47)
T ss_pred CEEEEeCCC
Confidence 589999973
No 16
>TIGR00853 pts-lac PTS system, lactose/cellobiose family IIB component. Bacterial PTS transporters transport and concomitantly phosphorylate their sugar substrates, and typically consist of multiple subunits or protein domains.The Lac family includes several sequenced lactose (b-galactoside) permeases of Gram-positive bacteria as well as those in E. coli. While the Lac family usually consists of two polypeptide components IIA and IICB, the Chb permease of E. coli consists of three IIA, IIB and IIC. This family is specific for the IIB subunit of the Lac PTS family.
Probab=42.69 E-value=31 Score=28.22 Aligned_cols=61 Identities=18% Similarity=0.244 Sum_probs=40.9
Q ss_pred cceEEecchhhh---ccCCCCeEEeCCCCccccccccCCceEEEEeEecCCCChhhhHHHHHHcCCcccccccccccccC
Q psy11622 170 WDVHIHEQAYLD---LFKKEDLVYLTSDSDNIIEELDQSKVYIIGGLVDHNQHKLLTLNKAREENIAHGKLPIDTYLNMK 246 (346)
Q Consensus 170 W~v~~~~~~~~e---lf~k~~iVYLSpDS~~~L~eld~~kvYIIGGIVDrnr~Kglt~~kA~e~GI~taRLPI~eyi~l~ 246 (346)
+.+++..-++.+ ..+.-++|.|+|.-...+.++ .+.+...||..+-+|-..|-.|+
T Consensus 32 i~~~v~a~~~~~~~~~~~~~Dvill~pqi~~~~~~i---------------------~~~~~~~~ipv~~I~~~~Y~~md 90 (95)
T TIGR00853 32 VPVKIAAGSYGAAGEKLDDADVVLLAPQVAYMLPDL---------------------KKETDKKGIPVEVINGAQYGKLT 90 (95)
T ss_pred CcEEEEEecHHHHHhhcCCCCEEEECchHHHHHHHH---------------------HHHhhhcCCCEEEeChhhcccCC
Confidence 345554444433 233456888888766655543 24567789999999999998888
Q ss_pred CCccc
Q psy11622 247 TRQVL 251 (346)
Q Consensus 247 ~rkVL 251 (346)
+.+||
T Consensus 91 g~~vl 95 (95)
T TIGR00853 91 GAGVL 95 (95)
T ss_pred ccccC
Confidence 87775
No 17
>PF08496 Peptidase_S49_N: Peptidase family S49 N-terminal; InterPro: IPR013703 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This domain is found to the N terminus of bacterial signal peptidases that belong to the MEROPS peptidase family S49 (protease IV family, clan SK) (see also IPR002142 from INTERPRO) [, ]. ; GO: 0004252 serine-type endopeptidase activity, 0005886 plasma membrane
Probab=42.53 E-value=1.3e+02 Score=27.04 Aligned_cols=42 Identities=7% Similarity=0.047 Sum_probs=32.0
Q ss_pred cCCCcccCHHHHhHHHHHHHHHHHHhhhcCCCCcEEEEcCChh
Q psy11622 114 FSFDHLMTQKDICKCVKQFNWCYSLNRRAQHPLQFYVSSFKDK 156 (346)
Q Consensus 114 csFddlM~~KEi~SL~~QL~~~Ys~NRr~~~P~~L~lts~~g~ 156 (346)
++|+--|.-.|..+|...|+...+.-+.- .-+=+-+.|-+|-
T Consensus 103 ldF~Gdi~A~~v~~LReeisail~~a~~~-DeV~~rLES~GG~ 144 (155)
T PF08496_consen 103 LDFKGDIKASEVESLREEISAILSVATPE-DEVLVRLESPGGM 144 (155)
T ss_pred EecCCCccHHHHHHHHHHHHHHHHhCCCC-CeEEEEEecCCce
Confidence 45677788999999999999999986655 5556667776653
No 18
>PF07788 DUF1626: Protein of unknown function (DUF1626); InterPro: IPR012431 This is a family consisting of sequences from hypothetical proteins of unknown function expressed by certain species of archaea. One member (Q9YCN7 from SWISSPROT) is thought to be similar to tropomyosin [].
Probab=41.80 E-value=25 Score=27.85 Aligned_cols=27 Identities=15% Similarity=0.238 Sum_probs=22.7
Q ss_pred ccCCceEEEEeEecCCCChhhhHHHHHHcCCc
Q psy11622 202 LDQSKVYIIGGLVDHNQHKLLTLNKAREENIA 233 (346)
Q Consensus 202 ld~~kvYIIGGIVDrnr~Kglt~~kA~e~GI~ 233 (346)
...+.+|||++.||.. ....|+++||.
T Consensus 44 rk~~r~ivVtp~id~~-----a~~~A~~LGIe 70 (70)
T PF07788_consen 44 RKVDRLIVVTPYIDDR-----AKEMAEELGIE 70 (70)
T ss_pred CCcceEEEEEeecCHH-----HHHHHHHhCCC
Confidence 3457899999999977 78899999984
No 19
>PF09805 Nop25: Nucleolar protein 12 (25kDa); InterPro: IPR019186 Nop12 is a novel nucleolar protein required for pre-large subunit rRNA processing and in yeast normal rates of cell growth at low temperatures [].
Probab=40.63 E-value=68 Score=28.14 Aligned_cols=10 Identities=30% Similarity=0.507 Sum_probs=4.6
Q ss_pred CCCHHHHHHH
Q psy11622 45 PLSKRAMKRK 54 (346)
Q Consensus 45 ~lSK~q~KKl 54 (346)
|++|+-+.|.
T Consensus 26 GFhKRK~~Rr 35 (137)
T PF09805_consen 26 GFHKRKKQRR 35 (137)
T ss_pred HHHHHHHHHH
Confidence 4555443333
No 20
>PRK11657 dsbG disulfide isomerase/thiol-disulfide oxidase; Provisional
Probab=39.40 E-value=31 Score=32.89 Aligned_cols=51 Identities=20% Similarity=0.154 Sum_probs=36.2
Q ss_pred CCCeEEeCCCCccccccccCCceEEEEeEecCCCChhhhHHHHHHcC------CcccccccccccccC
Q psy11622 185 KEDLVYLTSDSDNIIEELDQSKVYIIGGLVDHNQHKLLTLNKAREEN------IAHGKLPIDTYLNMK 246 (346)
Q Consensus 185 k~~iVYLSpDS~~~L~eld~~kvYIIGGIVDrnr~Kglt~~kA~e~G------I~taRLPI~eyi~l~ 246 (346)
...++|+|+|+. .+|.|-|+|-+ .+++|-....+.. ...+.||....|...
T Consensus 57 ~~~i~Y~t~dg~----------y~i~G~l~d~~-~~nlT~~~~~~~~~~~~~~~~~~~l~~~~~i~~g 113 (251)
T PRK11657 57 MGVTIYLTPDGK----------HAISGYMYDEK-GENLSEALLEKEVYAPMGREMWQRLEQSHWILDG 113 (251)
T ss_pred CceEEEEcCCCC----------EEEEEEEEcCC-CCccCHHHHHHHhcCCccHHHHHHhhccCCcccc
Confidence 455999999876 47899999986 4688877766532 224577777666653
No 21
>cd03020 DsbA_DsbC_DsbG DsbA family, DsbC and DsbG subfamily; V-shaped homodimeric proteins containing a redox active CXXC motif imbedded in a TRX fold. They function as protein disulfide isomerases and chaperones in the bacterial periplasm to correct non-native disulfide bonds formed by DsbA and prevent aggregation of incorrectly folded proteins. DsbC and DsbG are kept in their reduced state by the cytoplasmic membrane protein DsbD, which utilizes the TRX/TRX reductase system in the cytosol as a source of reducing equivalents. DsbG differ from DsbC in that it has a more limited substrate specificity, and it may preferentially act later in the folding process to catalyze disulfide rearrangements in folded or partially folded proteins. Also included in the alignment is the predicted protein TrbB, whose gene was sequenced from the enterohemorrhagic E. coli type IV pilus gene cluster, which is required for efficient plasmid transfer.
Probab=35.91 E-value=62 Score=29.04 Aligned_cols=51 Identities=25% Similarity=0.323 Sum_probs=34.1
Q ss_pred CCeEEeCCCCccccccccCCceEEEEeEecCCCCh-hhhHHHHH-HcCCcccccccccccccC
Q psy11622 186 EDLVYLTSDSDNIIEELDQSKVYIIGGLVDHNQHK-LLTLNKAR-EENIAHGKLPIDTYLNMK 246 (346)
Q Consensus 186 ~~iVYLSpDS~~~L~eld~~kvYIIGGIVDrnr~K-glt~~kA~-e~GI~taRLPI~eyi~l~ 246 (346)
..++|+|+|+. ..|+|-++|-...+ .+|..... ......++||....+.+.
T Consensus 21 ~~~~y~~~dg~----------~~i~G~l~d~~~~~~~~t~~~~~~~~~~~~~~l~~~~~i~~g 73 (197)
T cd03020 21 GGVLYTDDDGR----------YLIQGNLYDAKGRKDDLTEARLAQLNAIDLSALPLDDAIVYG 73 (197)
T ss_pred CEEEEEcCCCC----------EEEEeEEEEccCCCCChhHHHHHHhhhhhhhhCCcccCeEEc
Confidence 67999999874 57889999965544 45543332 234566788877666653
No 22
>PRK12377 putative replication protein; Provisional
Probab=35.89 E-value=2.3e+02 Score=27.11 Aligned_cols=49 Identities=14% Similarity=-0.011 Sum_probs=26.6
Q ss_pred EEEEcCCCcccCHHHHhHHHHHHHHHHHHhhhcCCCCcEEEEcCChhHHH
Q psy11622 110 VCIDFSFDHLMTQKDICKCVKQFNWCYSLNRRAQHPLQFYVSSFKDKCKQ 159 (346)
Q Consensus 110 IvIDcsFddlM~~KEi~SL~~QL~~~Ys~NRr~~~P~~L~lts~~g~l~~ 159 (346)
..-+++|+++....+-...+......|+.+... ..-.|+|+|-.|--+.
T Consensus 67 ~~~~~tFdnf~~~~~~~~~a~~~a~~~a~~~~~-~~~~l~l~G~~GtGKT 115 (248)
T PRK12377 67 LHRKCSFANYQVQNDGQRYALSQAKSIADELMT-GCTNFVFSGKPGTGKN 115 (248)
T ss_pred ccccCCcCCcccCChhHHHHHHHHHHHHHHHHh-cCCeEEEECCCCCCHH
Confidence 447788988875333332333344444444332 2457999997554333
No 23
>PF09554 RE_HaeII: HaeII restriction endonuclease; InterPro: IPR019058 There are four classes of restriction endonucleases: types I, II,III and IV. All types of enzymes recognise specific short DNA sequences and carry out the endonucleolytic cleavage of DNA to give specific double-stranded fragments with terminal 5'-phosphates. They differ in their recognition sequence, subunit composition, cleavage position, and cofactor requirements [, ], as summarised below: Type I enzymes (3.1.21.3 from EC) cleave at sites remote from recognition site; require both ATP and S-adenosyl-L-methionine to function; multifunctional protein with both restriction and methylase (2.1.1.72 from EC) activities. Type II enzymes (3.1.21.4 from EC) cleave within or at short specific distances from recognition site; most require magnesium; single function (restriction) enzymes independent of methylase. Type III enzymes (3.1.21.5 from EC) cleave at sites a short distance from recognition site; require ATP (but doesn't hydrolyse it); S-adenosyl-L-methionine stimulates reaction but is not required; exists as part of a complex with a modification methylase methylase (2.1.1.72 from EC). Type IV enzymes target methylated DNA. Type II restriction endonucleases (3.1.21.4 from EC) are components of prokaryotic DNA restriction-modification mechanisms that protect the organism against invading foreign DNA. These site-specific deoxyribonucleases catalyse the endonucleolytic cleavage of DNA to give specific double-stranded fragments with terminal 5'-phosphates. Of the 3000 restriction endonucleases that have been characterised, most are homodimeric or tetrameric enzymes that cleave target DNA at sequence-specific sites close to the recognition site. For homodimeric enzymes, the recognition site is usually a palindromic sequence 4-8 bp in length. Most enzymes require magnesium ions as a cofactor for catalysis. Although they can vary in their mode of recognition, many restriction endonucleases share a similar structural core comprising four beta-strands and one alpha-helix, as well as a similar mechanism of cleavage, suggesting a common ancestral origin []. However, there is still considerable diversity amongst restriction endonucleases [, ]. The target site recognition process triggers large conformational changes of the enzyme and the target DNA, leading to the activation of the catalytic centres. Like other DNA binding proteins, restriction enzymes are capable of non-specific DNA binding as well, which is the prerequisite for efficient target site location by facilitated diffusion. Non-specific binding usually does not involve interactions with the bases but only with the DNA backbone []. This family includes Type-2 restriction enzyme NgoBI, and HaeII. They recognise the double-stranded sequence RGCGCY and cleaves after C-5.
Probab=35.12 E-value=41 Score=33.49 Aligned_cols=96 Identities=13% Similarity=0.184 Sum_probs=70.8
Q ss_pred CCCCeEEEEcCCCcccCHHHHhHHHHHHHHHHHHhhhcCCCCcEEEEcCChhHHHHHHHhcCCCccceEEecchhhhccC
Q psy11622 105 PNKLRVCIDFSFDHLMTQKDICKCVKQFNWCYSLNRRAQHPLQFYVSSFKDKCKQEIARYNGYEHWDVHIHEQAYLDLFK 184 (346)
Q Consensus 105 ~~~~~IvIDcsFddlM~~KEi~SL~~QL~~~Ys~NRr~~~P~~L~lts~~g~l~~~l~k~~g~~nW~v~~~~~~~~elf~ 184 (346)
.-+..|=||.++..+.--+|-..++++|.-.-+.|-....+.++|=+|+....-.-|+. |.||+..
T Consensus 165 ~l~v~v~i~~~~~n~~Ll~Ef~dft~kil~l~~~~~~~~~~Aki~RVGVTNAADRGLDM---waNfG~a----------- 230 (338)
T PF09554_consen 165 QLNVKVEIDLPENNYDLLKEFEDFTKKILNLNPSNPSLTLNAKIYRVGVTNAADRGLDM---WANFGPA----------- 230 (338)
T ss_pred HhCcEEEEecCchhhHHHHHHHHHHHHHhccCCCCccccccceeEEeccccchhcchhH---HhccChh-----------
Confidence 35678999999999999999999999999888888888899999988875443333432 5566422
Q ss_pred CCCeEEeCCCC---ccccccccCCceEEEEeEec
Q psy11622 185 KEDLVYLTSDS---DNIIEELDQSKVYIIGGLVD 215 (346)
Q Consensus 185 k~~iVYLSpDS---~~~L~eld~~kvYIIGGIVD 215 (346)
=+|=+||-|. ++..+.+..+.++|++-=.+
T Consensus 231 -iQiKhLSL~~~laE~IV~sissdrIvIVCkdae 263 (338)
T PF09554_consen 231 -IQIKHLSLDEELAENIVSSISSDRIVIVCKDAE 263 (338)
T ss_pred -eeeeeecccHHHHHHHHhhcccCeEEEEecchh
Confidence 2355666654 46677888888888775444
No 24
>KOG1324|consensus
Probab=34.94 E-value=26 Score=32.68 Aligned_cols=39 Identities=23% Similarity=0.435 Sum_probs=26.8
Q ss_pred EEeCCCCcccccccc-C------CceEEEEeEecCCCChhhhHHHHHHcCCccc
Q psy11622 189 VYLTSDSDNIIEELD-Q------SKVYIIGGLVDHNQHKLLTLNKAREENIAHG 235 (346)
Q Consensus 189 VYLSpDS~~~L~eld-~------~kvYIIGGIVDrnr~Kglt~~kA~e~GI~ta 235 (346)
||++.-=+..|+-|. | ++|||||| |-=|..|...++..+
T Consensus 88 ~~~~~slesAl~lL~~pp~~~~ve~vfvIGG--------~~vy~~al~~p~~~~ 133 (190)
T KOG1324|consen 88 VFLSSSLESALDLLEEPPSSNSVEMVFVIGG--------SEVYSEALNSPRCDA 133 (190)
T ss_pred EEEeccHHHHHHhhcCCccccceeEEEEEcC--------HHHHHHHHcCcCcce
Confidence 677666565665432 2 68999999 477888887775543
No 25
>PF07646 Kelch_2: Kelch motif; InterPro: IPR011498 Kelch is a 50-residue motif, named after the Drosophila mutant in which it was first identified []. This sequence motif represents one beta-sheet blade, and several of these repeats can associate to form a beta-propeller. For instance, the motif appears 6 times in Drosophila egg-chamber regulatory protein, creating a 6-bladed beta-propeller. The motif is also found in mouse protein MIPP [] and in a number of poxviruses. In addition, kelch repeats have been recognised in alpha- and beta-scruin [, ], and in galactose oxidase from the fungus Dactylium dendroides [, ]. The structure of galactose oxidase reveals that the repeated sequence corresponds to a 4-stranded anti-parallel beta-sheet motif that forms the repeat unit in a super-barrel structural fold []. The known functions of kelch-containing proteins are diverse: scruin is an actin cross-linking protein; galactose oxidase catalyses the oxidation of the hydroxyl group at the C6 position in D-galactose; neuraminidase hydrolyses sialic acid residues from glycoproteins; and kelch may have a cytoskeletal function, as it is localised to the actin-rich ring canals that connect the 15 nurse cells to the developing oocyte in Drosophila []. Nevertheless, based on the location of the kelch pattern in the catalytic unit in galactose oxidase, functionally important residues have been predicted in glyoxal oxidase []. This entry represents a type of kelch sequence motif that comprises one beta-sheet blade.; GO: 0005515 protein binding
Probab=33.80 E-value=20 Score=25.04 Aligned_cols=10 Identities=40% Similarity=1.089 Sum_probs=9.2
Q ss_pred CCceEEEEeE
Q psy11622 204 QSKVYIIGGL 213 (346)
Q Consensus 204 ~~kvYIIGGI 213 (346)
.+++||+||.
T Consensus 11 ~~kiyv~GG~ 20 (49)
T PF07646_consen 11 DGKIYVFGGY 20 (49)
T ss_pred CCEEEEECCc
Confidence 6899999999
No 26
>COG5493 Uncharacterized conserved protein containing a coiled-coil domain [Function unknown]
Probab=32.03 E-value=46 Score=31.64 Aligned_cols=33 Identities=15% Similarity=0.054 Sum_probs=26.5
Q ss_pred ccCCceEEEEeEecCCCChhhhHHHHHHcCCccc
Q psy11622 202 LDQSKVYIIGGLVDHNQHKLLTLNKAREENIAHG 235 (346)
Q Consensus 202 ld~~kvYIIGGIVDrnr~Kglt~~kA~e~GI~ta 235 (346)
+.++++|||||.+|- ++...-..-|+.+||..+
T Consensus 194 vki~~vivitpFihd-r~p~~~kAmAe~mGIeii 226 (231)
T COG5493 194 VKINKVIVITPFIHD-RYPDRVKAMAERMGIEII 226 (231)
T ss_pred CccceEEEEcccccc-cChHHHHHHHHHcCceec
Confidence 678999999999974 445667778899999864
No 27
>PF04122 CW_binding_2: Putative cell wall binding repeat 2; InterPro: IPR007253 This repeat is found in multiple tandem copies in proteins including amidase enhancers [] and adhesins [].
Probab=31.29 E-value=12 Score=29.78 Aligned_cols=16 Identities=50% Similarity=0.883 Sum_probs=12.7
Q ss_pred ccccccCCceEEEEeE
Q psy11622 198 IIEELDQSKVYIIGGL 213 (346)
Q Consensus 198 ~L~eld~~kvYIIGGI 213 (346)
.|..+...+||||||-
T Consensus 67 ~l~~~~~~~v~iiGg~ 82 (92)
T PF04122_consen 67 FLKSLNIKKVYIIGGE 82 (92)
T ss_pred HHHHcCCCEEEEECCC
Confidence 4556678899999985
No 28
>KOG0068|consensus
Probab=31.15 E-value=50 Score=33.97 Aligned_cols=117 Identities=17% Similarity=0.258 Sum_probs=70.6
Q ss_pred eEEEEcCCCcccCH-----HHHhHHHHHHHHHHHHhhhcCCCCcEE-----------EEcC---ChhHHHHHHHhc----
Q psy11622 109 RVCIDFSFDHLMTQ-----KDICKCVKQFNWCYSLNRRAQHPLQFY-----------VSSF---KDKCKQEIARYN---- 165 (346)
Q Consensus 109 ~IvIDcsFddlM~~-----KEi~SL~~QL~~~Ys~NRr~~~P~~L~-----------lts~---~g~l~~~l~k~~---- 165 (346)
-+|+.-=+.+-.+- -.+-||++||.++|...|.-.|-..-+ +-|+ +.++..++....
T Consensus 94 i~Vvn~P~~Ns~saAEltigli~SLaR~i~~A~~s~k~g~wnr~~~~G~el~GKTLgvlG~GrIGseVA~r~k~~gm~vI 173 (406)
T KOG0068|consen 94 ILVVNTPTANSRSAAELTIGLILSLARQIGQASASMKEGKWNRVKYLGWELRGKTLGVLGLGRIGSEVAVRAKAMGMHVI 173 (406)
T ss_pred eEEEeCCCCChHHHHHHHHHHHHHHhhhcchhheeeecCceeecceeeeEEeccEEEEeecccchHHHHHHHHhcCceEE
Confidence 35555555544444 446899999999999988877665433 4444 244555655422
Q ss_pred CCCcc---------ceEEecchhhhccCCCCeE----EeCCCCccccccc----cCCceEEE----EeEecCCCChhhhH
Q psy11622 166 GYEHW---------DVHIHEQAYLDLFKKEDLV----YLTSDSDNIIEEL----DQSKVYII----GGLVDHNQHKLLTL 224 (346)
Q Consensus 166 g~~nW---------~v~~~~~~~~elf~k~~iV----YLSpDS~~~L~el----d~~kvYII----GGIVDrnr~Kglt~ 224 (346)
+|+.. +|++. ++.++.++.+.| =|||+.++.|.+- -..-|||| ||+||.. .+
T Consensus 174 ~~dpi~~~~~~~a~gvq~v--sl~Eil~~ADFitlH~PLtP~T~~lin~~tfA~mKkGVriIN~aRGGvVDe~-----AL 246 (406)
T KOG0068|consen 174 GYDPITPMALAEAFGVQLV--SLEEILPKADFITLHVPLTPSTEKLLNDETFAKMKKGVRIINVARGGVVDEP-----AL 246 (406)
T ss_pred eecCCCchHHHHhccceee--eHHHHHhhcCEEEEccCCCcchhhccCHHHHHHhhCCcEEEEecCCceechH-----HH
Confidence 23211 11221 234555666654 4689999888842 24459999 9999963 45
Q ss_pred HHHHHcCC
Q psy11622 225 NKAREENI 232 (346)
Q Consensus 225 ~kA~e~GI 232 (346)
=+|.+.|+
T Consensus 247 v~Al~sG~ 254 (406)
T KOG0068|consen 247 VRALDSGQ 254 (406)
T ss_pred HHHHhcCc
Confidence 56666664
No 29
>PF02112 PDEase_II: cAMP phosphodiesterases class-II; InterPro: IPR000396 Cyclic-AMP phosphodiesterase (3.1.4.17 from EC) (PDE) catalyses the hydrolysis of cAMP to the corresponding nucleoside 5' monophosphate. On the basis of sequence similarity, most PDEs can be grouped together [], but some enzymes lie apart from the main family and represent a second distinct class [] that includes PDEs from Dictyostelium and yeast. This entry contains class-II cyclic-AMP phosphodiesterases.; GO: 0004115 3',5'-cyclic-AMP phosphodiesterase activity, 0006198 cAMP catabolic process
Probab=28.99 E-value=70 Score=32.26 Aligned_cols=47 Identities=15% Similarity=0.199 Sum_probs=36.8
Q ss_pred eEEEEcCCCcc---------cCHHHHhHHHHHHHHHHHHhhhcCCCCcEEEEcCCh
Q psy11622 109 RVCIDFSFDHL---------MTQKDICKCVKQFNWCYSLNRRAQHPLQFYVSSFKD 155 (346)
Q Consensus 109 ~IvIDcsFddl---------M~~KEi~SL~~QL~~~Ys~NRr~~~P~~L~lts~~g 155 (346)
.|+|.|||.+. |+|+-+...-.+|...|+.......-+++.|+.+.+
T Consensus 245 aI~IEcS~~~~~~d~~LyGHLtP~~Li~EL~~L~~~~~~~~~~L~gL~VIItHIK~ 300 (335)
T PF02112_consen 245 AIFIECSYPNSQPDSQLYGHLTPKHLIEELKVLASKVGQTSPPLKGLNVIITHIKP 300 (335)
T ss_pred EEEEEeCCCCCCCchHhhccCCHHHHHHHHHHHHhccccccCCCCCCeEEEEEeCC
Confidence 59999999974 567777777777777777666677888999999754
No 30
>PRK02135 hypothetical protein; Provisional
Probab=28.79 E-value=1.4e+02 Score=28.14 Aligned_cols=83 Identities=19% Similarity=0.397 Sum_probs=53.8
Q ss_pred ceEEecchhhhcc----CCCCeEEeCCCCccccc-cccCCceEEEEeEecCCCChhhhHHHHHHcCCccccccccccccc
Q psy11622 171 DVHIHEQAYLDLF----KKEDLVYLTSDSDNIIE-ELDQSKVYIIGGLVDHNQHKLLTLNKAREENIAHGKLPIDTYLNM 245 (346)
Q Consensus 171 ~v~~~~~~~~elf----~k~~iVYLSpDS~~~L~-eld~~kvYIIGGIVDrnr~Kglt~~kA~e~GI~taRLPI~eyi~l 245 (346)
+|.+...+|.++. ...+++||.-|+...-+ ++.++-+||+| ||....--..+.=.+.|. .|+ .+
T Consensus 108 Gi~V~~~~fe~ll~~~~e~~~l~~L~e~G~~i~~~~~~~~~~FvLg---DH~~~~~ee~~~L~~~ga--~~i------Sl 176 (201)
T PRK02135 108 GIYVRRRGFEDLLEELAEGKTLYYLHEDGEDIRDVEFPENPVFVLG---DHIGFTEEEENLLKRLGA--EKI------SL 176 (201)
T ss_pred CEEEecCCHHHHHHHHhcCCcEEEEeCCCCchhhccCCCCCEEEEe---CCCCCCHHHHHHHHHhCC--eEE------Ee
Confidence 4555555555443 23589999999998875 57778899999 765532222222233322 222 22
Q ss_pred CCCcccchHHHHHHHHHhhh
Q psy11622 246 KTRQVLAVNHVFQILLAISS 265 (346)
Q Consensus 246 ~~rkVLtiN~V~EILl~~~e 265 (346)
|.+.|--+|++-|+..++.
T Consensus 177 -GP~~l~AshcI~~vhn~LD 195 (201)
T PRK02135 177 -GPKMLHADHCITLIHNELD 195 (201)
T ss_pred -CcHHHHHHHHHHHHHHHHh
Confidence 6678888999999988887
No 31
>PLN02850 aspartate-tRNA ligase
Probab=28.69 E-value=71 Score=34.11 Aligned_cols=23 Identities=22% Similarity=0.287 Sum_probs=17.9
Q ss_pred CCCCHHHHHHHHHHHHHHhhchh
Q psy11622 44 PPLSKRAMKRKLKHEKWLKWKPI 66 (346)
Q Consensus 44 ~~lSK~q~KKl~k~~kwe~~K~~ 66 (346)
..+||+++||+.+++++++.+++
T Consensus 11 ~~~~~~~~~k~~~~~~~~~~~~~ 33 (530)
T PLN02850 11 EKISKKAAKKAAAKAEKLRREAT 33 (530)
T ss_pred CCcCHHHHHHHHHHHHHHHHHHH
Confidence 34999999999998887765433
No 32
>PRK04171 ribosome biogenesis protein; Provisional
Probab=28.55 E-value=67 Score=30.70 Aligned_cols=84 Identities=17% Similarity=0.232 Sum_probs=48.7
Q ss_pred CccceEEecchhhhccCC---CCeEEeCCCCccc-cccc---cCCceEEEEeEecCCCChhhhHHHHHHcCCcccccccc
Q psy11622 168 EHWDVHIHEQAYLDLFKK---EDLVYLTSDSDNI-IEEL---DQSKVYIIGGLVDHNQHKLLTLNKAREENIAHGKLPID 240 (346)
Q Consensus 168 ~nW~v~~~~~~~~elf~k---~~iVYLSpDS~~~-L~el---d~~kvYIIGGIVDrnr~Kglt~~kA~e~GI~taRLPI~ 240 (346)
.+..+.+...++.++++. ..+|.||..++-+ +.++ +.+-++||||.- |..-..-....| +
T Consensus 126 ~~~Llkv~k~~l~~~l~~~~~~~~i~lS~~g~~~~~~~~~~~~~~~~~vIGaf~-hG~f~~~~~~~~------------~ 192 (222)
T PRK04171 126 GEPLLEIENKSLEELLEEIGPDRIILLSEKGELVKPKELGKENENIAVGIGGFP-HGDFSEKVLELA------------K 192 (222)
T ss_pred CcchheeecCcHHHhccccCCCcEEEECCCCcccCHHHHhhccCCcEEEEcccc-CCCcchhhHhhc------------C
Confidence 344566667777888776 6899999887733 2333 567899999983 221111111111 1
Q ss_pred cccccCCCcccchHHHHHHHHHhhh
Q psy11622 241 TYLNMKTRQVLAVNHVFQILLAISS 265 (346)
Q Consensus 241 eyi~l~~rkVLtiN~V~EILl~~~e 265 (346)
+++.+ ++..|+.-+|+.-++.-++
T Consensus 193 ~~iSI-s~~pLsa~~v~~ri~~a~E 216 (222)
T PRK04171 193 KKYSI-YGEPLTAWTVVCRVIAAYE 216 (222)
T ss_pred eEEEE-eCCChHHHHHHHHHHHHHH
Confidence 22333 3567777777766665554
No 33
>PF04298 Zn_peptidase_2: Putative neutral zinc metallopeptidase; InterPro: IPR007395 Members of this family of bacterial proteins are described as hypothetical proteins or zinc-dependent proteases. The majority have a HExxH zinc-binding motif characteristic of neutral zinc metallopeptidases, however there is no evidence to support their function as metallopeptidases.
Probab=25.64 E-value=1.8e+02 Score=27.84 Aligned_cols=69 Identities=14% Similarity=0.113 Sum_probs=46.2
Q ss_pred HhHHHHHHHHHHHHhhhcCCCCcEEEEcCChhHHHHHHHhcCCCccceEEecchhhhccCC-CCeEEeCCCC
Q psy11622 125 ICKCVKQFNWCYSLNRRAQHPLQFYVSSFKDKCKQEIARYNGYEHWDVHIHEQAYLDLFKK-EDLVYLTSDS 195 (346)
Q Consensus 125 i~SL~~QL~~~Ys~NRr~~~P~~L~lts~~g~l~~~l~k~~g~~nW~v~~~~~~~~elf~k-~~iVYLSpDS 195 (346)
+-++.-|..--...+|-+.-|...-+|| .++.+.+-...|-.+-.|+..+....|.|++ ++.|.||+|.
T Consensus 12 ~l~~~aq~~vk~~~~kYs~v~~~~g~TG--ae~Ar~iL~~~gl~~V~Ve~~~G~LtDHYdP~~k~vrLS~~v 81 (222)
T PF04298_consen 12 LLSLWAQWWVKSTFKKYSQVRSSSGMTG--AEVARHILDRNGLSDVRVERVPGELTDHYDPRNKVVRLSEDV 81 (222)
T ss_pred HHHHHHHHHHHHHHHhhCcCCCCCCCCH--HHHHHHHHHHCCCCCeeEEEeCCCCCCCcCCCCCEEEeCCcc
Confidence 3455566665555666666665444444 5677766666677666777777778888865 7899999983
No 34
>COG1366 SpoIIAA Anti-anti-sigma regulatory factor (antagonist of anti-sigma factor) [Signal transduction mechanisms]
Probab=25.55 E-value=2.9e+02 Score=22.64 Aligned_cols=52 Identities=10% Similarity=0.085 Sum_probs=38.4
Q ss_pred CCCeEEEEcCCCcccCHHHHhHHHHHHHHHHHHhhhcCCCCcEEEEcCChhHHHHHHH
Q psy11622 106 NKLRVCIDFSFDHLMTQKDICKCVKQFNWCYSLNRRAQHPLQFYVSSFKDKCKQEIAR 163 (346)
Q Consensus 106 ~~~~IvIDcsFddlM~~KEi~SL~~QL~~~Ys~NRr~~~P~~L~lts~~g~l~~~l~k 163 (346)
....|||||+--++|.---+..|...+..+- .. -..+.++|.++.+++-|..
T Consensus 43 ~~~~ivIDls~v~~~dS~gl~~L~~~~~~~~----~~--g~~~~l~~i~p~v~~~~~~ 94 (117)
T COG1366 43 GARGLVIDLSGVDFMDSAGLGVLVALLKSAR----LR--GVELVLVGIQPEVARTLEL 94 (117)
T ss_pred CCcEEEEECCCCceechHHHHHHHHHHHHHH----hc--CCeEEEEeCCHHHHHHHHH
Confidence 3445999999999998776666665544443 22 3679999999999887764
No 35
>PF01740 STAS: STAS domain; InterPro: IPR002645 The STAS (Sulphate Transporter and AntiSigma factor antagonist) domain is found in the C-terminal region of sulphate transporters and bacterial anti-sigma factor antagonists. It has been suggested that this domain may have a general NTP binding function. The establishment of differential gene expression in sporulating Bacillus subtilis involves four protein components one of which is SpoIIAA (P10727 from SWISSPROT). The four components regulate the sporulation sigma factor F. Early in sporulation, SpoIIAA is in the phosphorylated state (SpoIIAA-P), as a result of the activity of the ATP-dependent protein kinase SpoIIAB (P10728 from SWISSPROT). The site at which this protein is a conserved serine. SpoIIAB is an anti-sigma factor that in its free form inhibits F by binding to it. Competition by SpoIIAA (the anti-anti-sigma factor) for binding to SpoIIAB releases Sigma F activity []. The STAS domain is found in the anti-sigma factor antagonist SpoIIAA.; PDB: 3T6O_B 3LKL_B 1H4Z_A 1H4Y_B 1H4X_B 3NY7_A 3OIZ_A 1T6R_A 1VC1_B 1SBO_A ....
Probab=24.63 E-value=1.8e+02 Score=23.35 Aligned_cols=55 Identities=15% Similarity=0.131 Sum_probs=42.1
Q ss_pred CCeEEEEcCCCcccCHHHHhHHHHHHHHHHHHhhhcCCCCcEEEEcCChhHHHHHHHhcCCC
Q psy11622 107 KLRVCIDFSFDHLMTQKDICKCVKQFNWCYSLNRRAQHPLQFYVSSFKDKCKQEIARYNGYE 168 (346)
Q Consensus 107 ~~~IvIDcsFddlM~~KEi~SL~~QL~~~Ys~NRr~~~P~~L~lts~~g~l~~~l~k~~g~~ 168 (346)
.-.|||||+--..|.---+..|..-...+.. .-..++|+++.+.+...|... |+.
T Consensus 48 ~~~vIlD~s~v~~iDssgi~~L~~~~~~~~~------~g~~~~l~~~~~~v~~~l~~~-~~~ 102 (117)
T PF01740_consen 48 IKNVILDMSGVSFIDSSGIQALVDIIKELRR------RGVQLVLVGLNPDVRRILERS-GLI 102 (117)
T ss_dssp SSEEEEEETTESEESHHHHHHHHHHHHHHHH------TTCEEEEESHHHHHHHHHHHT-TGH
T ss_pred ceEEEEEEEeCCcCCHHHHHHHHHHHHHHHH------CCCEEEEEECCHHHHHHHHHc-CCC
Confidence 4689999999999988888777655555442 467899999999988877653 443
No 36
>COG1578 Uncharacterized conserved protein [Function unknown]
Probab=24.11 E-value=3.5e+02 Score=26.91 Aligned_cols=44 Identities=30% Similarity=0.479 Sum_probs=37.1
Q ss_pred CeEEeCCCCccc---------cccccCCceEEEEeEecCCCChhhhHHHHHHcCCc
Q psy11622 187 DLVYLTSDSDNI---------IEELDQSKVYIIGGLVDHNQHKLLTLNKAREENIA 233 (346)
Q Consensus 187 ~iVYLSpDS~~~---------L~eld~~kvYIIGGIVDrnr~Kglt~~kA~e~GI~ 233 (346)
+++||+-.+=+. |.++..+-|||++| .-...-.|.+-|++.||.
T Consensus 153 ~VlYl~DNaGEi~FD~vlie~ik~~~~~vv~vVrg---~PIlnDaT~EDak~~~i~ 205 (285)
T COG1578 153 SVLYLTDNAGEIVFDKVLIEVIKELGKKVVVVVRG---GPILNDATMEDAKEAGID 205 (285)
T ss_pred cEEEEecCCccHHHHHHHHHHHHhcCCceEEEEcC---CceechhhHHHHHHcCcc
Confidence 999999888653 56788889999999 666778999999999975
No 37
>PF13854 Kelch_5: Kelch motif
Probab=23.88 E-value=37 Score=23.22 Aligned_cols=11 Identities=36% Similarity=0.872 Sum_probs=9.7
Q ss_pred CCceEEEEeEe
Q psy11622 204 QSKVYIIGGLV 214 (346)
Q Consensus 204 ~~kvYIIGGIV 214 (346)
.+.+||+||..
T Consensus 14 ~~~iyi~GG~~ 24 (42)
T PF13854_consen 14 GNNIYIFGGYS 24 (42)
T ss_pred CCEEEEEcCcc
Confidence 37899999998
No 38
>KOG2967|consensus
Probab=23.87 E-value=1.8e+02 Score=29.38 Aligned_cols=34 Identities=21% Similarity=0.259 Sum_probs=26.0
Q ss_pred CCCCCCCCHHHHHHHHHHHHHHhhchhHHHHHHH
Q psy11622 40 NTAEPPLSKRAMKRKLKHEKWLKWKPIKRAKERE 73 (346)
Q Consensus 40 ~~~~~~lSK~q~KKl~k~~kwe~~K~~rr~~~ke 73 (346)
....++..+.--|+++|+++..+..++.++++++
T Consensus 21 ~~~~~p~~e~~sk~q~k~~~k~~~wee~~~~~~~ 54 (314)
T KOG2967|consen 21 EPGQQPVAEPMSKKQLKRQKKQAEWEELKKKKKE 54 (314)
T ss_pred CCCCCcccchhhHHHHHHHHHHHHHHHHHHHHHH
Confidence 3345577777888888888888888888877777
No 39
>cd05564 PTS_IIB_chitobiose_lichenan PTS_IIB_chitobiose_lichenan: subunit IIB of enzyme II (EII) of the N,N-diacetylchitobiose-specific and lichenan-specific phosphoenolpyruvate:carbohydrate phosphotransferase system (PTS). In these systems, EII is either a lichenan- or an N,N-diacetylchitobiose-specific permease with two cytoplasmic domains (IIA and IIB) and a transmembrane channel IIC domain. In the chitobiose system, these subunits are expressed as separate proteins from chbA, chbB, and chbC of the chb operon (formerly the cel (cellulose) operon). In the lichenan system, these subunits are expressed from licA, licB, and licC of the lic operon. The lic operon of Bacillus subtilis is required for the transport and degradation of oligomeric beta-glucosides, which are produced by extracellular enzymes on substrates such as lichenan or barley glucan. The lic operon is transcribed from a gammaA-dependent promoter and is inducible by lichenan, lichenan hydrolysate, and cellobiose. The IIB d
Probab=23.68 E-value=55 Score=26.57 Aligned_cols=48 Identities=25% Similarity=0.360 Sum_probs=33.4
Q ss_pred cCCCCeEEeCCCCccccccccCCceEEEEeEecCCCChhhhHHHHHHcCCcccccccccccccCCCccc
Q psy11622 183 FKKEDLVYLTSDSDNIIEELDQSKVYIIGGLVDHNQHKLLTLNKAREENIAHGKLPIDTYLNMKTRQVL 251 (346)
Q Consensus 183 f~k~~iVYLSpDS~~~L~eld~~kvYIIGGIVDrnr~Kglt~~kA~e~GI~taRLPI~eyi~l~~rkVL 251 (346)
.+.-++|.|+|.-...+.++. +.+...|+..+-+|...|-.|++.++|
T Consensus 44 ~~~~Diil~~Pqv~~~~~~i~---------------------~~~~~~~~pv~~I~~~~Y~~~dg~~il 91 (96)
T cd05564 44 IDDADVVLLGPQVRYMLDEVK---------------------KKAAEYGIPVAVIDMMDYGMMNGEKVL 91 (96)
T ss_pred cCCCCEEEEChhHHHHHHHHH---------------------HHhccCCCcEEEcChHhcccCCHHHHH
Confidence 345679999998777666542 234556888888888888777766554
No 40
>PF13056 DUF3918: Protein of unknown function (DUF3918)
Probab=23.34 E-value=58 Score=23.61 Aligned_cols=16 Identities=31% Similarity=0.322 Sum_probs=13.1
Q ss_pred CCCCHHHHHHHHHHHH
Q psy11622 44 PPLSKRAMKRKLKHEK 59 (346)
Q Consensus 44 ~~lSK~q~KKl~k~~k 59 (346)
.-+|++|+||.+|+-.
T Consensus 25 ~m~n~R~MKKmrkrv~ 40 (43)
T PF13056_consen 25 DMMNKRQMKKMRKRVM 40 (43)
T ss_pred cccchHHHHHHHHHHH
Confidence 4589999999998753
No 41
>KOG0555|consensus
Probab=22.42 E-value=1.4e+02 Score=31.49 Aligned_cols=61 Identities=16% Similarity=0.285 Sum_probs=36.2
Q ss_pred CeEEeCCCCccccccccC--CceEEEEeEecCCCChhhhHHHHHHcCCcccccccccccccCC-CcccchHHHHHHHH
Q psy11622 187 DLVYLTSDSDNIIEELDQ--SKVYIIGGLVDHNQHKLLTLNKAREENIAHGKLPIDTYLNMKT-RQVLAVNHVFQILL 261 (346)
Q Consensus 187 ~iVYLSpDS~~~L~eld~--~kvYIIGGIVDrnr~Kglt~~kA~e~GI~taRLPI~eyi~l~~-rkVLtiN~V~EILl 261 (346)
+=-|||-.|.--|+..-| +.||-|- -||..-+. +| |--+.||--... -..||+++.+++|=
T Consensus 289 EeAyLTQSSQLYLEtclpAlgdvy~I~----------~SyRAEkS---rT-RRHLsEytHVEaE~afltfd~ll~~iE 352 (545)
T KOG0555|consen 289 EEAYLTQSSQLYLETCLPALGDVYCIQ----------QSYRAEKS---RT-RRHLSEYTHVEAECAFLTFDDLLDRIE 352 (545)
T ss_pred chhhccchhHHHHHHhhhhcCceeEec----------Hhhhhhhh---hh-hhhhhhheeeeeecccccHHHHHHHHH
Confidence 337999999988988766 5899874 24432222 12 223556633222 35677777666653
No 42
>PF13418 Kelch_4: Galactose oxidase, central domain; PDB: 2UVK_B.
Probab=21.86 E-value=57 Score=22.39 Aligned_cols=16 Identities=19% Similarity=0.596 Sum_probs=9.5
Q ss_pred ccCCceEEEEeEecCC
Q psy11622 202 LDQSKVYIIGGLVDHN 217 (346)
Q Consensus 202 ld~~kvYIIGGIVDrn 217 (346)
+..+.+||+||.-...
T Consensus 10 ~~~~~i~v~GG~~~~~ 25 (49)
T PF13418_consen 10 IGDNSIYVFGGRDSSG 25 (49)
T ss_dssp E-TTEEEEE--EEE-T
T ss_pred EeCCeEEEECCCCCCC
Confidence 3457899999997754
No 43
>COG1514 LigT 2'-5' RNA ligase [Translation, ribosomal structure and biogenesis]
Probab=20.55 E-value=5.3e+02 Score=23.47 Aligned_cols=120 Identities=8% Similarity=0.124 Sum_probs=76.0
Q ss_pred HHHHhHHHHHHHHHHHHhh--hcCCCCcEEEE--cCC---h----hHHHHHHHhc-CCCccceEEecchhhhccC---CC
Q psy11622 122 QKDICKCVKQFNWCYSLNR--RAQHPLQFYVS--SFK---D----KCKQEIARYN-GYEHWDVHIHEQAYLDLFK---KE 186 (346)
Q Consensus 122 ~KEi~SL~~QL~~~Ys~NR--r~~~P~~L~lt--s~~---g----~l~~~l~k~~-g~~nW~v~~~~~~~~elf~---k~ 186 (346)
+.++.....|+..-|...+ +.-.|-++||| =++ + .+.+.+.+.. ++ .+.+++..-+ .|+ ..
T Consensus 10 p~~i~~~i~~~~~~~~~~~~~k~v~~en~HiTL~flGev~e~~~~~l~~~l~~i~~~~-~f~i~l~g~g---~F~~~~~~ 85 (180)
T COG1514 10 PAEIAERLARIRARLKGARAIKWVEPENLHITLKFLGEVDEDKADELIEALARIAAPE-PFPITLDGAG---SFPNPRRP 85 (180)
T ss_pred CHHHHHHHHHHHHhcCcccccccccccCceEEEEccCCcCchHHHHHHHHHHHhhcCC-ceEEEEeeEc---ccCCCCCC
Confidence 6788888888888888875 33344444444 332 2 2333444432 33 5556655433 566 37
Q ss_pred CeEEeCCCCccccccccCCceEEEEeEecCCCChhhhHHHHHHcCCcccccccccccccCCCcccchHHHHHHHHHhh
Q psy11622 187 DLVYLTSDSDNIIEELDQSKVYIIGGLVDHNQHKLLTLNKAREENIAHGKLPIDTYLNMKTRQVLAVNHVFQILLAIS 264 (346)
Q Consensus 187 ~iVYLSpDS~~~L~eld~~kvYIIGGIVDrnr~Kglt~~kA~e~GI~taRLPI~eyi~l~~rkVLtiN~V~EILl~~~ 264 (346)
.+||+.+...+.|..+. ......+...|++....|..-||.+.-.+. ...+.+++..+.
T Consensus 86 rvi~~~v~~~~~L~~L~-----------------~~l~~~~~~~g~~~~~r~F~PHvTl~r~k~--~~~~~~~~~~~~ 144 (180)
T COG1514 86 RVIWVGVEETEELRALA-----------------EELERALARLGLRPEERPFVPHVTLARVKS--KDKLVEALGEFK 144 (180)
T ss_pred cEEEEcCCCcHHHHHHH-----------------HHHHHHHHhcCCCCCCCCcCCCEEEEeecc--cchhhhhhhhhc
Confidence 89999999877788653 356667788999988888888877643333 555555555443
No 44
>PF06117 DUF957: Enterobacterial protein of unknown function (DUF957); InterPro: IPR009301 This family consists of several hypothetical proteins from Escherichia coli, Salmonella typhi, Shigella flexneri and Proteus vulgaris. The function of this family is unknown.
Probab=20.42 E-value=78 Score=24.82 Aligned_cols=17 Identities=29% Similarity=0.491 Sum_probs=15.6
Q ss_pred cccchHHHHHHHHHhhh
Q psy11622 249 QVLAVNHVFQILLAISS 265 (346)
Q Consensus 249 kVLtiN~V~EILl~~~e 265 (346)
+.||+-++++||+.|++
T Consensus 2 ~~lt~~~~L~iLi~WLe 18 (65)
T PF06117_consen 2 QILTTEEALEILIAWLE 18 (65)
T ss_pred ccchHHHHHHHHHHHHH
Confidence 46899999999999998
No 45
>cd07043 STAS_anti-anti-sigma_factors Sulphate Transporter and Anti-Sigma factor antagonist) domain of anti-anti-sigma factors, key regulators of anti-sigma factors by phosphorylation. Anti-anti-sigma factors play an important role in the regulation of several sigma factors and their corresponding anti-sigma factors. Upon dephosphorylation they bind the anti-sigma factor and induce the release of the sigma factor from the anti-sigma factor. In a feedback mechanism the anti-anti-sigma factor can be inactivated via phosphorylation by the anti-sigma factor. Well studied examples from Bacillus subtilis are SpoIIAA (regulating sigmaF and sigmaC which play an important role in sporulation) and RsbV (regulating sigmaB involved in the general stress response). The STAS domain is also found in the C- terminal region of sulphate transporters and stressosomes.
Probab=20.41 E-value=3.8e+02 Score=20.21 Aligned_cols=52 Identities=6% Similarity=0.068 Sum_probs=37.2
Q ss_pred CCeEEEEcCCCcccCHHHHhHHHHHHHHHHHHhhhcCCCCcEEEEcCChhHHHHHHHh
Q psy11622 107 KLRVCIDFSFDHLMTQKDICKCVKQFNWCYSLNRRAQHPLQFYVSSFKDKCKQEIARY 164 (346)
Q Consensus 107 ~~~IvIDcsFddlM~~KEi~SL~~QL~~~Ys~NRr~~~P~~L~lts~~g~l~~~l~k~ 164 (346)
...|+|||+--..|.---+.-|..-+..+. ....++.|+++++.+.+.|...
T Consensus 38 ~~~viid~~~v~~iDs~g~~~L~~l~~~~~------~~g~~v~i~~~~~~~~~~l~~~ 89 (99)
T cd07043 38 PRRLVLDLSGVTFIDSSGLGVLLGAYKRAR------AAGGRLVLVNVSPAVRRVLELT 89 (99)
T ss_pred CCEEEEECCCCCEEcchhHHHHHHHHHHHH------HcCCeEEEEcCCHHHHHHHHHh
Confidence 468999999999997666555554333222 2357899999999988877753
No 46
>KOG1135|consensus
Probab=20.17 E-value=27 Score=38.52 Aligned_cols=31 Identities=16% Similarity=0.127 Sum_probs=26.8
Q ss_pred CCCeEEEEcCCCcccCHHHHhHHHHHHHHHH
Q psy11622 106 NKLRVCIDFSFDHLMTQKDICKCVKQFNWCY 136 (346)
Q Consensus 106 ~~~~IvIDcsFddlM~~KEi~SL~~QL~~~Y 136 (346)
.+.+|.||||+++.|....+..|-.||-.+=
T Consensus 23 D~~~iLiDcGwd~~f~~~~i~~l~~~i~~iD 53 (764)
T KOG1135|consen 23 DGVRILIDCGWDESFDMSMIKELKPVIPTID 53 (764)
T ss_pred cCeEEEEeCCCcchhccchhhhhhccccccc
Confidence 5789999999999999999999998876543
Done!