Query 010423
Match_columns 511
No_of_seqs 140 out of 284
Neff 5.6
Searched_HMMs 46136
Date Fri Mar 29 00:21:36 2013
Command hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/010423.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/010423hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG2636 Splicing factor 3a, su 100.0 3E-157 6E-162 1206.3 37.6 489 3-511 1-497 (497)
2 COG5188 PRP9 Splicing factor 3 100.0 1E-112 3E-117 852.9 29.7 448 4-511 2-470 (470)
3 PF11931 DUF3449: Domain of un 100.0 1.6E-92 3.6E-97 673.5 5.8 182 329-510 1-196 (196)
4 PF13297 Telomere_Sde2_2: Telo 99.7 4.9E-17 1.1E-21 127.1 4.9 60 246-305 1-60 (60)
5 KOG2827 Uncharacterized conser 98.8 2.1E-09 4.5E-14 107.4 4.6 61 245-305 261-321 (322)
6 PF12108 SF3a60_bindingd: Spli 98.8 1.8E-09 3.9E-14 72.7 2.7 23 84-106 6-28 (28)
7 PF12874 zf-met: Zinc-finger o 94.5 0.012 2.5E-07 38.0 0.3 25 416-441 1-25 (25)
8 PF12171 zf-C2H2_jaz: Zinc-fin 90.5 0.12 2.5E-06 34.3 0.8 26 416-442 2-27 (27)
9 PF13894 zf-C2H2_4: C2H2-type 88.4 0.18 3.8E-06 31.3 0.5 21 416-437 1-21 (24)
10 PF00096 zf-C2H2: Zinc finger, 87.8 0.17 3.7E-06 31.8 0.1 21 416-437 1-21 (23)
11 smart00451 ZnF_U1 U1-like zinc 87.4 0.29 6.3E-06 33.8 1.1 31 415-446 3-33 (35)
12 PF06397 Desulfoferrod_N: Desu 84.9 0.21 4.5E-06 36.0 -0.7 13 413-425 4-16 (36)
13 COG4481 Uncharacterized protei 80.6 0.68 1.5E-05 36.3 0.7 32 405-436 24-55 (60)
14 PF12171 zf-C2H2_jaz: Zinc-fin 80.6 0.29 6.4E-06 32.3 -1.2 25 249-279 2-26 (27)
15 PLN02748 tRNA dimethylallyltra 80.5 0.72 1.5E-05 50.8 1.1 36 413-448 416-451 (468)
16 smart00355 ZnF_C2H2 zinc finge 79.8 0.82 1.8E-05 28.4 0.8 21 416-437 1-21 (26)
17 PF09943 DUF2175: Uncharacteri 79.5 0.76 1.6E-05 40.4 0.7 17 414-430 1-17 (101)
18 TIGR00319 desulf_FeS4 desulfof 75.3 0.88 1.9E-05 31.9 -0.1 14 413-426 5-18 (34)
19 cd00974 DSRD Desulforedoxin (D 73.0 1.1 2.4E-05 31.4 -0.1 13 414-426 3-15 (34)
20 PF13912 zf-C2H2_6: C2H2-type 66.6 2 4.3E-05 27.9 0.1 21 415-436 1-21 (27)
21 KOG2636 Splicing factor 3a, su 59.3 4.7 0.0001 43.9 1.4 80 177-277 215-294 (497)
22 cd00729 rubredoxin_SM Rubredox 58.7 3.1 6.6E-05 29.4 -0.1 17 415-432 2-18 (34)
23 PF13909 zf-H2C2_5: C2H2-type 57.2 4.5 9.7E-05 25.6 0.5 20 416-437 1-20 (24)
24 PF12756 zf-C2H2_2: C2H2 type 48.1 7.7 0.00017 32.1 0.7 28 415-443 50-77 (100)
25 PF13913 zf-C2HC_2: zinc-finge 44.8 10 0.00022 24.9 0.7 20 416-437 3-22 (25)
26 cd00350 rubredoxin_like Rubred 41.3 9.3 0.0002 26.6 0.1 14 416-430 2-15 (33)
27 PF13465 zf-H2C2_2: Zinc-finge 40.3 14 0.00029 24.3 0.8 16 408-423 7-22 (26)
28 KOG2608 Endoplasmic reticulum 39.5 3.8 8.2E-05 44.6 -3.1 48 429-476 316-371 (469)
29 PHA02768 hypothetical protein; 38.9 15 0.00033 29.0 1.0 34 415-451 5-38 (55)
30 PF15056 NRN1: Neuritin protei 37.4 29 0.00062 30.0 2.5 20 463-482 55-74 (89)
31 PRK12496 hypothetical protein; 36.6 8 0.00017 36.8 -1.1 27 404-430 125-158 (164)
32 PF14379 Myb_CC_LHEQLE: MYB-CC 34.0 1.1E+02 0.0023 23.9 4.9 13 6-18 12-24 (51)
33 COG4105 ComL DNA uptake lipopr 32.3 61 0.0013 33.2 4.3 41 137-177 86-127 (254)
34 PF10146 zf-C4H2: Zinc finger- 29.8 1.1E+02 0.0025 30.7 5.7 24 5-28 51-74 (230)
35 TIGR00320 dfx_rbo desulfoferro 28.9 18 0.00039 33.0 -0.1 13 413-425 5-17 (125)
36 COG4847 Uncharacterized protei 28.2 22 0.00048 31.1 0.3 17 414-430 5-21 (103)
37 COG5112 UFD2 U1-like Zn-finger 27.4 74 0.0016 28.5 3.4 31 249-286 56-86 (126)
38 KOG3408 U1-like Zn-finger-cont 27.3 27 0.00058 31.9 0.7 27 249-281 58-84 (129)
39 PF04194 PDCD2_C: Programmed c 26.9 12 0.00026 35.4 -1.6 31 396-433 78-110 (164)
40 cd00730 rubredoxin Rubredoxin; 26.7 22 0.00047 27.4 0.0 14 415-429 1-14 (50)
41 PF13319 DUF4090: Protein of u 26.2 29 0.00064 29.2 0.7 16 394-409 12-27 (84)
42 PF07864 DUF1651: Protein of u 25.8 49 0.0011 27.2 2.0 28 449-476 39-66 (75)
43 PF06107 DUF951: Bacterial pro 24.9 31 0.00067 27.5 0.6 29 409-437 25-53 (57)
44 KOG0324 Uncharacterized conser 24.6 27 0.00058 34.8 0.2 21 398-418 125-145 (214)
45 PHA00732 hypothetical protein 24.4 37 0.00079 28.5 1.0 21 415-436 1-21 (79)
46 PF09026 CENP-B_dimeris: Centr 24.3 25 0.00055 30.8 0.0 9 398-406 39-47 (101)
47 PF04502 DUF572: Family of unk 24.1 26 0.00056 36.9 0.0 34 416-449 41-82 (324)
48 PF13824 zf-Mss51: Zinc-finger 23.8 42 0.00092 26.5 1.2 28 411-438 10-37 (55)
49 PF07754 DUF1610: Domain of un 23.5 32 0.0007 22.7 0.4 13 411-423 12-24 (24)
50 PF00301 Rubredoxin: Rubredoxi 23.5 23 0.00049 27.0 -0.4 30 416-459 2-31 (47)
51 PF06160 EzrA: Septation ring 23.4 5.5E+02 0.012 29.1 10.4 89 5-104 342-433 (560)
52 TIGR00270 conserved hypothetic 22.4 30 0.00064 32.7 0.1 11 418-429 3-13 (154)
53 PF06147 DUF968: Protein of un 22.4 53 0.0011 32.3 1.8 19 401-423 117-135 (200)
54 PRK08359 transcription factor; 22.0 31 0.00068 33.3 0.2 12 417-429 8-19 (176)
55 PF02132 RecR: RecR protein; 21.8 26 0.00056 25.6 -0.4 9 417-425 19-27 (41)
56 PHA00733 hypothetical protein 20.5 52 0.0011 30.0 1.3 23 413-436 71-93 (128)
57 COG1439 Predicted nucleic acid 20.5 21 0.00046 34.6 -1.4 34 392-425 125-163 (177)
58 PRK07708 hypothetical protein; 20.3 1.1E+02 0.0023 30.6 3.5 51 452-508 16-66 (219)
No 1
>KOG2636 consensus Splicing factor 3a, subunit 3 [RNA processing and modification]
Probab=100.00 E-value=2.6e-157 Score=1206.26 Aligned_cols=489 Identities=54% Similarity=0.890 Sum_probs=450.2
Q ss_pred cchHHHHHHHHHHHHHHHHHHHHHhhcCCCChhHHHHHHHHHHHHHHHHHHHHHHHHHHHcccchhhHHHHHHccCCCCC
Q 010423 3 STLLEVTRAAHEEVERLERLVVKDLQTEPNSNKDRLVQSHRVRNMIDTITDTTERLIEIYADKDNARKDEIAALGGQTAT 82 (511)
Q Consensus 3 ~~~LE~~R~~hEeiErlE~ai~~~~~~~p~~~k~~l~q~h~i~~~ld~~~~~~~~L~~~y~d~dg~r~~Ei~~l~g~~~~ 82 (511)
+++||+||++|||+|||+++||++++++|.+.|++|.+.|+|+.|++++.+.+.+|+++|+|+||+|+.||.+|+|
T Consensus 1 etlLEt~R~lhEE~ERl~~~ive~~~~~p~~~k~ri~~~hrv~~~~~~~~~ss~~l~~~yedkdg~r~~e~~~l~g---- 76 (497)
T KOG2636|consen 1 ETLLETQRRLHEEMERLENAIVEREQANPPGKKDRINSEHRVRSFLERYRSSSIKLRKLYEDKDGLRKREIAALSG---- 76 (497)
T ss_pred CcHHHHHHHHHHHHHHHHHHHHHHHHhCCCchHHHHhHHhhHHHHHHHHHHHHHHHHHHHhhccchhHHHHHHhcC----
Confidence 4699999999999999999999999999999999999999999999999999999999999999999999999998
Q ss_pred CCchHHHHHHHHHHHHHhhhhCCCCccccCchhhHHh----hhcc----CCCCcccccccCccccchHHHHHHHhcCCCC
Q 010423 83 GTNVFSSFYDRLKEIREYHRRHPSARVAVDASEDYEN----LLKE----EPLVEFSGEEAYGRYLDLHELYNQYINSKFG 154 (511)
Q Consensus 83 ~~~~f~~Fy~~l~~Ike~h~~~p~~~~~~~~~~~~~~----~~~~----~~~~~Fs~eE~yGryLDL~~~y~~ylNl~~~ 154 (511)
+|+|.+||++|++|++||+++|++ ++++....+.. ..++ .+.+.|||+|+||||||||.+|.+||||+.+
T Consensus 77 -~n~f~EfY~rLk~I~~~hk~~p~e-~~~p~~v~~~~~~e~~~~~~~~~~~l~~Fs~ee~yGrfldL~d~y~kyinl~~~ 154 (497)
T KOG2636|consen 77 -PNDFAEFYKRLKEINEFHKKHPDE-KDEPKSVRFLELYEARLSPEDENEVLVEFSGEEGYGRFLDLHDCYRKYINLKNV 154 (497)
T ss_pred -chhHHHHHHHHHHHhHHHhcCccc-cccchhHHHHHHHHhhcCccccchhhHhhcccccccccccHHHHHHHHhhhhhh
Confidence 799999999999999999999986 33555554433 3333 2557899999999999999999999999999
Q ss_pred CccchhHHhhhhcCCCCCccccccchhHHHHHHHHHHHHHHHHHhccCCCchHHHHHHHHHHHHHHHhhCCCCCCcccCc
Q 010423 155 KEIEYSAYLDVFSRPHEIPRKLKMTRQYREYIEKLLEYLIYFFQRTEPLQDLDRIFSKVVADFEEQWVTSTLQGWETEGQ 234 (511)
Q Consensus 155 ~~i~Yl~YL~~f~~f~~ip~~~k~~~~Y~~Yl~~L~~YL~~F~~R~~PL~d~~~~~~~~~~~Fe~~w~~g~~~gW~~~~~ 234 (511)
.+++|++||.+|++|.+||+ .+++..|..||+.|.+||.+|++|++||.|++++++++..+|+.+|.+|.+|||....+
T Consensus 155 ~r~~Y~~yL~~fd~~~~ip~-~~k~~~Y~~Yi~~L~eYL~~F~~r~~Pl~d~~~ll~~~~~~f~~~~~aG~lpg~~~~et 233 (497)
T KOG2636|consen 155 ERVDYLEYLKNFDQLDDIPK-EKKNREYLNYIEELNEYLVSFIDRTEPLLDLDKLLAKVPKEFERAWAAGTLPGWKYKET 233 (497)
T ss_pred hhhhHHHHHHHHhhhcccch-hhhhHHHHHHHHHHHHHHHHHHHhcccchhhHhHhcchhhHHHHHHHhCCCCCcccccc
Confidence 99999999999999999999 67799999999999999999999999999999999999999999999999999994322
Q ss_pred CCCCCCCccCccCccccchHHHHHhhhhhhhhHHHHhccccCCCchHHHHHHhhhhcCCCccchhhhhhhccCCCCCCCC
Q 010423 235 ENGHVPAQHSELDLDYYSTVEELMEVGSERLKEELAAKGLKSGGTLQQRAERLFLTKHTPLDKLDKKHFAKGARGKEQNG 314 (511)
Q Consensus 235 ~~~~~~~~~~~~d~~~~~s~eklf~~g~~~lke~l~~~gLk~gg~lk~ra~rlf~~k~~~~e~~~~~~~ak~~~~~~~~~ 314 (511)
..+ ...+++..+++++|+++||+||++++.+.|++||||+++||+|+|++++.+.+.+++++++++.+.+
T Consensus 234 ~~~------~~~dl~~~~s~Eel~~~g~erlk~al~alglk~gGt~~~ra~rlf~Tk~~~l~~L~~~~~~kn~s~~---- 303 (497)
T KOG2636|consen 234 FSA------KALDLSGASSVEELYCLGCERLKSALTALGLKCGGTLHERAQRLFSTKSKSLSHLDTKLFAKNPSKK---- 303 (497)
T ss_pred ccc------cccccchhhHHHHHHhhchhHHHHHHHHHHHhcCCeecHHHHhhhhhcCcchhhhhhhhhccCcccc----
Confidence 111 2368899999999999999999999999999999999999999999999999999999999877654
Q ss_pred CCCcccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCHHHHHHHHHHhhhhhcCCCCchhhhhccCCCCCC
Q 010423 315 VAPATQEVGNLKDIALMEAKMKKLCDLLSETIERTIQNVQKKQALTYEEMEAEREEQEETQVDTESDDEEQQIYNPLKLP 394 (511)
Q Consensus 315 ~~~~~~~~~~~k~ia~~E~~i~~l~~~L~~~~~~T~~~veRk~a~T~~Ere~E~e~~~~~~~~~e~~d~e~~~yNplnLP 394 (511)
+......+.++||+.|++|.+++.+|+++|.+|++||.|||++|+.|++.|.+++. +..+++++|+++.||||+|||
T Consensus 304 --~~~~~~~~~keia~tEa~v~k~~~iL~eeR~~t~env~rKq~~ta~e~E~E~~eq~-~~~~e~~~de~~~~ynp~~lP 380 (497)
T KOG2636|consen 304 --GHRREKERNKEIARTEALVKKLLAILAEERKATRENVVRKQARTAEEREEEEEEQS-DSDEESDDDEEELIYNPKNLP 380 (497)
T ss_pred --hhhhhhhccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhhhhhhhhh-ccccccccchhhccCCcccCC
Confidence 12334566899999999999999999999999999999999999999977765443 334444556677899999999
Q ss_pred CCCCCCchhHHHHHHhcCCCcccceeecCCcccchhhhhhhcchhhhhhcccccCCCCCcCccccccHHHHHHHHHHHHH
Q 010423 395 MGWDGKPIPYWLYKLHGLGQEFKCEICGNYSYWGRRAFERHFKEWRHQHGMRCLGIPNTKNFNEITSIEEAKELWKKIQE 474 (511)
Q Consensus 395 LGwDGkPIPyWLYKLhGL~~ey~CEICGN~~Y~GRkaFekHF~E~RH~~GmrcLGIpnt~~F~~IT~I~dA~~Lw~klk~ 474 (511)
|||||||||||||||||||++|+||||||+||||||||+|||+||||+|||||||||||+||++||+|+||+.||+|||.
T Consensus 381 LGwDGkPiPyWLyKLHGL~~ey~CEICGNy~Y~GrkaF~RHF~EwRH~hGmrCLGIpnt~~F~~IT~I~eA~~LW~k~k~ 460 (497)
T KOG2636|consen 381 LGWDGKPIPYWLYKLHGLDIEYNCEICGNYVYKGRKAFDRHFNEWRHAHGMRCLGIPNTSVFKGITKIEEALELWKKMKE 460 (497)
T ss_pred CCCCCCcCchHHHhhcCCCcccceeeccCccccCcHHHHHHhHHHHHhhcceecCCCCcHHhcccccHHHHHHHHHHHHH
Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred hhcCCCCCCCCCceeeccCCCccchhhhHHHhhccCC
Q 010423 475 RQGGIKWRPELEEEYEDKEGNIYNKKTYTDLQRQGLI 511 (511)
Q Consensus 475 ~~~~~~~~~~~~eE~ED~~GNVmskK~YeDLkrQGLl 511 (511)
++....|.++.++||||++|||||+|||+||||||||
T Consensus 461 q~~~~kw~~~~eeE~ED~eGNV~~kKtYeDLKrQGLl 497 (497)
T KOG2636|consen 461 QSQSEKWPPDLEEEYEDEEGNVMNKKTYEDLKRQGLL 497 (497)
T ss_pred hhhhccCCchhHhhhhccccCcccHHhHHHHHHccCC
Confidence 9999999999999999999999999999999999997
No 2
>COG5188 PRP9 Splicing factor 3a, subunit 3 [RNA processing and modification]
Probab=100.00 E-value=1.3e-112 Score=852.93 Aligned_cols=448 Identities=27% Similarity=0.437 Sum_probs=395.8
Q ss_pred chHHHHHHHHHHHHHHHHHHHHHhhcCCCChhHHHHHHHHHHHHHHHHHHHHHHHHHHHcccchhhHHHHHHccCCCCCC
Q 010423 4 TLLEVTRAAHEEVERLERLVVKDLQTEPNSNKDRLVQSHRVRNMIDTITDTTERLIEIYADKDNARKDEIAALGGQTATG 83 (511)
Q Consensus 4 ~~LE~~R~~hEeiErlE~ai~~~~~~~p~~~k~~l~q~h~i~~~ld~~~~~~~~L~~~y~d~dg~r~~Ei~~l~g~~~~~ 83 (511)
++||+.|++|||+|+||+|||+|+++||+-.|+++...|.|+.|.......++.++--.+-.+|++.+++..|... .
T Consensus 2 nlLET~R~~~EEmE~ienAIaeR~~~NPK~Pr~~lrle~qi~~f~n~~R~~~q~~lv~hE~~~~lkDq~~~rinr~---~ 78 (470)
T COG5188 2 NLLETRRSLLEEMEIIENAIAERIQRNPKLPRDELRLERQIRIFENMERISNQIWLVEHERPTGLKDQMMKRINRS---I 78 (470)
T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHhHHHHHHHHHHHhhhhhhhcccccchhHHHHHHHHHHH---h
Confidence 4999999999999999999999999999999999999999999999999999999999999999999999998621 1
Q ss_pred CchHHHHHHHHHHHHHhhhhCCCCccccCchhhHHhhhc----cCCCC--cccccccCccccchHHHHHHHhcCCCCCcc
Q 010423 84 TNVFSSFYDRLKEIREYHRRHPSARVAVDASEDYENLLK----EEPLV--EFSGEEAYGRYLDLHELYNQYINSKFGKEI 157 (511)
Q Consensus 84 ~~~f~~Fy~~l~~Ike~h~~~p~~~~~~~~~~~~~~~~~----~~~~~--~Fs~eE~yGryLDL~~~y~~ylNl~~~~~i 157 (511)
++...+||..|.+|..+|+.+|+..| ......+..... .+..+ .|+|+|+||+|+||++||..|+|+....+|
T Consensus 79 d~dl~~fykkLg~l~~e~K~~~e~~v-k~l~~l~~~~ss~p~~~dlD~~~~F~g~e~YG~~meLe~~~~~y~nv~~~~~~ 157 (470)
T COG5188 79 DRDLYGFYKKLGALNVEGKLDGEIEV-KGLRDLGYYESSAPKARDLDVEAAFKGSELYGDGMELERIFRKYANVHLCSDC 157 (470)
T ss_pred hhhhhHHHHHHHHHHHHhccCccccc-cchhhhhccccCCCCcccccHHHHhcchHhhcchhhHHHHHHHHhhHHhhccc
Confidence 45699999999999999999997555 343332211111 11233 699999999999999999999999999999
Q ss_pred chhHHhhhhcCCCCCccccccchhHHHHHHHHHHHHHHHHHhccCCCchHHHHHHHHHHHHHHHhhCCCCCCcccCcCCC
Q 010423 158 EYSAYLDVFSRPHEIPRKLKMTRQYREYIEKLLEYLIYFFQRTEPLQDLDRIFSKVVADFEEQWVTSTLQGWETEGQENG 237 (511)
Q Consensus 158 ~Yl~YL~~f~~f~~ip~~~k~~~~Y~~Yl~~L~~YL~~F~~R~~PL~d~~~~~~~~~~~Fe~~w~~g~~~gW~~~~~~~~ 237 (511)
+|++||..+..|.-+|+.. +|..|..||..|.+||.+||.+++||.+.+++.+.+.++|+.+|+.| ++||.....
T Consensus 158 sylefLk~le~fd~~~~p~-Kn~rY~~yl~~L~eYl~~F~~~~ypL~~~~kv~a~~~~~f~~a~~rG-~~~~~~~~g--- 232 (470)
T COG5188 158 SYLEFLKKLERFDLTTEPS-KNFRYLEYLSELNEYLGRFIKVKYPLKMFRKVVASAPKIFSRAEARG-FGKKNGMEG--- 232 (470)
T ss_pred hHHHHHHHHHHhhccCCcc-cchhHHHHHHHHHHHHHHHHHhcchHHHHHHHHHhchhHhHHHHHcc-CCcccccch---
Confidence 9999999999998675444 47899999999999999999999999999999999999999999998 888873211
Q ss_pred CCCCccCccCccccchHHHHHhhhhhhhhHHHHhccccCCCchHHHHHHhhhhcCCCccchhhhhhhccCCCCCCCCCCC
Q 010423 238 HVPAQHSELDLDYYSTVEELMEVGSERLKEELAAKGLKSGGTLQQRAERLFLTKHTPLDKLDKKHFAKGARGKEQNGVAP 317 (511)
Q Consensus 238 ~~~~~~~~~d~~~~~s~eklf~~g~~~lke~l~~~gLk~gg~lk~ra~rlf~~k~~~~e~~~~~~~ak~~~~~~~~~~~~ 317 (511)
. .....+||..|.++|+ +.+|+..||+++.|.++-.
T Consensus 233 -~----~~~~~~YC~~C~r~f~------~~~VFe~Hl~gK~H~k~~~--------------------------------- 268 (470)
T COG5188 233 -A----EWFPKVYCVKCGREFS------RSKVFEYHLEGKRHCKEGQ--------------------------------- 268 (470)
T ss_pred -h----hhccceeeHhhhhHhh------hhHHHHHHHhhhhhhhhhh---------------------------------
Confidence 1 1223489999999999 9999999999999888431
Q ss_pred cccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCHHHHHHHHHHhhh---------------hhcCCCCch
Q 010423 318 ATQEVGNLKDIALMEAKMKKLCDLLSETIERTIQNVQKKQALTYEEMEAEREEQEE---------------TQVDTESDD 382 (511)
Q Consensus 318 ~~~~~~~~k~ia~~E~~i~~l~~~L~~~~~~T~~~veRk~a~T~~Ere~E~e~~~~---------------~~~~~e~~d 382 (511)
+...++..||.|++|+.+|.+++.+|+++|+|++|+|+.||.+|++.+.. +++| .+.+
T Consensus 269 ------~~~~~v~~Ey~l~r~~kyl~d~~s~trs~V~r~la~ta~ER~aei~~l~r~~~~~at~S~e~EGaeq~d-~eQ~ 341 (470)
T COG5188 269 ------GKEEFVYSEYVLHRYLKYLGDPVSETRSLVLRSLAITAKERKAEISLLSRRKKQPATKSSEKEGAEQVD-GEQR 341 (470)
T ss_pred ------hhhHHHHHHHHHHHHHHHhCChhHHHHHHHHHHHHHHHHHHHHHhHHHHHHhhccCCCchhhccccccc-cccc
Confidence 13459999999999999999999999999999999999999999975432 1122 2345
Q ss_pred hhhhccCCCCCCCCCCCCchhHHHHHHhcCCCcccceeecCCcccchhhhhhhcchhhhhhcccccCCCCCcCccccccH
Q 010423 383 EEQQIYNPLKLPMGWDGKPIPYWLYKLHGLGQEFKCEICGNYSYWGRRAFERHFKEWRHQHGMRCLGIPNTKNFNEITSI 462 (511)
Q Consensus 383 ~e~~~yNplnLPLGwDGkPIPyWLYKLhGL~~ey~CEICGN~~Y~GRkaFekHF~E~RH~~GmrcLGIpnt~~F~~IT~I 462 (511)
|++.+|||++|||||||+|||||||||||||++|+||||||+||+||++|+|||+|-||+|||+||||.+++.|++||+|
T Consensus 342 DE~~~~k~fdmPLG~DG~PmP~WL~klhgLd~ef~CEICgNyvy~GR~~FdrHF~E~rHiygl~clGi~ps~vfkgIT~I 421 (470)
T COG5188 342 DEHVSGKSFDMPLGPDGLPMPRWLCKLHGLDIEFECEICGNYVYYGRDRFDRHFEEDRHIYGLECLGIKPSRVFKGITRI 421 (470)
T ss_pred chhhccCcccCCCCCCCCCCchHHHHhcCCCcceeeeecccccccchHHHHhhhhhhhhhhheeeccccchHHHhhhhhH
Confidence 67889999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred HHHHHHHHHHHHhhcCCCCCCCCCceeeccCCCccchhhhHHHhhccCC
Q 010423 463 EEAKELWKKIQERQGGIKWRPELEEEYEDKEGNIYNKKTYTDLQRQGLI 511 (511)
Q Consensus 463 ~dA~~Lw~klk~~~~~~~~~~~~~eE~ED~~GNVmskK~YeDLkrQGLl 511 (511)
.+|++||++++.++++-....+..+|+||.+|||||+|||+||||||||
T Consensus 422 ~ea~~lw~~m~~~ss~~kv~~e~~~E~EDeEGNVmskkvY~dLK~qgLi 470 (470)
T COG5188 422 GEAMKLWNRMEESSSSLKVPTEYSEEFEDEEGNVMSKKVYEDLKRQGLI 470 (470)
T ss_pred HHHHHHHHHhhhhhhhcccchhhhhhhhccccccchHHHHHHHHHccCC
Confidence 9999999999999877666677899999999999999999999999997
No 3
>PF11931 DUF3449: Domain of unknown function (DUF3449); InterPro: IPR024598 This presumed domain is functionally uncharacterised. It has two conserved sequence motifs: PIP and CEICG and contains a zinc-finger of the C2H2-type.; PDB: 4DGW_A.
Probab=100.00 E-value=1.6e-92 Score=673.55 Aligned_cols=182 Identities=62% Similarity=1.098 Sum_probs=36.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCHHHHHHHHHHhhh--------------hhcCCCCchhhhhccCCCCCC
Q 010423 329 ALMEAKMKKLCDLLSETIERTIQNVQKKQALTYEEMEAEREEQEE--------------TQVDTESDDEEQQIYNPLKLP 394 (511)
Q Consensus 329 a~~E~~i~~l~~~L~~~~~~T~~~veRk~a~T~~Ere~E~e~~~~--------------~~~~~e~~d~e~~~yNplnLP 394 (511)
|+.|++|++|+++|++++++|++|||||||+|++||++|...... ...+++++|+++++|||+|||
T Consensus 1 ~~~E~~i~~~~~~L~~~~~~T~~~verk~a~T~~E~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~np~~lP 80 (196)
T PF11931_consen 1 ARREYKIHKLCELLSEEREDTKENVERKQARTEEERQAEEEYEEEIYSEDEYEEEEEEEESEEDSDDDEEEKIYNPLNLP 80 (196)
T ss_dssp -HHHHHHHHHHHHTHHHHHHHHHHHHHHHT--HHHHHHHHHHTS-SS-TT--SS--B-----------------------
T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHhccHHHHHHHHhhhhhhhccccccccccccccccccccccccccCCcccCC
Confidence 678999999999999999999999999999999999997532110 112223456677899999999
Q ss_pred CCCCCCchhHHHHHHhcCCCcccceeecCCcccchhhhhhhcchhhhhhcccccCCCCCcCccccccHHHHHHHHHHHHH
Q 010423 395 MGWDGKPIPYWLYKLHGLGQEFKCEICGNYSYWGRRAFERHFKEWRHQHGMRCLGIPNTKNFNEITSIEEAKELWKKIQE 474 (511)
Q Consensus 395 LGwDGkPIPyWLYKLhGL~~ey~CEICGN~~Y~GRkaFekHF~E~RH~~GmrcLGIpnt~~F~~IT~I~dA~~Lw~klk~ 474 (511)
|||||||||||||||||||++|+||||||+||||||||+|||+||||+|||||||||||+||++||+|+||++||++|++
T Consensus 81 LG~DGkPIPyWLYKLhGL~~ey~CEICGN~~Y~GrkaFekHF~E~rH~~GlrcLGI~nt~~F~~IT~I~dA~~Lw~kl~~ 160 (196)
T PF11931_consen 81 LGWDGKPIPYWLYKLHGLGVEYKCEICGNQSYKGRKAFEKHFQEWRHAYGLRCLGIPNTKHFKGITKIEDALELWEKLKK 160 (196)
T ss_dssp --------------------------------------------------------------------------------
T ss_pred CCCCCCcccHHHHHHhCCCCeeeeEeCCCcceecHHHHHHhcChhHHHccChhcCCCCcHHHcCcCcHHHHHHHHHHHHH
Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred hhcCCCCCCCCCceeeccCCCccchhhhHHHhhccC
Q 010423 475 RQGGIKWRPELEEEYEDKEGNIYNKKTYTDLQRQGL 510 (511)
Q Consensus 475 ~~~~~~~~~~~~eE~ED~~GNVmskK~YeDLkrQGL 510 (511)
+++...|.++++|||||++|||||+|||+|||||||
T Consensus 161 ~~~~~~~~~~~~eE~ED~eGNVm~~k~Y~dLkkQGL 196 (196)
T PF11931_consen 161 QKKRKRFEPDNEEEVEDSEGNVMSKKTYEDLKKQGL 196 (196)
T ss_dssp ------------------------------------
T ss_pred HhhhccCCCccceEeecCCCCCcCHHHHHHHHHccC
Confidence 999999999999999999999999999999999998
No 4
>PF13297 Telomere_Sde2_2: Telomere stability C-terminal
Probab=99.67 E-value=4.9e-17 Score=127.13 Aligned_cols=60 Identities=60% Similarity=0.873 Sum_probs=57.2
Q ss_pred cCccccchHHHHHhhhhhhhhHHHHhccccCCCchHHHHHHhhhhcCCCccchhhhhhhc
Q 010423 246 LDLDYYSTVEELMEVGSERLKEELAAKGLKSGGTLQQRAERLFLTKHTPLDKLDKKHFAK 305 (511)
Q Consensus 246 ~d~~~~~s~eklf~~g~~~lke~l~~~gLk~gg~lk~ra~rlf~~k~~~~e~~~~~~~ak 305 (511)
+|+..|+|+++++++|+||||++|+++|||||||+++||+|||++||++++++|+++|||
T Consensus 1 ldL~~f~sa~eLe~lGldrLK~~L~a~GLKcGGTl~ERA~RLfs~kg~~~~~~d~~l~AK 60 (60)
T PF13297_consen 1 LDLDAFSSAEELEALGLDRLKSALMALGLKCGGTLQERAARLFSVKGLPLEEIDKKLFAK 60 (60)
T ss_pred CcchhcCCHHHHHHhCHHHHHHHHHHcCCccCCCHHHHHHHHHHhcCCChhhCCHHHhcC
Confidence 366789999999999999999999999999999999999999999999999999999885
No 5
>KOG2827 consensus Uncharacterized conserved protein [Function unknown]
Probab=98.85 E-value=2.1e-09 Score=107.37 Aligned_cols=61 Identities=54% Similarity=0.837 Sum_probs=58.2
Q ss_pred ccCccccchHHHHHhhhhhhhhHHHHhccccCCCchHHHHHHhhhhcCCCccchhhhhhhc
Q 010423 245 ELDLDYYSTVEELMEVGSERLKEELAAKGLKSGGTLQQRAERLFLTKHTPLDKLDKKHFAK 305 (511)
Q Consensus 245 ~~d~~~~~s~eklf~~g~~~lke~l~~~gLk~gg~lk~ra~rlf~~k~~~~e~~~~~~~ak 305 (511)
+++++.|.|...++-|||+|||.+|..+|||||||+.+||+|||++|++|++++|++++++
T Consensus 261 p~~~ddf~s~~d~e~lg~e~lk~~l~~rglkcgg~l~eraarl~~~k~~~~~~~pk~~l~~ 321 (322)
T KOG2827|consen 261 PLNFDDFNSPADMEVLGMERLKTELQSRGLKCGGTLRERAARLFLLKSTPLDKLPKKLLAK 321 (322)
T ss_pred CccccccCCHHHHHHhhHHHHHHHHHhcCCcccccHHHHHhhhhhhcCCChhhhhHhhccC
Confidence 6778889999999999999999999999999999999999999999999999999998875
No 6
>PF12108 SF3a60_bindingd: Splicing factor SF3a60 binding domain; InterPro: IPR021966 This domain is found in eukaryotes. This domain is about 30 amino acids in length. This domain has a single completely conserved residue Y that may be functionally important. SF3a60 makes up the SF3a complex with SF3a66 and SF3a120. This domain is the binding site of SF3a60 for SF3a120. The SF3a complex is part of the spliceosome, a protein complex involved in splicing mRNA after transcription. ; PDB: 2DT7_A.
Probab=98.84 E-value=1.8e-09 Score=72.73 Aligned_cols=23 Identities=65% Similarity=1.180 Sum_probs=18.6
Q ss_pred CchHHHHHHHHHHHHHhhhhCCC
Q 010423 84 TNVFSSFYDRLKEIREYHRRHPS 106 (511)
Q Consensus 84 ~~~f~~Fy~~l~~Ike~h~~~p~ 106 (511)
+|+|++||+||++|||||+||||
T Consensus 6 ~d~f~eFY~rlk~Ike~Hrr~Pn 28 (28)
T PF12108_consen 6 GDPFSEFYERLKEIKEYHRRYPN 28 (28)
T ss_dssp --HHHHHHHHHHHHHHHHHS--S
T ss_pred CChHHHHHHHHHHHHHHHHhCCC
Confidence 79999999999999999999996
No 7
>PF12874 zf-met: Zinc-finger of C2H2 type; PDB: 1ZU1_A 2KVG_A.
Probab=94.54 E-value=0.012 Score=38.02 Aligned_cols=25 Identities=32% Similarity=0.833 Sum_probs=23.5
Q ss_pred ccceeecCCcccchhhhhhhcchhhh
Q 010423 416 FKCEICGNYSYWGRRAFERHFKEWRH 441 (511)
Q Consensus 416 y~CEICGN~~Y~GRkaFekHF~E~RH 441 (511)
|.|.|| |.++.++.+|+.|++.-+|
T Consensus 1 ~~C~~C-~~~f~s~~~~~~H~~s~~H 25 (25)
T PF12874_consen 1 FYCDIC-NKSFSSENSLRQHLRSKKH 25 (25)
T ss_dssp EEETTT-TEEESSHHHHHHHHTTHHH
T ss_pred CCCCCC-CCCcCCHHHHHHHHCcCCC
Confidence 789999 6999999999999999888
No 8
>PF12171 zf-C2H2_jaz: Zinc-finger double-stranded RNA-binding; InterPro: IPR022755 This zinc finger is found in archaea and eukaryotes, and is approximately 30 amino acids in length. The mammalian members of this group occur multiple times along the protein, joined by flexible linkers, and are referred to as JAZ - dsRNA-binding ZF protein - zinc-fingers. The JAZ proteins are expressed in all tissues tested and localise in the nucleus, particularly the nucleolus []. JAZ preferentially binds to double-stranded (ds) RNA or RNA/DNA hybrids rather than DNA. In addition to binding double-stranded RNA, these zinc-fingers are required for nucleolar localisation. This entry represents the multiple-adjacent-C2H2 zinc finger, JAZ. ; PDB: 4DGW_A 1ZR9_A.
Probab=90.47 E-value=0.12 Score=34.27 Aligned_cols=26 Identities=23% Similarity=0.654 Sum_probs=24.1
Q ss_pred ccceeecCCcccchhhhhhhcchhhhh
Q 010423 416 FKCEICGNYSYWGRRAFERHFKEWRHQ 442 (511)
Q Consensus 416 y~CEICGN~~Y~GRkaFekHF~E~RH~ 442 (511)
|.|++|+ ..+....+|+.|...-+|.
T Consensus 2 ~~C~~C~-k~f~~~~~~~~H~~sk~Hk 27 (27)
T PF12171_consen 2 FYCDACD-KYFSSENQLKQHMKSKKHK 27 (27)
T ss_dssp CBBTTTT-BBBSSHHHHHCCTTSHHHH
T ss_pred CCcccCC-CCcCCHHHHHHHHccCCCC
Confidence 8899999 9999999999999998883
No 9
>PF13894 zf-C2H2_4: C2H2-type zinc finger; PDB: 2ELX_A 2EPP_A 2DLK_A 1X6H_A 2EOU_A 2EMB_A 2GQJ_A 2CSH_A 2WBT_B 2ELM_A ....
Probab=88.39 E-value=0.18 Score=31.29 Aligned_cols=21 Identities=33% Similarity=0.988 Sum_probs=17.2
Q ss_pred ccceeecCCcccchhhhhhhcc
Q 010423 416 FKCEICGNYSYWGRRAFERHFK 437 (511)
Q Consensus 416 y~CEICGN~~Y~GRkaFekHF~ 437 (511)
|.|++|| .+|..+.++.+|..
T Consensus 1 ~~C~~C~-~~~~~~~~l~~H~~ 21 (24)
T PF13894_consen 1 FQCPICG-KSFRSKSELRQHMR 21 (24)
T ss_dssp EE-SSTS--EESSHHHHHHHHH
T ss_pred CCCcCCC-CcCCcHHHHHHHHH
Confidence 7899998 89999999999974
No 10
>PF00096 zf-C2H2: Zinc finger, C2H2 type; InterPro: IPR007087 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger: #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C], where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter []. This entry represents the classical C2H2 zinc finger domain. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0005622 intracellular; PDB: 2D9H_A 2EPC_A 1SP1_A 1VA3_A 2WBT_B 2ELR_A 2YTP_A 2YTT_A 1VA1_A 2ELO_A ....
Probab=87.76 E-value=0.17 Score=31.81 Aligned_cols=21 Identities=38% Similarity=1.016 Sum_probs=18.6
Q ss_pred ccceeecCCcccchhhhhhhcc
Q 010423 416 FKCEICGNYSYWGRRAFERHFK 437 (511)
Q Consensus 416 y~CEICGN~~Y~GRkaFekHF~ 437 (511)
|+|++|| .+|.-+..+.+|-.
T Consensus 1 y~C~~C~-~~f~~~~~l~~H~~ 21 (23)
T PF00096_consen 1 YKCPICG-KSFSSKSNLKRHMR 21 (23)
T ss_dssp EEETTTT-EEESSHHHHHHHHH
T ss_pred CCCCCCC-CccCCHHHHHHHHh
Confidence 7999999 88999999999854
No 11
>smart00451 ZnF_U1 U1-like zinc finger. Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ribonucleoprotein C and other RNA-binding proteins.
Probab=87.36 E-value=0.29 Score=33.78 Aligned_cols=31 Identities=23% Similarity=0.626 Sum_probs=26.5
Q ss_pred cccceeecCCcccchhhhhhhcchhhhhhccc
Q 010423 415 EFKCEICGNYSYWGRRAFERHFKEWRHQHGMR 446 (511)
Q Consensus 415 ey~CEICGN~~Y~GRkaFekHF~E~RH~~Gmr 446 (511)
.|.|++|+ .++.+..++..|.+.++|...++
T Consensus 3 ~~~C~~C~-~~~~~~~~~~~H~~gk~H~~~~~ 33 (35)
T smart00451 3 GFYCKLCN-VTFTDEISVEAHLKGKKHKKNVK 33 (35)
T ss_pred CeEccccC-CccCCHHHHHHHHChHHHHHHHH
Confidence 48899998 56779999999999999987654
No 12
>PF06397 Desulfoferrod_N: Desulfoferrodoxin, N-terminal domain; InterPro: IPR004462 This domain is found as essentially the full length of desulforedoxin, a 37-residue homodimeric non-haem iron protein. It is also found as the N-terminal domain of desulfoferrodoxin (rbo), a homodimeric non-haem iron protein with 2 Fe atoms per monomer in different oxidation states. This domain binds the ferric rather than the ferrous Fe of desulfoferrodoxin. Neelaredoxin, a monomeric blue non-haem iron protein, lacks this domain.; GO: 0005506 iron ion binding; PDB: 1DFX_A 1VZI_B 2JI2_D 1VZH_B 2JI3_C 2JI1_C 1VZG_A 1CFW_A 2LK5_B 1DHG_B ....
Probab=84.89 E-value=0.21 Score=35.99 Aligned_cols=13 Identities=54% Similarity=1.173 Sum_probs=7.9
Q ss_pred CCcccceeecCCc
Q 010423 413 GQEFKCEICGNYS 425 (511)
Q Consensus 413 ~~ey~CEICGN~~ 425 (511)
..-|+|++|||.+
T Consensus 4 ~~~YkC~~CGniV 16 (36)
T PF06397_consen 4 GEFYKCEHCGNIV 16 (36)
T ss_dssp TEEEE-TTT--EE
T ss_pred ccEEEccCCCCEE
Confidence 4569999999975
No 13
>COG4481 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=80.64 E-value=0.68 Score=36.31 Aligned_cols=32 Identities=34% Similarity=0.631 Sum_probs=26.7
Q ss_pred HHHHHhcCCCcccceeecCCcccchhhhhhhc
Q 010423 405 WLYKLHGLGQEFKCEICGNYSYWGRRAFERHF 436 (511)
Q Consensus 405 WLYKLhGL~~ey~CEICGN~~Y~GRkaFekHF 436 (511)
|=-=--|-++.-+|+=||-.+-.+|..|||-.
T Consensus 24 wkIiRvGaDIkikC~nC~h~vm~pR~~Ferkl 55 (60)
T COG4481 24 WKIIRVGADIKIKCENCGHSVMMPRYDFERKL 55 (60)
T ss_pred EEEEEecCcEEEEecCCCcEEEecHHHHHHHH
Confidence 33334588999999999999999999999854
No 14
>PF12171 zf-C2H2_jaz: Zinc-finger double-stranded RNA-binding; InterPro: IPR022755 This zinc finger is found in archaea and eukaryotes, and is approximately 30 amino acids in length. The mammalian members of this group occur multiple times along the protein, joined by flexible linkers, and are referred to as JAZ - dsRNA-binding ZF protein - zinc-fingers. The JAZ proteins are expressed in all tissues tested and localise in the nucleus, particularly the nucleolus []. JAZ preferentially binds to double-stranded (ds) RNA or RNA/DNA hybrids rather than DNA. In addition to binding double-stranded RNA, these zinc-fingers are required for nucleolar localisation. This entry represents the multiple-adjacent-C2H2 zinc finger, JAZ. ; PDB: 4DGW_A 1ZR9_A.
Probab=80.56 E-value=0.29 Score=32.30 Aligned_cols=25 Identities=16% Similarity=0.226 Sum_probs=23.2
Q ss_pred cccchHHHHHhhhhhhhhHHHHhccccCCCc
Q 010423 249 DYYSTVEELMEVGSERLKEELAAKGLKSGGT 279 (511)
Q Consensus 249 ~~~~s~eklf~~g~~~lke~l~~~gLk~gg~ 279 (511)
.+|..|++.|. ++..+..|+++++|
T Consensus 2 ~~C~~C~k~f~------~~~~~~~H~~sk~H 26 (27)
T PF12171_consen 2 FYCDACDKYFS------SENQLKQHMKSKKH 26 (27)
T ss_dssp CBBTTTTBBBS------SHHHHHCCTTSHHH
T ss_pred CCcccCCCCcC------CHHHHHHHHccCCC
Confidence 58999999999 99999999998766
No 15
>PLN02748 tRNA dimethylallyltransferase
Probab=80.55 E-value=0.72 Score=50.83 Aligned_cols=36 Identities=28% Similarity=0.563 Sum_probs=32.6
Q ss_pred CCcccceeecCCcccchhhhhhhcchhhhhhccccc
Q 010423 413 GQEFKCEICGNYSYWGRRAFERHFKEWRHQHGMRCL 448 (511)
Q Consensus 413 ~~ey~CEICGN~~Y~GRkaFekHF~E~RH~~GmrcL 448 (511)
-+.|.|||||+.+-.|....+-|++.-||-..++-+
T Consensus 416 ~~~~~Ce~C~~~~~~G~~eW~~Hlksr~Hk~~~~~~ 451 (468)
T PLN02748 416 WTQYVCEACGNKVLRGAHEWEQHKQGRGHRKRVQRL 451 (468)
T ss_pred cccccccCCCCcccCCHHHHHHHhcchHHHHHHhHH
Confidence 578999999999999999999999999999887743
No 16
>smart00355 ZnF_C2H2 zinc finger.
Probab=79.83 E-value=0.82 Score=28.39 Aligned_cols=21 Identities=24% Similarity=0.824 Sum_probs=19.2
Q ss_pred ccceeecCCcccchhhhhhhcc
Q 010423 416 FKCEICGNYSYWGRRAFERHFK 437 (511)
Q Consensus 416 y~CEICGN~~Y~GRkaFekHF~ 437 (511)
|.|..|| .++.++..+.+|..
T Consensus 1 ~~C~~C~-~~f~~~~~l~~H~~ 21 (26)
T smart00355 1 YRCPECG-KVFKSKSALKEHMR 21 (26)
T ss_pred CCCCCCc-chhCCHHHHHHHHH
Confidence 7899999 88899999999976
No 17
>PF09943 DUF2175: Uncharacterized protein conserved in archaea (DUF2175); InterPro: IPR018686 This family of various hypothetical archaeal proteins has no known function.
Probab=79.47 E-value=0.76 Score=40.44 Aligned_cols=17 Identities=41% Similarity=0.896 Sum_probs=15.0
Q ss_pred CcccceeecCCcccchh
Q 010423 414 QEFKCEICGNYSYWGRR 430 (511)
Q Consensus 414 ~ey~CEICGN~~Y~GRk 430 (511)
.+++|=||||.+|||-+
T Consensus 1 ~kWkC~iCg~~I~~gql 17 (101)
T PF09943_consen 1 KKWKCYICGKPIYEGQL 17 (101)
T ss_pred CceEEEecCCeeeecce
Confidence 36899999999999965
No 18
>TIGR00319 desulf_FeS4 desulfoferrodoxin FeS4 iron-binding domain. Neelaredoxin, a monomeric blue non-heme iron protein, lacks this domain.
Probab=75.27 E-value=0.88 Score=31.86 Aligned_cols=14 Identities=57% Similarity=1.222 Sum_probs=11.6
Q ss_pred CCcccceeecCCcc
Q 010423 413 GQEFKCEICGNYSY 426 (511)
Q Consensus 413 ~~ey~CEICGN~~Y 426 (511)
..-|+|++|||.+-
T Consensus 5 ~~~ykC~~Cgniv~ 18 (34)
T TIGR00319 5 GQVYKCEVCGNIVE 18 (34)
T ss_pred CcEEEcCCCCcEEE
Confidence 45799999999873
No 19
>cd00974 DSRD Desulforedoxin (DSRD) domain; a small non-heme iron domain present in the desulforedoxin (rubredoxin oxidoreductase) and desulfoferrodoxin proteins of some archeael and bacterial methanogens and sulfate/sulfur reducers. Desulforedoxin is a small, single-domain homodimeric protein; each subunit contains an iron atom bound to four cysteinyl sulfur atoms, Fe(S-Cys)4, in a distorted tetrahedral coordination. Its metal center is similar to that found in rubredoxin type proteins. Desulforedoxin is regarded as a potential redox partner for rubredoxin. Desulfoferrodoxin forms a homodimeric protein, with each protomer comprised of two domains, the N-terminal DSRD domain and C-terminal superoxide reductase-like (SORL) domain. Each domain has a distinct iron center: the DSRD iron center I, Fe(S-Cys)4; and the SORL iron center II, Fe[His4Cys(Glu)].
Probab=72.96 E-value=1.1 Score=31.42 Aligned_cols=13 Identities=54% Similarity=1.145 Sum_probs=10.9
Q ss_pred CcccceeecCCcc
Q 010423 414 QEFKCEICGNYSY 426 (511)
Q Consensus 414 ~ey~CEICGN~~Y 426 (511)
.-|+|++|||.+=
T Consensus 3 ~~ykC~~CGniv~ 15 (34)
T cd00974 3 EVYKCEICGNIVE 15 (34)
T ss_pred cEEEcCCCCcEEE
Confidence 4699999999874
No 20
>PF13912 zf-C2H2_6: C2H2-type zinc finger; PDB: 1JN7_A 1FU9_A 2L1O_A 1NJQ_A 2EN8_A 2EMM_A 1FV5_A 1Y0J_B 2L6Z_B.
Probab=66.62 E-value=2 Score=27.94 Aligned_cols=21 Identities=29% Similarity=0.705 Sum_probs=18.7
Q ss_pred cccceeecCCcccchhhhhhhc
Q 010423 415 EFKCEICGNYSYWGRRAFERHF 436 (511)
Q Consensus 415 ey~CEICGN~~Y~GRkaFekHF 436 (511)
.|.|.+|| .+|....+|.+|=
T Consensus 1 ~~~C~~C~-~~F~~~~~l~~H~ 21 (27)
T PF13912_consen 1 PFECDECG-KTFSSLSALREHK 21 (27)
T ss_dssp SEEETTTT-EEESSHHHHHHHH
T ss_pred CCCCCccC-CccCChhHHHHHh
Confidence 48999999 7899999999985
No 21
>KOG2636 consensus Splicing factor 3a, subunit 3 [RNA processing and modification]
Probab=59.30 E-value=4.7 Score=43.90 Aligned_cols=80 Identities=11% Similarity=0.038 Sum_probs=56.6
Q ss_pred ccchhHHHHHHHHHHHHHHHHHhccCCCchHHHHHHHHHHHHHHHhhCCCCCCcccCcCCCCCCCccCccCccccchHHH
Q 010423 177 KMTRQYREYIEKLLEYLIYFFQRTEPLQDLDRIFSKVVADFEEQWVTSTLQGWETEGQENGHVPAQHSELDLDYYSTVEE 256 (511)
Q Consensus 177 k~~~~Y~~Yl~~L~~YL~~F~~R~~PL~d~~~~~~~~~~~Fe~~w~~g~~~gW~~~~~~~~~~~~~~~~~d~~~~~s~ek 256 (511)
..+.+|...+=-...|-+.|...+.-|.+.+.+..-+...|+....++.+-|-.. .+.+|.-|++
T Consensus 215 ~f~~~~~aG~lpg~~~~et~~~~~~dl~~~~s~Eel~~~g~erlk~al~alglk~---------------gGt~~~ra~r 279 (497)
T KOG2636|consen 215 EFERAWAAGTLPGWKYKETFSAKALDLSGASSVEELYCLGCERLKSALTALGLKC---------------GGTLHERAQR 279 (497)
T ss_pred HHHHHHHhCCCCCccccccccccccccchhhHHHHHHhhchhHHHHHHHHHHHhc---------------CCeecHHHHh
Confidence 3457788887778888888999987777777777777777877777764443221 1278999999
Q ss_pred HHhhhhhhhhHHHHhccccCC
Q 010423 257 LMEVGSERLKEELAAKGLKSG 277 (511)
Q Consensus 257 lf~~g~~~lke~l~~~gLk~g 277 (511)
||+ ..+..-.||..+
T Consensus 280 lf~------Tk~~~l~~L~~~ 294 (497)
T KOG2636|consen 280 LFS------TKSKSLSHLDTK 294 (497)
T ss_pred hhh------hcCcchhhhhhh
Confidence 999 666665555543
No 22
>cd00729 rubredoxin_SM Rubredoxin, Small Modular nonheme iron binding domain containing a [Fe(SCys)4] center, present in rubrerythrin and nigerythrin and detected either N- or C-terminal to such proteins as flavin reductase, NAD(P)H-nitrite reductase, and ferredoxin-thioredoxin reductase. In rubredoxin, the iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), and believed to be involved in electron transfer. Rubrerythrins and nigerythrins are small homodimeric proteins, generally consisting of 2 domains: a rubredoxin domain C-terminal to a non-sulfur, oxo-bridged diiron site in the N-terminal rubrerythrin domain. Rubrerythrins and nigerythrins have putative peroxide activity.
Probab=58.69 E-value=3.1 Score=29.41 Aligned_cols=17 Identities=35% Similarity=0.739 Sum_probs=12.2
Q ss_pred cccceeecCCcccchhhh
Q 010423 415 EFKCEICGNYSYWGRRAF 432 (511)
Q Consensus 415 ey~CEICGN~~Y~GRkaF 432 (511)
.|.|.+|| ++|.|..+-
T Consensus 2 ~~~C~~CG-~i~~g~~~p 18 (34)
T cd00729 2 VWVCPVCG-YIHEGEEAP 18 (34)
T ss_pred eEECCCCC-CEeECCcCC
Confidence 47888888 777776543
No 23
>PF13909 zf-H2C2_5: C2H2-type zinc-finger domain; PDB: 1X5W_A.
Probab=57.22 E-value=4.5 Score=25.64 Aligned_cols=20 Identities=40% Similarity=0.918 Sum_probs=15.5
Q ss_pred ccceeecCCcccchhhhhhhcc
Q 010423 416 FKCEICGNYSYWGRRAFERHFK 437 (511)
Q Consensus 416 y~CEICGN~~Y~GRkaFekHF~ 437 (511)
|+|..|. ++-. +..+.+|..
T Consensus 1 y~C~~C~-y~t~-~~~l~~H~~ 20 (24)
T PF13909_consen 1 YKCPHCS-YSTS-KSNLKRHLK 20 (24)
T ss_dssp EE-SSSS--EES-HHHHHHHHH
T ss_pred CCCCCCC-CcCC-HHHHHHHHH
Confidence 7999999 7778 999999954
No 24
>PF12756 zf-C2H2_2: C2H2 type zinc-finger (2 copies); PDB: 2DMI_A.
Probab=48.07 E-value=7.7 Score=32.11 Aligned_cols=28 Identities=25% Similarity=0.741 Sum_probs=24.5
Q ss_pred cccceeecCCcccchhhhhhhcchhhhhh
Q 010423 415 EFKCEICGNYSYWGRRAFERHFKEWRHQH 443 (511)
Q Consensus 415 ey~CEICGN~~Y~GRkaFekHF~E~RH~~ 443 (511)
.|.|-+||-. +..+.++..|...-.|..
T Consensus 50 ~~~C~~C~~~-f~s~~~l~~Hm~~~~H~~ 77 (100)
T PF12756_consen 50 SFRCPYCNKT-FRSREALQEHMRSKHHKK 77 (100)
T ss_dssp SEEBSSSS-E-ESSHHHHHHHHHHTTTTC
T ss_pred CCCCCccCCC-CcCHHHHHHHHcCccCCC
Confidence 8999999955 999999999999887765
No 25
>PF13913 zf-C2HC_2: zinc-finger of a C2HC-type
Probab=44.76 E-value=10 Score=24.90 Aligned_cols=20 Identities=35% Similarity=0.801 Sum_probs=16.1
Q ss_pred ccceeecCCcccchhhhhhhcc
Q 010423 416 FKCEICGNYSYWGRRAFERHFK 437 (511)
Q Consensus 416 y~CEICGN~~Y~GRkaFekHF~ 437 (511)
.+|.+||.. | +..++++|..
T Consensus 3 ~~C~~CgR~-F-~~~~l~~H~~ 22 (25)
T PF13913_consen 3 VPCPICGRK-F-NPDRLEKHEK 22 (25)
T ss_pred CcCCCCCCE-E-CHHHHHHHHH
Confidence 479999954 4 8999999964
No 26
>cd00350 rubredoxin_like Rubredoxin_like; nonheme iron binding domain containing a [Fe(SCys)4] center. The family includes rubredoxins, a small electron transfer protein, and a slightly smaller modular rubredoxin domain present in rubrerythrin and nigerythrin and detected either N- or C-terminal to such proteins as flavin reductase, NAD(P)H-nitrite reductase, and ferredoxin-thioredoxin reductase. In rubredoxin, the iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), but iron can also be replaced by cobalt, nickel or zinc and believed to be involved in electron transfer. Rubrerythrins and nigerythrins are small homodimeric proteins, generally consisting of 2 domains: a rubredoxin domain C-terminal to a non-sulfur, oxo-bridged diiron site in the N-terminal rubrerythrin domain. Rubrerythrins and nigerythrins have putative peroxide activity.
Probab=41.26 E-value=9.3 Score=26.60 Aligned_cols=14 Identities=43% Similarity=1.227 Sum_probs=11.6
Q ss_pred ccceeecCCcccchh
Q 010423 416 FKCEICGNYSYWGRR 430 (511)
Q Consensus 416 y~CEICGN~~Y~GRk 430 (511)
|.|-+|| ++|.|.+
T Consensus 2 ~~C~~CG-y~y~~~~ 15 (33)
T cd00350 2 YVCPVCG-YIYDGEE 15 (33)
T ss_pred EECCCCC-CEECCCc
Confidence 7899999 7888775
No 27
>PF13465 zf-H2C2_2: Zinc-finger double domain; PDB: 2EN7_A 1TF6_A 1TF3_A 2ELT_A 2EOS_A 2EN2_A 2DMD_A 2WBS_A 2WBU_A 2EM5_A ....
Probab=40.27 E-value=14 Score=24.28 Aligned_cols=16 Identities=31% Similarity=0.825 Sum_probs=12.7
Q ss_pred HHhcCCCcccceeecC
Q 010423 408 KLHGLGQEFKCEICGN 423 (511)
Q Consensus 408 KLhGL~~ey~CEICGN 423 (511)
+.|-=.+.|+|.+||-
T Consensus 7 ~~H~~~k~~~C~~C~k 22 (26)
T PF13465_consen 7 RTHTGEKPYKCPYCGK 22 (26)
T ss_dssp HHHSSSSSEEESSSSE
T ss_pred hhcCCCCCCCCCCCcC
Confidence 3566678999999984
No 28
>KOG2608 consensus Endoplasmic reticulum membrane-associated oxidoreductin involved in disulfide bond formation [Posttranslational modification, protein turnover, chaperones; Intracellular trafficking, secretion, and vesicular transport]
Probab=39.48 E-value=3.8 Score=44.56 Aligned_cols=48 Identities=31% Similarity=0.466 Sum_probs=35.5
Q ss_pred hhhhhhhcchhhhhhcc---cccCCCCCcCccccccHHHHHH-----HHHHHHHhh
Q 010423 429 RRAFERHFKEWRHQHGM---RCLGIPNTKNFNEITSIEEAKE-----LWKKIQERQ 476 (511)
Q Consensus 429 RkaFekHF~E~RH~~Gm---rcLGIpnt~~F~~IT~I~dA~~-----Lw~klk~~~ 476 (511)
.++|.+||.|..-=.|= +.|.=.--+||++|+.|=|.+. ||-|||-+.
T Consensus 316 i~~~p~hFdE~~~f~gd~~a~~lKe~fr~hFrnISrIMDCVgCdKCRLWGKlQt~G 371 (469)
T KOG2608|consen 316 IKAFPKHFDEAELFAGDSEAPALKEEFRKHFRNISRIMDCVGCDKCRLWGKLQTQG 371 (469)
T ss_pred HhhCccccchHhhhcccccchhHHHHHHHHHHHHHHHHhhcCcchhhhhhhhhhhh
Confidence 46699999996555554 2222224589999999999985 999999874
No 29
>PHA02768 hypothetical protein; Provisional
Probab=38.93 E-value=15 Score=28.98 Aligned_cols=34 Identities=24% Similarity=0.642 Sum_probs=26.1
Q ss_pred cccceeecCCcccchhhhhhhcchhhhhhcccccCCC
Q 010423 415 EFKCEICGNYSYWGRRAFERHFKEWRHQHGMRCLGIP 451 (511)
Q Consensus 415 ey~CEICGN~~Y~GRkaFekHF~E~RH~~GmrcLGIp 451 (511)
-|.|++|| ..|-=+.++.+|=.- |.-+-+|.+-.
T Consensus 5 ~y~C~~CG-K~Fs~~~~L~~H~r~--H~k~~kc~~C~ 38 (55)
T PHA02768 5 GYECPICG-EIYIKRKSMITHLRK--HNTNLKLSNCK 38 (55)
T ss_pred ccCcchhC-CeeccHHHHHHHHHh--cCCcccCCccc
Confidence 48999999 567777888888654 77777886654
No 30
>PF15056 NRN1: Neuritin protein family
Probab=37.42 E-value=29 Score=29.95 Aligned_cols=20 Identities=20% Similarity=0.685 Sum_probs=17.8
Q ss_pred HHHHHHHHHHHHhhcCCCCC
Q 010423 463 EEAKELWKKIQERQGGIKWR 482 (511)
Q Consensus 463 ~dA~~Lw~klk~~~~~~~~~ 482 (511)
++|-++|++|+.++++-+|.
T Consensus 55 eeAa~iWEsLrqESrk~~f~ 74 (89)
T PF15056_consen 55 EEAAAIWESLRQESRKMQFQ 74 (89)
T ss_pred HHHHHHHHHHHHHHHcCCCC
Confidence 78999999999999987765
No 31
>PRK12496 hypothetical protein; Provisional
Probab=36.59 E-value=8 Score=36.78 Aligned_cols=27 Identities=26% Similarity=0.518 Sum_probs=22.5
Q ss_pred HHHHHHhcCCCccc-------ceeecCCcccchh
Q 010423 404 YWLYKLHGLGQEFK-------CEICGNYSYWGRR 430 (511)
Q Consensus 404 yWLYKLhGL~~ey~-------CEICGN~~Y~GRk 430 (511)
.|-|.=.|=+.+|+ |+|||+..-+-+.
T Consensus 125 ~w~~~C~gC~~~~~~~~~~~~C~~CG~~~~r~~~ 158 (164)
T PRK12496 125 KWRKVCKGCKKKYPEDYPDDVCEICGSPVKRKMV 158 (164)
T ss_pred eeeEECCCCCccccCCCCCCcCCCCCChhhhcch
Confidence 49999999999994 9999998755443
No 32
>PF14379 Myb_CC_LHEQLE: MYB-CC type transfactor, LHEQLE motif
Probab=34.02 E-value=1.1e+02 Score=23.94 Aligned_cols=13 Identities=46% Similarity=0.703 Sum_probs=11.7
Q ss_pred HHHHHHHHHHHHH
Q 010423 6 LEVTRAAHEEVER 18 (511)
Q Consensus 6 LE~~R~~hEeiEr 18 (511)
+|-||.+||.+|+
T Consensus 12 mEvQrrLhEQLEv 24 (51)
T PF14379_consen 12 MEVQRRLHEQLEV 24 (51)
T ss_pred HHHHHHHHHHHHH
Confidence 7999999999993
No 33
>COG4105 ComL DNA uptake lipoprotein [General function prediction only]
Probab=32.27 E-value=61 Score=33.19 Aligned_cols=41 Identities=17% Similarity=0.172 Sum_probs=24.6
Q ss_pred cccchHHHHHHHhcCCC-CCccchhHHhhhhcCCCCCccccc
Q 010423 137 RYLDLHELYNQYINSKF-GKEIEYSAYLDVFSRPHEIPRKLK 177 (511)
Q Consensus 137 ryLDL~~~y~~ylNl~~-~~~i~Yl~YL~~f~~f~~ip~~~k 177 (511)
.|-+=-..-++|+.+-. ...++|+.||..+..|..||...+
T Consensus 86 ~y~~A~~~~drFi~lyP~~~n~dY~~YlkgLs~~~~i~~~~r 127 (254)
T COG4105 86 EYDLALAYIDRFIRLYPTHPNADYAYYLKGLSYFFQIDDVTR 127 (254)
T ss_pred cHHHHHHHHHHHHHhCCCCCChhHHHHHHHHHHhccCCcccc
Confidence 33334444556666533 356788888777777776665544
No 34
>PF10146 zf-C4H2: Zinc finger-containing protein ; InterPro: IPR018482 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents a family of proteins which appears to have a highly conserved zinc finger domain at the C-terminal end, described as -C-X2-CH-X3-H-X5-C-X2-C-. The structure is predicted to contain a coiled coil. Members of this family are annotated as being tumour-associated antigen HCA127 in humans, but this could not be confirmed.
Probab=29.79 E-value=1.1e+02 Score=30.74 Aligned_cols=24 Identities=17% Similarity=0.336 Sum_probs=20.4
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHhh
Q 010423 5 LLEVTRAAHEEVERLERLVVKDLQ 28 (511)
Q Consensus 5 ~LE~~R~~hEeiErlE~ai~~~~~ 28 (511)
.+|..|.+|+||..||..|.+.-.
T Consensus 51 h~eeLrqI~~DIn~lE~iIkqa~~ 74 (230)
T PF10146_consen 51 HVEELRQINQDINTLENIIKQAES 74 (230)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHH
Confidence 578999999999999999987433
No 35
>TIGR00320 dfx_rbo desulfoferrodoxin. This protein is described in some articles as rubredoxin oxidoreductase (rbo), and its gene shares an operon with the rubredoxin gene in Desulfovibrio vulgaris Hildenborough.
Probab=28.90 E-value=18 Score=32.99 Aligned_cols=13 Identities=54% Similarity=1.104 Sum_probs=11.1
Q ss_pred CCcccceeecCCc
Q 010423 413 GQEFKCEICGNYS 425 (511)
Q Consensus 413 ~~ey~CEICGN~~ 425 (511)
..-|+|++|||.+
T Consensus 5 ~~fYkC~~CGniv 17 (125)
T TIGR00320 5 LQVYKCEVCGNIV 17 (125)
T ss_pred CcEEECCCCCcEE
Confidence 3469999999988
No 36
>COG4847 Uncharacterized protein conserved in archaea [Function unknown]
Probab=28.25 E-value=22 Score=31.12 Aligned_cols=17 Identities=35% Similarity=0.870 Sum_probs=14.8
Q ss_pred CcccceeecCCcccchh
Q 010423 414 QEFKCEICGNYSYWGRR 430 (511)
Q Consensus 414 ~ey~CEICGN~~Y~GRk 430 (511)
.+|+|=|||+.+--|-|
T Consensus 5 kewkC~VCg~~iieGqk 21 (103)
T COG4847 5 KEWKCYVCGGTIIEGQK 21 (103)
T ss_pred ceeeEeeeCCEeeeccE
Confidence 58999999999888865
No 37
>COG5112 UFD2 U1-like Zn-finger-containing protein [General function prediction only]
Probab=27.36 E-value=74 Score=28.52 Aligned_cols=31 Identities=16% Similarity=0.127 Sum_probs=26.1
Q ss_pred cccchHHHHHhhhhhhhhHHHHhccccCCCchHHHHHH
Q 010423 249 DYYSTVEELMEVGSERLKEELAAKGLKSGGTLQQRAER 286 (511)
Q Consensus 249 ~~~~s~eklf~~g~~~lke~l~~~gLk~gg~lk~ra~r 286 (511)
.||-.|.+.|. .+.+..-|++++-|.+ |++.
T Consensus 56 hYCieCaryf~------t~~aL~~HkkgkvHkR-R~Ke 86 (126)
T COG5112 56 HYCIECARYFI------TEKALMEHKKGKVHKR-RAKE 86 (126)
T ss_pred eeeehhHHHHH------HHHHHHHHhccchhHH-HHHH
Confidence 79999999999 9998888999888766 4443
No 38
>KOG3408 consensus U1-like Zn-finger-containing protein, probabl erole in RNA processing/splicing [RNA processing and modification]
Probab=27.30 E-value=27 Score=31.92 Aligned_cols=27 Identities=7% Similarity=-0.007 Sum_probs=26.0
Q ss_pred cccchHHHHHhhhhhhhhHHHHhccccCCCchH
Q 010423 249 DYYSTVEELMEVGSERLKEELAAKGLKSGGTLQ 281 (511)
Q Consensus 249 ~~~~s~eklf~~g~~~lke~l~~~gLk~gg~lk 281 (511)
+||-.|.+.|. .+.++..|++++.|++
T Consensus 58 fyCi~CaRyFi------~~~~l~~H~ktK~HKr 84 (129)
T KOG3408|consen 58 FYCIECARYFI------DAKALKTHFKTKVHKR 84 (129)
T ss_pred eehhhhhhhhc------chHHHHHHHhccHHHH
Confidence 89999999999 9999999999999987
No 39
>PF04194 PDCD2_C: Programmed cell death protein 2, C-terminal putative domain ; InterPro: IPR007320 PDCD2 is localized predominantly in the cytosol of cells situated at the opposite pole of the germinal centre from the centroblasts as well as in cells in the mantle zone. It has been shown to interact with BCL6, an evolutionarily conserved Kruppel-type zinc finger protein that functions as a strong transcriptional repressor and is required for germinal centre development. The rat homologue, Rp8, is associated with programmed cell death in thymocytes.; GO: 0005737 cytoplasm
Probab=26.93 E-value=12 Score=35.41 Aligned_cols=31 Identities=39% Similarity=0.688 Sum_probs=20.6
Q ss_pred CCCCCchhHHHHHHhcCC--CcccceeecCCcccchhhhh
Q 010423 396 GWDGKPIPYWLYKLHGLG--QEFKCEICGNYSYWGRRAFE 433 (511)
Q Consensus 396 GwDGkPIPyWLYKLhGL~--~ey~CEICGN~~Y~GRkaFe 433 (511)
.+.|+|+ |.....-.. ..-+|+.|| |+|.||
T Consensus 78 ~~gG~PL--w~s~~~~~~~~~ip~C~~Cg-----~~R~FE 110 (164)
T PF04194_consen 78 CRGGKPL--WISSTPIPPESDIPKCENCG-----SPRVFE 110 (164)
T ss_pred CCCCeEE--EecCCCCCccccCCCCccCC-----CccEEE
Confidence 5678855 665433222 256899999 788887
No 40
>cd00730 rubredoxin Rubredoxin; nonheme iron binding domains containing a [Fe(SCys)4] center. Rubredoxins are small nonheme iron proteins. The iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), but iron can also be replaced by cobalt, nickel or zinc. They are believed to be involved in electron transfer.
Probab=26.66 E-value=22 Score=27.43 Aligned_cols=14 Identities=43% Similarity=1.139 Sum_probs=11.6
Q ss_pred cccceeecCCcccch
Q 010423 415 EFKCEICGNYSYWGR 429 (511)
Q Consensus 415 ey~CEICGN~~Y~GR 429 (511)
.|.|-+|| ++|-..
T Consensus 1 ~y~C~~Cg-yiYd~~ 14 (50)
T cd00730 1 KYECRICG-YIYDPA 14 (50)
T ss_pred CcCCCCCC-eEECCC
Confidence 48999999 999754
No 41
>PF13319 DUF4090: Protein of unknown function (DUF4090)
Probab=26.17 E-value=29 Score=29.24 Aligned_cols=16 Identities=38% Similarity=0.671 Sum_probs=11.2
Q ss_pred CCCCCCCchhHHHHHH
Q 010423 394 PMGWDGKPIPYWLYKL 409 (511)
Q Consensus 394 PLGwDGkPIPyWLYKL 409 (511)
-+..||.|||-=.-.|
T Consensus 12 GiDlDGspIP~~~L~L 27 (84)
T PF13319_consen 12 GIDLDGSPIPPAMLEL 27 (84)
T ss_pred CcCCCCCcCCHHHHHH
Confidence 3567999999754433
No 42
>PF07864 DUF1651: Protein of unknown function (DUF1651); InterPro: IPR012447 The proteins in this entry have not been characterised.
Probab=25.76 E-value=49 Score=27.17 Aligned_cols=28 Identities=36% Similarity=0.547 Sum_probs=22.4
Q ss_pred CCCCCcCccccccHHHHHHHHHHHHHhh
Q 010423 449 GIPNTKNFNEITSIEEAKELWKKIQERQ 476 (511)
Q Consensus 449 GIpnt~~F~~IT~I~dA~~Lw~klk~~~ 476 (511)
|-|+...-.-.-.|++|.++|..|.++.
T Consensus 39 g~pp~lk~rr~l~~~~A~e~W~~L~~~G 66 (75)
T PF07864_consen 39 GEPPLLKTRRRLTREEARELWKELQKTG 66 (75)
T ss_pred CCCCcceEEEEEEHHHHHHHHHHHHHcC
Confidence 6666666666669999999999999863
No 43
>PF06107 DUF951: Bacterial protein of unknown function (DUF951); InterPro: IPR009296 This family consists of several short hypothetical bacterial proteins of unknown function.
Probab=24.91 E-value=31 Score=27.49 Aligned_cols=29 Identities=31% Similarity=0.511 Sum_probs=25.5
Q ss_pred HhcCCCcccceeecCCcccchhhhhhhcc
Q 010423 409 LHGLGQEFKCEICGNYSYWGRRAFERHFK 437 (511)
Q Consensus 409 LhGL~~ey~CEICGN~~Y~GRkaFekHF~ 437 (511)
-=|-++-.+|.=||-.+-.-|..|||...
T Consensus 25 R~GaDikikC~gCg~~imlpR~~feK~~K 53 (57)
T PF06107_consen 25 RIGADIKIKCLGCGRQIMLPRSKFEKRLK 53 (57)
T ss_pred EccCcEEEEECCCCCEEEEeHHHHHHHHH
Confidence 34788899999999999999999999753
No 44
>KOG0324 consensus Uncharacterized conserved protein [Function unknown]
Probab=24.56 E-value=27 Score=34.79 Aligned_cols=21 Identities=38% Similarity=0.727 Sum_probs=17.9
Q ss_pred CCCchhHHHHHHhcCCCcccc
Q 010423 398 DGKPIPYWLYKLHGLGQEFKC 418 (511)
Q Consensus 398 DGkPIPyWLYKLhGL~~ey~C 418 (511)
-|||||-|.-.|.-++..+.|
T Consensus 125 tgk~IP~winrLa~~~~~~~~ 145 (214)
T KOG0324|consen 125 TGKKIPSWVNRLARAGLCSLC 145 (214)
T ss_pred cCCCccHHHHHHHHHhhhhHH
Confidence 699999999999988876444
No 45
>PHA00732 hypothetical protein
Probab=24.39 E-value=37 Score=28.52 Aligned_cols=21 Identities=38% Similarity=0.757 Sum_probs=17.3
Q ss_pred cccceeecCCcccchhhhhhhc
Q 010423 415 EFKCEICGNYSYWGRRAFERHF 436 (511)
Q Consensus 415 ey~CEICGN~~Y~GRkaFekHF 436 (511)
+|+|.+|| .++.-..+..+|=
T Consensus 1 py~C~~Cg-k~F~s~s~Lk~H~ 21 (79)
T PHA00732 1 MFKCPICG-FTTVTLFALKQHA 21 (79)
T ss_pred CccCCCCC-CccCCHHHHHHHh
Confidence 48999999 5577788899884
No 46
>PF09026 CENP-B_dimeris: Centromere protein B dimerisation domain; InterPro: IPR015115 Centromere protein B (CENP-B) interacts with centromeric heterochromatin in chromosomes and binds to a specific subset of alphoid satellite DNA, called the CENP-B box. CENP-B may organise arrays of centromere satellite DNA into a higher order structure, which then directs centromere formation and kinetochore assembly in mammalian chromosomes. The CENP-B dimerisation domain is composed of two alpha-helices, which are folded into an antiparallel configuration. Dimerisation of CENP-B is mediated by this domain, in which monomers dimerise to form a symmetrical, antiparallel, four-helix bundle structure with a large hydrophobic patch in which 23 residues of one monomer form van der Waals contacts with the other monomer. This CENP-B dimer configuration may be suitable for capturing two distant CENP-B boxes during centromeric heterochromatin formation []. ; GO: 0003677 DNA binding, 0003682 chromatin binding, 0006355 regulation of transcription, DNA-dependent, 0000775 chromosome, centromeric region, 0005634 nucleus; PDB: 1UFI_A.
Probab=24.34 E-value=25 Score=30.85 Aligned_cols=9 Identities=33% Similarity=0.630 Sum_probs=0.9
Q ss_pred CCCchhHHH
Q 010423 398 DGKPIPYWL 406 (511)
Q Consensus 398 DGkPIPyWL 406 (511)
|+-|||-.=
T Consensus 39 de~p~p~fg 47 (101)
T PF09026_consen 39 DEVPVPEFG 47 (101)
T ss_dssp -------HH
T ss_pred ccccchhHH
Confidence 677787553
No 47
>PF04502 DUF572: Family of unknown function (DUF572) ; InterPro: IPR007590 This entry represents eukaryotic proteins with undetermined function belonging to the CWC16 family.
Probab=24.13 E-value=26 Score=36.87 Aligned_cols=34 Identities=26% Similarity=0.517 Sum_probs=23.3
Q ss_pred ccceeecCCcccchh--------hhhhhcchhhhhhcccccC
Q 010423 416 FKCEICGNYSYWGRR--------AFERHFKEWRHQHGMRCLG 449 (511)
Q Consensus 416 y~CEICGN~~Y~GRk--------aFekHF~E~RH~~GmrcLG 449 (511)
-.|.=||+++|+|.| --|+.++=.=+.|-|||=.
T Consensus 41 i~C~~C~~~I~kG~rFNA~Ke~v~~E~Yls~~I~rF~~kC~~ 82 (324)
T PF04502_consen 41 IWCNTCGEYIYKGVRFNARKEKVGNEKYLSTPIYRFYIKCPR 82 (324)
T ss_pred CcCCCCccccccceeeeeeeEecCCCccccceEEEEEEEcCC
Confidence 369999999999976 1244555555666677643
No 48
>PF13824 zf-Mss51: Zinc-finger of mitochondrial splicing suppressor 51
Probab=23.84 E-value=42 Score=26.53 Aligned_cols=28 Identities=21% Similarity=0.462 Sum_probs=23.5
Q ss_pred cCCCcccceeecCCcccchhhhhhhcch
Q 010423 411 GLGQEFKCEICGNYSYWGRRAFERHFKE 438 (511)
Q Consensus 411 GL~~ey~CEICGN~~Y~GRkaFekHF~E 438 (511)
=..+.|.|..||=.+|--+.+.+.-+++
T Consensus 10 ~~~v~~~Cp~cGipthcS~ehw~~D~e~ 37 (55)
T PF13824_consen 10 PAHVNFECPDCGIPTHCSEEHWEDDYEE 37 (55)
T ss_pred ccccCCcCCCCCCcCccCHHHHHHhHHH
Confidence 4579999999999999999888766554
No 49
>PF07754 DUF1610: Domain of unknown function (DUF1610); InterPro: IPR011668 This domain is found in archaeal species. It is likely to bind zinc via its four well-conserved cysteine residues.
Probab=23.52 E-value=32 Score=22.72 Aligned_cols=13 Identities=31% Similarity=0.659 Sum_probs=10.9
Q ss_pred cCCCcccceeecC
Q 010423 411 GLGQEFKCEICGN 423 (511)
Q Consensus 411 GL~~ey~CEICGN 423 (511)
+.+++|+|.-||.
T Consensus 12 ~~~v~f~CPnCG~ 24 (24)
T PF07754_consen 12 EQAVPFPCPNCGF 24 (24)
T ss_pred ccCceEeCCCCCC
Confidence 4589999999993
No 50
>PF00301 Rubredoxin: Rubredoxin; InterPro: IPR004039 Rubredoxin is a low molecular weight iron-containing bacterial protein involved in electron transfer [, ], sometimes replacing ferredoxin as an electron carrier []. The 3-D structures of a number of rubredoxins have been solved [, ]. The fold belongs to the alpha+beta class, with 2 alpha-helices and 2-3 beta-strands. Its active site contains an iron ion which is co-ordinated by the sulphurs of four conserved cysteine residues forming an almost regular tetrahedron. The conserved cysteines reside on two loops, which are the most conserved regions of the protein. In addition, a ring of acidic residues in the proximity of the [Fe(Cys)4] centre is also well-conserved []. ; GO: 0009055 electron carrier activity, 0046872 metal ion binding; PDB: 2RDV_C 1RDV_A 1S24_A 1T9O_B 1B2J_A 1SMW_A 2PVE_B 1BFY_A 1T9P_C 1C09_C ....
Probab=23.49 E-value=23 Score=26.99 Aligned_cols=30 Identities=33% Similarity=0.812 Sum_probs=19.3
Q ss_pred ccceeecCCcccchhhhhhhcchhhhhhcccccCCCCCcCcccc
Q 010423 416 FKCEICGNYSYWGRRAFERHFKEWRHQHGMRCLGIPNTKNFNEI 459 (511)
Q Consensus 416 y~CEICGN~~Y~GRkaFekHF~E~RH~~GmrcLGIpnt~~F~~I 459 (511)
|.|.+|| ++|-.. .|-.--|||+-..|.++
T Consensus 2 y~C~~Cg-yvYd~~-------------~Gd~~~~i~pGt~F~~L 31 (47)
T PF00301_consen 2 YQCPVCG-YVYDPE-------------KGDPENGIPPGTPFEDL 31 (47)
T ss_dssp EEETTTS-BEEETT-------------TBBGGGTB-TT--GGGS
T ss_pred cCCCCCC-EEEcCC-------------cCCcccCcCCCCCHHHC
Confidence 8899999 999654 34445578766667665
No 51
>PF06160 EzrA: Septation ring formation regulator, EzrA ; InterPro: IPR010379 During the bacterial cell cycle, the tubulin-like cell-division protein FtsZ polymerises into a ring structure that establishes the location of the nascent division site. EzrA modulates the frequency and position of FtsZ ring formation [].; GO: 0000921 septin ring assembly, 0005940 septin ring, 0016021 integral to membrane
Probab=23.42 E-value=5.5e+02 Score=29.06 Aligned_cols=89 Identities=18% Similarity=0.267 Sum_probs=60.5
Q ss_pred hHHHHHHHHHHHHHHHHHHHHH---hhcCCCChhHHHHHHHHHHHHHHHHHHHHHHHHHHHcccchhhHHHHHHccCCCC
Q 010423 5 LLEVTRAAHEEVERLERLVVKD---LQTEPNSNKDRLVQSHRVRNMIDTITDTTERLIEIYADKDNARKDEIAALGGQTA 81 (511)
Q Consensus 5 ~LE~~R~~hEeiErlE~ai~~~---~~~~p~~~k~~l~q~h~i~~~ld~~~~~~~~L~~~y~d~dg~r~~Ei~~l~g~~~ 81 (511)
-++.+|.+.+.|+.|+.....- +......-- ....++..+.++.....+...++...-+++|++|..+-
T Consensus 342 e~~~~~~l~~~l~~l~~~~~~~~~~i~~~~~~yS---~i~~~l~~~~~~l~~ie~~q~~~~~~l~~L~~dE~~Ar----- 413 (560)
T PF06160_consen 342 ELEIVRELEKQLKELEKRYEDLEERIEEQQVPYS---EIQEELEEIEEQLEEIEEEQEEINESLQSLRKDEKEAR----- 413 (560)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHcCCcCHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----
Confidence 4678888888888877665443 222222211 22345666667777777777777777788888898774
Q ss_pred CCCchHHHHHHHHHHHHHhhhhC
Q 010423 82 TGTNVFSSFYDRLKEIREYHRRH 104 (511)
Q Consensus 82 ~~~~~f~~Fy~~l~~Ike~h~~~ 104 (511)
.....|-..|..|+-+-.+.
T Consensus 414 ---~~l~~~~~~l~~ikR~lek~ 433 (560)
T PF06160_consen 414 ---EKLQKLKQKLREIKRRLEKS 433 (560)
T ss_pred ---HHHHHHHHHHHHHHHHHHHc
Confidence 34889999999999877664
No 52
>TIGR00270 conserved hypothetical protein TIGR00270.
Probab=22.41 E-value=30 Score=32.68 Aligned_cols=11 Identities=55% Similarity=1.235 Sum_probs=8.2
Q ss_pred ceeecCCcccch
Q 010423 418 CEICGNYSYWGR 429 (511)
Q Consensus 418 CEICGN~~Y~GR 429 (511)
|||||.-+. |+
T Consensus 3 CEiCG~~i~-~~ 13 (154)
T TIGR00270 3 CEICGRKIK-GK 13 (154)
T ss_pred cccCCCccC-CC
Confidence 999996663 54
No 53
>PF06147 DUF968: Protein of unknown function (DUF968); InterPro: IPR010373 This is a family of uncharacterised prophage proteins that are also found in bacteria and humans.
Probab=22.36 E-value=53 Score=32.26 Aligned_cols=19 Identities=32% Similarity=0.697 Sum_probs=15.3
Q ss_pred chhHHHHHHhcCCCcccceeecC
Q 010423 401 PIPYWLYKLHGLGQEFKCEICGN 423 (511)
Q Consensus 401 PIPyWLYKLhGL~~ey~CEICGN 423 (511)
-+|.|||.+ +.=+|-|||.
T Consensus 117 ~~~~yl~~v----~~~~C~iCGk 135 (200)
T PF06147_consen 117 ESEKYLYWV----KSRPCVICGK 135 (200)
T ss_pred HHHHHHhhh----ccCccccCCC
Confidence 368999984 4678999994
No 54
>PRK08359 transcription factor; Validated
Probab=22.00 E-value=31 Score=33.35 Aligned_cols=12 Identities=50% Similarity=0.949 Sum_probs=9.6
Q ss_pred cceeecCCcccch
Q 010423 417 KCEICGNYSYWGR 429 (511)
Q Consensus 417 ~CEICGN~~Y~GR 429 (511)
.|||||.-+. |+
T Consensus 8 ~CEiCG~~i~-g~ 19 (176)
T PRK08359 8 YCEICGAEIR-GP 19 (176)
T ss_pred eeecCCCccC-CC
Confidence 4999998884 66
No 55
>PF02132 RecR: RecR protein; InterPro: IPR023628 The bacterial protein RecR seems to play a role in a recombinational process of DNA repair []. It may act with RecF and RecO. RecR's structure consists of a N-terminal helix-hairpin-helix (HhH) motif, followed by a Cys4 zinc-finger motif, a Toprim domain and a Walker B motif []. This entry represents the C4-type zinc finger.; PDB: 1VDD_D 2V1C_B.
Probab=21.76 E-value=26 Score=25.59 Aligned_cols=9 Identities=67% Similarity=1.350 Sum_probs=3.5
Q ss_pred cceeecCCc
Q 010423 417 KCEICGNYS 425 (511)
Q Consensus 417 ~CEICGN~~ 425 (511)
.|++|||.+
T Consensus 19 ~C~~C~nls 27 (41)
T PF02132_consen 19 FCSICGNLS 27 (41)
T ss_dssp E-SSS--EE
T ss_pred ccCCCCCcC
Confidence 466666654
No 56
>PHA00733 hypothetical protein
Probab=20.51 E-value=52 Score=29.95 Aligned_cols=23 Identities=13% Similarity=0.489 Sum_probs=13.8
Q ss_pred CCcccceeecCCcccchhhhhhhc
Q 010423 413 GQEFKCEICGNYSYWGRRAFERHF 436 (511)
Q Consensus 413 ~~ey~CEICGN~~Y~GRkaFekHF 436 (511)
..+|.|++|| .+|..+....+|-
T Consensus 71 ~kPy~C~~Cg-k~Fss~s~L~~H~ 93 (128)
T PHA00733 71 VSPYVCPLCL-MPFSSSVSLKQHI 93 (128)
T ss_pred CCCccCCCCC-CcCCCHHHHHHHH
Confidence 3457777776 4466666666554
No 57
>COG1439 Predicted nucleic acid-binding protein, consists of a PIN domain and a Zn-ribbon module [General function prediction only]
Probab=20.48 E-value=21 Score=34.57 Aligned_cols=34 Identities=26% Similarity=0.529 Sum_probs=27.3
Q ss_pred CCCCCCCCCchhHHHHHHhcCCCccc-----ceeecCCc
Q 010423 392 KLPMGWDGKPIPYWLYKLHGLGQEFK-----CEICGNYS 425 (511)
Q Consensus 392 nLPLGwDGkPIPyWLYKLhGL~~ey~-----CEICGN~~ 425 (511)
+.+++.-.+-+=-|-|.=||=.+.|+ |+|||..+
T Consensus 125 ~~~~~~~I~~v~~w~~rC~GC~~~f~~~~~~Cp~CG~~~ 163 (177)
T COG1439 125 SISYKGKIKKVRKWRLRCHGCKRIFPEPKDFCPICGSPL 163 (177)
T ss_pred eeeccCccceEeeeeEEEecCceecCCCCCcCCCCCCce
Confidence 34555555666789999999999999 99999764
No 58
>PRK07708 hypothetical protein; Validated
Probab=20.30 E-value=1.1e+02 Score=30.56 Aligned_cols=51 Identities=22% Similarity=0.242 Sum_probs=41.0
Q ss_pred CCcCccccccHHHHHHHHHHHHHhhcCCCCCCCCCceeeccCCCccchhhhHHHhhc
Q 010423 452 NTKNFNEITSIEEAKELWKKIQERQGGIKWRPELEEEYEDKEGNIYNKKTYTDLQRQ 508 (511)
Q Consensus 452 nt~~F~~IT~I~dA~~Lw~klk~~~~~~~~~~~~~eE~ED~~GNVmskK~YeDLkrQ 508 (511)
.|..+-+=..+++|+.|.+.+.+..+. .+.++.|.+|+..++|.-..|-+|
T Consensus 16 ~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~d~~~~~~~~k~~~~~~~~ 66 (219)
T PRK07708 16 QTELTSDWMNIEEALQLAEDFEKTGRV------KELEFYDEMDTEWSLKELKKLSKE 66 (219)
T ss_pred eeEEEeccccHHHHHHHHHHHhhcCCc------eeEEEecCCCCEeeHHHHhhhhhh
Confidence 345556777899999999999887653 478999999999999987776553
Done!