Query psy447
Match_columns 417
No_of_seqs 233 out of 849
Neff 3.8
Searched_HMMs 46136
Date Fri Aug 16 17:52:07 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy447.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/447hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG3563|consensus 100.0 2.1E-72 4.5E-77 557.5 18.4 379 6-416 33-453 (454)
2 KOG3562|consensus 100.0 2.8E-38 6.1E-43 299.1 4.5 110 114-223 4-113 (277)
3 PF00250 Fork_head: Fork head 100.0 5.6E-38 1.2E-42 263.0 2.6 96 123-218 1-96 (96)
4 cd00059 FH Forkhead (FH), also 100.0 1.3E-36 2.8E-41 247.0 7.0 78 123-200 1-78 (78)
5 smart00339 FH FORKHEAD. FORKHE 100.0 1.4E-35 3E-40 245.8 6.7 86 123-208 1-86 (89)
6 KOG2294|consensus 100.0 2.2E-35 4.7E-40 294.9 -1.7 108 119-226 126-236 (454)
7 KOG4385|consensus 99.9 8.3E-23 1.8E-27 209.9 0.9 79 120-203 365-443 (581)
8 COG5025 Transcription factor o 99.8 5.4E-21 1.2E-25 202.8 2.9 101 120-220 334-434 (610)
9 COG5025 Transcription factor o 99.5 3.2E-15 6.8E-20 159.3 -1.0 93 118-210 81-173 (610)
10 PF09354 HNF_C: HNF3 C-termina 98.1 6.8E-07 1.5E-11 71.6 -0.1 29 352-393 1-32 (65)
11 PF08430 Fork_head_N: Forkhead 96.8 0.0047 1E-07 55.8 7.5 20 102-122 117-136 (136)
12 PF00538 Linker_histone: linke 93.3 0.12 2.5E-06 41.7 3.9 33 127-159 4-37 (77)
13 smart00526 H15 Domain in histo 90.4 0.59 1.3E-05 36.4 4.8 32 127-158 6-38 (66)
14 cd00073 H15 linker histone 1 a 87.1 0.94 2E-05 37.6 4.2 34 127-160 6-40 (88)
15 PF05066 HARE-HTH: HB1, ASXL, 82.3 0.88 1.9E-05 35.9 1.8 57 126-184 1-62 (72)
16 PF14338 Mrr_N: Mrr N-terminal 73.8 1.9 4.2E-05 35.6 1.6 73 127-204 1-85 (92)
17 KOG4012|consensus 54.2 15 0.00032 36.7 3.7 37 126-162 45-82 (243)
18 COG2958 Uncharacterized protei 51.7 18 0.00038 36.9 3.9 60 126-186 5-76 (307)
19 PF12872 OST-HTH: OST-HTH/LOTU 35.4 32 0.00069 26.5 2.3 41 127-167 5-47 (74)
20 cd06649 PKc_MEK2 Catalytic dom 28.7 34 0.00074 33.0 1.8 48 128-180 277-327 (331)
21 cd05071 PTKc_Src Catalytic dom 27.3 56 0.0012 30.0 2.9 34 125-158 229-262 (262)
22 PF08047 His_leader: Histidine 26.7 24 0.00052 21.6 0.2 8 306-313 6-13 (16)
23 PF08430 Fork_head_N: Forkhead 23.1 1.6E+02 0.0035 26.9 4.9 14 11-25 16-30 (136)
24 KOG1528|consensus 22.5 38 0.00082 35.3 0.9 32 173-204 109-141 (351)
25 cd08229 STKc_Nek7 Catalytic do 21.8 97 0.0021 28.1 3.3 30 127-156 236-265 (267)
26 PF12317 IFT46_B_C: Intraflage 20.4 70 0.0015 31.6 2.1 63 145-207 16-81 (214)
No 1
>KOG3563|consensus
Probab=100.00 E-value=2.1e-72 Score=557.50 Aligned_cols=379 Identities=43% Similarity=0.604 Sum_probs=256.7
Q ss_pred ccCCCCCCCCCCCcccccccc------cccCCCCCCCCCCC----CCCCCcccc-------c--------ccccCCCCCC
Q psy447 6 SHNTSLGSPGMTPSYSMNSVI------NSCSPQSGSTGTPL----GGFSSNMHS-------M--------GSINGMNGPN 60 (417)
Q Consensus 6 ~mn~~~~~~~~~~~y~~~~~~------~~~~~~~~~~~~~~----~~~~~~~~~-------~--------~~~~~~~~~~ 60 (417)
-||.+++++.|+..|.||+|+ +.+++.++++++-. ++++.+... . +.+|+| | |
T Consensus 33 N~~s~~~~~~~nsymtMn~~~~~sg~~g~~~aas~~~S~~d~A~~a~~sp~~~agms~~~nag~~gs~~aAg~~~m-g-~ 110 (454)
T KOG3563|consen 33 NMNSMLPENKMNSYMTMNPMGTPSGTSGNMPAASFNSSSLDTAMGAGMSPAAPAGMSFGANAGMNGSSGAAGVNAM-G-N 110 (454)
T ss_pred CccccCchhhhhhheeeccccCCCCCCCCcCCcccccccchhhhccCCCcccccCCCccccccccCccchhhcccC-C-c
Confidence 367888888888844499984 23344444444431 222221000 0 124555 4 6
Q ss_pred CCCCCCCCCCCCCcccc--ccCCCccCcccccccccccCCCCCchhhhhhhhccccccCcCCCCCCCCCHHHHHHHHHhc
Q psy447 61 CMSTGSMGYSSNMNAAC--MGGINSYNTALTTGRAELAENSPSPGALQRVARADKTYRRSYTHAKPPYSYISLITMAIQN 138 (417)
Q Consensus 61 ~~~~~~~g~~~~~~~~~--~~~l~s~~~~~~s~~~~l~~~sp~~~~~~~~~r~~k~~rr~~~~~KPPySYa~LI~~AI~s 138 (417)
|++.+.|+-.+.+.++. .+||.++..+.. +.......+|..-.-.| ....|+|||++.+.||||||+.||+|||+.
T Consensus 111 ~lsPs~msplGa~aAasma~~gl~~yg~~~~-~s~s~~a~sp~~l~rsr-g~~pktyRRsY~haKPPYSYISLITMAIQ~ 188 (454)
T KOG3563|consen 111 GLSPSNMSPLGAQAAASMATSGLGPYGAMNG-RSMSPMAYSPADLGRSR-GQDPKTYRRSYTHAKPPYSYISLITMAIQQ 188 (454)
T ss_pred ccCccccCcchhhhhhhhhccccccccccCC-CccCccccCcccccccc-CCCCceeeccccCCCCChhHHHHHHHHHHh
Confidence 66666553223332222 234433332221 11111222333322222 335689999999999999999999999999
Q ss_pred CCCCCcCHHHHHHHHHhhCccccccccccccccccccccccccccccCCCCCCCCccceEecCCCcccccccchhhcccc
Q psy447 139 SPTKMLTLSEIYQFIMDLFPFYRQNQQRWQNSIRHSLSFNDCFVKVPRTPDKPGKGSFWTLHKDSGNMFENGCYLRRQKR 218 (417)
Q Consensus 139 SP~krLTLsEIY~wI~~~fPYyr~~~~gWKNSIRHNLSLNkcFvKVpR~~~~pGKGs~WtLdPe~~~~fenG~~~rRrkr 218 (417)
+|.|+|||+|||+||++.|||||.+.++|||||||.||+|+|||||+|.+|+||||+||+|+|+++++||||||+||+||
T Consensus 189 ~pskmLTLSEIYqwIMDLFPyYrqNQQRWQNSIRHSLSFNDCFVKVaRSPDKPGKGSfWTLHpdsGNMFENGCYLRRQKR 268 (454)
T KOG3563|consen 189 APSKMLTLSEIYQWIMDLFPYYRQNQQRWQNSIRHSLSFNDCFVKVARSPDKPGKGSFWTLHPDSGNMFENGCYLRRQKR 268 (454)
T ss_pred CCccceeHHHHHHHHHHhhhHhhhhHHHHHhhhhhhccccceeeeccCCCCCCCCccceeecCCcCcccccchhehhhhh
Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred ccchhHHHHHhhccCCCCCCCCCCCCCCCCCCCcccCCCCCCCCCCCCCCCCCCCCCCCCChhhhhhcccchhhhhhhhc
Q psy447 219 FKDDKKEVIRQTHHKSVSPTHHHPTHHHHHHSHHTHHNSHNTHSHQSHDAGGKKSPETKPHLSELAMLGLTKEDAMSAGL 298 (417)
Q Consensus 219 ~kr~k~e~~Rq~~~~s~s~~~~~~~~~~~~~~ss~~~~~~~~hs~~~~d~~~~~r~~tss~aS~la~l~~~~~d~~~~~~ 298 (417)
||.+|++.++...-.+-+............ .......++..+--.+. +..++.++....+|+. ++.+.+ .
T Consensus 269 FK~ekk~~~~~~~gas~s~~~~~~~~~k~~-----~~~~~~t~p~~~sp~Ga---pap~~~ae~~~~~ha~-~a~~ga-~ 338 (454)
T KOG3563|consen 269 FKCEKKEASKGGSGASGSGLNGTGGGPKST-----TDAATATSPPQPSPAGA---PAPSPEAEPKEGLHAV-DATGGA-L 338 (454)
T ss_pred hhhhhhhccccCCCCccccccccCCCcccc-----cCcccCCCCCCCCCCCC---CCCCcccchhhhhhhh-hhcccc-c
Confidence 999999887433222222211111111100 00000011111111111 5566678888889988 577776 3
Q ss_pred cccccccccccccchh-hhHHHHHHHhcCCcccCcCCCCccc-cccccccCCCCCCCCcccccccCchhhhHHHhhhhcc
Q psy447 299 LHSDFQQNSYHQSFHQ-QDSELAAMVASGRCHPHLIPPDHYN-HHLKQEYGASAVNHPFPVSRLLPSASAMSAAMAMSAD 376 (417)
Q Consensus 299 ~~~~~~~~~~h~~hh~-~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 376 (417)
.++..+-+ +++++|+ +++++.|+++ ++||+++|. +|-. +|||+|++|+ .+|||+|++|||+.+-
T Consensus 339 ~l~~~pas-~t~~~~~~~~pg~~~s~~-~~~~~~~~~-~h~sq~~lk~~~~Ya-fnhpfsinnlmss~q~---------- 404 (454)
T KOG3563|consen 339 ELGLTPAS-STPPPTGLSQPGSVASSA-ATLGAHGLA-PHTSQAHLKLDPHYA-FNHPFSINNLMSSSQQ---------- 404 (454)
T ss_pred ccCCCCcc-cCCCCcCCCCcchhhhhh-hccCCCcCC-CCccccccccCCCcc-ccCchhhhhhcccccc----------
Confidence 44444443 3344444 5788888886 999999999 4655 5999999999 9999999999998332
Q ss_pred ccccccccceeecc---cCcc-CCC---------CCCcccccCccccCCCCCC
Q psy447 377 SAKSIDLCKMYEMT---GGYS-TGG---------NNDSYYQSSSLYHHSTTTS 416 (417)
Q Consensus 377 ~~~~~~~~~~~~~~---~~~~-~~~---------~~~~~~~~~~~~~~~~~~~ 416 (417)
. -|.|+|||||+ .||. +++ .+++||| ++||.+..+|
T Consensus 405 -~-~kld~k~Y~~~~qY~gyg~a~gp~tn~~p~~~~~syYQ--g~Ysrp~~nt 453 (454)
T KOG3563|consen 405 -A-HKLDVKAYEQTSQYPGYGSALGPATNGEPSALEGSYYQ--GLYSRPLLNT 453 (454)
T ss_pred -c-cccchhhhhhhcccCCcCcccCccccCCccccccceee--eeccccCCCC
Confidence 1 38999999996 5663 333 6789999 4599998876
No 2
>KOG3562|consensus
Probab=100.00 E-value=2.8e-38 Score=299.08 Aligned_cols=110 Identities=68% Similarity=1.211 Sum_probs=106.0
Q ss_pred cccCcCCCCCCCCCHHHHHHHHHhcCCCCCcCHHHHHHHHHhhCccccccccccccccccccccccccccccCCCCCCCC
Q psy447 114 TYRRSYTHAKPPYSYISLITMAIQNSPTKMLTLSEIYQFIMDLFPFYRQNQQRWQNSIRHSLSFNDCFVKVPRTPDKPGK 193 (417)
Q Consensus 114 ~~rr~~~~~KPPySYa~LI~~AI~sSP~krLTLsEIY~wI~~~fPYyr~~~~gWKNSIRHNLSLNkcFvKVpR~~~~pGK 193 (417)
+.|.++.++||||||+.|.+|||+++|+|.|.|+|||.||+++|||||.+.+.||||+|||||+|+||+||||..+++||
T Consensus 4 P~k~sY~dQKPPYSYIsLTaMAI~~SpekmLPLseIYkfImDrFPfYRkNTQrWQNSLRHNLSFNDCFIKiPRr~drpGK 83 (277)
T KOG3562|consen 4 PGKNSYGDQKPPYSYISLTAMAIQSSPEKMLPLSEIYKFIMDRFPFYRKNTQRWQNSLRHNLSFNDCFIKIPRRPDRPGK 83 (277)
T ss_pred CccccccccCCCceeeehhhHHhhcChhhcCcHHHHHHHHHhhCchhhhchHHHHHHhhccccccceeeecCCCCCCCCC
Confidence 55678889999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred ccceEecCCCcccccccchhhccccccchh
Q psy447 194 GSFWTLHKDSGNMFENGCYLRRQKRFKDDK 223 (417)
Q Consensus 194 Gs~WtLdPe~~~~fenG~~~rRrkr~kr~k 223 (417)
|+||+|+|.+.+|||||.++|||||++..+
T Consensus 84 GsyWalHP~a~dMFENGS~LRRrKRFrv~k 113 (277)
T KOG3562|consen 84 GSYWALHPSAFDMFENGSLLRRRKRFRVLK 113 (277)
T ss_pred ccceeeccchhhhcccchHHHHhhhhhhcc
Confidence 999999999999999999999999997554
No 3
>PF00250 Fork_head: Fork head domain; InterPro: IPR001766 The fork head protein of Drosophila melanogaster, a transcription factor that promotes terminal rather than segmental development, contains neither homeodomains nor zinc-fingers characteristic of other transcription factors []. Instead, it contains a distinct type of DNA-binding region, containing around 100 amino acids, which has since been identified in a number of transcription factors (including D. melanogaster FD1-5, mammalian HNF-3, human HTLF, Saccharomyces cerevisiae HCM1, etc.). This is referred to as the fork head domain but is also known as a 'winged helix' [, , ]. The fork head domain binds B-DNA as a monomer [], but shows no similarity to previously identified DNA-binding motifs. Although the domain is found in several different transcription factors, a common function is their involvement in early developmental decisions of cell fates during embryogenesis [].; GO: 0003700 sequence-specific DNA binding transcription factor activity, 0043565 sequence-specific DNA binding, 0006355 regulation of transcription, DNA-dependent; PDB: 2UZK_A 2K86_A 1JXS_A 2C6Y_A 2A3S_A 2D2W_A 2KIU_A 1VTN_C 2A07_J 2AS5_F ....
Probab=100.00 E-value=5.6e-38 Score=263.02 Aligned_cols=96 Identities=59% Similarity=1.116 Sum_probs=90.1
Q ss_pred CCCCCHHHHHHHHHhcCCCCCcCHHHHHHHHHhhCccccccccccccccccccccccccccccCCCCCCCCccceEecCC
Q psy447 123 KPPYSYISLITMAIQNSPTKMLTLSEIYQFIMDLFPFYRQNQQRWQNSIRHSLSFNDCFVKVPRTPDKPGKGSFWTLHKD 202 (417)
Q Consensus 123 KPPySYa~LI~~AI~sSP~krLTLsEIY~wI~~~fPYyr~~~~gWKNSIRHNLSLNkcFvKVpR~~~~pGKGs~WtLdPe 202 (417)
||||||++||++||+++|+++|||+|||+||+++||||+.+.++||||||||||+|+||+||+|..+++|||+||+|+|+
T Consensus 1 KPp~sY~~LI~~Ai~~sp~~~Ltl~eIy~~i~~~~pyyr~~~~~WknSIRHnLS~~~~F~kv~~~~~~~gkg~~W~i~~~ 80 (96)
T PF00250_consen 1 KPPYSYATLIAMAILSSPDKRLTLSEIYEWIEENFPYYRDASKGWKNSIRHNLSLNKCFVKVPRDPSEPGKGSYWTIDPE 80 (96)
T ss_dssp -SSS-HHHHHHHHHHTSTTSEBEHHHHHHHHHHHCGHHHCTHCHHHHHHHHHHHHSTTEEEESCCTSSSSSSEEEEE-CT
T ss_pred CCCCcHHHHHHHHHHhCCCCCccHHHHHHHHHHhhccccccchhhhhHHhhhccccceeeecCcCCCCCCCceeEEeCHH
Confidence 89999999999999999999999999999999999999999999999999999999999999998888999999999999
Q ss_pred Ccccccccchhhcccc
Q psy447 203 SGNMFENGCYLRRQKR 218 (417)
Q Consensus 203 ~~~~fenG~~~rRrkr 218 (417)
....|+++.+++||+|
T Consensus 81 ~~~~~~~~~~~~~~~~ 96 (96)
T PF00250_consen 81 AIEEFEKGRFKRRRKR 96 (96)
T ss_dssp HHHHHHHSCCSSSSSS
T ss_pred HHHHHhcchhhhhhcc
Confidence 9999999998888875
No 4
>cd00059 FH Forkhead (FH), also known as a "winged helix". FH is named for the Drosophila fork head protein, a transcription factor which promotes terminal rather than segmental development. This family of transcription factor domains, which bind to B-DNA as monomers, are also found in the Hepatocyte nuclear factor (HNF) proteins, which provide tissue-specific gene regulation. The structure contains 2 flexible loops or "wings" in the C-terminal region, hence the term winged helix.
Probab=100.00 E-value=1.3e-36 Score=246.99 Aligned_cols=78 Identities=69% Similarity=1.252 Sum_probs=76.5
Q ss_pred CCCCCHHHHHHHHHhcCCCCCcCHHHHHHHHHhhCccccccccccccccccccccccccccccCCCCCCCCccceEec
Q psy447 123 KPPYSYISLITMAIQNSPTKMLTLSEIYQFIMDLFPFYRQNQQRWQNSIRHSLSFNDCFVKVPRTPDKPGKGSFWTLH 200 (417)
Q Consensus 123 KPPySYa~LI~~AI~sSP~krLTLsEIY~wI~~~fPYyr~~~~gWKNSIRHNLSLNkcFvKVpR~~~~pGKGs~WtLd 200 (417)
||||||++||++||+++|+++|||+|||+||+++|||||.+.+|||||||||||+|+||+||+|..+++|||+||+||
T Consensus 1 kP~~sY~~LI~~Ai~~sp~~~lTL~eIy~~I~~~~pyyr~~~~gWknSIRHnLS~n~~F~kv~r~~~~~gkg~~W~i~ 78 (78)
T cd00059 1 KPPYSYSALIAMAIQSSPEKRLTLSEIYKWISDNFPYFRDAPAGWQNSIRHNLSLNKCFVKVPREPDEPGKGSYWTLD 78 (78)
T ss_pred CCCCCHHHHHHHHHHhCCCCCeeHHHHHHHHHHhCCccccCCCCCccceeEecccccceEECCCCCCCCCCCCeeeeC
Confidence 899999999999999999999999999999999999999999999999999999999999999998889999999996
No 5
>smart00339 FH FORKHEAD. FORKHEAD, also known as a "winged helix"
Probab=100.00 E-value=1.4e-35 Score=245.77 Aligned_cols=86 Identities=71% Similarity=1.251 Sum_probs=82.1
Q ss_pred CCCCCHHHHHHHHHhcCCCCCcCHHHHHHHHHhhCccccccccccccccccccccccccccccCCCCCCCCccceEecCC
Q psy447 123 KPPYSYISLITMAIQNSPTKMLTLSEIYQFIMDLFPFYRQNQQRWQNSIRHSLSFNDCFVKVPRTPDKPGKGSFWTLHKD 202 (417)
Q Consensus 123 KPPySYa~LI~~AI~sSP~krLTLsEIY~wI~~~fPYyr~~~~gWKNSIRHNLSLNkcFvKVpR~~~~pGKGs~WtLdPe 202 (417)
||||||++||++||+++|+++|||+|||+||+++|||||.+.++||||||||||+|+||+||+|..+.+|||+||+|+|+
T Consensus 1 kP~~sY~~lI~~ai~~sp~~~ltl~~Iy~~I~~~~pyy~~~~~~WknSIRHnLS~n~~F~kv~~~~~~~gkg~~W~i~~~ 80 (89)
T smart00339 1 KPPYSYIALIAMAILSSPDKRLTLSEIYKWIEDNFPYYRENRAGWQNSIRHNLSLNDCFVKVPREGDRPGKGSYWTLDPD 80 (89)
T ss_pred CCCCCHHHHHHHHHHhCCCCCeeHHHHHHHHHHhCchhhcCccccccccccccchhhceeecCcCCCCCCCCCEEEECcc
Confidence 79999999999999999999999999999999999999999999999999999999999999998888899999999999
Q ss_pred Cccccc
Q psy447 203 SGNMFE 208 (417)
Q Consensus 203 ~~~~fe 208 (417)
+..+|+
T Consensus 81 ~~~~~~ 86 (89)
T smart00339 81 AENMFE 86 (89)
T ss_pred HHHHHh
Confidence 876444
No 6
>KOG2294|consensus
Probab=100.00 E-value=2.2e-35 Score=294.86 Aligned_cols=108 Identities=51% Similarity=0.955 Sum_probs=99.6
Q ss_pred CCCCCCCCCHHHHHHHHHhcCCCCCcCHHHHHHHHHh-hCccccccc-cccccccccccccccccccccCCCCCCCCccc
Q psy447 119 YTHAKPPYSYISLITMAIQNSPTKMLTLSEIYQFIMD-LFPFYRQNQ-QRWQNSIRHSLSFNDCFVKVPRTPDKPGKGSF 196 (417)
Q Consensus 119 ~~~~KPPySYa~LI~~AI~sSP~krLTLsEIY~wI~~-~fPYyr~~~-~gWKNSIRHNLSLNkcFvKVpR~~~~pGKGs~ 196 (417)
....||||||++||+|||+.+|+|+|||+|||+||.. +|||||++. .|||||||||||||+||+||+|..+++|||+|
T Consensus 126 ~~~~kpp~SY~~LI~mAI~~Sp~krLtLs~Iy~~i~~~~fpyyr~~~~~gWqNSIRHNLSLn~cF~Kvpr~~~~~gKg~~ 205 (454)
T KOG2294|consen 126 GAGVKPPYSYSGLIAMAIRLSPGKRLTLGGIYFYILLEHFPYYRDANMPGWKNSIRHNLSLNKCFVKVPRERGRPGKGNY 205 (454)
T ss_pred CCCcCCccccccccccccccCCCcceeecCeeEEEEeccccccccCCCccchhhcccccccccCccccCCcccCCCCCCc
Confidence 3579999999999999999999999999999999995 999999999 99999999999999999999999889999999
Q ss_pred eEecCCCcc-cccccchhhccccccchhHHH
Q psy447 197 WTLHKDSGN-MFENGCYLRRQKRFKDDKKEV 226 (417)
Q Consensus 197 WtLdPe~~~-~fenG~~~rRrkr~kr~k~e~ 226 (417)
|+|||+..+ +|++|.++||++++++.....
T Consensus 206 W~ldP~~~~~~~~~g~~~rr~~~~~~~~~~~ 236 (454)
T KOG2294|consen 206 WTLDPDDENNMFDNGSFRRRRRSKSNGRSKS 236 (454)
T ss_pred cccCcchhccccccccccccccccccccccc
Confidence 999999998 999999999988765544333
No 7
>KOG4385|consensus
Probab=99.85 E-value=8.3e-23 Score=209.85 Aligned_cols=79 Identities=43% Similarity=0.914 Sum_probs=75.3
Q ss_pred CCCCCCCCHHHHHHHHHhcCCCCCcCHHHHHHHHHhhCccccccccccccccccccccccccccccCCCCCCCCccceEe
Q psy447 120 THAKPPYSYISLITMAIQNSPTKMLTLSEIYQFIMDLFPFYRQNQQRWQNSIRHSLSFNDCFVKVPRTPDKPGKGSFWTL 199 (417)
Q Consensus 120 ~~~KPPySYa~LI~~AI~sSP~krLTLsEIY~wI~~~fPYyr~~~~gWKNSIRHNLSLNkcFvKVpR~~~~pGKGs~WtL 199 (417)
.+.||||+|+.||.+||.++|++.|||+|||.|+.+.|.|||.+...|||+|||||||+|||++|+. -||..|++
T Consensus 365 ~dvRPPfTYAsLIRqAIlESpdkQLTLNEIY~WFTrtFAYFRRNaATWKnAVRHNLSLHKCF~RVEn-----vkgavwtv 439 (581)
T KOG4385|consen 365 ADVRPPFTYASLIRQAILESPDKQLTLNEIYNWFTRTFAYFRRNAATWKNAVRHNLSLHKCFVRVEN-----VKGAVWTV 439 (581)
T ss_pred cccCCchhHHHHHHHHHhcCchhhccHHHHHHHHHHHHHHHhccchhHhHHHhhhhHHHHHHHHHHH-----Hhcceeee
Confidence 5899999999999999999999999999999999999999999999999999999999999999965 58999999
Q ss_pred cCCC
Q psy447 200 HKDS 203 (417)
Q Consensus 200 dPe~ 203 (417)
|+.+
T Consensus 440 De~e 443 (581)
T KOG4385|consen 440 DERE 443 (581)
T ss_pred ehhh
Confidence 9764
No 8
>COG5025 Transcription factor of the Forkhead/HNF3 family [Transcription]
Probab=99.81 E-value=5.4e-21 Score=202.81 Aligned_cols=101 Identities=44% Similarity=0.730 Sum_probs=93.2
Q ss_pred CCCCCCCCHHHHHHHHHhcCCCCCcCHHHHHHHHHhhCccccccccccccccccccccccccccccCCCCCCCCccceEe
Q psy447 120 THAKPPYSYISLITMAIQNSPTKMLTLSEIYQFIMDLFPFYRQNQQRWQNSIRHSLSFNDCFVKVPRTPDKPGKGSFWTL 199 (417)
Q Consensus 120 ~~~KPPySYa~LI~~AI~sSP~krLTLsEIY~wI~~~fPYyr~~~~gWKNSIRHNLSLNkcFvKVpR~~~~pGKGs~WtL 199 (417)
...||+++|+.+|+.||+.+|+++|+|.+||.||..+|||||.++.+|+||||||||+|++|.||||....+|||+||.|
T Consensus 334 ~~~kP~~sya~~iT~ail~s~~~kmsls~Iy~~i~s~~pyYr~~p~~w~nSiRhnlSlNksf~kvP~sa~~pGKg~fw~i 413 (610)
T COG5025 334 RTSKPAFSYANSITQAILSSPSGKMTLSEIYSWISSNLPYYRHKPTAWQNSIRHNLSLNKSFEKVPRSASQPGKGCFWKI 413 (610)
T ss_pred cCccCCcchhhhhHHHhhcCcccccccccccccccccchhhccCCcccCchhhhhhhhccccccCCCCcCCCCCcccCcc
Confidence 46899999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred cCCCcccccccchhhcccccc
Q psy447 200 HKDSGNMFENGCYLRRQKRFK 220 (417)
Q Consensus 200 dPe~~~~fenG~~~rRrkr~k 220 (417)
++++...|+.-+.+++.+...
T Consensus 414 ~~s~~~~~~sk~~~~s~~~~~ 434 (610)
T COG5025 414 DYSYIYEKESKRNPRSPKKSP 434 (610)
T ss_pred ChhhhhhhcccccccCccccc
Confidence 999998887766555554443
No 9
>COG5025 Transcription factor of the Forkhead/HNF3 family [Transcription]
Probab=99.48 E-value=3.2e-15 Score=159.27 Aligned_cols=93 Identities=43% Similarity=0.726 Sum_probs=86.7
Q ss_pred cCCCCCCCCCHHHHHHHHHhcCCCCCcCHHHHHHHHHhhCccccccccccccccccccccccccccccCCCCCCCCccce
Q psy447 118 SYTHAKPPYSYISLITMAIQNSPTKMLTLSEIYQFIMDLFPFYRQNQQRWQNSIRHSLSFNDCFVKVPRTPDKPGKGSFW 197 (417)
Q Consensus 118 ~~~~~KPPySYa~LI~~AI~sSP~krLTLsEIY~wI~~~fPYyr~~~~gWKNSIRHNLSLNkcFvKVpR~~~~pGKGs~W 197 (417)
.+.--||||+|+.+|.++|..+++++|||..||.||-..|+||.....+|+|+|||||+++++|.||.+..+..+||.||
T Consensus 81 ~~t~~v~~~s~a~gr~~~il~s~~~~lt~s~~~~~~~~~~~~~~k~~~~~~~sIr~Nls~~~a~~~i~g~~g~~~~g~~~ 160 (610)
T COG5025 81 NWTYYVPPYSYATGRGLAILNSPDKPLTLSKIYTWIHNTFFYYAKVVSRWQNSIRHNLSLNDAFIKIEGRNGAKVKGHFW 160 (610)
T ss_pred CcccccCCeeeeeccccccccCcccCCcccceeeeeeecccccccccchhhhhhhcccccCceEEEEeccCCccccceee
Confidence 34568999999999999999999999999999999999999999999999999999999999999999998889999999
Q ss_pred EecCCCccccccc
Q psy447 198 TLHKDSGNMFENG 210 (417)
Q Consensus 198 tLdPe~~~~fenG 210 (417)
.|.|+....|.-.
T Consensus 161 ~igP~~~~~~l~~ 173 (610)
T COG5025 161 SIGPGHETQFLKS 173 (610)
T ss_pred ccCCCccceeecc
Confidence 9999987766543
No 10
>PF09354 HNF_C: HNF3 C-terminal domain; InterPro: IPR018533 This presumed domain is found in the C-terminal region of Hepatocyte Nuclear Factor 3 alpha and beta chains. Its specific function is uncertain. The N-terminal region of this presumed domain contains an EH1 (engrailed homology 1) motif, that is characterised by the FxIxxIL sequence [].
Probab=98.08 E-value=6.8e-07 Score=71.55 Aligned_cols=29 Identities=41% Similarity=0.687 Sum_probs=23.3
Q ss_pred CCCcccccccCchhhhHHHhhhhccccccccccceeecc---cCc
Q psy447 352 NHPFPVSRLLPSASAMSAAMAMSADSAKSIDLCKMYEMT---GGY 393 (417)
Q Consensus 352 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~ 393 (417)
|||||||||++..+. .|.|+||||+. .||
T Consensus 1 nHPFSI~nLms~eqq-------------~k~dlK~Ye~~~~Ys~Y 32 (65)
T PF09354_consen 1 NHPFSINNLMSSEQQ-------------HKMDLKMYEQMMQYSGY 32 (65)
T ss_pred CCCcchhhhcccccc-------------cchhHHHHHHHhccCCC
Confidence 799999999987321 38999999885 366
No 11
>PF08430 Fork_head_N: Forkhead N-terminal region; InterPro: IPR013638 The region described in this entry is found towards the N terminus of various eukaryotic fork head/HNF-3-related transcription factors (which contain the IPR001766 from INTERPRO domain). These proteins play key roles in embryogenesis, maintenance of differentiated cell states, and tumorigenesis []. ; GO: 0008134 transcription factor binding, 0019904 protein domain specific binding
Probab=96.80 E-value=0.0047 Score=55.79 Aligned_cols=20 Identities=75% Similarity=1.031 Sum_probs=15.8
Q ss_pred chhhhhhhhccccccCcCCCC
Q psy447 102 PGALQRVARADKTYRRSYTHA 122 (417)
Q Consensus 102 ~~~~~~~~r~~k~~rr~~~~~ 122 (417)
...++| +|.+|.|||.++|+
T Consensus 117 ~~~l~R-~R~~KtYRRSYTHA 136 (136)
T PF08430_consen 117 QSALNR-ARDPKTYRRSYTHA 136 (136)
T ss_pred cccccc-ccCCccccccCCCC
Confidence 445666 88999999998874
No 12
>PF00538 Linker_histone: linker histone H1 and H5 family; InterPro: IPR005818 Histone proteins have central roles in both chromatin organisation (as structural units of the nucleosome) and gene regulation (as dynamic components that have a direct impact on DNA transcription and replication). Eukaryotic DNA wraps around a histone octamer to form a nucleosome, the first order of compaction of eukaryotic chromatin. The core histone octamer is composed of a central H3-H4 tetramer and two flanking H2A-H2B dimers. Each of the core histone contains a common structural motif, called the histone fold, which facilitates the interactions between the individual core histones. In addition to the core histones, there is a "linker histone" called H1 (or H5 in avian species). The linker histones present in all multicellular eukaryotes are the most divergent group of histones, with numerous cell type- and stage-specific variant. Linker histone H1 is an essential component of chromatin structure. H1 links nucleosomes into higher order structures. Histone H5 performs the same function as histone H1, and replaces H1 in certain cells. The structure of GH5, the globular domain of the linker histone H5 is known [, ]. The fold is similar to the DNA-binding domain of the catabolite gene activator protein, CAP, thus providing a possible model for the binding of GH5 to DNA. The linker histones, which do not contain the histone fold motif, are critical to the higher-order compaction of chromatin, because they bind to internucleosomal DNA and facilitate interactions between individual nucleosomes. In addition, H1 variants have been shown to be involved in the regulation of developmental genes. A common feature of this protein family is a tripartite structure in which a globular (H15) domain of about 80 amino acids is flanked by two less structured N- and C-terminal tails. The H15 domain is also characterised by high sequence homology among the family of linker histones. The highly conserved H15 domain is essential for the binding of H1 or H5 to the nucleosome. It consists of a three helix bundle (I-III), with a beta-hairpin at the C terminus. There is also a short three-residue stretch between helices I and II that is in the beta-strand conformation. Together with the C-terminal beta-hairpin, this strand forms the third strand of an antiparallel beta-sheet [, , , ]. Proteins known to contain a H15 domain are: - Eukaryotic histone H1. The histones H1 constitute a family with many variants, differing in their affinity for chromatin. Several variants are simultaneously present in a single cell. For example, the nucleated erythrocytes of birds contain both H1 and H5, the latter being an extreme variant of H1. - Eukaryotic MHYST family of histone acetyltransferase. Histone acetyltransferases transfer an acetyl group from acetyl-CoA to the epsylon- amino group of lysine within the basic NH2-termini of histones, which bind the acidic phosphates of DNA []. This entry represents the H15 domain.; GO: 0003677 DNA binding, 0006334 nucleosome assembly, 0000786 nucleosome, 0005634 nucleus; PDB: 2LSO_A 2RQP_A 1UHM_A 1UST_A 1GHC_A 1HST_A 1YQA_A 1USS_A.
Probab=93.28 E-value=0.12 Score=41.71 Aligned_cols=33 Identities=24% Similarity=0.323 Sum_probs=28.9
Q ss_pred CHHHHHHHHHhcC-CCCCcCHHHHHHHHHhhCcc
Q psy447 127 SYISLITMAIQNS-PTKMLTLSEIYQFIMDLFPF 159 (417)
Q Consensus 127 SYa~LI~~AI~sS-P~krLTLsEIY~wI~~~fPY 159 (417)
+|..||..||.+. ..+..++..|..||+++|+.
T Consensus 4 ~y~~mI~eAI~~l~er~GsS~~aI~kyI~~~y~~ 37 (77)
T PF00538_consen 4 PYSDMILEAIKALKERKGSSLQAIKKYIKAKYKV 37 (77)
T ss_dssp CHHHHHHHHHHHCCSSSSEEHHHHHHHHHHHSSC
T ss_pred CHHHHHHHHHHHcCCCCCCCHHHHHHHHHHhcCc
Confidence 4999999999874 45789999999999999974
No 13
>smart00526 H15 Domain in histone families 1 and 5.
Probab=90.39 E-value=0.59 Score=36.42 Aligned_cols=32 Identities=28% Similarity=0.338 Sum_probs=27.9
Q ss_pred CHHHHHHHHHhcCC-CCCcCHHHHHHHHHhhCc
Q psy447 127 SYISLITMAIQNSP-TKMLTLSEIYQFIMDLFP 158 (417)
Q Consensus 127 SYa~LI~~AI~sSP-~krLTLsEIY~wI~~~fP 158 (417)
+|..||..||.... .+..++..|..||+.+|+
T Consensus 6 ~~~~mI~eAI~~l~er~GsS~~aI~kyi~~~~~ 38 (66)
T smart00526 6 PYSEMITEAISALKERKGSSLQAIKKYIEANYK 38 (66)
T ss_pred CHHHHHHHHHHHcCCCCCCCHHHHHHHHHHhCC
Confidence 69999999998754 466899999999999966
No 14
>cd00073 H15 linker histone 1 and histone 5 domains; the basic subunit of chromatin is the nucleosome, consisting of an octamer of core histones, two full turns of DNA, a linker histone (H1 or H5) and a variable length of linker DNA; H1/H5 are chromatin-associated proteins that bind to the exterior of nucleosomes and dramatically stabilize the highly condensed states of chromatin fibers; stabilization of higher order folding occurs through electrostatic neutralization of the linker DNA segments, through a highly positively charged carboxy- terminal domain known as the AKP helix (Ala, Lys, Pro); thought to be involved in specific protein-protein and protein-DNA interactions and play a role in suppressing core histone tail domain acetylation in the chromatin fiber
Probab=87.10 E-value=0.94 Score=37.55 Aligned_cols=34 Identities=24% Similarity=0.327 Sum_probs=29.5
Q ss_pred CHHHHHHHHHhc-CCCCCcCHHHHHHHHHhhCccc
Q psy447 127 SYISLITMAIQN-SPTKMLTLSEIYQFIMDLFPFY 160 (417)
Q Consensus 127 SYa~LI~~AI~s-SP~krLTLsEIY~wI~~~fPYy 160 (417)
+|..||.+||.. .+.+..++..|..||+.+|+.-
T Consensus 6 ~y~~MI~eAI~~l~er~GsS~~aI~kyI~~~y~~~ 40 (88)
T cd00073 6 PYSEMVTEAIKALKERKGSSLQAIKKYIEAKYKVD 40 (88)
T ss_pred CHHHHHHHHHHHcCCCCCcCHHHHHHHHHHHCCcc
Confidence 699999999986 4567789999999999998854
No 15
>PF05066 HARE-HTH: HB1, ASXL, restriction endonuclease HTH domain; InterPro: IPR007759 DNA-directed RNA polymerases 2.7.7.6 from EC (also known as DNA-dependent RNA polymerases) are responsible for the polymerisation of ribonucleotides into a sequence complementary to the template DNA. In eukaryotes, there are three different forms of DNA-directed RNA polymerases transcribing different sets of genes. Most RNA polymerases are multimeric enzymes and are composed of a variable number of subunits. The core RNA polymerase complex consists of five subunits (two alpha, one beta, one beta-prime and one omega) and is sufficient for transcription elongation and termination but is unable to initiate transcription. Transcription initiation from promoter elements requires a sixth, dissociable subunit called a sigma factor, which reversibly associates with the core RNA polymerase complex to form a holoenzyme []. The core RNA polymerase complex forms a "crab claw"-like structure with an internal channel running along the full length []. The key functional sites of the enzyme, as defined by mutational and cross-linking analysis, are located on the inner wall of this channel. RNA synthesis follows after the attachment of RNA polymerase to a specific site, the promoter, on the template DNA strand. The RNA synthesis process continues until a termination sequence is reached. The RNA product, which is synthesised in the 5' to 3'direction, is known as the primary transcript. Eukaryotic nuclei contain three distinct types of RNA polymerases that differ in the RNA they synthesise: RNA polymerase I: located in the nucleoli, synthesises precursors of most ribosomal RNAs. RNA polymerase II: occurs in the nucleoplasm, synthesises mRNA precursors. RNA polymerase III: also occurs in the nucleoplasm, synthesises the precursors of 5S ribosomal RNA, the tRNAs, and a variety of other small nuclear and cytosolic RNAs. Eukaryotic cells are also known to contain separate mitochondrial and chloroplast RNA polymerases. Eukaryotic RNA polymerases, whose molecular masses vary in size from 500 to 700 kDa, contain two non-identical large (>100 kDa) subunits and an array of up to 12 different small (less than 50 kDa) subunits. The delta protein is a dispensable subunit of Bacillus subtilis RNA polymerase (RNAP) that has major effects on the biochemical properties of the purified enzyme. In the presence of delta, RNAP displays an increased specificity of transcription, a decreased affinity for nucleic acids, and an increased efficiency of RNA synthesis because of enhanced recycling []. The delta protein, contains two distinct regions, an N-terminal domain and a glutamate and aspartate residue-rich C-terminal region [].; GO: 0003677 DNA binding, 0006351 transcription, DNA-dependent; PDB: 2KRC_A.
Probab=82.31 E-value=0.88 Score=35.88 Aligned_cols=57 Identities=18% Similarity=0.232 Sum_probs=36.7
Q ss_pred CCHHHHHHHHHhcCCCCCcCHHHHHHHHHhhCccccccccccccccccccc-----cccccccc
Q psy447 126 YSYISLITMAIQNSPTKMLTLSEIYQFIMDLFPFYRQNQQRWQNSIRHSLS-----FNDCFVKV 184 (417)
Q Consensus 126 ySYa~LI~~AI~sSP~krLTLsEIY~wI~~~fPYyr~~~~gWKNSIRHNLS-----LNkcFvKV 184 (417)
+||..++...|+... +.|+.+||++.|++.--|=.. .+...++|+-.|. .+..|++|
T Consensus 1 mt~~eaa~~vL~~~~-~pm~~~eI~~~i~~~~~~~~~-~k~p~~~i~a~ly~~~~~~d~~F~~v 62 (72)
T PF05066_consen 1 MTFKEAAYEVLEEAG-RPMTFKEIWEEIQERGLYKKS-GKTPEATIAAQLYTDIKNEDSRFVKV 62 (72)
T ss_dssp S-HHHHHHHHHHHH--S-EEHHHHHHHHHHHHTS----GGGGGHHHH-HHHHHHH-T-SS-EES
T ss_pred CCHHHHHHHHHHhcC-CCcCHHHHHHHHHHhCCCCcc-cCCHHHHHHHHHHHHcccCCCCEEEe
Confidence 478888888888876 789999999999977654322 4556667763333 44589988
No 16
>PF14338 Mrr_N: Mrr N-terminal domain
Probab=73.79 E-value=1.9 Score=35.55 Aligned_cols=73 Identities=22% Similarity=0.402 Sum_probs=46.8
Q ss_pred CHHHHHHHHHhc--CCCCCcCHHHHHHHHHhhCcccc--------ccc--cccccccccccccccccccccCCCCCCCCc
Q psy447 127 SYISLITMAIQN--SPTKMLTLSEIYQFIMDLFPFYR--------QNQ--QRWQNSIRHSLSFNDCFVKVPRTPDKPGKG 194 (417)
Q Consensus 127 SYa~LI~~AI~s--SP~krLTLsEIY~wI~~~fPYyr--------~~~--~gWKNSIRHNLSLNkcFvKVpR~~~~pGKG 194 (417)
+|-.|+...|+. .-.+.++.+||++.|.++|..=. ... ..|+|.|+=.++--++.--|.+ +.+|
T Consensus 1 ~~~~~~~piL~~L~~~g~~~~~~ei~~~v~~~~~ls~e~~~~~~~sg~~~~~~~~ri~Wa~~~L~~aGli~~----~~rG 76 (92)
T PF14338_consen 1 TYDELMPPILEALKDLGGSASRKEIYERVAERFGLSDEERNERLPSGQGYSRFKNRIRWARSYLKKAGLIER----PKRG 76 (92)
T ss_pred CHHHHHHHHHHHHHHcCCCcCHHHHHHHHHHHhCCCHHHHHHHcccCCcchhHHHhHHHHHHHHHHCCCccC----CCCC
Confidence 355566555443 22578999999999999997431 111 2599998876665444444433 3444
Q ss_pred cceEecCCCc
Q psy447 195 SFWTLHKDSG 204 (417)
Q Consensus 195 s~WtLdPe~~ 204 (417)
+|.|.+...
T Consensus 77 -~~~iT~~G~ 85 (92)
T PF14338_consen 77 -IWRITEKGR 85 (92)
T ss_pred -ceEECHhHH
Confidence 999998754
No 17
>KOG4012|consensus
Probab=54.18 E-value=15 Score=36.71 Aligned_cols=37 Identities=27% Similarity=0.271 Sum_probs=31.5
Q ss_pred CCHHHHHHHHHhcC-CCCCcCHHHHHHHHHhhCccccc
Q psy447 126 YSYISLITMAIQNS-PTKMLTLSEIYQFIMDLFPFYRQ 162 (417)
Q Consensus 126 ySYa~LI~~AI~sS-P~krLTLsEIY~wI~~~fPYyr~ 162 (417)
-+|..||..||... ......+.-|+.||..+|+.|.-
T Consensus 45 P~~~~mi~eAi~a~keR~GsS~aAikK~i~~~Y~g~~v 82 (243)
T KOG4012|consen 45 PPYSEMITEAISALKERKGSSLAAIKKYIAANYPGDDV 82 (243)
T ss_pred CcHHHHHHHHHHHhhhcccchHHHHHHHHhhcCccchh
Confidence 45999999999875 45678999999999999987764
No 18
>COG2958 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=51.71 E-value=18 Score=36.91 Aligned_cols=60 Identities=20% Similarity=0.264 Sum_probs=43.2
Q ss_pred CCHHHHHHHHHhcCCCCCcCHHHHHHHHHhhCcc------ccccccc------cccccccccccccccccccC
Q psy447 126 YSYISLITMAIQNSPTKMLTLSEIYQFIMDLFPF------YRQNQQR------WQNSIRHSLSFNDCFVKVPR 186 (417)
Q Consensus 126 ySYa~LI~~AI~sSP~krLTLsEIY~wI~~~fPY------yr~~~~g------WKNSIRHNLSLNkcFvKVpR 186 (417)
.+|..+|-..++. .+..+|..|||+|+.+.||- |....+. =--+.|-||-.+--|.||..
T Consensus 5 l~~~e~vl~~Lq~-~~ep~t~~eI~eka~e~~~~~~~~kk~~s~g~tp~q~va~~iy~~~~lq~~~pflKvte 76 (307)
T COG2958 5 LNLIEIVLSVLQT-SEEPFTAREIAEKAFETFPAECQEKKFDSGGDTPQQLVAEIIYTRPNLQEKHPFLKVTE 76 (307)
T ss_pred hhhHHHHHHHHHc-cCCCCcHHHHHHHHHHhccHHHHHHhhccccccHHHHHHHHHhcCccccccCccccccc
Confidence 3477777766766 78889999999999999983 2211111 22456788888899999843
No 19
>PF12872 OST-HTH: OST-HTH/LOTUS domain; PDB: 2KPM_A 3S93_B 3RCO_A 2KZV_A.
Probab=35.38 E-value=32 Score=26.54 Aligned_cols=41 Identities=12% Similarity=0.201 Sum_probs=29.7
Q ss_pred CHHHHHHHHHhcCC--CCCcCHHHHHHHHHhhCcccccccccc
Q psy447 127 SYISLITMAIQNSP--TKMLTLSEIYQFIMDLFPFYRQNQQRW 167 (417)
Q Consensus 127 SYa~LI~~AI~sSP--~krLTLsEIY~wI~~~fPYyr~~~~gW 167 (417)
.-..+|..+|.+.+ ++.+.|++|-.++.+.||-|....=|.
T Consensus 5 ~~~~~l~~ll~~~~~~~g~v~ls~l~~~~~~~~~~f~~~~yG~ 47 (74)
T PF12872_consen 5 ELKKLLRELLESQKGEDGWVSLSQLGQEYKKKYPDFDPRDYGF 47 (74)
T ss_dssp HHHHHHHHHHHHTCTTTSSEEHHHHHHHHHHHHTT--TCCTTS
T ss_pred HHHHHHHHHHHhCcCCCceEEHHHHHHHHHHHCCCCCccccCC
Confidence 35678888885544 347999999999999999887654443
No 20
>cd06649 PKc_MEK2 Catalytic domain of the dual-specificity Protein Kinase, MAP/ERK Kinase 2. Protein kinases (PKs), MAP/ERK Kinase (MEK) 2 subfamily, catalytic (c) domain. PKs catalyze the transfer of the gamma-phosphoryl group from ATP to serine/threonine or tyrosine residues on protein substrates. The MEK subfamily is part of a larger superfamily that includes the catalytic domains of other protein serine/threonine kinases, protein tyrosine kinases, RIO kinases, aminoglycoside phosphotransferase, choline kinase, and phosphoinositide 3-kinase. The mitogen-activated protein (MAP) kinase signaling pathways are important mediators of cellular responses to extracellular signals. The pathways involve a triple kinase core cascade comprising the MAP kinase (MAPK), which is phosphorylated and activated by a MAPK kinase (MAPKK or MKK), which itself is phosphorylated and activated by a MAPK kinase kinase (MAPKKK or MKKK). MEK2 is a dual-specificity PK that phosphorylates and activates the downst
Probab=28.69 E-value=34 Score=33.03 Aligned_cols=48 Identities=10% Similarity=0.310 Sum_probs=38.2
Q ss_pred HHHHHHHHHhcCCCCCcCHHHHHHHHHhhCcccccc---ccccccccccccccccc
Q psy447 128 YISLITMAIQNSPTKMLTLSEIYQFIMDLFPFYRQN---QQRWQNSIRHSLSFNDC 180 (417)
Q Consensus 128 Ya~LI~~AI~sSP~krLTLsEIY~wI~~~fPYyr~~---~~gWKNSIRHNLSLNkc 180 (417)
+..||...++-.|++|.|..||.+ .|||+.. ...|++.|.-+|+++.-
T Consensus 277 l~~li~~~L~~~P~~Rpt~~ell~-----h~~~~~~~~~~~~~~~~~~~~~~~~~~ 327 (331)
T cd06649 277 FQEFVNKCLIKNPAERADLKMLMN-----HTFIKRSEVEEVDFAGWLCKTLRLNQP 327 (331)
T ss_pred HHHHHHHHccCCcccCCCHHHHhc-----ChHHhhcccccccHHHHHHHhhccccc
Confidence 689999999999999999999963 4666543 45688888888887743
No 21
>cd05071 PTKc_Src Catalytic domain of the Protein Tyrosine Kinase, Src. Protein Tyrosine Kinase (PTK) family; Src kinase; catalytic (c) domain. The PTKc family is part of a larger superfamily that includes the catalytic domains of other kinases such as protein serine/threonine kinases, RIO kinases, and phosphoinositide 3-kinase (PI3K). PTKs catalyze the transfer of the gamma-phosphoryl group from ATP to tyrosine (tyr) residues in protein substrates. Src (or c-Src) is a cytoplasmic (or non-receptor) tyr kinase, containing an N-terminal SH4 domain with a myristoylation site, followed by SH3 and SH2 domains, a tyr kinase domain, and a regulatory C-terminal region with a conserved tyr. It is activated by autophosphorylation at the tyr kinase domain, and is negatively regulated by phosphorylation at the C-terminal tyr by Csk (C-terminal Src Kinase). c-Src is the vertebrate homolog of the oncogenic protein (v-Src) from Rous sarcoma virus. Together with other Src subfamily proteins, it is invo
Probab=27.26 E-value=56 Score=30.03 Aligned_cols=34 Identities=24% Similarity=0.382 Sum_probs=30.1
Q ss_pred CCCHHHHHHHHHhcCCCCCcCHHHHHHHHHhhCc
Q psy447 125 PYSYISLITMAIQNSPTKMLTLSEIYQFIMDLFP 158 (417)
Q Consensus 125 PySYa~LI~~AI~sSP~krLTLsEIY~wI~~~fP 158 (417)
|-....||..++...|++|.|..||.++|.+-|+
T Consensus 229 ~~~l~~li~~~l~~~p~~Rp~~~~~~~~l~~~~~ 262 (262)
T cd05071 229 PESLHDLMCQCWRKEPEERPTFEYLQAFLEDYFT 262 (262)
T ss_pred CHHHHHHHHHHccCCcccCCCHHHHHHHHHHhcC
Confidence 4567899999999999999999999999998764
No 22
>PF08047 His_leader: Histidine operon leader peptide; InterPro: IPR012565 This family consists of the leader peptide of the histidine (his) operon. The his operon contains all the genes necessary for histidine biosynthesis. The region corresponding to the untranslated 5'-end of the transcript, named the his leader region, displays the typical features of the T box transcriptional attenuation mechanism which is involved in the regulation of many amino acid biosynthetic operons [].; GO: 0000105 histidine biosynthetic process
Probab=26.72 E-value=24 Score=21.56 Aligned_cols=8 Identities=25% Similarity=0.322 Sum_probs=3.2
Q ss_pred ccccccch
Q psy447 306 NSYHQSFH 313 (417)
Q Consensus 306 ~~~h~~hh 313 (417)
|-+||+||
T Consensus 6 fk~hhhhh 13 (16)
T PF08047_consen 6 FKHHHHHH 13 (16)
T ss_pred eccccccc
Confidence 33444443
No 23
>PF08430 Fork_head_N: Forkhead N-terminal region; InterPro: IPR013638 The region described in this entry is found towards the N terminus of various eukaryotic fork head/HNF-3-related transcription factors (which contain the IPR001766 from INTERPRO domain). These proteins play key roles in embryogenesis, maintenance of differentiated cell states, and tumorigenesis []. ; GO: 0008134 transcription factor binding, 0019904 protein domain specific binding
Probab=23.08 E-value=1.6e+02 Score=26.87 Aligned_cols=14 Identities=36% Similarity=0.420 Sum_probs=8.3
Q ss_pred CCCCCCCCccc-cccc
Q psy447 11 LGSPGMTPSYS-MNSV 25 (417)
Q Consensus 11 ~~~~~~~~~y~-~~~~ 25 (417)
.+|++|+. |. +|++
T Consensus 16 ~sMnsM~s-~~~~n~~ 30 (136)
T PF08430_consen 16 GSMNSMNS-YMSMNSM 30 (136)
T ss_pred Cccccccc-ccccccc
Confidence 44666666 65 5555
No 24
>KOG1528|consensus
Probab=22.54 E-value=38 Score=35.27 Aligned_cols=32 Identities=31% Similarity=0.553 Sum_probs=25.1
Q ss_pred cccccccccccccCCCCCCC-CccceEecCCCc
Q psy447 173 HSLSFNDCFVKVPRTPDKPG-KGSFWTLHKDSG 204 (417)
Q Consensus 173 HNLSLNkcFvKVpR~~~~pG-KGs~WtLdPe~~ 204 (417)
.-||..+...-|.|...+.| ||.+|.|||-.+
T Consensus 109 ~~ls~~dvl~aID~G~s~GG~kGrhWvLDPIDG 141 (351)
T KOG1528|consen 109 SPLSSDDVLKAIDRGNSEGGPKGRHWVLDPIDG 141 (351)
T ss_pred CCCCHHHHHHHHhcccccCCCCCceEEeccCCC
Confidence 45778888888888765554 899999999875
No 25
>cd08229 STKc_Nek7 Catalytic domain of the Protein Serine/Threonine Kinase, Never In Mitosis gene A-related kinase 7. Serine/Threonine Kinases (STKs), Never In Mitosis gene A (NIMA)-related kinase 7 (Nek7) subfamily, catalytic (c) domain. STKs catalyze the transfer of the gamma-phosphoryl group from ATP to serine/threonine residues on protein substrates. The Nek7 subfamily is one of a family of 11 different Neks (Nek1-11) that are involved in cell cycle control. The Nek family is part of a larger superfamily that includes the catalytic domains of other protein STKs, protein tyrosine kinases, RIO kinases, aminoglycoside phosphotransferase, choline kinase, and phosphoinositide 3-kinase. Nek7 is required for mitotic spindle formation and cytokinesis. It is enriched in the centrosome and is critical for microtubule nucleation. Nek7 is activated by Nek9 during mitosis, and may regulate the p70 ribosomal S6 kinase.
Probab=21.77 E-value=97 Score=28.14 Aligned_cols=30 Identities=20% Similarity=0.392 Sum_probs=26.1
Q ss_pred CHHHHHHHHHhcCCCCCcCHHHHHHHHHhh
Q psy447 127 SYISLITMAIQNSPTKMLTLSEIYQFIMDL 156 (417)
Q Consensus 127 SYa~LI~~AI~sSP~krLTLsEIY~wI~~~ 156 (417)
....||...+...|++|.|+.+||+-+.+.
T Consensus 236 ~~~~li~~~l~~~p~~Rpt~~~i~~~~~~~ 265 (267)
T cd08229 236 ELRQLVNMCINPDPEKRPDITYVYDVAKRM 265 (267)
T ss_pred HHHHHHHHhcCCCcccCCCHHHHHHHHhhh
Confidence 467888888889999999999999988764
No 26
>PF12317 IFT46_B_C: Intraflagellar transport complex B protein 46 C terminal; InterPro: IPR022088 This entry represents proteins is found in eukaryotes. Proteins are typically between 298 and 416 amino acids in length. It is thought to be a flagellar protein of complex B and like all IFT proteins, it is required for transport of IFT particles into the flagella [].
Probab=20.41 E-value=70 Score=31.56 Aligned_cols=63 Identities=25% Similarity=0.360 Sum_probs=38.8
Q ss_pred CHHHHHHHHHhhCccccccccccccccc---cccccccccccccCCCCCCCCccceEecCCCcccc
Q psy447 145 TLSEIYQFIMDLFPFYRQNQQRWQNSIR---HSLSFNDCFVKVPRTPDKPGKGSFWTLHKDSGNMF 207 (417)
Q Consensus 145 TLsEIY~wI~~~fPYyr~~~~gWKNSIR---HNLSLNkcFvKVpR~~~~pGKGs~WtLdPe~~~~f 207 (417)
-++|++++|.+--|-=-.-...+|-=|- -.+.-=++|+||+|..+.+-+-.+|.||+-+.++-
T Consensus 16 EikeLF~yI~~YtPq~iel~~kLkPFiPdyiPAVGdiDaFiKVpRPD~~~d~LGL~vLDEP~~~QS 81 (214)
T PF12317_consen 16 EIKELFQYITRYTPQEIELDTKLKPFIPDYIPAVGDIDAFIKVPRPDGKPDNLGLTVLDEPSANQS 81 (214)
T ss_pred HHHHHHHHHhhcCCccccCCCcccccCccccccccCcccceeccCCCCCccccCCEEecCCCcccC
Confidence 5778888888866521111111221111 01122369999999988887788999999876543
Done!