Query psy2634
Match_columns 749
No_of_seqs 463 out of 1539
Neff 5.1
Searched_HMMs 46136
Date Fri Aug 16 21:12:12 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy2634.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/2634hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG3547|consensus 100.0 2E-111 5E-116 925.6 31.3 375 1-388 1-376 (450)
2 PF01062 Bestrophin: Bestrophi 100.0 2.9E-54 6.4E-59 453.8 24.6 282 1-332 1-292 (293)
3 COG3781 Predicted membrane pro 100.0 1.2E-52 2.6E-57 427.6 23.2 272 16-334 7-288 (306)
4 PF00856 SET: SET domain; Int 99.1 4.9E-11 1.1E-15 111.1 4.8 52 681-732 109-162 (162)
5 smart00317 SET SET (Su(var)3-9 98.9 1.1E-09 2.3E-14 98.3 3.7 46 686-731 69-116 (116)
6 KOG2084|consensus 97.9 1.9E-05 4E-10 88.0 6.4 65 682-748 197-264 (482)
7 KOG4442|consensus 97.6 4.2E-05 9.2E-10 88.9 4.2 40 692-731 195-236 (729)
8 KOG2589|consensus 97.6 4.1E-05 8.8E-10 83.5 3.3 43 691-734 197-239 (453)
9 KOG1080|consensus 97.4 9.1E-05 2E-09 90.6 4.0 43 691-733 940-984 (1005)
10 COG2940 Proteins containing SE 96.3 0.0016 3.4E-08 74.8 1.8 45 690-734 405-451 (480)
11 KOG1082|consensus 95.6 0.0072 1.6E-07 67.2 2.9 44 691-734 273-322 (364)
12 KOG1083|consensus 94.1 0.039 8.4E-07 67.4 3.7 41 692-732 1252-1294(1306)
13 KOG1079|consensus 92.1 0.12 2.5E-06 61.0 3.5 43 691-733 666-710 (739)
14 KOG1085|consensus 91.4 0.13 2.9E-06 55.3 2.8 45 691-735 334-380 (392)
15 KOG1141|consensus 90.4 0.12 2.6E-06 62.0 1.3 44 691-734 1190-1239(1262)
16 KOG1337|consensus 90.4 0.22 4.7E-06 57.4 3.4 49 685-734 231-279 (472)
17 KOG1081|consensus 72.6 1.5 3.3E-05 50.8 0.9 49 685-733 365-416 (463)
18 PF14023 DUF4239: Protein of u 60.1 2E+02 0.0044 29.3 13.8 54 79-134 9-66 (209)
19 PF02937 COX6C: Cytochrome c o 44.3 85 0.0018 27.8 6.5 42 24-65 8-49 (73)
20 KOG1338|consensus 41.7 19 0.00042 41.0 2.7 50 681-732 209-259 (466)
21 COG1422 Predicted membrane pro 40.1 2.2E+02 0.0048 29.9 9.8 66 190-261 82-148 (201)
22 TIGR02059 swm_rep_I cyanobacte 35.9 50 0.0011 31.0 4.0 30 706-735 70-99 (101)
23 PF10166 DUF2368: Uncharacteri 35.3 24 0.00052 34.4 1.9 19 74-92 78-96 (131)
24 PF05297 Herpes_LMP1: Herpesvi 33.2 14 0.00031 40.4 0.0 41 78-118 139-188 (381)
25 KOG4544|consensus 32.8 26 0.00056 34.5 1.7 50 35-92 50-99 (144)
26 TIGR00355 purH phosphoribosyla 29.7 41 0.00088 39.7 2.9 71 455-527 428-507 (511)
27 PF10947 DUF2628: Protein of u 26.8 75 0.0016 29.2 3.6 37 17-53 39-75 (108)
28 PF07234 DUF1426: Protein of u 23.7 1.2E+02 0.0027 28.4 4.2 57 19-101 8-65 (117)
29 PF11464 Rbsn: Rabenosyn Rab b 22.9 94 0.002 24.8 2.9 32 189-220 5-36 (42)
30 KOG2461|consensus 22.0 76 0.0017 36.4 3.2 25 711-735 122-146 (396)
31 PF07763 FEZ: FEZ-like protein 21.3 37 0.00081 36.4 0.5 16 507-522 113-128 (244)
No 1
>KOG3547|consensus
Probab=100.00 E-value=2.3e-111 Score=925.63 Aligned_cols=375 Identities=47% Similarity=0.889 Sum_probs=354.8
Q ss_pred CceecccccccccCchhHHHHHHHhcCchHhHHHHHHHHHHHHHHHHHHHHHhhcChhhhhhHHHHHhhhcccCCcccee
Q psy2634 1 MTVTYTAEVASCTGFGCFLKLLARWRGSIYKLVWPDLMAYLTLYYALNLTYRYQLNPEQKKTFESIVEYCDTYSNLIPLS 80 (749)
Q Consensus 1 MTvsY~~~vat~~~~~~flrLLfrWRGSI~K~Iw~eLLv~l~ly~~vs~iYr~~L~~~~k~~Fe~i~~y~~~~t~~IpLs 80 (749)
|||+|+.+|+| +.+++|+||||||||||||+||+||++|+++|++|+++||++|++.||++||+++.||+.+.++|||+
T Consensus 1 MTv~Y~~~v~t-~~~~~f~kll~rWkGSiwKai~~el~~~l~~Y~~i~~iYR~~l~~~qq~~Fe~~~~~~d~~~~~iPLt 79 (450)
T KOG3547|consen 1 MTVSYNLDVAT-SYSFTFLKLLLRWKGSIWKAIWKELLIWLILYYIISVIYRFALTESQQETFEKFVMYCDSRLDFIPLT 79 (450)
T ss_pred CeeeehHHhhh-hccHHHHHHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHHHhcCHHHHHHHHHHHHHHhcccccccch
Confidence 99999999999 66899999999999999999999999999999999999999999999999999999999999999999
Q ss_pred eeehhhHhHHHHHHHHHHhcCCcchhhHHHHHHhcCCchhhhHHHHHHHHHHHHHHHHHHhhccchhhhccCCCchhHHH
Q psy2634 81 FVLGFYISIIMTRWWDQYNNIPWPDAMAVYVSSQVHGQDERGRMMRRTIMRYVCVCITMVLANISPRVKKRFPSLDHFVE 160 (749)
Q Consensus 81 f~LGF~vn~vy~RWWe~r~~ig~~d~lA~~v~~~i~g~der~rliRRtI~RYl~La~vL~~R~IS~~lRkRfPt~~~Lv~ 160 (749)
|+|||||++|++|||+++.++||||++|..++++++|.+|++|++||+|+||++|+++|+|||||.+|||||||++++|+
T Consensus 80 FmLGFfVt~Vv~RW~~~f~nig~id~~a~~vs~~i~g~d~~~r~~RRtIiRY~vLsqvLv~RdIS~~VrkRFPt~d~lv~ 159 (450)
T KOG3547|consen 80 FMLGFFVTIVVDRWWKQFRNIGWIDNLALLVSAYIPGTDERARLIRRTIIRYLVLSQVLVFRDISMKVRKRFPTLDHLVE 159 (450)
T ss_pred HhhhhhHHHHHHHHHHHHhccCchHHHHHHHHHHccCCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCChHHHHH
Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred hcCCCHHHHHHHHhcccCCCCCCCchHHHHHHHHHHHHHHHcCCCCCHHHHHHHHHHHHHHHHHhhHhhcCCcccCchHH
Q psy2634 161 VGLLLENEKNIISAINDKFPGPPKHWMPIVWAASVVTRARKEGRVRDDFSVKSIIDELNRIRGQCGKLLSYDTVSVPLVY 240 (749)
Q Consensus 161 aGlLteeE~~~le~~~~~~p~~~k~wvpi~W~~~ll~~arkeGrI~~~~~l~~ll~eL~~fR~~CerL~~yDwtPIPLaY 240 (749)
||||+++|+++++++...+ +.|||+||+|+.+++++|+++|+|.++..++.+++||.+||.+|+.|++|||+||||||
T Consensus 160 AG~mt~~E~~~~~~~~~~~--~~kyw~Pi~Wa~sl~~~a~~~g~i~~d~~~~~~~~ei~~fR~~l~~L~~yDwVpIPL~Y 237 (450)
T KOG3547|consen 160 AGLMTEEELDILENIVDSK--YMKYWVPINWAFSLLRKARKEGRIESDYLYNSLLGEIKKFRTGLSLLCNYDWVPIPLVY 237 (450)
T ss_pred hcCCcHHHHHHHHhccccc--ccceeecHHHHHHHHHHHHHcCCccccHHHHHHHHHHHHHHHhhccccCCCceeccccc
Confidence 9999999999999987643 34999999999999999999999999999999999999999999999999999999999
Q ss_pred HHHHHHHHHHHHHHHHhhccccCCCCCCCCCCCcccccccccccchhHHHHHHHHHHHHHHHHHHHcCCCCCCCCchhHH
Q psy2634 241 TQVVTLAVYSFLITTIMGRQWLDVPAESDTSHPLRIHNNYIDLYFPIFTILQFFFYMGWLKVAEVLINPFGDDDEDFEVN 320 (749)
Q Consensus 241 tQvVtlaVY~Yfl~~lfgrQ~~~~~~~~~~~~~~~~~~~~id~y~Pi~T~Lqffff~GwlKVAe~LiNPFGeDDDDfeln 320 (749)
+|||++|||+||++|+||||+++.+.+ ..+.+|+|||+||++||+||||||||||.|+|||||||||||||
T Consensus 238 pQvV~lAVr~YF~~cL~~RQ~l~~~~~---------~~~~id~~fPi~T~lQFiF~vGWmKVaE~LlNP~GEDDDDFE~N 308 (450)
T KOG3547|consen 238 PQVVFLAVRSYFFICLFGRQYLDRDDS---------KKTEIDLYFPIMTILQFIFYVGWLKVAEVLLNPLGEDDDDFECN 308 (450)
T ss_pred cceeeeehHHhHHHHhhccccCCcccc---------cccccceeehhHHHHHHHHHhhHHHHHHHHhCCCCCCcchhhHH
Confidence 999999999999999999999997641 25789999999999999999999999999999999999999999
Q ss_pred HHHHHHhhhheeeccCccCCCCCcccccCCCCCCCCCCCccccccccc-cCCCCCccceeeeccccccc
Q psy2634 321 WMIDRNIQVSYLIVDGMHHEHPDLVRDQYWDDVFPSELPYTSAAEPFR-EKIPQPSTANIAVDACQVEY 388 (749)
Q Consensus 321 ~LIdRNievslliVDe~~~~~P~l~~D~yw~~~~~~~lPyt~~s~~~~-~~~~~~s~~~~~~~~~~~~~ 388 (749)
||||||++++|+|||++|+++|+|++|+||++..+. +.|+.+++..+ .++++||++++++.+.+++.
T Consensus 309 ~liDRNl~vgl~IVD~~~~~~P~l~~D~f~d~~~~~-p~y~~~s~~~~~~~~~~gS~~~~~~~~~~~~~ 376 (450)
T KOG3547|consen 309 YLIDRNLTVGLAIVDEMHDDHPELEKDQFWDDVDVL-PLYSEASADSKNNHALVGSAANVSLAKEDEEN 376 (450)
T ss_pred HhhhhhHHHHHHHhhhhhccCccccccccCCCCCcC-CCCchhhhhccccccCCCChhhhhcccccccc
Confidence 999999999999999999999999999999998775 44776654444 44599999999988765433
No 2
>PF01062 Bestrophin: Bestrophin, RFP-TM, chloride channel; InterPro: IPR021134 Bestrophin is a 68kDa basolateral plasma membrane protein expressed in retinal pigment epithelial cells (RPE). It is encoded by the VMD2 gene, which is mutated in Best macular dystrophy, a disease characterised by a depressed light peak in the electrooculogram []. VMD2 encodes a 585-amino acid protein with an approximate mass of 68 kDa which has been designated bestrophin. Bestrophin shares homology with the Caenorhabditis elegans RFP gene family, named for the presence of a conserved arginine (R), phenylalanine (F), proline (P), amino acid sequence motif. Bestrophin is a plasma membrane protein, localised to the basolateral surface of RPE cells consistent with a role for bestrophin in the generation or regulation of the EOG light peak. Bestrophin and other RFP family members represent a new class of chloride channels, indicating a direct role for bestrophin in generating the light peak []. The VMD2 gene underlying Best disease was shown to represent the first human member of the RFP-TM protein family. More than 97% of the disease-causing mutations are located in the N-terminal RFP-TM domain implying important functional properties []. This entry also includes a number of hypothetical proteins belonging to protein family UPF0187.
Probab=100.00 E-value=2.9e-54 Score=453.79 Aligned_cols=282 Identities=30% Similarity=0.562 Sum_probs=237.9
Q ss_pred CceecccccccccCchhHHHHHHHhcCchHhHHHHHHHHHHHHHHHHHHHHHhhcChhhhhhHHHHHhhhcccCC--c--
Q psy2634 1 MTVTYTAEVASCTGFGCFLKLLARWRGSIYKLVWPDLMAYLTLYYALNLTYRYQLNPEQKKTFESIVEYCDTYSN--L-- 76 (749)
Q Consensus 1 MTvsY~~~vat~~~~~~flrLLfrWRGSI~K~Iw~eLLv~l~ly~~vs~iYr~~L~~~~k~~Fe~i~~y~~~~t~--~-- 76 (749)
|||+|+.+|.+. .+|+++||+|||||||.||+++++++++++++.++|.+ .....++.+.+. +
T Consensus 1 m~~~y~~~~~~~---~~~~~~l~~~~gsv~~~i~~~ll~~~~~~~~v~~~~~~----------~~~~~~~~~~~~~~l~g 67 (293)
T PF01062_consen 1 MTVSYNQDVRTS---RSWWRLLFRWRGSVLKRIWPPLLVFLALAAAVALLYIF----------EPICWLSLPSSPFSLLG 67 (293)
T ss_pred CCCccccccCCc---ccHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHhh----------cccccccccccchhHHH
Confidence 999999999998 47999999999999999999999999999999999833 223344455544 3
Q ss_pred cceeeeehhhHhHHHHHHHHHHhcCCcc----hhhHHHHHHhcCCchhhhHHHHHHHHHHHHHHHHHHhhccchhhhccC
Q psy2634 77 IPLSFVLGFYISIIMTRWWDQYNNIPWP----DAMAVYVSSQVHGQDERGRMMRRTIMRYVCVCITMVLANISPRVKKRF 152 (749)
Q Consensus 77 IpLsf~LGF~vn~vy~RWWe~r~~ig~~----d~lA~~v~~~i~g~der~rliRRtI~RYl~La~vL~~R~IS~~lRkRf 152 (749)
+.|+|+||||+|++|+||||+|+.||.+ +++++++++++++++++.+++|++++ |+.+...+ +|+.+...+
T Consensus 68 ~~l~~lL~Fr~n~ay~Rwwe~r~~wg~l~~~~r~l~~~~~~~~~~~~~~~~l~r~~ia-~~~~~~~~-lR~~~~~~~--- 142 (293)
T PF01062_consen 68 IALSFLLVFRTNTAYDRWWEARKLWGNLVNWSRNLARQICAFVPDDDERRRLLRRLIA-FAVALKAH-LRGESTSEE--- 142 (293)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccchhhHHHHHHHHHH-HHHHHHHH-hcCCChhhh---
Confidence 5699999999999999999999997775 68999999999998889999999999 99988888 888877666
Q ss_pred CCchhHHHhcCCCHHHHHHHHhcccCCCCCCCchHHHHHHHHHHHHHHHcCCCCCHH--HHHHHHHHHHHHHHHhhHhhc
Q psy2634 153 PSLDHFVEVGLLLENEKNIISAINDKFPGPPKHWMPIVWAASVVTRARKEGRVRDDF--SVKSIIDELNRIRGQCGKLLS 230 (749)
Q Consensus 153 Pt~~~Lv~aGlLteeE~~~le~~~~~~p~~~k~wvpi~W~~~ll~~arkeGrI~~~~--~l~~ll~eL~~fR~~CerL~~ 230 (749)
+++++++| ++++|. .++... +.+.+++..|...+ ++++++|.+++.. .+.+.+.++.+..|+||||.
T Consensus 143 --~~~ll~~g-~~~~~~-~l~~~~-----~~p~~i~~~~~~~l-~~~~~~g~~~~~~~~~l~~~l~~l~~~~g~~eri~- 211 (293)
T PF01062_consen 143 --LEDLLEAG-LTEEER-RLESAD-----NPPNWILIQWSQRL-REARREGLIDSFQLAQLDNELNALRDAQGGCERIK- 211 (293)
T ss_pred --hhhhhhcc-chHHHH-HHHhhc-----CcHHHHHHHHHHHH-HHHHHcCCCChHHHHHHHHHHHHHHHHHhhhheeC-
Confidence 78888999 888888 666422 34567777777777 9999999965443 44555555555555555555
Q ss_pred CCcccCchHHHHHHHHHHHHHHHHHHhhccccCCCCCCCCCCCcccccccccccchhHHHHHHHHHHHHHHHHHHHcCCC
Q psy2634 231 YDTVSVPLVYTQVVTLAVYSFLITTIMGRQWLDVPAESDTSHPLRIHNNYIDLYFPIFTILQFFFYMGWLKVAEVLINPF 310 (749)
Q Consensus 231 yDwtPIPLaYtQvVtlaVY~Yfl~~lfgrQ~~~~~~~~~~~~~~~~~~~~id~y~Pi~T~Lqffff~GwlKVAe~LiNPF 310 (749)
+||||++|+|++++++|+||+++|||+ +.+++|++|+++++++++|+||++||++|+|||
T Consensus 212 --~TPiP~~Y~~~~~~~l~~y~~~lPf~l------------------~~~~~~~~~~~~~l~~~~~~gl~~ig~~ledPF 271 (293)
T PF01062_consen 212 --NTPIPFAYTQHLSRFLYIYLLLLPFGL------------------VDSLGWLTPPITFLVSFFFLGLEEIGEELEDPF 271 (293)
T ss_pred --CCcCCHHHHHHHHHHHHHHHHHHHHHH------------------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC
Confidence 899999999999999999999999999 457899999999999999999999999999999
Q ss_pred CCCCCchhHHHHHHHHhhhhee
Q psy2634 311 GDDDEDFEVNWMIDRNIQVSYL 332 (749)
Q Consensus 311 GeDDDDfeln~LIdRNievsll 332 (749)
|.|+||||++.+ +++|+..+.
T Consensus 272 g~d~~dlpl~~~-~~~ie~~l~ 292 (293)
T PF01062_consen 272 GNDPNDLPLDAI-CRTIERNLR 292 (293)
T ss_pred CcCCCcCcHHHH-HHHHHHHhc
Confidence 999999999999 777776653
No 3
>COG3781 Predicted membrane protein [Function unknown]
Probab=100.00 E-value=1.2e-52 Score=427.56 Aligned_cols=272 Identities=18% Similarity=0.355 Sum_probs=240.2
Q ss_pred hhHHHHHHHhcCchHhHHHHHHHHHHHHHHHHHHHHHhhcChhhhhhHHHHHhhhcccCC-ccceeeeehhhHhHHHHHH
Q psy2634 16 GCFLKLLARWRGSIYKLVWPDLMAYLTLYYALNLTYRYQLNPEQKKTFESIVEYCDTYSN-LIPLSFVLGFYISIIMTRW 94 (749)
Q Consensus 16 ~~flrLLfrWRGSI~K~Iw~eLLv~l~ly~~vs~iYr~~L~~~~k~~Fe~i~~y~~~~t~-~IpLsf~LGF~vn~vy~RW 94 (749)
.+|++++|.|||||.+.|+++++++++++.++.++|.+.....++ |. ..+|+. .|.|++|||||||++|+||
T Consensus 7 ~hw~r~~f~~rGSVl~~i~~rll~~~~fS~~v~~~~~w~~~~~i~--~~-----~tpf~i~giaLai~L~FRnNaaYdR~ 79 (306)
T COG3781 7 QHWLRLIFVWRGSVLPKILPRLLLCFLFSLVVTAVLPWYTPLSIP--FT-----LTPFSIPGIALAIFLGFRNNAAYDRY 79 (306)
T ss_pred HHHHHHHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhhccccce--ec-----cCcchhHHHHHHHHHhhccchHHHHH
Confidence 579999999999999999999999999999999999998765552 22 255655 3899999999999999999
Q ss_pred HHHHhcCCcc----hhhHHHHHHhcCCc---hhhhHHHHHHHHHHHHHHHHHHhhccchhhhccCCCchhHHHhcCCCHH
Q psy2634 95 WDQYNNIPWP----DAMAVYVSSQVHGQ---DERGRMMRRTIMRYVCVCITMVLANISPRVKKRFPSLDHFVEVGLLLEN 167 (749)
Q Consensus 95 We~r~~ig~~----d~lA~~v~~~i~g~---der~rliRRtI~RYl~La~vL~~R~IS~~lRkRfPt~~~Lv~aGlLtee 167 (749)
||+||.||-+ .|++|++.+.++.. .+...++|-.+ +++++++ .++||.-++ +++ +.++.++
T Consensus 80 wEaRKlWG~lvn~~Rnl~R~i~t~v~e~~d~a~~~~~l~llv----AFahalr-----~~LR~qp~~-~~l--~a~l~~~ 147 (306)
T COG3781 80 WEARKLWGQLVNAIRNLARQIWTLVPETLDSADVREFLRLLV----AFAHALR-----LQLRKQPQN-EDL--AALLPTS 147 (306)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCCcccChHHHHHHHHHHH----HHHHHHH-----HHHhCCCch-HHH--HHhcCHH
Confidence 9999999997 48999999999843 23333444444 5888887 788865444 554 4699998
Q ss_pred HHHHHHhcccCCCCCCCchHHHHHHHHHHHHHHHcCCCCCHH--HHHHHHHHHHHHHHHhhHhhcCCcccCchHHHHHHH
Q psy2634 168 EKNIISAINDKFPGPPKHWMPIVWAASVVTRARKEGRVRDDF--SVKSIIDELNRIRGQCGKLLSYDTVSVPLVYTQVVT 245 (749)
Q Consensus 168 E~~~le~~~~~~p~~~k~wvpi~W~~~ll~~arkeGrI~~~~--~l~~ll~eL~~fR~~CerL~~yDwtPIPLaYtQvVt 245 (749)
.+++++..+ |++.....|+++++++.+++|.+++.+ +++++++.|.++.|+||||.+ |||||||+.++|
T Consensus 148 ~~~kv~a~~------npp~ei~~wmGe~l~~q~r~g~l~~~~~~sl~~~L~~~s~vlggCERI~~---TPiPfAYtl~lh 218 (306)
T COG3781 148 DYEKVLASN------NPPLEIALWMGEWLQQQRRNGQLDAIQFTSLDRRLNSISAVLGGCERIAY---TPIPFAYTLHLH 218 (306)
T ss_pred HHHHHHhcc------CCHHHHHHHHHHHHHHHHhcCCccHHHHHHHHHHHHHHHHHHHhHHHHhc---CCccHHHHHHHH
Confidence 899998864 678899999999999999999998886 899999999999999999995 999999999999
Q ss_pred HHHHHHHHHHHhhccccCCCCCCCCCCCcccccccccccchhHHHHHHHHHHHHHHHHHHHcCCCCCCCCchhHHHHHHH
Q psy2634 246 LAVYSFLITTIMGRQWLDVPAESDTSHPLRIHNNYIDLYFPIFTILQFFFYMGWLKVAEVLINPFGDDDEDFEVNWMIDR 325 (749)
Q Consensus 246 laVY~Yfl~~lfgrQ~~~~~~~~~~~~~~~~~~~~id~y~Pi~T~Lqffff~GwlKVAe~LiNPFGeDDDDfeln~LIdR 325 (749)
++||+||+++|||+ +.+++|++|++++++.|+|||++++|++++||||.|+||++||+| |+
T Consensus 219 rtvyl~C~~LPF~l------------------V~tlgw~Tp~~s~lIayTffgleaia~EiEdPFGte~NDLpLD~i-c~ 279 (306)
T COG3781 219 RTVYLFCFLLPFGL------------------VETLGWATPFFSVLIAYTFFGLEAIAEEIEDPFGTEANDLPLDAI-CN 279 (306)
T ss_pred HHHHHHHHHhHHHH------------------HHHcccchHHHHHHHHHHHHHHHHHHHHHhCccCCCCCcCcHHHH-HH
Confidence 99999999999998 789999999999999999999999999999999999999999998 99
Q ss_pred Hhhhheeec
Q psy2634 326 NIQVSYLIV 334 (749)
Q Consensus 326 NievslliV 334 (749)
.||+++...
T Consensus 280 ~ie~~l~dl 288 (306)
T COG3781 280 TIEINLLDL 288 (306)
T ss_pred HHHHHHHHH
Confidence 999998876
No 4
>PF00856 SET: SET domain; InterPro: IPR001214 The SET domain appears generally as one part of a larger multidomain protein, and recently there were described three structures of very different proteins with distinct domain compositions: Neurospora crassa DIM-5, a member of the Su(var) family of HKMTs which methylate histone H3 on lysine 9,human SET7 (also called SET9), which methylates H3 on lysine 4 and garden pea Rubisco LSMT, an enzyme that does not modify histones, but instead methylates lysine 14 in the flexible tail of the large subunit of the enzyme Rubisco. The SET domain itself turned out to be an uncommon structure. Although in all three studies, electron density maps revealed the location of the AdoMet or AdoHcy cofactor, the SET domain bears no similarity at all to the canonical/AdoMet-dependent methyltransferase fold. Strictly conserved in the C-terminal motif of the SET domain tyrosine could be involved in abstracting a proton from the protonated amino group of the substrate lysine, promoting its nucleophilic attack on the sulphonium methyl group of the AdoMet cofactor. In contrast to the AdoMet-dependent protein methyltranferases of the classical type, which tend to bind their polypeptide substrates on top of the cofactor, it is noted from the Rubisco LSMT structure that the AdoMet seems to bind in a separate cleft, suggesting how a polypeptide substrate could be subjected to multiple rounds of methylation without having to be released from the enzyme. In contrast, SET7/9 is able to add only a single methyl group to its substrate. It has been demonstrated that association of SET domain and myotubularin-related proteins modulates growth control []. The SET domain-containing Drosophila melanogaster (Fruit fly) protein, enhancer of zeste, has a function in segment determination and the mammalian homologue may be involved in the regulation of gene transcription and chromatin structure. Histone lysine methylation is part of the histone code that regulated chromatin function and epigenetic control of gene function. Histone lysine methyltransferases (HMTase) differ both in their substrate specificity for the various acceptor lysines as well as in their product specificity for the number of methyl groups (one, two, or three) they transfer. With just one exception [], the HMTases belong to SET family that can be classified according to the sequences surrounding the SET domain [, ]. Structural studies on the human SET7/9, a mono-methylase, have revealed the molecular basis for the specificity of the enzyme for the histone-target and the roles of the invariant residues in the SET domain in determining the methylation specificities []. The pre-SET domain, as found in the SUV39 SET family, contains nine invariant cysteine residues that are grouped into two segments separated by a region of variable length. These 9 cysteines coordinate 3 zinc ions to form to form a triangular cluster, where each of the zinc ions is coordinated by 4 four cysteines to give a tetrahedral configuration. The function of this domain is structural, holding together 2 long segments of random coils. The C-terminal region including the post-SET domain is disordered when not interacting with a histone tail and in the absence of zinc. The three conserved cysteines in the post-SET domain form a zinc-binding site when coupled to a fourth conserved cysteine in the knot-like structure close to the SET domain active site []. The structured post-SET region brings in the C-terminal residues that participate in S-adenosylmethine-binding and histone tail interactions. The three conserved cysteine residues are essential for HMTase activity, as replacement with serine abolishes HMTase activity [], []. ; GO: 0005515 protein binding; PDB: 3TG5_A 3S7F_A 3RIB_B 3TG4_A 3S7J_A 3S7D_A 3S7B_A 3H6L_A 3SMT_A 3K5K_A ....
Probab=99.12 E-value=4.9e-11 Score=111.07 Aligned_cols=52 Identities=46% Similarity=0.678 Sum_probs=44.9
Q ss_pred CCcceEeecccccccCCCCCCceeeee--CCccEEEEEEecCCCCCCeEEEecC
Q psy2634 681 NNEGSALYKLQSSSNHSCVPNAEIKFP--YSNFVLNMVATRDIASGDEICISYM 732 (749)
Q Consensus 681 ~~~G~glyp~~S~~NHSC~PNa~~~f~--~~~~~~~v~A~r~I~~GEEItISYi 732 (749)
...+.++||.++++||||.|||.+.|+ ..+..+.++|.|+|++||||+|||.
T Consensus 109 ~~~~~~l~p~~d~~NHsc~pn~~~~~~~~~~~~~~~~~a~r~I~~GeEi~isYG 162 (162)
T PF00856_consen 109 DRDGIALYPFADMLNHSCDPNCEVSFDFDGDGGCLVVRATRDIKKGEEIFISYG 162 (162)
T ss_dssp EEEEEEEETGGGGSEEESSTSEEEEEEEETTTTEEEEEESS-B-TTSBEEEEST
T ss_pred cccccccCcHhHheccccccccceeeEeecccceEEEEECCccCCCCEEEEEEC
Confidence 456789999999999999999999987 3457999999999999999999994
No 5
>smart00317 SET SET (Su(var)3-9, Enhancer-of-zeste, Trithorax) domain. Putative methyl transferase, based on outlier plant homologues
Probab=98.89 E-value=1.1e-09 Score=98.28 Aligned_cols=46 Identities=37% Similarity=0.530 Sum_probs=40.5
Q ss_pred EeecccccccCCCCCCceeeeeCCcc--EEEEEEecCCCCCCeEEEec
Q psy2634 686 ALYKLQSSSNHSCVPNAEIKFPYSNF--VLNMVATRDIASGDEICISY 731 (749)
Q Consensus 686 glyp~~S~~NHSC~PNa~~~f~~~~~--~~~v~A~r~I~~GEEItISY 731 (749)
.+++.++++||||.|||...+...++ .+.++|+|+|++|||||++|
T Consensus 69 ~~~~~~~~iNHsc~pN~~~~~~~~~~~~~~~~~a~r~I~~GeEi~i~Y 116 (116)
T smart00317 69 RKGNIARFINHSCEPNCELLFVEVNGDSRIVIFALRDIKPGEELTIDY 116 (116)
T ss_pred ccCcHHHeeCCCCCCCEEEEEEEECCCcEEEEEECCCcCCCCEEeecC
Confidence 37899999999999999988754333 59999999999999999999
No 6
>KOG2084|consensus
Probab=97.87 E-value=1.9e-05 Score=87.97 Aligned_cols=65 Identities=32% Similarity=0.464 Sum_probs=55.4
Q ss_pred CcceEeecccccccCCCCCCceeeeeCCccEEEEEEecCCCCCC-eEEEecCCCC--hHHHHHHHHHhhc
Q psy2634 682 NEGSALYKLQSSSNHSCVPNAEIKFPYSNFVLNMVATRDIASGD-EICISYMDED--LLERGREKRNNYL 748 (749)
Q Consensus 682 ~~G~glyp~~S~~NHSC~PNa~~~f~~~~~~~~v~A~r~I~~GE-EItISYid~~--~~~Rqr~Lr~~Yl 748 (749)
..|.|+||..+++||||.||+...|++. ...+++...+.+++ ||+++|++.. +..|+++|++.|+
T Consensus 197 ~~~~~l~~~~~~~~hsC~pn~~~~~~~~--~~~~~~~~~~~~~~~~l~~~y~~~~~~~~~r~~~l~~~~~ 264 (482)
T KOG2084|consen 197 FLGRGLFPGSSLFNHSCFPNISVIFDGR--GLALLVPAGIDAGEEELTISYTDPLLSTASRQKQLRQSKL 264 (482)
T ss_pred cceeeecccchhcccCCCCCeEEEECCc--eeEEEeecccCCCCCEEEEeecccccCHHHHHHHHhhccc
Confidence 4899999999999999999999888864 46667777777776 9999999974 6789999998864
No 7
>KOG4442|consensus
Probab=97.61 E-value=4.2e-05 Score=88.88 Aligned_cols=40 Identities=35% Similarity=0.639 Sum_probs=33.2
Q ss_pred ccccCCCCCCceee-ee-CCccEEEEEEecCCCCCCeEEEec
Q psy2634 692 SSSNHSCVPNAEIK-FP-YSNFVLNMVATRDIASGDEICISY 731 (749)
Q Consensus 692 S~~NHSC~PNa~~~-f~-~~~~~~~v~A~r~I~~GEEItISY 731 (749)
-++||||+|||.+. |. .+..++-++|.+.|++|||||+.|
T Consensus 195 RFiNHSC~PNa~~~KWtV~~~lRvGiFakk~I~~GEEITFDY 236 (729)
T KOG4442|consen 195 RFINHSCDPNAEVQKWTVPDELRVGIFAKKVIKPGEEITFDY 236 (729)
T ss_pred HhhcCCCCCCceeeeeeeCCeeEEEEeEecccCCCceeeEec
Confidence 35899999999765 43 234588899999999999999999
No 8
>KOG2589|consensus
Probab=97.58 E-value=4.1e-05 Score=83.46 Aligned_cols=43 Identities=28% Similarity=0.336 Sum_probs=36.8
Q ss_pred cccccCCCCCCceeeeeCCccEEEEEEecCCCCCCeEEEecCCC
Q psy2634 691 QSSSNHSCVPNAEIKFPYSNFVLNMVATRDIASGDEICISYMDE 734 (749)
Q Consensus 691 ~S~~NHSC~PNa~~~f~~~~~~~~v~A~r~I~~GEEItISYid~ 734 (749)
++++||+|.|||...-.++ .++.|+++|||++|||||--|.+.
T Consensus 197 aafINHDCrpnCkFvs~g~-~tacvkvlRDIePGeEITcFYgs~ 239 (453)
T KOG2589|consen 197 AAFINHDCRPNCKFVSTGR-DTACVKVLRDIEPGEEITCFYGSG 239 (453)
T ss_pred HHhhcCCCCCCceeecCCC-ceeeeehhhcCCCCceeEEeeccc
Confidence 5789999999998775543 589999999999999999999653
No 9
>KOG1080|consensus
Probab=97.44 E-value=9.1e-05 Score=90.59 Aligned_cols=43 Identities=33% Similarity=0.449 Sum_probs=36.1
Q ss_pred cccccCCCCCCceeee--eCCccEEEEEEecCCCCCCeEEEecCC
Q psy2634 691 QSSSNHSCVPNAEIKF--PYSNFVLNMVATRDIASGDEICISYMD 733 (749)
Q Consensus 691 ~S~~NHSC~PNa~~~f--~~~~~~~~v~A~r~I~~GEEItISYid 733 (749)
+.++||||.|||.... .++..+|+|+|.|+|.+|||||..|--
T Consensus 940 Ar~InHsC~PNCyakvi~V~g~~~IvIyakr~I~~~EElTYDYkF 984 (1005)
T KOG1080|consen 940 ARFINHSCNPNCYAKVITVEGDKRIVIYSKRDIAAGEELTYDYKF 984 (1005)
T ss_pred hheeecccCCCceeeEEEecCeeEEEEEEecccccCceeeeeccc
Confidence 6789999999996542 334569999999999999999999943
No 10
>COG2940 Proteins containing SET domain [General function prediction only]
Probab=96.33 E-value=0.0016 Score=74.80 Aligned_cols=45 Identities=33% Similarity=0.458 Sum_probs=37.9
Q ss_pred ccccccCCCCCCceeeeeCCcc--EEEEEEecCCCCCCeEEEecCCC
Q psy2634 690 LQSSSNHSCVPNAEIKFPYSNF--VLNMVATRDIASGDEICISYMDE 734 (749)
Q Consensus 690 ~~S~~NHSC~PNa~~~f~~~~~--~~~v~A~r~I~~GEEItISYid~ 734 (749)
.+.++||||.||+.......++ .+.++|++||++||||++.|-..
T Consensus 405 ~~r~~nHS~~pN~~~~~~~~~g~~~~~~~~~rDI~~geEl~~dy~~~ 451 (480)
T COG2940 405 VARFINHSCTPNCEASPIEVNGIFKISIYAIRDIKAGEELTYDYGPS 451 (480)
T ss_pred ccceeecCCCCCcceecccccccceeeecccccchhhhhhccccccc
Confidence 3568999999999888655433 78899999999999999999654
No 11
>KOG1082|consensus
Probab=95.62 E-value=0.0072 Score=67.19 Aligned_cols=44 Identities=30% Similarity=0.481 Sum_probs=34.7
Q ss_pred cccccCCCCCCceeee--eCC----ccEEEEEEecCCCCCCeEEEecCCC
Q psy2634 691 QSSSNHSCVPNAEIKF--PYS----NFVLNMVATRDIASGDEICISYMDE 734 (749)
Q Consensus 691 ~S~~NHSC~PNa~~~f--~~~----~~~~~v~A~r~I~~GEEItISYid~ 734 (749)
+-++||||.||+.+.. .+. -..+.++|+++|.+|+|||..|...
T Consensus 273 ~RfinHSC~PN~~~~~v~~~~~~~~~~~i~ffa~~~I~p~~ELT~dYg~~ 322 (364)
T KOG1082|consen 273 ARFINHSCSPNLLYQAVFQDEFVLLYLRIGFFALRDISPGEELTLDYGKA 322 (364)
T ss_pred cccccCCCCccceeeeeeecCCccchheeeeeeccccCCCcccchhhccc
Confidence 3468999999997663 221 1367899999999999999999754
No 12
>KOG1083|consensus
Probab=94.14 E-value=0.039 Score=67.39 Aligned_cols=41 Identities=32% Similarity=0.677 Sum_probs=33.1
Q ss_pred ccccCCCCCCceee-eeC-CccEEEEEEecCCCCCCeEEEecC
Q psy2634 692 SSSNHSCVPNAEIK-FPY-SNFVLNMVATRDIASGDEICISYM 732 (749)
Q Consensus 692 S~~NHSC~PNa~~~-f~~-~~~~~~v~A~r~I~~GEEItISYi 732 (749)
-.+||+|.|||... |.- +..++.|+|.|||.+||||+.-|-
T Consensus 1252 RfinhscKPNc~~qkwSVNG~~Rv~L~A~rDi~kGEELtYDYN 1294 (1306)
T KOG1083|consen 1252 RFINHSCKPNCEMQKWSVNGEYRVGLFALRDLPKGEELTYDYN 1294 (1306)
T ss_pred cccccccCCCCccccccccceeeeeeeecCCCCCCceEEEecc
Confidence 35799999999654 433 235889999999999999999983
No 13
>KOG1079|consensus
Probab=92.15 E-value=0.12 Score=61.00 Aligned_cols=43 Identities=28% Similarity=0.407 Sum_probs=36.1
Q ss_pred cccccCCCCCCceee--eeCCccEEEEEEecCCCCCCeEEEecCC
Q psy2634 691 QSSSNHSCVPNAEIK--FPYSNFVLNMVATRDIASGDEICISYMD 733 (749)
Q Consensus 691 ~S~~NHSC~PNa~~~--f~~~~~~~~v~A~r~I~~GEEItISYid 733 (749)
+-..|||=.|||... +..+..++-++|.|.|.+|||||+.|-.
T Consensus 666 ~rFANHS~nPNCYAkvm~V~GdhRIGifAkRaIeagEELffDYrY 710 (739)
T KOG1079|consen 666 IRFANHSFNPNCYAKVMMVAGDHRIGIFAKRAIEAGEELFFDYRY 710 (739)
T ss_pred hhhccCCCCCCcEEEEEEecCCcceeeeehhhcccCceeeeeecc
Confidence 446799999999654 4556679999999999999999999954
No 14
>KOG1085|consensus
Probab=91.43 E-value=0.13 Score=55.30 Aligned_cols=45 Identities=36% Similarity=0.373 Sum_probs=36.5
Q ss_pred cccccCCCCCCceeeee--CCccEEEEEEecCCCCCCeEEEecCCCC
Q psy2634 691 QSSSNHSCVPNAEIKFP--YSNFVLNMVATRDIASGDEICISYMDED 735 (749)
Q Consensus 691 ~S~~NHSC~PNa~~~f~--~~~~~~~v~A~r~I~~GEEItISYid~~ 735 (749)
..++|||-.+|+..... .+...+++.|.|+|.+||||+..|.|-.
T Consensus 334 GRLINHS~~gNl~TKvv~Idg~pHLiLvA~rdIa~GEELlYDYGDRS 380 (392)
T KOG1085|consen 334 GRLINHSVRGNLKTKVVEIDGSPHLILVARRDIAQGEELLYDYGDRS 380 (392)
T ss_pred hhhhcccccCcceeeEEEecCCceEEEEeccccccchhhhhhccccc
Confidence 45899999999965432 2345899999999999999999998854
No 15
>KOG1141|consensus
Probab=90.39 E-value=0.12 Score=61.95 Aligned_cols=44 Identities=25% Similarity=0.422 Sum_probs=33.4
Q ss_pred cccccCCCCCCceee--eeCC-c---cEEEEEEecCCCCCCeEEEecCCC
Q psy2634 691 QSSSNHSCVPNAEIK--FPYS-N---FVLNMVATRDIASGDEICISYMDE 734 (749)
Q Consensus 691 ~S~~NHSC~PNa~~~--f~~~-~---~~~~v~A~r~I~~GEEItISYid~ 734 (749)
..++||||.||..+. |..+ + ..+.++|.+-|++|.|||-.|-..
T Consensus 1190 GRfLNHSC~PNl~VQnVfvdTHdlrfPwVAFFt~kyVkAgtELTWDY~Ye 1239 (1262)
T KOG1141|consen 1190 GRFLNHSCDPNLHVQNVFVDTHDLRFPWVAFFTRKYVKAGTELTWDYQYE 1239 (1262)
T ss_pred hhhhccCCCccceeeeeeeeccccCCchhhhhhhhhhccCceeeeecccc
Confidence 457899999998654 3321 1 256678999999999999999653
No 16
>KOG1337|consensus
Probab=90.38 E-value=0.22 Score=57.38 Aligned_cols=49 Identities=29% Similarity=0.394 Sum_probs=41.0
Q ss_pred eEeecccccccCCCCCCceeeeeCCccEEEEEEecCCCCCCeEEEecCCC
Q psy2634 685 SALYKLQSSSNHSCVPNAEIKFPYSNFVLNMVATRDIASGDEICISYMDE 734 (749)
Q Consensus 685 ~glyp~~S~~NHSC~PNa~~~f~~~~~~~~v~A~r~I~~GEEItISYid~ 734 (749)
.++.|..-++||+|.+ +..-++..+..+.+.+.++|++||||+|+|...
T Consensus 231 ~~L~P~~D~~NH~~~~-~~~~~~~~d~~~~l~~~~~v~~geevfi~YG~~ 279 (472)
T KOG1337|consen 231 EALAPLIDLLNHSPEV-IKAGYNQEDEAVELVAERDVSAGEEVFINYGPK 279 (472)
T ss_pred hhhhhhHHhhccCchh-ccccccCCCCcEEEEEeeeecCCCeEEEecCCC
Confidence 4889999999999999 434444445589999999999999999999774
No 17
>KOG1081|consensus
Probab=72.59 E-value=1.5 Score=50.75 Aligned_cols=49 Identities=29% Similarity=0.399 Sum_probs=38.0
Q ss_pred eEeeccccc-ccCCCCCCceeeeeC--CccEEEEEEecCCCCCCeEEEecCC
Q psy2634 685 SALYKLQSS-SNHSCVPNAEIKFPY--SNFVLNMVATRDIASGDEICISYMD 733 (749)
Q Consensus 685 ~glyp~~S~-~NHSC~PNa~~~f~~--~~~~~~v~A~r~I~~GEEItISYid 733 (749)
.+.+...+. +||+|.||+...-.. ++....++|.+.|++|||+|.+|.-
T Consensus 365 ~~~~~n~sr~~nh~~~~~v~~~k~~~~~~t~~~~~a~~~i~~g~e~t~~~n~ 416 (463)
T KOG1081|consen 365 AGPKGNYSRFLNHSCQPNVETEKWQVIGDTRVGLFAPRQIEAGEELTFNYNG 416 (463)
T ss_pred cccccchhhhhcccCCCceeechhheecccccccccccccccchhhhheeec
Confidence 456666665 799999999755211 2347889999999999999999954
No 18
>PF14023 DUF4239: Protein of unknown function (DUF4239)
Probab=60.05 E-value=2e+02 Score=29.25 Aligned_cols=54 Identities=15% Similarity=0.215 Sum_probs=35.4
Q ss_pred eeeeehhhHhHHHHHHHHHHhcCCcc----hhhHHHHHHhcCCchhhhHHHHHHHHHHHH
Q psy2634 79 LSFVLGFYISIIMTRWWDQYNNIPWP----DAMAVYVSSQVHGQDERGRMMRRTIMRYVC 134 (749)
Q Consensus 79 Lsf~LGF~vn~vy~RWWe~r~~ig~~----d~lA~~v~~~i~g~der~rliRRtI~RYl~ 134 (749)
.+++|||-++.+++|+-+.+...--= ..+.+.+..+ ++.++ ..-+|..+.+|+.
T Consensus 9 ~al~laf~~~~~~~~~~~a~~~v~~EA~al~~l~~~~~~l-p~~~~-~~~~r~~l~~Y~~ 66 (209)
T PF14023_consen 9 FALLLAFTISSAWSRYDNARQAVNQEANALGTLYRLLASL-PEPPD-RDEIRALLRAYTR 66 (209)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC-CCCch-hHHHHHHHHHHHH
Confidence 57799999999999999888865332 3455555555 42233 4456666666554
No 19
>PF02937 COX6C: Cytochrome c oxidase subunit VIc; InterPro: IPR004204 Cytochrome c oxidase, a 13 subunit complex, 1.9.3.1 from EC is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of cytochrome c oxidase subunit VIc.; GO: 0004129 cytochrome-c oxidase activity; PDB: 3AG4_I 2DYS_V 3ASO_I 2EIK_V 2EIM_I 1OCC_V 1V54_V 1OCO_V 3ASN_V 2EIL_I ....
Probab=44.30 E-value=85 Score=27.76 Aligned_cols=42 Identities=17% Similarity=0.367 Sum_probs=35.2
Q ss_pred HhcCchHhHHHHHHHHHHHHHHHHHHHHHhhcChhhhhhHHH
Q psy2634 24 RWRGSIYKLVWPDLMAYLTLYYALNLTYRYQLNPEQKKTFES 65 (749)
Q Consensus 24 rWRGSI~K~Iw~eLLv~l~ly~~vs~iYr~~L~~~~k~~Fe~ 65 (749)
..||-..+.+++.+.+=+++.+++.++|.++..+..|+..+.
T Consensus 8 ~mrgll~~~l~~~i~~a~~ls~~~~~~~kf~v~~pRKk~Yad 49 (73)
T PF02937_consen 8 QMRGLLAKRLKRHIVVAFVLSLGVAAAYKFGVAEPRKKAYAD 49 (73)
T ss_dssp --BSHHHHHHHHHHHHHHHHHHHHHHHHHHHTHHHHHHHHHH
T ss_pred hhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHH
Confidence 579999999999999999999999999999887766665555
No 20
>KOG1338|consensus
Probab=41.67 E-value=19 Score=40.97 Aligned_cols=50 Identities=30% Similarity=0.399 Sum_probs=41.5
Q ss_pred CCcceEeecccccccCCCC-CCceeeeeCCccEEEEEEecCCCCCCeEEEecC
Q psy2634 681 NNEGSALYKLQSSSNHSCV-PNAEIKFPYSNFVLNMVATRDIASGDEICISYM 732 (749)
Q Consensus 681 ~~~G~glyp~~S~~NHSC~-PNa~~~f~~~~~~~~v~A~r~I~~GEEItISYi 732 (749)
.+.|-..-|.+-++||+-. -|+...+.. +.+.+.|.|+|.+|+|+.-+|.
T Consensus 209 e~ngk~m~p~ad~lNhd~~k~nanl~y~~--NcL~mva~r~iekgdev~n~dg 259 (466)
T KOG1338|consen 209 ECNGKLMTPIADFLNHDGLKANANLRYED--NCLEMVADRNIEKGDEVDNSDG 259 (466)
T ss_pred ccCcccccchhhhhccchhhcccceeccC--cceeeeecCCCCCccccccccc
Confidence 4566677899999999876 677666653 5889999999999999999995
No 21
>COG1422 Predicted membrane protein [Function unknown]
Probab=40.12 E-value=2.2e+02 Score=29.88 Aligned_cols=66 Identities=15% Similarity=0.188 Sum_probs=43.8
Q ss_pred HHHHHHHHHHHHcCCCCCHHHHHHHHHHHHHH-HHHhhHhhcCCcccCchHHHHHHHHHHHHHHHHHHhhccc
Q psy2634 190 VWAASVVTRARKEGRVRDDFSVKSIIDELNRI-RGQCGKLLSYDTVSVPLVYTQVVTLAVYSFLITTIMGRQW 261 (749)
Q Consensus 190 ~W~~~ll~~arkeGrI~~~~~l~~ll~eL~~f-R~~CerL~~yDwtPIPLaYtQvVtlaVY~Yfl~~lfgrQ~ 261 (749)
.-......+|+++| |+..++++-++=+++ +.+.|..+. +-=|.+|+-++++.++.+...-+.+.-+
T Consensus 82 ~efq~e~~eA~~~~---d~~~lkkLq~~qmem~~~Q~elmk~---qfkPM~~~~v~tI~~F~Wl~~~~~~~~~ 148 (201)
T COG1422 82 KEFQKEFREAQESG---DMKKLKKLQEKQMEMMDDQRELMKM---QFKPMLYISVLTIPFFAWLRWFVGTGGY 148 (201)
T ss_pred HHHHHHHHHHHHhC---CHHHHHHHHHHHHHHHHHHHHHHHH---hhhhHHHHHHHHHHHHHHHHHHHccCcc
Confidence 44556677888887 556555555443333 333444442 4459999999999999988877766533
No 22
>TIGR02059 swm_rep_I cyanobacterial long protein repeat. This domain appears in 29 copies in a large (10000 amino protein in Synechococcus sp. WH8102 associated with a novel flagellar system, as one of three different repeats. Similar domains are found in two different large (<3500) proteins of Synechocystis PCC6803.
Probab=35.94 E-value=50 Score=30.99 Aligned_cols=30 Identities=23% Similarity=0.440 Sum_probs=24.8
Q ss_pred eeCCccEEEEEEecCCCCCCeEEEecCCCC
Q psy2634 706 FPYSNFVLNMVATRDIASGDEICISYMDED 735 (749)
Q Consensus 706 f~~~~~~~~v~A~r~I~~GEEItISYid~~ 735 (749)
.++....+.+.-.+.|..||+++++|.++.
T Consensus 70 V~~s~ktVTLTL~~~V~~Gq~VTVsYt~ps 99 (101)
T TIGR02059 70 LGGSNTTITLTLAQVVEDGDEVTLSYTKNS 99 (101)
T ss_pred EcCcccEEEEEecccccCCCEEEEEeeCCC
Confidence 344445889999999999999999998864
No 23
>PF10166 DUF2368: Uncharacterised conserved protein (DUF2368); InterPro: IPR019319 This family is conserved from nematodes to humans. The function is not known.
Probab=35.27 E-value=24 Score=34.44 Aligned_cols=19 Identities=37% Similarity=0.738 Sum_probs=15.7
Q ss_pred CCccceeeeehhhHhHHHH
Q psy2634 74 SNLIPLSFVLGFYISIIMT 92 (749)
Q Consensus 74 t~~IpLsf~LGF~vn~vy~ 92 (749)
..++||+|++|+....+|.
T Consensus 78 ~PlvPL~fv~~Yq~D~ayG 96 (131)
T PF10166_consen 78 IPLVPLTFVLGYQYDMAYG 96 (131)
T ss_pred hhHHHHHHHHHHHHHHHhh
Confidence 4458999999998888775
No 24
>PF05297 Herpes_LMP1: Herpesvirus latent membrane protein 1 (LMP1); InterPro: IPR007961 This family consists of several latent membrane protein 1 or LMP1s mostly from Epstein-Barr virus (strain GD1) (HHV-4) (Human herpesvirus 4). LMP1 of HHV-4 is a 62-65 kDa plasma membrane protein possessing six membrane spanning regions, a short cytoplasmic N terminus and a long cytoplasmic carboxy tail of 200 amino acids. HHV-4 virus latent membrane protein 1 (LMP1) is essential for HHV-4 mediated transformation and has been associated with several cases of malignancies. HHV-4-like viruses in Macaca fascicularis (Cynomolgus monkeys) have been associated with high lymphoma rates in immunosuppressed monkeys [].; GO: 0019087 transformation of host cell by virus, 0016021 integral to membrane; PDB: 1CZY_E 1ZMS_B.
Probab=33.16 E-value=14 Score=40.37 Aligned_cols=41 Identities=27% Similarity=0.585 Sum_probs=0.0
Q ss_pred ceeeeehhhHhHHH--------HHHHHHHhcCCcch-hhHHHHHHhcCCc
Q psy2634 78 PLSFVLGFYISIIM--------TRWWDQYNNIPWPD-AMAVYVSSQVHGQ 118 (749)
Q Consensus 78 pLsf~LGF~vn~vy--------~RWWe~r~~ig~~d-~lA~~v~~~i~g~ 118 (749)
-|+|+|.|+...+. +-||....-+=|+- -+|..|..|+|+.
T Consensus 139 iLaFcLAF~LaivlLIIAv~L~qaWfT~L~dL~WL~LFlaiLIWlY~H~~ 188 (381)
T PF05297_consen 139 ILAFCLAFLLAIVLLIIAVLLHQAWFTILVDLYWLLLFLAILIWLYVHDQ 188 (381)
T ss_dssp --------------------------------------------------
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC
Confidence 36777777766543 67999998888875 5888888888853
No 25
>KOG4544|consensus
Probab=32.85 E-value=26 Score=34.45 Aligned_cols=50 Identities=12% Similarity=0.278 Sum_probs=30.0
Q ss_pred HHHHHHHHHHHHHHHHHHhhcChhhhhhHHHHHhhhcccCCccceeeeehhhHhHHHH
Q psy2634 35 PDLMAYLTLYYALNLTYRYQLNPEQKKTFESIVEYCDTYSNLIPLSFVLGFYISIIMT 92 (749)
Q Consensus 35 ~eLLv~l~ly~~vs~iYr~~L~~~~k~~Fe~i~~y~~~~t~~IpLsf~LGF~vn~vy~ 92 (749)
+|++-|+.++++++++--.+-...-|+. .-...++||+|++|.+...+|.
T Consensus 50 RE~f~w~~~f~~~avv~laa~~~~~Kr~--------~~liPIvPL~f~~gYqyd~ayG 99 (144)
T KOG4544|consen 50 REKFNWIACFGSLAVVLLAASSFHHKRL--------LHLIPIVPLAFFIGYQYDFAYG 99 (144)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhcc--------chheechhhHhhhhheeecccc
Confidence 6777788877776654332221111220 2234468999999998877664
No 26
>TIGR00355 purH phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase. Involved in purine ribonucleotide biosynthesis. The IMP cyclohydrolase activity is in the N-terminal region.
Probab=29.69 E-value=41 Score=39.68 Aligned_cols=71 Identities=18% Similarity=0.114 Sum_probs=48.6
Q ss_pred ccccccccchHHHHHHHhcCCCCCCCCC-cCCCcccCCCCccccc-----ccccCCcceeehhHHHHhhccc---ceecc
Q psy2634 455 NSLNKISGGSMQTLRKIFSGSNDNKLTC-ANDGVAMKPASQSSDN-----LAKSAGTSMRITDQVIEELDEQ---MTITA 525 (749)
Q Consensus 455 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~---~~~~~ 525 (749)
|.++||.+ +--++++--....+.++.. +.|+-|+|+-+---.. -.-++|||+|= ++||+..||. |-.|.
T Consensus 428 GQ~sRV~s-~riA~~kA~~~~~~~~G~vlASDAFFPF~D~ve~aa~aGi~aIiQPGGSiRD-~evI~aa~e~giaMvfTg 505 (511)
T TIGR00355 428 GQMSRVGS-AKIAGIKADDEGLEAKGSSLASDAFFPFRDGVEEAAAAGITCIIQPGGSMRD-EDSIWAADEHGIVMVFTG 505 (511)
T ss_pred CCccHHHH-HHHHHHHHHhhCCCccCcEEEeccccCCCccHHHHHHcCCEEEEcCCCCCCc-HHHHHHHHHhCCEEEECC
Confidence 79999999 8888877532222323332 4688888886642222 23579999997 8999999998 55555
Q ss_pred cC
Q psy2634 526 SG 527 (749)
Q Consensus 526 ~~ 527 (749)
++
T Consensus 506 ~R 507 (511)
T TIGR00355 506 MR 507 (511)
T ss_pred CC
Confidence 43
No 27
>PF10947 DUF2628: Protein of unknown function (DUF2628) ; InterPro: IPR024399 Some members in this family of proteins have been annotated as YigF. Their function is currently unknown.
Probab=26.78 E-value=75 Score=29.23 Aligned_cols=37 Identities=16% Similarity=0.263 Sum_probs=28.0
Q ss_pred hHHHHHHHhcCchHhHHHHHHHHHHHHHHHHHHHHHh
Q psy2634 17 CFLKLLARWRGSIYKLVWPDLMAYLTLYYALNLTYRY 53 (749)
Q Consensus 17 ~flrLLfrWRGSI~K~Iw~eLLv~l~ly~~vs~iYr~ 53 (749)
||.-.+|-+==-.|+..|...++++++.+++.++.-.
T Consensus 39 nw~Af~f~~~w~l~r~mw~~~~~~~~~~~~~~~~~~~ 75 (108)
T PF10947_consen 39 NWWAFFFGPLWLLYRKMWLYAIIFLALLVALAIILIL 75 (108)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 6777888665577889999998888877777766544
No 28
>PF07234 DUF1426: Protein of unknown function (DUF1426); InterPro: IPR009871 This family consists of several Banana bunchy top virus proteins of around 120 residues in length. Q9IGU4 from SWISSPROT is annotated a movement protein whereas most other family members are hypothetical. The function of this family is unknown.
Probab=23.72 E-value=1.2e+02 Score=28.40 Aligned_cols=57 Identities=25% Similarity=0.491 Sum_probs=34.2
Q ss_pred HHHHHHhcCchHhHHHHHHHHHHHHHHHHHHHHHhh-cChhhhhhHHHHHhhhcccCCccceeeeehhhHhHHHHHHHHH
Q psy2634 19 LKLLARWRGSIYKLVWPDLMAYLTLYYALNLTYRYQ-LNPEQKKTFESIVEYCDTYSNLIPLSFVLGFYISIIMTRWWDQ 97 (749)
Q Consensus 19 lrLLfrWRGSI~K~Iw~eLLv~l~ly~~vs~iYr~~-L~~~~k~~Fe~i~~y~~~~t~~IpLsf~LGF~vn~vy~RWWe~ 97 (749)
.|++|.| +|+|..+|++|+++|-.+ +--+..++...++.|.-+| ..=.|=|-|
T Consensus 8 VKlfFEw-----------FLF~~AIFiAItIlYILLalL~EvPkYIK~~VrYlVEY---------------lTRRRVWmq 61 (117)
T PF07234_consen 8 VKLFFEW-----------FLFFGAIFIAITILYILLALLFEVPKYIKELVRYLVEY---------------LTRRRVWMQ 61 (117)
T ss_pred HHHHHHH-----------HHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHH---------------HHHHHHHHH
Confidence 5677776 677777788888887543 2122233445566665443 123477888
Q ss_pred HhcC
Q psy2634 98 YNNI 101 (749)
Q Consensus 98 r~~i 101 (749)
++.+
T Consensus 62 rTQL 65 (117)
T PF07234_consen 62 RTQL 65 (117)
T ss_pred HhHh
Confidence 8865
No 29
>PF11464 Rbsn: Rabenosyn Rab binding domain; InterPro: IPR021565 Rabenosyn-5 (Rbsn) is a multivalent effector with interacts with the Rab family.Rsbn contains distinct Rab4 and Rab5 binding sites within residues 264-500 and 627-784 respectively []. Rab proteins are GTPases involved in the regulation of all stages of membrane trafficking []. ; PDB: 1Z0K_B 1YZM_A 1Z0J_B.
Probab=22.86 E-value=94 Score=24.84 Aligned_cols=32 Identities=19% Similarity=0.242 Sum_probs=25.1
Q ss_pred HHHHHHHHHHHHHcCCCCCHHHHHHHHHHHHH
Q psy2634 189 IVWAASVVTRARKEGRVRDDFSVKSIIDELNR 220 (749)
Q Consensus 189 i~W~~~ll~~arkeGrI~~~~~l~~ll~eL~~ 220 (749)
+.-+-+.+.+|+++||+++-..+..++.||..
T Consensus 5 i~~I~~~I~qAk~~~r~dEV~~L~~NL~EL~~ 36 (42)
T PF11464_consen 5 INIIESYIKQAKAARRFDEVATLEENLRELQD 36 (42)
T ss_dssp HHHHHHHHHHHHHTT-HHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHhcCcHHHHHHHHHHHHHHH
Confidence 34567889999999999877788888888754
No 30
>KOG2461|consensus
Probab=22.02 E-value=76 Score=36.40 Aligned_cols=25 Identities=24% Similarity=0.242 Sum_probs=22.3
Q ss_pred cEEEEEEecCCCCCCeEEEecCCCC
Q psy2634 711 FVLNMVATRDIASGDEICISYMDED 735 (749)
Q Consensus 711 ~~~~v~A~r~I~~GEEItISYid~~ 735 (749)
..|..+|+|+|.+||||.+-|.+..
T Consensus 122 ~~Ifyrt~r~I~p~eELlVWY~~e~ 146 (396)
T KOG2461|consen 122 ENIFYRTIRDIRPNEELLVWYGSEY 146 (396)
T ss_pred CceEEEecccCCCCCeEEEEeccch
Confidence 4689999999999999999998753
No 31
>PF07763 FEZ: FEZ-like protein; InterPro: IPR011680 This is a family of eukaryotic proteins thought to be involved in axonal outgrowth and fasciculation []. The N-terminal regions of these sequences are less conserved than the C-terminal regions, and are highly acidic []. The Caenorhabditis elegans homolog, UNC-76 (Q7JNU9 from SWISSPROT), may play structural and signalling roles in the control of axonal extension and adhesion (particularly in the presence of adjacent neuronal cells []) and these roles have also been postulated for other FEZ family proteins []. Certain homologs have been definitively found to interact with the N-terminal variable region (V1) of PKC-zeta, and this interaction causes cytoplasmic translocation of the FEZ family protein in mammalian neuronal cells []. The C-terminal region probably participates in the association with the regulatory domain of PKC-zeta []. The members of this family are predicted to form coiled-coil structures [, ], which may interact with members of the RhoA family of signalling proteins [], but are not thought to contain other characteristic protein motifs []. Certain members of this family are expressed almost exclusively in the brain, whereas others (such as FEZ2, Q76LN0 from SWISSPROT) are expressed in other tissues, and are thought to perform similar but unknown functions in these tissues [].
Probab=21.32 E-value=37 Score=36.41 Aligned_cols=16 Identities=56% Similarity=0.848 Sum_probs=13.9
Q ss_pred eeehhHHHHhhcccce
Q psy2634 507 MRITDQVIEELDEQMT 522 (749)
Q Consensus 507 ~~~~~~~~~~~~~~~~ 522 (749)
+.-|||||||+||=|-
T Consensus 113 l~TAdqVIeEIdemMq 128 (244)
T PF07763_consen 113 LQTADQVIEEIDEMMQ 128 (244)
T ss_pred chhHHHHHHHHHHHHh
Confidence 6779999999999773
Done!