Query psy3120
Match_columns 711
No_of_seqs 312 out of 811
Neff 3.9
Searched_HMMs 46136
Date Fri Aug 16 16:57:42 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy3120.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/3120hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG1595|consensus 100.0 4.8E-58 1E-62 499.0 16.1 272 150-446 17-301 (528)
2 KOG1595|consensus 99.9 1.5E-24 3.2E-29 237.9 4.3 116 146-282 175-297 (528)
3 COG5063 CTH1 CCCH-type Zn-fing 99.5 4.3E-14 9.4E-19 147.6 9.5 209 210-441 115-343 (351)
4 COG5063 CTH1 CCCH-type Zn-fing 99.5 1.3E-13 2.8E-18 144.2 8.9 185 158-404 108-342 (351)
5 KOG1677|consensus 99.4 9E-14 1.9E-18 145.6 6.2 103 335-443 87-208 (332)
6 KOG1677|consensus 99.2 2.1E-11 4.4E-16 128.0 6.7 111 162-281 82-206 (332)
7 KOG1040|consensus 98.9 9.8E-10 2.1E-14 117.4 5.6 140 212-443 47-189 (325)
8 KOG1040|consensus 98.6 9.3E-08 2E-12 102.5 8.2 144 211-442 78-231 (325)
9 KOG1492|consensus 98.4 5.2E-07 1.1E-11 92.5 5.8 133 211-441 207-339 (377)
10 KOG2494|consensus 98.0 5.9E-06 1.3E-10 88.3 6.0 64 207-281 34-98 (331)
11 COG5084 YTH1 Cleavage and poly 98.0 4.8E-05 1E-09 80.7 11.5 148 213-443 77-240 (285)
12 PF00642 zf-CCCH: Zinc finger 98.0 1.2E-06 2.7E-11 62.1 -0.6 27 410-436 1-27 (27)
13 PF00642 zf-CCCH: Zinc finger 97.5 2.8E-05 6.2E-10 55.2 0.4 27 248-276 1-27 (27)
14 COG5084 YTH1 Cleavage and poly 97.4 0.0007 1.5E-08 72.1 9.7 57 207-276 102-158 (285)
15 smart00356 ZnF_C3H1 zinc finge 97.2 0.00021 4.5E-09 49.4 2.2 26 410-436 2-27 (27)
16 KOG4791|consensus 97.1 0.00042 9.1E-09 77.5 4.2 81 341-443 11-91 (667)
17 KOG1492|consensus 97.0 0.00044 9.6E-09 71.4 3.4 135 251-471 207-342 (377)
18 KOG2494|consensus 96.6 0.00054 1.2E-08 73.7 0.2 62 371-443 38-100 (331)
19 KOG4791|consensus 96.4 0.0017 3.7E-08 72.8 2.7 84 166-280 4-88 (667)
20 smart00356 ZnF_C3H1 zinc finge 96.4 0.0018 4E-08 44.6 1.8 26 248-276 2-27 (27)
21 KOG2333|consensus 96.4 0.0015 3.3E-08 73.6 1.9 65 369-439 75-143 (614)
22 KOG1763|consensus 96.1 0.0011 2.3E-08 70.8 -0.8 70 164-235 91-190 (343)
23 KOG2333|consensus 95.4 0.0059 1.3E-07 69.0 1.3 57 211-276 77-138 (614)
24 KOG3702|consensus 94.8 0.072 1.6E-06 62.2 7.5 70 348-437 597-666 (681)
25 COG5252 Uncharacterized conser 94.6 0.0067 1.4E-07 63.3 -1.1 72 162-235 82-175 (299)
26 KOG1763|consensus 93.3 0.014 3.1E-07 62.5 -1.6 31 333-363 91-122 (343)
27 PF14608 zf-CCCH_2: Zinc finge 89.1 0.28 6.1E-06 32.7 1.8 19 336-355 1-19 (19)
28 KOG2185|consensus 87.0 0.36 7.7E-06 54.1 2.1 31 243-276 133-163 (486)
29 COG5252 Uncharacterized conser 86.8 0.14 3E-06 53.9 -1.1 30 333-362 84-114 (299)
30 PF05297 Herpes_LMP1: Herpesvi 86.3 0.22 4.7E-06 53.8 0.0 8 11-18 4-11 (381)
31 COG5152 Uncharacterized conser 85.8 0.33 7.2E-06 50.1 1.0 33 404-441 138-170 (259)
32 COG5152 Uncharacterized conser 85.4 0.34 7.4E-06 50.0 0.9 28 205-234 137-164 (259)
33 PF08006 DUF1700: Protein of u 84.4 17 0.00036 36.0 12.1 11 39-49 54-64 (181)
34 PF14608 zf-CCCH_2: Zinc finge 83.7 0.8 1.7E-05 30.5 1.7 19 414-435 1-19 (19)
35 KOG2185|consensus 82.7 0.73 1.6E-05 51.7 2.1 32 405-437 133-164 (486)
36 KOG1813|consensus 82.0 0.45 9.8E-06 51.4 0.2 29 204-234 181-209 (313)
37 KOG3702|consensus 81.9 5 0.00011 47.6 8.4 85 334-437 544-646 (681)
38 PF05656 DUF805: Protein of un 81.7 16 0.00035 33.2 10.1 25 65-89 3-27 (120)
39 PF09972 DUF2207: Predicted me 79.0 15 0.00033 40.4 10.7 14 44-57 363-376 (511)
40 PF08006 DUF1700: Protein of u 72.8 44 0.00096 33.0 11.0 12 43-54 44-55 (181)
41 TIGR01654 bact_immun_7tm bacte 68.1 67 0.0015 38.4 13.1 22 5-26 535-556 (679)
42 PF09972 DUF2207: Predicted me 66.1 24 0.00052 38.9 8.4 18 35-52 365-382 (511)
43 KOG1813|consensus 65.5 2.1 4.5E-05 46.6 0.1 30 405-439 184-213 (313)
44 COG4758 Predicted membrane pro 64.7 61 0.0013 34.6 10.4 7 266-272 188-194 (235)
45 KOG0153|consensus 51.0 10 0.00022 42.3 2.3 23 211-235 162-184 (377)
46 COG3152 Predicted membrane pro 48.6 2E+02 0.0042 27.9 10.2 25 64-88 11-35 (125)
47 PRK13862 putative crown gall t 44.7 7.1 0.00015 39.7 -0.1 32 659-690 71-103 (201)
48 PF13908 Shisa: Wnt and FGF in 43.8 10 0.00022 37.5 0.9 7 62-68 50-56 (179)
49 TIGR01654 bact_immun_7tm bacte 38.2 1.4E+02 0.003 35.8 9.1 23 32-54 130-152 (679)
50 PF14257 DUF4349: Domain of un 36.4 1.1E+02 0.0024 32.0 7.1 45 2-56 138-185 (262)
51 PF10650 zf-C3H1: Putative zin 35.6 21 0.00045 25.5 1.1 20 167-189 2-22 (23)
52 smart00793 AgrB Accessory gene 35.4 4.8E+02 0.01 26.2 11.2 21 40-60 9-29 (184)
53 KOG0860|consensus 34.8 23 0.00049 34.1 1.6 21 37-57 30-54 (116)
54 COG3887 Predicted signaling pr 34.6 99 0.0021 37.2 6.9 8 659-666 501-508 (655)
55 KOG1039|consensus 33.4 16 0.00035 40.6 0.5 24 251-277 9-32 (344)
56 PF03208 PRA1: PRA1 family pro 33.1 3.2E+02 0.0069 26.1 9.1 26 25-50 12-38 (153)
57 KOG0153|consensus 30.5 29 0.00064 38.8 1.8 36 359-401 150-185 (377)
58 KOG1039|consensus 30.5 19 0.00042 40.1 0.4 25 413-438 9-33 (344)
59 PF02009 Rifin_STEVOR: Rifin/s 30.2 53 0.0011 36.0 3.6 25 47-71 172-196 (299)
60 COG4709 Predicted membrane pro 29.5 2.3E+02 0.005 29.7 7.8 8 8-15 10-17 (195)
61 COG4758 Predicted membrane pro 29.4 5.2E+02 0.011 27.9 10.5 6 160-165 101-106 (235)
62 KOG0810|consensus 28.6 30 0.00065 37.8 1.5 19 64-82 260-278 (297)
63 PRK05978 hypothetical protein; 27.1 3E+02 0.0065 27.6 7.9 16 57-72 48-63 (148)
64 PF01486 K-box: K-box region; 26.6 75 0.0016 28.8 3.5 48 3-53 19-66 (100)
65 COG4652 Uncharacterized protei 26.2 4.6E+02 0.0099 32.1 10.5 24 137-160 631-654 (657)
66 PF07181 VirC2: VirC2 protein; 25.3 16 0.00035 37.5 -1.1 28 663-690 76-103 (202)
67 PF00664 ABC_membrane: ABC tra 24.7 4.1E+02 0.009 25.3 8.3 19 39-57 98-116 (275)
68 PF06170 DUF983: Protein of un 21.8 1.6E+02 0.0035 26.7 4.6 16 57-72 4-19 (86)
69 PF00957 Synaptobrevin: Synapt 21.4 36 0.00078 29.9 0.4 12 4-15 4-15 (89)
70 PRK13899 type IV secretion sys 21.0 2.3E+02 0.005 26.2 5.5 18 153-170 65-82 (97)
71 PHA03093 EEV glycoprotein; Pro 21.0 64 0.0014 33.3 2.1 11 224-234 105-115 (185)
72 PF05399 EVI2A: Ectropic viral 20.8 80 0.0017 33.4 2.8 7 65-71 119-125 (227)
73 PRK01100 putative accessory ge 20.7 4.8E+02 0.01 26.9 8.4 9 52-60 33-41 (210)
74 PF09889 DUF2116: Uncharacteri 20.3 35 0.00077 29.2 0.1 8 61-68 3-10 (59)
75 COG5353 Uncharacterized protei 20.1 66 0.0014 32.4 1.9 9 148-156 27-35 (161)
No 1
>KOG1595|consensus
Probab=100.00 E-value=4.8e-58 Score=499.03 Aligned_cols=272 Identities=56% Similarity=1.084 Sum_probs=247.8
Q ss_pred cccccchhhhhcccccccccccccCcccCCCCCcccCCCCCCCCCCCCCCCCCCCccccccccccccccccCCCCCCCC-
Q psy3120 150 DLIFDFTFMYLKEFRVEQCPLFLQHKCTQHRPFTCFHWHFMNQRRRRPVRKRDGSFNYSPDTYCTKYDETTGLCPDGDE- 228 (711)
Q Consensus 150 ~~~k~~h~~yL~~FKTk~C~~f~qg~C~~g~p~~C~FAH~~ed~RR~P~~r~dGt~nYst~~lCpkf~e~tG~Cp~GD~- 228 (711)
+.++++|+.|+++|+++. +.|.+|+|.+|+|++|+.||+.+++||+|.++.+|+|+|+++.+|.+|+|.++.||.|+.
T Consensus 17 e~~~~~~~~~~k~~~~e~-~~~~~~~~~~~r~~~~~k~~~~~qR~~~~v~~~~Gs~~~~~~i~~~~~~e~~~~C~~~~~~ 95 (528)
T KOG1595|consen 17 EFEAPNDYAYLKEFRVEQ-PLFLQHKCLQHRPFVCFKWHFLNQRRRRPVARRDGSFNYSPDIYCTKYDEVTGICPDGDEH 95 (528)
T ss_pred hhccccHHHHHHHHHHhc-hhhhhhhhcccccchhhhhhhhccccccchhhhcCccccccceeecchhhccccCCCCccc
Confidence 889999999999999999 999999999999999999999999999999999999999999999999999999999999
Q ss_pred CCCCccCCCcccccccccccccccccccccccccccccccccCCCCCCCCCCCCCCchHHHhhhcCCCCCCCCCCCCCCC
Q psy3120 229 CPFLHRTAGDTERRYHLRYYKTCMCVHDTDARGLCVKNGAHCAFAHGNPDLRPPVYDIKEIQALENPEGDPNSNSNGPNA 308 (711)
Q Consensus 229 C~FAHs~~G~~E~~yHP~~YKTklCk~~~~~kG~C~K~G~~C~FAHg~~ELR~P~~~~~Elqal~~~~~~~~~~~~Gp~~ 308 (711)
|.++|.+.++.|+.||...|||..|.. ..+|+|+|.+.+|.|||++.++|.|+.+..|++. +....+ +.
T Consensus 96 C~~~g~s~~~~e~~~hL~~~k~~~~~t--da~g~~~~~v~~~~~~~~~~~~r~~~~~l~e~~~-----~~~~~~----~e 164 (528)
T KOG1595|consen 96 CAVLGRSVGDTERTYHLRYYKTLPCVT--DARGNCVKNVLHCAFAHGPNDLRPPVEDLLELQG-----GSGLPD----DE 164 (528)
T ss_pred chhcccccCCcceeEeccccccccCcc--ccCCCcccCcccccccCCccccccHHHHHHhccc-----ccCccC----CC
Confidence 999999999999999999999999984 4579999999999999999999999877666641 000001 11
Q ss_pred cccccc----cccCCCCCCC-------ccceeccccccccCCCCCCCCCCCCCCCCC-CCcCccCCCCccccCCCCCCCC
Q psy3120 309 LDKEGK----LLTEDPKWQD-------TNYVLSNYKTEQCKRPPRLCRQGYACPQFH-NSRDKRRSPRTFKYRSTPCPNV 376 (711)
Q Consensus 309 ld~~~k----lm~edPkw~D-------ddfvL~~YKTklC~~~~G~C~~G~~CpFAH-n~kDlRR~P~~~kYkT~LCP~f 376 (711)
.+.+.+ .+.+||+|+| |+|+|+.|||++|++. .||+|.+||||| +++++||+|++|+|..++||.|
T Consensus 165 ~~~~~~~~~~~y~~Dp~~pdi~~~ys~DeFrMy~fKir~C~R~--~shDwteCPf~HpgEkARRRDPRkyhYs~tpCPef 242 (528)
T KOG1595|consen 165 PEVESKLDVTEYPEDPSWPDINGIYSSDEFRMYSFKIRRCSRP--RSHDWTECPFAHPGEKARRRDPRKYHYSSTPCPEF 242 (528)
T ss_pred cccccccccccccCCCCcccccccccccceEEEeeeecccCCc--cCCCcccCCccCCCcccccCCcccccccCccCccc
Confidence 122344 8899999999 8899999999999996 899999999999 8999999999999999999999
Q ss_pred cCCCCCCCCCCCCCCCCcccccCccccccccccccccccccCccccCCCCCCCCCCCCcchhhhccccCC
Q psy3120 377 KHGDEWGEPANCESGDLCQYCHTRTEQQFHPEIYKSTKCNDVQQAGYCPRGVFCAFAHVDTEMANAREMA 446 (711)
Q Consensus 377 kkG~~w~~~G~Cp~GdsC~FAHg~~El~yHP~~YKTklC~~fq~tG~Cp~G~~C~FAHgeaELR~~~~~~ 446 (711)
++| .|++||+|.||||++|.|+||.+|||++|++. |+|++ .+|+|||.++|||......
T Consensus 243 rkG-------~C~rGD~CEyaHgvfEcwLHPa~YRT~~CkDg---~~C~R-rvCfFAH~~eqLR~l~~s~ 301 (528)
T KOG1595|consen 243 RKG-------SCERGDSCEYAHGVFECWLHPARYRTRKCKDG---GYCPR-RVCFFAHSPEQLRPLPPST 301 (528)
T ss_pred ccC-------CCCCCCccccccceehhhcCHHHhccccccCC---CCCcc-ceEeeecChHHhcccCCCC
Confidence 985 99999999999999999999999999999998 69999 8999999999999988743
No 2
>KOG1595|consensus
Probab=99.90 E-value=1.5e-24 Score=237.87 Aligned_cols=116 Identities=31% Similarity=0.536 Sum_probs=100.8
Q ss_pred Hhhhcccccc-------hhhhhcccccccccccccCcccCCCCCcccCCCCCCCCCCCCCCCCCCCcccccccccccccc
Q psy3120 146 IIIIDLIFDF-------TFMYLKEFRVEQCPLFLQHKCTQHRPFTCFHWHFMNQRRRRPVRKRDGSFNYSPDTYCTKYDE 218 (711)
Q Consensus 146 ~~~~~~~k~~-------h~~yL~~FKTk~C~~f~qg~C~~g~p~~C~FAH~~ed~RR~P~~r~dGt~nYst~~lCpkf~e 218 (711)
....|..||+ .++.|+.|||++|++-+.|+|+ .|||||..+..|||+++| |.|+. ++||.|
T Consensus 175 ~y~~Dp~~pdi~~~ys~DeFrMy~fKir~C~R~~shDwt-----eCPf~HpgEkARRRDPRk----yhYs~-tpCPef-- 242 (528)
T KOG1595|consen 175 EYPEDPSWPDINGIYSSDEFRMYSFKIRRCSRPRSHDWT-----ECPFAHPGEKARRRDPRK----YHYSS-TPCPEF-- 242 (528)
T ss_pred cccCCCCcccccccccccceEEEeeeecccCCccCCCcc-----cCCccCCCcccccCCccc----ccccC-ccCccc--
Confidence 3456777777 6789999999999999888877 699999666666666664 99998 799999
Q ss_pred ccCCCCCCCCCCCCccCCCcccccccccccccccccccccccccccccccccCCCCCCCCCCCC
Q psy3120 219 TTGLCPDGDECPFLHRTAGDTERRYHLRYYKTCMCVHDTDARGLCVKNGAHCAFAHGNPDLRPP 282 (711)
Q Consensus 219 ~tG~Cp~GD~C~FAHs~~G~~E~~yHP~~YKTklCk~~~~~kG~C~K~G~~C~FAHg~~ELR~P 282 (711)
++|.|++||.|.|||+ .+|.|+||..|||++|++. |+|+ + ..|+|||.+++||.-
T Consensus 243 rkG~C~rGD~CEyaHg---vfEcwLHPa~YRT~~CkDg----~~C~-R-rvCfFAH~~eqLR~l 297 (528)
T KOG1595|consen 243 RKGSCERGDSCEYAHG---VFECWLHPARYRTRKCKDG----GYCP-R-RVCFFAHSPEQLRPL 297 (528)
T ss_pred ccCCCCCCCccccccc---eehhhcCHHHhccccccCC----CCCc-c-ceEeeecChHHhccc
Confidence 5799999999999995 7999999999999999986 6796 4 999999999999963
No 3
>COG5063 CTH1 CCCH-type Zn-finger protein [General function prediction only]
Probab=99.50 E-value=4.3e-14 Score=147.65 Aligned_cols=209 Identities=19% Similarity=0.233 Sum_probs=142.1
Q ss_pred cccccccccccCCCCCCCCCCCCccCCCcccccc--cccccccccccccccccccccccccccCCCCCCCCCCCCCCchH
Q psy3120 210 DTYCTKYDETTGLCPDGDECPFLHRTAGDTERRY--HLRYYKTCMCVHDTDARGLCVKNGAHCAFAHGNPDLRPPVYDIK 287 (711)
Q Consensus 210 ~~lCpkf~e~tG~Cp~GD~C~FAHs~~G~~E~~y--HP~~YKTklCk~~~~~kG~C~K~G~~C~FAHg~~ELR~P~~~~~ 287 (711)
|.+|..-. ..+.|.+++.|+|||+ ..+..+ .+.+|||+-|.-+.+ -|+|+ ++.+|.|-|-+----.+
T Consensus 115 t~~l~ss~-~~~~~~~p~~n~fahs---~~issl~~~~~K~kt~slev~in-~~~vp-~s~~~~~~slP~t~~~~----- 183 (351)
T COG5063 115 TEMLRSST-EIPYCRYPDKNPFAHS---KAISSLAQTHPKYKTESLEVFIN-PGYVP-YSKRCCFISLPLTDINL----- 183 (351)
T ss_pred chhhhccc-cccccccCCCCcCCCc---cccccccccCccccccceeEEec-CCccc-cccccccccccccccCc-----
Confidence 36776432 2478999999999997 344433 345799999999887 48898 89999999954322221
Q ss_pred HHhhhcCCCCC----CCCCCCCCCCcccccccccCCCCCCCccceeccccc--cccCCC--CCCCCC---CCCCCCC---
Q psy3120 288 EIQALENPEGD----PNSNSNGPNALDKEGKLLTEDPKWQDTNYVLSNYKT--EQCKRP--PRLCRQ---GYACPQF--- 353 (711)
Q Consensus 288 Elqal~~~~~~----~~~~~~Gp~~ld~~~klm~edPkw~DddfvL~~YKT--klC~~~--~G~C~~---G~~CpFA--- 353 (711)
+.+...+.. ..+...+-+.-+.+.... .++-...-+..||| .+|.-| .|.|++ |..|.|+
T Consensus 184 --q~l~~rkpks~~~~~s~t~~kes~a~P~~~~----~~~~~e~n~~L~kt~~~lc~~ft~kg~~p~~~sG~~~q~a~~~ 257 (351)
T COG5063 184 --QPLSQRKPKSGKNCTSYTLGKESDAHPHDEL----IYQKQEQNKPLYKTNPELCESFTRKGTCPYWISGVKCQFACRG 257 (351)
T ss_pred --chhhccCcccCcCccccccccccccCchhhh----hhhhhhccchhhcCCHHHhhccCcCCCCccccccccccccccc
Confidence 111110100 011111111101100000 01111223567899 999998 599999 9999999
Q ss_pred CCCcCccCCCCccccCCCCCCCCcCCCCCCCCCCCCCCCCcccccCccccc---ccc-ccccccccccCccccCCCCCCC
Q psy3120 354 HNSRDKRRSPRTFKYRSTPCPNVKHGDEWGEPANCESGDLCQYCHTRTEQQ---FHP-EIYKSTKCNDVQQAGYCPRGVF 429 (711)
Q Consensus 354 Hn~kDlRR~P~~~kYkT~LCP~fkkG~~w~~~G~Cp~GdsC~FAHg~~El~---yHP-~~YKTklC~~fq~tG~Cp~G~~ 429 (711)
|+..+....-..-.|+|++|-+|.+. |.|++|.+|.|+||-.+.. .|+ ..|+...|+.+.+.|.||+|-.
T Consensus 258 HGlN~l~~k~k~~~frTePcinwe~s------Gyc~yg~Rc~F~hgd~~~ie~~~~~~~~y~~~~crt~~~~g~~p~g~~ 331 (351)
T COG5063 258 HGLNELKSKKKKQNFRTEPCINWEKS------GYCPYGLRCCFKHGDDSDIEMYEEASLGYLDGPCRTRAKGGAFPSGGA 331 (351)
T ss_pred cccccccccccccccccCCccchhhc------ccCccccccccccCChhhccccccccccccccccccccccCccCCCCc
Confidence 99876655444456899999998754 8999999999999988764 344 4799999999999999999999
Q ss_pred CCCCCcchhhhc
Q psy3120 430 CAFAHVDTEMAN 441 (711)
Q Consensus 430 C~FAHgeaELR~ 441 (711)
|.|-|.+.-+..
T Consensus 332 ~c~~~dkkn~~~ 343 (351)
T COG5063 332 VCKSFDKKNLDF 343 (351)
T ss_pred hhhccccchhhh
Confidence 999998776654
No 4
>COG5063 CTH1 CCCH-type Zn-finger protein [General function prediction only]
Probab=99.46 E-value=1.3e-13 Score=144.20 Aligned_cols=185 Identities=21% Similarity=0.291 Sum_probs=139.2
Q ss_pred hhhccccccccccc-ccCcccCCCCCcccCCCCCCCCCCCCCCCCCCCccccccccccccccccCCCCCCCCCCCCccCC
Q psy3120 158 MYLKEFRVEQCPLF-LQHKCTQHRPFTCFHWHFMNQRRRRPVRKRDGSFNYSPDTYCTKYDETTGLCPDGDECPFLHRTA 236 (711)
Q Consensus 158 ~yL~~FKTk~C~~f-~qg~C~~g~p~~C~FAH~~ed~RR~P~~r~dGt~nYst~~lCpkf~e~tG~Cp~GD~C~FAHs~~ 236 (711)
.-...|||.+|.-. .-+.|+..+ .|+|+|+....|-.+.. ..|++ +-|.-+- ..+.||++.+|-|.|= +
T Consensus 108 ~p~g~~kt~~l~ss~~~~~~~~p~--~n~fahs~~issl~~~~-----~K~kt-~slev~i-n~~~vp~s~~~~~~sl-P 177 (351)
T COG5063 108 KPYGLYKTEMLRSSTEIPYCRYPD--KNPFAHSKAISSLAQTH-----PKYKT-ESLEVFI-NPGYVPYSKRCCFISL-P 177 (351)
T ss_pred CccccccchhhhccccccccccCC--CCcCCCccccccccccC-----ccccc-cceeEEe-cCCccccccccccccc-c
Confidence 33567999999764 257899865 89999998887754432 45776 6787775 6789999988877762 2
Q ss_pred Ccc--------------------------------cccc-----ccccccc--cccccccccccccccc---ccccCCC-
Q psy3120 237 GDT--------------------------------ERRY-----HLRYYKT--CMCVHDTDARGLCVKN---GAHCAFA- 273 (711)
Q Consensus 237 G~~--------------------------------E~~y-----HP~~YKT--klCk~~~~~kG~C~K~---G~~C~FA- 273 (711)
... |+.| |-..||| .+|..|.. +|+|+ + |..|.||
T Consensus 178 ~t~~~~q~l~~rkpks~~~~~s~t~~kes~a~P~~~~~~~~~e~n~~L~kt~~~lc~~ft~-kg~~p-~~~sG~~~q~a~ 255 (351)
T COG5063 178 LTDINLQPLSQRKPKSGKNCTSYTLGKESDAHPHDELIYQKQEQNKPLYKTNPELCESFTR-KGTCP-YWISGVKCQFAC 255 (351)
T ss_pred ccccCcchhhccCcccCcCccccccccccccCchhhhhhhhhhccchhhcCCHHHhhccCc-CCCCc-cccccccccccc
Confidence 111 1112 2357999 89999986 69997 9 9999999
Q ss_pred --CCCCCCCCCCCchHHHhhhcCCCCCCCCCCCCCCCcccccccccCCCCCCCccceeccccccccCCCC--CCCCCCCC
Q psy3120 274 --HGNPDLRPPVYDIKEIQALENPEGDPNSNSNGPNALDKEGKLLTEDPKWQDTNYVLSNYKTEQCKRPP--RLCRQGYA 349 (711)
Q Consensus 274 --Hg~~ELR~P~~~~~Elqal~~~~~~~~~~~~Gp~~ld~~~klm~edPkw~DddfvL~~YKTklC~~~~--G~C~~G~~ 349 (711)
||..|+..-- .-..|||++|.++. |.|+||.+
T Consensus 256 ~~HGlN~l~~k~--------------------------------------------k~~~frTePcinwe~sGyc~yg~R 291 (351)
T COG5063 256 RGHGLNELKSKK--------------------------------------------KKQNFRTEPCINWEKSGYCPYGLR 291 (351)
T ss_pred cccccccccccc--------------------------------------------cccccccCCccchhhcccCccccc
Confidence 9988887521 02458999999995 89999999
Q ss_pred CCCCCCCcCccCCCCcc--ccCCCCCCCCcCCCCCCCCCCCCCCCCcccccCccccc
Q psy3120 350 CPQFHNSRDKRRSPRTF--KYRSTPCPNVKHGDEWGEPANCESGDLCQYCHTRTEQQ 404 (711)
Q Consensus 350 CpFAHn~kDlRR~P~~~--kYkT~LCP~fkkG~~w~~~G~Cp~GdsC~FAHg~~El~ 404 (711)
|+|+|+.+..-..+... .|+...|..+.++ |.|+.|..|.+.|...-+.
T Consensus 292 c~F~hgd~~~ie~~~~~~~~y~~~~crt~~~~------g~~p~g~~~c~~~dkkn~~ 342 (351)
T COG5063 292 CCFKHGDDSDIEMYEEASLGYLDGPCRTRAKG------GAFPSGGAVCKSFDKKNLD 342 (351)
T ss_pred cccccCChhhcccccccccccccccccccccc------CccCCCCchhhccccchhh
Confidence 99999996544444332 6889999999876 8999999999999765443
No 5
>KOG1677|consensus
Probab=99.44 E-value=9e-14 Score=145.63 Aligned_cols=103 Identities=25% Similarity=0.573 Sum_probs=84.4
Q ss_pred cccCCC--CCCCCCCCCCCCCCCCcCccCCCC--------ccccCCCCCCCCcCCCCCCCCCCCCC-CCCcccccCcccc
Q psy3120 335 EQCKRP--PRLCRQGYACPQFHNSRDKRRSPR--------TFKYRSTPCPNVKHGDEWGEPANCES-GDLCQYCHTRTEQ 403 (711)
Q Consensus 335 klC~~~--~G~C~~G~~CpFAHn~kDlRR~P~--------~~kYkT~LCP~fkkG~~w~~~G~Cp~-GdsC~FAHg~~El 403 (711)
..|..+ .+.|.++..|+|.|...+.|..+. .-.|++.+|..|... +.|.+ |++|+|+|+.+|+
T Consensus 87 ~~~~~~~~~~~~~~~s~~~~~~p~~~~~~~~~~~~~~~~~p~~~kt~lc~~~~~~------g~c~y~ge~crfah~~~e~ 160 (332)
T KOG1677|consen 87 GDCSAYLRTGVCGYGSSCRYNHPDLRLRPRPVRRSRGERKPERYKTPLCRSFRKS------GTCKYRGEQCRFAHGLEEL 160 (332)
T ss_pred cccccccccCCCCCCCCCCccCcccccccCCccccccccCcccccCCcceeeecC------ccccccCchhhhcCCcccc
Confidence 567666 368888899999999744433321 124789999998754 89999 9999999999999
Q ss_pred c--------cccccccccccccCccccCCCCCCCCCCCCcchhhhccc
Q psy3120 404 Q--------FHPEIYKSTKCNDVQQAGYCPRGVFCAFAHVDTEMANAR 443 (711)
Q Consensus 404 ~--------yHP~~YKTklC~~fq~tG~Cp~G~~C~FAHgeaELR~~~ 443 (711)
+ ..|.+|||++|.+|+++|+|+||.+|+|+|+..|.+...
T Consensus 161 r~~~~~~~~~~~~~~kt~lC~~f~~tG~C~yG~rC~F~H~~~~~~~~~ 208 (332)
T KOG1677|consen 161 RLPSSENQVGNPPKYKTKLCPKFQKTGLCKYGSRCRFIHGEPEDRASN 208 (332)
T ss_pred cccccchhhcCCCCCCCcCCCccccCCCCCCCCcCeecCCCccccccc
Confidence 8 346679999999999999999999999999999888743
No 6
>KOG1677|consensus
Probab=99.19 E-value=2.1e-11 Score=127.99 Aligned_cols=111 Identities=32% Similarity=0.592 Sum_probs=86.1
Q ss_pred cccc-ccccccc-cCcccCCCCCcccCCCCCCCCCCCCC---CCCCCCccccccccccccccccCCCCC-CCCCCCCccC
Q psy3120 162 EFRV-EQCPLFL-QHKCTQHRPFTCFHWHFMNQRRRRPV---RKRDGSFNYSPDTYCTKYDETTGLCPD-GDECPFLHRT 235 (711)
Q Consensus 162 ~FKT-k~C~~f~-qg~C~~g~p~~C~FAH~~ed~RR~P~---~r~dGt~nYst~~lCpkf~e~tG~Cp~-GD~C~FAHs~ 235 (711)
.+.+ ..|..+. .+.|..+. .|+|.|...+.|..+. ......-.|++ .+|..|. .+|.|.+ |++|+|+|..
T Consensus 82 ~~~~~~~~~~~~~~~~~~~~s--~~~~~~p~~~~~~~~~~~~~~~~~p~~~kt-~lc~~~~-~~g~c~y~ge~crfah~~ 157 (332)
T KOG1677|consen 82 ERSGEGDCSAYLRTGVCGYGS--SCRYNHPDLRLRPRPVRRSRGERKPERYKT-PLCRSFR-KSGTCKYRGEQCRFAHGL 157 (332)
T ss_pred CCCCccccccccccCCCCCCC--CCCccCcccccccCCccccccccCcccccC-Ccceeee-cCccccccCchhhhcCCc
Confidence 3344 6788776 58888876 8999999866665552 22222335776 7999996 8999999 9999999964
Q ss_pred CCccccc--------ccccccccccccccccccccccccccccCCCCCCCCCCC
Q psy3120 236 AGDTERR--------YHLRYYKTCMCVHDTDARGLCVKNGAHCAFAHGNPDLRP 281 (711)
Q Consensus 236 ~G~~E~~--------yHP~~YKTklCk~~~~~kG~C~K~G~~C~FAHg~~ELR~ 281 (711)
.|.+ ..+..|||++|.+|... |.|+ ||.+|.|+|+..+.+.
T Consensus 158 ---~e~r~~~~~~~~~~~~~~kt~lC~~f~~t-G~C~-yG~rC~F~H~~~~~~~ 206 (332)
T KOG1677|consen 158 ---EELRLPSSENQVGNPPKYKTKLCPKFQKT-GLCK-YGSRCRFIHGEPEDRA 206 (332)
T ss_pred ---ccccccccchhhcCCCCCCCcCCCccccC-CCCC-CCCcCeecCCCccccc
Confidence 4444 45778999999999864 9995 9999999999998885
No 7
>KOG1040|consensus
Probab=98.92 E-value=9.8e-10 Score=117.37 Aligned_cols=140 Identities=30% Similarity=0.619 Sum_probs=104.9
Q ss_pred cccccccccCCCCCCCCCCCCccCCCcccccccccccccccccccccccccccccccccCCCCCCCCCCCCCCchHHHhh
Q psy3120 212 YCTKYDETTGLCPDGDECPFLHRTAGDTERRYHLRYYKTCMCVHDTDARGLCVKNGAHCAFAHGNPDLRPPVYDIKEIQA 291 (711)
Q Consensus 212 lCpkf~e~tG~Cp~GD~C~FAHs~~G~~E~~yHP~~YKTklCk~~~~~kG~C~K~G~~C~FAHg~~ELR~P~~~~~Elqa 291 (711)
.|..+......|.+|-.|+..|. +.+ +.+++.+|++|. +|.| +.|+.|.|+|.. ++.
T Consensus 47 ~~~~~~~~~~~~~~~~~~~~~~~---~~~-----~~~~~~vcK~~l--~glC-~kgD~C~Flhe~-~~~----------- 103 (325)
T KOG1040|consen 47 TCEFNESREKPCERGPICPKSHN---DVS-----DSRGKVVCKHWL--RGLC-KKGDQCEFLHEY-DLT----------- 103 (325)
T ss_pred hhcccccCCCCccCCCCCccccC---Ccc-----ccCCceeehhhh--hhhh-hccCcCcchhhh-hhc-----------
Confidence 46655322267999999999996 332 468899999998 5899 589999999966 333
Q ss_pred hcCCCCCCCCCCCCCCCcccccccccCCCCCCCccceeccccccccCCC--CCCCCCCCCCCCCCCCcCccCCCCccccC
Q psy3120 292 LENPEGDPNSNSNGPNALDKEGKLLTEDPKWQDTNYVLSNYKTEQCKRP--PRLCRQGYACPQFHNSRDKRRSPRTFKYR 369 (711)
Q Consensus 292 l~~~~~~~~~~~~Gp~~ld~~~klm~edPkw~DddfvL~~YKTklC~~~--~G~C~~G~~CpFAHn~kDlRR~P~~~kYk 369 (711)
|...|.+| .+.|..+.+|+|+|..- . -+
T Consensus 104 -----------------------------------------k~rec~ff~~~g~c~~~~~c~y~h~dp------q---t~ 133 (325)
T KOG1040|consen 104 -----------------------------------------KMRECKFFSLFGECTNGKDCPYLHGDP------Q---TA 133 (325)
T ss_pred -----------------------------------------ccccccccccccccccccCCcccCCCh------h---hh
Confidence 22334444 25667789999999852 1 12
Q ss_pred CCCCCCCcCCCCCCCCCCCCCCCCcccccCccccccccccccccccccCccccCCCCCCC-CCCCCcchhhhccc
Q psy3120 370 STPCPNVKHGDEWGEPANCESGDLCQYCHTRTEQQFHPEIYKSTKCNDVQQAGYCPRGVF-CAFAHVDTEMANAR 443 (711)
Q Consensus 370 T~LCP~fkkG~~w~~~G~Cp~GdsC~FAHg~~El~yHP~~YKTklC~~fq~tG~Cp~G~~-C~FAHgeaELR~~~ 443 (711)
...|+.+ ..|.|+.|.+|++.|-.. .+|+.|. .|.|+.|.. |-+.|.+.++....
T Consensus 134 ~k~c~~~-------~~g~c~~g~~c~~~h~~~-----------~~c~~y~-~gfC~~g~q~c~~~hp~~~~~~~~ 189 (325)
T KOG1040|consen 134 IKKCKWY-------KEGFCRGGPSCKKRHERK-----------VLCPPYN-AGFCPKGPQRCDMLHPEFQQPPFH 189 (325)
T ss_pred hhccchh-------hhccCCCcchhhhhhhcc-----------cCCCchh-hhhccCCCCcccccCCCCCCChhh
Confidence 4567754 456999999999999654 5799997 699999999 99999988877643
No 8
>KOG1040|consensus
Probab=98.60 E-value=9.3e-08 Score=102.45 Aligned_cols=144 Identities=26% Similarity=0.502 Sum_probs=106.7
Q ss_pred ccccccccccCCCCCCCCCCCCccCCCcccccccccccccccccccccccccccccccccCCCCCCCCCCCCCCchHHHh
Q psy3120 211 TYCTKYDETTGLCPDGDECPFLHRTAGDTERRYHLRYYKTCMCVHDTDARGLCVKNGAHCAFAHGNPDLRPPVYDIKEIQ 290 (711)
Q Consensus 211 ~lCpkf~e~tG~Cp~GD~C~FAHs~~G~~E~~yHP~~YKTklCk~~~~~kG~C~K~G~~C~FAHg~~ELR~P~~~~~Elq 290 (711)
.+|..| ..|.|..||.|.|+|.. +.-|-..|.+|.. .|.|. ++..|.|-|++-+-
T Consensus 78 ~vcK~~--l~glC~kgD~C~Flhe~----------~~~k~rec~ff~~-~g~c~-~~~~c~y~h~dpqt----------- 132 (325)
T KOG1040|consen 78 VVCKHW--LRGLCKKGDQCEFLHEY----------DLTKMRECKFFSL-FGECT-NGKDCPYLHGDPQT----------- 132 (325)
T ss_pred eeehhh--hhhhhhccCcCcchhhh----------hhccccccccccc-ccccc-cccCCcccCCChhh-----------
Confidence 699988 57999999999999951 2345556888864 79997 89999999976411
Q ss_pred hhcCCCCCCCCCCCCCCCcccccccccCCCCCCCccceeccccccccCCC-CCCCCCCCCCCCCCCCcCccCCCCccccC
Q psy3120 291 ALENPEGDPNSNSNGPNALDKEGKLLTEDPKWQDTNYVLSNYKTEQCKRP-PRLCRQGYACPQFHNSRDKRRSPRTFKYR 369 (711)
Q Consensus 291 al~~~~~~~~~~~~Gp~~ld~~~klm~edPkw~DddfvL~~YKTklC~~~-~G~C~~G~~CpFAHn~kDlRR~P~~~kYk 369 (711)
+.+.|+++ .|.|+.|..|.+.|..+
T Consensus 133 ------------------------------------------~~k~c~~~~~g~c~~g~~c~~~h~~~------------ 158 (325)
T KOG1040|consen 133 ------------------------------------------AIKKCKWYKEGFCRGGPSCKKRHERK------------ 158 (325)
T ss_pred ------------------------------------------hhhccchhhhccCCCcchhhhhhhcc------------
Confidence 13346665 37788899999999742
Q ss_pred CCCCCCCcCCCCCCCCCCCCCCCC-cccccCcccccc-------c-cccccccccccCccccCCCCCCCCCCCCcchhhh
Q psy3120 370 STPCPNVKHGDEWGEPANCESGDL-CQYCHTRTEQQF-------H-PEIYKSTKCNDVQQAGYCPRGVFCAFAHVDTEMA 440 (711)
Q Consensus 370 T~LCP~fkkG~~w~~~G~Cp~Gds-C~FAHg~~El~y-------H-P~~YKTklC~~fq~tG~Cp~G~~C~FAHgeaELR 440 (711)
.+|+.|.. +.|+.|.. |.+.|...++-- | |....--.|..+. ...|.....|-+.+...|-.
T Consensus 159 -~~c~~y~~-------gfC~~g~q~c~~~hp~~~~~~~~~~~~q~~~~~~~~~~~~~~~-~~~~~k~~~~~~~~~~~~~~ 229 (325)
T KOG1040|consen 159 -VLCPPYNA-------GFCPKGPQRCDMLHPEFQQPPFHQGSVQHPPQRQSQQPGKISG-EEKIHKLLQDKQLPQQHEEK 229 (325)
T ss_pred -cCCCchhh-------hhccCCCCcccccCCCCCCChhhccccCCCCCCCCcCcccccc-ccccchhhhhhhhhHHHHHH
Confidence 67898764 59999998 999999887642 3 2334444455554 35688888999999888876
Q ss_pred cc
Q psy3120 441 NA 442 (711)
Q Consensus 441 ~~ 442 (711)
+.
T Consensus 230 q~ 231 (325)
T KOG1040|consen 230 QK 231 (325)
T ss_pred Hh
Confidence 65
No 9
>KOG1492|consensus
Probab=98.35 E-value=5.2e-07 Score=92.49 Aligned_cols=133 Identities=23% Similarity=0.531 Sum_probs=99.3
Q ss_pred ccccccccccCCCCCCCCCCCCccCCCcccccccccccccccccccccccccccccccccCCCCCCCCCCCCCCchHHHh
Q psy3120 211 TYCTKYDETTGLCPDGDECPFLHRTAGDTERRYHLRYYKTCMCVHDTDARGLCVKNGAHCAFAHGNPDLRPPVYDIKEIQ 290 (711)
Q Consensus 211 ~lCpkf~e~tG~Cp~GD~C~FAHs~~G~~E~~yHP~~YKTklCk~~~~~kG~C~K~G~~C~FAHg~~ELR~P~~~~~Elq 290 (711)
.+|..|. ..|.|-+|..|+|.|. | -+..+|..|.+ |.|. ....|...|..++.|.|.+
T Consensus 207 vycryyn-angicgkgaacrfvhe----------p--trkticpkfln--grcn-kaedcnlsheldprripac------ 264 (377)
T KOG1492|consen 207 VYCRYYN-ANGICGKGAACRFVHE----------P--TRKTICPKFLN--GRCN-KAEDCNLSHELDPRRIPAC------ 264 (377)
T ss_pred eEEEEec-CCCcccCCceeeeecc----------c--cccccChHHhc--CccC-chhcCCcccccCccccchh------
Confidence 6899886 7899999999999995 1 23347999984 8896 6899999999988887742
Q ss_pred hhcCCCCCCCCCCCCCCCcccccccccCCCCCCCccceeccccccccCCCCCCCCCCCCCCCCCCCcCccCCCCccccCC
Q psy3120 291 ALENPEGDPNSNSNGPNALDKEGKLLTEDPKWQDTNYVLSNYKTEQCKRPPRLCRQGYACPQFHNSRDKRRSPRTFKYRS 370 (711)
Q Consensus 291 al~~~~~~~~~~~~Gp~~ld~~~klm~edPkw~DddfvL~~YKTklC~~~~G~C~~G~~CpFAHn~kDlRR~P~~~kYkT 370 (711)
.+|-...|.+ ..|.|.|-.. ....
T Consensus 265 ---------------------------------------ryfllgkcnn--------pncryvhihy---------sena 288 (377)
T KOG1492|consen 265 ---------------------------------------RYFLLGKCNN--------PNCRYVHIHY---------SENA 288 (377)
T ss_pred ---------------------------------------hhhhhccCCC--------CCceEEEEee---------cCCC
Confidence 1223333433 6899998642 1123
Q ss_pred CCCCCCcCCCCCCCCCCCCCCCCcccccCccccccccccccccccccCccccCCCCCCCCCCCCcchhhhc
Q psy3120 371 TPCPNVKHGDEWGEPANCESGDLCQYCHTRTEQQFHPEIYKSTKCNDVQQAGYCPRGVFCAFAHVDTEMAN 441 (711)
Q Consensus 371 ~LCP~fkkG~~w~~~G~Cp~GdsC~FAHg~~El~yHP~~YKTklC~~fq~tG~Cp~G~~C~FAHgeaELR~ 441 (711)
.+|=.|. ..|.|..|.+|+.-|-. .|.+|.+.|.|.. +.|..+|+.-....
T Consensus 289 picfefa------kygfcelgtscknqhil-------------qctdyamfgscnn-pqcslyhgavsadv 339 (377)
T KOG1492|consen 289 PICFEFA------KYGFCELGTSCKNQHIL-------------QCTDYAMFGSCNN-PQCSLYHGAVSADV 339 (377)
T ss_pred ceeeeeh------hcceeccccccccceee-------------eecchhhhcCCCC-CcceeecceeccCC
Confidence 4576654 35899999999998843 5999999999997 89999998654444
No 10
>KOG2494|consensus
Probab=98.04 E-value=5.9e-06 Score=88.32 Aligned_cols=64 Identities=20% Similarity=0.582 Sum_probs=50.9
Q ss_pred ccccccccccccccCCCCCCCC-CCCCccCCCcccccccccccccccccccccccccccccccccCCCCCCCCCCC
Q psy3120 207 YSPDTYCTKYDETTGLCPDGDE-CPFLHRTAGDTERRYHLRYYKTCMCVHDTDARGLCVKNGAHCAFAHGNPDLRP 281 (711)
Q Consensus 207 Yst~~lCpkf~e~tG~Cp~GD~-C~FAHs~~G~~E~~yHP~~YKTklCk~~~~~kG~C~K~G~~C~FAHg~~ELR~ 281 (711)
|.+++.|..| ..+.|.+||. |+|+|-- + ..+-..-+-..|.++. +|.|. .+.|+|.|.+.+++.
T Consensus 34 wl~~eVCReF--~rn~C~R~d~~CkfaHP~----~-~~~V~~g~v~aC~Ds~--kgrCs--R~nCkylHpp~hlkd 98 (331)
T KOG2494|consen 34 WLTLEVCREF--LRNTCSRGDRECKFAHPP----K-NCQVSNGRVIACFDSQ--KGRCS--RENCKYLHPPQHLKD 98 (331)
T ss_pred hhHHHHHHHH--HhccccCCCccccccCCC----C-CCCccCCeEEEEeccc--cCccC--cccceecCCChhhhh
Confidence 5667899999 5899999999 9999942 1 2334455667799986 58896 688999999988885
No 11
>COG5084 YTH1 Cleavage and polyadenylation specificity factor (CPSF) Clipper subunit and related makorin family Zn-finger proteins [General function prediction only]
Probab=97.99 E-value=4.8e-05 Score=80.68 Aligned_cols=148 Identities=21% Similarity=0.437 Sum_probs=101.1
Q ss_pred ccccccccCCCCCCCCCCCCccCCCcccccccccccccccccccccccccccccccccCCCCCCCCCCCCCCchHHHhhh
Q psy3120 213 CTKYDETTGLCPDGDECPFLHRTAGDTERRYHLRYYKTCMCVHDTDARGLCVKNGAHCAFAHGNPDLRPPVYDIKEIQAL 292 (711)
Q Consensus 213 Cpkf~e~tG~Cp~GD~C~FAHs~~G~~E~~yHP~~YKTklCk~~~~~kG~C~K~G~~C~FAHg~~ELR~P~~~~~Elqal 292 (711)
|..+. ..+.+-..-.|++-|.. |..+....|++|. .|.|+ -+..|.|+|+.+-++. .
T Consensus 77 ~~~~~-~~~~~~~~~s~~~~~~~---------~~~~s~V~c~~~~--~g~c~-s~~~c~~lh~~d~~~s-~--------- 133 (285)
T COG5084 77 CISRN-FNSIRGSRLSTPNNHVN---------PVLSSSVVCKFFL--RGLCK-SGFSCEFLHEYDLRSS-Q--------- 133 (285)
T ss_pred ccccc-ccCCccccccCCccccC---------ccccCCcccchhc--cccCc-CCCccccccCCCcccc-c---------
Confidence 55443 33333334467777741 2257788999997 58996 8999999998765442 1
Q ss_pred cCCCCCCCCCCCCCCCcccccccccCCCCCCCccceeccccccccCCC--CCCCCCCCCCCCCCCCcCccCCCCccccCC
Q psy3120 293 ENPEGDPNSNSNGPNALDKEGKLLTEDPKWQDTNYVLSNYKTEQCKRP--PRLCRQGYACPQFHNSRDKRRSPRTFKYRS 370 (711)
Q Consensus 293 ~~~~~~~~~~~~Gp~~ld~~~klm~edPkw~DddfvL~~YKTklC~~~--~G~C~~G~~CpFAHn~kDlRR~P~~~kYkT 370 (711)
+-.|+.| .|.|..|..|+|.|.. |+.+ .
T Consensus 134 -----------------------------------------~~~c~~Fs~~G~cs~g~~c~~~h~d------p~~~---~ 163 (285)
T COG5084 134 -----------------------------------------GPPCRSFSLKGSCSSGPSCGYSHID------PDSF---A 163 (285)
T ss_pred -----------------------------------------CCCcccccccceeccCCCCCccccC------cccc---c
Confidence 2246666 4788899999999986 4332 2
Q ss_pred CCCCCCcCCCCCCCCCCCCCCCCcccccCccccccccccccccccccCccccCCCCCCC--------------CCCCCcc
Q psy3120 371 TPCPNVKHGDEWGEPANCESGDLCQYCHTRTEQQFHPEIYKSTKCNDVQQAGYCPRGVF--------------CAFAHVD 436 (711)
Q Consensus 371 ~LCP~fkkG~~w~~~G~Cp~GdsC~FAHg~~El~yHP~~YKTklC~~fq~tG~Cp~G~~--------------C~FAHge 436 (711)
..|..+... ..+.|+.|..|++.|+...+-|- +.+|.++. .+.||.|.. |...|..
T Consensus 164 ~~~~~~~~~----~~~f~p~g~~c~~~H~~~~~~~~-----~~p~~~y~-~~fsP~g~~~~~~~~~~~~~~~~~~~~~ps 233 (285)
T COG5084 164 GNCDQYSGA----TYGFCPLGASCKFSHTLKRVSYG-----SSPCGNYT-PPFSPPGTPSESVSSWGYGKGTSCSLSHPS 233 (285)
T ss_pred ccccccCcc----cccccCCCCcccccccccccccc-----ccccccCc-CCcCCCCCCccccccccccccccccCCCcc
Confidence 345554321 24699999999999998755444 44899997 478888877 8888877
Q ss_pred hhhhccc
Q psy3120 437 TEMANAR 443 (711)
Q Consensus 437 aELR~~~ 443 (711)
.+.....
T Consensus 234 ~~~~iq~ 240 (285)
T COG5084 234 LNIDIQQ 240 (285)
T ss_pred ccCCcCC
Confidence 6655543
No 12
>PF00642 zf-CCCH: Zinc finger C-x8-C-x5-C-x3-H type (and similar); InterPro: IPR000571 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents C-x8-C-x5-C-x3-H (CCCH) type Zinc finger (Znf) domains. Proteins containing CCCH Znf domains include Znf proteins from eukaryotes involved in cell cycle or growth phase-related regulation, e.g. human TIS11B (butyrate response factor 1), a probable regulatory protein involved in regulating the response to growth factors, and the mouse TTP growth factor-inducible nuclear protein, which has the same function. The mouse TTP protein is induced by growth factors. Another protein containing this domain is the human splicing factor U2AF 35kDa subunit, which plays a critical role in both constitutive and enhancer-dependent splicing by mediating essential protein-protein interactions and protein-RNA interactions required for 3' splice site selection. It has been shown that different CCCH-type Znf proteins interact with the 3'-untranslated region of various mRNA [, ]. This type of Znf is very often present in two copies. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0003676 nucleic acid binding, 0008270 zinc ion binding; PDB: 1M9O_A 1RGO_A 2CQE_A 2FC6_A 2D9M_A 2E5S_A 2RHK_C 2D9N_A 3D2S_A 3D2Q_C ....
Probab=97.96 E-value=1.2e-06 Score=62.11 Aligned_cols=27 Identities=41% Similarity=0.811 Sum_probs=22.6
Q ss_pred ccccccccCccccCCCCCCCCCCCCcc
Q psy3120 410 YKSTKCNDVQQAGYCPRGVFCAFAHVD 436 (711)
Q Consensus 410 YKTklC~~fq~tG~Cp~G~~C~FAHge 436 (711)
|||.+|.+|.+.|.|++|+.|.|+|++
T Consensus 1 ~k~~~C~~f~~~g~C~~G~~C~f~H~~ 27 (27)
T PF00642_consen 1 YKTKLCRFFMRTGTCPFGDKCRFAHGE 27 (27)
T ss_dssp TTSSB-HHHHHTS--TTGGGSSSBSSG
T ss_pred CccccChhhccCCccCCCCCcCccCCC
Confidence 799999999999999999999999984
No 13
>PF00642 zf-CCCH: Zinc finger C-x8-C-x5-C-x3-H type (and similar); InterPro: IPR000571 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents C-x8-C-x5-C-x3-H (CCCH) type Zinc finger (Znf) domains. Proteins containing CCCH Znf domains include Znf proteins from eukaryotes involved in cell cycle or growth phase-related regulation, e.g. human TIS11B (butyrate response factor 1), a probable regulatory protein involved in regulating the response to growth factors, and the mouse TTP growth factor-inducible nuclear protein, which has the same function. The mouse TTP protein is induced by growth factors. Another protein containing this domain is the human splicing factor U2AF 35kDa subunit, which plays a critical role in both constitutive and enhancer-dependent splicing by mediating essential protein-protein interactions and protein-RNA interactions required for 3' splice site selection. It has been shown that different CCCH-type Znf proteins interact with the 3'-untranslated region of various mRNA [, ]. This type of Znf is very often present in two copies. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0003676 nucleic acid binding, 0008270 zinc ion binding; PDB: 1M9O_A 1RGO_A 2CQE_A 2FC6_A 2D9M_A 2E5S_A 2RHK_C 2D9N_A 3D2S_A 3D2Q_C ....
Probab=97.49 E-value=2.8e-05 Score=55.21 Aligned_cols=27 Identities=44% Similarity=0.862 Sum_probs=21.1
Q ss_pred cccccccccccccccccccccccCCCCCC
Q psy3120 248 YKTCMCVHDTDARGLCVKNGAHCAFAHGN 276 (711)
Q Consensus 248 YKTklCk~~~~~kG~C~K~G~~C~FAHg~ 276 (711)
|||++|++|.. .|.|+ +|+.|.|+|++
T Consensus 1 ~k~~~C~~f~~-~g~C~-~G~~C~f~H~~ 27 (27)
T PF00642_consen 1 YKTKLCRFFMR-TGTCP-FGDKCRFAHGE 27 (27)
T ss_dssp TTSSB-HHHHH-TS--T-TGGGSSSBSSG
T ss_pred CccccChhhcc-CCccC-CCCCcCccCCC
Confidence 79999999985 59996 99999999974
No 14
>COG5084 YTH1 Cleavage and polyadenylation specificity factor (CPSF) Clipper subunit and related makorin family Zn-finger proteins [General function prediction only]
Probab=97.41 E-value=0.0007 Score=72.05 Aligned_cols=57 Identities=30% Similarity=0.615 Sum_probs=44.1
Q ss_pred ccccccccccccccCCCCCCCCCCCCccCCCcccccccccccccccccccccccccccccccccCCCCCC
Q psy3120 207 YSPDTYCTKYDETTGLCPDGDECPFLHRTAGDTERRYHLRYYKTCMCVHDTDARGLCVKNGAHCAFAHGN 276 (711)
Q Consensus 207 Yst~~lCpkf~e~tG~Cp~GD~C~FAHs~~G~~E~~yHP~~YKTklCk~~~~~kG~C~K~G~~C~FAHg~ 276 (711)
|+. ..|+.|+ -|.|+.|+.|.|+|+. + +.++ -.-.|+.|. ..|.|. .|..|.+.|.+
T Consensus 102 ~s~-V~c~~~~--~g~c~s~~~c~~lh~~--d---~~~s---~~~~c~~Fs-~~G~cs-~g~~c~~~h~d 158 (285)
T COG5084 102 SSS-VVCKFFL--RGLCKSGFSCEFLHEY--D---LRSS---QGPPCRSFS-LKGSCS-SGPSCGYSHID 158 (285)
T ss_pred cCC-cccchhc--cccCcCCCccccccCC--C---cccc---cCCCccccc-ccceec-cCCCCCccccC
Confidence 454 8999995 7999999999999973 1 1111 255699985 379995 99999999987
No 15
>smart00356 ZnF_C3H1 zinc finger.
Probab=97.22 E-value=0.00021 Score=49.38 Aligned_cols=26 Identities=46% Similarity=0.922 Sum_probs=23.7
Q ss_pred ccccccccCccccCCCCCCCCCCCCcc
Q psy3120 410 YKSTKCNDVQQAGYCPRGVFCAFAHVD 436 (711)
Q Consensus 410 YKTklC~~fq~tG~Cp~G~~C~FAHge 436 (711)
||+.+|.+| .+|.|++|+.|+|.|..
T Consensus 2 ~k~~~C~~~-~~g~C~~g~~C~~~H~~ 27 (27)
T smart00356 2 YKTELCKFF-KRGYCPYGDRCKFAHPL 27 (27)
T ss_pred CCCCcCcCc-cCCCCCCCCCcCCCCcC
Confidence 789999999 68999999999999973
No 16
>KOG4791|consensus
Probab=97.10 E-value=0.00042 Score=77.47 Aligned_cols=81 Identities=22% Similarity=0.504 Sum_probs=57.9
Q ss_pred CCCCCCCCCCCCCCCCcCccCCCCccccCCCCCCCCcCCCCCCCCCCCCCCCCcccccCccccccccccccccccccCcc
Q psy3120 341 PRLCRQGYACPQFHNSRDKRRSPRTFKYRSTPCPNVKHGDEWGEPANCESGDLCQYCHTRTEQQFHPEIYKSTKCNDVQQ 420 (711)
Q Consensus 341 ~G~C~~G~~CpFAHn~kDlRR~P~~~kYkT~LCP~fkkG~~w~~~G~Cp~GdsC~FAHg~~El~yHP~~YKTklC~~fq~ 420 (711)
+..|.+++.|+|.|++..+. .-..|.+|..+ -.|.. .|+|.|+..... ..-.+|..|.+
T Consensus 11 ys~cKk~d~c~~rh~E~al~--------n~t~C~~w~~~------~~C~k--~C~YRHSe~~~k-----r~e~~CYwe~~ 69 (667)
T KOG4791|consen 11 YSTCKKGDSCPFRHCEAALG--------NETVCTLWQEG------RCCRK--VCRYRHSEIDKK-----RSEIPCYWENQ 69 (667)
T ss_pred hhhhhccCcCcchhhHHHhc--------Ccchhhhhhhc------Ccccc--cccchhhHHhhh-----cCcccceeecC
Confidence 35677789999999975431 13679998754 46764 899999975443 23458988875
Q ss_pred ccCCCCCCCCCCCCcchhhhccc
Q psy3120 421 AGYCPRGVFCAFAHVDTEMANAR 443 (711)
Q Consensus 421 tG~Cp~G~~C~FAHgeaELR~~~ 443 (711)
...|-. ++|-|-|....|...-
T Consensus 70 p~gC~k-~~CgfRH~~pPLkg~l 91 (667)
T KOG4791|consen 70 PTGCQK-LNCGFRHNRPPLKGVL 91 (667)
T ss_pred CCccCC-CccccccCCCchhhhc
Confidence 444997 8999999876666543
No 17
>KOG1492|consensus
Probab=97.03 E-value=0.00044 Score=71.45 Aligned_cols=135 Identities=22% Similarity=0.500 Sum_probs=89.3
Q ss_pred ccccccccccccccccccccCCCCCCCCCCCCCCchHHHhhhcCCCCCCCCCCCCCCCcccccccccCCCCCCCccceec
Q psy3120 251 CMCVHDTDARGLCVKNGAHCAFAHGNPDLRPPVYDIKEIQALENPEGDPNSNSNGPNALDKEGKLLTEDPKWQDTNYVLS 330 (711)
Q Consensus 251 klCk~~~~~kG~C~K~G~~C~FAHg~~ELR~P~~~~~Elqal~~~~~~~~~~~~Gp~~ld~~~klm~edPkw~DddfvL~ 330 (711)
..|++|.. .|.|- .|..|+|.|.+ .
T Consensus 207 vycryyna-ngicg-kgaacrfvhep------t----------------------------------------------- 231 (377)
T KOG1492|consen 207 VYCRYYNA-NGICG-KGAACRFVHEP------T----------------------------------------------- 231 (377)
T ss_pred eEEEEecC-CCccc-CCceeeeeccc------c-----------------------------------------------
Confidence 34988865 69997 79999999932 1
Q ss_pred cccccccCCC-CCCCCCCCCCCCCCCCcCccCCCCccccCCCCCCCCcCCCCCCCCCCCCCCCCcccccCcccccccccc
Q psy3120 331 NYKTEQCKRP-PRLCRQGYACPQFHNSRDKRRSPRTFKYRSTPCPNVKHGDEWGEPANCESGDLCQYCHTRTEQQFHPEI 409 (711)
Q Consensus 331 ~YKTklC~~~-~G~C~~G~~CpFAHn~kDlRR~P~~~kYkT~LCP~fkkG~~w~~~G~Cp~GdsC~FAHg~~El~yHP~~ 409 (711)
+...||+| .|.|...++|...|... .||-|. |+.|.- |.|. ..+|+|-|-....
T Consensus 232 --rkticpkflngrcnkaedcnlsheld-prripa--------cryfll-------gkcn-npncryvhihyse------ 286 (377)
T KOG1492|consen 232 --RKTICPKFLNGRCNKAEDCNLSHELD-PRRIPA--------CRYFLL-------GKCN-NPNCRYVHIHYSE------ 286 (377)
T ss_pred --ccccChHHhcCccCchhcCCcccccC-ccccch--------hhhhhh-------ccCC-CCCceEEEEeecC------
Confidence 13358887 58899899999999864 455443 888764 5894 4789998843210
Q ss_pred ccccccccCccccCCCCCCCCCCCCcchhhhccccCCCCCCCCCCCCCCCccccccCCCCCC
Q psy3120 410 YKSTKCNDVQQAGYCPRGVFCAFAHVDTEMANAREMAGGLGPLDGTPGSGLSDILAGGLPNS 471 (711)
Q Consensus 410 YKTklC~~fq~tG~Cp~G~~C~FAHgeaELR~~~~~~~g~g~l~~s~~~s~~~~~s~~~Pp~ 471 (711)
..-.|..|.+.|.|.-|..|.-.|-..-. ++.- .|.-...+-+.+.+++|..||--
T Consensus 287 -napicfefakygfcelgtscknqhilqct----dyam-fgscnnpqcslyhgavsadvpeq 342 (377)
T KOG1492|consen 287 -NAPICFEFAKYGFCELGTSCKNQHILQCT----DYAM-FGSCNNPQCSLYHGAVSADVPEQ 342 (377)
T ss_pred -CCceeeeehhcceeccccccccceeeeec----chhh-hcCCCCCcceeecceeccCCccc
Confidence 13369999999999999999998864321 1110 11111223344555666666653
No 18
>KOG2494|consensus
Probab=96.62 E-value=0.00054 Score=73.72 Aligned_cols=62 Identities=23% Similarity=0.525 Sum_probs=49.4
Q ss_pred CCCCCCcCCCCCCCCCCCCCCCC-cccccCccccccccccccccccccCccccCCCCCCCCCCCCcchhhhccc
Q psy3120 371 TPCPNVKHGDEWGEPANCESGDL-CQYCHTRTEQQFHPEIYKSTKCNDVQQAGYCPRGVFCAFAHVDTEMANAR 443 (711)
Q Consensus 371 ~LCP~fkkG~~w~~~G~Cp~Gds-C~FAHg~~El~yHP~~YKTklC~~fq~tG~Cp~G~~C~FAHgeaELR~~~ 443 (711)
+.|+.|.++ .|.+||. |+|+|--.-. +-..-+-..|.|++ +|.|.+ +.|.|.|...+++...
T Consensus 38 eVCReF~rn-------~C~R~d~~CkfaHP~~~~--~V~~g~v~aC~Ds~-kgrCsR-~nCkylHpp~hlkdql 100 (331)
T KOG2494|consen 38 EVCREFLRN-------TCSRGDRECKFAHPPKNC--QVSNGRVIACFDSQ-KGRCSR-ENCKYLHPPQHLKDQL 100 (331)
T ss_pred HHHHHHHhc-------cccCCCccccccCCCCCC--CccCCeEEEEeccc-cCccCc-ccceecCCChhhhhhh
Confidence 679999876 8999999 9999975522 22334566799998 589999 7899999999987743
No 19
>KOG4791|consensus
Probab=96.45 E-value=0.0017 Score=72.83 Aligned_cols=84 Identities=21% Similarity=0.575 Sum_probs=59.5
Q ss_pred ccccccccCcccCCCCCcccCCCCCCCCCCCCCCCCCCCccccccccccccccccCCCCCCCCCCCCccCCCcccccccc
Q psy3120 166 EQCPLFLQHKCTQHRPFTCFHWHFMNQRRRRPVRKRDGSFNYSPDTYCTKYDETTGLCPDGDECPFLHRTAGDTERRYHL 245 (711)
Q Consensus 166 k~C~~f~qg~C~~g~p~~C~FAH~~ed~RR~P~~r~dGt~nYst~~lCpkf~e~tG~Cp~GD~C~FAHs~~G~~E~~yHP 245 (711)
.-|.+|.++.|.+++ .|+|.|...-+ ++.+ .|..|. ..-.|.+ .|+|.|+ .+-++
T Consensus 4 ~dcyff~ys~cKk~d--~c~~rh~E~al------------~n~t--~C~~w~-~~~~C~k--~C~YRHS---e~~~k--- 58 (667)
T KOG4791|consen 4 EDCYFFFYSTCKKGD--SCPFRHCEAAL------------GNET--VCTLWQ-EGRCCRK--VCRYRHS---EIDKK--- 58 (667)
T ss_pred ccchhhhhhhhhccC--cCcchhhHHHh------------cCcc--hhhhhh-hcCcccc--cccchhh---HHhhh---
Confidence 358889999999997 99999985422 2333 699886 3455764 9999997 23222
Q ss_pred ccccccccccccccccc-ccccccccCCCCCCCCCC
Q psy3120 246 RYYKTCMCVHDTDARGL-CVKNGAHCAFAHGNPDLR 280 (711)
Q Consensus 246 ~~YKTklCk~~~~~kG~-C~K~G~~C~FAHg~~ELR 280 (711)
..-.+|.+|.+ ++ |. ...|-|-|.--+|.
T Consensus 59 --r~e~~CYwe~~--p~gC~--k~~CgfRH~~pPLk 88 (667)
T KOG4791|consen 59 --RSEIPCYWENQ--PTGCQ--KLNCGFRHNRPPLK 88 (667)
T ss_pred --cCcccceeecC--CCccC--CCccccccCCCchh
Confidence 34568988864 45 97 59999999654444
No 20
>smart00356 ZnF_C3H1 zinc finger.
Probab=96.42 E-value=0.0018 Score=44.64 Aligned_cols=26 Identities=46% Similarity=0.858 Sum_probs=22.7
Q ss_pred cccccccccccccccccccccccCCCCCC
Q psy3120 248 YKTCMCVHDTDARGLCVKNGAHCAFAHGN 276 (711)
Q Consensus 248 YKTklCk~~~~~kG~C~K~G~~C~FAHg~ 276 (711)
+|+.+|.+|. .|.|. +|+.|+|+|..
T Consensus 2 ~k~~~C~~~~--~g~C~-~g~~C~~~H~~ 27 (27)
T smart00356 2 YKTELCKFFK--RGYCP-YGDRCKFAHPL 27 (27)
T ss_pred CCCCcCcCcc--CCCCC-CCCCcCCCCcC
Confidence 7899999994 58996 99999999974
No 21
>KOG2333|consensus
Probab=96.38 E-value=0.0015 Score=73.60 Aligned_cols=65 Identities=31% Similarity=0.753 Sum_probs=48.7
Q ss_pred CCCCCCCCcCCCCCCCCCCCCCCCCcccccCccccccc--cccccccccccCccccCCCCCCCCCCC--Ccchhh
Q psy3120 369 RSTPCPNVKHGDEWGEPANCESGDLCQYCHTRTEQQFH--PEIYKSTKCNDVQQAGYCPRGVFCAFA--HVDTEM 439 (711)
Q Consensus 369 kT~LCP~fkkG~~w~~~G~Cp~GdsC~FAHg~~El~yH--P~~YKTklC~~fq~tG~Cp~G~~C~FA--HgeaEL 439 (711)
...+||.+..+ +...|++|++|+|-|..+..+.. |+. --.|.-|...|.||+|..|+|+ |-..+-
T Consensus 75 ~n~LCPsli~g----~~~~C~f~d~Crf~HDi~ayLatK~~Di--g~~Cp~f~s~G~Cp~G~~CRFl~aHld~~g 143 (614)
T KOG2333|consen 75 QNRLCPSLIQG----DISKCSFGDNCRFVHDIEAYLATKAPDI--GPSCPVFESLGFCPYGFKCRFLGAHLDIEG 143 (614)
T ss_pred hhccChHhhcC----CCccCcccccccccccHHHHHhccCccc--CCccceeeccccCCccceeehhhcccCccc
Confidence 36899999765 44699999999999998754422 111 1349999999999999999996 544433
No 22
>KOG1763|consensus
Probab=96.15 E-value=0.0011 Score=70.80 Aligned_cols=70 Identities=26% Similarity=0.498 Sum_probs=51.5
Q ss_pred ccccccccccCcccCCCCCcccCCCCCCCCCCCCCCC---------------------CCCCcccccccccccccc----
Q psy3120 164 RVEQCPLFLQHKCTQHRPFTCFHWHFMNQRRRRPVRK---------------------RDGSFNYSPDTYCTKYDE---- 218 (711)
Q Consensus 164 KTk~C~~f~qg~C~~g~p~~C~FAH~~ed~RR~P~~r---------------------~dGt~nYst~~lCpkf~e---- 218 (711)
|+..|.+|.+|.|++|+ .|.|+|..+..|+.+-.. ..|...-.++..|..|.+
T Consensus 91 KSvvCafFk~g~C~KG~--kCKFsHdl~~~~k~eK~dly~d~rdemWD~~kl~~vv~~K~~k~k~~tdiVCKfFLeAvE~ 168 (343)
T KOG1763|consen 91 KSVVCAFFKQGTCTKGD--KCKFSHDLAVERKKEKIDLYPDTRDEMWDEEKLEEVVLKKHGKPKPTTDIVCKFFLEAVEN 168 (343)
T ss_pred hHHHHHHHhccCCCCCC--cccccchHHHhhhccchhccccchhhhhhHHHHHHHHHhhccCCCCchhHHHHHHHHHHhc
Confidence 67789999999999997 999999999999877421 111122234568987753
Q ss_pred ----ccCCCCCC-CCCCCCccC
Q psy3120 219 ----TTGLCPDG-DECPFLHRT 235 (711)
Q Consensus 219 ----~tG~Cp~G-D~C~FAHs~ 235 (711)
--..||.| +.|.|.|.+
T Consensus 169 ~kYGWfW~CPnGg~~C~YrHaL 190 (343)
T KOG1763|consen 169 GKYGWFWECPNGGDKCIYRHAL 190 (343)
T ss_pred CCccceeECCCCCCeeeeeecC
Confidence 12479996 699999974
No 23
>KOG2333|consensus
Probab=95.44 E-value=0.0059 Score=69.05 Aligned_cols=57 Identities=25% Similarity=0.459 Sum_probs=41.0
Q ss_pred cccccccccc--CCCCCCCCCCCCccCCCcccccc---cccccccccccccccccccccccccccCCCCCC
Q psy3120 211 TYCTKYDETT--GLCPDGDECPFLHRTAGDTERRY---HLRYYKTCMCVHDTDARGLCVKNGAHCAFAHGN 276 (711)
Q Consensus 211 ~lCpkf~e~t--G~Cp~GD~C~FAHs~~G~~E~~y---HP~~YKTklCk~~~~~kG~C~K~G~~C~FAHg~ 276 (711)
.+||..- .. ..|++||+|+|.|+ .|... .|+.- --|.-|. +.|.|+ ||-.|+|+-+.
T Consensus 77 ~LCPsli-~g~~~~C~f~d~Crf~HD----i~ayLatK~~Dig--~~Cp~f~-s~G~Cp-~G~~CRFl~aH 138 (614)
T KOG2333|consen 77 RLCPSLI-QGDISKCSFGDNCRFVHD----IEAYLATKAPDIG--PSCPVFE-SLGFCP-YGFKCRFLGAH 138 (614)
T ss_pred ccChHhh-cCCCccCccccccccccc----HHHHHhccCcccC--Cccceee-ccccCC-ccceeehhhcc
Confidence 6999885 44 68999999999996 45421 11111 2398886 479998 99999997543
No 24
>KOG3702|consensus
Probab=94.78 E-value=0.072 Score=62.18 Aligned_cols=70 Identities=24% Similarity=0.421 Sum_probs=45.5
Q ss_pred CCCCCCCCCcCccCCCCccccCCCCCCCCcCCCCCCCCCCCCCCCCcccccCccccccccccccccccccCccccCCCCC
Q psy3120 348 YACPQFHNSRDKRRSPRTFKYRSTPCPNVKHGDEWGEPANCESGDLCQYCHTRTEQQFHPEIYKSTKCNDVQQAGYCPRG 427 (711)
Q Consensus 348 ~~CpFAHn~kDlRR~P~~~kYkT~LCP~fkkG~~w~~~G~Cp~GdsC~FAHg~~El~yHP~~YKTklC~~fq~tG~Cp~G 427 (711)
.+|+|.|... +.|.. |++.+|+.=.. ...+-|.|+-.|+.- .=..|||. .|++.. +|+..
T Consensus 597 sDC~~sH~~~---~~pvq--~t~ip~~~~~~----ti~~~CrY~pnCrnm---~C~F~HPk-----~cRf~~---~c~~~ 656 (681)
T KOG3702|consen 597 SDCNYSHAGR---RIPVQ--PTRIPPPFPGG----TIRGLCRYRPNCRNM---QCKFYHPK-----TCRFNT---NCPNN 656 (681)
T ss_pred ccCcccccCC---CCCCc--cccCCCCCCCC----CccccceeccCcCCc---cccccCCc-----cccccc---cCCCC
Confidence 7999999863 44554 55666664311 234567777766521 11136665 688876 68877
Q ss_pred CCCCCCCcch
Q psy3120 428 VFCAFAHVDT 437 (711)
Q Consensus 428 ~~C~FAHgea 437 (711)
..|-|+|...
T Consensus 657 ~sc~fYh~r~ 666 (681)
T KOG3702|consen 657 PSCTFYHERP 666 (681)
T ss_pred cccccccCCc
Confidence 8999999865
No 25
>COG5252 Uncharacterized conserved protein, contains CCCH-type Zn-finger protein [General function prediction only]
Probab=94.60 E-value=0.0067 Score=63.31 Aligned_cols=72 Identities=24% Similarity=0.484 Sum_probs=52.0
Q ss_pred ccccccccccccCcccCCCCCcccCCCCCCCCCCCCCC------------CCCCCcc-cccccccccccc--------cc
Q psy3120 162 EFRVEQCPLFLQHKCTQHRPFTCFHWHFMNQRRRRPVR------------KRDGSFN-YSPDTYCTKYDE--------TT 220 (711)
Q Consensus 162 ~FKTk~C~~f~qg~C~~g~p~~C~FAH~~ed~RR~P~~------------r~dGt~n-Yst~~lCpkf~e--------~t 220 (711)
.=|+..|..|..+.|.+|+ .|.|+|..++.|+.--. .+.+... -.++..|.+|-+ -.
T Consensus 82 dpK~~vcalF~~~~c~kg~--~ckF~h~~ee~r~~eK~DLYsDvRd~~ed~pl~krP~intd~VCkffieA~e~GkYgw~ 159 (299)
T COG5252 82 DPKTVVCALFLNKTCAKGD--ACKFAHGKEEARKTEKPDLYSDVRDKEEDVPLGKRPWINTDRVCKFFIEAMESGKYGWG 159 (299)
T ss_pred CchhHHHHHhccCccccCc--hhhhhcchHHHhhhcccchhhhhhhhhccCCcccCCCCChhHHHHHHHHHHhcCCccce
Confidence 3478899999999999997 89999999998875421 0111111 125689987743 22
Q ss_pred CCCCCC-CCCCCCccC
Q psy3120 221 GLCPDG-DECPFLHRT 235 (711)
Q Consensus 221 G~Cp~G-D~C~FAHs~ 235 (711)
..||+| +.|-|.|.+
T Consensus 160 W~CPng~~~C~y~H~L 175 (299)
T COG5252 160 WTCPNGNMRCSYIHKL 175 (299)
T ss_pred eeCCCCCceeeeeecc
Confidence 479996 899999975
No 26
>KOG1763|consensus
Probab=93.29 E-value=0.014 Score=62.48 Aligned_cols=31 Identities=23% Similarity=0.378 Sum_probs=25.9
Q ss_pred cccccCCCC-CCCCCCCCCCCCCCCcCccCCC
Q psy3120 333 KTEQCKRPP-RLCRQGYACPQFHNSRDKRRSP 363 (711)
Q Consensus 333 KTklC~~~~-G~C~~G~~CpFAHn~kDlRR~P 363 (711)
|+..|-+|. |.|..|+.|.|+|+....|+.+
T Consensus 91 KSvvCafFk~g~C~KG~kCKFsHdl~~~~k~e 122 (343)
T KOG1763|consen 91 KSVVCAFFKQGTCTKGDKCKFSHDLAVERKKE 122 (343)
T ss_pred hHHHHHHHhccCCCCCCcccccchHHHhhhcc
Confidence 788999985 8999999999999986555554
No 27
>PF14608 zf-CCCH_2: Zinc finger C-x8-C-x5-C-x3-H type
Probab=89.10 E-value=0.28 Score=32.68 Aligned_cols=19 Identities=42% Similarity=0.910 Sum_probs=15.3
Q ss_pred ccCCCCCCCCCCCCCCCCCC
Q psy3120 336 QCKRPPRLCRQGYACPQFHN 355 (711)
Q Consensus 336 lC~~~~G~C~~G~~CpFAHn 355 (711)
+|+++.+ |.++++|+|+|.
T Consensus 1 ~Ck~~~~-C~~~~~C~f~HP 19 (19)
T PF14608_consen 1 PCKFGPN-CTNGDNCPFSHP 19 (19)
T ss_pred CCcCcCC-CCCCCcCccCCc
Confidence 4776655 999999999994
No 28
>KOG2185|consensus
Probab=87.05 E-value=0.36 Score=54.12 Aligned_cols=31 Identities=26% Similarity=0.461 Sum_probs=26.6
Q ss_pred ccccccccccccccccccccccccccccCCCCCC
Q psy3120 243 YHLRYYKTCMCVHDTDARGLCVKNGAHCAFAHGN 276 (711)
Q Consensus 243 yHP~~YKTklCk~~~~~kG~C~K~G~~C~FAHg~ 276 (711)
.||+.---++|++|+ .|.| +++..|+|.||.
T Consensus 133 l~PTh~sMkpC~ffL--eg~C-RF~enCRfSHG~ 163 (486)
T KOG2185|consen 133 LTPTHESMKPCKFFL--EGRC-RFGENCRFSHGL 163 (486)
T ss_pred ecCcchhhccchHhh--cccc-ccCcccccccCc
Confidence 468777788999998 4889 599999999996
No 29
>COG5252 Uncharacterized conserved protein, contains CCCH-type Zn-finger protein [General function prediction only]
Probab=86.79 E-value=0.14 Score=53.85 Aligned_cols=30 Identities=30% Similarity=0.547 Sum_probs=24.8
Q ss_pred cccccCCCC-CCCCCCCCCCCCCCCcCccCC
Q psy3120 333 KTEQCKRPP-RLCRQGYACPQFHNSRDKRRS 362 (711)
Q Consensus 333 KTklC~~~~-G~C~~G~~CpFAHn~kDlRR~ 362 (711)
||..|..|. +.|..|+.|.|+|..++-|+.
T Consensus 84 K~~vcalF~~~~c~kg~~ckF~h~~ee~r~~ 114 (299)
T COG5252 84 KTVVCALFLNKTCAKGDACKFAHGKEEARKT 114 (299)
T ss_pred hhHHHHHhccCccccCchhhhhcchHHHhhh
Confidence 788898884 889999999999997665543
No 30
>PF05297 Herpes_LMP1: Herpesvirus latent membrane protein 1 (LMP1); InterPro: IPR007961 This family consists of several latent membrane protein 1 or LMP1s mostly from Epstein-Barr virus (strain GD1) (HHV-4) (Human herpesvirus 4). LMP1 of HHV-4 is a 62-65 kDa plasma membrane protein possessing six membrane spanning regions, a short cytoplasmic N terminus and a long cytoplasmic carboxy tail of 200 amino acids. HHV-4 virus latent membrane protein 1 (LMP1) is essential for HHV-4 mediated transformation and has been associated with several cases of malignancies. HHV-4-like viruses in Macaca fascicularis (Cynomolgus monkeys) have been associated with high lymphoma rates in immunosuppressed monkeys [].; GO: 0019087 transformation of host cell by virus, 0016021 integral to membrane; PDB: 1CZY_E 1ZMS_B.
Probab=86.33 E-value=0.22 Score=53.82 Aligned_cols=8 Identities=25% Similarity=0.501 Sum_probs=0.0
Q ss_pred HHHhccCC
Q psy3120 11 EISRLSGT 18 (711)
Q Consensus 11 elE~l~~~ 18 (711)
++|+=+++
T Consensus 4 ~~~rg~~g 11 (381)
T PF05297_consen 4 DLNRGRGG 11 (381)
T ss_dssp --------
T ss_pred ccccCCCC
Confidence 34443333
No 31
>COG5152 Uncharacterized conserved protein, contains RING and CCCH-type Zn-fingers [General function prediction only]
Probab=85.84 E-value=0.33 Score=50.08 Aligned_cols=33 Identities=30% Similarity=0.780 Sum_probs=28.5
Q ss_pred ccccccccccccccCccccCCCCCCCCCCCCcchhhhc
Q psy3120 404 QFHPEIYKSTKCNDVQQAGYCPRGVFCAFAHVDTEMAN 441 (711)
Q Consensus 404 ~yHP~~YKTklC~~fq~tG~Cp~G~~C~FAHgeaELR~ 441 (711)
.|+|. .|.+|..+|+|-||+-|.|.|..++...
T Consensus 138 D~qpd-----VCKdyk~TGYCGYGDsCKflH~R~D~Kt 170 (259)
T COG5152 138 DTQPD-----VCKDYKETGYCGYGDSCKFLHDRSDFKT 170 (259)
T ss_pred ecCcc-----cccchhhcccccCCchhhhhhhhhhhhc
Confidence 46776 6999999999999999999999887654
No 32
>COG5152 Uncharacterized conserved protein, contains RING and CCCH-type Zn-fingers [General function prediction only]
Probab=85.44 E-value=0.34 Score=49.99 Aligned_cols=28 Identities=43% Similarity=0.994 Sum_probs=24.6
Q ss_pred ccccccccccccccccCCCCCCCCCCCCcc
Q psy3120 205 FNYSPDTYCTKYDETTGLCPDGDECPFLHR 234 (711)
Q Consensus 205 ~nYst~~lCpkf~e~tG~Cp~GD~C~FAHs 234 (711)
..|.+ ..|..|. .+|.|-|||.|.|+|.
T Consensus 137 iD~qp-dVCKdyk-~TGYCGYGDsCKflH~ 164 (259)
T COG5152 137 IDTQP-DVCKDYK-ETGYCGYGDSCKFLHD 164 (259)
T ss_pred eecCc-ccccchh-hcccccCCchhhhhhh
Confidence 35777 4899998 7999999999999996
No 33
>PF08006 DUF1700: Protein of unknown function (DUF1700); InterPro: IPR012963 This family contains many hypothetical bacterial proteins and two putative membrane proteins (Q6GFD0 from SWISSPROT and Q6G806 from SWISSPROT).
Probab=84.42 E-value=17 Score=35.99 Aligned_cols=11 Identities=18% Similarity=0.283 Sum_probs=5.8
Q ss_pred HHHHHHHHhhH
Q psy3120 39 KCIQSQLRGDL 49 (711)
Q Consensus 39 ~slQsQLr~dL 49 (711)
+.+-+|+..|.
T Consensus 54 ~~iA~~i~~~~ 64 (181)
T PF08006_consen 54 KEIAREILAEY 64 (181)
T ss_pred HHHHHHHHHhh
Confidence 44555555553
No 34
>PF14608 zf-CCCH_2: Zinc finger C-x8-C-x5-C-x3-H type
Probab=83.68 E-value=0.8 Score=30.53 Aligned_cols=19 Identities=32% Similarity=0.683 Sum_probs=14.6
Q ss_pred ccccCccccCCCCCCCCCCCCc
Q psy3120 414 KCNDVQQAGYCPRGVFCAFAHV 435 (711)
Q Consensus 414 lC~~fq~tG~Cp~G~~C~FAHg 435 (711)
+|++|. .|.+++.|.|+|.
T Consensus 1 ~Ck~~~---~C~~~~~C~f~HP 19 (19)
T PF14608_consen 1 PCKFGP---NCTNGDNCPFSHP 19 (19)
T ss_pred CCcCcC---CCCCCCcCccCCc
Confidence 477775 4888889999884
No 35
>KOG2185|consensus
Probab=82.73 E-value=0.73 Score=51.74 Aligned_cols=32 Identities=25% Similarity=0.421 Sum_probs=26.4
Q ss_pred cccccccccccccCccccCCCCCCCCCCCCcch
Q psy3120 405 FHPEIYKSTKCNDVQQAGYCPRGVFCAFAHVDT 437 (711)
Q Consensus 405 yHP~~YKTklC~~fq~tG~Cp~G~~C~FAHgea 437 (711)
.||.---.++|.+|. .|.|.++..|+|.||..
T Consensus 133 l~PTh~sMkpC~ffL-eg~CRF~enCRfSHG~~ 164 (486)
T KOG2185|consen 133 LTPTHESMKPCKFFL-EGRCRFGENCRFSHGLD 164 (486)
T ss_pred ecCcchhhccchHhh-ccccccCcccccccCcc
Confidence 356655678899998 58999999999999964
No 36
>KOG1813|consensus
Probab=82.04 E-value=0.45 Score=51.44 Aligned_cols=29 Identities=48% Similarity=1.127 Sum_probs=25.8
Q ss_pred CccccccccccccccccCCCCCCCCCCCCcc
Q psy3120 204 SFNYSPDTYCTKYDETTGLCPDGDECPFLHR 234 (711)
Q Consensus 204 t~nYst~~lCpkf~e~tG~Cp~GD~C~FAHs 234 (711)
...|.++ +|..|. .+|.|-+||.|.|.|.
T Consensus 181 ~~d~qpD-icKdyk-eTgycg~gdSckFlh~ 209 (313)
T KOG1813|consen 181 RIDYQPD-ICKDYK-ETGYCGYGDSCKFLHD 209 (313)
T ss_pred eeecCch-hhhhhH-hhCcccccchhhhhhh
Confidence 4678885 999997 7999999999999996
No 37
>KOG3702|consensus
Probab=81.92 E-value=5 Score=47.61 Aligned_cols=85 Identities=22% Similarity=0.442 Sum_probs=48.8
Q ss_pred ccccCCCCCCCCCCCCCCCCCCCcC--ccCCCC-----ccccCCCCCCCCcCCCCCCCCCCCCCCCCcccccCccccccc
Q psy3120 334 TEQCKRPPRLCRQGYACPQFHNSRD--KRRSPR-----TFKYRSTPCPNVKHGDEWGEPANCESGDLCQYCHTRTEQQFH 406 (711)
Q Consensus 334 TklC~~~~G~C~~G~~CpFAHn~kD--lRR~P~-----~~kYkT~LCP~fkkG~~w~~~G~Cp~GdsC~FAHg~~El~yH 406 (711)
..+|+++. .|. +..|-|+|..-. ...-|. ...|....|.+= ..|.. ..|.|+|+..-.
T Consensus 544 l~~Cky~~-~Ct-~a~Ce~~HPtaa~~~~s~p~k~fa~~~~ks~p~Ck~~---------~kCta-sDC~~sH~~~~~--- 608 (681)
T KOG3702|consen 544 LTRCKYGP-ACT-SAECEFAHPTAAENAKSLPNKKFASKCLKSHPGCKFG---------KKCTA-SDCNYSHAGRRI--- 608 (681)
T ss_pred eccccCCC-cCC-chhhhhcCCcchhhhhccccccccccceecccccccc---------ccccc-ccCcccccCCCC---
Confidence 45677653 355 789999998732 222222 122333444432 35643 568999987543
Q ss_pred ccccc-----------ccccccCccccCCCCCCCCCCCCcch
Q psy3120 407 PEIYK-----------STKCNDVQQAGYCPRGVFCAFAHVDT 437 (711)
Q Consensus 407 P~~YK-----------TklC~~fq~tG~Cp~G~~C~FAHgea 437 (711)
|-.+| +.+|+++ ++|.. ..|+|.|...
T Consensus 609 pvq~t~ip~~~~~~ti~~~CrY~---pnCrn-m~C~F~HPk~ 646 (681)
T KOG3702|consen 609 PVQPTRIPPPFPGGTIRGLCRYR---PNCRN-MQCKFYHPKT 646 (681)
T ss_pred CCccccCCCCCCCCCccccceec---cCcCC-ccccccCCcc
Confidence 22223 3456665 35775 7999999754
No 38
>PF05656 DUF805: Protein of unknown function (DUF805); InterPro: IPR008523 This entry is represented by Lactobacillus phage LBR48, DUF805. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches.; GO: 0016021 integral to membrane
Probab=81.65 E-value=16 Score=33.24 Aligned_cols=25 Identities=20% Similarity=0.300 Sum_probs=15.7
Q ss_pred ccccccccccceEehhHHHHHHHHH
Q psy3120 65 MVCEFVRKRSKYIIIIIIIIIIIII 89 (711)
Q Consensus 65 ~~C~e~~~~~~~~~~~~~~~~~~~~ 89 (711)
...+.+..|..|....++.+++.++
T Consensus 3 f~f~GR~~R~~fw~~~l~~~~~~~~ 27 (120)
T PF05656_consen 3 FNFKGRISRKEFWWFFLINILIFIL 27 (120)
T ss_pred ccCcCCcCHHHHHHHHHHHHHHHHH
Confidence 3556788888888666554444333
No 39
>PF09972 DUF2207: Predicted membrane protein (DUF2207); InterPro: IPR018702 This domain has no known function.
Probab=78.99 E-value=15 Score=40.37 Aligned_cols=14 Identities=7% Similarity=0.147 Sum_probs=5.5
Q ss_pred HHHhhHHHHHHHHh
Q psy3120 44 QLRGDLEKVDKILY 57 (711)
Q Consensus 44 QLr~dLE~VdkViy 57 (711)
.++..+++..+.+.
T Consensus 363 ~~~~~~~~~~~~i~ 376 (511)
T PF09972_consen 363 RFYKAFKKWQEAIK 376 (511)
T ss_pred HHHHHHHHHHHHHH
Confidence 33334444443333
No 40
>PF08006 DUF1700: Protein of unknown function (DUF1700); InterPro: IPR012963 This family contains many hypothetical bacterial proteins and two putative membrane proteins (Q6GFD0 from SWISSPROT and Q6G806 from SWISSPROT).
Probab=72.83 E-value=44 Score=33.01 Aligned_cols=12 Identities=8% Similarity=0.244 Sum_probs=5.9
Q ss_pred HHHHhhHHHHHH
Q psy3120 43 SQLRGDLEKVDK 54 (711)
Q Consensus 43 sQLr~dLE~Vdk 54 (711)
+|...+|..-++
T Consensus 44 eeii~~LG~P~~ 55 (181)
T PF08006_consen 44 EEIIAELGSPKE 55 (181)
T ss_pred HHHHHHcCCHHH
Confidence 455555544444
No 41
>TIGR01654 bact_immun_7tm bacteriocin-associated integral membrane (putative immunity) protein. This model represents a family of integral membrane proteins, most of which are about 650 residues in size and predicted to span the membrane seven times. Nearly half of the members of this family are found in association with a member of the lactococcin 972 family of bacteriocins (TIGR01653). Others may be associated with uncharacterized proteins that may also act as bacteriocins. Although this protein is suggested to be an immunity protein, and the bacteriocin is suggested to be exported by a Sec-dependent process, the role of this protein is unclear.
Probab=68.12 E-value=67 Score=38.39 Aligned_cols=22 Identities=18% Similarity=0.040 Sum_probs=11.4
Q ss_pred HHHHHHHHHhccCCCCCcccCc
Q psy3120 5 VKQYEKEISRLSGTPSIVCLSS 26 (711)
Q Consensus 5 ~k~LqeelE~l~~~~~~~~L~~ 26 (711)
.++|++.++.+...+....+++
T Consensus 535 ~~~l~~~l~~~~~~~~~~~~~s 556 (679)
T TIGR01654 535 YQALLPLLKKYLLDGNISHLSS 556 (679)
T ss_pred HHHHHHHHHHcCCccccceeee
Confidence 4556666666654344433444
No 42
>PF09972 DUF2207: Predicted membrane protein (DUF2207); InterPro: IPR018702 This domain has no known function.
Probab=66.14 E-value=24 Score=38.90 Aligned_cols=18 Identities=22% Similarity=0.309 Sum_probs=10.6
Q ss_pred hHHHHHHHHHHHhhHHHH
Q psy3120 35 ISALKCIQSQLRGDLEKV 52 (711)
Q Consensus 35 L~~L~slQsQLr~dLE~V 52 (711)
-..++..|.+++..+++-
T Consensus 365 ~~~~~~~~~~i~~~~~~~ 382 (511)
T PF09972_consen 365 YKAFKKWQEAIKKELKER 382 (511)
T ss_pred HHHHHHHHHHHHHHHHhh
Confidence 445666666666666443
No 43
>KOG1813|consensus
Probab=65.47 E-value=2.1 Score=46.56 Aligned_cols=30 Identities=33% Similarity=0.903 Sum_probs=26.3
Q ss_pred cccccccccccccCccccCCCCCCCCCCCCcchhh
Q psy3120 405 FHPEIYKSTKCNDVQQAGYCPRGVFCAFAHVDTEM 439 (711)
Q Consensus 405 yHP~~YKTklC~~fq~tG~Cp~G~~C~FAHgeaEL 439 (711)
|+|. +|++|..+|+|-||+-|.|-|...+.
T Consensus 184 ~qpD-----icKdykeTgycg~gdSckFlh~r~Dy 213 (313)
T KOG1813|consen 184 YQPD-----ICKDYKETGYCGYGDSCKFLHDRSDY 213 (313)
T ss_pred cCch-----hhhhhHhhCcccccchhhhhhhhhhc
Confidence 5665 79999999999999999999988765
No 44
>COG4758 Predicted membrane protein [Function unknown]
Probab=64.69 E-value=61 Score=34.60 Aligned_cols=7 Identities=29% Similarity=0.283 Sum_probs=3.3
Q ss_pred cccccCC
Q psy3120 266 NGAHCAF 272 (711)
Q Consensus 266 ~G~~C~F 272 (711)
||+.=.|
T Consensus 188 ~Gdv~~f 194 (235)
T COG4758 188 YGDVKLF 194 (235)
T ss_pred eeeeeee
Confidence 5554444
No 45
>KOG0153|consensus
Probab=50.99 E-value=10 Score=42.29 Aligned_cols=23 Identities=35% Similarity=0.966 Sum_probs=20.3
Q ss_pred ccccccccccCCCCCCCCCCCCccC
Q psy3120 211 TYCTKYDETTGLCPDGDECPFLHRT 235 (711)
Q Consensus 211 ~lCpkf~e~tG~Cp~GD~C~FAHs~ 235 (711)
.+|.+| ..|.|++|++|+|-|..
T Consensus 162 ~Icsf~--v~geckRG~ec~yrhEk 184 (377)
T KOG0153|consen 162 HICSFF--VKGECKRGAECPYRHEK 184 (377)
T ss_pred ccccce--eeccccccccccccccC
Confidence 689988 47899999999999974
No 46
>COG3152 Predicted membrane protein [Function unknown]
Probab=48.57 E-value=2e+02 Score=27.93 Aligned_cols=25 Identities=20% Similarity=0.228 Sum_probs=17.4
Q ss_pred cccccccccccceEehhHHHHHHHH
Q psy3120 64 CMVCEFVRKRSKYIIIIIIIIIIII 88 (711)
Q Consensus 64 C~~C~e~~~~~~~~~~~~~~~~~~~ 88 (711)
...=..+.+|+.|...+...+|+.+
T Consensus 11 yf~f~GR~~R~~fW~~~L~~~ii~~ 35 (125)
T COG3152 11 YFSFSGRIGRSEFWYVILFNLIIGV 35 (125)
T ss_pred hcCccCccchHHHHHHHHHHHHHHH
Confidence 4555678899999877666555444
No 47
>PRK13862 putative crown gall tumor protein VirC2; Provisional
Probab=44.69 E-value=7.1 Score=39.67 Aligned_cols=32 Identities=44% Similarity=0.650 Sum_probs=24.7
Q ss_pred CCCCCCC-CCcccccccCCCCCCCCCccccccc
Q psy3120 659 NNFSPTT-SSPLSHFLSSRPPAPGFGTTADNLN 690 (711)
Q Consensus 659 ~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~ 690 (711)
+-.|+++ +--+--|||.||||||...+-|||-
T Consensus 71 dalss~~~PEKvQVFLSArpPAp~vSk~yD~Li 103 (201)
T PRK13862 71 DALSSTTAPEKVQVFLSARPPAPEVSKIYDNLI 103 (201)
T ss_pred hhccCCCCchheEEeeecCCCCccHHHHHHHHH
Confidence 3344433 3467889999999999999999984
No 48
>PF13908 Shisa: Wnt and FGF inhibitory regulator
Probab=43.80 E-value=10 Score=37.46 Aligned_cols=7 Identities=14% Similarity=0.358 Sum_probs=3.4
Q ss_pred ccccccc
Q psy3120 62 TKCMVCE 68 (711)
Q Consensus 62 ~kC~~C~ 68 (711)
..|..-+
T Consensus 50 ~~C~~~~ 56 (179)
T PF13908_consen 50 GSCDNYD 56 (179)
T ss_pred ccccccc
Confidence 3555544
No 49
>TIGR01654 bact_immun_7tm bacteriocin-associated integral membrane (putative immunity) protein. This model represents a family of integral membrane proteins, most of which are about 650 residues in size and predicted to span the membrane seven times. Nearly half of the members of this family are found in association with a member of the lactococcin 972 family of bacteriocins (TIGR01653). Others may be associated with uncharacterized proteins that may also act as bacteriocins. Although this protein is suggested to be an immunity protein, and the bacteriocin is suggested to be exported by a Sec-dependent process, the role of this protein is unclear.
Probab=38.19 E-value=1.4e+02 Score=35.82 Aligned_cols=23 Identities=30% Similarity=0.329 Sum_probs=11.4
Q ss_pred cCchHHHHHHHHHHHhhHHHHHH
Q psy3120 32 SMSISALKCIQSQLRGDLEKVDK 54 (711)
Q Consensus 32 ~LpL~~L~slQsQLr~dLE~Vdk 54 (711)
++....|-+-+.|+-.+.+++++
T Consensus 130 ~l~~~~~~~~l~~~~~~~~~~~~ 152 (679)
T TIGR01654 130 RLSITKLQDYLNQLGSKFAKDRK 152 (679)
T ss_pred CccHHHHHHHHhhhhhhhhhccc
Confidence 34445555555555544444444
No 50
>PF14257 DUF4349: Domain of unknown function (DUF4349)
Probab=36.45 E-value=1.1e+02 Score=32.03 Aligned_cols=45 Identities=24% Similarity=0.371 Sum_probs=27.9
Q ss_pred hhHHHHHHHHHHhccCCCCCcccCcccccccCchHHHHHHHHH---HHhhHHHHHHHH
Q psy3120 2 LNQVKQYEKEISRLSGTPSIVCLSSNADLNSMSISALKCIQSQ---LRGDLEKVDKIL 56 (711)
Q Consensus 2 ~~q~k~LqeelE~l~~~~~~~~L~~~~dL~~LpL~~L~slQsQ---Lr~dLE~VdkVi 56 (711)
+++++.|+++.++|.. .|.+-. .++-+..++.+ .|.+||..++-+
T Consensus 138 ~arl~~l~~~~~rl~~-----ll~ka~-----~~~d~l~ie~~L~~v~~eIe~~~~~~ 185 (262)
T PF14257_consen 138 EARLKNLEAEEERLLE-----LLEKAK-----TVEDLLEIERELSRVRSEIEQLEGQL 185 (262)
T ss_pred HHHHHHHHHHHHHHHH-----HHHhcC-----CHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 5678888888888742 122211 55655666655 577888777744
No 51
>PF10650 zf-C3H1: Putative zinc-finger domain; InterPro: IPR019607 This domain is conserved in fungi and might be a zinc-finger domain as it contains three conserved Cs and an H in the C-x8-C-x5-C-x3-H conformation typical of a zinc-finger.
Probab=35.63 E-value=21 Score=25.46 Aligned_cols=20 Identities=35% Similarity=0.735 Sum_probs=16.6
Q ss_pred cccccccC-cccCCCCCcccCCCC
Q psy3120 167 QCPLFLQH-KCTQHRPFTCFHWHF 189 (711)
Q Consensus 167 ~C~~f~qg-~C~~g~p~~C~FAH~ 189 (711)
+|++...| .|+.. +|.|-|.
T Consensus 2 lC~yEl~Gg~Cnd~---~C~~QHf 22 (23)
T PF10650_consen 2 LCPYELTGGVCNDP---DCEFQHF 22 (23)
T ss_pred CCccccCCCeeCCC---CCCcccc
Confidence 58988877 99865 6999996
No 52
>smart00793 AgrB Accessory gene regulator B. The accessory gene regulator (agr) of Staphylococcus aureus is the central regulatory system that controls the gene expression for a large set of virulence factors. The arg locus consists of two transcripts: RNAII and RNAIII. RNAII encodes four genes (agrA, B, C, and D) whose gene products assemble a quorum sensing system. At low cell density, the agr genes are continuously expressed at basal levels. A signal molecule, autoinducing peptide (AIP), produced and secreted by the bacteria, accumulates outside of the cells. When the cell density increases and the AIP concentration reaches a threshold, it activates the agr response, i.e. activation of secreted protein gene expression and subsequent repression of cell wall-associated protein genes. AgrB and AgrD are essential for the production of the autoinducing peptide which functions as a signal for quorum sensing. AgrB is a transmembrane protein PUBMED:11195102. AgrB is involved in the proteolyt
Probab=35.43 E-value=4.8e+02 Score=26.25 Aligned_cols=21 Identities=24% Similarity=0.093 Sum_probs=11.6
Q ss_pred HHHHHHHhhHHHHHHHHhhhh
Q psy3120 40 CIQSQLRGDLEKVDKILYLQT 60 (711)
Q Consensus 40 slQsQLr~dLE~VdkViy~~~ 60 (711)
.+++++-.|=|+.|++.|-.+
T Consensus 9 ~l~~~~~~~~~~~e~~~YGle 29 (184)
T smart00793 9 YLQKRNNLDHIQLLKIKYGLE 29 (184)
T ss_pred HHHHhcCCCHHHHHHHHHHHH
Confidence 344455555566666666544
No 53
>KOG0860|consensus
Probab=34.83 E-value=23 Score=34.10 Aligned_cols=21 Identities=33% Similarity=0.419 Sum_probs=11.9
Q ss_pred HHHHHHHHHHhhH----HHHHHHHh
Q psy3120 37 ALKCIQSQLRGDL----EKVDKILY 57 (711)
Q Consensus 37 ~L~slQsQLr~dL----E~VdkViy 57 (711)
+|..+|+|+-.=. +-||||+=
T Consensus 30 k~~~tq~QvdeVv~IMr~NV~KVlE 54 (116)
T KOG0860|consen 30 KLQQTQAQVDEVVDIMRENVEKVLE 54 (116)
T ss_pred HHHHHHHHHHHHHHHHHHhHHHHHH
Confidence 6777777764321 34555553
No 54
>COG3887 Predicted signaling protein consisting of a modified GGDEF domain and a DHH domain [Signal transduction mechanisms]
Probab=34.55 E-value=99 Score=37.18 Aligned_cols=8 Identities=38% Similarity=0.563 Sum_probs=3.4
Q ss_pred CCCCCCCC
Q psy3120 659 NNFSPTTS 666 (711)
Q Consensus 659 ~~~~~~~~ 666 (711)
+||+-.|+
T Consensus 501 KnFt~rTg 508 (655)
T COG3887 501 KNFTLRTG 508 (655)
T ss_pred ccceeecc
Confidence 44444443
No 55
>KOG1039|consensus
Probab=33.42 E-value=16 Score=40.65 Aligned_cols=24 Identities=33% Similarity=0.751 Sum_probs=21.3
Q ss_pred ccccccccccccccccccccCCCCCCC
Q psy3120 251 CMCVHDTDARGLCVKNGAHCAFAHGNP 277 (711)
Q Consensus 251 klCk~~~~~kG~C~K~G~~C~FAHg~~ 277 (711)
.+|+++. +|+| ++|..|+|.|...
T Consensus 9 tic~~~~--~g~c-~~g~~cr~~h~~~ 32 (344)
T KOG1039|consen 9 TICKYYQ--KGNC-KFGDLCRLSHSLP 32 (344)
T ss_pred hhhhhcc--cccc-cccceeeeeccCc
Confidence 6899986 6999 5999999999876
No 56
>PF03208 PRA1: PRA1 family protein; InterPro: IPR004895 This family includes yeast hypothetical proteins and the uncharacterised rat prenylated rab acceptor protein PRA1.
Probab=33.13 E-value=3.2e+02 Score=26.14 Aligned_cols=26 Identities=15% Similarity=0.277 Sum_probs=18.0
Q ss_pred CcccccccCchH-HHHHHHHHHHhhHH
Q psy3120 25 SSNADLNSMSIS-ALKCIQSQLRGDLE 50 (711)
Q Consensus 25 ~~~~dL~~LpL~-~L~slQsQLr~dLE 50 (711)
+.+=|.++..+| +...+..+++..|.
T Consensus 12 ~eF~~~~~fs~P~~~~~~~~Ri~~Nl~ 38 (153)
T PF03208_consen 12 REFFDTSRFSVPSSFSEAKSRIKRNLS 38 (153)
T ss_pred HHHhCccCCCCCCCHHHHHHHHHHHHH
Confidence 344455677787 67888888877764
No 57
>KOG0153|consensus
Probab=30.49 E-value=29 Score=38.84 Aligned_cols=36 Identities=28% Similarity=0.635 Sum_probs=26.6
Q ss_pred ccCCCCccccCCCCCCCCcCCCCCCCCCCCCCCCCcccccCcc
Q psy3120 359 KRRSPRTFKYRSTPCPNVKHGDEWGEPANCESGDLCQYCHTRT 401 (711)
Q Consensus 359 lRR~P~~~kYkT~LCP~fkkG~~w~~~G~Cp~GdsC~FAHg~~ 401 (711)
.|+.|..-+-.+..|..|.+| .|.+|+.|.|.|-..
T Consensus 150 ~rt~p~ykrn~p~Icsf~v~g-------eckRG~ec~yrhEkp 185 (377)
T KOG0153|consen 150 QRTTPYYKRNRPHICSFFVKG-------ECKRGAECPYRHEKP 185 (377)
T ss_pred hccCccccCCCCccccceeec-------cccccccccccccCC
Confidence 456664333457789988654 899999999999754
No 58
>KOG1039|consensus
Probab=30.45 E-value=19 Score=40.06 Aligned_cols=25 Identities=32% Similarity=0.634 Sum_probs=22.5
Q ss_pred cccccCccccCCCCCCCCCCCCcchh
Q psy3120 413 TKCNDVQQAGYCPRGVFCAFAHVDTE 438 (711)
Q Consensus 413 klC~~fq~tG~Cp~G~~C~FAHgeaE 438 (711)
+.|++|+ .|+|.+|..|+|.|....
T Consensus 9 tic~~~~-~g~c~~g~~cr~~h~~~~ 33 (344)
T KOG1039|consen 9 TICKYYQ-KGNCKFGDLCRLSHSLPD 33 (344)
T ss_pred hhhhhcc-cccccccceeeeeccCch
Confidence 6899998 699999999999999874
No 59
>PF02009 Rifin_STEVOR: Rifin/stevor family; InterPro: IPR002858 Malaria is still a major cause of mortality in many areas of the world. Plasmodium falciparum causes the most severe human form of the disease and is responsible for most fatalities. Severe cases of malaria can occur when the parasite invades and then proliferates within red blood cell erythrocytes. The parasite produces many variant antigenic proteins, encoded by multigene families, which are present on the surface of the infected erythrocyte and play important roles in virulence. A crucial survival mechanism for the malaria parasite is its ability to evade the immune response by switching these variant surface antigens. The high virulence of P. falciparum relative to other malarial parasites is in large part due to the fact that in this organism many of these surface antigens mediate the binding of infected erythrocytes to the vascular endothelium (cytoadherence) and non-infected erythrocytes (rosetting). This can lead to the accumulation of infected cells in the vasculature of a variety of organs, blocking the blood flow and reducing the oxygen supply. Clinical symptoms of severe infection can include fever, progressive anaemia, multi-organ dysfunction and coma. For more information see []. Several multicopy gene families have been described in Plasmodium falciparum, including the stevor family of subtelomeric open reading frames and the rif interspersed repetitive elements. Both families contain three predicted transmembrane segments. It has been proposed that stevor and rif are members of a larger superfamily that code for variant surface antigens [].
Probab=30.16 E-value=53 Score=36.03 Aligned_cols=25 Identities=16% Similarity=0.263 Sum_probs=15.0
Q ss_pred hhHHHHHHHHhhhhccccccccccc
Q psy3120 47 GDLEKVDKILYLQTATKCMVCEFVR 71 (711)
Q Consensus 47 ~dLE~VdkViy~~~a~kC~~C~e~~ 71 (711)
++.-.+=+.+|......|....-..
T Consensus 172 ~~~~~l~~~i~~~~~~~c~~~~~~~ 196 (299)
T PF02009_consen 172 SNVSTLVGIIYAKYNTMCASSGAGS 196 (299)
T ss_pred CcHHHHHHHHHHHhhccccccccCC
Confidence 3444555666777777787665433
No 60
>COG4709 Predicted membrane protein [Function unknown]
Probab=29.49 E-value=2.3e+02 Score=29.65 Aligned_cols=8 Identities=25% Similarity=0.351 Sum_probs=2.9
Q ss_pred HHHHHHhc
Q psy3120 8 YEKEISRL 15 (711)
Q Consensus 8 LqeelE~l 15 (711)
|+.-||+|
T Consensus 10 L~~yL~~L 17 (195)
T COG4709 10 LEQYLEGL 17 (195)
T ss_pred HHHHHHhC
Confidence 33333333
No 61
>COG4758 Predicted membrane protein [Function unknown]
Probab=29.40 E-value=5.2e+02 Score=27.90 Aligned_cols=6 Identities=17% Similarity=0.346 Sum_probs=2.6
Q ss_pred hccccc
Q psy3120 160 LKEFRV 165 (711)
Q Consensus 160 L~~FKT 165 (711)
+-.+|+
T Consensus 101 lv~~~~ 106 (235)
T COG4758 101 LVNLKK 106 (235)
T ss_pred hhhccc
Confidence 334444
No 62
>KOG0810|consensus
Probab=28.61 E-value=30 Score=37.83 Aligned_cols=19 Identities=32% Similarity=0.411 Sum_probs=10.4
Q ss_pred cccccccccccceEehhHH
Q psy3120 64 CMVCEFVRKRSKYIIIIII 82 (711)
Q Consensus 64 C~~C~e~~~~~~~~~~~~~ 82 (711)
=++=|..+|+-|.|+||+.
T Consensus 260 Av~~qkkaRK~k~i~ii~~ 278 (297)
T KOG0810|consen 260 AVKYQKKARKWKIIIIIIL 278 (297)
T ss_pred HHHHHHHhhhceeeeehHH
Confidence 3455666666665554443
No 63
>PRK05978 hypothetical protein; Provisional
Probab=27.12 E-value=3e+02 Score=27.55 Aligned_cols=16 Identities=19% Similarity=0.482 Sum_probs=10.7
Q ss_pred hhhhcccccccccccc
Q psy3120 57 YLQTATKCMVCEFVRK 72 (711)
Q Consensus 57 y~~~a~kC~~C~e~~~ 72 (711)
|.+-+.+|.+|++.-.
T Consensus 48 ~Lkv~~~C~~CG~~~~ 63 (148)
T PRK05978 48 FLKPVDHCAACGEDFT 63 (148)
T ss_pred ccccCCCccccCCccc
Confidence 5566777888876533
No 64
>PF01486 K-box: K-box region; InterPro: IPR002487 MADS genes in plants encode key developmental regulators of vegetative and reproductive development. The majority of the plant MADS proteins share a stereotypical MIKC structure. It comprises (from N- to C-terminal) an N-terminal domain, which is, however, present only in a minority of proteins; a MADS domain (see PDOC00302 from PROSITEDOC, IPR002100 from INTERPRO), which is the major determinant of DNA-binding but which also performs dimerisation and accessory factor binding functions; a weakly conserved intervening (I) domain, which constitutes a key molecular determinant for the selective formation of DNA-binding dimers; a keratin-like (K-box) domain, which promotes protein dimerisation; and a C-terminal (C) domain, which is involved in transcriptional activation or in the formation of ternary or quaternary protein complexes. The 80-amino acid K-box domain was originally identified as a region with low but significant similarity to a region of keratin, which is part of the coiled-coil sequence constituting the central rod-shaped domain of keratin [, , ]. The K-box protein-protein interaction domain which mediates heterodimerization of MIKC-type MADS proteins contains several heptad repeats in which the first and the fourth positions are occupied by hydrophobic amino acids suggesting that the K-box domain forms three amphipathic alpha-helices referred to as K1, K2, and K3 [].; GO: 0003700 sequence-specific DNA binding transcription factor activity, 0006355 regulation of transcription, DNA-dependent, 0005634 nucleus
Probab=26.60 E-value=75 Score=28.77 Aligned_cols=48 Identities=27% Similarity=0.435 Sum_probs=34.2
Q ss_pred hHHHHHHHHHHhccCCCCCcccCcccccccCchHHHHHHHHHHHhhHHHHH
Q psy3120 3 NQVKQYEKEISRLSGTPSIVCLSSNADLNSMSISALKCIQSQLRGDLEKVD 53 (711)
Q Consensus 3 ~q~k~LqeelE~l~~~~~~~~L~~~~dL~~LpL~~L~slQsQLr~dLE~Vd 53 (711)
.++..|+++++.|.. ....+.. .||+.|++.-|..|..||-.-|..|-
T Consensus 19 ~e~~~L~~~~~~L~~--~~R~~~G-edL~~Ls~~eL~~LE~~Le~aL~~VR 66 (100)
T PF01486_consen 19 QEIAKLRKENESLQK--ELRHLMG-EDLESLSLKELQQLEQQLESALKRVR 66 (100)
T ss_pred HHHHHHHHHHHHHHH--HHhcccc-ccccccchHHHHHHHHhhhhhHHHHH
Confidence 356677777777762 1111221 49999999999999999998887764
No 65
>COG4652 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=26.23 E-value=4.6e+02 Score=32.07 Aligned_cols=24 Identities=8% Similarity=0.115 Sum_probs=11.8
Q ss_pred HHHHHHHHHHhhhcccccchhhhh
Q psy3120 137 IIIIIIIIIIIIIDLIFDFTFMYL 160 (711)
Q Consensus 137 ~~~~~~~~~~~~~~~~k~~h~~yL 160 (711)
+.++.+++.+.+++.++.....-|
T Consensus 631 f~~~~llil~~qmk~e~~~~~~vL 654 (657)
T COG4652 631 FLLEQLLILLVQMKKEEKGFMNVL 654 (657)
T ss_pred HHHHHHHHHHHHHHHHhhhhhHHh
Confidence 334444555566666554443333
No 66
>PF07181 VirC2: VirC2 protein; InterPro: IPR009841 This family consists of several VirC2 proteins which seem to be found exclusively in Agrobacterium species and Rhizobium etli. VirC2 is known to be involved in virulence in Agrobacterium species but its exact function is unclear [, ].; PDB: 2RH3_A.
Probab=25.27 E-value=16 Score=37.52 Aligned_cols=28 Identities=43% Similarity=0.761 Sum_probs=14.8
Q ss_pred CCCCCcccccccCCCCCCCCCccccccc
Q psy3120 663 PTTSSPLSHFLSSRPPAPGFGTTADNLN 690 (711)
Q Consensus 663 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 690 (711)
|+..+-+--|||-||||||...+-|||-
T Consensus 76 ~~~p~KvqVflSarpPap~VS~~yD~Li 103 (202)
T PF07181_consen 76 PASPEKVQVFLSARPPAPGVSKIYDALI 103 (202)
T ss_dssp -------EEEEEE----TTS-HHHHHHT
T ss_pred CCCCcceEEEEecCCCCCcccHHHHHHH
Confidence 4555668889999999999999999985
No 67
>PF00664 ABC_membrane: ABC transporter transmembrane region; InterPro: IPR001140 ABC transporters belong to the ATP-Binding Cassette (ABC) superfamily, which uses the hydrolysis of ATP to energise diverse biological systems. ABC transporters minimally consist of two conserved regions: a highly conserved ATP binding cassette (ABC) and a less conserved transmembrane domain (TMD). These can be found on the same protein or on two different ones. Most ABC transporters function as a dimer and therefore are constituted of four domains, two ABC modules and two TMDs. ABC transporters are involved in the export or import of a wide variety of substrates ranging from small ions to macromolecules. The major function of ABC import systems is to provide essential nutrients to bacteria. They are found only in prokaryotes and their four constitutive domains are usually encoded by independent polypeptides (two ABC proteins and two TMD proteins). Prokaryotic importers require additional extracytoplasmic binding proteins (one or more per systems) for function. In contrast, export systems are involved in the extrusion of noxious substances, the export of extracellular toxins and the targeting of membrane components. They are found in all living organisms and in general the TMD is fused to the ABC module in a variety of combinations. Some eukaryotic exporters encode the four domains on the same polypeptide chain []. The ABC module (approximately two hundred amino acid residues) is known to bind and hydrolyse ATP, thereby coupling transport to ATP hydrolysis in a large number of biological processes. The cassette is duplicated in several subfamilies. Its primary sequence is highly conserved, displaying a typical phosphate-binding loop: Walker A, and a magnesium binding site: Walker B. Besides these two regions, three other conserved motifs are present in the ABC cassette: the switch region which contains a histidine loop, postulated to polarise the attaching water molecule for hydrolysis, the signature conserved motif (LSGGQ) specific to the ABC transporter, and the Q-motif (between Walker A and the signature), which interacts with the gamma phosphate through a water bond. The Walker A, Walker B, Q-loop and switch region form the nucleotide binding site [, , ]. The 3D structure of a monomeric ABC module adopts a stubby L-shape with two distinct arms. ArmI (mainly beta-strand) contains Walker A and Walker B. The important residues for ATP hydrolysis and/or binding are located in the P-loop. The ATP-binding pocket is located at the extremity of armI. The perpendicular armII contains mostly the alpha helical subdomain with the signature motif. It only seems to be required for structural integrity of the ABC module. ArmII is in direct contact with the TMD. The hinge between armI and armII contains both the histidine loop and the Q-loop, making contact with the gamma phosphate of the ATP molecule. ATP hydrolysis leads to a conformational change that could facilitate ADP release. In the dimer the two ABC cassettes contact each other through hydrophobic interactions at the antiparallel beta-sheet of armI by a two-fold axis [, , , , , ]. The ATP-Binding Cassette (ABC) superfamily forms one of the largest of all protein families with a diversity of physiological functions []. Several studies have shown that there is a correlation between the functional characterisation and the phylogenetic classification of the ABC cassette [, ]. More than 50 subfamilies have been described based on a phylogenetic and functional classification [, , ]; (for further information see http://www.tcdb.org/tcdb/index.php?tc=3.A.1). A variety of ATP-binding transport proteins have a six transmembrane helical region. They are all integral membrane proteins involved in a variety of transport systems. Members of this family include; the cystic fibrosis transmembrane conductance regulator (CFTR), bacterial leukotoxin secretion ATP-binding protein, multidrug resistance proteins, the yeast leptomycin B resistance protein, the mammalian sulphonylurea receptor and antigen peptide transporter 2. Many of these proteins have two such regions.; GO: 0005524 ATP binding, 0042626 ATPase activity, coupled to transmembrane movement of substances, 0006810 transport, 0055085 transmembrane transport, 0016021 integral to membrane; PDB: 3G61_B 3G5U_B 3G60_A 3B60_D 3QF4_B 2HYD_A 2ONJ_A 4A82_B 4AA3_A 2YL4_A.
Probab=24.70 E-value=4.1e+02 Score=25.29 Aligned_cols=19 Identities=21% Similarity=0.459 Sum_probs=11.3
Q ss_pred HHHHHHHHhhHHHHHHHHh
Q psy3120 39 KCIQSQLRGDLEKVDKILY 57 (711)
Q Consensus 39 ~slQsQLr~dLE~VdkViy 57 (711)
-.+.+++..|.++|+..+.
T Consensus 98 g~l~~~i~~d~~~i~~~~~ 116 (275)
T PF00664_consen 98 GELLSRITNDIEQIENFLS 116 (275)
T ss_dssp HHHHHHHHHHHHHHHHHHH
T ss_pred ccccccccccccccccccc
Confidence 3455666666666665543
No 68
>PF06170 DUF983: Protein of unknown function (DUF983); InterPro: IPR009325 This family consists of several bacterial proteins of unknown function.
Probab=21.75 E-value=1.6e+02 Score=26.74 Aligned_cols=16 Identities=31% Similarity=0.702 Sum_probs=12.5
Q ss_pred hhhhcccccccccccc
Q psy3120 57 YLQTATKCMVCEFVRK 72 (711)
Q Consensus 57 y~~~a~kC~~C~e~~~ 72 (711)
|.+-+.+|.+|.+.-.
T Consensus 4 ~Lk~~~~C~~CG~d~~ 19 (86)
T PF06170_consen 4 YLKVAPRCPHCGLDYS 19 (86)
T ss_pred cccCCCcccccCCccc
Confidence 6677889999997643
No 69
>PF00957 Synaptobrevin: Synaptobrevin; InterPro: IPR001388 Synaptobrevin is an intrinsic membrane protein of small synaptic vesicles [], specialised secretory organelles of neurons that actively accumulate neurotransmitters and participate in their calcium-dependent release by exocytosis. Vesicle function is mediated by proteins in their membranes, although the precise nature of the protein-protein interactions underlying this are still uncertain []. Synaptobrevin may play a role in the molecular events underlying neurotransmitter release and vesicle recycling and may be involved in the regulation of membrane flow in the nerve terminal, a process mediated by interaction with low molecular weight GTP-binding proteins []. Synaptic vesicle-associated membrane proteins (VAMPs) from Torpedo californica (Pacific electric ray) and SNC1 from yeast are related to synaptobrevin.; GO: 0016192 vesicle-mediated transport, 0016021 integral to membrane; PDB: 3EGX_C 2NUP_C 3EGD_C 2NUT_C 1IOU_A 1H8M_A 3B5N_A 3ZYM_A 2NPS_A 1SFC_E ....
Probab=21.36 E-value=36 Score=29.91 Aligned_cols=12 Identities=8% Similarity=0.462 Sum_probs=6.6
Q ss_pred HHHHHHHHHHhc
Q psy3120 4 QVKQYEKEISRL 15 (711)
Q Consensus 4 q~k~LqeelE~l 15 (711)
++.++|++++..
T Consensus 4 kl~~i~~~v~~v 15 (89)
T PF00957_consen 4 KLEQIQEQVEEV 15 (89)
T ss_dssp HHHHHHHHHHHH
T ss_pred HHHHHHHHHHHH
Confidence 455556655544
No 70
>PRK13899 type IV secretion system protein VirB3; Provisional
Probab=21.02 E-value=2.3e+02 Score=26.18 Aligned_cols=18 Identities=17% Similarity=0.246 Sum_probs=10.1
Q ss_pred ccchhhhhcccccccccc
Q psy3120 153 FDFTFMYLKEFRVEQCPL 170 (711)
Q Consensus 153 k~~h~~yL~~FKTk~C~~ 170 (711)
+..+.+.+...|.+.|++
T Consensus 65 ~D~~~f~l~~~~~~~~~r 82 (97)
T PRK13899 65 KEPLFIELFLNKMQKCSK 82 (97)
T ss_pred cCHHHHHHHHHHHhhcCC
Confidence 344445555557777753
No 71
>PHA03093 EEV glycoprotein; Provisional
Probab=20.98 E-value=64 Score=33.31 Aligned_cols=11 Identities=18% Similarity=0.395 Sum_probs=9.2
Q ss_pred CCCCCCCCCcc
Q psy3120 224 PDGDECPFLHR 234 (711)
Q Consensus 224 p~GD~C~FAHs 234 (711)
++|..|--.|.
T Consensus 105 ~~~~~C~~~~~ 115 (185)
T PHA03093 105 VYDGSCYIFHS 115 (185)
T ss_pred ecCCEeEEecC
Confidence 77889988885
No 72
>PF05399 EVI2A: Ectropic viral integration site 2A protein (EVI2A); InterPro: IPR008608 This family contains several mammalian ectropic viral integration site 2A (EVI2A) proteins. The function of this protein is unknown although it is thought to be a membrane protein and may function as an oncogene in retrovirus induced myeloid tumours [, ].; GO: 0016021 integral to membrane
Probab=20.82 E-value=80 Score=33.42 Aligned_cols=7 Identities=29% Similarity=0.496 Sum_probs=5.1
Q ss_pred ccccccc
Q psy3120 65 MVCEFVR 71 (711)
Q Consensus 65 ~~C~e~~ 71 (711)
-+|+|+.
T Consensus 119 ~~CEen~ 125 (227)
T PF05399_consen 119 EICEENN 125 (227)
T ss_pred hhhhcCc
Confidence 4799874
No 73
>PRK01100 putative accessory gene regulator protein; Provisional
Probab=20.75 E-value=4.8e+02 Score=26.94 Aligned_cols=9 Identities=22% Similarity=0.032 Sum_probs=4.7
Q ss_pred HHHHHhhhh
Q psy3120 52 VDKILYLQT 60 (711)
Q Consensus 52 VdkViy~~~ 60 (711)
++++.|-.+
T Consensus 33 ~~~i~YGl~ 41 (210)
T PRK01100 33 YLKVKYGLE 41 (210)
T ss_pred HHHHHHHHH
Confidence 335666554
No 74
>PF09889 DUF2116: Uncharacterized protein containing a Zn-ribbon (DUF2116); InterPro: IPR019216 This entry contains various hypothetical prokaryotic proteins whose functions are unknown. They contain a conserved zinc ribbon motif in the N-terminal part and a predicted transmembrane segment in the C-terminal part.
Probab=20.30 E-value=35 Score=29.16 Aligned_cols=8 Identities=38% Similarity=0.962 Sum_probs=3.6
Q ss_pred cccccccc
Q psy3120 61 ATKCMVCE 68 (711)
Q Consensus 61 a~kC~~C~ 68 (711)
.+.|.+|+
T Consensus 3 HkHC~~CG 10 (59)
T PF09889_consen 3 HKHCPVCG 10 (59)
T ss_pred CCcCCcCC
Confidence 34444444
No 75
>COG5353 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=20.08 E-value=66 Score=32.45 Aligned_cols=9 Identities=0% Similarity=0.036 Sum_probs=4.5
Q ss_pred hhcccccch
Q psy3120 148 IIDLIFDFT 156 (711)
Q Consensus 148 ~~~~~k~~h 156 (711)
.....+|-|
T Consensus 27 ~~~s~~P~~ 35 (161)
T COG5353 27 FWKSMKPYH 35 (161)
T ss_pred HhHhcCccc
Confidence 344555555
Done!