Query psy647
Match_columns 190
No_of_seqs 105 out of 219
Neff 5.8
Searched_HMMs 46136
Date Fri Aug 16 18:55:17 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy647.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/647hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG4071|consensus 100.0 4.7E-63 1E-67 402.8 13.8 177 1-181 3-179 (187)
2 COG5093 Uncharacterized conser 100.0 1.8E-47 3.9E-52 304.8 13.2 170 1-175 11-182 (185)
3 PTZ00362 hypothetical protein; 100.0 6.1E-43 1.3E-47 317.1 15.0 161 10-176 282-459 (479)
4 PF05916 Sld5: GINS complex pr 99.8 2.4E-20 5.3E-25 137.3 9.5 104 43-154 1-108 (108)
5 KOG1106|consensus 99.0 1E-09 2.2E-14 89.9 7.6 143 8-160 20-163 (177)
6 COG5138 Uncharacterized conser 98.3 1.8E-06 3.9E-11 69.1 5.9 115 7-131 8-124 (168)
7 COG1711 DNA replication initia 88.0 3.1 6.7E-05 35.7 7.8 88 67-158 1-102 (223)
8 COG3924 Predicted membrane pro 60.6 7.2 0.00016 28.1 2.1 20 81-102 35-54 (80)
9 PF10792 DUF2605: Protein of u 49.1 58 0.0012 24.6 5.4 41 90-130 19-59 (98)
10 PF03233 Cauli_AT: Aphid trans 43.6 1.5E+02 0.0032 24.4 7.4 77 59-136 51-130 (163)
11 PF08542 Rep_fac_C: Replicatio 41.9 42 0.00091 23.4 3.6 33 107-139 1-33 (89)
12 COG3248 Tsx Nucleoside-binding 35.0 19 0.00041 31.6 1.0 39 15-54 101-165 (284)
13 PF01090 Ribosomal_S19e: Ribos 32.0 23 0.0005 28.3 1.0 40 49-100 15-58 (139)
14 PF15614 WHIM3: WSTF, HB1, Itc 29.9 1.1E+02 0.0024 20.0 3.8 29 109-137 5-36 (46)
15 PTZ00095 40S ribosomal protein 27.1 55 0.0012 27.1 2.4 63 26-100 11-83 (169)
16 PF14801 GCD14_N: tRNA methylt 24.6 27 0.00058 23.6 0.2 12 34-45 3-14 (54)
17 PF01788 PsbJ: PsbJ; InterPro 24.6 31 0.00067 21.9 0.4 7 44-50 7-13 (40)
18 PF11568 Med29: Mediator compl 24.6 2.4E+02 0.0052 22.8 5.7 33 106-138 2-34 (148)
19 PF12166 DUF3595: Protein of u 24.4 60 0.0013 29.6 2.5 24 106-129 371-394 (422)
20 PRK09333 30S ribosomal protein 23.8 75 0.0016 25.7 2.6 40 49-100 16-59 (150)
21 PF03711 OKR_DC_1_C: Orn/Lys/A 22.6 1.3E+02 0.0028 23.7 3.7 43 1-46 51-94 (136)
22 COG1480 Predicted membrane-ass 22.5 2E+02 0.0043 28.9 5.7 29 141-169 658-686 (700)
23 PF01531 Glyco_transf_11: Glyc 22.4 31 0.00068 30.1 0.2 26 48-73 260-285 (298)
24 PF08690 GET2: GET complex sub 21.2 19 0.00042 32.1 -1.4 12 121-132 11-22 (302)
25 smart00545 JmjN Small domain f 20.6 55 0.0012 20.8 1.0 15 52-66 25-39 (42)
No 1
>KOG4071|consensus
Probab=100.00 E-value=4.7e-63 Score=402.82 Aligned_cols=177 Identities=47% Similarity=0.884 Sum_probs=171.6
Q ss_pred CCHHHHHHhhcCceEEEEecccccccccccccccCCCCCCCccchhHHHHHHhcccCcceeeCCCCCCHHHHHHHHHHhh
Q psy647 1 MDPAEIEFIAEDQNVEIIPKFNHLNLIHLISGDVGPFRAGIPAKVPLWLAINLKQRQKCRLVLPTWMNLETLTEIKEEEK 80 (190)
Q Consensus 1 m~p~E~eFlaE~e~I~IvP~f~~~~~~~li~g~~gPf~pg~~~~VPLWLAi~Lkk~~~C~I~~P~Wl~~~~L~~~~~~E~ 80 (190)
|+|+|+||+|++++|+|+|++++ ++++||+|++|||.||.|+.||||+|+.||||++|+|+||.||+++.|+++.+.|+
T Consensus 3 ~~peeiefia~~eli~I~P~~S~-~~l~li~~d~gp~~p~~~~rVpLWlAllLKkq~kc~Iv~P~wm~~~~L~~i~d~E~ 81 (187)
T KOG4071|consen 3 MSPEEIEFIAENELIEIIPNISM-DQLHLISGDIGPFNPGVPCRVPLWLALLLKKQNKCRIVPPEWMDVDELEKIRDVEK 81 (187)
T ss_pred CCHHHHHHHhcccceeeccCCcc-hhhhhhcCCCcccCCCCcceehHHHHHHHhhccCceEeCcccccHHHHHHHHHHhh
Confidence 79999999999999999999999 99999999999999999999999999999999999999999999999999999999
Q ss_pred ccCCCCCCCCccHHHHHHHHHhcCCCCCCChHHHHHHHHHHHHHHHHHHHhhHHHHHhcCCceeeeCCCCHHHHHhhhHh
Q psy647 81 KSRFFIKMPSDHYMEMSHIILDIGADDIPNVDLIRTLIKDIWDLRISKLRSSIDTFVKSGGGHATLNHLTQFEINSIRNI 160 (190)
Q Consensus 81 ~~~~f~~LP~~~y~Eia~~Ll~~a~Ddi~~~~~ir~li~Di~~~R~~Klr~~~~~~~~~~~~~~~l~nLt~~Ein~iR~f 160 (190)
.+++|++|| +||+|+|++|+++|.||||+.++||++|+||||+|.+|+|+|++.|..+ .|+++||||+||+|++|||
T Consensus 82 e~~tfs~lp-~~~~ela~lL~n~a~Ddi~~~eeiR~lV~DI~d~R~aKlr~s~~~f~~~--~hakldNltLmEiN~~rpf 158 (187)
T KOG4071|consen 82 ESETFSKLP-FYYFELATLLLNFARDDIPDIEEIRTLVKDIRDIRLAKLRTSLDSFNRQ--AHAKLDNLTLMEINEIRPF 158 (187)
T ss_pred cccccccCC-hHHHHHHHHHHHhccccchhHHHHHHHHHHHHHHHHHHHHHhHHHHhhH--hhcCcccceeeeeccchHH
Confidence 988999999 7999999999999999999999999999999999999999999998543 7999999999999999999
Q ss_pred HHHHHHHHHhhhcccCccccc
Q psy647 161 LCDVLDTMSSLKDKQCDVDVT 181 (190)
Q Consensus 161 ~~~~~~~~~~l~~~~~~~~~~ 181 (190)
++++||++++++++..+.+.+
T Consensus 159 l~~~l~~~~ki~~~s~~~~~~ 179 (187)
T KOG4071|consen 159 LTEALDHGRKIRQTSKDQQEE 179 (187)
T ss_pred HHHHHHHHHHHhhhhhhcccc
Confidence 999999999999888877766
No 2
>COG5093 Uncharacterized conserved protein [Function unknown]
Probab=100.00 E-value=1.8e-47 Score=304.84 Aligned_cols=170 Identities=29% Similarity=0.532 Sum_probs=162.4
Q ss_pred CCHHHHHHhhcCceEEEEecccccccccccc-cccCCCCCCCccchhHHHHHHhcccCcceeeCCCCCCHHHHHHHHHHh
Q psy647 1 MDPAEIEFIAEDQNVEIIPKFNHLNLIHLIS-GDVGPFRAGIPAKVPLWLAINLKQRQKCRLVLPTWMNLETLTEIKEEE 79 (190)
Q Consensus 1 m~p~E~eFlaE~e~I~IvP~f~~~~~~~li~-g~~gPf~pg~~~~VPLWLAi~Lkk~~~C~I~~P~Wl~~~~L~~~~~~E 79 (190)
.+|+|++|+|++|+|.|+|+.++ +.+.|++ .+|+|+.|...+.||||+|+.|||+++|+|++|.||+.+.|+..+..|
T Consensus 11 fSPEEi~fia~nE~i~I~P~~t~-~qlrL~~~~t~p~m~p~~~~rvpLw~ALlLKkqn~c~Iv~P~wl~~~~L~~~i~~e 89 (185)
T COG5093 11 FSPEEILFIAYNELIEIEPMTTI-PQLRLLERATYPPMMPLDIARVPLWAALLLKKQNMCKIVLPSWLQLESLKMSIDVE 89 (185)
T ss_pred CCHHHHHHHhcccceeeeccccc-chhhhhhhhcCCCCCCchhhhhHHHHHHHHHhcccccccChhHhhHHHHHhhhhHH
Confidence 47999999999999999999999 9999999 599999999999999999999999999999999999999999988887
Q ss_pred hc-cCCCCCCCCccHHHHHHHHHhcCCCCCCChHHHHHHHHHHHHHHHHHHHhhHHHHHhcCCceeeeCCCCHHHHHhhh
Q psy647 80 KK-SRFFIKMPSDHYMEMSHIILDIGADDIPNVDLIRTLIKDIWDLRISKLRSSIDTFVKSGGGHATLNHLTQFEINSIR 158 (190)
Q Consensus 80 ~~-~~~f~~LP~~~y~Eia~~Ll~~a~Ddi~~~~~ir~li~Di~~~R~~Klr~~~~~~~~~~~~~~~l~nLt~~Ein~iR 158 (190)
+. .+.|+.|| |||++.|+.||+.|.||+.+.+++|..|+||+++|++|.++|++.+ ++.++++||||.+|+|++|
T Consensus 90 ~~~~d~fSelp-~~wl~lA~~Lln~~~ddvediee~r~iv~dirEiRq~K~lkGlk~l---ne~~L~ldNl~l~EiNEir 165 (185)
T COG5093 90 IEKADEFSELP-PYWLPLATELLNENCDDVEDIEESRMIVEDIREIRQAKTLKGLKCL---NEKALNLDNLTLFEINEIR 165 (185)
T ss_pred HhcccccccCC-chHHHHHHHHHHhccchHhHHHHHHHHHHHHHHHHHHHHHhhHhhh---hhhhcCccccchhhhhhhH
Confidence 76 79999999 8999999999999999999999999999999999999999999764 8899999999999999999
Q ss_pred HhHHHHHHHHHhhhccc
Q psy647 159 NILCDVLDTMSSLKDKQ 175 (190)
Q Consensus 159 ~f~~~~~~~~~~l~~~~ 175 (190)
|++.++||.+|+|.+..
T Consensus 166 p~i~e~md~~R~I~~~s 182 (185)
T COG5093 166 PLILESMDVGRRIEDLS 182 (185)
T ss_pred HHHHHHHHHHHHHHhhh
Confidence 99999999999997654
No 3
>PTZ00362 hypothetical protein; Provisional
Probab=100.00 E-value=6.1e-43 Score=317.12 Aligned_cols=161 Identities=19% Similarity=0.323 Sum_probs=141.5
Q ss_pred hcCceEEEEeccccccc--cccccc-ccCCCCCCCccchhHHHHHHhcccCcceeeCCCCCCHHHHHHHHHHh-hccCCC
Q psy647 10 AEDQNVEIIPKFNHLNL--IHLISG-DVGPFRAGIPAKVPLWLAINLKQRQKCRLVLPTWMNLETLTEIKEEE-KKSRFF 85 (190)
Q Consensus 10 aE~e~I~IvP~f~~~~~--~~li~g-~~gPf~pg~~~~VPLWLAi~Lkk~~~C~I~~P~Wl~~~~L~~~~~~E-~~~~~f 85 (190)
|-+|+|-|-..+.. +. +.+|+| ++|||+||.|++||||||++||++++|+|+||.||+++.|++++++| ++.+.|
T Consensus 282 al~eLvVvKalvDi-p~~~L~lI~G~digpfkag~~~~VPLWLAi~LKq~~KC~IvpP~Wm~ie~Lk~I~eeE~~~~~~F 360 (479)
T PTZ00362 282 ALDELVVVKALVDI-PYIDLSEIEGFDFKEMKSGERQWLPIYIAKELSHFGLVTVEFPFWFYIKNLKNIYEREFEDQNEL 360 (479)
T ss_pred HhcceEEEEEecCC-CCCcceeeeccccCCcCCCCeeEEeeeehhhhcccCceEEeCcccccHHHHHHHHHHHhhCcccc
Confidence 55788887665555 32 478999 99999999999999999999999999999999999999999999999 678999
Q ss_pred CCCCCccHHHHHHHHHhc-------CCCCCCC------hHHHHHHHHHHHHHHHHHHHhhHHHHHhcCCceeeeCCCCHH
Q psy647 86 IKMPSDHYMEMSHIILDI-------GADDIPN------VDLIRTLIKDIWDLRISKLRSSIDTFVKSGGGHATLNHLTQF 152 (190)
Q Consensus 86 ~~LP~~~y~Eia~~Ll~~-------a~Ddi~~------~~~ir~li~Di~~~R~~Klr~~~~~~~~~~~~~~~l~nLt~~ 152 (190)
++||++||+|||++|+++ |+||+++ +++||++|+|||++|.+|+|+|++++ ...+.|++++|||.|
T Consensus 361 t~LP~p~y~EIa~lLl~~~~F~~~~a~edi~~~d~~~~~~kVR~LIeDIrd~R~~KIr~~Le~~-~~~ss~i~LdNLslm 439 (479)
T PTZ00362 361 TDLPSPYFFEISSMFLENNIFKKVTPIETIGQRTVYKYISKVAGLVEDIRYKRLKKIMNHLENQ-DVHSSIIYIDNLQIS 439 (479)
T ss_pred ccCCchhHHHHHHHHHhcccccccCChhhcccccccccHHHHHHHHHHHHHHHHHHHHHHHHhc-cccccEEEECCccHH
Confidence 999988999999999999 6777665 69999999999999999999999997 445689999999999
Q ss_pred HHHhhhHhHHHHHHHHHhhhcccC
Q psy647 153 EINSIRNILCDVLDTMSSLKDKQC 176 (190)
Q Consensus 153 Ein~iR~f~~~~~~~~~~l~~~~~ 176 (190)
|++.+|+ +|+.+.+...+.+
T Consensus 440 Ei~~Vn~----~Ls~f~k~d~~~~ 459 (479)
T PTZ00362 440 ETYCVNQ----LLSVFFQYDKNLN 459 (479)
T ss_pred HHHHHHH----HHHHHHHhccccc
Confidence 9999999 5666666644443
No 4
>PF05916 Sld5: GINS complex protein; InterPro: IPR021151 DNA replication in eukaryotes results from a highly coordinated interaction between proteins, often as part of protein complexes, and the DNA template. One of the key early steps leading to DNA replication is formation of the pre- replication complex, or pre-RC. The pre-RC is formed by the sequential binding of the origin recognition complex (ORC), Cdc6 and Cdt1 proteins, and the MCM complex. Activation of the pre-RC into the initiation complex (IC) is achieved via the action of S-phase kinases, eventually leading to the loading of the replication machinery. Recently, a novel replication complex, GINS (for Go, Ichi, Nii, and San; five, one, two, and three in Japanese), has been identified [, ]. The precise function of GINS is not known. However, genetic and two-hybrid interactions indicate that it mediates the loading of the enzymatic replication machinery at a step after the action of the S-phase kinases []. Furthermore, GINS may be a part of the replication machinery itself, since it is found associated with replicating DNA [, ]. Electron microscopy of GINS shows that it forms a ring-like structure [], reminiscent of the structure of PCNA [], the DNA polymerase delta replication clamp.This observation, coupled with the observed interactions for GINS, indicates that the complex may represent the replication clamp for DNA polymerase epsilon []. The GINS complex is essential for initiation of DNA replication in Xenopus egg extracts []. This 100 kDa stable complex includes Sld5, Psf1, Psf2, and Psf3. Homologues of these components are found also in other eukaryotes. ; PDB: 2EHO_L 2E9X_G 2Q9Q_H 3ANW_B.
Probab=99.83 E-value=2.4e-20 Score=137.29 Aligned_cols=104 Identities=27% Similarity=0.522 Sum_probs=90.4
Q ss_pred cchhHHHHHHhcccCcceeeCCCCCCHHHHHHHHHHhhcc---CCCCCCCCccHHHHHHHHHhcCCCCCCChHHHHHHHH
Q psy647 43 AKVPLWLAINLKQRQKCRLVLPTWMNLETLTEIKEEEKKS---RFFIKMPSDHYMEMSHIILDIGADDIPNVDLIRTLIK 119 (190)
Q Consensus 43 ~~VPLWLAi~Lkk~~~C~I~~P~Wl~~~~L~~~~~~E~~~---~~f~~LP~~~y~Eia~~Ll~~a~Ddi~~~~~ir~li~ 119 (190)
++||+|+|..|++++.|+|++|+|+++++++.+.++|++. ..|+++| +||+|.|.++++.+ +.++.++.
T Consensus 1 v~lP~w~a~~L~~~~~~~i~~p~~~~~~~~~~l~~e~~~~~~~~~l~~l~-~~f~~~~~~l~~~~-------~~ik~~l~ 72 (108)
T PF05916_consen 1 VELPLWLAKELKKRNLVEIQPPDWFSIEYLNALEAEEKNNSNKVDLSKLP-PYFYELGEMLLELL-------ERIKRLLR 72 (108)
T ss_dssp EEEEHHHHHHHHHTTSEEEE-TCCHCHHHHHHHHHHHCCCS-HCSCCTSS-TTHHHHHHHHHHHH-------HHHHHHHH
T ss_pred CcCCcchHHHHHHcCCeeecCcHhhhHHHHHHHHhhhccccchhhhhcCC-hHHHHHHHHHHHHH-------HHHHHHHH
Confidence 5899999999999999999999999999977777777663 5999999 89999999999988 68999999
Q ss_pred HHHHHHHHHHHhhHHHHH-hcCCceeeeCCCCHHHH
Q psy647 120 DIWDLRISKLRSSIDTFV-KSGGGHATLNHLTQFEI 154 (190)
Q Consensus 120 Di~~~R~~Klr~~~~~~~-~~~~~~~~l~nLt~~Ei 154 (190)
+++..|..||++++.... ..+.....++|||.+|.
T Consensus 73 ~~~~~R~~KI~~~~~~~~~~~~~~~~~~~~Ls~~Er 108 (108)
T PF05916_consen 73 DYLRIRLKKILSLAWHLLSGNDDPQEILENLSEEER 108 (108)
T ss_dssp HHHHHHHHHHHHHHCHHHHHCTTCHHHHTTS-HHHH
T ss_pred HHHHHHHHHHHHHHHHhccccCCcHHHHccCCHhhC
Confidence 999999999999998752 33455668999999995
No 5
>KOG1106|consensus
Probab=99.01 E-value=1e-09 Score=89.87 Aligned_cols=143 Identities=14% Similarity=0.223 Sum_probs=107.6
Q ss_pred HhhcCceEEEEecccccccccccccccCCCCCCCccchhHHHHHHhcccCcceeeCCCCCCHHHHHHHHHHhhccCCCCC
Q psy647 8 FIAEDQNVEIIPKFNHLNLIHLISGDVGPFRAGIPAKVPLWLAINLKQRQKCRLVLPTWMNLETLTEIKEEEKKSRFFIK 87 (190)
Q Consensus 8 FlaE~e~I~IvP~f~~~~~~~li~g~~gPf~pg~~~~VPLWLAi~Lkk~~~C~I~~P~Wl~~~~L~~~~~~E~~~~~f~~ 87 (190)
++|..+.|.+....++ .++.+++..--+..+|.+|++|+|||..|+..+-.+|.+|+.++...+..+.++ .+.-.+..
T Consensus 20 Ila~~~ki~ct~e~~i-~~lgfl~~~~~~~~~g~kVelP~wl~~~L~~~~~~si~lP~~fs~~~rn~i~Ad-p~~vdL~s 97 (177)
T KOG1106|consen 20 ILASGEKIPCTFETKI-PNLGFLQAPPRELPAGRKVELPYWLAEELAINQFVSILLPDIFSEKVRNVIKAD-PNKVDLRS 97 (177)
T ss_pred HHhcCCcccceeeecC-CCCCccCCCcccCCCCCcccchHHHHHHHhhccceeecCchHHHHHHHHHHhcC-cceeeccc
Confidence 6899999999999999 999999877777899999999999999999999999999999998877666655 33345556
Q ss_pred CCCccHHHHHHHHHhcCCCCCCChHHHHHHHHHHHHHHHHHHHhhHHHHHhcCC-ceeeeCCCCHHHHHhhhHh
Q psy647 88 MPSDHYMEMSHIILDIGADDIPNVDLIRTLIKDIWDLRISKLRSSIDTFVKSGG-GHATLNHLTQFEINSIRNI 160 (190)
Q Consensus 88 LP~~~y~Eia~~Ll~~a~Ddi~~~~~ir~li~Di~~~R~~Klr~~~~~~~~~~~-~~~~l~nLt~~Ein~iR~f 160 (190)
+- +|||+.|.+|+..-+++- +..++-...-.|..-+.....+ +.+. ..-.+-.|+.||.+.++.-
T Consensus 98 ~~-phFY~fg~kl~~l~s~~~-----l~~~~se~l~~R~~~~l~~~~s--~~~~~~s~~~~rLD~~E~~l~~~~ 163 (177)
T KOG1106|consen 98 LC-PHFYEFGMKLLPLDSGEN-----LGIILSETLRSRVREILDSSLS--ATNTQTSKFVERLDEMERKLFEAG 163 (177)
T ss_pred cc-cHHHHHHHHHhhcccCcc-----hhHHHHHHHHHHHHHHHHHHhh--hhhHHHHHHHHHHHHHHHHHHHHH
Confidence 65 899999999999888773 3333445555565555554433 1111 1125677888888776643
No 6
>COG5138 Uncharacterized conserved protein [Function unknown]
Probab=98.27 E-value=1.8e-06 Score=69.10 Aligned_cols=115 Identities=12% Similarity=0.147 Sum_probs=91.5
Q ss_pred HHhhcCceEEEEecccccccccccccccCCCCCCCccchhHHHHHHhcccCcceeeCCCCCCHHHHHHHHHHhhccCCCC
Q psy647 7 EFIAEDQNVEIIPKFNHLNLIHLISGDVGPFRAGIPAKVPLWLAINLKQRQKCRLVLPTWMNLETLTEIKEEEKKSRFFI 86 (190)
Q Consensus 7 eFlaE~e~I~IvP~f~~~~~~~li~g~~gPf~pg~~~~VPLWLAi~Lkk~~~C~I~~P~Wl~~~~L~~~~~~E~~~~~f~ 86 (190)
..|+++..|.+....+. +++++++..=.|.-.|.++++|+|||-.|-...-..+.+|..|+-.-+.++.++ +.=.
T Consensus 8 diL~~~s~ipctf~~~I-pglG~l~~~gr~~p~n~kvelP~wLa~~Lainsfv~~l~p~~fs~~v~naika~----P~tv 82 (168)
T COG5138 8 DILLGESRIPCTFKHKI-PGLGFLGPRGRILPENRKVELPYWLASFLAINSFVCELGPGFFSESVLNAIKAK----PSTV 82 (168)
T ss_pred HHhcCCCccceeeEeec-CCcccCCCCCcccCCCCeeechHHHHHHHHHhhhHHhcCcchHHHHHHHHHhcC----Ccee
Confidence 47899999999999999 999988866667888999999999999999988889999999999888888776 4434
Q ss_pred CCCC--ccHHHHHHHHHhcCCCCCCChHHHHHHHHHHHHHHHHHHHh
Q psy647 87 KMPS--DHYMEMSHIILDIGADDIPNVDLIRTLIKDIWDLRISKLRS 131 (190)
Q Consensus 87 ~LP~--~~y~Eia~~Ll~~a~Ddi~~~~~ir~li~Di~~~R~~Klr~ 131 (190)
+|.+ +|||-+|.+.+-.-+|+ ++-..+-...--|...+-+
T Consensus 83 dlrsI~~hff~lA~K~~mL~se~-----sLv~~~~e~l~~Ra~~~~~ 124 (168)
T COG5138 83 DLRSICPHFFYLAEKYAMLLSEG-----SLVAGLPEFLYERAGDYSS 124 (168)
T ss_pred ehhhcchHHHHHHHHHHHHhcCc-----hHhhhhHHHHHHHHHhhhc
Confidence 4433 79999999888777777 4444455555556555544
No 7
>COG1711 DNA replication initiation complex subunit, GINS family [Replication, recombination, and repair]
Probab=88.01 E-value=3.1 Score=35.71 Aligned_cols=88 Identities=22% Similarity=0.275 Sum_probs=61.0
Q ss_pred CCHHHHHHHHHHhhccCCCCCCCCccHHHHHHHHH---hcC-----CCCCCC------hHHHHHHHHHHHHHHHHHHHhh
Q psy647 67 MNLETLTEIKEEEKKSRFFIKMPSDHYMEMSHIIL---DIG-----ADDIPN------VDLIRTLIKDIWDLRISKLRSS 132 (190)
Q Consensus 67 l~~~~L~~~~~~E~~~~~f~~LP~~~y~Eia~~Ll---~~a-----~Ddi~~------~~~ir~li~Di~~~R~~Klr~~ 132 (190)
|+++.|+.++..|+.+.+++.++--+|.+++.-+= ..+ +.|+.. .+.+.+..+-|+..|+.|+-+.
T Consensus 1 m~i~eLR~~~~kErS~~~L~~Id~dFY~~v~~~I~eLe~~~~~~~~~~D~e~~~~~~~~et~~~~~r~ifqrR~~Kiv~~ 80 (223)
T COG1711 1 MDIEELRRALYKERSQDGLTEIDEDFYRDVRSFIKELEDEAGRAEEARDIEKYLLTDRIETAKSDARSIFQRRYGKIVSR 80 (223)
T ss_pred CCHHHHHHHHHHHhhhhhhhhcCHHHHHHHHHHHHHHHHHhhccccccCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 67899999999999999999999446656555432 222 233321 2467778889999999999875
Q ss_pred HHHHHhcCCceeeeCCCCHHHHHhhh
Q psy647 133 IDTFVKSGGGHATLNHLTQFEINSIR 158 (190)
Q Consensus 133 ~~~~~~~~~~~~~l~nLt~~Ein~iR 158 (190)
..- ... .-.+.|||..|-..+-
T Consensus 81 A~~--~~~--~~~~~~Lt~eEk~ly~ 102 (223)
T COG1711 81 AIY--DVP--GETISNLTPEEKELYE 102 (223)
T ss_pred HHH--hcc--ccchhcCCHHHHHHHH
Confidence 432 222 2348999999965443
No 8
>COG3924 Predicted membrane protein [Function unknown]
Probab=60.65 E-value=7.2 Score=28.06 Aligned_cols=20 Identities=30% Similarity=0.539 Sum_probs=17.1
Q ss_pred ccCCCCCCCCccHHHHHHHHHh
Q psy647 81 KSRFFIKMPSDHYMEMSHIILD 102 (190)
Q Consensus 81 ~~~~f~~LP~~~y~Eia~~Ll~ 102 (190)
++++|+.+| +|||.|++++.
T Consensus 35 ~t~G~~gfP--~WFE~aCi~lP 54 (80)
T COG3924 35 NTPGFTGFP--LWFEMACILLP 54 (80)
T ss_pred CCCCCCCCh--HHHHHHHHHHH
Confidence 368999997 89999999875
No 9
>PF10792 DUF2605: Protein of unknown function (DUF2605); InterPro: IPR019728 This entry represents a protein conserved in Cyanobacteria. The function is not known.
Probab=49.10 E-value=58 Score=24.62 Aligned_cols=41 Identities=7% Similarity=0.022 Sum_probs=33.6
Q ss_pred CccHHHHHHHHHhcCCCCCCChHHHHHHHHHHHHHHHHHHH
Q psy647 90 SDHYMEMSHIILDIGADDIPNVDLIRTLIKDIWDLRISKLR 130 (190)
Q Consensus 90 ~~~y~Eia~~Ll~~a~Ddi~~~~~ir~li~Di~~~R~~Klr 130 (190)
+.|||+=|+-||+.+++++-++++=..|+.-+.+.+..-.-
T Consensus 19 F~yWF~Rs~~LLe~e~~~fls~~eQ~~Ll~Rv~qaq~ev~a 59 (98)
T PF10792_consen 19 FQYWFSRSRELLETEPISFLSPEEQSDLLERVKQAQQEVNA 59 (98)
T ss_pred HHHHHHHHHHHHhcCCcccCCHHHHHHHHHHHHHHHHHHHH
Confidence 46999999999999999998888888888777776655443
No 10
>PF03233 Cauli_AT: Aphid transmission protein; InterPro: IPR004917 This protein is found in various caulimoviruses. It codes for an 18 kDa protein (PII), which is dispensable for infection but which is required for aphid transmission of the virus []. This protein interacts with the PIII protein []. ; GO: 0019089 transmission of virus
Probab=43.59 E-value=1.5e+02 Score=24.38 Aligned_cols=77 Identities=21% Similarity=0.277 Sum_probs=56.6
Q ss_pred ceeeCCCCCCHHHHHHHHHHhhc-cCCCCCCCCc--cHHHHHHHHHhcCCCCCCChHHHHHHHHHHHHHHHHHHHhhHHH
Q psy647 59 CRLVLPTWMNLETLTEIKEEEKK-SRFFIKMPSD--HYMEMSHIILDIGADDIPNVDLIRTLIKDIWDLRISKLRSSIDT 135 (190)
Q Consensus 59 C~I~~P~Wl~~~~L~~~~~~E~~-~~~f~~LP~~--~y~Eia~~Ll~~a~Ddi~~~~~ir~li~Di~~~R~~Klr~~~~~ 135 (190)
=.|+.=-||.+.+|...+-.|+. +.+++.=|+| -||--+..++....+|.......-..++++-+ |..||......
T Consensus 51 N~i~~r~~l~~~kl~sylGleKD~Se~~S~~K~Pf~~~~k~~~~ifkegg~d~~k~~~~l~~L~e~sn-ki~kLe~~~k~ 129 (163)
T PF03233_consen 51 NEIVGRNWLKLSKLLSYLGLEKDPSEGLSKSKSPFESFFKDLSKIFKEGGGDKQKQLKLLPTLEEISN-KIRKLETEVKK 129 (163)
T ss_pred HHHHHHHHHHHHHHHHHhccccCCccccccCCCcHHHHHHHHHHHHHhcCCchhhHHHHHHHHHHHHH-HHHHHHHHHHh
Confidence 34666779999999999999988 6767655544 78888888888888887665333333555655 88888887766
Q ss_pred H
Q psy647 136 F 136 (190)
Q Consensus 136 ~ 136 (190)
+
T Consensus 130 L 130 (163)
T PF03233_consen 130 L 130 (163)
T ss_pred H
Confidence 4
No 11
>PF08542 Rep_fac_C: Replication factor C C-terminal domain; InterPro: IPR013748 Replication factor C (RFC) is a multimeric AAA+ protein complex that loads the DNA polymerase processivity clamp PCNA (Proliferating Cell Nuclear Antigen) onto DNA using ATP to drive the reaction []. PCNA functions at multiple levels in directing DNA metabolic pathways []. When bound to DNA, PCNA organises various proteins involved in DNA replication, DNA repair, DNA modification, and chromatin modelling. Replication factor C consists of five subunits in a spiral arrangement: Rfc1, Rfc2, Rfc3, Rfc4, and Rfc5 subunits. Rfc1 and Rfc2 load the PCNA sliding clamp onto DNA, while Rfc3 binds ATP and also acts as a checkpoint sensor. The RFC complex contains four ATP sites (sites A, B, C, and D) located at subunit interfaces. In each ATP site, an arginine residue from one subunit is located near the gamma-phosphate of ATP bound in the adjacent subunit. These arginine residues act as "arginine fingers" that can potentially perform two functions: sensing that ATP is bound and catalyzing ATP hydrolysis []. This entry represents the core domain found in Rfc1-5.; GO: 0003689 DNA clamp loader activity, 0005524 ATP binding, 0006260 DNA replication, 0005663 DNA replication factor C complex; PDB: 1SXJ_B 2CHG_B 2CHV_F 2CHQ_C 1IQP_A.
Probab=41.87 E-value=42 Score=23.41 Aligned_cols=33 Identities=12% Similarity=0.282 Sum_probs=27.3
Q ss_pred CCCChHHHHHHHHHHHHHHHHHHHhhHHHHHhc
Q psy647 107 DIPNVDLIRTLIKDIWDLRISKLRSSIDTFVKS 139 (190)
Q Consensus 107 di~~~~~ir~li~Di~~~R~~Klr~~~~~~~~~ 139 (190)
|.|.++.|+.+++.+......+++..+..++..
T Consensus 1 ~~p~~~~i~~i~~~~~~~~~~~~~~~~~~l~~~ 33 (89)
T PF08542_consen 1 DWPPPEVIEEILESCLNGDFKEARKKLYELLVE 33 (89)
T ss_dssp TS--HHHHHHHHHHHHHTCHHHHHHHHHHHHHT
T ss_pred CCCCHHHHHHHHHHHHhCCHHHHHHHHHHHHHc
Confidence 567889999999999999999999999888754
No 12
>COG3248 Tsx Nucleoside-binding outer membrane protein [Cell envelope biogenesis, outer membrane]
Probab=35.01 E-value=19 Score=31.55 Aligned_cols=39 Identities=26% Similarity=0.556 Sum_probs=29.1
Q ss_pred EEEEecccccccccccccccCCCC--------------------------CCCccchhHHHHHHhc
Q psy647 15 VEIIPKFNHLNLIHLISGDVGPFR--------------------------AGIPAKVPLWLAINLK 54 (190)
Q Consensus 15 I~IvP~f~~~~~~~li~g~~gPf~--------------------------pg~~~~VPLWLAi~Lk 54 (190)
-+|.|||+. +++.-.+-+||||+ +|.-+.+|+-.-+.|.
T Consensus 101 mei~PR~S~-dklTgaDlSFGPvkevy~A~~ye~d~gd~ka~~~s~~~mGlG~Di~~p~f~~~~LN 165 (284)
T COG3248 101 MEIEPRFSI-DKLTGADLSFGPVKEVYFANNYEYDMGDNKAGRQSTWYMGLGTDIDTPLFMSLSLN 165 (284)
T ss_pred EEeccceee-ccccccccccccceeEEEeceeEEcCCcccccceeeEEeccCcccccCCcceEEEE
Confidence 589999999 98766666788875 4666777777666554
No 13
>PF01090 Ribosomal_S19e: Ribosomal protein S19e; InterPro: IPR001266 Ribosomes are the particles that catalyse mRNA-directed protein synthesis in all organisms. The codons of the mRNA are exposed on the ribosome to allow tRNA binding. This leads to the incorporation of amino acids into the growing polypeptide chain in accordance with the genetic information. Incoming amino acid monomers enter the ribosomal A site in the form of aminoacyl-tRNAs complexed with elongation factor Tu (EF-Tu) and GTP. The growing polypeptide chain, situated in the P site as peptidyl-tRNA, is then transferred to aminoacyl-tRNA and the new peptidyl-tRNA, extended by one residue, is translocated to the P site with the aid the elongation factor G (EF-G) and GTP as the deacylated tRNA is released from the ribosome through one or more exit sites [, ]. About 2/3 of the mass of the ribosome consists of RNA and 1/3 of protein. The proteins are named in accordance with the subunit of the ribosome which they belong to - the small (S1 to S31) and the large (L1 to L44). Usually they decorate the rRNA cores of the subunits. Many ribosomal proteins, particularly those of the large subunit, are composed of a globular, surfaced-exposed domain with long finger-like projections that extend into the rRNA core to stabilise its structure. Most of the proteins interact with multiple RNA elements, often from different domains. In the large subunit, about 1/3 of the 23S rRNA nucleotides are at least in van der Waal's contact with protein, and L22 interacts with all six domains of the 23S rRNA. Proteins S4 and S7, which initiate assembly of the 16S rRNA, are located at junctions of five and four RNA helices, respectively. In this way proteins serve to organise and stabilise the rRNA tertiary structure. While the crucial activities of decoding and peptide transfer are RNA based, proteins play an active role in functions that may have evolved to streamline the process of protein synthesis. In addition to their function in the ribosome, many ribosomal proteins have some function 'outside' the ribosome [, ]. This family includes a number of eukaryotic and archaebacterial ribosomal proteins; mammalian S19, Drosophila S19, Ascaris lumbricoides S19g (ALEP-1) and S19s, yeast YS16 (RP55A and RP55B), Aspergillus S16 and Haloarcula marismortui HS12.; GO: 0003735 structural constituent of ribosome, 0006412 translation, 0005622 intracellular, 0005840 ribosome; PDB: 3IZ6_S 3U5G_T 3U5C_T 3O30_M 3O2Z_M 3IZB_S 2XZN_T 2XZM_T 2V7F_A.
Probab=31.95 E-value=23 Score=28.30 Aligned_cols=40 Identities=23% Similarity=0.406 Sum_probs=24.5
Q ss_pred HHHHhcccCcceeeCCCCCCHHHHHHHHHHhhccCCCCCCCC----ccHHHHHHHH
Q psy647 49 LAINLKQRQKCRLVLPTWMNLETLTEIKEEEKKSRFFIKMPS----DHYMEMSHII 100 (190)
Q Consensus 49 LAi~Lkk~~~C~I~~P~Wl~~~~L~~~~~~E~~~~~f~~LP~----~~y~Eia~~L 100 (190)
+|.+||+.+ .|.+|+|.++- ++..+-++|+ ++|.--|.++
T Consensus 15 ~A~~LK~~g--ki~~P~w~d~v----------KTg~~kE~~P~~~DW~Y~RaASil 58 (139)
T PF01090_consen 15 LAEFLKKSG--KIEPPEWADIV----------KTGSHKELAPYDPDWWYIRAASIL 58 (139)
T ss_dssp HHHHHTCSS--TS--TSCGGGT----------SSSSS-SSTTCHHTHHHHHHHHHH
T ss_pred HHHHHHHcC--CcCCcchHHHH----------hhcccccCCCCCCCeeeeeHHHHH
Confidence 688898885 68899998753 2356667764 4555555544
No 14
>PF15614 WHIM3: WSTF, HB1, Itc1p, MBD9 motif 3
Probab=29.88 E-value=1.1e+02 Score=19.95 Aligned_cols=29 Identities=24% Similarity=0.322 Sum_probs=22.4
Q ss_pred CChHHHHHHHH---HHHHHHHHHHHhhHHHHH
Q psy647 109 PNVDLIRTLIK---DIWDLRISKLRSSIDTFV 137 (190)
Q Consensus 109 ~~~~~ir~li~---Di~~~R~~Klr~~~~~~~ 137 (190)
.+++++..|++ |-+.+|..||++.+.+..
T Consensus 5 ~~~e~ld~L~~aL~~prG~RE~~L~~~L~~~~ 36 (46)
T PF15614_consen 5 DDPEELDELLKALENPRGKRESKLKKELDKHR 36 (46)
T ss_pred cCHHHHHHHHHHHcCcccHhHHHHHHHHHHHh
Confidence 34567777776 559999999999887754
No 15
>PTZ00095 40S ribosomal protein S19; Provisional
Probab=27.07 E-value=55 Score=27.06 Aligned_cols=63 Identities=16% Similarity=0.237 Sum_probs=40.6
Q ss_pred cccccccccCCCCCCCc-cchhHH-----HHHHhcccCcceeeCCCCCCHHHHHHHHHHhhccCCCCCCCC----ccHHH
Q psy647 26 LIHLISGDVGPFRAGIP-AKVPLW-----LAINLKQRQKCRLVLPTWMNLETLTEIKEEEKKSRFFIKMPS----DHYME 95 (190)
Q Consensus 26 ~~~li~g~~gPf~pg~~-~~VPLW-----LAi~Lkk~~~C~I~~P~Wl~~~~L~~~~~~E~~~~~f~~LP~----~~y~E 95 (190)
++.-|++.+|-+++|.- -+||-+ +|-+||+.+ .|.+|+|.++-.. ..+-++|+ ++|.-
T Consensus 11 ~~~~~~~~~~~~~~~~tvkDVpa~~fI~~~A~~LK~~g--ki~~P~W~d~vKT----------g~~KElaP~d~DW~YiR 78 (169)
T PTZ00095 11 KIKRIGKPVPYRRKNATLKDVSPWRFIKAFAQHFKLEG--KIFVPKCTEIVKT----------SHGRELAPQNPDWYYIR 78 (169)
T ss_pred HHHHhcCCCCcccCCCceeeCCHHHHHHHHHHHHHHcC--CCCCCcchhhhcc----------cccccCCCCCccHHHHH
Confidence 45567777777777765 468877 567788554 6899999875432 44445543 46665
Q ss_pred HHHHH
Q psy647 96 MSHII 100 (190)
Q Consensus 96 ia~~L 100 (190)
.|.++
T Consensus 79 aASil 83 (169)
T PTZ00095 79 CAAVL 83 (169)
T ss_pred HHHHH
Confidence 55544
No 16
>PF14801 GCD14_N: tRNA methyltransferase complex GCD14 subunit N-term; PDB: 1I9G_A.
Probab=24.65 E-value=27 Score=23.62 Aligned_cols=12 Identities=50% Similarity=0.877 Sum_probs=5.7
Q ss_pred cCCCCCCCccch
Q psy647 34 VGPFRAGIPAKV 45 (190)
Q Consensus 34 ~gPf~pg~~~~V 45 (190)
-|||++|.+|.+
T Consensus 3 ~Gpf~~GdrVQl 14 (54)
T PF14801_consen 3 RGPFRAGDRVQL 14 (54)
T ss_dssp --S--TT-EEEE
T ss_pred cCCCCCCCEEEE
Confidence 489999998864
No 17
>PF01788 PsbJ: PsbJ; InterPro: IPR002682 Oxygenic photosynthesis uses two multi-subunit photosystems (I and II) located in the cell membranes of cyanobacteria and in the thylakoid membranes of chloroplasts in plants and algae. Photosystem II (PSII) has a P680 reaction centre containing chlorophyll 'a' that uses light energy to carry out the oxidation (splitting) of water molecules, and to produce ATP via a proton pump. Photosystem I (PSI) has a P700 reaction centre containing chlorophyll that takes the electron and associated hydrogen donated from PSII to reduce NADP+ to NADPH. Both ATP and NADPH are subsequently used in the light-independent reactions to convert carbon dioxide to glucose using the hydrogen atom extracted from water by PSII, releasing oxygen as a by-product. PSII is a multisubunit protein-pigment complex containing polypeptides both intrinsic and extrinsic to the photosynthetic membrane [, ]. Within the core of the complex, the chlorophyll and beta-carotene pigments are mainly bound to the antenna proteins CP43 (PsbC) and CP47 (PsbB), which pass the excitation energy on to the reaction centre proteins D1 (Qb, PsbA) and D2 (Qa, PsbD) that bind all the redox-active cofactors involved in the energy conversion process. The PSII oxygen-evolving complex (OEC) oxidises water to provide protons for use by PSI, and consists of OEE1 (PsbO), OEE2 (PsbP) and OEE3 (PsbQ). The remaining subunits in PSII are of low molecular weight (less than 10 kDa), and are involved in PSII assembly, stabilisation, dimerisation, and photo-protection []. This family represents the low molecular weight transmembrane protein PsbJ found in PSII. PsbJ is one of the most hydrophobic proteins in the thylakoid membrane, and is located in a gene cluster with PsbE, PsbF and PsbL (PsbEFJL). Both PsbJ and PsbL (IPR003372 from INTERPRO) are essential for proper assembly of the OEC. Mutations in PsbJ cause the light-harvesting antenna to remain detached from the PSII dimers []. In addition, both PsbJ and PsbL are involved in the unidirectional flow of electrons, where PsbJ regulates the forward electron flow from D2 (Qa) to the plastoquinone pool, and PsbL prevents the reduction of PSII by back electron flow from plastoquinol protecting PSII from photo-inactivation [].; GO: 0015979 photosynthesis, 0009523 photosystem II, 0009539 photosystem II reaction center, 0016020 membrane; PDB: 3A0H_J 3ARC_J 3A0B_J 3KZI_J 2AXT_J 3PRQ_J 4FBY_b 3BZ2_J 1S5L_j 3PRR_J ....
Probab=24.61 E-value=31 Score=21.92 Aligned_cols=7 Identities=57% Similarity=1.697 Sum_probs=4.8
Q ss_pred chhHHHH
Q psy647 44 KVPLWLA 50 (190)
Q Consensus 44 ~VPLWLA 50 (190)
++||||.
T Consensus 7 RIPLWlV 13 (40)
T PF01788_consen 7 RIPLWLV 13 (40)
T ss_dssp SS-HHHH
T ss_pred cccchHH
Confidence 6899984
No 18
>PF11568 Med29: Mediator complex subunit 29; InterPro: IPR021018 The Mediator complex is a coactivator involved in the regulated transcription of nearly all RNA polymerase II-dependent genes. Mediator functions as a bridge to convey information from gene-specific regulatory proteins to the basal RNA polymerase II transcription machinery. The Mediator complex, having a compact conformation in its free form, is recruited to promoters by direct interactions with regulatory proteins and serves for the assembly of a functional preinitiation complex with RNA polymerase II and the general transcription factors. On recruitment the Mediator complex unfolds to an extended conformation and partially surrounds RNA polymerase II, specifically interacting with the unphosphorylated form of the C-terminal domain (CTD) of RNA polymerase II. The Mediator complex dissociates from the RNA polymerase II holoenzyme and stays at the promoter when transcriptional elongation begins. The Mediator complex is composed of at least 31 subunits: MED1, MED4, MED6, MED7, MED8, MED9, MED10, MED11, MED12, MED13, MED13L, MED14, MED15, MED16, MED17, MED18, MED19, MED20, MED21, MED22, MED23, MED24, MED25, MED26, MED27, MED29, MED30, MED31, CCNC, CDK8 and CDC2L6/CDK11. The subunits form at least three structurally distinct submodules. The head and the middle modules interact directly with RNA polymerase II, whereas the elongated tail module interacts with gene-specific regulatory proteins. Mediator containing the CDK8 module is less active than Mediator lacking this module in supporting transcriptional activation. The head module contains: MED6, MED8, MED11, SRB4/MED17, SRB5/MED18, ROX3/MED19, SRB2/MED20 and SRB6/MED22. The middle module contains: MED1, MED4, NUT1/MED5, MED7, CSE2/MED9, NUT2/MED10, SRB7/MED21 and SOH1/MED31. CSE2/MED9 interacts directly with MED4. The tail module contains: MED2, PGD1/MED3, RGR1/MED14, GAL11/MED15 and SIN4/MED16. The CDK8 module contains: MED12, MED13, CCNC and CDK8. Individual preparations of the Mediator complex lacking one or more distinct subunits have been variously termed ARC, CRSP, DRIP, PC2, SMCC and TRAP. Med29, along with Med11 and Med28, in mammals, is part of the core head-region of the complex. Med29 is the apparent orthologue of the Drosophila melanogaster Intersex protein, which interacts directly with, and functions as a transcriptional coactivator for, the DNA-binding transcription factor Doublesex, so it is likely that mammalian Med29 serves as a target for one or more DNA-binding transcriptional activators []. ; GO: 0016592 mediator complex
Probab=24.57 E-value=2.4e+02 Score=22.78 Aligned_cols=33 Identities=3% Similarity=0.262 Sum_probs=27.3
Q ss_pred CCCCChHHHHHHHHHHHHHHHHHHHhhHHHHHh
Q psy647 106 DDIPNVDLIRTLIKDIWDLRISKLRSSIDTFVK 138 (190)
Q Consensus 106 Ddi~~~~~ir~li~Di~~~R~~Klr~~~~~~~~ 138 (190)
+++-+.+.+|.||.+|++-=..=++.+...|..
T Consensus 2 e~~D~I~kvK~Lv~~LreSl~~~~k~AA~~l~q 34 (148)
T PF11568_consen 2 EKLDPISKVKSLVGPLRESLSNLMKTAAQNLQQ 34 (148)
T ss_pred CcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 345567899999999999999999998888654
No 19
>PF12166 DUF3595: Protein of unknown function (DUF3595); InterPro: IPR021999 This family of proteins is functionally uncharacterised.This family of proteins is found in eukaryotes. Proteins in this family are typically between 578 and 2525 amino acids in length.
Probab=24.38 E-value=60 Score=29.62 Aligned_cols=24 Identities=29% Similarity=0.419 Sum_probs=21.4
Q ss_pred CCCCChHHHHHHHHHHHHHHHHHH
Q psy647 106 DDIPNVDLIRTLIKDIWDLRISKL 129 (190)
Q Consensus 106 Ddi~~~~~ir~li~Di~~~R~~Kl 129 (190)
+|+|++|.+-.|.+||+=+|..+-
T Consensus 371 eelp~~d~ll~lc~dI~~aRe~g~ 394 (422)
T PF12166_consen 371 EELPNPDRLLKLCQDIYLAREEGD 394 (422)
T ss_pred cCCCChHHHHHHHHHHHHHHhcCc
Confidence 689999999999999999996543
No 20
>PRK09333 30S ribosomal protein S19e; Provisional
Probab=23.78 E-value=75 Score=25.72 Aligned_cols=40 Identities=23% Similarity=0.384 Sum_probs=24.3
Q ss_pred HHHHhcccCcceeeCCCCCCHHHHHHHHHHhhccCCCCCCCC----ccHHHHHHHH
Q psy647 49 LAINLKQRQKCRLVLPTWMNLETLTEIKEEEKKSRFFIKMPS----DHYMEMSHII 100 (190)
Q Consensus 49 LAi~Lkk~~~C~I~~P~Wl~~~~L~~~~~~E~~~~~f~~LP~----~~y~Eia~~L 100 (190)
+|..||+.+ .|.+|+|.+.-. +..+-++|+ ++|.--|.+|
T Consensus 16 ~A~~LK~~~--ki~~P~W~~~vK----------Tg~~kE~~P~~~DW~Y~R~AsIl 59 (150)
T PRK09333 16 LAEYLKNVD--EIKPPEWAAFVK----------TGVHKERPPEQEDWWYVRAASIL 59 (150)
T ss_pred HHHHHHhcC--CcCCCcchhhhc----------ccccccCCCCCCchHHHHHHHHH
Confidence 577788544 689999986532 245555553 4555555443
No 21
>PF03711 OKR_DC_1_C: Orn/Lys/Arg decarboxylase, C-terminal domain; InterPro: IPR008286 Pyridoxal-dependent decarboxylases are bacterial proteins acting on ornithine, lysine, arginine and related substrates []. One of the regions of sequence similarity contains a conserved lysine residue, which is the site of attachment of the pyridoxal-phosphate group.; GO: 0003824 catalytic activity; PDB: 1C4K_A 1ORD_A 3Q16_C 3N75_A 2X3L_A 2VYC_D.
Probab=22.59 E-value=1.3e+02 Score=23.69 Aligned_cols=43 Identities=33% Similarity=0.384 Sum_probs=23.8
Q ss_pred CCHHHHHHhhcCceEEEEeccccccccccccc-ccCCCCCCCccchh
Q psy647 1 MDPAEIEFIAEDQNVEIIPKFNHLNLIHLISG-DVGPFRAGIPAKVP 46 (190)
Q Consensus 1 m~p~E~eFlaE~e~I~IvP~f~~~~~~~li~g-~~gPf~pg~~~~VP 46 (190)
|+|.+...-+-....+-||--.. .+-|++ .+-|+-||.|+-+|
T Consensus 51 ~~P~~a~~~~~~~~~e~v~l~~~---~GrIsae~i~pYPPGIPll~p 94 (136)
T PF03711_consen 51 MTPREAYREAVRGEKERVPLEEA---VGRISAEFIVPYPPGIPLLVP 94 (136)
T ss_dssp S-HHHHHHHHHTT-EEEEEGGG----TTSBBSS-BEECTTTS-SB-T
T ss_pred eCcHHHHHHHHhhcceEEEcccc---CCeEEEeeeeecCCCCcEECC
Confidence 57777665444444555665544 233554 68999999999887
No 22
>COG1480 Predicted membrane-associated HD superfamily hydrolase [General function prediction only]
Probab=22.54 E-value=2e+02 Score=28.88 Aligned_cols=29 Identities=21% Similarity=0.267 Sum_probs=24.3
Q ss_pred CceeeeCCCCHHHHHhhhHhHHHHHHHHH
Q psy647 141 GGHATLNHLTQFEINSIRNILCDVLDTMS 169 (190)
Q Consensus 141 ~~~~~l~nLt~~Ein~iR~f~~~~~~~~~ 169 (190)
++-+.=.+||..|+..|+..|+..+.+.+
T Consensus 658 DgQl~~s~lT~~El~~I~~~fv~tl~g~~ 686 (700)
T COG1480 658 DGQLSESGLTLKELEIIADSFVQTLQGIY 686 (700)
T ss_pred cCCccccCCcHHHHHHHHHHHHHHHHHHH
Confidence 34456688999999999999999999764
No 23
>PF01531 Glyco_transf_11: Glycosyl transferase family 11; InterPro: IPR002516 The biosynthesis of disaccharides, oligosaccharides and polysaccharides involves the action of hundreds of different glycosyltransferases. These enzymes catalyse the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. A classification of glycosyltransferases using nucleotide diphospho-sugar, nucleotide monophospho-sugar and sugar phosphates (2.4.1.- from EC) and related proteins into distinct sequence based families has been described []. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. The same three-dimensional fold is expected to occur within each of the families. Because 3-D structures are better conserved than sequences, several of the families defined on the basis of sequence similarities may have similar 3-D structures and therefore form 'clans'. Glycosyltransferase family 11 GT11 from CAZY comprises enzymes with only one known activity; galactoside 2-L-fucosyltransferase (2.4.1.69 from EC). Some of the proteins in this group are responsible for the molecular basis of the blood group antigens, surface markers on the outside of the red blood cell membrane. Most of these markers are proteins, but some are carbohydrates attached to lipids or proteins [Reid M.E., Lomas-Francis C. The Blood Group Antigen FactsBook Academic Press, London / San Diego, (1997)]. Galactoside 2-L-fucosyltransferase 1 (2.4.1.69 from EC) and Galactoside 2-L-fucosyltransferase 2 (2.4.1.69 from EC) belong to the Hh blood group system and are associated with H/h and Se/se antigens.; GO: 0008107 galactoside 2-alpha-L-fucosyltransferase activity, 0005975 carbohydrate metabolic process, 0016020 membrane
Probab=22.36 E-value=31 Score=30.06 Aligned_cols=26 Identities=27% Similarity=0.496 Sum_probs=22.2
Q ss_pred HHHHHhcccCcceeeCCCCCCHHHHH
Q psy647 48 WLAINLKQRQKCRLVLPTWMNLETLT 73 (190)
Q Consensus 48 WLAi~Lkk~~~C~I~~P~Wl~~~~L~ 73 (190)
|+|-.|.+.++-.|.||.|.+-+..+
T Consensus 260 wW~a~L~~~~~i~i~p~~w~~~~~~~ 285 (298)
T PF01531_consen 260 WWAAYLSKNDKIVIAPIKWFNDEFMK 285 (298)
T ss_pred HHHHHHCCCCCEEEECCcccCccccc
Confidence 88888988888889999999866655
No 24
>PF08690 GET2: GET complex subunit GET2; InterPro: IPR014802 This family corresponds to the GET complex subunit GET2. The GET complex is involved in the retrieval of ER resident proteins from the Golgi []. ; PDB: 3SJD_D 3ZS9_C.
Probab=21.22 E-value=19 Score=32.12 Aligned_cols=12 Identities=17% Similarity=0.274 Sum_probs=10.2
Q ss_pred HHHHHHHHHHhh
Q psy647 121 IWDLRISKLRSS 132 (190)
Q Consensus 121 i~~~R~~Klr~~ 132 (190)
++|+|++||++|
T Consensus 11 rRERReAKi~~G 22 (302)
T PF08690_consen 11 RRERREAKIKAG 22 (302)
T ss_dssp HHHHHHHHHHCC
T ss_pred HHHHHHHHHHCC
Confidence 579999999875
No 25
>smart00545 JmjN Small domain found in the jumonji family of transcription factors. To date, this domain always co-occurs with the JmjC domain (although the reverse is not true).
Probab=20.57 E-value=55 Score=20.76 Aligned_cols=15 Identities=20% Similarity=0.419 Sum_probs=11.7
Q ss_pred HhcccCcceeeCCCC
Q psy647 52 NLKQRQKCRLVLPTW 66 (190)
Q Consensus 52 ~Lkk~~~C~I~~P~W 66 (190)
.-++-+.|+|+||+=
T Consensus 25 ~~~~yGi~KIvPP~~ 39 (42)
T smart00545 25 QAEKYGICKVVPPKS 39 (42)
T ss_pred HHhhCCEEEEECCCC
Confidence 356778999999973
Done!