Query psy8451
Match_columns 110
No_of_seqs 142 out of 861
Neff 8.5
Searched_HMMs 46136
Date Fri Aug 16 19:12:41 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy8451.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/8451hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG3780|consensus 100.0 4.3E-28 9.4E-33 181.5 11.6 109 2-110 101-209 (427)
2 PF00339 Arrestin_N: Arrestin 99.8 1.9E-21 4.1E-26 126.5 4.0 61 2-62 89-149 (149)
3 KOG3865|consensus 99.4 1E-12 2.2E-17 94.7 6.8 100 5-110 106-216 (402)
4 PF13002 LDB19: Arrestin_N ter 99.2 9.8E-11 2.1E-15 79.5 8.6 58 2-59 46-115 (191)
5 PF03643 Vps26: Vacuolar prote 98.6 9.7E-07 2.1E-11 63.6 11.4 92 3-107 93-184 (275)
6 PF08737 Rgp1: Rgp1; InterPro 98.0 3.2E-05 6.9E-10 58.7 7.5 57 1-57 113-173 (415)
7 KOG2717|consensus 97.0 0.00094 2E-08 47.2 3.8 61 3-63 84-151 (313)
8 PF07070 Spo0M: SpoOM protein; 97.0 0.0027 6E-08 44.4 6.1 52 2-57 81-133 (218)
9 PF02752 Arrestin_C: Arrestin 96.2 0.025 5.4E-07 35.4 6.1 53 5-57 78-133 (136)
10 KOG3063|consensus 95.8 0.041 8.8E-07 39.2 6.0 64 8-75 104-167 (301)
11 COG4326 Spo0M Sporulation cont 92.0 0.39 8.5E-06 33.6 4.6 51 2-56 101-152 (270)
12 cd00917 PG-PI_TP The phosphati 87.7 2.2 4.8E-05 26.9 5.3 32 3-42 79-110 (122)
13 PF04425 Bul1_N: Bul1 N termin 84.6 3.4 7.4E-05 32.0 5.7 17 2-18 255-272 (438)
14 PF02221 E1_DerP2_DerF2: ML do 83.6 5.7 0.00012 24.8 5.7 32 3-42 88-120 (134)
15 cd00258 GM2-AP GM2 activator p 78.8 10 0.00022 25.5 5.6 37 3-43 109-147 (162)
16 cd08544 Reeler Reeler, the N-t 78.0 2.6 5.6E-05 26.9 2.7 22 89-110 19-40 (135)
17 KOG4469|consensus 77.2 8.9 0.00019 27.4 5.3 44 2-45 104-148 (391)
18 smart00737 ML Domain involved 75.4 12 0.00027 22.9 5.2 31 3-41 74-105 (118)
19 PF08694 UFC1: Ubiquitin-fold 68.2 6.3 0.00014 26.0 2.7 19 6-24 73-91 (161)
20 PF02014 Reeler: Reeler domain 65.6 5.7 0.00012 25.2 2.1 21 90-110 20-40 (132)
21 cd02859 AMPKbeta_GBD_like AMP- 63.6 3.9 8.5E-05 23.7 1.0 13 2-14 42-54 (79)
22 PF07703 A2M_N_2: Alpha-2-macr 62.3 7.5 0.00016 24.3 2.2 23 88-110 94-116 (136)
23 PF07610 DUF1573: Protein of u 59.1 12 0.00026 19.2 2.3 21 90-110 23-43 (45)
24 PF08737 Rgp1: Rgp1; InterPro 55.2 33 0.00071 26.4 4.9 32 9-40 381-412 (415)
25 cd00912 ML The ML (MD-2-relate 53.0 17 0.00037 22.7 2.7 31 3-41 82-114 (127)
26 PF08845 SymE_toxin: Toxin Sym 48.6 6.5 0.00014 21.7 0.2 17 94-110 32-48 (57)
27 PF04729 ASF1_hist_chap: ASF1 48.0 24 0.00053 23.5 2.9 39 3-41 65-119 (154)
28 PF03643 Vps26: Vacuolar prote 46.4 68 0.0015 23.3 5.2 32 9-40 237-271 (275)
29 cd02861 E_set_proteins_like E 45.8 15 0.00032 21.2 1.5 11 3-13 44-54 (82)
30 TIGR03000 plancto_dom_1 Planct 44.0 66 0.0014 18.8 4.0 26 30-56 41-66 (75)
31 PF01835 A2M_N: MG2 domain; I 43.7 67 0.0015 18.8 5.6 24 9-40 63-86 (99)
32 PF07495 Y_Y_Y: Y_Y_Y domain; 42.8 15 0.00033 19.8 1.2 11 3-13 36-46 (66)
33 KOG3063|consensus 41.8 94 0.002 22.5 5.1 52 9-60 244-298 (301)
34 PF06688 DUF1187: Protein of u 41.3 43 0.00093 18.7 2.7 14 33-46 1-14 (61)
35 PF09608 Alph_Pro_TM: Putative 41.0 1.1E+02 0.0024 21.7 5.5 42 8-57 155-196 (236)
36 PF08750 CNP1: CNP1-like famil 40.6 50 0.0011 21.6 3.4 15 26-40 47-61 (139)
37 COG3126 Uncharacterized protei 40.5 57 0.0012 21.8 3.7 35 4-44 87-121 (158)
38 KOG3780|consensus 40.2 1.2E+02 0.0027 22.7 6.1 50 6-56 263-316 (427)
39 PF08875 DUF1833: Domain of un 39.2 64 0.0014 21.0 3.9 10 9-18 53-62 (150)
40 PF14347 DUF4399: Domain of un 37.8 25 0.00054 21.0 1.6 14 1-14 57-70 (87)
41 PF08111 Pea-VEAacid: Pea-VEAa 34.7 15 0.00033 14.3 0.2 8 2-9 2-9 (15)
42 PF10342 GPI-anchored: Ser-Thr 34.0 95 0.0021 17.7 3.8 20 12-38 58-77 (93)
43 PF05688 DUF824: Salmonella re 33.8 24 0.00052 18.7 0.9 11 100-110 9-19 (47)
44 KOG3262|consensus 32.4 96 0.0021 21.5 3.9 22 87-108 102-127 (215)
45 PRK09750 hypothetical protein; 31.4 59 0.0013 18.2 2.2 13 32-44 3-15 (64)
46 KOG1584|consensus 30.6 34 0.00074 25.3 1.6 25 12-36 107-132 (297)
47 PF01345 DUF11: Domain of unkn 30.5 82 0.0018 17.6 3.0 26 85-110 22-47 (76)
48 KOG0420|consensus 30.5 51 0.0011 22.5 2.3 17 4-20 69-88 (184)
49 TIGR02567 YscW type III secret 28.5 1E+02 0.0023 19.7 3.4 25 30-55 81-105 (124)
50 COG2373 Large extracellular al 27.5 47 0.001 30.2 2.2 22 89-110 751-772 (1621)
51 PRK02939 lipoprotein; Reviewed 27.3 2.4E+02 0.0051 20.2 7.1 26 18-43 47-72 (236)
52 TIGR02186 alph_Pro_TM conserve 26.6 2.6E+02 0.0055 20.4 5.5 40 10-57 182-221 (261)
53 KOG1366|consensus 26.4 1.1E+02 0.0025 27.6 4.2 28 83-110 109-136 (1436)
54 PF12857 TOBE_3: TOBE-like dom 26.4 92 0.002 16.7 2.6 24 85-109 29-56 (58)
55 PF05132 RNA_pol_Rpc4: RNA pol 25.1 54 0.0012 20.8 1.7 14 12-25 6-19 (131)
56 TIGR03769 P_ac_wall_RPT actino 24.9 94 0.002 15.7 2.2 12 4-15 11-22 (41)
57 PF12761 End3: Actin cytoskele 24.3 15 0.00032 25.4 -1.1 24 13-36 18-43 (195)
58 TIGR01451 B_ant_repeat conserv 23.4 54 0.0012 17.4 1.2 13 96-108 4-16 (53)
59 PF13584 BatD: Oxygen toleranc 23.3 1E+02 0.0023 23.7 3.2 26 85-110 124-149 (484)
60 PF13584 BatD: Oxygen toleranc 22.3 89 0.0019 24.1 2.7 24 87-110 271-294 (484)
61 PF02303 Phage_DNA_bind: Helix 22.2 57 0.0012 19.5 1.2 15 6-20 38-52 (86)
62 PF14734 DUF4469: Domain of un 21.6 1.7E+02 0.0038 17.9 3.4 22 12-41 66-87 (102)
63 PF05078 DUF679: Protein of un 21.3 66 0.0014 21.8 1.5 16 22-37 58-73 (170)
64 smart00736 CADG Dystroglycan-t 21.1 1.8E+02 0.0039 17.1 3.4 25 11-38 14-39 (97)
65 KOG3265|consensus 21.0 1.5E+02 0.0033 21.2 3.3 37 3-39 65-117 (250)
No 1
>KOG3780|consensus
Probab=99.95 E-value=4.3e-28 Score=181.51 Aligned_cols=109 Identities=42% Similarity=0.827 Sum_probs=104.2
Q ss_pred eecCeeeEEEeEEECCCCCCCceecCCeEEEEEEEEEEeCCCCCCceEEEEEEEEccccCCCCCCCCCceeEEEeeeEEE
Q psy8451 2 EIPQGSHVYPFSYTLPPTIPSSFDGEHGRVRYKVTARLDRPWKFDKTTEAVFSVVTPYDLNTDMRAKEPINQNVSKNFRF 81 (110)
Q Consensus 2 ~l~~G~h~fpF~f~LP~~lPsSf~~~~g~I~Y~i~a~i~~~~~~~~~~~~~~~V~~~~~~~~~p~~~~p~~~~~~~~~~~ 81 (110)
.|++|.|+|||+|+||.+||+||+|.+|+|||.|+|.++|+|+.+....+.|+|++.+|+|..|....|.....++.++|
T Consensus 101 ~l~~G~~~~pF~~~LP~~~P~Sfeg~~G~irY~vk~~idr~~~~~~~~~~~~~V~~~~~ln~~p~~~~~~~~~~~k~~~~ 180 (427)
T KOG3780|consen 101 VLPPGNYEFPFSFTLPLNLPPSFEGKFGHVRYFVKAEIDRPWKLNKKNRKPFTVIETVDLNSSPSLLEPIISKASKKLGC 180 (427)
T ss_pred ecCCCceEEeEeccCCCCCCCceeeCCceEEEEEEEEEecCCCCCccceeeEEEecccccccCccccCcchhhhhheeeE
Confidence 48899999999999999999999999999999999999999999999999999999999998888899999999999999
Q ss_pred EEccCccEEEEEEeCCeeeeCCCeEEEEC
Q psy8451 82 LWRQSGPLTLSVSAPFSGFVPGQTIPCMI 110 (110)
Q Consensus 82 ~~~~~g~v~~~~~l~k~~y~pGe~i~i~v 110 (110)
||+.+|.+.+++++++.+|++||.|++.+
T Consensus 181 ~~~~~g~v~~~~~ip~~~~~~ge~i~~~~ 209 (427)
T KOG3780|consen 181 VCFSSGPVSLELTIPKTGYVPGETIPVTL 209 (427)
T ss_pred EEecCCcEEEEEEcccccCcCCccEEEEE
Confidence 99999999999999999999999998763
No 2
>PF00339 Arrestin_N: Arrestin (or S-antigen), N-terminal domain; InterPro: IPR011021 G protein-coupled receptors are a large family of signalling molecules that respond to a wide variety of extracellular stimuli. The receptors relay the information encoded by the ligand through the activation of heterotrimeric G proteins and intracellular effector molecules. To ensure the appropriate regulation of the signalling cascade, it is vital to properly inactivate the receptor. This inactivation is achieved, in part, by the binding of a soluble protein, arrestin, which uncouples the receptor from the downstream G protein after the receptors are phosphorylated by G protein-coupled receptor kinases. In addition to the inactivation of G protein-coupled receptors, arrestins have also been implicated in the endocytosis of receptors and cross talk with other signalling pathways. Arrestin (retinal S-antigen) is a major protein of the retinal rod outer segments. It interacts with photo-activated phosphorylated rhodopsin, inhibiting or 'arresting' its ability to interact with transducin []. The protein binds calcium, and shows similarity in its C terminus to alpha-transducin and other purine nucleotide-binding proteins. In mammals, arrestin is associated with autoimmune uveitis. Arrestins comprise a family of closely-related proteins that includes beta-arrestin-1 and -2, which regulate the function of beta-adrenergic receptors by binding to their phosphorylated forms, impairing their capacity to activate G(S) proteins; Cone photoreceptors C-arrestin (arrestin-X) [], which could bind to phosphorylated red/green opsins; and Drosophila phosrestins I and II, which undergo light-induced phosphorylation, and probably play a role in photoreceptor transduction [, , ]. The crystal structure of bovine retinal arrestin comprises two domains of antiparallel beta-sheets connected through a hinge region and one short alpha-helix on the back of the amino-terminal fold []. The binding region for phosphorylated light-activated rhodopsin is located at the N-terminal domain, as indicated by the docking of the photoreceptor to the three-dimensional structure of arrestin. The N-terminal domain consists of an immunoglobulin-like beta-sandwich structure. This entry represents proteins with immunoglobulin-like domains that are similar to those found in arrestin.; PDB: 1SUJ_A 3UGX_A 1CF1_B 1AYR_A 3UGU_A 3P2D_B 1ZSH_A 2WTR_B 3GC3_A 1G4R_A ....
Probab=99.84 E-value=1.9e-21 Score=126.52 Aligned_cols=61 Identities=49% Similarity=0.931 Sum_probs=47.2
Q ss_pred eecCeeeEEEeEEECCCCCCCceecCCeEEEEEEEEEEeCCCCCCceEEEEEEEEccccCC
Q psy8451 2 EIPQGSHVYPFSYTLPPTIPSSFDGEHGRVRYKVTARLDRPWKFDKTTEAVFSVVTPYDLN 62 (110)
Q Consensus 2 ~l~~G~h~fpF~f~LP~~lPsSf~~~~g~I~Y~i~a~i~~~~~~~~~~~~~~~V~~~~~~~ 62 (110)
.|++|+|+|||+|+||.+||+||++.+|+|+|.|+|.+++|+..+.+++++|+|++.+|+|
T Consensus 89 ~l~~G~~~fpF~f~LP~~lP~S~~~~~g~I~Y~l~a~l~~~~~~~~~~~~~~~v~~~id~n 149 (149)
T PF00339_consen 89 ILPPGEYEFPFEFQLPSNLPSSFEGSHGSIRYKLKATLDRPGKKDHKAKREFTVVEPIDLN 149 (149)
T ss_dssp ---C-TTEEEEEE---TTS--SEEEE-SEEEEEEEEEESSTTSE--CGGEEEEEEEEEEB-
T ss_pred cccCCCEEEEEEEECCCCCCceEeccCcCEEEEEEEEEECCCCCCcEEEEEEEEECcccCC
Confidence 4789999999999999999999999999999999999999999999999999999999986
No 3
>KOG3865|consensus
Probab=99.39 E-value=1e-12 Score=94.69 Aligned_cols=100 Identities=27% Similarity=0.428 Sum_probs=72.0
Q ss_pred CeeeEEEeEEECCCCCCCcee--------cCCeEEEEEEEEEEeCCC-CCCc-eEEEEEEEEccccCCCCCCCCCce-eE
Q psy8451 5 QGSHVYPFSYTLPPTIPSSFD--------GEHGRVRYKVTARLDRPW-KFDK-TTEAVFSVVTPYDLNTDMRAKEPI-NQ 73 (110)
Q Consensus 5 ~G~h~fpF~f~LP~~lPsSf~--------~~~g~I~Y~i~a~i~~~~-~~~~-~~~~~~~V~~~~~~~~~p~~~~p~-~~ 73 (110)
-|.+.|||.|++|.++|+|+. |+-+.|.|.|+|.+.... ...+ +.+-.+. ++.+. ..|.-..|+ ..
T Consensus 106 LG~nAyPF~f~~pp~~P~SVtLQp~p~D~gKpcGVdyevkaF~~~s~edk~hKr~sVrL~-IRKvq--yAP~~~GpqP~~ 182 (402)
T KOG3865|consen 106 LGSNAYPFTFEFPPNLPCSVTLQPGPEDTGKPCGVDYEVKAFVADSEEDKIHKRNSVRLV-IRKVQ--YAPLEPGPQPSA 182 (402)
T ss_pred hCCCCCceEEeCCCCCCceEEeccCCccCCCcccceEEEEEEecCCcccccccccceeee-eeeee--ecCCCCCCCchh
Confidence 389999999999999999995 566899999999996442 1122 2222333 33332 234333343 23
Q ss_pred EEeeeEEEEEccCccEEEEEEeCCeeeeCCCeEEEEC
Q psy8451 74 NVSKNFRFLWRQSGPLTLSVSAPFSGFVPGQTIPCMI 110 (110)
Q Consensus 74 ~~~~~~~~~~~~~g~v~~~~~l~k~~y~pGe~i~i~v 110 (110)
+..+.| .++.|+++++++|||.-|..||.|.++|
T Consensus 183 ~v~k~F---lmS~~~lhLevsLDkEiYyHGE~isvnV 216 (402)
T KOG3865|consen 183 EVSKQF---LMSDGPLHLEVSLDKEIYYHGEPISVNV 216 (402)
T ss_pred HhhHhh---ccCCCceEEEEEecchheecCCceeEEE
Confidence 445655 6899999999999999999999999986
No 4
>PF13002 LDB19: Arrestin_N terminal like; InterPro: IPR024391 This entry represents a predicted Ig-like beta sandwich domain found towards the N terminus of protein LDB19 []. It is also found in other sequences and is related to the arrestin N-terminal fold [].
Probab=99.22 E-value=9.8e-11 Score=79.50 Aligned_cols=58 Identities=28% Similarity=0.502 Sum_probs=46.5
Q ss_pred eecCeeeEEEeEEECCCCCCCcee---cCCeEEEEEEEEEEeC--CC------CC-CceEEEEEEEEccc
Q psy8451 2 EIPQGSHVYPFSYTLPPTIPSSFD---GEHGRVRYKVTARLDR--PW------KF-DKTTEAVFSVVTPY 59 (110)
Q Consensus 2 ~l~~G~h~fpF~f~LP~~lPsSf~---~~~g~I~Y~i~a~i~~--~~------~~-~~~~~~~~~V~~~~ 59 (110)
.|..|.|.|||++-||.+||+|.+ +..++|.|.++|++.. +. +. .++.++.+.|.+.+
T Consensus 46 ~l~~G~h~fPFS~LiPG~LPaS~~lgs~~l~~I~Yel~A~a~~~~~~~~~~~~~~~~~~~~~pl~V~Rsi 115 (191)
T PF13002_consen 46 TLTKGSHAFPFSYLIPGHLPASMDLGSTPLVSIKYELKAEATYKDPRRGSSSSKPRVLKLKRPLPVKRSI 115 (191)
T ss_pred ccCCCcccCCeeEECCCCCccccccCCCCcEEEEEEEEEEEEEccCccccCCCcceeEEEeeeEEEEEec
Confidence 478999999999999999999999 7889999999999976 21 11 13455667776654
No 5
>PF03643 Vps26: Vacuolar protein sorting-associated protein 26 ; InterPro: IPR005377 The movement of lipid and protein components between intracellular organelles requires the regulated interactions of many molecules. Vacuolar protein sorting-associated protein (Vps)5 is a yeast protein that is a subunit of a large multimeric complex, termed the retromer complex, involved in retrograde transport of proteins from endosomes to the trans-Golgi network. Sorting nexin (SNX) 1 and SNX2 are its mammalian orthologs []. To carry out its biological functions, Vps5 forms the retromer complex with at least four other proteins: Vps17, Vps26, Vps29, and Vps35 []. This family of Vps26-proteins also contains Down syndrome critical region 3/A.; GO: 0007034 vacuolar transport, 0030904 retromer complex; PDB: 3LHA_A 3LH9_A 2R51_A 3LH8_B 2FAU_A.
Probab=98.63 E-value=9.7e-07 Score=63.57 Aligned_cols=92 Identities=21% Similarity=0.357 Sum_probs=54.5
Q ss_pred ecCeeeEEEeEEECCCCCCCceecCCeEEEEEEEEEEeCCCCCCceEEEEEEEEccccCCCCCCCCCceeEEEeeeEEEE
Q psy8451 3 IPQGSHVYPFSYTLPPTIPSSFDGEHGRVRYKVTARLDRPWKFDKTTEAVFSVVTPYDLNTDMRAKEPINQNVSKNFRFL 82 (110)
Q Consensus 3 l~~G~h~fpF~f~LP~~lPsSf~~~~g~I~Y~i~a~i~~~~~~~~~~~~~~~V~~~~~~~~~p~~~~p~~~~~~~~~~~~ 82 (110)
|+.|.- |||+|.+-..--.||.|.+-+|+|.|+|.+.|++ .+...+.+|.|... ...|+...|...+.
T Consensus 93 l~~~~t-~pFeF~~~~k~yETY~G~~v~i~Y~lrv~v~R~~-~~i~k~~ef~V~~~---~~~p~~~~~ik~ev------- 160 (275)
T PF03643_consen 93 LPEGKT-FPFEFPLVEKPYETYHGVNVNIRYFLRVTVKRSY-KDISKEQEFWVQNF---SITPESNQPIKMEV------- 160 (275)
T ss_dssp E-S-EE-EEEEE-SB---S--EE-SSEEEEEEEEEEE--SS-S-EEEEEEEEEE-E---B--------EEEEE-------
T ss_pred cCCCcE-EeeEeCCCCCCCccEeeeEEEEEEEEEEEEEccC-CCcceEEEEEEEec---cCCCCCCCCccccc-------
Confidence 566666 9999988544478999999999999999999999 78899999999843 22255556655543
Q ss_pred EccCccEEEEEEeCCeeeeCCCeEE
Q psy8451 83 WRQSGPLTLSVSAPFSGFVPGQTIP 107 (110)
Q Consensus 83 ~~~~g~v~~~~~l~k~~y~pGe~i~ 107 (110)
--.+.++++..++++.|.-.|.|.
T Consensus 161 -gie~~lhief~~~k~~~~l~d~i~ 184 (275)
T PF03643_consen 161 -GIEDCLHIEFEYDKSKYHLKDVIT 184 (275)
T ss_dssp -CETTTEEEEEEES-SEEETT-EEE
T ss_pred -CCCccEEEEEEEcccceECCCCEE
Confidence 124566888888888888777653
No 6
>PF08737 Rgp1: Rgp1; InterPro: IPR014848 Rgp1 forms heterodimer with Ric1 (IPR009771 from INTERPRO) which associates with Golgi membranes and functions as a guanyl-nucleotide exchange factor [].
Probab=97.99 E-value=3.2e-05 Score=58.73 Aligned_cols=57 Identities=23% Similarity=0.469 Sum_probs=44.2
Q ss_pred CeecCeee-EEEeEEECCCCCCCceecCCeEEEEEEEEEEeCC---CCCCceEEEEEEEEc
Q psy8451 1 MEIPQGSH-VYPFSYTLPPTIPSSFDGEHGRVRYKVTARLDRP---WKFDKTTEAVFSVVT 57 (110)
Q Consensus 1 ~~l~~G~h-~fpF~f~LP~~lPsSf~~~~g~I~Y~i~a~i~~~---~~~~~~~~~~~~V~~ 57 (110)
+.|.||+- +|-|++.||.+||+||.|+.-+|.|.+.+.+.+. ........-+|.|..
T Consensus 113 l~L~pge~k~f~~~~~lP~~lPPsy~g~~i~~~Y~l~vg~~~~~~~~~~~~~~~~Pi~v~~ 173 (415)
T PF08737_consen 113 LRLAPGESKSFHFSFTLPKDLPPSYRGKAIKISYSLVVGTQRGSSVNSKVKSIRVPIRVLP 173 (415)
T ss_pred eEECCCCcEEEEEEEeCCCCCCCCCcCcEEEEEEEEEEEEEEcCCCCcceEEEEeCEEEee
Confidence 46788764 7999999999999999999999999999999982 222333444566654
No 7
>KOG2717|consensus
Probab=97.02 E-value=0.00094 Score=47.24 Aligned_cols=61 Identities=26% Similarity=0.458 Sum_probs=48.6
Q ss_pred ecCeeeEEEeEEECC-----CCCCCceecCCeEEEEEEEEEEeCCC-CCCceEEEEEEEEc-cccCCC
Q psy8451 3 IPQGSHVYPFSYTLP-----PTIPSSFDGEHGRVRYKVTARLDRPW-KFDKTTEAVFSVVT-PYDLNT 63 (110)
Q Consensus 3 l~~G~h~fpF~f~LP-----~~lPsSf~~~~g~I~Y~i~a~i~~~~-~~~~~~~~~~~V~~-~~~~~~ 63 (110)
+++|.-++||++.|- .-|-.+|.|.+-+|.|.+++.+.|.. ..+.....+|.|.. ++|+..
T Consensus 84 ~p~G~tEipFelpL~~kge~~~lYETyHGvfiNiqY~LtcdikR~~L~K~ltkt~eFiv~s~pv~l~e 151 (313)
T KOG2717|consen 84 IPPGTTEIPFELPLREKGEGEKLYETYHGVFINIQYLLTCDIKRGYLHKPLTKTMEFIVESGPVDLPE 151 (313)
T ss_pred CCCCceeeeeeeeeccCCCccEeeeeecceEEEEEEEEEEecccchhcCchhhhheeeeccCCccccc
Confidence 789999999999984 45677888888899999999999984 55667778888854 555543
No 8
>PF07070 Spo0M: SpoOM protein; InterPro: IPR009776 This family consists of several bacterial SpoOM proteins which are thought to control sporulation in Bacillus subtilis.Spo0M exerts certain negative effects on sporulation and its gene expression is controlled by sigmaH [].
Probab=97.01 E-value=0.0027 Score=44.41 Aligned_cols=52 Identities=27% Similarity=0.328 Sum_probs=41.8
Q ss_pred eecCe-eeEEEeEEECCCCCCCceecCCeEEEEEEEEEEeCCCCCCceEEEEEEEEc
Q psy8451 2 EIPQG-SHVYPFSYTLPPTIPSSFDGEHGRVRYKVTARLDRPWKFDKTTEAVFSVVT 57 (110)
Q Consensus 2 ~l~~G-~h~fpF~f~LP~~lPsSf~~~~g~I~Y~i~a~i~~~~~~~~~~~~~~~V~~ 57 (110)
+++|| ++++||+|+||-++|-|. +..+|.++..+|.++..|..-.-++.|..
T Consensus 81 ~I~~ge~~~iPF~~~lP~etPiT~----~~~~v~l~T~LdI~~avD~~D~D~i~V~P 133 (218)
T PF07070_consen 81 TIEPGEEKEIPFSFPLPWETPITE----GGMRVWLRTGLDIAGAVDPGDLDPIEVEP 133 (218)
T ss_pred EECCCCEEEEeEEEECCCCCCccC----CCcEEEEEEEEEeCCCCCCCCceeEEEeC
Confidence 45565 469999999999999887 78999999999988766666666677754
No 9
>PF02752 Arrestin_C: Arrestin (or S-antigen), C-terminal domain; InterPro: IPR011022 G protein-coupled receptors are a large family of signalling molecules that respond to a wide variety of extracellular stimuli. The receptors relay the information encoded by the ligand through the activation of heterotrimeric G proteins and intracellular effector molecules. To ensure the appropriate regulation of the signalling cascade, it is vital to properly inactivate the receptor. This inactivation is achieved, in part, by the binding of a soluble protein, arrestin, which uncouples the receptor from the downstream G protein after the receptors are phosphorylated by G protein-coupled receptor kinases. In addition to the inactivation of G protein-coupled receptors, arrestins have also been implicated in the endocytosis of receptors and cross talk with other signalling pathways. Arrestin (retinal S-antigen) is a major protein of the retinal rod outer segments. It interacts with photo-activated phosphorylated rhodopsin, inhibiting or 'arresting' its ability to interact with transducin []. The protein binds calcium, and shows similarity in its C terminus to alpha-transducin and other purine nucleotide-binding proteins. In mammals, arrestin is associated with autoimmune uveitis. Arrestins comprise a family of closely-related proteins that includes beta-arrestin-1 and -2, which regulate the function of beta-adrenergic receptors by binding to their phosphorylated forms, impairing their capacity to activate G(S) proteins; Cone photoreceptors C-arrestin (arrestin-X) [], which could bind to phosphorylated red/green opsins; and Drosophila phosrestins I and II, which undergo light-induced phosphorylation, and probably play a role in photoreceptor transduction [, , ]. The crystal structure of bovine retinal arrestin comprises two domains of antiparallel beta-sheets connected through a hinge region and one short alpha-helix on the back of the amino-terminal fold []. The binding region for phosphorylated light-activated rhodopsin is located at the N-terminal domain, as indicated by the docking of the photoreceptor to the three-dimensional structure of arrestin. The C-terminal domain consists of an immunoglobulin-like beta-sandwich structure. This entry represents proteins with immunoglobulin-like domains that are similar to those found in arrestin.; PDB: 1SUJ_A 3UGX_A 1CF1_B 1AYR_A 3UGU_A 3P2D_B 1ZSH_A 2WTR_B 3GC3_A 1G4R_A ....
Probab=96.21 E-value=0.025 Score=35.43 Aligned_cols=53 Identities=19% Similarity=0.321 Sum_probs=32.1
Q ss_pred CeeeEEEeEEECCCCCCCce--ecCCeEEEEEEEEEEeCC-CCCCceEEEEEEEEc
Q psy8451 5 QGSHVYPFSYTLPPTIPSSF--DGEHGRVRYKVTARLDRP-WKFDKTTEAVFSVVT 57 (110)
Q Consensus 5 ~G~h~fpF~f~LP~~lPsSf--~~~~g~I~Y~i~a~i~~~-~~~~~~~~~~~~V~~ 57 (110)
.+.+.+..++.||.++++|. .+..-+|.|.|++.+.-. ...+...+-+++|..
T Consensus 78 ~~~~~~~~~l~lP~~~~~s~~~~~~~i~v~Y~l~v~~~~~~~~~~~~~~~PI~I~~ 133 (136)
T PF02752_consen 78 SGSFEFNIQLQLPSNLPPSTSTNSRLIQVEYQLEVTVKLSGCTSDLRLELPITIGS 133 (136)
T ss_dssp TEEEEEEEEE-----B-----CGGGSEEEEEEEEEEEEEETTSEEEEEEEEEEEEB
T ss_pred CceEEEEEEEcCCCccCcccccCCcEEEEEEEEEEEEEECCceeEEEEEccEEEEe
Confidence 34566669999998888887 678889999999999976 344556666666643
No 10
>KOG3063|consensus
Probab=95.80 E-value=0.041 Score=39.17 Aligned_cols=64 Identities=22% Similarity=0.311 Sum_probs=46.0
Q ss_pred eEEEeEEECCCCCCCceecCCeEEEEEEEEEEeCCCCCCceEEEEEEEEccccCCCCCCCCCceeEEE
Q psy8451 8 HVYPFSYTLPPTIPSSFDGEHGRVRYKVTARLDRPWKFDKTTEAVFSVVTPYDLNTDMRAKEPINQNV 75 (110)
Q Consensus 8 h~fpF~f~LP~~lPsSf~~~~g~I~Y~i~a~i~~~~~~~~~~~~~~~V~~~~~~~~~p~~~~p~~~~~ 75 (110)
.+|||.|.==..=-.|+.|++-+.||.++|++.|... ++...+.|.|..- ...|+...|..++.
T Consensus 104 ~~fpFeF~~vekpyEsY~G~NV~lrY~lkvTv~Rr~~-di~ke~d~~V~~~---~~~P~~nn~IkmeV 167 (301)
T KOG3063|consen 104 QSFPFEFPHVEKPYESYIGKNVRLRYFLKVTVSRRLT-DIVKEKDLVVHNL---STYPEINNSIKMEV 167 (301)
T ss_pred ccCCccccccccchhhhcCcceEEEEEEEEEEEechh-hhhhhhheeeEec---ccCCCCCCceeEee
Confidence 4678877644444478889999999999999999865 6677777877654 23366666666543
No 11
>COG4326 Spo0M Sporulation control protein [General function prediction only]
Probab=92.01 E-value=0.39 Score=33.64 Aligned_cols=51 Identities=27% Similarity=0.492 Sum_probs=34.5
Q ss_pred eecCe-eeEEEeEEECCCCCCCceecCCeEEEEEEEEEEeCCCCCCceEEEEEEEE
Q psy8451 2 EIPQG-SHVYPFSYTLPPTIPSSFDGEHGRVRYKVTARLDRPWKFDKTTEAVFSVV 56 (110)
Q Consensus 2 ~l~~G-~h~fpF~f~LP~~lPsSf~~~~g~I~Y~i~a~i~~~~~~~~~~~~~~~V~ 56 (110)
+++|| +|.|||++.||.+.|=|+ |++.-.+.--+|.....|..-+..++|.
T Consensus 101 TIqpgEe~~fpf~l~lP~~tPvT~----G~~~V~v~TgLDI~~aidp~D~D~l~Vr 152 (270)
T COG4326 101 TIQPGEERNFPFELSLPWNTPVTI----GDAKVWVETGLDIALAIDPTDKDILTVR 152 (270)
T ss_pred EecCCceEeccEEEecCCCCceee----cceeEEEEeccchhccCCCcccceEEEe
Confidence 34555 689999999999999776 6666666666666544343334445554
No 12
>cd00917 PG-PI_TP The phosphatidylinositol/phosphatidylglycerol transfer protein (PG/PI-TP) has been shown to bind phosphatidylglycerol and phosphatidylinositol, but the biological significance of this is still obscure. These proteins belong to the ML domain family.
Probab=87.74 E-value=2.2 Score=26.87 Aligned_cols=32 Identities=31% Similarity=0.429 Sum_probs=23.9
Q ss_pred ecCeeeEEEeEEECCCCCCCceecCCeEEEEEEEEEEeCC
Q psy8451 3 IPQGSHVYPFSYTLPPTIPSSFDGEHGRVRYKVTARLDRP 42 (110)
Q Consensus 3 l~~G~h~fpF~f~LP~~lPsSf~~~~g~I~Y~i~a~i~~~ 42 (110)
+++|.+.+..+..||...|+ ++|.+++.+...
T Consensus 79 i~~G~~~~~~~~~ip~~~P~--------g~y~v~~~l~d~ 110 (122)
T cd00917 79 IEPGDKFLTKLVDLPGEIPP--------GKYTVSARAYTK 110 (122)
T ss_pred cCCCcEEEEEEeeCCCCCCC--------ceEEEEEEEECC
Confidence 56788888888888866665 468888887654
No 13
>PF04425 Bul1_N: Bul1 N terminus; InterPro: IPR007519 This domain is the N terminus of Saccharomyces cerevisiae (Baker's yeast) Bul1. Bul1 binds the ubiquitin ligase Rsp5, via an N-terminal PPSY motif (157-160 in P48524 from SWISSPROT) []. The complex containing Bul1 and Rsp5 is involved in intracellular trafficking of the general amino acid permease Gap1 [], degradation of Rog1 in cooperation with Bul2 and GSK-3 [], and mitochondrial inheritance []. Bul1 may contain HEAT repeats. The C terminus is IPR007520 from INTERPRO.
Probab=84.61 E-value=3.4 Score=32.00 Aligned_cols=17 Identities=18% Similarity=0.391 Sum_probs=13.8
Q ss_pred eecCe-eeEEEeEEECCC
Q psy8451 2 EIPQG-SHVYPFSYTLPP 18 (110)
Q Consensus 2 ~l~~G-~h~fpF~f~LP~ 18 (110)
.|.|| .|..+|.|+||.
T Consensus 255 ~l~p~~~Yk~fF~FkiP~ 272 (438)
T PF04425_consen 255 ILEPGVKYKKFFTFKIPE 272 (438)
T ss_pred eecCCCeEeceeEEeCCc
Confidence 45554 699999999996
No 14
>PF02221 E1_DerP2_DerF2: ML domain; InterPro: IPR003172 The MD-2-related lipid-recognition (ML) domain is implicated in lipid recognition, particularly in the recognition of pathogen related products. It has an immunoglobulin-like beta-sandwich fold similar to that of E-set Ig domains. This domain is present in the following proteins: Epididymal secretory protein E1 (also known as Niemann-Pick C2 protein), which is known to bind cholesterol. Niemann-Pick disease type C2 is a fatal hereditary disease characterised by accumulation of low-density lipoprotein-derived cholesterol in lysosomes []. House-dust mite allergen proteins such as Der f 2 from Dermatophagoides farinae and Der p 2 from Dermatophagoides pteronyssinus []. ; PDB: 2AG9_B 1G13_B 2AG2_B 2AG4_A 1TJJ_C 1PU5_C 1PUB_A 2AF9_A 3T6Q_D 3M7O_B ....
Probab=83.57 E-value=5.7 Score=24.83 Aligned_cols=32 Identities=22% Similarity=0.572 Sum_probs=20.6
Q ss_pred ecCeee-EEEeEEECCCCCCCceecCCeEEEEEEEEEEeCC
Q psy8451 3 IPQGSH-VYPFSYTLPPTIPSSFDGEHGRVRYKVTARLDRP 42 (110)
Q Consensus 3 l~~G~h-~fpF~f~LP~~lPsSf~~~~g~I~Y~i~a~i~~~ 42 (110)
+.+|++ .|.+++.+|..+|.- .|.+++.+...
T Consensus 88 i~~G~~~~~~~~~~i~~~~p~~--------~~~i~~~l~d~ 120 (134)
T PF02221_consen 88 IKAGEYYTYTYTIPIPKIYPPG--------KYTIQWKLTDQ 120 (134)
T ss_dssp BTTTEEEEEEEEEEESTTSSSE--------EEEEEEEEEET
T ss_pred cCCCcEEEEEEEEEcccceeeE--------EEEEEEEEEeC
Confidence 456766 777777777666643 66666666544
No 15
>cd00258 GM2-AP GM2 activator protein (GM2-AP) is a non-enzymatic lysosomal protein that acts as cofactor in the sequential degradation of gangliosides. GM2A is an essential cofactor for beta-hexosaminidase A (Hex A) in the enzymatic hydrolysis of GM2 ganglioside to GM3. Mutation of the gene results in the AB variant of Tay-Sachs disease. GM2-AP and similar proteins belong to the ML domain family.
Probab=78.78 E-value=10 Score=25.52 Aligned_cols=37 Identities=30% Similarity=0.503 Sum_probs=26.1
Q ss_pred ecCeeeEEEe-EEECC-CCCCCceecCCeEEEEEEEEEEeCCC
Q psy8451 3 IPQGSHVYPF-SYTLP-PTIPSSFDGEHGRVRYKVTARLDRPW 43 (110)
Q Consensus 3 l~~G~h~fpF-~f~LP-~~lPsSf~~~~g~I~Y~i~a~i~~~~ 43 (110)
+++|+|+.|= +|.|| .+||+... .| .|++++.++...
T Consensus 109 ~~~G~y~lp~s~f~lP~~~LPs~l~--~G--~Y~i~~~l~~~g 147 (162)
T cd00258 109 FKEGVYSLPDSTFTLPNVDLPSWLT--NG--NYRITGILMADG 147 (162)
T ss_pred CCCcceEccceeeecccccCCCccC--CC--cEEEEEEECCCC
Confidence 5789999964 55999 68897664 23 378888886433
No 16
>cd08544 Reeler Reeler, the N-terminal domain of reelin, F-spondin, and a variety of other proteins. This domain is found at the N-terminus of F-spondin, a protein attached to the extracellular matrix, which plays roles in neuronal development and vascular remodelling. The F-spondin reeler domain has been reported to bind heparin. The reeler domain is also found at the N-terminus of reelin, an extracellular glycoprotein involved in the development of the brain cortex, and in a variety of other eukaryotic proteins with different domain architectures, including the animal ferric-chelate reductase 1 or stromal cell-derived receptor 2, a member of the cytochrome B561 family, which reduces ferric iron before its transport from the endosome to the cytoplasm. Also included is the insect putative defense protein 1, which is expressed upon bacterial infection and appears to contain a single reeler domain.
Probab=78.04 E-value=2.6 Score=26.85 Aligned_cols=22 Identities=23% Similarity=0.440 Sum_probs=17.9
Q ss_pred EEEEEEeCCeeeeCCCeEEEEC
Q psy8451 89 LTLSVSAPFSGFVPGQTIPCMI 110 (110)
Q Consensus 89 v~~~~~l~k~~y~pGe~i~i~v 110 (110)
.-+.+.++...|.|||.+.|+|
T Consensus 19 ~py~i~~~~~~y~pG~~~~Vtl 40 (135)
T cd08544 19 SPYSITISGNSYVPGETYTVTL 40 (135)
T ss_pred CCEEEEeCCCEECCCCEEEEEE
Confidence 3577788888999999998875
No 17
>KOG4469|consensus
Probab=77.24 E-value=8.9 Score=27.43 Aligned_cols=44 Identities=30% Similarity=0.423 Sum_probs=35.2
Q ss_pred eecCee-eEEEeEEECCCCCCCceecCCeEEEEEEEEEEeCCCCC
Q psy8451 2 EIPQGS-HVYPFSYTLPPTIPSSFDGEHGRVRYKVTARLDRPWKF 45 (110)
Q Consensus 2 ~l~~G~-h~fpF~f~LP~~lPsSf~~~~g~I~Y~i~a~i~~~~~~ 45 (110)
.|.+|+ .+|.++-.||-.=|+||.|..-+-.|++..-+.|-+.+
T Consensus 104 rldpgesksysysevlpiegppsfrgqsvkyvykltigcqrvnsp 148 (391)
T KOG4469|consen 104 RLDPGESKSYSYSEVLPIEGPPSFRGQSVKYVYKLTIGCQRVNSP 148 (391)
T ss_pred ecCCCccccccceeeeeccCCCccCCceeEEEEEEEeeeEecCCc
Confidence 456665 47889999999999999998888888888888876654
No 18
>smart00737 ML Domain involved in innate immunity and lipid metabolism. ML (MD-2-related lipid-recognition) is a novel domain identified in MD-1, MD-2, GM2A, Npc2 and multiple proteins of unknown function in plants, animals and fungi. These single-domain proteins were predicted to form a beta-rich fold containing multiple strands, and to mediate diverse biological functions through interacting with specific lipids.
Probab=75.40 E-value=12 Score=22.95 Aligned_cols=31 Identities=32% Similarity=0.485 Sum_probs=15.9
Q ss_pred ecCeee-EEEeEEECCCCCCCceecCCeEEEEEEEEEEeC
Q psy8451 3 IPQGSH-VYPFSYTLPPTIPSSFDGEHGRVRYKVTARLDR 41 (110)
Q Consensus 3 l~~G~h-~fpF~f~LP~~lPsSf~~~~g~I~Y~i~a~i~~ 41 (110)
+.+|++ .|..++.+|..+| .+.|.+++++..
T Consensus 74 l~~G~~~~~~~~~~v~~~~P--------~~~~~v~~~l~d 105 (118)
T smart00737 74 IEKGETVNYTNSLTVPGIFP--------PGKYTVKWELTD 105 (118)
T ss_pred CCCCeeEEEEEeeEccccCC--------CeEEEEEEEEEc
Confidence 345554 2555555555444 235666666643
No 19
>PF08694 UFC1: Ubiquitin-fold modifier-conjugating enzyme 1; InterPro: IPR014806 Ubiquitin-like (UBL) post-translational modifiers are covalently linked to most, if not all, target protein(s) through an enzymatic cascade analogous to ubiquitylation, consisting of E1 (activating), E2 (conjugating), and E3 (ligating) enzymes. Ubiquitin-fold modifier 1 (Ufm1) a ubiquitin-like protein is activated by a novel E1-like enzyme, Uba5, by forming a high-energy thioester bond. Activated Ufm1 is then transferred to its cognate E2-like enzyme, Ufc1, in a similar thioester linkage. This family represents the E2-like enzyme [].; PDB: 2Z6P_A 2K07_A 2Z6O_A 3EVX_D 3KPA_A.
Probab=68.19 E-value=6.3 Score=25.99 Aligned_cols=19 Identities=16% Similarity=0.387 Sum_probs=13.3
Q ss_pred eeeEEEeEEECCCCCCCce
Q psy8451 6 GSHVYPFSYTLPPTIPSSF 24 (110)
Q Consensus 6 G~h~fpF~f~LP~~lPsSf 24 (110)
-.|.|.|+|.+|...|.+=
T Consensus 73 ~kYEF~~eFdIP~tYP~t~ 91 (161)
T PF08694_consen 73 LKYEFDLEFDIPVTYPTTA 91 (161)
T ss_dssp EEEEEEEEEE--TTTTTS-
T ss_pred eeEEEeeecCCCccCCCCC
Confidence 3689999999998888653
No 20
>PF02014 Reeler: Reeler domain Schematic picture including Reeler domain; InterPro: IPR002861 Extracellular matrix (ECM) proteins play an important role in early cortical development, specifically in the formation of neural connections and in controlling the cyto-architecture of the central nervous system. The product of the reeler gene in mouse is reelin,a large extracellular protein secreted by pioneer neurons that coordinates cell positioning during neurodevelopment []. F-spondin and mindin are a family of matrix-attached adhesion molecules that share structural similarities and overlapping domains of expression. Both F-spondin and mindin promote adhesion and outgrowth of hippocampal embryonic neurons and bind to a putative receptor(s) expressed on both hippocampal and sensory neurons []. This domain of unknown function is found at the N terminus of reelin and F-spondin.; PDB: 2ZOT_B 2ZOU_B 3COO_A.
Probab=65.60 E-value=5.7 Score=25.20 Aligned_cols=21 Identities=33% Similarity=0.494 Sum_probs=14.9
Q ss_pred EEEEEeCCeeeeCCCeEEEEC
Q psy8451 90 TLSVSAPFSGFVPGQTIPCMI 110 (110)
Q Consensus 90 ~~~~~l~k~~y~pGe~i~i~v 110 (110)
-+.+.++...|.||+.+.|+|
T Consensus 20 py~i~~~~~~y~pg~~~~Vtl 40 (132)
T PF02014_consen 20 PYSISVSPSSYEPGQTYTVTL 40 (132)
T ss_dssp SEEEEET-SSB-TTBEEEEEE
T ss_pred CEEEEeCCCeEcCCCEEEEEE
Confidence 456666799999999998875
No 21
>cd02859 AMPKbeta_GBD_like AMP-activated protein kinase (AMPK) beta subunit glycogen binding domain (GBD). AMPK is a metabolic stress sensing protein that senses AMP/ATP and has recently been found to act as a glycogen sensor as well. The protein functions as a alpha-beta-gamma heterotrimer. This domain is the glycogen binding domain of the beta subunit.
Probab=63.58 E-value=3.9 Score=23.72 Aligned_cols=13 Identities=31% Similarity=0.639 Sum_probs=9.1
Q ss_pred eecCeeeEEEeEE
Q psy8451 2 EIPQGSHVYPFSY 14 (110)
Q Consensus 2 ~l~~G~h~fpF~f 14 (110)
.|++|.|.|.|-.
T Consensus 42 ~L~~g~y~YkF~V 54 (79)
T cd02859 42 RLPPGKYQYKFIV 54 (79)
T ss_pred EcCCCCEEEEEEE
Confidence 5777887777743
No 22
>PF07703 A2M_N_2: Alpha-2-macroglobulin family N-terminal region; InterPro: IPR011625 This is a domain of the alpha-2-macroglobulin family. The alpha-macroglobulin (aM) family of proteins includes protease inhibitors [], typified by the human tetrameric a2-macroglobulin (a2M); they belong to the MEROPS proteinase inhibitor family I39, clan IL. These protease inhibitors share several defining properties, which include (i) the ability to inhibit proteases from all catalytic classes, (ii) the presence of a 'bait region' and a thiol ester, (iii) a similar protease inhibitory mechanism and (iv) the inactivation of the inhibitory capacity by reaction of the thiol ester with small primary amines. aM protease inhibitors inhibit by steric hindrance []. The mechanism involves protease cleavage of the bait region, a segment of the aM that is particularly susceptible to proteolytic cleavage, which initiates a conformational change such that the aM collapses about the protease. In the resulting aM-protease complex, the active site of the protease is sterically shielded, thus substantially decreasing access to protein substrates. Two additional events occur as a consequence of bait region cleavage, namely (i) the h-cysteinyl-g-glutamyl thiol ester becomes highly reactive and (ii) a major conformational change exposes a conserved COOH-terminal receptor binding domain [] (RBD). RBD exposure allows the aM protease complex to bind to clearance receptors and be removed from circulation []. Tetrameric, dimeric, and, more recently, monomeric aM protease inhibitors have been identified [, ].; PDB: 2QKI_D 3L3O_D 3NMS_A 2ICF_A 2A73_A 2ICE_D 2HR0_A 2A74_A 2XWJ_G 3OHX_A ....
Probab=62.25 E-value=7.5 Score=24.32 Aligned_cols=23 Identities=22% Similarity=0.447 Sum_probs=19.3
Q ss_pred cEEEEEEeCCeeeeCCCeEEEEC
Q psy8451 88 PLTLSVSAPFSGFVPGQTIPCMI 110 (110)
Q Consensus 88 ~v~~~~~l~k~~y~pGe~i~i~v 110 (110)
..+++++.++..|.|||++.++|
T Consensus 94 ~~~v~l~~~~~~~~Pg~~~~~~i 116 (136)
T PF07703_consen 94 ELKVELTASPDEYKPGEEVTLRI 116 (136)
T ss_dssp SSSEEEEESSSSBTTTSEEEEEE
T ss_pred cceEEEEEecceeCCCCEEEEEE
Confidence 45777888899999999998875
No 23
>PF07610 DUF1573: Protein of unknown function (DUF1573); InterPro: IPR011467 These hypothetical proteins from bacteria, such as Rhodopirellula baltica, Bacteroides thetaiotaomicron and Porphyromonas gingivalis, share a region of conserved sequence towards their N termini.
Probab=59.10 E-value=12 Score=19.21 Aligned_cols=21 Identities=10% Similarity=0.120 Sum_probs=17.4
Q ss_pred EEEEEeCCeeeeCCCeEEEEC
Q psy8451 90 TLSVSAPFSGFVPGQTIPCMI 110 (110)
Q Consensus 90 ~~~~~l~k~~y~pGe~i~i~v 110 (110)
...+..++....|||+..|.|
T Consensus 23 Ct~~~~~~~~i~PGes~~i~v 43 (45)
T PF07610_consen 23 CTTAEYSKKPIAPGESGKIKV 43 (45)
T ss_pred CEEeeCCcceECCCCEEEEEE
Confidence 357788999999999988864
No 24
>PF08737 Rgp1: Rgp1; InterPro: IPR014848 Rgp1 forms heterodimer with Ric1 (IPR009771 from INTERPRO) which associates with Golgi membranes and functions as a guanyl-nucleotide exchange factor [].
Probab=55.17 E-value=33 Score=26.40 Aligned_cols=32 Identities=13% Similarity=0.288 Sum_probs=28.6
Q ss_pred EEEeEEECCCCCCCceecCCeEEEEEEEEEEe
Q psy8451 9 VYPFSYTLPPTIPSSFDGEHGRVRYKVTARLD 40 (110)
Q Consensus 9 ~fpF~f~LP~~lPsSf~~~~g~I~Y~i~a~i~ 40 (110)
+-+|.+.+|.+.+++|.-....++|.|+....
T Consensus 381 ~~~f~l~IP~~~tp~F~T~~v~lkW~LrfeFv 412 (415)
T PF08737_consen 381 RTSFSLPIPLSATPQFQTSGVSLKWRLRFEFV 412 (415)
T ss_pred eEEEEeeCCCCCCCceEeCCEEEEEEEEEEEE
Confidence 67899999999999999988899999988764
No 25
>cd00912 ML The ML (MD-2-related lipid-recognition) domain is present in MD-1, MD-2, GM2 activator protein, Niemann-Pick type C2 (Npc2) protein, phosphatidylinositol/phosphatidylglycerol transfer protein (PG/PI-TP), mite allergen Der p 2 and several proteins of unknown function in plants, animals and fungi. These single-domain proteins form two anti-parallel beta-pleated sheets stabilized by three disulfide bonds and with an accessible central hydrophobic cavity, and are predicted to mediate diverse biological functions through interaction with specific lipids.
Probab=53.00 E-value=17 Score=22.75 Aligned_cols=31 Identities=26% Similarity=0.472 Sum_probs=21.8
Q ss_pred ecCe-eeEEEeEEECCC-CCCCceecCCeEEEEEEEEEEeC
Q psy8451 3 IPQG-SHVYPFSYTLPP-TIPSSFDGEHGRVRYKVTARLDR 41 (110)
Q Consensus 3 l~~G-~h~fpF~f~LP~-~lPsSf~~~~g~I~Y~i~a~i~~ 41 (110)
|.+| .|.|..++.||. ..| .+.|.++..+..
T Consensus 82 l~~G~~~~~~~~~~v~~~~~P--------~~~~~v~~~l~~ 114 (127)
T cd00912 82 LRKGQQYSYAKTVNVPEFTIP--------TIEYQVVLEDVT 114 (127)
T ss_pred cCCCCEEEEEEEEecCcccCC--------CeeEEEEEEEEc
Confidence 5788 899999998886 555 345666666554
No 26
>PF08845 SymE_toxin: Toxin SymE, type I toxin-antitoxin system; InterPro: IPR014944 This entry represents a SOS-induced gene whose product shows homology to the antitoxin MazE (SymE), the coding region contains a cis-encoded antisense RNA. The small antisense RNA and the gene have the all the hallmarks of a toxin-antitoxin module. The synthesis of the SymE is tightly repressed at multiple levels; at the transcriptional level by the LexA repressor, at the level of mRNA stability and translation by the SymR RNA and at the level of protein stability by the Lon protease. SymE co-purifies with ribosomes and overproduction of the protein leads to cell growth inhibition, decreased protein synthesis and increased RNA degradation. These properties are shared with several RNA endonuclease toxins of toxin-antitoxin modules. It seems probable that the SymE protein represents an evolutionary derivative of a toxin containing the AbrB fold, whose representatives are typically antitoxins. The SymE promoted cleavage of RNA cleavage may be important for the recycling of RNAs damaged under SOS-inducing conditions []. ; GO: 0003723 RNA binding, 0016788 hydrolase activity, acting on ester bonds, 0016070 RNA metabolic process, 0005737 cytoplasm
Probab=48.60 E-value=6.5 Score=21.67 Aligned_cols=17 Identities=18% Similarity=0.290 Sum_probs=13.7
Q ss_pred EeCCeeeeCCCeEEEEC
Q psy8451 94 SAPFSGFVPGQTIPCMI 110 (110)
Q Consensus 94 ~l~k~~y~pGe~i~i~v 110 (110)
+|...||.+|+.+.|+|
T Consensus 32 WL~~aGF~~G~~v~V~v 48 (57)
T PF08845_consen 32 WLEEAGFTIGDPVKVRV 48 (57)
T ss_pred hhHHhCCCCCCEEEEEE
Confidence 67788888888888764
No 27
>PF04729 ASF1_hist_chap: ASF1 like histone chaperone; InterPro: IPR006818 This family includes the yeast and human ASF1 protein. These proteins have histone chaperone activity []. ASF1 participates in both the replication-dependent and replication-independent pathways. The structure three-dimensional has been determined as a compact immunoglobulin-like beta sandwich fold topped by three helical linkers [].; GO: 0006333 chromatin assembly or disassembly, 0005634 nucleus; PDB: 2YGV_C 1WG3_A 2HUE_A 2IDC_A 4EO5_A 1ROC_A 1TEY_A 2IO5_A 3AAD_D 2IIJ_A ....
Probab=47.98 E-value=24 Score=23.48 Aligned_cols=39 Identities=33% Similarity=0.579 Sum_probs=24.7
Q ss_pred ecCeeeEEEeEEECC--CCCCCc-------------eec-CCeEEEEEEEEEEeC
Q psy8451 3 IPQGSHVYPFSYTLP--PTIPSS-------------FDG-EHGRVRYKVTARLDR 41 (110)
Q Consensus 3 l~~G~h~fpF~f~LP--~~lPsS-------------f~~-~~g~I~Y~i~a~i~~ 41 (110)
++.|.|+|-|+-.-| ..+|.+ +++ ++-+|-|.+......
T Consensus 65 i~~G~~kF~~~adpPd~~~Ip~~dllGvTvillt~sY~~~EFiRVGYYVnney~d 119 (154)
T PF04729_consen 65 IPVGVNKFVFEADPPDPSKIPPEDLLGVTVILLTCSYRGQEFIRVGYYVNNEYPD 119 (154)
T ss_dssp -ESEEEEEEEEE----GGGSSCCHHSEEEEEEEEEEETTEEEEEEEEEEEEEESS
T ss_pred CcceeEEEEEEccCCChhhCChhhccccEEEEEEEEECCeEEEEEeeEEEeeeCC
Confidence 578999999999998 577742 222 345777888777654
No 28
>PF03643 Vps26: Vacuolar protein sorting-associated protein 26 ; InterPro: IPR005377 The movement of lipid and protein components between intracellular organelles requires the regulated interactions of many molecules. Vacuolar protein sorting-associated protein (Vps)5 is a yeast protein that is a subunit of a large multimeric complex, termed the retromer complex, involved in retrograde transport of proteins from endosomes to the trans-Golgi network. Sorting nexin (SNX) 1 and SNX2 are its mammalian orthologs []. To carry out its biological functions, Vps5 forms the retromer complex with at least four other proteins: Vps17, Vps26, Vps29, and Vps35 []. This family of Vps26-proteins also contains Down syndrome critical region 3/A.; GO: 0007034 vacuolar transport, 0030904 retromer complex; PDB: 3LHA_A 3LH9_A 2R51_A 3LH8_B 2FAU_A.
Probab=46.38 E-value=68 Score=23.32 Aligned_cols=32 Identities=19% Similarity=0.287 Sum_probs=22.5
Q ss_pred EEEeEEECC-CCCCCcee--cCCeEEEEEEEEEEe
Q psy8451 9 VYPFSYTLP-PTIPSSFD--GEHGRVRYKVTARLD 40 (110)
Q Consensus 9 ~fpF~f~LP-~~lPsSf~--~~~g~I~Y~i~a~i~ 40 (110)
..||.+.|| -++=+|+. .+.-+|+|.+.+.+.
T Consensus 237 ~IPirl~l~~l~l~Pt~~~~~~~FsV~y~lnlvli 271 (275)
T PF03643_consen 237 SIPIRLFLPRLFLCPTYKNVNNKFSVEYELNLVLI 271 (275)
T ss_dssp EEEEEEECCCT-----EEEECTTEEEEEEEEEEEE
T ss_pred eeeEEEEcCCcccCCcchhcCCcEEEEEEEEEEEE
Confidence 579999999 57777777 456899999998874
No 29
>cd02861 E_set_proteins_like E or "early" set-like proteins. These alpha amylase-like sugar utilizing enzymes which may be related to the immunoglobulin and/or fibronectin type III superfamilies are associated with different types of catalytic domains at either the N-terminal or C-terminal end. Members of this family include members of the alpha amylase family, sialidase, galactose oxidase, cellulase, cellulose, hyaluronate lyase, chitobiase, and chitinase.
Probab=45.85 E-value=15 Score=21.20 Aligned_cols=11 Identities=27% Similarity=0.522 Sum_probs=5.3
Q ss_pred ecCeeeEEEeE
Q psy8451 3 IPQGSHVYPFS 13 (110)
Q Consensus 3 l~~G~h~fpF~ 13 (110)
|++|.|.|.|.
T Consensus 44 l~~G~y~Ykf~ 54 (82)
T cd02861 44 LRPGRYEYKFV 54 (82)
T ss_pred CCCCcEEEEEE
Confidence 44555544443
No 30
>TIGR03000 plancto_dom_1 Planctomycetes uncharacterized domain TIGR03000. Domains described by this model are found, so far, only in the Planctomycetes (Pirellula sp. strain 1 and Gemmata obscuriglobus), in up to six proteins per genome, and may be duplicated within a protein. The function is unknown.
Probab=43.97 E-value=66 Score=18.80 Aligned_cols=26 Identities=31% Similarity=0.181 Sum_probs=17.2
Q ss_pred EEEEEEEEEEeCCCCCCceEEEEEEEE
Q psy8451 30 RVRYKVTARLDRPWKFDKTTEAVFSVV 56 (110)
Q Consensus 30 ~I~Y~i~a~i~~~~~~~~~~~~~~~V~ 56 (110)
.=+|.++|.+++.. .....++.+.+.
T Consensus 41 ~y~Y~v~a~~~~dG-~~~t~~~~V~vr 66 (75)
T TIGR03000 41 EYEYTVTAEYDRDG-RILTRTRTVVVR 66 (75)
T ss_pred EEEEEEEEEEecCC-cEEEEEEEEEEc
Confidence 34599999999877 344455555554
No 31
>PF01835 A2M_N: MG2 domain; InterPro: IPR002890 The proteinase-binding alpha-macroglobulins (A2M) [] are large glycoproteins found in the plasma of vertebrates, in the hemolymph of some invertebrates and in reptilian and avian egg white. A2M-like proteins are able to inhibit all four classes of proteinases by a 'trapping' mechanism. They have a peptide stretch, called the 'bait region', which contains specific cleavage sites for different proteinases. When a proteinase cleaves the bait region, a conformational change is induced in the protein, thus trapping the proteinase. The entrapped enzyme remains active against low molecular weight substrates, whilst its activity toward larger substrates is greatly reduced, due to steric hindrance. Following cleavage in the bait region, a thiol ester bond, formed between the side chains of a cysteine and a glutamine, is cleaved and mediates the covalent binding of the A2M-like protein to the proteinase. This family includes the N-terminal region of the alpha-2-macroglobulin family. The inhibitor domains belong to MEROPS inhibitor family I39.; GO: 0004866 endopeptidase inhibitor activity; PDB: 2B39_B 3KLS_B 3PRX_C 3KM9_B 3PVM_C 3CU7_A 4E0S_A 4A5W_A 4ACQ_C 2P9R_B ....
Probab=43.67 E-value=67 Score=18.81 Aligned_cols=24 Identities=29% Similarity=0.690 Sum_probs=11.0
Q ss_pred EEEeEEECCCCCCCceecCCeEEEEEEEEEEe
Q psy8451 9 VYPFSYTLPPTIPSSFDGEHGRVRYKVTARLD 40 (110)
Q Consensus 9 ~fpF~f~LP~~lPsSf~~~~g~I~Y~i~a~i~ 40 (110)
.|.++|+||.+.+. | .|.|+|..+
T Consensus 63 ~~~~~~~lp~~~~~------G--~y~i~~~~~ 86 (99)
T PF01835_consen 63 IFSGSFQLPDDAPL------G--TYTIRVKTD 86 (99)
T ss_dssp EEEEEEE--SS---------E--EEEEEEEET
T ss_pred EEEEEEECCCCCCC------E--eEEEEEEEc
Confidence 34556777766542 2 366666665
No 32
>PF07495 Y_Y_Y: Y_Y_Y domain; InterPro: IPR011123 This region is mostly found at the end of the beta propellers (IPR011110 from INTERPRO) in a family of two component regulators. However they are also found tandemly repeated in Q891H4 from SWISSPROT without other signal conduction domains being present. It is named after the conserved tyrosines found in the alignment. The exact function is not known.; PDB: 3V9F_D 3VA6_B 3OTT_B 4A2M_D 4A2L_B.
Probab=42.82 E-value=15 Score=19.83 Aligned_cols=11 Identities=18% Similarity=0.428 Sum_probs=4.2
Q ss_pred ecCeeeEEEeE
Q psy8451 3 IPQGSHVYPFS 13 (110)
Q Consensus 3 l~~G~h~fpF~ 13 (110)
|++|+|+|--.
T Consensus 36 L~~G~Y~l~V~ 46 (66)
T PF07495_consen 36 LPPGKYTLEVR 46 (66)
T ss_dssp --SEEEEEEEE
T ss_pred CCCEEEEEEEE
Confidence 45555554433
No 33
>KOG3063|consensus
Probab=41.76 E-value=94 Score=22.54 Aligned_cols=52 Identities=12% Similarity=0.075 Sum_probs=37.4
Q ss_pred EEEeEEECC-CCCCCcee--cCCeEEEEEEEEEEeCCCCCCceEEEEEEEEcccc
Q psy8451 9 VYPFSYTLP-PTIPSSFD--GEHGRVRYKVTARLDRPWKFDKTTEAVFSVVTPYD 60 (110)
Q Consensus 9 ~fpF~f~LP-~~lPsSf~--~~~g~I~Y~i~a~i~~~~~~~~~~~~~~~V~~~~~ 60 (110)
+.|-...|- -+|-+|+. .+.-+|+|.+...+..-........++++.-+.-|
T Consensus 244 sIPiRlFLagYdlTPtmrdinkkFsVkyyLnLVlvDeedRRYFKQqEItLwR~~d 298 (301)
T KOG3063|consen 244 SIPIRLFLAGYDLTPTMRDINKKFSVKYYLNLVLVDEEDRRYFKQQEITLWRKAD 298 (301)
T ss_pred eeeeEEEecccCCCcchhhhcceeeeeeEEEEEEEchhhhhhhhheeEEEEEecc
Confidence 567777776 79999998 57789999999998654444444566777765533
No 34
>PF06688 DUF1187: Protein of unknown function (DUF1187); InterPro: IPR009572 This family consists of several short, hypothetical bacterial proteins of around 62 residues in length. Members of this family are found in Escherichia coli and Salmonella typhi. The function of this family is unknown.
Probab=41.29 E-value=43 Score=18.69 Aligned_cols=14 Identities=36% Similarity=0.524 Sum_probs=11.6
Q ss_pred EEEEEEEeCCCCCC
Q psy8451 33 YKVTARLDRPWKFD 46 (110)
Q Consensus 33 Y~i~a~i~~~~~~~ 46 (110)
|+|+|+|..|....
T Consensus 1 YkItAtI~KpG~~P 14 (61)
T PF06688_consen 1 YKITATIIKPGNTP 14 (61)
T ss_pred CceEEEEEcCCCCC
Confidence 89999999986543
No 35
>PF09608 Alph_Pro_TM: Putative transmembrane protein (Alph_Pro_TM); InterPro: IPR019088 This entry consists of predicted transmembrane proteins of about 270 amino acids. They are found predominantly, though not exclusively, in alphaproteobacteria, generally only once in each genome.
Probab=40.98 E-value=1.1e+02 Score=21.72 Aligned_cols=42 Identities=21% Similarity=0.324 Sum_probs=31.9
Q ss_pred eEEEeEEECCCCCCCceecCCeEEEEEEEEEEeCCCCCCceEEEEEEEEc
Q psy8451 8 HVYPFSYTLPPTIPSSFDGEHGRVRYKVTARLDRPWKFDKTTEAVFSVVT 57 (110)
Q Consensus 8 h~fpF~f~LP~~lPsSf~~~~g~I~Y~i~a~i~~~~~~~~~~~~~~~V~~ 57 (110)
--|.=++.||.++|.- .|++++.+-+..+.-...+..|.|.+
T Consensus 155 ~lFra~i~LPanvp~G--------~Y~v~v~l~rdG~vv~~~~~~l~V~K 196 (236)
T PF09608_consen 155 TLFRARIPLPANVPPG--------DYTVRVYLFRDGQVVASQETPLRVRK 196 (236)
T ss_pred CeEEEEeEcCCCCCcc--------eEEEEEEEEECCEEEEEEeeEEEEEE
Confidence 4577788999999954 48888988888776666677777754
No 36
>PF08750 CNP1: CNP1-like family; InterPro: IPR014861 This group of proteins are likely to be lipoproteins. CNP1 (cryptic neisserial protein) has been expressed in Escherichia coli and shown to be localised periplasmicly [].
Probab=40.61 E-value=50 Score=21.57 Aligned_cols=15 Identities=40% Similarity=0.687 Sum_probs=12.1
Q ss_pred cCCeEEEEEEEEEEe
Q psy8451 26 GEHGRVRYKVTARLD 40 (110)
Q Consensus 26 ~~~g~I~Y~i~a~i~ 40 (110)
|.+|.|||.+.++=.
T Consensus 47 g~DgvVRY~lv~~S~ 61 (139)
T PF08750_consen 47 GPDGVVRYTLVVRSP 61 (139)
T ss_pred CCCCcEEEEEEEcCC
Confidence 678999999887654
No 37
>COG3126 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=40.49 E-value=57 Score=21.79 Aligned_cols=35 Identities=34% Similarity=0.617 Sum_probs=23.3
Q ss_pred cCeeeEEEeEEECCCCCCCceecCCeEEEEEEEEEEeCCCC
Q psy8451 4 PQGSHVYPFSYTLPPTIPSSFDGEHGRVRYKVTARLDRPWK 44 (110)
Q Consensus 4 ~~G~h~fpF~f~LP~~lPsSf~~~~g~I~Y~i~a~i~~~~~ 44 (110)
+.| .-||.|.||-| |+.++.. -||.+.|.|...++
T Consensus 87 ~~G--q~P~~F~L~fd-p~~i~p~---~ryalsArI~~~gk 121 (158)
T COG3126 87 TEG--QVPFPFVLPFD-PSDIQPN---HRYALSARITVNGK 121 (158)
T ss_pred ccC--ccceeEEeccC-hhhCCCC---cEEEEEEEEEECCE
Confidence 445 67888888732 2333222 68999999987765
No 38
>KOG3780|consensus
Probab=40.23 E-value=1.2e+02 Score=22.72 Aligned_cols=50 Identities=18% Similarity=0.215 Sum_probs=35.3
Q ss_pred eeeEEEeEEECCCCCCCceec--CCeEEEEEEEEEEeCCC--CCCceEEEEEEEE
Q psy8451 6 GSHVYPFSYTLPPTIPSSFDG--EHGRVRYKVTARLDRPW--KFDKTTEAVFSVV 56 (110)
Q Consensus 6 G~h~fpF~f~LP~~lPsSf~~--~~g~I~Y~i~a~i~~~~--~~~~~~~~~~~V~ 56 (110)
..+.|-.+|+||..+| |+.. ..-++.|.+++.+.-.. ..+....-.+.|-
T Consensus 263 ~~~~~~~~~~iP~~~P-s~~~~~~~i~v~y~l~v~~~~~~~~~~~~~l~~pi~ig 316 (427)
T KOG3780|consen 263 SEDKFEKELRIPPVPP-SILPDTPIIRVEYELKVTLKTSSLRHSELALELPIIIG 316 (427)
T ss_pred CccccceEEEcCCCCC-ccCCCCceEEEEEEEEEEEecCcccccceeeeeceEEe
Confidence 5677788888998888 6663 55699999999998663 3344444455554
No 39
>PF08875 DUF1833: Domain of unknown function (DUF1833); InterPro: IPR014974 This entry is represented by Bacteriophage D3, Orf22. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches.
Probab=39.24 E-value=64 Score=21.02 Aligned_cols=10 Identities=50% Similarity=0.913 Sum_probs=8.2
Q ss_pred EEEeEEECCC
Q psy8451 9 VYPFSYTLPP 18 (110)
Q Consensus 9 ~fpF~f~LP~ 18 (110)
-+||.+.||.
T Consensus 53 a~~f~i~lP~ 62 (150)
T PF08875_consen 53 ACPFDITLPD 62 (150)
T ss_pred EEEEEEEcCc
Confidence 4799999994
No 40
>PF14347 DUF4399: Domain of unknown function (DUF4399)
Probab=37.77 E-value=25 Score=21.02 Aligned_cols=14 Identities=29% Similarity=0.745 Sum_probs=9.3
Q ss_pred CeecCeeeEEEeEE
Q psy8451 1 MEIPQGSHVYPFSY 14 (110)
Q Consensus 1 ~~l~~G~h~fpF~f 14 (110)
|.|+||+|+.--.|
T Consensus 57 I~L~PG~htLtl~~ 70 (87)
T PF14347_consen 57 IELPPGKHTLTLQL 70 (87)
T ss_pred EEeCCCCEEEEEEe
Confidence 46788888765544
No 41
>PF08111 Pea-VEAacid: Pea-VEAacid family; InterPro: IPR012593 This family consists of the PEA-VEAacid neuropeptides family. These neuropeptides are isolated from the abdominal perisympathetic organs of the American cockroach. These peptides are found together with Pea-YLS-amide and Pea-SKNacid, giving a unique neuropeptide pattern in abdominal perisympathetic organs. The functions of these neuropeptides are unknown [].; GO: 0005184 neuropeptide hormone activity, 0007218 neuropeptide signaling pathway, 0005576 extracellular region
Probab=34.69 E-value=15 Score=14.33 Aligned_cols=8 Identities=50% Similarity=0.866 Sum_probs=4.8
Q ss_pred eecCeeeE
Q psy8451 2 EIPQGSHV 9 (110)
Q Consensus 2 ~l~~G~h~ 9 (110)
+|.||.|.
T Consensus 2 tltpgshv 9 (15)
T PF08111_consen 2 TLTPGSHV 9 (15)
T ss_pred cccCccch
Confidence 45666664
No 42
>PF10342 GPI-anchored: Ser-Thr-rich glycosyl-phosphatidyl-inositol-anchored membrane family; InterPro: IPR018466 This entry represents glycoproteins involved in cell wall (1-->6)-beta-glucan assembly. In yeast a null mutation leads to severe growth defects, aberrant multi-budded morphology, and mating defects [, ]. The entry includes DRMIP and Hesp-379, which are involved in both fruiting body formation and in host attack respectively. Hesp-379 is a haustorially expressed secreted protein; the haustorium being the small sucker that penetrates host tissue [].
Probab=33.95 E-value=95 Score=17.74 Aligned_cols=20 Identities=25% Similarity=0.605 Sum_probs=10.5
Q ss_pred eEEECCCCCCCceecCCeEEEEEEEEE
Q psy8451 12 FSYTLPPTIPSSFDGEHGRVRYKVTAR 38 (110)
Q Consensus 12 F~f~LP~~lPsSf~~~~g~I~Y~i~a~ 38 (110)
|.+.+|.+++++= .|.|++.
T Consensus 58 ~~~~~p~~l~~~~-------~Y~i~~~ 77 (93)
T PF10342_consen 58 YTWTIPSDLPSGG-------DYFIQIV 77 (93)
T ss_pred EEEEcCCCCCCCC-------cEEEEEE
Confidence 4445555555322 3666666
No 43
>PF05688 DUF824: Salmonella repeat of unknown function (DUF824); InterPro: IPR008542 This family consists of a series of repeated sequences (of around 180 residues) which are found in Salmonella typhimurium, Salmonella typhi and Escherichia coli. These repeats are almost always found with this entry. The repeats are associated with RatA and RatB, the coding sequences of which are found in the pathogeneicity island of Salmonella. The sequences may be determinants of pathogenicity [, ].
Probab=33.78 E-value=24 Score=18.66 Aligned_cols=11 Identities=36% Similarity=0.534 Sum_probs=7.9
Q ss_pred eeCCCeEEEEC
Q psy8451 100 FVPGQTIPCMI 110 (110)
Q Consensus 100 y~pGe~i~i~v 110 (110)
..-||+|+++|
T Consensus 9 aK~Ge~I~ltV 19 (47)
T PF05688_consen 9 AKVGETIPLTV 19 (47)
T ss_pred eecCCeEEEEE
Confidence 34588888875
No 44
>KOG3262|consensus
Probab=32.38 E-value=96 Score=21.53 Aligned_cols=22 Identities=23% Similarity=0.392 Sum_probs=14.5
Q ss_pred ccEEEEEEeCC----eeeeCCCeEEE
Q psy8451 87 GPLTLSVSAPF----SGFVPGQTIPC 108 (110)
Q Consensus 87 g~v~~~~~l~k----~~y~pGe~i~i 108 (110)
..+.++|+++. +.|.+||++.|
T Consensus 102 ~d~~fsIK~~dgv~assfk~g~k~fi 127 (215)
T KOG3262|consen 102 NDVHFSIKPSDGVQASSFKPGDKLFI 127 (215)
T ss_pred cccEEEEecCCCceeecccCCCeEEe
Confidence 44566666655 66778887765
No 45
>PRK09750 hypothetical protein; Provisional
Probab=31.43 E-value=59 Score=18.16 Aligned_cols=13 Identities=31% Similarity=0.506 Sum_probs=11.3
Q ss_pred EEEEEEEEeCCCC
Q psy8451 32 RYKVTARLDRPWK 44 (110)
Q Consensus 32 ~Y~i~a~i~~~~~ 44 (110)
.|.|.|+|..|..
T Consensus 3 kY~I~Ati~KpGg 15 (64)
T PRK09750 3 MYKITATIEKEGG 15 (64)
T ss_pred eeEEEEEEECCCC
Confidence 6999999999864
No 46
>KOG1584|consensus
Probab=30.55 E-value=34 Score=25.27 Aligned_cols=25 Identities=28% Similarity=0.511 Sum_probs=20.3
Q ss_pred eEEECC-CCCCCceecCCeEEEEEEE
Q psy8451 12 FSYTLP-PTIPSSFDGEHGRVRYKVT 36 (110)
Q Consensus 12 F~f~LP-~~lPsSf~~~~g~I~Y~i~ 36 (110)
|+=.|| ..||.|+..+.++|-|..+
T Consensus 107 ~kTHlP~~lLp~s~~~~~cKvVYv~R 132 (297)
T KOG1584|consen 107 FKTHLPFQLLPESLKESKCKVVYVCR 132 (297)
T ss_pred eeccCChhhcchhhhcCCCcEEEEec
Confidence 444578 7899999999999999654
No 47
>PF01345 DUF11: Domain of unknown function DUF11; InterPro: IPR001434 This group of sequences is represented by a conserved region of about 53 amino acids shared between regions, usually repeated, of proteins from a small number of phylogenetically distant prokaryotes. Examples include a 132-residue region found repeated in three of the five longest proteins of Bacillus anthracis, a 131-residue repeat in a cell wall-anchored protein of Enterococcus faecalis (Streptococcus faecalis), and a 120-residue repeat in Methanobacterium thermoautotrophicum. A similar region is found in some Chlamydia trachomatis outer membrane proteins. In C. trachomatis, three cysteine-rich proteins (also believed to be lipoproteins), MOMP, OMP6 and OMP3, make up the extracellular matrix of the outer membrane []. They are involved in the essential structural integrity of both the elementary body (EB) and recticulate body (RB) phase. They are thought to be involved in porin formation and, as these bacteria lack the peptidoglycan layer common to most Gram-negative microbes, such proteins are highly important in the pathogenicity of the organism.; GO: 0005727 extrachromosomal circular DNA
Probab=30.53 E-value=82 Score=17.56 Aligned_cols=26 Identities=23% Similarity=0.271 Sum_probs=19.6
Q ss_pred cCccEEEEEEeCCeeeeCCCeEEEEC
Q psy8451 85 QSGPLTLSVSAPFSGFVPGQTIPCMI 110 (110)
Q Consensus 85 ~~g~v~~~~~l~k~~y~pGe~i~i~v 110 (110)
....+.+.-+.++....+||.+..+|
T Consensus 22 ~~~~~~~~k~~~~~~~~~Gd~v~yti 47 (76)
T PF01345_consen 22 AIPDLSITKTVNPSTANPGDTVTYTI 47 (76)
T ss_pred CCCCEEEEEecCCCcccCCCEEEEEE
Confidence 34567777788888889999887654
No 48
>KOG0420|consensus
Probab=30.46 E-value=51 Score=22.48 Aligned_cols=17 Identities=18% Similarity=0.528 Sum_probs=12.4
Q ss_pred cCeeeE---EEeEEECCCCC
Q psy8451 4 PQGSHV---YPFSYTLPPTI 20 (110)
Q Consensus 4 ~~G~h~---fpF~f~LP~~l 20 (110)
..|-|. |-|+|++|..-
T Consensus 69 dEGyY~gGkf~F~~~v~~~Y 88 (184)
T KOG0420|consen 69 DEGYYQGGKFRFKFKVPNAY 88 (184)
T ss_pred CcceecCceEEEEEECCCCC
Confidence 457787 88888888543
No 49
>TIGR02567 YscW type III secretion system chaperone YscW. This family of proteins is found within type III secretion operons. The protein has been characterized as a chaperone for the outer membrane pore component YscC (TIGR02516). YscW is a lipoprotein which is itself localized to the outer membrane and, it is believed, facilitates the oligomerization and localization of YscC.
Probab=28.45 E-value=1e+02 Score=19.72 Aligned_cols=25 Identities=16% Similarity=0.046 Sum_probs=14.2
Q ss_pred EEEEEEEEEEeCCCCCCceEEEEEEE
Q psy8451 30 RVRYKVTARLDRPWKFDKTTEAVFSV 55 (110)
Q Consensus 30 ~I~Y~i~a~i~~~~~~~~~~~~~~~V 55 (110)
...| |.|.+.-..+.-..+++.+.|
T Consensus 81 g~~Y-v~A~L~~~g~~~~q~~~~~~v 105 (124)
T TIGR02567 81 GELY-LRGKLSWQGQRAVQAEAQQIF 105 (124)
T ss_pred CceE-EEEEEEECCEEEEEEeeeeEE
Confidence 3456 888877665544444444444
No 50
>COG2373 Large extracellular alpha-helical protein [General function prediction only]
Probab=27.52 E-value=47 Score=30.21 Aligned_cols=22 Identities=23% Similarity=0.327 Sum_probs=19.4
Q ss_pred EEEEEEeCCeeeeCCCeEEEEC
Q psy8451 89 LTLSVSAPFSGFVPGQTIPCMI 110 (110)
Q Consensus 89 v~~~~~l~k~~y~pGe~i~i~v 110 (110)
=.+++.++|..|.+||++.+.|
T Consensus 751 d~l~i~ldK~~Y~~Gdtakl~I 772 (1621)
T COG2373 751 DVLKLKLDKPEYQPGDTAKLLI 772 (1621)
T ss_pred cceEEecCccccCCCceEEEEe
Confidence 3488999999999999999875
No 51
>PRK02939 lipoprotein; Reviewed
Probab=27.34 E-value=2.4e+02 Score=20.24 Aligned_cols=26 Identities=19% Similarity=0.290 Sum_probs=21.1
Q ss_pred CCCCCceecCCeEEEEEEEEEEeCCC
Q psy8451 18 PTIPSSFDGEHGRVRYKVTARLDRPW 43 (110)
Q Consensus 18 ~~lPsSf~~~~g~I~Y~i~a~i~~~~ 43 (110)
+++-++....+|+|.|.+.+++++..
T Consensus 47 K~~tqt~~ne~g~v~~~v~~~~~~eG 72 (236)
T PRK02939 47 KDFTQTLMDEQGEVTKRVSGTLSEEG 72 (236)
T ss_pred eeeEEEEEcCCCcEEEEEEEEEcCCC
Confidence 35556667789999999999998765
No 52
>TIGR02186 alph_Pro_TM conserved hypothetical protein. This family consists of predicted transmembrane proteins of about 270 amino acids. Members are found, so far, only among the Alphaproteobacteria and only once in each genome.
Probab=26.58 E-value=2.6e+02 Score=20.36 Aligned_cols=40 Identities=28% Similarity=0.364 Sum_probs=30.6
Q ss_pred EEeEEECCCCCCCceecCCeEEEEEEEEEEeCCCCCCceEEEEEEEEc
Q psy8451 10 YPFSYTLPPTIPSSFDGEHGRVRYKVTARLDRPWKFDKTTEAVFSVVT 57 (110)
Q Consensus 10 fpF~f~LP~~lPsSf~~~~g~I~Y~i~a~i~~~~~~~~~~~~~~~V~~ 57 (110)
|.=++.||.|+|.- .|.+++.+-+..+.-...+..|.|.+
T Consensus 182 Fra~i~LPAnvp~G--------~Y~v~v~L~r~G~vv~~~~t~l~V~K 221 (261)
T TIGR02186 182 FRATLRLPANVPNG--------THEVRAYLFRGGVFIARTELALEIVK 221 (261)
T ss_pred EEEeeecCCCCCCc--------eEEEEEEEEeCCEEEEEEEeEEEEEE
Confidence 66778899999953 48899999888776666677777754
No 53
>KOG1366|consensus
Probab=26.42 E-value=1.1e+02 Score=27.57 Aligned_cols=28 Identities=14% Similarity=0.150 Sum_probs=24.4
Q ss_pred EccCccEEEEEEeCCeeeeCCCeEEEEC
Q psy8451 83 WRQSGPLTLSVSAPFSGFVPGQTIPCMI 110 (110)
Q Consensus 83 ~~~~g~v~~~~~l~k~~y~pGe~i~i~v 110 (110)
.+.+-...+-++.||.-|.||+++...|
T Consensus 109 ~~~~~~~~vfIqTDkpiY~pgqtV~~Rv 136 (1436)
T KOG1366|consen 109 LFSQRRGSVFIQTDKPIYKPGQTVRYRV 136 (1436)
T ss_pred eeecccceEEEecCCCccCCCCEEEEEE
Confidence 4667788999999999999999998764
No 54
>PF12857 TOBE_3: TOBE-like domain; InterPro: IPR024765 The TOBE (transport-associated OB) domain [] always occurs as a dimer and it is found in ABC transporters immediately after the ATPase domain. This entry represents a TOBE-like domain, found in the C terminus of ATPase subunit CysA. CysA is part of the CysATWP ABC transporter complex, involved in sulphate/thiosulphate import [, ].
Probab=26.39 E-value=92 Score=16.67 Aligned_cols=24 Identities=21% Similarity=0.401 Sum_probs=17.2
Q ss_pred cCccEEEEEEeCCee----eeCCCeEEEE
Q psy8451 85 QSGPLTLSVSAPFSG----FVPGQTIPCM 109 (110)
Q Consensus 85 ~~g~v~~~~~l~k~~----y~pGe~i~i~ 109 (110)
++|. .+++.|++.. ..+||++.+.
T Consensus 29 ~~~~-~iEvel~~~~~~l~l~~G~~V~l~ 56 (58)
T PF12857_consen 29 DDGE-PIEVELPRERRQLGLQPGDRVYLR 56 (58)
T ss_pred CCCC-EEEEEeCHhHHhcCCCCCCEEEEE
Confidence 4555 7777777776 7889988764
No 55
>PF05132 RNA_pol_Rpc4: RNA polymerase III RPC4; InterPro: IPR007811 This family comprises a specific subunit for Pol III, the tRNA specific polymerase.; GO: 0003677 DNA binding, 0003899 DNA-directed RNA polymerase activity, 0006383 transcription from RNA polymerase III promoter, 0005666 DNA-directed RNA polymerase III complex
Probab=25.15 E-value=54 Score=20.84 Aligned_cols=14 Identities=36% Similarity=0.790 Sum_probs=11.4
Q ss_pred eEEECCCCCCCcee
Q psy8451 12 FSYTLPPTIPSSFD 25 (110)
Q Consensus 12 F~f~LP~~lPsSf~ 25 (110)
|=||||..||.--.
T Consensus 6 ~lfQlP~~LP~~~~ 19 (131)
T PF05132_consen 6 FLFQLPPVLPPFKP 19 (131)
T ss_pred EEEECcCCCCCCCC
Confidence 66999999997654
No 56
>TIGR03769 P_ac_wall_RPT actinobacterial surface-anchored protein domain. This model describes a repeat domain that one to three times in Actinobacterial proteins, some of which have LPXTG-type sortase recognition motifs for covalent attachment to the Gram-positive cell wall. Where it occurs with duplication in an LPXTG-anchored protein, it tends to be adjacent to the substrate-binding protein of the gene trio of an ABC transporter system, where that substrate-binding protein has a single copy of this same domain. This arrangement suggests a substrate-binding relay system, with the LPXTG protein acting as a substrate receptor.
Probab=24.90 E-value=94 Score=15.71 Aligned_cols=12 Identities=17% Similarity=0.163 Sum_probs=8.6
Q ss_pred cCeeeEEEeEEE
Q psy8451 4 PQGSHVYPFSYT 15 (110)
Q Consensus 4 ~~G~h~fpF~f~ 15 (110)
.||.|+..|+..
T Consensus 11 ~PG~Y~l~~~a~ 22 (41)
T TIGR03769 11 KPGTYTLTVQAT 22 (41)
T ss_pred CCeEEEEEEEEE
Confidence 467777777765
No 57
>PF12761 End3: Actin cytoskeleton-regulatory complex protein END3
Probab=24.28 E-value=15 Score=25.45 Aligned_cols=24 Identities=25% Similarity=0.721 Sum_probs=19.8
Q ss_pred EEECCCCCCCceec--CCeEEEEEEE
Q psy8451 13 SYTLPPTIPSSFDG--EHGRVRYKVT 36 (110)
Q Consensus 13 ~f~LP~~lPsSf~~--~~g~I~Y~i~ 36 (110)
-+.||..+|+|... ....|.|-|.
T Consensus 18 G~rIPr~vPasLrasf~k~~i~Ydl~ 43 (195)
T PF12761_consen 18 GYRIPREVPASLRASFEKEQIDYDLN 43 (195)
T ss_pred CCcCCccCCHHHHHHHhcCCcCcccc
Confidence 36899999999986 4578999887
No 58
>TIGR01451 B_ant_repeat conserved repeat domain. This model represents the conserved region of about 53 amino acids shared between regions, usually repeated, of proteins from a small number of phylogenetically distant prokaryotes. Examples include a 132-residue region found repeated in three of the five longest proteins of Bacillus anthracis, a 131-residue repeat in a cell wall-anchored protein of Enterococcus faecalis, and a 120-residue repeat in Methanobacterium thermoautotrophicum. A similar region is found in some Chlamydial outer membrane proteins.
Probab=23.38 E-value=54 Score=17.36 Aligned_cols=13 Identities=23% Similarity=0.077 Sum_probs=6.0
Q ss_pred CCeeeeCCCeEEE
Q psy8451 96 PFSGFVPGQTIPC 108 (110)
Q Consensus 96 ~k~~y~pGe~i~i 108 (110)
++....+||.|..
T Consensus 4 d~~~~~~Gd~v~Y 16 (53)
T TIGR01451 4 DKTVATIGDTITY 16 (53)
T ss_pred CccccCCCCEEEE
Confidence 3444445555443
No 59
>PF13584 BatD: Oxygen tolerance
Probab=23.34 E-value=1e+02 Score=23.73 Aligned_cols=26 Identities=8% Similarity=0.109 Sum_probs=23.1
Q ss_pred cCccEEEEEEeCCeeeeCCCeEEEEC
Q psy8451 85 QSGPLTLSVSAPFSGFVPGQTIPCMI 110 (110)
Q Consensus 85 ~~g~v~~~~~l~k~~y~pGe~i~i~v 110 (110)
.++.+.+++.+++...+.||.+.+++
T Consensus 124 ~~~~~~l~~~v~~~~~Yvge~v~lt~ 149 (484)
T PF13584_consen 124 ADDDVFLEAEVSKKSVYVGEPVILTL 149 (484)
T ss_pred ccccEEEEEEeCCCceecCCcEEEEE
Confidence 46789999999999999999998874
No 60
>PF13584 BatD: Oxygen tolerance
Probab=22.34 E-value=89 Score=24.11 Aligned_cols=24 Identities=25% Similarity=0.253 Sum_probs=21.7
Q ss_pred ccEEEEEEeCCeeeeCCCeEEEEC
Q psy8451 87 GPLTLSVSAPFSGFVPGQTIPCMI 110 (110)
Q Consensus 87 g~v~~~~~l~k~~y~pGe~i~i~v 110 (110)
|.++++.++++..+..||.|.++|
T Consensus 271 g~f~l~~~~~~~~~~~Ge~vt~ti 294 (484)
T PF13584_consen 271 GNFSLSQSWDPTEVKVGEPVTRTI 294 (484)
T ss_pred eEEEEEEEcCcccccCCCeEEEEE
Confidence 789999999999999999998764
No 61
>PF02303 Phage_DNA_bind: Helix-destabilising protein; InterPro: IPR003512 This entry is represented by the Bacteriophage M13, G5P, DNA-binding protein. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. G5P is the bacteriophage helix-destabilising protein, or single-stranded DNA binding protein, required for DNA synthesis. The protein binds to DNA in a highly cooperative manner without pronounced sequence specificity. In the presence of single-stranded DNA it binds cooperatively to form a helical protein-DNA complex. It prevents the conversion during synthesis of the single-stranded (progeny) viral DNA back into the double-stranded replicative form.; GO: 0003697 single-stranded DNA binding, 0006260 DNA replication; PDB: 1VQI_A 1VQG_A 1AE3_A 1VQA_A 1VQD_A 1VQJ_A 1VQE_A 1YHB_A 1VQF_A 1VQC_A ....
Probab=22.23 E-value=57 Score=19.54 Aligned_cols=15 Identities=20% Similarity=0.197 Sum_probs=8.9
Q ss_pred eeeEEEeEEECCCCC
Q psy8451 6 GSHVYPFSYTLPPTI 20 (110)
Q Consensus 6 G~h~fpF~f~LP~~l 20 (110)
|.|-.+|.++|-.+-
T Consensus 38 G~fp~~f~v~Le~gq 52 (86)
T PF02303_consen 38 GKFPVEFKVRLEDGQ 52 (86)
T ss_dssp SSS-EEEEEEE-TTS
T ss_pred CCccEEEEEEcCCCC
Confidence 567777888776553
No 62
>PF14734 DUF4469: Domain of unknown function (DUF4469) with IG-like fold
Probab=21.57 E-value=1.7e+02 Score=17.91 Aligned_cols=22 Identities=23% Similarity=0.578 Sum_probs=12.4
Q ss_pred eEEECCCCCCCceecCCeEEEEEEEEEEeC
Q psy8451 12 FSYTLPPTIPSSFDGEHGRVRYKVTARLDR 41 (110)
Q Consensus 12 F~f~LP~~lPsSf~~~~g~I~Y~i~a~i~~ 41 (110)
..|.||++|+ .|. |.|+++=..
T Consensus 66 l~~~lPa~L~------~G~--Y~l~V~Tq~ 87 (102)
T PF14734_consen 66 LIFILPADLA------AGE--YTLEVRTQY 87 (102)
T ss_pred EEEECcCccC------ceE--EEEEEEEEe
Confidence 3455555554 455 777666553
No 63
>PF05078 DUF679: Protein of unknown function (DUF679); InterPro: IPR007770 This family contains uncharacterised plant proteins of unknown function.
Probab=21.32 E-value=66 Score=21.81 Aligned_cols=16 Identities=38% Similarity=0.740 Sum_probs=13.0
Q ss_pred CceecCCeEEEEEEEE
Q psy8451 22 SSFDGEHGRVRYKVTA 37 (110)
Q Consensus 22 sSf~~~~g~I~Y~i~a 37 (110)
-||.+.+|+++|-+--
T Consensus 58 DS~~~~dGkvyYG~aT 73 (170)
T PF05078_consen 58 DSFRGSDGKVYYGFAT 73 (170)
T ss_pred ceeECCCCCEEEEEEE
Confidence 4778899999998754
No 64
>smart00736 CADG Dystroglycan-type cadherin-like domains. Cadherin-homologous domains present in metazoan dystroglycans and alpha/epsilon sarcoglycans, yeast Axl2p and in a very large protein from magnetotactic bacteria. Likely to bind calcium ions.
Probab=21.09 E-value=1.8e+02 Score=17.09 Aligned_cols=25 Identities=32% Similarity=0.691 Sum_probs=16.8
Q ss_pred EeEEECCCCCCCceecCC-eEEEEEEEEE
Q psy8451 11 PFSYTLPPTIPSSFDGEH-GRVRYKVTAR 38 (110)
Q Consensus 11 pF~f~LP~~lPsSf~~~~-g~I~Y~i~a~ 38 (110)
+|+|++|.+ +|.+.. ..+.|.+...
T Consensus 14 ~f~~~i~~~---tF~d~d~~~lty~~~~~ 39 (97)
T smart00736 14 SFSYTIPSS---TFTDADGDTLTYSATLS 39 (97)
T ss_pred eEEEEcChh---ceECCCCCeEEEEEEeC
Confidence 788888866 555433 4788887643
No 65
>KOG3265|consensus
Probab=20.99 E-value=1.5e+02 Score=21.25 Aligned_cols=37 Identities=38% Similarity=0.688 Sum_probs=25.2
Q ss_pred ecCeeeEEEeEEECC--CCCCC-------------ceec-CCeEEEEEEEEEE
Q psy8451 3 IPQGSHVYPFSYTLP--PTIPS-------------SFDG-EHGRVRYKVTARL 39 (110)
Q Consensus 3 l~~G~h~fpF~f~LP--~~lPs-------------Sf~~-~~g~I~Y~i~a~i 39 (110)
+|.|.|.|-|+-.=| ..+|- ||.| .+-+|-|.|....
T Consensus 65 VP~G~~~FVf~AD~Pd~~kIP~~d~vGVTviLltC~Y~gQEFIRvGYyVnNeY 117 (250)
T KOG3265|consen 65 VPVGRHKFVFQADAPDPSKIPEDDIVGVTVILLTCSYRGQEFIRVGYYVNNEY 117 (250)
T ss_pred ccccceEEEEecCCCCcccCcccceeeeEEEEEEEEEcCceeEEEEEEecCCC
Confidence 578999999998887 45552 3344 3457777776554
Done!