Query psy2167
Match_columns 79
No_of_seqs 104 out of 195
Neff 5.5
Searched_HMMs 46136
Date Fri Aug 16 18:01:57 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy2167.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/2167hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG3547|consensus 100.0 2.4E-33 5.1E-38 218.2 5.9 75 5-79 175-250 (450)
2 PF01062 Bestrophin: Bestrophi 99.0 1.1E-10 2.3E-15 84.4 1.1 72 5-78 160-231 (293)
3 COG3781 Predicted membrane pro 82.2 2.8 6E-05 32.1 4.4 58 16-76 164-223 (306)
4 PF08828 DSX_dimer: Doublesex 71.5 2.3 4.9E-05 25.6 1.1 26 42-67 4-29 (62)
5 PF02284 COX5A: Cytochrome c o 65.0 7 0.00015 25.9 2.5 30 38-67 13-49 (108)
6 PF13624 SurA_N_3: SurA N-term 63.2 15 0.00033 23.7 3.9 32 19-50 87-118 (154)
7 PF11848 DUF3368: Domain of un 62.7 14 0.0003 20.4 3.1 24 19-45 6-29 (48)
8 PF11464 Rbsn: Rabenosyn Rab b 58.8 11 0.00023 21.0 2.1 28 18-45 8-35 (42)
9 PF14394 DUF4423: Domain of un 48.4 21 0.00046 24.6 2.8 23 33-55 130-152 (171)
10 PF09491 RE_AlwI: AlwI restric 46.0 38 0.00083 26.4 4.2 45 10-54 56-112 (435)
11 cd01421 IMPCH Inosine monophos 44.0 54 0.0012 23.4 4.3 33 20-52 126-169 (187)
12 KOG1562|consensus 43.8 57 0.0012 25.5 4.7 56 18-74 218-278 (337)
13 PF04677 CwfJ_C_1: Protein sim 42.7 19 0.0004 23.6 1.7 17 39-55 65-81 (121)
14 KOG4077|consensus 42.4 13 0.00028 25.8 1.0 21 46-66 67-87 (149)
15 PF13310 Virulence_RhuM: Virul 42.2 57 0.0012 24.6 4.4 37 16-52 47-94 (260)
16 PF00681 Plectin: Plectin repe 40.5 33 0.00071 18.5 2.3 14 22-35 24-37 (45)
17 PF09202 Rio2_N: Rio2, N-termi 39.3 28 0.00061 21.5 2.0 19 47-65 12-30 (82)
18 PF06424 PRP1_N: PRP1 splicing 39.1 19 0.00041 24.5 1.3 27 39-65 101-127 (133)
19 KOG2477|consensus 38.0 21 0.00046 29.7 1.7 19 37-55 460-478 (628)
20 PF02787 CPSase_L_D3: Carbamoy 37.7 65 0.0014 21.1 3.7 37 19-57 62-102 (123)
21 cd00923 Cyt_c_Oxidase_Va Cytoc 35.9 19 0.00041 23.7 0.9 23 45-67 24-46 (103)
22 PF11892 DUF3412: Domain of un 35.8 32 0.00069 23.3 2.0 20 41-60 2-21 (123)
23 PRK12275 hypothetical protein; 33.5 80 0.0017 19.9 3.6 35 20-55 78-112 (116)
24 COG3462 Predicted membrane pro 32.9 54 0.0012 22.0 2.7 10 13-22 53-62 (117)
25 PF07017 PagP: Antimicrobial p 32.8 15 0.00032 25.5 0.1 13 56-68 105-117 (147)
26 PF09111 SLIDE: SLIDE; InterP 31.5 74 0.0016 20.9 3.2 33 21-59 60-92 (118)
27 COG1422 Predicted membrane pro 30.7 19 0.0004 26.2 0.3 49 22-75 88-136 (201)
28 TIGR02147 Fsuc_second hypothet 30.3 57 0.0012 24.3 2.8 22 33-54 228-249 (271)
29 PF10303 DUF2408: Protein of u 29.9 1.2E+02 0.0026 20.2 4.1 26 30-55 100-125 (134)
30 PF14767 RPA_interact_M: Repli 29.6 56 0.0012 20.3 2.3 24 38-61 6-33 (83)
31 KOG3326|consensus 29.4 55 0.0012 22.9 2.4 24 19-42 61-84 (154)
32 PF08671 SinI: Anti-repressor 29.3 49 0.0011 17.0 1.7 13 19-31 6-18 (30)
33 PF10845 DUF2576: Protein of u 28.4 41 0.00089 19.2 1.4 23 34-56 10-32 (48)
34 KOG4095|consensus 26.9 74 0.0016 22.5 2.7 27 36-62 17-51 (165)
35 PF10474 DUF2451: Protein of u 26.2 1E+02 0.0022 22.3 3.5 44 13-56 67-113 (234)
36 COG3969 Predicted phosphoadeno 25.4 1.2E+02 0.0026 24.2 3.9 44 23-66 49-100 (407)
37 PF07026 DUF1317: Protein of u 25.1 94 0.002 18.6 2.5 23 31-53 32-54 (60)
38 PF05635 23S_rRNA_IVP: 23S rRN 25.0 1.6E+02 0.0034 18.2 3.8 33 20-53 77-109 (110)
39 TIGR02436 conserved hypothetic 25.0 1.4E+02 0.0031 19.0 3.6 35 21-56 72-106 (111)
40 PF04437 RINT1_TIP1: RINT-1 / 24.5 2.5E+02 0.0054 22.1 5.6 50 15-64 68-125 (494)
41 PF09779 Ima1_N: Ima1 N-termin 23.0 68 0.0015 21.2 1.9 22 33-54 98-119 (131)
42 KOG2351|consensus 22.6 93 0.002 21.3 2.5 21 28-49 112-132 (134)
43 cd07311 terB_like_1 tellurium 22.3 1.5E+02 0.0033 20.1 3.5 21 14-34 23-43 (150)
44 PTZ00233 variable surface prot 22.3 62 0.0014 26.5 1.8 34 22-58 67-100 (509)
45 PF04714 BCL_N: BCL7, N-termin 21.7 1.4E+02 0.0029 17.4 2.7 27 36-62 16-50 (52)
46 PF00458 WHEP-TRS: WHEP-TRS do 21.6 1.8E+02 0.0038 16.8 4.3 42 20-63 10-51 (56)
47 COG2089 SpsE Sialic acid synth 21.4 37 0.00081 26.6 0.4 52 12-64 86-139 (347)
48 PF05791 Bacillus_HBL: Bacillu 20.9 99 0.0021 21.4 2.4 18 36-53 125-142 (184)
49 PLN03075 nicotianamine synthas 20.8 1.3E+02 0.0029 22.7 3.3 35 18-56 38-72 (296)
50 KOG2787|consensus 20.8 1.6E+02 0.0034 23.5 3.7 47 16-66 164-210 (403)
51 COG3943 Virulence protein [Gen 20.5 2.7E+02 0.0058 21.4 4.8 40 15-54 103-151 (329)
52 PF14637 FNIP_M: Folliculin-in 20.4 2.2E+02 0.0048 20.9 4.3 39 19-57 51-89 (232)
No 1
>KOG3547|consensus
Probab=99.98 E-value=2.4e-33 Score=218.20 Aligned_cols=75 Identities=37% Similarity=0.652 Sum_probs=71.5
Q ss_pred CCCCC-CceeeHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHhcccccCcceecchhheeeeeEeeecccC
Q psy2167 5 RSPMA-KYWMPLVWATNIINRARKEALISSDHVVQTLLYELSEHRRKLGSIISYDTVCVPLVYTQQNFLDAPTKHY 79 (79)
Q Consensus 5 ~~~~~-kywvPi~Wa~~ll~~ar~eg~i~~~~~~~~l~~ei~~FR~~l~~L~~yDWVpIPLvY~QvV~lAV~~Yf~ 79 (79)
..+++ |||+||+|+.+|+++|+++|+|++|..++.+.+||++||++|++|++||||||||||||||+||||+||+
T Consensus 175 ~~~~~~kyw~Pi~Wa~sl~~~a~~~g~i~~d~~~~~~~~ei~~fR~~l~~L~~yDwVpIPL~YpQvV~lAVr~YF~ 250 (450)
T KOG3547|consen 175 VDSKYMKYWVPINWAFSLLRKARKEGRIESDYLYNSLLGEIKKFRTGLSLLCNYDWVPIPLVYPQVVFLAVRSYFF 250 (450)
T ss_pred ccccccceeecHHHHHHHHHHHHHcCCccccHHHHHHHHHHHHHHHhhccccCCCceeccccccceeeeehHHhHH
Confidence 34444 9999999999999999999999999999999999999999999999999999999999999999999995
No 2
>PF01062 Bestrophin: Bestrophin, RFP-TM, chloride channel; InterPro: IPR021134 Bestrophin is a 68kDa basolateral plasma membrane protein expressed in retinal pigment epithelial cells (RPE). It is encoded by the VMD2 gene, which is mutated in Best macular dystrophy, a disease characterised by a depressed light peak in the electrooculogram []. VMD2 encodes a 585-amino acid protein with an approximate mass of 68 kDa which has been designated bestrophin. Bestrophin shares homology with the Caenorhabditis elegans RFP gene family, named for the presence of a conserved arginine (R), phenylalanine (F), proline (P), amino acid sequence motif. Bestrophin is a plasma membrane protein, localised to the basolateral surface of RPE cells consistent with a role for bestrophin in the generation or regulation of the EOG light peak. Bestrophin and other RFP family members represent a new class of chloride channels, indicating a direct role for bestrophin in generating the light peak []. The VMD2 gene underlying Best disease was shown to represent the first human member of the RFP-TM protein family. More than 97% of the disease-causing mutations are located in the N-terminal RFP-TM domain implying important functional properties []. This entry also includes a number of hypothetical proteins belonging to protein family UPF0187.
Probab=98.99 E-value=1.1e-10 Score=84.37 Aligned_cols=72 Identities=21% Similarity=0.287 Sum_probs=66.2
Q ss_pred CCCCCCceeeHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHhcccccCcceecchhheeeeeEeeeccc
Q psy2167 5 RSPMAKYWMPLVWATNIINRARKEALISSDHVVQTLLYELSEHRRKLGSIISYDTVCVPLVYTQQNFLDAPTKH 78 (79)
Q Consensus 5 ~~~~~kywvPi~Wa~~ll~~ar~eg~i~~~~~~~~l~~ei~~FR~~l~~L~~yDWVpIPLvY~QvV~lAV~~Yf 78 (79)
+.++...++|..|+..+ ++++++|. .++.....+.+++.+|++.++.+..++++|||++|+|.+...+++|+
T Consensus 160 ~~~~~p~~i~~~~~~~l-~~~~~~g~-~~~~~~~~l~~~l~~l~~~~g~~eri~~TPiP~~Y~~~~~~~l~~y~ 231 (293)
T PF01062_consen 160 SADNPPNWILIQWSQRL-REARREGL-IDSFQLAQLDNELNALRDAQGGCERIKNTPIPFAYTQHLSRFLYIYL 231 (293)
T ss_pred hhcCcHHHHHHHHHHHH-HHHHHcCC-CChHHHHHHHHHHHHHHHHHhhhheeCCCcCCHHHHHHHHHHHHHHH
Confidence 34677899999999999 99999999 55668999999999999999999999999999999999999998886
No 3
>COG3781 Predicted membrane protein [Function unknown]
Probab=82.21 E-value=2.8 Score=32.06 Aligned_cols=58 Identities=14% Similarity=0.213 Sum_probs=43.3
Q ss_pred HHHHHHHHHHHHhcCCCCHH--HHHHHHHHHHHHHHHhcccccCcceecchhheeeeeEeeec
Q psy2167 16 VWATNIINRARKEALISSDH--VVQTLLYELSEHRRKLGSIISYDTVCVPLVYTQQNFLDAPT 76 (79)
Q Consensus 16 ~Wa~~ll~~ar~eg~i~~~~--~~~~l~~ei~~FR~~l~~L~~yDWVpIPLvY~QvV~lAV~~ 76 (79)
-|.-.-++.-|++|.+++-. .+++.++.|.+--++|+.+-+ .|||++|+=.+.=.||.
T Consensus 164 ~wmGe~l~~q~r~g~l~~~~~~sl~~~L~~~s~vlggCERI~~---TPiPfAYtl~lhrtvyl 223 (306)
T COG3781 164 LWMGEWLQQQRRNGQLDAIQFTSLDRRLNSISAVLGGCERIAY---TPIPFAYTLHLHRTVYL 223 (306)
T ss_pred HHHHHHHHHHHhcCCccHHHHHHHHHHHHHHHHHHHhHHHHhc---CCccHHHHHHHHHHHHH
Confidence 48888999999999987542 456677777788888888765 89999998544433433
No 4
>PF08828 DSX_dimer: Doublesex dimerisation domain; InterPro: IPR014932 Doublesex (DSX) is a transcription factor that regulates somatic sexual differences in Drosophila. The structure has revealed a novel dimeric arrangement of ubiquitin-associated folds that has not previously been identified in a transcription factor []. ; PDB: 1ZV1_B 2JZ0_A 2JZ1_B.
Probab=71.55 E-value=2.3 Score=25.63 Aligned_cols=26 Identities=35% Similarity=0.404 Sum_probs=20.9
Q ss_pred HHHHHHHHHhcccccCcceecchhhe
Q psy2167 42 YELSEHRRKLGSIISYDTVCVPLVYT 67 (79)
Q Consensus 42 ~ei~~FR~~l~~L~~yDWVpIPLvY~ 67 (79)
+++.++=++|-.-+.|-|--+||+|.
T Consensus 4 e~Ll~~cqkLlEkf~YpWEmmpLmyV 29 (62)
T PF08828_consen 4 EELLERCQKLLEKFRYPWEMMPLMYV 29 (62)
T ss_dssp HHHHHHHHHHHHHTT--GGGHHHHHH
T ss_pred HHHHHHHHHHHHHhCCCHHHHHHHHH
Confidence 67788888888899999999999994
No 5
>PF02284 COX5A: Cytochrome c oxidase subunit Va; InterPro: IPR003204 Cytochrome c oxidase (1.9.3.1 from EC) is an oligomeric enzymatic complex which is a component of the respiratory chain complex and is involved in the transfer of electrons from cytochrome c to oxygen []. In eukaryotes this enzyme complex is located in the mitochondrial inner membrane; in aerobic prokaryotes it is found in the plasma membrane. In eukaryotes, in addition to the three large subunits, I, II and III, that form the catalytic centre of the enzyme complex, there are a variable number of small polypeptidic subunits. One of these subunits is known as Va.; GO: 0004129 cytochrome-c oxidase activity; PDB: 2DYR_R 3AG1_E 3ABL_E 1V54_R 2EIJ_R 1OCR_E 2DYS_E 2EIM_E 2OCC_E 3ASN_R ....
Probab=65.04 E-value=7 Score=25.87 Aligned_cols=30 Identities=27% Similarity=0.355 Sum_probs=20.9
Q ss_pred HHHHHHHH-------HHHHHhcccccCcceecchhhe
Q psy2167 38 QTLLYELS-------EHRRKLGSIISYDTVCVPLVYT 67 (79)
Q Consensus 38 ~~l~~ei~-------~FR~~l~~L~~yDWVpIPLvY~ 67 (79)
.+-.+..+ +.|++|.+|+.||.||=|=+..
T Consensus 13 ary~~~F~~~~iD~we~rrglN~l~~~DlVP~P~ii~ 49 (108)
T PF02284_consen 13 ARYEKYFNRPDIDGWELRRGLNNLFGYDLVPEPKIIE 49 (108)
T ss_dssp HHHHHHHH-TT--HHHHHHHHHHHTTSSB---HHHHH
T ss_pred HHHHHHhCCccccHHHHHHHHHHHhccccCCChHHHH
Confidence 44455555 8999999999999999987654
No 6
>PF13624 SurA_N_3: SurA N-terminal domain; PDB: 3NRK_A.
Probab=63.18 E-value=15 Score=23.68 Aligned_cols=32 Identities=16% Similarity=0.265 Sum_probs=21.0
Q ss_pred HHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHH
Q psy2167 19 TNIINRARKEALISSDHVVQTLLYELSEHRRK 50 (79)
Q Consensus 19 ~~ll~~ar~eg~i~~~~~~~~l~~ei~~FR~~ 50 (79)
--+..+|++.|.--||..++..+++++.|..+
T Consensus 87 ~ll~q~A~~~gi~vsd~ev~~~i~~~~~f~~~ 118 (154)
T PF13624_consen 87 KLLLQEAKKLGISVSDAEVDDAIKQIPAFQEN 118 (154)
T ss_dssp HHHHHHHHHTT----HHHHHHHHHH--HHHHH
T ss_pred HHHHHHHHHcCCCCCHHHHHHHHHHHHHHHHC
Confidence 34678899999988999999999998888654
No 7
>PF11848 DUF3368: Domain of unknown function (DUF3368); InterPro: IPR021799 This domain is functionally uncharacterised. This domain is found in bacteria and archaea. This presumed domain is about 50 amino acids in length.
Probab=62.73 E-value=14 Score=20.40 Aligned_cols=24 Identities=38% Similarity=0.452 Sum_probs=17.4
Q ss_pred HHHHHHHHHhcCCCCHHHHHHHHHHHH
Q psy2167 19 TNIINRARKEALISSDHVVQTLLYELS 45 (79)
Q Consensus 19 ~~ll~~ar~eg~i~~~~~~~~l~~ei~ 45 (79)
+-++.+|.++|.|+ ....+++++.
T Consensus 6 lGiL~~Ak~~GlI~---~~~~~l~~l~ 29 (48)
T PF11848_consen 6 LGILLLAKRRGLIS---EVKPLLDRLQ 29 (48)
T ss_pred HHHHHHHHHcCChh---hHHHHHHHHH
Confidence 35788899999998 3556666663
No 8
>PF11464 Rbsn: Rabenosyn Rab binding domain; InterPro: IPR021565 Rabenosyn-5 (Rbsn) is a multivalent effector with interacts with the Rab family.Rsbn contains distinct Rab4 and Rab5 binding sites within residues 264-500 and 627-784 respectively []. Rab proteins are GTPases involved in the regulation of all stages of membrane trafficking []. ; PDB: 1Z0K_B 1YZM_A 1Z0J_B.
Probab=58.77 E-value=11 Score=21.03 Aligned_cols=28 Identities=18% Similarity=0.191 Sum_probs=18.4
Q ss_pred HHHHHHHHHHhcCCCCHHHHHHHHHHHH
Q psy2167 18 ATNIINRARKEALISSDHVVQTLLYELS 45 (79)
Q Consensus 18 a~~ll~~ar~eg~i~~~~~~~~l~~ei~ 45 (79)
.-+.+..|++.|+++.-..+..-+.||.
T Consensus 8 I~~~I~qAk~~~r~dEV~~L~~NL~EL~ 35 (42)
T PF11464_consen 8 IESYIKQAKAARRFDEVATLEENLRELQ 35 (42)
T ss_dssp HHHHHHHHHHTT-HHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHhcCcHHHHHHHHHHHHHH
Confidence 3467889999999886555555555543
No 9
>PF14394 DUF4423: Domain of unknown function (DUF4423)
Probab=48.42 E-value=21 Score=24.63 Aligned_cols=23 Identities=26% Similarity=0.314 Sum_probs=19.7
Q ss_pred CHHHHHHHHHHHHHHHHHhcccc
Q psy2167 33 SDHVVQTLLYELSEHRRKLGSII 55 (79)
Q Consensus 33 ~~~~~~~l~~ei~~FR~~l~~L~ 55 (79)
|+..+.++.+||.+||+++..+-
T Consensus 130 s~~~~~ki~~~i~~fRk~i~~i~ 152 (171)
T PF14394_consen 130 SREDYEKIKKEIREFRKKIIAIA 152 (171)
T ss_pred CHHHHHHHHHHHHHHHHHHHHHH
Confidence 45689999999999999987654
No 10
>PF09491 RE_AlwI: AlwI restriction endonuclease; InterPro: IPR018573 There are four classes of restriction endonucleases: types I, II,III and IV. All types of enzymes recognise specific short DNA sequences and carry out the endonucleolytic cleavage of DNA to give specific double-stranded fragments with terminal 5'-phosphates. They differ in their recognition sequence, subunit composition, cleavage position, and cofactor requirements [, ], as summarised below: Type I enzymes (3.1.21.3 from EC) cleave at sites remote from recognition site; require both ATP and S-adenosyl-L-methionine to function; multifunctional protein with both restriction and methylase (2.1.1.72 from EC) activities. Type II enzymes (3.1.21.4 from EC) cleave within or at short specific distances from recognition site; most require magnesium; single function (restriction) enzymes independent of methylase. Type III enzymes (3.1.21.5 from EC) cleave at sites a short distance from recognition site; require ATP (but doesn't hydrolyse it); S-adenosyl-L-methionine stimulates reaction but is not required; exists as part of a complex with a modification methylase methylase (2.1.1.72 from EC). Type IV enzymes target methylated DNA. Type II restriction endonucleases (3.1.21.4 from EC) are components of prokaryotic DNA restriction-modification mechanisms that protect the organism against invading foreign DNA. These site-specific deoxyribonucleases catalyse the endonucleolytic cleavage of DNA to give specific double-stranded fragments with terminal 5'-phosphates. Of the 3000 restriction endonucleases that have been characterised, most are homodimeric or tetrameric enzymes that cleave target DNA at sequence-specific sites close to the recognition site. For homodimeric enzymes, the recognition site is usually a palindromic sequence 4-8 bp in length. Most enzymes require magnesium ions as a cofactor for catalysis. Although they can vary in their mode of recognition, many restriction endonucleases share a similar structural core comprising four beta-strands and one alpha-helix, as well as a similar mechanism of cleavage, suggesting a common ancestral origin []. However, there is still considerable diversity amongst restriction endonucleases [, ]. The target site recognition process triggers large conformational changes of the enzyme and the target DNA, leading to the activation of the catalytic centres. Like other DNA binding proteins, restriction enzymes are capable of non-specific DNA binding as well, which is the prerequisite for efficient target site location by facilitated diffusion. Non-specific binding usually does not involve interactions with the bases but only with the DNA backbone []. This entry includes AlwI (recognises GGATC), Bsp6I (recognises GC^NGC), BstNBI (recognises GASTC), PleI(recognises GAGTC) and MlyI (recognises GAGTC) restriction endonucleases. ; PDB: 2EWF_A 2P14_A.
Probab=46.01 E-value=38 Score=26.40 Aligned_cols=45 Identities=16% Similarity=0.366 Sum_probs=31.7
Q ss_pred CceeeHHHHHHHHHHHHHhcCCC------------CHHHHHHHHHHHHHHHHHhccc
Q psy2167 10 KYWMPLVWATNIINRARKEALIS------------SDHVVQTLLYELSEHRRKLGSI 54 (79)
Q Consensus 10 kywvPi~Wa~~ll~~ar~eg~i~------------~~~~~~~l~~ei~~FR~~l~~L 54 (79)
.+.-|+.+++.++.+-++++.+. ++..++.++++|.+||..-...
T Consensus 56 ~~~~Pfr~lL~ll~~l~~~~~ls~~E~~~~v~~~~~~~~~~~lv~~I~~fR~~~~~~ 112 (435)
T PF09491_consen 56 FYIYPFRFLLKLLNELELDSYLSKEEFAYFVQTISDDNDIEELVNKILDFRKERENK 112 (435)
T ss_dssp -BB-HHHHHHHHHHHH---S-B-HHHHHHHTTT--BGGGHHHHHHHHHHHHHHHHHS
T ss_pred eeeehHHHHHHHHHHHhcccccChhhheeEEEEccchhhHHHHHHHHHHHHHhhhhh
Confidence 68899999999999998777653 3456788999999999876543
No 11
>cd01421 IMPCH Inosine monophosphate cyclohydrolase domain. This is the N-terminal domain in the purine biosynthesis pathway protein ATIC (purH). The bifunctional ATIC protein contains a C-terminal ATIC formylase domain that formylates 5-aminoimidazole-4-carboxamide-ribonucleotide. The IMPCH domain then converts the formyl-5-aminoimidazole-4-carboxamide-ribonucleotide to inosine monophosphate. This is the final step in de novo purine production.
Probab=44.01 E-value=54 Score=23.43 Aligned_cols=33 Identities=27% Similarity=0.352 Sum_probs=25.6
Q ss_pred HHHHHHHHhcC----CCCHHHHHHHHHHHHH-------HHHHhc
Q psy2167 20 NIINRARKEAL----ISSDHVVQTLLYELSE-------HRRKLG 52 (79)
Q Consensus 20 ~ll~~ar~eg~----i~~~~~~~~l~~ei~~-------FR~~l~ 52 (79)
+++|-|.|-.. +.++..+..+++||++ +|.+|+
T Consensus 126 smlRaAAKN~~~V~vv~dp~dY~~v~~~l~~~g~~~~~~R~~lA 169 (187)
T cd01421 126 SLLRAAAKNYKDVTVLVDPADYQKVLEELKSNGSISEETRRRLA 169 (187)
T ss_pred HHHHHHHhcCCCeEEEcCHHHHHHHHHHHHhcCCCCHHHHHHHH
Confidence 56776666552 5688899999999986 888875
No 12
>KOG1562|consensus
Probab=43.77 E-value=57 Score=25.49 Aligned_cols=56 Identities=13% Similarity=0.038 Sum_probs=43.0
Q ss_pred HHHHHHHHHHhcCC-----CCHHHHHHHHHHHHHHHHHhcccccCcceecchhheeeeeEee
Q psy2167 18 ATNIINRARKEALI-----SSDHVVQTLLYELSEHRRKLGSIISYDTVCVPLVYTQQNFLDA 74 (79)
Q Consensus 18 a~~ll~~ar~eg~i-----~~~~~~~~l~~ei~~FR~~l~~L~~yDWVpIPLvY~QvV~lAV 74 (79)
.+.++.+|++.|=+ ++-....+++.|+++|=.....+..|-|..+| -||=..+.-+
T Consensus 218 ~~~~v~~aLk~dgv~~~q~ec~wl~~~~i~e~r~~~~~~f~~t~ya~ttvP-Typsg~igf~ 278 (337)
T KOG1562|consen 218 YFGLVLDALKGDGVVCTQGECMWLHLDYIKEGRSFCYVIFDLTAYAITTVP-TYPSGRIGFM 278 (337)
T ss_pred HHHHHHHhhCCCcEEEEecceehHHHHHHHHHHHhHHHhcCccceeeecCC-CCccceEEEE
Confidence 35677788876632 34457789999999999999999999999999 6775554433
No 13
>PF04677 CwfJ_C_1: Protein similar to CwfJ C-terminus 1; InterPro: IPR006768 This group of sequences contain a conserved C-terminal domain which is found in the Schizosaccharomyces pombe (Fission yeast) protein Cwf19 (Q09909 from SWISSPROT) and its homologues. Cwf19 is part of the Cdc5p complex involved in mRNA splicing []. This domain is found in association with IPR006767 from INTERPRO, which is generally C-terminal and adjacent to this domain.
Probab=42.66 E-value=19 Score=23.64 Aligned_cols=17 Identities=12% Similarity=0.319 Sum_probs=15.0
Q ss_pred HHHHHHHHHHHHhcccc
Q psy2167 39 TLLYELSEHRRKLGSII 55 (79)
Q Consensus 39 ~l~~ei~~FR~~l~~L~ 55 (79)
+.++||++||++|...+
T Consensus 65 ~~~~Ei~~f~~~L~~mf 81 (121)
T PF04677_consen 65 EVWEEIRNFQKSLRKMF 81 (121)
T ss_pred HHHHHHHHHHHHHHHHH
Confidence 78999999999998764
No 14
>KOG4077|consensus
Probab=42.35 E-value=13 Score=25.80 Aligned_cols=21 Identities=38% Similarity=0.575 Sum_probs=18.5
Q ss_pred HHHHHhcccccCcceecchhh
Q psy2167 46 EHRRKLGSIISYDTVCVPLVY 66 (79)
Q Consensus 46 ~FR~~l~~L~~yDWVpIPLvY 66 (79)
+.|+.|.+|..||-||=|=+-
T Consensus 67 EvrkglN~l~~yDlVP~pkvI 87 (149)
T KOG4077|consen 67 EVRKGLNNLFDYDLVPSPKVI 87 (149)
T ss_pred HHHHHHHhhhccccCCChHHH
Confidence 679999999999999988654
No 15
>PF13310 Virulence_RhuM: Virulence protein RhuM family
Probab=42.24 E-value=57 Score=24.60 Aligned_cols=37 Identities=19% Similarity=0.346 Sum_probs=28.2
Q ss_pred HHHHHHHHHHHHhcCCCCH-----------HHHHHHHHHHHHHHHHhc
Q psy2167 16 VWATNIINRARKEALISSD-----------HVVQTLLYELSEHRRKLG 52 (79)
Q Consensus 16 ~Wa~~ll~~ar~eg~i~~~-----------~~~~~l~~ei~~FR~~l~ 52 (79)
.||.+.|++-..+|..-+| ..+..+++.|++.|.+=.
T Consensus 47 ~WAt~~Lkey~~KGf~~d~erLk~~~~~~~dyf~ell~rIr~IRaSEr 94 (260)
T PF13310_consen 47 QWATKVLKEYLIKGFVLDDERLKNGGVFGKDYFDELLERIRDIRASER 94 (260)
T ss_pred HHHHHhHHHHHHhhhhhhHHHHHccCcccHHHHHHHHHHHHhhHHHHH
Confidence 5999999988877765443 356788899998887643
No 16
>PF00681 Plectin: Plectin repeat; InterPro: IPR001101 Plectin may have a role in cross-linking intermediate filaments, in inter-linking intermediate filaments with microtubules and microfilaments and in anchoring intermediate filaments to the plasma and nuclear membranes. Plectin is recruited into hemidesmosomes, multiprotein complexes that facilitate adhesion of epithelia to the basement membrane, thereby providing linkage between the intracellular keratin filaments to the laminins of the extracellular matrix. Plectin binds to hemidesmosomes through association of its actin-binding domain with the first pair of fibronectin type III repeats and a small part of the connecting segment of the integrin-beta4 subunit, the latter (integrin-alpha6,beta4) acting as a receptor for the extracellular matrix component laminin-5. The plectin repeat is also seen in the cell adhesion junction plaque proteins, desmoplakin, envoplakin, and bullous pemphigoid antigen. The domains in plakins show considerable sequence homology. The N terminus consists of a plakin domain containing a number of subdomains with high alpha-helical content, while the central coiled-coil domain is composed of heptad repeats involved in the dimerisation of plakin, and the C terminus contains one or more homologous repeat sequences referred to plectin repeats []. This entry represents the plectin repeats found in the C terminus of plakin proteins.; GO: 0005856 cytoskeleton; PDB: 1LM7_A 1LM5_A.
Probab=40.53 E-value=33 Score=18.48 Aligned_cols=14 Identities=36% Similarity=0.522 Sum_probs=10.8
Q ss_pred HHHHHHhcCCCCHH
Q psy2167 22 INRARKEALISSDH 35 (79)
Q Consensus 22 l~~ar~eg~i~~~~ 35 (79)
+.+|.++|.|+++.
T Consensus 24 v~~A~~~glId~~~ 37 (45)
T PF00681_consen 24 VEEAIQRGLIDSDT 37 (45)
T ss_dssp HHHHHHTTSS-HHH
T ss_pred HHHHHHCCCcCHHH
Confidence 46899999999874
No 17
>PF09202 Rio2_N: Rio2, N-terminal; InterPro: IPR015285 This N-terminal domain is found in RIO2 kinases, and is structurally homologous to the winged helix (wHTH) domain. It adopts a structure consisting of four alpha helices followed by two beta strands and a fifth alpha helix. The domain confers DNA binding properties to the protein, as per other winged helix domains []. ; GO: 0004674 protein serine/threonine kinase activity, 0005524 ATP binding, 0006468 protein phosphorylation; PDB: 1TQP_A 1ZAR_A 1TQI_A 1ZAO_A 1TQM_A.
Probab=39.28 E-value=28 Score=21.48 Aligned_cols=19 Identities=16% Similarity=0.023 Sum_probs=11.6
Q ss_pred HHHHhcccccCcceecchh
Q psy2167 47 HRRKLGSIISYDTVCVPLV 65 (79)
Q Consensus 47 FR~~l~~L~~yDWVpIPLv 65 (79)
.+.=-....+|.|||+++.
T Consensus 12 L~aiE~gmk~hE~VP~~~I 30 (82)
T PF09202_consen 12 LRAIEMGMKNHEWVPLELI 30 (82)
T ss_dssp HHHHHTTTTT-SSEEHHHH
T ss_pred HHHHHHcccCCccCCHHHH
Confidence 3333445778889888764
No 18
>PF06424 PRP1_N: PRP1 splicing factor, N-terminal; InterPro: IPR010491 This domain is specific to the N-terminal part of the prp1 splicing factor, which is involved in mRNA splicing (and possibly also poly(A)+ RNA nuclear export and cell cycle progression). This domain is specific to the N terminus of the RNA splicing factor encoded by prp1 []. It is involved in mRNA splicing and possibly also poly(A)and RNA nuclear export and cell cycle progression.; GO: 0000398 nuclear mRNA splicing, via spliceosome, 0005634 nucleus
Probab=39.11 E-value=19 Score=24.48 Aligned_cols=27 Identities=11% Similarity=0.261 Sum_probs=24.1
Q ss_pred HHHHHHHHHHHHhcccccCcceecchh
Q psy2167 39 TLLYELSEHRRKLGSIISYDTVCVPLV 65 (79)
Q Consensus 39 ~l~~ei~~FR~~l~~L~~yDWVpIPLv 65 (79)
+|-+.+.+++.+|+.+..=+|-+||=|
T Consensus 101 kI~~QFaDLKR~La~VS~eeW~~IPE~ 127 (133)
T PF06424_consen 101 KIQQQFADLKRSLATVSEEEWENIPEA 127 (133)
T ss_pred hHHHHHHHHHHHHccCCHHHHhcCCcc
Confidence 577889999999999999999999943
No 19
>KOG2477|consensus
Probab=37.98 E-value=21 Score=29.70 Aligned_cols=19 Identities=16% Similarity=0.258 Sum_probs=16.2
Q ss_pred HHHHHHHHHHHHHHhcccc
Q psy2167 37 VQTLLYELSEHRRKLGSII 55 (79)
Q Consensus 37 ~~~l~~ei~~FR~~l~~L~ 55 (79)
=...+|||++||++|.+++
T Consensus 460 DEdvWDEIrnfrKcL~~Mf 478 (628)
T KOG2477|consen 460 DEDVWDEIRNFRKCLALMF 478 (628)
T ss_pred chHHHHHHHHHHHHHHHHH
Confidence 3578999999999998765
No 20
>PF02787 CPSase_L_D3: Carbamoyl-phosphate synthetase large chain, oligomerisation domain; InterPro: IPR005480 Carbamoyl phosphate synthase (CPSase) is a heterodimeric enzyme composed of a small and a large subunit (with the exception of CPSase III, see below). CPSase catalyses the synthesis of carbamoyl phosphate from biocarbonate, ATP and glutamine (6.3.5.5 from EC) or ammonia (6.3.4.16 from EC), and represents the first committed step in pyrimidine and arginine biosynthesis in prokaryotes and eukaryotes, and in the urea cycle in most terrestrial vertebrates [, ]. CPSase has three active sites, one in the small subunit and two in the large subunit. The small subunit contains the glutamine binding site and catalyses the hydrolysis of glutamine to glutamate and ammonia. The large subunit has two homologous carboxy phosphate domains, both of which have ATP-binding sites; however, the N-terminal carboxy phosphate domain catalyses the phosphorylation of biocarbonate, while the C-terminal domain catalyses the phosphorylation of the carbamate intermediate []. The carboxy phosphate domain found duplicated in the large subunit of CPSase is also present as a single copy in the biotin-dependent enzymes acetyl-CoA carboxylase (6.4.1.2 from EC) (ACC), propionyl-CoA carboxylase (6.4.1.3 from EC) (PCCase), pyruvate carboxylase (6.4.1.1 from EC) (PC) and urea carboxylase (6.3.4.6 from EC). Most prokaryotes carry one form of CPSase that participates in both arginine and pyrimidine biosynthesis, however certain bacteria can have separate forms. The large subunit in bacterial CPSase has four structural domains: the carboxy phosphate domain 1, the oligomerisation domain, the carbamoyl phosphate domain 2 and the allosteric domain []. CPSase heterodimers from Escherichia coli contain two molecular tunnels: an ammonia tunnel and a carbamate tunnel. These inter-domain tunnels connect the three distinct active sites, and function as conduits for the transport of unstable reaction intermediates (ammonia and carbamate) between successive active sites []. The catalytic mechanism of CPSase involves the diffusion of carbamate through the interior of the enzyme from the site of synthesis within the N-terminal domain of the large subunit to the site of phosphorylation within the C-terminal domain. Eukaryotes have two distinct forms of CPSase: a mitochondrial enzyme (CPSase I) that participates in both arginine biosynthesis and the urea cycle; and a cytosolic enzyme (CPSase II) involved in pyrimidine biosynthesis. CPSase II occurs as part of a multi-enzyme complex along with aspartate transcarbamoylase and dihydroorotase; this complex is referred to as the CAD protein []. The hepatic expression of CPSase is transcriptionally regulated by glucocorticoids and/or cAMP []. There is a third form of the enzyme, CPSase III, found in fish, which uses glutamine as a nitrogen source instead of ammonia []. CPSase III is closely related to CPSase I, and is composed of a single polypeptide that may have arisen from gene fusion of the glutaminase and synthetase domains []. This entry represents the oligomerisation domain found in the large subunit of carbamoyl phosphate synthases as well as in certain other carboxy phsophate domain-containing enzymes.; GO: 0006807 nitrogen compound metabolic process; PDB: 1M6V_C 1CS0_C 1C30_E 1C3O_G 1BXR_A 1T36_E 1A9X_A 1KEE_G 1CE8_A 1JDB_H ....
Probab=37.67 E-value=65 Score=21.13 Aligned_cols=37 Identities=24% Similarity=0.427 Sum_probs=25.1
Q ss_pred HHHHHHHHHhcCCCCHHHHHHHH----HHHHHHHHHhcccccC
Q psy2167 19 TNIINRARKEALISSDHVVQTLL----YELSEHRRKLGSIISY 57 (79)
Q Consensus 19 ~~ll~~ar~eg~i~~~~~~~~l~----~ei~~FR~~l~~L~~y 57 (79)
-.++++|.+.| -||.....++ +|+++.|.+.+..-.|
T Consensus 62 ~~~L~~aK~~G--FsD~~IA~l~~~~e~~vr~~R~~~~i~p~y 102 (123)
T PF02787_consen 62 PELLRKAKRLG--FSDRQIARLWGVSEEEVRELRKEHGIVPVY 102 (123)
T ss_dssp HHHHHHHHHTT----HHHHHHHHTS-HHHHHHHHHHHT---EE
T ss_pred HHHHHHHHHcC--CCHHHHHhccCCCHHHHHHHHHHcCCceee
Confidence 45889999999 5676665554 5899999998876555
No 21
>cd00923 Cyt_c_Oxidase_Va Cytochrome c oxidase subunit Va. Cytochrome c oxidase (CcO), the terminal oxidase in the respiratory chains of eukaryotes and most bacteria, is a multi-chain transmembrane protein located in the inner membrane of mitochondria and the cell membrane of prokaryotes. It catalyzes the reduction of O2 and simultaneously pumps protons across the membrane. The number of subunits varies from three to five in bacteria and up to 13 in mammalian mitochondria. Subunits I, II, and III of mammalian CcO are encoded within the mitochondrial genome and the remaining 10 subunits are encoded within the nuclear genome. Found only in eukaryotes, subunit Va is one of three mammalian subunits that lacks a transmembrane region. Subunit Va is located on the matrix side of the membrane and binds thyroid hormone T2, releasing allosteric inhibition caused by the binding of ATP to subunit IV and allowing high turnover at elevated intramitochondrial ATP/ADP ratios.
Probab=35.95 E-value=19 Score=23.69 Aligned_cols=23 Identities=39% Similarity=0.492 Sum_probs=19.5
Q ss_pred HHHHHHhcccccCcceecchhhe
Q psy2167 45 SEHRRKLGSIISYDTVCVPLVYT 67 (79)
Q Consensus 45 ~~FR~~l~~L~~yDWVpIPLvY~ 67 (79)
=+.|++|.+|+.||-||=|=+-.
T Consensus 24 we~rr~mN~l~~~DlVP~P~ii~ 46 (103)
T cd00923 24 WELRRGLNNLFGYDLVPEPKVIE 46 (103)
T ss_pred HHHHHHHHHHhccccCCCcHHHH
Confidence 36799999999999999986543
No 22
>PF11892 DUF3412: Domain of unknown function (DUF3412); InterPro: IPR021826 This presumed domain is functionally uncharacterised. This domain is found in bacteria. This domain is about 120 amino acids in length. This domain is found associated with PF03641 from PFAM. ; PDB: 3BQ9_A 2PMB_D 3GH1_D.
Probab=35.76 E-value=32 Score=23.27 Aligned_cols=20 Identities=25% Similarity=0.381 Sum_probs=15.5
Q ss_pred HHHHHHHHHHhcccccCcce
Q psy2167 41 LYELSEHRRKLGSIISYDTV 60 (79)
Q Consensus 41 ~~ei~~FR~~l~~L~~yDWV 60 (79)
+++++++|...+.-+.|.|-
T Consensus 2 m~~V~~~R~~~~Day~fNW~ 21 (123)
T PF11892_consen 2 MEEVKEYRRETGDAYYFNWS 21 (123)
T ss_dssp HHHHHHHHHHTT--SSS-TS
T ss_pred chHHHHHHHhcCCceeeeee
Confidence 68999999999999999993
No 23
>PRK12275 hypothetical protein; Reviewed
Probab=33.45 E-value=80 Score=19.90 Aligned_cols=35 Identities=20% Similarity=0.320 Sum_probs=24.6
Q ss_pred HHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHhcccc
Q psy2167 20 NIINRARKEALISSDHVVQTLLYELSEHRRKLGSII 55 (79)
Q Consensus 20 ~ll~~ar~eg~i~~~~~~~~l~~ei~~FR~~l~~L~ 55 (79)
..++-|++.|+|+.+ .+..+.+++.+...-++.+.
T Consensus 78 ~~L~la~~~~~i~~~-~~~~l~~~~~ei~kml~~~i 112 (116)
T PRK12275 78 TQLYIAKELGYITKE-QYESILQEYDEIAKMLNGLI 112 (116)
T ss_pred HHHHHHHHcCCCCHH-HHHHHHHHHHHHHHHHHHHH
Confidence 456778889998754 56777777777766665543
No 24
>COG3462 Predicted membrane protein [Function unknown]
Probab=32.88 E-value=54 Score=21.96 Aligned_cols=10 Identities=50% Similarity=1.198 Sum_probs=5.5
Q ss_pred eeHHHHHHHH
Q psy2167 13 MPLVWATNII 22 (79)
Q Consensus 13 vPi~Wa~~ll 22 (79)
+||-|+.+++
T Consensus 53 mpI~~~vvli 62 (117)
T COG3462 53 MPIFWAVVLI 62 (117)
T ss_pred HHHHHHHHHH
Confidence 4566665543
No 25
>PF07017 PagP: Antimicrobial peptide resistance and lipid A acylation protein PagP; InterPro: IPR009746 This family consists of several bacterial antimicrobial peptide resistance and lipid A acylation (PagP) proteins. The bacterial outer membrane enzyme PagP transfers a palmitate chain from a phospholipid to lipid A. In a number of pathogenic Gram-negative bacteria, PagP confers resistance to certain cationic antimicrobial peptides produced during the host innate immune response.; PDB: 1MM5_A 1MM4_A 1THQ_A 3GP6_A.
Probab=32.82 E-value=15 Score=25.53 Aligned_cols=13 Identities=15% Similarity=0.383 Sum_probs=9.7
Q ss_pred cCcceecchhhee
Q psy2167 56 SYDTVCVPLVYTQ 68 (79)
Q Consensus 56 ~yDWVpIPLvY~Q 68 (79)
.|++||+|++.|=
T Consensus 105 d~~yiP~P~~LPl 117 (147)
T PF07017_consen 105 DYHYIPFPAILPL 117 (147)
T ss_dssp CCTT-EEEEEEEE
T ss_pred cccccccceeeee
Confidence 5789999998773
No 26
>PF09111 SLIDE: SLIDE; InterPro: IPR015195 The SLIDE domain adopts a secondary structure comprising a main core of three alpha-helices. It has a role in DNA binding, contacting DNA target sites similar to c-Myb (IPR014778 from INTERPRO) repeats or homeodomains []. ; GO: 0003676 nucleic acid binding, 0005524 ATP binding, 0016818 hydrolase activity, acting on acid anhydrides, in phosphorus-containing anhydrides, 0006338 chromatin remodeling, 0005634 nucleus; PDB: 2NOG_A 2Y9Y_A 2Y9Z_A 1OFC_X.
Probab=31.49 E-value=74 Score=20.91 Aligned_cols=33 Identities=12% Similarity=0.199 Sum_probs=23.7
Q ss_pred HHHHHHHhcCCCCHHHHHHHHHHHHHHHHHhcccccCcc
Q psy2167 21 IINRARKEALISSDHVVQTLLYELSEHRRKLGSIISYDT 59 (79)
Q Consensus 21 ll~~ar~eg~i~~~~~~~~l~~ei~~FR~~l~~L~~yDW 59 (79)
|+....+-|+ .+|..+..+-++|++ .-++.|||
T Consensus 60 Ll~~~~~~G~-~~~~~~e~Ik~~Ir~-----~p~FrFDw 92 (118)
T PF09111_consen 60 LLCMLYKYGY-DAEGNWEKIKQEIRE-----SPLFRFDW 92 (118)
T ss_dssp HHHHHHHHTT-TSTTHHHHHHHHHHH------CGGCT-H
T ss_pred HHHHHHHhCC-CCCchHHHHHHHHHh-----CCCcccch
Confidence 5667778998 445588888888875 56777887
No 27
>COG1422 Predicted membrane protein [Function unknown]
Probab=30.70 E-value=19 Score=26.23 Aligned_cols=49 Identities=10% Similarity=0.102 Sum_probs=33.7
Q ss_pred HHHHHHhcCCCCHHHHHHHHHHHHHHHHHhcccccCcceecchhheeeeeEeee
Q psy2167 22 INRARKEALISSDHVVQTLLYELSEHRRKLGSIISYDTVCVPLVYTQQNFLDAP 75 (79)
Q Consensus 22 l~~ar~eg~i~~~~~~~~l~~ei~~FR~~l~~L~~yDWVpIPLvY~QvV~lAV~ 75 (79)
.++|+++| ++..++++-+|=++.+..-..+.+-..- |.+|+=++++-++
T Consensus 88 ~~eA~~~~---d~~~lkkLq~~qmem~~~Q~elmk~qfk--PM~~~~v~tI~~F 136 (201)
T COG1422 88 FREAQESG---DMKKLKKLQEKQMEMMDDQRELMKMQFK--PMLYISVLTIPFF 136 (201)
T ss_pred HHHHHHhC---CHHHHHHHHHHHHHHHHHHHHHHHHhhh--hHHHHHHHHHHHH
Confidence 35677777 6778888888777777776666665555 7777766665443
No 28
>TIGR02147 Fsuc_second hypothetical protein, TIGR02147. This family consists of the 40 members of a paralogous protein family in the rumen anaerobe Fibrobacter succinogenes S85. Member proteins are about 270 residues long and appear to lack signal sequences and transmembrane helices. The only perfectly conserved residue is a glycine in an otherwise poorly conserved region, suggesting members are not enzymes. The family is not characterized.
Probab=30.29 E-value=57 Score=24.30 Aligned_cols=22 Identities=18% Similarity=0.410 Sum_probs=19.0
Q ss_pred CHHHHHHHHHHHHHHHHHhccc
Q psy2167 33 SDHVVQTLLYELSEHRRKLGSI 54 (79)
Q Consensus 33 ~~~~~~~l~~ei~~FR~~l~~L 54 (79)
++..+.++.+||.+||+++..+
T Consensus 228 ~~~~~~~i~~~i~~fRk~i~~i 249 (271)
T TIGR02147 228 SEEAYKEIVKKIQEFRKEVLAI 249 (271)
T ss_pred CHHHHHHHHHHHHHHHHHHHHH
Confidence 4568999999999999998754
No 29
>PF10303 DUF2408: Protein of unknown function (DUF2408); InterPro: IPR018810 This entry represents a family of proteins conserved in fungi whose function is unknown.
Probab=29.93 E-value=1.2e+02 Score=20.22 Aligned_cols=26 Identities=19% Similarity=0.319 Sum_probs=21.6
Q ss_pred CCCCHHHHHHHHHHHHHHHHHhcccc
Q psy2167 30 LISSDHVVQTLLYELSEHRRKLGSII 55 (79)
Q Consensus 30 ~i~~~~~~~~l~~ei~~FR~~l~~L~ 55 (79)
-..-+..++.+.+.+.+.|++|.+|.
T Consensus 100 ~~~~~~~l~~iY~~L~~ik~~LE~L~ 125 (134)
T PF10303_consen 100 EIEVDPSLQPIYDQLIDIKNTLENLL 125 (134)
T ss_pred cccccHHHHHHHHHHHHHHHHHHhhh
Confidence 44556689999999999999999874
No 30
>PF14767 RPA_interact_M: Replication protein A interacting middle
Probab=29.59 E-value=56 Score=20.27 Aligned_cols=24 Identities=21% Similarity=0.227 Sum_probs=18.6
Q ss_pred HHHHHHHHHHHH---Hhc-ccccCccee
Q psy2167 38 QTLLYELSEHRR---KLG-SIISYDTVC 61 (79)
Q Consensus 38 ~~l~~ei~~FR~---~l~-~L~~yDWVp 61 (79)
.-+-+|++++|+ +.. .|++|||.+
T Consensus 6 eVm~eE~~~L~~~~~~~~d~lw~~~~~~ 33 (83)
T PF14767_consen 6 EVMEEEWKALQSSSSGLPDSLWEYDGLP 33 (83)
T ss_pred HHHHHHHHHHHhcCCCchhhhhccccch
Confidence 345679999999 444 899999975
No 31
>KOG3326|consensus
Probab=29.38 E-value=55 Score=22.92 Aligned_cols=24 Identities=13% Similarity=0.403 Sum_probs=19.3
Q ss_pred HHHHHHHHHhcCCCCHHHHHHHHH
Q psy2167 19 TNIINRARKEALISSDHVVQTLLY 42 (79)
Q Consensus 19 ~~ll~~ar~eg~i~~~~~~~~l~~ 42 (79)
..|+++.|++|..+.|..+....+
T Consensus 61 aRLlYqSrKRGiLEnDLLLs~Fa~ 84 (154)
T KOG3326|consen 61 ARLLYQSRKRGILENDLLLSNFAA 84 (154)
T ss_pred HHHHHHHhhcchhHHHHHHHHHHH
Confidence 579999999999999976655443
No 32
>PF08671 SinI: Anti-repressor SinI; InterPro: IPR010981 The SinR repressor is part of a group of Sin (sporulation inhibition) proteins in Bacillus subtilis that regulate the commitment to sporulation in response to extreme adversity []. SinR is a tetrameric repressor protein that binds to the promoters of genes essential for entry into sporulation and prevents their transcription. This repression is overcome through the activity of SinI, which disrupts the SinR tetramer through the formation of a SinI-SinR heterodimer, thereby allowing sporulation to proceed. The SinR structure consists of two domains: a dimerisation domain stabilised by a hydrophobic core, and a DNA-binding domain that is identical to domains of the bacteriophage 434 CI and Cro proteins that regulate prophage induction. The dimerisation domain is a four-helical bundle formed from two helices from the C-terminal residues of SinR and two helices from the central residues of SinI. These regions in SinR and SinI are similar in both structure and sequence. The interaction of SinR monomers to form tetramers is weaker than between SinR and SinI, since SinI can effectively disrupt SinR tetramers. This entry represents the dimerisation domain in both SinI and SinR proteins.; GO: 0005488 binding, 0006355 regulation of transcription, DNA-dependent; PDB: 1B0N_A 2YAL_A.
Probab=29.35 E-value=49 Score=16.96 Aligned_cols=13 Identities=23% Similarity=0.465 Sum_probs=9.7
Q ss_pred HHHHHHHHHhcCC
Q psy2167 19 TNIINRARKEALI 31 (79)
Q Consensus 19 ~~ll~~ar~eg~i 31 (79)
..|+.+|++.|.=
T Consensus 6 ~~Li~eA~~~Gls 18 (30)
T PF08671_consen 6 VELIKEAKESGLS 18 (30)
T ss_dssp HHHHHHHHHTT--
T ss_pred HHHHHHHHHcCCC
Confidence 4789999999953
No 33
>PF10845 DUF2576: Protein of unknown function (DUF2576); InterPro: IPR022556 The function of this viral family of proteins is unknown. The entry contains Orf5 from Autographa californica nuclear polyhedrosis virus (AcMNPV).
Probab=28.42 E-value=41 Score=19.17 Aligned_cols=23 Identities=26% Similarity=0.283 Sum_probs=19.4
Q ss_pred HHHHHHHHHHHHHHHHHhccccc
Q psy2167 34 DHVVQTLLYELSEHRRKLGSIIS 56 (79)
Q Consensus 34 ~~~~~~l~~ei~~FR~~l~~L~~ 56 (79)
|+.-.++-.|+++.|.++..||.
T Consensus 10 dydreqlrrelnsLR~~vhelct 32 (48)
T PF10845_consen 10 DYDREQLRRELNSLRRSVHELCT 32 (48)
T ss_pred ccCHHHHHHHHHHHHHHHHHHHH
Confidence 55667899999999999988864
No 34
>KOG4095|consensus
Probab=26.89 E-value=74 Score=22.46 Aligned_cols=27 Identities=22% Similarity=0.206 Sum_probs=19.7
Q ss_pred HHHHHHHHHHHHHHH--------hcccccCcceec
Q psy2167 36 VVQTLLYELSEHRRK--------LGSIISYDTVCV 62 (79)
Q Consensus 36 ~~~~l~~ei~~FR~~--------l~~L~~yDWVpI 62 (79)
..+++++.|.+.|.- =-.|.-|-||||
T Consensus 17 DIKkVMaaiEKVRrWEKKwVtvgDTslRIyKWVPV 51 (165)
T KOG4095|consen 17 DIKKVMAAIEKVRRWEKKWVTVGDTSLRIYKWVPV 51 (165)
T ss_pred HHHHHHHHHHHHHHHhhheEeecccceEEEEeeec
Confidence 477777777777753 246788999997
No 35
>PF10474 DUF2451: Protein of unknown function C-terminus (DUF2451); InterPro: IPR019514 This protein is found in eukaryotes but its function is not known. The N-terminal domain of some members is PF10475 from PFAM (DUF2450).
Probab=26.18 E-value=1e+02 Score=22.27 Aligned_cols=44 Identities=23% Similarity=0.309 Sum_probs=30.8
Q ss_pred eeHHHHHHHHHHHH---HhcCCCCHHHHHHHHHHHHHHHHHhccccc
Q psy2167 13 MPLVWATNIINRAR---KEALISSDHVVQTLLYELSEHRRKLGSIIS 56 (79)
Q Consensus 13 vPi~Wa~~ll~~ar---~eg~i~~~~~~~~l~~ei~~FR~~l~~L~~ 56 (79)
+++....+.+...+ +|--......+..+++|+..|+.+|..+..
T Consensus 67 l~~~~i~~~Ia~vKWdvkev~~qhs~YVd~l~~~~~~f~~rL~~i~~ 113 (234)
T PF10474_consen 67 LDLEQILNSIANVKWDVKEVMSQHSSYVDQLVQEFQQFSERLDEISK 113 (234)
T ss_pred cCHHHHHHHHHHcCCCCCCCCCccCHHHHHHHHHHHHHHHHHHHHHh
Confidence 45566666665554 333344456889999999999999977763
No 36
>COG3969 Predicted phosphoadenosine phosphosulfate sulfotransferase [General function prediction only]
Probab=25.35 E-value=1.2e+02 Score=24.24 Aligned_cols=44 Identities=23% Similarity=0.280 Sum_probs=31.0
Q ss_pred HHHHHhcC-------CCCHHHHHHHHHHHHHHHHHhcccccC-cceecchhh
Q psy2167 23 NRARKEAL-------ISSDHVVQTLLYELSEHRRKLGSIISY-DTVCVPLVY 66 (79)
Q Consensus 23 ~~ar~eg~-------i~~~~~~~~l~~ei~~FR~~l~~L~~y-DWVpIPLvY 66 (79)
.-||+-|+ |+=+..+..-.|-+.+-|++....... =||++||.-
T Consensus 49 ~~ar~~~~~~i~VlfiD~E~QYs~TidyV~em~~~~~dv~~~~yWvcLPl~t 100 (407)
T COG3969 49 EVARENGRDKISVLFIDWEAQYSCTIDYVQEMRESYHDVIETFYWVCLPLTT 100 (407)
T ss_pred HHHHHhCCCceEEEEEcchhhhhhHHHHHHHHHhcccCccccceEEEeehhc
Confidence 44555565 444556777788888888876666664 899999964
No 37
>PF07026 DUF1317: Protein of unknown function (DUF1317); InterPro: IPR009750 This entry is represented by Bacteriophage lambda, Xis. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches.
Probab=25.12 E-value=94 Score=18.55 Aligned_cols=23 Identities=26% Similarity=0.247 Sum_probs=19.1
Q ss_pred CCCHHHHHHHHHHHHHHHHHhcc
Q psy2167 31 ISSDHVVQTLLYELSEHRRKLGS 53 (79)
Q Consensus 31 i~~~~~~~~l~~ei~~FR~~l~~ 53 (79)
++++.-.+++-+|+++++.++..
T Consensus 32 i~NPlkAqR~AE~~n~~~~~l~~ 54 (60)
T PF07026_consen 32 ITNPLKAQRLAEELNSKQVNLCD 54 (60)
T ss_pred EcCHHHHHHHHHHHHhhHhhhhh
Confidence 46677789999999999998864
No 38
>PF05635 23S_rRNA_IVP: 23S rRNA-intervening sequence protein; InterPro: IPR008815 This family consists of bacterial proteins encoded within an intervening sequence present within some 23S rRNA genes[]. The function of these proteins is not known, but a structural study indicates that each momonmer folds into an antiparallel four-helix bundle, while the overall protein is a homopentamer with a toroid-shaped structure containing a tapered central channel [].; PDB: 2GSC_E 2RLD_D.
Probab=25.04 E-value=1.6e+02 Score=18.23 Aligned_cols=33 Identities=27% Similarity=0.266 Sum_probs=22.6
Q ss_pred HHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHhcc
Q psy2167 20 NIINRARKEALISSDHVVQTLLYELSEHRRKLGS 53 (79)
Q Consensus 20 ~ll~~ar~eg~i~~~~~~~~l~~ei~~FR~~l~~ 53 (79)
..+.-|++.|.|..+ .+..+.+|+.+....+..
T Consensus 77 ~~L~~a~~~~~i~~~-~~~~l~~~~~ei~~~L~~ 109 (110)
T PF05635_consen 77 YWLELARDLGYISEE-EYEELKKELEEISKMLNG 109 (110)
T ss_dssp HHHHHHHHTTSS-HH-HHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHcCCCCHH-HHHHHHHHHHHHHHHHcC
Confidence 456778888888854 567788877777766543
No 39
>TIGR02436 conserved hypothetical protein TIGR02436. This family consists of a few small, well-conserved proteins found so far in Bacteroides thetaiotaomicron VPI-5482, Nostoc sp. PCC 7120, Clostridium tetani E88, Chlorobium tepidum TLS, and Prevotella ruminicola 23. The function is unknown.
Probab=24.97 E-value=1.4e+02 Score=18.95 Aligned_cols=35 Identities=14% Similarity=0.116 Sum_probs=25.6
Q ss_pred HHHHHHHhcCCCCHHHHHHHHHHHHHHHHHhccccc
Q psy2167 21 IINRARKEALISSDHVVQTLLYELSEHRRKLGSIIS 56 (79)
Q Consensus 21 ll~~ar~eg~i~~~~~~~~l~~ei~~FR~~l~~L~~ 56 (79)
-+.-|++.|+++.+ .+..+.+|+.+..+-|..+.+
T Consensus 72 ~L~la~~~~~l~~~-~~~~l~~e~~ei~~~L~~li~ 106 (111)
T TIGR02436 72 WIRLLIQTELIPEN-RYESIKGDCLELLKILAATVV 106 (111)
T ss_pred HHHHHHHcCCCCHH-HHHHHHHHHHHHHHHHHHHHH
Confidence 45668889999755 578888888887777765543
No 40
>PF04437 RINT1_TIP1: RINT-1 / TIP-1 family; InterPro: IPR007528 This family includes RINT-1, a Rad50 interacting protein which participates in radiation induced checkpoint control [], that interacts with Rad50 only during late S and G2/M phases. RINT1 also functions in membrane trafficking from the endoplasmic reticulum(ER) to the Golgi complex in interphase cells [, , ]. In addition to this, the TIP-1 protein, which is involved in the retrograde transport from the Golgi to the ER []. They share a similar domain organisation with an N-terminal leucine heptad repeat rich coiled coil and an ~500-residue C-terminal RINT1/TIP20 domain, which might be a protein-protein interaction module necessary for the formation of functional complexes.; PDB: 3FHN_A.
Probab=24.51 E-value=2.5e+02 Score=22.09 Aligned_cols=50 Identities=16% Similarity=0.286 Sum_probs=30.7
Q ss_pred HHHHHHHHHHHHHh------cCCCCHHHHHHHHHHHHHHHHHhcccccCc--ceecch
Q psy2167 15 LVWATNIINRARKE------ALISSDHVVQTLLYELSEHRRKLGSIISYD--TVCVPL 64 (79)
Q Consensus 15 i~Wa~~ll~~ar~e------g~i~~~~~~~~l~~ei~~FR~~l~~L~~yD--WVpIPL 64 (79)
..|+..|+.-++++ ....++..+..+++|+..|-+.+....+|+ |.|+++
T Consensus 68 ~~fi~~ll~~~~~Kl~~~l~~~~~~~~~l~HlI~e~~~FD~~L~~~~~y~~d~~~~~~ 125 (494)
T PF04437_consen 68 EEFIRGLLPPVREKLRSDLPELLDDPSLLSHLIDEILSFDKELRSLYGYPGDWQGSTL 125 (494)
T ss_dssp HHHHHHHHHHHHHHHHHHH--TTS-HHHHHHHHHHHHHHHHHHHHTS---S------C
T ss_pred HHHHHHHHHHHHHHHHHHHHhhccChhHHHHHHHHHHHHHHHHHHHcCCCCccchhHH
Confidence 45666666655532 245677789999999999999999999886 344443
No 41
>PF09779 Ima1_N: Ima1 N-terminal domain; InterPro: IPR018617 Members of this family of uncharacterised novel proteins have no known function.
Probab=23.01 E-value=68 Score=21.20 Aligned_cols=22 Identities=32% Similarity=0.237 Sum_probs=18.3
Q ss_pred CHHHHHHHHHHHHHHHHHhccc
Q psy2167 33 SDHVVQTLLYELSEHRRKLGSI 54 (79)
Q Consensus 33 ~~~~~~~l~~ei~~FR~~l~~L 54 (79)
++..+....+|+.+||.+|...
T Consensus 98 e~p~y~~~~~e~~~Yr~~LE~r 119 (131)
T PF09779_consen 98 EDPEYANYEEELPEYRRSLEQR 119 (131)
T ss_pred CCccHHHHHHHHHHHHHHHHHH
Confidence 3457889999999999999864
No 42
>KOG2351|consensus
Probab=22.56 E-value=93 Score=21.31 Aligned_cols=21 Identities=33% Similarity=0.455 Sum_probs=16.1
Q ss_pred hcCCCCHHHHHHHHHHHHHHHH
Q psy2167 28 EALISSDHVVQTLLYELSEHRR 49 (79)
Q Consensus 28 eg~i~~~~~~~~l~~ei~~FR~ 49 (79)
+++|++| .++.++++|..-|+
T Consensus 112 ~nkidD~-~le~iL~dls~lr~ 132 (134)
T KOG2351|consen 112 ENKIDDD-ELEQILKDLSTLRT 132 (134)
T ss_pred ccccCHH-HHHHHHHHHHHHHh
Confidence 4566644 89999999988774
No 43
>cd07311 terB_like_1 tellurium resistance terB-like protein, subgroup 1. This family includes several uncharacterized bacterial proteins. The prototype of this CD is tellurite resistance protein from Nostoc punctiforme that belongs to COG3793. Its precise biological function and its mechanism responsible for tellurium resistance still remains rather poorly understood.
Probab=22.26 E-value=1.5e+02 Score=20.06 Aligned_cols=21 Identities=19% Similarity=0.409 Sum_probs=18.7
Q ss_pred eHHHHHHHHHHHHHhcCCCCH
Q psy2167 14 PLVWATNIINRARKEALISSD 34 (79)
Q Consensus 14 Pi~Wa~~ll~~ar~eg~i~~~ 34 (79)
|..|..+|+.-|+..|.++..
T Consensus 23 ~~~~~~~Ll~iAkADG~Vse~ 43 (150)
T cd07311 23 KLAYLKALLVCAKGDGVISPE 43 (150)
T ss_pred HHHHHHHHHHHHHcCCCCCHH
Confidence 578999999999999999854
No 44
>PTZ00233 variable surface protein Vir18; Provisional
Probab=22.26 E-value=62 Score=26.53 Aligned_cols=34 Identities=12% Similarity=0.210 Sum_probs=25.7
Q ss_pred HHHHHHhcCCCCHHHHHHHHHHHHHHHHHhcccccCc
Q psy2167 22 INRARKEALISSDHVVQTLLYELSEHRRKLGSIISYD 58 (79)
Q Consensus 22 l~~ar~eg~i~~~~~~~~l~~ei~~FR~~l~~L~~yD 58 (79)
|+.|...|+|..+ +... +.|++|+.+|-..-++-
T Consensus 67 LK~Cy~~~~i~~~-Li~D--d~IK~F~~kC~~~~~C~ 100 (509)
T PTZ00233 67 LKHCYDNGYINSD-LNND--EKIKNFKSKCNNKGTCN 100 (509)
T ss_pred hhhhhhcCccccc-ccch--HhHHHHHHhCCCCccCC
Confidence 5778888998887 3444 89999999998654443
No 45
>PF04714 BCL_N: BCL7, N-terminal conserver region; InterPro: IPR006804 The members of this group of sequences contain a conserved N-terminal domain which is found in the BCL7 family. The function of BCL7 proteins is unknown, though they may be involved in early development. Notably, BCL7B is commonly hemizygously deleted in patients with Williams syndrome [].
Probab=21.72 E-value=1.4e+02 Score=17.38 Aligned_cols=27 Identities=19% Similarity=0.180 Sum_probs=18.6
Q ss_pred HHHHHHHHHHHHHHH--------hcccccCcceec
Q psy2167 36 VVQTLLYELSEHRRK--------LGSIISYDTVCV 62 (79)
Q Consensus 36 ~~~~l~~ei~~FR~~--------l~~L~~yDWVpI 62 (79)
.++.++..|...|.- =..|.-|-||||
T Consensus 16 diKrv~~~iekVr~wEKKWVtv~dtslriyKWVPv 50 (52)
T PF04714_consen 16 DIKRVMASIEKVRKWEKKWVTVGDTSLRIYKWVPV 50 (52)
T ss_pred HHHHHHHHHHHHHHHhhceEEeccceEEEEEEEec
Confidence 467777777666653 245778999997
No 46
>PF00458 WHEP-TRS: WHEP-TRS domain; InterPro: IPR000738 A conserved domain of 46 amino acids, called WHEP-TRS has been shown [] to exist in a number of higher eukaryote aminoacyl-transfer RNA synthetases. This domain is present one to six times in the several enzymes. There are three copies in mammalian multifunctional aminoacyl-tRNA synthetase in a region that separates the N-terminal glutamyl-tRNA synthetase domain from the C-terminal prolyl-tRNA synthetase domain, and six copies in the intercatalytic region of the Drosophila enzyme. The domain is found at the N-terminal extremity of the mammalian tryptophanyl- tRNA synthetase and histidyl-tRNA synthetase, and the mammalian, insect, nematode and plant glycyl- tRNA synthetases []. This domain could contain a central alpha-helical region and may play a role in the association of tRNA-synthetases into multienzyme complexes.; GO: 0004812 aminoacyl-tRNA ligase activity, 0005524 ATP binding, 0006418 tRNA aminoacylation for protein translation; PDB: 1R1B_A 1D2D_A 1X59_A 1FYJ_A 2ZT8_A 2ZXF_A 2ZT7_A 2ZT6_A 2PME_A 2ZT5_A ....
Probab=21.65 E-value=1.8e+02 Score=16.76 Aligned_cols=42 Identities=12% Similarity=0.160 Sum_probs=31.9
Q ss_pred HHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHhcccccCcceecc
Q psy2167 20 NIINRARKEALISSDHVVQTLLYELSEHRRKLGSIISYDTVCVP 63 (79)
Q Consensus 20 ~ll~~ar~eg~i~~~~~~~~l~~ei~~FR~~l~~L~~yDWVpIP 63 (79)
+.+|+-..+| .+......-+++|.+.+..+..+..-||.|=.
T Consensus 10 ~~VR~LKa~k--a~k~~i~~aV~~Ll~LK~~~~~~tg~~~~~~~ 51 (56)
T PF00458_consen 10 DKVRKLKAEK--ADKEEIDAAVAKLLELKAELKELTGKDYKPGK 51 (56)
T ss_dssp HHHHHHHHTT----HHHHHHHHHHHHHHHHHHHHHCSSSTTSSS
T ss_pred HHHHHHHccC--CcHHHHHHHHHHHHHHHHHHHHhhCCCCCCCC
Confidence 4566666666 34446899999999999999999999987743
No 47
>COG2089 SpsE Sialic acid synthase [Cell envelope biogenesis, outer membrane]
Probab=21.41 E-value=37 Score=26.61 Aligned_cols=52 Identities=29% Similarity=0.270 Sum_probs=34.8
Q ss_pred eeeHHHHHHHHHHHHHhcCC--CCHHHHHHHHHHHHHHHHHhcccccCcceecch
Q psy2167 12 WMPLVWATNIINRARKEALI--SSDHVVQTLLYELSEHRRKLGSIISYDTVCVPL 64 (79)
Q Consensus 12 wvPi~Wa~~ll~~ar~eg~i--~~~~~~~~l~~ei~~FR~~l~~L~~yDWVpIPL 64 (79)
=+|..|-..|...||++|.+ .|+ .=..-++++.++--..-.+=+++-...||
T Consensus 86 ~~p~e~~~~Lke~a~~~Gi~~~SSP-fd~~svd~l~~~~~~ayKIaS~E~~~~pl 139 (347)
T COG2089 86 ETPLEWHAQLKEYARKRGIIFFSSP-FDLTAVDLLESLNPPAYKIASGEINDLPL 139 (347)
T ss_pred cCCHHHHHHHHHHHHHcCeEEEecC-CCHHHHHHHHhcCCCeEEecCccccChHH
Confidence 37999999999999999975 333 33445566666555555555555555554
No 48
>PF05791 Bacillus_HBL: Bacillus haemolytic enterotoxin (HBL); InterPro: IPR008414 This family consists of several Bacillus haemolytic enterotoxins (HblC, HblD, HblA, NheA, and NheB), which can cause food poisoning in humans []. Haemolysin BL (encoded by HBL) and non-haemolytic enterotoxin (encoded by NHE), represent the major enterotoxins produced by Bacillus cereus. Most of the cytotoxic activity of B. cereus isolates has been attributed to the level of Nhe, which may indicate a highly diarrheic potential []. The exact mechanism by which B. cereus causes diarrhoea is unknown. Hbl, cytotoxin K (CytK) and Nhe are all putative causes. Both Hbl and Nhe are three-component cytotoxins and maximal cytotoxicity of Nhe against epithelia is dependent on all three components. Nhe has haemolytic activity against erythrocytes from a variety of species. It is possible that the common structural and functional properties of these toxins indicate that the Hbl/Nhe and ClyA families of toxins constitute a superfamily of pore-forming cytotoxins []. The high virulence of some strains is thought to be due to the greater cytotoxic activity of CytK-1 compared to CytK-2, and to a high level of cytK expression []. Haemolysin BL and non-haemolytic enterotoxin production are both influenced by pH and micro []. This entry is found in cytotoxic proteins that form part of the enterotoxin complex and bind to erythrocytes. HblA is composed of a binding component, B, and two lytic components, L1 and L2. All three subunits act synergically to cause hemolysis.; GO: 0009405 pathogenesis, 0016020 membrane; PDB: 2NRJ_A.
Probab=20.87 E-value=99 Score=21.40 Aligned_cols=18 Identities=39% Similarity=0.560 Sum_probs=14.9
Q ss_pred HHHHHHHHHHHHHHHhcc
Q psy2167 36 VVQTLLYELSEHRRKLGS 53 (79)
Q Consensus 36 ~~~~l~~ei~~FR~~l~~ 53 (79)
..+.+++||++||.+|..
T Consensus 125 ~~~~~i~~L~~f~~~l~~ 142 (184)
T PF05791_consen 125 KVQALINELNDFKDKLQK 142 (184)
T ss_dssp HHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHH
Confidence 577889999999998753
No 49
>PLN03075 nicotianamine synthase; Provisional
Probab=20.82 E-value=1.3e+02 Score=22.72 Aligned_cols=35 Identities=23% Similarity=0.217 Sum_probs=24.3
Q ss_pred HHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHhccccc
Q psy2167 18 ATNIINRARKEALISSDHVVQTLLYELSEHRRKLGSIIS 56 (79)
Q Consensus 18 a~~ll~~ar~eg~i~~~~~~~~l~~ei~~FR~~l~~L~~ 56 (79)
+.+|+..+.... +.....+.+||.++|.+|..||+
T Consensus 38 f~~Lv~~c~~~~----~~~~~~l~~~i~~~~~~l~~l~~ 72 (296)
T PLN03075 38 FTQLVSTCIPPS----SIDVTKLCEEIQEMRSKLIKLCG 72 (296)
T ss_pred HHHHHHHhCCCC----cchHHHhhHHHHHHHHHHHHHHH
Confidence 344555554333 23456899999999999998886
No 50
>KOG2787|consensus
Probab=20.75 E-value=1.6e+02 Score=23.53 Aligned_cols=47 Identities=21% Similarity=0.416 Sum_probs=37.3
Q ss_pred HHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHhcccccCcceecchhh
Q psy2167 16 VWATNIINRARKEALISSDHVVQTLLYELSEHRRKLGSIISYDTVCVPLVY 66 (79)
Q Consensus 16 ~Wa~~ll~~ar~eg~i~~~~~~~~l~~ei~~FR~~l~~L~~yDWVpIPLvY 66 (79)
-|+.-.+++-.-+|.|.+| .+..++++|-+==..++.=-+ -|=||.|
T Consensus 164 L~a~lflNk~ig~~ti~~~-~i~~i~~~I~~sGr~~a~k~~---~~cPLmY 210 (403)
T KOG2787|consen 164 LWACLFLNKYIGQETIPDD-DIRSIVQAILTSGRELAKKEN---SPCPLMY 210 (403)
T ss_pred HHHHHHHHhhcCCCcCCHH-HHHHHHHHHHHHHHHHHhcCC---CCCCchh
Confidence 5999999999999999876 789999998766555665554 6677776
No 51
>COG3943 Virulence protein [General function prediction only]
Probab=20.50 E-value=2.7e+02 Score=21.40 Aligned_cols=40 Identities=15% Similarity=0.295 Sum_probs=31.7
Q ss_pred HHHHHHHHHHHHHhcCCCCHH---------HHHHHHHHHHHHHHHhccc
Q psy2167 15 LVWATNIINRARKEALISSDH---------VVQTLLYELSEHRRKLGSI 54 (79)
Q Consensus 15 i~Wa~~ll~~ar~eg~i~~~~---------~~~~l~~ei~~FR~~l~~L 54 (79)
-+||..-+.+---+|...+|- ....+++-|++.|.+-..+
T Consensus 103 rqwatq~lqeyl~kgfvmdderlk~~~~gdy~~emlerirdiraser~v 151 (329)
T COG3943 103 RQWATQTLQEYLIKGFVMDDERLKNPPVGDYFKEMLERIRDIRASERRV 151 (329)
T ss_pred HHHHHHHHHHHHHhhcccchhhccCCCcchHHHHHHHHHHHHHHHHHHH
Confidence 379999999988888776652 5678999999999886543
No 52
>PF14637 FNIP_M: Folliculin-interacting protein middle domain
Probab=20.40 E-value=2.2e+02 Score=20.91 Aligned_cols=39 Identities=10% Similarity=0.165 Sum_probs=30.7
Q ss_pred HHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHhcccccC
Q psy2167 19 TNIINRARKEALISSDHVVQTLLYELSEHRRKLGSIISY 57 (79)
Q Consensus 19 ~~ll~~ar~eg~i~~~~~~~~l~~ei~~FR~~l~~L~~y 57 (79)
++=++.+.++.....-..++.+.+++.+||+.+..|.+-
T Consensus 51 m~rL~~~i~~a~~~~~~f~~~i~~A~~~f~~~i~~L~~a 89 (232)
T PF14637_consen 51 MSRLRSAIEKAYSRRQQFLSIIYEAWEEFRQSICDLYTA 89 (232)
T ss_pred HHHHHHHHHHhccchHHHHHHHHHHHHHHHHHHHHHhcC
Confidence 344556667777776778999999999999999988754
Done!