Query         046217
Match_columns 288
No_of_seqs    147 out of 721
Neff          6.3 
Searched_HMMs 46136
Date          Fri Mar 29 09:50:30 2013
Command       hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/046217.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/046217hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 PLN03160 uncharacterized prote 100.0 4.7E-30   1E-34  230.7  25.6  178   87-287    33-216 (219)
  2 PF03168 LEA_2:  Late embryogen  99.2 4.9E-11 1.1E-15   92.7   9.4   98  147-262     1-100 (101)
  3 smart00769 WHy Water Stress an  98.0 0.00011 2.4E-09   58.0  10.9   62  139-205    12-73  (100)
  4 PF07092 DUF1356:  Protein of u  97.9  0.0016 3.6E-08   59.4  17.9   83  117-204    97-181 (238)
  5 PF12751 Vac7:  Vacuolar segreg  96.5  0.0071 1.5E-07   58.7   7.0   40  139-180   345-384 (387)
  6 KOG1924 RhoA GTPase effector D  96.2   0.013 2.9E-07   61.1   7.3   13   39-51    558-570 (1102)
  7 COG5608 LEA14-like dessication  95.4     1.3 2.9E-05   38.0  15.1   80  119-207    31-110 (161)
  8 KOG1924 RhoA GTPase effector D  94.5    0.07 1.5E-06   56.0   6.0   17   38-54    563-579 (1102)
  9 KOG4849 mRNA cleavage factor I  94.1    0.19 4.2E-06   48.4   7.6   39   38-78    283-339 (498)
 10 KOG2199 Signal transducing ada  89.9    0.67 1.4E-05   45.4   5.8   18   34-51    419-436 (462)
 11 PF02162 XYPPX:  XYPPX repeat (  89.2    0.28   6E-06   25.8   1.4    9   31-39      5-13  (15)
 12 COG5178 PRP8 U5 snRNP spliceos  84.5     0.7 1.5E-05   50.7   2.7   21   15-35      7-27  (2365)
 13 PLN03160 uncharacterized prote  83.9     4.1 8.8E-05   36.8   7.1   81   83-167    33-126 (219)
 14 TIGR02588 conserved hypothetic  74.6      35 0.00076   28.3   9.1   52   97-155     9-62  (122)
 15 PF12868 DUF3824:  Domain of un  67.1      17 0.00036   30.7   5.8    7   39-45     89-95  (137)
 16 PF10907 DUF2749:  Protein of u  59.1      11 0.00023   27.8   2.8   16  104-119    13-28  (66)
 17 PRK13150 cytochrome c-type bio  59.1   1E+02  0.0022   26.7   9.3   40  143-182    77-116 (159)
 18 PRK10893 lipopolysaccharide ex  59.0      47   0.001   29.3   7.5   38  117-155    37-74  (192)
 19 KOG2893 Zn finger protein [Gen  57.5      27 0.00059   32.3   5.8   14   48-61    159-172 (341)
 20 COG5178 PRP8 U5 snRNP spliceos  57.1     9.7 0.00021   42.4   3.2   23   37-59     11-33  (2365)
 21 KOG2192 PolyC-binding hnRNP-K   55.8      17 0.00038   34.1   4.3    8   30-37    236-243 (390)
 22 PF02124 Marek_A:  Marek's dise  54.6      31 0.00067   31.2   5.6   19   96-115   190-208 (211)
 23 COG2332 CcmE Cytochrome c-type  53.5 1.5E+02  0.0032   25.6   9.2   40  143-182    71-110 (153)
 24 PF11322 DUF3124:  Protein of u  53.4      72  0.0016   26.6   7.1   60  136-201    17-76  (125)
 25 KOG3950 Gamma/delta sarcoglyca  52.6      78  0.0017   29.5   7.8   21  139-159   105-125 (292)
 26 PF05399 EVI2A:  Ectropic viral  48.1 1.4E+02   0.003   27.2   8.6   11  172-184   176-186 (227)
 27 PF03302 VSP:  Giardia variant-  47.4      12 0.00025   36.9   1.9   28   92-119   369-396 (397)
 28 PF03100 CcmE:  CcmE;  InterPro  45.1      28  0.0006   28.8   3.6   37  145-181    72-108 (131)
 29 KOG2893 Zn finger protein [Gen  43.8      70  0.0015   29.7   6.1   23   47-69    148-171 (341)
 30 KOG4849 mRNA cleavage factor I  42.9 1.4E+02   0.003   29.3   8.3    8   29-36    221-228 (498)
 31 PF15145 DUF4577:  Domain of un  41.0      18 0.00038   29.6   1.7   26   92-118    63-88  (128)
 32 PF10873 DUF2668:  Protein of u  40.9      68  0.0015   27.5   5.2   11   38-48    119-129 (155)
 33 KOG1923 Rac1 GTPase effector F  40.3   1E+02  0.0022   33.1   7.4   23   17-39    280-302 (830)
 34 PF13179 DUF4006:  Family of un  39.6      23 0.00051   26.2   2.0   27   87-113     4-33  (66)
 35 PF06024 DUF912:  Nucleopolyhed  38.8      40 0.00086   26.7   3.4   25   92-117    63-88  (101)
 36 PF05170 AsmA:  AsmA family;  I  36.5 1.8E+02  0.0039   29.6   8.6   95  120-240   439-534 (604)
 37 PF04478 Mid2:  Mid2 like cell   36.2      22 0.00048   30.6   1.7   24   99-123    59-82  (154)
 38 COG1580 FliL Flagellar basal b  36.1 1.4E+02  0.0029   25.8   6.5    8  109-116    35-42  (159)
 39 PF09624 DUF2393:  Protein of u  35.5 1.6E+02  0.0034   24.4   6.8   56  109-172    34-91  (149)
 40 PF06092 DUF943:  Enterobacteri  34.9      24 0.00053   30.4   1.7   18  100-118    11-28  (157)
 41 PF08113 CoxIIa:  Cytochrome c   34.2      19 0.00041   23.1   0.7   15   99-114    10-24  (34)
 42 PF07787 DUF1625:  Protein of u  34.2      28 0.00061   31.7   2.1   10  109-118   239-248 (248)
 43 PRK13159 cytochrome c-type bio  33.8 1.4E+02   0.003   25.8   6.1   37  146-182    74-110 (155)
 44 PRK05529 cell division protein  33.8      56  0.0012   30.1   4.0   43  120-163    58-128 (255)
 45 PRK06531 yajC preprotein trans  33.7      19 0.00042   29.3   0.9   10  111-120    15-24  (113)
 46 PF01708 Gemini_mov:  Geminivir  33.1      30 0.00066   27.1   1.8    7   72-78     15-21  (91)
 47 PF01102 Glycophorin_A:  Glycop  32.8      13 0.00027   30.8  -0.4   26  101-127    75-101 (122)
 48 PF05568 ASFV_J13L:  African sw  32.4      23 0.00049   30.4   1.1   39   92-130    30-68  (189)
 49 PF14927 Neurensin:  Neurensin   32.4      41 0.00089   28.5   2.6   22   93-114    46-67  (140)
 50 PF12877 DUF3827:  Domain of un  32.2      40 0.00087   35.3   3.0   37   87-124   266-302 (684)
 51 PF04046 PSP:  PSP;  InterPro:   31.5      25 0.00054   24.4   1.0    7   49-55     31-37  (48)
 52 PRK07021 fliL flagellar basal   31.1 1.3E+02  0.0027   25.7   5.6   14  159-172    78-91  (162)
 53 PHA03164 hypothetical protein;  30.4      44 0.00095   25.5   2.2   21   87-107    54-75  (88)
 54 COG4698 Uncharacterized protei  30.2      41 0.00089   29.8   2.3   22  109-130    27-48  (197)
 55 PF11837 DUF3357:  Domain of un  29.3      18 0.00039   28.9   0.0   24   87-111    24-48  (106)
 56 PRK13254 cytochrome c-type bio  28.7 1.8E+02  0.0039   24.8   6.0   39  144-182    71-109 (148)
 57 PTZ00382 Variant-specific surf  28.4      33 0.00071   27.0   1.3   15  102-117    78-93  (96)
 58 PF02009 Rifin_STEVOR:  Rifin/s  28.3      20 0.00043   34.1   0.1   11  106-117   270-280 (299)
 59 COG5353 Uncharacterized protei  28.3      45 0.00098   28.7   2.2   23   96-119    12-34  (161)
 60 KOG2375 Protein interacting wi  27.9 1.4E+02   0.003   32.0   6.1   28   22-51    558-585 (756)
 61 PF11395 DUF2873:  Protein of u  27.5      24 0.00052   23.3   0.3    9  109-117    25-33  (43)
 62 PF12606 RELT:  Tumour necrosis  27.3      25 0.00055   24.6   0.4   30   96-126     5-34  (50)
 63 PF06667 PspB:  Phage shock pro  26.9      21 0.00045   27.1  -0.0    9  109-117    17-25  (75)
 64 PF06072 Herpes_US9:  Alphaherp  26.7      14 0.00031   26.8  -0.9   11  104-114    49-59  (60)
 65 PF11906 DUF3426:  Protein of u  26.3 3.6E+02  0.0079   22.0   8.6   59  123-182    48-107 (149)
 66 PF14155 DUF4307:  Domain of un  26.0 3.5E+02  0.0075   21.7  12.9   38  109-152    19-57  (112)
 67 PF15012 DUF4519:  Domain of un  26.0      73  0.0016   22.8   2.6   11  109-119    46-56  (56)
 68 COG1589 FtsQ Cell division sep  26.0      57  0.0012   30.1   2.7   33  100-133    38-70  (269)
 69 KOG0119 Splicing factor 1/bran  25.5 1.8E+02  0.0038   29.8   6.1   17   39-55    440-458 (554)
 70 PF06637 PV-1:  PV-1 protein (P  24.9      43 0.00092   33.0   1.6   15  139-153    83-97  (442)
 71 PF10164 DUF2367:  Uncharacteri  24.5 1.8E+02  0.0039   23.2   4.8   19   89-107    62-81  (98)
 72 PF02468 PsbN:  Photosystem II   24.4      81  0.0018   21.4   2.5   23   95-118     7-29  (43)
 73 KOG0810 SNARE protein Syntaxin  23.8      53  0.0012   31.2   2.1    8   92-99    272-279 (297)
 74 cd01324 cbb3_Oxidase_CcoQ Cyto  23.5      38 0.00083   23.3   0.8   12  109-120    24-35  (48)
 75 PF12355 Dscam_C:  Down syndrom  23.4      54  0.0012   26.7   1.7   10   15-24     61-70  (124)
 76 KOG1923 Rac1 GTPase effector F  23.1 2.7E+02  0.0058   30.1   7.1    8  111-118   374-381 (830)
 77 KOG2322 N-methyl-D-aspartate r  22.7 1.1E+02  0.0023   28.3   3.7   16   88-103    44-60  (237)
 78 PHA03269 envelope glycoprotein  22.6 1.7E+02  0.0037   30.0   5.3   21   96-117   543-563 (566)
 79 PF04881 Adeno_GP19K:  Adenovir  22.4      57  0.0012   27.3   1.7   25   92-120   106-130 (139)
 80 PF05545 FixQ:  Cbb3-type cytoc  21.9      34 0.00074   23.2   0.3   12  109-120    23-34  (49)
 81 PF06835 LptC:  Lipopolysacchar  21.9   4E+02  0.0087   21.7   6.9   49  120-170    32-80  (176)
 82 PHA03271 envelope glycoprotein  21.6   2E+02  0.0044   28.9   5.5   17   99-116   464-480 (490)
 83 KOG3294 WW domain binding prot  21.4 3.3E+02  0.0071   25.4   6.5   11   16-26    172-182 (261)
 84 PRK13183 psbN photosystem II r  21.3      91   0.002   21.5   2.2   22   96-118    11-32  (46)
 85 KOG0811 SNARE protein PEP12/VA  21.2      43 0.00094   31.4   0.9   23   94-117   246-268 (269)
 86 PF04790 Sarcoglycan_1:  Sarcog  20.9      45 0.00098   31.1   1.0   18  139-156    83-100 (264)
 87 CHL00020 psbN photosystem II p  20.7   1E+02  0.0022   20.9   2.3   22   96-118     8-29  (43)
 88 PF01034 Syndecan:  Syndecan do  20.0      35 0.00075   25.1   0.0   15  103-118    22-36  (64)

No 1  
>PLN03160 uncharacterized protein; Provisional
Probab=99.97  E-value=4.7e-30  Score=230.72  Aligned_cols=178  Identities=15%  Similarity=0.251  Sum_probs=142.3

Q ss_pred             CCCcE-EehhHHHHHHHHHHHHHHheeeEEeecCCCEEEEeeEEEeeeecCC-----CCceeEEEEEEEEEeCCCeeeEE
Q 046217           87 PPSSF-AILTTMMLFVFFTFIVGLVLMFLIARPKFPIFHVETLAVTNFSVAN-----DDFTSAWEINLVVENRNFGANVH  160 (288)
Q Consensus        87 ~r~~~-~l~~~li~lvlllgii~~lIlwLvlrPk~P~f~V~s~svs~fnvs~-----~~L~a~l~~~Lsv~NPN~k~~i~  160 (288)
                      +|+|+ ||+++++++++|++++ +.++|++||||+|+|+|++++|++|++++     ..+|++++++++++|||. ++|+
T Consensus        33 r~~~~~c~~~~~a~~l~l~~v~-~~l~~~vfrPk~P~~~v~~v~l~~~~~~~~~~~~~~~n~tl~~~v~v~NPN~-~~~~  110 (219)
T PLN03160         33 RRNCIKCCGCITATLLILATTI-LVLVFTVFRVKDPVIKMNGVTVTKLELINNTTLRPGTNITLIADVSVKNPNV-ASFK  110 (219)
T ss_pred             cccceEEHHHHHHHHHHHHHHH-HheeeEEEEccCCeEEEEEEEEeeeeeccCCCCceeEEEEEEEEEEEECCCc-eeEE
Confidence            44555 5545554555555555 88999999999999999999999999864     257888899999999999 8999


Q ss_pred             EcceEEEEEECCceeeeEeeecCCCcceEecCCceEEEEEEEEeccccccCCCCChHHHHHHHhhhcCCeEEEEEEEEEE
Q 046217          161 FDRIQSFIYFNTDDLLASSHFDHPTKDVTVKANEKGTIDIKLSTADYKQSVPAPGKRVMEEMRNIRETGVVPFSLRAFAW  240 (288)
Q Consensus       161 Yd~~~~~v~Y~g~~~lg~a~vp~p~~~F~q~~k~tt~v~v~l~~~~~~~~~~~l~~~~~~~l~~d~~~G~V~~~v~v~~r  240 (288)
                      |+++++.++|+|+ .+|++.+| +   |+|++++++.+++++.....+    .+.   ..+|.+|.++|.++|+++++.+
T Consensus       111 Y~~~~~~v~Y~g~-~vG~a~~p-~---g~~~ar~T~~l~~tv~~~~~~----~~~---~~~L~~D~~~G~v~l~~~~~v~  178 (219)
T PLN03160        111 YSNTTTTIYYGGT-VVGEARTP-P---GKAKARRTMRMNVTVDIIPDK----ILS---VPGLLTDISSGLLNMNSYTRIG  178 (219)
T ss_pred             EcCeEEEEEECCE-EEEEEEcC-C---cccCCCCeEEEEEEEEEEece----ecc---chhHHHHhhCCeEEEEEEEEEE
Confidence            9999999999887 89999999 8   999999999999998766542    122   2468889999999999877777


Q ss_pred             EEEEEeEEEEccceeEEEEEeCceEEeeeCCCCCcccccCCCCceec
Q 046217          241 SQIETGLFYTETKHIQSRIICEDLKVHFEGDTGNGKLNGKSKDCVVD  287 (288)
Q Consensus       241 vr~kvG~~~t~~~~~~~~v~C~~l~V~l~~~~~~G~l~~~~~~C~v~  287 (288)
                      .+.++|.+++  +++.++++|+ +.|.+.+    ..  .+..+|+.+
T Consensus       179 gkVkv~~i~k--~~v~~~v~C~-v~V~~~~----~~--i~~~~C~~~  216 (219)
T PLN03160        179 GKVKILKIIK--KHVVVKMNCT-MTVNITS----QA--IQGQKCKRH  216 (219)
T ss_pred             EEEEEEEEEE--EEEEEEEEeE-EEEECCC----CE--EeccEeccc
Confidence            7777777766  8999999999 8886622    22  245569865


No 2  
>PF03168 LEA_2:  Late embryogenesis abundant protein;  InterPro: IPR004864 Different types of LEA proteins are expressed at different stages of late embryogenesis in higher plant seed embryos and under conditions of dehydration stress [, ]. The function of these proteins is unknown. ; PDB: 3BUT_A 1XO8_A 1YYC_A.
Probab=99.24  E-value=4.9e-11  Score=92.72  Aligned_cols=98  Identities=20%  Similarity=0.348  Sum_probs=70.6

Q ss_pred             EEEEEeCCCeeeEEEcceEEEEEECCceeee-EeeecCCCcceEecCCceEEEEEEEEeccccccCCCCChHHHHHHHhh
Q 046217          147 NLVVENRNFGANVHFDRIQSFIYFNTDDLLA-SSHFDHPTKDVTVKANEKGTIDIKLSTADYKQSVPAPGKRVMEEMRNI  225 (288)
Q Consensus       147 ~Lsv~NPN~k~~i~Yd~~~~~v~Y~g~~~lg-~a~vp~p~~~F~q~~k~tt~v~v~l~~~~~~~~~~~l~~~~~~~l~~d  225 (288)
                      +|+++|||. ++++|+++++.++|+|. .+| ....+ +   |.|++++++.+.+.+..+..     .+    .+.+.++
T Consensus         1 ~l~v~NPN~-~~i~~~~~~~~v~~~g~-~v~~~~~~~-~---~~i~~~~~~~v~~~v~~~~~-----~l----~~~l~~~   65 (101)
T PF03168_consen    1 TLSVRNPNS-FGIRYDSIEYDVYYNGQ-RVGTGGSLP-P---FTIPARSSTTVPVPVSVDYS-----DL----PRLLKDL   65 (101)
T ss_dssp             EEEEEESSS-S-EEEEEEEEEEEESSS-EEEEEEECE-----EEESSSCEEEEEEEEEEEHH-----HH----HHHHHHH
T ss_pred             CEEEECCCc-eeEEEeCEEEEEEECCE-EEECccccC-C---eEECCCCcEEEEEEEEEcHH-----HH----HHHHHhh
Confidence            589999999 99999999999999887 788 55566 7   99999999999998887654     22    4556666


Q ss_pred             hcCCeEEEEEEEEEEEEEEE-eEEEEccceeEEEEEeC
Q 046217          226 RETGVVPFSLRAFAWSQIET-GLFYTETKHIQSRIICE  262 (288)
Q Consensus       226 ~~~G~V~~~v~v~~rvr~kv-G~~~t~~~~~~~~v~C~  262 (288)
                      . +|...+++.++++.++++ +.+..  ++..+.++|+
T Consensus        66 ~-~~~~~~~v~~~~~g~~~v~~~~~~--~~~~v~~~~~  100 (101)
T PF03168_consen   66 L-AGRVPFDVTYRIRGTFKVLGTPIF--GSVRVPVSCE  100 (101)
T ss_dssp             H-HTTSCEEEEEEEEEEEE-EE-TTT--SCEEEEEEEE
T ss_pred             h-ccccceEEEEEEEEEEEEccccee--eeEEEeEEeE
Confidence            6 556666666777777774 43333  4555555554


No 3  
>smart00769 WHy Water Stress and Hypersensitive response.
Probab=97.98  E-value=0.00011  Score=57.97  Aligned_cols=62  Identities=16%  Similarity=0.225  Sum_probs=56.4

Q ss_pred             CceeEEEEEEEEEeCCCeeeEEEcceEEEEEECCceeeeEeeecCCCcceEecCCceEEEEEEEEec
Q 046217          139 DFTSAWEINLVVENRNFGANVHFDRIQSFIYFNTDDLLASSHFDHPTKDVTVKANEKGTIDIKLSTA  205 (288)
Q Consensus       139 ~L~a~l~~~Lsv~NPN~k~~i~Yd~~~~~v~Y~g~~~lg~a~vp~p~~~F~q~~k~tt~v~v~l~~~  205 (288)
                      .++.++.+.|.+.|||. +.+.++.++..++++|. .+|++..+.+   +..++++++.+.+.+..+
T Consensus        12 ~~~~~~~l~l~v~NPN~-~~l~~~~~~y~l~~~g~-~v~~g~~~~~---~~ipa~~~~~v~v~~~~~   73 (100)
T smart00769       12 GLEIEIVLKVKVQNPNP-FPIPVNGLSYDLYLNGV-ELGSGEIPDS---GTLPGNGRTVLDVPVTVN   73 (100)
T ss_pred             ceEEEEEEEEEEECCCC-CccccccEEEEEEECCE-EEEEEEcCCC---cEECCCCcEEEEEEEEee
Confidence            56789999999999999 99999999999999886 8999998646   999999999999998884


No 4  
>PF07092 DUF1356:  Protein of unknown function (DUF1356);  InterPro: IPR009790 This family consists of several hypothetical mammalian proteins of around 250 residues in length. The function of this family is unknown.
Probab=97.88  E-value=0.0016  Score=59.36  Aligned_cols=83  Identities=10%  Similarity=0.127  Sum_probs=60.9

Q ss_pred             ecCCCEEEEeeEEEee--eecCCCCceeEEEEEEEEEeCCCeeeEEEcceEEEEEECCceeeeEeeecCCCcceEecCCc
Q 046217          117 RPKFPIFHVETLAVTN--FSVANDDFTSAWEINLVVENRNFGANVHFDRIQSFIYFNTDDLLASSHFDHPTKDVTVKANE  194 (288)
Q Consensus       117 rPk~P~f~V~s~svs~--fnvs~~~L~a~l~~~Lsv~NPN~k~~i~Yd~~~~~v~Y~g~~~lg~a~vp~p~~~F~q~~k~  194 (288)
                      -||.-.++-.++....  |+-.++.+..+++-.|.++|+|. ..+...++.+.+.|... .+|.+... .  ....++++
T Consensus        97 fPRsV~v~~~gv~s~~V~f~~~~~~v~l~itn~lNIsN~NF-y~V~Vt~~s~qv~~~~~-VVG~~~~~-~--~~~I~Prs  171 (238)
T PF07092_consen   97 FPRSVTVSPVGVKSVTVSFNPDKSTVQLNITNTLNISNPNF-YPVTVTNLSIQVLYMKT-VVGKGKNS-N--ITVIGPRS  171 (238)
T ss_pred             eCcEEEEecCcEEEEEEEEeCCCCEEEEEEEEEEEccCCCE-EEEEEEeEEEEEEEEEe-EEeeeEec-c--eEEecccC
Confidence            4776444433333222  33344568899999999999999 99999999999999765 89988776 3  26788888


Q ss_pred             eEEEEEEEEe
Q 046217          195 KGTIDIKLST  204 (288)
Q Consensus       195 tt~v~v~l~~  204 (288)
                      .+.+..++..
T Consensus       172 ~~q~~~tV~t  181 (238)
T PF07092_consen  172 SKQVNYTVKT  181 (238)
T ss_pred             CceEEEEeeE
Confidence            8877776665


No 5  
>PF12751 Vac7:  Vacuolar segregation subunit 7;  InterPro: IPR024260 Vac7 is localised at the vacuole membrane, a location which is consistent with its involvement in vacuole morphology and inheritance []. Vac7 has been shown to function as an upstream regulator of the Fab1 lipid kinase pathway []. The Fab1 lipid pathway is important for correct regulation of membrane trafficking events.
Probab=96.50  E-value=0.0071  Score=58.65  Aligned_cols=40  Identities=13%  Similarity=0.248  Sum_probs=32.6

Q ss_pred             CceeEEEEEEEEEeCCCeeeEEEcceEEEEEECCceeeeEee
Q 046217          139 DFTSAWEINLVVENRNFGANVHFDRIQSFIYFNTDDLLASSH  180 (288)
Q Consensus       139 ~L~a~l~~~Lsv~NPN~k~~i~Yd~~~~~v~Y~g~~~lg~a~  180 (288)
                      .--.-|++++.+.|||. +.|..++.++.|+-+.. .+++-.
T Consensus       345 ~qELmfdl~V~A~NPn~-~~V~I~d~dldIFAKS~-yvg~D~  384 (387)
T PF12751_consen  345 EQELMFDLTVEAFNPNW-FTVTIDDMDLDIFAKSR-YVGTDS  384 (387)
T ss_pred             cceEEEeeEEEEECCCe-EEEEeccceeeeEecCC-ccCccc
Confidence            34567889999999999 99999999999998775 555543


No 6  
>KOG1924 consensus RhoA GTPase effector DIA/Diaphanous [Signal transduction mechanisms; Cytoskeleton]
Probab=96.18  E-value=0.013  Score=61.07  Aligned_cols=13  Identities=38%  Similarity=0.672  Sum_probs=6.6

Q ss_pred             CCCCCCCCCCCCC
Q 046217           39 PPAMGYPPAMGYP   51 (288)
Q Consensus        39 ~~~~~~~~~~~~~   51 (288)
                      ||+++-|+-+|-|
T Consensus       558 PPPpplPg~aG~P  570 (1102)
T KOG1924|consen  558 PPPPPLPGIAGGP  570 (1102)
T ss_pred             ccCCCCCcccCCC
Confidence            4445555555544


No 7  
>COG5608 LEA14-like dessication related protein [Defense mechanisms]
Probab=95.41  E-value=1.3  Score=38.02  Aligned_cols=80  Identities=18%  Similarity=0.140  Sum_probs=60.7

Q ss_pred             CCCEEEEeeEEEeeeecCCCCceeEEEEEEEEEeCCCeeeEEEcceEEEEEECCceeeeEeeecCCCcceEecCCceEEE
Q 046217          119 KFPIFHVETLAVTNFSVANDDFTSAWEINLVVENRNFGANVHFDRIQSFIYFNTDDLLASSHFDHPTKDVTVKANEKGTI  198 (288)
Q Consensus       119 k~P~f~V~s~svs~fnvs~~~L~a~l~~~Lsv~NPN~k~~i~Yd~~~~~v~Y~g~~~lg~a~vp~p~~~F~q~~k~tt~v  198 (288)
                      +.|.+.--.+..-...    .....+-.++.++|||. +.|-...++..++-++ ..+|.+....+   +..++++..++
T Consensus        31 ~~p~ve~~ka~wGkvt----~s~~EiV~t~KiyNPN~-fPipVtgl~y~vymN~-Iki~eG~~~k~---~~v~p~S~~tv  101 (161)
T COG5608          31 KKPGVESMKAKWGKVT----NSETEIVGTLKIYNPNP-FPIPVTGLQYAVYMND-IKIGEGEILKG---TTVPPNSRETV  101 (161)
T ss_pred             CCCCceEEEEEEEEEe----ccceEEEEEEEecCCCC-cceeeeceEEEEEEcc-eEeeccccccc---eEECCCCeEEE
Confidence            3454444444443332    23467889999999999 9999999999998866 58999987645   99999999999


Q ss_pred             EEEEEeccc
Q 046217          199 DIKLSTADY  207 (288)
Q Consensus       199 ~v~l~~~~~  207 (288)
                      ++.+..+..
T Consensus       102 dv~l~~d~~  110 (161)
T COG5608         102 DVPLRLDNS  110 (161)
T ss_pred             EEEEEEehH
Confidence            999887654


No 8  
>KOG1924 consensus RhoA GTPase effector DIA/Diaphanous [Signal transduction mechanisms; Cytoskeleton]
Probab=94.53  E-value=0.07  Score=55.97  Aligned_cols=17  Identities=47%  Similarity=0.731  Sum_probs=7.2

Q ss_pred             CCCCCCCCCCCCCCCCC
Q 046217           38 LPPAMGYPPAMGYPPGY   54 (288)
Q Consensus        38 ~~~~~~~~~~~~~~~~~   54 (288)
                      .|+-+|-||.-+.|||.
T Consensus       563 lPg~aG~PPpPppppg~  579 (1102)
T KOG1924|consen  563 LPGIAGGPPPPPPPPGG  579 (1102)
T ss_pred             CCcccCCCCccCCCCCC
Confidence            33344444444444443


No 9  
>KOG4849 consensus mRNA cleavage factor I subunit/CPSF subunit [RNA processing and modification]
Probab=94.13  E-value=0.19  Score=48.38  Aligned_cols=39  Identities=44%  Similarity=0.686  Sum_probs=23.6

Q ss_pred             CCCCCCCCCC---------CCCCCCCCCCCCccc--cCCCCCC-------CCCccccCC
Q 046217           38 LPPAMGYPPA---------MGYPPGYRNGYDQQH--YQQGPPP-------SPDMYYAPP   78 (288)
Q Consensus        38 ~~~~~~~~~~---------~~~~~~~~~~~~~~~--~~~~p~~-------~~~~y~~~~   78 (288)
                      .+|+||.||-         ||.|||-++  +.-|  .++.|||       +|+|||.-.
T Consensus       283 p~pp~~~ppRP~m~l~phl~gppPga~p--Paph~NpaffpPP~~~s~GppP~~~~~~~  339 (498)
T KOG4849|consen  283 PQPPMGQPPRPMMQLSPHLMGPPPGAGP--PAPHNNPAFFPPPQLGSMGPPPQMNTAMR  339 (498)
T ss_pred             CCCCCCCCCCcccccCcccCCCCCCCCC--CCcccCcccCCCCCcCcCCCCCCCccCCC
Confidence            4666677766         677777653  2223  4555555       569999654


No 10 
>KOG2199 consensus Signal transducing adaptor protein STAM/STAM2 [Signal transduction mechanisms]
Probab=89.91  E-value=0.67  Score=45.44  Aligned_cols=18  Identities=22%  Similarity=0.414  Sum_probs=12.8

Q ss_pred             CCCCCCCCCCCCCCCCCC
Q 046217           34 PFPPLPPAMGYPPAMGYP   51 (288)
Q Consensus        34 ~~~~~~~~~~~~~~~~~~   51 (288)
                      ||+.+|-.++|.|+.+|+
T Consensus       419 Pp~~~~~~p~~~~q~~Y~  436 (462)
T KOG2199|consen  419 PPNSNPATPQQQPQLSYS  436 (462)
T ss_pred             CCccCCCCCCCCcccccc
Confidence            555566667788888887


No 11 
>PF02162 XYPPX:  XYPPX repeat (two copies);  InterPro: IPR006031 This repeat is found in a wide variety of proteins and generally consists of the motif XYPPX where X can be any amino acid. The family includes annexin VII ANX7_DICDI, the carboxy tail of certain rhodopsins OPSD_LOLSU. This family also includes plaque matrix proteins, however this motif is embedded in a ten residue repeat in FP1_MYTED. The molecular function of this repeat is unknown. It is also not clear is all the members of this family share a common evolutionary ancestor due to its short length and biased amino acid composition. 
Probab=89.22  E-value=0.28  Score=25.78  Aligned_cols=9  Identities=33%  Similarity=0.316  Sum_probs=3.4

Q ss_pred             CCCCCCCCC
Q 046217           31 QPAPFPPLP   39 (288)
Q Consensus        31 ~~~~~~~~~   39 (288)
                      |++|||.+|
T Consensus         5 qG~pPQ~~P   13 (15)
T PF02162_consen    5 QGYPPQGYP   13 (15)
T ss_pred             cCCCCCCCC
Confidence            333333333


No 12 
>COG5178 PRP8 U5 snRNP spliceosome subunit [RNA processing and modification]
Probab=84.47  E-value=0.7  Score=50.74  Aligned_cols=21  Identities=43%  Similarity=0.872  Sum_probs=15.6

Q ss_pred             CCCCCCCCCCCCCCCCCCCCC
Q 046217           15 EIPEPPPPTRQYFEPQQPAPF   35 (288)
Q Consensus        15 ~~~~~~~~~~~~~~~~~~~~~   35 (288)
                      +-|.||||+|+|-||+||+||
T Consensus         7 g~ppppppppg~epps~pppP   27 (2365)
T COG5178           7 GNPPPPPPPPGFEPPSQPPPP   27 (2365)
T ss_pred             CCCcccccCCCCCCCCCCCCc
Confidence            347888888888888875543


No 13 
>PLN03160 uncharacterized protein; Provisional
Probab=83.90  E-value=4.1  Score=36.81  Aligned_cols=81  Identities=10%  Similarity=0.093  Sum_probs=48.8

Q ss_pred             CCCCCCCcEEehhHHHHHHHHHHHHHHheeeEEeecC--CCEEEEeeEEEee-------eecCC----CCceeEEEEEEE
Q 046217           83 QDPGPPSSFAILTTMMLFVFFTFIVGLVLMFLIARPK--FPIFHVETLAVTN-------FSVAN----DDFTSAWEINLV  149 (288)
Q Consensus        83 ~~~~~r~~~~l~~~li~lvlllgii~~lIlwLvlrPk--~P~f~V~s~svs~-------fnvs~----~~L~a~l~~~Ls  149 (288)
                      ++.+-++|.|+|+++++++++++++ +++++-.=.|+  .-.++|+++.++.       +|++-    ..-|-|. +.+.
T Consensus        33 r~~~~~c~~~~~a~~l~l~~v~~~l-~~~vfrPk~P~~~v~~v~l~~~~~~~~~~~~~~~n~tl~~~v~v~NPN~-~~~~  110 (219)
T PLN03160         33 RRNCIKCCGCITATLLILATTILVL-VFTVFRVKDPVIKMNGVTVTKLELINNTTLRPGTNITLIADVSVKNPNV-ASFK  110 (219)
T ss_pred             cccceEEHHHHHHHHHHHHHHHHhe-eeEEEEccCCeEEEEEEEEeeeeeccCCCCceeEEEEEEEEEEEECCCc-eeEE
Confidence            3556778888888888877777777 66666656664  4556666666532       23221    0123444 4444


Q ss_pred             EEeCCCeeeEEEcceEEE
Q 046217          150 VENRNFGANVHFDRIQSF  167 (288)
Q Consensus       150 v~NPN~k~~i~Yd~~~~~  167 (288)
                      ..  |..+.++|+...+.
T Consensus       111 Y~--~~~~~v~Y~g~~vG  126 (219)
T PLN03160        111 YS--NTTTTIYYGGTVVG  126 (219)
T ss_pred             Ec--CeEEEEEECCEEEE
Confidence            43  45578888776553


No 14 
>TIGR02588 conserved hypothetical protein TIGR02588. The function of this protein is unknown. It is always found as part of a two-gene operon with TIGR02587, a protein that appears to span the membrane seven times. It is found in Nostoc sp. PCC 7120, Agrobacterium tumefaciens, Sinorhizobium meliloti, and Gloeobacter violaceus, so far, all of which are bacterial.
Probab=74.63  E-value=35  Score=28.27  Aligned_cols=52  Identities=10%  Similarity=0.114  Sum_probs=31.7

Q ss_pred             HHHHHHHHHHHHHheeeE--EeecCCCEEEEeeEEEeeeecCCCCceeEEEEEEEEEeCCC
Q 046217           97 MMLFVFFTFIVGLVLMFL--IARPKFPIFHVETLAVTNFSVANDDFTSAWEINLVVENRNF  155 (288)
Q Consensus        97 li~lvlllgii~~lIlwL--vlrPk~P~f~V~s~svs~fnvs~~~L~a~l~~~Lsv~NPN~  155 (288)
                      .+.++++++++ .+++|.  .-+.+.|.+++......+-.      ...+-+-++++|--.
T Consensus         9 ~Is~~ill~vi-glv~y~~l~~~~~pp~l~v~~~~~~r~~------~gqyyVpF~V~N~gg   62 (122)
T TIGR02588         9 GISTLILAAMF-GLVAYDWLRYSNKAAVLEVAPAEVERMQ------TGQYYVPFAIHNLGG   62 (122)
T ss_pred             HHHHHHHHHHH-HHHHHHhhccCCCCCeEEEeehheeEEe------CCEEEEEEEEEeCCC
Confidence            33445555555 555554  45678899998887665421      234666677777655


No 15 
>PF12868 DUF3824:  Domain of unknwon function (DUF3824);  InterPro: IPR024436 This repeating domain is proline-rich but its function is unknown.
Probab=67.06  E-value=17  Score=30.73  Aligned_cols=7  Identities=57%  Similarity=1.279  Sum_probs=2.6

Q ss_pred             CCCCCCC
Q 046217           39 PPAMGYP   45 (288)
Q Consensus        39 ~~~~~~~   45 (288)
                      ++...||
T Consensus        89 ~~~~~~p   95 (137)
T PF12868_consen   89 PPPPPYP   95 (137)
T ss_pred             CCCCCCC
Confidence            3333333


No 16 
>PF10907 DUF2749:  Protein of unknown function (DUF2749);  InterPro: IPR024475 This bacterial family of proteins represent the TrbJ and TrbK genes of the Ti plasmid conjugative transfer operon [].
Probab=59.14  E-value=11  Score=27.85  Aligned_cols=16  Identities=13%  Similarity=0.266  Sum_probs=12.1

Q ss_pred             HHHHHHheeeEEeecC
Q 046217          104 TFIVGLVLMFLIARPK  119 (288)
Q Consensus       104 lgii~~lIlwLvlrPk  119 (288)
                      +++.+..+.|++++|+
T Consensus        13 vaa~a~~atwviVq~~   28 (66)
T PF10907_consen   13 VAAAAGAATWVIVQPR   28 (66)
T ss_pred             HHhhhceeEEEEECCC
Confidence            4444478899999998


No 17 
>PRK13150 cytochrome c-type biogenesis protein CcmE; Reviewed
Probab=59.08  E-value=1e+02  Score=26.72  Aligned_cols=40  Identities=8%  Similarity=0.247  Sum_probs=28.1

Q ss_pred             EEEEEEEEEeCCCeeeEEEcceEEEEEECCceeeeEeeec
Q 046217          143 AWEINLVVENRNFGANVHFDRIQSFIYFNTDDLLASSHFD  182 (288)
Q Consensus       143 ~l~~~Lsv~NPN~k~~i~Yd~~~~~v~Y~g~~~lg~a~vp  182 (288)
                      .+.+.+.+..-+..+.+.|+.+-=.++=.|+..++++.+.
T Consensus        77 ~~~v~F~vtD~~~~v~V~Y~GilPDlFrEG~gVVveG~~~  116 (159)
T PRK13150         77 SLKVNFSLYDAEGSVTVSYEGILPDLFREGQGVVVQGTLE  116 (159)
T ss_pred             CcEEEEEEEcCCcEEEEEEeccCCccccCCCeEEEEEEEC
Confidence            4556666666677788888877666666666677777775


No 18 
>PRK10893 lipopolysaccharide exporter periplasmic protein; Provisional
Probab=59.05  E-value=47  Score=29.34  Aligned_cols=38  Identities=5%  Similarity=0.029  Sum_probs=25.7

Q ss_pred             ecCCCEEEEeeEEEeeeecCCCCceeEEEEEEEEEeCCC
Q 046217          117 RPKFPIFHVETLAVTNFSVANDDFTSAWEINLVVENRNF  155 (288)
Q Consensus       117 rPk~P~f~V~s~svs~fnvs~~~L~a~l~~~Lsv~NPN~  155 (288)
                      .++.|.|.+++++...|+.. +.++..++..=.-+=|+.
T Consensus        37 ~~~~Pdy~~~~~~~~~yd~~-G~l~y~l~a~~~~Hy~~~   74 (192)
T PRK10893         37 NNNDPTYQSQHTDTVVYNPE-GALSYKLVAQHVEYYSDQ   74 (192)
T ss_pred             CCCCCCEEEeccEEEEECCC-CCEEEEEEecceEEcCCC
Confidence            46789999999999999875 355555554443333444


No 19 
>KOG2893 consensus Zn finger protein [General function prediction only]
Probab=57.47  E-value=27  Score=32.34  Aligned_cols=14  Identities=21%  Similarity=0.247  Sum_probs=5.9

Q ss_pred             CCCCCCCCCCCCcc
Q 046217           48 MGYPPGYRNGYDQQ   61 (288)
Q Consensus        48 ~~~~~~~~~~~~~~   61 (288)
                      |-.|+|||+|-.+-
T Consensus       159 ~ppp~g~pp~~~pg  172 (341)
T KOG2893|consen  159 YPPPRGYPPAPAPG  172 (341)
T ss_pred             CCCCCCCCCCCCCc
Confidence            33334555444333


No 20 
>COG5178 PRP8 U5 snRNP spliceosome subunit [RNA processing and modification]
Probab=57.12  E-value=9.7  Score=42.43  Aligned_cols=23  Identities=35%  Similarity=0.701  Sum_probs=17.8

Q ss_pred             CCCCCCCCCCCCCCCCCCCCCCC
Q 046217           37 PLPPAMGYPPAMGYPPGYRNGYD   59 (288)
Q Consensus        37 ~~~~~~~~~~~~~~~~~~~~~~~   59 (288)
                      |.||++||-|..+.||-||+||+
T Consensus        11 ppppppg~epps~pppPppPg~~   33 (2365)
T COG5178          11 PPPPPPGFEPPSQPPPPPPPGVN   33 (2365)
T ss_pred             ccccCCCCCCCCCCCCccCCCcc
Confidence            35677788777788888889986


No 21 
>KOG2192 consensus PolyC-binding hnRNP-K protein HRB57A/hnRNP, contains KH domain [RNA processing and modification; General function prediction only]
Probab=55.81  E-value=17  Score=34.15  Aligned_cols=8  Identities=38%  Similarity=0.455  Sum_probs=3.6

Q ss_pred             CCCCCCCC
Q 046217           30 QQPAPFPP   37 (288)
Q Consensus        30 ~~~~~~~~   37 (288)
                      -|++|+++
T Consensus       236 gqgpp~~~  243 (390)
T KOG2192|consen  236 GQGPPPPR  243 (390)
T ss_pred             CCCCCCCC
Confidence            34444444


No 22 
>PF02124 Marek_A:  Marek's disease glycoprotein A;  InterPro: IPR001038  Equid herpesvirus 1 (Equine herpesvirus 1, EHV-1) glycoprotein 13 (EHV-1 gp13) has the characteristic features of a membrane-spanning protein: an N-terminal signal sequence; a hydrophobic membrane anchor region; a charged C-terminal cytoplasmic tail; and an exterior domain with nine potential N-glycosylation sites []. EHV-1 gp13 is the structural homologue of the gC-like glycoproteins of the Human herpesvirus 1 (HHV-1) and Human herpesvirus 2 (HHV-2) (gC-1 and gC-2 respectively), Pseudorabies virus (strain Indiana-Funkhauser/Becker) (PRV) (gIII) and Human herpesvirus 3 (HHV-3) (gp66).  Secretory glycoprotein GP57-65 precursor (glycoprotein A - GA) is similar to Herpesvirus glycoprotein C, and belongs to the immunoglobulin gene superfamily [, ]. GA is thought to play an immunoevasive role in the pathogenesis of Marek's disease. It is a candidate for causing the early-stage immunosuppression that occurs after MDHV infection.
Probab=54.62  E-value=31  Score=31.24  Aligned_cols=19  Identities=5%  Similarity=-0.115  Sum_probs=10.4

Q ss_pred             HHHHHHHHHHHHHHheeeEE
Q 046217           96 TMMLFVFFTFIVGLVLMFLI  115 (288)
Q Consensus        96 ~li~lvlllgii~~lIlwLv  115 (288)
                      ++++.++++|++ ++|+-+.
T Consensus       190 ~Vi~g~~~~g~~-~~i~alc  208 (211)
T PF02124_consen  190 GVICGAGALGLL-VLIAALC  208 (211)
T ss_pred             HHHHHHHHHHHH-HhhEEEE
Confidence            555556666655 5554443


No 23 
>COG2332 CcmE Cytochrome c-type biogenesis protein CcmE [Posttranslational modification, protein turnover, chaperones]
Probab=53.55  E-value=1.5e+02  Score=25.57  Aligned_cols=40  Identities=15%  Similarity=0.347  Sum_probs=29.4

Q ss_pred             EEEEEEEEEeCCCeeeEEEcceEEEEEECCceeeeEeeec
Q 046217          143 AWEINLVVENRNFGANVHFDRIQSFIYFNTDDLLASSHFD  182 (288)
Q Consensus       143 ~l~~~Lsv~NPN~k~~i~Yd~~~~~v~Y~g~~~lg~a~vp  182 (288)
                      .+++.+.+.--|.++.|.|.-+-=.++=.|+..++++.+.
T Consensus        71 ~~~v~F~vtD~~~~v~V~Y~GiLPDLFREGQgVVa~G~~~  110 (153)
T COG2332          71 SLKVSFVVTDGNKSVTVSYEGILPDLFREGQGVVAEGQLQ  110 (153)
T ss_pred             CcEEEEEEecCCceEEEEEeccCchhhhcCCeEEEEEEec
Confidence            4555556667788888999887777777777778888775


No 24 
>PF11322 DUF3124:  Protein of unknown function (DUF3124);  InterPro: IPR021471  This bacterial family of proteins has no known function. 
Probab=53.38  E-value=72  Score=26.55  Aligned_cols=60  Identities=18%  Similarity=0.300  Sum_probs=40.6

Q ss_pred             CCCCceeEEEEEEEEEeCCCeeeEEEcceEEEEEECCceeeeEeeecCCCcceEecCCceEEEEEE
Q 046217          136 ANDDFTSAWEINLVVENRNFGANVHFDRIQSFIYFNTDDLLASSHFDHPTKDVTVKANEKGTIDIK  201 (288)
Q Consensus       136 s~~~L~a~l~~~Lsv~NPN~k~~i~Yd~~~~~v~Y~g~~~lg~a~vp~p~~~F~q~~k~tt~v~v~  201 (288)
                      .+.....+|+++|++||.+.+-.|.-.+++   +|+.+-.+-...+..|   ...++-.+..+-+.
T Consensus        17 ~~~~~~~~Lt~tLSiRNtd~~~~i~i~~v~---Yydt~G~lvr~yl~~P---i~L~Pl~t~~~vV~   76 (125)
T PF11322_consen   17 GNKHRPFNLTATLSIRNTDPTDPIYITSVD---YYDTDGKLVRSYLDKP---IYLKPLATTEFVVE   76 (125)
T ss_pred             cCCCceEeEEEEEEEEcCCCCCCEEEEEEE---EECCCCeEhHHhcCCC---eEcCCCceEEEEEe
Confidence            344678899999999999987777765442   5544333444455556   77887777776543


No 25 
>KOG3950 consensus Gamma/delta sarcoglycan [Cytoskeleton]
Probab=52.62  E-value=78  Score=29.48  Aligned_cols=21  Identities=19%  Similarity=0.074  Sum_probs=16.5

Q ss_pred             CceeEEEEEEEEEeCCCeeeE
Q 046217          139 DFTSAWEINLVVENRNFGANV  159 (288)
Q Consensus       139 ~L~a~l~~~Lsv~NPN~k~~i  159 (288)
                      .+...=++++.+||||.++.=
T Consensus       105 ~~~S~rnvtvnarn~~g~v~~  125 (292)
T KOG3950|consen  105 YLQSARNVTVNARNPNGKVTG  125 (292)
T ss_pred             EEEeccCeeEEccCCCCceee
Confidence            456677899999999997643


No 26 
>PF05399 EVI2A:  Ectropic viral integration site 2A protein (EVI2A);  InterPro: IPR008608 This family contains several mammalian ectropic viral integration site 2A (EVI2A) proteins. The function of this protein is unknown although it is thought to be a membrane protein and may function as an oncogene in retrovirus induced myeloid tumours [, ].; GO: 0016021 integral to membrane
Probab=48.06  E-value=1.4e+02  Score=27.20  Aligned_cols=11  Identities=55%  Similarity=0.588  Sum_probs=7.6

Q ss_pred             CceeeeEeeecCC
Q 046217          172 TDDLLASSHFDHP  184 (288)
Q Consensus       172 g~~~lg~a~vp~p  184 (288)
                      || +|++..++ |
T Consensus       176 GD-FLASSgLW-P  186 (227)
T PF05399_consen  176 GD-FLASSGLW-P  186 (227)
T ss_pred             cc-eeeccccC-c
Confidence            44 77777777 5


No 27 
>PF03302 VSP:  Giardia variant-specific surface protein;  InterPro: IPR005127 During infection, the intestinal protozoan parasite Giardia lamblia virus undergoes continuous antigenic variation which is determined by diversification of the parasite's major surface antigen, named VSP (variant surface protein).
Probab=47.44  E-value=12  Score=36.92  Aligned_cols=28  Identities=21%  Similarity=0.305  Sum_probs=17.8

Q ss_pred             EehhHHHHHHHHHHHHHHheeeEEeecC
Q 046217           92 AILTTMMLFVFFTFIVGLVLMFLIARPK  119 (288)
Q Consensus        92 ~l~~~li~lvlllgii~~lIlwLvlrPk  119 (288)
                      +.-+.+.++|++.||+++|.-|+|.|=|
T Consensus       369 IaGIsvavvvvVgglvGfLcWwf~crgk  396 (397)
T PF03302_consen  369 IAGISVAVVVVVGGLVGFLCWWFICRGK  396 (397)
T ss_pred             eeeeeehhHHHHHHHHHHHhhheeeccc
Confidence            3334555566777788666667777744


No 28 
>PF03100 CcmE:  CcmE;  InterPro: IPR004329 CcmE is the product of one of a cluster of Ccm genes that are necessary for cytochrome c biosynthesis in eubacteria. Expression of these proteins is induced when the organisms are grown under anaerobic conditions with nitrate or nitrite as the final electron acceptor.; GO: 0017003 protein-heme linkage, 0017004 cytochrome complex assembly, 0005886 plasma membrane; PDB: 1SR3_A 2KCT_A 1J6Q_A 1LM0_A.
Probab=45.13  E-value=28  Score=28.79  Aligned_cols=37  Identities=3%  Similarity=0.125  Sum_probs=16.7

Q ss_pred             EEEEEEEeCCCeeeEEEcceEEEEEECCceeeeEeee
Q 046217          145 EINLVVENRNFGANVHFDRIQSFIYFNTDDLLASSHF  181 (288)
Q Consensus       145 ~~~Lsv~NPN~k~~i~Yd~~~~~v~Y~g~~~lg~a~v  181 (288)
                      .+++.+..-+..+.+.|+..-=..+=.++..+.++.+
T Consensus        72 ~~~F~i~D~~~~i~V~Y~G~~Pd~F~eg~~VVv~G~~  108 (131)
T PF03100_consen   72 TLTFTITDGGKEIPVVYTGPLPDLFREGQGVVVEGRL  108 (131)
T ss_dssp             EEEEEEE-SS-EEEEEEES--CTT--TTSEEEEEEEE
T ss_pred             EEEEEEEECCcEEEEEECCCCCccccCCCeEEEEEEE
Confidence            4444455556667777755433333344445555555


No 29 
>KOG2893 consensus Zn finger protein [General function prediction only]
Probab=43.77  E-value=70  Score=29.73  Aligned_cols=23  Identities=30%  Similarity=0.609  Sum_probs=10.2

Q ss_pred             CCCCCCCCCCCCCccc-cCCCCCC
Q 046217           47 AMGYPPGYRNGYDQQH-YQQGPPP   69 (288)
Q Consensus        47 ~~~~~~~~~~~~~~~~-~~~~p~~   69 (288)
                      .|+.|+|-|..|++-- |++|++|
T Consensus       148 ~~~~p~gmp~~~ppp~g~pp~~~p  171 (341)
T KOG2893|consen  148 GMMPPRGMPGAYPPPRGYPPAPAP  171 (341)
T ss_pred             CCCCCCCCCCCCCCCCCCCCCCCC
Confidence            3444455444444333 4444433


No 30 
>KOG4849 consensus mRNA cleavage factor I subunit/CPSF subunit [RNA processing and modification]
Probab=42.86  E-value=1.4e+02  Score=29.31  Aligned_cols=8  Identities=50%  Similarity=0.567  Sum_probs=3.1

Q ss_pred             CCCCCCCC
Q 046217           29 PQQPAPFP   36 (288)
Q Consensus        29 ~~~~~~~~   36 (288)
                      .+|-.|-+
T Consensus       221 ~~Q~~P~P  228 (498)
T KOG4849|consen  221 MQQVRPTP  228 (498)
T ss_pred             cccCCCCC
Confidence            34443333


No 31 
>PF15145 DUF4577:  Domain of unknown function (DUF4577)
Probab=41.00  E-value=18  Score=29.63  Aligned_cols=26  Identities=15%  Similarity=0.360  Sum_probs=15.8

Q ss_pred             EehhHHHHHHHHHHHHHHheeeEEeec
Q 046217           92 AILTTMMLFVFFTFIVGLVLMFLIARP  118 (288)
Q Consensus        92 ~l~~~li~lvlllgii~~lIlwLvlrP  118 (288)
                      .+...++++++-++++ .|++||+++-
T Consensus        63 ffvglii~LivSLaLV-sFvIFLiiQT   88 (128)
T PF15145_consen   63 FFVGLIIVLIVSLALV-SFVIFLIIQT   88 (128)
T ss_pred             hHHHHHHHHHHHHHHH-HHHHHheeec
Confidence            3444455555666666 7777777653


No 32 
>PF10873 DUF2668:  Protein of unknown function (DUF2668);  InterPro: IPR022640  Members in this family of proteins are annotated as cysteine and tyrosine-rich protein 1, however currently no function is known []. 
Probab=40.93  E-value=68  Score=27.46  Aligned_cols=11  Identities=36%  Similarity=0.824  Sum_probs=4.8

Q ss_pred             CCCCCCCCCCC
Q 046217           38 LPPAMGYPPAM   48 (288)
Q Consensus        38 ~~~~~~~~~~~   48 (288)
                      |---|.|++-+
T Consensus       119 y~~e~~~~~d~  129 (155)
T PF10873_consen  119 YDHEMEYPPDL  129 (155)
T ss_pred             ccccccccCCC
Confidence            44444444444


No 33 
>KOG1923 consensus Rac1 GTPase effector FRL [Signal transduction mechanisms; Cytoskeleton]
Probab=40.32  E-value=1e+02  Score=33.05  Aligned_cols=23  Identities=35%  Similarity=0.541  Sum_probs=11.0

Q ss_pred             CCCCCCCCCCCCCCCCCCCCCCC
Q 046217           17 PEPPPPTRQYFEPQQPAPFPPLP   39 (288)
Q Consensus        17 ~~~~~~~~~~~~~~~~~~~~~~~   39 (288)
                      ++|||+.+...-+++.++|.|.|
T Consensus       280 s~ppppap~p~~~~~~a~p~p~~  302 (830)
T KOG1923|consen  280 SGPPPPAPLPHTAQSDAAPSPLR  302 (830)
T ss_pred             CCCCCCCCCCCcccccCCCCCCC
Confidence            34444445444555555444433


No 34 
>PF13179 DUF4006:  Family of unknown function (DUF4006)
Probab=39.61  E-value=23  Score=26.20  Aligned_cols=27  Identities=19%  Similarity=0.342  Sum_probs=17.1

Q ss_pred             CCCcE---EehhHHHHHHHHHHHHHHheee
Q 046217           87 PPSSF---AILTTMMLFVFFTFIVGLVLMF  113 (288)
Q Consensus        87 ~r~~~---~l~~~li~lvlllgii~~lIlw  113 (288)
                      .|+.|   .+...+++.++++++++.|-+|
T Consensus         4 ~r~~f~LnGi~G~LIAvvLLLsIl~~lt~~   33 (66)
T PF13179_consen    4 NRSVFGLNGITGMLIAVVLLLSILAFLTYW   33 (66)
T ss_pred             ccceeeecchHhHHHHHHHHHHHHHHHHHH
Confidence            46666   6666677777777777344344


No 35 
>PF06024 DUF912:  Nucleopolyhedrovirus protein of unknown function (DUF912);  InterPro: IPR009261 This entry is represented by Autographa californica nuclear polyhedrosis virus (AcMNPV), Orf78; it is a family of uncharacterised viral proteins.
Probab=38.81  E-value=40  Score=26.71  Aligned_cols=25  Identities=24%  Similarity=0.576  Sum_probs=12.4

Q ss_pred             EehhHHHHHHHHHHHHHHheeeEE-ee
Q 046217           92 AILTTMMLFVFFTFIVGLVLMFLI-AR  117 (288)
Q Consensus        92 ~l~~~li~lvlllgii~~lIlwLv-lr  117 (288)
                      +++.+++++++++.++ .+|.|.+ +|
T Consensus        63 iili~lls~v~IlVil-y~IyYFVILR   88 (101)
T PF06024_consen   63 IILISLLSFVCILVIL-YAIYYFVILR   88 (101)
T ss_pred             chHHHHHHHHHHHHHH-hhheEEEEEe
Confidence            4444555555444444 4444554 55


No 36 
>PF05170 AsmA:  AsmA family;  InterPro: IPR007844 The AsmA protein is involved in the assembly of outer membrane proteins in Escherichia coli []. AsmA mutations were isolated as extragenic suppressors of an OmpF assembly mutant []. AsmA may have a role in LPS biogenesis [].
Probab=36.51  E-value=1.8e+02  Score=29.64  Aligned_cols=95  Identities=11%  Similarity=0.153  Sum_probs=49.9

Q ss_pred             CCEEEEeeEEEeeeecCCCCceeEEEEEEEEEeCCCeeeEEEcceEEEEEECCceeeeEeeecCCCcceEecCCceEEEE
Q 046217          120 FPIFHVETLAVTNFSVANDDFTSAWEINLVVENRNFGANVHFDRIQSFIYFNTDDLLASSHFDHPTKDVTVKANEKGTID  199 (288)
Q Consensus       120 ~P~f~V~s~svs~fnvs~~~L~a~l~~~Lsv~NPN~k~~i~Yd~~~~~v~Y~g~~~lg~a~vp~p~~~F~q~~k~tt~v~  199 (288)
                      +-++++.++++.++.+++      +.+.+++  -+.+  +.-+.+.+.+ |+|. +-+.+.+. .       ......++
T Consensus       439 ~~~l~~~~l~~~~l~i~~------~~~~~~~--~~G~--l~l~~l~~~l-~~G~-~~~~~~ld-~-------~~~~~~~~  498 (604)
T PF05170_consen  439 DLTLSAGSLKANGLPISN------LKLQLKA--KDGL--LTLDPLSAKL-YGGS-LSGSASLD-A-------RQDPPQYS  498 (604)
T ss_pred             eEEEEhhheEECCceecc------cEEEEEe--cCCe--EEEeeeeEec-CCcE-EEEEEEEe-c-------cCCCccEE
Confidence            445666666666666552      3333333  3333  3445666666 6565 55666655 2       23345566


Q ss_pred             EEEEeccccccCCCCChHHHHHHHh-hhcCCeEEEEEEEEEE
Q 046217          200 IKLSTADYKQSVPAPGKRVMEEMRN-IRETGVVPFSLRAFAW  240 (288)
Q Consensus       200 v~l~~~~~~~~~~~l~~~~~~~l~~-d~~~G~V~~~v~v~~r  240 (288)
                      ...++.++     .+.+-. ..+.. +.-.|...+++.++++
T Consensus       499 ~~~~~~~v-----~l~~Ll-~~~~~~~~l~G~~~~~~~l~g~  534 (604)
T PF05170_consen  499 LNLNLRGV-----QLQPLL-QDLALPDPLSGTGDLNLDLTGQ  534 (604)
T ss_pred             EeeeeCCc-----chHHHH-hhhccccCceEEEEEEEEEEeC
Confidence            66667666     443321 22222 3346666666666553


No 37 
>PF04478 Mid2:  Mid2 like cell wall stress sensor;  InterPro: IPR007567 This family represents a region near the C terminus of Mid2, which contains a transmembrane region. The remainder of the protein sequence is serine-rich and of low complexity, and is therefore impossible to align accurately. Mid2 is thought to act as a mechanosensor of cell wall stress. The C-terminal cytoplasmic region of Mid2 is known to interact with Rom2, a guanine nucleotide exchange factor (GEF) for Rho1, which is part of the cell wall integrity signalling pathway [].
Probab=36.19  E-value=22  Score=30.56  Aligned_cols=24  Identities=17%  Similarity=0.044  Sum_probs=16.1

Q ss_pred             HHHHHHHHHHHheeeEEeecCCCEE
Q 046217           99 LFVFFTFIVGLVLMFLIARPKFPIF  123 (288)
Q Consensus        99 ~lvlllgii~~lIlwLvlrPk~P~f  123 (288)
                      ...+|++++ +||+|+.+|.+.=.|
T Consensus        59 Gg~ill~il-~lvf~~c~r~kktdf   82 (154)
T PF04478_consen   59 GGPILLGIL-ALVFIFCIRRKKTDF   82 (154)
T ss_pred             cHHHHHHHH-HhheeEEEecccCcc
Confidence            345555666 778888888887443


No 38 
>COG1580 FliL Flagellar basal body-associated protein [Cell motility and secretion]
Probab=36.11  E-value=1.4e+02  Score=25.83  Aligned_cols=8  Identities=0%  Similarity=0.015  Sum_probs=5.1

Q ss_pred             HheeeEEe
Q 046217          109 LVLMFLIA  116 (288)
Q Consensus       109 ~lIlwLvl  116 (288)
                      +.++|+..
T Consensus        35 ~~~~~~~~   42 (159)
T COG1580          35 GYFFWFGS   42 (159)
T ss_pred             HHHHhhhc
Confidence            66666665


No 39 
>PF09624 DUF2393:  Protein of unknown function (DUF2393);  InterPro: IPR013417  The function of this protein is unknown. It is always found as part of a two-gene operon with IPR013416 from INTERPRO, a protein that appears to span the membrane seven times. It has so far been found in the bacteria Anabaena sp. (strain PCC 7120), Agrobacterium tumefaciens, Rhizobium meliloti, and Gloeobacter violaceus.
Probab=35.52  E-value=1.6e+02  Score=24.42  Aligned_cols=56  Identities=9%  Similarity=0.064  Sum_probs=33.6

Q ss_pred             HheeeEEeec--CCCEEEEeeEEEeeeecCCCCceeEEEEEEEEEeCCCeeeEEEcceEEEEEECC
Q 046217          109 LVLMFLIARP--KFPIFHVETLAVTNFSVANDDFTSAWEINLVVENRNFGANVHFDRIQSFIYFNT  172 (288)
Q Consensus       109 ~lIlwLvlrP--k~P~f~V~s~svs~fnvs~~~L~a~l~~~Lsv~NPN~k~~i~Yd~~~~~v~Y~g  172 (288)
                      .+++|.++.-  +.+..++.+..-  ++.     +-.+.+..+++|-.+ ..+..=.+++.+..++
T Consensus        34 ~~~~~~~l~~~~~~~~~~~~~~~~--l~~-----~~~~~v~g~V~N~g~-~~i~~c~i~~~l~~~~   91 (149)
T PF09624_consen   34 PFFGYYWLDKYLKKIELTLTSQKR--LQY-----SESFYVDGTVTNTGK-FTIKKCKITVKLYNDK   91 (149)
T ss_pred             HHHHHHHHhhhcCCceEEEeeeee--eee-----ccEEEEEEEEEECCC-CEeeEEEEEEEEEeCC
Confidence            4445555443  456666555443  333     346888899999887 5566656666666633


No 40 
>PF06092 DUF943:  Enterobacterial putative membrane protein (DUF943);  InterPro: IPR010351 This family consists of several hypothetical proteins from Escherichia coli, Yersinia pestis and Salmonella typhi.
Probab=34.88  E-value=24  Score=30.44  Aligned_cols=18  Identities=28%  Similarity=0.606  Sum_probs=12.4

Q ss_pred             HHHHHHHHHHheeeEEeec
Q 046217          100 FVFFTFIVGLVLMFLIARP  118 (288)
Q Consensus       100 lvlllgii~~lIlwLvlrP  118 (288)
                      .+++++++ +.++|+.+||
T Consensus        11 ~l~l~~~~-~y~~W~~~rp   28 (157)
T PF06092_consen   11 ALFLLACI-LYFLWLTLRP   28 (157)
T ss_pred             HHHHHHHH-HHhhhhccCC
Confidence            34444545 4889999999


No 41 
>PF08113 CoxIIa:  Cytochrome c oxidase subunit IIa family;  InterPro: IPR012538 This family consists of the cytochrome c oxidase subunit IIa family. The bax-type cytochrome c oxidase from Thermus thermophilus is known as a two subunit enzyme. From its crystal structure, it was discovered that an additional transmembrane helix, subunit IIa, spans the membrane. This subunit consists of 34 residues forming one helix across the membrane. The presence of this subunit seems to be important for the function of cytochrome c oxidases [].; PDB: 2QPD_C 3QJR_C 3EH5_C 3BVD_C 3S39_C 3QJU_C 3QJS_C 4EV3_C 3QJT_C 4FA7_C ....
Probab=34.21  E-value=19  Score=23.10  Aligned_cols=15  Identities=7%  Similarity=0.248  Sum_probs=9.1

Q ss_pred             HHHHHHHHHHHheeeE
Q 046217           99 LFVFFTFIVGLVLMFL  114 (288)
Q Consensus        99 ~lvlllgii~~lIlwL  114 (288)
                      .++.+++++ +|++|+
T Consensus        10 ~vv~iLt~~-ILvFWf   24 (34)
T PF08113_consen   10 GVVMILTAF-ILVFWF   24 (34)
T ss_dssp             HHHHHHHHH-HHHHHH
T ss_pred             eeHHHHHHH-HHHHHH
Confidence            445555666 677775


No 42 
>PF07787 DUF1625:  Protein of unknown function (DUF1625);  InterPro: IPR012430 Sequences making up this family are derived from hypothetical proteins expressed by both prokaryotic and eukaryotic species. The region in question is approximately 250 residues long. 
Probab=34.20  E-value=28  Score=31.73  Aligned_cols=10  Identities=40%  Similarity=0.830  Sum_probs=8.4

Q ss_pred             HheeeEEeec
Q 046217          109 LVLMFLIARP  118 (288)
Q Consensus       109 ~lIlwLvlrP  118 (288)
                      +.+.|+.+||
T Consensus       239 Ia~aW~~yRP  248 (248)
T PF07787_consen  239 IALAWLFYRP  248 (248)
T ss_pred             HHHhheeeCc
Confidence            6778999987


No 43 
>PRK13159 cytochrome c-type biogenesis protein CcmE; Reviewed
Probab=33.82  E-value=1.4e+02  Score=25.82  Aligned_cols=37  Identities=14%  Similarity=0.373  Sum_probs=21.1

Q ss_pred             EEEEEEeCCCeeeEEEcceEEEEEECCceeeeEeeec
Q 046217          146 INLVVENRNFGANVHFDRIQSFIYFNTDDLLASSHFD  182 (288)
Q Consensus       146 ~~Lsv~NPN~k~~i~Yd~~~~~v~Y~g~~~lg~a~vp  182 (288)
                      +.+.+.--+..+.+.|+-+-=.++=.|+..++++.+.
T Consensus        74 v~F~vtD~~~~v~V~Y~GilPDlFrEGqgVVaeG~~~  110 (155)
T PRK13159         74 VSFTVIDKNAATQVEYTGILPDLFRDNQSVIANGRMQ  110 (155)
T ss_pred             EEEEEEcCCcEEEEEEccCCCccccCCCeEEEEEEEc
Confidence            3334444455677777766555555555566665544


No 44 
>PRK05529 cell division protein FtsQ; Provisional
Probab=33.81  E-value=56  Score=30.07  Aligned_cols=43  Identities=9%  Similarity=0.089  Sum_probs=28.0

Q ss_pred             CCEEEEeeEEEeeeecCC--------------C--------------CceeEEEEEEEEEeCCCeeeEEEcc
Q 046217          120 FPIFHVETLAVTNFSVAN--------------D--------------DFTSAWEINLVVENRNFGANVHFDR  163 (288)
Q Consensus       120 ~P~f~V~s~svs~fnvs~--------------~--------------~L~a~l~~~Lsv~NPN~k~~i~Yd~  163 (288)
                      .|.|.|+++.|+.-...+              +              .+-.-=+++++-+.||. +.|+..+
T Consensus        58 Sp~~~v~~I~V~Gn~~vs~~eI~~~~~~~~g~~l~~vd~~~~~~~l~~~P~V~sa~V~r~~P~t-l~I~V~E  128 (255)
T PRK05529         58 SPLLALRSIEVAGNMRVKPQDIVAALRDQFGKPLPLVDPETVRKKLAAFPLIRSYSVESKPPGT-IVVRVVE  128 (255)
T ss_pred             CCceEEEEEEEECCccCCHHHHHHHhcccCCCcceeECHHHHHHHHhcCCCEeEEEEEEeCCCE-EEEEEEE
Confidence            589999999997643110              1              11222367788889998 7777654


No 45 
>PRK06531 yajC preprotein translocase subunit YajC; Validated
Probab=33.69  E-value=19  Score=29.34  Aligned_cols=10  Identities=30%  Similarity=0.683  Sum_probs=6.8

Q ss_pred             eeeEEeecCC
Q 046217          111 LMFLIARPKF  120 (288)
Q Consensus       111 IlwLvlrPk~  120 (288)
                      ++|+++||+.
T Consensus        15 i~yf~iRPQk   24 (113)
T PRK06531         15 LIFFMQRQQK   24 (113)
T ss_pred             HHHheechHH
Confidence            3466799964


No 46 
>PF01708 Gemini_mov:  Geminivirus putative movement protein ;  InterPro: IPR002621 This family consists of putative movement proteins from Maize streak virus and Wheat dwarf virus [].; GO: 0046740 spread of virus in host, cell to cell, 0016021 integral to membrane
Probab=33.09  E-value=30  Score=27.13  Aligned_cols=7  Identities=29%  Similarity=0.439  Sum_probs=3.4

Q ss_pred             CccccCC
Q 046217           72 DMYYAPP   78 (288)
Q Consensus        72 ~~y~~~~   78 (288)
                      ..||-|.
T Consensus        15 ~~y~~P~   21 (91)
T PF01708_consen   15 YSYQTPR   21 (91)
T ss_pred             ccccCCC
Confidence            4455554


No 47 
>PF01102 Glycophorin_A:  Glycophorin A;  InterPro: IPR001195 Proteins in this group are responsible for the molecular basis of the blood group antigens, surface markers on the outside of the red blood cell membrane. Most of these markers are proteins, but some are carbohydrates attached to lipids or proteins [Reid M.E., Lomas-Francis C. The Blood Group Antigen FactsBook Academic Press, London / San Diego, (1997)]. Glycophorin A (PAS-2) and glycophorin B (PAS-3) belong to the MNS blood group system and are associated with antigens that include M/N, S/s, U, He, Mi(a), M(c), Vw, Mur, M(g), Vr, M(e), Mt(a), St(a), Ri(a), Cl(a), Ny(a), Hut, Hil, M(v), Far, Mit, Dantu, Hop, Nob, En(a), ENKT, amongst others. Glycophorin A is the major sialoglycoprotein of the erythrocyte membrane []. Structurally, glycophorin A consists of an N-terminal extracellular domain, heavily glycosylated on serine and threonine residues, followed by a transmembrane region and a C-terminal cytoplasmic domain. Other glycophorins in this entry such as Glycophorin B and Glycophorin E represent minor sialoglycoproteins in the erythrocyte membrane.; GO: 0016021 integral to membrane; PDB: 2KPF_B 1AFO_B 2KPE_A.
Probab=32.85  E-value=13  Score=30.85  Aligned_cols=26  Identities=23%  Similarity=0.184  Sum_probs=9.6

Q ss_pred             HHHHHHHHHheeeEEeec-CCCEEEEee
Q 046217          101 VFFTFIVGLVLMFLIARP-KFPIFHVET  127 (288)
Q Consensus       101 vlllgii~~lIlwLvlrP-k~P~f~V~s  127 (288)
                      +-++|+| +||+|++-|= |+....+..
T Consensus        75 aGvIg~I-lli~y~irR~~Kk~~~~~~p  101 (122)
T PF01102_consen   75 AGVIGII-LLISYCIRRLRKKSSSDVQP  101 (122)
T ss_dssp             HHHHHHH-HHHHHHHHHHS---------
T ss_pred             HHHHHHH-HHHHHHHHHHhccCCCCCCC
Confidence            3345556 6666776542 444444443


No 48 
>PF05568 ASFV_J13L:  African swine fever virus J13L protein;  InterPro: IPR008385 This family consists of several African swine fever virus (ASFV) j13L proteins [, , ].
Probab=32.38  E-value=23  Score=30.42  Aligned_cols=39  Identities=15%  Similarity=0.260  Sum_probs=20.3

Q ss_pred             EehhHHHHHHHHHHHHHHheeeEEeecCCCEEEEeeEEE
Q 046217           92 AILTTMMLFVFFTFIVGLVLMFLIARPKFPIFHVETLAV  130 (288)
Q Consensus        92 ~l~~~li~lvlllgii~~lIlwLvlrPk~P~f~V~s~sv  130 (288)
                      -+|.+|+.+++++.+|.++|.|..-|=|+---.++...+
T Consensus        30 hm~tILiaIvVliiiiivli~lcssRKkKaaAAi~eedi   68 (189)
T PF05568_consen   30 HMYTILIAIVVLIIIIIVLIYLCSSRKKKAAAAIEEEDI   68 (189)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHhhhhHHHHhhhhhhcc
Confidence            455555555544443427777776666544444444333


No 49 
>PF14927 Neurensin:  Neurensin
Probab=32.35  E-value=41  Score=28.51  Aligned_cols=22  Identities=23%  Similarity=0.238  Sum_probs=14.6

Q ss_pred             ehhHHHHHHHHHHHHHHheeeE
Q 046217           93 ILTTMMLFVFFTFIVGLVLMFL  114 (288)
Q Consensus        93 l~~~li~lvlllgii~~lIlwL  114 (288)
                      ++.++-++++++|++++.+-|+
T Consensus        46 V~~i~g~l~Ll~Gi~~l~vgY~   67 (140)
T PF14927_consen   46 VGFISGLLLLLLGIVALTVGYL   67 (140)
T ss_pred             HHHHHHHHHHHHHHHHHHhhcc
Confidence            3445556778888885666666


No 50 
>PF12877 DUF3827:  Domain of unknown function (DUF3827);  InterPro: IPR024606 The function of the proteins in this entry is not currently known, but one of the human proteins (Q9HCM3 from SWISSPROT) has been implicated in pilocytic astrocytomas [, , ]. In the majority of cases of pilocytic astrocytomas a tandem duplication produces an in-frame fusion of the gene encoding this protein and the BRAF oncogene. The resulting fusion protein has constitutive BRAF kinase activity and is capable of transforming cells. 
Probab=32.20  E-value=40  Score=35.34  Aligned_cols=37  Identities=16%  Similarity=0.373  Sum_probs=20.7

Q ss_pred             CCCcEEehhHHHHHHHHHHHHHHheeeEEeecCCCEEE
Q 046217           87 PPSSFAILTTMMLFVFFTFIVGLVLMFLIARPKFPIFH  124 (288)
Q Consensus        87 ~r~~~~l~~~li~lvlllgii~~lIlwLvlrPk~P~f~  124 (288)
                      ++..++|..+++-+++++.|| +++.|...|=++-.|.
T Consensus       266 ~~NlWII~gVlvPv~vV~~Ii-iil~~~LCRk~K~eFq  302 (684)
T PF12877_consen  266 PNNLWIIAGVLVPVLVVLLII-IILYWKLCRKNKLEFQ  302 (684)
T ss_pred             CCCeEEEehHhHHHHHHHHHH-HHHHHHHhcccccCCC
Confidence            334456666655555555555 6666666665544443


No 51 
>PF04046 PSP:  PSP;  InterPro: IPR006568 PSP is a proline-rich domain of unknown function found in spliceosome associated proteins.
Probab=31.50  E-value=25  Score=24.42  Aligned_cols=7  Identities=86%  Similarity=2.048  Sum_probs=4.6

Q ss_pred             CCCCCCC
Q 046217           49 GYPPGYR   55 (288)
Q Consensus        49 ~~~~~~~   55 (288)
                      ||||||+
T Consensus        31 G~PP~y~   37 (48)
T PF04046_consen   31 GYPPGYP   37 (48)
T ss_pred             CCCCCCc
Confidence            6666664


No 52 
>PRK07021 fliL flagellar basal body-associated protein FliL; Reviewed
Probab=31.15  E-value=1.3e+02  Score=25.67  Aligned_cols=14  Identities=0%  Similarity=-0.190  Sum_probs=7.4

Q ss_pred             EEEcceEEEEEECC
Q 046217          159 VHFDRIQSFIYFNT  172 (288)
Q Consensus       159 i~Yd~~~~~v~Y~g  172 (288)
                      .+|=.+++.+-+.+
T Consensus        78 ~rylkv~i~L~~~~   91 (162)
T PRK07021         78 DRVLYVGLTLRLPD   91 (162)
T ss_pred             ceEEEEEEEEEECC
Confidence            45555555555543


No 53 
>PHA03164 hypothetical protein; Provisional
Probab=30.37  E-value=44  Score=25.52  Aligned_cols=21  Identities=19%  Similarity=0.396  Sum_probs=10.8

Q ss_pred             CCCcE-EehhHHHHHHHHHHHH
Q 046217           87 PPSSF-AILTTMMLFVFFTFIV  107 (288)
Q Consensus        87 ~r~~~-~l~~~li~lvlllgii  107 (288)
                      +|+.| .+..+.+++..|++++
T Consensus        54 rRktftFlvLtgLaIamILfii   75 (88)
T PHA03164         54 RRKTFTFLVLTGLAIAMILFII   75 (88)
T ss_pred             hhheeehHHHHHHHHHHHHHHH
Confidence            45566 4444444555555554


No 54 
>COG4698 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=30.23  E-value=41  Score=29.82  Aligned_cols=22  Identities=14%  Similarity=0.333  Sum_probs=15.6

Q ss_pred             HheeeEEeecCCCEEEEeeEEE
Q 046217          109 LVLMFLIARPKFPIFHVETLAV  130 (288)
Q Consensus       109 ~lIlwLvlrPk~P~f~V~s~sv  130 (288)
                      ++|+..++.|+.+..++.+.+=
T Consensus        27 ~~i~~~vlsp~ee~t~~~~a~~   48 (197)
T COG4698          27 VLIALFVLSPREEPTHLEDASE   48 (197)
T ss_pred             HHhheeeccCCCCCchhhccCc
Confidence            5666678899987777666543


No 55 
>PF11837 DUF3357:  Domain of unknown function (DUF3357);  InterPro: IPR021792  This entry represents the N-terminal domain of beta-fructofuranosidase, whcih is involved in the hydrolysis of terminal non-reducing beta-D-fructofuranoside residues in beta-D-fructofuranosides. ; GO: 0004564 beta-fructofuranosidase activity, 0004575 sucrose alpha-glucosidase activity; PDB: 3UGG_A 3UGH_B 3UGF_B.
Probab=29.30  E-value=18  Score=28.95  Aligned_cols=24  Identities=8%  Similarity=0.264  Sum_probs=0.0

Q ss_pred             CCCcE-EehhHHHHHHHHHHHHHHhe
Q 046217           87 PPSSF-AILTTMMLFVFFTFIVGLVL  111 (288)
Q Consensus        87 ~r~~~-~l~~~li~lvlllgii~~lI  111 (288)
                      +|+.+ |+.++++.+++|+.++ +||
T Consensus        24 ~rR~~k~~~~i~~s~~~ll~lv-al~   48 (106)
T PF11837_consen   24 RRRPLKCLAAIFSSLLFLLSLV-ALI   48 (106)
T ss_dssp             --------------------------
T ss_pred             cCCcchhHHHHHHHHHHHHHHH-HHH
Confidence            34434 6666666666666555 444


No 56 
>PRK13254 cytochrome c-type biogenesis protein CcmE; Reviewed
Probab=28.67  E-value=1.8e+02  Score=24.77  Aligned_cols=39  Identities=15%  Similarity=0.271  Sum_probs=24.4

Q ss_pred             EEEEEEEEeCCCeeeEEEcceEEEEEECCceeeeEeeec
Q 046217          144 WEINLVVENRNFGANVHFDRIQSFIYFNTDDLLASSHFD  182 (288)
Q Consensus       144 l~~~Lsv~NPN~k~~i~Yd~~~~~v~Y~g~~~lg~a~vp  182 (288)
                      ..+++.+.--+..+.+.|+..-=.++=.|+..+..+.+.
T Consensus        71 ~~~~F~ltD~~~~i~V~Y~G~lPd~F~eg~~VVv~G~~~  109 (148)
T PRK13254         71 LTVRFVVTDGNATVPVVYTGILPDLFREGQGVVAEGRLQ  109 (148)
T ss_pred             CEEEEEEEeCCeEEEEEECCCCCccccCCCEEEEEEEEC
Confidence            344455555567788888766555555555577777765


No 57 
>PTZ00382 Variant-specific surface protein (VSP); Provisional
Probab=28.37  E-value=33  Score=27.02  Aligned_cols=15  Identities=20%  Similarity=0.226  Sum_probs=7.0

Q ss_pred             HHHHHHHHheeeE-Eee
Q 046217          102 FFTFIVGLVLMFL-IAR  117 (288)
Q Consensus       102 lllgii~~lIlwL-vlr  117 (288)
                      ++.+|+ .+++|+ ++|
T Consensus        78 ~v~~lv-~~l~w~f~~r   93 (96)
T PTZ00382         78 VVGGLV-GFLCWWFVCR   93 (96)
T ss_pred             HHHHHH-HHHhheeEEe
Confidence            333445 444455 455


No 58 
>PF02009 Rifin_STEVOR:  Rifin/stevor family;  InterPro: IPR002858 Malaria is still a major cause of mortality in many areas of the world. Plasmodium falciparum causes the most severe human form of the disease and is responsible for most fatalities. Severe cases of malaria can occur when the parasite invades and then proliferates within red blood cell erythrocytes. The parasite produces many variant antigenic proteins, encoded by multigene families, which are present on the surface of the infected erythrocyte and play important roles in virulence. A crucial survival mechanism for the malaria parasite is its ability to evade the immune response by switching these variant surface antigens. The high virulence of P. falciparum relative to other malarial parasites is in large part due to the fact that in this organism many of these surface antigens mediate the binding of infected erythrocytes to the vascular endothelium (cytoadherence) and non-infected erythrocytes (rosetting). This can lead to the accumulation of infected cells in the vasculature of a variety of organs, blocking the blood flow and reducing the oxygen supply. Clinical symptoms of severe infection can include fever, progressive anaemia, multi-organ dysfunction and coma. For more information see []. Several multicopy gene families have been described in Plasmodium falciparum, including the stevor family of subtelomeric open reading frames and the rif interspersed repetitive elements. Both families contain three predicted transmembrane segments. It has been proposed that stevor and rif are members of a larger superfamily that code for variant surface antigens [].
Probab=28.32  E-value=20  Score=34.14  Aligned_cols=11  Identities=36%  Similarity=1.029  Sum_probs=7.9

Q ss_pred             HHHHheeeEEee
Q 046217          106 IVGLVLMFLIAR  117 (288)
Q Consensus       106 ii~~lIlwLvlr  117 (288)
                      ++ .+|+||+||
T Consensus       270 LI-MvIIYLILR  280 (299)
T PF02009_consen  270 LI-MVIIYLILR  280 (299)
T ss_pred             HH-HHHHHHHHH
Confidence            45 678888875


No 59 
>COG5353 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=28.31  E-value=45  Score=28.69  Aligned_cols=23  Identities=9%  Similarity=0.410  Sum_probs=15.4

Q ss_pred             HHHHHHHHHHHHHHheeeEEeecC
Q 046217           96 TMMLFVFFTFIVGLVLMFLIARPK  119 (288)
Q Consensus        96 ~li~lvlllgii~~lIlwLvlrPk  119 (288)
                      ++++|+.|++.+ ++++|....|.
T Consensus        12 i~viflai~~s~-~~~~~~s~~P~   34 (161)
T COG5353          12 ILVIFLAIILSI-ALFFWKSMKPY   34 (161)
T ss_pred             HHHHHHHHHHHH-HHHHhHhcCcc
Confidence            334455555556 88899988884


No 60 
>KOG2375 consensus Protein interacting with poly(A)-binding protein [RNA processing and modification]
Probab=27.86  E-value=1.4e+02  Score=32.03  Aligned_cols=28  Identities=32%  Similarity=0.478  Sum_probs=14.6

Q ss_pred             CCCCCCCCCCCCCCCCCCCCCCCCCCCCCC
Q 046217           22 PTRQYFEPQQPAPFPPLPPAMGYPPAMGYP   51 (288)
Q Consensus        22 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~   51 (288)
                      +.++++-+.+..+++.  ++|+|.|+||-+
T Consensus       558 ~~~~tp~q~~~~~~~~--~~~~~~P~~g~~  585 (756)
T KOG2375|consen  558 PMAMTPFQNSQHNPPG--PVPGYNPQMGNP  585 (756)
T ss_pred             cCCCCCCccccCCCCC--CCCCCCcccCCC
Confidence            3344444444443333  557777777666


No 61 
>PF11395 DUF2873:  Protein of unknown function (DUF2873);  InterPro: IPR021532 This entry is represented by the human SARS coronavirus, Orf7b; it is a family of uncharacterised viral proteins.
Probab=27.50  E-value=24  Score=23.25  Aligned_cols=9  Identities=11%  Similarity=0.538  Sum_probs=5.0

Q ss_pred             HheeeEEee
Q 046217          109 LVLMFLIAR  117 (288)
Q Consensus       109 ~lIlwLvlr  117 (288)
                      ++|+|.++.
T Consensus        25 liif~f~le   33 (43)
T PF11395_consen   25 LIIFWFSLE   33 (43)
T ss_pred             HHHHHHHHh
Confidence            456666543


No 62 
>PF12606 RELT:  Tumour necrosis factor receptor superfamily member 19;  InterPro: IPR022248 The members of tumor necrosis factor receptor (TNFR) superfamily have been designated as the "guardians of the immune system" due to their roles in immune cell proliferation, differentiation, activation, and death (apoptosis).  RELT (receptor expressed in lymphoid tissues) is a member of the TNFR superfamily. The messenger RNA of RELT is especially abundant in hematologic tissues such as spleen, lymph node, and peripheral blood leukocytes as well as in leukemias and lymphomas. RELT is able to activate the NF-kappaB pathway and selectively binds tumor necrosis factor receptor-associated factor 1 []. RELT like proteins 1 and 2 (RELL1 and RELL2) are two RELT homologues that bind to RELT. The expression of RELL1 at the mRNA level is ubiquitous, whereas expression of RELL2 mRNA is more restricted to particular tissues [].
Probab=27.26  E-value=25  Score=24.57  Aligned_cols=30  Identities=3%  Similarity=-0.088  Sum_probs=16.6

Q ss_pred             HHHHHHHHHHHHHHheeeEEeecCCCEEEEe
Q 046217           96 TMMLFVFFTFIVGLVLMFLIARPKFPIFHVE  126 (288)
Q Consensus        96 ~li~lvlllgii~~lIlwLvlrPk~P~f~V~  126 (288)
                      +++.+++++|++ .+.+|.+++=+.=+.+++
T Consensus         5 ~iV~i~iv~~lL-g~~I~~~~K~ygYkht~d   34 (50)
T PF12606_consen    5 LIVSIFIVMGLL-GLSICTTLKAYGYKHTVD   34 (50)
T ss_pred             HHHHHHHHHHHH-HHHHHHHhhccccccccC
Confidence            445556666666 555666666554444443


No 63 
>PF06667 PspB:  Phage shock protein B;  InterPro: IPR009554 This family consists of several bacterial phage shock protein B (PspB) sequences. The phage shock protein (psp) operon is induced in response to heat, ethanol, osmotic shock and infection by filamentous bacteriophages []. Expression of the operon requires the alternative sigma factor sigma54 and the transcriptional activator PspF. In addition, PspA plays a negative regulatory role, and the integral-membrane proteins PspB and PspC play a positive one [].; GO: 0006355 regulation of transcription, DNA-dependent, 0009271 phage shock
Probab=26.86  E-value=21  Score=27.06  Aligned_cols=9  Identities=22%  Similarity=0.486  Sum_probs=5.1

Q ss_pred             HheeeEEee
Q 046217          109 LVLMFLIAR  117 (288)
Q Consensus       109 ~lIlwLvlr  117 (288)
                      +..+||+++
T Consensus        17 Vap~WL~lH   25 (75)
T PF06667_consen   17 VAPIWLILH   25 (75)
T ss_pred             HHHHHHHHH
Confidence            555666643


No 64 
>PF06072 Herpes_US9:  Alphaherpesvirus tegument protein US9;  InterPro: IPR009278 This family consists of several US9 and related proteins from the Alphaherpesviruses. The function of the US9 protein is unknown although in Bovine herpesvirus 5 Us9 is essential for the anterograde spread of the virus from the olfactory mucosa to the bulb [].; GO: 0019033 viral tegument
Probab=26.70  E-value=14  Score=26.75  Aligned_cols=11  Identities=9%  Similarity=0.111  Sum_probs=5.8

Q ss_pred             HHHHHHheeeE
Q 046217          104 TFIVGLVLMFL  114 (288)
Q Consensus       104 lgii~~lIlwL  114 (288)
                      .+++++++.|+
T Consensus        49 S~~lG~~~~~~   59 (60)
T PF06072_consen   49 SGGLGALVAWH   59 (60)
T ss_pred             HHHHHHHhhcc
Confidence            33445666665


No 65 
>PF11906 DUF3426:  Protein of unknown function (DUF3426);  InterPro: IPR021834  This family of proteins are functionally uncharacterised. This protein is found in bacteria. Proteins in this family are typically between 262 to 463 amino acids in length. 
Probab=26.26  E-value=3.6e+02  Score=22.02  Aligned_cols=59  Identities=12%  Similarity=0.088  Sum_probs=38.9

Q ss_pred             EEEeeEEEeeeecCC-CCceeEEEEEEEEEeCCCeeeEEEcceEEEEEECCceeeeEeeec
Q 046217          123 FHVETLAVTNFSVAN-DDFTSAWEINLVVENRNFGANVHFDRIQSFIYFNTDDLLASSHFD  182 (288)
Q Consensus       123 f~V~s~svs~fnvs~-~~L~a~l~~~Lsv~NPN~k~~i~Yd~~~~~v~Y~g~~~lg~a~vp  182 (288)
                      -.++.+++.+..+.. ..-+-.+.++.+++|... ....|-.++++++=.++..+++-.+.
T Consensus        48 ~~~~~l~i~~~~~~~~~~~~~~l~v~g~i~N~~~-~~~~~P~l~l~L~D~~g~~l~~r~~~  107 (149)
T PF11906_consen   48 RDIDALKIESSDLRPVPDGPGVLVVSGTIRNRAD-FPQALPALELSLLDAQGQPLARRVFT  107 (149)
T ss_pred             cCcceEEEeeeeEEeecCCCCEEEEEEEEEeCCC-CcccCceEEEEEECCCCCEEEEEEEC
Confidence            355555554443332 134677899999999988 77888899888873333367666663


No 66 
>PF14155 DUF4307:  Domain of unknown function (DUF4307)
Probab=26.03  E-value=3.5e+02  Score=21.70  Aligned_cols=38  Identities=24%  Similarity=0.255  Sum_probs=16.6

Q ss_pred             HheeeEEee-cCCCEEEEeeEEEeeeecCCCCceeEEEEEEEEEe
Q 046217          109 LVLMFLIAR-PKFPIFHVETLAVTNFSVANDDFTSAWEINLVVEN  152 (288)
Q Consensus       109 ~lIlwLvlr-Pk~P~f~V~s~svs~fnvs~~~L~a~l~~~Lsv~N  152 (288)
                      +++.|+.+. -..+.++   ..+..|.+.+.   .++++++.+.-
T Consensus        19 ~~~~w~~~~~~~~~~v~---~~~~gf~vv~d---~~v~v~f~Vtr   57 (112)
T PF14155_consen   19 AVVAWFGYSQFGSPPVS---AEVIGFEVVDD---STVEVTFDVTR   57 (112)
T ss_pred             HHHhHhhhhhccCCCce---EEEEEEEECCC---CEEEEEEEEEE
Confidence            445555554 3444443   33444444321   23444444443


No 67 
>PF15012 DUF4519:  Domain of unknown function (DUF4519)
Probab=26.03  E-value=73  Score=22.85  Aligned_cols=11  Identities=36%  Similarity=0.951  Sum_probs=8.4

Q ss_pred             HheeeEEeecC
Q 046217          109 LVLMFLIARPK  119 (288)
Q Consensus       109 ~lIlwLvlrPk  119 (288)
                      ++++|+.-||+
T Consensus        46 vv~vy~kTRP~   56 (56)
T PF15012_consen   46 VVFVYLKTRPR   56 (56)
T ss_pred             hheeEEeccCC
Confidence            67788888875


No 68 
>COG1589 FtsQ Cell division septal protein [Cell envelope biogenesis, outer membrane]
Probab=25.95  E-value=57  Score=30.10  Aligned_cols=33  Identities=18%  Similarity=0.550  Sum_probs=25.2

Q ss_pred             HHHHHHHHHHheeeEEeecCCCEEEEeeEEEeee
Q 046217          100 FVFFTFIVGLVLMFLIARPKFPIFHVETLAVTNF  133 (288)
Q Consensus       100 lvlllgii~~lIlwLvlrPk~P~f~V~s~svs~f  133 (288)
                      +++++++. ++++|...-++.|.|.+..+.|++=
T Consensus        38 ~~~~~~~~-~~~~~~~~~~~~~~~~i~~v~v~Gn   70 (269)
T COG1589          38 YLVLLLLV-LVVLWVLILLSLPYFPIRKVSVSGN   70 (269)
T ss_pred             HHHHHHHH-HHHHheehhhhcCCccceEEEEecC
Confidence            34444445 6788888899999999999999764


No 69 
>KOG0119 consensus Splicing factor 1/branch point binding protein (RRM superfamily) [RNA processing and modification]
Probab=25.49  E-value=1.8e+02  Score=29.84  Aligned_cols=17  Identities=47%  Similarity=1.001  Sum_probs=12.1

Q ss_pred             CCCCCC--CCCCCCCCCCC
Q 046217           39 PPAMGY--PPAMGYPPGYR   55 (288)
Q Consensus        39 ~~~~~~--~~~~~~~~~~~   55 (288)
                      +|+++|  |++.|||+|-|
T Consensus       440 ~Pp~~~~~p~~~~~ppg~p  458 (554)
T KOG0119|consen  440 YPPTSYAPPPQSGQPPGIP  458 (554)
T ss_pred             CCCccCCCCCccCCCCCCC
Confidence            477778  77888876644


No 70 
>PF06637 PV-1:  PV-1 protein (PLVAP);  InterPro: IPR009538 This family consists of several PV-1 (PLVAP) proteins, which seem to be specific to mammals. PV-1 is a novel protein component of the endothelial fenestral and stomatal diaphragms []. The function of this family is unknown.
Probab=24.86  E-value=43  Score=33.02  Aligned_cols=15  Identities=7%  Similarity=0.222  Sum_probs=8.7

Q ss_pred             CceeEEEEEEEEEeC
Q 046217          139 DFTSAWEINLVVENR  153 (288)
Q Consensus       139 ~L~a~l~~~Lsv~NP  153 (288)
                      .|+..++++...++-
T Consensus        83 NLtkeLN~t~~~K~~   97 (442)
T PF06637_consen   83 NLTKELNLTTRAKDA   97 (442)
T ss_pred             HHHHHhchhHHHHHH
Confidence            455566666666554


No 71 
>PF10164 DUF2367:  Uncharacterized conserved protein (DUF2367);  InterPro: IPR019317  This is a highly conserved set of proteins which contains three pairs of cysteine residues within a length of 42 amino acids and is rich in proline residues towards the N terminus. It includes a membrane protein that has been found to be highly expressed in the mouse brain and consequently, several members have been assigned as brain protein i3 (Bri3). Their function is unknown.
Probab=24.50  E-value=1.8e+02  Score=23.18  Aligned_cols=19  Identities=26%  Similarity=0.252  Sum_probs=9.4

Q ss_pred             CcE-EehhHHHHHHHHHHHH
Q 046217           89 SSF-AILTTMMLFVFFTFIV  107 (288)
Q Consensus        89 ~~~-~l~~~li~lvlllgii  107 (288)
                      ..| |+-+++.++++=+||+
T Consensus        62 ~~ft~~Gi~~AI~fFPiGil   81 (98)
T PF10164_consen   62 DSFTCCGILCAIFFFPIGIL   81 (98)
T ss_pred             ccccHHHHHHHHHHHhhHHH
Confidence            344 4434444455556665


No 72 
>PF02468 PsbN:  Photosystem II reaction centre N protein (psbN);  InterPro: IPR003398 Oxygenic photosynthesis uses two multi-subunit photosystems (I and II) located in the cell membranes of cyanobacteria and in the thylakoid membranes of chloroplasts in plants and algae. Photosystem II (PSII) has a P680 reaction centre containing chlorophyll 'a' that uses light energy to carry out the oxidation (splitting) of water molecules, and to produce ATP via a proton pump. Photosystem I (PSI) has a P700 reaction centre containing chlorophyll that takes the electron and associated hydrogen donated from PSII to reduce NADP+ to NADPH. Both ATP and NADPH are subsequently used in the light-independent reactions to convert carbon dioxide to glucose using the hydrogen atom extracted from water by PSII, releasing oxygen as a by-product. PSII is a multisubunit protein-pigment complex containing polypeptides both intrinsic and extrinsic to the photosynthetic membrane [, ]. Within the core of the complex, the chlorophyll and beta-carotene pigments are mainly bound to the antenna proteins CP43 (PsbC) and CP47 (PsbB), which pass the excitation energy on to the reaction centre proteins D1 (Qb, PsbA) and D2 (Qa, PsbD) that bind all the redox-active cofactors involved in the energy conversion process. The PSII oxygen-evolving complex (OEC) oxidises water to provide protons for use by PSI, and consists of OEE1 (PsbO), OEE2 (PsbP) and OEE3 (PsbQ). The remaining subunits in PSII are of low molecular weight (less than 10 kDa), and are involved in PSII assembly, stabilisation, dimerisation, and photo-protection [].   This family represents the low molecular weight transmembrane protein PsbN found in PSII. PsbN may have a role in PSII stability, however its actual function unknown. PsbN does not appear to be essential for photoautotrophic growth or normal PSII function.; GO: 0015979 photosynthesis, 0009523 photosystem II, 0009539 photosystem II reaction center, 0016020 membrane
Probab=24.39  E-value=81  Score=21.41  Aligned_cols=23  Identities=9%  Similarity=0.171  Sum_probs=16.6

Q ss_pred             hHHHHHHHHHHHHHHheeeEEeec
Q 046217           95 TTMMLFVFFTFIVGLVLMFLIARP  118 (288)
Q Consensus        95 ~~li~lvlllgii~~lIlwLvlrP  118 (288)
                      .++.+.+++++++ ..-+|..+.|
T Consensus         7 ~~i~i~~~lv~~T-gy~iYtaFGp   29 (43)
T PF02468_consen    7 LAIFISCLLVSIT-GYAIYTAFGP   29 (43)
T ss_pred             HHHHHHHHHHHHH-hhhhhheeCC
Confidence            3445556677778 7888988876


No 73 
>KOG0810 consensus SNARE protein Syntaxin 1 and related proteins [Intracellular trafficking, secretion, and vesicular transport]
Probab=23.81  E-value=53  Score=31.20  Aligned_cols=8  Identities=13%  Similarity=0.538  Sum_probs=3.3

Q ss_pred             EehhHHHH
Q 046217           92 AILTTMML   99 (288)
Q Consensus        92 ~l~~~li~   99 (288)
                      |+|+++++
T Consensus       272 ~i~ii~~i  279 (297)
T KOG0810|consen  272 IIIIIILI  279 (297)
T ss_pred             eeeehHHH
Confidence            44444333


No 74 
>cd01324 cbb3_Oxidase_CcoQ Cytochrome cbb oxidase CcoQ.  Cytochrome cbb3 oxidase, the terminal oxidase in the respiratory chains of proteobacteria, is a multi-chain transmembrane protein located in the cell membrane. Like other cytochrome oxidases, it catalyzes the reduction of O2 and simultaneously pumps protons across the membrane.  Found exclusively in proteobacteria, cbb3 is believed to be a modern enzyme that has evolved independently to perform a specialized function in microaerobic energy metabolism. The cbb3 operon contains four genes (ccoNOQP or fixNOQP), with ccoN coding for subunit I.  Instead of a CuA-containing subunit II analogous to other cytochrome oxidases, cbb3 utilizes subunits ccoO and ccoP, which contain one and two hemes, respectively, to transfer electrons to the binuclear center.  ccoQ, the fourth subunit, is a single transmembrane helix protein.  It has been shown to protect the core complex from proteolytic degradation by serine proteases.  See cd00919, cd01322
Probab=23.52  E-value=38  Score=23.28  Aligned_cols=12  Identities=25%  Similarity=0.235  Sum_probs=7.1

Q ss_pred             HheeeEEeecCC
Q 046217          109 LVLMFLIARPKF  120 (288)
Q Consensus       109 ~lIlwLvlrPk~  120 (288)
                      +.|++.+++|+.
T Consensus        24 igiv~wa~~p~~   35 (48)
T cd01324          24 LGVVVWAFRPGR   35 (48)
T ss_pred             HHHHHHHhCCCc
Confidence            445556777753


No 75 
>PF12355 Dscam_C:  Down syndrome cell adhesion molecule C terminal ;  InterPro: IPR021012  This entry is specific to the insecta, predominantly Drosophila spp. This entry is found in association with PF00047 from PFAM, PF07679 from PFAM and PF00041 from PFAM. The Down syndrome cell adhesion molecule (Dscam) belongs to a family of cell membrane molecules involved in the differentiation of the nervous system. This is the C-terminal cytoplasmic tail region of Dscam. In Drosophila melanogaster (Fruit fly) the gene has at least 59 different transcripts. Dscam may play a role in the nervous and immune systems [].
Probab=23.44  E-value=54  Score=26.74  Aligned_cols=10  Identities=60%  Similarity=1.271  Sum_probs=5.4

Q ss_pred             CCCCCCCCCC
Q 046217           15 EIPEPPPPTR   24 (288)
Q Consensus        15 ~~~~~~~~~~   24 (288)
                      ++||||||+|
T Consensus        61 gSPePPpPPP   70 (124)
T PF12355_consen   61 GSPEPPPPPP   70 (124)
T ss_pred             CCCCCCCcCC
Confidence            3466665544


No 76 
>KOG1923 consensus Rac1 GTPase effector FRL [Signal transduction mechanisms; Cytoskeleton]
Probab=23.07  E-value=2.7e+02  Score=30.06  Aligned_cols=8  Identities=38%  Similarity=0.737  Sum_probs=3.7

Q ss_pred             eeeEEeec
Q 046217          111 LMFLIARP  118 (288)
Q Consensus       111 IlwLvlrP  118 (288)
                      +=|+.++|
T Consensus       374 lnW~alKP  381 (830)
T KOG1923|consen  374 LNWLALKP  381 (830)
T ss_pred             ccccccCc
Confidence            34444444


No 77 
>KOG2322 consensus N-methyl-D-aspartate receptor glutamate-binding subunit [Signal transduction mechanisms]
Probab=22.70  E-value=1.1e+02  Score=28.27  Aligned_cols=16  Identities=6%  Similarity=0.181  Sum_probs=6.5

Q ss_pred             CCcE-EehhHHHHHHHH
Q 046217           88 PSSF-AILTTMMLFVFF  103 (288)
Q Consensus        88 r~~~-~l~~~li~lvll  103 (288)
                      |..| |=...++.+=++
T Consensus        44 R~~FiRKVYsIl~~QLl   60 (237)
T KOG2322|consen   44 RWGFIRKVYSILSIQLL   60 (237)
T ss_pred             HHHHHHHHHHHHHHHHH
Confidence            3444 443444443333


No 78 
>PHA03269 envelope glycoprotein C; Provisional
Probab=22.56  E-value=1.7e+02  Score=29.96  Aligned_cols=21  Identities=10%  Similarity=0.135  Sum_probs=10.7

Q ss_pred             HHHHHHHHHHHHHHheeeEEee
Q 046217           96 TMMLFVFFTFIVGLVLMFLIAR  117 (288)
Q Consensus        96 ~li~lvlllgii~~lIlwLvlr  117 (288)
                      +++++++++|++ ++|.-+.|+
T Consensus       543 gV~~~~~algli-ilItAlCfy  563 (566)
T PHA03269        543 GIISIIGILGLI-ALIATLCIR  563 (566)
T ss_pred             HHHHHHHHHHHH-HhheeEEEE
Confidence            333444455665 566655543


No 79 
>PF04881 Adeno_GP19K:  Adenovirus GP19K;  InterPro: IPR006965 This 19 kDa glycoprotein binds the major histocompatibility (MHC) class I antigens in the endoplasmic reticulum (ER). The ER retention signal at the C terminus of Gp19K causes retention of the complex in the ER, preventing lysis of the cell by cytotoxic T-lymphocytes [].; GO: 0005537 mannose binding, 0050690 regulation of defense response to virus by virus
Probab=22.39  E-value=57  Score=27.34  Aligned_cols=25  Identities=12%  Similarity=0.317  Sum_probs=13.9

Q ss_pred             EehhHHHHHHHHHHHHHHheeeEEeecCC
Q 046217           92 AILTTMMLFVFFTFIVGLVLMFLIARPKF  120 (288)
Q Consensus        92 ~l~~~li~lvlllgii~~lIlwLvlrPk~  120 (288)
                      |+|++++..++++++    .+++.+||+.
T Consensus       106 ~~~aclit~l~~~~i----~~~i~~kpR~  130 (139)
T PF04881_consen  106 CICACLITALLCVCI----HLLIKIKPRN  130 (139)
T ss_pred             HHHHHHHHHHHHHHH----hhheeecccc
Confidence            555555555444433    3445688874


No 80 
>PF05545 FixQ:  Cbb3-type cytochrome oxidase component FixQ;  InterPro: IPR008621 This family consists of several Cbb3-type cytochrome oxidase components (FixQ/CcoQ). FixQ is found in nitrogen fixing bacteria. Since nitrogen fixation is an energy-consuming process, effective symbioses depend on operation of a respiratory chain with a high affinity for O2, closely coupled to ATP production. This requirement is fulfilled by a special three-subunit terminal oxidase (cytochrome terminal oxidase cbb3), which was first identified in Bradyrhizobium japonicum as the product of the fixNOQP operon [].
Probab=21.91  E-value=34  Score=23.24  Aligned_cols=12  Identities=17%  Similarity=0.365  Sum_probs=6.8

Q ss_pred             HheeeEEeecCC
Q 046217          109 LVLMFLIARPKF  120 (288)
Q Consensus       109 ~lIlwLvlrPk~  120 (288)
                      +.|+|.+++|++
T Consensus        23 ~gi~~w~~~~~~   34 (49)
T PF05545_consen   23 IGIVIWAYRPRN   34 (49)
T ss_pred             HHHHHHHHcccc
Confidence            344455667764


No 81 
>PF06835 LptC:  Lipopolysaccharide-assembly, LptC-related;  InterPro: IPR010664 This family consists of several related groups of proteins one of which is the LptC family. LptC is involved in lipopolysaccharide-assembly on the outer membrane of Gram-negative organisms.  The cell envelope of Gram-negative bacteria consists of an inner (IM) and an outer membrane (OM) separated by an aqueous compartment, the periplasm, which contains the peptidoglycan layer. The OM is an asymmetric bilayer, with phospholipids in the inner leaflet and lipopolysaccharides (LPS) facing outward [, ]. The OM is an effective permeability barrier that protects the cells from toxic compounds, such as antibiotics and detergents, thus allowing bacteria to inhabit several different and often hostile environments. LPS is responsible for the permeability properties of the OM. LPS consists of the lipid A moiety (a glucosamine-based phospholipid) linked to the short core oligosaccharide and the distal O-antigen polysaccharide chain. The core oligosaccharide can be further divided into an inner core, composed of 3-deoxy-D-mannooctulosanate (KDO) and heptose, and an outer core, which has a somewhat variable structure. LPS is essential in most Gram-negative bacteria, with the notable exception of Neisseria meningitidis. The biogenesis of the OM implies that the individual components are transported from the site of synthesis to their final destination outside the IM by crossing both hydrophilic and hydrophobic compartments. The machinery and the energy source that drive this process are not yet fully understood. The lipid A-core moiety and the O-antigen repeat units are synthesized at the cytoplasmic face of the IM and are separately exported via two independent transport systems, namely, the O-antigen transporter Wzx (RfbX) [, ] and the ATP binding cassette (ABC) transporter MsbA that flips the lipid A-core moiety from the inner leaflet to the outer leaflet of the IM [, , ]. O-antigen repeat units are then polymerised in the periplasm by the Wzy polymerase and ligated to the lipid A-core moiety by the WaaL ligase [see, , ]. The LPS transport machinery is composed of LptA, LptB, LptC, LptD, LptE. This supported by the fact, that depletion of any of one of these proteins blocks the LPS assembly pathway and results in very similar OM biogenesis defects. Moreover, the location of at least one of these five proteins in every cellular compartment suggests a model for how the LPS assembly pathway is organised and ordered in space []. Required for the translocation of lipopolysaccharide (LPS) from the inner membrane to the outer membrane [].; PDB: 3MY2_A.
Probab=21.87  E-value=4e+02  Score=21.72  Aligned_cols=49  Identities=14%  Similarity=0.195  Sum_probs=14.2

Q ss_pred             CCEEEEeeEEEeeeecCCCCceeEEEEEEEEEeCCCeeeEEEcceEEEEEE
Q 046217          120 FPIFHVETLAVTNFSVANDDFTSAWEINLVVENRNFGANVHFDRIQSFIYF  170 (288)
Q Consensus       120 ~P~f~V~s~svs~fnvs~~~L~a~l~~~Lsv~NPN~k~~i~Yd~~~~~v~Y  170 (288)
                      .|.+.++++++..++-. +.+...++..=..+++|.+. ++.+...+.++-
T Consensus        32 ~~~~~~~~~~~~~~~~~-G~~~~~l~A~~~~~~~~~~~-~~l~~p~~~~~~   80 (176)
T PF06835_consen   32 DPDYSIENFTLTQYDED-GKLQWKLTAERAEHYPNSDT-VELEDPSLIIYD   80 (176)
T ss_dssp             ----------------------EEEE-SSEEEETTTTE-EEEES-EEEEE-
T ss_pred             CCcEEEEeeEEEEECCC-CCEEEEEEEeEEEEecCCCc-EEEeccEEEEEe
Confidence            34555555555544322 23555555555567776643 455555555544


No 82 
>PHA03271 envelope glycoprotein C; Provisional
Probab=21.55  E-value=2e+02  Score=28.90  Aligned_cols=17  Identities=12%  Similarity=0.022  Sum_probs=8.2

Q ss_pred             HHHHHHHHHHHheeeEEe
Q 046217           99 LFVFFTFIVGLVLMFLIA  116 (288)
Q Consensus        99 ~lvlllgii~~lIlwLvl  116 (288)
                      .+.++++++ ++|.-|.|
T Consensus       464 g~avlla~~-ilItALCf  480 (490)
T PHA03271        464 GLALILGMG-IILTALCF  480 (490)
T ss_pred             HHHHHHHHH-HhheeEEE
Confidence            333344444 55555554


No 83 
>KOG3294 consensus WW domain binding protein WBP-2, contains GRAM domain [Signal transduction mechanisms]
Probab=21.44  E-value=3.3e+02  Score=25.37  Aligned_cols=11  Identities=55%  Similarity=1.289  Sum_probs=5.5

Q ss_pred             CCCCCCCCCCC
Q 046217           16 IPEPPPPTRQY   26 (288)
Q Consensus        16 ~~~~~~~~~~~   26 (288)
                      -+++||++-.|
T Consensus       172 y~~ppp~P~~y  182 (261)
T KOG3294|consen  172 YPVPPPPPNGY  182 (261)
T ss_pred             CCCCCCCcccc
Confidence            34555555555


No 84 
>PRK13183 psbN photosystem II reaction center protein N; Provisional
Probab=21.28  E-value=91  Score=21.48  Aligned_cols=22  Identities=5%  Similarity=0.164  Sum_probs=16.0

Q ss_pred             HHHHHHHHHHHHHHheeeEEeec
Q 046217           96 TMMLFVFFTFIVGLVLMFLIARP  118 (288)
Q Consensus        96 ~li~lvlllgii~~lIlwLvlrP  118 (288)
                      ++.+..++++++ ..-+|..+-|
T Consensus        11 ~i~i~~lL~~~T-gyaiYtaFGp   32 (46)
T PRK13183         11 AITILAILLALT-GFGIYTAFGP   32 (46)
T ss_pred             HHHHHHHHHHHh-hheeeeccCC
Confidence            444556677777 8888888877


No 85 
>KOG0811 consensus SNARE protein PEP12/VAM3/Syntaxin 7/Syntaxin 17 [Intracellular trafficking, secretion, and vesicular transport]
Probab=21.17  E-value=43  Score=31.37  Aligned_cols=23  Identities=13%  Similarity=0.025  Sum_probs=10.3

Q ss_pred             hhHHHHHHHHHHHHHHheeeEEee
Q 046217           94 LTTMMLFVFFTFIVGLVLMFLIAR  117 (288)
Q Consensus        94 ~~~li~lvlllgii~~lIlwLvlr  117 (288)
                      |+.++++++++.++ .+++|++++
T Consensus       246 ~~ll~v~~~v~lii-~l~i~~~~~  268 (269)
T KOG0811|consen  246 CILLLVGGPVGLII-GLIIAGIAA  268 (269)
T ss_pred             hhhhHHHHHHHHHH-HHHHHHhhc
Confidence            33333333333334 566665543


No 86 
>PF04790 Sarcoglycan_1:  Sarcoglycan complex subunit protein;  InterPro: IPR006875 The dystrophin glycoprotein complex (DGC) is a membrane-spanning complex that links the interior cytoskeleton to the extracellular matrix in muscle. The sarcoglycan complex is a subcomplex within the DGC and is composed of several muscle-specific, transmembrane proteins (alpha-, beta-, gamma-, delta- and zeta-sarcoglycan). The sarcoglycans are asparagine-linked glycosylated proteins with single transmembrane domains. This family contains beta, gamma and delta members [, ].; GO: 0007010 cytoskeleton organization, 0016012 sarcoglycan complex, 0016021 integral to membrane
Probab=20.90  E-value=45  Score=31.10  Aligned_cols=18  Identities=22%  Similarity=0.183  Sum_probs=12.3

Q ss_pred             CceeEEEEEEEEEeCCCe
Q 046217          139 DFTSAWEINLVVENRNFG  156 (288)
Q Consensus       139 ~L~a~l~~~Lsv~NPN~k  156 (288)
                      .+..+=++++.++|.|.+
T Consensus        83 ~i~s~~~v~~~~r~~~g~  100 (264)
T PF04790_consen   83 VIQSSRNVTLNARNENGS  100 (264)
T ss_pred             EEEecCceEEEEecCCCc
Confidence            344555677788888876


No 87 
>CHL00020 psbN photosystem II protein N
Probab=20.68  E-value=1e+02  Score=20.94  Aligned_cols=22  Identities=0%  Similarity=-0.010  Sum_probs=15.7

Q ss_pred             HHHHHHHHHHHHHHheeeEEeec
Q 046217           96 TMMLFVFFTFIVGLVLMFLIARP  118 (288)
Q Consensus        96 ~li~lvlllgii~~lIlwLvlrP  118 (288)
                      ++.+..++++++ ..-+|..+-|
T Consensus         8 ~i~i~~ll~~~T-gy~iYtaFGp   29 (43)
T CHL00020          8 AIFISGLLVSFT-GYALYTAFGQ   29 (43)
T ss_pred             HHHHHHHHHHhh-heeeeeccCC
Confidence            444556677777 7888888877


No 88 
>PF01034 Syndecan:  Syndecan domain;  InterPro: IPR001050 The syndecans are transmembrane proteoglycans which are involved in the organisation of cytoskeleton and/or actin microfilaments, and have important roles as cell surface receptors during cell-cell and/or cell-matrix interactions [, ]. Structurally, these proteins consist of four separate domains:   A signal sequence; An extracellular domain (ectodomain) of variable length whose sequence is not evolutionary conserved in the various forms of syndecans. The ectodomain contains the sites of attachment of the heparan sulphate glycosaminoglycan side chains;  A transmembrane region;  A highly conserved cytoplasmic domain of about 30 to 35 residues, which could interact with cytoskeletal proteins.    The proteins known to belong to this family are:    Syndecan 1.  Syndecan 2 or fibroglycan.  Syndecan 3 or neuroglycan or N-syndecan.  Syndecan 4 or amphiglycan or ryudocan.  Drosophila syndecan.   Caenorhabditis elegans probable syndecan (F57C7.3).    Syndecan-4, a transmembrane heparan sulphate proteoglycan, is a coreceptor with integrins in cell adhesion. It has been suggested to form a ternary signalling complex with protein kinase Calpha and phosphatidylinositol 4,5-bisphosphate (PIP2). Structural studies have demonstrated that the cytoplasmic domain undergoes a conformational transition and forms a symmetric dimer in the presence of phospholipid activator PIP2, and whose overall structure in solution exhibits a twisted clamp shape having a cavity in the centre of dimeric interface. In addition, it has been observed that the syndecan-4 variable domain interacts, strongly, not only with fatty acyl groups but also the anionic head group of PIP2. These findings indicate that PIP2 promotes oligomerisation of the syndecan-4 cytoplasmic domain for transmembrane signalling and cell-matrix adhesion [, ].; GO: 0008092 cytoskeletal protein binding, 0016020 membrane; PDB: 1EJQ_B 1EJP_B 1YBO_C 1OBY_Q.
Probab=20.05  E-value=35  Score=25.14  Aligned_cols=15  Identities=40%  Similarity=0.811  Sum_probs=0.0

Q ss_pred             HHHHHHHheeeEEeec
Q 046217          103 FTFIVGLVLMFLIARP  118 (288)
Q Consensus       103 llgii~~lIlwLvlrP  118 (288)
                      +++++ +||+++++|=
T Consensus        22 ll~ai-lLIlf~iyR~   36 (64)
T PF01034_consen   22 LLFAI-LLILFLIYRM   36 (64)
T ss_dssp             ----------------
T ss_pred             HHHHH-HHHHHHHHHH
Confidence            33444 5666677663


Done!