RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy15665
         (107 letters)



>gnl|CDD|240177 cd05156, ChoK_euk, Choline Kinase (ChoK) in eukaryotes. The ChoK
           subfamily is part of a larger superfamily that includes
           the catalytic domains of other kinases, such as the
           typical serine/threonine/tyrosine protein kinases (PKs),
           RIO kinases, actin-fragmin kinase (AFK), and
           phosphoinositide 3-kinase (PI3K). It is composed of
           bacterial and eukaryotic choline kinases, as well as
           eukaryotic ethanolamine kinase. ChoK catalyzes the
           transfer of the gamma-phosphoryl group from ATP (or CTP)
           to its substrate, choline, producing phosphorylcholine
           (PCho), a precursor to the biosynthesis of two major
           membrane phospholipids, phosphatidylcholine (PC) and
           sphingomyelin (SM). Although choline is the preferred
           substrate, ChoK also shows substantial activity towards
           ethanolamine and its N-methylated derivatives. ChoK
           plays an important role in cell signaling pathways and
           the regulation of cell growth. Along with PCho, it is
           involved in malignant transformation through Ras
           oncogenes in various human cancers such as breast, lung,
           colon, prostate, neuroblastoma, and hepatic lymphoma. In
           mammalian cells, there are three ChoK isoforms (A-1,
           A-2, and B) which are active in homo or heterodimeric
           forms.
          Length = 302

 Score =  108 bits (271), Expect = 4e-30
 Identities = 42/97 (43%), Positives = 50/97 (51%), Gaps = 18/97 (18%)

Query: 5   KIKSPVTFCHNDLQEGNILYRESPNNNNSSNNNNNNNNNNNNNNNIDLVVIDFEYCSYNY 64
           +  SPV FCHNDLQEGNIL     +                      LV+IDFEY SYNY
Sbjct: 174 ESGSPVVFCHNDLQEGNILLLNPSSETK------------------KLVLIDFEYASYNY 215

Query: 65  RAFDIANHFVESVYDYSYKHFPHYTVKRENYPSYSLR 101
           R FDIANHF E +YDY     P + +  + YP+   R
Sbjct: 216 RGFDIANHFCEWMYDYHDPEPPFFKIHEDKYPTEEQR 252


>gnl|CDD|216617 pfam01633, Choline_kinase, Choline/ethanolamine kinase.  Choline
           kinase catalyzes the committed step in the synthesis of
           phosphatidylcholine by the CDP-choline pathway. This
           alignment covers the protein kinase portion of the
           protein. The divergence of this family makes it very
           difficult to create a model that specifically predicts
           choline/ethanolamine kinases only. However if [add Pfam
           ID here for Choline_kinase_C] is also present then it is
           definitely a member of this family.
          Length = 206

 Score = 85.4 bits (212), Expect = 4e-22
 Identities = 41/98 (41%), Positives = 49/98 (50%), Gaps = 23/98 (23%)

Query: 1   KHLLKIKSPVTFCHNDLQEGNILYRESPNNNNSSNNNNNNNNNNNNNNNIDLVVIDFEYC 60
           K L  ++SP+ FCHNDLQ GNIL                      +N+   LV+IDFEY 
Sbjct: 130 KLLENLESPIVFCHNDLQPGNILL---------------------DNSTNRLVLIDFEYA 168

Query: 61  SYNYRAFDIANHFVESVYDYSYKHFPHYTVKRENYPSY 98
           SYNYRAFDIANHF E   DY       +      YP+ 
Sbjct: 169 SYNYRAFDIANHFCEWAGDYHEPEP--FKCDYSLYPTE 204


>gnl|CDD|240350 PTZ00296, PTZ00296, choline kinase; Provisional.
          Length = 442

 Score = 69.9 bits (171), Expect = 2e-15
 Identities = 36/92 (39%), Positives = 47/92 (51%), Gaps = 23/92 (25%)

Query: 10  VTFCHNDLQEGNILYRESPNNNNSSNNNNNNNNNNNNNNNIDLVVIDFEYCSYNYRAFDI 69
           + FCHNDLQE NI+                       N N  L +IDFEY  YN+ A DI
Sbjct: 284 IVFCHNDLQENNII-----------------------NTNKCLRLIDFEYSGYNFLATDI 320

Query: 70  ANHFVESVYDYSYKHFPHYTVKRENYPSYSLR 101
           AN F+E+  DYS  H+P + + ++ Y SY  R
Sbjct: 321 ANFFIETTIDYSVSHYPFFAIDKKKYISYENR 352


>gnl|CDD|240178 cd05157, ETNK_euk, Ethanolamine kinase (ETNK) in eukaryotes. ETNK
           is part of a larger superfamily that includes the
           catalytic domains of other kinases, such as the typical
           serine/threonine/tyrosine protein kinases (PKs), RIO
           kinases, actin-fragmin kinase (AFK), and
           phosphoinositide 3-kinase (PI3K). ETNK catalyzes the
           transfer of the gamma-phosphoryl group from CTP to
           ethanolamine (Etn), the first step in the CDP-Etn
           pathway for the formation of the major phospholipid,
           phosphatidylethanolamine (PtdEtn). Unlike ChoK, ETNK
           shows specific activity for its substrate, and displays
           negligible activity towards N-methylated derivatives of
           Etn. The Drosophila ETNK is implicated in development
           and neuronal function. Mammals contain two ETNK
           proteins, ETNK1 and ETNK2. ETNK1 selectively increases
           Etn uptake and phosphorylation, as well as PtdEtn
           synthesis. ETNK2 is found primarily in the liver and
           reproductive tissues. It plays a critical role in
           regulating placental hemostasis to support late
           embryonic development. It may also have a role in
           testicular maturation.
          Length = 235

 Score = 67.7 bits (166), Expect = 4e-15
 Identities = 32/75 (42%), Positives = 39/75 (52%), Gaps = 21/75 (28%)

Query: 1   KHLLKIKSPVTFCHNDLQEGNILYRESPNNNNSSNNNNNNNNNNNNNNNIDLVVIDFEYC 60
           + L  + SP+ FCHNDL  GNI+Y E  N+                     +  ID+EY 
Sbjct: 161 ELLSALNSPIVFCHNDLLSGNIIYNEEKNS---------------------VKFIDYEYA 199

Query: 61  SYNYRAFDIANHFVE 75
            YNYRAFDIANHF E
Sbjct: 200 GYNYRAFDIANHFNE 214


>gnl|CDD|177880 PLN02236, PLN02236, choline kinase.
          Length = 344

 Score = 65.5 bits (160), Expect = 6e-14
 Identities = 34/93 (36%), Positives = 41/93 (44%), Gaps = 29/93 (31%)

Query: 12  FCHNDLQEGNILYRESPNNNNSSNNNNNNNNNNNNNNNIDLVVIDFEYCSYNYRAFDIAN 71
           FCHNDLQ GNI+  E                         + +ID+EY SYN  A+DIAN
Sbjct: 202 FCHNDLQYGNIMIDEETRA---------------------ITIIDYEYASYNPVAYDIAN 240

Query: 72  HFVESVYDYSYKHFPH---YTVKRENYPSYSLR 101
           HF E   DY +   PH   Y+     YP    R
Sbjct: 241 HFCEMAADY-HSETPHILDYS----KYPGEEER 268


>gnl|CDD|215231 PLN02421, PLN02421, phosphotransferase, alcohol group as
           acceptor/kinase.
          Length = 330

 Score = 59.4 bits (144), Expect = 8e-12
 Identities = 30/82 (36%), Positives = 39/82 (47%), Gaps = 22/82 (26%)

Query: 6   IKSPVTFCHNDLQEGNILYRESPNNNNSSNNNNNNNNNNNNNNNIDLVVIDFEYCSYNYR 65
           +K+PV F HNDL  GN++                      N +   L  IDFEY SY+YR
Sbjct: 178 LKAPVVFAHNDLLSGNLML---------------------NEDEGKLYFIDFEYGSYSYR 216

Query: 66  AFDIANHFVESV-YDYSYKHFP 86
            +DI NHF E   +D  Y  +P
Sbjct: 217 GYDIGNHFNEYAGFDCDYSLYP 238


>gnl|CDD|240172 cd05151, ChoK, Choline Kinase (ChoK). The ChoK subfamily is part of
           a larger superfamily that includes the catalytic domains
           of other kinases, such as the typical
           serine/threonine/tyrosine protein kinases (PKs), RIO
           kinases, actin-fragmin kinase (AFK), and
           phosphoinositide 3-kinase (PI3K). It is composed of
           bacterial and eukaryotic choline kinases, as well as
           eukaryotic ethanolamine kinase. ChoK catalyzes the
           transfer of the gamma-phosphoryl group from ATP (or CTP)
           to its substrate, choline, producing phosphorylcholine
           (PCho), a precursor to the biosynthesis of two major
           membrane phospholipids, phosphatidylcholine (PC), and
           sphingomyelin (SM). Although choline is the preferred
           substrate, ChoK also shows substantial activity towards
           ethanolamine and its N-methylated derivatives. Bacterial
           ChoK is also referred to as licA protein. ETNK catalyzes
           the transfer of the gamma-phosphoryl group from CTP to
           ethanolamine (Etn), the first step in the CDP-Etn
           pathway for the formation of the major phospholipid,
           phosphatidylethanolamine (PtdEtn). Unlike ChoK, ETNK
           shows specific activity for its substrate and displays
           negligible activity towards N-methylated derivatives of
           Etn. ChoK plays an important role in cell signaling
           pathways and the regulation of cell growth.
          Length = 170

 Score = 53.0 bits (128), Expect = 5e-10
 Identities = 20/74 (27%), Positives = 27/74 (36%), Gaps = 23/74 (31%)

Query: 9   PVTFCHNDLQEGNILYRESPNNNNSSNNNNNNNNNNNNNNNIDLVVIDFEYCSYNYRAFD 68
            +  CHNDL  GN L                      ++  + L  ID+EY   N   FD
Sbjct: 106 DLVPCHNDLLPGNFLL---------------------DDGRLWL--IDWEYAGMNDPLFD 142

Query: 69  IANHFVESVYDYSY 82
           +AN F E+      
Sbjct: 143 LANFFSEAKLSEDQ 156


>gnl|CDD|240159 cd05120, APH_ChoK_like, Aminoglycoside 3'-phosphotransferase (APH)
           and Choline Kinase (ChoK) family. The APH/ChoK family is
           part of a larger superfamily that includes the catalytic
           domains of other kinases, such as the typical
           serine/threonine/tyrosine protein kinases (PKs), RIO
           kinases, actin-fragmin kinase (AFK), and
           phosphoinositide 3-kinase (PI3K). The family is composed
           of APH, ChoK, ethanolamine kinase (ETNK), macrolide
           2'-phosphotransferase (MPH2'), an unusual homoserine
           kinase, and uncharacterized proteins with similarity to
           the N-terminal domain of acyl-CoA dehydrogenase 10
           (ACAD10). The members of this family catalyze the
           transfer of the gamma-phosphoryl group from ATP (or CTP)
           to small molecule substrates such as aminoglycosides,
           macrolides, choline, ethanolamine, and homoserine.
           Phosphorylation of the antibiotics, aminoglycosides and
           macrolides, leads to their inactivation and to bacterial
           antibiotic resistance. Phosphorylation of choline,
           ethanolamine, and homoserine serves as precursors to the
           synthesis of important biological compounds, such as the
           major phospholipids, phosphatidylcholine and
           phosphatidylethanolamine and the amino acids, threonine,
           methionine, and isoleucine.
          Length = 155

 Score = 48.2 bits (115), Expect = 3e-08
 Identities = 21/72 (29%), Positives = 28/72 (38%), Gaps = 23/72 (31%)

Query: 2   HLLKIKSPVTFCHNDLQEGNILYRESPNNNNSSNNNNNNNNNNNNNNNIDLVVIDFEYCS 61
            L ++   V  CH DL  GNIL                      ++  I L +ID+EY  
Sbjct: 101 KLHQLPLLV-LCHGDLHPGNILV---------------------DDGKI-LGIIDWEYAG 137

Query: 62  YNYRAFDIANHF 73
           Y   AFD+A   
Sbjct: 138 YGPPAFDLAAAL 149


>gnl|CDD|223584 COG0510, ycfN, Thiamine kinase and related kinases [Coenzyme
           transport and metabolism].
          Length = 269

 Score = 43.5 bits (103), Expect = 3e-06
 Identities = 20/81 (24%), Positives = 32/81 (39%), Gaps = 23/81 (28%)

Query: 1   KHLLKIKSPV-TFCHNDLQEGNILYRESPNNNNSSNNNNNNNNNNNNNNNIDLVVIDFEY 59
           + L ++       CHNDL  GN+L   +                        L +ID+EY
Sbjct: 143 RALEEVPKDDLVPCHNDLNPGNLLL--TDKG--------------------GLFLIDWEY 180

Query: 60  CSYNYRAFDIANHFVESVYDY 80
              N  AFD+A    E ++++
Sbjct: 181 AGLNDPAFDLAALLSEYIFNH 201


>gnl|CDD|216618 pfam01636, APH, Phosphotransferase enzyme family.  This family
           consists of bacterial antibiotic resistance proteins,
           which confer resistance to various aminoglycosides they
           include: aminoglycoside 3'-phosphotransferase or
           kanamycin kinase / neomycin-kanamycin phosphotransferase
           and streptomycin 3''-kinase or streptomycin
           3''-phosphotransferase. The aminoglycoside
           phosphotransferases inactivate aminoglycoside
           antibiotics via phosphorylation. This family also
           includes homoserine kinase. This family is related to
           fructosamine kinase pfam03881.
          Length = 238

 Score = 41.8 bits (98), Expect = 1e-05
 Identities = 15/66 (22%), Positives = 20/66 (30%), Gaps = 21/66 (31%)

Query: 8   SPVTFCHNDLQEGNILYRESPNNNNSSNNNNNNNNNNNNNNNIDLVVIDFEYCSYNYRAF 67
            P+   H DL  GN+L                              VIDFE       A+
Sbjct: 163 LPLVLVHGDLHPGNLLVDP---------------------GGRVTGVIDFEDAGLGDPAY 201

Query: 68  DIANHF 73
           D+A+  
Sbjct: 202 DLASLL 207


>gnl|CDD|225213 COG2334, COG2334, Putative homoserine kinase type II (protein
           kinase fold) [General function prediction only].
          Length = 331

 Score = 38.1 bits (89), Expect = 3e-04
 Identities = 11/80 (13%), Positives = 20/80 (25%), Gaps = 26/80 (32%)

Query: 13  CHNDLQEGNILYRESPNNNNSSNNNNNNNNNNNNNNNIDLVVIDFEYCSYNYRAFDIANH 72
            H DL   N+L+ +  +                         IDF+   Y +  +D+A  
Sbjct: 200 IHGDLHPDNVLFDDDTDV---------------------SGFIDFDDAGYGWFIYDLAI- 237

Query: 73  FVESVYDYSYKHFPHYTVKR 92
                 +             
Sbjct: 238 ----ALNAWNGDEADPRAAI 253


>gnl|CDD|168775 PRK07034, PRK07034, hypothetical protein; Provisional.
          Length = 536

 Score = 35.1 bits (80), Expect = 0.003
 Identities = 15/26 (57%), Positives = 19/26 (73%)

Query: 23  LYRESPNNNNSSNNNNNNNNNNNNNN 48
           LY++ PN N+S  N NNNN N +NNN
Sbjct: 82  LYQQQPNANDSYPNGNNNNPNGDNNN 107



 Score = 26.7 bits (58), Expect = 2.7
 Identities = 13/44 (29%), Positives = 21/44 (47%)

Query: 2   HLLKIKSPVTFCHNDLQEGNILYRESPNNNNSSNNNNNNNNNNN 45
           ++ +  +P    +      N  Y    NNN + +NNN N +NNN
Sbjct: 71  NMARTSAPPVTLYQQQPNANDSYPNGNNNNPNGDNNNPNGSNNN 114


>gnl|CDD|240427 PTZ00470, PTZ00470, glycoside hydrolase family 47 protein;
          Provisional.
          Length = 522

 Score = 33.5 bits (77), Expect = 0.011
 Identities = 12/26 (46%), Positives = 18/26 (69%)

Query: 26 ESPNNNNSSNNNNNNNNNNNNNNNID 51
           +  ++N +NNNNNNN  N+NN N +
Sbjct: 11 HNNADDNYNNNNNNNNQINSNNPNNN 36



 Score = 32.4 bits (74), Expect = 0.029
 Identities = 15/36 (41%), Positives = 25/36 (69%)

Query: 14 HNDLQEGNILYRESPNNNNSSNNNNNNNNNNNNNNN 49
          + +L+E   ++  + +N N++NNNNN  N+NN NNN
Sbjct: 1  NVNLREHLSVHNNADDNYNNNNNNNNQINSNNPNNN 36



 Score = 32.4 bits (74), Expect = 0.029
 Identities = 17/58 (29%), Positives = 26/58 (44%), Gaps = 3/58 (5%)

Query: 29 NNNNSSNNNNNNNNNNNNNNNIDLVVIDFEYCSYNYRAFDIANH---FVESVYDYSYK 83
           +NN+ +N NNNNNNNN  N+ +         S   R      +    ++ VY  + K
Sbjct: 10 VHNNADDNYNNNNNNNNQINSNNPNNNGNNQASKLPRGKKKQENPFNKIDEVYYQNEK 67



 Score = 32.0 bits (73), Expect = 0.044
 Identities = 11/25 (44%), Positives = 16/25 (64%)

Query: 26 ESPNNNNSSNNNNNNNNNNNNNNNI 50
             + +  +N ++N NNNNNNNN I
Sbjct: 4  LREHLSVHNNADDNYNNNNNNNNQI 28



 Score = 28.9 bits (65), Expect = 0.41
 Identities = 13/36 (36%), Positives = 18/36 (50%)

Query: 14 HNDLQEGNILYRESPNNNNSSNNNNNNNNNNNNNNN 49
          H  +         + NNNN+  N+NN NNN NN  +
Sbjct: 7  HLSVHNNADDNYNNNNNNNNQINSNNPNNNGNNQAS 42



 Score = 26.2 bits (58), Expect = 3.7
 Identities = 9/22 (40%), Positives = 13/22 (59%)

Query: 29 NNNNSSNNNNNNNNNNNNNNNI 50
          N     + +NN ++N NNNNN 
Sbjct: 3  NLREHLSVHNNADDNYNNNNNN 24


>gnl|CDD|173576 PTZ00384, PTZ00384, choline kinase; Provisional.
          Length = 383

 Score = 33.2 bits (76), Expect = 0.013
 Identities = 25/91 (27%), Positives = 36/91 (39%), Gaps = 24/91 (26%)

Query: 6   IKSPVTFCHNDLQEGNILYRESPNNNNSSNNNNNNNNNNNNNNNIDLVVIDFEYCSYNYR 65
           I + V FCHNDL   NIL                       + N  +  IDF++  +NY 
Sbjct: 223 ITNSVLFCHNDLFFTNIL-----------------------DFNQGIYFIDFDFAGFNYV 259

Query: 66  AFDIANHFVESVYDYSYKHFPHYTVKRENYP 96
            ++IAN FV+    Y     P Y    ++  
Sbjct: 260 GWEIANFFVKLYIVYD-PPTPPYFNSDDSLA 289


>gnl|CDD|131952 TIGR02906, spore_CotS, spore coat protein, CotS family.  Members of
           this family include the spore coat proteins CotS and
           YtaA from Bacillus subtilis and, from other
           endospore-forming bacteria, homologs that are more
           closely related to these two than to the spore coat
           proteins YutH and YsxE. The CotS family is more broadly
           distributed than YutH or YsxE, but still is not
           universal among spore-formers [Cellular processes,
           Sporulation and germination].
          Length = 313

 Score = 31.5 bits (72), Expect = 0.057
 Identities = 15/60 (25%), Positives = 21/60 (35%), Gaps = 23/60 (38%)

Query: 12  FCHNDLQEGNILYRESPNNNNSSNNNNNNNNNNNNNNNIDLVVIDFEYCSYNYRAFDIAN 71
           FCH D    NIL                       +N +   VIDF+YC+ +    D+  
Sbjct: 189 FCHQDYAYHNILL---------------------KDNEVY--VIDFDYCTIDLPVRDLRK 225


>gnl|CDD|219500 pfam07655, Secretin_N_2, Secretin N-terminal domain.  This is a
          short domain found in bacterial type II/III secretory
          system proteins. The architecture of these proteins
          suggest that this family may be functionally analogous
          to pfam03958.
          Length = 95

 Score = 28.4 bits (64), Expect = 0.27
 Identities = 5/23 (21%), Positives = 18/23 (78%)

Query: 27 SPNNNNSSNNNNNNNNNNNNNNN 49
            N+++SS+N++N  +++++++ 
Sbjct: 30 GSNSSSSSSNSSNGGSSSSSSSG 52



 Score = 28.1 bits (63), Expect = 0.37
 Identities = 6/24 (25%), Positives = 19/24 (79%)

Query: 27 SPNNNNSSNNNNNNNNNNNNNNNI 50
          S + +NSS++++N++N  +++++ 
Sbjct: 27 SSSGSNSSSSSSNSSNGGSSSSSS 50



 Score = 27.3 bits (61), Expect = 0.83
 Identities = 4/23 (17%), Positives = 19/23 (82%)

Query: 27 SPNNNNSSNNNNNNNNNNNNNNN 49
          S +++ S+++++++N++N  +++
Sbjct: 25 SVSSSGSNSSSSSSNSSNGGSSS 47



 Score = 26.1 bits (58), Expect = 2.2
 Identities = 5/23 (21%), Positives = 19/23 (82%)

Query: 27 SPNNNNSSNNNNNNNNNNNNNNN 49
          S ++++SSN++N  +++++++ +
Sbjct: 31 SNSSSSSSNSSNGGSSSSSSSGD 53



 Score = 26.1 bits (58), Expect = 2.2
 Identities = 5/23 (21%), Positives = 19/23 (82%)

Query: 27 SPNNNNSSNNNNNNNNNNNNNNN 49
          S +++NSSN  +++++++ ++++
Sbjct: 34 SSSSSNSSNGGSSSSSSSGDSSS 56



 Score = 26.1 bits (58), Expect = 2.5
 Identities = 3/23 (13%), Positives = 18/23 (78%)

Query: 27 SPNNNNSSNNNNNNNNNNNNNNN 49
          S ++N+S+  +++++++ ++++ 
Sbjct: 35 SSSSNSSNGGSSSSSSSGDSSSG 57



 Score = 25.0 bits (55), Expect = 5.4
 Identities = 2/23 (8%), Positives = 19/23 (82%)

Query: 27 SPNNNNSSNNNNNNNNNNNNNNN 49
          + ++++S+++N  +++++++ ++
Sbjct: 32 NSSSSSSNSSNGGSSSSSSSGDS 54



 Score = 25.0 bits (55), Expect = 5.8
 Identities = 3/23 (13%), Positives = 19/23 (82%)

Query: 27 SPNNNNSSNNNNNNNNNNNNNNN 49
          S +++++S+N  +++++++ +++
Sbjct: 33 SSSSSSNSSNGGSSSSSSSGDSS 55



 Score = 24.6 bits (54), Expect = 7.7
 Identities = 4/24 (16%), Positives = 17/24 (70%)

Query: 27 SPNNNNSSNNNNNNNNNNNNNNNI 50
          S N++N  +++++++ ++++   I
Sbjct: 37 SSNSSNGGSSSSSSSGDSSSGTRI 60


>gnl|CDD|225798 COG3259, FrhA, Coenzyme F420-reducing hydrogenase, alpha subunit
           [Energy production and conversion].
          Length = 441

 Score = 29.3 bits (66), Expect = 0.30
 Identities = 14/51 (27%), Positives = 18/51 (35%), Gaps = 2/51 (3%)

Query: 48  NNIDLVVIDFEYCSYNYRAFDIANHFVESVYDYSYKHFPHYTVKRENYPSY 98
                  I        +   +  +   ESVY YSY   P+Y  K   YP Y
Sbjct: 221 PEPYDGSIRAYDDREKFDPDEYEDIIPESVYPYSYLKHPYY--KDLGYPDY 269


>gnl|CDD|191179 pfam05053, Menin, Menin.  MEN1, the gene responsible for multiple
           endocrine neoplasia type 1, is a tumour suppressor gene
           that encodes a protein called Menin which may be an
           atypical GTPase stimulated by nm23.
          Length = 618

 Score = 29.2 bits (65), Expect = 0.38
 Identities = 15/23 (65%), Positives = 18/23 (78%)

Query: 26  ESPNNNNSSNNNNNNNNNNNNNN 48
           ESPN    +NNNN+N+NNNNNN 
Sbjct: 496 ESPNPELPANNNNSNSNNNNNNG 518



 Score = 25.7 bits (56), Expect = 6.8
 Identities = 19/44 (43%), Positives = 24/44 (54%), Gaps = 10/44 (22%)

Query: 16  DLQEGN--ILYRES----PNNNNSSN----NNNNNNNNNNNNNN 49
           + +EG      RES    P+   S N     NNNN+N+NNNNNN
Sbjct: 474 EAREGRRRGPRRESKSQEPSGGESPNPELPANNNNSNSNNNNNN 517


>gnl|CDD|240339 PTZ00265, PTZ00265, multidrug resistance protein (mdr1);
           Provisional.
          Length = 1466

 Score = 28.8 bits (64), Expect = 0.47
 Identities = 15/26 (57%), Positives = 21/26 (80%)

Query: 25  RESPNNNNSSNNNNNNNNNNNNNNNI 50
           +++  NNN +N ++NNNNNNNNNN I
Sbjct: 672 KDNKENNNKNNKDDNNNNNNNNNNKI 697



 Score = 28.1 bits (62), Expect = 1.1
 Identities = 17/32 (53%), Positives = 22/32 (68%)

Query: 25  RESPNNNNSSNNNNNNNNNNNNNNNIDLVVID 56
           +E+ N NN  +NNNNNNNNNN  NN    +I+
Sbjct: 675 KENNNKNNKDDNNNNNNNNNNKINNAGSYIIE 706



 Score = 28.1 bits (62), Expect = 1.1
 Identities = 14/24 (58%), Positives = 17/24 (70%)

Query: 26  ESPNNNNSSNNNNNNNNNNNNNNN 49
           E P  +N  NNN NN ++NNNNNN
Sbjct: 668 EDPTKDNKENNNKNNKDDNNNNNN 691



 Score = 26.5 bits (58), Expect = 3.5
 Identities = 13/29 (44%), Positives = 20/29 (68%)

Query: 21  NILYRESPNNNNSSNNNNNNNNNNNNNNN 49
           +I+  +   +N  +NN NN ++NNNNNNN
Sbjct: 664 DIIGEDPTKDNKENNNKNNKDDNNNNNNN 692


>gnl|CDD|237171 PRK12678, PRK12678, transcription termination factor Rho;
           Provisional.
          Length = 672

 Score = 28.3 bits (64), Expect = 0.66
 Identities = 0/25 (0%), Positives = 4/25 (16%)

Query: 25  RESPNNNNSSNNNNNNNNNNNNNNN 49
                +     +        +  + 
Sbjct: 202 DRDRRDRREQGDRREERGRRDGGDR 226



 Score = 27.9 bits (63), Expect = 0.93
 Identities = 1/25 (4%), Positives = 6/25 (24%)

Query: 25  RESPNNNNSSNNNNNNNNNNNNNNN 49
           R+  +     +        +  +  
Sbjct: 203 RDRRDRREQGDRREERGRRDGGDRR 227


>gnl|CDD|237555 PRK13914, PRK13914, invasion associated secreted endopeptidase;
           Provisional.
          Length = 481

 Score = 28.2 bits (62), Expect = 0.79
 Identities = 10/23 (43%), Positives = 16/23 (69%)

Query: 27  SPNNNNSSNNNNNNNNNNNNNNN 49
           +P+ N ++N N+N N N+N N N
Sbjct: 332 TPSKNTNTNTNSNTNTNSNTNAN 354



 Score = 28.2 bits (62), Expect = 0.85
 Identities = 10/23 (43%), Positives = 12/23 (52%)

Query: 27  SPNNNNSSNNNNNNNNNNNNNNN 49
           S N N +  N N N N NN N +
Sbjct: 309 STNTNANKTNTNTNTNTNNTNTS 331



 Score = 27.1 bits (59), Expect = 2.1
 Identities = 10/21 (47%), Positives = 13/21 (61%)

Query: 29  NNNNSSNNNNNNNNNNNNNNN 49
           NN N+S  + N N N N+N N
Sbjct: 326 NNTNTSTPSKNTNTNTNSNTN 346



 Score = 26.3 bits (57), Expect = 3.7
 Identities = 9/21 (42%), Positives = 13/21 (61%)

Query: 29  NNNNSSNNNNNNNNNNNNNNN 49
            N NS+ N N+N N N  ++N
Sbjct: 339 TNTNSNTNTNSNTNANQGSSN 359



 Score = 25.9 bits (56), Expect = 5.4
 Identities = 8/23 (34%), Positives = 14/23 (60%)

Query: 27  SPNNNNSSNNNNNNNNNNNNNNN 49
           + N NN++ +  + N N N N+N
Sbjct: 322 NTNTNNTNTSTPSKNTNTNTNSN 344



 Score = 25.9 bits (56), Expect = 5.5
 Identities = 12/24 (50%), Positives = 15/24 (62%), Gaps = 3/24 (12%)

Query: 29  NNNNSSNNNNNNNNNNN---NNNN 49
           N N +SN N N+N N N   +NNN
Sbjct: 338 NTNTNSNTNTNSNTNANQGSSNNN 361



 Score = 25.9 bits (56), Expect = 5.5
 Identities = 10/23 (43%), Positives = 14/23 (60%)

Query: 27  SPNNNNSSNNNNNNNNNNNNNNN 49
           +P+ N ++N  N N N N NN N
Sbjct: 307 APSTNTNANKTNTNTNTNTNNTN 329



 Score = 25.5 bits (55), Expect = 6.9
 Identities = 7/21 (33%), Positives = 12/21 (57%)

Query: 29  NNNNSSNNNNNNNNNNNNNNN 49
            NN +++  + N N N N+N 
Sbjct: 325 TNNTNTSTPSKNTNTNTNSNT 345


>gnl|CDD|234342 TIGR03752, conj_TIGR03752, integrating conjugative element protein,
           PFL_4705 family.  Members of this protein family are
           found occasionally on plasmids such as the Pseudomonas
           putida toluene catabolic TOL plasmid pWWO_p085. Usually,
           however, they are found on the bacterial main chromosome
           in regions flanked by markers of conjugative transfer
           and/or transposition [Mobile and extrachromosomal
           element functions, Plasmid functions].
          Length = 472

 Score = 27.3 bits (61), Expect = 1.4
 Identities = 12/39 (30%), Positives = 21/39 (53%), Gaps = 4/39 (10%)

Query: 5   KIKSPVTFCHNDLQEGNILYRESPNNNNSSNNNNNNNNN 43
           +++S +TF  ND   G I     P NN+ +N  ++  +N
Sbjct: 309 QVRS-LTFVFND---GTIRTVPRPRNNSQTNGGDSGKSN 343


>gnl|CDD|219312 pfam07150, DUF1390, Protein of unknown function (DUF1390).  This
          family consists of several Paramecium bursaria
          chlorella virus 1 (PBCV-1) proteins of around 250
          residues in length. The function of this family is
          unknown.
          Length = 229

 Score = 27.4 bits (61), Expect = 1.4
 Identities = 10/27 (37%), Positives = 13/27 (48%)

Query: 30 NNNSSNNNNNNNNNNNNNNNIDLVVID 56
             ++N NN N   N  N  I+L V D
Sbjct: 62 GVVNNNVNNINKTINTKNITINLTVPD 88


>gnl|CDD|240174 cd05153, HomoserineK_II, Homoserine Kinase, type II. Homoserine
           kinase is part of a larger superfamily that includes the
           catalytic domains of other kinases, such as the typical
           serine/threonine/tyrosine protein kinases (PKs), RIO
           kinases, actin-fragmin kinase (AFK), and
           phosphoinositide 3-kinase (PI3K). This subfamily is
           composed of unusual homoserine kinases, from a subset of
           bacteria, which have a PK fold. These proteins do not
           bear any similarity to the GHMP family homoserine
           kinases present in most bacteria and eukaryotes.
           Homoserine kinase catalyzes the transfer of the
           gamma-phosphoryl group from ATP to L-homoserine
           producing L-homoserine phosphate, an intermediate in the
           production of the amino acids threonine, methionine, and
           isoleucine.
          Length = 296

 Score = 27.2 bits (61), Expect = 1.4
 Identities = 6/17 (35%), Positives = 10/17 (58%)

Query: 54  VIDFEYCSYNYRAFDIA 70
           VIDF +   +   +D+A
Sbjct: 197 VIDFYFACTDAFLYDLA 213


>gnl|CDD|214734 smart00587, CHK, ZnF_C4 abd HLH domain containing kinases domain.
           subfamily of choline kinases.
          Length = 196

 Score = 26.9 bits (60), Expect = 1.9
 Identities = 12/66 (18%), Positives = 21/66 (31%), Gaps = 17/66 (25%)

Query: 5   KIKSPVTFCHNDLQEGNILYRESPNNNNSSNNNNNNNNNNNNNNNIDLVVIDFEYCSYNY 64
                    H DL   NI+++                  ++     D+ +IDF+   Y  
Sbjct: 115 DEGEFNVLNHGDLWANNIMFKY-----------------DDEGKPEDVALIDFQLSHYGS 157

Query: 65  RAFDIA 70
            A D+ 
Sbjct: 158 PAEDLH 163


>gnl|CDD|215570 PLN03091, PLN03091, hypothetical protein; Provisional.
          Length = 459

 Score = 26.9 bits (59), Expect = 2.1
 Identities = 10/23 (43%), Positives = 18/23 (78%)

Query: 28  PNNNNSSNNNNNNNNNNNNNNNI 50
              ++SS+  NN+NNNN++N+N+
Sbjct: 187 EVESSSSSKINNSNNNNHSNSNL 209


>gnl|CDD|214408 MTH00036, ATP8, ATP synthase F0 subunit 8; Validated.
          Length = 54

 Score = 25.4 bits (56), Expect = 2.2
 Identities = 9/26 (34%), Positives = 18/26 (69%)

Query: 22 ILYRESPNNNNSSNNNNNNNNNNNNN 47
           L   +  + N+SN+++N +++NNNN
Sbjct: 26 YLLLNNWPSTNASNSSSNTSSSNNNN 51


>gnl|CDD|233667 TIGR01982, UbiB, 2-polyprenylphenol 6-hydroxylase.  This model
           represents the enzyme (UbiB) which catalyzes the first
           hydroxylation step in the ubiquinone biosynthetic
           pathway in bacteria. It is believed that the reaction is
           2-polyprenylphenol -> 6-hydroxy-2-polyprenylphenol. This
           model finds hits primarily in the proteobacteria. The
           gene is also known as AarF in certain species
           [Biosynthesis of cofactors, prosthetic groups, and
           carriers, Menaquinone and ubiquinone].
          Length = 437

 Score = 26.5 bits (59), Expect = 2.6
 Identities = 7/18 (38%), Positives = 8/18 (44%)

Query: 12  FCHNDLQEGNILYRESPN 29
           F H DL  GNI   +   
Sbjct: 278 FFHADLHPGNIFVLKDGK 295


>gnl|CDD|131944 TIGR02898, spore_YhcN_YlaJ, sporulation lipoprotein, YhcN/YlaJ
          family.  YhcN and YlaJ are predicted lipoproteins that
          have been detected as spore proteins but not vegetative
          proteins in Bacillus subtilis. Both appear to be
          expressed under control of the RNA polymerase sigma-G
          factor. The YlaJ-like members of this family have a
          low-complexity, strongly acidic 40-residue C-terminal
          domain that is not included in the seed alignment for
          this model. A portion of the low-complexity region
          between the lipoprotein signal sequence and the main
          conserved region of the protein family was also excised
          from the seed alignment [Cellular processes,
          Sporulation and germination].
          Length = 158

 Score = 25.8 bits (57), Expect = 4.4
 Identities = 13/31 (41%), Positives = 16/31 (51%), Gaps = 4/31 (12%)

Query: 26 ESPNNN----NSSNNNNNNNNNNNNNNNIDL 52
            PNNN    N  + NNN  N  N+ N+ DL
Sbjct: 22 NPPNNNVGETNVMSKNNNGMNTTNDTNDGDL 52


>gnl|CDD|153137 cd06269, PBP1_glutamate_receptors_like, Family C G-protein
          couples receptors (GPCRs), membrane bound guanylyl
          cyclases such as the family of natriuretic peptide
          receptors (NPRs), and the N-terminal
          leucine/isoleucine/valine- binding protein
          (LIVBP)-like domain of the ionotropic glutamate
          receptors.  This CD represents the ligand-binding
          domain of the family C G-protein couples receptors
          (GPCRs), membrane bound guanylyl cyclases such as the
          family of natriuretic peptide receptors (NPRs), and the
          N-terminal leucine/isoleucine/valine- binding protein
          (LIVBP)-like domain of the ionotropic glutamate
          receptors, all of which are structurally similar and
          related to the periplasmic-binding fold type I family.
          The family C GPCRs consist of metabotropic glutamate
          receptor (mGluR) receptors, a calcium-sensing receptor
          (CaSR), gamma-aminobutyric receptors (GABAb), the
          promiscuous L-alpha-amino acid receptor GPR6A, families
          of taste and pheromone receptors, and orphan receptors.
          Truncated splicing variants of the orphan receptors are
          not included in this CD. The family C GPCRs are
          activated by endogenous agonists such as amino acids,
          ions, and sugar based molecules. Their amino terminal
          ligand-binding region is homologous to the bacterial
          leucine-isoleucine-valine binding protein (LIVBP) and a
          leucine binding protein (LBP). The ionotropic glutamate
          receptors (iGluRs) have an integral ion channel and are
          subdivided into three major groups based on their
          pharmacology and structural similarities: NMDA
          receptors, AMPA receptors, and kainate receptors. The
          family of membrane bound guanylyl cyclases is further
          divided into three subfamilies: the ANP receptor
          (GC-A)/C-type natriuretic peptide receptor (GC-B), the
          heat-stable enterotoxin receptor (GC-C)/sensory organ
          specific membrane GCs such as retinal receptors (GC-E,
          GC-F), and olfactory receptors (GC-D and GC-G).
          Length = 298

 Score = 25.9 bits (57), Expect = 4.7
 Identities = 8/39 (20%), Positives = 12/39 (30%)

Query: 39 NNNNNNNNNNNIDLVVIDFEYCSYNYRAFDIANHFVESV 77
               NN+  N  L    ++ C     AF  A      +
Sbjct: 28 AVEEINNDLPNTTLGYEIYDSCCSPSDAFSAALDLCSLL 66


>gnl|CDD|217303 pfam02958, EcKinase, Ecdysteroid kinase.  This family includes
           ecdysteroid 22-kinase, an enzyme responsible for the
           phosphorylation of ecdysteroids (insect growth and
           moulting hormones) at C-22, to form physiologically
           inactive ecdysteroid 22-phosphates.
          Length = 293

 Score = 25.3 bits (56), Expect = 7.2
 Identities = 12/63 (19%), Positives = 23/63 (36%), Gaps = 18/63 (28%)

Query: 11  TFCHNDLQEGNILYRESPNNNNSSNNNNNNNNNNNNNNNIDLVVIDFEYCSYNYRAFDIA 70
              H DL   NI+++                  +      D++++DF+   Y   A D+ 
Sbjct: 215 VLNHGDLWVNNIMFK-----------------YDEEGEPEDVILVDFQLSRYGSPALDL- 256

Query: 71  NHF 73
            +F
Sbjct: 257 LYF 259


>gnl|CDD|222446 pfam13903, Claudin_2, PMP-22/EMP/MP20/Claudin tight junction.
          Members of this family are claudins, that form tight
          junctions between cells.
          Length = 170

 Score = 25.0 bits (55), Expect = 8.0
 Identities = 6/37 (16%), Positives = 12/37 (32%), Gaps = 6/37 (16%)

Query: 27 SPNNNNSSNNNNNNNNNNNNNNNIDLVVIDFEYCSYN 63
           P   NS  +N   N    ++  +      +  C  +
Sbjct: 20 DPRRTNSLGSNGAVNTGLFSHLGL------WRICFTS 50


>gnl|CDD|133910 PHA00540, PHA00540, hypothetical protein.
          Length = 715

 Score = 25.4 bits (55), Expect = 8.7
 Identities = 13/43 (30%), Positives = 25/43 (58%), Gaps = 1/43 (2%)

Query: 54  VIDFEYC-SYNYRAFDIANHFVESVYDYSYKHFPHYTVKRENY 95
           + DF Y  +  Y+ FD++   V++V  Y   H+P   +K++N+
Sbjct: 143 LTDFIYAFAVKYKMFDVSKEIVDNVDRYKENHYPRVKLKQDNW 185


>gnl|CDD|220296 pfam09580, Spore_YhcN_YlaJ, Sporulation lipoprotein YhcN/YlaJ
          (Spore_YhcN_YlaJ).  This entry contains YhcN and YlaJ,
          which are predicted lipoproteins that have been
          detected as spore proteins but not vegetative proteins
          in Bacillus subtilis. Both appear to be expressed under
          control of the RNA polymerase sigma-G factor. The
          YlaJ-like members of this family have a low-complexity,
          strongly acidic, 40-residue C-terminal domain.
          Length = 169

 Score = 25.0 bits (55), Expect = 8.9
 Identities = 10/37 (27%), Positives = 13/37 (35%)

Query: 15 NDLQEGNILYRESPNNNNSSNNNNNNNNNNNNNNNID 51
                N     + NN N+ NNNN       NN   +
Sbjct: 20 RQNVTDNDGNTNNVNNRNNRNNNNGYGRLTKNNVTGE 56


>gnl|CDD|177251 MTH00208, ND5, NADH dehydrogenase subunit 5; Provisional.
          Length = 628

 Score = 24.9 bits (55), Expect = 9.5
 Identities = 11/24 (45%), Positives = 15/24 (62%)

Query: 27 SPNNNNSSNNNNNNNNNNNNNNNI 50
          S +N +   NNNNN+  N NN+N 
Sbjct: 34 STSNASLGGNNNNNSIINYNNSNF 57


>gnl|CDD|237183 PRK12728, fliE, flagellar hook-basal body protein FliE;
          Provisional.
          Length = 102

 Score = 24.5 bits (54), Expect = 9.9
 Identities = 8/24 (33%), Positives = 9/24 (37%)

Query: 27 SPNNNNSSNNNNNNNNNNNNNNNI 50
          SPN      N +N  N NN     
Sbjct: 7  SPNQKVFDQNGSNKTNKNNPAEAQ 30


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.314    0.131    0.403 

Gapped
Lambda     K      H
   0.267   0.0767    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 5,359,654
Number of extensions: 445219
Number of successful extensions: 1803
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1278
Number of HSP's successfully gapped: 273
Length of query: 107
Length of database: 10,937,602
Length adjustment: 72
Effective length of query: 35
Effective length of database: 7,744,114
Effective search space: 271043990
Effective search space used: 271043990
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 53 (24.1 bits)