RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy11799
         (483 letters)



>gnl|CDD|221695 pfam12662, cEGF, Complement Clr-like EGF-like.  cEGF, or complement
           Clr-like EGF, domains have six conserved cysteine
           residues disulfide-bonded into the characteristic
           pattern 'ababcc'. They are found in blood coagulation
           proteins such as fibrillin, Clr and Cls, thrombomodulin,
           and the LDL receptor. The core fold of the EGF domain
           consists of two small beta-hairpins packed against each
           other. Two major structural variants have been
           identified based on the structural context of the
           C-terminal cysteine residue of disulfide 'c' in the
           C-terminal hairpin: hEGFs and cEGFs. In cEGFs the
           C-terminal thiol resides on the C-terminal beta-sheet,
           resulting in long loop-lengths between the cysteine
           residues of disulfide 'c', typically C[10+]XC. These
           longer loop-lengths may have arisen by selective
           cysteine loss from a four-disulfide EGF template such as
           laminin or integrin. Tandem cEGF domains have five
           linking residues between terminal cysteines of adjacent
           domains. cEGF domains may or may not bind calcium in the
           linker region. cEGF domains with the consensus motif
           CXN4X[F,Y]XCXC are hydroxylated exclusively on the
           asparagine residue.
          Length = 24

 Score = 42.8 bits (102), Expect = 4e-06
 Identities = 14/24 (58%), Positives = 17/24 (70%)

Query: 211 SFRCICPYGYALAPDGRHCIDINE 234
           S+ C CP GY L+ DGR C DI+E
Sbjct: 1   SYTCSCPPGYQLSGDGRTCEDIDE 24



 Score = 33.2 bits (77), Expect = 0.011
 Identities = 16/25 (64%), Positives = 18/25 (72%), Gaps = 1/25 (4%)

Query: 252 SYTCGCHPGFELSSDGSGTCLDSDE 276
           SYTC C PG++LS DG  TC D DE
Sbjct: 1   SYTCSCPPGYQLSGDG-RTCEDIDE 24



 Score = 32.0 bits (74), Expect = 0.024
 Identities = 11/24 (45%), Positives = 14/24 (58%)

Query: 151 SFRCLCPRGFKLDRTGTFCLDQDE 174
           S+ C CP G++L   G  C D DE
Sbjct: 1   SYTCSCPPGYQLSGDGRTCEDIDE 24



 Score = 32.0 bits (74), Expect = 0.024
 Identities = 11/24 (45%), Positives = 14/24 (58%)

Query: 448 SFRCLCPRGFKLDRTGTFCLDQDE 471
           S+ C CP G++L   G  C D DE
Sbjct: 1   SYTCSCPPGYQLSGDGRTCEDIDE 24



 Score = 29.0 bits (66), Expect = 0.35
 Identities = 9/24 (37%), Positives = 10/24 (41%)

Query: 293 GYKCGCPEGYVRHPFSNECIDDNE 316
            Y C CP GY        C D +E
Sbjct: 1   SYTCSCPPGYQLSGDGRTCEDIDE 24


>gnl|CDD|219496 pfam07645, EGF_CA, Calcium-binding EGF domain. 
          Length = 42

 Score = 42.7 bits (101), Expect = 6e-06
 Identities = 20/45 (44%), Positives = 28/45 (62%), Gaps = 3/45 (6%)

Query: 422 DIDECKEEINGVAVCPKPGTCINTMGSFRCLCPRGFKLDRTGTFC 466
           D+DEC    +G   CP    C+NT+GSF C+CP G++ +  GT C
Sbjct: 1   DVDEC---ADGTHNCPANTVCVNTIGSFECVCPDGYENNEDGTNC 42



 Score = 41.2 bits (97), Expect = 2e-05
 Identities = 20/45 (44%), Positives = 28/45 (62%), Gaps = 3/45 (6%)

Query: 125 DMDECKEEINGVAVCPKPGTCINTMGSFRCLCPRGFKLDRTGTFC 169
           D+DEC    +G   CP    C+NT+GSF C+CP G++ +  GT C
Sbjct: 1   DVDEC---ADGTHNCPANTVCVNTIGSFECVCPDGYENNEDGTNC 42



 Score = 37.3 bits (87), Expect = 5e-04
 Identities = 17/42 (40%), Positives = 24/42 (57%), Gaps = 2/42 (4%)

Query: 190 DINECLELSNQC--AFRCHNVPGSFRCICPYGYALAPDGRHC 229
           D++EC + ++ C     C N  GSF C+CP GY    DG +C
Sbjct: 1   DVDECADGTHNCPANTVCVNTIGSFECVCPDGYENNEDGTNC 42



 Score = 37.3 bits (87), Expect = 5e-04
 Identities = 17/42 (40%), Positives = 23/42 (54%), Gaps = 3/42 (7%)

Query: 273 DSDECATGRHICQQ--IC-NLLGGYKCGCPEGYVRHPFSNEC 311
           D DECA G H C    +C N +G ++C CP+GY  +     C
Sbjct: 1   DVDECADGTHNCPANTVCVNTIGSFECVCPDGYENNEDGTNC 42



 Score = 35.8 bits (83), Expect = 0.002
 Identities = 15/40 (37%), Positives = 21/40 (52%), Gaps = 2/40 (5%)

Query: 231 DINECEQFPGLCAH--TCTNTEGSYTCGCHPGFELSSDGS 268
           D++EC      C     C NT GS+ C C  G+E + DG+
Sbjct: 1   DVDECADGTHNCPANTVCVNTIGSFECVCPDGYENNEDGT 40



 Score = 31.2 bits (71), Expect = 0.090
 Identities = 15/42 (35%), Positives = 20/42 (47%), Gaps = 10/42 (23%)

Query: 313 DDNECA--RNPCEPSSNTTGERGGKCFNTAGSYRCGCPDGFQ 352
           D +ECA   + C             C NT GS+ C CPDG++
Sbjct: 1   DVDECADGTHNCP--------ANTVCVNTIGSFECVCPDGYE 34


>gnl|CDD|214542 smart00179, EGF_CA, Calcium-binding EGF-like domain. 
          Length = 39

 Score = 42.6 bits (101), Expect = 6e-06
 Identities = 19/41 (46%), Positives = 23/41 (56%), Gaps = 4/41 (9%)

Query: 422 DIDECKEEINGVAVCPKPGTCINTMGSFRCLCPRGFKLDRT 462
           DIDEC         C   GTC+NT+GS+RC CP G+   R 
Sbjct: 1   DIDECASG----NPCQNGGTCVNTVGSYRCECPPGYTDGRN 37



 Score = 40.7 bits (96), Expect = 3e-05
 Identities = 18/41 (43%), Positives = 23/41 (56%), Gaps = 4/41 (9%)

Query: 125 DMDECKEEINGVAVCPKPGTCINTMGSFRCLCPRGFKLDRT 165
           D+DEC         C   GTC+NT+GS+RC CP G+   R 
Sbjct: 1   DIDECASG----NPCQNGGTCVNTVGSYRCECPPGYTDGRN 37



 Score = 39.9 bits (94), Expect = 6e-05
 Identities = 16/34 (47%), Positives = 19/34 (55%), Gaps = 3/34 (8%)

Query: 231 DINECEQFPGLCAH--TCTNTEGSYTCGCHPGFE 262
           DI+EC      C +  TC NT GSY C C PG+ 
Sbjct: 1   DIDECAS-GNPCQNGGTCVNTVGSYRCECPPGYT 33



 Score = 39.5 bits (93), Expect = 9e-05
 Identities = 19/42 (45%), Positives = 23/42 (54%), Gaps = 4/42 (9%)

Query: 190 DINECLELSN-QCAFRCHNVPGSFRCICPYGYALAPDGRHCI 230
           DI+EC   +  Q    C N  GS+RC CP GY    DGR+C 
Sbjct: 1   DIDECASGNPCQNGGTCVNTVGSYRCECPPGYT---DGRNCE 39



 Score = 36.5 bits (85), Expect = 0.001
 Identities = 20/41 (48%), Positives = 23/41 (56%), Gaps = 9/41 (21%)

Query: 313 DDNECA-RNPCEPSSNTTGERGGKCFNTAGSYRCGCPDGFQ 352
           D +ECA  NPC+         GG C NT GSYRC CP G+ 
Sbjct: 1   DIDECASGNPCQN--------GGTCVNTVGSYRCECPPGYT 33



 Score = 30.3 bits (69), Expect = 0.15
 Identities = 17/33 (51%), Positives = 21/33 (63%), Gaps = 4/33 (12%)

Query: 273 DSDECATGRHICQQ--IC-NLLGGYKCGCPEGY 302
           D DECA+G + CQ    C N +G Y+C CP GY
Sbjct: 1   DIDECASG-NPCQNGGTCVNTVGSYRCECPPGY 32


>gnl|CDD|238011 cd00054, EGF_CA, Calcium-binding EGF-like domain, present in a
           large number of membrane-bound and extracellular (mostly
           animal) proteins. Many of these proteins require calcium
           for their biological function and calcium-binding sites
           have been found to be located at the N-terminus of
           particular EGF-like domains; calcium-binding may be
           crucial for numerous protein-protein interactions. Six
           conserved core cysteines form three disulfide bridges as
           in non calcium-binding EGF domains, whose structures are
           very similar. EGF_CA can be found in tandem repeat
           arrangements.
          Length = 38

 Score = 42.2 bits (100), Expect = 9e-06
 Identities = 18/40 (45%), Positives = 22/40 (55%), Gaps = 4/40 (10%)

Query: 422 DIDECKEEINGVAVCPKPGTCINTMGSFRCLCPRGFKLDR 461
           DIDEC         C   GTC+NT+GS+RC CP G+    
Sbjct: 1   DIDECASG----NPCQNGGTCVNTVGSYRCSCPPGYTGRN 36



 Score = 40.3 bits (95), Expect = 4e-05
 Identities = 17/40 (42%), Positives = 22/40 (55%), Gaps = 4/40 (10%)

Query: 125 DMDECKEEINGVAVCPKPGTCINTMGSFRCLCPRGFKLDR 164
           D+DEC         C   GTC+NT+GS+RC CP G+    
Sbjct: 1   DIDECASG----NPCQNGGTCVNTVGSYRCSCPPGYTGRN 36



 Score = 38.0 bits (89), Expect = 2e-04
 Identities = 16/35 (45%), Positives = 19/35 (54%), Gaps = 3/35 (8%)

Query: 231 DINECEQFPGLCAH--TCTNTEGSYTCGCHPGFEL 263
           DI+EC      C +  TC NT GSY C C PG+  
Sbjct: 1   DIDECAS-GNPCQNGGTCVNTVGSYRCSCPPGYTG 34



 Score = 35.7 bits (83), Expect = 0.002
 Identities = 20/41 (48%), Positives = 23/41 (56%), Gaps = 9/41 (21%)

Query: 313 DDNECA-RNPCEPSSNTTGERGGKCFNTAGSYRCGCPDGFQ 352
           D +ECA  NPC+         GG C NT GSYRC CP G+ 
Sbjct: 1   DIDECASGNPCQN--------GGTCVNTVGSYRCSCPPGYT 33



 Score = 35.3 bits (82), Expect = 0.003
 Identities = 18/42 (42%), Positives = 22/42 (52%), Gaps = 5/42 (11%)

Query: 190 DINECLELSN-QCAFRCHNVPGSFRCICPYGYALAPDGRHCI 230
           DI+EC   +  Q    C N  GS+RC CP GY     GR+C 
Sbjct: 1   DIDECASGNPCQNGGTCVNTVGSYRCSCPPGYT----GRNCE 38



 Score = 30.3 bits (69), Expect = 0.15
 Identities = 17/33 (51%), Positives = 21/33 (63%), Gaps = 4/33 (12%)

Query: 273 DSDECATGRHICQ--QIC-NLLGGYKCGCPEGY 302
           D DECA+G + CQ    C N +G Y+C CP GY
Sbjct: 1   DIDECASG-NPCQNGGTCVNTVGSYRCSCPPGY 32


>gnl|CDD|238752 cd01475, vWA_Matrilin, VWA_Matrilin: In cartilaginous plate,
           extracellular matrix molecules mediate cell-matrix and
           matrix-matrix interactions thereby providing tissue
           integrity. Some members of the matrilin family are
           expressed specifically in developing cartilage
           rudiments. The matrilin family consists of at least four
           members. All the members of the matrilin family contain
           VWA domains, EGF-like domains and a heptad repeat
           coiled-coiled domain at the carboxy terminus which is
           responsible for the oligomerization of the matrilins.
           The VWA domains have been shown to be essential for
           matrilin network formation by interacting with matrix
           ligands.
          Length = 224

 Score = 43.1 bits (102), Expect = 1e-04
 Identities = 17/34 (50%), Positives = 21/34 (61%), Gaps = 1/34 (2%)

Query: 271 CLDSDECATGRHICQQIC-NLLGGYKCGCPEGYV 303
           C+  D CAT  H+CQQ+C +  G Y C C EGY 
Sbjct: 184 CVVPDLCATLSHVCQQVCISTPGSYLCACTEGYA 217



 Score = 42.8 bits (101), Expect = 2e-04
 Identities = 14/42 (33%), Positives = 20/42 (47%)

Query: 226 GRHCIDINECEQFPGLCAHTCTNTEGSYTCGCHPGFELSSDG 267
           G+ C+  + C     +C   C +T GSY C C  G+ L  D 
Sbjct: 181 GKICVVPDLCATLSHVCQQVCISTPGSYLCACTEGYALLEDN 222



 Score = 38.9 bits (91), Expect = 0.003
 Identities = 16/40 (40%), Positives = 21/40 (52%)

Query: 188 CEDINECLELSNQCAFRCHNVPGSFRCICPYGYALAPDGR 227
           C   + C  LS+ C   C + PGS+ C C  GYAL  D +
Sbjct: 184 CVVPDLCATLSHVCQQVCISTPGSYLCACTEGYALLEDNK 223



 Score = 30.8 bits (70), Expect = 1.2
 Identities = 20/61 (32%), Positives = 28/61 (45%), Gaps = 6/61 (9%)

Query: 103 LFYVIIFYVISLKKKKFY-SVFLDMDECKEEINGVAVCPKPGTCINTMGSFRCLCPRGFK 161
           +FYV  F  I    KKF   + +  D C    +    C +   CI+T GS+ C C  G+ 
Sbjct: 163 VFYVEDFSTIEELTKKFQGKICVVPDLCATLSHV---CQQV--CISTPGSYLCACTEGYA 217

Query: 162 L 162
           L
Sbjct: 218 L 218


>gnl|CDD|238010 cd00053, EGF, Epidermal growth factor domain, found in epidermal
           growth factor (EGF) presents in a large number of
           proteins, mostly animal; the list of proteins currently
           known to contain one or more copies of an EGF-like
           pattern is large and varied; the functional significance
           of EGF-like domains in what appear to be unrelated
           proteins is not yet clear; a common feature is that
           these repeats are found in the extracellular domain of
           membrane-bound proteins or in proteins known to be
           secreted (exception: prostaglandin G/H synthase); the
           domain includes six cysteine residues which have been
           shown to be involved in disulfide bonds; the main
           structure is a two-stranded beta-sheet followed by a
           loop to a C-terminal short two-stranded sheet;
           Subdomains between the conserved cysteines vary in
           length; the region between the 5th and 6th cysteine
           contains two conserved glycines of which at  least  one 
           is  present  in  most EGF-like domains; a subset of
           these bind calcium.
          Length = 36

 Score = 34.0 bits (78), Expect = 0.008
 Identities = 15/27 (55%), Positives = 20/27 (74%)

Query: 139 CPKPGTCINTMGSFRCLCPRGFKLDRT 165
           C   GTC+NT GS+RC+CP G+  DR+
Sbjct: 8   CSNGGTCVNTPGSYRCVCPPGYTGDRS 34



 Score = 34.0 bits (78), Expect = 0.008
 Identities = 15/27 (55%), Positives = 20/27 (74%)

Query: 436 CPKPGTCINTMGSFRCLCPRGFKLDRT 462
           C   GTC+NT GS+RC+CP G+  DR+
Sbjct: 8   CSNGGTCVNTPGSYRCVCPPGYTGDRS 34



 Score = 30.5 bits (69), Expect = 0.14
 Identities = 11/21 (52%), Positives = 12/21 (57%)

Query: 244 HTCTNTEGSYTCGCHPGFELS 264
            TC NT GSY C C PG+   
Sbjct: 12  GTCVNTPGSYRCVCPPGYTGD 32



 Score = 29.4 bits (66), Expect = 0.29
 Identities = 19/38 (50%), Positives = 20/38 (52%), Gaps = 9/38 (23%)

Query: 316 ECAR-NPCEPSSNTTGERGGKCFNTAGSYRCGCPDGFQ 352
           ECA  NPC          GG C NT GSYRC CP G+ 
Sbjct: 1   ECAASNPCSN--------GGTCVNTPGSYRCVCPPGYT 30



 Score = 27.4 bits (61), Expect = 1.3
 Identities = 14/35 (40%), Positives = 16/35 (45%), Gaps = 5/35 (14%)

Query: 188 CEDINECLELSNQCAFRCHNVPGSFRCICPYGYAL 222
           C   N C          C N PGS+RC+CP GY  
Sbjct: 2   CAASNPCSNGG-----TCVNTPGSYRCVCPPGYTG 31


>gnl|CDD|214544 smart00181, EGF, Epidermal growth factor-like domain. 
          Length = 35

 Score = 32.1 bits (73), Expect = 0.034
 Identities = 13/24 (54%), Positives = 16/24 (66%)

Query: 142 PGTCINTMGSFRCLCPRGFKLDRT 165
            GTCINT GS+ C CP G+  D+ 
Sbjct: 10  NGTCINTPGSYTCSCPPGYTGDKR 33



 Score = 32.1 bits (73), Expect = 0.034
 Identities = 13/24 (54%), Positives = 16/24 (66%)

Query: 439 PGTCINTMGSFRCLCPRGFKLDRT 462
            GTCINT GS+ C CP G+  D+ 
Sbjct: 10  NGTCINTPGSYTCSCPPGYTGDKR 33



 Score = 30.2 bits (68), Expect = 0.17
 Identities = 16/32 (50%), Positives = 19/32 (59%), Gaps = 2/32 (6%)

Query: 234 ECEQFPGLCAH-TCTNTEGSYTCGCHPGFELS 264
           EC    G C++ TC NT GSYTC C PG+   
Sbjct: 1   ECAS-GGPCSNGTCINTPGSYTCSCPPGYTGD 31



 Score = 27.9 bits (62), Expect = 1.0
 Identities = 10/22 (45%), Positives = 11/22 (50%)

Query: 204 RCHNVPGSFRCICPYGYALAPD 225
            C N PGS+ C CP GY     
Sbjct: 12  TCINTPGSYTCSCPPGYTGDKR 33



 Score = 27.5 bits (61), Expect = 1.4
 Identities = 11/19 (57%), Positives = 12/19 (63%)

Query: 334 GKCFNTAGSYRCGCPDGFQ 352
           G C NT GSY C CP G+ 
Sbjct: 11  GTCINTPGSYTCSCPPGYT 29


>gnl|CDD|205157 pfam12947, EGF_3, EGF domain.  This family includes a variety of
           EGF-like domain homologues. This family includes the
           C-terminal domain of the malaria parasite MSP1 protein.
          Length = 36

 Score = 30.6 bits (70), Expect = 0.12
 Identities = 11/18 (61%), Positives = 13/18 (72%)

Query: 245 TCTNTEGSYTCGCHPGFE 262
           TCTNT GS+TC C  G+ 
Sbjct: 13  TCTNTGGSFTCTCKSGYT 30



 Score = 29.8 bits (68), Expect = 0.24
 Identities = 13/26 (50%), Positives = 14/26 (53%), Gaps = 2/26 (7%)

Query: 144 TCINTMGSFRCLCPRGFKLDRTGTFC 169
           TC NT GSF C C  G+  D  G  C
Sbjct: 13  TCTNTGGSFTCTCKSGYTGD--GVTC 36



 Score = 29.8 bits (68), Expect = 0.24
 Identities = 13/26 (50%), Positives = 14/26 (53%), Gaps = 2/26 (7%)

Query: 441 TCINTMGSFRCLCPRGFKLDRTGTFC 466
           TC NT GSF C C  G+  D  G  C
Sbjct: 13  TCTNTGGSFTCTCKSGYTGD--GVTC 36



 Score = 29.0 bits (66), Expect = 0.41
 Identities = 12/25 (48%), Positives = 12/25 (48%), Gaps = 2/25 (8%)

Query: 205 CHNVPGSFRCICPYGYALAPDGRHC 229
           C N  GSF C C  GY    DG  C
Sbjct: 14  CTNTGGSFTCTCKSGYTG--DGVTC 36


>gnl|CDD|224596 COG1682, TagG, ABC-type polysaccharide/polyol phosphate export
           systems, permease component [Carbohydrate transport and
           metabolism / Cell envelope biogenesis, outer membrane].
          Length = 263

 Score = 32.9 bits (76), Expect = 0.31
 Identities = 20/107 (18%), Positives = 34/107 (31%), Gaps = 32/107 (29%)

Query: 22  FLINMLAGIFVWSLASQWTGEILTNVSPYPTSVNMASVSTRKGL-TGVSVTRDTSYTMAI 80
           FL  +LAG+ +W   S+   E               SV     L   ++           
Sbjct: 68  FLAYLLAGLILWFFFSEAISEG------------AGSVVANAALIKKINFPPL------- 108

Query: 81  LNVEQDKQGILIFSMLQTRYFNLFY---VIIFYVISLKKKKFYSVFL 124
                    IL  +   +R FN      +I+ ++I L  +  +   L
Sbjct: 109 ---------ILPVARTLSRLFNFLIHLIIILIFLIILGVEPSWHWLL 146


>gnl|CDD|240151 cd05024, S-100A10, S-100A10: A subgroup of the S-100A10 domain
           found in proteins similar to S100A10. S100A10 is a
           member of the S100 family of EF-hand superfamily of
           calcium-binding proteins. Note that the S-100 hierarchy,
           to which this S-100A10 group belongs, contains only
           S-100 EF-hand domains, other EF-hands have been modeled
           separately. S100 proteins are expressed exclusively in
           vertebrates, and are implicated in intracellular and
           extracellular regulatory activities. A unique feature of
           S100A10 is that it contains mutation in both of the
           calcium binding sites, making it calcium insensitive.
           S100A10 has been detected in brain, heart,
           gastrointestinal tract, kidney, liver, lung, spleen,
           testes, epidermis, aorta, and thymus. Structural data
           supports the homo- and hetero-dimeric as well as
           hetero-tetrameric nature of the protein. S100A10 has
           multiple binding partners in its calcium free state and
           is therefore involved in many diverse biological
           functions.
          Length = 91

 Score = 30.6 bits (69), Expect = 0.42
 Identities = 11/33 (33%), Positives = 17/33 (51%)

Query: 273 DSDECATGRHICQQICNLLGGYKCGCPEGYVRH 305
           D D+C  G+   Q   +L+ G    C + YV+H
Sbjct: 56  DLDDCRDGKVGFQSFFSLIAGLLIACNDYYVKH 88


>gnl|CDD|233450 TIGR01528, NMN_trans_PnuC, nicotinamide mononucleotide transporter
           PnuC.  The PnuC protein of E. coli is membrane protein
           responsible for nicotinamide mononucleotide transport,
           subject to regulation by interaction with the NadR (also
           called NadI) protein (see TIGR01526). This model defines
           a region corresponding to most of the length of PnuC,
           found primarily in pathogens. The extreme N- and
           C-terminal regions are poorly conserved and not included
           in the alignment and model [Transport and binding
           proteins, Other, Biosynthesis of cofactors, prosthetic
           groups, and carriers, Pyridine nucleotides].
          Length = 189

 Score = 32.0 bits (73), Expect = 0.43
 Identities = 25/134 (18%), Positives = 45/134 (33%), Gaps = 33/134 (24%)

Query: 15  ELLIELLFLINMLAGIFVWSLASQWTGEILTNVSPYPTSVN------------MASVSTR 62
           E+L++L F+I  + G+F W   SQ     +T+                     +      
Sbjct: 49  EVLLQLFFVIANIYGLFAWKKNSQQEENGVTSGLKKLKGKGWTIILSTSVVLTVVIALIL 108

Query: 63  KGLTGVSVTRDTSYTMAILNVEQDKQGILIFSMLQTRYF-----NLFYVIIF-------Y 110
           K +       D S T  I  V Q    IL+    +  +      N+  +I++        
Sbjct: 109 KFIGDNFPWLD-SITFVIGIVAQ----ILMVRRYRENWILWLLSNVVSIILWFLAFTGSL 163

Query: 111 VISLKKKKFYSVFL 124
            + L     Y ++L
Sbjct: 164 AMLLT----YIIYL 173


>gnl|CDD|238537 cd01117, YbiR_permease, Putative anion permease YbiR.  Based on
           sequence similarity, YbiR proteins are predicted to
           function as anion translocating permeases in eubacteria,
           archaea and plants. They belong to ArsB/NhaD superfamily
           of permeases that have been shown to translocate sodium,
           sulfate, arsenite and organic anions. A typical
           ArsB/NhaD permease is composed of 8-13 transmembrane
           domains.
          Length = 384

 Score = 31.5 bits (72), Expect = 1.0
 Identities = 17/62 (27%), Positives = 25/62 (40%), Gaps = 6/62 (9%)

Query: 1   MFLLKARK-LETFDRELLIELLFLINMLAGIFVWSLASQWTGEILTNVSPYPTSVNMASV 59
           +  LK R+ L+  D  LL   LF   +   I V   A     E L+ + P   SV    +
Sbjct: 241 LTRLKPRRVLKKVDWGLL---LFFAGLF--ILVGGFAQGLPLEGLSRLLPLLDSVLSLFL 295

Query: 60  ST 61
            +
Sbjct: 296 VS 297


>gnl|CDD|178194 PLN02582, PLN02582, 1-deoxy-D-xylulose-5-phosphate synthase.
          Length = 677

 Score = 30.6 bits (69), Expect = 2.0
 Identities = 14/29 (48%), Positives = 17/29 (58%)

Query: 380 GISGFSCECPSGYQRIGQGHCLATISSGL 408
           G+SGF+    S Y   G GH   TIS+GL
Sbjct: 126 GLSGFTKRAESEYDCFGTGHSSTTISAGL 154


>gnl|CDD|214379 CHL00161, secY, preprotein translocase subunit SecY; Validated.
          Length = 417

 Score = 30.0 bits (68), Expect = 2.8
 Identities = 30/138 (21%), Positives = 50/138 (36%), Gaps = 24/138 (17%)

Query: 16  LLIELLFLINMLAGIFVW---------SLASQWTGEILTNVSPYPTSVNMASVSTRKGLT 66
           LL+  LF I ++  IF+          S         L + S  P  +N A         
Sbjct: 204 LLLLFLFFITLIGIIFLQEAIRKIPLISAKQLGKSSRLASNSYLPLRLNQA--------- 254

Query: 67  GVSVTRDTSYTMAILN--VEQDKQGIL---IFSMLQTRYFNLFYVIIFYVISLKKKKFYS 121
           GV      S  + +            L   +   LQ   F + Y+++++V+ L    FYS
Sbjct: 255 GVMPIIFASALLVLPGYLTNILLNQFLLPLLSLFLQLASFKILYLVLYFVLILFFSYFYS 314

Query: 122 VF-LDMDECKEEINGVAV 138
              L+  +  E +  +AV
Sbjct: 315 TIVLNPKDISENLQKMAV 332


>gnl|CDD|232977 TIGR00440, glnS, glutaminyl-tRNA synthetase.  This protein is a
           relatively rare aminoacyl-tRNA synthetase, found in the
           cytosolic compartment of eukaryotes, in E. coli and a
           number of other Gram-negative Bacteria, and in
           Deinococcus radiodurans. In contrast, the pathway to
           Gln-tRNA in mitochondria, Archaea, Gram-positive
           Bacteria, and a number of other lineages is by
           misacylation with Glu followed by transamidation to
           correct the aminoacylation to Gln. This enzyme is a
           class I tRNA synthetase (hit by the pfam model
           tRNA-synt_1c) and is quite closely related to
           glutamyl-tRNA synthetases [Protein synthesis, tRNA
           aminoacylation].
          Length = 522

 Score = 30.3 bits (68), Expect = 2.9
 Identities = 23/75 (30%), Positives = 35/75 (46%), Gaps = 6/75 (8%)

Query: 403 TISSGLYPGFRSYER----PNLGDIDECKEEINGVAVCPKPGTCINTMGSFRCLCPRGFK 458
           + SS      R Y+R    PN G  D+    IN  ++  K G   +++G    +  + F+
Sbjct: 434 SASSKYPTETRLYDRLFKVPNPGAPDDFLSVINPESLVIKQGFMEHSLGD--AVANKRFQ 491

Query: 459 LDRTGTFCLDQDECT 473
            +R G FCLD  E T
Sbjct: 492 FEREGYFCLDSKEST 506


>gnl|CDD|188093 TIGR00864, PCC, polycystin cation channel protein.  The Polycystin
           Cation Channel (PCC) Family (TC 1.A.5) Polycystin is a
           huge protein of 4303aas. Its repeated leucine-rich (LRR)
           segment is found in many proteins. It contains 16
           polycystic kidney disease (PKD) domains, one
           LDL-receptor class A domain, one C-type lectin family
           domain, and 16-18 putative TMSs in positions between
           residues 2200 and 4100. Polycystin-L has been shown to
           be a cation (Na+, K+ and Ca2+) channel that is activated
           by Ca2+. Two members of the PCC family (polycystin 1 and
           2) are mutated in autosomal dominant polycystic kidney
           disease, and polycystin-L is deleted in mice with renal
           and retinal defects. Note: this model is restricted to
           the amino half for technical reasons.
          Length = 2740

 Score = 29.3 bits (65), Expect = 6.2
 Identities = 23/123 (18%), Positives = 33/123 (26%), Gaps = 32/123 (26%)

Query: 259 PGFELSSDGSGTCLD--SDECATGRHICQQICNLLGGYK-CGCPEGYVRHPFSNECIDDN 315
           P    S     +  +  + E     HI  QI  L       G P                
Sbjct: 494 PALRFSRAAFLSSAEFGTQELRRPAHILFQIYRLRCRLPGAGGPA--------------- 538

Query: 316 ECARNP-CEPSSNTTGERGGKCFNTAGSYRCGCPDGFQFEAALQLCTQLSSSCLSSPCSF 374
            C     C P  N + +               C  G Q+     +C  L + C    C+ 
Sbjct: 539 -CGPEAECRPPDNRSADAPA------------CMKGEQWCPFAHICLPLDAPCHPQACAN 585

Query: 375 GCT 377
           GC+
Sbjct: 586 GCS 588


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.324    0.140    0.469 

Gapped
Lambda     K      H
   0.267   0.0632    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 23,320,799
Number of extensions: 2141487
Number of successful extensions: 1789
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1767
Number of HSP's successfully gapped: 100
Length of query: 483
Length of database: 10,937,602
Length adjustment: 101
Effective length of query: 382
Effective length of database: 6,457,848
Effective search space: 2466897936
Effective search space used: 2466897936
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 15 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (22.0 bits)
S2: 61 (27.5 bits)