RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy15592
         (294 letters)



>gnl|CDD|237987 cd00020, ARM, Armadillo/beta-catenin-like repeats. An approximately
           40 amino acid long tandemly repeated sequence motif
           first identified in the Drosophila segment polarity gene
           armadillo; these repeats were also found in the
           mammalian armadillo homolog beta-catenin, the junctional
           plaque protein plakoglobin, the adenomatous polyposis
           coli (APC) tumor suppressor protein, and a number of
           other proteins. ARM has been implicated in mediating
           protein-protein interactions, but no common features
           among the target proteins recognized by the ARM repeats
           have been identified; related to the HEAT domain; three
           consecutive copies of the repeat are represented by this
           alignment model.
          Length = 120

 Score = 97.4 bits (243), Expect = 2e-25
 Identities = 41/113 (36%), Positives = 68/113 (60%), Gaps = 3/113 (2%)

Query: 38  EVISFLSNPNNVIKANAAAYLQHLCYMDDPNKQKTRSLGGIPPLVKLLGHESPDVFRNAC 97
            ++S LS+ +  ++  AA  L +L   ++ N Q     GG+P LV+LL  E  +V + A 
Sbjct: 11  ALVSLLSSSDENVQREAAWALSNLSAGNNDNIQAVVEAGGLPALVQLLKSEDEEVVKAAL 70

Query: 98  GALRNLSYGRQNDENKRAIKNAGGIPLLINLLRKTADAEVKELVTGVLWNLSS 150
            ALRNL+ G   ++NK  +  AGG+P L+NLL  +++ ++++  TG L NL+S
Sbjct: 71  WALRNLAAG--PEDNKLIVLEAGGVPKLVNLLD-SSNEDIQKNATGALSNLAS 120



 Score = 75.8 bits (187), Expect = 3e-17
 Identities = 46/130 (35%), Positives = 68/130 (52%), Gaps = 18/130 (13%)

Query: 76  GGIPPLVKLLGHESPDVFRNACGALRNLSYGRQNDENKRAIKNAGGIPLLINLLRKTADA 135
           GG+P LV LL     +V R A  AL NLS G  N++N +A+  AGG+P L+ LL K+ D 
Sbjct: 7   GGLPALVSLLSSSDENVQREAAWALSNLSAG--NNDNIQAVVEAGGLPALVQLL-KSEDE 63

Query: 136 EVKELVTGVLWNLSSCEDLKKSIIDD--GLQVVVNHIIIPHSGWDPVSAGETCWSTIFRN 193
           EV +     L NL++  +  K I+ +  G+  +VN  ++  S  D           I +N
Sbjct: 64  EVVKAALWALRNLAAGPEDNKLIVLEAGGVPKLVN--LLDSSNED-----------IQKN 110

Query: 194 TSGVLRNASS 203
            +G L N +S
Sbjct: 111 ATGALSNLAS 120



 Score = 38.1 bits (89), Expect = 0.001
 Identities = 31/141 (21%), Positives = 56/141 (39%), Gaps = 27/141 (19%)

Query: 115 AIKNAGGIPLLINLLRKTADAEVKELVTGVLWNLSSC-EDLKKSIID-DGLQVVVNHIII 172
           A+  AGG+P L++LL  + D  V+      L NLS+   D  +++++  GL  +V  +  
Sbjct: 2   AVIQAGGLPALVSLLSSS-DENVQREAAWALSNLSAGNNDNIQAVVEAGGLPALVQLLKS 60

Query: 173 PHSGWDPVSAGETCWSTIFRNTSGVLRNASSAGEYARKKLRECEGL--IDSLLYVVKSAI 230
                            + +     LRN ++  E  +  + E  G+  + +LL      I
Sbjct: 61  ED-------------EEVVKAALWALRNLAAGPEDNKLIVLEAGGVPKLVNLLDSSNEDI 107

Query: 231 EKSNIGNKSVENCVCILRNLS 251
                     +N    L NL+
Sbjct: 108 ---------QKNATGALSNLA 119


>gnl|CDD|214547 smart00185, ARM, Armadillo/beta-catenin-like repeats.  Approx. 40
           amino acid repeat. Tandem repeats form superhelix of
           helices that is proposed to mediate interaction of
           beta-catenin with its ligands. Involved in transducing
           the Wingless/Wnt signal. In plakoglobin arm repeats bind
           alpha-catenin and N-cadherin.
          Length = 41

 Score = 52.0 bits (126), Expect = 2e-09
 Identities = 20/41 (48%), Positives = 24/41 (58%)

Query: 65  DDPNKQKTRSLGGIPPLVKLLGHESPDVFRNACGALRNLSY 105
           DD NKQ     GG+P LV+LL  E  +V + A  AL NLS 
Sbjct: 1   DDENKQAVVDAGGLPALVELLKSEDEEVVKEAAWALSNLSS 41



 Score = 45.5 bits (109), Expect = 4e-07
 Identities = 20/42 (47%), Positives = 28/42 (66%), Gaps = 1/42 (2%)

Query: 109 NDENKRAIKNAGGIPLLINLLRKTADAEVKELVTGVLWNLSS 150
           +DENK+A+ +AGG+P L+ LL+   D EV +     L NLSS
Sbjct: 1   DDENKQAVVDAGGLPALVELLKSE-DEEVVKEAAWALSNLSS 41


>gnl|CDD|201276 pfam00514, Arm, Armadillo/beta-catenin-like repeat.  Approx. 40
           amino acid repeat. Tandem repeats form super-helix of
           helices that is proposed to mediate interaction of
           beta-catenin with its ligands. CAUTION: This family does
           not contain all known armadillo repeats.
          Length = 41

 Score = 50.5 bits (122), Expect = 5e-09
 Identities = 16/40 (40%), Positives = 20/40 (50%)

Query: 65  DDPNKQKTRSLGGIPPLVKLLGHESPDVFRNACGALRNLS 104
              NKQ     G +PPLV+LL     +V   A  AL NL+
Sbjct: 1   SPENKQAVIEAGAVPPLVQLLSSPDEEVQEEAAWALSNLA 40



 Score = 44.4 bits (106), Expect = 9e-07
 Identities = 17/42 (40%), Positives = 26/42 (61%), Gaps = 1/42 (2%)

Query: 109 NDENKRAIKNAGGIPLLINLLRKTADAEVKELVTGVLWNLSS 150
           + ENK+A+  AG +P L+ LL  + D EV+E     L NL++
Sbjct: 1   SPENKQAVIEAGAVPPLVQLL-SSPDEEVQEEAAWALSNLAA 41


>gnl|CDD|215629 PLN03200, PLN03200, cellulose synthase-interactive protein;
           Provisional.
          Length = 2102

 Score = 55.9 bits (135), Expect = 1e-08
 Identities = 36/90 (40%), Positives = 49/90 (54%), Gaps = 3/90 (3%)

Query: 39  VISFLSNPNNVIKANAAAYLQHLCYMDDPNKQKTRSLGGIPPLVKLLGHE-SPDVFRNAC 97
           ++  LS+ N+  +ANAA+ L  L    + +  K    G +  L+KLLG      V   A 
Sbjct: 194 LVKLLSSGNSDAQANAASLLARLMMAFESSISKVLDAGAVKQLLKLLGQGNEVSVRAEAA 253

Query: 98  GALRNLSYGRQNDENKRAIKNAGGIPLLIN 127
           GAL  LS   Q+ E K+AI +AGGIP LIN
Sbjct: 254 GALEALS--SQSKEAKQAIADAGGIPALIN 281



 Score = 47.0 bits (112), Expect = 9e-06
 Identities = 41/121 (33%), Positives = 54/121 (44%), Gaps = 12/121 (9%)

Query: 40  ISFLSNPNNVIKANAAAYLQHLCYMDDPNKQKTRS---LGG-IPPLVKLLGHESPDVFRN 95
           +S L +     K NAAA L  LC  +D      R    LGG IPPL+ LL   S +  + 
Sbjct: 64  VSLLRSGTLGAKVNAAAVLGVLCKEED-----LRVKVLLGGCIPPLLSLLKSGSAEAQKA 118

Query: 96  ACGALRNLSYGRQNDENKRAI-KNAGGIPLLINLL--RKTADAEVKELVTGVLWNLSSCE 152
           A  A+  +S G  +D     I    G +P L + L      D  V+ L+TG L NL    
Sbjct: 119 AAEAIYAVSSGGLSDHVGSKIFSTEGVVPSLWDQLQPGNKQDKVVEGLLTGALRNLCGST 178

Query: 153 D 153
           D
Sbjct: 179 D 179



 Score = 45.1 bits (107), Expect = 3e-05
 Identities = 34/87 (39%), Positives = 45/87 (51%), Gaps = 7/87 (8%)

Query: 73  RSLG---GIPPLVKLLGHESPDVFRNACGALRNLSYGRQNDENKRAIKNAGGIPLLINLL 129
            +LG   G+  L+ LLG  S      A   L  L+   + DE+K AI  AGGIP L+ LL
Sbjct: 440 EALGGREGVQLLISLLGLSSEQQQEYAVALLAILT--DEVDESKWAITAAGGIPPLVQLL 497

Query: 130 RKTADAEVKELVTGVLWNL-SSCEDLK 155
            +T   + KE    VLWNL    ED++
Sbjct: 498 -ETGSQKAKEDSATVLWNLCCHSEDIR 523



 Score = 44.3 bits (105), Expect = 6e-05
 Identities = 37/142 (26%), Positives = 65/142 (45%), Gaps = 4/142 (2%)

Query: 34  PNLTEVISFLSNPNNVIKANAAAYLQHLCYMDDPNKQKTRSLGGIPPLVKLLGHESPDVF 93
           P+L + +   +  + V++      L++LC   D     T   GG+  LVKLL   + D  
Sbjct: 147 PSLWDQLQPGNKQDKVVEGLLTGALRNLCGSTDGFWSATLEAGGVDILVKLLSSGNSDAQ 206

Query: 94  RNACGALRNLSYGRQNDENKRAIKNAGGIPLLINLLRKTADAEVKELVTGVLWNLSS-CE 152
            NA   L  L      + +   + +AG +  L+ LL +  +  V+    G L  LSS  +
Sbjct: 207 ANAASLLARLM--MAFESSISKVLDAGAVKQLLKLLGQGNEVSVRAEAAGALEALSSQSK 264

Query: 153 DLKKSIID-DGLQVVVNHIIIP 173
           + K++I D  G+  ++N  + P
Sbjct: 265 EAKQAIADAGGIPALINATVAP 286



 Score = 39.7 bits (93), Expect = 0.002
 Identities = 31/92 (33%), Positives = 46/92 (50%), Gaps = 4/92 (4%)

Query: 40  ISFLSNPNNVIKANAAAYLQHLCYMDDPNKQKTRSLGGIPPLVKLLGHESPDVFRNACGA 99
           IS L   +   +  A A L  L    D +K    + GGIPPLV+LL   S     ++   
Sbjct: 452 ISLLGLSSEQQQEYAVALLAILTDEVDESKWAITAAGGIPPLVQLLETGSQKAKEDSATV 511

Query: 100 LRNLSYGRQNDENKRA-IKNAGGIPLLINLLR 130
           L NL     + E+ RA +++AG +P L+ LL+
Sbjct: 512 LWNLC---CHSEDIRACVESAGAVPALLWLLK 540



 Score = 35.5 bits (82), Expect = 0.037
 Identities = 23/83 (27%), Positives = 37/83 (44%), Gaps = 6/83 (7%)

Query: 40   ISFLSNPNNVIKANAAAYLQHLCYMDDPNKQKTRSLGGIPPLVKLLGHESPDVFRNACGA 99
            I  L +P+  ++  AA  L HL   +   +  T     + PLV+L G     + + A  A
Sbjct: 1534 IPLLESPSQAVQQLAAELLSHLLAEEHFQQDITTQNA-VVPLVRLAGIGILSLQQRAVKA 1592

Query: 100  LRNLSYGRQNDENKRAIKNAGGI 122
            L ++S          A+ +AGGI
Sbjct: 1593 LESISLSWPK-----AVADAGGI 1610



 Score = 30.5 bits (69), Expect = 1.3
 Identities = 34/134 (25%), Positives = 53/134 (39%), Gaps = 18/134 (13%)

Query: 36  LTEVISFLSNPNNVIKANAAAYL-------QHLCYMDDPNKQKTRSLGGIPPLVKLLGHE 88
           L  +I  LS+     +  AA+ L       Q LC  +     +      I P +KLL + 
Sbjct: 611 LRTLIQLLSSSKEETQEKAASVLADIFSSRQDLC--ESLATDEI-----INPCIKLLTNN 663

Query: 89  SPDVFRNACGALRNLSYG-RQNDENKRAIKNAGGIPLLINLLRKTADAEVKELVTGVLWN 147
           +  V   +  AL  LS   ++N   K +      I  LI  L K++  EV E     L N
Sbjct: 664 TEAVATQSARALAALSRSIKEN--RKVSYAAEDAIKPLIK-LAKSSSIEVAEQAVCALAN 720

Query: 148 LSSCEDLKKSIIDD 161
           L S  ++    + +
Sbjct: 721 LLSDPEVAAEALAE 734


>gnl|CDD|222285 pfam13646, HEAT_2, HEAT repeats.  This family includes multiple
           HEAT repeats.
          Length = 88

 Score = 41.2 bits (97), Expect = 4e-05
 Identities = 25/98 (25%), Positives = 40/98 (40%), Gaps = 24/98 (24%)

Query: 42  FLSNPNNVIKANAAAYLQHLCYMDDPNKQKTRSLGGIPPLVKLLGHESPDVFRNACGALR 101
            LS+P+  ++A AA  L  L   +            +P L++LL    P+V R A  AL 
Sbjct: 8   LLSDPDPEVRAAAARALGELGDPE-----------ALPALLELLKDPDPEVRRAAAEALG 56

Query: 102 NLSYGRQNDENKRAIKNAGGIPLLINLLRKTADAEVKE 139
                         + +   +P L+ LL+   DA V+ 
Sbjct: 57  K-------------LGDPEALPALLELLQDDDDAVVRA 81


>gnl|CDD|218281 pfam04826, Arm_2, Armadillo-like.  This domain contains
           armadillo-like repeats. Proteins containing this domain
           interact with numerous other proteins, through these
           interactions they are involved in a wide variety of
           processes including carcinogenesis, control of cellular
           ageing and survival, regulation of circadian rhythm and
           lysosomal sorting of G protein-coupled receptors.
          Length = 254

 Score = 36.2 bits (84), Expect = 0.012
 Identities = 28/86 (32%), Positives = 40/86 (46%), Gaps = 18/86 (20%)

Query: 32  RDPNLTEVISFLSNPNNVIKANAAAYLQHLCYMDDPNKQKTRSLGGIPPLVKLLGHESPD 91
            DP + E+         +   N+AAY          N+   R LGGIP +  LL + +P+
Sbjct: 25  EDPFIHEIAL-------ITLGNSAAY--------SFNQDIIRDLGGIPIIANLLSNPNPE 69

Query: 92  VFRNACGALRNLSYGRQNDENKRAIK 117
           +   A  AL NLS    N EN++ IK
Sbjct: 70  IKEKALNALNNLS---VNVENQKKIK 92


>gnl|CDD|227396 COG5064, SRP1, Karyopherin (importin) alpha [Intracellular
           trafficking and secretion].
          Length = 526

 Score = 34.5 bits (79), Expect = 0.055
 Identities = 30/119 (25%), Positives = 54/119 (45%), Gaps = 4/119 (3%)

Query: 42  FLSNPNNV-IKANAAAYLQHLCYMDDPNKQKTRSLGGIPPLVKLLGHESPDVFRNACGAL 100
            LS+  ++ +  NA   L +LC   +P    +     +P L KL+    P+V  +AC A+
Sbjct: 208 LLSSAIHISMLRNATWTLSNLCRGKNPPPDWSNISQALPILAKLIYSRDPEVLVDACWAI 267

Query: 101 RNLSYGRQNDENKRAIKNAGGIPLLINLLRKTADAEVKELVTGVLWNLSSCEDLKKSII 159
             LS G    E  +A+ + G    L+ LL     A+++      + N+ +  D +  +I
Sbjct: 268 SYLSDGPN--EKIQAVLDVGIPGRLVELL-SHESAKIQTPALRSVGNIVTGSDDQTQVI 323



 Score = 33.7 bits (77), Expect = 0.11
 Identities = 25/96 (26%), Positives = 42/96 (43%), Gaps = 4/96 (4%)

Query: 55  AAYLQHLCYMDDPNKQKTRSLGGIPPLVKLLGHESPDVFR-NACGALRNLSYGRQNDENK 113
             + + L     P  Q     G +P  V+ +     D+ +  A  AL N++ G       
Sbjct: 93  YKFRKLLSKETSPPIQPVIDAGVVPRFVEFMDEIQRDMLQFEAAWALTNIASGTTQ--QT 150

Query: 114 RAIKNAGGIPLLINLLRKTADAEVKELVTGVLWNLS 149
           + + +AG +PL I LL  T D +V+E     L N++
Sbjct: 151 KVVVDAGAVPLFIQLLSSTED-DVREQAVWALGNIA 185



 Score = 30.6 bits (69), Expect = 0.90
 Identities = 24/91 (26%), Positives = 39/91 (42%), Gaps = 3/91 (3%)

Query: 41  SFLSNPNNVIKANAAAYLQHLCYMDDPNKQKTRSLGGIPPLVKLLGHESPDVFRNACGAL 100
           S LS+P   I+  A   + ++   +    Q       IPPL+ LL      + + AC A+
Sbjct: 334 SLLSSPKENIRKEACWTISNITAGNTEQIQAVIDANLIPPLIHLLSSAEYKIKKEACWAI 393

Query: 101 RNLSYG--RQNDENKRAIKNAGGIPLLINLL 129
            N + G   + D  +  +   G I  L +LL
Sbjct: 394 SNATSGGLNRPDIIRYLVSQ-GFIKPLCDLL 423


>gnl|CDD|202500 pfam02985, HEAT, HEAT repeat.  The HEAT repeat family is related to
           armadillo/beta-catenin-like repeats (see pfam00514).
          Length = 31

 Score = 29.4 bits (67), Expect = 0.17
 Identities = 10/27 (37%), Positives = 15/27 (55%)

Query: 78  IPPLVKLLGHESPDVFRNACGALRNLS 104
           +P L++LL    P+V   A  AL  L+
Sbjct: 2   LPLLLELLNDPDPEVREAAAEALGALA 28


>gnl|CDD|205691 pfam13513, HEAT_EZ, HEAT-like repeat.  The HEAT repeat family is
           related to armadillo/beta-catenin-like repeats (see
           pfam00514). These EZ repeats are found in subunits of
           cyanobacterial phycocyanin lyase and other proteins and
           probably carry out a scaffolding role.
          Length = 55

 Score = 29.0 bits (65), Expect = 0.47
 Identities = 8/26 (30%), Positives = 13/26 (50%)

Query: 78  IPPLVKLLGHESPDVFRNACGALRNL 103
           +P L+ LL  +  +V   A  AL  +
Sbjct: 30  LPALLPLLKDDDDEVREAAAWALGRI 55


>gnl|CDD|147776 pfam05804, KAP, Kinesin-associated protein (KAP).  This family
           consists of several eukaryotic kinesin-associated (KAP)
           proteins. Kinesins are intracellular multimeric
           transport motor proteins that move cellular cargo on
           microtubule tracks. It has been shown that the sea
           urchin KRP85/95 holoenzyme associates with a KAP115
           non-motor protein, forming a heterotrimeric complex in
           vitro, called the Kinesin-II.
          Length = 708

 Score = 31.0 bits (70), Expect = 0.80
 Identities = 29/102 (28%), Positives = 49/102 (48%), Gaps = 8/102 (7%)

Query: 57  YLQHLCYMDDPNKQKTRSLGGIPPLVKLLGHESPDVFRNACGALRNLSYGRQNDENKRAI 116
           +L+ L   D+ NK +    G I  L+KL   +  D+       L NLS+   +   +  +
Sbjct: 309 FLKKLSIFDE-NKNEMEENGIIEKLLKLFPCQHEDLLNITLRLLFNLSF---DTGLRPKM 364

Query: 117 KNAGGIPLLINLLRKTADAEVKELVTGVLWNLSSCEDLKKSI 158
            N G +P L++LL    +     +   VL++L SC+D  KS+
Sbjct: 365 VNGGLLPKLVSLL---DNDNHHGIALCVLYHL-SCDDKAKSM 402


>gnl|CDD|227427 COG5096, COG5096, Vesicle coat complex, various subunits
           [Intracellular trafficking and secretion].
          Length = 757

 Score = 30.5 bits (69), Expect = 1.4
 Identities = 26/98 (26%), Positives = 37/98 (37%), Gaps = 17/98 (17%)

Query: 43  LSNPNNVIKANAAAYLQHLCYMDDPNKQKTRSLGG--IPPLVKLLGHESPDVFRNACGAL 100
           L +PN  I+  A   L+ L  +          L G  I P+ KLL      V + A  A+
Sbjct: 101 LQDPNEEIRGFA---LRTLSLLRVKE------LLGNIIDPIKKLLTDPHAYVRKTAALAV 151

Query: 101 RNLSYGRQNDENKRAIKNAGGIPLLINLLRKTADAEVK 138
             L Y      +K      G I +L  L+  + D  V 
Sbjct: 152 AKL-YRL----DKDLYHELGLIDILKELVADS-DPIVI 183


>gnl|CDD|224286 COG1367, COG1367, CRISPR system related protein, RAMP superfamily
           [Defense mechanisms].
          Length = 393

 Score = 29.6 bits (66), Expect = 1.8
 Identities = 13/57 (22%), Positives = 25/57 (43%)

Query: 93  FRNACGALRNLSYGRQNDENKRAIKNAGGIPLLINLLRKTADAEVKELVTGVLWNLS 149
            R     L++  YG     +++AI           ++ + A+ EVK+ +  +L  LS
Sbjct: 103 KRFTDDRLKDTKYGLYGLYDRKAIYEYDQEGKTFTIILQYANFEVKKEILSLLKLLS 159


>gnl|CDD|220789 pfam10508, Proteasom_PSMB, Proteasome non-ATPase 26S subunit.  The
           26S proteasome, a eukaryotic ATP-dependent, dumb-bell
           shaped, protease complex with a molecular mass of approx
           20kDa consists of a central 20S proteasome,functioning
           as a catalytic machine, and two large V-shaped terminal
           modules, having possible regulatory roles,composed of
           multiple subunits of 25- 110 kDa attached to the central
           portion in opposite orientations. It is responsible for
           degradation of abnormal intracellular proteins,
           including oxidatively damaged proteins, and may play a
           role as a component of a cellular anti-oxidative system.
           Expression of catalytic core subunits including PSMB5
           and peptidase activities of the proteasome were elevated
           following incubation with 3-methylcholanthrene. The 20S
           proteasome comprises a cylindrical stack of four rings,
           two outer rings formed by seven alpha-subunits
           (alpha1-alpha7) and two inner rings of seven
           beta-subunits (beta1-beta7). Two outer rings of alpha
           subunits maintain structure, while the central beta
           rings contain the proteolytic active core subunits beta1
           (PSMB6), beta2 (PSMB7), and beta5 (PSMB5). Expression of
           PSMB5 can be altered by chemical reactants, such as
           3-methylcholanthrene.
          Length = 504

 Score = 29.7 bits (67), Expect = 1.8
 Identities = 20/93 (21%), Positives = 36/93 (38%), Gaps = 7/93 (7%)

Query: 78  IPPLVKLLGHESPDVFRNACGALRNLSYGRQNDENKRAIKNAGGIPLLINLLRKTADAEV 137
              L + L H +P V       L  +   +  +     +KN   + L+ + LR   D  V
Sbjct: 83  QVVLQEGLPHPTPAVRALVLKELLRIV--QSKEGALELLKNEELLSLIADCLRDE-DTSV 139

Query: 138 KELVTGVLWNLSS----CEDLKKSIIDDGLQVV 166
            E    +L  LS      ++L  S +   L+++
Sbjct: 140 GEAAINLLSKLSKSEPGLDELFDSNLQTKLKIL 172


>gnl|CDD|153216 cd01745, GATase1_2, Subgroup of proteins having the Type 1
           glutamine amidotransferase (GATase1) domain.  This group
           contains a subgroup of proteins having the Type 1
           glutamine amidotransferase (GATase1) domain. GATase
           activity catalyses the transfer of ammonia from the
           amide side chain of glutamine to an acceptor substrate.
           Glutamine amidotransferases (GATase) includes the triad
           family of amidotransferases which have a conserved
           Cys-His-Glu catalytic triad in the glutaminase active
           site. In this subgroup this triad is conserved. GATase
           activity can be found in a range of biosynthetic
           enzymes, including: glutamine amidotransferase,
           formylglycinamide ribonucleotide, GMP synthetase ,
           anthranilate synthase component II, glutamine-dependent
           carbamoyl phosphate synthase, cytidine triphosphate
           synthetase, gamma-glutamyl hydrolase, imidazole glycerol
           phosphate synthase and, cobyric acid synthase. Glutamine
           amidotransferase (GATase) domains can occur either as
           single polypeptides, as in glutamine amidotransferases,
           or as domains in a much larger multifunctional synthase
           protein, such as CPSase.
          Length = 189

 Score = 29.1 bits (66), Expect = 2.2
 Identities = 16/63 (25%), Positives = 28/63 (44%), Gaps = 5/63 (7%)

Query: 93  FRNACGALRNLSYGRQNDENKRAIKNAGGIPLLI-NLLRKTADAEVKELVTGVLWNLSSC 151
            R   G      Y  Q      A++ AGG+P+L+  +  +    +  EL+ G+L  L+  
Sbjct: 7   LREEEGGYERRDYLNQY--YVDAVRKAGGLPVLLPPVDDEEDLEQYLELLDGLL--LTGG 62

Query: 152 EDL 154
            D+
Sbjct: 63  GDV 65


>gnl|CDD|236672 PRK10318, PRK10318, hypothetical protein; Provisional.
          Length = 121

 Score = 28.2 bits (63), Expect = 2.7
 Identities = 13/60 (21%), Positives = 18/60 (30%), Gaps = 2/60 (3%)

Query: 180 VSAGETCWSTIFRNTSGVLRNASSAGEYARKKLRECEGLIDSLLYVVKSAIEKSNIGNKS 239
               +    T  RN  G       A  + R KL      ID+    +      S+I  K 
Sbjct: 33  EGLAQKKDCTFVRN--GDEHTCKEAVSHLRLKLGNTRNRIDTAEQFIDKVASSSSISGKP 90


>gnl|CDD|217434 pfam03224, V-ATPase_H_N, V-ATPase subunit H.  The yeast
           Saccharomyces cerevisiae vacuolar H+-ATPase (V-ATPase)
           is a multisubunit complex responsible for acidifying
           organelles. It functions as an ATP dependent proton pump
           that transports protons across a lipid bilayer. This
           domain corresponds to the N terminal domain of the H
           subunit of V-ATPase. The N-terminal domain is required
           for the activation of the complex whereas the C-terminal
           domain is required for coupling ATP hydrolysis to proton
           translocation.
          Length = 312

 Score = 28.8 bits (65), Expect = 3.0
 Identities = 14/51 (27%), Positives = 29/51 (56%), Gaps = 2/51 (3%)

Query: 122 IPLLINLLRKTADAEVKELVTGVLWNLSSCE--DLKKSIIDDGLQVVVNHI 170
           IPLL+++L+ +   +V  +    L NL S    D  + ++D+GL  ++ ++
Sbjct: 241 IPLLLDILKTSIKEKVVRVSLATLRNLLSKAVKDFIEVMVDNGLLKLLQNL 291


>gnl|CDD|130022 TIGR00947, 2A73, putative bicarbonate transporter, IctB family.
           This family of proteins is suggested to transport
           inorganic carbon (HCO3-), based on the phenotype of a
           mutant of IctB in Synechococcus sp. strain PCC 7942.
           Bicarbonate uptake is used by many photosynthetic
           organisms including cyanobacteria. These organisms are
           able to concentrate CO2/HCO3- against a greater than
           ten-fold concentration gradient. Cyanobacteria may have
           several such carriers operating with different
           efficiencies. Note that homology to various O-antigen
           ligases, with possible implications for mutant cell
           envelope structure, might allow alternatives to the
           interpretation of IctB as a bicarbonate transport
           protein [Transport and binding proteins, Carbohydrates,
           organic alcohols, and acids].
          Length = 425

 Score = 28.9 bits (65), Expect = 3.3
 Identities = 13/34 (38%), Positives = 15/34 (44%), Gaps = 10/34 (29%)

Query: 30  RWRDPN-----LTEVISFLSNPNNVIKANAAAYL 58
            W DP       T V S+L NPN +     A YL
Sbjct: 150 TWVDPESVLAGTTRVYSYLGNPNLL-----AGYL 178


>gnl|CDD|185766 cd09243, BRO1_Brox_like, Protein-interacting Bro1-like domain of
          human Brox1 and related proteins.  This family contains
          the Bro1-like domain of a single-domain protein, human
          Brox, and related domains. It belongs to the
          BRO1_Alix_like superfamily which also includes the
          Bro1-like domains of mammalian Alix (apoptosis-linked
          gene-2 interacting protein X), His-Domain type N23
          protein tyrosine phosphatase (HD-PTP, also known as
          PTPN23), RhoA-binding proteins Rhophilin-1 and -2, Bro1
          and Rim20 (also known as PalA) from Saccharomyces
          cerevisiae, Ustilago maydis Rim23 (also known as PalC),
          and related domains. Alix, HD-PTP, Brox, Bro1, Rim20,
          and Rim23, interact with the ESCRT (Endosomal Sorting
          Complexes Required for Transport) system. Bro1-like
          domains are boomerang-shaped, and part of the domain is
          a tetratricopeptide repeat (TPR)-like structure.
          Bro1-like domains bind components of the ESCRT-III
          complex: CHMP4 in the case of Brox. Human Brox can bind
          to human immunodeficiency virus type 1 (HIV-1)
          nucleocapsid. In addition to a Bro1-like domain, Brox
          also has a C-terminal thioester-linkage site for
          isoprenoid lipids (CaaX motif). This family lacks the
          V-shaped (V) domain found in many members of the
          BRO1_Alix_like superfamily.
          Length = 353

 Score = 28.8 bits (65), Expect = 3.3
 Identities = 16/46 (34%), Positives = 23/46 (50%), Gaps = 2/46 (4%)

Query: 17 SPSLNKKCRNSSLRWRDPNLTEVISFLSNPNNVIKANAAAYLQHLC 62
          +P+ +K C  S LR     L E++S  SN  + +K    AYL  L 
Sbjct: 23 TPAASKLC--SDLRTARARLLELLSDPSNDVDTVKTAFNAYLSLLQ 66


>gnl|CDD|213993 cd07438, PHP_HisPPase_AMP, Polymerase and Histidinol Phosphatase
           domain of Histidinol phosphate phosphatase (HisPPase)
           AMP bound.  The PHP domain of this HisPPase family has
           an unknown function. It has a second domain inserted in
           the middle that binds adenosine 5-monophosphate (AMP).
           The PHP (also called histidinol phosphatase-2/HIS2)
           domain is associated with several types of DNA
           polymerases, such as PolIIIA and family X DNA
           polymerases, stand alone histidinol phosphate
           phosphatases (HisPPases), and a number of
           uncharacterized protein families. HisPPase catalyzes the
           eighth step of histidine biosynthesis, in which
           L-histidinol phosphate undergoes dephosphorylation to
           give histidinol. The PHP domain has four conserved
           sequence motifs and contains an invariant histidine that
           is involved in metal ion coordination. The PHP domain of
           HisPPase is structurally homologous to the other members
           of the PHP family that have a distorted (beta/alpha)7
           barrel fold with a trinuclear metal site on the
           C-terminal side of the barrel.
          Length = 155

 Score = 27.7 bits (63), Expect = 4.1
 Identities = 11/33 (33%), Positives = 15/33 (45%), Gaps = 5/33 (15%)

Query: 114 RAIKNAGGI-----PLLINLLRKTADAEVKELV 141
             I  AGG+     P L  L RK  +  ++EL 
Sbjct: 82  ELIHAAGGVAVLAHPGLYKLSRKKLEELIEELK 114


>gnl|CDD|240521 cd13394, Syo1_like, Fungal symportin 1 (syo1) and similar proteins.
            This family of eukaryotic proteins includes
           Saccharomyces cerevisiae Ydl063c and Chaetomium
           thermophilum Syo1, which mediate the co-import of two
           ribosomal proteins, Rpl5 and Rpl11 (which both interact
           with 5S rRNA) into the nucleus. Import precedes their
           association with rRNA and subsequent ribosome assembly
           in the nucleolus. The primary structure of syo1 is a
           mixture of Armadillo- (ARM, N-terminal part of syo1) and
           HEAT-repeats (C-terminal part of syo1).
          Length = 597

 Score = 28.8 bits (65), Expect = 4.5
 Identities = 20/75 (26%), Positives = 34/75 (45%), Gaps = 3/75 (4%)

Query: 78  IPPLVKLLGHESPDVFRNACGALRNLSYGRQNDENKRAIKNAGGIPLLINLLRKTADAEV 137
           I PL++ L   S +    A GA+ NL    ++   ++ +     + +LI  L   ++ EV
Sbjct: 39  ILPLLEKLQSPSANDRSMALGAISNLI---EDPSMRKLLLKEKLVQILIERLLDDSNTEV 95

Query: 138 KELVTGVLWNLSSCE 152
                G L NL+  E
Sbjct: 96  VVEAFGALRNLAIEE 110


>gnl|CDD|224982 COG2071, COG2071, Predicted glutamine amidotransferases [General
           function prediction only].
          Length = 243

 Score = 28.1 bits (63), Expect = 4.6
 Identities = 13/33 (39%), Positives = 20/33 (60%), Gaps = 1/33 (3%)

Query: 114 RAIKNAGGIPLLINLLRKTADA-EVKELVTGVL 145
            AI  AGGIP+L+  L    DA +  +L+ G++
Sbjct: 33  DAIIKAGGIPILLPALEDPEDARQYLDLIDGLI 65


>gnl|CDD|129661 TIGR00570, cdk7, CDK-activating kinase assembly factor MAT1.  All
           proteins in this family for which functions are known
           are cyclin dependent protein kinases that are components
           of TFIIH, a complex that is involved in nucleotide
           excision repair and transcription initiation. Also known
           as MAT1 (menage a trois 1). This family is based on the
           phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis,
           Stanford University) [DNA metabolism, DNA replication,
           recombination, and repair].
          Length = 309

 Score = 28.2 bits (63), Expect = 4.7
 Identities = 18/50 (36%), Positives = 22/50 (44%), Gaps = 10/50 (20%)

Query: 215 CEGLIDSLLYVVKSAIEKSNIGNKSVENCVCILRNLSFRCQEVEDPNYDK 264
           CE  +D LL+V  S          S   C   LR  +FR Q  EDP  +K
Sbjct: 31  CESCVD-LLFVRGSG---------SCPECDTPLRKNNFRVQLFEDPTVEK 70


>gnl|CDD|219535 pfam07722, Peptidase_C26, Peptidase C26.  These peptidases have
           gamma-glutamyl hydrolase activity; that is they catalyze
           the cleavage of the gamma-glutamyl bond in
           poly-gamma-glutamyl substrates. They are structurally
           related to pfam00117, but contain extensions in four
           loops and at the C terminus.
          Length = 219

 Score = 28.0 bits (63), Expect = 4.7
 Identities = 13/33 (39%), Positives = 22/33 (66%), Gaps = 1/33 (3%)

Query: 114 RAIKNAGGIPLLINLLRKTADAEVK-ELVTGVL 145
           +A++ AGG+P+L+ +L    D E   +LV G+L
Sbjct: 31  KAVEGAGGLPVLLPILEDPEDLEALLDLVDGLL 63


>gnl|CDD|204007 pfam08625, Utp13, Utp13 specific WD40 associated domain.  Utp13 is
           a component of the five protein Pwp2 complex that forms
           part of a stable particle subunit independent of the U3
           small nucleolar ribonucleoprotein that is essential for
           the initial assembly steps of the 90S pre-ribosome. Pwp2
           is capable of interacting directly with the 35 S
           pre-rRNA 5' end.
          Length = 138

 Score = 27.1 bits (61), Expect = 7.4
 Identities = 18/55 (32%), Positives = 25/55 (45%), Gaps = 4/55 (7%)

Query: 198 LRNASSAGEYARKKLRECEGL--IDSLLYVVKSAIEKSN-IGNKSVENCVCILRN 249
           L N    G+Y  + L     L     LL V+KS IE  + IG+  +E  +  LR 
Sbjct: 4   LSNLLREGDY-SEALLLALTLDHPHRLLNVLKSVIEDEDSIGSDELEETIGRLRK 57


>gnl|CDD|185056 PRK15101, PRK15101, protease3; Provisional.
          Length = 961

 Score = 27.6 bits (62), Expect = 8.9
 Identities = 11/22 (50%), Positives = 13/22 (59%)

Query: 40  ISFLSNPNNVIKANAAAYLQHL 61
           ISF +N NN +  NA  Y Q L
Sbjct: 592 ISFSTNANNGLMVNANGYTQRL 613


>gnl|CDD|218698 pfam05695, DUF825, Plant protein of unknown function (DUF825).  This
            family consists of several plant proteins greater than
            1000 residues in length. The function of this family is
            unknown.
          Length = 1127

 Score = 27.9 bits (62), Expect = 9.1
 Identities = 13/42 (30%), Positives = 23/42 (54%), Gaps = 3/42 (7%)

Query: 19   SLNKKCRNSSLRWRDPNLTEVISFLSNPNNVIKANAAAYLQH 60
            SLN+KC+N ++   D  L   ISF+ N   +  + +  ++ H
Sbjct: 1087 SLNRKCKNMNINGIDEIL---ISFIPNEKRISFSESTRHVSH 1125


>gnl|CDD|224331 COG1413, COG1413, FOG: HEAT repeat [Energy production and
           conversion].
          Length = 335

 Score = 27.6 bits (61), Expect = 9.6
 Identities = 22/119 (18%), Positives = 41/119 (34%), Gaps = 30/119 (25%)

Query: 32  RDPNLTEVISFLSNPNNVIKANAAAYLQHLCYMDDPNKQKTRSLGGIPPLVKLLGHESPD 91
                 E++  L + + +++ +AA  L  L         +      +P L +LL  E P 
Sbjct: 41  APEAADELLKLLEDEDLLVRLSAAVALGEL-------GSEEA----VPLLRELLSDEDPR 89

Query: 92  VFRNA---------CGALRNLSYGRQNDENK----------RAIKNAGGIPLLINLLRK 131
           V   A           A+  L    +NDEN+            + +   +  L+  L+ 
Sbjct: 90  VRDAAADALGELGDPEAVPPLVELLENDENEGVRAAAARALGKLGDERALDPLLEALQD 148


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.316    0.133    0.400 

Gapped
Lambda     K      H
   0.267   0.0877    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 14,607,246
Number of extensions: 1354505
Number of successful extensions: 1121
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1105
Number of HSP's successfully gapped: 45
Length of query: 294
Length of database: 10,937,602
Length adjustment: 96
Effective length of query: 198
Effective length of database: 6,679,618
Effective search space: 1322564364
Effective search space used: 1322564364
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 59 (26.2 bits)