RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy3460
         (153 letters)



>gnl|CDD|197664 smart00338, BRLZ, basic region leucin zipper. 
          Length = 65

 Score = 57.2 bits (139), Expect = 5e-12
 Identities = 26/65 (40%), Positives = 40/65 (61%)

Query: 37  EKSLKKIRRKIKNKISAQESRRKKKEYMDALERKVEILKSENCDYKKRISNLEDTNYSLV 96
           E+  K+ RR+ +N+ +A+ SR +KK  ++ LERKVE L++EN   KK I  L      L 
Sbjct: 1   EEDEKRRRRRERNREAARRSRERKKAEIEELERKVEQLEAENERLKKEIERLRRELEKLK 60

Query: 97  SQLNK 101
           S+L +
Sbjct: 61  SELEE 65


>gnl|CDD|201054 pfam00170, bZIP_1, bZIP transcription factor.  The Pfam entry
          includes the basic region and the leucine zipper
          region.
          Length = 64

 Score = 55.4 bits (134), Expect = 2e-11
 Identities = 24/63 (38%), Positives = 39/63 (61%)

Query: 37 EKSLKKIRRKIKNKISAQESRRKKKEYMDALERKVEILKSENCDYKKRISNLEDTNYSLV 96
          EK LK+ RR+ KN+ +A+ SR +KK Y++ LE KV+ L++EN   +  +  L+     L 
Sbjct: 1  EKELKRERRRQKNREAARRSRLRKKAYIEELEEKVKELEAENKTLRSELERLKKECAKLK 60

Query: 97 SQL 99
          S+ 
Sbjct: 61 SEN 63


>gnl|CDD|213400 cd12193, B_zip1, basic leucine zipper DNA-binding and
          multimerization region of GCN4 and related proteins.
          Basic leucine zipper (bZIP) transcription factors act
          in networks of homo- and hetero-dimers in the
          regulation in a diverse set of cellular pathways.
          Classical leucine zippers have alpha helices with
          leucine residues 7 amino acids apart, which stabilize
          dimerization with a parallel leucine zipper domain.
          Dimerization creates a pair of basic regions that bind
          DNA and undergo conformational change. GCN4 was
          identified in Saccharomyces cerevisiae from mutations
          in a deficiency in activation with the general amino
          acid control pathway. GCN4 encodes a trans-activator of
          amino acid biosynthetic genes containing 2 acidic
          activation domains and a C-terminal bZIP domain,
          comprised of a basic alpha-helical DNA-binding region
          and a coiled-coil dimerization region.
          Length = 54

 Score = 44.2 bits (105), Expect = 3e-07
 Identities = 19/51 (37%), Positives = 30/51 (58%)

Query: 40 LKKIRRKIKNKISAQESRRKKKEYMDALERKVEILKSENCDYKKRISNLED 90
            + R++ +N+ +A+ SR +K E M  LE KVE L SEN D +  +  L+ 
Sbjct: 1  DPRARKRARNREAARRSRARKMERMKQLEDKVEELTSENSDLEDEVDRLKK 51


>gnl|CDD|203738 pfam07716, bZIP_2, Basic region leucine zipper. 
          Length = 54

 Score = 36.4 bits (85), Expect = 3e-04
 Identities = 17/53 (32%), Positives = 31/53 (58%)

Query: 38 KSLKKIRRKIKNKISAQESRRKKKEYMDALERKVEILKSENCDYKKRISNLED 90
          K  +   R+ +N  +A+ SR KKK+  + LE +V+ L+ EN   ++++  LE 
Sbjct: 1  KDDEYRDRRRRNNEAARRSREKKKQREEELEERVKELEEENAQLRQKVEQLEK 53


>gnl|CDD|188306 TIGR03319, RNase_Y, ribonuclease Y.  Members of this family are
           RNase Y, an endoribonuclease. The member from Bacillus
           subtilis, YmdA, has been shown to be involved in
           turnover of yitJ riboswitch [Transcription, Degradation
           of RNA].
          Length = 514

 Score = 33.4 bits (77), Expect = 0.039
 Identities = 20/76 (26%), Positives = 39/76 (51%), Gaps = 5/76 (6%)

Query: 42  KIRRKIKNKISAQESRRKKKEYMDALERKVEILKSENCDYKKRISNLEDTNYSLVSQLNK 101
           K RR   N++   E R  ++E  + L+RK+E L  +  + +K+   L +   +L  +  +
Sbjct: 72  KERR---NELQRLERRLLQRE--ETLDRKMESLDKKEENLEKKEKELSNKEKNLDEKEEE 126

Query: 102 LQAIIARHTAQLSNVS 117
           L+ +IA    +L  +S
Sbjct: 127 LEELIAEQREELERIS 142


>gnl|CDD|227355 COG5022, COG5022, Myosin heavy chain [Cytoskeleton].
          Length = 1463

 Score = 32.0 bits (73), Expect = 0.11
 Identities = 29/127 (22%), Positives = 48/127 (37%), Gaps = 19/127 (14%)

Query: 36   EEKSLKKIRRKIKNKISAQESRRKKK--EYMDALERKVEILKSENCDYKKRISNLEDTNY 93
            E  S++ ++    NK+   ES+ K+   EY D L+ K  IL  E     K  S L++   
Sbjct: 944  EGPSIEYVKLPELNKLHEVESKLKETSEEYEDLLK-KSTILVRE---GNKANSELKNFKK 999

Query: 94   SL-------------VSQLNKLQAIIARHTAQLSNVSIVYSLVSQLNKLQAIIARHTAQL 140
             L               QL +L   +A   +    +S   + +S L  LQ +      + 
Sbjct: 1000 ELAELSKQYGALQESTKQLKELPVEVAELQSASKIISSESTELSILKPLQKLKGLLLLEN 1059

Query: 141  SNNQNNL 147
            +  Q   
Sbjct: 1060 NQLQARY 1066


>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
           chromosome partitioning].
          Length = 1163

 Score = 31.2 bits (71), Expect = 0.18
 Identities = 24/103 (23%), Positives = 53/103 (51%), Gaps = 1/103 (0%)

Query: 12  TEEEKRTLQAEGYPIPTRLPLTKTEEKSLKKIRRKIKNKISAQESRRKK-KEYMDALERK 70
            EEE ++L+ E   +   L   + + + L++   ++K +++A E   ++ +  ++ LE +
Sbjct: 686 LEEELKSLKNELRSLEDLLEELRRQLEELERQLEELKRELAALEEELEQLQSRLEELEEE 745

Query: 71  VEILKSENCDYKKRISNLEDTNYSLVSQLNKLQAIIARHTAQL 113
           +E L+ E  + ++R+  LE+   SL   L KL+  I     + 
Sbjct: 746 LEELEEELEELQERLEELEEELESLEEALAKLKEEIEELEEKR 788



 Score = 27.8 bits (62), Expect = 2.7
 Identities = 21/134 (15%), Positives = 50/134 (37%), Gaps = 11/134 (8%)

Query: 13  EEEKRTLQAEGYPIPTRLPLTKTEEKSLKKIRRKIKNKISAQES-----RRKKKEYMDAL 67
           E E   L+     +   L   +   + LK+    +K ++  +E+      +   E  +A 
Sbjct: 301 EGEISLLRERLEELENELEELEERLEELKEKIEALKEELEERETLLEELEQLLAELEEAK 360

Query: 68  ERKVEILKSENCDYKKRISNLEDTNYSLVSQLNKLQAIIARHTAQLSNVSIVYSLVSQLN 127
           E   E L +   + ++    L +    L ++L +++  +     ++       SL  +L 
Sbjct: 361 EELEEKLSALLEELEELFEALREELAELEAELAEIRNELEELKREIE------SLEERLE 414

Query: 128 KLQAIIARHTAQLS 141
           +L   +     +L 
Sbjct: 415 RLSERLEDLKEELK 428


>gnl|CDD|225288 COG2433, COG2433, Uncharacterized conserved protein [Function
           unknown].
          Length = 652

 Score = 30.8 bits (70), Expect = 0.24
 Identities = 17/77 (22%), Positives = 32/77 (41%), Gaps = 10/77 (12%)

Query: 38  KSLKKIRRKIKNKISAQESRRKKKEY----------MDALERKVEILKSENCDYKKRISN 87
             L +   K+K +   +E    ++E           +  LE  VE L+ EN + K+ +  
Sbjct: 388 YPLAEALSKVKEEERPREKEGTEEEERREITVYEKRIKKLEETVERLEEENSELKRELEE 447

Query: 88  LEDTNYSLVSQLNKLQA 104
           L+     L S+L + + 
Sbjct: 448 LKREIEKLESELERFRR 464



 Score = 29.3 bits (66), Expect = 0.88
 Identities = 23/71 (32%), Positives = 33/71 (46%), Gaps = 3/71 (4%)

Query: 23  GYPIPTRLPLTKTEEKSLKKIR--RKIKNKISAQESRRKKKEYMD-ALERKVEILKSENC 79
           GYP+   L   K EE+  +K     + + +I+  E R KK E     LE +   LK E  
Sbjct: 387 GYPLAEALSKVKEEERPREKEGTEEEERREITVYEKRIKKLEETVERLEEENSELKRELE 446

Query: 80  DYKKRISNLED 90
           + K+ I  LE 
Sbjct: 447 ELKREIEKLES 457


>gnl|CDD|224520 COG1604, COG1604, CRISPR system related protein, RAMP superfamily
          [Defense    mechanisms].
          Length = 257

 Score = 30.5 bits (69), Expect = 0.27
 Identities = 13/60 (21%), Positives = 23/60 (38%)

Query: 36 EEKSLKKIRRKIKNKISAQESRRKKKEYMDALERKVEILKSENCDYKKRISNLEDTNYSL 95
          E   L K +    +K +A E    K    +      ++LK     Y++ + N + T  S 
Sbjct: 19 EYNPLLKEQIINLDKKNAFEEEIIKDFVNEIQSYLSKVLKRLGRSYEQYVKNTKITLNSF 78


>gnl|CDD|185098 PRK15176, PRK15176, Vi polysaccharide export inner membrane protein
           VexB; Provisional.
          Length = 264

 Score = 29.2 bits (65), Expect = 0.77
 Identities = 16/49 (32%), Positives = 27/49 (55%), Gaps = 2/49 (4%)

Query: 91  TNYSLVSQLNKLQAIIARHTAQLSNVSIVYSLVSQLNKLQAIIARHTAQ 139
           T+ ++VSQ+ K   I +    +  N  I+Y ++S +N L  I +R  AQ
Sbjct: 100 TSLAVVSQVKKRHVIFSLAAIEFVNAVIIYIIISLINFL--IFSRWEAQ 146


>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
           primarily archaeal type.  SMC (structural maintenance of
           chromosomes) proteins bind DNA and act in organizing and
           segregating chromosomes for partition. SMC proteins are
           found in bacteria, archaea, and eukaryotes. It is found
           in a single copy and is homodimeric in prokaryotes, but
           six paralogs (excluded from this family) are found in
           eukarotes, where SMC proteins are heterodimeric. This
           family represents the SMC protein of archaea and a few
           bacteria (Aquifex, Synechocystis, etc); the SMC of other
           bacteria is described by TIGR02168. The N- and
           C-terminal domains of this protein are well conserved,
           but the central hinge region is skewed in composition
           and highly divergent [Cellular processes, Cell division,
           DNA metabolism, Chromosome-associated proteins].
          Length = 1164

 Score = 29.3 bits (66), Expect = 0.81
 Identities = 27/111 (24%), Positives = 59/111 (53%), Gaps = 14/111 (12%)

Query: 36  EEKSLKKIRRKIKNKISAQESRRK-----KKEYMDALERKVEILKSENCDYKKRISNLED 90
             + + +I ++I  ++  +E + K      +E + +LE+++E +KSE  + + RI  LE+
Sbjct: 714 ASRKIGEIEKEI-EQLEQEEEKLKERLEELEEDLSSLEQEIENVKSELKELEARIEELEE 772

Query: 91  TNYSLVSQLNKLQAIIARHTAQLSNVSIVYSLVSQLNKLQAIIARHTAQLS 141
             + L   LN L+A ++         S +  + ++L+KL+  ++R  A+L 
Sbjct: 773 DLHKLEEALNDLEARLSH--------SRIPEIQAELSKLEEEVSRIEARLR 815



 Score = 28.9 bits (65), Expect = 1.1
 Identities = 30/149 (20%), Positives = 55/149 (36%), Gaps = 23/149 (15%)

Query: 8   VLHLTEEEKRTLQAEGYPIPTRLPLTKTEEKSLKKIRRKIKNKISAQESRRKKKEY---- 63
            LH  EE    L+A      +R+P  + E   L++   +I+ ++   E +  +       
Sbjct: 773 DLHKLEEALNDLEAR--LSHSRIPEIQAELSKLEEEVSRIEARLREIEQKLNRLTLEKEY 830

Query: 64  ----MDALERKVEILKSENCDYKKRISNLEDTNYSLVSQLNKLQAIIARHTAQLSNVSIV 119
               +  L+ +   LK +    +K I NL      L  +L +L+A              +
Sbjct: 831 LEKEIQELQEQRIDLKEQIKSIEKEIENLNGKKEELEEELEELEA-------------AL 877

Query: 120 YSLVSQLNKLQAIIARHTAQLSNNQNNLR 148
             L S+L  L+       AQL   +  + 
Sbjct: 878 RDLESRLGDLKKERDELEAQLRELERKIE 906



 Score = 28.9 bits (65), Expect = 1.1
 Identities = 20/109 (18%), Positives = 48/109 (44%), Gaps = 16/109 (14%)

Query: 37  EKSLKKIRRKIKNKISAQESRRKKKEYMDALERKVE-------ILKSENCDYKKRISNLE 89
           E+  ++I  + K +    E   + KE ++ L  ++E         + E  DY++++  L+
Sbjct: 339 EELEREIEEERKRRDKLTEEYAELKEELEDLRAELEEVDKEFAETRDELKDYREKLEKLK 398

Query: 90  DTNYSLVSQLNKLQAIIARHTAQLSNVSIVYSLVSQLNKLQAIIARHTA 138
                L  +L++LQ  + R + +L++++           +  I A+   
Sbjct: 399 REINELKRELDRLQEELQRLSEELADLN---------AAIAGIEAKINE 438


>gnl|CDD|233922 TIGR02550, flagell_flgL, flagellar hook-associated protein 3.  This
           protein family consists of flagellar hook-associated
           proteins designated FlgL (or HAP3) encoded in bacterial
           flagellar operons. A N-terminal region of about 150
           residues and a C-terminal region of about 85 residues
           are conserved. Members show considerable length
           heterogeneity between these two well-conserved terminal
           regions; the seed alignment 486 columns, while members
           of this family are from 287 to over 500 residues in
           length. This model distinguishes FlgL from the flagellin
           gene product FliC [Cellular processes, Chemotaxis and
           motility].
          Length = 267

 Score = 28.4 bits (64), Expect = 1.3
 Identities = 11/42 (26%), Positives = 19/42 (45%), Gaps = 1/42 (2%)

Query: 64  MDALERKVEILKSENCDYKKRISNLEDTNY-SLVSQLNKLQA 104
           ++ LE     L  +     + +S+LED +    +S LN  Q 
Sbjct: 205 LNRLENLENRLSEQKLQLTEVLSDLEDVDLAEAISDLNMQQT 246


>gnl|CDD|240334 PTZ00258, PTZ00258, GTP-binding protein; Provisional.
          Length = 390

 Score = 28.4 bits (64), Expect = 1.4
 Identities = 20/81 (24%), Positives = 41/81 (50%), Gaps = 10/81 (12%)

Query: 37  EKSLKKIRRKIKNKISAQESRRKKKEYMDALERKVEILKSENCDYKKRISNLEDTNYSLV 96
           EK L ++ +K K K      ++++K  +D L++ +E L+       K + + + T+   +
Sbjct: 160 EKRLDELTKKRKKKKK----KKEEKVELDVLKKVLEWLEE-----GKPVRDGDWTD-KEI 209

Query: 97  SQLNKLQAIIARHTAQLSNVS 117
             LN+ Q + A+    L N+S
Sbjct: 210 EILNEYQLLTAKPMIYLVNMS 230


>gnl|CDD|99913 cd05530, POLBc_B1, DNA polymerase type-B B1 subfamily catalytic
           domain. Archaeal proteins that are involved in DNA
           replication are similar to those from eukaryotes. Some
           archaeal members also possess multiple family B DNA
           polymerases (B1, B2 and B3). So far there is no specific
           function(s) has been assigned for different members of
           the archaea type B DNA polymerases. Phylogenetic
           analyses of eubacterial, archaeal, and eukaryotic family
           B DNA polymerases are support independent gene
           duplications during the evolution of archaeal and
           eukaryotic family B DNA polymerases.
          Length = 372

 Score = 28.5 bits (64), Expect = 1.4
 Identities = 17/44 (38%), Positives = 24/44 (54%), Gaps = 2/44 (4%)

Query: 41  KKIRRKIKNKISAQESRRKKKEY-MDALERKVEILKSENCDYKK 83
           +K R KI++ +     R KKKEY +D L  KV + K    +Y K
Sbjct: 264 EKAREKIRDIVKGVYKRLKKKEYTLDQLAFKVMLSKPPE-EYTK 306


>gnl|CDD|220623 pfam10186, Atg14, UV radiation resistance protein and
           autophagy-related subunit 14.  The Atg14 or Apg14
           proteins are hydrophilic proteins with a predicted
           molecular mass of 40.5 kDa, and have a coiled-coil motif
           at the N terminus region. Yeast cells with mutant Atg14
           are defective not only in autophagy but also in sorting
           of carboxypeptidase Y (CPY), a vacuolar-soluble
           hydrolase, to the vacuole. Subcellular fractionation
           indicate that Apg14p and Apg6p are peripherally
           associated with a membrane structure(s). Apg14p was
           co-immunoprecipitated with Apg6p, suggesting that they
           form a stable protein complex. These results imply that
           Apg6/Vps30p has two distinct functions: in the
           autophagic process and in the vacuolar protein sorting
           pathway. Apg14p may be a component specifically required
           for the function of Apg6/Vps30p through the autophagic
           pathway. There are 17 auto-phagosomal component proteins
           which are categorized into six functional units, one of
           which is the AS-PI3K complex (Vps30/Atg6 and Atg14). The
           AS-PI3K complex and the Atg2-Atg18 complex are essential
           for nucleation, and the specific function of the AS-PI3K
           apparently is to produce phosphatidylinositol
           3-phosphate (PtdIns(3)P) at the pre-autophagosomal
           structure (PAS). The localisation of this complex at the
           PAS is controlled by Atg14. Autophagy mediates the
           cellular response to nutrient deprivation, protein
           aggregation, and pathogen invasion in humans, and
           malfunction of autophagy has been implicated in multiple
           human diseases including cancer. This effect seems to be
           mediated through direct interaction of the human Atg14
           with Beclin 1 in the human phosphatidylinositol 3-kinase
           class III complex.
          Length = 307

 Score = 28.1 bits (63), Expect = 1.6
 Identities = 18/103 (17%), Positives = 39/103 (37%), Gaps = 5/103 (4%)

Query: 15  EKRTLQAEGYPIPTRLPLTKTEEKSLKKIRRKIKNKISAQESR-RKKKEYMD----ALER 69
           E+    A          L K E    K+   +I+ +IS  +    +K+E ++    AL +
Sbjct: 44  EEALEGATNEDGKLAADLLKLEVARKKERLNQIRARISQLKEEIEQKRERIEELKRALAQ 103

Query: 70  KVEILKSENCDYKKRISNLEDTNYSLVSQLNKLQAIIARHTAQ 112
           +   L S +   +KR ++  +     + +       +    A+
Sbjct: 104 RRSDLSSASYQLEKRRASQLEKLQDEIKRTRSKLNALHSLLAE 146



 Score = 26.9 bits (60), Expect = 5.1
 Identities = 25/123 (20%), Positives = 48/123 (39%), Gaps = 5/123 (4%)

Query: 28  TRLPLTKTEEKSLKKIRRKIKNKISA---QESRRKKKEYMDALERKVEILKSENCDYKKR 84
             L   + +   L     ++K K+       +    K   D L+ +V   K      + R
Sbjct: 20  NSLYELRLDLARLLLENEELKQKVEEALEGATNEDGKLAADLLKLEVARKKERLNQIRAR 79

Query: 85  ISNLEDTNYSLVSQLNKLQAIIA--RHTAQLSNVSIVYSLVSQLNKLQAIIARHTAQLSN 142
           IS L++       ++ +L+  +A  R     ++  +     SQL KLQ  I R  ++L+ 
Sbjct: 80  ISQLKEEIEQKRERIEELKRALAQRRSDLSSASYQLEKRRASQLEKLQDEIKRTRSKLNA 139

Query: 143 NQN 145
             +
Sbjct: 140 LHS 142


>gnl|CDD|216597 pfam01601, Corona_S2, Coronavirus S2 glycoprotein.  The coronavirus
           spike glycoprotein forms the characteristic 'corona'
           after which the group is named. The Spike glycoprotein
           is translated as a large polypeptide that is
           subsequently cleaved to S1 pfam01600 and S2.
          Length = 609

 Score = 28.0 bits (63), Expect = 2.1
 Identities = 17/70 (24%), Positives = 33/70 (47%), Gaps = 10/70 (14%)

Query: 64  MDALERKVEILKSENCDYKKRISNLEDTNYSLVSQLNKLQAIIARHTAQLSNVSIVYSLV 123
            D L+   +++ +    + K + N++D   +  S L+K+Q ++ +    LS       L 
Sbjct: 243 TDVLQENQKLIANS---FNKALGNIQDGFTTTASALSKIQDVVNQQGQALSQ------LT 293

Query: 124 SQL-NKLQAI 132
           +QL N   AI
Sbjct: 294 NQLSNNFGAI 303


>gnl|CDD|217817 pfam03961, DUF342, Protein of unknown function (DUF342).  This
           family of bacterial proteins has no known function. The
           proteins are in the region of 500-600 amino acid
           residues in length.
          Length = 450

 Score = 28.0 bits (63), Expect = 2.2
 Identities = 17/93 (18%), Positives = 35/93 (37%), Gaps = 8/93 (8%)

Query: 61  KEYMDALERKVEILKSENCDYKKRISNLEDTNYSLVSQLNKLQAIIARHTAQLSNVSIVY 120
           KE +  LE +++ L+ E    KK +  L       +    + Q      T +        
Sbjct: 333 KEELKELEEELKELEEELEKIKKLLKKLPKKARGQLPPEKREQLEKLLETKE-------- 384

Query: 121 SLVSQLNKLQAIIARHTAQLSNNQNNLRVGVAK 153
            L  +L +L+  +     +L +  +  ++ V K
Sbjct: 385 KLSEELEELEEELKELKEELESLYSEGKISVNK 417


>gnl|CDD|235613 PRK05802, PRK05802, hypothetical protein; Provisional.
          Length = 320

 Score = 27.6 bits (62), Expect = 2.3
 Identities = 13/37 (35%), Positives = 17/37 (45%)

Query: 97  SQLNKLQAIIARHTAQLSNVSIVYSLVSQLNKLQAII 133
           S  N    +IAR   Q   V ++  L S  NK+  II
Sbjct: 169 STKNGKSLVIARGIGQAPGVPVIKKLYSNGNKIIVII 205


>gnl|CDD|179699 PRK03992, PRK03992, proteasome-activating nucleotidase;
           Provisional.
          Length = 389

 Score = 27.9 bits (63), Expect = 2.4
 Identities = 13/48 (27%), Positives = 20/48 (41%)

Query: 55  ESRRKKKEYMDALERKVEILKSENCDYKKRISNLEDTNYSLVSQLNKL 102
           E     +E    LE ++  L+ +  D +     LE     L S+L KL
Sbjct: 1   ERLEALEERNSELEEQIRQLELKLRDLEAENEKLERELERLKSELEKL 48


>gnl|CDD|235600 PRK05771, PRK05771, V-type ATP synthase subunit I; Validated.
          Length = 646

 Score = 28.0 bits (63), Expect = 2.5
 Identities = 15/95 (15%), Positives = 38/95 (40%), Gaps = 8/95 (8%)

Query: 36  EEKSLKKIRRKIKNKISAQESRRKKKEYMDA------LERKVEILKSENC--DYKKRISN 87
           EE S +++R+         E+  K + Y+         ++KV +   E    D ++ +  
Sbjct: 38  EELSNERLRKLRSLLTKLSEALDKLRSYLPKLNPLREEKKKVSVKSLEELIKDVEEELEK 97

Query: 88  LEDTNYSLVSQLNKLQAIIARHTAQLSNVSIVYSL 122
           +E     L  ++++L+  I     ++  +    + 
Sbjct: 98  IEKEIKELEEEISELENEIKELEQEIERLEPWGNF 132


>gnl|CDD|219955 pfam08657, DASH_Spc34, DASH complex subunit Spc34.  The DASH
           complex is a ~10 subunit microtubule-binding complex
           that is transferred to the kinetochore prior to mitosis.
           In Saccharomyces cerevisiae DASH forms both rings and
           spiral structures on microtubules in vitro. Components
           of the DASH complex, including Dam1, Duo1, Spc34, Dad1
           and Ask1, are essential and connect the centromere to
           the plus end of spindle microtubules.
          Length = 255

 Score = 27.5 bits (61), Expect = 2.8
 Identities = 12/42 (28%), Positives = 20/42 (47%)

Query: 80  DYKKRISNLEDTNYSLVSQLNKLQAIIARHTAQLSNVSIVYS 121
             +++I  L D    L   + +L+A IA    QL  ++I  S
Sbjct: 176 GAREKIETLRDRYNELSESIAELEAEIAEQKRQLEIMNISSS 217


>gnl|CDD|190534 pfam03131, bZIP_Maf, bZIP Maf transcription factor.  Maf
          transcription factors contain a conserved basic region
          leucine zipper (bZIP) domain, which mediates their
          dimerisation and DNA binding property. Thus, this
          family is probably related to pfam00170.
          Length = 93

 Score = 26.6 bits (59), Expect = 3.1
 Identities = 20/54 (37%), Positives = 28/54 (51%), Gaps = 7/54 (12%)

Query: 32 LTKTEEKSLKKIRRKIKNKISAQESRRKKKEYMDALE-------RKVEILKSEN 78
          L+K E   LK+ RR +KN+  AQ  R K+ +    LE       ++VE LK E 
Sbjct: 21 LSKEEVIRLKQRRRTLKNRGYAQSCRSKRVQQRHELEKEKSQLQQQVEQLKQEV 74


>gnl|CDD|236345 PRK08870, flgL, flagellar hook-associated protein FlgL; Reviewed.
          Length = 404

 Score = 27.2 bits (61), Expect = 3.4
 Identities = 14/65 (21%), Positives = 26/65 (40%), Gaps = 8/65 (12%)

Query: 64  MDALERKVEILKSENCDYKKRISNLEDTNY-SLVSQLNKLQAIIARHTAQLSNVSIVYSL 122
           ++ L+    + +         +S+LED +Y S +S+    QA        L      +S 
Sbjct: 342 LNELDSAEAVHEDNKLQNTSALSDLEDLDYASAISEFELQQAA-------LQAAQQAFSK 394

Query: 123 VSQLN 127
           V  L+
Sbjct: 395 VQSLS 399


>gnl|CDD|133212 cd05081, PTKc_Jak2_Jak3_rpt2, Catalytic (repeat 2) domain of the
          Protein Tyrosine Kinases, Janus kinases 2 and 3.
          Protein Tyrosine Kinase (PTK) family; Janus kinase 2
          (Jak2) and Jak3; catalytic (c) domain (repeat 2). The
          PTKc family is part of a larger superfamily that
          includes the catalytic domains of other kinases such as
          protein serine/threonine kinases, RIO kinases, and
          phosphoinositide 3-kinase (PI3K). PTKs catalyze the
          transfer of the gamma-phosphoryl group from ATP to
          tyrosine (tyr) residues in protein substrates. Jak2 and
          Jak3 are members of the Janus kinase (Jak) subfamily of
          proteins, which are cytoplasmic (or nonreceptor) tyr
          kinases containing an N-terminal FERM domain, followed
          by a Src homology 2 (SH2) domain, a pseudokinase
          domain, and a C-terminal catalytic tyr kinase domain.
          Jaks are crucial for cytokine receptor signaling. They
          are activated by autophosphorylation upon
          cytokine-induced receptor aggregation, and subsequently
          trigger downstream signaling events such as the
          phosphorylation of signal transducers and activators of
          transcription (STATs). Jak2 is widely expressed in many
          tissues while Jak3 is expressed only in hematopoietic
          cells. Jak2 is essential for the signaling of
          hormone-like cytokines such as growth hormone,
          erythropoietin, thrombopoietin, and prolactin, as well
          as some IFNs and cytokines that signal through the IL-3
          and gp130 receptors. Jak3 binds the shared receptor
          subunit common gamma chain and thus, is essential in
          the signaling of cytokines that use it such as IL-2,
          IL-4, IL-7, IL-9, IL-15, and IL-21. Disruption of Jak2
          in mice results in an embryonic lethal phenotype with
          multiple defects including erythropoietic and cardiac
          abnormalities. It is the only Jak gene that results in
          a lethal phenotype when disrupted in mice. A mutation
          in the pseudokinase domain of Jak2, V617F, is present
          in many myeloproliferative diseases, including almost
          all patients with polycythemia vera, and 50% of
          patients with essential thrombocytosis and
          myelofibrosis. Jak3 is important in lymphoid
          development and myeloid cell differentiation.
          Inactivating mutations in Jak3 have been reported in
          humans with severe combined immunodeficiency (SCID).
          Length = 284

 Score = 27.0 bits (60), Expect = 4.6
 Identities = 9/22 (40%), Positives = 13/22 (59%)

Query: 60 KKEYMDALERKVEILKSENCDY 81
            E++   ER++EILKS   D 
Sbjct: 45 TAEHLRDFEREIEILKSLQHDN 66


>gnl|CDD|182330 PRK10246, PRK10246, exonuclease subunit SbcC; Provisional.
          Length = 1047

 Score = 27.1 bits (60), Expect = 4.9
 Identities = 16/80 (20%), Positives = 32/80 (40%), Gaps = 2/80 (2%)

Query: 27  PTRLPLTKTEEKSLKKIRRKIKNKISAQESRRKKKEYMDALERKVEILKSENCDYK--KR 84
           P R  L     + + + +R  + +++ Q   +++ +   AL    +  K +       K 
Sbjct: 423 PLRQRLVALHGQIVPQQKRLAQLQVAIQNVTQEQTQRNAALNEMRQRYKEKTQQLADVKT 482

Query: 85  ISNLEDTNYSLVSQLNKLQA 104
           I   E     L +Q  +LQA
Sbjct: 483 ICEQEARIKDLEAQRAQLQA 502


>gnl|CDD|221268 pfam11853, DUF3373, Protein of unknown function (DUF3373).  This
          family of proteins are functionally uncharacterized.
          This protein is found in bacteria. Proteins in this
          family are typically between 472 to 574 amino acids in
          length.
          Length = 485

 Score = 27.1 bits (60), Expect = 5.0
 Identities = 10/42 (23%), Positives = 22/42 (52%), Gaps = 3/42 (7%)

Query: 53 AQESRRKKKEYMDALERKVEILKSENCDYKKRISNLEDTNYS 94
          A     +K   ++AL++++  LK++  D +KR+   E  +  
Sbjct: 25 ADIDLLQK---IEALKKELAELKAQLKDLQKRVDKTEKKSAG 63


>gnl|CDD|131884 TIGR02837, spore_II_R, stage II sporulation protein R.  A
           comparative genome analysis of all sequenced genomes of
           shows a number of proteins conserved strictly among the
           endospore-forming subset of the Firmicutes. This
           protein, a member of this panel, is designated stage II
           sporulation protein R [Cellular processes, Sporulation
           and germination].
          Length = 168

 Score = 26.6 bits (59), Expect = 5.1
 Identities = 18/48 (37%), Positives = 29/48 (60%), Gaps = 1/48 (2%)

Query: 40  LKKIRRKIKNKISAQESRRKKKEYMDALERKVE-ILKSENCDYKKRIS 86
           LK+IR  +    S +E+RR  +E +  +ER  E ++K+E  DYK R+ 
Sbjct: 65  LKEIRPWLSGLKSLEEARRVIRENLPEIERIAESVIKAEGADYKVRVE 112


>gnl|CDD|220388 pfam09766, FimP, Fms-interacting protein.  This entry carries part
           of the crucial 144 N-terminal residues of the FmiP
           protein, which is essential for the binding of the
           protein to the cytoplasmic domain of activated
           Fms-molecules in M-CSF induced haematopoietic
           differentiation of macrophages. The C-terminus contains
           a putative nuclear localisation sequence and a leucine
           zipper which suggest further, as yet unknown, nuclear
           functions. The level of FMIP expression might form a
           threshold that determines whether cells differentiate
           into macrophages or into granulocytes.
          Length = 352

 Score = 26.9 bits (60), Expect = 5.3
 Identities = 20/96 (20%), Positives = 37/96 (38%), Gaps = 12/96 (12%)

Query: 8   VLHLTEEEKRTLQAE-GYPIPTRLPLTKTEEKSLKKIRRKIKNKISAQ-----------E 55
             HL +E  R L  +        +PL +   ++ + I R  K  +              E
Sbjct: 42  KSHLEKEINRCLDFKSKDEEIELVPLEEFYAQAPEDISRPEKTPLDEHNLMLARLEWELE 101

Query: 56  SRRKKKEYMDALERKVEILKSENCDYKKRISNLEDT 91
            R++  E +  LE + E L+ +    K+ +S+L   
Sbjct: 102 QRKELAEKLKELESEKEKLQQDIEKKKEFLSSLPPK 137


>gnl|CDD|219638 pfam07914, DUF1679, Protein of unknown function (DUF1679).  The
           region featured in this family is found in a number of
           C. elegans proteins, in one case as a repeat. In many of
           the family members, this region is associated with the
           CHK region described by SMART as being found in ZnF_C4
           and HLH domain-containing kinases. In fact, one member
           of this family is annotated as being a member of the
           nuclear hormone receptor family, and contains regions
           typical of such proteins (Interpro:IPR000536,
           Interpro:IPR008946, and Interpro:IPR001628).
          Length = 413

 Score = 26.5 bits (59), Expect = 6.4
 Identities = 33/135 (24%), Positives = 62/135 (45%), Gaps = 19/135 (14%)

Query: 9   LHLTEEEKRTLQAEGYPIPTRLPLTKTEEKSLKKIRRKIKNKISAQESRRKKKEYMDALE 68
             L+EEEK++  A G     ++  T   E+ LK I  +++N   A+   +        +E
Sbjct: 189 ESLSEEEKKS--ANGADFLEQMFETFMSEEGLKGIFDQLRNIFGAEYPEK--------VE 238

Query: 69  RKVEILKSENCDYKKRISNLEDTNYSLVSQLNKLQAIIARHTAQLSNVSIVYSLVSQLN- 127
             V+I +     Y   I   +   Y+ +S++  ++ ++  H   L   +I++SL +    
Sbjct: 239 ELVDIFE----HYGPEILIFK--KYTNLSKVLGIKPVL-VH-GDLWQSNILWSLDNDGKL 290

Query: 128 KLQAIIARHTAQLSN 142
           KL+AII   T  + N
Sbjct: 291 KLKAIIDYQTVHMGN 305


>gnl|CDD|169386 PRK08334, PRK08334, translation initiation factor IF-2B subunit
           beta; Validated.
          Length = 356

 Score = 26.5 bits (58), Expect = 6.9
 Identities = 20/95 (21%), Positives = 40/95 (42%), Gaps = 25/95 (26%)

Query: 58  RKKKEYMDALERKVEILKSENCDYKKRISNLEDTNYSLVSQLNKLQAIIARHTAQLSNVS 117
           + K E+MD   +  E LK           N   T  +L   LN+++ ++  H        
Sbjct: 80  KTKDEFMDGFYKAYETLK-----------NTRPTAVNLFWALNRIKKLVEEH-------- 120

Query: 118 IVYSLVSQLNKLQAIIARHTAQLSNN--QNNLRVG 150
               L   L++++ +I     ++++   + NLR+G
Sbjct: 121 ----LEDPLDEIKRLIVEEAQKIADEDVEANLRMG 151


>gnl|CDD|221952 pfam13166, AAA_13, AAA domain.  This family of domains contain a
           P-loop motif that is characteristic of the AAA
           superfamily. Many of the proteins in this family are
           conjugative transfer proteins. This family includes the
           PrrC protein that is thought to be the active component
           of the anticodon nuclease.
          Length = 713

 Score = 26.5 bits (59), Expect = 7.1
 Identities = 20/98 (20%), Positives = 42/98 (42%), Gaps = 5/98 (5%)

Query: 34  KTEEKSLKKIRRKIKNKISAQESRRKK--KEYMDALERKVEILKSENCDYKKRISNLEDT 91
               + +++   KI N    +   +KK     +  L+  ++  + E    +K I++LE  
Sbjct: 371 DAINELIREHNEKIDNLKKEKNKAKKKLWLHLVAELKEDIDAYQKEKKGLEKAINSLEKE 430

Query: 92  NYSLVSQLNKLQAIIARHTAQLSNVSIVYSLVSQLNKL 129
              L +++  L+  I     QL+N+        ++NKL
Sbjct: 431 IKQLEAEIKALEKEIKELEKQLTNIEP---TADEINKL 465


>gnl|CDD|222818 PHA01075, PHA01075, major capsid protein.
          Length = 408

 Score = 26.1 bits (57), Expect = 7.9
 Identities = 26/134 (19%), Positives = 57/134 (42%), Gaps = 30/134 (22%)

Query: 26  IPTRLPLTKTEEKSLKK-------------IRRKIKNKISAQESRRKKKEYMDALERKVE 72
           +P  + L+K EE  LK+             I +K K  ++  ++R   +   DA    + 
Sbjct: 95  VPINMDLSKNEELMLKRNYPRMATKLYGNGIVKKQKFTLNNNDTRFNFQTLADATNYALG 154

Query: 73  ILKSENCDYKKRISNLEDTNYSLVSQLNKLQAIIARHTAQLSNVSIVYSLVSQLNKLQAI 132
           +       YKK+IS++       V +  +++A++  ++    + + V    S+    + +
Sbjct: 155 V-------YKKKISDIN------VLEEKEMRAMLVDYSLNQLSETNVRKATSK----EDL 197

Query: 133 IARHTAQLSNNQNN 146
            ++    + N QNN
Sbjct: 198 ASKVFEAILNLQNN 211


>gnl|CDD|206687 cd01900, YchF, YchF GTPase.  YchF is a member of the Obg family,
           which includes four other subfamilies of GTPases: Obg,
           DRG, Ygr210, and NOG1. Obg is an essential gene that is
           involved in DNA replication in C. crescentus and
           Streptomyces griseus and is associated with the
           ribosome. Several members of the family, including YchF,
           possess the TGS domain related to the RNA-binding
           proteins. Experimental data and genomic analysis suggest
           that YchF may be part of a nucleoprotein complex and may
           function as a GTP-dependent translational factor.
          Length = 274

 Score = 26.3 bits (59), Expect = 8.0
 Identities = 12/53 (22%), Positives = 26/53 (49%), Gaps = 6/53 (11%)

Query: 37  EKSLKKIRRKIKNKISAQESRRKKKEYMDALERKVEILKSENCDYKKRISNLE 89
           EK L+++ +K K+        ++ KE ++ LE+  E L+         +++ E
Sbjct: 137 EKRLERLEKKAKSG------DKEAKEELELLEKIKEHLEEGKPARTLELTDEE 183


>gnl|CDD|132845 cd07206, Pat_TGL3-4-5_SDP1, Triacylglycerol lipase 3, 4, and 5 and
           Sugar-Dependent 1 lipase.  Triacylglycerol lipases are
           involved in triacylglycerol mobilization and
           degradation; they are found in lipid particles. TGL4 is
           30% homologus to TGL3, whereas TGL5 is 26% homologus to
           TGL3. Sugar-Dependent 1 (SDP1) lipase has a patatin-like
           acyl-hydrolase domain that initiates the breakdown of
           storage oil in germinating Arabidopsis seeds. This
           family includes subfamilies of proteins: TGL3, TGL4,
           TGL5, and SDP1.
          Length = 298

 Score = 26.0 bits (58), Expect = 8.2
 Identities = 8/18 (44%), Positives = 13/18 (72%)

Query: 83  KRISNLEDTNYSLVSQLN 100
           KR++ L + N+ +VSQ N
Sbjct: 214 KRLARLYNVNHFIVSQTN 231


>gnl|CDD|200298 TIGR03603, cyclo_dehy_ocin, thiazole/oxazole-forming peptide
           maturase, SagC family component.  Members of this
           protein family include enzymes related to SagC, a
           protein involved in thiazole/oxazole cyclodehydration
           modifications during biosynthesis of streptolysin S in
           Streptococcus pyogenes from the protoxin polypeptide
           (product of the sagA gene). Recent evidence suggests
           that the YcaO/SagD-like component, not this component,
           performs an ATP-dependent cyclodehydration. This protein
           family serves as a marker for widely distributed
           prokaryotic systems for making a general class of
           heterocycle-containing bacteriocins. Note that this
           model does not find all possible examples of bacteriocin
           biosynthesis cyclodehydratases, an in particular misses
           the E. coli plasmid protein McbB of microcin B17
           biosynthesis [Cellular processes, Pathogenesis].
          Length = 319

 Score = 26.4 bits (58), Expect = 8.2
 Identities = 17/67 (25%), Positives = 32/67 (47%), Gaps = 7/67 (10%)

Query: 43  IRRKIKNKISAQESRRKKK-----EYMDALERKVEI--LKSENCDYKKRISNLEDTNYSL 95
           +R   ++ + +++  + K+       +D LE K  +   K    DY+  I   E  N SL
Sbjct: 108 LRETAESDLYSKDDIQLKRDRLLTSNLDILESKSLVDDKKDTLKDYEMIIVCAEHGNISL 167

Query: 96  VSQLNKL 102
           + +LN+L
Sbjct: 168 LRELNRL 174


>gnl|CDD|149675 pfam08694, UFC1, Ubiquitin-fold modifier-conjugating enzyme 1.
          Ubiquitin-like (UBL) post-translational modifiers are
          covalently linked to most, if not all, target
          protein(s) through an enzymatic cascade analogous to
          ubiquitylation, consisting of E1 (activating), E2
          (conjugating), and E3 (ligating) enzymes.
          Ubiquitin-fold modifier 1 (Ufm1) a ubiquitin-like
          protein is activated by a novel E1-like enzyme, Uba5,
          by forming a high-energy thioester bond. Activated Ufm1
          is then transferred to its cognate E2-like enzyme,
          Ufc1, in a similar thioester linkage. This family
          represents the E2-like enzyme.
          Length = 161

 Score = 25.5 bits (56), Expect = 9.4
 Identities = 15/36 (41%), Positives = 21/36 (58%), Gaps = 1/36 (2%)

Query: 58 RKKKEYMDALERKVEILKSENCDYKKRISNLEDTNY 93
          R K+EY  AL + VE  KS + D+ +  SN E T +
Sbjct: 28 RLKEEYQ-ALIKYVENNKSADNDWFRLESNKEGTRW 62


>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
          Length = 880

 Score = 26.2 bits (58), Expect = 9.6
 Identities = 12/77 (15%), Positives = 32/77 (41%), Gaps = 3/77 (3%)

Query: 34  KTEEKSLKKIRRKI---KNKISAQESRRKKKEYMDALERKVEILKSENCDYKKRISNLED 90
           +  E+ ++++  +I   K +I   E + K+ + +     +   L     +Y   +  +E 
Sbjct: 255 RKLEEKIRELEERIEELKKEIEELEEKVKELKELKEKAEEYIKLSEFYEEYLDELREIEK 314

Query: 91  TNYSLVSQLNKLQAIIA 107
               L  ++N ++  I 
Sbjct: 315 RLSRLEEEINGIEERIK 331


>gnl|CDD|100796 PRK01156, PRK01156, chromosome segregation protein; Provisional.
          Length = 895

 Score = 26.0 bits (57), Expect = 10.0
 Identities = 14/67 (20%), Positives = 25/67 (37%)

Query: 39  SLKKIRRKIKNKISAQESRRKKKEYMDALERKVEILKSENCDYKKRISNLEDTNYSLVSQ 98
           + K+I   I  +I+   +  KK   +          KS   D   +I  LE       S 
Sbjct: 309 NKKQILSNIDAEINKYHAIIKKLSVLQKDYNDYIKKKSRYDDLNNQILELEGYEMDYNSY 368

Query: 99  LNKLQAI 105
           L  ++++
Sbjct: 369 LKSIESL 375


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.312    0.126    0.327 

Gapped
Lambda     K      H
   0.267   0.0856    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 7,292,850
Number of extensions: 648110
Number of successful extensions: 1346
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1304
Number of HSP's successfully gapped: 248
Length of query: 153
Length of database: 10,937,602
Length adjustment: 89
Effective length of query: 64
Effective length of database: 6,990,096
Effective search space: 447366144
Effective search space used: 447366144
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 55 (24.7 bits)