RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy1763
         (365 letters)



>gnl|CDD|212559 cd11661, SANT_MTA3_like, Myb-Like Dna-Binding Domain of MTA3 and
           related proteins.  Members in this SANT/myb family
           include domains found in mouse metastasis-associated
           protein 3 (MTA3) proteins and arginine-glutamic
           dipeptide (RERE) repeats proteins. SANT (SWI3, ADA2,
           N-CoR and TFIIIB) DNA-binding domains are a diverse set
           of proteins that share a common 3 alpha-helix bundle.
           MTA3 has been shown to interact with nucleosome
           remodeling and deacetylase (NuRD) proteins CHD4 and
           HDAC1, and the core cohesin complex protein RAD21 in the
           ovary, and regulate G2/M progression in proliferating
           granulosa cells. RERE belongs to the atrophin family and
           has been identified as a nuclear receptor corepressor;
           altered expression levels of RERE are associated with
           cancer in humans while mutations of Rere in mice cause
           failure in closing the anterior neural tube and fusion
           of the telencephalic and optic vesicles during
           embryogenesis.
          Length = 46

 Score = 90.0 bits (224), Expect = 5e-23
 Identities = 26/45 (57%), Positives = 35/45 (77%)

Query: 211 WSEEECKNFESGLRMYGKNFYQIHQNKVKTRNVGEIVQFYYLWKK 255
           WSE E K FE GLR YGK+F+ I Q+ +  ++VGE+V+FYY+WKK
Sbjct: 2   WSESEAKLFEEGLRKYGKDFHDIRQDFLPWKSVGELVEFYYMWKK 46


>gnl|CDD|216509 pfam01448, ELM2, ELM2 domain.  The ELM2 (Egl-27 and MTA1 homology
           2) domain is a small domain of unknown function. It is
           found in the MTA1 protein that is part of the NuRD
           complex. The domain is usually found to the N terminus
           of a myb-like DNA binding domain pfam00249. ELM2 is also
           found associated with an ARID DNA binding domain
           pfam01388 in a member from Arabidopsis thaliana. This
           suggests that ELM2 may also be involved in DNA binding,
           or perhaps is a protein-protein interaction domain.
          Length = 55

 Score = 43.8 bits (104), Expect = 3e-06
 Identities = 19/54 (35%), Positives = 31/54 (57%), Gaps = 2/54 (3%)

Query: 105 IMVGSEYQAWIPEGMCKY-GDILPYENDDKLLWDPNHNLVTVDLVETYLKKSHE 157
           I VG +YQA IPE + +   D    E++D+L+W P++ L +   +E YL  +  
Sbjct: 1   IRVGPDYQAEIPELLPESEKDQDKEEDEDQLVWSPDNEL-SEQKLEEYLLVAKS 53


>gnl|CDD|238096 cd00167, SANT, 'SWI3, ADA2, N-CoR and TFIIIB' DNA-binding domains.
           Tandem copies of the domain bind telomeric DNA tandem
           repeatsas part of the capping complex. Binding is
           sequence dependent for repeats which contain the G/C
           rich motif [C2-3 A (CA)1-6]. The domain is also found in
           regulatory transcriptional repressor complexes where it
           also binds DNA.
          Length = 45

 Score = 36.0 bits (84), Expect = 0.001
 Identities = 9/44 (20%), Positives = 20/44 (45%)

Query: 211 WSEEECKNFESGLRMYGKNFYQIHQNKVKTRNVGEIVQFYYLWK 254
           W+EEE +     ++ YGKN ++    ++  R   +  + +    
Sbjct: 2   WTEEEDELLLEAVKKYGKNNWEKIAKELPGRTPKQCRERWRNLL 45


>gnl|CDD|197842 smart00717, SANT, SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding
           domains. 
          Length = 49

 Score = 36.4 bits (85), Expect = 0.001
 Identities = 10/46 (21%), Positives = 21/46 (45%)

Query: 211 WSEEECKNFESGLRMYGKNFYQIHQNKVKTRNVGEIVQFYYLWKKT 256
           W+EEE +     ++ YGKN ++    ++  R   +  + +    K 
Sbjct: 4   WTEEEDELLIELVKKYGKNNWEKIAKELPGRTAEQCRERWRNLLKP 49


>gnl|CDD|215818 pfam00249, Myb_DNA-binding, Myb-like DNA-binding domain.  This
           family contains the DNA binding domains from Myb
           proteins, as well as the SANT domain family.
          Length = 47

 Score = 34.8 bits (81), Expect = 0.004
 Identities = 5/44 (11%), Positives = 15/44 (34%)

Query: 211 WSEEECKNFESGLRMYGKNFYQIHQNKVKTRNVGEIVQFYYLWK 254
           W+ EE +     ++ +G   +      +  R   +    +  + 
Sbjct: 4   WTPEEDELLIEAVKKHGNGNWSKIAKHLPGRTDNQCKNRWNNYL 47


>gnl|CDD|219069 pfam06512, Na_trans_assoc, Sodium ion transport-associated.
           Members of this family contain a region found
           exclusively in eukaryotic sodium channels or their
           subunits, many of which are voltage-gated. Members very
           often also contain between one and four copies of
           pfam00520 and, less often, one copy of pfam00612.
          Length = 230

 Score = 34.3 bits (79), Expect = 0.072
 Identities = 19/80 (23%), Positives = 36/80 (45%), Gaps = 6/80 (7%)

Query: 24  GNPLAPPVSDPLMLDEEIEDDDEEEE--EEEKQPVSQLAKLYEEGCSTSSKETALSKALE 81
             P+A   SD   L++E    +      EE K+  ++  KL E+  S+S   T   +  E
Sbjct: 104 SVPIASGESDLENLNDEDTSSESSYGFKEESKKGSAETLKLEEKDSSSSEGSTVDLEPPE 163

Query: 82  EDDDEYNFFDEHPDDDDDDP 101
           E+++E        +++  +P
Sbjct: 164 EEEEEI----AEEEEEVKEP 179


>gnl|CDD|234681 PRK00182, tatB, sec-independent translocase; Provisional.
          Length = 160

 Score = 32.1 bits (73), Expect = 0.25
 Identities = 24/75 (32%), Positives = 42/75 (56%), Gaps = 13/75 (17%)

Query: 39  EEIEDDDEEEEEEEKQPVSQLAKLYEEGCSTSSKETALSKALEEDDDEYNFFDEHPDDDD 98
           ++++ D  EE +E ++P++Q+A            +TA++KAL + DD  +F       DD
Sbjct: 49  QQLDGDFGEEFDEFRKPLNQIASY-----RRMGPKTAITKALFDGDD--SFL------DD 95

Query: 99  DDPKKTIMVGSEYQA 113
            DPKK +  G+E +A
Sbjct: 96  FDPKKIMADGTEGEA 110


>gnl|CDD|221175 pfam11705, RNA_pol_3_Rpc31, DNA-directed RNA polymerase III subunit
           Rpc31.  RNA polymerase III contains seventeen subunits
           in yeasts and in human cells. Twelve of these are akin
           to RNA polymerase I or II and the other five are RNA pol
           III-specific, and form the functionally distinct groups
           (i) Rpc31-Rpc34-Rpc82, and (ii) Rpc37-Rpc53. Rpc31,
           Rpc34 and Rpc82 form a cluster of enzyme-specific
           subunits that contribute to transcription initiation in
           S.cerevisiae and H.sapiens. There is evidence that these
           subunits are anchored at or near the N-terminal Zn-fold
           of Rpc1, itself prolonged by a highly conserved but RNA
           polymerase III-specific domain.
          Length = 221

 Score = 31.6 bits (72), Expect = 0.45
 Identities = 21/65 (32%), Positives = 32/65 (49%), Gaps = 6/65 (9%)

Query: 36  MLDEEIEDDDEEEEEEEKQPVSQLAKLYEEGCSTSSKETALSKALEEDDDEYNFFDEHPD 95
           +  EE ED DE+    EK+     A+  +E       +    +  EE+++E   FD+  D
Sbjct: 141 LFTEEEEDIDEKLSMLEKKLKELEAEDVDEE------DEKDEEEEEEEEEEDEDFDDDDD 194

Query: 96  DDDDD 100
           DDDDD
Sbjct: 195 DDDDD 199



 Score = 30.5 bits (69), Expect = 1.2
 Identities = 19/67 (28%), Positives = 35/67 (52%), Gaps = 17/67 (25%)

Query: 37  LDEEIEDDDEEEEEEEKQPVSQLAKLYEEGCSTSSKETALSKALEEDDDEYN---FFDEH 93
             E+++++DE++EEEE++   +     ++               ++DDD+YN   +FD  
Sbjct: 164 EAEDVDEEDEKDEEEEEEEEEEDEDFDDDD--------------DDDDDDYNAENYFDNG 209

Query: 94  PDDDDDD 100
            DDD DD
Sbjct: 210 EDDDYDD 216


>gnl|CDD|215973 pfam00527, E7, E7 protein, Early protein. 
          Length = 92

 Score = 29.9 bits (68), Expect = 0.47
 Identities = 12/37 (32%), Positives = 19/37 (51%)

Query: 27 LAPPVSDPLMLDEEIEDDDEEEEEEEKQPVSQLAKLY 63
          L  P    L   E+++D +EEEE+EE +      + Y
Sbjct: 13 LELPEPVDLHCYEQLDDSEEEEEDEEDRGQPPAQQPY 49


>gnl|CDD|139494 PRK13335, PRK13335, superantigen-like protein; Reviewed.
          Length = 356

 Score = 31.6 bits (71), Expect = 0.54
 Identities = 21/80 (26%), Positives = 37/80 (46%), Gaps = 10/80 (12%)

Query: 289 EQEGSINPPASPNVYLMSESSKRQRNSSLGNKT--TPPEETKTDKEGATSTPAAQQPANT 346
            QE +     +PN      +++ + ++S   K      EE K+    AT  P  +Q   T
Sbjct: 69  RQERTPKLEKAPN------TNEEKTSASKIEKISQPKQEEQKSLNISATPAPKQEQSQTT 122

Query: 347 SQSL-PQTQ-TTPTSSAEEK 364
           ++S  P+T+ TTP S+   +
Sbjct: 123 TESTTPKTKVTTPPSTNTPQ 142


>gnl|CDD|235224 PRK04140, PRK04140, hypothetical protein; Provisional.
          Length = 317

 Score = 31.8 bits (73), Expect = 0.56
 Identities = 18/73 (24%), Positives = 36/73 (49%), Gaps = 4/73 (5%)

Query: 38  DEEIEDDDEEEEEEEKQPVSQLAKLYEEGCSTSSKETALSKAL-EEDDDEYNFFDEHPDD 96
           D+E+  +D EE+  EK+ ++ L +L   G      + A   A+ E++ DE        + 
Sbjct: 194 DDEVTPEDPEEDPFEKEILNVLTRL---GFDVLPTKKAPFDAVSEDEKDENILLTGISEF 250

Query: 97  DDDDPKKTIMVGS 109
           +++  K+  +V S
Sbjct: 251 NEETEKRARIVSS 263


>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger.  [Transport and
           binding proteins, Cations and iron carrying compounds].
          Length = 1096

 Score = 31.9 bits (72), Expect = 0.68
 Identities = 16/43 (37%), Positives = 26/43 (60%)

Query: 38  DEEIEDDDEEEEEEEKQPVSQLAKLYEEGCSTSSKETALSKAL 80
           +EE E+++EEEEEEE++   +  +  EE  S    ET   +A+
Sbjct: 865 EEEEEEEEEEEEEEEEEEEEEEEEENEEPLSLEWPETRQKQAI 907


>gnl|CDD|217927 pfam04147, Nop14, Nop14-like family.  Emg1 and Nop14 are novel
           proteins whose interaction is required for the
           maturation of the 18S rRNA and for 40S ribosome
           production.
          Length = 809

 Score = 31.1 bits (71), Expect = 0.94
 Identities = 18/67 (26%), Positives = 31/67 (46%), Gaps = 23/67 (34%)

Query: 38  DEEIEDDDEEEEEEEKQPVSQLAKLYEEGCSTSSKETALSKALEEDDDEYNFFDEHPDDD 97
           DEE +DDD EEEEE+                       +  + EE+D+E    D+  D++
Sbjct: 334 DEEDDDDDLEEEEED-----------------------VDLSDEEEDEEDEDSDDEDDEE 370

Query: 98  DDDPKKT 104
           +++ +K 
Sbjct: 371 EEEEEKE 377


>gnl|CDD|235640 PRK05901, PRK05901, RNA polymerase sigma factor; Provisional.
          Length = 509

 Score = 30.7 bits (70), Expect = 1.5
 Identities = 17/66 (25%), Positives = 33/66 (50%), Gaps = 8/66 (12%)

Query: 38  DEEIEDDDEEEEEEEKQPVSQLAKLYEEGCSTSSKETALSKALEEDDDEYNFFDEHPDDD 97
           D++I+DDD++E+++E      +    EE      KE   +K LE+  D+ +F  +  D +
Sbjct: 144 DDDIDDDDDDEDDDEDDDDDDVDDEDEEK-----KE---AKELEKLSDDDDFVWDEDDSE 195

Query: 98  DDDPKK 103
                +
Sbjct: 196 ALRQAR 201


>gnl|CDD|223031 PHA03273, PHA03273, envelope glycoprotein C; Provisional.
          Length = 486

 Score = 30.4 bits (68), Expect = 1.5
 Identities = 15/67 (22%), Positives = 27/67 (40%), Gaps = 3/67 (4%)

Query: 293 SINPPASPNVYLMSESSKRQRNSSLGNKTTP---PEETKTDKEGATSTPAAQQPANTSQS 349
           S +   S N     +S+      +  N T+P     +  T+  G  ST  A QP +   +
Sbjct: 31  SSSIENSDNSTAEMQSTPATPTHTTSNLTSPFGTGTDNSTNANGTESTTQASQPHSHETT 90

Query: 350 LPQTQTT 356
           +  T++ 
Sbjct: 91  ITCTKSL 97


>gnl|CDD|226709 COG4258, COG4258, Predicted exporter [General function prediction
           only].
          Length = 788

 Score = 30.2 bits (68), Expect = 1.8
 Identities = 16/61 (26%), Positives = 22/61 (36%)

Query: 299 SPNVYLMSESSKRQRNSSLGNKTTPPEETKTDKEGATSTPAAQQPANTSQSLPQTQTTPT 358
           S   +L S S +RQR   L   T  P  T      +  T     P      + +   +PT
Sbjct: 521 SNARFLPSNSLQRQREDLLPLPTALPRLTAALVNASGLTAVVFDPFAAPVEVDRWLASPT 580

Query: 359 S 359
           S
Sbjct: 581 S 581


>gnl|CDD|237047 PRK12298, obgE, GTPase CgtA; Reviewed.
          Length = 390

 Score = 29.8 bits (68), Expect = 2.0
 Identities = 17/66 (25%), Positives = 29/66 (43%), Gaps = 12/66 (18%)

Query: 37  LDEEIEDDDEEEEEEEKQPVSQLAKLYEEGCSTSSKETALSKALEEDDDEYNFFDEHPDD 96
           ++E   ++ EE E  EK  V  +   Y        +E       E+DDD   + D+  +D
Sbjct: 332 IEENPREEAEEAEAPEK--VEFMWDDYH-------REQLEEVEEEDDDD---WDDDWDED 379

Query: 97  DDDDPK 102
           DD+  +
Sbjct: 380 DDEGVE 385


>gnl|CDD|217830 pfam03986, Autophagy_N, Autophagocytosis associated protein (Atg3),
           N-terminal domain.  Autophagocytosis is a
           starvation-induced process responsible for transport of
           cytoplasmic proteins to the lysosome/vacuole. Atg3 is a
           ubiquitin like modifier that is topologically similar to
           the canonical E2 enzyme. It catalyzes the conjugation of
           Atg8 and phosphatidylethanolamine.
          Length = 146

 Score = 28.8 bits (65), Expect = 2.7
 Identities = 17/65 (26%), Positives = 28/65 (43%), Gaps = 6/65 (9%)

Query: 37  LDEEIEDDDEEEEEEEKQPVSQLAKLYEEGCSTSSKETALSKALEEDDDEYNFFDEHPDD 96
            +E +ED+DE++            K         + E  + +  ++DDD  +  D   DD
Sbjct: 88  AEEIVEDEDEDDGWVTTHGNRDKQK------DDIADEEDIPEIGDDDDDVVDSSDADEDD 141

Query: 97  DDDDP 101
           DDD P
Sbjct: 142 DDDIP 146


>gnl|CDD|183854 PRK13042, PRK13042, superantigen-like protein; Reviewed.
          Length = 291

 Score = 29.6 bits (66), Expect = 2.8
 Identities = 20/57 (35%), Positives = 25/57 (43%), Gaps = 3/57 (5%)

Query: 306 SESSKRQRNSSLGNKTTPPEETKTDKEGATSTPAAQQPANTSQSLPQTQTTPTSSAE 362
             S+K +   S  N TTPP    T  E    TP A  P++T    PQ+ TT     E
Sbjct: 45  PPSTKVEAPQSKPNATTPPS---TKVEAPQQTPNATTPSSTKVETPQSPTTKQVPTE 98


>gnl|CDD|185603 PTZ00415, PTZ00415, transmission-blocking target antigen s230;
           Provisional.
          Length = 2849

 Score = 29.6 bits (66), Expect = 3.1
 Identities = 18/61 (29%), Positives = 29/61 (47%)

Query: 38  DEEIEDDDEEEEEEEKQPVSQLAKLYEEGCSTSSKETALSKALEEDDDEYNFFDEHPDDD 97
           DE+ +DDDEE++EEE++   ++    +E       E    +  E D  +   F E    D
Sbjct: 159 DEDEDDDDEEDDEEEEEEEEEIKGFDDEDEEDEGGEDFTYEKSEVDKTDCFKFIEAGAGD 218

Query: 98  D 98
           D
Sbjct: 219 D 219


>gnl|CDD|217203 pfam02724, CDC45, CDC45-like protein.  CDC45 is an essential gene
           required for initiation of DNA replication in S.
           cerevisiae, forming a complex with MCM5/CDC46.
           Homologues of CDC45 have been identified in human, mouse
           and smut fungus among others.
          Length = 583

 Score = 29.6 bits (67), Expect = 3.4
 Identities = 17/60 (28%), Positives = 26/60 (43%), Gaps = 7/60 (11%)

Query: 43  DDDEEEEEEEKQPVSQLAKLYEEGCSTSSKETALSKALEEDDDEYNFFDEHPDDDDDDPK 102
           DD + EEE + +P    A    E      +E        +++DE +   E  +DDDDD  
Sbjct: 102 DDGDIEEELQDEPRYDDAYRDLEEDDDDDEE-------SDEEDEESSKSEDDEDDDDDDD 154


>gnl|CDD|234407 TIGR03941, tRNA_deam_assoc, putative tRNA adenosine
           deaminase-associated protein.  This model describes a
           protein family about 200 amino acids in length with only
           five invariant residues, including an Arg, a Ser-Asp
           pair, and two Gly residues. Members always are found
           exclusively in Actinobacteria, and always adjacent to
           homologs of TadA, a tRNA-specific adenosine deaminase
           from Escherichia coli. Homology, phyletic pattern, and
           gene neighborhood together suggest a housekeeping
           function in tRNA metabolism [Unknown function, General].
          Length = 154

 Score = 28.3 bits (64), Expect = 3.7
 Identities = 24/71 (33%), Positives = 32/71 (45%), Gaps = 12/71 (16%)

Query: 26  PLAPPVSDPLMLDEEIEDDDEEEEEEEKQPVSQLAKLYEEGCSTSSKETALSKALEEDDD 85
            LA  V D L  D   +DDDEE++ +   P   L  L + G S    E  L   L +D+D
Sbjct: 78  DLAADVLDELDADVPDDDDDEEDDADP-WPEGDLDILADLGLS----EEELG-VLCDDED 131

Query: 86  EYNFFDEHPDD 96
            Y      PD+
Sbjct: 132 LY------PDE 136


>gnl|CDD|178320 PLN02718, PLN02718, Probable galacturonosyltransferase.
          Length = 603

 Score = 29.1 bits (65), Expect = 3.8
 Identities = 21/79 (26%), Positives = 30/79 (37%), Gaps = 7/79 (8%)

Query: 290 QEGSINPPASPNVYLMSESSK--RQRNSSLGNKTTPPEETK--TDKEGATSTPAAQQPAN 345
           Q+G  N   S   Y  S+ +    +     GN   P EE    + K   +S+   Q  A 
Sbjct: 78  QDGDFN---SVVSYNSSDKNNDSLESEVDGGNNHKPKEEQAQVSQKTTVSSSEEVQISAR 134

Query: 346 TSQSLPQTQTTPTSSAEEK 364
             Q   +TQ  P +   EK
Sbjct: 135 DIQLNHKTQFNPPTVKHEK 153


>gnl|CDD|217503 pfam03344, Daxx, Daxx Family.  The Daxx protein (also known as the
           Fas-binding protein) is thought to play a role in
           apoptosis, but precise role played by Daxx remains to be
           determined. Daxx forms a complex with Axin.
          Length = 715

 Score = 29.1 bits (65), Expect = 3.9
 Identities = 21/104 (20%), Positives = 40/104 (38%), Gaps = 10/104 (9%)

Query: 17  NERSPSHGNPLAPP------VSDPLMLDEEIE----DDDEEEEEEEKQPVSQLAKLYEEG 66
            ER  +      P          P M  +E E     ++EEEEEEE++   Q ++  E  
Sbjct: 409 RERQGTSSRSSDPSKASSTSGESPSMASQESEEEESVEEEEEEEEEEEEEEQESEEEEGE 468

Query: 67  CSTSSKETALSKALEEDDDEYNFFDEHPDDDDDDPKKTIMVGSE 110
                +E       EE+ +  +  D   ++ ++D ++     + 
Sbjct: 469 DEEEEEEVEADNGSEEEMEGSSEGDGDGEEPEEDAERRNSEMAG 512


>gnl|CDD|222648 pfam14283, DUF4366, Domain of unknown function (DUF4366).  This
           family of proteins is found in bacteria and eukaryotes.
           Proteins in this family are typically between 227 and
           387 amino acids in length.
          Length = 213

 Score = 28.5 bits (64), Expect = 4.9
 Identities = 10/23 (43%), Positives = 17/23 (73%)

Query: 79  ALEEDDDEYNFFDEHPDDDDDDP 101
           A ++D DEY++ DE  ++DD+ P
Sbjct: 190 APDDDLDEYDYGDEDEEEDDEPP 212


>gnl|CDD|220684 pfam10310, DUF2413, Protein of unknown function (DUF2413).  This is
           a family of proteins conserved in fungi. The function is
           not known.
          Length = 436

 Score = 29.0 bits (65), Expect = 5.0
 Identities = 15/78 (19%), Positives = 25/78 (32%), Gaps = 1/78 (1%)

Query: 286 FLDEQEGSINPPASPNVYLMSESSKRQRNSSLGNKTTPPEETKTDKEGATSTPAAQQPAN 345
           FLDE E S      P     +      RN    +K T      ++++ A    +A+   +
Sbjct: 37  FLDELEQS-EKAKPPKKPKEASRPGTPRNPKKSSKPTESSAASSEEKPAKPRKSAESTRS 95

Query: 346 TSQSLPQTQTTPTSSAEE 363
           +        T      E 
Sbjct: 96  SHPKSKAPSTESEEEEEP 113


>gnl|CDD|218003 pfam04281, Tom22, Mitochondrial import receptor subunit Tom22.  The
           mitochondrial protein translocase family, which is
           responsible for movement of nuclear encoded pre-proteins
           into mitochondria, is very complex with at least 19
           components. These proteins include several chaperone
           proteins, four proteins of the outer membrane
           translocase (Tom) import receptor, five proteins of the
           Tom channel complex, five proteins of the inner membrane
           translocase (Tim) and three "motor" proteins. This
           family represents the Tom22 proteins. The N terminal
           region of Tom22 has been shown to have chaperone-like
           activity, and the C terminal region faces the
           intermembrane face.
          Length = 136

 Score = 27.6 bits (62), Expect = 5.2
 Identities = 12/67 (17%), Positives = 25/67 (37%), Gaps = 19/67 (28%)

Query: 39  EEIEDDDEEEEEEEKQPVSQLAKLYEEGCSTSSKETALSKALEEDDDEYNFFDEHPDDDD 98
            E+ED+  +E+    + ++Q                      ++DD++    D    DD 
Sbjct: 5   TEVEDETFQEKPAAPKNLAQEES-------------------DDDDEDDTDTDSDISDDS 45

Query: 99  DDPKKTI 105
           D   +T+
Sbjct: 46  DFENETL 52


>gnl|CDD|221620 pfam12527, DUF3727, Protein of unknown function (DUF3727).  This
          domain family is found in bacteria and eukaryotes, and
          is approximately 100 amino acids in length.
          Length = 100

 Score = 27.2 bits (61), Expect = 5.4
 Identities = 16/38 (42%), Positives = 22/38 (57%), Gaps = 3/38 (7%)

Query: 29 PPVSDPLMLDEEIEDDDEEEEEEEKQPVSQLAKLYEEG 66
          P   +  +L+ E EDDD+E+EEEE +    LA  Y E 
Sbjct: 19 PEPEEDEILELEEEDDDDEDEEEEYEL---LASFYHED 53


>gnl|CDD|145949 pfam03066, Nucleoplasmin, Nucleoplasmin.  Nucleoplasmins are also
           known as chromatin decondensation proteins. They bind to
           core histones and transfer DNA to them in a reaction
           that requires ATP. This is thought to play a role in the
           assembly of regular nucleosomal arrays.
          Length = 146

 Score = 27.7 bits (62), Expect = 5.5
 Identities = 11/21 (52%), Positives = 17/21 (80%)

Query: 38  DEEIEDDDEEEEEEEKQPVSQ 58
           +++ EDDDE+E EEE+ PV +
Sbjct: 124 EDDEEDDDEDESEEEESPVKK 144


>gnl|CDD|237019 PRK11907, PRK11907, bifunctional 2',3'-cyclic nucleotide
           2'-phosphodiesterase/3'-nucleotidase precursor protein;
           Reviewed.
          Length = 814

 Score = 28.7 bits (64), Expect = 5.7
 Identities = 10/45 (22%), Positives = 18/45 (40%)

Query: 321 TTPPEETKTDKEGATSTPAAQQPANTSQSLPQTQTTPTSSAEEKT 365
           TTP   T+ ++     + A ++  NT   +  T      S+ E  
Sbjct: 34  TTPATSTEAEQTTPVESDATEEADNTETPVAATTAAEAPSSSETA 78


>gnl|CDD|118131 pfam09595, Metaviral_G, Metaviral_G glycoprotein.  This is a viral
           attachment glycoprotein from region G of metaviruses. It
           is high in serine and threonine suggesting it is highly
           glycosylated.
          Length = 183

 Score = 28.1 bits (62), Expect = 6.0
 Identities = 19/75 (25%), Positives = 32/75 (42%), Gaps = 10/75 (13%)

Query: 291 EGSINPPASPNVYLMSESSKR----QRNSSLGNKTTPPEETKTDKEGATSTPAAQQPANT 346
           E S  P A+   +L +E +       R  ++   TTPP  ++T    A            
Sbjct: 66  ESSTLPAATSESHLETEPTSTPDTTNRQQTVDRHTTPPSSSRTQTTQAVHE------KKN 119

Query: 347 SQSLPQTQTTPTSSA 361
           +++  +TQT PT+S 
Sbjct: 120 TRTTSRTQTPPTTST 134


>gnl|CDD|165513 PHA03255, PHA03255, BDLF3; Provisional.
          Length = 234

 Score = 28.3 bits (62), Expect = 6.4
 Identities = 10/52 (19%), Positives = 21/52 (40%), Gaps = 1/52 (1%)

Query: 314 NSSLGNKTTPPEETKTDKEGATSTPAAQQPANTSQSLPQTQTTPTSSAEEKT 365
            +  G  T+    +      +++T A  +  N +   P T ++  +S   KT
Sbjct: 113 ATEAGTGTSTGVTSNVTTRSSSTTSATTRITNATTLAP-TLSSKGTSNATKT 163


>gnl|CDD|100110 cd05832, Ribosomal_L12p, Ribosomal protein L12p. This subfamily
           includes archaeal L12p, the protein that is functionally
           equivalent to L7/L12 in bacteria and the P1 and P2
           proteins in eukaryotes. L12p is homologous to P1 and P2
           but is not homologous to bacterial L7/L12. It is located
           in the L12 stalk, with proteins L10, L11, and 23S rRNA.
           L12p is the only protein in the ribosome to occur as
           multimers, always appearing as sets of dimers. Recent
           data indicate that most archaeal species contain six
           copies of L12p (three homodimers), while eukaryotes have
           four copies (two heterodimers), and bacteria may have
           four or six copies (two or three homodimers), depending
           on the species. The organization of proteins within the
           stalk has been characterized primarily in bacteria,
           where L7/L12 forms either two or three homodimers and
           each homodimer binds to the extended C-terminal helix of
           L10. L7/L12 is attached to the ribosome through L10 and
           is the only ribosomal protein that does not directly
           interact with rRNA. Archaeal L12p is believed to
           function in a similar fashion. However, hybrid ribosomes
           containing the large subunit from E. coli with an
           archaeal stalk are able to bind archaeal and eukaryotic
           elongation factors but not bacterial elongation factors.
           In several mesophilic and thermophilic archaeal species,
           the binding of 23S rRNA to protein L11 and to the
           L10/L12p pentameric complex was found to be
           temperature-dependent and cooperative.
          Length = 106

 Score = 27.1 bits (60), Expect = 6.6
 Identities = 11/36 (30%), Positives = 20/36 (55%)

Query: 28  APPVSDPLMLDEEIEDDDEEEEEEEKQPVSQLAKLY 63
           A    +     EE +  +EE+EEEE++ ++ L  L+
Sbjct: 71  AAAAEEKAEEKEEEKKKEEEKEEEEEEALAGLGALF 106


>gnl|CDD|220102 pfam09073, BUD22, BUD22.  BUD22 has been shown in yeast to be a
           nuclear protein involved in bud-site selection. It plays
           a role in positioning the proximal bud pole signal. More
           recently it has been shown to be involved in ribosome
           biogenesis.
          Length = 424

 Score = 28.3 bits (63), Expect = 7.1
 Identities = 17/69 (24%), Positives = 26/69 (37%), Gaps = 1/69 (1%)

Query: 38  DEEIEDDDEEEEEEEKQPVSQLAKLYEEGCSTSSKETALSKALEEDDDEYNFFDEHPDDD 97
             E + DDEEEE+ + +  SQ   +  +       E A S    ED  E    DE   + 
Sbjct: 186 SAEDDSDDEEEEDSDSEDYSQYDGMLVDSSDEEEGEEAPSINYNEDTSES-ESDESDSEI 244

Query: 98  DDDPKKTIM 106
            +    +  
Sbjct: 245 SESRSVSDS 253


>gnl|CDD|187787 cd09656, Cmr3_III-B, CRISPR/Cas system-associated RAMP superfamily
           protein Cmr3.  CRISPR (Clustered Regularly Interspaced
           Short Palindromic Repeats) and associated Cas proteins
           comprise a system for heritable host defense by
           prokaryotic cells against phage and other foreign DNA;
           RAMP superfamily protein; This protein is a subunit of
           Cmr complex.
          Length = 318

 Score = 28.2 bits (63), Expect = 7.3
 Identities = 16/90 (17%), Positives = 29/90 (32%), Gaps = 11/90 (12%)

Query: 79  ALEEDDDEYNFFDEHPDDDDDDPK-----KTIMVGSEYQAWIPEGMCKYGDILPYENDDK 133
            LE D+D  + F+   + +    K       I +     + +       G +   E    
Sbjct: 200 KLEVDEDNEDTFNPEIEPEGKYVKFYLLTPAIFIEGGVLSDLK------GLLCEVEIFAV 253

Query: 134 LLWDPNHNLVTVDLVETYLKKSHEMFINGA 163
           L+      +   D VE   K + E    G+
Sbjct: 254 LIDKRKQIVSGWDYVEKKPKPTLEAVPPGS 283


>gnl|CDD|235548 PRK05657, PRK05657, RNA polymerase sigma factor RpoS; Validated.
          Length = 325

 Score = 28.0 bits (63), Expect = 7.3
 Identities = 9/28 (32%), Positives = 15/28 (53%)

Query: 33 DPLMLDEEIEDDDEEEEEEEKQPVSQLA 60
          +  +++EE  D+D  EEE   Q  +Q  
Sbjct: 24 EKALVEEEPSDNDLAEEELLSQGATQRV 51


>gnl|CDD|233619 TIGR01888, cas_cmr3, CRISPR type III-B/RAMP module-associated
           protein Cmr3.  CRISPR is a term for Clustered Regularly
           Interspaced Short Palidromic Repeats. A number of
           protein families appear only in association with these
           repeats and are designated Cas (CRISPR associated)
           proteins. This highly divergent family is found in at
           least ten different archaeal and bacterial species as
           part of the CRISPR RAMP modulue but is not a member of
           the RAMP superfamily itself. A typical example is TM1793
           from Thermotoga maritima [Mobile and extrachromosomal
           element functions, Other].
          Length = 337

 Score = 28.2 bits (63), Expect = 8.0
 Identities = 18/124 (14%), Positives = 33/124 (26%), Gaps = 19/124 (15%)

Query: 79  ALEEDDDEYNFFDEHPDDDDDDPK-----KTIMVGSEYQAWIPEGMCKYGDILPYENDDK 133
            LE D+D  + F+     +    K       I +     + +   +C     L    +  
Sbjct: 215 KLEVDEDNEDTFNPEILPETKYVKFYLLTPAIFIEGGVLSDLKGLLC--EAKLVEVFNFG 272

Query: 134 LLWDPNHNLVTVDLVETYLKKSHEMFINGA-------GDCLPMGSHSRDDEQALYLLLQC 186
                       D VE   K + E    G+        +        R  +   +  +  
Sbjct: 273 KRKRTVSG---WDYVEKKPKPTLEAVPPGSVYFFKAKEEGPLELLFPRLIKLGKFHKI-- 327

Query: 187 GYNT 190
           GY  
Sbjct: 328 GYGL 331


>gnl|CDD|237555 PRK13914, PRK13914, invasion associated secreted endopeptidase;
           Provisional.
          Length = 481

 Score = 28.2 bits (62), Expect = 8.3
 Identities = 15/55 (27%), Positives = 26/55 (47%), Gaps = 6/55 (10%)

Query: 312 QRNSSLGNKTTPPEETKTDKEGATSTPA-AQQPA-----NTSQSLPQTQTTPTSS 360
           + N++    TT  +ET T ++ A   P  A +PA     NT+ +   T T   ++
Sbjct: 272 KENTNTNTATTEKKETTTQQQTAPKAPTEAAKPAPAPSTNTNANKTNTNTNTNTN 326


>gnl|CDD|180536 PRK06347, PRK06347, autolysin; Reviewed.
          Length = 592

 Score = 28.1 bits (62), Expect = 8.5
 Identities = 11/52 (21%), Positives = 17/52 (32%)

Query: 314 NSSLGNKTTPPEETKTDKEGATSTPAAQQPANTSQSLPQTQTTPTSSAEEKT 365
            S+  N T     T T +     T   +Q     Q+    +  P +   EK 
Sbjct: 63  KSAEANTTKEAPATATPENTTEPTVEPKQTETKEQTKTPEEKQPAAKQVEKA 114


>gnl|CDD|224179 COG1259, COG1259, Uncharacterized conserved protein [Function
           unknown].
          Length = 151

 Score = 27.3 bits (61), Expect = 8.7
 Identities = 13/35 (37%), Positives = 18/35 (51%)

Query: 21  PSHGNPLAPPVSDPLMLDEEIEDDDEEEEEEEKQP 55
           PS    LA  V  P+ + EE+ D+ E E E+E   
Sbjct: 107 PSDAIALALRVGAPIYVAEEVLDEAEIEIEDEDDE 141


>gnl|CDD|216269 pfam01056, Myc_N, Myc amino-terminal region.  The myc family
           belongs to the basic helix-loop-helix leucine zipper
           class of transcription factors, see pfam00010. Myc forms
           a heterodimer with Max, and this complex regulates cell
           growth through direct activation of genes involved in
           cell replication. Mutations in the C-terminal 20
           residues of this domain cause unique changes in the
           induction of apoptosis, transformation, and G2 arrest.
          Length = 329

 Score = 28.0 bits (62), Expect = 9.1
 Identities = 17/54 (31%), Positives = 25/54 (46%), Gaps = 3/54 (5%)

Query: 23  HGNPLAPPVSDPLMLDEEIEDDDEEEEEEEKQPVSQLAKLYEEG-CSTSSKETA 75
             N  +   SD     EE E+++EEEEEEE+  V  + K         S+ E+ 
Sbjct: 215 PPNSSSSSGSDSE--SEEDEEEEEEEEEEEEIDVVTVEKRRSSSNRKASTSESI 266


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.308    0.128    0.372 

Gapped
Lambda     K      H
   0.267   0.0794    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 18,865,143
Number of extensions: 1818355
Number of successful extensions: 3585
Number of sequences better than 10.0: 1
Number of HSP's gapped: 3105
Number of HSP's successfully gapped: 179
Length of query: 365
Length of database: 10,937,602
Length adjustment: 98
Effective length of query: 267
Effective length of database: 6,590,910
Effective search space: 1759772970
Effective search space used: 1759772970
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.6 bits)
S2: 60 (26.8 bits)