RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy14677
         (483 letters)



>gnl|CDD|238017 cd00060, FHA, Forkhead associated domain (FHA); found in eukaryotic
           and prokaryotic proteins. Putative nuclear signalling
           domain. FHA domains may bind phosphothreonine,
           phosphoserine and sometimes phosphotyrosine. In
           eukaryotes, many FHA domain-containing proteins localize
           to the nucleus, where they participate in establishing
           or maintaining cell cycle checkpoints, DNA repair, or
           transcriptional regulation. Members of the FHA family
           include: Dun1, Rad53,  Cds1, Mek1,
           KAPP(kinase-associated protein phosphatase),and Ki-67 (a
           human nuclear protein related to cell proliferation).
          Length = 102

 Score = 92.8 bits (231), Expect = 5e-23
 Identities = 33/104 (31%), Positives = 49/104 (47%), Gaps = 12/104 (11%)

Query: 355 RWRLYPFKGDTQLPVLHIHRQSAFLMGRDRKVADIPVDHPSCSKQHAALQYRLTDYTRED 414
             RL    GD      ++     + +GRD    DI +D PS S++HA ++Y         
Sbjct: 1   VPRLVVLSGDASGRRYYLDPGGTYTIGRDSDNCDIVLDDPSVSRRHAVIRYD-------- 52

Query: 415 GSKGKRIRLYIIDLESANGTFVNNKKIDPKRYVELLEKDVIKFG 458
                   + +IDL S NGTFVN +++ P   V L + DVI+ G
Sbjct: 53  ----GDGGVVLIDLGSTNGTFVNGQRVSPGEPVRLRDGDVIRLG 92


>gnl|CDD|215951 pfam00498, FHA, FHA domain.  The FHA (Forkhead-associated) domain
           is a phosphopeptide binding motif.
          Length = 67

 Score = 65.7 bits (161), Expect = 8e-14
 Identities = 29/81 (35%), Positives = 42/81 (51%), Gaps = 14/81 (17%)

Query: 378 FLMGRDRKVADIPVDHPSCSKQHAALQYRLTDYTREDGSKGKRIRLYIIDLESANGTFVN 437
             +GR     DI +D PS S++HA ++Y                R Y+ DL S NGTFVN
Sbjct: 1   VTIGRSPD-CDIVLDDPSVSRRHAEIRYDGGG------------RFYLEDLGSTNGTFVN 47

Query: 438 NKKIDPKRYVELLEKDVIKFG 458
            +++ P+  V L + DVI+ G
Sbjct: 48  GQRLGPE-PVRLRDGDVIRLG 67


>gnl|CDD|214578 smart00240, FHA, Forkhead associated domain.  Found in eukaryotic
           and prokaryotic proteins. Putative nuclear signalling
           domain.
          Length = 52

 Score = 54.1 bits (131), Expect = 7e-10
 Identities = 25/61 (40%), Positives = 32/61 (52%), Gaps = 12/61 (19%)

Query: 381 GRDRKVADIPVDHPSCSKQHAALQYRLTDYTREDGSKGKRIRLYIIDLESANGTFVNNKK 440
           GR  +  DI +D PS S++HA + Y                R Y+IDL S NGTFVN K+
Sbjct: 4   GRSSEDCDIQLDGPSISRRHAVIVYDGGG------------RFYLIDLGSTNGTFVNGKR 51

Query: 441 I 441
           I
Sbjct: 52  I 52


>gnl|CDD|237171 PRK12678, PRK12678, transcription termination factor Rho;
           Provisional.
          Length = 672

 Score = 49.1 bits (118), Expect = 4e-06
 Identities = 21/109 (19%), Positives = 36/109 (33%)

Query: 178 QERTELSDRHNLQENQRPRHEKPHSSDKHRHSSKTEPSERETKRNRQSHDDEETNERRHH 237
           Q  TE       +  +  R E+    D+    ++ E  ER  +  R    D+     R  
Sbjct: 151 QPATEARADAAERTEEEERDERRRRGDREDRQAEAERGERGRREERGRDGDDRDRRDRRE 210

Query: 238 DRNNRTHDNRHDERNSRQQNNERGSQDRRSNQSHDRPQSSSNSHHRDRG 286
             + R    R D  + R +   R  +D R + + +            RG
Sbjct: 211 QGDRREERGRRDGGDRRGRRRRRDRRDARGDDNREDRGDRDGDDGEGRG 259



 Score = 47.6 bits (114), Expect = 1e-05
 Identities = 22/138 (15%), Positives = 45/138 (32%), Gaps = 1/138 (0%)

Query: 177 KQERTELSDRHNLQENQRPRHEKPHSSDKHRHSSKTEPSERETKRNRQSHDDEETNERRH 236
           +  R   + +      Q     +  ++++     + E   R  + +RQ+  +     RR 
Sbjct: 135 EAARRGAARKAGEGGEQPATEARADAAERTEEEERDERRRRGDREDRQAEAERGERGRRE 194

Query: 237 HDRNNRTHDNRHDERNSRQQNNERGSQDRRSNQSHDRPQSSSNSHHRD-RGPKKPFCDAS 295
               +    +R D R    +  ERG +D    +   R +   ++   D R  +       
Sbjct: 195 ERGRDGDDRDRRDRREQGDRREERGRRDGGDRRGRRRRRDRRDARGDDNREDRGDRDGDD 254

Query: 296 DEYEWGRKDANNEDNKNN 313
            E   GR+     D    
Sbjct: 255 GEGRGGRRGRRFRDRDRR 272



 Score = 43.7 bits (104), Expect = 2e-04
 Identities = 21/149 (14%), Positives = 49/149 (32%), Gaps = 1/149 (0%)

Query: 127 RKSPKKKHQVVQALRVNDSEAVENKIKEEPTEEEKPRHHHHHHHRHRHHHKQERTELSDR 186
           R + +K  +  +            + +EE  +E + R             ++ R E   R
Sbjct: 139 RGAARKAGEGGEQPATEARADAAERTEEEERDERRRRGDREDRQAEAERGERGRREERGR 198

Query: 187 HNLQENQRPRHEKPHSSDKHRHSSKTEPSERETKRNRQSHDDEETNERRHHDRNNRTHDN 246
                ++R R E+    ++       +   R  +R+R+    ++  E R     +     
Sbjct: 199 DGDDRDRRDRREQGDRREERGRRDGGDRRGRRRRRDRRDARGDDNREDRGDRDGDDGEGR 258

Query: 247 RHDERNSRQQNNERGSQDRRSNQSHDRPQ 275
               R  R ++ +R  +      +   P+
Sbjct: 259 G-GRRGRRFRDRDRRGRRGGDGGNEREPE 286


>gnl|CDD|236641 PRK10019, PRK10019, nickel/cobalt efflux protein RcnA; Provisional.
          Length = 279

 Score = 45.2 bits (107), Expect = 3e-05
 Identities = 14/44 (31%), Positives = 19/44 (43%)

Query: 164 HHHHHHHRHRHHHKQERTELSDRHNLQENQRPRHEKPHSSDKHR 207
           H H H H H HHH        +     E  +  HE+ H++D  R
Sbjct: 125 HDHDHDHDHEHHHDHGHHHHHEHGATAEEYQDAHERAHANDIKR 168



 Score = 40.9 bits (96), Expect = 7e-04
 Identities = 17/82 (20%), Positives = 27/82 (32%), Gaps = 19/82 (23%)

Query: 159 EEKPRHHHHHHHRHRHHHKQERTELSDRHNLQENQRPRHEKPHSSDKHRHSSKTEPSERE 218
            E+    + HHH H H H  +                 HE  H    H H      +E  
Sbjct: 112 GERNWLENMHHHDHDHDHDHD-----------------HEHHHDHGHHHHHEHGATAEEY 154

Query: 219 TKRNRQSHDDEETNERRHHDRN 240
              + ++H ++   +RR   R 
Sbjct: 155 QDAHERAHANDI--KRRFDGRE 174


>gnl|CDD|177648 PHA03420, PHA03420, E4 protein; Provisional.
          Length = 137

 Score = 43.1 bits (101), Expect = 4e-05
 Identities = 18/64 (28%), Positives = 25/64 (39%), Gaps = 2/64 (3%)

Query: 141 RVNDSEAVENKIKEEPTEEEK--PRHHHHHHHRHRHHHKQERTELSDRHNLQENQRPRHE 198
           R+ D  A    I+ E        P  +    H H H H+Q+   L DR +L +    R  
Sbjct: 22  RLLDGRAENQHIQREGGNHRTWDPADYLDRPHHHPHRHQQDDHHLQDRQHLPQQHLQRPH 81

Query: 199 KPHS 202
            P S
Sbjct: 82  HPLS 85



 Score = 28.0 bits (62), Expect = 6.6
 Identities = 10/40 (25%), Positives = 15/40 (37%), Gaps = 3/40 (7%)

Query: 164 HHH---HHHHRHRHHHKQERTELSDRHNLQENQRPRHEKP 200
           H H    HH + R H  Q+  +        + Q+    KP
Sbjct: 57  HRHQQDDHHLQDRQHLPQQHLQRPHHPLSPQCQQNVTGKP 96


>gnl|CDD|233503 TIGR01642, U2AF_lg, U2 snRNP auxilliary factor, large subunit,
           splicing factor.  These splicing factors consist of an
           N-terminal arginine-rich low complexity domain followed
           by three tandem RNA recognition motifs (pfam00076). The
           well-characterized members of this family are auxilliary
           components of the U2 small nuclear ribonuclearprotein
           splicing factor (U2AF). These proteins are closely
           related to the CC1-like subfamily of splicing factors
           (TIGR01622). Members of this subfamily are found in
           plants, metazoa and fungi.
          Length = 509

 Score = 43.7 bits (103), Expect = 1e-04
 Identities = 24/137 (17%), Positives = 39/137 (28%), Gaps = 19/137 (13%)

Query: 153 KEEPTEEEKPRHHHHHHHRHRHHHKQERTELSDRHNLQENQRPRHEKPHSSDKHRHSSKT 212
           +E   E EK R         R                   +R   ++    D+HR S + 
Sbjct: 3   EEPDREREKSRGRDRDRSSERP------------------RRRSRDRSRFRDRHRRSRER 44

Query: 213 EPSERETKRNRQSHDDEETNERRHHDRNNRTHDNRHDERNSRQQNNERGSQDRRSNQSHD 272
              E    R+R+ +D       R+          R   R+ R     R  + R  + S+ 
Sbjct: 45  SYREDSRPRDRRRYDSRSPRSLRYSSVRRSRDRPRRRSRSVRSIEQHR-RRLRDRSPSNQ 103

Query: 273 RPQSSSNSHHRDRGPKK 289
             +        D  P  
Sbjct: 104 WRKDDKKRSLWDIKPPG 120



 Score = 43.7 bits (103), Expect = 2e-04
 Identities = 25/109 (22%), Positives = 43/109 (39%), Gaps = 5/109 (4%)

Query: 185 DRHNLQENQRPR-HEKPHSSDKHRHSSKTEPSERETKR---NRQSHDDEETNERRHHDRN 240
           D    +E ++ R  ++  SS++ R  S+     R+  R    R   +D    +RR +D  
Sbjct: 2   DEEPDREREKSRGRDRDRSSERPRRRSRDRSRFRDRHRRSRERSYREDSRPRDRRRYDSR 61

Query: 241 NRTHDNRHDERNSRQQNNERGSQDRRSNQSHDRPQSSSNSHHRDRGPKK 289
           +         R SR     R S+  RS + H R     +  ++ R   K
Sbjct: 62  SPRSLRYSSVRRSR-DRPRRRSRSVRSIEQHRRRLRDRSPSNQWRKDDK 109



 Score = 34.1 bits (78), Expect = 0.19
 Identities = 18/103 (17%), Positives = 35/103 (33%), Gaps = 6/103 (5%)

Query: 144 DSEAVENKIKEEPTEEEKPRHHHHHHHRHRHHHKQERTELSD-RHNLQENQRPRHEKPHS 202
           + E    + ++  +E  + R       R RH   +ER+   D R   +     R  +   
Sbjct: 8   EREKSRGRDRDRSSERPRRRSRDRSRFRDRHRRSRERSYREDSRPRDRRRYDSRSPRSLR 67

Query: 203 SDKHRHS-----SKTEPSERETKRNRQSHDDEETNERRHHDRN 240
               R S      ++       +  R+  D   +N+ R  D+ 
Sbjct: 68  YSSVRRSRDRPRRRSRSVRSIEQHRRRLRDRSPSNQWRKDDKK 110


>gnl|CDD|224630 COG1716, COG1716, FOG: FHA domain [Signal transduction mechanisms].
          Length = 191

 Score = 41.9 bits (98), Expect = 2e-04
 Identities = 24/95 (25%), Positives = 43/95 (45%), Gaps = 16/95 (16%)

Query: 369 VLHIHRQSAFLMGRDRKVADIPVDHPSCSKQHAALQYRLTDYTREDGSKGKRIRLYIIDL 428
            + +  +    +GRD    DI +D    S++HA L+    +              ++ DL
Sbjct: 82  SVIVLGEPVTTIGRD-PDNDIVLDDDVVSRRHAELRREGNEV-------------FLEDL 127

Query: 429 ESANGTFVNNKKIDPKRYVELLEKDVIKFGFSSRE 463
            S NGT+VN +K+  +  ++    DVI+ G +  E
Sbjct: 128 GSTNGTYVNGEKVRQRVLLQD--GDVIRLGGTLAE 160


>gnl|CDD|220365 pfam09726, Macoilin, Transmembrane protein.  This entry is a highly
           conserved protein present in eukaryotes.
          Length = 680

 Score = 40.7 bits (95), Expect = 0.002
 Identities = 28/150 (18%), Positives = 47/150 (31%), Gaps = 21/150 (14%)

Query: 158 EEEKPRHHHHHHHRHRHHHKQERTELSDRHNLQENQ-RPRHEKPHSSDKH---RHSSKTE 213
           E E       HHH H H H          H++  N    +H             HS+K+ 
Sbjct: 252 ELEYSEKGKDHHHSHNHQH----------HSIGINNHHSKHADSKLQTIEVIENHSNKSR 301

Query: 214 PS-----ERETKRNRQSHDDEETNERRHHDRNNRTHDNRHDE--RNSRQQNNERGSQDRR 266
           PS       +   +  S     +   +       ++ N+ +   ++    N    S    
Sbjct: 302 PSSSSTNGSKETTSNSSSAAAGSIGSKSSKSAKHSNRNKSNSSPKSHSSANGSVPSSSVS 361

Query: 267 SNQSHDRPQSSSNSHHRDRGPKKPFCDASD 296
            N+S  +  S S+S  RD         A+ 
Sbjct: 362 DNESKQKRASKSSSGARDSKKDASGMSANG 391


>gnl|CDD|236766 PRK10811, rne, ribonuclease E; Reviewed.
          Length = 1068

 Score = 37.7 bits (88), Expect = 0.014
 Identities = 19/85 (22%), Positives = 45/85 (52%), Gaps = 5/85 (5%)

Query: 198 EKPHSSDKHRHSSKTEPSERETKRNRQSHDDEETNERRHHDRNNRTH----DNRHDERNS 253
           E+    ++    ++ +P  ++ +R  + ++  + NERR   R+NRT     +NR + R +
Sbjct: 584 EETKPQEQPAPKAEAKPERQQDRRKPRQNNRRDRNERRD-TRDNRTRREGRENREENRRN 642

Query: 254 RQQNNERGSQDRRSNQSHDRPQSSS 278
           R+Q  ++ ++ R S Q+    ++ +
Sbjct: 643 RRQAQQQTAETRESQQAEVTEKART 667



 Score = 35.0 bits (81), Expect = 0.088
 Identities = 29/130 (22%), Positives = 52/130 (40%), Gaps = 13/130 (10%)

Query: 146 EAVENKIKEEPTEEEKPRHHHHHHHRHRHHHKQERTELSDRHNLQENQRPRHEKPHSSDK 205
                   E   +  KPR ++      R   +  RT    R N +EN+R R +    + +
Sbjct: 593 APKAEAKPERQQDRRKPRQNNRRDRNERRDTRDNRTRREGRENREENRRNRRQAQQQTAE 652

Query: 206 HRHSSKTEPSERETKRNRQSHDDEETNERRHHDRNNRTHDNRHDERNSRQQNNERGSQDR 265
            R S + E +E+      ++ D+++   RR  +R  R    R+DE+  RQ   E  + + 
Sbjct: 653 TRESQQAEVTEKA-----RTQDEQQQAPRR--ERQRR----RNDEK--RQAQQEAKALNV 699

Query: 266 RSNQSHDRPQ 275
                 +  Q
Sbjct: 700 EEQSVQETEQ 709



 Score = 33.5 bits (77), Expect = 0.31
 Identities = 17/54 (31%), Positives = 21/54 (38%), Gaps = 3/54 (5%)

Query: 215 SERETKRNRQSHDDEETNERRHHDRNNRTHDNRHDERNSRQQNNERGSQDRRSN 268
              ETK   Q     E    R  DR     +NR D RN R   + R ++ RR  
Sbjct: 582 GGEETKPQEQPAPKAEAKPERQQDRRKPRQNNRRD-RNER--RDTRDNRTRREG 632



 Score = 30.4 bits (69), Expect = 2.5
 Identities = 19/93 (20%), Positives = 36/93 (38%), Gaps = 14/93 (15%)

Query: 229 EETNERRHHDRNNRTHDNRHDERNSRQQNNERGSQDRRSNQSHDRPQSSSNSHHRDRGPK 288
           E   ER+   R  R        +N+R+  NER  +D R N++    + +   + R+R   
Sbjct: 597 EAKPERQQDRRKPR--------QNNRRDRNER--RDTRDNRTRREGRENREENRRNRRQA 646

Query: 289 KPFCDASDEYEWGRKDANNEDNKNNAQQPKQKP 321
           +     + E     + A   +      + +Q P
Sbjct: 647 QQQTAETRE----SQQAEVTEKARTQDEQQQAP 675



 Score = 28.9 bits (65), Expect = 7.8
 Identities = 15/68 (22%), Positives = 25/68 (36%), Gaps = 9/68 (13%)

Query: 249 DERNSRQQNNERGSQDRRSNQSHDRPQSSSNSHHRDRGPKKPFCDASDEYEWGRKDANNE 308
           ++   + +      QDRR  + ++R         RDR  ++   D     E       N 
Sbjct: 590 EQPAPKAEAKPERQQDRRKPRQNNR---------RDRNERRDTRDNRTRREGRENREENR 640

Query: 309 DNKNNAQQ 316
            N+  AQQ
Sbjct: 641 RNRRQAQQ 648


>gnl|CDD|114045 pfam05297, Herpes_LMP1, Herpesvirus latent membrane protein 1
           (LMP1).  This family consists of several latent membrane
           protein 1 or LMP1s mostly from Epstein-Barr virus. LMP1
           of EBV is a 62-65 kDa plasma membrane protein possessing
           six membrane spanning regions, a short cytoplasmic
           N-terminus and a long cytoplasmic carboxy tail of 200
           amino acids. EBV latent membrane protein 1 (LMP1) is
           essential for EBV-mediated transformation and has been
           associated with several cases of malignancies. EBV-like
           viruses in Cynomolgus monkeys (Macaca fascicularis) have
           been associated with high lymphoma rates in
           immunosuppressed monkeys.
          Length = 382

 Score = 36.9 bits (85), Expect = 0.018
 Identities = 32/141 (22%), Positives = 54/141 (38%), Gaps = 14/141 (9%)

Query: 162 PRHHHHHHHRHRHHHKQERTELSDRHNLQENQRPRH--------EKPHSSDKHRHS---- 209
           PRH   HHH     H Q+ T+ S   +   +   RH        + P    ++  +    
Sbjct: 189 PRHSDEHHHDDSLPHPQQATDDSGHESDSNSNEGRHHLLVSGAGDGPPLCSQNLGAPGGG 248

Query: 210 SKTEPSERETKRNRQSHDDEETNERRHHDRNNRTHDNRHDERNSRQQNNERGSQDRRSNQ 269
               P + +   +    D + T++    D +N T DN   + ++   N  +   +   N 
Sbjct: 249 PDNGPQDPDNTDDNGPQDPDNTDDNGPQDPDN-TDDNGPQDPDNTADNGPQDPDNTDDNG 307

Query: 270 SHDR-PQSSSNSHHRDRGPKK 289
            HD  P + S+S   D GP K
Sbjct: 308 PHDPLPHNPSDSAGNDGGPPK 328


>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3.  This
           protein, which interacts with both microtubules and
           TRAF3 (tumour necrosis factor receptor-associated factor
           3), is conserved from worms to humans. The N-terminal
           region is the microtubule binding domain and is
           well-conserved; the C-terminal 100 residues, also
           well-conserved, constitute the coiled-coil region which
           binds to TRAF3. The central region of the protein is
           rich in lysine and glutamic acid and carries KKE motifs
           which may also be necessary for tubulin-binding, but
           this region is the least well-conserved.
          Length = 506

 Score = 36.4 bits (84), Expect = 0.033
 Identities = 26/165 (15%), Positives = 49/165 (29%), Gaps = 18/165 (10%)

Query: 126 KRKSPKKKHQVVQALRVNDSEAVEN---KIKEEPTEEEKPRHHHHHHHRHRHHHKQERTE 182
           ++K  K+K +     R    EA E    K KE+  E++                 ++R E
Sbjct: 116 EKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEE-------------PRDREE 162

Query: 183 LSDRHNLQENQRPRHEKPHSSDKHRHSSKTEPSERETKRNRQSHDDEETNERRHHDRNNR 242
              R  ++   RP+          +     E  +R+  R       EE +     +R   
Sbjct: 163 EKKRERVRAKSRPKKPPKKKPPNKKKEPPEEEKQRQAAREAVKGKPEEPDVNE--EREKE 220

Query: 243 THDNRHDERNSRQQNNERGSQDRRSNQSHDRPQSSSNSHHRDRGP 287
             D +  E  +     +   Q    ++         +       P
Sbjct: 221 EDDGKDRETTTSPMEEDESRQSSEISRRSSSSLKKPDPSPSMASP 265



 Score = 33.3 bits (76), Expect = 0.32
 Identities = 31/206 (15%), Positives = 66/206 (32%), Gaps = 6/206 (2%)

Query: 153 KEEPTEEEKPRHHHHHHHRHRHHHKQERTELSDRHNLQENQRPRHEKPHSSDKHRHSSKT 212
            E   EEEK +       + +    +E  +        + +RP  EK    +K     + 
Sbjct: 100 NESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRD 159

Query: 213 EPSERETKRNRQSHDDEETNERRHHDRNNRTHDNRHDERNSRQQNNERGSQDRRSNQSHD 272
              E++ +R R         +       N+  +   +E+  +        +    + + +
Sbjct: 160 REEEKKRERVRAK---SRPKKPPKKKPPNKKKEPPEEEKQRQAAREAVKGKPEEPDVNEE 216

Query: 273 RPQSSSNSHHRDRGPKKPFCDASDEYEWGRKDANNEDNKNNAQQPKQKPDFGLSGKLTED 332
           R +   +   R+        D S +     + +++   K +       P+   S K TE 
Sbjct: 217 REKEEDDGKDRETTTSPMEEDESRQSSEISRRSSSSLKKPDPSPSMASPETRESSKRTET 276

Query: 333 ---TNTYNGVVIKYSEPPDARKPKRR 355
              T+         S  P   + KR+
Sbjct: 277 RPRTSLRPPSARPASARPAPPRVKRK 302


>gnl|CDD|222914 PHA02666, PHA02666, hypothetical protein; Provisional.
          Length = 287

 Score = 36.1 bits (82), Expect = 0.033
 Identities = 15/125 (12%), Positives = 36/125 (28%), Gaps = 1/125 (0%)

Query: 164 HHHHHHHRHRHHHKQERTELSDRHNLQENQRPRHEKPHSSDKHRHSSKTEPSE-RETKRN 222
           H    HHR R +  + R +       +  +R        + ++  +      +      +
Sbjct: 25  HESCFHHRRRANSMESRRKSRPSRQHRSAERTPTTASSLTHENNTAPSRHGKQHSCKASS 84

Query: 223 RQSHDDEETNERRHHDRNNRTHDNRHDERNSRQQNNERGSQDRRSNQSHDRPQSSSNSHH 282
           R SH+   T+   +H  +   H + H         +      +       + +    +  
Sbjct: 85  RSSHNRGSTSSSHNHHAHRGPHQSAHRRSKHDAVRDTYQPCPQSPETDLYKGRLPGETER 144

Query: 283 RDRGP 287
               P
Sbjct: 145 HYETP 149



 Score = 32.2 bits (72), Expect = 0.49
 Identities = 45/202 (22%), Positives = 66/202 (32%), Gaps = 20/202 (9%)

Query: 197 HEKPHSSDKHRHSSKTEPSERETKRNRQSHDDEETNERRHHDRNNRTHDNRHDERNSRQQ 256
                S   HR  +    S    +++R S               + TH+N        +Q
Sbjct: 22  DGHHESCFHHRRRAN---SMESRRKSRPSRQHRSAERTPTTAS-SLTHENNTAPSRHGKQ 77

Query: 257 NNERGSQDRRSNQSHDRPQSSSNSHHRDRGPKKPFCDASDEYEWGRKDANNEDNKNNAQQ 316
           ++ + S   RS+ +     SS N HH  RGP +        +   + DA  +  +   Q 
Sbjct: 78  HSCKASS--RSSHNRGSTSSSHN-HHAHRGPHQ------SAHRRSKHDAVRDTYQPCPQS 128

Query: 317 PKQKPDFGLSGKLTEDTNTYNGVVIKYSEPPDARKPKRRWRLYPFKGDTQLPVLHIHRQS 376
           P+     G     TE    Y      Y  P D R           + D  LP LHI    
Sbjct: 129 PETDLYKGRLPGETE--RHYETPDHIYDVPEDVRCAA-----VEPRRDLALPPLHIPSSK 181

Query: 377 AFLMGRDRKVADIPVDHPSCSK 398
                R   + D P+ H S  K
Sbjct: 182 PARRMRPGSMGDFPMKHTSAGK 203


>gnl|CDD|217089 pfam02535, Zip, ZIP Zinc transporter.  The ZIP family consists of
           zinc transport proteins and many putative metal
           transporters. The main contribution to this family is
           from the Arabidopsis thaliana ZIP protein family these
           proteins are responsible for zinc uptake in the plant.
           Also found within this family are C. elegans proteins of
           unknown function which are annotated as being similar to
           human growth arrest inducible gene product, although
           this protein in not found within this family.
          Length = 314

 Score = 35.8 bits (83), Expect = 0.040
 Identities = 12/63 (19%), Positives = 17/63 (26%), Gaps = 7/63 (11%)

Query: 124 NLKRKSPKKKHQVVQALRVNDSEAVENKIKEEPTEEEKPRHHHHHHHRHRHHHKQERTEL 183
           +      +  H V           V +        E     HHHHH  H      + + L
Sbjct: 110 HDHGHDGEHSHTVPDEESGAVESNVVH-------HEHGEGPHHHHHEGHEKGESDKESGL 162

Query: 184 SDR 186
             R
Sbjct: 163 RVR 165



 Score = 34.6 bits (80), Expect = 0.084
 Identities = 11/64 (17%), Positives = 13/64 (20%)

Query: 160 EKPRHHHHHHHRHRHHHKQERTELSDRHNLQENQRPRHEKPHSSDKHRHSSKTEPSERET 219
            +   H H H     H      E S             E PH      H       E   
Sbjct: 103 GRGHSHGHDHGHDGEHSHTVPDEESGAVESNVVHHEHGEGPHHHHHEGHEKGESDKESGL 162

Query: 220 KRNR 223
           +   
Sbjct: 163 RVRD 166



 Score = 33.9 bits (78), Expect = 0.17
 Identities = 12/62 (19%), Positives = 17/62 (27%)

Query: 164 HHHHHHHRHRHHHKQERTELSDRHNLQENQRPRHEKPHSSDKHRHSSKTEPSERETKRNR 223
             H H H H H  +   T   +     E+    HE       H H    +    +    R
Sbjct: 104 RGHSHGHDHGHDGEHSHTVPDEESGAVESNVVHHEHGEGPHHHHHEGHEKGESDKESGLR 163

Query: 224 QS 225
             
Sbjct: 164 VR 165



 Score = 33.5 bits (77), Expect = 0.23
 Identities = 16/82 (19%), Positives = 21/82 (25%), Gaps = 17/82 (20%)

Query: 131 KKKHQVVQALRVNDSEAVENKIKEEPTEEEKPRHHHHHHHRHRHHHKQERTELSDRHNLQ 190
           K +         +D E       EE    E    HH H     HHH +            
Sbjct: 102 KGRGHSHGHDHGHDGEHSHTVPDEESGAVESNVVHHEHGEGPHHHHHEG----------- 150

Query: 191 ENQRPRHEKPHSSDKHRHSSKT 212
                 HEK  S  +     + 
Sbjct: 151 ------HEKGESDKESGLRVRD 166



 Score = 31.9 bits (73), Expect = 0.71
 Identities = 11/61 (18%), Positives = 18/61 (29%), Gaps = 4/61 (6%)

Query: 159 EEKPRHHHHHHHRHRHHHKQERTEL----SDRHNLQENQRPRHEKPHSSDKHRHSSKTEP 214
                H H H H   H H     E     S+  + +  + P H      +K     ++  
Sbjct: 103 GRGHSHGHDHGHDGEHSHTVPDEESGAVESNVVHHEHGEGPHHHHHEGHEKGESDKESGL 162

Query: 215 S 215
            
Sbjct: 163 R 163


>gnl|CDD|218163 pfam04592, SelP_N, Selenoprotein P, N terminal region.  SelP is the
           only known eukaryotic selenoprotein that contains
           multiple selenocysteine (Sec) residues, and accounts for
           more than 50% of the selenium content of rat and human
           plasma. It is thought to be glycosylated. SelP may have
           antioxidant properties. It can attach to epithelial
           cells, and may protect vascular endothelial cells
           against peroxynitrite toxicity. The high selenium
           content of SelP suggests that it may be involved in
           selenium intercellular transport or storage. The
           promoter structure of bovine SelP suggest that it may be
           involved in countering heavy metal intoxication, and may
           also have a developmental function. The N-terminal
           region of SelP can exist independently of the C terminal
           region. Zebrafish selenoprotein Pb lacks the C terminal
           Sec-rich region, and a protein encoded by the rat SelP
           gene and lacking this region has also been reported.
           N-terminal region contains a conserved SecxxCys motif,
           which is similar to the CysxxCys found in thioredoxins.
           It is speculated that the N terminal region may adopt a
           thioredoxin fold and catalyze redox reactions. The
           N-terminal region also contains a His-rich region, which
           is thought to mediate heparin binding. Binding to
           heparan proteoglycans could account for the membrane
           binding properties of SelP. The function of the
           bacterial members of this family is uncharcterised.
          Length = 238

 Score = 34.9 bits (80), Expect = 0.056
 Identities = 14/48 (29%), Positives = 18/48 (37%), Gaps = 1/48 (2%)

Query: 153 KEEPTEEEKPRHHHHHHHRHRHHHKQERTELSDRHNLQENQRPRHEKP 200
             +P E E PR  H HHH H  H           H+   N     ++P
Sbjct: 173 VNKPVEAE-PRQDHPHHHSHHEHQGHAHHHPHGHHHPGSNSHSESQQP 219



 Score = 34.5 bits (79), Expect = 0.097
 Identities = 18/70 (25%), Positives = 28/70 (40%), Gaps = 4/70 (5%)

Query: 145 SEAVENKIKEEPTEEEKPRHHHHHHHRHRHHHKQERTELSDRHNLQENQRP---RHEKPH 201
              V   ++ EP ++    H HH H  H HHH              E+Q+P   +  +P 
Sbjct: 170 LATVNKPVEAEPRQDHPHHHSHHEHQGHAHHHPHGHHHPGSNS-HSESQQPDPDKPTEPP 228

Query: 202 SSDKHRHSSK 211
           S   H H+ +
Sbjct: 229 SGLHHHHNHR 238


>gnl|CDD|215173 PLN02304, PLN02304, probable pectinesterase.
          Length = 379

 Score = 34.1 bits (78), Expect = 0.15
 Identities = 9/22 (40%), Positives = 13/22 (59%)

Query: 164 HHHHHHHRHRHHHKQERTELSD 185
           HH HHHH H+H    ++  + D
Sbjct: 39  HHSHHHHHHKHPDDGKKVSICD 60



 Score = 32.5 bits (74), Expect = 0.44
 Identities = 8/15 (53%), Positives = 9/15 (60%)

Query: 164 HHHHHHHRHRHHHKQ 178
            H  HHH H HHH +
Sbjct: 34  PHSIHHHSHHHHHHK 48



 Score = 32.1 bits (73), Expect = 0.70
 Identities = 9/18 (50%), Positives = 9/18 (50%)

Query: 164 HHHHHHHRHRHHHKQERT 181
           H  HHH  H HHHK    
Sbjct: 35  HSIHHHSHHHHHHKHPDD 52



 Score = 31.4 bits (71), Expect = 1.2
 Identities = 9/21 (42%), Positives = 12/21 (57%)

Query: 164 HHHHHHHRHRHHHKQERTELS 184
            HHH HH H H H  +  ++S
Sbjct: 37  IHHHSHHHHHHKHPDDGKKVS 57



 Score = 30.6 bits (69), Expect = 2.1
 Identities = 6/16 (37%), Positives = 8/16 (50%)

Query: 164 HHHHHHHRHRHHHKQE 179
           + H  HH   HHH  +
Sbjct: 33  YPHSIHHHSHHHHHHK 48


>gnl|CDD|193258 pfam12782, Innate_immun, Invertebrate innate immunity transcript
           family.  The immune response of the purple sea urchin
           appears to be more complex than previously believed in
           that it uses immune-related gene families homologous to
           vertebrate Toll-like and NOD/NALP-like receptor families
           as well as C-type lectins and a rudimentary complement
           system. In addition, the species also produces this
           unusual family of mRNAs, also known as 185/333, which is
           strongly upregulated in response to pathogen challenge.
          Length = 312

 Score = 33.5 bits (76), Expect = 0.20
 Identities = 14/73 (19%), Positives = 22/73 (30%), Gaps = 1/73 (1%)

Query: 143 NDSEAVENKIKEEPTEEEKPRHHHHHHHRHRHHHKQERTELSDRHNLQENQRPRHE-KPH 201
           N+S   + +            H        R   ++  TE    HN   +   RH  K  
Sbjct: 167 NESSEEDGRPHPHHHRHHGHNHFGRKPFGGRPFGRRNHTEGHQGHNETGDHPHRHHNKTG 226

Query: 202 SSDKHRHSSKTEP 214
             D+ R   +  P
Sbjct: 227 DGDQDRPMFEMRP 239



 Score = 30.8 bits (69), Expect = 1.4
 Identities = 30/133 (22%), Positives = 50/133 (37%), Gaps = 9/133 (6%)

Query: 153 KEEPTEEEKPRHHHHHHHRHRHHHKQERTELSDRHNLQENQRPRHEKPHSS--DKHRHSS 210
           + E +EE+   H HHH H   +H    R     R   + N    H+  + +    HRH +
Sbjct: 166 RNESSEEDGRPHPHHHRHHGHNHF--GRKPFGGRPFGRRNHTEGHQGHNETGDHPHRHHN 223

Query: 211 KTEPSER-----ETKRNRQSHDDEETNERRHHDRNNRTHDNRHDERNSRQQNNERGSQDR 265
           KT   ++     E +  R +    +    R  DR N T +        R+    RG    
Sbjct: 224 KTGDGDQDRPMFEMRPFRFNPFGRKPFGDRPFDRRNGTEEGSPRRDGHRRPYGNRGRWGE 283

Query: 266 RSNQSHDRPQSSS 278
             ++  + P + S
Sbjct: 284 NESEEKEHPTTES 296


>gnl|CDD|238501 cd01019, ZnuA, Zinc binding protein ZnuA. These proteins have been
           shown to function as initial receptors in the ABC uptake
           of Zn2+.  They belong to the TroA superfamily of
           periplasmic metal binding proteins that share a distinct
           fold and ligand binding mechanism.  They are comprised
           of two globular subdomains connected by a single helix
           and bind their specific ligands in the cleft between
           these domains.  A typical TroA protein is comprised of
           two globular subdomains connected by a single helix and
           can bind the metal ion in the cleft between these
           domains. In addition, these proteins sometimes have a
           low complexity region containing a metal-binding
           histidine-rich motif (repetitive HDH sequence).
          Length = 286

 Score = 32.3 bits (74), Expect = 0.44
 Identities = 5/25 (20%), Positives = 5/25 (20%)

Query: 153 KEEPTEEEKPRHHHHHHHRHRHHHK 177
                 E    H H  H  H     
Sbjct: 95  ASHGDHEHDHEHAHGEHDGHEEGGL 119



 Score = 28.9 bits (65), Expect = 5.7
 Identities = 10/39 (25%), Positives = 14/39 (35%), Gaps = 2/39 (5%)

Query: 151 KIKE-EPTEEEKPRHHHHHHHRHRHHHKQERTELS-DRH 187
           K+ + +  E+      H H H H H       E   D H
Sbjct: 84  KLIDLKTLEDGASHGDHEHDHEHAHGEHDGHEEGGLDPH 122



 Score = 28.9 bits (65), Expect = 6.7
 Identities = 6/32 (18%), Positives = 8/32 (25%)

Query: 158 EEEKPRHHHHHHHRHRHHHKQERTELSDRHNL 189
            + K       H  H H H+    E       
Sbjct: 86  IDLKTLEDGASHGDHEHDHEHAHGEHDGHEEG 117


>gnl|CDD|236558 PRK09545, znuA, high-affinity zinc transporter periplasmic
           component; Reviewed.
          Length = 311

 Score = 32.3 bits (74), Expect = 0.50
 Identities = 6/26 (23%), Positives = 9/26 (34%)

Query: 151 KIKEEPTEEEKPRHHHHHHHRHRHHH 176
           K   +   ++   H  H      HHH
Sbjct: 115 KGAHDDHHDDDHDHAGHEKSDEDHHH 140



 Score = 31.5 bits (72), Expect = 0.92
 Identities = 5/26 (19%), Positives = 6/26 (23%)

Query: 154 EEPTEEEKPRHHHHHHHRHRHHHKQE 179
                 +    H  H      HH  E
Sbjct: 117 AHDDHHDDDHDHAGHEKSDEDHHHGE 142



 Score = 30.7 bits (70), Expect = 1.5
 Identities = 5/25 (20%), Positives = 8/25 (32%)

Query: 152 IKEEPTEEEKPRHHHHHHHRHRHHH 176
           +K    +     H H  H +    H
Sbjct: 114 MKGAHDDHHDDDHDHAGHEKSDEDH 138



 Score = 28.4 bits (64), Expect = 8.7
 Identities = 3/25 (12%), Positives = 9/25 (36%)

Query: 153 KEEPTEEEKPRHHHHHHHRHRHHHK 177
            ++  +++     H       HH +
Sbjct: 118 HDDHHDDDHDHAGHEKSDEDHHHGE 142


>gnl|CDD|183534 PRK12451, PRK12451, arginyl-tRNA synthetase; Reviewed.
          Length = 562

 Score = 31.3 bits (71), Expect = 1.2
 Identities = 17/59 (28%), Positives = 34/59 (57%), Gaps = 4/59 (6%)

Query: 88  KHIPS--MNLDIRKKSRRKRVIALIQNLTQAQIPNMKRNLKRKSP--KKKHQVVQALRV 142
           +H+P   +  D +K S RK  + L++ + +  I   K+N++ K+P  K+K +V + + V
Sbjct: 358 EHVPFGLILKDGKKMSTRKGRVVLLEEVLEEAIELAKQNIEEKNPNLKQKEEVAKQVGV 416


>gnl|CDD|132860 cd07221, Pat_PNPLA3, Patatin-like phospholipase domain containing
           protein 3.  PNPLA3 is a triacylglycerol lipase that
           mediates triacylglycerol hydrolysis in adipocytes and is
           an indicator of the nutritional state. PNPLA3 is also
           known as adiponutrin (ADPN) or iPLA2-epsilon. Human
           adiponutrins are bound to the cell membrane of
           adipocytes and show transacylase, TG hydrolase, and PLA2
           activity. This family includes patatin-like proteins:
           ADPN (adiponutrin) from mammals, PNPLA3 (Patatin-like
           phospholipase domain-containing protein 3), and
           iPLA2-epsilon (Calcium-independent phospholipase A2)
           from Homo sapiens.
          Length = 252

 Score = 30.5 bits (69), Expect = 1.7
 Identities = 20/81 (24%), Positives = 37/81 (45%), Gaps = 14/81 (17%)

Query: 77  YCAFLFSGNTRKHIPSMNLDIRKKSRRKRVIALIQ---NLTQAQIPNMKRNLKRKSPKKK 133
           +C    SG     I  + +D+ + + R R I ++    NL++    +++  L+R  P   
Sbjct: 45  HCVTFLSGLPLDQILQILMDLVRSA-RSRNIGILHPSFNLSK----HLRDGLQRHLPDNV 99

Query: 134 HQVVQA------LRVNDSEAV 148
           HQ++         RV+D E V
Sbjct: 100 HQLISGKMCISLTRVSDGENV 120


>gnl|CDD|237605 PRK14086, dnaA, chromosomal replication initiation protein;
           Provisional.
          Length = 617

 Score = 30.6 bits (69), Expect = 2.4
 Identities = 29/142 (20%), Positives = 45/142 (31%), Gaps = 21/142 (14%)

Query: 193 QRPRHEKPHSSDKHRHSSKTEPSERETKRNRQSHDDEETNERRHHDRNNRTHDNRHDERN 252
           Q+ R   P  +    ++S    +  E +R+R+ +D       +                 
Sbjct: 168 QQQRLGFPPRAP---YASPASYA-PEQERDREPYDAGRPEYDQR---------------- 207

Query: 253 SRQQNNERGSQDRRSNQSHDRPQSSSNSHHRDRGPKKPFCDASDEYEWGRKDANNEDNKN 312
            R  ++ R   DR      DRP+    + H  RG   P           R  A       
Sbjct: 208 RRDYDHPRPDWDRPRRDRTDRPEPPPGAGHVHRGGPGPPERDDAPVVPIRPSAPGPLAAQ 267

Query: 313 NAQQPKQ-KPDFGLSGKLTEDT 333
            A  P   +P   L+ K T DT
Sbjct: 268 PAPAPGPGEPTARLNPKYTFDT 289


>gnl|CDD|182628 PRK10663, PRK10663, cytochrome o ubiquinol oxidase subunit III;
           Provisional.
          Length = 204

 Score = 29.7 bits (67), Expect = 2.7
 Identities = 6/12 (50%), Positives = 6/12 (50%)

Query: 164 HHHHHHHRHRHH 175
           H   H H H HH
Sbjct: 8   HATAHAHEHGHH 19


>gnl|CDD|215507 PLN02939, PLN02939, transferase, transferring glycosyl groups.
          Length = 977

 Score = 30.3 bits (68), Expect = 2.7
 Identities = 22/108 (20%), Positives = 34/108 (31%), Gaps = 18/108 (16%)

Query: 183 LSDRHNLQENQRPRHEKPHSSDKHRHSSKTEPSERETKRNRQSHDDEE------------ 230
           L  R  L  + R R     S  K +      P +R +    QS+ DE             
Sbjct: 25  LPSRRRLAVSCRARRRGFSSQQKKKRGKNIAPKQRSSNSKLQSNTDENGQLENTSLRTVM 84

Query: 231 ------TNERRHHDRNNRTHDNRHDERNSRQQNNERGSQDRRSNQSHD 272
                 T+    H+R +   D      ++ QQ N +  +     Q  D
Sbjct: 85  ELPQKSTSSDDDHNRASMQRDEAIAAIDNEQQTNSKDGEQLSDFQLED 132


>gnl|CDD|236005 PRK07376, PRK07376, NAD(P)H-quinone oxidoreductase subunit F;
           Validated.
          Length = 673

 Score = 30.4 bits (69), Expect = 2.8
 Identities = 9/36 (25%), Positives = 11/36 (30%)

Query: 141 RVNDSEAVENKIKEEPTEEEKPRHHHHHHHRHRHHH 176
           R ND E  +  +           HH  H   H H  
Sbjct: 459 RGNDKELQKELLAAADAPGHHEGHHEEHGEHHHHSG 494


>gnl|CDD|177093 CHL00194, ycf39, Ycf39; Provisional.
          Length = 317

 Score = 30.0 bits (68), Expect = 2.8
 Identities = 14/30 (46%), Positives = 16/30 (53%), Gaps = 4/30 (13%)

Query: 10 ALSTVLHQTVLKCSFSALIRYYQYRCLVRN 39
          A  T+  Q V +    AL   YQ RCLVRN
Sbjct: 8  ATGTLGRQIVRQ----ALDEGYQVRCLVRN 33


>gnl|CDD|220096 pfam09052, SipA, Salmonella invasion protein A.  Salmonella
           invasion protein A is an actin-binding protein that
           contributes to host cytoskeletal rearrangements by
           stimulating actin polymerisation and counteracting
           F-actin destabilising proteins. Members of this family
           possess an all-helical fold consisting of eight
           alpha-helices arranged so that six long, amphipathic
           helices form a compact fold that surrounds a final,
           predominantly hydrophobic helix in the middle of the
           molecule.
          Length = 674

 Score = 30.0 bits (67), Expect = 3.1
 Identities = 14/70 (20%), Positives = 23/70 (32%), Gaps = 10/70 (14%)

Query: 226 HDDEETNERRHHDRNNRTHDNRHDE--RNSRQQNNERGSQDRRS--------NQSHDRPQ 275
           H+D  + + R  D    T+         NS  +N+E  +Q   S          S   P 
Sbjct: 307 HNDNRSYDNRVFDNRGDTYLGGARRHYDNSYHENSENDAQSPTSQTNDLSRNGNSLLSPP 366

Query: 276 SSSNSHHRDR 285
           +S  +     
Sbjct: 367 ASPAAGQHAL 376


>gnl|CDD|233496 TIGR01622, SF-CC1, splicing factor, CC1-like family.  This model
           represents a subfamily of RNA splicing factors including
           the Pad-1 protein (N. crassa), CAPER (M. musculus) and
           CC1.3 (H.sapiens). These proteins are characterized by
           an N-terminal arginine-rich, low complexity domain
           followed by three (or in the case of 4 H. sapiens
           paralogs, two) RNA recognition domains (rrm: pfam00706).
           These splicing factors are closely related to the U2AF
           splicing factor family (TIGR01642). A homologous gene
           from Plasmodium falciparum was identified in the course
           of the analysis of that genome at TIGR and was included
           in the seed.
          Length = 457

 Score = 29.9 bits (67), Expect = 3.5
 Identities = 11/85 (12%), Positives = 24/85 (28%)

Query: 151 KIKEEPTEEEKPRHHHHHHHRHRHHHKQERTELSDRHNLQENQRPRHEKPHSSDKHRHSS 210
           + +E        R       R R   +        R       R    +  S +++    
Sbjct: 3   RDRERGRLRNDTRRSDKGRERSRRRSRSRDRSRRRRDRDYYRGRRGRSRSRSPNRYYRPR 62

Query: 211 KTEPSERETKRNRQSHDDEETNERR 235
                 R+ +R+ ++  +  T   R
Sbjct: 63  GDRSYRRDDRRSGRNTKEPLTEAER 87


>gnl|CDD|224151 COG1230, CzcD, Co/Zn/Cd efflux system component [Inorganic ion
           transport and metabolism].
          Length = 296

 Score = 29.6 bits (67), Expect = 3.7
 Identities = 7/22 (31%), Positives = 9/22 (40%)

Query: 162 PRHHHHHHHRHRHHHKQERTEL 183
              HH H H H H + +    L
Sbjct: 3   GAAHHSHGHDHAHDNPRNERRL 24


>gnl|CDD|204706 pfam11666, DUF2933, Protein of unknown function (DUF2933).  This
           bacterial family of proteins has no known function.
          Length = 56

 Score = 26.9 bits (60), Expect = 4.1
 Identities = 4/17 (23%), Positives = 4/17 (23%)

Query: 163 RHHHHHHHRHRHHHKQE 179
              H  H  H  H    
Sbjct: 40  HGGHGGHGGHDSHDDDP 56


>gnl|CDD|220888 pfam10846, DUF2722, Protein of unknown function (DUF2722).  This
           eukaryotic family of proteins has no known function.
          Length = 373

 Score = 29.5 bits (66), Expect = 4.4
 Identities = 20/104 (19%), Positives = 43/104 (41%), Gaps = 6/104 (5%)

Query: 187 HNLQENQRPRHEKPHSSDKHRHSSKTEPSERETKRNRQSHDDEETNERRHHDRNNRTHDN 246
           H   ++     ++  +S +H    K E  +++ +R + S      +     + +    D+
Sbjct: 254 HKQSKSAPQPSQESMTSFQHIIQWKPESQQKKHRRTKSSSSFGVIDLNSISEASQVNEDD 313

Query: 247 RHDERNSRQQNNERGSQDRRSNQSHDR-----PQSSSNSHHRDR 285
              + +S+++ NE  + D  S  S D       +SSS S   +R
Sbjct: 314 DPPDSDSKERKNEE-NSDPESTPSDDNDDKTCSESSSRSESPNR 356


>gnl|CDD|217503 pfam03344, Daxx, Daxx Family.  The Daxx protein (also known as the
           Fas-binding protein) is thought to play a role in
           apoptosis, but precise role played by Daxx remains to be
           determined. Daxx forms a complex with Axin.
          Length = 715

 Score = 29.5 bits (66), Expect = 4.9
 Identities = 27/147 (18%), Positives = 49/147 (33%), Gaps = 7/147 (4%)

Query: 177 KQERTELSDRHNLQENQRPRHEKPHSSDKHRHSSKTEPS------ERETKRNRQSHDDEE 230
           KQ+ TE  +R   QE +R +     SSD  + SS +  S      E E + + +  ++EE
Sbjct: 394 KQDDTEEEERRKRQERER-QGTSSRSSDPSKASSTSGESPSMASQESEEEESVEEEEEEE 452

Query: 231 TNERRHHDRNNRTHDNRHDERNSRQQNNERGSQDRRSNQSHDRPQSSSNSHHRDRGPKKP 290
             E      +        +E    + +N    +   S++     +       R       
Sbjct: 453 EEEEEEEQESEEEEGEDEEEEEEVEADNGSEEEMEGSSEGDGDGEEPEEDAERRNSEMAG 512

Query: 291 FCDASDEYEWGRKDANNEDNKNNAQQP 317
               S+  +        E  +    QP
Sbjct: 513 ISRMSEGQQPRGSSVQPESPQEEPLQP 539


>gnl|CDD|220924 pfam10986, DUF2796, Protein of unknown function (DUF2796).  This
           bacterial family of proteins has no known function.
          Length = 161

 Score = 28.4 bits (64), Expect = 5.1
 Identities = 11/31 (35%), Positives = 12/31 (38%)

Query: 157 TEEEKPRHHHHHHHRHRHHHKQERTELSDRH 187
           +      H HH H  H HH   E    SD H
Sbjct: 79  SLFGDHDHDHHDHDHHDHHDDHEHAGHSDFH 109



 Score = 28.4 bits (64), Expect = 5.3
 Identities = 6/26 (23%), Positives = 8/26 (30%)

Query: 154 EEPTEEEKPRHHHHHHHRHRHHHKQE 179
           E     +    HH H H   H   + 
Sbjct: 77  ESSLFGDHDHDHHDHDHHDHHDDHEH 102


>gnl|CDD|150884 pfam10278, Med19, Mediator of RNA pol II transcription subunit 19. 
           Med19 represents a family of conserved proteins which
           are members of the multi-protein co-activator Mediator
           complex. Mediator is required for activation of RNA
           polymerase II transcription by DNA binding
           transactivators.
          Length = 178

 Score = 28.6 bits (64), Expect = 5.1
 Identities = 18/73 (24%), Positives = 31/73 (42%), Gaps = 3/73 (4%)

Query: 157 TEEEKPRHHHHHHHRHRHHHKQERTELSDRHNLQENQRPRHEKPHSSDKHRHSSKTEPSE 216
             +   + H H H +HR          SD   L+ +++   +K H  DK R   K +  E
Sbjct: 105 HIQPPKKKHKHKHKKHRTQDPLPEETPSDSEGLKGHEKKHKKKKHEDDKER---KKKKKE 161

Query: 217 RETKRNRQSHDDE 229
           ++ K+ R S +  
Sbjct: 162 KKKKKKRHSPEHP 174


>gnl|CDD|227466 COG5137, COG5137, Histone chaperone involved in gene silencing
           [Transcription / Chromatin structure and dynamics].
          Length = 279

 Score = 28.8 bits (64), Expect = 5.4
 Identities = 19/111 (17%), Positives = 39/111 (35%), Gaps = 5/111 (4%)

Query: 214 PSERETKRNRQSHDDEETNERRHHDRNNRTHDNRHDERNSRQQNNERGSQDRRSNQSHDR 273
           P  +    N +    EE++ R   +  +    +      +R+ N E   +   S+   D 
Sbjct: 168 PPAQPDVDNEEEERLEESDGRE--EEEDEEVGSDSYGEGNRELNEEEEEEAEGSDDGEDV 225

Query: 274 PQSSSNSHHRDRGPKK---PFCDASDEYEWGRKDANNEDNKNNAQQPKQKP 321
                    + +G ++          E EW  +  + E  +NN + P +K 
Sbjct: 226 VDYEGERIDKKQGEEEEMEEEVINLFEIEWEEESPSEEVPRNNEESPAKKQ 276


>gnl|CDD|200419 TIGR04168, TIGR04168, TIGR04168 family protein.  Members of this
           uncharacterized protein family are restricted, in 49 of
           50 genomes, to organisms with a family TIGR04167 radical
           SAM protein, which occasionally is a selenoprotein.
          Length = 269

 Score = 28.8 bits (65), Expect = 5.5
 Identities = 17/48 (35%), Positives = 19/48 (39%), Gaps = 4/48 (8%)

Query: 167 HHHHRHRHHHKQERTEL-SDRHN---LQENQRPRHEKPHSSDKHRHSS 210
           H HHR RH   + R  +  DR     L     PR  K     K RH S
Sbjct: 192 HMHHRLRHTKGRLRRTVVRDRQGTVYLNAACVPRIGKDEEGRKLRHFS 239


>gnl|CDD|220184 pfam09332, Mcm10, Mcm10 replication factor.  Mcm10 is a eukaryotic
           DNA replication factor that regulates the stability and
           chromatin association of DNA polymerase alpha.
          Length = 346

 Score = 28.7 bits (64), Expect = 7.1
 Identities = 17/79 (21%), Positives = 32/79 (40%), Gaps = 2/79 (2%)

Query: 87  RKHIPSMNLDIRKKSRRKRVIALIQNLTQAQIPNMKRNLKRKSPKKKHQVVQALRVNDSE 146
           +   P   L  +  + +   I  ++ L +A   ++KR  KR    K+  + + +  N S 
Sbjct: 123 KSPPPLPKLSAKASAAKLAAILKLKALEKANPNSIKR--KRSESGKRRAIDERVEKNASS 180

Query: 147 AVENKIKEEPTEEEKPRHH 165
           A   K  E+     K R +
Sbjct: 181 AKRQKSPEDEEPAMKKRRY 199


>gnl|CDD|238500 cd01018, ZntC, Metal binding protein ZntC.  These proteins are
           predicted to function as initial receptors in ABC
           transport of metal ions.  They belong to the TroA
           superfamily of helical backbone metal receptor proteins
           that share a distinct fold and ligand binding mechanism.
            They are comprised of two globular subdomains connected
           by a long alpha helix and bind their specific ligands in
           the cleft between these domains.  In addition, many of
           these proteins possess a metal-binding histidine-rich
           motif (repetitive HDH sequence).
          Length = 266

 Score = 28.5 bits (64), Expect = 7.8
 Identities = 8/13 (61%), Positives = 8/13 (61%)

Query: 165 HHHHHHRHRHHHK 177
           HHHHHH    HH 
Sbjct: 94  HHHHHHGEHEHHH 106


>gnl|CDD|237322 PRK13261, ureE, urease accessory protein UreE; Provisional.
          Length = 159

 Score = 28.0 bits (63), Expect = 8.1
 Identities = 6/13 (46%), Positives = 6/13 (46%)

Query: 164 HHHHHHHRHRHHH 176
              HH H H H H
Sbjct: 147 AFRHHGHSHDHSH 159



 Score = 27.6 bits (62), Expect = 8.9
 Identities = 5/13 (38%), Positives = 5/13 (38%)

Query: 164 HHHHHHHRHRHHH 176
                HH H H H
Sbjct: 145 GGAFRHHGHSHDH 157


>gnl|CDD|240420 PTZ00441, PTZ00441, sporozoite surface protein 2 (SSP2);
           Provisional.
          Length = 576

 Score = 28.8 bits (64), Expect = 8.2
 Identities = 20/107 (18%), Positives = 41/107 (38%), Gaps = 8/107 (7%)

Query: 190 QENQRPRHEKPHSSDKHRHSSKTEPSERETKRNRQSHDDEET---NERRHHDRNNRTHDN 246
             N+    + P   +  R ++  EP + E K + Q+         NER   ++N + +  
Sbjct: 400 DSNKEVPEDVPMEPEDDRDNNFNEPKKPENKGDGQNEPVIPKPLDNERDQSNKNKQVNPG 459

Query: 247 RHDERNSRQ-----QNNERGSQDRRSNQSHDRPQSSSNSHHRDRGPK 288
                  R      +NNE  + + +++     P+ S +    D+  K
Sbjct: 460 NRHNSEDRYTRPHGRNNENRNYNNKNSDIPKHPERSEHEQPEDKKKK 506


>gnl|CDD|144541 pfam00985, MSA_2, Merozoite Surface Antigen 2 (MSA-2) family. 
          Length = 171

 Score = 27.9 bits (61), Expect = 8.2
 Identities = 15/80 (18%), Positives = 30/80 (37%)

Query: 178 QERTELSDRHNLQENQRPRHEKPHSSDKHRHSSKTEPSERETKRNRQSHDDEETNERRHH 237
           Q   E  +  N+Q++ + +   P + D    S   +P + E         +    +    
Sbjct: 41  QANKETQNNSNVQQDSQTKSNVPETQDADTKSPTAQPEQAENSAPTAEQTESPELQSAPE 100

Query: 238 DRNNRTHDNRHDERNSRQQN 257
           ++    H + H  RN+  QN
Sbjct: 101 NKGTGQHGHMHGSRNNHPQN 120


>gnl|CDD|183610 PRK12585, PRK12585, putative monovalent cation/H+ antiporter
           subunit G; Reviewed.
          Length = 197

 Score = 28.1 bits (62), Expect = 8.6
 Identities = 11/56 (19%), Positives = 25/56 (44%)

Query: 217 RETKRNRQSHDDEETNERRHHDRNNRTHDNRHDERNSRQQNNERGSQDRRSNQSHD 272
           R+ +  +   + EE  ER   +R     D R D+    ++  E+  +++  +  H+
Sbjct: 124 RQEQIEKARQEREELEERMEWERREEKIDEREDQEEQEREREEQTIEEQSDDSEHE 179


>gnl|CDD|227489 COG5160, ULP1, Protease, Ulp1 family [Posttranslational
           modification, protein turnover, chaperones].
          Length = 578

 Score = 28.3 bits (63), Expect = 9.5
 Identities = 24/99 (24%), Positives = 36/99 (36%), Gaps = 4/99 (4%)

Query: 200 PHSSDKHRHS--SKTEPSERETKRNRQSHDDEETNERRHHDRNNRTHDNRHDERNSRQQN 257
           PHSS  H+HS  +K    +  TKR   +   +++      D N    D            
Sbjct: 269 PHSSQIHKHSLSNKPTGLQSLTKRVFIASLGKKSLLISIKDSNISPSDFYQSN-GEETLQ 327

Query: 258 NERGSQDRRSNQSHD-RPQSSSNSHHRDRGPKKPFCDAS 295
           +   S+ RRSN+        + N +   R  K  F   S
Sbjct: 328 SHAVSELRRSNRFKKGSDPLNFNDNVPFRDAKTDFQTPS 366


>gnl|CDD|216817 pfam01969, DUF111, Protein of unknown function DUF111.  This
           prokaryotic family has no known function.
          Length = 381

 Score = 28.2 bits (64), Expect = 10.0
 Identities = 9/13 (69%), Positives = 9/13 (69%)

Query: 160 EKPRHHHHHHHRH 172
           E   H HHHHHRH
Sbjct: 60  ELEEHEHHHHHRH 72


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.315    0.131    0.389 

Gapped
Lambda     K      H
   0.267   0.0799    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 24,034,902
Number of extensions: 2287970
Number of successful extensions: 3447
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2940
Number of HSP's successfully gapped: 205
Length of query: 483
Length of database: 10,937,602
Length adjustment: 101
Effective length of query: 382
Effective length of database: 6,457,848
Effective search space: 2466897936
Effective search space used: 2466897936
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 61 (27.1 bits)