RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy1888
         (386 letters)



>gnl|CDD|217217 pfam02759, RUN, RUN domain.  This domain is present in several
           proteins that are linked to the functions of GTPases in
           the Rap and Rab families. They could hence play
           important roles in multiple Ras-like GTPase signalling
           pathways. The domain is comprises six conserved regions,
           which in some proteins have considerable insertions
           between them. The domain core is thought to take up a
           predominantly alpha fold, with basic amino acids in
           regions A and D possibly playing a functional role in
           interactions with Ras GTPases.
          Length = 128

 Score =  126 bits (320), Expect = 8e-36
 Identities = 48/128 (37%), Positives = 74/128 (57%), Gaps = 5/128 (3%)

Query: 100 HFFIVLEHVLRHGLKPK---KGLLGPKKELWDILQLVEKLNPEASDITASVRDLPTVKTH 156
                LE +L HGLK +      L  ++  W +L+ V KL P A ++  SV++LP + T 
Sbjct: 1   SLCPALEALLSHGLKRRSLDAAGLLRERSFWSLLERVGKLVPPAEELIRSVQELPQIHTP 60

Query: 157 MGRARAWLRLALMQKKLADYLQVLVDHRDDILVEYFETDALLMSDEAIVILGKLLS-LNV 215
            GR RAW+RLAL +K L  +L +L+ ++ ++L +Y+E  ALL   E + IL  LL  L+ 
Sbjct: 61  DGRFRAWIRLALNEKVLESWLSLLLSNK-ELLSKYYEPWALLRDPEFVSILLGLLVGLSA 119

Query: 216 IDCNLCVK 223
           +D NL + 
Sbjct: 120 LDFNLDLD 127


>gnl|CDD|214736 smart00593, RUN, domain involved in Ras-like GTPase signaling. 
          Length = 64

 Score = 57.2 bits (139), Expect = 6e-11
 Identities = 25/65 (38%), Positives = 37/65 (56%), Gaps = 2/65 (3%)

Query: 160 ARAWLRLALMQKKLADYLQVLVDHRDDILVEYFETDALLMSDEAIVILGKLL-SLNVIDC 218
            RAW+RLAL +K L+ +L +L+    ++L +Y+E  A L   E    L  LL  L+ +D 
Sbjct: 1   FRAWIRLALNEKLLSSWLNLLLSDE-ELLSKYYEPWAFLRDPEEGEQLLGLLVGLSALDF 59

Query: 219 NLCVK 223
           NL V 
Sbjct: 60  NLPVD 64


>gnl|CDD|191114 pfam04859, DUF641, Plant protein of unknown function (DUF641).
           Plant protein of unknown function.
          Length = 132

 Score = 34.6 bits (80), Expect = 0.022
 Identities = 17/67 (25%), Positives = 32/67 (47%), Gaps = 2/67 (2%)

Query: 319 LKNQLANSSQHTSNLS-VHEV-GLKAEMEVAMKLLERDVHQKQDTIIELRRQLEDIKMIN 376
            + +   + Q     + + E   L    E+ +K LE +V  K   I  LR +LE++ + N
Sbjct: 66  RRKRQKGAPQDAILAAEIQEQRSLLKTYEIMVKKLEAEVRAKDSEIDSLREKLEELLVAN 125

Query: 377 LEMYKKL 383
            ++ K+L
Sbjct: 126 SKLEKRL 132


>gnl|CDD|234501 TIGR04211, SH3_and_anchor, SH3 domain protein.  Members of this
           protein family have a signal peptide, a strongly
           conserved SH3 domain, a variable region, and then a
           C-terminal hydrophobic transmembrane alpha helix region.
          Length = 198

 Score = 33.1 bits (76), Expect = 0.13
 Identities = 29/132 (21%), Positives = 58/132 (43%), Gaps = 12/132 (9%)

Query: 251 EVTSQVEDNMSLVLDQKN---YIEELNRHLNATVGNLQTQVESLTTTNALMREDLSIAEK 307
           EV  + +D  S V   K    ++  L+R+L+ T    + ++  L    A ++E+L+  ++
Sbjct: 31  EVLERSDDGYSRVRTPKGREGWV--LSRYLSDTPS-ARERLPELQQELAELQEELAELQE 87

Query: 308 NLRILIQENENLKNQLANSSQHTSNLSVHEVGLKAEMEVAMKL------LERDVHQKQDT 361
            L  L QEN+ LK +L+        L      +K     A++L      L  ++ + +  
Sbjct: 88  QLAELQQENQELKQELSTLEAELERLQKELARIKQLSANAIELDEENRELREELAELKQE 147

Query: 362 IIELRRQLEDIK 373
              L  + E ++
Sbjct: 148 NEALEAENERLQ 159


>gnl|CDD|234342 TIGR03752, conj_TIGR03752, integrating conjugative element protein,
           PFL_4705 family.  Members of this protein family are
           found occasionally on plasmids such as the Pseudomonas
           putida toluene catabolic TOL plasmid pWWO_p085. Usually,
           however, they are found on the bacterial main chromosome
           in regions flanked by markers of conjugative transfer
           and/or transposition [Mobile and extrachromosomal
           element functions, Plasmid functions].
          Length = 472

 Score = 33.5 bits (77), Expect = 0.21
 Identities = 16/75 (21%), Positives = 31/75 (41%), Gaps = 5/75 (6%)

Query: 298 MREDLSIAEKNLRILIQENENLKNQLANSSQHTSN-LSVHEVGLKAEMEVAMKLLERDVH 356
           +R+ L+        L  ENE L+ +  +  Q     +         E+   ++ L+ +  
Sbjct: 71  LRKRLAKLISENEALKAENERLQKREQSIDQQIQQAVQ----SETQELTKEIEQLKSERQ 126

Query: 357 QKQDTIIELRRQLED 371
           Q Q  I +L+R+L  
Sbjct: 127 QLQGLIDQLQRRLAG 141


>gnl|CDD|225437 COG2882, FliJ, Flagellar biosynthesis chaperone [Cell motility and
           secretion / Intracellular trafficking and secretion /
           Posttranslational modification, protein turnover,
           chaperones].
          Length = 148

 Score = 31.6 bits (72), Expect = 0.34
 Identities = 22/102 (21%), Positives = 45/102 (44%), Gaps = 5/102 (4%)

Query: 285 QTQVESLTTTNALMREDLSIAEKNLRILIQENENLKNQLANSSQHTSNLSVHEVGLKAEM 344
           + + E      + +R +   AE+ L++L       +  L    +  S +S  +       
Sbjct: 15  KKEEEEAAIELSKIRSEKENAEEQLKMLSGYRNEYEQNLN--EKLKSGVSAAQWQ---NY 69

Query: 345 EVAMKLLERDVHQKQDTIIELRRQLEDIKMINLEMYKKLQAL 386
           +  +  LE  + Q+Q  + +LR+Q+E  + I  E   +L+AL
Sbjct: 70  QQFISQLEVAIDQQQSQLSKLRKQVEQKREIWQEKQIELKAL 111


>gnl|CDD|214534 smart00138, MeTrc, Methyltransferase, chemotaxis proteins.
           Methylates methyl-accepting chemotaxis proteins to form
           gamma-glutamyl methyl ester residues.
          Length = 264

 Score = 31.5 bits (72), Expect = 0.63
 Identities = 12/45 (26%), Positives = 21/45 (46%), Gaps = 1/45 (2%)

Query: 149 DLPTVKTHMGRARAWLRL-ALMQKKLADYLQVLVDHRDDILVEYF 192
            L   K  + ++R   RL  L  K  ++YL++L  HR +  +   
Sbjct: 18  VLTDYKRTLLQSRLSRRLRVLGLKDFSEYLELLTSHRGEEELAEL 62


>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
           bacterial type.  SMC (structural maintenance of
           chromosomes) proteins bind DNA and act in organizing and
           segregating chromosomes for partition. SMC proteins are
           found in bacteria, archaea, and eukaryotes. This family
           represents the SMC protein of most bacteria. The smc
           gene is often associated with scpB (TIGR00281) and scpA
           genes, where scp stands for segregation and condensation
           protein. SMC was shown (in Caulobacter crescentus) to be
           induced early in S phase but present and bound to DNA
           throughout the cell cycle [Cellular processes, Cell
           division, DNA metabolism, Chromosome-associated
           proteins].
          Length = 1179

 Score = 31.2 bits (71), Expect = 1.0
 Identities = 22/125 (17%), Positives = 44/125 (35%), Gaps = 7/125 (5%)

Query: 251 EVTSQVEDNMSLVLDQKNYIEELN---RHLNATVGNLQTQVESLTTTNALMREDLSIAEK 307
           E+  ++E+    +    N I  L    + L   + NL+ Q+E L      +   L    +
Sbjct: 278 ELEEEIEELQKELYALANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDELAE 337

Query: 308 NLRILIQENENLKNQLANSSQHTSNLSVHEVGLKAEMEVAMKLLERDVHQKQDTIIELRR 367
            L     E E    +L    +              E+E  ++ LE  +   +  + +L  
Sbjct: 338 EL----AELEEKLEELKEELESLEAELEELEAELEELESRLEELEEQLETLRSKVAQLEL 393

Query: 368 QLEDI 372
           Q+  +
Sbjct: 394 QIASL 398



 Score = 30.0 bits (68), Expect = 2.3
 Identities = 29/136 (21%), Positives = 59/136 (43%), Gaps = 14/136 (10%)

Query: 250 EEVTSQVEDNMSLVLDQKNYIEELNRHLNATVGNLQTQVESLTTTNALMREDLSIAEKNL 309
           EE+  ++E+    + + +  + EL + L      L+ ++E L      +   +S   K+L
Sbjct: 680 EELEEKIEELEEKIAELEKALAELRKELE----ELEEELEQLRKELEELSRQISALRKDL 735

Query: 310 RILIQENENLKNQLANSSQHTSNLSVHEVGL----------KAEMEVAMKLLERDVHQKQ 359
             L  E E L+ ++A  S+  + L      L           AE E  ++ LE  + Q +
Sbjct: 736 ARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLK 795

Query: 360 DTIIELRRQLEDIKMI 375
           + +  LR  L++++  
Sbjct: 796 EELKALREALDELRAE 811



 Score = 28.9 bits (65), Expect = 5.3
 Identities = 23/107 (21%), Positives = 48/107 (44%), Gaps = 3/107 (2%)

Query: 270 IEELNRHLNATVGNLQTQVESLTTTNALMREDLSIAEKNLRILIQENENLKNQLANSSQH 329
           +EE    L+  +  L+ ++E L        E+L+ AE  +  L  + E LK +L    + 
Sbjct: 745 LEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKALREA 804

Query: 330 TSNLS--VHEVGLKA-EMEVAMKLLERDVHQKQDTIIELRRQLEDIK 373
              L   +  +  +A  +   ++ LER +   +  + +L  Q+E++ 
Sbjct: 805 LDELRAELTLLNEEAANLRERLESLERRIAATERRLEDLEEQIEELS 851



 Score = 28.9 bits (65), Expect = 5.3
 Identities = 26/130 (20%), Positives = 56/130 (43%), Gaps = 7/130 (5%)

Query: 249 PEEVTSQVEDNMSLVLDQKNYIEELNRHLNATVGNLQTQVESLTTTNALMREDLSIAEKN 308
            EE+ S++E     +L+++  +EE    L + +  L  ++  L +  + +R +L    + 
Sbjct: 868 IEELESELE----ALLNERASLEEALALLRSELEELSEELRELESKRSELRRELEELREK 923

Query: 309 LRILIQENENLKNQLANSSQHTSNLSVHEVGLKAEMEVAMKLLERDVHQKQDTIIELRRQ 368
           L  L    E L+ ++ N       LS        E E     +E D  + +  +  L  +
Sbjct: 924 LAQLELRLEGLEVRIDNL---QERLSEEYSLTLEEAEALENKIEDDEEEARRRLKRLENK 980

Query: 369 LEDIKMINLE 378
           ++++  +NL 
Sbjct: 981 IKELGPVNLA 990



 Score = 28.5 bits (64), Expect = 8.8
 Identities = 26/140 (18%), Positives = 57/140 (40%), Gaps = 7/140 (5%)

Query: 250 EEVTSQVEDNMSLVLDQKNYIEELNRHLNATVGNLQTQVESLTTTNALMREDLSIAEKNL 309
           EE+ +Q+E     +   +  ++EL   L           E L      +   ++  E+ L
Sbjct: 785 EELEAQIEQLKEELKALREALDELRAELTLLNEEAANLRERL----ESLERRIAATERRL 840

Query: 310 RILIQENENLKNQLANSSQHTSNLSVHEVGLKAEMEVAMKL---LERDVHQKQDTIIELR 366
             L ++ E L   + + +     L      L++E+E  +     LE  +   +  + EL 
Sbjct: 841 EDLEEQIEELSEDIESLAAEIEELEELIEELESELEALLNERASLEEALALLRSELEELS 900

Query: 367 RQLEDIKMINLEMYKKLQAL 386
            +L +++    E+ ++L+ L
Sbjct: 901 EELRELESKRSELRRELEEL 920


>gnl|CDD|192773 pfam11559, ADIP, Afadin- and alpha -actinin-Binding.  This family
           is found in mammals where it is localised at cell-cell
           adherens junctions, and in Sch. pombe and other fungi
           where it anchors spindle-pole bodies to spindle
           microtubules. It is a coiled-coil structure, and in
           pombe, it is required for anchoring the minus end of
           spindle microtubules to the centrosome equivalent, the
           spindle-pole body. The name ADIP derives from the family
           being composed of Afadin- and alpha -Actinin-Binding
           Proteins Localised at Cell-Cell Adherens Junctions.
          Length = 149

 Score = 29.6 bits (67), Expect = 1.4
 Identities = 16/49 (32%), Positives = 26/49 (53%)

Query: 276 HLNATVGNLQTQVESLTTTNALMREDLSIAEKNLRILIQENENLKNQLA 324
            L  T+  L+TQ+E L    AL++      EK L+ L Q+ +N K ++ 
Sbjct: 68  RLQNTIERLKTQLEDLERELALLQAKERQLEKKLKTLEQKLKNEKEEVQ 116


>gnl|CDD|225616 COG3074, COG3074, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 79

 Score = 28.0 bits (62), Expect = 1.8
 Identities = 14/54 (25%), Positives = 25/54 (46%)

Query: 280 TVGNLQTQVESLTTTNALMREDLSIAEKNLRILIQENENLKNQLANSSQHTSNL 333
           T+  LQ ++E L   N  + +++  A+     L +ENE LK +     +    L
Sbjct: 19  TITLLQMEIEELKEKNNSLSQEVQNAQHQREALERENEQLKEEQNGWQERLRAL 72


>gnl|CDD|130673 TIGR01612, 235kDa-fam, reticulocyte binding/rhoptry protein.  This
            model represents a group of paralogous families in
            plasmodium species alternately annotated as reticulocyte
            binding protein, 235-kDa family protein and rhoptry
            protein. Rhoptry protein is localized on the cell surface
            and is extremely large (although apparently lacking in
            repeat structure) and is important for the process of
            invasion of the RBCs by the parasite. These proteins are
            found in P. falciparum, P. vivax and P. yoelii.
          Length = 2757

 Score = 30.0 bits (67), Expect = 2.7
 Identities = 29/118 (24%), Positives = 51/118 (43%), Gaps = 13/118 (11%)

Query: 242  SACYHDHPEEVTSQVEDNMSLVLDQKNYIEELNRHLN--ATVGNLQTQVESLTTTN---- 295
             A  +D PEE+  ++E+ ++ +  +KN  +E+ + LN  A +   +T +E +   N    
Sbjct: 1161 KAISNDDPEEIEKKIENIVTKIDKKKNIYDEIKKLLNEIAEIEKDKTSLEEVKGINLSYG 1220

Query: 296  ----ALMREDLSIAEKNLRILIQENENLKNQLAN---SSQHTSNLSVHEVGLKAEMEV 346
                 L  E +   +K    +I+  E     L      S    N    E+ +KAEME 
Sbjct: 1221 KNLGKLFLEKIDEEKKKSEHMIKAMEAYIEDLDEIKEKSPEIENEMGIEMDIKAEMET 1278


>gnl|CDD|188306 TIGR03319, RNase_Y, ribonuclease Y.  Members of this family are
           RNase Y, an endoribonuclease. The member from Bacillus
           subtilis, YmdA, has been shown to be involved in
           turnover of yitJ riboswitch [Transcription, Degradation
           of RNA].
          Length = 514

 Score = 29.9 bits (68), Expect = 2.8
 Identities = 19/79 (24%), Positives = 33/79 (41%), Gaps = 15/79 (18%)

Query: 298 MREDLSIAEKNLRILIQENENLKNQLANSSQHTSNLSVHEVGLKAEMEVAMKLLERDVHQ 357
            R +L   E+ L    Q  E L  ++ +  +   NL   E  L           E+++ +
Sbjct: 74  RRNELQRLERRLL---QREETLDRKMESLDKKEENLEKKEKELSN--------KEKNLDE 122

Query: 358 KQDTIIEL----RRQLEDI 372
           K++ + EL    R +LE I
Sbjct: 123 KEEELEELIAEQREELERI 141


>gnl|CDD|237024 PRK11922, PRK11922, RNA polymerase sigma factor; Provisional.
          Length = 231

 Score = 29.2 bits (66), Expect = 3.0
 Identities = 19/42 (45%), Positives = 26/42 (61%), Gaps = 5/42 (11%)

Query: 129 ILQLVEKLNPEASDITASVRDLP--TVKTHMGRARAWLRLAL 168
           +L++VE+L+ E    TA    LP  TVKT + RAR  LR +L
Sbjct: 159 VLRVVEELSVEE---TAQALGLPEETVKTRLHRARRLLRESL 197


>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family.  Atrophin-1 is the
           protein product of the dentatorubral-pallidoluysian
           atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
           neurodegenerative disorder. It is caused by the
           expansion of a CAG repeat in the DRPLA gene on
           chromosome 12p. This results in an extended
           polyglutamine region in atrophin-1, that is thought to
           confer toxicity to the protein, possibly through
           altering its interactions with other proteins. The
           expansion of a CAG repeat is also the underlying defect
           in six other neurodegenerative disorders, including
           Huntington's disease. One interaction of expanded
           polyglutamine repeats that is thought to be pathogenic
           is that with the short glutamine repeat in the
           transcriptional coactivator CREB binding protein, CBP.
           This interaction draws CBP away from its usual nuclear
           location to the expanded polyglutamine repeat protein
           aggregates that are characteristic of the polyglutamine
           neurodegenerative disorders. This interferes with
           CBP-mediated transcription and causes cytotoxicity.
          Length = 979

 Score = 29.7 bits (66), Expect = 3.2
 Identities = 12/21 (57%), Positives = 13/21 (61%)

Query: 33  LDDVEVRVRDPPELRSPSPEP 53
           LD+ E     PP  RSPSPEP
Sbjct: 516 LDEAEEPESPPPPPRSPSPEP 536


>gnl|CDD|240411 PTZ00419, PTZ00419, valyl-tRNA synthetase-like protein;
           Provisional.
          Length = 995

 Score = 29.6 bits (67), Expect = 3.2
 Identities = 29/133 (21%), Positives = 54/133 (40%), Gaps = 10/133 (7%)

Query: 193 ETDALLMSDEAIVI-LGKLLSLNVIDCNLCVKEEDLDCQQGVID--FSLYLRSACYHDHP 249
           E   L+ S E ++  L K+ S++VI       E    C   V+D    +YL    + D  
Sbjct: 869 ELIELIESAENLISTLAKIGSVSVIPPIEEEAEVPKGCGFDVVDNKVIIYLNLDEFIDLK 928

Query: 250 EEVTSQVEDNMSLVLDQKNYIEELNRHLNATVGNLQTQV-ESLTTTNALMREDLSIAEKN 308
           +E+    +    L       +E   + ++    N + +V E +   N    ++L+   K 
Sbjct: 929 KELAKLEKKLAKLQ----KSLESYLKKISIP--NYEDKVPEDVRKLNDEKIDELNEEIKQ 982

Query: 309 LRILIQENENLKN 321
           L   I+E ++L  
Sbjct: 983 LEQAIEELKSLLK 995


>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
           primarily archaeal type.  SMC (structural maintenance of
           chromosomes) proteins bind DNA and act in organizing and
           segregating chromosomes for partition. SMC proteins are
           found in bacteria, archaea, and eukaryotes. It is found
           in a single copy and is homodimeric in prokaryotes, but
           six paralogs (excluded from this family) are found in
           eukarotes, where SMC proteins are heterodimeric. This
           family represents the SMC protein of archaea and a few
           bacteria (Aquifex, Synechocystis, etc); the SMC of other
           bacteria is described by TIGR02168. The N- and
           C-terminal domains of this protein are well conserved,
           but the central hinge region is skewed in composition
           and highly divergent [Cellular processes, Cell division,
           DNA metabolism, Chromosome-associated proteins].
          Length = 1164

 Score = 29.7 bits (67), Expect = 3.6
 Identities = 21/85 (24%), Positives = 41/85 (48%), Gaps = 5/85 (5%)

Query: 302 LSIAEKNLRILIQENENLKNQLANSSQHTSNLSVHEVGLKAEMEVAMKLLERDVHQKQDT 361
           +   EK +  L QE E LK +L    +  S+L      +K+E+    K LE  + + ++ 
Sbjct: 718 IGEIEKEIEQLEQEEEKLKERLEELEEDLSSLEQEIENVKSEL----KELEARIEELEED 773

Query: 362 IIELRRQLEDIK-MINLEMYKKLQA 385
           + +L   L D++  ++     ++QA
Sbjct: 774 LHKLEEALNDLEARLSHSRIPEIQA 798



 Score = 28.5 bits (64), Expect = 8.5
 Identities = 20/109 (18%), Positives = 45/109 (41%), Gaps = 5/109 (4%)

Query: 266 QKNYIEELNRHLNATVGNLQTQVESLTTTNALMREDLSIAEKNLRILIQENENLKNQLAN 325
                 E+++ L A +  L+ ++E        + E+ +  ++ L  L  E E +  + A 
Sbjct: 324 LAKLEAEIDK-LLAEIEELEREIEEERKRRDKLTEEYAELKEELEDLRAELEEVDKEFAE 382

Query: 326 SSQHTSNLSVHEVGLKAEMEVAMKLLERDVHQKQDTIIELRRQLEDIKM 374
           +     +       LK E+      L+R++ + Q+ +  L  +L D+  
Sbjct: 383 TRDELKDYREKLEKLKREINE----LKRELDRLQEELQRLSEELADLNA 427


>gnl|CDD|222469 pfam13949, ALIX_LYPXL_bnd, ALIX V-shaped domain binding to HIV.
           The binding of the LYPxL motif of late HIV p6Gag and
           EIAV p9Gag to this domain is necessary for viral
           budding.This domain is generally central between an
           N-terminal Bro1 domain, pfam03097 and a C-terminal
           proline-rich domain. The retroviruses thus used this
           domain to hijack the ESCRT system of the cell.
          Length = 292

 Score = 28.8 bits (65), Expect = 4.0
 Identities = 22/101 (21%), Positives = 44/101 (43%), Gaps = 3/101 (2%)

Query: 271 EELNRHLNATVGNLQTQVESLTTTNALMREDLSIAEKNLRILIQENENLKNQLANSSQHT 330
            EL   L   +   +  +E  + ++A +R  L   E+NL +L    E L+  L +SS  +
Sbjct: 76  SELTAELREELRKYRGYLEQASESDAQLRSKLEEHEENLELLSGPEEELEALLPSSSP-S 134

Query: 331 SNLSVHEVGLKAEMEVAMKLLERDVHQKQDTIIELRRQLED 371
               V E    + +   +  L     +++  + EL+ + +D
Sbjct: 135 KTPEVSEQ--ISRLRELLNKLNELKAEREKLLEELKEKAQD 173


>gnl|CDD|109649 pfam00601, Flu_NS2, Influenza non-structural protein (NS2).  NS2
           may play a role in promoting normal replication of the
           genomic RNAs by preventing the replication of
           short-length RNA species.
          Length = 94

 Score = 27.2 bits (60), Expect = 4.3
 Identities = 16/57 (28%), Positives = 27/57 (47%)

Query: 290 SLTTTNALMREDLSIAEKNLRILIQENENLKNQLANSSQHTSNLSVHEVGLKAEMEV 346
           SL   N L RE+L    + +R LI E+  +     NS +  + L   ++  + E E+
Sbjct: 30  SLQERNRLWREELYQKFEEIRWLILEHRTVLANTENSFEQITFLQALQLLYEVEDEI 86


>gnl|CDD|235057 PRK02628, nadE, NAD synthetase; Reviewed.
          Length = 679

 Score = 29.1 bits (66), Expect = 4.4
 Identities = 10/20 (50%), Positives = 13/20 (65%), Gaps = 2/20 (10%)

Query: 98  LQHFFIVLEHVLRHGLKPKK 117
           LQ FF  L + LR+G +P K
Sbjct: 569 LQDFF--LYYFLRYGFRPSK 586


>gnl|CDD|218636 pfam05557, MAD, Mitotic checkpoint protein.  This family consists
           of several eukaryotic mitotic checkpoint (Mitotic arrest
           deficient or MAD) proteins. The mitotic spindle
           checkpoint monitors proper attachment of the bipolar
           spindle to the kinetochores of aligned sister chromatids
           and causes a cell cycle arrest in prometaphase when
           failures occur. Multiple components of the mitotic
           spindle checkpoint have been identified in yeast and
           higher eukaryotes. In S.cerevisiae, the existence of a
           Mad1-dependent complex containing Mad2, Mad3, Bub3 and
           Cdc20 has been demonstrated.
          Length = 722

 Score = 29.1 bits (65), Expect = 4.5
 Identities = 26/126 (20%), Positives = 51/126 (40%), Gaps = 5/126 (3%)

Query: 248 HPEEVTSQVEDNMSLVLDQKNYIEELNRHLNATVGNLQTQVESLTTTNALMREDLSIAEK 307
             E    Q++      + +   +++    L A V  LQ ++  +T     +R  L+  +K
Sbjct: 368 QLETTLQQLQLERQKAVSEILELKKKLEALKALVRRLQRRLTLVTKERDGLRAILNSYDK 427

Query: 308 NLRILIQENENLKNQLANSSQHTSNLSVHEVGLKAEMEVAMKLLERDVHQKQDTIIELRR 367
            L        ++  QL    +   +L        A+ME  +  LE DV Q++D    L  
Sbjct: 428 ELTET-----SVSGQLMKRLEEAEDLVQKVQSHLAKMENQLSELEEDVGQQKDRNNTLET 482

Query: 368 QLEDIK 373
           +++ +K
Sbjct: 483 EIKLLK 488


>gnl|CDD|221432 pfam12128, DUF3584, Protein of unknown function (DUF3584).  This
           protein is found in bacteria and eukaryotes. Proteins in
           this family are typically between 943 to 1234 amino
           acids in length. This family contains a P-loop motif
           suggesting it is a nucleotide binding protein. It may be
           involved in replication.
          Length = 1198

 Score = 29.3 bits (66), Expect = 4.8
 Identities = 23/127 (18%), Positives = 53/127 (41%), Gaps = 7/127 (5%)

Query: 254 SQVEDNMSLVLDQKNYIEELNRHLNATVG---NLQTQVESLTTTNALMREDLSIAEKNLR 310
            ++      V + + +++E   H ++      NL  Q+  L ++   ++++L+   K+ +
Sbjct: 788 ERIAVRRPEVREYRAFMQETWLHRDSLREERPNLAIQLRELESSAEELQQELTRLIKDTK 847

Query: 311 ILIQENENLKNQLANSSQHTSNLSVHEVGLKAEME-VAMKLLERDVHQKQDTIIELRRQL 369
                 + L+ +     +    L     GL+ EM  +A      + +Q + +I E   QL
Sbjct: 848 ---LRRKKLEQERKALEKQLDQLDELLRGLRDEMRQLAELKEPANANQAEGSISERLDQL 904

Query: 370 EDIKMIN 376
           E+ K   
Sbjct: 905 EEFKRKR 911


>gnl|CDD|227512 COG5185, HEC1, Protein involved in chromosome segregation,
           interacts with SMC proteins [Cell division and
           chromosome partitioning].
          Length = 622

 Score = 29.2 bits (65), Expect = 4.9
 Identities = 25/122 (20%), Positives = 47/122 (38%), Gaps = 9/122 (7%)

Query: 271 EELNRHLNATVGNLQTQVESLTTTNALMREDLSIAEKNLRILIQENENLK-------NQL 323
           +EL       V  + T + +L T N  + E +  A K +   I+             N+ 
Sbjct: 256 QELKLGFEKFVHIINTDIANLKTQNDNLYEKIQEAMK-ISQKIKTLREKWRALKSDSNKY 314

Query: 324 ANSSQHTSNLSVHEVGLKAEMEVAMKLLERDVHQKQDTIIELRRQLEDIKMINLEMYKKL 383
            N        S    G   +++  ++L E ++   Q  I EL +QL   + I+ E ++ +
Sbjct: 315 ENYVNAMKQKSQEWPGKLEKLKSEIELKEEEIKALQSNIDELHKQLRK-QGISTEQFELM 373

Query: 384 QA 385
             
Sbjct: 374 NQ 375


>gnl|CDD|236082 PRK07737, fliD, flagellar capping protein; Validated.
          Length = 501

 Score = 29.0 bits (65), Expect = 5.5
 Identities = 23/86 (26%), Positives = 37/86 (43%), Gaps = 16/86 (18%)

Query: 286 TQVESLTTTNALMREDLSIAEKNLRILIQENENLKNQLANSSQHTSNLSVHEVGLKAEME 345
            Q+   T++N L    L I E  LR  I+EN +   QL NS    S+ + +E G+     
Sbjct: 363 AQIGITTSSNYLDGGKLEIDETKLRQKIKENPDAVYQLFNSG--GSSSNYNEKGI----- 415

Query: 346 VAMKLLERDVHQKQDTIIELRRQLED 371
                      + +DT+ E  + +E 
Sbjct: 416 ---------ARRLRDTLKETIKSIEQ 432


>gnl|CDD|114219 pfam05483, SCP-1, Synaptonemal complex protein 1 (SCP-1).
           Synaptonemal complex protein 1 (SCP-1) is the major
           component of the transverse filaments of the
           synaptonemal complex. Synaptonemal complexes are
           structures that are formed between homologous
           chromosomes during meiotic prophase.
          Length = 787

 Score = 28.9 bits (64), Expect = 5.7
 Identities = 31/133 (23%), Positives = 58/133 (43%), Gaps = 10/133 (7%)

Query: 254 SQVEDNMSLVLDQKNYIEELNRHLNATVGNLQTQVESLTTTNALMREDLSIAEKNLRILI 313
           +Q+E+   L  +    + E   HL   + +++  ++   +T   + EDL IA K +  L 
Sbjct: 271 NQLEEKTKLQDENLKELIEKKDHLTKELEDIKMSLQRSMSTQKALEEDLQIATKTICQLT 330

Query: 314 QENENLKNQLANSSQHTSNLSVHEVGLKAEMEVAMKLLERDVHQKQDTIIELRRQLEDIK 373
           +E E    +L N ++   +  V       E E     LE  +  +Q     L +  + +K
Sbjct: 331 EEKEAQMEEL-NKAKAAHSFVV------TEFEATTCSLEELLRTEQQ---RLEKNEDQLK 380

Query: 374 MINLEMYKKLQAL 386
           +I +E+ KK   L
Sbjct: 381 IITMELQKKSSEL 393


>gnl|CDD|219018 pfam06419, COG6, Conserved oligomeric complex COG6.  COG6 is a
           component of the conserved oligomeric golgi complex,
           which is composed of eight different subunits and is
           required for normal golgi morphology and localisation.
          Length = 615

 Score = 28.8 bits (65), Expect = 5.7
 Identities = 11/64 (17%), Positives = 26/64 (40%), Gaps = 1/64 (1%)

Query: 262 LVLDQKNYIEELNRHLNATVGNLQTQVESLTTTNALMREDLSIAEKNLRILIQENENLKN 321
           L+      ++E    +   +  +   +  L  T   M+  L+ A+++ + L++E   L  
Sbjct: 29  LLKSNGEILKEF-GPVAEQLKRIGEDLSKLNNTCEEMKSQLNNAKQDTKDLLEEASTLME 87

Query: 322 QLAN 325
           Q   
Sbjct: 88  QKRQ 91


>gnl|CDD|219737 pfam08172, CASP_C, CASP C terminal.  This domain is the C-terminal
           region of the CASP family of proteins. It is a Golgi
           membrane protein which is thought to have a role in
           vesicle transport.
          Length = 245

 Score = 28.1 bits (63), Expect = 6.5
 Identities = 14/36 (38%), Positives = 26/36 (72%)

Query: 351 LERDVHQKQDTIIELRRQLEDIKMINLEMYKKLQAL 386
           LE ++ ++  TI  LRR+LE +K  N+++Y+K++ L
Sbjct: 98  LEEELRKQNQTISSLRRELESLKADNIKLYEKIRYL 133


>gnl|CDD|218681 pfam05667, DUF812, Protein of unknown function (DUF812).  This
           family consists of several eukaryotic proteins of
           unknown function.
          Length = 536

 Score = 28.6 bits (64), Expect = 6.6
 Identities = 21/105 (20%), Positives = 46/105 (43%), Gaps = 7/105 (6%)

Query: 284 LQTQVESLTTTNALMREDLSIAEKNLRILIQE-NENLKNQLANSSQHTSNLSVHE--VGL 340
           LQ Q++ L ++   +  ++    K L   I++ NE L      + +      + +  V L
Sbjct: 275 LQEQIDELESSIEEVLSEI----KALASKIKQVNEELTTVRQENEELEEEYKIKKRTVEL 330

Query: 341 KAEMEVAMKLLERDVHQKQDTIIELRRQLEDIKMINLEMYKKLQA 385
             + E  +  L+  V    + ++EL +Q E  +   ++ Y+ L+ 
Sbjct: 331 LPDAENNVAKLQALVVASSERLLELAQQWEAHRTPLIDEYRSLKE 375


>gnl|CDD|234608 PRK00064, recF, recombination protein F; Reviewed.
          Length = 361

 Score = 28.6 bits (65), Expect = 6.7
 Identities = 19/79 (24%), Positives = 31/79 (39%), Gaps = 12/79 (15%)

Query: 130 LQLVEKLNPEASDI---------TASVRDLPTVKTHMGRARAWLRLALMQKKLADYLQ-- 178
           L+ +E+L P A+            AS+    +V+    +    L  AL + +  D  +  
Sbjct: 192 LEYLERLAPLAAKTHQEISPEFELASLSYQSSVEDDAEKIEEDLLEALAKNRERDRARGR 251

Query: 179 VLVD-HRDDILVEYFETDA 196
            LV  HRDD+        A
Sbjct: 252 TLVGPHRDDLRFRINGLPA 270


>gnl|CDD|133043 cd06421, CESA_CelA_like, CESA_CelA_like are involved in the
           elongation of the glucan chain of cellulose.  Family of
           proteins related to  Agrobacterium tumefaciens CelA and 
           Gluconacetobacter xylinus BscA. These proteins are
           involved in the elongation of the glucan chain of
           cellulose, an aggregate of unbranched polymers of
           beta-1,4-linked glucose residues. They are putative
           catalytic subunit of cellulose synthase, which is a
           glycosyltransferase using UDP-glucose as the substrate.
           The catalytic subunit is an integral membrane protein
           with 6 transmembrane segments and it is postulated that
           the protein is anchored in the membrane at the
           N-terminal end.
          Length = 234

 Score = 27.9 bits (63), Expect = 7.0
 Identities = 10/27 (37%), Positives = 13/27 (48%), Gaps = 3/27 (11%)

Query: 91  LDSDHMPLQHFFIVLEHVLRHGLKPKK 117
           LD+DH+P   F   L   L + L   K
Sbjct: 91  LDADHVPTPDF---LRRTLGYFLDDPK 114


>gnl|CDD|223786 COG0714, COG0714, MoxR-like ATPases [General function prediction
           only].
          Length = 329

 Score = 28.2 bits (63), Expect = 7.4
 Identities = 37/194 (19%), Positives = 61/194 (31%), Gaps = 27/194 (13%)

Query: 37  EVRVRDPPELRSPSPEPPLFVA-RNPVIIERRNLVNISKLIVKELIETSQKHGRMLDSDH 95
           E +V  P       P P + +A +NP   E              L E +    R L   +
Sbjct: 138 ERQVTVPGLTTIRLPPPFIVIATQNPGEYEG----------TYPLPE-ALL-DRFLLRIY 185

Query: 96  MP-LQHFFIVLEHVLRHGLKPKKGLLGPKKELWDILQLVE--------KLNPEASD---- 142
           +            + R G   +  L    K +    +L+          ++ E  D    
Sbjct: 186 VDYPDSEEEERIILARVGGVDELDLESLVKPVLSDEELLRLQKEVKKVPVSDEVIDYIVT 245

Query: 143 ITASVRDLPTVKTHMGRARAWLRLALMQKKLADYLQVLVDHRDDILVEYFETDALLMSDE 202
           + A++R+ P V       RA L L    + LA          DD+        A  +  E
Sbjct: 246 LVAALREAPDVALGAS-PRASLALLAALRALALLDGRDAVIPDDVKALAEPALAHRLILE 304

Query: 203 AIVILGKLLSLNVI 216
               L  L  L+++
Sbjct: 305 LEAKLSGLSVLDIV 318


>gnl|CDD|225104 COG2193, Bfr, Bacterioferritin (cytochrome b1) [Inorganic ion
           transport and metabolism].
          Length = 157

 Score = 27.7 bits (62), Expect = 7.6
 Identities = 13/40 (32%), Positives = 20/40 (50%)

Query: 346 VAMKLLERDVHQKQDTIIELRRQLEDIKMINLEMYKKLQA 385
           V+  LLE  +  +++ I  L  QL+ I  I  E Y + Q 
Sbjct: 115 VSRDLLEEILADEEEHIDWLETQLDLIAKIGEENYLQSQM 154


>gnl|CDD|100813 PRK01203, PRK01203, prefoldin subunit alpha; Provisional.
          Length = 130

 Score = 27.2 bits (60), Expect = 8.2
 Identities = 13/51 (25%), Positives = 24/51 (47%)

Query: 263 VLDQKNYIEELNRHLNATVGNLQTQVESLTTTNALMREDLSIAEKNLRILI 313
           V  Q NYIE L   +++ + +L   +  +  T + + ++     K L I I
Sbjct: 5   VEAQLNYIESLISSVDSQIDSLNKTLSEVQQTISFLSDNELDNSKELLISI 55


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.319    0.135    0.383 

Gapped
Lambda     K      H
   0.267   0.0724    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 19,798,522
Number of extensions: 1980851
Number of successful extensions: 2164
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2134
Number of HSP's successfully gapped: 124
Length of query: 386
Length of database: 10,937,602
Length adjustment: 99
Effective length of query: 287
Effective length of database: 6,546,556
Effective search space: 1878861572
Effective search space used: 1878861572
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 60 (26.9 bits)