RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy7576
         (481 letters)



>gnl|CDD|217838 pfam04004, Leo1, Leo1-like protein.  Members of this family are
           part of the Paf1/RNA polymerase II complex. The Paf1
           complex probably functions during the elongation phase
           of transcription. The Leo1 subunit of the yeast
           Paf1-complex binds RNA and contributes to complex
           recruitment. The subunit acts by co-ordinating
           co-transcriptional chromain modifications and helping
           recruitment of mRNA 3prime-end processing factors.
          Length = 312

 Score =  227 bits (581), Expect = 5e-71
 Identities = 111/297 (37%), Positives = 163/297 (54%), Gaps = 25/297 (8%)

Query: 130 EEGDGEGERRERDEDARSDASRRGSDR------EDDEDKENRTETGEDKEEEEPVETRID 183
           +         +R+  +  D   RG +        +D  + +  E   +   E+ V  +  
Sbjct: 1   DREQDSHRSNDRNSLSGDDDLSRGGNGDDGEINGEDPSQFSEIEEATETFPEKDVAVKFQ 60

Query: 184 VEIPKISCDL---GREMHFVKLPNFLSVETRPFDPETYEDEIEDEETL---DEEGRARLK 237
            E+P  +  L        F K+PNFLSVE +PFDPET+E E+  E+     +   R R+K
Sbjct: 61  TEVPTQNFGLPAGSNPSFFAKIPNFLSVEPKPFDPETFEAEMTKEDAKDEHEAHWRQRIK 120

Query: 238 LKVENTIRWRDTFNKDGDLVKESNARLVKWSDGSLSLHLGSEIFDVYKQPLQGDHNHLYI 297
           L+VENT+RWR+  +KDG   KESNA++V+WSDGS SLH+G+EIFD+  QPL  + NHL +
Sbjct: 121 LRVENTVRWRENKSKDGKTTKESNAQIVRWSDGSYSLHIGNEIFDIMHQPLSVNQNHLTV 180

Query: 298 RQ-GTGLQGQAVFRTKLSFRPHSTESFTHRKMTLSLADRSNKTTGIKILGQVGYDPDANR 356
               T LQ Q V + KL+FRP ST SFTHRK+T +LA R  K+ G+KI   VG DP+  +
Sbjct: 181 SHEQTLLQPQGVIKKKLTFRPSSTNSFTHRKLTRNLATRQRKSGGVKIT-IVGKDPEHEK 239

Query: 357 YEKIKKEEEKLRQSLRKETKTKRTREK-----------GASSRLSAGYLEGDDDDDE 402
            E+ KKEE+KLR   R++ + K   +               ++ +  Y E +D+  +
Sbjct: 240 REREKKEEQKLRARRRRQNREKMKNKPPNRPGHGSGSDSNVAKAATTYSEDEDEGSD 296


>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger.  [Transport and
           binding proteins, Cations and iron carrying compounds].
          Length = 1096

 Score = 41.1 bits (96), Expect = 0.001
 Identities = 26/136 (19%), Positives = 56/136 (41%), Gaps = 16/136 (11%)

Query: 56  DHLGRIEVAAD-QLQGQTEVVQDLEAARVVQGEDLDHHLEALVALLLKLNAEALFGDASD 114
           +  G I+   D +++G       +E     +  + D H             +   G+  +
Sbjct: 778 EDEGEIQAGEDGEMKGDEGAEGKVEHEGETEAGEKDEHEGQSETQADDTEVKDETGEQ-E 836

Query: 115 ISTDDDDDGK--LQVVEEEGDGEGERRERDEDARSDASRRGSDREDDEDKENRTETGEDK 172
           ++ ++  + K   + V+  G  +G   E +E+            E++E++E   E  E++
Sbjct: 837 LNAENQGEAKQDEKGVDGGGGSDGGDSEEEEE------------EEEEEEEEEEEEEEEE 884

Query: 173 EEEEPVETRIDVEIPK 188
           EEEE  E  + +E P+
Sbjct: 885 EEEEENEEPLSLEWPE 900



 Score = 33.0 bits (75), Expect = 0.46
 Identities = 36/172 (20%), Positives = 62/172 (36%), Gaps = 11/172 (6%)

Query: 119 DDDDGKLQVVEEEGDGEGERRERDEDARSDASRRGSDREDDEDKENRTETGEDKEEEEPV 178
            + +G+ ++  +E D +G     + +A        ++ E  ED E   ETGE+ EE E  
Sbjct: 691 GEQEGEGEIEAKEADHKG-----ETEAEEVEHEGETEAEGTED-EGEIETGEEGEEVED- 743

Query: 179 ETRIDVEIPKISCDLGREMHFVKLPNFLSVETRPFDPETYEDEIEDEETLDEEGRARLKL 238
           E   + E        G      K              +  E EI+  E  + +G    + 
Sbjct: 744 EGEGEAEGKHEVETEGDR----KETEHEGETEAEGKEDEDEGEIQAGEDGEMKGDEGAEG 799

Query: 239 KVENTIRWRDTFNKDGDLVKESNARLVKWSDGSLSLHLGSEIFDVYKQPLQG 290
           KVE+          + +   E+ A   +  D +    L +E     KQ  +G
Sbjct: 800 KVEHEGETEAGEKDEHEGQSETQADDTEVKDETGEQELNAENQGEAKQDEKG 851



 Score = 31.9 bits (72), Expect = 0.93
 Identities = 31/167 (18%), Positives = 57/167 (34%), Gaps = 6/167 (3%)

Query: 71  QTEVVQDLEAARVVQGEDLDHHLEALVALLLKLNAEALFGDASDISTDDDDDGKLQVVEE 130
             E   + E    ++ ++ DH  E     +         G   +   +  ++G+    E 
Sbjct: 686 PAERKGEQEGEGEIEAKEADHKGETEAEEVEHEGETEAEGTEDEGEIETGEEGEEVEDEG 745

Query: 131 EGDGEGER---RERDEDARSDASRRGSDREDDEDKENRTETGEDKEEEEPVETRIDVEIP 187
           EG+ EG+     E D           ++ ++DED E   + GED E +        VE  
Sbjct: 746 EGEAEGKHEVETEGDRKETEHEGETEAEGKEDED-EGEIQAGEDGEMKGDEGAEGKVEHE 804

Query: 188 KISCDLGREMHFVKLPNFLSVETRPFDPETYE--DEIEDEETLDEEG 232
             +    ++ H  +            +    E   E + E   DE+G
Sbjct: 805 GETEAGEKDEHEGQSETQADDTEVKDETGEQELNAENQGEAKQDEKG 851


>gnl|CDD|226920 COG4547, CobT, Cobalamin biosynthesis protein CobT
           (nicotinate-mononucleotide:5, 6-dimethylbenzimidazole
           phosphoribosyltransferase) [Coenzyme metabolism].
          Length = 620

 Score = 37.5 bits (87), Expect = 0.016
 Identities = 38/182 (20%), Positives = 57/182 (31%), Gaps = 45/182 (24%)

Query: 86  GEDLDHHLEALVALLLKLNAEALFGDASDISTDDDDD-----------GKLQVVEEEGDG 134
           G+        ++ L      +    D   ++ + DD            G + + EE GD 
Sbjct: 166 GDAPPDSAGKVLDLWRDEIEDKAGEDLDGLAAEIDDQQAFARVVRDMLGSMDMAEETGDD 225

Query: 135 EGERRERDEDARSDASRRGSDRE--------------------DDEDKENRTETGEDKEE 174
             E    +ED   D      D E                    D E +E   +  E  E+
Sbjct: 226 GIEEDADEEDGDDDQPDNNEDSEAGREESEGSDESEEDEAEATDGEGEEGEMDAAEASED 285

Query: 175 EEPVETRIDVEIP-------KISCDLGREMHFVKLPNFLSVETRPFDPETYEDEIEDEET 227
            E  E+  D E P           +L  E+ +        V TR FD     +E+ DE  
Sbjct: 286 SESDESDEDTETPGEDARPATPFTELMEEVDY-------KVFTREFDEIVLAEELCDEAE 338

Query: 228 LD 229
           LD
Sbjct: 339 LD 340


>gnl|CDD|227693 COG5406, COG5406, Nucleosome binding factor SPN, SPT16 subunit
           [Transcription / DNA replication, recombination, and
           repair / Chromatin structure and dynamics].
          Length = 1001

 Score = 34.6 bits (79), Expect = 0.15
 Identities = 21/68 (30%), Positives = 31/68 (45%)

Query: 116 STDDDDDGKLQVVEEEGDGEGERRERDEDARSDASRRGSDREDDEDKENRTETGEDKEEE 175
           S D+ D+ + +V E E   + E  E DED  SD S      ++ E+  +  E GED +E 
Sbjct: 927 SDDESDESEEEVSEYEASSDDESDETDEDEESDESSEDLSEDESENDSSDEEDGEDWDEL 986

Query: 176 EPVETRID 183
           E       
Sbjct: 987 ESKAAYDS 994


>gnl|CDD|130712 TIGR01651, CobT, cobaltochelatase, CobT subunit.  This model
           describes Pseudomonas denitrificans CobT gene product,
           which is a cobalt chelatase subunit that functions in
           cobalamin biosynthesis. Cobalamin (vitamin B12) can be
           synthesized via several pathways, including an aerobic
           pathway (found in Pseudomonas denitrificans) and an
           anaerobic pathway (found in P. shermanii and Salmonella
           typhimurium). These pathways differ in the point of
           cobalt insertion during corrin ring formation. There are
           apparently a number of variations on these two pathways,
           where the major differences seem to be concerned with
           the process of ring contraction. Confusion regarding the
           functions of enzymes found in the aerobic vs. anaerobic
           pathways has arisen because nonhomologous genes in these
           different pathways were given the same gene symbols.
           Thus, cobT in the aerobic pathway (P. denitrificans) is
           not a homolog of cobT in the anaerobic pathway (S.
           typhimurium). It should be noted that E. coli
           synthesizes cobalamin only when it is supplied with the
           precursor cobinamide, which is a complex intermediate.
           Additionally, all E. coli cobalamin synthesis genes
           (cobU, cobS and cobT) were named after their Salmonella
           typhimurium homologs which function in the anaerobic
           cobalamin synthesis pathway. This model describes the
           aerobic cobalamin pathway Pseudomonas denitrificans CobT
           gene product, which is a cobalt chelatase subunit, with
           a MW ~70 kDa. The aerobic pathway cobalt chelatase is a
           heterotrimeric, ATP-dependent enzyme that catalyzes
           cobalt insertion during cobalamin biosynthesis. The
           other two subunits are the P. denitrificans CobS
           (TIGR01650) and CobN (pfam02514 CobN/Magnesium
           Chelatase) proteins. To avoid potential confusion with
           the nonhomologous Salmonella typhimurium/E.coli cobT
           gene product, the P. denitrificans gene symbol is not
           used in the name of this model [Biosynthesis of
           cofactors, prosthetic groups, and carriers, Heme,
           porphyrin, and cobalamin].
          Length = 600

 Score = 34.1 bits (78), Expect = 0.17
 Identities = 33/120 (27%), Positives = 51/120 (42%), Gaps = 9/120 (7%)

Query: 111 DASDISTDDDDDGKLQVVEEEGDGEGERRERDEDARSDASRRGSDREDDEDKENRTETGE 170
           ++ D    DDD       EE+G+GEGE +E      S+A+ R S+   +E+     +   
Sbjct: 208 ESEDEEDGDDDQPTENEQEEQGEGEGEGQEGSAPQESEATDRESE-SGEEEMVQSDQDDL 266

Query: 171 DKEEEEPVETRIDVEIP-KISCDLGREMHFVKLPNFLSVETRPFDPETYEDEIEDEETLD 229
             E ++  ET  +   P +     G E  +        V T  FD     +E+ DEE LD
Sbjct: 267 PDESDDDSETPGEGARPARPFTSTGGEPDY-------KVFTTAFDETVDAEELCDEEELD 319


>gnl|CDD|227596 COG5271, MDN1, AAA ATPase containing von Willebrand factor type A
            (vWA) domain [General function prediction only].
          Length = 4600

 Score = 33.8 bits (77), Expect = 0.26
 Identities = 28/140 (20%), Positives = 49/140 (35%), Gaps = 30/140 (21%)

Query: 114  DISTDDDDDGKLQVVEEEGDGEGERR-----ERDEDARSDASRRGSDREDDEDKENRTE- 167
            D+  +  + G+L+     G G G          D+D    A+   +  + D D+    E 
Sbjct: 3801 DLPQEKSNSGELESGTGLGSGVGAEDITNTLNEDDDLEELANEEDTANQSDLDESEAREL 3860

Query: 168  ----TGE---------DKEEEEPVETRIDVEIPKISCDLGREMHFVKLPNFLSVETRPFD 214
                 G          +  + E     +D E+  I  D         L N  S+  + +D
Sbjct: 3861 ESDMNGVTKDSVVSENENSDSEEENQDLDEEVNDIPED---------LSN--SLNEKLWD 3909

Query: 215  PETYEDEIEDEETLDEEGRA 234
                ED +E E+  +E+  A
Sbjct: 3910 EPNEEDLLETEQKSNEQSAA 3929



 Score = 28.8 bits (64), Expect = 8.2
 Identities = 20/86 (23%), Positives = 36/86 (41%), Gaps = 5/86 (5%)

Query: 114  DISTDDDDD--GKLQVVEEEGDGEGERRERDEDARSDASRRGSDREDDEDKENRTETGED 171
            D S + + D      + E+E D  G+++ +D    SD  + G   ED    E  +    +
Sbjct: 4232 DESANQNPDSMNSTNIAEDEADEVGDKQLQDGQDISDIKQTG---EDTLPTEFGSINQSE 4288

Query: 172  KEEEEPVETRIDVEIPKISCDLGREM 197
            K  E   +  I+ E+P  +  +   M
Sbjct: 4289 KVFELSEDEDIEDELPDYNVKITPAM 4314


>gnl|CDD|237115 PRK12489, PRK12489, anaerobic C4-dicarboxylate transporter;
           Reviewed.
          Length = 443

 Score = 32.5 bits (75), Expect = 0.48
 Identities = 12/41 (29%), Positives = 21/41 (51%), Gaps = 1/41 (2%)

Query: 149 ASRRGSDREDDEDKENRTETGEDKEE-EEPVETRIDVEIPK 188
           + RRG D + D + + R +  E +E       T +D ++PK
Sbjct: 192 SLRRGKDLDKDPEFQERLKDPEFREYVYGTTATLLDKKLPK 232


>gnl|CDD|215601 PLN03142, PLN03142, Probable chromatin-remodeling complex ATPase
           chain; Provisional.
          Length = 1033

 Score = 32.9 bits (75), Expect = 0.51
 Identities = 18/60 (30%), Positives = 29/60 (48%), Gaps = 1/60 (1%)

Query: 129 EEEGDGEGERRERDEDARSDASRRGSDREDDEDKENRTETGEDKEEEEPVETRIDVEIPK 188
           EEE        E DE+   +A  R +  + D+D+    +  ED+E++E  E+    EI K
Sbjct: 1   EEEQVNTQANEEEDEE-ELEAVARSAGSDSDDDEVPAEDEDEDEEDDEEAESPAKAEISK 59


>gnl|CDD|217927 pfam04147, Nop14, Nop14-like family.  Emg1 and Nop14 are novel
           proteins whose interaction is required for the
           maturation of the 18S rRNA and for 40S ribosome
           production.
          Length = 809

 Score = 32.7 bits (75), Expect = 0.51
 Identities = 17/86 (19%), Positives = 37/86 (43%), Gaps = 15/86 (17%)

Query: 111 DASDISTDDDDDGKL---------------QVVEEEGDGEGERRERDEDARSDASRRGSD 155
           + S  S DD DD                  +  EE+G  + +  + D+D   +       
Sbjct: 293 EDSKESADDLDDEFEPDDDDNFGLGQGEEDEEEEEDGVDDEDEEDDDDDLEEEEEDVDLS 352

Query: 156 REDDEDKENRTETGEDKEEEEPVETR 181
            E++++++  ++  +D+EEEE  + +
Sbjct: 353 DEEEDEEDEDSDDEDDEEEEEEEKEK 378



 Score = 28.8 bits (65), Expect = 7.8
 Identities = 29/122 (23%), Positives = 52/122 (42%), Gaps = 15/122 (12%)

Query: 124 KLQVVEEE------GDGEGERRERDEDARSDASRRGSDREDDEDKENRTETGEDKEEEEP 177
           +L+ +E E      G+ E +  E D    +D      + +DD++        +++EEE+ 
Sbjct: 270 RLKKLEAERLRRMRGEEEDDEEEEDSKESADDLDDEFEPDDDDNFGLGQGEEDEEEEEDG 329

Query: 178 VETRIDVEIPKISCDLGREMHFVKLPNFLSVETRPFDPETYEDEIEDEETLDEEGRARLK 237
           V+   D +      DL  E   V L +         D E  + + ED+E  +EE + + K
Sbjct: 330 VD---DEDEEDDDDDLEEEEEDVDLSDEEE------DEEDEDSDDEDDEEEEEEEKEKKK 380

Query: 238 LK 239
            K
Sbjct: 381 KK 382


>gnl|CDD|237171 PRK12678, PRK12678, transcription termination factor Rho;
           Provisional.
          Length = 672

 Score = 32.6 bits (75), Expect = 0.59
 Identities = 17/56 (30%), Positives = 26/56 (46%)

Query: 110 GDASDISTDDDDDGKLQVVEEEGDGEGERRERDEDARSDASRRGSDREDDEDKENR 165
            D  D   DD+ + +     ++G+G G RR R    R    RRG D  ++ + E R
Sbjct: 233 RDRRDARGDDNREDRGDRDGDDGEGRGGRRGRRFRDRDRRGRRGGDGGNEREPELR 288



 Score = 30.6 bits (70), Expect = 2.0
 Identities = 18/67 (26%), Positives = 24/67 (35%), Gaps = 1/67 (1%)

Query: 110 GDASDISTDDDDDGKLQVVEEEGDGEGERRERDEDARSDASRRGSDREDDEDKENRTETG 169
           G   +   D DD  +    E+ GD   ER  RD   R    RR   R+   D        
Sbjct: 191 GRREERGRDGDDRDRRDRREQ-GDRREERGRRDGGDRRGRRRRRDRRDARGDDNREDRGD 249

Query: 170 EDKEEEE 176
            D ++ E
Sbjct: 250 RDGDDGE 256



 Score = 29.5 bits (67), Expect = 5.2
 Identities = 14/66 (21%), Positives = 23/66 (34%), Gaps = 2/66 (3%)

Query: 118 DDDDDGKLQVVEEEGDGE--GERRERDEDARSDASRRGSDREDDEDKENRTETGEDKEEE 175
              + G+    E   D     E  ERDE  R           +  ++  R E G D ++ 
Sbjct: 144 KAGEGGEQPATEARADAAERTEEEERDERRRRGDREDRQAEAERGERGRREERGRDGDDR 203

Query: 176 EPVETR 181
           +  + R
Sbjct: 204 DRRDRR 209


>gnl|CDD|226950 COG4584, COG4584, Transposase and inactivated derivatives [DNA
           replication, recombination, and repair].
          Length = 278

 Score = 31.7 bits (72), Expect = 0.65
 Identities = 13/49 (26%), Positives = 18/49 (36%), Gaps = 2/49 (4%)

Query: 231 EGRARLKLKVENTIR--WRDTFNKDGDLVKESNARLVKWSDGSLSLHLG 277
            GR R K KVE  +    R  F      ++  N R + W     +   G
Sbjct: 221 PGRPRTKGKVERMVGYVARSFFLPGTLDLENLNRRALLWLADVANRRHG 269


>gnl|CDD|217503 pfam03344, Daxx, Daxx Family.  The Daxx protein (also known as the
           Fas-binding protein) is thought to play a role in
           apoptosis, but precise role played by Daxx remains to be
           determined. Daxx forms a complex with Axin.
          Length = 715

 Score = 31.8 bits (72), Expect = 0.84
 Identities = 21/118 (17%), Positives = 46/118 (38%), Gaps = 6/118 (5%)

Query: 113 SDISTDDDDDGKLQVVEEEGDGEGERRERDEDARSDASRRGSDREDDEDKENRTETGEDK 172
             +  +++++ + +  E+E + E    E +E+     +  GS+ E +   E   +  E +
Sbjct: 443 ESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEADN--GSEEEMEGSSEGDGDGEEPE 500

Query: 173 EEEEPVETRIDVEIPKISCDLGREMHFVKLPNFLSVETRPFDPETYEDEIEDEETLDE 230
           E+ E   + +            R           S +  P  PE+ + E   EE+ +E
Sbjct: 501 EDAERRNSEMAGISRMSEGQQPRGSSVQP----ESPQEEPLQPESMDAESVGEESDEE 554


>gnl|CDD|219939 pfam08619, Nha1_C, Alkali metal cation/H+ antiporter Nha1 C
           terminus.  The C terminus of the plasma membrane Nha1
           antiporter plays an important role in the immediate cell
           response to hypo-osmotic shock which prevents an
           execessive loss of ions and water. This domain is found
           with pfam00999.
          Length = 430

 Score = 31.3 bits (71), Expect = 1.3
 Identities = 21/112 (18%), Positives = 43/112 (38%), Gaps = 3/112 (2%)

Query: 73  EVVQDLEAARVVQGEDLDHHLEALVALLLKLNAEALFGDASDISTDDDD---DGKLQVVE 129
           + ++ +         ++D    A   +      E    D+S ++    +   +  L    
Sbjct: 300 DFIKQIRNLDPKARREVDLETNAPERVKAPARDEVDTNDSSPLTDSKSEEKLEDDLAHSL 359

Query: 130 EEGDGEGERRERDEDARSDASRRGSDREDDEDKENRTETGEDKEEEEPVETR 181
              +    R+ R+  + S  SR  S R+ +++   R ++  D EEE P E R
Sbjct: 360 LGSEASERRKNREPSSSSPPSRPRSRRDSEDEDTERDDSDSDDEEETPAERR 411


>gnl|CDD|224595 COG1681, FlaB, Archaeal flagellins [Cell motility and secretion].
          Length = 209

 Score = 30.4 bits (69), Expect = 1.6
 Identities = 7/24 (29%), Positives = 13/24 (54%)

Query: 339 TTGIKILGQVGYDPDANRYEKIKK 362
           ++GI+++G VGY  +      I  
Sbjct: 51  SSGIEVVGAVGYAGNTTPGGTIDY 74


>gnl|CDD|240433 PTZ00482, PTZ00482, membrane-attack complex/perforin (MACPF)
           Superfamily; Provisional.
          Length = 844

 Score = 31.0 bits (70), Expect = 1.7
 Identities = 14/64 (21%), Positives = 26/64 (40%), Gaps = 3/64 (4%)

Query: 116 STDDDDDGKLQVVEEEGDGEGERRERDEDARSDASRRGSDREDDEDKENRTETGEDKEEE 175
           S DDDDD +   + E+ + +       E +  D S        D D++  T+   D+  +
Sbjct: 85  SLDDDDDDEFDFLYEDDEDDAGNATSGESSTDDDSLLEL---PDRDEDADTQANNDQTND 141

Query: 176 EPVE 179
              +
Sbjct: 142 FDQD 145


>gnl|CDD|220135 pfam09184, PPP4R2, PPP4R2.  PPP4R2 (protein phosphatase 4 core
           regulatory subunit R2) is the regulatory subunit of the
           histone H2A phosphatase complex. It has been shown to
           confer resistance to the anticancer drug cisplatin in
           yeast, and may confer resistance in higher eukaryotes.
          Length = 285

 Score = 30.6 bits (69), Expect = 1.7
 Identities = 20/76 (26%), Positives = 34/76 (44%), Gaps = 1/76 (1%)

Query: 113 SDISTDDDDDGKLQVVEEEGDGEGERRERDEDARSDASRRGSDREDDEDKE-NRTETGED 171
               +  D +  L+   E+   +    + D    SD   + SD E+D+D++ +  E  E 
Sbjct: 206 GLPDSSQDKNKSLEEYYEKESSDAAASQDDGPKGSDVKNKKSDDEEDDDQDGDYVEEKEL 265

Query: 172 KEEEEPVETRIDVEIP 187
           KE+EE  ET  + E  
Sbjct: 266 KEDEEEEETEEEEEEE 281


>gnl|CDD|222792 PHA00435, PHA00435, capsid assembly protein.
          Length = 306

 Score = 30.6 bits (69), Expect = 1.7
 Identities = 28/106 (26%), Positives = 40/106 (37%), Gaps = 19/106 (17%)

Query: 83  VVQGEDLDHHLEALVALLLKLNAEALFGDASDISTDDDDDG-KLQVVEEEGDGEGERRER 141
           VV G     H + ++AL              D++  D DD  +L   E   D  G     
Sbjct: 18  VVTGSTPTEHEQNMLAL--------------DVAARDGDDAIELAEPETSDDPYGNPDPF 63

Query: 142 DEDARSDASRRGSDREDDEDKENRTETGEDKEEEEPVETRIDVEIP 187
            ED       R S+  ++E+     E GE+ EEEE  E   + E  
Sbjct: 64  GEDDEGRIEVRISEDGEEEE----VEEGEEDEEEEGEEESEEFEPL 105


>gnl|CDD|215494 PLN02915, PLN02915, cellulose synthase A [UDP-forming], catalytic
           subunit.
          Length = 1044

 Score = 30.7 bits (69), Expect = 2.1
 Identities = 19/71 (26%), Positives = 32/71 (45%), Gaps = 3/71 (4%)

Query: 123 GKLQVVEEEGDGEGERRERDEDARSDASRRG---SDREDDEDKENRTETGEDKEEEEPVE 179
           GK    E EG G  E ++R +  ++   +RG    D  DD D +   E     E  +P+ 
Sbjct: 137 GKDLEAEREGYGNAEWKDRVDKWKTRQEKRGLVNKDDSDDGDDKGDEEEYLLAEARQPLW 196

Query: 180 TRIDVEIPKIS 190
            ++ +   KI+
Sbjct: 197 RKVPIPSSKIN 207


>gnl|CDD|217373 pfam03115, Astro_capsid, Astrovirus capsid protein precursor.  This
           product is encoded by astrovirus ORF2, one of the three
           astrovirus ORFs (1a, 1b, 2). The 87kD precursor protein
           undergoes an intracellular cleavage to form a 79kD
           protein. Subsequently, extracellular trypsin cleavage
           yields the three proteins forming the infectious virion.
          Length = 787

 Score = 30.5 bits (69), Expect = 2.1
 Identities = 22/94 (23%), Positives = 29/94 (30%), Gaps = 15/94 (15%)

Query: 100 LLKLNAEALFGDASDISTDDDDDGKLQVVEEEGDGEGERRERDEDARSDASR----RGSD 155
            L  +   L G       D  D  +  +  EE D E    E  ED   +  R      S 
Sbjct: 643 SLPQSRAELSGLEERAVEDAPDPYEDLISLEETDTED---ESTEDEDDELDRFDLHDSSG 699

Query: 156 REDDEDKENRTET--------GEDKEEEEPVETR 181
            E ++D EN   T        G   E    +  R
Sbjct: 700 SEPEDDDENNRVTLLSTLINQGMSVERATRITKR 733


>gnl|CDD|217861 pfam04050, Upf2, Up-frameshift suppressor 2.  Transcripts
           harbouring premature signals for translation termination
           are recognised and rapidly degraded by eukaryotic cells
           through a pathway known as nonsense-mediated mRNA decay.
           In Saccharomyces cerevisiae, three trans-acting factors
           (Upf1 to Upf3) are required for nonsense-mediated mRNA
           decay.
          Length = 171

 Score = 29.7 bits (67), Expect = 2.3
 Identities = 17/62 (27%), Positives = 26/62 (41%), Gaps = 1/62 (1%)

Query: 116 STDDDDDGKLQVVEEEGDGEGERRERDEDARSDASRRG-SDREDDEDKENRTETGEDKEE 174
           S  + DDG+      E D + E  + +E    D  +   SD E+++    R E   D E 
Sbjct: 3   SESESDDGEEDEELPEEDEDDESSDEEEVDLPDDEQDEESDSEEEQIFVTRQEEEVDPEA 62

Query: 175 EE 176
           E 
Sbjct: 63  EA 64


>gnl|CDD|181997 PRK09620, PRK09620, hypothetical protein; Provisional.
          Length = 229

 Score = 29.9 bits (67), Expect = 2.5
 Identities = 18/55 (32%), Positives = 30/55 (54%), Gaps = 4/55 (7%)

Query: 188 KISCDLGREMHFVKLPNFLSVETRPFDPETYEDEIEDEETLDEE---GRARLKLK 239
           KIS D+   +HF K P  L  + + +DPET     + E  ++EE    RA+ +++
Sbjct: 118 KISSDIAPIIHFQKAPKVLK-QIKQWDPETVLVGFKLESDVNEEELFERAKNRME 171


>gnl|CDD|185603 PTZ00415, PTZ00415, transmission-blocking target antigen s230;
           Provisional.
          Length = 2849

 Score = 30.4 bits (68), Expect = 3.2
 Identities = 12/39 (30%), Positives = 21/39 (53%)

Query: 140 ERDEDARSDASRRGSDREDDEDKENRTETGEDKEEEEPV 178
           E D   R +      D ++DED ++  +  E++EEEE +
Sbjct: 142 EEDMSPRDNFVIDDDDEDEDEDDDDEEDDEEEEEEEEEI 180


>gnl|CDD|177447 PHA02664, PHA02664, hypothetical protein; Provisional.
          Length = 534

 Score = 30.0 bits (67), Expect = 3.4
 Identities = 14/34 (41%), Positives = 17/34 (50%), Gaps = 5/34 (14%)

Query: 109 FGDASDISTDDDDDGKLQVVEEEGDGEGERRERD 142
           +GD SD   + DD G  Q +EEE     E  ER 
Sbjct: 495 WGDESDSGIEHDDGGVGQAIEEE-----EEEERA 523


>gnl|CDD|218538 pfam05285, SDA1, SDA1.  This family consists of several SDA1
           protein homologues. SDA1 is a Saccharomyces cerevisiae
           protein which is involved in the control of the actin
           cytoskeleton. The protein is essential for cell
           viability and is localised in the nucleus.
          Length = 317

 Score = 29.2 bits (66), Expect = 4.5
 Identities = 18/67 (26%), Positives = 34/67 (50%), Gaps = 1/67 (1%)

Query: 116 STDDDDDGKLQVVEEEGDGEGERRERDEDARSDASRRGSDREDDEDKENRTETGEDKEEE 175
             D DD+G+   VE + + E    E DE+ + +A+++  +  D+E  E   E   ++EE 
Sbjct: 109 DEDSDDEGEWIDVESDKEIESSDSE-DEEEKDEAAKKAKEDSDEELSEEDEEEAAEEEEA 167

Query: 176 EPVETRI 182
           E  + + 
Sbjct: 168 EAEKEKA 174



 Score = 29.2 bits (66), Expect = 4.8
 Identities = 36/143 (25%), Positives = 53/143 (37%), Gaps = 14/143 (9%)

Query: 62  EVAADQLQ----GQTEVVQDLEAARVVQGEDLDHHLEALVALLLKLNAEALFGD-ASDIS 116
           EV  + L+    G+   +      +   GE+        + LL K   E      A    
Sbjct: 33  EVNPELLKKKDRGKPASMGLEAGEKKGYGEEEVVDGIPGLELLEKWKEEERKKKEAEQGL 92

Query: 117 TDDDDDGKLQVVEEEGDGEGERRERDEDARSDASRRGSDREDDEDK---------ENRTE 167
             DDDD + +  E E D + +      D  SD     SD ED+E+K         ++  E
Sbjct: 93  ESDDDDDEEEEWEVEEDEDSDDEGEWIDVESDKEIESSDSEDEEEKDEAAKKAKEDSDEE 152

Query: 168 TGEDKEEEEPVETRIDVEIPKIS 190
             E+ EEE   E   + E  K S
Sbjct: 153 LSEEDEEEAAEEEEAEAEKEKAS 175


>gnl|CDD|237284 PRK13108, PRK13108, prolipoprotein diacylglyceryl transferase;
           Reviewed.
          Length = 460

 Score = 29.2 bits (65), Expect = 5.3
 Identities = 24/120 (20%), Positives = 41/120 (34%), Gaps = 7/120 (5%)

Query: 62  EVAADQLQGQTEVVQDLEAARVVQGEDLDHHLEALVALLLKLNAEALFGDASDISTDDDD 121
           E+AA  +      V  +      Q +D+   ++A VA              S +   D D
Sbjct: 302 ELAAAAVASAASAVGPVGPGEPNQPDDVAEAVKAEVA-----EVTDEVAAESVVQVADRD 356

Query: 122 DGKLQVVEEEGDGEGERRERDEDA--RSDASRRGSDREDDEDKENRTETGEDKEEEEPVE 179
                 VEE  + + ER +  + A     A +  ++      +E      E  +E EP  
Sbjct: 357 GESTPAVEETSEADIEREQPGDLAGQAPAAHQVDAEAASAAPEEPAALASEAHDETEPEV 416


>gnl|CDD|240402 PTZ00399, PTZ00399, cysteinyl-tRNA-synthetase; Provisional.
          Length = 651

 Score = 29.2 bits (66), Expect = 5.7
 Identities = 13/52 (25%), Positives = 26/52 (50%), Gaps = 5/52 (9%)

Query: 332 LADRSNKTTGIKILGQVGYDPDANRYEKIKKEEEKLRQSLRKETKTKRTREK 383
           + D+ +  +  K+      D +  + EK +KE  K ++ LRK  K +  ++K
Sbjct: 535 IEDKPDGPSVWKL-----DDKEELQREKEEKEALKEQKRLRKLKKQEEKKKK 581


>gnl|CDD|217829 pfam03985, Paf1, Paf1.  Members of this family are components of
           the RNA polymerase II associated Paf1 complex. The Paf1
           complex functions during the elongation phase of
           transcription in conjunction with Spt4-Spt5 and
           Spt16-Pob3i.
          Length = 431

 Score = 28.9 bits (65), Expect = 5.8
 Identities = 19/60 (31%), Positives = 31/60 (51%), Gaps = 6/60 (10%)

Query: 118 DDDDDGKLQV-----VEEEGDGEGERRERDEDARSDASRR-GSDREDDEDKENRTETGED 171
           +D+D+ + Q       EE  D E E  +  ED  S++S   GSD E   DKE+ +++  +
Sbjct: 372 EDEDEEEEQRSDEHEEEEGEDSEEEGSQSREDGSSESSSDVGSDSESKADKESASDSDSE 431


>gnl|CDD|113290 pfam04514, BTV_NS2, Bluetongue virus non-structural protein NS2.
           This family includes NS2 proteins from other members of
           the Orbivirus genus. NS2 is a non-specific
           single-stranded RNA-binding protein that forms large
           homomultimers and accumulates in viral inclusion bodies
           of infected cells. Three RNA binding regions have been
           identified in Bluetongue virus serotype 17 at residues
           2-11, 153-166 and 274-286. NS2 multimers also possess
           nucleotidyl phosphatase activity. The precise function
           of NS2 is not known, but it may be involved in the
           transport and condensation of viral mRNAs.
          Length = 363

 Score = 29.1 bits (65), Expect = 6.3
 Identities = 21/110 (19%), Positives = 36/110 (32%), Gaps = 10/110 (9%)

Query: 367 LRQSLRKE-TKTKRTREKGASSRLSAGYLEGDDDDDEGAI---------SLSAIKNKYNA 416
           LRQ +R+E  K K  +    + +        D D+DE  +           S    K   
Sbjct: 157 LRQKIREEREKEKEEQPMKPAFKPERWMGGPDSDEDENPLDEEAPDMTPETSKQDQKEER 216

Query: 417 KKNPAGASGAAPSRNIYSSDEDASDLELRTKKHDKPKRVLQDSDDEDEKS 466
           +               ++ +E   DL    +  +  +R   D D+  E S
Sbjct: 217 RAAVERRLAELVEMINWNLEERRRDLRKEQELEENVERDSDDEDEHGEDS 266


>gnl|CDD|115196 pfam06524, NOA36, NOA36 protein.  This family consists of several
           NOA36 proteins which contain 29 highly conserved
           cysteine residues. The function of this protein is
           unknown.
          Length = 314

 Score = 28.8 bits (64), Expect = 6.6
 Identities = 14/61 (22%), Positives = 23/61 (37%), Gaps = 8/61 (13%)

Query: 116 STDDDDDGKLQVVEEEGDGEGERRERDEDARSDASRRGSDREDDEDKENRTETGEDKEEE 175
           S++   D K      E D +      D D   D         DD+D E+  +T  +  +E
Sbjct: 257 SSNKYGDFKGVNYGYESDDDEGSSSNDYDEEED--------GDDDDNEDNDDTNTNHGDE 308

Query: 176 E 176
           +
Sbjct: 309 D 309


>gnl|CDD|217203 pfam02724, CDC45, CDC45-like protein.  CDC45 is an essential gene
           required for initiation of DNA replication in S.
           cerevisiae, forming a complex with MCM5/CDC46.
           Homologues of CDC45 have been identified in human, mouse
           and smut fungus among others.
          Length = 583

 Score = 29.2 bits (66), Expect = 6.6
 Identities = 13/64 (20%), Positives = 27/64 (42%), Gaps = 5/64 (7%)

Query: 118 DDDDDGKLQVVEEEGDGEGERRERDEDARSDASRRGSDREDDEDKENRTETGEDKEEEEP 177
           D D + +LQ      D   +  E D+D          + +++  K    E  +D ++++ 
Sbjct: 103 DGDIEEELQDEPRYDDAYRDLEEDDDDDEES-----DEEDEESSKSEDDEDDDDDDDDDD 157

Query: 178 VETR 181
           + TR
Sbjct: 158 IATR 161


>gnl|CDD|132321 TIGR03278, methan_mark_10, methanogenesis marker radical SAM
           protein.  Members of this protein family, to date, are
           found in a completed prokaryotic genome if and only if
           the species is one of the archaeal methanogens. It is a
           radical SAM enzyme by homology. The exact function is
           unknown, but likely is linked to methanogenesis. In most
           genomes, the member of this family is encoded by a gene
           next to, and divergently transcribed from, the methyl
           coenzyme M reductase operon.
          Length = 404

 Score = 28.9 bits (65), Expect = 6.7
 Identities = 19/120 (15%), Positives = 42/120 (35%), Gaps = 27/120 (22%)

Query: 253 DGDLVKESNARLVKW-SDGSLSLHLGSEIFDVYKQPLQGDHNHLYIRQGTGLQGQAVFRT 311
           DGD++ ++ A L  W +   + +   +            +   L +     + G      
Sbjct: 181 DGDVLWKTCADLESWGAKALILMRFANT-----------EEQGLILGNAPIIPGI----- 224

Query: 312 KLSFRPHSTESFTHRKMTLSLADRSNKTTGIKILGQVGYDPDANRYEKIKKEEEKLRQSL 371
               +PH+   F       ++   ++K   I++ G    DP+      + KE  +  + L
Sbjct: 225 ----KPHTVSEFK------NIVRETHKEFPIRVTGTPLCDPETGAPFALAKEPNEYLEIL 274


>gnl|CDD|236198 PRK08243, PRK08243, 4-hydroxybenzoate 3-monooxygenase; Validated.
          Length = 392

 Score = 28.6 bits (65), Expect = 6.9
 Identities = 11/20 (55%), Positives = 14/20 (70%)

Query: 70  GQTEVVQDLEAARVVQGEDL 89
           GQTEV +DL AAR+  G  +
Sbjct: 101 GQTEVTRDLMAARLAAGGPI 120


>gnl|CDD|227847 COG5560, UBP12, Ubiquitin C-terminal hydrolase [Posttranslational
           modification, protein turnover, chaperones].
          Length = 823

 Score = 29.1 bits (65), Expect = 7.5
 Identities = 23/80 (28%), Positives = 35/80 (43%), Gaps = 3/80 (3%)

Query: 169 GEDKEEEEPVETRIDVEIPKISCDLGREMHFVKLPNFLSVETRPFDPETYEDEIEDEETL 228
             DK  +E  E  + VE+ K   DL  E   V+L    S  +     ET E + + EE +
Sbjct: 574 IYDKLVKEFEELLVLVEMKKTDVDLVSEQ--VRLLREESSPSSWLKLET-EIDTKREEQV 630

Query: 229 DEEGRARLKLKVENTIRWRD 248
           +EEG+      V  +  W +
Sbjct: 631 EEEGQMNFNDAVVISCEWEE 650


>gnl|CDD|227701 COG5414, COG5414, TATA-binding protein-associated factor
           [Transcription].
          Length = 392

 Score = 28.5 bits (63), Expect = 8.4
 Identities = 12/59 (20%), Positives = 22/59 (37%)

Query: 118 DDDDDGKLQVVEEEGDGEGERRERDEDARSDASRRGSDREDDEDKENRTETGEDKEEEE 176
            ++D        E  +     +E+ ++   +A     + + D   E   E  ED E EE
Sbjct: 273 SEEDLDVGAAEIENKEVSEGDKEQQQEEVENAEAHKEEVQSDRPDEIGEEKEEDDENEE 331


>gnl|CDD|215443 PLN02829, PLN02829, Probable galacturonosyltransferase.
          Length = 639

 Score = 28.7 bits (64), Expect = 8.6
 Identities = 16/68 (23%), Positives = 34/68 (50%)

Query: 116 STDDDDDGKLQVVEEEGDGEGERRERDEDARSDASRRGSDREDDEDKENRTETGEDKEEE 175
           +TDDD   +   + ++   +  + + D+  ++  S+  S  E  E  +   +T E  +E+
Sbjct: 105 ATDDDTHSQTDDIIKQVTQKAGQDDSDQQEKNSQSQSASQAESLEHVQQSAQTSEKVDEK 164

Query: 176 EPVETRID 183
           EP+ T+ D
Sbjct: 165 EPLLTKTD 172


>gnl|CDD|233503 TIGR01642, U2AF_lg, U2 snRNP auxilliary factor, large subunit,
           splicing factor.  These splicing factors consist of an
           N-terminal arginine-rich low complexity domain followed
           by three tandem RNA recognition motifs (pfam00076). The
           well-characterized members of this family are auxilliary
           components of the U2 small nuclear ribonuclearprotein
           splicing factor (U2AF). These proteins are closely
           related to the CC1-like subfamily of splicing factors
           (TIGR01622). Members of this subfamily are found in
           plants, metazoa and fungi.
          Length = 509

 Score = 28.7 bits (64), Expect = 8.7
 Identities = 12/37 (32%), Positives = 13/37 (35%)

Query: 138 RRERDEDARSDASRRGSDREDDEDKENRTETGEDKEE 174
           RR RD     D  RR  +R   ED   R     D   
Sbjct: 26  RRSRDRSRFRDRHRRSRERSYREDSRPRDRRRYDSRS 62


>gnl|CDD|219408 pfam07423, DUF1510, Protein of unknown function (DUF1510).  This
           family consists of several hypothetical bacterial
           proteins of around 200 residues in length. The function
           of this family is unknown.
          Length = 214

 Score = 27.8 bits (62), Expect = 9.6
 Identities = 17/71 (23%), Positives = 38/71 (53%), Gaps = 1/71 (1%)

Query: 116 STDDDDDGKLQVVEEEGDGEGERRERDEDARSDASRRGSDREDDEDKENRTETGEDKEEE 175
            +DD +  +++ V+EE        ++++   ++     S+ E++E+ E  ++  E KE E
Sbjct: 53  KSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEEDEESSDENE-KETE 111

Query: 176 EPVETRIDVEI 186
           E  E+ ++ EI
Sbjct: 112 EKTESNVEKEI 122


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.308    0.130    0.356 

Gapped
Lambda     K      H
   0.267   0.0696    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 24,650,273
Number of extensions: 2498522
Number of successful extensions: 3815
Number of sequences better than 10.0: 1
Number of HSP's gapped: 3255
Number of HSP's successfully gapped: 241
Length of query: 481
Length of database: 10,937,602
Length adjustment: 101
Effective length of query: 380
Effective length of database: 6,457,848
Effective search space: 2453982240
Effective search space used: 2453982240
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.6 bits)
S2: 61 (27.3 bits)