RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy8413
         (796 letters)



>gnl|CDD|149958 pfam09088, MIF4G_like, MIF4G like.  Members of this family are
           involved in mediating U snRNA export from the nucleus.
           They adopt a highly helical structure, wherein the
           polypeptide chain forms a right-handed solenoid. At the
           tertiary level, the domain is composed of a superhelical
           arrangement of successive antiparallel pairs of helices.
          Length = 189

 Score =  231 bits (592), Expect = 2e-71
 Identities = 80/188 (42%), Positives = 105/188 (55%), Gaps = 23/188 (12%)

Query: 310 PGAHSIERFLIEEHLLQIIEMNYFERKDCATQLL-----------------------KYN 346
           P A SI  FL  + L  II +  F RK+CA +LL                          
Sbjct: 1   PPASSIAAFLFRDILADIINLLEFNRKECARELLDLDCFFTRKTFAPRGTSVDKLLDPPE 60

Query: 347 MKIPLEYCIVEIIFGELFRLPTPKYLEIFYGSVLIELCKMKPDTMPQVLAQATVILFMRI 406
               LE  IVE I GELF+LPTP Y  I+Y S+LIELCK+ P  +  VL +A   L+ R+
Sbjct: 61  STWKLEDVIVETILGELFQLPTPPYKPIYYHSLLIELCKLAPAAIAPVLGRAIRFLYRRL 120

Query: 407 ESMNTACFDRFVAWFAYHLSNFQFQWSWEDWESALKLDLEHPKPKFIHEVLNKCVRLSYH 466
           +S++    DRFV WF++HLSNF F+W W +W   L+L   HP+  FI EV+ K +RLSY+
Sbjct: 121 DSLDLELIDRFVDWFSHHLSNFGFRWKWTEWVPDLELPDLHPRKVFIREVIEKEIRLSYY 180

Query: 467 QRIKEIVP 474
            RIKE +P
Sbjct: 181 DRIKETLP 188


>gnl|CDD|220107 pfam09090, MIF4G_like_2, MIF4G like.  Members of this family are
           involved in mediating U snRNA export from the nucleus.
           They adopt a highly helical structure, wherein the
           polypeptide chain forms a right-handed solenoid. At the
           tertiary level, the domain is composed of a superhelical
           arrangement of successive antiparallel pairs of helices.
          Length = 250

 Score =  195 bits (498), Expect = 3e-57
 Identities = 93/277 (33%), Positives = 149/277 (53%), Gaps = 32/277 (11%)

Query: 489 FKYNQEGGEHLPGFVYAQELINAIKKKQSSPNEIMAILNKIPSSSMDEDGADPINPLKID 548
           FKY  E    LP    A++L++AI+KK+ S  EI+ ++  IP      D     N   ID
Sbjct: 1   FKYENES---LPLHEEAKQLLDAIRKKKPS-EEILELIKSIPEI--ASDHGSDFNRFAID 54

Query: 549 VFTQVLLYLGSKSFSHSFAALSKFYKVLKEHIGHTEESQSYVLKSLFELWHDHQQMMAVL 608
           +F Q LL+LGS+SFSH+ +A+ ++ +VLK     +EE Q ++++++FE W +  QM  ++
Sbjct: 55  LFVQTLLHLGSRSFSHALSAIERYKEVLKTLSAESEEKQQWIIEAVFEYWKNQPQMGFLI 114

Query: 609 IDKMLKTQLLHCSSVANWIFSKEMQPE-FTKLYVWEILHLTIKKMSKYVNRVGKELLDAK 667
           +DK+L   ++   ++  WIFS+EM     T+  VWE L  T+ K++  + RV K+L   +
Sbjct: 115 LDKLLNYGIISPIAIITWIFSEEMGNLILTRSTVWESLFRTLSKLASRLRRVVKDLKVLE 174

Query: 668 ERLKHVSSESEEESDGEGGKKNEEKISEEVVEKIEEKLEAAQADQKNLFLIIFQRFIMIL 727
           E                     +    EE +E +EEKLE A  +QK+L   I +RFI IL
Sbjct: 175 E---------------------DNSSEEEDLEFVEEKLERAINEQKSLLNTILKRFIEIL 213

Query: 728 SEHL---VRCDTDGVDFNTHW-YKWTIGRLQQVLLAH 760
            +     +  D D  D +  W ++W IGRL+ +L  +
Sbjct: 214 KDEGEPELEGDEDLPDLDLKWLFRWAIGRLKSILRKY 250


>gnl|CDD|214713 smart00543, MIF4G, Middle domain of eukaryotic initiation factor 4G
           (eIF4G).  Also occurs in NMD2p and CBP80. The domain is
           rich in alpha-helices and may contain multiple
           alpha-helical repeats. In eIF4G, this domain binds
           eIF4A, eIF3, RNA and DNA. Ponting (TiBS) "Novel eIF4G
           domain homologues (in press).
          Length = 200

 Score =  107 bits (268), Expect = 2e-26
 Identities = 55/204 (26%), Positives = 96/204 (47%), Gaps = 20/204 (9%)

Query: 45  STTSLESHLDGLAKVLETDLATYRVKILRILTDCATKYPEKCCIYTTLVGLLNAKNYNFG 104
           S ++ ES +  L K+  +D    R  IL ++ + A + P     Y  L  LLNAKN +FG
Sbjct: 12  SPSNFESIIKELLKLNNSDKNL-RKYILELIFEKAVEEPNFIPAYARLCALLNAKNPDFG 70

Query: 105 GEFVELMVKTFK---DALKNCQWNAARYALRFLADLVNCHVISTNSLLQLL-ENMLDASK 160
              +E + + F+   ++ +          +RFL +L N  V+++  +L+LL E + D +K
Sbjct: 71  SLLLERLQEEFEKGLESEEESDKQRRLGLVRFLGELYNFQVLTSKIILELLKELLNDLTK 130

Query: 161 EDGVPNVRKDWYVYAVLSCLPWVGRELY-EKKEQALALLLLNIELYLNKRSKKHVAALQV 219
            D     R D+ V  +LS LP  G++L  EK  + L  +L  ++ YL K+ K        
Sbjct: 131 LDP---PRSDFSVECLLSLLPTCGKDLEREKSPKLLDEILERLQDYLLKKDKTE------ 181

Query: 220 WSSDKPHPQEEYLDCLWAQIVKLR 243
                   +  ++  L  ++ K +
Sbjct: 182 -----LSSRLRFMLELLIELRKNK 200


>gnl|CDD|217253 pfam02854, MIF4G, MIF4G domain.  MIF4G is named after Middle domain
           of eukaryotic initiation factor 4G (eIF4G). Also occurs
           in NMD2p and CBP80. The domain is rich in alpha-helices
           and may contain multiple alpha-helical repeats. In
           eIF4G, this domain binds eIF4A, eIF3, RNA and DNA.
          Length = 198

 Score = 80.1 bits (198), Expect = 4e-17
 Identities = 40/201 (19%), Positives = 86/201 (42%), Gaps = 20/201 (9%)

Query: 47  TSLESHLDGLAKVLETDLATYRVKILRILTDCATKYPEKCCIYTTLVGLLNAKNYNFGGE 106
           ++ E  +D L K+  +D       IL ++ + A + P     Y  L   LN+KN +FG  
Sbjct: 14  SNFEKIIDELLKLNMSDKNL-LKYILELIFEKAVEEPNFIPAYARLCAGLNSKNPDFGEL 72

Query: 107 FVELMVKTFKDALKNCQ---WNAARYALRFLADLVNCHVISTNSLLQLLENMLDASKEDG 163
            +  + + F+  ++  +          +RFL +L N  V++   +L+ L+ +L++  E  
Sbjct: 73  LLNRLQEEFEKGVEEEERGNRKRRLGLVRFLGELFNFKVLTEKIILECLKELLESLTE-- 130

Query: 164 VPNVRKDWYVYAVLSCLPWVGRELY-EKKEQALALLLLNIELYLNKRSKKHVAALQVWSS 222
             + R ++ +  +L  L   G+ L  EK ++ +  +L  I+ YL                
Sbjct: 131 -EDPRDEFSLEILLLLLTTCGKLLDREKLKKLMDEILERIQKYLLSADP----------- 178

Query: 223 DKPHPQEEYLDCLWAQIVKLR 243
            +   +  ++     ++ K +
Sbjct: 179 -ELSSRLRFMLEDLIELRKNK 198


>gnl|CDD|225502 COG2952, COG2952, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 183

 Score = 34.4 bits (79), Expect = 0.12
 Identities = 19/79 (24%), Positives = 39/79 (49%), Gaps = 8/79 (10%)

Query: 644 ILHLTIKKMSKYVNRVGKELLDAKERLKHVSSESEEESDGEGGKKNEEKISE---EVVEK 700
           I H+  K  + +++    EL   +E LK +     E+S      K E  + E   E++E+
Sbjct: 8   IEHIAHKIANDFIHSKLLELKAPRELLKELIIGILEKSV-----KKENALDEQAKELLEE 62

Query: 701 IEEKLEAAQADQKNLFLII 719
             +++E  + D++ LF ++
Sbjct: 63  NTDEIEFMRMDERQLFWMV 81


>gnl|CDD|227570 COG5245, DYN1, Dynein, heavy chain [Cytoskeleton].
          Length = 3164

 Score = 35.0 bits (80), Expect = 0.23
 Identities = 32/190 (16%), Positives = 61/190 (32%), Gaps = 20/190 (10%)

Query: 491 YNQEGGEHLPGFVYAQELINAIKKKQSSPNEIMAILNKIPSSSMDEDGADPINPLKIDVF 550
           Y   GGEHL GF  A   I  I     S  ++  +L  +P  S              + +
Sbjct: 346 YEFRGGEHLAGFYSAFGDIKRILLFTWSFKKLGTLLPSLPGYS-------SGGMDYGEEY 398

Query: 551 TQVLLYLGSKSFSHSFAALSKFYK--VLKEHIGH-TEESQSY--VLKSLFELWHDHQQMM 605
             +L  LGS+        + K+ +  +    +       +    V       +     M 
Sbjct: 399 RSLLWELGSEVGDPDSGPVRKWMRKDLFDAKVRSGVSFGKQEEFVSDIFNITFERIHGMD 458

Query: 606 AVLIDKMLKTQLLHCSSVANWIFSKEMQPEFTKLYVWEILHLTIKKMSKYVNRVGKELLD 665
              ++   +       ++A  +  +E    F KL     +  +     +  +R       
Sbjct: 459 PTTLEDDEEDT----PALAILLGQEEA-GRFVKLCKIMRM-FSFFNSLEMFSRRTLANRM 512

Query: 666 AKERLKHVSS 675
           A   +K++SS
Sbjct: 513 AI--VKYLSS 520


>gnl|CDD|215774 pfam00183, HSP90, Hsp90 protein. 
          Length = 529

 Score = 34.0 bits (78), Expect = 0.32
 Identities = 22/66 (33%), Positives = 35/66 (53%), Gaps = 3/66 (4%)

Query: 648 TIKKMSKYVNRVGKELLDAKERLKHVSSESEEESDGEGGKKNEEKISEEVVEKIEEKLEA 707
            +KK S+++N     L   KE  K V  E EEE   E  KK EE+ + +  E+++E+ E 
Sbjct: 11  LVKKHSEFIN-YPIYLWVEKEVEKEVPDEEEEEEKEE--KKEEEEKTTDKEEEVDEEEEK 67

Query: 708 AQADQK 713
            +  +K
Sbjct: 68  EEKKKK 73


>gnl|CDD|215822 pfam00257, Dehydrin, Dehydrin. 
          Length = 137

 Score = 32.5 bits (74), Expect = 0.36
 Identities = 15/38 (39%), Positives = 21/38 (55%), Gaps = 5/38 (13%)

Query: 668 ERLKHVSSESEEESDGEGGKKNEEKISEEVVEKIEEKL 705
            R    SS S  E DG+GG++      + + EKI+EKL
Sbjct: 58  RRSGSSSSSSSSEDDGQGGERK-----KGIKEKIKEKL 90


>gnl|CDD|240025 cd04667, Nudix_Hydrolase_10, Members of the Nudix hydrolase
           superfamily catalyze the hydrolysis of NUcleoside
           DIphosphates linked to other moieties, X. Enzymes
           belonging to this superfamily require a divalent cation,
           such as Mg2+ or Mn2+, for their activity and contain a
           highly conserved 23-residue nudix motif
           (GX5EX7REUXEEXGU, where U = I, L or V), which functions
           as a metal binding and catalytic site. Substrates of
           nudix hydrolases include intact and oxidatively damaged
           nucleoside triphosphates, dinucleoside polyphosphates,
           nucleotide-sugars and dinucleotide enzymes. These
           substrates are metabolites or cell signaling molecules
           that require regulation during different stages of the
           cell cycle or during periods of stress. In general, the
           role of the nudix hydrolase is to sanitize the
           nucleotide pools and to maintain cell viability, thereby
           serving as surveillance & "house-cleaning" enzymes.
           Substrate specificity is used to define families within
           the superfamily. Differences in substrate specificity
           are determined by the N-terminal extension or by
           residues in variable loop regions. Mechanistically,
           substrate hydrolysis occurs by a nucleophilic
           substitution reaction, with variation in the numbers and
           roles of divalent cations required.
          Length = 112

 Score = 31.5 bits (72), Expect = 0.46
 Identities = 21/63 (33%), Positives = 27/63 (42%), Gaps = 3/63 (4%)

Query: 185 RELYEKKEQALALLLLNIELYLNKRSKKH-VAALQVWSSDKPHPQEEYLDCLWAQIVKLR 243
           REL E  E  L  L L    +++  S +H V    V  S +P P  E  DC W  +  L 
Sbjct: 42  RELQE--ETGLQGLDLLYLFHVDGGSTRHHVFVASVPPSAQPKPSNEIADCRWLSLDALG 99

Query: 244 QDN 246
             N
Sbjct: 100 DLN 102


>gnl|CDD|216833 pfam01991, vATP-synt_E, ATP synthase (E/31 kDa) subunit.  This
           family includes the vacuolar ATP synthase E subunit, as
           well as the archaebacterial ATP synthase E subunit.
          Length = 195

 Score = 31.9 bits (73), Expect = 0.78
 Identities = 15/47 (31%), Positives = 28/47 (59%), Gaps = 2/47 (4%)

Query: 665 DAKERLKHVSSESEEESDGEGG--KKNEEKISEEVVEKIEEKLEAAQ 709
           +A+E+ + + +E+EEE + E     +  EK  EE+ EK E++ E  +
Sbjct: 5   EAEEKAEEIRAEAEEEFEIEKAEAVEEAEKKIEEIYEKKEKQAEMEK 51


>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
           Provisional.
          Length = 482

 Score = 32.6 bits (75), Expect = 0.86
 Identities = 21/69 (30%), Positives = 33/69 (47%), Gaps = 1/69 (1%)

Query: 649 IKKMSKYVNRVGKELLDAKERLKHVSSESEEESDGEGGKKNEEKISEEVVEKIEEKLEAA 708
           IKK+ +   +  +E    K++      + EEE + E  KK EEK  EE   + E++ E  
Sbjct: 412 IKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEE 471

Query: 709 QAD-QKNLF 716
           +   Q  LF
Sbjct: 472 KKKKQATLF 480


>gnl|CDD|132588 TIGR03549, TIGR03549, YcaO domain protein.  This family consists of
           remarkably well-conserved proteins from gamma and beta
           Proteobacteria, heavily skewed towards organisms of
           marine environments. Its gene neighborhood is not
           conserved. This family has an OsmC-like N-terminal
           domain. It shares a YcaO domain, frequently associated
           with ATP-dependent cyclodehydration for peptide
           modification. The function is unknown. Fifteen of the
           first sixteen members of this family are from
           selenouridine-positive genomes, but this correlation may
           not be meaningful.
          Length = 718

 Score = 30.6 bits (69), Expect = 4.0
 Identities = 14/45 (31%), Positives = 24/45 (53%), Gaps = 1/45 (2%)

Query: 124 WNAARYALRFLADLVNCHVISTNSLLQLLENMLDASKEDGVPNVR 168
           W+    AL +  D++N H +   +L  LLE  L+ S+ D   ++R
Sbjct: 546 WDNTNKALDYREDILNLHRLDDEALEDLLER-LEESQLDNYTDIR 589


>gnl|CDD|132850 cd07211, Pat_PNPLA8, Patatin-like phospholipase domain containing
           protein 8.  PNPLA8 is a Ca-independent myocardial
           phospholipase which maintains mitochondrial integrity.
           PNPLA8 is also known as iPLA2-gamma. In humans, it is
           predominantly expressed in heart tissue. iPLA2-gamma can
           catalyze both phospholipase A1 and A2 reactions (PLA1
           and PLA2 respectively). This family includes PNPLA8
           (iPLA2-gamma) from Homo sapiens and iPLA2-2 from Mus
           musculus.
          Length = 308

 Score = 30.3 bits (69), Expect = 4.1
 Identities = 14/42 (33%), Positives = 21/42 (50%), Gaps = 2/42 (4%)

Query: 545 LKIDVFTQVLLYLGSKS--FSHSFAALSKFYKVLKEHIGHTE 584
           L  DVF+Q     G+     SH++     + K+LKE +G  E
Sbjct: 75  LGKDVFSQNTYISGTSRLVLSHAYYDTETWEKILKEMMGSDE 116


>gnl|CDD|239407 cd03154, TM4SF3_like_LEL, Tetraspanin, extracellular domain or
           large extracellular loop (LEL), TM4SF3_like subfamily.
           Tetraspanins are trans-membrane proteins with 4
           trans-membrane segments. Both the N- and C-termini lie
           on the intracellular side of the membrane. This
           alignment model spans the extracellular domain between
           the 3rd and 4th trans-membrane segment. Tetraspanins are
           involved in diverse processes and their various
           functions may relate to their ability to act as
           molecular facilitators. Tetraspanins associate laterally
           with one another and cluster dynamically with numerous
           parnter domains in membrane microdomains, forming a
           network of multimolecular complexes, the "tetraspanin
           web". This subfamily contaions transmembrane 4
           superfamily 3 (TM4SF3) or D6.1a and related proteins.
           D6.1a associates with alpha6beta4 integrin and supports
           cell motility, it has been ascribed a role in tumor
           progression and metastasis.
          Length = 100

 Score = 28.5 bits (64), Expect = 4.7
 Identities = 12/42 (28%), Positives = 19/42 (45%), Gaps = 2/42 (4%)

Query: 683 GEGGKKNEEKISEEVVEKIEEKLEAAQADQKNL--FLIIFQR 722
           G  G   + KI  E+ EK  + L     + K++   L  FQ+
Sbjct: 1   GIVGAVYKPKIENELKEKNTKLLSLLGQNAKSVKKSLEKFQK 42


>gnl|CDD|219733 pfam08163, NUC194, NUC194 domain.  This is domain B in the
           catalytic subunit of DNA-dependent protein kinases.
          Length = 394

 Score = 30.1 bits (68), Expect = 4.8
 Identities = 22/75 (29%), Positives = 32/75 (42%), Gaps = 14/75 (18%)

Query: 608 LIDKMLKTQLLHCSSVANWIF--------SKEMQPEFTKLYVWEILHLTIKKMSKY---- 655
           L+D+ L T L HCS      F           ++  FTK   WE     IKKM  Y    
Sbjct: 7   LLDRYLVTLLWHCSLDTLVQFFSRIISDIMDIIKSRFTKSNGWEFESQIIKKMGCYKLIE 66

Query: 656 --VNRVGKELLDAKE 668
              +R+ K++L + +
Sbjct: 67  VYYSRLPKDILYSDD 81


>gnl|CDD|221893 pfam13010, pRN1_helical, Primase helical domain.  This alpha
           helical domain is found in a set of bacterial plasmid
           replication proteins. The domain is found to the
           C-terminus of the primase/polymerase domain. Mutants of
           this domain are defective in template binding,
           dinucleotide formation and conformation change prior to
           DNA extension.
          Length = 135

 Score = 29.0 bits (65), Expect = 5.3
 Identities = 17/61 (27%), Positives = 29/61 (47%), Gaps = 2/61 (3%)

Query: 661 KELLDAKERLKHVSSESEEESDGEGGKKNEEKISEEVVEKIEEKLEAAQADQKNLFLIIF 720
           KE  D++E  + +  E  +    +G  K  E I EEV +KI++ LE  +  +K +     
Sbjct: 4   KEEEDSEEDFEKLKEEMAKYDRFKG--KTVEAIREEVCKKIKKSLEELKEKKKAILNTAK 61

Query: 721 Q 721
            
Sbjct: 62  G 62


>gnl|CDD|227132 COG4795, PulJ, Type II secretory pathway, component PulJ
           [Intracellular trafficking and secretion].
          Length = 194

 Score = 29.3 bits (66), Expect = 5.7
 Identities = 10/42 (23%), Positives = 17/42 (40%), Gaps = 5/42 (11%)

Query: 410 NTAC----FDRFVAWFAYHLSNFQFQWSW-EDWESALKLDLE 446
            T C    +D+ +      +S F  Q    + W  AL L ++
Sbjct: 131 ATDCLGLGWDKLLDPLEDIISAFLVQGLSGKGWSPALPLAVK 172


>gnl|CDD|220369 pfam09731, Mitofilin, Mitochondrial inner membrane protein.
           Mitofilin controls mitochondrial cristae morphology.
           Mitofilin is enriched in the narrow space between the
           inner boundary and the outer membranes, where it forms a
           homotypic interaction and assembles into a large
           multimeric protein complex. The first 78 amino acids
           contain a typical amino-terminal-cleavable mitochondrial
           presequence rich in positive-charged and hydroxylated
           residues and a membrane anchor domain. In addition, it
           has three centrally located coiled coil domains.
          Length = 493

 Score = 30.0 bits (68), Expect = 5.8
 Identities = 15/82 (18%), Positives = 33/82 (40%), Gaps = 8/82 (9%)

Query: 643 EILHLTIKKMSKYVNRVGKELLDAKERLKHVSSESEEESDGEGGKKNEEKISEEVVEKIE 702
           + L   I    + ++++ K+L + K   +     + +E   E   K EE++         
Sbjct: 159 DDLESLIASAKEELDQLSKKLAELKAEEEEELERALKEKREELLSKLEEELLAR------ 212

Query: 703 EKLEAAQADQKNLFLIIFQRFI 724
             LE+ +A  +    + F+R  
Sbjct: 213 --LESKEAALEKQLRLEFEREK 232


>gnl|CDD|182123 PRK09866, PRK09866, hypothetical protein; Provisional.
          Length = 741

 Score = 29.8 bits (67), Expect = 7.5
 Identities = 22/89 (24%), Positives = 36/89 (40%), Gaps = 13/89 (14%)

Query: 661 KELLDAKERLKHVSSESEE------ESDGEGGKKNEEKISEEVVEKIEEKLEAA------ 708
           +E LD +    +V+ E         E   +  + N+ ++S E+  +IE  L +A      
Sbjct: 434 REYLDFRAHGLNVACEQLRQNIHQVEESLQLLQLNQAQVSGEIKHEIELALTSANHFLRQ 493

Query: 709 -QADQKNLFLIIFQRFIMILSEHLVRCDT 736
            Q         +FQ     LSE   RC+T
Sbjct: 494 QQDAVNAQLAALFQDDSEPLSEIRTRCET 522


>gnl|CDD|144268 pfam00613, PI3Ka, Phosphoinositide 3-kinase family, accessory
           domain (PIK domain).  PIK domain is conserved in all PI3
           and PI4-kinases. Its role is unclear but it has been
           suggested to be involved in substrate presentation.
          Length = 185

 Score = 28.8 bits (65), Expect = 8.1
 Identities = 13/59 (22%), Positives = 23/59 (38%)

Query: 179 CLPWVGRELYEKKEQALALLLLNIELYLNKRSKKHVAALQVWSSDKPHPQEEYLDCLWA 237
            L W  R       +AL  LLL+++        + ++ +  W+   P    E LD  + 
Sbjct: 31  DLLWKFRYSLMSVPKALTKLLLSVKWSDLDEVAQALSLMLKWAPIDPVDALELLDPNFP 89


>gnl|CDD|226889 COG4487, COG4487, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 438

 Score = 29.4 bits (66), Expect = 8.5
 Identities = 20/73 (27%), Positives = 34/73 (46%), Gaps = 2/73 (2%)

Query: 643 EILHLTIKKMSKYVNRVGKELLDAKERLKHVSSESEEESDGEGGKKNEEKISEEVVEKIE 702
           E+L   + ++SK + +  +         K  ++++EE    E  KK EE + E   EK E
Sbjct: 117 ELLEKELDELSKELQKQLQNTA-EIIEKKRENNKNEERLKFENEKKLEESL-ELEREKFE 174

Query: 703 EKLEAAQADQKNL 715
           E+L  A  D +  
Sbjct: 175 EQLHEANLDLEFK 187


>gnl|CDD|233467 TIGR01554, major_cap_HK97, phage major capsid protein, HK97 family.
            This model family represents the major capsid protein
           component of the heads (capsids) of bacteriophage HK97,
           phi-105, P27, and related phage. This model represents
           one of several analogous families lacking detectable
           sequence similarity. The gene encoding this component is
           typically located in an operon encoding the small and
           large terminase subunits, the portal protein and the
           prohead or maturation protease [Mobile and
           extrachromosomal element functions, Prophage functions].
          Length = 384

 Score = 29.2 bits (66), Expect = 8.9
 Identities = 9/53 (16%), Positives = 27/53 (50%), Gaps = 5/53 (9%)

Query: 663 LLDAKERLKHVSSESEEESDGEGGKKNEEKISEEV-----VEKIEEKLEAAQA 710
           L + +E L  ++ E  + ++ E   + EE+ +E       ++K++ +++  + 
Sbjct: 1   LKELREALAELAKELRKLTEDEKLAEAEEEKAEYDALKEEIDKLDAEIDRLEE 53


>gnl|CDD|148630 pfam07133, Merozoite_SPAM, Merozoite surface protein (SPAM).  This
           family consists of several Plasmodium falciparum SPAM
           (secreted polymorphic antigen associated with
           merozoites) proteins. Variation among SPAM alleles is
           the result of deletions and amino acid substitutions in
           non-repetitive sequences within and flanking the alanine
           heptad-repeat domain. Heptad repeats in which the a and
           d position contain hydrophobic residues generate
           amphipathic alpha-helices which give rise to helical
           bundles or coiled-coil structures in proteins. SPAM is
           an example of a P. falciparum antigen in which a
           repetitive sequence has features characteristic of a
           well-defined structural element.
          Length = 164

 Score = 28.7 bits (64), Expect = 9.0
 Identities = 20/64 (31%), Positives = 30/64 (46%)

Query: 643 EILHLTIKKMSKYVNRVGKELLDAKERLKHVSSESEEESDGEGGKKNEEKISEEVVEKIE 702
           E L   +K  S     + KE  D K+  +    E EEE + E  +  + +  EE+VE  E
Sbjct: 20  ENLLEHVKITSWDKEDIIKENEDVKDEKQEDDEEEEEEDEEEIEEPEDIEDEEEIVEDEE 79

Query: 703 EKLE 706
           E+ E
Sbjct: 80  EEEE 83


>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
           recombination, and repair].
          Length = 908

 Score = 29.3 bits (66), Expect = 9.6
 Identities = 19/72 (26%), Positives = 35/72 (48%), Gaps = 1/72 (1%)

Query: 643 EILHLTI-KKMSKYVNRVGKELLDAKERLKHVSSESEEESDGEGGKKNEEKISEEVVEKI 701
           E+  L   +K+S+ +  V KE     E L+   SE  E+ +       EE    + +E+I
Sbjct: 162 ELFGLEKYEKLSELLKEVIKEAKAKIEELEGQLSELLEDIEDLLEALEEELKELKKLEEI 221

Query: 702 EEKLEAAQADQK 713
           +E+ E  + +Q+
Sbjct: 222 QEEQEEEELEQE 233


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.321    0.136    0.415 

Gapped
Lambda     K      H
   0.267   0.0693    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 41,240,635
Number of extensions: 4121065
Number of successful extensions: 4690
Number of sequences better than 10.0: 1
Number of HSP's gapped: 4655
Number of HSP's successfully gapped: 73
Length of query: 796
Length of database: 10,937,602
Length adjustment: 105
Effective length of query: 691
Effective length of database: 6,280,432
Effective search space: 4339778512
Effective search space used: 4339778512
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 63 (28.1 bits)