BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 037252
         (138 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|296085075|emb|CBI28490.3| unnamed protein product [Vitis vinifera]
          Length = 566

 Score =  104 bits (260), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 50/138 (36%), Positives = 84/138 (60%), Gaps = 7/138 (5%)

Query: 2   NGDVAVASAYNDD-EIGIHIENHGGSGR----SGDFTDVDFLEEEPCIKCNRQRDENLLV 56
           NG+ A   A N   E  +  +  G SG     +  + ++++ ++  CIKC    +  +LV
Sbjct: 86  NGEQAHQLAPNSSGETIVGTKQSGASGSWDMSTQAWMEIEWTQQSKCIKCGEGGE--VLV 143

Query: 57  CSQSGCPISVHENCLSCGVKFDDVGNFYCPYCWYKCELMRTKELRKKAKETKKQLACFID 116
           CS   C ++VHE C++C   FDD+G+FYCPYCWY+C + ++ E RK+A  +KK L+ F+D
Sbjct: 144 CSDRVCRLAVHEKCMNCSAAFDDMGDFYCPYCWYRCAIAKSNEARKRAMSSKKALSTFLD 203

Query: 117 SKSFSGDKKKENCRTDKG 134
           +K+  G+++KE  ++  G
Sbjct: 204 TKALCGNQQKEKTKSSNG 221


>gi|449453762|ref|XP_004144625.1| PREDICTED: uncharacterized protein LOC101213119 [Cucumis sativus]
 gi|449520068|ref|XP_004167056.1| PREDICTED: uncharacterized LOC101213119 [Cucumis sativus]
          Length = 510

 Score = 92.4 bits (228), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 48/123 (39%), Positives = 73/123 (59%), Gaps = 7/123 (5%)

Query: 6   AVASAYNDDEIGIHIEN---HGGSGRSGD--FTDVDFLEEEPCIKCNRQRDENLLVCSQS 60
            +AS+   D+   ++E+   H  S +  D     +D  +++ C +C+   D  LLVC++ 
Sbjct: 24  PMASSPLPDQDVPNVEDNTLHDASNKETDDVLDKIDCFQKDTCTRCDESGD--LLVCTEP 81

Query: 61  GCPISVHENCLSCGVKFDDVGNFYCPYCWYKCELMRTKELRKKAKETKKQLACFIDSKSF 120
           GCPI++HE C+SC   FD+ G FYCPYC YK  L+R  ELR+K    K+ L+ FID++  
Sbjct: 82  GCPIALHELCMSCEPSFDEDGRFYCPYCSYKRALIRVNELRRKTMVAKRALSDFIDTRMV 141

Query: 121 SGD 123
            GD
Sbjct: 142 GGD 144


>gi|357462399|ref|XP_003601481.1| hypothetical protein MTR_3g082160 [Medicago truncatula]
 gi|355490529|gb|AES71732.1| hypothetical protein MTR_3g082160 [Medicago truncatula]
          Length = 351

 Score = 90.9 bits (224), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 44/82 (53%), Positives = 57/82 (69%), Gaps = 1/82 (1%)

Query: 35  VDFLEEEPCIKCNRQRDENLLVCSQSGCPISVHENCLSCGVKFDDVGNFYCPYCWYKCEL 94
           VD L++  CI CN+   E LLVCSQ+ CP+SVH  C+    KFDD GNF+CPYC YK  L
Sbjct: 65  VDSLDDNICITCNKLGGE-LLVCSQTDCPVSVHVTCIGSEPKFDDSGNFFCPYCAYKRAL 123

Query: 95  MRTKELRKKAKETKKQLACFID 116
            +T+ELR+KA   KK L+ F++
Sbjct: 124 KKTRELREKAVLAKKALSSFLE 145


>gi|297739691|emb|CBI29873.3| unnamed protein product [Vitis vinifera]
          Length = 774

 Score = 88.6 bits (218), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 49/121 (40%), Positives = 68/121 (56%), Gaps = 19/121 (15%)

Query: 8   ASAYNDDEIGIHIE-----------NHGGSGRSGDFTDVDFLEEEPCIKCNRQRDENLLV 56
           ++ Y+D+E GI +E           NH     +G      + E+  C+KC +  D  LLV
Sbjct: 468 STEYHDEEDGIAMERQNFLSSKCTFNHDSLSIAG------WTEQNLCMKCTK--DGQLLV 519

Query: 57  CSQSGCPISVHENCLSCGVKFDDVGNFYCPYCWYKCELMRTKELRKKAKETKKQLACFID 116
           CS SGCP+ VHENCL C   FD++GNFYCP+C Y   +    E +KK    KK+LA FI+
Sbjct: 520 CSSSGCPLVVHENCLGCPPSFDNMGNFYCPFCAYSRAVSEYLESKKKVSLAKKELASFIN 579

Query: 117 S 117
           +
Sbjct: 580 A 580


>gi|147846688|emb|CAN80644.1| hypothetical protein VITISV_016915 [Vitis vinifera]
          Length = 774

 Score = 88.6 bits (218), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 49/121 (40%), Positives = 68/121 (56%), Gaps = 19/121 (15%)

Query: 8   ASAYNDDEIGIHIE-----------NHGGSGRSGDFTDVDFLEEEPCIKCNRQRDENLLV 56
           ++ Y+D+E GI +E           NH     +G      + E+  C+KC +  D  LLV
Sbjct: 468 STEYHDEEDGIAMERQNFLSSKCTFNHDSLSIAG------WTEQNLCMKCTK--DGQLLV 519

Query: 57  CSQSGCPISVHENCLSCGVKFDDVGNFYCPYCWYKCELMRTKELRKKAKETKKQLACFID 116
           CS SGCP+ VHENCL C   FD++GNFYCP+C Y   +    E +KK    KK+LA FI+
Sbjct: 520 CSSSGCPLVVHENCLGCPPSFDNMGNFYCPFCAYSRAVSEYLESKKKVSLAKKELASFIN 579

Query: 117 S 117
           +
Sbjct: 580 A 580


>gi|224065645|ref|XP_002301900.1| predicted protein [Populus trichocarpa]
 gi|222843626|gb|EEE81173.1| predicted protein [Populus trichocarpa]
          Length = 472

 Score = 87.8 bits (216), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 48/125 (38%), Positives = 72/125 (57%), Gaps = 15/125 (12%)

Query: 2   NGDVAVASAYNDDEIGIHIENHGGSGRSGDFTDVDFLEEEPCIKCNRQRDENLLVCSQSG 61
           + +++  S+ +DD++G     +GG          D++E + C+ CN++    LLVC   G
Sbjct: 42  DANLSEKSSRSDDDVG-----NGG----------DWMEVDACLSCNKRGKSKLLVCCVIG 86

Query: 62  CPISVHENCLSCGVKFDDVGNFYCPYCWYKCELMRTKELRKKAKETKKQLACFIDSKSFS 121
           CP+S+HE C +  + FDD G F CPYC YK E+ R KEL +KA   KK L  FID +   
Sbjct: 87  CPVSIHEKCANFKLAFDDSGRFCCPYCSYKREVGRAKELFRKAMLAKKALLGFIDPEMVG 146

Query: 122 GDKKK 126
           G+ K+
Sbjct: 147 GEAKR 151


>gi|356546362|ref|XP_003541595.1| PREDICTED: uncharacterized protein LOC100787956 [Glycine max]
          Length = 466

 Score = 82.4 bits (202), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 38/78 (48%), Positives = 51/78 (65%), Gaps = 4/78 (5%)

Query: 43  CIKCNRQ----RDENLLVCSQSGCPISVHENCLSCGVKFDDVGNFYCPYCWYKCELMRTK 98
           CI CN +    +++ LL+CS  GCP++VH  CL+ G KFD  GNF CPYCWYK  +   +
Sbjct: 37  CIHCNNKGEEAKEDGLLICSGRGCPVAVHATCLATGPKFDGSGNFCCPYCWYKRAVDTCR 96

Query: 99  ELRKKAKETKKQLACFID 116
            LR+KA E K  L+ F+D
Sbjct: 97  RLREKALEAKGDLSRFLD 114


>gi|317106598|dbj|BAJ53106.1| JHL20J20.13 [Jatropha curcas]
          Length = 531

 Score = 77.8 bits (190), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 35/75 (46%), Positives = 50/75 (66%), Gaps = 3/75 (4%)

Query: 24  GGSGRSG-DFTDVDFLEEEPCIKCNRQRDENLLVCSQSGCPISVHENCLSCGVKFDDVGN 82
           GG    G D +D D+LEE+ C+ CN      LL+CS+ GCPI++H+ C+    ++D+ GN
Sbjct: 58  GGESNVGEDSSDNDWLEEKSCLMCNM--GGQLLLCSEIGCPIALHKECIVSKPRYDEEGN 115

Query: 83  FYCPYCWYKCELMRT 97
           FYCPYCW+K +L  T
Sbjct: 116 FYCPYCWFKLQLSIT 130


>gi|255571552|ref|XP_002526722.1| conserved hypothetical protein [Ricinus communis]
 gi|223533911|gb|EEF35636.1| conserved hypothetical protein [Ricinus communis]
          Length = 814

 Score = 77.4 bits (189), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 34/73 (46%), Positives = 47/73 (64%), Gaps = 2/73 (2%)

Query: 43  CIKCNRQRDENLLVCSQSGCPISVHENCLSCGVKFDDVGNFYCPYCWYKCELMRTKELRK 102
           C++C++  D  LLVC+  GCP  VHE CL C  KFD+ GNFYCP+C Y   + +  E +K
Sbjct: 469 CVQCSK--DGQLLVCNAVGCPFVVHEKCLGCSPKFDEKGNFYCPFCAYSYAISKYLEAKK 526

Query: 103 KAKETKKQLACFI 115
            A   +K+L+ FI
Sbjct: 527 TATLARKELSAFI 539


>gi|255539006|ref|XP_002510568.1| hypothetical protein RCOM_1598630 [Ricinus communis]
 gi|223551269|gb|EEF52755.1| hypothetical protein RCOM_1598630 [Ricinus communis]
          Length = 422

 Score = 75.5 bits (184), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 40/94 (42%), Positives = 55/94 (58%), Gaps = 4/94 (4%)

Query: 43  CIKCNRQRDENLLVCSQSGCPISVHENCLSCGVKFDDVGNFYCPYCWYKCELMRTKELRK 102
           C+KCN+     LL+C  +GC I +H  C+    K+D+ GNF+CPYCWYK +  R +E +K
Sbjct: 24  CLKCNKGG--KLLICCGAGCAICLHVECIPRKPKYDEEGNFHCPYCWYKLQQARAQEWKK 81

Query: 103 KAKETKKQLACFIDSKSFS--GDKKKENCRTDKG 134
            A   KK L+ F+DS+      DK K N R   G
Sbjct: 82  MALLAKKALSDFMDSRQVEVGNDKAKLNDRRING 115


>gi|356555030|ref|XP_003545842.1| PREDICTED: uncharacterized protein LOC100806195 [Glycine max]
          Length = 580

 Score = 75.1 bits (183), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 35/83 (42%), Positives = 51/83 (61%), Gaps = 2/83 (2%)

Query: 35  VDFLEEEPCIKCNRQRDE--NLLVCSQSGCPISVHENCLSCGVKFDDVGNFYCPYCWYKC 92
           VD  ++  CI C+ + +E   +L+C   GCP++VH  CL    +FDD GNF CPYCWYK 
Sbjct: 68  VDIFDKTICIHCDNKGEEAEGVLICGGRGCPVAVHATCLGFEPEFDDSGNFCCPYCWYKR 127

Query: 93  ELMRTKELRKKAKETKKQLACFI 115
            +   + LR+KA + K +L+ F 
Sbjct: 128 AVDTCRRLREKAMKAKGELSRFF 150


>gi|224139856|ref|XP_002323310.1| predicted protein [Populus trichocarpa]
 gi|222867940|gb|EEF05071.1| predicted protein [Populus trichocarpa]
          Length = 827

 Score = 70.1 bits (170), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 32/87 (36%), Positives = 50/87 (57%), Gaps = 2/87 (2%)

Query: 32  FTDVDFLEEEPCIKCNRQRDENLLVCSQSGCPISVHENCLSCGVKFDDVGNFYCPYCWYK 91
             + ++ ++  CIKC +  D  LLVC    C + +HENCL     FD+ G+FYCP+C Y 
Sbjct: 480 LAEANWTDQNLCIKCCK--DGQLLVCGAGSCSLVIHENCLVFSPHFDERGDFYCPFCAYS 537

Query: 92  CELMRTKELRKKAKETKKQLACFIDSK 118
             +    E +KKA   +K+L  FI+++
Sbjct: 538 LAISEYLEAKKKAYSARKELKLFIETR 564


>gi|297728353|ref|NP_001176540.1| Os11g0482500 [Oryza sativa Japonica Group]
 gi|255680092|dbj|BAH95268.1| Os11g0482500, partial [Oryza sativa Japonica Group]
          Length = 748

 Score = 65.9 bits (159), Expect = 5e-09,   Method: Composition-based stats.
 Identities = 35/97 (36%), Positives = 50/97 (51%), Gaps = 4/97 (4%)

Query: 39  EEEPCIKCNRQRDENLLVCSQSGCPISVHENCLSCGVKFDDVGNFYCPYCWYKCELMRTK 98
           E+E CIKC +  D  LL CS  GC ++VH+ C    V FDD G FYCP C+Y       +
Sbjct: 589 EQELCIKCGK--DGQLLKCS--GCFLAVHDTCFGSSVTFDDSGQFYCPVCFYTKATEAYQ 644

Query: 99  ELRKKAKETKKQLACFIDSKSFSGDKKKENCRTDKGN 135
           + +K   E +K L+ F+  K  +   ++   R    N
Sbjct: 645 KAKKTYSEARKNLSAFLGRKQLAEQYQQAAVRQRAAN 681


>gi|125577146|gb|EAZ18368.1| hypothetical protein OsJ_33899 [Oryza sativa Japonica Group]
          Length = 887

 Score = 65.5 bits (158), Expect = 6e-09,   Method: Composition-based stats.
 Identities = 35/97 (36%), Positives = 50/97 (51%), Gaps = 4/97 (4%)

Query: 39  EEEPCIKCNRQRDENLLVCSQSGCPISVHENCLSCGVKFDDVGNFYCPYCWYKCELMRTK 98
           E+E CIKC +  D  LL CS  GC ++VH+ C    V FDD G FYCP C+Y       +
Sbjct: 589 EQELCIKCGK--DGQLLKCS--GCFLAVHDTCFGSSVTFDDSGQFYCPVCFYTKATEAYQ 644

Query: 99  ELRKKAKETKKQLACFIDSKSFSGDKKKENCRTDKGN 135
           + +K   E +K L+ F+  K  +   ++   R    N
Sbjct: 645 KAKKTYSEARKNLSAFLGRKQLAEQYQQAAVRQRAAN 681


>gi|108864393|gb|ABA93678.2| expressed protein [Oryza sativa Japonica Group]
          Length = 756

 Score = 65.5 bits (158), Expect = 7e-09,   Method: Composition-based stats.
 Identities = 35/97 (36%), Positives = 50/97 (51%), Gaps = 4/97 (4%)

Query: 39  EEEPCIKCNRQRDENLLVCSQSGCPISVHENCLSCGVKFDDVGNFYCPYCWYKCELMRTK 98
           E+E CIKC +  D  LL CS  GC ++VH+ C    V FDD G FYCP C+Y       +
Sbjct: 589 EQELCIKCGK--DGQLLKCS--GCFLAVHDTCFGSSVTFDDSGQFYCPVCFYTKATEAYQ 644

Query: 99  ELRKKAKETKKQLACFIDSKSFSGDKKKENCRTDKGN 135
           + +K   E +K L+ F+  K  +   ++   R    N
Sbjct: 645 KAKKTYSEARKNLSAFLGRKQLAEQYQQAAVRQRAAN 681


>gi|15223319|ref|NP_171623.1| Homeodomain-like protein with RING/FYVE/PHD-type zinc finger domain
           [Arabidopsis thaliana]
 gi|332189125|gb|AEE27246.1| Homeodomain-like protein with RING/FYVE/PHD-type zinc finger domain
           [Arabidopsis thaliana]
          Length = 345

 Score = 63.5 bits (153), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 37/112 (33%), Positives = 59/112 (52%), Gaps = 5/112 (4%)

Query: 27  GRSGDFTDVDFLEEEPCIKCNRQRDENLLVCSQSGCPISVHENCLSCGVKFDDVGNFYCP 86
           G S  F   + L E  CI C+   D+ ++ CS + CP++VH  C+   +  +D   FYCP
Sbjct: 71  GTSDTFRSFEPLNEHACIVCDIA-DDGVVPCSGNECPLAVHRKCVE--LDCEDPATFYCP 127

Query: 87  YCWYKCELMRTKELRKKAKETKKQLACFIDSKSFSGD--KKKENCRTDKGNE 136
           YCW+K +  R+  LR +     K L  +  S+  SGD    +EN + + G++
Sbjct: 128 YCWFKEQATRSTALRTRGVAAAKTLVQYGCSELRSGDIVMTRENSQLENGSD 179


>gi|6715640|gb|AAF26467.1|AC007323_8 T25K16.14 [Arabidopsis thaliana]
          Length = 321

 Score = 63.2 bits (152), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 37/112 (33%), Positives = 59/112 (52%), Gaps = 5/112 (4%)

Query: 27  GRSGDFTDVDFLEEEPCIKCNRQRDENLLVCSQSGCPISVHENCLSCGVKFDDVGNFYCP 86
           G S  F   + L E  CI C+   D+ ++ CS + CP++VH  C+   +  +D   FYCP
Sbjct: 49  GTSDTFRSFEPLNEHACIVCDIA-DDGVVPCSGNECPLAVHRKCVE--LDCEDPATFYCP 105

Query: 87  YCWYKCELMRTKELRKKAKETKKQLACFIDSKSFSGD--KKKENCRTDKGNE 136
           YCW+K +  R+  LR +     K L  +  S+  SGD    +EN + + G++
Sbjct: 106 YCWFKEQATRSTALRTRGVAAAKTLVQYGCSELRSGDIVMTRENSQLENGSD 157


>gi|15223886|ref|NP_172930.1| RING/FYVE/PHD zinc finger protein [Arabidopsis thaliana]
 gi|42571477|ref|NP_973829.1| RING/FYVE/PHD zinc finger protein [Arabidopsis thaliana]
 gi|8778233|gb|AAF79242.1|AC006917_27 F10B6.17 [Arabidopsis thaliana]
 gi|26452752|dbj|BAC43457.1| putative PhD-finger protein [Arabidopsis thaliana]
 gi|30793923|gb|AAP40414.1| putative PhD-finger protein [Arabidopsis thaliana]
 gi|332191102|gb|AEE29223.1| RING/FYVE/PHD zinc finger protein [Arabidopsis thaliana]
 gi|332191103|gb|AEE29224.1| RING/FYVE/PHD zinc finger protein [Arabidopsis thaliana]
          Length = 429

 Score = 60.1 bits (144), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 32/76 (42%), Positives = 43/76 (56%), Gaps = 3/76 (3%)

Query: 40  EEPCIKCNRQRDENLLVCSQSGCPISVHENCLSCGVKFDDVGNFYCPYCWYKCELMRTKE 99
           E  C KC  +++  LL+CS+S C   VH+ CL+C V  D+ GNF CP CWY    M   E
Sbjct: 345 ENVCWKC--EKEGTLLICSRSECAAKVHKECLNCPVNVDEYGNFLCPLCWYDRLAMEYHE 402

Query: 100 LRKKAKETKKQLA-CF 114
            +K     K++L  CF
Sbjct: 403 SQKLISCAKRRLVKCF 418


>gi|21593665|gb|AAM65632.1| PhD-finger protein, putative [Arabidopsis thaliana]
          Length = 429

 Score = 60.1 bits (144), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 32/76 (42%), Positives = 43/76 (56%), Gaps = 3/76 (3%)

Query: 40  EEPCIKCNRQRDENLLVCSQSGCPISVHENCLSCGVKFDDVGNFYCPYCWYKCELMRTKE 99
           E  C KC  +++  LL+CS+S C   VH+ CL+C V  D+ GNF CP CWY    M   E
Sbjct: 345 ENVCWKC--EKEGTLLICSRSECAAKVHKECLNCPVNVDEYGNFLCPLCWYDRLAMEYHE 402

Query: 100 LRKKAKETKKQLA-CF 114
            +K     K++L  CF
Sbjct: 403 SQKLISCAKRRLVKCF 418


>gi|297849896|ref|XP_002892829.1| hypothetical protein ARALYDRAFT_471666 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297338671|gb|EFH69088.1| hypothetical protein ARALYDRAFT_471666 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 412

 Score = 59.7 bits (143), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 31/76 (40%), Positives = 43/76 (56%), Gaps = 3/76 (3%)

Query: 40  EEPCIKCNRQRDENLLVCSQSGCPISVHENCLSCGVKFDDVGNFYCPYCWYKCELMRTKE 99
           E  C KC  +++  LL+CS+S C   VH+ CL+C V  D+ GNF CP CWY    M   E
Sbjct: 328 ENVCWKC--EKEGTLLICSKSECAAKVHKECLNCPVNVDEGGNFLCPLCWYDRVAMEYNE 385

Query: 100 LRKKAKETKKQLA-CF 114
            ++     K++L  CF
Sbjct: 386 SKRLIGGAKRRLVKCF 401


>gi|357142398|ref|XP_003572559.1| PREDICTED: uncharacterized protein LOC100830030 [Brachypodium
           distachyon]
          Length = 953

 Score = 57.4 bits (137), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 29/85 (34%), Positives = 45/85 (52%), Gaps = 4/85 (4%)

Query: 37  FLEEEPCIKCNRQRDENLLVCSQSGCPISVHENCLSCGVKFDDVGNFYCPYCWYKCELMR 96
             E++ C+KC +  D  LL CS   C ++ H++C    V FDD G FYC  C+Y      
Sbjct: 587 LTEQDLCVKCGQ--DGQLLRCS--SCLLAAHDSCFGSSVTFDDSGQFYCAVCFYTKATEA 642

Query: 97  TKELRKKAKETKKQLACFIDSKSFS 121
            ++ +K   E +K L+ F+  K F+
Sbjct: 643 YQKAKKIYAEARKNLSAFLGPKQFA 667


>gi|326526823|dbj|BAK00800.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 928

 Score = 56.6 bits (135), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 33/88 (37%), Positives = 45/88 (51%), Gaps = 12/88 (13%)

Query: 37  FLEEEPCIKCNRQRDENLLVCSQSGCPISVHENCLSCGVKFDDVGNFYCPYCWYKCELMR 96
           F E++ CIKC +  D  LL CS   C +  H+ C    V  D  G FYCP C+Y     +
Sbjct: 511 FTEQDLCIKCGK--DGQLLKCS--SCSLIAHDRCFGSSVTLDVSGQFYCPVCFY----TK 562

Query: 97  TKELRKKAK----ETKKQLACFIDSKSF 120
             E  +KAK    E +K L+ F+ +K F
Sbjct: 563 ATEAYQKAKITYLEARKNLSAFLGTKQF 590


>gi|297842998|ref|XP_002889380.1| hypothetical protein ARALYDRAFT_887342 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297335222|gb|EFH65639.1| hypothetical protein ARALYDRAFT_887342 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 293

 Score = 55.5 bits (132), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 35/99 (35%), Positives = 50/99 (50%), Gaps = 7/99 (7%)

Query: 27  GRSGDFTDVDFLEEEPCIKCNRQRDENLLVCSQSGCPISVHENC--LSCGVKFDDVGNFY 84
           G S  +   + L E+ C+ C+   D+++  CS + CPISVH  C  L C    +D    Y
Sbjct: 77  GTSDTWRSFEPLNEDACVVCDIT-DDSVAPCSGNDCPISVHRECAELDC----EDSATSY 131

Query: 85  CPYCWYKCELMRTKELRKKAKETKKQLACFIDSKSFSGD 123
           CPYCW+K +  R+  LR       K L  +  +K  SGD
Sbjct: 132 CPYCWFKDKATRSTTLRTVGVAAAKTLDQYRCTKLKSGD 170


>gi|326488397|dbj|BAJ93867.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 869

 Score = 54.3 bits (129), Expect = 1e-05,   Method: Composition-based stats.
 Identities = 29/96 (30%), Positives = 51/96 (53%), Gaps = 5/96 (5%)

Query: 39  EEEPCIKCNRQRDENLLVCSQSGCPISVHENCLSCGVKFDDVGNFYCPYCWYKCELMRTK 98
           E++ C+KC++  D  LL CS   C ++ H +C    V FDD G+F CP C+ +      +
Sbjct: 497 EQDSCVKCSK--DGQLLKCSS--CFLAAHASCFGSSVTFDDSGHFDCPVCYCRKAAEALE 552

Query: 99  ELRKKAKETKKQLACFIDS-KSFSGDKKKENCRTDK 133
           + +K   E +K L+ F    K F+ +  +++  T +
Sbjct: 553 KAKKTYSEARKNLSVFCSGWKQFTKEFSEQSIGTQQ 588


>gi|297838557|ref|XP_002887160.1| hypothetical protein ARALYDRAFT_339050 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297333001|gb|EFH63419.1| hypothetical protein ARALYDRAFT_339050 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 302

 Score = 53.5 bits (127), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 29/83 (34%), Positives = 40/83 (48%), Gaps = 2/83 (2%)

Query: 43  CIKCNRQRDENLLVCSQSGCPISVHENCLSCGVKFDDVGNFYCPYCWYKCELMRTKELRK 102
           C+KC  +    LL CS  GC + VH+ CL     +DD GNFYC  C   C      + + 
Sbjct: 217 CVKC--KESGKLLFCSSDGCQVLVHQKCLDSPPAYDDAGNFYCSLCAITCASAEYVQSQD 274

Query: 103 KAKETKKQLACFIDSKSFSGDKK 125
           +  + KK+L  F+   S    KK
Sbjct: 275 EVAKAKKKLVSFLRLMSEVNKKK 297


>gi|115484315|ref|NP_001065819.1| Os11g0160700 [Oryza sativa Japonica Group]
 gi|77548797|gb|ABA91594.1| expressed protein [Oryza sativa Japonica Group]
 gi|113644523|dbj|BAF27664.1| Os11g0160700 [Oryza sativa Japonica Group]
 gi|222615565|gb|EEE51697.1| hypothetical protein OsJ_33066 [Oryza sativa Japonica Group]
          Length = 951

 Score = 52.8 bits (125), Expect = 4e-05,   Method: Composition-based stats.
 Identities = 27/83 (32%), Positives = 41/83 (49%), Gaps = 4/83 (4%)

Query: 39  EEEPCIKCNRQRDENLLVCSQSGCPISVHENCLSCGVKFDDVGNFYCPYCWYKCELMRTK 98
           E++ CIKC +  D  LL CS   C ++ H+ C       DD G  YCP C+Y       +
Sbjct: 578 EQDLCIKCGK--DGQLLKCSS--CLLAAHDTCFGSLATLDDSGQLYCPVCFYTKATEAYQ 633

Query: 99  ELRKKAKETKKQLACFIDSKSFS 121
           + +K   E +K L+ F+  K  +
Sbjct: 634 KAKKTYSEARKNLSAFLGRKQLA 656


>gi|218185299|gb|EEC67726.1| hypothetical protein OsI_35213 [Oryza sativa Indica Group]
          Length = 951

 Score = 52.8 bits (125), Expect = 4e-05,   Method: Composition-based stats.
 Identities = 27/83 (32%), Positives = 41/83 (49%), Gaps = 4/83 (4%)

Query: 39  EEEPCIKCNRQRDENLLVCSQSGCPISVHENCLSCGVKFDDVGNFYCPYCWYKCELMRTK 98
           E++ CIKC +  D  LL CS   C ++ H+ C       DD G  YCP C+Y       +
Sbjct: 578 EQDLCIKCGK--DGQLLKCSS--CLLAAHDTCFGSLATLDDSGQLYCPVCFYTKATEAYQ 633

Query: 99  ELRKKAKETKKQLACFIDSKSFS 121
           + +K   E +K L+ F+  K  +
Sbjct: 634 KAKKTYSEARKNLSAFLGRKQLA 656


>gi|62733346|gb|AAX95463.1| hypothetical protein [Oryza sativa Japonica Group]
          Length = 932

 Score = 52.8 bits (125), Expect = 5e-05,   Method: Composition-based stats.
 Identities = 27/83 (32%), Positives = 41/83 (49%), Gaps = 4/83 (4%)

Query: 39  EEEPCIKCNRQRDENLLVCSQSGCPISVHENCLSCGVKFDDVGNFYCPYCWYKCELMRTK 98
           E++ CIKC +  D  LL CS   C ++ H+ C       DD G  YCP C+Y       +
Sbjct: 578 EQDLCIKCGK--DGQLLKCSS--CLLAAHDTCFGSLATLDDSGQLYCPVCFYTKATEAYQ 633

Query: 99  ELRKKAKETKKQLACFIDSKSFS 121
           + +K   E +K L+ F+  K  +
Sbjct: 634 KAKKTYSEARKNLSAFLGRKQLA 656


>gi|242060102|ref|XP_002451340.1| hypothetical protein SORBIDRAFT_04g000347 [Sorghum bicolor]
 gi|241931171|gb|EES04316.1| hypothetical protein SORBIDRAFT_04g000347 [Sorghum bicolor]
          Length = 392

 Score = 52.0 bits (123), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 29/86 (33%), Positives = 46/86 (53%), Gaps = 4/86 (4%)

Query: 36  DFLEEEPCIKCNRQRDENLLVCSQSGCPISVHENCLSCGVKFDDVGNFYCPYCWYKCELM 95
           D  +++ CIKC +  D  LL CS  GC ++ H++C    V F+    FYCP C YK    
Sbjct: 80  DLSQQDLCIKCGK--DGQLLECS--GCSLAAHDSCFGSSVTFEGTDLFYCPRCCYKKATE 135

Query: 96  RTKELRKKAKETKKQLACFIDSKSFS 121
             ++ +K   E +++LA F  ++  S
Sbjct: 136 AYEKAKKTYYEAREKLAAFHGTEHVS 161


>gi|297810459|ref|XP_002873113.1| hypothetical protein ARALYDRAFT_487156 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297318950|gb|EFH49372.1| hypothetical protein ARALYDRAFT_487156 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 427

 Score = 51.2 bits (121), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 32/92 (34%), Positives = 48/92 (52%), Gaps = 6/92 (6%)

Query: 37  FLEEEPCIKCNRQRDENLLVCSQSGCPISVHENCLSCGV---KFDDVGNFYCPYCWYKCE 93
            L+ + CI C    DE +  C    C +S H  CL   +     +DV N +CPYCW K  
Sbjct: 73  LLDLDACIVCEVS-DERVSRCCGVDCLLSFHGECLYAELGSSSSEDVANPFCPYCWLKIV 131

Query: 94  LMRTKELRKKAKETKKQLACFIDS--KSFSGD 123
            +++K LR+K  E +K +  ++D   KS +GD
Sbjct: 132 ALKSKTLREKTVEAEKAVFKYLDKEMKSRNGD 163


>gi|413935053|gb|AFW69604.1| putative homeodomain-like transcription factor superfamily protein
           [Zea mays]
          Length = 938

 Score = 51.2 bits (121), Expect = 1e-04,   Method: Composition-based stats.
 Identities = 26/79 (32%), Positives = 42/79 (53%), Gaps = 4/79 (5%)

Query: 39  EEEPCIKCNRQRDENLLVCSQSGCPISVHENCLSCGVKFDDVGNFYCPYCWYKCELMRTK 98
           +++ CIKC +     LL CS   C ++ H +C    V F++   FYCP C+YK      K
Sbjct: 589 QQDLCIKCGKV--GQLLECSS--CSLAAHNSCFGSSVTFEETNLFYCPVCFYKRATEAYK 644

Query: 99  ELRKKAKETKKQLACFIDS 117
           + +K   E +K +A F+ +
Sbjct: 645 KAKKTYGEARKNVAAFLGT 663


>gi|413935052|gb|AFW69603.1| putative homeodomain-like transcription factor superfamily protein
           [Zea mays]
          Length = 942

 Score = 50.8 bits (120), Expect = 2e-04,   Method: Composition-based stats.
 Identities = 26/79 (32%), Positives = 42/79 (53%), Gaps = 4/79 (5%)

Query: 39  EEEPCIKCNRQRDENLLVCSQSGCPISVHENCLSCGVKFDDVGNFYCPYCWYKCELMRTK 98
           +++ CIKC +     LL CS   C ++ H +C    V F++   FYCP C+YK      K
Sbjct: 589 QQDLCIKCGKV--GQLLECSS--CSLAAHNSCFGSSVTFEETNLFYCPVCFYKRATEAYK 644

Query: 99  ELRKKAKETKKQLACFIDS 117
           + +K   E +K +A F+ +
Sbjct: 645 KAKKTYGEARKNVAAFLGT 663


>gi|242067090|ref|XP_002454834.1| hypothetical protein SORBIDRAFT_04g038230 [Sorghum bicolor]
 gi|241934665|gb|EES07810.1| hypothetical protein SORBIDRAFT_04g038230 [Sorghum bicolor]
          Length = 926

 Score = 49.3 bits (116), Expect = 5e-04,   Method: Composition-based stats.
 Identities = 28/90 (31%), Positives = 44/90 (48%), Gaps = 4/90 (4%)

Query: 39  EEEPCIKCNRQRDENLLVCSQSGCPISVHENCLSCGVKFDDVGNFYCPYCWYKCELMRTK 98
           E+  CIKC +     LL CS   C +S HE+C +  + F+D    YCP C         K
Sbjct: 603 EQNLCIKCGK--GGQLLQCSS--CLLSAHESCFASSLTFEDPRQLYCPVCICAKATEEYK 658

Query: 99  ELRKKAKETKKQLACFIDSKSFSGDKKKEN 128
           + +K   E +K LA F+ ++      +++N
Sbjct: 659 KAKKTYIEARKSLAAFVGAEQLLKQHEQQN 688


>gi|357445323|ref|XP_003592939.1| PHD finger protein 21A [Medicago truncatula]
 gi|355481987|gb|AES63190.1| PHD finger protein 21A [Medicago truncatula]
          Length = 473

 Score = 48.9 bits (115), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 25/69 (36%), Positives = 37/69 (53%), Gaps = 4/69 (5%)

Query: 43  CIKCNRQRDENLLVCSQSGCPISVHENCLSCGVKFDDVGNFYCPYCWYKCELMRTKELRK 102
           C KCN      LLVC+   CP+ +H+NCL    + +  GNF CP+C Y   +    E +K
Sbjct: 383 CSKCNEAG--QLLVCTT--CPLMLHKNCLGDSAQLEAKGNFLCPFCKYSHAISEYLEAKK 438

Query: 103 KAKETKKQL 111
            +   +K+L
Sbjct: 439 ISSSARKEL 447


>gi|225898058|dbj|BAH30361.1| hypothetical protein [Arabidopsis thaliana]
          Length = 278

 Score = 48.9 bits (115), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 25/83 (30%), Positives = 38/83 (45%), Gaps = 2/83 (2%)

Query: 43  CIKCNRQRDENLLVCSQSGCPISVHENCLSCGVKFDDVGNFYCPYCWYKCELMRTKELRK 102
           C+ C  +    LL CS  GC + VH+ CL     + D G+FYC  C   C      + + 
Sbjct: 193 CVNC--KESGKLLFCSSDGCEVMVHQKCLDSPPSYHDAGDFYCSLCALTCPSAEYVQFQD 250

Query: 103 KAKETKKQLACFIDSKSFSGDKK 125
           +  + K++L  F+   S    KK
Sbjct: 251 EVAKAKRKLVSFLRLMSEVNKKK 273


>gi|15220630|ref|NP_176970.1| RING/FYVE/PHD zinc finger protein [Arabidopsis thaliana]
 gi|12324072|gb|AAG52000.1|AC012563_10 hypothetical protein; 42846-44012 [Arabidopsis thaliana]
 gi|332196618|gb|AEE34739.1| RING/FYVE/PHD zinc finger protein [Arabidopsis thaliana]
          Length = 314

 Score = 48.5 bits (114), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 25/83 (30%), Positives = 38/83 (45%), Gaps = 2/83 (2%)

Query: 43  CIKCNRQRDENLLVCSQSGCPISVHENCLSCGVKFDDVGNFYCPYCWYKCELMRTKELRK 102
           C+ C  +    LL CS  GC + VH+ CL     + D G+FYC  C   C      + + 
Sbjct: 229 CVNC--KESGKLLFCSSDGCEVMVHQKCLDSPPSYHDAGDFYCSLCALTCPSAEYVQFQD 286

Query: 103 KAKETKKQLACFIDSKSFSGDKK 125
           +  + K++L  F+   S    KK
Sbjct: 287 EVAKAKRKLVSFLRLMSEVNKKK 309


>gi|7340663|emb|CAB82943.1| myb-like protein [Arabidopsis thaliana]
 gi|9758006|dbj|BAB08603.1| Myb-like protein [Arabidopsis thaliana]
 gi|41619056|gb|AAS10013.1| MYB transcription factor [Arabidopsis thaliana]
          Length = 412

 Score = 48.5 bits (114), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 30/97 (30%), Positives = 47/97 (48%), Gaps = 10/97 (10%)

Query: 37  FLEEEPCIKCNRQRDENLLVCSQSGCPISVHENCLSCGV---------KFDDVGNFYCPY 87
            L+ + CI C    DE +  C    C +S H  CL   +           +DV N +CPY
Sbjct: 59  LLDLDACIVCEVS-DERVSRCCGVDCLLSFHGECLYADLGSTSSSSSSSSEDVSNPFCPY 117

Query: 88  CWYKCELMRTKELRKKAKETKKQLACFIDSKSFSGDK 124
           CW K   +++K LR+K  E +K +  ++D +  S D+
Sbjct: 118 CWLKIVALKSKTLREKTVEAEKAVCKYLDKEMKSRDE 154


>gi|30680203|ref|NP_195998.2| protein TRF-like 10 [Arabidopsis thaliana]
 gi|225898885|dbj|BAH30573.1| hypothetical protein [Arabidopsis thaliana]
 gi|332003270|gb|AED90653.1| protein TRF-like 10 [Arabidopsis thaliana]
          Length = 420

 Score = 48.1 bits (113), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 30/96 (31%), Positives = 46/96 (47%), Gaps = 10/96 (10%)

Query: 37  FLEEEPCIKCNRQRDENLLVCSQSGCPISVHENCLSCGV---------KFDDVGNFYCPY 87
            L+ + CI C    DE +  C    C +S H  CL   +           +DV N +CPY
Sbjct: 67  LLDLDACIVCEVS-DERVSRCCGVDCLLSFHGECLYADLGSTSSSSSSSSEDVSNPFCPY 125

Query: 88  CWYKCELMRTKELRKKAKETKKQLACFIDSKSFSGD 123
           CW K   +++K LR+K  E +K +  ++D +  S D
Sbjct: 126 CWLKIVALKSKTLREKTVEAEKAVCKYLDKEMKSRD 161


>gi|26450818|dbj|BAC42517.1| myb like protein [Arabidopsis thaliana]
          Length = 420

 Score = 48.1 bits (113), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 30/96 (31%), Positives = 46/96 (47%), Gaps = 10/96 (10%)

Query: 37  FLEEEPCIKCNRQRDENLLVCSQSGCPISVHENCLSCGV---------KFDDVGNFYCPY 87
            L+ + CI C    DE +  C    C +S H  CL   +           +DV N +CPY
Sbjct: 67  LLDLDACIVCEVS-DERVSRCCGVDCLLSFHGECLYADLGSTSSSSSSSSEDVSNPFCPY 125

Query: 88  CWYKCELMRTKELRKKAKETKKQLACFIDSKSFSGD 123
           CW K   +++K LR+K  E +K +  ++D +  S D
Sbjct: 126 CWLKIVALKSKTLREKTVEAEKAVCKYLDKEMKSRD 161


>gi|242061730|ref|XP_002452154.1| hypothetical protein SORBIDRAFT_04g020760 [Sorghum bicolor]
 gi|241931985|gb|EES05130.1| hypothetical protein SORBIDRAFT_04g020760 [Sorghum bicolor]
          Length = 961

 Score = 47.8 bits (112), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 26/82 (31%), Positives = 42/82 (51%), Gaps = 4/82 (4%)

Query: 39  EEEPCIKCNRQRDENLLVCSQSGCPISVHENCLSCGVKFDDVGNFYCPYCWYKCELMRTK 98
           E++ CIKC +    +LL CS   C +S H++C    + F+D    +CP C Y       K
Sbjct: 602 EQDLCIKCGK--GGHLLQCS--SCLLSAHDSCFGSSLTFEDPRQLHCPVCIYAKATEEYK 657

Query: 99  ELRKKAKETKKQLACFIDSKSF 120
           + +K   E +K LA F+ ++  
Sbjct: 658 KTKKTYIEARKSLAAFLGAEQL 679


>gi|357119201|ref|XP_003561334.1| PREDICTED: uncharacterized protein LOC100826621 [Brachypodium
           distachyon]
          Length = 954

 Score = 47.8 bits (112), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 29/87 (33%), Positives = 45/87 (51%), Gaps = 12/87 (13%)

Query: 39  EEEPCIKCNRQRDENLLVCSQSGCPISVHENCLSCGVKFDDVGNFYCPYCWYKCELMRTK 98
           E+E CIKC +  D  LL CS   C ++ H++C    + F+D   F CP C+Y    ++  
Sbjct: 582 EQESCIKCGK--DGQLLKCST--CLLAAHDSCFGPSMPFEDSTQFCCPVCFY----IKAT 633

Query: 99  ELRKKAK----ETKKQLACFIDSKSFS 121
           E  KKAK    E +  L  F+ ++  +
Sbjct: 634 EAYKKAKNTYCEARMNLTAFLGTEQLA 660


>gi|413932381|gb|AFW66932.1| hypothetical protein ZEAMMB73_241747 [Zea mays]
          Length = 867

 Score = 47.4 bits (111), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 28/92 (30%), Positives = 42/92 (45%), Gaps = 4/92 (4%)

Query: 36  DFLEEEPCIKCNRQRDENLLVCSQSGCPISVHENCLSCGVKFDDVGNFYCPYCWYKCELM 95
           D  ++  CIKC       LL C+  GC ++ H +C      F+    FYCP C YK    
Sbjct: 568 DLSQQGLCIKCGE--GGQLLECN--GCFLAAHSSCFGSSATFEGTNFFYCPVCLYKKATE 623

Query: 96  RTKELRKKAKETKKQLACFIDSKSFSGDKKKE 127
             K+ +K   ET+K L  F+ +   +    K+
Sbjct: 624 AYKKAKKTYCETRKNLVAFLGTAQATNQPDKQ 655


>gi|413932382|gb|AFW66933.1| hypothetical protein ZEAMMB73_241747 [Zea mays]
 gi|413932383|gb|AFW66934.1| hypothetical protein ZEAMMB73_241747 [Zea mays]
          Length = 748

 Score = 47.4 bits (111), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 28/92 (30%), Positives = 42/92 (45%), Gaps = 4/92 (4%)

Query: 36  DFLEEEPCIKCNRQRDENLLVCSQSGCPISVHENCLSCGVKFDDVGNFYCPYCWYKCELM 95
           D  ++  CIKC       LL C+  GC ++ H +C      F+    FYCP C YK    
Sbjct: 421 DLSQQGLCIKCGE--GGQLLECN--GCFLAAHSSCFGSSATFEGTNFFYCPVCLYKKATE 476

Query: 96  RTKELRKKAKETKKQLACFIDSKSFSGDKKKE 127
             K+ +K   ET+K L  F+ +   +    K+
Sbjct: 477 AYKKAKKTYCETRKNLVAFLGTAQATNQPDKQ 508


>gi|125534385|gb|EAY80933.1| hypothetical protein OsI_36111 [Oryza sativa Indica Group]
          Length = 940

 Score = 47.0 bits (110), Expect = 0.002,   Method: Composition-based stats.
 Identities = 24/74 (32%), Positives = 36/74 (48%), Gaps = 2/74 (2%)

Query: 64  ISVHENCLSCG--VKFDDVGNFYCPYCWYKCELMRTKELRKKAKETKKQLACFIDSKSFS 121
           +S  E C+ CG  V FDD G FYCP C+Y       ++ +K   E +K L+ F+  K  +
Sbjct: 587 LSEQELCIKCGKDVTFDDSGQFYCPVCFYTKATEAYQKAKKTYSEARKNLSAFLGRKQLA 646

Query: 122 GDKKKENCRTDKGN 135
              ++   R    N
Sbjct: 647 EQYQQAAVRQRAAN 660


>gi|413932342|gb|AFW66893.1| putative homeodomain-like transcription factor superfamily protein
           [Zea mays]
          Length = 800

 Score = 45.4 bits (106), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 34/105 (32%), Positives = 49/105 (46%), Gaps = 17/105 (16%)

Query: 15  EIGIHIENHGGSGRSGDFTDVDFLE----EEPCIKCNRQRDENLLVCSQSGCPISVHENC 70
           EI +H E         D   +D LE    ++ C+KC +     LL CS+  C ++VH +C
Sbjct: 423 EIFVHFE---------DQYIIDTLEGLSQQDLCLKCGK--GGQLLECSR--CFLAVHSSC 469

Query: 71  LSCGVKFDDVGNFYCPYCWYKCELMRTKELRKKAKETKKQLACFI 115
           L     F+    FYCP C YK  +   K+  K   E +K LA  +
Sbjct: 470 LGSSATFEGTNLFYCPICSYKKAIEACKKANKTYCEARKNLAALL 514


>gi|449437938|ref|XP_004136747.1| PREDICTED: uncharacterized protein LOC101206592 [Cucumis sativus]
          Length = 562

 Score = 44.7 bits (104), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 19/48 (39%), Positives = 26/48 (54%), Gaps = 2/48 (4%)

Query: 43  CIKCNRQRDENLLVCSQSGCPISVHENCLSCGVKFDDVGNFYCPYCWY 90
           C+KCN      LL C+   CP+ VH  CL      +D G+F CP+C +
Sbjct: 517 CVKCNEGG--QLLSCNIGDCPLVVHAKCLGSLAAMNDEGDFRCPFCLF 562


>gi|224135941|ref|XP_002322199.1| predicted protein [Populus trichocarpa]
 gi|222869195|gb|EEF06326.1| predicted protein [Populus trichocarpa]
          Length = 105

 Score = 40.4 bits (93), Expect = 0.25,   Method: Compositional matrix adjust.
 Identities = 21/48 (43%), Positives = 27/48 (56%)

Query: 70  CLSCGVKFDDVGNFYCPYCWYKCELMRTKELRKKAKETKKQLACFIDS 117
           C    + +DD+G FY  Y  YK ++   K LR+KA   KK L  FIDS
Sbjct: 35  CTWFKLAYDDLGRFYYTYYSYKQQVDMAKGLREKAMFAKKALLNFIDS 82


>gi|301118602|ref|XP_002907029.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262108378|gb|EEY66430.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 863

 Score = 40.4 bits (93), Expect = 0.26,   Method: Composition-based stats.
 Identities = 26/66 (39%), Positives = 35/66 (53%), Gaps = 2/66 (3%)

Query: 42  PCIKCNRQRDEN--LLVCSQSGCPISVHENCLSCGVKFDDVGNFYCPYCWYKCELMRTKE 99
           PC +CNR  D    L++C    C    H NCLS  +     G ++CP C  + E  R KE
Sbjct: 148 PCERCNRVDDHYSLLILCDGKNCKREYHMNCLSPPLVSVPPGEWFCPDCQKEREKEREKE 207

Query: 100 LRKKAK 105
           L+K+AK
Sbjct: 208 LQKRAK 213


>gi|391334814|ref|XP_003741795.1| PREDICTED: uncharacterized protein LOC100898311 [Metaseiulus
           occidentalis]
          Length = 1347

 Score = 40.0 bits (92), Expect = 0.31,   Method: Composition-based stats.
 Identities = 29/88 (32%), Positives = 44/88 (50%), Gaps = 12/88 (13%)

Query: 38  LEEEPCIKCNRQ-RDENLLVCSQSGCPISVHENCLSCGVKFDDVGNFYCPYCWYKC--EL 94
           +++EPC KCN+    E +L+C    C    H +CL   +    +GN+YCP C +    E 
Sbjct: 801 VDDEPCRKCNKSDHPEFILLCDS--CDAGYHMSCLKPALMVIPLGNWYCPPCEHDALIEA 858

Query: 95  MRTK------ELRKKAKE-TKKQLACFI 115
           +  K      E  K+ KE T+KQ   F+
Sbjct: 859 LNGKLSFIEAEWEKRQKEITRKQRLAFV 886


>gi|159470889|ref|XP_001693589.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158283092|gb|EDP08843.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 1585

 Score = 39.3 bits (90), Expect = 0.47,   Method: Composition-based stats.
 Identities = 17/47 (36%), Positives = 26/47 (55%), Gaps = 3/47 (6%)

Query: 43   CIKCNRQRD-ENLLVCSQSGCPISVHENCLSCGVKFDDVGNFYCPYC 88
            C+ C R    +N+L+C   GCP + H +CL   V     G+++CP C
Sbjct: 1503 CVDCRRPHPADNMLMCD--GCPGAYHAHCLGIPVPEPGSGDWFCPMC 1547


>gi|84997071|ref|XP_953257.1| SWI/SNF-related chromatin remodelling factor (ISWI homologue)
           [Theileria annulata strain Ankara]
 gi|65304253|emb|CAI76632.1| SWI/SNF-related chromatin remodelling factor (ISWI homologue),
           putative [Theileria annulata]
          Length = 1972

 Score = 38.5 bits (88), Expect = 0.88,   Method: Compositional matrix adjust.
 Identities = 30/89 (33%), Positives = 41/89 (46%), Gaps = 15/89 (16%)

Query: 12  NDDEIGIHIENHGGSGRSGDFT---------DVDFLEEEPCIKCN-RQRDEN--LLVCSQ 59
           N    G +  NH       DFT         +++ + +E C  C  R RDE+  LL+C  
Sbjct: 43  NSTATGYYQNNHDNPIDLSDFTCDSCQQYAQEIEIVNDERCKICKERNRDESDILLLCD- 101

Query: 60  SGCPISVHENCLSCGVKFDDVGNFYCPYC 88
            GCP S H +CL   V+  D   +YCP C
Sbjct: 102 -GCPNSYHMSCLDLHVE-PDSEKWYCPMC 128


>gi|298708138|emb|CBJ30479.1| myst-related protein [Ectocarpus siliculosus]
          Length = 1620

 Score = 38.5 bits (88), Expect = 0.95,   Method: Composition-based stats.
 Identities = 23/65 (35%), Positives = 31/65 (47%), Gaps = 5/65 (7%)

Query: 43  CIKCNRQRD-ENLLVCSQSGCPISVHENCLSCGVKFDDVGNFYCPYCWYKCELMRTKELR 101
           C  C    D + LL C   GC +  H  CL   VK    G+F+CPYC  K ++    E  
Sbjct: 614 CRVCKVDVDYDKLLKCDGPGCELEWHTYCLKPPVKTVPKGDFFCPYCKAKQKV----EFE 669

Query: 102 KKAKE 106
           KK ++
Sbjct: 670 KKRQQ 674


>gi|348665984|gb|EGZ05812.1| hypothetical protein PHYSODRAFT_551242 [Phytophthora sojae]
          Length = 895

 Score = 38.1 bits (87), Expect = 1.2,   Method: Composition-based stats.
 Identities = 18/49 (36%), Positives = 26/49 (53%), Gaps = 2/49 (4%)

Query: 42  PCIKCNRQRDEN--LLVCSQSGCPISVHENCLSCGVKFDDVGNFYCPYC 88
           PC +CNR  D    L++C    C    H NCLS  ++    G+++CP C
Sbjct: 148 PCERCNRVDDHYSLLILCDGKNCKREYHMNCLSPPLESIPPGDWFCPDC 196


>gi|302816250|ref|XP_002989804.1| hypothetical protein SELMODRAFT_428350 [Selaginella moellendorffii]
 gi|300142370|gb|EFJ09071.1| hypothetical protein SELMODRAFT_428350 [Selaginella moellendorffii]
          Length = 2150

 Score = 37.7 bits (86), Expect = 1.7,   Method: Composition-based stats.
 Identities = 30/104 (28%), Positives = 48/104 (46%), Gaps = 7/104 (6%)

Query: 33  TDVDFLEEEPCIKCNRQRDENLLVCSQSGCPISVHENCLSCGVKFDDVGNFYCPYCWYKC 92
           +D  F E   C+ C  Q   NLL C    CP   H +CLS  +K    G + CP C  + 
Sbjct: 241 SDGSFFE---CMVC--QSGGNLLCCDH--CPRVYHLHCLSPPLKRAPTGKWRCPDCIGES 293

Query: 93  ELMRTKELRKKAKETKKQLACFIDSKSFSGDKKKENCRTDKGNE 136
           E    + +  K  ET K++    +SK+ + + +  N   +K ++
Sbjct: 294 EKPLAETVAAKNTETAKEVRSESESKNQAPNGEILNISNEKSHK 337


>gi|351706869|gb|EHB09788.1| PHD finger protein 21B [Heterocephalus glaber]
          Length = 482

 Score = 37.4 bits (85), Expect = 2.1,   Method: Composition-based stats.
 Identities = 22/61 (36%), Positives = 27/61 (44%), Gaps = 11/61 (18%)

Query: 40  EEPCIKCNRQRDE---------NLLVCSQSGCPISVHENCLSCGVKFDDVGNFYCPYCWY 90
           E+PC K   + DE         NLL CS   CP + H  CL   +K    G + CP C  
Sbjct: 292 EDPCWKSELEHDELCAACRRGSNLLCCS--ACPGAYHLGCLHPPLKTAPKGGWLCPKCQQ 349

Query: 91  K 91
           K
Sbjct: 350 K 350


>gi|353231818|emb|CCD79173.1| putative set domain protein [Schistosoma mansoni]
          Length = 1306

 Score = 37.4 bits (85), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 28/98 (28%), Positives = 45/98 (45%), Gaps = 13/98 (13%)

Query: 3    GDVAVASAYNDDEIGIHIENHG------GSGRSGDFTDVDFLEEEPCIKCNRQRDENLLV 56
              V +A+      I +H +NH       G+  S        + E+ C +C    +  L++
Sbjct: 956  SPVTMAARATSSSISLH-KNHTIQTKVEGNAESNTIGFNPAVHEDFCYRCGDGGE--LIL 1012

Query: 57   CSQSGCPISVHENCLSCGVKFDDVGNFYCPYCWYKCEL 94
            C +S C  S H NCL  G+    +G +YCP  W+ C+L
Sbjct: 1013 CDKSTCSKSYHLNCL--GLSVPPLGIWYCP--WHYCDL 1046


>gi|348522650|ref|XP_003448837.1| PREDICTED: PHD finger protein 21B-like [Oreochromis niloticus]
          Length = 626

 Score = 37.4 bits (85), Expect = 2.3,   Method: Composition-based stats.
 Identities = 19/59 (32%), Positives = 28/59 (47%), Gaps = 7/59 (11%)

Query: 40  EEPCIKCNRQRDENLLVCSQSG-------CPISVHENCLSCGVKFDDVGNFYCPYCWYK 91
           E+ C K + + D++  VC + G       CP + H  CL   +K    G +YCP C  K
Sbjct: 425 EDFCWKQDLEHDDHCAVCKEDGELQQCHNCPRAFHPTCLHPPLKTPPRGPWYCPKCQKK 483


>gi|320536731|ref|ZP_08036740.1| ankyrin repeat protein [Treponema phagedenis F0421]
 gi|320146434|gb|EFW38041.1| ankyrin repeat protein [Treponema phagedenis F0421]
          Length = 304

 Score = 37.0 bits (84), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 23/67 (34%), Positives = 32/67 (47%), Gaps = 1/67 (1%)

Query: 43  CIKCNRQRDENLLVCSQSGCPISVHENCLSCGVKFDDVGNFYCPYCWYKCELMRTKELRK 102
           CIKCN + D+N   C + G   S  + C  C    +D G  +CP+C  K + +   EL K
Sbjct: 4   CIKCNAELDDNAEFCGRCGAAQSKKKTCSKCHSALED-GFQFCPHCGLKVKKITKAELNK 62

Query: 103 KAKETKK 109
             K   K
Sbjct: 63  AEKILHK 69


>gi|350645335|emb|CCD59958.1| zinc finger protein, putative [Schistosoma mansoni]
          Length = 1690

 Score = 37.0 bits (84), Expect = 2.3,   Method: Composition-based stats.
 Identities = 19/52 (36%), Positives = 30/52 (57%), Gaps = 3/52 (5%)

Query: 38   LEEEPCIKCNRQRDE-NLLVCSQSGCPISVHENCLSCGVKFDDVGNFYCPYC 88
            +E+  C  C R+ D+ NLL+C   GC ++ H  CL   +K    G+++CP C
Sbjct: 1248 VEDARCRICRRKTDDDNLLLCD--GCNLAFHLYCLRPPLKRVPTGDWFCPTC 1297


>gi|256075776|ref|XP_002574192.1| SET domain protein [Schistosoma mansoni]
          Length = 1343

 Score = 37.0 bits (84), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 28/98 (28%), Positives = 45/98 (45%), Gaps = 13/98 (13%)

Query: 3    GDVAVASAYNDDEIGIHIENHG------GSGRSGDFTDVDFLEEEPCIKCNRQRDENLLV 56
              V +A+      I +H +NH       G+  S        + E+ C +C    +  L++
Sbjct: 993  SPVTMAARATSSSISLH-KNHTIQTKVEGNAESNTIGFNPAVHEDFCYRCGDGGE--LIL 1049

Query: 57   CSQSGCPISVHENCLSCGVKFDDVGNFYCPYCWYKCEL 94
            C +S C  S H NCL  G+    +G +YCP  W+ C+L
Sbjct: 1050 CDKSTCSKSYHLNCL--GLSVPPLGIWYCP--WHYCDL 1083


>gi|256070387|ref|XP_002571524.1| zinc finger protein [Schistosoma mansoni]
          Length = 1690

 Score = 37.0 bits (84), Expect = 2.3,   Method: Composition-based stats.
 Identities = 19/52 (36%), Positives = 30/52 (57%), Gaps = 3/52 (5%)

Query: 38   LEEEPCIKCNRQRDE-NLLVCSQSGCPISVHENCLSCGVKFDDVGNFYCPYC 88
            +E+  C  C R+ D+ NLL+C   GC ++ H  CL   +K    G+++CP C
Sbjct: 1248 VEDARCRICRRKTDDDNLLLCD--GCNLAFHLYCLRPPLKRVPTGDWFCPTC 1297


>gi|358255504|dbj|GAA57198.1| probable histone-lysine N-methyltransferase NSD2 [Clonorchis
            sinensis]
          Length = 1293

 Score = 37.0 bits (84), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 18/40 (45%), Positives = 24/40 (60%), Gaps = 4/40 (10%)

Query: 54   LLVCSQSGCPISVHENCLSCGVKFDDVGNFYCPYCWYKCE 93
            LL+C +S CP S H NCL  G+     G +YCP  W+ C+
Sbjct: 1070 LLLCDKSSCPKSFHLNCL--GLSSPPSGIWYCP--WHYCD 1105


>gi|444519036|gb|ELV12526.1| PHD finger protein 21B [Tupaia chinensis]
          Length = 489

 Score = 37.0 bits (84), Expect = 2.6,   Method: Composition-based stats.
 Identities = 19/59 (32%), Positives = 27/59 (45%), Gaps = 7/59 (11%)

Query: 40  EEPCIKCNRQRDENLLVCSQS-------GCPISVHENCLSCGVKFDDVGNFYCPYCWYK 91
           E+PC K     DE+   C++        GCP + H +CL   +K    G + CP C  K
Sbjct: 299 EDPCWKNEITHDEHCAACTRGANLQPCGGCPGAYHLSCLDPPLKTAPKGTWLCPKCQQK 357


>gi|170079294|ref|YP_001735932.1| putative protein serine/threonine phosphatase [Synechococcus sp.
           PCC 7002]
 gi|169886963|gb|ACB00677.1| probable protein phosphatase [Synechococcus sp. PCC 7002]
          Length = 662

 Score = 36.6 bits (83), Expect = 3.2,   Method: Composition-based stats.
 Identities = 18/40 (45%), Positives = 26/40 (65%)

Query: 2   NGDVAVASAYNDDEIGIHIENHGGSGRSGDFTDVDFLEEE 41
           +G+ A A++ ++ E    IE+ GGS  SGDF  +DF EEE
Sbjct: 339 DGETATANSGSETEDSPQIEHSGGSIASGDFEPLDFPEEE 378


>gi|47230592|emb|CAF99785.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 1679

 Score = 36.2 bits (82), Expect = 3.9,   Method: Composition-based stats.
 Identities = 27/92 (29%), Positives = 37/92 (40%), Gaps = 5/92 (5%)

Query: 43   CIKCNRQRDENLLVCSQSGCPISVHENCLSCGVKFDDVGNFYCPYCWYKCELMRTKELRK 102
            C  C +  +E+LL+    GC    H  C    +     G++YCP C  K     +    K
Sbjct: 1435 CQMCKKGDNEDLLLLC-DGCDKGCHTYCHKPKISTIPEGDWYCPSCISKA----SGSSPK 1489

Query: 103  KAKETKKQLACFIDSKSFSGDKKKENCRTDKG 134
              K   KQ+A    S    GD KK   +T  G
Sbjct: 1490 SKKTPSKQVATGGGSAKKGGDSKKNGKQTGNG 1521


>gi|378730781|gb|EHY57240.1| hypothetical protein HMPREF1120_05286 [Exophiala dermatitidis
           NIH/UT8656]
          Length = 449

 Score = 36.2 bits (82), Expect = 4.1,   Method: Composition-based stats.
 Identities = 20/63 (31%), Positives = 31/63 (49%), Gaps = 2/63 (3%)

Query: 43  CIKCNRQRDENLLVCSQSGCPIS-VHENCLSCGVKFDDVGNFYCPYCWYKCELMRTKELR 101
           CI CN++    ++ C    CP    H  CL+     D+  ++YCP C  K E++    L+
Sbjct: 385 CI-CNQRSYGEMVACENDDCPYQWFHTGCLNMKKVPDEDEDWYCPTCREKPEVIEKVRLK 443

Query: 102 KKA 104
           KK 
Sbjct: 444 KKG 446


>gi|348551108|ref|XP_003461372.1| PREDICTED: PHD finger protein 21B [Cavia porcellus]
          Length = 513

 Score = 36.2 bits (82), Expect = 4.3,   Method: Composition-based stats.
 Identities = 21/56 (37%), Positives = 26/56 (46%), Gaps = 4/56 (7%)

Query: 36  DFLEEEPCIKCNRQRDENLLVCSQSGCPISVHENCLSCGVKFDDVGNFYCPYCWYK 91
           +F  +E C  C R  D  LL CS   CP + H  CL   +K    G + CP C  K
Sbjct: 330 EFSHDEHCAACRRGAD--LLHCS--ACPDAYHLGCLHPPLKTAPRGGWLCPKCQQK 381


>gi|332265789|ref|XP_003281897.1| PREDICTED: leishmanolysin-like peptidase [Nomascus leucogenys]
          Length = 649

 Score = 36.2 bits (82), Expect = 4.3,   Method: Composition-based stats.
 Identities = 27/77 (35%), Positives = 37/77 (48%), Gaps = 15/77 (19%)

Query: 26  SGRSGDFTDVDFLEEEPCIKCNRQ------RDENLLVCSQSG--CPISVHENCLSCGVKF 77
           SG     +D   LE +P + C+ Q      +D + L CS++G   P+S+  N    G   
Sbjct: 529 SGEYQRSSDCRILENQPDVSCSPQGLKVWVQDTSYL-CSRAGQVLPVSIQMN----GWIH 583

Query: 78  DDVGNFYCPYCWYKCEL 94
           D  GN  CP CW  CEL
Sbjct: 584 D--GNLLCPSCWDFCEL 598


>gi|194874037|ref|XP_001973329.1| GG16034 [Drosophila erecta]
 gi|190655112|gb|EDV52355.1| GG16034 [Drosophila erecta]
          Length = 869

 Score = 36.2 bits (82), Expect = 4.4,   Method: Composition-based stats.
 Identities = 21/52 (40%), Positives = 26/52 (50%), Gaps = 4/52 (7%)

Query: 37 FLEEEPCIKCNRQRDENLLVCSQSGCPISVHENCLSCGVKFDDVGNFYCPYC 88
          F EEE C  C   RD   L+C  S CP   H  CL+  +K    G++ CP C
Sbjct: 5  FREEEYCRVC---RDGGDLLCCDS-CPSVYHRTCLTPPLKSIPKGDWICPRC 52


>gi|303288972|ref|XP_003063774.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226454842|gb|EEH52147.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 702

 Score = 36.2 bits (82), Expect = 4.4,   Method: Composition-based stats.
 Identities = 16/50 (32%), Positives = 26/50 (52%), Gaps = 2/50 (4%)

Query: 39  EEEPCIKCNRQRDENLLVCSQSGCPISVHENCLSCGVKFDDVGNFYCPYC 88
           ++ PC+ C R   E+  V  ++GCP   H  CL  G+     G+++C  C
Sbjct: 394 DDVPCLACGRADGESDFVLCENGCPTGGHYQCL--GMAAVPEGDWFCDAC 441


>gi|399218583|emb|CCF75470.1| unnamed protein product [Babesia microti strain RI]
          Length = 1643

 Score = 36.2 bits (82), Expect = 4.7,   Method: Compositional matrix adjust.
 Identities = 23/60 (38%), Positives = 32/60 (53%), Gaps = 5/60 (8%)

Query: 43  CIKCNRQRDENLLVCSQSGCPISVHENCLSCGVKFDDVGNFYCPYCWYKCELMRTKELRK 102
           C  C    +E LL+C   GCP S H +CL+   +  D  N+YCP C  K E  +  E+R+
Sbjct: 74  CKICKEINEETLLLCD--GCPNSYHLSCLALETE-PDADNWYCPMC--KPEDHKGHEIRR 128


>gi|405963330|gb|EKC28913.1| hypothetical protein CGI_10016416 [Crassostrea gigas]
          Length = 519

 Score = 36.2 bits (82), Expect = 4.9,   Method: Composition-based stats.
 Identities = 17/53 (32%), Positives = 27/53 (50%), Gaps = 4/53 (7%)

Query: 36  DFLEEEPCIKCNRQRDENLLVCSQSGCPISVHENCLSCGVKFDDVGNFYCPYC 88
           D++E   C+ C    +E +L+C    C  S H+ C+   ++ D   N YCP C
Sbjct: 456 DYVECSQCVICLEDYEEGVLLCGLP-CKHSFHQRCIVVWLQRD---NHYCPVC 504


>gi|358374402|dbj|GAA90994.1| PHD and RING finger domain protein [Aspergillus kawachii IFO 4308]
          Length = 626

 Score = 35.8 bits (81), Expect = 5.2,   Method: Composition-based stats.
 Identities = 22/78 (28%), Positives = 37/78 (47%), Gaps = 3/78 (3%)

Query: 31  DFTDVDFLEEEPCIKCNRQRDENLLVCSQSGCPISVHENCLSCGVKFDDVGNFYCPYCWY 90
           ++ D DF + +PC  C    +E LL+    GC +  H  C+  G+     G +YC  C  
Sbjct: 125 EYIDDDFADFQPCPICGDADNEELLLLC-DGCDVPSHTYCV--GLDSVPTGPWYCTRCVT 181

Query: 91  KCELMRTKELRKKAKETK 108
           +  L R+ E    + +T+
Sbjct: 182 QRRLGRSPETADWSSQTQ 199


>gi|71029212|ref|XP_764249.1| DNA-dependent helicase [Theileria parva strain Muguga]
 gi|68351203|gb|EAN31966.1| DNA-dependent helicase, putative [Theileria parva]
          Length = 2026

 Score = 35.8 bits (81), Expect = 5.6,   Method: Compositional matrix adjust.
 Identities = 24/58 (41%), Positives = 33/58 (56%), Gaps = 6/58 (10%)

Query: 34  DVDFLEEEPCIKCN-RQRDEN--LLVCSQSGCPISVHENCLSCGVKFDDVGNFYCPYC 88
           +V+ + +E C  C  R RDE+  LL+C   GCP S H +CL   V+  D   +YCP C
Sbjct: 74  EVEIVNDERCKICKERNRDESDILLLCD--GCPNSYHMSCLDLHVE-PDSEKWYCPMC 128


>gi|403223808|dbj|BAM41938.1| SWI/SNF-related chromatin remodelling factor [Theileria orientalis
           strain Shintoku]
          Length = 1002

 Score = 35.8 bits (81), Expect = 6.2,   Method: Compositional matrix adjust.
 Identities = 23/58 (39%), Positives = 33/58 (56%), Gaps = 6/58 (10%)

Query: 34  DVDFLEEEPCIKCN-RQRDEN--LLVCSQSGCPISVHENCLSCGVKFDDVGNFYCPYC 88
           +++ + +E C  C  R RDE+  LL+C   GCP S H +CL   V+  D   +YCP C
Sbjct: 85  ELEIVNDERCKICRERNRDESDVLLLCD--GCPNSYHMSCLELQVE-PDSEKWYCPMC 139


>gi|298710439|emb|CBJ25503.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 1754

 Score = 35.4 bits (80), Expect = 6.9,   Method: Composition-based stats.
 Identities = 18/61 (29%), Positives = 30/61 (49%), Gaps = 11/61 (18%)

Query: 37  FLEEEP--------CIKC-NRQRDENLLVCSQSGCPISVHENCLSCGVKFDDVGNFYCPY 87
            + EEP        C+ C +   +E L++C   GC  + H  C  CG++    G+++CP 
Sbjct: 685 LIPEEPEPEGARAVCLHCQDGGAEEQLMLCDGPGCSNAAHTFC--CGLEEVPAGDWFCPA 742

Query: 88  C 88
           C
Sbjct: 743 C 743


>gi|350631905|gb|EHA20274.1| hypothetical protein ASPNIDRAFT_55163 [Aspergillus niger ATCC 1015]
          Length = 625

 Score = 35.4 bits (80), Expect = 7.3,   Method: Composition-based stats.
 Identities = 21/69 (30%), Positives = 33/69 (47%), Gaps = 3/69 (4%)

Query: 31  DFTDVDFLEEEPCIKCNRQRDENLLVCSQSGCPISVHENCLSCGVKFDDVGNFYCPYCWY 90
           ++ D DF + +PC  C    +E LL+    GC +  H  C+  G+     G +YC  C  
Sbjct: 124 EYIDDDFADFQPCPICGDADNEELLLLC-DGCDVPSHTYCV--GLDSVPTGPWYCARCVT 180

Query: 91  KCELMRTKE 99
           +  L R+ E
Sbjct: 181 QRRLGRSSE 189


>gi|317038822|ref|XP_001402254.2| PHD and RING finger domain protein [Aspergillus niger CBS 513.88]
          Length = 582

 Score = 35.4 bits (80), Expect = 7.4,   Method: Composition-based stats.
 Identities = 21/69 (30%), Positives = 33/69 (47%), Gaps = 3/69 (4%)

Query: 31  DFTDVDFLEEEPCIKCNRQRDENLLVCSQSGCPISVHENCLSCGVKFDDVGNFYCPYCWY 90
           ++ D DF + +PC  C    +E LL+    GC +  H  C+  G+     G +YC  C  
Sbjct: 124 EYIDDDFADFQPCPICGDADNEELLLLC-DGCDVPSHTYCV--GLDSVPTGPWYCARCVT 180

Query: 91  KCELMRTKE 99
           +  L R+ E
Sbjct: 181 QRRLGRSSE 189


>gi|410912905|ref|XP_003969929.1| PREDICTED: bromodomain adjacent to zinc finger domain protein 2B-like
            [Takifugu rubripes]
          Length = 2168

 Score = 35.4 bits (80), Expect = 7.7,   Method: Composition-based stats.
 Identities = 26/92 (28%), Positives = 37/92 (40%), Gaps = 5/92 (5%)

Query: 43   CIKCNRQRDENLLVCSQSGCPISVHENCLSCGVKFDDVGNFYCPYCWYKCELMRTKELRK 102
            C  C +  +E+LL+    GC    H  C    +     G++YCP C  K     +    K
Sbjct: 1921 CQMCKKGDNEDLLLLC-DGCDKGCHTYCHKPKISTIPEGDWYCPACISKA----SGSSPK 1975

Query: 103  KAKETKKQLACFIDSKSFSGDKKKENCRTDKG 134
              K   KQ+A    S    G+ KK   +T  G
Sbjct: 1976 SKKTPSKQVASSGGSAKKGGEAKKNGKQTGNG 2007


>gi|448517075|ref|XP_003867703.1| hypothetical protein CORT_0B05580 [Candida orthopsilosis Co 90-125]
 gi|380352042|emb|CCG22266.1| hypothetical protein CORT_0B05580 [Candida orthopsilosis]
          Length = 475

 Score = 35.4 bits (80), Expect = 8.2,   Method: Composition-based stats.
 Identities = 21/52 (40%), Positives = 25/52 (48%), Gaps = 3/52 (5%)

Query: 43  CIKCNRQRDENLLVCSQSGCPISVHENCLSCG-VKFDDVGNFYCPYCWYKCE 93
           C+    +R+  LL C Q  C  S H NCL       DD  N+YCP C Y  E
Sbjct: 117 CLCAITRRNSPLLPCQQ--CASSFHVNCLGISDAATDDYFNWYCPMCDYNQE 166


>gi|222637725|gb|EEE67857.1| hypothetical protein OsJ_25665 [Oryza sativa Japonica Group]
          Length = 786

 Score = 35.4 bits (80), Expect = 8.2,   Method: Composition-based stats.
 Identities = 20/54 (37%), Positives = 29/54 (53%), Gaps = 2/54 (3%)

Query: 35  VDFLEEEPCIKCNRQRDENLLVCSQSGCPISVHENCLSCGVKFDDVGNFYCPYC 88
           +D  E   CI+CNR  D+NL++     C  S H  C+  G +  + GN+YC  C
Sbjct: 454 LDPYENVVCIECNRGGDDNLMLLCDI-CDSSAHTYCVGLGRQVPE-GNWYCGGC 505


>gi|357613867|gb|EHJ68756.1| hypothetical protein KGM_12662 [Danaus plexippus]
          Length = 727

 Score = 35.0 bits (79), Expect = 9.4,   Method: Composition-based stats.
 Identities = 24/81 (29%), Positives = 34/81 (41%), Gaps = 8/81 (9%)

Query: 40  EEPCIKCNRQRDENLLVCSQSGCPISVHENCLSCGVKFDDVGNFYCPYCWYKCELMRTKE 99
           E  C KCN       L+   S       +NC S     D +G F C  C Y+C+     +
Sbjct: 455 EYACYKCNEVFPSKRLLKQHS-------KNCESADSDLDKLGKFSCSQCAYRCQSPAILK 507

Query: 100 LRKKAKETKKQLAC-FIDSKS 119
           + ++    +K  AC F D KS
Sbjct: 508 IHERTHTGEKPYACTFCDYKS 528


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.318    0.136    0.432 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 2,309,606,704
Number of Sequences: 23463169
Number of extensions: 93225345
Number of successful extensions: 224164
Number of sequences better than 100.0: 388
Number of HSP's better than 100.0 without gapping: 56
Number of HSP's successfully gapped in prelim test: 332
Number of HSP's that attempted gapping in prelim test: 224026
Number of HSP's gapped (non-prelim): 447
length of query: 138
length of database: 8,064,228,071
effective HSP length: 103
effective length of query: 35
effective length of database: 9,942,488,960
effective search space: 347987113600
effective search space used: 347987113600
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 71 (32.0 bits)