BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 012466
(463 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|225432860|ref|XP_002283908.1| PREDICTED: AT-rich interactive domain-containing protein 2 [Vitis
vinifera]
gi|147794687|emb|CAN69148.1| hypothetical protein VITISV_003946 [Vitis vinifera]
gi|297737139|emb|CBI26340.3| unnamed protein product [Vitis vinifera]
Length = 457
Score = 785 bits (2028), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 393/461 (85%), Positives = 422/461 (91%), Gaps = 5/461 (1%)
Query: 1 MQKREQGKSGGAAGGAATPAAKRGRPFGSTSGSSGGSGSAADSAAPTTLLGPSLQVHSSF 60
MQKR+Q K GG AGGA TPAAKRGRPFGS GS+ + +AAD+AAP+TLLGPSL VHSSF
Sbjct: 1 MQKRDQSKLGGTAGGATTPAAKRGRPFGS-GGSNSAAAAAADAAAPSTLLGPSLHVHSSF 59
Query: 61 ADQNHKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQVID 120
ADQN+KRIVLALQSGLKSEL WA+N LTLLSFKEKDD+RKDATPLAKIPGLLDALLQVID
Sbjct: 60 ADQNNKRIVLALQSGLKSELGWAINALTLLSFKEKDDVRKDATPLAKIPGLLDALLQVID 119
Query: 121 DWRDIALPKELSKGPRARTLGVNSLVTGFGSEFEALGSINNAFPRSGVGSGSSAADSLVQ 180
DWRDIALPKEL+K PRAR LG NS VTGFG+E+EALGS N+ S GSGSS +++ VQ
Sbjct: 120 DWRDIALPKELAKAPRARLLGANSFVTGFGNEYEALGS-NDVL--SHPGSGSSISEASVQ 176
Query: 181 KNAARVRSSEWWFDEDGLFNLDDEGRAEKQQCAVGASNIIRNFSFMPDNEVIMAQHRHCL 240
KN ++R SEWW DEDGLFNLD+EGRAEKQQCAV ASNIIRNFSFMPDNEVIMAQHRHCL
Sbjct: 177 KNTTKLRPSEWWLDEDGLFNLDEEGRAEKQQCAVAASNIIRNFSFMPDNEVIMAQHRHCL 236
Query: 241 ETVFQCIEDHVTEDEELVTNALETIVNLAPLLDLRIFSSSKQSYIKITREKRAVEAIMGI 300
ETVFQCIEDH+TEDEELVTNALETIVNLAPLLDLRIFSSSK SYIKIT EKRAV+AIMG+
Sbjct: 237 ETVFQCIEDHITEDEELVTNALETIVNLAPLLDLRIFSSSKPSYIKIT-EKRAVQAIMGM 295
Query: 301 LGSPFKAWHCAAAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSLPAFDAQAAAVGALY 360
LGS KAWHCAAAELLGRLIINPDNEPFLLPF QIHKRLVDL+SLPA DAQAAAVGALY
Sbjct: 296 LGSAVKAWHCAAAELLGRLIINPDNEPFLLPFASQIHKRLVDLLSLPAVDAQAAAVGALY 355
Query: 361 NLAEVNVDCRLKLASERWAIDRLLRVIKTPHPVPEVCRKAAMILESLVSEPQNRVLLLAY 420
NLAEVN+DCRLKLASERWAIDRLL+VIKTPHPVPEVCRKAAMI+ESLVSEPQNR LLAY
Sbjct: 356 NLAEVNMDCRLKLASERWAIDRLLKVIKTPHPVPEVCRKAAMIIESLVSEPQNRAQLLAY 415
Query: 421 ENAFAEILFSDGRYSDTFARILYELTSRPNNKVASARGIWG 461
ENAFAEILFSDGR+SDTFARILYELTSRPNNK+A+ARGIWG
Sbjct: 416 ENAFAEILFSDGRHSDTFARILYELTSRPNNKMAAARGIWG 456
>gi|356500441|ref|XP_003519040.1| PREDICTED: uncharacterized protein LOC100814807 [Glycine max]
Length = 460
Score = 753 bits (1943), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 377/461 (81%), Positives = 409/461 (88%), Gaps = 2/461 (0%)
Query: 1 MQKREQGKSGGAAGGAATPAAKRGRPFGSTSGSSGGSGSAADSAAPTTLLGPSLQVHSSF 60
M KREQGKSGGAAGG A AKRGRPFGS + S+ + SAADSAAP+ LLGPSL VH+SF
Sbjct: 1 MLKREQGKSGGAAGGVAVTPAKRGRPFGSGNNSASAAASAADSAAPSNLLGPSLHVHNSF 60
Query: 61 ADQNHKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQVID 120
ADQN+KRIVLALQSGLKSELTWALN LTLLSFKEKDDMRKDATPLAKIPGLLDALLQVID
Sbjct: 61 ADQNNKRIVLALQSGLKSELTWALNILTLLSFKEKDDMRKDATPLAKIPGLLDALLQVID 120
Query: 121 DWRDIALPKELSKGPRARTLGVNSLVTGFGSEFEALGSINNAFPRSGVGSGSSAADSLVQ 180
DWRDIALPKEL+K R RTLG +S+VTGFG E++ALGS R GVGSGS+ +S Q
Sbjct: 121 DWRDIALPKELAKTTRVRTLGASSVVTGFGCEYQALGS-TGTHHRPGVGSGSAGIESTQQ 179
Query: 181 KNAARVRSSEWWFDEDGLFNLDDEGRAEKQQCAVGASNIIRNFSFMPDNEVIMAQHRHCL 240
+ R SE W DED LFNLDDEGR EKQQCAV SNIIRNFSFMPDNEVIM QHRHCL
Sbjct: 180 NGVTKSRFSELWLDEDSLFNLDDEGRTEKQQCAVATSNIIRNFSFMPDNEVIMVQHRHCL 239
Query: 241 ETVFQCIEDHVTEDEELVTNALETIVNLAPLLDLRIFSSSKQSYIKITREKRAVEAIMGI 300
ET FQCIEDH+ EDEELVTNALETIVNLAPLLDLRIFSSSK S+IKIT EKRAV+AIMG+
Sbjct: 240 ETAFQCIEDHLVEDEELVTNALETIVNLAPLLDLRIFSSSKPSFIKIT-EKRAVQAIMGM 298
Query: 301 LGSPFKAWHCAAAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSLPAFDAQAAAVGALY 360
L S KAWHCAAAELLGRLIINPDNEPFLLPF PQIHKRL+DL+S+PA DAQAAA+GALY
Sbjct: 299 LESAVKAWHCAAAELLGRLIINPDNEPFLLPFFPQIHKRLIDLISMPALDAQAAAIGALY 358
Query: 361 NLAEVNVDCRLKLASERWAIDRLLRVIKTPHPVPEVCRKAAMILESLVSEPQNRVLLLAY 420
NLAEVN+DCRLK+A+ERWAIDRLL+VIKTPHPVPEVCRK+AMILESLVSEPQNR LLLAY
Sbjct: 359 NLAEVNMDCRLKIANERWAIDRLLKVIKTPHPVPEVCRKSAMILESLVSEPQNRSLLLAY 418
Query: 421 ENAFAEILFSDGRYSDTFARILYELTSRPNNKVASARGIWG 461
ENAFAEI+F+DGRYSDTFARILYELTSRP++KVA+ARGIWG
Sbjct: 419 ENAFAEIVFTDGRYSDTFARILYELTSRPSSKVAAARGIWG 459
>gi|356536049|ref|XP_003536553.1| PREDICTED: uncharacterized protein LOC100790539 [Glycine max]
Length = 460
Score = 746 bits (1927), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 385/461 (83%), Positives = 417/461 (90%), Gaps = 2/461 (0%)
Query: 1 MQKREQGKSGGAAGGAATPAAKRGRPFGSTSGSSGGSGSAADSAAPTTLLGPSLQVHSSF 60
MQKREQGKSGG+AGG ATP AKRGRPFGS + SS + SAADSAAP+TLLGPSL VH+SF
Sbjct: 1 MQKREQGKSGGSAGGGATPPAKRGRPFGSGNNSSSAAASAADSAAPSTLLGPSLHVHNSF 60
Query: 61 ADQNHKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQVID 120
ADQN+KRIVLALQSGLK+ELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQVID
Sbjct: 61 ADQNNKRIVLALQSGLKNELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQVID 120
Query: 121 DWRDIALPKELSKGPRARTLGVNSLVTGFGSEFEALGSINNAFPRSGVGSGSSAADSLVQ 180
DWRDIALPKE +K R RTLG NS+V+GFGSE++ALGS R GVGSGS+ +S Q
Sbjct: 121 DWRDIALPKEFAKTTRIRTLGANSVVSGFGSEYQALGSTGTPH-RPGVGSGSAGIESTQQ 179
Query: 181 KNAARVRSSEWWFDEDGLFNLDDEGRAEKQQCAVGASNIIRNFSFMPDNEVIMAQHRHCL 240
A+ R SE W DED LFNLDDEGRAEKQQCAV ASNIIRNFSFMPDNEVIMAQHRHCL
Sbjct: 180 NGMAKSRFSELWLDEDSLFNLDDEGRAEKQQCAVAASNIIRNFSFMPDNEVIMAQHRHCL 239
Query: 241 ETVFQCIEDHVTEDEELVTNALETIVNLAPLLDLRIFSSSKQSYIKITREKRAVEAIMGI 300
ET FQCIEDH+ EDEELVTNALETIVNLAPLLDLRIFSSSK S+IKIT EKRAV+AIMG+
Sbjct: 240 ETAFQCIEDHLVEDEELVTNALETIVNLAPLLDLRIFSSSKPSFIKIT-EKRAVQAIMGM 298
Query: 301 LGSPFKAWHCAAAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSLPAFDAQAAAVGALY 360
L S KAWHCAAAELLGRLIINPDNEPFLLPF PQIHKRL+DL+S+PA DAQAAA+GALY
Sbjct: 299 LESAVKAWHCAAAELLGRLIINPDNEPFLLPFFPQIHKRLIDLISMPALDAQAAAIGALY 358
Query: 361 NLAEVNVDCRLKLASERWAIDRLLRVIKTPHPVPEVCRKAAMILESLVSEPQNRVLLLAY 420
NLAEVN+DCRLK+A+ERWAIDRLL+VIK PHPVPEVCRK+AMILESLVSEPQNR LLLAY
Sbjct: 359 NLAEVNMDCRLKIANERWAIDRLLKVIKMPHPVPEVCRKSAMILESLVSEPQNRSLLLAY 418
Query: 421 ENAFAEILFSDGRYSDTFARILYELTSRPNNKVASARGIWG 461
ENAFAEI+F+DGRYSDTFARILYELTSRP+NKVA+ARGIWG
Sbjct: 419 ENAFAEIVFTDGRYSDTFARILYELTSRPSNKVAAARGIWG 459
>gi|449432640|ref|XP_004134107.1| PREDICTED: uncharacterized protein LOC101219772 [Cucumis sativus]
gi|449516049|ref|XP_004165060.1| PREDICTED: uncharacterized LOC101219772 [Cucumis sativus]
Length = 460
Score = 740 bits (1911), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 367/468 (78%), Positives = 405/468 (86%), Gaps = 16/468 (3%)
Query: 1 MQKREQGKSGG-AAGGAATPAAKRGRPFGSTSGSSGGSGS----AADSAAPTTLLGPSLQ 55
MQKR+Q K GG +GGA+ P AKRGRPFGS + ++ + ++ AP+TLLGPSL
Sbjct: 1 MQKRDQNKLGGNVSGGASAPPAKRGRPFGSVNSNAAAVAAAVAAGTETLAPSTLLGPSLH 60
Query: 56 VHSSFADQNHKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDAL 115
+H+SFADQN+KRIVLALQSGLKSELTWALNTLTLLSFKEKDDMR+D+TPLAKIPGLLDAL
Sbjct: 61 IHTSFADQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRRDSTPLAKIPGLLDAL 120
Query: 116 LQVIDDWRDIALPKELSKGPRARTLGVNSLVTGFGSEFEALGSI--NNAFPRSGVGSGSS 173
LQVIDDWRDIALP++L K R RTLG NS VTGFG+EFEALGS+ ++ P S V +
Sbjct: 121 LQVIDDWRDIALPRDLVKKQRVRTLGANSSVTGFGNEFEALGSVVPDSLRPSSSVSESTG 180
Query: 174 AADSLVQKNAARVRSSEWWFDEDGLFNLDDEGRAEKQQCAVGASNIIRNFSFMPDNEVIM 233
+A++ S WW +EDGLFNLDDEGRAE+QQCAV ASNI+RNFSFMP+NE IM
Sbjct: 181 --------HASKPSSRPWWLEEDGLFNLDDEGRAERQQCAVSASNILRNFSFMPENESIM 232
Query: 234 AQHRHCLETVFQCIEDHVTEDEELVTNALETIVNLAPLLDLRIFSSSKQSYIKITREKRA 293
A HRH LETVFQCIEDHVTEDEELVTNALETIVNLAPLLDLRIFSS K SYIKIT EKRA
Sbjct: 233 ALHRHTLETVFQCIEDHVTEDEELVTNALETIVNLAPLLDLRIFSSLKPSYIKIT-EKRA 291
Query: 294 VEAIMGILGSPFKAWHCAAAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSLPAFDAQA 353
VEAIMG+LGS K WHCAAAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMS+PA DAQA
Sbjct: 292 VEAIMGMLGSAVKVWHCAAAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSIPALDAQA 351
Query: 354 AAVGALYNLAEVNVDCRLKLASERWAIDRLLRVIKTPHPVPEVCRKAAMILESLVSEPQN 413
AAVGALYNL EVN+DCR+KLASERWAIDRLL+VIK PHPVPE+CRKAAMILESLVSEPQN
Sbjct: 352 AAVGALYNLVEVNMDCRIKLASERWAIDRLLKVIKMPHPVPEICRKAAMILESLVSEPQN 411
Query: 414 RVLLLAYENAFAEILFSDGRYSDTFARILYELTSRPNNKVASARGIWG 461
R LLLAYENAFAEILFSDGRYSDTFARILYELTSRPNNKVA+A+G+WG
Sbjct: 412 RGLLLAYENAFAEILFSDGRYSDTFARILYELTSRPNNKVAAAQGVWG 459
>gi|224099867|ref|XP_002311651.1| predicted protein [Populus trichocarpa]
gi|118487781|gb|ABK95714.1| unknown [Populus trichocarpa]
gi|222851471|gb|EEE89018.1| predicted protein [Populus trichocarpa]
Length = 453
Score = 726 bits (1875), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 384/463 (82%), Positives = 411/463 (88%), Gaps = 13/463 (2%)
Query: 1 MQKREQGKSGGAAGGAATPAAKRGRPFGSTSGSSGGSGSAADSAAPTTLLGPSLQVHSSF 60
MQKR+ KSGG+ GG+A PA KRGRPFGSTSGS+ S AAD AP+TLLGPSLQVH+SF
Sbjct: 1 MQKRDLNKSGGSGGGSAAPAPKRGRPFGSTSGSAAAS-FAADFVAPSTLLGPSLQVHTSF 59
Query: 61 A--DQNHKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQV 118
A DQN+KRIVLALQSGLKSELTWALNTLTLLSFKEK+DMRKD+ LAKI GLLDALLQV
Sbjct: 60 AASDQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKEDMRKDS--LAKISGLLDALLQV 117
Query: 119 IDDWRDIALPKELSKGPRARTLGVNSLVTGFGSEFEALGSINNAFPRSGVGSGSSAADSL 178
IDDWRDIALPKEL K R RTLG NSLVTGFG E+EALGS N+ +SG+ D+
Sbjct: 118 IDDWRDIALPKELQKTRRVRTLGSNSLVTGFGYEYEALGS-NDNVKQSGL------TDAS 170
Query: 179 VQKNAARVRSSEWWFDEDGLFNLDDEGRAEKQQCAVGASNIIRNFSFMPDNEVIMAQHRH 238
VQKN A+ R SEWW+DEDGLFNLDDEGRAEKQQCAV ASNIIRNFSFMP+NEVIMA +RH
Sbjct: 171 VQKNVAKFRPSEWWYDEDGLFNLDDEGRAEKQQCAVAASNIIRNFSFMPENEVIMAANRH 230
Query: 239 CLETVFQCIEDHVTEDEELVTNALETIVNLAPLLDLRIFSSSKQSYIKITREKRAVEAIM 298
CLETVFQCIEDH TEDEELVTNALETIVNLAPLLDLRIFSSSK SYIKIT EKRAV+AIM
Sbjct: 231 CLETVFQCIEDHSTEDEELVTNALETIVNLAPLLDLRIFSSSKPSYIKIT-EKRAVQAIM 289
Query: 299 GILGSPFKAWHCAAAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSLPAFDAQAAAVGA 358
G+LGS KAWHCAAAELLGRLIINPDNEPFLLPF PQIHKRLVDLMS PA DAQAAAVGA
Sbjct: 290 GMLGSAVKAWHCAAAELLGRLIINPDNEPFLLPFFPQIHKRLVDLMSSPALDAQAAAVGA 349
Query: 359 LYNLAEVNVDCRLKLASERWAIDRLLRVIKTPHPVPEVCRKAAMILESLVSEPQNRVLLL 418
LYNLAEVN+DCRLKLASERWA+DRLLRVI+ PHPVPEVCRKAAMILESLVSEPQN+ LLL
Sbjct: 350 LYNLAEVNMDCRLKLASERWAVDRLLRVIRAPHPVPEVCRKAAMILESLVSEPQNKALLL 409
Query: 419 AYENAFAEILFSDGRYSDTFARILYELTSRPNNKVASARGIWG 461
AYENAFAEILFSD RYSDTFARILYELTSRP+NK +ARG+WG
Sbjct: 410 AYENAFAEILFSDTRYSDTFARILYELTSRPSNKFTAARGVWG 452
>gi|358346987|ref|XP_003637544.1| hypothetical protein MTR_090s0013 [Medicago truncatula]
gi|355503479|gb|AES84682.1| hypothetical protein MTR_090s0013 [Medicago truncatula]
Length = 524
Score = 675 bits (1741), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 326/400 (81%), Positives = 361/400 (90%), Gaps = 2/400 (0%)
Query: 61 ADQNHKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQVID 120
+DQN+KRIVLALQSGLKSELTWALNTLTLLSFKEKD+MRKDATPLAKIPGLLDALLQVID
Sbjct: 96 SDQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDEMRKDATPLAKIPGLLDALLQVID 155
Query: 121 DWRDIALPKELSKGPRARTLGVNSLVTGFGSEFEALGSINNAFPRSGVGSGSSAADSLVQ 180
DWRDIALPKEL + R R+LGVNS+ TGFG+E++ALGS R +GSG+++ +S Q
Sbjct: 156 DWRDIALPKELVRTTRVRSLGVNSVATGFGNEYQALGS-TGTLQRPSLGSGTASTESAQQ 214
Query: 181 KNAARVRSSEWWFDEDGLFNLDDEGRAEKQQCAVGASNIIRNFSFMPDNEVIMAQHRHCL 240
A+ R SE FDEDGLFNLDDEGRAEKQQCAV ASNIIRNFSFMPDNEVIMAQ+RHC+
Sbjct: 215 NGTAKPRFSELRFDEDGLFNLDDEGRAEKQQCAVAASNIIRNFSFMPDNEVIMAQNRHCM 274
Query: 241 ETVFQCIEDHVTEDEELVTNALETIVNLAPLLDLRIFSSSKQSYIKITREKRAVEAIMGI 300
ET FQCIEDH+ EDEELVTNA+ETIVNLAPLLDLRIFSSSK S+IKIT EKRAV+AI+GI
Sbjct: 275 ETAFQCIEDHIVEDEELVTNAIETIVNLAPLLDLRIFSSSKPSFIKIT-EKRAVQAIIGI 333
Query: 301 LGSPFKAWHCAAAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSLPAFDAQAAAVGALY 360
L SP KAWHCAAAELLGRLIINPDNEPFLLPF P+I+K L+DL+SLPA DAQAAA+GALY
Sbjct: 334 LNSPVKAWHCAAAELLGRLIINPDNEPFLLPFFPKIYKHLIDLISLPATDAQAAAIGALY 393
Query: 361 NLAEVNVDCRLKLASERWAIDRLLRVIKTPHPVPEVCRKAAMILESLVSEPQNRVLLLAY 420
NLAEVN+DCRL +ASERWAIDRLL+VIK PHPVPEVCRKAAMILESLVSEPQNR LLL Y
Sbjct: 394 NLAEVNMDCRLGIASERWAIDRLLKVIKAPHPVPEVCRKAAMILESLVSEPQNRTLLLVY 453
Query: 421 ENAFAEILFSDGRYSDTFARILYELTSRPNNKVASARGIW 460
ENAFAEILF+D +YSDTFARILYEL+SRP +KVA+ARGIW
Sbjct: 454 ENAFAEILFTDSKYSDTFARILYELSSRPGHKVATARGIW 493
>gi|358345932|ref|XP_003637028.1| hypothetical protein MTR_067s0040, partial [Medicago truncatula]
gi|355502963|gb|AES84166.1| hypothetical protein MTR_067s0040, partial [Medicago truncatula]
Length = 444
Score = 671 bits (1730), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 325/402 (80%), Positives = 360/402 (89%), Gaps = 2/402 (0%)
Query: 59 SFADQNHKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQV 118
+ QN+KRIVLALQSGLKSELTWALNTLTLLSFKEKD+MRKDATPLAKIPGLLDALLQV
Sbjct: 14 NLGHQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKDEMRKDATPLAKIPGLLDALLQV 73
Query: 119 IDDWRDIALPKELSKGPRARTLGVNSLVTGFGSEFEALGSINNAFPRSGVGSGSSAADSL 178
IDDWRDIALPKEL + R R+LGVNS+ TGFG+E++ALGS R +GSG+++ +S
Sbjct: 74 IDDWRDIALPKELVRTTRVRSLGVNSVATGFGNEYQALGS-TGTLQRPSLGSGTASTESA 132
Query: 179 VQKNAARVRSSEWWFDEDGLFNLDDEGRAEKQQCAVGASNIIRNFSFMPDNEVIMAQHRH 238
Q A+ R SE FDEDGLFNLDDEGRAEKQQCAV ASNIIRNFSFMPDNEVIMAQ+RH
Sbjct: 133 QQNGTAKPRFSELRFDEDGLFNLDDEGRAEKQQCAVAASNIIRNFSFMPDNEVIMAQNRH 192
Query: 239 CLETVFQCIEDHVTEDEELVTNALETIVNLAPLLDLRIFSSSKQSYIKITREKRAVEAIM 298
C+ET FQCIEDH+ EDEELVTNA+ETIVNLAPLLDLRIFSSSK S+IKIT EKRAV+AI+
Sbjct: 193 CMETAFQCIEDHIVEDEELVTNAIETIVNLAPLLDLRIFSSSKPSFIKIT-EKRAVQAII 251
Query: 299 GILGSPFKAWHCAAAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSLPAFDAQAAAVGA 358
GIL SP KAWHCAAAELLGRLIINPDNEPFLLPF P+I+K L+DL+SLPA DAQAAA+GA
Sbjct: 252 GILNSPVKAWHCAAAELLGRLIINPDNEPFLLPFFPKIYKHLIDLISLPATDAQAAAIGA 311
Query: 359 LYNLAEVNVDCRLKLASERWAIDRLLRVIKTPHPVPEVCRKAAMILESLVSEPQNRVLLL 418
LYNLAEVN+DCRL +ASERWAIDRLL+VIK PHPVPEVCRKAAMILESLVSEPQNR LLL
Sbjct: 312 LYNLAEVNMDCRLGIASERWAIDRLLKVIKAPHPVPEVCRKAAMILESLVSEPQNRTLLL 371
Query: 419 AYENAFAEILFSDGRYSDTFARILYELTSRPNNKVASARGIW 460
YENAFAEILF+D +YSDTFARILYEL+SRP +KVA+ARGIW
Sbjct: 372 VYENAFAEILFTDSKYSDTFARILYELSSRPGHKVATARGIW 413
>gi|255551987|ref|XP_002517038.1| conserved hypothetical protein [Ricinus communis]
gi|223543673|gb|EEF45201.1| conserved hypothetical protein [Ricinus communis]
Length = 412
Score = 666 bits (1718), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 349/461 (75%), Positives = 377/461 (81%), Gaps = 50/461 (10%)
Query: 1 MQKREQGKSGGAAGGAATPAAKRGRPFGSTSGSSGGSGSAADSAAPTTLLGPSLQVHSSF 60
MQKREQ KSGG+AGGA P+AKRGRPFGSTS + SGS D+ AP LLGPSLQVH+SF
Sbjct: 1 MQKREQNKSGGSAGGATAPSAKRGRPFGSTSSNITSSGSGGDTVAPLNLLGPSLQVHTSF 60
Query: 61 ADQNHKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQVID 120
DQN+KRIVLALQSGLKSELTWALNTLTLLSFKEK+DMRKD+ L KI GLLDALLQVID
Sbjct: 61 VDQNNKRIVLALQSGLKSELTWALNTLTLLSFKEKEDMRKDS--LCKISGLLDALLQVID 118
Query: 121 DWRDIALPKELSKGPRARTLGVNSLVTGFGSEFEALGSINNAFPRSGVGSGSSAADSLVQ 180
DWRDIA PK+LSK PR RTLG NSLVTGFG+ +EALG +NNA
Sbjct: 119 DWRDIAPPKDLSKTPRVRTLGANSLVTGFGNGYEALG-LNNA------------------ 159
Query: 181 KNAARVRSSEWWFDEDGLFNLDDEGRAEKQQCAVGASNIIRNFSFMPDNEVIMAQHRHCL 240
GR + SNIIRNFSFMP+NEVIMAQHRHCL
Sbjct: 160 -----------------------PGRMQPX-----XSNIIRNFSFMPENEVIMAQHRHCL 191
Query: 241 ETVFQCIEDHVTEDEELVTNALETIVNLAPLLDLRIFSSSKQSYIKITREKRAVEAIMGI 300
ETVFQCIEDH+TEDEELVTNALETIVNLAPLLDLRIFSS+K S+IKIT EKRAV+AIMG+
Sbjct: 192 ETVFQCIEDHITEDEELVTNALETIVNLAPLLDLRIFSSAKPSFIKIT-EKRAVQAIMGM 250
Query: 301 LGSPFKAWHCAAAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSLPAFDAQAAAVGALY 360
LGS +AWHCAAAELLGR+IINPDNEPFLLPFVPQIHKRLVDLMS+ A DAQAAAVGALY
Sbjct: 251 LGSAVRAWHCAAAELLGRMIINPDNEPFLLPFVPQIHKRLVDLMSIQAVDAQAAAVGALY 310
Query: 361 NLAEVNVDCRLKLASERWAIDRLLRVIKTPHPVPEVCRKAAMILESLVSEPQNRVLLLAY 420
NLAEVN+DCRLKLASERWA+DRLL+VIK PHPVPEVCRKAAMILESLVSEPQNR LLLAY
Sbjct: 311 NLAEVNMDCRLKLASERWAVDRLLKVIKMPHPVPEVCRKAAMILESLVSEPQNRALLLAY 370
Query: 421 ENAFAEILFSDGRYSDTFARILYELTSRPNNKVASARGIWG 461
ENAFAEILFS+ RYSDTFARILYELTSRPNNK A+ARG+WG
Sbjct: 371 ENAFAEILFSESRYSDTFARILYELTSRPNNKFAAARGVWG 411
>gi|125559139|gb|EAZ04675.1| hypothetical protein OsI_26829 [Oryza sativa Indica Group]
Length = 460
Score = 639 bits (1648), Expect = e-180, Method: Compositional matrix adjust.
Identities = 315/450 (70%), Positives = 378/450 (84%), Gaps = 17/450 (3%)
Query: 18 TPAAKRGRPFGSTSGS----SGGSGSAADSAAPTTLLGPSLQVHSSFADQNHKRIVLALQ 73
TPA KRGRPFGST+GS + + + D+AAP L+GPSLQV ++ +DQN+KRIVLALQ
Sbjct: 19 TPA-KRGRPFGSTTGSGAAAAAAAAAIGDAAAPAALVGPSLQVLTALSDQNNKRIVLALQ 77
Query: 74 SGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQVIDDWRDIALPKELSK 133
SGLKSE+ WALN LT+LSFKEKDD+R+D TPLAK+PGLLDALLQVIDDWRDIA+PK+ +K
Sbjct: 78 SGLKSEILWALNALTVLSFKEKDDLRRDTTPLAKVPGLLDALLQVIDDWRDIAMPKDHTK 137
Query: 134 GPRARTLGVNSLVTGFGSEFEALGSINNAFPRSGVGSG--SSAADSLVQKNAARVRSSEW 191
PR RTLGVN+ ++GFG E ++ + + + S + ADS V K RS+ +
Sbjct: 138 PPRVRTLGVNTTLSGFGHE-----NVEKVYSDTIIPSDDQTKTADSTVTKK----RSAGF 188
Query: 192 WFDEDGLFNLDDEGRAEKQQCAVGASNIIRNFSFMPDNEVIMAQHRHCLETVFQCIEDHV 251
FDE+GLFN+DDEGR EKQQCAV ASNIIRNFSFMP+NE +M QHRHCLETVFQC+ED
Sbjct: 189 LFDEEGLFNVDDEGRTEKQQCAVAASNIIRNFSFMPENETVMVQHRHCLETVFQCLEDQN 248
Query: 252 TEDEELVTNALETIVNLAPLLDLRIFSSSKQSYIKITREKRAVEAIMGILGSPFKAWHCA 311
TED+EL+TN LET+VNLAP+LDLRIFSSSK S+IKIT EKRAV+AIMG+L S + WHCA
Sbjct: 249 TEDDELITNMLETLVNLAPVLDLRIFSSSKPSFIKIT-EKRAVQAIMGMLASSIRVWHCA 307
Query: 312 AAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSLPAFDAQAAAVGALYNLAEVNVDCRL 371
AAEL+GRLIINPDNEPFLLP +PQI+KRLVDL+S+PA DAQAAA+ ALYN+AEVN+D RL
Sbjct: 308 AAELIGRLIINPDNEPFLLPAIPQIYKRLVDLLSVPAVDAQAAAISALYNVAEVNMDFRL 367
Query: 372 KLASERWAIDRLLRVIKTPHPVPEVCRKAAMILESLVSEPQNRVLLLAYENAFAEILFSD 431
KLASERWA+DRLL+V+KTPHPVPEVCRKA+MI+ESLVSEPQNR+ LL +EN FAEIL S+
Sbjct: 368 KLASERWAVDRLLKVVKTPHPVPEVCRKASMIVESLVSEPQNRMHLLVHENTFAEILTSE 427
Query: 432 GRYSDTFARILYELTSRPNNKVASARGIWG 461
G+YSDTFARILYELT+RP+NKV + + IWG
Sbjct: 428 GKYSDTFARILYELTARPSNKVTAGQAIWG 457
>gi|297725863|ref|NP_001175295.1| Os07g0609766 [Oryza sativa Japonica Group]
gi|14192867|gb|AAK55772.1|AC079038_6 Unknown protein [Oryza sativa]
gi|34394199|dbj|BAC84651.1| unknown protein [Oryza sativa Japonica Group]
gi|255677964|dbj|BAH94023.1| Os07g0609766 [Oryza sativa Japonica Group]
Length = 460
Score = 639 bits (1647), Expect = e-180, Method: Compositional matrix adjust.
Identities = 315/450 (70%), Positives = 377/450 (83%), Gaps = 17/450 (3%)
Query: 18 TPAAKRGRPFGSTSGS----SGGSGSAADSAAPTTLLGPSLQVHSSFADQNHKRIVLALQ 73
TPA KRGRPFGST+GS + + + D+AAP L+GPSLQV ++ +DQN+KRIVLALQ
Sbjct: 19 TPA-KRGRPFGSTTGSGAAAAAAAAAIGDAAAPAALVGPSLQVLTALSDQNNKRIVLALQ 77
Query: 74 SGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQVIDDWRDIALPKELSK 133
SGLKSE+ WALN LT+LSFKEKDD+R+D TPLAK+PGLLDALLQVIDDWRDIA+PK+ +K
Sbjct: 78 SGLKSEILWALNALTVLSFKEKDDLRRDTTPLAKVPGLLDALLQVIDDWRDIAMPKDHTK 137
Query: 134 GPRARTLGVNSLVTGFGSEFEALGSINNAFPRSGVGSG--SSAADSLVQKNAARVRSSEW 191
PR RTLGVN+ ++GFG E ++ + + S + ADS V K RS+ +
Sbjct: 138 PPRVRTLGVNTTLSGFGHE-----NVEKVYSDTTTPSDDQTKTADSTVTKK----RSAGF 188
Query: 192 WFDEDGLFNLDDEGRAEKQQCAVGASNIIRNFSFMPDNEVIMAQHRHCLETVFQCIEDHV 251
FDE+GLFN+DDEGR EKQQCAV ASNIIRNFSFMP+NE +M QHRHCLETVFQC+ED
Sbjct: 189 LFDEEGLFNVDDEGRTEKQQCAVAASNIIRNFSFMPENETVMVQHRHCLETVFQCLEDQN 248
Query: 252 TEDEELVTNALETIVNLAPLLDLRIFSSSKQSYIKITREKRAVEAIMGILGSPFKAWHCA 311
TED+EL+TN LET+VNLAP+LDLRIFSSSK S+IKIT EKRAV+AIMG+L S + WHCA
Sbjct: 249 TEDDELITNMLETLVNLAPVLDLRIFSSSKPSFIKIT-EKRAVQAIMGMLASSIRVWHCA 307
Query: 312 AAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSLPAFDAQAAAVGALYNLAEVNVDCRL 371
AAEL+GRLIINPDNEPFLLP +PQI+KRLVDL+S+PA DAQAAA+ ALYN+AEVN+D RL
Sbjct: 308 AAELIGRLIINPDNEPFLLPAIPQIYKRLVDLLSVPAVDAQAAAISALYNVAEVNMDFRL 367
Query: 372 KLASERWAIDRLLRVIKTPHPVPEVCRKAAMILESLVSEPQNRVLLLAYENAFAEILFSD 431
KLASERWA+DRLL+V+KTPHPVPEVCRKA+MI+ESLVSEPQNR+ LL +EN FAEIL S+
Sbjct: 368 KLASERWAVDRLLKVVKTPHPVPEVCRKASMIVESLVSEPQNRMHLLVHENTFAEILTSE 427
Query: 432 GRYSDTFARILYELTSRPNNKVASARGIWG 461
G+YSDTFARILYELT+RP+NKV + + IWG
Sbjct: 428 GKYSDTFARILYELTARPSNKVTAGQAIWG 457
>gi|226497310|ref|NP_001144242.1| uncharacterized protein LOC100277110 [Zea mays]
gi|195638962|gb|ACG38949.1| hypothetical protein [Zea mays]
Length = 456
Score = 635 bits (1637), Expect = e-179, Method: Compositional matrix adjust.
Identities = 306/447 (68%), Positives = 367/447 (82%), Gaps = 16/447 (3%)
Query: 21 AKRGRPFGSTSGSSGGSGS----AADSAAPTTLLGPSLQVHSSFADQNHKRIVLALQSGL 76
AKRGRPFGST+G + + D AP L+GPSLQV S+ +DQN+KRIVLALQSGL
Sbjct: 17 AKRGRPFGSTTGGGAAAAAAAAAVVDPGAPAALVGPSLQVLSALSDQNNKRIVLALQSGL 76
Query: 77 KSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQVIDDWRDIALPKELSKGPR 136
KSE+ WALN LT+LSFKEKDD+R+DATPLAK+PGLLDALLQVID+W DI++PK+ +K PR
Sbjct: 77 KSEILWALNALTVLSFKEKDDLRRDATPLAKVPGLLDALLQVIDEWSDISMPKDHTKPPR 136
Query: 137 ARTLGVNSLVTGFGSEFEALGSINNAFPRSGVGSG--SSAADSLVQKNAARVRSSEWWFD 194
RTLG N+ ++GFG E ++ + + S S A DS V K R++ +WFD
Sbjct: 137 LRTLGANTTLSGFGQE-----NMEKVYSDTATTSNDQSKAEDSSVTKK----RAASFWFD 187
Query: 195 EDGLFNLDDEGRAEKQQCAVGASNIIRNFSFMPDNEVIMAQHRHCLETVFQCIEDHVTED 254
EDGLFN DDEGRAE+QQCA+ ASNIIRNFSFMP+NE+IM QHRHCLET+F C+E+ ED
Sbjct: 188 EDGLFNNDDEGRAERQQCAIAASNIIRNFSFMPENEIIMVQHRHCLETIFHCLENQNRED 247
Query: 255 EELVTNALETIVNLAPLLDLRIFSSSKQSYIKITREKRAVEAIMGILGSPFKAWHCAAAE 314
+EL+TN LET+VNLAP+LDLRIFSSSK S+IK+T EK A+ AIMG+L S K WHCAAAE
Sbjct: 248 DELITNMLETLVNLAPVLDLRIFSSSKPSFIKMT-EKGAIHAIMGMLSSSVKPWHCAAAE 306
Query: 315 LLGRLIINPDNEPFLLPFVPQIHKRLVDLMSLPAFDAQAAAVGALYNLAEVNVDCRLKLA 374
L+GRLIINPDNEPFLLP +PQI+KRLVDL+S+PA DAQAAAV ALYN+AEVN+DCRLKLA
Sbjct: 307 LIGRLIINPDNEPFLLPIIPQIYKRLVDLLSVPAXDAQAAAVSALYNVAEVNMDCRLKLA 366
Query: 375 SERWAIDRLLRVIKTPHPVPEVCRKAAMILESLVSEPQNRVLLLAYENAFAEILFSDGRY 434
SERWA+DRLL+++KTPHPVPEVCRK +MILESLVSEPQNR+ LL +EN FAEIL S+G+Y
Sbjct: 367 SERWAVDRLLKIVKTPHPVPEVCRKTSMILESLVSEPQNRMHLLVHENTFAEILTSEGKY 426
Query: 435 SDTFARILYELTSRPNNKVASARGIWG 461
SDTFARI+YELT+RP+NKV S IWG
Sbjct: 427 SDTFARIVYELTARPSNKVTSGHAIWG 453
>gi|18403609|ref|NP_566721.1| leaf and flower related protein [Arabidopsis thaliana]
gi|9294188|dbj|BAB02090.1| unnamed protein product [Arabidopsis thaliana]
gi|18176265|gb|AAL60013.1| unknown protein [Arabidopsis thaliana]
gi|20465317|gb|AAM20062.1| unknown protein [Arabidopsis thaliana]
gi|332643182|gb|AEE76703.1| leaf and flower related protein [Arabidopsis thaliana]
Length = 460
Score = 634 bits (1634), Expect = e-179, Method: Compositional matrix adjust.
Identities = 345/464 (74%), Positives = 405/464 (87%), Gaps = 8/464 (1%)
Query: 1 MQKREQGKSGGAAGGAATPAAKRGRPFGSTSGSSGGSGSAADSAAPTT---LLGPSLQVH 57
MQKRE GKSGG +GG++ P AKRGRPFGSTS +S + +AA +A + LLGPSL VH
Sbjct: 1 MQKRELGKSGGNSGGSSGPPAKRGRPFGSTSANSAAAAAAAAAADAMSPSALLGPSLLVH 60
Query: 58 SSFADQNHKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQ 117
+SF +QN++RIVLALQSGLKSE+TWALNTLTLLSFKEK+D+R+D PLAKI GLLDALL
Sbjct: 61 NSFVEQNNRRIVLALQSGLKSEVTWALNTLTLLSFKEKEDIRRDVMPLAKIAGLLDALLL 120
Query: 118 VIDDWRDIALPKELSKGPRARTLGVNSLVTGFGSEFEALGSINNAFPRSGVGSGSSAADS 177
+IDDWRDIALPK+L++G R RTLG N+ VTGFG+E++AL SI G G GSSAA++
Sbjct: 121 IIDDWRDIALPKDLTRGTRVRTLGTNASVTGFGNEYDALASIQPP----GSGIGSSAAEA 176
Query: 178 LVQKNAARVRSSEWWFDEDGLFNLDDEGRAEKQQCAVGASNIIRNFSFMPDNEVIMAQHR 237
L +K+ + +SS+WW +EDGLFNLDDEGR+EKQ CA+ ASN+IRNFSFMPDNEV+MAQHR
Sbjct: 177 LGKKSTGKHQSSQWWMEEDGLFNLDDEGRSEKQMCAIAASNVIRNFSFMPDNEVVMAQHR 236
Query: 238 HCLETVFQCIEDHVTEDEELVTNALETIVNLAPLLDLRIFSSSKQSYIKITREKRAVEAI 297
HCLETVFQCI DH+TEDEELVTN+LETIVNLA L+DLRIFSS KQSYI I EK+AV+A+
Sbjct: 237 HCLETVFQCIHDHMTEDEELVTNSLETIVNLAHLMDLRIFSSLKQSYININ-EKKAVQAV 295
Query: 298 MGILGSPFKAWHCAAAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSLPAFDAQAAAVG 357
+GIL S KAW+CAAAELLGRLIINPDNEPF+ P +PQIHKRL+DL+S+ A DAQAAAVG
Sbjct: 296 VGILNSSVKAWNCAAAELLGRLIINPDNEPFISPLIPQIHKRLIDLLSIQAVDAQAAAVG 355
Query: 358 ALYNLAEVNVDCRLKLASERWAIDRLLRVIKTPHPVPEVCRKAAMILESLVSEPQNRVLL 417
ALYNL EVN+DCRLKLASERWA+DRLL+VIKTPHPVPEVCRKAAMILE+LVSEPQNR LL
Sbjct: 356 ALYNLVEVNMDCRLKLASERWAVDRLLKVIKTPHPVPEVCRKAAMILENLVSEPQNRGLL 415
Query: 418 LAYENAFAEILFSDGRYSDTFARILYELTSRPNNKVASARGIWG 461
LAYENAFAE+LF +G+YSD+FARILYELT+R N++VASARGIWG
Sbjct: 416 LAYENAFAELLFQEGKYSDSFARILYELTARSNSRVASARGIWG 459
>gi|21537098|gb|AAM61439.1| unknown [Arabidopsis thaliana]
Length = 460
Score = 634 bits (1634), Expect = e-179, Method: Compositional matrix adjust.
Identities = 346/464 (74%), Positives = 405/464 (87%), Gaps = 8/464 (1%)
Query: 1 MQKREQGKSGGAAGGAATPAAKRGRPFGSTSGSSGGSGSAADSAAPTT---LLGPSLQVH 57
MQKRE GKSGG +GG++ P AKRGRPFGSTS +S + +AA +A + LLGPSL VH
Sbjct: 1 MQKRELGKSGGNSGGSSGPPAKRGRPFGSTSANSAAAAAAAAAADAMSPSALLGPSLLVH 60
Query: 58 SSFADQNHKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQ 117
+SF +QN++RIVLALQSGLKSE+TWALNTLTLLSFKEK+D+R+D PLAKI GLLDALL
Sbjct: 61 NSFVEQNNRRIVLALQSGLKSEVTWALNTLTLLSFKEKEDIRRDVMPLAKIAGLLDALLL 120
Query: 118 VIDDWRDIALPKELSKGPRARTLGVNSLVTGFGSEFEALGSINNAFPRSGVGSGSSAADS 177
+IDDWRDIALPK+L++G R RTLG N+ VTGFG+E++AL SI G G GSSAA++
Sbjct: 121 IIDDWRDIALPKDLTRGTRVRTLGTNASVTGFGNEYDALASIQPP----GSGIGSSAAEA 176
Query: 178 LVQKNAARVRSSEWWFDEDGLFNLDDEGRAEKQQCAVGASNIIRNFSFMPDNEVIMAQHR 237
L +K+ + +SS+WW +EDGLFNLDDEGR+EKQ CA+ ASN+IRNFSFMPDNEV+MAQHR
Sbjct: 177 LGKKSTGKHQSSQWWMEEDGLFNLDDEGRSEKQMCAIAASNVIRNFSFMPDNEVVMAQHR 236
Query: 238 HCLETVFQCIEDHVTEDEELVTNALETIVNLAPLLDLRIFSSSKQSYIKITREKRAVEAI 297
HCLETVFQCI DH+TEDEELVTN+LETIVNLA L+DLRIFSS KQSYI I EK+AV+A+
Sbjct: 237 HCLETVFQCIHDHMTEDEELVTNSLETIVNLAHLMDLRIFSSLKQSYININ-EKKAVQAV 295
Query: 298 MGILGSPFKAWHCAAAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSLPAFDAQAAAVG 357
+GIL S KAW+CAAAELLGRLIINPDNEPF+ P +PQIHKRLVDL+S+ A DAQAAAVG
Sbjct: 296 VGILDSSVKAWNCAAAELLGRLIINPDNEPFISPLIPQIHKRLVDLLSIQAVDAQAAAVG 355
Query: 358 ALYNLAEVNVDCRLKLASERWAIDRLLRVIKTPHPVPEVCRKAAMILESLVSEPQNRVLL 417
ALYNL EVN+DCRLKLASERWA+DRLL+VIKTPHPVPEVCRKAAMILE+LVSEPQNR LL
Sbjct: 356 ALYNLVEVNMDCRLKLASERWAVDRLLKVIKTPHPVPEVCRKAAMILENLVSEPQNRGLL 415
Query: 418 LAYENAFAEILFSDGRYSDTFARILYELTSRPNNKVASARGIWG 461
LAYENAFAE+LF +G+YSD+FARILYELT+R N++VASARGIWG
Sbjct: 416 LAYENAFAELLFQEGKYSDSFARILYELTARSNSRVASARGIWG 459
>gi|242065622|ref|XP_002454100.1| hypothetical protein SORBIDRAFT_04g024550 [Sorghum bicolor]
gi|241933931|gb|EES07076.1| hypothetical protein SORBIDRAFT_04g024550 [Sorghum bicolor]
Length = 457
Score = 633 bits (1632), Expect = e-179, Method: Compositional matrix adjust.
Identities = 304/445 (68%), Positives = 364/445 (81%), Gaps = 11/445 (2%)
Query: 21 AKRGRPFGSTSGSSGGSGS----AADSAAPTTLLGPSLQVHSSFADQNHKRIVLALQSGL 76
AKRGRPFGST+G + + D A L+GPSLQV S+ +DQN+KRIVLALQSGL
Sbjct: 17 AKRGRPFGSTTGGGAAAAAAAAAVVDPGASAALVGPSLQVLSALSDQNNKRIVLALQSGL 76
Query: 77 KSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQVIDDWRDIALPKELSKGPR 136
KSE+ WALN LT+LSFKEKDD R+D TPLAK+PGLLDALLQVID+WRDI++PK+ K PR
Sbjct: 77 KSEILWALNALTVLSFKEKDDQRRDTTPLAKVPGLLDALLQVIDEWRDISMPKDHLKPPR 136
Query: 137 ARTLGVNSLVTGFGSEFEALGSINNAFPRSGVGSGSSAADSLVQKNAARVRSSEWWFDED 196
RTLG N+ ++GFG E + + + + S DS V K R++ +WFDED
Sbjct: 137 VRTLGANTTLSGFGQ--ENMEKVYSDTATTSNNDQSKTEDSSVTKK----RAASFWFDED 190
Query: 197 GLFNLDDEGRAEKQQCAVGASNIIRNFSFMPDNEVIMAQHRHCLETVFQCIEDHVTEDEE 256
GLFN DDEGRAE+QQCA+ ASNIIRNFSFMP+NE+IM QHRHCLE++F C+ED ED+E
Sbjct: 191 GLFNNDDEGRAERQQCAIAASNIIRNFSFMPENEIIMVQHRHCLESIFHCLEDQNREDDE 250
Query: 257 LVTNALETIVNLAPLLDLRIFSSSKQSYIKITREKRAVEAIMGILGSPFKAWHCAAAELL 316
L+TN +ET+VNLAP+LDLRIFSSSK S+IK+T EK AV AIMG+L S K WHCAAAEL+
Sbjct: 251 LITNMVETLVNLAPVLDLRIFSSSKPSFIKMT-EKGAVHAIMGMLSSSVKPWHCAAAELI 309
Query: 317 GRLIINPDNEPFLLPFVPQIHKRLVDLMSLPAFDAQAAAVGALYNLAEVNVDCRLKLASE 376
GRLIINPDNEPFLLP +PQI+KRLVDL+S+PA+DAQAAAV ALYN+AEVN+DCRLKLASE
Sbjct: 310 GRLIINPDNEPFLLPVIPQIYKRLVDLLSVPAYDAQAAAVSALYNVAEVNMDCRLKLASE 369
Query: 377 RWAIDRLLRVIKTPHPVPEVCRKAAMILESLVSEPQNRVLLLAYENAFAEILFSDGRYSD 436
RWA+DRLL+++KTPHPVPEVCRK +MILESLVSEPQNR+ LL +EN FAEIL S+G+YSD
Sbjct: 370 RWAVDRLLKIVKTPHPVPEVCRKTSMILESLVSEPQNRMHLLVHENTFAEILTSEGKYSD 429
Query: 437 TFARILYELTSRPNNKVASARGIWG 461
TFARILYELT+RP+NKV S + IWG
Sbjct: 430 TFARILYELTARPSNKVTSGQAIWG 454
>gi|357142730|ref|XP_003572673.1| PREDICTED: uncharacterized protein LOC100822874 [Brachypodium
distachyon]
Length = 497
Score = 625 bits (1611), Expect = e-176, Method: Compositional matrix adjust.
Identities = 306/446 (68%), Positives = 367/446 (82%), Gaps = 15/446 (3%)
Query: 21 AKRGRPFGSTSGSSGGSGSAADSAAPTT---LLGPSLQVHSSFADQNHKRIVLALQSGLK 77
AKRGRPFGST+G + +AA P L+GPSLQV ++ +DQN+KRIVLALQSGLK
Sbjct: 59 AKRGRPFGSTTGGGAAAAAAAAIGDPAAPAALVGPSLQVLTALSDQNNKRIVLALQSGLK 118
Query: 78 SELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQVIDDWRDIALPKELSKGPRA 137
SE+ WALN LT+LSFKEKDD+R+D TPLAK+PGLLDALLQVIDDWRDIA+PK+ +K PR
Sbjct: 119 SEILWALNALTVLSFKEKDDLRRDTTPLAKVPGLLDALLQVIDDWRDIAMPKDHTKPPRV 178
Query: 138 RTLGVNSLVTGFGSEFEALGSI--NNAFPRSGVGSGSSAADSLVQKNAARVRSSEWWFDE 195
RTLGVN+ ++GFG E +G + + A P + S DS + K RS+ + DE
Sbjct: 179 RTLGVNTTLSGFG--LENIGKVYSDTATPSN---DESKKEDSTITKK----RSAGFLLDE 229
Query: 196 DGLFNLDDEGRAEKQQCAVGASNIIRNFSFMPDNEVIMAQHRHCLETVFQCIEDHVTEDE 255
+GLFN+DDEGR E+QQCAV ASNIIRNFSFMP+NE IM QHRHCLET+FQC+ED TED+
Sbjct: 230 EGLFNMDDEGRTERQQCAVAASNIIRNFSFMPENETIMVQHRHCLETIFQCLEDQNTEDD 289
Query: 256 ELVTNALETIVNLAPLLDLRIFSSSKQSYIKITREKRAVEAIMGILGSPFKAWHCAAAEL 315
EL+TN LET+VNLAP+LDLRIFSSSK S+I++T EK AV AIMG+L S KAWHCAAAEL
Sbjct: 290 ELITNMLETLVNLAPVLDLRIFSSSKPSFIQMT-EKSAVHAIMGMLASSVKAWHCAAAEL 348
Query: 316 LGRLIINPDNEPFLLPFVPQIHKRLVDLMSLPAFDAQAAAVGALYNLAEVNVDCRLKLAS 375
+GRLIINPDNE FLLP + QI+KRLVDL+S+PAFDAQAAAV ALYN++EVN+DCRLKLA
Sbjct: 349 IGRLIINPDNESFLLPVISQIYKRLVDLLSVPAFDAQAAAVSALYNVSEVNMDCRLKLAC 408
Query: 376 ERWAIDRLLRVIKTPHPVPEVCRKAAMILESLVSEPQNRVLLLAYENAFAEILFSDGRYS 435
ERWAIDRLL+++K PHPVPEVCRKA +ILESLVSEPQN++ LL +EN FAEIL ++G+YS
Sbjct: 409 ERWAIDRLLKIVKAPHPVPEVCRKATVILESLVSEPQNKMHLLVHENTFAEILTTEGKYS 468
Query: 436 DTFARILYELTSRPNNKVASARGIWG 461
DTFARILYELT+RP+NKV S + IWG
Sbjct: 469 DTFARILYELTARPSNKVTSGQAIWG 494
>gi|326525347|dbj|BAK07943.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 465
Score = 620 bits (1598), Expect = e-175, Method: Compositional matrix adjust.
Identities = 300/443 (67%), Positives = 361/443 (81%), Gaps = 12/443 (2%)
Query: 23 RGRPFGSTSGSSGGSGSAADSAAPTT----LLGPSLQVHSSFADQNHKRIVLALQSGLKS 78
RGRPFGST+GS + +AA + L+GPSL V ++ +DQN+KRIVLALQSGLKS
Sbjct: 28 RGRPFGSTTGSGAAAAAAAAAVGDPAAPAALVGPSLHVLTALSDQNNKRIVLALQSGLKS 87
Query: 79 ELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQVIDDWRDIALPKELSKGPRAR 138
E+ WALN LT+LSFKEKDD+R+D TPLAK+PGLLDALLQVIDDWRDIA+P++ +K PR R
Sbjct: 88 EILWALNALTVLSFKEKDDLRRDTTPLAKVPGLLDALLQVIDDWRDIAMPRDHTKPPRVR 147
Query: 139 TLGVNSLVTGFGSEFEALGSINNAFPRSGVGSGSSAADSLVQKNAARVRSSEWWFDEDGL 198
TLGVN+ ++GFG E +G + + A ++ +K RS+ + FDEDGL
Sbjct: 148 TLGVNTTISGFG--LENVGKVYSDSTTPPNDQSKIEASTITKK-----RSAGFLFDEDGL 200
Query: 199 FNLDDEGRAEKQQCAVGASNIIRNFSFMPDNEVIMAQHRHCLETVFQCIEDHVTEDEELV 258
FN+DDEGR E+QQCAV ASNIIRNFSFMP+NE IM QHRHCLET+FQC+ED ED+EL+
Sbjct: 201 FNIDDEGRTERQQCAVAASNIIRNFSFMPENETIMVQHRHCLETIFQCLEDQNAEDDELI 260
Query: 259 TNALETIVNLAPLLDLRIFSSSKQSYIKITREKRAVEAIMGILGSPFKAWHCAAAELLGR 318
TN LE +VNLAP+LDLRIFSSSK SYIK+ EK AV AIMG+L S KAWHCAAAEL+GR
Sbjct: 261 TNMLEALVNLAPVLDLRIFSSSKPSYIKMA-EKSAVHAIMGMLASSIKAWHCAAAELIGR 319
Query: 319 LIINPDNEPFLLPFVPQIHKRLVDLMSLPAFDAQAAAVGALYNLAEVNVDCRLKLASERW 378
LIINPDNE FL+P + QI++RLVDL+S+PAFDAQAAAV ALYN++EVN+DCRLKLASERW
Sbjct: 320 LIINPDNESFLVPVISQIYRRLVDLLSVPAFDAQAAAVSALYNVSEVNMDCRLKLASERW 379
Query: 379 AIDRLLRVIKTPHPVPEVCRKAAMILESLVSEPQNRVLLLAYENAFAEILFSDGRYSDTF 438
A+DRLL++IK PHPV EVCRKAA+ILESLVSEPQNR+ LL +EN FAEIL S+G+YSDTF
Sbjct: 380 AVDRLLKIIKAPHPVSEVCRKAAVILESLVSEPQNRMHLLVHENTFAEILTSEGKYSDTF 439
Query: 439 ARILYELTSRPNNKVASARGIWG 461
ARILYELT+RP+NKV S + IWG
Sbjct: 440 ARILYELTARPSNKVTSGQAIWG 462
>gi|222637437|gb|EEE67569.1| hypothetical protein OsJ_25084 [Oryza sativa Japonica Group]
Length = 410
Score = 612 bits (1579), Expect = e-172, Method: Compositional matrix adjust.
Identities = 293/411 (71%), Positives = 347/411 (84%), Gaps = 12/411 (2%)
Query: 53 SLQVHSSFADQNHKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLL 112
SL F+DQN+KRIVLALQSGLKSE+ WALN LT+LSFKEKDD+R+D TPLAK+PGLL
Sbjct: 7 SLCFFVDFSDQNNKRIVLALQSGLKSEILWALNALTVLSFKEKDDLRRDTTPLAKVPGLL 66
Query: 113 DALLQVIDDWRDIALPKELSKGPRARTLGVNSLVTGFGSEFEALGSINNAFPRSGVGSG- 171
DALLQVIDDWRDIA+PK+ +K PR RTLGVN+ ++GFG E ++ + + S
Sbjct: 67 DALLQVIDDWRDIAMPKDHTKPPRVRTLGVNTTLSGFGHE-----NVEKVYSDTTTPSDD 121
Query: 172 -SSAADSLVQKNAARVRSSEWWFDEDGLFNLDDEGRAEKQQCAVGASNIIRNFSFMPDNE 230
+ ADS V K RS+ + FDE+GLFN+DDEGR EKQQCAV ASNIIRNFSFMP+NE
Sbjct: 122 QTKTADSTVTKK----RSAGFLFDEEGLFNVDDEGRTEKQQCAVAASNIIRNFSFMPENE 177
Query: 231 VIMAQHRHCLETVFQCIEDHVTEDEELVTNALETIVNLAPLLDLRIFSSSKQSYIKITRE 290
+M QHRHCLETVFQC+ED TED+EL+TN LET+VNLAP+LDLRIFSSSK S+IKIT E
Sbjct: 178 TVMVQHRHCLETVFQCLEDQNTEDDELITNMLETLVNLAPVLDLRIFSSSKPSFIKIT-E 236
Query: 291 KRAVEAIMGILGSPFKAWHCAAAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSLPAFD 350
KRAV+AIMG+L S + WHCAAAEL+GRLIINPDNEPFLLP +PQI+KRLVDL+S+PA D
Sbjct: 237 KRAVQAIMGMLASSIRVWHCAAAELIGRLIINPDNEPFLLPAIPQIYKRLVDLLSVPAVD 296
Query: 351 AQAAAVGALYNLAEVNVDCRLKLASERWAIDRLLRVIKTPHPVPEVCRKAAMILESLVSE 410
AQAAA+ ALYN+AEVN+D RLKLASERWA+DRLL+V+KTPHPVPEVCRKA+MI+ESLVSE
Sbjct: 297 AQAAAISALYNVAEVNMDFRLKLASERWAVDRLLKVVKTPHPVPEVCRKASMIVESLVSE 356
Query: 411 PQNRVLLLAYENAFAEILFSDGRYSDTFARILYELTSRPNNKVASARGIWG 461
PQNR+ LL +EN FAEIL S+G+YSDTFARILYELT+RP+NKV + + IWG
Sbjct: 357 PQNRMHLLVHENTFAEILTSEGKYSDTFARILYELTARPSNKVTAGQAIWG 407
>gi|297831016|ref|XP_002883390.1| hypothetical protein ARALYDRAFT_479809 [Arabidopsis lyrata subsp.
lyrata]
gi|297329230|gb|EFH59649.1| hypothetical protein ARALYDRAFT_479809 [Arabidopsis lyrata subsp.
lyrata]
Length = 395
Score = 610 bits (1574), Expect = e-172, Method: Compositional matrix adjust.
Identities = 312/400 (78%), Positives = 360/400 (90%), Gaps = 5/400 (1%)
Query: 62 DQNHKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQVIDD 121
+QN++RIVLALQSGLKSE+TWALNTLTLLSFKEKDD+R+D TPLAKI GLLDALL +IDD
Sbjct: 1 EQNNRRIVLALQSGLKSEVTWALNTLTLLSFKEKDDIRRDVTPLAKISGLLDALLLIIDD 60
Query: 122 WRDIALPKELSKGPRARTLGVNSLVTGFGSEFEALGSINNAFPRSGVGSGSSAADSLVQK 181
WRDIALPK+L++G R RTLG N+ VTGFG+E++AL SI P SG+GS SAA+ L +K
Sbjct: 61 WRDIALPKDLTRGTRVRTLGTNASVTGFGNEYDALASIQ--LPGSGIGS--SAAEGLGKK 116
Query: 182 NAARVRSSEWWFDEDGLFNLDDEGRAEKQQCAVGASNIIRNFSFMPDNEVIMAQHRHCLE 241
+ + +SS+WW +EDGLFNLDDEGR+EKQ CA+ ASN+IRNFSFMPDNEV+MAQHRHCLE
Sbjct: 117 SGGKHQSSQWWMEEDGLFNLDDEGRSEKQMCAIAASNVIRNFSFMPDNEVVMAQHRHCLE 176
Query: 242 TVFQCIEDHVTEDEELVTNALETIVNLAPLLDLRIFSSSKQSYIKITREKRAVEAIMGIL 301
TVFQCI DH+TEDEELVTN+LETIVNLA L+DLRIFSS KQSYI I EK+AV+A++GIL
Sbjct: 177 TVFQCIHDHMTEDEELVTNSLETIVNLAHLMDLRIFSSLKQSYININ-EKKAVQAVVGIL 235
Query: 302 GSPFKAWHCAAAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSLPAFDAQAAAVGALYN 361
S KAW+CAAAELLGRLIINPDNEPF+ P +PQIHKRLVDL+S+ A DAQAAAVGALYN
Sbjct: 236 DSSVKAWNCAAAELLGRLIINPDNEPFISPLIPQIHKRLVDLLSIQAVDAQAAAVGALYN 295
Query: 362 LAEVNVDCRLKLASERWAIDRLLRVIKTPHPVPEVCRKAAMILESLVSEPQNRVLLLAYE 421
L EVN+DCRLKLASERWA+DRLL+VIKTPHPVPEVCRKAAMILE+LVSEPQNR LLLAYE
Sbjct: 296 LVEVNMDCRLKLASERWAVDRLLKVIKTPHPVPEVCRKAAMILENLVSEPQNRGLLLAYE 355
Query: 422 NAFAEILFSDGRYSDTFARILYELTSRPNNKVASARGIWG 461
NAFAE+LF DG+YSD+FARILYELT+R N++VASARGIWG
Sbjct: 356 NAFAELLFQDGKYSDSFARILYELTARSNSRVASARGIWG 395
>gi|294461396|gb|ADE76259.1| unknown [Picea sitchensis]
Length = 399
Score = 581 bits (1498), Expect = e-163, Method: Compositional matrix adjust.
Identities = 288/401 (71%), Positives = 338/401 (84%), Gaps = 5/401 (1%)
Query: 62 DQNHKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQVIDD 121
DQN KRIV+ALQSGLKSELTWALNTLT+LSFKEK+D+RKDATPLAK+PG+LDALLQV+DD
Sbjct: 2 DQNSKRIVMALQSGLKSELTWALNTLTVLSFKEKEDLRKDATPLAKLPGVLDALLQVVDD 61
Query: 122 WRDIALPKELSKGPRARTLGVNSLVTGFGSEFEALGSINNAFPRSGVGSGSSAADSLVQK 181
WRDIA P+E +K R R LG N + TGFGSE+E + S N+ P S + S S A S
Sbjct: 62 WRDIAHPREAAKMTRPRVLGANCVFTGFGSEYE-ITSPND--PLSRIRSVSDALTSEDTG 118
Query: 182 NAARVRSSEWWFDEDGLFNLDDEGRAEKQQCAVGASNIIRNFSFMPDNEVIMAQHRHCLE 241
N + R S+WW+DE+GLFNLD+EGRAE+QQCAV ASN+IRNFSFMPDNE +MAQHRHCLE
Sbjct: 119 NGTKSRGSDWWYDEEGLFNLDEEGRAERQQCAVAASNVIRNFSFMPDNETVMAQHRHCLE 178
Query: 242 TVFQCIEDHVTEDEELVTNALETIVNLAPLLDLRIFSSSKQSYIKITREKRAVEAIMGIL 301
T+ QC+EDH TEDEELVTNA+ETI NLAP L+LRIFSSSK S+ IT+ KRAVEAI+G+L
Sbjct: 179 TIIQCMEDHDTEDEELVTNAIETIANLAPFLNLRIFSSSKPSHAPITK-KRAVEAIVGML 237
Query: 302 GSPFKAWHCAAAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSLPAFDAQAAAVGALYN 361
SP KAWHCAAAEL+GRLI+NPDNEPF+LPF PQ++KRLVDL+S P DAQAAAV ALYN
Sbjct: 238 ESPVKAWHCAAAELVGRLIVNPDNEPFVLPFAPQVYKRLVDLLSFPGADAQAAAVAALYN 297
Query: 362 LAEVNVDCRLKLASERWAIDRLLRVIKTPHPVPEV-CRKAAMILESLVSEPQNRVLLLAY 420
LAEVN+DCRL+LA+ERWAI RLL++I++PHP+ EV CRKAA+ +ESL EPQNR LL Y
Sbjct: 298 LAEVNMDCRLRLANERWAIGRLLKIIQSPHPLSEVLCRKAALTIESLACEPQNRAQLLTY 357
Query: 421 ENAFAEILFSDGRYSDTFARILYELTSRPNNKVASARGIWG 461
EN FAEI FSD R SDTFARILYELTSR N+K+ + RG+WG
Sbjct: 358 ENKFAEIAFSDARLSDTFARILYELTSRANSKLGAVRGVWG 398
>gi|302802885|ref|XP_002983196.1| hypothetical protein SELMODRAFT_117806 [Selaginella moellendorffii]
gi|300148881|gb|EFJ15538.1| hypothetical protein SELMODRAFT_117806 [Selaginella moellendorffii]
Length = 462
Score = 509 bits (1312), Expect = e-142, Method: Compositional matrix adjust.
Identities = 247/413 (59%), Positives = 321/413 (77%), Gaps = 12/413 (2%)
Query: 49 LLGPSLQVHSSFADQNHKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKI 108
L GP++QV +FA++N KRIV+ALQSGLKSEL+WAL +L +LSFKEKDD RKDA L KI
Sbjct: 61 LTGPTVQVQLTFAEKNSKRIVMALQSGLKSELSWALTSLNVLSFKEKDDGRKDA--LVKI 118
Query: 109 PGLLDALLQVIDDWRDIALPKELSKGPRARTLGVNSLVTGFGSEFEALGSINNAFPRSGV 168
PGLLDALL VID+WRDI+ + K R RTLG++ TGFG E+E + + +
Sbjct: 119 PGLLDALLHVIDEWRDISYSSDSKKSARKRTLGLHRPTTGFGLEYEINTHHDPLYRMRTM 178
Query: 169 GSGSSAADSLVQKNAARVRSSEWWFDEDGLFNLDDEGRAEKQQCAVGASNIIRNFSFMPD 228
+ + + +KNA +WW+DE+GLFNLD+ GRAE+QQCAV ASN++RNFSF+P+
Sbjct: 179 PERNPESVTDEKKNA------DWWWDEEGLFNLDEIGRAERQQCAVAASNVLRNFSFIPE 232
Query: 229 NEVIMAQHRHCLETVFQCIEDHVTEDEELVTNALETIVNLAPLLDLRIFSSSKQSYIKIT 288
NE+ MAQHRHCLET+ C++DH TEDEELVTNA+ETI+NLA + LRIF+ + I
Sbjct: 233 NEIHMAQHRHCLETLVCCMQDHNTEDEELVTNAVETILNLATFVVLRIFTDPSKGRIT-- 290
Query: 289 REKRAVEAIMGILGSPFKAWHCAAAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSLPA 348
E+ AV+AI+ +L SP K WHC+AAELLGRL++NP+NEP+LLPF QI+KRLVD+++ PA
Sbjct: 291 -EQAAVQAIVTMLESPIKPWHCSAAELLGRLVVNPENEPYLLPFATQIYKRLVDILNFPA 349
Query: 349 FDAQAAAVGALYNLAEVNVDCRLKLASERWAIDRLLRVIKTPHPVPEVCRKAAMILESLV 408
D+QAAA+ ALYN AEVN+DCRL+LA+ERWA+ RLLR+++TPHP+ EV RKAA+ LESL
Sbjct: 350 SDSQAAAIAALYNFAEVNMDCRLRLANERWAVGRLLRIVQTPHPLQEVVRKAALTLESLA 409
Query: 409 SEPQNRVLLLAYENAFAEILFSDGRYSDTFARILYELTSRPNNKVASARGIWG 461
SEP+NR +LLAYE+ FAE+ +D R SD FARIL+EL+SR +K +SARG+WG
Sbjct: 410 SEPENRSILLAYEHTFAELALTDARTSDMFARILWELSSRTGSK-SSARGVWG 461
>gi|302812016|ref|XP_002987696.1| hypothetical protein SELMODRAFT_126434 [Selaginella moellendorffii]
gi|300144588|gb|EFJ11271.1| hypothetical protein SELMODRAFT_126434 [Selaginella moellendorffii]
Length = 462
Score = 508 bits (1307), Expect = e-141, Method: Compositional matrix adjust.
Identities = 246/413 (59%), Positives = 320/413 (77%), Gaps = 12/413 (2%)
Query: 49 LLGPSLQVHSSFADQNHKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKI 108
L GP++QV +FA++N KRIV+ALQSGLKSEL+WAL +L +LSFKEKDD RKDA L KI
Sbjct: 61 LTGPTVQVQLTFAEKNSKRIVMALQSGLKSELSWALTSLNVLSFKEKDDGRKDA--LVKI 118
Query: 109 PGLLDALLQVIDDWRDIALPKELSKGPRARTLGVNSLVTGFGSEFEALGSINNAFPRSGV 168
PGLLDALL VID+WRDI+ + K R RTLG++ TGFG E+E + + +
Sbjct: 119 PGLLDALLHVIDEWRDISYSSDSKKSARKRTLGLHRPTTGFGLEYEINTHHDPLYRMRTM 178
Query: 169 GSGSSAADSLVQKNAARVRSSEWWFDEDGLFNLDDEGRAEKQQCAVGASNIIRNFSFMPD 228
+ + + +KN +WW+DE+GLFNLD+ GRAE+QQCAV ASN++RNFSF+P+
Sbjct: 179 PERNPESVTDEKKNV------DWWWDEEGLFNLDEIGRAERQQCAVAASNVLRNFSFIPE 232
Query: 229 NEVIMAQHRHCLETVFQCIEDHVTEDEELVTNALETIVNLAPLLDLRIFSSSKQSYIKIT 288
NE+ MAQHRHCLET+ C++DH TEDEELVTNA+ETI+NLA + LRIF+ + I
Sbjct: 233 NEIHMAQHRHCLETLVCCMQDHNTEDEELVTNAVETILNLATFVVLRIFTDPSKGRIT-- 290
Query: 289 REKRAVEAIMGILGSPFKAWHCAAAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSLPA 348
E+ AV+AI+ +L SP K WHC+AAELLGRL++NP+NEP+LLPF QI+KRLVD+++ PA
Sbjct: 291 -EQAAVQAIVTMLESPIKPWHCSAAELLGRLVVNPENEPYLLPFATQIYKRLVDILNFPA 349
Query: 349 FDAQAAAVGALYNLAEVNVDCRLKLASERWAIDRLLRVIKTPHPVPEVCRKAAMILESLV 408
D+QAAA+ ALYN AEVN+DCRL+LA+ERWA+ RLLR+++TPHP+ EV RKAA+ LESL
Sbjct: 350 SDSQAAAIAALYNFAEVNMDCRLRLANERWAVGRLLRIVQTPHPLQEVVRKAALTLESLA 409
Query: 409 SEPQNRVLLLAYENAFAEILFSDGRYSDTFARILYELTSRPNNKVASARGIWG 461
SEP+NR +LLAYE+ FAE+ +D R SD FARIL+EL+SR +K +SARG+WG
Sbjct: 410 SEPENRSILLAYEHTFAELALTDARTSDMFARILWELSSRTGSK-SSARGVWG 461
>gi|413922859|gb|AFW62791.1| hypothetical protein ZEAMMB73_079942, partial [Zea mays]
Length = 365
Score = 488 bits (1255), Expect = e-135, Method: Compositional matrix adjust.
Identities = 239/359 (66%), Positives = 289/359 (80%), Gaps = 16/359 (4%)
Query: 21 AKRGRPFGSTSGSSGGSGS----AADSAAPTTLLGPSLQVHSSFADQNHKRIVLALQSGL 76
AKRGRPFGST+G + + D AP L+GPSLQV S+ +DQN+KRIVLALQSGL
Sbjct: 17 AKRGRPFGSTTGGGAAAAAAAAAVVDPGAPAALVGPSLQVLSALSDQNNKRIVLALQSGL 76
Query: 77 KSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQVIDDWRDIALPKELSKGPR 136
KSE+ WALN LT+LSFKEKDD+R+DATPLAK+PGLLDALLQVID+W DI++PK+ +K PR
Sbjct: 77 KSEILWALNALTVLSFKEKDDLRRDATPLAKVPGLLDALLQVIDEWSDISMPKDHTKPPR 136
Query: 137 ARTLGVNSLVTGFGSEFEALGSINNAFPRSGVGSG--SSAADSLVQKNAARVRSSEWWFD 194
RTLG N+ ++GFG E ++ + + S S A DS V K R++ +WFD
Sbjct: 137 LRTLGANTTLSGFGQE-----NMEKVYSDTATTSNDQSKAEDSSVTKK----RAASFWFD 187
Query: 195 EDGLFNLDDEGRAEKQQCAVGASNIIRNFSFMPDNEVIMAQHRHCLETVFQCIEDHVTED 254
EDGLFN DDEGRAE+QQCA+ ASNIIRNFSFMP+NE+IM QHRHCLET+F C+E+ ED
Sbjct: 188 EDGLFNNDDEGRAERQQCAIAASNIIRNFSFMPENEIIMVQHRHCLETIFHCLENQNRED 247
Query: 255 EELVTNALETIVNLAPLLDLRIFSSSKQSYIKITREKRAVEAIMGILGSPFKAWHCAAAE 314
+EL+TN LET+VNLAP+LDLRIFSSSK S+IK+T EK A+ AIMG+L S K WHCAAAE
Sbjct: 248 DELITNMLETLVNLAPVLDLRIFSSSKPSFIKMT-EKGAIHAIMGMLSSSVKPWHCAAAE 306
Query: 315 LLGRLIINPDNEPFLLPFVPQIHKRLVDLMSLPAFDAQAAAVGALYNLAEVNVDCRLKL 373
L+GRLIINPDNEPFLLP +PQI+KRLVDL+S+PA+DAQAAAV ALYN+AEVN+DCRLKL
Sbjct: 307 LIGRLIINPDNEPFLLPIIPQIYKRLVDLLSVPAYDAQAAAVSALYNVAEVNMDCRLKL 365
>gi|168029124|ref|XP_001767076.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162681572|gb|EDQ67997.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 466
Score = 460 bits (1183), Expect = e-127, Method: Compositional matrix adjust.
Identities = 250/457 (54%), Positives = 318/457 (69%), Gaps = 23/457 (5%)
Query: 22 KRGRPFGSTSGSSG--------GSGSAADSAAPTTLLGPSLQVHSSFADQNHKRIVLALQ 73
KRGRP +G++ SG A +P LGPSLQV +FADQ+ KRIV+ALQ
Sbjct: 14 KRGRPATVNAGNNANVSGHLENASGGTAHQPSPGLNLGPSLQVVYNFADQHTKRIVMALQ 73
Query: 74 SGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQVIDDWRDIALPKELSK 133
SGLK EL WA+N LTLLSFKEKDD RKDATPLAK+PGLLD+LLQVI +WRDIA + +K
Sbjct: 74 SGLKGELAWAINALTLLSFKEKDDSRKDATPLAKVPGLLDSLLQVISEWRDIAHERVFAK 133
Query: 134 GPRARTLGVNSLVTGFGSEFEALGSINNAFPRSGVGSGSSAADSLVQ--KNAARVRSSE- 190
R R LG + TGFG E+E P + +AA+ VQ V + E
Sbjct: 134 LSRPRLLGADQPYTGFGLEYEIYS------PDDAMLHIRAAAEKNVQGGDGCDAVETKEG 187
Query: 191 WWFDEDGLFNLDDEGRAEKQQCAVGASNIIRNFSFMPDNEVIMAQHRHCLETVFQCIEDH 250
W +DE+G+FNLD+ GR EKQ CAV SN++RNFSFMP+NE MAQHR CLET+ +E H
Sbjct: 188 WCWDEEGIFNLDEIGRHEKQICAVAVSNVLRNFSFMPENEASMAQHRGCLETLITVLEAH 247
Query: 251 VTEDEELVTNALETIVNLAPLLDLRIFSSSKQSYIKIT----REKRAVEAIMGILGSPFK 306
TEDEELVTN++ET++NLA L +IF+++ + T EKR ++AIM +L P
Sbjct: 248 ETEDEELVTNSIETLLNLATFLCFKIFTNNGNNKASNTGSSMSEKRMIDAIMMMLSCPVI 307
Query: 307 AWHCAAAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSLPAF-DAQAAAVGALYNLAEV 365
+WHC AAELLGRL++NPDNEPFL+PFV QI RLV+L+S A DA AAAV AL+N +E+
Sbjct: 308 SWHCCAAELLGRLVVNPDNEPFLMPFVSQIFPRLVELLSTEASGDALAAAVAALHNFSEI 367
Query: 366 NVDCRLKLASERWAIDRLLRVIKTPHPVP-EVCRKAAMILESLVSEPQNRVLLLAYENAF 424
N +CRL LASERWA+ RLLRV+ + + P EVCRKAA+ LE+L SEPQNR ++L+YE++
Sbjct: 368 NTECRLGLASERWAVARLLRVVGSGNHYPQEVCRKAALTLENLASEPQNRAVMLSYESSI 427
Query: 425 AEILFSDGRYSDTFARILYELTSRPNNKVASARGIWG 461
AE+ D R SD FARIL+EL+S ++K+ S RG+WG
Sbjct: 428 AELAVVDQRMSDVFARILWELSSGISSKMGSLRGVWG 464
>gi|168048771|ref|XP_001776839.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162671843|gb|EDQ58389.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 463
Score = 457 bits (1177), Expect = e-126, Method: Compositional matrix adjust.
Identities = 249/454 (54%), Positives = 312/454 (68%), Gaps = 20/454 (4%)
Query: 22 KRGRPFGSTSGSSG--------GSGSAADSAAPTTLLGPSLQVHSSFADQNHKRIVLALQ 73
KRGRP + +G+S + AA +P LGPSLQV +FA+Q+ KRIV+ALQ
Sbjct: 14 KRGRPATANAGNSANHVGHVENATAGAAHQPSPGPNLGPSLQVSHNFAEQHTKRIVMALQ 73
Query: 74 SGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQVIDDWRDIALPKELSK 133
SGLK EL WA+N LTLLSFKEKDD RKDATPLAK+PGLLD+LLQVI +WRDIA + +K
Sbjct: 74 SGLKGELAWAINALTLLSFKEKDDTRKDATPLAKVPGLLDSLLQVISEWRDIAHERMFAK 133
Query: 134 GPRARTLGVNSLVTGFGSEFEALGSINNAFPRSGVGSGSSAADSLVQKNAARVRSSEWWF 193
R R LG + TGFG E E P + +AA+ Q A W +
Sbjct: 134 LSRPRLLGADQPYTGFGLEHEIY------IPDDALLHVRAAAERNAQGADATETKEGWCW 187
Query: 194 DEDGLFNLDDEGRAEKQQCAVGASNIIRNFSFMPDNEVIMAQHRHCLETVFQCIEDHVTE 253
DE+GLFNLD+ GR EKQ CAV SN++RN SFMP+NE MAQHR CLET+ +EDH TE
Sbjct: 188 DEEGLFNLDEIGRHEKQICAVAVSNVLRNMSFMPENESFMAQHRGCLETLITVLEDHETE 247
Query: 254 DEELVTNALETIVNLAPLLDLRIFSSSKQSYIKI----TREKRAVEAIMGILGSPFKAWH 309
DEELVTN+ ET++NLAP L L+IF+++ + +KR ++AIM +L SP +WH
Sbjct: 248 DEELVTNSTETLLNLAPFLCLKIFTNNGNNKANNPGSSMSDKRMIDAIMTMLSSPLISWH 307
Query: 310 CAAAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSLP-AFDAQAAAVGALYNLAEVNVD 368
C AAELLGRL++NPDNE +LPFVP I RLV+L+S + DA AAAV AL N +E+N D
Sbjct: 308 CNAAELLGRLVVNPDNESSILPFVPLIFPRLVELISTEVSGDALAAAVAALLNFSEINTD 367
Query: 369 CRLKLASERWAIDRLLRVIKT-PHPVPEVCRKAAMILESLVSEPQNRVLLLAYENAFAEI 427
CRL+LA ERWA+ RLLRV+ + H EVCRKAA+ LESL SEPQNR++LLAYE+ AE+
Sbjct: 368 CRLRLAGERWAVGRLLRVVGSWNHHPQEVCRKAALTLESLASEPQNRIVLLAYESLIAEL 427
Query: 428 LFSDGRYSDTFARILYELTSRPNNKVASARGIWG 461
D R SD FARIL+EL+S ++K+ S RG+WG
Sbjct: 428 AVVDQRMSDVFARILWELSSGVSSKMGSIRGVWG 461
>gi|452823220|gb|EME30232.1| ARID/BRIGHT DNA binding domain containing protein [Galdieria
sulphuraria]
Length = 610
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 101/387 (26%), Positives = 172/387 (44%), Gaps = 48/387 (12%)
Query: 65 HKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQVIDDWRD 124
++VLALQSG++ E++WAL T+ +LSF K D + P LL++L +
Sbjct: 265 RNKLVLALQSGIEDEISWALVTINVLSFDPKLDFL-----IRDYPMLLESL--------E 311
Query: 125 IALPKELSKGPRARTLGVNSLVTGFGSEFEALGSINNAFPRSGVGSGSSAADSLVQKNAA 184
I L + L RT G++ + + G+ N + + + A + V N +
Sbjct: 312 IILKEYLEDLNGCRTFGLHP----DDEDARSSGAKNKMLSAVDINTSNVIAHNNVC-NPS 366
Query: 185 RVRSSEWWFDEDGLFNLDDEGRAEKQQCAVGASNIIRNFSFMPDNEVIMAQHRHCLETVF 244
+ + D +F +++ A +N +RN SF+ N A+H L T
Sbjct: 367 LQNYGDLFCTRDPIF-------LKREWNAKCVANALRNLSFIERNHPSFAKHVSLLRT-- 417
Query: 245 QCIEDHVTED--EELVTNALETIVNLAPLLDLRIFSSSK---QSYIKITREKRAVEAIMG 299
CI V+ + ++V + +T+ N+A +++++ + S I I E
Sbjct: 418 -CISIIVSSEAASQVVYDLSDTLKNIA--VEVKLNQDTLFLLDSAIAILYEWEDCTDDTR 474
Query: 300 ILGSPFKAWHCAAAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSLPAFDAQAAAVGAL 359
IL A EL+ RL NPDNE LL +I LV +S D + AVGA+
Sbjct: 475 IL---------KATELIARLCFNPDNELELLKRFDEILSLLVGFLSSENKDIRLVAVGAI 525
Query: 360 YNLAEVNVDCRLKLASERWAIDRLLRVIKTPHPVPEVCRKAAMILESLVSEPQNRVLLLA 419
N + + + R+ +A + L + + P+ P +A+ + +L P NR +LL
Sbjct: 526 CNASAFDWNARVAIAKTPSVVYYLTQCLSDPYIAP----LSAITIANLAEAPSNRAILLR 581
Query: 420 YENAFAEILFSDGRYSDTFARILYELT 446
YE+ F + S+ S+ A L EL+
Sbjct: 582 YESKFVHVAMSNSAASELVACALKELS 608
>gi|403345100|gb|EJY71909.1| ARID/BRIGHT DNA binding domain containing protein [Oxytricha
trifallax]
Length = 673
Score = 73.9 bits (180), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 102/449 (22%), Positives = 181/449 (40%), Gaps = 88/449 (19%)
Query: 66 KRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQVIDDWRDI 125
KR+VLA +S + E+ WALNTLT+ S + + P + + + L+ I + +
Sbjct: 240 KRLVLAFESRITKEINWALNTLTIFSCNTNQNFTLENQPYL-LESISNYLVYCIQNIESL 298
Query: 126 ALPKELSKGPRARTLGVNS----LVTGFGSEFEALGS----------------------- 158
+ K + ++ V S L++G S ++LG
Sbjct: 299 SYSDPFEKKSKILSVNVPSYVDVLMSG-QSNMQSLGDHQNISQHQQFGMQKLDYKNFGQF 357
Query: 159 ------------------INNAFPRSGVGSGSSAADSLVQKNAARVRSSEWWFDEDGLFN 200
I+ + +GSG++ + ++ + AR R + E+ N
Sbjct: 358 TKEIRKREGKYEEKKKELIDEFVAQQQIGSGNALSSNMPK---ARGRKPKIKILEEQ--N 412
Query: 201 LDDEGRAEKQQCAVGASN----------------IIRNFSFMPDNEVIMAQHRHCLETVF 244
DE R ++++ I+RN SF+ NE + + L+ V
Sbjct: 413 ALDELRKKRKKLVTVLHQEVTELELIDHLRMIILIVRNLSFIRANEHHLIKCFKLLDIVT 472
Query: 245 QCIEDHVTEDEELVTNALETIVNLAPLLDLRIFSSSKQSYIKITREKRAVEAIMGILG-- 302
D + D E+ N L+ I NL L L S + RE V+A+ G+
Sbjct: 473 SLFVDLI--DREITFNCLDIITNLGKHLVL--------SELNCGRE--LVDALFGLFSTA 520
Query: 303 -SPFKAWHCAAAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSLPAFDAQAAAVGALYN 361
S C E L RL ++ NE +L + + L++L+ + + + L
Sbjct: 521 DSELIVDQCV--ECLRRLSLSAGNEEYLEEVRDEDIQNLINLLMSNNIETREGCLEILCT 578
Query: 362 LAEVNVDCRLKLASERWAIDRLLRVIKTPHPVP---EVCRKAAMILESLVSEPQNRVLLL 418
+++ ++K+A ++ I+RL+ +I T P ++ + AA+ L +L P NR L++
Sbjct: 579 ISDRKTTLKVKIAHQKKCIERLIGLIATGSQTPNEEKISKLAALTLANLNLAPSNRALIV 638
Query: 419 AYENAFAEILFSDGRYSDTFARILYELTS 447
YE A I SD + A IL +L S
Sbjct: 639 PYEQELALIAASDEKTCKIIAEILGDLDS 667
>gi|119578292|gb|EAW57888.1| AT rich interactive domain 2 (ARID, RFX-like), isoform CRA_a [Homo
sapiens]
Length = 500
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 86/334 (25%), Positives = 146/334 (43%), Gaps = 50/334 (14%)
Query: 64 NHKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQVIDDWR 123
++ ++VL+L SGL +E+ +A+N TLLS + K M+ + P KI LL A V DD
Sbjct: 157 DYNKLVLSLLSGLPNEVDFAINVCTLLSNESKHVMQLEKDP--KIITLLLANAGVFDD-- 212
Query: 124 DIALPKELSKGPRARTLGVNSLVTGFGSEFEALGSINNAFPRSGVGSGSSAADSLVQKNA 183
TLG S T FG E++ + + + D + +N
Sbjct: 213 ---------------TLG--SFSTVFGEEWKEKTDRDFVKFWKDIVDDNEVRDLISDRNK 255
Query: 184 ARVRSSEWWFDEDGLFN------LDDEGRAEKQQCAVGASNIIRNFSFMPDNEVIMAQHR 237
+ +S W E LF+ ++D Q AV I+RN SF N ++A +R
Sbjct: 256 SHEGTSGEWIWE-SLFHPPRKLGINDIEGQRVLQIAV----ILRNLSFEEGNVKLLAANR 310
Query: 238 HCLETVFQCIEDHVTEDEELVTNALETIVNLAP--LLDLRIFSSSKQSYIKITREKRAVE 295
CL + H +L L+T+ N+A LLD F ++ + +T+ + +
Sbjct: 311 TCLRFLLLSAHSHFISLRQL---GLDTLGNIAAELLLDPVDFKTTHLMFHTVTKCLMSRD 367
Query: 296 AIMGILGSPFKAWHCAAAELLGRLIINPDNEPFLLPFVPQ-IHKRLVDLMSLPAFDAQAA 354
+ + G E+LG L DN + +V Q ++ ++ ++LP +
Sbjct: 368 RFLKMRG----------MEILGNLCKAEDNGVLICEYVDQDSYREIICHLTLPDVLLVIS 417
Query: 355 AVGALYNLAEV-NVDCRLKLASERWAIDRLLRVI 387
+ LY L E+ +V C K+A +ID L+ ++
Sbjct: 418 TLEVLYMLTEMGDVACT-KIAKVEKSIDMLVCLV 450
>gi|327272934|ref|XP_003221239.1| PREDICTED: AT-rich interactive domain-containing protein 2-like
[Anolis carolinensis]
Length = 1839
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 85/332 (25%), Positives = 143/332 (43%), Gaps = 46/332 (13%)
Query: 64 NHKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQVIDDWR 123
++ ++VL+L SGL +E+ +A+N TLLS + K M+ + P KI LL A V DD
Sbjct: 157 DYNKLVLSLLSGLPNEVDFAINVCTLLSNESKHVMQLEKDP--KIITLLLANAGVFDD-- 212
Query: 124 DIALPKELSKGPRARTLGVNSLVTGFGSEFEALGSINNAFPRSGVGSGSSAADSLVQKNA 183
TLG S T FG E++ + + + D + +N
Sbjct: 213 ---------------TLG--SFSTVFGEEWKEKTDRDFVKFWKDIVEDNEVRDLISDRNK 255
Query: 184 ARVRSSEWWFDEDGL-----FNLDDEGRAEKQQCAVGASNIIRNFSFMPDNEVIMAQHRH 238
++ +SE W E L ++D Q AV I+RN SF N ++A +R
Sbjct: 256 SQDSTSEDWIWESLLHPPRKLGINDIEGQRVLQIAV----ILRNLSFEEGNVKLLAANRT 311
Query: 239 CLETVFQCIEDHVTEDEELVTNALETIVNLAP--LLDLRIFSSSKQSYIKITREKRAVEA 296
CL + H +L L+T+ N+A LLD F ++ + +T+ + +
Sbjct: 312 CLRFLLLSAHSHFISLRQL---GLDTLGNIAAELLLDPVDFKTTHLMFHTVTKCLMSRDR 368
Query: 297 IMGILGSPFKAWHCAAAELLGRLIINPDNEPFLLPFVPQ-IHKRLVDLMSLPAFDAQAAA 355
+ + G E+L L DN + +V Q +K ++ ++LP +A
Sbjct: 369 FLKMRG----------MEILANLCKAEDNGVLICEYVDQESYKEIICHLTLPDVLLVISA 418
Query: 356 VGALYNLAEVNVDCRLKLASERWAIDRLLRVI 387
+ LY L E+ K+A +ID L+ ++
Sbjct: 419 LEVLYMLTEMGEVACTKIAKVDKSIDTLVCLV 450
>gi|126340157|ref|XP_001367064.1| PREDICTED: AT-rich interactive domain-containing protein 2
[Monodelphis domestica]
Length = 1836
Score = 59.7 bits (143), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 85/333 (25%), Positives = 144/333 (43%), Gaps = 48/333 (14%)
Query: 64 NHKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQVIDDWR 123
++ ++VL+L SGL +E+ +A+N TLLS + K M+ + P KI LL A V DD
Sbjct: 157 DYNKLVLSLLSGLPNEVDFAINVCTLLSNESKHVMQLEKDP--KIITLLLANAGVFDD-- 212
Query: 124 DIALPKELSKGPRARTLGVNSLVTGFGSEFEALGSINNAFPRSGVGSGSSAADSLVQKNA 183
TLG S T FG E++ + + + D + KN
Sbjct: 213 ---------------TLG--SFSTVFGEEWKEKTDRDFVKFWKDIVDDNEVRDLISDKNK 255
Query: 184 ARVRSSEWW-----FDEDGLFNLDDEGRAEKQQCAVGASNIIRNFSFMPDNEVIMAQHRH 238
++ +S W F ++D Q AV I+RN SF N ++A +R
Sbjct: 256 SQENTSGEWIWESLFHPPRKLGINDIEGQRVLQIAV----ILRNLSFEEGNVKLLAANRT 311
Query: 239 CLETVFQCIEDHVTEDEELVTNALETIVNLAP--LLDLRIFSSSKQSYIKITREKRAVEA 296
CL + H +L L+T+ N+A LLD F ++ + +T+ + +
Sbjct: 312 CLRFLLLSAHSHFISLRQL---GLDTLGNIAAELLLDPVDFKTTHLMFHTVTKCLMSRDR 368
Query: 297 IMGILGSPFKAWHCAAAELLGRLIINPDNEPFLLPFVPQ-IHKRLVDLMSLPAFDAQAAA 355
+ + G E+LG L DN + +V Q ++ ++ ++LP +
Sbjct: 369 FLKMRG----------MEILGNLCKAEDNGVLICEYVDQESYREIICHLTLPDVLLVIST 418
Query: 356 VGALYNLAEV-NVDCRLKLASERWAIDRLLRVI 387
+ LY L E+ +V C K+A +ID L+ ++
Sbjct: 419 LEVLYMLTEMGDVAC-TKIAKVEKSIDMLVCLV 450
>gi|354498260|ref|XP_003511233.1| PREDICTED: AT-rich interactive domain-containing protein 2-like
[Cricetulus griseus]
Length = 1890
Score = 59.3 bits (142), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 83/332 (25%), Positives = 139/332 (41%), Gaps = 46/332 (13%)
Query: 64 NHKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQVIDDWR 123
++ ++VL+L SGL +E+ +A+N TLLS + K M+ + P KI LL A V DD
Sbjct: 219 DYNKLVLSLLSGLPNEVDFAINVCTLLSNESKHAMQLEKDP--KIITLLLANAGVFDD-- 274
Query: 124 DIALPKELSKGPRARTLGVNSLVTGFGSEFEALGSINNAFPRSGVGSGSSAADSLVQKNA 183
TLG S T FG E+ + + + D + +N
Sbjct: 275 ---------------TLG--SFSTVFGEEWREKTDRDFVKFWKDIVDDNEVRDLISDRNK 317
Query: 184 ARVRSSEWW-----FDEDGLFNLDDEGRAEKQQCAVGASNIIRNFSFMPDNEVIMAQHRH 238
A +S W F ++D Q AV I+RN SF N ++A +R
Sbjct: 318 AHEDTSGEWLWESLFHPPRKLGINDIEGQRVLQIAV----ILRNLSFEEGNVKLLAANRT 373
Query: 239 CLETVFQCIEDHVTEDEELVTNALETIVNLAP--LLDLRIFSSSKQSYIKITREKRAVEA 296
CL + H +L L+T+ N+A LLD F ++ + +T+ + +
Sbjct: 374 CLRFLLLSAHSHFISLRQL---GLDTLGNIAAELLLDPVDFKTTHLMFHTVTKCLMSRDR 430
Query: 297 IMGILGSPFKAWHCAAAELLGRLIINPDNEPFLLPFVPQ-IHKRLVDLMSLPAFDAQAAA 355
+ + G E+LG L DN + +V Q ++ ++ ++LP +
Sbjct: 431 FLKMRG----------MEILGNLCKAEDNGVLICEYVDQDSYREIICHLTLPDVLLVTST 480
Query: 356 VGALYNLAEVNVDCRLKLASERWAIDRLLRVI 387
+ LY L E+ K+A +ID L+ ++
Sbjct: 481 LEVLYMLTEMGDIACTKIAKVEKSIDMLVCLV 512
>gi|301606475|ref|XP_002932851.1| PREDICTED: AT-rich interactive domain-containing protein 2-like
[Xenopus (Silurana) tropicalis]
Length = 1815
Score = 59.3 bits (142), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 86/331 (25%), Positives = 143/331 (43%), Gaps = 45/331 (13%)
Query: 64 NHKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQVIDDWR 123
++ ++VL+L SGL +E+ +A+N TLLS + K M+ + P KI LL A V DD
Sbjct: 157 DYNKLVLSLLSGLPNEVDFAINVCTLLSNESKHVMQLEKDP--KIITLLLANAGVFDD-- 212
Query: 124 DIALPKELSKGPRARTLGVNSLVTGFGSEFEALGSINNAFPRSGVGSGSSAADSLVQKNA 183
TLG S V FG+E++ + + + D + K+
Sbjct: 213 ---------------TLGSFSAV--FGNEWKEKTDRDFVQFWKDIVEDNEVRDLIYDKST 255
Query: 184 ARVRSSE--W--WFDEDGLFNLDDEGRAEKQQCAVGASNIIRNFSFMPDNEVIMAQHRHC 239
++ S E W F ++D Q AV I+RN SF N ++A +R C
Sbjct: 256 SQGSSPESLWEALFHPPRKLGINDIEGQRVLQIAV----ILRNLSFEESNVKLLAANRTC 311
Query: 240 LETVFQCIEDHVTEDEELVTNALETIVNLAPLLDLRI--FSSSKQSYIKITREKRAVEAI 297
L + H +L L+T+ N+A L L F ++ + IT+ + +
Sbjct: 312 LRFLLLSAHSHFISLRQL---GLDTLGNIAAELQLDPVDFKTTHLMFHTITKCLMSRDRF 368
Query: 298 MGILGSPFKAWHCAAAELLGRLIINPDNEPFLLPFVPQ-IHKRLVDLMSLPAFDAQAAAV 356
+ + G E+LG L DN + +V Q +K ++ +SLP + +
Sbjct: 369 LKMRG----------MEILGNLCKAEDNAVLICEYVDQDSYKEIICHLSLPDVLLVISTL 418
Query: 357 GALYNLAEVNVDCRLKLASERWAIDRLLRVI 387
LY L E+ +K+A+ +ID L+ +I
Sbjct: 419 EVLYMLTELGEVACVKIANVERSIDTLVCLI 449
>gi|395538926|ref|XP_003771425.1| PREDICTED: AT-rich interactive domain-containing protein 2
[Sarcophilus harrisii]
Length = 1766
Score = 58.5 bits (140), Expect = 7e-06, Method: Composition-based stats.
Identities = 86/330 (26%), Positives = 142/330 (43%), Gaps = 48/330 (14%)
Query: 64 NHKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQVIDDWR 123
++ ++VL+L SGL +E+ +A+N TLLS + K M+ + P KI LL A V DD
Sbjct: 110 DYNKLVLSLLSGLPNEVDFAINVCTLLSNESKHVMQLEKDP--KIITLLLANAGVFDD-- 165
Query: 124 DIALPKELSKGPRARTLGVNSLVTGFGSEFEALGSINNAFPRSGVGSGSSAADSLVQKNA 183
TLG S T FG E++ + + + D + KN
Sbjct: 166 ---------------TLG--SFSTVFGEEWKEKTDRDFVKFWKDIVDDNEVRDLISDKNK 208
Query: 184 ARVRSSEWWFDEDGLFN------LDDEGRAEKQQCAVGASNIIRNFSFMPDNEVIMAQHR 237
++ +S W E LF+ ++D Q AV I+RN SF N ++A +R
Sbjct: 209 SQENTSGEWIWES-LFHPPRKLGINDIEGQRVLQIAV----ILRNLSFEEGNVKLLAANR 263
Query: 238 HCLETVFQCIEDHVTEDEELVTNALETIVNLAP--LLDLRIFSSSKQSYIKITREKRAVE 295
CL + H +L L+T+ N+A LLD F ++ + +T+ + +
Sbjct: 264 TCLRFLLLSAHSHFISLRQL---GLDTLGNIAAELLLDPVDFKTTHLMFHTVTKCLMSRD 320
Query: 296 AIMGILGSPFKAWHCAAAELLGRLIINPDNEPFLLPFVPQ-IHKRLVDLMSLPAFDAQAA 354
+ + G E+LG L DN + +V Q +K ++ ++LP +
Sbjct: 321 RFLKMRG----------MEILGNLCKAEDNGVLICEYVDQESYKEIICHLTLPDVLLVIS 370
Query: 355 AVGALYNLAEVNVDCRLKLASERWAIDRLL 384
+ LY L E+ K+A +ID L+
Sbjct: 371 TLEVLYMLTEMGDVACTKIAKVEKSIDMLV 400
>gi|58012117|gb|AAU20329.2| ARID2 [Homo sapiens]
Length = 1113
Score = 58.2 bits (139), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 84/333 (25%), Positives = 143/333 (42%), Gaps = 48/333 (14%)
Query: 64 NHKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQVIDDWR 123
++ ++VL+L SGL +E+ +A+N TLLS + K M+ + P KI LL A V DD
Sbjct: 157 DYNKLVLSLLSGLPNEVDFAINVCTLLSNESKHVMQLEKDP--KIITLLLANAGVFDD-- 212
Query: 124 DIALPKELSKGPRARTLGVNSLVTGFGSEFEALGSINNAFPRSGVGSGSSAADSLVQKNA 183
TLG S T FG E++ + + + D + +N
Sbjct: 213 ---------------TLG--SFSTVFGEEWKEKTDRDFVKFWKDIVDDNEVRDLISDRNK 255
Query: 184 ARVRSSEWW-----FDEDGLFNLDDEGRAEKQQCAVGASNIIRNFSFMPDNEVIMAQHRH 238
+ +S W F ++D Q AV I+RN SF N ++A +R
Sbjct: 256 SHEGTSGEWIWESLFHPPRKLGINDIEGQRVLQIAV----ILRNLSFEEGNVKLLAANRT 311
Query: 239 CLETVFQCIEDHVTEDEELVTNALETIVNLAP--LLDLRIFSSSKQSYIKITREKRAVEA 296
CL + H +L L+T+ N+A LLD F ++ + +T+ + +
Sbjct: 312 CLRFLLLSAHSHFISLRQL---GLDTLGNIAAELLLDPVDFKTTHLMFHTVTKCLMSRDR 368
Query: 297 IMGILGSPFKAWHCAAAELLGRLIINPDNEPFLLPFVPQ-IHKRLVDLMSLPAFDAQAAA 355
+ + G E+LG L DN + +V Q ++ ++ ++LP +
Sbjct: 369 FLKMRG----------MEILGNLCKAEDNGVLICEYVDQDSYREIICHLTLPDVLLVIST 418
Query: 356 VGALYNLAEV-NVDCRLKLASERWAIDRLLRVI 387
+ LY L E+ +V C K+A +ID L+ ++
Sbjct: 419 LEVLYMLTEMGDVACT-KIAKVEKSIDMLVCLV 450
>gi|392349703|ref|XP_345868.4| PREDICTED: AT-rich interactive domain-containing protein 2 [Rattus
norvegicus]
Length = 1826
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 82/332 (24%), Positives = 138/332 (41%), Gaps = 46/332 (13%)
Query: 64 NHKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQVIDDWR 123
++ ++VL+L SGL +E+ +A+N TLLS + K M+ + P KI LL A V DD
Sbjct: 157 DYNKLVLSLLSGLPNEVDFAINVCTLLSNESKHVMQLEKDP--KIITLLLANAGVFDD-- 212
Query: 124 DIALPKELSKGPRARTLGVNSLVTGFGSEFEALGSINNAFPRSGVGSGSSAADSLVQKNA 183
TLG S T FG E+ + + + D + +N
Sbjct: 213 ---------------TLG--SFSTVFGEEWREKTDRDFVKFWKDIVDDNEVRDLISDRNK 255
Query: 184 ARVRSSEWW-----FDEDGLFNLDDEGRAEKQQCAVGASNIIRNFSFMPDNEVIMAQHRH 238
A + W F ++D Q AV I+RN SF N ++A +R
Sbjct: 256 AHEDTPGEWIWESLFHPPRKLGINDIEGQRVLQIAV----ILRNLSFEESNVKLLAANRT 311
Query: 239 CLETVFQCIEDHVTEDEELVTNALETIVNLAP--LLDLRIFSSSKQSYIKITREKRAVEA 296
CL + H +L L+T+ N+A LLD F ++ + +T+ + +
Sbjct: 312 CLRFLLLSAHSHFISLRQL---GLDTLGNIAAELLLDPVDFKTTHLMFHTVTKCLMSRDR 368
Query: 297 IMGILGSPFKAWHCAAAELLGRLIINPDNEPFLLPFVPQ-IHKRLVDLMSLPAFDAQAAA 355
+ + G E+LG L DN + +V Q ++ ++ ++LP +
Sbjct: 369 FLKMRG----------MEILGNLCKAEDNGVLICEYVDQDSYREIICHLTLPDVLLVTST 418
Query: 356 VGALYNLAEVNVDCRLKLASERWAIDRLLRVI 387
+ LY L E+ K+A +ID L+ ++
Sbjct: 419 LEVLYMLTEMGDIACTKIAKVEKSIDMLVCLV 450
>gi|109096289|ref|XP_001092151.1| PREDICTED: AT-rich interactive domain-containing protein 2 [Macaca
mulatta]
Length = 1905
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 84/333 (25%), Positives = 143/333 (42%), Gaps = 48/333 (14%)
Query: 64 NHKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQVIDDWR 123
++ ++VL+L SGL +E+ +A+N TLLS + K M+ + P KI LL A V DD
Sbjct: 226 DYNKLVLSLLSGLPNEVDFAINVCTLLSNESKHVMQLEKDP--KIITLLLANAGVFDD-- 281
Query: 124 DIALPKELSKGPRARTLGVNSLVTGFGSEFEALGSINNAFPRSGVGSGSSAADSLVQKNA 183
TLG S T FG E++ + + + D + +N
Sbjct: 282 ---------------TLG--SFSTVFGEEWKEKTDRDFVKFWKDIVDDNEVRDLISDRNK 324
Query: 184 ARVRSSEWW-----FDEDGLFNLDDEGRAEKQQCAVGASNIIRNFSFMPDNEVIMAQHRH 238
+ +S W F ++D Q AV I+RN SF N ++A +R
Sbjct: 325 SHEGTSGEWIWESLFHPPRKLGINDIEGQRVLQIAV----ILRNLSFEEGNVKLLAANRT 380
Query: 239 CLETVFQCIEDHVTEDEELVTNALETIVNLAP--LLDLRIFSSSKQSYIKITREKRAVEA 296
CL + H +L L+T+ N+A LLD F ++ + +T+ + +
Sbjct: 381 CLRFLLLSAHSHFISLRQL---GLDTLGNIAAELLLDPVDFKTTHLMFHTVTKCLMSRDR 437
Query: 297 IMGILGSPFKAWHCAAAELLGRLIINPDNEPFLLPFVPQ-IHKRLVDLMSLPAFDAQAAA 355
+ + G E+LG L DN + +V Q ++ ++ ++LP +
Sbjct: 438 FLKMRG----------MEILGNLCKAEDNGVLICEYVDQDSYREIICHLTLPDVLLVIST 487
Query: 356 VGALYNLAEV-NVDCRLKLASERWAIDRLLRVI 387
+ LY L E+ +V C K+A +ID L+ ++
Sbjct: 488 LEVLYMLTEMGDVAC-TKIAKVEKSIDMLVCLV 519
>gi|344266737|ref|XP_003405436.1| PREDICTED: AT-rich interactive domain-containing protein 2-like
[Loxodonta africana]
Length = 2367
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 84/333 (25%), Positives = 143/333 (42%), Gaps = 48/333 (14%)
Query: 64 NHKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQVIDDWR 123
++ ++VL+L SGL +E+ +A+N TLLS + K M+ + P KI LL A V DD
Sbjct: 687 DYNKLVLSLLSGLPNEVDFAINVCTLLSNESKHVMQLEKDP--KIITLLLANAGVFDD-- 742
Query: 124 DIALPKELSKGPRARTLGVNSLVTGFGSEFEALGSINNAFPRSGVGSGSSAADSLVQKNA 183
TLG S T FG E++ + + + D + +N
Sbjct: 743 ---------------TLG--SFSTVFGEEWKEKTDRDFVKFWKDIVDDNEVRDLISDRNK 785
Query: 184 ARVRSSEWW-----FDEDGLFNLDDEGRAEKQQCAVGASNIIRNFSFMPDNEVIMAQHRH 238
+ +S W F ++D Q AV I+RN SF N ++A +R
Sbjct: 786 SHEGTSGEWIWESLFHPPRKLGINDIEGQRVLQIAV----ILRNLSFEEGNVKLLAANRT 841
Query: 239 CLETVFQCIEDHVTEDEELVTNALETIVNLAP--LLDLRIFSSSKQSYIKITREKRAVEA 296
CL + H +L L+T+ N+A LLD F ++ + +T+ + +
Sbjct: 842 CLRFLLLSAHSHFISLRQL---GLDTLGNIAAELLLDPVDFKTTHLMFHTVTKCLMSRDR 898
Query: 297 IMGILGSPFKAWHCAAAELLGRLIINPDNEPFLLPFVPQ-IHKRLVDLMSLPAFDAQAAA 355
+ + G E+LG L DN + +V Q ++ ++ ++LP +
Sbjct: 899 FLKMRG----------MEILGNLCKAEDNGVLICEYVDQDSYREIICHLTLPDVLLVIST 948
Query: 356 VGALYNLAEV-NVDCRLKLASERWAIDRLLRVI 387
+ LY L E+ +V C K+A +ID L+ ++
Sbjct: 949 LEVLYMLTEMGDVAC-TKIAKVEKSIDMLVCLV 980
>gi|426224635|ref|XP_004006474.1| PREDICTED: AT-rich interactive domain-containing protein 2 [Ovis
aries]
Length = 1835
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 84/333 (25%), Positives = 143/333 (42%), Gaps = 48/333 (14%)
Query: 64 NHKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQVIDDWR 123
++ ++VL+L SGL +E+ +A+N TLLS + K M+ + P KI LL A V DD
Sbjct: 157 DYNKLVLSLLSGLPNEVDFAINVCTLLSNESKHVMQLEKDP--KIITLLLANAGVFDD-- 212
Query: 124 DIALPKELSKGPRARTLGVNSLVTGFGSEFEALGSINNAFPRSGVGSGSSAADSLVQKNA 183
TLG S T FG E++ + + + D + +N
Sbjct: 213 ---------------TLG--SFSTVFGEEWKEKTDRDFVKFWKDIVDDNEVRDLISDRNK 255
Query: 184 ARVRSSEWW-----FDEDGLFNLDDEGRAEKQQCAVGASNIIRNFSFMPDNEVIMAQHRH 238
+ +S W F ++D Q AV I+RN SF N ++A +R
Sbjct: 256 SHEGTSGEWIWESLFHPPRKLGINDIEGQRVLQIAV----ILRNLSFEEGNVKLLAANRT 311
Query: 239 CLETVFQCIEDHVTEDEELVTNALETIVNLAP--LLDLRIFSSSKQSYIKITREKRAVEA 296
CL + H +L L+T+ N+A LLD F ++ + +T+ + +
Sbjct: 312 CLRFLLLSAHSHFISLRQL---GLDTLGNIAAELLLDPVDFKTTHLMFHTVTKCLMSRDR 368
Query: 297 IMGILGSPFKAWHCAAAELLGRLIINPDNEPFLLPFVPQ-IHKRLVDLMSLPAFDAQAAA 355
+ + G E+LG L DN + +V Q ++ ++ ++LP +
Sbjct: 369 FLKMRG----------MEILGNLCKAEDNGVLICEYVDQDSYREIICHLTLPDVLLVIST 418
Query: 356 VGALYNLAEV-NVDCRLKLASERWAIDRLLRVI 387
+ LY L E+ +V C K+A +ID L+ ++
Sbjct: 419 LEVLYMLTEMGDVAC-TKIAKVEKSIDMLVCLV 450
>gi|432114543|gb|ELK36391.1| AT-rich interactive domain-containing protein 2 [Myotis davidii]
Length = 1935
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 84/333 (25%), Positives = 143/333 (42%), Gaps = 48/333 (14%)
Query: 64 NHKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQVIDDWR 123
++ ++VL+L SGL +E+ +A+N TLLS + K M+ + P KI LL A V DD
Sbjct: 229 DYNKLVLSLLSGLPNEVDFAINVCTLLSNESKHVMQLEKDP--KIITLLLANAGVFDD-- 284
Query: 124 DIALPKELSKGPRARTLGVNSLVTGFGSEFEALGSINNAFPRSGVGSGSSAADSLVQKNA 183
TLG S T FG E++ + + + D + +N
Sbjct: 285 ---------------TLG--SFSTVFGEEWKEKTDRDFVKFWKDIVDDNEVRDLISDRNK 327
Query: 184 ARVRSSEWW-----FDEDGLFNLDDEGRAEKQQCAVGASNIIRNFSFMPDNEVIMAQHRH 238
+ +S W F ++D Q AV I+RN SF N ++A +R
Sbjct: 328 SHEGTSGEWIWESLFHPPRKLGINDIEGQRVLQIAV----ILRNLSFEEGNVKLLAANRT 383
Query: 239 CLETVFQCIEDHVTEDEELVTNALETIVNLAP--LLDLRIFSSSKQSYIKITREKRAVEA 296
CL + H +L L+T+ N+A LLD F ++ + +T+ + +
Sbjct: 384 CLRFLLLSAHSHFISLRQL---GLDTLGNIAAELLLDPVDFKTTHLMFHTVTKCLMSRDR 440
Query: 297 IMGILGSPFKAWHCAAAELLGRLIINPDNEPFLLPFVPQ-IHKRLVDLMSLPAFDAQAAA 355
+ + G E+LG L DN + +V Q ++ ++ ++LP +
Sbjct: 441 FLKMRG----------MEILGNLCKAEDNGVLICEYVDQDSYREIICHLTLPDVLLVIST 490
Query: 356 VGALYNLAEV-NVDCRLKLASERWAIDRLLRVI 387
+ LY L E+ +V C K+A +ID L+ ++
Sbjct: 491 LEVLYMLTEMGDVAC-TKIAKVEKSIDMLVCLV 522
>gi|297691617|ref|XP_002823175.1| PREDICTED: AT-rich interactive domain-containing protein 2 [Pongo
abelii]
Length = 1836
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 84/333 (25%), Positives = 143/333 (42%), Gaps = 48/333 (14%)
Query: 64 NHKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQVIDDWR 123
++ ++VL+L SGL +E+ +A+N TLLS + K M+ + P KI LL A V DD
Sbjct: 157 DYNKLVLSLLSGLPNEVDFAINVCTLLSNESKHVMQLEKDP--KIITLLLANAGVFDD-- 212
Query: 124 DIALPKELSKGPRARTLGVNSLVTGFGSEFEALGSINNAFPRSGVGSGSSAADSLVQKNA 183
TLG S T FG E++ + + + D + +N
Sbjct: 213 ---------------TLG--SFSTVFGEEWKEKTDRDFVKFWKDIVDDNEVRDLISDRNK 255
Query: 184 ARVRSSEWW-----FDEDGLFNLDDEGRAEKQQCAVGASNIIRNFSFMPDNEVIMAQHRH 238
+ +S W F ++D Q AV I+RN SF N ++A +R
Sbjct: 256 SHEGTSGEWIWESLFHPPRKLGINDIEGQRVLQIAV----ILRNLSFEEGNVKLLAANRT 311
Query: 239 CLETVFQCIEDHVTEDEELVTNALETIVNLAP--LLDLRIFSSSKQSYIKITREKRAVEA 296
CL + H +L L+T+ N+A LLD F ++ + +T+ + +
Sbjct: 312 CLRFLLLSAHSHFISLRQL---GLDTLGNIAAELLLDPVDFKTTHLMFHTVTKCLMSRDR 368
Query: 297 IMGILGSPFKAWHCAAAELLGRLIINPDNEPFLLPFVPQ-IHKRLVDLMSLPAFDAQAAA 355
+ + G E+LG L DN + +V Q ++ ++ ++LP +
Sbjct: 369 FLKMRG----------MEILGNLCKAEDNGVLICEYVDQDSYREIICHLTLPDVLLVIST 418
Query: 356 VGALYNLAEV-NVDCRLKLASERWAIDRLLRVI 387
+ LY L E+ +V C K+A +ID L+ ++
Sbjct: 419 LEVLYMLTEMGDVAC-TKIAKVEKSIDMLVCLV 450
>gi|402885721|ref|XP_003906296.1| PREDICTED: AT-rich interactive domain-containing protein 2 [Papio
anubis]
Length = 1842
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 84/333 (25%), Positives = 143/333 (42%), Gaps = 48/333 (14%)
Query: 64 NHKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQVIDDWR 123
++ ++VL+L SGL +E+ +A+N TLLS + K M+ + P KI LL A V DD
Sbjct: 163 DYNKLVLSLLSGLPNEVDFAINVCTLLSNESKHVMQLEKDP--KIITLLLANAGVFDD-- 218
Query: 124 DIALPKELSKGPRARTLGVNSLVTGFGSEFEALGSINNAFPRSGVGSGSSAADSLVQKNA 183
TLG S T FG E++ + + + D + +N
Sbjct: 219 ---------------TLG--SFSTVFGEEWKEKTDRDFVKFWKDIVDDNEVRDLISDRNK 261
Query: 184 ARVRSSEWW-----FDEDGLFNLDDEGRAEKQQCAVGASNIIRNFSFMPDNEVIMAQHRH 238
+ +S W F ++D Q AV I+RN SF N ++A +R
Sbjct: 262 SHEGTSGEWIWESLFHPPRKLGINDIEGQRVLQIAV----ILRNLSFEEGNVKLLAANRT 317
Query: 239 CLETVFQCIEDHVTEDEELVTNALETIVNLAP--LLDLRIFSSSKQSYIKITREKRAVEA 296
CL + H +L L+T+ N+A LLD F ++ + +T+ + +
Sbjct: 318 CLRFLLLSAHSHFISLRQL---GLDTLGNIAAELLLDPVDFKTTHLMFHTVTKCLMSRDR 374
Query: 297 IMGILGSPFKAWHCAAAELLGRLIINPDNEPFLLPFVPQ-IHKRLVDLMSLPAFDAQAAA 355
+ + G E+LG L DN + +V Q ++ ++ ++LP +
Sbjct: 375 FLKMRG----------MEILGNLCKAEDNGVLICEYVDQDSYREIICHLTLPDVLLVIST 424
Query: 356 VGALYNLAEV-NVDCRLKLASERWAIDRLLRVI 387
+ LY L E+ +V C K+A +ID L+ ++
Sbjct: 425 LEVLYMLTEMGDVAC-TKIAKVEKSIDMLVCLV 456
>gi|297474617|ref|XP_002687369.1| PREDICTED: AT-rich interactive domain-containing protein 2 [Bos
taurus]
gi|296487772|tpg|DAA29885.1| TPA: brahma associated protein 170kD-like [Bos taurus]
Length = 1834
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 84/333 (25%), Positives = 143/333 (42%), Gaps = 48/333 (14%)
Query: 64 NHKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQVIDDWR 123
++ ++VL+L SGL +E+ +A+N TLLS + K M+ + P KI LL A V DD
Sbjct: 157 DYNKLVLSLLSGLPNEVDFAINVCTLLSNESKHVMQLEKDP--KIITLLLANAGVFDD-- 212
Query: 124 DIALPKELSKGPRARTLGVNSLVTGFGSEFEALGSINNAFPRSGVGSGSSAADSLVQKNA 183
TLG S T FG E++ + + + D + +N
Sbjct: 213 ---------------TLG--SFSTVFGEEWKEKTDRDFVKFWKDIVDDNEVRDLISDRNK 255
Query: 184 ARVRSSEWW-----FDEDGLFNLDDEGRAEKQQCAVGASNIIRNFSFMPDNEVIMAQHRH 238
+ +S W F ++D Q AV I+RN SF N ++A +R
Sbjct: 256 SHEGTSGEWIWESLFHPPRKLGINDIEGQRVLQIAV----ILRNLSFEEGNVKLLAANRT 311
Query: 239 CLETVFQCIEDHVTEDEELVTNALETIVNLAP--LLDLRIFSSSKQSYIKITREKRAVEA 296
CL + H +L L+T+ N+A LLD F ++ + +T+ + +
Sbjct: 312 CLRFLLLSAHSHFISLRQL---GLDTLGNIAAELLLDPVDFKTTHLMFHTVTKCLMSRDR 368
Query: 297 IMGILGSPFKAWHCAAAELLGRLIINPDNEPFLLPFVPQ-IHKRLVDLMSLPAFDAQAAA 355
+ + G E+LG L DN + +V Q ++ ++ ++LP +
Sbjct: 369 FLKMRG----------MEILGNLCKAEDNGVLICEYVDQDSYREIICHLTLPDVLLVIST 418
Query: 356 VGALYNLAEV-NVDCRLKLASERWAIDRLLRVI 387
+ LY L E+ +V C K+A +ID L+ ++
Sbjct: 419 LEVLYMLTEMGDVAC-TKIAKVEKSIDMLVCLV 450
>gi|395841720|ref|XP_003793681.1| PREDICTED: AT-rich interactive domain-containing protein 2
[Otolemur garnettii]
Length = 1819
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 84/333 (25%), Positives = 143/333 (42%), Gaps = 48/333 (14%)
Query: 64 NHKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQVIDDWR 123
++ ++VL+L SGL +E+ +A+N TLLS + K M+ + P KI LL A V DD
Sbjct: 147 DYNKLVLSLLSGLPNEVDFAINVCTLLSNESKHVMQLEKDP--KIITLLLANAGVFDD-- 202
Query: 124 DIALPKELSKGPRARTLGVNSLVTGFGSEFEALGSINNAFPRSGVGSGSSAADSLVQKNA 183
TLG S T FG E++ + + + D + +N
Sbjct: 203 ---------------TLG--SFSTVFGEEWKEKTDRDFVKFWKDIVDDNEVRDLISDRNK 245
Query: 184 ARVRSSEWW-----FDEDGLFNLDDEGRAEKQQCAVGASNIIRNFSFMPDNEVIMAQHRH 238
+ +S W F ++D Q AV I+RN SF N ++A +R
Sbjct: 246 SHEGTSGEWIWESLFHPPRKLGINDIEGQRVLQIAV----ILRNLSFEEGNVKLLAANRT 301
Query: 239 CLETVFQCIEDHVTEDEELVTNALETIVNLAP--LLDLRIFSSSKQSYIKITREKRAVEA 296
CL + H +L L+T+ N+A LLD F ++ + +T+ + +
Sbjct: 302 CLRFLLLSAHSHFISLRQL---GLDTLGNIAAELLLDPVDFKTTHLMFHTVTKCLMSRDR 358
Query: 297 IMGILGSPFKAWHCAAAELLGRLIINPDNEPFLLPFVPQ-IHKRLVDLMSLPAFDAQAAA 355
+ + G E+LG L DN + +V Q ++ ++ ++LP +
Sbjct: 359 FLKMRG----------MEILGNLCKAEDNGVLICEYVDQDSYREIICHLTLPDVLLVIST 408
Query: 356 VGALYNLAEV-NVDCRLKLASERWAIDRLLRVI 387
+ LY L E+ +V C K+A +ID L+ ++
Sbjct: 409 LEVLYMLTEMGDVAC-TKIAKVEKSIDMLVCLV 440
>gi|417406770|gb|JAA50029.1| Putative transcriptional regulator [Desmodus rotundus]
Length = 1835
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 84/333 (25%), Positives = 143/333 (42%), Gaps = 48/333 (14%)
Query: 64 NHKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQVIDDWR 123
++ ++VL+L SGL +E+ +A+N TLLS + K M+ + P KI LL A V DD
Sbjct: 157 DYNKLVLSLLSGLPNEVDFAINVCTLLSNESKHVMQLEKDP--KIITLLLANAGVFDD-- 212
Query: 124 DIALPKELSKGPRARTLGVNSLVTGFGSEFEALGSINNAFPRSGVGSGSSAADSLVQKNA 183
TLG S T FG E++ + + + D + +N
Sbjct: 213 ---------------TLG--SFSTVFGEEWKEKTDRDFVKFWKDIVDDNEVRDLISDRNK 255
Query: 184 ARVRSSEWW-----FDEDGLFNLDDEGRAEKQQCAVGASNIIRNFSFMPDNEVIMAQHRH 238
+ +S W F ++D Q AV I+RN SF N ++A +R
Sbjct: 256 SHEGTSGEWIWESLFHPPRKLGINDIEGQRVLQIAV----ILRNLSFEEGNVKLLAANRT 311
Query: 239 CLETVFQCIEDHVTEDEELVTNALETIVNLAP--LLDLRIFSSSKQSYIKITREKRAVEA 296
CL + H +L L+T+ N+A LLD F ++ + +T+ + +
Sbjct: 312 CLRFLLLSAHSHFISLRQL---GLDTLGNIAAELLLDPVDFKTTHLMFHTVTKCLMSRDR 368
Query: 297 IMGILGSPFKAWHCAAAELLGRLIINPDNEPFLLPFVPQ-IHKRLVDLMSLPAFDAQAAA 355
+ + G E+LG L DN + +V Q ++ ++ ++LP +
Sbjct: 369 FLKMRG----------MEILGNLCKAEDNGVLICEYVDQDSYREIICHLTLPDVLLVIST 418
Query: 356 VGALYNLAEV-NVDCRLKLASERWAIDRLLRVI 387
+ LY L E+ +V C K+A +ID L+ ++
Sbjct: 419 LEVLYMLTEMGDVAC-TKIAKVEKSIDMLVCLV 450
>gi|410046813|ref|XP_003313802.2| PREDICTED: LOW QUALITY PROTEIN: AT-rich interactive
domain-containing protein 2 [Pan troglodytes]
Length = 1846
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 84/333 (25%), Positives = 143/333 (42%), Gaps = 48/333 (14%)
Query: 64 NHKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQVIDDWR 123
++ ++VL+L SGL +E+ +A+N TLLS + K M+ + P KI LL A V DD
Sbjct: 168 DYNKLVLSLLSGLPNEVDFAINVCTLLSNESKHVMQLEKDP--KIITLLLANAGVFDD-- 223
Query: 124 DIALPKELSKGPRARTLGVNSLVTGFGSEFEALGSINNAFPRSGVGSGSSAADSLVQKNA 183
TLG S T FG E++ + + + D + +N
Sbjct: 224 ---------------TLG--SFSTVFGEEWKEKTDRDFVKFWKDIVDDNEVRDLISDRNK 266
Query: 184 ARVRSSEWW-----FDEDGLFNLDDEGRAEKQQCAVGASNIIRNFSFMPDNEVIMAQHRH 238
+ +S W F ++D Q AV I+RN SF N ++A +R
Sbjct: 267 SHEGTSGEWIWESLFHPPRKLGINDIEGQRVLQIAV----ILRNLSFEEGNVKLLAANRT 322
Query: 239 CLETVFQCIEDHVTEDEELVTNALETIVNLAP--LLDLRIFSSSKQSYIKITREKRAVEA 296
CL + H +L L+T+ N+A LLD F ++ + +T+ + +
Sbjct: 323 CLRFLLLSAHSHFISLRQL---GLDTLGNIAAELLLDPVDFKTTHLMFHTVTKCLMSRDR 379
Query: 297 IMGILGSPFKAWHCAAAELLGRLIINPDNEPFLLPFVPQ-IHKRLVDLMSLPAFDAQAAA 355
+ + G E+LG L DN + +V Q ++ ++ ++LP +
Sbjct: 380 FLKMRG----------MEILGNLCKAEDNGVLICEYVDQDSYREIICHLTLPDVLLVIST 429
Query: 356 VGALYNLAEV-NVDCRLKLASERWAIDRLLRVI 387
+ LY L E+ +V C K+A +ID L+ ++
Sbjct: 430 LEVLYMLTEMGDVAC-TKIAKVEKSIDMLVCLV 461
>gi|109482813|ref|XP_001059099.1| PREDICTED: AT-rich interactive domain-containing protein 2 [Rattus
norvegicus]
Length = 1847
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 82/332 (24%), Positives = 138/332 (41%), Gaps = 46/332 (13%)
Query: 64 NHKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQVIDDWR 123
++ ++VL+L SGL +E+ +A+N TLLS + K M+ + P KI LL A V DD
Sbjct: 178 DYNKLVLSLLSGLPNEVDFAINVCTLLSNESKHVMQLEKDP--KIITLLLANAGVFDD-- 233
Query: 124 DIALPKELSKGPRARTLGVNSLVTGFGSEFEALGSINNAFPRSGVGSGSSAADSLVQKNA 183
TLG S T FG E+ + + + D + +N
Sbjct: 234 ---------------TLG--SFSTVFGEEWREKTDRDFVKFWKDIVDDNEVRDLISDRNK 276
Query: 184 ARVRSSEWW-----FDEDGLFNLDDEGRAEKQQCAVGASNIIRNFSFMPDNEVIMAQHRH 238
A + W F ++D Q AV I+RN SF N ++A +R
Sbjct: 277 AHEDTPGEWIWESLFHPPRKLGINDIEGQRVLQIAV----ILRNLSFEESNVKLLAANRT 332
Query: 239 CLETVFQCIEDHVTEDEELVTNALETIVNLAP--LLDLRIFSSSKQSYIKITREKRAVEA 296
CL + H +L L+T+ N+A LLD F ++ + +T+ + +
Sbjct: 333 CLRFLLLSAHSHFISLRQL---GLDTLGNIAAELLLDPVDFKTTHLMFHTVTKCLMSRDR 389
Query: 297 IMGILGSPFKAWHCAAAELLGRLIINPDNEPFLLPFVPQ-IHKRLVDLMSLPAFDAQAAA 355
+ + G E+LG L DN + +V Q ++ ++ ++LP +
Sbjct: 390 FLKMRG----------MEILGNLCKAEDNGVLICEYVDQDSYREIICHLTLPDVLLVTST 439
Query: 356 VGALYNLAEVNVDCRLKLASERWAIDRLLRVI 387
+ LY L E+ K+A +ID L+ ++
Sbjct: 440 LEVLYMLTEMGDIACTKIAKVEKSIDMLVCLV 471
>gi|426372262|ref|XP_004053045.1| PREDICTED: AT-rich interactive domain-containing protein 2 [Gorilla
gorilla gorilla]
Length = 1834
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 84/333 (25%), Positives = 143/333 (42%), Gaps = 48/333 (14%)
Query: 64 NHKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQVIDDWR 123
++ ++VL+L SGL +E+ +A+N TLLS + K M+ + P KI LL A V DD
Sbjct: 157 DYNKLVLSLLSGLPNEVDFAINVCTLLSNESKHVMQLEKDP--KIITLLLANAGVFDD-- 212
Query: 124 DIALPKELSKGPRARTLGVNSLVTGFGSEFEALGSINNAFPRSGVGSGSSAADSLVQKNA 183
TLG S T FG E++ + + + D + +N
Sbjct: 213 ---------------TLG--SFSTVFGEEWKEKTDRDFVKFWKDIVDDNEVRDLISDRNK 255
Query: 184 ARVRSSEWW-----FDEDGLFNLDDEGRAEKQQCAVGASNIIRNFSFMPDNEVIMAQHRH 238
+ +S W F ++D Q AV I+RN SF N ++A +R
Sbjct: 256 SHEGTSGEWIWESLFHPPRKLGINDIEGQRVLQIAV----ILRNLSFEEGNVKLLAANRT 311
Query: 239 CLETVFQCIEDHVTEDEELVTNALETIVNLAP--LLDLRIFSSSKQSYIKITREKRAVEA 296
CL + H +L L+T+ N+A LLD F ++ + +T+ + +
Sbjct: 312 CLRFLLLSAHSHFISLRQL---GLDTLGNIAAELLLDPVDFKTTHLMFHTVTKCLMSRDR 368
Query: 297 IMGILGSPFKAWHCAAAELLGRLIINPDNEPFLLPFVPQ-IHKRLVDLMSLPAFDAQAAA 355
+ + G E+LG L DN + +V Q ++ ++ ++LP +
Sbjct: 369 FLKMRG----------MEILGNLCKAEDNGVLICEYVDQDSYREIICHLTLPDVLLVIST 418
Query: 356 VGALYNLAEV-NVDCRLKLASERWAIDRLLRVI 387
+ LY L E+ +V C K+A +ID L+ ++
Sbjct: 419 LEVLYMLTEMGDVAC-TKIAKVEKSIDMLVCLV 450
>gi|56549668|ref|NP_689854.2| AT-rich interactive domain-containing protein 2 [Homo sapiens]
gi|73921721|sp|Q68CP9.2|ARID2_HUMAN RecName: Full=AT-rich interactive domain-containing protein 2;
Short=ARID domain-containing protein 2; AltName:
Full=BRG1-associated factor 200; Short=BAF200; AltName:
Full=Zinc finger protein with activation potential;
AltName: Full=Zipzap/p200
gi|119578294|gb|EAW57890.1| AT rich interactive domain 2 (ARID, RFX-like), isoform CRA_c [Homo
sapiens]
gi|162317620|gb|AAI56212.1| AT rich interactive domain 2 (ARID, RFX-like) [synthetic construct]
gi|225000202|gb|AAI72461.1| AT rich interactive domain 2 (ARID, RFX-like) [synthetic construct]
gi|261858004|dbj|BAI45524.1| AT rich interactive domain containing protein 2 [synthetic
construct]
Length = 1835
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 84/333 (25%), Positives = 143/333 (42%), Gaps = 48/333 (14%)
Query: 64 NHKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQVIDDWR 123
++ ++VL+L SGL +E+ +A+N TLLS + K M+ + P KI LL A V DD
Sbjct: 157 DYNKLVLSLLSGLPNEVDFAINVCTLLSNESKHVMQLEKDP--KIITLLLANAGVFDD-- 212
Query: 124 DIALPKELSKGPRARTLGVNSLVTGFGSEFEALGSINNAFPRSGVGSGSSAADSLVQKNA 183
TLG S T FG E++ + + + D + +N
Sbjct: 213 ---------------TLG--SFSTVFGEEWKEKTDRDFVKFWKDIVDDNEVRDLISDRNK 255
Query: 184 ARVRSSEWW-----FDEDGLFNLDDEGRAEKQQCAVGASNIIRNFSFMPDNEVIMAQHRH 238
+ +S W F ++D Q AV I+RN SF N ++A +R
Sbjct: 256 SHEGTSGEWIWESLFHPPRKLGINDIEGQRVLQIAV----ILRNLSFEEGNVKLLAANRT 311
Query: 239 CLETVFQCIEDHVTEDEELVTNALETIVNLAP--LLDLRIFSSSKQSYIKITREKRAVEA 296
CL + H +L L+T+ N+A LLD F ++ + +T+ + +
Sbjct: 312 CLRFLLLSAHSHFISLRQL---GLDTLGNIAAELLLDPVDFKTTHLMFHTVTKCLMSRDR 368
Query: 297 IMGILGSPFKAWHCAAAELLGRLIINPDNEPFLLPFVPQ-IHKRLVDLMSLPAFDAQAAA 355
+ + G E+LG L DN + +V Q ++ ++ ++LP +
Sbjct: 369 FLKMRG----------MEILGNLCKAEDNGVLICEYVDQDSYREIICHLTLPDVLLVIST 418
Query: 356 VGALYNLAEV-NVDCRLKLASERWAIDRLLRVI 387
+ LY L E+ +V C K+A +ID L+ ++
Sbjct: 419 LEVLYMLTEMGDVAC-TKIAKVEKSIDMLVCLV 450
>gi|397510864|ref|XP_003825805.1| PREDICTED: LOW QUALITY PROTEIN: AT-rich interactive
domain-containing protein 2 [Pan paniscus]
Length = 1835
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 84/333 (25%), Positives = 143/333 (42%), Gaps = 48/333 (14%)
Query: 64 NHKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQVIDDWR 123
++ ++VL+L SGL +E+ +A+N TLLS + K M+ + P KI LL A V DD
Sbjct: 157 DYNKLVLSLLSGLPNEVDFAINVCTLLSNESKHVMQLEKDP--KIITLLLANAGVFDD-- 212
Query: 124 DIALPKELSKGPRARTLGVNSLVTGFGSEFEALGSINNAFPRSGVGSGSSAADSLVQKNA 183
TLG S T FG E++ + + + D + +N
Sbjct: 213 ---------------TLG--SFSTVFGEEWKEKTDRDFVKFWKDIVDDNEVRDLISDRNK 255
Query: 184 ARVRSSEWW-----FDEDGLFNLDDEGRAEKQQCAVGASNIIRNFSFMPDNEVIMAQHRH 238
+ +S W F ++D Q AV I+RN SF N ++A +R
Sbjct: 256 SHEGTSGEWIWESLFHPPRKLGINDIEGQRVLQIAV----ILRNLSFEEGNVKLLAANRT 311
Query: 239 CLETVFQCIEDHVTEDEELVTNALETIVNLAP--LLDLRIFSSSKQSYIKITREKRAVEA 296
CL + H +L L+T+ N+A LLD F ++ + +T+ + +
Sbjct: 312 CLRFLLLSAHSHFISLRQL---GLDTLGNIAAELLLDPVDFKTTHLMFHTVTKCLMSRDR 368
Query: 297 IMGILGSPFKAWHCAAAELLGRLIINPDNEPFLLPFVPQ-IHKRLVDLMSLPAFDAQAAA 355
+ + G E+LG L DN + +V Q ++ ++ ++LP +
Sbjct: 369 FLKMRG----------MEILGNLCKAEDNGVLICEYVDQDSYREIICHLTLPDVLLVIST 418
Query: 356 VGALYNLAEV-NVDCRLKLASERWAIDRLLRVI 387
+ LY L E+ +V C K+A +ID L+ ++
Sbjct: 419 LEVLYMLTEMGDVAC-TKIAKVEKSIDMLVCLV 450
>gi|351700357|gb|EHB03276.1| AT-rich interactive domain-containing protein 2 [Heterocephalus
glaber]
Length = 1815
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 84/333 (25%), Positives = 143/333 (42%), Gaps = 48/333 (14%)
Query: 64 NHKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQVIDDWR 123
++ ++VL+L SGL +E+ +A+N TLLS + K M+ + P KI LL A V DD
Sbjct: 157 DYNKLVLSLLSGLPNEVDFAINVCTLLSNESKHVMQLEKDP--KIITLLLANAGVFDD-- 212
Query: 124 DIALPKELSKGPRARTLGVNSLVTGFGSEFEALGSINNAFPRSGVGSGSSAADSLVQKNA 183
TLG S T FG E++ + + + D + +N
Sbjct: 213 ---------------TLG--SFSTVFGEEWKEKTDRDFVKFWKDIVDDNEVRDLISDRNK 255
Query: 184 ARVRSSEWW-----FDEDGLFNLDDEGRAEKQQCAVGASNIIRNFSFMPDNEVIMAQHRH 238
+ +S W F ++D Q AV I+RN SF N ++A +R
Sbjct: 256 SHEGTSGEWIWESLFHPPRKLGINDIEGQRVLQIAV----ILRNLSFEEGNVKLLAANRT 311
Query: 239 CLETVFQCIEDHVTEDEELVTNALETIVNLAP--LLDLRIFSSSKQSYIKITREKRAVEA 296
CL + H +L L+T+ N+A LLD F ++ + +T+ + +
Sbjct: 312 CLRFLLLSAHSHFISLRQL---GLDTLGNIAAELLLDPVDFKTTHLMFHTVTKCLMSRDR 368
Query: 297 IMGILGSPFKAWHCAAAELLGRLIINPDNEPFLLPFVPQ-IHKRLVDLMSLPAFDAQAAA 355
+ + G E+LG L DN + +V Q ++ ++ ++LP +
Sbjct: 369 FLKMRG----------MEILGNLCKAEDNGVLICEYVDQDSYREIICHLTLPDVLLVIST 418
Query: 356 VGALYNLAEV-NVDCRLKLASERWAIDRLLRVI 387
+ LY L E+ +V C K+A +ID L+ ++
Sbjct: 419 LEVLYMLTEMGDVAC-TKIAKVEKSIDMLVCLV 450
>gi|73334458|gb|AAZ74794.1| zipzap protein [Homo sapiens]
Length = 1835
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 84/333 (25%), Positives = 143/333 (42%), Gaps = 48/333 (14%)
Query: 64 NHKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQVIDDWR 123
++ ++VL+L SGL +E+ +A+N TLLS + K M+ + P KI LL A V DD
Sbjct: 157 DYNKLVLSLLSGLPNEVDFAINVCTLLSNESKHVMQLEKDP--KIITLLLANAGVFDD-- 212
Query: 124 DIALPKELSKGPRARTLGVNSLVTGFGSEFEALGSINNAFPRSGVGSGSSAADSLVQKNA 183
TLG S T FG E++ + + + D + +N
Sbjct: 213 ---------------TLG--SFSTVFGEEWKEKTDRDFVKFWKDIVDDNEVRDLISDRNK 255
Query: 184 ARVRSSEWW-----FDEDGLFNLDDEGRAEKQQCAVGASNIIRNFSFMPDNEVIMAQHRH 238
+ +S W F ++D Q AV I+RN SF N ++A +R
Sbjct: 256 SHEGTSGEWIWESLFHPPRKLGINDIEGQRVLQIAV----ILRNLSFEEGNVKLLAANRT 311
Query: 239 CLETVFQCIEDHVTEDEELVTNALETIVNLAP--LLDLRIFSSSKQSYIKITREKRAVEA 296
CL + H +L L+T+ N+A LLD F ++ + +T+ + +
Sbjct: 312 CLRFLLLSAHSHFISLRQL---GLDTLGNIAAELLLDPVDFKTTHLMFHTVTKCLMSRDR 368
Query: 297 IMGILGSPFKAWHCAAAELLGRLIINPDNEPFLLPFVPQ-IHKRLVDLMSLPAFDAQAAA 355
+ + G E+LG L DN + +V Q ++ ++ ++LP +
Sbjct: 369 FLKMRG----------MEILGNLCKAEDNGVLICEYVDQDSYREIICHLTLPDVLLVIST 418
Query: 356 VGALYNLAEV-NVDCRLKLASERWAIDRLLRVI 387
+ LY L E+ +V C K+A +ID L+ ++
Sbjct: 419 LEVLYMLTEMGDVAC-TKIAKVEKSIDMLVCLV 450
>gi|119578293|gb|EAW57889.1| AT rich interactive domain 2 (ARID, RFX-like), isoform CRA_b [Homo
sapiens]
Length = 1793
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 84/333 (25%), Positives = 143/333 (42%), Gaps = 48/333 (14%)
Query: 64 NHKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQVIDDWR 123
++ ++VL+L SGL +E+ +A+N TLLS + K M+ + P KI LL A V DD
Sbjct: 157 DYNKLVLSLLSGLPNEVDFAINVCTLLSNESKHVMQLEKDP--KIITLLLANAGVFDD-- 212
Query: 124 DIALPKELSKGPRARTLGVNSLVTGFGSEFEALGSINNAFPRSGVGSGSSAADSLVQKNA 183
TLG S T FG E++ + + + D + +N
Sbjct: 213 ---------------TLG--SFSTVFGEEWKEKTDRDFVKFWKDIVDDNEVRDLISDRNK 255
Query: 184 ARVRSSEWW-----FDEDGLFNLDDEGRAEKQQCAVGASNIIRNFSFMPDNEVIMAQHRH 238
+ +S W F ++D Q AV I+RN SF N ++A +R
Sbjct: 256 SHEGTSGEWIWESLFHPPRKLGINDIEGQRVLQIAV----ILRNLSFEEGNVKLLAANRT 311
Query: 239 CLETVFQCIEDHVTEDEELVTNALETIVNLAP--LLDLRIFSSSKQSYIKITREKRAVEA 296
CL + H +L L+T+ N+A LLD F ++ + +T+ + +
Sbjct: 312 CLRFLLLSAHSHFISLRQL---GLDTLGNIAAELLLDPVDFKTTHLMFHTVTKCLMSRDR 368
Query: 297 IMGILGSPFKAWHCAAAELLGRLIINPDNEPFLLPFVPQ-IHKRLVDLMSLPAFDAQAAA 355
+ + G E+LG L DN + +V Q ++ ++ ++LP +
Sbjct: 369 FLKMRG----------MEILGNLCKAEDNGVLICEYVDQDSYREIICHLTLPDVLLVIST 418
Query: 356 VGALYNLAEV-NVDCRLKLASERWAIDRLLRVI 387
+ LY L E+ +V C K+A +ID L+ ++
Sbjct: 419 LEVLYMLTEMGDVAC-TKIAKVEKSIDMLVCLV 450
>gi|119578295|gb|EAW57891.1| AT rich interactive domain 2 (ARID, RFX-like), isoform CRA_d [Homo
sapiens]
Length = 1788
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 84/333 (25%), Positives = 143/333 (42%), Gaps = 48/333 (14%)
Query: 64 NHKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQVIDDWR 123
++ ++VL+L SGL +E+ +A+N TLLS + K M+ + P KI LL A V DD
Sbjct: 157 DYNKLVLSLLSGLPNEVDFAINVCTLLSNESKHVMQLEKDP--KIITLLLANAGVFDD-- 212
Query: 124 DIALPKELSKGPRARTLGVNSLVTGFGSEFEALGSINNAFPRSGVGSGSSAADSLVQKNA 183
TLG S T FG E++ + + + D + +N
Sbjct: 213 ---------------TLG--SFSTVFGEEWKEKTDRDFVKFWKDIVDDNEVRDLISDRNK 255
Query: 184 ARVRSSEWW-----FDEDGLFNLDDEGRAEKQQCAVGASNIIRNFSFMPDNEVIMAQHRH 238
+ +S W F ++D Q AV I+RN SF N ++A +R
Sbjct: 256 SHEGTSGEWIWESLFHPPRKLGINDIEGQRVLQIAV----ILRNLSFEEGNVKLLAANRT 311
Query: 239 CLETVFQCIEDHVTEDEELVTNALETIVNLAP--LLDLRIFSSSKQSYIKITREKRAVEA 296
CL + H +L L+T+ N+A LLD F ++ + +T+ + +
Sbjct: 312 CLRFLLLSAHSHFISLRQL---GLDTLGNIAAELLLDPVDFKTTHLMFHTVTKCLMSRDR 368
Query: 297 IMGILGSPFKAWHCAAAELLGRLIINPDNEPFLLPFVPQ-IHKRLVDLMSLPAFDAQAAA 355
+ + G E+LG L DN + +V Q ++ ++ ++LP +
Sbjct: 369 FLKMRG----------MEILGNLCKAEDNGVLICEYVDQDSYREIICHLTLPDVLLVIST 418
Query: 356 VGALYNLAEV-NVDCRLKLASERWAIDRLLRVI 387
+ LY L E+ +V C K+A +ID L+ ++
Sbjct: 419 LEVLYMLTEMGDVAC-TKIAKVEKSIDMLVCLV 450
>gi|301783599|ref|XP_002927214.1| PREDICTED: AT-rich interactive domain-containing protein 2-like
[Ailuropoda melanoleuca]
gi|281351998|gb|EFB27582.1| hypothetical protein PANDA_016976 [Ailuropoda melanoleuca]
Length = 1836
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 84/333 (25%), Positives = 143/333 (42%), Gaps = 48/333 (14%)
Query: 64 NHKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQVIDDWR 123
++ ++VL+L SGL +E+ +A+N TLLS + K M+ + P KI LL A V DD
Sbjct: 157 DYNKLVLSLLSGLPNEVDFAINVCTLLSNESKHVMQLEKDP--KIITLLLANAGVFDD-- 212
Query: 124 DIALPKELSKGPRARTLGVNSLVTGFGSEFEALGSINNAFPRSGVGSGSSAADSLVQKNA 183
TLG S T FG E++ + + + D + +N
Sbjct: 213 ---------------TLG--SFSTVFGEEWKEKTDRDFVKFWKDIVDDNEVRDLISDRNK 255
Query: 184 ARVRSSEWW-----FDEDGLFNLDDEGRAEKQQCAVGASNIIRNFSFMPDNEVIMAQHRH 238
+ +S W F ++D Q AV I+RN SF N ++A +R
Sbjct: 256 SHEGTSGEWIWESLFHPPRKLGINDIEGQRVLQIAV----ILRNLSFEEGNVKLLAANRT 311
Query: 239 CLETVFQCIEDHVTEDEELVTNALETIVNLAP--LLDLRIFSSSKQSYIKITREKRAVEA 296
CL + H +L L+T+ N+A LLD F ++ + +T+ + +
Sbjct: 312 CLRFLLLSAHSHFISLRQL---GLDTLGNIAAELLLDPVDFKTTHLMFHTVTKCLMSRDR 368
Query: 297 IMGILGSPFKAWHCAAAELLGRLIINPDNEPFLLPFVPQ-IHKRLVDLMSLPAFDAQAAA 355
+ + G E+LG L DN + +V Q ++ ++ ++LP +
Sbjct: 369 FLKMRG----------MEILGNLCKAEDNGVLICEYVDQDSYREIICHLTLPDVLLVIST 418
Query: 356 VGALYNLAEV-NVDCRLKLASERWAIDRLLRVI 387
+ LY L E+ +V C K+A +ID L+ ++
Sbjct: 419 LEVLYMLTEMGDVAC-TKIAKVEKSIDMLVCLV 450
>gi|30722343|emb|CAD91164.1| hypothetical protein [Homo sapiens]
Length = 1686
Score = 55.8 bits (133), Expect = 4e-05, Method: Composition-based stats.
Identities = 85/330 (25%), Positives = 142/330 (43%), Gaps = 48/330 (14%)
Query: 64 NHKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQVIDDWR 123
++ ++VL+L SGL +E+ +A+N TLLS + K M+ + P KI LL A V DD
Sbjct: 8 DYNKLVLSLLSGLPNEVDFAINVCTLLSNESKHVMQLEKDP--KIITLLLANAGVFDD-- 63
Query: 124 DIALPKELSKGPRARTLGVNSLVTGFGSEFEALGSINNAFPRSGVGSGSSAADSLVQKNA 183
TLG SL T FG E++ + + + D + +N
Sbjct: 64 ---------------TLG--SLSTVFGEEWKEKTDRDFVKFWKDIVDDNEVRDLISDRNK 106
Query: 184 ARVRSSEWWFDEDGLFN------LDDEGRAEKQQCAVGASNIIRNFSFMPDNEVIMAQHR 237
+ +S W E LF+ ++D Q AV I+RN SF N ++A +R
Sbjct: 107 SHEGTSGEWIWES-LFHPPRKLGINDIEGQRVLQIAV----ILRNLSFEEGNVKLLAANR 161
Query: 238 HCLETVFQCIEDHVTEDEELVTNALETIVNLAP--LLDLRIFSSSKQSYIKITREKRAVE 295
CL + H +L L+T+ N+A LLD F ++ + +T+ + +
Sbjct: 162 TCLRFLLLSAHSHFISLRQL---GLDTLGNIAAELLLDPVDFKTTHLMFHTVTKCLMSRD 218
Query: 296 AIMGILGSPFKAWHCAAAELLGRLIINPDNEPFLLPFVPQ-IHKRLVDLMSLPAFDAQAA 354
+ + G E+LG L DN + +V Q ++ ++ ++LP +
Sbjct: 219 RFLKMRG----------MEILGNLCKAEDNGVLICEYVDQDSYREIICHLTLPDVLLVIS 268
Query: 355 AVGALYNLAEVNVDCRLKLASERWAIDRLL 384
+ LY L E+ K+A +ID L+
Sbjct: 269 TLEVLYMLTEMGDVACTKIAKVEKSIDMLV 298
>gi|291392403|ref|XP_002712720.1| PREDICTED: AT rich interactive domain 2 (ARID, RFX-like)
[Oryctolagus cuniculus]
Length = 1837
Score = 55.8 bits (133), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 83/334 (24%), Positives = 144/334 (43%), Gaps = 48/334 (14%)
Query: 63 QNHKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQVIDDW 122
+++ ++VL+L SGL +E+ +A+N TLLS + K M+ + P KI LL A V DD
Sbjct: 159 KDYNKLVLSLLSGLPNEVDFAINVCTLLSNESKHAMQLEKDP--KIITLLLANAGVFDD- 215
Query: 123 RDIALPKELSKGPRARTLGVNSLVTGFGSEFEALGSINNAFPRSGVGSGSSAADSLVQKN 182
TLG S T FG E++ + + + D + +N
Sbjct: 216 ----------------TLG--SFSTVFGEEWKEKTDRDFVKFWKDIVDDNEVRDLISDRN 257
Query: 183 AARVRSSEWW-----FDEDGLFNLDDEGRAEKQQCAVGASNIIRNFSFMPDNEVIMAQHR 237
+ ++ W F ++D Q AV I+RN SF N ++A +R
Sbjct: 258 KSHDGTAGDWIWESLFHPPRKLGINDIEGQRVLQVAV----ILRNLSFEEGNVKLLAANR 313
Query: 238 HCLETVFQCIEDHVTEDEELVTNALETIVNLAP--LLDLRIFSSSKQSYIKITREKRAVE 295
CL + H +L L+T+ N+A LLD F ++ + +T+ + +
Sbjct: 314 TCLRFLLLSAHSHFISLRQL---GLDTLGNIAAELLLDPVDFKTTHLMFHTVTKCLMSRD 370
Query: 296 AIMGILGSPFKAWHCAAAELLGRLIINPDNEPFLLPFVPQ-IHKRLVDLMSLPAFDAQAA 354
+ + G E+LG L DN + +V Q ++ ++ ++LP +
Sbjct: 371 RFLKMRG----------MEILGNLCKAEDNGVLICEYVDQDSYREIICHLTLPDVLLVIS 420
Query: 355 AVGALYNLAEV-NVDCRLKLASERWAIDRLLRVI 387
+ LY L E+ +V C K+A +ID L+ ++
Sbjct: 421 TLEVLYMLTEMGDVAC-TKIAKVEKSIDMLVCLV 453
>gi|118150542|ref|NP_001071231.1| AT-rich interactive domain-containing protein 2 [Danio rerio]
gi|117558565|gb|AAI27386.1| AT rich interactive domain 2 (ARID, RFX-like) [Danio rerio]
Length = 1570
Score = 55.8 bits (133), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 86/330 (26%), Positives = 144/330 (43%), Gaps = 41/330 (12%)
Query: 64 NHKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQVIDDWR 123
++ ++VL+L SGL +E+ +A+N TLLS + K M+ + P K+ LL A V DD
Sbjct: 157 DYNKLVLSLLSGLPNEVDFAINVCTLLSNESKHAMQLEKEP--KLVTLLLAHAGVFDD-- 212
Query: 124 DIALPKELSKGPRARTLGVNSLVTGFGSEFEALGSINNAFPRSGVGSGSSAADSLVQKNA 183
+LG S V FG++++ S + V + D + K+
Sbjct: 213 ---------------SLGSFSAV--FGTDWQEKTSRDFVRFWKDVIEDNEVKDLISDKSC 255
Query: 184 ARVRSSEWWFDEDGLFNLDDE---GRAEKQQCAVGASNIIRNFSFMPDNEVIMAQHRHCL 240
E LF+ G E Q+ + + I+RN SF N ++A +R CL
Sbjct: 256 TPQNGLELRDICTPLFHPARNTGIGDVEGQR-VLQVAMILRNLSFEEANIKLLAANRTCL 314
Query: 241 ETVFQCIEDHVTEDEELVTNALETIVNLAPLLDLRI--FSSSKQSYIKITREKRAVEAIM 298
+ C + +L L+T+ N A L L F ++ + IT+
Sbjct: 315 RFLLLCAHCTFIQLRQL---GLDTLGNFAGELQLDPVDFRTTHLMFHTITK--------- 362
Query: 299 GILGSPFKAWHCAAAELLGRLIINPDNEPFLLPFVPQ-IHKRLVDLMSLPAFDAQAAAVG 357
L S + A E+LG L DN + +V Q ++ ++ L++LP +A+
Sbjct: 363 -CLLSRDRFLKMRAMEILGNLSKVEDNGVLICEYVDQESYREIIALLTLPDVMLVISALE 421
Query: 358 ALYNLAEVNVDCRLKLASERWAIDRLLRVI 387
LY LAE+ K++S +ID L+R++
Sbjct: 422 VLYQLAELGEITCSKISSVERSIDLLVRLV 451
>gi|262231796|ref|NP_780460.3| AT rich interactive domain 2 (ARID, RFX-like) [Mus musculus]
Length = 1828
Score = 55.5 bits (132), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 83/333 (24%), Positives = 141/333 (42%), Gaps = 48/333 (14%)
Query: 64 NHKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQVIDDWR 123
++ ++VL+L SGL +E+ +A+N TLLS + K M+ + P KI LL A V DD
Sbjct: 157 DYNKLVLSLLSGLPNEVDFAINVCTLLSNESKHVMQLEKDP--KIITLLLANAGVFDD-- 212
Query: 124 DIALPKELSKGPRARTLGVNSLVTGFGSEFEALGSINNAFPRSGVGSGSSAADSLVQKNA 183
TLG S + FG E+ + + + D + +N
Sbjct: 213 ---------------TLG--SFSSVFGEEWREKTDRDFVKFWKDIVDDNEVRDLISDRNK 255
Query: 184 ARVRSSEWW-----FDEDGLFNLDDEGRAEKQQCAVGASNIIRNFSFMPDNEVIMAQHRH 238
A + W F ++D Q AV I+RN SF N ++A +R
Sbjct: 256 AHEDTPGEWIWESLFHPPRKLGINDIEGQRVLQIAV----ILRNLSFEESNVKLLAANRT 311
Query: 239 CLETVFQCIEDHVTEDEELVTNALETIVNLAP--LLDLRIFSSSKQSYIKITREKRAVEA 296
CL + H +L L+T+ N+A LLD F ++ + +T+ + +
Sbjct: 312 CLRFLLLSAHSHFISLRQL---GLDTLGNIAAELLLDPVDFRTTHLMFHTVTKCLMSRDR 368
Query: 297 IMGILGSPFKAWHCAAAELLGRLIINPDNEPFLLPFVPQ-IHKRLVDLMSLPAFDAQAAA 355
+ + G E+LG L DN + +V Q ++ ++ ++LP +
Sbjct: 369 FLKMRG----------MEILGNLCKAEDNGVLICEYVDQDSYREIICHLTLPDVLLVTST 418
Query: 356 VGALYNLAEV-NVDCRLKLASERWAIDRLLRVI 387
+ LY L E+ +V C K+A +ID L+ ++
Sbjct: 419 LEVLYMLTEMGDVAC-TKIAKVEKSIDVLVCLV 450
>gi|403301730|ref|XP_003941536.1| PREDICTED: AT-rich interactive domain-containing protein 2 [Saimiri
boliviensis boliviensis]
Length = 1807
Score = 55.5 bits (132), Expect = 6e-05, Method: Composition-based stats.
Identities = 84/330 (25%), Positives = 141/330 (42%), Gaps = 48/330 (14%)
Query: 64 NHKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQVIDDWR 123
++ ++VL+L SGL +E+ +A+N TLLS + K M+ + P KI LL A V DD
Sbjct: 128 DYNKLVLSLLSGLPNEVDFAINVCTLLSNESKHVMQLEKDP--KIITLLLANAGVFDD-- 183
Query: 124 DIALPKELSKGPRARTLGVNSLVTGFGSEFEALGSINNAFPRSGVGSGSSAADSLVQKNA 183
TLG S T FG E++ + + + D + +N
Sbjct: 184 ---------------TLG--SFSTVFGEEWKEKTDRDFVKFWKDIVDDNEVRDLISDRNK 226
Query: 184 ARVRSSEWWFDEDGLFN------LDDEGRAEKQQCAVGASNIIRNFSFMPDNEVIMAQHR 237
+ +S W E LF+ ++D Q AV I+RN SF N ++A +R
Sbjct: 227 SHEGTSGEWIWES-LFHPPRKLGINDIEGQRVLQIAV----ILRNLSFEEGNVKLLAANR 281
Query: 238 HCLETVFQCIEDHVTEDEELVTNALETIVNLAP--LLDLRIFSSSKQSYIKITREKRAVE 295
CL + H +L L+T+ N+A LLD F ++ + +T+ + +
Sbjct: 282 TCLRFLLLSAHSHFISLRQL---GLDTLGNIAAELLLDPVDFKTTHLMFHTVTKCLMSRD 338
Query: 296 AIMGILGSPFKAWHCAAAELLGRLIINPDNEPFLLPFVPQ-IHKRLVDLMSLPAFDAQAA 354
+ + G E+LG L DN + +V Q ++ ++ ++LP +
Sbjct: 339 RFLKMRG----------MEILGNLCKAEDNGVLICEYVDQDSYREIICHLTLPDVLLVIS 388
Query: 355 AVGALYNLAEVNVDCRLKLASERWAIDRLL 384
+ LY L E+ K+A +ID L+
Sbjct: 389 TLEVLYMLTEMGDVACTKIAKVEKSIDMLV 418
>gi|296211402|ref|XP_002752404.1| PREDICTED: AT-rich interactive domain-containing protein 2
[Callithrix jacchus]
Length = 1852
Score = 55.1 bits (131), Expect = 7e-05, Method: Composition-based stats.
Identities = 84/330 (25%), Positives = 141/330 (42%), Gaps = 48/330 (14%)
Query: 64 NHKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQVIDDWR 123
++ ++VL+L SGL +E+ +A+N TLLS + K M+ + P KI LL A V DD
Sbjct: 173 DYNKLVLSLLSGLPNEVDFAINVCTLLSNESKHVMQLEKDP--KIITLLLANAGVFDD-- 228
Query: 124 DIALPKELSKGPRARTLGVNSLVTGFGSEFEALGSINNAFPRSGVGSGSSAADSLVQKNA 183
TLG S T FG E++ + + + D + +N
Sbjct: 229 ---------------TLG--SFSTVFGEEWKEKTDRDFVKFWKDIVDDNEVRDLISDRNK 271
Query: 184 ARVRSSEWWFDEDGLFN------LDDEGRAEKQQCAVGASNIIRNFSFMPDNEVIMAQHR 237
+ +S W E LF+ ++D Q AV I+RN SF N ++A +R
Sbjct: 272 SHEGTSGEWIWES-LFHPPRKLGINDIEGQRVLQIAV----ILRNLSFEEGNVKLLAANR 326
Query: 238 HCLETVFQCIEDHVTEDEELVTNALETIVNLAP--LLDLRIFSSSKQSYIKITREKRAVE 295
CL + H +L L+T+ N+A LLD F ++ + +T+ + +
Sbjct: 327 TCLRFLLLSAHSHFISLRQL---GLDTLGNIAAELLLDPVDFKTTHLMFHTVTKCLMSRD 383
Query: 296 AIMGILGSPFKAWHCAAAELLGRLIINPDNEPFLLPFVPQ-IHKRLVDLMSLPAFDAQAA 354
+ + G E+LG L DN + +V Q ++ ++ ++LP +
Sbjct: 384 RFLKMRG----------MEILGNLCKAEDNGVLICEYVDQDSYREIICHLTLPDVLLVIS 433
Query: 355 AVGALYNLAEVNVDCRLKLASERWAIDRLL 384
+ LY L E+ K+A +ID L+
Sbjct: 434 TLEVLYMLTEMGDVACTKIAKVEKSIDMLV 463
>gi|440895279|gb|ELR47517.1| AT-rich interactive domain-containing protein 2, partial [Bos
grunniens mutus]
Length = 1738
Score = 55.1 bits (131), Expect = 7e-05, Method: Composition-based stats.
Identities = 84/330 (25%), Positives = 141/330 (42%), Gaps = 48/330 (14%)
Query: 64 NHKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQVIDDWR 123
++ ++VL+L SGL +E+ +A+N TLLS + K M+ + P KI LL A V DD
Sbjct: 62 DYNKLVLSLLSGLPNEVDFAINVCTLLSNESKHVMQLEKDP--KIITLLLANAGVFDD-- 117
Query: 124 DIALPKELSKGPRARTLGVNSLVTGFGSEFEALGSINNAFPRSGVGSGSSAADSLVQKNA 183
TLG S T FG E++ + + + D + +N
Sbjct: 118 ---------------TLG--SFSTVFGEEWKEKTDRDFVKFWKDIVDDNEVRDLISDRNK 160
Query: 184 ARVRSSEWWFDEDGLFN------LDDEGRAEKQQCAVGASNIIRNFSFMPDNEVIMAQHR 237
+ +S W E LF+ ++D Q AV I+RN SF N ++A +R
Sbjct: 161 SHEGTSGEWIWES-LFHPPRKLGINDIEGQRVLQIAV----ILRNLSFEEGNVKLLAANR 215
Query: 238 HCLETVFQCIEDHVTEDEELVTNALETIVNLAP--LLDLRIFSSSKQSYIKITREKRAVE 295
CL + H +L L+T+ N+A LLD F ++ + +T+ + +
Sbjct: 216 TCLRFLLLSAHSHFISLRQL---GLDTLGNIAAELLLDPVDFKTTHLMFHTVTKCLMSRD 272
Query: 296 AIMGILGSPFKAWHCAAAELLGRLIINPDNEPFLLPFVPQ-IHKRLVDLMSLPAFDAQAA 354
+ + G E+LG L DN + +V Q ++ ++ ++LP +
Sbjct: 273 RFLKMRG----------MEILGNLCKAEDNGVLICEYVDQDSYREIICHLTLPDVLLVIS 322
Query: 355 AVGALYNLAEVNVDCRLKLASERWAIDRLL 384
+ LY L E+ K+A +ID L+
Sbjct: 323 TLEVLYMLTEMGDVACTKIAKVEKSIDMLV 352
>gi|348580727|ref|XP_003476130.1| PREDICTED: AT-rich interactive domain-containing protein 2-like
[Cavia porcellus]
Length = 1784
Score = 55.1 bits (131), Expect = 7e-05, Method: Composition-based stats.
Identities = 84/330 (25%), Positives = 141/330 (42%), Gaps = 48/330 (14%)
Query: 64 NHKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQVIDDWR 123
++ ++VL+L SGL +E+ +A+N TLLS + K M+ + P KI LL A V DD
Sbjct: 110 DYNKLVLSLLSGLPNEVDFAINVCTLLSNESKHVMQLEKDP--KIITLLLANAGVFDD-- 165
Query: 124 DIALPKELSKGPRARTLGVNSLVTGFGSEFEALGSINNAFPRSGVGSGSSAADSLVQKNA 183
TLG S T FG E++ + + + D + +N
Sbjct: 166 ---------------TLG--SFSTVFGEEWKEKTDRDFVKFWKDIVDDNEVRDLISDRNK 208
Query: 184 ARVRSSEWWFDEDGLFN------LDDEGRAEKQQCAVGASNIIRNFSFMPDNEVIMAQHR 237
+ +S W E LF+ ++D Q AV I+RN SF N ++A +R
Sbjct: 209 SHEGTSGEWIWES-LFHPPRKLGINDIEGQRVLQIAV----ILRNLSFEEGNVKLLAANR 263
Query: 238 HCLETVFQCIEDHVTEDEELVTNALETIVNLAP--LLDLRIFSSSKQSYIKITREKRAVE 295
CL + H +L L+T+ N+A LLD F ++ + +T+ + +
Sbjct: 264 TCLRFLLLSAHSHFISLRQL---GLDTLGNIAAELLLDPVDFKTTHLMFHTVTKCLMSRD 320
Query: 296 AIMGILGSPFKAWHCAAAELLGRLIINPDNEPFLLPFVPQ-IHKRLVDLMSLPAFDAQAA 354
+ + G E+LG L DN + +V Q ++ ++ ++LP +
Sbjct: 321 RFLKMRG----------MEILGNLCKAEDNGVLICEYVDQDSYREIICHLTLPDVLLVIS 370
Query: 355 AVGALYNLAEVNVDCRLKLASERWAIDRLL 384
+ LY L E+ K+A +ID L+
Sbjct: 371 TLEVLYMLTEMGDVACTKIAKVEKSIDMLV 400
>gi|431901419|gb|ELK08445.1| AT-rich interactive domain-containing protein 2 [Pteropus alecto]
Length = 1836
Score = 55.1 bits (131), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 83/333 (24%), Positives = 143/333 (42%), Gaps = 48/333 (14%)
Query: 64 NHKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQVIDDWR 123
++ ++VL+L SGL +E+ +A+N TLLS + K M+ + P KI LL A V DD
Sbjct: 157 DYNKLVLSLLSGLPNEVDFAINVCTLLSNESKHVMQLEKDP--KIITLLLANAGVFDD-- 212
Query: 124 DIALPKELSKGPRARTLGVNSLVTGFGSEFEALGSINNAFPRSGVGSGSSAADSLVQKNA 183
TLG S T FG E++ + + + D + ++
Sbjct: 213 ---------------TLG--SFSTVFGEEWKEKTDRDFVKFWKDIVDDNEVRDLISDRSK 255
Query: 184 ARVRSSEWW-----FDEDGLFNLDDEGRAEKQQCAVGASNIIRNFSFMPDNEVIMAQHRH 238
+ +S W F ++D Q AV I+RN SF N ++A +R
Sbjct: 256 SHEGTSGEWIWEPLFHPPRKLGINDIEGQRVLQIAV----ILRNLSFEEGNVKLLAANRT 311
Query: 239 CLETVFQCIEDHVTEDEELVTNALETIVNLAP--LLDLRIFSSSKQSYIKITREKRAVEA 296
CL + H +L L+T+ N+A LLD F ++ + +T+ + +
Sbjct: 312 CLRFLLLSAHSHFISLRQL---GLDTLGNIAAELLLDPVDFKTTHLMFHTVTKCLMSRDR 368
Query: 297 IMGILGSPFKAWHCAAAELLGRLIINPDNEPFLLPFVPQ-IHKRLVDLMSLPAFDAQAAA 355
+ + G E+LG L DN + +V Q ++ ++ ++LP +
Sbjct: 369 FLKMRG----------MEILGNLCKAEDNGVLICEYVDQDSYREIICHLTLPDVLLVIST 418
Query: 356 VGALYNLAEV-NVDCRLKLASERWAIDRLLRVI 387
+ LY L E+ +V C K+A +ID L+ ++
Sbjct: 419 LEVLYMLTEMGDVAC-TKIAKVEKSIDMLVCLV 450
>gi|358412203|ref|XP_596055.6| PREDICTED: AT-rich interactive domain-containing protein 2 [Bos
taurus]
Length = 1685
Score = 55.1 bits (131), Expect = 8e-05, Method: Composition-based stats.
Identities = 84/330 (25%), Positives = 141/330 (42%), Gaps = 48/330 (14%)
Query: 64 NHKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQVIDDWR 123
++ ++VL+L SGL +E+ +A+N TLLS + K M+ + P KI LL A V DD
Sbjct: 8 DYNKLVLSLLSGLPNEVDFAINVCTLLSNESKHVMQLEKDP--KIITLLLANAGVFDD-- 63
Query: 124 DIALPKELSKGPRARTLGVNSLVTGFGSEFEALGSINNAFPRSGVGSGSSAADSLVQKNA 183
TLG S T FG E++ + + + D + +N
Sbjct: 64 ---------------TLG--SFSTVFGEEWKEKTDRDFVKFWKDIVDDNEVRDLISDRNK 106
Query: 184 ARVRSSEWWFDEDGLFN------LDDEGRAEKQQCAVGASNIIRNFSFMPDNEVIMAQHR 237
+ +S W E LF+ ++D Q AV I+RN SF N ++A +R
Sbjct: 107 SHEGTSGEWIWES-LFHPPRKLGINDIEGQRVLQIAV----ILRNLSFEEGNVKLLAANR 161
Query: 238 HCLETVFQCIEDHVTEDEELVTNALETIVNLAP--LLDLRIFSSSKQSYIKITREKRAVE 295
CL + H +L L+T+ N+A LLD F ++ + +T+ + +
Sbjct: 162 TCLRFLLLSAHSHFISLRQL---GLDTLGNIAAELLLDPVDFKTTHLMFHTVTKCLMSRD 218
Query: 296 AIMGILGSPFKAWHCAAAELLGRLIINPDNEPFLLPFVPQ-IHKRLVDLMSLPAFDAQAA 354
+ + G E+LG L DN + +V Q ++ ++ ++LP +
Sbjct: 219 RFLKMRG----------MEILGNLCKAEDNGVLICEYVDQDSYREIICHLTLPDVLLVIS 268
Query: 355 AVGALYNLAEVNVDCRLKLASERWAIDRLL 384
+ LY L E+ K+A +ID L+
Sbjct: 269 TLEVLYMLTEMGDVACTKIAKVEKSIDMLV 298
>gi|335288684|ref|XP_003355674.1| PREDICTED: AT-rich interactive domain-containing protein 2 [Sus
scrofa]
Length = 1685
Score = 55.1 bits (131), Expect = 8e-05, Method: Composition-based stats.
Identities = 84/330 (25%), Positives = 141/330 (42%), Gaps = 48/330 (14%)
Query: 64 NHKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQVIDDWR 123
++ ++VL+L SGL +E+ +A+N TLLS + K M+ + P KI LL A V DD
Sbjct: 8 DYNKLVLSLLSGLPNEVDFAINVCTLLSNESKHVMQLEKDP--KIITLLLANAGVFDD-- 63
Query: 124 DIALPKELSKGPRARTLGVNSLVTGFGSEFEALGSINNAFPRSGVGSGSSAADSLVQKNA 183
TLG S T FG E++ + + + D + +N
Sbjct: 64 ---------------TLG--SFSTVFGEEWKEKTDRDFVKFWKDIVDDNEVRDLISDRNK 106
Query: 184 ARVRSSEWWFDEDGLFN------LDDEGRAEKQQCAVGASNIIRNFSFMPDNEVIMAQHR 237
+ +S W E LF+ ++D Q AV I+RN SF N ++A +R
Sbjct: 107 SHEGTSGEWIWES-LFHPPRKLGINDIEGQRVLQIAV----ILRNLSFEEGNVKLLAANR 161
Query: 238 HCLETVFQCIEDHVTEDEELVTNALETIVNLAP--LLDLRIFSSSKQSYIKITREKRAVE 295
CL + H +L L+T+ N+A LLD F ++ + +T+ + +
Sbjct: 162 TCLRFLLLSAHSHFISLRQL---GLDTLGNIAAELLLDPVDFKTTHLMFHTVTKCLMSRD 218
Query: 296 AIMGILGSPFKAWHCAAAELLGRLIINPDNEPFLLPFVPQ-IHKRLVDLMSLPAFDAQAA 354
+ + G E+LG L DN + +V Q ++ ++ ++LP +
Sbjct: 219 RFLKMRG----------MEILGNLCKAEDNGVLICEYVDQDSYREIICHLTLPDVLLVIS 268
Query: 355 AVGALYNLAEVNVDCRLKLASERWAIDRLL 384
+ LY L E+ K+A +ID L+
Sbjct: 269 TLEVLYMLTEMGDVACTKIAKVEKSIDMLV 298
>gi|338726331|ref|XP_003365302.1| PREDICTED: AT-rich interactive domain-containing protein 2 isoform
2 [Equus caballus]
gi|338726333|ref|XP_003365303.1| PREDICTED: AT-rich interactive domain-containing protein 2 isoform
3 [Equus caballus]
Length = 1687
Score = 55.1 bits (131), Expect = 8e-05, Method: Composition-based stats.
Identities = 84/330 (25%), Positives = 141/330 (42%), Gaps = 48/330 (14%)
Query: 64 NHKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQVIDDWR 123
++ ++VL+L SGL +E+ +A+N TLLS + K M+ + P KI LL A V DD
Sbjct: 8 DYNKLVLSLLSGLPNEVDFAINVCTLLSNESKHVMQLEKDP--KIITLLLANAGVFDD-- 63
Query: 124 DIALPKELSKGPRARTLGVNSLVTGFGSEFEALGSINNAFPRSGVGSGSSAADSLVQKNA 183
TLG S T FG E++ + + + D + +N
Sbjct: 64 ---------------TLG--SFSTVFGEEWKEKTDRDFVKFWKDIVDDNEVRDLISDRNK 106
Query: 184 ARVRSSEWWFDEDGLFN------LDDEGRAEKQQCAVGASNIIRNFSFMPDNEVIMAQHR 237
+ +S W E LF+ ++D Q AV I+RN SF N ++A +R
Sbjct: 107 SHEGTSGEWIWES-LFHPPRKLGINDIEGQRVLQIAV----ILRNLSFEEGNVKLLAANR 161
Query: 238 HCLETVFQCIEDHVTEDEELVTNALETIVNLAP--LLDLRIFSSSKQSYIKITREKRAVE 295
CL + H +L L+T+ N+A LLD F ++ + +T+ + +
Sbjct: 162 TCLRFLLLSAHSHFISLRQL---GLDTLGNIAAELLLDPVDFKTTHLMFHTVTKCLMSRD 218
Query: 296 AIMGILGSPFKAWHCAAAELLGRLIINPDNEPFLLPFVPQ-IHKRLVDLMSLPAFDAQAA 354
+ + G E+LG L DN + +V Q ++ ++ ++LP +
Sbjct: 219 RFLKMRG----------MEILGNLCKAEDNGVLICEYVDQDSYREIICHLTLPDVLLVIS 268
Query: 355 AVGALYNLAEVNVDCRLKLASERWAIDRLL 384
+ LY L E+ K+A +ID L+
Sbjct: 269 TLEVLYMLTEMGDVACTKIAKVEKSIDMLV 298
>gi|345792211|ref|XP_003433601.1| PREDICTED: AT-rich interactive domain-containing protein 2 isoform
1 [Canis lupus familiaris]
Length = 1687
Score = 54.7 bits (130), Expect = 1e-04, Method: Composition-based stats.
Identities = 84/330 (25%), Positives = 141/330 (42%), Gaps = 48/330 (14%)
Query: 64 NHKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQVIDDWR 123
++ ++VL+L SGL +E+ +A+N TLLS + K M+ + P KI LL A V DD
Sbjct: 8 DYNKLVLSLLSGLPNEVDFAINVCTLLSNESKHVMQLEKDP--KIITLLLANAGVFDD-- 63
Query: 124 DIALPKELSKGPRARTLGVNSLVTGFGSEFEALGSINNAFPRSGVGSGSSAADSLVQKNA 183
TLG S T FG E++ + + + D + +N
Sbjct: 64 ---------------TLG--SFSTVFGEEWKEKTDRDFVKFWKDIVDDNEVRDLISDRNK 106
Query: 184 ARVRSSEWWFDEDGLFN------LDDEGRAEKQQCAVGASNIIRNFSFMPDNEVIMAQHR 237
+ +S W E LF+ ++D Q AV I+RN SF N ++A +R
Sbjct: 107 SHEGTSGEWIWES-LFHPPRKLGINDIEGQRVLQIAV----ILRNLSFEEGNVKLLAANR 161
Query: 238 HCLETVFQCIEDHVTEDEELVTNALETIVNLAP--LLDLRIFSSSKQSYIKITREKRAVE 295
CL + H +L L+T+ N+A LLD F ++ + +T+ + +
Sbjct: 162 TCLRFLLLSAHSHFISLRQL---GLDTLGNIAAELLLDPVDFKTTHLMFHTVTKCLMSRD 218
Query: 296 AIMGILGSPFKAWHCAAAELLGRLIINPDNEPFLLPFVPQ-IHKRLVDLMSLPAFDAQAA 354
+ + G E+LG L DN + +V Q ++ ++ ++LP +
Sbjct: 219 RFLKMRG----------MEILGNLCKAEDNGVLICEYVDQDSYREIICHLTLPDVLLVIS 268
Query: 355 AVGALYNLAEVNVDCRLKLASERWAIDRLL 384
+ LY L E+ K+A +ID L+
Sbjct: 269 TLEVLYMLTEMGDVACTKIAKVEKSIDMLV 298
>gi|355564152|gb|EHH20652.1| hypothetical protein EGK_03551 [Macaca mulatta]
Length = 1687
Score = 54.7 bits (130), Expect = 1e-04, Method: Composition-based stats.
Identities = 84/330 (25%), Positives = 141/330 (42%), Gaps = 48/330 (14%)
Query: 64 NHKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQVIDDWR 123
++ ++VL+L SGL +E+ +A+N TLLS + K M+ + P KI LL A V DD
Sbjct: 8 DYNKLVLSLLSGLPNEVDFAINVCTLLSNESKHVMQLEKDP--KIITLLLANAGVFDD-- 63
Query: 124 DIALPKELSKGPRARTLGVNSLVTGFGSEFEALGSINNAFPRSGVGSGSSAADSLVQKNA 183
TLG S T FG E++ + + + D + +N
Sbjct: 64 ---------------TLG--SFSTVFGEEWKEKTDRDFVKFWKDIVDDNEVRDLISDRNK 106
Query: 184 ARVRSSEWWFDEDGLFN------LDDEGRAEKQQCAVGASNIIRNFSFMPDNEVIMAQHR 237
+ +S W E LF+ ++D Q AV I+RN SF N ++A +R
Sbjct: 107 SHEGTSGEWIWES-LFHPPRKLGINDIEGQRVLQIAV----ILRNLSFEEGNVKLLAANR 161
Query: 238 HCLETVFQCIEDHVTEDEELVTNALETIVNLAP--LLDLRIFSSSKQSYIKITREKRAVE 295
CL + H +L L+T+ N+A LLD F ++ + +T+ + +
Sbjct: 162 TCLRFLLLSAHSHFISLRQL---GLDTLGNIAAELLLDPVDFKTTHLMFHTVTKCLMSRD 218
Query: 296 AIMGILGSPFKAWHCAAAELLGRLIINPDNEPFLLPFVPQ-IHKRLVDLMSLPAFDAQAA 354
+ + G E+LG L DN + +V Q ++ ++ ++LP +
Sbjct: 219 RFLKMRG----------MEILGNLCKAEDNGVLICEYVDQDSYREIICHLTLPDVLLVIS 268
Query: 355 AVGALYNLAEVNVDCRLKLASERWAIDRLL 384
+ LY L E+ K+A +ID L+
Sbjct: 269 TLEVLYMLTEMGDVACTKIAKVEKSIDMLV 298
>gi|224093694|ref|XP_002196572.1| PREDICTED: AT-rich interactive domain-containing protein 2
[Taeniopygia guttata]
Length = 1825
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 83/332 (25%), Positives = 140/332 (42%), Gaps = 46/332 (13%)
Query: 64 NHKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQVIDDWR 123
++ ++VL+L SGL +E+ +A+N TLLS + K M+ + P KI LL A V DD
Sbjct: 157 DYNKLVLSLLSGLPNEVDFAINVCTLLSNESKHVMQLEKDP--KIITLLLANAGVFDD-- 212
Query: 124 DIALPKELSKGPRARTLGVNSLVTGFGSEFEALGSINNAFPRSGVGSGSSAADSLVQKNA 183
TLG S V FG E++ + + D + ++
Sbjct: 213 ---------------TLGSFSAV--FGDEWKEKTDRDFVKFWKDIVEDIEVRDLISDRSK 255
Query: 184 ARVRSSEWW-----FDEDGLFNLDDEGRAEKQQCAVGASNIIRNFSFMPDNEVIMAQHRH 238
++ SE W F ++D Q AV I+RN SF N ++A +R
Sbjct: 256 SQDIPSEEWIWESLFHPPRKLGINDIEGQRVLQIAV----ILRNLSFEEGNVKLLAANRT 311
Query: 239 CLETVFQCIEDHVTEDEELVTNALETIVNLAP--LLDLRIFSSSKQSYIKITREKRAVEA 296
CL + H +L L+T+ N+A LLD F ++ + +T+ + +
Sbjct: 312 CLRFLLLSAHSHFISLRQL---GLDTLGNIAAELLLDPVDFKTTHLMFHTVTKCLMSRDR 368
Query: 297 IMGILGSPFKAWHCAAAELLGRLIINPDNEPFLLPFVPQ-IHKRLVDLMSLPAFDAQAAA 355
+ + G E+L L DN + +V Q +K ++ ++LP +A
Sbjct: 369 FLKMRG----------MEILANLCKAEDNGVLICEYVDQESYKEIICHLTLPDVLLVISA 418
Query: 356 VGALYNLAEVNVDCRLKLASERWAIDRLLRVI 387
+ LY L E+ K+A +ID L+ ++
Sbjct: 419 LEVLYMLTEMGEVACSKIAKVEKSIDTLVCLV 450
>gi|384244726|gb|EIE18224.1| hypothetical protein COCSUDRAFT_49334 [Coccomyxa subellipsoidea
C-169]
Length = 1019
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 29/70 (41%), Positives = 41/70 (58%), Gaps = 4/70 (5%)
Query: 52 PSLQVHSSFADQNHKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGL 111
P L V + A Q + +++AL+SGL + WALN L L SF + ++ L +PGL
Sbjct: 581 PFLLVPQTLAPQQMRELIMALKSGLPECVPWALNALALASFSARPELPS----LPSLPGL 636
Query: 112 LDALLQVIDD 121
L ALLQV+ D
Sbjct: 637 LPALLQVLRD 646
>gi|118082251|ref|XP_416046.2| PREDICTED: AT-rich interactive domain-containing protein 2 [Gallus
gallus]
Length = 1830
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 82/332 (24%), Positives = 140/332 (42%), Gaps = 46/332 (13%)
Query: 64 NHKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQVIDDWR 123
++ ++VL+L SGL +E+ +A+N TLLS + K M+ + P KI LL A V DD
Sbjct: 157 DYNKLVLSLLSGLPNEVDFAINVCTLLSNESKHVMQLEKDP--KIITLLLANAGVFDD-- 212
Query: 124 DIALPKELSKGPRARTLGVNSLVTGFGSEFEALGSINNAFPRSGVGSGSSAADSLVQKNA 183
TLG S V FG E++ + + D + ++
Sbjct: 213 ---------------TLGSFSAV--FGEEWKEKTDRDFVKFWRDIVEDIEVRDLISDRSK 255
Query: 184 ARVRSSEWW-----FDEDGLFNLDDEGRAEKQQCAVGASNIIRNFSFMPDNEVIMAQHRH 238
++ SE W F ++D Q AV I+RN SF N ++A +R
Sbjct: 256 SQDIPSEDWIWESLFHPPRKLGINDIEGQRVLQIAV----ILRNLSFEEGNVKLLAANRT 311
Query: 239 CLETVFQCIEDHVTEDEELVTNALETIVNLAP--LLDLRIFSSSKQSYIKITREKRAVEA 296
CL + H +L L+T+ N+A LLD F ++ + +T+ + +
Sbjct: 312 CLRFLLLSAHSHFISLRQL---GLDTLGNIAAELLLDPVDFKTTHLMFHTVTKCLMSRDR 368
Query: 297 IMGILGSPFKAWHCAAAELLGRLIINPDNEPFLLPFVPQ-IHKRLVDLMSLPAFDAQAAA 355
+ + G E+L L DN + +V Q +K ++ ++LP +A
Sbjct: 369 FLKMRG----------MEILANLCKAEDNGVLICEYVDQESYKEIICHLTLPDVLLVISA 418
Query: 356 VGALYNLAEVNVDCRLKLASERWAIDRLLRVI 387
+ LY L E+ K++ +ID L+ ++
Sbjct: 419 LEVLYMLTEMGEVACTKISKVEKSIDTLVCLV 450
>gi|326911459|ref|XP_003202076.1| PREDICTED: AT-rich interactive domain-containing protein 2-like
[Meleagris gallopavo]
Length = 1831
Score = 53.1 bits (126), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 82/332 (24%), Positives = 140/332 (42%), Gaps = 46/332 (13%)
Query: 64 NHKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQVIDDWR 123
++ ++VL+L SGL +E+ +A+N TLLS + K M+ + P KI LL A V DD
Sbjct: 157 DYNKLVLSLLSGLPNEVDFAINVCTLLSNESKHVMQLEKDP--KIITLLLANAGVFDD-- 212
Query: 124 DIALPKELSKGPRARTLGVNSLVTGFGSEFEALGSINNAFPRSGVGSGSSAADSLVQKNA 183
TLG S V FG E++ + + D + ++
Sbjct: 213 ---------------TLGSFSAV--FGEEWKEKTDRDFVKFWRDIVEDIEVRDLISDRSK 255
Query: 184 ARVRSSEWW-----FDEDGLFNLDDEGRAEKQQCAVGASNIIRNFSFMPDNEVIMAQHRH 238
++ SE W F ++D Q AV I+RN SF N ++A +R
Sbjct: 256 SQDIPSEDWIWESLFHPPRKLGINDIEGQRVLQIAV----ILRNLSFEEGNVKLLAANRT 311
Query: 239 CLETVFQCIEDHVTEDEELVTNALETIVNLAP--LLDLRIFSSSKQSYIKITREKRAVEA 296
CL + H +L L+T+ N+A LLD F ++ + +T+ + +
Sbjct: 312 CLRFLLLSAHSHFISLRQL---GLDTLGNIAAELLLDPVDFKTTHLMFHTVTKCLMSRDR 368
Query: 297 IMGILGSPFKAWHCAAAELLGRLIINPDNEPFLLPFVPQ-IHKRLVDLMSLPAFDAQAAA 355
+ + G E+L L DN + +V Q +K ++ ++LP +A
Sbjct: 369 FLKMRG----------MEILANLCKAEDNGVLICEYVDQESYKEIICHLTLPDVLLVISA 418
Query: 356 VGALYNLAEVNVDCRLKLASERWAIDRLLRVI 387
+ LY L E+ K++ +ID L+ ++
Sbjct: 419 LEVLYMLTEMGEVACTKISKVEKSIDTLVCLV 450
>gi|51491251|emb|CAH18689.1| hypothetical protein [Homo sapiens]
Length = 1756
Score = 52.4 bits (124), Expect = 5e-04, Method: Composition-based stats.
Identities = 83/330 (25%), Positives = 140/330 (42%), Gaps = 48/330 (14%)
Query: 64 NHKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQVIDDWR 123
++ ++VL+L SG +E+ +A+N TLLS + K M+ + P KI LL A V DD
Sbjct: 130 DYNKLVLSLLSGPPNEVDFAINVCTLLSNESKHVMQLEKDP--KIITLLLANAGVFDD-- 185
Query: 124 DIALPKELSKGPRARTLGVNSLVTGFGSEFEALGSINNAFPRSGVGSGSSAADSLVQKNA 183
TLG S T FG E++ + + + D + +N
Sbjct: 186 ---------------TLG--SFSTVFGEEWKEKTDRDFVKFWKDIVDDNEVRDLISDRNK 228
Query: 184 ARVRSSEWWFDEDGLFN------LDDEGRAEKQQCAVGASNIIRNFSFMPDNEVIMAQHR 237
+ +S W E LF+ ++D Q AV I+RN SF N ++A +R
Sbjct: 229 SHEGTSGEWIWES-LFHPPRKLGINDIEGQRVLQIAV----ILRNLSFEEGNVKLLAANR 283
Query: 238 HCLETVFQCIEDHVTEDEELVTNALETIVNLAP--LLDLRIFSSSKQSYIKITREKRAVE 295
CL + H +L L+T+ N+A LLD F ++ + +T+ + +
Sbjct: 284 TCLRFLLLSAHSHFISLRQL---GLDTLGNIAAELLLDPVDFKTTHLMFHTVTKCLMSRD 340
Query: 296 AIMGILGSPFKAWHCAAAELLGRLIINPDNEPFLLPFVPQ-IHKRLVDLMSLPAFDAQAA 354
+ + G E+LG L DN + +V Q ++ ++ ++LP +
Sbjct: 341 RFLKMRG----------MEILGNLCKAEDNGVLICEYVDQDSYREIICHLTLPDVLLVIS 390
Query: 355 AVGALYNLAEVNVDCRLKLASERWAIDRLL 384
+ LY L E+ K+A +ID L+
Sbjct: 391 TLEVLYMLTEMGDVACTKIAKVEKSIDMLV 420
>gi|388856609|emb|CCF49726.1| uncharacterized protein [Ustilago hordei]
Length = 1062
Score = 49.7 bits (117), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 30/75 (40%), Positives = 41/75 (54%), Gaps = 13/75 (17%)
Query: 45 APTTLLGPSLQVHSSFADQNHKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATP 104
AP L GPS RI+L+L+SGL S++ WAL+ L LL+ +D +D T
Sbjct: 135 APYLLPGPS------------NRILLSLKSGLPSQIDWALSRLVLLTANHRDQAARDFT- 181
Query: 105 LAKIPGLLDALLQVI 119
+PGL DALL +
Sbjct: 182 FDSVPGLADALLSYV 196
>gi|170580091|ref|XP_001895110.1| ARID/BRIGHT DNA binding domain containing protein [Brugia malayi]
gi|158598040|gb|EDP36026.1| ARID/BRIGHT DNA binding domain containing protein [Brugia malayi]
Length = 1125
Score = 49.7 bits (117), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 97/395 (24%), Positives = 156/395 (39%), Gaps = 57/395 (14%)
Query: 1 MQKREQGKSGGAAGGAAT---PAAKRGRPFGSTSGSSGGSGSAADSAAPTTLLGPSLQVH 57
+ K EQ + G A + + RG+ F S + + + A+ L P +
Sbjct: 122 LSKYEQNELVGEVDDADSDLLSSRSRGKGFSSLATADCPISTGQRQASEYFRLRPEKK-- 179
Query: 58 SSFADQNHKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQ 117
+ + RIV +L SGL +E+ +A+N TLLS +R A P +I LL A L
Sbjct: 180 ----EAEYDRIVKSLLSGLPNEVDFAVNVCTLLSHPGPRVLRLVAAP--QIITLLVAHLA 233
Query: 118 VIDDWRDIALPKELSKGPRARTLGVNSLVTGFGSEFEALGSINNAFPRSGVGSGSSAADS 177
+ D G RA S G F A SG+ D
Sbjct: 234 IFPD------------GDRAFYDLYKSWELVSGHNFIAF------------WSGAGITDD 269
Query: 178 LVQKNAARVRSSEWWFDEDGLF-NLDDEGRAEK------QQCAVGASNIIRNFSFMPDNE 230
V K ++ S +E +F LD E + QQ + IIRN SF N+
Sbjct: 270 EVLKLIPHIKPSMVSEEESNIFCGLDTEFQPRNVVSWRVQQVLL----IIRNLSFEVINK 325
Query: 231 VIMAQHRHCLETVFQCIEDHVTEDEELVTNALETIVNLAPLLDLRIFSSSKQSYIKITRE 290
++A L+ +F C + L T AL+ + N+A +DL SS +++
Sbjct: 326 AVLAASWPLLKFLFICSN---CKWSMLRTAALDALSNIACEIDLMAEESSSTNHL----- 377
Query: 291 KRAVEAIMGILGSPFKAWHCAAAELLGRLIINPDNEPFLLPFVPQ-IHKRLVDLMSLPAF 349
++ + L S K A E+L L N NE + F+ I ++ ++S+
Sbjct: 378 --LLKTVSCCLHSDDKFRVIRALEILSGLCNNERNESLICEFLDHRILSKIFTVISVKDI 435
Query: 350 DAQAAAVGALYNLAEVNVDCRLKLASERWAIDRLL 384
+ +LY ++E+ +L+ AID L+
Sbjct: 436 MMCVYTLESLYQISELGATACYQLSRFPHAIDTLV 470
>gi|324506200|gb|ADY42654.1| ARID domain-containing protein C08B11.3 [Ascaris suum]
Length = 478
Score = 49.3 bits (116), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 94/370 (25%), Positives = 151/370 (40%), Gaps = 57/370 (15%)
Query: 30 TSGSSGGSGSAADSAAPTTLLGPSLQVHSSF-------ADQNHKRIVLALQSGLKSELTW 82
+ G G S A + P ++ S Q + F + ++R+V +L GL +E+ +
Sbjct: 130 SRGRGKGFSSLATADCPVSV--ASRQTNDFFRVRPEKKTEAEYERLVKSLLCGLPNEVDF 187
Query: 83 ALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQVIDDWRDIALPKELSKGPRARTLGV 142
A+N TLLS +R A P ++ LL A + V D G
Sbjct: 188 AVNVCTLLSHPGPRVLRLSAAP--QLITLLVAHVAVFAD-------------------GD 226
Query: 143 NSLVTGFGSEFEALGSINNAFPRSGVGSGSSAADSLVQKNAARVRSSEWWFDEDGLFN-L 201
NSL + S +A G AF R G+ D + V SE ++ LF L
Sbjct: 227 NSLYDLYESWSKASGRDFIAFWR-----GAGIEDEEILALLPNVTRSEMPKEDLELFTGL 281
Query: 202 DDEGRAEK------QQCAVGASNIIRNFSFMPDNEVIMAQHRHCLETVFQCIEDHVTEDE 255
+ E R QQ +I+RN SF N+ IMA L+ +F C +
Sbjct: 282 EAEFRPRDPVSWRVQQIL----SIMRNLSFEVINKPIMAASWPLLKFLFVCSN---CKWS 334
Query: 256 ELVTNALETIVNLAPLLDLRIFSSSKQSYIKITREKRAVEAIMGILGSPFKAWHCAAAEL 315
L T AL+ + N+A +DL SS +++ + R L S K + E+
Sbjct: 335 ALRTAALDALSNIASEIDLMCEESSVTNHLLLKTVSRC-------LNSSDKLQVIRSLEI 387
Query: 316 LGRLIINPDNEPFLLPFV-PQIHKRLVDLMSLPAFDAQAAAVGALYNLAEVNVDCRLKLA 374
L L N NE L F+ +I ++ D++++ + ALY ++E+ +++
Sbjct: 388 LAGLCNNEHNESLLCEFLDTRILSKIFDVITVKDIMMCVYTLEALYQISELGASACQQVS 447
Query: 375 SERWAIDRLL 384
AID L+
Sbjct: 448 LYPRAIDTLV 457
>gi|402588804|gb|EJW82737.1| arid/bright DNA binding domain-containing protein, partial
[Wuchereria bancrofti]
Length = 556
Score = 49.3 bits (116), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 96/395 (24%), Positives = 155/395 (39%), Gaps = 57/395 (14%)
Query: 1 MQKREQGKSGGAAGGAAT---PAAKRGRPFGSTSGSSGGSGSAADSAAPTTLLGPSLQVH 57
+ K EQ + G A + + RG+ F S + + + A+ P +
Sbjct: 122 LSKYEQNELVGEVDDADSDLLSSRSRGKGFSSLATADCPISTGQRQASEYFRFRPEKK-- 179
Query: 58 SSFADQNHKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQ 117
+ + RIV +L SGL +E+ +A+N TLLS +R A P +I LL A L
Sbjct: 180 ----EAEYDRIVKSLLSGLPNEVDFAVNVCTLLSHPGPRVLRLVAAP--QIITLLVAHLA 233
Query: 118 VIDDWRDIALPKELSKGPRARTLGVNSLVTGFGSEFEALGSINNAFPRSGVGSGSSAADS 177
+ D G RA S G F A S G+ D
Sbjct: 234 IFPD------------GDRAFYDLYKSWELVSGHNFIAFWS------------GAGITDD 269
Query: 178 LVQKNAARVRSSEWWFDEDGLF-NLDDEGRAEK------QQCAVGASNIIRNFSFMPDNE 230
V K ++ S +E +F LD E + QQ +IIRN SF N+
Sbjct: 270 EVLKLIPHIKPSTVSEEESNIFCGLDAEFQPRNVVSWRVQQVL----SIIRNLSFEVINK 325
Query: 231 VIMAQHRHCLETVFQCIEDHVTEDEELVTNALETIVNLAPLLDLRIFSSSKQSYIKITRE 290
++A L+ +F C + L T AL+ + N+A +DL SS +++
Sbjct: 326 AVLAASWPLLKFLFICSN---CKWSMLRTAALDALSNIACEIDLMAEESSSTNHL----- 377
Query: 291 KRAVEAIMGILGSPFKAWHCAAAELLGRLIINPDNEPFLLPFVPQ-IHKRLVDLMSLPAF 349
++ + L S K A E+L L N NE + F+ I ++ ++S+
Sbjct: 378 --LLKTVSCCLHSDDKFRVIRALEILSGLCNNERNESLICEFLDHRILSKIFTVISVKDI 435
Query: 350 DAQAAAVGALYNLAEVNVDCRLKLASERWAIDRLL 384
+ +LY ++E+ +L+ AID L+
Sbjct: 436 MMCVYTLESLYQISELGATACYQLSRFPHAIDTLV 470
>gi|410927538|ref|XP_003977198.1| PREDICTED: AT-rich interactive domain-containing protein 2-like,
partial [Takifugu rubripes]
Length = 1416
Score = 48.5 bits (114), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 92/334 (27%), Positives = 148/334 (44%), Gaps = 49/334 (14%)
Query: 64 NHKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQVIDDWR 123
++ ++VL+L SGL +E+ +A+N TLLS + K M+ D P K+ LL A V DD
Sbjct: 157 DYNKLVLSLLSGLPNEVDFAVNVCTLLSNESKHSMQLDKDP--KLVTLLLAHAGVFDD-- 212
Query: 124 DIALPKELSKGPRARTLGVNSLVTGFGSEFEALGSINNAFPRSGVGSGSSAADSLVQK-N 182
+LG S V FG++++ S + V S + + K N
Sbjct: 213 ---------------SLGSFSRV--FGTDWKERSSRDFVRFWKEVVEDSEVRELIWDKSN 255
Query: 183 AARVRSS--EWW---FDEDGLFNLDDEGRAEKQQCAVGASNIIRNFSFMPDNEVIMAQHR 237
AA+ +S E W F + D Q A+ I+RN SF N ++A +R
Sbjct: 256 AAQDGTSCEERWHSLFHPPRTHGIGDMEAQRVLQIAI----ILRNLSFEEANVKLLAANR 311
Query: 238 HCLETVFQCIEDHVTEDEELVTNALETIVNLAPLLDLRI--FSSSKQSYIKITREKRAVE 295
CL + C ++ +L L+T+ N+A L L F ++ + IT+
Sbjct: 312 TCLRFLLLCSHCNLISLRQL---GLDTLGNVAAELQLDPVDFRTTHLIFHTITK------ 362
Query: 296 AIMGILGSPFKAWHCAAAELLGRLIINPDNEPFLLPFVPQ-IHKRLVDLMSLPAFDAQAA 354
L S + A E+LG L DN + +V Q ++ + L++LP A
Sbjct: 363 ----CLMSRDRFLKMRAMEILGNLSKVDDNAVLICEYVDQDSYREVTMLLTLPDLMLLMA 418
Query: 355 AVGALYNLAEV-NVDCRLKLASERWAIDRLLRVI 387
+ LY LA++ V C K+A +ID L+R++
Sbjct: 419 CLEVLYLLAQLGEVPCS-KIACVDRSIDLLVRLV 451
>gi|312066844|ref|XP_003136463.1| hypothetical protein LOAG_00875 [Loa loa]
Length = 1092
Score = 48.5 bits (114), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 85/335 (25%), Positives = 137/335 (40%), Gaps = 56/335 (16%)
Query: 65 HKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQVIDD--- 121
+ RIV +L SGL +E+ +A+N TLLS +R A P +I LL A L + D
Sbjct: 169 YDRIVKSLLSGLPNEVDFAVNVCTLLSHPGPRVLRLVAAP--QIITLLVAHLAIFPDDDR 226
Query: 122 -WRDIALPKELSKGPRARTLGVNSLVTGFGSEFEALGSINNAFPRSGVGSGSSAADSLVQ 180
+ D+ EL G F A S G+ D V
Sbjct: 227 AFYDLYKSWELVSG----------------HNFIAFWS------------GAGITDDEVL 258
Query: 181 KNAARVRSSEWWFDEDGLF-NLDDEGRAEK------QQCAVGASNIIRNFSFMPDNEVIM 233
K +R S +E +F LD E + QQ +IIRN SF N+ ++
Sbjct: 259 KLIPHIRPSMISEEESNIFCGLDTEFQPRNVVSWRVQQVL----SIIRNLSFEVINKAVL 314
Query: 234 AQHRHCLETVFQCIEDHVTEDEELVTNALETIVNLAPLLDLRIFSSSKQSYIKITREKRA 293
A L+ +F C + L T AL+ + N+A +DL SS +++
Sbjct: 315 AASWPLLKFLFICSS---CKWSMLRTAALDALSNIACEIDLMAEESSSTNHL-------L 364
Query: 294 VEAIMGILGSPFKAWHCAAAELLGRLIINPDNEPFLLPFVP-QIHKRLVDLMSLPAFDAQ 352
++ + L S K A E+L L N NE + F+ +I ++ ++S+
Sbjct: 365 LKTVSCCLHSDDKFRVIRALEILSGLCNNERNESLICEFLDYRILSKIFTVISVKDIMMC 424
Query: 353 AAAVGALYNLAEVNVDCRLKLASERWAIDRLLRVI 387
+ +LY ++E+ +L+ AID L+ ++
Sbjct: 425 VYTLESLYQISELGATACYQLSRFPHAIDTLVSLV 459
>gi|393911788|gb|EFO27606.2| hypothetical protein LOAG_00875 [Loa loa]
Length = 1108
Score = 48.1 bits (113), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 85/335 (25%), Positives = 137/335 (40%), Gaps = 56/335 (16%)
Query: 65 HKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQVIDD--- 121
+ RIV +L SGL +E+ +A+N TLLS +R A P +I LL A L + D
Sbjct: 169 YDRIVKSLLSGLPNEVDFAVNVCTLLSHPGPRVLRLVAAP--QIITLLVAHLAIFPDDDR 226
Query: 122 -WRDIALPKELSKGPRARTLGVNSLVTGFGSEFEALGSINNAFPRSGVGSGSSAADSLVQ 180
+ D+ EL G F A S G+ D V
Sbjct: 227 AFYDLYKSWELVSG----------------HNFIAFWS------------GAGITDDEVL 258
Query: 181 KNAARVRSSEWWFDEDGLF-NLDDEGRAEK------QQCAVGASNIIRNFSFMPDNEVIM 233
K +R S +E +F LD E + QQ +IIRN SF N+ ++
Sbjct: 259 KLIPHIRPSMISEEESNIFCGLDTEFQPRNVVSWRVQQVL----SIIRNLSFEVINKAVL 314
Query: 234 AQHRHCLETVFQCIEDHVTEDEELVTNALETIVNLAPLLDLRIFSSSKQSYIKITREKRA 293
A L+ +F C + L T AL+ + N+A +DL SS +++
Sbjct: 315 AASWPLLKFLFICSS---CKWSMLRTAALDALSNIACEIDLMAEESSSTNHL-------L 364
Query: 294 VEAIMGILGSPFKAWHCAAAELLGRLIINPDNEPFLLPFVP-QIHKRLVDLMSLPAFDAQ 352
++ + L S K A E+L L N NE + F+ +I ++ ++S+
Sbjct: 365 LKTVSCCLHSDDKFRVIRALEILSGLCNNERNESLICEFLDYRILSKIFTVISVKDIMMC 424
Query: 353 AAAVGALYNLAEVNVDCRLKLASERWAIDRLLRVI 387
+ +LY ++E+ +L+ AID L+ ++
Sbjct: 425 VYTLESLYQISELGATACYQLSRFPHAIDTLVSLV 459
>gi|332206942|ref|XP_003252554.1| PREDICTED: AT-rich interactive domain-containing protein 2
[Nomascus leucogenys]
Length = 1964
Score = 48.1 bits (113), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 79/333 (23%), Positives = 140/333 (42%), Gaps = 48/333 (14%)
Query: 64 NHKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQVIDDWR 123
++ ++VL+L S L +E +A++ +LLS + K M+ L K P ++ LL
Sbjct: 285 DYNKLVLSLLSELPNEADFAIHVCSLLSNESKHVMQ-----LEKDPKIITLLLN------ 333
Query: 124 DIALPKELSKGPRARTLGVNSLVTGFGSEFEALGSINNAFPRSGVGSGSSAADSLVQKNA 183
S G TLG S T FG E++ + + + D + +N
Sbjct: 334 --------SAGITGVTLG--SFSTVFGEEWKEKTDRDFVKFWKDIVDDNEVRDLISDRNK 383
Query: 184 ARVRSSEWW-----FDEDGLFNLDDEGRAEKQQCAVGASNIIRNFSFMPDNEVIMAQHRH 238
+ +S W F ++D Q AV I+RN SF N ++A +R
Sbjct: 384 SHEGTSGEWIWESLFHPPRKLGINDIEGQRVLQIAV----ILRNLSFEEGNVKLLAANRT 439
Query: 239 CLETVFQCIEDHVTEDEELVTNALETIVNLAP--LLDLRIFSSSKQSYIKITREKRAVEA 296
CL + H +L L+T+ N+A LLD F ++ + +T+ + +
Sbjct: 440 CLRFLLLSAHSHFISLRQL---GLDTLGNIAAELLLDPVDFKTTHLMFHTVTKCLMSRDR 496
Query: 297 IMGILGSPFKAWHCAAAELLGRLIINPDNEPFLLPFVPQ-IHKRLVDLMSLPAFDAQAAA 355
+ + G E+LG L DN + +V Q ++ ++ ++LP +
Sbjct: 497 FLKMRG----------MEILGNLCKAEDNGVLICEYVDQDSYREIICHLTLPDVLLVIST 546
Query: 356 VGALYNLAEV-NVDCRLKLASERWAIDRLLRVI 387
+ LY L E+ +V C K+A +ID L+ ++
Sbjct: 547 LEVLYMLTEMGDVAC-TKIAKVEKSIDMLVCLV 578
>gi|312075050|ref|XP_003140244.1| calcium-binding protein [Loa loa]
Length = 1198
Score = 47.8 bits (112), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 29/76 (38%), Positives = 43/76 (56%), Gaps = 6/76 (7%)
Query: 66 KRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQVIDDWRDI 125
KRI +AL+SGL++E+TWALN L ++ + D T L PGLL+ L++ +
Sbjct: 465 KRITMALRSGLETEITWALNALNIMLY----DDYAPPTILNHTPGLLNVLVEHFRALLSL 520
Query: 126 ALPK--ELSKGPRART 139
PK E+ +ART
Sbjct: 521 LYPKVFEVGAESKART 536
>gi|393909659|gb|EJD75540.1| CBR-LSS-4 protein, partial [Loa loa]
Length = 1582
Score = 47.4 bits (111), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 29/76 (38%), Positives = 43/76 (56%), Gaps = 6/76 (7%)
Query: 66 KRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQVIDDWRDI 125
KRI +AL+SGL++E+TWALN L ++ + D T L PGLL+ L++ +
Sbjct: 849 KRITMALRSGLETEITWALNALNIMLY----DDYAPPTILNHTPGLLNVLVEHFRALLSL 904
Query: 126 ALPK--ELSKGPRART 139
PK E+ +ART
Sbjct: 905 LYPKVFEVGAESKART 920
>gi|146181973|ref|XP_001023719.2| ARID/BRIGHT DNA binding domain containing protein [Tetrahymena
thermophila]
gi|146143978|gb|EAS03474.2| ARID/BRIGHT DNA binding domain containing protein [Tetrahymena
thermophila SB210]
Length = 651
Score = 47.0 bits (110), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 90/418 (21%), Positives = 170/418 (40%), Gaps = 51/418 (12%)
Query: 63 QNHKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATP--LAKIPGLLDALLQVID 120
Q+ KRI LA +S ++ E+T+ALN L L S ++ L I L+ +++ +
Sbjct: 231 QDIKRISLAFESRIQDEITFALNQLLLYSVNSMHQFTLESYNHLLEGIVQYLEEIIKNVP 290
Query: 121 DWRDIALPKELSKGPRA-RTLGVNSLVTGFG-----SEFEALGSINNA-FPRSGVGSGSS 173
I KE+ PR + G S + G F+ L ++N FP + +S
Sbjct: 291 SLNKILQLKEIK--PRTLQEYGGTSDYSYSGPPPTKEAFKELCILDNYNFPICDIVLFNS 348
Query: 174 AADSLVQKNAARVRSSEWWFDEDGLFNLDDEGRAEKQQCAVGASNIIRNFSFMPDNEVIM 233
V +R + + + + L + E +Q + I+RNFSF NEV M
Sbjct: 349 QKKKEVSP-IVFLRKRKRY-NVENLKEMAGEVYLLEQTRTIFL--ILRNFSFTKSNEVYM 404
Query: 234 AQHRHCLETVFQCIEDHVTEDEELVTNALETIVNLAPLLDLRIFSSSKQSYIKITREKRA 293
A++ L+ + Q + D ++ L+ + N++ + L + I K
Sbjct: 405 AKNERLLQIMIQLF--ILNSDSQITKYILDILANISKQIQL----------MNIPDYKLF 452
Query: 294 VEAIMGILGSPFKAWHCAAAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSLPAFDAQA 353
E + +L + A +++ LI +NE + F+P+ ++ LM D +
Sbjct: 453 CERLFELLNADTSDELEACVDIIRNLIAIQENESIIENFLPKYIDQVTRLMLQAQGDIRE 512
Query: 354 AAVGALYNLAEVNVDCRLKLASERWAIDRLL-----RVIKTPHPVP-------------- 394
+ L L+++ + R++LA I R++ V+K +P
Sbjct: 513 GILEFLCFLSDLRMATRVQLARHPKLIMRMVGLLSSGVMKQNRQLPSHLQQQQHNQQGEG 572
Query: 395 -----EVCRKAAMILESLVSEPQNRVLLLAYENAFAEILFSDGRYSDTFARILYELTS 447
++ + AA+ L ++ P +R L +E + +D + + IL EL +
Sbjct: 573 ERNSEKITKLAALTLSNISLAPLSRAYLKPFERDLFVVASTDETVTKYISNILSELMN 630
>gi|432942120|ref|XP_004082969.1| PREDICTED: AT-rich interactive domain-containing protein 2-like
[Oryzias latipes]
Length = 1520
Score = 45.8 bits (107), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 89/331 (26%), Positives = 152/331 (45%), Gaps = 44/331 (13%)
Query: 64 NHKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQVIDDWR 123
++ ++VL+L SGL +E+ +A+N TLLS + K M+ D P K+ LL A V DD
Sbjct: 99 DYSKLVLSLLSGLPNEVDFAVNVCTLLSNESKHVMQLDKDP--KLVTLLLAHAGVFDD-- 154
Query: 124 DIALPKELSKGPRARTLGVNSLVTGFGSEFEALGSINNA-FPRSGVGSGSSAADSLVQKN 182
+LG S V FGS+++ S + F R V + L+
Sbjct: 155 ---------------SLGSFSGV--FGSDWKEKTSRDFIRFWREVVED--TEVRELIWDK 195
Query: 183 AARVRSSEWWFDEDGLFNLD-DEGRAEKQ-QCAVGASNIIRNFSFMPDNEVIMAQHRHCL 240
+ + LF+ ++G ++ + Q + + I+RN SF N ++A +R CL
Sbjct: 196 SGLTQDGTCGDRWQSLFHPPRNQGISDMEAQRVLQVAVILRNLSFEEANVKLLAANRTCL 255
Query: 241 ETVFQCIEDHVTEDEELVTNALETIVNLAPLLDLRI--FSSSKQSYIKITREKRAVEAIM 298
+ C ++ +L L+T+ N+A L L F ++ + IT+
Sbjct: 256 RFLLLCAHCNLISLRQL---GLDTLGNVAAELQLDPVDFRTTHLIFHTITK--------- 303
Query: 299 GILGSPFKAWHCAAAELLGRLIINPDNEPFLLPFVPQ-IHKRLVDLMSLPAFDAQAAAVG 357
L S + A E+LG L DN + +V Q ++ ++ L++LP A++
Sbjct: 304 -CLMSRDRFLKMRAMEILGNLSKADDNSVLICEYVDQESYREVMMLLTLPDLMLLMASLE 362
Query: 358 ALYNLAEV-NVDCRLKLASERWAIDRLLRVI 387
LY LA++ V C K+AS +ID L+R++
Sbjct: 363 VLYLLAQLGEVSCS-KIASVDHSIDLLVRLV 392
>gi|402594686|gb|EJW88612.1| hypothetical protein WUBG_00478 [Wuchereria bancrofti]
Length = 1272
Score = 45.8 bits (107), Expect = 0.042, Method: Compositional matrix adjust.
Identities = 29/76 (38%), Positives = 42/76 (55%), Gaps = 6/76 (7%)
Query: 66 KRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQVIDDWRDI 125
KRI +AL+SGL++E TWALN L ++ + D T L PGLL+ L++ +
Sbjct: 538 KRITMALRSGLETEATWALNALNVMLY----DDYAPPTILNHTPGLLNVLVEHFRALLSL 593
Query: 126 ALPK--ELSKGPRART 139
PK E+ +ART
Sbjct: 594 LYPKVFEVGAESKART 609
>gi|343426508|emb|CBQ70037.1| conserved hypothetical protein [Sporisorium reilianum SRZ2]
Length = 1060
Score = 45.8 bits (107), Expect = 0.051, Method: Compositional matrix adjust.
Identities = 23/53 (43%), Positives = 35/53 (66%), Gaps = 1/53 (1%)
Query: 67 RIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQVI 119
RI+L+L+SGL S++ WAL+ L L+ +D ++ T L +PGL DALL +
Sbjct: 147 RILLSLKSGLPSQIDWALSRLVQLTSNHRDQAAREFT-LDNVPGLADALLSYV 198
>gi|170584350|ref|XP_001896964.1| hypothetical protein [Brugia malayi]
gi|158595653|gb|EDP34192.1| conserved hypothetical protein [Brugia malayi]
Length = 1204
Score = 45.4 bits (106), Expect = 0.061, Method: Compositional matrix adjust.
Identities = 26/69 (37%), Positives = 37/69 (53%), Gaps = 4/69 (5%)
Query: 66 KRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQVIDDWRDI 125
KRI +AL+SGL++E TWALN L ++ + D T L PGLL+ L++ +
Sbjct: 474 KRITMALRSGLETEXTWALNALNVMLY----DDYAPPTMLNHTPGLLNVLVEHFRALLSL 529
Query: 126 ALPKELSKG 134
PK G
Sbjct: 530 LYPKVFEVG 538
>gi|405967809|gb|EKC32936.1| AT-rich interactive domain-containing protein 2 [Crassostrea gigas]
Length = 1873
Score = 45.4 bits (106), Expect = 0.062, Method: Compositional matrix adjust.
Identities = 87/338 (25%), Positives = 149/338 (44%), Gaps = 54/338 (15%)
Query: 57 HSSFADQNHKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALL 116
HS F HK + AL SGL +E+ +A+N TLLS + + + L K LL L+
Sbjct: 201 HSDF----HK-LEKALLSGLPNEVDFAINVCTLLSSESRKSLL-----LRKAQNLLQLLM 250
Query: 117 QVIDDWRDIALPKELSKGPRARTLGVNSLVTGFGSEFEALGSINNAFPRSGVGS--GSSA 174
I + D E R L ++ + + L ++NN R V + G S
Sbjct: 251 AHIGIFSDDHGTYEEVYEREWRKLSNSNFIHFW------LNTVNNETIRGLVHTSHGYSR 304
Query: 175 ADSLVQK--NAARVRSSEWWFDEDGLFNLDDEGRAEKQQCAVGASNIIRNFSFMPDNEVI 232
+ L Q+ N + EDGL D EG+ Q AV IIRN SF +N
Sbjct: 305 KELLGQEILNLGQ---------EDGLH--DREGQ-RVMQLAV----IIRNLSFEEENMKF 348
Query: 233 MAQHRHCLETVFQCIEDHVTEDEELVTNALETIVNLAPLLDLRIFSSSKQSYIKITREKR 292
M+++ + C+ +L AL+++ N+A + + ++ S++
Sbjct: 349 MSENDLVFRFLMLCVHSSYGSLRQL---ALDSLGNIA---EKFVLTTDDHSHL------- 395
Query: 293 AVEAIMGILGSPFKAWHCAAAELLGRLIINPDNEPFLLPFVPQ-IHKRLVDLMSLPAFDA 351
++ + L S K E+L +L + NE L + + ++ +V LM++ FD
Sbjct: 396 VLQLLKHCLSSDDKYEIVRGLEILSKLCLLDQNEDMLTDRLEEPLYADIVRLMNV--FDI 453
Query: 352 Q--AAAVGALYNLAEVNVDCRLKLASERWAIDRLLRVI 387
Q ++ ALY L+E+ K+A+ + A+D L+ ++
Sbjct: 454 QIIVYSLEALYQLSELGEHTTTKIAAVKHAVDLLVSLL 491
>gi|393213224|gb|EJC98721.1| hypothetical protein FOMMEDRAFT_129050 [Fomitiporia mediterranea
MF3/22]
Length = 668
Score = 45.1 bits (105), Expect = 0.070, Method: Compositional matrix adjust.
Identities = 30/70 (42%), Positives = 40/70 (57%), Gaps = 10/70 (14%)
Query: 67 RIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQVIDDW--RD 124
R++L+L+SGL SE+TWAL L LS D L+ IPGL+DAL + +W R+
Sbjct: 33 RMLLSLRSGLDSEVTWALERLCRLSC-------NDQFQLSSIPGLVDALFE-WPEWFLRE 84
Query: 125 IALPKELSKG 134
LP LS
Sbjct: 85 YGLPSTLSSS 94
>gi|71022359|ref|XP_761409.1| hypothetical protein UM05262.1 [Ustilago maydis 521]
gi|46101278|gb|EAK86511.1| hypothetical protein UM05262.1 [Ustilago maydis 521]
Length = 1059
Score = 45.1 bits (105), Expect = 0.073, Method: Compositional matrix adjust.
Identities = 23/53 (43%), Positives = 35/53 (66%), Gaps = 1/53 (1%)
Query: 67 RIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQVI 119
RI+L+L+SGL S++ WAL+ L L+ +D ++ T L +PGL DALL +
Sbjct: 150 RILLSLKSGLPSQIDWALSRLVSLTSGHRDQASREFT-LDSVPGLTDALLSYV 201
>gi|405952221|gb|EKC20059.1| Trithorax group protein osa [Crassostrea gigas]
Length = 2566
Score = 45.1 bits (105), Expect = 0.077, Method: Composition-based stats.
Identities = 24/51 (47%), Positives = 35/51 (68%), Gaps = 3/51 (5%)
Query: 67 RIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQ 117
R+++ L+SGL +E TWAL+TL +L + DD LA +PGLL+ALL+
Sbjct: 1920 RVMMGLKSGLLAESTWALDTLNILLY---DDATVQYFNLAHLPGLLEALLE 1967
>gi|198423504|ref|XP_002123117.1| PREDICTED: similar to AT rich interactive domain 1A (Swi1 like),
partial [Ciona intestinalis]
Length = 451
Score = 45.1 bits (105), Expect = 0.080, Method: Compositional matrix adjust.
Identities = 22/51 (43%), Positives = 36/51 (70%), Gaps = 3/51 (5%)
Query: 67 RIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQ 117
++++AL+SGL E TWALN LT+L + ++ + L ++PGLLDAL++
Sbjct: 218 KVIMALKSGLLVESTWALNVLTVLLYDQQTIAQFK---LVQLPGLLDALVE 265
>gi|348536584|ref|XP_003455776.1| PREDICTED: AT-rich interactive domain-containing protein 2
[Oreochromis niloticus]
Length = 1690
Score = 44.7 bits (104), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 88/338 (26%), Positives = 147/338 (43%), Gaps = 57/338 (16%)
Query: 64 NHKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQVIDDWR 123
++ ++VL+L SGL +E+ +A+N TLLS + K M+ D P K+ LL A V DD
Sbjct: 157 DYNKLVLSLLSGLPNEVDFAVNVCTLLSNESKHAMQLDKDP--KLVTLLLAHAGVFDD-- 212
Query: 124 DIALPKELSKGPRARTLGVNSLVTGFGSEFEALGS----------INNAFPRSGVGSGSS 173
+LG S V FG++++ S + +A R + S+
Sbjct: 213 ---------------SLGSFSGV--FGTDWKEKTSRDFVRFWKEVVEDAEVRELIWDKSN 255
Query: 174 AADSLVQKNAARVRSSEWWFDEDGLFNLDDEGRAEKQQCAVGASNIIRNFSFMPDNEVIM 233
A Q + + F + D Q AV I+RN SF N ++
Sbjct: 256 PA----QDGTSCAERWQSLFHPPRTAGISDMEAQRVLQIAV----ILRNLSFEEANVKLL 307
Query: 234 AQHRHCLETVFQCIEDHVTEDEELVTNALETIVNLAPLLDLRI--FSSSKQSYIKITREK 291
A +R CL + C ++ +L L+T+ N+A L L F ++ + IT+
Sbjct: 308 AANRTCLRFLLLCAHCNLISLRQL---GLDTLGNVAAELQLDPVDFRTTHLIFHTITK-- 362
Query: 292 RAVEAIMGILGSPFKAWHCAAAELLGRLIINPDNEPFLLPFVPQ-IHKRLVDLMSLPAFD 350
L S + A E+LG L DN + +V Q ++ ++ L++LP
Sbjct: 363 --------CLMSRDRFLKMRAMEILGNLSKVEDNGVLICEYVDQDSYREVMLLLTLPDLM 414
Query: 351 AQAAAVGALYNLAEV-NVDCRLKLASERWAIDRLLRVI 387
A++ LY LA++ + C K+A +ID L+R++
Sbjct: 415 LLMASLEVLYLLAQLGEIPCS-KIAFVDHSIDLLVRLV 451
>gi|357612802|gb|EHJ68177.1| hypothetical protein KGM_12579 [Danaus plexippus]
Length = 1236
Score = 44.3 bits (103), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 42/165 (25%), Positives = 68/165 (41%), Gaps = 32/165 (19%)
Query: 67 RIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQVIDDWRDIA 126
RI++AL+SGL +E WAL+ L +L F DD L +PGLLD LL+
Sbjct: 806 RIMMALKSGLLAETCWALDILNILLF---DDNCIGYFGLQHMPGLLDLLLE--------- 853
Query: 127 LPKELSKGPRARTLGVNSLVTGFGSEFEALGSINNAFPRSGVGSGSSAADSLVQKNAARV 186
G F+A + + + R + A ++++
Sbjct: 854 -----------------HFQKSLGDVFDAPATESEPW-RPALQVRDPAG--VLKRRRLED 893
Query: 187 RSSEWWFDEDGLFNLDDEGRAEKQQCAVGASNIIRNFSFMPDNEV 231
E + ++ NL +E R + + SNI+R +F+P NE
Sbjct: 894 YEDECYTRDEPSLNLVNESRDALARRCIALSNILRGLTFVPGNEA 938
>gi|328787106|ref|XP_395512.4| PREDICTED: hypothetical protein LOC412046 [Apis mellifera]
Length = 2066
Score = 44.3 bits (103), Expect = 0.13, Method: Composition-based stats.
Identities = 25/51 (49%), Positives = 33/51 (64%), Gaps = 3/51 (5%)
Query: 67 RIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQ 117
RI++AL+SGL +E WAL+ L +L F DD LA +PGLLD LL+
Sbjct: 1385 RIMMALRSGLLAESCWALDVLNILLF---DDSSVSYFGLAHLPGLLDVLLE 1432
>gi|350406792|ref|XP_003487883.1| PREDICTED: hypothetical protein LOC100748451 [Bombus impatiens]
Length = 2066
Score = 44.3 bits (103), Expect = 0.14, Method: Composition-based stats.
Identities = 25/51 (49%), Positives = 33/51 (64%), Gaps = 3/51 (5%)
Query: 67 RIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQ 117
RI++AL+SGL +E WAL+ L +L F DD LA +PGLLD LL+
Sbjct: 1385 RIMMALRSGLLAESCWALDVLNILLF---DDSSVSYFGLAHLPGLLDVLLE 1432
>gi|340721281|ref|XP_003399052.1| PREDICTED: hypothetical protein LOC100651892 [Bombus terrestris]
Length = 2066
Score = 44.3 bits (103), Expect = 0.14, Method: Composition-based stats.
Identities = 25/51 (49%), Positives = 33/51 (64%), Gaps = 3/51 (5%)
Query: 67 RIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQ 117
RI++AL+SGL +E WAL+ L +L F DD LA +PGLLD LL+
Sbjct: 1385 RIMMALRSGLLAESCWALDVLNILLF---DDSSVSYFGLAHLPGLLDVLLE 1432
>gi|383849904|ref|XP_003700574.1| PREDICTED: uncharacterized protein LOC100883763 [Megachile rotundata]
Length = 2067
Score = 44.3 bits (103), Expect = 0.14, Method: Composition-based stats.
Identities = 25/51 (49%), Positives = 33/51 (64%), Gaps = 3/51 (5%)
Query: 67 RIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQ 117
RI++AL+SGL +E WAL+ L +L F DD LA +PGLLD LL+
Sbjct: 1384 RIMMALRSGLLAESCWALDVLNILLF---DDSSVSYFGLAHLPGLLDVLLE 1431
>gi|345483492|ref|XP_001600023.2| PREDICTED: trithorax group protein osa [Nasonia vitripennis]
Length = 1177
Score = 43.9 bits (102), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 38/68 (55%), Gaps = 7/68 (10%)
Query: 67 RIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQ----VIDDW 122
RI++AL+SGL +E WAL+ L +L F DD L +PGLLD LL+ + D
Sbjct: 474 RIMMALRSGLLAESCWALDVLNILLF---DDSSVHYFGLTHLPGLLDVLLEHFSRALSDM 530
Query: 123 RDIALPKE 130
D +L E
Sbjct: 531 FDFSLIDE 538
>gi|430811478|emb|CCJ31119.1| unnamed protein product [Pneumocystis jirovecii]
Length = 284
Score = 43.9 bits (102), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 23/56 (41%), Positives = 38/56 (67%), Gaps = 5/56 (8%)
Query: 67 RIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQVIDDW 122
RI LAL+SGL SEL WAL+ L LS+++ +++R D +IP L + L++ + ++
Sbjct: 13 RIALALKSGLISELDWALHHLVRLSYEQGNNLRFD-----RIPDLAETLIKKLYEF 63
>gi|339252276|ref|XP_003371361.1| cuticle collagen rol-6 [Trichinella spiralis]
gi|316968416|gb|EFV52694.1| cuticle collagen rol-6 [Trichinella spiralis]
Length = 1465
Score = 43.1 bits (100), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 24/61 (39%), Positives = 34/61 (55%), Gaps = 6/61 (9%)
Query: 67 RIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQVIDDWRDIA 126
R+V+ +SGL E+TW LN L +L + DD L +PG LDAL++V W+
Sbjct: 948 RLVMVFKSGLLFEVTWGLNVLNVLLY---DDSSAPYFNLNSLPGFLDALVEV---WKQSL 1001
Query: 127 L 127
L
Sbjct: 1002 L 1002
>gi|358059718|dbj|GAA94487.1| hypothetical protein E5Q_01139 [Mixia osmundae IAM 14324]
Length = 754
Score = 43.1 bits (100), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 30/96 (31%), Positives = 46/96 (47%), Gaps = 6/96 (6%)
Query: 27 FGSTSGSSGGSGSAADSAAPTTLLGPSLQVHSSFADQNH-KRIVLALQSGLKSELTWALN 85
+GS G S + A T +L + DQ R+ L+L+ GL E+ +AL
Sbjct: 72 YGSGQGRQATSAAQQAPATTTQILQQRYAGQPIYNDQGSGNRLTLSLRCGLTEEVDFALG 131
Query: 86 TLTLLSFKEKDDMRKDATPLAKIPGLLDALLQVIDD 121
L +S + D + PL +PG+L+ALL +I D
Sbjct: 132 RLAEISQADPDLL-----PLKDLPGMLEALLSLIQD 162
>gi|260790547|ref|XP_002590303.1| hypothetical protein BRAFLDRAFT_216118 [Branchiostoma floridae]
gi|229275495|gb|EEN46314.1| hypothetical protein BRAFLDRAFT_216118 [Branchiostoma floridae]
Length = 892
Score = 43.1 bits (100), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 22/51 (43%), Positives = 34/51 (66%), Gaps = 3/51 (5%)
Query: 67 RIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQ 117
RI++AL+SGL +E TWAL+TL +L F DD L +PG+++ L++
Sbjct: 294 RIMMALKSGLLAEATWALDTLNILLF---DDNTVTYFNLQHLPGMIETLME 341
>gi|307169896|gb|EFN62405.1| Trithorax group protein osa [Camponotus floridanus]
Length = 2116
Score = 43.1 bits (100), Expect = 0.33, Method: Composition-based stats.
Identities = 24/51 (47%), Positives = 33/51 (64%), Gaps = 3/51 (5%)
Query: 67 RIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQ 117
RI+++L+SGL +E WAL+ L +L F DD LA +PGLLD LL+
Sbjct: 1418 RIMMSLRSGLLAESCWALDVLNILLF---DDSSVSYFGLAHLPGLLDVLLE 1465
>gi|355786025|gb|EHH66208.1| hypothetical protein EGM_03149 [Macaca fascicularis]
Length = 1687
Score = 42.7 bits (99), Expect = 0.35, Method: Composition-based stats.
Identities = 78/316 (24%), Positives = 130/316 (41%), Gaps = 48/316 (15%)
Query: 78 SELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQVIDDWRDIALPKELSKGPRA 137
+E+ +A+N TLLS + K M+ + P KI LL A V DD
Sbjct: 22 NEVDFAINVCTLLSNESKHVMQLEKDP--KIITLLLANAGVFDD---------------- 63
Query: 138 RTLGVNSLVTGFGSEFEALGSINNAFPRSGVGSGSSAADSLVQKNAARVRSSEWWFDEDG 197
TLG S T FG E++ + + + D + +N + +S W E
Sbjct: 64 -TLG--SFSTVFGEEWKEKTDRDFVKFWKDIVDDNEVRDLISDRNKSHEGTSGEWIWES- 119
Query: 198 LFN------LDDEGRAEKQQCAVGASNIIRNFSFMPDNEVIMAQHRHCLETVFQCIEDHV 251
LF+ ++D Q AV I+RN SF N ++A +R CL + H
Sbjct: 120 LFHPPRKLGINDIEGQRVLQIAV----ILRNLSFEEGNVKLLAANRTCLRFLLLSAHSHF 175
Query: 252 TEDEELVTNALETIVNLAP--LLDLRIFSSSKQSYIKITREKRAVEAIMGILGSPFKAWH 309
+L L+T+ N+A LLD F ++ + +T+ + + + + G
Sbjct: 176 ISLRQL---GLDTLGNIAAELLLDPVDFKTTHLMFHTVTKCLMSRDRFLKMRG------- 225
Query: 310 CAAAELLGRLIINPDNEPFLLPFVPQ-IHKRLVDLMSLPAFDAQAAAVGALYNLAEVNVD 368
E+LG L DN + +V Q ++ ++ ++LP + + LY L E+
Sbjct: 226 ---MEILGNLCKAEDNGVLICEYVDQDSYREIICHLTLPDVLLVISTLEVLYMLTEMGDV 282
Query: 369 CRLKLASERWAIDRLL 384
K+A +ID L+
Sbjct: 283 ACTKIAKVEKSIDMLV 298
>gi|443898106|dbj|GAC75444.1| hypothetical protein PANT_15c00080 [Pseudozyma antarctica T-34]
Length = 1043
Score = 42.7 bits (99), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 33/108 (30%), Positives = 53/108 (49%), Gaps = 8/108 (7%)
Query: 18 TPAAKRGRPFGSTSGSSGGSGSAADSAAPTT------LLGPSLQVHSSFADQNHKRIVLA 71
TP+A+ P G GG+ SA P+ L+G + S RI+L+
Sbjct: 81 TPSARPIMPGGRLPQHLGGARMTHPSADPSASFKRPKLVGDGYEA-SYLVPGPSNRILLS 139
Query: 72 LQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQVI 119
L+SGL ++ WAL+ L ++ +D ++ T L +PGL DAL+ +
Sbjct: 140 LKSGLPVQIDWALSRLVHITANNRDPAAREFT-LDSVPGLADALISYV 186
>gi|328724862|ref|XP_001946583.2| PREDICTED: hypothetical protein LOC100165557 [Acyrthosiphon pisum]
Length = 2046
Score = 42.7 bits (99), Expect = 0.40, Method: Composition-based stats.
Identities = 23/51 (45%), Positives = 34/51 (66%), Gaps = 3/51 (5%)
Query: 67 RIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQ 117
R++++L+SGL +E TWAL+ L +L F DD+ L +PGLLD LL+
Sbjct: 1481 RLMMSLRSGLLAESTWALDVLNILLF---DDVSIPFFGLTHMPGLLDVLLE 1528
>gi|47213937|emb|CAF94468.1| unnamed protein product [Tetraodon nigroviridis]
Length = 321
Score = 42.4 bits (98), Expect = 0.46, Method: Compositional matrix adjust.
Identities = 62/217 (28%), Positives = 100/217 (46%), Gaps = 34/217 (15%)
Query: 64 NHKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQVIDDWR 123
++ ++VL+L SGL +E+ +A+N TLLS + K M+ D P K+ LL A V DD
Sbjct: 86 DYNKLVLSLLSGLPNEVDFAVNVCTLLSNESKHSMQLDKDP--KLVTLLLAHAGVFDD-- 141
Query: 124 DIALPKELSKGPRARTLGVNSLVTGFGSEFEALGSINNAFPRSGVGSGSSAADSLVQK-N 182
+LG S FG++++ S + A V + + + K N
Sbjct: 142 ---------------SLG--SFSRLFGTDWKERTSRDFARFWKEVVEDNEVRELIWDKSN 184
Query: 183 AARVRSS--EWWFDEDGLFN---LDDEGRAEKQQCAVGASNIIRNFSFMPDNEVIMAQHR 237
AA+ +S E W LF+ G E Q+ + + I+RN SF N ++A +R
Sbjct: 185 AAQDGTSCEERWH---SLFHPPRTHGIGDMEAQR-VLQIAIILRNLSFEEANVKLLAANR 240
Query: 238 HCLETVFQCIEDHVTEDEELVTNALETIVNLAPLLDL 274
CL + C ++ +L L+T+ N+A L L
Sbjct: 241 TCLRFLLLCSHCNLISLRQL---GLDTLGNVAAELQL 274
>gi|307210169|gb|EFN86842.1| Trithorax group protein osa [Harpegnathos saltator]
Length = 2098
Score = 42.0 bits (97), Expect = 0.59, Method: Composition-based stats.
Identities = 23/51 (45%), Positives = 33/51 (64%), Gaps = 3/51 (5%)
Query: 67 RIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQ 117
R++++L+SGL +E WAL+ L +L F DD LA +PGLLD LL+
Sbjct: 1406 RLMMSLRSGLLAESCWALDVLNILLF---DDSSVSYFGLAHLPGLLDVLLE 1453
>gi|198436052|ref|XP_002127335.1| PREDICTED: similar to AT rich interactive domain 2 (ARID, RFX-like)
[Ciona intestinalis]
Length = 1197
Score = 42.0 bits (97), Expect = 0.61, Method: Compositional matrix adjust.
Identities = 74/345 (21%), Positives = 149/345 (43%), Gaps = 51/345 (14%)
Query: 64 NHKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALL---QVID 120
N++R+V +L S L +E+ +A+N TLLS + + +R L + P +L+ALL ++
Sbjct: 148 NYERLVSSLISCLPNEIDFAINACTLLSGESRHVLR-----LPRCPQVLNALLLHAGIVS 202
Query: 121 DWRD--IALPKELSKGPRARTLGVNSLVTGFGSEFEALGSINNAFP----------RSGV 168
+ + KEL K + SL+ + + + ++ P +S V
Sbjct: 203 NCEGSYACIAKELEKNGQ-------SLIQFWVDTVDNIEVLDEMLPTYRERLKLNEKSAV 255
Query: 169 GSGSSAADSLVQKNAARVRSSEW--WFDEDGLFNLDD-EGRAEKQQCAVGASNIIRNFSF 225
S + + K +++W F L +++ EG Q + S ++ N SF
Sbjct: 256 KDDSDNKEYSLTKKQIIKPTTKWPLLFQSKRLLGIEEREG-----QRVLQISLVLLNLSF 310
Query: 226 MPDNEVIMAQHRHCLETVFQCIEDHVTEDEELVTNALETIVNLAPLLDLRI--FSSSKQS 283
N V++A+ R + VF CI +E L L+ + N++ ++L +S ++ +
Sbjct: 311 EEVNLVLLAKSRELVRFVFLCIH---SEYSALKQTGLDILGNVSSRMNLNDLGYSIAQCT 367
Query: 284 YIKITREKRAVEAIMGILGSPFKAWHCAAAELLGRLI-INPDNEPFLLPFVPQIHKRLVD 342
Y +T+ + + I G E+LG L + E ++ + +
Sbjct: 368 YKTLTQCTSSADKFDRIRG----------FEILGNLCSCEANKENLIVAVTDAVISAAIL 417
Query: 343 LMSLPAFDAQAAAVGALYNLAEVNVDCRLKLASERWAIDRLLRVI 387
+ L + AA+ ALY L+++ + ++ +ID ++ +I
Sbjct: 418 CLPLSDIELIVAALEALYKLSKLGEETCTRIMLIPHSIDLIVNLI 462
>gi|7495155|pir||T29265 hypothetical protein C01G8.7 - Caenorhabditis elegans
Length = 1357
Score = 42.0 bits (97), Expect = 0.65, Method: Compositional matrix adjust.
Identities = 21/66 (31%), Positives = 39/66 (59%), Gaps = 4/66 (6%)
Query: 66 KRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQVIDDWRDI 125
+R++++L+SGL +E WA+N L +L + DD T L ++PGL++ +++ + I
Sbjct: 587 RRLIMSLRSGLDAEAIWAINALNVLLY---DDTNPHPT-LQQMPGLVNVIVEHLYATLSI 642
Query: 126 ALPKEL 131
P E
Sbjct: 643 MYPAEF 648
>gi|156352532|ref|XP_001622802.1| hypothetical protein NEMVEDRAFT_v1g139771 [Nematostella vectensis]
gi|156209421|gb|EDO30702.1| predicted protein [Nematostella vectensis]
Length = 595
Score = 42.0 bits (97), Expect = 0.68, Method: Compositional matrix adjust.
Identities = 22/50 (44%), Positives = 34/50 (68%), Gaps = 3/50 (6%)
Query: 67 RIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALL 116
R++++L+SGL +E TWAL+TL +L + D+ L+ +PGLLD LL
Sbjct: 37 RVMMSLKSGLLAETTWALDTLNILLY---DENTVGYFVLSHLPGLLDNLL 83
>gi|242005724|ref|XP_002423712.1| trithorax group protein osa, putative [Pediculus humanus corporis]
gi|212506897|gb|EEB10974.1| trithorax group protein osa, putative [Pediculus humanus corporis]
Length = 1664
Score = 42.0 bits (97), Expect = 0.71, Method: Composition-based stats.
Identities = 26/57 (45%), Positives = 33/57 (57%), Gaps = 6/57 (10%)
Query: 67 RIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQVIDDWR 123
RI++AL+SG E TWAL+ L +L F DD L +PGLLD LL+ WR
Sbjct: 1055 RIMMALRSGQLCETTWALDVLNILLF---DDTTIAYFGLGNLPGLLDILLE---HWR 1105
>gi|392885572|ref|NP_491562.3| Protein LET-526, isoform a [Caenorhabditis elegans]
gi|351020483|emb|CCD62467.1| Protein LET-526, isoform a [Caenorhabditis elegans]
Length = 1687
Score = 42.0 bits (97), Expect = 0.74, Method: Compositional matrix adjust.
Identities = 21/66 (31%), Positives = 39/66 (59%), Gaps = 4/66 (6%)
Query: 66 KRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQVIDDWRDI 125
+R++++L+SGL +E WA+N L +L + DD T L ++PGL++ +++ + I
Sbjct: 917 RRLIMSLRSGLDAEAIWAINALNVLLY---DDTNPHPT-LQQMPGLVNVIVEHLYATLSI 972
Query: 126 ALPKEL 131
P E
Sbjct: 973 MYPAEF 978
>gi|453224740|ref|NP_001263448.1| Protein LET-526, isoform c [Caenorhabditis elegans]
gi|403411303|emb|CCM09387.1| Protein LET-526, isoform c [Caenorhabditis elegans]
Length = 1768
Score = 41.6 bits (96), Expect = 0.78, Method: Compositional matrix adjust.
Identities = 21/66 (31%), Positives = 39/66 (59%), Gaps = 4/66 (6%)
Query: 66 KRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQVIDDWRDI 125
+R++++L+SGL +E WA+N L +L + DD T L ++PGL++ +++ + I
Sbjct: 998 RRLIMSLRSGLDAEAIWAINALNVLLY---DDTNPHPT-LQQMPGLVNVIVEHLYATLSI 1053
Query: 126 ALPKEL 131
P E
Sbjct: 1054 MYPAEF 1059
>gi|62898111|dbj|BAD96995.1| AT rich interactive domain 1A (SWI- like) isoform a variant [Homo
sapiens]
Length = 1181
Score = 41.6 bits (96), Expect = 0.82, Method: Compositional matrix adjust.
Identities = 19/53 (35%), Positives = 37/53 (69%), Gaps = 3/53 (5%)
Query: 67 RIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQVI 119
R++++L+SGL +E TWAL+T+ +L + + M + L+++PGLL+ L++
Sbjct: 567 RVMMSLKSGLLAESTWALDTINILLYDDNSIMTFN---LSQLPGLLELLVEYF 616
>gi|355669371|gb|AER94505.1| AT rich interactive domain 1A [Mustela putorius furo]
Length = 1144
Score = 41.6 bits (96), Expect = 0.86, Method: Compositional matrix adjust.
Identities = 19/53 (35%), Positives = 37/53 (69%), Gaps = 3/53 (5%)
Query: 67 RIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQVI 119
R++++L+SGL +E TWAL+T+ +L + + M + L+++PGLL+ L++
Sbjct: 535 RVMMSLKSGLLAESTWALDTINILLYDDNSIMTFN---LSQLPGLLELLVEYF 584
>gi|341882042|gb|EGT37977.1| hypothetical protein CAEBREN_07938 [Caenorhabditis brenneri]
Length = 1779
Score = 41.2 bits (95), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 20/66 (30%), Positives = 40/66 (60%), Gaps = 4/66 (6%)
Query: 66 KRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQVIDDWRDI 125
+R++++L+SGL++E WA+N L +L + DD + L ++PGL++ +++ + I
Sbjct: 1013 RRLIMSLRSGLEAEAIWAINALNVLLY---DDTNPQPS-LQQMPGLVNVIVEHLYATLSI 1068
Query: 126 ALPKEL 131
P E
Sbjct: 1069 LFPSEF 1074
>gi|449017728|dbj|BAM81130.1| similar to DNA binding protein, dead ringer [Cyanidioschyzon
merolae strain 10D]
Length = 858
Score = 41.2 bits (95), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 41/170 (24%), Positives = 66/170 (38%), Gaps = 32/170 (18%)
Query: 310 CAAAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMS------LPAFDAQAAAVGALYNLA 363
C AAE++ + +NE ++ P + RLVD+++ L + AA+ AL +L+
Sbjct: 688 CKAAEIVAQFATRLENETIMITRFPVLLPRLVDMVAPGTGTHLASVQLSMAALDALASLS 747
Query: 364 EVNVDCRLKLASERWAID---------------RLLRVIKTPHPVPEVCR---------- 398
R ++A + LL P P R
Sbjct: 748 AFEWQARERIARTPRLVPTLVRVVAAAVAQQEPELLHRAGVPSPSETSPRVTATIGASLS 807
Query: 399 -KAAMILESLVSEPQNRVLLLAYENAFAEILFSDGRYSDTFARILYELTS 447
+AA++L +L P NR LLL YE +D A +L+EL +
Sbjct: 808 ARAALVLLNLAENPHNRRLLLPYELVLVYGAMTDKVAGTALASVLHELVA 857
Score = 40.4 bits (93), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 73/280 (26%), Positives = 112/280 (40%), Gaps = 61/280 (21%)
Query: 16 AATPAAKRGRPFGSTSGSSGGSGSAA-DSAAPTTLLGPSLQVHSSFADQNHKRIVLALQS 74
AAT + KR R +G S GS AP P L + + +++VL+LQS
Sbjct: 338 AATESPKRRRSALRGNGRSVGSQRGHWKERAP-----PWLPYFAENLNSEREKLVLSLQS 392
Query: 75 GLKSELTWALNTL----------TLLSFKEKDDMRKDATPLAKIPGLLDAL--------- 115
G++ E+ WAL TL T +S + + A PGLLDAL
Sbjct: 393 GVREEVRWALTTLNALCSGAPAGTSVSIGGRSTSGRLELRCALYPGLLDALNDLLDAYLE 452
Query: 116 -LQVIDDWRDIALPKEL--SKGPRARTLGVNSLVT---GFGSEFEALGSINNAFPRSGVG 169
L+ W +++P+EL +G L ++L T G G+ AL +
Sbjct: 453 DLERRRQW-SLSMPQELRECRGAHLADLSASALETLEPGAGNNERALDT----------- 500
Query: 170 SGSSAADSLVQKNAARVRSSEWWFDEDGLFNLDDEGRAEKQQCAVGASNIIRNFSFMPDN 229
+S +Q A GL N D E+ Q ++ +N +RN SF N
Sbjct: 501 -----PESSLQGYRA------------GLMNCRDTIAKERAQYSLLVTNALRNMSFTTGN 543
Query: 230 EVIMAQHRHCLETVFQCIEDHVTEDEELVTNALETIVNLA 269
E+ +A H T + + H V + L+ +N+A
Sbjct: 544 EIALATHPALRSTTLRLLR-HPDTQPACVDDLLDMWLNIA 582
>gi|443694225|gb|ELT95418.1| hypothetical protein CAPTEDRAFT_226262 [Capitella teleta]
Length = 2320
Score = 41.2 bits (95), Expect = 1.2, Method: Composition-based stats.
Identities = 24/50 (48%), Positives = 33/50 (66%), Gaps = 3/50 (6%)
Query: 67 RIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALL 116
R+++AL+SGL +E TWAL+ LT+L + DD L +PGLLD LL
Sbjct: 1510 RLMMALRSGLLAESTWALDVLTVLLY---DDSTVMWFGLQNLPGLLDVLL 1556
>gi|324500620|gb|ADY40285.1| ARID domain-containing protein C08B11.3 [Ascaris suum]
Length = 860
Score = 40.8 bits (94), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 44/168 (26%), Positives = 76/168 (45%), Gaps = 11/168 (6%)
Query: 218 NIIRNFSFMPDNEVIMAQHRHCLETVFQCIEDHVTEDEELVTNALETIVNLAPLLDLRIF 277
+I+RN SF N+ IMA L+ +F C + L T AL+ + N+A +DL
Sbjct: 31 SIMRNLSFEVINKPIMAASWPLLKFLFVCSN---CKWSALRTAALDALSNIASEIDLMCE 87
Query: 278 SSSKQSYIKITREKRAVEAIMGILGSPFKAWHCAAAELLGRLIINPDNEPFLLPFV-PQI 336
SS +++ + R L S K + E+L L N NE L F+ +I
Sbjct: 88 ESSVTNHLLLKTVSRC-------LNSSDKLQVIRSLEILAGLCNNEHNESLLCEFLDTRI 140
Query: 337 HKRLVDLMSLPAFDAQAAAVGALYNLAEVNVDCRLKLASERWAIDRLL 384
++ D++++ + ALY ++E+ +++ AID L+
Sbjct: 141 LSKIFDVITVKDIMMCVYTLEALYQISELGASACQQVSLYPRAIDTLV 188
>gi|308499769|ref|XP_003112070.1| CRE-LET-526 protein [Caenorhabditis remanei]
gi|308268551|gb|EFP12504.1| CRE-LET-526 protein [Caenorhabditis remanei]
Length = 1740
Score = 40.8 bits (94), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 20/66 (30%), Positives = 40/66 (60%), Gaps = 4/66 (6%)
Query: 66 KRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQVIDDWRDI 125
+R++++L+SGL++E WA+N L +L + DD + L ++PGL++ +++ + I
Sbjct: 973 RRLIMSLRSGLEAEAIWAINALNVLLY---DDTNPQPS-LQQMPGLVNVIVEHLYATLSI 1028
Query: 126 ALPKEL 131
P E
Sbjct: 1029 LYPSEF 1034
>gi|389584972|dbj|GAB67703.1| hypothetical protein PCYB_122700 [Plasmodium cynomolgi strain B]
Length = 1283
Score = 40.8 bits (94), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 60/255 (23%), Positives = 103/255 (40%), Gaps = 45/255 (17%)
Query: 171 GSSAADSLVQKNAARVRS-SEWWFDEDGLFNLDDEGRAEKQQCAV-----GASNIIRNFS 224
G + L +KN + S W F++D F L C + G I++
Sbjct: 882 GECNKNVLSEKNVYKPFSFHRWEFNKDSKFGL----------CGIDLNIIGTILILKTIH 931
Query: 225 FMPDNEVIMAQHRHCLETVFQCI----EDHVTED-EELVTNALETIVNLAPLLDLRIFSS 279
F+ D E ++ R CLE F + ++H+ E++ AL I NL+ L+ L +
Sbjct: 932 FIIDEETLLELFRECLENSFNSVTSIYKEHMKNHPEDMFNGALYLIKNLSLLIYLFYKVT 991
Query: 280 SKQSYIKITREKRAVEAIMGILGSPFKAW----HCAAAELLGRLIINPDNEPFLLPFVPQ 335
+++ E ++ G+ K W H + + G + +NE +L F
Sbjct: 992 KDHEFVRFYLHSGLTEQLL--FGASPKEWIMESHSGSGKKEGDK-VGAENENSILCFF-- 1046
Query: 336 IHKRLVDLMSLPAFDAQ----AAAVGALYN--LAEVNVDCR-----LKLASERWAIDRLL 384
KR+ ++ S D Q AA A+YN +A V+ C L + + A+++
Sbjct: 1047 --KRVYNMSSQD--DVQNRILAAFNDAIYNFTVATVSRVCSPLIKILSMEYKESALEKEE 1102
Query: 385 RVIKTPHPVPEVCRK 399
+ KT H RK
Sbjct: 1103 EIKKTTHDFLNAKRK 1117
>gi|324499695|gb|ADY39876.1| Trithorax group protein osa [Ascaris suum]
Length = 1799
Score = 40.8 bits (94), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 28/80 (35%), Positives = 42/80 (52%), Gaps = 8/80 (10%)
Query: 66 KRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATP--LAKIPGLLDALLQVIDDWR 123
+R+ +AL+SGL++E TWALN L L + DDM A P L P LL+ +++
Sbjct: 1037 RRLTMALRSGLETETTWALNALNALLY---DDM---AVPLNLNHSPALLNVIVEHFRAQL 1090
Query: 124 DIALPKELSKGPRARTLGVN 143
I PK G ++ V+
Sbjct: 1091 AILFPKIFKVGRESKVRTVD 1110
>gi|22760714|dbj|BAC11306.1| unnamed protein product [Homo sapiens]
Length = 754
Score = 40.8 bits (94), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 19/51 (37%), Positives = 37/51 (72%), Gaps = 3/51 (5%)
Query: 67 RIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQ 117
R++++L+SGL +E TWAL+T+ +L + + M + L+++PGLL+ L++
Sbjct: 140 RVMMSLKSGLLAESTWALDTINILLYDDNSIMTFN---LSQLPGLLELLVE 187
>gi|320163317|gb|EFW40216.1| hypothetical protein CAOG_00741 [Capsaspora owczarzaki ATCC 30864]
Length = 994
Score = 40.0 bits (92), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 21/54 (38%), Positives = 35/54 (64%), Gaps = 2/54 (3%)
Query: 67 RIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQVID 120
R++ ALQS L +E+ WA N L ++S + D+ D P A +PGL++ +LQ ++
Sbjct: 388 RVLQALQSTLPNEIDWAFNVLVVMS-HDVCDVTLDICP-ATVPGLIELVLQQVE 439
>gi|321459926|gb|EFX70974.1| hypothetical protein DAPPUDRAFT_327651 [Daphnia pulex]
Length = 1815
Score = 40.0 bits (92), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 22/51 (43%), Positives = 32/51 (62%), Gaps = 3/51 (5%)
Query: 67 RIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQ 117
R+ LAL+SGL +E TWA++ L +L + DD LA +PGL + LL+
Sbjct: 1098 RVYLALKSGLLAESTWAIDVLNVLLY---DDSSVHYFSLAYMPGLTEVLLE 1145
>gi|354492435|ref|XP_003508354.1| PREDICTED: AT-rich interactive domain-containing protein 1A
[Cricetulus griseus]
Length = 2087
Score = 40.0 bits (92), Expect = 2.4, Method: Composition-based stats.
Identities = 20/51 (39%), Positives = 36/51 (70%), Gaps = 3/51 (5%)
Query: 67 RIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQ 117
R++++L+SGL +E TWAL+T+ +L + DD L+++PGLL+ L++
Sbjct: 1477 RVMMSLKSGLLAESTWALDTINILLY---DDNSIMTFSLSQLPGLLELLVE 1524
>gi|344245847|gb|EGW01951.1| AT-rich interactive domain-containing protein 1A [Cricetulus griseus]
Length = 1892
Score = 40.0 bits (92), Expect = 2.4, Method: Composition-based stats.
Identities = 20/51 (39%), Positives = 36/51 (70%), Gaps = 3/51 (5%)
Query: 67 RIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQ 117
R++++L+SGL +E TWAL+T+ +L + DD L+++PGLL+ L++
Sbjct: 1282 RVMMSLKSGLLAESTWALDTINILLY---DDNSIMTFSLSQLPGLLELLVE 1329
>gi|431891218|gb|ELK02095.1| AT-rich interactive domain-containing protein 1A [Pteropus alecto]
Length = 2008
Score = 40.0 bits (92), Expect = 2.4, Method: Composition-based stats.
Identities = 20/51 (39%), Positives = 36/51 (70%), Gaps = 3/51 (5%)
Query: 67 RIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQ 117
R++++L+SGL +E TWAL+T+ +L + DD L+++PGLL+ L++
Sbjct: 1395 RVMMSLKSGLLAESTWALDTINILLY---DDNSIMTFSLSQLPGLLELLVE 1442
>gi|426222752|ref|XP_004005548.1| PREDICTED: AT-rich interactive domain-containing protein 1A [Ovis
aries]
Length = 1934
Score = 40.0 bits (92), Expect = 2.4, Method: Composition-based stats.
Identities = 20/51 (39%), Positives = 36/51 (70%), Gaps = 3/51 (5%)
Query: 67 RIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQ 117
R++++L+SGL +E TWAL+T+ +L + DD L+++PGLL+ L++
Sbjct: 1321 RVMMSLKSGLLAESTWALDTINILLY---DDNSIMTFSLSQLPGLLELLVE 1368
>gi|351697861|gb|EHB00780.1| AT-rich interactive domain-containing protein 1A [Heterocephalus
glaber]
Length = 1990
Score = 40.0 bits (92), Expect = 2.4, Method: Composition-based stats.
Identities = 19/51 (37%), Positives = 37/51 (72%), Gaps = 3/51 (5%)
Query: 67 RIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQ 117
R++++L+SGL +E TWAL+T+ +L + + M + L+++PGLL+ L++
Sbjct: 1377 RVMMSLKSGLLAESTWALDTINILLYDDNSIMTFN---LSQLPGLLELLVE 1424
>gi|328866273|gb|EGG14658.1| hypothetical protein DFA_10916 [Dictyostelium fasciculatum]
Length = 579
Score = 40.0 bits (92), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 27/110 (24%), Positives = 53/110 (48%), Gaps = 4/110 (3%)
Query: 339 RLVDLMSLPAFDAQAAAVGALYNLAEVNVDCRLKLASERWAIDRLLRVIKTP---HPVPE 395
RL++L+ P D Q +++ +YNL + + R+ + + + ++ H + E
Sbjct: 467 RLIELLGHPNNDIQLSSLNVIYNLIKTSQKSRIMICHISGLVRHVCNLLSYKPGDHQM-E 525
Query: 396 VCRKAAMILESLVSEPQNRVLLLAYENAFAEILFSDGRYSDTFARILYEL 445
+ +KAA +L E N +LL +E +I SD +++ IL +L
Sbjct: 526 MGKKAATLLSYFSREQLNIPVLLQFETLLTQIALSDNPHTEIVTNILIKL 575
>gi|449550952|gb|EMD41916.1| hypothetical protein CERSUDRAFT_147324 [Ceriporiopsis subvermispora
B]
Length = 636
Score = 40.0 bits (92), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 24/58 (41%), Positives = 35/58 (60%), Gaps = 8/58 (13%)
Query: 65 HKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQVIDDW 122
+ R++LAL+SG+ +E+TWAL+ L L E+ +R IPGL DAL + DW
Sbjct: 38 NNRMLLALRSGIDTEVTWALDRLCRLCNNEQFLLR-------AIPGLTDALFE-WPDW 87
>gi|390335034|ref|XP_785628.3| PREDICTED: armadillo repeat-containing protein 2-like
[Strongylocentrotus purpuratus]
Length = 886
Score = 40.0 bits (92), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 45/171 (26%), Positives = 82/171 (47%), Gaps = 13/171 (7%)
Query: 219 IIRNFSFMPDNEVIMAQHRHCLETVFQCIEDH-VTEDEELVTNALETIVNLAPLLDLRIF 277
++ N S PD ++A + C++ + Q +E ++++EELV NAL TI NL+ D+ I
Sbjct: 619 VVANLSINPDIGPLIAANETCVDLLMQVLESKDISQNEELVLNALITINNLS-FYDI-IN 676
Query: 278 SSSKQSYIKITREKRAVEAIMGILGSPFKAWHCAAAELLGRLIINPDNEPFLLPFVPQIH 337
S+ + I+I R ++ L + A+ + G L + D FL ++
Sbjct: 677 SAIVERQIQIAR------LLLKQLVTDHHEGMIEASRVFGNLSRSIDIRNFLT--AKKVD 728
Query: 338 KRLVDLMSLPAFDAQAAAVGALYNLAEVNVDCRLKLASERWAIDRLLRVIK 388
+ +V L+ + + G L NL +VD R L E + +L+ V++
Sbjct: 729 EMMVTLLDSGNREFVYTSCGVLINLM-ADVDRRPMLKRE-GGVSKLIDVLR 777
>gi|456753261|gb|JAA74134.1| AT rich interactive domain 1A (SWI-like) tv1 [Sus scrofa]
Length = 1953
Score = 40.0 bits (92), Expect = 2.5, Method: Composition-based stats.
Identities = 19/51 (37%), Positives = 37/51 (72%), Gaps = 3/51 (5%)
Query: 67 RIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQ 117
R++++L+SGL +E TWAL+T+ +L + + M + L+++PGLL+ L++
Sbjct: 1340 RVMMSLKSGLLAESTWALDTINILLYDDNSIMTFN---LSQLPGLLELLVE 1387
>gi|441671986|ref|XP_004093174.1| PREDICTED: LOW QUALITY PROTEIN: AT-rich interactive domain-containing
protein 1A [Nomascus leucogenys]
Length = 1843
Score = 40.0 bits (92), Expect = 2.5, Method: Composition-based stats.
Identities = 19/51 (37%), Positives = 37/51 (72%), Gaps = 3/51 (5%)
Query: 67 RIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQ 117
R++++L+SGL +E TWAL+T+ +L + + M + L+++PGLL+ L++
Sbjct: 1229 RVMMSLKSGLLAESTWALDTINILLYDDNSIMTFN---LSQLPGLLELLVE 1276
>gi|440905964|gb|ELR56280.1| AT-rich interactive domain-containing protein 1A [Bos grunniens
mutus]
Length = 1906
Score = 40.0 bits (92), Expect = 2.5, Method: Composition-based stats.
Identities = 19/51 (37%), Positives = 37/51 (72%), Gaps = 3/51 (5%)
Query: 67 RIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQ 117
R++++L+SGL +E TWAL+T+ +L + + M + L+++PGLL+ L++
Sbjct: 1293 RVMMSLKSGLLAESTWALDTINILLYDDNSIMTFN---LSQLPGLLELLVE 1340
>gi|397476267|ref|XP_003846154.1| PREDICTED: LOW QUALITY PROTEIN: AT-rich interactive domain-containing
protein 1A, partial [Pan paniscus]
Length = 2057
Score = 40.0 bits (92), Expect = 2.5, Method: Composition-based stats.
Identities = 19/51 (37%), Positives = 37/51 (72%), Gaps = 3/51 (5%)
Query: 67 RIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQ 117
R++++L+SGL +E TWAL+T+ +L + + M + L+++PGLL+ L++
Sbjct: 1443 RVMMSLKSGLLAESTWALDTINILLYDDNSIMTFN---LSQLPGLLELLVE 1490
>gi|390465563|ref|XP_002807024.2| PREDICTED: LOW QUALITY PROTEIN: AT-rich interactive domain-containing
protein 1A, partial [Callithrix jacchus]
Length = 2024
Score = 40.0 bits (92), Expect = 2.5, Method: Composition-based stats.
Identities = 19/51 (37%), Positives = 37/51 (72%), Gaps = 3/51 (5%)
Query: 67 RIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQ 117
R++++L+SGL +E TWAL+T+ +L + + M + L+++PGLL+ L++
Sbjct: 1389 RVMMSLKSGLLAESTWALDTINILLYDDNSIMTFN---LSQLPGLLELLVE 1436
>gi|350585782|ref|XP_003127781.3| PREDICTED: AT-rich interactive domain-containing protein 1A, partial
[Sus scrofa]
Length = 1499
Score = 40.0 bits (92), Expect = 2.5, Method: Composition-based stats.
Identities = 19/51 (37%), Positives = 37/51 (72%), Gaps = 3/51 (5%)
Query: 67 RIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQ 117
R++++L+SGL +E TWAL+T+ +L + + M + L+++PGLL+ L++
Sbjct: 1041 RVMMSLKSGLLAESTWALDTINILLYDDNSIMTFN---LSQLPGLLELLVE 1088
>gi|348570742|ref|XP_003471156.1| PREDICTED: AT-rich interactive domain-containing protein 1A [Cavia
porcellus]
Length = 1973
Score = 40.0 bits (92), Expect = 2.5, Method: Composition-based stats.
Identities = 19/51 (37%), Positives = 37/51 (72%), Gaps = 3/51 (5%)
Query: 67 RIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQ 117
R++++L+SGL +E TWAL+T+ +L + + M + L+++PGLL+ L++
Sbjct: 1355 RVMMSLKSGLLAESTWALDTINILLYDDNSIMTFN---LSQLPGLLELLVE 1402
>gi|52139164|gb|AAH82554.1| AT rich interactive domain 1A (SWI-like) [Mus musculus]
Length = 1902
Score = 40.0 bits (92), Expect = 2.5, Method: Composition-based stats.
Identities = 19/51 (37%), Positives = 37/51 (72%), Gaps = 3/51 (5%)
Query: 67 RIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQ 117
R++++L+SGL +E TWAL+T+ +L + + M + L+++PGLL+ L++
Sbjct: 1291 RVMMSLKSGLLAESTWALDTINILLYDDNSIMTFN---LSQLPGLLELLVE 1338
>gi|426328508|ref|XP_004025294.1| PREDICTED: AT-rich interactive domain-containing protein 1A [Gorilla
gorilla gorilla]
Length = 1685
Score = 40.0 bits (92), Expect = 2.5, Method: Composition-based stats.
Identities = 19/51 (37%), Positives = 37/51 (72%), Gaps = 3/51 (5%)
Query: 67 RIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQ 117
R++++L+SGL +E TWAL+T+ +L + + M + L+++PGLL+ L++
Sbjct: 1071 RVMMSLKSGLLAESTWALDTINILLYDDNSIMTFN---LSQLPGLLELLVE 1118
>gi|8489817|gb|AAF75765.1|AF265208_1 SWI-SNF complex protein p270 [Homo sapiens]
Length = 1927
Score = 40.0 bits (92), Expect = 2.5, Method: Composition-based stats.
Identities = 19/51 (37%), Positives = 37/51 (72%), Gaps = 3/51 (5%)
Query: 67 RIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQ 117
R++++L+SGL +E TWAL+T+ +L + + M + L+++PGLL+ L++
Sbjct: 1313 RVMMSLKSGLLAESTWALDTINILLYDDNSIMTFN---LSQLPGLLELLVE 1360
>gi|417515629|gb|JAA53631.1| AT-rich interactive domain-containing protein 1A isoform a, partial
[Sus scrofa]
Length = 1911
Score = 40.0 bits (92), Expect = 2.5, Method: Composition-based stats.
Identities = 19/51 (37%), Positives = 37/51 (72%), Gaps = 3/51 (5%)
Query: 67 RIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQ 117
R++++L+SGL +E TWAL+T+ +L + + M + L+++PGLL+ L++
Sbjct: 1298 RVMMSLKSGLLAESTWALDTINILLYDDNSIMTFN---LSQLPGLLELLVE 1345
>gi|410966438|ref|XP_003989740.1| PREDICTED: AT-rich interactive domain-containing protein 1A [Felis
catus]
Length = 1683
Score = 40.0 bits (92), Expect = 2.5, Method: Composition-based stats.
Identities = 19/51 (37%), Positives = 37/51 (72%), Gaps = 3/51 (5%)
Query: 67 RIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQ 117
R++++L+SGL +E TWAL+T+ +L + + M + L+++PGLL+ L++
Sbjct: 1070 RVMMSLKSGLLAESTWALDTINILLYDDNSIMTFN---LSQLPGLLELLVE 1117
>gi|403257395|ref|XP_003921305.1| PREDICTED: AT-rich interactive domain-containing protein 1A [Saimiri
boliviensis boliviensis]
Length = 1682
Score = 40.0 bits (92), Expect = 2.5, Method: Composition-based stats.
Identities = 19/51 (37%), Positives = 37/51 (72%), Gaps = 3/51 (5%)
Query: 67 RIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQ 117
R++++L+SGL +E TWAL+T+ +L + + M + L+++PGLL+ L++
Sbjct: 1069 RVMMSLKSGLLAESTWALDTINILLYDDNSIMTFN---LSQLPGLLELLVE 1116
>gi|355758492|gb|EHH61486.1| hypothetical protein EGM_20831, partial [Macaca fascicularis]
Length = 1906
Score = 40.0 bits (92), Expect = 2.5, Method: Composition-based stats.
Identities = 19/51 (37%), Positives = 37/51 (72%), Gaps = 3/51 (5%)
Query: 67 RIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQ 117
R++++L+SGL +E TWAL+T+ +L + + M + L+++PGLL+ L++
Sbjct: 1292 RVMMSLKSGLLAESTWALDTINILLYDDNSIMTFN---LSQLPGLLELLVE 1339
>gi|296490111|tpg|DAA32224.1| TPA: AT rich interactive domain 1A-like [Bos taurus]
Length = 2092
Score = 40.0 bits (92), Expect = 2.5, Method: Composition-based stats.
Identities = 19/51 (37%), Positives = 37/51 (72%), Gaps = 3/51 (5%)
Query: 67 RIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQ 117
R++++L+SGL +E TWAL+T+ +L + + M + L+++PGLL+ L++
Sbjct: 1479 RVMMSLKSGLLAESTWALDTINILLYDDNSIMTFN---LSQLPGLLELLVE 1526
>gi|281351602|gb|EFB27186.1| hypothetical protein PANDA_001156 [Ailuropoda melanoleuca]
Length = 1904
Score = 40.0 bits (92), Expect = 2.5, Method: Composition-based stats.
Identities = 19/51 (37%), Positives = 37/51 (72%), Gaps = 3/51 (5%)
Query: 67 RIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQ 117
R++++L+SGL +E TWAL+T+ +L + + M + L+++PGLL+ L++
Sbjct: 1291 RVMMSLKSGLLAESTWALDTINILLYDDNSIMTFN---LSQLPGLLELLVE 1338
>gi|157817412|ref|NP_001100105.1| AT-rich interactive domain-containing protein 1A [Rattus norvegicus]
gi|149024190|gb|EDL80687.1| AT rich interactive domain 1A (Swi1 like) (predicted) [Rattus
norvegicus]
Length = 1911
Score = 40.0 bits (92), Expect = 2.5, Method: Composition-based stats.
Identities = 19/51 (37%), Positives = 37/51 (72%), Gaps = 3/51 (5%)
Query: 67 RIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQ 117
R++++L+SGL +E TWAL+T+ +L + + M + L+++PGLL+ L++
Sbjct: 1299 RVMMSLKSGLLAESTWALDTINILLYDDNSIMTFN---LSQLPGLLELLVE 1346
>gi|148698102|gb|EDL30049.1| mCG20806 [Mus musculus]
Length = 1955
Score = 40.0 bits (92), Expect = 2.5, Method: Composition-based stats.
Identities = 19/51 (37%), Positives = 37/51 (72%), Gaps = 3/51 (5%)
Query: 67 RIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQ 117
R++++L+SGL +E TWAL+T+ +L + + M + L+++PGLL+ L++
Sbjct: 1344 RVMMSLKSGLLAESTWALDTINILLYDDNSIMTFN---LSQLPGLLELLVE 1391
>gi|14150461|gb|AAK54504.1|AF268912_1 Osa1 nuclear protein [Mus musculus]
Length = 1902
Score = 40.0 bits (92), Expect = 2.5, Method: Composition-based stats.
Identities = 19/51 (37%), Positives = 37/51 (72%), Gaps = 3/51 (5%)
Query: 67 RIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQ 117
R++++L+SGL +E TWAL+T+ +L + + M + L+++PGLL+ L++
Sbjct: 1291 RVMMSLKSGLLAESTWALDTINILLYDDNSIMTFN---LSQLPGLLELLVE 1338
>gi|14150463|gb|AAK54505.1|AF268913_1 OSA1 nuclear protein [Homo sapiens]
Length = 1685
Score = 40.0 bits (92), Expect = 2.5, Method: Composition-based stats.
Identities = 19/51 (37%), Positives = 37/51 (72%), Gaps = 3/51 (5%)
Query: 67 RIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQ 117
R++++L+SGL +E TWAL+T+ +L + + M + L+++PGLL+ L++
Sbjct: 1071 RVMMSLKSGLLAESTWALDTINILLYDDNSIMTFN---LSQLPGLLELLVE 1118
>gi|13195757|gb|AAG17549.2|AF219114_1 chromatin remodelling factor p250 [Homo sapiens]
Length = 1939
Score = 40.0 bits (92), Expect = 2.5, Method: Composition-based stats.
Identities = 19/51 (37%), Positives = 37/51 (72%), Gaps = 3/51 (5%)
Query: 67 RIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQ 117
R++++L+SGL +E TWAL+T+ +L + + M + L+++PGLL+ L++
Sbjct: 1325 RVMMSLKSGLLAESTWALDTINILLYDDNSIMTFN---LSQLPGLLELLVE 1372
>gi|22597104|gb|AAN03446.1|AF521670_1 SWI/SNF chromatin remodeling complex subunit OSA1 [Homo sapiens]
Length = 1999
Score = 40.0 bits (92), Expect = 2.5, Method: Composition-based stats.
Identities = 19/51 (37%), Positives = 37/51 (72%), Gaps = 3/51 (5%)
Query: 67 RIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQ 117
R++++L+SGL +E TWAL+T+ +L + + M + L+++PGLL+ L++
Sbjct: 1385 RVMMSLKSGLLAESTWALDTINILLYDDNSIMTFN---LSQLPGLLELLVE 1432
>gi|432940003|ref|XP_004082669.1| PREDICTED: AT-rich interactive domain-containing protein 1B-like
[Oryzias latipes]
Length = 1232
Score = 40.0 bits (92), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 18/53 (33%), Positives = 36/53 (67%), Gaps = 3/53 (5%)
Query: 67 RIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQVI 119
R++++L+SGL +E TWAL+T+ +L + DD + L+++PG L+ +++
Sbjct: 643 RVMMSLKSGLLAETTWALDTINILLY---DDSTVASFNLSQLPGFLELIVEYF 692
>gi|21264565|ref|NP_006006.3| AT-rich interactive domain-containing protein 1A isoform a [Homo
sapiens]
gi|73920185|sp|O14497.3|ARI1A_HUMAN RecName: Full=AT-rich interactive domain-containing protein 1A;
Short=ARID domain-containing protein 1A; AltName:
Full=B120; AltName: Full=BRG1-associated factor 250;
Short=BAF250; AltName: Full=BRG1-associated factor 250a;
Short=BAF250A; AltName: Full=Osa homolog 1; Short=hOSA1;
AltName: Full=SWI-like protein; AltName: Full=SWI/SNF
complex protein p270; AltName: Full=SWI/SNF-related,
matrix-associated, actin-dependent regulator of chromatin
subfamily F member 1; AltName: Full=hELD
gi|119628200|gb|EAX07795.1| AT rich interactive domain 1A (SWI- like), isoform CRA_c [Homo
sapiens]
gi|119628201|gb|EAX07796.1| AT rich interactive domain 1A (SWI- like), isoform CRA_c [Homo
sapiens]
Length = 2285
Score = 40.0 bits (92), Expect = 2.5, Method: Composition-based stats.
Identities = 19/51 (37%), Positives = 37/51 (72%), Gaps = 3/51 (5%)
Query: 67 RIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQ 117
R++++L+SGL +E TWAL+T+ +L + + M + L+++PGLL+ L++
Sbjct: 1671 RVMMSLKSGLLAESTWALDTINILLYDDNSIMTFN---LSQLPGLLELLVE 1718
>gi|410222734|gb|JAA08586.1| AT rich interactive domain 1A (SWI-like) [Pan troglodytes]
gi|410305710|gb|JAA31455.1| AT rich interactive domain 1A (SWI-like) [Pan troglodytes]
Length = 2287
Score = 40.0 bits (92), Expect = 2.5, Method: Composition-based stats.
Identities = 19/51 (37%), Positives = 37/51 (72%), Gaps = 3/51 (5%)
Query: 67 RIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQ 117
R++++L+SGL +E TWAL+T+ +L + + M + L+++PGLL+ L++
Sbjct: 1673 RVMMSLKSGLLAESTWALDTINILLYDDNSIMTFN---LSQLPGLLELLVE 1720
>gi|410222732|gb|JAA08585.1| AT rich interactive domain 1A (SWI-like) [Pan troglodytes]
Length = 2286
Score = 40.0 bits (92), Expect = 2.5, Method: Composition-based stats.
Identities = 19/51 (37%), Positives = 37/51 (72%), Gaps = 3/51 (5%)
Query: 67 RIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQ 117
R++++L+SGL +E TWAL+T+ +L + + M + L+++PGLL+ L++
Sbjct: 1672 RVMMSLKSGLLAESTWALDTINILLYDDNSIMTFN---LSQLPGLLELLVE 1719
>gi|410032530|ref|XP_003949382.1| PREDICTED: AT-rich interactive domain-containing protein 1A [Pan
troglodytes]
Length = 2288
Score = 40.0 bits (92), Expect = 2.5, Method: Composition-based stats.
Identities = 19/51 (37%), Positives = 37/51 (72%), Gaps = 3/51 (5%)
Query: 67 RIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQ 117
R++++L+SGL +E TWAL+T+ +L + + M + L+++PGLL+ L++
Sbjct: 1674 RVMMSLKSGLLAESTWALDTINILLYDDNSIMTFN---LSQLPGLLELLVE 1721
>gi|410032528|ref|XP_513235.4| PREDICTED: AT-rich interactive domain-containing protein 1A isoform 6
[Pan troglodytes]
Length = 2285
Score = 40.0 bits (92), Expect = 2.5, Method: Composition-based stats.
Identities = 19/51 (37%), Positives = 37/51 (72%), Gaps = 3/51 (5%)
Query: 67 RIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQ 117
R++++L+SGL +E TWAL+T+ +L + + M + L+++PGLL+ L++
Sbjct: 1671 RVMMSLKSGLLAESTWALDTINILLYDDNSIMTFN---LSQLPGLLELLVE 1718
>gi|301755052|ref|XP_002913404.1| PREDICTED: AT-rich interactive domain-containing protein 1A-like
[Ailuropoda melanoleuca]
Length = 1983
Score = 40.0 bits (92), Expect = 2.6, Method: Composition-based stats.
Identities = 19/51 (37%), Positives = 37/51 (72%), Gaps = 3/51 (5%)
Query: 67 RIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQ 117
R++++L+SGL +E TWAL+T+ +L + + M + L+++PGLL+ L++
Sbjct: 1370 RVMMSLKSGLLAESTWALDTINILLYDDNSIMTFN---LSQLPGLLELLVE 1417
>gi|11320942|gb|AAG33967.1|AF231056_1 BRG1-Associated Factor 250a [Homo sapiens]
Length = 2285
Score = 40.0 bits (92), Expect = 2.6, Method: Composition-based stats.
Identities = 19/51 (37%), Positives = 37/51 (72%), Gaps = 3/51 (5%)
Query: 67 RIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQ 117
R++++L+SGL +E TWAL+T+ +L + + M + L+++PGLL+ L++
Sbjct: 1671 RVMMSLKSGLLAESTWALDTINILLYDDNSIMTFN---LSQLPGLLELLVE 1718
>gi|363742215|ref|XP_417693.3| PREDICTED: AT-rich interactive domain-containing protein 1A [Gallus
gallus]
Length = 1737
Score = 40.0 bits (92), Expect = 2.6, Method: Composition-based stats.
Identities = 19/51 (37%), Positives = 37/51 (72%), Gaps = 3/51 (5%)
Query: 67 RIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQ 117
R++++L+SGL +E TWAL+T+ +L + + M + L+++PGLL+ L++
Sbjct: 1127 RVMMSLKSGLLAESTWALDTINILLYDDNSIMTFN---LSQLPGLLELLVE 1174
>gi|417414095|gb|JAA53348.1| Putative swi-snf chromatin-remodeling complex protein, partial
[Desmodus rotundus]
Length = 2253
Score = 40.0 bits (92), Expect = 2.6, Method: Composition-based stats.
Identities = 19/51 (37%), Positives = 37/51 (72%), Gaps = 3/51 (5%)
Query: 67 RIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQ 117
R++++L+SGL +E TWAL+T+ +L + + M + L+++PGLL+ L++
Sbjct: 1641 RVMMSLKSGLLAESTWALDTINILLYDDNSIMTFN---LSQLPGLLELLVE 1688
>gi|449680112|ref|XP_002157646.2| PREDICTED: trithorax group protein osa-like [Hydra magnipapillata]
Length = 782
Score = 40.0 bits (92), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 36/57 (63%), Gaps = 6/57 (10%)
Query: 67 RIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQVIDDWR 123
RI+++L+SGL +E +WA++TL +L D+ L ++PGLLD L+ D +R
Sbjct: 178 RIMMSLKSGLVAECSWAIDTLNILL---SDNKTITYFHLTQLPGLLDTLM---DHYR 228
>gi|449488871|ref|XP_004174432.1| PREDICTED: LOW QUALITY PROTEIN: AT-rich interactive domain-containing
protein 1A [Taeniopygia guttata]
Length = 1896
Score = 40.0 bits (92), Expect = 2.7, Method: Composition-based stats.
Identities = 19/51 (37%), Positives = 37/51 (72%), Gaps = 3/51 (5%)
Query: 67 RIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQ 117
R++++L+SGL +E TWAL+T+ +L + + M + L+++PGLL+ L++
Sbjct: 1285 RVMMSLKSGLLAESTWALDTINILLYDDNSIMTFN---LSQLPGLLELLVE 1332
>gi|395730950|ref|XP_003780665.1| PREDICTED: LOW QUALITY PROTEIN: AT-rich interactive domain-containing
protein 1A [Pongo abelii]
Length = 2144
Score = 40.0 bits (92), Expect = 2.7, Method: Composition-based stats.
Identities = 19/51 (37%), Positives = 37/51 (72%), Gaps = 3/51 (5%)
Query: 67 RIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQ 117
R++++L+SGL +E TWAL+T+ +L + + M + L+++PGLL+ L++
Sbjct: 1553 RVMMSLKSGLLAESTWALDTINILLYDDNSIMTFN---LSQLPGLLELLVE 1600
>gi|297282618|ref|XP_002808326.1| PREDICTED: LOW QUALITY PROTEIN: AT-rich interactive domain-containing
protein 1A-like [Macaca mulatta]
Length = 2224
Score = 40.0 bits (92), Expect = 2.8, Method: Composition-based stats.
Identities = 19/51 (37%), Positives = 37/51 (72%), Gaps = 3/51 (5%)
Query: 67 RIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQ 117
R++++L+SGL +E TWAL+T+ +L + + M + L+++PGLL+ L++
Sbjct: 1673 RVMMSLKSGLLAESTWALDTINILLYDDNSIMTFN---LSQLPGLLELLVE 1720
>gi|410032532|ref|XP_001144752.3| PREDICTED: AT-rich interactive domain-containing protein 1A isoform 1
[Pan troglodytes]
Length = 2068
Score = 40.0 bits (92), Expect = 2.8, Method: Composition-based stats.
Identities = 19/51 (37%), Positives = 37/51 (72%), Gaps = 3/51 (5%)
Query: 67 RIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQ 117
R++++L+SGL +E TWAL+T+ +L + + M + L+++PGLL+ L++
Sbjct: 1454 RVMMSLKSGLLAESTWALDTINILLYDDNSIMTFN---LSQLPGLLELLVE 1501
>gi|329664977|ref|NP_001192714.1| AT-rich interactive domain-containing protein 1A [Bos taurus]
Length = 2286
Score = 40.0 bits (92), Expect = 2.8, Method: Composition-based stats.
Identities = 19/51 (37%), Positives = 37/51 (72%), Gaps = 3/51 (5%)
Query: 67 RIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQ 117
R++++L+SGL +E TWAL+T+ +L + + M + L+++PGLL+ L++
Sbjct: 1673 RVMMSLKSGLLAESTWALDTINILLYDDNSIMTFN---LSQLPGLLELLVE 1720
>gi|21264575|ref|NP_624361.1| AT-rich interactive domain-containing protein 1A isoform b [Homo
sapiens]
gi|119628198|gb|EAX07793.1| AT rich interactive domain 1A (SWI- like), isoform CRA_a [Homo
sapiens]
Length = 2068
Score = 40.0 bits (92), Expect = 2.8, Method: Composition-based stats.
Identities = 19/51 (37%), Positives = 37/51 (72%), Gaps = 3/51 (5%)
Query: 67 RIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQ 117
R++++L+SGL +E TWAL+T+ +L + + M + L+++PGLL+ L++
Sbjct: 1454 RVMMSLKSGLLAESTWALDTINILLYDDNSIMTFN---LSQLPGLLELLVE 1501
>gi|124249109|ref|NP_001074288.1| AT-rich interactive domain-containing protein 1A [Mus musculus]
gi|288561880|sp|A2BH40.1|ARI1A_MOUSE RecName: Full=AT-rich interactive domain-containing protein 1A;
Short=ARID domain-containing protein 1A; AltName:
Full=BRG1-associated factor 250; Short=BAF250; AltName:
Full=BRG1-associated factor 250a; Short=BAF250A; AltName:
Full=Osa homolog 1; AltName: Full=SWI-like protein;
AltName: Full=SWI/SNF complex protein p270; AltName:
Full=SWI/SNF-related, matrix-associated, actin-dependent
regulator of chromatin subfamily F member 1
Length = 2283
Score = 40.0 bits (92), Expect = 2.8, Method: Composition-based stats.
Identities = 19/51 (37%), Positives = 37/51 (72%), Gaps = 3/51 (5%)
Query: 67 RIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQ 117
R++++L+SGL +E TWAL+T+ +L + + M + L+++PGLL+ L++
Sbjct: 1672 RVMMSLKSGLLAESTWALDTINILLYDDNSIMTFN---LSQLPGLLELLVE 1719
>gi|395854766|ref|XP_003799850.1| PREDICTED: AT-rich interactive domain-containing protein 1A isoform 1
[Otolemur garnettii]
Length = 2280
Score = 40.0 bits (92), Expect = 2.8, Method: Composition-based stats.
Identities = 19/51 (37%), Positives = 37/51 (72%), Gaps = 3/51 (5%)
Query: 67 RIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQ 117
R++++L+SGL +E TWAL+T+ +L + + M + L+++PGLL+ L++
Sbjct: 1668 RVMMSLKSGLLAESTWALDTINILLYDDNSIMTFN---LSQLPGLLELLVE 1715
>gi|390604576|gb|EIN13967.1| hypothetical protein PUNSTDRAFT_56613 [Punctularia strigosozonata
HHB-11173 SS5]
Length = 633
Score = 40.0 bits (92), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 28/92 (30%), Positives = 46/92 (50%), Gaps = 10/92 (10%)
Query: 27 FGSTSGSSGGSGSAADSAAPTTLLGPSLQVHSSFADQN-HKRIVLALQSGLKSELTWALN 85
+G+TS ++ S A P P + ++ + R++L+L+SG+ +E+ WAL+
Sbjct: 5 YGATSYAARTSSYVQYPARPVP--PPKDDYERWYTEERPNNRMLLSLRSGIPTEVRWALD 62
Query: 86 TLTLLSFKEKDDMRKDATPLAKIPGLLDALLQ 117
L LS EK +R PGL DAL +
Sbjct: 63 RLCRLSHNEKFSLR-------SYPGLTDALFE 87
>gi|359319068|ref|XP_852546.3| PREDICTED: AT-rich interactive domain-containing protein 1A isoform 1
[Canis lupus familiaris]
Length = 2284
Score = 40.0 bits (92), Expect = 2.8, Method: Composition-based stats.
Identities = 19/51 (37%), Positives = 37/51 (72%), Gaps = 3/51 (5%)
Query: 67 RIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQ 117
R++++L+SGL +E TWAL+T+ +L + + M + L+++PGLL+ L++
Sbjct: 1671 RVMMSLKSGLLAESTWALDTINILLYDDNSIMTFN---LSQLPGLLELLVE 1718
>gi|330843385|ref|XP_003293636.1| hypothetical protein DICPUDRAFT_99722 [Dictyostelium purpureum]
gi|325076013|gb|EGC29838.1| hypothetical protein DICPUDRAFT_99722 [Dictyostelium purpureum]
Length = 521
Score = 39.7 bits (91), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 31/115 (26%), Positives = 53/115 (46%), Gaps = 4/115 (3%)
Query: 334 PQIHKRLVDLMSLPAFDAQAAAVGALYNLAEVNVDCRLKLASERWAIDRLLRVIKTPHPV 393
P + RLV+L+ Q + L +LA+ + ++ + + L+ ++ T P
Sbjct: 402 PNLISRLVELLGHTNQSIQLICLQILLSLAKASQKLKITICHYPGLVRHLINLL-THKPA 460
Query: 394 ---PEVCRKAAMILESLVSEPQNRVLLLAYENAFAEILFSDGRYSDTFARILYEL 445
EV ++AA +L L E N +LL YE A++ +D +SD IL L
Sbjct: 461 GENAEVSKRAATLLSRLSKESYNISILLPYETILAQVALTDNPHSDVITDILVRL 515
>gi|402853529|ref|XP_003891445.1| PREDICTED: AT-rich interactive domain-containing protein 1A [Papio
anubis]
Length = 2069
Score = 39.7 bits (91), Expect = 3.0, Method: Composition-based stats.
Identities = 19/51 (37%), Positives = 37/51 (72%), Gaps = 3/51 (5%)
Query: 67 RIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQ 117
R++++L+SGL +E TWAL+T+ +L + + M + L+++PGLL+ L++
Sbjct: 1455 RVMMSLKSGLLAESTWALDTINILLYDDNSIMTFN---LSQLPGLLELLVE 1502
>gi|320169581|gb|EFW46480.1| hypothetical protein CAOG_04448 [Capsaspora owczarzaki ATCC 30864]
Length = 1486
Score = 39.7 bits (91), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 21/57 (36%), Positives = 35/57 (61%), Gaps = 8/57 (14%)
Query: 66 KRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQVIDDW 122
++++ +L SG+ ++ TWALN LT+ F ++D LA+ PGL+D L +D W
Sbjct: 1065 QKLLNSLSSGMLADSTWALNALTVWLFTDQDTFA-----LAEFPGLVDIL---VDHW 1113
>gi|301610311|ref|XP_002934686.1| PREDICTED: AT-rich interactive domain-containing protein 1A-like
isoform 2 [Xenopus (Silurana) tropicalis]
Length = 1832
Score = 39.7 bits (91), Expect = 3.0, Method: Composition-based stats.
Identities = 19/51 (37%), Positives = 37/51 (72%), Gaps = 3/51 (5%)
Query: 67 RIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQ 117
R++++L+SGL +E TWAL+T+ +L + + M + L+++PGLL+ L++
Sbjct: 1241 RVMMSLKSGLLAESTWALDTINILLYDDNSIMTFN---LSQLPGLLELLVE 1288
>gi|291399519|ref|XP_002716168.1| PREDICTED: AT rich interactive domain 1A [Oryctolagus cuniculus]
Length = 2212
Score = 39.7 bits (91), Expect = 3.0, Method: Composition-based stats.
Identities = 19/51 (37%), Positives = 37/51 (72%), Gaps = 3/51 (5%)
Query: 67 RIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQ 117
R++++L+SGL +E TWAL+T+ +L + + M + L+++PGLL+ L++
Sbjct: 1598 RVMMSLKSGLLAESTWALDTINILLYDDNSIMTFN---LSQLPGLLELLVE 1645
>gi|395854768|ref|XP_003799851.1| PREDICTED: AT-rich interactive domain-containing protein 1A isoform 2
[Otolemur garnettii]
Length = 2063
Score = 39.7 bits (91), Expect = 3.1, Method: Composition-based stats.
Identities = 19/51 (37%), Positives = 37/51 (72%), Gaps = 3/51 (5%)
Query: 67 RIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQ 117
R++++L+SGL +E TWAL+T+ +L + + M + L+++PGLL+ L++
Sbjct: 1451 RVMMSLKSGLLAESTWALDTINILLYDDNSIMTFN---LSQLPGLLELLVE 1498
>gi|359319070|ref|XP_003638989.1| PREDICTED: AT-rich interactive domain-containing protein 1A [Canis
lupus familiaris]
Length = 2067
Score = 39.7 bits (91), Expect = 3.1, Method: Composition-based stats.
Identities = 19/51 (37%), Positives = 37/51 (72%), Gaps = 3/51 (5%)
Query: 67 RIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQ 117
R++++L+SGL +E TWAL+T+ +L + + M + L+++PGLL+ L++
Sbjct: 1454 RVMMSLKSGLLAESTWALDTINILLYDDNSIMTFN---LSQLPGLLELLVE 1501
>gi|301610309|ref|XP_002934685.1| PREDICTED: AT-rich interactive domain-containing protein 1A-like
isoform 1 [Xenopus (Silurana) tropicalis]
Length = 2055
Score = 39.7 bits (91), Expect = 3.1, Method: Composition-based stats.
Identities = 19/51 (37%), Positives = 37/51 (72%), Gaps = 3/51 (5%)
Query: 67 RIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQ 117
R++++L+SGL +E TWAL+T+ +L + + M + L+++PGLL+ L++
Sbjct: 1464 RVMMSLKSGLLAESTWALDTINILLYDDNSIMTFN---LSQLPGLLELLVE 1511
>gi|291227342|ref|XP_002733645.1| PREDICTED: AT rich interactive domain 1A-like [Saccoglossus
kowalevskii]
Length = 2269
Score = 39.7 bits (91), Expect = 3.1, Method: Composition-based stats.
Identities = 21/51 (41%), Positives = 33/51 (64%), Gaps = 3/51 (5%)
Query: 67 RIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQ 117
R+++ L+SGL +E TWA++ L++L F DD L +PGLL+ LL+
Sbjct: 1679 RVMMCLKSGLLAESTWAIDVLSILLF---DDNTVTYFQLQHLPGLLEILLE 1726
>gi|47230314|emb|CAG10728.1| unnamed protein product [Tetraodon nigroviridis]
Length = 1556
Score = 39.7 bits (91), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 18/53 (33%), Positives = 36/53 (67%), Gaps = 3/53 (5%)
Query: 67 RIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQVI 119
R++++L+SGL +E TWAL+T+ +L + DD + L+++PG L+ +++
Sbjct: 915 RVMMSLKSGLLAESTWALDTINILLY---DDSTVASFNLSQLPGFLELIVEYF 964
>gi|402218780|gb|EJT98855.1| vacuolar protein 8 [Dacryopinax sp. DJM-731 SS1]
Length = 593
Score = 39.7 bits (91), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 36/133 (27%), Positives = 61/133 (45%), Gaps = 17/133 (12%)
Query: 246 CIEDHVTEDEELV----TNALETIVNLAPLLDLRIFSSSKQSYIKITR--EKR------- 292
C+ + T DE + AL + LA D+R+ ++ + + +T E R
Sbjct: 155 CVTNLATHDENKTKIAKSGALVPLTRLARSKDMRVQRNATGALLNMTHSDENRQQLVNAG 214
Query: 293 AVEAIMGILGSPFK--AWHCAAAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSLPAFD 350
A+ ++G+L SP ++C A L + ++ +N L P++ + LV LM P+
Sbjct: 215 AIPVLVGLLSSPDTDVQYYCTTA--LSNIAVDANNRKKLAQTEPKLVQSLVALMDSPSLK 272
Query: 351 AQAAAVGALYNLA 363
Q A AL NLA
Sbjct: 273 VQCQAALALRNLA 285
>gi|328862239|gb|EGG11340.1| hypothetical protein MELLADRAFT_102234 [Melampsora larici-populina
98AG31]
Length = 808
Score = 39.7 bits (91), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 24/84 (28%), Positives = 46/84 (54%), Gaps = 7/84 (8%)
Query: 41 ADSAAPTTLLGPSLQVHSSFADQN--HKRIVLALQSGLKSELTWALNTLTLLSFKEKDDM 98
+++ P L +L + DQ RI+L+++SG+ E+ + L L SF E + +
Sbjct: 78 SNNTLPFQKLNAALNGTPHYLDQPGPQNRILLSIKSGIPEEIDYGLEILLAGSFYEPECI 137
Query: 99 RKDATPLAKIPGLLDALLQVIDDW 122
+ L++ PGL+++LL +ID +
Sbjct: 138 Q-----LSRFPGLIESLLSLIDQY 156
>gi|395521845|ref|XP_003765025.1| PREDICTED: AT-rich interactive domain-containing protein 1A
[Sarcophilus harrisii]
Length = 1969
Score = 39.7 bits (91), Expect = 3.6, Method: Composition-based stats.
Identities = 19/51 (37%), Positives = 36/51 (70%), Gaps = 3/51 (5%)
Query: 67 RIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQ 117
R++++L+SGL +E TWAL+T+ +L + + M + L ++PGLL+ L++
Sbjct: 1356 RVMMSLKSGLLAESTWALDTINILLYDDNSIMTFN---LGQLPGLLELLVE 1403
>gi|345305497|ref|XP_001506053.2| PREDICTED: AT-rich interactive domain-containing protein 2
[Ornithorhynchus anatinus]
Length = 1683
Score = 39.7 bits (91), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 54/220 (24%), Positives = 95/220 (43%), Gaps = 26/220 (11%)
Query: 176 DSLVQKNAARVRSSEW-W---FDEDGLFNLDDEGRAEKQQCAVGASNIIRNFSFMPDNEV 231
D + +N + S EW W F ++D Q AV I+RN SF N
Sbjct: 44 DLISDRNKSPGTSQEWIWESLFHPPRKLGINDIEGQRVLQIAV----ILRNLSFEEGNVK 99
Query: 232 IMAQHRHCLETVFQCIEDHVTEDEELVTNALETIVNLAP--LLDLRIFSSSKQSYIKITR 289
++A +R CL + H +L L+T+ N+A LLD F ++ + +T+
Sbjct: 100 LLAANRTCLRFLLLSAHSHFISLRQL---GLDTLGNIAAELLLDPVDFKTTHLMFHTVTK 156
Query: 290 EKRAVEAIMGILGSPFKAWHCAAAELLGRLIINPDNEPFLLPFVPQ-IHKRLVDLMSLPA 348
+ + + + G E+LG L DN + +V Q ++ ++ ++LP
Sbjct: 157 CLMSRDRFLKMRG----------MEILGNLCKAEDNGVLICEYVDQESYREIICHLTLPD 206
Query: 349 FDAQAAAVGALYNLAEV-NVDCRLKLASERWAIDRLLRVI 387
+ + LY L E+ +V C K+A +ID L+ ++
Sbjct: 207 VLLVISTLEVLYMLTEMGDVAC-TKIAKVDKSIDMLVCLV 245
>gi|334328301|ref|XP_003341063.1| PREDICTED: LOW QUALITY PROTEIN: AT-rich interactive domain-containing
protein 1A-like [Monodelphis domestica]
Length = 2299
Score = 39.3 bits (90), Expect = 4.0, Method: Composition-based stats.
Identities = 19/51 (37%), Positives = 36/51 (70%), Gaps = 3/51 (5%)
Query: 67 RIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQ 117
R++++L+SGL +E TWAL+T+ +L + + M + L ++PGLL+ L++
Sbjct: 1689 RVMMSLKSGLLAESTWALDTINILLYDDNSIMTFN---LGQLPGLLELLVE 1736
>gi|301603789|ref|XP_002931553.1| PREDICTED: AT-rich interactive domain-containing protein 1B-like
[Xenopus (Silurana) tropicalis]
Length = 2200
Score = 39.3 bits (90), Expect = 4.2, Method: Composition-based stats.
Identities = 19/51 (37%), Positives = 35/51 (68%), Gaps = 3/51 (5%)
Query: 67 RIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQ 117
R++++L+SGL +E TWAL+T+ +L + DD L+++PG L+ L++
Sbjct: 1591 RVMMSLKSGLLAESTWALDTINILLY---DDSTVATFNLSQLPGFLELLVE 1638
>gi|256082673|ref|XP_002577578.1| hypothetical protein [Schistosoma mansoni]
gi|353233329|emb|CCD80684.1| hypothetical protein Smp_156170 [Schistosoma mansoni]
Length = 2565
Score = 38.9 bits (89), Expect = 5.1, Method: Composition-based stats.
Identities = 21/57 (36%), Positives = 37/57 (64%), Gaps = 6/57 (10%)
Query: 67 RIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQVIDDWR 123
++++AL+SGL +E++WALN L +L +D+ D + +PGL+ +L +D WR
Sbjct: 1441 KLLMALRSGLTAEVSWALNCLNILL---RDENGMDFIIPSALPGLITSL---VDLWR 1491
>gi|326676953|ref|XP_698079.3| PREDICTED: AT-rich interactive domain-containing protein 1B [Danio
rerio]
Length = 2121
Score = 38.9 bits (89), Expect = 5.5, Method: Composition-based stats.
Identities = 18/51 (35%), Positives = 36/51 (70%), Gaps = 3/51 (5%)
Query: 67 RIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQ 117
R++++L+SGL +E TWAL+T+ +L + DD + L+++PG L+ +++
Sbjct: 1499 RVMMSLKSGLLAESTWALDTINILLY---DDNTVSSFGLSQLPGFLELIVE 1546
>gi|241829658|ref|XP_002414770.1| brahma/SWI2-related protein BRG-1 [Ixodes scapularis]
gi|215508982|gb|EEC18435.1| brahma/SWI2-related protein BRG-1 [Ixodes scapularis]
Length = 1372
Score = 38.9 bits (89), Expect = 5.5, Method: Compositional matrix adjust.
Identities = 20/51 (39%), Positives = 34/51 (66%), Gaps = 3/51 (5%)
Query: 67 RIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQ 117
R+++ L+SGL +E TWAL+ L++L + DD L+ +PGLL+ L++
Sbjct: 801 RLMMCLKSGLLAESTWALDVLSVLLY---DDATVLYFGLSHLPGLLETLME 848
>gi|354549332|gb|AER27741.1| Arid1b [Danio rerio]
Length = 1840
Score = 38.9 bits (89), Expect = 6.1, Method: Composition-based stats.
Identities = 18/51 (35%), Positives = 36/51 (70%), Gaps = 3/51 (5%)
Query: 67 RIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQ 117
R++++L+SGL +E TWAL+T+ +L + DD + L+++PG L+ +++
Sbjct: 1218 RVMMSLKSGLLAESTWALDTINILLY---DDNTVSSFGLSQLPGFLELIVE 1265
>gi|354549334|gb|AER27742.1| Arid1b [Danio rerio]
Length = 1840
Score = 38.9 bits (89), Expect = 6.1, Method: Composition-based stats.
Identities = 18/51 (35%), Positives = 36/51 (70%), Gaps = 3/51 (5%)
Query: 67 RIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQ 117
R++++L+SGL +E TWAL+T+ +L + DD + L+++PG L+ +++
Sbjct: 1218 RVMMSLKSGLLAESTWALDTINILLY---DDNTVSSFGLSQLPGFLELIVE 1265
>gi|224003787|ref|XP_002291565.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220973341|gb|EED91672.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 837
Score = 38.5 bits (88), Expect = 6.8, Method: Compositional matrix adjust.
Identities = 36/149 (24%), Positives = 62/149 (41%), Gaps = 2/149 (1%)
Query: 280 SKQSYIKITREKRAVEAIMGILGSPFKAWHCAAAELLGRLIINPDNEPFLLPFVPQIHKR 339
+K++ I RE VEA++G+ H A +L L +P N L+ ++
Sbjct: 618 NKENAYNIAREDVLVEALIGVSTKHSSLSHARAISILAHLTRHPKNCHHLVFKCAKLLPM 677
Query: 340 LVDLMSLPAFDAQAAAVGALYNLAEVNVDCRLKLASERWAIDRLLRVIKTPHPVPEVCRK 399
L + S P + + A+ +L NL+ ++ CR +A I L EV
Sbjct: 678 LQNATSSPEGETRKYALCSLQNLS-MDKSCRAPIAHTPKMIVSLAERCSKKETKEEVL-A 735
Query: 400 AAMILESLVSEPQNRVLLLAYENAFAEIL 428
A L++L EP N + +N ++
Sbjct: 736 AVAALQNLSDEPANLIQFTIVQNCIGTLI 764
>gi|331223423|ref|XP_003324384.1| hypothetical protein PGTG_05190 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|309303374|gb|EFP79965.1| hypothetical protein PGTG_05190 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 860
Score = 38.5 bits (88), Expect = 7.4, Method: Compositional matrix adjust.
Identities = 21/54 (38%), Positives = 32/54 (59%), Gaps = 5/54 (9%)
Query: 67 RIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQVID 120
RI+L+++S + + +AL L SF E D PL + PGL+DALL +I+
Sbjct: 103 RILLSIKSTIPEDTDYALEVLIAGSFYEPD-----LIPLPRFPGLIDALLDLIE 151
>gi|443924240|gb|ELU43293.1| vacuolar protein 8 [Rhizoctonia solani AG-1 IA]
Length = 680
Score = 38.5 bits (88), Expect = 7.7, Method: Compositional matrix adjust.
Identities = 33/124 (26%), Positives = 55/124 (44%), Gaps = 8/124 (6%)
Query: 246 CIEDHVTEDEELV----TNALETIVNLAPLLDLRIFSSSKQSYIKITREKRAVEAIMGIL 301
C+ + T DE + AL + LA D+R+ ++ + A+ ++G+L
Sbjct: 170 CVTNLATHDENKTMIAKSGALVPLTRLARSKDMRVQRNATDENRQQLVNAGAIPVLVGLL 229
Query: 302 GSPFK--AWHCAAAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSLPAFDAQAAAVGAL 359
SP ++C A L + ++ N L P++ + LV LM P+ Q A AL
Sbjct: 230 NSPDTDVQYYCTTA--LSNIAVDAANRKKLASSEPKLVQSLVALMDSPSLKVQCQAALAL 287
Query: 360 YNLA 363
NLA
Sbjct: 288 RNLA 291
>gi|348518429|ref|XP_003446734.1| PREDICTED: AT-rich interactive domain-containing protein 1B-like
[Oreochromis niloticus]
Length = 2215
Score = 38.1 bits (87), Expect = 8.9, Method: Composition-based stats.
Identities = 18/51 (35%), Positives = 35/51 (68%), Gaps = 3/51 (5%)
Query: 67 RIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQ 117
R++++L+SGL +E TWAL+T+ +L + DD + L ++PG L+ +++
Sbjct: 1620 RVMMSLKSGLLAESTWALDTINILLY---DDSTVGSFSLPQLPGFLELIVE 1667
>gi|47207505|emb|CAF92773.1| unnamed protein product [Tetraodon nigroviridis]
Length = 1644
Score = 38.1 bits (87), Expect = 9.2, Method: Composition-based stats.
Identities = 24/73 (32%), Positives = 41/73 (56%), Gaps = 2/73 (2%)
Query: 67 RIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQVIDDWRDIA 126
R++++L+SGL +E TWAL+T+ +L + + + L PG +LQ + RD+
Sbjct: 1017 RVMMSLKSGLLAESTWALDTINILLYDDSSISTFNLCQLPGFPGAGGGVLQTLPH-RDLW 1075
Query: 127 LPKELSKGPRART 139
P+ + G R RT
Sbjct: 1076 HPEGVRSG-RPRT 1087
>gi|281211075|gb|EFA85241.1| hypothetical protein PPL_02241 [Polysphondylium pallidum PN500]
Length = 530
Score = 38.1 bits (87), Expect = 9.2, Method: Compositional matrix adjust.
Identities = 29/107 (27%), Positives = 52/107 (48%), Gaps = 2/107 (1%)
Query: 338 KRLVDLMSLPAFDAQAAAVGALYNLAEVNVDCRLKLASERWAIDRLLRVI--KTPHPVPE 395
+RL++L+ A D Q ++ L NL +++ R+ + + L ++ KT E
Sbjct: 419 ERLIELLGHYANDIQLLSLNILCNLIKISQKSRIMICHCPGLLRHLCNLLAYKTGDHNME 478
Query: 396 VCRKAAMILESLVSEPQNRVLLLAYENAFAEILFSDGRYSDTFARIL 442
+ +KAA L + EP N L+ +E A+I +D ++D IL
Sbjct: 479 MGKKAAAFLSQVSKEPLNLPALMPFEPTLAQIGLTDNPHTDVIISIL 525
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.318 0.133 0.390
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 7,082,279,292
Number of Sequences: 23463169
Number of extensions: 292850053
Number of successful extensions: 1302862
Number of sequences better than 100.0: 287
Number of HSP's better than 100.0 without gapping: 47
Number of HSP's successfully gapped in prelim test: 240
Number of HSP's that attempted gapping in prelim test: 1301922
Number of HSP's gapped (non-prelim): 961
length of query: 463
length of database: 8,064,228,071
effective HSP length: 146
effective length of query: 317
effective length of database: 8,933,572,693
effective search space: 2831942543681
effective search space used: 2831942543681
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 79 (35.0 bits)