RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy11399
(865 letters)
>gnl|CDD|240015 cd04657, Piwi_ago-like, Piwi_ago-like: PIWI domain, Argonaute-like
subfamily. Argonaute is the central component of the
RNA-induced silencing complex (RISC) and related
complexes. The PIWI domain is the C-terminal portion of
Argonaute and consists of two subdomains, one of which
provides the 5' anchoring of the guide RNA and the
other, the catalytic site for slicing.
Length = 426
Score = 322 bits (827), Expect = e-102
Identities = 127/279 (45%), Positives = 167/279 (59%), Gaps = 30/279 (10%)
Query: 448 YVHEFGLTISNSMMEVRGRVLPPPKLQYGGRMSSITAQVNSMSSQGLFASKQQALPNSGV 507
Y+ EFG+++S M+ V GRVLPPPKL+YG + P +G
Sbjct: 2 YLKEFGISVSKEMITVPGRVLPPPKLKYGDS-------------------SKTVPPRNGS 42
Query: 508 WDMRGKQFYTGVEIRVWAIACFAPQRTVRED--SLRSFTQQLQKISNDAGMPIIGQPCFC 565
W++RGK+F G IR WA+ FA R RE+ LR+F QL K AG+ I
Sbjct: 43 WNLRGKKFLEGGPIRSWAVLNFAGPRRSREERADLRNFVDQLVKTVIGAGINIT-----T 97
Query: 566 KYATGPDQVEPMFRYLK-STFQALQLVVVVLPGK-TPVYAEVKRVGDTVLGMATQCVQAK 623
A+ +VE +F LK + + QLV+V+LP K + +Y +KR+ DT LG+ TQCV AK
Sbjct: 98 AIASVEGRVEELFAKLKQAKGEGPQLVLVILPKKDSDIYGRIKRLADTELGIHTQCVLAK 157
Query: 624 NVNK-TSPQTLSNLCLKINVKLGGINSILVPSIRPKVFNEPVIFMGADVTHPPAGDN-KK 681
V K +PQ +N+ LKIN+KLGGIN L P IRP + EP + +GADVTHP GD
Sbjct: 158 KVTKKGNPQYFANVALKINLKLGGINHSLEPDIRPLLTKEPTMVLGADVTHPSPGDPAGA 217
Query: 682 PSIAAVVGSMDAHPSRYAATVRVQQHRQEIIQELSSMVR 720
PSIAAVV S+D H ++Y A+VR+Q HRQEII +L SMVR
Sbjct: 218 PSIAAVVASVDWHLAQYPASVRLQSHRQEIIDDLESMVR 256
Score = 224 bits (573), Expect = 3e-65
Identities = 74/104 (71%), Positives = 83/104 (79%)
Query: 720 RLFCADKKEQSGKSGNIPAGTTVDIGITHPTEFDFYLCSHQGIQGTSRPSHYHVLWDDNH 779
R F D+ + GK+GN+P GT VD GITHP EFDFYLCSH GIQGT+RP+HYHVLWD+
Sbjct: 323 RFFPTDEDDADGKNGNVPPGTVVDRGITHPREFDFYLCSHAGIQGTARPTHYHVLWDEIG 382
Query: 780 FDADELQCLTYQLCHTYVRCTRSVSIPAPAYYAHLVAFRARYHL 823
F ADELQ LTY LC+TY RCTRSVSIP PAYYAHL A RAR +L
Sbjct: 383 FTADELQTLTYNLCYTYARCTRSVSIPPPAYYAHLAAARARCYL 426
>gnl|CDD|215631 PLN03202, PLN03202, protein argonaute; Provisional.
Length = 900
Score = 317 bits (814), Expect = 8e-95
Identities = 229/789 (29%), Positives = 361/789 (45%), Gaps = 136/789 (17%)
Query: 39 PGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELPMFNCPRRPNLGREGRPIVLRA 98
PP PV PP VP I +P +P LPM R G +G+ I L
Sbjct: 3 ALPPPPPVVPPNVVP--------IKLEPTKKPSKPKRLPM----ARRGFGSKGQKIQLLT 50
Query: 99 NHFQISM--PRGFVHHYEIQIQ-PDKCP---RKVNREIIETMVHAYSKIFGALKPVFDGR 152
NHF++S+ P G HY + + D P + + R++I+ + YS +DG
Sbjct: 51 NHFKVSVNNPDGHFFHYSVSLTYEDGRPVDGKGIGRKVIDKVQETYSSDLAGKDFAYDGE 110
Query: 153 NNLYTRDPLPIGNDRMELEVTL--------------------PGEGKDRV--------FR 184
+L+T LP +++E V L G + R F+
Sbjct: 111 KSLFTVGALP--QNKLEFTVVLEDVSSNRNNGNGSPVGNGSPNGGDRKRSRRPYQSKTFK 168
Query: 185 VAIKWQAQVSLFALEEALEGRTRQIPFDAILALDVVMR-HLPSMTYTPVGRSFFSTPDGY 243
V I + A++ + A+ AL G+ + DA+ LD+++R H V +SFF
Sbjct: 169 VEISFAAKIPMQAIANALRGQESENSQDALRVLDIILRQHAAKQGCLLVRQSFFHNDPKN 228
Query: 244 YHPLGGGREVWFGFHQSVRPSQWKMMLNIDVSATAFYKSQPVIEFMCEVLDIRDVNEQRK 303
+ LGGG GFH S R +Q + LNIDVS T + PV++F+ ++RD +
Sbjct: 229 FVDLGGGVLGCRGFHSSFRTTQGGLSLNIDVSTTMIVQPGPVVDFLIANQNVRDPFQ--- 285
Query: 304 PLTDSQRVKFTKEIKGLKIEITHCGTMKRKYRVCNVTRRPAQMQSFPLQLENG-----QT 358
D + K + +K L+++++ +Y++ ++ +P + Q+F L+ NG +T
Sbjct: 286 --IDWSKAK--RMLKNLRVKVSPSNQ---EYKITGLSEKPCKEQTFSLKQRNGNGNEVET 338
Query: 359 VECTVAKYFLDKYKMKLRFP-HLPCLQVGQEHKHTYLPLEVCNIVAGQRCIKKLTDMQTS 417
VE TV YF+ ++LR+ LPC+ VG+ + TY P+E+C++V+ QR K L+ +Q S
Sbjct: 339 VEITVYDYFVKHRGIELRYSGDLPCINVGKPKRPTYFPIELCSLVSLQRYTKALSTLQRS 398
Query: 418 TMIKATARSAPDREREIQSLVKRADFNNDAYVHEFGLTISNSMMEVRGRVLPPPKLQYGG 477
++++ + + +R + + +K ++++ D + G++IS+ +V GRVLP PKL+ G
Sbjct: 399 SLVEKSRQKPQERMKVLTDALKSSNYDADPMLRSCGISISSQFTQVEGRVLPAPKLKVG- 457
Query: 478 RMSSITAQVNSMSSQGLFASKQQALPNSGVWDMRGKQFYTGVEIRVWAIACFAPQRTVRE 537
+ + F P +G W+ K+ +I WA+ F + R
Sbjct: 458 ------------NGEDFF-------PRNGRWNFNNKKLVEPTKIERWAVVNF----SARC 494
Query: 538 DSLRSFTQQLQKISNDAGMPIIG-------QPCFCKYATGPDQVEPMFRYLKSTFQAL-Q 589
D R + L K G+ I P F + A P +VE MF ++S Q
Sbjct: 495 DI-RHLVRDLIKCGEMKGINIEPPFDVFEENPQF-RRAPPPVRVEKMFEQIQSKLPGPPQ 552
Query: 590 LVVVVLPGK--TPVYAEVKRVGDTVLGMATQCVQAKNVNKTSPQTLSNLCLKINVKLGGI 647
++ +LP + + +Y K+ + G+ TQC+ VN Q L+N+ LKIN KLGG+
Sbjct: 553 FLLCILPERKNSDIYGPWKKKNLSEFGIVTQCIAPTRVND---QYLTNVLLKINAKLGGL 609
Query: 648 NSILV----PSIRPKVFNEPVIFMGADVTHPPAGDNKKPSIAAVVGSMDAHP--SRYAAT 701
NS+L PSI P V P I +G DV+H G + PSIAAVV S P SRY A+
Sbjct: 610 NSLLAIEHSPSI-PLVSKVPTIILGMDVSHGSPGQSDVPSIAAVVSSRQ-WPLISRYRAS 667
Query: 702 VRVQQHRQEIIQELSSMVRLFCADKKEQSGKSGNIPAGTTVDIGITHPTEFDFYLCSHQG 761
VR Q + E+I L P G D GI DFY S
Sbjct: 668 VRTQSPKVEMIDSLFK-------------------PVGDKDDDGIIRELLLDFYTSS--- 705
Query: 762 IQGTSRPSH 770
G +P
Sbjct: 706 --GKRKPEQ 712
Score = 118 bits (298), Expect = 3e-27
Identities = 45/88 (51%), Positives = 59/88 (67%)
Query: 729 QSGKSGNIPAGTTVDIGITHPTEFDFYLCSHQGIQGTSRPSHYHVLWDDNHFDADELQCL 788
Q+G N+P GT VD I HP DFY+C+H G+ GT+RP+HYHVL D+ F AD+LQ L
Sbjct: 766 QAGSPDNVPPGTVVDNKICHPRNNDFYMCAHAGMIGTTRPTHYHVLLDEIGFSADDLQEL 825
Query: 789 TYQLCHTYVRCTRSVSIPAPAYYAHLVA 816
+ L + Y R T ++S+ AP YAHL A
Sbjct: 826 VHSLSYVYQRSTTAISVVAPVCYAHLAA 853
>gnl|CDD|214930 smart00950, Piwi, This domain is found in the protein Piwi and its
relatives. The function of this domain is the dsRNA
guided hydrolysis of ssRNA. Determination of the crystal
structure of Argonaute reveals that PIWI is an RNase H
domain, and identifies Argonaute as Slicer, the enzyme
that cleaves mRNA in the RNAi RISC complex.. In
addition, Mg+2 dependence and production of 3'-OH and 5'
phosphate products are shared characteristics of RNaseH
and RISC. The PIWI domain core has a tertiary structure
belonging to the RNase H family of enzymes. RNase H fold
proteins all have a five-stranded mixed beta-sheet
surrounded by helices. By analogy to RNase H enzymes
which cleave single-stranded RNA guided by the DNA
strand in an RNA/DNA hybrid, the PIWI domain can be
inferred to cleave single-stranded RNA, for example
mRNA, guided by double stranded siRNA.
Length = 301
Score = 258 bits (662), Expect = 1e-79
Identities = 118/308 (38%), Positives = 153/308 (49%), Gaps = 80/308 (25%)
Query: 590 LVVVVLPG--KTPVYAEVKRVGDTVLGMATQCVQAKNVNK-----TSPQTLSNLCLKINV 642
L+VV+LPG KT +Y E+K+ +T LG+ TQCVQAK ++K Q L+N+ LKIN
Sbjct: 1 LIVVILPGEKKTDLYHEIKKYLETKLGVPTQCVQAKTLDKVSKRRKLKQYLTNVALKINA 60
Query: 643 KLGGINSILVPSIRPKVFNEPVIFMGADVTHPPAGDNK--KPSIAAVVGS-MDAHPSRYA 699
KLGGIN +L P + +P + +G DV+HP AG PS+AA V S + Y
Sbjct: 61 KLGGINWVLDV---PPIPLKPTLIIGIDVSHPSAGKGGSVAPSVAAFVASGNYLSGNFYQ 117
Query: 700 ATVRVQ--------------------------------------QHRQEIIQELSSM--- 718
A VR Q Q +Q + E+ ++
Sbjct: 118 AFVREQGSRQLKEILREALKKYYKSNRKRLPDRIVVYRDGVSEGQFKQVLEYEVKAIKKA 177
Query: 719 ----------------------VRLFCADKKEQSGKSGNIPAGTTVDIGITHPTEFDFYL 756
R F D N+P GT VD IT P +DFYL
Sbjct: 178 CKELGPDYKPKLTVIVVQKRHHTRFFPEDG----NGRVNVPPGTVVDSVITSPEWYDFYL 233
Query: 757 CSHQGIQGTSRPSHYHVLWDDNHFDADELQCLTYQLCHTYVRCTRSVSIPAPAYYAHLVA 816
SH G+QGT+RP+HY VL+D+ + D DELQ LTY+LCH Y R TR VS+PAP YYAHL+A
Sbjct: 234 VSHAGLQGTARPTHYTVLYDEGNLDPDELQRLTYKLCHLYYRSTRPVSLPAPVYYAHLLA 293
Query: 817 FRARYHLV 824
RAR L
Sbjct: 294 KRARQLLH 301
>gnl|CDD|239208 cd02826, Piwi-like, Piwi-like: PIWI domain. Domain found in
proteins involved in RNA silencing. RNA silencing refers
to a group of related gene-silencing mechanisms mediated
by short RNA molecules, including siRNAs, miRNAs, and
heterochromatin-related guide RNAs. The central
component of the RNA-induced silencing complex (RISC)
and related complexes is Argonaute. The PIWI domain is
the C-terminal portion of Argonaute and consists of two
subdomains, one of which provides the 5' anchoring of
the guide RNA and the other, the catalytic site for
slicing. This domain is also found in closely related
proteins, including the Piwi subfamily, where it is
believed to perform a crucial role in germline cells,
via a similar mechanism.
Length = 393
Score = 225 bits (575), Expect = 6e-66
Identities = 108/434 (24%), Positives = 165/434 (38%), Gaps = 116/434 (26%)
Query: 461 MEVRGRVLPPPKLQYGGRMSSITAQVNSMSSQGLFASKQQALPNSGVWDMRGKQFYTGVE 520
+ ++GRVLP P+ I + + + G F + V R ++
Sbjct: 3 LILKGRVLPKPQ---------ILFKNKFLRNIGPFEKPAKITNPVAVIAFRNEEVDD--L 51
Query: 521 IRVWAIACFAPQRTVREDSLRSFTQQLQKISNDAGMPIIGQPCFCKYATGPDQVEPMFRY 580
++ A AC ++E + S+ + + + +
Sbjct: 52 VKRLADACRQLGMKIKEIPIVSWIED-----------------------LNNSFKDLKSV 88
Query: 581 LKSTFQA-LQLVVVVLPGK-TPVYAEVKRVGDTVLGMATQCVQAKNVNKTS--PQTLSNL 636
K+ +A +QLV+ +L K P++ E+KR+ + +Q +Q K K QTL NL
Sbjct: 89 FKNAIKAGVQLVIFILKEKKPPLHDEIKRLEAKS-DIPSQVIQLKTAKKMRRLKQTLDNL 147
Query: 637 CLKINVKLGGINSILVPSIRPKVFNEPVIFMGADVTHPPAGDNKKPSIAAVVGSMDAH-- 694
K+N KLGGIN IL P + IF+G DV+HP + VG
Sbjct: 148 LRKVNSKLGGINYILDS---PVKLFKSDIFIGFDVSHPDRR--TVNGGPSAVGFAANLSN 202
Query: 695 -----------PSRYAATV--------------------------------------RVQ 705
PSR RV+
Sbjct: 203 HTFLGGFLYVQPSREVKLQDLGEVIKKCLDGFKKSTGEGLPEKIVIYRDGVSEGEFKRVK 262
Query: 706 QHRQEIIQELSSM------------------VRLFCADKKEQSGKSGNIPAGTTVDIGIT 747
+ +EII+E + R F +K G N GT VD IT
Sbjct: 263 EEVEEIIKEACEIEESYRPKLVIIVVQKRHNTRFFPNEKN---GGVQNPEPGTVVDHTIT 319
Query: 748 HPTEFDFYLCSHQGIQGTSRPSHYHVLWDDNHFDADELQCLTYQLCHTYVRCTRSVSIPA 807
P +FYL SH QGT +P+ Y V+++D ++ +EL+ LTY LC T+ +S+PA
Sbjct: 320 SPGLSEFYLASHVARQGTVKPTKYTVVFNDKNWSLNELEILTYILCLTHQNVYSPISLPA 379
Query: 808 PAYYAHLVAFRARY 821
P YYAH +A R R
Sbjct: 380 PLYYAHKLAKRGRN 393
>gnl|CDD|216915 pfam02171, Piwi, Piwi domain. This domain is found in the protein
Piwi and its relatives. The function of this domain is
the dsRNA guided hydrolysis of ssRNA. Determination of
the crystal structure of Argonaute reveals that PIWI is
an RNase H domain, and identifies Argonaute as Slicer,
the enzyme that cleaves mRNA in the RNAi RISC complex.
In addition, Mg+2 dependence and production of 3'-OH and
5' phosphate products are shared characteristics of
RNaseH and RISC. The PIWI domain core has a tertiary
structure belonging to the RNase H family of enzymes.
RNase H fold proteins all have a five-stranded mixed
beta-sheet surrounded by helices. By analogy to RNase H
enzymes which cleave single-stranded RNA guided by the
DNA strand in an RNA/DNA hybrid, the PIWI domain can be
inferred to cleave single-stranded RNA, for example
mRNA, guided by double stranded siRNA.
Length = 296
Score = 154 bits (392), Expect = 7e-42
Identities = 52/105 (49%), Positives = 65/105 (61%), Gaps = 3/105 (2%)
Query: 720 RLFCADKKEQSGKSGNIPAGTTVDIGITHPTEFDFYLCSHQGIQGTSRPSHYHVLWDDNH 779
R F + K + N P GT VD IT P +DFYLCSH G QGT +P+ Y VL+D+
Sbjct: 195 RFFASGK---RDGAQNPPPGTVVDDVITSPEYYDFYLCSHAGRQGTVKPTKYTVLYDEIG 251
Query: 780 FDADELQCLTYQLCHTYVRCTRSVSIPAPAYYAHLVAFRARYHLV 824
+ELQ LTY+LC+ Y R R VS+PAP YYAH +A R R +L
Sbjct: 252 LSPEELQQLTYKLCYMYQRVFRPVSLPAPVYYAHKLAKRGRNNLK 296
Score = 129 bits (327), Expect = 3e-33
Identities = 54/133 (40%), Positives = 79/133 (59%), Gaps = 7/133 (5%)
Query: 590 LVVVVLPG-KTPVYAEVKRVGDTVLGMATQCVQAKNV-NKTSPQTLSNLCLKINVKLGGI 647
L+VV+LP Y +K+ +T LG+ TQC++ K +T PQTL+N+ LKINVKLGG+
Sbjct: 1 LIVVILPDKNKDNYHSIKKYLETELGIPTQCIRLKTALKRTKPQTLTNVLLKINVKLGGL 60
Query: 648 NSILVPSIRPKVFNEPVIFMGADVTHPPAGDNKKPSIAAVVGSMDAHPSRYAATVRVQQH 707
N + I PK+ + +G D++H G + PS+ V SMD HP +YA VR Q
Sbjct: 61 N-YWIVEIPPKI----DVIIGFDISHGTGGTDDNPSVVGFVASMDKHPQKYAGGVRYQAS 115
Query: 708 RQEIIQELSSMVR 720
QE+I+ L ++
Sbjct: 116 GQELIEPLKEIIL 128
>gnl|CDD|239212 cd02846, PAZ_argonaute_like, PAZ domain, argonaute_like subfamily.
Argonaute is part of the RNA-induced silencing complex
(RISC), and is an endonuclease that plays a key role in
the RNA interference pathway. The PAZ domain has been
named after the proteins Piwi,Argonaut, and Zwille. PAZ
is found in two families of proteins that are essential
components of RNA-mediated gene-silencing pathways,
including RNA interference, the Piwi and Dicer families.
PAZ functions as a nucleic acid binding domain, with a
strong preference for single-stranded nucleic acids (RNA
or DNA) or RNA duplexes with single-stranded 3'
overhangs. It has been suggested that the PAZ domain
provides a unique mode for the recognition of the two
3'-terminal nucleotides in single-stranded nucleic acids
and buries the 3' OH group, and that it might recognize
characteristic 3' overhangs in siRNAs within RISC
(RNA-induced silencing) and other complexes.
Length = 114
Score = 146 bits (370), Expect = 4e-41
Identities = 54/121 (44%), Positives = 77/121 (63%), Gaps = 7/121 (5%)
Query: 282 SQPVIEFMCEVLDIRDVNEQRKPLTDSQRVKFTKEIKGLKIEITHCGTMKRKYRVCNVTR 341
+QPVIEF+ E L L+D+ R K K +KGLK+E+TH G RKY++ ++
Sbjct: 1 AQPVIEFLKEFLGFDTPL----GLSDNDRRKLKKALKGLKVEVTHRGNTNRKYKIKGLSA 56
Query: 342 RPAQMQSFPLQLENGQTVECTVAKYFLDKYKMKLRFPHLPCLQVGQEHKHTYLPLEVCNI 401
PA Q+F L E +VA YF +KY ++L++P+LPCLQVG++ K YLP+E+CNI
Sbjct: 57 EPASQQTFEL---KDGEKEISVADYFKEKYNIRLKYPNLPCLQVGRKGKPNYLPMELCNI 113
Query: 402 V 402
V
Sbjct: 114 V 114
>gnl|CDD|216914 pfam02170, PAZ, PAZ domain. This domain is named PAZ after the
proteins Piwi Argonaut and Zwille. This domain is found
in two families of proteins that are involved in
post-transcriptional gene silencing. These are the Piwi
family and the Dicer family, that includes the Carpel
factory protein. The function of the domains is unknown
but has been suggested to mediate complex formation
between proteins of the Piwi and Dicer families by
hetero-dimerisation. The three-dimensional structure of
this domain has been solved. The PAZ domain is composed
of two subdomains. One subdomain is similar to the OB
fold, albeit with a different topology. The OB-fold is
well known as a single-stranded nucleic acid binding
fold. The second subdomain is composed of a beta-hairpin
followed by an alpha-helix. The PAZ domains shows
low-affinity nucleic acid binding and appears to
interact with the 3' ends of single-stranded regions of
RNA in the cleft between the two subdomains. PAZ can
bind the characteristic two-base 3' overhangs of siRNAs,
indicating that although PAZ may not be a primary
nucleic acid binding site in Dicer or RISC, it may
contribute to the specific and productive incorporation
of siRNAs and miRNAs into the RNAi pathway.
Length = 114
Score = 137 bits (346), Expect = 5e-38
Identities = 53/127 (41%), Positives = 75/127 (59%), Gaps = 13/127 (10%)
Query: 283 QPVIEFMCEVLDIRDVNEQRKPLTDSQRVKFTKEIKGLKIEITHCGTMKRKYRVCNVTRR 342
QPV++F+ D+ QR PL D R KFTK +KGL +E T+ R YR+ +T
Sbjct: 1 QPVLDFL------TDILTQRTPLGDDFREKFTKALKGLIVETTYN---NRTYRIDGITWD 51
Query: 343 PAQMQSFPLQLENGQTVECTVAKYFLDKYKMKLRFPHLPCLQVGQEHKHTYLPLEVCNIV 402
P +FPL+ E +VA+YF +KY + L++P+LP L VG++ K YLP E+C I
Sbjct: 52 PTPNSTFPLK----DGGEISVAEYFKEKYNITLKYPNLPLLVVGRKKKPNYLPPELCFIT 107
Query: 403 AGQRCIK 409
GQR +K
Sbjct: 108 GGQRYMK 114
>gnl|CDD|239207 cd02825, PAZ, PAZ domain, named PAZ after the proteins Piwi
Argonaut and Zwille. PAZ is found in two families of
proteins that are essential components of RNA-mediated
gene-silencing pathways, including RNA interference, the
piwi and Dicer families. PAZ functions as a nucleic-acid
binding domain, with a strong preference for
single-stranded nucleic acids (RNA or DNA) or RNA
duplexes with single-stranded 3' overhangs. It has been
suggested that the PAZ domain provides a unique mode for
the recognition of the two 3'-terminal nucleotides in
single-stranded nucleic acids and buries the 3' OH
group, and that it might recognize characteristic 3'
overhangs in siRNAs within RISC (RNA-induced silencing)
and other complexes. This parent model also contains
structures of an archaeal PAZ domain.
Length = 115
Score = 115 bits (289), Expect = 3e-30
Identities = 48/123 (39%), Positives = 63/123 (51%), Gaps = 12/123 (9%)
Query: 283 QPVIEFMCEVLDIRDVNEQRKPLTDSQRVKFTKEIKGLKIEITHCGTMKRKYRVCNVTRR 342
PVIE MC+ R+++ PL DS R +FTKE+KGLK+E TH + R YR TR
Sbjct: 2 DPVIETMCKFPKDREIDT---PLLDSPREEFTKELKGLKVEDTHNP-LNRVYRPDGETRL 57
Query: 343 PAQMQSFPLQLENGQTVECTVAKYFLDKYKMKLRFPHLPCLQVGQEHK---HTYLPLEVC 399
A P QL++ E T A YF ++Y + L + P L V K LP E+C
Sbjct: 58 KA-----PSQLKHSDGKEITFADYFKERYNLTLTDLNQPLLIVKFSSKKSYSILLPPELC 112
Query: 400 NIV 402
I
Sbjct: 113 VIT 115
>gnl|CDD|240016 cd04658, Piwi_piwi-like_Euk, Piwi_piwi-like_Euk: PIWI domain,
Piwi-like subfamily found in eukaryotes. This domain is
found in Piwi and closely related proteins, where it is
believed to perform a crucial role in germline cells,
via RNA silencing. RNA silencing refers to a group of
related gene-silencing mechanisms mediated by short RNA
molecules, including siRNAs, miRNAs, and
heterochromatin-related guide RNAs. The mechanism in
Piwi is believed to be similar to that in Argonaute, the
central component of the RNA-induced silencing complex
(RISC). The PIWI domain is the C-terminal portion of
Argonaute and consists of two subdomains, one of which
provides the 5' anchoring of the guide RNA and the
other, the catalytic site for slicing.
Length = 448
Score = 105 bits (264), Expect = 7e-24
Identities = 43/98 (43%), Positives = 54/98 (55%), Gaps = 4/98 (4%)
Query: 720 RLFCADKKEQSGKSGNIPAGTTVDIGITHPTEFDFYLCSHQGIQGTSRPSHYHVLWDDNH 779
R F + N P GT VD IT P +DF+L S QGT P+HY+VL+D
Sbjct: 351 RFF----NQGGNNFSNPPPGTVVDSEITKPEWYDFFLVSQSVRQGTVTPTHYNVLYDTTG 406
Query: 780 FDADELQCLTYQLCHTYVRCTRSVSIPAPAYYAHLVAF 817
D LQ LTY+LCH Y + S+ +PAP YAH +AF
Sbjct: 407 LKPDHLQRLTYKLCHLYYNWSGSIRVPAPCQYAHKLAF 444
Score = 95.4 bits (238), Expect = 1e-20
Identities = 73/312 (23%), Positives = 136/312 (43%), Gaps = 49/312 (15%)
Query: 418 TMIKATARSAPDREREIQSLVKRADFNNDAYVHE----FGLTISNSMMEVRGRVLPPPKL 473
+ + T + +R I+ ++R + V E +G+ + ++ ++++GRVLPP ++
Sbjct: 5 ELAEHTKLNPKERYDTIRQFIQR--IQKNPSVQELLKKWGIELDSNPLKIQGRVLPPEQI 62
Query: 474 QYGGRMSSITAQVNSMSSQGLFASKQQALPNSGVWDMRGKQFYTGVEIRVWAIACFAPQR 533
G + A ++R + Y V + W + P R
Sbjct: 63 IMGNVFVY---------------ANSNADWKR---EIRNQPLYDAVNLNNWVL--IYPSR 102
Query: 534 TVREDSLRSFTQQLQKISNDAGMPIIGQPCFCKYATGPDQVEPMFRYLKSTFQAL-QLVV 592
RE SF Q L++++ G+ I P K D++E R LK F++ QLVV
Sbjct: 103 DQRE--AESFLQTLKQVAGPMGIQIS-PPKIIKVKD--DRIETYIRALKDAFRSDPQLVV 157
Query: 593 VVLPG-KTPVYAEVKRVGDTVLGMATQCVQAKNVNKTSPQTLSNLCLKI----NVKLGGI 647
++LPG K +Y +K+ + +Q + ++ + K + L ++ KI N KLGGI
Sbjct: 158 IILPGNKKDLYDAIKKFCCVECPVPSQVITSRTLKK--KKNLRSIASKIALQINAKLGGI 215
Query: 648 NSILVP-SIRPKVF-NEPVIFMGADVTHPPAGDNKKPSIAAVVGSMDAHPSRYAATVRVQ 705
P ++ F + + +G DV H K S+ V S++ +++ + Q
Sbjct: 216 -----PWTVEIPPFILKNTMIVGIDVYHDTITKKK--SVVGFVASLNKSITKWFSKYISQ 268
Query: 706 -QHRQEIIQELS 716
+ ++EII L
Sbjct: 269 VRGQEEIIDSLG 280
>gnl|CDD|219976 pfam08699, DUF1785, Domain of unknown function (DUF1785). This
region is found in argonaute proteins and often
co-occurs with pfam02179 and pfam02171.
Length = 52
Score = 89.5 bits (223), Expect = 5e-22
Identities = 32/53 (60%), Positives = 36/53 (67%), Gaps = 1/53 (1%)
Query: 230 TPVGRSFFSTPDGYYHPLGGGREVWFGFHQSVRPSQWKMMLNIDVSATAFYKS 282
VGRSFFS G+ LGGG E W GF QSVRP+Q + LN+DVS TAFYK
Sbjct: 1 IAVGRSFFSISFGH-RDLGGGLEAWRGFFQSVRPTQGGLSLNVDVSTTAFYKP 52
>gnl|CDD|198017 smart00949, PAZ, This domain is named PAZ after the proteins Piwi
Argonaut and Zwille. This domain is found in two
families of proteins that are involved in
post-transcriptional gene silencing. These are the Piwi
family and the Dicer family, that includes the Carpel
factory protein. The function of the domains is unknown
but has been suggested to mediate complex formation
between proteins of the Piwi and Dicer families by
hetero-dimerisation. The three-dimensional structure of
this domain has been solved. The PAZ domain is composed
of two subdomains. One subdomain is similar to the OB
fold, albeit with a different topology. The OB-fold is
well known as a single-stranded nucleic acid binding
fold. The second subdomain is composed of a beta-hairpin
followed by an alpha-helix. The PAZ domains shows
low-affinity nucleic acid binding and appears to
interact with the 3' ends of single-stranded regions of
RNA in the cleft between the two subdomains. PAZ can
bind the characteristic two-base 3' overhangs of siRNAs,
indicating that although PAZ may not be a primary
nucleic acid binding site in Dicer or RISC, it may
contribute to the specific and productive incorporation
of siRNAs and miRNAs into the RNAi pathway.
Length = 138
Score = 54.6 bits (132), Expect = 8e-09
Identities = 29/145 (20%), Positives = 54/145 (37%), Gaps = 18/145 (12%)
Query: 290 CEVLDIRDVNEQRKPLTDSQRVKFTKEIKGLKIEITHCGTMKRKYRVCNVTRRPAQMQSF 349
VLD + + + + K++KGL + + + YR+ ++ A +F
Sbjct: 1 ETVLDFM-RQLPSQGNRSNFQDRCAKDLKGLIVLTRY---NNKTYRIDDIDWNLAPKSTF 56
Query: 350 PLQLENGQTVECTVAKYFLDKYKMKLRFPHLPCL--------QVGQEHKHTYLPLEVCNI 401
+G E T +Y+ KY + +R P+ P L + + LP E+C I
Sbjct: 57 EKS--DGS--EITFVEYYKQKYNITIRDPNQPLLVSRPKRRRNQNGKGEPVLLPPELCFI 112
Query: 402 VA-GQRCIKKLTDM-QTSTMIKATA 424
R K M + + +
Sbjct: 113 TGLTDRMRKDFMLMKSIADRTRLSP 137
>gnl|CDD|219321 pfam07174, FAP, Fibronectin-attachment protein (FAP). This family
contains bacterial fibronectin-attachment proteins
(FAP). Family members are rich in alanine and proline,
are approximately 300 long, and seem to be restricted to
mycobacteria. These proteins contain a
fibronectin-binding motif that allows mycobacteria to
bind to fibronectin in the extracellular matrix.
Length = 297
Score = 50.3 bits (120), Expect = 2e-06
Identities = 26/59 (44%), Positives = 26/59 (44%)
Query: 31 AGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELPMFNCPRRPNLGR 89
A AP P AP PP P APP P P A PP P P PP P P P GR
Sbjct: 52 AAAPAPAAPPPPPPPAAPPAPQPDDPNAAPPPPPADPNAPPPPPVDPNAPPPPAPEPGR 110
Score = 44.9 bits (106), Expect = 9e-05
Identities = 21/82 (25%), Positives = 25/82 (30%), Gaps = 6/82 (7%)
Query: 33 APPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELPMFNCPRRPNLGREGR 92
A AP P PP P PP P P P P PN
Sbjct: 48 PSTAAAAPAPAAPPPPPPPAAPPAPQPDDPNAAPPPPPADPNAPP--PPPVDPNAPPPPA 105
Query: 93 PIVLRANH----FQISMPRGFV 110
P R ++ F +P G+V
Sbjct: 106 PEPGRIDNAVGGFSYVVPAGWV 127
Score = 32.2 bits (73), Expect = 1.1
Identities = 13/39 (33%), Positives = 14/39 (35%)
Query: 36 PGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPP 74
P A P P PP + A A PP PP P
Sbjct: 34 PATANADPAPPPPPPSTAAAAPAPAAPPPPPPPAAPPAP 72
Score = 31.0 bits (70), Expect = 2.3
Identities = 15/33 (45%), Positives = 15/33 (45%)
Query: 31 AGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIP 63
A PPP P APP P P A P P G I
Sbjct: 80 APPPPPADPNAPPPPPVDPNAPPPPAPEPGRID 112
Score = 31.0 bits (70), Expect = 2.8
Identities = 13/34 (38%), Positives = 15/34 (44%)
Query: 31 AGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPP 64
A PPP P APP P PG + G + P
Sbjct: 90 APPPPPVDPNAPPPPAPEPGRIDNAVGGFSYVVP 123
>gnl|CDD|223021 PHA03247, PHA03247, large tegument protein UL36; Provisional.
Length = 3151
Score = 43.8 bits (103), Expect = 5e-04
Identities = 16/76 (21%), Positives = 20/76 (26%)
Query: 18 GNSIFTCGGQTKCAGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELP 77
+ F G P P + P P P APP +P +P P P
Sbjct: 391 AATPFARGPGGDDQTRPAAPVPASVPTPAPTPVPASAPPPPATPLPSAEPGSDDGPAPPP 450
Query: 78 MFNCPRRPNLGREGRP 93
P P
Sbjct: 451 ERQPPAPATEPAPDDP 466
Score = 38.4 bits (89), Expect = 0.019
Identities = 17/63 (26%), Positives = 19/63 (30%), Gaps = 2/63 (3%)
Query: 31 AGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELPMFNCPRRPNLGRE 90
AG P P P AP PP + P A P P P + P L
Sbjct: 2764 AGPPAPAPPAAPAAG--PPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPA 2821
Query: 91 GRP 93
P
Sbjct: 2822 ASP 2824
Score = 36.1 bits (83), Expect = 0.12
Identities = 16/63 (25%), Positives = 19/63 (30%), Gaps = 2/63 (3%)
Query: 31 AGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELPMFNCPRRPNLGRE 90
+P P AP P P P P +P P + PRR R
Sbjct: 2613 PPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRAR--RL 2670
Query: 91 GRP 93
GR
Sbjct: 2671 GRA 2673
Score = 35.7 bits (82), Expect = 0.13
Identities = 17/42 (40%), Positives = 18/42 (42%)
Query: 33 APPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPP 74
A PP P PP P APP P P P PP P +P
Sbjct: 2899 ALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQ 2940
Score = 34.9 bits (80), Expect = 0.25
Identities = 15/46 (32%), Positives = 17/46 (36%)
Query: 32 GAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELP 77
P P APP P P P P PP +P P+ P P
Sbjct: 2904 QPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDP 2949
Score = 34.9 bits (80), Expect = 0.26
Identities = 16/60 (26%), Positives = 18/60 (30%), Gaps = 9/60 (15%)
Query: 27 QTKCAGAPPPGAPGAPPGPVAPPGAVPAPPG---------AMGLIPPNQPPQPVQPPELP 77
+ A P A GA P P A P A+ P P P PP P
Sbjct: 261 VGEGADRAPETARGATGPPPPPEAAAPNGAAAPPDGVWGAALAGAPLALPAPPDPPPPAP 320
Score = 33.8 bits (77), Expect = 0.53
Identities = 19/51 (37%), Positives = 22/51 (43%), Gaps = 8/51 (15%)
Query: 31 AGAPPPGAPGAPPGP-----VAPPGAVPAPPG-AMGLIP--PNQPPQPVQP 73
AG PPP P A P V PP P P A+ P+ PPQ +P
Sbjct: 2548 AGDPPPPLPPAAPPAAPDRSVPPPRPAPRPSEPAVTSRARRPDAPPQSARP 2598
Score = 33.4 bits (76), Expect = 0.67
Identities = 13/70 (18%), Positives = 14/70 (20%)
Query: 4 PFPTTPDISNDLNFGNSIFTCGGQTKCAGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIP 63
P + P A A PG A P P G P
Sbjct: 2712 PHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAP 2771
Query: 64 PNQPPQPVQP 73
P P
Sbjct: 2772 PAAPAAGPPR 2781
Score = 33.4 bits (76), Expect = 0.69
Identities = 18/54 (33%), Positives = 18/54 (33%), Gaps = 3/54 (5%)
Query: 23 TCGGQTKCAGAPPPGAPGAPPGPVAPPGAVPAPPG--AMGLIPPNQPPQPVQPP 74
T G T A PPP P A P PPG A P P P P
Sbjct: 2691 TVGSLTSLADPPPPPPTPEPAPHALVS-ATPLPPGPAAARQASPALPAAPAPPA 2743
Score = 33.0 bits (75), Expect = 0.96
Identities = 15/44 (34%), Positives = 15/44 (34%)
Query: 34 PPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELP 77
P P PP P PP PP PP QPP P
Sbjct: 2902 PDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAP 2945
Score = 33.0 bits (75), Expect = 0.99
Identities = 21/63 (33%), Positives = 25/63 (39%), Gaps = 7/63 (11%)
Query: 35 PPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELPMFNCPRRPNLGREGRPI 94
P P PP P P G AP G + PP++ P P RP + R RP
Sbjct: 2838 APPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPA-------APARPPVRRLARPA 2890
Query: 95 VLR 97
V R
Sbjct: 2891 VSR 2893
Score = 32.6 bits (74), Expect = 1.1
Identities = 17/43 (39%), Positives = 17/43 (39%), Gaps = 4/43 (9%)
Query: 33 APPPGAPG--APPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQP 73
AP PPGP A A PA P A PP P P P
Sbjct: 2711 APHALVSATPLPPGPAAARQASPALPAAP--APPAVPAGPATP 2751
Score = 32.6 bits (74), Expect = 1.3
Identities = 15/49 (30%), Positives = 15/49 (30%), Gaps = 5/49 (10%)
Query: 31 AGAPPPGAPGAPPGPVAP-----PGAVPAPPGAMGLIPPNQPPQPVQPP 74
A P AP P P P P PP G P P P P
Sbjct: 2731 ASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGP 2779
Score = 32.6 bits (74), Expect = 1.3
Identities = 23/69 (33%), Positives = 24/69 (34%), Gaps = 15/69 (21%)
Query: 34 PPPGAPGAPPGPVAPP-------GAVPAPPGAMGLIPPNQPPQPVQPPELPMFNCPRRPN 86
P P APP P P PAPP P QPP P P + R
Sbjct: 420 PTPVPASAPPPPATPLPSAEPGSDDGPAPP------PERQPPAPATEPAPDDPDDATRKA 473
Query: 87 LG--REGRP 93
L RE RP
Sbjct: 474 LDALRERRP 482
Score = 32.2 bits (73), Expect = 1.5
Identities = 23/79 (29%), Positives = 26/79 (32%), Gaps = 21/79 (26%)
Query: 31 AGAPPPGAP--------------GAPPGPVAPPGAVPAPPGAM-------GLIPPNQPPQ 69
A PP +P P PP AV AP A+ G +PP Q
Sbjct: 2776 AAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQ 2835
Query: 70 PVQPPELPMFNCPRRPNLG 88
P PP P P P G
Sbjct: 2836 PTAPPPPPGPPPPSLPLGG 2854
Score = 31.8 bits (72), Expect = 1.8
Identities = 15/63 (23%), Positives = 19/63 (30%)
Query: 31 AGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELPMFNCPRRPNLGRE 90
+ A P AP PP AV + L P P P P+ P +
Sbjct: 2870 SPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQP 2929
Query: 91 GRP 93
P
Sbjct: 2930 QPP 2932
Score = 31.8 bits (72), Expect = 2.0
Identities = 19/69 (27%), Positives = 23/69 (33%), Gaps = 7/69 (10%)
Query: 32 GAPP----PGAPGAPPGP---VAPPGAVPAPPGAMGLIPPNQPPQPVQPPELPMFNCPRR 84
APP P AP G APP +P A PP+ P +P P P
Sbjct: 2590 DAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPP 2649
Query: 85 PNLGREGRP 93
+ P
Sbjct: 2650 ERPRDDPAP 2658
Score = 31.8 bits (72), Expect = 2.3
Identities = 15/61 (24%), Positives = 18/61 (29%), Gaps = 12/61 (19%)
Query: 39 PGAPPG------PVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELPMFNCPRRPNLGREGR 92
P APP PV G P PP+ P P+ P + N
Sbjct: 2589 PDAPPQSARPRAPVDDRGDPRGPA------PPSPLPPDTHAPDPPPPSPSPAANEPDPHP 2642
Query: 93 P 93
P
Sbjct: 2643 P 2643
Score = 31.4 bits (71), Expect = 2.6
Identities = 20/61 (32%), Positives = 21/61 (34%), Gaps = 2/61 (3%)
Query: 31 AGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELPMFNCPRRPNLGRE 90
P P P PP V P A P A P PP QP P P P+L
Sbjct: 2796 ESLPSPWDPADPPAAVLAPA--AALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLG 2853
Query: 91 G 91
G
Sbjct: 2854 G 2854
Score = 30.7 bits (69), Expect = 4.5
Identities = 17/56 (30%), Positives = 19/56 (33%), Gaps = 1/56 (1%)
Query: 31 AGAPPPGAPGAPPGPVAPPGAV-PAPPGAMGLIPPNQPPQPVQPPELPMFNCPRRP 85
AG PG P P P G PAPP A PP + +P P
Sbjct: 2746 AGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSP 2801
Score = 30.7 bits (69), Expect = 4.7
Identities = 13/61 (21%), Positives = 16/61 (26%), Gaps = 17/61 (27%)
Query: 34 PPPGAPGAPPGPVAPPGAVPAPPGAMG-----------------LIPPNQPPQPVQPPEL 76
PPP P P P P P GA+ +P P +
Sbjct: 2934 PPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPASST 2993
Query: 77 P 77
P
Sbjct: 2994 P 2994
Score = 30.7 bits (69), Expect = 4.9
Identities = 17/78 (21%), Positives = 21/78 (26%), Gaps = 5/78 (6%)
Query: 33 APPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELPMFNCPRRPNLGREGR 92
A P + + P P PPQP P P P P R
Sbjct: 2887 ARPAVSRSTESFALPPDQPERPPQP-----QAPPPPQPQPQPPPPPQPQPPPPPPPRPQP 2941
Query: 93 PIVLRANHFQISMPRGFV 110
P+ + P G V
Sbjct: 2942 PLAPTTDPAGAGEPSGAV 2959
>gnl|CDD|152960 pfam12526, DUF3729, Protein of unknown function (DUF3729). This
family of proteins is found in viruses. Proteins in this
family are typically between 145 and 1707 amino acids in
length. The family is found in association with
pfam01443, pfam01661, pfam05417, pfam01660, pfam00978.
There is a single completely conserved residue L that
may be functionally important.
Length = 115
Score = 38.5 bits (90), Expect = 0.002
Identities = 19/44 (43%), Positives = 21/44 (47%), Gaps = 5/44 (11%)
Query: 34 PPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELP 77
PPP P APP PP PA P + P PP P + P LP
Sbjct: 64 PPPSEPAAPPPDPEPPVPGPAGPPS-----PLAPPAPARKPPLP 102
Score = 35.8 bits (83), Expect = 0.016
Identities = 16/56 (28%), Positives = 21/56 (37%), Gaps = 4/56 (7%)
Query: 26 GQTKCAGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLI----PPNQPPQPVQPPELP 77
G + C P P PP P V P + + PP++P P PE P
Sbjct: 24 GFSSCFSPPESAHPDDPPPVGDPRPPVVDTPPPVSAVWVLPPPSEPAAPPPDPEPP 79
Score = 32.7 bits (75), Expect = 0.23
Identities = 14/37 (37%), Positives = 15/37 (40%)
Query: 33 APPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQ 69
APPP PGP PP + P A P PQ
Sbjct: 71 APPPDPEPPVPGPAGPPSPLAPPAPARKPPLPPPRPQ 107
Score = 31.2 bits (71), Expect = 0.60
Identities = 13/51 (25%), Positives = 15/51 (29%)
Query: 25 GGQTKCAGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPE 75
G PP P A P P + P PP P+ PP
Sbjct: 44 GDPRPPVVDTPPPVSAVWVLPPPSEPAAPPPDPEPPVPGPAGPPSPLAPPA 94
Score = 29.6 bits (67), Expect = 2.1
Identities = 15/55 (27%), Positives = 16/55 (29%), Gaps = 2/55 (3%)
Query: 31 AGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELPMFNCPRRP 85
G P P PP V PP PP+ P P P P P
Sbjct: 43 VGDPRPPVVDTPPP--VSAVWVLPPPSEPAAPPPDPEPPVPGPAGPPSPLAPPAP 95
Score = 28.1 bits (63), Expect = 7.1
Identities = 11/26 (42%), Positives = 12/26 (46%)
Query: 31 AGAPPPGAPGAPPGPVAPPGAVPAPP 56
PP P PP P+APP PP
Sbjct: 75 DPEPPVPGPAGPPSPLAPPAPARKPP 100
Score = 28.1 bits (63), Expect = 8.3
Identities = 10/25 (40%), Positives = 11/25 (44%)
Query: 32 GAPPPGAPGAPPGPVAPPGAVPAPP 56
P P P +P P AP P PP
Sbjct: 79 PVPGPAGPPSPLAPPAPARKPPLPP 103
>gnl|CDD|236090 PRK07764, PRK07764, DNA polymerase III subunits gamma and tau;
Validated.
Length = 824
Score = 41.1 bits (97), Expect = 0.002
Identities = 14/55 (25%), Positives = 14/55 (25%), Gaps = 4/55 (7%)
Query: 25 GGQTKCAGAPPPGAPGAPPGPVAPPGAV----PAPPGAMGLIPPNQPPQPVQPPE 75
G A P A A P P A A P P P P P
Sbjct: 388 AGGAGAPAAAAPSAAAAAPAAAPAPAAAAPAAAAAPAPAAAPQPAPAPAPAPAPP 442
Score = 41.1 bits (97), Expect = 0.003
Identities = 15/45 (33%), Positives = 15/45 (33%)
Query: 31 AGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPE 75
P P P PP P P P PP P P P P E
Sbjct: 737 DPVPLPPEPDDPPDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPSE 781
Score = 36.9 bits (86), Expect = 0.054
Identities = 15/48 (31%), Positives = 15/48 (31%)
Query: 31 AGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELPM 78
A AP A P P P PAPP G P P P
Sbjct: 417 PAAAAAPAPAAAPQPAPAPAPAPAPPSPAGNAPAGGAPSPPPAAAPSA 464
Score = 35.3 bits (82), Expect = 0.15
Identities = 15/61 (24%), Positives = 15/61 (24%)
Query: 25 GGQTKCAGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELPMFNCPRR 84
A A P A P A P AP A P PP P P
Sbjct: 399 PSAAAAAPAAAPAPAAAAPAAAAAPAPAAAPQPAPAPAPAPAPPSPAGNAPAGGAPSPPP 458
Query: 85 P 85
Sbjct: 459 A 459
Score = 34.2 bits (79), Expect = 0.31
Identities = 16/44 (36%), Positives = 16/44 (36%)
Query: 31 AGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPP 74
A PP AP AP P A PAP A QPP
Sbjct: 679 AAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPP 722
Score = 34.2 bits (79), Expect = 0.40
Identities = 16/47 (34%), Positives = 17/47 (36%), Gaps = 1/47 (2%)
Query: 31 AGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELP 77
A P AP P P AP A P+P G PP P P
Sbjct: 421 AAPAPAAAPQPAPAP-APAPAPPSPAGNAPAGGAPSPPPAAAPSAQP 466
Score = 33.4 bits (77), Expect = 0.58
Identities = 14/45 (31%), Positives = 16/45 (35%), Gaps = 4/45 (8%)
Query: 34 PPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELPM 78
PP GAP P PP PA A P P + E+
Sbjct: 747 DPPDPAGAPAQPPPPPAPAPAAAPA----AAPPPSPPSEEEEMAE 787
Score = 33.4 bits (77), Expect = 0.64
Identities = 9/53 (16%), Positives = 10/53 (18%), Gaps = 2/53 (3%)
Query: 25 GGQTKCAGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQP--PQPVQPPE 75
P A + PA P PQP P
Sbjct: 384 RLGVAGGAGAPAAAAPSAAAAAPAAAPAPAAAAPAAAAAPAPAAAPQPAPAPA 436
Score = 33.0 bits (76), Expect = 0.71
Identities = 15/50 (30%), Positives = 15/50 (30%), Gaps = 6/50 (12%)
Query: 25 GGQTKCAGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPP 74
G AG P AP P P AP A P P PP
Sbjct: 667 DGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAP------AQPAPAPAATPP 710
Score = 32.7 bits (75), Expect = 0.99
Identities = 13/47 (27%), Positives = 13/47 (27%), Gaps = 1/47 (2%)
Query: 31 AGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELP 77
A P P P P A PAP P P P P
Sbjct: 458 PAAAPSAQPAPAPAAAPEPTAAPAPAPPAA-PAPAAAPAAPAAPAAP 503
Score = 32.7 bits (75), Expect = 1.0
Identities = 16/48 (33%), Positives = 16/48 (33%)
Query: 31 AGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELPM 78
GAP P AP AP A P A P P P P P
Sbjct: 451 GGAPSPPPAAAPSAQPAPAPAAAPEPTAAPAPAPPAAPAPAAAPAAPA 498
Score = 32.7 bits (75), Expect = 1.1
Identities = 12/44 (27%), Positives = 13/44 (29%), Gaps = 1/44 (2%)
Query: 33 APPPGAPGAP-PGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPE 75
AP AP P P P + A G P P P
Sbjct: 424 APAAAPQPAPAPAPAPAPPSPAGNAPAGGAPSPPPAAAPSAQPA 467
Score = 31.5 bits (72), Expect = 2.3
Identities = 13/43 (30%), Positives = 13/43 (30%)
Query: 33 APPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPE 75
P AP P AP PA A P V PE
Sbjct: 609 PEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPE 651
Score = 31.5 bits (72), Expect = 2.6
Identities = 12/45 (26%), Positives = 14/45 (31%)
Query: 31 AGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPE 75
AP P AP AP G A + QP Q +
Sbjct: 682 PPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQ 726
Score = 31.1 bits (71), Expect = 2.8
Identities = 13/47 (27%), Positives = 14/47 (29%)
Query: 31 AGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELP 77
A A +P P A P PA P PP P P
Sbjct: 448 APAGGAPSPPPAAAPSAQPAPAPAAAPEPTAAPAPAPPAAPAPAAAP 494
Score = 31.1 bits (71), Expect = 2.9
Identities = 11/54 (20%), Positives = 11/54 (20%), Gaps = 5/54 (9%)
Query: 22 FTCGGQTKCAGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPE 75
A G P A PA P A P P P
Sbjct: 584 VEAVVGPAPGAAGGEGPPAPASSGPPEEAARPAAPAA-----PAAPAAPAPAGA 632
Score = 31.1 bits (71), Expect = 3.1
Identities = 19/54 (35%), Positives = 19/54 (35%), Gaps = 1/54 (1%)
Query: 33 APPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQP-PQPVQPPELPMFNCPRRP 85
AP P APP P A AP P QP P P PE P P
Sbjct: 432 APAPAPAPAPPSPAGNAPAGGAPSPPPAAAPSAQPAPAPAAAPEPTAAPAPAPP 485
Score = 31.1 bits (71), Expect = 3.2
Identities = 13/45 (28%), Positives = 14/45 (31%)
Query: 31 AGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPE 75
PP G P AP A P A P P+P P
Sbjct: 437 PAPAPPSPAGNAPAGGAPSPPPAAAPSAQPAPAPAAAPEPTAAPA 481
Score = 30.3 bits (69), Expect = 5.1
Identities = 14/49 (28%), Positives = 14/49 (28%), Gaps = 4/49 (8%)
Query: 25 GGQTKCAGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQP 73
A A P GA A P P P QPPQ Q
Sbjct: 683 PAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADD----PAAQPPQAAQG 727
Score = 30.0 bits (68), Expect = 6.6
Identities = 13/55 (23%), Positives = 14/55 (25%)
Query: 31 AGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELPMFNCPRRP 85
A P AP P P +P G PA QP P
Sbjct: 427 AAPQPAPAPAPAPAPPSPAGNAPAGGAPSPPPAAAPSAQPAPAPAAAPEPTAAPA 481
Score = 30.0 bits (68), Expect = 6.9
Identities = 12/47 (25%), Positives = 12/47 (25%)
Query: 31 AGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELP 77
A AP P AP P GA P P P
Sbjct: 426 AAAPQPAPAPAPAPAPPSPAGNAPAGGAPSPPPAAAPSAQPAPAPAA 472
Score = 30.0 bits (68), Expect = 7.2
Identities = 14/51 (27%), Positives = 19/51 (37%)
Query: 34 PPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELPMFNCPRR 84
P P A P P P AP A PP++ + + M + RR
Sbjct: 749 PDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPSEEEEMAEDDAPSMDDEDRR 799
Score = 29.6 bits (67), Expect = 8.1
Identities = 13/28 (46%), Positives = 13/28 (46%)
Query: 33 APPPGAPGAPPGPVAPPGAVPAPPGAMG 60
AP P A AP A P A AP GA
Sbjct: 481 APAPPAAPAPAAAPAAPAAPAAPAGADD 508
>gnl|CDD|220708 pfam10349, WWbp, WW-domain ligand protein. The WWbp domain is
characterized by several short PY and PT-like motifs of
the PPPPY form. These appear to bind directly to the WW
domains of WWP1 and WWP2 and other such diverse proteins
as dystrophin and YAP (Yes-associated protein). This is
the WW-domain binding protein WWbp via PY and PY_like
motifs. The presence of a phosphotyrosine residue in the
pWBP-1 peptide abolishes WW domain binding which
suggests a potential regulatory role for tyrosine
phosphorylation in modulating WW domain-ligand
interactions. Given the likelihood that WWP1 and WWP2
function as E3 ubiquitin-protein ligases, it is possible
that initial substrate-specific recognition occurs via
WW domain-substrate protein interaction followed by
ubiquitin transfer and subsequent proteolysis. This
domain lies just downstream of the GRAM (pfam02893) in
many members.
Length = 111
Score = 35.5 bits (82), Expect = 0.019
Identities = 17/44 (38%), Positives = 17/44 (38%)
Query: 34 PPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELP 77
P G APP PP PAPPG PP P E P
Sbjct: 60 PAYGQYAAPPPYGPPPPYYPAPPGVYPTPPPPNSGYMADPQEPP 103
Score = 32.4 bits (74), Expect = 0.26
Identities = 15/45 (33%), Positives = 16/45 (35%)
Query: 33 APPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELP 77
A PP PP APPG P PP + P P P
Sbjct: 66 AAPPPYGPPPPYYPAPPGVYPTPPPPNSGYMADPQEPPPPYPGPP 110
>gnl|CDD|235904 PRK06995, flhF, flagellar biosynthesis regulator FlhF; Validated.
Length = 484
Score = 38.0 bits (89), Expect = 0.019
Identities = 26/140 (18%), Positives = 39/140 (27%), Gaps = 9/140 (6%)
Query: 30 CAGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQP-------PELPMFNCP 82
A A P A P V+ P A A P + + + P P P
Sbjct: 57 AAPAAAQPPPAAAPAAVSRPAAPAAEPAPWLVEHAKRLTAQREQLVARAAAPAAPEAQAP 116
Query: 83 RRPNLGREGRPIVLRANHFQISMPRGFVHHYEIQIQPDKCPRKVNREIIETMVHAYSKIF 142
P R + PR V D ++ R + +T++ +
Sbjct: 117 AAPAERAAAENAARRLARAAAAAPRPRVPADAAAAVADAVKARIERIVNDTVMQELRSLR 176
Query: 143 GALKPVFDG--RNNLYTRDP 160
G L+ RDP
Sbjct: 177 GMLEEQLASLAWGERQRRDP 196
Score = 31.1 bits (71), Expect = 2.7
Identities = 15/64 (23%), Positives = 18/64 (28%), Gaps = 2/64 (3%)
Query: 36 PGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELPMFNCPRRPNLGREGRPIV 95
P A AP PP A PA P V+ + R + R P
Sbjct: 53 PPAAAAPAAAQPPPAAAPAAVSRPAAPAAEPAPWLVEHAKRLT--AQREQLVARAAAPAA 110
Query: 96 LRAN 99
A
Sbjct: 111 PEAQ 114
>gnl|CDD|218621 pfam05518, Totivirus_coat, Totivirus coat protein.
Length = 753
Score = 38.2 bits (89), Expect = 0.020
Identities = 14/45 (31%), Positives = 15/45 (33%)
Query: 32 GAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPEL 76
PPG G P P P A P LI P +PP
Sbjct: 700 PGGPPGGGGGLPPPPDLPAAAGPAPCGSSLIASPTAPPEPEPPGA 744
Score = 29.4 bits (66), Expect = 9.6
Identities = 16/62 (25%), Positives = 19/62 (30%), Gaps = 3/62 (4%)
Query: 35 PPGAPGAPPGPVAPPGAVPA---PPGAMGLIPPNQPPQPVQPPELPMFNCPRRPNLGREG 91
G G P P P +PA P + P +P P P P G G
Sbjct: 650 RGGEAGGPGVPGPVPVGMPAHTARPSRVARGDPVRPTAHHAALRAPQAPRPGGPPGGGGG 709
Query: 92 RP 93
P
Sbjct: 710 LP 711
>gnl|CDD|223065 PHA03378, PHA03378, EBNA-3B; Provisional.
Length = 991
Score = 38.1 bits (88), Expect = 0.024
Identities = 18/54 (33%), Positives = 19/54 (35%), Gaps = 4/54 (7%)
Query: 28 TKCAGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQP----PQPVQPPELP 77
PP APG P A PG P A G P P P PP+ P
Sbjct: 730 APGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGAPTPQPPPQAP 783
Score = 37.7 bits (87), Expect = 0.033
Identities = 19/54 (35%), Positives = 21/54 (38%), Gaps = 4/54 (7%)
Query: 28 TKCAGAPPPGAPGAPPGPVAPPGA-VPAPPGAMGLIPPNQP---PQPVQPPELP 77
PP APG P A PGA P PP P +P P P PP+
Sbjct: 750 APGRARPPAAAPGRARPPAAAPGAPTPQPPPQAPPAPQQRPRGAPTPQPPPQAG 803
Score = 37.4 bits (86), Expect = 0.035
Identities = 22/62 (35%), Positives = 23/62 (37%), Gaps = 2/62 (3%)
Query: 34 PPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQ-PPQPVQPPE-LPMFNCPRRPNLGREG 91
PPP AP P APPG P A G P P +PP P P GR
Sbjct: 696 PPPRAPTPMRPPAAPPGRAQRPAAATGRARPPAAAPGRARPPAAAPGRARPPAAAPGRAR 755
Query: 92 RP 93
P
Sbjct: 756 PP 757
Score = 35.8 bits (82), Expect = 0.12
Identities = 18/61 (29%), Positives = 19/61 (31%), Gaps = 3/61 (4%)
Query: 34 PPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQ-PPQPVQPPELPMFNCPRRPNLGREGR 92
P A G P A PG P A G P P +PP RP G
Sbjct: 716 RPAAATGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAA--PGRARPPAAAPGA 773
Query: 93 P 93
P
Sbjct: 774 P 774
Score = 32.3 bits (73), Expect = 1.4
Identities = 29/89 (32%), Positives = 32/89 (35%), Gaps = 18/89 (20%)
Query: 23 TCGGQTKCAGAPPP-----GAPGAP---PGPVAPPGAVPAPPGAMGLIPPNQ-PPQPVQ- 72
G A AP APGAP P P APP P GA PP Q P +Q
Sbjct: 750 APGRARPPAAAPGRARPPAAAPGAPTPQPPPQAPPAPQQRPRGAPTPQPPPQAGPTSMQL 809
Query: 73 -PPELPMFNCPRRPNLG-------REGRP 93
P P P + L + GRP
Sbjct: 810 MPRAAPGQQGPTKQILRQLLTGGVKRGRP 838
>gnl|CDD|237874 PRK14971, PRK14971, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 614
Score = 37.8 bits (88), Expect = 0.024
Identities = 20/94 (21%), Positives = 30/94 (31%), Gaps = 2/94 (2%)
Query: 26 GQTKCAGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELPMFNCPRRP 85
Q+ A P P P +V PP A+ + PP+ PQ V+P +
Sbjct: 403 SQSSAAAQPSAPQSATQPAGTPPTVSVD-PPAAVPVNPPSTAPQAVRPAQFKEEKKIPVS 461
Query: 86 NLGREGRPIVLRANHFQISMPRGFVHHYEIQIQP 119
+ G LR + G + Q
Sbjct: 462 KVSSLGPS-TLRPIQEKAEQATGNIKEAPTGTQK 494
>gnl|CDD|236382 PRK09111, PRK09111, DNA polymerase III subunits gamma and tau;
Validated.
Length = 598
Score = 37.6 bits (88), Expect = 0.031
Identities = 23/93 (24%), Positives = 27/93 (29%), Gaps = 15/93 (16%)
Query: 31 AGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELPMFNCPRRPN---- 86
GAP A A PG A A P A+ +P P R N
Sbjct: 405 GGAPGAPAAAAAPGAAAAAPAAGGPAAALAAVPDAAAAAAAPPAPAAAPQPAVRLNSFED 464
Query: 87 ---LGREGRPIVLRANHFQISMPRGFVH--HYE 114
L E R I L+A V +E
Sbjct: 465 IVALAEEKRDIKLKA------AVERDVRLVRFE 491
Score = 32.2 bits (74), Expect = 1.5
Identities = 11/63 (17%), Positives = 13/63 (20%)
Query: 31 AGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELPMFNCPRRPNLGRE 90
+P G G P G AP A P +P
Sbjct: 391 PPSPGGGGGGPPGGGGAPGAPAAAAAPGAAAAAPAAGGPAAALAAVPDAAAAAAAPPAPA 450
Query: 91 GRP 93
P
Sbjct: 451 AAP 453
Score = 31.0 bits (71), Expect = 3.2
Identities = 7/46 (15%), Positives = 7/46 (15%)
Query: 31 AGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPEL 76
PP G P A P L
Sbjct: 388 QEGPPSPGGGGGGPPGGGGAPGAPAAAAAPGAAAAAPAAGGPAAAL 433
Score = 30.6 bits (70), Expect = 4.0
Identities = 13/53 (24%), Positives = 15/53 (28%), Gaps = 1/53 (1%)
Query: 25 GGQTKCAGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELP 77
A A P A AP P A+ A P A P P +
Sbjct: 407 APGAPAAAAAPGAAAAAPAAG-GPAAALAAVPDAAAAAAAPPAPAAAPQPAVR 458
>gnl|CDD|237865 PRK14951, PRK14951, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 618
Score = 37.4 bits (87), Expect = 0.035
Identities = 14/47 (29%), Positives = 14/47 (29%)
Query: 28 TKCAGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPP 74
K P AP A P A PA A P PP P
Sbjct: 378 KKTPARPEAAAPAAAPVAQAAAAPAPAAAPAAAASAPAAPPAAAPPA 424
Score = 35.5 bits (82), Expect = 0.14
Identities = 16/47 (34%), Positives = 16/47 (34%), Gaps = 2/47 (4%)
Query: 33 APPPGA--PGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELP 77
A P A A P APP V AP A P P V P
Sbjct: 405 AAPAAAASAPAAPPAAAPPAPVAAPAAAAPAAAPAAAPAAVALAPAP 451
Score = 34.7 bits (80), Expect = 0.21
Identities = 14/50 (28%), Positives = 15/50 (30%)
Query: 26 GQTKCAGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPE 75
APP AP AP A AP A + P PE
Sbjct: 409 AAASAPAAPPAAAPPAPVAAPAAAAPAAAPAAAPAAVALAPAPPAQAAPE 458
Score = 32.0 bits (73), Expect = 1.5
Identities = 16/53 (30%), Positives = 20/53 (37%)
Query: 33 APPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELPMFNCPRRP 85
A P AP A P AP A A P A+ L P + +P+ P
Sbjct: 419 AAAPPAPVAAPAAAAPAAAPAAAPAAVALAPAPPAQAAPETVAIPVRVAPEPA 471
Score = 32.0 bits (73), Expect = 1.5
Identities = 14/53 (26%), Positives = 14/53 (26%), Gaps = 5/53 (9%)
Query: 31 AGAPPPG-----APGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELPM 78
A A P A P A P A A A P P PP
Sbjct: 370 AEAAAPAEKKTPARPEAAAPAAAPVAQAAAAPAPAAAPAAAASAPAAPPAAAP 422
Score = 31.2 bits (71), Expect = 3.1
Identities = 16/45 (35%), Positives = 17/45 (37%), Gaps = 3/45 (6%)
Query: 31 AGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPE 75
A P A P P A P A + P A P PP PV P
Sbjct: 389 PAAAPVAQAAAAPAPAAAPAAAASAPAA---PPAAAPPAPVAAPA 430
>gnl|CDD|237871 PRK14965, PRK14965, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 576
Score = 37.4 bits (87), Expect = 0.035
Identities = 32/116 (27%), Positives = 39/116 (33%), Gaps = 22/116 (18%)
Query: 23 TCGGQTKCAGAPPPGAPGAPPGPVAPP--GAVPAPPGAMGLIPPNQPPQPVQPPELPMFN 80
+ GAP P AP APP APP A PA P A P PP PP
Sbjct: 381 APAPPSAAWGAPTPAAPAAPPPAAAPPVPPAAPARPAAARPAPAPAPPAAAAPPARSADP 440
Query: 81 CPR-----------------RPNLG---REGRPIVLRANHFQISMPRGFVHHYEIQ 116
+P LG +G P+ + A +I P G +Q
Sbjct: 441 AAAASAGDRWRAFVAFVKGKKPALGASLEQGSPLGVSAGLLEIGFPEGSFELSAMQ 496
>gnl|CDD|217469 pfam03276, Gag_spuma, Spumavirus gag protein.
Length = 582
Score = 36.8 bits (85), Expect = 0.054
Identities = 21/75 (28%), Positives = 29/75 (38%), Gaps = 1/75 (1%)
Query: 4 PFPTTPDISNDLNFGNSIFTCGGQTKCAGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIP 63
P P S+ L S + + P P P P APP ++PAPP +
Sbjct: 193 SLPGLPPGSSSLAPSASSTPGNRLPRVSFNPFLPGPS-PAQPSAPPASIPAPPIPPVIQY 251
Query: 64 PNQPPQPVQPPELPM 78
PP P P +P+
Sbjct: 252 VAPPPVPPPQPIIPI 266
Score = 30.3 bits (68), Expect = 5.4
Identities = 15/51 (29%), Positives = 21/51 (41%), Gaps = 6/51 (11%)
Query: 35 PPGAPGAPP-GPVAPPGAVPAPPGAMGLIPPNQ-----PPQPVQPPELPMF 79
PP + APP PV A P P +IP P P ++P++
Sbjct: 236 PPASIPAPPIPPVIQYVAPPPVPPPQPIIPIQHIRAVTGETPANPRDIPLW 286
>gnl|CDD|237057 PRK12323, PRK12323, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 700
Score = 36.8 bits (85), Expect = 0.059
Identities = 13/47 (27%), Positives = 14/47 (29%)
Query: 31 AGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELP 77
A P P A A AP A + PP PPE
Sbjct: 461 AARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPWEELPPEFA 507
Score = 34.5 bits (79), Expect = 0.29
Identities = 12/44 (27%), Positives = 12/44 (27%)
Query: 31 AGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPP 74
P AP A P A P A P A P P
Sbjct: 446 GAPAPAPAPAAAPAAAARPAAAGPRPVAAAAAAAPARAAPAAAP 489
Score = 34.1 bits (78), Expect = 0.42
Identities = 18/57 (31%), Positives = 19/57 (33%), Gaps = 10/57 (17%)
Query: 31 AGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQP-------PQPVQPP---ELP 77
A AP P AP P A P P A P + P PP ELP
Sbjct: 447 APAPAPAPAAAPAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPWEELP 503
Score = 33.3 bits (76), Expect = 0.68
Identities = 14/50 (28%), Positives = 14/50 (28%), Gaps = 3/50 (6%)
Query: 31 AGAPPPGAPGAPPGPVAPPGAVPAP---PGAMGLIPPNQPPQPVQPPELP 77
A PG P P P A PA P A G P P
Sbjct: 436 ARQASARGPGGAPAPAPAPAAAPAAAARPAAAGPRPVAAAAAAAPARAAP 485
Score = 32.9 bits (75), Expect = 0.92
Identities = 19/67 (28%), Positives = 21/67 (31%), Gaps = 2/67 (2%)
Query: 27 QTKCAGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELPMFNCPRRPN 86
Q A A P A AP P A P A AP A P P + +
Sbjct: 384 QPAPAAAAPAAAAPAPAAPPAAPAA--APAAAAAARAVAAAPARRSPAPEALAAARQASA 441
Query: 87 LGREGRP 93
G G P
Sbjct: 442 RGPGGAP 448
Score = 31.0 bits (70), Expect = 3.0
Identities = 13/68 (19%), Positives = 16/68 (23%), Gaps = 1/68 (1%)
Query: 33 APPPGAPGAPPGPVAPPGAVPAPPGAMGLIP-PNQPPQPVQPPELPMFNCPRRPNLGREG 91
P G + P L P P P P PR P G
Sbjct: 513 QPDAAPAGWVAESIPDPATADPDDAFETLAPAPAAAPAPRAAAATEPVVAPRPPRASASG 572
Query: 92 RPIVLRAN 99
P + +
Sbjct: 573 LPDMFDGD 580
Score = 31.0 bits (70), Expect = 3.2
Identities = 12/48 (25%), Positives = 13/48 (27%), Gaps = 1/48 (2%)
Query: 31 AGAPPPGA-PGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELP 77
+P P A A PG PAP A P P
Sbjct: 425 RRSPAPEALAAARQASARGPGGAPAPAPAPAAAPAAAARPAAAGPRPV 472
Score = 31.0 bits (70), Expect = 3.3
Identities = 10/48 (20%), Positives = 11/48 (22%)
Query: 27 QTKCAGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPP 74
Q G AP P A PA G + P
Sbjct: 438 QASARGPGGAPAPAPAPAAAPAAAARPAAAGPRPVAAAAAAAPARAAP 485
Score = 29.5 bits (66), Expect = 9.6
Identities = 16/57 (28%), Positives = 16/57 (28%), Gaps = 1/57 (1%)
Query: 22 FTCGGQTKCAGAPPP-GAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELP 77
F G AG AP A P P A A AP A P P
Sbjct: 363 FRPGQSGGGAGPATAAAAPVAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAV 419
>gnl|CDD|220392 pfam09770, PAT1, Topoisomerase II-associated protein PAT1. Members
of this family are necessary for accurate chromosome
transmission during cell division.
Length = 804
Score = 36.7 bits (85), Expect = 0.059
Identities = 18/61 (29%), Positives = 22/61 (36%), Gaps = 3/61 (4%)
Query: 34 PPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELPMFNCPRRPNLGREGRP 93
PP P PG A PP L+P Q PQ Q P F + L ++ R
Sbjct: 277 PPQNQPTPHPGLPQGQNAPLPPPQQPQLLPLVQQPQGQQRG--PQFR-EQLVQLSQQQRE 333
Query: 94 I 94
Sbjct: 334 A 334
Score = 35.5 bits (82), Expect = 0.13
Identities = 18/55 (32%), Positives = 19/55 (34%), Gaps = 9/55 (16%)
Query: 32 GAPPPGAPGAPPGPVAPPGAVPAPPGAMGL---------IPPNQPPQPVQPPELP 77
G PP A GP P P PP P P QP PP+LP
Sbjct: 183 GMPPRQAAFPQQGPPEQPPGYPQPPQGHPEQVQPQQFLPAPSQAPAQPPLPPQLP 237
Score = 33.2 bits (76), Expect = 0.68
Identities = 20/113 (17%), Positives = 31/113 (27%), Gaps = 7/113 (6%)
Query: 31 AGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQP----PQPVQP--PELPMFNCPRR 84
P + PP P PP PP PP P Q LP P+
Sbjct: 245 QPQFPGLSQQMPPPPPQPPQQQQQPPQPQAQPPPQNQPTPHPGLPQGQNAPLPPPQQPQL 304
Query: 85 PNLGREGRPIVLRANHFQISMPRGFVHHYEIQIQPDKCPRKVNREIIETMVHA 137
L ++ + + + E Q + K +I+ +
Sbjct: 305 LPLVQQPQGQQRGPQFREQLVQLSQQQR-EALSQEEAKRAKRRHKIVSLSKYN 356
Score = 32.4 bits (74), Expect = 1.4
Identities = 12/42 (28%), Positives = 13/42 (30%)
Query: 33 APPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPP 74
AP P P P P + PP P QPP
Sbjct: 222 APSQAPAQPPLPPQLPQQPPPLQQPQFPGLSQQMPPPPPQPP 263
Score = 31.7 bits (72), Expect = 2.4
Identities = 20/58 (34%), Positives = 23/58 (39%), Gaps = 4/58 (6%)
Query: 31 AGAPPPGAPGAPPGP---VAPPGAVPAPPGAMGLIP-PNQPPQPVQPPELPMFNCPRR 84
PPG P P G V P +PAP A P P Q PQ P + P F +
Sbjct: 196 PPEQPPGYPQPPQGHPEQVQPQQFLPAPSQAPAQPPLPPQLPQQPPPLQQPQFPGLSQ 253
Score = 31.3 bits (71), Expect = 2.5
Identities = 13/63 (20%), Positives = 16/63 (25%), Gaps = 2/63 (3%)
Query: 33 APPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPEL-PMFNCPRRPNLGREG 91
P P + P A G PP P P PE P+
Sbjct: 171 QLPQPPQQVLPQGMPPRQAAFPQQGPPE-QPPGYPQPPQGHPEQVQPQQFLPAPSQAPAQ 229
Query: 92 RPI 94
P+
Sbjct: 230 PPL 232
Score = 30.9 bits (70), Expect = 3.6
Identities = 15/95 (15%), Positives = 20/95 (21%), Gaps = 8/95 (8%)
Query: 27 QTKCAGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQ--PVQPPELPMFNCPRR 84
Q + P PP V P G P PP QPP +
Sbjct: 162 QLQQRQQAP--QLPQPPQQVLPQGMPPRQAAFPQQGPPEQPPGYPQPPQGHPEQVQPQQF 219
Query: 85 PNLGREGRPIVLRANHFQISMPRGFVHHYEIQIQP 119
+P+ + Q
Sbjct: 220 LP----APSQAPAQPPLPPQLPQQPPPLQQPQFPG 250
Score = 29.7 bits (67), Expect = 8.2
Identities = 13/55 (23%), Positives = 17/55 (30%), Gaps = 2/55 (3%)
Query: 34 PPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQ--PPQPVQPPELPMFNCPRRPN 86
P P P P P + + PP P Q QPP+ P+
Sbjct: 228 AQPPLPPQLPQQPPPLQQPQFPGLSQQMPPPPPQPPQQQQQPPQPQAQPPPQNQP 282
>gnl|CDD|184281 PRK13729, PRK13729, conjugal transfer pilus assembly protein TraB;
Provisional.
Length = 475
Score = 36.0 bits (83), Expect = 0.086
Identities = 23/67 (34%), Positives = 24/67 (35%), Gaps = 11/67 (16%)
Query: 19 NSIFTCGGQTKCAGAPPPGAPGAP-PGPVA---PP---GAVPAP----PGAMGLIPPNQP 67
N + G A PPG G P PG PP AVP P PG PP
Sbjct: 126 NPVTATGEPVPQMPASPPGPEGEPQPGNTPVSFPPQGSVAVPPPTAFYPGNGVTPPPQVT 185
Query: 68 PQPVQPP 74
Q V P
Sbjct: 186 YQSVPVP 192
>gnl|CDD|184927 PRK14963, PRK14963, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 504
Score = 36.0 bits (83), Expect = 0.10
Identities = 25/120 (20%), Positives = 33/120 (27%), Gaps = 17/120 (14%)
Query: 2 AIPFPTTPDISNDLNFGNSIFTCGGQTKCAGAPPPGAPGAPPGPVAPPGAVPAPPGAMGL 61
A P P D++ LN A A GAP P P P
Sbjct: 354 APPAPAPADLTQRLNRLEKEVRSLRSAPTAAATAAGAPLPDFDPRPRGPPAPEPAR---- 409
Query: 62 IPPNQPPQPVQPPELPMFNCPRRPNLGREGRPIVLRANHFQISMPRGFVHHYEIQIQPDK 121
+ P V P P R + VL A Q+ R F+ ++
Sbjct: 410 --SAEAPPLVAPAAAPAGLALRWRD--------VLAALKMQL---RAFLREARPHVEEGY 456
>gnl|CDD|222828 PHA01732, PHA01732, proline-rich protein.
Length = 94
Score = 33.2 bits (75), Expect = 0.12
Identities = 17/40 (42%), Positives = 18/40 (45%), Gaps = 7/40 (17%)
Query: 37 GAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPEL 76
AP PP P AP P PP PP PP PV P +
Sbjct: 5 RAP-KPPEPPAPLPPAPVPP------PPPAPPAPVPEPTV 37
Score = 28.6 bits (63), Expect = 3.7
Identities = 12/26 (46%), Positives = 12/26 (46%)
Query: 33 APPPGAPGAPPGPVAPPGAVPAPPGA 58
AP P P APP PV P P A
Sbjct: 19 APVPPPPPAPPAPVPEPTVKPVNAEA 44
Score = 28.6 bits (63), Expect = 3.7
Identities = 14/23 (60%), Positives = 14/23 (60%), Gaps = 1/23 (4%)
Query: 33 APPPGAPGAPPGPVAPPGAVPAP 55
AP P AP PP P APP VP P
Sbjct: 14 APLPPAP-VPPPPPAPPAPVPEP 35
>gnl|CDD|227505 COG5178, PRP8, U5 snRNP spliceosome subunit [RNA processing and
modification].
Length = 2365
Score = 35.7 bits (82), Expect = 0.14
Identities = 14/26 (53%), Positives = 15/26 (57%)
Query: 48 PPGAVPAPPGAMGLIPPNQPPQPVQP 73
PPG P PP G PP+QPP P P
Sbjct: 5 PPGNPPPPPPPPGFEPPSQPPPPPPP 30
>gnl|CDD|221459 pfam12200, DUF3597, Domain of unknown function (DUF3597). This
family of proteins is found in bacteria, eukaryotes and
viruses. Proteins in this family are typically between
126 and 281 amino acids in length. The function of this
domain is unknown. The structure of this domain has
been found to contain five helices with a long flexible
loop between helices one and two.
Length = 124
Score = 33.5 bits (77), Expect = 0.15
Identities = 12/40 (30%), Positives = 14/40 (35%), Gaps = 5/40 (12%)
Query: 19 NSIFTCGGQTKCAGAPPPGAPGAPPGPVAPPGAVPAPPGA 58
N+IF + A A P A P A A PA
Sbjct: 7 NAIFG-----EAAAAAPAPAAAPATAPAAAAAAAPAATPP 41
Score = 30.8 bits (70), Expect = 1.1
Identities = 10/34 (29%), Positives = 10/34 (29%)
Query: 38 APGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPV 71
A A P P A P PA A V
Sbjct: 13 AAAAAPAPAAAPATAPAAAAAAAPAATPPAAASV 46
Score = 29.2 bits (66), Expect = 4.1
Identities = 6/25 (24%), Positives = 6/25 (24%)
Query: 31 AGAPPPGAPGAPPGPVAPPGAVPAP 55
A AP A A A
Sbjct: 21 AAAPATAPAAAAAAAPAATPPAAAS 45
>gnl|CDD|237030 PRK12270, kgd, alpha-ketoglutarate decarboxylase; Reviewed.
Length = 1228
Score = 35.6 bits (83), Expect = 0.15
Identities = 14/49 (28%), Positives = 14/49 (28%)
Query: 27 QTKCAGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPE 75
A A P AP A P APP A A P P
Sbjct: 62 PAAKAPAAPAPAPPAAAAPAAPPKPAAAAAAAAAPAAPPAAAAAAAPAA 110
Score = 34.1 bits (79), Expect = 0.35
Identities = 13/45 (28%), Positives = 14/45 (31%), Gaps = 1/45 (2%)
Query: 31 AGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPE 75
A A A P A A PAP P PP+P
Sbjct: 48 AAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPA-APPKPAAAAA 91
Score = 33.7 bits (78), Expect = 0.55
Identities = 13/45 (28%), Positives = 13/45 (28%)
Query: 31 AGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPE 75
P AP APP P A A AP P E
Sbjct: 72 PAPPAAAAPAAPPKPAAAAAAAAAPAAPPAAAAAAAPAAAAVEDE 116
Score = 33.3 bits (77), Expect = 0.71
Identities = 11/47 (23%), Positives = 11/47 (23%)
Query: 31 AGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELP 77
A P A P P P A PA P P
Sbjct: 58 PAAAPAAKAPAAPAPAPPAAAAPAAPPKPAAAAAAAAAPAAPPAAAA 104
Score = 32.9 bits (76), Expect = 0.91
Identities = 12/50 (24%), Positives = 12/50 (24%)
Query: 28 TKCAGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELP 77
A A A AP P A A P P A P
Sbjct: 58 PAAAPAAKAPAAPAPAPPAAAAPAAPPKPAAAAAAAAAPAAPPAAAAAAA 107
Score = 32.9 bits (76), Expect = 1.0
Identities = 12/44 (27%), Positives = 12/44 (27%), Gaps = 4/44 (9%)
Query: 31 AGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPP 74
A A A AP PA P P P PP
Sbjct: 45 TAAAAAAAAAASAPAAAPAAKAPAAPAP----APPAAAAPAAPP 84
Score = 31.8 bits (73), Expect = 1.9
Identities = 12/51 (23%), Positives = 13/51 (25%)
Query: 27 QTKCAGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELP 77
+ A AP P A AP P P A A P
Sbjct: 65 KAPAAPAPAPPAAAAPAAPPKPAAAAAAAAAPAAPPAAAAAAAPAAAAVED 115
Score = 31.0 bits (71), Expect = 4.0
Identities = 12/46 (26%), Positives = 13/46 (28%)
Query: 27 QTKCAGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQ 72
A P AP P A A APP A P +
Sbjct: 71 APAPPAAAAPAAPPKPAAAAAAAAAPAAPPAAAAAAAPAAAAVEDE 116
Score = 29.9 bits (68), Expect = 8.5
Identities = 11/39 (28%), Positives = 13/39 (33%)
Query: 36 PGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPP 74
PG+ AP A A + P A P P P
Sbjct: 38 PGSTAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPA 76
>gnl|CDD|166942 PRK00404, tatB, sec-independent translocase; Provisional.
Length = 141
Score = 33.6 bits (77), Expect = 0.16
Identities = 18/46 (39%), Positives = 21/46 (45%), Gaps = 2/46 (4%)
Query: 31 AGAPPPGAPGAPP--GPVAPPGAVPAPPGAMGLIPPNQPPQPVQPP 74
A +P P P PP P PP A P PP+ PPQP + P
Sbjct: 96 AQSPAPAVPTPPPTSTPAVPPAPAAAVPAPAAAPPPSDPPQPPRAP 141
Score = 33.3 bits (76), Expect = 0.22
Identities = 18/50 (36%), Positives = 19/50 (38%), Gaps = 5/50 (10%)
Query: 33 APPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQ-----PPQPVQPPELP 77
PP AP P PP + PA P A P P P QPP P
Sbjct: 92 TPPTAQSPAPAVPTPPPTSTPAVPPAPAAAVPAPAAAPPPSDPPQPPRAP 141
Score = 32.9 bits (75), Expect = 0.25
Identities = 16/44 (36%), Positives = 17/44 (38%), Gaps = 3/44 (6%)
Query: 34 PPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELP 77
P P P PP +P AVP PP P PP P P
Sbjct: 85 PAPPEPVTPPTAQSPAPAVPTPPPTST---PAVPPAPAAAVPAP 125
>gnl|CDD|220260 pfam09483, HpaP, Type III secretion protein (HpaP). This entry
represents proteins encoded by genes which are always
found in type III secretion operons, although their
function in the processes of secretion and virulence is
unclear. Hpa stands for Hrp-associated gene, where Hrp
stands for hypersensitivity response and virulence. see
also PMID:18584024.
Length = 185
Score = 34.0 bits (78), Expect = 0.17
Identities = 14/43 (32%), Positives = 17/43 (39%)
Query: 30 CAGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQ 72
APP A A P P PAPP ++ PQP+
Sbjct: 30 GTPAPPGPAEDAHPEFPERPRDAPAPPAPPRATDGDRDPQPLA 72
>gnl|CDD|218191 pfam04652, DUF605, Vta1 like. Vta1 (VPS20-associated protein 1) is
a positive regulator of Vps4. Vps4 is an ATPase that is
required in the multivesicular body (MVB) sorting
pathway to dissociate the endosomal sorting complex
required for transport (ESCRT). Vta1 promotes correct
assembly of Vps4 and stimulates its ATPase activity
through its conserved Vta1/SBP1/LIP5 region.
Length = 315
Score = 34.7 bits (80), Expect = 0.18
Identities = 14/42 (33%), Positives = 19/42 (45%)
Query: 33 APPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPP 74
PP +P +P P PPG PP + +PP +P P
Sbjct: 222 DTPPPSPESPTNPSPPPGPAAPPPPPVQQVPPLSTAKPTPPS 263
Score = 32.3 bits (74), Expect = 1.1
Identities = 17/60 (28%), Positives = 18/60 (30%), Gaps = 5/60 (8%)
Query: 31 AGAPPPGAPGAPPGPVAPPGAV-----PAPPGAMGLIPPNQPPQPVQPPELPMFNCPRRP 85
+ PG P P P P PAP PP P P P P P P
Sbjct: 187 PPSSSPGVPSFPSPPEDPSSPSDSSLPPAPSSFQSDTPPPSPESPTNPSPPPGPAAPPPP 246
Score = 30.8 bits (70), Expect = 2.6
Identities = 9/56 (16%), Positives = 10/56 (17%)
Query: 31 AGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELPMFNCPRRPN 86
A P P P + PP P P P
Sbjct: 177 ADPASASPSDPPSSSPGVPSFPSPPEDPSSPSDSSLPPAPSSFQSDTPPPSPESPT 232
>gnl|CDD|189968 pfam01391, Collagen, Collagen triple helix repeat (20 copies).
Members of this family belong to the collagen
superfamily. Collagens are generally extracellular
structural proteins involved in formation of connective
tissue structure. The alignment contains 20 copies of
the G-X-Y repeat that forms a triple helix. The first
position of the repeat is glycine, the second and third
positions can be any residue but are frequently proline
and hydroxyproline. Collagens are post translationally
modified by proline hydroxylase to form the
hydroxyproline residues. Defective hydroxylation is the
cause of scurvy. Some members of the collagen
superfamily are not involved in connective tissue
structure but share the same triple helical structure.
Length = 60
Score = 31.3 bits (72), Expect = 0.21
Identities = 26/50 (52%), Positives = 26/50 (52%), Gaps = 4/50 (8%)
Query: 31 AGAP-PPGAPGAPPGPVAPPGAV--PAPPGAMGLIPPNQPPQPVQPPELP 77
G P PPG PG PPGP PPG P PPG G P PP P PP P
Sbjct: 3 PGPPGPPGPPG-PPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGPP 51
Score = 27.5 bits (62), Expect = 5.3
Identities = 17/29 (58%), Positives = 17/29 (58%), Gaps = 4/29 (13%)
Query: 31 AGAP-PPGAPGAP--PGPVAPPGAVPAPP 56
G P PPG PG P PGP PGA P PP
Sbjct: 33 PGPPGPPGPPGPPGPPGPPGAPGA-PGPP 60
>gnl|CDD|221581 pfam12446, DUF3682, Protein of unknown function (DUF3682). This
domain family is found in eukaryotes, and is typically
between 125 and 136 amino acids in length.
Length = 133
Score = 32.9 bits (75), Expect = 0.21
Identities = 9/34 (26%), Positives = 11/34 (32%)
Query: 31 AGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPP 64
G + APP P P P A G+
Sbjct: 8 GGVSSGSSAPAPPAGPGPGPNAPPAPAAPGVDSS 41
Score = 28.2 bits (63), Expect = 8.3
Identities = 12/40 (30%), Positives = 16/40 (40%), Gaps = 2/40 (5%)
Query: 23 TCGGQTKCAGAPPPGAPGAPPGPVAPPGA--VPAPPGAMG 60
T G + + PP PG P P A V + G+ G
Sbjct: 7 TGGVSSGSSAPAPPAGPGPGPNAPPAPAAPGVDSSAGSSG 46
>gnl|CDD|220441 pfam09849, DUF2076, Uncharacterized protein conserved in bacteria
(DUF2076). This domain, found in various hypothetical
prokaryotic proteins, has no known function. The domain,
however, is found in various periplasmic ligand-binding
sensor proteins.
Length = 234
Score = 33.9 bits (78), Expect = 0.23
Identities = 15/51 (29%), Positives = 19/51 (37%), Gaps = 1/51 (1%)
Query: 17 FGNSIFTCGGQTKCAGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQP 67
F + +F GG + A P P APP P+ GA QP
Sbjct: 82 FLSGMF-GGGAPRPPPAAPAVQPPAPPARPGWGSGGPSQQGAGQQPGYAQP 131
Score = 30.4 bits (69), Expect = 3.1
Identities = 10/43 (23%), Positives = 11/43 (25%)
Query: 32 GAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPP 74
G P P A P P G+ G QP
Sbjct: 88 GGGAPRPPPAAPAVQPPAPPARPGWGSGGPSQQGAGQQPGYAQ 130
>gnl|CDD|227665 COG5373, COG5373, Predicted membrane protein [Function unknown].
Length = 931
Score = 34.8 bits (80), Expect = 0.26
Identities = 15/45 (33%), Positives = 16/45 (35%)
Query: 33 APPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELP 77
A AP A P A A +PP PP P Q E P
Sbjct: 48 AKAAEQMAAPEAAEAAPLPAAAESIASPEVPPPVPPAPAQEGEAP 92
>gnl|CDD|235906 PRK07003, PRK07003, DNA polymerase III subunits gamma and tau;
Validated.
Length = 830
Score = 34.4 bits (79), Expect = 0.28
Identities = 14/54 (25%), Positives = 14/54 (25%), Gaps = 7/54 (12%)
Query: 31 AGAPPPGAPGAPPG--PVAPPGAVPAP-----PGAMGLIPPNQPPQPVQPPELP 77
A P GAP G P GAVPAP P
Sbjct: 357 AFEPAVTGGGAPGGGVPARVAGAVPAPGARAAAAVGASAVPAVTAVTGAAGAAL 410
Score = 32.9 bits (75), Expect = 0.86
Identities = 17/51 (33%), Positives = 19/51 (37%), Gaps = 1/51 (1%)
Query: 26 GQTKCAGAPPPGAPGAPPGPVAPPGAVPAP-PGAMGLIPPNQPPQPVQPPE 75
G A A P AP A P P AP AV P P A P + +
Sbjct: 568 GARAAAAAKPAAAPAAAPKPAAPRVAVQVPTPRARAATGDAPPNGAARAEQ 618
Score = 30.6 bits (69), Expect = 4.3
Identities = 21/76 (27%), Positives = 23/76 (30%)
Query: 33 APPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELPMFNCPRRPNLGREGR 92
AP AP A P PA PP P P P P G
Sbjct: 493 APRAAAPSAATPAAVPDARAPAAASREDAPAAAAPPAPEARPPTPAAAAPAARAGGAAAA 552
Query: 93 PIVLRANHFQISMPRG 108
VLR ++S RG
Sbjct: 553 LDVLRNAGMRVSSDRG 568
Score = 29.8 bits (67), Expect = 7.6
Identities = 13/48 (27%), Positives = 13/48 (27%), Gaps = 5/48 (10%)
Query: 25 GGQTKCAGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQ 72
G A GA GA P A A A P P P
Sbjct: 392 GASAVPAVTAVTGAAGAALAPKAAAAAAATRAEA-----PPAAPAPPA 434
>gnl|CDD|130706 TIGR01645, half-pint, poly-U binding splicing factor, half-pint
family. The proteins represented by this model contain
three RNA recognition motifs (rrm: pfam00076) and have
been characterized as poly-pyrimidine tract binding
proteins associated with RNA splicing factors. In the
case of PUF60 (GP|6176532), in complex with p54, and in
the presence of U2AF, facilitates association of U2
snRNP with pre-mRNA.
Length = 612
Score = 34.3 bits (78), Expect = 0.30
Identities = 22/69 (31%), Positives = 28/69 (40%), Gaps = 4/69 (5%)
Query: 32 GAPPPGAPGA-PPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPEL-PMFNCPRRPNLGR 89
P AP PGP+ P P PP + + PP V P E+ P F R + R
Sbjct: 359 PPLPQAAPAVVKPGPMEIP--TPVPPPGLAIPSLVAPPGLVAPTEINPSFLASPRKKMKR 416
Query: 90 EGRPIVLRA 98
E P+ A
Sbjct: 417 EKLPVTFGA 425
Score = 31.2 bits (70), Expect = 2.6
Identities = 12/45 (26%), Positives = 16/45 (35%), Gaps = 3/45 (6%)
Query: 37 GAPGAPPGPVAPPGAVPA-PPGAMGLIPPN--QPPQPVQPPELPM 78
G VP P A ++ P + P PV PP L +
Sbjct: 343 GNKAVVSSAKKEAEEVPPLPQAAPAVVKPGPMEIPTPVPPPGLAI 387
>gnl|CDD|234938 PRK01297, PRK01297, ATP-dependent RNA helicase RhlB; Provisional.
Length = 475
Score = 34.1 bits (78), Expect = 0.31
Identities = 15/59 (25%), Positives = 17/59 (28%), Gaps = 7/59 (11%)
Query: 29 KCAGAPPPGAPGAPPG--PVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELPMFNCPRRP 85
K P AP +P PP AP P E P + PRR
Sbjct: 11 KGEAEQPAPAPPSPAAAPAPPPPAKTAAPA-----TKAAAPAAAAPRAEKPKKDKPRRE 64
Score = 29.9 bits (67), Expect = 7.7
Identities = 9/50 (18%), Positives = 12/50 (24%)
Query: 44 GPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELPMFNCPRRPNLGREGRP 93
G P P P A PP R ++ +P
Sbjct: 12 GEAEQPAPAPPSPAAAPAPPPPAKTAAPATKAAAPAAAAPRAEKPKKDKP 61
>gnl|CDD|218350 pfam04959, ARS2, Arsenite-resistance protein 2. Arsenite is a
carcinogenic compound which can act as a co-mutagen by
inhibiting DNA repair. Arsenite-resistance protein 2 is
thought to play a role in arsenite resistance.
Length = 211
Score = 33.2 bits (76), Expect = 0.35
Identities = 20/70 (28%), Positives = 22/70 (31%), Gaps = 6/70 (8%)
Query: 33 APPPGAPGAPPGPVAPPGAVPAPPG-----AMGLIPPNQPPQPVQPPELPMFNCPRRPNL 87
A P P PP P P G P PG L+P QP P+ P
Sbjct: 126 ALPEIKPLQPPKP-DPGGLAPGLPGYPPQTPQALMPYGQPRPPMMGYGRGGPPFPPNQYG 184
Query: 88 GREGRPIVLR 97
G G R
Sbjct: 185 GGRGNYDEFR 194
>gnl|CDD|235826 PRK06549, PRK06549, acetyl-CoA carboxylase biotin carboxyl
carrier protein subunit; Validated.
Length = 130
Score = 32.1 bits (73), Expect = 0.43
Identities = 13/44 (29%), Positives = 14/44 (31%), Gaps = 5/44 (11%)
Query: 32 GAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPE 75
GAP A P P + P VP Q PQP
Sbjct: 22 GAPAQAAA--PAQPASTPVPVPTEASPQ---VEAQAPQPAAAAG 60
>gnl|CDD|165142 PHA02777, PHA02777, major capsid L1 protein; Provisional.
Length = 555
Score = 33.9 bits (77), Expect = 0.43
Identities = 18/83 (21%), Positives = 24/83 (28%), Gaps = 10/83 (12%)
Query: 25 GGQTKCAGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELPMFNCPRR 84
GG + AGA P P G + P +P A L + P P+ R
Sbjct: 65 GGTSLGAGAIGSARPSIPAGAIGPGDIIPIDAEAGALAEEIEL-----LPMAPISGRARD 119
Query: 85 PNLGREGRPIVLRANHFQISMPR 107
P I +P
Sbjct: 120 PF-----GEGFGEEPFDPIGIPP 137
>gnl|CDD|237864 PRK14950, PRK14950, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 585
Score = 33.6 bits (77), Expect = 0.44
Identities = 22/72 (30%), Positives = 25/72 (34%), Gaps = 6/72 (8%)
Query: 31 AGAPPPG----APGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELPMFNCPRRPN 86
A AP P AP P A P P PP PP+PV PP P
Sbjct: 373 AAAPSPVRPTPAPSTRPKAAAAANIPPKEPVRETATPPPVPPRPVAPPVPHT--PESAPK 430
Query: 87 LGREGRPIVLRA 98
L R P+ +
Sbjct: 431 LTRAAIPVDEKP 442
Score = 32.5 bits (74), Expect = 1.00
Identities = 15/48 (31%), Positives = 16/48 (33%), Gaps = 3/48 (6%)
Query: 33 APPPGAPGAP-PGPVAP-PGAVPAPPGAMGLIPPNQPPQP-VQPPELP 77
AP P P A P PV P P P A P + P P
Sbjct: 365 APQPAKPTAAAPSPVRPTPAPSTRPKAAAAANIPPKEPVRETATPPPV 412
>gnl|CDD|240420 PTZ00441, PTZ00441, sporozoite surface protein 2 (SSP2);
Provisional.
Length = 576
Score = 33.8 bits (77), Expect = 0.44
Identities = 30/133 (22%), Positives = 42/133 (31%), Gaps = 16/133 (12%)
Query: 36 PGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELPMFNCPRRPNLGRE--GRP 93
P PP P PG + + PPN PP P P + P + + P
Sbjct: 355 PDESNVPPNPPNVPGGSNSEFSSDVENPPN-PPNPDIPEQEPNIPEDSNKEVPEDVPMEP 413
Query: 94 IVLRANHFQISMPRGFVHHYEIQIQPDKCPRKVNREIIETMVHAYSKIFGALKPVFDG-R 152
R N+F P+ +P+ N +I + K V G R
Sbjct: 414 EDDRDNNFNE--PK----------KPENKGDGQNEPVIPKPLDNERDQSNKNKQVNPGNR 461
Query: 153 NNLYTRDPLPIGN 165
+N R P G
Sbjct: 462 HNSEDRYTRPHGR 474
>gnl|CDD|215533 PLN02983, PLN02983, biotin carboxyl carrier protein of acetyl-CoA
carboxylase.
Length = 274
Score = 33.3 bits (76), Expect = 0.45
Identities = 17/52 (32%), Positives = 20/52 (38%), Gaps = 6/52 (11%)
Query: 34 PPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQP-PQPVQPPE--LPMFNCP 82
PPP P +PP A PAP PP P P + P+ P P
Sbjct: 155 PPPPHAMPPA---SPPAAQPAPSAPASSPPPTPASPPPAKAPKSSHPPLKSP 203
Score = 32.1 bits (73), Expect = 1.1
Identities = 15/44 (34%), Positives = 15/44 (34%)
Query: 34 PPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELP 77
PPP AP P P PA P A P P P P
Sbjct: 144 PPPPAPVVMMQPPPPHAMPPASPPAAQPAPSAPASSPPPTPASP 187
>gnl|CDD|233045 TIGR00601, rad23, UV excision repair protein Rad23. All proteins
in this family for which functions are known are
components of a multiprotein complex used for targeting
nucleotide excision repair to specific parts of the
genome. In humans, Rad23 complexes with the XPC protein.
This family is based on the phylogenomic analysis of JA
Eisen (1999, Ph.D. Thesis, Stanford University) [DNA
metabolism, DNA replication, recombination, and repair].
Length = 378
Score = 33.3 bits (76), Expect = 0.47
Identities = 10/53 (18%), Positives = 14/53 (26%), Gaps = 8/53 (15%)
Query: 28 TKCAGAPPPGAPGAPPGPVA------PPGAVPAPPGAMGLIPPNQPPQPVQPP 74
APP P + P P G AP A+ + +
Sbjct: 80 GTGKVAPPAATPTSAPTPTPSPPASPASGMSAAPASAV--EEKSPSEESATAT 130
>gnl|CDD|218646 pfam05590, DUF769, Xylella fastidiosa protein of unknown function
(DUF769). This family consists of several
uncharacterized hypothetical proteins of unknown
function from Xylella fastidiosa, the organism that
causes Pierce's disease in plants.
Length = 255
Score = 33.3 bits (76), Expect = 0.49
Identities = 14/45 (31%), Positives = 19/45 (42%), Gaps = 5/45 (11%)
Query: 225 PSMTYTPVGRSFFSTPDGYYHPL-----GGGREVWFGFHQSVRPS 264
S Y P G + PDG + PL GG ++ FG + S
Sbjct: 100 TSNEYQPDGDTVLRNPDGKFDPLCTAEFVGGNDIQFGIRSAASES 144
>gnl|CDD|236940 PRK11633, PRK11633, cell division protein DedD; Provisional.
Length = 226
Score = 33.1 bits (76), Expect = 0.49
Identities = 13/42 (30%), Positives = 17/42 (40%)
Query: 34 PPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPE 75
PP GA A A ++ A P P PV+PP+
Sbjct: 67 PPEGAAEAVRAGDAAAPSLDPATVAPPNTPVEPEPAPVEPPK 108
Score = 31.5 bits (72), Expect = 1.6
Identities = 15/45 (33%), Positives = 17/45 (37%), Gaps = 4/45 (8%)
Query: 33 APPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELP 77
APP P PV PP P P +P Q V+ P P
Sbjct: 91 APPNTPVEPEPAPVEPPKPKPVEK----PKPKPKPQQKVEAPPAP 131
>gnl|CDD|148139 pfam06346, Drf_FH1, Formin Homology Region 1. This region is found
in some of the Diaphanous related formins (Drfs). It
consists of low complexity repeats of around 12
residues.
Length = 160
Score = 32.2 bits (73), Expect = 0.52
Identities = 23/62 (37%), Positives = 24/62 (38%), Gaps = 7/62 (11%)
Query: 34 PPPGAPGAPPGPVAPPGAVPAPP-----GAMGLIPPNQPP--QPVQPPELPMFNCPRRPN 86
P PG G PP P PG PP G G+ PP P P PP P P P
Sbjct: 89 PLPGGAGIPPPPPPLPGGAAVPPPPPLPGGPGVPPPPPPFPGAPGIPPPPPGMGSPPPPP 148
Query: 87 LG 88
G
Sbjct: 149 FG 150
Score = 31.8 bits (72), Expect = 0.74
Identities = 22/47 (46%), Positives = 23/47 (48%), Gaps = 5/47 (10%)
Query: 31 AGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELP 77
A PPP P P GP PP P PGA G+ PP PP PP P
Sbjct: 107 AAVPPP--PPLPGGPGVPPPP-PPFPGAPGIPPP--PPGMGSPPPPP 148
Score = 29.1 bits (65), Expect = 5.6
Identities = 17/37 (45%), Positives = 17/37 (45%)
Query: 34 PPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQP 70
PPP PGAP P PPG PP G P P P
Sbjct: 124 PPPPFPGAPGIPPPPPGMGSPPPPPFGFGVPAAPVLP 160
>gnl|CDD|236138 PRK07994, PRK07994, DNA polymerase III subunits gamma and tau;
Validated.
Length = 647
Score = 33.3 bits (77), Expect = 0.55
Identities = 13/43 (30%), Positives = 15/43 (34%)
Query: 33 APPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPE 75
A P P PP AP + A + P Q P PP
Sbjct: 362 AAPLPEPEVPPQSAAPAASAQATAAPTAAVAPPQAPAVPPPPA 404
Score = 32.9 bits (76), Expect = 0.91
Identities = 11/48 (22%), Positives = 14/48 (29%)
Query: 31 AGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELPM 78
PP A A P +PP P Q P +P+
Sbjct: 368 PEVPPQSAAPAASAQATAAPTAAVAPPQAPAVPPPPASAPQQAPAVPL 415
Score = 32.5 bits (75), Expect = 1.1
Identities = 15/47 (31%), Positives = 16/47 (34%), Gaps = 1/47 (2%)
Query: 31 AGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELP 77
A A A AP VAPP A PP P P P+
Sbjct: 376 APAASAQATAAPTAAVAPPQAPAVPP-PPASAPQQAPAVPLPETTSQ 421
Score = 30.6 bits (70), Expect = 4.2
Identities = 12/44 (27%), Positives = 13/44 (29%)
Query: 31 AGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPP 74
A P P P P A A AP A+ P P
Sbjct: 363 APLPEPEVPPQSAAPAASAQATAAPTAAVAPPQAPAVPPPPASA 406
>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family. Atrophin-1 is the
protein product of the dentatorubral-pallidoluysian
atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
neurodegenerative disorder. It is caused by the
expansion of a CAG repeat in the DRPLA gene on
chromosome 12p. This results in an extended
polyglutamine region in atrophin-1, that is thought to
confer toxicity to the protein, possibly through
altering its interactions with other proteins. The
expansion of a CAG repeat is also the underlying defect
in six other neurodegenerative disorders, including
Huntington's disease. One interaction of expanded
polyglutamine repeats that is thought to be pathogenic
is that with the short glutamine repeat in the
transcriptional coactivator CREB binding protein, CBP.
This interaction draws CBP away from its usual nuclear
location to the expanded polyglutamine repeat protein
aggregates that are characteristic of the polyglutamine
neurodegenerative disorders. This interferes with
CBP-mediated transcription and causes cytotoxicity.
Length = 979
Score = 33.5 bits (76), Expect = 0.56
Identities = 16/63 (25%), Positives = 23/63 (36%), Gaps = 3/63 (4%)
Query: 23 TCGGQTKCAGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQ-PPQP--VQPPELPMF 79
T Q+ AP P + PP G + L P+ PPQP + ++P
Sbjct: 248 TASQQSPQPPAPSSRHPQSSHHGPGPPMPHALQQGPVFLQHPSSNPPQPFGLAQSQVPPL 307
Query: 80 NCP 82
P
Sbjct: 308 PLP 310
Score = 32.0 bits (72), Expect = 1.9
Identities = 22/71 (30%), Positives = 26/71 (36%), Gaps = 12/71 (16%)
Query: 35 PPGAPGAPPGPVAPPGAVPAPP--GAMGLIPPNQPPQPV----------QPPELPMFNCP 82
PPGA AP P P A PP + P QP QP P LP + P
Sbjct: 184 PPGAALAPSAPPPTPSAQAVPPQGSPIAAQPAPQPQQPSPLSLISAPSLHPQRLPSPHPP 243
Query: 83 RRPNLGREGRP 93
+P + P
Sbjct: 244 LQPQTASQQSP 254
Score = 29.7 bits (66), Expect = 7.9
Identities = 13/45 (28%), Positives = 16/45 (35%), Gaps = 1/45 (2%)
Query: 31 AGAPPPGAPGAPPGPVAPPGAVPAPPGAMGL-IPPNQPPQPVQPP 74
A PPP P + P + PPG+ P P PP
Sbjct: 464 AIGPPPSLPTSTPAAPPRASSGSQPPGSALPSSGGCAGPGPPLPP 508
>gnl|CDD|220603 pfam10152, DUF2360, Predicted coiled-coil domain-containing
protein (DUF2360). This is the conserved 140 amino
acid region of a family of proteins conserved from
nematodes to humans. One C. elegans member is annotated
as a Daf-16-dependent longevity protein 1 but this
could not be confirmed. The function is unknown.
Length = 147
Score = 32.0 bits (73), Expect = 0.59
Identities = 11/44 (25%), Positives = 12/44 (27%), Gaps = 1/44 (2%)
Query: 35 PPGAPGAPPGPVAPP-GAVPAPPGAMGLIPPNQPPQPVQPPELP 77
PP A G P PP A PP+ P
Sbjct: 53 VTVQTTPPPPASAITNGGPPPPPPARAEAASPPPPEAPAEPPAE 96
Score = 29.7 bits (67), Expect = 3.8
Identities = 14/53 (26%), Positives = 17/53 (32%), Gaps = 6/53 (11%)
Query: 31 AGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELPMFNCPR 83
G PPP P PP PA P P P+ + + PR
Sbjct: 68 NGGPPPPPPARAEAASPPPPEAPAEP------PAEPEPEAPAENTVTVAKDPR 114
>gnl|CDD|237802 PRK14723, flhF, flagellar biosynthesis regulator FlhF;
Provisional.
Length = 767
Score = 33.2 bits (76), Expect = 0.60
Identities = 13/48 (27%), Positives = 14/48 (29%), Gaps = 3/48 (6%)
Query: 27 QTKCAGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPP 74
APP P P A+ AP A I P V P
Sbjct: 45 DEDLGAVAASAQAYAPPAPAPLPAALVAPAPAAASIA---APAAVPAP 89
>gnl|CDD|233366 TIGR01348, PDHac_trf_long, pyruvate dehydrogenase complex
dihydrolipoamide acetyltransferase, long form. This
model describes a subset of pyruvate dehydrogenase
complex dihydrolipoamide acetyltransferase specifically
close by both phylogenetic and per cent identity (UPGMA)
trees. Members of this set include two or three copies
of the lipoyl-binding domain. E. coli AceF is a member
of this model, while mitochondrial and some other
bacterial forms belong to a separate model [Energy
metabolism, Pyruvate dehydrogenase].
Length = 546
Score = 33.3 bits (76), Expect = 0.68
Identities = 18/75 (24%), Positives = 21/75 (28%), Gaps = 5/75 (6%)
Query: 21 IFTCGGQTKCAGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPP-ELPMF 79
+ G T P A A P A A P A Q P P Q + P
Sbjct: 188 TLSVAGSTPATAPAPASAQPAAQSPAATQPEPAAAPAA----AKAQAPAPQQAGTQNPAK 243
Query: 80 NCPRRPNLGREGRPI 94
P + R R
Sbjct: 244 VDHAAPAVRRLAREF 258
>gnl|CDD|165528 PHA03270, PHA03270, envelope glycoprotein C; Provisional.
Length = 466
Score = 33.0 bits (75), Expect = 0.77
Identities = 21/79 (26%), Positives = 24/79 (30%), Gaps = 5/79 (6%)
Query: 24 CGGQTKCAGA---PPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELP--M 78
C G GA PG P P P G P PP+ P
Sbjct: 20 CAGAGAPRGAVSNASEAPTSGSPGSAEGPRTTPTPTRGKGTPTGPASPPKSGPPKSPPAP 79
Query: 79 FNCPRRPNLGREGRPIVLR 97
F C R L R G + +R
Sbjct: 80 FRCKRPDVLARYGSRVQIR 98
>gnl|CDD|171499 PRK12438, PRK12438, hypothetical protein; Provisional.
Length = 991
Score = 32.9 bits (75), Expect = 0.83
Identities = 17/31 (54%), Positives = 17/31 (54%), Gaps = 3/31 (9%)
Query: 31 AGAPPPGA-PGAPPGPVAPP--GAVPAPPGA 58
A APPPGA P APP V PP PA P
Sbjct: 911 ASAPPPGAGPPAPPQAVPPPRTTQPPAAPPR 941
Score = 32.9 bits (75), Expect = 0.93
Identities = 17/38 (44%), Positives = 19/38 (50%), Gaps = 4/38 (10%)
Query: 38 APGAPPGPVAPPGAV-PAPPGAMGLIPPNQPPQPVQPP 74
APG PPGA PAPP A +PP + QP P
Sbjct: 905 APGGDAASAPPPGAGPPAPPQA---VPPPRTTQPPAAP 939
Score = 30.2 bits (68), Expect = 5.4
Identities = 17/45 (37%), Positives = 17/45 (37%), Gaps = 4/45 (8%)
Query: 39 PGAPPGPVA-PPGAVPAPPGAMGLIPPNQPPQPVQPPELPMFNCP 82
PG G VA PG A G PP PPQ V PP
Sbjct: 897 PGT--GRVATAPGGDAASAPPPGAGPP-APPQAVPPPRTTQPPAA 938
>gnl|CDD|183659 PRK12659, PRK12659, putative monovalent cation/H+ antiporter
subunit C; Reviewed.
Length = 117
Score = 30.8 bits (70), Expect = 0.86
Identities = 17/31 (54%), Positives = 18/31 (58%), Gaps = 4/31 (12%)
Query: 21 IFTCGGQTKCAGAPP--PGAPGAPPGPVAPP 49
IFT G T+ GAPP P AP GPVA P
Sbjct: 43 IFTVGRLTR--GAPPLVPEGATAPAGPVADP 71
>gnl|CDD|220309 pfam09606, Med15, ARC105 or Med15 subunit of Mediator complex
non-fungal. The approx. 70 residue Med15 domain of the
ARC-Mediator co-activator is a three-helix bundle with
marked similarity to the KIX domain. The sterol
regulatory element binding protein (SREBP) family of
transcription activators use the ARC105 subunit to
activate target genes in the regulation of cholesterol
and fatty acid homeostasis. In addition, Med15 is a
critical transducer of gene activation signals that
control early metazoan development.
Length = 768
Score = 32.7 bits (74), Expect = 0.97
Identities = 16/41 (39%), Positives = 20/41 (48%), Gaps = 4/41 (9%)
Query: 34 PPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPP 74
P PG PG+ P G +P+PP M P+ PQ Q P
Sbjct: 435 PSPGGPGSQPPQSVSGGMIPSPPALM----PSPSPQMSQSP 471
>gnl|CDD|237000 PRK11855, PRK11855, dihydrolipoamide acetyltransferase; Reviewed.
Length = 547
Score = 32.5 bits (75), Expect = 1.00
Identities = 15/44 (34%), Positives = 15/44 (34%)
Query: 31 AGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPP 74
A AP A A P A A PAP A P P P
Sbjct: 196 AAAPAAAAAPAAAAPAAAAAAAPAPAPAAAAAPAAAAPAAAAAP 239
Score = 31.7 bits (73), Expect = 1.7
Identities = 10/44 (22%), Positives = 10/44 (22%)
Query: 31 AGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPP 74
A P A AP A AP A P
Sbjct: 194 VAAAAPAAAAAPAAAAPAAAAAAAPAPAPAAAAAPAAAAPAAAA 237
>gnl|CDD|217814 pfam03957, Jun, Jun-like transcription factor.
Length = 236
Score = 31.9 bits (72), Expect = 1.0
Identities = 19/80 (23%), Positives = 20/80 (25%), Gaps = 23/80 (28%)
Query: 23 TCGGQTKCAGAPPPGAPGAPPGPVAPPGAVPAPP-----------------GAMGL---- 61
P A A PG + G P PP GAM
Sbjct: 139 NSAAAAASGAGGPTAAAMAGPGGLFAGGLAPEPPVYANLSTFNPGATATAYGAMTEGFAA 198
Query: 62 --IPPNQPPQPVQPPELPMF 79
P PP V PP L
Sbjct: 199 PPHPHANPPGAVGPPRLQAL 218
>gnl|CDD|237011 PRK11892, PRK11892, pyruvate dehydrogenase subunit beta;
Provisional.
Length = 464
Score = 32.6 bits (75), Expect = 1.1
Identities = 15/48 (31%), Positives = 17/48 (35%), Gaps = 1/48 (2%)
Query: 31 AGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQP-PELP 77
AGA P A A A A A A P P V P++P
Sbjct: 87 AGAAPAAAAEAAAAAPAAAAAAAAKKAAPAPAAPAAPAAEVAADPDIP 134
>gnl|CDD|184918 PRK14954, PRK14954, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 620
Score = 32.6 bits (74), Expect = 1.1
Identities = 21/51 (41%), Positives = 22/51 (43%), Gaps = 7/51 (13%)
Query: 33 APPPGAPGAP--PGPVAP--PGAVPA--PPGAMGLIPPNQPPQPVQPPELP 77
AP P P PGP P PGA PA P A P QPP + LP
Sbjct: 394 APEPDLPQPDRHPGPAKPEAPGARPAELPSPASAPTPEQQPPVA-RSAPLP 443
>gnl|CDD|223039 PHA03307, PHA03307, transcriptional regulator ICP4; Provisional.
Length = 1352
Score = 32.8 bits (75), Expect = 1.1
Identities = 18/71 (25%), Positives = 23/71 (32%), Gaps = 6/71 (8%)
Query: 29 KCAGAPPPGAPGAPPGPVAPPGAVPA----PPGAMGL--IPPNQPPQPVQPPELPMFNCP 82
+ G+P P +PP A P AV + A P P PP P + P
Sbjct: 139 RPVGSPGPPPAASPPAAGASPAAVASDAASSRQAALPLSSPEETARAPSSPPAEPPPSTP 198
Query: 83 RRPNLGREGRP 93
R R
Sbjct: 199 PAAASPRPPRR 209
Score = 31.7 bits (72), Expect = 2.4
Identities = 16/49 (32%), Positives = 19/49 (38%), Gaps = 1/49 (2%)
Query: 31 AGAPPPGAPGAPPG-PVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELPM 78
G+P P P +P P PP A P P A L +P PP
Sbjct: 103 EGSPTPPGPSSPDPPPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAAS 151
Score = 30.5 bits (69), Expect = 5.5
Identities = 11/37 (29%), Positives = 11/37 (29%)
Query: 34 PPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQP 70
PPP P A P P P PP P
Sbjct: 117 PPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAASPP 153
Score = 30.1 bits (68), Expect = 7.4
Identities = 13/45 (28%), Positives = 14/45 (31%), Gaps = 1/45 (2%)
Query: 27 QTKCAGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIP-PNQPPQP 70
GA P A P P P P + L P P P P
Sbjct: 879 PRARPGAAAPPKAAAAAPPAGAPAPRPRPAPRVKLGPMPPGGPDP 923
>gnl|CDD|219404 pfam07415, Herpes_LMP2, Gammaherpesvirus latent membrane protein
(LMP2) protein. This family consists of several
Gammaherpesvirus latent membrane protein (LMP2)
proteins. Epstein-Barr virus is a human
Gammaherpesvirus that infects and establishes latency
in B lymphocytes in vivo. The latent membrane protein 2
(LMP2) gene is expressed in latently infected B cells
and encodes two protein isoforms, LMP2A and LMP2B, that
are identical except for an additional N-terminal 119
aa cytoplasmic domain which is present in the LMP2A
isoform. LMP2A is thought to play a key role in either
the establishment or the maintenance of latency and/or
the reactivation of productive infection from the
latent state. The significance of LMP2B and its role in
pathogenesis remain unclear.
Length = 489
Score = 32.5 bits (74), Expect = 1.2
Identities = 14/52 (26%), Positives = 20/52 (38%), Gaps = 4/52 (7%)
Query: 32 GAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQP-PELPMFNCP 82
GA P + G P G + P P + G ++P PV + P P
Sbjct: 10 GAGGPRSHGGPDG--DEGDSNPYYPSSFGS-SWDRPGPPVPEDYDAPSHRPP 58
>gnl|CDD|235899 PRK06975, PRK06975, bifunctional uroporphyrinogen-III
synthetase/uroporphyrin-III C-methyltransferase;
Reviewed.
Length = 656
Score = 32.0 bits (73), Expect = 1.4
Identities = 11/30 (36%), Positives = 11/30 (36%)
Query: 31 AGAPPPGAPGAPPGPVAPPGAVPAPPGAMG 60
A AP PP P A P PA G
Sbjct: 294 QPAAAAAAPAPPPNPPATPPEPPARRGRGS 323
>gnl|CDD|236131 PRK07920, PRK07920, lipid A biosynthesis lauroyl acyltransferase;
Provisional.
Length = 298
Score = 31.8 bits (73), Expect = 1.4
Identities = 14/37 (37%), Positives = 16/37 (43%), Gaps = 10/37 (27%)
Query: 501 ALPNSGVWDMRGKQFYTGVEIRVWAIACFAPQRTVRE 537
ALP+SG WDM G W + P TV E
Sbjct: 113 ALPHSGNWDMAG----------AWLVQHHGPFTTVAE 139
>gnl|CDD|130689 TIGR01628, PABP-1234, polyadenylate binding protein, human types 1,
2, 3, 4 family. These eukaryotic proteins recognize the
poly-A of mRNA and consists of four tandem RNA
recognition domains at the N-terminus (rrm: pfam00076)
followed by a PABP-specific domain (pfam00658) at the
C-terminus. The protein is involved in the transport of
mRNA's from the nucleus to the cytoplasm. There are four
paralogs in Homo sapiens which are expressed in testis
(GP:11610605_PABP3 ), platelets (SP:Q13310_PABP4 ),
broadly expressed (SP:P11940_PABP1) and of unknown
tissue range (SP:Q15097_PABP2).
Length = 562
Score = 32.1 bits (73), Expect = 1.5
Identities = 12/45 (26%), Positives = 15/45 (33%), Gaps = 5/45 (11%)
Query: 34 PPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELPM 78
P P PG P P P A+ N +PP P+
Sbjct: 423 PTPMGPGGPLRPNGLA-----PMNAVRAPSRNAQNAAQKPPMQPV 462
Score = 31.3 bits (71), Expect = 2.6
Identities = 15/52 (28%), Positives = 16/52 (30%), Gaps = 6/52 (11%)
Query: 25 GGQTKCAGAPPPGAPGAPPG------PVAPPGAVPAPPGAMGLIPPNQPPQP 70
GG + G P A AP P V PP L PQP
Sbjct: 429 GGPLRPNGLAPMNAVRAPSRNAQNAAQKPPMQPVMYPPNYQSLPLSQDLPQP 480
>gnl|CDD|177464 PHA02682, PHA02682, ORF080 virion core protein; Provisional.
Length = 280
Score = 31.4 bits (70), Expect = 1.7
Identities = 23/66 (34%), Positives = 28/66 (42%), Gaps = 17/66 (25%)
Query: 34 PPPGAPGAPPGPVAPPGAVPAP--PGAMGLIPPNQP---PQP------------VQPPEL 76
PP AP PP V P A PAP P + PP P P+P + PP+
Sbjct: 117 PPATAPTCPPPAVCPAPARPAPACPPSTRQCPPAPPLPTPKPAPAAKPIFLHNQLPPPDY 176
Query: 77 PMFNCP 82
P +CP
Sbjct: 177 PAASCP 182
Score = 30.2 bits (67), Expect = 4.5
Identities = 20/52 (38%), Positives = 20/52 (38%), Gaps = 2/52 (3%)
Query: 33 APPPGAPGAPPGPVAPPGAVPAP-PGAMGLIPPNQPPQPVQP-PELPMFNCP 82
AP P P P AP PAP P P PP V P P P CP
Sbjct: 90 APAPACPACAPAAPAPAVTCPAPAPACPPATAPTCPPPAVCPAPARPAPACP 141
>gnl|CDD|223037 PHA03301, PHA03301, envelope glycoprotein L; Provisional.
Length = 226
Score = 31.4 bits (71), Expect = 1.7
Identities = 12/40 (30%), Positives = 15/40 (37%), Gaps = 4/40 (10%)
Query: 35 PPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPP 74
P AP P A P P A P + P+P + P
Sbjct: 179 PLSAPDDEASPQPKSLATPPPVAA----PSRRTPRPRRKP 214
>gnl|CDD|113398 pfam04625, DEC-1_N, DEC-1 protein, N-terminal region. The
defective chorion-1 gene (dec-1) in Drosophila encodes
follicle cell proteins necessary for proper eggshell
assembly. Multiple products of the dec-1 gene are formed
by alternative RNA splicing and proteolytic processing.
Cleavage products include S80 (80 kDa) which is
incorporated into the eggshell, and further proteolysis
of S80 gives S60 (60 kDa).
Length = 407
Score = 31.7 bits (71), Expect = 1.8
Identities = 14/47 (29%), Positives = 19/47 (40%)
Query: 43 PGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELPMFNCPRRPNLGR 89
PG + VPAP A PP P P P+ + + LG+
Sbjct: 100 PGLLGAAAPVPAPAPAPAAAPPAAPAPAADTPAAPIPDAVQPAILGQ 146
Score = 29.4 bits (65), Expect = 8.7
Identities = 15/43 (34%), Positives = 15/43 (34%), Gaps = 6/43 (13%)
Query: 31 AGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQP 73
A AP P AP P P PA P P VQP
Sbjct: 105 AAAPVP-APAPAPAAAPPAAPAPAAD-----TPAAPIPDAVQP 141
>gnl|CDD|216868 pfam02084, Bindin, Bindin.
Length = 239
Score = 31.0 bits (70), Expect = 2.0
Identities = 17/69 (24%), Positives = 18/69 (26%), Gaps = 10/69 (14%)
Query: 25 GGQTKCAGAPPPG--APGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELPMFNCP 82
GG G P A GPV G A GA G PV P
Sbjct: 17 GGNYPAPGQPAQQGYANQGMGGPVGGGGGPGAGGGAPG--------GPVGGGGGGSGGPP 68
Query: 83 RRPNLGREG 91
+ E
Sbjct: 69 GGGEVAGEA 77
>gnl|CDD|221818 pfam12868, DUF3824, Domain of unknwon function (DUF3824). This is
a repeating domain found in fungal proteins. It is
proline-rich, and the function is not known.
Length = 135
Score = 30.2 bits (68), Expect = 2.1
Identities = 23/56 (41%), Positives = 25/56 (44%), Gaps = 7/56 (12%)
Query: 34 PPPG-APGAPPGPVAP---PGAVPAPPGAMGLIPPNQPPQPVQPPELPMFNCPRRP 85
PPPG P PP P P P PPGA+ PP P P P + P PRR
Sbjct: 79 PPPGSTP-VPPPGPQPGYNPADYPPPPGAV--PPPQNYPYPPGPGQDPYAPRPRRA 131
>gnl|CDD|225499 COG2948, VirB10, Type IV secretory pathway, VirB10 components
[Intracellular trafficking and secretion].
Length = 360
Score = 31.3 bits (71), Expect = 2.1
Identities = 16/60 (26%), Positives = 21/60 (35%), Gaps = 2/60 (3%)
Query: 21 IFTCGGQTKCAGAPPPG--APGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELPM 78
I G + + PP G PP P P PP + L P P Q V+ +
Sbjct: 44 IALQGEKKRINNTQPPSNVERGTPPLPPLPDDPPLPPPLPVDLGAPVLPDQQVEEAKDQP 103
>gnl|CDD|184923 PRK14959, PRK14959, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 624
Score = 31.6 bits (71), Expect = 2.4
Identities = 20/76 (26%), Positives = 23/76 (30%), Gaps = 14/76 (18%)
Query: 25 GGQTKCAGAPPPGAPGA----PPGPVAPPGAVPA-----PPGAMGLIPPNQPPQPV---- 71
+ A P + GA PG P G PA A P+ P P
Sbjct: 378 ASAPSGSAAEGPASGGAATIPTPGTQGPQGTAPAAGMTPSSAAPATPAPSAAPSPRVPWD 437
Query: 72 -QPPELPMFNCPRRPN 86
PP P P RP
Sbjct: 438 DAPPAPPRSGIPPRPA 453
Score = 30.4 bits (68), Expect = 4.4
Identities = 17/49 (34%), Positives = 19/49 (38%)
Query: 26 GQTKCAGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPP 74
G T + AP AP A P P P P P G+ P P P P
Sbjct: 413 GMTPSSAAPATPAPSAAPSPRVPWDDAPPAPPRSGIPPRPAPRMPEASP 461
>gnl|CDD|223033 PHA03291, PHA03291, envelope glycoprotein I; Provisional.
Length = 401
Score = 31.1 bits (70), Expect = 2.8
Identities = 14/70 (20%), Positives = 14/70 (20%), Gaps = 2/70 (2%)
Query: 31 AGAPPPGAPGAPPGPVAPPGAV--PAPPGAMGLIPPNQPPQPVQPPELPMFNCPRRPNLG 88
A P A P P P PQ PE P P G
Sbjct: 206 ATPRPTPRTTASPETTPTPSTTTSPPSTTIPAPSTTIAAPQAGTTPEAEGTPAPPTPGGG 265
Query: 89 REGRPIVLRA 98
A
Sbjct: 266 EAPPANATPA 275
>gnl|CDD|237866 PRK14952, PRK14952, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 584
Score = 31.0 bits (70), Expect = 2.9
Identities = 10/26 (38%), Positives = 10/26 (38%)
Query: 31 AGAPPPGAPGAPPGPVAPPGAVPAPP 56
G P A A G APP PP
Sbjct: 510 TGKPAAAASPAGGGANAPPAKPVKPP 535
Score = 29.8 bits (67), Expect = 7.6
Identities = 12/30 (40%), Positives = 12/30 (40%)
Query: 31 AGAPPPGAPGAPPGPVAPPGAVPAPPGAMG 60
A AP AP P P V AP A G
Sbjct: 399 APAPSAAAPEPKHQPAPEPRPVLAPTPASG 428
>gnl|CDD|237862 PRK14948, PRK14948, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 620
Score = 31.1 bits (71), Expect = 2.9
Identities = 16/47 (34%), Positives = 17/47 (36%)
Query: 31 AGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELP 77
A APPP P PP PA A PP P +P P
Sbjct: 545 ATAPPPTPPPPPPTATQASSNAPAQIPADSSPPPPIPEEPTPSPTKD 591
Score = 30.7 bits (70), Expect = 4.3
Identities = 11/52 (21%), Positives = 15/52 (28%)
Query: 27 QTKCAGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELPM 78
T P AP P P P + + + PP P+ P
Sbjct: 536 PTPPLPQPTATAPPPTPPPPPPTATQASSNAPAQIPADSSPPPPIPEEPTPS 587
>gnl|CDD|220596 pfam10138, Tellurium_res, Tellurium resistance protein. Members
of this family confer resistance to the metalloid
element tellurium and its salts.
Length = 98
Score = 28.9 bits (65), Expect = 3.0
Identities = 9/22 (40%), Positives = 9/22 (40%)
Query: 36 PGAPGAPPGPVAPPGAVPAPPG 57
P AP PP P P A P
Sbjct: 1 PAAPVPPPAPAPPAPAPPPAAP 22
>gnl|CDD|185616 PTZ00436, PTZ00436, 60S ribosomal protein L19-like protein;
Provisional.
Length = 357
Score = 30.7 bits (68), Expect = 3.1
Identities = 15/48 (31%), Positives = 17/48 (35%)
Query: 31 AGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELPM 78
A A P A AP APP APP P P + P+
Sbjct: 302 AAAAPAKAAAAPAKAAAPPAKAAAPPAKAATPPAKAAAPPAKAAAAPV 349
Score = 30.7 bits (68), Expect = 3.1
Identities = 15/47 (31%), Positives = 16/47 (34%)
Query: 31 AGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELP 77
A APP A P APP APP P P + P
Sbjct: 267 AAAPPAKAAAPPAKAAAPPAKAAAPPAKAAAAPAKAAAAPAKAAAAP 313
Score = 30.3 bits (67), Expect = 5.2
Identities = 15/47 (31%), Positives = 16/47 (34%)
Query: 31 AGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELP 77
A APP A AP A P APP P P + P
Sbjct: 232 AAAPPAKAAAAPAKAAAAPAKAAAPPAKAAAPPAKAAAPPAKAAAPP 278
Score = 30.3 bits (67), Expect = 5.2
Identities = 14/47 (29%), Positives = 15/47 (31%)
Query: 31 AGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELP 77
A A P A P APP APP P P + P
Sbjct: 246 AAAAPAKAAAPPAKAAAPPAKAAAPPAKAAAPPAKAAAPPAKAAAPP 292
Score = 29.9 bits (66), Expect = 5.6
Identities = 15/47 (31%), Positives = 16/47 (34%)
Query: 31 AGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELP 77
A A P A AP APP APP P P + P
Sbjct: 239 AAAAPAKAAAAPAKAAAPPAKAAAPPAKAAAPPAKAAAPPAKAAAPP 285
Score = 29.9 bits (66), Expect = 5.6
Identities = 15/47 (31%), Positives = 16/47 (34%)
Query: 31 AGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELP 77
A APP A P APP APP P P + P
Sbjct: 253 AAAPPAKAAAPPAKAAAPPAKAAAPPAKAAAPPAKAAAPPAKAAAAP 299
Score = 29.5 bits (65), Expect = 7.9
Identities = 15/47 (31%), Positives = 16/47 (34%)
Query: 31 AGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELP 77
A APP A P APP APP P P + P
Sbjct: 260 AAAPPAKAAAPPAKAAAPPAKAAAPPAKAAAPPAKAAAAPAKAAAAP 306
>gnl|CDD|197891 smart00818, Amelogenin, Amelogenins, cell adhesion proteins, play a
role in the biomineralisation of teeth. They seem to
regulate formation of crystallites during the secretory
stage of tooth enamel development and are thought to
play a major role in the structural organisation and
mineralisation of developing enamel. The extracellular
matrix of the developing enamel comprises two major
classes of protein: the hydrophobic amelogenins and the
acidic enamelins. Circular dichroism studies of porcine
amelogenin have shown that the protein consists of 3
discrete folding units: the N-terminal region appears to
contain beta-strand structures, while the C-terminal
region displays characteristics of a random coil
conformation. Subsequent studies on the bovine protein
have indicated the amelogenin structure to contain a
repetitive beta-turn segment and a "beta-spiral" between
Gln112 and Leu138, which sequester a (Pro, Leu, Gln)
rich region. The beta-spiral offers a probable site for
interactions with Ca2+ ions. Muatations in the human
amelogenin gene (AMGX) cause X-linked hypoplastic
amelogenesis imperfecta, a disease characterised by
defective enamel. A 9bp deletion in exon 2 of AMGX
results in the loss of codons for Ile5, Leu6, Phe7 and
Ala8, and replacement by a new threonine codon,
disrupting the 16-residue (Met1-Ala16) amelogenin signal
peptide.
Length = 165
Score = 30.1 bits (68), Expect = 3.3
Identities = 13/48 (27%), Positives = 18/48 (37%), Gaps = 1/48 (2%)
Query: 31 AGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELPM 78
PP P P P P + + P QP P+ P+LP+
Sbjct: 104 PLQPPQPQQPMQPQPPVHPIPPLPPQPPLPPMFPMQPLPPL-LPDLPL 150
>gnl|CDD|217310 pfam02993, MCPVI, Minor capsid protein VI. This minor capsid
protein may act as a link between the external capsid
and the internal DNA-protein core. The C-terminal 11
residues may function as a protease cofactor leading to
enzyme activation.
Length = 238
Score = 30.5 bits (69), Expect = 3.3
Identities = 14/47 (29%), Positives = 15/47 (31%), Gaps = 1/47 (2%)
Query: 33 APPPGAPGAPPGPVAPPGAVPAPPGAMGLIPP-NQPPQPVQPPELPM 78
A P A P P VPA P P PV+ P M
Sbjct: 123 ADPIQALQPRPRPDVEEVLVPAAPEPPSYEETIKPGPAPVEEPVDSM 169
>gnl|CDD|233692 TIGR02031, BchD-ChlD, magnesium chelatase ATPase subunit D. This
model represents one of two ATPase subunits of the
trimeric magnesium chelatase responsible for insertion
of magnesium ion into protoporphyrin IX. This is an
essential step in the biosynthesis of both chlorophyll
and bacteriochlorophyll. This subunit is found in green
plants, photosynthetic algae, cyanobacteria and other
photosynthetic bacteria. Unlike subunit I (TIGR02030),
this subunit is not found in archaea [Biosynthesis of
cofactors, prosthetic groups, and carriers, Chlorophyll
and bacteriochlorphyll].
Length = 589
Score = 30.5 bits (69), Expect = 3.8
Identities = 20/81 (24%), Positives = 24/81 (29%), Gaps = 7/81 (8%)
Query: 34 PPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELPMFNCPRRP------NL 87
P P PP P PP P P + P EL MF+
Sbjct: 268 EPEPQPPPPPPPPEPPEPEEEPDEPDQTDPDDGEETDQIPEEL-MFDAVEADLPDNILAT 326
Query: 88 GREGRPIVLRANHFQISMPRG 108
+ + RA Q S RG
Sbjct: 327 LQTVQRRRGRAGGEQKSNHRG 347
>gnl|CDD|219805 pfam08347, CTNNB1_binding, N-terminal CTNNB1 binding. This region
tends to appear at the N-terminus of proteins also
containing DNA-binding HMG (high mobility group) boxes
(pfam00505) and appears to bind the armadillo repeat of
CTNNB1 (beta-catenin), forming a stable complex.
Signaling by Wnt through TCF/LCF is involved in
developmental patterning, induction of neural tissues,
cell fate decisions and stem cell differentiation.
Isoforms of HMG T-cell factors lacking the N-terminal
CTNNB1-binding domain cannot fulfill their role as
transcriptional activators in T-cell differentiation.
Length = 200
Score = 29.9 bits (67), Expect = 3.8
Identities = 14/45 (31%), Positives = 15/45 (33%), Gaps = 9/45 (20%)
Query: 32 GAPPPGAPGA---------PPGPVAPPGAVPAPPGAMGLIPPNQP 67
G PPP P PP P P PG +G IP
Sbjct: 155 GTPPPHLPYDVDPKTGIPRPPHPPDISPFYPLSPGGVGQIPHPLG 199
>gnl|CDD|227315 COG4982, COG4982, 3-oxoacyl-[acyl-carrier protein].
Length = 866
Score = 30.6 bits (69), Expect = 4.2
Identities = 12/38 (31%), Positives = 15/38 (39%)
Query: 33 APPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQP 70
P P A P AP A P A G P ++P +
Sbjct: 18 PPAPAASAPAPAAAAPAPVAAAAPAAAGPRPDDEPFKA 55
>gnl|CDD|223061 PHA03369, PHA03369, capsid maturational protease; Provisional.
Length = 663
Score = 30.7 bits (69), Expect = 4.4
Identities = 12/43 (27%), Positives = 14/43 (32%), Gaps = 2/43 (4%)
Query: 32 GAPPPGAPGAPPGPVAPPGAVPAP--PGAMGLIPPNQPPQPVQ 72
P G P + P P P G GL+ P P P
Sbjct: 384 PQRPDGIPYSVPARSPMTAYPPVPQFCGDPGLVSPYNPQSPGT 426
>gnl|CDD|220271 pfam09507, CDC27, DNA polymerase subunit Cdc27. This protein forms
the C subunit of DNA polymerase delta. It carries the
essential residues for binding to the Pol1 subunit of
polymerase alpha, from residues 293-332, which are
characterized by the motif D--G--VT, referred to as the
DPIM motif. The first 160 residues of the protein form
the minimal domain for binding to the B subunit, Cdc1,
of polymerase delta, the final 10 C-terminal residues,
362-372, being the DNA sliding clamp, PCNA, binding
motif.
Length = 427
Score = 30.6 bits (69), Expect = 4.5
Identities = 16/55 (29%), Positives = 22/55 (40%), Gaps = 6/55 (10%)
Query: 29 KCAGAPPPGAP------GAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELP 77
A P P G PVAP + P A G P ++PP+ + PE+
Sbjct: 123 YGKQAGPITNPNVKRRTGVGLPPVAPAASPALKPTANGKRPSSKPPKSIMSPEVK 177
>gnl|CDD|235665 PRK05996, motB, flagellar motor protein MotB; Validated.
Length = 423
Score = 30.4 bits (69), Expect = 4.7
Identities = 12/54 (22%), Positives = 15/54 (27%), Gaps = 1/54 (1%)
Query: 27 QTKCAGAPPPGAPGAPPGPVAP-PGAVPAPPGAMGLIPPNQPPQPVQPPELPMF 79
+ AG P A A PA +P QP + EL
Sbjct: 191 EVTTAGDLLPPGQAREQAQGAKSATAAPATVPQAAPLPQAQPKKAATEEELIAD 244
>gnl|CDD|177618 PHA03381, PHA03381, tegument protein VP22; Provisional.
Length = 290
Score = 30.0 bits (67), Expect = 4.7
Identities = 10/40 (25%), Positives = 10/40 (25%), Gaps = 4/40 (10%)
Query: 31 AGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQP 70
A PG GP G P PP P
Sbjct: 97 PEASGPGPARGARGPAGSRGRGRRAESP----SPRDPPNP 132
>gnl|CDD|218597 pfam05466, BASP1, Brain acid soluble protein 1 (BASP1 protein).
This family consists of several brain acid soluble
protein 1 (BASP1) or neuronal axonal membrane protein
NAP-22. The BASP1 is a neuron enriched Ca(2+)-dependent
calmodulin-binding protein of unknown function.
Length = 233
Score = 29.8 bits (66), Expect = 5.0
Identities = 18/62 (29%), Positives = 22/62 (35%), Gaps = 4/62 (6%)
Query: 23 TCGGQTKCAGAPP----PGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELPM 78
+TK AP P + A P P A AP P P + V+P E P
Sbjct: 163 AAAQETKSDAAPASDSKPSSSEAAPSSKETPAATEAPSSTAKASAPAAPAEEVKPSEAPA 222
Query: 79 FN 80
N
Sbjct: 223 AN 224
>gnl|CDD|234797 PRK00575, tatA, twin arginine translocase protein A; Provisional.
Length = 92
Score = 28.2 bits (63), Expect = 5.3
Identities = 10/43 (23%), Positives = 11/43 (25%), Gaps = 7/43 (16%)
Query: 31 AGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQP 73
A A A P PV PA Q +P
Sbjct: 56 AAAAQAPYQVATPTPVQSQRVDPAA-------ASGQDSTEARP 91
>gnl|CDD|215643 PLN03241, PLN03241, magnesium chelatase subunit H; Provisional.
Length = 1315
Score = 30.2 bits (68), Expect = 5.4
Identities = 15/40 (37%), Positives = 18/40 (45%), Gaps = 3/40 (7%)
Query: 41 APPGPVAPPGAVPAPPG---AMGLIPPNQPPQPVQPPELP 77
APP V PP P P A+GL P++ Q E P
Sbjct: 257 APPSTVTPPSYAPPPLVETPALGLYHPDRERQQAPYFESP 296
>gnl|CDD|219339 pfam07223, DUF1421, Protein of unknown function (DUF1421). This
family represents a conserved region approximately 350
residues long within a number of plant proteins of
unknown function.
Length = 357
Score = 29.9 bits (67), Expect = 5.5
Identities = 14/53 (26%), Positives = 15/53 (28%)
Query: 33 APPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELPMFNCPRRP 85
PPP P PP PP PPQ Q + P P
Sbjct: 122 YPPPSQPQPPPAQQPQAQQPQPPPQVPQQQQYQSPPQQPQYQQNPPPQAQSAP 174
Score = 29.5 bits (66), Expect = 7.9
Identities = 19/76 (25%), Positives = 24/76 (31%), Gaps = 5/76 (6%)
Query: 34 PPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELP-----MFNCPRRPNLG 88
PPP A AP P P P + P +QPP F P +P+
Sbjct: 166 PPPQAQSAPQVSGLYPEESPYQPQSYPPNEPLPSSMAMQPPYSGAPPSQQFYGPPQPSPY 225
Query: 89 REGRPIVLRANHFQIS 104
G P + F
Sbjct: 226 MYGGPGGRPNSGFPSG 241
>gnl|CDD|220950 pfam11029, DAZAP2, DAZ associated protein 2 (DAZAP2). DAZ
associated protein 2 has a highly conserved sequence
throughout evolution including a conserved polyproline
region and several SH2/SH3 binding sites. It occurs as
a single copy gene with a four-exon organisation and is
located on chromosome 12. It encodes a ubiquitously
expressed protein and binds to DAZ and DAZL1 through
DAZ repeats.
Length = 136
Score = 29.0 bits (65), Expect = 5.8
Identities = 15/49 (30%), Positives = 16/49 (32%), Gaps = 6/49 (12%)
Query: 31 AGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQ-----PPQPVQPP 74
A P AP P + P A G PP P PV PP
Sbjct: 22 AQMPQASAP-YPGPSMYLPMAQVMAVGPQSSHPPMAYYPIGAPPPVYPP 69
>gnl|CDD|193426 pfam12953, DUF3842, Domain of unknown function (DUF3842). This
short protein is found mainly in firmicute bacteria. It
is functionally uncharacterized.
Length = 131
Score = 28.6 bits (65), Expect = 6.0
Identities = 18/51 (35%), Positives = 25/51 (49%), Gaps = 5/51 (9%)
Query: 414 MQTSTMIKATARSAPDREREIQSLVKRADFNNDAYVHEFGLTISNSMM-EV 463
+ TS M+KA A E I K+AD V G+ I+NSM+ E+
Sbjct: 38 IATSAMLKAGANRGATGENAIVVNAKKADI----IVGPIGIVIANSMLGEI 84
>gnl|CDD|222579 pfam14179, YppG, YppG-like protein. The YppG-like protein family
includes the B. subtilis YppG protein, which is
functionally uncharacterized. This family of proteins
is found in bacteria. Proteins in this family are
typically between 115 and 181 amino acids in length.
There are two completely conserved residues (F and G)
that may be functionally important.
Length = 110
Score = 28.5 bits (64), Expect = 6.1
Identities = 15/47 (31%), Positives = 17/47 (36%), Gaps = 1/47 (2%)
Query: 35 PPGAPGAPPGPVAPPGAVPAPPGAMGLIP-PNQPPQPVQPPELPMFN 80
P PP P +PP P P Q PQ QPP+ F
Sbjct: 23 QPYHQQMPPPPYSPPQQQQGHFMPPQPQPYPKQSPQQQQPPQFSSFL 69
>gnl|CDD|234228 TIGR03483, FtsZ_alphas_C, cell division protein FtsZ,
alphaProteobacterial C-terminal extension. This model
describes a domain found as a C-terminal extension to
the cell division protein FtsZ in many but not all
members of the alphaProteobacteria [Cellular processes,
Cell division].
Length = 121
Score = 28.7 bits (64), Expect = 6.1
Identities = 14/53 (26%), Positives = 20/53 (37%), Gaps = 5/53 (9%)
Query: 41 APPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELPMFNCPRRPNLGREGRP 93
AP + P P P A + P+ ++ PRR L +GRP
Sbjct: 50 APAAHLEPAPRAPQAPPAHPYNENRRAF----APQASIY-APRRGGLDDQGRP 97
>gnl|CDD|236768 PRK10819, PRK10819, transport protein TonB; Provisional.
Length = 246
Score = 29.7 bits (67), Expect = 6.3
Identities = 16/44 (36%), Positives = 16/44 (36%), Gaps = 6/44 (13%)
Query: 31 AGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPP 74
A PP A PP PV P P P IP PV P
Sbjct: 56 ADLEPPQAVQPPPEPVVEPEPEPEP------IPEPPKEAPVVIP 93
>gnl|CDD|240227 PTZ00009, PTZ00009, heat shock 70 kDa protein; Provisional.
Length = 653
Score = 30.1 bits (68), Expect = 6.4
Identities = 14/28 (50%), Positives = 14/28 (50%), Gaps = 1/28 (3%)
Query: 31 AGAPPPGAPGAPPGPVAPPGAVPAPPGA 58
G P G PG PG P GA PA GA
Sbjct: 617 GGGMPGGMPGGMPG-GMPGGAGPAGAGA 643
>gnl|CDD|177328 PHA01929, PHA01929, putative scaffolding protein.
Length = 306
Score = 29.6 bits (66), Expect = 6.4
Identities = 20/51 (39%), Positives = 20/51 (39%), Gaps = 5/51 (9%)
Query: 31 AGAPPPGAPGAPPGPVAPPGAVPAPPGAMG---LIPPNQPPQPVQPPELPM 78
A PP AP P PV P A P L P Q PQPV P M
Sbjct: 17 ANVPPAAAPTPQPNPVIQPQAPVQPGQPGAPQQLAIPTQQPQPV--PTSAM 65
Score = 29.3 bits (65), Expect = 8.0
Identities = 15/36 (41%), Positives = 17/36 (47%), Gaps = 3/36 (8%)
Query: 36 PGAPGAPPGPVAPPGAVPAPPGAMGLIPP--NQPPQ 69
AP A P P APP A A P A+ + PP P
Sbjct: 71 QQAP-AQPAPAAPPAAGAALPEALEVPPPPAFTPNG 105
>gnl|CDD|172376 PRK13855, PRK13855, type IV secretion system protein VirB10;
Provisional.
Length = 376
Score = 29.9 bits (67), Expect = 6.8
Identities = 24/91 (26%), Positives = 31/91 (34%), Gaps = 27/91 (29%)
Query: 25 GGQTKCAGAPPPGAP--------------GAPPGPVAPPGAV--PAPPGAMGLIPPNQPP 68
GG++K P P + PP P A AV APP A N+P
Sbjct: 50 GGRSKKENEPAPPSTMIATNTKPFHPAPIDVPPDPPAAQEAVQPTAPPSAQSEPERNEP- 108
Query: 69 QPVQPPELPMF-------NCPRRPNLGREGR 92
+P E P+F +R G R
Sbjct: 109 ---RPEETPIFAYSSGDQGGSKRAGHGDTDR 136
>gnl|CDD|179538 PRK03100, PRK03100, sec-independent translocase; Provisional.
Length = 136
Score = 28.4 bits (64), Expect = 7.3
Identities = 17/50 (34%), Positives = 21/50 (42%), Gaps = 9/50 (18%)
Query: 19 NSIFTCGGQTKCAGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPP 68
+S+FT G G P +P A PAPP A L P +PP
Sbjct: 90 DSLFT--GNFDQTGNGRPTPS-------SPDAAGPAPPDATPLAPGERPP 130
>gnl|CDD|221868 pfam12938, M_domain, M domain of GW182.
Length = 238
Score = 29.5 bits (66), Expect = 7.4
Identities = 15/42 (35%), Positives = 15/42 (35%), Gaps = 4/42 (9%)
Query: 34 PPPGAPGAPPGPVAPP--GAVPAPPGAMGLIPPNQPPQPVQP 73
PPP G G P G P A I PN Q QP
Sbjct: 75 PPPLGNGGGSGAGGPGPVGGGGGPGVAPNNIQPNA--QAQQP 114
>gnl|CDD|222449 pfam13908, Shisa, Wnt and FGF inhibitory regulator. Shisa is a
transcription factor-type molecule that physically
interacts with immature forms of the Wnt receptor
Frizzled and the FGF receptor within the endoplasmic
reticulum to inhibit their post-translational maturation
and trafficking to the cell surface.
Length = 177
Score = 29.0 bits (65), Expect = 7.5
Identities = 16/52 (30%), Positives = 20/52 (38%), Gaps = 1/52 (1%)
Query: 23 TCGGQTKCAGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPP 74
T Q PG P+ P +PAPP ++ PP QP PP
Sbjct: 125 TPLPQPPSTAPSYPGPQYQGYHPMPPQPGMPAPPYSLQY-PPPGLLQPQGPP 175
>gnl|CDD|216860 pfam02063, MARCKS, MARCKS family.
Length = 296
Score = 29.4 bits (65), Expect = 7.5
Identities = 11/44 (25%), Positives = 12/44 (27%), Gaps = 4/44 (9%)
Query: 34 PPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELP 77
P A + AP A P P PPQ P
Sbjct: 251 PAEEAGASSAAQEAPAAEQEAAPAE----EPAAPPQEACSESSP 290
>gnl|CDD|236669 PRK10263, PRK10263, DNA translocase FtsK; Provisional.
Length = 1355
Score = 30.1 bits (67), Expect = 7.6
Identities = 17/54 (31%), Positives = 21/54 (38%), Gaps = 3/54 (5%)
Query: 33 APPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPP-QPVQPPELPMFNCPRRP 85
P P PVAP P + P Q P QPV P P + P++P
Sbjct: 772 VAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQ--PQYQQPQQP 823
>gnl|CDD|235777 PRK06302, PRK06302, acetyl-CoA carboxylase biotin carboxyl
carrier protein subunit; Validated.
Length = 155
Score = 28.6 bits (65), Expect = 8.4
Identities = 16/51 (31%), Positives = 16/51 (31%), Gaps = 5/51 (9%)
Query: 33 APPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPEL-----PM 78
A P AP A AP A PA A P P E PM
Sbjct: 36 AAAPVAPVAQQAAAAPVAAAPAAAAAAAAAPAAAPAAAAAEAEGHVVTSPM 86
>gnl|CDD|240327 PTZ00243, PTZ00243, ABC transporter; Provisional.
Length = 1560
Score = 29.7 bits (67), Expect = 8.6
Identities = 18/45 (40%), Positives = 22/45 (48%), Gaps = 4/45 (8%)
Query: 63 PPNQPPQPVQPPELPMFNCPRRPNLGREGRPIVLRANHFQISMPR 107
P + P PVQ L R REG P+VLR F+I+ PR
Sbjct: 1296 PTSAAPHPVQAGSLVFEGVQMRY---REGLPLVLRGVSFRIA-PR 1336
>gnl|CDD|221440 pfam12144, Med12-PQL, Eukaryotic Mediator 12 catenin-binding
domain. This domain is found in eukaryotes, and is
typically between 325 and 354 amino acids in length.
Both development and carcinogenesis are driven by
signal transduction within the canonical
Wnt/beta-catenin pathway through both programmed and
unprogrammed changes in gene transcription.
Beta-catenin physically and functionally targets this
PQL (proline-, glutamine-, leucine-rich) region of the
Med12 subunit of Mediator to activate transcription.
The beta-catenin transactivation domain binds directly
to isolated Med12 and intact Mediator both in vitro and
in vivo, and Mediator is recruited to Wnt-responsive
genes in a beta-catenin-dependent manner.
Length = 204
Score = 28.8 bits (64), Expect = 8.8
Identities = 17/48 (35%), Positives = 20/48 (41%), Gaps = 8/48 (16%)
Query: 47 APPGAVPAPPGAMGLIPPNQP--------PQPVQPPELPMFNCPRRPN 86
AP G +P AM + NQP P +P PM P RPN
Sbjct: 8 APYGRLPYGQQAMNMYTQNQPLPPGGPGLEPPYRPARNPMNKMPVRPN 55
>gnl|CDD|221040 pfam11235, Med25_SD1, Mediator complex subunit 25 synapsin 1. The
overall function of the full-length Med25 is efficiently
to coordinate the transcriptional activation of RAR/RXR
(retinoic acid receptor/retinoic X receptor) in higher
eukaryotic cells. Human Med25 consists of several
domains with different binding properties, the
N-terminal, VWA, domain, this SD1 - synapsin 1 - domain
from residues 229-381, a PTOV(B) or ACID domain from
395-545, an SD2 domain from residues 564-645 and a
C-terminal NR box-containing domain (646-650) from
646-747. This The function of the SD domains is unclear.
Length = 168
Score = 28.6 bits (63), Expect = 9.0
Identities = 19/50 (38%), Positives = 21/50 (42%), Gaps = 9/50 (18%)
Query: 35 PPGAPGAPPGPVAP---------PGAVPAPPGAMGLIPPNQPPQPVQPPE 75
PPG PGAP A PG+ AP +PP QP QP P
Sbjct: 104 PPGPPGAPKPSPASQLSLVTTVSPGSGLAPVLTQQQVPPQQPQQPSMVPT 153
Score = 28.6 bits (63), Expect = 9.7
Identities = 15/48 (31%), Positives = 20/48 (41%), Gaps = 4/48 (8%)
Query: 36 PGAPGAPPGPVAPPGAVPAPP----GAMGLIPPNQPPQPVQPPELPMF 79
P G+ PGP+ V PP L P P PV PP++ +
Sbjct: 2 PVGGGSVPGPLQSKQPVSLPPAAVLPPQSLPAPQNPLPPVTPPQMQVP 49
>gnl|CDD|114709 pfam06003, SMN, Survival motor neuron protein (SMN). This family
consists of several eukaryotic survival motor neuron
(SMN) proteins. The Survival of Motor Neurons (SMN)
protein, the product of the spinal muscular
atrophy-determining gene, is part of a large
macromolecular complex (SMN complex) that functions in
the assembly of spliceosomal small nuclear
ribonucleoproteins (snRNPs). The SMN complex functions
as a specificity factor essential for the efficient
assembly of Sm proteins on U snRNAs and likely protects
cells from illicit, and potentially deleterious,
non-specific binding of Sm proteins to RNAs.
Length = 264
Score = 29.2 bits (65), Expect = 9.4
Identities = 20/55 (36%), Positives = 21/55 (38%), Gaps = 8/55 (14%)
Query: 32 GAPPPGAPGAPPGPVAPP--------GAVPAPPGAMGLIPPNQPPQPVQPPELPM 78
G P P P PPGP PP PP G PP P+ PP PM
Sbjct: 163 GPPSPWNPRFPPGPPPPPPGFGRHGEKPSGWPPFLSGWPPPFPLGPPMIPPPPPM 217
>gnl|CDD|223066 PHA03379, PHA03379, EBNA-3A; Provisional.
Length = 935
Score = 29.6 bits (66), Expect = 9.7
Identities = 12/52 (23%), Positives = 16/52 (30%), Gaps = 1/52 (1%)
Query: 27 QTKCAGAPPPGAPGAPPGPVAPPGAVPAPPGAMGLIPPNQPPQPVQPPELPM 78
+ + P P P PP + P Q VQPP+L
Sbjct: 565 RVRERWRPAPWTP-NPPRSPSQMSVRDRLARLRAEAQPYQASVEVQPPQLTQ 615
>gnl|CDD|165539 PHA03282, PHA03282, envelope glycoprotein E; Provisional.
Length = 540
Score = 29.5 bits (66), Expect = 9.8
Identities = 13/50 (26%), Positives = 15/50 (30%), Gaps = 12/50 (24%)
Query: 4 PFPTTPDISNDLNFGNSIFTCGGQTKCAGAPPPGAPGAPPGPVAPPGAVP 53
P TP +D AGAP AP P P P +
Sbjct: 168 PPGGTPPPDDDEGDE------------AGAPATPAPPLHPPPAPHPHPIA 205
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.321 0.136 0.418
Gapped
Lambda K H
0.267 0.0717 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 44,348,042
Number of extensions: 4362570
Number of successful extensions: 6910
Number of sequences better than 10.0: 1
Number of HSP's gapped: 5806
Number of HSP's successfully gapped: 422
Length of query: 865
Length of database: 10,937,602
Length adjustment: 105
Effective length of query: 760
Effective length of database: 6,280,432
Effective search space: 4773128320
Effective search space used: 4773128320
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 63 (28.1 bits)