RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy9257
(530 letters)
>gnl|CDD|238492 cd00992, PDZ_signaling, PDZ domain found in a variety of Eumetazoan
signaling molecules, often in tandem arrangements. May
be responsible for specific protein-protein
interactions, as most PDZ domains bind C-terminal
polypeptides, and binding to internal (non-C-terminal)
polypeptides and even to lipids has been demonstrated.
In this subfamily of PDZ domains an N-terminal
beta-strand forms the peptide-binding groove base, a
circular permutation with respect to PDZ domains found
in proteases.
Length = 82
Score = 81.5 bits (202), Expect = 4e-19
Identities = 28/83 (33%), Positives = 46/83 (55%), Gaps = 4/83 (4%)
Query: 347 IRELVLCKDASGLIGLRVCAINE---GVFVCLVERGSPASLVGLRFGDQILSINGETVAG 403
+R + L KD G +G + + G+FV VE G PA GLR GD+IL +NG +V G
Sbjct: 1 VRTVTLRKDPGGGLGFSLRGGKDSGGGIFVSRVEPGGPAERGGLRVGDRILEVNGVSVEG 60
Query: 404 YNMKQVHKILKAAPVNNIRVVIR 426
++ ++LK + + + + +R
Sbjct: 61 LTHEEAVELLKNSG-DEVTLTVR 82
Score = 70.7 bits (174), Expect = 3e-15
Identities = 30/82 (36%), Positives = 41/82 (50%), Gaps = 7/82 (8%)
Query: 431 ERTVTLHRDSSGHFGFHFNKGQ-------IVSLVKDSSAARNGLLVNHNILEVNGANVVG 483
RTVTL +D G GF G+ + + A R GL V ILEVNG +V G
Sbjct: 1 VRTVTLRKDPGGGLGFSLRGGKDSGGGIFVSRVEPGGPAERGGLRVGDRILEVNGVSVEG 60
Query: 484 LKDKELLEAMEKGGSVLTLTLI 505
L +E +E ++ G +TLT+
Sbjct: 61 LTHEEAVELLKNSGDEVTLTVR 82
>gnl|CDD|214570 smart00228, PDZ, Domain present in PSD-95, Dlg, and ZO-1/2. Also
called DHR (Dlg homologous region) or GLGF (relatively
well conserved tetrapeptide in these domains). Some PDZs
have been shown to bind C-terminal polypeptides; others
appear to bind internal (non-C-terminal) polypeptides.
Different PDZs possess different binding specificities.
Length = 85
Score = 70.5 bits (173), Expect = 3e-15
Identities = 28/86 (32%), Positives = 39/86 (45%), Gaps = 5/86 (5%)
Query: 347 IRELVLCKDASGLIGLRVCAINE---GVFVCLVERGSPASLVGLRFGDQILSINGETVAG 403
R + L K GL G + + GV V V GSPA+ GLR GD IL +NG +V G
Sbjct: 2 PRLVELEKGGGGL-GFSLVGGKDEGGGVVVSSVVPGSPAAKAGLRVGDVILEVNGTSVEG 60
Query: 404 YNMKQVHKILKAAPVNNIRVVIRDRP 429
+ +LK A + + +
Sbjct: 61 LTHLEAVDLLKKAG-GKVTLTVLRGG 85
Score = 61.6 bits (150), Expect = 4e-12
Identities = 30/83 (36%), Positives = 43/83 (51%), Gaps = 8/83 (9%)
Query: 431 ERTVTLHRDSSGHFGFHFNKGQ-------IVSLVKDSSAARNGLLVNHNILEVNGANVVG 483
R V L + G GF G+ + S+V S AA+ GL V ILEVNG +V G
Sbjct: 2 PRLVELEKGGGG-LGFSLVGGKDEGGGVVVSSVVPGSPAAKAGLRVGDVILEVNGTSVEG 60
Query: 484 LKDKELLEAMEKGGSVLTLTLIP 506
L E ++ ++K G +TLT++
Sbjct: 61 LTHLEAVDLLKKAGGKVTLTVLR 83
>gnl|CDD|238080 cd00136, PDZ, PDZ domain, also called DHR (Dlg homologous region)
or GLGF (after a conserved sequence motif). Many PDZ
domains bind C-terminal polypeptides, though binding to
internal (non-C-terminal) polypeptides and even to
lipids has been demonstrated. Heterodimerization through
PDZ-PDZ domain interactions adds to the domain's
versatility, and PDZ domain-mediated interactions may be
modulated dynamically through target phosphorylation.
Some PDZ domains play a role in scaffolding
supramolecular complexes. PDZ domains are found in
diverse signaling proteins in bacteria, archebacteria,
and eurkayotes. This CD contains two distinct structural
subgroups with either a N- or C-terminal beta-strand
forming the peptide-binding groove base. The circular
permutation placing the strand on the N-terminus appears
to be found in Eumetazoa only, while the C-terminal
variant is found in all three kingdoms of life, and
seems to co-occur with protease domains. PDZ domains
have been named after PSD95(post synaptic density
protein), DlgA (Drosophila disc large tumor suppressor),
and ZO1, a mammalian tight junction protein.
Length = 70
Score = 69.6 bits (171), Expect = 5e-15
Identities = 22/60 (36%), Positives = 32/60 (53%)
Query: 367 INEGVFVCLVERGSPASLVGLRFGDQILSINGETVAGYNMKQVHKILKAAPVNNIRVVIR 426
GV V VE GSPA GL+ GD IL++NG V ++ V ++LK + + +R
Sbjct: 11 TEGGVVVLSVEPGSPAERAGLQAGDVILAVNGTDVKNLTLEDVAELLKKEVGEKVTLTVR 70
Score = 45.0 bits (107), Expect = 2e-06
Identities = 18/54 (33%), Positives = 30/54 (55%), Gaps = 1/54 (1%)
Query: 453 IVSLVKDSSAARNGLLVNHNILEVNGANVVGLKDKELLEAM-EKGGSVLTLTLI 505
++S+ S A R GL IL VNG +V L +++ E + ++ G +TLT+
Sbjct: 17 VLSVEPGSPAERAGLQAGDVILAVNGTDVKNLTLEDVAELLKKEVGEKVTLTVR 70
>gnl|CDD|238488 cd00988, PDZ_CTP_protease, PDZ domain of C-terminal processing-,
tail-specific-, and tricorn proteases, which function in
posttranslational protein processing, maturation, and
disassembly or degradation, in Bacteria, Archaea, and
plant chloroplasts. May be responsible for substrate
recognition and/or binding, as most PDZ domains bind
C-terminal polypeptides, and binding to internal
(non-C-terminal) polypeptides and even to lipids has
been demonstrated. In this subfamily of
protease-associated PDZ domains a C-terminal beta-strand
forms the peptide-binding groove base, a circular
permutation with respect to PDZ domains found in
Eumetazoan signaling proteins.
Length = 85
Score = 58.0 bits (141), Expect = 7e-11
Identities = 29/84 (34%), Positives = 47/84 (55%), Gaps = 1/84 (1%)
Query: 358 GLIGLRVCAINEGVFVCLVERGSPASLVGLRFGDQILSINGETVAGYNMKQVHKILKAAP 417
G IGL + + G+ + V GSPA+ G++ GD I++I+GE V G +++ V K+L+
Sbjct: 2 GGIGLELKYDDGGLVITSVLPGSPAAKAGIKAGDIIVAIDGEPVDGLSLEDVVKLLRGKA 61
Query: 418 VNNIRV-VIRDRPLERTVTLHRDS 440
+R+ + R R VTL R
Sbjct: 62 GTKVRLTLKRGDGEPREVTLTRLK 85
Score = 38.4 bits (90), Expect = 6e-04
Identities = 16/64 (25%), Positives = 32/64 (50%), Gaps = 1/64 (1%)
Query: 444 FGFHFNKGQIVSLVKDSSAARNGLLVNHNILEVNGANVVGLKDKELLEAM-EKGGSVLTL 502
+ I S++ S AA+ G+ I+ ++G V GL +++++ + K G+ + L
Sbjct: 8 LKYDDGGLVITSVLPGSPAAKAGIKAGDIIVAIDGEPVDGLSLEDVVKLLRGKAGTKVRL 67
Query: 503 TLIP 506
TL
Sbjct: 68 TLKR 71
>gnl|CDD|238487 cd00987, PDZ_serine_protease, PDZ domain of tryspin-like serine
proteases, such as DegP/HtrA, which are oligomeric
proteins involved in heat-shock response, chaperone
function, and apoptosis. May be responsible for
substrate recognition and/or binding, as most PDZ
domains bind C-terminal polypeptides, though binding to
internal (non-C-terminal) polypeptides and even to
lipids has been demonstrated. In this subfamily of
protease-associated PDZ domains a C-terminal beta-strand
forms the peptide-binding groove base, a circular
permutation with respect to PDZ domains found in
Eumetazoan signaling proteins.
Length = 90
Score = 52.6 bits (127), Expect = 8e-09
Identities = 22/70 (31%), Positives = 38/70 (54%), Gaps = 3/70 (4%)
Query: 368 NEGVFVCLVERGSPASLVGLRFGDQILSINGETVAGYNMKQVHKILKAAPVNN-IRVVIR 426
+GV V V+ GSPA+ GL+ GD IL++NG+ V ++ + + L + + + +
Sbjct: 23 TKGVLVASVDPGSPAAKAGLKPGDVILAVNGKPVK--SVADLRRALAELKPGDKVTLTVL 80
Query: 427 DRPLERTVTL 436
E TVT+
Sbjct: 81 RGGKELTVTV 90
Score = 34.9 bits (81), Expect = 0.011
Identities = 20/55 (36%), Positives = 28/55 (50%), Gaps = 4/55 (7%)
Query: 453 IVSLVKDSSAARNGLLVNHNILEVNGANVVGLKDKELLEAME--KGGSVLTLTLI 505
+ S+ S AA+ GL IL VNG V + D L A+ K G +TLT++
Sbjct: 28 VASVDPGSPAAKAGLKPGDVILAVNGKPVKSVAD--LRRALAELKPGDKVTLTVL 80
>gnl|CDD|201332 pfam00595, PDZ, PDZ domain (Also known as DHR or GLGF). PDZ
domains are found in diverse signaling proteins.
Length = 80
Score = 51.5 bits (124), Expect = 1e-08
Identities = 23/82 (28%), Positives = 37/82 (45%), Gaps = 6/82 (7%)
Query: 349 ELVLCKDASGLIGLRV---CAINEGVFVCLVERGSPASLVGLRFGDQILSINGETVAGY- 404
E+ L K G +G + + G+FV V G A GL+ GD+ILSING+ +
Sbjct: 1 EVTLEKSGRGGLGFSLVGGSDGDPGIFVSEVLPGGAAEAGGLQEGDRILSINGQDLENLS 60
Query: 405 NMKQVHKILKAAPVNNIRVVIR 426
+ + V + + + I
Sbjct: 61 HDEAVLALKG--SGGEVTLTIL 80
Score = 47.2 bits (113), Expect = 4e-07
Identities = 23/80 (28%), Positives = 37/80 (46%), Gaps = 7/80 (8%)
Query: 433 TVTLHRDSSGHFGF------HFNKGQIVS-LVKDSSAARNGLLVNHNILEVNGANVVGLK 485
VTL + G GF + G VS ++ +A GL IL +NG ++ L
Sbjct: 1 EVTLEKSGRGGLGFSLVGGSDGDPGIFVSEVLPGGAAEAGGLQEGDRILSINGQDLENLS 60
Query: 486 DKELLEAMEKGGSVLTLTLI 505
E + A++ G +TLT++
Sbjct: 61 HDEAVLALKGSGGEVTLTIL 80
>gnl|CDD|223021 PHA03247, PHA03247, large tegument protein UL36; Provisional.
Length = 3151
Score = 56.9 bits (137), Expect = 2e-08
Identities = 57/253 (22%), Positives = 78/253 (30%), Gaps = 28/253 (11%)
Query: 36 PPPPVYQTSSHTSPNAYNNPPSPY------------HSANPPPYYQHPSVPSHPSVPTFP 83
PPPP+ + +P+ PP P + PP P P P
Sbjct: 2551 PPPPLPPAAPPAAPDRSVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGD--P 2608
Query: 84 SAPIYPSAPPQESHS---PYYYSDIRANVPTGGSPVPSYPQGSYPSNPYQAQNSYP---S 137
P PS P ++H+ P AN P P P +P + S P
Sbjct: 2609 RGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRAR 2668
Query: 138 VPNQPQAPNTYPSMPNQLQAPNGFYPLTSSQQASQASLPYPINPLASNKPSAPQPPAYSV 197
+ ++ P P + A LTS P P P + P
Sbjct: 2669 RLGRAAQASSPPQRPRRRAARPTVGSLTSLADPP----PPPPTPEPAPHALVSATPLPPG 2724
Query: 198 QPSHNTAYPTAPLVPSTIATAPSAPPANIEDATRSTHPTTATAPS-VPPAIHDAT--RST 254
+ A P P P+ A P+ P A + PTTA P+ PPA A R
Sbjct: 2725 PAAARQASPALPAAPAPPAV-PAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRL 2783
Query: 255 PSPIIAPSVPPTN 267
P +A
Sbjct: 2784 TRPAVASLSESRE 2796
Score = 56.5 bits (136), Expect = 3e-08
Identities = 57/247 (23%), Positives = 79/247 (31%), Gaps = 17/247 (6%)
Query: 36 PPPPVYQTSSHTSPNAYNNPPSPYHSANPPPYYQHPSVPSHPSVP---TFPSAPIYPSAP 92
P PP SP A P P + PP + P S P S+P
Sbjct: 2624 PDPP----PPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSP 2679
Query: 93 PQESHSPYYYSDIRANVPTGGSPVPSYPQGSYPSNPYQAQNSYPSVPNQPQAPNTYPSMP 152
PQ + + P P P A P QA P+ P
Sbjct: 2680 PQRPRRRAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAP 2739
Query: 153 NQLQAPNGFYPLTSSQQASQASLPYPINPLASNKPSAPQ--PPAYSVQPSHNTAYPTAPL 210
P G P T A A P P A P+AP PP +P+ + +
Sbjct: 2740 APPAVPAG--PATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRES 2797
Query: 211 VPSTIATAPSAPPANIEDATRSTHPTTATAPSVPPAIHDATRSTPSPIIAPSVPPTNRDT 270
+PS P+ PPA + + P + A +PP T + P+ P PP
Sbjct: 2798 LPS--PWDPADPPAAVLAPAAALPPAASPAGPLPP----PTSAQPTAPPPPPGPPPPSLP 2851
Query: 271 IRSALVP 277
+ ++ P
Sbjct: 2852 LGGSVAP 2858
Score = 55.3 bits (133), Expect = 6e-08
Identities = 50/275 (18%), Positives = 81/275 (29%), Gaps = 35/275 (12%)
Query: 29 TSSSFSSPPPPVYQTSSHTSPNAYNNPPSPYHSANPPPYYQHPSVPSHPSV-----PTFP 83
+ PP SP+ N P P+ PP + P+ V
Sbjct: 2611 PAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRL 2670
Query: 84 SAPIYPSAPPQ------------------ESHSPYYYSDIRANVPTGGSPVPSYPQGSYP 125
S+PPQ + P + + +P+P P +
Sbjct: 2671 GRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQ 2730
Query: 126 SNPYQ----AQNSYPSVPNQPQAPNTYPSMPNQLQAPNGFYPLTSSQQASQASLPYPINP 181
++P A + P+ P P P P P P + + +
Sbjct: 2731 ASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVAS 2790
Query: 182 LASNKPSAPQPPAYSVQP--------SHNTAYPTAPLVPSTIATAPSAPPANIEDATRST 233
L+ ++ S P P + P + A A +P + P+APP S
Sbjct: 2791 LSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSL 2850
Query: 234 HPTTATAPSVPPAIHDATRSTPSPIIAPSVPPTNR 268
+ AP +RS + AP+ PP R
Sbjct: 2851 PLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRR 2885
Score = 53.0 bits (127), Expect = 3e-07
Identities = 55/238 (23%), Positives = 86/238 (36%), Gaps = 23/238 (9%)
Query: 29 TSSSFSSPPPPVYQTSSHTSPNAYNNPPSPYHSANPPPYYQHPSVPSHPSVPTFPSAPIY 88
TS + PPPP + + H +A PP P + P P+ P+ P+VP P+ P
Sbjct: 2696 TSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASP--ALPAAPAPPAVPAGPATPGG 2753
Query: 89 PSAPPQESHSPYYYSDIRANVPTGGSPVPSYPQGSYPSNPYQAQNSYPSVPNQPQAPNTY 148
P+ P + T G P P+ P P+ + + P+V + ++ +
Sbjct: 2754 PARPARPP-------------TTAGPPAPAPPAA--PAAGPPRRLTRPAVASLSESRESL 2798
Query: 149 PSMPNQLQAPNGFYPLTSSQQASQASLPYPINPLASNKPSAPQPPAYSVQPSHNTAYPTA 208
PS + P ++ + + P+ P S +P+AP PP PS A
Sbjct: 2799 PSPWDPADPPAAVLAPAAALPPAASPAG-PLPPPTSAQPTAPPPPPGPPPPSLPLGGSVA 2857
Query: 209 PLVPSTIATAPSAPPANIEDATRSTHPTTATAPSVPPAIHDATRSTPSPIIAPSVPPT 266
P +P A P PA+ +T S P P PP
Sbjct: 2858 PGGDVRRRPPSRSPAAK---PAAPARPPVRRLAR--PAVSRSTESFALPPDQPERPPQ 2910
Score = 50.7 bits (121), Expect = 2e-06
Identities = 53/257 (20%), Positives = 75/257 (29%), Gaps = 10/257 (3%)
Query: 29 TSSSFSSPPPPVYQTSSHTSPNAYNNPPSPYHSANPPPYYQHPSVPSHPSVPTFPSAPIY 88
S+ PP P + + A PP+ P P+ P + P P+ P
Sbjct: 2715 LVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAA 2774
Query: 89 PSAPPQESHSPYYYSDIRANVPTGGSP-----VPSYPQGSYPSNPYQAQNSYPSVP---N 140
P+A P + + + + + SP P+ + P A + P P
Sbjct: 2775 PAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSA 2834
Query: 141 QPQAPNTYPSMPNQLQAPNGFYPLTSSQQASQASLPYPINPLASNKPSAPQPPAYSVQPS 200
QP AP P P G + S P A +P + +V S
Sbjct: 2835 QPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRS 2894
Query: 201 HNTAYPTAPLVPSTIATAPSAPPANIEDATRSTHPTTATAPSVPPAIHDATRSTPSPIIA 260
++ P P P APP P P PP T P A
Sbjct: 2895 -TESFALPPDQPERPPQ-PQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGA 2952
Query: 261 PSVPPTNRDTIRSALVP 277
ALVP
Sbjct: 2953 GEPSGAVPQPWLGALVP 2969
Score = 49.2 bits (117), Expect = 6e-06
Identities = 49/232 (21%), Positives = 71/232 (30%), Gaps = 15/232 (6%)
Query: 25 SKQTTSSSFSSPPPPVYQTSSHTSPNAYNNPPSPYHSANPPPYYQHPSVPSHPSVPTFPS 84
S + S SP P ++ +P A P + PPP P+ P P P PS
Sbjct: 2790 SLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPS 2849
Query: 85 APIYPSAPPQESHSPYYYSDIRANVPTGGSPVPSYPQGSYPSNPYQAQNSYPSVPNQPQA 144
P+ S P D+R P+ + + P+ P + + P+V ++
Sbjct: 2850 LPLGGSVAPG--------GDVRRRPPSRSPA----AKPAAPARPPVRRLARPAVSRSTES 2897
Query: 145 PNTYPSMPNQLQAPNGFYPLTSSQQASQASLPYPINPLASNKPSAPQPPAYSVQPSHNTA 204
P P + P P Q P P P P P A + P+
Sbjct: 2898 FALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPP---PPPRPQPPLAPTTDPAGAGE 2954
Query: 205 YPTAPLVPSTIATAPSAPPANIEDATRSTHPTTATAPSVPPAIHDATRSTPS 256
A P A P + A A S PP + S
Sbjct: 2955 PSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPASSTPPLTGHSLSRVSS 3006
Score = 47.6 bits (113), Expect = 2e-05
Identities = 52/275 (18%), Positives = 84/275 (30%), Gaps = 48/275 (17%)
Query: 34 SSPPPPVYQTSSHTSPNAYNNPPSPYHSANPP----------PYYQHPSVPSHPSVPTFP 83
+ PP + + + + + + PSP+ A+PP P P+ P P P
Sbjct: 2777 AGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQP 2836
Query: 84 SAPIYPSAPPQESHSPYYYSDIRANVPTGGSPVPSYPQGSYPSNPYQAQNSYPSVPNQPQ 143
+AP P PP S +P GGS P P P ++ + P+ P +P
Sbjct: 2837 TAPPPPPGPPPPS------------LPLGGSVAPGGDVRRRP--PSRSPAAKPAAPARPP 2882
Query: 144 A-----PNTYPSMPNQLQAPNGFYPLTSSQQASQASLPYPINPLASNKPSAPQPPAYSVQ 198
P S + P+ Q P +P P PP
Sbjct: 2883 VRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPR---- 2938
Query: 199 PSHNTAYPTAPLVPSTIATAPSAPPANIEDATRSTHPTTATAPSVPPAIHDATRSTPSPI 258
P PL P+T P + P VP + P P
Sbjct: 2939 -------PQPPLAPTTDPAGAGEPSGAV--------PQPWLGALVPGRVAVPRFRVPQPA 2983
Query: 259 IAPSVPPTNRDTIRSALVPLYPSLDNYMGLEVASE 293
+ P ++ + + S + + L ++
Sbjct: 2984 PSREAPASSTPPLTGHSLSRVSSWASSLALHEETD 3018
Score = 47.6 bits (113), Expect = 2e-05
Identities = 51/244 (20%), Positives = 73/244 (29%), Gaps = 24/244 (9%)
Query: 36 PPPPVYQTSSHT-SPNAYNNPPSPYHSANPPPYYQHPSVPSHPSVPTFPSAPIYPSAPPQ 94
P PVY+ + P A P P P P P P+ + I P P
Sbjct: 2475 PGAPVYRRPAEARFPFAAGAAPDPGGGGPPDP--------DAPPAPSRLAPAILPDEPVG 2526
Query: 95 ESHSPYYYSDIR-----ANVPTGGSP-------VPSYPQGSYPSNPYQAQNSYPSVPNQP 142
E P + IR A+ G P P+ P S P + S P+V ++
Sbjct: 2527 EPVHPRMLTWIRGLEELASDDAGDPPPPLPPAAPPAAPDRSVPPPRPAPRPSEPAVTSRA 2586
Query: 143 QAPNTYPSMPNQLQAPNGFYPLTSSQQASQASLPYPINPLASNKPSAPQPPAYSVQPSHN 202
+ P+ P + +AP + + S P P +P P A P
Sbjct: 2587 RRPDA-PPQSARPRAPVD--DRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPP 2643
Query: 203 TAYPTAPLVPSTIATAPSAPPANIEDATRSTHPTTATAPSVPPAIHDATRSTPSPIIAPS 262
P A + P R+ ++ A S S P
Sbjct: 2644 PTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPP 2703
Query: 263 VPPT 266
PPT
Sbjct: 2704 PPPT 2707
Score = 37.2 bits (86), Expect = 0.028
Identities = 46/255 (18%), Positives = 68/255 (26%), Gaps = 30/255 (11%)
Query: 17 TIAQNEYFSKQTTSSSFSSPPPPVYQTSSHTSPNAYNNPPSPYHSANPPPYYQHPSVPSH 76
T Q+E F ++ S + P A PP + P+
Sbjct: 230 TYLQDEPFVERRVVISHPLRGDIAAPAPPPVVGEGADRAPETARGATGPPPPPEAAAPNG 289
Query: 77 PSVP-----------TFPSAPIYPSAPPQESHSPYYYSDIRANVPTGGSPVPSYPQGSYP 125
+ P + P P PP D SP+P P+ YP
Sbjct: 290 AAAPPDGVWGAALAGAPLALPAPPDPPPPAPAGDAEEEDDEDGAMEVVSPLP-RPRQHYP 348
Query: 126 SNPYQAQNSYPSVPNQPQAPNTYPSMPNQLQAPNGFYPLTSSQQASQASLPYPINPLASN 185
P + + T PS L A S + S + P A
Sbjct: 349 L----------GFPKRRRPTWTPPSSLEDLSAGRHHPKRASLPTRKRRSARHAATPFARG 398
Query: 186 KPSAPQPPAYSVQPSHNTAYPTAPLVPSTIATAPSAPPANIEDAT-------RSTHPTTA 238
Q + P+ + P VP++ P+ P + E + P
Sbjct: 399 PGGDDQTRPAAPVPA-SVPTPAPTPVPASAPPPPATPLPSAEPGSDDGPAPPPERQPPAP 457
Query: 239 TAPSVPPAIHDATRS 253
P DATR
Sbjct: 458 ATEPAPDDPDDATRK 472
Score = 36.1 bits (83), Expect = 0.053
Identities = 47/194 (24%), Positives = 55/194 (28%), Gaps = 29/194 (14%)
Query: 75 SHPSVPTFPSAPIYPSAPPQESHSPYYYSDIRANVPTGGSPVPSYPQGSYPSNPYQAQNS 134
S FP AP+Y + R G +P P P P
Sbjct: 2467 SLLLGELFPGAPVYRRPA-----------EARFPFAAGAAPDPGGGGPPDPDAPPAPSRL 2515
Query: 135 YPSVPNQPQAPNTYPSMPNQLQAPNGFYPLTSSQQASQASLPYPINPLASNKPSAPQPPA 194
P++ P P P P L G L S P PL P P P
Sbjct: 2516 APAI--LPDEPVGEPVHPRMLTWIRGLEELASDDAGD------PPPPLPPAAP--PAAPD 2565
Query: 195 YSVQPSHNTAYPTAPLVPSTIATAPSAPPANIEDATRSTHPTTATAPSVPPAIHDATRST 254
SV P P+ P V S A P APP +S P P
Sbjct: 2566 RSVPPPRPAPRPSEPAVTSR-ARRPDAPP-------QSARPRAPVDDRGDPRGPAPPSPL 2617
Query: 255 PSPIIAPSVPPTNR 268
P AP PP +
Sbjct: 2618 PPDTHAPDPPPPSP 2631
Score = 33.4 bits (76), Expect = 0.41
Identities = 51/262 (19%), Positives = 70/262 (26%), Gaps = 23/262 (8%)
Query: 25 SKQTTSSSFSSPPPPVYQTSSHTSPNAYNNPPSPYHSANPPPYYQHPSVPSHPSVPTFPS 84
S+ + + PPV + + + + P PP Q P P P P
Sbjct: 2868 SRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPP 2927
Query: 85 APIYPSAPPQESHSPYYYSDIRANVPTGGSPVPSYPQGSYPSNPYQAQNSYPSVPNQPQA 144
P P PP P PT P G+ P A + +
Sbjct: 2928 QPQPPPPPPPRPQPPL--------APTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRV 2979
Query: 145 PNTYPSMPNQLQAPNGFYPLTSSQQASQASLPYPINPLASNKPSAPQPPAYSVQPSHNTA 204
P PS + + S+ +S AS LA ++ + P PP Q
Sbjct: 2980 PQPAPSREAPASSTPPLTGHSLSRVSSWAS------SLALHEETDP-PPVSLKQTLWPPD 3032
Query: 205 YPTAPLVPSTIATAPSAPPANIEDATRSTHPTTATAPSVPPAIHDATRSTPSPIIAPSVP 264
S + D P R +PS P
Sbjct: 3033 DTEDSDADSLFDSDSERSDLEALDPLPPEPHDPFAHEPDPATPEAGARESPSSQFGPPPL 3092
Query: 265 PTN--------RDTIRSALVPL 278
N R T RSAL L
Sbjct: 3093 SANAALSRRYVRSTGRSALAVL 3114
>gnl|CDD|220392 pfam09770, PAT1, Topoisomerase II-associated protein PAT1. Members
of this family are necessary for accurate chromosome
transmission during cell division.
Length = 804
Score = 56.3 bits (136), Expect = 3e-08
Identities = 49/264 (18%), Positives = 67/264 (25%), Gaps = 29/264 (10%)
Query: 19 AQNEYFSKQTTSSSFSSPPPPVYQTSSHTSPNAYNNPPSPYHSA-NPPPYYQHPSVPSHP 77
S + +S SP Y+ S+ Y P S P + +
Sbjct: 85 PSVGPDSDLSQKTSTFSPCQSGYEASTDPE---YIPDLQPDPSLWGTAPKPEPQPPQAPE 141
Query: 78 SVPTFPSAPIYPSAPPQESHSPYYYSDIRANVPTGGSPVPSYPQGSYPSNPYQAQNSYPS 137
S P P P +E + +P P PQG P Q P
Sbjct: 142 SQP-QPQTPAQKMLSLEEVEAQLQQRQQAPQLPQP--PQQVLPQGMPPRQAAFPQQGPPE 198
Query: 138 VP-NQPQAPNTYPSMPNQLQAPNGFYPLTSSQQASQASLPYPINPLASNKPSAPQPPAYS 196
P PQ P +P Q + A P P P P P
Sbjct: 199 QPPGYPQPPQGHPEQVQPQQFLPA-------PSQAPAQPPLPPQL-----PQQPPPLQQP 246
Query: 197 VQPSHNTAYPTAPLVPSTIATAPSAPPANIEDATRSTHPTTATAPSVPPAIHDATRSTPS 256
P + P P P P P A P P+ P + +
Sbjct: 247 QFPGLSQQMPPPPPQPPQQQQQPPQPQA---------QPPPQNQPTPHPGLPQGQNAPLP 297
Query: 257 PIIAPSVPPTNRDTIRSALVPLYP 280
P P + P + P +
Sbjct: 298 PPQQPQLLPLVQQPQGQQRGPQFR 321
Score = 42.8 bits (101), Expect = 4e-04
Identities = 30/175 (17%), Positives = 36/175 (20%), Gaps = 22/175 (12%)
Query: 35 SPPPPVYQTSSHTSPNAYNNPPS--PYHSANPPPYYQHPSVPSHPSVPTFPSAPIYPSAP 92
P + + P P P Q P P+ P P
Sbjct: 179 VLPQGMPPRQAAFPQQGPPEQPPGYPQPPQGHPEQVQPQQFLPAP--SQAPAQPPLPPQL 236
Query: 93 PQESHSPYYYSDIRANVPTGGSPVPSYPQGSYPSNPYQAQNSYPSVPNQPQAP--NTYPS 150
PQ P P Q P P Q Q Q P N
Sbjct: 237 PQ------------QPPPLQQPQFPGLSQQMPPPPPQPPQQQQQPPQPQAQPPPQNQPTP 284
Query: 151 MPNQLQAPNGFYPLTSSQQASQASLPYPINPLASNKPSAPQPPAYSVQPSHNTAY 205
P Q N P Q LP P + + + A
Sbjct: 285 HPGLPQGQNAPLPPPQQPQ----LLPLVQQPQGQQRGPQFREQLVQLSQQQREAL 335
>gnl|CDD|236090 PRK07764, PRK07764, DNA polymerase III subunits gamma and tau;
Validated.
Length = 824
Score = 52.7 bits (127), Expect = 4e-07
Identities = 37/215 (17%), Positives = 53/215 (24%), Gaps = 10/215 (4%)
Query: 30 SSSFSSPPPPVYQTSSHTSPNAYNNPPSPYHSANPPPYYQHPSVPSHPSVPTFPSAPIYP 89
++ PP P +S P P +P A P + P+ + AP
Sbjct: 594 AAGGEGPPAP----ASSGPPEEAARPAAPAAPA-APAAPAPAGAAAAPAEASAAPAPGVA 648
Query: 90 SAPPQESH--SPYYYSDIRANVPTGGSPVPSYPQGSYPSNPYQAQNSYPSVPNQPQAPNT 147
+ H P G P+ P + A P T
Sbjct: 649 APEHHPKHVAVPDASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAAT 708
Query: 148 YPSMPNQLQAPNGFYP-LTSSQQASQASLPYPINPLASNKPSAPQPPAYSVQPSHNTAYP 206
P+ A +S + A P P+ P + P PA P
Sbjct: 709 PPAGQADDPAAQPPQAAQGASAPSPAADDPVPLPPEPDDPPDPAGAPAQPPPPPAPAPAA 768
Query: 207 TAPLVPSTIATAPSAPPANIEDATRSTHPTTATAP 241
P + PS ED S
Sbjct: 769 APAAAPP--PSPPSEEEEMAEDDAPSMDDEDRRDA 801
Score = 37.3 bits (87), Expect = 0.020
Identities = 35/215 (16%), Positives = 52/215 (24%), Gaps = 26/215 (12%)
Query: 51 AYNNPPSPYHSANPPPYYQHPSVPSHPSVPTFPSAPIYPSAPPQESHSPYYYSDIRANVP 110
A P S+ PP P+ P+ P+ P P+ +AP + S +P
Sbjct: 595 AGGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAP----------- 643
Query: 111 TGGSPVPSYPQGSYPSNPYQAQNSYPSVPNQPQAPNTYPSMPNQLQAPNGFYPLTSSQQA 170
+ + P + A P AP +Q A
Sbjct: 644 -APGVAAPEHHPKHVAVPDASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPA 702
Query: 171 SQASLPYPINPLASNKPSAPQPPAYSVQPSHNTAYPTAPLVPSTIATAPSAPPANIEDAT 230
P+A P + P+ + A P D
Sbjct: 703 P--------------APAATPPAGQADDPAAQPPQAAQGASAPSPAADDPVPLPPEPDDP 748
Query: 231 RSTHPTTATAPSVPPAIHDATRSTPSPIIAPSVPP 265
A P P A + P PS
Sbjct: 749 PDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPSEEE 783
Score = 35.7 bits (83), Expect = 0.060
Identities = 26/199 (13%), Positives = 43/199 (21%), Gaps = 13/199 (6%)
Query: 71 PSVPSHPSVPTFPSAPIYPSAPPQESHSPYYYSDIRANVPTGGSPVPSYPQGSYPSNPYQ 130
P P+ P P+AP + A P G + P+ +
Sbjct: 597 GEGPPAPASSGPPEEAARPAAPAAPAAPA-------APAPAGAAAAPAEASAAPAPGV-- 647
Query: 131 AQNSYPSVPNQPQAPNTYPSMPNQLQAPNGFYPLTSSQQASQASLPYPINPLASNKPSAP 190
+ S ++ + A A+ AP
Sbjct: 648 ----AAPEHHPKHVAVPDASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAP 703
Query: 191 QPPAYSVQPSHNTAYPTAPLVPSTIATAPSAPPANIEDATRSTHPTTATAPSVPPAIHDA 250
P A + P + A + P P A
Sbjct: 704 APAATPPAGQADDPAAQPPQAAQGASAPSPAADDPVPLPPEPDDPPDPAGAPAQPPPPPA 763
Query: 251 TRSTPSPIIAPSVPPTNRD 269
+P AP P + +
Sbjct: 764 PAPAAAPAAAPPPSPPSEE 782
Score = 32.3 bits (74), Expect = 0.68
Identities = 21/139 (15%), Positives = 35/139 (25%), Gaps = 13/139 (9%)
Query: 136 PSVPNQPQAPNTYPSMPNQLQAPNGFYPLTSSQQASQASLPYPINPLASNKPSAPQPPAY 195
+ AP+ + P AP P ++ A A+ P + P +P A
Sbjct: 390 GAGAPAAAAPSAAAAAPAAAPAPAAAAPAAAAAPAPAAAPQPAPAPAPAPAPPSPAGNAP 449
Query: 196 SVQPSHNTAYPTAPLVPSTIATAPSAPPANIEDATRSTHPTTATAPSVPPAIHDATRSTP 255
+ P+ A P A A + A +
Sbjct: 450 AGGAPSPPPAAAPSAQPAPAPAAAPEPTAAPAPAPPAAPAPAAAPAAPAAP--------- 500
Query: 256 SPIIAPSVPPTNRDTIRSA 274
A + T+R
Sbjct: 501 ----AAPAGADDAATLRER 515
Score = 31.5 bits (72), Expect = 1.2
Identities = 34/145 (23%), Positives = 44/145 (30%), Gaps = 18/145 (12%)
Query: 106 RANVPTGGSPVPSYPQGSYPSNPYQAQNSYPSVPNQPQAPNTYPSMPNQLQAPNGFYPLT 165
R GG+ P+ S + A + + A + P AP
Sbjct: 383 RRLGVAGGAGAPAAAAPSAAAAAPAAAPAPAAAAPAAAAAPAPAAAPQPAPAP------- 435
Query: 166 SSQQASQASLPYPINPLASNKPSAPQPPAYSVQPSHNTAYPTAPLVPSTIATAPSAPPAN 225
+ A N A PS P A S QP P A T A AP+ P A
Sbjct: 436 ---APAPAPPSPAGNAPAGGAPSPPPAAAPSAQP---APAPAAA-PEPTAAPAPAPPAA- 487
Query: 226 IEDATRSTHPTTATAPSVPPAIHDA 250
+ P AP+ P DA
Sbjct: 488 ---PAPAAAPAAPAAPAAPAGADDA 509
Score = 30.3 bits (69), Expect = 2.9
Identities = 16/100 (16%), Positives = 27/100 (27%), Gaps = 1/100 (1%)
Query: 166 SSQQASQASLPYPINPLASNKPSAPQPPAYSVQPSHNTAYPTAPLVPSTIATAPSAPPAN 225
++ + P + +P+AP PA P+ A AP S A P +
Sbjct: 594 AAGGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGA-AAAPAEASAAPAPGVAAPEH 652
Query: 226 IEDATRSTHPTTATAPSVPPAIHDATRSTPSPIIAPSVPP 265
+ A A + P +
Sbjct: 653 HPKHVAVPDASDGGDGWPAKAGGAAPAAPPPAPAPAAPAA 692
Score = 30.0 bits (68), Expect = 3.6
Identities = 22/127 (17%), Positives = 34/127 (26%), Gaps = 14/127 (11%)
Query: 34 SSPPPPVYQTSSHTSPNAYNNPPSPYHSANPPPYYQHPSVPSHPSVPTFPSAPIYPSAPP 93
+ ++ + +P +A P P P+ P PS A
Sbjct: 393 APAAAAPSAAAAAPAAAPAPAAAAPAAAAAPAPAAAPQPAPAPAPAPAPPSPAGNAPAGG 452
Query: 94 QESHSPYYYSDIRANVPTGGSPVPSYPQGSYPSNPYQAQNSYPSVPNQPQAPNTYPSMPN 153
S P + PS P+ + + P AP P+ P
Sbjct: 453 APSPPP--------------AAAPSAQPAPAPAAAPEPTAAPAPAPPAAPAPAAAPAAPA 498
Query: 154 QLQAPNG 160
AP G
Sbjct: 499 APAAPAG 505
Score = 29.6 bits (67), Expect = 4.8
Identities = 25/138 (18%), Positives = 42/138 (30%), Gaps = 19/138 (13%)
Query: 89 PSAPPQESHSPYYYSDIRANVPTGGSPVPSYPQGSYPSNPYQAQNSYPSVPNQP-QAPNT 147
+ P + +P + + P A P+ QP AP
Sbjct: 390 GAGAPAAAAPS----------AAAAAPAAAPAPAAAA--PAAAAAPAPAAAPQPAPAPAP 437
Query: 148 YPSMPNQLQAPNGFYPLTSSQQASQASLPYPINPLASNKPSAPQPPAYSVQPSHNTAYPT 207
P+ P+ + A+ ++ P P P A+ +P+A PA P+ P
Sbjct: 438 APAPPSPAGNAPAGGAPSPPPAAAPSAQPAPA-PAAAPEPTAAPAPAPPAAPA-----PA 491
Query: 208 APLVPSTIATAPSAPPAN 225
A AP+
Sbjct: 492 AAPAAPAAPAAPAGADDA 509
>gnl|CDD|130689 TIGR01628, PABP-1234, polyadenylate binding protein, human types 1,
2, 3, 4 family. These eukaryotic proteins recognize the
poly-A of mRNA and consists of four tandem RNA
recognition domains at the N-terminus (rrm: pfam00076)
followed by a PABP-specific domain (pfam00658) at the
C-terminus. The protein is involved in the transport of
mRNA's from the nucleus to the cytoplasm. There are four
paralogs in Homo sapiens which are expressed in testis
(GP:11610605_PABP3 ), platelets (SP:Q13310_PABP4 ),
broadly expressed (SP:P11940_PABP1) and of unknown
tissue range (SP:Q15097_PABP2).
Length = 562
Score = 50.6 bits (121), Expect = 1e-06
Identities = 18/115 (15%), Positives = 23/115 (20%), Gaps = 14/115 (12%)
Query: 110 PTGGSPVPSYPQGSYPSNPYQAQNSYPSVPNQPQAPNTYPSMPNQLQAPNGFYPLTSSQQ 169
P G P + P P P PN M Q+
Sbjct: 403 QGPQQQFNGQPLG----WPRMSMMPTPMGPGGPLRPNGLAPMNAVRAPSRNAQNAA--QK 456
Query: 170 ASQASLPYPINPLASNKPSAPQPPAYSVQPSHNTAYPTAPLVPSTIATAPSAPPA 224
+ YP P+ P P + SA P
Sbjct: 457 PPMQPVMYP--------PNYQSLPLSQDLPQPQSTASQGGQNKKLAQVLASATPQ 503
Score = 33.2 bits (76), Expect = 0.38
Identities = 28/158 (17%), Positives = 46/158 (29%), Gaps = 20/158 (12%)
Query: 17 TIAQNEYFSKQTTSSSFSSPPPPVYQTSSHTSPNAYNNPPSPYHSANPPPYYQHPSVP-S 75
+AQ + + F P + Q SP PY+ P Q P
Sbjct: 359 ALAQRKEQRRAHLQDQFMQLQPRMRQLPM-GSPMGGAMGQPPYYGQGP--QQQFNGQPLG 415
Query: 76 HPSVPTFPSAPIYPSAPPQESHSPYYYSDIRANVPTGGSPVPSYPQGSYPSNPYQAQNSY 135
P + P+ P+ P P + P + + A + + + P P +Y
Sbjct: 416 WPRMSMMPT-PMGPGGPLR----PNGLAPMNAVRAPSRNAQNAAQK--PPMQPVMYPPNY 468
Query: 136 PS--VPNQPQAPNTYPSMPNQLQAPNGFYPLTSSQQAS 171
S + P + S Q + AS
Sbjct: 469 QSLPLSQDLPQPQSTASQGGQN-------KKLAQVLAS 499
Score = 31.3 bits (71), Expect = 1.6
Identities = 19/140 (13%), Positives = 34/140 (24%), Gaps = 21/140 (15%)
Query: 140 NQPQAPNTYPSMPNQLQAPNGFYPLTSSQQASQAS-----------LPYPINPLASNKPS 188
Q Q M + + G P Q + +P P+ P +P+
Sbjct: 376 MQLQPRMRQLPMGSPMGGAMGQPPYYGQGPQQQFNGQPLGWPRMSMMPTPMGPGGPLRPN 435
Query: 189 APQPPAYSVQPSHNTAYPTAPLVPSTIATAPSAPPANIEDATRSTHPTTATAPSVPPAIH 248
P PS N + P+ + P + P
Sbjct: 436 GLAPMNAVRAPSRNAQNAAQKPPMQPVMYPPNY----------QSLPLSQDLPQPQSTAS 485
Query: 249 DATRSTPSPIIAPSVPPTNR 268
++ + S P +
Sbjct: 486 QGGQNKKLAQVLASATPQMQ 505
>gnl|CDD|238489 cd00989, PDZ_metalloprotease, PDZ domain of bacterial and plant
zinc metalloprotases, presumably membrane-associated or
integral membrane proteases, which may be involved in
signalling and regulatory mechanisms. May be responsible
for substrate recognition and/or binding, as most PDZ
domains bind C-terminal polypeptides, and binding to
internal (non-C-terminal) polypeptides and even to
lipids has been demonstrated. In this subfamily of
protease-associated PDZ domains a C-terminal beta-strand
forms the peptide-binding groove base, a circular
permutation with respect to PDZ domains found in
Eumetazoan signaling proteins.
Length = 79
Score = 44.9 bits (107), Expect = 2e-06
Identities = 17/62 (27%), Positives = 33/62 (53%), Gaps = 4/62 (6%)
Query: 376 VERGSPASLVGLRFGDQILSINGETVAGYNMKQVHKILKAAPVNNIRVVIR--DRPLERT 433
V GSPA+ GL+ GD+IL+ING+ + + + + ++ P + + + + T
Sbjct: 19 VVPGSPAAKAGLKAGDRILAINGQKIK--SWEDLVDAVQENPGKPLTLTVERNGETITLT 76
Query: 434 VT 435
+T
Sbjct: 77 LT 78
Score = 33.0 bits (76), Expect = 0.052
Identities = 22/73 (30%), Positives = 29/73 (39%), Gaps = 10/73 (13%)
Query: 444 FGFHFNKGQIVSLVKDSSAARNGLLVNHNILEVNG----------ANVVGLKDKELLEAM 493
G + I +V S AA+ GL IL +NG V K L +
Sbjct: 7 PGGPPIEPVIGEVVPGSPAAKAGLKAGDRILAINGQKIKSWEDLVDAVQENPGKPLTLTV 66
Query: 494 EKGGSVLTLTLIP 506
E+ G +TLTL P
Sbjct: 67 ERNGETITLTLTP 79
>gnl|CDD|223864 COG0793, Prc, Periplasmic protease [Cell envelope biogenesis, outer
membrane].
Length = 406
Score = 48.9 bits (117), Expect = 4e-06
Identities = 26/85 (30%), Positives = 46/85 (54%), Gaps = 7/85 (8%)
Query: 360 IGLRVCAINEG-VFVCLVERGSPASLVGLRFGDQILSINGETVAGYNMKQVHKILKAAPV 418
IG+ + + G V V GSPA+ G++ GD I+ I+G++V G ++ + K+++ P
Sbjct: 102 IGIELQMEDIGGVKVVSPIDGSPAAKAGIKPGDVIIKIDGKSVGGVSLDEAVKLIRGKPG 161
Query: 419 NNIRVVIR----DRPLERTVTLHRD 439
+ + I +P TVTL R+
Sbjct: 162 TKVTLTILRAGGGKPF--TVTLTRE 184
Score = 44.6 bits (106), Expect = 9e-05
Identities = 16/55 (29%), Positives = 33/55 (60%), Gaps = 1/55 (1%)
Query: 453 IVSLVKDSSAARNGLLVNHNILEVNGANVVGLKDKELLEAM-EKGGSVLTLTLIP 506
+VS + S AA+ G+ I++++G +V G+ E ++ + K G+ +TLT++
Sbjct: 116 VVSPIDGSPAAKAGIKPGDVIIKIDGKSVGGVSLDEAVKLIRGKPGTKVTLTILR 170
>gnl|CDD|233695 TIGR02037, degP_htrA_DO, periplasmic serine protease, Do/DeqQ
family. This family consists of a set proteins various
designated DegP, heat shock protein HtrA, and protease
DO. The ortholog in Pseudomonas aeruginosa is designated
MucD and is found in an operon that controls mucoid
phenotype. This family also includes the DegQ (HhoA)
paralog in E. coli which can rescue a DegP mutant, but
not the smaller DegS paralog, which cannot. Members of
this family are located in the periplasm and have
separable functions as both protease and chaperone.
Members have a trypsin domain and two copies of a PDZ
domain. This protein protects bacteria from thermal and
other stresses and may be important for the survival of
bacterial pathogens.// The chaperone function is
dominant at low temperatures, whereas the proteolytic
activity is turned on at elevated temperatures [Protein
fate, Protein folding and stabilization, Protein fate,
Degradation of proteins, peptides, and glycopeptides].
Length = 428
Score = 46.8 bits (112), Expect = 2e-05
Identities = 43/163 (26%), Positives = 60/163 (36%), Gaps = 34/163 (20%)
Query: 368 NEGVFVCLVERGSPASLVGLRFGDQILSINGETV----AGYNMKQVHKILKAAPVNNIRV 423
G V V GSPA GL+ GD I S+NG+ + K K +
Sbjct: 256 QRGALVAQVLPGSPAEKAGLKAGDVITSVNGKPISSFADLRRAIGTLKPGKKVTLG---- 311
Query: 424 VIRD-RPLERTVTLHRD-------SSGHFGFHF------------NKGQ-----IVSLVK 458
++R + TVTL S+ G KG + +V
Sbjct: 312 ILRKGKEKTITVTLGASPEEQASSSNPFLGLTVANLSPEIRKELRLKGDVKGVVVTKVVS 371
Query: 459 DSSAARNGLLVNHNILEVNGANVVGLKD-KELLEAMEKGGSVL 500
S AAR GL IL VN V + + +++L +KGG V
Sbjct: 372 GSPAARAGLQPGDVILSVNQQPVSSVAELRKVLARAKKGGRVA 414
Score = 29.5 bits (67), Expect = 4.9
Identities = 16/64 (25%), Positives = 29/64 (45%), Gaps = 3/64 (4%)
Query: 444 FGFHFNKGQIVS-LVKDSSAARNGLLVNHNILEVNGANVVGLKD-KELLEAMEKGGSVLT 501
G +G +V+ ++ S A + GL I VNG + D + + ++ G V T
Sbjct: 251 LGLEKQRGALVAQVLPGSPAEKAGLKAGDVITSVNGKPISSFADLRRAIGTLKPGKKV-T 309
Query: 502 LTLI 505
L ++
Sbjct: 310 LGIL 313
>gnl|CDD|221961 pfam13180, PDZ_2, PDZ domain.
Length = 81
Score = 42.2 bits (100), Expect = 2e-05
Identities = 31/83 (37%), Positives = 46/83 (55%), Gaps = 6/83 (7%)
Query: 358 GLIGLRVCAI-NEGVFVCLVERGSPASLVGLRFGDQILSINGETVAGYNMKQ-VHKILKA 415
G IG+RV GV V V+ GSPA+ GL+ GD ILSI+G+ V ++ + + IL
Sbjct: 1 GDIGVRVVQNEGTGVTVVSVKEGSPAAKAGLKPGDIILSIDGKKVN--SLTELIEVILNG 58
Query: 416 APVNNIRV-VIRD-RPLERTVTL 436
P + +++ V RD + V L
Sbjct: 59 KPGDTVKLTVYRDGKKKTVEVKL 81
Score = 29.1 bits (66), Expect = 0.98
Identities = 21/58 (36%), Positives = 28/58 (48%), Gaps = 5/58 (8%)
Query: 449 NKGQIVSLVKDSS-AARNGLLVNHNILEVNGANVVGLKDKELLEAMEKG--GSVLTLT 503
G V VK+ S AA+ GL IL ++G V L EL+E + G G + LT
Sbjct: 12 GTGVTVVSVKEGSPAAKAGLKPGDIILSIDGKKVNSL--TELIEVILNGKPGDTVKLT 67
>gnl|CDD|236669 PRK10263, PRK10263, DNA translocase FtsK; Provisional.
Length = 1355
Score = 46.2 bits (109), Expect = 4e-05
Identities = 49/245 (20%), Positives = 74/245 (30%), Gaps = 48/245 (19%)
Query: 39 PVYQTSSHTSPNAYNNPPSPYHSANPPPYYQHPSVPSHPSVPTFPSAPIYPSAPPQESHS 98
PV + +P A PY++ P Q P P P +A +
Sbjct: 400 PVQPQQPYYAPAAEQPAQQPYYAPAPEQPAQQPYYAPAPEQPVAGNAWQAEEQQSTFAPQ 459
Query: 99 PYYYSDIRANVPTGGSPVPSYPQGSYPSNPYQAQNSY---PSVPNQPQAPNTYPSMPNQL 155
Y +Y Q + YQ P V +P T P+ P
Sbjct: 460 STY------------QTEQTYQQPAAQEPLYQQPQPVEQQPVVEPEPVVEETKPARP--- 504
Query: 156 QAPNGFYPLTSSQQASQAS--------LPYPINPLASNKPSAPQPPAYSVQPSHNTAYPT 207
P ++ ++A + +P P+ K S P +V P
Sbjct: 505 --PLYYFEEVEEKRAREREQLAAWYQPIPEPVKEPEPIKSSLKAPSVAAVPP-------- 554
Query: 208 APLVPSTIATAPSAPPANIEDATRSTHPTTATAPSVPPAIHDATRSTPSPI----IAPSV 263
V + A +P A + ++ AT + T A A P A P P I P +
Sbjct: 555 ---VEAAAAVSPLA--SGVKKATLA---TGAAATVAAPVFSLANSGGPRPQVKEGIGPQL 606
Query: 264 PPTNR 268
P R
Sbjct: 607 PRPKR 611
Score = 45.5 bits (107), Expect = 8e-05
Identities = 44/245 (17%), Positives = 69/245 (28%), Gaps = 25/245 (10%)
Query: 28 TTSSSFSSPPPPVYQTSSHTSPNAYNNPPSPYHSANPPPYYQHPSVPSHPSVPTFPSAPI 87
T + S+++P PV QT S + P+ P P P + P +P
Sbjct: 328 TATQSWAAPVEPVTQTPPVASVDVPPAQPTVAWQPVPGPQTGEPVIAPAPE--GYPQQSQ 385
Query: 88 YPSAPPQESHSPYYYSDIRANVPTGGSPVPSYPQGSYPSNPYQAQNSYPSVPNQPQAPNT 147
Y A P ++ ++ P Q Y + + P P+ P
Sbjct: 386 Y--AQPAVQYNEPLQQPVQPQQPYYAPAAEQPAQQPYYAPAPEQPAQQPYYAPAPEQP-- 441
Query: 148 YPSMPNQLQAPNGFYPLTSSQQASQASLPYPINPLASNKPSAPQPPAYSVQPSHNTAYPT 207
Q + + S+ Q Q P + P V+P
Sbjct: 442 VAGNAWQAEEQQSTFAPQSTYQTEQT-YQQPAAQEPLYQQPQPVEQQPVVEP-------- 492
Query: 208 APLVPSTIATAPSAPPANI-----EDATRSTHPTTATAPSVPPAIHDATRSTPS--PIIA 260
P T P+ PP E R A +P + + S
Sbjct: 493 ---EPVVEETKPARPPLYYFEEVEEKRAREREQLAAWYQPIPEPVKEPEPIKSSLKAPSV 549
Query: 261 PSVPP 265
+VPP
Sbjct: 550 AAVPP 554
Score = 40.5 bits (94), Expect = 0.003
Identities = 35/136 (25%), Positives = 45/136 (33%), Gaps = 8/136 (5%)
Query: 57 SPYHSANPPPYYQHPSVPSHPSVPTFPSAPIYPSAPPQESHSPYYYSDIRANVPTGGSPV 116
P P YQ P P P P P AP + P + PV
Sbjct: 754 QPQQPVAPQQQYQQPQQPVAP--QPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPV 811
Query: 117 PSYPQGSYPSNPYQAQNSYPSVPNQPQAPNTYPSMPNQLQAPNGFYPLTSSQQASQASLP 176
PQ P P Q Y P QP AP ++ + L NG S+ + + P
Sbjct: 812 APQPQYQQPQQPVAPQPQYQQ-PQQPVAPQPQDTLLHPLLMRNG-----DSRPLHKPTTP 865
Query: 177 YPINPLASNKPSAPQP 192
P L + PS +P
Sbjct: 866 LPSLDLLTPPPSEVEP 881
Score = 38.5 bits (89), Expect = 0.010
Identities = 40/133 (30%), Positives = 45/133 (33%), Gaps = 18/133 (13%)
Query: 109 VPTGGSPVPSYPQGSYPSNPYQAQNSYPSVPNQPQAPNTYPSMPNQLQAPNGFYPLTSSQ 168
V PV Q P P Q Y P QP AP P Q AP Y
Sbjct: 752 VQQPQQPVAPQQQYQQPQQPVAPQPQYQQ-PQQPVAPQPQYQQPQQPVAPQPQYQQPQQP 810
Query: 169 QASQASLPYPINPLAS-------NKPSAPQPPAYSVQP-------SHNTAYPTAPLVPST 214
A Q P P+A +P APQP + P S PT PL PS
Sbjct: 811 VAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQDTLLHPLLMRNGDSRPLHKPTTPL-PSL 869
Query: 215 IATAPSAPPANIE 227
P PP+ +E
Sbjct: 870 DLLTP--PPSEVE 880
Score = 35.8 bits (82), Expect = 0.066
Identities = 36/196 (18%), Positives = 57/196 (29%), Gaps = 24/196 (12%)
Query: 6 PSLEDMTFHQLTIAQNEYFSKQTTSSSFSSPPPPVYQTSSHTSPNAYNNPPSPYHS--AN 63
AQ Y++ + Q S +P + Y A
Sbjct: 416 AQQPYYAPAPEQPAQQPYYAPAPEQPVAGNAWQAEEQ-QSTFAPQSTYQTEQTYQQPAAQ 474
Query: 64 PPPYYQHPSVPSHPSVPTFPSA-PIYPSAPPQESHSPYYYSDI-------RANVPTGGSP 115
P Y Q V P V P P+ PP YY+ ++ R + P
Sbjct: 475 EPLYQQPQPVEQQPVVEPEPVVEETKPARPPL-----YYFEEVEEKRAREREQLAAWYQP 529
Query: 116 VPSYPQGSYPSNPYQAQNSYPSVPNQPQAPNTYPSMPNQLQAPNGFYPLTSSQQASQASL 175
+P + P S +VP A P +A + + A++
Sbjct: 530 IPEPVKEPEPIKSSLKAPSVAAVPPVEAAAAVSPLASGVKKA--------TLATGAAATV 581
Query: 176 PYPINPLASNKPSAPQ 191
P+ LA++ PQ
Sbjct: 582 AAPVFSLANSGGPRPQ 597
Score = 30.4 bits (68), Expect = 2.8
Identities = 31/136 (22%), Positives = 40/136 (29%), Gaps = 13/136 (9%)
Query: 89 PSAPPQESHSPYYYSDIRANVPTGGSPVPSYPQGSYPSNPYQAQNSYPSVPNQPQAPNTY 148
P+ S + +S ++A + G P + P Q + QPQ P
Sbjct: 716 PAGANPFSLDDFEFSPMKALLDDGPHE-PLFTPIVEPVQQPQQPVAPQQQYQQPQQPVAP 774
Query: 149 PSMPNQLQAPNGFYPLTSSQQASQASLPYPINPLASNKPSAPQPPAYSVQPSHNTAYPTA 208
Q Q P P QQ Q P P Q P V P P
Sbjct: 775 QPQYQQPQQP--VAPQPQYQQPQQPVAPQP----------QYQQPQQPVAPQPQYQQPQQ 822
Query: 209 PLVPSTIATAPSAPPA 224
P+ P P P A
Sbjct: 823 PVAPQPQYQQPQQPVA 838
Score = 28.9 bits (64), Expect = 7.9
Identities = 21/84 (25%), Positives = 24/84 (28%), Gaps = 5/84 (5%)
Query: 20 QNEYFSKQTTSSSFSSPPPPVYQTSSHTSPNAYNNPPSPYHSANPPPYYQHPSVPSHP-- 77
Q +Q P P YQ P P P YQ P P P
Sbjct: 757 QPVAPQQQYQQPQQPVAPQPQYQQPQQPVAPQPQYQ-QPQQPVAPQPQYQQPQQPVAPQP 815
Query: 78 --SVPTFPSAPIYPSAPPQESHSP 99
P P AP PQ+ +P
Sbjct: 816 QYQQPQQPVAPQPQYQQPQQPVAP 839
>gnl|CDD|237865 PRK14951, PRK14951, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 618
Score = 45.9 bits (109), Expect = 4e-05
Identities = 36/172 (20%), Positives = 53/172 (30%), Gaps = 20/172 (11%)
Query: 132 QNSYPSVPNQPQAPNTYPSMPNQLQAPNGFYPLTSSQQASQASLPYPINPLASNKPSAPQ 191
+ + + P AP QA+ A P A++ P+AP
Sbjct: 367 AAAAEAAAPAEKKTPARPEAAAPAAAP--------VAQAAAAPAPAAAPAAAASAPAAPP 418
Query: 192 PPAYSVQPSHNTAYPTAPLVPSTIATAPSAPPANIEDATRSTHPTTATAPSVPPAIHDAT 251
A + A A + A AP + A + AP PA+ A
Sbjct: 419 AAAPPAPVAAPAAAAPAAAPAAAPAAVALAPAPPAQAAPETVAIPVRVAP--EPAVASAA 476
Query: 252 RSTPSPIIAPSVPPTNRDTIRSALVPLYPSLDNYMGLEVASELINAYARENA 303
+ + A + PT + A V A+E I A ARE A
Sbjct: 477 PAPAAAPAAARLTPTEEGDVWHATVQQL----------AAAEAITALARELA 518
Score = 33.1 bits (76), Expect = 0.40
Identities = 27/149 (18%), Positives = 42/149 (28%), Gaps = 23/149 (15%)
Query: 65 PPYYQHPSVPSHPSVPTFPSAPIYPSAPPQESHSPYYYSDIRANVPTGGSPVPSYPQGSY 124
P + P+ P P A +AP ++ + A P + P+ P +
Sbjct: 366 PAAAAEAAAPAEKKTPARPEAAAPAAAPVAQAAAA----PAPAAAPAAAASAPAAPPAAA 421
Query: 125 PSNPYQAQNSYPSVPNQPQAPNTYPSMPNQLQAPNGFYPLTSSQQASQASLPYPINPLAS 184
P P A + AP + A +A
Sbjct: 422 PPAPVAAPAAAAPAAAPAAAP-------------------AAVALAPAPPAQAAPETVAI 462
Query: 185 NKPSAPQPPAYSVQPSHNTAYPTAPLVPS 213
AP+P S P+ A A L P+
Sbjct: 463 PVRVAPEPAVASAAPAPAAAPAAARLTPT 491
Score = 33.1 bits (76), Expect = 0.44
Identities = 21/124 (16%), Positives = 35/124 (28%)
Query: 110 PTGGSPVPSYPQGSYPSNPYQAQNSYPSVPNQPQAPNTYPSMPNQLQAPNGFYPLTSSQQ 169
P + + + P+ P A + V AP + AP
Sbjct: 366 PAAAAEAAAPAEKKTPARPEAAAPAAAPVAQAAAAPAPAAAPAAAASAPAAPPAAAPPAP 425
Query: 170 ASQASLPYPINPLASNKPSAPQPPAYSVQPSHNTAYPTAPLVPSTIATAPSAPPANIEDA 229
+ + P A+ + PA Q + T + P + + PA A
Sbjct: 426 VAAPAAAAPAAAPAAAPAAVALAPAPPAQAAPETVAIPVRVAPEPAVASAAPAPAAAPAA 485
Query: 230 TRST 233
R T
Sbjct: 486 ARLT 489
>gnl|CDD|218191 pfam04652, DUF605, Vta1 like. Vta1 (VPS20-associated protein 1) is
a positive regulator of Vps4. Vps4 is an ATPase that is
required in the multivesicular body (MVB) sorting
pathway to dissociate the endosomal sorting complex
required for transport (ESCRT). Vta1 promotes correct
assembly of Vps4 and stimulates its ATPase activity
through its conserved Vta1/SBP1/LIP5 region.
Length = 315
Score = 44.3 bits (105), Expect = 9e-05
Identities = 20/118 (16%), Positives = 27/118 (22%), Gaps = 6/118 (5%)
Query: 77 PSVPTFPSAPIYPSAPPQESHSPYYYSDIRANVPTGGSPVPSYPQGSYPSNPYQAQNSYP 136
V T S +P + S + G P P S + P
Sbjct: 161 ADVATTNSDNSFPGEDADPAS----ASPSDPPSSSPGVPSFPSPPEDPSSPSDSSLPPAP 216
Query: 137 SVPNQPQAPNTYPSMPNQLQAPNGFYPLTSSQQASQASLPYPINPLASNKPSAPQPPA 194
S P + S N P G L P + + P P
Sbjct: 217 SSFQSDTPPPSPESPTNP-SPPPGPAAPPPPPVQQVPPLSTA-KPTPPSASATPAPIG 272
Score = 43.1 bits (102), Expect = 2e-04
Identities = 27/110 (24%), Positives = 36/110 (32%), Gaps = 11/110 (10%)
Query: 21 NEYFSKQTTSSSFSSPPPPVYQTS--SHTSPNAYNNPPSPYHSANPPPYYQHPSVPSHPS 78
N + + +S S PP S SP + PS P +Q + P P
Sbjct: 170 NSFPGEDADPASASPSDPPSSSPGVPSFPSPPEDPSSPSDSSLPPAPSSFQSDTPPPSPE 229
Query: 79 VPTFPSAPIYPSAPPQESHSPYYYSDIRANVPTGGSPVPSYPQGSYPSNP 128
PT PS P P+APP + P+ P S P
Sbjct: 230 SPTNPSPPPGPAAPPPPPVQQVPPL---------STAKPTPPSASATPAP 270
Score = 42.0 bits (99), Expect = 4e-04
Identities = 27/139 (19%), Positives = 38/139 (27%), Gaps = 20/139 (14%)
Query: 104 DIRANVPTGGSPVPSYPQGSYPSNPYQAQNSYPSVPNQPQAPNTYPSMPNQLQAPNGFYP 163
D + P ++P +S P VP+ P P S P+ P
Sbjct: 160 DADVATTNSDNSFPGEDADPASASPSDPPSSSPGVPSFPSPPEDPSS-PSDSSLP----- 213
Query: 164 LTSSQQASQASLPYPINPLASNKPSAP-QPPAYSVQPSHNTAYPTAP-LVPSTIATAPSA 221
P P + + P +P P S P P VP P+
Sbjct: 214 ------------PAPSSFQSDTPPPSPESPTNPSPPPGPAAPPPPPVQQVPPLSTAKPTP 261
Query: 222 PPANIEDATRSTHPTTATA 240
P A+ A A
Sbjct: 262 PSASATPAPIGGITLDDDA 280
Score = 41.2 bits (97), Expect = 0.001
Identities = 23/127 (18%), Positives = 33/127 (25%)
Query: 49 PNAYNNPPSPYHSANPPPYYQHPSVPSHPSVPTFPSAPIYPSAPPQESHSPYYYSDIRAN 108
PN A+ S P + P S PS+ P P D +
Sbjct: 148 PNPGPPLDEEDEDADVATTNSDNSFPGEDADPASASPSDPPSSSPGVPSFPSPPEDPSSP 207
Query: 109 VPTGGSPVPSYPQGSYPSNPYQAQNSYPSVPNQPQAPNTYPSMPNQLQAPNGFYPLTSSQ 168
+ P PS Q P ++ + P P L P S+
Sbjct: 208 SDSSLPPAPSSFQSDTPPPSPESPTNPSPPPGPAAPPPPPVQQVPPLSTAKPTPPSASAT 267
Query: 169 QASQASL 175
A +
Sbjct: 268 PAPIGGI 274
Score = 37.4 bits (87), Expect = 0.015
Identities = 35/130 (26%), Positives = 41/130 (31%), Gaps = 21/130 (16%)
Query: 22 EYFSKQTTSSSFSSPPPPVYQTSSHTSPNAYNNPPSPYHSANPPPYYQHPSVPSHPSVPT 81
E TT+S S P S+ S P S + P + PS PS S+P
Sbjct: 159 EDADVATTNSDNSFPGEDADPASASPSDP----PSSSPGVPSFPSPPEDPSSPSDSSLPP 214
Query: 82 FPSAPIYPSAPPQESHSPYYYSDIRANVPTGGSPVPSYPQGSYPSNPYQAQNSYPS--VP 139
PS+ S P S PT SP P P P Q S P
Sbjct: 215 APSSF--QSDTPPPSPES----------PTNPSPPPGPAA---PPPPPVQQVPPLSTAKP 259
Query: 140 NQPQAPNTYP 149
P A T
Sbjct: 260 TPPSASATPA 269
Score = 37.0 bits (86), Expect = 0.019
Identities = 22/131 (16%), Positives = 35/131 (26%), Gaps = 18/131 (13%)
Query: 137 SVPNQPQAPNTYPSMPNQLQAPNGFYPLTSSQQASQASLPYPINPLASNKPSAPQPPAYS 196
+ + N++P + + + +S +P P + PS P
Sbjct: 161 ADVATTNSDNSFPGEDADPASASP----SDPPSSSPGVPSFPSPPEDPSSPSDSSLPPA- 215
Query: 197 VQPSHNTAYPTAPLVPSTIATAPSAPPANIEDATRSTHPTTATAPSVPPAIHDATRSTPS 256
PS + P S T PS PP + + P TP
Sbjct: 216 --PSSFQSDTPPPSPESP--TNPSPPPGPAAPPPPPVQQVPPLSTAKP---------TPP 262
Query: 257 PIIAPSVPPTN 267
A P
Sbjct: 263 SASATPAPIGG 273
Score = 35.8 bits (83), Expect = 0.048
Identities = 22/67 (32%), Positives = 25/67 (37%), Gaps = 2/67 (2%)
Query: 30 SSSFSSPPPPVYQTSSH--TSPNAYNNPPSPYHSANPPPYYQHPSVPSHPSVPTFPSAPI 87
S S PP P S SP + NP P A PPP P + PT PSA
Sbjct: 207 PSDSSLPPAPSSFQSDTPPPSPESPTNPSPPPGPAAPPPPPVQQVPPLSTAKPTPPSASA 266
Query: 88 YPSAPPQ 94
P+
Sbjct: 267 TPAPIGG 273
Score = 31.2 bits (71), Expect = 1.3
Identities = 24/114 (21%), Positives = 37/114 (32%), Gaps = 3/114 (2%)
Query: 171 SQASLPYPINPLASNKPSAPQPPAYSVQPSHNTAYPTAPLVPSTIATAPSAPPANIEDAT 230
+ + +P S PP+ S + P P PS ++ P AP + + T
Sbjct: 166 TNSDNSFPGEDADPASASPSDPPSSSPGVPSFPSPPEDPSSPSD-SSLPPAPS-SFQSDT 223
Query: 231 RSTHPTTATAPSVPPAIHDATRSTPSPIIAPSVP-PTNRDTIRSALVPLYPSLD 283
P + T PS PP + S PT + +LD
Sbjct: 224 PPPSPESPTNPSPPPGPAAPPPPPVQQVPPLSTAKPTPPSASATPAPIGGITLD 277
Score = 30.0 bits (68), Expect = 2.8
Identities = 15/59 (25%), Positives = 20/59 (33%), Gaps = 2/59 (3%)
Query: 34 SSPPPPVY-QTSSHTSPNAYNNPPSPYHSANPPPYYQHPSVPSHPSVPTFPSAPIYPSA 91
S PPP ++ + P PP P PP P+ PS + P I
Sbjct: 221 SDTPPPSPESPTNPSPPPGPAAPPPPPVQQVPPLSTAKPTPPSASA-TPAPIGGITLDD 278
>gnl|CDD|237057 PRK12323, PRK12323, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 700
Score = 44.5 bits (105), Expect = 1e-04
Identities = 29/190 (15%), Positives = 51/190 (26%), Gaps = 2/190 (1%)
Query: 90 SAPPQESHSPYYYSDIRANVPTGGSPVPSYPQGSYPSNPYQAQNSYPSVPNQPQAPNTYP 149
+ P + +P A P +P P+ P + + P A + +
Sbjct: 372 AGPATAAAAPVAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPE 431
Query: 150 SMPNQLQAPNGFYPLTSSQQASQASLPYPINPLASNKPSAPQPPAYSVQPSHNTAYPTAP 209
++ QA + + A+ P A+ P A + A AP
Sbjct: 432 ALAAARQASARGPGGAPAPAPAPAAAPAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAP 491
Query: 210 --LVPSTIATAPSAPPANIEDATRSTHPTTATAPSVPPAIHDATRSTPSPIIAPSVPPTN 267
P P + + PA D + + AP+ P
Sbjct: 492 ADDDPPPWEELPPEFASPAPAQPDAAPAGWVAESIPDPATADPDDAFETLAPAPAAAPAP 551
Query: 268 RDTIRSALVP 277
R + V
Sbjct: 552 RAAAATEPVV 561
Score = 44.1 bits (104), Expect = 2e-04
Identities = 33/194 (17%), Positives = 51/194 (26%), Gaps = 3/194 (1%)
Query: 137 SVPNQPQAPNTYPSMPNQLQAPNGFYPLTSSQQASQASLPYPINPLASNKPSAPQPPAYS 196
AP P+ A P + A + P+ P +
Sbjct: 373 GPATAAAAPVAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEA 432
Query: 197 VQPSHNTAYPTAPLVPSTIATAPSAPPANIEDATRSTHPTTATAPSVPPAIHDATRSTPS 256
+ + + P+ +AP A A P A A + P A P+
Sbjct: 433 LAAARQASARGPGGAPAPAPAPAAAPAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPA 492
Query: 257 PIIAPSVPPTNRDTIRSALVPLYPSLDNYMGLEVASELINAYARENAANERESDAAGNTV 316
PP A G S A A + A E + A
Sbjct: 493 D---DDPPPWEELPPEFASPAPAQPDAAPAGWVAESIPDPATADPDDAFETLAPAPAAAP 549
Query: 317 ARQAPRDSCTMIAP 330
A +A + ++AP
Sbjct: 550 APRAAAATEPVVAP 563
Score = 41.0 bits (96), Expect = 0.001
Identities = 39/231 (16%), Positives = 55/231 (23%), Gaps = 20/231 (8%)
Query: 34 SSPPPPVYQTSSHTSPNAYNNPPSPYHSANPPPYYQHPSVPSHPSVPTFPSAPIYPSAPP 93
+ P + +P A + A PP P+ + A P
Sbjct: 372 AGPATAAAAPVAQPAPAAAAPAAAAPAPAAPPA---APAAAPAAAAAARAVAAAPARRSP 428
Query: 94 QESHSPYYYSDIRANVPTGGSPVPSYPQGSYPSNPYQAQNSYPSVPNQPQAPNTYPSMPN 153
+P P+ + A P AP
Sbjct: 429 APEALAAARQASARGPGGAPAPAPAPAAAPAAAARPAAAGPRPVAAAAAAAPARAAPAAA 488
Query: 154 QLQAPNGFYPLTSSQQASQASLPYPINPLASNKPSAPQPPAYSVQPSHNTAYPTAPLVPS 213
A + P + P + AP P TA P
Sbjct: 489 PAPADDDPPPWEELPPEFASPAPAQPDA-------APAGWVAESIPDPATADPDDAF--E 539
Query: 214 TIATAPSAPPANIEDATRSTHPTTATAPSVPPAIHDATRSTPSPIIAPSVP 264
T+A AP+A PA AT P V P A+ S + P
Sbjct: 540 TLAPAPAAAPA--------PRAAAATEPVVAPRPPRASASGLPDMFDGDWP 582
Score = 36.0 bits (83), Expect = 0.054
Identities = 32/153 (20%), Positives = 42/153 (27%), Gaps = 6/153 (3%)
Query: 112 GGSPVPSYPQGSYPSNPYQAQNSYPSVPNQPQAPNTYPSMPNQLQAPNGFYPLTSSQQAS 171
GG+ + A + P AP P+ P AP + A
Sbjct: 370 GGAGPATAAAAPVAQPAPAAAAPAAAAP----APAAPPAAPAA--APAAAAAARAVAAAP 423
Query: 172 QASLPYPINPLASNKPSAPQPPAYSVQPSHNTAYPTAPLVPSTIATAPSAPPANIEDATR 231
P P A+ + SA P A P A P+ P A A A
Sbjct: 424 ARRSPAPEALAAARQASARGPGGAPAPAPAPAAAPAAAARPAAAGPRPVAAAAAAAPARA 483
Query: 232 STHPTTATAPSVPPAIHDATRSTPSPIIAPSVP 264
+ A A PP + SP A
Sbjct: 484 APAAAPAPADDDPPPWEELPPEFASPAPAQPDA 516
Score = 35.2 bits (81), Expect = 0.089
Identities = 28/186 (15%), Positives = 48/186 (25%), Gaps = 2/186 (1%)
Query: 149 PSMPNQLQAPNGFYPLTSSQQASQASLPYPINPLASNKPSAPQPPAYSVQPSHNTAYPTA 208
P P +Q A A+ P P + P+AP + + A A
Sbjct: 365 PGQSGGGAGPATAAAAPVAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPA 424
Query: 209 PLVPSTIATAPSAPPANIEDATRSTHPTTATAPSVPPAIHDATRSTPSPIIAPSVPPTNR 268
P+ A A + + A A A P A + P
Sbjct: 425 RRSPAPEALAAARQASARGPGGAPAPAPAPAAAPAAAARPAAAGPRPVAAAAAAAPARAA 484
Query: 269 DTIRSALVPLYPSLDNYMGLEVASELINAYARENAANERESDAAGNTVARQAPRDSCTMI 328
A P + E AS + A +++ + T+
Sbjct: 485 PAAAPAPADDDPPPWEELPPEFAS--PAPAQPDAAPAGWVAESIPDPATADPDDAFETLA 542
Query: 329 APLSSS 334
+++
Sbjct: 543 PAPAAA 548
Score = 33.3 bits (76), Expect = 0.34
Identities = 27/105 (25%), Positives = 37/105 (35%), Gaps = 3/105 (2%)
Query: 161 FYPLTSSQQASQASLPYPINPLASNKPSAPQPPAYSVQPSHNTAYPTAPLVPSTIATAPS 220
F P S A A+ P+A P+A P A + P+ A P A + A A +
Sbjct: 363 FRPGQSGGGAGPATAAAA--PVAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVA 420
Query: 221 APPANIEDATRSTHPTTATAPSVPPAIHDATRSTPSPIIAPSVPP 265
A PA A A + P A P+ A + P
Sbjct: 421 AAPARRSPA-PEALAAARQASARGPGGAPAPAPAPAAAPAAAARP 464
>gnl|CDD|223065 PHA03378, PHA03378, EBNA-3B; Provisional.
Length = 991
Score = 43.9 bits (103), Expect = 2e-04
Identities = 61/245 (24%), Positives = 80/245 (32%), Gaps = 23/245 (9%)
Query: 38 PPVYQTSSHTSPNAYNNPPSPYHSANPPPY-YQHPSVPSHPSVPTFPSAP-IYPSAPPQE 95
P T+S + +A PS + P P+ Q P P+ S SAP +P
Sbjct: 576 PLTSPTTSQLASSA----PSYAQTPWPVPHPSQTPEPPTTQSHIPETSAPRQWPMPLRPI 631
Query: 96 SHSPYYYSDIRANVPTGGSPVPSYPQGSYPSNPYQAQNSYPSVPNQPQAPNTYPSMPNQL 155
P I NV P P P + +P QP +P Q
Sbjct: 632 PMRPLRMQPITFNVL--VFPTPHQPPQVEITPYKPTWTQIGHIPYQPSPTGANTMLPIQ- 688
Query: 156 QAPNGFYPLTSSQQASQASLPYPINPLASNKPSAPQPPAYSVQPSHNTAYPTAPLVPSTI 215
AP P P P+ P A+ P Q PA + + A P
Sbjct: 689 WAPGTMQP--------PPRAPTPMRPPAA-PPGRAQRPAAATGRARPPAAAPGRARPPAA 739
Query: 216 ATAPSAPPANIEDATRSTHPTTATAPSVPPAIHDATRSTPSPIIAPSVPPTNRDTIRSAL 275
A + PPA R+ P A + PPA A P+P P PP + R A
Sbjct: 740 APGRARPPAAA--PGRARPPAAAPGRARPPA---AAPGAPTPQPPPQAPPAPQQRPRGAP 794
Query: 276 VPLYP 280
P P
Sbjct: 795 TPQPP 799
Score = 29.6 bits (66), Expect = 5.6
Identities = 41/190 (21%), Positives = 52/190 (27%), Gaps = 29/190 (15%)
Query: 56 PSPYHSANPPPYYQHPSVPSHPSVPTFPSAPIYPSAPPQESHSPYYYSDIRANVPTGGSP 115
P ++ P +Q P V P PT+ P P + P P
Sbjct: 640 PITFNVLVFPTPHQPPQVEITPYKPTWTQIGHIPYQPSPTGANT---MLPIQWAPGTMQP 696
Query: 116 VPSYPQGSYPSNPYQAQNSYPSVPNQPQAPNTYPSMPNQLQAPNGFYPLTSSQQASQASL 175
P P P P A P Q A T + P P A +
Sbjct: 697 PPRAPT---PMRPPAA----PPGRAQRPAAATGRARPPAAAPGRARPP----AAAPGRAR 745
Query: 176 PYPINPLASNKPSAPQPPAYSVQPSHNTAYPTAPLVPSTIATAPSAPPANIEDATRSTHP 235
P P + P+A A + P P P APPA + P
Sbjct: 746 PPAAAPGRARPPAAAPGRARPPAAAPGAPTPQPP---------PQAPPAPQQ------RP 790
Query: 236 TTATAPSVPP 245
A P PP
Sbjct: 791 RGAPTPQPPP 800
>gnl|CDD|232883 TIGR00225, prc, C-terminal peptidase (prc). A C-terminal peptidase
with different substrates in different species including
processing of D1 protein of the photosystem II reaction
center in higher plants and cleavage of a peptide of 11
residues from the precursor form of penicillin-binding
protein in E.coli E.coli and H influenza have the most
distal branch of the tree and their proteins have an
N-terminal 200 amino acids that show no homology to
other proteins in the database [Protein fate,
Degradation of proteins, peptides, and glycopeptides,
Protein fate, Protein modification and repair].
Length = 334
Score = 43.1 bits (102), Expect = 2e-04
Identities = 19/80 (23%), Positives = 39/80 (48%)
Query: 360 IGLRVCAINEGVFVCLVERGSPASLVGLRFGDQILSINGETVAGYNMKQVHKILKAAPVN 419
IG++V + + + GSPA G++ GD+I+ ING++VAG ++ +++
Sbjct: 53 IGIQVGMDDGEIVIVSPFEGSPAEKAGIKPGDKIIKINGKSVAGMSLDDAVALIRGKKGT 112
Query: 420 NIRVVIRDRPLERTVTLHRD 439
+ + I + +T
Sbjct: 113 KVSLEILRAGKSKPLTFTLK 132
Score = 38.5 bits (90), Expect = 0.007
Identities = 14/55 (25%), Positives = 30/55 (54%), Gaps = 1/55 (1%)
Query: 453 IVSLVKDSSAARNGLLVNHNILEVNGANVVGLKDKELLEAME-KGGSVLTLTLIP 506
IVS + S A + G+ I+++NG +V G+ + + + K G+ ++L ++
Sbjct: 66 IVSPFEGSPAEKAGIKPGDKIIKINGKSVAGMSLDDAVALIRGKKGTKVSLEILR 120
>gnl|CDD|236138 PRK07994, PRK07994, DNA polymerase III subunits gamma and tau;
Validated.
Length = 647
Score = 43.3 bits (103), Expect = 3e-04
Identities = 30/164 (18%), Positives = 47/164 (28%), Gaps = 9/164 (5%)
Query: 110 PTGGSPVPSYPQGSYPSNPYQAQNSYPSVPNQPQAPNTYPSMPNQLQAPNGFYPL-TSSQ 168
P P P P S + P+ P P P PL ++
Sbjct: 361 PAAPLPEPEVPPQSAAPAASAQATAAPTAAVAPPQAPAVPPPPASAPQQAPAVPLPETTS 420
Query: 169 QASQASLPYPINPLASNKPSAPQPPAYSVQPSHNTAYPTAPLVPSTIATAPSAPPANIED 228
Q A + K +P A S N+A V + AP
Sbjct: 421 QLLAARQQLQ-RAQGATKAKKSEPAAASRARPVNSALERLASVRPAPSALEKAPAKKEAY 479
Query: 229 ATRSTHPTTATAPSV--PPAI-----HDATRSTPSPIIAPSVPP 265
++T+P V P A+ H+ T + + A ++
Sbjct: 480 RWKATNPVEVKKEPVATPKALKKALEHEKTPELAAKLAAEAIER 523
Score = 34.5 bits (80), Expect = 0.16
Identities = 32/169 (18%), Positives = 48/169 (28%), Gaps = 21/169 (12%)
Query: 55 PPSPYHSANPPPYYQHPSVPSHPSVPTFPSAPIYPSAPPQESHSPYYYSDIRANVPTGGS 114
P P SA P Q + P+ P PP +P P
Sbjct: 368 PEVPPQSAAPAASAQ---ATAAPTAAVAPPQAPAVPPPPAS--AP-------QQAPA-VP 414
Query: 115 PVPSYPQGSYPSNPYQ-AQNSYPSVPNQPQAPNTYPSMPNQLQAPNGFYPLTSSQQASQA 173
+ Q Q AQ + + ++P A + + + L+ P S+ + + A
Sbjct: 415 LPETTSQLLAARQQLQRAQGATKAKKSEPAAASRARPVNSALERLASVRPAPSALEKAPA 474
Query: 174 SLPY----PINPLASNKPSAPQPPAYSVQPSHNTAYPTAPLVPSTIATA 218
NP+ K P A H T L A A
Sbjct: 475 KKEAYRWKATNPVEVKKEPVATPKALKKALEHE---KTPELAAKLAAEA 520
Score = 33.7 bits (78), Expect = 0.24
Identities = 33/147 (22%), Positives = 47/147 (31%), Gaps = 6/147 (4%)
Query: 178 PINPLASNKPSAPQPPAYSVQPSHNTAYPTAPLVPSTIATAPSAP-PANIEDATRSTHPT 236
P PL P P A + TA PTA + P P P A + T
Sbjct: 361 PAAPLPE--PEVPPQSAAPAASAQATAAPTAAVAPPQAPAVPPPPASAPQQAPAVPLPET 418
Query: 237 TATAPSVPPAIHDATRSTPSPIIAPSVPPTNRDTIRSALVPLYPSLDNYMGLEVASELIN 296
T+ + + A +T + P+ R SAL L LE A
Sbjct: 419 TSQLLAARQQLQRAQGATKAKKSEPAAASRARPV-NSALERLASVRPAPSALEKAPAKKE 477
Query: 297 AYARE--NAANERESDAAGNTVARQAP 321
AY + N ++ A ++A
Sbjct: 478 AYRWKATNPVEVKKEPVATPKALKKAL 504
>gnl|CDD|237605 PRK14086, dnaA, chromosomal replication initiation protein;
Provisional.
Length = 617
Score = 42.5 bits (100), Expect = 5e-04
Identities = 41/202 (20%), Positives = 53/202 (26%), Gaps = 22/202 (10%)
Query: 28 TTSSSFSSPPPPVYQTSSHTSPNAYNNPPSPYHSANPPPYYQHPSVPSHPSVPTFPSAPI 87
T S P PP + P PY Y + P +P P
Sbjct: 87 TVDPSAGEPAPPPPHARRTSEPELPRPGRRPY-----EGYGGPRADDRPPGLPRQDQLPT 141
Query: 88 YPSAPPQESHSP-----------YYYSDIRANVPTG---GSPVPSYPQGSYPSNPYQAQN 133
A P P Y + R P SP P+ PY A
Sbjct: 142 ARPAYPAYQQRPEPGAWPRAADDYGWQQQRLGFPPRAPYASPASYAPEQERDREPYDAGR 201
Query: 134 S-YPSVPNQPQAPNTYPSMPNQLQAPNGFYPLTSSQ--QASQASLPYPINPLASNKPSAP 190
Y P P + + P + + P+ +PSAP
Sbjct: 202 PEYDQRRRDYDHPRPDWDRPRRDRTDRPEPPPGAGHVHRGGPGPPERDDAPVVPIRPSAP 261
Query: 191 QPPAYSVQPSHNTAYPTAPLVP 212
P A P+ PTA L P
Sbjct: 262 GPLAAQPAPAPGPGEPTARLNP 283
>gnl|CDD|237874 PRK14971, PRK14971, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 614
Score = 42.1 bits (99), Expect = 6e-04
Identities = 23/96 (23%), Positives = 36/96 (37%), Gaps = 6/96 (6%)
Query: 178 PINPLASNKPSAPQPPAY---SVQPSHNTAYPTAPLVPSTIATAPSAPPANIEDATRSTH 234
I P+ + +APQP A S PS ++A S A + P + D +
Sbjct: 378 HIKPVFTQPAAAPQPSAAAAASPSPSQSSAAAQPSAPQSATQPAGTPPTVS-VDPPAAVP 436
Query: 235 --PTTATAPSVPPAIHDATRSTPSPIIAPSVPPTNR 268
P + +V PA + P ++ P T R
Sbjct: 437 VNPPSTAPQAVRPAQFKEEKKIPVSKVSSLGPSTLR 472
Score = 37.4 bits (87), Expect = 0.017
Identities = 23/119 (19%), Positives = 36/119 (30%), Gaps = 2/119 (1%)
Query: 26 KQTTSSSFSSP-PPPVYQTSSHTSPNAYNNPPSPYHSANPPPYYQHPSVPSHPSVPTFPS 84
KQ F+ P P ++ SP+ + + SA + P+ P
Sbjct: 376 KQHIKPVFTQPAAAPQPSAAAAASPSPSQSSAAAQPSAPQSATQPAGTPPTVSVDPPAAV 435
Query: 85 APIYPSAPPQESHSPYYYSDIRANVPTGGSPVPSYPQGSYPSNPYQAQNSYPSVPNQPQ 143
PS PQ + + + V S PS + QA + P Q
Sbjct: 436 PVNPPSTAPQAVRPAQFKEEKKIPVSKVSSLGPSTLRPIQE-KAEQATGNIKEAPTGTQ 493
Score = 32.8 bits (75), Expect = 0.46
Identities = 15/68 (22%), Positives = 22/68 (32%), Gaps = 6/68 (8%)
Query: 206 PTAPLVPSTIATAPSAPPANIEDATRS------THPTTATAPSVPPAIHDATRSTPSPII 259
P + P A + P+ A+ S +A + PA T S P
Sbjct: 375 PKQHIKPVFTQPAAAPQPSAAAAASPSPSQSSAAAQPSAPQSATQPAGTPPTVSVDPPAA 434
Query: 260 APSVPPTN 267
P PP+
Sbjct: 435 VPVNPPST 442
Score = 31.3 bits (71), Expect = 1.5
Identities = 15/92 (16%), Positives = 27/92 (29%), Gaps = 7/92 (7%)
Query: 166 SSQQASQASLPYPINPLASNKPSAPQPPAY------SVQPSHNTAYPTAPLVPSTIATAP 219
++ Q S A+ P +S P + S + P+ P + A
Sbjct: 388 AAPQPSAAAAASPSPSQSSAAAQPSAPQSATQPAGTPPTVSVDPPAA-VPVNPPSTAPQA 446
Query: 220 SAPPANIEDATRSTHPTTATAPSVPPAIHDAT 251
P E+ ++ PS I +
Sbjct: 447 VRPAQFKEEKKIPVSKVSSLGPSTLRPIQEKA 478
>gnl|CDD|178748 PLN03209, PLN03209, translocon at the inner envelope of chloroplast
subunit 62; Provisional.
Length = 576
Score = 41.5 bits (97), Expect = 9e-04
Identities = 54/215 (25%), Positives = 79/215 (36%), Gaps = 30/215 (13%)
Query: 55 PPSPYHSANPPPYYQHPSVPSHPSVPTFPSAPIYPSAPPQESHSPYY-YSDIRANVPTGG 113
P + P P P P PS P P P A SPY Y D++ PT
Sbjct: 330 PKESDAADGPKPVPTKPVTPEAPSPPI-EEEPPQPKAVVPRPLSPYTAYEDLKP--PT-- 384
Query: 114 SPVPSYPQGSYPSNPYQAQNSYPSVPNQPQAPNTYPSMPNQLQAPNGFYPLTSSQQASQA 173
SP+P+ P S S+ + P+ P+ +P + ++P A +A
Sbjct: 385 SPIPTPPSSSPASSKSVDAVAKPAEPDVVPSPGSASNVPEVEPAQ------------VEA 432
Query: 174 SLPYPINPLAS----NKPSAPQPPAYSVQPSHNTAYPTAPLVPSTIATAPSAPPANIEDA 229
P++P A P++P P A + ++ + P VP T AP DA
Sbjct: 433 KKTRPLSPYARYEDLKPPTSPSPTAPTGVSPSVSSTSSVPAVPDT------APATAATDA 486
Query: 230 TRSTHPTTATAPSVPPAIHDATRSTPSPIIAPSVP 264
P P P A++D + SP A V
Sbjct: 487 A--APPPANMRPLSPYAVYDDLKPPTSPSPAAPVG 519
Score = 31.8 bits (72), Expect = 0.99
Identities = 54/235 (22%), Positives = 75/235 (31%), Gaps = 31/235 (13%)
Query: 26 KQTTSSSFSSPPPPVYQTSSHTSP-NAYNN-----PPSPYHSANPPPYYQHPSVPSHPSV 79
+ + PP P SP AY + P P ++ P + + P+
Sbjct: 350 EAPSPPIEEEPPQPKAVVPRPLSPYTAYEDLKPPTSPIPTPPSSSPASSKSVDAVAKPAE 409
Query: 80 PTFPSAPIYPSAPPQESH-----------SPYY-YSDIRANVPTGGSPVPSYPQGSYPSN 127
P +P S P+ SPY Y D++ PT SP P+ P G PS
Sbjct: 410 PDVVPSPGSASNVPEVEPAQVEAKKTRPLSPYARYEDLKP--PT--SPSPTAPTGVSPS- 464
Query: 128 PYQAQNSYPSVPNQPQAPNTYPSMPNQLQAPNGFYPLTSSQQASQASLPYPINPLASNKP 187
+S SVP P + P PL+ P +P A+
Sbjct: 465 ----VSSTSSVPAVPDTAPATAATDAAAPPPANMRPLSPYAVYDDLKPPTSPSP-AAPVG 519
Query: 188 SAPQPPAYSVQPSHNTAYPTAPLVPSTIATA---PSAPPANIEDATRSTHPTTAT 239
V N+A PTA A P +P ED T PT +
Sbjct: 520 KVAPSSTNEVVKVGNSAPPTALADEQHHAQPKPRPLSPYTMYEDLKPPTSPTPSP 574
Score = 28.7 bits (64), Expect = 8.4
Identities = 24/87 (27%), Positives = 30/87 (34%), Gaps = 1/87 (1%)
Query: 178 PINPLASNKPSAPQPPAYSVQPSHNTAYPTAPLVPSTIATAPS-APPANIEDATRSTHPT 236
P+ L + PS PP S PT P+ P + PP R P
Sbjct: 315 PMEELLAKIPSQRVPPKESDAADGPKPVPTKPVTPEAPSPPIEEEPPQPKAVVPRPLSPY 374
Query: 237 TATAPSVPPAIHDATRSTPSPIIAPSV 263
TA PP T + SP + SV
Sbjct: 375 TAYEDLKPPTSPIPTPPSSSPASSKSV 401
>gnl|CDD|218439 pfam05109, Herpes_BLLF1, Herpes virus major outer envelope
glycoprotein (BLLF1). This family consists of the BLLF1
viral late glycoprotein, also termed gp350/220. It is
the most abundantly expressed glycoprotein in the viral
envelope of the Herpesviruses and is the major antigen
responsible for stimulating the production of
neutralising antibodies in vivo.
Length = 830
Score = 40.9 bits (95), Expect = 0.001
Identities = 43/254 (16%), Positives = 70/254 (27%), Gaps = 10/254 (3%)
Query: 28 TTSSSFSSPPPPVYQTSSHTSPNAYNNPPSPYHSANPPPYYQHPSVPSHPSVPTFPSAPI 87
T+ + + P T+ TS P+P P+ + T P
Sbjct: 500 TSPTPAVTTPNATSPTTQKTSDTPNATSPTPIVIGVTTTATSPPTGTTSVPNATSPQVTE 559
Query: 88 YPSAPPQESHSPYYYSDIRANVPTGGSPVPSYPQGSYPSNPYQAQNSYPSVPNQPQAPNT 147
+ + + T G S + +S P + P
Sbjct: 560 ESPVNNTNTPVVTSAPSVLTSAVTTGQHGTGSSPTSQQPGIPSSSHSTPRSNSTSTTPLL 619
Query: 148 YPSMPNQLQAPNGFYPLTSSQQASQASLPYPINPLAS------NKPSAPQPPAYSVQ--- 198
+ P + P S P P S N ++ P V
Sbjct: 620 TSAHPTGGENITEETPSVPSTTHVSTLSPGPGPGTTSQVSGPGNSSTSRYPGEVHVTEGM 679
Query: 199 PSHNTAYPTAPLVPSTIATAPSAPPANIEDATRSTHPTTATAPSVPPAIHDATRSTPSPI 258
P+ N P+AP T ++ T+ T +T A + P A R+TP
Sbjct: 680 PNPNATSPSAPSGQKTAVPTVTSTGGKANSTTKETSGSTLMASTSPHTNEGAFRTTPYNA 739
Query: 259 IAPSVPPTNRDTIR 272
PP+ +R
Sbjct: 740 TTYL-PPSTSSKLR 752
Score = 30.1 bits (67), Expect = 3.7
Identities = 35/186 (18%), Positives = 53/186 (28%), Gaps = 9/186 (4%)
Query: 106 RANVPTGGSPVPSYPQGSYPSNPYQAQNSYPSVPNQPQAPNTYPSMPNQLQAPNGFYPLT 165
+A T Y + A + PS+P P + P P G T
Sbjct: 424 KAPDTTKSVIF-VYTLVHVEPHKTTAVPTTPSLPPASTGPTVSTADPT-SGTPTGTTSST 481
Query: 166 SSQQASQASLPYPINPLA------SNKPSAPQPPAYSVQPSHNTAYPTAPLVPSTIATAP 219
+ S S P A P+A P + N PT P+V TA
Sbjct: 482 LPEDTSPTSRTTSATPNATSPTPAVTTPNATSPTTQKTSDTPNATSPT-PIVIGVTTTAT 540
Query: 220 SAPPANIEDATRSTHPTTATAPSVPPAIHDATRSTPSPIIAPSVPPTNRDTIRSALVPLY 279
S P ++ T +P T + A + + ++ P
Sbjct: 541 SPPTGTTSVPNATSPQVTEESPVNNTNTPVVTSAPSVLTSAVTTGQHGTGSSPTSQQPGI 600
Query: 280 PSLDNY 285
PS +
Sbjct: 601 PSSSHS 606
>gnl|CDD|177681 PLN00049, PLN00049, carboxyl-terminal processing protease;
Provisional.
Length = 389
Score = 40.1 bits (94), Expect = 0.002
Identities = 20/62 (32%), Positives = 35/62 (56%)
Query: 379 GSPASLVGLRFGDQILSINGETVAGYNMKQVHKILKAAPVNNIRVVIRDRPLERTVTLHR 438
G PA+ G+R GD IL+I+G + G ++ + L+ +++ + +R P R VTL R
Sbjct: 112 GGPAARAGIRPGDVILAIDGTSTEGLSLYEAADRLQGPEGSSVELTLRRGPETRLVTLTR 171
Query: 439 DS 440
+
Sbjct: 172 EK 173
>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family. Atrophin-1 is the
protein product of the dentatorubral-pallidoluysian
atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
neurodegenerative disorder. It is caused by the
expansion of a CAG repeat in the DRPLA gene on
chromosome 12p. This results in an extended
polyglutamine region in atrophin-1, that is thought to
confer toxicity to the protein, possibly through
altering its interactions with other proteins. The
expansion of a CAG repeat is also the underlying defect
in six other neurodegenerative disorders, including
Huntington's disease. One interaction of expanded
polyglutamine repeats that is thought to be pathogenic
is that with the short glutamine repeat in the
transcriptional coactivator CREB binding protein, CBP.
This interaction draws CBP away from its usual nuclear
location to the expanded polyglutamine repeat protein
aggregates that are characteristic of the polyglutamine
neurodegenerative disorders. This interferes with
CBP-mediated transcription and causes cytotoxicity.
Length = 979
Score = 40.4 bits (94), Expect = 0.002
Identities = 57/245 (23%), Positives = 85/245 (34%), Gaps = 21/245 (8%)
Query: 39 PVYQTSSHTSPNAYNNP-PSPYHSANPPPYYQHPSVPSHPSVPTFPSAPIYPSAPPQESH 97
P S P+++ P S PP P PS P + P+ PI H
Sbjct: 305 PPLPLPSQAQPHSHTPPSQSALQPQQPPREQPLPPAPSMPHIKPPPTTPIPQLPNQSHKH 364
Query: 98 -----SPYYYSDIRANVPTGGS--PVPSYPQGSYPSNPYQAQNSYPSVPNQPQAPNTYPS 150
P + + +N+P + P+ S P PS P Q Q + P+
Sbjct: 365 PPHLQGPSPFPQMPSNLPPPPALKPLSSLPTHHPPSAHPPPLQLMP----QSQPLQSVPA 420
Query: 151 MPNQLQAPNGFYPLTSSQQASQASLPYPINPLASNKPSAPQPPAYSVQPSHNTAYPTAPL 210
P L P S+ S P +P A + ++ PA PS T+ P AP
Sbjct: 421 QPPVLTQSQSLPPKASTHPHSGLHSGPPQSPFAQHPFTSGGLPAIGPPPSLPTSTPAAP- 479
Query: 211 VPSTIATAPSAPPANIEDATRSTHPTTATAPSV-----PPAIHDATRSTPSPIIAPSVPP 265
A++ S PP + ++ P + P + S P P +PS P
Sbjct: 480 ---PRASSGSQPPGSALPSSGGCAGPGPPLPPIQIKEEPLDEAEEPESPPPPPRSPSPEP 536
Query: 266 TNRDT 270
T +T
Sbjct: 537 TVVNT 541
Score = 39.7 bits (92), Expect = 0.004
Identities = 47/225 (20%), Positives = 63/225 (28%), Gaps = 13/225 (5%)
Query: 20 QNEYFSKQTTSSSFSSP-PPPVYQTSSHTSPNAYNNPPSPYHSANPPPYYQHPSVPSHPS 78
NE S + P PP Q + PP+P A PP + P+
Sbjct: 159 DNESDSDSSAQQQLLQPQGPPSIQVPPGAALAPSAPPPTPSAQAVPPQGSPIAAQPAPQP 218
Query: 79 VPTFPSAPIY-PSAPPQESHSPYYYSDIRANVPTGGSPVPSYPQGSYPSNPYQAQNSYPS 137
P + I PS PQ SP+ ++ + SP P P +P + +
Sbjct: 219 QQPSPLSLISAPSLHPQRLPSPH--PPLQPQTASQQSPQPPAPSSRHPQSSHHGPGPPMP 276
Query: 138 VPNQPQAPNTYPSMPNQLQAPNGFYPLTSSQQASQASLPYPINPLASNKPSAPQPPAYSV 197
Q N Q Q+ LP P + Q
Sbjct: 277 HALQQGPVFLQHPSSNPPQP-------FGLAQSQVPPLPLPSQAQPHSHTPPSQSALQPQ 329
Query: 198 QPSHNTAYPTAPLVPSTIATAPSAPPANIEDATRSTHPTTATAPS 242
QP P AP +P P P HP PS
Sbjct: 330 QPPREQPLPPAPSMPHI--KPPPTTPIPQLPNQSHKHPPHLQGPS 372
Score = 38.9 bits (90), Expect = 0.007
Identities = 34/161 (21%), Positives = 51/161 (31%), Gaps = 5/161 (3%)
Query: 31 SSFSSPPPPVYQTSSHTSPNAYNNPPSPYHSANPPPYYQHPSVPSHPSVPTFPSA--PIY 88
S + PP + Q+ S + + + P+ QHP P +
Sbjct: 416 QSVPAQPPVLTQSQSLPPKASTHPHSGLHSGPPQSPFAQHPFTSGGLPAIGPPPSLPTST 475
Query: 89 PSAPPQESHSPYYYSDIRANVPTGGSPVPSYPQGSYPSNPYQAQNSYPSVPNQPQAPNTY 148
P+APP+ S + P P P P S P P++P+
Sbjct: 476 PAAPPRASSGSQPPGSALPSSGGCAGPGPPLPPIQIKEEPLDEAEEPESPPPPPRSPSPE 535
Query: 149 PS---MPNQLQAPNGFYPLTSSQQASQASLPYPINPLASNK 186
P+ P+ FY S A PLAS+K
Sbjct: 536 PTVVNTPSHASQSARFYKHLDRGYNSCARTDLYFTPLASSK 576
Score = 32.7 bits (74), Expect = 0.64
Identities = 43/205 (20%), Positives = 68/205 (33%), Gaps = 11/205 (5%)
Query: 132 QNSYPSVPN-QPQAPNTYPSMPNQLQAPNGFYPLTSSQQASQASLPYPINPLASNKPSAP 190
++S PS+P+ Q ++ S QL P G + A+ A P P A P P
Sbjct: 148 RSSSPSIPSPQDNESDSDSSAQQQLLQPQGPPSIQVPPGAALAPSAPPPTPSAQAVP--P 205
Query: 191 QPPAYSVQPSHNTAYPTAPLVPSTIATAPSAPPANIEDATRSTHPTTATAPSVPPAIHDA 250
Q + QP A P ++ +APS P + P TA+ S P
Sbjct: 206 QGSPIAAQP----APQPQQPSPLSLISAPSLHPQRLPSPHPPLQPQTASQQSPQPPA--P 259
Query: 251 TRSTPSPIIAPSVPPTNRDTIRSALVPLYPSLDNYMGLEVASELINAYARENAANERESD 310
+ P PP + + +PS + +A + + A
Sbjct: 260 SSRHPQSSHHGPGPPMPHALQQGPVFLQHPSSNPPQPFGLAQSQVPPLPLPSQAQPHSHT 319
Query: 311 AAGNTVA--RQAPRDSCTMIAPLSS 333
+ +Q PR+ AP
Sbjct: 320 PPSQSALQPQQPPREQPLPPAPSMP 344
>gnl|CDD|223821 COG0750, COG0750, Predicted membrane-associated Zn-dependent
proteases 1 [Cell envelope biogenesis, outer membrane].
Length = 375
Score = 39.8 bits (93), Expect = 0.003
Identities = 32/150 (21%), Positives = 59/150 (39%), Gaps = 7/150 (4%)
Query: 376 VERGSPASLVGLRFGDQILSINGETVAGYNMKQVHKILKAAPVNNIRVVIRDRPLERTVT 435
V S A+L GLR GD+I++++GE VA ++ + + A V N+ ++ R
Sbjct: 136 VAPKSAAALAGLRPGDRIVAVDGEKVASWDDVRRLLVAAAGDVFNLLTILVIRL--DGEA 193
Query: 436 LHRDSSGHFGFHFNKGQIVSLVKDSSAARNGLLVNH--NILEVNGANVVGLKD-KELLEA 492
+ I D A + + N G +V + + A
Sbjct: 194 HAVAAEIIKSLGLTPVVIPLKPGDKIVAVDVGAIGLSPNGEPDVGKVLVKYGPLEAVGLA 253
Query: 493 MEKGGSVLTLTLIPTSMYQHMMKKMSFNLV 522
+EK G ++ LTL + + + +S +
Sbjct: 254 VEKTGRLVKLTL--KMLKKLITGDLSLKNL 281
>gnl|CDD|234977 PRK01741, PRK01741, cell division protein ZipA; Provisional.
Length = 332
Score = 39.3 bits (92), Expect = 0.004
Identities = 34/188 (18%), Positives = 58/188 (30%), Gaps = 20/188 (10%)
Query: 23 YFSKQTTSSSFSSPPPPVYQTSSHTSPNAYNNPPSPYHSANPPPYYQHPSVPSHPSVPTF 82
YFS +++F+ PP S+ + P Y PPP+ Q + S
Sbjct: 33 YFSN---ANTFTRTRPPSRPISNEEADQPNTLNPQSYVETTPPPFQQPQTEESESE---- 85
Query: 83 PSAPIYPSAPPQESHSPYYYSDIRANVPTGGSPVPSYPQGSYPSNPYQAQNSYPSVPNQP 142
+ ++S +I+ +P S P P Q Q P+ N
Sbjct: 86 --NEVQIQQEVEQS-----VDEIKITLPNQEPAYYMQNHRSEPIQPTQPQYQSPTQTNVA 138
Query: 143 QAPNTYPSMPNQLQAPNGFYPLTSSQQASQASLPYPINPLASNKPSAPQPPAYSVQP-SH 201
PN + SS + + L + S+ + ++P +
Sbjct: 139 SMTIEETQSPNV-----PIEGINSSSEQLRVELAELAAEIYSDASHRVELAKNFMEPQAE 193
Query: 202 NTAYPTAP 209
A P A
Sbjct: 194 TEAQPEAT 201
>gnl|CDD|235906 PRK07003, PRK07003, DNA polymerase III subunits gamma and tau;
Validated.
Length = 830
Score = 39.4 bits (92), Expect = 0.004
Identities = 26/155 (16%), Positives = 45/155 (29%), Gaps = 6/155 (3%)
Query: 112 GGSPVPSYPQGSYPSNPYQAQNSYPSVPNQPQAPNTYPSMPNQLQAPNGFYPLTSSQQAS 171
+ G+ A P+A + + +AP ++
Sbjct: 382 APGARAAAAVGASAVPAVTAVTGAAGAALAPKA--AAAAAATRAEAPPAAPAPPATADRG 439
Query: 172 QASLPYPINPLASNKPSAPQPPAYSVQPSHNTAYPTAPLVPSTIATAPSAPPANIEDATR 231
+ A A + + A P A ++ + + P A E A R
Sbjct: 440 DDAADGD----APVPAKANARASADSRCDERDAQPPADSGSASAPASDAPPDAAFEPAPR 495
Query: 232 STHPTTATAPSVPPAIHDATRSTPSPIIAPSVPPT 266
+ P+ AT +VP A A S A + P
Sbjct: 496 AAAPSAATPAAVPDARAPAAASREDAPAAAAPPAP 530
Score = 37.5 bits (87), Expect = 0.019
Identities = 26/177 (14%), Positives = 43/177 (24%), Gaps = 3/177 (1%)
Query: 64 PPPYYQHPSVPSHPSVPTFPSAPIYPSAPPQESHSPYYYSDIRANVPTGGSPVPSYPQGS 123
P V P A A + + + A P + + +
Sbjct: 367 APGGGVPARVAGAVPAPG-ARAAAAVGASAVPAVTAVTGAAGAALAPKAAAAAAATRAEA 425
Query: 124 YPSNPYQAQNSYPSVPNQPQAPNTYPSMPNQLQAPNGFYPLTSSQQASQASLPYPINPLA 183
P+ P + + A + + A S P +
Sbjct: 426 PPAAPAPPATADRGDDAADGDAPVPAKANARASADSRCDERDAQPPADSGSASAPASDAP 485
Query: 184 SNKPSAPQPPAYSVQPSHNTAYPTAPLVPSTIATAPSAPPANIEDATRSTHPTTATA 240
+ P P A + + T A+ AP A A + PT A A
Sbjct: 486 PDAAFEPAPRAAAPSAA--TPAAVPDARAPAAASREDAPAAAAPPAPEARPPTPAAA 540
Score = 31.4 bits (71), Expect = 1.3
Identities = 24/156 (15%), Positives = 38/156 (24%), Gaps = 10/156 (6%)
Query: 89 PSAPPQESHSPYYYSDIRANVPTGGSPVPSYPQGSYPSNPYQAQNSYPSVPNQPQAPNTY 148
+ + +P +A + + P P + A
Sbjct: 402 VTGAAGAALAP------KAAAAAAATRAEAPPAAPAPPATADRGDDAADGDAPVPAKANA 455
Query: 149 PSMPNQLQAPNGFYPLTSSQQASQASLPYPINPLASNKPSAPQPPAYSVQPSHNTAYPTA 208
+ + P S AS + P + P A P A + + P A
Sbjct: 456 RASADSRCDERDAQPPADSGSASAPASDAPPDAAFEPAPRAAAPSAATPAAVPDARAPAA 515
Query: 209 PLVPSTIAT----APSAPPANIEDATRSTHPTTATA 240
A AP A P A + A A
Sbjct: 516 ASREDAPAAAAPPAPEARPPTPAAAAPAARAGGAAA 551
Score = 31.0 bits (70), Expect = 1.9
Identities = 41/180 (22%), Positives = 55/180 (30%), Gaps = 12/180 (6%)
Query: 144 APNTYPSMPNQLQAPNGFYPLTSSQQASQASLPYPINPLASNKPSAPQPPAYSVQPSHNT 203
AP+ Y L F P + A +P A P A A
Sbjct: 341 APDEYAGFTMTLLRMLAFEPAVTGGGAPGGGVP-ARVAGAVPAPGARAAAAVGAS----- 394
Query: 204 AYPTAPLVPSTIATAPSAPPANIEDATRSTHPTTATAPSVPPAIHDATRSTPSPIIAPSV 263
A P V A + A ATR+ P A AP D +P+ A +
Sbjct: 395 AVPAVTAVTGAAGAALAPKAAAAAAATRAEAPPAAPAPPATADRGDDAADGDAPVPAKAN 454
Query: 264 PPTNRDTIRSALVPLYPSLDNYMGLEVASELINAYARENAANERESDAAGNTVARQAPRD 323
+ D+ R P D+ AS+ A +AA E AA + A A
Sbjct: 455 ARASADS-RCDERDAQPPADSGSASAPASD-----APPDAAFEPAPRAAAPSAATPAAVP 508
>gnl|CDD|236776 PRK10856, PRK10856, cytoskeletal protein RodZ; Provisional.
Length = 331
Score = 38.9 bits (91), Expect = 0.005
Identities = 20/87 (22%), Positives = 30/87 (34%), Gaps = 3/87 (3%)
Query: 163 PLTSSQQASQASLPYPINPLASNKPSAPQPPAYSVQPSHNTAYPTAPLVPSTIATAPSAP 222
PL +S A+ P P P+ + ++ P + A + PS +A
Sbjct: 163 PLDTSTTTDPATTPAPAAPVDTTPTNSQTPAVATAPAPAVDPQQNAVVAPSQANVDTAAT 222
Query: 223 PANIEDA---TRSTHPTTATAPSVPPA 246
PA A + PT S P A
Sbjct: 223 PAPAAPATPDGAAPLPTDQAGVSTPAA 249
Score = 35.0 bits (81), Expect = 0.082
Identities = 25/106 (23%), Positives = 34/106 (32%), Gaps = 12/106 (11%)
Query: 126 SNPYQAQNSYPSVP----------NQPQAPNTYPSMPNQLQAPNGFYPLTSSQQASQASL 175
S+ +QNS SVP P + P Q P + Q ++
Sbjct: 150 SSAELSQNSGQSVPLDTSTTTDPATTPAPAAPVDTTPTNSQTPAVATAPAPAVDPQQNAV 209
Query: 176 PYPINPLASNKPSAPQPPAYSVQPSHNTAYPTAPLVPSTIATAPSA 221
P A+ +A PA P PT ST A P+A
Sbjct: 210 VAPSQ--ANVDTAATPAPAAPATPDGAAPLPTDQAGVSTPAADPNA 253
Score = 33.1 bits (76), Expect = 0.31
Identities = 20/113 (17%), Positives = 34/113 (30%), Gaps = 10/113 (8%)
Query: 166 SSQQASQASLPYPINPLASNKPSAPQPPAYSVQPSHNTAYPTAPLVPSTIATAPSAPPAN 225
S++ + + P++ + P+ PA T P T A A + PA
Sbjct: 151 SAELSQNSGQSVPLDTSTTTDPATTPAPA--------APVDTTPTNSQTPAVATAPAPAV 202
Query: 226 IEDATRSTHP--TTATAPSVPPAIHDATRSTPSPIIAPSVPPTNRDTIRSALV 276
P + P AT +P+ + +ALV
Sbjct: 203 DPQQNAVVAPSQANVDTAATPAPAAPATPDGAAPLPTDQAGVSTPAADPNALV 255
>gnl|CDD|219339 pfam07223, DUF1421, Protein of unknown function (DUF1421). This
family represents a conserved region approximately 350
residues long within a number of plant proteins of
unknown function.
Length = 357
Score = 38.8 bits (90), Expect = 0.005
Identities = 40/195 (20%), Positives = 55/195 (28%), Gaps = 26/195 (13%)
Query: 27 QTTSSSFSSPPPPVYQTSSHTSPNAYNNPPSPYHSANPPPYYQHPSVPSHPSVPTFPSAP 86
Q+ + P Y S P P + P P Q P + S P P
Sbjct: 108 QSVPQQPTPQQEPYYPPPSQPQPPPAQQPQAQ----QPQPPPQVPQQQQYQSPPQQPQYQ 163
Query: 87 IYPSAPPQESHSPYYYSDIRANVPTGGSPVPSYPQGSYPSNPYQAQNSYPSVPNQPQAPN 146
P Q V Y SYP N + P P+
Sbjct: 164 QNPPPQAQS----------APQVSGLYPEESPYQPQSYPPNEPLPSSMAMQPPYSGAPPS 213
Query: 147 TYPSMPNQLQA---------PNGFYPLTSSQQASQASLPYPINPLASNKPSAPQPPAYSV 197
P Q PN +P SQ Y + +K + +Y+
Sbjct: 214 QQFYGPPQPSPYMYGGPGGRPNSGFPSGQQPPPSQGQEGYGYSGPPPSKGNHGSVASYAP 273
Query: 198 Q---PSHNTAYPTAP 209
Q S++TAYP+ P
Sbjct: 274 QGSSQSYSTAYPSLP 288
Score = 37.2 bits (86), Expect = 0.015
Identities = 34/163 (20%), Positives = 46/163 (28%), Gaps = 7/163 (4%)
Query: 35 SPPPPVYQTSSHTSPNA----YNNPPSPYHSANPPPYYQHPSVPSHPSVPTFPSAPIYPS 90
PPP Y P P + P +P + P+ P+ S
Sbjct: 141 PQPPPQVPQQQQYQSPPQQPQYQQNPPPQAQSAPQVSGLYPEESPYQPQSYPPNEPLPSS 200
Query: 91 APPQESHSPYYYSDIRANVPTGGSPVPSYPQGSYPSNPYQAQNSYPSVPNQPQAPNTYPS 150
Q +S S P + P G S Q PS + + P
Sbjct: 201 MAMQPPYSGAPPSQQFYGPPQPSPYMYGGPGGRPNSGFPSGQQPPPSQGQEGYGYSGPPP 260
Query: 151 MPNQLQAPNGFYPLTSSQQASQASLPYPINPLASNKPSAPQPP 193
+ + P SSQ S A YP P A+ P A
Sbjct: 261 SKGNHGSVASYAPQGSSQSYSTA---YPSLPAATVLPQALPMS 300
Score = 36.1 bits (83), Expect = 0.034
Identities = 45/191 (23%), Positives = 53/191 (27%), Gaps = 11/191 (5%)
Query: 12 TFHQLTIAQNEYFSKQTTSSSFSSPPPPVYQTSSHTSPNAYNNPPSPYHSANPPP--YYQ 69
+FH + Q + +P V P S PP Y
Sbjct: 41 SFHVDSTKQPPAPEQVAKHELADAPLQQVNAALPPAPAPQSPQPDQQQQSQAPPSHQYPS 100
Query: 70 HPS---VPSHPSVPTFPSAPIYPSAPPQESHSPYYYSDIRANVPTGGSPVPSYPQGSYPS 126
V S P PT P YP PP + P S P
Sbjct: 101 QLPPQQVQSVPQQPTPQQEPYYP--PPSQPQPPPAQQPQAQQPQPPPQVPQQQQYQSPPQ 158
Query: 127 NPYQAQNSYPSVPNQPQAPNTYPS-MPNQLQAPNGFYPLTSSQQASQASLPYPINPLASN 185
P QN P + PQ YP P Q Q+ PL SS PY P +
Sbjct: 159 QPQYQQNPPPQAQSAPQVSGLYPEESPYQPQSYPPNEPLPSSMAMQP---PYSGAPPSQQ 215
Query: 186 KPSAPQPPAYS 196
PQP Y
Sbjct: 216 FYGPPQPSPYM 226
Score = 34.5 bits (79), Expect = 0.12
Identities = 34/151 (22%), Positives = 44/151 (29%), Gaps = 17/151 (11%)
Query: 27 QTTSSSFSSPPPPVYQTSSHTSPNAYNNPPSPYHSANPPPYYQHPSVPSHPSVPTFPSAP 86
Q PP Q + P A + P PY P+ P + P
Sbjct: 146 QVPQQQQYQSPPQQPQYQQNPPPQAQSAPQVSGLYPEESPYQPQSYPPNEPLPSSMAMQP 205
Query: 87 IYPSAPPQE------SHSPYYYSDI----RANVPTGGSPVPSYPQGSYPSNPYQAQN--- 133
Y APP + SPY Y + P+G P PS Q Y +
Sbjct: 206 PYSGAPPSQQFYGPPQPSPYMYGGPGGRPNSGFPSGQQPPPSQGQEGYGYSGPPPSKGNH 265
Query: 134 ----SYPSVPNQPQAPNTYPSMPNQLQAPNG 160
SY + YPS+P P
Sbjct: 266 GSVASYAPQGSSQSYSTAYPSLPAATVLPQA 296
Score = 29.9 bits (67), Expect = 3.8
Identities = 35/170 (20%), Positives = 48/170 (28%), Gaps = 5/170 (2%)
Query: 110 PTGGSPVPSYPQGSYPSNPYQAQNSYPSVPNQPQAPNTYPSMPNQLQAPNGFYPLTSSQQ 169
P P + PS+ Y +Q V + PQ P P P+ P + Q
Sbjct: 79 PQSPQPDQQQQSQAPPSHQYPSQLPPQQVQSVPQQPT--PQQEPYYPPPSQPQPPPAQQP 136
Query: 170 ASQASLPYPINPLASNKPSAPQPPAYSVQPS--HNTAYPTAPLVPSTIATAPSAPPANIE 227
+Q P P P S PQ P Y P +A + L P P + P N E
Sbjct: 137 QAQQPQPPPQVPQQQQYQSPPQQPQYQQNPPPQAQSAPQVSGLYPEESPYQPQSYPPN-E 195
Query: 228 DATRSTHPTTATAPSVPPAIHDATRSTPSPIIAPSVPPTNRDTIRSALVP 277
S + + P + N P
Sbjct: 196 PLPSSMAMQPPYSGAPPSQQFYGPPQPSPYMYGGPGGRPNSGFPSGQQPP 245
>gnl|CDD|238490 cd00990, PDZ_glycyl_aminopeptidase, PDZ domain associated with
archaeal and bacterial M61 glycyl-aminopeptidases. May
be responsible for substrate recognition and/or binding,
as most PDZ domains bind C-terminal polypeptides, and
binding to internal (non-C-terminal) polypeptides and
even to lipids has been demonstrated. In this subfamily
of protease-associated PDZ domains a C-terminal
beta-strand is presumed to form the peptide-binding
groove base, a circular permutation with respect to PDZ
domains found in Eumetazoan signaling proteins.
Length = 80
Score = 35.5 bits (82), Expect = 0.006
Identities = 19/78 (24%), Positives = 34/78 (43%), Gaps = 6/78 (7%)
Query: 361 GLRVCAINEGVFVCLVERGSPASLVGLRFGDQILSINGETVAGYNMKQVHKILKAAPVNN 420
GL + V V SPA GL GD+++++NG V ++ + +
Sbjct: 4 GLTLDKEEGLGKVTFVRDDSPADKAGLVAGDELVAVNGWRVDALQD----RLKEYQAGDP 59
Query: 421 IRVVI--RDRPLERTVTL 436
+ + + DR +E +TL
Sbjct: 60 VELTVFRDDRLIEVPLTL 77
>gnl|CDD|223039 PHA03307, PHA03307, transcriptional regulator ICP4; Provisional.
Length = 1352
Score = 39.0 bits (91), Expect = 0.007
Identities = 39/237 (16%), Positives = 64/237 (27%), Gaps = 24/237 (10%)
Query: 34 SSPPPPVYQTSSHTSPNAYNNPPSPYHSANPPPYYQHPSVPSHPSVPTFPSAPIYPSAPP 93
++ + PP P A P+ P P+ P+ P
Sbjct: 51 AAVTVVAGAAACDRFEPPTGPPPGPGTEAPANESRSTPTWSLSTLAPASPAREGSPTPPG 110
Query: 94 QESHSPYYYSDIRANVPTGGSPVPSYPQGSYPSNPYQAQNSYPSVPNQPQAPNTYPSMPN 153
S P P SP PS P + P +P + P
Sbjct: 111 PSSPDP------PPPTPPPASPPPSPA----PDLSEMLRPVGSPGPPPAASPPAAGASPA 160
Query: 154 QLQAPNGFYPLTSSQQASQASLPYPINPLASNKPSAPQPPAYSVQPSHNTAYPTAPLVPS 213
+ + + + QA+LP + PS+P P +
Sbjct: 161 AVASD--------AASSRQAALPLSSPEETARAPSSPPAEPPPSTPPAAASPRPPRRSSP 212
Query: 214 TIATAPSAPPANIEDATRSTHPTTATAPSVPPAIHD------ATRSTPSPIIAPSVP 264
A+A S PA A +++ + S + P+PI P+
Sbjct: 213 ISASASSPAPAPGRSAADDAGASSSDSSSSESSGCGWGPENECPLPRPAPITLPTRI 269
Score = 36.3 bits (84), Expect = 0.053
Identities = 39/244 (15%), Positives = 62/244 (25%), Gaps = 14/244 (5%)
Query: 28 TTSSSFSSPPPPVYQTSSHTSPNAYNNPPSPYHSANPPPYYQHPSVPSHPSVPTFPSAPI 87
S PPPP P + P+P S P P+ S
Sbjct: 108 PPGPSSPDPPPPT------PPPASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGASPAA 161
Query: 88 YPSAPPQESHSPYYYSDIRANVPTGGSPVPSYPQGSYPSNPYQAQNSYPSVPNQPQAPNT 147
S + S SP P S P + S P A +
Sbjct: 162 VASDAASSRQAALPLSSPEETARAPSSPPAEPP-PSTPPAAASPRPPRRSSPISASASSP 220
Query: 148 YPSMPNQLQAPNGF--YPLTSSQQASQASLPYPINPLASNKPSAPQPPAYSVQPSHNTAY 205
P+ G +SS+ + P PL P + + +
Sbjct: 221 APAPGRSAADDAGASSSDSSSSESSGCGWGPENECPLPRPAPITLPTRIWEASGWNGPS- 279
Query: 206 PTAPLVPSTIATAPSAPPANIEDATRSTHPTTATAP----SVPPAIHDATRSTPSPIIAP 261
S+ + +P + P++ A S + +T S+
Sbjct: 280 SRPGPASSSSSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSRESSSSSTSSSSESSRGA 339
Query: 262 SVPP 265
+V P
Sbjct: 340 AVSP 343
Score = 35.9 bits (83), Expect = 0.056
Identities = 37/203 (18%), Positives = 51/203 (25%), Gaps = 10/203 (4%)
Query: 81 TFPSAPIYPSAPPQESHSPYYYSDIRANVPTGGSPVPSYPQGSYPSNPYQAQNSYPSVPN 140
P A +A S S A V + P P
Sbjct: 22 PRPPATPGDAADDLLSGSQGQLVSDSA--ELAAVTVVAGAAACDRFEPPTGPPPGPGTEA 79
Query: 141 QPQAPNTYPSMPNQLQAPNGFYPLTSSQQASQASLPYPINPLASNKPSAPQPPAYSVQPS 200
+ P+ AP S +S P P S P PA +
Sbjct: 80 PANESRSTPTWSLSTLAPASPAREGSPTPPGPSSPDPP--PPTPPPASPPPSPAPDLSEM 137
Query: 201 HNTAYPTAPLVPSTIATAPSAPPANIEDATRSTHP------TTATAPSVPPAIHDATRST 254
P ++ A ++P A DA S TA + + ST
Sbjct: 138 LRPVGSPGPPPAASPPAAGASPAAVASDAASSRQAALPLSSPEETARAPSSPPAEPPPST 197
Query: 255 PSPIIAPSVPPTNRDTIRSALVP 277
P +P P + SA P
Sbjct: 198 PPAAASPRPPRRSSPISASASSP 220
Score = 35.1 bits (81), Expect = 0.098
Identities = 44/250 (17%), Positives = 66/250 (26%), Gaps = 18/250 (7%)
Query: 34 SSPPPPVYQTSSHTSPNAYNNPPSPYHSANPPPYYQHPSVPSHPSVPTFPSAPIYPSAPP 93
S PP S ++ + P +A+ S S S + P P
Sbjct: 203 SPRPPRRSSPISASASSP--APAPGRSAADDAGASSSDSSSSESSGCGWGPENECPLPRP 260
Query: 94 QESHSPYYYSDIRANVPTGGSPVPSYPQGSYPSNPYQAQNSYPSVPNQPQAPNTYPSMPN 153
P + I G P S S ++ + PS P AP++ + +
Sbjct: 261 APITLP---TRIWEASGWNGPSSRPGPASSSSSPRERSPSPSPSSPGSGPAPSSPRASSS 317
Query: 154 QLQAPNGFYPLTSSQQASQASLPYPINPLASNKPSAPQPPAYSVQPSHNTAYPTAPLVPS 213
+ TSS S P S PS +PP + S + S
Sbjct: 318 SSSSRESSSSSTSSSSESSRGAAVSPGPSPSRSPSPSRPPPPADPSSPRKRPRPSRAPSS 377
Query: 214 TIATAPSAPPANIEDATRSTHPTTATAPSVPPAIHDATRSTPSPIIAPSVPPTNRDTIRS 273
A+A +V + P P P +
Sbjct: 378 PAASAGRPTRR-------------RARAAVAGRARRRDATGRFPAGRPRPSPLDAGAASG 424
Query: 274 ALVPLYPSLD 283
A YP L
Sbjct: 425 AFYARYPLLT 434
Score = 29.8 bits (67), Expect = 5.4
Identities = 29/166 (17%), Positives = 48/166 (28%), Gaps = 8/166 (4%)
Query: 29 TSSSFSSPPPPVYQTSSHTSPNAYNNPPSPYHSANPPPYYQHPSVPSHPSVPTF-PSAPI 87
+S ++ P SS +SP + PSP + P + S S S+
Sbjct: 271 EASGWNGPSSRPGPASSSSSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSRESSSSSTS 330
Query: 88 YPSAPPQESHSPYYYSDIRANVPTGGSPVPSYPQGSYPSNPYQAQNSYPSVPNQPQAPNT 147
S + + S R+ P+ P PS+P + + + +
Sbjct: 331 SSSESSRGAAVSPGPSPSRSPSPSRPPPPAD------PSSPRKRPRPSRAPSSPAASAGR 384
Query: 148 YPSMPNQLQAPNGFYPLTSSQQASQASLPYPINPLASNKPSAPQPP 193
P+ A G + A P P A A
Sbjct: 385 -PTRRRARAAVAGRARRRDATGRFPAGRPRPSPLDAGAASGAFYAR 429
>gnl|CDD|182262 PRK10139, PRK10139, serine endoprotease; Provisional.
Length = 455
Score = 37.6 bits (87), Expect = 0.014
Identities = 39/160 (24%), Positives = 72/160 (45%), Gaps = 24/160 (15%)
Query: 367 INEGVFVCLVERGSPASLVGLRFGDQILSINGETVAGYNMKQVHKILKAAPVNNIRV-VI 425
+ G FV V S ++ G++ GD I S+NG+ + + + +I P +++ ++
Sbjct: 288 VQRGAFVSEVLPNSGSAKAGVKAGDIITSLNGKPLNSF-AELRSRIATTEPGTKVKLGLL 346
Query: 426 RD-RPLERTVTLHRDSSGHF----------GFHFNKGQ---------IVSLVKDSSAARN 465
R+ +PLE VTL +S G + GQ I +VK S AA+
Sbjct: 347 RNGKPLEVEVTLDTSTSSSASAEMITPALQGATLSDGQLKDGTKGIKIDEVVKGSPAAQA 406
Query: 466 GLLVNHNILEVNGANVVGLKDKELLEAMEKGGSVLTLTLI 505
GL + I+ VN V + E+ + + +++ L ++
Sbjct: 407 GLQKDDVIIGVNRDRVNSI--AEMRKVLAAKPAIIALQIV 444
>gnl|CDD|237864 PRK14950, PRK14950, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 585
Score = 37.9 bits (88), Expect = 0.014
Identities = 36/156 (23%), Positives = 48/156 (30%), Gaps = 26/156 (16%)
Query: 178 PINPLASNKPSAPQPPAYSVQPSHNTAYPTAPLVPSTIATAPSAPPANIEDATRSTHPTT 237
P+ KP+A P V+P+ AP A A + PP T
Sbjct: 362 PVPAPQPAKPTAAAPSP--VRPT------PAPSTRPKAAAAANIPPK-------EPVRET 406
Query: 238 ATAPSVPPAIHDATRSTPSPIIAPSVPPTNRDTIRSALVPLYPSLDNYMGLEVASELINA 297
AT P VPP + P P S P R I P Y +
Sbjct: 407 ATPPPVPPR----PVAPPVPHTPESAPKLTRAAIPVDEKPKYTP-------PAPPKEEEK 455
Query: 298 YARENAANERESDAAGNTVARQAPRDSCTMIAPLSS 333
+ + +A + R P S + A LSS
Sbjct: 456 ALIADGDVLEQLEAIWKQILRDVPPRSPAVQALLSS 491
Score = 37.5 bits (87), Expect = 0.017
Identities = 20/96 (20%), Positives = 22/96 (22%), Gaps = 6/96 (6%)
Query: 136 PSVPNQPQAPNTYPSMPN--QLQAPNGFYPLTSSQQASQASLPYPINPLASNKPSAPQPP 193
P +P A P P P + P P AP P
Sbjct: 365 APQPAKPTAAAPSPVRPTPAPSTRPKAAAAANIPPKEPVRETATPPPVPPR--PVAPPVP 422
Query: 194 AYSVQPSHNTAYPTAPLVPSTIATAPSAPPANIEDA 229
S A V P APP E A
Sbjct: 423 H--TPESAPKLTRAAIPVDEKPKYTPPAPPKEEEKA 456
Score = 34.0 bits (78), Expect = 0.19
Identities = 18/90 (20%), Positives = 25/90 (27%)
Query: 31 SSFSSPPPPVYQTSSHTSPNAYNNPPSPYHSANPPPYYQHPSVPSHPSVPTFPSAPIYPS 90
+ P P + +P+ P+P P T P P P
Sbjct: 358 ALLVPVPAPQPAKPTAAAPSPVRPTPAPSTRPKAAAAANIPPKEPVRETATPPPVPPRPV 417
Query: 91 APPQESHSPYYYSDIRANVPTGGSPVPSYP 120
APP RA +P P + P
Sbjct: 418 APPVPHTPESAPKLTRAAIPVDEKPKYTPP 447
Score = 30.2 bits (68), Expect = 3.5
Identities = 20/84 (23%), Positives = 26/84 (30%), Gaps = 9/84 (10%)
Query: 163 PLTSSQQASQASLPYPINPLASNKPSAPQPPAYSVQPSHNTAYPTAPLVPSTIATAPSAP 222
S + + A P A+N P P + P P AP VP P
Sbjct: 374 AAPSPVRPTPAPSTRPKAAAAANIPPKE-PVRETATPPPVPPRPVAPPVP-------HTP 425
Query: 223 PANIEDATRSTHPTTATAPSVPPA 246
+ TR+ P PPA
Sbjct: 426 ES-APKLTRAAIPVDEKPKYTPPA 448
>gnl|CDD|222010 pfam13254, DUF4045, Domain of unknown function (DUF4045). This
presumed domain is functionally uncharacterized. This
domain family is found in bacteria and eukaryotes, and
is typically between 384 and 430 amino acids in length.
Length = 414
Score = 37.5 bits (87), Expect = 0.015
Identities = 39/193 (20%), Positives = 57/193 (29%), Gaps = 22/193 (11%)
Query: 84 SAPIYPSAPPQESHSPYYYSDI------RANVPTG--GSPVPSYPQGSYPSNPYQAQNSY 135
+ P P PQ P + D+ RA+V G S P G + P + +
Sbjct: 159 NKPESPKHKPQPPQQPEWKKDLSRLRQSRASVDLGRTNSFKEVTPVGLMRTPPPGSHSKS 218
Query: 136 PSVPNQPQAPNTYPSMPNQLQAPNGFYPLTSSQQASQASLPYPINPLASNKPSAPQPPAY 195
PS P P++ + + S A P S + + P
Sbjct: 219 PSKSGIPDLPSS-RDSEKTKPEKPQQETSSMDTEKSSAPKPRETLDPKSPEKAPPIDTTE 277
Query: 196 SVQPSHNTAYPTAPLVPSTIATAPSAPPANIEDATRSTHPTTATAPSVP-PAIHDATRST 254
S P A S A+A P+ P+ P + + R
Sbjct: 278 EELKS-----PEASPKESEEASARKRSPSL-------LSPSPKAESPKPLASPGKSPRDP 325
Query: 255 PSPIIAPSVPPTN 267
SP P PP N
Sbjct: 326 LSPRPKPQSPPVN 338
>gnl|CDD|226483 COG3975, COG3975, Predicted protease with the C-terminal PDZ domain
[General function prediction only].
Length = 558
Score = 37.8 bits (88), Expect = 0.015
Identities = 24/90 (26%), Positives = 34/90 (37%), Gaps = 17/90 (18%)
Query: 361 GLRVCAINEGVFVCLVERGSPASLVGLRFGDQILSINGETVAGYNMK-----QVH----- 410
GL+V + + V G PA GL GD+I++ING + K QVH
Sbjct: 454 GLKVKSEGGHEKITFVFPGGPAYKAGLSPGDKIVAINGISDQLDRYKVNDKIQVHVFREG 513
Query: 411 -------KILKAAPVNNIRVVIRDRPLERT 433
K+ I + I DR +
Sbjct: 514 RLREFLVKLGGDPTAQYIILPIGDRNPAQL 543
>gnl|CDD|235124 PRK03427, PRK03427, cell division protein ZipA; Provisional.
Length = 333
Score = 37.3 bits (87), Expect = 0.015
Identities = 23/115 (20%), Positives = 32/115 (27%), Gaps = 4/115 (3%)
Query: 110 PTGGSPVPSYPQGSYPSNPYQAQNSYPSVPNQPQAPNTYPSMPNQLQAPNGFYPLTSSQQ 169
PQ Y PY + V P+A P Q + Q
Sbjct: 78 AQEHEAARPSPQHQYQP-PYASAQPRQPVQQPPEAQVPPQHAPRPAQPAPQPVQQPAYQP 136
Query: 170 ASQASLPYPINPLASNKPSAPQPPAYSVQPSHNTAYPTAPLVPSTIATAPSAPPA 224
+ L P++P APQP + QP+ P P+ P
Sbjct: 137 QPEQPLQQPVSPQV---APAPQPVHSAPQPAQQAFQPAEPVAAPQPEPVAEPAPV 188
Score = 35.8 bits (83), Expect = 0.053
Identities = 27/136 (19%), Positives = 39/136 (28%), Gaps = 11/136 (8%)
Query: 60 HSANPPPYYQHPSVPSHPSVPTFPSAPIYPSAPPQESHSPYYYSDIRANVPTGGSPVPSY 119
H N P + PS P P Y SA P++ P P
Sbjct: 69 HRVNHAPANAQEHEAARPS-PQHQYQPPYASAQPRQPVQQ---------PPEAQVPPQHA 118
Query: 120 PQGSYP-SNPYQAQNSYPSVPNQPQAPNTYPSMPNQLQAPNGFYPLTSSQQASQASLPYP 178
P+ + P P Q P Q P + P + P + Q ++
Sbjct: 119 PRPAQPAPQPVQQPAYQPQPEQPLQQPVSPQVAPAPQPVHSAPQPAQQAFQPAEPVAAPQ 178
Query: 179 INPLASNKPSAPQPPA 194
P+A P +P
Sbjct: 179 PEPVAEPAPVMDKPKR 194
Score = 33.5 bits (77), Expect = 0.28
Identities = 22/112 (19%), Positives = 33/112 (29%)
Query: 34 SSPPPPVYQTSSHTSPNAYNNPPSPYHSANPPPYYQHPSVPSHPSVPTFPSAPIYPSAPP 93
+ P Q P+ + PY SA P Q P P A P
Sbjct: 71 VNHAPANAQEHEAARPSPQHQYQPPYASAQPRQPVQQPPEAQVPPQHAPRPAQPAPQPVQ 130
Query: 94 QESHSPYYYSDIRANVPTGGSPVPSYPQGSYPSNPYQAQNSYPSVPNQPQAP 145
Q ++ P ++ V +P P + Q + P QP+
Sbjct: 131 QPAYQPQPEQPLQQPVSPQVAPAPQPVHSAPQPAQQAFQPAEPVAAPQPEPV 182
>gnl|CDD|185594 PTZ00395, PTZ00395, Sec24-related protein; Provisional.
Length = 1560
Score = 37.7 bits (87), Expect = 0.019
Identities = 40/178 (22%), Positives = 62/178 (34%), Gaps = 49/178 (27%)
Query: 23 YFSKQTTSSSFSSPPPPVYQTSSHTSP----NAYNNPP---SPYHSANPPPYYQHPSVPS 75
+ +++ +S+P + + YNNPP +PY +NPP +P +
Sbjct: 404 QSNAGFSNAGYSNPG---NSNPGYNNAPNSNTPYNNPPNSNTPY--SNPP--NSNPPYSN 456
Query: 76 HPSVPTFPSAPIYPSAPP---QESHSPYY--YSDIRANVPTGGSPVPSYPQGSYPSNPYQ 130
P T S +APP ++ HS Y+ Y AN P P
Sbjct: 457 LPYSNTPYSNAPLSNAPPSSAKDHHSAYHAAYQHRAANQPAANLP--------------- 501
Query: 131 AQNSYPSVPNQPQAPNTYPSMPNQLQAPNGFYPLTSSQQASQASLPYPINPLASNKPS 188
NQP A N + + N + P P S+ PY N + P+
Sbjct: 502 -------TANQPAANNFHGAAGNSVGNPFASRPFGSA--------PYGGNAATTADPN 544
Score = 28.9 bits (64), Expect = 8.4
Identities = 36/148 (24%), Positives = 53/148 (35%), Gaps = 23/148 (15%)
Query: 112 GGSPVPSYPQGSYPSNPYQAQNSYPSVPNQPQAPNTYPSMPNQLQAPNGFYPLTSSQQAS 171
G P + SY Y + + N S P N ++ ++
Sbjct: 380 AGGP---HSNASYNCAAYSNAAQSNAAQSNAGFSNAGYSNP-----GNSNPGYNNAPNSN 431
Query: 172 QASLPYPINPLASNKPSAPQP---PAYSVQPSHNTAYPTAPLVPSTIATAPSAPPANIED 228
PY NP SN P + P P YS P NT Y APL +APP++ +D
Sbjct: 432 T---PYN-NPPNSNTPYSNPPNSNPPYSNLPYSNTPYSNAPL--------SNAPPSSAKD 479
Query: 229 ATRSTHPTTATAPSVPPAIHDATRSTPS 256
+ H + PA + T + P+
Sbjct: 480 HHSAYHAAYQHRAANQPAANLPTANQPA 507
>gnl|CDD|223343 COG0265, DegQ, Trypsin-like serine proteases, typically
periplasmic, contain C-terminal PDZ domain
[Posttranslational modification, protein turnover,
chaperones].
Length = 347
Score = 36.8 bits (85), Expect = 0.026
Identities = 19/69 (27%), Positives = 32/69 (46%), Gaps = 1/69 (1%)
Query: 368 NEGVFVCLVERGSPASLVGLRFGDQILSINGETVAGYNMKQVHKILKAAPVNNIRVVIRD 427
G V V GSPA+ G++ GD I ++NG+ VA V + P + + + +
Sbjct: 269 AAGAVVLGVLPGSPAAKAGIKAGDIITAVNGKPVA-SLSDLVAAVASNRPGDEVALKLLR 327
Query: 428 RPLERTVTL 436
ER + +
Sbjct: 328 GGKERELAV 336
Score = 30.6 bits (69), Expect = 2.2
Identities = 26/102 (25%), Positives = 38/102 (37%), Gaps = 13/102 (12%)
Query: 415 AAPVNNIRVVIRDRPLERTVTLHRDSSG----------HFGFHFNKGQIV-SLVKDSSAA 463
A PVN + V+ + L + R G G G +V ++ S AA
Sbjct: 227 AIPVNLVAPVLDE--LISKGKVVRGYLGVIGEPLTADIALGLPVAAGAVVLGVLPGSPAA 284
Query: 464 RNGLLVNHNILEVNGANVVGLKDKELLEAMEKGGSVLTLTLI 505
+ G+ I VNG V L D A + G + L L+
Sbjct: 285 KAGIKAGDIITAVNGKPVASLSDLVAAVASNRPGDEVALKLL 326
>gnl|CDD|237030 PRK12270, kgd, alpha-ketoglutarate decarboxylase; Reviewed.
Length = 1228
Score = 37.2 bits (87), Expect = 0.028
Identities = 16/83 (19%), Positives = 24/83 (28%)
Query: 182 LASNKPSAPQPPAYSVQPSHNTAYPTAPLVPSTIATAPSAPPANIEDATRSTHPTTATAP 241
A P + P + + A A + AP+ P P A A
Sbjct: 33 FADYGPGSTAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPKPAAAAAA 92
Query: 242 SVPPAIHDATRSTPSPIIAPSVP 264
+ PA A + +P A
Sbjct: 93 AAAPAAPPAAAAAAAPAAAAVED 115
Score = 33.7 bits (78), Expect = 0.28
Identities = 20/74 (27%), Positives = 26/74 (35%), Gaps = 3/74 (4%)
Query: 196 SVQPS-HN--TAYPTAPLVPSTIATAPSAPPANIEDATRSTHPTTATAPSVPPAIHDATR 252
SV PS Y T A A +A A+ A + A AP+ P A A
Sbjct: 24 SVDPSWREFFADYGPGSTAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAP 83
Query: 253 STPSPIIAPSVPPT 266
P+ A + P
Sbjct: 84 PKPAAAAAAAAAPA 97
Score = 33.3 bits (77), Expect = 0.36
Identities = 20/87 (22%), Positives = 28/87 (32%), Gaps = 5/87 (5%)
Query: 178 PINPLASNKPSAPQPPAYSVQPSHNTAYPTAPLVPSTIATAPSAPPANIEDATRSTHPTT 237
P + A +A A S + A A P+ A A A P +
Sbjct: 38 PGSTAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPP-----KPAAAAAA 92
Query: 238 ATAPSVPPAIHDATRSTPSPIIAPSVP 264
A AP+ PPA A + + P
Sbjct: 93 AAAPAAPPAAAAAAAPAAAAVEDEVTP 119
Score = 32.9 bits (76), Expect = 0.47
Identities = 17/86 (19%), Positives = 23/86 (26%)
Query: 170 ASQASLPYPINPLASNKPSAPQPPAYSVQPSHNTAYPTAPLVPSTIATAPSAPPANIEDA 229
A A+ +A + + A P + AP P A A
Sbjct: 34 ADYGPGSTAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPKPAAAAAAA 93
Query: 230 TRSTHPTTATAPSVPPAIHDATRSTP 255
P A A + P A TP
Sbjct: 94 AAPAAPPAAAAAAAPAAAAVEDEVTP 119
Score = 32.2 bits (74), Expect = 0.97
Identities = 16/95 (16%), Positives = 27/95 (28%), Gaps = 8/95 (8%)
Query: 180 NPLASNKPSAPQPPAYSVQPSHNTAYPTAPLVPSTIATAPSAPPANIEDATRSTHPTTAT 239
P ++ P+A A + + A A +A PA P A
Sbjct: 37 GPGSTAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPA--------APPKPAA 88
Query: 240 APSVPPAIHDATRSTPSPIIAPSVPPTNRDTIRSA 274
A + A + + A + +R A
Sbjct: 89 AAAAAAAPAAPPAAAAAAAPAAAAVEDEVTPLRGA 123
Score = 29.5 bits (67), Expect = 6.6
Identities = 22/83 (26%), Positives = 29/83 (34%), Gaps = 3/83 (3%)
Query: 163 PLTSSQQASQASLPYPINPLASNKPSAPQPPAYSVQPSHNTAYPTAPLVPSTIATAPSAP 222
+ A+ A+ P A+ AP PA A P AP P+ A A +AP
Sbjct: 40 STAAPTAAAAAAAAAASAPAAAPAAKAPAAPA---PAPPAAAAPAAPPKPAAAAAAAAAP 96
Query: 223 PANIEDATRSTHPTTATAPSVPP 245
A A + A V P
Sbjct: 97 AAPPAAAAAAAPAAAAVEDEVTP 119
>gnl|CDD|236802 PRK10942, PRK10942, serine endoprotease; Provisional.
Length = 473
Score = 36.3 bits (84), Expect = 0.041
Identities = 19/49 (38%), Positives = 31/49 (63%), Gaps = 2/49 (4%)
Query: 369 EGVFVCLVERGSPASLVGLRFGDQILSINGETVAGYNMKQVHKILKAAP 417
+GV V V+ G+PA+ +GL+ GD I+ N + V N+ ++ KIL + P
Sbjct: 408 KGVVVDNVKPGTPAAQIGLKKGDVIIGANQQPVK--NIAELRKILDSKP 454
Score = 32.4 bits (74), Expect = 0.65
Identities = 42/159 (26%), Positives = 68/159 (42%), Gaps = 33/159 (20%)
Query: 370 GVFVCLVERGSPASLVGLRFGDQILSINGETVAGYNMKQVHKILKAAPVNNIRV------ 423
G FV V S A+ G++ GD I S+NG+ ++ + + A V + V
Sbjct: 312 GAFVSQVLPNSSAAKAGIKAGDVITSLNGKPISSFAALR-------AQVGTMPVGSKLTL 364
Query: 424 -VIRD-RPLERTVTLHRDS----------SGHFGFHFN-----KGQIVSLVK-DSSAARN 465
++RD +P+ V L + S +G G + KG +V VK + AA+
Sbjct: 365 GLLRDGKPVNVNVELQQSSQNQVDSSNIFNGIEGAELSNKGGDKGVVVDNVKPGTPAAQI 424
Query: 466 GLLVNHNILEVNGANVVGLKDKELLEAMEKGGSVLTLTL 504
GL I+ N V + EL + ++ SVL L +
Sbjct: 425 GLKKGDVIIGANQQPVKNI--AELRKILDSKPSVLALNI 461
>gnl|CDD|217495 pfam03326, Herpes_TAF50, Herpesvirus transcription activation
factor (transactivator). This family includes EBV BRLF1
and similar ORF 50 proteins from other herpesviruses.
Length = 500
Score = 35.9 bits (83), Expect = 0.047
Identities = 32/203 (15%), Positives = 47/203 (23%), Gaps = 25/203 (12%)
Query: 35 SPPPPVYQTSSHTSPNAYNNPPSPYHSANPPPYYQHPSVPSHPSVPTFPSAPIYPSAPPQ 94
S P + S P+ S P S + + +P+ P
Sbjct: 260 SAEPLAPASPSSRRPSTAQEEQIACSSPQAEPEQGVQS---YVPQSSDSRPSCFPA--PS 314
Query: 95 ESHSPYYYSDIRANVPTGGSPVPSYPQGSYPSNPYQAQNSYPSVPNQPQAPNTYPSMPNQ 154
+ + N PQ P A S +AP PS Q
Sbjct: 315 TTQPT----FLPPNTNKKAKRD-RRPQMVTPKQEGGAAVSQNHDGGTVRAPRGRPSGSGQ 369
Query: 155 LQAPNGFYPLTSSQQASQASLPYPINPLASNKPSAPQPPAYSVQPSHNTAYPTAPLVPST 214
N + + S A+ A P VQ A P P +
Sbjct: 370 SPPSNSPLLSSLADTPSGAAH-----------QPASLLPPAVVQQQLEDASDKQPPTPGS 418
Query: 215 ----IATAPSAPPANIEDATRST 233
P+ + R
Sbjct: 419 SLVPQPDEQELGPSVMALLDRDQ 441
>gnl|CDD|215130 PLN02217, PLN02217, probable pectinesterase/pectinesterase
inhibitor.
Length = 670
Score = 36.2 bits (83), Expect = 0.047
Identities = 24/101 (23%), Positives = 37/101 (36%), Gaps = 8/101 (7%)
Query: 175 LPYPINPLASNKPSAPQPPAYSVQPSHNTAYPTAPLVPSTIATAPSAPPANIEDATRSTH 234
+PY A N S P S S+ T + PST+ ++PPA + +T
Sbjct: 555 VPYIPGLFAGNPGSTNSTPTGSAASSNTTFSSDS---PSTVVAPSTSPPAGHLGSPPATP 611
Query: 235 PTTATAPSVPPAIH-----DATRSTPSPIIAPSVPPTNRDT 270
+ + PPA H S S I S + ++
Sbjct: 612 SKIVSPSTSPPASHLGSPSTTPSSPESSIKVASTETASPES 652
>gnl|CDD|238491 cd00991, PDZ_archaeal_metalloprotease, PDZ domain of archaeal zinc
metalloprotases, presumably membrane-associated or
integral membrane proteases, which may be involved in
signalling and regulatory mechanisms. May be responsible
for substrate recognition and/or binding, as most PDZ
domains bind C-terminal polypeptides, and binding to
internal (non-C-terminal) polypeptides and even to
lipids has been demonstrated. In this subfamily of
protease-associated PDZ domains a C-terminal beta-strand
forms the peptide-binding groove base, a circular
permutation with respect to PDZ domains found in
Eumetazoan signaling proteins.
Length = 79
Score = 32.7 bits (75), Expect = 0.059
Identities = 17/57 (29%), Positives = 29/57 (50%), Gaps = 4/57 (7%)
Query: 452 QIVSLVKDSSAARNGLLVNHNILEVNGANVVGLKDKELLEAMEKG--GSVLTLTLIP 506
IV ++ S A L I +NG + L+D +EA++ G V+T+T++P
Sbjct: 13 VIVGVIVGSPAENAVLHTGDVIYSINGTPITTLED--FMEALKPTKPGEVITVTVLP 67
Score = 30.8 bits (70), Expect = 0.28
Identities = 15/35 (42%), Positives = 17/35 (48%)
Query: 370 GVFVCLVERGSPASLVGLRFGDQILSINGETVAGY 404
GV + V GSPA L GD I SING +
Sbjct: 11 GVVIVGVIVGSPAENAVLHTGDVIYSINGTPITTL 45
>gnl|CDD|217392 pfam03153, TFIIA, Transcription factor IIA, alpha/beta subunit.
Transcription initiation factor IIA (TFIIA) is a
heterotrimer, the three subunits being known as alpha,
beta, and gamma, in order of molecular weight. The N and
C-terminal domains of the gamma subunit are represented
in pfam02268 and pfam02751, respectively. This family
represents the precursor that yields both the alpha and
beta subunits. The TFIIA heterotrimer is an essential
general transcription initiation factor for the
expression of genes transcribed by RNA polymerase II.
Together with TFIID, TFIIA binds to the promoter region;
this is the first step in the formation of a
pre-initiation complex (PIC). Binding of the rest of the
transcription machinery follows this step. After
initiation, the PIC does not completely dissociate from
the promoter. Some components, including TFIIA, remain
attached and re-initiate a subsequent round of
transcription.
Length = 332
Score = 35.1 bits (81), Expect = 0.083
Identities = 25/148 (16%), Positives = 40/148 (27%), Gaps = 21/148 (14%)
Query: 62 ANPPPYYQHPSVPSHPSVPTFPSAPIYPSAPPQESHSPYYYSDIRANVPTGGSPVPSYPQ 121
+ P P P + P+ Q+ ++P P +P+ P
Sbjct: 52 SPQAPPPVAQLPQPLPQPPPTQALQALPAGDQQQHNTPTGSPA---ANPPATFALPAGPA 108
Query: 122 GSY----PSNPYQAQNSYPSVPNQPQAPNTYPS-----MPNQLQAPNGFYPLTSSQQASQ 172
G P Y Q P + Q A + QLQ Y +S Q
Sbjct: 109 GPTIQTEPGQLYPVQ--VPVMVTQNPANSPLDQPAQQRALQQLQQR---YGAPASGQLPS 163
Query: 173 ASLPYPINPLASNKPSAPQPPAYSVQPS 200
+ +++ Q P P
Sbjct: 164 Q----QQSAQKNDESQLQQQPNGETPPQ 187
Score = 33.9 bits (78), Expect = 0.16
Identities = 27/134 (20%), Positives = 38/134 (28%), Gaps = 7/134 (5%)
Query: 140 NQPQAPNTYPSMPNQLQAPNGFYPLTSSQQASQASLPYPINPLASNKPSAPQPPAYSVQ- 198
PQAP +P L P L + Q P A+N P+ PA
Sbjct: 51 PSPQAPPPVAQLPQPLPQPPPTQALQALPAGDQQQHNTPTGSPAANPPATFALPAGPAGP 110
Query: 199 -----PSHNTAYPTAPLVPSTIATAPSAPPANIEDATRSTHPTTATA-PSVPPAIHDATR 252
P +V A +P PA + A A +P A +
Sbjct: 111 TIQTEPGQLYPVQVPVMVTQNPANSPLDQPAQQRALQQLQQRYGAPASGQLPSQQQSAQK 170
Query: 253 STPSPIIAPSVPPT 266
+ S + T
Sbjct: 171 NDESQLQQQPNGET 184
Score = 33.6 bits (77), Expect = 0.23
Identities = 30/150 (20%), Positives = 41/150 (27%), Gaps = 12/150 (8%)
Query: 83 PSAPIYPSAPPQESHSPYYYSDIRANVPTGGSPVPSYPQGSYPSNP--------YQAQNS 134
P AP P A + + +P G + P GS +NP A +
Sbjct: 53 PQAP-PPVAQLPQPLPQPPPTQALQALPAGDQQQHNTPTGSPAANPPATFALPAGPAGPT 111
Query: 135 YPSVPNQPQAPNTYPSMPNQLQAPNGFYPLTSSQQASQASLPYPINPLASNKPSAPQPPA 194
+ P Q P P M Q A + + Q Y AS + + Q A
Sbjct: 112 IQTEPGQL-YPVQVPVMVTQNPANSPLDQPAQQRALQQLQQRY--GAPASGQLPSQQQSA 168
Query: 195 YSVQPSHNTAYPTAPLVPSTIATAPSAPPA 224
S P P A
Sbjct: 169 QKNDESQLQQQPNGETPPQQTDGAGDDESE 198
Score = 32.8 bits (75), Expect = 0.38
Identities = 26/144 (18%), Positives = 39/144 (27%), Gaps = 6/144 (4%)
Query: 34 SSPPPPVYQTSSHTSPNAYNNPPSPYHSANPPPYYQHPSVPSHPSVPTFPSAPIYPSAPP 93
S PP P A + P+ + P + P P+ P
Sbjct: 52 SPQAPPPVAQLPQPLPQPPPTQALQALPAGDQQQHNTPTGSPAANPPATFALPAGPAGPT 111
Query: 94 QESHSPYYYSDIRANVPTGGSPVPSYPQGSYPSNPYQAQNSY-----PSVPNQPQAPNTY 148
++ Y + T Q + Q Q Y +P+Q Q+
Sbjct: 112 IQTEPGQLYPVQVPVMVTQNPANSPLDQPAQQRALQQLQQRYGAPASGQLPSQQQSAQKN 171
Query: 149 PSMPNQLQAPNGFYPLTSSQQASQ 172
Q Q PNG P + A
Sbjct: 172 DESQLQQQ-PNGETPPQQTDGAGD 194
Score = 30.9 bits (70), Expect = 1.8
Identities = 14/73 (19%), Positives = 18/73 (24%), Gaps = 2/73 (2%)
Query: 153 NQLQAPNGFYPLTSSQQASQASLPYPINPLASNKPSAPQPPAYSVQPSHNTAYPTAPLVP 212
+Q + S QA P A Q Q HNT + P
Sbjct: 40 SQSGVAEFPW--DPSPQAPPPVAQLPQPLPQPPPTQALQALPAGDQQQHNTPTGSPAANP 97
Query: 213 STIATAPSAPPAN 225
P+ P
Sbjct: 98 PATFALPAGPAGP 110
Score = 30.1 bits (68), Expect = 2.7
Identities = 20/148 (13%), Positives = 35/148 (23%), Gaps = 16/148 (10%)
Query: 27 QTTSSSFSSPPPPVYQTSSHTSPNAYNNPPSPYHSANPPPYYQHPSVPSHPSVPTFPSAP 86
T + S ++ PP + + + P P + T A
Sbjct: 87 NTPTGSPAANPPATFALPAGPAGPTIQTEPG------------QLYPVQVPVMVTQNPAN 134
Query: 87 IYPSAPPQESHSPYYYSDIRANVPTGGSPVPSYPQGSYPSNPYQAQNSYPSVPNQPQAPN 146
P Q+ ++ S Q S N P+ PQ +
Sbjct: 135 SPLDQPAQQRALQ----QLQQRYGAPASGQLPSQQQSAQKNDESQLQQQPNGETPPQQTD 190
Query: 147 TYPSMPNQLQAPNGFYPLTSSQQASQAS 174
++ T Q+ A
Sbjct: 191 GAGDDESEALVRLREADGTLEQRIKGAE 218
>gnl|CDD|234035 TIGR02860, spore_IV_B, stage IV sporulation protein B. SpoIVB, the
stage IV sporulation protein B of endospore-forming
bacteria such as Bacillus subtilis, is a serine
proteinase, expressed in the spore (rather than mother
cell) compartment, that participates in a proteolytic
activation cascade for Sigma-K. It appears to be
universal among endospore-forming bacteria and occurs
nowhere else [Cellular processes, Sporulation and
germination].
Length = 402
Score = 35.0 bits (81), Expect = 0.10
Identities = 17/64 (26%), Positives = 30/64 (46%), Gaps = 4/64 (6%)
Query: 380 SPASLVGLRFGDQILSINGETVAGYNMKQVHKILKAAPVNNIRVVIR--DRPLERTVTLH 437
SP G++ GD+IL INGE + NM + ++ A + + I + +E +
Sbjct: 124 SPGEEAGIQIGDRILKINGEKIK--NMDDLANLINKAGGEKLTLTIERGGKIIETVIKPV 181
Query: 438 RDSS 441
+D
Sbjct: 182 KDKE 185
>gnl|CDD|163521 TIGR03809, TIGR03809, TIGR03809 family protein. This protein
family contains proteins with a median length of about
175, including a strongly conserved N-terminal region of
about 55 amino acids, a conserved extreme C-terminal
region of about 15 amino acids, and highly variable
sequence in between the two. Members are found
invariably with a member of family TIGR03808.
Length = 168
Score = 33.3 bits (76), Expect = 0.16
Identities = 21/90 (23%), Positives = 34/90 (37%), Gaps = 7/90 (7%)
Query: 176 PYPINPLASNKPSAPQPPAYSVQPSHNTAYPTAPLVPSTIATAPSAPPANIEDATRSTHP 235
P ++ L NK P+ P Y+ P + A P++ PA +
Sbjct: 65 PVDLSWLGRNKKLPPRAPFYAADE------TVEPRRFAPPAALPASAPAIVA-LPEEAAA 117
Query: 236 TTATAPSVPPAIHDATRSTPSPIIAPSVPP 265
AP VP +DA+ + + + P V P
Sbjct: 118 EPRAAPDVPAEPNDASDADVAEVAEPDVVP 147
>gnl|CDD|225657 COG3115, ZipA, Cell division protein [Cell division and chromosome
partitioning].
Length = 324
Score = 33.7 bits (77), Expect = 0.20
Identities = 27/152 (17%), Positives = 34/152 (22%), Gaps = 10/152 (6%)
Query: 53 NNPPSPYHSANPPPYYQHPSVPSHPSVPTFPSAPIYPSAPPQESHSPYYYSDIRANVPTG 112
PY V P F PQ + P Y S
Sbjct: 46 KRDDDPYDEVADDEGVGEVRVVRKNEAPQFTQEHEAARQSPQHQYQPEYASA-------- 97
Query: 113 GSPVPSYPQGSYPSNPYQAQNSYPSVPNQPQAPNTYPSMPNQLQAPNGFYPLTSSQQASQ 172
PQ S+P +Q Q P P Q P+ S+
Sbjct: 98 -QIKIPVPQPPQISDPPAHPQPTQPALDQEQPPEE-ARQPVLPQEAPAPQPVHSAAPQPA 155
Query: 173 ASLPYPINPLASNKPSAPQPPAYSVQPSHNTA 204
P P +P PA V+
Sbjct: 156 VQTVQPAVPEQQVQPEEVVEPAPEVKRPPRKD 187
>gnl|CDD|232801 TIGR00054, TIGR00054, RIP metalloprotease RseP. Members of this
nearly universal bacterial protein family are regulated
intramembrane proteolysis (RIP) proteases. Older and
synonymous gene symbols include yaeL in E. coli, mmpA in
Caulobacter crescentus, etc. This family includes a
region that hits the PDZ domain, found in a number of
proteins targeted to the membrane by binding to a
peptide ligand. The N-terminal region of this family
contains a perfectly conserved motif HEXGH as found in a
number of metalloproteinases, where the Glu is the
active site and the His residues coordinate the metal
cation [Protein fate, Degradation of proteins, peptides,
and glycopeptides].
Length = 419
Score = 34.0 bits (78), Expect = 0.20
Identities = 32/147 (21%), Positives = 58/147 (39%), Gaps = 21/147 (14%)
Query: 375 LVERGSPASLVGLRFGDQILSINGETVAGYNMKQVHKILKAA---PVNNIRVVIRDRPLE 431
L+++ S A G+ GD+ILS+NG + G+ K V + + P+ I + E
Sbjct: 133 LLDKNSIALEAGIEPGDEILSVNGNKIPGF--KDVRQQIADIAGEPMVEILAERENWTFE 190
Query: 432 RTVTLHRDSSGHFGFHFNKGQIVSLVKDSSAARNGLLVNHNILEVNGANVVGLKD----- 486
L + + + +S A + GL I +NG + D
Sbjct: 191 VMKELIP------RGPKIEPVLSDVTPNSPAEKAGLKEGDYIQSINGEKLRSWTDFVSAV 244
Query: 487 -----KELLEAMEKGGSVLTLTLIPTS 508
K + +E+ G L+++L P +
Sbjct: 245 KENPGKSMDIKVERNGETLSISLTPEA 271
Score = 29.8 bits (67), Expect = 3.7
Identities = 20/72 (27%), Positives = 27/72 (37%), Gaps = 10/72 (13%)
Query: 380 SPASLVGLRFGDQILSINGETVAGYNMKQVHKILKAAPVNN-----IRVVIRDRPLERTV 434
SPA GL+ GD I SINGE ++ + A N I+V L ++
Sbjct: 213 SPAEKAGLKEGDYIQSINGEK-----LRSWTDFVSAVKENPGKSMDIKVERNGETLSISL 267
Query: 435 TLHRDSSGHFGF 446
T G
Sbjct: 268 TPEAKGKIGIGI 279
>gnl|CDD|183854 PRK13042, PRK13042, superantigen-like protein; Reviewed.
Length = 291
Score = 33.5 bits (76), Expect = 0.21
Identities = 26/86 (30%), Positives = 38/86 (44%), Gaps = 7/86 (8%)
Query: 164 LTSSQQASQASLPYPINPLASNKPSAPQPPAYSVQPSHNTAYPTAPLVPSTIATAP-SAP 222
+T++ QA+ A+ P +S K APQ S + + P A PST AP P
Sbjct: 21 ITTTTQAANATTP------SSTKVEAPQSTPPSTKVEAPQSKPNATTPPSTKVEAPQQTP 74
Query: 223 PANIEDATRSTHPTTATAPSVPPAIH 248
A +T+ P + T VP I+
Sbjct: 75 NATTPSSTKVETPQSPTTKQVPTEIN 100
Score = 31.1 bits (70), Expect = 1.2
Identities = 20/67 (29%), Positives = 30/67 (44%), Gaps = 2/67 (2%)
Query: 156 QAPNGFYPLTSSQQASQASLPYPINPLASNKPSAPQPPAYSVQPSHNTAYPTAPLVPSTI 215
QA N P ++ +A Q++ P +KP+A PP+ V+ T T P ST
Sbjct: 26 QAANATTPSSTKVEAPQSTPPSTKVEAPQSKPNATTPPSTKVEAPQQTPNATTP--SSTK 83
Query: 216 ATAPSAP 222
P +P
Sbjct: 84 VETPQSP 90
Score = 28.8 bits (64), Expect = 7.4
Identities = 15/71 (21%), Positives = 29/71 (40%)
Query: 188 SAPQPPAYSVQPSHNTAYPTAPLVPSTIATAPSAPPANIEDATRSTHPTTATAPSVPPAI 247
+A P + V+ +T T P + A + P +E ++ + TT ++ V
Sbjct: 29 NATTPSSTKVEAPQSTPPSTKVEAPQSKPNATTPPSTKVEAPQQTPNATTPSSTKVETPQ 88
Query: 248 HDATRSTPSPI 258
T+ P+ I
Sbjct: 89 SPTTKQVPTEI 99
>gnl|CDD|223061 PHA03369, PHA03369, capsid maturational protease; Provisional.
Length = 663
Score = 33.8 bits (77), Expect = 0.23
Identities = 25/132 (18%), Positives = 38/132 (28%), Gaps = 18/132 (13%)
Query: 43 TSSHTSPNAYNNPPSPYHSANPPPYYQHPSVPSHPSVPTFPSAPIYPSAPPQESHSPYYY 102
T+S T+P+ + P Q + P+ P P P + P S
Sbjct: 350 TASLTAPSRVLAAAAKVAVIAAP---QTHTGPADRQRPQRP--DGIPYSVPARSP----- 399
Query: 103 SDIRANVPTGGSPVPSYPQGSYPSNPYQAQNSYPSVPNQPQAPNTYPSMPNQLQAPNGFY 162
T PVP + +PY Q+ S +P P P N P
Sbjct: 400 -------MTAYPPVPQFCGDPGLVSPYNPQSPGTSYGPEPVGP-VPPQPTNPYVMPISMA 451
Query: 163 PLTSSQQASQAS 174
+ +
Sbjct: 452 NMVYPGHPQEHG 463
Score = 29.6 bits (66), Expect = 4.8
Identities = 19/62 (30%), Positives = 24/62 (38%)
Query: 35 SPPPPVYQTSSHTSPNAYNNPPSPYHSANPPPYYQHPSVPSHPSVPTFPSAPIYPSAPPQ 94
+ PPV Q + NP SP S P P P P++P V A + PQ
Sbjct: 401 TAYPPVPQFCGDPGLVSPYNPQSPGTSYGPEPVGPVPPQPTNPYVMPISMANMVYPGHPQ 460
Query: 95 ES 96
E
Sbjct: 461 EH 462
Score = 29.2 bits (65), Expect = 6.4
Identities = 17/92 (18%), Positives = 25/92 (27%), Gaps = 2/92 (2%)
Query: 27 QTTSSSFSSPPPPVYQTSSHTSPNAYNNPPSPYHSANPPPYYQHPSVPSHPSVPTFPSAP 86
QT + P ++ P P + Y P +P P P
Sbjct: 373 QTHTGPADRQRPQRPDGIPYSVPARS--PMTAYPPVPQFCGDPGLVSPYNPQSPGTSYGP 430
Query: 87 IYPSAPPQESHSPYYYSDIRANVPTGGSPVPS 118
P + +PY AN+ G P
Sbjct: 431 EPVGPVPPQPTNPYVMPISMANMVYPGHPQEH 462
Score = 28.8 bits (64), Expect = 9.0
Identities = 17/102 (16%), Positives = 24/102 (23%), Gaps = 16/102 (15%)
Query: 115 PVPSYPQGSYPSNPYQAQNSYPSVPNQPQAPNTYPSMPNQLQAPNGFYPLTSSQQASQAS 174
+ P P+ +PQ P+ P + P+
Sbjct: 362 AAAKVAVIAAPQTHTG-----PADRQRPQRPDGIPYSVPARSPMTAYPPVPQFCGDPGLV 416
Query: 175 LPYPINPLASNKPSAPQPPAYSVQPSHNTAYPTAPLVPSTIA 216
PY PQ P S P P P P +
Sbjct: 417 SPYN-----------PQSPGTSYGPEPVGPVPPQPTNPYVMP 447
>gnl|CDD|237863 PRK14949, PRK14949, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 944
Score = 33.9 bits (78), Expect = 0.24
Identities = 23/140 (16%), Positives = 37/140 (26%), Gaps = 1/140 (0%)
Query: 65 PPYYQHPSVPSHPSVPTFPSAPIYPSAPPQESHSPYYYSDIRANVPTGGSPVPSYPQGSY 124
PP P+ S P+ S + + S A + +G +P P Y
Sbjct: 648 PPSRAPPASLSKPASSPDASQTSASFDLDPDFELATHQSVPEAALASGSAPAPPPVPDPY 707
Query: 125 PSNPYQAQNSYPSVPNQPQAPNTYPSMPNQLQAPNGFYPLTSSQQASQASLPYPINPLAS 184
P++ S + P + A N Q Q + AS
Sbjct: 708 DRPPWEEAPEVASANDGPNNAAEGNLSESVEDASNSELQAVEQQATHQPQVQAEAQSPAS 767
Query: 185 NKPSAPQPPAYSVQPSHNTA 204
+ Q + N
Sbjct: 768 -TTALTQTSSEVQDTELNLV 786
>gnl|CDD|218181 pfam04621, ETS_PEA3_N, PEA3 subfamily ETS-domain transcription
factor N terminal domain. The N terminus of the PEA3
transcription factors is implicated in transactivation
and in inhibition of DNA binding. Transactivation is
potentiated by activation of the Ras/MAP kinase and
protein kinase A signalling cascades. The N terminal
region contains conserved MAP kinase phosphorylation
sites.
Length = 336
Score = 33.3 bits (76), Expect = 0.27
Identities = 29/122 (23%), Positives = 38/122 (31%), Gaps = 11/122 (9%)
Query: 23 YFSKQTTSSSFSSPPPPVYQTSSHTSPNAYNNPPSPYHSANPPPYYQHPSVPSHPSVPTF 82
Y K + +PP + P P +++PP P P P
Sbjct: 122 YDRKPASGFKPPTPPSTPCSPVNPQETVRQLQPSGPLSNSSPP----SPHTPLPNQSPLP 177
Query: 83 PSAPIYPSAPPQESHSPYYYSDIRANVPTGGSPVPSYPQGSYPSNP-YQAQNSYPSVPNQ 141
P S+ P E S+ P P +GS P Y Q S P VP
Sbjct: 178 PPMSSPDSSYPSEHRFQRQLSEPCLPFP------PPPGRGSRDGRPPYHRQMSEPLVPYP 231
Query: 142 PQ 143
PQ
Sbjct: 232 PQ 233
>gnl|CDD|238486 cd00986, PDZ_LON_protease, PDZ domain of ATP-dependent LON serine
proteases. Most PDZ domains bind C-terminal
polypeptides, though binding to internal
(non-C-terminal) polypeptides and even to lipids has
been demonstrated. In this bacterial subfamily of
protease-associated PDZ domains a C-terminal beta-strand
is thought to form the peptide-binding groove base, a
circular permutation with respect to PDZ domains found
in Eumetazoan signaling proteins.
Length = 79
Score = 30.9 bits (70), Expect = 0.27
Identities = 12/34 (35%), Positives = 20/34 (58%), Gaps = 1/34 (2%)
Query: 368 NEGVFVCLVERGSPASLVGLRFGDQILSINGETV 401
GV+V V G PA+ L+ GD I++++G+
Sbjct: 7 YHGVYVTSVVEGMPAAGK-LKAGDHIIAVDGKPF 39
>gnl|CDD|237862 PRK14948, PRK14948, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 620
Score = 33.4 bits (77), Expect = 0.31
Identities = 19/92 (20%), Positives = 29/92 (31%), Gaps = 8/92 (8%)
Query: 180 NPLASNKPSAPQPPAYSVQPSHNTAYPTAPLVPSTIATAPSAPPANIEDATRSTHPTTAT 239
+ ASN P PP S P T P + T P PP + ++ + A
Sbjct: 514 SGSASNTAKTPPPPQKSPPPPAPTPPLPQPTATAPPPTPPPPPPTATQASSNAPAQIPAD 573
Query: 240 APSVPPAIHDATRSTPSPIIAPSVPPTNRDTI 271
+ PP P ++ + I
Sbjct: 574 SSPPPP--------IPEEPTPSPTKDSSPEEI 597
Score = 32.2 bits (74), Expect = 0.84
Identities = 19/69 (27%), Positives = 26/69 (37%), Gaps = 1/69 (1%)
Query: 178 PINPLASNKPSAPQPPAYSVQPSHNTAYPTAPLVPSTIATAPSAPPANIEDATRSTHPTT 237
I+ +A+ A PA + P P+AP AT PS PPA T
Sbjct: 364 FISEIANASAPANPTPAPNPSPPPAPIQPSAP-KTKQAATTPSPPPAKASPPIPVPAEPT 422
Query: 238 ATAPSVPPA 246
+P+ P
Sbjct: 423 EPSPTPPAN 431
Score = 31.9 bits (73), Expect = 1.1
Identities = 13/64 (20%), Positives = 16/64 (25%)
Query: 165 TSSQQASQASLPYPINPLASNKPSAPQPPAYSVQPSHNTAYPTAPLVPSTIATAPSAPPA 224
+ A LP P P P P A + P P I P+ P
Sbjct: 530 SPPPPAPTPPLPQPTATAPPPTPPPPPPTATQASSNAPAQIPADSSPPPPIPEEPTPSPT 589
Query: 225 NIED 228
Sbjct: 590 KDSS 593
Score = 30.3 bits (69), Expect = 3.1
Identities = 14/78 (17%), Positives = 23/78 (29%), Gaps = 3/78 (3%)
Query: 206 PTAPLVPSTIATAPSAPPANIEDATRSTHPTTATAPSVPPAIHDATRSTPSPIIAPSVPP 265
+ P+ P+ + + P T A + P PI P+ P
Sbjct: 366 SEIANASAPANPTPAPNPSPPPAPIQPSAPKTKQAATTPSPP---PAKASPPIPVPAEPT 422
Query: 266 TNRDTIRSALVPLYPSLD 283
T + PSL+
Sbjct: 423 EPSPTPPANAANAPPSLN 440
Score = 29.5 bits (67), Expect = 5.8
Identities = 22/81 (27%), Positives = 25/81 (30%), Gaps = 1/81 (1%)
Query: 187 PSAPQPPAYSVQPSHNTAYPTAPLVPSTIATAPSAPPANIEDATRSTHPTTATAPSVPPA 246
PSA + N P P PSAP T S P A+ P PA
Sbjct: 361 PSAFISEIANASAPANPTPAPNPSPPPAPIQ-PSAPKTKQAATTPSPPPAKASPPIPVPA 419
Query: 247 IHDATRSTPSPIIAPSVPPTN 267
TP A + P N
Sbjct: 420 EPTEPSPTPPANAANAPPSLN 440
>gnl|CDD|220401 pfam09786, CytochromB561_N, Cytochrome B561, N terminal. Members
of this family are found in the N terminal region of
cytochrome B561, as well as in various other putative
uncharacterized proteins.
Length = 559
Score = 33.2 bits (76), Expect = 0.33
Identities = 49/273 (17%), Positives = 77/273 (28%), Gaps = 41/273 (15%)
Query: 15 QLTIAQNEYFSKQTTSSSFSSPPPPVYQTSSHTSPNAYNNPPSPYHSANPPPYYQHPSVP 74
QL + + Q T S + PP +TS+ N P VP
Sbjct: 97 QLGLLGVKAKDSQFTVVSQAKKSPPASKTSTP---------------MNTSE----PLVP 137
Query: 75 SHPSVPTFPSAPIYPSAPPQESHSPYYYSDIRANVPTGGSPVPSYPQGSYPSNPYQAQNS 134
H S PS PS S + + + SP SY SY S+ +S
Sbjct: 138 GHSSFSDSPSRSASPSRKFSPSSTIQQSPQLTPSNK-PASPSSSYQSPSYSSSLGPVNSS 196
Query: 135 YPSVPNQPQAPNTYPSMPNQLQAPNGFYPLTSSQQASQASLPYPINPLASNKPSAPQPPA 194
N+ ++ P + A + + + S+ PP
Sbjct: 197 ----GNRSNLRSS-PWALRSSGDKKDITTDEKYLETFLAEVDEEQHMITSSAGKNATPPE 251
Query: 195 YSVQPSHNTAYPTAPLVPSTIATAPSAPPANIEDATRSTHPTTATAPSVPPAIHDATRST 254
T PS + +A DA RS + S P ST
Sbjct: 252 --------TINSFGSSSPSFWNYSRNAS-----DAARSL-KKRSYQLSPSPVPSKQKAST 297
Query: 255 PSPIIAPSVPPTNRDTIRSAL--VPLYPSLDNY 285
P + ++ V + P ++ +
Sbjct: 298 SPKKGEGEPPNMSLESASEVFKRVGVLPQIEQW 330
>gnl|CDD|184918 PRK14954, PRK14954, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 620
Score = 33.4 bits (76), Expect = 0.34
Identities = 24/122 (19%), Positives = 38/122 (31%), Gaps = 9/122 (7%)
Query: 206 PTAPLVPSTIATAPS---APPANIEDATRSTHPTTATAPSVPPAIHDATRSTPSPIIAP- 261
+ PS + AP ++ R P AP PA + S P+P P
Sbjct: 376 NDGGVAPSPAGSPDVKKKAPEPDLPQPDRHPGPAKPEAPGARPAELPSPASAPTPEQQPP 435
Query: 262 -----SVPPTNRDTIRSALVPLYPSLDNYMGLEVASELINAYARENAANERESDAAGNTV 316
+PP+ + + + P +D +R+ SDAA V
Sbjct: 436 VARSAPLPPSPQASAPRNVASGKPGVDLGSWQGKFMNFTRNGSRKQPVQASSSDAAQTGV 495
Query: 317 AR 318
Sbjct: 496 FE 497
>gnl|CDD|220309 pfam09606, Med15, ARC105 or Med15 subunit of Mediator complex
non-fungal. The approx. 70 residue Med15 domain of the
ARC-Mediator co-activator is a three-helix bundle with
marked similarity to the KIX domain. The sterol
regulatory element binding protein (SREBP) family of
transcription activators use the ARC105 subunit to
activate target genes in the regulation of cholesterol
and fatty acid homeostasis. In addition, Med15 is a
critical transducer of gene activation signals that
control early metazoan development.
Length = 768
Score = 33.4 bits (76), Expect = 0.35
Identities = 25/92 (27%), Positives = 35/92 (38%), Gaps = 5/92 (5%)
Query: 35 SPPPPVYQTSSHTSPNAYNNPPSPYHSANPPPYYQHPSVPSHPSVPTFPSAPIYPSAPPQ 94
S P PV Q + + PP P + P Q P S + PS PQ
Sbjct: 412 SSPSPVPQV---QTNQSMPQPPQPSVPSPGGPGSQPPQ--SVSGGMIPSPPALMPSPSPQ 466
Query: 95 ESHSPYYYSDIRANVPTGGSPVPSYPQGSYPS 126
S SP I+ ++ + G P+ + Q S S
Sbjct: 467 MSQSPASQRTIQQDMVSPGGPLNTPGQSSVNS 498
>gnl|CDD|215145 PLN02258, PLN02258, 9-cis-epoxycarotenoid dioxygenase NCED.
Length = 590
Score = 33.1 bits (76), Expect = 0.37
Identities = 29/111 (26%), Positives = 49/111 (44%), Gaps = 2/111 (1%)
Query: 134 SYPSVPNQPQAPNTYPSMPNQLQAPNGFYPLTSSQQASQASLPYPINPLASNKPSAPQPP 193
S P+ +Q A ++ S +Q P+ P ++ S +SL + + L K S+P PP
Sbjct: 4 SNPTSRSQSHASSS-SSSSSQSSPPSSTSPRPRRRKPSASSLLHTPSILPLPKLSSPSPP 62
Query: 194 AYSVQPSHNTAYPTAPLVPSTIATAPSAPPANIEDATRSTHPTTATA-PSV 243
+ ++ P+ T P + A A A + + HP TA P+V
Sbjct: 63 SVTLPPAATTQTPQLNPLQRAAAAALDAVESALVSHLERQHPLPKTADPAV 113
Score = 29.3 bits (66), Expect = 5.7
Identities = 21/70 (30%), Positives = 26/70 (37%), Gaps = 3/70 (4%)
Query: 25 SKQTTSSSFSSPPPPVYQTSSHTSPNAYNNPPSPYHSANPPPYYQHPSVPSHPSVPTFPS 84
S+ SSS SS S TSP PS + P P + S PS P+
Sbjct: 10 SQSHASSSSSSSSQS--SPPSSTSPRPRRRKPSASSLLHTPSILPLPKLSS-PSPPSVTL 66
Query: 85 APIYPSAPPQ 94
P + PQ
Sbjct: 67 PPAATTQTPQ 76
>gnl|CDD|215602 PLN03143, PLN03143, nudix hydrolase; Provisional.
Length = 291
Score = 32.9 bits (75), Expect = 0.39
Identities = 16/74 (21%), Positives = 26/74 (35%), Gaps = 10/74 (13%)
Query: 160 GFYPLTSSQQASQASLPYPINPLASNKPSAPQPPAYSVQPSHNTAYPTAPLVPSTIATAP 219
F L S A LP + ++ S+ P +S+ P P P + AP
Sbjct: 1 RFTLLPSRPLA----LPSRMAHKEASSSSSSSPLTHSI------TLPGQPGQPVLVVAAP 50
Query: 220 SAPPANIEDATRST 233
++ A S+
Sbjct: 51 GISSSDFRKAIDSS 64
>gnl|CDD|185628 PTZ00449, PTZ00449, 104 kDa microneme/rhoptry antigen; Provisional.
Length = 943
Score = 33.1 bits (75), Expect = 0.39
Identities = 56/275 (20%), Positives = 68/275 (24%), Gaps = 42/275 (15%)
Query: 32 SFSSPPPPVYQTSSHTSPNAYNNPPSPYHSANP---PPYYQHPSVPSHPSVPTFPSAPIY 88
S S P T P P P P + P P P P P P
Sbjct: 537 SKESDEPKEGGKPGETKEGEVGKKPGPAKEHKPSKIPTLSKKPEFPKDPKHPKDPEEPKK 596
Query: 89 PSAP-----PQESHSPYY--YSDIRANVPTGGSPVP--SYPQGSYPSNPYQAQNSYPSVP 139
P P P SP DI + SP P PS+P + + P +
Sbjct: 597 PKRPRSAQRPTRPKSPKLPELLDIPKSPKRPESPKSPKRPPPPQRPSSPERPEG--PKII 654
Query: 140 NQPQAPNTYPSMPNQLQAPNGFYPLTSSQQA----------------SQASLPYPINPLA 183
P+ P + P P + FY A S P P
Sbjct: 655 KSPKPPKS-PKPPFDPKFKEKFYDDYLDAAAKSKETKTTVVLDESFESILKETLPETPGT 713
Query: 184 SNKPSAPQPPAYSVQPSHNTAYPTAPLVPSTIATAPSAPPANIEDATRSTHPTTATAPSV 243
P PP P PP E+ T H T A P
Sbjct: 714 PFTTPRPLPPKLPRDEEFPFEPIGDPDAEQPDDIEFFTPP--EEERTF-FHETPADTPLP 770
Query: 244 --------PPAIHDATRSTPSPIIAPSVPPTNRDT 270
IH T + P P + D
Sbjct: 771 DILAEEFKEEDIHAETGEPDEAMKRPDSPSEHEDK 805
Score = 30.4 bits (68), Expect = 2.8
Identities = 36/218 (16%), Positives = 57/218 (26%), Gaps = 20/218 (9%)
Query: 25 SKQTTSSSFSSPPPPVYQTSSHTSPNAYNNPPSPYHSANPPPYYQ--HPSVPSHPSVPTF 82
K+ S+ + P P + P SP PPP + P P P +
Sbjct: 597 PKRPRSAQRPTRPKSPKLPELLDIPKSPKRPESPKSPKRPPPPQRPSSPERPEGPKIIKS 656
Query: 83 PSAPIYPSAPPQESHSPYYYSD------------IRANVPTGGSPVPSYPQGSYPSNPYQ 130
P P P P +Y D + + P P+
Sbjct: 657 PKPPKSPKPPFDPKFKEKFYDDYLDAAAKSKETKTTVVLDESFESILKETLPETPGTPFT 716
Query: 131 AQNSYPSV-PNQPQAPNTYPSMPNQLQ-APNGFY--PLTSSQQASQASLPYPINPLASNK 186
P P + P P+ Q F+ P + P+ + + +
Sbjct: 717 TPRPLPPKLPRDEEFPFEPIGDPDAEQPDDIEFFTPPEEERTFFHETPADTPLPDILAEE 776
Query: 187 PSAPQPPAYSVQPSHNTAYPTAPL--VPSTIATAPSAP 222
A + +P P +P PS P
Sbjct: 777 FKEEDIHAETGEPDEAMKRPDSPSEHEDKPPGDHPSLP 814
>gnl|CDD|222579 pfam14179, YppG, YppG-like protein. The YppG-like protein family
includes the B. subtilis YppG protein, which is
functionally uncharacterized. This family of proteins is
found in bacteria. Proteins in this family are typically
between 115 and 181 amino acids in length. There are two
completely conserved residues (F and G) that may be
functionally important.
Length = 110
Score = 31.2 bits (71), Expect = 0.42
Identities = 16/79 (20%), Positives = 23/79 (29%), Gaps = 3/79 (3%)
Query: 84 SAPIYPSAPPQESHSPYYYSDIRANVPTGGSPVPSYPQGSYPSNPYQAQNSYPSVPNQPQ 143
P + PY +P P S PQ YP Q Q
Sbjct: 4 QQNTNQYPPQNQQQQPYQQQPYHQQMP---PPPYSPPQQQQGHFMPPQPQPYPKQSPQQQ 60
Query: 144 APNTYPSMPNQLQAPNGFY 162
P + S +Q + +G +
Sbjct: 61 QPPQFSSFLSQFKNSDGTF 79
>gnl|CDD|222449 pfam13908, Shisa, Wnt and FGF inhibitory regulator. Shisa is a
transcription factor-type molecule that physically
interacts with immature forms of the Wnt receptor
Frizzled and the FGF receptor within the endoplasmic
reticulum to inhibit their post-translational maturation
and trafficking to the cell surface.
Length = 177
Score = 32.1 bits (73), Expect = 0.43
Identities = 21/75 (28%), Positives = 28/75 (37%), Gaps = 6/75 (8%)
Query: 74 PSHPSVPTFPSAPIYPSAPPQESHSPYYYSDIRANVPTGGSPVPSYPQGSYPSNPYQAQN 133
P P + S + + PQ + Y G P+P PQ P+ PY Q
Sbjct: 109 PQRPVMTRATSTTVQTTPLPQPPSTAPSYP---GPQYQGYHPMP--PQPGMPAPPYSLQY 163
Query: 134 SYPSVPNQPQAPNTY 148
P + QPQ P Y
Sbjct: 164 PPPGLL-QPQGPPPY 177
Score = 29.8 bits (67), Expect = 2.5
Identities = 15/46 (32%), Positives = 17/46 (36%)
Query: 115 PVPSYPQGSYPSNPYQAQNSYPSVPNQPQAPNTYPSMPNQLQAPNG 160
P P SYP YQ + P P P P + P L P G
Sbjct: 128 PQPPSTAPSYPGPQYQGYHPMPPQPGMPAPPYSLQYPPPGLLQPQG 173
Score = 28.6 bits (64), Expect = 5.2
Identities = 18/69 (26%), Positives = 24/69 (34%), Gaps = 2/69 (2%)
Query: 35 SPPPPVYQTSSHTSPNA-YNNPPSPYHSANPPPYYQHPSVPSHPSVPTFP-SAPIYPSAP 92
P + + +S T PPS S P Y + +P P +P P S P
Sbjct: 109 PQRPVMTRATSTTVQTTPLPQPPSTAPSYPGPQYQGYHPMPPQPGMPAPPYSLQYPPPGL 168
Query: 93 PQESHSPYY 101
Q P Y
Sbjct: 169 LQPQGPPPY 177
Score = 27.9 bits (62), Expect = 8.9
Identities = 16/46 (34%), Positives = 17/46 (36%), Gaps = 1/46 (2%)
Query: 29 TSSSFSSPPPPVYQTSSHTSPNAYNNPPSPYHSANPPPYYQHPSVP 74
S+ S P P YQ H P P PY PPP P P
Sbjct: 130 PPSTAPSYPGPQYQGY-HPMPPQPGMPAPPYSLQYPPPGLLQPQGP 174
>gnl|CDD|182723 PRK10779, PRK10779, zinc metallopeptidase RseP; Provisional.
Length = 449
Score = 32.7 bits (75), Expect = 0.51
Identities = 18/67 (26%), Positives = 32/67 (47%), Gaps = 10/67 (14%)
Query: 452 QIVSLVKDSSAARNGLLVNHNILEVNGANVVGLKD----------KELLEAMEKGGSVLT 501
+ + +S+A++ GL I++V+G + + K L +E+ GS L+
Sbjct: 224 VLAEVQPNSAASKAGLQAGDRIVKVDGQPLTQWQTFVTLVRDNPGKPLALEIERQGSPLS 283
Query: 502 LTLIPTS 508
LTL P S
Sbjct: 284 LTLTPDS 290
>gnl|CDD|237019 PRK11907, PRK11907, bifunctional 2',3'-cyclic nucleotide
2'-phosphodiesterase/3'-nucleotidase precursor protein;
Reviewed.
Length = 814
Score = 32.9 bits (75), Expect = 0.54
Identities = 17/79 (21%), Positives = 23/79 (29%), Gaps = 10/79 (12%)
Query: 197 VQPSHNTAYPTAPLVP-STIATAP----SAPPANIEDATRSTHPTTATAPSVPPAIHDAT 251
V + T+ P + AT P A A + TA D T
Sbjct: 32 VTTTPATSTEAEQTTPVESDATEEADNTETPVAATTAAEAPSSSETAETSDPTSEATDTT 91
Query: 252 RSTPSPIIAPSVPPTNRDT 270
S A +V P +T
Sbjct: 92 TSE-----ARTVTPAATET 105
>gnl|CDD|223033 PHA03291, PHA03291, envelope glycoprotein I; Provisional.
Length = 401
Score = 32.6 bits (74), Expect = 0.54
Identities = 13/72 (18%), Positives = 22/72 (30%), Gaps = 1/72 (1%)
Query: 187 PSAPQPPAYSVQPSHNTAYPTAPLVPSTIATAPSAPPANIEDATRSTHPTTATAPSVPPA 246
P+ P+P + T P+ P + + A + AP P
Sbjct: 205 PATPRPTPRTTASPETTPTPSTTTSPPSTTIPAPSTTIAAPQAGTTPEAEGTPAPPTPGG 264
Query: 247 IHDA-TRSTPSP 257
+TP+P
Sbjct: 265 GEAPPANATPAP 276
>gnl|CDD|221480 pfam12238, MSA-2c, Merozoite surface antigen 2c. This family of
proteins is found in eukaryotes. Proteins in this family
are typically between 263 and 318 amino acids in length.
There is a conserved SFT sequence motif. MSA-2 is a
plasma membrane glycoprotein which can be found in
Babesia bovis species.
Length = 201
Score = 31.7 bits (72), Expect = 0.64
Identities = 16/50 (32%), Positives = 20/50 (40%), Gaps = 5/50 (10%)
Query: 114 SPVPSYPQG---SYPSNPYQAQNSYPSVPNQPQAPNTYPSMPNQLQAPNG 160
SP S QG + PS+ N+ S P P P P P + NG
Sbjct: 143 SPKHSSSQGTSTTRPSDGSATPNT--SAPPTPGNPAAQPEKPAETPKGNG 190
>gnl|CDD|178806 PRK00030, minC, septum formation inhibitor; Provisional.
Length = 292
Score = 32.0 bits (72), Expect = 0.77
Identities = 20/85 (23%), Positives = 31/85 (36%), Gaps = 4/85 (4%)
Query: 181 PLASNKPSAPQPPAYSVQPSHNTAYPTAPLVP--STIATAPSAPPANIEDATRSTHPTTA 238
P+ + P A P P ++ A P P VP + A A + P + D ++
Sbjct: 97 PVELSTPVARAPQVIDTAPPNDVATPV-PSVPEATAEAAAKAGPQDDEADGEQADEAPAH 155
Query: 239 TAPSVP-PAIHDATRSTPSPIIAPS 262
SVP A + T + P
Sbjct: 156 NPESVPTRAARETTEANRPTATPPQ 180
>gnl|CDD|221404 pfam12067, Sox_C_TAD, Sox C-terminal transactivation domain. This
domain is found at the C-terminus of the Sox family of
transcription factors. It is found associated with
pfam00505. It binds to the Armadillo repeats (pfam00514)
in Catenin beta-1 (CTNNB1), which is involved in
transcriptional regulation. It functions as a
transactivating domain (TAD).
Length = 197
Score = 31.2 bits (71), Expect = 0.79
Identities = 23/116 (19%), Positives = 35/116 (30%), Gaps = 14/116 (12%)
Query: 29 TSSSFSSPPPPVYQTSSHTSPNAYNNPPSPYHSANPPPYYQHPSVPSHPSVPTFPSAPIY 88
+ +F SPP Q P YN+ +P ++ P Q + P PS
Sbjct: 35 SEPAFFSPPC---QEDCQMMPYGYNSSYAPQNAHAPALLRQMAVTENIPQGSPAPSIMGC 91
Query: 89 PSAPPQESHSPYYYSDIRANVPTGGSPVPSYPQGSYPSNPYQAQNSYPSVPNQPQA 144
P+ P Y Q S P ++Q ++ QA
Sbjct: 92 PTPPQMYYGQMY----------VPECAKHHPVQLGQLSPPPESQ-HLDTLDQLQQA 136
>gnl|CDD|235904 PRK06995, flhF, flagellar biosynthesis regulator FlhF; Validated.
Length = 484
Score = 31.9 bits (73), Expect = 0.82
Identities = 23/100 (23%), Positives = 32/100 (32%), Gaps = 4/100 (4%)
Query: 176 PYPINPLASNKPSAPQPPAYSVQPSHNTAYPTAPLV-PSTIATAPSAPPANIEDATRSTH 234
P P A+ P A PA +P+ A P LV + TA A +
Sbjct: 54 PAAAAPAAAQPPPAA-APAAVSRPAAPAAEPAPWLVEHAKRLTAQREQLVARAAAPAAPE 112
Query: 235 PTTATAPSVPPAIHDATRSTPSPIIAPSVPPTNRDTIRSA 274
AP+ A +A R A + P +A
Sbjct: 113 AQAPAAPAERAAAENAARRLARA--AAAAPRPRVPADAAA 150
>gnl|CDD|218621 pfam05518, Totivirus_coat, Totivirus coat protein.
Length = 753
Score = 32.1 bits (73), Expect = 0.86
Identities = 26/126 (20%), Positives = 34/126 (26%), Gaps = 8/126 (6%)
Query: 107 ANVPTGGSPVPSYPQGSYPSNPYQAQNSYPSVP--NQPQAP--NTYPSMPNQLQAPNGFY 162
+ G P P + P VP P PS + P
Sbjct: 628 GEMIISGFPPVFKTALPRPDYNRGGEAGGPGVPGPVPVGMPAHTARPSRVARGD-PVRPT 686
Query: 163 PLTSSQQASQASLPYPINPLASNKPSAPQPPAYSVQ---PSHNTAYPTAPLVPSTIATAP 219
++ +A QA P P P PA + S A PTAP P
Sbjct: 687 AHHAALRAPQAPRPGGPPGGGGGLPPPPDLPAAAGPAPCGSSLIASPTAPPEPEPPGAEQ 746
Query: 220 SAPPAN 225
+ N
Sbjct: 747 ADGAEN 752
>gnl|CDD|166942 PRK00404, tatB, sec-independent translocase; Provisional.
Length = 141
Score = 30.6 bits (69), Expect = 0.87
Identities = 15/47 (31%), Positives = 19/47 (40%)
Query: 178 PINPLASNKPSAPQPPAYSVQPSHNTAYPTAPLVPSTIATAPSAPPA 224
P P P Q PA +V T+ P P P+ AP+A P
Sbjct: 84 PPAPPEPVTPPTAQSPAPAVPTPPPTSTPAVPPAPAAAVPAPAAAPP 130
>gnl|CDD|238322 cd00577, PCNA, Proliferating Cell Nuclear Antigen (PCNA) domain
found in eukaryotes and archaea. These polymerase
processivity factors play a role in DNA replication and
repair. PCNA encircles duplex DNA in its central
cavity, providing a DNA-bound platform for the
attachment of the polymerase. The trimeric PCNA ring is
structurally similar to the dimeric ring formed by the
DNA polymerase processivity factors in bacteria (beta
subunit DNA polymerase III holoenzyme) and in
bacteriophages (catalytic subunits in T4 and RB69). This
structural correspondence further substantiates the
mechanistic connection between eukaryotic and
prokaryotic DNA replication that has been suggested on
biochemical grounds. PCNA is also involved with
proteins involved in cell cycle processes such as DNA
repair and apoptosis. Many of these proteins contain a
highly conserved motif known as the PIP-box (PCNA
interacting protein box) which contains the sequence
Qxx[LIM]xxF[FY]. .
Length = 248
Score = 31.4 bits (72), Expect = 0.90
Identities = 26/139 (18%), Positives = 42/139 (30%), Gaps = 27/139 (19%)
Query: 321 PRDSCTMIAPLSSSGSGLKKAYVTNGIRELVLCKDASGLIGLRVCAINEGVFVCLVERGS 380
P L S LK +R+L D+ + + A +G
Sbjct: 125 PELEYDATVTLPSDE--LKDI-----VRDLESISDS-----VTISASKDGFKFSAEGELG 172
Query: 381 PASLVGLRFG---DQILSINGETVAGYNMKQVHKILKAAPVNNIRVVIRDRPLERTVTLH 437
AS+ L + + + Y++K + KAAP+++ VTL
Sbjct: 173 GASVTLLPKDSDLLVTIECSEPVSSTYSLKYLKDFTKAAPLSD------------KVTLS 220
Query: 438 RDSSGHFGFHFNKGQIVSL 456
S G F L
Sbjct: 221 FGSDGPLSLEFKIADGGHL 239
>gnl|CDD|222843 PHA02030, PHA02030, hypothetical protein.
Length = 336
Score = 31.5 bits (71), Expect = 1.1
Identities = 17/68 (25%), Positives = 26/68 (38%)
Query: 197 VQPSHNTAYPTAPLVPSTIATAPSAPPANIEDATRSTHPTTATAPSVPPAIHDATRSTPS 256
++P A P VP+ A A SA + A + + P VP +
Sbjct: 255 IKPKSKAAGSNLPAVPNVAADAGSAAAPAVPAAAAAVAQAAPSVPQVPNVAVLPDVPQVA 314
Query: 257 PIIAPSVP 264
P+ AP+ P
Sbjct: 315 PVAAPAAP 322
Score = 29.6 bits (66), Expect = 3.9
Identities = 15/66 (22%), Positives = 27/66 (40%), Gaps = 2/66 (3%)
Query: 27 QTTSSSFSSPPPPVYQTSSHTSPNAYNNPPSPYHSANPPPYYQHPSVPSHPSVPTFPSAP 86
+ + S+ P V ++ + A + P P + P P P+ P P+ P
Sbjct: 271 NVAADAGSAAAPAVPAAAAAVAQAAPSVPQVPNVAVLPDVPQVAPVAA--PAAPEVPAVP 328
Query: 87 IYPSAP 92
+ P+AP
Sbjct: 329 VVPAAP 334
>gnl|CDD|237191 PRK12757, PRK12757, cell division protein FtsN; Provisional.
Length = 256
Score = 31.2 bits (71), Expect = 1.2
Identities = 22/89 (24%), Positives = 26/89 (29%), Gaps = 1/89 (1%)
Query: 143 QAPNTYPSMPNQLQAPNGFYPLTSSQQASQASLPYPINPLASNKPSAPQPPAYSVQPSHN 202
Q P +P Q P QQ +Q P P + P+ VQP
Sbjct: 97 QQPTQLSEVPYNEQTPQVPRSTVQIQQQAQQQQP-PATTAQPQPVTPPRQTTAPVQPQTP 155
Query: 203 TAYPTAPLVPSTIATAPSAPPANIEDATR 231
T P P T A A E R
Sbjct: 156 APVRTQPAAPVTQAVEAPKVEAEKEKEQR 184
>gnl|CDD|237182 PRK12727, PRK12727, flagellar biosynthesis regulator FlhF;
Provisional.
Length = 559
Score = 31.5 bits (71), Expect = 1.2
Identities = 23/107 (21%), Positives = 30/107 (28%), Gaps = 3/107 (2%)
Query: 168 QQASQASLPYPINPLASNKPSAPQPPAYSVQPSHNTAYPTAPLVPSTIATAPSAPPANIE 227
Q+A + + A+ P APQ P P H +A S SA I
Sbjct: 52 QRALETARSDTPATAAAPAP-APQAPTKPAAPVHAPLKLSANANMSQRQRVASAAEDMI- 109
Query: 228 DATRSTHPTTATAPSVPPAIHDATRSTPSPIIAPSVPPTNRDTIRSA 274
A + + P A S PSP T
Sbjct: 110 -AAMALRQPVSVPRQAPAAAPVRAASIPSPAAQALAHAAAVRTAPRQ 155
Score = 31.1 bits (70), Expect = 1.4
Identities = 27/142 (19%), Positives = 41/142 (28%), Gaps = 20/142 (14%)
Query: 136 PSVPNQPQAPNTYPSMPNQLQAPNGFYPLTSSQQASQASLPYPINPLASNKPSAPQPPAY 195
+ P T P+ P S +Q ++ I +A +P + A
Sbjct: 66 AAAPAPAPQAPTKPAAPVHAPLKLSANANMSQRQRVASAAEDMIAAMALRQPVSVPRQAP 125
Query: 196 SVQPSHNTAYPTAPLVPSTIATAPSAPPANIEDATRSTH-------------PTTATAPS 242
+ P + P+ A A A + A R H TTA P
Sbjct: 126 AAAPVRAASIPSP-------AAQALAHAAAVRTAPRQEHALSAVPEQLFADFLTTAPVPR 178
Query: 243 VPPAIHDATRSTPSPIIAPSVP 264
P P P IA ++
Sbjct: 179 APVQAPVVAAPAPVPAIAAALA 200
Score = 31.1 bits (70), Expect = 1.6
Identities = 40/224 (17%), Positives = 64/224 (28%), Gaps = 48/224 (21%)
Query: 55 PPSPYHSANPPPYYQHPSVPSHPSVPTFPSAPIYPSAPPQESHSPYYYSDIR-------- 106
+P +A P P P PT P+AP++ + + +
Sbjct: 60 SDTPATAAAPAPA---------PQAPTKPAAPVHAPLKLSANANMSQRQRVASAAEDMIA 110
Query: 107 -------ANVPTGGSPVPSYPQGSYPSNPYQAQNSYPSVPNQPQAPNTYPSMPNQLQAPN 159
+VP S PS QA +V P+ + ++P QL A
Sbjct: 111 AMALRQPVSVPRQAPAAAPVRAASIPSPAAQALAHAAAVRTAPRQEHALSAVPEQLFA-- 168
Query: 160 GFYPLTSSQQASQASLPYPINPLASNKPSAPQP------PAYSVQPSHNTAYPTAPLVPS 213
+ A +P AP P A++ +
Sbjct: 169 --------DFLTTAPVPRAPVQAPVVAAPAPVPAIAAALAAHAAYAQDDDEQLDDDGFDL 220
Query: 214 TIATAPSAPPANIEDATRSTHPTTATAPSVPPAIHDATRSTPSP 257
A PPA + P AP+ P A+ + P+P
Sbjct: 221 DDALPQILPPAAL--------PPIVVAPAAPAALAAVAAAAPAP 256
>gnl|CDD|171499 PRK12438, PRK12438, hypothetical protein; Provisional.
Length = 991
Score = 31.8 bits (72), Expect = 1.2
Identities = 17/71 (23%), Positives = 24/71 (33%), Gaps = 12/71 (16%)
Query: 214 TIATAPSAPPANIEDATRSTHPTTATAPSVPPAIHDATRSTPSPIIAPSVPPTNRDTIRS 273
+ATAP A + P P PP P P+ PP D +
Sbjct: 901 RVATAPGGDAA--------SAPPPGAGPPAPPQ----AVPPPRTTQPPAAPPRGPDVPPA 948
Query: 274 ALVPLYPSLDN 284
A+ L +L +
Sbjct: 949 AVAELRETLAD 959
>gnl|CDD|139494 PRK13335, PRK13335, superantigen-like protein; Reviewed.
Length = 356
Score = 31.2 bits (70), Expect = 1.2
Identities = 29/110 (26%), Positives = 41/110 (37%), Gaps = 9/110 (8%)
Query: 143 QAPNTYPS-MPNQLQAPNGFYPLTSSQQASQASLPYPINPLASNKPSAPQPPAYSVQPSH 201
QA NT P +APN TS+ + + S P + N + P P Q +
Sbjct: 64 QAANTRQERTPKLEKAPNTNEEKTSASKIEKISQPKQEEQKSLNISATPAPKQEQSQTTT 123
Query: 202 NTAYPTAPLVPSTIATAPSAPPANIEDATRSTHPTTATAPSVPPAIHDAT 251
+ P T T P P N +ST T +P++ A D T
Sbjct: 124 ESTTPK------TKVTTP--PSTNTPQPMQSTKSDTPQSPTIKQAQTDMT 165
Score = 29.3 bits (65), Expect = 5.4
Identities = 19/90 (21%), Positives = 32/90 (35%), Gaps = 14/90 (15%)
Query: 139 PNQPQAPNT-----YPSMPNQLQAPNGFYPLTSSQQASQASLPYPINPLASNKPSAPQP- 192
P +APNT S ++ P + + ++ P P + + P
Sbjct: 74 PKLEKAPNTNEEKTSASKIEKISQP----KQEEQKSLNISATPAPKQEQSQTTTESTTPK 129
Query: 193 PAYSVQPSHNTAYPTAPLVPSTIATAPSAP 222
+ PS NT P + ST + P +P
Sbjct: 130 TKVTTPPSTNTPQP----MQSTKSDTPQSP 155
>gnl|CDD|173162 PRK14699, PRK14699, replication factor A; Provisional.
Length = 484
Score = 31.5 bits (71), Expect = 1.2
Identities = 13/22 (59%), Positives = 15/22 (68%)
Query: 293 ELINAYARENAANERESDAAGN 314
ELINAYARENA ++ GN
Sbjct: 238 ELINAYARENAFTQKVELQVGN 259
>gnl|CDD|177614 PHA03377, PHA03377, EBNA-3C; Provisional.
Length = 1000
Score = 31.6 bits (71), Expect = 1.3
Identities = 48/222 (21%), Positives = 67/222 (30%), Gaps = 40/222 (18%)
Query: 34 SSPPPPVYQTSSHTSPNAYNNPPSPYHSANPPPY----YQHPSVPSHPSVPTFPSAPIYP 89
+ PP S H P A P Y PP YQ P A +
Sbjct: 739 APPPSHQAPYSGHEEPQAQQAPYPGYWEPRPPQAPYLGYQEPQAQGVQVSSYPGYAGPWG 798
Query: 90 SAP--PQESHSPYYYSDIRANVPTGG--SPVPSYPQGSYPSNPYQAQNSYPSVPNQPQAP 145
P+ HS Y+S + G +P P + + + Q+ P
Sbjct: 799 LRAQHPRYRHSWAYWSQYPGHGHPQGPWAPRPPHLPPQWDGSAGHGQDQVSQFP--HLQS 856
Query: 146 NTYPSMPNQLQAPNGFYPLTSSQQASQASLPYPINPLASNKPSAPQPPAYSVQPSHNTAY 205
T P Q P Y Q S PS ++
Sbjct: 857 ETGPPRLQLSQVPQLPYS---------------------------QTLVSSSAPSWSSPQ 889
Query: 206 PTAPLVPSTIATAPSAPPANIEDATRSTHPTTATA-PSVPPA 246
P AP+ P I T PP ++D+ ++ TA PS+P A
Sbjct: 890 PRAPIRP--IPTRFPPPPMPLQDSMAVGCDSSGTACPSMPFA 929
>gnl|CDD|227665 COG5373, COG5373, Predicted membrane protein [Function unknown].
Length = 931
Score = 31.7 bits (72), Expect = 1.3
Identities = 23/112 (20%), Positives = 32/112 (28%), Gaps = 17/112 (15%)
Query: 160 GFYPLTSSQQASQASLPYPINPLASNKPS---------APQPPAYSVQPSHNTAYPTAPL 210
F +TS+Q + + + L S A Q A P A P
Sbjct: 13 AFEVITSAQLSRIGRIERELRELRSLVAEGAAGPVAKAAEQMAA----PEAAEAAPLPAA 68
Query: 211 VPSTIATAPSAPPANIEDATRSTHPTTATAPSVPPAIHDATRSTPSPIIAPS 262
S + P + A PS PA A P+ + PS
Sbjct: 69 AESIASPEVPPPVP--PAPAQEGEAPAAEQPSAVPAPSAA--PAPAEPVEPS 116
>gnl|CDD|220840 pfam10667, DUF2486, Protein of unknown function (DUF2486). This
family is made up of members from various Burkholderia
spp. The function is unknown.
Length = 245
Score = 31.0 bits (70), Expect = 1.3
Identities = 24/90 (26%), Positives = 30/90 (33%), Gaps = 7/90 (7%)
Query: 175 LPYPINPLASNKPSAPQPPAYSVQPSH---NTAYPTAPLVPSTIATAPSAPPANIE--DA 229
+P P P + P P A APL + A P APPA DA
Sbjct: 78 VPTPHVPAVALPGDTDAPAEPGAAPHVVAERAAAMQAPLPSALAADDPQAPPAGATAADA 137
Query: 230 TRSTHPTT--ATAPSVPPAIHDATRSTPSP 257
+ T A + PPA A S +
Sbjct: 138 GDAAPDATPPAAGDASPPAAAQAAASAAAA 167
>gnl|CDD|226164 COG3638, COG3638, ABC-type phosphate/phosphonate transport system,
ATPase component [Inorganic ion transport and
metabolism].
Length = 258
Score = 31.1 bits (71), Expect = 1.3
Identities = 21/87 (24%), Positives = 36/87 (41%), Gaps = 24/87 (27%)
Query: 446 FHFNKGQIVSLVKDSSAARNGLL--VNHNI------LEVNGANVVGLKDKELLEAMEKGG 497
N+G++V+++ S A ++ LL +N + + NG + LK KEL + G
Sbjct: 25 LEINQGEMVAIIGPSGAGKSTLLRSLNGLVDPTSGEILFNGVQITKLKGKELRKLRRDIG 84
Query: 498 SVLTLTLIPTSMYQHMMKKMSFNLVKD 524
+ +Q FNLV
Sbjct: 85 MI----------FQQ------FNLVPR 95
>gnl|CDD|221509 pfam12287, Caprin-1_C, Cytoplasmic
activation/proliferation-associated protein-1 C term.
This family of proteins is found in eukaryotes. Proteins
in this family are typically between 343 and 708 amino
acids in length. This family is the C terminal region of
caprin-1. Caprin-1 is a protein involved in regulating
cellular proliferation. In mutated phenotypes, the G1
phase of the cell cycle is greatly lengthened, impairing
normal proliferation. The C terminal region of caprin-1
contains RGG motifs which are characteristic of RNA
binding domains. It is possible that caprin-1 functions
through an RNA binding mechanism.
Length = 319
Score = 31.1 bits (70), Expect = 1.4
Identities = 37/158 (23%), Positives = 52/158 (32%), Gaps = 19/158 (12%)
Query: 28 TTSSSFSSPPPPVYQTSSHTSPNAYNNPPSPYHSANPPPYYQHPSVPSHPSVPTFPSAPI 87
+ +S + PP+YQ S P +P P ++ Q P+ S P+
Sbjct: 70 SPTSEGYTSSPPLYQPSHTAEPRPQTDPIDPIQASMSLNSEQTPTSSSLPAAS------- 122
Query: 88 YPSAPPQESHSPYYYSDIRAN---------VPTGGSPVPSYPQGSYPSNPYQAQNSY-PS 137
Q P + S I N V +PVP + Q Q SY
Sbjct: 123 -QPQVFQTGSKPLHSSGINVNAAPFQSMQTVFNMNAPVPPVNEPESLKQQNQYQASYNQG 181
Query: 138 VPNQPQAPNTYPSM-PNQLQAPNGFYPLTSSQQASQAS 174
+QPQ P + QLQ G + Q AS
Sbjct: 182 FSSQPQHPVEQTELQQEQLQTVVGSFHSQDQQAHQAAS 219
>gnl|CDD|233045 TIGR00601, rad23, UV excision repair protein Rad23. All proteins
in this family for which functions are known are
components of a multiprotein complex used for targeting
nucleotide excision repair to specific parts of the
genome. In humans, Rad23 complexes with the XPC protein.
This family is based on the phylogenomic analysis of JA
Eisen (1999, Ph.D. Thesis, Stanford University) [DNA
metabolism, DNA replication, recombination, and repair].
Length = 378
Score = 31.0 bits (70), Expect = 1.5
Identities = 19/84 (22%), Positives = 32/84 (38%), Gaps = 11/84 (13%)
Query: 181 PLASNKPSAPQPPAYSVQPSHNTAYPTAPLVPSTIATAPS-APPANIEDATRSTHPTTAT 239
P AP + P+ + P +P A+ S AP + +E+ + S TAT
Sbjct: 77 PKTGTGKVAPPAATPTSAPTPTPSPPASP------ASGMSAAPASAVEEKSPSEESATAT 130
Query: 240 APSVPPAIHDATRSTPSPIIAPSV 263
AP P + + A ++
Sbjct: 131 APESP----STSVPSSGSDAASTL 150
>gnl|CDD|183756 PRK12799, motB, flagellar motor protein MotB; Reviewed.
Length = 421
Score = 31.2 bits (70), Expect = 1.5
Identities = 28/123 (22%), Positives = 50/123 (40%), Gaps = 2/123 (1%)
Query: 150 SMPNQLQAPNGFYPLTSSQQASQASLPYP-INPLASNKPSAPQPPAYSVQPSHNTAYPTA 208
++P P+ +S+ S A++P P + P + SA A +V S P+
Sbjct: 298 TVPVAAVTPSSAVTQSSAITPSSAAIPSPAVIPSSVTTQSATTTQASAVALSSAGVLPSD 357
Query: 209 PLVPSTIATAPSAPPANIEDATRSTHPTTATAPSVPPAIHDATRSTPS-PIIAPSVPPTN 267
+P T+A + P +T ++ ++ + T S P+ P V PT+
Sbjct: 358 VTLPGTVALPAAEPVNMQPQPMSTTETQQSSTGNITSTANGPTTSLPAAPASNIPVSPTS 417
Query: 268 RDT 270
RD
Sbjct: 418 RDA 420
>gnl|CDD|227395 COG5063, CTH1, CCCH-type Zn-finger protein [General function
prediction only].
Length = 351
Score = 31.2 bits (70), Expect = 1.5
Identities = 26/143 (18%), Positives = 40/143 (27%), Gaps = 4/143 (2%)
Query: 16 LTIAQNEYFSKQTTSSSFSSPPPPVYQTSSHTSPNAYNNPPSPYHSANPPPYYQHPSVPS 75
TI+ SS ++S SP Y P ++ P Y+ + S
Sbjct: 61 ETISHLSASLDNKEQSSSEKRQKAEDSSASSNSPQNYPWSPRGSNANKPYGLYKTEMLRS 120
Query: 76 HPSVPTFPSAPIYPSAPPQESHS----PYYYSDIRANVPTGGSPVPSYPQGSYPSNPYQA 131
+P P A + S Y V VP + + S P
Sbjct: 121 STEIPYCRYPDKNPFAHSKAISSLAQTHPKYKTESLEVFINPGYVPYSKRCCFISLPLTD 180
Query: 132 QNSYPSVPNQPQAPNTYPSMPNQ 154
N P +P++ S
Sbjct: 181 INLQPLSQRKPKSGKNCTSYTLG 203
>gnl|CDD|191251 pfam05283, MGC-24, Multi-glycosylated core protein 24 (MGC-24).
This family consists of several MGC-24 (or Cd164
antigen) proteins from eukaryotic organisms.
MGC-24/CD164 is a sialomucin expressed in many normal
and cancerous tissues. In humans, soluble and
transmembrane forms of MGC-24 are produced by
alternative splicing.
Length = 187
Score = 30.4 bits (68), Expect = 1.6
Identities = 11/82 (13%), Positives = 22/82 (26%), Gaps = 1/82 (1%)
Query: 193 PAYSVQPSHNTAYPTAPLVPSTIATAPSAPPANIEDATRSTHPTTATAPSVPPAIHDATR 252
P S + + S + PT +P+ H +
Sbjct: 82 PEEPGYCSSQAEVVKSGCQIYNTTDSCSVATTTPVPTNSTAKPTITPSPTTSHH-HVTSE 140
Query: 253 STPSPIIAPSVPPTNRDTIRSA 274
+ + P+ P + T +A
Sbjct: 141 PKTNTTVTPTSQPDRKSTFDAA 162
>gnl|CDD|220708 pfam10349, WWbp, WW-domain ligand protein. The WWbp domain is
characterized by several short PY and PT-like motifs of
the PPPPY form. These appear to bind directly to the WW
domains of WWP1 and WWP2 and other such diverse proteins
as dystrophin and YAP (Yes-associated protein). This is
the WW-domain binding protein WWbp via PY and PY_like
motifs. The presence of a phosphotyrosine residue in the
pWBP-1 peptide abolishes WW domain binding which
suggests a potential regulatory role for tyrosine
phosphorylation in modulating WW domain-ligand
interactions. Given the likelihood that WWP1 and WWP2
function as E3 ubiquitin-protein ligases, it is possible
that initial substrate-specific recognition occurs via
WW domain-substrate protein interaction followed by
ubiquitin transfer and subsequent proteolysis. This
domain lies just downstream of the GRAM (pfam02893) in
many members.
Length = 111
Score = 29.3 bits (66), Expect = 1.6
Identities = 21/83 (25%), Positives = 26/83 (31%), Gaps = 4/83 (4%)
Query: 13 FHQ--LTIAQNEYFSKQTTSSSFSSPPPPVYQTSSHTSPNAYNNPPSPYHSANPPPYYQH 70
F Q +A+ ++ + S PPP Y P PY PPPYY
Sbjct: 23 FGQAMERLAERAQRAQPVSRESGYYPPPGAYVHLEPLPAYGQYAAPPPY--GPPPPYYPA 80
Query: 71 PSVPSHPSVPTFPSAPIYPSAPP 93
P P P PP
Sbjct: 81 PPGVYPTPPPPNSGYMADPQEPP 103
>gnl|CDD|184281 PRK13729, PRK13729, conjugal transfer pilus assembly protein TraB;
Provisional.
Length = 475
Score = 30.9 bits (70), Expect = 1.7
Identities = 29/109 (26%), Positives = 39/109 (35%), Gaps = 24/109 (22%)
Query: 71 PSVPSHPSVPTFPSAPIYPSAPPQESHSPYYYSDIRANVPTGGSPVPSYPQGSYPSNPYQ 130
P + VP P++P P PQ ++P + P GS P YP N
Sbjct: 127 PVTATGEPVPQMPASPPGPEGEPQPGNTPVSF-------PPQGSVAVPPPTAFYPGNGVT 179
Query: 131 --AQNSYPSVPNQPQAPNTYPSMPNQLQAPNGFYPLTSSQQASQASLPY 177
Q +Y SVP +PN++Q Y SLPY
Sbjct: 180 PPPQVTYQSVP-----------VPNRIQRKTFTYNE----GKKGPSLPY 213
Score = 30.6 bits (69), Expect = 2.5
Identities = 24/98 (24%), Positives = 33/98 (33%), Gaps = 5/98 (5%)
Query: 151 MPNQLQAPNGFYPLTSSQQASQASLPYPI---NPLASNKPSAPQPPAYSVQPSHNTA-YP 206
+ Q++A + + Q P P N P + PP SV TA YP
Sbjct: 116 LAEQVKALGANPVTATGEPVPQMPASPPGPEGEPQPGNTPVS-FPPQGSVAVPPPTAFYP 174
Query: 207 TAPLVPSTIATAPSAPPANIEDATRSTHPTTATAPSVP 244
+ P T S P N T+ PS+P
Sbjct: 175 GNGVTPPPQVTYQSVPVPNRIQRKTFTYNEGKKGPSLP 212
>gnl|CDD|236766 PRK10811, rne, ribonuclease E; Reviewed.
Length = 1068
Score = 31.2 bits (71), Expect = 1.8
Identities = 19/132 (14%), Positives = 34/132 (25%), Gaps = 7/132 (5%)
Query: 134 SYPSVPNQPQAPNTYPSMPNQLQAPNGFYPLTSSQQASQASLPYPINPLASNKPSAPQPP 193
YP V +PQ + P+ + + A P P+ + P
Sbjct: 844 RYPVV--RPQDVQVEEQREAEEVQVQ---PVVAEVPVAAAVEPVVSAPVVEAVAEVVEEP 898
Query: 194 AYSVQPSHNTAYPTAPLVPSTIATAPSAPPANIEDATRSTHPTTATAPSVPPAIHDATRS 253
+P P IA + P I ++ + A P + +
Sbjct: 899 VVVAEPQPEEVVVVETTHPEVIAAPVTEQPQVITESDVAVAQEVAEHAE--PVVEPQDET 956
Query: 254 TPSPIIAPSVPP 265
A +
Sbjct: 957 ADIEEAAETAEV 968
>gnl|CDD|219358 pfam07271, Cytadhesin_P30, Cytadhesin P30/P32. This family
consists of several Mycoplasma species specific
Cytadhesin P32 and P30 proteins. P30 has been found to
be membrane associated and localised on the tip
organelle. It is thought that it is important in
cytadherence and virulence.
Length = 279
Score = 30.8 bits (69), Expect = 1.8
Identities = 35/157 (22%), Positives = 46/157 (29%), Gaps = 7/157 (4%)
Query: 8 LEDMTFHQLTIAQNEYFSKQTTSSSFSSPPPPVYQTSSHTSPNAYNNPPSPYHSANPP-- 65
LE+ + Q + S+Q + P T T P N +P P
Sbjct: 103 LEEKERQEQMAEQLQRISEQNEQQAIEIDPTEEVNTQEPTQPAGVNVANNPQPQVQPQFG 162
Query: 66 PYYQHPSVPSHPSVPTFPSA---PIYPSAPPQESHSPYYYSDIRANVPTGGSPVPSYPQG 122
P Q P P P+ P + PP P N G P P + Q
Sbjct: 163 PNPQQRINPQRFGFPMQPNMGMRPGFNQMPPHMPGMPPNQMRPGFNPMPGMPPRPGFNQN 222
Query: 123 SYPSNPYQAQNSYPSVPNQPQAPNTYPSMPNQLQAPN 159
P + + P P T P PN Q P
Sbjct: 223 P-NMMPNMNRPGFRPQPGGFNHPGT-PMGPNMQQRPG 257
>gnl|CDD|221818 pfam12868, DUF3824, Domain of unknwon function (DUF3824). This is
a repeating domain found in fungal proteins. It is
proline-rich, and the function is not known.
Length = 135
Score = 29.5 bits (66), Expect = 1.9
Identities = 19/68 (27%), Positives = 26/68 (38%), Gaps = 2/68 (2%)
Query: 34 SSPPPPVYQTSSHTSPNAYNNPP--SPYHSANPPPYYQHPSVPSHPSVPTFPSAPIYPSA 91
P PPV + + N + PP +P P P Y P P P YP
Sbjct: 59 YPPSPPVSDPRYYPNSNYFPPPPGSTPVPPPGPQPGYNPADYPPPPGAVPPPQNYPYPPG 118
Query: 92 PPQESHSP 99
P Q+ ++P
Sbjct: 119 PGQDPYAP 126
>gnl|CDD|219897 pfam08549, SWI-SNF_Ssr4, Fungal domain of unknown function
(DUF1750). This is a fungal domain of unknown function.
Length = 669
Score = 30.8 bits (69), Expect = 2.0
Identities = 12/47 (25%), Positives = 16/47 (34%), Gaps = 7/47 (14%)
Query: 49 PNAYNNPPSPYHSANPP--PYYQHPSVPSHPSVPTFPSAPIYPSAPP 93
+ PP PY +A P+Y P + P P AP
Sbjct: 210 GQSMYQPPGPYPNAMVGRQPFYPQPGAVAGP-----PKRRGGHKAPR 251
>gnl|CDD|184923 PRK14959, PRK14959, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 624
Score = 30.8 bits (69), Expect = 2.0
Identities = 27/103 (26%), Positives = 37/103 (35%), Gaps = 7/103 (6%)
Query: 165 TSSQQASQASLPYPINPLASNKPSAPQPPAYSVQPSHNTAYPTAPLVPSTIATAPSAPPA 224
T Q Q + P +S P+ P P S PS + AP P P P
Sbjct: 399 TPGTQGPQGTAPAAGMTPSSAAPATPAP---SAAPSPRVPWDDAPPAPPRSGIPPRPAPR 455
Query: 225 NIEDATRSTHP-TTATAPSVPPAIHDATRSTPSPIIAPSVPPT 266
E + P + A+A PP + D + + PS P T
Sbjct: 456 MPEASPVPGAPDSVASASDAPPTLGDPSDTAEH---TPSGPRT 495
Score = 30.4 bits (68), Expect = 3.0
Identities = 31/133 (23%), Positives = 42/133 (31%), Gaps = 17/133 (12%)
Query: 30 SSSFSSPPPPVYQTSSHTSPNAYNNPPSPYHSANPPPYYQHPSVPSHPSVPTFPSAPIYP 89
S ++ P P Q T+P A P S + P P VP + P P + I P
Sbjct: 391 SGGAATIPTPGTQGPQGTAPAAGMTPSSAAPATPAPSAAPSPRVPWDDAPPAPPRSGIPP 450
Query: 90 SAPPQESHSPYYYSDIRANVPTGGSPVPSYPQGSYPSNPYQAQNSYPS--VPNQPQAPNT 147
P+ SPVP P ++ PS + P P T
Sbjct: 451 RPAPRMP---------------EASPVPGAPDSVASASDAPPTLGDPSDTAEHTPSGPRT 495
Query: 148 YPSMPNQLQAPNG 160
+ Q NG
Sbjct: 496 WDGFLEFCQGRNG 508
>gnl|CDD|114270 pfam05539, Pneumo_att_G, Pneumovirinae attachment membrane
glycoprotein G.
Length = 408
Score = 30.8 bits (69), Expect = 2.0
Identities = 27/156 (17%), Positives = 49/156 (31%), Gaps = 15/156 (9%)
Query: 126 SNPYQAQNSYPSVPNQPQAPNTYPSMPNQLQAPNGFYPLTSSQQASQASLPYPINPLASN 185
+ + + S+P+ P+Q + + +Q T++ +S P+ +
Sbjct: 177 TTSWPTEVSHPTYPSQVTPQSQPATQGHQ----------TATANQRLSSTE-PVGTQGTT 225
Query: 186 KPSAPQPPAYSVQPSHNTAYPTAPLVPSTIATAPSAPPANIEDATRSTHPTTATAPSVPP 245
S P P PS + PST + S + + T A S
Sbjct: 226 TSSNP-EPQTEPPPSQRGPSGSPQHPPSTTSQDQST---TGDGQEHTQRRKTPPATSNRR 281
Query: 246 AIHDATRSTPSPIIAPSVPPTNRDTIRSALVPLYPS 281
+ H P+ + PT R T + P
Sbjct: 282 SPHSTATPPPTTKRQETGRPTPRPTATTQSGSSPPH 317
>gnl|CDD|225805 COG3266, DamX, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 292
Score = 30.3 bits (68), Expect = 2.0
Identities = 28/142 (19%), Positives = 40/142 (28%), Gaps = 14/142 (9%)
Query: 137 SVPNQPQAPNTYPSMPNQLQAPNGFYPLTSSQQASQASLPYPINPLASNKPSAPQPPAYS 196
S P + + QA N P A S + P++S +P A
Sbjct: 37 SSSEAPASAEKSIDLNGATQA-NAQQPAPGPTSAENTSQDLSLPPISSTPTQGQEPLAQD 95
Query: 197 VQPSHNTAYPTAPLVPSTIATAPSAPPANIEDATRSTHPTTA--TAPSVPPAIHDATRST 254
Q + A P A ST PT AP ++ A R
Sbjct: 96 GQQRVEV-----QGDLNNAAVQPQNLSQLNNVAVTSTLPTEPATVAPVRNASVPTAERPA 150
Query: 255 PS------PIIAPSVPPTNRDT 270
+ + P+V P T
Sbjct: 151 ITRPVRAQAVSEPAVEPKAAKT 172
>gnl|CDD|165527 PHA03269, PHA03269, envelope glycoprotein C; Provisional.
Length = 566
Score = 30.9 bits (69), Expect = 2.2
Identities = 28/137 (20%), Positives = 48/137 (35%), Gaps = 5/137 (3%)
Query: 108 NVPTGGSPVPSYPQGSYPS-NPYQAQNSYPSVPNQPQAPNTYPSMPNQLQAPNGFYPLTS 166
N+P + Q P+ P+QA + P P + + Q P
Sbjct: 25 NIPIPELHTSAATQKPDPAPAPHQAASRAPDPAVAPTSAASRKPDLAQAPTPAASEKFDP 84
Query: 167 SQQASQASLPYPINPLASNKPSAPQPPAYSVQPSHNTAYPTAPLVPSTIATAPSAPPANI 226
+ QA+ P +A +AP+P A S A+ ++ A+ P A+
Sbjct: 85 APAPHQAASRAPDPAVAPQLAAAPKPDAAEAFTSAAQAHEAPADAGTSAASKKPDPAAH- 143
Query: 227 EDATRSTHPTTATAPSV 243
T+ + P A S+
Sbjct: 144 ---TQHSPPPFAYTRSM 157
Score = 29.3 bits (65), Expect = 5.3
Identities = 21/106 (19%), Positives = 32/106 (30%), Gaps = 8/106 (7%)
Query: 169 QASQASLPYPINPLASNKPSAPQPPAYSVQPSHNTAYPTAPLVPST--------IATAPS 220
A+L I + +A Q P + P + P V T +A AP+
Sbjct: 16 NLIIANLNTNIPIPELHTSAATQKPDPAPAPHQAASRAPDPAVAPTSAASRKPDLAQAPT 75
Query: 221 APPANIEDATRSTHPTTATAPSVPPAIHDATRSTPSPIIAPSVPPT 266
+ D + H + AP A A P A +
Sbjct: 76 PAASEKFDPAPAPHQAASRAPDPAVAPQLAAAPKPDAAEAFTSAAQ 121
>gnl|CDD|227361 COG5028, COG5028, Vesicle coat complex COPII, subunit SEC24/subunit
SFB2/subunit SFB3 [Intracellular trafficking and
secretion].
Length = 861
Score = 30.9 bits (70), Expect = 2.3
Identities = 28/168 (16%), Positives = 50/168 (29%), Gaps = 8/168 (4%)
Query: 118 SYPQGSYPSNPYQAQNSYPSVPNQPQAPNTYPSMPNQLQAPNGFYPLTSSQQASQASLPY 177
YPQ + A + + P++ A + M S +Q QA+
Sbjct: 8 VYPQAQSQVHTGAASSKKSARPHRAYANFSAGQMGMP-PYTTPPLQQQSRRQIDQAATAM 66
Query: 178 PINPLASNKPSAPQPPAYSVQPSHNTAYPTAPLVPSTIATAPSAPPANIED-ATRSTHPT 236
+ PS P S Q + + + T P P ED +
Sbjct: 67 HNTGANNPAPSVMSPAFQSQQKFSSPYGGSMADGTAPKPTNPLVPVDLFEDQPPPISDLF 126
Query: 237 TATAPSVPP----AIHDATRSTPSPIIAPSVP--PTNRDTIRSALVPL 278
P VPP + + + ++ P D ++ + +P
Sbjct: 127 LPPPPIVPPLTTNFVGSEQSNCSPKYVRSTMYAIPETNDLLKKSKIPF 174
>gnl|CDD|234368 TIGR03835, termin_org_DnaJ, terminal organelle assembly protein
TopJ. This model describes TopJ (MG_200, CbpA), a DnaJ
homolog and probable assembly protein of the Mycoplasma
terminal organelle. The terminal organelle is involved
in both cytadherence and gliding motility [Cellular
processes, Chemotaxis and motility].
Length = 871
Score = 30.6 bits (68), Expect = 2.4
Identities = 18/111 (16%), Positives = 29/111 (26%), Gaps = 7/111 (6%)
Query: 117 PSYPQGSYPSNPYQAQNSYPSVPNQPQAPNTYPSMPNQLQAPNGFYPLTSSQQASQASLP 176
P P S P++ P P + + + P
Sbjct: 242 PQDDSEDDYVIPDAEIISSPTLEVTAPKEVEQPLQPEPVDE-------ETVAETKAEEEP 294
Query: 177 YPINPLASNKPSAPQPPAYSVQPSHNTAYPTAPLVPSTIATAPSAPPANIE 227
P + + SAP+ P N TI+T P P ++
Sbjct: 295 QPTQTVETKPTSAPESTVEENLPEINQPTQAVQPTSETISTTPVEPTDQLK 345
>gnl|CDD|227952 COG5665, NOT5, CCR4-NOT transcriptional regulation complex, NOT5
subunit [Transcription].
Length = 548
Score = 30.4 bits (68), Expect = 2.5
Identities = 27/113 (23%), Positives = 37/113 (32%), Gaps = 11/113 (9%)
Query: 146 NTYPSMPNQLQ--APNGFYPLTSSQQASQASLPYPINPLASNKPSAPQPPAYSVQPSHNT 203
Y M ++Q + N P + Q S +S+ S K APQ V S
Sbjct: 191 TIYEDMGCEIQPSSSNNEAPKEGNNQTSLSSIRSSKKQERSPKKKAPQ---RDVSISDRA 247
Query: 204 AYPTAPLVPSTIATAPSAPPANIEDATRSTHPTTATAPSVPPAIHDATRSTPS 256
P AP V S + S P T T ++T TP+
Sbjct: 248 TTPIAPGVESASQSISSTPTPVS------TDTPLHTVKDDSIKFDNSTLGTPT 294
>gnl|CDD|182820 PRK10898, PRK10898, serine endoprotease; Provisional.
Length = 353
Score = 30.4 bits (69), Expect = 2.6
Identities = 16/51 (31%), Positives = 24/51 (47%), Gaps = 12/51 (23%)
Query: 369 EGVFVCLVERGSPASLVGLRFGDQILSING-------ETVAGYNMKQVHKI 412
+G+ V V PA+ G++ D I+S+N ET M QV +I
Sbjct: 279 QGIVVNEVSPDGPAAKAGIQVNDLIISVNNKPAISALET-----MDQVAEI 324
>gnl|CDD|203523 pfam06815, RVT_connect, Reverse transcriptase connection domain.
This domain is known as the connection domain. This
domain lies between the thumb and palm domains.
Length = 102
Score = 28.3 bits (64), Expect = 2.7
Identities = 6/29 (20%), Positives = 10/29 (34%)
Query: 394 LSINGETVAGYNMKQVHKILKAAPVNNIR 422
+ G+ Y + Q K LK +
Sbjct: 13 IQKLGDGQWTYQIYQEKKPLKTGKYAKQK 41
>gnl|CDD|237011 PRK11892, PRK11892, pyruvate dehydrogenase subunit beta;
Provisional.
Length = 464
Score = 30.3 bits (69), Expect = 3.0
Identities = 18/70 (25%), Positives = 26/70 (37%), Gaps = 1/70 (1%)
Query: 207 TAPLVPSTIATAPSAPPANIEDATRSTHPTTATAPSVPPAIHDATRSTPS-PIIAPSVPP 265
+A + A A A A A + A AP+ P A + P P V
Sbjct: 83 SASDAGAAPAAAAEAAAAAPAAAAAAAAKKAAPAPAAPAAPAAEVAADPDIPAGTEMVTM 142
Query: 266 TNRDTIRSAL 275
T R+ +R A+
Sbjct: 143 TVREALRDAM 152
>gnl|CDD|240420 PTZ00441, PTZ00441, sporozoite surface protein 2 (SSP2);
Provisional.
Length = 576
Score = 30.3 bits (68), Expect = 3.0
Identities = 26/139 (18%), Positives = 41/139 (29%), Gaps = 15/139 (10%)
Query: 49 PNAYNNP-PSPYHSANPPPYYQHPSVPSHPSVPTFPSAPIYPSAPPQESHSPYYYSDIRA 107
P+ +P P P + P ++ P VP + P P P S+S +
Sbjct: 323 PDNPQDPVPPPNEGKDGNPNEENLFPPGDDEVPDESNVPPNPPNVPGGSNSEFSSDVENP 382
Query: 108 NVPTGGSPVPSYPQGSYPSNPYQAQNSYPSVPNQPQAPNTYPSMPNQLQAPNGFYPLTSS 167
P P P Q N + + P P P + N P
Sbjct: 383 PNPPN------------PDIPEQEPN--IPEDSNKEVPEDVPMEPEDDRDNNFNEPKKPE 428
Query: 168 QQASQASLPYPINPLASNK 186
+ + P PL + +
Sbjct: 429 NKGDGQNEPVIPKPLDNER 447
>gnl|CDD|225499 COG2948, VirB10, Type IV secretory pathway, VirB10 components
[Intracellular trafficking and secretion].
Length = 360
Score = 30.1 bits (68), Expect = 3.1
Identities = 18/122 (14%), Positives = 27/122 (22%), Gaps = 5/122 (4%)
Query: 164 LTSSQQASQASLPYPINPLASNKPSAPQPPAYSVQPSH--NTAYPTAPLVPSTIATAPSA 221
++ + P N P P P + P + P P A
Sbjct: 45 ALQGEKKRINNTQPPSNVERGTPPLPPLPDDPPLPPPLPVDLGAPVLPDQQVEEAKDQPR 104
Query: 222 P-PANIEDATRSTHPTTATAPSVPPAIHDATRSTPSPIIAPSVPPTNRD--TIRSALVPL 278
A AT + + A A + P A+ P
Sbjct: 105 RLRAAELAATSGSRVESDRAVGRVRAALANAAPAAAAPPPAGQPSGQSAKEDFAGAVNPT 164
Query: 279 YP 280
P
Sbjct: 165 QP 166
>gnl|CDD|236797 PRK10927, PRK10927, essential cell division protein FtsN;
Provisional.
Length = 319
Score = 30.0 bits (67), Expect = 3.4
Identities = 22/101 (21%), Positives = 30/101 (29%), Gaps = 6/101 (5%)
Query: 128 PYQAQNSYPSVPNQPQAPNTYPSMPNQLQAPNGFYPLTSSQQASQASLPYPINPLASNKP 187
P Q Q + Q + T + QA+ P P S
Sbjct: 146 PEQRQQTLQRQRQAQQLAEQQRLAQQSRTTEQSWQQQTRTSQAA----PVQAQPRQSKPA 201
Query: 188 SAPQPPAYSVQ-PSHNTAYPTAPLVPSTIATAPSAPPANIE 227
S QP +Q P+H TA + + A AP E
Sbjct: 202 STQQPYQDLLQTPAHTTAQSKPQQA-APVTRAADAPKPTAE 241
>gnl|CDD|152960 pfam12526, DUF3729, Protein of unknown function (DUF3729). This
family of proteins is found in viruses. Proteins in this
family are typically between 145 and 1707 amino acids in
length. The family is found in association with
pfam01443, pfam01661, pfam05417, pfam01660, pfam00978.
There is a single completely conserved residue L that
may be functionally important.
Length = 115
Score = 28.5 bits (64), Expect = 3.4
Identities = 16/86 (18%), Positives = 23/86 (26%), Gaps = 3/86 (3%)
Query: 184 SNKPSAPQPPAYSVQPSHNTAYPTAPLVPSTIATAPSAPPANIEDATRSTHPTTATAPSV 243
S+ S P+ P P P ++ PP + A P+
Sbjct: 26 SSCFSPPESAHPDDPPPVGDPRPPVVDTPPPVSAVWVLPPPSEPAAPPPDPEPPVPGPAG 85
Query: 244 PPAIHDATRSTP-SPIIAPSVPPTNR 268
PP+ P P P R
Sbjct: 86 PPS--PLAPPAPARKPPLPPPRPQRR 109
>gnl|CDD|147601 pfam05505, Ebola_NP, Ebola nucleoprotein. This family consists of
Ebola and Marburg virus nucleoproteins. These proteins
are responsible for encapsidation of genomic RNA. It has
been found that nucleoprotein DNA vaccines can offer
protection from the virus.
Length = 717
Score = 29.9 bits (67), Expect = 4.3
Identities = 15/65 (23%), Positives = 23/65 (35%), Gaps = 4/65 (6%)
Query: 17 TIAQNEYFSKQTTSSSFSSPPPPVYQTSS--HTSPNAYNNPPSPYHSAN--PPPYYQHPS 72
+ ++Q + +PP PVY+ P+ NP P S + S
Sbjct: 566 SDDDPGSDNEQGVDLTEVAPPAPVYRDEKEQDEIPHPAQNPQDPTGSIGNVDSDILRSNS 625
Query: 73 VPSHP 77
PS P
Sbjct: 626 KPSAP 630
>gnl|CDD|172376 PRK13855, PRK13855, type IV secretion system protein VirB10;
Provisional.
Length = 376
Score = 29.5 bits (66), Expect = 4.4
Identities = 14/47 (29%), Positives = 19/47 (40%), Gaps = 2/47 (4%)
Query: 53 NNPPSPYHSANPPPYYQHPSVPSHPSVPTFPSAPIYPSAPPQESHSP 99
PPS + N P+ HP+ P P + P+APP P
Sbjct: 59 PAPPSTMIATNTKPF--HPAPIDVPPDPPAAQEAVQPTAPPSAQSEP 103
>gnl|CDD|184927 PRK14963, PRK14963, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 504
Score = 29.8 bits (67), Expect = 4.4
Identities = 19/110 (17%), Positives = 33/110 (30%), Gaps = 7/110 (6%)
Query: 170 ASQASLPYPINPLASNKPSAPQPPAYSVQPSHNT-AYPTAPLVPSTIATAPSAPPANIED 228
A A P +A+ P AP P + + + + P+ ATA AP + +
Sbjct: 338 ALLALGGAPSEGVAAVAPPAPAPADLTQRLNRLEKEVRSLRSAPTAAATAAGAPLPDFDP 397
Query: 229 ATRSTHPTTATAPSVPPAIHDATRSTPSPIIAPSVPPTNRDTIRSALVPL 278
R + P + RD + + + L
Sbjct: 398 RPRGPPAPEPARSAEAPP------LVAPAAAPAGLALRWRDVLAALKMQL 441
>gnl|CDD|237524 PRK13826, PRK13826, Dtr system oriT relaxase; Provisional.
Length = 1102
Score = 29.7 bits (67), Expect = 4.5
Identities = 39/121 (32%), Positives = 51/121 (42%), Gaps = 24/121 (19%)
Query: 335 GSGLKKAYVTNGI----RELVLCKDASGLIGLR--VCAINEGVFVCLVERGSPASLVG-- 386
GS LK V + I R+ K L LR V +NE LVERG +VG
Sbjct: 567 GSRLKAEAVESLIADWNRDYDPTKTTLILAHLRRDVRMLNEMARAKLVERG----IVGEG 622
Query: 387 ---------LRF--GDQILSINGETVAGYNMKQVHKILKAAPVNNIRVVIRDRPLERTVT 435
RF GDQI+ + E G + K+++AAP N I I + R VT
Sbjct: 623 HAFRTADGERRFAAGDQIVFLKNEGSLGVKNGMIGKVVEAAP-NRIVAEIGEGEHRRQVT 681
Query: 436 L 436
+
Sbjct: 682 V 682
>gnl|CDD|219094 pfam06583, Neogenin_C, Neogenin C-terminus. This family represents
the C-terminus of eukaryotic neogenin precursor
proteins, which contains several potential
phosphorylation sites. Neogenin is a member of the N-CAM
family of cell adhesion molecules (and therefore
contains multiple copies of pfam00047 and pfam00041) and
is closely related to the DCC tumour suppressor gene
product - these proteins may play an integral role in
regulating differentiation programmes and/or cell
migration events within many adult and embryonic
tissues.
Length = 295
Score = 29.5 bits (66), Expect = 4.6
Identities = 34/169 (20%), Positives = 48/169 (28%), Gaps = 25/169 (14%)
Query: 120 PQGSYPSNPYQAQNSYPSVPNQPQAPNTYPSMPNQLQAPNGFYPLTSSQQASQASLPYPI 179
P S P P + + ++ N PQ P PS + L
Sbjct: 86 PMDSQPPQPVVSAHPIHTLDN-PQYPGILPSPRCGYLHHQFSLRPMPFSTLTVQRLYQHG 144
Query: 180 NPLAS-----NKPSAPQPPAYSVQPSHNTAYPTAPLVP-----------STIATAPSAPP 223
+ S P P PA + S+ + +P S A A
Sbjct: 145 DRAESVESVRQTPEPPYLPAAQSESSNAAEEAPSRSIPTAHVRPTHPLKSFAVPALPASM 204
Query: 224 ANIEDATRSTHPTTATAPSVPP--------AIHDATRSTPSPIIAPSVP 264
+ IE ST T P++P RS P+ PS P
Sbjct: 205 STIEPKLPSTPLLTQQGPTLPKHSVKTASVGTLGRARSPLLPVTVPSAP 253
>gnl|CDD|191511 pfam06409, NPIP, Nuclear pore complex interacting protein (NPIP).
This family consists of a series of primate specific
nuclear pore complex interacting protein (NPIP)
sequences. The function of this family is unknown but is
well conserved from African apes to humans.
Length = 350
Score = 29.4 bits (65), Expect = 5.1
Identities = 24/109 (22%), Positives = 44/109 (40%), Gaps = 2/109 (1%)
Query: 167 SQQASQASLPYPINPLASNKPSAPQPPAYSVQPSHNTAYPTAPLVPSTIATAPSAP-PAN 225
++ Q P + + N S PP + P +A P+A T A P P +
Sbjct: 241 NRMGHQPPPPTQQHSIIDNSLSLKTPPECLLTPLPPSALPSADDNLKTPAECLLRPLPPS 300
Query: 226 IEDATRSTHPTTATAPSVPPAIHDATRSTPSPIIAPS-VPPTNRDTIRS 273
+D + P +PP++ D + + PS +PP+ D +++
Sbjct: 301 ADDNIKECPLALKPPPPLPPSVDDNLKECLLAPLPPSALPPSVDDNLKT 349
>gnl|CDD|217931 pfam04153, NOT2_3_5, NOT2 / NOT3 / NOT5 family. NOT1, NOT2,
NOT3, NOT4 and NOT5 form a nuclear complex that
negatively regulates the basal and activated
transcription of many genes. This family includes NOT2,
NOT3 and NOT5.
Length = 131
Score = 28.1 bits (63), Expect = 5.2
Identities = 14/49 (28%), Positives = 19/49 (38%), Gaps = 7/49 (14%)
Query: 50 NAYNNPPSPYHSANPPPYYQHPSVPSHPSVPTFPSAPIYPSAPPQESHS 98
++ +PPSP + PP Y P + YP PP S S
Sbjct: 3 PSFASPPSPSDARERPPQYDIPECYLNTP-------SSYPQNPPPLSPS 44
>gnl|CDD|219916 pfam08580, KAR9, Yeast cortical protein KAR9. The KAR9 protein in
Saccharomyces cerevisiae is a cytoskeletal protein
required for karyogamy, correct positioning of the
mitotic spindle and for orientation of cytoplasmic
microtubules. KAR9 localises at the shmoo tip in mating
cells and at the tip of the growing bud in anaphase.
Length = 626
Score = 29.5 bits (66), Expect = 5.5
Identities = 30/191 (15%), Positives = 49/191 (25%), Gaps = 18/191 (9%)
Query: 1 MSSIYPSLE-DMTFHQLTIAQNEYFSKQTTSS--SFSSPPPPVYQTSSHTSPNAYNNPPS 57
+I P+L I+ E SS PP + S H N + N
Sbjct: 440 RMNIKPTLASIPDEKPSNISVFEDSETSPNSSTLLRDPPPKKCGEESGHLPNNPFFNKLK 499
Query: 58 PYHSANPPPYYQHPSVPSHPSVPTFPSAPIYPSAPPQESHSPYYYSDIRANVPTGGSPVP 117
S+ PP + + PS P S +R +G P
Sbjct: 500 LTLSSIPPLSPRQSIITLPT-----------PSRPASRISS----LSLRLGSYSGSIVSP 544
Query: 118 SYPQGSYPSNPYQAQNSYPSVPNQPQAPNTYPSMPNQLQAPNGFYPLTSSQQASQASLPY 177
+ SV + ++ F +++ +SLP
Sbjct: 545 PPYPTLVSRKGAAGLSFNRSVSDIEGERIGRYNLLPTRIPALPFKAESTTSSRRSSSLPS 604
Query: 178 PINPLASNKPS 188
P +
Sbjct: 605 PTGVIGFPGSV 615
>gnl|CDD|221188 pfam11725, AvrE, Pathogenicity factor. This family is secreted by
gram-negative Gammaproteobacteria such as Pseudomonas
syringae of tomato and the fire blight plant pathogen
Erwinia amylovora, amongst others. It is an essential
pathogenicity factor of approximately 198 kDa. Its
injection into the host-plant is dependent upon the
bacterial type III or Hrp secretion system. The family
is long and carries a number of predicted functional
regions, including an ERMS or endoplasmic reticulum
membrane retention signal at both the C- and the
N-termini, a leucine-zipper motif from residues 539-560,
and a nuclear localisation signal at 1358-1361. this
conserved AvrE-family of effectors is among the few that
are required for full virulence of many phytopathogenic
pseudomonads, erwinias and pantoeas.
Length = 1771
Score = 29.4 bits (66), Expect = 5.9
Identities = 22/142 (15%), Positives = 36/142 (25%), Gaps = 9/142 (6%)
Query: 116 VPSYPQGSYPSNPYQAQNSYPSVPNQPQAPNTYPSMPNQLQAPNGFYPLTSSQQASQASL 175
+P Q S QA P Q P ++ L +QA
Sbjct: 54 MPKVFQKSSAPRQIQAA------PPQALNPTAAAPQSSRGPTLRELLALPEDDGETQAPE 107
Query: 176 PYPINPLASNKPSAPQPPAYSVQPSHNTAYPTAPLVPSTIAT---APSAPPANIEDATRS 232
P + + + + I + PP + E+ T S
Sbjct: 108 SSPSARRLTRSEGVARHEMEDLAGRPVVKPDADRQLRQDILNKSSSSRRPPVSKEEGTSS 167
Query: 233 THPTTATAPSVPPAIHDATRST 254
P TA A + + +
Sbjct: 168 KMPATALASAALFKDDEIRQEV 189
>gnl|CDD|236863 PRK11153, metN, DL-methionine transporter ATP-binding subunit;
Provisional.
Length = 343
Score = 29.0 bits (66), Expect = 5.9
Identities = 22/84 (26%), Positives = 37/84 (44%), Gaps = 24/84 (28%)
Query: 446 FHFNKGQIVSLVKDSSAARNGLLVNHNILE--------VNGANVVGLKDKELLEAMEKGG 497
H G+I ++ S A ++ L+ N+LE V+G ++ L +KEL +A + G
Sbjct: 26 LHIPAGEIFGVIGASGAGKSTLIRCINLLERPTSGRVLVDGQDLTALSEKELRKARRQIG 85
Query: 498 SVLTLTLIPTSMYQHMMKKMSFNL 521
+ +QH FNL
Sbjct: 86 MI----------FQH------FNL 93
>gnl|CDD|217469 pfam03276, Gag_spuma, Spumavirus gag protein.
Length = 582
Score = 29.1 bits (65), Expect = 6.1
Identities = 20/94 (21%), Positives = 24/94 (25%), Gaps = 10/94 (10%)
Query: 106 RANVPTGGSPVPSYPQGSYPSNPYQAQNSYPSVPNQPQAPNTYPSMPNQLQAPNGFYPLT 165
N+P P P P S S P ++ N F P
Sbjct: 179 GGNIPGAIQPPPPSSLPGLPPGSSSLAPSASSTPGNRLPRVSF----------NPFLPGP 228
Query: 166 SSQQASQASLPYPINPLASNKPSAPQPPAYSVQP 199
S Q S P P+ PP QP
Sbjct: 229 SPAQPSAPPASIPAPPIPPVIQYVAPPPVPPPQP 262
>gnl|CDD|215141 PLN02252, PLN02252, nitrate reductase [NADPH].
Length = 888
Score = 29.3 bits (66), Expect = 6.3
Identities = 18/83 (21%), Positives = 31/83 (37%), Gaps = 8/83 (9%)
Query: 210 LVPSTIATAPSAPPANIEDATRSTHPTTATAPSVPPAIHDATRSTPSPIIAPSVPPTNRD 269
LV + A + SA + + ++ A+ P + P ++ + D
Sbjct: 591 LVTTGAAASSSASSHPLSAISTASALAAASPAPGRPVALNPREKIPCRLVEKI--SLSHD 648
Query: 270 TI--RSALVPLYPSLDNYMGLEV 290
R AL PS D+ +GL V
Sbjct: 649 VRLFRFAL----PSEDHVLGLPV 667
>gnl|CDD|237871 PRK14965, PRK14965, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 576
Score = 29.3 bits (66), Expect = 6.7
Identities = 19/66 (28%), Positives = 22/66 (33%), Gaps = 1/66 (1%)
Query: 216 ATAPSAPPANIEDATRSTHPTTATAPSVPPAIHDATRSTPSPIIAPSVPPTNRDTIRSAL 275
A AP + P A AP VPPA + P AP+ P RSA
Sbjct: 381 APAPPSAAWGAPTPAAPAAPPPAAAPPVPPAAPARPAAAR-PAPAPAPPAAAAPPARSAD 439
Query: 276 VPLYPS 281
S
Sbjct: 440 PAAAAS 445
>gnl|CDD|220950 pfam11029, DAZAP2, DAZ associated protein 2 (DAZAP2). DAZ
associated protein 2 has a highly conserved sequence
throughout evolution including a conserved polyproline
region and several SH2/SH3 binding sites. It occurs as a
single copy gene with a four-exon organisation and is
located on chromosome 12. It encodes a ubiquitously
expressed protein and binds to DAZ and DAZL1 through DAZ
repeats.
Length = 136
Score = 27.9 bits (62), Expect = 6.8
Identities = 23/77 (29%), Positives = 27/77 (35%), Gaps = 8/77 (10%)
Query: 49 PNAYNNPPSPYHSANPPPYYQHPSVPSHPSVPTFPSAPIYPSAPPQESHSPYYYSDIRAN 108
P AY+ P + P S P P A + PQ SH P Y I
Sbjct: 5 PPAYSELYQPSYVVPPQAQMPQASAPYPGPSMYLPMAQVMA-VGPQSSHPPMAYYPI--- 60
Query: 109 VPTGGSPVPSYPQGSYP 125
G+P P YP GS
Sbjct: 61 ----GAPPPVYPPGSTV 73
>gnl|CDD|130706 TIGR01645, half-pint, poly-U binding splicing factor, half-pint
family. The proteins represented by this model contain
three RNA recognition motifs (rrm: pfam00076) and have
been characterized as poly-pyrimidine tract binding
proteins associated with RNA splicing factors. In the
case of PUF60 (GP|6176532), in complex with p54, and in
the presence of U2AF, facilitates association of U2
snRNP with pre-mRNA.
Length = 612
Score = 29.3 bits (65), Expect = 6.9
Identities = 27/174 (15%), Positives = 45/174 (25%), Gaps = 39/174 (22%)
Query: 178 PINPLASNKPSAPQPPAYSVQPSHNTAYPTA----------------PLVPSTIATAPSA 221
P + L + P A +V + TA A P PS+
Sbjct: 284 PPDALLQPATVSAIPAAAAVAAAAATAKIMAAEAVAGAAVLGPRAQSPATPSSSLPTDIG 343
Query: 222 PPANIEDATRSTHPTTATAPSVPPAIHDATRSTPSPI----------------IAPSVP- 264
A + A + + P + P+P+ +AP+
Sbjct: 344 NKAVVSSAKKEAEEVPPLPQAAPAVVKPGPMEIPTPVPPPGLAIPSLVAPPGLVAPTEIN 403
Query: 265 ------PTNRDTIRSALVPLYPSLDNYMGLEVASELINAYARENAANERESDAA 312
P + V D E + E + + A E+ AA
Sbjct: 404 PSFLASPRKKMKREKLPVTFGALDDTLAWKEPSKEDQTSEDGKMLAIMGEAAAA 457
>gnl|CDD|240289 PTZ00144, PTZ00144, dihydrolipoamide succinyltransferase;
Provisional.
Length = 418
Score = 28.9 bits (65), Expect = 7.7
Identities = 13/58 (22%), Positives = 17/58 (29%), Gaps = 2/58 (3%)
Query: 208 APLVPSTIATAPSAPPANIEDATRSTHPTTATAPSVPPAIHDATRSTPSPIIAPSVPP 265
APL S I T + P A A + T A + P + P
Sbjct: 113 APL--SEIDTGGAPPAAAPAAAAAAKAEKTTPEKPKAAAPTPEPPAASKPTPPAAAKP 168
Score = 28.9 bits (65), Expect = 8.6
Identities = 12/83 (14%), Positives = 17/83 (20%), Gaps = 8/83 (9%)
Query: 161 FYPLTSSQQASQASLPYPINPLASNKPSAPQPPAYSVQPSHNTAYPTAPLVPSTIATAPS 220
+ + A+ A A + PT P AP+
Sbjct: 115 LSEIDTGGAPPAAAPAAAAAAKAEKTTPEKPKAAAPTPEPPAASKPTPPAAAKPPEPAPA 174
Query: 221 APPANIEDATRSTHPTTATAPSV 243
A P P P
Sbjct: 175 AKP--------PPTPVARADPRE 189
>gnl|CDD|173288 PRK14827, PRK14827, undecaprenyl pyrophosphate synthase;
Provisional.
Length = 296
Score = 28.8 bits (64), Expect = 8.0
Identities = 18/63 (28%), Positives = 28/63 (44%), Gaps = 6/63 (9%)
Query: 139 PNQPQAPNTYPSMPNQLQAPNGFYPLTSSQQASQASLPYPINPLASNKPSAPQPPAYSVQ 198
P P AP+ YP+ P++ P F L + P +K +AP+ PA +
Sbjct: 14 PQLPPAPDDYPTFPDKSTWPVVFPELPPAPYGGPCRPPQHT-----SKAAAPRIPADRL- 67
Query: 199 PSH 201
P+H
Sbjct: 68 PNH 70
>gnl|CDD|183690 PRK12698, flgH, flagellar basal body L-ring protein; Reviewed.
Length = 224
Score = 28.5 bits (64), Expect = 8.2
Identities = 15/43 (34%), Positives = 20/43 (46%), Gaps = 11/43 (25%)
Query: 76 HPSVPTFPS-APIYPSAPP----------QESHSPYYYSDIRA 107
+ P AP+YP APP Q+S + YSDI+A
Sbjct: 21 EKPIADDPYYAPVYPEAPPTKIAATGSIFQDSQANSLYSDIKA 63
>gnl|CDD|218332 pfam04929, Herpes_DNAp_acc, Herpes DNA replication accessory
factor. Replicative DNA polymerases are capable of
polymerising tens of thousands of nucleotides without
dissociating from their DNA templates. The high
processivity of these polymerases is dependent upon
accessory proteins that bind to the catalytic subunit of
the polymerase or to the substrate. The Epstein-Barr
virus (EBV) BMRF1 protein is an essential component of
the viral DNA polymerase and is absolutely required for
lytic virus replication. BMRF1 is also a transactivator.
This family is predicted to have a UL42 like structure.
Length = 381
Score = 28.9 bits (65), Expect = 8.2
Identities = 12/53 (22%), Positives = 17/53 (32%)
Query: 47 TSPNAYNNPPSPYHSANPPPYYQHPSVPSHPSVPTFPSAPIYPSAPPQESHSP 99
+ N P + + +H S S PS P + PP S S
Sbjct: 289 ADSSVEANGVEPEPTGSVSDRPRHLSSDSSPSPPDTSDSDPSTETPPPASLSH 341
>gnl|CDD|237866 PRK14952, PRK14952, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 584
Score = 28.7 bits (64), Expect = 8.3
Identities = 26/118 (22%), Positives = 40/118 (33%), Gaps = 15/118 (12%)
Query: 171 SQASLPYPINPLASNKPSAPQPPAYSVQPSHNTAYPTAPLVPSTIATAPSAPPA------ 224
++ + P N L + +AP P A + +P H P A P A
Sbjct: 381 TRLDMSIPANLLHNAPQAAPAPSAAAPEPKH--QPAPEPRPVLAPTPASGEPNAAAVRSM 438
Query: 225 --NIEDATRSTHPTTAT--APSVPPAIHDATR--STPSPIIAPSVPPT-NRDTIRSAL 275
+ D R TT A + A+ T + S +A + N D + AL
Sbjct: 439 WSTVRDKVRQRSRTTEVMLAGATVRALEGNTLVLTHESAPLARRLSEQRNADVLAEAL 496
>gnl|CDD|236333 PRK08691, PRK08691, DNA polymerase III subunits gamma and tau;
Validated.
Length = 709
Score = 28.9 bits (64), Expect = 8.3
Identities = 26/100 (26%), Positives = 35/100 (35%), Gaps = 12/100 (12%)
Query: 165 TSSQQASQASLPYPINPL--ASNKPSAPQPPAYSVQPSHNTAYPTAPLVPSTIATA--PS 220
T Q AS A++P SN+ + PP TA TA +I TA
Sbjct: 405 TPVQTASAAAMPSEGKTAGPVSNQENNDVPPWEDAPDEAQTAAGTAQTSAKSIQTASEAE 464
Query: 221 APPANIEDATRSTHPTTATAPSVPPAIHDATRSTPSPIIA 260
PP N ++ T S P + +PI A
Sbjct: 465 TPPENQVSKNKAADNETDAPLSEVP--------SENPIQA 496
>gnl|CDD|219321 pfam07174, FAP, Fibronectin-attachment protein (FAP). This family
contains bacterial fibronectin-attachment proteins
(FAP). Family members are rich in alanine and proline,
are approximately 300 long, and seem to be restricted to
mycobacteria. These proteins contain a
fibronectin-binding motif that allows mycobacteria to
bind to fibronectin in the extracellular matrix.
Length = 297
Score = 28.7 bits (64), Expect = 8.4
Identities = 19/67 (28%), Positives = 23/67 (34%), Gaps = 8/67 (11%)
Query: 199 PSHNTAYPTAPLVPSTIATAPSAPPANIEDATRSTHPTTATAPSVPPAIHDATRSTPSPI 258
P+ A P P P + A A AP A P A P D + P P
Sbjct: 34 PATANADPAPPPPPPSTAAAAPAPAAPP--------PPPPPAAPPAPQPDDPNAAPPPPP 85
Query: 259 IAPSVPP 265
P+ PP
Sbjct: 86 ADPNAPP 92
>gnl|CDD|178125 PLN02509, PLN02509, cystathionine beta-lyase.
Length = 464
Score = 28.8 bits (64), Expect = 9.0
Identities = 13/33 (39%), Positives = 18/33 (54%), Gaps = 1/33 (3%)
Query: 33 FSSPPPPVYQTSSHTSPNAYNNPPSPY-HSANP 64
F + P+YQT++ P+A N P Y S NP
Sbjct: 101 FDAMSTPLYQTATFKQPSAIENGPYDYTRSGNP 133
>gnl|CDD|184285 PRK13733, PRK13733, conjugal transfer protein TraV; Provisional.
Length = 171
Score = 27.8 bits (62), Expect = 9.1
Identities = 18/84 (21%), Positives = 26/84 (30%), Gaps = 10/84 (11%)
Query: 185 NKPSAPQPPAYSVQPSHNTAYPTAPLVPSTIATAPSAPPA---NIEDATRSTHPTTATAP 241
+ S +P A S+ + T P+ T T + PA E + P A
Sbjct: 50 EQSSDAKPAAASLPRLAEGNFRTMPVQTVTATTPSGSRPAVTATPEQKLLAPRPLFTAAR 109
Query: 242 SVPPAIHDATRSTPSPIIAPSVPP 265
V P + P PP
Sbjct: 110 EVKTV-------VPVSSVTPVTPP 126
>gnl|CDD|220759 pfam10446, DUF2457, Protein of unknown function (DUF2457). This is
a family of uncharacterized proteins.
Length = 449
Score = 28.4 bits (63), Expect = 9.3
Identities = 20/87 (22%), Positives = 28/87 (32%), Gaps = 8/87 (9%)
Query: 33 FSSPPPPVYQTSSHTSPNAYNNPPSPYHSA---NPPPYYQHPSVPSHPSVPTFPSAPIYP 89
SPPPP S + + SP+ N P P + S+P P+
Sbjct: 295 IRSPPPPGRLRSPPPLHASDSPVASPHVGELGVNFSPLAFRPGLTRTKSLPRTPNPF--- 351
Query: 90 SAPPQESHSPYYYSDIRANVPTGGSPV 116
E P+ D R + T V
Sbjct: 352 --HRLEYILPFNERDERESSTTRDVHV 376
>gnl|CDD|235777 PRK06302, PRK06302, acetyl-CoA carboxylase biotin carboxyl carrier
protein subunit; Validated.
Length = 155
Score = 27.8 bits (63), Expect = 9.7
Identities = 8/39 (20%), Positives = 14/39 (35%)
Query: 238 ATAPSVPPAIHDATRSTPSPIIAPSVPPTNRDTIRSALV 276
A + PA A + P+ A + + S +V
Sbjct: 49 AAPVAAAPAAAAAAAAAPAAAPAAAAAEAEGHVVTSPMV 87
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.311 0.128 0.375
Gapped
Lambda K H
0.267 0.0647 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 26,525,205
Number of extensions: 2537514
Number of successful extensions: 5181
Number of sequences better than 10.0: 1
Number of HSP's gapped: 3504
Number of HSP's successfully gapped: 650
Length of query: 530
Length of database: 10,937,602
Length adjustment: 101
Effective length of query: 429
Effective length of database: 6,457,848
Effective search space: 2770416792
Effective search space used: 2770416792
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.8 bits)
S2: 61 (27.4 bits)