RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy6065
(316 letters)
>gnl|CDD|218191 pfam04652, DUF605, Vta1 like. Vta1 (VPS20-associated protein 1) is
a positive regulator of Vps4. Vps4 is an ATPase that is
required in the multivesicular body (MVB) sorting
pathway to dissociate the endosomal sorting complex
required for transport (ESCRT). Vta1 promotes correct
assembly of Vps4 and stimulates its ATPase activity
through its conserved Vta1/SBP1/LIP5 region.
Length = 315
Score = 280 bits (718), Expect = 8e-94
Identities = 118/316 (37%), Positives = 152/316 (48%), Gaps = 18/316 (5%)
Query: 13 KAIQHFLNVASEHDNRDLTVSYWCRLHALQKALKIDKSSDGAKMLLLKLMDWLEQEKKTH 72
K+IQ +L A E + D V+Y+CRL+A+Q LKI L KL+D LE KK
Sbjct: 1 KSIQPYLKRAQELEKADPVVAYYCRLYAVQLILKIKLKDGEVTKFLTKLLDQLEAFKKEL 60
Query: 73 RDNDTIMNETAAQAYIENYALKLFQWADSMDRASTFNKNVVKAFYTASILMDVLQTFGET 132
DND I +E AAQAY+EN+ALKLF AD DRA KN K FY AS +VL FGE
Sbjct: 61 GDNDAITDEVAAQAYVENFALKLFNRADKQDRAGRATKNTAKTFYAASTFFEVLAIFGEL 120
Query: 133 SEDIEQNKKYAKWKAAYIHNCLKSGETPIPGPP-----ESGEGSEGPSLSSQNSNDQLPS 187
E++ Q KYAKWKAA IH LK GE P PGPP E + + S +S D P+
Sbjct: 121 DEEVAQKIKYAKWKAARIHKALKEGEDPNPGPPLDEEDEDADVATTNSDNSFPGEDADPA 180
Query: 188 PPRENSNEDVTPSFPAPTSSGGT-----LPSPPSMPQNLPGPSISGPLIPGPSIPSFPSP 242
+ +P P+ S S P P + + + P P
Sbjct: 181 SASPSDPPSSSPGVPSFPSPPEDPSSPSDSSLPPAPSSFQSDTPPPSPESPTNPSPPPGP 240
Query: 243 AVTPSSP-------STPAPQPAMPALVTQPEQEISYIEAELINKCQKYIKFASSALNYDD 295
A P P ST P P + P I+ ++ + I K QK+ K+A SALN++D
Sbjct: 241 AAPPPPPVQQVPPLSTAKPTPPSASATPAPIGGIT-LDDDAIAKAQKHAKWAISALNFED 299
Query: 296 YKEAKSNLIKVLNILN 311
A L K L +L
Sbjct: 300 VPTAVKELRKALKLLG 315
>gnl|CDD|223039 PHA03307, PHA03307, transcriptional regulator ICP4; Provisional.
Length = 1352
Score = 44.0 bits (104), Expect = 9e-05
Identities = 27/113 (23%), Positives = 41/113 (36%), Gaps = 9/113 (7%)
Query: 153 CLKSGETPIPGPPESGEGSEG------PSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTS 206
C PI P E S P +S +S+ + SP S+ P+ +P +
Sbjct: 255 CPLPRPAPITLPTRIWEASGWNGPSSRPGPASSSSSPRERSPSPSPSSPGSGPAPSSPRA 314
Query: 207 SGGTLPSPPSMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMP 259
S + S S + S S G ++ PSP+ +PS P P
Sbjct: 315 SSSSSSSRES---SSSSTSSSSESSRGAAVSPGPSPSRSPSPSRPPPPADPSS 364
Score = 40.5 bits (95), Expect = 0.001
Identities = 26/112 (23%), Positives = 34/112 (30%), Gaps = 7/112 (6%)
Query: 156 SGETPIPGPPESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTP-------SFPAPTSSG 208
P P S PS S + S+ + + + SS
Sbjct: 278 PSSRPGPASSSSSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSRESSSSSTSSSSESSR 337
Query: 209 GTLPSPPSMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPA 260
G SP P P PS P S P P+ PSSP+ A +P
Sbjct: 338 GAAVSPGPSPSRSPSPSRPPPPADPSSPRKRPRPSRAPSSPAASAGRPTRRR 389
Score = 39.4 bits (92), Expect = 0.002
Identities = 28/111 (25%), Positives = 38/111 (34%), Gaps = 7/111 (6%)
Query: 156 SGETPIPGPPESGEGSEGPSLSSQNSNDQLP-SPPRENSNEDVTPS----FPAPTSSGGT 210
P PGP +E S + + + P SP RE S PS P
Sbjct: 67 PPTGPPPGPGTEAPANESRSTPTWSLSTLAPASPAREGSPTPPGPSSPDPPPPTPPPASP 126
Query: 211 LPSPPS-MPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPA 260
PSP + + L GP + SPA S ++ Q A+P
Sbjct: 127 PPSPAPDLSEMLRPVGSPGPPPAASPPAAGASPAAVASDAASS-RQAALPL 176
Score = 36.3 bits (84), Expect = 0.024
Identities = 26/108 (24%), Positives = 32/108 (29%), Gaps = 8/108 (7%)
Query: 158 ETPIPGPPESGEGSEGPSLS-------SQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGT 210
P PP S S P LS S P S V + +
Sbjct: 116 PPPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGASPAAVASDAASSRQAALP 175
Query: 211 LPSPPSMPQNLPGPSISGPLIPGPSIPSFPSPAV-TPSSPSTPAPQPA 257
L SP + P P P+ S P +P S S +P PA
Sbjct: 176 LSSPEETARAPSSPPAEPPPSTPPAAASPRPPRRSSPISASASSPAPA 223
Score = 35.9 bits (83), Expect = 0.030
Identities = 24/103 (23%), Positives = 29/103 (28%), Gaps = 14/103 (13%)
Query: 159 TPIPGPPESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMP 218
P P G G+E P S + S AP S P P
Sbjct: 65 FEPPTGPPPGPGTEAP-------------ANESRSTPTWSLSTLAPASPAREGSPTPPGP 111
Query: 219 QNLPG-PSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPA 260
+ P P P PS S + P P P + PA
Sbjct: 112 SSPDPPPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAASPPA 154
Score = 35.1 bits (81), Expect = 0.047
Identities = 23/106 (21%), Positives = 35/106 (33%), Gaps = 7/106 (6%)
Query: 156 SGETPIPGPPESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGT--LPS 213
S P P S S +++ D S+ D + S + G P
Sbjct: 203 SPRPPRRSSPISASASSPAPAPGRSAAD-----DAGASSSDSSSSESSGCGWGPENECPL 257
Query: 214 PPSMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMP 259
P P LP GPS P+ + + +P+P P+ P
Sbjct: 258 PRPAPITLPTRIWEASGWNGPSSRPGPASSSSSPRERSPSPSPSSP 303
Score = 33.6 bits (77), Expect = 0.14
Identities = 23/99 (23%), Positives = 29/99 (29%), Gaps = 3/99 (3%)
Query: 162 PGPPESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMPQNL 221
G G E + + LP+ E S + S P P SS P
Sbjct: 242 SESSGCGWGPENECPLPRPAPITLPTRIWEASGWNGPSSRPGPASSSS--SPRERSPSPS 299
Query: 222 PGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPA 260
P SGP P S S SS S+ +
Sbjct: 300 PSSPGSGPAPSSPRASSSSSS-SRESSSSSTSSSSESSR 337
Score = 32.8 bits (75), Expect = 0.27
Identities = 23/110 (20%), Positives = 31/110 (28%), Gaps = 7/110 (6%)
Query: 162 PGPPESGEGSEGP------SLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPP 215
P PP P LS P PP S + PA +S
Sbjct: 114 PDPPPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAASPPA-AGASPAAVASDAASSRQA 172
Query: 216 SMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPALVTQP 265
++P + P + P P P PA P + + A P
Sbjct: 173 ALPLSSPEETARAPSSPPAEPPPSTPPAAASPRPPRRSSPISASASSPAP 222
Score = 32.5 bits (74), Expect = 0.34
Identities = 27/117 (23%), Positives = 37/117 (31%), Gaps = 7/117 (5%)
Query: 156 SGETPIPGPPESGEGSEGPSLSSQNSNDQLPSP--PRENSNEDVTPSFPAPTSSGGTLPS 213
S +P P P S G S S +S++ P PT
Sbjct: 216 SASSPAPAPGRSAADDAGASSSDSSSSESSGCGWGPENECPLPRPAPITLPTRIWEASGW 275
Query: 214 PPSMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPALVTQPEQEIS 270
+ P S S P PS PSP+ +P S P+ A + + E S
Sbjct: 276 NGPSSRPGPASSSSSPRERSPS----PSPS-SPGSGPAPSSPRASSSSSSSRESSSS 327
Score = 32.5 bits (74), Expect = 0.38
Identities = 27/128 (21%), Positives = 38/128 (29%), Gaps = 17/128 (13%)
Query: 162 PGPPESGEGSEGPSLSSQNSNDQLPSPPRENSNEDV-TPSFPAPTSSGGTLPSPPSMPQN 220
PP + S + + + R P P P + G P +
Sbjct: 836 ARPPGAAARPPPARSSESSKSKPAAAGGRARGKNGRRRPRPPEPRARPGAAAPPKAAAAA 895
Query: 221 LPGPSISGPLIPGPSIPSFPSPA-----------VTPSSPSTPAPQPAMPALVTQPEQEI 269
P + + P P + P P V P TPAP A A PE
Sbjct: 896 PPAGAPAPRPRPAPRVKLGPMPPGGPDPRGGFRRVPPGDLHTPAPSAAALAAYCPPE--- 952
Query: 270 SYIEAELI 277
+ AEL+
Sbjct: 953 --VVAELV 958
Score = 31.7 bits (72), Expect = 0.65
Identities = 19/101 (18%), Positives = 27/101 (26%), Gaps = 5/101 (4%)
Query: 160 PIPGPPESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMPQ 219
+ G ++ S + S + + P S T +P S P
Sbjct: 137 MLRPVGSPGPPPAASPPAAGASPAAVASDAASSRQ----AALPLS-SPEETARAPSSPPA 191
Query: 220 NLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPA 260
P + P P S P A S P A A
Sbjct: 192 EPPPSTPPAAASPRPPRRSSPISASASSPAPAPGRSAADDA 232
Score = 31.3 bits (71), Expect = 0.77
Identities = 18/108 (16%), Positives = 28/108 (25%), Gaps = 6/108 (5%)
Query: 156 SGETPIPGPPESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPP 215
S S PP S+E + S PA +
Sbjct: 813 SRTASKRKSRSHTPDGGSESSGPARPPGAAARPPPARSSES-SKSKPAAAGGRARGKNGR 871
Query: 216 SMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPALVT 263
P+ + G P + + A P+ P P+PA +
Sbjct: 872 RRPRPPEPRARPGAAAPPKA-----AAAAPPAGAPAPRPRPAPRVKLG 914
Score = 29.8 bits (67), Expect = 2.3
Identities = 20/117 (17%), Positives = 28/117 (23%), Gaps = 13/117 (11%)
Query: 162 PGPPESGEGSEGPSLSSQNSNDQLPSPPRE----NSNEDVTPSFPAPTSSGGTLPSPP-- 215
P E+ G S + D + + D P G PP
Sbjct: 789 PVRAEAAFRRPGRLRRSGPAADAASRTASKRKSRSHTPDGGSESSGPARPPGAAARPPPA 848
Query: 216 -------SMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPALVTQP 265
S P G + P P + + P A P A +P
Sbjct: 849 RSSESSKSKPAAAGGRARGKNGRRRPRPPEPRARPGAAAPPKAAAAAPPAGAPAPRP 905
Score = 28.2 bits (63), Expect = 8.3
Identities = 14/91 (15%), Positives = 22/91 (24%), Gaps = 3/91 (3%)
Query: 173 GPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSP---PSMPQNLPGPSISGP 229
P+ + + P P + + + G SGP
Sbjct: 776 EPAEPQRGAGSSPPVRAEAAFRRPGRLRRSGPAADAASRTASKRKSRSHTPDGGSESSGP 835
Query: 230 LIPGPSIPSFPSPAVTPSSPSTPAPQPAMPA 260
P + P + SS S PA
Sbjct: 836 ARPPGAAARPPPARSSESSKSKPAAAGGRAR 866
>gnl|CDD|165527 PHA03269, PHA03269, envelope glycoprotein C; Provisional.
Length = 566
Score = 41.3 bits (96), Expect = 4e-04
Identities = 23/109 (21%), Positives = 43/109 (39%), Gaps = 4/109 (3%)
Query: 160 PIPGPPESGEGSEGPSLSSQNSNDQLPSP---PRENSNEDVTPSFPAPTSSGGTLPSPPS 216
P P P ++ + P+++ ++ + P P ++E P+ PAP + P P
Sbjct: 42 PAPAPHQAASRAPDPAVAPTSAASRKPDLAQAPTPAASEKFDPA-PAPHQAASRAPDPAV 100
Query: 217 MPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPALVTQP 265
PQ P + + +PA +S ++ P PA + P
Sbjct: 101 APQLAAAPKPDAAEAFTSAAQAHEAPADAGTSAASKKPDPAAHTQHSPP 149
>gnl|CDD|223021 PHA03247, PHA03247, large tegument protein UL36; Provisional.
Length = 3151
Score = 41.1 bits (96), Expect = 6e-04
Identities = 29/114 (25%), Positives = 38/114 (33%), Gaps = 4/114 (3%)
Query: 160 PIPGPPESGEGSEGPS--LSSQNSNDQLPSPPRENSNEDVTP-SFPAPTSSGGTLPSPPS 216
P P GS P + + + + P + V + PA + S + PP
Sbjct: 2844 GPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPD 2903
Query: 217 MPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPALVTQPEQEIS 270
P+ P P P P P P P P P P P P A P E S
Sbjct: 2904 QPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLA-PTTDPAGAGEPS 2956
Score = 38.4 bits (89), Expect = 0.005
Identities = 26/101 (25%), Positives = 34/101 (33%), Gaps = 4/101 (3%)
Query: 165 PESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMPQNLPGP 224
P S + P+ ++ +L P S E SF P P P + P P P
Sbjct: 2866 PPSRSPAAKPAAPARPPVRRLARPAVSRSTE----SFALPPDQPERPPQPQAPPPPQPQP 2921
Query: 225 SISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPALVTQP 265
P P P P P P + + PA V QP
Sbjct: 2922 QPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQP 2962
Score = 37.2 bits (86), Expect = 0.012
Identities = 21/91 (23%), Positives = 32/91 (35%), Gaps = 2/91 (2%)
Query: 185 LPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMPQNLPGPSISGPLIPGPSIPSFPSPAV 244
LP+ R ++ TP F +P ++P P+ + P P+ P P+
Sbjct: 380 LPTRKRRSARHAATP-FARGPGGDDQTRPAAPVPASVPTPAPTPVPASAPPPPATPLPSA 438
Query: 245 TPSSPSTPAPQPA-MPALVTQPEQEISYIEA 274
P S PAP P P +A
Sbjct: 439 EPGSDDGPAPPPERQPPAPATEPAPDDPDDA 469
Score = 36.5 bits (84), Expect = 0.019
Identities = 26/114 (22%), Positives = 36/114 (31%), Gaps = 7/114 (6%)
Query: 159 TPIPGPPESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMP 218
T S PS P +P+ P P + P+ P P
Sbjct: 2784 TRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTS-AQPTAPPPP 2842
Query: 219 QNLPGPSIS--GPLIPGPSIPSFPSPAVTPSSPSTPAPQP----AMPALVTQPE 266
P PS+ G + PG + P + P+ PA P A PA+ E
Sbjct: 2843 PGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTE 2896
Score = 36.5 bits (84), Expect = 0.020
Identities = 21/78 (26%), Positives = 25/78 (32%), Gaps = 6/78 (7%)
Query: 186 PSPPRENSNEDVTPSFPAPTSSG------GTLPSPPSMPQNLPGPSISGPLIPGPSIPSF 239
P+ P + V P PAP S P P P GP+ PS
Sbjct: 2557 PAAPPAAPDRSVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSP 2616
Query: 240 PSPAVTPSSPSTPAPQPA 257
P P P+P PA
Sbjct: 2617 LPPDTHAPDPPPPSPSPA 2634
Score = 36.5 bits (84), Expect = 0.021
Identities = 22/115 (19%), Positives = 34/115 (29%), Gaps = 10/115 (8%)
Query: 162 PGPPESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTS----------SGGTL 211
P PP G P ++ + + P + + P S +L
Sbjct: 2739 PAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESL 2798
Query: 212 PSPPSMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPALVTQPE 266
PSP +P + P+ P P T + P+ P P P P
Sbjct: 2799 PSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLG 2853
Score = 36.5 bits (84), Expect = 0.022
Identities = 25/126 (19%), Positives = 38/126 (30%), Gaps = 17/126 (13%)
Query: 157 GETPIPGPPESGEGSEGPSLS-----SQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTL 211
+P P + +S + S P + +S
Sbjct: 2642 PPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADP 2701
Query: 212 PSPPSMPQNLPGPSISG-PLIPGPSI-----------PSFPSPAVTPSSPSTPAPQPAMP 259
P PP P+ P +S PL PGP+ P+ P+ P++P PA P
Sbjct: 2702 PPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPP 2761
Query: 260 ALVTQP 265
P
Sbjct: 2762 TTAGPP 2767
Score = 36.1 bits (83), Expect = 0.024
Identities = 21/104 (20%), Positives = 34/104 (32%), Gaps = 6/104 (5%)
Query: 157 GETPIPGPP-ESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPP 215
G PP +G + P + + + P S + S P+P P+ P
Sbjct: 2753 GPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWD-----PADP 2807
Query: 216 SMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMP 259
P ++ P +P S T P P P++P
Sbjct: 2808 PAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLP 2851
Score = 36.1 bits (83), Expect = 0.028
Identities = 25/115 (21%), Positives = 34/115 (29%), Gaps = 9/115 (7%)
Query: 160 PIPGPPESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTP---SFPAPTSSGGTLPSPPS 216
P P+ S P+ + + + PP E +D P S P G S
Sbjct: 2619 PDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASS 2678
Query: 217 MPQNLPGPSISGPLIP------GPSIPSFPSPAVTPSSPSTPAPQPAMPALVTQP 265
PQ + + P P P PA +TP P A P
Sbjct: 2679 PPQRPRRRAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASP 2733
Score = 36.1 bits (83), Expect = 0.030
Identities = 28/91 (30%), Positives = 38/91 (41%), Gaps = 13/91 (14%)
Query: 173 GPSLSSQNSNDQLPSP-PRENSNEDVTPSFPAPTSSGGTLPSPPSMPQNLPGPSISGPLI 231
G S + P+P P ++ TP P P ++ P+ P+ P P +GP
Sbjct: 2693 GSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAP--APPAVPAGPAT 2750
Query: 232 PGPSIPSFPSPAVTPSSPSTPA--PQPAMPA 260
PG PA P+ P T A P PA PA
Sbjct: 2751 PG-------GPA-RPARPPTTAGPPAPAPPA 2773
Score = 34.5 bits (79), Expect = 0.080
Identities = 30/135 (22%), Positives = 42/135 (31%), Gaps = 30/135 (22%)
Query: 159 TPIPGPPESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPA--PTSSGGTLPSPPS 216
TP+P P + + P P+ PA PT++G P+PP+
Sbjct: 2719 TPLPPGPAAARQASPAL-----PAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPA 2773
Query: 217 MPQNLPGP----------SISGPLIPGPSIPSFPSPAVTPSSPSTPA----------P-- 254
P P S S +P P P+ P AV + + P P
Sbjct: 2774 APAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTS 2833
Query: 255 -QPAMPALVTQPEQE 268
QP P P
Sbjct: 2834 AQPTAPPPPPGPPPP 2848
Score = 34.1 bits (78), Expect = 0.12
Identities = 27/127 (21%), Positives = 37/127 (29%), Gaps = 18/127 (14%)
Query: 162 PGPPESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMPQNL 221
PP S S+++ P R + P P P P PP P
Sbjct: 2879 ARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPR 2938
Query: 222 PGPSI---SGPLIPGPSIPSFPSPA---------------VTPSSPSTPAPQPAMPALVT 263
P P + + P G + P P V +PS AP + P L
Sbjct: 2939 PQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPASSTPPLTG 2998
Query: 264 QPEQEIS 270
+S
Sbjct: 2999 HSLSRVS 3005
Score = 33.4 bits (76), Expect = 0.19
Identities = 26/103 (25%), Positives = 31/103 (30%), Gaps = 7/103 (6%)
Query: 160 PIPGPPESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMPQ 219
P P + S + PP D P P + P P P
Sbjct: 2873 AKPAAPARPPVRRLARPAVSRSTESFALPP------DQPERPPQPQAPPPPQPQPQPPPP 2926
Query: 220 NLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPALV 262
P P P P P + PA PS PQP + ALV
Sbjct: 2927 PQPQPPPPPPPRPQPPLAPTTDPAGAG-EPSGAVPQPWLGALV 2968
Score = 33.4 bits (76), Expect = 0.21
Identities = 16/60 (26%), Positives = 21/60 (35%), Gaps = 2/60 (3%)
Query: 200 SFPAPTSSGGTLPSPPSMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMP 259
S P P + +PP P P PS GP+ P P + P+ P A
Sbjct: 414 SVPTPAPTPVPASAPP--PPATPLPSAEPGSDDGPAPPPERQPPAPATEPAPDDPDDATR 471
Score = 33.0 bits (75), Expect = 0.25
Identities = 23/103 (22%), Positives = 29/103 (28%), Gaps = 3/103 (2%)
Query: 160 PIPGPPESGEGSEGPSLSSQN-SNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMP 218
P S E P + Q P PP+ P P P P PP P
Sbjct: 2887 ARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQ-PQPPPPPPPRPQPPLAP 2945
Query: 219 QNLPGP-SISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPA 260
P +P P + + V P P P+ A
Sbjct: 2946 TTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREA 2988
Score = 31.4 bits (71), Expect = 0.67
Identities = 28/120 (23%), Positives = 34/120 (28%), Gaps = 27/120 (22%)
Query: 160 PIPGPPESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMPQ 219
P GPP SLS S + LPSP PA + P+ P
Sbjct: 2775 PAAGPPRRLTRPAVASLSE--SRESLPSPWD-----------PADPPAAVLAPAAALPPA 2821
Query: 220 NLPGPSISGPLIPGPSIPSFPSPAVTPSSP--------------STPAPQPAMPALVTQP 265
P + P P+ P P PS P A PA +P
Sbjct: 2822 ASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARP 2881
Score = 31.4 bits (71), Expect = 0.71
Identities = 22/121 (18%), Positives = 32/121 (26%), Gaps = 18/121 (14%)
Query: 160 PIPGPPESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVT-PSFPAPTSSGGTLPSPPSMP 218
P P E + D P+P R + A SS P +
Sbjct: 2631 PSPAANEPDPHPPPTVPPPERPRDD-PAPGRVSRPRRARRLGRAAQASSPPQRPRRRAAR 2689
Query: 219 QNL---------PGPSISGPLIPGPSIPSFPSPAVT-------PSSPSTPAPQPAMPALV 262
+ P P + P + + P P P+ P+ PAP
Sbjct: 2690 PTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPA 2749
Query: 263 T 263
T
Sbjct: 2750 T 2750
Score = 30.7 bits (69), Expect = 1.2
Identities = 26/117 (22%), Positives = 38/117 (32%), Gaps = 17/117 (14%)
Query: 160 PIPGPPESGEGSEG--PSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSM 217
P P P E S P Q++ P P ++ + P+ P+P P PP
Sbjct: 2573 PAPRPSEPAVTSRARRPDAPPQSAR---PRAPVDDRGDPRGPAPPSPLPPDTHAPDPPP- 2628
Query: 218 PQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTP-------APQPAMPALVTQPEQ 267
P PS + P+ P P P+ A + A + P Q
Sbjct: 2629 ----PSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQ 2681
Score = 30.7 bits (69), Expect = 1.2
Identities = 21/108 (19%), Positives = 32/108 (29%), Gaps = 6/108 (5%)
Query: 160 PIPGPPESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMPQ 219
PP+ E P P PP + P P P + T P+ P
Sbjct: 2897 SFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPS 2956
Query: 220 NLPGPSISGPLIPG------PSIPSFPSPAVTPSSPSTPAPQPAMPAL 261
G L+PG +P P+S + P ++ +
Sbjct: 2957 GAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPASSTPPLTGHSLSRV 3004
Score = 29.1 bits (65), Expect = 4.4
Identities = 24/120 (20%), Positives = 41/120 (34%), Gaps = 17/120 (14%)
Query: 118 TASILMDVLQTFGETSEDIEQNKKYAKWKAAYIHNCLKSGETPIPGPPES-GEGSEGPSL 176
A + L +G SE Q++ + + + H G+ P PP GEG++
Sbjct: 214 PADLTAAALHLYG-ASETYLQDEPFVERRVVISHPL--RGDIAAPAPPPVVGEGADRAPE 270
Query: 177 SSQNSNDQLPSPPRENSNEDVTPSFPAPTSSG-------GTLPSPPSMPQNLPGPSISGP 229
+++ + P P E P+ A G G + P+ P P
Sbjct: 271 TARGATGPPPPP------EAAAPNGAAAPPDGVWGAALAGAPLALPAPPDPPPPAPAGDA 324
>gnl|CDD|236776 PRK10856, PRK10856, cytoskeletal protein RodZ; Provisional.
Length = 331
Score = 40.0 bits (94), Expect = 0.001
Identities = 19/90 (21%), Positives = 33/90 (36%), Gaps = 8/90 (8%)
Query: 177 SSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMPQNLPGPSISGPLIPGPSI 236
S++ S + S P + S + PAP + P N P+++ P P++
Sbjct: 151 SAELSQNSGQSVPLDTSTTTDPATTPAPA------APVDTTPTNSQTPAVATA--PAPAV 202
Query: 237 PSFPSPAVTPSSPSTPAPQPAMPALVTQPE 266
+ V PS + PA P+
Sbjct: 203 DPQQNAVVAPSQANVDTAATPAPAAPATPD 232
>gnl|CDD|236090 PRK07764, PRK07764, DNA polymerase III subunits gamma and tau;
Validated.
Length = 824
Score = 40.4 bits (95), Expect = 0.001
Identities = 19/125 (15%), Positives = 39/125 (31%), Gaps = 2/125 (1%)
Query: 156 SGETPIPGPPESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPA--PTSSGGTLPS 213
P P P + G+ + + + + + + A P+ +
Sbjct: 681 PPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAADDPVP 740
Query: 214 PPSMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPALVTQPEQEISYIE 273
P P + P P+ + P P P+ + PS P+ + M + +
Sbjct: 741 LPPEPDDPPDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPSEEEEMAEDDAPSMDDEDRRD 800
Query: 274 AELIN 278
AE +
Sbjct: 801 AEEVA 805
Score = 39.2 bits (92), Expect = 0.002
Identities = 20/105 (19%), Positives = 35/105 (33%), Gaps = 2/105 (1%)
Query: 159 TPIPGPPESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMP 218
P P + P+ ++ P+P + S +P + P
Sbjct: 409 APAPAAAA-PAAAAAPAPAAAPQPAPAPAPAPAPPSPAGNAPAGGAPSPPPA-AAPSAQP 466
Query: 219 QNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPALVT 263
P + P P+ P+ P+PA P++P+ PA T
Sbjct: 467 APAPAAAPEPTAAPAPAPPAAPAPAAAPAAPAAPAAPAGADDAAT 511
Score = 37.7 bits (88), Expect = 0.007
Identities = 25/113 (22%), Positives = 39/113 (34%), Gaps = 3/113 (2%)
Query: 156 SGETPIPGPPESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPP 215
+ + P+ ++ + P+P PAP S G P+
Sbjct: 394 PAAAAPSAAAAAPAAAPAPA-AAAPAAAAAPAPAAAPQPAPAPAPAPAPPSPAGNAPAGG 452
Query: 216 SMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPALVTQPEQE 268
+ P P + S P P+ P+ A P+ P+ PAP A PA P
Sbjct: 453 A-PSPPPAAAPSAQPAPAPAAAPEPTAAPAPAPPAAPAPAAA-PAAPAAPAAP 503
Score = 36.9 bits (86), Expect = 0.012
Identities = 20/122 (16%), Positives = 33/122 (27%), Gaps = 7/122 (5%)
Query: 160 PIPGPPESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMPQ 219
P + + + +P + TP P +
Sbjct: 670 PAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPP-AGQADDPAAQPPQAAQGA 728
Query: 220 NLPGPSISG--PLIPGPSIPSFPSPAVTPS----SPSTPAPQPAMPALVTQPEQEISYIE 273
+ P P+ PL P P P P+ A +P+ A A P E+E +
Sbjct: 729 SAPSPAADDPVPLPPEPDDPPDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPSEEEEMAED 788
Query: 274 AE 275
Sbjct: 789 DA 790
Score = 36.5 bits (85), Expect = 0.019
Identities = 16/126 (12%), Positives = 30/126 (23%)
Query: 143 AKWKAAYIHNCLKSGETPIPGPPESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFP 202
A + + P P + + + ++ S P +
Sbjct: 602 APASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPD 661
Query: 203 APTSSGGTLPSPPSMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPALV 262
A G P P+ + P+ + PA P++ A
Sbjct: 662 ASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQP 721
Query: 263 TQPEQE 268
Q Q
Sbjct: 722 PQAAQG 727
Score = 35.0 bits (81), Expect = 0.060
Identities = 22/120 (18%), Positives = 38/120 (31%), Gaps = 14/120 (11%)
Query: 163 GPPESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPS--------- 213
P + G P+ +S ++ P + PA ++ S
Sbjct: 590 PAPGAAGGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAA 649
Query: 214 PPSMPQNLPGPSIS-----GPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPALVTQPEQE 268
P P+++ P S P G + P+ P PA P++P+ PA
Sbjct: 650 PEHHPKHVAVPDASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATP 709
Score = 34.6 bits (80), Expect = 0.064
Identities = 22/106 (20%), Positives = 29/106 (27%), Gaps = 7/106 (6%)
Query: 163 GPPESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMPQNLP 222
G G+ + S + +P P+ A + P P P P
Sbjct: 385 LGVAGGAGAPAAAAPSAAAAAPAAAPAPA----AAAPAAAAAPAPAAA-PQPAPAPAPAP 439
Query: 223 GPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPALVTQPEQE 268
P S P PA PS+ PAP A P
Sbjct: 440 APP-SPAGNAPAGGAPSPPPAAAPSAQPAPAPAAAPEP-TAAPAPA 483
Score = 32.7 bits (75), Expect = 0.28
Identities = 18/117 (15%), Positives = 33/117 (28%), Gaps = 12/117 (10%)
Query: 155 KSGETPIPGPPESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSF------------P 202
+ P E S P+ ++++ P
Sbjct: 624 PAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAGGAAPAAPPP 683
Query: 203 APTSSGGTLPSPPSMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMP 259
AP + P+ + Q P P+ + P + P A +S +PA +P
Sbjct: 684 APAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAADDPVP 740
Score = 27.6 bits (62), Expect = 9.9
Identities = 9/68 (13%), Positives = 17/68 (25%), Gaps = 6/68 (8%)
Query: 203 APTSSGGTLPSPPSMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPALV 262
+ G P+ + P+ P+ + A +P+ PA
Sbjct: 385 LGVAGGAGAPAAAAPSAAAAAPA------AAPAPAAAAPAAAAAPAPAAAPQPAPAPAPA 438
Query: 263 TQPEQEIS 270
P
Sbjct: 439 PAPPSPAG 446
>gnl|CDD|215130 PLN02217, PLN02217, probable pectinesterase/pectinesterase
inhibitor.
Length = 670
Score = 39.3 bits (91), Expect = 0.002
Identities = 34/123 (27%), Positives = 47/123 (38%), Gaps = 12/123 (9%)
Query: 157 GETPIPG---PPESGEGSEGP-SLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLP 212
G+ IPG P G + P S +S + S +S+ T P+ + G L
Sbjct: 546 GDAWIPGKGVPYIPGLFAGNPGSTNSTPTGSAASSNTTFSSDSPSTVVAPSTSPPAGHLG 605
Query: 213 SPPSMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPALVTQPEQEISYI 272
SPP+ P + PS S P SP+ TPSSP + + PE I
Sbjct: 606 SPPATPSKIVSPSTSPP------ASHLGSPSTTPSSPESSIKVASTET--ASPESSIKVA 657
Query: 273 EAE 275
E
Sbjct: 658 STE 660
>gnl|CDD|178748 PLN03209, PLN03209, translocon at the inner envelope of chloroplast
subunit 62; Provisional.
Length = 576
Score = 39.1 bits (91), Expect = 0.002
Identities = 29/114 (25%), Positives = 38/114 (33%), Gaps = 2/114 (1%)
Query: 154 LKSGETPIPGPPESGEGSEGPSLSSQNSNDQLPSP-PRENSNEDVTPSFPAPTSSGGTLP 212
LK +PIP PP S S + + P P SN L
Sbjct: 380 LKPPTSPIPTPPSSSPASSKSVDAVAKPAEPDVVPSPGSASNVPEVEPAQVEAKKTRPL- 438
Query: 213 SPPSMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPALVTQPE 266
SP + ++L P+ P P PS S + P+ P T A A P
Sbjct: 439 SPYARYEDLKPPTSPSPTAPTGVSPSVSSTSSVPAVPDTAPATAATDAAAPPPA 492
Score = 32.2 bits (73), Expect = 0.36
Identities = 23/108 (21%), Positives = 36/108 (33%), Gaps = 4/108 (3%)
Query: 160 PIPGPPESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMPQ 219
P P + PS+SS +S +P + D AP + SP ++
Sbjct: 449 PPTSPSPTAPTGVSPSVSSTSSVPAVPDTAPATAATDAA----APPPANMRPLSPYAVYD 504
Query: 220 NLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPALVTQPEQ 267
+L P+ P P + + V S P A QP+
Sbjct: 505 DLKPPTSPSPAAPVGKVAPSSTNEVVKVGNSAPPTALADEQHHAQPKP 552
>gnl|CDD|184923 PRK14959, PRK14959, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 624
Score = 38.1 bits (88), Expect = 0.005
Identities = 27/107 (25%), Positives = 40/107 (37%), Gaps = 11/107 (10%)
Query: 163 GPPESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMPQNLP 222
SG +EGP+ + +P P + P+ SS P+ P+ P P
Sbjct: 378 ASAPSGSAAEGPASGGAAT---IP-TPGTQGPQGTAPAAGMTPSSA--APATPA-PSAAP 430
Query: 223 GPSI----SGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPALVTQP 265
P + + P P IP P+P + +SP AP A P
Sbjct: 431 SPRVPWDDAPPAPPRSGIPPRPAPRMPEASPVPGAPDSVASASDAPP 477
>gnl|CDD|236733 PRK10672, PRK10672, rare lipoprotein A; Provisional.
Length = 361
Score = 37.3 bits (87), Expect = 0.007
Identities = 22/97 (22%), Positives = 35/97 (36%), Gaps = 5/97 (5%)
Query: 161 IPGPPESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMPQN 220
+P P+ G PS+ + Q P NS AP +S G L +P ++
Sbjct: 194 LPARPDLSGGMGTPSVQPAPA-PQGDVLPVSNSTLKSEDPTGAPVTSSGFLGAPTTLA-- 250
Query: 221 LPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPA 257
PG P + P+ A ++P A +
Sbjct: 251 -PGVLEGSEPTPTAPSSA-PATAPAAAAPQAAATSSS 285
>gnl|CDD|217469 pfam03276, Gag_spuma, Spumavirus gag protein.
Length = 582
Score = 37.2 bits (86), Expect = 0.009
Identities = 23/91 (25%), Positives = 32/91 (35%), Gaps = 12/91 (13%)
Query: 159 TPIPGPPESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMP 218
P P G SL+ S+ PR + N + PA P + P
Sbjct: 187 QPPPPSSLPGLPPGSSSLAPSASSTPGNRLPRVSFNPFLPGPSPA---------QPSAPP 237
Query: 219 QNLPGPSISGPLIPGPSIPSFPSPAVTPSSP 249
++P P I P+I + P P P P P
Sbjct: 238 ASIPAPPI-PPVIQYVAPP--PVPPPQPIIP 265
Score = 35.7 bits (82), Expect = 0.027
Identities = 28/79 (35%), Positives = 35/79 (44%), Gaps = 4/79 (5%)
Query: 185 LPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMPQN--LPGPSISGPLIPGPSIPSFPSP 242
P PP S AP++S P + N LPGPS + P P SIP+ P P
Sbjct: 187 QPPPPSSLPGLPPGSSSLAPSASSTPGNRLPRVSFNPFLPGPSPAQPSAPPASIPAPPIP 246
Query: 243 AVTP--SSPSTPAPQPAMP 259
V + P P PQP +P
Sbjct: 247 PVIQYVAPPPVPPPQPIIP 265
>gnl|CDD|237874 PRK14971, PRK14971, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 614
Score = 37.1 bits (86), Expect = 0.012
Identities = 27/142 (19%), Positives = 40/142 (28%), Gaps = 20/142 (14%)
Query: 164 PPESGEGSEGPSLSSQNSNDQLPS---PPRENSNEDVTPSFPAPTSSGGTLPSPPSMPQN 220
+ P + + Q + S AP S+ +PP++ +
Sbjct: 371 GGRGPKQHIKPVFTQPAAAPQPSAAAAASPSPSQSSAAAQPSAPQSATQPAGTPPTVSVD 430
Query: 221 LPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPA-----LVTQPEQE----ISY 271
P P+ P PS AV P+ P +P QE +
Sbjct: 431 PPAAV---PVNP----PSTAPQAVRPAQFKEEKKIPVSKVSSLGPSTLRPIQEKAEQATG 483
Query: 272 IEAELINKCQKYIKFASSALNY 293
E QK I F L Y
Sbjct: 484 NIKEAPTGTQKEI-FTEEDLQY 504
Score = 32.8 bits (75), Expect = 0.25
Identities = 18/111 (16%), Positives = 32/111 (28%), Gaps = 1/111 (0%)
Query: 160 PIPGPPESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMPQ 219
P P S + PS S ++ Q +P P+ + + P + PQ
Sbjct: 386 PAAAPQPSAAAAASPSPSQSSAAAQPSAPQSATQPAGTPPTVSVDPPAAVPVNPPSTAPQ 445
Query: 220 NLPGPSISGPLIPGPS-IPSFPSPAVTPSSPSTPAPQPAMPALVTQPEQEI 269
+ S + S + P + T ++EI
Sbjct: 446 AVRPAQFKEEKKIPVSKVSSLGPSTLRPIQEKAEQATGNIKEAPTGTQKEI 496
Score = 31.7 bits (72), Expect = 0.54
Identities = 16/93 (17%), Positives = 23/93 (24%), Gaps = 8/93 (8%)
Query: 192 NSNEDVTPSF--PAPTSSGGTLPSPPSMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSP 249
+D + T P+ P S S + PS P A P+
Sbjct: 364 QKGDDASGGRGPKQHIKPVFTQPAAAPQPSAAAAASPSPSQSSAAAQPSAPQSATQPAGT 423
Query: 250 ST------PAPQPAMPALVTQPEQEISYIEAEL 276
PA P P + + E
Sbjct: 424 PPTVSVDPPAAVPVNPPSTAPQAVRPAQFKEEK 456
>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family. Atrophin-1 is the
protein product of the dentatorubral-pallidoluysian
atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
neurodegenerative disorder. It is caused by the
expansion of a CAG repeat in the DRPLA gene on
chromosome 12p. This results in an extended
polyglutamine region in atrophin-1, that is thought to
confer toxicity to the protein, possibly through
altering its interactions with other proteins. The
expansion of a CAG repeat is also the underlying defect
in six other neurodegenerative disorders, including
Huntington's disease. One interaction of expanded
polyglutamine repeats that is thought to be pathogenic
is that with the short glutamine repeat in the
transcriptional coactivator CREB binding protein, CBP.
This interaction draws CBP away from its usual nuclear
location to the expanded polyglutamine repeat protein
aggregates that are characteristic of the polyglutamine
neurodegenerative disorders. This interferes with
CBP-mediated transcription and causes cytotoxicity.
Length = 979
Score = 37.0 bits (85), Expect = 0.012
Identities = 35/116 (30%), Positives = 50/116 (43%), Gaps = 9/116 (7%)
Query: 155 KSGETPIPGPPESGEGS---EGPSLSSQNSNDQLPSPPR--ENSNEDVTPSFPAPTSSGG 209
K+ E P P GEG EG S S++ N++ S P+ + N +PS P+P +
Sbjct: 103 KTQELSRPNSPSEGEGEGEGEGESSDSRSVNEEGSSDPKDIDQDNRSSSPSIPSPQDNES 162
Query: 210 TLPSPPSMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPALVTQP 265
S S Q L P P I P + +P+ P +PS A P + QP
Sbjct: 163 D--SDSSAQQQLLQPQ-GPPSIQVPPGAA-LAPSAPPPTPSAQAVPPQGSPIAAQP 214
Score = 33.9 bits (77), Expect = 0.13
Identities = 28/122 (22%), Positives = 35/122 (28%), Gaps = 19/122 (15%)
Query: 159 TPIPGPPESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMP 218
P P P PSL Q +LPSP P ++ P PP+
Sbjct: 215 APQPQQPSPLSLISAPSLHPQ----RLPSPHPPLQ----------PQTASQQSPQPPAPS 260
Query: 219 QNLPGPSISGPLIPGPSI-----PSFPSPAVTPSSPSTPAPQPAMPALVTQPEQEISYIE 273
P S GP P P P+ P P A P + Q S+
Sbjct: 261 SRHPQSSHHGPGPPMPHALQQGPVFLQHPSSNPPQPFGLAQSQVPPLPLPSQAQPHSHTP 320
Query: 274 AE 275
Sbjct: 321 PS 322
Score = 33.5 bits (76), Expect = 0.16
Identities = 31/109 (28%), Positives = 40/109 (36%), Gaps = 5/109 (4%)
Query: 164 PPESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAP--TSSGGTLPSPPSMPQNL 221
P S SQ LPS + +S+ + S P LP PSMP
Sbjct: 288 HPSSNPPQPFGLAQSQVPPLPLPSQAQPHSHTPPSQSALQPQQPPREQPLPPAPSMPHIK 347
Query: 222 PGPSISGPLIPGPS---IPSFPSPAVTPSSPSTPAPQPAMPALVTQPEQ 267
P P+ P +P S P P+ P PS P PA+ L + P
Sbjct: 348 PPPTTPIPQLPNQSHKHPPHLQGPSPFPQMPSNLPPPPALKPLSSLPTH 396
Score = 32.7 bits (74), Expect = 0.24
Identities = 28/110 (25%), Positives = 36/110 (32%), Gaps = 7/110 (6%)
Query: 156 SGETPIPGPPESGEGSEGPSLSSQNSND-QLPSPPRENSNEDVTPSFPA----PTSSGGT 210
S P P ES S Q + PP P P+ P
Sbjct: 151 SPSIPSPQDNESDSDSSAQQQLLQPQGPPSIQVPPGAALAPSAPPPTPSAQAVPPQGSPI 210
Query: 211 LPSPPSMPQNLPGPSI-SGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMP 259
P PQ S+ S P + +PS P P + P + S +PQP P
Sbjct: 211 AAQPAPQPQQPSPLSLISAPSLHPQRLPS-PHPPLQPQTASQQSPQPPAP 259
>gnl|CDD|223029 PHA03264, PHA03264, envelope glycoprotein D; Provisional.
Length = 416
Score = 36.1 bits (83), Expect = 0.021
Identities = 27/114 (23%), Positives = 33/114 (28%), Gaps = 21/114 (18%)
Query: 159 TPIPGPPESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAP----TSSGGTLPSP 214
PP GS P P R + + P T G P P
Sbjct: 262 KGYEPPPAPSGGSPAP-----------PGDDRPEAKPEPGPVEDGAPGRETGGEGEGPEP 310
Query: 215 PSMPQNLPGPSISGPLIPGPSI------PSFPSPAVTPSSPSTPAPQPAMPALV 262
G GP P P PS + P +P+TPA A P +V
Sbjct: 311 AGRDGAAGGEPKPGPPRPAPDADRPEGWPSLEAITFPPPTPATPAVPRARPVIV 364
>gnl|CDD|237865 PRK14951, PRK14951, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 618
Score = 36.2 bits (84), Expect = 0.022
Identities = 16/104 (15%), Positives = 33/104 (31%), Gaps = 2/104 (1%)
Query: 165 PESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMPQNLPGP 224
P + + P+ + + +P + PA + P P
Sbjct: 366 PAAAAEAAAPAEKKTPARPEAAAPAAAPVAQAAAAPAPAAAPAAAASAPAAPPAAAPPAP 425
Query: 225 SISGPLI-PGPSIPSFPSP-AVTPSSPSTPAPQPAMPALVTQPE 266
+ P + + P+ A+ P+ P+ AP+ + PE
Sbjct: 426 VAAPAAAAPAAAPAAAPAAVALAPAPPAQAAPETVAIPVRVAPE 469
Score = 29.3 bits (66), Expect = 3.2
Identities = 19/110 (17%), Positives = 30/110 (27%), Gaps = 14/110 (12%)
Query: 160 PIPGPPESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMPQ 219
P P + S + P P + PA + L P
Sbjct: 401 PAPAAAPAAAASAPAA-----PPAAAPPAPVAAPAAAAPAAAPAAAPAAVALAPAPPAQA 455
Query: 220 NLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPALVTQPEQEI 269
+I + P P++ S+ PA PA L E ++
Sbjct: 456 APETVAIPVRVAPEPAVA---------SAAPAPAAAPAAARLTPTEEGDV 496
Score = 28.5 bits (64), Expect = 5.9
Identities = 11/67 (16%), Positives = 20/67 (29%), Gaps = 2/67 (2%)
Query: 204 PTSSGGTLPSPPSMPQNLPGPSISGPL-IPGPSIPSFPSPAVTPSSPSTPAPQP-AMPAL 261
P ++ P + + + P+ A ++ + AP A PA
Sbjct: 366 PAAAAEAAAPAEKKTPARPEAAAPAAAPVAQAAAAPAPAAAPAAAASAPAAPPAAAPPAP 425
Query: 262 VTQPEQE 268
V P
Sbjct: 426 VAAPAAA 432
Score = 28.1 bits (63), Expect = 6.3
Identities = 10/75 (13%), Positives = 20/75 (26%), Gaps = 2/75 (2%)
Query: 202 PAPTSSGGTLPSPPSMPQNLPGPSISGPLI--PGPSIPSFPSPAVTPSSPSTPAPQPAMP 259
PA + +P + P P+ + + P+ PA + +
Sbjct: 381 PARPEAAAPAAAPVAQAAAAPAPAAAPAAAASAPAAPPAAAPPAPVAAPAAAAPAAAPAA 440
Query: 260 ALVTQPEQEISYIEA 274
A +A
Sbjct: 441 APAAVALAPAPPAQA 455
>gnl|CDD|215964 pfam00513, Late_protein_L2, Late Protein L2.
Length = 466
Score = 36.1 bits (84), Expect = 0.022
Identities = 21/99 (21%), Positives = 29/99 (29%), Gaps = 7/99 (7%)
Query: 174 PSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMPQNLPGPSISG----- 228
S + D L + DV + S Q + S +
Sbjct: 343 HSGDTSPVEDGLYDIYADPDPLDVELDTYSDDLLLDETTEDFSTSQLVSSSSRTSTTNTT 402
Query: 229 -PLIPGPSIPSFPSPAVT-PSSPSTPAPQPAMPALVTQP 265
PL P +P + P + P SP T P P L T
Sbjct: 403 IPLSSTPDVPVYYGPDIVLPESPGTTPIVPVPPDLPTVI 441
Score = 35.0 bits (81), Expect = 0.052
Identities = 17/82 (20%), Positives = 22/82 (26%), Gaps = 14/82 (17%)
Query: 171 SEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMPQNLPGPSISGPL 230
L + + D S +S+ T + P SS P GP
Sbjct: 371 YSDDLLLDETTEDFSTSQLVSSSSRTSTTNTTIPLSSTPD------------VPVYYGPD 418
Query: 231 IPGPSIPSFPSPAVTPSSPSTP 252
I P P P P P
Sbjct: 419 IVLPESPGTTPI--VPVPPDLP 438
>gnl|CDD|130689 TIGR01628, PABP-1234, polyadenylate binding protein, human types 1,
2, 3, 4 family. These eukaryotic proteins recognize the
poly-A of mRNA and consists of four tandem RNA
recognition domains at the N-terminus (rrm: pfam00076)
followed by a PABP-specific domain (pfam00658) at the
C-terminus. The protein is involved in the transport of
mRNA's from the nucleus to the cytoplasm. There are four
paralogs in Homo sapiens which are expressed in testis
(GP:11610605_PABP3 ), platelets (SP:Q13310_PABP4 ),
broadly expressed (SP:P11940_PABP1) and of unknown
tissue range (SP:Q15097_PABP2).
Length = 562
Score = 35.9 bits (83), Expect = 0.027
Identities = 17/110 (15%), Positives = 29/110 (26%), Gaps = 8/110 (7%)
Query: 179 QNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMPQNLPGPSISGPLIPGPSIPS 238
P + P+ AP ++ P QN P++ P+ S
Sbjct: 414 LGWPRMSMMPTPMGPGGPLRPNGLAPMNAVRA---PSRNAQNAAQKPPMQPVMYPPNYQS 470
Query: 239 FPSPAVTPSSPSTPAPQPAMPALVTQ-----PEQEISYIEAELINKCQKY 283
P P ST + L P+ + + L +
Sbjct: 471 LPLSQDLPQPQSTASQGGQNKKLAQVLASATPQMQKQVLGERLFPLVEAI 520
>gnl|CDD|222843 PHA02030, PHA02030, hypothetical protein.
Length = 336
Score = 35.7 bits (82), Expect = 0.027
Identities = 20/93 (21%), Positives = 36/93 (38%), Gaps = 9/93 (9%)
Query: 168 GEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMPQNLPGPSIS 227
G S+ + LP+ P ++ + P ++ + PS+PQ +
Sbjct: 249 GGEDLIIKPKSKAAGSNLPAVPNVAADAGSAAAPAVPAAAAAVAQAAPSVPQVPNVAVL- 307
Query: 228 GPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPA 260
P +P +P P++P PA P +PA
Sbjct: 308 ------PDVP-QVAPVAAPAAPEVPA-VPVVPA 332
>gnl|CDD|220392 pfam09770, PAT1, Topoisomerase II-associated protein PAT1. Members
of this family are necessary for accurate chromosome
transmission during cell division.
Length = 804
Score = 35.5 bits (82), Expect = 0.039
Identities = 24/125 (19%), Positives = 37/125 (29%), Gaps = 9/125 (7%)
Query: 159 TPIPGPPESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMP 218
+P P + + P LS Q P PP+ + P P + P P
Sbjct: 234 PQLPQQPPPLQQPQFPGLSQQ----MPPPPPQPPQQQ---QQPPQPQAQPPPQNQPTPHP 286
Query: 219 QNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPALVTQPEQEISYIEAELIN 278
G + P P P P + + L Q + +S EA+
Sbjct: 287 GLPQGQNAPLPPPQQP--QLLPLVQQPQGQQRGPQFREQLVQLSQQQREALSQEEAKRAK 344
Query: 279 KCQKY 283
+ K
Sbjct: 345 RRHKI 349
Score = 33.2 bits (76), Expect = 0.17
Identities = 22/123 (17%), Positives = 32/123 (26%), Gaps = 15/123 (12%)
Query: 160 PIPGPPESGEG--SEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPS---- 213
P P PP++ E S +++ + ++ P P G P
Sbjct: 132 PEPQPPQAPESQPQPQTPAQKMLSLEEVEAQLQQRQQAPQLPQPPQQVLPQGMPPRQAAF 191
Query: 214 ----PPSMPQNLPGPSISGPLIPGPS-----IPSFPSPAVTPSSPSTPAPQPAMPALVTQ 264
PP P P P P P P+ P P P
Sbjct: 192 PQQGPPEQPPGYPQPPQGHPEQVQPQQFLPAPSQAPAQPPLPPQLPQQPPPLQQPQFPGL 251
Query: 265 PEQ 267
+Q
Sbjct: 252 SQQ 254
Score = 28.6 bits (64), Expect = 6.2
Identities = 13/68 (19%), Positives = 15/68 (22%), Gaps = 1/68 (1%)
Query: 199 PSFPAPTSSGGTLPSPPSMPQNLPGPS-ISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPA 257
P LP+P P P P + P P P P Q
Sbjct: 208 QGHPEQVQPQQFLPAPSQAPAQPPLPPQLPQQPPPLQQPQFPGLSQQMPPPPPQPPQQQQ 267
Query: 258 MPALVTQP 265
P
Sbjct: 268 QPPQPQAQ 275
>gnl|CDD|233045 TIGR00601, rad23, UV excision repair protein Rad23. All proteins
in this family for which functions are known are
components of a multiprotein complex used for targeting
nucleotide excision repair to specific parts of the
genome. In humans, Rad23 complexes with the XPC protein.
This family is based on the phylogenomic analysis of JA
Eisen (1999, Ph.D. Thesis, Stanford University) [DNA
metabolism, DNA replication, recombination, and repair].
Length = 378
Score = 34.9 bits (80), Expect = 0.048
Identities = 20/74 (27%), Positives = 29/74 (39%), Gaps = 2/74 (2%)
Query: 197 VTPSFPAPTSSGGTLPSPPSMPQNLPGPSISGPLIPGPSIP-SFPSPAVTPSSPSTPAPQ 255
V P PTS+ PSPP+ P + + + + S + A S S P+
Sbjct: 84 VAPPAATPTSAPTPTPSPPASPASGMSAAPASAVEEKSPSEESATATAPESPSTSVPSSG 143
Query: 256 P-AMPALVTQPEQE 268
A LV E+E
Sbjct: 144 SDAASTLVVGSERE 157
Score = 33.7 bits (77), Expect = 0.097
Identities = 16/57 (28%), Positives = 25/57 (43%)
Query: 204 PTSSGGTLPSPPSMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPA 260
P + G + P + P + P P+ S P P + + P+ AV SPS + P
Sbjct: 77 PKTGTGKVAPPAATPTSAPTPTPSPPASPASGMSAAPASAVEEKSPSEESATATAPE 133
>gnl|CDD|236382 PRK09111, PRK09111, DNA polymerase III subunits gamma and tau;
Validated.
Length = 598
Score = 34.5 bits (80), Expect = 0.062
Identities = 13/68 (19%), Positives = 19/68 (27%), Gaps = 1/68 (1%)
Query: 202 PAPTSSGGTLPSPPSMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPAL 261
P GG P P + + P+ A+ P A A PA
Sbjct: 391 PPSPGGGGGGPPGGGGAPGAPAAAAAPGAAAAAPAAGGPAAALAA-VPDAAAAAAAPPAP 449
Query: 262 VTQPEQEI 269
P+ +
Sbjct: 450 AAAPQPAV 457
Score = 34.1 bits (79), Expect = 0.089
Identities = 10/84 (11%), Positives = 21/84 (25%), Gaps = 2/84 (2%)
Query: 183 DQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMPQNLPGPSISGPLIPGPSIPSFPSP 242
+ P P + P + GP+ + +P + + P
Sbjct: 388 QEGPPSPGGGGGGPPGGGGAPGAPAAAAAPGAAAAAPAAGGPAAALAAVPDAAAAAAAPP 447
Query: 243 AVTPSSPSTPAPQ--PAMPALVTQ 264
A + + AL +
Sbjct: 448 APAAAPQPAVRLNSFEDIVALAEE 471
>gnl|CDD|233154 TIGR00859, ENaC, sodium channel transporter. The Epithelial Na+
Channel (ENaC) Family (TC 1.A.06)The ENaC family
consists of sodium channels from animals and has no
recognizable homologues in other eukaryotes or bacteria.
The vertebrate ENaC proteins from epithelial cells
cluster tightly together on the phylogenetic tree:
voltage-insensitive ENaC homologues are also found in
the brain. Eleven sequenced C. elegans proteins,
including the degenerins, are distantly related to the
vertebrate proteins as well as to each other. At least
some ofthese proteins form part of a mechano-transducing
complex for touch sensitivity. Other members of the ENaC
family, the acid-sensing ion channels, ASIC1-3,are homo-
or hetero-oligomeric neuronal H+-gated channels that
mediate pain sensation in response to tissue acidosis.
The homologous Helix aspersa(FMRF-amide)-activated Na+
channel is the first peptide neurotransmitter-gated
ionotropic receptor to be sequenced.Mammalian ENaC is
important for the maintenance of Na+ balance and the
regulation of blood pressure. Three homologous ENaC
subunits, a, b and g, havebeen shown to assemble to form
the highly Na+-selective channel.This model is designed
from the vertebrate members of the ENaC family
[Transport and binding proteins, Cations and iron
carrying compounds].
Length = 595
Score = 34.3 bits (79), Expect = 0.072
Identities = 15/53 (28%), Positives = 20/53 (37%)
Query: 187 SPPRENSNEDVTPSFPAPTSSGGTLPSPPSMPQNLPGPSISGPLIPGPSIPSF 239
R + A T L PP+ P LP P SG +PG P++
Sbjct: 540 QRRRGPPYAEPPEPVSADTPPSLQLDDPPTFPSALPLPHASGLSLPGTPPPNY 592
>gnl|CDD|235540 PRK05641, PRK05641, putative acetyl-CoA carboxylase biotin carboxyl
carrier protein subunit; Validated.
Length = 153
Score = 33.3 bits (76), Expect = 0.072
Identities = 17/66 (25%), Positives = 24/66 (36%), Gaps = 8/66 (12%)
Query: 200 SFPAPTSSGGTLPSPPSMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMP 259
++ G S P P P P++PS P+P + PAP A
Sbjct: 33 TYEVEAKGLGIDLSAVQEQVPTPAP------APAPAVPSAPTP--VAPAAPAPAPASAGE 84
Query: 260 ALVTQP 265
+VT P
Sbjct: 85 NVVTAP 90
>gnl|CDD|235906 PRK07003, PRK07003, DNA polymerase III subunits gamma and tau;
Validated.
Length = 830
Score = 34.4 bits (79), Expect = 0.081
Identities = 17/107 (15%), Positives = 30/107 (28%), Gaps = 2/107 (1%)
Query: 162 PGPPESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMP--Q 219
P PP + + + + + + +S D + P S + P+ + P
Sbjct: 430 PAPPATADRGDDAADGDAPVPAKANARASADSRCDERDAQPPADSGSASAPASDAPPDAA 489
Query: 220 NLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPALVTQPE 266
P P + P P+ S PA P
Sbjct: 490 FEPAPRAAAPSAATPAAVPDARAPAAASREDAPAAAAPPAPEARPPT 536
Score = 33.3 bits (76), Expect = 0.19
Identities = 18/66 (27%), Positives = 25/66 (37%), Gaps = 2/66 (3%)
Query: 202 PAPTSSGGTL-PSPPSMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPA 260
PA T G P + +P P G S AVT ++ + AP+ A A
Sbjct: 360 PAVTGGGAPGGGVPARVAGAVPAPGARAAAAVGAS-AVPAVTAVTGAAGAALAPKAAAAA 418
Query: 261 LVTQPE 266
T+ E
Sbjct: 419 AATRAE 424
Score = 29.8 bits (67), Expect = 2.2
Identities = 15/80 (18%), Positives = 26/80 (32%), Gaps = 8/80 (10%)
Query: 189 PRENSNEDVTPSFPAPTSSGG------TLPSPPSMPQNLPGPSISGPLIPGPSIPSFPSP 242
+ + P+ PAP ++ +P N + S P+
Sbjct: 417 AAAATRAEAPPAAPAPPATADRGDDAADGDAPVPAKANARASADSRCDERDAQPPADSGS 476
Query: 243 AVTPSSPSTPAP--QPAMPA 260
A P+S + P +PA A
Sbjct: 477 ASAPASDAPPDAAFEPAPRA 496
Score = 27.9 bits (62), Expect = 9.6
Identities = 11/83 (13%), Positives = 21/83 (25%), Gaps = 1/83 (1%)
Query: 187 SPPRENSNEDVTPSFPAPTSSGGTLPSPPSMPQNLPGPSISGPLIPGP-SIPSFPSPAVT 245
+P + + + + S P G S P+ +P
Sbjct: 429 APAPPATADRGDDAADGDAPVPAKANARASADSRCDERDAQPPADSGSASAPASDAPPDA 488
Query: 246 PSSPSTPAPQPAMPALVTQPEQE 268
P+ A P+ P+
Sbjct: 489 AFEPAPRAAAPSAATPAAVPDAR 511
>gnl|CDD|236940 PRK11633, PRK11633, cell division protein DedD; Provisional.
Length = 226
Score = 33.4 bits (77), Expect = 0.092
Identities = 26/115 (22%), Positives = 37/115 (32%), Gaps = 13/115 (11%)
Query: 161 IPGPPESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMPQN 220
IP P+ G+ E + + LP+ P E + E V A S +PP+ P
Sbjct: 41 IPLVPKPGDRDEPDMMPAATQ--ALPTQPPEGAAEAVRAGDAAAPSLDPATVAPPNTPVE 98
Query: 221 LPGPSISGPLI---PGPSIPSFPSPAVTPSSPSTPAPQPAMP--------ALVTQ 264
+ P P P V P P+P + A V Q
Sbjct: 99 PEPAPVEPPKPKPVEKPKPKPKPQQKVEAPPAPKPEPKPVVEEKAAPTGKAYVVQ 153
>gnl|CDD|215145 PLN02258, PLN02258, 9-cis-epoxycarotenoid dioxygenase NCED.
Length = 590
Score = 33.9 bits (78), Expect = 0.099
Identities = 16/66 (24%), Positives = 25/66 (37%), Gaps = 2/66 (3%)
Query: 198 TPSFPAPTSSGGTLPSPPSMPQNLPGPSISGPLIPGPSIPSFP--SPAVTPSSPSTPAPQ 255
+ + + +SS + P + P+ + L+ PSI P S PS PA
Sbjct: 12 SHASSSSSSSSQSSPPSSTSPRPRRRKPSASSLLHTPSILPLPKLSSPSPPSVTLPPAAT 71
Query: 256 PAMPAL 261
P L
Sbjct: 72 TQTPQL 77
>gnl|CDD|237864 PRK14950, PRK14950, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 585
Score = 33.6 bits (77), Expect = 0.12
Identities = 16/95 (16%), Positives = 23/95 (24%), Gaps = 4/95 (4%)
Query: 183 DQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMPQNLPGPSISGPLIPGPSIPSFPSP 242
PSP R P A + P + P P +P +
Sbjct: 373 AAAPSPVRPTPAPSTRPKAAAAANIPPKEPVRETATPPPVPPRPVAPPVPHTPESAPKLT 432
Query: 243 AVTPSSPSTPAPQPAMPALVTQPEQEISYIEAELI 277
P P P E+E + I +
Sbjct: 433 RAAIPVDEKPKYTPPAP----PKEEEKALIADGDV 463
Score = 33.6 bits (77), Expect = 0.14
Identities = 21/89 (23%), Positives = 30/89 (33%), Gaps = 22/89 (24%)
Query: 201 FPAPTSSGGTLP----------SPPSMPQ-NLPGPSISGPLIPGPSIPSFPSPAVT---- 245
F T+S G LP P PQ P + P+ P P+ + P A
Sbjct: 339 FQLRTTSYGQLPLELAVIEALLVPVPAPQPAKPTAAAPSPVRPTPAPSTRPKAAAAANIP 398
Query: 246 -------PSSPSTPAPQPAMPALVTQPEQ 267
++P P+P P + PE
Sbjct: 399 PKEPVRETATPPPVPPRPVAPPVPHTPES 427
Score = 30.5 bits (69), Expect = 1.2
Identities = 22/116 (18%), Positives = 34/116 (29%), Gaps = 7/116 (6%)
Query: 159 TPIPGPPESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMP 218
P+P P + + PS +N + P P + P
Sbjct: 361 VPVPAPQPAKPTAAAPSPVRPTPAPSTRPKAAAAANIPPKEPVRETATPPPVPPRPVAPP 420
Query: 219 QNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPALVTQPEQEISYIEA 274
+P S P + +IP P TP PAP + + +EA
Sbjct: 421 --VPHTPESAPKLTRAAIPVDEKPKYTP-----PAPPKEEEKALIADGDVLEQLEA 469
>gnl|CDD|152960 pfam12526, DUF3729, Protein of unknown function (DUF3729). This
family of proteins is found in viruses. Proteins in this
family are typically between 145 and 1707 amino acids in
length. The family is found in association with
pfam01443, pfam01661, pfam05417, pfam01660, pfam00978.
There is a single completely conserved residue L that
may be functionally important.
Length = 115
Score = 32.0 bits (73), Expect = 0.12
Identities = 17/60 (28%), Positives = 20/60 (33%)
Query: 200 SFPAPTSSGGTLPSPPSMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMP 259
S +P S PP P P+ +P PA P P P P PA P
Sbjct: 27 SCFSPPESAHPDDPPPVGDPRPPVVDTPPPVSAVWVLPPPSEPAAPPPDPEPPVPGPAGP 86
Score = 31.6 bits (72), Expect = 0.17
Identities = 17/71 (23%), Positives = 22/71 (30%), Gaps = 6/71 (8%)
Query: 187 SPPRENSNEDVTPSFPAPTSSGGTLPSPPSMPQNLPGPSISGPLIPGPSIPSFPSPAVTP 246
PP P P S+ LP P P P P P+ P + P
Sbjct: 39 DPPPVGDPRPPVVDTPPPVSAVWVLPPPSEPAAPPPDPE------PPVPGPAGPPSPLAP 92
Query: 247 SSPSTPAPQPA 257
+P+ P P
Sbjct: 93 PAPARKPPLPP 103
Score = 27.7 bits (62), Expect = 4.1
Identities = 17/75 (22%), Positives = 25/75 (33%), Gaps = 8/75 (10%)
Query: 199 PSFPAPTSSGGTLPSPPSMPQNLPGPSISGPLIPGPSIPSFPSP-----AVTPSSPSTPA 253
+ P G P + P P + ++P PS P+ P P P+ P +P
Sbjct: 34 SAHPDDPPPVGD---PRPPVVDTPPPVSAVWVLPPPSEPAAPPPDPEPPVPGPAGPPSPL 90
Query: 254 PQPAMPALVTQPEQE 268
PA P
Sbjct: 91 APPAPARKPPLPPPR 105
>gnl|CDD|237057 PRK12323, PRK12323, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 700
Score = 33.7 bits (77), Expect = 0.15
Identities = 15/99 (15%), Positives = 34/99 (34%)
Query: 159 TPIPGPPESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMP 218
GP + + + P+ E++ P F +P + +
Sbjct: 464 PAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPWEELPPEFASPAPAQPDAAPAGWVA 523
Query: 219 QNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPA 257
+++P P+ + P ++ P+ A P + + P A
Sbjct: 524 ESIPDPATADPDDAFETLAPAPAAAPAPRAAAATEPVVA 562
Score = 32.5 bits (74), Expect = 0.28
Identities = 20/104 (19%), Positives = 36/104 (34%), Gaps = 4/104 (3%)
Query: 159 TPIPGPPESGEGSEGPSLS---SQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPP 215
P P P + + + + + + SP E S P + P+P
Sbjct: 396 APAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGPGGAPAPAPAPA 455
Query: 216 SMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMP 259
+ P P+ +GP + + P+ A P++ PA P
Sbjct: 456 AAPAAAARPAAAGPRPVAAAAAAAPARA-APAAAPAPADDDPPP 498
Score = 31.4 bits (71), Expect = 0.78
Identities = 17/113 (15%), Positives = 35/113 (30%), Gaps = 1/113 (0%)
Query: 156 SGETPIPGPPESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPP 215
+ P P + + ++ + + R + S PAP + +
Sbjct: 383 AQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRS-PAPEALAAARQASA 441
Query: 216 SMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPALVTQPEQE 268
P P P+ + P + + ++ + AP A PA P +
Sbjct: 442 RGPGGAPAPAPAPAAAPAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADD 494
Score = 30.2 bits (68), Expect = 1.6
Identities = 19/93 (20%), Positives = 30/93 (32%), Gaps = 12/93 (12%)
Query: 178 SQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMPQNLPGPSISGPLIPGPSIP 237
Q+ P+ P+ AP ++ +PP+ P P+ + + P
Sbjct: 366 GQSGGGAGPATAAAAPVAQPAPAAAAPAAAAPAPAAPPAAPA--AAPAAAAAARAVAAAP 423
Query: 238 SFPSPAVTP----------SSPSTPAPQPAMPA 260
+ SPA PAP PA A
Sbjct: 424 ARRSPAPEALAAARQASARGPGGAPAPAPAPAA 456
Score = 29.8 bits (67), Expect = 1.9
Identities = 17/113 (15%), Positives = 32/113 (28%), Gaps = 2/113 (1%)
Query: 156 SGETPIPGPPESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPP 215
G + P+ ++ + P+ P + A + P
Sbjct: 369 GGGAGPATAAAAPVAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSP 428
Query: 216 SMPQNLPGPSISGPLIPGPSIPSFPSPAVTPS-SPSTPAPQPAMPALVTQPEQ 267
+ P+ L + PG + P+PA P+ + A P A
Sbjct: 429 A-PEALAAARQASARGPGGAPAPAPAPAAAPAAAARPAAAGPRPVAAAAAAAP 480
Score = 29.5 bits (66), Expect = 2.6
Identities = 17/87 (19%), Positives = 24/87 (27%), Gaps = 7/87 (8%)
Query: 186 PSPPRENSNEDVTP---SFPAPTSSGGTLPSPP--SMPQNL--PGPSISGPLIPGPSIPS 238
+ PR + + PA + PP +P P P+ G S
Sbjct: 466 AAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPWEELPPEFASPAPAQPDAAPAGWVAES 525
Query: 239 FPSPAVTPSSPSTPAPQPAMPALVTQP 265
P PA + PA A
Sbjct: 526 IPDPATADPDDAFETLAPAPAAAPAPR 552
Score = 28.7 bits (64), Expect = 4.4
Identities = 17/111 (15%), Positives = 32/111 (28%), Gaps = 9/111 (8%)
Query: 159 TPIPGPPESGEGSEG-PSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSM 217
P + G + ++ P+ +++D P P SP
Sbjct: 457 APAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPWEELP----PEFASPAPA 512
Query: 218 PQNLPGPSISGPLIPGPSI----PSFPSPAVTPSSPSTPAPQPAMPALVTQ 264
+ IP P+ +F + A P++ P A +V
Sbjct: 513 QPDAAPAGWVAESIPDPATADPDDAFETLAPAPAAAPAPRAAAATEPVVAP 563
Score = 28.3 bits (63), Expect = 6.7
Identities = 15/70 (21%), Positives = 25/70 (35%), Gaps = 1/70 (1%)
Query: 200 SFPAPTSSGGTLPSPP-SMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAM 258
+F S GG P+ + P P P+ + P P+ + P+ + + A A
Sbjct: 362 AFRPGQSGGGAGPATAAAAPVAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAA 421
Query: 259 PALVTQPEQE 268
P E
Sbjct: 422 APARRSPAPE 431
Score = 27.9 bits (62), Expect = 8.8
Identities = 11/95 (11%), Positives = 23/95 (24%)
Query: 160 PIPGPPESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMPQ 219
+ + P+ ++LP + + + P+
Sbjct: 477 AAAPARAAPAAAPAPADDDPPPWEELPPEFASPAPAQPDAAPAGWVAESIPDPATADPDD 536
Query: 220 NLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAP 254
+ + P P + P V P P A
Sbjct: 537 AFETLAPAPAAAPAPRAAAATEPVVAPRPPRASAS 571
>gnl|CDD|223033 PHA03291, PHA03291, envelope glycoprotein I; Provisional.
Length = 401
Score = 33.4 bits (76), Expect = 0.15
Identities = 20/78 (25%), Positives = 29/78 (37%), Gaps = 1/78 (1%)
Query: 186 PSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMPQNLPGPSISGPLIPGPSIPSFPSPAVT 245
PR ++ + TP+ TS T PS P + P G P P
Sbjct: 209 RPTPRTTASPETTPTPSTTTSPPSTTIPAPSTTIAAPQAGTT-PEAEGTPAPPTPGGGEA 267
Query: 246 PSSPSTPAPQPAMPALVT 263
P + +TPAP+ + L
Sbjct: 268 PPANATPAPEASRYELTV 285
Score = 31.8 bits (72), Expect = 0.47
Identities = 31/119 (26%), Positives = 41/119 (34%), Gaps = 7/119 (5%)
Query: 158 ETPIPGPPESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLP---SP 214
P G + EG + S + S PR + P+ P PT P
Sbjct: 165 AFPAEGTLAAPPLGEGSADGSCDP-ALPLSAPRLGPADVFVPATPRPTPRTTASPETTPT 223
Query: 215 PSMPQNLPGPSISGP--LIPGPSIPSFPSPAVTPSSPSTPAPQPAMPALVTQPEQEISY 271
PS + P +I P I P + P TP+ P TP A PA T + Y
Sbjct: 224 PSTTTSPPSTTIPAPSTTIAAPQAGTTPEAEGTPAPP-TPGGGEAPPANATPAPEASRY 281
>gnl|CDD|219406 pfam07420, DUF1509, Protein of unknown function (DUF1509). This
family consists of several uncharacterized viral
proteins from the Marek's disease-like viruses. Members
of this family are typically around 400 residues in
length. The function of this family is unknown.
Length = 377
Score = 33.1 bits (75), Expect = 0.17
Identities = 26/99 (26%), Positives = 33/99 (33%), Gaps = 22/99 (22%)
Query: 160 PIPGPPESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMPQ 219
P PGP P+LS+ D P T+ G PSPP
Sbjct: 201 PQPGPSGQNRSPRTPTLSNVRVLDA-----------------PVATNRGEA-PSPPRTDT 242
Query: 220 NLPGPSISGPLIPG----PSIPSFPSPAVTPSSPSTPAP 254
P P+I+GP PS P P + + P P
Sbjct: 243 LDPDPAIAGPSRAVNRTPSPRPSSPPPEIDEEYNAQPDP 281
>gnl|CDD|236333 PRK08691, PRK08691, DNA polymerase III subunits gamma and tau;
Validated.
Length = 709
Score = 33.1 bits (75), Expect = 0.18
Identities = 23/119 (19%), Positives = 45/119 (37%), Gaps = 3/119 (2%)
Query: 156 SGETPIPGPPESGEGSEGPSLSSQNSNDQLPSPPRENS-NEDVTPSFPAPTSSGG--TLP 212
+ +TP+ + SEG + ++ + PP E++ +E T + A TS+ T
Sbjct: 402 TAQTPVQTASAAAMPSEGKTAGPVSNQENNDVPPWEDAPDEAQTAAGTAQTSAKSIQTAS 461
Query: 213 SPPSMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPALVTQPEQEISY 271
+ P+N + + + PS ++P+ A + A E Y
Sbjct: 462 EAETPPENQVSKNKAADNETDAPLSEVPSENPIQATPNDEAVETETFAHEAPAEPFYGY 520
>gnl|CDD|221173 pfam11702, DUF3295, Protein of unknown function (DUF3295). This
family is conserved in fungi but the function is not
known.
Length = 509
Score = 33.0 bits (75), Expect = 0.19
Identities = 22/116 (18%), Positives = 33/116 (28%), Gaps = 9/116 (7%)
Query: 159 TPIPGPPESGEGSEGPSLSSQNSNDQLPSPPRENS------NEDVTPSFPAPTSSGGTLP 212
PI P + S S+ E++ + D S + T + +
Sbjct: 87 APITPPSSEPTPAPPSSESTATRTPDPNQQALESTESTSTTSADCNDSEQSSTPNLNSSD 146
Query: 213 SPPSMPQNLPGPSISGPLIPGP---SIPSFPSPAVTPSSPSTPAPQPAMPALVTQP 265
+ S LP S+ P S S PS + P A A P
Sbjct: 147 TSTSSSGALPSTSVVRGFSPSHISSSYRSTAQLNKAPSPTKSAEPTAAPQAKPELP 202
>gnl|CDD|235124 PRK03427, PRK03427, cell division protein ZipA; Provisional.
Length = 333
Score = 32.7 bits (75), Expect = 0.20
Identities = 23/112 (20%), Positives = 34/112 (30%), Gaps = 6/112 (5%)
Query: 162 PGPPESGEGSEGPSLSSQNSNDQLP----SPPRENSNEDVTPSFPAPTSSGGTLPSPPSM 217
E + + P + PR+ + P + P+P +
Sbjct: 70 RVNHAPANAQEHEAARPSPQHQYQPPYASAQPRQPVQQPPEAQVPPQHAPRPAQPAPQPV 129
Query: 218 PQNLPGPSISGPLIPG--PSIPSFPSPAVTPSSPSTPAPQPAMPALVTQPEQ 267
Q P PL P + P P + P+ A QPA P QPE
Sbjct: 130 QQPAYQPQPEQPLQQPVSPQVAPAPQPVHSAPQPAQQAFQPAEPVAAPQPEP 181
Score = 30.8 bits (70), Expect = 0.93
Identities = 22/96 (22%), Positives = 28/96 (29%), Gaps = 13/96 (13%)
Query: 186 PSPPRENSNEDVTPSF---PAPTSSGGTLPSPPSMPQNLPGPSISGPLIPG-------PS 235
P+ +E+ +P P S+ P +P P P
Sbjct: 75 PANAQEHEAARPSPQHQYQPPYASAQPRQPVQQPPEAQVPPQHAPRPAQPAPQPVQQPAY 134
Query: 236 IPSFPSPAVTPSSPS-TPAPQPAM--PALVTQPEQE 268
P P P SP PAPQP P Q Q
Sbjct: 135 QPQPEQPLQQPVSPQVAPAPQPVHSAPQPAQQAFQP 170
>gnl|CDD|219321 pfam07174, FAP, Fibronectin-attachment protein (FAP). This family
contains bacterial fibronectin-attachment proteins
(FAP). Family members are rich in alanine and proline,
are approximately 300 long, and seem to be restricted to
mycobacteria. These proteins contain a
fibronectin-binding motif that allows mycobacteria to
bind to fibronectin in the extracellular matrix.
Length = 297
Score = 32.6 bits (74), Expect = 0.20
Identities = 16/62 (25%), Positives = 22/62 (35%)
Query: 199 PSFPAPTSSGGTLPSPPSMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAM 258
P+ P P S P+ P P P+ P + P P P++P P P
Sbjct: 41 PAPPPPPPSTAAAAPAPAAPPPPPPPAAPPAPQPDDPNAAPPPPPADPNAPPPPPVDPNA 100
Query: 259 PA 260
P
Sbjct: 101 PP 102
Score = 32.6 bits (74), Expect = 0.23
Identities = 15/67 (22%), Positives = 20/67 (29%)
Query: 202 PAPTSSGGTLPSPPSMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPAL 261
PAP + + P P P +P P+ P+ P P P P
Sbjct: 41 PAPPPPPPSTAAAAPAPAAPPPPPPPAAPPAPQPDDPNAAPPPPPADPNAPPPPPVDPNA 100
Query: 262 VTQPEQE 268
P E
Sbjct: 101 PPPPAPE 107
Score = 29.9 bits (67), Expect = 1.7
Identities = 21/71 (29%), Positives = 29/71 (40%), Gaps = 3/71 (4%)
Query: 186 PSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMPQNLPGPSISGPLIPGPSIPSFPSPAVT 245
P+PP + PA P+ P PQ P + P P + P P V
Sbjct: 41 PAPPPPPPSTAAAAPAPAA-PPPPPPPAAPPAPQ--PDDPNAAPPPPPADPNAPPPPPVD 97
Query: 246 PSSPSTPAPQP 256
P++P PAP+P
Sbjct: 98 PNAPPPPAPEP 108
Score = 29.1 bits (65), Expect = 2.8
Identities = 20/92 (21%), Positives = 33/92 (35%), Gaps = 5/92 (5%)
Query: 185 LPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMPQNLPGPSISGPLIPGPSIPSFP---- 240
+ P N++ P P+ ++ +PP P P+ P P + P P
Sbjct: 31 IALPATANADPAPPPPPPSTAAAAPAPAAPPPPPPPAAPPA-PQPDDPNAAPPPPPADPN 89
Query: 241 SPAVTPSSPSTPAPQPAMPALVTQPEQEISYI 272
+P P P+ P P P + SY+
Sbjct: 90 APPPPPVDPNAPPPPAPEPGRIDNAVGGFSYV 121
>gnl|CDD|152115 pfam11679, DUF3275, Protein of unknown function (DUF3275). This
family of proteins with unknown function appear to be
restricted to Proteobacteria.
Length = 211
Score = 32.5 bits (74), Expect = 0.21
Identities = 18/83 (21%), Positives = 29/83 (34%), Gaps = 9/83 (10%)
Query: 189 PRENSNEDVTPSFPAPTSSGGTLPSPPSMPQNLPGPSISGPLIPGPSIPSFPSPAVTP-- 246
PR ++ P +P S+ +P P PS P P S + P
Sbjct: 90 PRRTEPQEPDPLDESPASAAPVASAPA------PAPSPQSP-KPASRRASRDMRRIAPFG 142
Query: 247 SSPSTPAPQPAMPALVTQPEQEI 269
+ S PA + A + + E+
Sbjct: 143 MNASAPAQEAAQASSDEDEDAEL 165
>gnl|CDD|236138 PRK07994, PRK07994, DNA polymerase III subunits gamma and tau;
Validated.
Length = 647
Score = 32.9 bits (76), Expect = 0.23
Identities = 15/61 (24%), Positives = 22/61 (36%), Gaps = 1/61 (1%)
Query: 210 TLPSPPSMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPALVTQPEQEI 269
LP P PQ+ P+ S P+ P A P APQ A + + ++
Sbjct: 364 PLPEPEVPPQSAA-PAASAQATAAPTAAVAPPQAPAVPPPPASAPQQAPAVPLPETTSQL 422
Query: 270 S 270
Sbjct: 423 L 423
Score = 30.2 bits (69), Expect = 1.6
Identities = 14/109 (12%), Positives = 30/109 (27%), Gaps = 3/109 (2%)
Query: 159 TPIPGPPESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMP 218
P+P P + + P+ S+Q + + + P AP + +
Sbjct: 363 APLPEPEVPPQSAA-PAASAQATAAPTAAVAPPQAPAVPPPPASAPQQAPAVPLPETTSQ 421
Query: 219 QNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPALVTQPEQ 267
L G + PA + + + ++ P
Sbjct: 422 --LLAARQQLQRAQGATKAKKSEPAAASRARPVNSALERLASVRPAPSA 468
Score = 29.8 bits (68), Expect = 2.3
Identities = 10/63 (15%), Positives = 18/63 (28%)
Query: 199 PSFPAPTSSGGTLPSPPSMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAM 258
P+ P P + P+ + P P + +P+ P P+
Sbjct: 361 PAAPLPEPEVPPQSAAPAASAQATAAPTAAVAPPQAPAVPPPPASAPQQAPAVPLPETTS 420
Query: 259 PAL 261
L
Sbjct: 421 QLL 423
>gnl|CDD|178806 PRK00030, minC, septum formation inhibitor; Provisional.
Length = 292
Score = 32.4 bits (73), Expect = 0.23
Identities = 16/81 (19%), Positives = 30/81 (37%), Gaps = 4/81 (4%)
Query: 186 PSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMPQNLPGPSISGPLIPGPSIPSFPSPAVT 245
+PP + + PS P T+ P + P S+P+ + T
Sbjct: 113 TAPPNDVATP--VPSVPEATAEAAAKAGPQDDEADGEQAD-EAPAHNPESVPTRAARETT 169
Query: 246 PSSPSTPA-PQPAMPALVTQP 265
++ T PQ + ++T+P
Sbjct: 170 EANRPTATPPQSSSALVITKP 190
Score = 28.9 bits (64), Expect = 3.6
Identities = 17/77 (22%), Positives = 27/77 (35%), Gaps = 5/77 (6%)
Query: 159 TPIPGPPESG--EGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPS 216
TP+P PE+ ++ + +Q P N T + T + +PP
Sbjct: 121 TPVPSVPEATAEAAAKAGPQDDEADGEQADEAPAHNPESVPTRAARETTEANRPTATPPQ 180
Query: 217 MPQNLPGPSISGPLIPG 233
L I+ PL G
Sbjct: 181 SSSAL---VITKPLRSG 194
>gnl|CDD|216078 pfam00716, Peptidase_S21, Assemblin (Peptidase family S21).
Length = 326
Score = 32.4 bits (74), Expect = 0.25
Identities = 21/98 (21%), Positives = 36/98 (36%), Gaps = 6/98 (6%)
Query: 162 PGPPESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMPQNL 221
PG S + + S S Q + P + ++ S P + G + +P +
Sbjct: 235 PGTAPSFDATPSVSPSGQPLSPAAPPGTSSVAGTALSAS---PAALFGDMVY---VPLDA 288
Query: 222 PGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMP 259
++G P P P+P + P+ P PA P
Sbjct: 289 YNQLLAGQAFNQPPDPQGPAPPAELAPPAPAPPPPANP 326
>gnl|CDD|139494 PRK13335, PRK13335, superantigen-like protein; Reviewed.
Length = 356
Score = 32.4 bits (73), Expect = 0.27
Identities = 18/95 (18%), Positives = 30/95 (31%), Gaps = 11/95 (11%)
Query: 165 PESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMPQNL-PG 223
P + E S + S P +E + PAP + ++ P
Sbjct: 80 PNTNEEKTSASKIEKISQ-----PKQEEQKSLNISATPAPKQE-----QSQTTTESTTPK 129
Query: 224 PSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAM 258
++ P P + + TP SP+ Q M
Sbjct: 130 TKVTTPPSTNTPQPMQSTKSDTPQSPTIKQAQTDM 164
>gnl|CDD|222449 pfam13908, Shisa, Wnt and FGF inhibitory regulator. Shisa is a
transcription factor-type molecule that physically
interacts with immature forms of the Wnt receptor
Frizzled and the FGF receptor within the endoplasmic
reticulum to inhibit their post-translational maturation
and trafficking to the cell surface.
Length = 177
Score = 31.7 bits (72), Expect = 0.28
Identities = 16/50 (32%), Positives = 20/50 (40%), Gaps = 3/50 (6%)
Query: 212 PSPPSMPQNLPGPSISGPLI-PGPSIPSFPSPAVTPSSPSTPAPQPAMPA 260
P P M + + PL P + PS+P P PQP MPA
Sbjct: 109 PQRPVMTRATSTTVQTTPLPQPPSTAPSYPGP--QYQGYHPMPPQPGMPA 156
Score = 30.2 bits (68), Expect = 1.1
Identities = 19/69 (27%), Positives = 25/69 (36%), Gaps = 2/69 (2%)
Query: 192 NSNEDVTPSFPAPTSSGGTLPSPPSMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPST 251
+T + + LP PPS + PGP G P P P P+P + P
Sbjct: 109 PQRPVMTRATSTTVQTT-PLPQPPSTAPSYPGPQYQG-YHPMPPQPGMPAPPYSLQYPPP 166
Query: 252 PAPQPAMPA 260
QP P
Sbjct: 167 GLLQPQGPP 175
>gnl|CDD|222997 PHA03132, PHA03132, thymidine kinase; Provisional.
Length = 580
Score = 32.4 bits (74), Expect = 0.28
Identities = 24/141 (17%), Positives = 37/141 (26%), Gaps = 21/141 (14%)
Query: 131 ETSEDIEQNKKYAKWKAAYIHNCLKSGET-----PIPGPPESGEGSEGPSLSSQNSNDQL 185
E+ E A+ S T + P E+G S G +S
Sbjct: 19 ESPEGSRDENFDAERDDFLTPLGSTSEATSEDDDDLYPPRETG--SGGGVATSTI----- 71
Query: 186 PSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMPQNLPGPSISGPLIPGPSIPSFPSPAVT 245
+ PR + T P + LP PGP+ P + T
Sbjct: 72 YTVPRPPRGPEQTLDKPDSLPASRELP---------PGPTPVPPGGFRGASSPRLGADST 122
Query: 246 PSSPSTPAPQPAMPALVTQPE 266
P + A + +
Sbjct: 123 SPRFLYQVNFPVILAPIGESN 143
>gnl|CDD|237030 PRK12270, kgd, alpha-ketoglutarate decarboxylase; Reviewed.
Length = 1228
Score = 32.6 bits (75), Expect = 0.29
Identities = 10/83 (12%), Positives = 21/83 (25%), Gaps = 1/83 (1%)
Query: 195 EDVTPSFPAPTSSGGTLPSPPSMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAP 254
A ++ +P + P + + + + P PA ++
Sbjct: 39 GSTAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPKPAAAAAAA-AAPA 97
Query: 255 QPAMPALVTQPEQEISYIEAELI 277
P A P E +
Sbjct: 98 APPAAAAAAAPAAAAVEDEVTPL 120
Score = 30.6 bits (70), Expect = 1.2
Identities = 12/80 (15%), Positives = 25/80 (31%)
Query: 186 PSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMPQNLPGPSISGPLIPGPSIPSFPSPAVT 245
+ + + AP ++ P + + P P + + +PA
Sbjct: 40 STAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPKPAAAAAAAAAPAAP 99
Query: 246 PSSPSTPAPQPAMPALVTQP 265
P++ + AP A P
Sbjct: 100 PAAAAAAAPAAAAVEDEVTP 119
Score = 30.2 bits (69), Expect = 1.8
Identities = 8/60 (13%), Positives = 15/60 (25%)
Query: 201 FPAPTSSGGTLPSPPSMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPA 260
F P+ + + + P+P + + P P A A
Sbjct: 33 FADYGPGSTAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPKPAAAAAA 92
Score = 27.9 bits (63), Expect = 8.1
Identities = 16/102 (15%), Positives = 33/102 (32%), Gaps = 22/102 (21%)
Query: 163 GPPESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMPQNLP 222
GP + + + ++ ++ +P + P+ PAP P+ P
Sbjct: 37 GPGSTAAPTAAAAAAAAAASAPAAAPAAK------APAAPAPAPPAAAAPAAP------- 83
Query: 223 GPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPALVTQ 264
P P+ + + A + A PA A+ +
Sbjct: 84 ---------PKPAAAAAAAAAPAAPPAAAAAAAPAAAAVEDE 116
>gnl|CDD|220309 pfam09606, Med15, ARC105 or Med15 subunit of Mediator complex
non-fungal. The approx. 70 residue Med15 domain of the
ARC-Mediator co-activator is a three-helix bundle with
marked similarity to the KIX domain. The sterol
regulatory element binding protein (SREBP) family of
transcription activators use the ARC105 subunit to
activate target genes in the regulation of cholesterol
and fatty acid homeostasis. In addition, Med15 is a
critical transducer of gene activation signals that
control early metazoan development.
Length = 768
Score = 32.7 bits (74), Expect = 0.31
Identities = 22/81 (27%), Positives = 32/81 (39%), Gaps = 4/81 (4%)
Query: 178 SQNSNDQLPSPPRENSNEDVTP-SFPAPTSSGGTLPSPPS---MPQNLPGPSISGPLIPG 233
+N +P PP+ + P S P + SGG +PSPP+ P S +
Sbjct: 419 QVQTNQSMPQPPQPSVPSPGGPGSQPPQSVSGGMIPSPPALMPSPSPQMSQSPASQRTIQ 478
Query: 234 PSIPSFPSPAVTPSSPSTPAP 254
+ S P TP S +P
Sbjct: 479 QDMVSPGGPLNTPGQSSVNSP 499
Score = 31.1 bits (70), Expect = 0.98
Identities = 18/62 (29%), Positives = 25/62 (40%), Gaps = 4/62 (6%)
Query: 213 SPPSMPQNLPGPSISG----PLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPALVTQPEQE 268
P M P P + P P PS+PS P P + P+ PAL+ P +
Sbjct: 407 QPGMMSSPSPVPQVQTNQSMPQPPQPSVPSPGGPGSQPPQSVSGGMIPSPPALMPSPSPQ 466
Query: 269 IS 270
+S
Sbjct: 467 MS 468
>gnl|CDD|237605 PRK14086, dnaA, chromosomal replication initiation protein;
Provisional.
Length = 617
Score = 32.1 bits (73), Expect = 0.36
Identities = 15/68 (22%), Positives = 19/68 (27%), Gaps = 8/68 (11%)
Query: 205 TSSGGTLPSPPSMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPST------PAPQPAM 258
S G PP + P P + P P P +PA
Sbjct: 89 DPSAGEPAPPPPHARRTSEPE--LPRPGRRPYEGYGGPRADDRPPGLPRQDQLPTARPAY 146
Query: 259 PALVTQPE 266
PA +PE
Sbjct: 147 PAYQQRPE 154
Score = 31.7 bits (72), Expect = 0.50
Identities = 23/113 (20%), Positives = 29/113 (25%), Gaps = 9/113 (7%)
Query: 154 LKSGETPIPGPPESGEGSEGPSLSSQNSNDQLPS-PPRENSNEDVTPSFPAPTSSGGTLP 212
+ G P + + + P R + P + P P
Sbjct: 170 QRLGFPPRAPYASPASYAPEQERDREPYDAGRPEYDQRRRDYDHPRPDWDRPRRDRTDRP 229
Query: 213 SPPSMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPALVTQP 265
PP P GP P V P PS P P A PA P
Sbjct: 230 EPP--------PGAGHVHRGGPGPPERDDAPVVPIRPSAPGPLAAQPAPAPGP 274
Score = 30.9 bits (70), Expect = 1.0
Identities = 26/112 (23%), Positives = 36/112 (32%), Gaps = 10/112 (8%)
Query: 161 IPGPPESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSS--GGTLPSPPSMP 218
I P +GE + P + + S +LP P R + P + LP +P
Sbjct: 86 ITVDPSAGEPAPPPPHARRTSEPELPRPGRRPY-----EGYGGPRADDRPPGLPRQDQLP 140
Query: 219 QNLPG-PSISGPLIPGP-SIPSFPSPAVTPSSPSTPAPQPAMPALVTQPEQE 268
P P+ PG + P A PA PEQE
Sbjct: 141 TARPAYPAYQQRPEPGAWPRAADDYGWQQQRLGFPPRAPYASPA-SYAPEQE 191
Score = 30.2 bits (68), Expect = 1.5
Identities = 21/122 (17%), Positives = 37/122 (30%), Gaps = 17/122 (13%)
Query: 156 SGETPIPGP-PESGEGSEGPS--------LSSQNSNDQLPSPPRENSNEDVTPSFPAPTS 206
+GE P P E P ++D+ P PR++ P++PA
Sbjct: 92 AGEPAPPPPHARRTSEPELPRPGRRPYEGYGGPRADDRPPGLPRQDQLPTARPAYPAYQ- 150
Query: 207 SGGTLPSPPSMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPALVTQPE 266
P P + +P +P+S + + P +PE
Sbjct: 151 -------QRPEPGAWPRAADDYGWQQQRLGFPPRAPYASPASYAPEQERDREPYDAGRPE 203
Query: 267 QE 268
+
Sbjct: 204 YD 205
>gnl|CDD|184927 PRK14963, PRK14963, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 504
Score = 32.1 bits (73), Expect = 0.36
Identities = 16/78 (20%), Positives = 22/78 (28%), Gaps = 5/78 (6%)
Query: 196 DVTPSFPAPTSSGGTLPSPPSMPQNLPGPSISGPLIPGPSIPSF-----PSPAVTPSSPS 250
V P PAP L ++L + G +P F PA P+ +
Sbjct: 352 AVAPPAPAPADLTQRLNRLEKEVRSLRSAPTAAATAAGAPLPDFDPRPRGPPAPEPARSA 411
Query: 251 TPAPQPAMPALVTQPEQE 268
P A A
Sbjct: 412 EAPPLVAPAAAPAGLALR 429
Score = 30.6 bits (69), Expect = 1.3
Identities = 19/108 (17%), Positives = 30/108 (27%), Gaps = 13/108 (12%)
Query: 148 AYIHNCLKSGETPIPGPPESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSS 207
A +H L G P G + P+ +Q N + S+
Sbjct: 334 ALLHALLALGGAPSEGVAAVAPPAPAPADLTQRLNRL-------------EKEVRSLRSA 380
Query: 208 GGTLPSPPSMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQ 255
+ P P GP P P+ + P V P++
Sbjct: 381 PTAAATAAGAPLPDFDPRPRGPPAPEPARSAEAPPLVAPAAAPAGLAL 428
Score = 28.3 bits (63), Expect = 6.6
Identities = 17/83 (20%), Positives = 25/83 (30%), Gaps = 2/83 (2%)
Query: 168 GEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFP--APTSSGGTLPSPPSMPQNLPGPS 225
EG + + D R S P A T++G LP P+ P P
Sbjct: 347 SEGVAAVAPPAPAPADLTQRLNRLEKEVRSLRSAPTAAATAAGAPLPDFDPRPRGPPAPE 406
Query: 226 ISGPLIPGPSIPSFPSPAVTPSS 248
+ P + +PA
Sbjct: 407 PARSAEAPPLVAPAAAPAGLALR 429
>gnl|CDD|220749 pfam10428, SOG2, RAM signalling pathway protein. SOG2 proteins in
Saccharomyces cerevisiae are involved in cell separation
and cytokinesis.
Length = 419
Score = 32.0 bits (73), Expect = 0.41
Identities = 21/113 (18%), Positives = 34/113 (30%), Gaps = 5/113 (4%)
Query: 156 SGETPIPGPPESGEGSEGPSLSSQN-----SNDQLPSPPRENSNEDVTPSFPAPTSSGGT 210
+ + P + S SS +D+L + S P S+ T
Sbjct: 144 AWSSLGPPLQHRKRDAVTASPSSMIARNTPISDRLRPRSVTPTRGRRPSSSPRSLSNPTT 203
Query: 211 LPSPPSMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPALVT 263
L SP ++ P S S + SS +TP + + T
Sbjct: 204 LESPSNLQVTTDVPPPYSNGTSRSSTMSSSANLSIISSLATPRSGESFRSTPT 256
Score = 30.1 bits (68), Expect = 1.6
Identities = 23/89 (25%), Positives = 29/89 (32%), Gaps = 2/89 (2%)
Query: 171 SEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMPQNLPGPSISGPL 230
SS S P+ SN VT P P S+G + S S NL S
Sbjct: 187 RGRRPSSSPRSLSN-PTTLESPSNLQVTTDVPPPYSNGTSRSSTMSSSANLSIISSLATP 245
Query: 231 IPGPSIPSFPSPAVTPSSPSTPAPQPAMP 259
G S S P+ + + S A
Sbjct: 246 RSGESFRSTPT-SGSSSINPVSGLDEAEE 273
>gnl|CDD|217453 pfam03251, Tymo_45kd_70kd, Tymovirus 45/70Kd protein. Tymoviruses
are single stranded RNA viruses. This family includes a
protein of unknown function that has been named based on
its molecular weight. Tymoviruses such as the ononis
yellow mosaic tymovirus encode only three proteins. Of
these two are overlapping this protein overlaps a larger
ORF that is thought to be the polymerase.
Length = 458
Score = 32.0 bits (73), Expect = 0.44
Identities = 20/86 (23%), Positives = 26/86 (30%), Gaps = 15/86 (17%)
Query: 184 QLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMPQNLPGPSISGPLIPG---------- 233
QLP P + P P LP P PSI P P
Sbjct: 358 QLPPPTK-----RRLRLLPVPPPKVQALPLTALAPLVRHSPSIPLPHPPSALPSHVGASS 412
Query: 234 PSIPSFPSPAVTPSSPSTPAPQPAMP 259
P + S+P+P P++P
Sbjct: 413 SKHHRLPPSVLPGPRLSSPSPSPSLP 438
Score = 28.9 bits (65), Expect = 4.1
Identities = 19/82 (23%), Positives = 30/82 (36%), Gaps = 7/82 (8%)
Query: 179 QNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMPQN-------LPGPSISGPLI 231
+N LPSP + + P + P P+ ++ LP PS L+
Sbjct: 222 ENQLHPLPSPQPVRHSPGILGPSPLHPHTTRPSPPRPAFSRSPSSPLSPLPRPSTRRGLL 281
Query: 232 PGPSIPSFPSPAVTPSSPSTPA 253
P P +P + P + S P
Sbjct: 282 PNPRLPRASRGHLPPPTSSAPP 303
Score = 27.8 bits (62), Expect = 8.4
Identities = 17/59 (28%), Positives = 25/59 (42%), Gaps = 1/59 (1%)
Query: 198 TPSFPAPTSSG-GTLPSPPSMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQ 255
+ P PT LP PP Q LP +++ + PSIP P+ PS + +
Sbjct: 356 SSQLPPPTKRRLRLLPVPPPKVQALPLTALAPLVRHSPSIPLPHPPSALPSHVGASSSK 414
>gnl|CDD|218621 pfam05518, Totivirus_coat, Totivirus coat protein.
Length = 753
Score = 32.1 bits (73), Expect = 0.45
Identities = 22/152 (14%), Positives = 37/152 (24%), Gaps = 28/152 (18%)
Query: 114 KAFYTASILMDVLQTFGETSEDIEQNKKYAKWKAAYIHNCLKSGETPIPGP-PESGEGSE 172
+ F A++ ++ F + Y ++G +PGP P
Sbjct: 620 RTFGRATVGEMIISGFPPVFKTALPRPDYN--------RGGEAGGPGVPGPVPVGMPAHT 671
Query: 173 GPSLSSQ-------NSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMPQNLPGPS 225
++ P+ P P GG LP PP +
Sbjct: 672 ARPSRVARGDPVRPTAHHAALRAPQ-------APRPGGPPGGGGGLPPPP-----DLPAA 719
Query: 226 ISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPA 257
I S +P + A
Sbjct: 720 AGPAPCGSSLIASPTAPPEPEPPGAEQADGAE 751
Score = 30.5 bits (69), Expect = 1.4
Identities = 25/122 (20%), Positives = 34/122 (27%), Gaps = 12/122 (9%)
Query: 156 SGETPIPG-PPESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTL--- 211
GE I G PP P + P + P+ + G +
Sbjct: 627 VGEMIISGFPPVFKTALPRPDYNRGGEAGGPGVPGPVPVGMPAHTARPSRVARGDPVRPT 686
Query: 212 -----PSPPSMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPALVTQPE 266
P P+ GP G +P P P P+ A S+ P P P
Sbjct: 687 AHHAALRAPQAPRP-GGPPGGGGGLPPP--PDLPAAAGPAPCGSSLIASPTAPPEPEPPG 743
Query: 267 QE 268
E
Sbjct: 744 AE 745
>gnl|CDD|225987 COG3456, COG3456, Predicted component of the type VI protein
secretion system, contains a FHA domain [Intracellular
trafficking, secretion, and vesicular transport;
Signal transduction mechanisms].
Length = 430
Score = 31.7 bits (72), Expect = 0.46
Identities = 21/111 (18%), Positives = 29/111 (26%), Gaps = 9/111 (8%)
Query: 161 IPGPPESGEGSEGPS-LSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMPQ 219
I P G P L+ N P +E + G PS +
Sbjct: 136 IEPEPRLPPGLTDPLQLADPVPNGFSADPLSALQSESLIA----QPDPAGGAPSIS---R 188
Query: 220 NLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPALVTQPEQEIS 270
N P S P G FP P ++P P + +
Sbjct: 189 NSEAPL-SDPASAGGIDTPFPDDLDVPPIAASPPGPQEGPTISASQPAQRD 238
>gnl|CDD|220633 pfam10214, Rrn6, RNA polymerase I-specific transcription-initiation
factor. RNA polymerase I-specific
transcription-initiation factor Rrn6 and Rrn7 represent
components of a multisubunit transcription factor
essential for the initiation of rDNA transcription by
Pol I. These proteins are found in fungi.
Length = 753
Score = 32.0 bits (73), Expect = 0.47
Identities = 18/99 (18%), Positives = 29/99 (29%), Gaps = 10/99 (10%)
Query: 174 PSLSSQNSNDQLPSPPRENSNEDVTPSFPAPT-----SSGGTLPSPPSMPQNLPGPSISG 228
P + +D + P + + PT T S+ + P P
Sbjct: 639 PGDDPSDVDDSQATQPDVTDSSQLESQSQIPTIRSSQQVSQTRKGGSSVVPSAPAP---- 694
Query: 229 PLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPALVTQPEQ 267
L P+ S + P S S +P +Q E
Sbjct: 695 RLAQSSQPPTSQSSSDLPPSSSQAFSLSDLPMQ-SQSES 732
>gnl|CDD|237782 PRK14666, uvrC, excinuclease ABC subunit C; Provisional.
Length = 694
Score = 31.8 bits (72), Expect = 0.49
Identities = 19/97 (19%), Positives = 30/97 (30%), Gaps = 16/97 (16%)
Query: 164 PPESG-EGSEGPSLSSQNSNDQLP-----SPPRENSNEDVTPSFPAPTSSGGTLPSPPSM 217
P G EG + + LP E S++ V P A P S+
Sbjct: 317 PDTEGREGDDLAPTAVCTDAGLLPDTPLLPDAPEGSSDPVVPVAAAT-------PVDASL 369
Query: 218 PQNLPGPSISGPLIP--GPSIPSFPSPAVTPSSPSTP 252
P G + + + P+ A T + + P
Sbjct: 370 PDVRTGTAPTSLANVSHADPAVAQPTQAATL-AGAAP 405
>gnl|CDD|223065 PHA03378, PHA03378, EBNA-3B; Provisional.
Length = 991
Score = 32.0 bits (72), Expect = 0.50
Identities = 19/106 (17%), Positives = 27/106 (25%), Gaps = 5/106 (4%)
Query: 162 PGPPESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMPQNL 221
P P G + + + P + P P P
Sbjct: 701 PTPMRPPAAPPGRAQRPAAATGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAA 760
Query: 222 PGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPALVTQPEQ 267
PG P + P P+P P +P P +P P Q
Sbjct: 761 PGR-----ARPPAAAPGAPTPQPPPQAPPAPQQRPRGAPTPQPPPQ 801
>gnl|CDD|237862 PRK14948, PRK14948, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 620
Score = 31.9 bits (73), Expect = 0.51
Identities = 22/111 (19%), Positives = 32/111 (28%), Gaps = 13/111 (11%)
Query: 176 LSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMPQNLPGPSISGPLIPGPS 235
L SQ+ + + P P S P +PQ P P P+
Sbjct: 510 LESQSGSASNTAKTP-----------PPPQKSPPPPAPTPPLPQPTATAPPPTPPPPPPT 558
Query: 236 IPSFPSPAVTPSSPSTPAPQPAMPALVTQPEQEISYIEAELINKCQKYIKF 286
S A + P P +P T + S E E+ + F
Sbjct: 559 ATQASSNAPAQIPADSS-PPPPIPEEPTPSPTKDSSPE-EIDKAAKNLADF 607
Score = 30.3 bits (69), Expect = 1.6
Identities = 20/81 (24%), Positives = 27/81 (33%), Gaps = 3/81 (3%)
Query: 185 LPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMPQNLPGPSISGPLIPGPSIPSFPSPAV 244
LPS +E S PA + PP+ Q + P P P
Sbjct: 360 LPSA---FISEIANASAPANPTPAPNPSPPPAPIQPSAPKTKQAATTPSPPPAKASPPIP 416
Query: 245 TPSSPSTPAPQPAMPALVTQP 265
P+ P+ P+P P A P
Sbjct: 417 VPAEPTEPSPTPPANAANAPP 437
Score = 29.9 bits (68), Expect = 2.1
Identities = 19/72 (26%), Positives = 22/72 (30%), Gaps = 10/72 (13%)
Query: 209 GTLPSPPSMPQNLPGPSISG---------PLIPGPSIPSFPSPAVTPSSPSTPAPQPA-M 258
G LPS + P PS P A TPS P A P +
Sbjct: 358 GLLPSAFISEIANASAPANPTPAPNPSPPPAPIQPSAPKTKQAATTPSPPPAKASPPIPV 417
Query: 259 PALVTQPEQEIS 270
PA T+P
Sbjct: 418 PAEPTEPSPTPP 429
Score = 28.0 bits (63), Expect = 7.2
Identities = 16/96 (16%), Positives = 25/96 (26%), Gaps = 8/96 (8%)
Query: 174 PSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMPQNLPGPSISGPLIPG 233
PS + + P P S+ T + + P P
Sbjct: 361 PSAFISEIANASAPANPTPAPNPSPPPAPIQPSAPKTKQAATT-PSPPPAK-------AS 412
Query: 234 PSIPSFPSPAVTPSSPSTPAPQPAMPALVTQPEQEI 269
P IP P +P A + + Q+I
Sbjct: 413 PPIPVPAEPTEPSPTPPANAANAPPSLNLEELWQQI 448
>gnl|CDD|221371 pfam12004, DUF3498, Domain of unknown function (DUF3498). This
presumed domain is functionally uncharacterized. This
domain is found in eukaryotes. This domain is typically
between 433 to 538 amino acids in length. This domain is
found associated with pfam00616, pfam00168. This domain
has two conserved sequence motifs: DLQ and PLSFQNP.
Length = 489
Score = 31.6 bits (71), Expect = 0.52
Identities = 22/95 (23%), Positives = 33/95 (34%), Gaps = 4/95 (4%)
Query: 159 TPIPGPPESGEGSEGPSLSSQNSNDQ---LPSPPRENSNEDVTPSFPAPTSSGGTLPSPP 215
P+P P GSE S +S +SN + L + + E+ T T +LP
Sbjct: 186 APLPVSPRGDSGSENHSSASSHSNSEDCSLSAGFSNSGGEEFTRRSTDFTRRQLSLPDRQ 245
Query: 216 SMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPS 250
P LP + +GP + P
Sbjct: 246 HQPA-LPRQNSAGPQRRVDQPSPPGGGSHRGRIPP 279
Score = 27.8 bits (61), Expect = 8.9
Identities = 16/75 (21%), Positives = 23/75 (30%), Gaps = 4/75 (5%)
Query: 158 ETPIPGPPESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSF---PAPTSSGGTLPSP 214
+ + P G GS + + LPS S+E +SS G P
Sbjct: 259 QRRVDQPSPPGGGSHRGRIPPSLLS-SLPSEGSMLSSEWPQSGARPRQQSSSSKGDSPEL 317
Query: 215 PSMPQNLPGPSISGP 229
+L PS
Sbjct: 318 RPAAGHLQQPSPVNM 332
>gnl|CDD|237015 PRK11901, PRK11901, hypothetical protein; Reviewed.
Length = 327
Score = 31.6 bits (72), Expect = 0.53
Identities = 16/74 (21%), Positives = 24/74 (32%), Gaps = 16/74 (21%)
Query: 175 SLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMPQNLPGPSISGPLIPGP 234
S SS S+ SP N + + PQ++ P IS
Sbjct: 82 SGSSSLSSGNQSSPSAAN---------NTSDGHDASGVKNTAPPQDISAPPISPT----- 127
Query: 235 SIPSFPSPAVTPSS 248
P+ +P TP+
Sbjct: 128 --PTQAAPPQTPNG 139
>gnl|CDD|114270 pfam05539, Pneumo_att_G, Pneumovirinae attachment membrane
glycoprotein G.
Length = 408
Score = 31.6 bits (71), Expect = 0.58
Identities = 21/107 (19%), Positives = 34/107 (31%), Gaps = 6/107 (5%)
Query: 168 GEGSEGPSLSSQNSNDQLPSPPRE---NSNEDVTPSFPAPTSSGGTLPSPPSMPQNLPGP 224
P+ SQ + P+ +N+ ++ + P T T +P PQ P P
Sbjct: 181 PTEVSHPTYPSQVTPQSQPATQGHQTATANQRLSSTEPVGTQGTTTSSNPE--PQTEPPP 238
Query: 225 SISGPLIPGPSIPSFPSPAVTPSSPS-TPAPQPAMPALVTQPEQEIS 270
S GP PS S + + + P + S
Sbjct: 239 SQRGPSGSPQHPPSTTSQDQSTTGDGQEHTQRRKTPPATSNRRSPHS 285
>gnl|CDD|237695 PRK14376, PRK14376, hypothetical protein; Provisional.
Length = 176
Score = 30.7 bits (69), Expect = 0.63
Identities = 13/51 (25%), Positives = 15/51 (29%)
Query: 204 PTSSGGTLPSPPSMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAP 254
P S+GG P P P GP P AV +P
Sbjct: 99 PLSAGGYDPVPGEAPGTSVGPGTVRKPRGRTRTAGGPVGAVDAGTPHRSRR 149
>gnl|CDD|150420 pfam09744, Jnk-SapK_ap_N, JNK_SAPK-associated protein-1. This is
the N-terminal 200 residues of a set of proteins
conserved from yeasts to humans. Most of the proteins
in this entry have an RhoGEF pfam00621 domain at their
C-terminal end.
Length = 158
Score = 30.1 bits (68), Expect = 0.75
Identities = 23/69 (33%), Positives = 29/69 (42%), Gaps = 13/69 (18%)
Query: 8 CPDSLKAIQHFLNVASE---HDNRDLTVSYWCRLHALQKALKIDKSSDGAKMLLLKLMDW 64
DSL + L V E DN L+ Y R L+K A+ LL+ D
Sbjct: 37 NLDSLASENQELEVELELLREDNERLSTQY-EREKELRKQ---------AEQKLLEFEDE 86
Query: 65 LEQEKKTHR 73
LEQEKK +
Sbjct: 87 LEQEKKELQ 95
>gnl|CDD|237940 PRK15313, PRK15313, autotransport protein MisL; Provisional.
Length = 955
Score = 31.3 bits (70), Expect = 0.75
Identities = 17/55 (30%), Positives = 23/55 (41%), Gaps = 5/55 (9%)
Query: 212 PSPPSMPQN----LPGPSISGPLIPGPSIPSFPSPAV-TPSSPSTPAPQPAMPAL 261
P P +P +P P I P+ P P P P P + P P P+P P +
Sbjct: 560 PDEPIIPDPVDPVIPDPVIPDPVDPDPVDPVIPDPVIPDPVDPDPVDPEPVDPVI 614
>gnl|CDD|113398 pfam04625, DEC-1_N, DEC-1 protein, N-terminal region. The
defective chorion-1 gene (dec-1) in Drosophila encodes
follicle cell proteins necessary for proper eggshell
assembly. Multiple products of the dec-1 gene are formed
by alternative RNA splicing and proteolytic processing.
Cleavage products include S80 (80 kDa) which is
incorporated into the eggshell, and further proteolysis
of S80 gives S60 (60 kDa).
Length = 407
Score = 30.9 bits (69), Expect = 0.77
Identities = 17/54 (31%), Positives = 25/54 (46%)
Query: 211 LPSPPSMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPALVTQ 264
LP+ PSMP L + P P+ +PA +P+ P P PA++ Q
Sbjct: 93 LPAMPSMPGLLGAAAPVPAPAPAPAAAPPAAPAPAADTPAAPIPDAVQPAILGQ 146
>gnl|CDD|218232 pfam04731, Caudal_act, Caudal like protein activation region. This
family consists of the amino termini of proteins
belonging to the caudal-related homeobox protein family.
This region is thought to mediate transcription
activation. The level of activation caused by mouse Cdx2
is affected by phosphorylation at serine 60 via the
mitogen-activated protein kinase pathway. Caudal family
proteins are involved in the transcriptional regulation
of multiple genes expressed in the intestinal
epithelium, and are important in differentiation and
maintenance of the intestinal epithelial lining. Caudal
proteins always have a homeobox DNA binding domain
(pfam00046).
Length = 135
Score = 30.1 bits (68), Expect = 0.77
Identities = 21/65 (32%), Positives = 29/65 (44%), Gaps = 4/65 (6%)
Query: 187 SPPRENSNEDVTPSFPAPTSSGGTLPSPPSMPQNLPGPSISGPLIPGP-SIPSFPSPAVT 245
PPRE+ + P P+++ T SP M + P S P PGP + S +V
Sbjct: 53 GPPREDWS--AYGPGPGPSATAATG-SPGQMAYSPPDYSPVHPPGPGPGLLQPADSGSVE 109
Query: 246 PSSPS 250
SPS
Sbjct: 110 QLSPS 114
>gnl|CDD|184918 PRK14954, PRK14954, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 620
Score = 31.1 bits (70), Expect = 0.78
Identities = 20/79 (25%), Positives = 24/79 (30%), Gaps = 7/79 (8%)
Query: 202 PAPTSSGGTLPSPPSMPQNLPGPSIS----GPLIPGPSIPSFPSPAVTPSSPSTPAPQPA 257
P+P S P +LP P P PG PSPA P +P P
Sbjct: 382 PSPAGSPDVKKKAP--EPDLPQPDRHPGPAKPEAPGARPAELPSPASAP-TPEQQPPVAR 438
Query: 258 MPALVTQPEQEISYIEAEL 276
L P+ A
Sbjct: 439 SAPLPPSPQASAPRNVASG 457
Score = 30.3 bits (68), Expect = 1.4
Identities = 25/95 (26%), Positives = 32/95 (33%), Gaps = 20/95 (21%)
Query: 159 TPIPGPPESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFP-APTSSGGTLPSPPS- 216
+P P + E LP P R P+ P AP + LPSP S
Sbjct: 383 SPAGSPDVKKKAPEPD----------LPQPDRHPG-----PAKPEAPGARPAELPSPASA 427
Query: 217 -MPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPS 250
P+ P + S PL P P + V P
Sbjct: 428 PTPEQQPPVARSAPLPPSPQ--ASAPRNVASGKPG 460
Score = 29.1 bits (65), Expect = 3.8
Identities = 22/86 (25%), Positives = 28/86 (32%), Gaps = 4/86 (4%)
Query: 179 QNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMPQNLPGPSISGPLIPGPSIPS 238
+N PSP + P P P+ P P P S P P
Sbjct: 375 RNDGGVAPSPAGSPDVKKKAPEPDLPQPDRHPGPAKPEAPGARPAELPSPASAPTPE--- 431
Query: 239 FPSPAVTPSSPSTPAPQPAMPALVTQ 264
P V S+P P+PQ + P V
Sbjct: 432 -QQPPVARSAPLPPSPQASAPRNVAS 456
>gnl|CDD|236669 PRK10263, PRK10263, DNA translocase FtsK; Provisional.
Length = 1355
Score = 31.2 bits (70), Expect = 0.86
Identities = 24/112 (21%), Positives = 35/112 (31%), Gaps = 12/112 (10%)
Query: 197 VTPSFPAPTSSGGTLPSPPSMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPS-TPAPQ 255
T S+ AP P S+ P+++ +PGP P + P+ Q
Sbjct: 329 ATQSWAAPVEPVTQTPPVASVDVPPAQPTVAWQPVPGPQT---GEPVIAPAPEGYPQQSQ 385
Query: 256 PAMPALV--------TQPEQEISYIEAELINKCQKYIKFASSALNYDDYKEA 299
A PA+ QP+Q AE + Y Y A
Sbjct: 386 YAQPAVQYNEPLQQPVQPQQPYYAPAAEQPAQQPYYAPAPEQPAQQPYYAPA 437
Score = 30.4 bits (68), Expect = 1.7
Identities = 22/99 (22%), Positives = 28/99 (28%), Gaps = 2/99 (2%)
Query: 162 PGPPESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMPQNL 221
P + + + Q Q P P+ + P P P P P
Sbjct: 747 PIVEPVQQPQQPVAPQQQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQ 806
Query: 222 P-GPSISGPLIPGPSIPSFPSPAVT-PSSPSTPAPQPAM 258
P P P P P P P P P P PQ +
Sbjct: 807 PQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQDTL 845
Score = 27.7 bits (61), Expect = 9.8
Identities = 29/122 (23%), Positives = 43/122 (35%), Gaps = 8/122 (6%)
Query: 158 ETPIPGPPESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSM 217
+ P+ P+ + + + Q Q P P+ + P P P P P
Sbjct: 782 QQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQ---PQQPVA 838
Query: 218 PQNLPGPSISGPLI--PGPSIP-SFPSPAVTPSSPSTPAPQPAMPALVTQPEQEISYIEA 274
PQ P ++ PL+ G S P P+ + TP P P EQ +EA
Sbjct: 839 PQ--PQDTLLHPLLMRNGDSRPLHKPTTPLPSLDLLTPPPSEVEPVDTFALEQMARLVEA 896
Query: 275 EL 276
L
Sbjct: 897 RL 898
>gnl|CDD|132858 cd07219, Pat_PNPLA1, Patatin-like phospholipase domain containing
protein 1. Members of this family share a patatin
domain, initially discovered in potato tubers. Some
members of PNPLA1 subfamily do not have the lipase
consensus sequence Gly-X-Ser-X-Gly which is essential
for hydrolase activity. This family includes PNPLA1
from Homo sapiens and Gallus gallus. Currently, there is
no literature available on the physiological role,
structure, or enzymatic activity of PNPLA1. It is
expressed in various human tissues in low mRNA levels.
Length = 382
Score = 31.0 bits (70), Expect = 0.89
Identities = 20/88 (22%), Positives = 34/88 (38%), Gaps = 12/88 (13%)
Query: 173 GPSLSSQNSNDQLPSPPREN----SNEDVTPSFPAPTSSGG-TLPSPPSMPQNLP--GPS 225
G + ++ L P+ + + ++P AP S+ + SP S P + PS
Sbjct: 296 GQASLEESWQPHLARAPKGDGRGLHDPPLSPPLAAPESTAEWVVESPVSSPASPLESSPS 355
Query: 226 ISGPLIPGPSIPSFPS----PAVTPSSP 249
+ G L S P+ P+ TP
Sbjct: 356 LPGSLTDLSPA-SLPAVHSLPSSTPGLS 382
>gnl|CDD|225599 COG3057, SeqA, Negative regulator of replication initiationR [DNA
replication, recombination, and repair].
Length = 181
Score = 30.2 bits (68), Expect = 0.97
Identities = 19/72 (26%), Positives = 33/72 (45%), Gaps = 10/72 (13%)
Query: 241 SPAVTPSSPSTPAPQPAMPALVTQPEQEISYIEAELINKCQKYIKFASSALNYDDYKEAK 300
S A ++P+T P+ A+PA V EA+ + ++ L D+Y E K
Sbjct: 36 SAASQTAAPATKEPRVAVPAKV----------EAKPTKTIKDKVRAMRELLLSDEYAEQK 85
Query: 301 SNLIKVLNILNT 312
+ + + IL+T
Sbjct: 86 RAVNRFMLILST 97
>gnl|CDD|185628 PTZ00449, PTZ00449, 104 kDa microneme/rhoptry antigen; Provisional.
Length = 943
Score = 30.8 bits (69), Expect = 0.97
Identities = 28/118 (23%), Positives = 41/118 (34%), Gaps = 21/118 (17%)
Query: 162 PGPPESGEGSEGPSLSSQNSNDQLPSPPRE-----NSNEDVTPSFPAPTSSGGTLPSPPS 216
PGP + + S+ P+LS + + P P++ + P T P P
Sbjct: 561 PGPAKEHKPSKIPTLSKKPEFPKDPKHPKDPEEPKKPKRPRSAQRP-------TRPKSPK 613
Query: 217 MPQNL--------PGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPALVTQPE 266
+P+ L P S P P PS P P +P P P P P+
Sbjct: 614 LPELLDIPKSPKRPESPKSPKRPPPPQRPSSPERPEGPKIIKSPKP-PKSPKPPFDPK 670
Score = 30.4 bits (68), Expect = 1.5
Identities = 27/119 (22%), Positives = 37/119 (31%), Gaps = 9/119 (7%)
Query: 155 KSGETPIPGPPESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTL-PS 213
K E P GP SG + P + S + E P G P+
Sbjct: 506 KHDEPP-EGPEASGLPPKAPGDKEGEEGEHEDSKESDEPKEGGKPGETKEGEVGKKPGPA 564
Query: 214 PPSMPQNLPGPSI--SGPLIPG----PSIPSFPSPAVTPSSPSTPAPQPAMPALVTQPE 266
P +P S P P P P P + P+ P P +P L+ P+
Sbjct: 565 KEHKPSKIPTLSKKPEFPKDPKHPKDPEEPKKPKRPRSAQRPTRP-KSPKLPELLDIPK 622
>gnl|CDD|148139 pfam06346, Drf_FH1, Formin Homology Region 1. This region is found
in some of the Diaphanous related formins (Drfs). It
consists of low complexity repeats of around 12
residues.
Length = 160
Score = 29.5 bits (66), Expect = 1.3
Identities = 31/121 (25%), Positives = 39/121 (32%), Gaps = 14/121 (11%)
Query: 156 SGETPIPGPPESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSG------- 208
G IP PP + P +P PP + P P P +G
Sbjct: 43 PGGAAIPPPPPLPGVAGIPPPPPLPGATAIPPPPPLPGAAGIPPPPPLPGGAGIPPPPPP 102
Query: 209 ----GTLPSPPSMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPALVTQ 264
+P PP +P GP + P P P P P P SP P +PA
Sbjct: 103 LPGGAAVPPPPPLP---GGPGVPPPPPPFPGAPGIPPPPPGMGSPPPPPFGFGVPAAPVL 159
Query: 265 P 265
P
Sbjct: 160 P 160
Score = 29.5 bits (66), Expect = 1.4
Identities = 30/109 (27%), Positives = 35/109 (32%), Gaps = 6/109 (5%)
Query: 157 GETPIPGPPESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPS 216
G IP PP G P +P PP + P P P +P PP
Sbjct: 8 GGVGIPPPPPLPGGVCIPPPPPLPGGTGIPPPPPLPGGAAIPP--PPPLPGVAGIPPPPP 65
Query: 217 MPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPALVTQP 265
LPG + P P P P P P P P P +P P
Sbjct: 66 ----LPGATAIPPPPPLPGAAGIPPPPPLPGGAGIPPPPPPLPGGAAVP 110
>gnl|CDD|166942 PRK00404, tatB, sec-independent translocase; Provisional.
Length = 141
Score = 29.4 bits (66), Expect = 1.3
Identities = 12/48 (25%), Positives = 19/48 (39%)
Query: 212 PSPPSMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMP 259
P P Q+ + P P++P P+ AV + + P P P
Sbjct: 90 PVTPPTAQSPAPAVPTPPPTSTPAVPPAPAAAVPAPAAAPPPSDPPQP 137
Score = 29.0 bits (65), Expect = 1.5
Identities = 17/54 (31%), Positives = 26/54 (48%), Gaps = 1/54 (1%)
Query: 214 PPSMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPALVTQPEQ 267
PP+ P+ + P+ P P+ P +PAV P +P+ P PA + P Q
Sbjct: 84 PPAPPEPVTPPTAQSPAPAVPTPPPTSTPAV-PPAPAAAVPAPAAAPPPSDPPQ 136
Score = 29.0 bits (65), Expect = 2.0
Identities = 14/53 (26%), Positives = 18/53 (33%)
Query: 202 PAPTSSGGTLPSPPSMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAP 254
PAP + P P S P +P + P+PA P P P
Sbjct: 85 PAPPEPVTPPTAQSPAPAVPTPPPTSTPAVPPAPAAAVPAPAAAPPPSDPPQP 137
Score = 27.5 bits (61), Expect = 5.3
Identities = 13/54 (24%), Positives = 18/54 (33%)
Query: 212 PSPPSMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPALVTQP 265
P P P P P +P P S P+ P++ P+ QP
Sbjct: 84 PPAPPEPVTPPTAQSPAPAVPTPPPTSTPAVPPAPAAAVPAPAAAPPPSDPPQP 137
>gnl|CDD|234428 TIGR03979, His_Ser_Rich, His-Xaa-Ser repeat protein HxsA. Members
of this protein share two defining regions. One is a
histidine/serine-rich cluster, typically
H-R-S-H-S-S-H-R-S-H-S-S-H. Members are found always in
the context of a pair of radical SAM proteins, HxsB and
HxsC, and a fourth protein HxsD. The system is predicted
to perform peptide modifications, likely in the
His-Xaa-Ser region, to produce some uncharacterized
natural product.
Length = 186
Score = 29.8 bits (67), Expect = 1.3
Identities = 22/79 (27%), Positives = 29/79 (36%), Gaps = 24/79 (30%)
Query: 187 SPPRENSNEDVTPSFPAPTSSGGTLPSPPSMPQNLPGPSISGPLIPGPSIPSFPSPA-VT 245
S P +++ T S+P P+ S SP PG SI S PS V
Sbjct: 77 SVPSGDTS---TYSYPVPSPS----YSPS----------------PGSSIQSLPSTTGVR 113
Query: 246 PSSPSTPAPQPAMPALVTQ 264
P S + A LV +
Sbjct: 114 PQSSAENANSEKRKLLVIR 132
>gnl|CDD|114603 pfam05887, Trypan_PARP, Procyclic acidic repetitive protein (PARP).
This family consists of several Trypanosoma brucei
procyclic acidic repetitive protein (PARP) like
sequences. The procyclic acidic repetitive protein
(parp) genes of Trypanosoma brucei encode a small family
of abundant surface proteins whose expression is
restricted to the procyclic form of the parasite. They
are found at two unlinked loci, parpA and parpB;
transcription of both loci is developmentally regulated.
Length = 145
Score = 29.5 bits (65), Expect = 1.4
Identities = 25/98 (25%), Positives = 33/98 (33%), Gaps = 6/98 (6%)
Query: 168 GEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMPQNLPGPSIS 227
G+G +G + + ++N P E E P P G P P + P P +
Sbjct: 42 GKGEKGTKVGADDTNGTDPDDEPEEEEE------PEPEEEGEEEPEPEEEGEEEPEPEET 95
Query: 228 GPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPALVTQP 265
G P P P P P P P A V P
Sbjct: 96 GEEEPEPEPEPEPEPEPEPEPEPEPEPGAATLKSVALP 133
>gnl|CDD|217392 pfam03153, TFIIA, Transcription factor IIA, alpha/beta subunit.
Transcription initiation factor IIA (TFIIA) is a
heterotrimer, the three subunits being known as alpha,
beta, and gamma, in order of molecular weight. The N and
C-terminal domains of the gamma subunit are represented
in pfam02268 and pfam02751, respectively. This family
represents the precursor that yields both the alpha and
beta subunits. The TFIIA heterotrimer is an essential
general transcription initiation factor for the
expression of genes transcribed by RNA polymerase II.
Together with TFIID, TFIIA binds to the promoter region;
this is the first step in the formation of a
pre-initiation complex (PIC). Binding of the rest of the
transcription machinery follows this step. After
initiation, the PIC does not completely dissociate from
the promoter. Some components, including TFIIA, remain
attached and re-initiate a subsequent round of
transcription.
Length = 332
Score = 30.1 bits (68), Expect = 1.4
Identities = 17/74 (22%), Positives = 24/74 (32%), Gaps = 6/74 (8%)
Query: 194 NEDVTPSFPAPTSSGGTLPSPPSMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPA 253
P A P P Q LP + P+ A P++ + P
Sbjct: 51 PSPQAPPPVAQLPQPLPQPPPTQALQALPAGDQ-----QQHNTPTGSPAANPPATFALP- 104
Query: 254 PQPAMPALVTQPEQ 267
PA P + T+P Q
Sbjct: 105 AGPAGPTIQTEPGQ 118
>gnl|CDD|150071 pfam09279, efhand_like, Phosphoinositide-specific phospholipase
C, efhand-like. Members of this family are
predominantly found in phosphoinositide-specific
phospholipase C. They adopt a structure consisting of a
core of four alpha helices, in an EF like fold, and are
required for functioning of the enzyme.
Length = 83
Score = 28.3 bits (64), Expect = 1.5
Identities = 13/42 (30%), Positives = 20/42 (47%), Gaps = 5/42 (11%)
Query: 50 SSDGAKMLLLKLMDWLEQEKKTHRDNDTIMNETAAQAYIENY 91
SS+ + +L +L +E+K D + E A A IE Y
Sbjct: 10 SSNKGVLTAEELRRFLNEEQK-----DPRLTEEQALAIIEKY 46
>gnl|CDD|218658 pfam05616, Neisseria_TspB, Neisseria meningitidis TspB protein.
This family consists of several Neisseria meningitidis
TspB virulence factor proteins.
Length = 502
Score = 30.0 bits (67), Expect = 1.8
Identities = 21/71 (29%), Positives = 29/71 (40%), Gaps = 1/71 (1%)
Query: 196 DVTP-SFPAPTSSGGTLPSPPSMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAP 254
D+TP S AP + SP P N P P + P P +P P + P
Sbjct: 318 DLTPGSAEAPEAQPLPEVSPAENPANNPNPRENPGTRPNPEPDPDLNPDANPDTDGQPGT 377
Query: 255 QPAMPALVTQP 265
+P PA+ +P
Sbjct: 378 RPDSPAVPDRP 388
>gnl|CDD|219897 pfam08549, SWI-SNF_Ssr4, Fungal domain of unknown function
(DUF1750). This is a fungal domain of unknown function.
Length = 669
Score = 30.0 bits (67), Expect = 1.8
Identities = 17/85 (20%), Positives = 23/85 (27%), Gaps = 15/85 (17%)
Query: 176 LSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMPQNLPGPSISGPLIPGPS 235
L +N+ +P PP+ P P P + G P P PG
Sbjct: 193 LHDRNNWPTIPLPPQMAGQSMYQPPGPYPNAMVGRQPFYPQ---------------PGAV 237
Query: 236 IPSFPSPAVTPSSPSTPAPQPAMPA 260
+ A Q A A
Sbjct: 238 AGPPKRRGGHKAPRGHRASQAAGQA 262
>gnl|CDD|221759 pfam12764, Gly-rich_Ago1, Glycine-rich region of argonaut. This
domain is often found at the very N-terminal of
argonaut-like proteins.
Length = 102
Score = 28.4 bits (63), Expect = 1.9
Identities = 14/73 (19%), Positives = 23/73 (31%), Gaps = 4/73 (5%)
Query: 202 PAPTSSGGTLPSPPSMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPS----TPAPQPA 257
P G P S P + + + + T +PS + P +
Sbjct: 14 PPQQGGRGGGGGGRGGGSTGGPPRPSVPELHQATQAPYQAAVSTQPAPSEASSSSQPPES 73
Query: 258 MPALVTQPEQEIS 270
VTQ Q++S
Sbjct: 74 SSLQVTQQFQQLS 86
>gnl|CDD|182059 PRK09752, PRK09752, adhesin; Provisional.
Length = 1250
Score = 30.0 bits (67), Expect = 1.9
Identities = 19/50 (38%), Positives = 22/50 (44%), Gaps = 10/50 (20%)
Query: 213 SPPSMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPALV 262
+PPS P P P P P+ P P TP TPA QP + A V
Sbjct: 919 TPPSPPDPDPTPD------PDPT----PDPDPTPDPEPTPAYQPVLNAKV 958
>gnl|CDD|220603 pfam10152, DUF2360, Predicted coiled-coil domain-containing protein
(DUF2360). This is the conserved 140 amino acid region
of a family of proteins conserved from nematodes to
humans. One C. elegans member is annotated as a
Daf-16-dependent longevity protein 1 but this could not
be confirmed. The function is unknown.
Length = 147
Score = 28.9 bits (65), Expect = 2.0
Identities = 15/61 (24%), Positives = 19/61 (31%)
Query: 205 TSSGGTLPSPPSMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPALVTQ 264
+ T P P S N P S P +PA P+ P AP +
Sbjct: 53 VTVQTTPPPPASAITNGGPPPPPPARAEAASPPPPEAPAEPPAEPEPEAPAENTVTVAKD 112
Query: 265 P 265
P
Sbjct: 113 P 113
>gnl|CDD|219404 pfam07415, Herpes_LMP2, Gammaherpesvirus latent membrane protein
(LMP2) protein. This family consists of several
Gammaherpesvirus latent membrane protein (LMP2)
proteins. Epstein-Barr virus is a human Gammaherpesvirus
that infects and establishes latency in B lymphocytes in
vivo. The latent membrane protein 2 (LMP2) gene is
expressed in latently infected B cells and encodes two
protein isoforms, LMP2A and LMP2B, that are identical
except for an additional N-terminal 119 aa cytoplasmic
domain which is present in the LMP2A isoform. LMP2A is
thought to play a key role in either the establishment
or the maintenance of latency and/or the reactivation of
productive infection from the latent state. The
significance of LMP2B and its role in pathogenesis
remain unclear.
Length = 489
Score = 29.8 bits (67), Expect = 2.1
Identities = 19/94 (20%), Positives = 27/94 (28%), Gaps = 27/94 (28%)
Query: 154 LKSGETPIPGPPESGEGSEGP----SLSSQNSNDQLPSPPRENSNEDVTPSFPAPT---- 205
L +G G P+ EG P S S P P ++ P +
Sbjct: 9 LGAGGPRSHGGPDGDEGDSNPYYPSSFGSSWDRPGPPVPEDYDAPSHRPPPYGGSNGDRH 68
Query: 206 -------------------SSGGTLPSPPSMPQN 220
+ GG LP PP P++
Sbjct: 69 GGYQPLGQQDPSLYAGLGQNGGGGLPPPPYSPRD 102
>gnl|CDD|227676 COG5386, COG5386, Cell surface protein [Cell envelope biogenesis,
outer membrane].
Length = 352
Score = 29.5 bits (66), Expect = 2.2
Identities = 9/37 (24%), Positives = 13/37 (35%)
Query: 220 NLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQP 256
PG + P P+P TP+ P P+
Sbjct: 187 AATPNDDKNVAQPGTTTPKTPTPQTTPNKPKVENPKT 223
>gnl|CDD|240420 PTZ00441, PTZ00441, sporozoite surface protein 2 (SSP2);
Provisional.
Length = 576
Score = 29.5 bits (66), Expect = 2.3
Identities = 26/90 (28%), Positives = 31/90 (34%), Gaps = 13/90 (14%)
Query: 182 NDQLPSPPRENSNEDVTPS----FPAPTSSGGTLPSPPSMPQNLPGPSISG--------P 229
N Q P PP N +D P+ FP + P P N+PG S S P
Sbjct: 325 NPQDPVPP-PNEGKDGNPNEENLFPPGDDEVPDESNVPPNPPNVPGGSNSEFSSDVENPP 383
Query: 230 LIPGPSIPSFPSPAVTPSSPSTPAPQPAMP 259
P P IP S+ P P P
Sbjct: 384 NPPNPDIPEQEPNIPEDSNKEVPEDVPMEP 413
>gnl|CDD|218158 pfam04580, Pox_D3, Chordopoxvirinae D3 protein. Chordopoxvirinae
D3 protein conserved region. Region occupies entire
length of D3 protein.
Length = 246
Score = 29.2 bits (66), Expect = 2.4
Identities = 15/50 (30%), Positives = 23/50 (46%), Gaps = 2/50 (4%)
Query: 265 PEQEISYIEAELINKCQKYIK--FASSALNYDDYKEAKSNLIKVLNILNT 312
PE EI LI + + K S +NY +K S L ++LN + +
Sbjct: 133 PEYEIFLTNPFLIKQNRDIFKNLILRSEINYFIFKTKNSPLKELLNHIVS 182
>gnl|CDD|114709 pfam06003, SMN, Survival motor neuron protein (SMN). This family
consists of several eukaryotic survival motor neuron
(SMN) proteins. The Survival of Motor Neurons (SMN)
protein, the product of the spinal muscular
atrophy-determining gene, is part of a large
macromolecular complex (SMN complex) that functions in
the assembly of spliceosomal small nuclear
ribonucleoproteins (snRNPs). The SMN complex functions
as a specificity factor essential for the efficient
assembly of Sm proteins on U snRNAs and likely protects
cells from illicit, and potentially deleterious,
non-specific binding of Sm proteins to RNAs.
Length = 264
Score = 29.2 bits (65), Expect = 2.4
Identities = 25/98 (25%), Positives = 35/98 (35%), Gaps = 13/98 (13%)
Query: 158 ETPIPGPPESGEGSEGPSLSSQNSNDQ-LPSPPRENSNEDVTPSFPAPTSSGGTLPSPPS 216
E + S + S+ S S + + P P N P FP P PP
Sbjct: 132 EANVNETESSTDESDRSSHSHEVRSKSNFPMGPPSPWN----PRFPPGPP-----PPPPG 182
Query: 217 MPQNLPGPSISGPLIPG--PSIPSFPSPAVTPSSPSTP 252
++ PS P + G P P P P + P P +P
Sbjct: 183 FGRHGEKPSGWPPFLSGWPPPFPLGP-PMIPPPPPMSP 219
>gnl|CDD|177464 PHA02682, PHA02682, ORF080 virion core protein; Provisional.
Length = 280
Score = 29.4 bits (65), Expect = 2.4
Identities = 35/128 (27%), Positives = 48/128 (37%), Gaps = 11/128 (8%)
Query: 134 EDIEQNKKYA-KWKAAYIHNCLKSGETPIPGPPESGEGSEGPSLSSQNSNDQLPSPPREN 192
D++ KY+ K Y + LK+ + P SG+ PS + P+
Sbjct: 45 ADVDPLDKYSVKEAGRYYQSRLKANSACMQRP--SGQSPLAPSPACAAPAPACPACAPAA 102
Query: 193 SNEDVTPSFPAPTSSGGTLPSPPSMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTP 252
VT PAP T P+ P P P P P+ P PS P +P P
Sbjct: 103 PAPAVTCPAPAPACPPATAPTCP------PPAVCPAPARPAPACP--PSTRQCPPAPPLP 154
Query: 253 APQPAMPA 260
P+PA A
Sbjct: 155 TPKPAPAA 162
>gnl|CDD|222010 pfam13254, DUF4045, Domain of unknown function (DUF4045). This
presumed domain is functionally uncharacterized. This
domain family is found in bacteria and eukaryotes, and
is typically between 384 and 430 amino acids in length.
Length = 414
Score = 29.4 bits (66), Expect = 2.5
Identities = 27/120 (22%), Positives = 38/120 (31%), Gaps = 14/120 (11%)
Query: 155 KSGETPIPGPPESGEGSEGPSLSSQNSNDQLPSP-PRENSNEDVTPSFPAPTSSGGTLPS 213
KSG +P +S + +S D S P+ D AP T
Sbjct: 221 KSGIPDLPSSRDSEKTKPEKPQQETSSMDTEKSSAPKPRETLDPKSPEKAPPID--TTEE 278
Query: 214 PPSMPQNLPG-----------PSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPALV 262
P+ P PS+ P S SP +P P +P P+P P +
Sbjct: 279 ELKSPEASPKESEEASARKRSPSLLSPSPKAESPKPLASPGKSPRDPLSPRPKPQSPPVN 338
Score = 29.0 bits (65), Expect = 3.2
Identities = 29/130 (22%), Positives = 42/130 (32%), Gaps = 8/130 (6%)
Query: 146 KAAYIHNCLKSGETPIPGPPESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPT 205
KA ++ + L E+P P + LS + R NS ++VTP T
Sbjct: 150 KATWLESALNKPESPKHKPQPPQQPEWKKDLSRLRQSRASVDLGRTNSFKEVTPVGLMRT 209
Query: 206 SSGGTLPSPPSMPQNLPGPSISG--------PLIPGPSIPSFPSPAVTPSSPSTPAPQPA 257
G+ PS PS P S+ + S A P P
Sbjct: 210 PPPGSHSKSPSKSGIPDLPSSRDSEKTKPEKPQQETSSMDTEKSSAPKPRETLDPKSPEK 269
Query: 258 MPALVTQPEQ 267
P + T E+
Sbjct: 270 APPIDTTEEE 279
>gnl|CDD|218439 pfam05109, Herpes_BLLF1, Herpes virus major outer envelope
glycoprotein (BLLF1). This family consists of the BLLF1
viral late glycoprotein, also termed gp350/220. It is
the most abundantly expressed glycoprotein in the viral
envelope of the Herpesviruses and is the major antigen
responsible for stimulating the production of
neutralising antibodies in vivo.
Length = 830
Score = 29.8 bits (66), Expect = 2.5
Identities = 30/129 (23%), Positives = 40/129 (31%), Gaps = 19/129 (14%)
Query: 156 SGETPIPGPPESGEGSEGPSLSSQ--------NSNDQLPSPPRENSNEDVTPSFPAPTSS 207
G T P +G S + S Q N+N + + VT SS
Sbjct: 533 IGVTTTATSPPTGTTSVPNATSPQVTEESPVNNTNTPVVTSAPSVLTSAVTTGQHGTGSS 592
Query: 208 GGT-LPSPPSMPQNLP---GPSISGPLIPGPSIPSFPSPAVTPSSPST-------PAPQP 256
+ P PS + P S + L TPS PST P P P
Sbjct: 593 PTSQQPGIPSSSHSTPRSNSTSTTPLLTSAHPTGGENITEETPSVPSTTHVSTLSPGPGP 652
Query: 257 AMPALVTQP 265
+ V+ P
Sbjct: 653 GTTSQVSGP 661
Score = 28.6 bits (63), Expect = 6.0
Identities = 18/106 (16%), Positives = 33/106 (31%), Gaps = 5/106 (4%)
Query: 156 SGETPIPGPPESGE--GSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPS 213
+G T P SG G+ +L S + N+ +S T +
Sbjct: 460 TGPTVSTADPTSGTPTGTTSSTLPEDTSPTSRTTSATPNATSPTPAVTTPNATSPTTQKT 519
Query: 214 PPSMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMP 259
+ P P + G + S P+ + + ++P P
Sbjct: 520 SDTPNATSPTPIVIGVT---TTATSPPTGTTSVPNATSPQVTEESP 562
>gnl|CDD|223061 PHA03369, PHA03369, capsid maturational protease; Provisional.
Length = 663
Score = 29.6 bits (66), Expect = 2.5
Identities = 19/97 (19%), Positives = 31/97 (31%), Gaps = 18/97 (18%)
Query: 158 ETPIPGPPESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSM 217
+ + P++ G ++ Q P P + + S PA + P +
Sbjct: 365 KVAVIAAPQTHTGP---------ADRQRPQRP-----DGIPYSVPARSPMTA----YPPV 406
Query: 218 PQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAP 254
PQ P + P P S+ V P P P
Sbjct: 407 PQFCGDPGLVSPYNPQSPGTSYGPEPVGPVPPQPTNP 443
Score = 29.6 bits (66), Expect = 2.6
Identities = 21/114 (18%), Positives = 28/114 (24%), Gaps = 11/114 (9%)
Query: 161 IPGPPESGEGSEGPSLSSQNS-NDQLPSPPRENSNEDVTPSFPAPTSSGGTLPS-----P 214
I G E + SL++ + + + T + PA P
Sbjct: 338 INGLKAHNEILKTASLTAPSRVLAAAAKVAVIAAPQ--THTGPADRQRPQRPDGIPYSVP 395
Query: 215 PSMPQNLPGPSISGPLIPGPSIPSFPSPAVT--PSSPSTPAPQPAMPALVTQPE 266
P P PG P P T P P P P P
Sbjct: 396 ARSPMTAYPPVPQFCGDPGLVSPYNPQSPGTSYGPEPVGPVP-PQPTNPYVMPI 448
>gnl|CDD|137339 PRK09499, sifB, secreted effector protein SifB; Provisional.
Length = 316
Score = 29.2 bits (65), Expect = 2.6
Identities = 31/112 (27%), Positives = 48/112 (42%), Gaps = 21/112 (18%)
Query: 61 LMDWLEQEKK----------------THRDNDTIMNETAAQAYIENYALKLFQWADSMDR 104
L DW EQE+K T D D I N+ AA+ IEN LK + +
Sbjct: 192 LCDWKEQERKAAISSRINLGIAQAGVTAID-DAIKNKIAAKV-IENTNLKNAAF-EPNYA 248
Query: 105 ASTFNKNVVKAFYTASILMDVLQTFGETSEDIEQNKKYAKWKAAYIHNCLKS 156
S+ + V + ILM++L+ +S + + ++ A +HN L S
Sbjct: 249 QSSVTQIVYSCLFKNEILMNMLEE--SSSHGLLCLNELTEYVALQVHNSLFS 298
>gnl|CDD|223003 PHA03169, PHA03169, hypothetical protein; Provisional.
Length = 413
Score = 29.6 bits (66), Expect = 2.7
Identities = 28/104 (26%), Positives = 37/104 (35%), Gaps = 4/104 (3%)
Query: 162 PGPPESGEGS--EGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMPQ 219
P PPES S + PS Q S++ P P ++E S P S T PP P
Sbjct: 149 PAPPESHNPSPNQQPSSFLQPSHEDSPEEPEPPTSEPEPDSPGPPQSETPTSSPPPQSPP 208
Query: 220 NLPGP--SISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPAL 261
+ PG S + P P+ P+ P P
Sbjct: 209 DEPGEPQSPTPQQAPSPNTQQAVEHEDEPTEPEREGPPFPGHRS 252
>gnl|CDD|225145 COG2236, COG2236, Predicted phosphoribosyltransferases [General
function prediction only].
Length = 192
Score = 28.9 bits (65), Expect = 2.7
Identities = 15/73 (20%), Positives = 23/73 (31%), Gaps = 4/73 (5%)
Query: 43 KALKIDKSSD-GAKMLLLK--LMDWLEQEKKTHRDNDTIMNETAAQAYIENYALKLFQWA 99
K L +D D G + L L E +T + + +A +F W
Sbjct: 89 KVLIVDDIVDTGETLELALEELKKLAPAEVRTAVLQY-KKSPDYYGEEVTAWAWIIFPWN 147
Query: 100 DSMDRASTFNKNV 112
D A K +
Sbjct: 148 RKEDLADLILKIL 160
>gnl|CDD|224433 COG1516, FliS, Flagellin-specific chaperone FliS [Cell motility and
secretion / Intracellular trafficking and secretion /
Posttranslational modification, protein turnover,
chaperones].
Length = 132
Score = 28.4 bits (64), Expect = 2.9
Identities = 9/36 (25%), Positives = 19/36 (52%)
Query: 276 LINKCQKYIKFASSALNYDDYKEAKSNLIKVLNILN 311
L K++K A A+ +D +E ++ K ++I+
Sbjct: 27 LYEGALKFLKRAKEAIEQEDIEEKNESIDKAIDIIT 62
>gnl|CDD|225805 COG3266, DamX, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 292
Score = 29.1 bits (65), Expect = 3.0
Identities = 21/117 (17%), Positives = 40/117 (34%), Gaps = 15/117 (12%)
Query: 160 PIPGPPESGEGSEGPSLSSQNSNDQL------------PSPPRENSNEDVTPSFPAPTSS 207
+ PP S ++G +Q+ ++ P + +N VT + P ++
Sbjct: 75 DLSLPPISSTPTQGQEPLAQDGQQRVEVQGDLNNAAVQPQNLSQLNNVAVTSTLPTEPAT 134
Query: 208 GGTLPSPPSMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAP-QPAMPALVT 263
P + P+I+ P+ P ++ +T A Q A PA
Sbjct: 135 --VAPVRNASVPTAERPAITRPVRAQAVSEPAVEPKAAKTATATEAKVQTASPAQTP 189
>gnl|CDD|217310 pfam02993, MCPVI, Minor capsid protein VI. This minor capsid
protein may act as a link between the external capsid
and the internal DNA-protein core. The C-terminal 11
residues may function as a protease cofactor leading to
enzyme activation.
Length = 238
Score = 29.0 bits (65), Expect = 3.2
Identities = 15/67 (22%), Positives = 19/67 (28%), Gaps = 11/67 (16%)
Query: 199 PSFPAPTSSGGTLPSPPSMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAM 258
P T+ P Q P P + L+P P PS
Sbjct: 110 LGEEEPAPQEETVADPIQALQPRPRPDVEEVLVPAAPEP-----------PSYEETIKPG 158
Query: 259 PALVTQP 265
PA V +P
Sbjct: 159 PAPVEEP 165
Score = 27.8 bits (62), Expect = 7.4
Identities = 16/104 (15%), Positives = 30/104 (28%), Gaps = 6/104 (5%)
Query: 164 PPESGEGSEGPSLSS-QNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMPQNLP 222
E + +++ + P P E P P+ + P+P P +
Sbjct: 110 LGEEEPAPQEETVADPIQALQPRPRPDVEEVLVPAAPEPPSYEETIKPGPAPVEEPVDSM 169
Query: 223 GPSISGPLIPG----PSIPSFPSPAVTPSSPSTPAPQPAMPALV 262
++ P P P P P V P + + +
Sbjct: 170 AIAVPAIDTPVTLELPPAPQ-PPPPVVPQPSTMVVHRRSRIKRT 212
>gnl|CDD|184281 PRK13729, PRK13729, conjugal transfer pilus assembly protein TraB;
Provisional.
Length = 475
Score = 29.0 bits (65), Expect = 3.2
Identities = 25/98 (25%), Positives = 39/98 (39%), Gaps = 15/98 (15%)
Query: 204 PTSSGGTLPSPPSMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPALVT 263
T++G +P P+ P PGP G PG + SFP S + P P P
Sbjct: 128 VTATGEPVPQMPASP---PGPE--GEPQPGNTPVSFPP----QGSVAVPPPTAFYPGNGV 178
Query: 264 QPEQEISYIEAELINKCQKYIKFASSALNYDDYKEAKS 301
P +++Y + N+ Q Y++ K+ S
Sbjct: 179 TPPPQVTYQSVPVPNRIQ------RKTFTYNEGKKGPS 210
>gnl|CDD|220972 pfam11081, DUF2890, Protein of unknown function (DUF2890). This
family is conserved in dsDNA adenoviruses of
vertebrates. The function is not known.
Length = 172
Score = 28.3 bits (63), Expect = 3.2
Identities = 15/60 (25%), Positives = 18/60 (30%), Gaps = 4/60 (6%)
Query: 158 ETPIPGPPESGEGSEGPSLSSQ-NSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPS 216
ET S S S S + S P R + D T FP PT+
Sbjct: 58 ETAASSKAPS---SSSKSSSQETISIPPTPPARRPSRRWDQTGRFPNPTTGAKPTLRAAR 114
>gnl|CDD|183756 PRK12799, motB, flagellar motor protein MotB; Reviewed.
Length = 421
Score = 29.3 bits (65), Expect = 3.2
Identities = 13/67 (19%), Positives = 23/67 (34%)
Query: 202 PAPTSSGGTLPSPPSMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPAL 261
P+S+ + +P P++ + S + + AV SS +P
Sbjct: 304 VTPSSAVTQSSAITPSSAAIPSPAVIPSSVTTQSATTTQASAVALSSAGVLPSDVTLPGT 363
Query: 262 VTQPEQE 268
V P E
Sbjct: 364 VALPAAE 370
Score = 27.8 bits (61), Expect = 8.5
Identities = 21/95 (22%), Positives = 31/95 (32%), Gaps = 5/95 (5%)
Query: 161 IPGPPESGEGSEGPSLSS-QNSNDQLPSPPRENSNEDVTPSFPAP-TSSGGTLPSPPSMP 218
IP P S ++ Q S L S S+ + + P P P S
Sbjct: 323 IPSPAVIPSSVTTQSATTTQASAVALSSAGVLPSDVTLPGTVALPAAEPVNMQPQPMSTT 382
Query: 219 QNLP---GPSISGPLIPGPSIPSFPSPAVTPSSPS 250
+ G S P S+P+ P+ + S S
Sbjct: 383 ETQQSSTGNITSTANGPTTSLPAAPASNIPVSPTS 417
>gnl|CDD|221480 pfam12238, MSA-2c, Merozoite surface antigen 2c. This family of
proteins is found in eukaryotes. Proteins in this family
are typically between 263 and 318 amino acids in length.
There is a conserved SFT sequence motif. MSA-2 is a
plasma membrane glycoprotein which can be found in
Babesia bovis species.
Length = 201
Score = 28.6 bits (64), Expect = 3.5
Identities = 16/50 (32%), Positives = 19/50 (38%), Gaps = 2/50 (4%)
Query: 199 PSFPAPTSSGGTLPSPPSMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSS 248
P +SS GT + PS P+ S P PG PA TP
Sbjct: 141 YYSPKHSSSQGTSTTRPSDGSA--TPNTSAPPTPGNPAAQPEKPAETPKG 188
>gnl|CDD|130706 TIGR01645, half-pint, poly-U binding splicing factor, half-pint
family. The proteins represented by this model contain
three RNA recognition motifs (rrm: pfam00076) and have
been characterized as poly-pyrimidine tract binding
proteins associated with RNA splicing factors. In the
case of PUF60 (GP|6176532), in complex with p54, and in
the presence of U2AF, facilitates association of U2
snRNP with pre-mRNA.
Length = 612
Score = 29.3 bits (65), Expect = 3.5
Identities = 21/81 (25%), Positives = 29/81 (35%), Gaps = 2/81 (2%)
Query: 187 SPPRENSNEDVTPSFPAPTSSGGTLPSP-PSMPQNLPGPSISGPL-IPGPSIPSFPSPAV 244
SP +S+ A SS P +PQ P GP+ IP P P +
Sbjct: 330 SPATPSSSLPTDIGNKAVVSSAKKEAEEVPPLPQAAPAVVKPGPMEIPTPVPPPGLAIPS 389
Query: 245 TPSSPSTPAPQPAMPALVTQP 265
+ P AP P+ + P
Sbjct: 390 LVAPPGLVAPTEINPSFLASP 410
>gnl|CDD|165468 PHA03201, PHA03201, uracil DNA glycosylase; Provisional.
Length = 318
Score = 28.7 bits (64), Expect = 3.5
Identities = 25/69 (36%), Positives = 27/69 (39%), Gaps = 1/69 (1%)
Query: 186 PSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMPQNLPGPSISGPLIPGPSIPSFPSPAVT 245
PSPPR S TP S T PSPP P +GP P P VT
Sbjct: 9 PSPPRRPSPPRPTPPRSPDASPEETPPSPPGPGAEPPPGRAAGPAAPRRRPRGCP-AGVT 67
Query: 246 PSSPSTPAP 254
SS + P P
Sbjct: 68 FSSSAPPRP 76
Score = 28.3 bits (63), Expect = 6.0
Identities = 21/91 (23%), Positives = 29/91 (31%), Gaps = 9/91 (9%)
Query: 159 TPIPGPPESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMP 218
+P P S P + + PSPP P P +P P
Sbjct: 8 SPSPPRRPSPPRPTPPRSPDASPEETPPSPP--------GPGAEPPPGRAAGPAAPRRRP 59
Query: 219 QNLP-GPSISGPLIPGPSIPSFPSPAVTPSS 248
+ P G + S P P + +PA TP
Sbjct: 60 RGCPAGVTFSSSAPPRPPLGLDDAPAATPPP 90
>gnl|CDD|177556 PHA03195, PHA03195, tegument protein VP11/12; Provisional.
Length = 746
Score = 29.2 bits (65), Expect = 3.7
Identities = 16/70 (22%), Positives = 22/70 (31%), Gaps = 2/70 (2%)
Query: 155 KSGETPIPGPPESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSP 214
++G P G S P LP+PP + P LP+
Sbjct: 415 RTGSCPPQGATFGRNPSGFPGQFCCPPQGPLPAPPNSKTRGTFRRPRPGSVRGSRQLPAS 474
Query: 215 PSMPQNLPGP 224
P P N+ P
Sbjct: 475 P--PSNIVSP 482
>gnl|CDD|218994 pfam06333, Med13_C, Mediator complex subunit 13 C-terminal.
Mediator is a large complex of up to 33 proteins that is
conserved from plants through fungi to humans - the
number and representation of individual subunits varying
with species. It is arranged into four different
sections, a core, a head, a tail and a kinase-activity
part, and the number of subunits within each of these is
what varies with species. Overall, Mediator regulates
the transcriptional activity of RNA polymerase II but it
would appear that each of the four different sections
has a slightly different function. Med13 is part of the
ancillary kinase module, together with Med12, CDK8 and
CycC, which in yeast is implicated in transcriptional
repression, though most of this activity is likely
attributable to the CDK8 kinase. The large Med12 and
Med13 proteins are required for specific developmental
processes in Drosophila, zebrafish, and Caenorhabditis
elegans but their biochemical functions are not
understood.
Length = 406
Score = 28.8 bits (65), Expect = 3.8
Identities = 19/94 (20%), Positives = 31/94 (32%), Gaps = 16/94 (17%)
Query: 192 NSNEDVTPSFPAPTSSGGTL-PSPPSMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPS 250
+++ + P P S S PQ+ P SFPSP ++P+
Sbjct: 226 DTDPKLVPDPPTQPIPFSRFNTSNSSTPQSTPCTL------------SFPSPDQFGNAPT 273
Query: 251 TP--APQPAMPALVTQPEQ-EISYIEAELINKCQ 281
P VT P S ++ L++
Sbjct: 274 PPLGENGNDNAPPVTPPGSPSESDPDSILVDIDD 307
>gnl|CDD|188060 TIGR00531, BCCP, acetyl-CoA carboxylase, biotin carboxyl carrier
protein. This model is designed to identify biotin
carboxyl carrier protein as a peptide of acetyl-CoA
carboxylase. Scoring below the trusted cutoff is a
related protein encoded in a region associated with
polyketide synthesis in the prokaryote Saccharopolyspora
hirsuta, and a reported chloroplast-encoded biotin
carboxyl carrier protein that may be highly derived from
the last common ancestral sequence. Scoring below the
noise cutoff are biotin carboxyl carrier domains of
other enzymes such as pyruvate carboxylase.The gene name
is accB or fabE [Fatty acid and phospholipid metabolism,
Biosynthesis].
Length = 156
Score = 28.3 bits (63), Expect = 4.0
Identities = 8/46 (17%), Positives = 14/46 (30%), Gaps = 1/46 (2%)
Query: 223 GPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPALVTQPEQE 268
+ + + P P+ S PAP PA ++
Sbjct: 36 AAAAKKSAVQQAAAPV-PAQVPAAPSAQAPAPAVCAPAPAKADKKG 80
>gnl|CDD|221745 pfam12737, Mating_C, C-terminal domain of homeodomain 1. Mating in
fungi is controlled by the loci that determine the
mating type of an individual, and only individuals with
differing mating types can mate. Basidiomycete fungi
have evolved a unique mating system, termed tetrapolar
or bifactorial incompatibility, in which mating type is
determined by two unlinked loci; compatibility at both
loci is required for mating to occur. The multi-allelic
tetrapolar mating system is considered to be a novel
innovation that could have only evolved once, and is
thus unique to the mushroom fungi. This domain is
C-terminal to the homeodomain transcription factor
region.
Length = 418
Score = 28.6 bits (64), Expect = 4.6
Identities = 23/106 (21%), Positives = 35/106 (33%), Gaps = 8/106 (7%)
Query: 156 SGETPIPGPPESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPT-------SSG 208
S + PPE+ S S + S P + + TP+ AP+ S G
Sbjct: 133 SSSSSPAKPPEACLPSPAASTQDELSEASAAPLPTPSLSPPHTPTDTAPSGKRKRRLSDG 192
Query: 209 GTLPSPPSMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAP 254
LP+P PQ P +P + + + S S
Sbjct: 193 FQLPAPKR-PQTSSRPQTVSDPLPLHATTDWDTWFQATVSSSPSLL 237
>gnl|CDD|227889 COG5602, SIN3, Histone deacetylase complex, SIN3 component
[Chromatin structure and dynamics].
Length = 1163
Score = 28.7 bits (64), Expect = 4.7
Identities = 23/106 (21%), Positives = 34/106 (32%), Gaps = 1/106 (0%)
Query: 182 NDQLPSPPR-ENSNEDVTPSFPAPTSSGGTLPSPPSMPQNLPGPSISGPLIPGPSIPSFP 240
N LPS + E S S T+ G L SPP P P++PS
Sbjct: 188 NTFLPSGYKIEGSLPQPNGSRLHVTTPQGPLSSPPVQSSYYVAPCNHDQRTSHPTLPSDS 247
Query: 241 SPAVTPSSPSTPAPQPAMPALVTQPEQEISYIEAELINKCQKYIKF 286
P + S + Q ++ ++ N + Y F
Sbjct: 248 QPEPSAPSHMPSDARGKHQVDFNQAIIFVNKVKVRFQNNPEMYYDF 293
>gnl|CDD|225499 COG2948, VirB10, Type IV secretory pathway, VirB10 components
[Intracellular trafficking and secretion].
Length = 360
Score = 28.6 bits (64), Expect = 4.7
Identities = 14/74 (18%), Positives = 19/74 (25%), Gaps = 7/74 (9%)
Query: 185 LPSPPRENSNEDVTPSF-PAPTSSGGTLPSPPSMPQNLPGPSISGPLIPGPSIPSFPSPA 243
++ N PS T LP P +P LP P +P
Sbjct: 45 ALQGEKKRINNTQPPSNVERGTPPLPPLPDDPPLPPPLPVDL------GAPVLPDQQVEE 98
Query: 244 VTPSSPSTPAPQPA 257
A + A
Sbjct: 99 AKDQPRRLRAAELA 112
>gnl|CDD|218056 pfam04388, Hamartin, Hamartin protein. This family includes the
hamartin protein which is thought to function as a
tumour suppressor. The hamartin protein interacts with
the tuberin protein pfam03542. Tuberous sclerosis
complex (TSC) is an autosomal dominant disorder and is
characterized by the presence of hamartomas in many
organs, such as brain, skin, heart, lung, and kidney. It
is caused by mutation either TSC1 or TSC2 tumour
suppressor gene. TSC1 encodes a protein, hamartin,
containing two coiled-coil regions, which have been
shown to mediate binding to tuberin. The TSC2 gene codes
for tuberin pfam03542. These two proteins function
within the same pathway(s) regulating cell cycle, cell
growth, adhesion, and vesicular trafficking.
Length = 667
Score = 28.8 bits (64), Expect = 4.7
Identities = 28/145 (19%), Positives = 45/145 (31%), Gaps = 13/145 (8%)
Query: 119 ASILMDVLQTFGETSEDIEQNKKYAKWKAAYIHNCLKSGETPIPGPPESG-EGSEGPSLS 177
A + +D +T E ++ Y + + I P + GS + +
Sbjct: 259 AKLSLDPTETSSEDGYSFSRSSAYP-NSSPRQ-----ALPPSISLPQNTSTSGSLHSAQT 312
Query: 178 SQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMPQNLP-GPSISGPLIPGPSI 236
S+ N + +D S + S P L PS PG +
Sbjct: 313 SRRPNTTFDKAASSGT-KDSLWSPSSLCGMATPPSSIGMSPLILSLSPSHLSGRAPGTTG 371
Query: 237 PSFPSPAVTPSSPSTPAPQPAMPAL 261
P+S STP+ P P L
Sbjct: 372 SGKG----EPASESTPSTSPPPPGL 392
>gnl|CDD|215496 PLN02918, PLN02918, pyridoxine (pyridoxamine) 5'-phosphate oxidase.
Length = 544
Score = 28.7 bits (64), Expect = 4.8
Identities = 16/73 (21%), Positives = 25/73 (34%), Gaps = 2/73 (2%)
Query: 200 SFPAPTSSGGTLPSPPSMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMP 259
P +L S PS Q PS G +P P + + + P A +P
Sbjct: 21 PISPPPPHSSSLSSSPSPTQRFLTPS-QGSRLP-PRRRALCTKSQDPRWRRAMASLAVIP 78
Query: 260 ALVTQPEQEISYI 272
+ +SY+
Sbjct: 79 NMQDSGSPPLSYL 91
>gnl|CDD|237871 PRK14965, PRK14965, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 576
Score = 28.6 bits (64), Expect = 4.8
Identities = 16/62 (25%), Positives = 22/62 (35%), Gaps = 3/62 (4%)
Query: 207 SGGTLPSPPSMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPALVTQPE 266
G P + P + P P + P P A + + PAP PA PA P
Sbjct: 379 RGAPAPPSAAWGAPTPAAP-AAP--PPAAAPPVPPAAPARPAAARPAPAPAPPAAAAPPA 435
Query: 267 QE 268
+
Sbjct: 436 RS 437
>gnl|CDD|223066 PHA03379, PHA03379, EBNA-3A; Provisional.
Length = 935
Score = 28.9 bits (64), Expect = 5.1
Identities = 23/116 (19%), Positives = 38/116 (32%), Gaps = 23/116 (19%)
Query: 161 IPGPPESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSP------ 214
+PG + G + P +P+P + V P + + G +P
Sbjct: 484 LPGVVQDGRPACAP----------VPAP----AGPIVRPWEASLSQVPGVAFAPVMPQPM 529
Query: 215 PSMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTP---APQPAMPALVTQPEQ 267
P P +P ++ P+ P P + + P T P P P P Q
Sbjct: 530 PVEPVPVPTVALERPVCPAPPLIAMQGPGETSGIVRVRERWRPAPWTPNPPRSPSQ 585
>gnl|CDD|237182 PRK12727, PRK12727, flagellar biosynthesis regulator FlhF;
Provisional.
Length = 559
Score = 28.4 bits (63), Expect = 5.1
Identities = 15/80 (18%), Positives = 27/80 (33%), Gaps = 5/80 (6%)
Query: 185 LPSPPRENSNEDVTPSFPAPTSSGGTLPSP----PSMPQNLPGPSISGPLIPGPSIPSFP 240
+P R S A ++ T P ++P+ L ++ +P + +
Sbjct: 126 AAAPVRAASIPSPAAQALAHAAAVRTAPRQEHALSAVPEQLFADFLTTAPVPRAPVQAPV 185
Query: 241 SPAVTPSSPSTPAPQPAMPA 260
A P P+ A A A
Sbjct: 186 VAAPAP-VPAIAAALAAHAA 204
>gnl|CDD|132971 cd06640, STKc_MST4, Catalytic domain of the Protein
Serine/Threonine Kinase, Mammalian Ste20-like protein
kinase 4. Serine/threonine kinases (STKs), mammalian
Ste20-like protein kinase 4 (MST4) subfamily, catalytic
(c) domain. STKs catalyze the transfer of the
gamma-phosphoryl group from ATP to serine/threonine
residues on protein substrates. The MST4 subfamily is
part of a larger superfamily that includes the catalytic
domains of other protein STKs, protein tyrosine kinases,
RIO kinases, aminoglycoside phosphotransferase, choline
kinase, and phosphoinositide 3-kinase. MST4 is sometimes
referred to as MASK (MST3 and SOK1-related kinase). It
plays a role in mitogen-activated protein kinase (MAPK)
signaling during cytoskeletal rearrangement,
morphogenesis, and apoptosis. It influences cell growth
and transformation by modulating the extracellular
signal-regulated kinase (ERK) pathway. MST4 may also
play a role in tumor formation and progression. It
localizes in the Golgi apparatus by interacting with the
Golgi matrix protein GM130 and may play a role in cell
migration.
Length = 277
Score = 28.1 bits (62), Expect = 5.6
Identities = 27/90 (30%), Positives = 38/90 (42%), Gaps = 18/90 (20%)
Query: 3 IQLPPCPDSLKAIQHFLNVASEHDNRDLTVSY---------WCRLHALQKALKIDKSSDG 53
I L D ++ IQ + V S+ D+ +T Y W + L +D G
Sbjct: 37 IDLEEAEDEIEDIQQEITVLSQCDSPYVTKYYGSYLKGTKLWIIMEYLGGGSALDLLRAG 96
Query: 54 -------AKML--LLKLMDWLEQEKKTHRD 74
A ML +LK +D+L EKK HRD
Sbjct: 97 PFDEFQIATMLKEILKGLDYLHSEKKIHRD 126
>gnl|CDD|177328 PHA01929, PHA01929, putative scaffolding protein.
Length = 306
Score = 28.1 bits (62), Expect = 5.6
Identities = 25/94 (26%), Positives = 33/94 (35%), Gaps = 11/94 (11%)
Query: 173 GPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMPQNLPGPSISGPLIP 232
P L+ +N + P N + P P P P PQ L P+ P
Sbjct: 9 PPGLAGLVANVPPAAAPTPQPNPVIQPQAPVQ-------PGQPGAPQQLAIPTQQ----P 57
Query: 233 GPSIPSFPSPAVTPSSPSTPAPQPAMPALVTQPE 266
P S +P V +P+ PAP A PE
Sbjct: 58 QPVPTSAMTPHVVQQAPAQPAPAAPPAAGAALPE 91
>gnl|CDD|216194 pfam00922, Phosphoprotein, Vesiculovirus phosphoprotein.
Length = 283
Score = 28.0 bits (62), Expect = 6.2
Identities = 13/56 (23%), Positives = 24/56 (42%)
Query: 224 PSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPALVTQPEQEISYIEAELINK 279
P +S PS + PS + S+ +P P + + P+ + + +INK
Sbjct: 170 PDVSSTSEHIPSSNTSPSTSEPVSTLRSPTPSSKSSSSLGLPDLSSTSDWSGMINK 225
>gnl|CDD|212073 cd11504, SLC5sbd_SMVT, Na(+)/multivitamin cotransporter SMVT and
related proteins; solute-binding domain. This
multivitamin transporter SMVT (product of the SLC5A6
gene) transports biotin, pantothenic acid and lipoate,
and is essential for mediating biotin uptake into
mammalian cells. SMVT is expressed in the placenta,
intestine, heart, brain, lung, liver, kidney and
pancreas. Biotin may regulate its own cellular uptake
through participation in holocarboxylase
synthetase-dependent chromatin remodeling events at SMVT
promoter loci. The cis regulatory elements, Kruppel-like
factor 4 and activator protein-2, regulate the activity
of the human SMVT promoter in the intestine.
Glycosylation of the hSMVT is important for its
transport function. This subgroup belongs to the solute
carrier 5 (SLC5) transporter family.
Length = 527
Score = 28.3 bits (63), Expect = 6.4
Identities = 12/55 (21%), Positives = 23/55 (41%), Gaps = 3/55 (5%)
Query: 209 GTLPSPPSMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPALVT 263
G+ P P I+ + P + + +P ++P+ P P P + AL +
Sbjct: 450 GSFVFRMYSP--TKSPPINDTIFPNVTNATAATPTTLFATPTDPRP-PGLQALYS 501
>gnl|CDD|227665 COG5373, COG5373, Predicted membrane protein [Function unknown].
Length = 931
Score = 28.3 bits (63), Expect = 6.5
Identities = 15/52 (28%), Positives = 22/52 (42%), Gaps = 3/52 (5%)
Query: 212 PSPPSMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTP--APQPAMPAL 261
P P + ++ P + P+ P P+ A PS+ P AP PA P
Sbjct: 64 PLPAAAE-SIASPEVPPPVPPAPAQEGEAPAAEQPSAVPAPSAAPAPAEPVE 114
>gnl|CDD|187859 cd09728, Csx1_III-U, CRISPR/Cas system-associated protein Csx1.
CRISPR (Clustered Regularly Interspaced Short
Palindromic Repeats) and associated Cas proteins
comprise a system for heritable host defense by
prokaryotic cells against phage and other foreign DNA;
Protein of this family often fused to HTH domain; Some
proteins could have an additional fusion with
RecB-family nuclease domain; Core domain appears to have
a Rossmann-like fold; loosely associated with CRISPR/Cas
systems; also known as DxTHG family.
Length = 400
Score = 28.1 bits (63), Expect = 7.0
Identities = 14/49 (28%), Positives = 20/49 (40%), Gaps = 1/49 (2%)
Query: 265 PEQEISYIEAELINKCQKYIKFASSALNYDDYKEAKSNLIKVLNILNTG 313
P EISY++ + INK + K E +IK + L G
Sbjct: 193 PVTEISYLKEKFINKYKDNNKDLKRYFKK-ALNEGLEKIIKFASALYNG 240
>gnl|CDD|215732 pfam00121, TIM, Triosephosphate isomerase.
Length = 242
Score = 27.6 bits (62), Expect = 7.1
Identities = 13/35 (37%), Positives = 17/35 (48%), Gaps = 10/35 (28%)
Query: 127 QTFGETSEDIEQNKKYAKWKAAYIHNCLKSGETPI 161
Q FGET E + KK + LK+G TP+
Sbjct: 97 QYFGETDEIV--AKK--------VKAALKNGLTPV 121
>gnl|CDD|235899 PRK06975, PRK06975, bifunctional uroporphyrinogen-III
synthetase/uroporphyrin-III C-methyltransferase;
Reviewed.
Length = 656
Score = 28.1 bits (63), Expect = 7.5
Identities = 11/56 (19%), Positives = 20/56 (35%), Gaps = 7/56 (12%)
Query: 202 PAPTSSGGTLPSPPSMPQNLPGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPA 257
PA + P+P M S++ P + + P P++P P +
Sbjct: 272 PATAA-----PAPSRMTDTNDSKSVTSQ--PAAAAAAPAPPPNPPATPPEPPARRG 320
>gnl|CDD|221617 pfam12517, DUF3720, Protein of unknown function (DUF3720). This
domain family is found in eukaryotes, and is
approximately 100 amino acids in length. There are two
completely conserved A residues that may be functionally
important.
Length = 98
Score = 26.6 bits (59), Expect = 7.9
Identities = 16/64 (25%), Positives = 25/64 (39%), Gaps = 5/64 (7%)
Query: 166 ESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGGTLPSPPSMPQNLPGPS 225
E+G+ G +++ + P +N T AP GG + P+ PGP
Sbjct: 7 EAGQELSGVPDPLEDTKESSP----DNKLGANTEGSGAPGLGGGGVAGQPA-AALPPGPG 61
Query: 226 ISGP 229
SG
Sbjct: 62 GSGT 65
>gnl|CDD|235665 PRK05996, motB, flagellar motor protein MotB; Validated.
Length = 423
Score = 27.7 bits (62), Expect = 8.1
Identities = 9/57 (15%), Positives = 19/57 (33%), Gaps = 1/57 (1%)
Query: 213 SPPSMPQNLPGPSISGPLIPGPS-IPSFPSPAVTPSSPSTPAPQPAMPALVTQPEQE 268
P + + + L PG + + + + T + + P P A + E
Sbjct: 182 DPDFWSKQVEVTTAGDLLPPGQAREQAQGAKSATAAPATVPQAAPLPQAQPKKAATE 238
>gnl|CDD|240628 cd05303, PGDH_2, Phosphoglycerate dehydrogenase (PGDH) NAD-binding
and catalytic domains. Phosphoglycerate dehydrogenase
(PGDH) catalyzes the initial step in the biosynthesis of
L-serine from D-3-phosphoglycerate. PGDH comes in 3
distinct structural forms, with this first group being
related to 2-hydroxy acid dehydrogenases, sharing
structural similarity to formate and glycerate
dehydrogenases. PGDH in E. coli and Mycobacterium
tuberculosis form tetramers, with subunits containing a
Rossmann-fold NAD binding domain. Formate/glycerate and
related dehydrogenases of the D-specific 2-hydroxyacid
dehydrogenase superfamily include groups such as formate
dehydrogenase, glycerate dehydrogenase, L-alanine
dehydrogenase, and S-Adenosylhomocysteine Hydrolase.
Despite often low sequence identity, these proteins
typically have a characteristic arrangement of 2 similar
subdomains of the alpha/beta Rossmann fold NAD+ binding
form. The NAD+ binding domain is inserted within the
linear sequence of the mostly N-terminal catalytic
domain, which has a similar domain structure to the
internal NAD binding domain. Structurally, these domains
are connected by extended alpha helices and create a
cleft in which NAD is bound, primarily to the C-terminal
portion of the 2nd (internal) domain. Some related
proteins have similar structural subdomain but with a
tandem arrangement of the catalytic and NAD-binding
subdomains in the linear sequence.
Length = 301
Score = 27.5 bits (62), Expect = 8.7
Identities = 13/39 (33%), Positives = 20/39 (51%), Gaps = 2/39 (5%)
Query: 272 IEAELINKCQ--KYIKFASSALNYDDYKEAKSNLIKVLN 308
+ E+I+ + K I A L+ D + AK IKV+N
Sbjct: 53 VTKEVIDAAKNLKIIARAGVGLDNIDVEYAKKKGIKVIN 91
>gnl|CDD|219094 pfam06583, Neogenin_C, Neogenin C-terminus. This family represents
the C-terminus of eukaryotic neogenin precursor
proteins, which contains several potential
phosphorylation sites. Neogenin is a member of the N-CAM
family of cell adhesion molecules (and therefore
contains multiple copies of pfam00047 and pfam00041) and
is closely related to the DCC tumour suppressor gene
product - these proteins may play an integral role in
regulating differentiation programmes and/or cell
migration events within many adult and embryonic
tissues.
Length = 295
Score = 27.6 bits (61), Expect = 9.0
Identities = 27/121 (22%), Positives = 34/121 (28%), Gaps = 16/121 (13%)
Query: 160 PIPGPPESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSF-----PAPTSSGGTLPSP 214
PI PS + Q P S V + S P P
Sbjct: 100 PIHTLDNPQYPGILPSPRCGYLHHQFSLRPMPFSTLTVQRLYQHGDRAESVESVRQTPEP 159
Query: 215 PSMPQNLPGPSISGPLIPGPSIP-----------SFPSPAVTPSSPSTPAPQPAMPALVT 263
P +P S + P SIP SF PA+ S + P+ P L
Sbjct: 160 PYLPAAQSESSNAAEEAPSRSIPTAHVRPTHPLKSFAVPALPASMSTIEPKLPSTPLLTQ 219
Query: 264 Q 264
Q
Sbjct: 220 Q 220
>gnl|CDD|202284 pfam02561, FliS, Flagellar protein FliS. FliS is coded for by the
FliD operon and is transcribed in conjunction with FliD
and FliT, however this protein has no known function.
Length = 122
Score = 26.7 bits (59), Expect = 9.0
Identities = 9/35 (25%), Positives = 18/35 (51%)
Query: 276 LINKCQKYIKFASSALNYDDYKEAKSNLIKVLNIL 310
L +K K + A+ + +D ++ N+ K NI+
Sbjct: 25 LYDKAIKALVRAAEFIENEDIEKKNENISKAQNII 59
>gnl|CDD|234178 TIGR03354, VI_FHA, type VI secretion system FHA domain protein.
Members of this protein family are FHA
(forkhead-associated) domain-containing proteins that
are part of type VI secretion loci in a considerable
number of bacteria, most of which are known pathogens.
Species include Pseudomonas aeruginosa PAO1, Aeromonas
hydrophila, Yersinia pestis, Burkholderia mallei, etc
[Protein fate, Protein and peptide secretion and
trafficking, Cellular processes, Pathogenesis].
Length = 396
Score = 27.7 bits (62), Expect = 9.3
Identities = 13/75 (17%), Positives = 21/75 (28%), Gaps = 10/75 (13%)
Query: 157 GETPIPGP-------PESGEGSEGPSLSSQNSNDQLPSPPRENSNEDVTPSFPAPTSSGG 209
G P P P P E S + + L +P + + +
Sbjct: 125 GGPPTPDPAPLAQLDPLKALDQEPLSAADLDD---LSAPLFPPLDARLPAFAAPIDAEPT 181
Query: 210 TLPSPPSMPQNLPGP 224
+P +P P P
Sbjct: 182 MVPPFVPLPAPEPAP 196
>gnl|CDD|171499 PRK12438, PRK12438, hypothetical protein; Provisional.
Length = 991
Score = 27.9 bits (62), Expect = 9.5
Identities = 12/48 (25%), Positives = 17/48 (35%)
Query: 222 PGPSISGPLIPGPSIPSFPSPAVTPSSPSTPAPQPAMPALVTQPEQEI 269
PG + PG P+ P P + PA P P + E+
Sbjct: 906 PGGDAASAPPPGAGPPAPPQAVPPPRTTQPPAAPPRGPDVPPAAVAEL 953
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.310 0.128 0.375
Gapped
Lambda K H
0.267 0.0647 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 15,659,244
Number of extensions: 1446068
Number of successful extensions: 3371
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2381
Number of HSP's successfully gapped: 507
Length of query: 316
Length of database: 10,937,602
Length adjustment: 97
Effective length of query: 219
Effective length of database: 6,635,264
Effective search space: 1453122816
Effective search space used: 1453122816
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.8 bits)
S2: 59 (26.7 bits)