RPS-BLAST 2.2.22 [Sep-27-2009] Database: CddB 21,608 sequences; 5,994,473 total letters Searching..................................................done Query= gi|254780772|ref|YP_003065185.1| surface antigen (D15) [Candidatus Liberibacter asiaticus str. psy62] (781 letters) >gnl|CDD|163210 TIGR03303, OM_YaeT, outer membrane protein assembly complex, YaeT protein. Members of this protein family are the YaeT protein of the YaeT/YfiO complex for assembling proteins into the outer membrane of Gram-negative bacteria. This protein is similar in sequence and function to a non-essential paralog, YtfM, that is also in the Omp85 family. Members of this family typically have five tandem copies of the surface antigen variable number repeat (pfam07244), followed by an outer membrane beta-barrel domain (pfam01103), while the YtfM family typically has a single pfam07244 repeat. Length = 741 Score = 566 bits (1460), Expect = e-161 Identities = 251/748 (33%), Positives = 380/748 (50%), Gaps = 23/748 (3%) Query: 47 IVRRIEIRGATNVGKEVILSRIPVVVGQSISDADLDHAVKNIYAMGYFSNVKIKIVDSVL 106 +V+ I + G V + +L+ +PV VG +ISD +D A+K +YA GYF +VKI+ VL Sbjct: 2 VVKDIRVEGLQRVSEGTVLNYLPVKVGDTISDEKIDEAIKALYATGYFEDVKIEREGGVL 61 Query: 107 IIDLIERKIINHLFFSGNNNLKDDQLKMIVRS-RSAAAYDEDTVNADVHNIKQAYASIGY 165 +I + ER IIN + FSGN +K DQLK + + ++ + D +K+ Y S G Sbjct: 62 VIKVKERPIINSIVFSGNKEIKKDQLKKALVGIKKGEIFNRAKLEKDEKALKEFYRSRGK 121 Query: 166 LNVMVKVQHHSISPTTLNITYVIEEGVKAKINSIRFVGNKNYSHARLERVISIRTSGYFS 225 N V+ + + +++ + I+EG KAKI I FVGNK +S + L V +++ S ++S Sbjct: 122 YNAKVEAKVTPLPRNRVDLEFNIKEGKKAKIKKINFVGNKAFSDSELRDVFALKESNFWS 181 Query: 226 -FGKTDVYSKERMSFDEEAIRAFYHDRGYAAVKV-SSQVLFDKQKSGYVLIFQIDEGEIY 283 F D YSK+++ D E +R+FY +RGY KV S+QV K G + + I EGE Y Sbjct: 182 WFTGDDKYSKQKLEADLELLRSFYLNRGYLDFKVESTQVSITPDKKGVYITYNIKEGEQY 241 Query: 284 TVGNISIQSTLQEIQKKTLLSLIRIRSGNLYNPQEIKESSEKISKYFFSGERPFVRVKTR 343 G ++I+ L ++ L L++I+ G ++N ++ + + I F V R Sbjct: 242 KFGEVTIEGDLIGPDEE-LKKLLKIKKGEVFNRSKVTKIVKAIKDLLGEKGYAFANVNPR 300 Query: 344 INRDFAKRIVDIEYLIDQGSPLYVKRIEIEGNDQSYDSVIRRELELSEGDPINYSMIERA 403 + + VD+ + +D G +YV+RI I GN ++ D VIRREL EGD + S I+R+ Sbjct: 301 PQINDENKTVDLTFKVDPGKRVYVRRINISGNTRTRDEVIRRELRQLEGDWYSLSKIKRS 360 Query: 404 KRRIMATGYFSEVNISQLPANDVSDYVILRVSVKQLSAGSVGIATNYEVDKGMGVEGHID 463 KRR+ GYF VNI +P D V L V VK+ GS+ Y G+ I Sbjct: 361 KRRLERLGYFETVNIETVPVGS-PDQVDLNVKVKEQPTGSISFGVGYGSSSGLSFNASIS 419 Query: 464 DNNFFGQGYRARLAAGFGRHAVQNYTFSVEDPYFLGSPISAGFDLQKTH-----LEDGSL 518 + NF G G R L+A +Y+ S DPYF +S GF + + Sbjct: 420 ERNFLGTGNRLSLSANKSSL-STSYSLSFTDPYFTDDGVSLGFSIFYSETDRNYKNFSDY 478 Query: 519 DINDESAAVRMIVPITESISTSFKYDLRFLQYGAISEKEKIPSIYTTLIEH--GKFSSHS 576 ++ + PITE + S Y + I S + I+ GKF S Sbjct: 479 KTKTYGGSINLGYPITEYLRVSLGYG---YEQNKIKNSSDSDSSASYFIKEQGGKFIDSS 535 Query: 577 ISQSIIYNTLDNPIVPRKGMLISSSYDYAGFGGDSQYHRIGSRASYFYLLSDDSDIVGSL 636 +S Y+TLD+ P KG + S ++AG GGD +Y+++ + Y+ LS + D V SL Sbjct: 536 LSYGWSYDTLDSGYFPTKGSIQRLSQEFAGPGGDLKYYKLTYDSEYYIPLSKEDDWVLSL 595 Query: 637 RF--GYGCVIPSNKNLQLFDQFSVSSNYYLRGFAYKGIGPR--VDKKYAIGGKIYSSASA 692 R GYG K+L +++F +RGF GIGPR D +IGG ++A+ Sbjct: 596 RGRLGYGNGY-GGKDLPFYERFYAGGIGSVRGFESNGIGPRDINDSGDSIGGNAMATANV 654 Query: 693 AVSFPMPLVPERAGLRGAFFVDSATLYANHVALGADKLEGNDSFWRVSTGVEIMWNSPLG 752 + FP+P +PE GLRG+ F D+ ++ D +DS R S GV + W SP+G Sbjct: 655 ELIFPLPFLPEDNGLRGSVFFDAGNVWGTDQKKEGD--YSDDSSLRASVGVGLRWISPMG 712 Query: 753 MMGVYYGIPLRHREGDKIQQFGFRIGNR 780 + Y PL+ ++GDK Q F F IG Sbjct: 713 PLRFSYAKPLKKKKGDKTQSFQFSIGTT 740 >gnl|CDD|182941 PRK11067, PRK11067, outer membrane protein assembly factor YaeT; Provisional. Length = 803 Score = 271 bits (695), Expect = 4e-73 Identities = 209/797 (26%), Positives = 353/797 (44%), Gaps = 60/797 (7%) Query: 34 IFGLSAVYGSNTSIVRRIEIRGATNVGKEVILSRIPVVVGQSISDADLDHAVKNIYAMGY 93 +F + VYG+ +V+ I G V L +PV VG +++D D+ + ++ ++A G Sbjct: 12 LFSSATVYGAEGFVVKDIHFEGLQRVAVGAALLSMPVRVGDTVNDEDISNTIRALFATGN 71 Query: 94 FSNVKIKIVDSVLIIDLIERKIINHLFFSGNNNLKDDQLKMIVRS---RSAAAYDEDTVN 150 F +V++ + L++ + ER I + FSGN ++KDD LK + + R A D T++ Sbjct: 72 FEDVRVLRDGNTLLVQVKERPTIASITFSGNKSVKDDMLKQNLEASGVRVGEALDRTTLS 131 Query: 151 ADVHNIKQAYASIGYLNVMVKVQHHSISPTTLNITYVIEEGVKAKINSIRFVGNKNYSHA 210 ++ Y S+G + VK + +++ V EGV AKI I VGN +S Sbjct: 132 NIEKGLEDFYYSVGKYSASVKAVVTPLPRNRVDLKLVFTEGVSAKIQQINIVGNHAFSTD 191 Query: 211 RLERVISIRTS-GYFSFGKTDVYSKERMSFDEEAIRAFYHDRGYAAVKV-SSQVLFDKQK 268 L +R +++ Y K++++ D E +R++Y DRGYA + S+QV K Sbjct: 192 ELISRFQLRDEVPWWNVVGDRKYQKQKLAGDLETLRSYYLDRGYARFNIDSTQVSLTPDK 251 Query: 269 SGYVLIFQIDEGEIYTVGNISIQSTL----QEIQKKTLLSLIRIRSGNLYNPQEIKESSE 324 G + I EG+ Y + + + L EI+K L +I G LYN ++ + + Sbjct: 252 KGIYITVNITEGDQYKLSGVQVSGNLAGHSAEIEK-----LTKIEPGELYNGTKVTKMED 306 Query: 325 KISKYFFSGERPFVRVKTRINRDFAKRIVDIEYLIDQGSPLYVKRIEIEGNDQSYDSVIR 384 I K + RV+T+ + A + V + +D G+ YV++I EGND S D+V+R Sbjct: 307 DIKKLLGRYGYAYPRVQTQPEINDADKTVKLHVNVDAGNRFYVRKIRFEGNDTSKDAVLR 366 Query: 385 RELELSEGDPINYSMIERAKRRIMATGYFSEVNISQLPANDVSDYVILRVSVKQLSAGSV 444 RE+ EG + ++E+ K R+ G+F V++ D V + VK+ + GS Sbjct: 367 REMRQMEGAWLGSDLVEQGKERLNRLGFFETVDVDTQRVPGSPDQVDVVYKVKERNTGSF 426 Query: 445 GIATNYEVDKGMGVEGHIDDNNFFGQGYRARLAAGFGRHAVQNYT-FSVEDPYFLGSPIS 503 Y + G+ + + +N+ G G + ++ Q Y SV DPYF +S Sbjct: 427 NFGVGYGTESGVSFQAGVQQDNWLGTGNSVGING--TKNDYQTYAELSVTDPYFTVDGVS 484 Query: 504 AGFDLQKTHLEDGSLDIND---ESAAVRMIV--PITE--SISTSFKY---DLRFLQ-YGA 552 G + E D++D +S + + PI E S+ Y L +Q A Sbjct: 485 LGGRIFYNDFEADDADLSDYTNKSYGTDVTLGFPINENNSLRAGLGYVHNSLSNMQPQVA 544 Query: 553 ISEKEKIPSIYTTLIEHGKFSSH--SISQSIIYNTLDNPIVPRKGMLISSSYDYAGFGGD 610 + + F + + + YN LD P G ++ + G D Sbjct: 545 MWRYLYSMGENPSSDSDNSFKTDDFTFNYGWTYNNLDRGYFPTAGNRVNLTGKVTIPGSD 604 Query: 611 SQYHRIGSRASYFYLLSDDSDIV--GSLRFGYGCVIPSNKNLQLFDQFSVSSNYYLRGFA 668 ++Y+++ + + L DD V G R GYG + K + ++ F + +RGF Sbjct: 605 NEYYKVTLDTAQYVPLDDDHKWVLLGRGRLGYGDGL-GGKEMPFYENFYAGGSSTVRGFQ 663 Query: 669 YKGIGPRV--------------DKKY-----AIGGKIYSSASAAVSFPMPLVPERAG--L 707 IGP+ K A+GG + AS + P P + ++ + Sbjct: 664 SNTIGPKAVYYPGQASNYTQDGAKDLCKSDDAVGGNAMAVASLELITPTPFISDKYANSV 723 Query: 708 RGAFFVDSATL----YANHVALGADKLE--GNDSFWRVSTGVEIMWNSPLGMMGVYYGIP 761 R +FF D T+ + N A + S R+S G+ + W SPLG + Y P Sbjct: 724 RTSFFWDMGTVWDTNWDNTAYTRAAGYPDYSDPSNIRMSAGIALQWMSPLGPLVFSYAQP 783 Query: 762 LRHREGDKIQQFGFRIG 778 + +GDK +QF F IG Sbjct: 784 FKKYDGDKAEQFQFNIG 800 >gnl|CDD|148699 pfam07244, Surf_Ag_VNR, Surface antigen variable number repeat. This family is found primarily in bacterial surface antigens, normally as variable number repeats at the N-terminus. The C-terminus of these proteins is normally represented by pfam01103. The alignment centres on a -GY- or -GF- motif. Some members of this family are found in the mitochondria. It is predicted to have a mixed alpha/beta secondary structure. Length = 78 Score = 51.5 bits (124), Expect = 8e-07 Identities = 23/88 (26%), Positives = 36/88 (40%), Gaps = 11/88 (12%) Query: 194 AKINSIRFVGN-KNYSHARLERVISIRTSGYFSFGKTDVYSKERMSFDEEAIRAFYHDRG 252 KI I GN K L R + ++ DV+++E++ D E + +Y G Sbjct: 1 YKIKDINIEGNLKRTKDEVLRRELKLKEG--------DVFNREKLEEDIERLYDYYGALG 52 Query: 253 YAAVKVSSQVLFDKQKSGYVLIFQIDEG 280 YA V + + L F +DEG Sbjct: 53 YAFADVKVEP--EPDPKTVDLTFNVDEG 78 Score = 50.4 bits (121), Expect = 2e-06 Identities = 26/72 (36%), Positives = 35/72 (48%), Gaps = 6/72 (8%) Query: 366 YVKRIEIEGNDQSY-DSVIRRELELSEGDPINYSMIERAKRRI----MATGY-FSEVNIS 419 +K I IEGN + D V+RREL+L EGD N +E R+ A GY F++V + Sbjct: 2 KIKDINIEGNLKRTKDEVLRRELKLKEGDVFNREKLEEDIERLYDYYGALGYAFADVKVE 61 Query: 420 QLPANDVSDYVI 431 P D Sbjct: 62 PEPDPKTVDLTF 73 Score = 47.7 bits (114), Expect = 1e-05 Identities = 18/78 (23%), Positives = 39/78 (50%), Gaps = 2/78 (2%) Query: 115 IINHLFFSGNNN-LKDDQLKMIVRSRSAAAYDEDTVNADVHNIKQAYASIGYLNVMVKVQ 173 I + GN KD+ L+ ++ + ++ + + D+ + Y ++GY VKV+ Sbjct: 2 KIKDINIEGNLKRTKDEVLRRELKLKEGDVFNREKLEEDIERLYDYYGALGYAFADVKVE 61 Query: 174 HHSISPTTLNITYVIEEG 191 P T+++T+ ++EG Sbjct: 62 -PEPDPKTVDLTFNVDEG 78 Score = 46.9 bits (112), Expect = 2e-05 Identities = 18/80 (22%), Positives = 45/80 (56%), Gaps = 2/80 (2%) Query: 283 YTVGNISIQSTLQEIQKKTLLSLIRIRSGNLYNPQEIKESSEKISKYFFSGERPFVRVKT 342 Y + +I+I+ L+ + + L ++++ G+++N ++++E E++ Y+ + F VK Sbjct: 1 YKIKDINIEGNLKRTKDEVLRRELKLKEGDVFNREKLEEDIERLYDYYGALGYAFADVKV 60 Query: 343 RINRDFAKRIVDIEYLIDQG 362 D + VD+ + +D+G Sbjct: 61 EPEPD--PKTVDLTFNVDEG 78 >gnl|CDD|130065 TIGR00992, 3a0901s03IAP75, chloroplast envelope protein translocase, IAP75 family. Two families of proteins are involved in the chloroplast envelope import appartus.They are the three proteins of the outer membrane (TOC) and four proteins in the inner membrane (TIC). This family is specific for the TOC IAP75 protein. Length = 718 Score = 46.1 bits (109), Expect = 3e-05 Identities = 36/144 (25%), Positives = 61/144 (42%), Gaps = 9/144 (6%) Query: 372 IEGNDQSYDSVIRREL--ELSEGDPINYSMIERAKRRIMATGYFSEVNISQLPANDVSDY 429 +EGN + D V+ REL +L GD N + A + I + G FS + ++ P Sbjct: 281 VEGNTR--DPVVTRELPKQLKPGDVFNIEAGKTALQNINSLGLFSNIEVNPRPDEMNEGE 338 Query: 430 VILRVSVKQLSAGSVGIATNYEVDKGMGVEGHIDDNNFFGQ-GYRARLAAGFGRHAVQNY 488 +I+ + +K+L S ++T + + G G + + G + R G R + Sbjct: 339 IIVEIKLKELEQKSAEVSTEWSIVPGRGGRPTLASSQPGGTITFEHRNLQGLNR----SL 394 Query: 489 TFSVEDPYFLGSPISAGFDLQKTH 512 SV FL F ++ TH Sbjct: 395 GGSVTTSNFLNPQDDLLFKVEYTH 418 >gnl|CDD|178683 PLN03138, PLN03138, Protein TOC75; Provisional. Length = 796 Score = 33.3 bits (76), Expect = 0.25 Identities = 23/88 (26%), Positives = 43/88 (48%), Gaps = 4/88 (4%) Query: 372 IEGNDQSYDSVIRREL--ELSEGDPINYSMIERAKRRIMATGYFSEVNISQLPANDVSDY 429 +EGN Q +I REL +L +G N ++A R I + FS + ++ P Sbjct: 359 VEGNTQ--LPIIDRELPKQLRQGHIFNIEAGKQALRNINSLALFSNIEVNPRPDEKNEGG 416 Query: 430 VILRVSVKQLSAGSVGIATNYEVDKGMG 457 +++ + +K+L S ++T + + G G Sbjct: 417 IVVEIKLKELEPKSAEVSTEWSIVPGRG 444 >gnl|CDD|184070 PRK13473, PRK13473, gamma-aminobutyraldehyde dehydrogenase; Provisional. Length = 475 Score = 31.8 bits (73), Expect = 0.61 Identities = 18/61 (29%), Positives = 27/61 (44%), Gaps = 17/61 (27%) Query: 48 VRRIEIRGATNVGKEVILS--------------RIPVVVGQSISDADLDHAVKNIYAMGY 93 VR + + G+ GK V+ + + PV+V DADLD V+ I GY Sbjct: 217 VRMVSLTGSIATGKHVLSAAADSVKRTHLELGGKAPVIV---FDDADLDAVVEGIRTFGY 273 Query: 94 F 94 + Sbjct: 274 Y 274 >gnl|CDD|173541 PTZ00348, PTZ00348, tyrosyl-tRNA synthetase; Provisional. Length = 682 Score = 30.6 bits (69), Expect = 1.7 Identities = 21/56 (37%), Positives = 27/56 (48%), Gaps = 7/56 (12%) Query: 109 DLIERK----IINHLFFSGNNNLKDDQLKMIVRSRSAAAYDEDTVNADVHNIKQAY 160 DLI RK I++H +G LK Q KM +A + EDT I+QAY Sbjct: 197 DLIGRKLKPVILSHHMLAG---LKQGQAKMSKSDPDSAIFMEDTEEDVARKIRQAY 249 >gnl|CDD|132488 TIGR03447, mycothiol_MshC, cysteine--1-D-myo-inosityl 2-amino-2-deoxy-alpha-D-glucopyranoside ligase. Members of this protein family are MshC, l-cysteine:1-D-myo-inosityl 2-amino-2-deoxy-alpha-D-glucopyranoside ligase, an enzyme that uses ATP to ligate a Cys residue to a mycothiol precursor molecule, in the second to last step in mycothiol biosynthesis. This enzyme shows considerable homology to Cys--tRNA ligases, and many instances are misannotated as such. Mycothiol is found in Mycobacterium tuberculosis, Corynebacterium glutamicum, Streptomyces coelicolor, and various other members of the Actinobacteria. Mycothiol is an analog to glutathione. Length = 411 Score = 30.1 bits (68), Expect = 2.2 Identities = 34/127 (26%), Positives = 50/127 (39%), Gaps = 16/127 (12%) Query: 460 GHIDDNNFFGQGYRARLAAGFGRHAVQNYTFSVEDPYFLGSPISAGFDLQKTHLEDGSLD 519 GH F R AG H VQN T V+DP F G D + E G+ Sbjct: 53 GHAATYLTFDLVNRVWRDAGHRVHYVQNVT-DVDDPLF-ERAERDGVDWR----ELGTSQ 106 Query: 520 IN---DESAAVRMIVP-----ITESISTSFKYDLRFLQYGA--ISEKEKIPSIYTTLIEH 569 I+ ++ A+R++ P ESI + + L GA E + P +Y ++ Sbjct: 107 IDLFREDMEALRVLPPRDYIGAVESIDEVIEMVEKLLAAGAAYEVEGPEYPDVYFSIEAT 166 Query: 570 GKFSSHS 576 +F S Sbjct: 167 EQFGYES 173 >gnl|CDD|177917 PLN02278, PLN02278, succinic semialdehyde dehydrogenase. Length = 498 Score = 29.7 bits (67), Expect = 2.8 Identities = 20/62 (32%), Positives = 29/62 (46%), Gaps = 17/62 (27%) Query: 47 IVRRIEIRGATNVGKEVI------LSRI--------PVVVGQSISDADLDHAVKNIYAMG 92 VR+I G+T VGK+++ + R+ P +V DADLD AVK A Sbjct: 239 KVRKITFTGSTAVGKKLMAGAAATVKRVSLELGGNAPFIV---FDDADLDVAVKGALASK 295 Query: 93 YF 94 + Sbjct: 296 FR 297 >gnl|CDD|149506 pfam08478, POTRA_1, POTRA domain, FtsQ-type. FtsQ/DivIB bacterial division proteins (pfam03799) contain an N-terminal POTRA domain (for polypeptide-transport-associated domain). This is found in different types of proteins, usually associated with a transmembrane beta-barrel. FtsQ/DivIB may have chaperone-like roles, which has also been postulated for the POTRA domain in other contexts. Length = 67 Score = 29.4 bits (67), Expect = 3.1 Identities = 14/67 (20%), Positives = 31/67 (46%), Gaps = 1/67 (1%) Query: 48 VRRIEIRGATNVGKEVILSRIPVVVGQSISDADLDHAVKNIYAMGYFSNVKI-KIVDSVL 106 +R++E+ G +V E I + + G S DL+ + + + + + + + L Sbjct: 1 IRQVEVSGNKHVSAEEIRKALGIQKGTSFFSVDLNAIEDRLEKLPWIKSASVRRQWPNTL 60 Query: 107 IIDLIER 113 I ++ER Sbjct: 61 EIRVVER 67 >gnl|CDD|131337 TIGR02284, TIGR02284, conserved hypothetical protein. Members of this protein family are found mostly in the Proteobacteria, although one member is found in the the marine planctomycete Pirellula sp. strain 1. The function is unknown. Length = 139 Score = 29.4 bits (66), Expect = 3.2 Identities = 16/57 (28%), Positives = 24/57 (42%), Gaps = 7/57 (12%) Query: 109 DLIERKIINHLFF-SGNNNLKDDQLKMIVRSRSAAAYDEDTVNADVHNIKQAYASIG 164 DLIE I F +KD +L + R + +A V ++Q AS+G Sbjct: 7 DLIEISIDGKDGFEESAEEVKDPELATLFRRIA------GEKSAIVSELQQVVASLG 57 >gnl|CDD|185597 PTZ00399, PTZ00399, cysteinyl-tRNA-synthetase; Provisional. Length = 651 Score = 29.6 bits (67), Expect = 3.4 Identities = 10/29 (34%), Positives = 18/29 (62%) Query: 1 MHKSTEDFRRIKRLLEKYFPRSFQMGFII 29 M KS ++F I++ L KY R ++ F++ Sbjct: 316 MSKSLKNFITIRQALSKYTARQIRLLFLL 344 >gnl|CDD|183848 PRK13024, PRK13024, bifunctional preprotein translocase subunit SecD/SecF; Reviewed. Length = 755 Score = 29.4 bits (67), Expect = 3.6 Identities = 16/68 (23%), Positives = 29/68 (42%), Gaps = 9/68 (13%) Query: 26 GFIILFYAIFGLSAVYG-SNTSIVRRI----EIRGATNVGKEVILSRIPVVVGQSISDAD 80 ++I F + L +Y S + + I +++G G EV+L P G+ Sbjct: 5 RWVIFFLTLLLLIGIYYASPKFVTKNINLGLDLQG----GFEVLLEVSPADKGKKADREA 60 Query: 81 LDHAVKNI 88 LD V+ + Sbjct: 61 LDSTVEVL 68 >gnl|CDD|150150 pfam09381, Porin_OmpG, Outer membrane protein G (OmpG). Porins are channel proteins in the outer membrane of gram negative bacteria which mediate the uptake of molecules required for growth and survival. Escherichia coli OmpG forms a 14 stranded beta-barrel and in contrast to most porins, appears to function as a monomer. The central pore of OmpG is wider than other E. coli porins and it is speculated that it may form a non-specific channel for the transport of larger oligosaccharides. Length = 280 Score = 29.3 bits (65), Expect = 3.9 Identities = 14/47 (29%), Positives = 24/47 (51%) Query: 443 SVGIATNYEVDKGMGVEGHIDDNNFFGQGYRARLAAGFGRHAVQNYT 489 +V + NY +++G ++ ++ F Q RA L G H+V YT Sbjct: 164 TVALRVNYYLERGFNMDDSRNNGEFSTQEIRAYLPLTLGNHSVTPYT 210 >gnl|CDD|162254 TIGR01213, TIGR01213, conserved hypothetical protein TIGR01213. Members of this family show twilight-zone similarity to several predicted RNA pseudouridine synthases. All trusted members of this family are archaeal. Several eukaryotic homologs lack N-terminal homology including two CXXC motifs. Length = 388 Score = 29.3 bits (66), Expect = 4.0 Identities = 18/60 (30%), Positives = 25/60 (41%), Gaps = 3/60 (5%) Query: 316 PQEIKESSEKISKYFFSGERPFVRVKTRINRDFAKRIVD-IEYLIDQGSPLYVKRIEIEG 374 P ++ E E+I K F SG +K NR+ K V +D P V +E E Sbjct: 92 PPDVIELEEEIRKEFGSGGGE--SIKREFNREVGKLFVKRTGKEVDFERPDLVIMVEFEY 149 >gnl|CDD|162158 TIGR01015, hmgA, homogentisate 1,2-dioxygenase. Missing in human disease alkaptonuria. Length = 429 Score = 29.0 bits (65), Expect = 4.6 Identities = 12/29 (41%), Positives = 16/29 (55%), Gaps = 1/29 (3%) Query: 539 TSFKYDL-RFLQYGAISEKEKIPSIYTTL 566 +KYDL RF ++S PSI+T L Sbjct: 269 VPYKYDLKRFNVINSVSFDHPDPSIFTVL 297 >gnl|CDD|177778 PLN00184, PLN00184, aquaporin NIP1; Provisional. Length = 296 Score = 28.8 bits (64), Expect = 4.9 Identities = 19/62 (30%), Positives = 34/62 (54%), Gaps = 7/62 (11%) Query: 21 RSFQMGFIILFYAIFGLSAVYGSNTSIVRRIEIRGATNVGKEVILSRIPVVVGQSISDAD 80 ++F M FI+ FY +F +S V N +I E+ G +G V+L+ V++ +S A Sbjct: 179 QAFVMEFIVTFYLMFIISGVATDNRAIG---ELAGLA-IGSTVLLN---VLIAAPVSSAS 231 Query: 81 LD 82 ++ Sbjct: 232 MN 233 >gnl|CDD|117056 pfam08479, POTRA_2, POTRA domain, ShlB-type. The POTRA domain (for polypeptide-transport-associated domain) is found towards the N-terminus of ShlB family proteins (pfam03865). ShlB is important in the secretion and activation of the haemolysin ShlA. It has been postulated that the POTRA domain has a chaperone-like function over ShlA; it may fold back into the C-terminal beta-barrel channel. Length = 76 Score = 28.3 bits (64), Expect = 8.7 Identities = 17/73 (23%), Positives = 34/73 (46%), Gaps = 8/73 (10%) Query: 48 VRRIEIRGATNVGKEVILSRIPVVVGQSISDADLDHAVKNI----YAMGY-FSNVKI--- 99 ++ IE+ G T + + + + +G+ + ADL+ K + GY S V + Sbjct: 3 IKGIELEGNTLLPPADLAALLQPYLGRCLGLADLNQLAKALTNAYIERGYVTSRVYLPPQ 62 Query: 100 KIVDSVLIIDLIE 112 + D VL + ++E Sbjct: 63 DLSDGVLRLTVLE 75 >gnl|CDD|180212 PRK05702, flhB, flagellar biosynthesis protein FlhB; Reviewed. Length = 359 Score = 28.2 bits (64), Expect = 8.8 Identities = 13/32 (40%), Positives = 18/32 (56%), Gaps = 5/32 (15%) Query: 383 IRRELELSEGDP-----INYSMIERAKRRIMA 409 ++ E + SEGDP I E A+RR+MA Sbjct: 227 VKDEHKQSEGDPEVKGRIRQLQREMARRRMMA 258 Database: CddB Posted date: Feb 4, 2011 9:54 PM Number of letters in database: 5,994,473 Number of sequences in database: 21,608 Lambda K H 0.320 0.138 0.390 Gapped Lambda K H 0.267 0.0677 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 21608 Number of Hits to DB: 12,943,707 Number of extensions: 877139 Number of successful extensions: 1655 Number of sequences better than 10.0: 1 Number of HSP's gapped: 1614 Number of HSP's successfully gapped: 41 Length of query: 781 Length of database: 5,994,473 Length adjustment: 101 Effective length of query: 680 Effective length of database: 3,812,065 Effective search space: 2592204200 Effective search space used: 2592204200 Neighboring words threshold: 11 Window for multiple hits: 40 X1: 16 ( 7.4 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.8 bits) S2: 61 (27.4 bits)