Draft Genome of the Filarial Nematode Parasite Brugia malayi

I've Read This
  • 1 View
REPORTS
    GenBank (EF588824 to EF588901); the Brugia malayi contigs are available in GenBank (DS237653, DS238272, DS238705, DS239028, DS239057, DS239291, DS239377, and DS239315); the Brugia malayi scaffolds are available in GenBank (AAQA01000958, AAQA01000097, AAQA01001500, AAQA01000425, AAQA01001819, AAQA01001498, AAQA01000384, AAQA01001952, AAQA01000736, AAQA01000571, and AAQA01000369); and the microarray primers sequences that were used for RT-PCR are deposited in ArrayExpress (A-TIGR-28). This work was supported by an NSF grant to J.H.W. and H.T., a National Institute of Allergy and Infectious Diseases grant to E.G., and New England Biolabs Incorporated support to B.E.S. Fig. S1 Tables S1 to S5 References 13 March 2007; accepted 2 July 2007 Published online 30 August 2007; 10.1126/science.1142490 Include this information when citing this paper.
    
    Supporting Online Material
    www.sciencemag.org/cgi/content/full/1142490/DC1 Materials and Methods
    
    Draft Genome of the Filarial Nematode Parasite Brugia malayi
    Elodie Ghedin,1,2# Shiliang Wang,2 David Spiro,2 Elisabet Caler,2 Qi Zhao,2 Jonathan Crabtree,2 Jonathan E. Allen,2* Arthur L. Delcher,2† David B. Guiliano,3 Diego Miranda-Saavedra,4‡ Samuel V. Angiuoli,2 Todd Creasy,2 Paolo Amedeo,2 Brian Haas,2 Najib M. El-Sayed,2§ Jennifer R. Wortman,2 Tamara Feldblyum,2 Luke Tallon,2 Michael Schatz,2† Martin Shumway,2 Hean Koo,2 Steven L. Salzberg,2† Seth Schobel,2 Mihaela Pertea,2† Mihai Pop,2† Owen White,2 Geoffrey J. Barton,4 Clotilde K. S. Carlow,5 Michael J. Crawford,6 Jennifer Daub,7|| Matthew W. Dimmic,6 Chris F. Estes,8 Jeremy M. Foster,5 Mehul Ganatra,5 William F. Gregory,7 Nicholas M. Johnson,9 Jinming Jin,10 Richard Komuniecki,11 Ian Korf,12 Sanjay Kumar,5 Sandra Laney,13 Ben-Wen Li,14 Wen Li,13 Tim H. Lindblom,8 Sara Lustigman,15 Dong Ma,5 Claude V. Maina,5 David M. A. Martin,4 James P. McCarter,6,16 Larry McReynolds,10 Makedonka Mitreva,16 Thomas B. Nutman,17 John Parkinson,18 José M. Peregrín-Alvarez,1 Catherine Poole,5 Qinghu Ren,2 Lori Saunders,13 Ann E. Sluder,19 Katherine Smith,11 Mario Stanke,20 Thomas R. Unnasch,21 Jenna Ware,5 Aguan D. Wei,22 Gary Weil,14 Deryck J. Williams,7 Yinhua Zhang,5 Steven A. Williams,13 Claire Fraser-Liggett,2¶ Barton Slatko,5 Mark L. Blaxter,7 Alan L. Scott23 Parasitic nematodes that cause elephantiasis and river blindness threaten hundreds of millions of people in the developing world. We have sequenced the ~90 megabase (Mb) genome of the human filarial parasite Brugia malayi and predict ~11,500 protein coding genes in 71 Mb of robustly assembled sequence. Comparative analysis with the free-living, model nematode Caenorhabditis elegans revealed that, despite these genes having maintained little conservation of local synteny during ~350 million years of evolution, they largely remain in linkage on chromosomal units. More than 100 conserved operons were identified. Analysis of the predicted proteome provides evidence for adaptations of B. malayi to niches in its human and vector hosts and insights into the molecular basis of a mutualistic relationship with its Wolbachia endosymbiont. These findings offer a foundation for rational drug design. he phylum Nematoda is speciose and abundant and, although most species are free-living, many are parasitic. Over onethird of all humans, mainly in the developing world, carry a nematode infection. Parasitic worms typically cause chronic, debilitating infections that are often difficult to treat and that, despite the high cost to human health, have been neglected in biomedical research. Current knowledge of nematode molecular genetics and developmental biology is largely based on extensive studies of the free-living, bacteriovorous species Caenorhabditis elegans. Here, we present the initial analysis of the genome of the human filarial parasite Brugia malayi. Brugia malayi is endemic in Southeast Asia and Indonesia. Like other filarial nematodes, B. malayi develops through four larval stages into an adult male or female (fig. S1), entirely within one of two host species—a mosquito vector (Culex, Aedes, and Anopheles) and humans, where adult worms can live for more than a decade. B. malayi was chosen for wholegenome sequencing (1) because it is the only
    
    T
    
    major human filarial pathogen that can be maintained in small laboratory animals. Most filarial nematodes, including B. malayi, carry three genomes: nuclear, mitochondrial (available at GenBank, accession no. AF538716), and that of an alphaproteobacterial endosymbiont, Wolbachia. We present here the draft assembly and annotated genome of the TRS strain of B. malayi. We provide comparative analyses with Caenorhabditis and another well-annotated member of the superphylum Ecdysozoa, Drosophila melanogaster, to further illuminate the origins of novelty and loss of ancestral characters in the model species and the parasite. Comparative genome analysis reveals key features of Nematoda that define the scope of molecular diversity that has contributed to the success of the phylum. The analysis also uncovers adaptations that appear to have evolved in the B. malayi genome in response to the pressures of parasitism and to the presence of the parasite’s Wolbachia endosymbiont, wBm. The B. malayi nuclear genome is organized as five chromosomes (2), including an XY sexVOL 317 SCIENCE
    
    determination pair, and has been estimated to be 80 to 100 megabases (Mb) (3, 4). The sequence of the B. malayi nuclear genome was obtained to ~9× coverage with the use of whole-genome shotgun (WGS) sequencing (1, 5). The sequences were assembled into scaffolds totaling ~71 Mb of data with a further ~17.5 Mb of contigs not integrated into any scaffold (orphan contigs). The repeat content of the B. malayi genome, estimated at ~15% (1), may have contributed significantly to assembly difficulties (5, 6). From these sequence data, we estimate that the B. malayi genome is 90 to 95 Mb (Table 1) (5). In comparison, the C. elegans genome is 100 Mb and the Caenorhabditis briggsae genome 104 Mb. The overall G + C content (30.5%) is lower than that of C. elegans (35.4%) or C. briggsae (37.4%) (6). The complement of protein-coding genes was derived by automated gene prediction from the ~71-Mb assembly and by manual annotation of selected gene families (table S1). The 11,515 robustly predicted gene-coding regions occupy ~32% of the sequence at an average density of 162 genes/Mb (Table 1). After inclusion of genes estimated to be found in the unannotated portion of the genomic sequence (5), we infer that B. malayi has between 14,500 and 17,800 protein-coding genes, agreeing with previous estimates (7). Even the higher estimate is lower than the 19,762 (WormBase data release WS133) and 19,507 (6) genes reported for C. elegans and C. briggsae, respectively, which suggests that parasitic nematode genomes have fewer genes than their free-living counterparts, echoing a pattern observed in bacterial pathogens. For the six scaffolds longer than 1 Mb, totaling ~25 Mb of the genome, the arrangement of B. malayi genes was compared with that of their C. elegans orthologs (Fig. 1). Linkage is in general conserved: For large regions of the B. malayi genome, orthologs map predominantly to one (or, in the case of scaffold 14972, two) C. elegans chromosome(s) (Fig. 1, A to C), which indicates maintenance of linkage of these genes despite ~350 million years of separation (8). However, local gene order is not conserved (Fig. 1D). The largest, 6.5-Mb scaffold contains interdigitating blocks of genes that map to chromosomes 4 and X of C. elegans, which suggests there were ancient breakage and fusion events between linkage groups. These data support a model where within-linkage group rearrangements have been many times more common than between-linkage group transloca-
    
    1756
    
    21 SEPTEMBER 2007
    
    www.sciencemag.org
    
    Downloaded from www.sciencemag.org on September 25, 2007
    
    REPORTS
    tions (7, 9), a pattern that may be typical of nematode genomes (6, 10). Operons are a common form of gene organization in bacteria and some protozoa, but in Metazoa, operons have been identified only in nematodes, platyhelminths, and urochordates (11, 12). Using 1000 base pairs (bp) as the upDivision of Infectious Diseases, University of Pittsburgh School of Medicine, Pittsburgh, PA 15261, USA. 2The Institute for Genomic Research (TIGR) and the J. Craig Venter Institute, 9712 Medical Center Drive, Rockville, MD 20850, USA. 3Division of Cell and Molecular Biology, Biochemistry Building, Imperial College London, Exhibition Road, South Kensington, London, SW7 2AZ, UK. 4School of Life Sciences Research, University of Dundee, Dow Street, Dundee, DD1 5EH, UK. 5Division of Parasitology, New England BioLabs, Inc., 240 County Road, Ipswich, MA 01938, USA. 6Divergence, Inc., 893 North Warson Road, St. Louis, MO 63141, USA. 7Institute of Evolutionary Biology and Institute of Immunology and Infection Research, University of Edinburgh, Edinburgh EH9 3JT, UK. 8Division of Science, Lyon College, 2300 Highland Road, Batesville, AR 72501, USA. 9School of Biochemistry and Molecular Biology, The Australian National University, Canberra, ACT 0200, Australia. 10Division of RNA Biology, New England Biolabs, Inc., 240 County Road, Ipswich, MA 01938, USA. 11Department of Biological Sciences, University of Toledo, 2801 West Bancroft Street, Toledo, OH 43606–3390, USA. 12Genome Center, Section of Molecular Biology, Division of Biological Sciences, University of California at Davis, Davis, CA 95616, USA. 13Clark Science Center, Smith College, Northampton, MA 01063, USA. 14 Infectious Diseases Division, Washington University School of Medicine, 660 South Euclid Avenue, St. Louis, MO 63110, USA. 15Laboratory of Molecular Parasitology, Lindsley F. Kimball Research Institute, New York Blood Center, 310 East 67th Street, New York, NY 10021, USA. 16 Genome Sequencing Center, Washington University School of Medicine, 4444 Forest Park Avenue, St. Louis, MO 63108, USA. 17Laboratory of Parasitic Diseases, National Institute of Allergy and Infectious Diseases, National Institutes of Health, 6610 Rockledge Drive, Bethesda, MD 20892–6612, USA. 18Program in Molecular Structure and Function, Hospital for Sick Children, TMDT Building, 101 College Street, 15th Floor, East Tower, Toronto, ON, Canada, M5G 1L7. 19Cambria Biosciences, 8A Henshaw Street, Woburn, MA 01801, USA. 20Department of Bioinformatics, University of Göttingen, Goldschmidtstrasse 1, 37077 Göttingen, Germany. 21Division of Geographic Medicine, University of Alabama at Birmingham, BBRB 203, 1530 Third Avenue South, Birmingham, AL 35294–2170, USA. 22Department of Anatomy and Neurobiology, Washington University School of Medicine, 660 South Euclid Avenue, St. Louis, MO 63110, USA. 23 Department of Molecular Microbiology and Immunology, Bloomberg School of Public Health, Johns Hopkins University, 615 North Wolfe, Baltimore, MD 21205, USA. *Present address: Lawrence Livermore National Laboratory, 7000 East Avenue, Livermore, CA 94550, USA. †Present address: Center for Bioinformatics and Computational Biology, University of Maryland, College Park, MD 20742, USA. ‡Present address: Cambridge Institute for Medical Research, Wellcome Trust/MRC Building, Addenbrooke's Hospital, Hills Road, Cambridge CB2 0XY, UK. §Present address: Department of Cell Biology and Molecular Genetics, 2105 H. J. Patterson Hall, and Center for Bioinformatics and Computational Biology, 3115 Biomolecular Sciences Building, University of Maryland, College Park, MD 20742, USA. ∥Present address: The Wellcome Trust Sanger Institute, Hinxton Genome Campus, Cambridge CB10 1SA, UK. ¶Present address: Institute for Genome Sciences, University of Maryland School of Medicine, 800 West Baltimore Street, Baltimore, MD 21201, USA. #To whom correspondence should be addressed. E-mail: GhedinE@dom.pitt.edu
    1
    
    Table 1. General features and genome content of B. malayi, with C. elegans values for comparison. Scaffolds are also called super-contigs, they are assembled into large regions of ordered sequence fragments separated by gaps. Orphan-contigs are sequence fragments (contigs) that do not have information allowing them to be linked to other fragments and into scaffolds. Singletons are sequences that cannot be assembled into sequence fragments (contigs). N50 is a length-weighted average of contig or scaffold size, such that the average nucleotide in an assembly will appear in a contig (or scaffold) of N50 size or greater.
    Features Overall Estimated size of genome (Mb) Total number of bp of assembled sequence (bp) Number of scaffolds N50 of scaffolds (bp) Maximum length of scaffold (bp) Number of bp assembled into scaffolds (bp) Number of orphan contigs Number of bp assembled into orphan contigs (bp) Number of singletons Number of bp in singletons (bp) Protein-coding regions Percent of genome containing protein-coding sequence (%) Number of gene models Number of proteins Max/average protein length (amino acids) Gene density (genes per Mb) Number of exons Mean/median exon size (bp) Mean/median number of exons per gene Number of bp included in exons Number of introns Mean/median intron size (bp) Number of bp included in introns Mean length of intergenic region (bp) Overall G + C content (%) Exons, G + C content (%) Introns, G + C content (%) Intergenic regions, G + C content (%) Non–protein coding genes Transfer RNA (tRNA) genes (+ tRNA pseudogenes) 5S ribosomal RNA (found in scaffolds and orphan contigs)
    *This number includes seven pseudogenes.
    †
    
    B. malayi 90–95 88,363,057 8,180 93,771 6,534,162 70,837,048 18,868 17,526,009 176,099 108,289,205 17.84 11,515* 11,508 (9,839)† 9,445/371 162 83,672 159/140 7.27/5 13,282,846 72,157 311/219 22,512,502 3,783 30.5 39.6 27.6 30.9 ~233 (+26) ~400
    
    C. elegans
    
    18,563/440 228 307/147 6.38/6
    
    320/68 2218 35.4 42.9 29.1 32.5
    
    The number of proteins 100 amino acids long or larger.
    
    www.sciencemag.org
    
    SCIENCE
    
    VOL 317
    
    21 SEPTEMBER 2007
    
    1757
    
    Downloaded from www.sciencemag.org on September 25, 2007
    
    per limit of intergenic spacing, 838 potential operons (5), containing ~1800 genes (16% of the total; 2 to 5 genes per operon), were found in the assembled genome (Fig. 2 and fig. S2). Of these putative operons, only 10% of the gene pairs were also in operons in C. elegans (table S2). To obtain an estimate of the core complement of proteins that defines the phylum Nematoda, we compared the proteomes of B. malayi, C. elegans, and C. briggsae. Comparisons with the arthropod D. melanogaster were also made to help define a list of lineagerestricted genes. We identified 3979 sets of orthologs with representatives in all four species and 1726 sets of orthologs limited to the three nematode species (fig. S3A; tables S3 to S8). The average pairwise identity of B. malayi proteins with orthologs from either caenorhabditid species is ~ 48%. The genes conserved in nematodes but absent from the fly include cathepsin Z–like cysteine proteases, major sperm proteins, and cuticle collagens, as well
    
    as several families of unknown function. In addition, these orthologs were significantly enriched (2.4- to 4.4-fold; P > 0.0017) for genes with RNA interference (RNAi) phenotypes in C. elegans (fig. S4), which is consistent with a gene set essential to the core of nematode biochemisty and cell biology. These lineagerestricted families may define a molecular “bauplan” of Nematoda. As noted above, the B. malayi genome appears to have fewer genes than C. elegans. On examination, much of the disparity in gene numbers can be accounted for by the extent to which gene families in Brugia and Caenorhabditis have undergone lineage-specific expansion. More than 8% of the 5780 B. malayi–C. elegans ortholog clusters were expanded in C. elegans (fig. S3C). Comparing the occurrence of protein domains in B. malayi, C. elegans, C. briggsae, and D. melanogaster (figs. S3B and S5 and table S9) revealed, to our surprise, that B. malayi is in some ways more similar to the fly than to
    
    REPORTS
    Fig. 1. Genome organization of B. malayi in comparison with C. elegans. (A) Genes on the eight largest B. malayi scaffolds. The upper part of each scaffold cartoon represents genes in conserved operons (black) or B. malayi–specific operons (red) or genes not in operons (gray). The lower part indicates genes with orthologs in C. elegans, colored to indicate the C. elegans chromosome on which the ortholog is located. (B) Numbers of genes on the longest B. malayi scaffolds that have orthologs on each of the six C. elegans chromosomes. (C) Distribution on the six chromosomes of C. elegans of orthologs of B. malayi genes. Orthologs map to both the conserved, central cores of the autosomes and to the less well conserved genes on the autosome arms. (D) Relative arrangements of the genes on B. malayi scaffold 14979 and their orthologs on C. elegans chromosome 3. Forwardand reverse-strand genes are distinguished (forward on top, reverse below). Red rectangles correspond to putative B. malayi operons, and black rectangles correspond to conserved operon structures.
    
    the model worms. For example, B. malayi and D. melanogaster have similar numbers of genes of the most abundant domains, whereas several of the most abundant domains in the caenorhabditid nematodes rank much lower in or are absent from the filarial or fly genomes (fig. S5). For domains with high abundance in all four species, C. elegans tends to have 1.5- to 2-fold as many instances as do B. malayi or D. melanogaster (fig. S5). The distinctive biology of B. malayi is likely to be underpinned by novel proteins with unique functions. After extensive comparative analyses, 20% of the predicted proteins were found to be B. malayi-specific (fig. S6 and table S10). More than one-third of the 1977 hypothetical proteins found only in B. malayi were confirmed by B. malayi expressed sequence tags. These genes constitute an interesting list of initial candidates for functional studies of putatively filaria-specific gene products. The drugs used for treatment of filarial parasites, although effective in the short-term control of worm burden and transmission, require extended courses of treatment that have tradition-
    
    ally compromised their long-term effectiveness. Recently, issues of the emergence of drug resistance have become a concern (13, 14). From the genome sequence we can identify several systems likely to be fruitful targets for the discovery of additional drug targets. (i) Molting: The B. malayi genome contains many homologs of genes that encode molecules required for molting in C. elegans (15) including proteases, protease inhibitors, nuclear hormone receptors (NRs), cuticular collagens, and chitinases (table S11). (ii) Nuclear receptors: Twenty-seven members of the NR family were identified in the B. malayi genome including orthologs of Ecr (not present in the caenorhabditids) and other NRs acting in the D. melanogaster ecdysone-response cascade (table S12). (iii) Collagens and collagen processing: B. malayi has ~82 genes that encode for a collagen repeat (including cuticular collagens and basement membrane collagens) (table S13), which is less than half the number of collagens found in the C. elegans genome (~180). It also encodes enzymes important for cuticular collagen processing such as blisterase-like proteases, protease inhibitors, tyrosinases, mixed-function oxiVOL 317 SCIENCE
    
    dases, and peptidyl-prolyl isomerase (table S1). (iv) Neuronal signaling: Seven putative biogenic amine heterotrimeric guanosine 5′-triphosphate– binding protein (G protein)–coupled receptors, 44 Cys-loop receptors, and 36 genes encoding potassium channels (table S14) were identified in B. malayi, a number of which are orthologs of C. elegans genes that can be mutated to give paralytic or uncoordinated phenotypes. (v) The B. malayi kinome: The B. malayi genome encodes ~205 conventional and ~10 atypical protein kinases (Table 2), of which 142 appear to be of fundamental importance based on the severity of their RNAi phenotypes in C. elegans (table S15). (vi) Reliance on host and endosymbiont metabolism: As 9 of 10 enzymes required for de novo purine synthesis, 6 of 7 genes required for heme biosynthesis, and all 5 enzymes required for de novo riboflavin biosynthesis are absent from the B. malayi genome, the worm may be forced to meet requirements for these key metabolic factors by active uptake of host-supplied molecules (16) or through reliance on wBm, which has complete purine, heme, and riboflavin synthesis pathways (17).
    
    1758
    
    21 SEPTEMBER 2007
    
    www.sciencemag.org
    
    Downloaded from www.sciencemag.org on September 25, 2007
    
    REPORTS
    A
    (abundant larval transcript) family of proteins have been implicated as virulence factors through their ability to modulate macrophage function (25). The B. malayi genome sequence revealed an unexpected diversity of 13 ALT genes (table S17), most of which are expressed by adult parasites. Note that the ALTs represent one of the few gene families that are expanded in B. malayi but not in C. elegans (which has only one member). The innate immune systems encoded in the B. malayi genome have a complexity comparable to those of C. elegans (26) and include thioester proteins, scavenger receptors, C-type lectins, and galectins. However, both nematodes lack the peptidoglycan-recognition and lipopolysaccharide-binding proteins found in arthropods. Although there are orthologs of some components of the DAF-2, TGFb, and p38 mitogen-activated protein (MAP) kinase signaling cascades in B. malayi and C. elegans, there is no evidence for the nuclear factor kB and Dif pathways. None of the small antibacterial peptides described from C. elegans and Ascaris suum (27) were identified in B. malayi, which suggests that the parasite might have a unique set of small peptide effectors or may lack this effector arm altogether. B. malayi gene products implicated in defense against and interaction with mammalian and insect immune systems were found, including seven genes encoding antioxidants deployed at the cuticle surface, where they may protect against oxyradical attack (28) (table S17). Four representative organisms involved in the maintenance of Brugian filariasis have now been sequenced: the nematode parasite; its Wolbachia endosymbiont, wBm; mosquito vectors (Aedes and Anopheles); and the human host. Together these present opportunities for a systems-based approach to understanding the molecular basis of parasitism and for identification of targets for intervention. In addition, defining the molecular mechanisms that allow filarial worms to persist for decades in an immunologically competent host may yield new strategies for the control of autoimmunity and the management of transplanted tissues. The differences in genome content and organization between Caenorhabditis and B. malayi underscore the importance of obtaining additional genome data from representative species from across the diversity of the Nematoda (29). The ability to carry out large-scale comparative genomics within Nematoda will be key in defining molecules and pathways unique to nematode development and parasitism that can serve as the targets for the next generation of antinematode drugs and vaccines.
    References and Notes
    1. E. Ghedin, S. Wang, J. M. Foster, B. E. Slatko, Trends Parasitol. 20, 151 (2004). 2. Y. Sakaguchi, I. Tada, L. R. Ash, Y. Aoki, J. Parasitol. 69, 1090 (1983). 3. B. K. Sim, J. Shah, D. F. Wirth, W. F. Piessens, Ciba Found. Symp. 127, 107 (1987).
    SCAFFOLD 14719 CONTIG 1381139
    
    Bm1_24730 E02H1.3
    
    Bm1_24725 R02D3.5
    
    Bm1_24720 E02H1.5
    
    10000 Bm1_24715 Bm1_24710 E02H1.6 E02H1.8
    
    5000
    
    CEOP2436
    5000
    
    CEOP2436
    
    Effective RNAi by soaking worms in doublestranded RNA (dsRNA) has been demonstrated in B. malayi adults (18). We therefore expected to find components of the RNAi pathway in the genome (table S16). However, some genes necessary for systemic RNAi in C. elegans appear to be absent from B. malayi, including sid-1, a membrane channel that transfers dsRNA molecules from a source cell to neighboring cells (19); sid-2; sid-3; and rsd-6. The presence of a putative drsh-1 ortholog suggests that B. malayi is also capable of microRNA processing. The effectiveness of RNAi in B. malayi implies either that these genes are rapidly evolving or are not required in B. malayi or that alternate pathways for siRNA transfer exist. Improvement of RNAi protocols for filarial nematodes would offer an attractive testing platform for verifying candidate drug targets. Mapping B. malayi genes onto the C. elegans protein-protein interaction network (20) reveals an interesting pattern of evolutionarily conserved relations within the context of interconnected functional modules (figs. S7 and S8). Of 957 B. malayi genes that could be mapped, only 30 were found to be nematode-specific (supporting
    
    online text), revealing the overall conserved nature of the protein interaction network (21). Given the low level of sequence similarity between the two nematodes, the identification of conserved functional modules indicates that results from investigations of these complexes within C. elegans may be effectively translated to B. malayi. B. malayi interacts with two hosts during its life cycle and is thought to have evolved mechanisms to suppress, subvert, or exploit host defense systems (22). Comparison of sequences of predicted proteins of B. malayi to that of interleukins, chemokines, and other signaling molecules from humans identified intriguing candidates including two genes encoding members of the macrophage migration inhibition (MIF) family of signaling molecules (23), transforming growth factor beta (TGFb) homologs (including Bm-tgh-1) (24), and a member of the PDZ domain/interleukin 16 family (table S17). These proteins may be immune modulators that promote parasite survival or growth and differentiation factors important in parasite development. In addition, members of the ALT SCIENCE VOL 317
    
    www.sciencemag.org
    
    21 SEPTEMBER 2007
    
    1759
    
    Downloaded from www.sciencemag.org on September 25, 2007
    
    Fig. 2. Operon diversity in the genome of Brugia malayi. The cartoon shows the structure of four B putative B. malayi operons (scale Bm1_51345 Bm1_51350 is shown in base pairs). Gene transcriptional orientations are F53F4.11 F53F4.14 shown with colored arrows, and the C. elegans chromosome on ~3 kb CEOP5396 CEOP5392 which the ortholog is contained is indicated (II, green; IV, dark purple; V, light purple, or none, grey), along with the peptide C identification number (table S2). C. elegans orthologs are aligned Bm1_39535 Bm1_39540 Bm1_39545 below the B. malayi contigs, and their gene names and operon Y55F3AM.10 Y55F3AM.13 Y55F3AM.1 identifiers are shown. The angled double bars show gaps in the ~25 kb ~600bp C. elegans genomic DNA; the relative size of the gap is indicated. (A) Four predicted B. malayi genes, annotated as “hypothetical,” have D orthologs in a C. elegans operon. Bm1_40305 Bm1_40310 The C. elegans ortholog of the F42C5.9 second gene is found on a different chromosome; the ortholog of E02H1.4 has not been identified within the B. malayi genome sequence. (B) In C. elegans, the orthologs of these genes are found in operons adjacent to each other but in opposing transcriptional orientations. (C) The C. elegans orthologs of genes found in this operon are relatively close to each other but are not in an operon. (D) The C. elegans ortholog of the first gene is not in the operon while the downstream gene (independently verified GenBank accession CAD22104) has no identifiable C. elegans ortholog.
    SCAFFOLD 14992 CONTIG 1384561 SCAFFOLD 14972 CONTIG 1384551
    
    5000
    
    SCAFFOLD 14972 CONTIG 1384551
    
    5000
    
    REPORTS
    Table 2. The kinomes of the nematodes B. malayi, C. elegans, and C. briggsae in comparison with that of Homo sapiens. Eukaryotic protein kinases are mainly defined on the basis of sequence similarity of their catalytic domains, plus knowledge of accessory domains and any known modes of regulation. Conventional protein kinases (EPKs) include AGC [adenosine 3′,5′-monophosphate–dependent protein kinase/protein kinase G/protein kinase C] kinases regulated by cyclic-nucleotide and calcium-phospholipid binding; casein kinase 1 and close relatives (CK1); calmodulin-dependent kinases (CaMK), cyclin-dependent kinases (CDK)/
    Protein kinases EPKs AGC CAMK CK1 CMGC RGC STE TK TKL Total APKs PIKK Alpha PDHK RIO Total
    
    mitogen-activated protein kinases/glycogen synthase kinases/CDK-like kinases (CMGC); receptor guanylate cyclase (RGC); a group including many kinases functioning in MAP kinase cascades (STE); tyrosine kinases (TK); tyrosine kinase-like kinases (TKL). Atypical protein kinases (APKs) include Alpha (exemplified by myosin heavy chain kinase of Dictyostelium discoideum); phosphatidylinositol 3-kinase–related kinases (PIKK); pyruvate dehydrogenase kinases (PHDK); “right open reading frame” (RIO) [named as such as it was one of two adjacent genes that were found to be transcribed divergently from the same intergenic region].
    Kinases shared by all 3 nematodes 19 23 13 28 4 16 21 9 133 4 1 1 3 9
    
    Organism H. sapiens 84 98 12 70 5 61 93 55 478 6 6 5 3 20 C. elegans 35 63 91 56 27 35 96 22 425 5 1 1 3 10 C. briggsae 46 69 77 60 24 27 73 21 397 4 1 1 3 9 B. malayi 22 41 31 33 4 27 35 12 205 5 1 1 3 10
    
    4. L. A. McReynolds, S. M. DeSimone, S. A. Williams, Proc. Natl. Acad. Sci. U.S.A. 83, 797 (1986). 5. Materials and methods are available as supporting material on Science Online. 6. L. D. Stein et al., PLoS Biol. 1, E45 (2003). 7. C. Whitton, J. Daub, M. Thompson, M. Blaxter, Methods Mol. Biol. 270, 75 (2004). 8. J. R. Vanfleteren et al., Mol. Phylogenet. Evol. 3, 92 (1994). 9. D. B. Guiliano et al., Genome Biol. 3, RESEARCH0057.1 (2002). 10. K. Z. Lee, A. Eizinger, R. Nandakumar, S. C. Schuster, R. J. Sommer, Nucleic Acids Res. 31, 2553 (2003). 11. T. Blumenthal, Brief. Funct. Genomic. Proteomic. 3, 199 (2004). 12. D. B. Guiliano, M. L. Blaxter, PLOS Genetics 2, e198 (2006). 13. K. Awadzi et al., Ann. Trop. Med. Parasitol. 98, 231 (2004). 14. A. E. Schwab, D. A. Boakye, D. Kyelem, R. K. Prichard, Am. J. Trop. Med. Hyg. 73, 234 (2005). 15. A. R. Frand, S. Russel, G. Ruvkun, PLoS Biol. 3, e312 (2005). 16. S. N. Chen, R. E. Howells, Exp. Parasitol. 51, 296 (1981). 17. J. Foster et al., PLoS Biol. 3, e121 (2005). 18. A. A. Aboobaker, M. L. Blaxter, Mol. Biochem. Parasitol. 129, 41 (2003). 19. W. M. Winston, C. Molodowitch, C. P. Hunter, Science 295, 2456 (2002). 20. S. Li et al., Science 303, 540 (2004). 21. R. Sharan et al., Proc. Natl. Acad. Sci. U.S.A. 102, 1974 (2005). 22. J. Murray, W. F. Gregory, N. Gomez-Escobar, A. K. Atmadja, R. M. Maizels, Mol. Biochem. Parasitol. 118, 89 (2001). 23. D. V. Pastrana et al., Infect. Immun. 66, 5955 (1998). 24. N. Gomez-Escobar, E. Lewis, R. M. Maizels, Exp. Parasitol. 88, 200 (1998). 25. N. Gomez-Escobar et al., BMC Biol. 3, 8 (2005). 26. S. Alper, S. J. McBride, B. Lackford, J. H. Freedman, D. A. Schwartz, Mol. Cell. Biol. 27, 5544 (2007). 27. Y. Kato, S. Komatsu, J. Biol. Chem. 271, 30493 (1996). 28. X. Ou, L. Tang, M. McCrossan, K. Henkle-Duhrsen, M. E. Selkirk, Exp. Parasitol. 80, 515 (1995). 29. M. L. Blaxter, Adv. Parasitol. 54, 101 (2003). 30. Funding for this project was provided by a grant from
    
    the National Institute for Allergy and Infectious Diseases, NIH (NIAID/NIH U01-AI50903) awarded to E.G. and A.L.S. We would like to acknowledge our colleagues in the Filarial Genome Consortium and the filarial research community for their continued support and encouragement. The Filarial Genome Consortium was initiated by grants from the United Nations Special Programme for Research and Training in Tropical Diseases (TDR), which is cosponsored by the U.N. Children’s Fund (UNICEF), U.N. Development Programme (UNDP), World Bank, and World Health Organization (WHO) (T23/79/152 to A.L.S.; T23/79/153 to B.S.; and T23/79/157 to S.A.W.). This whole-genome shotgun project has been deposited at the DNA Databank of Japan (DDBJ), European Molecular Biology Laboratory (EMBL), and GenBank under the
    
    project accession AAQA00000000. The version described in this paper is the first version AAQA01000000. The data are also available in WormBase release WS175.
    
    Supporting Online Material
    www.sciencemag.org/cgi/content/full/317/5845/1756/DC1 Materials and Methods SOM Text Figs. S1 to S8 Tables S1 to S17 References 8 May 2007; accepted 14 August 2007 10.1126/science.1145406
    
    UHRF1 Plays a Role in Maintaining DNA Methylation in Mammalian Cells
    Magnolia Bostick,1* Jong Kyong Kim,2* Pierre-Olivier Estève ,2 Amander Clark,1 Sriharsa Pradhan,2† Steven E. Jacobsen1,3† Epigenetic inheritance in mammals relies in part on robust propagation of DNA methylation patterns throughout development. We show that the protein UHRF1 (ubiquitin-like, containing PHD and RING finger domains 1), also known as NP95 in mouse and ICBP90 in human, is required for maintaining DNA methylation. UHRF1 colocalizes with the maintenance DNA methyltransferase protein DNMT1 throughout S phase. UHRF1 appears to tether DNMT1 to chromatin through its direct interaction with DNMT1. Furthermore UHRF1 contains a methyl DNA binding domain, the SRA (SET and RING associated) domain, that shows strong preferential binding to hemimethylated CG sites, the physiological substrate for DNMT1. These data suggest that UHRF1 may help recruit DNMT1 to hemimethylated DNA to facilitate faithful maintenance of DNA methylation.
    
    C
    
    ytosine methylation is an epigenetic mark used for the silencing of transposable elements and for the regulation of VOL 317 SCIENCE
    
    development (1, 2). Once established, DNA methylation is often stable through mitosis, in part because CG methylation is faithfully main-
    
    1760
    
    21 SEPTEMBER 2007
    
    www.sciencemag.org
    
    Downloaded from www.sciencemag.org on September 25, 2007

Readers

Recent searches finding this paper
sexvol via Bing
 

Academia © 2009