How many genes does arabidopsis have




















Two-dimensional RFLP analyses reveal megabase-sized clusters of rRNA gene variants in Arabidopsis thaliana , suggesting local spreading of variants as the mode for gene homogenization during concerted evolution. Fransz, P. Cytogenetics for the model system Arabidopsis thaliana. Richards, E. Isolation of a higher eukarotic telomere from Arabidopsis thaliana.

Cell 53 , — Round, E. Arabidopsis thaliana centromere regions: genetic map positions and repetitive DNA structure. The complete sequence of a heterochromatic island from a higher eukaryote. Integrated cytogenetic map of chromosome arm 4S of A. Paulsen, I. Jr Microbial genome analyses: comparative transport capabilities in eighteen prokaryotes.

Jr Unified inventory of established and putative transporters encoded within the complete genome of Saccharomyces cerevisiae. FEBS Lett. Hirsch, R. D, Spalding, E. A role for the AKT1 potassium channel in plant nutrition. Slayman, C. Depolarization of the plasma membrane of Neurospora during active transport of glucose: evidence for a proton-dependent cotransport system.

USA 71 , — Ryan, C. Systemin: a polypeptide signal for plant defensive genes. Eisen, J. A phylogenomic study of DNA repair genes, proteins, and processes.

Selby, C. Structure and function of transcription-repair coupling factor. Structural domains and binding properties. Britt, A. Molecular genetics of DNA repair in higher plants. Dangl, M. Response to Aravind, L. Second Family of Histone Deacetylases.

Science , Cao, X. Conserved plant genes with similarity to mammalian de novo DNA methyltransferases. Henikoff, S. A DNA methyltransferase homologue with a chromodomain exists in multiple polymorphic forms in Arabidopsis. Hung, M. Drosophila proteins related to vertebrate DNA 5-cytosine methyltransferases.

Proc Natl Acad. Dalmay, T. An RNA-dependent-RNA polymerase in Arabidopsis is required for post transcriptional gene silencing mediated by a transgene but not by a virus—the truth. Riechmann, J. A genomic perspective on plant transcription factors. Plant Biol. Liljegren, S. Pelaz, S. Canaday, J.

Higher plant cells: gamma-tubulin and microtubule nucleation in the absence of centrosomes. Bassham, D. Unique features of the plant vacuolar sorting machinery. Cell Biol. Zheng, Z. The Rrop GTPase switch turns on polar growth in pollen.

Triggering the cell cycle in plants. Trends Cell Biol. Heese, M. Cytokinesis in flowering plants: cellular process and developmental integration. Plants, animals, and the logic of development. Wang, D. Divergence time estimates for the early history of animal phyla and the origin of plants, animals and fungi. B Bio. Torii, K. Receptor kinase activation and signal transduction in plants: an emerging picture. Yeh, K. USA 95 , — McCarty, D. Conservation and innovation in plant signaling pathways.

Weiss, C. USA 91 , — Bowler, C. Cyclic GMP and calcium mediate phytochrome phototransduction. Cell 77 , 73—81 Stepanova, A. Ethylene signaling: from mutants to molecules. Urao, T. Two-component systems in plant signal transduction. Makino, S. Genes encoding pseudo-response regulators: Insight into His-to-Asp phosphorelay and circadian rhythm in Arabidopsis thaliana. Plant Cell Physiol. D'Agostino, I. Phosphorelay signal transduction: the emerging family of plant response regulators.

Trends Biol. Strayer, C. Cloning of the Arabidopsis clock gene TOC1, an autoregulatory response regulator homologue. Stahl, E. Plant-Pathogen arms races at the molecular level. McDowell, J. Signal transduction in the plant innate immune response. Trends Biochem. Van der Biezen, E. Plant disease-resistance proteins and the gene-for-gene concept. Trends Biochem Sci. Belvin, M. A conserved signaling pathway: the Drosophila toll-dorsal pathway.

Uren, A. Identification of paracaspases and metacaspases: Two ancient families of caspase-like proteins, one of which plays a key role in MALT lymphoma. Cell 6 , — Fankhauser, C. Light control of plant development.

Briggs, W. Blue-light photoreceptors in higher plants. Christie, J. LOV light, oxygen, or voltage domains of the blue-light photoreceptor phototropin nph1 : binding sites for the chromophore flavin mononucleotide. Golbeck, J. Structure and function of photosystem I.

Maier, R. Complete sequence of the maize chloroplast genome: gene content, hotspots of divergence and fine tuning of genetic information by transcript editing. Buchanan, B. Mekhedov, S. Toward a functional catalog of the plant genome. A survey of genes for lipid biosynthesis. Somerville, C. Photorespiration deficient mutants of Arabidopsis thaliana lacking mitochrondrial serine transhydroxymethylase activity. Richmond, T. The cellulose synthase superfamily.

Plant Physiol , — Carpita, N. Vergara C: A recipe for cellulose. De Vries, H. Sur la loi de disjonction des hybrides. Paris , — Alonso-Blanco, C. Naturally occurring variation in Arabidopsis : an underexploited resource for plant genetics. Chory, J. Functional genomics and the virtual plant. A blueprint for understanding how plants are built and how to improve them.

Plant Physiology , — Burge, C. Prediction of complete gene structures in human genomic DNA. Lukashin, A. Uberbacher, E. Locating protein-coding regions in human DNA sequences by a multiple sensor-neural network approach.

USA 88 , — Salzberg, S. Interpolated Markov models for eukaryotic gene finding. Genomics 59 , 24 —31 Hebsgaard, S. Splice site prediction in Arabidopsis thaliana DNA by combining local and global sequence information. Brendel, V. Prediction of locally optimal splice sites in plant pre-mRNA with applications to gene identification in Arabidopsis thaliana genomic DNA.

Quackenbush, J. The TIGR gene indices: reconstruction and representation of expressed gene sequences. Huang, X. A tool for analyzing and annotating genomic sequences. Genomics 46 , 37—45 Altschul, S. Basic local alignment search tool. Morgenstern, B. Murzin, A. SCOP: a structural classification of proteins database for the investigation of sequences and structures.

Emanuelsson, O. Predicting subcellular localization of proteins based on their N-terminal amino acid sequence. Download references. Umwelt u. The authors wish to thank E. Magnien, D. Nasser and J. Watson for their continual support and encouragement. Reprints and Permissions. The Arabidopsis Genome Initiative. Analysis of the genome sequence of the flowering plant Arabidopsis thaliana. Download citation. Received : 20 October Accepted : 15 November Issue Date : 14 December Anyone you share the following link with will be able to read this content:.

Sorry, a shareable link is not currently available for this article. Provided by the Springer Nature SharedIt content-sharing initiative. Genome Biology Algorithms for Molecular Biology BMC Plant Biology By submitting a comment you agree to abide by our Terms and Community Guidelines.

If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate. Advanced search. Skip to main content Thank you for visiting nature. Download PDF. Abstract The flowering plant Arabidopsis thaliana is an important model system for identifying genes and determining their functions.

You have full access to this article via your institution. Main The plant and animal kingdoms evolved independently from unicellular eukaryotes and represent highly contrasting life forms. The Arabidopsis Genome Initiative Three groups contributed to the work reported here. European Chromosome 3 Sequencing Consortium M. Rounsley, D. Bush, S.

Subramaniam, I. Levin, S. Comparative analysis of the genomes of A. Schmidt, A. Acarkan, I. Integration of the three genomes in the plant cell: the extent of protein and nucleic acid traffic between nucleus, plastids and mitochondria F. Quetier, A. Brennicke, J. Transposable elements T. Bureau, B. Legault, Q. Le, N. Agrawal, Z. Yu, R. Copenhaver, S. Luo, C.

Pikaard, D. Membrane transport I. Paulsen, M. DNA repair and recombination A. Britt, J. Gene regulation D. Selinger, R. Pandey, D. Mount, V. Chandler, R. Jorgensen, C. Cellular organization G. Development E. Signal transduction J. Ecker, A. Recognition of and response to pathogens J. Dangl, J. Photomorphogenesis and photosynthesis M. Chen, J. Metabolism C. Overview of sequencing strategy We used large-insert bacterial artificial chromosome BAC , phage P1 and transformation-competent artificial chromosome TAC libraries 9 , 10 , 11 , 12 as the primary substrates for sequencing.

Table 1 Summary statistics of the Arabidopsis genome Full size table. In the main, this is due to the increased number of EST sequences deposited in GenBank, particularly those originating from high-throughput sequencing technologies 8.

While the total number of genes continues to rise with each release Table 1 new genes were added for the latest release , the most common updates are refinements to existing gene structures. For the TAIR7 release 10 gene structures were updated, of which gene models had updated coding exons. A total of 14 exons were modified and new exons were incorporated. There were 41 gene splits and 34 gene merges. Whereas the original Arabidopsis genome annotation focused solely on identifying protein-coding genes, subsequent re-annotations have added additional gene classes i.

However, certain gene classes may still be underrepresented. Not all such transcripts will be of functional significance but at least some are likely to represent unannotated protein-coding genes, ncRNAs or transposable elements. The problem of false-negative prediction is particularly serious for smaller CDSs where the decreasing signal-to-noise ratio makes distinguishing real genes from biologically meaningless open reading frames ORFs a more problematic issue In order to reduce the number of short false-positive gene predictions TIGR applied a minimum cut-off of amino acids 5.

Unannotated protein-coding genes are therefore likely to fall below this threshold. Some previously unannotated small genes including cysteine-rich peptides 12 , 13 have now been incorporated into the most recent genome releases TAIR6 and TAIR7 while others are currently being evaluated for potential inclusion in the forthcoming release In addition many unannotated transcripts excluded from earlier releases due to a lack of clear coding potential or assignment within one of the small RNA classes have now been incorporated.

One hundred and eighteen of these gene models overlap antisense to existing protein-coding genes and may represent natural antisense genes. While our latest annotation includes new splice variants, numerous sequences in GenBank that represent alternatively spliced transcripts with disrupted ORFs are not presently incorporated into TAIR gene structures.

Many of these transcripts contain retained introns or alternative splice sites that generate premature termination codons PTCs and heavily truncated peptides. It remains an open question if such transcripts encode functional peptides. Some will undoubtedly represent mistakes by the splicing machinery or cloning and sequencing errors, while others may have regulatory roles via the nonsense-mediated decay NMD pathway, a surveillance mechanism that selectively degrades nonsense mRNAs.

Coupling of regulated alternative splicing and NMD may therefore provide an alternative mechanism for regulating protein expression We plan to incorporate and categorize these and other unusual transcripts encoding fused proteins or dicistronic transcripts containing two complete ORFs in future releases. At the level of gene function, we continue to refine our annotations using information from the literature and community-submitted annotations, as well as sequence similarity and other computational methods.

Approximately genes currently have similarity only to uncharacterized proteins i. Future TAIR genome releases will contain corrections to the chromosome sequences based on newly available sequence data, improved annotation of endogenous transposable elements and pseudogenes, more complete annotation of splice variants including those lacking coding potential and utilization of genome comparisons to further refine the existing gene structures, particularly those with little or no transcript support.

We plan to produce new genome releases at least once per year, with the next release TAIR8 expected in early National Center for Biotechnology Information , U. Journal List Nucleic Acids Res v. Nucleic Acids Res. Published online Nov 5. Author information Article notes Copyright and License information Disclaimer. This article has been cited by other articles in PMC.

Open in a separate window. He wrote a more easily found one in Ann. Both go through some of the early history of the use of Arabidopsis in the laboratory, though the longer one has all the details. In Methods in Arabidopsis Research , eds C. These accessions are quite variable in terms of form and development e. Researchers around the world are using these differences in natural accessions to uncover the complex genetic interactions such as those underlying plant responses to environment and evolution of morphological traits.

While many collections of natural accessions may not meet a strict definition of an ecotype, they are commonly referred to as ecotypes in the scientific literature. PNG image of world wide distribution , from Jonothan Clarke. This figure was produced by Jonathan Clarke for his Ph. This map was based on, i. World map showing the geographical distribution longitude, latitude, elevation of more than 30 Arabidopsis ecotypes.

Follow the links to view distribution and other links. Arabidopsis Genome Initiative. Nucleic Acids Res.. Genome-wide patterns of genetic variation in worldwide Arabidopsis thaliana accessions from the RegMap panel. Common sequence polymorphisms shaping genetic diversity in Arabidopsis thaliana. Genome-wide association study of phenotypes in Arabidopsis thaliana inbred lines. Coding genes 27, Non coding genes 5, Small non coding genes 1, Long non coding genes 3, A transcript is the operational unit of a gene.

In a genomic context, transcripts consist of one or more exons, with adjoining exons being separated by introns. Get help Using this website Documentation Adding custom tracks Downloading data.



0コメント

  • 1000 / 1000