for the C. neoformans Genome Project

(UPDATE as of February 2003)

Our funding agency, the NIAID, has recently (November 18, 2002) reiterated the most essential point of our policy. "For genomic data, details should be provided in the plan to assure that sequence data release and usage will support the principal that high quality, primary genomic sequencing data should be rapidly and freely released to the broad scientific community without restriction prior to publication and reconcile the interests of the scientific community for early data release and those of the sequencing centers. Plan must recognize the widely accepted ethic in the scientific community that those who generate the primary genomic data and preliminary genome-wide data analysis should have priority to publish the work in a peer-reviewed journal in a timely manner" (emphasis added; OMB No. 0990- 0115). Please remember that a "timely manner" includes time for finishing.

We ask that users of our unpublished sequence data respect our (SGTC and TIGR) intention to publish the complete, accurate, and annotated sequence of the C. neoformans genome, along with our large-scale interpretation of that genome sequence, as soon as finishing and annotation are completed. In this context, "large-scale" refers to regions the size of the whole genome or individual chromosome arms. Examples of large-scale interpretations include (but are not limited to) identification of regions of evolutionary conservation across the genome and/or individual chromosomes, identification of complete sets of genomic features such as genes, gene-families, biochemical and metabolic pathways, repeat structures, G+C content. To avoid any possible misunderstanding, please Email Dr. Richard Hyman (, because there will be serious consequences to any violators of our data release policy (see Hyman letter to the editor of Science). With this single exception, the pre-publication data are available for all other scientific uses (e.g., array design) as follows:

  1. Publications of results should refer to the specific version or date of the data release and include the following citation: "C. neoformans Genome Project, Stanford Genome Technology Center, funded by the NIAID/NIH under cooperative agreement AI47087, and The Institute for Genomic Research, funded by the NIAID/NIH under cooperative agreement U01 AI48594."

  2. Users are free to download the C. neoformans genome sequence for their own use and that of others within their research environment. This data release policy must be displayed to all users of the downloaded data.

  3. We explicitly request that users not serve our C. neoformans genome sequence data to external users. For an exception to our request, you must receive explicit written permission from Dr. Richard Hyman.

By proceeding into the C. neoformans genome sequence data, you are acknowledging that you have read, understood, and will abide by the C. neoformans Genome Project data release policy.