Assembly Release QA Steps: Difference between revisions
Line 22: | Line 22: | ||
When a developer is ready for a new assembly to be released, the QA team (usually an individual of) will QA and release the assembly. ย | When a developer is ready for a new assembly to be released, the QA team (usually an individual of) will QA and release the assembly. ย | ||
This wiki section exists as a guide for the assembly QA and release process. | This wiki section exists as a guide for the assembly QA and release process. | ||
===<span style="color:blue">Change happens=== | ===<span style="color:blue">Change happens=== |
Revision as of 16:35, 18 October 2016
Welcome to the Assembly Release: QA Guide ๐
Page created Fall. 2016 by Cath, Jairo, and ChrisV.
This page is currently a draft in progress.
For now, use Releasing_an_assembly instead.
Introduction
When a developer is ready for a new assembly to be released, the QA team (usually an individual of) will QA and release the assembly. This wiki section exists as a guide for the assembly QA and release process.
Change happens
Collaboration rocks. Keep me updated!
For the UCSC Genome Browser QA Team, there are two types of genome assemblies:
- New species: Assembly for a species that is not already exisiting as a browser.
- New version for exisiting species: Assembly version for a species that already exists as a browser.
When a new or updated assembly is ready to QA, the QA team should perform the following steps, outlined in this guide.
Do storks bring new assemblies in?
While genbank and refseq assemblies can be claimed to be 'identical' that just means they use the same sequence. The names for everything are different, aptMan1 has contig names of the format NW_013982187v1 which is a RefSeq identifier.
- RefSeq assemblies:
- use accession ID: GCF_000002315.4 (e.g., galGal5)
- are delivered with chrMt (if they exisit)
- are delivered with NCBI gene predictions
- Genbank assemblies:
- use accession ID: GCA_000001305.2
- delivered without a chrMt.
- do not have gene predictions.
For the UCSC Genome Browser, it is preferable to use RefSeq assemblies (in part due to 'more data'). This is a "learn as we go" direction; historically GeneBank was preferred.
๐ต Ready to get started? Let's go to Assembly QA Part 1: DEV Steps