IMGT: Difference between revisions
From genomewiki
Jump to navigationJump to search
No edit summary |
No edit summary |
||
Line 1: | Line 1: | ||
* The main reference database for immune | * Basic problem: If you have an immune receptor sequence, e.g. antibody or T-Cell receptor, how do you call it? | ||
* Just like genomes: We need "reference" sequences and people that choose them | |||
* The main reference database for immune loci is IMGT.org | |||
* It consists of databases, tools and additional files (e.g. ontology, some notes, etc) | * It consists of databases, tools and additional files (e.g. ontology, some notes, etc) | ||
** LIGM-DB: Nucleotide sequences of all kinds. Can be C, V, J, D, combinations thereof. | ** LIGM-DB: Nucleotide sequences of all kinds. Can be C, V, J, D, or combinations thereof. | ||
*** Seems to be pulling in from Genbank and change their annotations slightly with different keywords | *** Seems to be pulling in from Genbank and change their annotations slightly with different keywords | ||
*** Automatic transformation | *** Automatic transformation or manual annotation? | ||
*** Looks like they could have a BLAST-based pipeline that annotates the different fragments | |||
*** Annotates Genbank sequences with the CVDJ-numbers | |||
*** Example: Genbank AF139844 | *** Example: Genbank AF139844 | ||
**** in IMGT: http://imgt.org/cgi-bin/IMGTlect.jv?query=201+AF139844 | **** in IMGT: http://imgt.org/cgi-bin/IMGTlect.jv?query=201+AF139844 | ||
Line 10: | Line 14: | ||
**** Annotations that they add: [http://imgt.org/download/LIGM-DB/ftable_doc.html Ftable] | **** Annotations that they add: [http://imgt.org/download/LIGM-DB/ftable_doc.html Ftable] | ||
** MHC-DB | ** MHC-DB | ||
*** Also pulls in from Genbank, | *** Also pulls in from Genbank, changes their identifiers | ||
*** E.g. [http://www.ebi.ac.uk/Tools/dbfetch/emblfetch?GQ240381 GQ240381] is annotated as a sequence of [http://srs.ebi.ac.uk/srsbin/cgi-bin/wgetz?-id+2ke121D6Z1q+-e+ | *** Maps Genbank sequences to MHC-"clusters" of sequences | ||
*** E.g. [http://www.ebi.ac.uk/Tools/dbfetch/emblfetch?GQ240381 GQ240381] is annotated as a sequence of HLA06329 [http://srs.ebi.ac.uk/srsbin/cgi-bin/wgetz?-id+2ke121D6Z1q+-e+IMGTHLA-ID:HLA06329] | |||
** GENE-DB: A list of categorized VDJC sequences and all their alleles | |||
***Naming is <locus><type><number>-<number>*allel | |||
*** e.g. [http://www.imgt.org/IMGT_GENE-DB/GENElect?query=2+IGHV1-18&species=Homo+sapiens IGHV1-18] | |||
** Repertoire: all loci with descriptions, e.g. human IGH, human IGK, etc. | |||
*** Example human IGH [http://www.imgt.org/textes/IMGTrepertoire/LocusGenes/chromosomes/human/IGH/Hu_IGHchrom14.html] |
Revision as of 02:12, 12 July 2011
- Basic problem: If you have an immune receptor sequence, e.g. antibody or T-Cell receptor, how do you call it?
- Just like genomes: We need "reference" sequences and people that choose them
- The main reference database for immune loci is IMGT.org
- It consists of databases, tools and additional files (e.g. ontology, some notes, etc)
- LIGM-DB: Nucleotide sequences of all kinds. Can be C, V, J, D, or combinations thereof.
- Seems to be pulling in from Genbank and change their annotations slightly with different keywords
- Automatic transformation or manual annotation?
- Looks like they could have a BLAST-based pipeline that annotates the different fragments
- Annotates Genbank sequences with the CVDJ-numbers
- Example: Genbank AF139844
- in IMGT: http://imgt.org/cgi-bin/IMGTlect.jv?query=201+AF139844
- on NCBI: http://www.ncbi.nlm.nih.gov/nuccore/4732152?report=graph
- Both look very similar, almost identical information
- Annotations that they add: Ftable
- MHC-DB
- GENE-DB: A list of categorized VDJC sequences and all their alleles
- Naming is <locus><type><number>-<number>*allel
- e.g. IGHV1-18
- Repertoire: all loci with descriptions, e.g. human IGH, human IGK, etc.
- Example human IGH [2]
- LIGM-DB: Nucleotide sequences of all kinds. Can be C, V, J, D, or combinations thereof.