1 Feb 15:47
Re: How to get from gi/ref/gb to genomic coordinates ?
Cui, Wenwu (NIH/NLM/NCBI) [C] <cuiw <at> ncbi.nlm.nih.gov>
2007-02-01 14:47:38 GMT
2007-02-01 14:47:38 GMT
This is a simple test from gene ID 3632373 (protein is 46100068) to
contig coordinates:
perl -MLWP::Simple -e 'map {print $_, "\n" if
/<(Gene-source_src.*?>)(.*)?<$1/} (split "\n",
get(q{http://eutils.ncbi.nlm.nih.gov/entrez/eutils/efetch.fcgi?db=gene&i
d=3632373&retmode=xml}))'
You need to translate protein id to gene id though.
If the genome is available at Map Viewer, try (the contig name is
NW_101115 from last step)
http://www.ncbi.nlm.nih.gov/mapview/maps.cgi?taxid=5270&gnl=NW_101115&MA
PS=genes&cmd=txt
Wenwu Cui, PhD
-----Original Message-----
From: Rainer Machne [mailto:raim <at> tbi.univie.ac.at]
Sent: Wednesday, January 31, 2007 4:10 PM
To: bioperl-l <at> lists.open-bio.org
Subject: [Bioperl-l] How to get from gi/ref/gb to genomic coordinates ?
Dear Bioperl list,
hoping not be on the wrong email list, i would have a short question:
Is there a standard way or are there nice (Bioperl) tools to come from a
gene id (gi) other ids (see below) to the genomic coordinates of the
(Continue reading)
Thank you very much!
Rainer
Barry Moore wrote:
> Rainer,
>
> We use a perl library called CGL written by Mark Yandell and colleagues
> (which in turn uses Chris Mungal's BioChaos and Unflattener.pm referred
> to by Jason) for this type of task. The basic pipeline is convert
> GenBank files to Chaos XML, then use CGL with those XML files to get a
> nice object oriented access to exons, transcripts, proteins,
> coordinates and more for of those genes. I am currently using this
> with good success on most GenBank genomes (unfortunately I haven't been
> working with the fungal genomes, but it should work fine). The Ensembl
> API provides similar functionality for Ensembl genomes - but not very
> many fungi there.
RSS Feed