5 Feb 2008 14:36
[Bug 2443] New: Specifying the alphabet in Bio.SeqIO.parse()
<bugzilla-daemon <at> portal.open-bio.org>
2008-02-05 13:36:16 GMT
2008-02-05 13:36:16 GMT
http://bugzilla.open-bio.org/show_bug.cgi?id=2443 Summary: Specifying the alphabet in Bio.SeqIO.parse() Product: Biopython Version: 1.44 Platform: All OS/Version: All Status: NEW Severity: enhancement Priority: P2 Component: Main Distribution AssignedTo: biopython-dev <at> biopython.org ReportedBy: biopython-bugzilla <at> maubp.freeserve.co.uk Currently when reading sequences using Bio.SeqIO, unless the alphabet can be determined from the file format, all the records have a generic alphabet. This can be a handicap if later on you want to work with "strict" functions which check for a particular alphabet (e.g. a gapped alphabet when working with alignments), or perhaps the Bio.Translate module. For an example of this, see Dalloliogm's question on the SeqIO wiki talk page, http://biopython.org/wiki/Talk:SeqIO Currently the user may need to use a tedious work around to override the alphabet of each sequence, e.g. from Bio import SeqIO from Bio.Alphabet import generic_dna records = list(SeqIO.parse(open("data.txt"), "fasta"))(Continue reading)
RSS Feed