1 Jan 21:17
Re: acquiring a local refseq + index
Erik <er <at> xs4all.nl>
2007-01-01 20:17:42 GMT
2007-01-01 20:17:42 GMT
> Agree with Hilmar, in that we need examples. Another problematic one was NC_004822 - however, most other problems I referred to were ones that did *not* stop de DBD indexing. (Of course, I do not know how many error-throwing entries there still are in the files that are not yet indexed: ca 75%). The most common error was pollution of the 'binomial' name with classification lines as a result of faulty parsing. If Bio::Species is deprecated (I hadn't noticed that before) then these problems are of course correspondingly less important. >> If you are referring to your submitted bug: > http://bugzilla.open-bio.org/show_bug.cgi?id=2167 Yes, the above was one that stopped refseq indexing (there is one more that I will stick into bugzilla in a minute). Thanks for the commit. > > we could add this in as long as it passes (I'll try giving it a > workout with my local bacterial seqs tonight or tomorrow). However, > in the not-too-distant future your patch would likely be rendered > obsolete, as any parsing in Bio::SeqIO modules pertaining to > Bio::Species-related matters will be deprecated in favor of simple > parsing (more foolproof, less uncertainty) and Bio::Taxon (which has > optional db lookups using NCBI Taxonomy). Bio::Species and anything > related to it are considered marked for deprecation. Fair warning... What does simple parsing mean? Just returning the whole ORGANISM string, and leaving further parsing to application side?(Continue reading)
RSS Feed