Re: The unique constraint on the analysis table
Robin Houston <rh11 <at> sanger.ac.uk>
2008-11-13 15:28:21 GMT
I don't know how common our situation is, but our database has dozens
of different organisms and we won't typically run analyses across the
entire database at once. So we could run an analysis of, say,
Plasmodium against a particular database, and also an analysis of
Trypanosoma against the same database.
The current uniqueness constraint only kicks in if the sourcename is
specified. Because (NULL = NULL) is not true, you can have many rows
with the same program and programversion, provided that none of them
has a sourcename. Quite often at the moment we're loading data from
legacy files in which the source database is not specified, so this is
our commonest case at present and avoids the uniqueness problem, but
it's hard to believe this was the intention.
Is there any reason the uniqueness constraint should not simply be
dropped?
Robin
On 13 Nov 2008, at 15:00, Scott Cain wrote:
> Hi Robin,
>
> I haven't really thought about it, but perhaps the unique constraint
> should be on (program, programversion, sourcename, sourceversion) to
> allow for blast results on the same database obtained on different
> dates. Also, you could encode information about the blast run in the
> sourcename if you were going to do more than one blast against the
> same database using different parameters, like "nr_high_sensitivity"
> and "nr_low_sensitivity". I don't know if I really like that though.
>
> Any one else have thoughts (FlyBase, I'm looking at you all
>
> Scott
>
>
> On Thu, Nov 13, 2008 at 9:19 AM, Robin Houston <rh11 <at> sanger.ac.uk>
> wrote:
>> A row in the analysis table is supposed to represent a single run of
>> an algorithm (e.g. BLAST), as I understand it.
>>
>> If that's right, then why is there a UNIQUE constraint on (program,
>> programversion, sourcename)? It's certainly
>> possible to do more than one BLAST against the same database.
>>
>> Robin
>>
>>
>> --
>> The Wellcome Trust Sanger Institute is operated by Genome Research
>> Limited, a charity registered in England with number 1021457 and a
>> company registered in England with number 2742969, whose registered
>> office is 215 Euston Road, London, NW1 2BE.
>>
>> -------------------------------------------------------------------------
>> This SF.Net email is sponsored by the Moblin Your Move Developer's
>> challenge
>> Build the coolest Linux based applications with Moblin SDK & win
>> great prizes
>> Grand prize is a trip for two to an Open Source event anywhere in
>> the world
>> http://moblin-contest.org/redirect.php?banner_id=100&url=/
>> _______________________________________________
>> Gmod-schema mailing list
>> Gmod-schema <at> lists.sourceforge.net
>> https://lists.sourceforge.net/lists/listinfo/gmod-schema
>>
>
>
>
> --
> ------------------------------------------------------------------------
> Scott Cain, Ph. D. scott at
> scottcain dot net
> GMOD Coordinator (http://gmod.org/) 216-392-3087
> Ontario Institute for Cancer Research
--
--
The Wellcome Trust Sanger Institute is operated by Genome Research
Limited, a charity registered in England with number 1021457 and a
company registered in England with number 2742969, whose registered
office is 215 Euston Road, London, NW1 2BE.
-------------------------------------------------------------------------
This SF.Net email is sponsored by the Moblin Your Move Developer's challenge
Build the coolest Linux based applications with Moblin SDK & win great prizes
Grand prize is a trip for two to an Open Source event anywhere in the world
http://moblin-contest.org/redirect.php?banner_id=100&url=/