Marco Roos | 5 Jun 2008 12:55
Picon
Picon
Favicon

WikiProteins and community annotation

Hello,

I have mentioned Barend Mons and WikiProteins a few times. His paper about it has recently come out:
'Calling on a million minds for community annotation in WikiProteins' (http://genomebiology.com/2008/9/5/R89)

Regards,
Marco.

-- Marco Roos Faculty of Science University of Amsterdam Kruislaan 403, room F1.02 1098 SJ Amsterdam tel. +31 (0) 20 525 7522 http://home.medewerker.uva.nl/m.roos1 (includes links to social networks) Note the change of e-mail address to M.Roos1-V/Xpf/srekw@public.gmane.org
_______________________________________________
Myexperiment-discuss mailing list
Myexperiment-discuss@...
http://lists.nongnu.org/mailman/listinfo/myexperiment-discuss
M. Scott Marshall | 10 Jun 2008 10:24
Picon
Picon
Favicon

Wrapping a Runner with a web service

Hi Marco,

As we were recently discussing, I think that 
myExperiment/Taverna/BioCatalogue would be interesting to the Pistoia 
project[1], which will assemble data standards, services, and ontologies 
for the Pharmaceutical Drug Discovery workflow. Note carefully that the 
participants of this project are the Heads of (Chem)IT for several not 
terribly small Pharma companies.

While formulating my ideas about this to pose to Pistoia members, I came 
up with a technical question and would like to know who can help me with 
the answer. The question is about plans for creating web service 
wrappers for workflow Runners on myExperiment. This is now within reach, 
isn't it? Or is this desire-induced delusion on my part? The 
ramifications are HUGE: composition of workflows via web service 
interfaces can become trivial and myExperiment could provide web (REST) 
access to executing them.

Best,
Scott

[1] http://sourceforge.net/projects/pistoia

P.S. Don't think that I can post this to the myGrid list. Is it 
appropriate?

--

-- 
M. Scott Marshall
http://staff.science.uva.nl/~marshall
http://adaptivedisclosure.org
Jiten Bhagat | 10 Jun 2008 12:48
Picon
Picon

Re: Wrapping a Runner with a web service

Hi Scott,

We've had a brief discussion on this at the myGrid meeting with the view 
that this may not be myExperiment specific. Could you also post this to 
the Taverna-hackers 
(http://lists.sourceforge.net/mailman/listinfo/taverna-hackers) mailing 
list (but feel free to keep myExperiment-discuss cc'ed).

Thanks,
Jits

M. Scott Marshall wrote:

> Hi Marco,
>
> As we were recently discussing, I think that 
> myExperiment/Taverna/BioCatalogue would be interesting to the Pistoia 
> project[1], which will assemble data standards, services, and 
> ontologies for the Pharmaceutical Drug Discovery workflow. Note 
> carefully that the participants of this project are the Heads of 
> (Chem)IT for several not terribly small Pharma companies.
>
> While formulating my ideas about this to pose to Pistoia members, I 
> came up with a technical question and would like to know who can help 
> me with the answer. The question is about plans for creating web 
> service wrappers for workflow Runners on myExperiment. This is now 
> within reach, isn't it? Or is this desire-induced delusion on my part? 
> The ramifications are HUGE: composition of workflows via web service 
> interfaces can become trivial and myExperiment could provide web 
> (REST) access to executing them.
>
> Best,
> Scott
>
> [1] http://sourceforge.net/projects/pistoia
>
> P.S. Don't think that I can post this to the myGrid list. Is it 
> appropriate?
>
M. Scott Marshall | 10 Jun 2008 15:00
Picon
Picon
Favicon

Wrapping a Runner with a web service

[I've been asked by Jits to post this to Taverna-hackers and keep 
myExperiment-discuss cc'ed.]

Hi Marco,

As we were recently discussing, I think that
myExperiment/Taverna/BioCatalogue would be interesting to the Pistoia
project[1], which will assemble data standards, services, and ontologies
for the Pharmaceutical Drug Discovery workflow. Note carefully that the
participants of this project are the Heads of (Chem)IT for several not
terribly small Pharma companies.

While formulating my ideas about this to pose to Pistoia members, I came
up with a technical question and would like to know who can help me with
the answer. The question is about plans for creating web service
wrappers for workflow Runners on myExperiment. This is now within reach,
isn't it? Or is this desire-induced delusion on my part? The
ramifications are HUGE: composition of workflows via web service
interfaces can become trivial and myExperiment could provide web (REST)
access to executing them.

Best,
Scott

[1] http://sourceforge.net/projects/pistoia

--

-- 
M. Scott Marshall
http://staff.science.uva.nl/~marshall
http://adaptivedisclosure.org
Tom Oinn | 10 Jun 2008 15:27
Favicon

Re: [Taverna-hackers] Wrapping a Runner with a web service

Hi Scott, all,

On the face of it a workflow is an entity that consumes some data, 
performs some processing and returns some results, having potentially 
also had a side effect on its execution environment. At this level it 
seems perfectly plausible that we can turn a workflow into a service.

The problems start to emerge when we match the full capabilities of the 
workflow system against the limitations of 'vanilla' web services.

1) Workflows can run for a long time. Potentially they can run for 
months; if you were to expose a workflow of that kind as a service you'd 
have to move beyond the synchronous invocation pattern that a 
conventional SOAP service supports.

	- there are existing ways to do this of course, we're not the only ones 
with long running services, but they immediately move you out of the 
'really simple service' realm. Whether you do this with e.g. WSRF or 
with a custom service interface (so your service interface is to the 
workflow *engine* not to the workflow itself) is another choice to make.

2) Workflows can consume and emit very large data. Web services have no 
consistant mechanism to handle this - there are approaches to do with 
attachments and there are ad hoc mechanisms such as returning URLs to 
results and accepting the same as inputs.

Both these issues may or may not be a problem in any given case, and 
there are certainly simple workflows (short running, small data) which 
could be wrapped up in a naive synchronous service interface. Exactly 
what values of 'short' and 'small' we use here is a gray area, but it's 
not going to give you much room to work in.

Another option is to create a hybrid system, where you force all data to 
be passed by reference (HTTP URLs are pretty good for this). Taverna can 
internally handle this, and we can even force the workflow to return 
results in that form so your service logic wouldn't have to do very 
much. This works around the data size issue (as URLs are 'small') but 
not the run time problems.

The run length issues could be mitigated by adopting the typical 
'submit, poll, get results, destroy' pattern. Of course, you'd have to 
have some way of identifying the workflow instance you wanted to 
interact with as part of the call (or in WS or HTTP headers).

There is, however, a final problem with 'really' making these things 
available as services. In T2 the 'workflow engine' is really a set of 
components such as security agents, reference management systems and the 
like. To run a workflow you need to assemble and mutually configure 
these components - in some cases there is an obvious trivial assembly 
which could be done implicitly, but in general you're looking at 
configuring this federation prior to workflow launch - this would 
immediately make the service taverna-specific.

There's nothing to say, of course, that these approaches are mutually 
exclusive. We could envisage a heirarchy of functionality where as you 
move up the heirarchy you get increased control (configuration, 
monitoring, support for long running workflows) at the cost of increased 
interface complexity. At the top of this heirarchy would be the full 
blown peer management service framework, at the bottom would be vanilla 
synchronous web services (with no exposure of the workflow system at 
all). Any given application would live somewhere between these two extremes.

Tom
James Howison | 15 Jun 2008 16:31
Gravatar

strategies for excluding username/password from logbook?

Howdy,

We are preparing some workflows which require access to password  
protected databases.  The data providers issue user/pass to  
researchers on request, mostly to monitor usage and load by account.   
We'd like the workflows to work against the databases, but when other  
researchers use them to have them provide their user/id.  However we'd  
also like to publish the logbook results for the workflows with  
publications so that others can see all the intermediates etc (and  
eventually use the ontology elements to discover data/point or  
variable usage etc).  Where ever possible we're trying to use standard  
components, like the SQL lookup component.

We'd like the user/pass not to show up in the results.

Currently I see three options:

user/pass as workflow inputs
user/pass as default values
user/pass as "ask for value" pop-up results

But all these show up in the logbook results.

I could wrap the components that need user/pass in a custom beanshell  
component and read them from the file system, but I think that limits  
the portability pretty badly.  Even if there was a user properties  
file that could be read from inside a component that would limit the  
ability to use the standard SQL component (which takes user/pass as  
input and therefore always records it).

Is there a standard/recommended strategy for this?  I searched the  
mailing list and myexperiment, and the strategy I saw most is to have  
user/pass as input variables, passed through to where they are needed;  
but that will record them in the workflow runs and make the runs less  
shareable.

Perhaps one could mark input/output ports as "to be obfuscated" for  
workflow/logbook records?  Of course one would want to avoid that in  
general, but user/pass (especially pass :) is a special case, I think.

Thanks,
James
Håkon Sagehaug | 16 Jun 2008 09:43
Picon
Picon

Problems installing myExperience

Hi

I wanted to test out myExperience on my own machien for a project and downloaded the source from svn, but when I wanted to build it I could not find sufficient information. In the file install.txt the 3'rd point

 copy vendor/plugins/sitealizer/lib/config.yml.pre to
      vendor/plugins/sitealizer/lib/config.yml

The folder sitealizer does not exist. And what commnad should i use to run/build it go.sh?? If I run go.sh I get the following error message

./script/../config/boot.rb:9:in `require': no such file to load -- rubygems (LoadError)
        from ./script/../config/boot.rb:9
        from ./script/server:2:in `require'
        from ./script/server:2


cheers, Håkon

--
Håkon Sagehaug, Software Developer
Parallab, Bergen Center for Computational Science (BCCS)
UNIFOB AS (University of Bergen Research Company)

_______________________________________________
Myexperiment-discuss mailing list
Myexperiment-discuss@...
http://lists.nongnu.org/mailman/listinfo/myexperiment-discuss
Linde, A.E. | 17 Jun 2008 13:41
Picon
Picon

THE ELSEVIER GRAND CHALLENGE

Anything myExperiment could offer?
 
Abstract deadline is July 15th.
 
t.
_______________________________________________
Myexperiment-discuss mailing list
Myexperiment-discuss@...
http://lists.nongnu.org/mailman/listinfo/myexperiment-discuss
Carole Goble | 17 Jun 2008 22:02
Picon
Favicon

Re: THE ELSEVIER GRAND CHALLENGE

Tony

thanks for the reminder

Carole

> Anything myExperiment could offer?
>   http://www.elseviergrandchallenge.com/
>  
> Abstract deadline is July 15th.
>  
> t.
> ------------------------------------------------------------------------
>
> _______________________________________________
> Myexperiment-discuss mailing list
> Myexperiment-discuss@...
> http://lists.nongnu.org/mailman/listinfo/myexperiment-discuss
>   
Stian Soiland-Reyes | 18 Jun 2008 10:31
Picon
Favicon
Gravatar

Re: strategies for excluding username/password from logbook?

On Sun, Jun 15, 2008 at 3:31 PM, James Howison <james@...> wrote:

> We'd like the user/pass not to show up in the results.
>
> Currently I see three options:
>
> user/pass as workflow inputs
> user/pass as default values
> user/pass as "ask for value" pop-up results
>
> But all these show up in the logbook results.

Hi!

Taverna 2 will come with support for a security agent, which would be
able to provide username/password (and similar authorisation
information) "on the side" of the workflow instead of exposing them as
values.

However, we're also redoing the logbook for Taverna 2, and the data
structures are most probably going to look a bit different - for
instance to properly handle iteration.

When would you need this for..?

--

-- 
Stian Soiland-Reyes, myGrid team
School of Computer Science
The University of Manchester
http://www.mygrid.org.uk/dev/blog/

Gmane