Lisa Dusseault | 2 Feb 19:45
Favicon

Lisa's Apps area Activity Report for Jan 2009


News, Updates
Quite a few BOFs getting requested for the next meeting, see http://trac.tools.ietf.org/bof/trac/wiki for more details on each:
 - OAUTH - charter revised recently on list
 - YAM, BOF on advancing core email standards
 - MMOX, BOF on Massive Multiplayer Online Interactions
 - XMPP, revising base specs and doing some transport and security stuff
 - Interest in other HTTP topics: Cookies, Origin header -- not sure if there will be a BOF around this

The BOF approval call is Feb 5 and scheduling of these BOFs will happen after that.


Document Status and Progress
Active documents, my action:

Active documents, waiting on other:
 - draft-reschke-webdav-post (Exp): Cyrus Daboo is doing shepherd review and writeup
 - drat-montemurro-gsma-imei-urn (Exp): Waiting on revision from authors
 - draft-ietf-sieve-mime-loop (PS): Waiting on authors to respond to GenArt review
 - draft-ietf-usefor-usepro (PS): Waiting for last WG issue to get resolved
 - draft-ietf-calsify-2446bis (PS): Waiting for a revision
 - draft-ietf-calsify-rfc2445bis (PS): Waiting for a revision
 - draft-snell-atompub-bidi (PS): Waiting for a revision
 - draft-wilde-sms-uri (PS): Waiting for a revision

Finished processing -- new in RFC Ed queue and new RFCs
 - draft-melnikov-sieve-imapext-metadata (PS): approved, announcement sent
 - draft-freed-sieve-ihave (PS): In RFC Ed queue
 - draft-ietf-sieve-managesieve (PS): In RFC Ed queue
 - draft-kucherawy-sender-auth-header (PS): In RFC Ed queue

WG Status

ALTO, HTTPBIS, IDNABIS and SIEVE are meeting in SF / IETF74

ALTO: mostly quiet 
CALSIFY: Working through open issues
HTTPBIS: Interesting issues and great discussion on "Origin" header or other fix/replacement to "Referer" header
IDNABIS: Great discussion on goals and tradeoffs e.g. simplicity and invariability of assignments
SIEVE: Published several documents and pushing more through
USEFOR: Prompted the WG last week to close their last one or two issues, to finish their last doc

--- Scanned by M+ Guardian Messaging Firewall --- Messaging Architects sponsors The Spamhaus Project.

_______________________________________________
Apps-Discuss mailing list
Apps-Discuss <at> ietf.org
https://www.ietf.org/mailman/listinfo/apps-discuss
Mark Nottingham | 10 Feb 13:06
Favicon
Gravatar

Fwd: New Version Notification for draft-nottingham-site-meta-01

FYI.

http://www.ietf.org/internet-drafts/draft-nottingham-site-meta-01.txt

Begin forwarded message:

> From: IETF I-D Submission Tool <idsubmission <at> ietf.org>
> Date: 10 February 2009 11:03:13 PM
> To: mnot <at> mnot.net
> Cc: eran <at> hueniverse.com
> Subject: New Version Notification for draft-nottingham-site-meta-01
>
>
> A new version of I-D, draft-nottingham-site-meta-01.txt has been  
> successfuly submitted by Mark Nottingham and posted to the IETF  
> repository.
>
> Filename:	 draft-nottingham-site-meta
> Revision:	 01
> Title:		 Host Metadata for the Web
> Creation_date:	 2009-02-10
> WG ID:		 Independent Submission
> Number_of_pages: 12
>
> Abstract:
> This memo describes a method for locating host-specific metadata for
> the Web.
>
>
>
> The IETF Secretariat.
>
>

--
Mark Nottingham     http://www.mnot.net/
Thomas Roessler | 10 Feb 14:38
Picon
Favicon

host-meta file format comments (draft-nottingham-site-meta-01)

Going through draft-nottingham-site-meta-01:
> 3. The host-meta File Format

> As with HTTP headers, field-names are not case-sensitive,  
> unrecognised field-names SHOULD be silently ignored when parsing  
> this format, and ordering of fields SHOULD NOT be considered  
> significant unless specified otherwise. Additionally, although the  
> syntax does not explicitly allow empty lines between fields, parsers  
> SHOULD silently discard them (i.e., be permissive in what they  
> accept). Field content is constrained by the specification indicated  
> by its associated field-name.

What's the cost of just permitting empty lines between fields in the  
sytnax vs having the current "SHOULD parse"?  The current text sounds  
like a gratuitous interop problem to me.
> 5. Minting New meta-fields

> Applications that wish to mint new meta-fields for use in the host-  
> meta format MUST register them in the host-meta field-registry,  
> following the procedures in Section 7.2. Field-names MUST conform to  
> the field-name ABNF Section 3, and field-value syntax MUST be well-  
> defined (e.g., using ABNF, or a reference to the syntax of an  
> existing header field-value). Field-values SHOULD use the ISO-859-1  
> character encoding. If a field-value applies to a scope other than  
> the entire authority, that scope MUST be well-defined.

Editorial nit: ISO-8859-1 is missing an 8 here.
More substantially, is there any particular reason to not just go with  
utf-8 here?  After all, the content type is *appplication*/host-meta  
anyway.

--
Thomas Roessler, W3C  <tlr <at> w3.org>
Thomas Roessler | 10 Feb 15:57
Picon
Favicon

Origin vs Authority; use of HTTPS (draft-nottingham-site-meta-01)

Reading draft-nottingham-site-meta-01...

> 4. Discovering host-meta Files

> The metadata for a given authority can be discovered by  
> dereferencing the path /host-meta on the same authority. For  
> example, for an HTTP URI [RFC2616], the following request would  
> obtain metadata for the authority "www.example.com:80";

Editorial nit: That semicolon wants to be a colon.

> GET /host-meta HTTP/1.1
> Host: www.example.com

It is somewhat unclear what the scope of the host-meta file is, or  
more precisely, how the URI for the host-meta file is derived from the  
URI of the resource that the metadata apply to.

Section 4 seems to suggest that the URI is maybe generated by  
dereferencing the relative URI reference /host-meta using the  
resource's URI as the base URI, but it doesn't say that clearly; the  
use of "authority" suggests that the choice of the protocol is  
actually up to the implementation.

 From the previous apps-discuss thread, it seems like the main use  
case for permitting metadata to leak across schemes (and therefore,  
typically ports -- though ports and schemes are strictly speaking  
orthogonal) lies with URI schemes that do not have a resource  
retrieval operation readily available, e.g., mailto.

On the other hand, I'm extremely wary about anything near HTTP that  
might tear down origin boundaries without a great deal of care.  E.g.,  
a purely authority-based approach might permit metadata to leak from  
the HTTP part of a site (where no integrity protection is given) into  
its HTTPS part (where integrity protection and authenticity of data is  
deemed important), possibly permitting attacks against web  
applications that are ostensibly protected -- as is alluded to in the  
security considerations.

The obvious solution to that part of the puzzle is to let the  
mechanism default to the same URI scheme, unless there is a specific  
convention to the contrary.  That should cover any URI schemes for  
which a safe retrieval operation is defined (HTTP, HTTPS, FTP come to  
mind).

For other URI schemes, one could either punt on this issue completely,  
define a default fall-back to HTTP (or HTTPS, depending on which of  
the two better matches the security properties of the protocol in  
question), or actually say explicitly what's the correct scheme.

Thoughts?

--
Thomas Roessler, W3C  <tlr <at> w3.org>
Thomas Roessler | 11 Feb 01:48
Picon
Favicon

Re: Origin vs Authority; use of HTTPS (draft-nottingham-site-meta-01)

On 11 Feb 2009, at 01:31, Mark Nottingham wrote:

> Gentle reminder; the draft asks for discussion on www-talk. Sending  
> followups there (I should have mentioned this in the announcement,  
> sorry)...

(and I should read instructions.... apologies)

>> The obvious solution to that part of the puzzle is to let the  
>> mechanism default to the same URI scheme, unless there is a  
>> specific convention to the contrary.  That should cover any URI  
>> schemes for which a safe retrieval operation is defined (HTTP,  
>> HTTPS, FTP come to mind).
>
> I'm happy to clarify this by either adding scheme/protocol to the  
> (host, port) tuple (although we'll probably have to come up with a  
> different term than "authority"; PLEASE don't say "endpoint" ;),  
> which will affect both the default scoping of application as well as  
> the discovery mechanism, or just limiting it to discovery.

I'd use the (scheme, host, port) triple to identify the endpoints that  
we're dealing with here, both for scope and discovery. Adam Barth's  
draft-abarth-origin gives a canonicalization procedure for these  
tuples.  That will be useful when the tuples derived from different  
URIs need to be compared, to determine whether one is in the same site  
metadata scope as the other.

Calling that kind of triple "an origin" seems fine, and is consistent  
with the usage of that word in draft-abarth-origin and elsewhere.

The benefit of using the triple for both discovery and scope is that  
you don't acquire yet another possible cross-origin channel in the  
browser.

>> For other URI schemes, one could either punt on this issue  
>> completely, define a default fall-back to HTTP (or HTTPS, depending  
>> on which of the two better matches the security properties of the  
>> protocol in question), or actually say explicitly what's the  
>> correct scheme.
>
> I'm inclined to punt on it. Default fall-back to HTTP makes too many  
> assumptions.

Same inclination here, actually.
Thomas Roessler | 11 Feb 02:05
Picon
Favicon

Re: host-meta file format comments (draft-nottingham-site-meta-01)

(diverting to www-talk, too...)

On 11 Feb 2009, at 01:20, Mark Nottingham wrote:

> Yeah, I'm not completely happy with it yet. The thought was that  
> since blank lines don't introduce ambiguity here, they're not  
> harmful. OTOH one of my goals for the format is to allow existing  
> HTTP header and MIME parsers (e.g., in Python) to be used on the  
> format, and they very well may barf on a blank line.

Well, they'll barf on blank lines and declare the header over;  
changing that within the parser (or just restarting it on the rest of  
the file) should be relatively cheap.

BTW, I notice that this draft is silent on the HTTP header syntax's  
combining feature for multiple occurences of the same field (last  
paragraph of 4.2, RFC 2616); I suspect that to be one of the more  
likely causes for surprises if HTTP header parsers are re-used.  (No  
such risk with MIME parsers.)

Finally, why disallow whitespace stuffed folding?  It's pretty useful  
to make long lines editable, and I suspect that we're assuming /host- 
meta to be the product of some human with emacs in their hands. ;-)   
Implementing it is easy, and a given if existing parsers are used.

> So, the right thing to do might be to explicitly disallow them, both  
> in BNF and prose. Eran, thoughts?

I'd just prefer to not have the BNF say "no empty lines", and then  
have prose that says the opposite, but with a SHOULD.

>>> 5. Minting New meta-fields
>>
>>> Applications that wish to mint new meta-fields for use in the  
>>> host- meta format MUST register them in the host-meta field- 
>>> registry, following the procedures in Section 7.2. Field-names  
>>> MUST conform to the field-name ABNF Section 3, and field-value  
>>> syntax MUST be well- defined (e.g., using ABNF, or a reference to  
>>> the syntax of an existing header field-value). Field-values SHOULD  
>>> use the ISO-859-1 character encoding. If a field-value applies to  
>>> a scope other than the entire authority, that scope MUST be well- 
>>> defined.
>>
>> Editorial nit: ISO-8859-1 is missing an 8 here.
>
> That one always gets me, thanks.
>
>> More substantially, is there any particular reason to not just go  
>> with utf-8 here?  After all, the content type is *appplication*/ 
>> host-meta anyway.
>
> Same as above; allowing existing parsers and serialisation libraries  
> to be used. That said, there have been many arguments in HTTPbis  
> that existing libraries won't harm non-ASCII characters in transit,  
> but IIRC no one has actually gone out and surveyed what they do...

That suggests that it's a coin toss, unless the mythical "someone"  
does that work.  May I, in that event, suggest that we use a coin  
biased in favor of broader internationalization, i.e., UTF-8?
Thomas Roessler | 11 Feb 02:28
Picon
Favicon

Re: host-meta file format comments (draft-nottingham-site-meta-01)

On 11 Feb 2009, at 02:18, Mark Nottingham wrote:

[ASCII vs UTF-8]

> OTOH we're talking about a SHOULD here. Maybe it just needs more  
> careful guidance; i.e., that you should stick to ASCII unless you're  
> conveying elements for presentation to end users.

Well, one point to consider is how you expect IRIs and IRI references  
to be represented.

There's one school of thought (more common in the IETF crowd) that  
says that these should be convereted to ASCII early, and therefore  
shouldn't occur here.

The other school of thought (more common at W3C) says that they're  
fine in the places where XML and other document formats have always  
accepted URIs, and therefore should be representable in this spot.

There are some properties of the direction that the IDNA update effort  
is going into that suggest that the IETF school of thought is less  
likely to cause interoperability problems.

The other question is what the cost of violating this SHOULD is.   
Assume that some people have a really good reason to violate an ASCII  
or ISO-8859-1 SHOULD, and actually go for UTF-8.  You now get mixed  
character sets in a single metadata file.  I'm not sure that's  
desirable...

(BTW, are we just going down the rathole of defining yet another tag- 
value format that's subtly different?  Maybe the spec should just say  
"use HTTP header format, but with UTF-8", or "use RFC 822, but with  
UTF-8".)

--
Thomas Roessler, W3C  <tlr <at> w3.org>
Eran Hammer-Lahav | 11 Feb 08:21
Favicon
Gravatar

RE: host-meta file format comments (draft-nottingham-site-meta-01)

(not sure how my work email got into this thread... but please replace it with this one)

> From: Mark Nottingham [mailto:mnot <at> yahoo-inc.com]
> Sent: Tuesday, February 10, 2009 4:21 PM
>
> On 11/02/2009, at 12:38 AM, Thomas Roessler wrote:
>
> >> As with HTTP headers, field-names are not case-sensitive,
> >> unrecognised field-names SHOULD be silently ignored when parsing
> >> this format, and ordering of fields SHOULD NOT be considered
> >> significant unless specified otherwise. Additionally, although the
> >> syntax does not explicitly allow empty lines between fields,
> >> parsers SHOULD silently discard them (i.e., be permissive in what
> >> they accept). Field content is constrained by the specification
> >> indicated by its associated field-name.
> >
> > What's the cost of just permitting empty lines between fields in the
> > sytnax vs having the current "SHOULD parse"?  The current text
> > sounds like a gratuitous interop problem to me.
>
> Yeah, I'm not completely happy with it yet. The thought was that since
> blank lines don't introduce ambiguity here, they're not harmful. OTOH
> one of my goals for the format is to allow existing HTTP header and
> MIME parsers (e.g., in Python) to be used on the format, and they very
> well may barf on a blank line.
>
> So, the right thing to do might be to explicitly disallow them, both
> in BNF and prose. Eran, thoughts?

I wanted to either allow in both or explicitly disallow in both. Allowing them has the advantage of
disabling the ability to stick other payload after the line break. But I see the logic in following the HTTP
header structure which was my original inspiration for this general structure. I would say I am neutral
with slight lean towards disallowing.

EHL
Eran Hammer-Lahav | 13 Feb 08:18
Favicon
Gravatar

FW: I-D Action:draft-hammer-discovery-02.txt

Please discuss on the www-talk <at> w3.org list.

For those who have read previous revisions (thanks!), please note that except for Appendix B, the rest of the spec was significantly changed and a fresh read is recommended.

Thanks,

EHL


------ Forwarded Message
From: <Internet-Drafts <at> ietf.org>
Reply-To: <internet-drafts <at> ietf.org>
Date: Fri, 13 Feb 2009 00:15:02 -0700
To: <i-d-announce <at> ietf.org>
Subject: I-D Action:draft-hammer-discovery-02.txt


A New Internet-Draft is available from the on-line Internet-Drafts directories.

        Title           : Link-based Resource Descriptor Discovery
        Author(s)       : E. Hammer-Lahav
        Filename        : draft-hammer-discovery-02.txt
        Pages           : 25
        Date            : 2009-02-12

This memo describes a process for obtaining information about a
resource identified by a URI.  The 'information about a resource', a
resource descriptor, provides machine-readable information that aims
to increase interoperability and enhance the interaction with the
resource.  This memo only defines the process for locating and
obtaining the descriptor, but leaves the descriptor format and its
interpretation out of scope.

A URL for this Internet-Draft is:
http://www.ietf.org/internet-drafts/draft-hammer-discovery-02.txt

Internet-Drafts are also available by anonymous FTP at:
ftp://ftp.ietf.org/internet-drafts/

Below is the data which will enable a MIME compliant mail reader
implementation to automatically retrieve the ASCII version of the
Internet-Draft.


------ End of Forwarded Message
Attachment (ATT00001.txt): application/octet-stream, 258 bytes
_______________________________________________
Apps-Discuss mailing list
Apps-Discuss <at> ietf.org
https://www.ietf.org/mailman/listinfo/apps-discuss
Adam Barth | 10 Feb 17:58

Re: Origin vs Authority; use of HTTPS (draft-nottingham-site-meta-01)

Wow, this draft is scary.  I haven't seen the prior discussion of this
draft, but we should learn from the mistakes of Flash's
crossdomain.xml policy file.  In particular, you should require that
the host-meta file should be served with a specific mime type (ignore
the response if the mime type is wrong.  This protects servers that
let users upload content from having attackers upload a bogus
host-meta file.

Also, if you want this feature to be useful for Web browsers, you
should align the scope of the host-meta file with the notion or origin
(not authority).  Section 4 seems to imply that the scope is
"www.example.com:80" but Section 6 implies the scope is
"https://www.example.com".  In fact, computing the origin of a URL
correctly is more complex than this draft assumes.  For details, see
my origin draft.

That said, I think host-meta would be super useful if specified correctly.

Adam

On Tue, Feb 10, 2009 at 6:57 AM, Thomas Roessler <tlr <at> w3.org> wrote:
> Reading draft-nottingham-site-meta-01...
>
>> 4. Discovering host-meta Files
>
>> The metadata for a given authority can be discovered by dereferencing the
>> path /host-meta on the same authority. For example, for an HTTP URI
>> [RFC2616], the following request would obtain metadata for the authority
>> "www.example.com:80";
>
> Editorial nit: That semicolon wants to be a colon.
>
>> GET /host-meta HTTP/1.1
>> Host: www.example.com
>
> It is somewhat unclear what the scope of the host-meta file is, or more
> precisely, how the URI for the host-meta file is derived from the URI of the
> resource that the metadata apply to.
>
> Section 4 seems to suggest that the URI is maybe generated by dereferencing
> the relative URI reference /host-meta using the resource's URI as the base
> URI, but it doesn't say that clearly; the use of "authority" suggests that
> the choice of the protocol is actually up to the implementation.
>
> From the previous apps-discuss thread, it seems like the main use case for
> permitting metadata to leak across schemes (and therefore, typically ports
> -- though ports and schemes are strictly speaking orthogonal) lies with URI
> schemes that do not have a resource retrieval operation readily available,
> e.g., mailto.
>
> On the other hand, I'm extremely wary about anything near HTTP that might
> tear down origin boundaries without a great deal of care.  E.g., a purely
> authority-based approach might permit metadata to leak from the HTTP part of
> a site (where no integrity protection is given) into its HTTPS part (where
> integrity protection and authenticity of data is deemed important), possibly
> permitting attacks against web applications that are ostensibly protected --
> as is alluded to in the security considerations.
>
> The obvious solution to that part of the puzzle is to let the mechanism
> default to the same URI scheme, unless there is a specific convention to the
> contrary.  That should cover any URI schemes for which a safe retrieval
> operation is defined (HTTP, HTTPS, FTP come to mind).
>
> For other URI schemes, one could either punt on this issue completely,
> define a default fall-back to HTTP (or HTTPS, depending on which of the two
> better matches the security properties of the protocol in question), or
> actually say explicitly what's the correct scheme.
>
> Thoughts?
>
> --
> Thomas Roessler, W3C  <tlr <at> w3.org>
>
>

Gmane