hatom-issues: Difference between revisions

From Microformats Wiki
Jump to navigation Jump to search
(Raised issue: entry id)
Line 68: Line 68:


Template section: if there is something clearly from an Atom ''Entry'' that you'd like in hAtom 0.2, use this section as a template and replicate it in place here. See the hAtom 0.1 section below for more details.
Template section: if there is something clearly from an Atom ''Entry'' that you'd like in hAtom 0.2, use this section as a template and replicate it in place here. See the hAtom 0.1 section below for more details.
== Entry <i>id</i> (atom:<i>id</i>) ==
<small>2006-04-01 raised by [[User:RobertBachmann|Robert Bachmann]]</small>
atom:<i>id</i> is required for atom:entry. Thus it should be available in hAtom to.
The Entry permalink should be used as the entry id.


== Author ==
== Author ==

Revision as of 14:17, 1 April 2006

hAtom 0.2

This section is for discussing what you'd like to see in the next version of hAtom, i.e. 0.2.

Feed XXX (atom:xxx)

Template section: if there is something clearly from an Atom Feed that you'd like in hAtom 0.2, use this section as a template and replicate it in place here. See the hAtom 0.1 section below for more details.

Feed id (atom:id)

2006-04-01 raised by Robert Bachmann

atom:id is required for atom:feed. Thus it should be available in hAtom to. The Feed permalink should be used as the feed id.

Feed permalink (atom:permalink)

2006-04-01 raised by Robert Bachmann

I'm proposing the following rules:

  • a Feed Permalink element is identified by rel-bookmark at the feed level.
  • a Feed SHOULD have a Feed Permalink
  • a Feed Permalink element represents the concept of an Atom link in a feed.
  • if the Feed Permalink is missing, use the URI of the page; if the Feed has an "id" attribute, add that as a fragment to the page URI

Feed updated (atom:updated)

2006-04-01 raised by Robert Bachmann

atom:updated is required for atom:feed. Thus it should be available in hAtom to. I'm proposing the following rules:

  • The Feed Updated element is identified by the class name updated at the feed level.
  • If no element with the class name updated is present, use the youngest updated from the feed's entries.

Algorithm:

$a = array();
for each $entry in $feed {
    if ($entry.updated)
      $a.add(pad_datetime($entry.updated))
    else
      $a.add(pad_datetime($entry.published))
  }
$a.sort_by( datetime_to_utc($element) )
$feed_updated = $a[0];

Feed title (atom:title)

2006-04-01 raised by Robert Bachmann

atom:title is required for atom:feed. Thus it should be available in hAtom to.

I'm proposing the following rules:

  • a Feed Title element is identified by the class name entry-title
  • a Feed SHOULD have an Feed Title
  • a Feed Title element represents the concept of an Atom feed title
  • if the Feed Title is missing, use
    • the first <h#> element in the Feed, or
    • the <title> of the page, or
    • assume it is the empty string
  • 2006-04-01 ChrisCasciano - I think that the fall back to using the first h# on the page is dangerous.. depending on the pge it may be something that changes often (first h# is a post title) or is otherwise ambiguous. I would think using <title> before h# would be prefered if not the most common desire of the page author.

Entry XXX (atom:xxx)

Template section: if there is something clearly from an Atom Entry that you'd like in hAtom 0.2, use this section as a template and replicate it in place here. See the hAtom 0.1 section below for more details.

Entry id (atom:id)

2006-04-01 raised by Robert Bachmann

atom:id is required for atom:entry. Thus it should be available in hAtom to.

The Entry permalink should be used as the entry id.

Author

author as an hcard is too much to require

The following 3 items were extracted from the conversation starting on irc with logs available starting around here

  • Fil If, for example, you are programming an "aggregator" of news syndicated from many sources like in Sedna, chances are that you don't control what "authors" look like; they can be nicely microformated (if coming from an mf-enabled system), but most probably they will be internally represented by a string that contains, in some random order, a name, and/or an email, and so on. If you want to pass on this information in an hAtom feed, you can't possibly reformat it to an hCard. But you still want to pass it on in a <div class="author"> element.
    • Tantek I don't believe the "can't possibly" statement. Please provide a URL to a concrete example that you think you can't possibly reformat into an hCard so we can all take a look.
    • ChrisCasciano details of Fil's extraction in irc logs including sting data passed to his app in the form of "Béatrice XXXXXXX beatrice.xxxxxx@@zzzzzzzzz.com"
    • Fil the example url was given up there (Sedna); note that the author information comes from syndication links; nobody is going to edit them to outline what is the name, what is the email and so on, as everything is flowing through automatically... so here the "author" data is dirty, and will not be cleaned into an hCard. We can force it to be in an hCard but it will be meaningless if the source (original data) wasn't built on an mf-enabled software.
  • pnhChris i don't disagree.. the field often comes from places too dumb to follow these rules well; even cases like wordpress that allow users to present their name 1 of 6 or 8 difference ways (from username to LF, FN) .. its not just writing a template to output as hatom at that point... you have to go further upstream where the string to be displayed is chosen .. I also think its pointless to have 10 vcards on the same page whose only data is a generic name like "Chris"
    • Tantek 10 vcards that are the same is pointless yes, but identifying who the author of 10 posts are is not pointless - that's the difference.
    • ChrisCasciano Agreed, but I still have concerns that "author" in hAtom does not always make for good hCards, though the situations where it does is optimal. My comments in the conversation were old comments I've made before over concerns and hardships or the lack of desire to make crappy data more portable, in neither of these cases do I think my two comments alone provide reasons to make change from the hAtom 0.1 spec
  • Fil for the moment, to comply losely with hAtom 0.1, I will use <span class="author"><span class="vcard"><span class="fn">My Name</span></span></span> ; but it's not good
    • Tantek You can actually simplify that (one fewer span) with: <span class="author vcard"><span class="fn">My Name</span></span>

Other Questions and Issues

General comments, modeling issues, algorithm issues, should have issues, etc. go here.

Entry Updated Required? -- Blogger Issue

The hAtom 0.1 spec states if there is no Entry Updated element...the page is invalid hAtom I have a real problem with this because I work with Blogger, where we cannot output datetime-design-pattern-compatible datestrings for our posts... We can output some different human-readable formats and we can output a nanosecond unix-timestamp, but the template tags will not output YYYY-MM-DDTHH:MM:SS+ZZ:ZZ no matter what you do... so how are we to resolve this so that Blogger blogs can use hAtom? -- singpolyma 05:45, 28 Mar 2006 (PST)

  • DavidJanes: I'm not sure if anything can be done. My thought right now is just to put "x-posted" (or whatever) as the semantic class name and eventually hope that they'll adopted something more flexible. A while back there was a proposal that ABBR be used to describe a parsable version of the date string (for example title="day month-name year, hour12:minute ampm tz") but I think this is asking too much from creators and parsers.

hAtom 0.1

This section is more or less closed, as hAtom 0.1 is out the door. If there are open issues that you are championing that didn't make it into hAtom 0.1, move them up above to the hAtom 0.2 section

This page documents the issues that have been raised regarding the hAtom draft during the course of its development, and the resolutions of those issues (often with accompanying opinions).


Contributors

  • Danny Ayers
  • Robert Bachmann
  • Paul Bryson
  • Benjamin Carlyle
  • Chris Casciano
  • Tantek Çelik
  • David Janes
  • Ryan King
  • Kevin Marks
  • Scott Reynen
  • Brian

Feed (atom:feed)

RyanKing: STATUS: RESOLVED - 'hfeed' and not required (a la hcalendar)

Initial proposal

atomfeed (or rather, "atom-entry")

  • DannyAyers: But what does 'feed' mean in the context of a HTML page? Doesn't the <head> element cover the corresponding semantics?
  • DavidJanes: It is possible, somewhat common, and documented, that multiple feeds can appear on a single page, so it's insufficient to depend on the header, even though this may be the default case. You'll note that I've left out documenting a lot of concepts relating to feeds at a conceptual level, except for noting they exist because I think this is a bit of a swamp that's going to need more thinking
  • DavidJanes: I'm going to more explicitly recognize that the XHTML document may act as an implicit feed in many cases
  • DavidJanes: A Feed is a group of related Entries; what defines the relationship is entirely up to the author of the blog, except to note that if they were to place them together in the same Atom syndication feed, you'd do the same in the XHTML
  • Dr. Ernie 16:59, 25 Oct 2005 (PDT): This makes sense to me, the way vcalendar is optional since vevent is usually sufficient.
  • Tantek: Ernie is precisely correct. The vevent/vcalendar :: entry/feed analogy is precisely correct.
  • DannyAyers: The multi-feed point makes sense, but if this data appears on a regular HTML page the question remains, does "feed" make sense? (Maybe just naming aesthetics again)
  • David Janes I'm thinking about it more -- I think so, just to split the content of the webpage up (as opposed to blogrolls, headers, footers, etc.) --
  • Tantek: Agreed with David. Not only does it make sense, it is a bad idea to consider renaming something like that for "aesthetics".
  • Tantek: Per the root-class-name naming practices, we should seriously consider a more "unique" name, e.g. some possibilities:
    • atom-feed
    • hfeed

Alternatives

The above proposal was not fully accepted and some other possibilities were proposed:

  • feed (Atom consistency)
  • atom-feed (Atom consistency with prefix)
  • hfeed (h* uF consistency)
    • +1 DavidJanes
    • +1 Tantek
    • +1 BenjaminCarlyle
    • +1 MarkRickerby
    • +1 DannyAyers

Discussion

The feed is a root class name of hAtom, similar to "vcalendar" in hCalendar, thus it should be fairly unique, per the root class name naming-principles. - Tantek

Entry (atom:entry)

RyanKing: STATUS - RESOLVED - 'hentry'

Initial Proposal

atomentry (or rather, "atom-entry")

  • DannyAyers: Why not simply "entry"? The parallel to Atom is clear, but in the context of a Web page, why add the reference? In case maybe you want to try for something approaching a string that won't get confused, my feeling is: forget it. Stick to the local semantics and let the doc-level (or HTML5 div level?) profile attribute disambiguate. Or to put it another way, it's premature to see a need at that point.
  • I (David Janes) chose the "atom" prefix:
    • to disambiguate; it is just too likely that "entry" or "feed" would appear on a random webpage in some other context. My preference would be to have a declarative statement in the XHTML header which would render this argument moot, but at this point the community seems cool on the concept.
    • to follow the naming pattern seen in the other compound microformats (hCard, hCalendar, etc.)
    • because Entrys will not be required to be in Feeds (these rules and the reasons where this can happen will be forthcoming), I choose to disambiguate both
      • I don't like the analogy; I think this is more useful than just Atom, so it should be made generic. Dr. Ernie 16:59, 25 Oct 2005 (PDT)
      • DannyAyers: My point exactly, but it wouldn't be the end of the world if the prefix was there - not really more than aesthetics...
      • STATUS - RESOLVED. We're going with "entry".
      • Tantek: This is actually difficult to consider outside the following issue. In particular, if "entry" is to serve as a potential root class name (similar to "vevent", which may be a root of an hCalendar event, or may be present in the context of a "vcalendar"), then we should strongly consider "uniquifying" it per our root-class-name practices. Possibilities to consider:
        • atom-entry
        • hentry
        • vjournal (from RFC 2445 and thus borrowed in effect from hCalendar)

Alternatives

The above proposal was not fully accepted. Other alternatives:

Discussion

  • Tantek: Since feed is optional in hAtom (thereby implying the context of the entire XHTML document as the feed), similar to how "vcalendar" is optional in hCalendar (thereby implying a vcalendar context for the entire document), the entry can also be a root class name, similar to "vevent" in hCalendar, thus it should be fairly unique, per the root class name naming-principles.

    If we are deliberately rejecting "vjournal", then we may want to exclude the entire "vjournal" object (and any vjournal specific properties) from hCalendar so that we don't accidentally have two blog post microformats.(RyanKing added this to hcalendar-issues)

    Having analyzed the list of vjournal properties and their semantics and compared them with the list of Atom elements and their semantics, I greatly prefer the list and semantics from Atom over vjournal. Thus I would be ok with excluding vjournal from hCalendar, and pointing folks to use hAtom instead, even in the context of an hCalendar element that would otherwise be outputting vjournal entries. To that extent, once hAtom has stabilized, we should develop a mapping between vjournal properties and hAtom properties so that hAtom inside an hCalendar could be converted into BEGIN:VJOURNAL...END:VJOURNAL objects in an iCalendar/ics stream, as well as allowing for the opposite, so that one could even use an iCalendar-compliant authoring tool to create hAtom via the journal feature of said tool.

Entry Title (atom:title)

RyanKing: STATUS - RESOLVED - going with 'entry-title, to be consistent with 'entry-content'

proposals

The title class is defined by hCard to mean "job title". Possible alternatives include (Please add to list):

  • summary, as used by hReview, hCalendar, VJOURNAL
    • Tantek: Though I agree with the reuse, in this context, it may be confusing for those reading/familiar-with the Atom specification. We may want to avoid the use of 'summary' entirely within hAtom.
    • -1 KevinMarks (clashes with atom)
  • Subject, as used by SMTP email
    • -1 RyanKing - different semantics, doesn't fit
  • heading
    • -1 RyanKing - a replication of <h*> semantics in html
  • entry-title
  • headline
  • title (Atom consistency)
    • -1 Tantek. Already defined to mean something else in hCard. The same term should not be used to mean different things.
  • entry-title (Atom consistency, avoid hCard conflict)
  • fn (attempt to re-use from hCard and hReview)

Discussion

  • BenjaminCarlyle: If one were to review a blog entry with hReview we would fill out the "fn" field with the atom:entry/title. This suggests to me that fn may be sufficient for this title usage. headline is more semantically specific, and does seem appropriate. It may be a line-ball call as to whether a new term is required, or whether the atom:entry context is sufficient to indicate the fn is also a headline.
  • BenjaminCarlyle: Are we considering atom:feed/title in this discussion? There is some suggestion that atom:title should be "fn", separate to any value of atom:entry/title.
  • DavidJanes: vcard defines "FN" to be "to specify the formatted text corresponding to the name of the object the vCard represents". If we reject FN, are we not making too subtle a distinction that the atom:title isn't the name of the post? I'll also note that the domain experts believe that the atom:title of an entry is pretty well the same sort of thing as the atom:title of a feed.
  • Tantek: First, I have re-evaluated using "fn" for feed:title per the information from Benjamin, David and others. See this discussion for details.

    Second, I now agree with DavidJanes and the domain experts that the title of a feed is very similar (if not nearly identical) in semantics to the title of an entry, neither of which can really be considered a name.

    Thus I am -1-ing "fn" for title for entry or feed since it doesn't mean the same thing.

  • DavidJanes: to summarize (I think), Tantek argues on the link above that atom:title can and does include more than the name.
  • DavidJanes: we're now at the point where FN is the title of a movie, a DVD, and a book, but not the atom:title of an entry and definitely not the atom:title of a feed.
  • BenjaminCarlyle: Entry and feed titles are both usually used as the name of the entry of feed, however examples exist where the entry title is changed for republication or is an auto-generated string (eg date). Headline is a good substitute at the entry level, and has a clear analogue in print.

    If headline is selected for entry a different term would be required for feed. Headline cannot meaningfully be used for a feed title any more than the name of a newspaper can be called a headline. Working back from the newspaper analogue, I am aware of the use of both name or title to describe the analogous text. In the absence of evidence that a feed's desired title is ever anything but a human-created name for the blog, my support falls behind fn for feed title only. The danger remains that someone will supply non-name data as "fn" in order to "get it into the atom:title element". For this reason I remain open to further naming suggestions and to any example in the wild where this might already occur.

    There has been some discussion that because the two are a single term in atom the domain experts consider the semantics to be the same. I suggest differently. The double use of title is inherited from rss, and has always been disambiguated by context. rfc4287 defines title as "a Text construct that conveys a human-readable title for an entry or feed", which conveys no useful semantics. Everything in a microformat is human-readable, and it isn't suprising that the semantics of title are equivalent to "title". To be honest, I would guess that the domain experts didn't give this issue a second thought.

  • DavidJanes: RESOLVED Let's go with "headline". I'm not in love with it but so it goes. My thinking on this at this point is we won't find a good word that covers atom:entry/title and atom:feed/title and I like the idea of a (somewhat) domain specific word that captures the concept and (especially a big point for me now) it will make mixing hAtom with other uFs a little nicer.
  • PaulBryson: I like entry-title for it's clarity. Unfortunately, I also feel that hyphenating names together in a string adds unnecessary complexity. In this case, it also adds a specificity that could be detrimental in the element's reuse. Headline feels redundant with "heading", which is what the element should be. Regardless, this is probably the best of the available choices.

Entry Content (atom:content)

STATUS - RESOLVED going with entry-content


Discussion

  • Tantek - It turns out there is actually a very fine semantic distinction between the way "description" is used in vCalendar, hCalendar, xFolk, hReview, and what "content" means. In short, those other microformats are all "about" something else, whether an actual event in spacetime, or another item. Whereas in hAtom is the thing itself. The feed is the data is the item. Thus it makes sense use a different class name than "description". Based on our naming-principles, lacking an existing microformat term for this, we should use a term from a standard. Since Atom uses "content", that is the logical name to bring over and use, whether or not it is "perfect" to capture the semantic we are trying to capture.
  • BenjaminCarlyle: We may also have to consider forms of blogs that carry other media. An <a rel="content" href="..."/> form of content may also have to be considered, although this could still be embedded in a very short html content block. I'm not quite ready to commit to "content" yet, but I agree that description may be a little weak.
  • ChrisCasciano - I'd be a bit cautious about equating usage of the content class in the wild with the specific usage you'd adopt here -- that of the content of a particular item or entry. As a deveoper I know I've used the term content to designate larger page sections or as synonym for content body (or that which is not header, nav or footer). In most cases my usage has been via ID which is safe (though perhaps confusing usages of similar terms) but I'm certain I've also used it as a class to free up ID for more specific information on larger sites.
  • Tantek: Chris Casciano is right. Not only that, but note the Google HTML survey of about a billion documents found that many web authors use "content" as a class name already, for whatever purpose they are intending. I have changed my vote to -1 for "content".
  • Tantek: I have added a few proposed alternatives based on discussions with various folks. I also checked synonyms for content but didn't find anything worth proposing. I have split my vote among the new alternatives for now.
  • ChrisCasciano - added hcontent per irc conversation a few nights ago. Not necessarily my favorite, but it should probably be on the table for discussion.
  • KevinMarks - I think entry-content is OK - if we go by existing practice in blogs, post-body or post are common.
  • ChrisCasciano - I'm behind entry-content as the least bad choice I've thought over.. atom-content doesn't 'read' generic enough for my tastes ('is it content for the page, or something just for atom export')

Entry Summary (atom:summary)

STATUS - RESOLVED - going with 'entry-summary'

The summary class is defined by vCalendar, iCalendar, hCalendar, and also hReview, to mean "summary or title". Possible alternatives include (add to list):

  • description, as used by VJOURNAL. It may be possible to interpret description as text longer than summary which is about the entry content. The hierarchy of detail would be summary (atom:title) -> description (atom:summary) -> content (atom:content)
    • Kevin Marks: description is used ambiguously by RSS to mean 'content' or 'summary', and by hReview and hCalendar to mean 'content'. Doing this would recreate that ambiguity needlessly, when Atom distinguishes it clearly.
    • Tantek: Kevin's right, and not only that, "description" does NOT mean summary in VJOURNAL. "description" means "full description" in vCalendar, iCalendar, hCalendar, and also hReview. We must NOT use "description" to mean summary.
  • summary (re-use from and consistency with Atom)
  • content-summary (Atom consistency avoiding hCalendar conflict)
  • partial-description
  • excerpt
    • +1 Tantek
    • +1 BenjaminCarlyle
    • +1 DavidJanes, my only concern being that they're not always excerpts
  • abstract
    • +1 KevinMarks
    • +1 Ernest Prabhakar: this is what my blog software calls it, and how I use it in my own blogs

Discussion

  • Tantek: Excerpt is by far the most frequent (>80%) use of summary, thus it makes sense to name it as such.
  • Kevin Marks: Disagree - Atom allows summary to be distinct from content, though this is less usual. However, by using a class that means summary (eg abstract) we can convey an excerpt by making it wholly within 'atom:content', or a separate abstract by putting it within the entry but not within 'content'
  • BenjaminCarlyle: I have been trying to convince myself that atom:summary differs semantically from iCalendar summary. The "summary or subject" wording from rfc2445 is problematic, and it seems earlier microformats have taken the "subject" side. If we were to start from rfc2445 alone we might go the other way. In the end, though, webster.com defines summary as "covering the main points succinctly". atom:summary is not really consistent with that definition, so I'll swing my weight behind excerpt. On the subject of abstract, I think the semantics are such that "abstract" and "exerpt" are distinct (non-overlapping) sets. webster.com defines abstract as "a summary of points (as of a writing) usually presented in skeletal form". An exerpt is not a summary of points, and a summary of points is not an excerpt. I think tantek is simply suggesting that the 80% win in this case.
  • Tantek: Benjamin is correct. The vast majority (easily 80%+) of summaries in Atom, when they exist are excerpts.

    In addition:

    • WordPress user interface calls it "excerpt"
    • MovableType user interface calls it "excerpt"
    Thus, based on the principle of user-centered design (an instance of humans first, machines second) as well, in that a user *typing* into the "Excerpt:" field in the UI of their blogging tool, is communicating to the interface that "This is the excerpt of my blog post", "excerpt" is actually a BETTER name for this element than summary, or anything else for that matter. Atom should have chosen "excerpt" as well based on this reason alone.
  • ScottReynen: I think there's a chance Tantek is mistaking cause and effect. Perhaps >80% of summaries are excerpts because two of the most popular publishing tools label the summaries as excerpts. Maybe we should be more sure WordPress and Movable type aren't actually confusing authors by using excerpt before following those examples.
  • ChrisCasciano: The Textpattern interface also calls this field an excerpt.

Entry Permalink (atom:link)

STATUS - RESOLVED - 'bookmark'

  • rel="bookmark" (HTML consitency)
    • +2 DavidJanes
    • +1 Tantek
    • +1 BenjaminCarlyle
    • +1 KevinMarks

Discussion

  • KevinMarks: I know this maps through to the atom name, but rel="bookmark" is the established standard for permalinks, and is included in the w3c list of rel's, so there is an Occam's Razor case for using this.
  • DavidJanes: I'd like input from everyone in this -- I'm torn really. Once I knock this thing into more of a complete state, I'll throw this out onto the mailing list for discussion
  • DavidJanes Also, "link" is horribly generic and is in fact modified through the "rel" attribute in Atom.
  • Tantek: Agreed with what Kevin wrote. Also, rel="link" doesn't actually make sense when you do the analysis as described in the rel-faq. The destination of the link is not really a "link" itself with respect to the current document/file.
  • David Janes: OK, I'm happy with this.STATUS - RESOLVED. We are using rel="bookmark".
  • BenjaminCarlyle: No real controversy here, unless you want to start giving blog entries or feeds vcards. A vcard could contain entry or feed title as fn, as well as url.
    • RyanKing non-issue, you can always use both.

Entry Published (atom:published)

Discussion

  • BenjaminCarlyle: I would still like to see a clear engagement with last-modified before voting on this one.
  • Tantek: last-modified reflects the last time the page/file was actually modified, most likely by the user. IMHO it is a 1:1 mapping of the "Date Modified" of a file in a file system. It is a direct mapping of what date is shown for HTTP directory listings.

    published is defined in Atom quite differently from that, and among the alternatives it seems best to take the name from Atom precisely.

  • BenjaminCarlyle: From the last-modified-brainstorming purpose statement, emphasis added. "To specify the date of publication and the date of modification of a web page (or a part thereof)"
  • Tantek: Note that Atom chose to drop "created" which is much more reflective of what current file systems etc. support.

    The concept of "published" is distinct from a generic "created" notion, in that it indicates when the content was made public or made available to readers (even on intranets) which is often very different than when the author started typing the entry or even first saved the entry.

  • DavidJanes: It's simple, it's clear, it's not being used it's not being used already. We can make last-modified consistent afterwards
  • RyanKing: I'm a bit wary of using someing so generic as 'published' for this. I need to go back throught blog-post-examples to see what conventions we have.
  • Tantek: I have the same concerns as Ryan, and in addition, it may be useful from a parsing perspective to adopt a dt prefix convention for ISO8601 typed properties.

Entry Updated (atom:updated)

STATUS - RESOLVED - 'updated'

  • updated (Atom consistency)
    • +1 Tantek
    • +1 DavidJanes
  • dtupdated (Atom consistency with dt unofficial pattern)
    • +½ Paul Bryson, Not as human readable
    • +0.5 Tantek (want to consider it, while we can)
  • last-modified
    • VJOURNAL LAST-MODIFIED (also HTTP)
    • dtstamp
    • dtupdated

Discussion

  • PaulBryson: I would prefer to maintain some consistency with already existing date naming conventions, but acknowledge that these aren't as clearly human readable as they could be.
  • BenjaminCarlyle: I would still like to see a clear engagement with last-modified before voting on this one.
  • Tantek: See discussion for published. Updated is closer to last-modified than published is, however, upon careful reading of the definition of updated in Atom, it is clear that the user has the option of not changing the updated date even if they change the entry, e.g. by fixing a spelling error or something. Thus there is an implied stronger meaning of "this entry has been semantically changed" that is a different enough semantic from last-modified as to justify a new name, and among the alternatives it seems best to take the name from Atom precisely.
  • BenjaminCarlyle: From last-modified-brainstorming semantics:

    "Since both Atom and HTTP define the last-modified date (or its equivalent) as a "user-defined" value, this microformat should have the same semantics. In other words, the value should represent the last instance that the resource was changed in a way deemed significant to the publisher/author, which is not neccessarily the same as a file-system modified date-time."

  • Tantek: They are both user defined values but *different* user defined values.

    It is VERY important to note this distinction because Atom chose to note it.

    In the 99% case, file-system, web-server (HTTP) context, the last-modified date reflects the last time the *user* modified the file or page, WITHOUT consideration for whether or not the user wanted that change to reflect a change in the last-modified date.

    Atom specifically allows for the exception that a user might not update the "updated" date, even when they change the underlying blog post, spelling corrections or whatever.

    This is in stark contrast to the traditional application model, where in a word processor, even if you change one character and save, you change the file system last-modified date, and hence the HTTP last-modified headers.

  • DavidJanes: we can make last-modified consistent afterwards

go back throught blog-post-examples to see what conventions we have.

  • Tantek: Similar to comments on "published", it may be useful from a parsing perspective to adopt a dt prefix convention for ISO8601 typed properties.

Entry Author (atom:author)

STATUS - RESOLVED - 'author' required, should use <address>

Discussion

  • BenjaminCarlyle: I think an author concept is generally useful to microformats, so long as you can make it clear whether it is the author of the uf wrapper or the author of the uf content that is being described. I think any wavering over whether author and contributor are both required is probably a step outside the atom specification. This may be worthwhile, with an xfn-style external definition that could relate a person to a work... or even a rel-tag-based relationship. Can room be left open for both of these possibilities for future expansion, while still providing a clear author -> atom:author translation?
  • Tantek: My point is that in practice (>80% case again), contributor is not used. Thus we should exclude it from hAtom in the first version. However, I am ok with reserving contributor with the intent that if it does somehow take off, we can add it later.
  • RyanKing is <address> not sufficient for 'author' semantics?
  • DimitriGlazkov I don't believe it is. The author of the feed and the author of the page (which is what <address> implies could be two different things. I just ran into this problem when trying to mark up a feed inside of a post.

Entry Contributor (atom:contributor)

    • -1 Tantek (see Discussion)
  • contributor (Atom consistency)
    • +1 Tantek
    • +1 DavidJanes

Discussion

  • Tantek: I recommend we postpone contributor from hAtom first version (thus the -1 before any choices), since the 80% case does not need "contributor". We should reserve the name so we can add it later if we need it (thus the +1 on "contributor").
  • DavidJanes: RESOLUTION: DEFERRED

Entry Geo (geo:Point)

  • Brian: GeoRSS is away to embed geo-position information into an entry, it is NOT part of Atom nor is this directly part of hAtom. This is an addition that can add value to a post. Microformats has already defined a way to add geo position data into HTML it is possible to combine the two in a single entry.

GeoRSS Resources

Questions and Comments

Limitations

  • There seems to be nothing in the hAtom specification to supply metadata for the blog (title, description, url, feedurl). There is nothing defined for the encapsulation of comments, comment counts, or links to comment sections. The microformat would be much more useful with these capabilities added.-- singpolyma 03:35, 3 Jan 2006 (PST)
    • We've deliberately restricted this to being a "blog post" microformat at this point to make the problem manageable. Once the core elements are defined, we will consider extended the spec to cover as much as Atom does. Also note that microformats are compositable, thus, all these things could potentially be defined elsewhere with detriment to this standard. -- DavidJanes
    • RyanKing: STATUS:DEFERRED/REJECTED: As David says, our scope is limited. After we can establish the core specification of hAtom, we'll look at adding more properties.

Relationship to hReview definitions needs clarification

[DavidJanes?] hAtom will define terminology for the general act of publication that overlaps with hReview's terminology for the specific act of publishing a review of something. The following terms could be pushed back into hReview:

  • atom:published -> hReview dtreviewed
  • atom:author -> hReview reviewer

Tantek: "Pushed back" is the wrong direction here.

The right direction is "re-use" by new proposals/drafts. If you see anything in hReview that appears to overlap this new specification, the first thing to do is to see if you can reuse those terms from hReview in this new specification, not vice versa.

In addition, "published" does not mean the same as "dtreviewed" (you might write a restaurant review just after you eat there, but not actually "publish" it until later). "reviewer" is also a more precise semantic than "author", thus the two should not be collapsed.

hCards

DavidJanes: Should hCards be required for the <address> of the Entry Poster? MAY, MUST, SHOULD? Your thoughts please.

RESOLVED: MUST use hCard for author.

  • Robert Bachmann: “MUST” or at least “SHOULD” because atom:author is specified as "The 'atom:author' element is a Person construct that indicates the author of the entry or feed." and <address>’s semantics are too loose to describe an Atom person construct but using <address class="vcard"> we would have pretty good 1:1 mappings:
    • atom:name ↔ hCard’s FN
    • atom:email ↔ hCard’s EMAIL
    • atom:uri ↔ hCard’s URI
  • STATUS - OPEN. "MAY" is the answer.
  • Tantek: I think this should be MUST. Atom should have referenced vCard for these semantics and made the mistake of making up their own terms. Let's undo that mistake with hAtom. Also, hReview 0.3 has made hCard a MUST for the "reviewer" property, based on experience and feedback. Thus we may want to just follow suit with hAtom as well.
  • DavidJanes: I had based the behavior on hReview 0.2. The problem is getting meaningful information into the blog templates and also I would appeal to parsimony, that is:
    <div class="author">bonehead</div>

    has an assumed defined mapping to

    <div class="author vcard"><span class="fn">bonehead</div></div>

    Since in many cases we're not going to get much more information than that, why add the verbosity? I note an analogous situation in hCard, where N.* are not required because they can be inferred algorithmically.

Comparisons

This seems precisely analogous to S5:

  • atomentry <-> slide
  • content <-> slidecontent
  • summary <-> handout

I'm all for NOT boiling the ocean, but these really seem like the same cup of tea.

--Ernie Prabhakar

  • David Janes: See the #Purpose section above. Basically that drove the design decision for the naming

    STATUS - REJECTED. We're sticking with atom terminology (entry, content, summary).

  • Tantek: As far as the analogy to S5, yes, there is an analogy, but that doesn't make them the same. The semantics that are represented are different enough to let these evolve independently and see if content authors want them to converge or not. Note that you can overlay hAtom and S5 in the same markup. Anyone that is serious about converging these should *try* using both at the same time in a *real* slide presentation example and report back their experience.

Repeated Elements

We allow certain elements to be repeated, such as Entry Permalink, Entry Published and Entry Title, even though there can be at most one real value. We provide "disambiguation" rules for sorting out which is the real value. See here, here, here and here.

Your thoughts please... -- David Janes

STATUS - RESOLVED. The spec has explicit rules for disambiguating all these items if they appear multiple times.

Opaqueness

If you have concerns about opaqueness, that is, stopping interpretation below certain hAtom elements, raise them here.

Opaqueness of other microformat elements

How would we handle a case where someone wanted to provide a vcard under the class~=entry element for an individual who was neither author or contributor? Consider the hypothetical case where someone wanted to list their "muse" alongside article author and contributors. If this vcard included a title it might be included accidentally as an <atom:title>.

To summarise, Is it possible that other microformats found under the class~=entry or class~=feed elements need to be considered opaque?

-- BenjaminCarlyle

  • David Janes: The issue of "muse" and such is somewhat out of scope. However, I grasp your larger point -- what if we wanted to extend or compositie hAtom in the future. Given the 80-20 rule right now, my feeling is to set aside the problem and if it arises, define a class~="opaque" element. --
  • Tantek: See the mfo-examples document, and add further thoughts on this matter there.

Opaqueness of summary and content

DavidJanes?: What one publisher considers the entry content may differ from another publisher's point of view. Is the content simply a div that does not contain any author/updated/published metadata etc, or could some of that metadata be relevant to the content as well as the entry? Consider updated. last-modified-brainstorming introduces an idea of using <ins> and <del> elements to indicate update time. Updates are also often included in entry content with further information. This suggests to me that the line of opaqueness is blurry.

Perhaps content and summary should not be opaque, and instead rely on the mfo proposal to avoid parsing into microformats below the content level. This approach would allow a single div to contain both "entry" and "content" classes should all metadata be considered content by the author, or would permit any other subset of the metadata to be considered content without repeating one's self.

Consider also the "read more"-style blog. The following nesting of div elements is illegal under current opacity rules:

<div class="content"><div class="summary">......

A further example is provided by _fil_ on #microformats, who uses the rel-tag microformat within his atom:content to be handled as tags in his feed reader.

Identification

The current spec under Schema:Nomenclature:Entry includes the text: "if practical, also define id="unique-identifier" to the Entry" What should be done with this id by parsers? How does this interact (if at all) with the interpretation of a rel=bookmark within the entry?

Also, how should a feed <id> element be filled out from a hAtom source document? Is a rel=bookmark at the feed level required?

The id elements in atom are supposed to survive all future movements of the blog to new hosting arrangements and the like. Are current feed URLs or even rel=bookmarks solid enough?

STATUS - OPEN.

HTML Title

Atom permits title to be either plain text or html. hAtom2Atom.xsl currently uses a plain text translation, and some feed readers seem not to handle html titles well (liferea does not normalize-whitespace, for example). Should a hAtom title element become a plain text or a html atom title? If so, should a subset of html be passed through rather than all html (including id, etc)?

rel-tag

Should hAtom use rel-tag for atom category elements? -- DavidJanes

  • Tantek: IMHO yes.
  • A version of this is currently implemented in hAtom2Atom.xsl, but the interpretation of rel-tag is not straightforward.
  • rel-tag uses the last path segment of a URI as its tag, for example <a href="http://apple.com/ipod" rel="tag">iPod</a>. Human-friendly content is permitted within the anchor. Atom defines three attributes on a category element. "term" is the category in use. "scheme" is a namespace for this category. "label" is a human-friendly text-only version of the category.
  • This looks like a clear mapping to me - term is last path segment; scheme is the tagspace and label is the text within the anchor? The problem is if the scheme + tag is not a true URL but a URI. So for your example, term is 'ipod, scheme is 'http://apple.com/' and label is iPod. Kevin Marks 15:03, 31 Dec 2005 (PST)
  • hAtom2Atom.xsl does not currently supply a scheme. Label is taken from the content of the anchor tag, and no special handling for content such as the title attribute of an img element is performed. Term is the portion of the href after the last slash character.
  • rel-tag permits url encoding for IRIs, as well as conversion of spaces to plus (+) characters. It is unclear whether the conversion of rel-tag data to atom:category/@term should attempt to reverse any such encoding. The handling of plus characters may be especially difficult to reverse (are the plus characters, or spaces?).
  • They are spaces. If you want plus characters use %2B Perhaps I should add this to rel-tag. Kevin Marks 15:03, 31 Dec 2005 (PST)

Excess disambiguation rules?

Disambiguation rules apply to feed and entry title, and hAtom2Atom.xsl implements these. Rules also apply to permalink, published, and updated. These are currently not implemented. If they appear multiple times in the source document they are repeated multiple times.

It is clear that the data relating to these fields may be repeated within a hAtom entry, however the class notation may not. Only one element need be marked with rel="bookmark". Only one need be marked published, and one updated. Should the disambiguation rules be removed and only one element be allowed for each value, or is there value to the publisher in marking different elements with the hAtom class names?

Dependencies

mfo

Does this specification depend on acceptance of a hAtom-compatible mfo? See mfo-examples.

Is atom:content necessary?

Atom's structure is built up around separating content and other metadata. atom:updated, atom:author, and the like are separate from atom:content any may contain repeated data. Microformats are built around bringing the content and the metadata back together. Is there are genuine use case for identifying only part of the atom entry as content? Presumably the whole html entry is fit for human consumption, or it wouldn't be part of a microformatted web page. Could that whole html snippet be used as the content?

Published as default value for atom:updated

It seems to be common practice to include an "updated" section within the main blog content to track updates to an atom:entry as they occur. It is less common to include a value for atom:published within atom:content. atom:published is usually provided by a machine, but atom:updated is often provided by a human.

I suggest that if a value of published exists but no value for updated exists that the required updated field be filled out from the optional published field. I think this would make changing the required value of updated easier for publishers. Also, several updates may occur to a single entry. I suggest that a disambiguation rule be applied such that the the latest timestamp of any updated field be used if several exist. The overal parser semantics would therefore be:

  1. If multiple updated fields exist, choose the most recent one.
  2. If only one updated field exists, choose that value.
  3. If no updated field exists but a published field exists, use the published value for atom:updated.
+ 1 Robert Bachmann

Designating the page author

(2006-02-07 raised by Robert Bachmann)

“[I]f an Entry has 0 Entry Author elements, the "logical Entry Author" is assumed to be the author of the XHTML page”

  • How do I designate the page author(s)?
    • class="author" outside of the hAtom entry?
    • <address class="author"> outside of the hAtom entry?
    • <address> outside of the hAtom feed (i.e. at the page level)?
  • How do I designate the feed author(s)?

(2006-02-13 example by Chris Casciano) There is a live case showing this issue at http://chunkysoup.net - The posts are now hatom'd but since I am the only author the individual entries do not repeast the info with each entry. I do have an hcard with my (the page author's) information in the fotter of the page, but at the moment it is not designated via the <address> element due to sematics/content. FWIW, it is also outside of the block designated as the hfeed.

Proposal

  • If no author is found at the entry level try to use the author(s) at the feed level (i.e: class="author" at the feed level)
  • If no author is found at the feed level try to use all <address>’s outside of the feed as authors.

Entry Updated Required? -- Blogger

  • 2006-03-06 raised by singpolyma.
    1. The hAtom 0.1 spec states if there is no Entry Updated element...the page is invalid hAtom I have a real problem with this because I work with Blogger, where we cannot output datetime-design-pattern-compatible datestrings for our posts... We can output some different human-readable formats and we can output a nanosecond unix-timestamp, but the template tags will not output YYYY-MM-DDTHH:MM:SS+ZZ:ZZ no matter what you do... so how are we to resolve this so that Blogger blogs can use hAtom? I ask primarily because I am wanting to update my XOXO Blog Format, which is based on hAtom, to comply with the new version of the standard -- and all my test-cases are on Blogger blogs...

See Also

Template

Please use this format (copy and paste this to the end of the list to add your issues):

  • YYYY-MM-DD raised by YOURNAME.
    1. Issue 1: Here is the first issue I have.
    2. Issue 2: Here is the second issue I have.