[microformats-discuss] When we say "XHTML" do we mean...

Mark Pilgrim pilgrim at gmail.com
Mon Oct 17 07:06:01 PDT 2005


On 10/17/05, Ryan Tomayko <rtomayko at gmail.com> wrote:
>  Speaking of bozo bits, I was going to ask Mark P. how he was attacking this
> problem in his implementations atop Firefox/Greasemonkey but the beauty
> there is that you can ride on the liberalness provided by the browser's
> parser, right? (You weren't going to tell anyone were you.) I'm assuming

All of my test cases (
http://diveintomark.org/projects/greasemonkey/hcard/tests/ ) are
application/xhtml+xml, so I've honestly never tried it on tag soup
markup.

> that in the event that Mark's DOM-walking code comes across a
> namespace-unqualified or, <gasp>, upper-case element that looks a lot like
> hCard, that it's not going to care because Mark's goal is to extract
> information as opposed to enforcing rules that 99% of the web ignore? Mark,

No, my goal was to write a test suite that accurately reflected hcard,
hcard-examples, and hcard-parsing, and then to write a parser that
passed those tests.  As soon as it passed the last test, I stopped
coding.  Literally.

> am I correct that your microformat implementations will handle the dirtiest,
> soupiest, unbalancedist markup the web has to offer?

I have no idea.  I would guess that there are some subtle surprises
lurking somewhere.  I have no plans to test with invalid markup.

--
Cheers,
-Mark


More information about the microformats-discuss mailing list