[uf-discuss] generic microformat parsing heuristics?

brian suda brian.suda at gmail.com
Mon Nov 7 11:52:07 PST 2005

To build a generic microformat parse, you would first need to build a
generic XMDP parser. Before you can parse the XHTML and find property
names, you need to get what and where to look, from the XMDP. I started
to work on a universal XMDP parser, but ran into problems. This is an
old discussion from the list[1,2]. If you have further specific
questions, look through the archives and if it is not there, then post a
message. I know there is a formula to encode RDF in XHTML that uses a
slightly different style than microformats way to find and encode data.


[1] -
[2] -

Phil Dawes wrote:

> Hi Tantek,
> Tantek Çelik wrote:
> > Phil,
> >
> > Take a look at hCard parsing:
> >
> >  http://microformats.org/wiki/hcard-parsing
> >
> > Much of which is embodied there generalizes to other microformats.
> >
> Excellent - many thanks.
> Out of interest, do you think that a generic microformats parser _can_
> be written?
> (e.g. something that could parse hcard, hcal et al out of xhtml without
> prior knowledge of their precise schemas?)
> I ask because we're starting to wonder about embedding our own internal
> microformats into webapps at work[1] (e.g. maybe for financial reference
> data), but we'd want to be able to use off-the-shelf generic tools to
> parse, aggregate and query the custom data.
> Thanks again,
> Phil
> [1] http://www.drkw.com/
> _______________________________________________
> microformats-discuss mailing list
> microformats-discuss at microformats.org
> http://microformats.org/mailman/listinfo/microformats-discuss

More information about the microformats-discuss mailing list