[uf-discuss] Universal Feed Parser 4.2 will support microformats

David Janes -- BlogMatrix davidjanes at blogmatrix.com
Sun Jan 15 05:11:54 PST 2006


Mark Pilgrim wrote:
> I just checked support for basic microformat parsing into feedparser.py CVS.
> 
> Currently supported:
> - rel=tag (maps to 'tags', like atom:category, rss:category, dc:subject, etc.)
> - rel=enclosure (maps to 'enclosures', like rss:enclosure and
> atom:link[@rel=enclosure])
> - XFN
> 
> To David: sorry, I decided against using your Almost Universal
> Microformat Parser because it requires well-formed XHTML, which is
> unacceptable for my needs.  

*sniff* :-)

I hear what you're saying though. It does use TIDY to clean up malformed 
HTML but since I assume you're going though each entry and pulling out 
the contents, that would be somewhat expensive.

> I may, however, try to adapt your code to
> handle more complicated microformats like hCard, which would be quite
> messy to support with my current SAX-based approach.

Regards, etc...
David



More information about the microformats-discuss mailing list