[uf-discuss] Universal Feed Parser 4.2 will support microformats
David Janes -- BlogMatrix
davidjanes at blogmatrix.com
Sun Jan 15 05:11:54 PST 2006
Mark Pilgrim wrote:
> I just checked support for basic microformat parsing into feedparser.py CVS.
> Currently supported:
> - rel=tag (maps to 'tags', like atom:category, rss:category, dc:subject, etc.)
> - rel=enclosure (maps to 'enclosures', like rss:enclosure and
> - XFN
> To David: sorry, I decided against using your Almost Universal
> Microformat Parser because it requires well-formed XHTML, which is
> unacceptable for my needs.
I hear what you're saying though. It does use TIDY to clean up malformed
HTML but since I assume you're going though each entry and pulling out
the contents, that would be somewhat expensive.
> I may, however, try to adapt your code to
> handle more complicated microformats like hCard, which would be quite
> messy to support with my current SAX-based approach.
More information about the microformats-discuss