<?xml version="1.0"?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en">
	<id>https://microformats.org/wiki/api.php?action=feedcontributions&amp;feedformat=atom&amp;user=MaxVoelkel</id>
	<title>Microformats Wiki - User contributions [en]</title>
	<link rel="self" type="application/atom+xml" href="https://microformats.org/wiki/api.php?action=feedcontributions&amp;feedformat=atom&amp;user=MaxVoelkel"/>
	<link rel="alternate" type="text/html" href="https://microformats.org/wiki/Special:Contributions/MaxVoelkel"/>
	<updated>2026-06-10T10:08:56Z</updated>
	<subtitle>User contributions</subtitle>
	<generator>MediaWiki 1.38.4</generator>
	<entry>
		<id>https://microformats.org/wiki/index.php?title=parsing-microformats&amp;diff=2532</id>
		<title>parsing-microformats</title>
		<link rel="alternate" type="text/html" href="https://microformats.org/wiki/index.php?title=parsing-microformats&amp;diff=2532"/>
		<updated>2005-10-27T12:08:37Z</updated>

		<summary type="html">&lt;p&gt;MaxVoelkel: Added CyberNeko link http://people.apache.org/~andyc/neko/doc/html/&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;= Microformat Parsing =&lt;br /&gt;
&lt;br /&gt;
Microformat parsing mechanisms that depend on documents having even minimal xml properties like well-formedness may fail when consuming non-well-formed content.  [http://tidy.sourceforge.net/ Tidy] or even better [http://people.apache.org/~andyc/neko/doc/html/ CyberNeko] may be a useful work around.&lt;br /&gt;
In particular  [http://suda.co.uk/projects/X2V/ Brian Suda's frequently cited X2V hCard and hCalendar discovery and transformation prototypes] use XSLT, and &amp;quot;tidy&amp;quot; any non-well-formed input before processing it.&lt;br /&gt;
&lt;br /&gt;
Most microformats tend to be agnostic about things like exact element type used.&lt;br /&gt;
&lt;br /&gt;
Developers can use tools like XPATH that assume well-formedness on well-formed content (from the web or by using tidy).  Mark Pilgrim's example [http://sourceforge.net/projects/feedparser/ universal feed parser] suggests that it may be possible to sanitize user html to an extent that it is suitable for later processing as xml.&lt;br /&gt;
&lt;br /&gt;
==== See Also ====&lt;br /&gt;
&lt;br /&gt;
* [[xmdp-brainstorming]]&lt;/div&gt;</summary>
		<author><name>MaxVoelkel</name></author>
	</entry>
</feed>