parsers: Difference between revisions
(added Optimus, heading ids, shortened heading (put expansion in prose text)) |
|||
Line 6: | Line 6: | ||
== Javascript == | == Javascript == | ||
=== Sumo === | === Sumo === | ||
* [http://www.danwebb.net/2007/2/9/sumo-a-generic-microformats-parser-for-javascript Sumo! A Generic Microformats Parser For JavaScript] | * [http://www.danwebb.net/2007/2/9/sumo-a-generic-microformats-parser-for-javascript Sumo! A Generic Microformats Parser For JavaScript] | ||
== Perl == | == Perl == | ||
=== Data::Microformat === | === Data::Microformat === | ||
* [http://search.cpan.org/~ussjoin/Data-Microformat-0.01/lib/Data/Microformat/hCard.pm Data::Microformat] is a CPAN module to parse and create hCard, adr, and geo. | * [http://search.cpan.org/~ussjoin/Data-Microformat-0.01/lib/Data/Microformat/hCard.pm Data::Microformat] is a CPAN module to parse and create hCard, adr, and geo. | ||
Line 17: | Line 15: | ||
===Swignition === | ===Swignition === | ||
* [http://buzzword.org.uk/swignition/ Swignition] is a parser for both “upper case Semantic Web” (RDF, RDFa) and “lower case semantic web” (microformats) technologies. It includes modules for exporting parsed data in a variety of formats, including RDF, vCard, iCalendar, Atom and KML. | * [http://buzzword.org.uk/swignition/ Swignition] is a parser for both “upper case Semantic Web” (RDF, RDFa) and “lower case semantic web” (microformats) technologies. It includes modules for exporting parsed data in a variety of formats, including RDF, vCard, iCalendar, Atom and KML. | ||
** By Toby Inkster | ** By Toby Inkster | ||
=== Text::Microformat === | === Text::Microformat === | ||
[http://code.google.com/p/ufperl/ Google Code] | [http://code.google.com/p/ufperl/ Google Code] | ||
== PHP == | == PHP == | ||
=== XMFP === | |||
=== XMFP | <span id="xmfp">[http://code.google.com/p/xmfp/ XMFP]</span> (eXtensible MicroFormats Parser for PHP 5) by [http://www.metonymie.com Emiliano Martínez Luque] is a set of PHP 5 classes providing a simple API for extracting Microformated Content either from a URI or a String representing HTML/XML. It can return the results as a PHP associative array, a JSON definition or an XML representation of the data. It supports most of the currently accepted microformats and can be easily extended to add new ones, it also has full support for the include pattern and provides basic validation of microformated data. | ||
[http://code.google.com/p/xmfp/ XMFP] by [http://www.metonymie.com Emiliano Martínez Luque] is a set of PHP 5 classes providing a simple API for extracting Microformated Content either from a URI or a String representing HTML/XML. It can return the results as a PHP associative array, a JSON definition or an XML representation of the data. It supports most of the currently accepted microformats and can be easily extended to add new ones, it also has full support for the include pattern and provides basic validation of microformated data. | |||
=== hKit Microformats Toolkit for PHP5 === | === hKit Microformats Toolkit for PHP5 === | ||
Line 44: | Line 38: | ||
== Python == | == Python == | ||
===AUMP=== | ===AUMP=== | ||
* [http://aump.googlecode.com AUMP] is a parser written by David Janes. It supports hCard, hCalendar, hAtom, hReview and hListing. | * [http://aump.googlecode.com AUMP] is a parser written by David Janes. It supports hCard, hCalendar, hAtom, hReview and hListing. | ||
** Uses Python's [http://docs.python.org/library/xml.dom.minidom.html xml.dom.minidom] after cleaning input through [http://www.w3.org/People/Raggett/tidy/ HTML Tidy]. | ** Uses Python's [http://docs.python.org/library/xml.dom.minidom.html xml.dom.minidom] after cleaning input through [http://www.w3.org/People/Raggett/tidy/ HTML Tidy]. | ||
===Microtron=== | ===Microtron=== | ||
* [http://github.com/amccollum/microtron Microtron] is intended to be a general-purpose microformat parser/transformer. It can operate on the definition file included in [[Optimus]], making it a close replacement for certain tasks, and can easily be extended with new formats without modifying the source. The primary advantages are speed (> 100x faster that [[Optimus]] for some operations), simplicity (single file) and small code size (currently < 150 lines). | * [http://github.com/amccollum/microtron Microtron] is intended to be a general-purpose microformat parser/transformer. It can operate on the definition file included in [[Optimus]], making it a close replacement for certain tasks, and can easily be extended with new formats without modifying the source. The primary advantages are speed (> 100x faster that [[Optimus]] for some operations), simplicity (single file) and small code size (currently < 150 lines). | ||
** Requires [http://codespeak.net/lxml/ lxml] and [http://pypi.python.org/pypi/isodate/ isodate] | ** Requires [http://codespeak.net/lxml/ lxml] and [http://pypi.python.org/pypi/isodate/ isodate] | ||
== Ruby == | == Ruby == | ||
=== mofo ruby microformats parser=== | === mofo ruby microformats parser=== | ||
* [http://mofo.rubyforge.org/ mofo], [http://groups.google.com/group/mofo-rb mofo Google Group], [http://github.com/defunkt/mofo/tree/master GitHub repository] | * [http://mofo.rubyforge.org/ mofo], [http://groups.google.com/group/mofo-rb mofo Google Group], [http://github.com/defunkt/mofo/tree/master GitHub repository] | ||
Line 73: | Line 63: | ||
== XSLT == | == XSLT == | ||
===Optimus=== | |||
* [[Optimus]] is open source XSLT that parses microformats, converts them into JSON or XML, and validates them too. | |||
===X2V=== | ===X2V=== | ||
* [http://hg.microformats.org/ XSLT for extracting microformats] | * [[X2V]] is [http://hg.microformats.org/ open source XSLT for extracting microformats]. | ||
** by Brian Suda | |||
== see also == | == see also == | ||
Line 80: | Line 74: | ||
* [[implementors]] | * [[implementors]] | ||
* [[user-interface]] | * [[user-interface]] | ||
* [[open-source]] |
Revision as of 04:10, 31 August 2009
Microformats Parsers
This page lists libraries that consume, transform or convert microformats. This is only a partial list. If you know of other such tools for microformats, please add them and list what specific microformats they support.
For now, this page copies (rather than moves) information from the existing Implementations page.
Javascript
Sumo
Perl
Data::Microformat
- Data::Microformat is a CPAN module to parse and create hCard, adr, and geo.
- By Brendan O'Connor / Six Apart
Swignition
- Swignition is a parser for both “upper case Semantic Web” (RDF, RDFa) and “lower case semantic web” (microformats) technologies. It includes modules for exporting parsed data in a variety of formats, including RDF, vCard, iCalendar, Atom and KML.
- By Toby Inkster
Text::Microformat
PHP
XMFP
XMFP (eXtensible MicroFormats Parser for PHP 5) by Emiliano Martínez Luque is a set of PHP 5 classes providing a simple API for extracting Microformated Content either from a URI or a String representing HTML/XML. It can return the results as a PHP associative array, a JSON definition or an XML representation of the data. It supports most of the currently accepted microformats and can be easily extended to add new ones, it also has full support for the include pattern and provides basic validation of microformated data.
hKit Microformats Toolkit for PHP5
hKit Microformats Toolkit for PHP5 as announced by Drew McLellan. See also hKit on this wiki.
PHP Microformats parser
Microformats parser is a PHP package for extracting the microformats data embedded into HTML. The gathered data is stored as an xArray of objects - one for each microformat type container found. Announcement. The parser supports most of the hCard (missing SOUND), hCalendar, hReview (missing item info; spec really needs some clarification) and rel elements, according to their respective specification on microformats Wiki.
Transformr
A Simple set of XSLT and PHP tools for Transforming Microformats available from Google code and as a web service.
hCard Validator
Source code of the hCard Validator contains XSLT and PHP code for hCard and include microformats.
Python
AUMP
- AUMP is a parser written by David Janes. It supports hCard, hCalendar, hAtom, hReview and hListing.
- Uses Python's xml.dom.minidom after cleaning input through HTML Tidy.
Microtron
- Microtron is intended to be a general-purpose microformat parser/transformer. It can operate on the definition file included in Optimus, making it a close replacement for certain tasks, and can easily be extended with new formats without modifying the source. The primary advantages are speed (> 100x faster that Optimus for some operations), simplicity (single file) and small code size (currently < 150 lines).
Ruby
mofo ruby microformats parser
- mofo, mofo Google Group, GitHub repository
- by Chris Wanstrath
- Uses the Hpricot HTML/XML parser
Microformat Parser for Ruby
uformats
scrAPI
XSLT
Optimus
- Optimus is open source XSLT that parses microformats, converts them into JSON or XML, and validates them too.
X2V
- X2V is open source XSLT for extracting microformats.
- by Brian Suda