parsers: Difference between revisions
m (→HTML::Microformats: add link to release announcement) |
m (→HTML::Microformats: add output formats, v 0.101 and 0.102.) |
||
Line 28: | Line 28: | ||
=== HTML::Microformats === | === HTML::Microformats === | ||
Perhaps we can capture and update this info on a page like [[perl-html-microformats-parser]]. | Perhaps we can capture and update this info on a page like [[perl-html-microformats-parser]]. | ||
[http://search.cpan.org/~tobyink/HTML-Microformats/ HTML::Microformats] is a CPAN module that has support for: | |||
* input: | |||
** rel: [[rel-enclosure]], [[rel-license]], [[rel-tag]], [[VoteLinks]], [[XFN]]. | ** rel: [[rel-enclosure]], [[rel-license]], [[rel-tag]], [[VoteLinks]], [[XFN]]. | ||
** class: [[adr]], [[figure]], [[geo]], [[hAtom]], [[hAudio]], [[User:TobyInk/hcalendar-1.1|hCalendar]], [[hCard]], [[hListing]], [[hNews]], [[hProduct]], [[hRecipe]], [[hResume]], [[hReview]], [[hReview-aggregate]], [[xFolk]], [[XMDP]], [[XOXO]]. | ** class: [[adr]], [[figure]], [[geo]], [[hAtom]], [[hAudio]], [[User:TobyInk/hcalendar-1.1|hCalendar]], [[hCard]], [[hListing]], [[hNews]], [[hProduct]], [[hRecipe]], [[hResume]], [[hReview]], [[hReview-aggregate]], [[xFolk]], [[XMDP]], [[XOXO]]. | ||
*** highly experimental: [[measure]], [[species]]. | *** highly experimental: [[measure]], [[species]]. | ||
** poshformats: [http://ocoins.info/ OpenURL COinS]. | ** poshformats: [http://ocoins.info/ OpenURL COinS]. | ||
** By [[User:TobyInk|Toby Inkster]]. | * output: | ||
** RDF: RDF/XML, Turtle, N-Triples, RDF/JSON, etc. | |||
** JSON | |||
** domain specific: vCard (3.0, 4.0 and XML), iCalendar, Atom, KML. | |||
* By [[User:TobyInk|Toby Inkster]]. | |||
Versions and releases notes: | Versions and releases notes: | ||
* 2011-02-05 0.102 | |||
* 2010-12-22 0.101 | |||
* 2010-12-16 [http://microformats.org/discuss/mail/microformats-discuss/2010-December/013363.html 0.100] | * 2010-12-16 [http://microformats.org/discuss/mail/microformats-discuss/2010-December/013363.html 0.100] | ||
* 2010-10-18 0.00_13 | * 2010-10-18 0.00_13 |
Revision as of 22:23, 6 February 2011
<entry-title>Microformats Parsers</entry-title>
- shortlinks
- http://ufs.cc/w/parsers (preferred)
- http://tr.im/ufprs
This page lists libraries that consume, transform or convert microformats. This is only a partial list. If you know of other such tools for microformats, please add them and list what specific microformats they support.
Alphabetical listing by programming language:
.Net
UfXtract is an open source .Net microformats parser. It can parse microformats from URLs or HTML strings. The extracted data can be used directly in .Net or converted into JSON, JSON-P or XML. Currently Supports 16 microformats and can easily be extended with new definitions.
Java
- Any23 (Anything to Triples) parses microformats into RDF triples
- org.microformats.hCard by Reinier Zwitserloot
Javascript
Microformat Shiv
- Microformat Shiv The microformat shiv provides a simple to use, light-weight, cross-browser microformat API. It even works with modern mobile browsers and is backward compatible with Internet Explore 6. Based on the microformat API for Mozilla's extensions. This is the same code base as the popular Operater plug-in.
Sumo
Perl
Data::Microformat
- Data::Microformat is a CPAN module to parse and create hCard, adr, and geo.
- By Brendan O'Connor / Six Apart
HTML::Microformats
Perhaps we can capture and update this info on a page like perl-html-microformats-parser. HTML::Microformats is a CPAN module that has support for:
- input:
- output:
- RDF: RDF/XML, Turtle, N-Triples, RDF/JSON, etc.
- JSON
- domain specific: vCard (3.0, 4.0 and XML), iCalendar, Atom, KML.
- By Toby Inkster.
Versions and releases notes:
- 2011-02-05 0.102
- 2010-12-22 0.101
- 2010-12-16 0.100
- 2010-10-18 0.00_13
- 2010-06-25 0.00_12
- 2010-06-23 0.00_11
- 2010-05-13 0.00_10
- 2010-05-12 0.00_09
- 2010-04-29 0.00_08
- 2010-04-28 0.00_07
- 2010-04-16 0.00_06
- 2010-04-16 0.00_05
- 2010-03-20 0.00_04
- 2010-03-09 0.00_03
- 2010-02-28 0.00_02
- 2010-02-24 0.00_01
- 2010-02-20 0.00_00
- see also the "Changes" file included in the CPAN distribution.
XML::Atom::Microformats
XML::Atom::Microformats provides the same functionality for Atom. It finds microformats in Atom entry content elements.
Swignition
- Swignition is a parser for both “upper case Semantic Web” (RDF, RDFa) and “lower case semantic web” (microformats) technologies. It includes modules for exporting parsed data in a variety of formats, including RDF, vCard, iCalendar, Atom and KML.
- By Toby Inkster
- Active development has moved to HTML::Microformats (see above).
Text::Microformat
- Text::Microformat is a microformats parser hosted on Google Code that supports:
PHP
XMFP
XMFP (eXtensible MicroFormats Parser for PHP 5) by Emiliano Martínez Luque is a set of PHP 5 classes providing a simple API for extracting Microformated Content either from a URI or a String representing HTML/XML. It can return the results as a PHP associative array, a JSON definition or an XML representation of the data. It supports most of the currently accepted microformats and can be easily extended to add new ones, it also has full support of the include pattern and provides basic validation of microformated data.
hKit Microformats Toolkit for PHP5
hKit Microformats Toolkit for PHP5 as announced by Drew McLellan. See also hKit on this wiki.
PHP Microformats parser
Microformats parser is a PHP package for extracting the microformats data embedded into HTML. The gathered data is stored as an xArray of objects - one for each microformat type container found. Announcement. The parser supports most of the hCard (missing SOUND), hCalendar, hReview (missing item info; spec really needs some clarification) and rel elements, according to their respective specification on microformats Wiki.
Transformr
A Simple set of XSLT and PHP tools for Transforming Microformats Source available from github Live webservice available at microform.at.
hCard Validator
Source code of the hCard Validator contains XSLT and PHP code for hCard and include microformats.
ARC2
ARC2 is a semantic web toolkit which includes support for hCard, adr, geo, XFN, hCalendar, hAtom, hResume, hReview, xFolk, rel-license and rel-tag. It's tri-licensed under the GPL 2 and 3, and the W3C Software licence.
Python
AUMP
- AUMP is a parser written by David Janes. It supports hCard, hCalendar, hAtom, hReview and hListing.
- Uses Python's xml.dom.minidom after cleaning input through HTML Tidy.
Microtron
- Microtron is intended to be a general-purpose microformat parser/transformer.
It can operate on the definition file included in Optimus, making it a close replacement for certain tasks, and can easily be extended with new formats without modifying the source. The primary advantages are speed (> 100x faster that Optimus for some operations), simplicity (single file) and small code size (currently < 150 lines).
Ruby
Prism
- Prism
- by Mark Wunsch
- Library and command line tool for parsing POSH/Microformats
- Uses the Nokogiri HTML, XML, SAX, and Reader parser
mofo ruby microformats parser
- mofo, mofo Google Group, GitHub repository
- by Chris Wanstrath
- Uses the Hpricot HTML/XML parser
Microformat Parser for Ruby
uformats
scrAPI
XSLT
Optimus
- Optimus is open source XSLT that parses microformats, converts them into JSON or XML, and validates them too.
X2V
- X2V is open source XSLT for extracting microformats.
- by Brian Suda
editing this page
For now, this page copies (rather than moves) information from the existing Implementations page.