[uf-new] Microformat for Datasets

info at csarven.ca info at csarven.ca
Mon Mar 25 11:21:10 PDT 2013


On 25.03.2013 13:12, Jordan Mendelson wrote:
> Hello,
>
> Has there been any work towards a microformat for datasets like what
> you'd find at http://data.gov, http:///commoncrawl.org, etc?
>
> Open data is becoming more common and there is a lot of metadata
> surrounding it (url, format of the data, size of dataset, when it was
> published, when it was updated, description, sample data,
> license/terms of use, contributors, geo (if data relates to an area),
> etc and really no way to easily find it outside some very incomplete
> directories.
>
> With a microformat, one might actually be able to build a decent
> search engine to help people who are searching for datasets for use 
> in
> research, commerce, etc.
>
> My organization publishes several hundred TB of web crawl data and at
> a recent talk at Strata, someone asked me about a microformat for
> datasets. I feel like if there isn't one started yet, one needs to be
> started.
>
>
> Jordan
> _______________________________________________
> microformats-new mailing list
> microformats-new at microformats.org
> http://microformats.org/mailman/listinfo/microformats-new

I won't bug you with the "guidelines" on how to come up with new 
microformats because you can dig that up from the wiki and then 
collaborate with the community.

If you do end up developing a new microformats for datasets, I suggest 
to take a look at VoID [1] and DCAT [2] (RDF vocabularies) as well.

[1] http://www.w3.org/TR/void/
[2] http://www.w3.org/TR/vocab-dcat/

-Sarven


More information about the microformats-new mailing list