species

From Microformats Wiki
Revision as of 09:46, 1 February 2007 by AndyMabbett (talk | contribs) (→‎Proposal: typo)
Jump to navigation Jump to search

Species

For the latest ideas, and to make comments, please see species-brainstorming.
Note: the original name of the proposed microformat, "species", is likely to change, probably to "biota" or "taxon". The former has been retained here, to avoid having to make many repetitive and perhaps redundant edits

Introduction

People use the vernacular AND taxonomic names of species in everyday speech and writing - just read or watch any populist gardening magazine or television programme.

Consider this list: "Blackbird", "poodle", "T Rex", "potato", "French Marigold", "Wisteria", "E. Coli", "HIV", "Rubella" and "human being".

"T Rex" is "Tyrannosaurus rex"; "E. Coli" is "Escherichia coli"; "HIV" is "Human immunodeficiency virus" and "Rubella" is "Rubella virus". All are the taxonomic (or scientific) names of unique species.

"Wisteria" is a taxonomic genus.

"Blackbird"; "poodle"; "potato"; "French Marigold" and "human being" (arguments about Neanderthals not withstanding) are vernacular (or common) names, but still refer to individual species.

The scientific naming of organisms is a part of biodiversity informatics - "the application of information technology to the domain of biodiversity".

Also, for interest, see Note 4, on Robert Rothenburg's April 1998 paper on Dictionaries. It's interesting that microformats have given us the first three missing items - and were now debating the fourth!

Proposal

Imagine viewing a web page with a reference to a species - and being able to use an add-on to you browser to be taken directly to information about that species, on, say, Wikipedia, or Wikispecies, or Google Images, or another site, such as in an academic database, of your choosing.

Your software would automatically know to search site A if the scientific name referred to a moth, site B for a bird, and site C for a plant - and you could set your preferences as to which sites those were to be, and in which order two or more were to be searched (e.g. for moths, try UK Moths first, if not found try The Global Lepidoptera Names Index).

Or supposing someone writes a long, chronologically-ordered web page about all the birds, insects, mammals and plants they saw on a wildlife safari, with lots of prose description about the paces where they saw them and the people they were with, but you want to extract a list of species, sorted into alphabetical order within taxonomic class (birds first, then insects then...) or in taxonomic order.

Those are just two of the things a "species" microformat might do for you.

Another benefit would be that user-agents could be instructed to treat text marked up in this way as not being in the base language of the document or element in which they occur - pronunciation should be as for Latin, they should not be translated (e.g. where a component word happens also to be a valid word in that language) and should not be spell-checked , or be spell-checked with a specialised dictionary. (identified as a need in this 2003 ietf-languages discussion of language values for taxonomic names).

Existing taxonomies

The proposal respects all existing biological taxonomies, and is not intended to change or supplant any of them - it is intended merely to provide webmasters (from personal hobby sites to major academic databases; from news outlets to retail organisations) with a method of either:

  1. marking-up a taxonomical name (or taxon-common name pair) in such a way that its components can be recognised by computers or
  2. marking up a common name, so as to associate with it a taxonomical name, in such a way that the latter's components can be recognised by computers.

Embedding within other microformats

The proposed plant microformat (with care regime, supplier, etc.), hlisting, recipe or hReview (and possibly others) could contain a scientific name microformat, in the same way that an hCalendar can contain an hCard.

See also: species-brainstorming#Future development

References

Contributors & Supporters

See also

Here's some work-in-progress:

Implementations (pending)

  • Taxon Checker - a software tool which, given a common name, searches for the relevant taxonomic data and outputs the selected species' details as (among other options) an HTML fragment. It is intended to provide templates for outputting such fragments with "species" microformat markup, once this proposal is implemented.