representative-object-brainstorming: Difference between revisions
m (→Methods of Prioritisation: Fixed source snippet) |
m (Replace <entry-title> with {{DISPLAYTITLE:}}) |
||
(2 intermediate revisions by one other user not shown) | |||
Line 1: | Line 1: | ||
{{DISPLAYTITLE:Microformat Objects Representing an Entire Page}} | |||
Pages may (and often do) contain multiple microformats; objects of different vocabularies, multiple objects from the same vocabulary, and objects from other sources of structure data. There are use cases (such as Search Engine Results) that want to use microformats in the page to better represent the page, but must work out which object is the most important, the one that really represents ''the page'', as opposed to being an incidental piece of data. | Pages may (and often do) contain multiple microformats; objects of different vocabularies, multiple objects from the same vocabulary, and objects from other sources of structure data. There are use cases (such as Search Engine Results) that want to use microformats in the page to better represent the page, but must work out which object is the most important, the one that really represents ''the page'', as opposed to being an incidental piece of data. | ||
Line 30: | Line 30: | ||
Many microformats including [[hCalendar]], [[hAtom]], [[hReview]] include sub-properties which are themselves microformats (<code>author</code>, <code>organizer</code>, <code>agent</code> etc.) Although parsable as standalone microformats as well, when used directly as a component of another microformat, they should be deprioritized. --[[User:BenWard|BenWard]] 06:39, 25 June 2009 (UTC) | Many microformats including [[hCalendar]], [[hAtom]], [[hReview]] include sub-properties which are themselves microformats (<code>author</code>, <code>organizer</code>, <code>agent</code> etc.) Although parsable as standalone microformats as well, when used directly as a component of another microformat, they should be deprioritized. --[[User:BenWard|BenWard]] 06:39, 25 June 2009 (UTC) | ||
'''This needs to be supported by real-world examples. Please add some!''' | |||
<div class="discussion"> | <div class="discussion"> | ||
Line 40: | Line 42: | ||
In a blogging context these entries are chronological content, their content is passing through the page as more content is written. As such, microformats nested inside entries could be deprioritized. --[[User:BenWard|BenWard]] 06:39, 25 June 2009 (UTC) | In a blogging context these entries are chronological content, their content is passing through the page as more content is written. As such, microformats nested inside entries could be deprioritized. --[[User:BenWard|BenWard]] 06:39, 25 June 2009 (UTC) | ||
'''This needs to be supported by real-world examples. Please add some!''' | |||
<div class="discussion"> | <div class="discussion"> | ||
Line 52: | Line 56: | ||
Some weight could be given to any object that links identifies the same domain in its <code>url</code> property. --[[User:BenWard|BenWard]] 06:39, 25 June 2009 (UTC) | Some weight could be given to any object that links identifies the same domain in its <code>url</code> property. --[[User:BenWard|BenWard]] 06:39, 25 June 2009 (UTC) | ||
'''This needs to be supported by real-world examples. Please add some!''' | |||
<div class="discussion"> | <div class="discussion"> | ||
Line 60: | Line 66: | ||
Since the <code>address</code> element may only be used by a person or organisation that is responsible for the page, the present of this on or within an hCard should add weight to that hCard. --[[User:BenWard|BenWard]] 06:39, 25 June 2009 (UTC) | Since the <code>address</code> element may only be used by a person or organisation that is responsible for the page, the present of this on or within an hCard should add weight to that hCard. --[[User:BenWard|BenWard]] 06:39, 25 June 2009 (UTC) | ||
'''This needs to be supported by real-world examples. Please add some!''' | |||
<div class="discussion"> | <div class="discussion"> | ||
Line 68: | Line 76: | ||
If authors could add a <code>representative</code> classname (or functional equivalent) to any microformat on a page, it could indicate to the parser that the object described is the definitive object over all others, and bypass heuristics. --[[User:BenWard|BenWard]] 06:39, 25 June 2009 (UTC) | If authors could add a <code>representative</code> classname (or functional equivalent) to any microformat on a page, it could indicate to the parser that the object described is the definitive object over all others, and bypass heuristics. --[[User:BenWard|BenWard]] 06:39, 25 June 2009 (UTC) | ||
'''This needs to be supported by real-world examples. Please add some!''' | |||
<div class="discussion"> | <div class="discussion"> | ||
* … | * … | ||
</div> | </div> | ||
==Related Pages== | |||
* [[hcards-and-pages]] | |||
* [[representative-hcard]] |
Latest revision as of 16:32, 18 July 2020
Pages may (and often do) contain multiple microformats; objects of different vocabularies, multiple objects from the same vocabulary, and objects from other sources of structure data. There are use cases (such as Search Engine Results) that want to use microformats in the page to better represent the page, but must work out which object is the most important, the one that really represents the page, as opposed to being an incidental piece of data.
This page is for brainstorming methods of ranking all the structured data objects in a page, prioritizing and de-prioritizing them according to conditions, such that a consumer tool could pick the item ranked highest to confidently represent the page.
Examples of Problems
Note that the similarity of some of these problems highlights how subtle
- A personal blog homepage contains an hCard for the author and hAtom blog entries. You would represent the page using the hCard.
- A group blog contains hCards for the authors and hAtom entries. You would represent the page with information about the feed.
Please add more
Methods of Prioritisation
When you document a possible technique for analyzing/prioritizing each object, please give it a new heading. Follow this template for each new idea:
===Summary of Technique===
Detailed description. Provide some indication of the weight of this technique. --~~~~
<div class="discussion">
* …
</div>
Deprioritize Compound Objects
Many microformats including hCalendar, hAtom, hReview include sub-properties which are themselves microformats (author
, organizer
, agent
etc.) Although parsable as standalone microformats as well, when used directly as a component of another microformat, they should be deprioritized. --BenWard 06:39, 25 June 2009 (UTC)
This needs to be supported by real-world examples. Please add some!
- …
Deprioritize Objects Contained in hAtom Entries
Since hAtom entries represent articles, the content of each hentry
may contain other microformat objects — blog posts about an event or another person for example may contain hCalendar and hCard microformats.
In a blogging context these entries are chronological content, their content is passing through the page as more content is written. As such, microformats nested inside entries could be deprioritized. --BenWard 06:39, 25 June 2009 (UTC)
This needs to be supported by real-world examples. Please add some!
- …
Prioritize hCards with rel=me
or any object with a uid
property on the same domain
See also, representative-hcard for ways of working out which hCard is representative of a page, when compared to others (such as a blog author hcard in relation to hcards of article commentors)
Where an object is in a page with a property uid
pointing to the same domain as the current page, and/or making a rel=me
link to the same domain (hCards only), that object should be weighted in favour.
Some weight could be given to any object that links identifies the same domain in its url
property. --BenWard 06:39, 25 June 2009 (UTC)
This needs to be supported by real-world examples. Please add some!
- …
Prioritize hCards nested inside or containing address
elements
Since the address
element may only be used by a person or organisation that is responsible for the page, the present of this on or within an hCard should add weight to that hCard. --BenWard 06:39, 25 June 2009 (UTC)
This needs to be supported by real-world examples. Please add some!
- …
Create a new microformat for authors to explicitly publish the representative object for their page
If authors could add a representative
classname (or functional equivalent) to any microformat on a page, it could indicate to the parser that the object described is the definitive object over all others, and bypass heuristics. --BenWard 06:39, 25 June 2009 (UTC)
This needs to be supported by real-world examples. Please add some!
- …